Monarch geneset OGS2.0

DPOGS213721
TranscriptDPOGS213721-TA1836 bp
ProteinDPOGS213721-PA611 aa
Genomic positionDPSCF300310 - 92417-94701
RNAseq coverage188x (Rank: top 48%)
Annotation
HeliconiusHMEL0152224e-1141.13% 
BombyxBGIBMGA011851-TA1e-3570.59% 
DrosophilaMur89F-PA2e-0933.61% 
EBI UniRef50UniRef50_D5KU196e-3365.88%Peritrophin type-A domain protein 4 n=1 Tax=Mamestra configurata RepID=D5KU19_9NEOP
NCBI RefSeqXP_002099964.11e-0747.47%GE16785 [Drosophila yakuba]
NCBI nr blastpgi|2914806432e-3265.88%peritrophin type-A domain protein 4 [Mamestra configurata]
NCBI nr blastxgi|2914806431e-6732.23%peritrophin type-A domain protein 4 [Mamestra configurata]
Group
Gene OntologyGO:00080612.5e-14chitin binding
GO:00060302.5e-14chitin metabolic process
GO:00055762.5e-14extracellular region
KEGG pathway 
InterPro domain[24-97] IPR0025572.5e-14Chitin binding domain
[441-462] IPR0023958.2e-11HMW kininogen
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213721-TA
ATGAACACTGCCATGAGGCTTTCGTGCCTTTTTTTGGCGTTGGCTGGCGCTTCCGCCTTCTCAATTAATTTGAGACAAAATCCGGAGACTCTCTGCTTAGGCCAAGACAACTTCTTACGTATCGCCAATGTTGATAGCTGCCAGTTGTATTATCAGTGCTATGCTGGACGTTCATATCTAATGCAATGCCCCGAAGAGACGTGGTTTAATGAAGAGATACAGGTTTGTGACCGATCGCCAATTCCAAGGAGCTGTCGCGCAGATGAATCAGAGCCGGAAACAGAACCTGAGCCTGAAGAAGAACAAGAACCGGAACCTGAACCTGAAGAAGAACAAGAACCGGAACCTGAACCTGAGCCTGAAGAAGAACAAGAACCGGAACCAGAACCTGAAGAAGAACAAGAACCGGAACCAGAACCTGAAGAAGAACAAGAACCGAAACCAGAACCTGAAGAAGAACAAGAGCCGGAACCTGAACCTGAAGAAGAACCAGAACCGAAACCAGAACCTGAAGAAGAACAAGAGCCGGAACCTGAACCTGAAGAAGAACCAGAAAATGGAGAGTCTGAGCCAGAAATAGAGGAGCCTGAAGAAGATCAAAAACCAGAGGAATCTGAACCAGACCAAGAAGTTGAAGAACCGGAGCAAGAACAAGAAGAAGAGAAACCAGAACAAGAGCCAGAGGCCGAAGAACCAGAGCCAGAGGAAGAAGCTGAGCAACCCGAAGTAGACGAGGAGGCAGTGCCTGAAGAGGATGCAGATCAACAAAATCCAGAGGAGTCTGATCCTGAGGATCAACTCCCCGAAGAGGAAATTCCTGAGCAAGAACAAGAAGAAGAAGAAGAACAAGTAGGCGAAGAAAACGAAGGAGGCTCAGAAGATGGAGAGGAAATAAAAAGATTAGGACGAAACGTTCATAGACAAGAAGAACAAGAAGGAGAGGAGAAAGACTCTGAAGAAGCTGAACATAATCATGATCATGACCACGGCCACAACCACGACCACCAACATGACCACGACCACGATCACGACCATGAACACAATCACGATCACGACCACGGCCATGGACATGACCATGCCCACGACCACGATCATGCCCACAACCACGACCACGACCACGAACACGACGACAATCATGATCACAGCCATGACCACGGCCACGACCACGGTCACGGTCACGACCACGACCATGACCATGATCATGATCATGAACACAATCACGATCACGATCACGACCACGACCACGGCCATGGCCATGACCATGCCCACGACCACGATCACGACCACGAACACGACGACAATCATGATCACAGCCACGAGCACGACCACGATCATGACCACGGCCACGACCATGACCACGACCATGACCACGGCCACGACCACGACCACGACCACGACCATGGCCATGATCACGACCACGACCACGGCCATGACCACGACCACGACCACGGCCATGATCACGACCATGACCACGACCATGGCCATGATCACGACTATGACCACGACCATGGCCATGACCACGACCATGGCCATGATCACGACCATGACCACGACCATGGCCATGATCACGACCATGACCACGACCATGGCCAAGATGAGGGTTCCGGTTCAGATATTGAAAAGGATTCTGTTGTGAAGTTTATCAAAAGAAAACTTGAACAGAGTGGTGATGGAAGTTTGGAGGATAACGTTTGGATAGATGAAGTCATTATAGATGATGAGTGGGATGTTGAAGGTGGTGTAGATGGGGAACAAGAAGTAGAAAACCCAATTGAAGCTCCAGAAGATTTCAGATGGGGAAAAATTTTCAAAAGCTGGCTCGATGGACAAGCTTAG

Protein sequence:

>DPOGS213721-PA
MNTAMRLSCLFLALAGASAFSINLRQNPETLCLGQDNFLRIANVDSCQLYYQCYAGRSYLMQCPEETWFNEEIQVCDRSPIPRSCRADESEPETEPEPEEEQEPEPEPEEEQEPEPEPEPEEEQEPEPEPEEEQEPEPEPEEEQEPKPEPEEEQEPEPEPEEEPEPKPEPEEEQEPEPEPEEEPENGESEPEIEEPEEDQKPEESEPDQEVEEPEQEQEEEKPEQEPEAEEPEPEEEAEQPEVDEEAVPEEDADQQNPEESDPEDQLPEEEIPEQEQEEEEEQVGEENEGGSEDGEEIKRLGRNVHRQEEQEGEEKDSEEAEHNHDHDHGHNHDHQHDHDHDHDHEHNHDHDHGHGHDHAHDHDHAHNHDHDHEHDDNHDHSHDHGHDHGHGHDHDHDHDHDHEHNHDHDHDHDHGHGHDHAHDHDHDHEHDDNHDHSHEHDHDHDHGHDHDHDHDHGHDHDHDHDHGHDHDHDHGHDHDHDHGHDHDHDHDHGHDHDYDHDHGHDHDHGHDHDHDHDHGHDHDHDHDHGQDEGSGSDIEKDSVVKFIKRKLEQSGDGSLEDNVWIDEVIIDDEWDVEGGVDGEQEVENPIEAPEDFRWGKIFKSWLDGQA-