Monarch geneset OGS2.0

DPOGS214134
TranscriptDPOGS214134-TA2673 bp
ProteinDPOGS214134-PA890 aa
Genomic positionDPSCF300014 - 1340053-1358426
RNAseq coverage555x (Rank: top 23%)
Annotation
HeliconiusHMEL0025050.083.50% 
BombyxBGIBMGA006181-TA0.079.85% 
DrosophilaCG17684-PC0.051.80% 
EBI UniRef50UniRef50_UPI00022471530.054.14%UPI0002247153 related cluster n=2 Tax=unknown RepID=UPI0002247153
NCBI RefSeqXP_001688473.10.050.63%AGAP005043-PB [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3407216390.057.24%PREDICTED: dipeptidyl aminopeptidase-like protein 6-like isoform 1 [Bombus terrestris]
NCBI nr blastxgi|3504048530.057.24%PREDICTED: dipeptidyl aminopeptidase-like protein 6-like isoform 1 [Bombus impatiens]
Group
Gene OntologyGO:00160201.9e-53membrane
GO:00065081.9e-53proteolysis
GO:00082362.5e-29serine-type peptidase activity
KEGG pathway 
InterPro domain[195-560] IPR0024691.9e-53Peptidase S9B, dipeptidylpeptidase IV N-terminal
[687-877] IPR0013752.5e-29Peptidase S9, prolyl oligopeptidase, catalytic domain
Orthology groupMCL16464 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214134-TA
ATGACCTCCGTGTCCGCAGACTCGCAGACTCGCCACAATCCGACCGAACATGCATCACCTGATGGGGCTGTCCGTGTTCCGGTGTGTCGCGAGTGTCGCGTCTGTGGCGTCCGTGGCGTCTGTGGCGTTTGTGGCGCGCGTGTCAGTGTCGCGGGCGAGGTGGTGTGCGGGCGCGTAGGGCTGCGCGGTGGCGGGTGCGCGCGCGGCTCGGCCGGCTCGGGCGGCGGCGCATTATCTGCGAAGGAAGAGGACCTGTACCCCGGCGATGGACACAATTGGAGAAGCATCATATTCTCATTGATGGTCATCAGTTTTGTGATAGCAGGCATTGTCACAGCGATTTACTTATTGGGATATGTGGACGAGCTGCTGTACTGGTCGGGGCGTCGTATGAGGCTGGACGAGTTCCTGCGAGGAGACCTGACTGGCGAACGTCTGCCGACCACGTGGGTCAGCTCACACCAGCTCGTGTACCAGGCGGACGACGGAGGTCTACTCGCCCTGGACACCTTCAACAATACGCTATACGTGCTCGTCACAAACCACACTCTCAGGCAGCTCAATGTGCGAGGGTATCAATGCTCGCCAAATCTACGCTTCGTGCTGTTCCAACACAACATCAAAGAGGTTTACCGACGGACTTTTACTGCGCATTACACGGTTTATGACGTCACAAATGACCACCATATCCCCCTCTTCGGCGAGGGTCAGAGCAGTTGGGAGTGGCAGCACGCGGCTTGGCTCGGTACAGAAGGTGCTATCGTACTGGCTGCAGACAACGAGGTCTTGGCGAGACCAGCACCTCCTGCACGAAGAGCTCCCTTACTTCGACTTACAAATGATGCTGTCGCGGGCAGCGTTTATAACGGGGTGTCGGATTGGTTGTACCAAGAGGAGGTGACAAAAGAATCATCAGCGACTTGGGGATCATCAGACGGGGCTTTCGTGTTATATGTCCAGTACGATGACAGGAAGGTGTCTCAGATGAGGTTCCCACATATTTCCTCCGGGATAGGTGGCGCTGGTGCCTCCAGATCAGGGTTTTTGCTTCCCGCTTTCAACAACAGTAACCCAACCATTTTCCCTGATCACGTAACGATCAGATATCCCACGCCCGGTAGTTCAATACCTCTCGTTAAGTTGTGGATAGTTGCGGTACAAAACGTAACATCTCCACCCAGGTGGGAAGTAAAACCTCCAAGTACATTAGATGGCATGGAATATTATCTCATATCAGCCCAATGGGTGGGCAAAGAAAATTCTCATATAGGAGTAGTTTGGATGAACCGTGCACAGAATCTAACTACCCATTCAGAAAAAGCCACTGATGAGCCTTGGTTAGAAGTTCATCAGCGTCCGGTTTACTCAGAAGACGGCAGTGCTTTTCTACTTCTAGCAGCAGTCCAAGAAGGCGGAGGCCAATACTATACTCATATTAAGCATGTTGATGTCCTCCGTCAACGCATAGCTGTTCTATCGCACGGTAAAGTGGAGGTGGCGAAGATCCTGGCGTGGGACCAGGAAAACAATTTAGTTTATTATTTAGGTAGCGCAGATAGACCGGGCCAGAGGCAGGTGTACGTGGTACGTGATCCAAGCTACGGAGGAGCTAGCAACTCCGTCAGAGCTAGAGCTGAACGCGAGGAACCACGTTGCCTTACGTGTGAGTTGGCTGTATGGCCTGCTCGGCTTCATTACGCCAACTGCACTTTCTGGAGCGCGACGTTCCCACCTCCTAAGCCGAAGCGTGGTATAACTCATTACGTTCTGGAGTGCAGAGGTCCTGGGCCTCCACTCGCAGGTCTTCACGATGCCAAGACTCATAAGTTAGAGAGAATTTTATACGATACGAGGCCTTATAGATCTGTACGATTACGTGAGTTGGCATTACCTTCTCGTAGATCATTTGATGTACAATTGAGTAGTGGCTCTAAAGCCCGTGTACAGCTTCTGCTGCCGCCGTCTTGGAGAGAAGAACTCCGTGACGCAGCATTTCCTGTACTAGTTCACGTAGACGGTCGCCCTGGCAGTCAACAAGTGACAGATGAATTCCTGGTAGACTGGGGAACGTATATGTCCTCACGTAACGACGTAGTTTACGTTAAATTAGATGTAGCAGGCGCTAAAGGGCTACCCCGAGCGCTGTTACGAGGTCGCCTCGGTGGGGTCGAGGTGGCCGATCAATTGGCTGTTATTAGATATTTATTAGAAACATTTAAATTCTTGGATGTAACTCGTGTTGCTGTTTGGGGATGGGGTTATGGTGGATATGTAACGTCAATGTTGTTGGGGTCTCAGCAGTCTACTTTAAAGTGTGGTATAGCGGTGTCACCGATCACAGACTGGCTGTATTACAACGCAGCATTCACGGAGCGTATCCTGGGCCAACCGTCAGTTAATTATAAAGGGTATGTGGAGGCTGATGCGTCCCAGCGCGCGCACCACGTGCCGCCGCACGCGTTGTACCTCGTGCACGGGATGGCAGACATGAGCGCGCCGTACCCTCACGCTCTGCAGTTGGCTAGGGCCTTGACTGATGCTGGAGCGTATGCTGATGAAGGACACGACCTTGAAGGTGTTATCGAGCATGTTTACCGGTCAATGGAAGATTACCTCCTAGAGTGCTTGTCCCTCGACCCAGAAGACACCAAGCTGCCTCCGCCAGATAGATAA

Protein sequence:

>DPOGS214134-PA
MTSVSADSQTRHNPTEHASPDGAVRVPVCRECRVCGVRGVCGVCGARVSVAGEVVCGRVGLRGGGCARGSAGSGGGALSAKEEDLYPGDGHNWRSIIFSLMVISFVIAGIVTAIYLLGYVDELLYWSGRRMRLDEFLRGDLTGERLPTTWVSSHQLVYQADDGGLLALDTFNNTLYVLVTNHTLRQLNVRGYQCSPNLRFVLFQHNIKEVYRRTFTAHYTVYDVTNDHHIPLFGEGQSSWEWQHAAWLGTEGAIVLAADNEVLARPAPPARRAPLLRLTNDAVAGSVYNGVSDWLYQEEVTKESSATWGSSDGAFVLYVQYDDRKVSQMRFPHISSGIGGAGASRSGFLLPAFNNSNPTIFPDHVTIRYPTPGSSIPLVKLWIVAVQNVTSPPRWEVKPPSTLDGMEYYLISAQWVGKENSHIGVVWMNRAQNLTTHSEKATDEPWLEVHQRPVYSEDGSAFLLLAAVQEGGGQYYTHIKHVDVLRQRIAVLSHGKVEVAKILAWDQENNLVYYLGSADRPGQRQVYVVRDPSYGGASNSVRARAEREEPRCLTCELAVWPARLHYANCTFWSATFPPPKPKRGITHYVLECRGPGPPLAGLHDAKTHKLERILYDTRPYRSVRLRELALPSRRSFDVQLSSGSKARVQLLLPPSWREELRDAAFPVLVHVDGRPGSQQVTDEFLVDWGTYMSSRNDVVYVKLDVAGAKGLPRALLRGRLGGVEVADQLAVIRYLLETFKFLDVTRVAVWGWGYGGYVTSMLLGSQQSTLKCGIAVSPITDWLYYNAAFTERILGQPSVNYKGYVEADASQRAHHVPPHALYLVHGMADMSAPYPHALQLARALTDAGAYADEGHDLEGVIEHVYRSMEDYLLECLSLDPEDTKLPPPDR-