Monarch geneset OGS2.0

DPOGS201115
TranscriptDPOGS201115-TA2532 bp
ProteinDPOGS201115-PA843 aa
Genomic positionDPSCF300137 + 144707-195238
RNAseq coverage361x (Rank: top 33%)
Annotation
HeliconiusHMEL0179862e-17173.46% 
BombyxBGIBMGA006181-TA3e-7627.88% 
DrosophilaCG11319-PA3e-13734.67% 
EBI UniRef50UniRef50_D6WQR40.042.71%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WQR4_TRICA
NCBI RefSeqXP_975282.10.042.71%PREDICTED: similar to CG9059 CG9059-PB [Tribolium castaneum]
NCBI nr blastpgi|910871410.042.71%PREDICTED: similar to CG9059 CG9059-PB [Tribolium castaneum]
NCBI nr blastxgi|910871410.042.69%PREDICTED: similar to CG9059 CG9059-PB [Tribolium castaneum]
Group
Gene OntologyGO:00160201.9e-65membrane
GO:00065081.9e-65proteolysis
GO:00082361.5e-28serine-type peptidase activity
KEGG pathway 
InterPro domain[120-477] IPR0024691.9e-65Peptidase S9B, dipeptidylpeptidase IV N-terminal
[631-825] IPR0013751.5e-28Peptidase S9, prolyl oligopeptidase, catalytic domain
Orthology groupMCL17800 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201115-TA
ATGACAGACATTATAACGTGTGAGGAACTAGCTGCAGCAACACCAACGCAACGGAACTGGCGCGGTATACTGATCGCATTGCTGGTTATAGCAGCGGTGTTGGGGCTGATCGTGTTCAGCATAGCGCTGCTGACTCCGGCTGGTGATGATGGCCGGGGCAGGGGCAGAAGACCGACACTCGCGGATATAATGACAGAGCACTTTAAGCCGTTTAATGGCACATGGCTATCAGACGAGGAGTTGGTGTTCCGCGACCGTTGGGGAGGATTGACGCTGTTCAACGTCAAGAACTTAACAACCAGACTGCTTATGAACAATTCTACCTTTAGGGAGCTGAATGCTGTAGACTTTAAAGTCTCTTCCGATCTGAAGTTCGTTCTGCTGATATCCGACGTGCGACCGGGCTGGCGACACGCAAGACTAGCAAGATACCACGTGTATGATGTTATAACTAGGAACAAAATCCCCATTTCGCCGATAGAGGACGACAGGTCTGCTCCCTTGCTGCAGTATGCAGAATGGTCTCCTGTCGGCTCCGGGCTGGTGTTCGTATATGACAACGACATTTACTACAAGCCTAAGGTTTTAAAGGCCTTGGTTTGCAGAATCACTAGTAACGGAGTTCCAGGTGTAATCTTTAATGGAGTACCAGATTTCCTTTACGAGACCGAGGTGTTGCGATTGGACCGCGCCCTGTGGTTCAGCCCCGACGGACAGACGCTCATGTACGTGACCTACAATGACAGCCTGGTCCAACAACACAAATATCCTTGGTATGGTTTGGATCAACAGGAACCGCCCGCCTACCCTGCCATACGGACCCTGAGATATCCGAAGATGAACACTAATAACCCAGCAGTAACGGTGTACGTGGTCAGTCTGAAGACTCCAAAGTTTCTGTTCCCACATGCTATACAGTTTAATTCACCCTTTGACTCTGGCTGGTATGTTCGTTGGACAAGTTGGGTGTCTGAGCGTCAGATAGCTGCTCTCCTTCTCAACAGACCTCAGAATCTATCCATCATCGCGACATGCAACGCTGTGTCTTACAACTGCCAAGATATCTATCGCGACGAATCTGACGGTTCGCGATGGTCAGGTCTTGGGTCAGACCCGGAGGAAGAGTGCGGCTGGTGCGGGGGGGCAGCGCTCGTGGGGGGGAGGAGCGGCATCTTCACGTCCATACCTGTCACCGACCAGGGAGGCGTGTGGAGACACGCTATACATCTCACCCAGGAGACCAGAACGACCATTACCCAGGGGAACTTCGAAATAACACAGCTGATTGGATGGGATGAGAAACGAAGACTGCTCTACGTGATCGGTACCGCCCCTGACAAAGCCGGAGAGCGTCACCTCTACCGCGTGTCGGTGCCTGTGGACGGCTGGCCTCCTCCGCCCGTCTGCCTCACCTGCCCTGGACGGATGATTGAAGCCACCGCTGAACCCAGCACGGAGGAACCGGAATACGACAACAGCACGTCGTCATTACCCGCGTGGCCGACGTCCACAGTTCTGCCTCACGTGGCCGACGAGAACGACTCGCTTCCCACCGCCTGTCTCTACAACAGAGTTATATTTAGCAAAAATTTCTCGTACTACGTTCAGGAGTGCCTCGGTCCAGAGCCCCCGGCGATATTCCTTTGTACGTCAGCGGGGTCTCGCCGAGCCGTGCTGTGGGACGGGGCGCCGCTCAGACAAAAGTTCGCGGCCCTCGCTTCCCCCCAGGCCAAAGTGTTCAGAGTCGAGGTCCAAGCGCAAAGATCAGCACGCGTAAGGTTGCTGCTGCCTCCGGGCTTGCGGGACACTGACGACTTGCCGCTACCACTCGTATTGCATTTGTCTTCGGCTCCAGGTTCCCAGCTGGTGACTGAGCAGTGGGCACCTGGTTGGGGCTGGTATCTCGCCGCCGCAAGGAACTTTATAATTGCAGAAATAGACGCGAGAGGATCCGGCGGACAGGGAGAGGAGTTACGAACAGAGATATACCAGAAACTGCTCTCAGTAGACGTCGAAGACCAAATAGCTGTTTTATCCTACCTCCGTGACAACCTGAAGATGGTGGATGGGAGCCGTACTGGTGCCTGGGGGAGCGGGTACGGCGCGGGCGCGGCGCTCGCCCTCGCTGCGGGAGACGCCGCTAACCTCACCAGGTGCCTGGCCCTGCTGGCGCCCCTCGCCGACTTACGACACCACAACTCGTTCTGGTCGGAGCGCTACTCCGGCCTGGGCGGCGGAGCGTCGCTGGGTGTGTGGCGCCGGGCGTCCTCGGTGCCTCCTCGGCGCGTGTTGCTCGCGCACGCCACCGCGGACGTGCGGGCGCCTCCGCCCCATGCTCTCGCTCTTGCACGAGCTCTCATACAAGCCAGAGCCGTGTACTCTCATCAGGTGTATCCTGATGAGGGTCACAACTTCGAGCGCTCGTTCCTGCACGTGTACTCGACTATGGAGCAGTTCTTCGACGAATGTTTCGGGCCCGTGGAGCTCGCGGACTGGGACAATCCCGGAGGACTGTTTCCATTTAGGGATTGA

Protein sequence:

>DPOGS201115-PA
MTDIITCEELAAATPTQRNWRGILIALLVIAAVLGLIVFSIALLTPAGDDGRGRGRRPTLADIMTEHFKPFNGTWLSDEELVFRDRWGGLTLFNVKNLTTRLLMNNSTFRELNAVDFKVSSDLKFVLLISDVRPGWRHARLARYHVYDVITRNKIPISPIEDDRSAPLLQYAEWSPVGSGLVFVYDNDIYYKPKVLKALVCRITSNGVPGVIFNGVPDFLYETEVLRLDRALWFSPDGQTLMYVTYNDSLVQQHKYPWYGLDQQEPPAYPAIRTLRYPKMNTNNPAVTVYVVSLKTPKFLFPHAIQFNSPFDSGWYVRWTSWVSERQIAALLLNRPQNLSIIATCNAVSYNCQDIYRDESDGSRWSGLGSDPEEECGWCGGAALVGGRSGIFTSIPVTDQGGVWRHAIHLTQETRTTITQGNFEITQLIGWDEKRRLLYVIGTAPDKAGERHLYRVSVPVDGWPPPPVCLTCPGRMIEATAEPSTEEPEYDNSTSSLPAWPTSTVLPHVADENDSLPTACLYNRVIFSKNFSYYVQECLGPEPPAIFLCTSAGSRRAVLWDGAPLRQKFAALASPQAKVFRVEVQAQRSARVRLLLPPGLRDTDDLPLPLVLHLSSAPGSQLVTEQWAPGWGWYLAAARNFIIAEIDARGSGGQGEELRTEIYQKLLSVDVEDQIAVLSYLRDNLKMVDGSRTGAWGSGYGAGAALALAAGDAANLTRCLALLAPLADLRHHNSFWSERYSGLGGGASLGVWRRASSVPPRRVLLAHATADVRAPPPHALALARALIQARAVYSHQVYPDEGHNFERSFLHVYSTMEQFFDECFGPVELADWDNPGGLFPFRD-