Monarch geneset OGS2.0

DPOGS215364
TranscriptDPOGS215364-TA2169 bp
ProteinDPOGS215364-PA722 aa
Genomic positionDPSCF300351 + 63281-75717
RNAseq coverage759x (Rank: top 17%)
Annotation
HeliconiusHMEL0052232e-12469.81% 
BombyxBGIBMGA008734-TA3e-13775.16% 
DrosophilaCG3744-PB1e-14839.15% 
EBI UniRef50UniRef50_F4X3881e-15844.48%Dipeptidyl peptidase 9 n=10 Tax=Pancrustacea RepID=F4X388_ACREC
NCBI RefSeqXP_971949.18e-17446.62%PREDICTED: similar to AGAP003138-PA [Tribolium castaneum]
NCBI nr blastpgi|910766982e-17246.62%PREDICTED: similar to AGAP003138-PA [Tribolium castaneum]
NCBI nr blastxgi|910766982e-16846.62%PREDICTED: similar to AGAP003138-PA [Tribolium castaneum]
Group
Gene OntologyGO:00160202e-52membrane
GO:00065082e-52proteolysis
GO:00082362.1e-08serine-type peptidase activity
KEGG pathway 
InterPro domain[173-567] IPR0024692e-52Peptidase S9B, dipeptidylpeptidase IV N-terminal
[649-693] IPR0013752.1e-08Peptidase S9, prolyl oligopeptidase, catalytic domain
Orthology groupMCL10962 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215364-TA
ATGGAGTTAAGGGAGGCTGGTGATGGGGCGGCGGGCGAATATGAACGTCCTCCCAAGAAATATAGCTGGTCGGAAGTCAGGCAGGCGGTTCACGATCTTCGGAAGGAGCTATCGTCATTATCCACCATGGTGCCAATGGCTATATCGTTCAGGAAGCTCAGCAATGGAAAGATGAGGATATATTTCTTACGGACACCTCAAAACGGATGGGAGGTCACGCTTCTGTACACCGACGTGACGCCTTCGCAGACCGCTAGTAATATGAGGTTAGACTGGCGGCCGTTGATTGAGTCCAACGTAGCGCTGGGCGTGTCATCAGGGAAGTGGTCTCGTGAGGAACAGCTGCTATGGGAGCGGCAGAGGGTCGCCGCTTGGGGTATAGCATCTTATGAACTGCACCCCAAGACCGGGAGAGTGCTGTTCCCATGCGCTTCATCTTTGTTTATAGCGGAAGAGGCCCCGAACCAGTCCCCTCCCCTAGTACCAAAGTCGCTCAGCACGGGGTGGGGGGCCCCGCTGACACCAGCGATGTGTCCCGCAATGCCGTCCCTGGTAGCGCACGCTGCCCGTGGTGATGTGTGGCTGGCGGGCGATTCTCTAAGGCGGCCCGCAAGACTTACGTACGCGTGTAAAGGGAGGGAAGAACGCTTATCAGATGATCCTAAGCAGGCTGGGGTGCCGTGCTACGTGACTCAGGAGGAGTTTTCGAGATATACCGGAATATGGTGGCAGCCGCAGTCCACAGATAATGTATTTAGAATAGTGTACGAGGAGGTGGATGAGGGTGAGGTGAAGATATACAGCTTCCCATCATCACAGAGCTCCAGCGGGGAGGTCGAGGAGTTCAGGTTTCCCCGCGCCGGCACCCCTAATGCTAAATCAGTCCTGAAAATGGTGACCTTCAGATTACAGAAAGCTCCCCCCACCACCGTCCTTGATTATTACCAAGAAGGGAACTCTAATACTGTTGCATCAGAGAGCCCCGGGAACAGTTCGGATCCCTTGGAGGTGGTCGATGTAAGATGGTATGAACTGAGACATTCGCTGAAAGAGGTGTTCCCCTGGTTTGAATACCTGGCCAGAGTCGGTTGGACCCCGTGCTCTCAATACGTTTGGGTCCAGGTGTTGGACAGGAAGCAGCAGAGGTTAGAACTGGCCCTGGTGCCGGTTAGTGAGTTCAATGTCCCCGTGAGGTATGAGCAGGGGTCTGATGGAGGAAGACTGGATGAGGAATCTCCAGCTTCAGGGAGTAGACAGGGAGACAGGACACAGATCCAGGTGTTGGTGTCTGAGACGGCTCCCGACGCGTGGGTCAACGTCCACGACATACTGCACTTCCTGCCCTCAGAACCTGGTATTGTGAGGTTCATCTGGGCTTCAGAGGAAACCGGACACCTGCACCTGTATCTCATCACCTGCGCTGTCAACGGACAGAGGGCTATGACAGTAACTGATATAATGGCTGAGGATGAGTCAAATGCTGCAGTCCCTCGGGTGATCAGCAAGGAACCCCTCACTGATGGGGACTGGGAGGTCATGGGAAGAAAGATATGGGTGGACGAGCCGCGCGGTCTGGTGTATTTCGTAGGGCTCCGTGAGACGCCGCTGGAGCGCCACCTGTACGTGGTGTCAATGTCCGCGCCCAGGCAGGTCGTCCTGCTCACTAAGCCGGGACATTCACACAGTGTTGACATGGACGAGTCACCGGAACCTCGTTCGTTCAACGGTTCCTGGGACTGTCGTCCTGATGAGGAGGAGTCGCCCAGCACCCGCCCTCCCCCGGTTCCCCCTCCACAGATACTATCGACTCGTCTGTCTTGCGGAGCCCTAGCATACTGCACACTTTGGCGGAGCGCCGTCCCAGGGCGAAGGCCGACTGTCTTACACGTTTACGGAGGGCCCGAGGTTCAAACGGTCACTAATAGTTACAAGGGTGTACGACAGTTGAGAATGCATATGCTGGCTGCCCGAGGGTTCACAGTGGTGTCCGTGGACTCGAGGGGGTCCAAGCACAGAGGGAGGTTGTGGGAAGCAGCTATCAAAGGAAAGATGGGACAAGTGGAGCTGGACGATCAGGTTTACCCGGGTGAGAGGCATTCGCTGCGAGCTATGCACGCGGCTAAGCATTACGAGGCGACACTGCTGCACTTCCTACACGAGAACCTGTAG

Protein sequence:

>DPOGS215364-PA
MELREAGDGAAGEYERPPKKYSWSEVRQAVHDLRKELSSLSTMVPMAISFRKLSNGKMRIYFLRTPQNGWEVTLLYTDVTPSQTASNMRLDWRPLIESNVALGVSSGKWSREEQLLWERQRVAAWGIASYELHPKTGRVLFPCASSLFIAEEAPNQSPPLVPKSLSTGWGAPLTPAMCPAMPSLVAHAARGDVWLAGDSLRRPARLTYACKGREERLSDDPKQAGVPCYVTQEEFSRYTGIWWQPQSTDNVFRIVYEEVDEGEVKIYSFPSSQSSSGEVEEFRFPRAGTPNAKSVLKMVTFRLQKAPPTTVLDYYQEGNSNTVASESPGNSSDPLEVVDVRWYELRHSLKEVFPWFEYLARVGWTPCSQYVWVQVLDRKQQRLELALVPVSEFNVPVRYEQGSDGGRLDEESPASGSRQGDRTQIQVLVSETAPDAWVNVHDILHFLPSEPGIVRFIWASEETGHLHLYLITCAVNGQRAMTVTDIMAEDESNAAVPRVISKEPLTDGDWEVMGRKIWVDEPRGLVYFVGLRETPLERHLYVVSMSAPRQVVLLTKPGHSHSVDMDESPEPRSFNGSWDCRPDEEESPSTRPPPVPPPQILSTRLSCGALAYCTLWRSAVPGRRPTVLHVYGGPEVQTVTNSYKGVRQLRMHMLAARGFTVVSVDSRGSKHRGRLWEAAIKGKMGQVELDDQVYPGERHSLRAMHAAKHYEATLLHFLHENL-