Monarch geneset OGS2.0

DPOGS215679
TranscriptDPOGS215679-TA3165 bp
ProteinDPOGS215679-PA1054 aa
Genomic positionDPSCF300041 - 964344-971328
RNAseq coverage184x (Rank: top 49%)
Annotation
HeliconiusHMEL0040850.069.67% 
BombyxBGIBMGA003578-TA0.067.70% 
Drosophilabsf-PA4e-10827.54% 
EBI UniRef50UniRef50_D2A3F73e-10727.79%Putative uncharacterized protein GLEAN_07529 n=1 Tax=Tribolium castaneum RepID=D2A3F7_TRICA
NCBI RefSeqXP_001657384.14e-10929.12%leucine rich protein, putative [Aedes aegypti]
NCBI nr blastpgi|1571120347e-10829.12%leucine rich protein, putative [Aedes aegypti]
NCBI nr blastxgi|910807135e-11027.91%PREDICTED: similar to bicoid stability factor CG10302-PA [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL18281 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215679-TA
ATGTTAGCCCGTCAATGTAATAAATTAAGGTCGTCTGTAAAGGTCATATCTTGTAGTAAAATTGCCTATAGTAATAAAGTTAACTTACAGAAATCGTGTTTTTTACTAGGCTCATCAAAAAGGTTTTTAGGAAATCAAAATCCAATAAGCAAAAGTGACATATTCGTGTCTACTGGACCCACAAATAAAAATGTTGTAGGAGGAAATAAAAGAAGTAGAATATCCTCTGTGGAATTTCTTGAAAAACTAATAGCTGGAATATCAACAGACATACACCAAAAAAATAGGGTTTATAAAAATGATTTTATTAGAGTTATAGATAAAATTAAAGAAACTAATATGTCATCAAGAAAGCAAGGGTTGCTACTTATAAAATGTTGCACAGAACTAATGCCTGATGAAATGCCATCGACCAGAATGGCTTTAGTCCAAGAGTTATGGAATGCTATTAAACCTCATACCACTTTTGATGTTGAGCATTACAATGAATTGCTGAGGGTATACATTGCAAACAATAAAAAACTTGTAGTCAGTAAGTTTATTGGCGAAATGGGTGCTGTGAAACCTAATTTGACTACATTCGAACTGATCCTAAGGACTCTTGGAGAGGCCGGAGACATCAATCAGTGTACTGAGGTTATTTCTAATATGAAAGCCCAAGGACTGCCAGCGACTGAAAATGTATTTAACTCATTAATAATTTGTCATGGAAAATCTGGGAATTTAGATAATATCCCTGAAATTTTAACAATGATGAAATCTCTGCAACTGGACTACTCTGTCAATACATATACAGCAATCGCTAGAGCATATGGATGGAATAAAAAGAACCAACAACTTATAAAGGAAATGAATACAGCCGTAAAAAACGGAATTACTTTTGGGGAGGTCCATATTATGGATATAGTAAAAACCTTGGCATTTGTTGGAAATCATACTTCTATACCGGAGGTACTTAAGTATCTGCCTAGACAGATTTTAGAATCTCCATCTATTACTCCTACAATGCAGAGTGTGGCAACCCTACTGGTATTTCAAAGTCACCCGATGGCAGCCCTGGAAATATACAAAATTTTACCATTACCTTCTTTCGGACCAAAGGATGATATGGGACTACATGGAAGGAGTTTAGTTCGAGATTGTGTAAAGGCCTCAACACCATCCAGCGTAATAGCATTAATATCGCAAGAGCTTATGTCGTCCGGTCGTAACGCTATAGCTCTGCAGAACGCAGCCGAAGCGGCTTTGCAGTTAGGAAAAGTTCCCTTAGCTTTGGACTTGTTTACACGAATGAAGCAGTTCGGGATGCCTGTTAGACCACACTACTTCTGGCCGATATTGCTGCAAACATCTAAAACTTATGGAGAGAAAGGCGTAATGAACACTTTATCGACAATGTCAGAAATGGAAGTCAAACCGGACTATGAGACTATCATGGACTATACACTGCCTTATGTCAGTTTCACCTCACCACAGAACTTAATGAAGAAGTTTCAAGAGGCAGGTCTAACAGTGACCAGTGTATTGACCCCTATGATGGTTACCTTACTAAATACAGGACAAGTCAGAGCTGCCAGTGAGATATGTGAACTGTTTGATGGGAAGGTAGATGCAGAGTTAGTCATTAAACCTTTATTAAAAGGTTTCTTAATAAGCTCCGATATAAAGTCTGTTATACATATATTAGAAGATATAACAGCTAAGGCTCTGGACAAGAAAAAAGATTGGGTCGGACGGTTTTTGTGTGCCCTCATAAAAAATAAGAGAGTCAAAGAGGATTTTCCCGACATTCTGAGTATTGTTAAGGCGCTGCAAGGGAAAGGAATTAAAATATCAACATCAGCAGCCGACTATTGCTTGTCCCGTGTACCGGAACATCCCAAGAAGGAACTCATTGACAGTTTTAAAGAGGCTCTGGTGAATATAACAGACGAGAGATTAGTCGATGAAGGAGAGATGTTTGTGCAACAGATTCAGCACCCGAAGCAAATGAACGAAGAGGCCTTGTCTGCTCACCTGGTGGAGCTTGAATCTAAGGGGATGAACACACGGGGCGTCCTGCGGAAACTGTTGCAGCAGTACTGCAGGTCTGGAAATCTGGCTGCCGCGAGGGAGATTCTGGAAAAGTGCGAGAGGGAGGGGGTGTTCCTATCAGCTGGAATGAAAGCATCAATATTCGATCTTCACGTGAAACAGGGGGAGTTGGATATGGCTGAATTGGTGTTAGCGGATTTGAACAAATCATCACCAAATTTTACTTTGGATGAATACAAAGTCATAGACTTCGCGACCCTCATGGTATATAGAAAGAAAATCGACAAGGCTTTCGAGTTAATCAACGAGCAGTCGAGAAAGCGGCATATAACTGGTGGACGTAATCTCTCAATGAACTGTTGGCGGCTGATGGACGCGGTAGCGGCTCAAGGCTCAGTGACTGAGGCCAGGCGGATGTTCACTCTGCTTACTTCCTTACGTTACTGCAAACCCTCGAACACACTTCTGGGACCAGTAGTACGAGCTCATCTAAAGAACGGTGACTTGGAACAGGCGGTACAGGAGTTCGTAAGTCTGGCGGAAAAATACAAAAAGACACCCTTGAAACACGAGTTGCTCTGTAAAGTTTTGCTGGCTATGAACGAGGGAAAATCTGAAGAAAGGTTCATAAGTAACGAGCAATCAAACGGGAAACTAAACAAATTAGCTCAAACTATACTCAATGTGGACAGACAAGTCCACGGGGCGAGTGATGTTCAGGTAACCCTAATAGCGGCGCTGGCGGATGTGGGTTACAAGATGACACTCCGAAAAATTTTCCTAGACCCTACTACGAAGTTCCATCCGGATGCTTTGCTAAGACATTGCGAGAGATTCGCGGATGAGAAGAAAATTCACGCGTTGGAAGCCATCGCTGATACCTCCAAAGACCTGAGACATGTCCATATTGAAGAGATTTATAACCTGATCCTGGATATATATCAACGAGAAGACAATTGCAAGGACGCGTTAACATTATTCTACAAAATGCAAGAAAACGACATAGAACCCTCCAAGAAGTTCATAGAGAATATTTGTTCGCTATTCAAGTCCAATAATAAACTCGTACCGCCGGATGTAGCGCTGATTAGAGACAAACTAACAAAAGGCGCCTCTAAGAAAGCTGTCTAG

Protein sequence:

>DPOGS215679-PA
MLARQCNKLRSSVKVISCSKIAYSNKVNLQKSCFLLGSSKRFLGNQNPISKSDIFVSTGPTNKNVVGGNKRSRISSVEFLEKLIAGISTDIHQKNRVYKNDFIRVIDKIKETNMSSRKQGLLLIKCCTELMPDEMPSTRMALVQELWNAIKPHTTFDVEHYNELLRVYIANNKKLVVSKFIGEMGAVKPNLTTFELILRTLGEAGDINQCTEVISNMKAQGLPATENVFNSLIICHGKSGNLDNIPEILTMMKSLQLDYSVNTYTAIARAYGWNKKNQQLIKEMNTAVKNGITFGEVHIMDIVKTLAFVGNHTSIPEVLKYLPRQILESPSITPTMQSVATLLVFQSHPMAALEIYKILPLPSFGPKDDMGLHGRSLVRDCVKASTPSSVIALISQELMSSGRNAIALQNAAEAALQLGKVPLALDLFTRMKQFGMPVRPHYFWPILLQTSKTYGEKGVMNTLSTMSEMEVKPDYETIMDYTLPYVSFTSPQNLMKKFQEAGLTVTSVLTPMMVTLLNTGQVRAASEICELFDGKVDAELVIKPLLKGFLISSDIKSVIHILEDITAKALDKKKDWVGRFLCALIKNKRVKEDFPDILSIVKALQGKGIKISTSAADYCLSRVPEHPKKELIDSFKEALVNITDERLVDEGEMFVQQIQHPKQMNEEALSAHLVELESKGMNTRGVLRKLLQQYCRSGNLAAAREILEKCEREGVFLSAGMKASIFDLHVKQGELDMAELVLADLNKSSPNFTLDEYKVIDFATLMVYRKKIDKAFELINEQSRKRHITGGRNLSMNCWRLMDAVAAQGSVTEARRMFTLLTSLRYCKPSNTLLGPVVRAHLKNGDLEQAVQEFVSLAEKYKKTPLKHELLCKVLLAMNEGKSEERFISNEQSNGKLNKLAQTILNVDRQVHGASDVQVTLIAALADVGYKMTLRKIFLDPTTKFHPDALLRHCERFADEKKIHALEAIADTSKDLRHVHIEEIYNLILDIYQREDNCKDALTLFYKMQENDIEPSKKFIENICSLFKSNNKLVPPDVALIRDKLTKGASKKAV-