Monarch geneset OGS2.0

DPOGS214092
TranscriptDPOGS214092-TA6525 bp
ProteinDPOGS214092-PA2174 aa
Genomic positionDPSCF300014 - 2252778-2262759
RNAseq coverage128x (Rank: top 57%)
Annotation
HeliconiusHMEL0114280.044.86% 
BombyxBGIBMGA014306-TA2e-5786.21% 
Drosophila% 
EBI UniRef50%
NCBI RefSeqXP_969118.23e-1034.78%PREDICTED: similar to GA18707-PA [Tribolium castaneum]
NCBI nr blastp%
NCBI nr blastxgi|1234386771e-8021.27%viral A-type inclusion protein [Trichomonas vaginalis G3]
Group
KEGG pathway 
Orthology groupMCL22358 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214092-TA
ATGTCTTTTATCACGAAAAGGGAACGAGTATTCAATGAATACGACCTATTTGACTTACCGGCTAGGAATGAGGGTAAACTAAAAACGTACGCATCATGTACGGACGCAAGAGCTTGGTCCCATTTCTGTTCAGATAAAACTGAGCATTTGGTTGAAATTATAAAGCGAAATAATGAAACTGTCCGTCCGAAATACGTACCGACCGACGTCATCGTCAACACTCTGATGACGGTGAAGAATGCGGAAATAGCAAGATTGAGGCGTAAGATCGACGAATTTGAACAAATGATAGCAGCTTACGATCAATTGGAACTTACTTGTGAACAAAAATGTGAAATAGCAAGTGCTCACGCCGCAATAAAAGCGGCCAACAAAGAGTTGGATGATATGTGTTTGGACTTGGATCTTTCTGGATACACCGAGGGCGTTGATTCTGAGGCTTATGAGACAGGAAAATCACGAGGCGATGAGTCCACAAAATACTCCCAAGAGGTTGAGAAATTTAAGGAGAAAGACATAGAAAAAGAGAAAGAGACTGCAAAATCTGGAGGCGATGAGTGGATTGTGGAAAAAGAACCGATGTCTAAGATGGAAGCTAAATGTAACCAGATAGGATCTCCGCCTTGCGTATGTGATGCGAGCACTTCTGCTTATGACAGCAGAATAGACGAAATGAAAGAACTGATTATAAACAAAGACGCGAAATTAAGTGCTATGCAGAACACTATAGCGGTAATGGAGAACGACGTATGTGAACCTTATTGTATATACGCTCACATATATACGGCTCTTGAAAAGATATTTGGCATTTTATGTCAAAACAAGAAATACAAACAATATTTAGATTTATTGACTGCTGGAAAAGACACGAGATGTATAGATATAACAGGCAAAATTTTATTTAAAATGAAAGTATTAGAGAAATTTAGCCTCGCTCTGATAGCTCCTTGCACACAGGAACGTTCAACACACGAGGATTGTTCGTGTTATAGAGCCGAAATTCTAACACATGTAGAAACAACATTTGCTTTAACATCTATGGAAAATAATAAACCCAATATAGATATTGATAGTAAAAGAGCCCAACTAGTTGCAGACATCATGCAACATCAGGAAATGCAAGAAATTTTAAGCAAAGAAGATATAACGTCAAGAAATGAAGATGAGCAATTGGATGATCCTTATAATATTGACAATTATAATATAGACGCTGAAAATATCAATCGATTAAAAAATTTACAAGCAAACTATGATGACTTGTTAAATTGCTACGAGACTTTGAAGCATGAACGAGATAACATATTTATTCAGTGCCAAAAATATGTAAATTTAGAGCAGGAATGTCAATGTTTACAAAATCAACTTCAAGAGTATAACCAGATGTGGAAAGAAAAAGAAATTTTCAAGAAACGTTCAGAAGATCTGGATAAGTTAAAAGAAAATTATTATATCCTCACGGAGGAAACATTAAACTTAGAAACAAAATTAAAAGCTGAGCAAGAAATCAATAAGATAAAATGTGAGGCTATAGATGAACTCAGAAATGAGAATGTTAGGTTAGAGAAAAAAATTACGGAAGCCTCTTTAATGTTTGAAAAACAAAAAAATAACTTAGTGTGCAAAGTCAAAGAATACGAGTGCAAAATAATGTGTCAGGAACAGCAAATAAGAAGTTTATCACAACAAATAGATAACTTTTTAGAGCAAGAAATTAATAAAACACCAACATCTGAAGAAGCATCACGTTCCATAGAGCTTCTTGACAATATCGAAGCTCAAAAGGAACAAATTAAAAACCTCAAAGACGCGATATGCTGCAATGAAGAAGAAAAACAATACCTTCAGGAAGAGTACCAAAAGAAACTTGAACTTATTAATGAACTCAAGTTTGATGTAGAGGATTTGAAAGGTAAATATGAAACAGCTGTGCAAAGAAATCATTATTTAGAAGAATATTTAGAGGAATTTCAAGATCAAATCAGTAAATTGGAAGATAAAAATACTCAACTCAATCATGACATAGATATAAAATCAAAGGCAATAGAAAATTTGAATACAATTTTATCCTCTAAATCTCAAGAAGTAAATAATTTGATGAATGAAGTTGATCATAAACGTAGCGAAAATAAAGAACTGTTCAATAAAATAATAGATATGGAAAGAAACTTTAGTAACAGTTTAACATCACTTAAAAATGAACAAAGAGTTGCATTAAGTTCTATTCGTTTAGCGAAACAAGAAAGTTTGGAGATTTTAAAAAGTATACAATTTGATGACTTACAAAAACAACCGAGCAATACTCAAAATAATATCAATGAAGTTAATAGTAAATTAGATGTACCAACTGTTACAAAAGACTTCTTAGACACGGAGAGTATTAATGAAAATTTATTAAAAGAAGTTCAGGGATTACGAGACATTCATTTTGAGAGTATACAAAGCTTACAAGACGAAAATAGAAATATGAAAAGATCTTTGGATGTAGCAAGTAAAAGCAGCATAGTTTTAGAATCTAAACTTAAAGACTTTGAAGAAATTAAGTGTAAGTTACACAAATTACAAGAGGATAACAATAGGCTAATTGATGAAAACGAAGATCTAAAACGAGCTTTAGACTTAAAAAATGACGAAATTAATGGTATGTTACATGTTGTTGAGCTTACTAAGAAGAATAGTGACATCTTAATAGATCAGTTACATCAATCTGAAAATATACAAGACGAATTTACTAAATTAAATAAAGCTTACCAAAATTTAATTGGTACAAAAAATATCTTGGAAGAAAAAGTTTTGCTAAAGGATCATAAGATTCAAGAACTTCTTAAAAATGTAAGCAATCTAGAAGAAGAAAATAATCGTAAAAATTTGGATCTTATAAAAATTAAATCTATTGAAAAAGAATTAATTGATTTGCATGACAAATATTCAGAATTATCCGAGGAAAAGCAAAAATTGTTGGAAGACTTCAACAATAAAACTTCTGAAATGAATAGTCTCTATAATAACTTGGAAAAAGTTATAGAAGAAAATCAAATTCTAAATAATAATATTAAAACATTGCAGTTCAGAGAAACATCAGCAAAAAGTAATCTCTCTGCCCTACAGAATGAAAATGAAACTATAAATAATAATTTCCAAGCTTTAAAAAAAGAAAGTGCAGCTTTATTGGAAAAGATTAAATTTTATGAGCCTTTAGAAACTGAATTAAATGAATTAAAAAGAGCATACCAACAAACTATTATAGAAAAAGAAAAATTGCAGCAGGATTTGAATGAACAACTAAGCGATTTACATAAATTAGAACAAGATAATAACCAAAAAGAAGAAATACTTGAGAGTGTTTTAATACAAGCTCGAGATAAGAAAGACAATTATGATGAAGTTAAAAAAGAAATAGCTTTGTTGAAAGAAGAAAAGCATTCTCAATACAAAAGAATAGAAGATTTACTAAATAAACTCGAAGAATCGGACTATTTAATAAACAACTTAAATGAAGACATTATTGCAAGAGATAAAAAAATTACAACACTCGAAAATCATATTAATGAACTAGAAGACGAAATAAGGAAGTTGCATAAAGATTTAGAAGAAGTCGTAGAAACAGGAGAAGAAATAAAACATTTGAGTTACGAAAACCTGGATCAGTGTCTTAAAACTGTGGAGGCTCATCAATCGAAGGCGACACATAATATTAAATTAGAGTTAACGAAATTACAAGATGAAAAAGCATATCTCGAAAGTCAACTTTCAAATACAAAACTAGAGTCTGAACAATCCATTCAAGATAAATACAAACTTATTGCTCAAATTGAACATCTCCAAAATGAAAGACTCATTTTGGTATCAGAAATAAAGCAATTAGAACTTAAAAGTACTGGGGACAGTACTTTAAACCCCTCGAGCAAAATAAATGATATAGTGACATCTCTTGATCGCATTAGCAAATCAATAAATAATAAAAACACTTCTTTGGAGAAAACTCTATTAAATGTGCAAGCATCTTACCAGCTACTTAAAACAAAAGCTAATGAAGCTAAAATATTAGCTGAAAAGGAAAGACAAAAAATTCTAGAGGAAAAGGAAGAAGCCAAAGCCGCTAGAACACGCCTTGAACAACAGGTAGAATATTTTGAAAATAAATTGAAAGAAGAGGAAAATAATCGTGAAAATATAATTCAAGACTTAGAAAGAGAAATGTTAAATCAAACTTTAATATCTGACAAAATAAAACAGTCAAAAGAAGATGAAATCCTTAAATTAAAAAATGAGATTTCGATTCTACAAAAGAAAATAAAAGAAAGCAATGAAAACAATCAGAACGAAAAAATAAAATACGCTGACATCATAAACGGCTTAGAATTAACTATTCAAGAAAAGGTAGATAAATGCCACCTTCTTGAGGATAATCTCAAGAGGTTTAATAATAAACAAATGAGTGATTTCGGCAATCAAACAAATTTGTTTGATAGAAAAATGAATTTGGAAGAAAAATCAACACAAACGAATTTTAATAACTCAAAAGCAACCCAAAATGACATCGGTTTCTCATTTCATGGCAATAAAGCACATAAATATACAATGTATAAATTACCAGAACCCCCCGTCAATTTAAACGATAGGTCTGACAATAATAATATAAAGTCTGATTCGATCTCTCCCCCAAGACATCTAAATGAAGTTCAAATATTATCAGCGGCGGTTGAACCAAATATAAATGTTGTAAAAGACATATATATTAATTTTAAAATAAAAAGACTAAGTACCAGCAAGGTAGAGCAATGTTCGATCAACTTATTAAAAGAAAATAAAACTCAAATTGAAAATCAAGAAACTATAAATTTAAGCAATGAGCTTTTAAGGCATTACGCTCCCGAGATAACTCCTGGTTATGCGCCACTAATTCCACAAAATAACAACATTCTTAGTATTTATAATACAACACTTAAAGCACCAAGTCCAATTGAAACAAAATCGAACAAAAGAACATTTTGGGATTCTAAATCGCTTGCCGGTACGAATGAAAACATTAATATCGATGACATTGATGAAAGTATCAATAAGGATAACGAAGTTAGACACAGTAGTCCAGTAATTGACTCTCAAACAGATAAAGATTTCTTCGTTATTTACACAGATACAGAAAGCATTTACGACTATCGAAGTGGAAATAAAAATGAATCGCCACAATCCCCTAATTTACCATTTGACAATGAAACTGCTACTTCAATTGAACCCCTCAGCCTAGCATTAGCTGAAAGGAAAGATATTAGACGCAAAAATAATCTAATTCCAAAATATGAATACGAAGACAGAGATGACGACAGCATAAAACCAAGATTACATATAAAAATGCCAAGAGTAACAGGCGGCAGCCCTTCAACGATAACATCCATAGCTGACAAAAGATCACTAGATTCATATACCAAAGAAAATATAGAAAAAGATTACAAAAAGAAAATAGATGCTGTTAAAATGCAATATGATTGTAATGTTAAAAGTATTATAAATGAGCACAATCAAGGAGTTGCAAGCATTCAAAGTTTACACGAGGAAACGTTACACGATATTGTAAAAATACATGAGAATGAAATTGAAAATTTAAGATCCATGAGTATAGAAGCAATGCGTAAAGCGGAGAAGCTGGAAAAAGAAAACCAATCTCTTAAAATGAAACTAAAGCTACAATATCCAGAAAAAGTAGATGAGGAACCAAAAATGTTAAGTCACGAAGGTAAAAGAAAGAAAAACAAAGGACGTATAGAGGAGAGATTGTTAACGAAAACTGATATAGAGGCATATAATGTTAGACCAAAACGTCGGTCCCATGGACCCTGCACTTGTTCTCTTGATATGAATATATCGGATACAATACGCAACATTTTCGAACAAGTCGATGTTGAACAAAGAAAGAATGCGGAGCAAACATATTTAAGGTACATTGCTAATAAGATCTTGGGGGCCAATGTTGAGTCTCTGGATGCGCAAGAATTATCATTTCTTCATCTAAAGGTTTGTAACACATGGAAAATGAAGTTAAATAAAGAAGAAGCTCTTCAAAAAAAGTTTGACTTCCTAGAAAACGAATTGATAAACAAACAACGAAAAACAGAAAAGCATATGGCTGAACTTGATCGTAAAGTGGCTGAAGAGTATAGACGGTTACAAGAGGTAAGGGAAGCCGTGTGCAAGACTCCTCCTGAAACGCATGATGACAGTCCAATGGAGGAGGCAACTCAACGGCCCTGTACTACCAGAAAAGAAATGTGTAATTGCAATTCGGTTAGTTGCAATATTACCTTGGGAACGAGATGTTCTGCAGGGGACTTACAACCGCAAATGTCAAAACCAGGTTTAAAACTAAAACGTACGAAAATGGAGAGCAATAGAGCTGTGCTTGCTAAGTTGGACGCCGATGAGGACAGAGACAAAAAACTGTTAAATGACGAAACACCTACTAGACTGAAGCGCTCTCACGATCGTCAACATTTTCGAATTTACAGAAAAAGAGAAAGATATAACATATAA

Protein sequence:

>DPOGS214092-PA
MSFITKRERVFNEYDLFDLPARNEGKLKTYASCTDARAWSHFCSDKTEHLVEIIKRNNETVRPKYVPTDVIVNTLMTVKNAEIARLRRKIDEFEQMIAAYDQLELTCEQKCEIASAHAAIKAANKELDDMCLDLDLSGYTEGVDSEAYETGKSRGDESTKYSQEVEKFKEKDIEKEKETAKSGGDEWIVEKEPMSKMEAKCNQIGSPPCVCDASTSAYDSRIDEMKELIINKDAKLSAMQNTIAVMENDVCEPYCIYAHIYTALEKIFGILCQNKKYKQYLDLLTAGKDTRCIDITGKILFKMKVLEKFSLALIAPCTQERSTHEDCSCYRAEILTHVETTFALTSMENNKPNIDIDSKRAQLVADIMQHQEMQEILSKEDITSRNEDEQLDDPYNIDNYNIDAENINRLKNLQANYDDLLNCYETLKHERDNIFIQCQKYVNLEQECQCLQNQLQEYNQMWKEKEIFKKRSEDLDKLKENYYILTEETLNLETKLKAEQEINKIKCEAIDELRNENVRLEKKITEASLMFEKQKNNLVCKVKEYECKIMCQEQQIRSLSQQIDNFLEQEINKTPTSEEASRSIELLDNIEAQKEQIKNLKDAICCNEEEKQYLQEEYQKKLELINELKFDVEDLKGKYETAVQRNHYLEEYLEEFQDQISKLEDKNTQLNHDIDIKSKAIENLNTILSSKSQEVNNLMNEVDHKRSENKELFNKIIDMERNFSNSLTSLKNEQRVALSSIRLAKQESLEILKSIQFDDLQKQPSNTQNNINEVNSKLDVPTVTKDFLDTESINENLLKEVQGLRDIHFESIQSLQDENRNMKRSLDVASKSSIVLESKLKDFEEIKCKLHKLQEDNNRLIDENEDLKRALDLKNDEINGMLHVVELTKKNSDILIDQLHQSENIQDEFTKLNKAYQNLIGTKNILEEKVLLKDHKIQELLKNVSNLEEENNRKNLDLIKIKSIEKELIDLHDKYSELSEEKQKLLEDFNNKTSEMNSLYNNLEKVIEENQILNNNIKTLQFRETSAKSNLSALQNENETINNNFQALKKESAALLEKIKFYEPLETELNELKRAYQQTIIEKEKLQQDLNEQLSDLHKLEQDNNQKEEILESVLIQARDKKDNYDEVKKEIALLKEEKHSQYKRIEDLLNKLEESDYLINNLNEDIIARDKKITTLENHINELEDEIRKLHKDLEEVVETGEEIKHLSYENLDQCLKTVEAHQSKATHNIKLELTKLQDEKAYLESQLSNTKLESEQSIQDKYKLIAQIEHLQNERLILVSEIKQLELKSTGDSTLNPSSKINDIVTSLDRISKSINNKNTSLEKTLLNVQASYQLLKTKANEAKILAEKERQKILEEKEEAKAARTRLEQQVEYFENKLKEEENNRENIIQDLEREMLNQTLISDKIKQSKEDEILKLKNEISILQKKIKESNENNQNEKIKYADIINGLELTIQEKVDKCHLLEDNLKRFNNKQMSDFGNQTNLFDRKMNLEEKSTQTNFNNSKATQNDIGFSFHGNKAHKYTMYKLPEPPVNLNDRSDNNNIKSDSISPPRHLNEVQILSAAVEPNINVVKDIYINFKIKRLSTSKVEQCSINLLKENKTQIENQETINLSNELLRHYAPEITPGYAPLIPQNNNILSIYNTTLKAPSPIETKSNKRTFWDSKSLAGTNENINIDDIDESINKDNEVRHSSPVIDSQTDKDFFVIYTDTESIYDYRSGNKNESPQSPNLPFDNETATSIEPLSLALAERKDIRRKNNLIPKYEYEDRDDDSIKPRLHIKMPRVTGGSPSTITSIADKRSLDSYTKENIEKDYKKKIDAVKMQYDCNVKSIINEHNQGVASIQSLHEETLHDIVKIHENEIENLRSMSIEAMRKAEKLEKENQSLKMKLKLQYPEKVDEEPKMLSHEGKRKKNKGRIEERLLTKTDIEAYNVRPKRRSHGPCTCSLDMNISDTIRNIFEQVDVEQRKNAEQTYLRYIANKILGANVESLDAQELSFLHLKVCNTWKMKLNKEEALQKKFDFLENELINKQRKTEKHMAELDRKVAEEYRRLQEVREAVCKTPPETHDDSPMEEATQRPCTTRKEMCNCNSVSCNITLGTRCSAGDLQPQMSKPGLKLKRTKMESNRAVLAKLDADEDRDKKLLNDETPTRLKRSHDRQHFRIYRKRERYNI-