Monarch geneset OGS2.0

DPOGS216115
TranscriptDPOGS216115-TA3903 bp
ProteinDPOGS216115-PA1300 aa
Genomic positionDPSCF300182 + 81034-99460
RNAseq coverage375x (Rank: top 32%)
Annotation
HeliconiusHMEL0021960.061.76% 
BombyxBGIBMGA009279-TA0.071.19% 
DrosophilaCG34126-PB2e-16241.05% 
EBI UniRef50UniRef50_D1ZZL60.040.80%Putative uncharacterized protein GLEAN_08067 n=1 Tax=Tribolium castaneum RepID=D1ZZL6_TRICA
NCBI RefSeqXP_973878.20.040.80%PREDICTED: similar to AGAP008379-PA [Tribolium castaneum]
NCBI nr blastpgi|1892364570.040.80%PREDICTED: similar to AGAP008379-PA [Tribolium castaneum]
NCBI nr blastxgi|1892364570.040.82%PREDICTED: similar to AGAP008379-PA [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL10984 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216115-TA
ATGGTAACTATTATAAAAAACCAACTGCTTAAACACTTATCAAGGTACACAAAAAACTTAAATCCCGAGCAAATATCCTTATCAGCATTACGAGGTTCTGGTGAACTCCAAGACCTGACTCTGGATGAGGATCTGTTAACTGATCTACTTGAGCTGCCAGGCTGGGTGCGCTTGACTTCCGCAAGATGCAATCGAGCCTCATTCAGGATTCAATGGACAAAACTGAAGACAGTTCCTATTGTACTGAATCTAGACGAAGTTCACATATCGCTGGAGGTATGCACGGAGCCTCGTGTCATGAAACCAGGGGCTGGTGGTGCTATGCCTATACCGGGGAAGTATAGCTATATACATAAGGTGATAGACGGTATATCAGTGGCTGTGAACCAAGTACAGATAAACTTCAAATGTGATGCCTTCACCAGTAGCGTTCAGATCTCAAGAGTGACGGTCGAGTCTCGGACGCCGGAAGGCAAGAAAGGTGATCTAAGACTGACTAGAATCAAATGCCCTGACACGGGACAGTTACTTATATTTAAGGAATTGGAGTGGCAGAGCGCCCGTATAGAGGCGAAGGCTCACGGCGCGGCCGCGGCCAGCCTGCCCCCGTTAAGGCTTCTCCTGGGGACTACTTACTGTAGGATCGTTATTAAGAAGAGACTATCAGACTGCGCGGTTCTTGGGTCCCGCCTGGTGCTGCGCCCTGAGCCGGTGGCGTGGGCTTTGACTGACGGACAGTTGAGGGCGGCACTCGCCTGTGGAGCGGCTCTCGCTGGACCTGTCAAAAGGGCCACCGAAATGGCCACTAGGACCAAAGCTGCTCATAAGATAGAGGAGCCCCGTGAGCAAATCCAGACGCGATCGTCGTCTAGCGAGCGTGACATTCTCGCACGTATGTTCGCTAAACACGACGTTCGGGAGACTTCTTATCACCTGCTCGCTCCCAGAATAGACCTGCACCTGTGTGATGACCCTGGATTGGGTAGGTCTGAAATGCCACAACTAGCCAATGGAGGAGCTCTTCAAGTGACTCTGGTGAGCATGCAATGTGACCTGTTCCCGTACCATAAAGCATCAGCTGACAGAAGACATTGGAGAGGCTACAGGGAGGCAGCAACACCTCACAGTCAGTGGCTCTCTCAAGCTTTATCTTCATTCTGCACCACTCTGTTAGAGACATTGGATCCTAGACCTATTATACAGACAAATAAGCCGAGTCAACATGAAACAAAACCGAGCCAAGAACCAGTGTCTAATAAAGAGAATCATCGTACTACCACCACTCCAACCACTACCACTACCACCACCACCACCACACAAGTGTCTCCAACGAGGACACGGATTCTACAACAGTTGGGCAGACTCATGACAACCTGTCTCGTGTTGAGGATAGAGGATTTCACTGTTTATAAGGTGTCTACAGGGTCCAAGTCTCGTGAAGCCCCCAGACCTTTGGTGAGCGCTGAGAAGGCGACTCTGCCAGGTGACGCTGGTCTCCTTCACGCTGAACTGACATTCTTCTACTATCCCGGGGACATCTGCTTCCCTGTGCCAGCCCCGAAGCTCTACGTGCAGCTGAGTCCGGTTCGTGTATCGGTGGACGTGACGAGTCTGTTGTGGTTGACAGCCTTCCTTCCTCATGTGGGTGCGGCCGTCGTGCACACCGATGACGATTCATCCTCGTATATGGACGTGCGGGCGGAGGCGATTATGCCCAAGATAGTTTTGGAGGCTGGTCCCGAGCACGTGTCGCAGCAGAGAGATCGGCCCAAGGAACTGCAGATATGTACCGCCAGGGCTACCATCACTAACATAAGGGAGTCACCCAGAGCTAATACAGCGGGGACCCGGGCTGATCTGGCGTGCATTATAGCGTCGATCCGCGAGCGAGCGCCCCCGAGAGGAAAGTTCCCAACGTCTACCCAAGACATGGACCCGGTTCACGAGAACTTCGTGCTACACGCGGAACACTTGGACGATATCGACCGTGGTACAGCCATAAGCCCTGAGCTGTTGTGGCGTGAGAACAGATCTATTTGGTGTGCGAGAGTGGAACCTCTGTGGGCGGATTTCTGTGGCGCCAGAGCCACGAATTATAAACCGTCGCCGTTGTTGGATGCTACGCCGCTTACTGCGTGGATTTACCAGGAGGATGGCTTTTCTCGTATCTGGGTGATAGCTCGTACATCTGGCCTCAGTGGCCTCCAGCTGCATCACTACCAGCTGCTGTTCTTAATGCGTCAGCTGGAACGGATCAGCGAGCTGACCACCTGGATGGCGCACCAGGCCAGCCGCCTGGAAGATGACCAGGGAACTATGGTGGTGGGTCTAGTAGTGCCGGCGGTGGAGTTGACGCTTGTTCTTCCCACTAACTGCCCTGGACAAGAGTCTTCTAGGGATCTGGATAGTGTTCCTCTAGATTCCTCCAGTCTTAATGATATGAAACTAGGTTCCGAGGCTACAATGGCTCCATCTATGTTGGATCGTGATAGCGGTGTTTTGGCGACGCAGGCGTCTGTGGAGGTGTTCTGTAGTCAGCCCCTACCCGCTGAAGAGATTCCTCCATCCAGCCCCGGGTTGAGTTTTGGAGGGTTCACGTCCATGCGTCGCGGCCTGACCTCCCTGGTCAGCTCCATAGACAGCGCCCTGACCCGTGACGACGGCCGCAGCGACGCCGCGTCCACCGCCAGCTCCGACAGCGACCGGTACGTGGTGGTGGGACTCGCGGCGGAGTCGCCGGACGACGCGGACGTAGCATTCAGGGAGTTCGAACACGGTCGTTTGTCCAGCGGCGTGGAGGTGGCTGCCGAGGTGATGGAACGATCCTCATCACCGAGCGACCACTCCATCACCAGCTCCTGTAAACGACGAGACGTTATATCTACATGCACGATTCGTCTGAACGGCATCCACGTGGTGCAGCAGAGTAACGCCGGCACCACCAGCATGAGATTAGCGGCCGATGATGTAAAACTGGACGAGTGCCCCGCCATACCCTGGGACGAGTTCCAGAATAAGTTCTCTATGAGGGCGCGCGCTTGGTCGGACCTGGACGAGGGGGAGAAGACTGGTGACGCACCCAAGGTTACACTTAGGCTGCTCAGGACTGAGCTGCCGCGGACTGAAGAGGAGAAGAGAACGCCCGGGGCTCTAGCTAGGGCGTCAGAGTTGCTGGAGGGTCAAATACGCTGTCTGAACCTCTCTCTCGGTATGAGCACGGCACTCGCCCTCTCAGAGTTCATAGAGGACGAGGTCATCGCGCCTCCGATGCCTCTAGAGGTTCTAATAGAGAATTTAAAACTGCATCTCATAGAGGACAGGCCGACCCGATCCATTTCATCGCCGCCTCCTCAGCCTTTAGACCTCAACCTGTCCACTATCAAACTGAGCCGGGATTCCTCGGGGGTCGTGCACCTGGGACCGCCCACCATCGATGAACCGTCCCCGGAGACGACGTCCCCACAACAGTCCATCGCGGATGAAGTACAAAGACTGAACGAAGAGAACGAGGAGCTCAAGAAACGTTTGGCGACACTCAACAGAATAGCAGAAGACAATAGGGAATTGAGAGCTAAGCTGGAGGAGGCGTCGGTTCTTCGTCAATGCGCTCACGCAGCCCAACAGGAAGCAGAGCGACTCCTGGCTGACAAACACGACCTGTTGCAGACAGTCAGTGTACTTAAGGATCATTTCGTAGCAAGAGGTAGCGCTCCCGCAGCGGGCACGGCTGTCTCGATTGCACTGCCTGCTGCCCTGACCACCGACTGTAGACCTCTGACACCACCACAGATGTTACAGGACGAGGACCCAGTTATATTGGAAATTATGATGCCAGTGTTAACGCCCGGTGTTGATTACATTGGACTTTATATACCAAATACCGTAGATCCAGTTGAACAGTGA

Protein sequence:

>DPOGS216115-PA
MVTIIKNQLLKHLSRYTKNLNPEQISLSALRGSGELQDLTLDEDLLTDLLELPGWVRLTSARCNRASFRIQWTKLKTVPIVLNLDEVHISLEVCTEPRVMKPGAGGAMPIPGKYSYIHKVIDGISVAVNQVQINFKCDAFTSSVQISRVTVESRTPEGKKGDLRLTRIKCPDTGQLLIFKELEWQSARIEAKAHGAAAASLPPLRLLLGTTYCRIVIKKRLSDCAVLGSRLVLRPEPVAWALTDGQLRAALACGAALAGPVKRATEMATRTKAAHKIEEPREQIQTRSSSSERDILARMFAKHDVRETSYHLLAPRIDLHLCDDPGLGRSEMPQLANGGALQVTLVSMQCDLFPYHKASADRRHWRGYREAATPHSQWLSQALSSFCTTLLETLDPRPIIQTNKPSQHETKPSQEPVSNKENHRTTTTPTTTTTTTTTTQVSPTRTRILQQLGRLMTTCLVLRIEDFTVYKVSTGSKSREAPRPLVSAEKATLPGDAGLLHAELTFFYYPGDICFPVPAPKLYVQLSPVRVSVDVTSLLWLTAFLPHVGAAVVHTDDDSSSYMDVRAEAIMPKIVLEAGPEHVSQQRDRPKELQICTARATITNIRESPRANTAGTRADLACIIASIRERAPPRGKFPTSTQDMDPVHENFVLHAEHLDDIDRGTAISPELLWRENRSIWCARVEPLWADFCGARATNYKPSPLLDATPLTAWIYQEDGFSRIWVIARTSGLSGLQLHHYQLLFLMRQLERISELTTWMAHQASRLEDDQGTMVVGLVVPAVELTLVLPTNCPGQESSRDLDSVPLDSSSLNDMKLGSEATMAPSMLDRDSGVLATQASVEVFCSQPLPAEEIPPSSPGLSFGGFTSMRRGLTSLVSSIDSALTRDDGRSDAASTASSDSDRYVVVGLAAESPDDADVAFREFEHGRLSSGVEVAAEVMERSSSPSDHSITSSCKRRDVISTCTIRLNGIHVVQQSNAGTTSMRLAADDVKLDECPAIPWDEFQNKFSMRARAWSDLDEGEKTGDAPKVTLRLLRTELPRTEEEKRTPGALARASELLEGQIRCLNLSLGMSTALALSEFIEDEVIAPPMPLEVLIENLKLHLIEDRPTRSISSPPPQPLDLNLSTIKLSRDSSGVVHLGPPTIDEPSPETTSPQQSIADEVQRLNEENEELKKRLATLNRIAEDNRELRAKLEEASVLRQCAHAAQQEAERLLADKHDLLQTVSVLKDHFVARGSAPAAGTAVSIALPAALTTDCRPLTPPQMLQDEDPVILEIMMPVLTPGVDYIGLYIPNTVDPVEQ-