Monarch geneset OGS2.0

DPOGS211604
TranscriptDPOGS211604-TA3351 bp
ProteinDPOGS211604-PA1116 aa
Genomic positionDPSCF300232 - 91111-108149
RNAseq coverage177x (Rank: top 50%)
Annotation
HeliconiusHMEL0034040.078.83% 
BombyxBGIBMGA008219-TA3e-16868.79% 
Drosophila% 
EBI UniRef50UniRef50_D2A3B92e-10152.20%Putative uncharacterized protein GLEAN_07953 n=2 Tax=Tribolium castaneum RepID=D2A3B9_TRICA
NCBI RefSeqXP_001815376.13e-10252.20%PREDICTED: hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|2700058416e-10152.20%hypothetical protein TcasGA2_TC007953 [Tribolium castaneum]
NCBI nr blastxgi|2700058411e-14140.66%hypothetical protein TcasGA2_TC007953 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL15557 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211604-TA
ATGGCCAGGACTGTAAGGAAACTAAAACGAAGTAAAGGGAAAGAGGCAAAACAGCTCCCGGCCATAATAATAACCCCGCCCGAGTTAGACATGAAATTCAAATCTACCAACTACAAGGAAAACTCATCCACAACCGACATGGAGTTCAGTGTTCACGATCTGATAGCTCAGCTCTCCGAGAACTCGAAGGCATCGAAGGAGGAAATAGAGAGCATACAGAGGAAGCTGCTCCATCAGGCCAGCGAGATCCTGAAAGTGGACAGGTTCGCCAACCAACCGTCTCCAGACCCACGTGTCGTGAGCGCCACCCAAAAATACAACGGGCCGATCTACGGCAGACCCATATCGATCCGGAGACAATCGGTGATCAATCAAGACGCTCAATTAGTGCCCGAGAAATTACAAGAGAATGTCAGCGCGGTAAACAAGCATCAGGTCAACGTGGTCGGCGCGGGAAGAGGAGATGTCTTTAACGGAACGGGTTCTGAGGCTAGAGTCGCCAGTGCGCGCGACGCGGCCGTCCGAGGTACCTCAGAGCCGGTGCCTCCGTACAGAATGCCGCCAGCGCCCGAGGCGGCTCTACCCGGTGCCCCCGCGCCCCCCGTCCAAGGCATACACACGCACGCCAAGTTTCCCATAGAGCGAGATGTAATTCTGTCGTCTGGAGATAAGAATTCAAAAGTGCCGATATTACAAGAGGCTAGAAAGAGAGGGAGGCTGGGAGCAGTGGCTCCCGATACACCCCCCAAAGCTCTGTCAGCGTGGACTCATACTGAGAATCAACAAGCCGGCCCATCTGGTGACTTTAGACAATACGAACAAAACTACGGGTATGCTATGCACCAAAACGTTAATCCGAATCAGGGTGCTGTGGCTAGAAACTTACAGAACTATGACTCACCAAAACCTCCTGTACCAGCCAAGAACGCTTACAAACCGGAACAAAACAATTCTCATATAACAGTCACAGTCGAAACCGGAAAAGATAACACAAAAGATGCTCCTAAGAGCAACAAGTCGGTTAGCAAAACTTATCACACTTTAAAAGACATGATATCGAGTAGATTTAAAAATAAAGACGGGAACGATGCAGAGAAAAACAACGAGGAAGCCAGACTAAATAATAACGAAGAACGGAAAAAGCCAGATCAAGAACCAGTAACACCACGAGAGACACCGAGGAAAGTTGAACAAGGCATTTATGGAAGACCAATGCCACAAAATCGACCAGATATGCAATACAATCAAGGCATGCCAAACAATATGGCATATCACAGCCCTTCCCCTCATAGACAGTTAATTCACCAGCAACAGCAAATAGTCCAACAGCAAATGATGTTGAACCATCAGGCTCGCTCTCAGGAGATGTTGGCTCACCGACCACAGGCTTTGGGTGCAGACGCTCTTTACCAATACGGGCCGCCAGGAAGACGTAGTGCTGTTTATCAAAGGGAAGATTTGAGGTCTTTGGCAAATTTCACATCGCTGAAAACAACTCCACAACCACAATTCGAAAATTCACATTCGAGGCTTGATCTCAGAAGTCCACAACAATTAGAAAGGGATATTGGACGGCAAAGAGGATTGGGTGAAGGAAGGCGAGCAGCGTCACATCCACATCTTCTAGAAGAGATACAACATCGACAAGAAATAGTCAGCCCCCAAATTCATAATCGATCTAGGAGAAATTCCCAAGCAAACTTGTTAGATGGAATATCTCATGAAAATGATCCCATAAGAAATAACGAAGAGCGGGAGTCTGATGACGGTGGTTTTAGACTGAGGCATGCTACCCAAAGTAGGTTAAGTTATGAAGAAAGAATTCGAACCAGTGGTCGGTCCTTAGAATCACACCACGAAAGATCTCATGAAATGTATCGAAGGACACCCGATAGCCATAAAGAATCAAGAAGAACGCCCGATTCCTTAAGCCTAAGACAAAAAGATGAACCTTCAACATCCAGAGAAATAGAGAGGAATGACGAAAGTGCGAGTCAAAAATCAGCAGACAGTGTGTATAACTCCAGTGGAAAAGCCGAGGCGTACACTCCCCAACCGTCTTCCTCAAGACAAACACCGAGTAGGATCGAAGACTTAAAGGCTCATGGAAAGAAAGGACCCAGTGGATCAGGAGCCAGTTCGGATTATGATAAAACCGGCGGTCAATCTTCCAACGTGGATTCAGGTCGTGGGAGCGCTGCGAACTCGAGCGGGAGACGCGCAGAGACCACACGAGCACCTCCGCATGATGCCACAGCTGCACCAGAAAACGAATGGGCAGATTTAGTGGAATGCGAGTTGCGTCAAATCCTGGAGCCGAAGCTCTCCAGCATGAGGTTGGACAGCTCGGCCAGTTCGGATGGATCGGTCACGCCTCCACTACCACCGCTGTCTCCATCTTCAGATCTTCACAAACGGAACAGTCTTCCCGGCCGTGTTGAGTATTCTGACGATCGACGTCGCCGCGAGTCCCCTCGCTGGCCCTCTCACTCACACTCGCACTCACACAAGAAATCGTCAAAAAGAGATCATCATTACAAGAAGCACTCCTTTGGCCCTGACACAACGGACGTCACTTCAACGACGACACGCAGTCTGGATCTGTCTTCCTTGTTAGATGCAAGAACAGACAGCGACGCATCCACAGATGCACGCGCCATACGAAGGCAGCTCCGAGGACTGGAGAACATGTACGGGGAGGTGCTGCAGTTGTTGGGGGTCAGGAAACCAGCTGGAAAGAACTCCTGGGAGGCACGGTTAACTTCCAAGCGTCGTTATGGCAGCATGTCCTCGCTGCCGTCCAGCTCCGTCAGCAGTCGACCTGTCAGGGATAAACGAAGGTCATCCAACGAACATCGGAAGAAGAATGATTATAAGGGCATCAACAAGCGCTTCCAGCGGCTTGAATCCCACGTGGTGACACTGGCTCGGTCGGTGGCGCACTTGTCGTCCGAGATGAGAACACACCACTTGGTGCTGCAAGAGATGGACACCATCCGCGCCGAACTGGCCGCCCTCAGGCACATGTACAGATCTGGCGCCCCAAGTCGAAGACGCACTTCAGGGTTCAGTGACCCCGAGCGTGTGAAACGTCTCACCAAATTCTTTGGAGATGAACCACCGCTCATGAGACTGTTCCTCAAGAAACTTGGATACGAGAAATATGCAGCTCTTCTTGAAAAGGAGAAGGTGGGCGCGGCGGAACTGCCCTACGTCGGGGAGGACAAACTCAGAGCCCTAGGAGTTCCATTAGGTCCTAGGATGAGAATACTCAAAGAGGCTGGGATCCATCAGGACCTACATTTATCTAGAGATGATCATAACACAACGACTACTTTGGCTATAGTGTAA

Protein sequence:

>DPOGS211604-PA
MARTVRKLKRSKGKEAKQLPAIIITPPELDMKFKSTNYKENSSTTDMEFSVHDLIAQLSENSKASKEEIESIQRKLLHQASEILKVDRFANQPSPDPRVVSATQKYNGPIYGRPISIRRQSVINQDAQLVPEKLQENVSAVNKHQVNVVGAGRGDVFNGTGSEARVASARDAAVRGTSEPVPPYRMPPAPEAALPGAPAPPVQGIHTHAKFPIERDVILSSGDKNSKVPILQEARKRGRLGAVAPDTPPKALSAWTHTENQQAGPSGDFRQYEQNYGYAMHQNVNPNQGAVARNLQNYDSPKPPVPAKNAYKPEQNNSHITVTVETGKDNTKDAPKSNKSVSKTYHTLKDMISSRFKNKDGNDAEKNNEEARLNNNEERKKPDQEPVTPRETPRKVEQGIYGRPMPQNRPDMQYNQGMPNNMAYHSPSPHRQLIHQQQQIVQQQMMLNHQARSQEMLAHRPQALGADALYQYGPPGRRSAVYQREDLRSLANFTSLKTTPQPQFENSHSRLDLRSPQQLERDIGRQRGLGEGRRAASHPHLLEEIQHRQEIVSPQIHNRSRRNSQANLLDGISHENDPIRNNEERESDDGGFRLRHATQSRLSYEERIRTSGRSLESHHERSHEMYRRTPDSHKESRRTPDSLSLRQKDEPSTSREIERNDESASQKSADSVYNSSGKAEAYTPQPSSSRQTPSRIEDLKAHGKKGPSGSGASSDYDKTGGQSSNVDSGRGSAANSSGRRAETTRAPPHDATAAPENEWADLVECELRQILEPKLSSMRLDSSASSDGSVTPPLPPLSPSSDLHKRNSLPGRVEYSDDRRRRESPRWPSHSHSHSHKKSSKRDHHYKKHSFGPDTTDVTSTTTRSLDLSSLLDARTDSDASTDARAIRRQLRGLENMYGEVLQLLGVRKPAGKNSWEARLTSKRRYGSMSSLPSSSVSSRPVRDKRRSSNEHRKKNDYKGINKRFQRLESHVVTLARSVAHLSSEMRTHHLVLQEMDTIRAELAALRHMYRSGAPSRRRTSGFSDPERVKRLTKFFGDEPPLMRLFLKKLGYEKYAALLEKEKVGAAELPYVGEDKLRALGVPLGPRMRILKEAGIHQDLHLSRDDHNTTTTLAIV-