Monarch geneset OGS2.0

DPOGS206407
TranscriptDPOGS206407-TA4533 bp
ProteinDPOGS206407-PA1510 aa
Genomic positionDPSCF300181 - 302704-314487
RNAseq coverage164x (Rank: top 51%)
Annotation
HeliconiusHMEL0068920.055.45% 
BombyxBGIBMGA013871-TA5e-15552.84% 
Drosophilassp3-PA3e-4542.35% 
EBI UniRef50UniRef50_D6WRG63e-5754.26%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WRG6_TRICA
NCBI RefSeqXP_973792.24e-5854.26%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|1892387679e-5754.26%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|2700100072e-5629.48%hypothetical protein TcasGA2_TC009337 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL22340 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206407-TA
ATGGAAGAAGTGAGACAACTGGTACAGGAAGAGGGCAGGGAAGCGAGAAACCTGGTCGCCTTCCATGTTTCCGTGGACACGCCAGCTCGTGTTTCTCGGAAACCTCCTCGAGCATTACCACCTCCTCGTCGAGCTGCGCCGCGGACCTCTCGTCTGAGATCCGCGTCAGCTGGTCGCGATAAGCGTTCCGAATTACGAGCGAGATACTGGGCTCTATTGTTCGGAGACTTGCAACGTGCAGTTGGCGAAATCTACAATACTGTGGAAGCTCACGAAAACTTAAACGAGTGCCAAGAAGTTATACTAGTACTTGAGAACTATACGCGAGATTTCAAGGCTTTAGGCGAATGGTTTCGATTGAAATGGGAATACGACAATACGCCACCACCACAAAGACCGCAGAGTTTAGCTTGGGAGATACGAAAAACTGATTTTGTTAGACCAGAATCACGACGTACTTATTCCATGAAAAGTTCACCTTCAGTGTCAGGGAAGAATAGTCCCTGTTTATCAGGACATAACACCCCAAGTGGCAAGAATAGTCCCAATCTGACGGGAAAAGCCAGCCCAGGTAGCGGAAAAATAAGTCCAAGGATCACTCATCCTACCTCTAGTCCTAAAGCTAACGTGGAGTTTTTTACCAATAAGCTGGTCGCTTCTAAAATCACAGCAAAACCGGAGCCGAAGATCGCTAAAGAATCAAGGCTATCTGATATCAAAGAAAAAGAGGTGAAAGTAACATGCGATGTGAGTGACACTGAAGAGAATTTGCCAGACGAAAATAATAAAGTAACTACCATTGAAACTGGCCTCACTAAAGAAGTTAAGAACGCTCTTCCTCAAAGAGTTGCTATAAAATTTGCCAAATCAAAAATATCAGAGACAAAAACTACCGAAAATGAAATTTCAGATCAATGCAAAACAACTATTGCACAGTTGGATGCAATGGAAAACGAATTTTTAAAGAAAAATCTTATTGAAAGTCAAGTCAAGAATGATAATATGATCAACCAAGAGATGGTCATTAAATCTGATCCTACATTTACGTCTGCCGAAGATAAGAAAATTGAAATTAAGATTGAAGAAAATGATAACACGTGTTCGGAGTCGCCGACCAAGTCTGAGGACAATTTCGCAGATATATCAAGTGAAAGCTCTCCGAAGTCTGAAGTGGAGAAACCAGATAATAAAAAAGATCAAAAAAACAAATGCAAAAAAACATCGAATTGCAATACAAAGCAAATCGAAGAAAAGAAGGTCGCTCAACCCAAAAAACAGATAATTGAAATAAAAGAAGCTATTGCTACAAAAGAAGAACATCAAGTGGAAGAACTGGAAAATGATATTGATACGAACATTAGCAATAATAATAACATAGAGCCAGTTAATCAAAAAGACATAAAAGATATATCAAATAGCGAAACTGCCATTAGTAATGGCTGGGGCGACGTAAATTCTGAAAAGATGAATACTGGATTAAACTTTAATAATGTTGAAGAAATAAGAAAAATCTCGGCCTTAGTGGCCAAAACGGTTGCTAATGGCCCTTCTCACCTAAGTAATGAAATCAAAACTAAAAATGATTTTTCTTTGGCATTAGATAACGTACAGGAAGAAAATATAAAAAATCACATTCAAGATGTGCAAAAAATTCTCGAAAATGATAAGAAATCGAAGGAAATAAAAGATATTACGAAGGAAGTGATCATGTTGTCTAAAGACGAGGTTGACGTCAAAAAAGAAATAGAAGTGAAAAAATTAGACAACAAGCAAAATAATGATTCAAAACAGTCGAAGCCTGCTTATTCACAAGCAGTTGCGAAACCCAAAGCCGTAAATATAACGAAAAATGAAGTGAAAACACCCACACGACTCATCCGAAGCTCAACTGTCGTAGAAATACGACCGAGTACTGTAGCAAAGCAACCAAAAACCTGCCAAAAGAAAACCAATAAATGCAGTTACCCATTCAATTTGACTACAGGTCGCACCACACTATTTGATGGTGTTCCAGCTACTCCTGCGACCAAACCTGTCGTAACAAAACGCCCAATAATAAAAAGCAACCAAAATATGGCAGGTGGTGATGCTAAACCCGTAACTGGTAAAGAGAATCTAAAAAAACGTCCGCCAAGACCGATTTCACTGATCGCTAAAGAAAAAGAAGAGAAGAAGAATAGAGTCAATCAAGCTCTGGAGAAAGTAAAGGATTGCAAAGATTACAGTAGCTCTGACAGCGTTATAACTCAGATCGACATGTCACGTATAGAATCAAGTGAGAGCCTGAAAACTCTTGTATCCGAAGATCATCAGGTTTTAAAAGATGTTGCCAATTCTATCGAGGTTCTAAATGTTGATAGCAATAAAAATGATAACGAAGGCTGGCTTACAGTCAAATCTCGTCGTCTCAGTCGTGAATCCAAAAAGCATTCAAAATCCCACTGGGCTAACAGATTCAATCAGCCTTCTGCCACGACAAGTCTACCGACATTAAACATGCTCGAATCTCCTAAACAGGAAGATAACAGCATCTTAAATCCAAAACCTGTCAATGTAGACCGAGCTAAATCTGAACAACCACAAACTCAAGATGACCCACCGCTTACTAAAATTATAAATGGTTCAGAACAACCACAGAAACCTCAACTGCAAGAGAAGAAAACAGAGAAACCAGAAAAAGTTGTTAAGGTTGTCGCAAAACTGAAGTCAGAGAACAAAACTAACGATCCATCAATGATTAGACAAAAATCTGACGTCACTGGCTTAAAGACAAAATCGTCTAGAAACAAAGTGCTGACTAAAAAAGCAGAAAAAGTCAAAGACTCCCACAAAGACGACGATATGTCAGACTTGACTAAGAACCGCTTGCATTCCTCGTTAGAAAGTCTAACATCCGCCTTGGCCAGATCACAGGAAAGCACCGAAGAGGCGTTCGATTTCGACAAATGGAAAGCTGAATTCAAATCTACTTTCAAGTATCTAGAAGAAGACGATCAGATCGCTGACCAGACCGAGATATTGAAGTCCGCGGATCCTTCGGAGATGTCCGAAATAGCCGCGATGACGTCACAGATTGAAGAGAACGAGAGGAAAATAAGCCTGGCATTAGATTTCCAGTCCGAAGTCGATCAGAGGAAACTTTGCGCTGAGGAGGATTTATTGAACAGGCAAATCCTGGAGCTGCAACAAGTATCGGACATTGATATCGACACCGAAACCGATGACACAGAGGCGAGTATCAGCTTTGAGACCGATGCCGAGATCCTCATAGAAGATCCGCCTCAGGTTCTATCAGAGAACTTGGGTTCGCTAGCGGCAAGTCTAGAAGATCAGTACGAGACGGCGCTAGCTGGGATGACCTGGGCGGAGAGAGTGGACACGCTCGTAGTACTGGAGGCTTTGGTGGCTAGAGATCCTGTTAAAGCCGTTTTCTTAAATTTCCAGGAACAAAAAGCTGCACGTGAAGCCGCTGTTGGCGCTCGTCGTCAAGCTATTGAAGCGGCTCGAAAGTTGCACGTCGAGAACTTGATGGAGAGGCGGAAGGAGAGAGACGCCAGGGTTGAAGAGATGAAGGGCTTGAAGAAACAGGAGAGAGATGACGCAGCCAGGGAGAAGGCTGAAGGTGTAAAGTCCCGTCTATCGGCATTAGCAGCGGCCGCTGAGTTGGAGGCAGATGCTTTAAGATCTCGTATAAGGGACAAACAGGACGCCAGCCAGCAGAGGCTCGAGTCACACTTACAGGCCATACGGGAGAAAGCTACCGGGCCCAGACCCGCAACAATATCCGAAAGCCAGGACCAAGCTGAGAAGGAAGATTTAGAATCGAAGAGAATAGACCAGGAAAGGAAAGAGAGAGAGAAGCAGAAGGCTGTTAAGCGGAAGGCGAGGAAGATCAAGCAGAAACTACTGGCTGGGCTCGTACTGAGAAGCCAAGCCCCAGTGTTCGAGGACAATATAACGACGTTGGACACAATGCTGAACCCACTCACAAGCAAAGTATACAGAGCTATTAGTTCCATACGTAACATAATGGCGCCTTTCGTACCGGAATGCGAGAAAGATGACGATGAAAAGGAAAAGGAAAAGGAAAAAGAAAAGAAGCAAACCCCGCAAGTGAATGGAGAAGCGGAGAAGAACAAAGAAGATCCGAGTGTTAATGGAGAATGTTCCAAGTTCTGCGAGCCTAGCACCAGCGGGACGCAGAATGTTCCAGAAGTTAAATCCAAGAAGAAGAAGAAAAAGAAGAACAATAACAATAATAATGATGATAAAGGGAGATGGAGTAAAGCGAGTTCTGTTTGCAGTTCTTTGATTGAAATGAATAGGAACGCTAGGATTGAGAGGAAGATAGACATTGATCTGCCGTATCTGGAGAAGCAGATGAATGAACTGTACAGGATGATGGAGAAGTTTGAGAAGCACTTCCTAGAGAAGGCGTCAGTGCAAAGATTCCAAGCTGTGAAAGTCATTAGTCAAATGCTAGCTGTCGCTGCGGAGAAAGAAGATCTCAACCCGCCTATATCAGCCAAGTAA

Protein sequence:

>DPOGS206407-PA
MEEVRQLVQEEGREARNLVAFHVSVDTPARVSRKPPRALPPPRRAAPRTSRLRSASAGRDKRSELRARYWALLFGDLQRAVGEIYNTVEAHENLNECQEVILVLENYTRDFKALGEWFRLKWEYDNTPPPQRPQSLAWEIRKTDFVRPESRRTYSMKSSPSVSGKNSPCLSGHNTPSGKNSPNLTGKASPGSGKISPRITHPTSSPKANVEFFTNKLVASKITAKPEPKIAKESRLSDIKEKEVKVTCDVSDTEENLPDENNKVTTIETGLTKEVKNALPQRVAIKFAKSKISETKTTENEISDQCKTTIAQLDAMENEFLKKNLIESQVKNDNMINQEMVIKSDPTFTSAEDKKIEIKIEENDNTCSESPTKSEDNFADISSESSPKSEVEKPDNKKDQKNKCKKTSNCNTKQIEEKKVAQPKKQIIEIKEAIATKEEHQVEELENDIDTNISNNNNIEPVNQKDIKDISNSETAISNGWGDVNSEKMNTGLNFNNVEEIRKISALVAKTVANGPSHLSNEIKTKNDFSLALDNVQEENIKNHIQDVQKILENDKKSKEIKDITKEVIMLSKDEVDVKKEIEVKKLDNKQNNDSKQSKPAYSQAVAKPKAVNITKNEVKTPTRLIRSSTVVEIRPSTVAKQPKTCQKKTNKCSYPFNLTTGRTTLFDGVPATPATKPVVTKRPIIKSNQNMAGGDAKPVTGKENLKKRPPRPISLIAKEKEEKKNRVNQALEKVKDCKDYSSSDSVITQIDMSRIESSESLKTLVSEDHQVLKDVANSIEVLNVDSNKNDNEGWLTVKSRRLSRESKKHSKSHWANRFNQPSATTSLPTLNMLESPKQEDNSILNPKPVNVDRAKSEQPQTQDDPPLTKIINGSEQPQKPQLQEKKTEKPEKVVKVVAKLKSENKTNDPSMIRQKSDVTGLKTKSSRNKVLTKKAEKVKDSHKDDDMSDLTKNRLHSSLESLTSALARSQESTEEAFDFDKWKAEFKSTFKYLEEDDQIADQTEILKSADPSEMSEIAAMTSQIEENERKISLALDFQSEVDQRKLCAEEDLLNRQILELQQVSDIDIDTETDDTEASISFETDAEILIEDPPQVLSENLGSLAASLEDQYETALAGMTWAERVDTLVVLEALVARDPVKAVFLNFQEQKAAREAAVGARRQAIEAARKLHVENLMERRKERDARVEEMKGLKKQERDDAAREKAEGVKSRLSALAAAAELEADALRSRIRDKQDASQQRLESHLQAIREKATGPRPATISESQDQAEKEDLESKRIDQERKEREKQKAVKRKARKIKQKLLAGLVLRSQAPVFEDNITTLDTMLNPLTSKVYRAISSIRNIMAPFVPECEKDDDEKEKEKEKEKKQTPQVNGEAEKNKEDPSVNGECSKFCEPSTSGTQNVPEVKSKKKKKKKNNNNNNDDKGRWSKASSVCSSLIEMNRNARIERKIDIDLPYLEKQMNELYRMMEKFEKHFLEKASVQRFQAVKVISQMLAVAAEKEDLNPPISAK-