Monarch geneset OGS2.0

DPOGS201241
TranscriptDPOGS201241-TA4563 bp
ProteinDPOGS201241-PA1520 aa
Genomic positionDPSCF300037 + 1435-20059
RNAseq coverage195x (Rank: top 48%)
Annotation
HeliconiusHMEL0032120.053.48% 
BombyxBGIBMGA012465-TA0.047.67% 
DrosophilaCAP-PT1e-1729.06% 
EBI UniRef50UniRef50_B4J6W84e-1427.63%GH21778 n=1 Tax=Drosophila grimshawi RepID=B4J6W8_DROGR
NCBI RefSeqNP_001137637.14e-1629.06%CAP, isoform L [Drosophila melanogaster]
NCBI nr blastpgi|1950286362e-1327.63%GH21778 [Drosophila grimshawi]
NCBI nr blastxgi|1544134682e-2920.12%viral A-type inclusion protein [Trichomonas vaginalis G3]
Group
KEGG pathway 
Orthology groupMCL17554 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201241-TA
ATGAGTTTGTTCCAAAAAAGACCATCTCAATTGATTGACGGTGCCGAGAAGGCGACCCCACACCCCCCGGTCTACGCAACCATCAAACAACCGAACTCGCCAAACAAACTGCCAGCGAAATCTCCGTCGAACAAATCAGTACAACTCGTGAAATCCTTCGAAGACACACAGAATTTTCATCCAACCAACCCTTTCTATTCAACACTGCCTAACACTAAATTACAAATTACAAACGGCAAATCGAATTCACTCCACAGATCTGTAAATAGAACCAATTACGACGACCTGAAAATGTCATCGTTCTTTAAGAAAAATGATCCCGGAGCTGGTTCACCTAACGCTGGTTATCAAACGAGCGAGACCTCGCCGAGACGGTTTTCGTTCGACAAATTCAGTTATGACGTTAAGGATTCAAATAACGAAGATCAGATAAAAAAGAATCCATTCAATAATGGGACAAATGATTTCAATCCAACTGACAAATTGGACGGGTCGCCCAACGGTGCTAGCATGATATATTCCAAGAGCGACAGTAACTTCCGTAGAAACGAATTTACCGAATACACCGAACGAAAATCAGTAACGGAGGAGATGATTAGCGAGACAGAGGAAATTAAAACCATTAAAAAGATAACACTCAACGGTTCTAGTGAAAGTGATAAACCAAACGGTGTGCACAAATTAGACAGTGATCGTAATAATAAAGAGCGAATCGTTCCTGTGACGCTGTATGATGAAGGTATCAAGGTCAACGAGAACAGAAGGAACGTGGGAGAAAACACACCAAGAGAAGAATATAAAGAAGCGACCGAAACAGAAGACAGTCTAGACTCGGGCTGCGAGAAAACAGTCGGGCCACGAACGAAATACAACGACTCTAAGAGAACTATAAACAAAGTTAGCGAGAAAATTCTGAAGGCGATTGAGCGATTAACATGTAATATTGGCAAGGTGTCGAAAGTTAAGAAATCTGAGAAGATTAAAAAAGTACGAAGCAGTCGGCTGCCGCTATCGGACTCCCCTCGTCGCACCGTTCGTACGTCGCCGCTTAAAAATAAAGTCCACGGCCCGCGCCTGCAATGCAGCGGATCTAGTACTAGTACGAATGAAAATTTAAAAGAAGCTACCGAAAATATCAATCAAATTAACGTTGACCTTGATGAGCCAAACTCAAAAGATAAATCCTTGAATAATCTTGAAATTTCTAACGCGGGGCATGAGGCTAAAAATGAAGGCTTCGAAGAATCTTTGAAAAGTAGTGACATGGAAACAGGTTTGTCTGAAAAATACATCGTTGAAACTCCAGATCAAAATTCAAAATCTATACCTAAATTAAATAATGCAAAGAAAAGTAAGAGTTGTGACGATGATGATTCTCCTAAAATTATTGAAATCACAGATGACAGTTCTCAAATAGATGACATCAGCTTGTCTTTAGTTGATTCTAATAACGTTATTGTTACTACTGAAGACCCTGAAGTTGAGTGGGAGAAGGCCGAACTATTAAATTTGCCACAAACTGAAATCATAAAAGGTATACTTTCTGAAAGCCATAATATAAGTCTCAATACAGCACAATGTGAACAAACTAAATCTCTCAGTCCAGAAGAAGAATTATCTTTACGTCATTATCTTCAAACCCTCAATTTATCAACAAATCCAAACGTGAATAATGCAGATATTAAATCGGAAATCGAACATATAATTAACCGTGAAATCAAATATCGCTTGAGGAGAAAAGGATTGGCCGAAGAATCATTTTTTGAGCGTTCCGGACCGTCGAGATCTTTAGCTGTGATTGATGAGGAAGGGAGTGGAGATTCGTCTAAAACATCAAGACGTCATTCATATTTGAGTGATAAAAAGAGCGATACTGAGGAATTGGAGGACGAAGTTTTTGAAAGTAAAGATTCACAAATAAAAAGGAGAAGAGATACAAATTTCCCACACCAGTCAAATTTAGTTTCACGTCATGTCATTCCACAGCAATGCATAAAAGTTGATGCCAAAATCATGGAGCCAGAACTATGTGAAGCTCGTGGAGACTGGAAAATGGAAACGATTGAAAAAATTACAGGAGCAGAGTTGGTTTACTTAACAGACTCATCAAGTAGTACAAGTTCAATATACGAGATGAGCGATGATGCGGATAAAGGTCATGAAACTGATGTTTCTGTTCGTATGATAACTCCAACAATTGAGGTTACTGATACAGAATCGCTACTAAAGAACACATTTTTATCAAAAGGCACTGAAAATAAATATAAGAATGTGATTGACAGCGATAACAATATAACTAACGTTGAAGAAAGAAATGTTCATCAAGATAGTAATTTAATTATGGGTAAAGAAGATATCATTGATACCGAGAAATCAAAAATAGATATTCATGAAATAATAGTATTGAGTACAGATGATATAAAAACAGATCTGTTTGTTAAAGATATTAATGTAAACGAAAAAAGTACGGATGCTTTAGAACTCAACAAGAAAAACTCAAATGAAATTGATAAATATGATCTTGAAATAAAAGTTTTAAAATGTGAACTCAACGATGCCATAAATAACTTGATTAAAGAAGTTTCCGACTCAGAAAATGCTAGTGATATTAATATGAGTGATTCTAAAGAATTGTTCACTAGACAAGACTCTTCAAGTAGCCTCGATTCCTCACAATGTACAGCCAAATATAATCCTACAAGTTCATCCCTAAATGATGTTTCTAGTTTTCTGTGTAATGAATCACAGGAAGAGATTAAAGAAGAGAAAATAGTAAGGAAAACATTGAGTCATGTTAAGGATGTTATTCAGCATGTAGATCAAAAGTTAGTACCAGATATAGAGGCGGACTTTGTAACTGAAAAGCCATCTACGTTACGAGATATATGTTTAAAAAGGATATCTTCTTTTCCTTTTGGTGAAAAAGTTTTGGAAGAATTGGCTAACGTGTCAAAACGTCTGCAGGATATAAGTAAGTTGTCAACGAATAACAAAAATATTAATATTCAAGATATTAACTGTCAGTTCCACATTAAGGACGATGTGTTGTCTCAACAAAAGCAATTACATAATAAAGAAAATAAACATGCCCCACCCCCCGTACAACCTAGAAAGTCATCATTAAAAAAGGCTAAAGAAGAACAAAGCGTTTCAATACCACCCTTACCGGAAAAGGTTTATGAATGTTTATCTCCTTCACAGAAAATGTTAATGGAAAAGACCAATACTGTTATAAATAGAGACGACATAATTGCTCCTTCTAAACCAAAATTTGAACAAACTGAAAGATCACGAATTGATAGAAAAAGGGAAAGTGATCCTTGTGTGGTACCGATGAAGTCGGAAACTGGAAGTAGATTACTAGCGTTACTTCGTAATACATCCTCTCCGAAAAAATTAAGTTTACCCACCTTAATTGATGATACTTACTTTCAAAACCCAAAACCGTCATCGCCTCGTACCGATTCTTTCTCTGTACGAATGTTGCAAGAACTAGATGCATCAAATAATTTCAAAGAAACGATAAACAATCAAAAGTATATGTCTTCCATCCCTCCAATAAATTTTAAACCTATTCCGCCACCAAAACCAAGAAAAATATACACATATGAAAGTGATTCTGAATTTCTAAGTGATAGTTCATTCAGGTCCATGCGTAGTGACAAAAAAGTATTTCATTACTCTACTGGGAATTTAAATGAAGAAATAGAGAATGATATTTCTTCAATACAGAATATGCATAGGCAATGTACAAACTTACGAAATAAAAATTCAAACATTGTGTATCCACGACGACCTTCGTTACCCAAAGATTTATGTGATCAACAAATGGAATACATACGTCAAAAAGAAAAAGAAGTTGACGCAGAAATAAAACGGCTTGAACAGGAGCAACTAAAGTCTTCACAGAGACGGGGTCCTAGAGCGCCAATGATTTCTGAAAAAGATAATAATTACAGTGGACTTTTTGAGACTGAAAAATCTTTTAAAAAAGAAAGAATCTCCAATCACCCCAATTCTAAAAATGTTGAAAACAGAAAGTTACATTCATTCTTCAGTAGTAGCGAAGAAGAATTATTAAGAGAAAAAATGTATACCGAATATATTAATCAAATGGCTGAAAGAGAACAGAGAAAGCATCACAGAGTTATAAAAGTATCACAGACACCCTCTAGTAGCAATTTGATATCAAAAAGCATGCCATCATTAAATTATTTGGATTCCAAAGTGAATAATCGTATAGAACAGGAATTTATATCGAAGGCAAAGGAAAGGTGGAATAAATTAGGTATAAGAGATCCAGAAACTGAGGACGAAAGAGAACCTAGAGACGTGTATAAAGAGCCCAAAGTGATAGAACATAAAATACGAGTGATTGAAGACGGTCAAGAAAAAGATGTGCGAAAATTGCCTAGTCATATGCAAGAATTTGTAAGGTTTACAGTGAGAGATAAGGATAAGGACGAAAATACAGGTAGCACCGTTTACAATTACAGTCCCGATGATAGACAGCTGTTAGCGCTGGGTCACGTGATAACTACATTATCAGAACGAGGATGTTGA

Protein sequence:

>DPOGS201241-PA
MSLFQKRPSQLIDGAEKATPHPPVYATIKQPNSPNKLPAKSPSNKSVQLVKSFEDTQNFHPTNPFYSTLPNTKLQITNGKSNSLHRSVNRTNYDDLKMSSFFKKNDPGAGSPNAGYQTSETSPRRFSFDKFSYDVKDSNNEDQIKKNPFNNGTNDFNPTDKLDGSPNGASMIYSKSDSNFRRNEFTEYTERKSVTEEMISETEEIKTIKKITLNGSSESDKPNGVHKLDSDRNNKERIVPVTLYDEGIKVNENRRNVGENTPREEYKEATETEDSLDSGCEKTVGPRTKYNDSKRTINKVSEKILKAIERLTCNIGKVSKVKKSEKIKKVRSSRLPLSDSPRRTVRTSPLKNKVHGPRLQCSGSSTSTNENLKEATENINQINVDLDEPNSKDKSLNNLEISNAGHEAKNEGFEESLKSSDMETGLSEKYIVETPDQNSKSIPKLNNAKKSKSCDDDDSPKIIEITDDSSQIDDISLSLVDSNNVIVTTEDPEVEWEKAELLNLPQTEIIKGILSESHNISLNTAQCEQTKSLSPEEELSLRHYLQTLNLSTNPNVNNADIKSEIEHIINREIKYRLRRKGLAEESFFERSGPSRSLAVIDEEGSGDSSKTSRRHSYLSDKKSDTEELEDEVFESKDSQIKRRRDTNFPHQSNLVSRHVIPQQCIKVDAKIMEPELCEARGDWKMETIEKITGAELVYLTDSSSSTSSIYEMSDDADKGHETDVSVRMITPTIEVTDTESLLKNTFLSKGTENKYKNVIDSDNNITNVEERNVHQDSNLIMGKEDIIDTEKSKIDIHEIIVLSTDDIKTDLFVKDINVNEKSTDALELNKKNSNEIDKYDLEIKVLKCELNDAINNLIKEVSDSENASDINMSDSKELFTRQDSSSSLDSSQCTAKYNPTSSSLNDVSSFLCNESQEEIKEEKIVRKTLSHVKDVIQHVDQKLVPDIEADFVTEKPSTLRDICLKRISSFPFGEKVLEELANVSKRLQDISKLSTNNKNINIQDINCQFHIKDDVLSQQKQLHNKENKHAPPPVQPRKSSLKKAKEEQSVSIPPLPEKVYECLSPSQKMLMEKTNTVINRDDIIAPSKPKFEQTERSRIDRKRESDPCVVPMKSETGSRLLALLRNTSSPKKLSLPTLIDDTYFQNPKPSSPRTDSFSVRMLQELDASNNFKETINNQKYMSSIPPINFKPIPPPKPRKIYTYESDSEFLSDSSFRSMRSDKKVFHYSTGNLNEEIENDISSIQNMHRQCTNLRNKNSNIVYPRRPSLPKDLCDQQMEYIRQKEKEVDAEIKRLEQEQLKSSQRRGPRAPMISEKDNNYSGLFETEKSFKKERISNHPNSKNVENRKLHSFFSSSEEELLREKMYTEYINQMAEREQRKHHRVIKVSQTPSSSNLISKSMPSLNYLDSKVNNRIEQEFISKAKERWNKLGIRDPETEDEREPRDVYKEPKVIEHKIRVIEDGQEKDVRKLPSHMQEFVRFTVRDKDKDENTGSTVYNYSPDDRQLLALGHVITTLSERGC-