Monarch geneset OGS2.0

DPOGS214207
TranscriptDPOGS214207-TA3372 bp
ProteinDPOGS214207-PA1123 aa
Genomic positionDPSCF300014 + 302425-306846
RNAseq coverage86x (Rank: top 63%)
Annotation
HeliconiusHMEL0068070.091.19% 
BombyxBGIBMGA005934-TA0.086.49% 
DrosophilaCG8211-PA0.044.41% 
EBI UniRef50UniRef50_E0VMM50.059.44%Integrator complex subunit, putative n=13 Tax=Neoptera RepID=E0VMM5_PEDHC
NCBI RefSeqXP_623755.20.060.91%PREDICTED: similar to integrator complex subunit 2 [Apis mellifera]
NCBI nr blastpgi|3838580000.061.52%PREDICTED: integrator complex subunit 2 [Megachile rotundata]
NCBI nr blastxgi|3838580000.061.44%PREDICTED: integrator complex subunit 2 [Megachile rotundata]
Group
KEGG pathway 
Orthology groupMCL13596 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214207-TA
ATGGATATCGAATTTATGAAACCCGTTAAGCCTCTAGTTTTCAAGGCTTTAAAAGATGTCGATATTGAAACTTTAATAAAATGCACACCGGATGAAATAAGACCGATCATACCATGTCTAGTCCGTATGGCTCTTATAGCACCTCTTGATATAACTAGATATTGTGCCGAGGCTAAAAAAGACATCCTGACTCTACTATCTGGGATTGATCTAGTAAATTTCATCGTATCTTTACTGTCTATTGAATTTCATGCTCTAGAAGTGGATCTAAAGAAAGAACAACAAATGCGCTTAAAAAGTGGATCCCAGAATACTGAATCCTTTTTAATACAGAATGTAGTAAATGGAATTGCAAATGACTTTGAACAGTCAGATTCCGCAAGAAGAGTCCGACTTGTTCTCTCTGAGTTGCTGCAGATGCAAGCGCAGTTGGCAGAGTATAATCAGAATAAAAATTCAAATTCTGAATCTTCTATAAAACCATCGGAACTCTTTGATAATGAGGTGTATCTTGAAGAAATTACAGATGTTATTTGTATAAGTCTTGCTGAATTACCAAATCTTTTAAATATATGTGAAATTGTTGAAGTATTACTGCATGTGAACAAGGGACCAATTATTATTTCTTGGGTTGTAGCAAATATGCCTGACACACTTTTAGATGTGGCAGAATCTTTGGTTTTAAATGCTGAAAGAGGAGAAGAAGGTGGCATTAGAGCCAAAACTTTATCCACATTATGTGACGCCTGTCCCTATATTGCAACAGCTGTTAGAGCAAAAGCTGTATCTGCTTCCAGACTACCGTGTTTAATAATAAACCTCACTTTGACACATCATCAAGACTTGGTATCCTTTATATCTGGTTTGCTATTGGGTTCAGACCAGAGTACCAGAACATGGTTTGCAACATTCTTACGTAACTCCCATAAAAGGGGGAAAGGAGATGGCCATGCAATATTGGTGAAGTTACGCCAAGAACTTCTGATTAGATTAAAAGAAGCTTCAGCTGGGGTTGATGCCTCTGCATTATTAAGGTTATACTGTGCCTTGAGAGGAATCGCGGGAATAAAGTTCCAAGATGATGAGGTGTCAGGACTCTTACGACTTGTGACACAAAAGCCACCGCCAACTCCAGCTGGTGTGAGATTTGTTTCCTTGAGTTTATGTATGATCCTAGCATGTCCTTCACTTATGGCTGCTCCTGAATATGAGAAGAAAGCAATAGAATGGGTACAATGGCTTGTAAAGGAAGAAGCTTATTTTGAAAGCAATTCAGGCGTCACAGCTTCGTTTGGGGAGATGTTGCTGCTAATAGCAATCCACTTCCACTCTGGACAGCTGACGGCCGTCGGTGAACTAGTCTGTGCTACACTTGGCATGAGGGTCCCCGTGCGACCAAACGGACTTGCGAGGATCAAGCAGGCCTTCACACAGGAAATATTTACTGAGCAGGTCGTCACTGCACATGCTGTTAAAGTACCTGTCACTGCAAATCTCAACAGCAACATATCCGGTTATTTGCCTGTGCATTGTATTCACCAATTACTGAAGTCGCGAGCATTTTCGAAACATAAAGTGCCAATAAAAAATTGGATATATAGTCAAATTTGCAACTGTATTGCTCCCTTACACCCTGTAATGCCAGCCCTCGTCGAAGTTTACGTCAATTCTATTCTGGTTATTAATAATAAAGGAACAAATGAATACTTCAACAAGCCAATAACAGAAGAAGAAATACGCAGGGTATTCCGAAAATCTATTTTTGGTGTTAATTATGACTCAAACAGCAAACCATTTACTTCTATGGATGTTGATAGTGATTCCACAGTTGACATAAACATTGAGAAACCAACTCTAGCCTCACAACTATTATTGATCTATTACCTGCTCCTGTATGAAGATGTAAGATTGGCTAATACAGCTATACTGATTGCCAATGGAAGAAAAGTGAAAAGTTATTCAACAACATTTCTTTCCGAATTGCCAATAAAGTATTTGCTACATCAAGCCCAGAAAGATCAAATGAGTTATGGTGGTCTTTTCAGCCCGCTGCTTCGTTTGCTTGCGACTCATTTTCCGCAGCTATCGCTTGTAGATGATTGGATGGATGACCAGGTCTTTGGAGATTCCTGTCGTCACCAAATAGACATTAATCTTTCAGAAGTATCTATAACTGAAGCATTCCAGTGCATCGAAGAAAATCCATATAAAACGGGTAAAATATTAAAAGCCATGCTTAATAAAAATCCTACTGACATATGGCCTTTTGCAGAAATATTTGTTAAATACGTGAAGAGTGTGTTAGGAGGTAGAGTCCCAAGACATATACAAGAACTCTACAGAGAGGTTTGGTTGCGTTTAAACACGGTTCTACCCCGATGTTTGTGGATATTGACAATTAACGCGTTGCTGGATATAAATAATGGATGCGGTAAATACGTTACCATAACACAGGAAAACGTTCTAGTTGATCCTTTACAAGTCTTAAGATGTGATATAAGAGTATTTAGATGTGGTCCTATATTAAAAATAATTCTGAGAATTTTAGAAGCGAGCTTAGCTGCATCGAGAAGCCAGTTAAGTCGCCATTTATTGGACAAGCCACTTCTTGAAAAAAGCGGCCAATTGACATCAGACTCCGAGAGGGAAGAATTGAAAAATGCCTTAGTTGCCGCTCAAGAAAGTGCAGCACTACAAATTTTACTAGAAGCTTGTTTGGAGACTGAAGAAGACCAATCTAAACCCGAACTAATGTGGTCTTTGAAAGAAGTACGAAGTATAATATGTTCGTTTTTACATCAAGTGTTTATAGCTGAGCCATCACTTGCAAAATTAGTACACTTCCAAGGATATCCGAGGGAATTATTGACAGTAACCGTCCAAGGCATACCGTCAATGCACATATGTTTAGATTTTATTCCTGAACTTCTAAGTCAAGCTTCTCTAGAGAAACAAATTTTTGCTGTGGACTTGGTATCTCATTTATCAATTCAGTATGCTTTACCCAAAGCTATGTCCATTGCGAGGTTATGCGTGAATACTCTATCCACCCTCCTATCTGTCCTACCAAGTGACCTGCGTCTGGAACTCTTCCAACCAGTTTTAAAATCGCTCGTACGGATTTGTATCGCATTTCCCTCCTTACTTGAAGATATTACATCGTTATTGTTACAGTTAGGTCGAATTTGTGAATCTCAGGTATCACTTGGCCATTGTTGGAATGACACAAATATATTGGGCGAAGGAGCTTATGTATCCTCTGAAGTTCACAATGACAGTAAAGTATTACTCGCCGAGGTTTTATGTAGGGACATTAAATCAACAATGTCAGAAATTATACAGAAAGCACTTTTAAATGATAAACTGTATTGA

Protein sequence:

>DPOGS214207-PA
MDIEFMKPVKPLVFKALKDVDIETLIKCTPDEIRPIIPCLVRMALIAPLDITRYCAEAKKDILTLLSGIDLVNFIVSLLSIEFHALEVDLKKEQQMRLKSGSQNTESFLIQNVVNGIANDFEQSDSARRVRLVLSELLQMQAQLAEYNQNKNSNSESSIKPSELFDNEVYLEEITDVICISLAELPNLLNICEIVEVLLHVNKGPIIISWVVANMPDTLLDVAESLVLNAERGEEGGIRAKTLSTLCDACPYIATAVRAKAVSASRLPCLIINLTLTHHQDLVSFISGLLLGSDQSTRTWFATFLRNSHKRGKGDGHAILVKLRQELLIRLKEASAGVDASALLRLYCALRGIAGIKFQDDEVSGLLRLVTQKPPPTPAGVRFVSLSLCMILACPSLMAAPEYEKKAIEWVQWLVKEEAYFESNSGVTASFGEMLLLIAIHFHSGQLTAVGELVCATLGMRVPVRPNGLARIKQAFTQEIFTEQVVTAHAVKVPVTANLNSNISGYLPVHCIHQLLKSRAFSKHKVPIKNWIYSQICNCIAPLHPVMPALVEVYVNSILVINNKGTNEYFNKPITEEEIRRVFRKSIFGVNYDSNSKPFTSMDVDSDSTVDINIEKPTLASQLLLIYYLLLYEDVRLANTAILIANGRKVKSYSTTFLSELPIKYLLHQAQKDQMSYGGLFSPLLRLLATHFPQLSLVDDWMDDQVFGDSCRHQIDINLSEVSITEAFQCIEENPYKTGKILKAMLNKNPTDIWPFAEIFVKYVKSVLGGRVPRHIQELYREVWLRLNTVLPRCLWILTINALLDINNGCGKYVTITQENVLVDPLQVLRCDIRVFRCGPILKIILRILEASLAASRSQLSRHLLDKPLLEKSGQLTSDSEREELKNALVAAQESAALQILLEACLETEEDQSKPELMWSLKEVRSIICSFLHQVFIAEPSLAKLVHFQGYPRELLTVTVQGIPSMHICLDFIPELLSQASLEKQIFAVDLVSHLSIQYALPKAMSIARLCVNTLSTLLSVLPSDLRLELFQPVLKSLVRICIAFPSLLEDITSLLLQLGRICESQVSLGHCWNDTNILGEGAYVSSEVHNDSKVLLAEVLCRDIKSTMSEIIQKALLNDKLY-