Monarch geneset OGS2.0

DPOGS212437
TranscriptDPOGS212437-TA5352 bp
ProteinDPOGS212437-PA1783 aa
Genomic positionDPSCF300258 + 150816-162433
RNAseq coverage342x (Rank: top 34%)
Annotation
HeliconiusHMEL0086400.092.10% 
BombyxBGIBMGA002895-TA0.093.05% 
Drosophilapea-PA0.092.43% 
EBI UniRef50UniRef50_Q145620.082.97%ATP-dependent RNA helicase DHX8 n=200 Tax=Eukaryota RepID=DHX8_HUMAN
NCBI RefSeqXP_001661730.10.093.05%ATP-dependent RNA helicase [Aedes aegypti]
NCBI nr blastpgi|1571295710.093.05%ATP-dependent RNA helicase [Aedes aegypti]
NCBI nr blastxgi|1954362340.093.27%GK22168 [Drosophila willistoni]
Group
Gene OntologyGO:00043864.4e-38helicase activity
GO:00055241.6e-20ATP binding
GO:00036761.6e-20nucleic acid binding
GO:00037239.4e-09RNA binding
GO:00080263.3e-07ATP-dependent helicase activity
KEGG pathwayaag:AaeL_AAEL0115340.0 
 K12818 (DHX8, PRP22)maps-> Spliceosome
InterPro domain[1518-1608] IPR0075024.4e-38Helicase-associated domain
[605-786] IPR0140016.5e-32DEAD-like helicase
[1642-1741] IPR0117097.6e-29Domain of unknown function DUF1605
[1353-1457] IPR0016501.6e-20Helicase, C-terminal
[308-394] IPR0160273.4e-13Nucleic acid-binding, OB-fold-like
[296-383] IPR0123403.5e-13Nucleic acid-binding, OB-fold
[309-381] IPR0229679.8e-13RNA-binding domain, S1
[308-369] IPR0030299.4e-09Ribosomal protein S1, RNA-binding domain
[612-760] IPR0115453.3e-07DNA/RNA helicase, DEAD/DEAH box type, N-terminal
Orthology groupMCL10030 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212437-TA
ATGGATGAAGTTTCAAAGCTAGAACATCTTTCTCTGGTGTCCAAAATATGCACAGAGTTGGACAACCATTTGGGATTGAATGATAAAGATTTAGCTGAGTTCATTATCGACTTGGCCGACAAGAACCCAAACTTTGACAATTTTAAAAAAGCACTTATCGAGAATGGTGCGGAATTTTCTGACTCATTCATGACAAACCTGCTACGTATTATTCAACATATGAAACCTTCTGAAAATCAAGCTGACGGGCCACAGAAAGAGATCAAGAGCAGTAATCCTCTGGCAAGCAAATTTCCAGGTTTGGCTATTCCCAATGATAAACCTAGTAAATTTTCATCAGATGATGAAAGCGAAGATGATAATAAAAATACGAGAATAACATCAAAGGACATATTTAAAACTGAATCTAAAGCAAAGGAGTCGTGTGTCGATGTTGAGGATGCGATGGCAGCGCTAGAGGCGCTAGCACCGTCCAATATAAACAAAGGCAAGGATGAGTCAAAGAAAGATATCATAAAGAAGAGAGACCAGAGCGGCGATCGTAGCGAAACTATTAGGAAACGAGAAAGATCGAGGGAAAGAAAGCGCAGTAGAAGTAGGGAGAGGAGACGCCGCAGTAAAGATAGAGACAGACGTAGCCGTAGTCGTAAGAGAAGTCGCAGTAGAAGCAGAAGAAGTCACAGCAAAGAAAGAACAAATAGAAGTAGAGATCAGAGAAGTAGAGACAGGGATAGACGAAGACGTTCCAGAAGTAAACCTCGACATAGATCCAGGTCTAGAGATAGGCAACGGAGATCCAGATCTACTAATAGAAGATCTAGATCCAGATCCCGTTCATATGAAAGAAAAGAAAGAAATAGATACAATGATTATGGCAGAAATCAGAAAAGAAGAAGCGCTGAGGTGGAGATGACCGATGATCCTGAACCGGGGAAAATTTACAATGGACGAGTGGCAAACATAGTACCGTTTGGATGCTTCGTACAGATGGAAGGGCTGCGGAAGAGATGGGAGGGTCTCGTACACATCTCTCAACTCAGAGCTGAAGGTAGAGTCACGAATGTATCCGATGTAGTGTCCAGGGGTGACAAAGTTAAGGTATTACTGTCAGTGACCGGACAAAAGGTATCACTGACGATGAAGGATGTCTGTCAAGAGTCCGGCAAGGATTTAAATCCGACTTCACATGCACATCTAGAGGTGGAGCGTTCAGGTCGCAACCCGGACCGTCCCCCGGCCGTGTTGGCGGGACTCCAACTAGACCCTGATGAAGACTCCAGCCGCAAACGGGTCACCAGGATATCCAGTCCCGAGCGATGGGAGATCAAACAGATGATATCATCAGGTGTGATCGATAAAAGCGAGTTGCCAGATTTCGATGAAGAAACGGGTCTTCTGCCCAAAGAGGAGGACGGAGAAGCGGACATCGAGATAGAACTGGTCGAGGAGGAACCGCCCTTCCTACAAGGTCACGGGCGAGCTCTACACGACCTGTCCCCTGTTAGAATAGTCAAGAATCCTGATGGATCACTAGCGCAGGCCGCCATGATGCAGTCCGCTCTGGCGAAGGAGAGAAGAGAACAGAAGATGATACAGAGAGAACAGGAGATGGAGAGTCTGCCGACCGGTCTCAACAAAAACTGGATCGATCCTCTACCGGAAGCGGACGGGAGGGCGTTAGCGGCTAACATGCGAGGCTCGGGCATAACACCGCAGGACTTGCCCGAGTGGAAGAAACACGTCATCGGGGGGAAGAAATCTTCATTCGGCAAGAAAACTAACCTGTCCCTCCTGGAGCAGAGGCAGTCCCTGCCCATTTACAAGTTGAGAGACGAATTGACCAAGGCCATATCCGACAACCAGATCCTGATAGTGATAGGAGAGACGGGTTCCGGGAAGACGACTCAAATCACGCAGTACGTCTGCGAGTGTGGCGTGTCCGGGCGGGGCCGTGTGGCGTGCACCCAGCCCAGGAGAGTGGCCGCCATGTCCGTCGCCAAGAGGGTCGCTGAGGAGTTCGGCTGCAGGCTGGGTCAAGAGGTCGGCTACACCATACGATTTGAGGACTGCACCGGACCCGACACGGTCATCAAGTACATGACAGACGGTATGTTGCTCCGCGAGTGTCTGATGGATCTGGACCTGAAGAGCTACTCCGTCATCATGCTGGACGAGGCCCACGAGCGCACCATACACACGGACGTGCTGTTCGGCCTCCTCAAACAAGCGGTCCAGAAACGACCGGAACTCAAACTGATCGTGACATCCGCCACCCTGGACGCCGTGAAATTCTCCCAGTACTTCTTCGAGGCCCCCATCTTCACCATACCCGGACGGACCTTCCCCGTCGAGGTTCTGTACACAAAGGAACCGGAAACGGATTACCTGGACGCCTCCTTAATAACCGTCATGCAAATACATCTGCGTGAACCGCCCGGGGATATTCTGCTGTTTTTGACCGGCCAGGAGGAAATCGACACCGCCTGCGAGATACTGTACGAGAGGATGAAGTCCCTCGGCCCGGATGTACCTGAGCTGATCATTCTTCCGGTTTACTCCGCCCTTCCGTCTGAGATGCAGACCAGAATCTTCGAACCCGCTCCGCCTGGCTCGAGGAAGGTGGTGATAGCTACCAACATAGCGGAGACCTCGCTCACCATAGACGGCATTTACTACGTAGTGGACCCCGGGTTCGTCAAACAGAAGGTCTACAATTCAAAGACCGGTATGGACTCGTTGGTCGTCACCCCGATCTCACAGGCGGCGGCGAAAGTCGCTCGGCCAGCACGCCCAGCGACTGGCCCGGGGAAGTGTTACCGACTGTACACGGAGCGCGCATACCGGGATGAAATGTTGCCCACCCCTGTCCCGGAAATACAAAGGACTAATCTCGCCACTACAATGATATCATCAGGTGTGATCGATAAAAGCGAGTTGCCAGATTTCGATGAAGAAACGGGTCTTCTGCCCAAAGAGGAGGACGGAGAAGCGGACATCGAGATAGAACTGGTCGAGGAGGAACCGCCCTTCCTACAAGGTCACGGGCGAGCTCTACACGACCTGTCCCCTGTTAGAATAGTCAAGAATCCTGATGGATCACTAGCGCAGGCCGCCATGATGCAGTCCGCTCTGGCGAAGGAGAGAAGAGAACAGAAGATGATACAGAGAGAACAGGAGATGGAGAGTCTGCCGACCGGTCTCAACAAAAACTGGATCGATCCTCTACCGGAAGCGGACGGGAGGGCGTTAGCGGCTAACATGCGAGGCTCGGGCATAACACCGCAGGACTTGCCCGAGTGGAAGAAACACGTCATCGGGGGGAAGAAATCTTCATTCGGCAAGAAAACTAACCTGTCCCTCCTGGAGCAGAGGCAGTCCCTGCCTATTTACAAGTTGAGAGACGAATTGACCAAGGCCATATCCGACAACCAGATCCTGATAGTGATAGGAGAGACGGGTTCCGGGAAGACGACTCAAATCACGCAGTACGTCTGCGAGTGTGGCGTGTCCGGGCGGGGCCGGGTGGCGTGCACCCAGCCCAGGAGAGTGGCCGCCATGTCCGTCGCCAAGAGGGTCGCTGAGGAGTTCGGCTGCAGGCTGGGTCAAGAGGTCGGCTACACCATACGATTTGAGGACTGCACCGGACCCGACACGGTCATCAAGTACATGACAGACGGTATGTTGCTCCGCGAGTGTCTGATGGATCTGGACCTGAAGAGCTACTCCGTCATCATGCTGGACGAGGCCCACGAGCGCACCATACACACGGACGTGCTGTTCGGCCTCCTCAAACAAGCGGTCCAGAAACGACCGGAACTCAAACTGATCGTGACATCCGCCACCCTGGACGCCGTGAAATTCTCCCAGTACTTCTTCGAGGCCCCCATCTTCACCATACCCGGACGGACCTTCCCCGTCGAGGTTCTGTACACAAAGGAACCGGAAACGGATTACCTGGACGCCTCCTTAATAACCGTCATGCAAATACATCTGCGTGAACCGCCCGGGGATATTCTGCTGTTTTTGACCGGCCAGGAGGAAATCGACACCGCCTGCGAGATACTGTACGAGAGGATGAAGTCCCTCGGCCCGGATGTACCTGAGCTGATCATTCTTCCGGTTTACTCCGCCCTTCCGTCTGAGATGCAGACCAGAATCTTCGAACCCGCTCCGCCTGGCTCGAGGAAGGTGGTGATAGCTACCAACATAGCGGAGACCTCGCTCACCATAGACGGCATTTACTACGTAGTGGACCCCGGGTTCGTCAAACAGAAGGTCTACAATTCAAAGACCGGTATGGACTCGTTGGTCGTCACCCCGATCTCACAGGCGGCGGCGAAACAGCGAGCCGGTCGTGCGGGTCGGACTGGCCCGGGGAAGTGTTACCGACTGTACACGGAGCGCGCATACCGGGATGAAATGTTGCCCACCCCTGTCCCGGAAATACAAAGGACTAATCTCGCCACTACAGTGCTGCAACTCAAGACGATGGGTATAAACGACTTGCTGCACTTCGACTTCATGGACGCCCCGCCCGTGGAGTCCCTCATCATGGCGCTGGAACAGCTACACTCCCTGTCTGCCCTCGACGCCGAGGGGCTGCTCACCAGACTCGGGAGACGGATGGCGGAATTCCCCTTAGAGCCGAACCTGTCCAAGATTCTGATTATGTCCGTAGCTCTGCAGTGCTCCGACGAAATACTGACAATAGTGTCGATGTTGAGCGTACAAAATGTCTTCTACCGACCCAAAGACAAACAGGCCTTGGCCGATCAGAAGAAAGCTAAGTTCAACCAGGCGGAGGGTGATCATCTGACGTTGTTGGCTGTCTATAACAGCTGGAAAAATAATAAATTCTCCAACGCCTGGTGTTATGAGAACTTTGTCCAGATACGTACGTTGAAACGGGCGCAGGACGTTAGAAAACAACTGCTGGGGATTATGGATAGGCATAAGTTGGACGTAGTGTCAGCAGGGAAGAACACGGTGAGGATACAGAAGACCATCTGTTCAGGGTTCTTCAGGAATGCAGCAAAGAAAGACCCGCAAGAAGGGTACAGAACACTAGTTGATAGTCAGGTTGTCTATATTCACCCCTCCAGCGCCCTATTTAATCGGCAGCCAGAATGGGTAATTTACCATGAGTTAGTACAAACAACAAAAGAATATATGAGAGAAGTAACTACAATCGATCCCAAATGGCTCGTCGACTTCGCCCCGGCATTCTTCAAGTTCTCAGACCCGACGAAACTGTCCAAATTCAAGAAAAATCAACGATTGGAGCCTCTGTATAATAAATATGAGGAACCGAATGCTTGGCGTATATCACGCGTTAGGAGACGTAGAAATTAA

Protein sequence:

>DPOGS212437-PA
MDEVSKLEHLSLVSKICTELDNHLGLNDKDLAEFIIDLADKNPNFDNFKKALIENGAEFSDSFMTNLLRIIQHMKPSENQADGPQKEIKSSNPLASKFPGLAIPNDKPSKFSSDDESEDDNKNTRITSKDIFKTESKAKESCVDVEDAMAALEALAPSNINKGKDESKKDIIKKRDQSGDRSETIRKRERSRERKRSRSRERRRRSKDRDRRSRSRKRSRSRSRRSHSKERTNRSRDQRSRDRDRRRRSRSKPRHRSRSRDRQRRSRSTNRRSRSRSRSYERKERNRYNDYGRNQKRRSAEVEMTDDPEPGKIYNGRVANIVPFGCFVQMEGLRKRWEGLVHISQLRAEGRVTNVSDVVSRGDKVKVLLSVTGQKVSLTMKDVCQESGKDLNPTSHAHLEVERSGRNPDRPPAVLAGLQLDPDEDSSRKRVTRISSPERWEIKQMISSGVIDKSELPDFDEETGLLPKEEDGEADIEIELVEEEPPFLQGHGRALHDLSPVRIVKNPDGSLAQAAMMQSALAKERREQKMIQREQEMESLPTGLNKNWIDPLPEADGRALAANMRGSGITPQDLPEWKKHVIGGKKSSFGKKTNLSLLEQRQSLPIYKLRDELTKAISDNQILIVIGETGSGKTTQITQYVCECGVSGRGRVACTQPRRVAAMSVAKRVAEEFGCRLGQEVGYTIRFEDCTGPDTVIKYMTDGMLLRECLMDLDLKSYSVIMLDEAHERTIHTDVLFGLLKQAVQKRPELKLIVTSATLDAVKFSQYFFEAPIFTIPGRTFPVEVLYTKEPETDYLDASLITVMQIHLREPPGDILLFLTGQEEIDTACEILYERMKSLGPDVPELIILPVYSALPSEMQTRIFEPAPPGSRKVVIATNIAETSLTIDGIYYVVDPGFVKQKVYNSKTGMDSLVVTPISQAAAKVARPARPATGPGKCYRLYTERAYRDEMLPTPVPEIQRTNLATTMISSGVIDKSELPDFDEETGLLPKEEDGEADIEIELVEEEPPFLQGHGRALHDLSPVRIVKNPDGSLAQAAMMQSALAKERREQKMIQREQEMESLPTGLNKNWIDPLPEADGRALAANMRGSGITPQDLPEWKKHVIGGKKSSFGKKTNLSLLEQRQSLPIYKLRDELTKAISDNQILIVIGETGSGKTTQITQYVCECGVSGRGRVACTQPRRVAAMSVAKRVAEEFGCRLGQEVGYTIRFEDCTGPDTVIKYMTDGMLLRECLMDLDLKSYSVIMLDEAHERTIHTDVLFGLLKQAVQKRPELKLIVTSATLDAVKFSQYFFEAPIFTIPGRTFPVEVLYTKEPETDYLDASLITVMQIHLREPPGDILLFLTGQEEIDTACEILYERMKSLGPDVPELIILPVYSALPSEMQTRIFEPAPPGSRKVVIATNIAETSLTIDGIYYVVDPGFVKQKVYNSKTGMDSLVVTPISQAAAKQRAGRAGRTGPGKCYRLYTERAYRDEMLPTPVPEIQRTNLATTVLQLKTMGINDLLHFDFMDAPPVESLIMALEQLHSLSALDAEGLLTRLGRRMAEFPLEPNLSKILIMSVALQCSDEILTIVSMLSVQNVFYRPKDKQALADQKKAKFNQAEGDHLTLLAVYNSWKNNKFSNAWCYENFVQIRTLKRAQDVRKQLLGIMDRHKLDVVSAGKNTVRIQKTICSGFFRNAAKKDPQEGYRTLVDSQVVYIHPSSALFNRQPEWVIYHELVQTTKEYMREVTTIDPKWLVDFAPAFFKFSDPTKLSKFKKNQRLEPLYNKYEEPNAWRISRVRRRRN-