Monarch geneset OGS2.0

DPOGS205379
TranscriptDPOGS205379-TA4575 bp
ProteinDPOGS205379-PA1524 aa
Genomic positionDPSCF300373 + 51448-70574
RNAseq coverage922x (Rank: top 14%)
Annotation
HeliconiusHMEL0135940.033.56% 
BombyxBGIBMGA008779-TA0.059.78% 
DrosophilaCG5931-PA0.073.58% 
EBI UniRef50UniRef50_A3AMK80.043.89%Putative uncharacterized protein n=3 Tax=Oryza sativa RepID=A3AMK8_ORYSJ
NCBI RefSeqXP_970554.10.074.85%PREDICTED: similar to pre-mRNA-splicing helicase BRR2 [Tribolium castaneum]
NCBI nr blastpgi|910938510.074.85%PREDICTED: similar to pre-mRNA-splicing helicase BRR2 [Tribolium castaneum]
NCBI nr blastxgi|3504122760.069.54%PREDICTED: putative U5 small nuclear ribonucleoprotein 200 kDa helicase-like [Bombus impatiens]
Group
Gene OntologyGO:00055245.8e-22ATP binding
GO:00080265.8e-22ATP-dependent helicase activity
GO:00036765.8e-22nucleic acid binding
GO:00043862.2e-16helicase activity
KEGG pathwaysbi:SORBI_02g0407000.0 
 K12854 (SNRNP200, BRR2)maps-> Spliceosome
InterPro domain[902-1165] IPR0041798.2e-91Sec63 domain
[476-660] IPR0140017.3e-23DEAD-like helicase
[482-647] IPR0115455.8e-22DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[692-784] IPR0016502.2e-16Helicase, C-terminal
Orthology groupMCL10167 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205379-TA
ATGGCTGACGCGGCCGCCAGACAGTTGCAATATGAATACAAGGCGAACTCGAACCTTGTACTCCAAGCTGATGTGCGCCTCATAGAGCGTCGTGGTCGTGATGAGGCCACGGGGGAGGTGTTGTCTCTGTCCGGCAAGTTGGGTGGCACGAAGATGGGGGATAGGGCTCAACGGACAAAACCTGATAAGGCCGAGGAGAGGAAAGCCAAACGTCAGAAGCGTGATGAGGCAACATATGAGTTGACCAAGTCCAAATCCCGAGCTGCATGGGCTGATGAAACTCCCGGGGCTTTGTATAGACCGAGAGCCCAGCACACTCGACACGCCTATGAGCTGCTGTTAGCCTTCATGCAAAGCGCCTTGGGCGACCAGCCCAGGGATGTTCTGTGCGGAGCTTGCGATGAGGTGTTGGTGGTGTTGAAAAACGAAAAGATGAAGGACCTGGAAAGGAAGAAAGAGATCGAGTTACTCCTCGGTCCGATAGCAGATGAAAGATTCGCTTTATTAGTTAATTTAGGTAAAAAAATCACCGACTTCACAATAAGTACATCCACCGAAGGCAACACGGAGATCGATGAAACATATGGCATTAATGTGCAGTTTGAGGAGTCGGAGGAGGAAGATGACGAGGATGCTTATGGAGAGGTCCGTGATGAAGAGAGGGAGGAGGGAGATGCTTCAGAGGCGGAGGGTGGGGAGAGTTCCGGGGAGGAGAGTGGGGGGGAGAAGAAAAACGCCATACACGCTAATCTGTCAGAGGAGCAGTCCAGCAAGCGTCGGGACGCCGTCCTCCATGCGATGGATATAGACGCCTACTGGCTGCAGCGGAGACTCTCAAGACACTTCCCGGATGCTATGTTATCACAGGCGAAGTCCTCCGAAGTGCTGCAAGCTCTGGCGGATGCGGCTGATGAAAGAGACCTGGAGAACAGGCTAGTGTTGCTGCTGGGTTACGACTGTTTCGACCTGGTCAAGACGCTCAACAAGTACAGATACACGGTTCTGTACTGCACGAAACTGGCATCGTCACAGTCTGAGAGCGAGAGGGCTGCTATAAGAGAGGAGATGGCAACAAAACCACACCTGCAGAAGATCCTGGCACAGCTGGACACCGGGAAAGGGGAGGATGAGAGCACGGTGCAATCGGAGCAGCCCCGCAAGCGTTCCAGGTCGGTGCTCGAGGGAGGCTGGGCTGGCAGCGTGGCGGGGAACCGGAAACAGCTGCAGCTAGACGAGCTGGTGTTTGCAGCTGGGGCTCACTTCATGGCCAACAAGCGCTGTCAGCTACCACCTGGATCCTTCAGGAAGCAGCGGAAAGGCTACGAGGAAGTCCACGTCCCGGCGCTGAAACCAAAGCCGTTCGAGGAGAACGAGGTGTTGCTACCCATAGAGAAGCTGCCCAAGTATGTACAACCCGCCTTCGAGGGCTTCAAGACGCTCAACCGGATACAGAGTCGCATCTCTACAGCGGCCCTGGAGTCGGATGACAATCTGCTGGTCTGCGCGCCCACGGGAGCGGGCAAGACGAACGTGGCGCTGCTGTGCATCCTGAGGGCGTTGGGCAAACACGTCAACCACGACGGCAGCGTCAACGTGGGCGACTTCAAGGTGGTGTACGTGGCGCCCATGAGGTCGCTGGTGCAGGAGATGGTGGGCAATTTCAGCAAGCGGCTGGCGGCTTACAACATGAAGGTGTCGGAGCTGACGGGTGACCACCAGCTCACCAGGGAACAGATAGACGACACGCAGCTCATAGTGTGCACGCCTGAAAAATGGGACATCGTCACTAGGAAAGGTGGCGAGCGTTCCTTCACCAACCTGGTCCGCCTCATCATCATAGACGAGATTCATCTCCTGCACGACGAGCGCGGTCCAGTTCTGGAGGCGTTGGTCGCCCGCACGCTGAGGACGGTGGAGCACACACAGGAAGAGATATTGGTGTTCGTGCACTCCAGGAAGGAGACGGGGAAGACGGCGCGGGCTGTCCGCGACACCTGCCTGGAGAAGGACACGCTCGGACATTTCCTTAGGGAGGGTTCCGCCAGCGTGGAGGTCCTACGTACCGAGGCTGAGCAGGTGAAGAACCCTGAACTCCGTGAGTTACTACCCTACGGTTTCGCCATACATCACGCGGGTATGAGCCGAGTGGACAGGACCCTGGTAGAAGATCTGTTCGCTGACAGACATATACAGGTGTTGGTATCAACGGCTACCCTGGCGTGGGGTGTGAACCTGCCAGCACACACGGTCATAGTGAAGGGGACGCAGGTGTACAGCCCCGAGAAGGGCCGCTGGAGCGAGCTGGGGGCGCTGGACGTGCTTCAGATGCTGGGAAGAGCTGGACGGCCGCAGTACGACACCAAGGGAGAAGGCATTCTGATAACAAATCACTCCGAGCTGCAATACTACCTGTCGCTGTTGAACCAGCAATTGCCGATCGAATCACAGCTCGTCTCCAAGCTACCGGATATGCTGAACGCGGAGATAGTGCTGGGTTCCGTACAGAGCGTGAGGGATGCTGTCACGTGGCTCGGTTACACGTACCTGTATGTGCGCATGCTCCGCCAGCCCGCTCTGTACGGTGTGAGCGAGGACAGGCTGAAGGACGACGCGTTGTTGGAACTACACCGGGCCGACCTCGTTCACACCGCCGCCTCGCTGCTCGACAAAGCCGGTCTAATTAAATACGAGCGGAAATCCGGTCACTTCCAAGCCACTGAGCTGGGTCGTATAGCTTCTCACTACTACTGCACCTATGAGACCATGCAGAACTACAACCAGCTGTTGAAGCCCACGTTGGCTGAGATAGAGTTGTTCAGAGTGTTCTCGCTGTCTGCTGAGTTCAAGCACATAACTGTCAGGGACGAGGAGAAGCTGGAGCTGCACAAGCTCATGGAGCGGGTTCAAAATCATTCCACATATGCAGACCGACCTCTGACCCGCTGGGCGCAGCTGGTCGATAAGACGCTGGCTCTGTGCAAGATGGTAGACAAACGGATGTGGCAGTCCATGTCGCCGTTGAGACAATTCAGGAAAATGCCGGAGGAGGTGATCAAAAAGCTGGAGAAAAAGAACTTCCCCTGGGAGAAGCTGTACGAGCTGGGCCCCAATGAGATCGGTGAACTGGTCCGAGCACCCAAGTTAGGGAAGATGATACACAAATATGTACACCAGTTCCCCAAGTTAGAACTGGCAACACACATACAGCCCATCACGAGGTCCACACTGAGGGTCGAGCTTACTATAACTCCTGACTTCCAATGGGATGAGAAGATCCACGGCCAGTCTGAAGCTTTCTGGATCCTGGTCGAGGATGTGGACTCTGAGACCGTGCTGCACCACGAGCAGCTGCTGCTGAAGCACAAGTACTGCCGCGACGAGCAGCACGTGAAGCTGTTCGTGCCGGTCTTCGAGCCACTGCCTCCACAGTACTTCCTCAGGGTTGTATCAGATCGGTGGATAGCCGCGGAAACGCAGCTTCCAGTGTCATTCCGGCACCTCATCCTTCCGGAGAAAAACCTGCCGCCCACCGAACTTCTGGACCTGCAACCTCTGCCGATATCCGCCCTCCGGAACGAGAAATGGGAACAGCTGTACAAAGACGCCTTCCCGCAATTCAATCCAGTGCAGACACAGGTCTTCAACGCTGTATACAACTCGGACGACAACGTGTTCGTGGGTGCTCCATCAGGGTCTGGCAAGTCGGTGATCGCTGAGCTGGCACTTCTCCGACTCCTGACCCACAGCCCAGCCTCCAGAGCGGTGTACCTGGTCCCTCATGATGCACTCGCAGATATAGTGTTTGCGGACTGGTATCACAAGTTCGGTGCCCGCTTCAATCTGAAGGGCTTCAACATCTCCCACGCGGGGTCCCGCCTAGCGGCCATGACCCGTCCCATCTACAATGCGATCCTCCGCCACGCGGGCTCGCGTCCCGTGGCGGTGTTCGTCCCGTCCCGGCGTCACGCTCGCGTCCTGGCGGCAGACCTGCTAGCCCTGGCCGGGGCACACGACACGCCGGGGAGATTCCTCCGCGCTCGCCCTGACCTAGTCCAGCCCTTCCTTGATAAGGTGCAGGATAGGACGCTCCGCGAGACGCTAGCAGCTGGGGTGGCGTATCTGCATGCTGGGGTTTGTGCTGGAGATCGCCGTGCAGCGCTTCAACTTCTGGAATCCGGCGCGGCCCAGCTGTGCGTGGCGGCGGCTGAACTGGCGTACGCCTTCACCGCTCACGTCCACACCGTCATAGTGGCTGATACTAGTGTCTACAACGGTAAGCTGCACTGCTACGAGCAGTACCCGGTGACGACGGTGCTGCAAATGCTGGGCCGGGCCTGCAGACCTCTGGAAGACGAGCACGCAGTCGCAGTACTCATGTGCGCTCAGCATCACAAGACGTTCTTCACGAAGCTTCTCAATGACTGCCTACCCTTAGAGAGTCACTTGGACCACCGTCTCCACGACCACATGAACGCTGAGATCGTGACGAAGACGATAGAGAACAAACAGGACGCGGTGGACTACCTCACCTGGACCTTCCTCTACAGGAGACTCACGCAGAATCCCAACTACTAG

Protein sequence:

>DPOGS205379-PA
MADAAARQLQYEYKANSNLVLQADVRLIERRGRDEATGEVLSLSGKLGGTKMGDRAQRTKPDKAEERKAKRQKRDEATYELTKSKSRAAWADETPGALYRPRAQHTRHAYELLLAFMQSALGDQPRDVLCGACDEVLVVLKNEKMKDLERKKEIELLLGPIADERFALLVNLGKKITDFTISTSTEGNTEIDETYGINVQFEESEEEDDEDAYGEVRDEEREEGDASEAEGGESSGEESGGEKKNAIHANLSEEQSSKRRDAVLHAMDIDAYWLQRRLSRHFPDAMLSQAKSSEVLQALADAADERDLENRLVLLLGYDCFDLVKTLNKYRYTVLYCTKLASSQSESERAAIREEMATKPHLQKILAQLDTGKGEDESTVQSEQPRKRSRSVLEGGWAGSVAGNRKQLQLDELVFAAGAHFMANKRCQLPPGSFRKQRKGYEEVHVPALKPKPFEENEVLLPIEKLPKYVQPAFEGFKTLNRIQSRISTAALESDDNLLVCAPTGAGKTNVALLCILRALGKHVNHDGSVNVGDFKVVYVAPMRSLVQEMVGNFSKRLAAYNMKVSELTGDHQLTREQIDDTQLIVCTPEKWDIVTRKGGERSFTNLVRLIIIDEIHLLHDERGPVLEALVARTLRTVEHTQEEILVFVHSRKETGKTARAVRDTCLEKDTLGHFLREGSASVEVLRTEAEQVKNPELRELLPYGFAIHHAGMSRVDRTLVEDLFADRHIQVLVSTATLAWGVNLPAHTVIVKGTQVYSPEKGRWSELGALDVLQMLGRAGRPQYDTKGEGILITNHSELQYYLSLLNQQLPIESQLVSKLPDMLNAEIVLGSVQSVRDAVTWLGYTYLYVRMLRQPALYGVSEDRLKDDALLELHRADLVHTAASLLDKAGLIKYERKSGHFQATELGRIASHYYCTYETMQNYNQLLKPTLAEIELFRVFSLSAEFKHITVRDEEKLELHKLMERVQNHSTYADRPLTRWAQLVDKTLALCKMVDKRMWQSMSPLRQFRKMPEEVIKKLEKKNFPWEKLYELGPNEIGELVRAPKLGKMIHKYVHQFPKLELATHIQPITRSTLRVELTITPDFQWDEKIHGQSEAFWILVEDVDSETVLHHEQLLLKHKYCRDEQHVKLFVPVFEPLPPQYFLRVVSDRWIAAETQLPVSFRHLILPEKNLPPTELLDLQPLPISALRNEKWEQLYKDAFPQFNPVQTQVFNAVYNSDDNVFVGAPSGSGKSVIAELALLRLLTHSPASRAVYLVPHDALADIVFADWYHKFGARFNLKGFNISHAGSRLAAMTRPIYNAILRHAGSRPVAVFVPSRRHARVLAADLLALAGAHDTPGRFLRARPDLVQPFLDKVQDRTLRETLAAGVAYLHAGVCAGDRRAALQLLESGAAQLCVAAAELAYAFTAHVHTVIVADTSVYNGKLHCYEQYPVTTVLQMLGRACRPLEDEHAVAVLMCAQHHKTFFTKLLNDCLPLESHLDHRLHDHMNAEIVTKTIENKQDAVDYLTWTFLYRRLTQNPNY-