Monarch geneset OGS2.0

DPOGS212385
TranscriptDPOGS212385-TA1875 bp
ProteinDPOGS212385-PA624 aa
Genomic positionDPSCF300019 + 779288-804096
RNAseq coverage208x (Rank: top 46%)
Annotation
HeliconiusHMEL0139430.081.40% 
BombyxBGIBMGA002895-TA2e-14647.44% 
Drosophilal(1)G0007-PA0.073.20% 
EBI UniRef50UniRef50_Q9VY540.073.20%LD24737p n=12 Tax=Bilateria RepID=Q9VY54_DROME
NCBI RefSeqXP_969616.20.074.91%PREDICTED: similar to pre-mRNA splicing factor ATP-dependent RNA helicase PRP16 [Tribolium castaneum]
NCBI nr blastpgi|1892358660.074.91%PREDICTED: similar to pre-mRNA splicing factor ATP-dependent RNA helicase PRP16 [Tribolium castaneum]
NCBI nr blastxgi|1892358660.074.91%PREDICTED: similar to pre-mRNA splicing factor ATP-dependent RNA helicase PRP16 [Tribolium castaneum]
Group
Gene OntologyGO:00043864.9e-37helicase activity
GO:00055243.2e-20ATP binding
GO:00036763.2e-20nucleic acid binding
KEGG pathwaytca:6581130.0 
 K12815 (DHX38, PRP16)maps-> Spliceosome
InterPro domain[329-419] IPR0075024.9e-37Helicase-associated domain
[453-553] IPR0117095.1e-23Domain of unknown function DUF1605
[161-268] IPR0016503.2e-20Helicase, C-terminal
Orthology groupMCL10030 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212385-TA
ATGTTTTATAATACCATTATAAAATTTAATCAACCATCCTCCCTCGCTTTCCTTAAGCCGGCGTTATCTCGTCGGTCAACGACCTTCAGCTGGTACAAGGCTTACTTAAATGGGTCAAGGAAAAATATTATTAAGCGAACTGTGGTAGCCCGTCGTTCGGATCTAAAGCTGATCGTGACCTCAGCGACAATGGACTCCAGCAAGTTCTCCACGTTCTTCGGCAACGTGCCAACCTTCACCATCCCTGGGAGGACCTTCCCCGTCGAGACTTTCTTCTCCAAGAACGTCTGCGAGGACTACGTCGACGGTGCTGTCAAACAGGCGATAATTTATGTGTGTGTTCGTGTGCGTGTGCGTGTGCGTGCGTATGCATGTGTGGTCATAAATAATGGTCCTTATCTAGCGGAAGAGGCGCTGCAGATCCATTTACAACCGGATGAAGGAGACATCTTGATTTTCATGCCAGGTCAAGAAGACATTGAAGTCACTTGCGAGGTCCTAACAGAACGTTTGGGTGATCTGGATAACGCGCCGCCACTCACCGTACTGCCGATATATTCGCAACTTCCAGCTGACCTGCAAGCGAAGATATTCCAGCGTGCTCCTCCAGGGCAAAGAAAATGCATCGTGGCTACCAATATAGCTGAGACGTCTCTGACTGTGGATGGCATCATGTACGTGATCGACTGTGGTTATTGTAAACTGAAGGTCTACAATCCACGGATAGGCATGGACGCTCTACAGATCTACCCGGTGAGCCAGGCGAACGCTCGCCAGCGCGCAGGTCGAGCGGGTCGTACAGGTCCCGGGCGGGCGTTCTGCCTGTACACGGAGCGGCAGTTCAGCCAGGAGCTGCTGCCGGCTACCGTGCCGGAGATCCAGCGCACCAACCTCGCCAACACCGTCCTGCTGCTCAAGTCGCTGGGGGTCGACGACCTGCTCGCCTTCCACTTCATGGACCCGCCGCCGCAGGACACAATCCTGAACTCGATGTACCAACTGTGGATCCTGGGAGCGCTGGACGGCACGGGAGCCCTGACGCCTCTCGGGAGACAGATGGCGGAGTTCCCCCTGGACCCCCCGCAATGTCACATGCTCATAGTCTCCGCTGAGATGGGTTGCAGCGCTGAGATGCTCATCATAGTGTCGATGCTGTCGGTGCCGTCCGTGTTCTACCGGCCCCAGGGTCGGGAGGAGGACGCCGACACGGCCAAGGAGAAGTTCCAAGTGGCGGAATCGGACCACCTCACCTTACTCCACTTATATAACCAGTGGAAATCCAACAATTATTCGAGCGCGTGGTGTACGGAACACTTCGTCCACGCGAAGGCGATGCGGAAGGTCCGCGAGGTCCGCCAGCAGCTGAGGGACATCCTCACGCAGCAGAGACTGCCGCTGCTGTCGTGCGGCACGGACTGGGACACTGTGCGGAAATGTATCTGCTCAGCGTACTTCCAACAAGCCGCTCGTCTGAAAGGCATCGGCGAGTACGTGAACTGTCGGACGGGTATGCCGTGTCACCTGCACCCTACCAGCGCCCTGTTCGGAGCGGGCAGCGCGCCGGACTACGTGGTGTACCACGAGCTGATGATGACCTCGCGGGAGTACATGCACTGCGTCACCGCCGTCGACGGCCGCTGGCTCGCCGAGTTGGGACCCATGTTCTTCTCGGTTAAAGAAACTGGCAAGTCCAATCGCGACAAACGCAAGGAGGCCGCGGTCCATCTGCAAAGGATGGAAGAAGAGATGAAGATGGCGGAACAGAAGATGGCAGAAGAGAAAAAGAAGAGGGATCAAGAGGTTCCCGTGAAACAGGAAGTGGCCACGCCCGGCCTGAACACGCCGCGACGGACGCCGCATACGCTCGGCTTGTAG

Protein sequence:

>DPOGS212385-PA
MFYNTIIKFNQPSSLAFLKPALSRRSTTFSWYKAYLNGSRKNIIKRTVVARRSDLKLIVTSATMDSSKFSTFFGNVPTFTIPGRTFPVETFFSKNVCEDYVDGAVKQAIIYVCVRVRVRVRAYACVVINNGPYLAEEALQIHLQPDEGDILIFMPGQEDIEVTCEVLTERLGDLDNAPPLTVLPIYSQLPADLQAKIFQRAPPGQRKCIVATNIAETSLTVDGIMYVIDCGYCKLKVYNPRIGMDALQIYPVSQANARQRAGRAGRTGPGRAFCLYTERQFSQELLPATVPEIQRTNLANTVLLLKSLGVDDLLAFHFMDPPPQDTILNSMYQLWILGALDGTGALTPLGRQMAEFPLDPPQCHMLIVSAEMGCSAEMLIIVSMLSVPSVFYRPQGREEDADTAKEKFQVAESDHLTLLHLYNQWKSNNYSSAWCTEHFVHAKAMRKVREVRQQLRDILTQQRLPLLSCGTDWDTVRKCICSAYFQQAARLKGIGEYVNCRTGMPCHLHPTSALFGAGSAPDYVVYHELMMTSREYMHCVTAVDGRWLAELGPMFFSVKETGKSNRDKRKEAAVHLQRMEEEMKMAEQKMAEEKKKRDQEVPVKQEVATPGLNTPRRTPHTLGL-