Monarch geneset OGS2.0

DPOGS214155
TranscriptDPOGS214155-TA2091 bp
ProteinDPOGS214155-PA696 aa
Genomic positionDPSCF300014 - 746110-751210
RNAseq coverage252x (Rank: top 42%)
Annotation
HeliconiusHMEL0124733e-14042.07% 
BombyxBGIBMGA002895-TA2e-14142.41% 
DrosophilaCG4901-PA4e-17549.36% 
EBI UniRef50UniRef50_E0VC350.049.77%ATP-dependent RNA helicase, putative n=2 Tax=Neoptera RepID=E0VC35_PEDHC
NCBI RefSeqXP_002423679.10.049.77%ATP-dependent RNA helicase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2700067410.047.18%hypothetical protein TcasGA2_TC013109 [Tribolium castaneum]
NCBI nr blastxgi|3838565420.049.27%PREDICTED: putative ATP-dependent RNA helicase DHX33-like [Megachile rotundata]
Group
Gene OntologyGO:00043861.4e-28helicase activity
GO:00055241.7e-13ATP binding
GO:00036761.7e-13nucleic acid binding
GO:00080262.3e-08ATP-dependent helicase activity
KEGG pathwaycal:CaO19.40332e-151 
 K12818 (DHX8, PRP22)maps-> Spliceosome
InterPro domain[67-256] IPR0140012.3e-31DEAD-like helicase
[462-554] IPR0075021.4e-28Helicase-associated domain
[588-688] IPR0117095.1e-18Domain of unknown function DUF1605
[294-403] IPR0016501.7e-13Helicase, C-terminal
[76-232] IPR0115452.3e-08DNA/RNA helicase, DEAD/DEAH box type, N-terminal
Orthology groupMCL14525 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214155-TA
ATGGATTCTAAGTACGCTTCCATAGGGAGAGAAAAACCGAAAATTACAAACGATTTAACAGTTAAAAGGTTAAAACTATGTGTTAATAACGCAAACACAAAAAAAGATAAAGAGCAGACAACCAATGGAGCAACCAATAATAATAAACTTGAAAATAATTTTCCTAAAGAGAACACAGAAGACCTCAAGGAAGCTAGAAAATCATTACCTGTTTATTTAGTGCGAACCAGAATTATAGAAGAAATAAAAAAACATGATACAATGATATTAATTGGTGAGACAGGAAGTGGTAAGACCACACAAATACCTCAGCTTATTCATGAATATCGACTAGAGGGTAAGAGTTGTGTGGCTGTGACACAACCGAGAAGGGTGGCTGCAATCACTTTAGCTTTACGAGTGGCAGCTGAAATGAATACTGATATTGGATCTATAGTTGGGTATTCAGTAAGATTTGAAGATGTAACAAGTCCAAGAACGAAAGTAAAATATTTGACAGATGGAATGTTATTGAGAGAAGCTGTAACAGATCCATTGCTGAAGAAATATTCCATTATTGTGTTAGATGAAGCCCATGAACGCACTGTAAATACAGATGTCTTATTTGGAATTGTTAAACTCGCCCAAAAGGAAAGAAATGGACAGAAACAAAATCCGCTAAAGGTTATAGTTATGTCGGCGACAATGGATGTGGATTCTTTTAGAAAATATTATGATAATTGTCCTGTTATATACTTAGAGGGTAGAACTTACCCTGTAACAATTTATCACTCTAAGATTAAACATGAGGATTATCAGTATGCTGCTATATGTACAATATTCCAGCTACATACAACAACTCCAGCCAATGAAGATTTCCTTGTGTTTTTAACTGGACAGGAAGAAATTGAAACAGTTATGTCCAATATAAAGCAAATAGCTAAAGAAACTGTTGGTCCTCAAATACGAGTATGTCCTCTATATGCTGGTTTACCCGCCGCTAAACAGTTGTTGGTATGGAAAAAAACACCCCCTGGAATGAGAAAAATTGTGTTAGCAACAAACATAGCTGAAGCTTCTGTGACGATACCAGAAATAAGATATGTTATCGATACGGGTGTTGTAAAAGAGAGGACGTGGTGTACTCGTACCGGTGCTGAGCGTTTGTCAGTGGTGCCGTGTTCTCAGGCGGCTTCCTGGCAGCGAGCGGGGCGCGCTGGACGGACTGCAGCCGGCGCCTCCTACAGACTATACACAGCCACCGATTTTAAATGCAGGCGGCAACACAACATACCGGAAATAGTACGTTGTCCGCTCACGTCAACAGTGCTGATGTTGATAGCAACTGGCTTGGATCCCGGGACATTTCCATTGATAGACACGCCGCCGAAAGATTCGATACATGCTGCTTTATTGTTATTAAAAGAATTAGGTGCCGTAGATAATGAAAGTAATCCAAAATTAACGGTTCTCGGAAAGAAGATGACGGCATTCCCAATAGACCCAAAATATGCCAAGATCCTTTTATGTGCTCCAGAGTATGGATGTTTGGAAGAGGCTCTGAGCTTGGTGGCGGTAATGTCCAGCGAAAATGTGTTTCACACGCCATTACACAAAAGGGAAGACGCTTTGAAAGTAAAGCAAAAATTTATATCGCCTCTCGGAGATCATATCACGCTTCTAAACGTTTATAGAGCGTTCTGTAAGGCGCCTCTTAAAAAGCAATGGTGCCGTGAAAACTATTTGAATCACAAGAATTTGTCTTACGCTTTTGATGTACGTCAGCAGTTACTGCTAGTTTGCCAGCGATTAAATTTGGCAGTATCGAGTTGTGGAAATGCCACGGATCAGTTGCTGAAGTGTCTGCTGAGCGGTCTGTTCACGAACTGCGCGTGGTCCCGGGCGGGCGGGCGGTACGTGACGAGTGCGGGCGCGGCGGCGTCTCTGCACCCGTCCAGCGTGCTGCACGGACGTTCTGCGAACCCTGCACTAGTGTACACGGAGTTACTGCACACGCAGCGGTCGTTCTTAATCAACGTGTCCGTGGTGCAACCGCAGTGGCTGCAGCAGGTTGCGCCCGAATACGCCAGGCGGTGTCGCTCGAACCGGTGA

Protein sequence:

>DPOGS214155-PA
MDSKYASIGREKPKITNDLTVKRLKLCVNNANTKKDKEQTTNGATNNNKLENNFPKENTEDLKEARKSLPVYLVRTRIIEEIKKHDTMILIGETGSGKTTQIPQLIHEYRLEGKSCVAVTQPRRVAAITLALRVAAEMNTDIGSIVGYSVRFEDVTSPRTKVKYLTDGMLLREAVTDPLLKKYSIIVLDEAHERTVNTDVLFGIVKLAQKERNGQKQNPLKVIVMSATMDVDSFRKYYDNCPVIYLEGRTYPVTIYHSKIKHEDYQYAAICTIFQLHTTTPANEDFLVFLTGQEEIETVMSNIKQIAKETVGPQIRVCPLYAGLPAAKQLLVWKKTPPGMRKIVLATNIAEASVTIPEIRYVIDTGVVKERTWCTRTGAERLSVVPCSQAASWQRAGRAGRTAAGASYRLYTATDFKCRRQHNIPEIVRCPLTSTVLMLIATGLDPGTFPLIDTPPKDSIHAALLLLKELGAVDNESNPKLTVLGKKMTAFPIDPKYAKILLCAPEYGCLEEALSLVAVMSSENVFHTPLHKREDALKVKQKFISPLGDHITLLNVYRAFCKAPLKKQWCRENYLNHKNLSYAFDVRQQLLLVCQRLNLAVSSCGNATDQLLKCLLSGLFTNCAWSRAGGRYVTSAGAAASLHPSSVLHGRSANPALVYTELLHTQRSFLINVSVVQPQWLQQVAPEYARRCRSNR-