Monarch geneset OGS2.0

DPOGS206984
TranscriptDPOGS206984-TA3495 bp
ProteinDPOGS206984-PA1164 aa
Genomic positionDPSCF300001 + 439224-448199
RNAseq coverage218x (Rank: top 45%)
Annotation
HeliconiusHMEL0021260.075.91% 
BombyxBGIBMGA012946-TA0.069.59% 
Drosophilakz-PA0.050.08% 
EBI UniRef50UniRef50_D2A3P70.054.10%Putative uncharacterized protein GLEAN_15731 n=1 Tax=Tribolium castaneum RepID=D2A3P7_TRICA
NCBI RefSeqXP_001844430.10.050.33%ATP-dependent RNA helicase DHX8 [Culex quinquefasciatus]
NCBI nr blastpgi|1700331260.050.33%ATP-dependent RNA helicase DHX8 [Culex quinquefasciatus]
NCBI nr blastxgi|3504112810.051.21%PREDICTED: probable ATP-dependent RNA helicase kurz-like [Bombus impatiens]
Group
Gene OntologyGO:00043861.9e-13helicase activity
GO:00055242.5e-12ATP binding
GO:00036762.5e-12nucleic acid binding
GO:00080266e-11ATP-dependent helicase activity
KEGG pathwaypyo:PY008353e-90 
 K12815 (DHX38, PRP16)maps-> Spliceosome
InterPro domain[264-455] IPR0140011.2e-26DEAD-like helicase
[887-1019] IPR0117092.5e-19Domain of unknown function DUF1605
[763-838] IPR0075021.9e-13Helicase-associated domain
[600-701] IPR0016502.5e-12Helicase, C-terminal
[273-428] IPR0115456e-11DNA/RNA helicase, DEAD/DEAH box type, N-terminal
Orthology groupMCL14883 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206984-TA
ATGGGTAAAAGACGATACAATGCAAAGGCTCGCCAAGTGGTCAAAACAGATATTGACGATTCCAAAACTAATGAGATAAAATTAGAATTCACATCAAATGAATATGGAACAAGTGATACTGCAAATGCATTGGCCTTGCCTTCAAGAAAGAGAGAAACTAAGATTATTGCAGACAAAAAAGAAAAGACTAGATTTCTATCAAAGGCCCAAAGGAAACGGTTAGAAAAGATAGTTGATAAGAAGAAAAAGAAAGAAAATAGAGCTGCGCTTTTGGAATCCTTGTCACAAGTACAAGCAACTCCTGAGGAAGTGAAACAATTGACTACAATATCCTCTGTCCAGACACTCGGTTTGAGAAAGTTGACAGAGTTTAATTTAGAAACTACTCAAAATGCTACTGAAGTTAGTGAACAAAAGAAGTTTAGTAGTATTGCAGGTGCAAAGAAAAGATTGAGACTTTTAAAAATGGATAGATCTGATGACAATAAAAAGAAGAAATATGATCCTAATGTTGTTGGTCTCGAAGAATCTTCTGATGATTCAAGTATAGAAACTGATGATAGCGAAGGCAAAGACACATGTGAAGTAATTGAAACCACAGATAAAATAATCAATGTTGAAAAAGGTGCTACATCATCAGCCCATAGAAGTAATGAAACAAATACTAAGGCAATTGAAATTAAAGAAAGTGAACCTGTAGAAGTGGAAGCGGAAGTAAAAAAGCCTATTCTAGAACATCCTACCATAAATGTTCAAGTTAAAAGAGACCCCAAAGTTCAAGTGGCTAGATTGAAATTACCAATATTAGGAGAGGAACAAAGAGTTATGGAGTTGATAAACGAAAATGAATTTGTCATCGTCGCTGGTGAAACTGGTAGCGGTAAAACTACTCAGATACCTCAGTTTCTCTATGAAGCTGGGTATACAGAAAATAAAATGATAGCAGTTACTGAACCTCGAAGAGTGGCAACTGTGGCTATGTCAGCTCGAGTCGGTTATGAACTGGGCTTGAGCAGCAAAGAGGTTTCATATCTGATGAGATTTGAAGGAAATGTCACCAAAGACACTAAGATTAAGTTTATGACTGACGGTGTTCTCTTAAAAGAGATCCAATCGGATTTTCTCTTAAGCAAATATTCAGTTGTTATTATAGACGAAGCTCATGAAAGAAGCATGTACACAGATATACTTCTGGGTTTGCTATCTAGAATTGTACCGTTGAGACGTAAAAGGGGATGTCCATTGAGACTCATTATAATGTCAGCTACCCTCAGAGTTGAGGATTTCACAGAAAATACGAGACTTTTCAAAGTTCCACCTCCTGTTATAGAGATACAATCTAGGCAGTTCCCTGTGACGGTACATTTCAATAAACACACATACAGTGATTATTTGAAAGAAGCTTTTAAGAAAACAGTCAAAATACACACCAGGCTACCGGAAGGTGGAATTTTGATATTTGTGACTGGCCAGCAAGAGGTAAACTATCTAGTAAGGAAATTAAGAGCGTCTTTTCCATACCACAAAGGTGTTGATTATTCGTCATTAATAAACAAAAAGGTGAATGTAGTTGACACCTCCTTGGACTCGGAGCCTGATGATATTGAATCGGATGATGATGAGGTGGAAAAGGAAATGAAGCGGATACGCAAAGCCAGGAAAAAAGCGAAGCGCAAGACTATAAAGTTACCAAAAATTAGTTTGGACGATTTTGACATGCCGGAGGACGATGGCCAGCCAGATTTAGTCAGTGACGCTGACAGCGAGGGTCATTTGAGCGATTCAGATGCCGATGAATCAACTCTGACACCTATCGTAAAATCCAGCCAACAGCCGCTGTGGGTGTTGCCGTTGTATTCTATGTTGAGTACAGCGAAACAGGGTCGGGTGTTCGAAACCCCGCCCGCTGGCACAAGACTGTGCGTCGTCAGCACCGACGTCGCGGAGACATCCCTCACGATACCCAGCATAAAGTACGTCGTTGATACCGGCAAGAAGAAGATGCGTATATACGATCACGTGACCGGCGCAAGCGCATGGCGTGTAGTGTGGACGTCTCAGGCCAGCGCGGAACAACGTTCGGGCCGAGCGGGACGGACGGGTCCGGGACACGTATATAGATTGTACAGTAGTGCGGTGTACCAACACGAGTGTGTACCACAGTACAAACCCGACCTCTGTACGAGGCCCGTAGATCATTTGATGCTGACCCTAAAATGTATGGGCATCGATAAGGTGGTTAACTTTCCATATCCCACCGCACCAGACAGGATGCAGTTGCGTTTAGCTGAGAAGCGACTAGAAGTTTTGGGAATTTTGGAGAAGGTTGAAATGAGGAACAGGAGAAAAGATGACGAAGAGGTGTTAAAAGTAACTCCGCTGGGGAAGGCTGTGTCGGCGTTTCCGCTGCTTCCTCGCTACGGCAAGATGTTGGCTCTCAGTCATCAATATACTTTACTACCGTACGCTATAACCATCGTTTCTGCTTTAACTGTGCCGGAGGTAATGTCGGGTAAAACCGATAGTTGGCCTGCGACCGGTAATATGTTATTGCTCGGCGACCCCGGCGTACTACTGAGAGCGGTCGGAGCTTGCGATCATAGTACGGAGGCGTTGTCTGTGTTCTGTGCCAAGTATGGACTAAGGGAGAAGGCCATAATAGAAATTAGGAAGCTTCGGAAGCAGCTAGCATCTGAGATAAATCTCAGCGTGTCCGGTGTTAATCTTGTTGTGGATCCTAAACTGCAGCCTCCTGATGACAAACAGGCGAAATTGTTGAGACAGTTGTTGCTCAGCGGTCTAGGAGACCAGGTCGCGAGGAAGATTAGTATGGCAGAAGTTAAAGAAGGTGAGGACAAACGGAAATACAAATACGCCTACCGCTGTTCTGATCTGGACGAGCCGGTGTTCATCCACTCGGAGTCCATATTACGCAAAGTGATACCCGAATGGGTGATATACCAGGAGCTGTACGAGACCGGCCCGGATGACAGGAAGAAGATGATCATGAGGAACATTACCGCCGTAGAACCTGAATGGCTGCCAGTTTATGTGCCGTTCCTATGTAATTTAGGTGAACCGCTATCGGAGCCCGAGCCGCGATACGACGCGAGGTCGGGCAGGGTGAAATGTCACTTTAAAGGGACATTCGGTAAAAGTTGTTGGGAACTACCCACAGTTGAGATCGATTATCCGGAGAAAATAGACAAATACAGATGGTTCGCAAGATTCCTTTTGGAAGGTTCAGTATTCGTGAAGCTCAAGAAATACGCGGCGTCCCTACTGTCACCCCCCTCCACTATGATAAAGAGCTGGGCGAAATTACAACCAAGGACGGAAATACTATTGAAGGCTCTGATAGACAAGAAAATAGGCTCCAAAGATAAGATGGAGCAGATTTGGAAAGAAAAACCCACTTATTTGCAGGACGAATATTTGAAATGGGTCCCGGAGTCGGCTCACAACGAAGTTATATTGTACTGGCCGCCGTTGTAG

Protein sequence:

>DPOGS206984-PA
MGKRRYNAKARQVVKTDIDDSKTNEIKLEFTSNEYGTSDTANALALPSRKRETKIIADKKEKTRFLSKAQRKRLEKIVDKKKKKENRAALLESLSQVQATPEEVKQLTTISSVQTLGLRKLTEFNLETTQNATEVSEQKKFSSIAGAKKRLRLLKMDRSDDNKKKKYDPNVVGLEESSDDSSIETDDSEGKDTCEVIETTDKIINVEKGATSSAHRSNETNTKAIEIKESEPVEVEAEVKKPILEHPTINVQVKRDPKVQVARLKLPILGEEQRVMELINENEFVIVAGETGSGKTTQIPQFLYEAGYTENKMIAVTEPRRVATVAMSARVGYELGLSSKEVSYLMRFEGNVTKDTKIKFMTDGVLLKEIQSDFLLSKYSVVIIDEAHERSMYTDILLGLLSRIVPLRRKRGCPLRLIIMSATLRVEDFTENTRLFKVPPPVIEIQSRQFPVTVHFNKHTYSDYLKEAFKKTVKIHTRLPEGGILIFVTGQQEVNYLVRKLRASFPYHKGVDYSSLINKKVNVVDTSLDSEPDDIESDDDEVEKEMKRIRKARKKAKRKTIKLPKISLDDFDMPEDDGQPDLVSDADSEGHLSDSDADESTLTPIVKSSQQPLWVLPLYSMLSTAKQGRVFETPPAGTRLCVVSTDVAETSLTIPSIKYVVDTGKKKMRIYDHVTGASAWRVVWTSQASAEQRSGRAGRTGPGHVYRLYSSAVYQHECVPQYKPDLCTRPVDHLMLTLKCMGIDKVVNFPYPTAPDRMQLRLAEKRLEVLGILEKVEMRNRRKDDEEVLKVTPLGKAVSAFPLLPRYGKMLALSHQYTLLPYAITIVSALTVPEVMSGKTDSWPATGNMLLLGDPGVLLRAVGACDHSTEALSVFCAKYGLREKAIIEIRKLRKQLASEINLSVSGVNLVVDPKLQPPDDKQAKLLRQLLLSGLGDQVARKISMAEVKEGEDKRKYKYAYRCSDLDEPVFIHSESILRKVIPEWVIYQELYETGPDDRKKMIMRNITAVEPEWLPVYVPFLCNLGEPLSEPEPRYDARSGRVKCHFKGTFGKSCWELPTVEIDYPEKIDKYRWFARFLLEGSVFVKLKKYAASLLSPPSTMIKSWAKLQPRTEILLKALIDKKIGSKDKMEQIWKEKPTYLQDEYLKWVPESAHNEVILYWPPL-