Monarch geneset OGS2.0

DPOGS214267
TranscriptDPOGS214267-TA2958 bp
ProteinDPOGS214267-PA985 aa
Genomic positionDPSCF300014 + 1679768-1684146
RNAseq coverage500x (Rank: top 25%)
Annotation
HeliconiusHMEL0113920.097.73% 
BombyxBGIBMGA001336-TA0.097.15% 
DrosophilaCG10077-PA0.069.30% 
EBI UniRef50UniRef50_Q928410.069.68%Probable ATP-dependent RNA helicase DDX17 n=75 Tax=Amniota RepID=DDX17_HUMAN
NCBI RefSeqXP_972501.10.083.85%PREDICTED: similar to DEAD-box RNA-dependent helicase p68 [Tribolium castaneum]
NCBI nr blastpgi|2700046640.083.92%hypothetical protein TcasGA2_TC010324 [Tribolium castaneum]
NCBI nr blastxgi|3454932180.076.92%PREDICTED: probable ATP-dependent RNA helicase DDX17-like [Nasonia vitripennis]
Group
Gene OntologyGO:00055241e-48ATP binding
GO:00080261e-48ATP-dependent helicase activity
GO:00036761e-48nucleic acid binding
GO:00043869.5e-35helicase activity
KEGG pathway 
InterPro domain[125-328] IPR0140011.2e-63DEAD-like helicase
[130-300] IPR0115451e-48DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[367-448] IPR0016509.5e-35Helicase, C-terminal
Orthology groupMCL10060 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214267-TA
ATGGGACGTGGAAGGGATCGCAGTCGAGATAGACGTCGGAGCCGTAGTCGTAGCCGGAGCAGGAGTCGCAGCCGAAGCCCTCGCTACGGTCTTGGTGGAGGTGGTGGTGGTGGTGGATATAGCGGAGGCGGACGCAGGAGTAATCCCGGAGCCAATCTGCGCAAACCGAAATGGGACCTTAACAGGCTTAAGCCTTTTAAGAAAGATTTCTACGTGCCTCACCCAGATGTTGAAAGTAGACTTGAATCAGATGTCGAGGCTTGGAGGAGTGAAAATGAAATAACGTTGAAAGGGCGTAATATACCAAAACCCACACTAACTTTCGATGAAGCCGGATTTCCTGATTATGTTATGGATGAAATTGATAAAATGGGATTTTCCAAACCAACACCAATTCAGGCACAAGGTTGGCCTATAGCCTTAAGTGGATGTGATATGGTTGGCATTGCTTCGACAGGTTCAGGAAAAACTTTATCTTATATTTTACCTGCAATAGTTCACATTAATAATCAGCCCAAGTCAAGTAGAGGAGATGGACCAATTGCTTTAGTATTGGCTCCAACAAGAGAACTCGCCCAACAAATTCAAGAAGTGTGTGATAAGTTCGCTAACACCTCCAAAATTCACAACACATGTTTGTTTGGTGGAGCTCCCAAAGGTCCACAAGCTAGAGATTTGGATGCTGGTGTTGAAATTGTAATTGCAACACCGGGCCGTCTATTAGACTTTTTAGAGAGTGGTCGGACAAATCTTAAAAGATGCACATATTTGGTACTTGATGAAGCAGATCGAATGTTGGATATGGGATTTGAACCTCAAATTAGAAAAATCATAGAACAAATACGACCTGATAGACAAACACTCATGTGGTCTGCTACATGGCCTAGAGAAGTACAGAGTTTAGCAGCAGAATTTTTGAAAGATTATTTACAAATCAATGTTGGTTCCTTACAATTAGCAGCCAATCACAACATCCTTCAGATCATTGATGTTTGTATGGAATATGAAAAAGAAACTAAACTTAGTACATTGTTAAAAGAAATTATGGCCGAAAAGGAAAATAAAACTATCATATTCATTGAAACAAAACGCAGAGTTGATGATATCACAAGAAAAATGAAACGCGATGGATGGCCCGCTGTGTGTATTCATGGTGATAAGTCACAAAATGAACGTGACTGGGTATTACAAGATTTCCGTAGTGGAAAAGCACCCATTCTTGTAGCTACAGATGTTGCTGCCAGGGGTTTAGATGTCGATGACGTAAAGTTTGTTATTAATTTTGACTATCCTAGTAACTCTGAAGATTATGTGCATAGAATTGGAAGAACGGGTCGAACTAATAAAACCGGAACTGCTTATACGTTTTTCACACCATCGAATGCGGCAAAGGCGGCAGATTTGGTTTCAGTGCTAAAAGAAGCCAAACAAGTCGTGAATCCTAAATTACAAGAGTTAGCCGAACGCGGCGGTGGCGGTGGACGAAGACACCGTGGTCGTGGCGGTAGATATCGCAGAGGGGGACGCCGTTCGAGGTCCCGTTCTCGCTCGCGTGATCGTCGTCGGCGTTCACGTACACGATCCCGTTCCCGAGACCGCCGTCGTCGCAGACACAGCTCCTCGCGTTCATCACGCAGCAGGTCGTCGAGATCGTCGAGAAGCCATTCACGCTCGCGCTCCAGATCAAGAAGCCGCTCTCGCTCAGGCAAGTGCAGCCCACTGAAGGATAATACTGTTGGGCCTCAGCCAGCACCCCAGGCACTTCTACCGACGCCGAAACCACTTCTGCCTACCCCGATCGGTCCGCAGCTACCTCCCCCACATTTCGATAAACACACTAGTAAAAATTCTTTACCGCCTTCTAATGGTGACGACGAATCTCGCTCAAATAAAGATCTCTCCCAATGTTATAATAAAACTAACAGTCACAATAGTAGCAATAACAAACAAACCCACAATGACAACAGGTTACAGCAACAGCAACCTGTCAACTCAATTCCGCCTTTGATGGCCATAAATCCTCAGATGAATGTATGTGTTCCGCCACCACCTCTTAATGGACAAGGCTTTGTTATGCCTCCATATTTTCCTTCCGACCAATATGGAATGATGATGCCTAATTTCGGCCCAAATCCTATGCTCAATGGACACAGTTGGGGAGCTCCGCCTCCTCCGCCTCCACCCCCTCCTCCGTCTTCAGATCCCTCTAACGGGTCTCAAAATAACTATAATTATGGTCAAAGCGGCTCGGGACAAAGTAGCCTGGAGTCAGATTCCAGAAAGCGGGGTGGCCGTTCTGGTGGCCTCGGAAGTTCGAGTTCATACGGCTTGGGCTCTGGAAGTGGTGGCCTTAGCTCTGCTGGAGGTGCCGGTTCCTGCGGCAGGGGCCTGGGCCTTGCCGATGGCCCCACTCAAGGTGGGGGCCTCGGCACTGGTGGCCTAGGGTCCGATAGCCTCTACGGAGGTGGCGGTGGCCTCGGCTCCGACGACGGCCAAGGCAGTCAACGGTCGCGCGGCGGCCGCGATCGGCGGCGACGAGGCAGAGGACGGGACTACGACGACCACAACTCGGATAATCCGAGCGGTGGTCTCGGCTCTTACGGCTCGGGCGGCTTCGAAGGTGGCCTGGGCCAGGCCTCTGGCGGTCCCAGTGTGCCCCATGGGAGCCTATTGCCGCGCATGCTGCCGCAGAATACTGGTGACTTCGGTGGACAACAAAACGCTAACTTTACCGCTTTCGGTTCTTATGGGCCTAAGTTTAAAAAAAATCAGGGTGGTTACGATAACGGAGATTACGACGAGGGTCCCGTTAACGGGGTCGAATATTACGGCCACCAAAATATGGGACCAGGCCGACCGTTAAATTCTAACATGGATCGAGGTGTTTTTAACGATGCGCAGGCATACGGCTCCATGGGGTACAGAAATGACCGCCAGGGCAATCGCCAACGGCGGTGA

Protein sequence:

>DPOGS214267-PA
MGRGRDRSRDRRRSRSRSRSRSRSRSPRYGLGGGGGGGGYSGGGRRSNPGANLRKPKWDLNRLKPFKKDFYVPHPDVESRLESDVEAWRSENEITLKGRNIPKPTLTFDEAGFPDYVMDEIDKMGFSKPTPIQAQGWPIALSGCDMVGIASTGSGKTLSYILPAIVHINNQPKSSRGDGPIALVLAPTRELAQQIQEVCDKFANTSKIHNTCLFGGAPKGPQARDLDAGVEIVIATPGRLLDFLESGRTNLKRCTYLVLDEADRMLDMGFEPQIRKIIEQIRPDRQTLMWSATWPREVQSLAAEFLKDYLQINVGSLQLAANHNILQIIDVCMEYEKETKLSTLLKEIMAEKENKTIIFIETKRRVDDITRKMKRDGWPAVCIHGDKSQNERDWVLQDFRSGKAPILVATDVAARGLDVDDVKFVINFDYPSNSEDYVHRIGRTGRTNKTGTAYTFFTPSNAAKAADLVSVLKEAKQVVNPKLQELAERGGGGGRRHRGRGGRYRRGGRRSRSRSRSRDRRRRSRTRSRSRDRRRRRHSSSRSSRSRSSRSSRSHSRSRSRSRSRSRSGKCSPLKDNTVGPQPAPQALLPTPKPLLPTPIGPQLPPPHFDKHTSKNSLPPSNGDDESRSNKDLSQCYNKTNSHNSSNNKQTHNDNRLQQQQPVNSIPPLMAINPQMNVCVPPPPLNGQGFVMPPYFPSDQYGMMMPNFGPNPMLNGHSWGAPPPPPPPPPPSSDPSNGSQNNYNYGQSGSGQSSLESDSRKRGGRSGGLGSSSSYGLGSGSGGLSSAGGAGSCGRGLGLADGPTQGGGLGTGGLGSDSLYGGGGGLGSDDGQGSQRSRGGRDRRRRGRGRDYDDHNSDNPSGGLGSYGSGGFEGGLGQASGGPSVPHGSLLPRMLPQNTGDFGGQQNANFTAFGSYGPKFKKNQGGYDNGDYDEGPVNGVEYYGHQNMGPGRPLNSNMDRGVFNDAQAYGSMGYRNDRQGNRQRR-