Monarch geneset OGS2.0

DPOGS200832
TranscriptDPOGS200832-TA1422 bp
ProteinDPOGS200832-PA473 aa
Genomic positionDPSCF300071 - 439032-442485
RNAseq coverage692x (Rank: top 19%)
Annotation
HeliconiusHMEL0126330.081.14% 
BombyxBGIBMGA009885-TA0.084.18% 
DrosophilaCG9253-PA0.073.95% 
EBI UniRef50UniRef50_Q9H0S40.072.97%Probable ATP-dependent RNA helicase DDX47 n=41 Tax=Amniota RepID=DDX47_HUMAN
NCBI RefSeqXP_969791.10.080.76%PREDICTED: similar to GA21647-PA [Tribolium castaneum]
NCBI nr blastpgi|3503989970.080.51%PREDICTED: probable ATP-dependent RNA helicase DDX47-like [Bombus impatiens]
NCBI nr blastxgi|910881150.082.50%PREDICTED: similar to GA21647-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055242.1e-51ATP binding
GO:00080262.1e-51ATP-dependent helicase activity
GO:00036762.1e-51nucleic acid binding
GO:00043867.7e-32helicase activity
KEGG pathway 
InterPro domain[56-254] IPR0140013.7e-60DEAD-like helicase
[62-228] IPR0115452.1e-51DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[290-371] IPR0016507.7e-32Helicase, C-terminal
Orthology groupMCL12130 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200832-TA
ATGACGGAAGATAAAGAAAAAAGTGGTAGTGAAAAGAGTGATAGTGAGGATCAGGGCTCCGACGAAGAACAGCAAGTTGAAATTAGAGAGAACCATGAAGAAGACACAGGGACTTTTCAAGAACTTGGTGTCGTTGACGTTCTGTGTGAAGCTTGTGCAGAGCTGAAATGGAAACATCCATCCAAGATCCAAAAAGAAGCTATACCTGTAGCATTACAGGGTAAAGACATAATTGGCTTAGCGGAAACTGGTTCAGGCAAAACAGGAGCATTTGCTTTACCCATACTCCAAGCGCTGTTGGAAAATCCACAAAGATACTTTGCTCTGATTCTCACACCCACAAGAGAATTGGCATTCCAAATATCTGAGCAATTTGAAGCCCTTGGTGCCAGTATAGGTGTGAAGTGTGCAGTAATAGTTGGAGGTATGGACATGGTGGCACAAGCTCTCATCCTGTCTAAGAAGCCTCACATCATCATCGCCACCCCTGGCCGTCTAGTGGACCACTTGGAGAATACAAAGGGGTTCAATTTAAAGGCACTTAAATATCTTGTGATGGATGAAGCAGATAGAATATTAAACATGGACTTTGAAGTGGAAGTTGATAAAATACTACGCGTCATTCCTCGCGAACGCCGCACTTACCTGTTCTCGGCGACCATGACCAAGAAAGTGCAAAAGTTACAGCGAGCCTCCCTTCAAGACCCTGTCAAGGTGGAAGTTTCCACTAAATATCAAACTGTGGAGAAGCTGCAGCAGTACTACATATTCATACCCGTGAAGTTCAAGGACGTATACCTGGTTCACATCCTGAACGAGCTGGCGGGCAATTCGTTCATAGTGTTCGTGTCGACATGCGCGGGCGCATTGCGTGTGGCGTTGTTGCTGCGAGCGCTGGGAGTGGGCGCTGTGCCGCTACATGGACAGATGTCGCAACAGAAAAGATTGGCCGCACTTAATAAGTTTAAAAGCAAAGCTCGCTCGGTGCTGATATGTACTGATGTCGCTTCTAGAGGTCTGGACATCCCTCATGTCGACGTGGTAGTCAATTTGGACATTCCTTTGCACTCCAAGGACTATATACATCGTGTTGGAAGAACTGCACGTGCGGGTAGAGCTGGGAAAGCTATTACTTTTGTTTCACAGTATGACGTGGAGTTGTACCAGCGTATAGAACAGCTGATCGGCAAACAACTGCCGCTGTACAAGACGGACGAGAACGAGGTCATGGTGCTGCAGGAGAGAGTCGCCGAGGCGCAGAGGCTTACTAAGATTGAAATGAAAGAGTTGGAAGACAAGAAAGGCTCCAGAGGAAAGAAACGCGGCGCCGACAGCGACGACGACACCGAGGAGGCGGTCGGAGTCAGGAGGCGGATCAAGGGCAAGAACAAGAACAAACATGGCGGGAAGAGGAAACGATGA

Protein sequence:

>DPOGS200832-PA
MTEDKEKSGSEKSDSEDQGSDEEQQVEIRENHEEDTGTFQELGVVDVLCEACAELKWKHPSKIQKEAIPVALQGKDIIGLAETGSGKTGAFALPILQALLENPQRYFALILTPTRELAFQISEQFEALGASIGVKCAVIVGGMDMVAQALILSKKPHIIIATPGRLVDHLENTKGFNLKALKYLVMDEADRILNMDFEVEVDKILRVIPRERRTYLFSATMTKKVQKLQRASLQDPVKVEVSTKYQTVEKLQQYYIFIPVKFKDVYLVHILNELAGNSFIVFVSTCAGALRVALLLRALGVGAVPLHGQMSQQKRLAALNKFKSKARSVLICTDVASRGLDIPHVDVVVNLDIPLHSKDYIHRVGRTARAGRAGKAITFVSQYDVELYQRIEQLIGKQLPLYKTDENEVMVLQERVAEAQRLTKIEMKELEDKKGSRGKKRGADSDDDTEEAVGVRRRIKGKNKNKHGGKRKR-