Monarch geneset OGS2.0

DPOGS201182
TranscriptDPOGS201182-TA1680 bp
ProteinDPOGS201182-PA559 aa
Genomic positionDPSCF300262 - 31818-35633
RNAseq coverage525x (Rank: top 24%)
Annotation
HeliconiusHMEL0171349e-15782.09% 
BombyxBGIBMGA014271-TA0.084.47% 
DrosophilaHlc-PA5e-17052.87% 
EBI UniRef50UniRef50_Q6SC690.076.72%RNA helicase n=20 Tax=Coelomata RepID=Q6SC69_CHOFU
NCBI RefSeqXP_966623.10.064.07%PREDICTED: similar to ATP-dependent RNA helicase DBP9 [Tribolium castaneum]
NCBI nr blastpgi|425391710.076.72%RNA helicase [Choristoneura fumiferana]
NCBI nr blastxgi|425391710.077.25%RNA helicase [Choristoneura fumiferana]
Group
Gene OntologyGO:00055248.6e-40ATP binding
GO:00080268.6e-40ATP-dependent helicase activity
GO:00036768.6e-40nucleic acid binding
GO:00043861.3e-20helicase activity
KEGG pathwaydpo:Dpse_GA140864e-162 
 K01509 (E3.6.1.3)maps-> Purine metabolism
InterPro domain[26-232] IPR0140013e-41DEAD-like helicase
[32-202] IPR0115458.6e-40DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[269-381] IPR0016501.3e-20Helicase, C-terminal
Orthology groupMCL12302 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201182-TA
ATGAGTGAAGACAAAAAGGTTATGTTCCATGAAATGGAACTGGATGATCGATTATTAAAGGCTATATCTCAGTTGGGATGGCCTCACCCGACACTGATACAAGAAACGGCTATCCCATTGTTATTAGAGGGCAAAGATGTACTCATGAGAGCCAGGACGGGATCAGGCAAGACAGCAGCTTTCACAATACCTGTCATACAAAAGATTTTGAATCTTAAAAATACCAGTGCACACCAATGTATAAGAGCCCTTATATTATCTCCAAGTAAGGAGCTGTGCGGACAGATAACTTCTGTGGTTGGTCATTTAACACTGAAATGTGCAAGAGAAGTCCGTTGTATAGACATTTCCTCCAACGGTGACATGCAGATACAGAAGTCTTTACTGGCTGACAAGCCTGATATAGTAGTGTCCACACCATCACGAGTATTGGCCCACTTGAAGGCTAATAATGTAAGGTTGAAGGAGGATATAGCCATGTTGGTTGTGGATGAAGCCGATTTGGTATTCTCATTTGGTTATGAAAACGAAATTAAGGAACTTCTTGAACATTTGCCGAAGATATATCAAGCTGTTCTAGCCTCAGCTACACTTTCCGACGATGTTTTAAGTCTGAAAAAGATAGTCCTCAGAAATCCGGTGACATTAAAGCTCGAAGAACCAGAGCTGGCACCGTCTACACAATTACAGCATTATCATTTGTTTGCCGAAGAAGATGATAAAGCGGCCATACTCTATGCATTGCTGAAATTAAATCTCATCAGAGGAAAGACCATCATATTTGTTAGGACGGTTGACCGATGTTACAAATTGAAGTTATACTTGGAGCAGTTTAAAATCGGCTCATGTGTACTGAATTCTGAGTTGCCGGCGGCTGTGCGGTGTATGTCTGTGGAGCAGTTTAACAGAGGTCGGTACCAGATTATCGTGGCCTCTGACGAGAAGGCTTTGGAGGAACCGGACGGGGGCATGATGCTGGAGGAGACGGGCAAGAAGAAGCAAAAATCAAAACGTAGGAAAGACAAGGAGTCGGGCGTGTCCCGGGGCATCGATTTCCAGCATGTGTCTAACGTTATAAACTTCGACTTCCCCCTGGATGTGACGGCCTACGTGCATCGCGCGGGCCGGACGGCTAGAGGGACTAGTCAGGGCTCCGTGCTGTCGTTTGTGTCCATCAGAGAGAAACCGCTCATGAATGCTGTGAAAGAACATCTAACTAAATGTTTCAATGGGCAGAAAGTTTTACAGAAGTATTCTTTCGCGTTGGACGAGGTGGAGGGTTTCCGGTACCGGTCACGGGACGCGTGGCGCGCCGTCACACGGGTGGCCGTCAGGGAGGCGAGGCTGAGCGAGATCAGGAGGGAGCTGCTCAACTGCAAGAGACTACAGGGCTACTTCGAGGAGAACCCCACAGACCTGGCCGCCTTGAAGCGCGATAAGGCCCTTCACACCGTGCGCCTGCAGCCGCAGCTGGCTCACGTGCCGGAGTACCTCCTGCCGGCCGCGCTCCGGACGGACGGCCCCGAGCCCGAGCCGGCGGCGCCGGACGCGCCCCCCGCCAAGAAAAAGAAGCAGCAGAACTTCGGCAGCGTGAAGAGACATAAGTACCAAGCCCGGCAGCGAGATCCCCTCAAGAGCTTCGCGGTGAAGGCTGCCAGCGCGTCGCCGTCCAAGACCTAG

Protein sequence:

>DPOGS201182-PA
MSEDKKVMFHEMELDDRLLKAISQLGWPHPTLIQETAIPLLLEGKDVLMRARTGSGKTAAFTIPVIQKILNLKNTSAHQCIRALILSPSKELCGQITSVVGHLTLKCAREVRCIDISSNGDMQIQKSLLADKPDIVVSTPSRVLAHLKANNVRLKEDIAMLVVDEADLVFSFGYENEIKELLEHLPKIYQAVLASATLSDDVLSLKKIVLRNPVTLKLEEPELAPSTQLQHYHLFAEEDDKAAILYALLKLNLIRGKTIIFVRTVDRCYKLKLYLEQFKIGSCVLNSELPAAVRCMSVEQFNRGRYQIIVASDEKALEEPDGGMMLEETGKKKQKSKRRKDKESGVSRGIDFQHVSNVINFDFPLDVTAYVHRAGRTARGTSQGSVLSFVSIREKPLMNAVKEHLTKCFNGQKVLQKYSFALDEVEGFRYRSRDAWRAVTRVAVREARLSEIRRELLNCKRLQGYFEENPTDLAALKRDKALHTVRLQPQLAHVPEYLLPAALRTDGPEPEPAAPDAPPAKKKKQQNFGSVKRHKYQARQRDPLKSFAVKAASASPSKT-