Monarch geneset OGS2.0

DPOGS208171
TranscriptDPOGS208171-TA1947 bp
ProteinDPOGS208171-PA648 aa
Genomic positionDPSCF300207 - 183319-185265
RNAseq coverage608x (Rank: top 21%)
Annotation
HeliconiusHMEL0157180.074.45% 
BombyxBGIBMGA010259-TA0.075.47% 
DrosophilaCG7878-PA5e-17648.39% 
EBI UniRef50UniRef50_E9IRZ50.054.95%Putative uncharacterized protein (Fragment) n=1 Tax=Solenopsis invicta RepID=E9IRZ5_SOLIN
NCBI RefSeqXP_001602397.10.055.92%PREDICTED: similar to DEAD box ATP-dependent RNA helicase [Nasonia vitripennis]
NCBI nr blastpgi|3071883100.058.51%Probable ATP-dependent RNA helicase DDX43 [Camponotus floridanus]
NCBI nr blastxgi|3071883100.056.75%Probable ATP-dependent RNA helicase DDX43 [Camponotus floridanus]
Group
Gene OntologyGO:00055241.1e-45ATP binding
GO:00080261.1e-45ATP-dependent helicase activity
GO:00036761.1e-45nucleic acid binding
GO:00043865.6e-29helicase activity
GO:00037232e-12RNA binding
KEGG pathwayssc:1001551742e-154 
 K12823 (DDX5, DBP2)maps-> Spliceosome
InterPro domain[259-462] IPR0140011.9e-61DEAD-like helicase
[265-434] IPR0115451.1e-45DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[499-580] IPR0016505.6e-29Helicase, C-terminal
[70-129] IPR0181112e-12K Homology, type 1, subgroup
[66-134] IPR0040873.8e-12K Homology
Orthology groupMCL12008 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208171-TA
ATGGCGGACGATTGGGAAGAGAGTGAACCTGTTGTTGTGGTACCGGCATCAACTACCTACATCCAAAATTCACGTGTGTTTCATCAAACTGGTAGAGGAAGAGGAAGAGGAGGATGGAGGACTCCCAGTGACGATTCCAATCAGAGATTCCGTGGCCAAGACAACAGGGATAACCGCCGAAGTTTCAACAATGATCACAATAGTAAAAAAATTATAACGGTGTCATCAGATAAAGTTGGTCGCATTATTGGGAAAGGTGGAAATAAAATACGTGATCTCGAATATGAATCCGAGGCTAAGATAAGGATTGGGGACTCCAGCGGCAGTTTGACCTCCATAACACTGTTCGGTTCCCCTGAAGCGACCGCTAAGGTCGAAGATATGATAAATAAACTGATCGAGGATCGTAAACCGCCAACGAGGGAACATTTTAACAACAGTGACACTGGAAGTGGTTCTAATGATTTTATGAAACAAAGTCAAGACGGTGTTGAGATTATTGATTGGGATAAATTAAATGCTGCACATGACGAAGAGCAAAAAAAGAGATGGGATAGCTTACCTCCAATCATTAAAGATTTCTATAAGGAAGATCCCACTGTGGCCGGTATGACCCCAGCAGATGTAACTCGCTGGAGGTTGGCTAACCATGACATTCAAGTAAAAAGAACATTTGATGACAAACCAGAGCTCCGTCCCATTCCCAATCCTGTTTTAACATTTGAACAGGCATTTCATCAATATCCAGAAATTTTGGAAGAAATTTATAAGCAAGGCTTTAAGCAACCATCTCCAATTCAGAGTCAAGCTTGGCCAATTTTGCTCAGGGGTGATGATATGATTGGTATAGCACAAACTGGAACTGGGAAGACATTAGCTTTTCTGTTGCCTGCTTTAATACATATTGATGGACAAACCATTCCGAGGGAAGAGAGAGAAGGTCCAACTGTATTAATATTAGCACCAACTCGGGAACTCGCTCTGCAGATTGAGAAGGAAACATTAAAATACCAATACAAGGGAATAACTTCTGTCTGCCTATACGGCGGTGGAGATAGAAAGGAACAAATAAAAATGTGTAAAGGTGGTGTAGATATTGTCATTGCTACACCAGGAAGATTGAATGATCTTGTCTTGGCTCGTCATCTGAATATAATAAATTTTTCATACATCGTTCTTGATGAAGCAGACAGAATGTTGGATATGGGTTTTGAACCACAAATCAGAAAGTCCTTATATGATGTGAGACCAGACCGCCAAACTGTTATGACATCTGCCACTTGGCCTGCAGGTGTACGTCGTCTAGCCGAGTCATATATGAAAGATCCAATACAGGTAAATGTAGGATCTTTAGACCTTGCAGCTGTACACACTGTAACTCAGAAAATTGTGTTCCTCGAGGAAGATGACAAGGAGGCAGCTCTTTTTGAATTCATTCAAAACATGGATAAAAATGATAAAGTCATCATATTCTGTGGGAAGAAAGCTACAGCCAGGCATATTTCGACGGAACTCTGCTTAAAAGGCATTGAGTGTCAATCTCTTCATGGGGATAGGGAGCAAATTGACCGGGAAGCTGCTTTGGAAGAAATGGTTGATGGCACAGTTAATATTCTAGTAGCTACTGATGTAGCTTCTAGGGGCATTGATATAAAAGACTTAACACATGTTGTCAACCTGGATTTCCCCCGTCACATTGAAGAGTATGTCCACAGAGTTGGTAGAACTGGAAGAGCTGGAAAGACAGGCATATCTCTGTCTTTCATAACAAGACAGGACTGGGCTCACGCTCAGGATTTGATAAAAATCTTAGAAGAAGCTAATCAAGAAATTCCAGATGAGTTGTTATCTATGGCTAACAGATTTGAGGCTATGAAAATTCGTAGGGAACAAGAGGGTGGGGATAGAAGGGGCCGAGGTGGAAGGGGTAGGAGGGGTAGATACTGA

Protein sequence:

>DPOGS208171-PA
MADDWEESEPVVVVPASTTYIQNSRVFHQTGRGRGRGGWRTPSDDSNQRFRGQDNRDNRRSFNNDHNSKKIITVSSDKVGRIIGKGGNKIRDLEYESEAKIRIGDSSGSLTSITLFGSPEATAKVEDMINKLIEDRKPPTREHFNNSDTGSGSNDFMKQSQDGVEIIDWDKLNAAHDEEQKKRWDSLPPIIKDFYKEDPTVAGMTPADVTRWRLANHDIQVKRTFDDKPELRPIPNPVLTFEQAFHQYPEILEEIYKQGFKQPSPIQSQAWPILLRGDDMIGIAQTGTGKTLAFLLPALIHIDGQTIPREEREGPTVLILAPTRELALQIEKETLKYQYKGITSVCLYGGGDRKEQIKMCKGGVDIVIATPGRLNDLVLARHLNIINFSYIVLDEADRMLDMGFEPQIRKSLYDVRPDRQTVMTSATWPAGVRRLAESYMKDPIQVNVGSLDLAAVHTVTQKIVFLEEDDKEAALFEFIQNMDKNDKVIIFCGKKATARHISTELCLKGIECQSLHGDREQIDREAALEEMVDGTVNILVATDVASRGIDIKDLTHVVNLDFPRHIEEYVHRVGRTGRAGKTGISLSFITRQDWAHAQDLIKILEEANQEIPDELLSMANRFEAMKIRREQEGGDRRGRGGRGRRGRY-