Monarch geneset OGS2.0

DPOGS210654
TranscriptDPOGS210654-TA1842 bp
ProteinDPOGS210654-PA613 aa
Genomic positionDPSCF300401 + 93318-99224
RNAseq coverage335x (Rank: top 34%)
Annotation
HeliconiusHMEL0043075e-8140.14% 
BombyxBGIBMGA001799-TA0.085.92% 
Drosophilaabs-PA0.072.54% 
EBI UniRef50UniRef50_Q9V3C00.072.54%ATP-dependent RNA helicase abstrakt n=34 Tax=Eukaryota RepID=DDX41_DROME
NCBI RefSeqXP_392069.30.074.15%PREDICTED: similar to ATP-dependent RNA helicase abstrakt (DEAD box protein abstrakt) isoform 1 [Apis mellifera]
NCBI nr blastpgi|3838585650.073.99%PREDICTED: ATP-dependent RNA helicase abstrakt-like [Megachile rotundata]
NCBI nr blastxgi|2700142040.073.70%hypothetical protein TcasGA2_TC016289 [Tribolium castaneum]
Group
Gene OntologyGO:00055246.4e-45ATP binding
GO:00080266.4e-45ATP-dependent helicase activity
GO:00036766.4e-45nucleic acid binding
GO:00043865.9e-34helicase activity
KEGG pathway 
InterPro domain[194-405] IPR0140016.3e-51DEAD-like helicase
[199-378] IPR0115456.4e-45DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[440-521] IPR0016505.9e-34Helicase, C-terminal
Orthology groupMCL14781 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210654-TA
ATGTCGGAACCTCAAGTGAAACGATATAGAAGAGAAGAGAAATCGTCAGAATCTGAAGAAGATATAGATAATTACGAGCCATACGTGCCTGTGAAAGATAGGAAAAAGCAGAAATTGTTGGTGTTGGGTCGCCTTGGCCAGTTGGCTGCTGAGGCTGTAGCTGAAACCAAGAGCTCCAGCGAGAACGACCCAGAAGACGAGGGATCACAGGAAGAATGGGGCAGACGTTACAATGTGTCATTGCTGGATCAACACAGTGAACTGAAGAGGTTGGCGGAGGCGAGGGCCCTCTCTGCAGCGGAGAGACAGGCTAAGGAGGAAGAACATATACTTGATAGTGTAGCACAGAGCAAGGCTTTGATGGGAGTCGCTGAATTGGCTAAAGGTATTCAATACTCGGAACCTATAAAGACGTCGTGGAAACCTCCTGGCTGTATCAGCTCCTTACCTCCAGAGCGTCACGAGAGGGTGAGGAGGGAGCTGAGGATACAGGTGGAGGGTGAAAACGTCCCTCCACCTATAAGGACTTTCAGACATATGAAATTTCCCAAAGGTATTCTCCAAGGTCTGGAGGCGAAGGGCATAAAGAAGCCCACGCCCATCCAAGTTCAAGGTATCCCAGCGGTTCTGAGCGGTCGTGACATGATCGGTATAGCCTTCACCGGCTCCGGGAAGACGCTGGTCTTCACGCTGCCCATCATCATGATGTGTCTGGAACAGGAGATAGAGATGCCGTTCATCAGAAACGAAGGTCCCTACGGCCTCATAATATGCCCGTCCAGGGAACTCGCCAAGCAGACCCATGATATCATAATGCATTTCGTTAAACATCTCAAAATGGCGGGACATCCGGAGATAAGGAGCTGTCTGGCGATCGGTGGTGTGTCCGTCTCCGAGTGCATGGAGGTGGTGCAGCGAGGAGTTCACATCATGGTGGCTACACCTGGAAGATTAATGGACATGCTGGATAAGAAGATGGTCCGACTGAACGTGTGCCGCTACCTGTGTATGGACGAAGCTGATCGGATGATAGACATGGGCTTTGAAGAGGACGTGCGGACCATCTTCTCCTACTTCGCCGGGCAGCGACAGACTTTACTGTTCAGCGCCACCATGCCCAAGAAGATACAGAACTTCGCCAGGTCAGCGTTGGTCCAACCGGTGACGTTGAACGTGGGTCGCGCCGGGGCCGCGGCGCTCGCGGTCCGCCAGGAGCTGGAGCCCGTGAAGGCCGAGGCGAGGACGGTCCACCTGCTGCAGTGTCTCCAGAAAACGCCTCCTCCCGTCCTAGTGTTCGCTGAACGGAAACAGCACGTGGACGCTATACATGAGTACCTGCTGCTCAAGGGGGTGGAGGCCGTGGCGATACACGGCGGCAAGGACCAGGAGGAGAGGTCCCGGGCCGTGGAGGCCTTCAGGAGGGGCGAGAAGGACGTGCTCGTGGCTACCGATGTTGCCAGTAAAGGTCTCGACTTCGAGAACATCCAGCACGTGATCAACTACGACATGCCGGAGGATATCGAAAACTACGTCCACAGGATAGGTCGTACTGGGCGGGCGGGGACACAGGGCGTAGCGAGCACATTACTAGGGCGGGCGGCGGATTCTAGCGTTCTACGAGATTTGGCCCATTTACTGGTCGAAGCTGGGCAAAAAGTCCCGCAATTTCTTCTAGAGATGATCGGTGAGGACGGGCCCCTGTCCGGGGGACCTGGCTGCGCGTACTGCGGGGGCCTGGGACATAGGATCACTGAGTGCCCCAAGCTAGAGGCAGTCCAGAACAAGCAGGCCTCCAACATCGGCAGGAGGGACTACCTCGCCAACACAGCAGCAGACTATTGA

Protein sequence:

>DPOGS210654-PA
MSEPQVKRYRREEKSSESEEDIDNYEPYVPVKDRKKQKLLVLGRLGQLAAEAVAETKSSSENDPEDEGSQEEWGRRYNVSLLDQHSELKRLAEARALSAAERQAKEEEHILDSVAQSKALMGVAELAKGIQYSEPIKTSWKPPGCISSLPPERHERVRRELRIQVEGENVPPPIRTFRHMKFPKGILQGLEAKGIKKPTPIQVQGIPAVLSGRDMIGIAFTGSGKTLVFTLPIIMMCLEQEIEMPFIRNEGPYGLIICPSRELAKQTHDIIMHFVKHLKMAGHPEIRSCLAIGGVSVSECMEVVQRGVHIMVATPGRLMDMLDKKMVRLNVCRYLCMDEADRMIDMGFEEDVRTIFSYFAGQRQTLLFSATMPKKIQNFARSALVQPVTLNVGRAGAAALAVRQELEPVKAEARTVHLLQCLQKTPPPVLVFAERKQHVDAIHEYLLLKGVEAVAIHGGKDQEERSRAVEAFRRGEKDVLVATDVASKGLDFENIQHVINYDMPEDIENYVHRIGRTGRAGTQGVASTLLGRAADSSVLRDLAHLLVEAGQKVPQFLLEMIGEDGPLSGGPGCAYCGGLGHRITECPKLEAVQNKQASNIGRRDYLANTAADY-