Monarch geneset OGS2.0

DPOGS207438
TranscriptDPOGS207438-TA1266 bp
ProteinDPOGS207438-PA421 aa
Genomic positionDPSCF300051 - 810234-813437
RNAseq coverage169x (Rank: top 51%)
Annotation
HeliconiusHMEL0123322e-17171.43% 
BombyxBGIBMGA009910-TA7e-15965.32% 
DrosophilaCG5589-PA8e-9143.74% 
EBI UniRef50UniRef50_E0VC782e-9552.44%Predicted protein n=10 Tax=Eukaryota RepID=E0VC78_PEDHC
NCBI RefSeqXP_975300.22e-10649.29%PREDICTED: similar to DEAD box ATP-dependent RNA helicase [Tribolium castaneum]
NCBI nr blastpgi|3838552212e-10748.03%PREDICTED: probable ATP-dependent RNA helicase DDX52-like [Megachile rotundata]
NCBI nr blastxgi|1892353293e-10550.12%PREDICTED: similar to DEAD box ATP-dependent RNA helicase [Tribolium castaneum]
Group
Gene OntologyGO:00055242.1e-38ATP binding
GO:00080262.1e-38ATP-dependent helicase activity
GO:00036762.1e-38nucleic acid binding
KEGG pathway 
InterPro domain[153-357] IPR0140012e-47DEAD-like helicase
[158-328] IPR0115452.1e-38DNA/RNA helicase, DEAD/DEAH box type, N-terminal
Orthology groupMCL11771 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207438-TA
ATGGATGCTTACGATATATTTAAGAAACTGACCAAGGGTTTGACTTTTAAGCATCGCGTCTTGGGAGTAAAAAACAAAGATGAGCCTATTAAATCAAGTTTAAAAAATGAAGTTATCAAAGAAGAACGTGATATTAAAGAGGAGATTGAGAGTTGTTCAGATAATGAAGAGTCAAGTGTACCCCAGACGGATTCAGAACAAGATGTCAGTGACCCCGAACTACAAATTATAGAAGGTGACAAGGGTCAGTCGACAAGAAAAGAAAAGAAAAAGAAATCAAAATTAACTCCTGAAGAGTTACAAAAGAAGCTGGAACTAGAAGAGCAAAACCATTTCCGTAATGAGCATGGGATAAAAGCAGTTGGCCGTCACATTCCTAACGCATTAAGAGACTTCAACGAACTGATTACAAAGTATAATATATCCCCGTCAATGGTGGAAGTGCTTTTACAGTGTGGTTACTCTGAACCCACACCCATTCAACGTCAAGCTGTACCATGTTTATTGGAGAATCGTCAAATCGTAGCCTGTGCTCCGACTGGATCTGGTAAGACAGCAGCATTCTTAATGCCTCTCTTACACATGCTCGGAGCTCCTCAAGGCGGTCCGAGAGCCCTGGTGCTTTGTCCCACCAGAGAACTGGCAAACCAGATATACCGCGAGGCCATAAGACTCTCGGCATCCACTCAGTTGAGATGCTCAGTCATCAGGAGTCTCAAGGAAAGTAAGATCAAGGAGCGAGAAGCTACGATAAGGAAGAGTGATTTAGTGATAAGTACACCGAATCGTCTCTGTTACTTGCTGAAACAGGAAACAGTGGGAATAAACATGGACAAAGTACAATGGCTGGTCATAGATGAAGCTGACAAGTTATTCGAGGGCTCCCAGGAGGAGGTGGACACCTTCCGCCAACAGCTGGACATTATCCTCAGCAGCTGCAAGTCCCGCCTGGCCATGTTCAGCGCGACACACACGCCATCTATCGCCAAGTGGGCGAGACACAACATGAGAGGGCTCATTAACATCACCGTCGGACACAGGAACGCAGCTTCCTCATCGGTGGAGCAGGAACTTCTGTTCTGTGGCAACGAGAGTGGGAAGCTGGTGGCTTTCAGACAACTCATTCAGAAGGGTCTCAAACCCCCCGTACTAGTGTTCGTCCAAAGTAAAGATCGCGCCAAAGAGCTGTTCAAGGAACTATTATACGACGGGATCCAGGTGGACGTCATACACGGAGACAGGACACAGGCACAGGTGGAGTTATAG

Protein sequence:

>DPOGS207438-PA
MDAYDIFKKLTKGLTFKHRVLGVKNKDEPIKSSLKNEVIKEERDIKEEIESCSDNEESSVPQTDSEQDVSDPELQIIEGDKGQSTRKEKKKKSKLTPEELQKKLELEEQNHFRNEHGIKAVGRHIPNALRDFNELITKYNISPSMVEVLLQCGYSEPTPIQRQAVPCLLENRQIVACAPTGSGKTAAFLMPLLHMLGAPQGGPRALVLCPTRELANQIYREAIRLSASTQLRCSVIRSLKESKIKEREATIRKSDLVISTPNRLCYLLKQETVGINMDKVQWLVIDEADKLFEGSQEEVDTFRQQLDIILSSCKSRLAMFSATHTPSIAKWARHNMRGLINITVGHRNAASSSVEQELLFCGNESGKLVAFRQLIQKGLKPPVLVFVQSKDRAKELFKELLYDGIQVDVIHGDRTQAQVEL-