Monarch geneset OGS2.0

DPOGS209762
TranscriptDPOGS209762-TA1155 bp
ProteinDPOGS209762-PA384 aa
Genomic positionDPSCF300314 + 53220-58980
RNAseq coverage1x (Rank: top 94%)
Annotation
HeliconiusHMEL0119078e-12160.42% 
BombyxBGIBMGA010171-TA3e-6436.10% 
DrosophilaCG5205-PA2e-6837.60% 
EBI UniRef50UniRef50_D6WVU91e-11050.65%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WVU9_TRICA
NCBI RefSeqXP_970333.22e-11150.65%PREDICTED: similar to HFM1 protein [Tribolium castaneum]
NCBI nr blastpgi|1892400094e-11050.65%PREDICTED: similar to HFM1 protein [Tribolium castaneum]
NCBI nr blastxgi|2700117909e-10750.65%hypothetical protein TcasGA2_TC005866 [Tribolium castaneum]
Group
Gene OntologyGO:00055241.5e-23ATP binding
GO:00080261.5e-23ATP-dependent helicase activity
GO:00036761.5e-23nucleic acid binding
GO:00043861.8e-16helicase activity
KEGG pathway 
InterPro domain[4-165] IPR0115451.5e-23DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[3-196] IPR0140014.1e-17DEAD-like helicase
[269-359] IPR0016501.8e-16Helicase, C-terminal
Orthology groupMCL18043 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209762-TA
ATGGATGACGCGTTGTATTCAAATAAATCTATGGTGGTATGCGCCCCGACCGGGTCAGGCAAGACAGTTATATTCGAAATGGCCATAGTACAGTTGTTAATGGAACTTGAAGATAAAAACTGCAATGACGATTACAAGATTATTTACATGGCCCCTGTTAAGGCTCTGTGCACAGAACGTATTACCGAATGGTATTCAAAGTACATGAAGCTTGGCCTCCTATGTATAGAAGTAACTGGCGACACCGACGTAGACTTCTCACAGTTACAACCCTACAGAATAATAATAACAACACCCGAAAAATGGGATTTGATAACGAGACGTTGTAACGACCTGTCCCTCGTGAAGCTCTTCCTTATAGACGAGGTACATTCCTTGAATGACGAATCGAGAGGGCCCGTCCTAGAAGCTGTTGTTAGTAGAGTGAAGACTGTACAGTGTTCAATACAATGGTGCGTCCGTCTTGTGGCTGTGTCCGCTACCATCAGCAACCCTGAAGATGTAGCCACTTGGCTCGGGGGATCGCAGGCTGTACACTATAAATTTGGTGACGAGTGTCGACCAGTTAAGTTGAACCGCGTAGTGGAGGGTTATCCGTGCTCGCCAGGGACCAGCATATTCAAGTTCGACATTATTCTAAACTACAAACTTTGGCCGGTCATACAGAAATATTACAACGGGAAACCAACTTTGATATTTTGCAACACTAGAAAAAGCGTGATGTTCACCGCTGAAACACTGTCCAAAGAGATTACTGTTGGCTTCAGTCCTGACCAAAGAGCGAAACTAACAACCATAGCGTCTTCAATAAGAAACAAGAAATTACAGTCGTTGGTGTTGTCCGGCGTGGGTTGCCATCACGCCGGTCTCTTACTAGACGAAAGGAACGCCATTGAACGTGCGTTCAGGAACAGAGATCTTCCTATACTCATAACAACAACAACATTAGCCATGGGCGTTAATTTACCAGCTCATCTTGTTATTATTAAGAACACACAGCAATATGTCAATGGCGCTTATAAGGAGTACAGTATAAGTACTGTGTTACAAATGATCGGTAGAGCGGGCAGACCGCAGTACGATCGTGAGGCCACCGCCGTTATTATGACCAGACTCCAGGATAAGGTAAGTATAACTATGATTACGTATATATAA

Protein sequence:

>DPOGS209762-PA
MDDALYSNKSMVVCAPTGSGKTVIFEMAIVQLLMELEDKNCNDDYKIIYMAPVKALCTERITEWYSKYMKLGLLCIEVTGDTDVDFSQLQPYRIIITTPEKWDLITRRCNDLSLVKLFLIDEVHSLNDESRGPVLEAVVSRVKTVQCSIQWCVRLVAVSATISNPEDVATWLGGSQAVHYKFGDECRPVKLNRVVEGYPCSPGTSIFKFDIILNYKLWPVIQKYYNGKPTLIFCNTRKSVMFTAETLSKEITVGFSPDQRAKLTTIASSIRNKKLQSLVLSGVGCHHAGLLLDERNAIERAFRNRDLPILITTTTLAMGVNLPAHLVIIKNTQQYVNGAYKEYSISTVLQMIGRAGRPQYDREATAVIMTRLQDKVSITMITYI-