Monarch geneset OGS2.0

DPOGS208980
TranscriptDPOGS208980-TA2571 bp
ProteinDPOGS208980-PA856 aa
Genomic positionDPSCF300009 + 1268833-1273521
RNAseq coverage51x (Rank: top 70%)
Annotation
HeliconiusHMEL0038950.067.36% 
BombyxBGIBMGA008087-TA0.065.14% 
DrosophilaCG11403-PA3e-15535.62% 
EBI UniRef50UniRef50_UPI0001791E1F0.041.82%UPI0001791E1F related cluster n=1 Tax=unknown RepID=UPI0001791E1F
NCBI RefSeqXP_001949260.10.041.82%PREDICTED: similar to DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 11 (CHL1-like helicase homolog, S. cerevisiae) [Acyrthosiphon pisum]
NCBI nr blastpgi|1936083490.041.82%PREDICTED: probable ATP-dependent RNA helicase DDX11-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3503993660.042.05%PREDICTED: probable ATP-dependent RNA helicase DDX11-like [Bombus impatiens]
Group
Gene OntologyGO:00168171.5e-141hydrolase activity, acting on acid anhydrides
GO:00168181e-62hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides
GO:00040031e-62ATP-dependent DNA helicase activity
GO:00055246.4e-58ATP binding
GO:00061396.4e-58nucleobase, nucleoside, nucleotide and nucleic acid metabolic process
GO:00080266.4e-58ATP-dependent helicase activity
GO:00036766.4e-58nucleic acid binding
GO:00036773.7e-41DNA binding
KEGG pathway 
InterPro domain[143-837] IPR0130201.5e-141DNA helicase (DNA repair), Rad3 type
[3-396] IPR0065541e-62Helicase-like, DEXD box c2 type
[665-813] IPR0065556.4e-58Helicase, ATP-dependent, c2 type
[190-374] IPR0106143.7e-41DEAD2
Orthology groupMCL13788 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208980-TA
ATGAATTCACATTTTTCTTTCCCTTTTGAACCTTACGATATTCAAGTAAAATTTATGACCGACCTATATTCGACAATTGAGAACAAAAAACTTGGAATATTCGAAAGTCCAACAGGAACGGGTAAATCATTAAGCATTTGTTGTGGATCTCTGCAGTGGCTTAGAGATACAAACAAAAAAATTGAAGATGATTTAATTAAAAATATAGATGATCTTAAGTATCAGATAGCAAATACTTCGGGTTCTGATGATTGGCTTCAAGACGAATATAATAAAATACAGAAAAACCAGTTATTGTTAAAATTACAAAACAAACTCTCCAAAATAAAAAAAATGCAGGAGGAGTTTCAGGAATTGAAAAACAAAGTTGCAAAACAAAATCTACATAGTAGTAAAATTGATAGCTTTTTCATAAAGCAGAATAAAGATGAAAAGTCTGAGGCAACAGATGAGACAGAGAATGTGGTTGATGATGATTTGATTGTGGAAATTGAAAATAACAAAGATGATTCTGATGATGAGTCGCCTTTGGAAGATGTAGAAGACAGTTTTACAAAGATTTACATAAGCAGTAGGACACATAGTCAGCTATCCCAATTTATTGGTGAAATTAAAAGGACTATTTTTAATAAGGGTACAAAAGTAGTAACACTCGCATCTCGACAACATTATTGTATAAACTCTGAAGTGAATAGATTGAAAAATGTTAGTCTCATTAATGAAAGGTGTTTAGAAATGCAAAAATCTAAAACTAAATCCACAGCCAAAGACGAAGAGGGCAAAGTTGTGAAGAAAAGTAAGACCAAGGCCTGTTCGGCATGTCCATACTACAATCAGAATAATATAAAAAAATTAAAGGAAAAATTGCTTGTTGAAGTTATGGATATGGAAGAAATTGTGAAAAGTGGAAAACAAATGAAAGCATGCCCTTATTATGCTTCAAGAATGGCTTTAGAAGATGCAGAGGTTGCTTTGATCAGTCATGCGGGTATTGTAAGTCATGGCGCCAGAACGGGTGTATCAATAAAATTGCAAAATAATATATTGATAATGGACGAAGCTCATGGGCTATCAGCGGCTTTGGAAAATGCACATTCTGCACCAGTTTCACATAAACAATTGTCATGTGTTAAAACCTTCTTGGAGTTTTACATCAATAAATACAGATCATTATTGAGCAGCAGGAATTTGTTGCAGCTGAATCAAATTAATTTTGTTGTATACATTGTTTTAATTGTTTTAGGTATGGTATGTCCAAAATTAAATGACAAAGATGGTTCAAAAATATATACATTAGAAGATTTTGTCATAGAGGCTGAAATAGATCACCTTCATTTACACCCTCTTGTTGAGTTTTGTAGAAATGTCAGATTGGCACCGAAATTGCATGGTTTTTGTATGAGGTACAGTCAACAAGCTCTAGCGGAAGAACATAAAAAGGATACCAATAAAAAAGGTTCCTTTAAGGATTTTTTAAGCAACATATCTAAAAAGAGAACACAAAATGATAGTTTGAAATGTGCTCCTTTAGACGTCCCACCGACTGAGATATCTGCGGGCAGTAGTTTATATGCAGTATTAGATTTCTTAGAGAGGCTTTGTGATCGCAGTGAAAACGGTCGGGTCTTAACACAGAGTGATTCCGGCCTTCTGAAATATCTACTATTAAATCCCGCTGAACATTTTGCGGATGTAGTTAGCCAGTGTCGATCAGTAATTTTAGCGGGTGGTACAATGGAGCCTATAAGTGAATTCCAAGAACTTCTTGCTTCTGATAAAACACAATTGGATAGAGTGAATGTTGTAAAGTGCGGACATGTTGTGCCTGCTGATAACGTGTTAGGAATATGTCTTTCAAAGGGTCCATCTAAGCTTAATTTAAATTTCTCATATGAAAATAGATCCTCATTTGAGTTGCTAAATGAAATCGGTCGCATTTTGAGAAATCTTTGTAATGTCGTGCCTGACGGCGTCGTATGTTTTTTGCCATCTTATTCATTTGAACAAACTTTGTATGAACATTTAAAGAGCACGGGTGTGTTAGAATCTGTAAGCAAAAAGAAGATTATATTTAGGGAGCCTAAGTCAGCCTCGGAAGTTGAACAGGTGTTACAAAAATATTCTGCAGCTGTTAAAAGTAAAGTAGGCGATATCAACGGCGCGCTCATGTTGAGTGTAGTTGGAGGGAAATTAAGTGAGGGATTAAATTTCAGCGATGAATTAGGTCGATGTGTGTTGGTAGTTGGAATGCCGTATCCAAATATAAAATCCTTGGAGCTTCAAGAGAAGATGAAGTATTTGAATAAATCTACTCCTGGTTCAGGAAGCATATATTATGAAAACCTTTGTATGAAAGCAGTGAATCAGTGCATTGGCAGAGCCGTTAGACATGCAAATGATTACGCCTGTGTCATTTTAGTGGATGAACGGTATTCTAGATCTCAAACAATATCAGCATTACCATCCTTTGTTCAGAAATCTTTAATTTCGAATTGCGTATTCGGACAAGCAATGGGCAGTATAGCGAAGTTCTTTGCTAGACACAAGAAGAAAAATAATGATAACAAATGA

Protein sequence:

>DPOGS208980-PA
MNSHFSFPFEPYDIQVKFMTDLYSTIENKKLGIFESPTGTGKSLSICCGSLQWLRDTNKKIEDDLIKNIDDLKYQIANTSGSDDWLQDEYNKIQKNQLLLKLQNKLSKIKKMQEEFQELKNKVAKQNLHSSKIDSFFIKQNKDEKSEATDETENVVDDDLIVEIENNKDDSDDESPLEDVEDSFTKIYISSRTHSQLSQFIGEIKRTIFNKGTKVVTLASRQHYCINSEVNRLKNVSLINERCLEMQKSKTKSTAKDEEGKVVKKSKTKACSACPYYNQNNIKKLKEKLLVEVMDMEEIVKSGKQMKACPYYASRMALEDAEVALISHAGIVSHGARTGVSIKLQNNILIMDEAHGLSAALENAHSAPVSHKQLSCVKTFLEFYINKYRSLLSSRNLLQLNQINFVVYIVLIVLGMVCPKLNDKDGSKIYTLEDFVIEAEIDHLHLHPLVEFCRNVRLAPKLHGFCMRYSQQALAEEHKKDTNKKGSFKDFLSNISKKRTQNDSLKCAPLDVPPTEISAGSSLYAVLDFLERLCDRSENGRVLTQSDSGLLKYLLLNPAEHFADVVSQCRSVILAGGTMEPISEFQELLASDKTQLDRVNVVKCGHVVPADNVLGICLSKGPSKLNLNFSYENRSSFELLNEIGRILRNLCNVVPDGVVCFLPSYSFEQTLYEHLKSTGVLESVSKKKIIFREPKSASEVEQVLQKYSAAVKSKVGDINGALMLSVVGGKLSEGLNFSDELGRCVLVVGMPYPNIKSLELQEKMKYLNKSTPGSGSIYYENLCMKAVNQCIGRAVRHANDYACVILVDERYSRSQTISALPSFVQKSLISNCVFGQAMGSIAKFFARHKKKNNDNK-