Monarch geneset OGS2.0

DPOGS207561
TranscriptDPOGS207561-TA1332 bp
ProteinDPOGS207561-PA443 aa
Genomic positionDPSCF300072 - 781606-784703
RNAseq coverage615x (Rank: top 21%)
Annotation
HeliconiusHMEL0171435e-13380.59% 
BombyxBGIBMGA004711-TA0.091.67% 
Drosophilapont-PA0.076.70% 
EBI UniRef50UniRef50_Q9Y2650.076.97%RuvB-like 1 n=198 Tax=Eukaryota RepID=RUVB1_HUMAN
NCBI RefSeqXP_971596.10.081.54%PREDICTED: similar to pontin [Tribolium castaneum]
NCBI nr blastpgi|910898730.081.54%PREDICTED: similar to pontin [Tribolium castaneum]
NCBI nr blastxgi|3228001560.083.99%hypothetical protein SINV_01535 [Solenopsis invicta]
Group
Gene OntologyGO:00055249.6e-174ATP binding
GO:00036789.6e-174DNA helicase activity
KEGG pathwaytca:6602540.0 
 K04499 (RUVBL1, RVB1, INO80H)maps-> Wnt signaling pathway
InterPro domain[14-400] IPR0103399.6e-174TIP49, C-terminal
[112-198] IPR0160271.3e-26Nucleic acid-binding, OB-fold-like
Orthology groupMCL14974 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207561-TA
ATGAAAATTGAAGAAGTTAAAAGCACTGCAAAAACGCAAAGAATATCTGCACACACTCATATAAAAGGTCTAGGTCTGGATGAAAATGGAGTACCAATTCAAATGGCAGCCGGTCTCGTGGGCCAAGAATCAGCGCGAGAGGCTGCAGGGATAGTAGTGGATATGATAAGGAGCAAGAAAATGGCGGGACGTGCGTTATTACTCGCTATAGCACAGGAACTTGGTAACAAAGTACCATTTTGTCCGATGGTAGGAAGTGAAGTATACAGTACTGAGATCAAAAAGACAGAAGTATTAATGGAAAACTTTCGTAGAGCTATTGGCCTACGAATCAGAGAAACAAAAGAGGTGTATGAGGGGGAAGTTACTGAACTGACTCCTGTAGAGACTGAAAATCCTGCTGGCGGTTATGGCAAAACTGTTTCCCATGTGATTATTGGACTTAAGACAGCAAAAGGTACAAAAAACTTGAAGCTTGACCCTACAATATATGAGTCACTACAAAAGGAGAAGGTGGAAGTTGGGGATGTTATTTATATAGAGGCAAATTCCGGAGCTGTGAAAAGACAGGGAAGAAGTGATACTTTTGCTACCGAATTTGATCTAGAGGCTGAAGAGTATGTTCCTTTACCAAAGGGTGATGTGCATAAAAAGAAGGAAGTTGTCCAGGATGTGACTCTTCATGACTTAGATTGTGCAAATGCTAGGCCACAGGGAGGCCATGATATTATGTCTATGATGGGACAACTGATGAAGCCCAAGAAAACTGAAATTACTGATAAGCTTAGGAAAGAAATAAACAAAGTAGTAAATAAATATATTGACCAAGGTATAGCAGAATTAGTGCCTGGGGTTTTATTTATAGATGAGGTTCATATGCTAGATATAGAAACATTTACCTACCTTCACCGTGCCTTAGAATCAGCAATTGCTCCCATAGTTATATTTGCTACCAACAGAGGTCGTTGTCAAATAAGAGGAACTGAAGATGTAATCTCTCCACATGGTATTCCATTGGATCTCTTAGATAGACTGTTGATTATCCGCACCCTCCCATACAATAAATCTGAACTTTTACAGATATTAAAGCTCCGTGCTAATACAGAGGGTATATCTATAGATGACGAGGCTTTGACAGCTCTCTCTGAAGTTGGTGCCAACAGCACTCTCAGGTATGCTGCCCAACTACTGACACCTTCATGGCTGGCGGCCCGTGCAGAGGGCGCCACACGTATCGCACCTTCCCACGTCCGTTCAGTGCATGCATTGTTCCTCGACGCCAAGTCTTCCGCACGCATTCTCACACAACACTCAGACAAATATATGAAGTAA

Protein sequence:

>DPOGS207561-PA
MKIEEVKSTAKTQRISAHTHIKGLGLDENGVPIQMAAGLVGQESAREAAGIVVDMIRSKKMAGRALLLAIAQELGNKVPFCPMVGSEVYSTEIKKTEVLMENFRRAIGLRIRETKEVYEGEVTELTPVETENPAGGYGKTVSHVIIGLKTAKGTKNLKLDPTIYESLQKEKVEVGDVIYIEANSGAVKRQGRSDTFATEFDLEAEEYVPLPKGDVHKKKEVVQDVTLHDLDCANARPQGGHDIMSMMGQLMKPKKTEITDKLRKEINKVVNKYIDQGIAELVPGVLFIDEVHMLDIETFTYLHRALESAIAPIVIFATNRGRCQIRGTEDVISPHGIPLDLLDRLLIIRTLPYNKSELLQILKLRANTEGISIDDEALTALSEVGANSTLRYAAQLLTPSWLAARAEGATRIAPSHVRSVHALFLDAKSSARILTQHSDKYMK-