Monarch geneset OGS2.0

DPOGS212434
TranscriptDPOGS212434-TA2358 bp
ProteinDPOGS212434-PA785 aa
Genomic positionDPSCF300258 + 124936-128051
RNAseq coverage90x (Rank: top 63%)
Annotation
HeliconiusHMEL0123558e-10346.60% 
BombyxBGIBMGA002892-TA0.056.06% 
DrosophilaDNApol-eta-PA8e-12250.90% 
EBI UniRef50UniRef50_Q17CG94e-15340.16%DNA polymerase eta n=2 Tax=Aedes aegypti RepID=Q17CG9_AEDAE
NCBI RefSeqXP_001649408.17e-15440.16%DNA polymerase eta [Aedes aegypti]
NCBI nr blastpgi|1571066211e-15240.16%DNA polymerase eta [Aedes aegypti]
NCBI nr blastxgi|1571066213e-14940.02%DNA polymerase eta [Aedes aegypti]
Group
Gene OntologyGO:00038876.7e-50DNA-directed DNA polymerase activity
GO:00062816.7e-50DNA repair
GO:00036846.7e-50damaged DNA binding
KEGG pathway 
InterPro domain[2-786] IPR0170611.3e-159DNA polymerase eta
[11-215] IPR0011266.7e-50DNA-repair protein, UmuC-like
[302-431] IPR0179611.3e-17DNA polymerase, Y-family, little finger domain
Orthology groupMCL14358 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212434-TA
ATGGATCAGGAGAACAGGATTGTTGTACTGATAGATATGGACTGTTTTTATTGTCAAGTAGAAGAAAAATTAAATCCACAATTGAAAGGCAAACCAATTGCTGTCGTGCAATATAATCCCTGGAGAGGAGGAGGAATTATAGCCGTGAACTATGTTGCTCGAGCCATGGGAGTAACCAGGCACATGAGAGGTAATGAAGCTAAGCAAAAGTGTCCAGAAATACAACTACCATCGGTGCCATGTTTCAGAGGGAAGGCTGATATAACCAAGTACAGGGAAGCGGGCAAAGATGTTGCTAAGGTCCTACAAAGGTTTACACCCTTATTGGAAAGAGCTTCGATTGATGAGGCATATTTAGATATCACAGACCCGGTGCGGAAGAGAATTCTAAACATTGATGTCAGGGACATAAATTCTAACATGCTACCAAATAATTTTGCCCTCGGTTATGATACCTTAGATTCCTTCATATCTGATGTACATAGCTGTGGCCTGTCGTCTATGGAGTTTGATTATGAACACTCAAAACATCTTCTTGTCGGTGCTCTCATAGTTAGCGAGATAAGGGCTGCGGTATACGCTGAAACTGGCTACCAATGTTCAGCCGGGATAGCTCATAATAAAATCTTGGCAAAGCTCGTGTGTGGTATGAACAAGCCCAACAAACAGACAGTGTTACCAAAACATTCTGTTAACATTCTATACAAGACATTGTCACTCAAAAAAGTAAAGCACTTGGGTGGGAAGTTTGGGGATCACGTCGCTGAAACTCTTAATATTAGTACGATGGGACAACTACAGAGATTCACGGAAAAGGATCTTCAGGCGAGATTTGATGAAAAGAACGGTTCCTGGTTGTACAATATTGCCCGCGGCGTTGACTTGGAACCAGTCCAAGCTAGATTTAACCCTAAAAGTATCGGTTGTTGCAAACAGCTGAGAGGCAAAGCGGCTCTGCAGGATTTAGTCAGCCTCAGGAAGTGGCTTCGAGATCTAGGCGATGAAATCGAGAACCGATTGGAACAGGACTCATTAGAAAATAATCGGATCCCGAAACAAATGGTTGTTAGTTTTTCTTTACAAGCTTCCAAAGGGAAGAGAGATATAAGTAGTTCAAGGTCTTACAATTTCAGCCCCGAAGATGAATTATGTGGAGAAATATTTTCGAGTAAGGCCTTGGAGCTAGTGATGGACAGTGCCGAAGGCTGCAAACCGACAGATGGCGAACTCAACAGGATGTTGAAATCACCGATAACGTTTTTAGGCATAAGTGTTGGGAAATTTGATGATAATATTGATGCGAAGAAAACGAAAAAGATCAAAGATTATTTCAGTGCCGGGTCGTCTAAGGATGTGTCACAAACCGATGAAAGCGTTAGGATAAAACTTGAGAGATGTGTTGAGAAGGATGGAACCAACGCTGGCAAAGAATACGTTTTAGAAAAATACTTTGAATCATCTGATGACGTTAGAAAAGAAAATATTACGAGTAAAATAAGTACAGAAACAGAACAACGTAAAGAGACGGTATACCAGTCAAGTTTGGACAGACAGGAGTCATTTTTTGCAAAATATTTAAACAGTGGAAGATCAAATGTTGCGAATGACAGAACGCCCTGTACACGCGCGGCCTGCGGACAAACATTGCACCTTAGTAACGCTGAAGCCAGTAACGACACGGACTATTCAGGTTCAACAATAAATGAGGAAATTAACAGGAGTATAGCTCTGTTTGAAGATGATCCAGATGATGTAACGCGAGTTGTTAATATGAGGCAGCTGTTGAAGACATCGGAAGCGAAGTTAGAAGATGTGGAGGATGGAGACAGAGCTCAGACAGGAACAGCGCCGATAGAACCTGAACGAAATAAATCTCCCGATATAAACAGCGTTGAATGCTCTGAATGTGGCAAGACAGTGTCTTTGGACAAATTCGATGAACACTCAGATTATCATTTAGCACTTAAATTAAGAGAGGAAATGAGACAGGAGGTCAGAAGAGAACAGAACAGAACAAAATCTGTTCTGTCAGAAACTAATAAGAATTCTCCAAATAAAAAAGAAACACCAGAAAAACAATGCAACAGGAGTGACAATGTACCTTCAATAGTCAATTTTTTTACAAAATTCGACAGATCCATTGAAACAAAATTATGTGCGGAATGTGGAAAGAAAGTCCCCATTAACAAACTCCCTGAACATCTAGATTTCCACGAAGCTCAGAAATTGAGCAGAGAAATAAACAACCGGTCAAGTGTAGTGAATGTAACGAGTGCTAAAAGAAAAAGAAAGTCGTCATCTCCAGTAAAAAAAAACAAAGTGCCTTGTAAGTCAATAGATCTGTTCTTTAGACAATAG

Protein sequence:

>DPOGS212434-PA
MDQENRIVVLIDMDCFYCQVEEKLNPQLKGKPIAVVQYNPWRGGGIIAVNYVARAMGVTRHMRGNEAKQKCPEIQLPSVPCFRGKADITKYREAGKDVAKVLQRFTPLLERASIDEAYLDITDPVRKRILNIDVRDINSNMLPNNFALGYDTLDSFISDVHSCGLSSMEFDYEHSKHLLVGALIVSEIRAAVYAETGYQCSAGIAHNKILAKLVCGMNKPNKQTVLPKHSVNILYKTLSLKKVKHLGGKFGDHVAETLNISTMGQLQRFTEKDLQARFDEKNGSWLYNIARGVDLEPVQARFNPKSIGCCKQLRGKAALQDLVSLRKWLRDLGDEIENRLEQDSLENNRIPKQMVVSFSLQASKGKRDISSSRSYNFSPEDELCGEIFSSKALELVMDSAEGCKPTDGELNRMLKSPITFLGISVGKFDDNIDAKKTKKIKDYFSAGSSKDVSQTDESVRIKLERCVEKDGTNAGKEYVLEKYFESSDDVRKENITSKISTETEQRKETVYQSSLDRQESFFAKYLNSGRSNVANDRTPCTRAACGQTLHLSNAEASNDTDYSGSTINEEINRSIALFEDDPDDVTRVVNMRQLLKTSEAKLEDVEDGDRAQTGTAPIEPERNKSPDINSVECSECGKTVSLDKFDEHSDYHLALKLREEMRQEVRREQNRTKSVLSETNKNSPNKKETPEKQCNRSDNVPSIVNFFTKFDRSIETKLCAECGKKVPINKLPEHLDFHEAQKLSREINNRSSVVNVTSAKRKRKSSSPVKKNKVPCKSIDLFFRQ-