Monarch geneset OGS2.0

DPOGS202630
TranscriptDPOGS202630-TA2964 bp
ProteinDPOGS202630-PA987 aa
Genomic positionDPSCF300371 - 9867-15768
RNAseq coverage15x (Rank: top 81%)
Annotation
HeliconiusHMEL0102200.062.22% 
BombyxBGIBMGA008315-TA0.063.91% 
DrosophilaRev1-PA0.046.89% 
EBI UniRef50UniRef50_Q9W0P22e-18046.89%DREV1 n=8 Tax=Drosophila RepID=Q9W0P2_DROME
NCBI RefSeqXP_001849153.10.041.98%terminal deoxycytidyl transferase rev1 [Culex quinquefasciatus]
NCBI nr blastpgi|1700429140.041.98%terminal deoxycytidyl transferase rev1 [Culex quinquefasciatus]
NCBI nr blastxgi|1700429141e-17542.13%terminal deoxycytidyl transferase rev1 [Culex quinquefasciatus]
Group
Gene OntologyGO:00002871.7e-168magnesium ion binding
GO:00062811.7e-168DNA repair
GO:00036841.7e-168damaged DNA binding
GO:00167791.7e-168nucleotidyltransferase activity
GO:00038871.8e-38DNA-directed DNA polymerase activity
GO:00056221.9e-19intracellular
KEGG pathway 
InterPro domain[1-948] IPR0121121.7e-168DNA repair protein, Rev1
[296-497] IPR0011261.8e-38DNA-repair protein, UmuC-like
[554-682] IPR0179611.7e-19DNA polymerase, Y-family, little finger domain
[11-125] IPR0013571.9e-19BRCT
Orthology groupMCL13797 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202630-TA
ATGAGTAGAAGAGATGAAAGATTCCCTGACAATGGTTTTGAAGCTTGGGGTGGGTATATGCATGCTAAAATTGCGAAACTTGAAGAGCAATTTACAGCAGGAATAGGAAAAGAAAATAAATTGACAGATATATTTAAAGGTGTCAGTATTTATGTTAATGGTTTCACGGTGCCTTCTGCAGACGAATTGAAAGAATTAATGGCTATACATGGTGGAGTGTATCACACTTATCAAAGAAGTAATGACTTCATTATAGCTTCTAATTTACCAGACACAAAAGTAAAAAAAATGTGCTTGGCAAAAGTAGTTAAGCCAGATTGGATCACTGATAGTATAAAGGCTAAAAAATTGTTAAACTATAAAGAATATTTACTCTATCGAAATTCAAGAACACAGTCTACAATAAAATTTAATCAAGAAAATAATAAAAGTCAAATTGGTAATGTTGGTGATACAAAAAAAATGCTAGAAGAGACCAGTTTATTAAAAGTATGCCAGATATCTTCGAAGGCAGAAGATGTGGTTTCAAAAAATTCATATACTGATACAAATTCAGCTGAATGTTCAAATCTTCAGTCTGCTGATTCCAAAAGTAGAGATAACTCTTTAAAAACTTTTAACTGTCAACCCAGTGACTCAGTTAATAAAGATAAGTATGCCAAAACTGCAGCCGACCCAAATTTTATCTCAGAATTTTACAACAACTCAAGATTACATCATATTTCTCAACTCGGTGCATGTTTTAAACAACATGTCAATGATTTGAGAGAAACTAGTGACTTTAGTTTTCCTGCAAGAGAAAGTTTAAAACACAAAATTTTAGCTTTAAATAATGAAAACAGATATTCAAATGTTTGTTTTGAGAATGGTAAGACTATTATGCATATTGACATGGATTGCTTTTTTGTGTCTGTTGGTTTGAGAAATAGACCAGAATTGAGAGGAAAACCGGTAGCTGTCACACATTCAAAAGGAGGCCAGCCTGGAAGTAAACGGTCTGGCAATGATGCAATTACTGAATCCAACTTATACAAACAAAGGCAAGCCAAAAAAATTGGTATTGCTCTGGATATCAAAGACGATATTGATACAGAATCTGCTGAAAGTGGTTATGAGGAGGAATCATCATATGGGTCTATGAGTGAAATAGCTTCTTGTTCTTATGAAGCAAGGGCTAAGGGCATCAAAAATGGAATGTTTATGGGGAAAGCTCTGAAGCTTTGCCCCGAATTAAGGACAATTCCATATGATTTTGATGGTTACAAAGATGTTGCTTTTAAATTATACAACACTATAGCTAATTACACATTAGATATTGAAGCTGTGTCATGTGATGAAATGTATGTTGATTGTACAGAACTCTTGAAATCAATGAATGTAAGTGTTACTGACTTTGCCACCGCTTTGAGAGAGGAGATAAAAAGTATAACTAAATGTCCCTGCTCAACAGGGTTTGGTGGTAATAGATTGCAAGCTCGCTTGGCTACGAAGAAGGCTAAGCCCAATGGACAATTCTTTCTTACAGCTGATATAGTTAACGATTTTATGTACAATATACAACTTAGTGATCTACCAGGTGTTGGGTATCAGACTTCACATAAATTAGAATCTTTAGGGTACCAGACTTGTGGATCCCTGTTAAGCTTGAGCTTAATAAACCTTCAACAACATTTAGGAAAAAAGACTGGCGCACAATTATATGAACAGATTCGTGGACAAGATTCACACCCTTTATCTTTCCACACAGTGAGAAAATCTGTTTCCGCGGAAGTAAATTATGGTATTCGTTTTGAGAATAATGATCAATGCAAAGAGTTTTTAAAGCAACTTTCTGCTGAGGTCCATTCTCGGATGCAACAATTCAAAGTAATTGGCAAATGTATTACCTTAAAATTGATGGTCAGGGCTGAAAATGCTCCGGTACAAACTGCAAAGTTCATGGGTCATGGCTACTGTGACGTCATAAACAAGTCTACAACATTGCAAAATGCAACTAATGATGTTGAAATAATAACAAAGGAAGTTATTTCAATATGTAAGAAACAAAACATAGATCCAAAAGAAATGCGTGGAATAGGAATTCAAGTCACTAAGCTAGAACCAATCAATATTAAGCCAATAAAAGGAGCAATTAACAAATTTTTGACGTCCAAACCCGTCCCTAAATCTGAAAAAAATATTTCAAATGAAAACATTGTTGACATAGGTGTTAAAGTTAAAGTACCTACGACTCCGAAAAAAGTTACAACTTGCACAACCCTACAAAAATCTCCTATTTTAAATATATCTAAATCTCCTAAAGGGAAGAGAAGAGGACGACCACCCAAACATTCTAAACCTCAGATATCTTTTAACCCACTTAGTAGATTTTTTCATTCAAATACTGAAATAACTGTTAAAAGTGAAATAAAAACAGAAGAATTAAGCAAAATCGTCATAAAAGAAGATATATCTAAAGAAGAAAAGCCGAAGCCCCAGGGTTTACTCGGATTACCGTGGGATAAAATAAGAGAATTACTCCGAGCCTGGTTTGAAAGCGGACAAACTCCTAAACATTGTGATATCCAATTAATTGCTGGTTACATGCGAGATATGGAAGACAAGGAAACTGATGCTGGACCATCTCAAGCACAAAAAGCAGAAAAAGAAAATTTACAATCAGACAATACAGACAATAGTTGGAGAAATTGGACACCAGCAGCATTGAAGACCAAGGCGTCTAGTACTCTTAAACGGAAAAATAATCCATCATCATCATCACCATCATCATGGCTTCATCGAAGAAGACAAAGAACCTATCTACAGGAGAAAGTTTTACAAAATAAAATTGGATTATTAGAAATATTGAAACAAAATGCTCATAGAGAAGCTGAATTAAAAACTAAACTGCTCGAGGAACAAATTAAGCAGGAACTAATAAGAACGAAAATTTTAACATTGGAACTACAAAAACTGCAACAGTAA

Protein sequence:

>DPOGS202630-PA
MSRRDERFPDNGFEAWGGYMHAKIAKLEEQFTAGIGKENKLTDIFKGVSIYVNGFTVPSADELKELMAIHGGVYHTYQRSNDFIIASNLPDTKVKKMCLAKVVKPDWITDSIKAKKLLNYKEYLLYRNSRTQSTIKFNQENNKSQIGNVGDTKKMLEETSLLKVCQISSKAEDVVSKNSYTDTNSAECSNLQSADSKSRDNSLKTFNCQPSDSVNKDKYAKTAADPNFISEFYNNSRLHHISQLGACFKQHVNDLRETSDFSFPARESLKHKILALNNENRYSNVCFENGKTIMHIDMDCFFVSVGLRNRPELRGKPVAVTHSKGGQPGSKRSGNDAITESNLYKQRQAKKIGIALDIKDDIDTESAESGYEEESSYGSMSEIASCSYEARAKGIKNGMFMGKALKLCPELRTIPYDFDGYKDVAFKLYNTIANYTLDIEAVSCDEMYVDCTELLKSMNVSVTDFATALREEIKSITKCPCSTGFGGNRLQARLATKKAKPNGQFFLTADIVNDFMYNIQLSDLPGVGYQTSHKLESLGYQTCGSLLSLSLINLQQHLGKKTGAQLYEQIRGQDSHPLSFHTVRKSVSAEVNYGIRFENNDQCKEFLKQLSAEVHSRMQQFKVIGKCITLKLMVRAENAPVQTAKFMGHGYCDVINKSTTLQNATNDVEIITKEVISICKKQNIDPKEMRGIGIQVTKLEPINIKPIKGAINKFLTSKPVPKSEKNISNENIVDIGVKVKVPTTPKKVTTCTTLQKSPILNISKSPKGKRRGRPPKHSKPQISFNPLSRFFHSNTEITVKSEIKTEELSKIVIKEDISKEEKPKPQGLLGLPWDKIRELLRAWFESGQTPKHCDIQLIAGYMRDMEDKETDAGPSQAQKAEKENLQSDNTDNSWRNWTPAALKTKASSTLKRKNNPSSSSPSSWLHRRRQRTYLQEKVLQNKIGLLEILKQNAHREAELKTKLLEEQIKQELIRTKILTLELQKLQQ-