Monarch geneset OGS2.0

DPOGS200570
TranscriptDPOGS200570-TA3534 bp
ProteinDPOGS200570-PA1177 aa
Genomic positionDPSCF300119 + 462796-475263
RNAseq coverage82x (Rank: top 64%)
Annotation
HeliconiusHMEL0054913e-8854.86% 
BombyxBGIBMGA009359-TA0.059.44% 
Drosophilatam-PA0.044.56% 
EBI UniRef50UniRef50_E2A6800.046.04%DNA polymerase subunit gamma-1, mitochondrial n=4 Tax=Formicidae RepID=E2A680_CAMFO
NCBI RefSeqXP_395230.30.042.56%PREDICTED: similar to tamas CG8987-PA [Apis mellifera]
NCBI nr blastpgi|3287897530.042.56%PREDICTED: DNA polymerase subunit gamma-1, mitochondrial [Apis mellifera]
NCBI nr blastxgi|3287897530.041.60%PREDICTED: DNA polymerase subunit gamma-1, mitochondrial [Apis mellifera]
Group
Gene OntologyGO:00057600gamma DNA polymerase complex
GO:00038876.5e-52DNA-directed DNA polymerase activity
GO:00036776.5e-52DNA binding
GO:00062606.5e-52DNA replication
GO:00036768.2e-07nucleic acid binding
KEGG pathway 
InterPro domain[48-1167] IPR0022970DNA-directed DNA-polymerase, family A, mitochondria
[741-1084] IPR0010986.5e-52DNA-directed DNA polymerase, family A, palm domain
[166-362] IPR0123378.2e-07Ribonuclease H-like
Orthology groupMCL10155 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200570-TA
ATGAGGAATATTAAAAAACTGTGCCACATTTATCGTCGGGGTTACAGCAATGATATATTGCCATCAAGTGAATCCGTGATAATAAGACAGAATAAACCTGAAAAGGATATTGATAACCTCGAAACACCTGTCGAAATTCATCCTGAAAATGATCCTAAAGAATTCAGAGTCAATGATGTGAATATACAAATGATATCAAAAAATATTTACGACCAGCTCTTTCGGACTCCGCAACCCACTTTAGATGCTGGTTTGATTAAAAGTTGCCTAAACCATTTAGAAAAGCATGGCATAGATATCAAAAAGAGTACATACTTACCAGATGTACAACTTAAAATACCAAAGTTACAAGGAAAGGATATTGAGGAACATTTTTTTAATATCGGAGAGACACAGTGTGCCCCATACAGAACCCTGTTACAAAAACTATCAACAAACAACTTACCTAAATTACCAAAGAGCGGATGCATTGTCGAAGATAACTCCATAATCAATGTAACACTGATTCAGATATATAAAAATACGTATGGCTGGGTCAGTGAACCTCTGGCAAATGATAACAAACATGAACATCACAACTCTGTGAGGTACGAAGACCTGATACCTTTAGAAACCGATGGCTTTGAACCAGTAGGTGATATAACGAGACCGAGGATGGTGGTTGGTCACAATGTATCATATGACAGGTCCAAAATTAAAGAACAATATTGGTTGAACAAGACTGGTGTACGTTTCATGGACACAATGTCAATGCACATATGTGTGTCAGGAGTAACCAGCTATCAGCGAACAGTTCTCAAAGACAAAAACAAAGAACCTCATCCTCATGATGACGAATGGAGAGAAGTCAGCTCCCTCAATGGACTCTCTGATGTGCATAGATTATACTGCGGTGTGGCCATAGACAAGCAGACTCGAGATGTATTCGTAGAGGGCTGCCTAGAGGATGTCCAGAACAACTTCCAAGACCTCATGAAGTACTGTGCCGGTGACGTCATCGCTACCCACAACGTACTCAGAGCGCTACTACCCTTGTTCCTAGAAAGATTTCCGCATCCTGTCACATTCGCGGGAATGTTAGAATTGGGTTCTGCTTATCTTCCTGTCAATTCGAATTGGTCACAGTACCTAGATTCAGCCGATACAGTTTTTGAGGATTTGAAATTGGAATCCCAACAGATACTCTCATCTAAAGCGGATGAAGCGTGTCGAATGATGGAGAACGAAGGTTACAAGGAGGATCCATGGATGTGGGATCAGGACTGGTCGGTACAGAAACTAAAATTAAAAAAAAATGCGACCAAGAAAAAAGATACTATAGAGTCAAAAAAACCAGCCAGTGAAGATGTAAATAGAATTAGTTCTAAATTTGAATTATTAAGTGACGAATATGTTGATAGTCTAAAGACCAATGAGAAAGTTACACACGATTTTGAGAATTTGAACAAAAAGTTTAAGTATTTATATGATTTGGGTGATTTATTGCCTGTAAAAAGACCTTTTTTGGCCGGATATCCAGCGTGGTATAGGAAATTGTGCACTAAACCAGGAAAGGACCCCGACTGGAGGGCCGGCGCTAATAATATTACTACCAGTATGCAGATAACTCCAAAGCTCCTGCGTCTCTCATGGGAAGGCTATCCTCTACATCACCTGCAGTCCGAAGGTTGGGGATTTCTAGTTCCATACAGTAGACATATAGCTGAAGACGCCAAGGAACCTCTGGTGCCGTTGGAGAAGATTCTGGAAATTTGTCCATTAATGACCTCTAAAGCTGATTACGTTGAGAGCGAACTACATATGCTGCCTCAGAATGTGGAGGAAGATTTAAGCAGGAGGGCGTATTATGCTCGTAAGAAGAAGGAAGAACAGGCGGTGGCCAATCAATACCATGGACTCGGGGTTTGGTGTGGAGTACAGATACAGGGTTGCTGTCATTTCCTGCGTCTCCCTCACAAAGATGGTCCGAAATATAAAGTAGGGAATCCGCTGGCGAGGGATTTCCTAAACATGTTCAGTCAGAACGTACTCTCTGCTCAAGGGAACGAGGCTGAGAAGGTTAGTTGGGAGTTGTATTACAACGCTACCACAGTTAGAGGTTTCCGATTGATACCACAAAAAACTACTCTTGATACATTATTACAATGTTCTCGTATCGTAAATCTTTTCGCGTCGGAACCAACTTGGATGACCGCTAGTAATGCTCAGAGCGACCGCGTGGGCTCCGAACTACGGTCTATGGTCACCGCTCCTCCAGGGTACCGCTTCGTTGGTGCTGATGTGGACTCTCAGGAGTTGTGGATAGCCGCTCTATTGGGTGACAGTTCGTTGATGTGTGGGGGGAGCGCGTTCGGGTGGGCGGTGCTAGCGGGGGACAAACGGACTAACACCGACCTACACTCACTGACAGCAGTCGCTGCCGGAGTGAGGAGGGATCACGCTAAAGTCATCAATTACGCCAGGATATATGGTGCCGGGCAGAATTTCGCTGAGAGATTGCTGAAACAGTTCAACCCGACTATGACGATATCTGAGGCGAAGAGCAAAGCCGCCAAAATGTTTACTACAACCAAAGGAAAAAGGGTTTACAGGCTGAAGGAAAAATTTATGGAGGGTTTCATGGACGAGGAGTCAGGAGATCAGGCGTACCGTCCCCAGACTGTAGAGATGACTTCGTACCAAGCGATGCGTCTGGCTAAACTGAGCGGCCGAACTACAGATGAGATGTTCGAGCGACCTCGGTGGGAGGGGGGAACAGAGTCGCACATGTTCAACAAACTAGAGGAAATAGCAGACGTATGTAACTATGGCCATGATAGGAACGCGTCCGGACAAAAACAAAATATTTGCATTGAAGAGTCTGAGACCATGCCCATTACACCGATCCGTCACATCGCCGTCACGTGGCCTATATGGAGCGTTAAAAATCGTCGGCGAAAATGTTTTGTTGGTGACGACGCGCTCCATGCACGACACGCGACGTTATCACAGCGATACGAGGTGGAGACGAGACGAAAATTGATTCTATCTGTAGAGTCATCGTGTCCTCGTACAAGTTTCCTCGAGGGGCGTCTGTCTCGTTCTTTGGAGGCGGCACATAACGGAGGCGGCACTAAACTGAACTGGGTGGTCCAGAGCGCTGCGGCTGACTTCCTACATCTCATGCTGGTTAGCATGACACATCTGCAGCCTTCAGCCAGATTTTGTCTCAGTTTCCACGACGAGGTCAGATACTTAGTTCCCGAGGAACATAAATACGAGGCAGCCTTGGCATTACAAATAACAAACCTTCTCACTAGAGCTTTCTGTTCACAGAGGGTTGGGATACACGATCTACCGATGTCAGTAGCTTTCTTCTCATCTGTGGAAGTCGATCGAGTTTTAAGAAAGGAAGCCAATATGGACTGTGTGACCCCTTCAAACCCTCATGGACTGGAAAAAGGCTATGGTATACCTAATGGAGAGTCATTGACGATATATGATGTCTTAGAGAAAACCGATTGTAACAGATTTTGTACCAAAGAAACGAAATAA

Protein sequence:

>DPOGS200570-PA
MRNIKKLCHIYRRGYSNDILPSSESVIIRQNKPEKDIDNLETPVEIHPENDPKEFRVNDVNIQMISKNIYDQLFRTPQPTLDAGLIKSCLNHLEKHGIDIKKSTYLPDVQLKIPKLQGKDIEEHFFNIGETQCAPYRTLLQKLSTNNLPKLPKSGCIVEDNSIINVTLIQIYKNTYGWVSEPLANDNKHEHHNSVRYEDLIPLETDGFEPVGDITRPRMVVGHNVSYDRSKIKEQYWLNKTGVRFMDTMSMHICVSGVTSYQRTVLKDKNKEPHPHDDEWREVSSLNGLSDVHRLYCGVAIDKQTRDVFVEGCLEDVQNNFQDLMKYCAGDVIATHNVLRALLPLFLERFPHPVTFAGMLELGSAYLPVNSNWSQYLDSADTVFEDLKLESQQILSSKADEACRMMENEGYKEDPWMWDQDWSVQKLKLKKNATKKKDTIESKKPASEDVNRISSKFELLSDEYVDSLKTNEKVTHDFENLNKKFKYLYDLGDLLPVKRPFLAGYPAWYRKLCTKPGKDPDWRAGANNITTSMQITPKLLRLSWEGYPLHHLQSEGWGFLVPYSRHIAEDAKEPLVPLEKILEICPLMTSKADYVESELHMLPQNVEEDLSRRAYYARKKKEEQAVANQYHGLGVWCGVQIQGCCHFLRLPHKDGPKYKVGNPLARDFLNMFSQNVLSAQGNEAEKVSWELYYNATTVRGFRLIPQKTTLDTLLQCSRIVNLFASEPTWMTASNAQSDRVGSELRSMVTAPPGYRFVGADVDSQELWIAALLGDSSLMCGGSAFGWAVLAGDKRTNTDLHSLTAVAAGVRRDHAKVINYARIYGAGQNFAERLLKQFNPTMTISEAKSKAAKMFTTTKGKRVYRLKEKFMEGFMDEESGDQAYRPQTVEMTSYQAMRLAKLSGRTTDEMFERPRWEGGTESHMFNKLEEIADVCNYGHDRNASGQKQNICIEESETMPITPIRHIAVTWPIWSVKNRRRKCFVGDDALHARHATLSQRYEVETRRKLILSVESSCPRTSFLEGRLSRSLEAAHNGGGTKLNWVVQSAAADFLHLMLVSMTHLQPSARFCLSFHDEVRYLVPEEHKYEAALALQITNLLTRAFCSQRVGIHDLPMSVAFFSSVEVDRVLRKEANMDCVTPSNPHGLEKGYGIPNGESLTIYDVLEKTDCNRFCTKETK-