Monarch geneset OGS2.0

DPOGS214944
TranscriptDPOGS214944-TA6249 bp
ProteinDPOGS214944-PA2082 aa
Genomic positionDPSCF300280 - 74912-93682
RNAseq coverage153x (Rank: top 53%)
Annotation
HeliconiusHMEL0155880.086.25% 
BombyxBGIBMGA004825-TA0.078.87% 
DrosophilaDNApol-epsilon-PA0.067.75% 
EBI UniRef50UniRef50_D6WSX80.071.14%DNA polymerase n=5 Tax=Coelomata RepID=D6WSX8_TRICA
NCBI RefSeqXP_969200.20.071.14%PREDICTED: similar to DNA polymerase epsilon CG6768-PA [Tribolium castaneum]
NCBI nr blastpgi|1892389150.071.14%PREDICTED: similar to DNA polymerase epsilon CG6768-PA [Tribolium castaneum]
NCBI nr blastxgi|2700102240.070.97%hypothetical protein TcasGA2_TC009600 [Tribolium castaneum]
Group
Gene OntologyGO:00056344.1e-124nucleus
GO:00038874.1e-124DNA-directed DNA polymerase activity
GO:00062604.1e-124DNA replication
GO:00082704.1e-124zinc ion binding
GO:00036769.2e-80nucleic acid binding
GO:00001661.4e-76nucleotide binding
GO:00061391.4e-76nucleobase, nucleoside, nucleotide and nucleic acid metabolic process
GO:00036771.5e-16DNA binding
KEGG pathwaytca:6576600.0 
 K02324 (POLE1)maps-> Purine metabolism
    Base excision repair
    DNA replication
    Nucleotide excision repair
    Pyrimidine metabolism
InterPro domain[1394-1793] IPR0136974.1e-124DNA polymerase epsilon, catalytic subunit A, C-terminal
[64-495] IPR0123379.2e-80Ribonuclease H-like
[263-822] IPR0061721.4e-76DNA-directed DNA polymerase, family B
[88-424] IPR0061331.9e-65DNA-directed DNA polymerase, family B, exonuclease domain
[619-1014] IPR0061341.5e-16DNA-directed DNA polymerase, family B, multifunctional domain
Orthology groupMCL13310 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214944-TA
ATGGTTCTTCAAAATACTGGTAAATACAGGACAGACAAAAACGATTCAAATAAAAAAGATCCCGGGACATCTTCTGCACCCAGGGAAGAATCATCTGAATCACGACTGCGAAATGCCTTGGATAACGATAAAATTGATTCTAAATATGGGTTCGATAGAGTCAGAGATACTAAGGAACGAACGGGTTACTTAATAAATATGCATACGGCTGAAATATTGGATGAAGACAAAAGACTCGTAGCAGCTGTAGATTATTACTTCATTGAAATGGATGGATCTAGGTTTAAAGTGTCTCTGGTGTTCCAGCCTTATTTTCTTGTTCTTGCTAGGAAGGAATGTGAACAAGAAGTTATACAATACTTATCTAAGAAGTTCGCTGGAACGATTCACAAGATAGAAGTAGTAGAAAAGGAAGATCTAGATTTGCCTAATCATTTGTCAGGCATAAAACAGAGATACATTAAATTGTCGTTCATGTCGCAAAACGAGATGATGAAAGTCAGAAAAGAATTATTAACAGCAGCAAACAAAAGCAAAGAAAGGGAGAAAAAGGATGCGATTTACTCCGAAATGTTGACAAATGTATTGACAAATGCAGCTGCTATCGAACATGCTAAGAAAACTACAGATCATTTGGAAAATATTATCGACATAAGAGAACATGATGTGCCGTACCACATAAGAGTGTCAATAGACATGAAGATATTCTGTGGAACCTGGTACACTGTTAAAAGCAGGGGGAATGACACTCCAATATTTACTAAACGTGACGACATTCTTGAGAGACCGGACCCTATTGTTCTGGCATTTGATATTGAGACAACAAAGTTACCGCTGAAGTTCCCAGACTCACAGACGGATCAGATTATGATGATATCTTACATGATAGATGCTCAGGGATACTTGATTACCAACAGGGAAATTATATCCACTGATGTAGAGGATTTTGAATACACCCCTAAACCCGAGTTTGAAGGGCAATTTATAGTCTTCAACGAGCCCAATGAACTTGCCTTAATTCAAAAATTCTTTGATCACATAATGGATATTAAGCCACACATATTTGTCACTTACAATGGAGACTTTTTCGACTGGCCGTTCGTGGAGGCCCGTGCGGCAGCACTAGGGCTGGATATGCGTCAAGAGATTGGATTCAGTAGACTTACTAGGGATGGGACATACGCTTGCAGACCGGCCATGCATATGGACTGTTTGTGCTGGGTGAAGAGAGATTCATACCTGCCAGTGGGTTCCCAAGGTCTGAAGGCGGTCGCCAAGGCCAAGCTGAGATATGACCCAGTGGAATTGGACCCTGAAGATATGTGCAGGATGGCCTCTGAACAACCACAGGTGTTGTCAAACTACTCGGTGTCAGATGCGGTGGCCACATACTACTTGTATATGAAGTATGTACAACCTTTCATATTCGCCCTCTGCACCATCATACCACTGGAACCGGATGAGGTACTCCGTAAGGGCTCGGGCACACTGTGCGAGTCATTGCTCATGGTGGAAGCCTTCCACGCTAACATCGTGTTCCCCAATAAGCAAGTGGACGAACTCAACAAGCTGACAAGCGACGGACACGTCCTGGAAACGGAGACGTACGTTGGAGGGCACGTTGAGGCGCTGGAATCTGGCGTATTCCGCGCCGATATCAAATGCAAGTTCCGCATCGTACCATCGGCTGTGGAAAAACTGATAGAAAACATAGAGAAGACCATGAAACACGCGATAGAAGTGGAGGAAGCTGTTCCCTTAGAGCTGGTTACCAACTTTGACGAAGTGTGCTCAGAGATAAGAGAGAAATTGCAACATATGAAGGACCATCCCAGGAGGGACGAGAACCCACTGATATATCATTTGGATGTGGGAGCAATGTATCCCAATATTATTTTAACTAATAGACTACAACCGTCGGCCATGGTGAGTGCCAGCGCGTGCGCCGCCTGCGACCACAACAGACCGGGGGCTCGCTGTCAGAGACGCATGGACTGGACGTGGCGGGGGGACTACCTGCCGGCGACCAGGAGCGAGTACCAGCGGATACAGCAGCAACTGGAAACGGAGAAGTTCCCTCCCCTACATCCCGGGGGACCCAATATATATGAATATACTCTTCACCTCCCTTATATTATAATGGACCGTCGCTATGAATACAAGGCGTTGAACAAGCAGGCCAAGGCGCTGGTGGCGGAGGCGGTCCGCGCCGGGGACGCCGCGGACATCAAGAACGCGCGCTCCAGGGAAGTGCTGTACGACTCCCTGCAGCTCGCGCACAAGTGCATCCTCAACTCCTTCTATGGATACGTTATGAGGAGGGGAGCTCGCTGGCACAGTATGGAGATGGCGGGCATCGTGTGTCACACGGGCGCTAACATCATCATGAGGGCGAGGGAGATCATCGAGCAGGTGGGGCGGCCGCTGGAACTCGATACAGACGGTATCTGGTGCATACTACCCTCGTCGTTTCCAGAGAACGTCACTATCAATACCACTCATCACAAGAAGAAGAAAATTAATGTCTCATATCCGAATGCCGTTCTGAACGCTATGGTTAAGGATTATTTCACCAACGATCAATATTACGAACTAGTGGATCCTGTGGAGAAGAAGTATGAAGTTCGCTCTGAGAATTCTATATTCTTTGAGGTTGATGGACCTTACTTGGCGATGGTGCTGCCAGCCTCCAAAGAAGAAGGGAAGAAGCTCAAGAAGAGATACGCTGTGTTCAATTTCGATGGTTCCTTGGCTGAACTCAAAGGTTTCGAGGTGAAGCGTCGCGGTGAGCTGCAGCTGATCAAGATCTTCCAGTCATCAGTATTCGAGGCGTTCCTCAAAGGGAACGATCTGAAGTCGTGTTACGGCGCCGTAGCGAAGGTTGCCGACTACTGGCTCGACGTGTTATACAGCAAGGGATCCAACATGCCGGACTCGGAACTGTTCGAGCTGATATCAGAAAACAGATCTATGTCCAAGAAGCTAGAAGACTACGGCGGGCAGAAGTCTACGTCTATATCCACTGCCAAGAGATTAGCGGAGTTTCTCGGTGACGAGATGCTCGACAACCCAGTCCCCCGAGTGCAGTATCCAGACTGGCTTCACAAGAAAATACTGGCCAAGACAGACAAGTACAAGACGAGGAAAATCACAGAGATGTTTACATCAAAGCCCAAAGAAATCAGGAACTGCGAGGACGCGGACGCCGCGTGTGAGAGTACAGAAGTGGATATCGAAGACATTGGTAAAGAGTGCACGGCCAAACCTATTCAACCCGTGGCGCACCGCGTGAAGCGGAAACGAGAACCGTCCCCGAGGAGAGCGAGCTCGTGGAGGGAGGCTCTGGGACCACCCCCCAGCTTCGGCGATACCAAGGAGCAGAGAACGGCGTGGATAGTGTTCCAGAAGAAAAAATGGATGTGGCAGATGGAGCAACGTGGAGTCAGAGACAGAAATAAAAGGGGCAAGGTCGACAGTGACGTTTCACTGATGTCGAAGAGCCTCGGCCCGGCCAATACTCTGGGGGGATTCATTAAGAGAGCGCAGAGGACTCTGCTGAACACACCGTGGCAGGTTATACAGATCCAGGAGACGGCTGAGCCGGGAGTGTTCCGTGTGTGGTCGCTGGTAGGGTCGGAGCTGCACCAGCTGAAGCTCACCGTCCCCAGACAGTTCTACGTGAACCGGCGCGTGCCGCTGACCTGCGACACGGGACCCCACTGGCGCCGCGTGTCCCGCGTGCTGCCCCGCTCCCGGCCGGCGCACCATCTCTACTTATACACCGTCCCTGAAGCTCTATACAGGGAGCGATGCCAGGAGCTGATGAGTGAGCTGTCGGCTCCAGAGATCGAGGGTATCTACGAGACACAGATGAGTCTGGAGTTCCGAGCGCTCGTGACCATCGGATGTGTGTGTGTCGTAGACCCCCAGGAGGCACGGAGATTGATACAGTTCGGCTCAAACAACATGGACTCGTTTAATCTGAGCCAGCTCCAATTCAAGAGTGTGGCTCACCAGCCGTACCTACAGAAACAGGACGAGGTGTCTCCTATAAAGCACATCTTCCTATATCAACACTCGTCTCCCAACTCGAGCCGCAGTATGTGGTCGCTGGTGCTGGCTCCTATTAAGAAAGGCTATATTTTTGTCCTGGACACCGTCAAGACGAACCAAGTGCCGAACATGAACACCCTGTATAACGCTGAGAGGACCGCTAAAATTAATCTAGGTACGGATGAGAAGAGTCTGCCAAACGCGGATCTGTCGTGGGAGGTGGTTGTGGAGACCGAGCCTCGGGCTGTGTACCGCGGGGTCCAGCGAGCGTTACAGAGGTACAGGGACGAGAGGTGCGGACCCACCATGGTGGCCCTTCAGACCACCCTATCATCACACGCGCTCCTGGCACTCATGCCCGGACTTTCTGAATTCCCGCTTGTGCCACTGCACGTGAGAGACGTAGAGACTCTGTACAACACCTTGGAGTGGCAGCGGATAGGAGCTAGAGCCATCGTCCGGCACTACCTCAACTTGGACTCCGTTCTGGAACTCACTATAGAGCAGTGCAGGTACTTCCACCTCCCCCTCGGCAACACGTCGTCCGACCCGACCCAGTTCGGTGCCGATGTGTTCTTCGCACGTCACCTCCTCAAACATAACTTCGTGCTGTGGTGCTCCGGCTCCGAGCGACCAGACCTCGGCGGCCGGGAGATTGATGACAATAGGCTTGTGAGTGAGTCCGACGAGTGGTCTAGCTGCCGGGTGTGTTCCCGGGGATCCTACGGCGGGGTCTGCGTGGCTCTGACGGGAGACGCGCTGGCCGTGTCCGCTCTGCTGCAGGCGCATACTATATTACAGACTGAAGGCACCAGCGTCGCCACGTCCTTCGGAGCGCAGCACGCCAACATACAGGACGTGATGGCCTCCACAGGTTCAAACGCCACAGAGACGTATGACGAGACAGCTCAATGCAGCGCAGCCTTCAAGATTCTGAGGACGATGGTGGCTTCATGGCTGCGTGACGTCACTCACTACAAAAACGTCTTCGCGGACTTCCAGATATCACACTTCTACAGATGGCTCAAGTCGCCCACCTCTCTATTGTACGACCCGGCCCTGCGCCGCACCTTGTACAGCCTCATGAAGAAGCTGTTCCTCATGCTGGTAGCGGAGTTCAGGAGACTCGGCTCGCAGATCATCTACGCCGACTTCAACAAGATCATTCTGTATACCAAGAAACACTCGCTCATGGACGGCATCGGATACGTGGAGTTTGTGGTTCAGAGTATCAGGAACAAGGAGCTGTTCCACGGCATTGACATCCGGTACAAACAGTGCTGGACCTACCTGCTGTGGCTCGATGAAGCCAACTATGCGGGGATTGAGGGCAAACTGCCAGCGGGTCTGGTGGAGGTGGGGTCGTCTCAGATACCAGATAACACGGACGTGGAGGAGGAGGGTGAGATAACTATGGAGTGGAACATATCCCGGTTCATGCCGGGCATCACCCGCGAGTCGTTCGTGTCGTGCGTGGCGGGCTTCTTGTCCGCGGCACACGCCGATTCCGGGACACTCAAGGCGATGCTGGCCGGGGAAATGACGCAGAAATTTTTCCAGTTAGCAGAGAAAATAAATCGTCGTATGCCAACTCTGAGCCTGGAAGATATCGGTCCCCAGGCTGGGCTCCGCGAGGAGTCGCTAGAGGGCGCCACGCCCGCGCTGCTGTTCGTTAACGCGCTCTGTCATGTACTGGGACTTGACACAAACTTGGAAGATGAGGTGACTCTCCTCCGTCGCAACCTCTTACGGTTGCTGTCAGTGGGCGAGTTCAGTTCTTTAGCCGAGTGGCGGGACCCCTGCGCCTCGTGTGTGTTGGCCGAACTCAGCTGCCGGGTCTGCAACCACTCTCAAGACCTGGACCTGTGTCGCACGCAGCATCATGACAATGAACAGCCAGTGTGCTTGTGTCCAACTTGCGGCACAGCATATGATAACCAGGAGTTGGAATGGAAACTCATTGAAACTATGAACAGGAGAGCTATGAGTTATACCTTACAAGATCTTATATGCACCAGATGTCATCAGGTGAAGCGTGAGAATCTATCAACGGTGTGCGACTGTGCCGGTGAATTCACAACAATGGTATCCCTCAAGGAAATAAGAACGCAGCTCATGACCTACAAGACTATAGCCGAGTACTATAAAATGCCCTTACTAATGGAACTCATAGCGTATAATTTAAGTAACATGTAA

Protein sequence:

>DPOGS214944-PA
MVLQNTGKYRTDKNDSNKKDPGTSSAPREESSESRLRNALDNDKIDSKYGFDRVRDTKERTGYLINMHTAEILDEDKRLVAAVDYYFIEMDGSRFKVSLVFQPYFLVLARKECEQEVIQYLSKKFAGTIHKIEVVEKEDLDLPNHLSGIKQRYIKLSFMSQNEMMKVRKELLTAANKSKEREKKDAIYSEMLTNVLTNAAAIEHAKKTTDHLENIIDIREHDVPYHIRVSIDMKIFCGTWYTVKSRGNDTPIFTKRDDILERPDPIVLAFDIETTKLPLKFPDSQTDQIMMISYMIDAQGYLITNREIISTDVEDFEYTPKPEFEGQFIVFNEPNELALIQKFFDHIMDIKPHIFVTYNGDFFDWPFVEARAAALGLDMRQEIGFSRLTRDGTYACRPAMHMDCLCWVKRDSYLPVGSQGLKAVAKAKLRYDPVELDPEDMCRMASEQPQVLSNYSVSDAVATYYLYMKYVQPFIFALCTIIPLEPDEVLRKGSGTLCESLLMVEAFHANIVFPNKQVDELNKLTSDGHVLETETYVGGHVEALESGVFRADIKCKFRIVPSAVEKLIENIEKTMKHAIEVEEAVPLELVTNFDEVCSEIREKLQHMKDHPRRDENPLIYHLDVGAMYPNIILTNRLQPSAMVSASACAACDHNRPGARCQRRMDWTWRGDYLPATRSEYQRIQQQLETEKFPPLHPGGPNIYEYTLHLPYIIMDRRYEYKALNKQAKALVAEAVRAGDAADIKNARSREVLYDSLQLAHKCILNSFYGYVMRRGARWHSMEMAGIVCHTGANIIMRAREIIEQVGRPLELDTDGIWCILPSSFPENVTINTTHHKKKKINVSYPNAVLNAMVKDYFTNDQYYELVDPVEKKYEVRSENSIFFEVDGPYLAMVLPASKEEGKKLKKRYAVFNFDGSLAELKGFEVKRRGELQLIKIFQSSVFEAFLKGNDLKSCYGAVAKVADYWLDVLYSKGSNMPDSELFELISENRSMSKKLEDYGGQKSTSISTAKRLAEFLGDEMLDNPVPRVQYPDWLHKKILAKTDKYKTRKITEMFTSKPKEIRNCEDADAACESTEVDIEDIGKECTAKPIQPVAHRVKRKREPSPRRASSWREALGPPPSFGDTKEQRTAWIVFQKKKWMWQMEQRGVRDRNKRGKVDSDVSLMSKSLGPANTLGGFIKRAQRTLLNTPWQVIQIQETAEPGVFRVWSLVGSELHQLKLTVPRQFYVNRRVPLTCDTGPHWRRVSRVLPRSRPAHHLYLYTVPEALYRERCQELMSELSAPEIEGIYETQMSLEFRALVTIGCVCVVDPQEARRLIQFGSNNMDSFNLSQLQFKSVAHQPYLQKQDEVSPIKHIFLYQHSSPNSSRSMWSLVLAPIKKGYIFVLDTVKTNQVPNMNTLYNAERTAKINLGTDEKSLPNADLSWEVVVETEPRAVYRGVQRALQRYRDERCGPTMVALQTTLSSHALLALMPGLSEFPLVPLHVRDVETLYNTLEWQRIGARAIVRHYLNLDSVLELTIEQCRYFHLPLGNTSSDPTQFGADVFFARHLLKHNFVLWCSGSERPDLGGREIDDNRLVSESDEWSSCRVCSRGSYGGVCVALTGDALAVSALLQAHTILQTEGTSVATSFGAQHANIQDVMASTGSNATETYDETAQCSAAFKILRTMVASWLRDVTHYKNVFADFQISHFYRWLKSPTSLLYDPALRRTLYSLMKKLFLMLVAEFRRLGSQIIYADFNKIILYTKKHSLMDGIGYVEFVVQSIRNKELFHGIDIRYKQCWTYLLWLDEANYAGIEGKLPAGLVEVGSSQIPDNTDVEEEGEITMEWNISRFMPGITRESFVSCVAGFLSAAHADSGTLKAMLAGEMTQKFFQLAEKINRRMPTLSLEDIGPQAGLREESLEGATPALLFVNALCHVLGLDTNLEDEVTLLRRNLLRLLSVGEFSSLAEWRDPCASCVLAELSCRVCNHSQDLDLCRTQHHDNEQPVCLCPTCGTAYDNQELEWKLIETMNRRAMSYTLQDLICTRCHQVKRENLSTVCDCAGEFTTMVSLKEIRTQLMTYKTIAEYYKMPLLMELIAYNLSNM-