Monarch geneset OGS2.0

DPOGS208948
TranscriptDPOGS208948-TA3840 bp
ProteinDPOGS208948-PA1279 aa
Genomic positionDPSCF300009 + 333593-340601
RNAseq coverage56x (Rank: top 69%)
Annotation
HeliconiusHMEL0026380.080.91% 
BombyxBGIBMGA002418-TA0.076.44% 
Drosophilamus308-PA0.051.31% 
EBI UniRef50UniRef50_D6X4C00.057.71%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X4C0_TRICA
NCBI RefSeqXP_969311.10.057.71%PREDICTED: similar to DNA polymerase theta [Tribolium castaneum]
NCBI nr blastpgi|910917640.057.71%PREDICTED: similar to DNA polymerase theta [Tribolium castaneum]
NCBI nr blastxgi|910917640.057.71%PREDICTED: similar to DNA polymerase theta [Tribolium castaneum]
Group
Gene OntologyGO:00038873.8e-79DNA-directed DNA polymerase activity
GO:00036773.8e-79DNA binding
GO:00062603.8e-79DNA replication
GO:00055246.1e-20ATP binding
GO:00043866.1e-20helicase activity
GO:00036766.1e-20nucleic acid binding
GO:00080266.1e-17ATP-dependent helicase activity
KEGG pathway 
InterPro domain[1039-1240] IPR0010983.8e-79DNA-directed DNA polymerase, family A, palm domain
[993-1015] IPR0022981.3e-22DNA polymerase A
[282-382] IPR0016506.1e-20Helicase, C-terminal
[18-172] IPR0115456.1e-17DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[4-210] IPR0140013.5e-11DEAD-like helicase
Orthology groupMCL14405 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208948-TA
ATGTTCGATTGGCAAGTTGAATGTCTCAGCAATCCAAAAGTGCTTATAGATTGTCAAAATCTGTTATATTCGGCACCAACATCTGCTGGTAAGACACTTGTTGCTGAATTATTGACCATTAAGACTGTTCTGGAAAGACAGAAAAAAGTCATAATCATATTACCCTTTGTATCAATTGTGAGAGAGAAAATGTTTTATTTGCAAGACATATTATCTAGTTCAGGTATCAGGGTAGAAGGATTCATGGGCTCCCAGACTCCACCTGGTGGTTTACAGGCAGTACACATTGCGATATGTACAATTGAAAAAGCGAATAGTTTAATCAATAAACTTTTAGATGAAGGAAATATATCAGAATTGGGTGCTGTAGTTGTTGATGAATTACATTTACTTGGAGATCCACATAGAGGATATATTCTGGAGCTTCTTTTAACTAAAATTAAATATACAGCATCTAAATTAAATGATCTCTCAATACAAATAATAGGAATGTCTGCAACTTTACCAAATTTAAAAATGTTGGCGGATTGGTTGGAAGCTCATTTATTTATAACAGAATTTCGGCCCATACCTCTAATTGAATCATGTTTGGTCGGAGACAAGTATTATAATAAAAAAGGTGAACACATAGGCATGCTGTGTAAGTCAAATTTAAAAGAAATTGATGATGATAGTGTCCTTTTGATTTGTCTGGAAACAATAAAAAGCAGTTGTTCTGTTCTTATATTTTGTATGACTAAGAATAGATGTGAAAACTTAGCACAGAGCATTGCATCATCATTTTTTAAATTGGGTTGTATGAATAATGAACAAGGTATGATTTTAAGAGAACAATTAAAGACTTCAAGTATTCTCGAAGTTTTAGAACAATTGAAAGGTTGTCCTGTTGGTTTGGATCCAGTATTAAAAAATATTATCTCATTTGGAGTTGCATATCATCATGCTGGACTTACATTCGATGAGAGGGACATAATAGAAGGGGCATTCAAATCTGGTGCTGTGAGAGTACTCGTTGCTACATCCACCTTGAGTTCCGGTGTTAATTTACCTGCTAGAAAAGTAATCATCAGGTGCCCCATGTTCCAGAAGCAACCAATTAATATTTTGACCTATAAACAAATGGTTGGCAGAGCTGGGCGTATGGGAAAAGATACAAAGGGAGAAAGTATTCTAATATGCACTCCAAATGAACAAAAAATTGGATTTGATCTGATGATGGGGGATCTGGATCCTGTAAAAAGTTGCATAGAGACTGAAGATAAATTTATGAGAGCTGTATTAGAAATGATTGCTAGTCAAGATGTTTGTACGGAAGAACAGTTAGATTTGTACTCTAAAAGTACACTATTATTTAGCCAACAAAGTCTCCATCCATCCCAAAACTTTTTATTAAATGACACTCTAAAGGAACTCGTCAATTATGAACTTGTGAGAATACAAAAAGATGGAGAAGAAATAAGATATGTAGCCACTTCATTAGGGAAAGCCTGTTTGTCATCTTCCATGTCGCCAAACGATGGAATATCTTTGTTTTGTGAGTTACAAAAAGCTCGACAATGTTTAGTCTTAGAAACAGACTTACATCTTATTTATTTAGTGACGCCATATAGCGTTAGTAATCAATGGAATAATATAGATTGGTTACATCTGCTCACTCTTTGGGAAAGTCTCACATCCGCCATGAAAAGAGTTGGCGAGCTTGTTGGTGTCCAAGAGAGTTTTATAATTCGTTGCTTAAGGGGAACAAACAAAAATAATAATAACCAAAATAAACTTAATATACATAAGAGATTTTATACAGCACTAGCATTACAGGATTTAGTGAATGAAGTGCCACTCTCTGAAGTTGCTGGTAAATTTCAGTGTGCTAGAGGTTTCTTACAAGGTTTACAGCAAGCTTCCGCTACATTTGCCGGAATGGTAACATCATTTTGTCATCAACTTGGGTGGAAAAACATGGAAATGATTATATCGCAATTTCAAGATCGTTTGCATTTTGGTATACATTCAGAGTTATTAGAACTCATGAAACTATCCTCCCTAAACGGCGTTCGAGCGAGAACTTTATTTAATGCGGGTTTTGAAACTGTTGCAAGCATTGCATCAGCTGAAGTTAATGTTATAGAAAATGCACTTCATAAATCCGTACCATTCCAAAGTGAAAAACAAAGAGACGAAGATGATATGAGCGATTTAAGAAAAAGGAATAAAATCAAGAATATATGGATAACAGGCTACTGTGGCGAACACGAGCAGATATTTAAAACAAAGATGTCGGAGATTCTATCAAACGATTCCCTTCAGTTGGATATGCTGTCGATAAAGACGTATTACGCTGAAATCAAGAAATATTTTGGAGTTAATTTGTCTTATTGTAACGACGTGTCTTTAGCTGAGTGGCTTCTAGATAGTGAGGAGAAAATATCGACAATCGCTGATCTGGCGTTCAAGTACTGTGATCTAGATTTACAAAAGATGGAAATAAAAATTGACAATCAGATAAAAAGTTACAAATCCTTGAACATGCATGAGATGAATTGTTTAAGGGCATGGTGTTTATGCGATATAGTAAAACAACAGGAGAAAAAAATATCGCAAGAAACATTGGTCATGGAGAAGATCTTAAATACAGAGATCCAAGTTTGCAAGATCCTTGGGGATTGCGAGTATCACGGCATTACGGTGGATAAAGATCTCGTGTCGAGATTTTTGATTGATGTGAAAAATTCTCAAGAGATCTTACAGAAGAAGGCATTTAAGATATGCGGATACCATTTTAATTTCAACTCATCCAAGGATGTAGCTAAAGTTTTAGGACTTTACAAGGGTCGTAAGACCAGCACTAGGAAGAGTGTTCTTTCGGCGCACAACAGTCCTATGTCTAGTATTATAATATACTGGCGGAAACTCAACTCCATACTCACTAAGAGTCTTTATCCCATCACTGAACAAGCCTGTGTATACACTGAAGATAATAGGATATCTCCATCTTATACCATGTACACATGCACGGGACGCATTAGCATGCACGAGCCGAATTTGCAAAACTTACCGCGGAAATTCACGATACCGGCAAACTATTTATGTGATAATGAATCTTGTGACGACGTAATAGAGTTCAATTGTAGGAAAATATTCAGAGCAGCGCCCGGTTACGTTTTCATATCGGCTGATTACTGCCAGTTGGAAATGAGGATTCTGACACACTTTTCCAAGGACGTTACTCTAACTAGGATAATGGGTTCGGATGTTGACGTTTTTAAATCGATTGCAGCGTCTTGGAGTGGTGTGCCCGAGCACGAGGTAGACGAAGATTTACGTCATAAAGCCAAGCAGCTTTGTTACGGTATATTATACGGAATGGGTAATAGGACTCTGTCTCAACATTTAAACGTTACAGAATTAGAGGCTGCATATTTTATGGATATGTTTTATAAGACCTATCCATCGATAAAGGTTTTTACAGCGAGTCTGATAGAGGAGTGTAGGAAGAAAGGTTACGTGGAAACTTTGATGAAGAGGAGAAGATATCTTCCTAACATCAACAGCAGTGTTCCTTCAAAGAGGAGTGCAGCTGAAAGGCAAGCTGTTAACACGACCATCCAAGGATCGGCCGCAGACATAGCGAAGTCAGCGATGTGTTCCATACAACAAAGCACTTCATCACGTCTGATATTACAAATGCACGATGAACTTATATACGAAGTACCGGTTAATAATAAACAAGATTTTATAGTTATTTTAAAAAAATCTATGGAAAATACCGTCCGTCTGAACGTACCTTTACCGGTCAAAATAAAGTGTGGGCAGACCTGGGGTACAATGGAGGACGTCAAATAA

Protein sequence:

>DPOGS208948-PA
MFDWQVECLSNPKVLIDCQNLLYSAPTSAGKTLVAELLTIKTVLERQKKVIIILPFVSIVREKMFYLQDILSSSGIRVEGFMGSQTPPGGLQAVHIAICTIEKANSLINKLLDEGNISELGAVVVDELHLLGDPHRGYILELLLTKIKYTASKLNDLSIQIIGMSATLPNLKMLADWLEAHLFITEFRPIPLIESCLVGDKYYNKKGEHIGMLCKSNLKEIDDDSVLLICLETIKSSCSVLIFCMTKNRCENLAQSIASSFFKLGCMNNEQGMILREQLKTSSILEVLEQLKGCPVGLDPVLKNIISFGVAYHHAGLTFDERDIIEGAFKSGAVRVLVATSTLSSGVNLPARKVIIRCPMFQKQPINILTYKQMVGRAGRMGKDTKGESILICTPNEQKIGFDLMMGDLDPVKSCIETEDKFMRAVLEMIASQDVCTEEQLDLYSKSTLLFSQQSLHPSQNFLLNDTLKELVNYELVRIQKDGEEIRYVATSLGKACLSSSMSPNDGISLFCELQKARQCLVLETDLHLIYLVTPYSVSNQWNNIDWLHLLTLWESLTSAMKRVGELVGVQESFIIRCLRGTNKNNNNQNKLNIHKRFYTALALQDLVNEVPLSEVAGKFQCARGFLQGLQQASATFAGMVTSFCHQLGWKNMEMIISQFQDRLHFGIHSELLELMKLSSLNGVRARTLFNAGFETVASIASAEVNVIENALHKSVPFQSEKQRDEDDMSDLRKRNKIKNIWITGYCGEHEQIFKTKMSEILSNDSLQLDMLSIKTYYAEIKKYFGVNLSYCNDVSLAEWLLDSEEKISTIADLAFKYCDLDLQKMEIKIDNQIKSYKSLNMHEMNCLRAWCLCDIVKQQEKKISQETLVMEKILNTEIQVCKILGDCEYHGITVDKDLVSRFLIDVKNSQEILQKKAFKICGYHFNFNSSKDVAKVLGLYKGRKTSTRKSVLSAHNSPMSSIIIYWRKLNSILTKSLYPITEQACVYTEDNRISPSYTMYTCTGRISMHEPNLQNLPRKFTIPANYLCDNESCDDVIEFNCRKIFRAAPGYVFISADYCQLEMRILTHFSKDVTLTRIMGSDVDVFKSIAASWSGVPEHEVDEDLRHKAKQLCYGILYGMGNRTLSQHLNVTELEAAYFMDMFYKTYPSIKVFTASLIEECRKKGYVETLMKRRRYLPNINSSVPSKRSAAERQAVNTTIQGSAADIAKSAMCSIQQSTSSRLILQMHDELIYEVPVNNKQDFIVILKKSMENTVRLNVPLPVKIKCGQTWGTMEDVK-