DPGLEAN09159 in OGS1.0

New model in OGS2.0DPOGS204200 
Genomic Positionscaffold119:+ 90906-97567
See gene structure
CDS Length1761
Paired RNAseq reads  673
Single RNAseq reads  1767
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005158 (0.0)
Best Drosophila hit  DNA polymerase alpha 73kD (8e-67)
Best Human hitDNA polymerase alpha subunit B (7e-72)
Best NR hit (blastp)  DNA-directed DNA polymerase alpha 2 [Xenopus (Silurana) tropicalis] (8e-83)
Best NR hit (blastx)  polymerase (DNA directed) alpha 2 [synthetic construct] (3e-84)
GeneOntology terms








  
GO:0000060 protein import into nucleus, translocation
GO:0006260 DNA replication
GO:0005515 protein binding
GO:0005634 nucleus
GO:0005654 nucleoplasm
GO:0003887 DNA-directed DNA polymerase activity
GO:0005658 alpha DNA polymerase:primase complex
GO:0046982 protein heterodimerization activity
GO:0003674 molecular_function
GO:0003677 DNA binding
InterPro families

  
IPR007185 DNA polymerase alpha/epsilon, subunit B
IPR013627 DNA polymerase alpha, subunit B N-terminal
IPR016722 DNA polymerase alpha, subunit B
Orthology groupMCL12238

Nucleotide sequence:

ATGGCTACCGAAGAATTGGTGACCGAGCAATTTAAATTTTTAGGAATAGAAGTATCAAAA
GAAGTTTTATCAAAATGTGTGAGTATATGCACAGAATACGATGTAGACGCTGAAACTTTC
ATCGAACAATGGATGGCGTTTAGTTTGACTACTTTAAACGGTGCTTCGCCGACTTTAGAT
AACTTAGAATTACTTGAGAAGAGGGAATTTTCCAAACGGTCTGCGAGTCGACTCACCGCT
GTCGCTAATGAAGTTACACATTCAAGTACCGGCAGAAGTTTGACTGTTTACGGAGCGCCT
GTTGCTAAACAGTCAGAAAATGAACTTCTGTCTAATTACATGACAACTACACCAAAGAGA
ATTAAAATAGAATGTGAGACGGGGCCGAAACAGAATGAGCTCCATCCTGCTACATATTCA
CCGAAAGTAGAGCACTCCGTTAAATACACATCGAGAACGAACCAAGGGACAGTGGTGCAT
TCATTCGGTGAAGATAAGCTGGTGGAAGTGATCACGAATACAACGGCCTTGGATAATTTA
CTCGAACTAAATATTGTACAAGTCCCCAACGACGACGGCGAACTATACAACAAAGCGAAG
TACGGCTTTGAACTGCTGCATGAAAAAGCGAGTGTATTTGATAACCACATCCGATATATA
TCACAATGTATCATGAAGAAGAACGGCTTTTCCGAGACGGTCTCGGTGAGGCATAAAACA
CAGACCGAAGTTTTGGTAGCTGGTCGTATTGAATGCGACGCAGATGCTAGACTAAACTCG
AAAAGTGTAATCTTACAAGGCACATGGGAGGATTCACTGAGCCAGACCGTCCCTCTAGAT
TTGGACAGCGTGCAGCAGTATTCTCTGTTCCCGGGCCAGGTGGTGGTGGTGCGTGGTATA
AATCCACGCGGCAACAAGTTTGTGGCTCGGGAGTTGTTCTGCGACGCGGCCCGCCCTGTG
CCGGATCCAACATCAGATATCACGAACACGCTAAAAGGTACATTGTCAATGGTTGTAGCG
GCCGGCCCGTACACCACGTCTAATAACATGTCGTACGAGCCGTTGAAGGACTTCATAGCG
TATTTGAACACTCACAAACCACACGTAGTCATAATGACGGGACCATTCGTGGACTGTGAG
CACGAGAAAGTCAAAGATAACTCTATGGCTGAAACATATAAATCTTTCTTCGACAAACTT
ATTGATAGTCTAGCTGATATCGGCAACACAAGTCCTTTTACAAAAATTTACATAGTGTCA
AGTCACAAGGACGCTTTTCATGTAAATATCTACCCGACGCCGCCCTATAGCAGTCGAAAG
AAATATCCCAACATACAATTTCTACCAGATCCCAGCACATTAAACATCAATGGATATATA
GTTGGCATCACCAGTTACGATGTGCTTATGAGCATCAATCAAGAAGAAATATCACATGGT
TCAGTCGGCGACAAGTTATCTCGTCTGTCCGGGCACGTGTTGCGGCAACAGTGCTATTAT
CCAACGGCTGGCTCTCTCGGCTCGCTGGCGGCGGACGGATCGCTGTGGGCGGCGCACGCA
CAACTACCCGCAACTCCTCACATACTAGTAGTGCCCTCCAACTTCAGATACTTCGTTAAG
GAAGTGAACGGCTGTATAGTCATAAACCCTGAGCATCTCAGTAAAGGTGCCGGTGGCGGG
ACGTTCGCACGACTCGTCGTTCGTCCGCCGACAGAAGATAAAACTAATAGTAATATAGCC
GCACAGATAGTACGCATTTAA

Protein sequence:

MATEELVTEQFKFLGIEVSKEVLSKCVSICTEYDVDAETFIEQWMAFSLTTLNGASPTLD
NLELLEKREFSKRSASRLTAVANEVTHSSTGRSLTVYGAPVAKQSENELLSNYMTTTPKR
IKIECETGPKQNELHPATYSPKVEHSVKYTSRTNQGTVVHSFGEDKLVEVITNTTALDNL
LELNIVQVPNDDGELYNKAKYGFELLHEKASVFDNHIRYISQCIMKKNGFSETVSVRHKT
QTEVLVAGRIECDADARLNSKSVILQGTWEDSLSQTVPLDLDSVQQYSLFPGQVVVVRGI
NPRGNKFVARELFCDAARPVPDPTSDITNTLKGTLSMVVAAGPYTTSNNMSYEPLKDFIA
YLNTHKPHVVIMTGPFVDCEHEKVKDNSMAETYKSFFDKLIDSLADIGNTSPFTKIYIVS
SHKDAFHVNIYPTPPYSSRKKYPNIQFLPDPSTLNINGYIVGITSYDVLMSINQEEISHG
SVGDKLSRLSGHVLRQQCYYPTAGSLGSLAADGSLWAAHAQLPATPHILVVPSNFRYFVK
EVNGCIVINPEHLSKGAGGGTFARLVVRPPTEDKTNSNIAAQIVRI