DPGLEAN01329 in OGS1.0

New model in OGS2.0DPOGS203531 
Genomic Positionscaffold2436:+ 17635-30761
See gene structure
CDS Length1695
Paired RNAseq reads  472
Single RNAseq reads  1422
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009175 (2e-110)
Best Drosophila hit  Trf4-1, isoform G (4e-124)
Best Human hitPAP-associated domain-containing protein 5 isoform a (9e-114)
Best NR hit (blastp)  PREDICTED: similar to CG11265-PA, isoform A [Apis mellifera] (4e-168)
Best NR hit (blastx)  PREDICTED: similar to CG11265-PA, isoform A [Apis mellifera] (2e-155)
GeneOntology terms



  
GO:0003916 DNA topoisomerase activity
GO:0007062 sister chromatid cohesion
GO:0003887 DNA-directed DNA polymerase activity
GO:0043630 ncRNA polyadenylation involved in polyadenylation-dependent ncRNA catabolic process
GO:0004652 polynucleotide adenylyltransferase activity
InterPro families
  
IPR002058 PAP/25A-associated
IPR002934 Nucleotidyl transferase domain
Orthology groupMCL11856

Nucleotide sequence:

ATGGATCCCTACGTCGGGTGGTATCAGCCCGAGCAAGAAGGACCCGCGAAGCGCTTGTGG
CTTCGTATCTGGGAAACTCAAAGTGAAACCGACAAAATGAACCTCAAAAACTTAGAAAAT
GCTAATCCTAACGTGAATAAAACACCAGACTTTATACCCTTAAACGAGGTGAACGGTGAA
AATAGGAATAATAATTACTTTAACCCTACACGAAGGAAAGTGAACGACAATCGTGCTTCT
ACGTTTAACCTGAACCAAAATCACGATGCTCTCATAGGTGAATACGGTGGCTGTCCTTGG
AGAATACCCAACTATAATTACAAGCCTGGAGTTCTGGGTCTGCACGAGGAGATCGAGCAC
TTCTACATGTACATGTCCCCGTCTGAGACTGAACACCTGGTCAGAACCACTGTCGTGACA
CGTATCAGGAGCGCCATCCTGTCCCTGTGGCCCCAGGCACGCGTCGAGGTCTTCGGGAGC
TTCCGGACTGGCCTCTATCTGCCGACCAGTGACATCGACCTCGTGGTTATAGGTCAATGG
GAGAAGCTGCCTCTGTGGACCTTGGAGCGAGAGCTGGTAGCCCAGGACATCGCGGAGCAA
GACAGCATTAAAGTATTGGAAAAGGCGACCGTTCCTATTGTCAAGATGACCGACAAGTAC
TCCGACGTTAAGGTGGACATTTCTTTCAACATGAGCAGTGGCGTCAAGAGTGCGGAGCTG
ATCAAACAGTTCAAGGAGCAATATCCAGAACTGTCGAGGCTGGTGATGGTGTTGAAGCAG
TTCCTGCTGCAGCGCGACCTGAACGAGGTGTTCACCGGAGGCATCTCCTCCTACTCCCTC
ATCCTCATGTGCATCAGCTTCCTGCAGCTACACCCGCGGCCGGAGAGACTCCGCCAGAGA
CACAACCTCGGAGTGTTACTGATCGAGTTCTTTGAATTGTACGGAAGGAAATTCAATTAT
GTGAAGACAGCCATCAGGGTCAAGAACGGAGGCTCATATGTATCTAAGGACGAGATCTCA
AAGGAGATGAACGACGGCCATAGACCCTCGCTCCTGTGCATCGAGGACCCGCTGACGCCC
GGCAACGACATCGGCCGGTCCAGCTACGGAGCCATACAAGTTAAACAGGCGTTCGACTAC
GGCTACATAATTCTCCAGCAGGCCGTGGCGCCGCACAACGCGTTACTCGCCCGTCACAGC
GTTCTAGGTCGTGTCGTGCGCGTCACGGATCACGTGTTACAATATAGACGCTGGGTGAGA
GACACCTTCGAGCCGTTCTTCTTCCCGCACCGCGTGAGGCCGAGGCGGGTAGGGAACACC
CGCTCGCCCACCCCCGACCCCACACCCACACCGACGCCCACGCCCTCCGACACTGATCCT
GAGTGGTCGGACGGTTCGGGTCCGTCAGGCCCGGCCCGCACGTCGCCGCCGCCGCTGTCG
GCTCTGCAGTGCTCGTCGCCCACCCCCCGCAGGGTGTCCGCCCACCAGTCCCTCATAATA
CACCACATAACAAGCAACTCGGATTTCAACAACATACCATCGGACCCGCTTGCAGGGGTG
CTCCGCCCCCGGCCGCGGCCCCGCCGTCGCGCGTCGCCCCCCCGCGGCCGAGCCCAACGC
AACGACCGCTCCGACAGACAGGACCGGAACGACAGACCCGACCGCCACCGCAAGAGGCGC
GGCTACACCCGATGA

Protein sequence:

MDPYVGWYQPEQEGPAKRLWLRIWETQSETDKMNLKNLENANPNVNKTPDFIPLNEVNGE
NRNNNYFNPTRRKVNDNRASTFNLNQNHDALIGEYGGCPWRIPNYNYKPGVLGLHEEIEH
FYMYMSPSETEHLVRTTVVTRIRSAILSLWPQARVEVFGSFRTGLYLPTSDIDLVVIGQW
EKLPLWTLERELVAQDIAEQDSIKVLEKATVPIVKMTDKYSDVKVDISFNMSSGVKSAEL
IKQFKEQYPELSRLVMVLKQFLLQRDLNEVFTGGISSYSLILMCISFLQLHPRPERLRQR
HNLGVLLIEFFELYGRKFNYVKTAIRVKNGGSYVSKDEISKEMNDGHRPSLLCIEDPLTP
GNDIGRSSYGAIQVKQAFDYGYIILQQAVAPHNALLARHSVLGRVVRVTDHVLQYRRWVR
DTFEPFFFPHRVRPRRVGNTRSPTPDPTPTPTPTPSDTDPEWSDGSGPSGPARTSPPPLS
ALQCSSPTPRRVSAHQSLIIHHITSNSDFNNIPSDPLAGVLRPRPRPRRRASPPRGRAQR
NDRSDRQDRNDRPDRHRKRRGYTR