DPGLEAN05732 in OGS1.0

New model in OGS2.0DPOGS205534 
Genomic Positionscaffold2284:- 29868-38227
See gene structure
CDS Length1848
Paired RNAseq reads  2124
Single RNAseq reads  5061
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000084 (0.0)
Best Drosophila hit  hiiragi, isoform C (0.0)
Best Human hitpoly(A) polymerase gamma (3e-179)
Best NR hit (blastp)  poly a polymerase [Aedes aegypti] (0.0)
Best NR hit (blastx)  PREDICTED: similar to hiiragi CG9854-PA, isoform A [Apis mellifera] (0.0)
GeneOntology terms



  
GO:0004652 polynucleotide adenylyltransferase activity
GO:0006378 mRNA polyadenylation
GO:0003723 RNA binding
GO:0006350 transcription
GO:0005634 nucleus
InterPro families



  
IPR007012 Poly(A) polymerase, central domain
IPR007010 Poly(A) polymerase, RNA-binding domain
IPR002934 Nucleotidyl transferase domain
IPR011068 Nucleotidyltransferase, class I, C-terminal-like
IPR014492 Poly(A) polymerase
Orthology groupMCL10628

Nucleotide sequence:

ATGTGGCCGGCATCTCAATATTCGCATACAAATCACCAGGCCAACGCCTCCAAGTCCAAT
GAACACCAAAATCAACAGAACCTGAAGACGCTCGGCATGACTTCAGCTATTTCTATGGCA
GGTCCGAAACCCATCGACATTGAAAAGACAAATGAGCTCAAGGAATCCCTGGTGCCGTTT
GGTGTGTTTGAATCCGAGGCTGAGATGCATCACAGGATGGAGGTGCTCGGATCCTTACAT
CGGCTGGTCAGGCAGTGGATAAGAGACGAATCCTTGAGGAAGAACATGCCACCCAGCGTA
GCTGACACAGTCGGAGGCAATATATATACATTCGGATCATACAGGCTCGGGGTGCACCAC
CGAGGCGCGGATATTGACGCCTTGTGCGTGGCTCCAAGACATATCGACCGGTCGGACTAC
TTCCAGTCATTCTACGAACTGCTCAAGGAACAACCTCAAGTGAAAGATCTCCGAGCTGTG
GAGGACGCGTTCGTGCCCGTCATTAAGATGAACTTCGACGGTATCGAAATAGATCTGTTG
TTTGCCAGACTAGCTCTCAAGGAAATACCAGATTCCTTCGACCTCCGAGACGACATGCTC
CTCAAGAACCTGGACCAGAAGTGCGTGAGGTCGCTGAACGGGTGTAGAGTCACCGATGAA
ATACTGAGATTGGTCCCCGATATAAATACCTTTAGACTCACCTTGAGGGCTATCAAGCTG
TGGGCCAAACGGCATGGGATATATTCTAATACCCTGGGCTACCTCGGCGGAGTGTCCTGG
GCCATGCTAGTGGCGCGAACCTGTCAGTTGTATCCCAATGCGTTACCAGCTACATTACTA
CACAAGTTCTTCCTCGTCTTCAGCCAGTGGAAGTGGCCGCAGCCAGTACTCCTCAAACCA
CCGGACTCAGTCAATCTGGGATTCCCCGTTTGGGATCCGAGGGTTAACATGTCGGATCGC
TACCACCTGATGCCCATCATAACACCGGCTTACCCACAACAGAACTCCACGTTCAATGTG
TCGTCATCCACGAGGACGGTCATCATGGAGGAGTTCAGGCTGGGTCTTGCTATAACTGAT
GAGATAATGCTCGGAAAGTGTGGCTGGGAACGGTTGTTTGAAGCTGCAAATTTCTTCTCC
CGCTACAAACACTTCATAGTACTGCTTGCATCATCGGCTAACACCCTGGATCAGCTGCCC
TGGTGCGGGCTGGTCGAGAGCAAGATACGACACCTCATCACCACACTGGAACGAAACCAG
CATATAACAATTGCTCATGTGAACCCGGAGTGTTACAACTCCGTGCCTCTCAATACTAAC
AACGGACATCCGCTCGCCTTACCTCCAGGTACACCAGTACAAACAGAGGAACACGGCGCC
GCTGAAGTTAAAAATGATAAGGGCGAGATAGTGGCAAACGTCTGCTCAATGTGGTTCATA
GGTCTGGTGTTTGACAAGACCAATGTCAATGTTGACCTCACATATGATATATCGTCATTC
ACAAAGGCCGTACACTACCAGGCCGAGAACACTAATGTACTTAGAGAGGGAATGACTATA
GAGGCTCGTCATGTTCGTCGTAAGCAACTTCATCAATACCTGTCTCCGTCACTACTAAGG
AGAGAAAAAGTTAACAAGAGAAAGAATGAAACACTCGCTGTTCATACAAAGAAGGCTAAG
AGGGTATCGGAAAGCAGTGCGGATGAGGTGAGCGTGCTATCGTACACCGAGGACTCGAAC
TCGTCTAACATGTATGAAGTGAACGTACAGAACGGCGCGCATCAAGAACAGAAGACGAGC
GAGAAGGTCGACAGGGGGTCCAGCTCGAGCGGCATAGCGTGCACGTAG

Protein sequence:

MWPASQYSHTNHQANASKSNEHQNQQNLKTLGMTSAISMAGPKPIDIEKTNELKESLVPF
GVFESEAEMHHRMEVLGSLHRLVRQWIRDESLRKNMPPSVADTVGGNIYTFGSYRLGVHH
RGADIDALCVAPRHIDRSDYFQSFYELLKEQPQVKDLRAVEDAFVPVIKMNFDGIEIDLL
FARLALKEIPDSFDLRDDMLLKNLDQKCVRSLNGCRVTDEILRLVPDINTFRLTLRAIKL
WAKRHGIYSNTLGYLGGVSWAMLVARTCQLYPNALPATLLHKFFLVFSQWKWPQPVLLKP
PDSVNLGFPVWDPRVNMSDRYHLMPIITPAYPQQNSTFNVSSSTRTVIMEEFRLGLAITD
EIMLGKCGWERLFEAANFFSRYKHFIVLLASSANTLDQLPWCGLVESKIRHLITTLERNQ
HITIAHVNPECYNSVPLNTNNGHPLALPPGTPVQTEEHGAAEVKNDKGEIVANVCSMWFI
GLVFDKTNVNVDLTYDISSFTKAVHYQAENTNVLREGMTIEARHVRRKQLHQYLSPSLLR
REKVNKRKNETLAVHTKKAKRVSESSADEVSVLSYTEDSNSSNMYEVNVQNGAHQEQKTS
EKVDRGSSSSGIACT