DPGLEAN22719 in OGS1.0

New model in OGS2.0DPOGS203580 
Genomic Positionscaffold110:+ 113116-116462
See gene structure
CDS Length1779
Paired RNAseq reads  458
Single RNAseq reads  1429
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008573 (2e-144)
Best Drosophila hit  CG11418, isoform A (3e-97)
Best Human hitpoly(A) RNA polymerase, mitochondrial precursor (6e-57)
Best NR hit (blastp)  GL14387 [Drosophila persimilis] (4e-115)
Best NR hit (blastx)  GL14387 [Drosophila persimilis] (2e-107)
GeneOntology terms









  
GO:0071044 histone mRNA catabolic process
GO:0003723 RNA binding
GO:0006397 mRNA processing
GO:0004652 polynucleotide adenylyltransferase activity
GO:0005524 ATP binding
GO:0016779 nucleotidyltransferase activity
GO:0005739 mitochondrion
GO:0000166 nucleotide binding
GO:0006350 transcription
GO:0016740 transferase activity
GO:0005737 cytoplasm
InterPro families  IPR002058 PAP/25A-associated
Orthology groupMCL14825

Nucleotide sequence:

ATGTCTGTATTATTTCAGACAGGAAAATATTGTCTTAGAAAATCTTACAGTTATAGCTTT
TGTAAATATGTATTGTTGAGAAAAAAATCAGATGTTCCTAGGAAATTCATAACTTTCGAC
GAGATCGTAACCCAGAGAAGAGCTGAGGCACGACGCAGCCTTGTTGTACAAGTTAACTCT
GAATCATCCTTTGACGAGTTGTATGGTTATTGCTCTAAGTATAGCTCAATAAAAGATGTT
TACCATTATAAGAATTCAGGAGAAGAGCATTTCATGCTAATAGAATTTAGTTCAGAAGAA
AATCTTCAGAGTATTCTGCAATCATGTTGTTCTCATCAAAAAGACCTCGAAGTGATGGCG
GTGCAATCCCCATTTGTTTGGTTCCGAGCAGCATCAGATGCCAAACAGAAACTCACTGCC
AATGGACTGGAACTGAGAGTAAAAGACGGCAACAACAAACATAGTGAAGATTTGTTATTT
GAAGATTTAATGAAATGTCAGACAGTGTCAGAACAAATTCAGATGTTGTATGACAAAACT
ATACTGAATGATGTAGGAGCTAGGCTGAGGTTCATGGTGGCGAGACAGTTGGAGGTGATA
CTGAGCAGCTTGTACACTAATATCCAAGTACTACCATTTGGTTCCTCCGTTAATGGTTTC
GGTAAAATGGGCTGTGATTTGGATTTAGTTCTGACAAACTCATTGACTGATGGAATGATG
TCACCTACAAATCGCCTGGTGTACCAGGAGAAGAGGTCGGAGGGGAGTCGTGGTCCGTGG
CAGCGTCACATGGAGCTGGTGGGAGCGTTGTTGGAGCTGCGGGTCCCCGGGGCCACGAGG
GTGCAGCGGATACTCAACGCACGGGTGCCCATTGTGAAGTACTCACAGGAACTGGCCGAT
GTCGATGTGGACCTCTGCTTCAAAAACATGTCCGGCGTCCACATGTCGGCCCTGCTGTAC
TCCCTGGGGGCGCTGGATCCCGCGGGCCCGGCGTTGGCCGTCTCGGTCCGGCGCTGGGCC
GCGGCGGTGCAACTCACGCAGCCTCACCCCGGCCGATGGATCACCAACTTCCCGCTCACA
CTCATGGTGCTGTTCTTCCTCATGACACAGAAGATCCTGCCCACCTTCAGGTGTCTACTA
GAATGTGCCGGTCGGTTATATACTGATAACATAAACTGTACGTTTGTCCGCGACCTGTCC
CGCCTGCCTCCGCATTCGTACCGACCTTCTTCGGACGACCTCCAAACTCTTCTGCTCAAA
TTCTTCGAGTTCTACTCCCAGTTCGACTTCCAAGAGCACGCGATATCGGTCATCGAAGGT
AAACCGATCAGGAAGCCCAACACCCTACCGCTGTACATAGTCAACCCCCTGGAGCCGGCT
CTCAACGTCAGCAGGAACGTCAGCTACGAGGAGTGCGAGAGACTCAAGATGGAGGTGAGG
AACGCGGCCTGGCATCTGGAGGCGTGTCTCGACAACAACAGGGGGGACGACTGGGGAATA
CTGGGCCTCGTGGAGAAGAAAACCACCAGGGGACTCAAGAAGCTGCTGCGGGCGGGGAAC
CAGCACAGGCTCGTGTCCGTCAAGGATCTGTTCAAGGACGACGAGCAACCCGCGTCCGCC
ACGAGCCCCAGGAAGACACATACCGACACCGACAGAGAAGGACAGTCCAGCGGGAGACAT
CACAGACACGGGGCGGACGACAAACACGACAAAGTAAAACTCAAAAACACACAGACGGCG
AAGGAAGTGTACCGGATAAGGAGAGACAAACTGATCTGA

Protein sequence:

MSVLFQTGKYCLRKSYSYSFCKYVLLRKKSDVPRKFITFDEIVTQRRAEARRSLVVQVNS
ESSFDELYGYCSKYSSIKDVYHYKNSGEEHFMLIEFSSEENLQSILQSCCSHQKDLEVMA
VQSPFVWFRAASDAKQKLTANGLELRVKDGNNKHSEDLLFEDLMKCQTVSEQIQMLYDKT
ILNDVGARLRFMVARQLEVILSSLYTNIQVLPFGSSVNGFGKMGCDLDLVLTNSLTDGMM
SPTNRLVYQEKRSEGSRGPWQRHMELVGALLELRVPGATRVQRILNARVPIVKYSQELAD
VDVDLCFKNMSGVHMSALLYSLGALDPAGPALAVSVRRWAAAVQLTQPHPGRWITNFPLT
LMVLFFLMTQKILPTFRCLLECAGRLYTDNINCTFVRDLSRLPPHSYRPSSDDLQTLLLK
FFEFYSQFDFQEHAISVIEGKPIRKPNTLPLYIVNPLEPALNVSRNVSYEECERLKMEVR
NAAWHLEACLDNNRGDDWGILGLVEKKTTRGLKKLLRAGNQHRLVSVKDLFKDDEQPASA
TSPRKTHTDTDREGQSSGRHHRHGADDKHDKVKLKNTQTAKEVYRIRRDKLI