DPGLEAN14569 in OGS1.0

New model in OGS2.0DPOGS206062 
Genomic Positionscaffold84:+ 1687-12335
See gene structure
CDS Length1446
Paired RNAseq reads  978
Single RNAseq reads  2445
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006845 (1e-77)
Best Drosophila hit  Smg6 (3e-12)
Best Human hittelomerase-binding protein EST1A isoform 2 (7e-15)
Best NR hit (blastp)  hypothetical protein Phum_PHUM014230 [Pediculus humanus corporis] (2e-74)
Best NR hit (blastx)  hypothetical protein Phum_PHUM014230 [Pediculus humanus corporis] (6e-55)
GeneOntology terms










  
GO:0016787 hydrolase activity
GO:0005634 nucleus
GO:0005694 chromosome
GO:0005575 cellular_component
GO:0003674 molecular_function
GO:0000184 nuclear-transcribed mRNA catabolic process, nonsense-mediated decay
GO:0004518 nuclease activity
GO:0004519 endonuclease activity
GO:0000781 chromosome, telomeric region
GO:0003677 DNA binding
GO:0046872 metal ion binding
GO:0008150 biological_process
InterPro families  IPR006596 Nucleotide binding protein, PINc
Orthology groupMCL10459

Nucleotide sequence:

ATGCCATTCCATAGCTTGTGTGGTATCAGGAATCTTCTGGTTCACACGGCTTGGTGTTCG
TTCCGAGGAGTCAGCGGCGTCACTGGTAGTAACGGCGAGGAGCGCTCGGCTTGGTTAGAA
TGCGCTTTAGGTGTGTCCTTGCTAATGTTCGGTGCGCTGCTCGAGAGGTGCTGCGCGCTG
CTACCCGAGCCGCAGCACACCCAGCAGCACGCTGATGCATTGCTTCTTCTGCCTGCCATT
AAGATGTGGTCGGACTGGATGTTATGTCATAGCAGTATTTGGAATCCGCCACCTAGTTTT
GACAATTTCGAATTAGAAAGCGAAAACGATCCTTGGGATTGGCTCGCGAAGCTTATGAAC
ATATTGGAAGCCCTTGACGACAAATCATTAGAATTTGAGAACGAACCGAAGGAAGGATAT
ATTCCCGTAAGGCTGCCGGAGGATGCTTCCCTCGCTGGGTTCACGCCGCTCATGTACATG
GAGCCGGCGCTGGCGCACGTGGCGCCGCACGCGCACGTCGCGCCTCACGCCGCTGAACAC
GCGCTGCGCATGCGCAAGCTGCTGTTCTTCGGAACCGAATACCTGGTGGGTGTGGAGCCT
CCGGTGCTCAAGCTGGAGTACCCGGTCGGTGAACAGCCGCGGTATGTGAGTGCTGTACAG
CGAGCACAAGCAGACTACCCACCATTACAACATCTTTCTGAAGACTCGGAAGTCGACGAG
AGTGTGAGTACGACCAGCGGTCCGACGAGTCTGAGCGAGGCTGTGGAGGCCCCGGACGAC
GACACCAGGGACCTGCTGAGACGGAGGGACCTTCTGGAGAACAGGAGGGCCACCATAGAG
AAACGACGACAACGGATGCAGGAGATGCTTAGCACCGGATGGGTGAGCGTAGAAGTGGAA
GTTCGTCCGCGTTGGTTGGTACCCGACACCAACTGCTTCATCGACCACCTGCCGTTGTTA
CAAGCGGTGGCCAGCGCACCATCGCAGCCTTACAACCTCGCTGTGCCGCTCGTTGTTGTA
ACGGAGCTCGAGGGTTTGAAGAAGTGTTCTCGTTTGGGGGGTAAGGCGAGGGACGCCCTG
GCCTGGGTGTGCGGTGGTGGTGCGGGGGGTGCGGGAGGAGGTACTGTTAGGTTGGCTACA
GCCCGGGGCTCTCTGCTGACGTCACGGACTTTCACGGCCGAACAGGACGAGGGTAGAGCC
ACCAACGACGACCGCGTGCTGGCCACCGCTCTCAACCTGCAGGCGAATCTCACAGCTGAC
GGAGCGGAGGTCCGGGCGGACGGCGGCTGTGTTCGCGTGGTGGTGTTGGTGACGGACGAC
CGCAACCTGCGCGTGAAGGCTTTGTCGGCAGACCTGCCAGCCAAGGACCTGCTGTCATTC
GCGCAGTGGGCGGGACTGCACGCTGATACGAAACCCAAGTCGGACGCCCGCAGTGCCGAT
GTTTAA

Protein sequence:

MPFHSLCGIRNLLVHTAWCSFRGVSGVTGSNGEERSAWLECALGVSLLMFGALLERCCAL
LPEPQHTQQHADALLLLPAIKMWSDWMLCHSSIWNPPPSFDNFELESENDPWDWLAKLMN
ILEALDDKSLEFENEPKEGYIPVRLPEDASLAGFTPLMYMEPALAHVAPHAHVAPHAAEH
ALRMRKLLFFGTEYLVGVEPPVLKLEYPVGEQPRYVSAVQRAQADYPPLQHLSEDSEVDE
SVSTTSGPTSLSEAVEAPDDDTRDLLRRRDLLENRRATIEKRRQRMQEMLSTGWVSVEVE
VRPRWLVPDTNCFIDHLPLLQAVASAPSQPYNLAVPLVVVTELEGLKKCSRLGGKARDAL
AWVCGGGAGGAGGGTVRLATARGSLLTSRTFTAEQDEGRATNDDRVLATALNLQANLTAD
GAEVRADGGCVRVVVLVTDDRNLRVKALSADLPAKDLLSFAQWAGLHADTKPKSDARSAD
V