DPGLEAN03049 in OGS1.0

New model in OGS2.0DPOGS215847 
Genomic Positionscaffold56:+ 194962-197481
See gene structure
CDS Length2520
Paired RNAseq reads  487
Single RNAseq reads  1305
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002947 (0.0)
Best Drosophila hit  CG9247 (1e-35)
Best Human hitprobable exonuclease mut-7 homolog (3e-38)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC002596 [Tribolium castaneum] (4e-86)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC002596 [Tribolium castaneum] (4e-84)
GeneOntology terms



  
GO:0006139 nucleobase, nucleoside, nucleotide and nucleic acid metabolic process
GO:0003676 nucleic acid binding
GO:0005622 intracellular
GO:0008408 3'-5' exonuclease activity
GO:0016787 hydrolase activity
InterPro families

  
IPR002562 3'-5' exonuclease
IPR002782 Protein of unknown function DUF82
IPR012337 Ribonuclease H-like
Orthology groupMCL15912

Nucleotide sequence:

ATGGACTTAAATAAGTTAGTCCATCAAAATCAAACAATAAAAATCATACCATCGTTAGAG
GACTCTCTACGAGGACTTGGCCTGAATAGTGATCTAGATGAAGAAACTGAGATTTGGTTT
AATCAATTGAAAATCAAATGGAAAACATGGAAGAAAAGCCCTACGATTGAGCGTCACTTT
GACTCATTTTTTCAGTTCTGTCAAGATCCCTTCAGAGTTGCCCTAGTGTGCATTGTCAAA
TGTGATGAACCAAAAGATCGTAAGCCTAAATCTCTCTCTTATTGCATACTTGAAATTATA
TTTAAATGGTCCCAGACAAATGGCAGATTACCTGAAGAGACTCTGAAACTTCCAGCGTAC
AATATAGCAACACAACAGAGAAATCAACATTTTCTTTATTTAGCAGTTAAAACTTATCAA
TTGTATACTATAAAAGAGACTGTACTTCCTCTTGTAAAAGATATGATAAGAAATGATAAT
TGCAAACAAGCATCACAAATTGTAATTGCAATGGAACTCTTTGATGAAATTCCTGTTGAA
GATTTACTGTTTCCATTGGTTTTGCAAGATACGCCAAACCTAATTGATGAATATTTGTCA
GAATCTCCAAATCAAATTCAACCATTTTTATTATTTTTAGATAGACTGTTAGACAAAAAC
TTTAGTATAAGAGACTATGCTCAAAAATTTATTGAAGAAAATAAAATTTATAATGTTAAA
TATGATAAAATTCATTATAAACCTCTGGGAAAATTAGTCGCTCGGCTTTGTAATAAATTT
AATGTTCCAATAGAGTCATGTAAAAACTTGAGTAAGAATCGTACCACAGGAGGGCTGAGG
TATTTAATTCATCAGAGATATGTTGAACACAACTTGAGCCCCTCAGTTTGGGATGATTTG
GTAAAAGATTCTTTGAAGCAAAGTACTGACTGTGCTAAAGAATTTGTTGATATGTTAGCA
ATGTATGACATAAATGAATCACTTAAGTGGTCTTCATATTTCGAAATATCAAATGATTGT
CTTCCTCATGCTCTTCAGAATTTAACAATAAAAGATAATCCTATAGAAGAAGAAAATTGG
GACTCAACTGACAATGCAGCTCAGAACTATTATAGACTTCCAATATCAGAAGAAAATATT
TTAATTATTGACACAGCAGAAAAGTTTGATGAATTAATCTCAAAGTTGTCAAATTGTCCT
ATCATCAGCTTTGATTGCGAATGGAAACCATCATTTGGTGCTGCTAAATCTCGAATGGCT
CTCATTCAAATTGGTACATTTGATCAAGTTTATCTTATTGACACTCTTATATTAAACAAC
AAGCAATACATGGGTAGTTGGTGCCGGTTTAATAAATATGTATTAGATAATGCGGAGATA
ATAAAATTGGGTTTTGGAGTTGAACAGGATCTGAATGAAATGAAGTCTTTAATTATTGGT
TTGAATAATATCAAGGTTAAAGGTGAAGGACTTTTAGATTTAGGTTTACTGTGGAAAAAT
CTTGTCAAATGTGGCTTGTCATTACCAAGTAACAGTGATAATGGAGGTAACAGTCTCAGC
TCTTTGGTCCAAACTTGCTTTGGATTGCCCTTGGAAAAATCTGAGCAATGTTCAAATTGG
GAGTTAAGGCCCTTAAGAAATACTCAGATTCACTATGCTGCTTTGGATGCTTTTGTTTTG
TTAGAGATATACAAATACCTTCAAAATCTTTGTGTAGAACAACATATTAATTTTGAGGAA
ATTTGTAATGATGTAATGTTGGATAGAAAACTGAAATGTCTAAAAAAGAATAAAGTAGTT
GATTGTCTGCAGACAACAAAAAATATAAAGGTGAGAACTCCTATGGACGTTAAAATTCTT
CTTGAACATGACAATGCACATTTACGATATTATCTAAGATACTGTGGTATTGACACTACT
ATTACAACTTCCCATATGTTATGGCACGATACTATTAAATTAGCCACATCTGAAAATCGT
TTAATATTGACATCTAAATTGAAGTTTTCACCATCTAGCAGATTTTCACAAAACTTTATC
TTAGATATAGGTAAAGGAAGCATCAAGGATCAATTATTAAAAATTCTTAAACATTTTAAT
GTGGGCCTTCAAAAGAATTATATTTTGACAAGATGTATAGAATGCAATTCTACAGATGTA
AAATATTACTCTATTAATGATCTCAAAGATATATGTAGAAAATATAATGGTGGTAGCCAC
AAGTCTTCCGATCAGATCAGAAGGAGTGCTAGTGACAATGAAGATGATAATGATTATTCT
GAAAACTTTCTCAGTGATTCAGAAGGGGAAGACATACATTTATACAAACCATTTCCAATA
CAGGACAAATGGTATACATCTAGCAGTGGAGCTAAAATTAATATGAATCAGATTGAAAAG
TTATGTGCTTCCAATAAAACTTCACATATTTGTGAAAATTGTGGAAAACTATATTGCGAT
GAAGAGCCGTTGCTTAAATCAATACACGAAGTAATCATGTCTATAACAAATTTTAATTAG

Protein sequence:

MDLNKLVHQNQTIKIIPSLEDSLRGLGLNSDLDEETEIWFNQLKIKWKTWKKSPTIERHF
DSFFQFCQDPFRVALVCIVKCDEPKDRKPKSLSYCILEIIFKWSQTNGRLPEETLKLPAY
NIATQQRNQHFLYLAVKTYQLYTIKETVLPLVKDMIRNDNCKQASQIVIAMELFDEIPVE
DLLFPLVLQDTPNLIDEYLSESPNQIQPFLLFLDRLLDKNFSIRDYAQKFIEENKIYNVK
YDKIHYKPLGKLVARLCNKFNVPIESCKNLSKNRTTGGLRYLIHQRYVEHNLSPSVWDDL
VKDSLKQSTDCAKEFVDMLAMYDINESLKWSSYFEISNDCLPHALQNLTIKDNPIEEENW
DSTDNAAQNYYRLPISEENILIIDTAEKFDELISKLSNCPIISFDCEWKPSFGAAKSRMA
LIQIGTFDQVYLIDTLILNNKQYMGSWCRFNKYVLDNAEIIKLGFGVEQDLNEMKSLIIG
LNNIKVKGEGLLDLGLLWKNLVKCGLSLPSNSDNGGNSLSSLVQTCFGLPLEKSEQCSNW
ELRPLRNTQIHYAALDAFVLLEIYKYLQNLCVEQHINFEEICNDVMLDRKLKCLKKNKVV
DCLQTTKNIKVRTPMDVKILLEHDNAHLRYYLRYCGIDTTITTSHMLWHDTIKLATSENR
LILTSKLKFSPSSRFSQNFILDIGKGSIKDQLLKILKHFNVGLQKNYILTRCIECNSTDV
KYYSINDLKDICRKYNGGSHKSSDQIRRSASDNEDDNDYSENFLSDSEGEDIHLYKPFPI
QDKWYTSSSGAKINMNQIEKLCASNKTSHICENCGKLYCDEEPLLKSIHEVIMSITNFN