DPGLEAN12395 in OGS1.0

New model in OGS2.0DPOGS214793 
Genomic Positionscaffold907:- 3946-8815
See gene structure
CDS Length1965
Paired RNAseq reads  405
Single RNAseq reads  990
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012103 (0.0)
Best Drosophila hit  tosca (2e-127)
Best Human hitexonuclease 1 isoform a (4e-87)
Best NR hit (blastp)  exonuclease [Aedes aegypti] (2e-147)
Best NR hit (blastx)  exonuclease [Aedes aegypti] (4e-141)
GeneOntology terms


  
GO:0008852 exodeoxyribonuclease I activity
GO:0004518 nuclease activity
GO:0006281 DNA repair
GO:0003677 DNA binding
InterPro families




  
IPR006086 XPG/RAD2 endonuclease
IPR006085 XPG N-terminal
IPR006084 DNA repair protein (XPGC)/yeast Rad
IPR020045 5'-3' exonuclease, C-terminal subdomain
IPR019974 XPG conserved site
IPR008918 Helix-hairpin-helix motif, class 2
Orthology groupMCL15563

Nucleotide sequence:

ATGGGTATTACTGGTTTGATACCCTTTTTGGACAAAGCCTCTAGAAGGGCAAATGTAAGC
GAATTCAGTGGATCATCAGTTGCAATTGATACTTATTGCTGGCTTCACAAGGGGGCTTTT
GCTTGTGCTGATAAACTAGTTCGGGGAGAAGAAACCGATATACATATAAAATATTGCTTG
AAATATGTAACGATGTTGCAATCAAGGAACATAAAACCAATTTTAGTTTTTGACGGGAGA
CATCTTCCTGCTAAGGCAATGACTGAGCTGAAAAGAAGAGAAACACGAGATATATCTAAG
AAAAGAGCAGCGGAATTACTTAGCTTAGGAAAGATTGACGAAGCGAGGTCCTTCATGCGT
CGCAGTGTAGACATAACTCATGCAATGGCCCTAGCTTTGATAAAAGAATGTAGGAAACGT
AACATAGACTGTATAGTAGCGCCATACGAAGCTGATGCACAACTAGCATACTTAAATATC
AAGAACTACGCTCAGTTAGTTATAACAGAAGATTCCGATTTAATACTTTTTGGTTGCACA
AAGGTCCTATTCAAAATGGATTTAGAGGGTAATGGAACTTTAGTAGAAACTATCAAGCTG
CCTCTAGTTATGAAATGTCCCATAGAACATTACACATTTGATAAATTCAGGAGAATGTGT
ATATTGTCGGGGTGTGATTATTTAAACTCGCTACCCGGCATCGGTTTGGCCAAAGCGCGT
CAGTTTGTCAATGCTTCACAGGACACAAATTTCGCTAACGCCCTAAAAAAGCTGCCAAGT
TTTTTCAACAGATCATTGCAAGTGAGTGATGATTATAGAGAGAATTTTCTCAAAGCCGAA
GCAACATTCAAACATCAGTACGTCTATGACCCTTCACAGAGATGTATGACCCGACTCACA
CCTGTTTATGATGAAGAAATCGAAGCGGCTTTGTGTTCTAATGCCGGAGAGCTTTTGGAT
CCTCAGATAGCCTTCCAGTTAGCTTTGGGTAACTTAGACCCTTTCACATTGAAGAAGATG
GATAATTGGGATCCCGATAGTAGAAGTGATGTGACAGATCATATAAGGAGTTCAAATTGG
AAGGATGCGGGGGTATCAAATAAGCCAAGTATATGGAGTGAGTCTTACAAGGAATATTTA
GATGAATCTCAACCTTGGATGAAAAAGGTTCAAAAACAGGAACCCATTATATCAACACAA
ACACGATCAAGGAAGAAGGTTGTCACCTTAACAACTAAATATGTACCGGAGACACAGGAT
GATAGTTTATCGATAGAGACACTCAGCGGCATGTACTGTATGGAACCAGCCAGCAAGAAA
CAGAAAGTTGAACAAAAGAAAAATAACATCAATATAGACTATGATAGAAATAATTTTAAT
CTCAAACAGAAATCACCCATACTGGAAAACAAAGGCAGATCGTTCAAGAAGTGTCTCAGT
TCCGGTAGTTTTTCAGTTTTGAAGAAATTGAGCGCTTTCCCAAGGACAGTTCTGGATGAT
GATATCATTGAAAGCAAATTCTTCAGTTCGTGCGAAAAGGACTCCAATGATACGTGTAAC
AGAGTTGATAATCAGACGATAATACAGGAATCACCGGAAAAAGATTTAGATACTGCTATG
ATAGACACATGCACAGGTTCCAGCTCACAGAAAGAGAATTCTCCCAGTCCCGCAAAGAAA
AGTCCTATATTAGTGAGCCCTAGAACAAGAAATCCGTTCAAGTTAAAAGACTCACAGTCA
ACAAACGACACGGGTTTCAGTGAGTCCGTCATAGAAAATACATGTCCCATTGAATCTGAG
CCACCCATATGTACCTCGCCCATCAAACCCGCTGTTCCTCAGGATAAGTTTAAGTTCAAT
CAAAATATAAAGAAGACAAAAGCCCCTGCTAAAAAGGTTCAAAGTTCTCAACCAACTCTC
TTAAGTATGTTCGGTTTTCAGAAAAAGCCAGTATTAAAAAGGTGA

Protein sequence:

MGITGLIPFLDKASRRANVSEFSGSSVAIDTYCWLHKGAFACADKLVRGEETDIHIKYCL
KYVTMLQSRNIKPILVFDGRHLPAKAMTELKRRETRDISKKRAAELLSLGKIDEARSFMR
RSVDITHAMALALIKECRKRNIDCIVAPYEADAQLAYLNIKNYAQLVITEDSDLILFGCT
KVLFKMDLEGNGTLVETIKLPLVMKCPIEHYTFDKFRRMCILSGCDYLNSLPGIGLAKAR
QFVNASQDTNFANALKKLPSFFNRSLQVSDDYRENFLKAEATFKHQYVYDPSQRCMTRLT
PVYDEEIEAALCSNAGELLDPQIAFQLALGNLDPFTLKKMDNWDPDSRSDVTDHIRSSNW
KDAGVSNKPSIWSESYKEYLDESQPWMKKVQKQEPIISTQTRSRKKVVTLTTKYVPETQD
DSLSIETLSGMYCMEPASKKQKVEQKKNNINIDYDRNNFNLKQKSPILENKGRSFKKCLS
SGSFSVLKKLSAFPRTVLDDDIIESKFFSSCEKDSNDTCNRVDNQTIIQESPEKDLDTAM
IDTCTGSSSQKENSPSPAKKSPILVSPRTRNPFKLKDSQSTNDTGFSESVIENTCPIESE
PPICTSPIKPAVPQDKFKFNQNIKKTKAPAKKVQSSQPTLLSMFGFQKKPVLKR