New model in OGS2.0 | DPOGS214793  |
---|---|
Genomic Position | scaffold907:- 3946-8815 |
See gene structure | |
CDS Length | 1965 |
Paired RNAseq reads   | 405 |
Single RNAseq reads   | 990 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012103 (0.0) |
Best Drosophila hit   | tosca (2e-127) |
Best Human hit | exonuclease 1 isoform a (4e-87) |
Best NR hit (blastp)   | exonuclease [Aedes aegypti] (2e-147) |
Best NR hit (blastx)   | exonuclease [Aedes aegypti] (4e-141) |
GeneOntology terms    | GO:0008852 exodeoxyribonuclease I activity GO:0004518 nuclease activity GO:0006281 DNA repair GO:0003677 DNA binding |
InterPro families    | IPR006086 XPG/RAD2 endonuclease IPR006085 XPG N-terminal IPR006084 DNA repair protein (XPGC)/yeast Rad IPR020045 5'-3' exonuclease, C-terminal subdomain IPR019974 XPG conserved site IPR008918 Helix-hairpin-helix motif, class 2 |
Orthology group | MCL15563 |
Nucleotide sequence:
ATGGGTATTACTGGTTTGATACCCTTTTTGGACAAAGCCTCTAGAAGGGCAAATGTAAGC
GAATTCAGTGGATCATCAGTTGCAATTGATACTTATTGCTGGCTTCACAAGGGGGCTTTT
GCTTGTGCTGATAAACTAGTTCGGGGAGAAGAAACCGATATACATATAAAATATTGCTTG
AAATATGTAACGATGTTGCAATCAAGGAACATAAAACCAATTTTAGTTTTTGACGGGAGA
CATCTTCCTGCTAAGGCAATGACTGAGCTGAAAAGAAGAGAAACACGAGATATATCTAAG
AAAAGAGCAGCGGAATTACTTAGCTTAGGAAAGATTGACGAAGCGAGGTCCTTCATGCGT
CGCAGTGTAGACATAACTCATGCAATGGCCCTAGCTTTGATAAAAGAATGTAGGAAACGT
AACATAGACTGTATAGTAGCGCCATACGAAGCTGATGCACAACTAGCATACTTAAATATC
AAGAACTACGCTCAGTTAGTTATAACAGAAGATTCCGATTTAATACTTTTTGGTTGCACA
AAGGTCCTATTCAAAATGGATTTAGAGGGTAATGGAACTTTAGTAGAAACTATCAAGCTG
CCTCTAGTTATGAAATGTCCCATAGAACATTACACATTTGATAAATTCAGGAGAATGTGT
ATATTGTCGGGGTGTGATTATTTAAACTCGCTACCCGGCATCGGTTTGGCCAAAGCGCGT
CAGTTTGTCAATGCTTCACAGGACACAAATTTCGCTAACGCCCTAAAAAAGCTGCCAAGT
TTTTTCAACAGATCATTGCAAGTGAGTGATGATTATAGAGAGAATTTTCTCAAAGCCGAA
GCAACATTCAAACATCAGTACGTCTATGACCCTTCACAGAGATGTATGACCCGACTCACA
CCTGTTTATGATGAAGAAATCGAAGCGGCTTTGTGTTCTAATGCCGGAGAGCTTTTGGAT
CCTCAGATAGCCTTCCAGTTAGCTTTGGGTAACTTAGACCCTTTCACATTGAAGAAGATG
GATAATTGGGATCCCGATAGTAGAAGTGATGTGACAGATCATATAAGGAGTTCAAATTGG
AAGGATGCGGGGGTATCAAATAAGCCAAGTATATGGAGTGAGTCTTACAAGGAATATTTA
GATGAATCTCAACCTTGGATGAAAAAGGTTCAAAAACAGGAACCCATTATATCAACACAA
ACACGATCAAGGAAGAAGGTTGTCACCTTAACAACTAAATATGTACCGGAGACACAGGAT
GATAGTTTATCGATAGAGACACTCAGCGGCATGTACTGTATGGAACCAGCCAGCAAGAAA
CAGAAAGTTGAACAAAAGAAAAATAACATCAATATAGACTATGATAGAAATAATTTTAAT
CTCAAACAGAAATCACCCATACTGGAAAACAAAGGCAGATCGTTCAAGAAGTGTCTCAGT
TCCGGTAGTTTTTCAGTTTTGAAGAAATTGAGCGCTTTCCCAAGGACAGTTCTGGATGAT
GATATCATTGAAAGCAAATTCTTCAGTTCGTGCGAAAAGGACTCCAATGATACGTGTAAC
AGAGTTGATAATCAGACGATAATACAGGAATCACCGGAAAAAGATTTAGATACTGCTATG
ATAGACACATGCACAGGTTCCAGCTCACAGAAAGAGAATTCTCCCAGTCCCGCAAAGAAA
AGTCCTATATTAGTGAGCCCTAGAACAAGAAATCCGTTCAAGTTAAAAGACTCACAGTCA
ACAAACGACACGGGTTTCAGTGAGTCCGTCATAGAAAATACATGTCCCATTGAATCTGAG
CCACCCATATGTACCTCGCCCATCAAACCCGCTGTTCCTCAGGATAAGTTTAAGTTCAAT
CAAAATATAAAGAAGACAAAAGCCCCTGCTAAAAAGGTTCAAAGTTCTCAACCAACTCTC
TTAAGTATGTTCGGTTTTCAGAAAAAGCCAGTATTAAAAAGGTGA
Protein sequence:
MGITGLIPFLDKASRRANVSEFSGSSVAIDTYCWLHKGAFACADKLVRGEETDIHIKYCL
KYVTMLQSRNIKPILVFDGRHLPAKAMTELKRRETRDISKKRAAELLSLGKIDEARSFMR
RSVDITHAMALALIKECRKRNIDCIVAPYEADAQLAYLNIKNYAQLVITEDSDLILFGCT
KVLFKMDLEGNGTLVETIKLPLVMKCPIEHYTFDKFRRMCILSGCDYLNSLPGIGLAKAR
QFVNASQDTNFANALKKLPSFFNRSLQVSDDYRENFLKAEATFKHQYVYDPSQRCMTRLT
PVYDEEIEAALCSNAGELLDPQIAFQLALGNLDPFTLKKMDNWDPDSRSDVTDHIRSSNW
KDAGVSNKPSIWSESYKEYLDESQPWMKKVQKQEPIISTQTRSRKKVVTLTTKYVPETQD
DSLSIETLSGMYCMEPASKKQKVEQKKNNINIDYDRNNFNLKQKSPILENKGRSFKKCLS
SGSFSVLKKLSAFPRTVLDDDIIESKFFSSCEKDSNDTCNRVDNQTIIQESPEKDLDTAM
IDTCTGSSSQKENSPSPAKKSPILVSPRTRNPFKLKDSQSTNDTGFSESVIENTCPIESE
PPICTSPIKPAVPQDKFKFNQNIKKTKAPAKKVQSSQPTLLSMFGFQKKPVLKR