DPGLEAN15708 in OGS1.0

Genomic Positionscaffold1535:+ 23412-24845
See gene structure
CDS Length1434
Paired RNAseq reads  3
Single RNAseq reads  20
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012141 (8e-48)
Best Drosophila hit  ND
Best Human hitgeneral transcription factor II-I repeat domain-containing protein 2A (1e-74)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC012929 [Tribolium castaneum] (1e-122)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC012929 [Tribolium castaneum] (2e-119)
GeneOntology terms



  
GO:0005634 nucleus
GO:0003677 DNA binding
GO:0005622 intracellular
GO:0008270 zinc ion binding
GO:0045449 regulation of transcription
InterPro families  IPR012337 Ribonuclease H-like
Orthology groupMCL16751

Nucleotide sequence:

ATGGCTTCAAAAAAAAGGAAGTTAGCAGAAGAAAATAGGGTATTTAACGATGCATGGACA
GATTTATACTTTTTTATTAATTGCAATGGCAAACCGCTATGCTTAATATGCCAAAAAACC
CTTACTATTCAAAAAGAATATAATGTCAAACGCCATTATGATAGCGAGCATAAAGCTAAA
TTTGCATGCCTAGTAGGTGAATCTCGAAAAAATAAAATTAACGCACTGAAATCTTCAGTT
AAGAACCAACAAAATGTTTTTAAGGTTCAAGTACAAAGTAACGAGTCGAGCATACGTGCA
AGTTTAAGGGTAGCGGAAATTCTTGCGAAGTCTGGCCGACCATTTACCGACAGCGAACTT
ATAAAACAATGTGCTTTAGTCATGGCTGAGGAGGTGTGCCCCGACCTAAAAAAAAAATTT
GAAAATATAAGTCTTTCTGCAAGAACATGTACTCGCCGAACTGAAGATTTGGGCGATAAT
TTATGCCAGCAACTCCGGGAAAAAGCACAACAATTTGAATGGTTCTCTTTAGCTACCGAT
GAAAGTAATGATGTTACTGACACTGCCCAGTTTTTAATATTTGTTCGCGGAATCGATAAG
AATTTTAATGTCTATGAGGAGCTTTTACAGTTATGTAGTTTGAAGGGCACGACTACCGGA
GAAGATTTGTTTTGTAACTTAGAACAAGCGCTAATATCAATGCAATTACCATGGGAAAAA
TTGGTAAGTGTTACAACTGACGGAGGCAGAAACATGAGTGGACAAAATAAAGGTTTGGTA
GGTAGAATTAAATCAAAACTGGCAGAGATAGGATGTGCCATCCCATTATTTTTCCATTGC
ATCATCCACCAAGAAGCGTTATGTTCCAAGGTCGTATCATGGAAAGAAGTAATGGACATC
GTGGTATCGACTGTAAACTACATCAGAAAAAATGGATTGACTCATCGTCAATTTCAACAG
TTTCTGTCAGATACGGAAGCAGATCATCGAGATGTTGTTTATTATTCTGAAGTACGATGG
TTAAGTAGAGGCGCAGTGCTGAAAAGATTTTTTGATCTCAGAAAAGAAATCAATACTTTT
ATGAATGAAAAAGGCAAATCTATTCCAGAGCTGACAGATACTCAATGGTTGATAGACGTT
GGATTTTTGACAGATGTCACTCATGAATTAAATACGCTAAATTTGAGGCTTCAAGGTAAA
AAAAAAATAATTTCTGATATGCACACAGACGTAAAAGCATTTCAAATGAAAATTAAACTC
TTTATAAAGCATATTGATGAAAAAAAGTTGGATCATTTTCCGAATTGTAAAGAGGCTGTT
GAGGAAGCTAGAATTAATTTTCATTGGAACAGTGATAACATGAAGGACATTTTAATTGAA
CTGCAAACTCAATTCCAGCAATCTGAAGGTGTTTGCGAATCCTTTTTCATGTGA

Protein sequence:

MASKKRKLAEENRVFNDAWTDLYFFINCNGKPLCLICQKTLTIQKEYNVKRHYDSEHKAK
FACLVGESRKNKINALKSSVKNQQNVFKVQVQSNESSIRASLRVAEILAKSGRPFTDSEL
IKQCALVMAEEVCPDLKKKFENISLSARTCTRRTEDLGDNLCQQLREKAQQFEWFSLATD
ESNDVTDTAQFLIFVRGIDKNFNVYEELLQLCSLKGTTTGEDLFCNLEQALISMQLPWEK
LVSVTTDGGRNMSGQNKGLVGRIKSKLAEIGCAIPLFFHCIIHQEALCSKVVSWKEVMDI
VVSTVNYIRKNGLTHRQFQQFLSDTEADHRDVVYYSEVRWLSRGAVLKRFFDLRKEINTF
MNEKGKSIPELTDTQWLIDVGFLTDVTHELNTLNLRLQGKKKIISDMHTDVKAFQMKIKL
FIKHIDEKKLDHFPNCKEAVEEARINFHWNSDNMKDILIELQTQFQQSEGVCESFFM