New model in OGS2.0 | DPOGS210058  |
---|---|
Genomic Position | scaffold162:- 95501-99603 |
See gene structure | |
CDS Length | 1347 |
Paired RNAseq reads   | 111 |
Single RNAseq reads   | 289 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012698 (0.0) |
Best Drosophila hit   | CG9973 (4e-36) |
Best Human hit | methylcytosine dioxygenase TET1 (3e-12) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC013796 [Tribolium castaneum] (4e-84) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC013796 [Tribolium castaneum] (1e-84) |
GeneOntology terms    | GO:0008270 zinc ion binding GO:0003677 DNA binding |
InterPro families   | IPR002857 Zinc finger, CXXC-type |
Orthology group | MCL18019 |
Nucleotide sequence:
ATGAGCGATACGCTCAGGAGCGAGGCGGGCGCGGATAGCGCGCACTTACCGCCCTTCTCG
ACCTTCGGTGAAATGGCGGAGAACGAGCAGAGGATCCTGACGGCCGACGCCAGGCTGCTG
GATCCCGCGTGGGAGTACTACGAGAGGACCGGGGACACAGTCTCCGTCATCGCCAGCCAG
CCGCAGTACCGCCCCTGGGAGTCGATGCCTATCACGGTCAATTCCAAGGATGCAATACTA
CGGGCGGGTTTCTCATCTCCTCTCGAATACCAGTCGATTACGTTACAACCGATACCAAAC
AAGCTACCGTCCTTCCAAAGCCAATTTCAGACGTTCCCAGAGACCACAGTCATACCGGAG
ACTGGTCTGCCGAGTGTGACGCCAGTTCCTGTCACCACGAGCCCCACACCCAGCGCGAGT
CCTAGTCAGTTAACCCAGCTCACGCAACTAACGACACCTTCATCGCCAGCACATTTAACT
ACCTTGGCCCAGGTGGCGCCCTTGTCCAGTACTTTGACTACGCTTTCACCGGTCAATGCA
ACGACATTCCACACTCTCACCGCTGTGAACGCTCGGAGTTACCCAATCGTTCCGGCGCCC
TTACAAGCCAGAGAGCTAGCACCGACAGGCCAAGCGTACATTGACGACCGACACATACAG
CTTTACCAACCGAATATTGCAACCATTAACGCATTTCCGACACAAAATGGTATACTGCAT
CAGAACGGGGCGCTTCTACACCAGAACGGTAGCTTAATACAGAACATTCAAAGTCCAACA
GTCGTTCATGTATTGAAAAACGAGCCGTTCGATATGAAATCATTACAAGACAAGTACACG
CCGAACGGGCTGCATCACAGTAATTTTCAAAATCCAATGTTAATTGATAATAGCTACGAG
AAGAAAGTGAACGGTTTCGGGAGTGGTTCATCGCCGACCAGGTCGGACTTCAGGAAAAAG
GAGAGACGGAAAATGAGAGCGAATAGCTCGGAATCAGACTGTTCTAATATGGAGATGGGT
TCAGAGAGTAGTGGACAGGTGGCAGCGGTGTCATCCACAGCAGGGTTCAAGTCCCCGATG
CACGGCGCGCCGCCAATGAACACGGGACCCATGGAACTCGACGACATATCCAGCGAAAAA
CAGACTAAAAAGAAAAGAAAGAGATGTGGCGAGTGTATAGGCTGCCAACGAAAAGATAAC
TGTGGTGACTGCGCTCCGTGCAGGAACGACAAGTCACATCAGATATGCAAGCAAAGAAGA
TGCGAGAAGCTGACGGAGAAGAAGAATTATTTCACCCAACGACATATAATCGACTTTGAC
AGCTCAGTGACGCTGTCAAAGTACTGA
Protein sequence:
MSDTLRSEAGADSAHLPPFSTFGEMAENEQRILTADARLLDPAWEYYERTGDTVSVIASQ
PQYRPWESMPITVNSKDAILRAGFSSPLEYQSITLQPIPNKLPSFQSQFQTFPETTVIPE
TGLPSVTPVPVTTSPTPSASPSQLTQLTQLTTPSSPAHLTTLAQVAPLSSTLTTLSPVNA
TTFHTLTAVNARSYPIVPAPLQARELAPTGQAYIDDRHIQLYQPNIATINAFPTQNGILH
QNGALLHQNGSLIQNIQSPTVVHVLKNEPFDMKSLQDKYTPNGLHHSNFQNPMLIDNSYE
KKVNGFGSGSSPTRSDFRKKERRKMRANSSESDCSNMEMGSESSGQVAAVSSTAGFKSPM
HGAPPMNTGPMELDDISSEKQTKKKRKRCGECIGCQRKDNCGDCAPCRNDKSHQICKQRR
CEKLTEKKNYFTQRHIIDFDSSVTLSKY