New model in OGS2.0 | DPOGS201360  |
---|---|
Genomic Position | scaffold13:- 394594-430550 |
See gene structure | |
CDS Length | 1692 |
Paired RNAseq reads   | 11 |
Single RNAseq reads   | 39 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000604 (4e-67) |
Best Drosophila hit   | scratch (3e-61) |
Best Human hit | transcriptional repressor scratch 1 (6e-53) |
Best NR hit (blastp)   | hypothetical protein AaeL_AAEL009230 [Aedes aegypti] (1e-66) |
Best NR hit (blastx)   | AGAP006791-PA [Anopheles gambiae str. PEST] (2e-63) |
GeneOntology terms    | GO:0003676 nucleic acid binding GO:0005622 intracellular GO:0008270 zinc ion binding |
InterPro families    | IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding IPR007087 Zinc finger, C2H2-type IPR015880 Zinc finger, C2H2-like |
Orthology group | ND |
Nucleotide sequence:
ATGCCGCGCGCGTGCGTCCCTCGTGCTTTATCCGCGTGTCGCCGCCGGACCTCGCCGCCG
GACCGCCGCCGTCGACGCACGCCCCCGACCCGTAGGAGGACGGACTTGTTGCGAAGCCCA
CCTGTTAACACATGGAGGCGCCAACCAGAATCATCAGCAGTCAACTATGAGGAGATAGGA
AGCACACCCATTACTGAACGAACTGCTGAAGAGACAGAGGCGGCTCATGAATTATTGTCT
CTCGCGCACAGCTTGCCACCTCTGCCGCCAGTCCCACCCCTAACACCAGCAACAACCGTG
CCGGCGTTGCCGCAGTTCCCATCGCTGCCATCTCTGCCACAGGTGACGCAGCTGCCTCCG
CTGCCGTCTCTGTCGTCACTTCCATCAATATCACCACTGTCATCGCTGTCAGGACTGCCT
TCTGTCCCTCTCGTCTCCGTGCTACCACCGAATGAACCTGTCGTCCCAATTTACACCTAT
ACCATACATCCTACTAATATATATATAATAGCTGAAGAGTCACGTGACCCCACGTATAAT
AACTCTGTGCCAACAATAACACCGATTCCTTGTGGCATCGAATATACAATTGAACCTCAA
TTAAGCTATCTTGCTTATCAACATGTACCGGAGGCGGTCATGCCAGTGGGTCATATATTG
CTCCCTGCAGCCGAAATAATACCTAATAATAATCGACCAGAACCAAGCAGAGTCACTGAT
GTAGTGGAGCCACCAAGCAGGCTCCCTTGTTTAATGGAGCCCAAACGTCCAAAAATCAAA
CCGATTAATGCTCCAAGAGGGAAAAATGCTAAATATGACTGCAAAGAATGTGGCAAGCGG
TACGCTACATCTTCAAACCTGTCACGCCATAAGCAAACACATCGCAGCCTTGATTCAGTT
GCAGCGAAACACTGCGAGGATTGTGGCAAAGTATATGTGTCAATGCCAGCACTTGCGATG
CACGTTCTGACTCATAGAATGGGCCACGTTTGCGGTATATGTGGTAAACAATTCTCAAGA
CCCTGGCTTTTGCGTGGTCACTTACGTTCCCACACGGGAGAGAAGCCTTATGACTGTCCA
TACGAAGGATGCCCTAAGGCGTTTGCGGATCGATCAAATTTACGTGCACATCTGCAGACT
CATACAGGTGACAAAAAATTTGAATGCTCAAAGTGCCACAAAACTTTTGCTCTAAAAAGT
TACTTGGCCAAGCATGAGGAAACGGTGTGCTTTCGAGATGAGATTGCTTGTGATCCGGAG
TTAATTCAACCTACAATACCTGAAACATCGAGTATACAGTCTGACAAGCCATCAGAGCAG
CTTAATACTCCCTCAGTGGGATTTGAACAGCCTGAGACGTCTCATAACCAGCTAAACTCT
CCCCGTATTCAGGCTCCGCTTGCAGATGAAGCTCGAATAGACTCAGATTGTATTCAGCTT
GAGACTAGAGAAATTAGTGATCAAATTATGTATACTGATTCTCAGACTGAAGGTATCCAG
GCGGAAACAGCAGTGAGACGGTATGAGCCTCTACGTCCCATCGGTATACGATCTGATTTG
GAACCTTTGTCACTGGAATTAGAAGATCCGCCGCAACCAGCAGTGATACGTTTTGATCCG
TCTTGCGTTTTGCCCGAGCCAGAAATTATACGATACGATACGATGAATCTGGTGCCTGTG
TTCGCGGAATAA
Protein sequence:
MPRACVPRALSACRRRTSPPDRRRRRTPPTRRRTDLLRSPPVNTWRRQPESSAVNYEEIG
STPITERTAEETEAAHELLSLAHSLPPLPPVPPLTPATTVPALPQFPSLPSLPQVTQLPP
LPSLSSLPSISPLSSLSGLPSVPLVSVLPPNEPVVPIYTYTIHPTNIYIIAEESRDPTYN
NSVPTITPIPCGIEYTIEPQLSYLAYQHVPEAVMPVGHILLPAAEIIPNNNRPEPSRVTD
VVEPPSRLPCLMEPKRPKIKPINAPRGKNAKYDCKECGKRYATSSNLSRHKQTHRSLDSV
AAKHCEDCGKVYVSMPALAMHVLTHRMGHVCGICGKQFSRPWLLRGHLRSHTGEKPYDCP
YEGCPKAFADRSNLRAHLQTHTGDKKFECSKCHKTFALKSYLAKHEETVCFRDEIACDPE
LIQPTIPETSSIQSDKPSEQLNTPSVGFEQPETSHNQLNSPRIQAPLADEARIDSDCIQL
ETREISDQIMYTDSQTEGIQAETAVRRYEPLRPIGIRSDLEPLSLELEDPPQPAVIRFDP
SCVLPEPEIIRYDTMNLVPVFAE