New model in OGS2.0 | DPOGS210231  |
---|---|
Genomic Position | scaffold42:- 32824-40012 |
See gene structure | |
CDS Length | 2673 |
Paired RNAseq reads   | 1536 |
Single RNAseq reads   | 3989 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002551 (0.0) |
Best Drosophila hit   | Zn finger homeodomain 1, isoform B (7e-93) |
Best Human hit | zinc finger E-box-binding homeobox 2 isoform 1 (3e-35) |
Best NR hit (blastp)   | zinc finger protein, putative [Pediculus humanus corporis] (7e-146) |
Best NR hit (blastx)   | zinc finger protein, putative [Pediculus humanus corporis] (7e-130) |
GeneOntology terms    | GO:0007399 nervous system development GO:0007498 mesoderm development GO:0003677 DNA binding GO:0008354 germ cell migration GO:0008406 gonad development GO:0007280 pole cell migration GO:0003702 RNA polymerase II transcription factor activity GO:0005634 nucleus GO:0007507 heart development GO:0007417 central nervous system development GO:0007517 muscle organ development GO:0006355 regulation of transcription, DNA-dependent GO:0008270 zinc ion binding GO:0003700 sequence-specific DNA binding transcription factor activity GO:0043565 sequence-specific DNA binding GO:0008045 motor axon guidance GO:0019730 antimicrobial humoral response GO:0007514 garland cell differentiation GO:0048542 lymph gland development |
InterPro families    | IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding IPR012287 Homeodomain-related IPR001356 Homeobox IPR007087 Zinc finger, C2H2-type IPR009057 Homeodomain-like IPR015880 Zinc finger, C2H2-like IPR017970 Homeobox, conserved site |
Orthology group | MCL16518 |
Nucleotide sequence:
ATGACATGTTCGGCTGCTACGGTGACAGCTACTCGTATCGCCTGTGGAGCCTGGCCAACG
TGTGGGGCGCCTGTCGATGTGAGTAGGAGGCCGTGTCGTGGCCGCGGCCGAGTGTCTCTC
AGTGAGGTGATGATAGTGCACAGCTCTCTTATGGGAGTTGTCGGTTGGTCTGAACGTTCG
TCAGCGCCCTGGCTGGCCGACATGAGGAGTAAGCAGGTCCCCACGTCGACGACCTCCGCC
AGGCGGCGAGCTGCCTTCATCCGGCCTCTAGCTGGTGGAGGTTCCGAGATCAAGACGTGT
GAGGAGGAAGAGGATCGTGGGTCTTCGCCTCTGGGTCCCGGCGGGCCTTTCCCCTGCAGC
CACTGTCGCGGAGCCTACCCGACCCGCGAACAGCTCGAGCGACATGAGACGCTGCACGCG
CCGAGCACTCAGACGTGCAAAATATGCCACAAGAGTTTTCAAAATGTCTACAGACTTCAG
CGTCACATGATAAGTCATGATGACAGCGCGAAGTTACGCAAATATAAATGCAATGACTGC
GATAAAGCATTTAAATTTAAACATCATCTCAAAGAGCATCTCCGGATACACAGTGGAGAA
AAACCGTTCGAATGCGCTAACTGTGGAAAAAAGTTTTCACATTCCGGCTCTTATTCGTCT
CATATGACTTCCAAGAAATGTCTTGTTATGAATCTCAAAATGGGAAGAATAAAGCCAAAT
AATCCGGCATTGAATCCAGACCGGAGTCCATCTCGGAAACGTGCAAACGCCATGGCAGCT
AGTCAGTTGAATAATAATATCGCGCCAAACGGCAATTCGTTTTTGCCAATATTGCCAAAG
TATAACGAAGCTGCAGCATTTTTCGCATCTATGTCATCTCAAGAAAATAATTTCCTGAGG
CCCCCGTTGGGTCAACCTGGATTAAATCCTTTCTATATGCCTCCTGGTATGCCAATGAGC
CCAGCCAACGGTATTGCACCTTACACCTTCCCTACTTCATTAAGCCAATTATTTGAGCAA
CTGGCCTCTCAACATTATCAACAACGAAAAATAGAGATTCCTAGTCCAAAGCTTGTGAGT
CCACCCGCAAACCCTGAAGACTTAATCGAGGAAGTAGTGGATGAGGAAGATAAGCGCTCC
GAGGCCAGCGCAGAACTAGTGATGGATATAGACGACGATGACAACGTTACCGTAAAGAAA
GAACAAGAAGACCGGGAAACTGAGGCGAGTTCTCCTTCCCGTAATTACGAATCTATCTTA
AGTAGCAATGAACGCGGCGAGTCCGACATCAATCATTCGGATTTTAATACTGTTAAAGCA
TCCGATACTAAATATTATTTTAAGACGCACAATGATCAGCAGTCTCCTATATCTGGCCAG
GAATACCCCGGTGAAGCTTTACCATTGACACAGATAAATGTTAAAGAGGAACCTGATATT
GATACCCTACGTTGCTTAAAATGTAACGTATTGTTCAATGACAAAAATGATTTATTGGAA
CATGACAAAGCAGCGTGTGGTAATATTTTTAGAAAACATGAAGGACTAGCTGCTCAAGTG
GCTGAGACGGTGGCTCTGAATAGATTAGAAGCTGAAATGCGCGCATCTATACAAAGTGGG
GTAAGTGCGAGTGAAGATGAGGATTTCGGGAGAGAGGATCGGGAAGACAAAGCCTCTATA
AATGAAAATGACAGAAAAATCAGAGTTCGCACCGCACTTACCGAAGAACAACAGATGGTG
TTAAAAAGGCACTATTCGATCAACCCTCGACCGAATCGAGAGGAATTTAAGAAGATCGCA
CAGCAGATAGGCTTAGATAACCGAGTAGTACAAGTTTGGTTCCAAAATAATAGAGCCAGA
GTACGGAGGATGACTCAGGCGGTCGCGATATCTGATCAACCTCTAGATTTATCTACAAAA
AAATCGAATACCTCCGTTACTTCAAGCCCGTCACCTTCACCGACTTGCAGCATTTCAGTA
ACACATTCCGATTCCGAGGAAGCGGTTAATTTAAGTCAAAAATCTTCTCGCAGCACGACC
CCACATCGCGCTAACTACATAAATACGTATCCACATTCCAACTGCTCGTCCTCATCGTTC
ACGGATTTTCGGTTATCACCCTCACCAGGTGAAACTATGAACGGTTACAAAAGAATGTTG
CAACAGAAAATGCCCATCAATCCTATGATGCCGATGGACAAACTTCTTCATTACAACGAC
CTGAGTAACGGACGATCTCCAATTCTTAACATGCAAGTGCCCGAGAGGCAAGAATCCAGT
CCGTCTTACGACCGCCCAATATGGAACGAGGATCTCCAGACTCAAATCGAATTAGAAGAT
GAAACCACTGTACTTAAAAAGAGCAAAATAAAGGCTGGAAATGAGTTGAAAGAGGGAGAA
GGACAATTCGTTTGTGATCAATGTGATAAAACTTTTGTCAAGCAAAGTTCTCTCGCGAGG
CATAAATACGAGCACTCAGGCCAGCGACCTTACAAATGCTTGGAATGTCCTAAGGCTTTC
AAGCATAAGCACCACCTGACTGAACACAAGCGGTTGCACACCGGCGAGAAGCCGTTCCAG
TGCTGCAAGTGTCTCAAGAAGTTCTCTCACTCCGGCTCCTACAGCCAGCACATGAACCAC
AGGTTCGCGATCTGCAAGCCATACAGAGACTAG
Protein sequence:
MTCSAATVTATRIACGAWPTCGAPVDVSRRPCRGRGRVSLSEVMIVHSSLMGVVGWSERS
SAPWLADMRSKQVPTSTTSARRRAAFIRPLAGGGSEIKTCEEEEDRGSSPLGPGGPFPCS
HCRGAYPTREQLERHETLHAPSTQTCKICHKSFQNVYRLQRHMISHDDSAKLRKYKCNDC
DKAFKFKHHLKEHLRIHSGEKPFECANCGKKFSHSGSYSSHMTSKKCLVMNLKMGRIKPN
NPALNPDRSPSRKRANAMAASQLNNNIAPNGNSFLPILPKYNEAAAFFASMSSQENNFLR
PPLGQPGLNPFYMPPGMPMSPANGIAPYTFPTSLSQLFEQLASQHYQQRKIEIPSPKLVS
PPANPEDLIEEVVDEEDKRSEASAELVMDIDDDDNVTVKKEQEDRETEASSPSRNYESIL
SSNERGESDINHSDFNTVKASDTKYYFKTHNDQQSPISGQEYPGEALPLTQINVKEEPDI
DTLRCLKCNVLFNDKNDLLEHDKAACGNIFRKHEGLAAQVAETVALNRLEAEMRASIQSG
VSASEDEDFGREDREDKASINENDRKIRVRTALTEEQQMVLKRHYSINPRPNREEFKKIA
QQIGLDNRVVQVWFQNNRARVRRMTQAVAISDQPLDLSTKKSNTSVTSSPSPSPTCSISV
THSDSEEAVNLSQKSSRSTTPHRANYINTYPHSNCSSSSFTDFRLSPSPGETMNGYKRML
QQKMPINPMMPMDKLLHYNDLSNGRSPILNMQVPERQESSPSYDRPIWNEDLQTQIELED
ETTVLKKSKIKAGNELKEGEGQFVCDQCDKTFVKQSSLARHKYEHSGQRPYKCLECPKAF
KHKHHLTEHKRLHTGEKPFQCCKCLKKFSHSGSYSQHMNHRFAICKPYRD