New model in OGS2.0 | DPOGS205244  |
---|---|
Genomic Position | scaffold1389:+ 7905-11933 |
See gene structure | |
CDS Length | 3276 |
Paired RNAseq reads   | 179 |
Single RNAseq reads   | 531 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008745 (3e-68) |
Best Drosophila hit   | crooked legs, isoform A (2e-35) |
Best Human hit | zinc finger protein 208 (5e-68) |
Best NR hit (blastp)   | PREDICTED: similar to mCG7830 [Acyrthosiphon pisum] (2e-72) |
Best NR hit (blastx)   | hypothetical protein BRAFLDRAFT_71028 [Branchiostoma floridae] (6e-90) |
GeneOntology terms    | GO:0008270 zinc ion binding GO:0006355 regulation of transcription, DNA-dependent GO:0005622 intracellular GO:0046872 metal ion binding GO:0003677 DNA binding GO:0005634 nucleus |
InterPro families    | IPR007087 Zinc finger, C2H2-type IPR015880 Zinc finger, C2H2-like IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding |
Orthology group | MCL20861 |
Nucleotide sequence:
ATGCCTCTTAGAAAGTTACCGTTTCCACATACGGATTTGCAAAAACATCGCTATAATATT
AGAGAAATATTGCACAGCTCTAATGCGACGCCCATACGCGGCCACAAAGACCGCGGCTAC
ACTTGTAATTTTTGCGAAAATGAATTCGAGAATCCCGCCGATTTAAAACAGCACAATTTA
GATTGGCACACTTATAACTTAAGCACCGACGACATTGAAGTTTTGGTCCCATCCAACCTA
CCCCAGGTCTGTGTAAAATTGGACATCACAGGTCTCACTTGCAATTTATGCCAAGAAAGC
GTAGACTCTGTCGAAGAGCTATGCGACCACCTGCAACAAGCTCATGATAGGACAATTTTT
GACGACATAATACATCAAATACTACCATTTAAATTCAATGATCGAATCCTCCAATGCCAC
ATATGTAATAATATATACAATAAATTCAAAAAACTCATGGAACATATGAACAATCATTAC
AGTAACTATGTGTGTGACGTGTGCGAAGCTGGTTTCGTCAATAAGAGAAAACTTCATTAT
CACAAGATCGGACACAACGACGGTGATTTCAAGTGTGACGTGTGCGTAAAACGCTTCCGC
ACTTTCGGCAGCAAAAGATATCACGAACGGCGATACCATGACAAGAACAGGCTGTTTAAC
AAATGCGGATACTGCAATGCGCTGTTCAAGGGCATGAGGGAAAAGGATCTTCATCTGGCT
AAAGAGCACGGTGTGGCGTCATTTGCGAGAAAATGTCAAGCCTGTGACAAAATTTTCTCT
CATCAGAACGCTTTGCACACGCACATGAGACGATATCACTTGGTGGAAAGGAAGCACAAG
TGCCCAGAATGTGACAAAACATTCTTCTCTACATCCGACGTCAAGATGCACATGGTGAAA
CATACGGGCGAGCGTGAATACAAGTGTGAAATATGCCTTAAATCTTATGCGAGGAAGTGG
ACTCTGAGCGAGCATATGCGCATTCACTCGGACGATAGGCGGTTCAAGTGCGAGCATTGT
GGACAATCGTTCATACAGAAATGTATACTACGTAAACTCAAAAAGGAAAAGTTAAATAAG
GACGACGACTTCGTTCAGAGAACAGATCTGCCGAGAGGTAGACAAGGAGAAATTTTAGAC
AAACATCGCACTAATATACGCGAAATTATCAAATGGTCGAACGCTACGCCGATACGATGC
CGAGGCGGGATCGGATATGCGTGCTGTTTCTGTTCAGACCAATTTCCAAATCCTGCTGAC
TTAAAACATCACACCATCAAAGCTCACGATGATATCACCAAGTCCAAGTTCATGAAGGGT
AGGGATATGTACGGATATTTCGTCAAACTAGATATAACCTCTCTACAATGCAACATTTGT
GGGCTGGAAAATGACACGTTAGAACAAATTATGTTGCATTTGAAAGACGAACATGGTAAA
AGCATCAGTACTGACATATCGAATCACATATTACCTTTCAAATTTGACAGCGAGATTTTG
CGCTGTTTCATGTGCCACAACGTGTTCAATAGGTTCAAGGCATTGCAAGAACACATGAAC
TTACACTATAGGAATTACGTCTGTGAAGTTTGCGACGCTGGTTTCGTCAATCGCCACCTT
CTTTTATGTCACAACGAGGGTCACAAAACGGGTACATTTGCGTGCGATCAATGCGGAAAA
ATCTTCGACACGCTGAGGAAGAGAAAGTTGCACGAAAGGAAAATTCACAACGGTCTGAAC
ATGCCGCATAAATGCGGTTATTGCAACGAGAGGTTTAAGGAGAACTGTTACAAAAATGAG
CATCTCGCCAAAGTTCACGGCATAATCGGCCCTTCTATCAAATGTCAAGCCTGCGAGAAG
ACGTTTTCGACGCAGCAGACCTGGCTCTTGCACATGAAGAAGTACCATTTGATGCAGAGA
CAGCACAAATGCACAAAATGCGAAATGGATTTCTTCTCAAAGAGGGAACTCACCGACCAC
ATGGTGAAACACACCGGAACTAGGGACTACAGATGTGAAATGTGTTTCAAATCGTATGGC
AGATTGAAGACTCTGAAAGAACACATAAGACGGCTCCATCCAGAAGGCAGAGACATCAAA
TGCGCTCATTGTGGTCAAGCTTTTGCGAAGAGATTTGCTTTGAAAAGTATGAAAACGATA
AAAATTGAAAGAAAAACGGACGTCCCACGCGTGCAGTTGAAGCTGGTTAGCCGGAAACAT
CGGAAGCTGTCCGACAACAAGAAAAACCAGGAGAATCTCACGCACATATTACTCAATTCA
AACGCGAGCCCCATAAGGAATAAGGACAGCCTCGGCTACGGTTGCGCCTTTTGTTCTGAA
CAGTTCGTTGATCCTAAAAATCTAAAGAGGCATTTTTTGAGCGAACATCAGAGCGACAGG
CTGATAAAATTAATGTCCAGCAAACTTTTTGAACATGTCGTCAAATTGGATATCACGTAT
TTGCATTGTTCGCTGTGCGATAAAAAGATACCTCACTTAGATGAACTAATGGCGCATCTG
AAAACGGAACACAGAAAGGACTTGCACACGGACATCAAGAGTTCAATAGTACCTTTCAGT
TTTGATACTCCGCAGTTGCAATGCGCTGTTTGCAAGATAGAGTATTCAAACTTCAAGCTA
TTACAAGAGCACATGAATTCGCATTTCGGTAATCACATCTGTGCTATGTGCGGCGGCGGC
TTCGTTACGGAACGTCTGCTGACAACGCACATGAAGAGGCACAAAACCGGCGAGTACAAA
TGCGAAGATTGCGACAAGGTTTTCGATAATGAAGAGAAAATGAAGGAGCACCAGAAGCGT
TCACACCTCGGTCATAATAAAAGGAACAAGTGTCTTATATGTGAGGAGAGATTTGTGGAT
TACTGGAAGAAAGTCGAGCACATGGTCGAAATACACGGCGCTCCGCCGGTCGTGCTGAAA
TGTCAGGCTTGTGACCGTACATTCAGAAATCAGAGAGCTCTGTCCAGACACACGAAAAAG
GACCACCTGATGGAGAGGAAAAACAAATGTCCGGAATGCGATATGAGATTCTTCAGCAAG
AGCAGTCTGCAGCGGCACATGGCGAAGCACACGGGGTTACGGCAGTTTTCGTGCGACGTT
TGTTTTAAATCCTACGGAAGAAAGAACACTCTGAGGGAACATATGAGGATTCACGCCGAC
GACAGGAGGTTTGCGTGCATCCATTGCGGACAGGCCTTCGTTCAGAAGTGCAGTTGGCGG
AGTCACATGAGGTCCAAGCATGGCGATGACGTTTAG
Protein sequence:
MPLRKLPFPHTDLQKHRYNIREILHSSNATPIRGHKDRGYTCNFCENEFENPADLKQHNL
DWHTYNLSTDDIEVLVPSNLPQVCVKLDITGLTCNLCQESVDSVEELCDHLQQAHDRTIF
DDIIHQILPFKFNDRILQCHICNNIYNKFKKLMEHMNNHYSNYVCDVCEAGFVNKRKLHY
HKIGHNDGDFKCDVCVKRFRTFGSKRYHERRYHDKNRLFNKCGYCNALFKGMREKDLHLA
KEHGVASFARKCQACDKIFSHQNALHTHMRRYHLVERKHKCPECDKTFFSTSDVKMHMVK
HTGEREYKCEICLKSYARKWTLSEHMRIHSDDRRFKCEHCGQSFIQKCILRKLKKEKLNK
DDDFVQRTDLPRGRQGEILDKHRTNIREIIKWSNATPIRCRGGIGYACCFCSDQFPNPAD
LKHHTIKAHDDITKSKFMKGRDMYGYFVKLDITSLQCNICGLENDTLEQIMLHLKDEHGK
SISTDISNHILPFKFDSEILRCFMCHNVFNRFKALQEHMNLHYRNYVCEVCDAGFVNRHL
LLCHNEGHKTGTFACDQCGKIFDTLRKRKLHERKIHNGLNMPHKCGYCNERFKENCYKNE
HLAKVHGIIGPSIKCQACEKTFSTQQTWLLHMKKYHLMQRQHKCTKCEMDFFSKRELTDH
MVKHTGTRDYRCEMCFKSYGRLKTLKEHIRRLHPEGRDIKCAHCGQAFAKRFALKSMKTI
KIERKTDVPRVQLKLVSRKHRKLSDNKKNQENLTHILLNSNASPIRNKDSLGYGCAFCSE
QFVDPKNLKRHFLSEHQSDRLIKLMSSKLFEHVVKLDITYLHCSLCDKKIPHLDELMAHL
KTEHRKDLHTDIKSSIVPFSFDTPQLQCAVCKIEYSNFKLLQEHMNSHFGNHICAMCGGG
FVTERLLTTHMKRHKTGEYKCEDCDKVFDNEEKMKEHQKRSHLGHNKRNKCLICEERFVD
YWKKVEHMVEIHGAPPVVLKCQACDRTFRNQRALSRHTKKDHLMERKNKCPECDMRFFSK
SSLQRHMAKHTGLRQFSCDVCFKSYGRKNTLREHMRIHADDRRFACIHCGQAFVQKCSWR
SHMRSKHGDDV