New model in OGS2.0 | DPOGS207138  |
---|---|
Genomic Position | scaffold1:+ 3423149-3426267 |
See gene structure | |
CDS Length | 1218 |
Paired RNAseq reads   | 1073 |
Single RNAseq reads   | 3169 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012757 (3e-39) |
Best Drosophila hit   | crooked legs, isoform A (2e-09) |
Best Human hit | zinc finger protein 799 (3e-11) |
Best NR hit (blastp)   | hypothetical protein BRAFLDRAFT_125176 [Branchiostoma floridae] (8e-17) |
Best NR hit (blastx)   | hypothetical protein BRAFLDRAFT_125176 [Branchiostoma floridae] (4e-17) |
GeneOntology terms    | GO:0005622 intracellular GO:0008270 zinc ion binding GO:0003676 nucleic acid binding |
InterPro families    | IPR015880 Zinc finger, C2H2-like IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding IPR007087 Zinc finger, C2H2-type |
Orthology group | MCL40783 |
Nucleotide sequence:
ATGAATACAGCGGTATTGAAAGTGGAAGCAGAAGCGGAAACATGTTCGGTGTGCGGTGGT
GGCGGAGATCTATTCACGCCAGAAGAACACGATGCCGGCGCTCCGCCAATGCAAGTGTCT
TTACGAACGATGCTATTGCAAATTAATAATTATAAGGTTGTTCCCGAAGGGAGGCTTTGT
ATTAGTTGCATACGTCGCGCCATTGAAGCGTATGAGTTTAGTTCTGCCTTAGGATCCAGA
ACAGCTCCTCCATTAAGTGAGAAAATCCGTACTTTGAGGAGAAGACTACATGATTTAACG
CAGAAGGTAGATGTTTTCATCGTGGTCGGTGGACCGGGTGTGAACTCTGGAGGTGCATAT
AGTGAGGACGATATTATAATGGTAGAACGAGACGCCTTAGCTGCAGCTGCTGCTGCTGAT
GCTGATGATGAGGATCTTGAAAAAGCTAGAAATGCCTGTGGGGACTCGGTGTACCAATGC
TCTATATGTCCCATGTCGTTCCAGCATGCAGCTGAGTATCGATCGCATGTAGCAAGTCAC
CCAGGCGGTGCTCGACACTCGTGCTGGACTTGTGGCGCACAGTTTGCACAAAAAGAGGCA
CTTCGAGACCATGCAGCTGAACATTCTACTCCTGGACTCATCTGCCAACTCTGTCGGAAC
AGATTCCAGAATGCCTCCGAGTTGCGTCGTCACGAGTCGGATGAGTCGTGCCCGTCTCGC
TGTCCGATCCCAGGGTGTGGTGTGCGGTGTATGTCGCGCAGCTCGCTATCATCACACGCC
TCGGCATCCCACTCGAGGGACCCGCCCTTGCTCTGCTCGCAATGTTTCGTGCAATGCTCC
ACTCGAGCTCAGATGGCGGCCCATGCTCTGTCTCACCGCTGTGCCGAGCGGTTCGTCTGC
GGTTATGATGCTTGCATATTGCGCTTCGCAAATAGAGGTGACTTGTTGTCTCACATCCGC
AAGCAGCATGCGGGTTCCGTGCCTGACCCAACCTCGGAACAGCCACCATCCACCACTTGT
CACTGTGGACGCATTTTCGGGTCAGTGGCGGCTCTAAAGCGTCATGCTCGTGTTCATCGC
CGCGAAACGCAGCAGGAGGAACCAGAATGGACCATCACAATGGAGACTGGGGAGGGGGCG
GAGGAAGGGGAGGGAGCGCTCGACGGAGACGTGGAGTACCTCGAGCTGGAGGCGCTCGAC
GAATACGACGAAAATTAG
Protein sequence:
MNTAVLKVEAEAETCSVCGGGGDLFTPEEHDAGAPPMQVSLRTMLLQINNYKVVPEGRLC
ISCIRRAIEAYEFSSALGSRTAPPLSEKIRTLRRRLHDLTQKVDVFIVVGGPGVNSGGAY
SEDDIIMVERDALAAAAAADADDEDLEKARNACGDSVYQCSICPMSFQHAAEYRSHVASH
PGGARHSCWTCGAQFAQKEALRDHAAEHSTPGLICQLCRNRFQNASELRRHESDESCPSR
CPIPGCGVRCMSRSSLSSHASASHSRDPPLLCSQCFVQCSTRAQMAAHALSHRCAERFVC
GYDACILRFANRGDLLSHIRKQHAGSVPDPTSEQPPSTTCHCGRIFGSVAALKRHARVHR
RETQQEEPEWTITMETGEGAEEGEGALDGDVEYLELEALDEYDEN