Genomic Position | scaffold1389:+ 49335-53608 |
---|---|
See gene structure | |
CDS Length | 1758 |
Paired RNAseq reads   | 425 |
Single RNAseq reads   | 975 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008782 (0.0) |
Best Drosophila hit   | CG12299 (3e-27) |
Best Human hit | PREDICTED: zinc finger protein 208 (1e-28) |
Best NR hit (blastp)   | hypothetical protein BRAFLDRAFT_87563 [Branchiostoma floridae] (1e-31) |
Best NR hit (blastx)   | hypothetical protein BRAFLDRAFT_87563 [Branchiostoma floridae] (2e-39) |
GeneOntology terms    | GO:0003676 nucleic acid binding GO:0008270 zinc ion binding GO:0005622 intracellular GO:0008150 biological_process |
InterPro families    | IPR007087 Zinc finger, C2H2-type IPR022453 Zinc finger, YgiT-type IPR012934 Zinc finger, AD-type IPR015880 Zinc finger, C2H2-like IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding |
Orthology group | MCL40331 |
Nucleotide sequence:
ATGGCACTAATGAAAAACAACGGACCCATAATAGACCCAGCTTTATGCCGTTGCTGTAGA
GCTATAAAAAAATGTAGAGTTCTCACAGTGGAGTACACCTGGATGGAGCAGAAAGAGGTT
TACGCGGATATGATCATGGACTGTTTCGGTATACTCCTGTCTCATGTAGATGAAGATGTA
AAAGATAGTGGAGTTTGTGCTACGTGTGTGGTCAGACTGAGAGATGCTTGTGCCTTCAGA
CAACAGGTGCTGCAATGCGAGGAGTTGTTCCTTGGAGCCAAGTTAGGGGATAAGGATGAA
ACCAAAAAGACTTCCGAAATGGAACTGAAAACGGAACCAAAGGATGACTGCAGTGATCAC
AACTCGGTGAACGAGGATCCCAACGACAGCATGTCCTGCGATATTGAGGCTGAAGTGAAA
GACGTGAAGCTTCCAAAGAAAGAGGAAGACGAGAAAGCAGAAGATGGAAACGATGTGAAA
GAAGATTTGGAGGATGTTATGGACGATGAAGATTACAACGACTATATGGATTCCAGTTCG
GACTCGGACAAACCGATCAAGAGGAAAGCTAAGGTGAAGAAGCTAAAGGCGAATAGTACG
AAGAAGAAGAAGAATCTGAAGGCCAAAACTAAGTTGAAGCAGAAACCAGGCTCCTCAACA
GAAAAGAAACTCAAGAAACCGCCAGTTGAAAAGAGGAACACATACGACAGAGACTTGGAT
TGTATGTCAGAGGAAAATCTTTTGACAATCATACAATATTCATATGTATGTCCTTTCAAG
AATAGAAGGAATAATTACTACTGTTTCTACTGCAAAGATTATTATCCAAAACCGGAAGAT
TTACGTGAACATACGATATCCCACGACACGAAACCTTTCCAATTGTTGATGGGCTATAAA
AAAATGCCGAAAATCGACATAACGAGAATCGATTGTAAGCTCTGTCCGATGAAAATCGAT
GATTTGGACACATTCAAACGTCACATCGACGAGGTACATAACAAAAAGATATACTTCGAG
GCCCCGGATAAGATGTTGCTGTACAAACTGACGTGGAACGATCTAGTGTGTGTTATGTGC
AGTGATGTGTTTGAGGACTTTAATACGTTGAACACTCATATGGTGGAGCATTTCAGTAAC
TATACGTGTGACATATGCGGCGTGTGCTTCCTAGAGAAACCGCGTCTGGACGCTCATCTG
AAGCGTCACAAAGACGACGAGCGTCACACGTGCGAGGTTTGCGGTAAAGTGTTCAAATCC
AACCACTACAAGGACATGCACGTTGATATAGTGCACAAGAAGAAAGCTATCATACGTTGC
CCGAGATGCGACGAGTGTTTCATGTCGTACGCGCTTAAAAACAAGCATTTAACGGAAGCT
CACGGTCAAAAACGGACGTATCCGTGCAATTTGTGCGACAAGGTATACAACAGACAGAAG
ACATTGACCGAACACCAGAGACGTAATCATCAGAAAGTCTTGAAGCACCAGTGTGAGTAT
TGCGACCAGAGGTTCTACTTACCATCCCGTCTTAAAGAACATATAGCGACGCACACAGGC
GAAAGGAACTTCAGATGCGAGTACTGCGATAAAAGCTATCCTCGGCTAAAGTCTCTACAG
TATCACATCAGAACTCACACCAACGACAGGAGATACAGGTGCCATATATGCGGCCAGGCC
TTCATACAGAACGCCAGCCTCAAGTCGCATATTAAGAGCCATCATCCCGAATGTGATATC
GAGGGATGTTATTTCTGA
Protein sequence:
MALMKNNGPIIDPALCRCCRAIKKCRVLTVEYTWMEQKEVYADMIMDCFGILLSHVDEDV
KDSGVCATCVVRLRDACAFRQQVLQCEELFLGAKLGDKDETKKTSEMELKTEPKDDCSDH
NSVNEDPNDSMSCDIEAEVKDVKLPKKEEDEKAEDGNDVKEDLEDVMDDEDYNDYMDSSS
DSDKPIKRKAKVKKLKANSTKKKKNLKAKTKLKQKPGSSTEKKLKKPPVEKRNTYDRDLD
CMSEENLLTIIQYSYVCPFKNRRNNYYCFYCKDYYPKPEDLREHTISHDTKPFQLLMGYK
KMPKIDITRIDCKLCPMKIDDLDTFKRHIDEVHNKKIYFEAPDKMLLYKLTWNDLVCVMC
SDVFEDFNTLNTHMVEHFSNYTCDICGVCFLEKPRLDAHLKRHKDDERHTCEVCGKVFKS
NHYKDMHVDIVHKKKAIIRCPRCDECFMSYALKNKHLTEAHGQKRTYPCNLCDKVYNRQK
TLTEHQRRNHQKVLKHQCEYCDQRFYLPSRLKEHIATHTGERNFRCEYCDKSYPRLKSLQ
YHIRTHTNDRRYRCHICGQAFIQNASLKSHIKSHHPECDIEGCYF