New model in OGS2.0 | DPOGS209864  |
---|---|
Genomic Position | scaffold5089:- 6615-13667 |
See gene structure | |
CDS Length | 2553 |
Paired RNAseq reads   | 696 |
Single RNAseq reads   | 1791 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001744 (9e-73) |
Best Drosophila hit   | crooked legs, isoform A (6e-23) |
Best Human hit | zinc finger protein 658 (1e-44) |
Best NR hit (blastp)   | PREDICTED: zinc finger protein 228 [Pan troglodytes] (4e-52) |
Best NR hit (blastx)   | novel protein [Xenopus (Silurana) tropicalis] (2e-60) |
GeneOntology terms    | GO:0046872 metal ion binding GO:0008150 biological_process GO:0005634 nucleus GO:0005575 cellular_component GO:0003674 molecular_function |
InterPro families    | IPR007087 Zinc finger, C2H2-type IPR012934 Zinc finger, AD-type IPR015880 Zinc finger, C2H2-like IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding |
Orthology group | MCL39581 |
Nucleotide sequence:
ATGAAGACGTCTTGGTTGAGCTTGTTAGGGTTACCGCAATGGGTTCCTCGTAGTTTGGTG
TGCTCTGACCACTTTAAGGACGACGATATTTGTGAAGTCGAAGGTGGCGAGAGGAAACTC
CTACCAGGAGCTGTACCTAAAGCTGTTACTACACCAGTGAACAGTACTGTTACACAGGTA
TGTCGTATATGCTTGGGGGCAGAGATGAGCGTCTACCCGATACTAAATCCCGATCTTCAG
CAGACATTTATATTACTCACAGGCATGAACATATACAAAGATGACTACCTGCCTCAGAAT
CTTTGTATCGAATGCATGCAAAGATTAAAAAATTGCTATAAATTTAGAACGCAGACGTTA
CAGGCGCAGACTATTATTGTGGATCTCATTAGAATAGGCAATTTCACAACAGAAGCTATA
AAATCATTAAATCACAATCCCAAATGGAATCTGCAAATGCAGAGGTTTGATGCGAACCAC
TGCGATGCATACATTAACAAAGAGGACGTGAACCCCATACAAGAGATCGTTATAGAGGAA
CAGGTTAAAATTGAAAAGGAATTTATTAACGAGCAAGACACAGTCGCTATGGGCTACACA
ACAGACGACAGTCTGCCACTAGAGACGCAGAGAACCAAAGCCAAGAAGACAAAGAAGAAG
AAAGAGAAGAAGATACAAGAACCGAAGGTAGATAGGAGGAGAAAGCCGTTCCTTAACGAT
GATCTGAATGAGAGTCTGTTCACTATCACCGATCTGACCTTGGAGGAACAGATAGCTGAT
ATCCAGAAGAGACAGGAGAGTTCTAACTTCAAGAATTCAGTGTACAAGTGTATGGAGTGC
TTTAAGGGTTTCCTTGATGAAGGAGCGTACAACGGACATATGACAAGGCATACTACTCAA
TGCGGTGAATATTGTTGTGAAATTTGTAAGACACATTTCAAACACTCGCACGCATTGAGG
AAACACACGACGGCGCATCACGCGCAGAGGTTCAATTGTAACCGGTGTGCTTTCGTTACT
ACACACAGACAAACAGCACGACTCCACGAGCGATGGCACAAGGGCACCAAATACGAGTGT
CCGCACTGTAATGAAGTGTTCCTTAAATTCACAACCTACATGGGACACATTCGCATCAAA
CACCCGTCGGATTTCGTGTGCGCTCTGTGCGGTTATTGCTTCGTCAGTCAGAAGGGGATA
GATTTGCACAAGAAACTGAAACACAGATTACATCTCGGACAGATCCCGGAGGACGGGCCG
CTCTGTGAGCTATGTGACGTGCGGTTCATATCACAAGAGGCTTACAAACGACACCTCAGC
GTCTCCGCGAGACACGCTGGCGACGAAATATCCAAGGATCCCAGCAAACCGAAACGCGGA
AGAAAATCAAGGGACGCCCTCGACAAAGACGACAGCGAAAAAAACGACTTAAACGACAAG
AAAGTGTATCCGAGTCAAGTCAGAAAAGCAGAAGGTCCTATACCGTGCGAGCAATGCGGT
ATGCAGTTAGAGGACTCGCGCGCCTACCACGCTCACTTCAGACGGAACCATCCCGACAAG
AACAGGACCAACTACCCCAGCATGAAGTCGCCCTGCATGTGCGAGGTGTGCGGCAGGATG
TTCCAGAGTCACGCATTGTTGAAAGACCATCGCTGGGTGCATACAAACGAGAGGCCATTC
GCCTGTGAGTGTGGCAAGAGGTTCCGTATGAAACAACGCCTGGTGGCGCACAGAAGGGTA
CACAGACAGACCAGGCACTACACATGTGCTCTGTGCGGGAAAGGATTCAGCACACACAGC
AACAGGCAGAGACATATGATTGTACATTCACACCGGGCTAAAACCATTAGCAAACCGAAA
CGCGGAAGAAAATCAAGGGACGCCCTCGACAAAGACGACAGCGAAAAAAACGACTTAAAC
GACAAGAAAGTGTATCCGAGTCAAGTCAGAAAAGCAGAAGGTCCTATACCGTGCGAGCAA
TGCGGTATGCAGTTAGAGGACTCGCGCGCCTACCACGCTCACTTCAGACGGAACCATCCC
GACAAGAACAGGACCAACTACCCCAGCATGAAGTCGCCCTGCATGTGCGAGGTGTGCGGC
AGGATGTTCCAGAGTCACGCATTGTTGAAAGACCATCGCTGGGTGCATACAAACGAGAGG
CCATTCGCCTGTGAGTGTGGCAAGAGGTTCCGTATGAAACAACGCCTGGTGGCGCACAGA
AGGGTACACAGACAGACCAGGCACTACACATGTGCTCTGTGCGGGAAAGGATTCAGCACA
CACAGCAACAGGCAGAGACATATGATTATTCACACCGGGCTAAAACCATTTAAGTGTGAG
ATGTGCGGCAAATGTTTTAAGCATGCCAGCGAGAAACGGGCTCACATAACATATGTACAT
CTCAAGAAGCCCTGGCCGAAGAGATCACGGGCGAAGAGACAAGGACAGAATATAACAGGT
ATGGCGAGCGCGAGCGAGATAGACATACAAGGATGGAACGATCCCAAGATAGATCTGATG
ATGGATAAACAGTATTTCAATGTCAAGATGTAG
Protein sequence:
MKTSWLSLLGLPQWVPRSLVCSDHFKDDDICEVEGGERKLLPGAVPKAVTTPVNSTVTQV
CRICLGAEMSVYPILNPDLQQTFILLTGMNIYKDDYLPQNLCIECMQRLKNCYKFRTQTL
QAQTIIVDLIRIGNFTTEAIKSLNHNPKWNLQMQRFDANHCDAYINKEDVNPIQEIVIEE
QVKIEKEFINEQDTVAMGYTTDDSLPLETQRTKAKKTKKKKEKKIQEPKVDRRRKPFLND
DLNESLFTITDLTLEEQIADIQKRQESSNFKNSVYKCMECFKGFLDEGAYNGHMTRHTTQ
CGEYCCEICKTHFKHSHALRKHTTAHHAQRFNCNRCAFVTTHRQTARLHERWHKGTKYEC
PHCNEVFLKFTTYMGHIRIKHPSDFVCALCGYCFVSQKGIDLHKKLKHRLHLGQIPEDGP
LCELCDVRFISQEAYKRHLSVSARHAGDEISKDPSKPKRGRKSRDALDKDDSEKNDLNDK
KVYPSQVRKAEGPIPCEQCGMQLEDSRAYHAHFRRNHPDKNRTNYPSMKSPCMCEVCGRM
FQSHALLKDHRWVHTNERPFACECGKRFRMKQRLVAHRRVHRQTRHYTCALCGKGFSTHS
NRQRHMIVHSHRAKTISKPKRGRKSRDALDKDDSEKNDLNDKKVYPSQVRKAEGPIPCEQ
CGMQLEDSRAYHAHFRRNHPDKNRTNYPSMKSPCMCEVCGRMFQSHALLKDHRWVHTNER
PFACECGKRFRMKQRLVAHRRVHRQTRHYTCALCGKGFSTHSNRQRHMIIHTGLKPFKCE
MCGKCFKHASEKRAHITYVHLKKPWPKRSRAKRQGQNITGMASASEIDIQGWNDPKIDLM
MDKQYFNVKM