New model in OGS2.0 | DPOGS203346  |
---|---|
Genomic Position | scaffold1196:- 60268-65399 |
See gene structure | |
CDS Length | 4815 |
Paired RNAseq reads   | 2979 |
Single RNAseq reads   | 6883 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011997 (7e-89) |
Best Drosophila hit   | CG1832, isoform A (3e-06) |
Best Human hit | zinc finger protein 141 (3e-09) |
Best NR hit (blastp)   | mCG141045 [Mus musculus] (2e-09) |
Best NR hit (blastx)   | PREDICTED: ZNF41 protein [Danio rerio] (2e-27) |
GeneOntology terms    | GO:0003676 nucleic acid binding GO:0005622 intracellular GO:0005634 nucleus GO:0006355 regulation of transcription, DNA-dependent GO:0008270 zinc ion binding GO:0046872 metal ion binding |
InterPro families    | IPR007087 Zinc finger, C2H2-type IPR015880 Zinc finger, C2H2-like |
Orthology group | MCL40702 |
Nucleotide sequence:
ATGGCTGACGATATCGATATCGAGGAGCACGATGTCCTGGACAATCCGCTGCTTAAGAAC
ATATTTCCGGACATCACAAGTATAAAGACGGAGGTGATCGACTACGAAGATGAAGAGAAT
GATGAGGAGAATATGTTGTTTAATGGGGACGAAGACTTCAGTTATGAGATTCAACAGCAA
AAAAACGATCCATCAATCCACGAGAGCAAACCGAGCTCGTCATCAACATCTGGTGTCAAC
ACACCGGAACCACATGCAACGATAAGCAGAGGAGAATCTCATTCGGACGGCACATCTCAT
CATTGCAACGAATGTGACTCGATGTTTCCGACCGAAGAAGCTCTGGATGAACATAAAACG
ATAACCCATTCATATCTGGTGGCTGTCAGGAAGAACAAAGCGTACAAGAGCACAACAAAG
GGCTCGATATTCAATCGCAAGATCAAAATGGAACCTGATCCGGTTGAGGACAAGTCAGTT
TTGTGCAGCTGTTGTAACGAGGTCTTCCCAGATGAGCTGGCATTCATGAAACATTCGTAC
AGCGTGATGCCCGTGTGTTTCCAGTGCGACCTGTGCGACATGGAGTGTGACAGCGAGGCC
GCTTTGAGGAACCACAAAGCGACCCATTCTATGAACGACGAGTACATATGCTGTCCGGTC
TGCTCCTGCCACTTCCGGAACAGAGTGAAGCTCTGCAACCACATGCGCATCTTCCACGGC
TTCAACGACGACGTGCCGGAGCCGACGACCGCGGTGGACTTCGATTGCGAGGCCTGCGGT
CACTCGCTGCCGGATTGCAAGCGCTACAACCACCACATACAGCACAAACATCCGGAACTG
TACAACAAAGTGGTGGATACCCGCAAATATCACTGTCCGCCCTGTAATCTGACTTTCCCG
TCGTCGTACTCCGAGAAGATACATCGCGCGGCCAAACACGCGGTGCCAGATGACGTGAGT
TACGAACAGCAGTCCACGAGCTCGCAGAACTACCTGCCGATGCCGATCGCTACACTGTTC
AAGTGCACCAAGTGCCACGTGCATTTCCTGTCGTTCATGAAAGCGGTGGAACACTTCAAA
ACCTGTGAGGCTGATGCTGGTGATTACAAATGCAAAATATGCAGGCGTTTCCTCAACAAA
CCGGACAAAATTGGTCATTTGAAACAGCACGAAATGGTTGAGAAATTGAAGGGCATTAAG
ATAAGAGACGTCGAGATACAGAATAAGATAGTTTGCAATTGCCGCAAATGCCAGATATGC
TTCGACGAAGCCGGCTTCCAGAAATGCCACACGGACGTCTGTTTGCCAGCTAAAAGCGTG
AAGTGCTCGCTGTGCCGTCTGATAATACACGAGGATTATATGGTACAGCATTCAAAGGCT
CATGCGACCGGCGTCAAAACTACGGACTTCATTGTGGTTGATTATATGTGCTTTGACGAG
AACGAGCAGAAAGAACAGATACAAGAACAGCCGAAGGTCAAGATGGATCGTACAGCTAAG
CCGAAACACCTGTTCTACTGTCCCACCTGCAAATGCTACCTCAAAATCAACAGAACGATC
CACGGGCACAGTGTCGGCAAATGCAACAGGTCTCTCAACAAATACCTGTGCAAGCTCTGC
GGTTTGTGCTTCACTGTTAAAGGAATGAAGACTCACAAGCGGGAGCACAAACTCTACCCC
AATCTGAAGCTCCACGAATTCAAATTCGTTTCGACGCAAACCGGAAGGCAGATAGAACCG
AAGTTTCCGGAATTCAAGAAATGCAAAGCCTGCAGCGTCCACTTCTTCAGCGAGCAGGCC
AGGAGACGGCATTCCTGTTTCTCCGAAGTCCACAAAACTTGCCAATACTGCCGCGAGAAC
TTCAGTGATCTGGCATTCAAACTCCACGTGCCGTTCCACAAATATGGCGATTGGGACAGC
CAAAAAGTCAACATCCCGGATATACTGAAAACCTACGAGTCTCTCCAGACCATGTGGAAT
ATACTCTACCTGTGCGAGACCTGCGACACTATAATAGACACATACGACTCCGTGGTAGAG
CACTCCCAAGACCACTTCTGCAACATGGAGAGCTACAACAAGACGATAAACAATTGTGAT
ATATGCGATCTCAAATTCGTTGGCAATTCAATAGATAGACACAAAGAATTGCATCTCGGG
AACAGCCTCAGGAAGGATTCGTTCATAATATTAAACTATGATTACGAGAAGCTTCTATCC
AACGAGTGGTTGAAAATGTTCGCGTGCCTGGCCAAGGAGCAGGTCAATCAGATACTGTCT
AGGAGCATTTACAAAGTCACAAGGAGCATCAGAATGGAAATCGCCGTGGACGGTCCCTTA
CACACGACGCTGTACCGGTGTGGGGGGTGCAGCAATATCATAGACACGGACTGCGTCGAG
GCGCACGCCCAAAATAATAATTGCTCCAACAACGATTTCAAATACAATTGCGTCACTTGC
CGGCTCAGCTTCGCTACGCGGAACTCGAAGGCGGATCACGATTATCTTCACAAGACATGC
AAATTGGACAGCAACTGTTTGAGAATTATAGATTTCAATTTGCAAAAAGACTTCTACGTG
AACGATGTCCTGCGATCCAAGTTGCGTCCCGAGGGTTTGAGCTTCAAGAGATGCCAGCAA
TGCGGTAAACTGATACAGAAAGACAAATTCCAGAAACACAGCGCCTTCCACGTAGAACAA
AAACACAAGCTCTTCAAGAACCAACCCAAGTTTGTGTCGAAGTCGTCCGCTAAGAAAATG
GTCCACACATTCTACACTTGTAGGAAATGTAAAGTTAGCGTCGTCTTCAAACAGACGATA
AACATACACACCTGCAAGACTCTGGTCAGACTCGTAAAGTGTAACAAGTGCGGGCTCATG
ATCAGAGCGTCGAGTTTGCAGAGGCACAGCCAATACCACAGAAAATTCCCGAAAATGACG
GCAGCGGACATAAAAATAGTGTACTTCCAAAATAAATGCATCGAGGATCCTGTAAGACAA
AACGAGGAGCAGTTGGTCTTCTACCAATGTACGGACTGCGCTTTGACTGTACACAGAGAG
TCCACAACCAAGAAACATGCCTGCAACGGCTCACTTAACAAAAAATATTGCAATATATGC
GAGCTGTATTTCCACGCGTCAAATTTTATGCACCACGAAAAAATACACGAGCAGATGGCA
TTCGGCAAAGACGACATCACCCTCTTGCAGTTCAGGAACGGCGTAGTTTACGGAGACGTC
AACCCGCAAACGAAGATCGTAAGCTATAAGCCGAAGCGGGAGAATATATACATAAAGAAA
GTGCGGAAACGTCGTCTGTTCGACAGCCAGCTGAACCACGAGTTGACCGACAGGGAATTC
TACCGGTCTATATCGGCTAGACTCTACAAATGTGATACCTGCAAATTGCTCTTCGTAACC
GGCTCATGTCTGGCTCAACATCAAAAAATATGCAGCGAGAACAACACAGGCGTTGAATGT
AAAAATTGCGGACTGATTTTCCACGAAACAGCGATAAAACATCACATCACACTCCAGAAA
TGCAATCTCAAACCAAACATCAATTTTATTTCGCTTAAATTGAACTGTGAAACCTACAGC
GACAGGCGGGTCGTGTACCTCTGCCAGCAATGCAACGTGTACAACATATCCCTGAGGGGC
AACATGTATCACGTGGAAGCTAACCACAGGATAGGCAAAATGACGGTTAAGTGCGTCACG
TGCAATATAACCTTCTCGTCGGTGAGTTACAGGAACCACATGAAGCTGCATCACCACAAG
AAGAGAACCGGCTTCAAAGACTTGGCAGTCGCCACGGTCACTATTATGACGTTAGCGGAC
GCGCTCAAAGATTTACCTGACAGAAGTCAAATCAGATTGATAGATTTTGATTCCGTCGAA
GAATCGGGTGTCGATGGAAGAGCAGTGAAACGAAAACTGTCGCTGGATGATGAGGATTCG
CAAGACGAAACAAACAAACTGCCTAGAATTGAACCCAGGGCCACTACTAGCGATGAGATT
CTGTCACCGAAATCTAAGATACAGTTCAATAAAAACAGCCTCTACTCCTGTGGGGTCTGC
GACCTGAACTTCCTCCATCCAAAAACCCTCAAACGTCACATGGACATAGGACGTCACGAC
GAACAAAGATACGTCTGTCCCGAATGCAATCTTATGTTCACAAGAATATCTCTAACAAGG
CACATGTACACCCACGAGACGGTGGAGGAGACTAACGACTACAGACCGAAATACAGAAAC
GAGTCTAGGACTAGGCGGAGCCAAGGAGACAGCAGCGAGGAGAATTCACAGAGCTCCAGT
ACATACAAAGTGGAAATAGAGCCGAGTATATCAACGGAAGAGGCGAGTAGTCAGGGCGAT
CCAGAGGTCAAGCTATACAAGTGCGCCGCCTGCGACGTCTATTACCTGAAAGAAGATATA
TGTGTGGAGCATGTTACGGAGCACGCGGCGTTGGATCCCACAGAGTATATAGCGTGCAAG
ATGTGCGACCTGCAGTTCCTCTGTGAGTATCTCGGCTCGCACATGAAGACTCACCGCGAC
AAGTCCTTCAATATCGATAAATTGATAGTCCTCGAATACCAGATAGTCGATAACAGCGTG
AAAATCGATACATACTCAGCGGCCGACAGGTTGAAATCCAAATTGGTCAGCACCACGACC
CACTCCGATACAGAGGACGAGAAGAACGATAACATAGACTCGACGACTGATGAAAACAAT
TTCAACCAATCAGCGCCCTCCGTGTCCGACCAATCAGCGGACAGACCGACGCCTCAAATG
GAGAGCGCCAATTGA
Protein sequence:
MADDIDIEEHDVLDNPLLKNIFPDITSIKTEVIDYEDEENDEENMLFNGDEDFSYEIQQQ
KNDPSIHESKPSSSSTSGVNTPEPHATISRGESHSDGTSHHCNECDSMFPTEEALDEHKT
ITHSYLVAVRKNKAYKSTTKGSIFNRKIKMEPDPVEDKSVLCSCCNEVFPDELAFMKHSY
SVMPVCFQCDLCDMECDSEAALRNHKATHSMNDEYICCPVCSCHFRNRVKLCNHMRIFHG
FNDDVPEPTTAVDFDCEACGHSLPDCKRYNHHIQHKHPELYNKVVDTRKYHCPPCNLTFP
SSYSEKIHRAAKHAVPDDVSYEQQSTSSQNYLPMPIATLFKCTKCHVHFLSFMKAVEHFK
TCEADAGDYKCKICRRFLNKPDKIGHLKQHEMVEKLKGIKIRDVEIQNKIVCNCRKCQIC
FDEAGFQKCHTDVCLPAKSVKCSLCRLIIHEDYMVQHSKAHATGVKTTDFIVVDYMCFDE
NEQKEQIQEQPKVKMDRTAKPKHLFYCPTCKCYLKINRTIHGHSVGKCNRSLNKYLCKLC
GLCFTVKGMKTHKREHKLYPNLKLHEFKFVSTQTGRQIEPKFPEFKKCKACSVHFFSEQA
RRRHSCFSEVHKTCQYCRENFSDLAFKLHVPFHKYGDWDSQKVNIPDILKTYESLQTMWN
ILYLCETCDTIIDTYDSVVEHSQDHFCNMESYNKTINNCDICDLKFVGNSIDRHKELHLG
NSLRKDSFIILNYDYEKLLSNEWLKMFACLAKEQVNQILSRSIYKVTRSIRMEIAVDGPL
HTTLYRCGGCSNIIDTDCVEAHAQNNNCSNNDFKYNCVTCRLSFATRNSKADHDYLHKTC
KLDSNCLRIIDFNLQKDFYVNDVLRSKLRPEGLSFKRCQQCGKLIQKDKFQKHSAFHVEQ
KHKLFKNQPKFVSKSSAKKMVHTFYTCRKCKVSVVFKQTINIHTCKTLVRLVKCNKCGLM
IRASSLQRHSQYHRKFPKMTAADIKIVYFQNKCIEDPVRQNEEQLVFYQCTDCALTVHRE
STTKKHACNGSLNKKYCNICELYFHASNFMHHEKIHEQMAFGKDDITLLQFRNGVVYGDV
NPQTKIVSYKPKRENIYIKKVRKRRLFDSQLNHELTDREFYRSISARLYKCDTCKLLFVT
GSCLAQHQKICSENNTGVECKNCGLIFHETAIKHHITLQKCNLKPNINFISLKLNCETYS
DRRVVYLCQQCNVYNISLRGNMYHVEANHRIGKMTVKCVTCNITFSSVSYRNHMKLHHHK
KRTGFKDLAVATVTIMTLADALKDLPDRSQIRLIDFDSVEESGVDGRAVKRKLSLDDEDS
QDETNKLPRIEPRATTSDEILSPKSKIQFNKNSLYSCGVCDLNFLHPKTLKRHMDIGRHD
EQRYVCPECNLMFTRISLTRHMYTHETVEETNDYRPKYRNESRTRRSQGDSSEENSQSSS
TYKVEIEPSISTEEASSQGDPEVKLYKCAACDVYYLKEDICVEHVTEHAALDPTEYIACK
MCDLQFLCEYLGSHMKTHRDKSFNIDKLIVLEYQIVDNSVKIDTYSAADRLKSKLVSTTT
HSDTEDEKNDNIDSTTDENNFNQSAPSVSDQSADRPTPQMESAN