New model in OGS2.0 | DPOGS202167  |
---|---|
Genomic Position | scaffold981:- 41989-47954 |
See gene structure | |
CDS Length | 4890 |
Paired RNAseq reads   | 3050 |
Single RNAseq reads   | 7351 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003310 (0.0) |
Best Drosophila hit   | crooked legs, isoform A (5e-30) |
Best Human hit | zinc finger protein 624 (3e-28) |
Best NR hit (blastp)   | PREDICTED: similar to mCG7830 [Acyrthosiphon pisum] (4e-51) |
Best NR hit (blastx)   | PREDICTED: zinc finger protein 225 isoform 1 [Pan troglodytes] (2e-77) |
GeneOntology terms    | GO:0005634 nucleus GO:0046872 metal ion binding GO:0003677 DNA binding GO:0006355 regulation of transcription, DNA-dependent GO:0008270 zinc ion binding GO:0003676 nucleic acid binding GO:0005622 intracellular |
InterPro families    | IPR007087 Zinc finger, C2H2-type IPR012934 Zinc finger, AD-type IPR015880 Zinc finger, C2H2-like |
Orthology group | MCL39739 |
Nucleotide sequence:
ATGGCACTCAAGCTAGGAAAATGCAGATTGTGCCTGAAACTGGGTGACTTCTATTCGATC
TTCACTATGGATAACAATTTGCAGTTAGCGGAGATGGTAATGGAATGTGCTCGAGTTAAA
ATATATGAAGGTGACGGACTTCCGGACAAAATTTGTAGTGAATGTATACAGAAACTCAGC
AGCGCTCACATTTTTAAACAGCAATGTGAGAGATCAGATCAAGAGTTGAGACGCAATTAT
ATTCCACCACCAGCTTTCAGTTCTACACCTCCACCGCCAAATAGACAAAGCAGTGACTCC
GCAATTTCAACTCACCTGGAAGTTTCAAAGCCTTCGTCTTCAAATGATAGTAAAATGTCG
ATCGAGAGCAAACTCACTCCCATCAGTCGTAACAGAAAACGCAGTAAGGATAGTATGGGT
GACGCATCAACCAGTAGTGACTGTAGACCGGGCAGTTCCAAAAGAGTCGAAGAATTAAGA
AGGACACGCAAGAAGCCAAAAATGCTCTCGAATTACGATTCCGACTTCGATGACAGTGGC
TCATTTTACTCTCAGGAAACCGATTCAGACGATCCGCTACTGTACAAATGTGATATATGT
TCCAAAGCATTCAAGTCAAAGAACAGTCTCTCGGGACATTCTAAATGTCATAAAAGAAAA
AATGCATTAAAATATGACTCGGTGACCAAAGAGGATTCAATGTTGAATGCATATGTGCCA
GAATTATCGAATGTGAGAGACGCTCACGACGATGACGATAAACTGAAATGCGAAAAATGC
GGGAAAGAATTCAAATTGAAGATAATGCTCAAAAGACACAACGAGATCTGTAGCAGACCA
CCGATGAAAGAGCTTTTGATATCTCTAGAGCCGATCAACATTACACGTAAAAGAAACAGA
CTGGATTGCGAGCTGTGTTCCTTGAAATCTGGAACTGTGGAGGGCTTGCAGGAGCACATG
AAGCTCGAACACGCCATGGAACTCGACAAGGACAAGGTGTGCATGAGGGATCGCGACGGA
AAAATATGTGTTCCGTGCTGCTACTGTGAAGAGAATATCGACGACTTCTATAAGTACACC
GCTCATATCGGCGAATGTACCAAGAAGGGCAATGCCGCGGACATCGTGTGCCCCGTGTGC
AAACAGACGACGACGAAATCCAATTACTTGGTACACGTTAAGCTGCATTTCTTTCCGACT
CGGACGATTGAATCTGGTGCTACTAAAGAAAATTTCCAGTGCAGAATGTGCAACAAAGAG
CTGCCGAGCCAGGAGTTACTGATCAAACACCTGGCTGCTCACATGTCCAATATAGATGAC
GCCGATGAGGGGGGCGACGAGGAATCTCGAGCGAGTACAGTTGAAGACTGTGGGTCGATA
CATTCTGAATACAGCATAAATACTCCAAAAACAACCTTGCAGTGTCAACATTGTGATAAG
ACGTTTAAATATAAAAAAGCCTTACAATCTCACGAGGAGAAGCATAGGCGTGAAGTGAAA
ATAGAAGGTCCCGAAACACATCAGTCAGCAGATAGCATCAACGTAGTGGACCCATCATTC
GCTCAGTACGACTCTGACACTAGTCAAGAAGACGGCGAAGACGATAACACGTGTGATATT
TGTGAAAAACAATTTTCCTACAAGAGACAGCTGTTGCAGCACAAGAGAACCAAGCACCAT
ATGACGTCCGGCACCAAGAGGGCGAAGATTAACCTGAAGGACTGTTCGGTCCGATGCTTG
ATATGCGATATAGAGATGAAGGTGAGCGCGATCAACGAGCACAACCAGACGCACATCTCA
GTGAACATCAAGCCCAAGAACCAGTACACGTGTATACAATGCACTGAACAGTTCAAGAGC
TGCAGCAATCTGGCCAATCACATCAAGCTGATTCACAGACTGAAACAGCAGCCGATGGAT
TCGAAAATGAGAGCCGATTTGGCGGATTTTTGTGAAGTCGTTGTGACCAAGGCGGAACCC
CTGGACGAGCTCCAGAATCACAACGGCGTCAATGAAAATTCCGCCACCGATGTTAAACCT
TTAGTCAACATGAGCGGATTCAGCTGTCCCACTTGCAACAAAACTCTGCCCACTCTGATA
TCACTTAAGCGGCATATCAACTGGCACAATAATGTTGGTAAGAACATGGAAAAGAAATTG
GAATGCTTTGTATGTAAAGAGACCTTCCGATTCCAATGTCATTACAAACTGCACATGCGC
GATCACTACAAGGACACGAATCTAGACCCGGCCCTACTGACCTGCAACATCTGCAACAGG
AAAAGCAAGCACCTCCGGGCCGCTCAGGCACACATGAATTTCCATAAACAGACTCGCTTC
CAGAGCAAGGATTACGAATGTTCGATATGCAAGAGAGTGTTCCAGCATCGGAAGGTGTAC
CTCTCGCATATGGCGATACACTACAAACGCGGCGAGAGCACCAGCAACACTGTAGTCGGA
GCCGAGTTGCCCAATACGGTGGATAAAAACGTCTTTGACGGAACCTACAGCTGCCACCTC
TGCGGGAAGGTCTGCGATTCGGAAACCTCGTTGAAACACCACGTGATCTGGCACAGCTCG
AAGACGTCCCTGTACGGCGCTCGCCATCAGTGCGATATCTGCAATTTGCAGTTCACTAAC
AAGAAACGTCTCGAGCTCCATACTAGATCGCATTTCGAAGACGACAACGGACCTTTCAAG
TGCCACATCTGTGGGAAAGGATATCTAGTCGAAGATTACTTCAAGAGACACGTGAAGGGG
CATAACTTCGATCATCAGTCGCATAAAAAGAGGATAGAGAGGCTCAGGAAAGACAAAGTG
AAATGTCCGATTTGCTCGCGATACTATCCGGACCTGGGGAAACTGATCCGGCACCTGCGG
CGCACTCACCCGGAGAGCAAAATGATCAAACAGGACCCAGACGCCCCAACGCCTCGCTAT
TATTCTTGCAAGCTATGTGCGAAGGTCTTCTTGGACGAGCGGAGGTTGCAATACCACGAG
GAAGCCCATCTCAGAAAACCAGAGTTTTTCAAATGCAAGTTCTGTGGAAAGAAAACAATC
TCCCTGAAAAATCATAGGGTTCACATAAAGGGTCACTTGACACAGAAGTACATCGATAAT
CCTCTGAAATGTAGCCACTGCGAAGAAACATTTACACGCGGCTACGACCTCCAATACCAC
CTTCGAGACGCTCACGGCGTCAACGAGACGTGGATAGCGGAACGCGGCGTGCAGACTCCC
GACGGACCGCTCAAGGAGTTCCAATGCTCCATATGCTTTAAAATATTGGCCAGTAAAGGA
AACTTCGAACGACACATCGACTATCACAATTCGCTCCGATGCAATTACTGTTTCGAGTAC
TTCGGTAGTTCCAGGTTTCTGGAGGGGCATCTCACCTTCAGCTGCGATAAGAAGAAACTC
CTCGGCGACACCGAGATCTACCCCAAGAAGGTCAAGTGCCATATATGTTACAAGGCTTTC
CATTTGCAAGTCAAGTTAGACTGTCATTTGCGAACCCAGCACGACATAAGAACGTTCAAA
GAGGCGTTCGAAGGGAAAAAGGAAATCGTATGCGATTACTGTTTCAAAGTGTTCGAAAAC
GAATACGCTCTCAGCACGCACAAGATCTACCACCGCACTGTCGGGTACTACGGCTGTATC
TACTGCAACAGGAAATTCAATACCATGACCCTGTACAGGAAACATAAGAATCACCACTTC
TCCCAACTCAATGTGGACAACCCGACCAAGTGTGAACACTGCGATGAAACTTTCGTGGCC
TTCAGGAAGATGATCTACCATATGAGAGACGTCCACGGCGACCACAAGGAGTGGATCGTG
TTGCCAAAGGAATCCAAACAAGAGAAATGCAACATTTGTAACAAAACGTTCTTCAACCTT
CATAGACATCTGGATTATCACGAGGAGAACAAGTGTCAGAAGTGCGGGGAGTACTTCTAC
TCGCGGGCGGACTTCGACAATCATCTCTGTGCTATAGACAGCGAGGAGGAAGTCGCCGAC
ACTAACACTACCGGCGATCGCTGCCAGTACGAGGAGTGCGAGTTCTGCTTTAAACCAGTC
ACAAAGAAAAACTCAAAGAAAATGCATCTCCAAATCCATAGAGGCTCCGGTTCTATATCG
TGTCGATTCTGCGACCTCAAGTTCAAGACGATGGACGCGTTCAATATACACGCGTTTTCG
CACAGGAGCAGGAAATACAAAAAGAGACCCATCAAGTGTAGAAAGTGCGGTGAGCAGTTC
GTCAAGTACGGCCCCTTCATCCGACACATGAAGTTTGTTCACAAATCACTCAAGAAGCTG
CACTACAGAGCCACCGTGATGCCAGAGCAGTGCGTGGTCTGCAAGCAAGACTTCCCCAAC
CTGCACAACCACTATCGAGCTCATCTACAGAACCAGTGCCATCTGTGTCTCAAGTACTTC
ACATCTTCAAAGTTATTTTCGTTGCATCAATGCGACAAGGAGGAGTCTGATCCGACCAAA
GTGTTCACATCCGACGCCAACTTGACGGAGCTGATCAACTCCTATGTGCCGAGAGACGAA
AAAGACGACGAGAAATATTACGGATACGAAGACGAAGGCGAGAACTTGGACGAGAAAGCG
AATGAGAAAACGGAAGTGACGTCAAACGTGCCATCGCAGGACGAGGACAGTCAGGGCTCT
CTAAATGTAGAGGAAAAGAAAGTACACTCGTTGGTGCACGCGCCCATTATATCAGACGTT
CTGTCGCTGTATAAAAATAAATGTAGCAAAAACAGCATCCGGACTAAAGGTGACCAGAAC
AGCGTCGGTGGGAGCGTTGTGGTGCTCACGGACGAAGAGTCCGCGGACTACGAGTCTAAC
GAAGCCTCCGTCATCACAATAGACGACTAG
Protein sequence:
MALKLGKCRLCLKLGDFYSIFTMDNNLQLAEMVMECARVKIYEGDGLPDKICSECIQKLS
SAHIFKQQCERSDQELRRNYIPPPAFSSTPPPPNRQSSDSAISTHLEVSKPSSSNDSKMS
IESKLTPISRNRKRSKDSMGDASTSSDCRPGSSKRVEELRRTRKKPKMLSNYDSDFDDSG
SFYSQETDSDDPLLYKCDICSKAFKSKNSLSGHSKCHKRKNALKYDSVTKEDSMLNAYVP
ELSNVRDAHDDDDKLKCEKCGKEFKLKIMLKRHNEICSRPPMKELLISLEPINITRKRNR
LDCELCSLKSGTVEGLQEHMKLEHAMELDKDKVCMRDRDGKICVPCCYCEENIDDFYKYT
AHIGECTKKGNAADIVCPVCKQTTTKSNYLVHVKLHFFPTRTIESGATKENFQCRMCNKE
LPSQELLIKHLAAHMSNIDDADEGGDEESRASTVEDCGSIHSEYSINTPKTTLQCQHCDK
TFKYKKALQSHEEKHRREVKIEGPETHQSADSINVVDPSFAQYDSDTSQEDGEDDNTCDI
CEKQFSYKRQLLQHKRTKHHMTSGTKRAKINLKDCSVRCLICDIEMKVSAINEHNQTHIS
VNIKPKNQYTCIQCTEQFKSCSNLANHIKLIHRLKQQPMDSKMRADLADFCEVVVTKAEP
LDELQNHNGVNENSATDVKPLVNMSGFSCPTCNKTLPTLISLKRHINWHNNVGKNMEKKL
ECFVCKETFRFQCHYKLHMRDHYKDTNLDPALLTCNICNRKSKHLRAAQAHMNFHKQTRF
QSKDYECSICKRVFQHRKVYLSHMAIHYKRGESTSNTVVGAELPNTVDKNVFDGTYSCHL
CGKVCDSETSLKHHVIWHSSKTSLYGARHQCDICNLQFTNKKRLELHTRSHFEDDNGPFK
CHICGKGYLVEDYFKRHVKGHNFDHQSHKKRIERLRKDKVKCPICSRYYPDLGKLIRHLR
RTHPESKMIKQDPDAPTPRYYSCKLCAKVFLDERRLQYHEEAHLRKPEFFKCKFCGKKTI
SLKNHRVHIKGHLTQKYIDNPLKCSHCEETFTRGYDLQYHLRDAHGVNETWIAERGVQTP
DGPLKEFQCSICFKILASKGNFERHIDYHNSLRCNYCFEYFGSSRFLEGHLTFSCDKKKL
LGDTEIYPKKVKCHICYKAFHLQVKLDCHLRTQHDIRTFKEAFEGKKEIVCDYCFKVFEN
EYALSTHKIYHRTVGYYGCIYCNRKFNTMTLYRKHKNHHFSQLNVDNPTKCEHCDETFVA
FRKMIYHMRDVHGDHKEWIVLPKESKQEKCNICNKTFFNLHRHLDYHEENKCQKCGEYFY
SRADFDNHLCAIDSEEEVADTNTTGDRCQYEECEFCFKPVTKKNSKKMHLQIHRGSGSIS
CRFCDLKFKTMDAFNIHAFSHRSRKYKKRPIKCRKCGEQFVKYGPFIRHMKFVHKSLKKL
HYRATVMPEQCVVCKQDFPNLHNHYRAHLQNQCHLCLKYFTSSKLFSLHQCDKEESDPTK
VFTSDANLTELINSYVPRDEKDDEKYYGYEDEGENLDEKANEKTEVTSNVPSQDEDSQGS
LNVEEKKVHSLVHAPIISDVLSLYKNKCSKNSIRTKGDQNSVGGSVVVLTDEESADYESN
EASVITIDD