DPGLEAN09471 in OGS1.0

New model in OGS2.0DPOGS202114 
Genomic Positionscaffold1100:- 33199-41041
See gene structure
CDS Length4389
Paired RNAseq reads  3419
Single RNAseq reads  8286
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006957 (3e-08)
Best Drosophila hit  crooked legs, isoform A (6e-19)
Best Human hitzinc finger protein 197 isoform 1 (4e-28)
Best NR hit (blastp)  PREDICTED: zinc finger protein 624 [Pan troglodytes] (1e-38)
Best NR hit (blastx)  PREDICTED: zinc finger protein 624 [Pan troglodytes] (6e-51)
GeneOntology terms





  
GO:0045449 regulation of transcription
GO:0005634 nucleus
GO:0046872 metal ion binding
GO:0006350 transcription
GO:0008270 zinc ion binding
GO:0003676 nucleic acid binding
GO:0005622 intracellular
InterPro families


  
IPR007087 Zinc finger, C2H2-type
IPR012934 Zinc finger, AD-type
IPR015880 Zinc finger, C2H2-like
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL23949

Nucleotide sequence:

ATGAATGTAAATTATGACCGTGTTTGTAGACTGTGCTTGTCATCTACATGTGAATTACTA
CCGATTTTTCCTACCACCAGTTCGGATGACTCGGAACCTCCTGTCCTCGCTTCGAAGATT
AAAGATTGCGTGTCAGTACAGATAAGGGAAAATGACGACCTGCCAACCAATGTCTGCAGG
AAATGTATGGATAATGTCAATAACTGGCACATATTTAAGGCAGTATGTGAAAGAACACAA
AACAGATTGCTTTCGTTTTTAAATAAGGAGAGTGACCAACTAGAAGAGGTGAAAATAAAA
AATGAACCTTTATCTGATGAAGCCTATGATGATGGAGTAGTCATTGATGGTTCATATCCT
GAAAGTGAAAATGCAGCCCCATCTAGTAAGGTCCAACCGGAGGGTCCACCTATCTTGGCT
TCATTGGGGCTCACACCAAGAAGTGATAAGGGAATGGATAGCGAAAAGGATGAGGACTAT
GAAGAGGAGGAAGAAATTGACCAACAGCAATTCACACACGTCCCAAACATGCCAGAAGTT
TCCATTACGGTCATGAGACCAACCGGTGAAACTCTTCACGCTCGTCAGGGTATTCAAGAA
ATAGCGTCCAAGGATTGTCTAGTTTGTGGAAGGTCGTACAGATATTCTCACAATGCCAGA
AGACATGAATTGAACGCGCACAGCTTTGACAGATATACAAATAAAATAACGAATAAGAAA
CAGCAATCACACATGCAACCAAAGTTCAGGCCGAATCCATTCAACCCCAAAGCCCGCCTC
ATGCCGAATCCCATAAGCCACAAAATGCAATTTCATTCAAAATCAATTCCGGGTAGAATG
CCACAGAGGATTGTCGCACCGCCGAAGCCGATACCAATTAAAACCGCTAAAGCTTCGCAG
AATAACTTGCCATATCCTCTTCGTATAAAAGCATTAAAAGATCTGCAGATCAAAAAGAAG
GAACCACAAATTTTAAAGACACTATTGACTTCCAAACCGGAAGTTTTGGTGTCTGAGCCT
GAAATAATTAATTCAGGTCCAGAAAGTCCTGAGACGTTAATTTCCGAACCGGAAATAGCG
TCTTTCCAAGTGGAGACAATTTTATCAGAACCAGATGGATATGTCAATCAACAACAAGAT
GATGACGAAGGTGTTAACGATAATAACCAAGGCCAACACTATGACACCGTTGATATGGAA
TCTGATAATGAAATTGAAATTGCACGGCAAAATGAAAATGAGATTGATGTAGATGAGAAT
CCCATGAATGATCCCGAAGAAAACCAGAATGATGAAGAGAATAATATGGATGATGGGCAA
GAGGACAGTGATAGAGGTATTGATGATAGTGATGACAGACAGGAGGATTTAGTAAATAAC
GTGGATGGTGAAGGTGGTGATGCCAAAGAAAGTCAGGATGATGAGGCGGAAAAAGATAAT
TACCAAAATATTAAGAATGAAGATAATGAAGATGACGAAATGCCAGCATTGAACATTGCA
CCCGTCGTCGAGATAAATGAAGACATGCAAACCAACTCTTACAACAGTGACGGTAACGAG
GAAGAAGAAGCTGACGAAACGGTTGATCCAAATGACACGGTTGATGGTGATGAAGCCGAA
AAAGATTTGGACCCAGATAAGGTGTACATCACAAAAACTCAAAGAGACTTCATCTTGAAG
TACCGTGATATCATCGAACAGATCAATACCAAGCGGTGTCTTTGTTGCAATAAAGAACAT
CCACGCAGGAAAGCGGTTATACAGCATTTGCAGAAAAACGGACACAAAGTCCCAAAGCAC
ACTTGTTATAATTGCGTAGTGACCTATGGTCATATCGGTGCTTTGTTGAGTCACATGAGA
TCGAACACTTGTACTGATCTGTGGAAAATAATTTATAACGAAAATGGTATAACCGATGAT
ATCGTGTTAGAAGATAAACCCGATAACAAAATTCCATACAAAGATGTTGTTAACGCTAGG
TCCTATGCTTGTAAATTATGTCCAGCGAAATTCCAGCTGAAACAATTTATTATGAAACAC
GTTTTAGATGTCCACGAGGATGGCCAATCACGTGTTCCGTTTTCTTGCGTCCATTGTGGA
ATGAGGTTTAAAGACAAAAATATTGGTAAAAGACATATTCGCAATGGAGATTGCACTGTG
TATATAGCTTGTGAGTTATGTTCTGACAAGTTTTTGAACATGCAAGATTTCAATGATCAT
GCTGTTTCCGTGCATGCTGGCAATTTAGATCCCGAAAACCAAAACAAATGTGTGGACGGT
CGACCGACTGATTGTCCTATCTGTGGTAAAAAGAACAGTAGTTATCCCAATTTAGTGAAG
CATTTGAAAGCTGTTCATAACGAAGAAAAGCCTCATCACTGCCAACATTGTGACTCTAAA
TTCGAGCAAACTACTGATCTTAACAAGCACATATACATGGAACATTCTGATAGAAGTTTG
GGTATGCAGTCTATTGAGCCCGACATGTCCATTGTGAAGGAGGAAGCCGAAGAATATCAT
TATTCATGTACGGAATGCAACGCCATCTTTGAAACTGTCGACGCGTGGACGGATCATCAA
GTTGCCGAACACAACCAGGTCGCACATCACTGCGACCAGTGTGAAAAGAAATTCTTACGA
CCATCAGAACTTGCTGAACATAAAAACACACATTTGCGAGTTAAATTTTATCCATGTAAC
GTATGTTCGAATTCCTACAGCACACCACAAAAGCTGACTGAGCATATGCAGCAGATGCAT
CCTGGGTCCAACGCACGTGGAGGTGATAGCGATTTCTATTGCGATATATGTGTTAGATCG
AATGACAGCGAGGAAAATACCATGCATGCTTCTACAGAAGAGGATGAATATAATGTTGAT
GAAAACGGTGCCATTTTAACGACGCCACAATTTAATAGTTTTAATTTCACTAACGACGCT
TTGCCATATTCATGTGAGTTATGCTATAAACGTTTTCCGCTGCGGACGTATTTGTGGAAA
CACAAACGCGCCAAGCACGGCATCACCAAACCTAACGCCGGTGAAGCTTCAGAAACGCAA
ACACAGCCGTCATCGGCTGAAGGGAGATCTAGTTGTACGATATGTAAAATAACATTCTCC
GATAAGAAATCATACTATCGTCATAGGAAAAACGTTCACAAGTCGACTGTCCAGATGTGT
AAGATATGTGGAAAGCCTTTGAATTCAACTTTGGAATTGTACGAGCATCTTAAAGCTGCG
CATGCTCGCGAGTTACTTGGATACAACGCCAACCAAGGTCCGAGCAAATCGCAGGAGGTT
GTTCAGGAAATGGAGGTGGAATATGACGAGGATCAAGATTCCGCTGACCCGAGCGTCGAT
TACCAAGCCAGATACCCGTGTGATACTTGTGGGAAGCAGTTCGTTGGACTTCTTGCTCTG
CAGAATCATCAGTGTATAAACCAGATTCAGACACAGCCACAAACATTCGAATGCGAAATT
TGTCACAAGAGCTACACGTCAATTGCCGCACTGAAGAGCCATCGGGGATGGCATTTACGG
TCTCCAGATGGAAAAGCGGCTGCCAATAACACTGGTTTATGGATGCCCCAAAACAAAGTG
ACGACCAAAGTCAGCAAACACGAAGTTGTAGATCCGTCTCAACTCGCGCGTGTTCAGCAT
TCAACACCAGCCAACATTGCTAAAAGGAGATTACCGCCGGAAGTCGAAATAACTGTGGTC
AATCCGAACAAAAAGCTTCGGTCCGATGATTCTATTGAATTAGATCATCAGAACAATTCA
TCTGGCGGTCCCGAAGACAAATACTGTAACATTTGCGATAAAGAATTCACGAAGCGAGCG
GCCTACCAGCGTCATATGGACGAAGTTCACCAACCGAACTCCGTGTTCTGTCCCGTATGC
GACAAGAGCTTCACCAGGAAATCGACATTAATAGTTCACATGAAGAAGCACTACGAAAGC
GGAGAAGGTACATCAGGGTCCACGCAAATGGATGAAGATTCGCACACGTGTGACGTATGC
GGCAGTGTGTTCGACAGCTCGAAGTCTCTGATGGCCCACAAGAACATGCATCATGGAGAG
GATGAATCCGACCAGTCTGAAGACGACGGCGGTGCGACTATACAGCCCCCAGGCGAGTTC
ACGTGCGCTCAGTGCGGCGACGGCGTCGCTACACCACGCGACTTAATAGCACATCGAGCT
ATGCACGCCACTCCGACGAAGTTCTTCTGTAATATTTGCAAGGTCTACTTTGCTAGAGCG
TTGGACCTCTCCTCCCACACTCGAGCCAGACATTCTGACAACGAAAAAGTATTCTTCCCT
TGCGCGATGTGCGACCGTTTCTATATGAACAAGAAGAGTTTGCAACGCCACATAGAAATG
GCTCACTGA

Protein sequence:

MNVNYDRVCRLCLSSTCELLPIFPTTSSDDSEPPVLASKIKDCVSVQIRENDDLPTNVCR
KCMDNVNNWHIFKAVCERTQNRLLSFLNKESDQLEEVKIKNEPLSDEAYDDGVVIDGSYP
ESENAAPSSKVQPEGPPILASLGLTPRSDKGMDSEKDEDYEEEEEIDQQQFTHVPNMPEV
SITVMRPTGETLHARQGIQEIASKDCLVCGRSYRYSHNARRHELNAHSFDRYTNKITNKK
QQSHMQPKFRPNPFNPKARLMPNPISHKMQFHSKSIPGRMPQRIVAPPKPIPIKTAKASQ
NNLPYPLRIKALKDLQIKKKEPQILKTLLTSKPEVLVSEPEIINSGPESPETLISEPEIA
SFQVETILSEPDGYVNQQQDDDEGVNDNNQGQHYDTVDMESDNEIEIARQNENEIDVDEN
PMNDPEENQNDEENNMDDGQEDSDRGIDDSDDRQEDLVNNVDGEGGDAKESQDDEAEKDN
YQNIKNEDNEDDEMPALNIAPVVEINEDMQTNSYNSDGNEEEEADETVDPNDTVDGDEAE
KDLDPDKVYITKTQRDFILKYRDIIEQINTKRCLCCNKEHPRRKAVIQHLQKNGHKVPKH
TCYNCVVTYGHIGALLSHMRSNTCTDLWKIIYNENGITDDIVLEDKPDNKIPYKDVVNAR
SYACKLCPAKFQLKQFIMKHVLDVHEDGQSRVPFSCVHCGMRFKDKNIGKRHIRNGDCTV
YIACELCSDKFLNMQDFNDHAVSVHAGNLDPENQNKCVDGRPTDCPICGKKNSSYPNLVK
HLKAVHNEEKPHHCQHCDSKFEQTTDLNKHIYMEHSDRSLGMQSIEPDMSIVKEEAEEYH
YSCTECNAIFETVDAWTDHQVAEHNQVAHHCDQCEKKFLRPSELAEHKNTHLRVKFYPCN
VCSNSYSTPQKLTEHMQQMHPGSNARGGDSDFYCDICVRSNDSEENTMHASTEEDEYNVD
ENGAILTTPQFNSFNFTNDALPYSCELCYKRFPLRTYLWKHKRAKHGITKPNAGEASETQ
TQPSSAEGRSSCTICKITFSDKKSYYRHRKNVHKSTVQMCKICGKPLNSTLELYEHLKAA
HARELLGYNANQGPSKSQEVVQEMEVEYDEDQDSADPSVDYQARYPCDTCGKQFVGLLAL
QNHQCINQIQTQPQTFECEICHKSYTSIAALKSHRGWHLRSPDGKAAANNTGLWMPQNKV
TTKVSKHEVVDPSQLARVQHSTPANIAKRRLPPEVEITVVNPNKKLRSDDSIELDHQNNS
SGGPEDKYCNICDKEFTKRAAYQRHMDEVHQPNSVFCPVCDKSFTRKSTLIVHMKKHYES
GEGTSGSTQMDEDSHTCDVCGSVFDSSKSLMAHKNMHHGEDESDQSEDDGGATIQPPGEF
TCAQCGDGVATPRDLIAHRAMHATPTKFFCNICKVYFARALDLSSHTRARHSDNEKVFFP
CAMCDRFYMNKKSLQRHIEMAH