DPGLEAN13435 in OGS1.0

New model in OGS2.0DPOGS204622 
Genomic Positionscaffold827:+ 45370-50167
See gene structure
CDS Length3909
Paired RNAseq reads  1388
Single RNAseq reads  3434
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012183 (2e-84)
Best Drosophila hit  CG3281 (1e-19)
Best Human hitzinc finger protein 91 (2e-53)
Best NR hit (blastp)  PREDICTED: similar to zinc finger protein 91 (HPF7, HTF10) isoform 4 [Canis familiaris] (1e-70)
Best NR hit (blastx)  PREDICTED: zinc finger protein 91 isoform 2 [Pan troglodytes] (2e-81)
GeneOntology terms




  
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0006355 regulation of transcription, DNA-dependent
GO:0008270 zinc ion binding
GO:0005622 intracellular
GO:0046872 metal ion binding
GO:0005634 nucleus
InterPro families



  
IPR011011 Zinc finger, FYVE/PHD-type
IPR012934 Zinc finger, AD-type
IPR015880 Zinc finger, C2H2-like
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR007087 Zinc finger, C2H2-type
Orthology groupMCL40724

Nucleotide sequence:

ATGAGCAAACAAGTGGATGTCAAGGCGTTAGTGTCACACGTAGTCCGCGGCGACGGTGTG
CAAAGATGTAGAATATGTATGGGAGATACATCCGAAGGGCAAGTTCATTTAGAAGATACG
GTGATGATGGACGGAGATAAACCCATCACATTATCAGAATTACTAGAAACAGTCACCGGA
GTTCAGGTGGTGCTCGAGGGGGACCTGCCTCCTGGAGTGTGTCCGGCCTGTCTCATGTGT
GCTCTGAGTGCTGCAGAGTTCCGATCCCTGTGCCAGCAGGCCGCCAGCCAATGGGAACTG
ACCGTGGAACTCCTGAATGGCCTCACCTCACAATGTGATGGAGAGAAGACTATATTTGCA
TTATTAGAAAAAAGCCAAATGATTGTCATCAAAGATAACACTTGCATCAAAACTATGAAT
ATAGTCGACAGACTGAATGAGCACTTGACTGAACCAAAGAAGGCCGAGTCTCCCATATCC
TGCTCTTGCCCAAACTGCGGCAAGCAGTTCCAATATGCCCATCAATTGAGCCACCATTTG
AAAGAATCCATGGATATGCAACGGGCTTGTTATATTTGTGCAAAAATAATGTCCCGAAGT
GAATTAATCCAACACATGCATCAGGTACATGACAAGGAATCATTTGATTGTAAGAAGTGC
CCGGCAGTTCTGTGCTCCTACAGCCAGTATAAACTACATTTGGCCAAATCACATTTCCAT
ACAGCGTGTATGTGTGTCGAATGTGGACGCAGCTTCCAAACTCAGAACGCGTTCCACGCT
CACGTGTCCGTCCACCGACCTCAAACCTGCCCGAGCTGTAACAAGCTGTTCCGCAACCAA
ACCTGTTACGTTCATCACGTGAAATGGTGCTGCAATCTGGACAAGAATAGAGAGGACACC
TTCAAAACCAAGACCAAAGTGACCGTGGAAGTGAAAAACGAAGTCAGCAAGCGGAAGATC
AAAGTTGGTCTACGGGGGAGCGCAAACCAGCAGTGCATATGCGACTACTGCGGCAAGAAG
TTCGCTGGGAAGAAGTTCGTCGCTGCTCACATCCAGATAGTTCACATGAAGAACACACAC
CGACCTTGTGTTTATTGCGGCAAGTTGCTGGCGGCTGCACACATGTCGGAGCACGTGAAG
TATCACGAGGAGGACAGGTCGTTCACCTGTCAGCATTGCGGCGTGGTGCTTAAAACTAGG
CTGGGCTACACGCAGCACATACGCCTGCATACCGGTGAAAGGCCGTACGCCTGCAAGTTC
TGCGGCCAAACTTTCTCAGCGTCCTCGCGAAGATCGGAACATATACGTAAAGTGCACAAG
AGTTCTGACATCGTGTTGAAACACGCCTGTCAATACTGTCCAGCGAGGTTCCGTCTCCCG
TACCGACTGAAGAAGCACCTGGTAGCTGTTCACAACAACGAAGACCAAAGCTTGGATTAC
GAGTGCGCTGAGTGCCACGAAAAGTTCGGATCCTGTCGTGGTCTCTTGCATCACAGCCGG
AAACACCAAAACGTGAAATTCCTGCCAAAGAGAAAGGAGAGATGTGTCTTTAAAGTCTTC
AGCGATGTGCTGGAGGAATGGGAACTGCCACGAGGGTTGTGCTGCGATTGCTCCGAAACC
GCCCTCACAGCATATCAGTTCAGGCAGCTTTGCAAACAGTCCGACAACCATTGGAGGAGG
TCCATGGAAGTCATAAACAAAGCCACACTGATACCGCGCGACCATAAAACCTTGTATATA
TCGTACGCAGACGAAGTTGTATATAAAGATGATGGGATAACGAATTCAGCGATGGCGGCC
GCCAGACTTAATGCCATCAGGAAAGGTGCTAACCGGCCGAGATACAAGAAGCACGTGAAA
ATTAACGGTGAATCGAAATGCCGCGATTGTGGCAAAAAATTCCCTCTACCATACTACCTG
AACAGACATCTGAAAAGCACACCAAAGCGGGCCTGCACTCAGTGTGGCGAGGTGATGCTT
AAGGAGAAACTGGCTAGGCATTTGGAAACTATACACCAGATACATGTTTTTTACTGCGAC
ATCTGCTACCAGCTGTTCAATGATTTATGCGGTTTGGGGCGCCACAGGGAGGTTTATCAC
AGAGATAGCGCATTGGAATGCAAGGTGTGCAGGAACGGCTTCACCAGCGACAGATCGCTG
GCGGCTCACATGTACTCGCACACATTATTTAACTGCTCGTCCTGCAACAGAGCGTTCGAG
AATCGCAAGTGCTATATTTACCATAAAAGCGAATGCACCGGAAGGAAGTCCTACACCCAG
AACCTTTATGAATGTCATGATTGTGGTAGCAAATACACCAAGAAACCCTCGCTGAGGATA
CATATAGTGCAGAAACATTTGAATGTCCTGCCGTTTGTCTGCCAGACTTGCGGGAAGAGG
TGTTCCACTAGGAACCACTTGAGGTCTCATGAGCTGGTTCACAAGACCGAGAGACAGGTC
TACGAGTGTTACTGCGGTGCAAAAATGCGATCGGCTCTCGGCTTTGAGATGCACCAGAGG
ATTCATTCGGGGGAGAAGCGATATGTTTGCGAGGAGTGTGGAGACAGGTTCCTCTCGGCC
TCGAGGAGGTTGGACCACGTCAAGAGGAAGCACAGATCTAAAGAGTTAGCCCACGGCTGC
GACAAATGCGATGCAAGATTCCTGAGGGAGGAGCTTTATGTACCCAAGGGTATCTGCATT
GGGTGCTCCAGTGTGGCGATCTCGGCCTTCGAGTTCCGGATATTCACCAGGAACTCGGTC
AGTCTGTGGAGGAACTGCATCAACACCATAGACACGTTGCCGAAAATGTCATGCAATAAG
TCCGTATACGCCATCCTGAGGGGAAATATGTCGCTGCAGGCGGTGAACAGCTTCAACGGC
GGGAAGACGGAGCTGGTGGAACATCTCTCGAACCGTCTGACCAAGAAGAAACCTTTAGTA
GTGGAGAAGAAGCCGAGACATCCCAGAACGGGGCCGGCCTGCACTTGTGTCGACTGCGGA
AAGGCGTTCCTAAGTCCGTACTACTTGAATCTACATTTGCGCAACAGCGGACAGAAGGAG
GCCTGCTGGCTGTGCGGGGCCATGGTGGTCAGGGGCAGAGAGATGAAGGAACATTTGTCA
ACAATACACAAGACTGATATGATTCTATGTCCGGATTGTCCCACGTTGTTGAAAAGCGAA
CAGGAATGCAAGAGGCATCTCAAGAAGTGCCACGGCCCGGGGAATTTGACGTGTGCGGAC
TGCGGCAGGACCTTCCAGAGGCAGACCTCCTTCGAGGTGCACACGCAGATGCATACCGTC
AGGACCTGCAGGGCCTGTGGCGCGCAGTTCACCAATCGCGGCTGTTACAGAGAGCACAGA
TCCAAATGCGAACCGGACGCGAAGCCAGACAGGAAGTCGGTCCCGAGGAGTCGGAGGTCA
AACGTCCGCGACCCGGCCACATTCACGTGCGACTACTGCTCGAAGACGTACCACTCGCGG
CCGCAGCTGAAGAATCACATAATGTGGATACACATGGACGTGAGGCCGCATCAGTGCCAG
TGGTGCGGGAAGAGGTTCTACACGCCGGCGCGTCTGGCCGAGCACATGGTGGTTCATACG
AGGGTCAGAAACTTCGAGTGTGACATCTGCGGTGCGAAACTGGTCTCCAAGATGGCGGCC
GTGTATCACAGGCGGAGGCATACGGGGGAGAGACCCTACGAGTGTCCGGATTGTGGGGAG
AGGTTCATATCGTCGTCCAGGAGGTCGGAGCACGCCAAGAGGAGGCACAACAGAGGTCTC
AGGTTCCAATGCCCCCACTGTCACGCCTCGTTCGTCAGGAGCCATGAGCTGAAGAAGCAC
ACAGACAAAGTCCACAACACGGACGAGCTGAGAGTTAAGAAAGACAAGAACTGTTTAGAG
AAAGTTTGA

Protein sequence:

MSKQVDVKALVSHVVRGDGVQRCRICMGDTSEGQVHLEDTVMMDGDKPITLSELLETVTG
VQVVLEGDLPPGVCPACLMCALSAAEFRSLCQQAASQWELTVELLNGLTSQCDGEKTIFA
LLEKSQMIVIKDNTCIKTMNIVDRLNEHLTEPKKAESPISCSCPNCGKQFQYAHQLSHHL
KESMDMQRACYICAKIMSRSELIQHMHQVHDKESFDCKKCPAVLCSYSQYKLHLAKSHFH
TACMCVECGRSFQTQNAFHAHVSVHRPQTCPSCNKLFRNQTCYVHHVKWCCNLDKNREDT
FKTKTKVTVEVKNEVSKRKIKVGLRGSANQQCICDYCGKKFAGKKFVAAHIQIVHMKNTH
RPCVYCGKLLAAAHMSEHVKYHEEDRSFTCQHCGVVLKTRLGYTQHIRLHTGERPYACKF
CGQTFSASSRRSEHIRKVHKSSDIVLKHACQYCPARFRLPYRLKKHLVAVHNNEDQSLDY
ECAECHEKFGSCRGLLHHSRKHQNVKFLPKRKERCVFKVFSDVLEEWELPRGLCCDCSET
ALTAYQFRQLCKQSDNHWRRSMEVINKATLIPRDHKTLYISYADEVVYKDDGITNSAMAA
ARLNAIRKGANRPRYKKHVKINGESKCRDCGKKFPLPYYLNRHLKSTPKRACTQCGEVML
KEKLARHLETIHQIHVFYCDICYQLFNDLCGLGRHREVYHRDSALECKVCRNGFTSDRSL
AAHMYSHTLFNCSSCNRAFENRKCYIYHKSECTGRKSYTQNLYECHDCGSKYTKKPSLRI
HIVQKHLNVLPFVCQTCGKRCSTRNHLRSHELVHKTERQVYECYCGAKMRSALGFEMHQR
IHSGEKRYVCEECGDRFLSASRRLDHVKRKHRSKELAHGCDKCDARFLREELYVPKGICI
GCSSVAISAFEFRIFTRNSVSLWRNCINTIDTLPKMSCNKSVYAILRGNMSLQAVNSFNG
GKTELVEHLSNRLTKKKPLVVEKKPRHPRTGPACTCVDCGKAFLSPYYLNLHLRNSGQKE
ACWLCGAMVVRGREMKEHLSTIHKTDMILCPDCPTLLKSEQECKRHLKKCHGPGNLTCAD
CGRTFQRQTSFEVHTQMHTVRTCRACGAQFTNRGCYREHRSKCEPDAKPDRKSVPRSRRS
NVRDPATFTCDYCSKTYHSRPQLKNHIMWIHMDVRPHQCQWCGKRFYTPARLAEHMVVHT
RVRNFECDICGAKLVSKMAAVYHRRRHTGERPYECPDCGERFISSSRRSEHAKRRHNRGL
RFQCPHCHASFVRSHELKKHTDKVHNTDELRVKKDKNCLEKV