DPGLEAN12663 in OGS1.0

New model in OGS2.0DPOGS210903 
Genomic Positionscaffold853:- 129721-139103
See gene structure
CDS Length3312
Paired RNAseq reads  1330
Single RNAseq reads  3222
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003087 (3e-92)
Best Drosophila hit  CG6654 (8e-38)
Best Human hitendothelial zinc finger protein induced by tumor necrosis factor alpha (5e-47)
Best NR hit (blastp)  PREDICTED: similar to zinc finger protein 585A, partial [Apis mellifera] (2e-87)
Best NR hit (blastx)  PREDICTED: similar to zinc finger protein 585A, partial [Apis mellifera] (1e-92)
GeneOntology terms





  
GO:0005634 nucleus
GO:0006355 regulation of transcription, DNA-dependent
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005730 nucleolus
GO:0046872 metal ion binding
GO:0008270 zinc ion binding
GO:0005622 intracellular
InterPro families


  
IPR007087 Zinc finger, C2H2-type
IPR012934 Zinc finger, AD-type
IPR015880 Zinc finger, C2H2-like
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL18525

Nucleotide sequence:

ATGGAAGTGTTTCTTTATAACTCTACAGTTTGTAGATTATGTGGCGAAGAAAATGATAAT
GGAACATTACTATATTCATGTGAAGAAAATAATCAAAGCTTATGTGAAATAATTAATACC
TATTTGCCAATAAAGGTATCTGATGATGGAGAACTACCACGGACTATTTGCCCTGGATGT
ACAATTCAATTGGAAGCAACAGTTGAATTTTTAAATCTAATTATAAATGGTCAAAAAATT
TTGCGTGAACTTTACCAACGAGAGAAGGAATACAAAAAGACTGTTCTTAATAATTCCAAT
AAAGGAACTCCGGAAGTTATATCAGAAAAAATCATTTACGAAATAAATACAAGCAATGGG
GTGTATCAAGTTGAGCATCCAATATCACTGCAGGTCAGCGGGCTTGATAAACCAAAGAGA
AAAAGAGGCCGTCCACCAAAGAAACAGAAGACTGCCGAGGAGATCGCCCAGGAAACTCCC
AAAACAGTGGAAATTGAGGATAAGACGGAGAAAGATGATGACGAACGTTCAGGGAAGAGG
AGGAGAAAAACACCTACCAGGTTCAAGGAAGCCGTTCAGGGCAAGGAGCTGGAAAGAATA
TTCATTGAAGAAGGCGTCATAGATGGCAATGAGAGCGACCACAACACAAAGGCTGATACG
ACACAGGAAAATAAATTACCGGTGAACAAGGAACCACAAGTTATAGGGCATTTGGAGGCG
TCCGGAGAGCTTGTTGTGGTGGTGAAGGGCAAGGGAAGGGGTAGACCTAAAGGTCGCACG
CGTCAAACCCGCGAGGAATGCGCCATATGTGGGCTTGAGTTTGCTGCGACTGGTCGCTAC
ATGTCCCACATCGCTCAGCATGGACCTGTTCTTTACAAGTGTGACTGCGGTCAAACATTC
ACTACTAAGCTACTGTTCTCCGAACATCAGAACACAAGCGGTCACAGCGGGCGGACGGTG
GTGCCCTGTAGAAACGAAGTCGAGTCTCAGAAAGAGTCCGAAAAGAATGAAACGCCTTTG
ATCGAATTGATACCCGAGGCCGTAGAGGATGTTGTCAAAGGAGATATACAAATACCTCAA
GCATTACCTGATTTGAGTGATCTCGACCCGCTGAAGTGTGATGACCATGTCAAGACTGAG
ACGGTGAAAAACGAACAAGAGAGAGAGGAGAATGACCCTCTGCAAGATGAGTGCGAGACA
GCTGACGGAACTCGTGAGGAAGTACAGGACAGCAAGAAGGAGAAGGTCAAGATTAAGTGC
AACCACTGCGATAAACTGTTCGGCACCCGGCAGAGCAAGTCGCTGCACATAAAGAGTACC
AACCGGGGGTTCCAGGAGGACTACGAAACTTACTTAAGTCGCACGCGTCAAACCCGCGAG
GAATGCGCTATATGTGGGCTGGAGTTTGCTGCGACGGGTCGCTACATGTCCCACATCGCT
CAGCACGGACCTGTTCTTTACAAGTGTGACTGCGGTCAAACATTCACCACTAAGCTACTG
TTCTCCGAACATCAGAACACAAGCGGTCACAGCGGGCGGACCGTGGTGCCCTGTAGAAAC
GAAGTCGAGTCTCAGAAAGAGTCCGAAAAGAATGAAACGCCTTTGATCGAATTGATACCC
GAGGCCGTAGAGGATGTTGTCAAAGGAGATATACAAATACCTCAAGCATTACCTGATTTG
AGTGATCTCGACCCGCTGAAGTGTGATGACCATGTCAAGACTGAGACGGTGAAAAACGAA
CAAGAGAGAGAGGAGAATGACCCTCTGCAAGATGAGTGCGAGACAGCTGACGGAACTCGT
GAGGAAGTACAGGACAGCAAGAAGGAGAAGGTCAAGATTAAGTGCAACCACTGCGATAAA
CTGTTCGGCACCCGGCAGAGCAAGTCGCTGCACATAAAGGCGGTACATCTCGGCGAGAAG
TCGTACGTGTGCCCGGAGTGCGGCGCGCGGTTTGCGTACCCCCGCTCGCTGGCCGTACAC
CGACAAGCTCACCGCAGGGCGAGGCCCTCCGCGGGCTACGCCTGCGATCTCTGCGGGAAG
GTGTTGAACCACCCGTCGTCGGTGGTGTATCACAAGCAGGCGGAGCACGCGGACCAGCGC
TACGTGTGCGGCGCGTGCGGCAAACAGTTCCGACACAAGCAACTGCTGCAACGACACCAG
CTGGTACACTCGCAGGCCAGGCCCTTCTCGTGTAAGGTGTGTAACGCCACGTTCAAGACG
AAAGCCAATCTTCTCAACCACCAGCTGCTGCACTCCGGCGTTAAGAAATTCTCGTGCGAA
ATTTGCAAACATAAATTCGCACACAAGACCAGCCTCACGCTGCACATGAGATGGCACACA
GGGGTCAAACCGTTTACTTGTGGCGTGTGCGGTAAGAGCTTCAGTCAGAAAGGGAACCTC
TCGGAACACGAACGCATCCACACTGGAGAGAAGCCGTATCAGTGTGCGCTGTGTCCTCGA
AGATTCACAACCTCGTCCCAGCACCGCCTGCACGCCAGGAGACACGCCGAACGAACACAC
TGCTGTGGAAAATGCGGGAAGCGCATGTCGTCCCGCAGCGTGTGGGCGGCGCACGTCCGG
CGCGATGACTGCACGACGCGGCGGTTGGCGCGACAAAAGGTCACAAAACAAATAAGTTTA
TTGGTAAACGACAAGAACCATCAGCCGGTGCAGCTGGAAGATCCCAAGCTGTCCGACGAC
AACACCGAGGAGAGGGTCATATACGTGGCCTACGACACCGAAGACTCCGAGTCCACCGCC
TTCCATATATTAGACCCAGAACAGGTGCAGACTGCTGATATAGAACAGAACAAAGTACTG
ACGACCTGCGAGCTTTATACACGACCGTCGCTGCTGGTGTCGCAACAACTACAGCAGTTA
CAGCTGGAGACGGCGGAACAGCAGGTGGTGGAACACGAGCAGCTGGAAATAGACGAACAC
CTGGAGCTGGAACACGAGGAACTCGGCCTGGACGACGAGCAAATTAAGATCGAGAACCAG
ATGGAGATTGAAGAAATTGAGGAAATAGAAACGAGTCCTGTAGTGGTCGGCGGGCAGAGC
ATACCCGTGACGGACGAGCGCGGTAACCCACTACACTTCACCATGGCTGACGGAACCAAG
CTGGCTATCACCTCCGTGGACGGCAAGTCGCTGCAGGTGATAACACAAGACGGCCAGACG
ATACCGGTGGAGATCAACGGATACGACAACCAAGACCAGGTGCCGCCGAGCCCCAACGCG
GTGGTTCACCAGCTCCACCTGCAGAAGACTCCGCCGCCCGCTCCCGTCACTCACTACTTC
ACTATCGTCTGA

Protein sequence:

MEVFLYNSTVCRLCGEENDNGTLLYSCEENNQSLCEIINTYLPIKVSDDGELPRTICPGC
TIQLEATVEFLNLIINGQKILRELYQREKEYKKTVLNNSNKGTPEVISEKIIYEINTSNG
VYQVEHPISLQVSGLDKPKRKRGRPPKKQKTAEEIAQETPKTVEIEDKTEKDDDERSGKR
RRKTPTRFKEAVQGKELERIFIEEGVIDGNESDHNTKADTTQENKLPVNKEPQVIGHLEA
SGELVVVVKGKGRGRPKGRTRQTREECAICGLEFAATGRYMSHIAQHGPVLYKCDCGQTF
TTKLLFSEHQNTSGHSGRTVVPCRNEVESQKESEKNETPLIELIPEAVEDVVKGDIQIPQ
ALPDLSDLDPLKCDDHVKTETVKNEQEREENDPLQDECETADGTREEVQDSKKEKVKIKC
NHCDKLFGTRQSKSLHIKSTNRGFQEDYETYLSRTRQTREECAICGLEFAATGRYMSHIA
QHGPVLYKCDCGQTFTTKLLFSEHQNTSGHSGRTVVPCRNEVESQKESEKNETPLIELIP
EAVEDVVKGDIQIPQALPDLSDLDPLKCDDHVKTETVKNEQEREENDPLQDECETADGTR
EEVQDSKKEKVKIKCNHCDKLFGTRQSKSLHIKAVHLGEKSYVCPECGARFAYPRSLAVH
RQAHRRARPSAGYACDLCGKVLNHPSSVVYHKQAEHADQRYVCGACGKQFRHKQLLQRHQ
LVHSQARPFSCKVCNATFKTKANLLNHQLLHSGVKKFSCEICKHKFAHKTSLTLHMRWHT
GVKPFTCGVCGKSFSQKGNLSEHERIHTGEKPYQCALCPRRFTTSSQHRLHARRHAERTH
CCGKCGKRMSSRSVWAAHVRRDDCTTRRLARQKVTKQISLLVNDKNHQPVQLEDPKLSDD
NTEERVIYVAYDTEDSESTAFHILDPEQVQTADIEQNKVLTTCELYTRPSLLVSQQLQQL
QLETAEQQVVEHEQLEIDEHLELEHEELGLDDEQIKIENQMEIEEIEEIETSPVVVGGQS
IPVTDERGNPLHFTMADGTKLAITSVDGKSLQVITQDGQTIPVEINGYDNQDQVPPSPNA
VVHQLHLQKTPPPAPVTHYFTIV