DPGLEAN07907 in OGS1.0

New model in OGS2.0DPOGS214691 
Genomic Positionscaffold15:- 43731-49436
See gene structure
CDS Length1545
Paired RNAseq reads  275
Single RNAseq reads  1497
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005595 (9e-81)
Best Drosophila hit  GATAd (4e-16)
Best Human hittranscription factor GATA-4 (8e-13)
Best NR hit (blastp)  PREDICTED: similar to GATAd CG5034-PA [Tribolium castaneum] (1e-42)
Best NR hit (blastx)  GATAd [Tribolium castaneum] (8e-28)
GeneOntology terms




  
GO:0005634 nucleus
GO:0016251 general RNA polymerase II transcription factor activity
GO:0008270 zinc ion binding
GO:0043565 sequence-specific DNA binding
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0006355 regulation of transcription, DNA-dependent
InterPro families
  
IPR000679 Zinc finger, GATA-type
IPR013088 Zinc finger, NHR/GATA-type
Orthology groupMCL17939

Nucleotide sequence:

ATGAAGAACCTGGAGAGTCGGCAGGCGGCTGTGACACTGCTGCAAATAAAAAACTACGAT
CCTACTAAATACGCCGTCAAGACTGAGGAAAGTCCTCACATAATGTTCAATAGCGTTCCG
AGTTTACCGCCCGCCGACAGAGCGAGAGAGGTTATGCATTGTAATGCCGTCATAGATATA
ATAAGCAAGGCTGTCGCGGTGGCACAGCGCGAGAACGTTGAATCTCAGAACTACCAGCCG
AATTACACGGGCGTCATAGACAGGACGTCCTCTACGGCCGCTGAGCTAGCGTACACGCAG
GACTACCCTGAGGAGCAGTACGCTGGGTACAATGCCGCTCATGACGTCACCAGCCCCGGC
AGCGATGACGACAACAAGGAAATGGACCTTGAACAGAACGAGCAGTGCGAGCGTGATACG
AGCGGCTTCTTACAGAGGAACCACGCCAGTAACAAATCCAGCTTCGTCGAGGAATACAAG
CAGCACGTGTTCGGGCAAGCTAAGAAGCATAATGACAGCTCGCCCGTGTACGAGGAGTGC
AGTCAGAGCAGCAGCGGCTCAGACCCTGATAGACTACAGATGGATATCTCTGAAGTATCG
CAGGACGACCCCGAAGAGACGCAATCGGTGCCATCAGCTCAGTCTTCCCCCAAGCCGCCT
CACGACAACGATACGGACAAGGAGTCCCTGTGGCAGGCGCTCCACAAACAGAACGGTCGT
GGCGGCGAGGCGACTCACCTGCTGCGGAGGCTCATCAACAGCAAACACCTGGGCATGACG
GTGTCCCCGCTCCGGGCCGCGCCCTCACCACTACCACAGACACACCCGCACACACACAAC
GGCACCGTGTCACCGAACGGTGAGTGGTCGAGTCCCACTCGCGGCGGGTCGGGGGCGGGC
GCCGGCACAGCACGCAGGAAGCAGAGCTGCCCGGCTCGAGCACAACCAGCCCTGGACACC
ACCGGCTGGACCAGCGACCAGCAGGAGAGTCCAGAGAGCGCGTCTAATACAACATCAGGC
GTGGTGTCGGGTGGGGCAAGGGGTCCTCGTGTGGAACTGTCCTGCAGTAACTGCGGCACT
CACACCACCACCATCTGGCGGAGGGACGCCCGCGGGGAGATGGTGTGCAACGCGTGCGGT
CTGTACTACAAGCTGCACGGTGTACCGCGGCCCAGCGCCATGAGGAGGGACACGATACAC
ACACGGCGCAGGCGGCCCAGACACGACGGGAAACATACTAGGAACACCTCGCCAGGCGGC
GGTGAGGGAGGGGGGACAGTGGTCAGTACTGAGGGGGAGGTGTCACGCGGTGGAGGCTCT
GGAGGGGGAGGGGGAGCCGCCGCAGGGGGGCCGTCTGACGGGGCCGAGGAGGCCGTGCTC
GCAGCGCTCAGGAGACAGCTACAACCTCACTTGCTGGCAGCACTACACGCACACACACCC
AGGGAACACACGCACACACGTACACAGGGCCGCAGCGTGTCGGAGTACGATGAGGCGCCC
CTGAACCTGGTGGCGAGTCACGTGGCCGCCGAGGAGACGCGCTGA

Protein sequence:

MKNLESRQAAVTLLQIKNYDPTKYAVKTEESPHIMFNSVPSLPPADRAREVMHCNAVIDI
ISKAVAVAQRENVESQNYQPNYTGVIDRTSSTAAELAYTQDYPEEQYAGYNAAHDVTSPG
SDDDNKEMDLEQNEQCERDTSGFLQRNHASNKSSFVEEYKQHVFGQAKKHNDSSPVYEEC
SQSSSGSDPDRLQMDISEVSQDDPEETQSVPSAQSSPKPPHDNDTDKESLWQALHKQNGR
GGEATHLLRRLINSKHLGMTVSPLRAAPSPLPQTHPHTHNGTVSPNGEWSSPTRGGSGAG
AGTARRKQSCPARAQPALDTTGWTSDQQESPESASNTTSGVVSGGARGPRVELSCSNCGT
HTTTIWRRDARGEMVCNACGLYYKLHGVPRPSAMRRDTIHTRRRRPRHDGKHTRNTSPGG
GEGGGTVVSTEGEVSRGGGSGGGGGAAAGGPSDGAEEAVLAALRRQLQPHLLAALHAHTP
REHTHTRTQGRSVSEYDEAPLNLVASHVAAEETR