New model in OGS2.0 | DPOGS214691  |
---|---|
Genomic Position | scaffold15:- 43731-49436 |
See gene structure | |
CDS Length | 1545 |
Paired RNAseq reads   | 275 |
Single RNAseq reads   | 1497 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005595 (9e-81) |
Best Drosophila hit   | GATAd (4e-16) |
Best Human hit | transcription factor GATA-4 (8e-13) |
Best NR hit (blastp)   | PREDICTED: similar to GATAd CG5034-PA [Tribolium castaneum] (1e-42) |
Best NR hit (blastx)   | GATAd [Tribolium castaneum] (8e-28) |
GeneOntology terms    | GO:0005634 nucleus GO:0016251 general RNA polymerase II transcription factor activity GO:0008270 zinc ion binding GO:0043565 sequence-specific DNA binding GO:0003700 sequence-specific DNA binding transcription factor activity GO:0006355 regulation of transcription, DNA-dependent |
InterPro families    | IPR000679 Zinc finger, GATA-type IPR013088 Zinc finger, NHR/GATA-type |
Orthology group | MCL17939 |
Nucleotide sequence:
ATGAAGAACCTGGAGAGTCGGCAGGCGGCTGTGACACTGCTGCAAATAAAAAACTACGAT
CCTACTAAATACGCCGTCAAGACTGAGGAAAGTCCTCACATAATGTTCAATAGCGTTCCG
AGTTTACCGCCCGCCGACAGAGCGAGAGAGGTTATGCATTGTAATGCCGTCATAGATATA
ATAAGCAAGGCTGTCGCGGTGGCACAGCGCGAGAACGTTGAATCTCAGAACTACCAGCCG
AATTACACGGGCGTCATAGACAGGACGTCCTCTACGGCCGCTGAGCTAGCGTACACGCAG
GACTACCCTGAGGAGCAGTACGCTGGGTACAATGCCGCTCATGACGTCACCAGCCCCGGC
AGCGATGACGACAACAAGGAAATGGACCTTGAACAGAACGAGCAGTGCGAGCGTGATACG
AGCGGCTTCTTACAGAGGAACCACGCCAGTAACAAATCCAGCTTCGTCGAGGAATACAAG
CAGCACGTGTTCGGGCAAGCTAAGAAGCATAATGACAGCTCGCCCGTGTACGAGGAGTGC
AGTCAGAGCAGCAGCGGCTCAGACCCTGATAGACTACAGATGGATATCTCTGAAGTATCG
CAGGACGACCCCGAAGAGACGCAATCGGTGCCATCAGCTCAGTCTTCCCCCAAGCCGCCT
CACGACAACGATACGGACAAGGAGTCCCTGTGGCAGGCGCTCCACAAACAGAACGGTCGT
GGCGGCGAGGCGACTCACCTGCTGCGGAGGCTCATCAACAGCAAACACCTGGGCATGACG
GTGTCCCCGCTCCGGGCCGCGCCCTCACCACTACCACAGACACACCCGCACACACACAAC
GGCACCGTGTCACCGAACGGTGAGTGGTCGAGTCCCACTCGCGGCGGGTCGGGGGCGGGC
GCCGGCACAGCACGCAGGAAGCAGAGCTGCCCGGCTCGAGCACAACCAGCCCTGGACACC
ACCGGCTGGACCAGCGACCAGCAGGAGAGTCCAGAGAGCGCGTCTAATACAACATCAGGC
GTGGTGTCGGGTGGGGCAAGGGGTCCTCGTGTGGAACTGTCCTGCAGTAACTGCGGCACT
CACACCACCACCATCTGGCGGAGGGACGCCCGCGGGGAGATGGTGTGCAACGCGTGCGGT
CTGTACTACAAGCTGCACGGTGTACCGCGGCCCAGCGCCATGAGGAGGGACACGATACAC
ACACGGCGCAGGCGGCCCAGACACGACGGGAAACATACTAGGAACACCTCGCCAGGCGGC
GGTGAGGGAGGGGGGACAGTGGTCAGTACTGAGGGGGAGGTGTCACGCGGTGGAGGCTCT
GGAGGGGGAGGGGGAGCCGCCGCAGGGGGGCCGTCTGACGGGGCCGAGGAGGCCGTGCTC
GCAGCGCTCAGGAGACAGCTACAACCTCACTTGCTGGCAGCACTACACGCACACACACCC
AGGGAACACACGCACACACGTACACAGGGCCGCAGCGTGTCGGAGTACGATGAGGCGCCC
CTGAACCTGGTGGCGAGTCACGTGGCCGCCGAGGAGACGCGCTGA
Protein sequence:
MKNLESRQAAVTLLQIKNYDPTKYAVKTEESPHIMFNSVPSLPPADRAREVMHCNAVIDI
ISKAVAVAQRENVESQNYQPNYTGVIDRTSSTAAELAYTQDYPEEQYAGYNAAHDVTSPG
SDDDNKEMDLEQNEQCERDTSGFLQRNHASNKSSFVEEYKQHVFGQAKKHNDSSPVYEEC
SQSSSGSDPDRLQMDISEVSQDDPEETQSVPSAQSSPKPPHDNDTDKESLWQALHKQNGR
GGEATHLLRRLINSKHLGMTVSPLRAAPSPLPQTHPHTHNGTVSPNGEWSSPTRGGSGAG
AGTARRKQSCPARAQPALDTTGWTSDQQESPESASNTTSGVVSGGARGPRVELSCSNCGT
HTTTIWRRDARGEMVCNACGLYYKLHGVPRPSAMRRDTIHTRRRRPRHDGKHTRNTSPGG
GEGGGTVVSTEGEVSRGGGSGGGGGAAAGGPSDGAEEAVLAALRRQLQPHLLAALHAHTP
REHTHTRTQGRSVSEYDEAPLNLVASHVAAEETR