New model in OGS2.0 | DPOGS210903  |
---|---|
Genomic Position | scaffold853:- 129721-139103 |
See gene structure | |
CDS Length | 3312 |
Paired RNAseq reads   | 1330 |
Single RNAseq reads   | 3222 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003087 (3e-92) |
Best Drosophila hit   | CG6654 (8e-38) |
Best Human hit | endothelial zinc finger protein induced by tumor necrosis factor alpha (5e-47) |
Best NR hit (blastp)   | PREDICTED: similar to zinc finger protein 585A, partial [Apis mellifera] (2e-87) |
Best NR hit (blastx)   | PREDICTED: similar to zinc finger protein 585A, partial [Apis mellifera] (1e-92) |
GeneOntology terms    | GO:0005634 nucleus GO:0006355 regulation of transcription, DNA-dependent GO:0003700 sequence-specific DNA binding transcription factor activity GO:0005730 nucleolus GO:0046872 metal ion binding GO:0008270 zinc ion binding GO:0005622 intracellular |
InterPro families    | IPR007087 Zinc finger, C2H2-type IPR012934 Zinc finger, AD-type IPR015880 Zinc finger, C2H2-like IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding |
Orthology group | MCL18525 |
Nucleotide sequence:
ATGGAAGTGTTTCTTTATAACTCTACAGTTTGTAGATTATGTGGCGAAGAAAATGATAAT
GGAACATTACTATATTCATGTGAAGAAAATAATCAAAGCTTATGTGAAATAATTAATACC
TATTTGCCAATAAAGGTATCTGATGATGGAGAACTACCACGGACTATTTGCCCTGGATGT
ACAATTCAATTGGAAGCAACAGTTGAATTTTTAAATCTAATTATAAATGGTCAAAAAATT
TTGCGTGAACTTTACCAACGAGAGAAGGAATACAAAAAGACTGTTCTTAATAATTCCAAT
AAAGGAACTCCGGAAGTTATATCAGAAAAAATCATTTACGAAATAAATACAAGCAATGGG
GTGTATCAAGTTGAGCATCCAATATCACTGCAGGTCAGCGGGCTTGATAAACCAAAGAGA
AAAAGAGGCCGTCCACCAAAGAAACAGAAGACTGCCGAGGAGATCGCCCAGGAAACTCCC
AAAACAGTGGAAATTGAGGATAAGACGGAGAAAGATGATGACGAACGTTCAGGGAAGAGG
AGGAGAAAAACACCTACCAGGTTCAAGGAAGCCGTTCAGGGCAAGGAGCTGGAAAGAATA
TTCATTGAAGAAGGCGTCATAGATGGCAATGAGAGCGACCACAACACAAAGGCTGATACG
ACACAGGAAAATAAATTACCGGTGAACAAGGAACCACAAGTTATAGGGCATTTGGAGGCG
TCCGGAGAGCTTGTTGTGGTGGTGAAGGGCAAGGGAAGGGGTAGACCTAAAGGTCGCACG
CGTCAAACCCGCGAGGAATGCGCCATATGTGGGCTTGAGTTTGCTGCGACTGGTCGCTAC
ATGTCCCACATCGCTCAGCATGGACCTGTTCTTTACAAGTGTGACTGCGGTCAAACATTC
ACTACTAAGCTACTGTTCTCCGAACATCAGAACACAAGCGGTCACAGCGGGCGGACGGTG
GTGCCCTGTAGAAACGAAGTCGAGTCTCAGAAAGAGTCCGAAAAGAATGAAACGCCTTTG
ATCGAATTGATACCCGAGGCCGTAGAGGATGTTGTCAAAGGAGATATACAAATACCTCAA
GCATTACCTGATTTGAGTGATCTCGACCCGCTGAAGTGTGATGACCATGTCAAGACTGAG
ACGGTGAAAAACGAACAAGAGAGAGAGGAGAATGACCCTCTGCAAGATGAGTGCGAGACA
GCTGACGGAACTCGTGAGGAAGTACAGGACAGCAAGAAGGAGAAGGTCAAGATTAAGTGC
AACCACTGCGATAAACTGTTCGGCACCCGGCAGAGCAAGTCGCTGCACATAAAGAGTACC
AACCGGGGGTTCCAGGAGGACTACGAAACTTACTTAAGTCGCACGCGTCAAACCCGCGAG
GAATGCGCTATATGTGGGCTGGAGTTTGCTGCGACGGGTCGCTACATGTCCCACATCGCT
CAGCACGGACCTGTTCTTTACAAGTGTGACTGCGGTCAAACATTCACCACTAAGCTACTG
TTCTCCGAACATCAGAACACAAGCGGTCACAGCGGGCGGACCGTGGTGCCCTGTAGAAAC
GAAGTCGAGTCTCAGAAAGAGTCCGAAAAGAATGAAACGCCTTTGATCGAATTGATACCC
GAGGCCGTAGAGGATGTTGTCAAAGGAGATATACAAATACCTCAAGCATTACCTGATTTG
AGTGATCTCGACCCGCTGAAGTGTGATGACCATGTCAAGACTGAGACGGTGAAAAACGAA
CAAGAGAGAGAGGAGAATGACCCTCTGCAAGATGAGTGCGAGACAGCTGACGGAACTCGT
GAGGAAGTACAGGACAGCAAGAAGGAGAAGGTCAAGATTAAGTGCAACCACTGCGATAAA
CTGTTCGGCACCCGGCAGAGCAAGTCGCTGCACATAAAGGCGGTACATCTCGGCGAGAAG
TCGTACGTGTGCCCGGAGTGCGGCGCGCGGTTTGCGTACCCCCGCTCGCTGGCCGTACAC
CGACAAGCTCACCGCAGGGCGAGGCCCTCCGCGGGCTACGCCTGCGATCTCTGCGGGAAG
GTGTTGAACCACCCGTCGTCGGTGGTGTATCACAAGCAGGCGGAGCACGCGGACCAGCGC
TACGTGTGCGGCGCGTGCGGCAAACAGTTCCGACACAAGCAACTGCTGCAACGACACCAG
CTGGTACACTCGCAGGCCAGGCCCTTCTCGTGTAAGGTGTGTAACGCCACGTTCAAGACG
AAAGCCAATCTTCTCAACCACCAGCTGCTGCACTCCGGCGTTAAGAAATTCTCGTGCGAA
ATTTGCAAACATAAATTCGCACACAAGACCAGCCTCACGCTGCACATGAGATGGCACACA
GGGGTCAAACCGTTTACTTGTGGCGTGTGCGGTAAGAGCTTCAGTCAGAAAGGGAACCTC
TCGGAACACGAACGCATCCACACTGGAGAGAAGCCGTATCAGTGTGCGCTGTGTCCTCGA
AGATTCACAACCTCGTCCCAGCACCGCCTGCACGCCAGGAGACACGCCGAACGAACACAC
TGCTGTGGAAAATGCGGGAAGCGCATGTCGTCCCGCAGCGTGTGGGCGGCGCACGTCCGG
CGCGATGACTGCACGACGCGGCGGTTGGCGCGACAAAAGGTCACAAAACAAATAAGTTTA
TTGGTAAACGACAAGAACCATCAGCCGGTGCAGCTGGAAGATCCCAAGCTGTCCGACGAC
AACACCGAGGAGAGGGTCATATACGTGGCCTACGACACCGAAGACTCCGAGTCCACCGCC
TTCCATATATTAGACCCAGAACAGGTGCAGACTGCTGATATAGAACAGAACAAAGTACTG
ACGACCTGCGAGCTTTATACACGACCGTCGCTGCTGGTGTCGCAACAACTACAGCAGTTA
CAGCTGGAGACGGCGGAACAGCAGGTGGTGGAACACGAGCAGCTGGAAATAGACGAACAC
CTGGAGCTGGAACACGAGGAACTCGGCCTGGACGACGAGCAAATTAAGATCGAGAACCAG
ATGGAGATTGAAGAAATTGAGGAAATAGAAACGAGTCCTGTAGTGGTCGGCGGGCAGAGC
ATACCCGTGACGGACGAGCGCGGTAACCCACTACACTTCACCATGGCTGACGGAACCAAG
CTGGCTATCACCTCCGTGGACGGCAAGTCGCTGCAGGTGATAACACAAGACGGCCAGACG
ATACCGGTGGAGATCAACGGATACGACAACCAAGACCAGGTGCCGCCGAGCCCCAACGCG
GTGGTTCACCAGCTCCACCTGCAGAAGACTCCGCCGCCCGCTCCCGTCACTCACTACTTC
ACTATCGTCTGA
Protein sequence:
MEVFLYNSTVCRLCGEENDNGTLLYSCEENNQSLCEIINTYLPIKVSDDGELPRTICPGC
TIQLEATVEFLNLIINGQKILRELYQREKEYKKTVLNNSNKGTPEVISEKIIYEINTSNG
VYQVEHPISLQVSGLDKPKRKRGRPPKKQKTAEEIAQETPKTVEIEDKTEKDDDERSGKR
RRKTPTRFKEAVQGKELERIFIEEGVIDGNESDHNTKADTTQENKLPVNKEPQVIGHLEA
SGELVVVVKGKGRGRPKGRTRQTREECAICGLEFAATGRYMSHIAQHGPVLYKCDCGQTF
TTKLLFSEHQNTSGHSGRTVVPCRNEVESQKESEKNETPLIELIPEAVEDVVKGDIQIPQ
ALPDLSDLDPLKCDDHVKTETVKNEQEREENDPLQDECETADGTREEVQDSKKEKVKIKC
NHCDKLFGTRQSKSLHIKSTNRGFQEDYETYLSRTRQTREECAICGLEFAATGRYMSHIA
QHGPVLYKCDCGQTFTTKLLFSEHQNTSGHSGRTVVPCRNEVESQKESEKNETPLIELIP
EAVEDVVKGDIQIPQALPDLSDLDPLKCDDHVKTETVKNEQEREENDPLQDECETADGTR
EEVQDSKKEKVKIKCNHCDKLFGTRQSKSLHIKAVHLGEKSYVCPECGARFAYPRSLAVH
RQAHRRARPSAGYACDLCGKVLNHPSSVVYHKQAEHADQRYVCGACGKQFRHKQLLQRHQ
LVHSQARPFSCKVCNATFKTKANLLNHQLLHSGVKKFSCEICKHKFAHKTSLTLHMRWHT
GVKPFTCGVCGKSFSQKGNLSEHERIHTGEKPYQCALCPRRFTTSSQHRLHARRHAERTH
CCGKCGKRMSSRSVWAAHVRRDDCTTRRLARQKVTKQISLLVNDKNHQPVQLEDPKLSDD
NTEERVIYVAYDTEDSESTAFHILDPEQVQTADIEQNKVLTTCELYTRPSLLVSQQLQQL
QLETAEQQVVEHEQLEIDEHLELEHEELGLDDEQIKIENQMEIEEIEEIETSPVVVGGQS
IPVTDERGNPLHFTMADGTKLAITSVDGKSLQVITQDGQTIPVEINGYDNQDQVPPSPNA
VVHQLHLQKTPPPAPVTHYFTIV