DPGLEAN18538 in OGS1.0

New model in OGS2.0DPOGS213479 
Genomic Positionscaffold1949:+ 25677-27161
See gene structure
CDS Length1338
Paired RNAseq reads  50
Single RNAseq reads  189
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004361 (1e-133)
Best Drosophila hit  pita, isoform A (8e-18)
Best Human hitzinc finger protein PLAGL2 (3e-32)
Best NR hit (blastp)  PREDICTED: similar to lost on transformation protein 1 [Tribolium castaneum] (2e-51)
Best NR hit (blastx)  PREDICTED: similar to lost on transformation protein 1 [Tribolium castaneum] (4e-52)
GeneOntology terms




  
GO:0005622 intracellular
GO:0008270 zinc ion binding
GO:0003677 DNA binding
GO:0045449 regulation of transcription
GO:0046872 metal ion binding
GO:0005634 nucleus
InterPro families

  
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR015880 Zinc finger, C2H2-like
IPR007087 Zinc finger, C2H2-type
Orthology groupMCL24418

Nucleotide sequence:

ATGGCCAGTCCGCCACCGAACCCTAGAGCAGGTGCGGAGAGCGGCTCCCAGTTCACGGAG
GAAACGGGCGGCCCACGCGCGGCCGCCCCGCAGAAGAAGTTCATAGCACCGCCTGCCTCG
CAGCTGCCAACCAAGTTGGAGTACGTCGCTCGCGGAGGAACGGGCCGCAGCGTCAAGCAC
ACCTTCCGAACAGTCAAGTTCGCTCGCAGAGTGCCCACGCTGCCTCCCAAGAAGCCCGAA
GGTGGGGCGGGCGGCGCGGGTGAGGCGAGCGGCAGCGGCGCCTCGGCCGAGCGCCGCCCG
CGTCGCGACCGCAGCAAACGGCATGTCTGCACCACTTGCGACAAACGCTTCTCCAGCCCC
GGCAAGCTGAGTCAACACGTTCTCTCTCATACTGGTGAGCTCCCGTTTTCGTGTGATTTA
TGTGACAAGCGCTTCAATTCTAAATTTAAACTAGTGCGTCACAGTCTCATCCACAGTGAG
TCTAGAGCCTTCGCGTGTACCGTCTGCGCGGAGGAAGGCAATTTACAGTGCAAGATTTGT
GATGAAGTGTTCAATTCGAGACAAGAAATTGTTAACCATCTTAAAGTTCATACAGGGAGT
CGAGCGCCCAAAAGTGACACTGATAAGAAGTTTACCTGTGATCATTGTGACAGAAGGTTT
TTTACAGCAAAAGATGTTAGACGACATCTTGTGGTTCACACAGGAAGGAGAGATTTTTTA
TGCCCTCACTGCCCACAAAAGTTTGGCCGTAAAGATCACCTGGTCCGTCATGTTAAGAAT
GCACATCCCGAGGAGTCATGGAAATCAGCAGCTGTGGGTACATCCAAAGACCCACCGCCA
GAAGCTACCTCATTTGAAGAAACTTATACAGAATATAACATTGAAGAAACAGATTTCAAC
CTTTGGAAGACCATAACTCCTAAAGAAGAAGTACCAGAGGGACGGACAGAGGTCCCGGCT
TCTGATATAATAGTAGAAATACCAGATTTAGGGATCAAAGTGGAGCCTTTAGATATAAAG
TTAGAAAATCCTCAATCTCCCAGTGAAATACTCGAATATCCAGTAGTTTATATGACCGAC
TTGCCATATATAACACAACCAATTCGAGATCCTATTGATGTGCATTTACTTAGTTCTGGC
AACGTTCAATCAATATTGTTGGACCCTGGCGAGGGTCCTTCGGGATTGTCGAGTCAGATG
TTGGGGCTGTTGGAGGAGGGTGAACCTTCTTATCCTAGCGACGAGGGACGGGTGCAGCAG
AGGCTGCCGGCTTTCACGCAGGCTTTTCAGACCGCCCAGAGCCCTAAGCCTCCCCCGCCC
CCGCCCCCGCCGCACTAA

Protein sequence:

MASPPPNPRAGAESGSQFTEETGGPRAAAPQKKFIAPPASQLPTKLEYVARGGTGRSVKH
TFRTVKFARRVPTLPPKKPEGGAGGAGEASGSGASAERRPRRDRSKRHVCTTCDKRFSSP
GKLSQHVLSHTGELPFSCDLCDKRFNSKFKLVRHSLIHSESRAFACTVCAEEGNLQCKIC
DEVFNSRQEIVNHLKVHTGSRAPKSDTDKKFTCDHCDRRFFTAKDVRRHLVVHTGRRDFL
CPHCPQKFGRKDHLVRHVKNAHPEESWKSAAVGTSKDPPPEATSFEETYTEYNIEETDFN
LWKTITPKEEVPEGRTEVPASDIIVEIPDLGIKVEPLDIKLENPQSPSEILEYPVVYMTD
LPYITQPIRDPIDVHLLSSGNVQSILLDPGEGPSGLSSQMLGLLEEGEPSYPSDEGRVQQ
RLPAFTQAFQTAQSPKPPPPPPPPH