DPGLEAN11423 in OGS1.0

New model in OGS2.0DPOGS204925 
Genomic Positionscaffold2954:+ 1635-9103
See gene structure
CDS Length921
Paired RNAseq reads  126
Single RNAseq reads  367
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011419 (2e-46)
Best Drosophila hit  CG9895 (1e-34)
Best Human hitKrueppel-like factor 5 (2e-35)
Best NR hit (blastp)  PREDICTED: similar to Krueppel-like factor 5 (Intestinal-enriched krueppel-like factor) (Colon krueppel-like factor) (Transcription factor BTEB2) (Basic transcription element-binding protein 2) (BTE-binding protein 2) (GC-box-binding protein 2) [Tribolium castaneum] (3e-41)
Best NR hit (blastx)  PREDICTED: similar to Krueppel-like factor 5 (Intestinal-enriched krueppel-like factor) (Colon krueppel-like factor) (Transcription factor BTEB2) (Basic transcription element-binding protein 2) (BTE-binding protein 2) (GC-box-binding protein 2) [Tribolium castaneum] (5e-39)
GeneOntology terms






  
GO:0006355 regulation of transcription, DNA-dependent
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0045941 positive regulation of transcription
GO:0005515 protein binding
GO:0030033 microvillus assembly
GO:0001525 angiogenesis
GO:0005622 intracellular
GO:0008270 zinc ion binding
InterPro families

  
IPR015880 Zinc finger, C2H2-like
IPR007087 Zinc finger, C2H2-type
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL17392

Nucleotide sequence:

ATGAGACGAAACCTCTCATCATTCCATGGATTCACCGTGCTGGTGCCGGGTCCATCGTTG
CAGGGAGAGGAGCTTCAACAGCGGCAAACGAGCATAGATAGGGCCTCTGACGGAGGTGGT
ACCGTCGCCCATAACATACACAACGTAAGGGATGTTGCGGATACGTTGCTGACGTTGATT
GCGGGAGAGGGAGAGGAAAGGGTGGGAAAAGGAAAGGGCAACCGTCTCCCTCACTCATCG
GTCGAAAGGCCATTAAAGGCTACTTCATGCCGATGCTCTGTGAGAGGGTGGTACTTCCCC
GGTGGAGCCAGCCCATCGTCTTCAGCGACATCTTCTAAGGTCAGAATGGAGGAGGCAGAT
TCTCTGGATTGGAGCGAGTCAAGCCCTGAATCAGTCAGGATTAAGAGAGGTCCGGGTCTG
AGGAGCGCCAGAAAAGTTAAGAATGCCATCAGATTCAAGCAGCAGGACACGCCAACACAG
CTGTCACAGTACACAAACAAACAGATACTCACGCCCGTCATGAATTTCAGCTACCAGCAG
ATAAAGCCAGCGGGGGATATGAGTTTTTTTCTAAACACAGCCCTACAGAACGCTCTCACA
GATTCACTGACAGAACTAAATAAGAAGTACAGAGAAGAGTCAGACGCCAATCAGAATAGG
AAGAGGCTTCATAAGTGCGATGTGGCGGGCTGTCACAAGGTTTACACGAAGAGCTCCCAT
CTGAAGGCGCACAAACGGACCCACACAGGAGAGAAGCCTTATAGCTGCGGCTGGGCCGGA
TGCAACTGGCGCTTCGCCAGATCTGACGAGCTGACTCGGCACACTCGCAAGCACACGGGA
CATAGGCCCTTCTCATGCCCTTTGTGCCGACGGGCCTTCGCTAGATCTGATCATCTAGGA
CTCCATATGCGGAGGCACTGA

Protein sequence:

MRRNLSSFHGFTVLVPGPSLQGEELQQRQTSIDRASDGGGTVAHNIHNVRDVADTLLTLI
AGEGEERVGKGKGNRLPHSSVERPLKATSCRCSVRGWYFPGGASPSSSATSSKVRMEEAD
SLDWSESSPESVRIKRGPGLRSARKVKNAIRFKQQDTPTQLSQYTNKQILTPVMNFSYQQ
IKPAGDMSFFLNTALQNALTDSLTELNKKYREESDANQNRKRLHKCDVAGCHKVYTKSSH
LKAHKRTHTGEKPYSCGWAGCNWRFARSDELTRHTRKHTGHRPFSCPLCRRAFARSDHLG
LHMRRH