DPGLEAN19875 in OGS1.0

New model in OGS2.0DPOGS214467 
Genomic Positionscaffold690:+ 9375-17749
See gene structure
CDS Length1098
Paired RNAseq reads  80
Single RNAseq reads  216
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013353 (1e-104)
Best Drosophila hit  onecut (9e-72)
Best Human hithepatocyte nuclear factor 6 (3e-71)
Best NR hit (blastp)  PREDICTED: similar to One cut domain family member 2 (Transcription factor ONECUT-2) (OC-2) [Apis mellifera] (4e-95)
Best NR hit (blastx)  PREDICTED: similar to One cut domain family member 2 (Transcription factor ONECUT-2) (OC-2) [Apis mellifera] (4e-83)
GeneOntology terms






  
GO:0003702 RNA polymerase II transcription factor activity
GO:0006350 transcription
GO:0005634 nucleus
GO:0030528 transcription regulator activity
GO:0006355 regulation of transcription, DNA-dependent
GO:0003677 DNA binding
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0043565 sequence-specific DNA binding
InterPro families



  
IPR003350 Homeodomain protein CUT
IPR001356 Homeobox
IPR010982 Lambda repressor-like, DNA-binding
IPR009057 Homeodomain-like
IPR012287 Homeodomain-related
Orthology groupMCL11814

Nucleotide sequence:

ATGGACGATACACGGCTCAGGGAGCGAGCGCCTCTCACGGTCATAGTGGCGCCCAGTAAC
GTGTCTCCACCCCGGCTGTCCCCCGCGGACCTGCTGCCCGACGGAGACGCTGCCTTCCAC
CCGCTGTCCGCCGTCAACGGCCGCCTCACACCGCCCGGGCTCGAGCCCGCGTCCTACGCT
ACTCTGACGCCCCTGCTCCCTCTGCCCCCCATCAGCACCGTGTCCGACAAGTTCGCGTAC
CACGCGGGCGGCACTTTCACGGTCATCCAGCAGCAGCAGTCCTACGCGTCCTTGTCTCCG
ACCGCCTACAATGAGCCGCTGTCGCCGCAGTCCGCGTACAGTCGGCGGAGCGCGTCACCC
GGCTCGTACGAGCGCCGTTCCCCCTCGCCGCCGCTGCCCAGCCCGGGGCTGGACCTGAAC
GCGGCGCTTCTGGCCAGAGAGACGAGAGACGAGCAGGCGCAGCAGACGCAGCAGAACGAC
ACGGAGGAGATAAACACCAAGGAGCTCGCGCAGAGGATAAGCGGAGAACTGAAGAGGTAC
TCCATACCTCAGGCGATATTCGCTCAGAGGGTGCTGTGTCGGTCGCAGGGTACGCTCAGC
GACCTACTCAGGAACCCCAAGCCGTGGTCCAAGTTGAAGTCGGGCCGAGAAACCTTCAGG
CGGATGTGGAAATGGCTACAGGAACCCGAGTTTCAAAGGATGTCGGCCTTGAGACTTGCA
GCGGCCCAGATACCACAGAGAGGCAGCTGCAAGCGCAAGGAGGATATGGCCTCGGATACC
CTTCCTTCCCCGAAGAAGCCGCGTTTGGTTTTCACAGATCTGCAGCGGCGTACTCTGCAG
GCTATTTTTAAGGAAACAAAGAGACCATCAAAAGAGATGCAAGTAACGATAGCGCGTCAG
CTCGGTCTAGAGCCGACCACTGTTGGCAACTTCTTCATGAACGCTCGCAGACGATCCATG
GACAAATGGAAGGATGACGACGCACCATCCACAGACTTGGACCAGCAATGCGACGGGCAG
GATCTAGACCACGTCCCCAGTCTAGACTCGGCGGAGTCTGAGGAGGATCACGACGACCAA
GACGATCATCTATTGTGA

Protein sequence:

MDDTRLRERAPLTVIVAPSNVSPPRLSPADLLPDGDAAFHPLSAVNGRLTPPGLEPASYA
TLTPLLPLPPISTVSDKFAYHAGGTFTVIQQQQSYASLSPTAYNEPLSPQSAYSRRSASP
GSYERRSPSPPLPSPGLDLNAALLARETRDEQAQQTQQNDTEEINTKELAQRISGELKRY
SIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
AAQIPQRGSCKRKEDMASDTLPSPKKPRLVFTDLQRRTLQAIFKETKRPSKEMQVTIARQ
LGLEPTTVGNFFMNARRRSMDKWKDDDAPSTDLDQQCDGQDLDHVPSLDSAESEEDHDDQ
DDHLL