DPGLEAN05986 in OGS1.0

New model in OGS2.0DPOGS210964 
Genomic Positionscaffold1784:+ 10065-28230
See gene structure
CDS Length1011
Paired RNAseq reads  244
Single RNAseq reads  615
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006394 (3e-75)
Best Drosophila hit  Sex combs reduced, isoform A (2e-62)
Best Human hithomeobox protein Hox-B5 (1e-35)
Best NR hit (blastp)  sex combs reduced homolog [Bombyx mori] (5e-151)
Best NR hit (blastx)  sex combs reduced homolog [Bombyx mori] (3e-148)
GeneOntology terms









  
GO:0003704 specific RNA polymerase II transcription factor activity
GO:0006357 regulation of transcription from RNA polymerase II promoter
GO:0005634 nucleus
GO:0007494 midgut development
GO:0007379 segment specification
GO:0007381 specification of segmental identity, labial segment
GO:0045498 sex comb development
GO:0007548 sex differentiation
GO:0007432 salivary gland boundary specification
GO:0043565 sequence-specific DNA binding
GO:0003700 sequence-specific DNA binding transcription factor activity
InterPro families





  
IPR001356 Homeobox
IPR012287 Homeodomain-related
IPR017970 Homeobox, conserved site
IPR001827 Homeobox protein, antennapedia type, conserved site
IPR020479 Homeobox, eukaryotic
IPR017995 Homeobox protein, antennapedia type
IPR009057 Homeodomain-like
Orthology groupMCL16519

Nucleotide sequence:

ATGAGTTCCTATCAATTCGTCAATTCCCTGGCCTCGTGCTATGGGAATCAGGTCCCGGGA
CGGACTGGGACACCCGTGGATCAGAGCGGTCACCCGGGCCTGCCGACACCCGGGACGGAT
TACTACAATCCGAACGCGGCAGCGTCCTATCCGAACACGTGTTATTCACCTCCACAAGTT
AGCCATCATTATCCCCAACATCCATACGCCACACCGGCCGCTGGTGCACACATGCAGCCA
CAAGCCATGATAGATTACACACAACTCCATCCCCAAAGACTCGCGAGCAGCGCGACTCAC
ATCCACCAACACACGAACCCAAGCCCTGGTGCGTTATCACCGAACCTAATGACACCAACG
AGTCAGACGGCTAGTGCTTGTAAATTCGCCGATTCAACTTCGACGACGGGAGTGGCATCC
CCACAGGATCTATCCACATCGTCAGGCCCTGGGAGGACTTCACCTGGATTTAATGTTAAT
CCAGCCGGAACGAGTACCAAATTAGGCTTGACGACACCAATAGCGTCTCCAGTGGAACAC
AAGGCTAATATCAACCAGAATATATCGAGCCCAGCTTCAAGCACCTCGAGCAATGAGAGT
GCCGAAGCTAACAGCACCAGTACAAAGAACACCAAATCATCAGCCAGCGCTCAAGCGAAT
CCACCGCAGATATATCCATGGATGAAACGAGTGCATCTAGGACAGAGTACAGTGAACGCT
AACGGTGAGACGAAACGTCAGAGAACTTCCTACACCCGGTACCAGACGCTGGAACTGGAG
AAGGAGTTCCACTTCAACCGCTACCTGACGAGGAGGAGACGCATAGAGATCGCACACGCG
CTCTGTCTCACCGAGAGACAGATCAAGATATGGTTCCAGAATCGACGTATGAAGTGGAAG
AAGGAGCACAAAATGGCGTCCATGAACATAGTCCCCTACCACATGAACCCGTACGGACAC
CCCTACCAGTTCGACCTCCACCCGAGCCAATTCGCGCATCTATCAGCATAG

Protein sequence:

MSSYQFVNSLASCYGNQVPGRTGTPVDQSGHPGLPTPGTDYYNPNAAASYPNTCYSPPQV
SHHYPQHPYATPAAGAHMQPQAMIDYTQLHPQRLASSATHIHQHTNPSPGALSPNLMTPT
SQTASACKFADSTSTTGVASPQDLSTSSGPGRTSPGFNVNPAGTSTKLGLTTPIASPVEH
KANINQNISSPASSTSSNESAEANSTSTKNTKSSASAQANPPQIYPWMKRVHLGQSTVNA
NGETKRQRTSYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLTERQIKIWFQNRRMKWK
KEHKMASMNIVPYHMNPYGHPYQFDLHPSQFAHLSA