DPGLEAN21796 in OGS1.0

New model in OGS2.0DPOGS210962 
Genomic Positionscaffold14:- 54849-64464
See gene structure
CDS Length912
Paired RNAseq reads  259
Single RNAseq reads  687
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006391 (3e-83)
Best Drosophila hit  antennapedia, isoform K (3e-48)
Best Human hithomeobox protein Hox-B7 (6e-34)
Best NR hit (blastp)  antennapedia homologue protein [Bombyx mori] (1e-132)
Best NR hit (blastx)  antennapedia homologue protein [Bombyx mori] (5e-113)
GeneOntology terms









  
GO:0005634 nucleus
GO:0003704 specific RNA polymerase II transcription factor activity
GO:0007494 midgut development
GO:0006357 regulation of transcription from RNA polymerase II promoter
GO:0007379 segment specification
GO:0007507 heart development
GO:0043565 sequence-specific DNA binding
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0006355 regulation of transcription, DNA-dependent
GO:0007383 specification of segmental identity, antennal segment
GO:0048542 lymph gland development
InterPro families





  
IPR009057 Homeodomain-like
IPR001356 Homeobox
IPR017970 Homeobox, conserved site
IPR001827 Homeobox protein, antennapedia type, conserved site
IPR020479 Homeobox, eukaryotic
IPR017995 Homeobox protein, antennapedia type
IPR012287 Homeodomain-related
Orthology groupMCL11855

Nucleotide sequence:

ATGAGCGCCAACAATTGCGATAGCATGACATATTTCTCTAACGCATACGTCCCAGACATG
AGGAACGGGGGCCATGATCACCAGCAGGCGCACGCTCATTACGGTGCTGTGCCGCAGCAG
GGACACGAAATGGACGGTTGTGATCAACAACTCAGGCCTGCACAACATCACTACTCGGCG
CAGACGGCGCCTGGGATGCCCTATCCAAGATTCCCACCATACGACAGATTGGGTTACTAT
CAACAAATGGAACAGAATGGATATCGGCCCGATAGTCCATCACAGATGGGTCATATGGGT
CCGAAATCGGATGGATATGGCCCTAATGGTCACCAACCACCGGCTCCAGCGGTCTACCCT
TCGTCGTGCAAAGTACAAGCAGCAGCCGCGATGGCGGGTGGTGTCCCTGGGAGTCCTCCA
CTGGAACAGGCCCAGCAAATGCCTCACCACATGCATCCCCAGCAGCACATGGCCCAGCAC
GGGATGCCGTCGCATCAGCAGCACCTCATGTATCCCGTGGACGACATGCAGCACCAGACG
CAGATGCCTCCCATGCATCAGCAGTCGATGCACGCACAGCAGGCACCTCCTCAACAACCA
CCGCCAAATACCAATGCGTCGTTACCAAGTCCGCTCTACCCTTGGATGAGAAGTCAATTT
GAGCGGAAGCGTGGTCGGCAAACGTACACCCGGTACCAGACCCTGGAACTGGAGAAGGAG
TTCCACTTCAACCGCTACCTGACGAGGAGGAGACGCATAGAGATCGCACACGCGCTCTGT
CTCACCGAGAGACAGATCAAGATATGGTTCCAGAATCGACGTATGAAGTGGAAGAAGGAG
AATAAAACCAAAGGCGAGCCAGGGTCGGGAGACGAACCTGACAACATGAGTCCACCGACA
TCGCCACAATAA

Protein sequence:

MSANNCDSMTYFSNAYVPDMRNGGHDHQQAHAHYGAVPQQGHEMDGCDQQLRPAQHHYSA
QTAPGMPYPRFPPYDRLGYYQQMEQNGYRPDSPSQMGHMGPKSDGYGPNGHQPPAPAVYP
SSCKVQAAAAMAGGVPGSPPLEQAQQMPHHMHPQQHMAQHGMPSHQQHLMYPVDDMQHQT
QMPPMHQQSMHAQQAPPQQPPPNTNASLPSPLYPWMRSQFERKRGRQTYTRYQTLELEKE
FHFNRYLTRRRRIEIAHALCLTERQIKIWFQNRRMKWKKENKTKGEPGSGDEPDNMSPPT
SPQ