DPGLEAN21797 in OGS1.0

New model in OGS2.0DPOGS210963 
Genomic Positionscaffold14:- 10655-12038
See gene structure
CDS Length1302
Paired RNAseq reads  16
Single RNAseq reads  40
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006393 (1e-85)
Best Drosophila hit  antennapedia, isoform F (4e-27)
Best Human hithomeobox protein Hox-B7 (1e-28)
Best NR hit (blastp)  PREDICTED: similar to fushi-tarazu-like protein [Nasonia vitripennis] (3e-32)
Best NR hit (blastx)  PREDICTED: similar to fushi-tarazu-like protein [Nasonia vitripennis] (2e-30)
GeneOntology terms







  
GO:0030099 myeloid cell differentiation
GO:0005634 nucleus
GO:0048704 embryonic skeletal system morphogenesis
GO:0005515 protein binding
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0007275 multicellular organismal development
GO:0006355 regulation of transcription, DNA-dependent
GO:0043565 sequence-specific DNA binding
GO:0009952 anterior/posterior pattern formation
InterPro families




  
IPR009057 Homeodomain-like
IPR001356 Homeobox
IPR017970 Homeobox, conserved site
IPR020479 Homeobox, eukaryotic
IPR000047 Helix-turn-helix motif, lambda-like repressor
IPR012287 Homeodomain-related
Orthology groupMCL40067

Nucleotide sequence:

ATGTCCTCTGCGGCGACTACGAACCGCGCCAGTGATAGTTGGAACAGTTCCCAGCCCAAC
CAAAGTGAACAGCAAATGGAATACGGCTATCAATATCCTGTGAACCCCAACTTCTATGGA
TATCATAACCATCCATACCCTCAGGGCTATGACTACAATAACGAACAATACAGATCCTAT
GTGAATAATACCGACAAGATTGTTAAAACGGAACCGACAAATTGGCCAAATTACCAAACA
GGATATAATGGTCATCAGGATAATATAAATACGATTAATAGATGGCGGGAGATGAACATG
TATTCCCAACAGCTCCCGGACAATTATTATGACCAAGGCTTGCATGGTGTTAAAAGTGCG
TTAGACTTGAAAGGTGAAGACTCGCGATCCATCCACTCTCCTAGTCAATGCAGCATTCCA
GAAACAAGTTACGGCTCCCCATTAAGCGCATCCAGCGCTGTCAAATCAAATCATTTAGAC
GATGAAGACTCACCCAATTTAAGAGGAATTTTAAACAAAACATCCGCTAAGCGATCACAC
GCCTACGTTGATAAATATGGCGAATCCTACACACAAGAAACGTTACAGCAAATGATGTAT
CGAAACGAAACACAGAGCTGGAAGAAAAATGATGAAAAAGCTATCGAAAAGGAATCCAAT
TTACAGAGATTTCATGGTGAATATGAGAGCTTGGATGACGCGAAGGTTAAAATCAAGCGA
TCAGTGGGCGGGGTATCTGACACGAAGACATCGGACGAGCTCTCAGCTAGCTGTCACGAT
GTGACGAGGGTTGAGTCAGGCGGTGACGAAGATTATAACGACAGCAAGATGGCCACCGCG
CCGGATGTGCAAGGATTTTATCCATGGATGAAAAGCATTGGAGGTGATGACAAAAAAGAA
GGCTCGAAGCGGACCCGACAAACTTACACGAGGTTTCAAACATTGGAGCTGGAGAAAGAA
TTTCACTTCAACAAATATTTATCAAGACGGCGCCGGATAGAGGTTTCACACGCACTGGGT
CTGACGGAGAGACAAATAAAGATATGGTTCCAGAACAGGAGAATGAAGGCAAAGAAAGAT
GGGAAGTTGACGAATTCACCAGATCCATTTGAAGATCTGAACAGTAAAGCAGCCAACGTT
AATGACTTCGCTGATCCAAGGCAGCAAATACCAAAATTCATGGATTTCCCGAATCCCAAC
TATCACCTGACCGGCAATCCCATCGCAAACATGAGTCATTTAACAGGAAACCTAAATCAA
ACCTGCAACATACCTTACGGCGGTGTCATACCAAAAATGTGA

Protein sequence:

MSSAATTNRASDSWNSSQPNQSEQQMEYGYQYPVNPNFYGYHNHPYPQGYDYNNEQYRSY
VNNTDKIVKTEPTNWPNYQTGYNGHQDNINTINRWREMNMYSQQLPDNYYDQGLHGVKSA
LDLKGEDSRSIHSPSQCSIPETSYGSPLSASSAVKSNHLDDEDSPNLRGILNKTSAKRSH
AYVDKYGESYTQETLQQMMYRNETQSWKKNDEKAIEKESNLQRFHGEYESLDDAKVKIKR
SVGGVSDTKTSDELSASCHDVTRVESGGDEDYNDSKMATAPDVQGFYPWMKSIGGDDKKE
GSKRTRQTYTRFQTLELEKEFHFNKYLSRRRRIEVSHALGLTERQIKIWFQNRRMKAKKD
GKLTNSPDPFEDLNSKAANVNDFADPRQQIPKFMDFPNPNYHLTGNPIANMSHLTGNLNQ
TCNIPYGGVIPKM