DPGLEAN20665 in OGS1.0

New model in OGS2.0DPOGS205542 
Genomic Positionscaffold1981:- 26315-30646
See gene structure
CDS Length1146
Paired RNAseq reads  14
Single RNAseq reads  72
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000091 (1e-141)
Best Drosophila hit  Six4, isoform B (3e-82)
Best Human hithomeobox protein SIX4 (9e-72)
Best NR hit (blastp)  AGAP011065-PA [Anopheles gambiae str. PEST] (2e-90)
Best NR hit (blastx)  six/sine homebox transcription factors [Culex quinquefasciatus] (2e-89)
GeneOntology terms








  
GO:0003702 RNA polymerase II transcription factor activity
GO:0005634 nucleus
GO:0045449 regulation of transcription
GO:0008406 gonad development
GO:0007520 myoblast fusion
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0043565 sequence-specific DNA binding
GO:0006355 regulation of transcription, DNA-dependent
GO:0007498 mesoderm development
GO:0007503 fat body development
InterPro families


  
IPR009057 Homeodomain-like
IPR001356 Homeobox
IPR012287 Homeodomain-related
IPR017970 Homeobox, conserved site
Orthology groupMCL13219

Nucleotide sequence:

ATGGAGTCGTGTTCTGACCGGTTGAGTCCATCCAGCGATATGACAAACGAATCCGAGACA
TCGCTGCCATCATTTCCATACGAACAGCAGAGTGGGAACTTCTATCAGCCGGAGATAAAC
GAAAAACAATACTTTTCCTGCAAACAGAAGTCACCGCGCGACGACAGGAAAGAGGTTTAC
TTACAGGAGAGCAATAAAAACTCCTTAGAGAACTACTTGCCGGCGAGGAGCAAAAAGTAT
TTAAATTTCGAACTCAAATTGCCGCCGATCAATGACAACTTTTTCTCTCAAATTGACAGC
GATGACAGGACGATGCGTTCAGCGTACTTCAGTGACATGAACAAATGCGTTCAGAATGAG
AACAATCCGAACATGCAAGAATTGGATATAGATTTAAACAATCAGTTTGAAAACGTTAAC
AGTGACAGAGAGAGGGTCCAACAATCTTATTTTGCGGATAATGAAAGTGGAAATATGCGA
AGATGTTTAAACTTCAACTCCGAACAGGTTCAATGCGTGTGTGAGGCCCTCCAGCAAAAA
GGCGATATAGAAAAATTGGCGGCATTCCTATGGAGTCTACCACCGAGTGAATTATTAAGA
GGAAATGAAACCGTTCTCAGAGCCCGCGCTTTGGTGGCGTATCATCGCGGCGTATTTCAG
GAGTTGTACGCCATATTGGAGACGCACACATTCTCACCTCGTCACCACACCGATCTCCAG
AACCTTTGGTTTAAAGCGCACTATAAGGAAGCCCAGAAAGTCAGAGGAAGACCGCTTGGA
GCTGTTGATAAATACCGTCTTCGCAAGAAGTATCCCTTGCCAAAGACGATCTGGGATGGT
GAAGAGACGGTGTACTGCTTTAAGGAAAAGTCGAGAAATGCGCTGAAAGACTGTTACTAT
AGAAACCGTTACCCAACTCCAGACGAAAAACGTGCGCTCGCACAAAAAACAGGCTTGACA
TTAACACAAGTGTCAAATTGGTTCAAGAACCGACGCCAGAGGGATAGGACACCGCAGCAA
CCGAATAGACCTGAAATGATGGTTCCGGCTCAATATGTTGGTTCGCAGCCGGGTTTGGCG
CAATCTTTTCTTCCCAATGCCTACTATAAGCTTCAAGAATCTCACTATTTACACGGAAAT
CCTTGA

Protein sequence:

MESCSDRLSPSSDMTNESETSLPSFPYEQQSGNFYQPEINEKQYFSCKQKSPRDDRKEVY
LQESNKNSLENYLPARSKKYLNFELKLPPINDNFFSQIDSDDRTMRSAYFSDMNKCVQNE
NNPNMQELDIDLNNQFENVNSDRERVQQSYFADNESGNMRRCLNFNSEQVQCVCEALQQK
GDIEKLAAFLWSLPPSELLRGNETVLRARALVAYHRGVFQELYAILETHTFSPRHHTDLQ
NLWFKAHYKEAQKVRGRPLGAVDKYRLRKKYPLPKTIWDGEETVYCFKEKSRNALKDCYY
RNRYPTPDEKRALAQKTGLTLTQVSNWFKNRRQRDRTPQQPNRPEMMVPAQYVGSQPGLA
QSFLPNAYYKLQESHYLHGNP