DPGLEAN11699 in OGS1.0

New model in OGS2.0DPOGS216048 
Genomic Positionscaffold665:+ 14523-28815
See gene structure
CDS Length1416
Paired RNAseq reads  32
Single RNAseq reads  124
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009025 (0.0)
Best Drosophila hit  gooseberry (2e-101)
Best Human hitpaired box protein Pax-3 isoform PAX3i (2e-89)
Best NR hit (blastp)  gooseberry [Tribolium castaneum] (2e-147)
Best NR hit (blastx)  gooseberry [Tribolium castaneum] (3e-125)
GeneOntology terms








  
GO:0045449 regulation of transcription
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005667 transcription factor complex
GO:0007367 segment polarity determination
GO:0005634 nucleus
GO:0007419 ventral cord development
GO:0043565 sequence-specific DNA binding
GO:0006355 regulation of transcription, DNA-dependent
GO:0007435 salivary gland morphogenesis
GO:0005737 cytoplasm
InterPro families




  
IPR017970 Homeobox, conserved site
IPR001523 Paired box protein, N-terminal
IPR009057 Homeodomain-like
IPR001356 Homeobox
IPR011991 Winged helix-turn-helix transcription repressor DNA-binding
IPR012287 Homeodomain-related
Orthology groupMCL11701

Nucleotide sequence:

ATGGCTGTATCCACATTAAACATGACTCCCTATTTCACTGGATACTCTTTCCAAGGACAA
GGACGTATGAATCAGTTAGGCGGAGTCTTCATTAACGGACGTCCTCTGCCGAACCACATC
CGTCTAAAAATCGTGGAGATGGCAGCGGCGGGTGTTCGGCCTTGTGTCATCTCAAGACAG
TTGAGGGTCTCCCACGGCTGTGTCTCTAAAATACTCAATAGATACCAGGAAACTGGATCT
ATTCGCCCTGGTGTGATAGGGGGATCGAAGCCGAGAGTAGCCACGCCCGAAGTTGAAAAC
AGGATCGAAGAATTGAAGAGACAAAACCCAGGTATATTTTCCTGGGAAATTAGAGATAAA
TTAATAAAAGAAGGCATATGTGATAAAAACACCGCGCCATCAGTAAGTTCGATTTCGCGA
CTCATAAGAGGAGGCAAAAGGGACGAATCAGATCCTAGAAGGAACCACAGTATAGATGGT
ATTCTTGGACCATCCTCATCGTGTGAGGATAGTGATACGGAGTCTGAGCCTGGTATAACG
TTAAAAAGAAAACAACGAAGATCTAGAACAACCTTTTCTGGAGATCAACTTGAAGCTCTC
GAGCGCGCTTTCACGCGAACCCAATACCCAGACGTTTACACTCGTGAAGAATTAGCTCAA
AAGACGAAGTTGACCGAAGCACGTGTTCAGGTATGGTTCTCAAACCGAAGAGCACGTCTT
CGCAAACAACTGAACTCACAACAATTGAGTGCTTTTAATACAATGTCTTTACAATCTGCA
TTTCCCTCCGTTCATCAACAATACGAACCACCAACAACATTTAATGCACAGTGCGCGTCG
TGGCAACAATCATATTCTGCTTTGGGTAGCAGTTCTGTTCTGAATTCTGCTTTGGCACCT
TCTTTACATCAGTCATCATTATCAGCTCCTTCTGTTTGTCAATCTGCTCTTACAGCACCA
TCCCTCCATCCTCCCACATCGAGTTCATATTCATCAGGAAACTTGACGCCATTGTCACAT
TCATCTGAACTGCCTACACCTATACAAGCCTCTACTGATGCTACACCACCAAGTTCAAGC
CCAATCACTTCCAGCCCTGCTGGAAATCAAAGCGGTGGCATCACTTACCAACATCCAACT
TACACTAATACTAGTGATGTAGTTTCACACCCATACGGCTATGGTGACTACGCAAAACAA
GAACATATGTCAGCACATAACCACTGGACTTCAAGACAACTGAGTGGACATTCACAAAAT
AAATTAGCAGAAGTCAGCGCTTGGCCAGAAAATTATAGTTCATTCTTTGGCGCGAATACT
CACTATGCGTCGCACGCGCATTCACCTAGTGAAGCAAAGTCTGGTTATCCCTATATAGGA
CAGCTTGGCGGAATGGATATGGGGAGAGTTCATTGA

Protein sequence:

MAVSTLNMTPYFTGYSFQGQGRMNQLGGVFINGRPLPNHIRLKIVEMAAAGVRPCVISRQ
LRVSHGCVSKILNRYQETGSIRPGVIGGSKPRVATPEVENRIEELKRQNPGIFSWEIRDK
LIKEGICDKNTAPSVSSISRLIRGGKRDESDPRRNHSIDGILGPSSSCEDSDTESEPGIT
LKRKQRRSRTTFSGDQLEALERAFTRTQYPDVYTREELAQKTKLTEARVQVWFSNRRARL
RKQLNSQQLSAFNTMSLQSAFPSVHQQYEPPTTFNAQCASWQQSYSALGSSSVLNSALAP
SLHQSSLSAPSVCQSALTAPSLHPPTSSSYSSGNLTPLSHSSELPTPIQASTDATPPSSS
PITSSPAGNQSGGITYQHPTYTNTSDVVSHPYGYGDYAKQEHMSAHNHWTSRQLSGHSQN
KLAEVSAWPENYSSFFGANTHYASHAHSPSEAKSGYPYIGQLGGMDMGRVH