DPGLEAN16865 in OGS1.0

New model in OGS2.0DPOGS213657 
Genomic Positionscaffold5407:- 2005-3090
See gene structure
CDS Length1086
Paired RNAseq reads  1098
Single RNAseq reads  3816
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010868 (5e-144)
Best Drosophila hit  ventral veins lacking, isoform B (1e-92)
Best Human hitPOU domain, class 3, transcription factor 3 (3e-80)
Best NR hit (blastp)  silk gland factor 3 [Bombyx mori] (6e-164)
Best NR hit (blastx)  POU-domain transcription factor [Helicoverpa armigera] (9e-174)
GeneOntology terms











  
GO:0007422 peripheral nervous system development
GO:0005634 nucleus
GO:0003702 RNA polymerase II transcription factor activity
GO:0003677 DNA binding
GO:0006355 regulation of transcription, DNA-dependent
GO:0007425 epithelial cell fate determination, open tracheal system
GO:0005515 protein binding
GO:0048813 dendrite morphogenesis
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0043565 sequence-specific DNA binding
GO:0008045 motor axon guidance
GO:0007420 brain development
GO:0035284 brain segmentation
InterPro families





  
IPR017970 Homeobox, conserved site
IPR000327 POU-specific
IPR001356 Homeobox
IPR010982 Lambda repressor-like, DNA-binding
IPR009057 Homeodomain-like
IPR012287 Homeodomain-related
IPR013847 POU
Orthology groupMCL11971

Nucleotide sequence:

ATGGCGGCGACCACGTATATGCCGGGAGAGCTGGATCTCGGCAGCATCAGCGGCTATCAC
GCTGCGTCGCCGCGCAGCGCTGAGCCGGCTGACATGAAGTACCAGCACCACCTGCACGCA
AGCGGGTCTCCTTCGCCAGGCGCGCCCGTGGGTCTAAACCCGTGGGCTTCGCTACCGCCT
GGTGATCCCTGGTCCATGCATCAGCACCACACTCATGTGCATCAACCTGACGTTAAGCCA
CCGGCTGCTCCACACGATCACCGTCACTTGCAAGGCCATGGCTGGCATGCACCGGTTGTA
TCAGCACATTACGGCGCTGTTTCGCCCTATCCGGTGCCCATGCACCAGCATCACATGCTG
CGGGATGTGCAGCCCTCGCCGCATCCCATGCATCATCATCACGCGCTCGAACGTGACCAG
CCAGAAGAGGACACACCGACCAGTGATGACCTAGAAGCGTTTGCCAAACAATTTAAGCAA
CGCCGTATCAAACTCGGCTTCACGCAGGCTGACGTAGGTCTCGCATTAGGAACCCTTTAT
GGCAATGTGTTTTCTCAAACAACGATATGCCGATTCGAGGCTCTACAGCTCAGTTTCAAA
AACATGTGTAAATTGAAACCACTTTTGCAAAAATGGCTAGAGGAAGCCGACTCTACGACG
GGTAGTCCGACTAGTATCGATAAAATAGCCGCCCAGGGTCGCAAGAGAAAGAAGCGAACA
TCCATCGAAGTGTCAGTGAAGGGTGCGCTAGAACAACACTTTCATAAGCAGCCAAAACCC
TCCGCCCAAGAGATCACGAGTCTCGCAGATAGTTTACAACTAGAGAAGGAGGTAGTCAGA
GTATGGTTTTGTAATCGTCGGCAAAAGGAAAAGAGGATGACGCCGCCGAACACGTTAGGC
GGTGAGATGTTGGACGGAATGGGCCACGGGCACTACGGGCACGACGTGCATGGCTCACCA
CCGCTGCATGCTCACTCGCCGGCGCTGTCACCACACGCGCAGCACGGCCAGCACGCACCG
CACGCGCCGCACTCGCAGCACCCGCAACACGGCCTCCAGAGCGCGCACACCCTAGCCGCA
CACTAA

Protein sequence:

MAATTYMPGELDLGSISGYHAASPRSAEPADMKYQHHLHASGSPSPGAPVGLNPWASLPP
GDPWSMHQHHTHVHQPDVKPPAAPHDHRHLQGHGWHAPVVSAHYGAVSPYPVPMHQHHML
RDVQPSPHPMHHHHALERDQPEEDTPTSDDLEAFAKQFKQRRIKLGFTQADVGLALGTLY
GNVFSQTTICRFEALQLSFKNMCKLKPLLQKWLEEADSTTGSPTSIDKIAAQGRKRKKRT
SIEVSVKGALEQHFHKQPKPSAQEITSLADSLQLEKEVVRVWFCNRRQKEKRMTPPNTLG
GEMLDGMGHGHYGHDVHGSPPLHAHSPALSPHAQHGQHAPHAPHSQHPQHGLQSAHTLAA
H