DPGLEAN16069 in OGS1.0

New model in OGS2.0DPOGS204692 
Genomic Positionscaffold533:+ 59905-97699
See gene structure
CDS Length1299
Paired RNAseq reads  326
Single RNAseq reads  736
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007471 (3e-30)
Best Drosophila hit  CG34340 (3e-63)
Best Human hitdorsal root ganglia homeobox protein (4e-30)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC005600 [Tribolium castaneum] (5e-68)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC005600 [Tribolium castaneum] (6e-66)
GeneOntology terms




  
GO:0005634 nucleus
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0006355 regulation of transcription, DNA-dependent
GO:0043565 sequence-specific DNA binding
GO:0007517 muscle organ development
GO:0048813 dendrite morphogenesis
InterPro families



  
IPR001356 Homeobox
IPR003654 Paired-like homeodomain protein, OAR
IPR012287 Homeodomain-related
IPR017970 Homeobox, conserved site
IPR009057 Homeodomain-like
Orthology groupMCL17330

Nucleotide sequence:

ATGCAAATCGTGTCGGAGAGACCCGTCCGCGACGAGACTCCGTCTCGTTCATGCATCTAT
CTCGAGATCGTCGCCTCTCTAGCGCCGGATGCTGGCATCTCGGCGTGTGGGAATGAGATG
GTCTATGTTGTAGCGCGTACTAAGCTTCTGTCCGCAGTCGGTCTCGCGCCGTGGTCGCGG
GTGCGTGTGTCGGGCTATGTCAGGGTGAGCGCGCTTCTTGTCACGAGTAAGGCGGGAAAA
CTTCCCGCCCACAACGCAGCAACAATATTTTCAGACGCTAATAAGCTGGCGAGTGGGCCG
GGCGGTGGGCGCGGGTTGTTCTGCTATCATTGCCCGCCGAGCCTGCCCCCGCACCAGCAC
CGTCTTCCAACCCTGGAGTACCCCTTCACAGCATCACATCCCTACACCAGCTATTCCTAC
CACCCCGCCATCCACGATGACACTTTCGTTAGACGCAAACAGAGACGAAACAGAACCACC
TTTACATTACAGCAGCTGGAAGAGCTGGAGACGGCGTTTGCACAGACGCATTACCCGGAT
GTGTTCACTAGAGAGGATCTAGCACTCAAGATAAACCTCACCGAAGCTAGAGTTCAGGTT
TGGTTTCAAAACAGACGGGCTAAGTGGCGGAAAGCGGAAAGACTAAAAGAGGAACAGCGC
AAACGAGAGGGAGCTGAAGTTTTGGCTAAGAGGGATCCAGCGGATGATAAGGGTTCTTCG
GAGTGCGGAATGTCTCGAGGGTCTGGGGATGCATCTCCAATGTCAACTGGCGTGTCCCCT
CGCGCGTCTCCCCCGGTAACGCCAGGGTCACCTCGTCGTTCCCCCCACCGTTCACCCAAT
AGGTCACCAAGATTGGAAAGATCTGAGACCTGTGCCTCTCCCGCTCCTTCGGTTGGCAGC
GCAGGTTCCCGCGAGCCAGACCCTCGCCCGCCGCACAACATCTTCTCTCCTTTCGATCAT
GGAGCGTTCCGTTCATCAGCCCCAGGCGGTGCTGACCCGCCTCCGCTGTTCCTGCCTCCC
CATCTCTCTCATCTCTCGCAGCATCTCAACCATCTATCGCAGCCTTTCTTCCCGTTAAAA
GGTTGGGGAGCACCTTGCCCGTGTTGTCCCAAAGAAGAAGCTCGCTCAACCAGCGTGGCT
GAGTTGAGACGTAAAGCTCACGAACATTCCGCTGCGTTACTGCAATCGCTAGCAAATTTC
CAGTCGCGAGCGTTCCCGCTTCCGCTCCCGCTGCCGCCTCTGCCGCTCCCGCTGTTACAC
GAGCCACCGCCGTCGGAACCTCCCAAACATCTCGAATAA

Protein sequence:

MQIVSERPVRDETPSRSCIYLEIVASLAPDAGISACGNEMVYVVARTKLLSAVGLAPWSR
VRVSGYVRVSALLVTSKAGKLPAHNAATIFSDANKLASGPGGGRGLFCYHCPPSLPPHQH
RLPTLEYPFTASHPYTSYSYHPAIHDDTFVRRKQRRNRTTFTLQQLEELETAFAQTHYPD
VFTREDLALKINLTEARVQVWFQNRRAKWRKAERLKEEQRKREGAEVLAKRDPADDKGSS
ECGMSRGSGDASPMSTGVSPRASPPVTPGSPRRSPHRSPNRSPRLERSETCASPAPSVGS
AGSREPDPRPPHNIFSPFDHGAFRSSAPGGADPPPLFLPPHLSHLSQHLNHLSQPFFPLK
GWGAPCPCCPKEEARSTSVAELRRKAHEHSAALLQSLANFQSRAFPLPLPLPPLPLPLLH
EPPPSEPPKHLE