DPGLEAN12980 in OGS1.0

Genomic Positionscaffold2167:- 1982-13961
See gene structure
CDS Length1239
Paired RNAseq reads  39
Single RNAseq reads  117
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003265 (2e-53)
Best Drosophila hit  extra-extra (8e-43)
Best Human hitmotor neuron and pancreas homeobox protein 1 isoform 1 (5e-31)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC009461 [Tribolium castaneum] (5e-51)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC009461 [Tribolium castaneum] (3e-46)
GeneOntology terms






  
GO:0045449 regulation of transcription
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005634 nucleus
GO:0007417 central nervous system development
GO:0007412 axon target recognition
GO:0007399 nervous system development
GO:0043565 sequence-specific DNA binding
GO:0006355 regulation of transcription, DNA-dependent
InterPro families



  
IPR009057 Homeodomain-like
IPR017970 Homeobox, conserved site
IPR020479 Homeobox, eukaryotic
IPR012287 Homeodomain-related
IPR001356 Homeobox
Orthology groupMCL15124

Nucleotide sequence:

ATGCTGTCATCGACGATGTTCCGACGGGATTCACAGGCGTCGGAGAAACCGCTGGACATA
GACCAGGTCTCCAACCCCGACCGCAGCAGGTCAAACAGTCCAAAGAACATGAGGTACTAT
AAAGAAGATGACAAGACGAACGTCAGTGCCAAGAAATCCTTCTGCATAGACGCGCTGCTA
TCAAAACATGTTGAACCAGAGCAGACTATACAGGAGATGGCGCAGCAGAAGTATCTAGCC
CTGATACAGAACAAGAATTACATCACCAGATACCAGAACGAGAATTACGACACGCAAATC
GAGCAGAATACTATAAGCAAGTGTCAGAATAGACCTCCCAGCCAGAACTTCGATAGGGCC
AGCAGTCCCAGGTCTCTGGCGAGCAGCCCCAGGAGCGGCAGTCCGGGGTCTGAAGACGGG
AACCGTTCAGATAACTACAGTCCACCGATTTCGCCTGGAGTGGAAGGCGGCACTGACGAA
TATGATAACTATAGCGATAAAATCATCAAAAAATATAGTTATTCTCGTTCTGAAGACTTA
TGTTTTCGTTTTTCTGTAGGCATAATCCCGAAACCCGGTCTACTGCAGCCTCCGATGTCC
CAACCCCTGCTGTATCCGCCTCAGATGATGCAGTCGGCTTTCCATCACCCCGGAGACAGA
GTCCTGCATCAGATGCAGCTGGAATGGCTGGCTCGGACTGGAATGTTCTACCACCGGATA
CCAGAGCTGGGAGGTGCCCCTCACGCTCTTCTCGGTAAGACAAGAAGACCACGGACCGCG
TTCACGTCACAGCAGTTGTTGGAACTTGAGAAGCAGTTCCGCATGAACAAATACCTGTCC
AGACCAAAGAGGTTCGAAGTAGCCACCAGTTTGATGCTCACAGAGACACAGGTGAAAATA
TGGTTCCAAAACCGTAGAATGAAGTGGAAGCGGTCGAAAAAAGCACAACAGGATACGAAG
ATCAAAGAACCGCAGAGCCACGACGACAAAAACAAAACTAAGGACGTACCAGTGGCTGAA
CACGACAAACAACCTTCACAGCACATAGCGGCAGATTTGACATCTGTCAAGCCGGCCCCG
CTGTTAGACAGAGACAGAATCATCGCGCTAGAGAGGGAGAGAGCGATGGCCGCCGCCAAT
TTCAATTCGAATTTGGAAAACAATAGACGCGGATTGGTCGTCATGAACCAAGATGGCGGC
CGGCCCGGCATTGACATGTTCAGGCCTTACGTAGTATGA

Protein sequence:

MLSSTMFRRDSQASEKPLDIDQVSNPDRSRSNSPKNMRYYKEDDKTNVSAKKSFCIDALL
SKHVEPEQTIQEMAQQKYLALIQNKNYITRYQNENYDTQIEQNTISKCQNRPPSQNFDRA
SSPRSLASSPRSGSPGSEDGNRSDNYSPPISPGVEGGTDEYDNYSDKIIKKYSYSRSEDL
CFRFSVGIIPKPGLLQPPMSQPLLYPPQMMQSAFHHPGDRVLHQMQLEWLARTGMFYHRI
PELGGAPHALLGKTRRPRTAFTSQQLLELEKQFRMNKYLSRPKRFEVATSLMLTETQVKI
WFQNRRMKWKRSKKAQQDTKIKEPQSHDDKNKTKDVPVAEHDKQPSQHIAADLTSVKPAP
LLDRDRIIALERERAMAAANFNSNLENNRRGLVVMNQDGGRPGIDMFRPYVV