DPGLEAN21596 in OGS1.0

New model in OGS2.0DPOGS202998 
Genomic Positionscaffold8:- 99619-191605
See gene structure
CDS Length1221
Paired RNAseq reads  7
Single RNAseq reads  50
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003888 (2e-88)
Best Drosophila hit  arrowhead, isoform B (2e-92)
Best Human hitLIM/homeobox protein Lhx8 (2e-51)
Best NR hit (blastp)  PREDICTED: similar to GA10520-PA [Tribolium castaneum] (4e-116)
Best NR hit (blastx)  PREDICTED: similar to GA10520-PA [Tribolium castaneum] (1e-104)
GeneOntology terms





  
GO:0007444 imaginal disc development
GO:0005634 nucleus
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0006355 regulation of transcription, DNA-dependent
GO:0043565 sequence-specific DNA binding
GO:0008270 zinc ion binding
GO:0005515 protein binding
InterPro families


  
IPR001781 Zinc finger, LIM-type
IPR001356 Homeobox
IPR009057 Homeodomain-like
IPR012287 Homeodomain-related
Orthology groupMCL10541

Nucleotide sequence:

ATGGAACGGGAGTGCGTGCAGCGACATGTGACTCGAGAGATACAACTAAGCACAAACCCT
TTACAGAAACAAAACAATGCACAAAATCGCAATGCAACAACAGCTATGCTGATAGACGGA
TCTCACAACGTGGTCCCTGTTCATATGTCTGTGTTTGTGTCTGTGCTGTGTCCGTCTGTG
ACTGTGCTGTGTCGACACTGTGACGTATGTGCAAACAGCGGTGACGTCAGAAGCCACGCC
CAGATCCGCCCGCTGTCGGCGCCCGTCTACCCATCTTACACCGGGCCACTAGAAAATACG
CCTTACCGTGATGATAGAGAGCTCAAAATTGATGCGACGTATGTCGTTGAACACTTGAAC
CACTGCGATGTTTCTGAAGAATTCTCAACATCGGGGACGGAGCATCGCACGTGTTGTGCC
TGTGGGGAGCCCATAGCTGATCGCTTCCTACTGGAAGTGGGCGGCGGCGCATGGCACACG
GGCTGCCTGAGATGCTGTGTCTGTGCTGTGCAATTAGACAGACATCCCTCCTGCTTTCTC
AGAGACAGACAGGTGTACTGCAAGCAAGACTACGCCAAGAGTTTTGGGGCAAAGTGCTCC
AAGTGCTGCCGAGGCATCTCTTCATCAGACTGGGTTCGCAAGGCTCGTGAGCAGGTATAC
CATCTGGCCTGCTTCGCCTGCGACGCCTGCGGCCGTCAGCTATCCACGGGGGAACAGTTC
GCTTTGCACGAGGACAGAGTACTCTGCAAGCCGCACTACTTGGAAACATTGGATGGAGGA
TCTATTTCCTCAGATGGCAAGTGCAATGGCTGTGACTCAGAAGGTTACCACAAAAGTAAA
GCGAAACGCGTTCGTACGACTTTCACGGAAGAACAATTGCAAGTTCTCCAAGCTAACTTC
CAGCTGGACTCGAACCCTGACGGCCAAGACCTGGAAAGGATCGCTCAAGTCACCGGTCTC
AGCAAACGGGTCACACAAGTCTGGTTTCAAAATAGCCGTGCCAGGCAGAAGAAACATCAA
CATACGGGAAAAGGGAAACAGAACCAAGCGATGTCCCGCGAGGATGCGGTGGGGTTTGGT
CGGCCCATCAACCTCCACCTCACGTATTCCTTCCAGAACAAACCGCCGTTCGTTCCGATA
GATGGAACTTCTTTCACTGACTCATCAATGGACGAACTGTCCGAGGACTCCTCGATACAC
TGCATGCAGAGCGAGGTCTAG

Protein sequence:

MERECVQRHVTREIQLSTNPLQKQNNAQNRNATTAMLIDGSHNVVPVHMSVFVSVLCPSV
TVLCRHCDVCANSGDVRSHAQIRPLSAPVYPSYTGPLENTPYRDDRELKIDATYVVEHLN
HCDVSEEFSTSGTEHRTCCACGEPIADRFLLEVGGGAWHTGCLRCCVCAVQLDRHPSCFL
RDRQVYCKQDYAKSFGAKCSKCCRGISSSDWVRKAREQVYHLACFACDACGRQLSTGEQF
ALHEDRVLCKPHYLETLDGGSISSDGKCNGCDSEGYHKSKAKRVRTTFTEEQLQVLQANF
QLDSNPDGQDLERIAQVTGLSKRVTQVWFQNSRARQKKHQHTGKGKQNQAMSREDAVGFG
RPINLHLTYSFQNKPPFVPIDGTSFTDSSMDELSEDSSIHCMQSEV