DPGLEAN13125 in OGS1.0

New model in OGS2.0DPOGS210970 
Genomic Positionscaffold307:- 13368-18622
See gene structure
CDS Length1647
Paired RNAseq reads  20
Single RNAseq reads  71
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006400 (2e-171)
Best Drosophila hit  proboscipedia, isoform C (9e-43)
Best Human hithomeobox protein Hox-B2 (5e-28)
Best NR hit (blastp)  maxillopedia [Tribolium castaneum] (2e-75)
Best NR hit (blastx)  maxillopedia [Tribolium castaneum] (6e-69)
GeneOntology terms






  
GO:0003704 specific RNA polymerase II transcription factor activity
GO:0006357 regulation of transcription from RNA polymerase II promoter
GO:0005634 nucleus
GO:0007382 specification of segmental identity, maxillary segment
GO:0007381 specification of segmental identity, labial segment
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0043565 sequence-specific DNA binding
GO:0048728 proboscis development
InterPro families



  
IPR001356 Homeobox
IPR020479 Homeobox, eukaryotic
IPR012287 Homeodomain-related
IPR009057 Homeodomain-like
IPR017970 Homeobox, conserved site
Orthology groupMCL16205

Nucleotide sequence:

ATGAGTGAGAGAGCCGGTTACCCTTTCCCTTTTCCCACCTTTTCCTACCCCTTCCCTTTT
CCCTCCCTTTCCCACTTTTTCCTCGCCCTCTCCCCTTATCGAGGGTGGCAACACATCCGC
AATTCTATGTTACGGATGTCCATGGGCGACGAAAATGGCTTGCCTCGTAGGCTGCGCACA
GCTTACACAAATACTCAGCTGTTGGAATTGGAGAAGGAATTTCACTTCAACAAGTACCTG
TGCAGGCCGCGAAGAATTGAGATTGCTGCGTCCCTTGACCTTACTGAGAGACAGGTTAAA
GTATGGTTTCAAAATCGACGAATGAAACACAAACGACAAACGTTAAGCAAAAGCGAAGAC
GGCGATGATAAGGACTCCACTACTTCCGAGGGTGGCAAGAGTTCTAAAACAGGTTTGGAA
AAATTCCTTGATGACGACGGTCCTCTATCGGGCAAGAAGAGTTGTCAAGGATGTGAACTG
CCACCAGGTGCTCTATGCTCTCCTACTGAAGATCTTCCGGAGCTGACATCCCGAACAAGA
AATAATAATACTCCAAGCGCAACCAATAACAATAGCTTTGGGAGTGACGGTGCTTCTAGT
GTTGCTTCATCTTCTTCGCTTGATAAACTCGCAGAAGAAGACTCCCGAGAAGGTCCACCT
CCGGTTAACGCTGTTAATGTACCTAGGAATCTAGCGAAAAGAATTAAGCAGGAATCAAGG
AAGCGATCTCCATCATTAGATGCCACAGGATGTAAAGTGTCTCCATCGTCTTCTAAAGAG
GGCCTTATAGGAGTTACGGGTTTACCAGATGGCAAATTCTCATCAGTAAACTTAACACCA
TCATCTACCCCCGGCACACCGTCTAGTATGCATCAAAGTCCACTCGGAGCATATCCCAGG
CCCTCACCCCCACACGCACCTGGAGCTCCTTTGACACAAACCGTACCTAATGCGATACCA
CCATATGTAATTAGAGGCAATGCACCTCCAGGTCAATTTGTACCTCACCCCGACTTTCGC
ATGGAAACAAAACAATTCGTCGGTAAGCTCGCCCAGTACCCGCAAAATAACAGATCGTAT
GAAGCTTACTCAGCGCTACAAGGTAGCGACCATCATATATATCCAGGACGTAATCCGACT
TCAAGAGCAACTAACGGCATTGGTTCTAGACAATCTTATCCTCACGAAATGTATCAGAAC
TATGCTTACACTGGATACGGAAAAGATCAGACCGGGTATGGTCATCCCGGTTACGATCAA
GGTCAAAGTTACCCAGCCGAAATGGGCTACCCCAATAGCCATTACGGATACCACTACCAC
GAAAGTGGACAGCACGACCACGGCCACGGATACTTCAGTAGCGAAGGACAGAAGAACATG
CACGGCCACGAATATTCAAAGAATTATTACGACACAAACTCTTACAATCAACAAGGGAGT
ACACAACCCAGTTATGGTCCCAACAACCCGCAAGGCGAAGGTTACACAGGAACAGCTGAG
TGCGGGGAGGGTTACGGATCTTTTCAACAATTCTATGAAGCGACTCACGCTACACCCGCG
ACCGGAGAAAACTCTAATTCCTCGTCAGACTTCCACTTTCTAAGCAATCTGGCTAACGAC
TTTGCTCCTGAATATTACACCATTTGA

Protein sequence:

MSERAGYPFPFPTFSYPFPFPSLSHFFLALSPYRGWQHIRNSMLRMSMGDENGLPRRLRT
AYTNTQLLELEKEFHFNKYLCRPRRIEIAASLDLTERQVKVWFQNRRMKHKRQTLSKSED
GDDKDSTTSEGGKSSKTGLEKFLDDDGPLSGKKSCQGCELPPGALCSPTEDLPELTSRTR
NNNTPSATNNNSFGSDGASSVASSSSLDKLAEEDSREGPPPVNAVNVPRNLAKRIKQESR
KRSPSLDATGCKVSPSSSKEGLIGVTGLPDGKFSSVNLTPSSTPGTPSSMHQSPLGAYPR
PSPPHAPGAPLTQTVPNAIPPYVIRGNAPPGQFVPHPDFRMETKQFVGKLAQYPQNNRSY
EAYSALQGSDHHIYPGRNPTSRATNGIGSRQSYPHEMYQNYAYTGYGKDQTGYGHPGYDQ
GQSYPAEMGYPNSHYGYHYHESGQHDHGHGYFSSEGQKNMHGHEYSKNYYDTNSYNQQGS
TQPSYGPNNPQGEGYTGTAECGEGYGSFQQFYEATHATPATGENSNSSSDFHFLSNLAND
FAPEYYTI