DPGLEAN18821 in OGS1.0

New model in OGS2.0DPOGS214084 
Genomic Positionscaffold380:- 85062-106326
See gene structure
CDS Length1815
Paired RNAseq reads  927
Single RNAseq reads  2741
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005253 (4e-71)
Best Drosophila hit  pou domain motif 3, isoform A (1e-94)
Best Human hitPOU domain, class 6, transcription factor 2 isoform 2 (3e-69)
Best NR hit (blastp)  PREDICTED: similar to CG11641-PA [Apis mellifera] (9e-142)
Best NR hit (blastx)  pou domain motif 3, isoform B [Drosophila melanogaster] (5e-90)
GeneOntology terms


  
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005634 nucleus
GO:0006355 regulation of transcription, DNA-dependent
GO:0043565 sequence-specific DNA binding
InterPro families




  
IPR013847 POU
IPR010982 Lambda repressor-like, DNA-binding
IPR009057 Homeodomain-like
IPR000327 POU-specific
IPR001356 Homeobox
IPR012287 Homeodomain-related
Orthology groupMCL12932

Nucleotide sequence:

ATGGGAATAAGGAAATGGCAGGAAAGAGTGGTAATAAGGAAAGGGCAACTGCCTCTCTCA
CTCATCGCACAAAACGCAGCCACTAAAGGCTACTTCACGCTGATAGTCTGTGAGAGGGTT
GTACAAGCGGAGTCAGACCGTGAGAGCGGCGCTAGTTCGCCGGAGGGTGCGCGGGCGCAC
GCCGACCAGCGCTCGCCGTCGCCGCGCTCGCGAACGCCGCACCAGCACATGAATGGCTCC
ATGGCATCGATGTTCCAAAATCTGCAAAATTTGGCGAACATGCAACAGAGCATGCCGCTG
TCGCAGCAACAAATGTCGCAACAAATGTCACAACAAATGTCGCAGCTGGCTGCCAACCTG
CAGGGTCTCACCTCAATGCCATCCAACCCCGTCATCAACTCGCCTCTCAACCTGAGTGTC
AGTGCCCCAGGCATGGGATCTCCTACCCCAGTTAACAGCAGTATGCTGCCGCCGGCTATG
CCATCACCTATGCCGCAGCTCATCCTGGCCTCTGGACAGCTAGTACAGGGCATACAGGGT
GCACAACTGCTGATACCTACTTCTCAAGGTATAGCGACACAAACAATTCTCACCATACCC
GTAAATCACGTGAACTCCAACGATCAAATGGTAAATCTCGCTCTGAACAATGGCCAAGTG
GTATCCACATCTCTGGCCAATTTACAAGCGATGGCCCAACCCCACCAACTACTAAACTCC
AACCCGCAACAAACGTCCAACATTCGGCCGAACATGCTAAATCCAACACTATCGAACGCG
CTCCTCAATCCGGGACTGCCAAACTTTTTATCCAACGGAGCGACAAATGCGCAAGAACTG
CTGCAAGCATTACAACAGCCGCAAGGGAATCACAATCTCCTACAGACAGTTCAACAAAAC
AATATGCCACAACAAATGCAAGGCCGAAGATCGTCCTCCCCACGACCTGACAGACATTAC
AAGGAGAGGGAAAGCTTCGAGCGGTTCGCAGGAGGATCGAGGGAAAGGAACGAGAGAGAA
AACTCGGGAGCGGCAGCCCTGAATAGTATTAATAGGCTCGCAGCATCCAACGGCGAGATT
ACCATAACAACGTCCCATTCAACAGCGGGTACAACAAGCAGTGCGGGTAGTGTTGGCAGT
GCGCCTACAGCATCACCTCACGCTCCCGTCAAGCTCTCACCAAGCTCTGTCAAGTCACCA
GCACATGACGAGGACCTGTTGGCCGATTCACCTAATCAGCCAACTATAAGTCAGTCGACG
GGCAACGTTGTTGATGGAATCAATCTAGAAGACATCAAGGAGTTCGCAAAGGCATTCAAA
TTACGACGACTAGGCCTAGGGCTGACGCAGACCCAGGTCGGACAAGCGCTTTCCGTCACC
GAAGGGCCCGCTTACAGTCAGAGCGCCATTTGCAGTGCCCTGGCTTCGCAGATGCTAGCA
GCTCAGCTGTCTTCACAGCAACAAAACATATTTGAGAAATTGGATATAACTCCAAAAAGT
GCGCAGAAAATCAAACCGGTGCTTGAACGTTGGATGAAGGAAGCTGAAGAGAGGTACGCG
TCCGGTCAGAACCATCTAACGGATTTCATAGGCATGGAGCCGAGCAAGAAACGCAAACGA
CGGACGTCCTTCACGCCGCAGGCTCTCGAACTACTCAACGCTCACTTCGAACGAAACACG
CACCCATCTGGAACAGAAATAACCGGTCTGGCTCACCAGCTCGGCTACGAGCGGGAGGTC
ATCAGAATATGGTTCTGCAACAAACGACAGGCTTTAAAAAACACCGTGCGAATGATGTCC
AAAGGGATGGTCTAA

Protein sequence:

MGIRKWQERVVIRKGQLPLSLIAQNAATKGYFTLIVCERVVQAESDRESGASSPEGARAH
ADQRSPSPRSRTPHQHMNGSMASMFQNLQNLANMQQSMPLSQQQMSQQMSQQMSQLAANL
QGLTSMPSNPVINSPLNLSVSAPGMGSPTPVNSSMLPPAMPSPMPQLILASGQLVQGIQG
AQLLIPTSQGIATQTILTIPVNHVNSNDQMVNLALNNGQVVSTSLANLQAMAQPHQLLNS
NPQQTSNIRPNMLNPTLSNALLNPGLPNFLSNGATNAQELLQALQQPQGNHNLLQTVQQN
NMPQQMQGRRSSSPRPDRHYKERESFERFAGGSRERNERENSGAAALNSINRLAASNGEI
TITTSHSTAGTTSSAGSVGSAPTASPHAPVKLSPSSVKSPAHDEDLLADSPNQPTISQST
GNVVDGINLEDIKEFAKAFKLRRLGLGLTQTQVGQALSVTEGPAYSQSAICSALASQMLA
AQLSSQQQNIFEKLDITPKSAQKIKPVLERWMKEAEERYASGQNHLTDFIGMEPSKKRKR
RTSFTPQALELLNAHFERNTHPSGTEITGLAHQLGYEREVIRIWFCNKRQALKNTVRMMS
KGMV