New model in OGS2.0 | DPOGS214084  |
---|---|
Genomic Position | scaffold380:- 85062-106326 |
See gene structure | |
CDS Length | 1815 |
Paired RNAseq reads   | 927 |
Single RNAseq reads   | 2741 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005253 (4e-71) |
Best Drosophila hit   | pou domain motif 3, isoform A (1e-94) |
Best Human hit | POU domain, class 6, transcription factor 2 isoform 2 (3e-69) |
Best NR hit (blastp)   | PREDICTED: similar to CG11641-PA [Apis mellifera] (9e-142) |
Best NR hit (blastx)   | pou domain motif 3, isoform B [Drosophila melanogaster] (5e-90) |
GeneOntology terms    | GO:0003700 sequence-specific DNA binding transcription factor activity GO:0005634 nucleus GO:0006355 regulation of transcription, DNA-dependent GO:0043565 sequence-specific DNA binding |
InterPro families    | IPR013847 POU IPR010982 Lambda repressor-like, DNA-binding IPR009057 Homeodomain-like IPR000327 POU-specific IPR001356 Homeobox IPR012287 Homeodomain-related |
Orthology group | MCL12932 |
Nucleotide sequence:
ATGGGAATAAGGAAATGGCAGGAAAGAGTGGTAATAAGGAAAGGGCAACTGCCTCTCTCA
CTCATCGCACAAAACGCAGCCACTAAAGGCTACTTCACGCTGATAGTCTGTGAGAGGGTT
GTACAAGCGGAGTCAGACCGTGAGAGCGGCGCTAGTTCGCCGGAGGGTGCGCGGGCGCAC
GCCGACCAGCGCTCGCCGTCGCCGCGCTCGCGAACGCCGCACCAGCACATGAATGGCTCC
ATGGCATCGATGTTCCAAAATCTGCAAAATTTGGCGAACATGCAACAGAGCATGCCGCTG
TCGCAGCAACAAATGTCGCAACAAATGTCACAACAAATGTCGCAGCTGGCTGCCAACCTG
CAGGGTCTCACCTCAATGCCATCCAACCCCGTCATCAACTCGCCTCTCAACCTGAGTGTC
AGTGCCCCAGGCATGGGATCTCCTACCCCAGTTAACAGCAGTATGCTGCCGCCGGCTATG
CCATCACCTATGCCGCAGCTCATCCTGGCCTCTGGACAGCTAGTACAGGGCATACAGGGT
GCACAACTGCTGATACCTACTTCTCAAGGTATAGCGACACAAACAATTCTCACCATACCC
GTAAATCACGTGAACTCCAACGATCAAATGGTAAATCTCGCTCTGAACAATGGCCAAGTG
GTATCCACATCTCTGGCCAATTTACAAGCGATGGCCCAACCCCACCAACTACTAAACTCC
AACCCGCAACAAACGTCCAACATTCGGCCGAACATGCTAAATCCAACACTATCGAACGCG
CTCCTCAATCCGGGACTGCCAAACTTTTTATCCAACGGAGCGACAAATGCGCAAGAACTG
CTGCAAGCATTACAACAGCCGCAAGGGAATCACAATCTCCTACAGACAGTTCAACAAAAC
AATATGCCACAACAAATGCAAGGCCGAAGATCGTCCTCCCCACGACCTGACAGACATTAC
AAGGAGAGGGAAAGCTTCGAGCGGTTCGCAGGAGGATCGAGGGAAAGGAACGAGAGAGAA
AACTCGGGAGCGGCAGCCCTGAATAGTATTAATAGGCTCGCAGCATCCAACGGCGAGATT
ACCATAACAACGTCCCATTCAACAGCGGGTACAACAAGCAGTGCGGGTAGTGTTGGCAGT
GCGCCTACAGCATCACCTCACGCTCCCGTCAAGCTCTCACCAAGCTCTGTCAAGTCACCA
GCACATGACGAGGACCTGTTGGCCGATTCACCTAATCAGCCAACTATAAGTCAGTCGACG
GGCAACGTTGTTGATGGAATCAATCTAGAAGACATCAAGGAGTTCGCAAAGGCATTCAAA
TTACGACGACTAGGCCTAGGGCTGACGCAGACCCAGGTCGGACAAGCGCTTTCCGTCACC
GAAGGGCCCGCTTACAGTCAGAGCGCCATTTGCAGTGCCCTGGCTTCGCAGATGCTAGCA
GCTCAGCTGTCTTCACAGCAACAAAACATATTTGAGAAATTGGATATAACTCCAAAAAGT
GCGCAGAAAATCAAACCGGTGCTTGAACGTTGGATGAAGGAAGCTGAAGAGAGGTACGCG
TCCGGTCAGAACCATCTAACGGATTTCATAGGCATGGAGCCGAGCAAGAAACGCAAACGA
CGGACGTCCTTCACGCCGCAGGCTCTCGAACTACTCAACGCTCACTTCGAACGAAACACG
CACCCATCTGGAACAGAAATAACCGGTCTGGCTCACCAGCTCGGCTACGAGCGGGAGGTC
ATCAGAATATGGTTCTGCAACAAACGACAGGCTTTAAAAAACACCGTGCGAATGATGTCC
AAAGGGATGGTCTAA
Protein sequence:
MGIRKWQERVVIRKGQLPLSLIAQNAATKGYFTLIVCERVVQAESDRESGASSPEGARAH
ADQRSPSPRSRTPHQHMNGSMASMFQNLQNLANMQQSMPLSQQQMSQQMSQQMSQLAANL
QGLTSMPSNPVINSPLNLSVSAPGMGSPTPVNSSMLPPAMPSPMPQLILASGQLVQGIQG
AQLLIPTSQGIATQTILTIPVNHVNSNDQMVNLALNNGQVVSTSLANLQAMAQPHQLLNS
NPQQTSNIRPNMLNPTLSNALLNPGLPNFLSNGATNAQELLQALQQPQGNHNLLQTVQQN
NMPQQMQGRRSSSPRPDRHYKERESFERFAGGSRERNERENSGAAALNSINRLAASNGEI
TITTSHSTAGTTSSAGSVGSAPTASPHAPVKLSPSSVKSPAHDEDLLADSPNQPTISQST
GNVVDGINLEDIKEFAKAFKLRRLGLGLTQTQVGQALSVTEGPAYSQSAICSALASQMLA
AQLSSQQQNIFEKLDITPKSAQKIKPVLERWMKEAEERYASGQNHLTDFIGMEPSKKRKR
RTSFTPQALELLNAHFERNTHPSGTEITGLAHQLGYEREVIRIWFCNKRQALKNTVRMMS
KGMV