New model in OGS2.0 | DPOGS202866  |
---|---|
Genomic Position | scaffold30:+ 268617-275571 |
See gene structure | |
CDS Length | 1377 |
Paired RNAseq reads   | 144 |
Single RNAseq reads   | 449 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010868 (2e-47) |
Best Drosophila hit   | nubbin, isoform D (1e-55) |
Best Human hit | POU domain, class 2, transcription factor 1 isoform 1 (7e-60) |
Best NR hit (blastp)   | nubbin [Cupiennius salei] (2e-78) |
Best NR hit (blastx)   | nubbin [Culex quinquefasciatus] (3e-62) |
GeneOntology terms    | GO:0006355 regulation of transcription, DNA-dependent GO:0003700 sequence-specific DNA binding transcription factor activity GO:0043565 sequence-specific DNA binding GO:0005634 nucleus |
InterPro families    | IPR000327 POU-specific IPR001356 Homeobox IPR010982 Lambda repressor-like, DNA-binding IPR009057 Homeodomain-like IPR012287 Homeodomain-related IPR013847 POU IPR017970 Homeobox, conserved site |
Orthology group | MCL13084 |
Nucleotide sequence:
ATGGTGTTGTCGGCGTGCGTGCGCGCCACCGCCCGGACCTACTGCACCTCCTATCAAGAA
TCTGGCGACCTCATTAAATCTCCATCGCCGCCACCGAGCCACGATGGGTCCGCGAGCGGG
GAAGGCGAGGGCGAGGGCGAGGGTAGCGGAGACGAGAGCGTCCCAGCTGGCGGAGGAGCC
CGCGCCCCGCACGCTCCTTCCTCCGCCGCCTCCGCTGCCTCTGCCGCCACGGCGGCCGTG
CTGGTGAGTCTGCTTCGAGACCACCACTACCACCTCTCCCCGAGTGCCGCTGGTGCCGGC
AATGCTCTAGCCCAACTGCAACATTTGTTGCTTACACAGCATGGTGCTCATTCTCTACTT
TTACATACGCAAGTGCAGCAGGCCGTTGCCCAGGCAGCAGCACAACAGCTTCAGCAACTT
CAAGCGAGAGCCAACGCTGGAGCACCAACTGTTGACACTCGTTCTCCGTCTCCGCGCGGC
TCTCCTCCTGGCGCAGCCGCCTTCTTGACTCCGCTCACGCCAGGTCCGGGACGGGCACGC
TCCCCCCATGCGCATCACGCGCACGCACATGCGCACGCGCACTCGCTGCATGCGCACTCG
CCAGTTGCGCACGCTCAGTCCCAGGCAGGACACTCGCCGCTGCACGCACACGCGCACAAA
CCTCGCGCGCTCGACCCAGCCGACGACACAGCCGACCTGGAAGAGCTCGAACACTTCGCC
AAAACATTCAAACAACGAAGAATTAAACTCGGTTTCACTCAGGGCGATGTGGGGCTCGCC
ATGGGCAAATTGTACGGAAATGATTTCTCCCAGACGACAATATCGCGCTTCGAAGCGTTG
AATCTTAGTTTCAAGAACATGTGCAAACTGAAGCCACTACTACAGAAGTGGTTGGAGGAC
GCGGACTCCTCGCTGAGCGGCAGCGGTGGCGGAGCGTCTCTAGGCGCCGGCCTGGCTGAG
GCTGTGGGGAGACGCCGCAAGAAGCGCACGTCCATAGAATCAGGAGTCAGGGTAGCACTC
GAGAAGGCTTTTCTCCACAACCCGAAACCGACCAGCGAGGAAATATCGGCGTTAGCCGAC
AGTCTCTGTATGGAGAAGGAAGTAGTACGCGTTTGGTTTTGCAATCGCAGACAGAAGGAG
AAGCGTATAAACCCACCCGCAGGCGAGGCGGGCGGAGCGTCATCTCCGGGTGGCGGTGGG
TCGTTGCTGCCCCTGTCTCACCCCCTAGCACATGCTTTGCCTGCGCATGCTCACGGACAC
GCCCACGGACATGGTCTGCAGCACGCGCACGCACACCCCGCTGCCGCAGCGCACGCCGCC
CTGCAGGCTGCGGCGCTGCAGCCGCTGGCTCTCCTCGCGCGCCCCCCACGCGACTGA
Protein sequence:
MVLSACVRATARTYCTSYQESGDLIKSPSPPPSHDGSASGEGEGEGEGSGDESVPAGGGA
RAPHAPSSAASAASAATAAVLVSLLRDHHYHLSPSAAGAGNALAQLQHLLLTQHGAHSLL
LHTQVQQAVAQAAAQQLQQLQARANAGAPTVDTRSPSPRGSPPGAAAFLTPLTPGPGRAR
SPHAHHAHAHAHAHSLHAHSPVAHAQSQAGHSPLHAHAHKPRALDPADDTADLEELEHFA
KTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFEALNLSFKNMCKLKPLLQKWLED
ADSSLSGSGGGASLGAGLAEAVGRRRKKRTSIESGVRVALEKAFLHNPKPTSEEISALAD
SLCMEKEVVRVWFCNRRQKEKRINPPAGEAGGASSPGGGGSLLPLSHPLAHALPAHAHGH
AHGHGLQHAHAHPAAAAHAALQAAALQPLALLARPPRD