DPGLEAN09061 in OGS1.0

New model in OGS2.0DPOGS202866 
Genomic Positionscaffold30:+ 268617-275571
See gene structure
CDS Length1377
Paired RNAseq reads  144
Single RNAseq reads  449
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010868 (2e-47)
Best Drosophila hit  nubbin, isoform D (1e-55)
Best Human hitPOU domain, class 2, transcription factor 1 isoform 1 (7e-60)
Best NR hit (blastp)  nubbin [Cupiennius salei] (2e-78)
Best NR hit (blastx)  nubbin [Culex quinquefasciatus] (3e-62)
GeneOntology terms


  
GO:0006355 regulation of transcription, DNA-dependent
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0043565 sequence-specific DNA binding
GO:0005634 nucleus
InterPro families





  
IPR000327 POU-specific
IPR001356 Homeobox
IPR010982 Lambda repressor-like, DNA-binding
IPR009057 Homeodomain-like
IPR012287 Homeodomain-related
IPR013847 POU
IPR017970 Homeobox, conserved site
Orthology groupMCL13084

Nucleotide sequence:

ATGGTGTTGTCGGCGTGCGTGCGCGCCACCGCCCGGACCTACTGCACCTCCTATCAAGAA
TCTGGCGACCTCATTAAATCTCCATCGCCGCCACCGAGCCACGATGGGTCCGCGAGCGGG
GAAGGCGAGGGCGAGGGCGAGGGTAGCGGAGACGAGAGCGTCCCAGCTGGCGGAGGAGCC
CGCGCCCCGCACGCTCCTTCCTCCGCCGCCTCCGCTGCCTCTGCCGCCACGGCGGCCGTG
CTGGTGAGTCTGCTTCGAGACCACCACTACCACCTCTCCCCGAGTGCCGCTGGTGCCGGC
AATGCTCTAGCCCAACTGCAACATTTGTTGCTTACACAGCATGGTGCTCATTCTCTACTT
TTACATACGCAAGTGCAGCAGGCCGTTGCCCAGGCAGCAGCACAACAGCTTCAGCAACTT
CAAGCGAGAGCCAACGCTGGAGCACCAACTGTTGACACTCGTTCTCCGTCTCCGCGCGGC
TCTCCTCCTGGCGCAGCCGCCTTCTTGACTCCGCTCACGCCAGGTCCGGGACGGGCACGC
TCCCCCCATGCGCATCACGCGCACGCACATGCGCACGCGCACTCGCTGCATGCGCACTCG
CCAGTTGCGCACGCTCAGTCCCAGGCAGGACACTCGCCGCTGCACGCACACGCGCACAAA
CCTCGCGCGCTCGACCCAGCCGACGACACAGCCGACCTGGAAGAGCTCGAACACTTCGCC
AAAACATTCAAACAACGAAGAATTAAACTCGGTTTCACTCAGGGCGATGTGGGGCTCGCC
ATGGGCAAATTGTACGGAAATGATTTCTCCCAGACGACAATATCGCGCTTCGAAGCGTTG
AATCTTAGTTTCAAGAACATGTGCAAACTGAAGCCACTACTACAGAAGTGGTTGGAGGAC
GCGGACTCCTCGCTGAGCGGCAGCGGTGGCGGAGCGTCTCTAGGCGCCGGCCTGGCTGAG
GCTGTGGGGAGACGCCGCAAGAAGCGCACGTCCATAGAATCAGGAGTCAGGGTAGCACTC
GAGAAGGCTTTTCTCCACAACCCGAAACCGACCAGCGAGGAAATATCGGCGTTAGCCGAC
AGTCTCTGTATGGAGAAGGAAGTAGTACGCGTTTGGTTTTGCAATCGCAGACAGAAGGAG
AAGCGTATAAACCCACCCGCAGGCGAGGCGGGCGGAGCGTCATCTCCGGGTGGCGGTGGG
TCGTTGCTGCCCCTGTCTCACCCCCTAGCACATGCTTTGCCTGCGCATGCTCACGGACAC
GCCCACGGACATGGTCTGCAGCACGCGCACGCACACCCCGCTGCCGCAGCGCACGCCGCC
CTGCAGGCTGCGGCGCTGCAGCCGCTGGCTCTCCTCGCGCGCCCCCCACGCGACTGA

Protein sequence:

MVLSACVRATARTYCTSYQESGDLIKSPSPPPSHDGSASGEGEGEGEGSGDESVPAGGGA
RAPHAPSSAASAASAATAAVLVSLLRDHHYHLSPSAAGAGNALAQLQHLLLTQHGAHSLL
LHTQVQQAVAQAAAQQLQQLQARANAGAPTVDTRSPSPRGSPPGAAAFLTPLTPGPGRAR
SPHAHHAHAHAHAHSLHAHSPVAHAQSQAGHSPLHAHAHKPRALDPADDTADLEELEHFA
KTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFEALNLSFKNMCKLKPLLQKWLED
ADSSLSGSGGGASLGAGLAEAVGRRRKKRTSIESGVRVALEKAFLHNPKPTSEEISALAD
SLCMEKEVVRVWFCNRRQKEKRINPPAGEAGGASSPGGGGSLLPLSHPLAHALPAHAHGH
AHGHGLQHAHAHPAAAAHAALQAAALQPLALLARPPRD