DPGLEAN08622 in OGS1.0

New model in OGS2.0DPOGS212701 
Genomic Positionscaffold3:- 439318-459018
See gene structure
CDS Length1095
Paired RNAseq reads  402
Single RNAseq reads  1126
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013132 (1e-109)
Best Drosophila hit  optomotor-blind-related-gene-1 (1e-77)
Best Human hitT-box transcription factor TBX1 isoform A (1e-74)
Best NR hit (blastp)  T-box protein, putative [Pediculus humanus corporis] (2e-92)
Best NR hit (blastx)  T-box protein, putative [Pediculus humanus corporis] (3e-83)
GeneOntology terms


  
GO:0005634 nucleus
GO:0045944 positive regulation of transcription from RNA polymerase II promoter
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0016563 transcription activator activity
InterPro families

  
IPR001699 Transcription factor, T-box
IPR008967 p53-like transcription factor, DNA-binding
IPR018186 Transcription factor, T-box, conserved site
Orthology groupMCL16076

Nucleotide sequence:

ATGGAGAGCCAAGAGTGGCGCGAGGACTGGCACCAGCCGAGACAGATGGACGGCGTTGTG
TTCAATCAGTTCCCGCGAGGTGGTTTCCAACTGCAGGCTTTAGCTGAGCGAGTCAGCCGC
GACCAGGAACACCTGCCCGTACTGCCGCCGCTTTACAACGTAGTACGAGACACTGCTAGC
TGTTCGCGGAGTACATACTCTCCGCTACAGGCGCTCAGTGAGAGCACGTGCCGAGACCAC
GCTGCGCCCGCGCCGCCCCCGCCCCCGCCACGTGTTCAGGAGGCCCACTCGCAGGTCCCG
AATCAAAGCGTGACCCTCCACCCAGCTGTAGCTCGTTGCAGCGCTTCCTTGGAGCTATCA
GCGTTATGGCGGAGCTTCCACGAGCTCGGGACGGAGATGATAGTGACGAAGGCCGGCAGA
CGGATGTTCCCAGCGCTCCAGGCGAGGCTCTCCGGTCTACTGCCCAATGCTGATTATCTA
CTGCTGGTGGATTTCGTACCGCTGGACGACAAAAGATACAGATACGCCTTCCACAGTTCG
AGCTGGGTCGTGGCTGGCAAGGCCGACCCAGTGTCTCCGCCTCGTATCCACGTACACCCT
GACTCGCCAGCGGCCGGAGCACACTGGATGAGACAGCTCGTCTCTTTCGACAAACTTAAA
TTGACAAACAATCAGTTGGACGACAATGGACACATAATCCTGAACTCGATGCACCGCTAC
CAGCCCCGGCTGCACGTGGTGTTCCTACCCGGAGACGGGCAGAGCGCCCCGGGGACGGTC
CCCTACAGGACCTTCATCTTCCCGGAGACAGGGTTCACAGCGGTCACCGCCTATCAGAAT
CATCGCATAACTCAATTGAAGATAGCCAGCAATCCGTTCGCTAAAGGCTTCAGAGACTGC
GATCCCGACGACTGTCCACCAGAGCCTGGCGGACAACGGGCCCCTCGGAGGCGCGAGGAG
GGTCCGCTAGCGCAGCCCTACGCCGCTGAACCCTCGCGGCCGCCCGGCAACATGCCGCCC
CACGCGCACACCGTAAGATACCAACCTCACTCAAGTCACAACAGCTCGTACACAGCGTAT
TACGCTCACAGATAA

Protein sequence:

MESQEWREDWHQPRQMDGVVFNQFPRGGFQLQALAERVSRDQEHLPVLPPLYNVVRDTAS
CSRSTYSPLQALSESTCRDHAAPAPPPPPPRVQEAHSQVPNQSVTLHPAVARCSASLELS
ALWRSFHELGTEMIVTKAGRRMFPALQARLSGLLPNADYLLLVDFVPLDDKRYRYAFHSS
SWVVAGKADPVSPPRIHVHPDSPAAGAHWMRQLVSFDKLKLTNNQLDDNGHIILNSMHRY
QPRLHVVFLPGDGQSAPGTVPYRTFIFPETGFTAVTAYQNHRITQLKIASNPFAKGFRDC
DPDDCPPEPGGQRAPRRREEGPLAQPYAAEPSRPPGNMPPHAHTVRYQPHSSHNSSYTAY
YAHR