DPGLEAN21075 in OGS1.0

New model in OGS2.0DPOGS200612 
Genomic Positionscaffold173:+ 123317-142478
See gene structure
CDS Length1377
Paired RNAseq reads  15
Single RNAseq reads  43
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008971 (3e-128)
Best Drosophila hit  RunxA (4e-104)
Best Human hitrunt-related transcription factor 1 isoform AML1b (1e-53)
Best NR hit (blastp)  PREDICTED: similar to CG34145 CG34145-PA [Tribolium castaneum] (1e-161)
Best NR hit (blastx)  PREDICTED: similar to CG34145 CG34145-PA [Tribolium castaneum] (2e-150)
GeneOntology terms


  
GO:0005634 nucleus
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0006355 regulation of transcription, DNA-dependent
GO:0005524 ATP binding
InterPro families


  
IPR000040 Acute myeloid leukemia 1 protein (AML 1)/Runt
IPR013524 Acute myeloid leukemia 1 (AML 1)/Runt
IPR008967 p53-like transcription factor, DNA-binding
IPR012346 p53/RUNT-type transcription factor, DNA-binding domain
Orthology groupMCL15325

Nucleotide sequence:

ATGCACTTGACGAGTGCCTCCAGCGGGGCCGTGTCGCCGGACACCAGCTCCGCACTCCTC
CACGAGACATACACCAAGATGACATCAGATATACTCGCGGAGAGGACGCTGGGTGATTTT
CTATCGGAGCACCCGGGGGAATTGGTGAGGACTGGTAGTCCACACTTCGTATGCACAGTG
CTACCTCCTCACTGGCGATCCAATAAAACGCTGCCGGTGGCGTTTAAGGTGGTAGCGCTC
GGTGACGTTGGAGACGGAACACTGGTGACTGTCAGGGCTGGTAACGATGAGAACTGCTGC
GCTGAACTCCGTAACAGCTCGGCGGTCATGAAGAACCAAGTGGCAAAGTTCAACGATTTG
AGATTCGTCGGCCGTAGTGGTCGCGGGAAATCATTCACATTGACAATAACGATATCGACG
ACGCCTCCGCAAGTCACAACCTACAATAAGGCTATCAAGGTCACCGTGGACGGACCCAGG
GAACCGCGGTCGAAAACCAGGCAGCAGCAGCAATTTCATTTCGCATTCGGTCAACGGCCG
TTTCCTTTCCCACCAGATCCTCTGGGAGGATTCCGGATGCCGCCGATTACTACATGTCAG
AATATGAGTCAATTTGGTTTGAGTTCGAGTAACTCTCATTGGGGCTATGGTGGTGCCGGC
GCTTACCCAGCATACCTTCCATCCTGTGCGGCTCCAGCGACACAGTTCAATACACCGACA
TTAGGCTTCGCTGGTTCCGTCCCTGAACAAACCCCCACTCAGGATTTCACTAATAACACC
GTTCTACCGGATACGACGGGAGTGGATCTGGACCAACAGCTGTCCGGTCTAGTGGGATCG
TCTCCATCACACCACGGCAGCTTGCTACCTAGATACAACAACAACACAGACTACACGCTA
TCCACCGGCCCACGCTCCCTCAGCGACAATAGCTCGCAACCGGAATCCCCGGTCCAAGAC
GACCTTTTAACTTCAAACACAACAACCAACATCGGTCACAACCATTCGAACTCCAACTTC
TCGCTAATGAGTACTCAGAATGCATCATACGGAAGCAGCAACTGCAACAATTCCCTCTAC
CCCGTTCTACCGGCCAGCCTGCTATACAGTCAATTATACACAGCAGCTAATCAAAGTCAC
AATTTCCATCCGCTCCATTCGAACTCCATCCATTCAACGCAGAATCATCACAACGAACTA
CAAACTATGATGGACCAGATATCATCAACCACGAACCATAGACAGGGTCACGGGCAAGAC
TTGTTGGGTGGGAACTCGTGTGCTGCTGCGGCCGCGAGGGGGGAAGATGGAAGGGTTAGT
TTGGGACAGCGGGGAAATCCCCAACCAGACAGCAACACCGTTTGGCGGCCCTATTGA

Protein sequence:

MHLTSASSGAVSPDTSSALLHETYTKMTSDILAERTLGDFLSEHPGELVRTGSPHFVCTV
LPPHWRSNKTLPVAFKVVALGDVGDGTLVTVRAGNDENCCAELRNSSAVMKNQVAKFNDL
RFVGRSGRGKSFTLTITISTTPPQVTTYNKAIKVTVDGPREPRSKTRQQQQFHFAFGQRP
FPFPPDPLGGFRMPPITTCQNMSQFGLSSSNSHWGYGGAGAYPAYLPSCAAPATQFNTPT
LGFAGSVPEQTPTQDFTNNTVLPDTTGVDLDQQLSGLVGSSPSHHGSLLPRYNNNTDYTL
STGPRSLSDNSSQPESPVQDDLLTSNTTTNIGHNHSNSNFSLMSTQNASYGSSNCNNSLY
PVLPASLLYSQLYTAANQSHNFHPLHSNSIHSTQNHHNELQTMMDQISSTTNHRQGHGQD
LLGGNSCAAAAARGEDGRVSLGQRGNPQPDSNTVWRPY