DPGLEAN11481 in OGS1.0

New model in OGS2.0DPOGS214939 
Genomic Positionscaffold2575:+ 13537-28278
See gene structure
CDS Length2316
Paired RNAseq reads  5874
Single RNAseq reads  13444
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004821 (0.0)
Best Drosophila hit  MTA1-like, isoform A (3e-166)
Best Human hitmetastasis-associated protein MTA1 (3e-127)
Best NR hit (blastp)  PREDICTED: similar to MTA1-like CG2244-PB [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to MTA1-like CG2244-PB [Tribolium castaneum] (0.0)
GeneOntology terms




  
GO:0000118 histone deacetylase complex
GO:0043565 sequence-specific DNA binding
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0008270 zinc ion binding
GO:0006355 regulation of transcription, DNA-dependent
GO:0006911 phagocytosis, engulfment
InterPro families





  
IPR009057 Homeodomain-like
IPR001005 SANT domain, DNA binding
IPR000679 Zinc finger, GATA-type
IPR007087 Zinc finger, C2H2-type
IPR001025 Bromo adjacent homology (BAH) domain
IPR000949 ELM2 domain
IPR017884 SANT, eukarya
Orthology groupMCL10920

Nucleotide sequence:

ATGGAGTCCCGTCACAAAGACCGTAGATGTCGCTGCCGCTCGAGTCCCGTGTCCGTTGTT
TTAAACGCTATACATGTCTCTACGACGCCAACTCCAAATTTTACATCGACGAAGCAGCGG
GCAAAAACTATGGCAATGGAGGAGGAATCTACGGAGCTGCCTGGTTCTGACGGGCTCGCG
CCCAAACAGCGGCACCAGGCGAAGCAGCGCGAGCTGTTCCTGTCGCGGCACGTGGAGACC
CTCCCAGCCACGCACATCCGCGGCAAGTGTACCGTCACGCTGCTCAACGAGACGGAGTCG
CTGCTCAGCTATCTCAATAAGGATGACGCATTTTTTTATTGTTTAGTATTTGATCCTTCA
CAAAAGACTTTATTAGCAGATAAGGGAGAAATCAGAGTTGGAAGTAAATATCAGACTGAA
GTAACTAATTTATTAAAAGAAGGTGAGATGATTTCTTTAACTAGTTATGATGAAAGTAAC
AAGATCGACCAATTCCTGGTGGTGGCTCGGTCTGTGGGCACCTTCGCCAGGGCATTGGAC
TGCAGCTCCAGTGTTAAACAGCCCTCGCTACACATGTCCGCGGCGGCCGCCAGCAGGGAC
ATAACTCTTTTCCACGCCATGGACACGCTGCACAAGTCCGGGTACAGCATAGAGGCTGCT
CTGTCGTCGCTGGTGCCGGCCTCCGGGCCTGTGCTGTGTCGCGACGAGATGGAGGAGTGG
TCGGCCTCAGAGGCCAACCTGTTCGAGGAGGCGCTCGAGAAATACGGCAAGGACTTCGCT
GATGTACGGAAGGACTTTCTGCCGTGGAAGACGCTGAAGAATCTGGTGGAGTACTACTAC
ATGTGGAAGACGACGGATCGCTACGTGCAACAGAAACGGGTGAAGGCTGTGGAGGCGGAG
TCCAAGCTGAAGCAAGTGTACATTCCCAATTATAACAAACCGAATCCAGCGTTGTTGTCG
AGCGGCGCGGCGGCTATCACGAGCGCGGCGGCCCCCCCGCCTCCGGGGCCCCGCCCGGCC
GGCGTCGCCAACAAGGGAGCCGTGCTGAACGGAGGAACCAACGGCACAGCGGCCGCACCC
ACCATGTGCGCCTCGTGTCAAGTGACAAATTCAAACCAGTGGTACGCCTGGGGACCACAG
CATTTACAGTACAGATTATGTGGCGCTTGCTGGCAGTACTGGAAAAAATATGGAGGACTT
AAGACGGCGGGAGTGTTCGGCGAGAGCGAGGCGGAGGCGGGGCGCGGGGTGCGGGCGGAG
GCCGACGACACAGCACTGTCCGTGTCGCACAGACCGCACCGGTGCTCCGTGGTTAACTGC
GCCAAGGAATTTAAACTGCGCGCTCACCTGGCCCGCCACATGGCGACTGCTCACGGCGGC
GCGGGCGAGGGCGCTCGGCCCGTCATGAAGACCCGAGCCGCCTTCTATCTCCGCGCCTCG
CCCTTCACGAGACTCGCGCGCCGCCTCGCCCGCGCCCTCCGCAGACCCAGGCACTACGCG
CGCTCACCCTTCTCACCGATCAACCTGCACCAGGTCAAACACGAGTGTACGATAGCGATG
GCGGGCGGCGTCGGTGGTGTGGGCGGTGTGGGCGGCGTGGTCCCGGCGGAGGTCCGAGGC
GTCGCTCGTGCCCGCGGGCCCGTGGGCGCGGTAGCGGCCCGACTCGCCGCCGCTCTGGGC
ACGTCCGCGCCTCGAGCCCAGGACTGGCTCACCCTCACCCCGCGCGAACGTCTGCCCACA
CCCAACCACGTCGCCTTCCCCAAGCCGCCCAAGGCCCCAGATGGCAGCCTCATGTACGAG
CGTGTGGTGTCCCGCGCGGAGCTGGAGGCGCGCCGCAGCGAGGCGGCCGCGCCGGCTCTC
AAGCGGCGCGCCTACGACGACATCAACGGCCTCGACAGAGGTTGTGGTGGTAGCGCGCCT
CCCGCCAAGCGACCCAACAAGCATCCGGCGCCCATGCAACGTCCATCACGCGAACAGTAC
GCGGCCATGTGCGCGCGAGCCCAGGCCACGGGACAACCTCTGCCCGCACACGTTTTTGCA
CACGTGAACGGCAAACCGACGAACCTGACCGGCCGCGGCGGTCGTCGCCACGTGATCTCG
TGGATGGACGCTCCGGACGACCTCTACTTCAGAGCCACCGAGACCGCCAAAGCCGCCCGA
CGGACGCTGAGCTGCGGCGAGCTGAGACGCGGCGCCCGCGCTCCGTGGCGCGTGATGCGC
GGGGCGGTGGCCGGCGTGGTGCTGGGCGCGGCGGCGGCGGCGGGCGGCAAGGCGGGCGCC
GCCTCCGCCCCGCTGCAGCTGGTGATCCTCGACTGA

Protein sequence:

MESRHKDRRCRCRSSPVSVVLNAIHVSTTPTPNFTSTKQRAKTMAMEEESTELPGSDGLA
PKQRHQAKQRELFLSRHVETLPATHIRGKCTVTLLNETESLLSYLNKDDAFFYCLVFDPS
QKTLLADKGEIRVGSKYQTEVTNLLKEGEMISLTSYDESNKIDQFLVVARSVGTFARALD
CSSSVKQPSLHMSAAAASRDITLFHAMDTLHKSGYSIEAALSSLVPASGPVLCRDEMEEW
SASEANLFEEALEKYGKDFADVRKDFLPWKTLKNLVEYYYMWKTTDRYVQQKRVKAVEAE
SKLKQVYIPNYNKPNPALLSSGAAAITSAAAPPPPGPRPAGVANKGAVLNGGTNGTAAAP
TMCASCQVTNSNQWYAWGPQHLQYRLCGACWQYWKKYGGLKTAGVFGESEAEAGRGVRAE
ADDTALSVSHRPHRCSVVNCAKEFKLRAHLARHMATAHGGAGEGARPVMKTRAAFYLRAS
PFTRLARRLARALRRPRHYARSPFSPINLHQVKHECTIAMAGGVGGVGGVGGVVPAEVRG
VARARGPVGAVAARLAAALGTSAPRAQDWLTLTPRERLPTPNHVAFPKPPKAPDGSLMYE
RVVSRAELEARRSEAAAPALKRRAYDDINGLDRGCGGSAPPAKRPNKHPAPMQRPSREQY
AAMCARAQATGQPLPAHVFAHVNGKPTNLTGRGGRRHVISWMDAPDDLYFRATETAKAAR
RTLSCGELRRGARAPWRVMRGAVAGVVLGAAAAAGGKAGAASAPLQLVILD