DPGLEAN21536 in OGS1.0

New model in OGS2.0DPOGS202670 
Genomic Positionscaffold1142:- 20978-28269
See gene structure
CDS Length1395
Paired RNAseq reads  297
Single RNAseq reads  919
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001296 (0.0)
Best Drosophila hit  Ets at 98B (7e-71)
Best Human hitSAM pointed domain-containing Ets transcription factor (1e-50)
Best NR hit (blastp)  PREDICTED: similar to Ets at 98B CG5583-PA [Tribolium castaneum] (1e-90)
Best NR hit (blastx)  PREDICTED: similar to Ets at 98B CG5583-PA [Apis mellifera] (3e-89)
GeneOntology terms




  
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005634 nucleus
GO:0003677 DNA binding
GO:0008354 germ cell migration
GO:0006355 regulation of transcription, DNA-dependent
GO:0043565 sequence-specific DNA binding
InterPro families



  
IPR000418 Ets
IPR013761 Sterile alpha motif-type
IPR011991 Winged helix-turn-helix transcription repressor DNA-binding
IPR003118 Sterile alpha motif/pointed
IPR010993 Sterile alpha motif homology
Orthology groupMCL12822

Nucleotide sequence:

ATGCCACAGACGGTGCCTCGGTCCGGGTGCAGCCCACCATCGGCATCCCAAGGAGAGAGG
GCAGTGCCAACATCGCCAGCCGACATTCATCACCTCCTGCGTCTTTTAGGCGCCGAATCA
CCACCAGAACCGATGTTGCACTCTCCACCAACACACACTAAGCCACCACCTCCATATCCA
GAGGACAATAACTATCTTGAACGCATATACGATTTTGAAGGATTCCCCACACCTTCCCCT
TCGTCGGACGAGGGCTCAATACCAACAGTAGCACTACAACCGTCTTCCCCTTATAACTCG
TTTCAATATTCTCCAGTTTTCATCAAAGAGGAACCAAACAGGCTAACAGTGCCAGGGTTC
GCGTCACCTTATCCACTGTCACCATCGGGCTCCTGCGTTTCTTACAGCAGTAACAATCAA
TATTCCTCACCAGTTCCACAACAAGAAGAATATATTAACATCGAAGATCTCCTTAAAGAA
AATCAAATATTACAAGACAGTATCCAACAAAATTATATTACACCTAAAATTGAAGTCGAA
GAACCCAGGGATCATATTCTTTTAAGATCAGCTCTTGAAGATACAACATTCCAAAAAAGA
TTAAACTTAAGGCCCTTTGAATTAGGAAGTGTCAAAATGGAAGAAAGCAGCGGTGGTCCC
GGCGAGGAGGCCCTAGTTGCCCCCGACATTGATCGCGTTCTTTCTATGGCCATAGAACAG
TCAAAGCGAGATGTCGATAACACGTGCACAGTACTGGGTATATCACCAGACCCAATGCAA
TGGAGCTCTAGTGACGTTAAAGCTTGGGTGATGTTCACACTGAGACACTTCAACCTGCCG
ATGGTACCATCCGAGTATTTCGCAATGGACGGAACAGCTCTTGTTGCGCTCACTGAAGAG
GAATTTAATCAAAGGGCTCCACAAGCGGGTAGCACGCTGTACGCGCAACTGGAGATCTGG
AAAGCTGCACGACATGAGGGTTGGAGGAGCCAGTGGACTGAGCAACGGCCGCCTACACCA
GCACCGCCCGCACCTGCCACTGAGGACATGAGCGATGATGATGCAGAATCCATTGTAGCA
AATGTCAGCCAAGGTGGTGGCAAAGTGAAGACTGGTAGCACCCACATCCATCTTTGGCAG
TTCCTCAAAGAACTTTTGGCTTCACCACATATACACGGATCAGCGATACGGTGGTTAGAC
AGAAGTAACGGTGTATTCAAAATTGAAGATTCAGTTCGTGTCGCCAGACTGTGGGGCAAA
AGAAAAAACAGGCCCGCAATGAACTACGACAAACTATCGCGTAGCATTAGACAATACTAC
AAGAAAGGCATCATGAAGAAAACGGAAAGAAGTCAGAGGCTCGTCTATCAGTTCTGTCAT
CCCTACTGTTTATAA

Protein sequence:

MPQTVPRSGCSPPSASQGERAVPTSPADIHHLLRLLGAESPPEPMLHSPPTHTKPPPPYP
EDNNYLERIYDFEGFPTPSPSSDEGSIPTVALQPSSPYNSFQYSPVFIKEEPNRLTVPGF
ASPYPLSPSGSCVSYSSNNQYSSPVPQQEEYINIEDLLKENQILQDSIQQNYITPKIEVE
EPRDHILLRSALEDTTFQKRLNLRPFELGSVKMEESSGGPGEEALVAPDIDRVLSMAIEQ
SKRDVDNTCTVLGISPDPMQWSSSDVKAWVMFTLRHFNLPMVPSEYFAMDGTALVALTEE
EFNQRAPQAGSTLYAQLEIWKAARHEGWRSQWTEQRPPTPAPPAPATEDMSDDDAESIVA
NVSQGGGKVKTGSTHIHLWQFLKELLASPHIHGSAIRWLDRSNGVFKIEDSVRVARLWGK
RKNRPAMNYDKLSRSIRQYYKKGIMKKTERSQRLVYQFCHPYCL