DPGLEAN10188 in OGS1.0

New model in OGS2.0DPOGS213896 
Genomic Positionscaffold1130:- 35786-48198
See gene structure
CDS Length1419
Paired RNAseq reads  1966
Single RNAseq reads  5918
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004632 (2e-16)
Best Drosophila hit  mitf, isoform B (2e-45)
Best Human hittranscription factor E3 (8e-38)
Best NR hit (blastp)  PREDICTED: similar to CG17469-PA.3 [Apis mellifera] (6e-75)
Best NR hit (blastx)  PREDICTED: similar to CG17469-PA.3 [Apis mellifera] (8e-72)
GeneOntology terms


  
GO:0005634 nucleus
GO:0001745 compound eye morphogenesis
GO:0030528 transcription regulator activity
GO:0045449 regulation of transcription
InterPro families
  
IPR001092 Helix-loop-helix DNA-binding domain
IPR011598 Helix-loop-helix DNA-binding
Orthology groupMCL12826

Nucleotide sequence:

ATGACCAAGGGCCCTCGGAAAGTGAAGCTAGTCATAGTAGTCAACAATAAAAAAGATCCG
CCCACATTCAAAACCTTAACGCCCACATCCCGCACGCAGCTTAAACAACAGTTGATGAGA
GAGCATGCCCAGGAGCAACTACGGAGGGAATCGTTACAGGTGCGAACAGTTTTGGAGAAT
CCCACCAGGTATCACGTGATCCAGAAGCAGAAGAGCCAGGTGCGCCAGTACCTCAGCGAG
TCATTCACACCACAAACGCAGGTGTCAGCTGTCCGTGGTCCGGTGCAGAGCGCCCCGGAG
CTAAGGTCGTCATCACCAGAACGTGGAACTGTCCTCAGTCCAGGACTATGCTCGGCAGGA
AACTCAGAAACGGATGAATTTCTGGATGACATCCTATCCCTGGATAGCGGGGCTGGTCCC
CTGTCGTCTTCGGAGCCCCCCTCTACAGCCAGCTCCGTGGCCGGGGACTGCGCCCTCTCA
GACGCAGACATGCACGCGCTCGCTAAGGATAGACAGAAGAAAGACAACCATAATATGATC
GAACGCCGCCGTCGTTTCAATATAAACGATAGAATTAAAGAGTTGGGTACCTTACTGCCC
AAAACGAACGATCCCTTCTACGAGGTGATACGGGACGTGCGACCTAACAAGGGGACCATC
CTCAAGAGCAGCGTCGACTACATCAAGTGTCTGCGGGACGAAGTCAACAGGCTCAAGCAG
AGCGAACAGAGGCGGAAACAGATTGAGCTGCACAACCGGAAACTCATGCTGAGGATACAG
GAGTTGGAACGTCTGGCGAGAGTTCATGGACTTCCGGTCAATGAAAGCTGGTCGGCATCA
CAGGAGGACTCGGGGGTCGAAGCCTCCCCGGAATGTTACACTGACAAGAACCCAGTACAC
CAAGAGCCTCCAGCTGTGCAGCCCAAGAGTGAACCAGCGCCGATGGAACTGTCCGATGGA
AGGGACGCCCTTGCAGCACTCACAGCGCTTGACGGTTTGAAGCTGGGCTCATGTTCTCCC
CTGGACCGCGGAGCATCTCTGTCCTTGGACTGCCTGGAACCAGACCTCTGTCTCGACACA
CCTGGAGACCTCTTCCACAAAGATATCAAGCAGATGCGTTTGTCACCCACGGCTGGTCTC
CTTGATGATGAAGCGGTGATGAACCTGGCTCAGATAGAAGACCTCATGGATGACGACTCA
CACAATCCCGTCACACAGGGTGACCCGATGTTGTGTTCGTCGCCGAGCGCGATGGGGCCG
GCGGGAGATTCGTCCTGCGCCATGCTGCACATAGACCTCGCGCTGCACAACACAGACTAC
GGCTCACGATCTCTCCTGTCCGAGCTGAGTGACGGCCTGCCTCTGTTGATGGGTGCTCCG
CCCCCCCGGGCCTGCTTCGACATGGATCTAGGGGCGTAG

Protein sequence:

MTKGPRKVKLVIVVNNKKDPPTFKTLTPTSRTQLKQQLMREHAQEQLRRESLQVRTVLEN
PTRYHVIQKQKSQVRQYLSESFTPQTQVSAVRGPVQSAPELRSSSPERGTVLSPGLCSAG
NSETDEFLDDILSLDSGAGPLSSSEPPSTASSVAGDCALSDADMHALAKDRQKKDNHNMI
ERRRRFNINDRIKELGTLLPKTNDPFYEVIRDVRPNKGTILKSSVDYIKCLRDEVNRLKQ
SEQRRKQIELHNRKLMLRIQELERLARVHGLPVNESWSASQEDSGVEASPECYTDKNPVH
QEPPAVQPKSEPAPMELSDGRDALAALTALDGLKLGSCSPLDRGASLSLDCLEPDLCLDT
PGDLFHKDIKQMRLSPTAGLLDDEAVMNLAQIEDLMDDDSHNPVTQGDPMLCSSPSAMGP
AGDSSCAMLHIDLALHNTDYGSRSLLSELSDGLPLLMGAPPPRACFDMDLGA