DPGLEAN05037 in OGS1.0

New model in OGS2.0DPOGS201662 
Genomic Positionscaffold1555:+ 9414-10970
See gene structure
CDS Length1557
Paired RNAseq reads  647
Single RNAseq reads  2025
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005416 (4e-178)
Best Drosophila hit  germ cell-expressed bHLH-PAS (1e-78)
Best Human hitneuronal PAS domain-containing protein 2 (3e-16)
Best NR hit (blastp)  methoprene-tolerant protein 1 [Bombyx mori] (0.0)
Best NR hit (blastx)  methoprene-tolerant protein 1 [Bombyx mori] (0.0)
GeneOntology terms




  
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0045449 regulation of transcription
GO:0005634 nucleus
GO:0006355 regulation of transcription, DNA-dependent
GO:0004871 signal transducer activity
GO:0007165 signal transduction
InterPro families


  
IPR011598 Helix-loop-helix DNA-binding
IPR000014 PAS
IPR001092 Helix-loop-helix DNA-binding domain
IPR013655 PAS fold-3
Orthology groupMCL16514

Nucleotide sequence:

ATGGCATCTTCTGGAGCAGCTAGTACTCGTCTCGGAGTGCATTCCGGGATGACAAACCAG
GTTGGTCTTTCTAATCCAGATTTTGGAATGGCCGTAGCCGTACCACAAACATTTCAATTA
CAAACTGTAAGTTGCGGCGACTCTTCGTCGGATGCTAATTCGCCTGACATACCAATAGTC
AAAGCAAGTGATAGACAGTCTCGTATAATAGCTGAGAAACACCGCAGGAGTCAGTTTAAC
TCTCAGATAGCTCAAATGATGTCTCTGTTGTCTGACATTGTTCATTCCCAACGAAAAGTT
GATAAGACCAGTGTTTTGCGTCTTGCAGCGACGAAACTGAGGAATGAACATGTTTTCGGT
GATTCGATCAAATGCTGTCACCTCGAGACATGGTCCTCTGCCATATTAAAATTTTTTGAT
CTCATTGGGGGCTTCATGATTGCTATAACTTGCAAAGGTCGTATTTGCAATGTTTCACCA
AATGTCCAAGATAAGTTAGGATATTGTCACATAGATCTCCTTAGTCAAGATATTTATAAC
TATGTTCACAATGATGACAAAGAAATTTTAAAATTACATATATTTCCTCCTGAGTTACAC
ACAGGCTGTGATCCAAAACTTTTAGAACAACATCATACCTTTCACATTCGCATCATGAGA
GCAGGGGCCCGGTCAGATCCTCCGAGATATGAGCGTTGTAGAATTGATGGAGTGTTACGT
CGATCTGATCATGCTACATCTGAATGTGTTCAAGATGAACAAACTATAAGAAGACAAAGA
GTAAGAAATCATCGTACATTTTCATCAAGCGGGAATGATTATGTTTTCATTGCTATGGTT
CATGTTTTATCAAACAACTTACCAGCAAGGATGCTGCCTCCCACGGCATATTCTGAGTAT
TGGACGAGGCATTTGATAGATGGAAGAATTGTGCAATGTGACCAGGGTATCTCACTGGCT
GCAGGTTACATGACTGAAGAGGTAACTGGAGCATCTGCATTTGTTTTCATGCACAAAGAG
GATGTCCGTTGGGTTGTCTGTATATTACGCCAAATGTATGACCAAAGTAGGGAGTTTGGA
GAGTCTTATTACAGACTTATGTCCAGATCGGGGCATTTTTTATATATGAGATCAAGAGGT
TACCTTGAAATAGATAAAAAAACAAAAAAAGTACAAAGTTTTGTATGTGTTAATAGTGTT
ATAGGGGAAGATTATGGTAGAAAGATGATGGAAGAAATGAAGAGGAAATTCTCTGTCATG
GTGAACACTGAGAGAACAGAAAATGAAGTAGTGGCAGCTATAGATGAAGCTCCTGTTGAA
CATCCTAAGCGTTTGGAGAGAATTGTAATGCATTTAGTTGAACCACCAACAAGCGAAAGT
GTAGAGGAATTCAAATTAGTACCACCATCGAGAGAAAACATAATCTCGGCGATTAAAAAT
AGTGAGAAAGTTGTACAAGAAACCGGTGTAAGGTTTGACAATCGCAAAAGGAAAAACTCA
GACAGTGATGACAACAGTGAACAATTGAAACGACACAGTGGAATATCAGAATGTTAG

Protein sequence:

MASSGAASTRLGVHSGMTNQVGLSNPDFGMAVAVPQTFQLQTVSCGDSSSDANSPDIPIV
KASDRQSRIIAEKHRRSQFNSQIAQMMSLLSDIVHSQRKVDKTSVLRLAATKLRNEHVFG
DSIKCCHLETWSSAILKFFDLIGGFMIAITCKGRICNVSPNVQDKLGYCHIDLLSQDIYN
YVHNDDKEILKLHIFPPELHTGCDPKLLEQHHTFHIRIMRAGARSDPPRYERCRIDGVLR
RSDHATSECVQDEQTIRRQRVRNHRTFSSSGNDYVFIAMVHVLSNNLPARMLPPTAYSEY
WTRHLIDGRIVQCDQGISLAAGYMTEEVTGASAFVFMHKEDVRWVVCILRQMYDQSREFG
ESYYRLMSRSGHFLYMRSRGYLEIDKKTKKVQSFVCVNSVIGEDYGRKMMEEMKRKFSVM
VNTERTENEVVAAIDEAPVEHPKRLERIVMHLVEPPTSESVEEFKLVPPSRENIISAIKN
SEKVVQETGVRFDNRKRKNSDSDDNSEQLKRHSGISEC