DPGLEAN04621 in OGS1.0

New model in OGS2.0DPOGS203676 
Genomic Positionscaffold120:- 312422-318945
See gene structure
CDS Length1926
Paired RNAseq reads  2717
Single RNAseq reads  7047
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003472 (4e-09)
Best Drosophila hit  tango (1e-169)
Best Human hitaryl hydrocarbon receptor nuclear translocator isoform 4 (3e-145)
Best NR hit (blastp)  PREDICTED: similar to arylhydrocarbon receptor nuclear translocator homolog b [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to arylhydrocarbon receptor nuclear translocator homolog b [Tribolium castaneum] (0.0)
GeneOntology terms




















  
GO:0045449 regulation of transcription
GO:0005634 nucleus
GO:0003702 RNA polymerase II transcription factor activity
GO:0007424 open tracheal system development
GO:0007425 epithelial cell fate determination, open tracheal system
GO:0006357 regulation of transcription from RNA polymerase II promoter
GO:0003677 DNA binding
GO:0006800 oxygen and reactive oxygen species metabolic process
GO:0046982 protein heterodimerization activity
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0004871 signal transducer activity
GO:0007165 signal transduction
GO:0007420 brain development
GO:0048813 dendrite morphogenesis
GO:0007517 muscle organ development
GO:0048666 neuron development
GO:0043565 sequence-specific DNA binding
GO:0005515 protein binding
GO:0010552 positive regulation of gene-specific transcription from RNA polymerase II promoter
GO:0043234 protein complex
GO:0010551 regulation of gene-specific transcription from RNA polymerase II promoter
GO:0016563 transcription activator activity
InterPro families





  
IPR000014 PAS
IPR001092 Helix-loop-helix DNA-binding domain
IPR001067 Nuclear translocator
IPR011598 Helix-loop-helix DNA-binding
IPR013655 PAS fold-3
IPR013767 PAS fold
IPR001610 PAC motif
Orthology groupMCL11664

Nucleotide sequence:

ATGTCTGCCGTGGCTCCCACTATTCCCGGGGCTGACCCAACAAAGGACATACAAAAGCGT
CGAGCTGGTAGTATTGGATCAGATGAAGACGACGCAAGTGGTGGGAAATATACAAGGATG
GAGGAGGACAATATTCAAGACAAGGAGAGGTTTGCCAGTCGTGAAAACCACTGCGAGATC
GAACGTCGTCGCAGGAACAAGATGACGGCGTATATTACGGAACTATCTGACATGGTTCCA
ACATGCTCCGCTCTCGCGAGGAAACCAGACAAACTAACCATACTGCGTATGGCAGTAGCG
CATATGAAAGCTTTAAGAGGTACCGGCAACACGTCTACAGACGGCACATACAAACCATCG
TTTCTAACGGATCAGGAGTTGAAACACCTCATACTGGAAGCGGCCGATGGATTTTTATTC
GTCGTCAGTTGTGACACTGGCCGCATTATATACGTTAGTGATAGTATAGCGCCAGTGCTA
AATTATTCTCAGGGTGAATGGTACTCATCATGTTTCTACGACCAAGTACATCCCGACGAT
TTGGAAAAAGTTCGGGAACAGCTAAGCACACAGGAGCCTCAGAACACGGGACGTATTCTG
GATCTCAAAACGGGAACCGTTAAGAAGGAGGGACACCAATCTTCAATGCGTCAAGTGATG
GGTTCTCGTCGCGGGTTCATATGCCGCATGCGTGTTGGCGGGACGGCGGAGAGCGCTCAC
CTGGGCAGGCTGCGCGCGCGCAACTCGCTCGGCCCCTCACACGACGGACACAACTACGCG
GTCGTACACTGCACCGGTTACATCAAGAACTGGCCGCCGACAGATCTGTTTCCAGGCATG
CAGATGGACCGACCGGTCGAGGACGAGTTGCACGCCTCTCATTGTTGCCTCGTCGCTATT
GGTAGATTACAGGTGACGTCAACTCCGAGTAGCGCTGAGGGCAGCGCGTGCGGTGGCGTG
GAGTTCGTGTCGCGTCACTCTGTGGAGGGCCGCTTCACGTTCGCTGACCAGCGTGCGGCT
CAAGTACTAGGGTATGCGCCCGCGGACCTCCTCGGGAAACTCTGCTACGACTTTTACCAT
CCGGAAGACCAGCAGCACATGAGAGATAACTTTGATCAAGTTCTTAAGTTGAAAGGGCAG
ATAATTTCGCTCATGTATCGGTTTCGCACAAAGAACAGAGAGTGGATTTGGTTGAGGACA
TCCGCCTTTGCATTCCTCAATCCTTACAACGATGACGTGGAATATATTGTGTGCACAAAT
ACATTGGCCAACCGTTCATTGGGCAGCACGGGAGGCGAGCCGGTTGCAGATGAAAACTAC
GATTACCACCTCCGACAGCGTGATGTGTACCAAGCACCTCCTCCGCCTATACATCAGCAA
CACCATGCTCCACCAGGTGGTGGAGTGGGTGCTCGTTCGCCGGGTGAGGCGGGTGGTGCG
GGCGCTGCGGCTGCCTACGCTCCACATGCGCCTCACTATGCGCCTGACTACTCACCTCAC
CGACCCGCCAACACACCGCCACATACTACTTGGACAACGCTGCGACCGAGCGGGGCCAGT
GGAGCCGGGAGTGGTGGTGAAAGTTACGCGTACAGCGGCGACGCGGCTGGAGCGGGTGCG
GGTAACGGCAGCCCGGCCCGATCACCGCCCGCGCCCGCCTACCTACCACCAGCACACTAC
CACCACAACCACCACCCCCCACACCCTACACATCCAACGCACCCTACACATCCCACGCAT
CCACCGCATGCGGGTATATGGGCATGGCAGGGCGGTGCTGGGGGTGCAGGGGGCCCAGCT
GCTGGCGCTCCTGAGGGAGGACACGCGCCCCACGAGCTCTCCGAAATGCTGCAAATACTG
GACCAAGGCGGTGCCGCTACCTTCGAAGACCTCAACATAAACATGTTTAATTCCAACTTT
GAATAG

Protein sequence:

MSAVAPTIPGADPTKDIQKRRAGSIGSDEDDASGGKYTRMEEDNIQDKERFASRENHCEI
ERRRRNKMTAYITELSDMVPTCSALARKPDKLTILRMAVAHMKALRGTGNTSTDGTYKPS
FLTDQELKHLILEAADGFLFVVSCDTGRIIYVSDSIAPVLNYSQGEWYSSCFYDQVHPDD
LEKVREQLSTQEPQNTGRILDLKTGTVKKEGHQSSMRQVMGSRRGFICRMRVGGTAESAH
LGRLRARNSLGPSHDGHNYAVVHCTGYIKNWPPTDLFPGMQMDRPVEDELHASHCCLVAI
GRLQVTSTPSSAEGSACGGVEFVSRHSVEGRFTFADQRAAQVLGYAPADLLGKLCYDFYH
PEDQQHMRDNFDQVLKLKGQIISLMYRFRTKNREWIWLRTSAFAFLNPYNDDVEYIVCTN
TLANRSLGSTGGEPVADENYDYHLRQRDVYQAPPPPIHQQHHAPPGGGVGARSPGEAGGA
GAAAAYAPHAPHYAPDYSPHRPANTPPHTTWTTLRPSGASGAGSGGESYAYSGDAAGAGA
GNGSPARSPPAPAYLPPAHYHHNHHPPHPTHPTHPTHPTHPPHAGIWAWQGGAGGAGGPA
AGAPEGGHAPHELSEMLQILDQGGAATFEDLNINMFNSNFE