DPGLEAN22355 in OGS1.0

New model in OGS2.0DPOGS200963 
Genomic Positionscaffold1929:- 20369-24430
See gene structure
CDS Length1527
Paired RNAseq reads  26
Single RNAseq reads  84
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010167 (0.0)
Best Drosophila hit  dysfusion, isoform B (2e-124)
Best Human hitneuronal PAS domain-containing protein 4 (4e-39)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC013566 [Tribolium castaneum] (1e-137)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC013566 [Tribolium castaneum] (2e-129)
GeneOntology terms












  
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005634 nucleus
GO:0045449 regulation of transcription
GO:0007424 open tracheal system development
GO:0035147 branch fusion, open tracheal system
GO:0046982 protein heterodimerization activity
GO:0007165 signal transduction
GO:0004871 signal transducer activity
GO:0007427 epithelial cell migration, open tracheal system
GO:0043565 sequence-specific DNA binding
GO:0016563 transcription activator activity
GO:0043234 protein complex
GO:0010552 positive regulation of gene-specific transcription from RNA polymerase II promoter
GO:0010551 regulation of gene-specific transcription from RNA polymerase II promoter
InterPro families  IPR000014 PAS
Orthology groupMCL14486

Nucleotide sequence:

ATGCGTCGCGATCTCATCAACGCTGAGATATCCAACCTCCGCGATCTGCTACCCCTACCA
CCGTCCACAAGACAAAGACTGTCGCAACTGCAACTAATGGCGCTGGTCTGTGTGTACGTC
AGGAAAATGAATTACTTCCAACAAGTGTTCAAGAGTCACGACTTTAGTTATCAGTACCAA
GAGCAACCTACTCCTACACCAAATATCGGATTTTCAAAGGCAATGAATGGTTTTATGATG
ATGATGACACAGAACGGAAAACTGTTGTATATATCAGAGAACGCTGCGGAATATTTAGGA
CATTCTATGGAGGATCTTCTGATTCATGGCGATAGCGTTTACGATATCATTGACAGACAA
GATCACCAGATGGTTCAACTTGAATTGAACAGAACTAGCAACAGTGAGACTGAAAATGTA
CCAAAGAATAGACATTTCTTCTGCAGAATGAACGTCTCAAGAAATGCGAGACGCCAAATG
AGATTTGGTGACCAAAAAGTAGTTTTAGTTCAAGGGCATTATGTGTCATATTTGCCGCTT
TGTAGCCGCAATGAGCCGGTGTTCCTAGCATCATGTACGCCTCTGGCTATGCCTGAAACA
AGGGAATGTATAGTTCATGGAGCCACGAATGTGTTCACTACCATACATTCGATGGACATG
AAGATATTACACATAGATACCAACGGTGAATGGTACTTAGGTTGGAAGAAAACTGATCTG
ATAGACGTGTCGTGGTATCAAATACTACATTGGGATATTACCCAATCGGAGCAAGATAAA
TGTTGCATCCTCTTAGTAAAGCTACAGCAAAGAAGCGGTTCGTTCCTATGGATACATATG
GTGCTGCAAGTCAAAGAGGCTGCGGATGCTCCCAGGCAGTTCATTGTAGCTACTAACCAA
GTCTTAAGTGAAGAAGAAGCATCTATAATGGTGTCCAATTCATGGTTATATCAATATTAT
GGATACCAGAACCAGAATTGTGGGATGCTCGACCCCAGATGTCAAAAGTTCTTTAGAAGA
GAACCATATTACCCTGATCCCTATCCAGAATACCAATATGTAGAAAACGAGATCGATTAC
ACTGGTTACCACATAACGCCGTATGTTTGTCAAAAAGCTGACGATTATGGCTGTGAAAGA
ATATACACTGATAGAGGTCCAGTAGATTATTCCACACATTCTCCACAGTCTACAATAAGT
GAAGAAAGATCTCCTTTACATTACGAAACGGGTGATGTCGTTGTGAACAGTAACATGTAT
ATGTGCAGTAAGAGAGAATATTATGACCAATACGCGCAAGTTCATTACACACCAGAAGCT
TGTGGCAGTGGAAACATTGAAGGAATCGAATACCCGGCCGCTAAAAGAATGAGATTAACG
ACGCCACTGAGCATAGAGGGCACTGATGGCATGGAGAGATGGAACCCCAGCCCGCCCTGG
TCTGATACACTCAAACTGACGGATTACACACAGAGATTTACTTATAATATGCTACCGGCG
CCTACGGAAAGAACTATTGTCACTTAA

Protein sequence:

MRRDLINAEISNLRDLLPLPPSTRQRLSQLQLMALVCVYVRKMNYFQQVFKSHDFSYQYQ
EQPTPTPNIGFSKAMNGFMMMMTQNGKLLYISENAAEYLGHSMEDLLIHGDSVYDIIDRQ
DHQMVQLELNRTSNSETENVPKNRHFFCRMNVSRNARRQMRFGDQKVVLVQGHYVSYLPL
CSRNEPVFLASCTPLAMPETRECIVHGATNVFTTIHSMDMKILHIDTNGEWYLGWKKTDL
IDVSWYQILHWDITQSEQDKCCILLVKLQQRSGSFLWIHMVLQVKEAADAPRQFIVATNQ
VLSEEEASIMVSNSWLYQYYGYQNQNCGMLDPRCQKFFRREPYYPDPYPEYQYVENEIDY
TGYHITPYVCQKADDYGCERIYTDRGPVDYSTHSPQSTISEERSPLHYETGDVVVNSNMY
MCSKREYYDQYAQVHYTPEACGSGNIEGIEYPAAKRMRLTTPLSIEGTDGMERWNPSPPW
SDTLKLTDYTQRFTYNMLPAPTERTIVT