New model in OGS2.0 | DPOGS216059  |
---|---|
Genomic Position | scaffold6830:- 1131-2790 |
See gene structure | |
CDS Length | 966 |
Paired RNAseq reads   | 20 |
Single RNAseq reads   | 75 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009025 (3e-29) |
Best Drosophila hit   | gooseberry-neuro (5e-32) |
Best Human hit | paired box protein Pax-7 isoform 1 (4e-28) |
Best NR hit (blastp)   | conserved hypothetical protein [Culex quinquefasciatus] (2e-37) |
Best NR hit (blastx)   | PREDICTED: similar to gooseberry-neuro CG2692-PA [Tribolium castaneum] (2e-36) |
GeneOntology terms    | GO:0003677 DNA binding GO:0007367 segment polarity determination GO:0003700 sequence-specific DNA binding transcription factor activity GO:0005667 transcription factor complex GO:0005634 nucleus GO:0043565 sequence-specific DNA binding GO:0006355 regulation of transcription, DNA-dependent |
InterPro families    | IPR017970 Homeobox, conserved site IPR001356 Homeobox IPR009057 Homeodomain-like IPR000047 Helix-turn-helix motif, lambda-like repressor IPR012287 Homeodomain-related |
Orthology group | ND |
Nucleotide sequence:
ATGTCACCCGAACCCAAAGATGCCGACAAAAGGACCGCCACGTTGGGCCGTGGATCAGAC
TCTTCAGATATAGAATCGGAGCCAGGCCTGACCCTTAAAAGAAAACAGCGTAGATCTCGC
ACCACGTTTACAGGAGAACAGCTTGACGCGCTTGAGAGAGCTTTTCATAGAACACAATAT
CCTGATGTATACACTAGAGAAGAACTTGCTTTGCAGACTGGCCTTACCGAGGCCAGAATA
CAAGTTTGGTTCTCCAATAGAAGAGCTCGACTACGAAAGCATACTGGTTCAAATCCGACT
CCTTCACTCGCTAGTTATTCGACGATACCAATGCCGCAGATACCGTGCCCGTATCCTGCC
GGAGAAATACCTTCACTATCTCAACATCACCCGCAACATCCGGATGCCTGGCATCATCAA
AAGTATGCCAATTATAACCAGCTAATGGCTCAGTCTCAACATCTTAACCAAGCTTTTCAA
ACTGCAGCCTTCCCCAGCACCTCTGGGACTACTTTCAGCCATTTAGTGACCGGTGCTAGC
GCACCAACTCACAGTCAGCTTCTTGATAGCACTCCAAGAACTGATTATCCTCGATATCCC
ACTGATGTCTACAACAAACCCATCAGTTATATGCCTAAAGATACGGAAGCGGAAGATAAG
GGAGTGGGAGAAGAAATTATAGAGCAACGTGAAGAAGCTTACATAAAAACAGGTGGAAAT
GAATACAAAGAATTAGCGACCAGTGATTATCCTAAAGTTCCTACTGATTATTCTAAGCTT
TCTGTTGATCCTTCCTCCACCAACTGGACTGCATCTAATAACTCCTTGAATATGAGTCTA
TCTGGATTATCTAGTGACTATAAATATATGAGTGACCCTTATGCTTTTCCTGCTATCGCG
TCGGATACCCTAAATCAACATACCTACACCAATCCAGGAAATGCAGCCAATAAATACTGG
ATTTGA
Protein sequence:
MSPEPKDADKRTATLGRGSDSSDIESEPGLTLKRKQRRSRTTFTGEQLDALERAFHRTQY
PDVYTREELALQTGLTEARIQVWFSNRRARLRKHTGSNPTPSLASYSTIPMPQIPCPYPA
GEIPSLSQHHPQHPDAWHHQKYANYNQLMAQSQHLNQAFQTAAFPSTSGTTFSHLVTGAS
APTHSQLLDSTPRTDYPRYPTDVYNKPISYMPKDTEAEDKGVGEEIIEQREEAYIKTGGN
EYKELATSDYPKVPTDYSKLSVDPSSTNWTASNNSLNMSLSGLSSDYKYMSDPYAFPAIA
SDTLNQHTYTNPGNAANKYWI