New model in OGS2.0 | DPOGS205593  |
---|---|
Genomic Position | scaffold3082:+ 5418-10856 |
See gene structure | |
CDS Length | 1923 |
Paired RNAseq reads   | 53 |
Single RNAseq reads   | 126 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004183 (6e-128) |
Best Drosophila hit   | CG18130, isoform A (3e-18) |
Best Human hit | thioredoxin domain-containing protein 3 (9e-07) |
Best NR hit (blastp)   | GG19570 [Drosophila erecta] (3e-31) |
Best NR hit (blastx)   | PREDICTED: similar to CG14221 CG14221-PA [Tribolium castaneum] (1e-29) |
GeneOntology terms   | GO:0045454 cell redox homeostasis |
InterPro families    | IPR005746 Thioredoxin IPR012335 Thioredoxin fold IPR012336 Thioredoxin-like fold |
Orthology group | MCL39838 |
Nucleotide sequence:
ATGGCTAGAAAGGGACAAGTAGCCATACAAGATAATATTGAAACCAATGAAGAATTTGAG
GAAAATATGGCTAGAAAGGGACAAGTAGCCATACAAGATAATATAGAAACCAATGAAGAA
TTTGAGGAAACATTAATGTCAAATTTTGATCGACTCCTATGTTTGGAGGTGTATTCTGAA
TTCTGTGGTCATTGTTTAGCTACTGGAAATGCCATAAGAAAGGGTAAACTAGAAATTGGT
CAAGATCGTATTGCTATGGTCAGAGCTTTAGCAGATAACATCGACGTTTTATCGAGATTT
AGAAATCGGAGCGAGCCGATTTTTCTTTTCATATCGAAAGGTAAATTAATAAGAGCTATG
TTTGGTGCAAATGGTTTAGAATTATGTCGCATAATGGAGGAAGAATTGGAAAATGTGAAA
ATTGAGGCTGAAACCGGAATTGAAAGACCCAAACAAGAAATTGAAGAGCTTTTACCGGAG
GAAGCTGCAAAGATTGAAGAAGATTTAAAAATGGAAGAAGAAGCTCGAGAAAAAACTGAG
AGACTTCGAGTTTTAACTACTGCTGCTCGAAAAAAAAGAGTTTGCGAGCGTTTGGCACGC
CACGTACGAGGGTTGAATTTTATTTTGTACTGGCCACACTGTCACAAAGCCCATTTAGAC
CTTTATGAAAAATGGGATCTTATAAATGTCCAAGTGGCGGCTAAGGAGACGATCCAAATG
ACTGAGGAATTAGTGAAAGAGGCTTTATATATGAGTGACGTAGACCCTAATGAAGCTTGT
ATCCATGCTCTAATGAACGGAGAAGCATTGGTTGTTCTTTTTAAAATGTTCGACACGGAT
GATAGGGATTTTGTTAAACTTATGCGTCATACTTTATACGAAGAAATACCAGTTCCAAAG
GAGGATTTGCCACCGGAAAAGCAGCTTCCTCCAATACCGGCGTTTGAAAGGTATGCGACC
ATCAGTAAAACGGCTAGAGAGGTTCGGAGGGAGAGGTACGAAGCCCGAATGGAAAAACTA
CGTCAAGAAAAAGAAGATCGGGATAGATTAGCAGCGGAACAAGCGAGACTTGCAAGAGAA
GAAGAGGAAGAAAGACAAAGACTAGAGAAACAAAGACAGGAAGAAGAAAGAATGGCTAGG
ATTCAAGCTGGATTGCCAGCAGATCCCGAACCAGAGCCAGCACAAGAAGCTGGGGAAGAG
GGTGGTGAGGAAGCAGTTGAAGGAGAGGAAACAGAGGTGGCCGAAACTGAAGACATTGAG
GAACCGGAACAAAAAGAAGAAGAACAAGTAGAGGTTGAAGAGGAATTCCACTCAGATGTA
TCTGTTGAGGACGAGGAGTATATTCCTCCTGGTGGTCTATTTGTGCCAGGACTATATACT
CCACCTAACGATTTAGCCAAGGCTAATGCATTGGCCTACTTTTATCCCAAGATCGTGTCT
CAAATTACACCAATTGAGTCGGAGTTTCTCCCTCCGCACGTGTTGGTAATGTTTACTATT
GAAAAGCGACATGATGTTAAAGATATAATGGATCAATTTCCTGATGAAATTCTTAATTAT
GGCATTTTTATCGGAGATGACCCTACCACAGCTCAACACCTCGCTTATACTATAAAGCAG
TATAATCATATGAGCAGAATAAGGAAGCACAACGATAGACTGGCGTTGATGGTTTCTCGG
AAGCGCAGTCTACCAATGTTGCAGTTGGCGGGAGTCAATCCTTGTTACATCAGTCATGAT
GTGGAGAGCGGGGAAAAGGATTGCCTTATTATGTTTCCTGTGGGTTACGGAGATGACTAT
GAGGAAGAAGAAAGTGTCCATGAGGAGGCTGAGGAGGCTGTAGAAGAACAAGCACCTGAA
CAGGAGGTTGTCGAAGTTGTTAACCAAGAAGAACAGGAAGAAGATGAAGAAGAGGACGAC
TAA
Protein sequence:
MARKGQVAIQDNIETNEEFEENMARKGQVAIQDNIETNEEFEETLMSNFDRLLCLEVYSE
FCGHCLATGNAIRKGKLEIGQDRIAMVRALADNIDVLSRFRNRSEPIFLFISKGKLIRAM
FGANGLELCRIMEEELENVKIEAETGIERPKQEIEELLPEEAAKIEEDLKMEEEAREKTE
RLRVLTTAARKKRVCERLARHVRGLNFILYWPHCHKAHLDLYEKWDLINVQVAAKETIQM
TEELVKEALYMSDVDPNEACIHALMNGEALVVLFKMFDTDDRDFVKLMRHTLYEEIPVPK
EDLPPEKQLPPIPAFERYATISKTAREVRRERYEARMEKLRQEKEDRDRLAAEQARLARE
EEEERQRLEKQRQEEERMARIQAGLPADPEPEPAQEAGEEGGEEAVEGEETEVAETEDIE
EPEQKEEEQVEVEEEFHSDVSVEDEEYIPPGGLFVPGLYTPPNDLAKANALAYFYPKIVS
QITPIESEFLPPHVLVMFTIEKRHDVKDIMDQFPDEILNYGIFIGDDPTTAQHLAYTIKQ
YNHMSRIRKHNDRLALMVSRKRSLPMLQLAGVNPCYISHDVESGEKDCLIMFPVGYGDDY
EEEESVHEEAEEAVEEQAPEQEVVEVVNQEEQEEDEEEDD