DPGLEAN15945 in OGS1.0

New model in OGS2.0DPOGS201897 
Genomic Positionscaffold22:+ 128013-132249
See gene structure
CDS Length1671
Paired RNAseq reads  15
Single RNAseq reads  43
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006070 (2e-77)
Best Drosophila hit  CG18130, isoform A (2e-33)
Best Human hitthioredoxin domain-containing protein 3 (7e-19)
Best NR hit (blastp)  hypothetical protein AaeL_AAEL007253 [Aedes aegypti] (4e-62)
Best NR hit (blastx)  hypothetical protein AaeL_AAEL007253 [Aedes aegypti] (1e-46)
GeneOntology terms  GO:0045454 cell redox homeostasis
InterPro families


  
IPR012335 Thioredoxin fold
IPR012336 Thioredoxin-like fold
IPR013766 Thioredoxin domain
IPR005746 Thioredoxin
Orthology groupMCL12852

Nucleotide sequence:

ATGTCCGTAACGAGCGCGGTCGCGAACGCCGCCGCCGCCGCCGCCGCGGCCGGCACAGGC
AAGAAGGCCGCTCAGGTACAGTTACAGGCGGAGCTGAACAACGACGATGAATGGAACAAG
TTTCTTCTACGAGACGGACTCCTCGTGATCGACGTCTACACGGAGTGGTGCGGCCCGTGC
ATAGGAATGGTGGGGAATCTGAAGAAAATCAAAGTTGAGATCGGAGGAGATAATTTACAT
CTGGCGGTGGCGAAGGCGGACACCATTGGATGTCTGTCTAGATTCAGGAACCGCAGCGAA
CCAACTTGGATGTTTATTTCTGGTGGTCAATTAATTAATGTGGTGTTCGGCGCGGACGCT
CCTCGCCTCGCTCGCACGATCGTGGAAGAGCTGAAGAATGAAGAGCTGGTGAAGAAAGGG
GAGAGAGAGAGACCGACACGAGCTCCACACGAACTCACTCCACCGGAACAGGAGGTCGCC
TTGGCCCAAGCAAAGCTTCTCCAGCTACGCAAAGAAAAAGAGGCGGCGGCTGCGGCAGCG
GAACGACTTGAAAGAAGAGAAGCGCGAGCAGTCGCCCTAGAGGTACACTTCAATGACGTG
TGTCCCGCGCTTATGATGCCCCACTCACAGAAAAATATACGAAAAGTCTCGGACGCGCTG
GAGCCTTACGGAGTAATTGTCGCTGACAAATGCCCATTAGTGCTGGGAAAAGATGGAGCG
AAAGTTCTTGGCGTGGAAGATCCTGAATTTGCAAAACCAGAAACCGCGATGGCTTTACTC
GAAAGACCAGCACTTGTACTGCTGTTGAAGAAACTACCTGACAAGGAAGGTAGTGTCATC
GAACTGGTCCGTCGCGCGATTTATAACGAAGGTATAGAATCAGATGAGGACGACACGAAG
AAATCTCTGGCAGAGGAACTGAGGGCTAACGGTATTCCCGGCGTGTTCGTACCGACCGAC
CGTCATCAGAGAGCTTCCGCACTGGACCTATTCTTCCCGAAGATGGTGTCGGCGGTGGCG
GCTCCGCTGACGGCCCCGGAGCCTCCGCACGTGGCGATGATACTGGGAGCGTGGCAGAGA
CGAGCCGTGCTCAATATCATCGCCAGCAAGCTGCCCTCCAGGCTCCTGAGATATGGCTTC
TTTAAAGACGCCGACGTCGAGCAACCGACACTACTGTGCAAGACCATCGACCAGTATGAG
GAACGACCGGAGAAAGACTTTTCGGAGACTATCGTGCTGATGATATCGGTGGGTGTGACG
GATCCTGGCGCTGAGGGGGCGCCGGTGACGGAGGGAGTTCCACACGAGCTCCTCTCACTA
GGACCTCTGTGGGTCAGCGAGGATGCGGTGCTGGGGAAAGAGGAATGCGCGAGGTTCTTC
CCCCCGGGGTACAGCGAGCCGGAGAAGAAACCCGGCCCCAAACCCAAGAAGAAGAAGAAG
AAGCGTCACGACACTAGAGAAGAAACCGCGGACAACGTGGATGCGCCCCCCGGGACTGCG
CCGGACACGGAGGCGGGAGATGGATCCGTGGAGGGAGACCCTGACCCGGAGGGGGAGGAG
GAGGGGGAAGAGAGGGAGGAGGAGGGGCAGGGAGACGGAGACGGGGAAGCGGAGGAGGGG
GAGGAGTTACTCCTGGACAAGGGAACCTCGCCGCCACCAGTCAACGATTAG

Protein sequence:

MSVTSAVANAAAAAAAAGTGKKAAQVQLQAELNNDDEWNKFLLRDGLLVIDVYTEWCGPC
IGMVGNLKKIKVEIGGDNLHLAVAKADTIGCLSRFRNRSEPTWMFISGGQLINVVFGADA
PRLARTIVEELKNEELVKKGERERPTRAPHELTPPEQEVALAQAKLLQLRKEKEAAAAAA
ERLERREARAVALEVHFNDVCPALMMPHSQKNIRKVSDALEPYGVIVADKCPLVLGKDGA
KVLGVEDPEFAKPETAMALLERPALVLLLKKLPDKEGSVIELVRRAIYNEGIESDEDDTK
KSLAEELRANGIPGVFVPTDRHQRASALDLFFPKMVSAVAAPLTAPEPPHVAMILGAWQR
RAVLNIIASKLPSRLLRYGFFKDADVEQPTLLCKTIDQYEERPEKDFSETIVLMISVGVT
DPGAEGAPVTEGVPHELLSLGPLWVSEDAVLGKEECARFFPPGYSEPEKKPGPKPKKKKK
KRHDTREETADNVDAPPGTAPDTEAGDGSVEGDPDPEGEEEGEEREEEGQGDGDGEAEEG
EELLLDKGTSPPPVND