Monarch geneset OGS2.0

DPOGS212419
TranscriptDPOGS212419-TA1560 bp
ProteinDPOGS212419-PA519 aa
Genomic positionDPSCF300258 - 35000-42182
RNAseq coverage3225x (Rank: top 4%)
Annotation
HeliconiusHMEL0096180.075.34% 
BombyxBGIBMGA002818-TA0.079.75% 
DrosophilaTrxr-1-PB0.063.34% 
EBI UniRef50UniRef50_P919380.063.34%Thioredoxin reductase 1, mitochondrial n=32 Tax=Endopterygota RepID=TRXR1_DROME
NCBI RefSeqXP_975772.10.067.44%PREDICTED: similar to thioredoxin reductase isoform 2 [Tribolium castaneum]
NCBI nr blastpgi|18482940.066.02%glutathione reductase family member [Musca domestica]
NCBI nr blastxgi|910794220.067.44%PREDICTED: similar to thioredoxin reductase isoform 2 [Tribolium castaneum]
Group
Gene OntologyGO:00506601.3e-236flavin adenine dinucleotide binding
GO:00551141.3e-236oxidation-reduction process
GO:00454541.3e-236cell redox homeostasis
GO:00047911.3e-236thioredoxin-disulfide reductase activity
GO:00506611.3e-236NADP binding
GO:00166681.3e-236oxidoreductase activity, acting on a sulfur group of donors, NAD or NADP as acceptor
GO:00164913.5e-36oxidoreductase activity
GO:00057375.7e-34cytoplasm
KEGG pathwaytca:6560620.0 
 K00384 (E1.8.1.9, trxB)maps-> Pyrimidine metabolism
InterPro domain[5-518] IPR0063381.3e-236Thioredoxin/glutathione reductase selenoprotein
[11-357] IPR0237533.5e-36Pyridine nucleotide-disulphide oxidoreductase, FAD/NAD(P)-binding domain
[378-503] IPR0040995.7e-34Pyridine nucleotide-disulphide oxidoreductase, dimerisation
[387-519] IPR0161568.5e-33FAD/NAD-linked reductase, dimerisation
[12-31] IPR0130271.8e-24FAD-dependent pyridine nucleotide-disulphide oxidoreductase
Orthology groupMCL10877 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212419-TA
ATGGCACCTATAGATGGAACATATGACTATGATCTAGCGGTAATCGGTGGTGGTTCAGGGGGTCTTGCTTGTGCAAAGGAAGCTGTGAACCTGGGTGCCAAAGTGGCAGTATTAGATTATGTGACCCCATCACCTCAAGGAACAAAGTGGGGCCTCGGTGGGACATGTGTCAATGTTGGTTGCATACCAAAGAAGTTGATGCACCAGGCTGCGATACTCGGTGAAAGCATACATGGAATGGTAAGTGAATGCTTGAATCCTTGGACCCTTCCATTGTTCCCGATCCCCGAAATCCCTTGGAATCCTAAAGAAGCCGTCGCCTACGGATGGGAAGTGCCTTCTATAAACCAGGTCAAAATAAACTGGTCGGCGTTGACCGAAGCCGTTCAGAATCATATCAAATCGGTCAATTGGGTCACCAGAGTGGACTTAAGGGAGAAGAAGATCGAATATATCAATGGTTTGGGAGAGTTCAAGGATCCGCACACGCTGGTCGCCACGCTGAAGAACGGGAACAAGAAGGAACTCACGGCTAAGAACATTCTGATAGCCGTGGGCGGACGACCACACTACCCTGATATCCCTGGGGCCAAGGAGTACTGCATCACTAGCGATGACATCTTCAGCTTGAGCCATCCCCCCGGCAAGACGCTAGTCGTGGGCGCTGGGTATATCGGTCTAGAATGCGCGGGTTTCCTCAACTCGCTGGGCTTCCCGGCGACGGTGCTGGTGAGGTCGGTGCCGCTGCGAGGCTTCGACCAGCAGATGGCGGGTCTGGTCACCAGCGAGATGCAGGAGAAGGGCGTGGTGTTCCAGCACAAGTGCGTGCCGCTCTCCGTGGAGCGGCTGGAGTCCGGCCAGCTCAAGGCTCGCTGGATGAACACGGACACGCAGCAACAGAGCGAAGACGTGTTCGACACAGTGTTACTGGCCACAGGAAGATATGCGCTCACCGAACAGCTCAACCTGAAGGCCGCCGGGGTCACGACGTTGGACGATCACGGCAAGGTGGTCTCCAGCGACGAGTCCACCAACGTCCCTCACATCTTCGCGGTCGGCGACGTGTTGTCGTCGCGGCCGGAGCTGACCCCGGTGGCGATCCACGCGGGCCGGCTCCTCGCGCGCCGCATGCTCGGCGGCGGGAAGCAACACATGGACTACGACAACGTCGCCACCACCGTCTTCACTCCGCTGGAGTACGGCTGCGTGGGCCTCAGCGAGGAGACGGCGCTCGAGAGATACGGCGCTGACAACGTGGAGGTGTACCACGCCTACTACAAGCCGACCGAGTTCTTCATACCTCAGAAGAACATTCGGAACTGTTACTTGAAGGCGGTGGTCCGGCGCGAGGCTCCCTACCAGGTGCTGGGGCTTCACTTCGTGGGCCCCGCGGCCGGGGAGGTCATCCAAGGCTTCGCCGCCGCTATCAAGTGCGGTTTGACCATGGAACAGCTGATGAACACGGTGGGCATCCACCCCACGGTGGCCGAGGAGTTCACGAGACTGAACATCACCAAGAGGTCCGGCAAGGATCCCAACCCCGCCTCCTGCTGCAGCTAG

Protein sequence:

>DPOGS212419-PA
MAPIDGTYDYDLAVIGGGSGGLACAKEAVNLGAKVAVLDYVTPSPQGTKWGLGGTCVNVGCIPKKLMHQAAILGESIHGMVSECLNPWTLPLFPIPEIPWNPKEAVAYGWEVPSINQVKINWSALTEAVQNHIKSVNWVTRVDLREKKIEYINGLGEFKDPHTLVATLKNGNKKELTAKNILIAVGGRPHYPDIPGAKEYCITSDDIFSLSHPPGKTLVVGAGYIGLECAGFLNSLGFPATVLVRSVPLRGFDQQMAGLVTSEMQEKGVVFQHKCVPLSVERLESGQLKARWMNTDTQQQSEDVFDTVLLATGRYALTEQLNLKAAGVTTLDDHGKVVSSDESTNVPHIFAVGDVLSSRPELTPVAIHAGRLLARRMLGGGKQHMDYDNVATTVFTPLEYGCVGLSEETALERYGADNVEVYHAYYKPTEFFIPQKNIRNCYLKAVVRREAPYQVLGLHFVGPAAGEVIQGFAAAIKCGLTMEQLMNTVGIHPTVAEEFTRLNITKRSGKDPNPASCCS-