Monarch geneset OGS2.0

DPOGS213145
TranscriptDPOGS213145-TA861 bp
ProteinDPOGS213145-PA286 aa
Genomic positionDPSCF300016 + 1059426-1060748
RNAseq coverage895x (Rank: top 14%)
Annotation
HeliconiusHMEL0103378e-15790.21% 
BombyxBGIBMGA007904-TA1e-14683.97% 
DrosophilaTxl-PA7e-10663.60% 
EBI UniRef50UniRef50_Q9VRP31e-10363.60%AT08565p n=50 Tax=Eumetazoa RepID=Q9VRP3_DROME
NCBI RefSeqNP_001040348.13e-14583.97%thioredoxin [Bombyx mori]
NCBI nr blastpgi|1140520586e-14483.97%thioredoxin [Bombyx mori]
NCBI nr blastxgi|1140520581e-13883.97%thioredoxin [Bombyx mori]
Group
Gene OntologyGO:00150355.2e-119protein disulfide oxidoreductase activity
GO:00090555.2e-119electron carrier activity
GO:00066625.2e-119glycerol ether metabolic process
GO:00454545.2e-119cell redox homeostasis
KEGG pathway 
InterPro domain[6-236] IPR0057465.2e-119Thioredoxin
[115-281] IPR0104003.7e-56Proteasome-interacting thioredoxin-like domain, C-terminal
[110-266] IPR0089798.8e-45Galactose-binding domain-like
[1-108] IPR0123361.2e-31Thioredoxin-like fold
[8-104] IPR0137661e-24Thioredoxin domain
Orthology groupMCL13771 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213145-TA
ATGGGTTTAGCAACTGTGATAGAAAATGAAGCTCATTTCCAATCTGAAATGGCTAATGCTGGAACTAAGCTGGTTGTAGTAGATTTTACAGCAACCTGGTGTCCACCGTGCCAGCGTATTGCTCCATTCTTTGAGCAGCTCCCAGCCAAGTTCCCACGTGCTGTATTTCTTAAGGTCGACGTAGATAGATGCGCCGAAACAGCCAGTGCACAGGGTATCAGTGCGATGCCTACATTCATATTTTACAGAAATCGTGCAAGAATTGACCGGCTGCAGGGCGCCGATCCTTCATCCTTAGAAAATAAAGTAAGACAGTACTATGGCACTGAAGACAGCGGTGATGATGACAATTCTGTTGCTGGACATATGGATTTAAATACATTCATTGTAAAGAATGAATGTGAATGTCTGAATGAAGCTGATGATCATCCACTGTCTCATGCGCTCACAAGTGGTGATGGGCACTTGGCTAGTGACTGTGATGAACAACTCATCATAAATATTTCATTCAATCAGCTAGTTAAAATTCATTCAATCAAAATGAAAGCACCAAGTGACAAGGGTCCAAAATCTGTTAAGGTGTTTATTAATCAGCCGAGGACACTTGATTTTGATCAAGCTGCAGGAAATGCATCGATTCAGGATTTGGAGATATCTTCGAGTGATTTAGAAGGTAATCCCGTACCATTGAAGTTTGTCAAGTTCCAGAGCGTTCAGAACATTCAACTATTTATCAAAGACAACCAGTCGGGGGATGAAGTAACACAAATTGACCACTTAGCTTTCTATGGCTCACCAATTTCAACAACCAACATGGGTGAATTCAAGCGAGTTGCCGGTAAAATAGGTGAAAGTCACTAG

Protein sequence:

>DPOGS213145-PA
MGLATVIENEAHFQSEMANAGTKLVVVDFTATWCPPCQRIAPFFEQLPAKFPRAVFLKVDVDRCAETASAQGISAMPTFIFYRNRARIDRLQGADPSSLENKVRQYYGTEDSGDDDNSVAGHMDLNTFIVKNECECLNEADDHPLSHALTSGDGHLASDCDEQLIINISFNQLVKIHSIKMKAPSDKGPKSVKVFINQPRTLDFDQAAGNASIQDLEISSSDLEGNPVPLKFVKFQSVQNIQLFIKDNQSGDEVTQIDHLAFYGSPISTTNMGEFKRVAGKIGESH-