Monarch geneset OGS2.0

DPOGS205593
TranscriptDPOGS205593-TA1923 bp
ProteinDPOGS205593-PA640 aa
Genomic positionDPSCF300445 - 64074-69512
RNAseq coverage17x (Rank: top 81%)
Annotation
HeliconiusHMEL0156730.077.30% 
BombyxBGIBMGA004183-TA9e-17063.07% 
DrosophilaCG18130-PA8e-3326.03% 
EBI UniRef50UniRef50_Q9VYR51e-3026.03%CG18130, isoform A n=10 Tax=Drosophila RepID=Q9VYR5_DROME
NCBI RefSeqXP_001978404.13e-3226.15%GG19570 [Drosophila erecta]
NCBI nr blastpgi|1948960675e-3126.15%GG19570 [Drosophila erecta]
NCBI nr blastxgi|1948960675e-4626.24%GG19570 [Drosophila erecta]
Group
Gene OntologyGO:00150351.2e-21protein disulfide oxidoreductase activity
GO:00090551.2e-21electron carrier activity
GO:00066621.2e-21glycerol ether metabolic process
GO:00454541.2e-21cell redox homeostasis
KEGG pathway 
InterPro domain[9-183] IPR0057461.2e-21Thioredoxin
[26-136] IPR0123364.5e-08Thioredoxin-like fold
Orthology groupMCL24996 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205593-TA
ATGGCTAGAAAGGGACAAGTAGCCATACAAGATAATATTGAAACCAATGAAGAATTTGAGGAAAATATGGCTAGAAAGGGACAAGTAGCCATACAAGATAATATAGAAACCAATGAAGAATTTGAGGAAACATTAATGTCAAATTTTGATCGACTCCTATGTTTGGAGGTGTATTCTGAATTCTGTGGTCATTGTTTAGCTACTGGAAATGCCATAAGAAAGGGTAAACTAGAAATTGGTCAAGATCGTATTGCTATGGTCAGAGCTTTAGCAGATAACATCGACGTTTTATCGAGATTTAGAAATCGGAGCGAGCCGATTTTTCTTTTCATATCGAAAGGTAAATTAATAAGAGCTATGTTTGGTGCAAATGGTTTAGAATTATGTCGCATAATGGAGGAAGAATTGGAAAATGTGAAAATTGAGGCTGAAACCGGAATTGAAAGACCCAAACAAGAAATTGAAGAGCTTTTACCGGAGGAAGCTGCAAAGATTGAAGAAGATTTAAAAATGGAAGAAGAAGCTCGAGAAAAAACTGAGAGACTTCGAGTTTTAACTACTGCTGCTCGAAAAAAAAGAGTTTGCGAGCGTTTGGCACGCCACGTACGAGGGTTGAATTTTATTTTGTACTGGCCACACTGTCACAAAGCCCATTTAGACCTTTATGAAAAATGGGATCTTATAAATGTCCAAGTGGCGGCTAAGGAGACGATCCAAATGACTGAGGAATTAGTGAAAGAGGCTTTATATATGAGTGACGTAGACCCTAATGAAGCTTGTATCCATGCTCTAATGAACGGAGAAGCATTGGTTGTTCTTTTTAAAATGTTCGACACGGATGATAGGGATTTTGTTAAACTTATGCGTCATACTTTATACGAAGAAATACCAGTTCCAAAGGAGGATTTGCCACCGGAAAAGCAGCTTCCTCCAATACCGGCGTTTGAAAGGTATGCGACCATCAGTAAAACGGCTAGAGAGGTTCGGAGGGAGAGGTACGAAGCCCGAATGGAAAAACTACGTCAAGAAAAAGAAGATCGGGATAGATTAGCAGCGGAACAAGCGAGACTTGCAAGAGAAGAAGAGGAAGAAAGACAAAGACTAGAGAAACAAAGACAGGAAGAAGAAAGAATGGCTAGGATTCAAGCTGGATTGCCAGCAGATCCCGAACCAGAGCCAGCACAAGAAGCTGGGGAAGAGGGTGGTGAGGAAGCAGTTGAAGGAGAGGAAACAGAGGTGGCCGAAACTGAAGACATTGAGGAACCGGAACAAAAAGAAGAAGAACAAGTAGAGGTTGAAGAGGAATTCCACTCAGATGTATCTGTTGAGGACGAGGAGTATATTCCTCCTGGTGGTCTATTTGTGCCAGGACTATATACTCCACCTAACGATTTAGCCAAGGCTAATGCATTGGCCTACTTTTATCCCAAGATCGTGTCTCAAATTACACCAATTGAGTCGGAGTTTCTCCCTCCGCACGTGTTGGTAATGTTTACTATTGAAAAGCGACATGATGTTAAAGATATAATGGATCAATTTCCTGATGAAATTCTTAATTATGGCATTTTTATCGGAGATGACCCTACCACAGCTCAACACCTCGCTTATACTATAAAGCAGTATAATCATATGAGCAGAATAAGGAAGCACAACGATAGACTGGCGTTGATGGTTTCTCGGAAGCGCAGTCTACCAATGTTGCAGTTGGCGGGAGTCAATCCTTGTTACATCAGTCATGATGTGGAGAGCGGGGAAAAGGATTGCCTTATTATGTTTCCTGTGGGTTACGGAGATGACTATGAGGAAGAAGAAAGTGTCCATGAGGAGGCTGAGGAGGCTGTAGAAGAACAAGCACCTGAACAGGAGGTTGTCGAAGTTGTTAACCAAGAAGAACAGGAAGAAGATGAAGAAGAGGACGACTAA

Protein sequence:

>DPOGS205593-PA
MARKGQVAIQDNIETNEEFEENMARKGQVAIQDNIETNEEFEETLMSNFDRLLCLEVYSEFCGHCLATGNAIRKGKLEIGQDRIAMVRALADNIDVLSRFRNRSEPIFLFISKGKLIRAMFGANGLELCRIMEEELENVKIEAETGIERPKQEIEELLPEEAAKIEEDLKMEEEAREKTERLRVLTTAARKKRVCERLARHVRGLNFILYWPHCHKAHLDLYEKWDLINVQVAAKETIQMTEELVKEALYMSDVDPNEACIHALMNGEALVVLFKMFDTDDRDFVKLMRHTLYEEIPVPKEDLPPEKQLPPIPAFERYATISKTAREVRRERYEARMEKLRQEKEDRDRLAAEQARLAREEEEERQRLEKQRQEEERMARIQAGLPADPEPEPAQEAGEEGGEEAVEGEETEVAETEDIEEPEQKEEEQVEVEEEFHSDVSVEDEEYIPPGGLFVPGLYTPPNDLAKANALAYFYPKIVSQITPIESEFLPPHVLVMFTIEKRHDVKDIMDQFPDEILNYGIFIGDDPTTAQHLAYTIKQYNHMSRIRKHNDRLALMVSRKRSLPMLQLAGVNPCYISHDVESGEKDCLIMFPVGYGDDYEEEESVHEEAEEAVEEQAPEQEVVEVVNQEEQEEDEEEDD-