Monarch geneset OGS2.0

DPOGS213590
TranscriptDPOGS213590-TA765 bp
ProteinDPOGS213590-PA254 aa
Genomic positionDPSCF300033 + 486543-487729
RNAseq coverage1808x (Rank: top 7%)
Annotation
HeliconiusHMEL0054746e-11675.40% 
BombyxBGIBMGA011819-TA8e-12379.13% 
DrosophilaCG6776-PA1e-4940.33% 
EBI UniRef50UniRef50_Q2F6897e-12078.35%Glutathione S-transferase omega 1 n=3 Tax=Obtectomera RepID=Q2F689_BOMMO
NCBI RefSeqNP_001040131.11e-12078.35%glutathione S-transferase omega 1 [Bombyx mori]
NCBI nr blastpgi|1140522422e-11978.35%glutathione S-transferase omega 1 [Bombyx mori]
NCBI nr blastxgi|1140522422e-11678.35%glutathione S-transferase omega 1 [Bombyx mori]
Group
Gene OntologyGO:00043642.7e-15glutathione transferase activity
GO:00081522.7e-15metabolic process
GO:00057372.7e-15cytoplasm
GO:00055154e-08protein binding
KEGG pathwaydme:Dmel_CG67819e-47 
 K00310 (E1.5.4.1)maps-> Glutathione metabolism
InterPro domain[4-108] IPR0123364.4e-30Thioredoxin-like fold
[97-231] IPR0109872e-22Glutathione S-transferase, C-terminal-like
[17-32] IPR0054422.7e-15Glutathione S-transferase, omega-class
[18-90] IPR0040454e-08Glutathione S-transferase, N-terminal
Orthology groupMCL10627 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213590-TA
ATGTCGGAAAAACATTTACAAAGCGGTGATTCACTGCCACCATTTACCGGAAAATTGAGGTTATTCGCAATGAGATTTTGCCCCTACGCCGAAAGAAGCATTCTTTGTTTAAATGCAAAAAAACTTCAATATGATTTAGTTTTTATAAATTTAGATCACAAGCCGGAATGGATTTTCCAATTCAACCCAAAAGGAGCAGTACCAGCCTTGGAGTATGAGGAAGGTAAAGCCATTTTTGACAGTAATGTTATCAATGTCTATCTTGACGAGAAGTATCCTGAAATACCACTCCAAAATTCAGACCCATTAAGAAGAGCTCAAGATAAATTGCTTGTTGAAATGTTTGCTGGGGCACAATCCGCATACTACACTGCCGCATTCAATCCTCAAGCTGTTGAACCAAGCATGGTTGAAAACTTTCACAAAGGACTAGATCTCCTGCAAAAGGAGATTGAGTCTCGGGGAACTAAATTCTTAAATGGAGATGAACCTGGGCTCGTTGATTATACCATTTGGCCATTCTTGGAGAGGTTTGAAGCTCTTCCAATTCTAGGACAACAGGAATTTGCCATTGATAAATCAAAATATGAGATTCTTATAACATACATGGCAGCCATGAGAGATTCACCTGCTGTTAAAGCATATGCCTTAGCCCCTGACACCCATGCCAAGTTCACAGAGTCTCGTATTAAAGGAGACGCCAATTACAATATGTTGGACACAAGCGCTGTATGTTGCATGAGACCAAGAAAGAAGAAGGAATAA

Protein sequence:

>DPOGS213590-PA
MSEKHLQSGDSLPPFTGKLRLFAMRFCPYAERSILCLNAKKLQYDLVFINLDHKPEWIFQFNPKGAVPALEYEEGKAIFDSNVINVYLDEKYPEIPLQNSDPLRRAQDKLLVEMFAGAQSAYYTAAFNPQAVEPSMVENFHKGLDLLQKEIESRGTKFLNGDEPGLVDYTIWPFLERFEALPILGQQEFAIDKSKYEILITYMAAMRDSPAVKAYALAPDTHAKFTESRIKGDANYNMLDTSAVCCMRPRKKKE-