Monarch geneset OGS2.0

DPOGS213591
TranscriptDPOGS213591-TA759 bp
ProteinDPOGS213591-PA252 aa
Genomic positionDPSCF300033 + 493370-495627
RNAseq coverage71x (Rank: top 66%)
Annotation
HeliconiusHMEL0054748e-9361.90% 
BombyxBGIBMGA011658-TA1e-9665.87% 
Drosophilase-PA2e-4842.28% 
EBI UniRef50UniRef50_Q2F5R13e-9465.48%Glutathione S-transferase omega 2 n=6 Tax=Obtectomera RepID=Q2F5R1_BOMMO
NCBI RefSeqNP_001037406.12e-9465.48%glutathione S-transferase omega 2 [Bombyx mori]
NCBI nr blastpgi|1156053617e-9465.87%glutathione S-transferase omega 1 [Bombyx mandarina]
NCBI nr blastxgi|1156053615e-9265.87%glutathione S-transferase omega 1 [Bombyx mandarina]
Group
Gene OntologyGO:00043643.8e-21glutathione transferase activity
GO:00081523.8e-21metabolic process
GO:00057373.8e-21cytoplasm
GO:00055151e-11protein binding
KEGG pathwaydme:Dmel_CG67811e-46 
 K00310 (E1.5.4.1)maps-> Glutathione metabolism
InterPro domain[15-116] IPR0123367.8e-32Thioredoxin-like fold
[28-43] IPR0054423.8e-21Glutathione S-transferase, omega-class
[108-241] IPR0109873.8e-21Glutathione S-transferase, C-terminal-like
[29-101] IPR0040451e-11Glutathione S-transferase, N-terminal
Orthology groupMCL30864 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213591-TA
ATGTCCTCCAAGGCTATCACTGGTAAAATAAACTTTAATACAAAGCACTTGAAACGAGGAGATCCGTTACCGCCCTATAATGGAAAATTACGAGTGTACAACATGCGTTATTGTCCTTTCGCGCAACGAACCATTTTGGCGCTAAATGCGAAACAAATGGACTATGAAGTTGTAAATATCAACCTTATGGACAAACCCGAATGGCTGACGAGAAAAAGTGCATTTGGCAAAGTGCCAGCAATAGAAATTAATGAAGACGTATGTATATTCGAAAGCTTGGTGACAGTTGAGTATCTTGATGAAGCGTACCCACAAAGACCCCTGTTGCCAAAAGATCCTCTTAGGAAAGCTTTGGATAAAATTTTAATAGAGGCTTCAGGACCTATCCATACAATGATGTTCAAAACAGTCAAAATGCCAGATTCAATAACCGAAGACAACTTAAAGGCGTACGAGAGCTCTTTACAATACATACAGAACGAACTTATAAATCGAAAAACAAAATTCTTAAGTGGCAACGAACCGGGCTACGTGGATTACATGATATGGCCGTGGTTTGAAAGAATTGGGGCTCTCAAGAAATTCGATGAACGTGCCGGGATAGATTCTAGCAAATTTGGTTTACTGTTGGAATACTGCAGTAATATGGCCAAGGACCCAGCAGTTAGTGATTACCTACTGCCAGATGACATCTTGTTTAAATATTTTGAAGGTTACAAGGCAGGAGCACCCAATTATGAGCTTATCACTGAAGAGTGA

Protein sequence:

>DPOGS213591-PA
MSSKAITGKINFNTKHLKRGDPLPPYNGKLRVYNMRYCPFAQRTILALNAKQMDYEVVNINLMDKPEWLTRKSAFGKVPAIEINEDVCIFESLVTVEYLDEAYPQRPLLPKDPLRKALDKILIEASGPIHTMMFKTVKMPDSITEDNLKAYESSLQYIQNELINRKTKFLSGNEPGYVDYMIWPWFERIGALKKFDERAGIDSSKFGLLLEYCSNMAKDPAVSDYLLPDDILFKYFEGYKAGAPNYELITEE-