Monarch geneset OGS2.0

DPOGS208645
TranscriptDPOGS208645-TA1341 bp
ProteinDPOGS208645-PA446 aa
Genomic positionDPSCF300281 - 114180-117495
RNAseq coverage639x (Rank: top 20%)
Annotation
HeliconiusHMEL0117573e-13156.33% 
BombyxBGIBMGA007779-TA3e-9559.18% 
DrosophilaMgstl-PA2e-3348.28% 
EBI UniRef50UniRef50_D7RX051e-5467.33%Microsomal glutathione transferase n=2 Tax=Endopterygota RepID=D7RX05_HELVI
NCBI RefSeqXP_002428068.16e-3959.85%Microsomal glutathione S-transferase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2964278634e-5467.33%microsomal glutathione transferase [Heliothis virescens]
NCBI nr blastxgi|2964278635e-5267.33%microsomal glutathione transferase [Heliothis virescens]
Group
KEGG pathwaydpo:Dpse_GA145061e-32 
 K00799 (E2.5.1.18, gst)maps-> Drug metabolism - cytochrome P450
    Glutathione metabolism
    Metabolism of xenobiotics by cytochrome P450
InterPro domain[304-441] IPR0233522.2e-45Membrane associated eicosanoid/glutathione metabolism-like domain
[311-439] IPR0011294e-23Membrane-associated, eicosanoid/glutathione metabolism (MAPEG) protein
Orthology groupMCL11613 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208645-TA
ATGACATCATTTCAGTTCAGTCTGAGGAGTGAAGTGAGCAACACGTATGCTGTTTACAATCAAATATTATTAGACATGGTGCCCGTCACGATCCTGGAACCGGCTGTCAAGTCGTACATAGCATGCTCCGGAATACTAGCACTCAAAGTATTAGGAATGTCATTGTTGACGGGGCGGATGAGATACAAAAAGAAAGTCTTCGCTAACGAGGAAGACACAAAATTAAAGGATTCAGTTGTGAAATACGACGACCCTGACGTGGAGAGAATCCGCCGCGCTCACTTGAACGACCTAGAGAACATTCCAGTGTTCTGGGTTGTAGGTGCGTTGTATCTAACGACCGGACCGTCTGACGAGGTCGCCGTCAATCTGTTTAGAGTTTACACAGCGGGGAGGATCCTTCACACTCTTGTGTACGCCGTGAAGCCTTTACCACAACCCGCCCGAGGCATATGTTTTGCTATACTAGCACTGAAGCTGTTGTCAATGAGCACACTGACGTCTTTAGTGCGTTTATCGAGTGGTATTTTCTCAAACCCAGAGGACGCTAAGGCTTTCAAAGGGAAGGTGAAATACGACGACCCGATTGTTGAGCGAACCCGTCGGGCTCACCTGAATGATTTGGAGAACATTCCAGCGTTCTGGGTGATAGCAGCTCTGTACCTGACGACGGGGCCGGTAGCGGTGGTGGCAACACTGCTCTTCAGAGTTTACACAGCCAGCCGCATCATCCACACACTGGTGTACGCTGTGGTACCATTACCTCAGCCAACCCGCGCCATAGCGTACATGATACCCTACCTCATCAAATGGTACATGGGCTTCCAGTTCAGTCTGCGGAGTGAAGTGTGCAACACGTATGCTGTTTACAATCAAATATTATTAGACATGGTGCCCGTCACGATCCTGGAACCGGCTGTCAAGTCGTACATAGCATGCTCCGGAATACTAGCACTCAAAGTATTAGGAATGTCATTGTTGACGGGGCGGATGAGATACAAAAAGAAAGTCTTCGCTAACGAGGAAGACACAAAATTAAAGGATTCAGTTGTGAAATACGACGACCCTGACGTGGAGAGAATCCGCCGCGCTCACTTGAACGACCTAGAGAACATTCCAGTGTTCTGGGTTGTAGGTGCGTTGTATCTAACGACCGGACCGTCTGACGAGGTCGCCGTCAATCTCTTTAGAGTTTACACAGCGGGAAGGATCCTTCACACTCTTGTGTACGCCGTGAAGCCTTTTCCACAACCCGCCCGAGGCATATGTTTTGTTATTCCGTTTTGGATCACTATTTATATGGGGGTGAAAATTATTTCTCACTATATGATCGCCTTGTAA

Protein sequence:

>DPOGS208645-PA
MTSFQFSLRSEVSNTYAVYNQILLDMVPVTILEPAVKSYIACSGILALKVLGMSLLTGRMRYKKKVFANEEDTKLKDSVVKYDDPDVERIRRAHLNDLENIPVFWVVGALYLTTGPSDEVAVNLFRVYTAGRILHTLVYAVKPLPQPARGICFAILALKLLSMSTLTSLVRLSSGIFSNPEDAKAFKGKVKYDDPIVERTRRAHLNDLENIPAFWVIAALYLTTGPVAVVATLLFRVYTASRIIHTLVYAVVPLPQPTRAIAYMIPYLIKWYMGFQFSLRSEVCNTYAVYNQILLDMVPVTILEPAVKSYIACSGILALKVLGMSLLTGRMRYKKKVFANEEDTKLKDSVVKYDDPDVERIRRAHLNDLENIPVFWVVGALYLTTGPSDEVAVNLFRVYTAGRILHTLVYAVKPFPQPARGICFVIPFWITIYMGVKIISHYMIAL-