Monarch geneset OGS2.0

DPOGS215524
TranscriptDPOGS215524-TA1605 bp
ProteinDPOGS215524-PA534 aa
Genomic positionDPSCF300467 + 15231-23603
RNAseq coverage511x (Rank: top 24%)
Annotation
HeliconiusHMEL0096250.078.13% 
BombyxBGIBMGA014063-TA3e-16087.42% 
DrosophilaGS-PI1e-16451.02% 
EBI UniRef50UniRef50_D6WQX31e-16854.51%Putative uncharacterized protein n=2 Tax=Endopterygota RepID=D6WQX3_TRICA
NCBI RefSeqXP_001653705.13e-17255.77%glutathione synthetase [Aedes aegypti]
NCBI nr blastpgi|1571208786e-17155.77%glutathione synthetase [Aedes aegypti]
NCBI nr blastxgi|1950459913e-16256.14%GH24557 [Drosophila grimshawi]
Group
Gene OntologyGO:00055241.9e-223ATP binding
GO:00043631.9e-223glutathione synthase activity
GO:00067501.9e-223glutathione biosynthetic process
GO:00038242.4e-35catalytic activity
GO:00168742.1e-26ligase activity
KEGG pathwayaag:AaeL_AAEL0091548e-172 
 K01920 (E6.3.2.3, gshB)maps-> Glutathione metabolism
InterPro domain[1-533] IPR0056151.9e-223Glutathione synthase, eukaryotic
[236-360] IPR0048875.5e-42Glutathione synthase, substrate-binding, eukaryotic
[256-361] IPR0161852.4e-35PreATP-grasp-like fold
[86-167] IPR0140422.1e-26Glutathione synthase, alpha-helical, eukaryotic
[425-531] IPR0138168.7e-26ATP-grasp fold, subdomain 2
[11-85] IPR0140493.5e-24Glutathione synthase, N-terminal, eukaryotic
Orthology groupMCL10646 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215524-TA
ATGTCACAATCTCGCCTAAAATCTTGTATTCCTCTGCCAATTGACCACAAGATGCTGGTGTCTGTGATAGAAAAAGCGAAGGATTGGGCCCTGATGCACGGAGTTGGGATGCGAGATAAAAAACATTTCAACAAAGATGTCATTCAGATTGCGCCATTTGTTCTTCTGCCATCACCATTCCCTCGGACAGAGTTTAACAAAGCAGTTGAGCTACAGCCAATATTAAATGAATTAATGCACAAGGTTGCACATGACGATGAATTTCTGGAAAAGACACTACAAAATGCACTCCAGGTCGATGAGTTCACAGCAAATCTGTTTGATATATGGGTTAAAGTAAGGGAAGAGGGCATAACACAGCCAATATCACTAGGGATGCTGCGATCTGACATCATGTTGGAGTCGAGATGTCCTCACACAGAGAACCAATGCGCTAAACACACCCCATATTGCTCATGGAAGCAAGTGGAGATAAATAGTATAGCATCTGGTTTTGGACATCTTGGGCCAGTTTCTAAGGATATACAGAGCTTCGGGCTCTTCAGATCGGACTACTTGATGCAAAGGGACGGTAACCGGATCAAACAAGTGGAGTTTAACACTATAGCGTCGAGTTTTGGCGCCCTTACCTCACACCTGCCCGCTATGTCTCGTTATATCCTTCGTCAATTGGGACATGGTGATCTTATAAAAAATATGCCAGAGAATCGAGCTCTGTCAGGTCTATGCACTGGCATCACGGATGCCTACGACCTGTTCGGAGTACCATCGGCTGTTGTTCTGTTTGTGGTTGAGGAGGTGTCCTACAACATTTGCGATCAAAGGTTCCACGAATTTGAGATATCGGAGAAGAGACCCGACATTATGATATACAGGAAAACTCTGAACGAAATATACGAGGAGACGCGACTCAATGAAAAGAAACAACTCATATTGGAGGATCGTCCCGTGGCTGTGGTCTACTACCGCTCTGGTTACGAACCAGCTCAGTATCCGACCACCAAGGAATGGGACGCCAGGCTAAGAGTGGAAAAATCATCGGCGATAAAATGTCCGTCAATACATTATCAGCTCGCTGGCACCAAGAAAGTCCAGCAGGCGTTGGCTGCTCCGGGGGTTTTGGAGAAGTTCATGGGCGCCGGCGCCACTACAGGTCGCGTTAGGGATATATTCGCTGGACTGTACTCATTGGACTTCGACGAGAACGGCGAGAGGGCTGTGGATATGGCCCTAGCTGATGCTGAGAGGTTTGTGTTAAAACCTCAGCGGGAGGGCGGTGGTAATAATGTCTACGGAGCTGATATAAGGGACGCCCTGCTGAGGATGAGGCATAGCAGGGAACGGGCGGCCTACATACTCATGGAGAGAATACTGCCTCCGTTAGTGGCTGGTTACGTCGTTCGTCCAGGTGCAGCTGTTCCACCGCCCATCACAGACCTGGTGTCAGAGCTCGGTATCTTCGGTGTTATTATAGGTACGAAAGACAAAATCTACTGCAACAAACAAGTCGGTCACATGTTACGTACGAAACTAGCGGACGCCAACGAAGGAGGAGTCGCTGCCGGACTGGGGGCGCTCGACTCGCCGTACTTATTAGACATGTAG

Protein sequence:

>DPOGS215524-PA
MSQSRLKSCIPLPIDHKMLVSVIEKAKDWALMHGVGMRDKKHFNKDVIQIAPFVLLPSPFPRTEFNKAVELQPILNELMHKVAHDDEFLEKTLQNALQVDEFTANLFDIWVKVREEGITQPISLGMLRSDIMLESRCPHTENQCAKHTPYCSWKQVEINSIASGFGHLGPVSKDIQSFGLFRSDYLMQRDGNRIKQVEFNTIASSFGALTSHLPAMSRYILRQLGHGDLIKNMPENRALSGLCTGITDAYDLFGVPSAVVLFVVEEVSYNICDQRFHEFEISEKRPDIMIYRKTLNEIYEETRLNEKKQLILEDRPVAVVYYRSGYEPAQYPTTKEWDARLRVEKSSAIKCPSIHYQLAGTKKVQQALAAPGVLEKFMGAGATTGRVRDIFAGLYSLDFDENGERAVDMALADAERFVLKPQREGGGNNVYGADIRDALLRMRHSRERAAYILMERILPPLVAGYVVRPGAAVPPPITDLVSELGIFGVIIGTKDKIYCNKQVGHMLRTKLADANEGGVAAGLGALDSPYLLDM-