Monarch geneset OGS2.0

DPOGS204308
TranscriptDPOGS204308-TA1875 bp
ProteinDPOGS204308-PA624 aa
Genomic positionDPSCF300046 + 647312-655387
RNAseq coverage492x (Rank: top 25%)
Annotation
HeliconiusHMEL0151550.080.51% 
BombyxBGIBMGA007582-TA0.083.23% 
DrosophilaGclc-PA0.071.60% 
EBI UniRef50UniRef50_P485060.059.39%Glutamate--cysteine ligase catalytic subunit n=86 Tax=Eukaryota RepID=GSH1_HUMAN
NCBI RefSeqXP_966349.10.069.81%PREDICTED: similar to glutamate cysteine ligase isoform 1 [Tribolium castaneum]
NCBI nr blastpgi|910936590.069.81%PREDICTED: similar to glutamate cysteine ligase isoform 1 [Tribolium castaneum]
NCBI nr blastxgi|910936590.069.81%PREDICTED: similar to glutamate cysteine ligase isoform 1 [Tribolium castaneum]
Group
Gene OntologyGO:00043573.8e-171glutamate-cysteine ligase activity
GO:00067503.8e-171glutathione biosynthetic process
KEGG pathwaytca:6565670.0 
 K11204 (GCLC)maps-> Glutathione metabolism
InterPro domain[2-622] IPR0043080Glutamate-cysteine ligase catalytic subunit
Orthology groupMCL11255 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204308-TA
ATGGGTTTATTGACGGAAGGTAGCCCTCTTTCTTGGGAAGAAACCAAAGCATTGGCGGAACATGTGCGTCGACATGGTGTTGAACAGTTTATTAATCTCTACAGCAAGCTTAGAGACCGTACTGGAGATGTTCTCAAGTGGGGTGATGAGGTGGAGTATATTATAGTTAAATTTGATGATGTTAATCAGCGAGCCACTGTAAGTCTAAGAGCTGAGGAGGTTCTGCCTAAACTGCAAGAAAAAGAATTGAAAGATCCTCAAAATGTAAAAAGTCTCTGGCGGCCAGAATACGGAGCTTATATGATTGAAGGTACTCCAGGGAAGCCCTACGGAGGGCTGCTGGCCCATTTCAACATAGTTGAAGCAAATATGCGCTATCGTAGGGCCGAAGCAAGTGCACTTTTAAAAGACGGTGAAGTCATTATGAGTATTACGAATTTTCCTAGATTAGGCAGTCCAAACTTTACATCTCCACCATACAAACCAACTCCAGACAGGGGTGTGTCATTATCATACTTCTTTCCTGATCAGGCAACATTCCCTGGACACCCACGTTATAAGACACTGGCCGAAAACATTCGGAAGAGAAGAGGAAATAGAATGGCCATAAATATTCCAGTTTTCCGCGATGTGAACACTAAGATTCCTATCGACGATTACCACAAGATACTGCCAGATTTGGCCAAACCAGACTCCGTGTATTTGGATGCTATGGGTTTGGGTATGGGGTGCTGCTGTCTTCAGGTCACATTCCAGGCTTGTTGTATAACTGAAGCTCGCACATTGTACGATCAACTTGCCCCTTTATGTCCAATCATGCTAGCGTTGTCTGCCGCTTCTCCCGTGTACCGAGGTTACCTGACGGATGTTGACTGCCGCTGGAATGTCGTCTCGGCCTCGGTGGATTGCCGTACTAGAGAGGAGTTGGGCTTAGAGCCGCTGAAGAATGACAAGTTCCGCATACACAAATCACGTTACGACTCCATCGATTCCTACCTATCACCTGAACATGAGAAGTACAACGATATCGAGGTGGTGCACGATCCCGCAGTGTACCGCCGTCTTCGCGAGGGTGGTATAGATCACCCTCTGGCGATCCACGTGGCACATCTCTTCATACGAGACACCGTGTCTCTGTTCAGTGAAAAGGTTCACCAGGATGACGAGAATGATACTGATCATTTTGAAAACATTCAATCTACCAACTGGCAGACCATGCGTTTCAAGCCGCCTCCTCCGAACTCGCCGATCGGTTGGCGCGTCGAGTTCCGTCCATGTGACGCTCAACTCACAGACTTTGAGAACGCCGCCTATGTATGTTTCGTGGTGCTCCTCACGCGCGTCATATTAACATACAACCTCAAATTCGTGATGCCCATCAGTAAGGTGGACGAAAACATGCAACGCGCTCAGCGTCGTGGCGCGTGCGCTTCACAGCGCTTCTGGTGGCGTCGCGACGTACGCTCACAAGACGCCGACACGTATCTGGAGATGACCGTACACGAGATCATTAACGGAAAGGAGGGCGTGTTCCCTGGTCTTATCCCTCTCATAGAGTCCTACCTGTCCGGTATGGACGTAGACGCGGACACTCACTGCTCCGTGCAACAGTACCTGAAGCTGATACAACGCCGCGCCTCCGGAGAAATACTCACCATGGCCTCCTGGATGAGAGAATTCATTGACAAACACCCGCAATACAAAAAAGATTCCATCGTCACCGAAAAGATCAACTACGACCTTCTAAAGACAGCGTACGGTATTCAGTCTGGTACGATCCCAGCTCCCACACTCCTCGGCAGTTCCAATGTGTCCAAGACCAACGACGACATCCCAAAAGCCTTCAGCAAGATGATGAGCAAGGACTGTCCTTAG

Protein sequence:

>DPOGS204308-PA
MGLLTEGSPLSWEETKALAEHVRRHGVEQFINLYSKLRDRTGDVLKWGDEVEYIIVKFDDVNQRATVSLRAEEVLPKLQEKELKDPQNVKSLWRPEYGAYMIEGTPGKPYGGLLAHFNIVEANMRYRRAEASALLKDGEVIMSITNFPRLGSPNFTSPPYKPTPDRGVSLSYFFPDQATFPGHPRYKTLAENIRKRRGNRMAINIPVFRDVNTKIPIDDYHKILPDLAKPDSVYLDAMGLGMGCCCLQVTFQACCITEARTLYDQLAPLCPIMLALSAASPVYRGYLTDVDCRWNVVSASVDCRTREELGLEPLKNDKFRIHKSRYDSIDSYLSPEHEKYNDIEVVHDPAVYRRLREGGIDHPLAIHVAHLFIRDTVSLFSEKVHQDDENDTDHFENIQSTNWQTMRFKPPPPNSPIGWRVEFRPCDAQLTDFENAAYVCFVVLLTRVILTYNLKFVMPISKVDENMQRAQRRGACASQRFWWRRDVRSQDADTYLEMTVHEIINGKEGVFPGLIPLIESYLSGMDVDADTHCSVQQYLKLIQRRASGEILTMASWMREFIDKHPQYKKDSIVTEKINYDLLKTAYGIQSGTIPAPTLLGSSNVSKTNDDIPKAFSKMMSKDCP-