Monarch geneset OGS2.0

DPOGS214557
TranscriptDPOGS214557-TA1572 bp
ProteinDPOGS214557-PA523 aa
Genomic positionDPSCF300266 - 11615-35114
RNAseq coverage52x (Rank: top 70%)
Annotation
HeliconiusHMEL0082581e-10740.12% 
BombyxBGIBMGA003219-TA2e-10488.44% 
DrosophilaGad1-PA0.066.47% 
EBI UniRef50UniRef50_P202280.066.47%Glutamate decarboxylase n=16 Tax=Drosophila RepID=DCE_DROME
NCBI RefSeqXP_974463.10.067.25%PREDICTED: similar to AGAP005866-PA [Tribolium castaneum]
NCBI nr blastpgi|3407233670.067.51%PREDICTED: glutamate decarboxylase-like [Bombus terrestris]
NCBI nr blastxgi|3504269760.067.51%PREDICTED: glutamate decarboxylase-like isoform 2 [Bombus impatiens]
Group
Gene OntologyGO:00197523.4e-237carboxylic acid metabolic process
GO:00168313.4e-237carboxy-lyase activity
GO:00301703.4e-237pyridoxal phosphate binding
GO:00038242.4e-74catalytic activity
KEGG pathwaytca:6633150.0 
 K01580 (E4.1.1.15, gadB)maps-> Type I diabetes mellitus
    Alanine, aspartate and glutamate metabolism
    Taurine and hypotaurine metabolism
    Butanoate metabolism
    beta-Alanine metabolism
InterPro domain[24-520] IPR0021293.4e-237Pyridoxal phosphate-dependent decarboxylase
[55-520] IPR0154241.6e-97Pyridoxal phosphate-dependent transferase, major domain
[137-395] IPR0154212.4e-74Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[396-519] IPR0154225.1e-15Pyridoxal phosphate-dependent transferase, major region, subdomain 2
Orthology groupMCL10976 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214557-TA
ATGTCAAAGCTGACTTTGGACGGCTTTTTTTCCTACTGCAAGCAAATAGCTGAACACATCCAGTTGGGCGAAAGAATGGCTTCCTTGTCAGGACAAAGTACCCCCAAACTGACGTCTGGTGTTGGCAACCTGTATTACGATTCGATCCTGCCATACAGAGAGGATGCCGGCCCCCAAACACGTGAGTTCCTCGCCAGGGTTGTGGACGTGTTGCTGGACTTCATACAGAAAGTCAATGACAGGGATGAAAAGATCTTGGAGTTCAAGATGCCTGAGGAGATGCAGAAAGTGCTGGACCTGCATCTGCCCGACGAACCGCTGCCGTTGAAACAACTTCTAGAAGACTGCAAGGTCACATTGAAACACCAGGTCAAGACCGGTCATCCCCACTTCTTCAACCAACTATCCTGTGGCCTGGACATCATTTCTTTAGCTGGAGAGTGGCTTACTGCTGCTGCCAATACAAACATGTTCACGTATGAGATCGCTCCCGTGTTCATTCTGATGGAGAATGTGGTTCTGGAGAAGATGAGGGCGATGATTGGTTGGAAAACAGGAGATTCTATCTTGGCACCAGGTAAATACAGCCTGGTGGCTCTGTGTCCAATTTATATGCTTTTCTTGGCGGCTCGTCATCATAAGTTCCCGCAGTACAAGGAAAAGGGCCTGACCAGTATTCCTGGACACCTGGTCATGTTTACCTCTGATCAGTGCCATTACTCTGTGAAATCCTGTGCATCTGTGTGCGGATTGGGCACAGACTACTGTGTGTGCGTGCCCAGTGACGAGCGTGGCAGAATGATACCAACTGAGTTGGAGCGTCTCGTCAGATATCACAAGGACAGAGGACATGTGCCGTTCTTCGTGAACGCCACTTCTGGTACCACGGTGCTGGGAGCATTCGATCCGCTCATGGAGATAGCTGACATCTGTGAAAAATATGATATGTGGATGCATGTTGATGTAACGCATTCAGTCACATGGAATCCCCACAAGCTGATGGGAACTCTGCTGCAGTGTTCAACTGTTCACTTCAGATATGAGGGTATACTTCTGAGCTGTAACGCTATGTCAGCGGAGTATCTTTTCATGACGGACAAGATTTACGATCCTAGATACGACACAGGCGACAAAGTCATCCAGTGTGGTCGCCACAACGATATATTCAAACTGTGGCTTCAGTGGCGTGGCAAGGGTACAACCGGCTTCGAGCGTCACATGGACCGTCTGATGGAACTGTCCGAGTACATGGTGCGTCGCATCAAGGAACAGTCTGACAAGTTCCACCTCATCCTCGAACCGGAGATGGTGAACGTTAGCTTCTGGTACCTGCCCATCCAGCTCAGGGGACAACACCACGATAAGAACAAGGAGATCAAACTTGGAAAGGTATGCGCCAAGCTGAAGGGTCGTATGATGCAAGCCGGCACTATCATGGTCGGCTACCAGCCAGATGACCGTCGACCAAATTTCTTCAGGAACATTATCTCCTCAGCTGCTGTCACTGAACGAGATGTGGACTTCCTCCTCAGCGAGATGGACCGCCTCGGACACGACATCGTCGTCGACTAA

Protein sequence:

>DPOGS214557-PA
MSKLTLDGFFSYCKQIAEHIQLGERMASLSGQSTPKLTSGVGNLYYDSILPYREDAGPQTREFLARVVDVLLDFIQKVNDRDEKILEFKMPEEMQKVLDLHLPDEPLPLKQLLEDCKVTLKHQVKTGHPHFFNQLSCGLDIISLAGEWLTAAANTNMFTYEIAPVFILMENVVLEKMRAMIGWKTGDSILAPGKYSLVALCPIYMLFLAARHHKFPQYKEKGLTSIPGHLVMFTSDQCHYSVKSCASVCGLGTDYCVCVPSDERGRMIPTELERLVRYHKDRGHVPFFVNATSGTTVLGAFDPLMEIADICEKYDMWMHVDVTHSVTWNPHKLMGTLLQCSTVHFRYEGILLSCNAMSAEYLFMTDKIYDPRYDTGDKVIQCGRHNDIFKLWLQWRGKGTTGFERHMDRLMELSEYMVRRIKEQSDKFHLILEPEMVNVSFWYLPIQLRGQHHDKNKEIKLGKVCAKLKGRMMQAGTIMVGYQPDDRRPNFFRNIISSAAVTERDVDFLLSEMDRLGHDIVVD-