Monarch geneset OGS2.0

DPOGS207306
TranscriptDPOGS207306-TA1275 bp
ProteinDPOGS207306-PA424 aa
Genomic positionDPSCF300008 + 962275-965098
RNAseq coverage674x (Rank: top 19%)
Annotation
HeliconiusHMEL0082580.086.32% 
BombyxBGIBMGA012088-TA0.083.73% 
Drosophilab-PA2e-18067.94% 
EBI UniRef50UniRef50_Q240622e-17867.94%Black n=12 Tax=Endopterygota RepID=Q24062_DROME
NCBI RefSeqNP_001096055.10.071.56%aspartate 1-decarboxylase [Tribolium castaneum]
NCBI nr blastpgi|2914867650.088.21%black [Papilio xuthus]
NCBI nr blastxgi|2914867650.088.21%black [Papilio xuthus]
Group
Gene OntologyGO:00197521.3e-246carboxylic acid metabolic process
GO:00168311.3e-246carboxy-lyase activity
GO:00301701.3e-246pyridoxal phosphate binding
GO:00038241.8e-106catalytic activity
KEGG pathwayaga:AgaP_AGAP0089040.0 
 K01580 (E4.1.1.15, gadB)maps-> Type I diabetes mellitus
    Alanine, aspartate and glutamate metabolism
    Taurine and hypotaurine metabolism
    Butanoate metabolism
    beta-Alanine metabolism
InterPro domain[3-422] IPR0021291.3e-246Pyridoxal phosphate-dependent decarboxylase
[4-422] IPR0154243e-112Pyridoxal phosphate-dependent transferase, major domain
[24-297] IPR0154211.8e-106Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[298-421] IPR0154227.2e-21Pyridoxal phosphate-dependent transferase, major region, subdomain 2
Orthology groupMCL12072 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207306-TA
ATGGCTAACGTGGCTCGTTACTCTGTGAACACAGGTCACCCATATTTCGTGAATCAACTGTTCTCATCTGTTGACCCATACGGCCTGGTCGGACAGTGGTTGACCGATGCGCTGAATCCAAGTGTCTACACCTTTGAGGTTGCACCAGTTTTTACTTTGATGGAAGAGGAGGTGTTACGTGAGATGCGTAAGATCGTAGGATGGCCGGAAGGGGAGGGCGATGGAATATTTTGCCCGGGAGGCTCTATAGCCAATGGATACGCGATCAGCTGCGCTCGTCACCACTTTTATCCGGAAGTTAAGTATAAAGGTGTACATGCAGTTCCAAAGTTAGTGTTATTTACATCCGAGCTTGCTCATTATTCTACAAAGAAAATGGCTGCTTTCATGGGGATCGGTAGCGACAACTGCGTCAATATTAAGACGGACGATGTTGGAAAGATGAATATAGTGGATTTAGAAATGAAAATTAAGATCGCTATTGATAATAAATGCACTCCATTTATGGTCACAGCTACTTCGGGAACCACAGTTTTCGGTGCTTTTGATCCATTAGTAGCTATATCCGATTTATGCAAGAAATACAATCTTTGGCTACATGTTGATGCAGCTTGGGGTGGAGGTGCACTCATGTCGAAGAAACATAGACATCTCTTAAACGGAATTGAACTAGCCGATTCCGTTACATGGAATCCACATAAACTTCTGGCAGCTCCACAACAATGCTCTACTTTTTTAACCAGGCACAAAAAGGTGCTCAGTGAAGGGCACTCCTCGAATGCCAAGTATCTTTTCCAAAAAGATAAATTCTACGATACGTCATACGATACCGGTGACAAACATATTCAATGCGGTAGGCGAGCAGATGTCTTGAAGTTTTGGTTCATGTGGAAGGCAAAAGGGACAGAAGGTTTCGAAAAACACGTTGACAAGTTATTTGATAATGCAAAATATTTTCTTGATCACATCAAACAAAGAGAAGGCTTCCAGCTCGTTATAGCAGAACCGCAATGCACTAACATTATGTTCTGGTACATTCCTAAATGTCTGCGCGGATGCGAGAACGATGCTGATTATTACGAAAGATTGCATAAGGTGGCACCTAAAATAAAAGAAAGGATGATAAAAGAGGGAAGTATGATGGTCACGTATCAGCCACAAGGTGATCTCGTGAACTTTTTCCGTATTGTTTTTCAAAACTCTGCTCTTGACCACAAGGATATGGTTTACTTTGCTAATGAATTTGAAAGGCTCGGATCGGACATGATTGTCTAA

Protein sequence:

>DPOGS207306-PA
MANVARYSVNTGHPYFVNQLFSSVDPYGLVGQWLTDALNPSVYTFEVAPVFTLMEEEVLREMRKIVGWPEGEGDGIFCPGGSIANGYAISCARHHFYPEVKYKGVHAVPKLVLFTSELAHYSTKKMAAFMGIGSDNCVNIKTDDVGKMNIVDLEMKIKIAIDNKCTPFMVTATSGTTVFGAFDPLVAISDLCKKYNLWLHVDAAWGGGALMSKKHRHLLNGIELADSVTWNPHKLLAAPQQCSTFLTRHKKVLSEGHSSNAKYLFQKDKFYDTSYDTGDKHIQCGRRADVLKFWFMWKAKGTEGFEKHVDKLFDNAKYFLDHIKQREGFQLVIAEPQCTNIMFWYIPKCLRGCENDADYYERLHKVAPKIKERMIKEGSMMVTYQPQGDLVNFFRIVFQNSALDHKDMVYFANEFERLGSDMIV-