Monarch geneset OGS2.0

DPOGS210350
TranscriptDPOGS210350-TA1434 bp
ProteinDPOGS210350-PA477 aa
Genomic positionDPSCF300025 + 88509-92606
RNAseq coverage125x (Rank: top 57%)
Annotation
HeliconiusHMEL0137630.067.57% 
BombyxBGIBMGA011983-TA1e-17062.72% 
DrosophilaCG1640-PB4e-16053.68% 
EBI UniRef50UniRef50_Q8TD302e-15053.15%Alanine aminotransferase 2 n=74 Tax=Metazoa RepID=ALAT2_HUMAN
NCBI RefSeqXP_001948711.12e-17259.62%PREDICTED: similar to alanine aminotransferase [Acyrthosiphon pisum]
NCBI nr blastpgi|3287045347e-17259.83%PREDICTED: alanine aminotransferase 2-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3287045342e-16659.83%PREDICTED: alanine aminotransferase 2-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00038241e-52catalytic activity
GO:00301701e-52pyridoxal phosphate binding
GO:00167691.4e-39transferase activity, transferring nitrogenous groups
GO:00090581.4e-39biosynthetic process
GO:00422181e-051-aminocyclopropane-1-carboxylate biosynthetic process
GO:00168471e-051-aminocyclopropane-1-carboxylate synthase activity
KEGG pathwayapi:1001648995e-172 
 K00814 (E2.6.1.2, GPT)maps-> Alanine, aspartate and glutamate metabolism
    Carbon fixation in photosynthetic organisms
InterPro domain[1-473] IPR0154242.2e-78Pyridoxal phosphate-dependent transferase, major domain
[84-339] IPR0154211e-52Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[84-459] IPR0048391.4e-39Aminotransferase, class I/classII
[361-471] IPR0154222.8e-19Pyridoxal phosphate-dependent transferase, major region, subdomain 2
[135-155] IPR0011761e-051-aminocyclopropane-1-carboxylate synthase
Orthology groupMCL10420 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210350-TA
ATGGCATCGATGACATTGAAAAATATCAACCCGAACATACTCAAGGTGGAGTACGCTGTTCAAGGTCCCTTAGTGGCTCGAGCTGGTGAAATAGAAGAAGAATTAAAAAGGGGAGTAGAAAAACCATTCAAGAGTGTAATAAGAGCGAACATTGGAGACGCACAAGCCATGGGACAGGTTCCAATCACATTTATAAGACAGGTACTAGCTTGTATTTCGTATCCAGAGCTCATTGGAGAAGGAAATTTTCCTGAAGATGTCAAAGAAAGAGCTCGAGAAATCTTAAACAGTTGCAGTGGGGGCTCGATAGGATCTTATTCAGCACCGTACGGTATTGATCACATCCGACGCCACGTCGCCGAGTACATCGAAAGACGAGATGCCCTGCCGGCTAACTGGCAAAATGTGTTTCTCTCTGCCGGCGCGTCCACAGCGATAAAGTATTGTTTGCAGCTATTTAGTAACGATACGCATGGGAAAAAGAATGGTGTTTTGACTCCCATTCCTCAATACCCATTATATTCTGCTTGCTACGCGAAGTACGGCCTTACCCAAGTGGGTTATTATTTGGACGAAACCACTAACTGGAGTCTCAGTATTGAAGAGTTAGAGCGAAGATTAAACGAGGCTGAGAAGAGGTGCAATGTGAGGGCACTTGTGGTTATTAATCCAGGAAATCCAACGGGACAAGTTTTAAGTCGTAATAATATGGAAGATATTATCAAATTCGTCTACAAGCACAATTTACTTCTTATATCTGATGAGGTCTACCAACACAATATATATGCTGAAGGTATCCAATTCTTCTCATTCAGAAAGGTTATGATGGAACTAGGCGCACCGTACTCATCGATCGAGCTGGCTTCGATCATGTCGGCAAGTAAGGGCTACATGGGAGAATGCGGTCTGCGTGGAGGCTGGGTGGAGCTCACCAACTTCCATCCCGATGTACAAGCGCAACTCTATAAGTTCATGTCGGCCATGGTGTCACCGAATATATTGGGGCAGGCCGCCATTGACTGCGTGGCTAAGCCACCTTTACCTGGAGAGCCATCATATGATTTATGGCTGCGCGAAAAGGAATCAGTGCTTGCGTCCCTTAGAGAACGCGCCAAGATGATAACTGATGGCCTTAACGCAGGCCAGGGATTCAAATGTAACATAGTGCAGGGAGCAATGTACGCTTTTCCTACTGTTATTCTGCCTCCGAAGGCCGTTGCAGCAGCTGAAGATGCCAAACAACCACCTGACGTTTTCTATGCTTACAGGCTCTTAGAAGAAACTGGTATTTGCGTGGTTCCTGGCAGCGGATTCGGCCAGGCTCCGGGAACTCATCACATTCGAATGACCATTCTGCCGAAGACGGACGATCTTCGCACTATGATTCACAGTATTAAAACCTTTCACCAGAGCTTTATCGAGCGATATACCTAA

Protein sequence:

>DPOGS210350-PA
MASMTLKNINPNILKVEYAVQGPLVARAGEIEEELKRGVEKPFKSVIRANIGDAQAMGQVPITFIRQVLACISYPELIGEGNFPEDVKERAREILNSCSGGSIGSYSAPYGIDHIRRHVAEYIERRDALPANWQNVFLSAGASTAIKYCLQLFSNDTHGKKNGVLTPIPQYPLYSACYAKYGLTQVGYYLDETTNWSLSIEELERRLNEAEKRCNVRALVVINPGNPTGQVLSRNNMEDIIKFVYKHNLLLISDEVYQHNIYAEGIQFFSFRKVMMELGAPYSSIELASIMSASKGYMGECGLRGGWVELTNFHPDVQAQLYKFMSAMVSPNILGQAAIDCVAKPPLPGEPSYDLWLREKESVLASLRERAKMITDGLNAGQGFKCNIVQGAMYAFPTVILPPKAVAAAEDAKQPPDVFYAYRLLEETGICVVPGSGFGQAPGTHHIRMTILPKTDDLRTMIHSIKTFHQSFIERYT-