Monarch geneset OGS2.0

DPOGS210346
TranscriptDPOGS210346-TA1440 bp
ProteinDPOGS210346-PA479 aa
Genomic positionDPSCF300025 - 200186-204394
RNAseq coverage1888x (Rank: top 7%)
Annotation
HeliconiusHMEL0137630.077.27% 
BombyxBGIBMGA011983-TA0.070.97% 
DrosophilaCG1640-PB1e-16259.95% 
EBI UniRef50UniRef50_Q8QZR51e-14458.37%Alanine aminotransferase 1 n=41 Tax=Eumetazoa RepID=ALAT1_MOUSE
NCBI RefSeqXP_001948711.11e-17464.92%PREDICTED: similar to alanine aminotransferase [Acyrthosiphon pisum]
NCBI nr blastpgi|3287045345e-17465.15%PREDICTED: alanine aminotransferase 2-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3287045345e-16965.15%PREDICTED: alanine aminotransferase 2-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00038241.7e-53catalytic activity
GO:00301701.7e-53pyridoxal phosphate binding
GO:00167692.9e-37transferase activity, transferring nitrogenous groups
GO:00090582.9e-37biosynthetic process
GO:00422185.1e-061-aminocyclopropane-1-carboxylate biosynthetic process
GO:00168475.1e-061-aminocyclopropane-1-carboxylate synthase activity
KEGG pathwayapi:1001648994e-174 
 K00814 (E2.6.1.2, GPT)maps-> Alanine, aspartate and glutamate metabolism
    Carbon fixation in photosynthetic organisms
InterPro domain[59-475] IPR0154244.6e-72Pyridoxal phosphate-dependent transferase, major domain
[86-335] IPR0154211.7e-53Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[102-453] IPR0048392.9e-37Aminotransferase, class I/classII
[336-474] IPR0154228.8e-19Pyridoxal phosphate-dependent transferase, major region, subdomain 2
[163-184] IPR0011765.1e-061-aminocyclopropane-1-carboxylate synthase
Orthology groupMCL10420 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210346-TA
ATGACATCACCTTACAGCTTAGATTACACCGTCGCCCTCTGTGTGGTGATAACACTACCAGCACACAGCTCGTGGGGCGACTGGTGCTGGCTGGCCTCGGGTTCGTCGGGTGAGGGGGGAGCGTCCAAACCCTTCCAGAAAGTCATCAGGGCCAACATCGGCGACGCCCACGCTATGGGCCAGCGACCCATCACCTTCATCAGACAGGTGTTGTCATGTATAACGAATACTGACTTATTAGAAAAAGGCGACTTCCCCGACGACGTGAAGGACAGGGCCAGGGAGATCCTCTGTGGCTGCGGCGGCGGCTCCGTGGGGTCGTACTCGGCGTCTCAGGGTATAGAGCACATCAGGCGACACGTTGCGGAGTACATCGCCAAGAGAGACGGCCACCCCGCCCGCTGGCAGGACATCTGCCTCTCTGCCGGCGCCTCCACCGCCATCAAGAATTGTCTGCAGCTGTTCTGTAAGGAAATCGACGGCAAGAAGAGCGGTGTAATGATCCCCATCCCTCAGTATCCTCTGTACTCGGCGTCGCTGGCGGAGTACGGCCTGGAGCAGGTCGGCTACTACCTGGACGAGGAGTGCAATTGGGGTCTCAGCACACAGGAGCTGGAGAGGAGCTTGCAGGAGGCGCAGCAGACCTGCAACGTGCGGGCGCTCGTGGTCATCAACCCCGGGAACCCCACGGGACAGGTGTTGACGCGAGAGAACATCGAGCAGGTGATAAAGTTCGCGCACAAACACAAGCTGTTCATCTTCGCCGACGAGGTGTACCAGGACAACGTGTACGCCGAAGGTAGCGCCTTCCACTCCTTCAAGAAGGTGCTGGTGGAGCTGGGCGCGGAGTACAGCTCGCAGGAGCTGGCGTCGTTCATGACTATCAGCAAGGGCTACATGGGCGAGTGCGGTCTCCGCGGCGGCTGGGTGGAGTTGGTGAACATGCTGCCGGAGGTGCAGGCCCAGCTATACAAGTGCATGTCAGCCATGCTGTGCCCGTCCGTGCTGGGCCAGGCCGTGGTGGATTGCGTCGCCAAGCCGCCCGCCCCCGGCGAGCCGTCCTATGACCTCTGGATCAAAGAGAAGACGGACATACTGACCTCCCTCAAGGACCGCGCTAAGCTCATCGCCGAGACCTTCAACACCTTCGAGGGGTTCAAGTGTAACGTGGTGCAGGGCGCGATGTACGCCTTCCCCAGGATCACGCTTCCCCCGAAGGCGATCCAGGCAGCCAAGGAGAAGAACATGGCGCCGGACGTGTTCTATGCCTTCAGACTGCTGGAGGAGACAGGCGTGTGTATAGTCCCCGGGTCTGGGTTCGGGCAGGCGCCGGGCACGTTCCACTTCCGCACCACAATCCTGCCGCAGCCGGACCTCCTCAACACCATGCTGGAGAACTTCAGGAACTTCCACAAGAAGTTCACTCAGGAGTACGCCTAG

Protein sequence:

>DPOGS210346-PA
MTSPYSLDYTVALCVVITLPAHSSWGDWCWLASGSSGEGGASKPFQKVIRANIGDAHAMGQRPITFIRQVLSCITNTDLLEKGDFPDDVKDRAREILCGCGGGSVGSYSASQGIEHIRRHVAEYIAKRDGHPARWQDICLSAGASTAIKNCLQLFCKEIDGKKSGVMIPIPQYPLYSASLAEYGLEQVGYYLDEECNWGLSTQELERSLQEAQQTCNVRALVVINPGNPTGQVLTRENIEQVIKFAHKHKLFIFADEVYQDNVYAEGSAFHSFKKVLVELGAEYSSQELASFMTISKGYMGECGLRGGWVELVNMLPEVQAQLYKCMSAMLCPSVLGQAVVDCVAKPPAPGEPSYDLWIKEKTDILTSLKDRAKLIAETFNTFEGFKCNVVQGAMYAFPRITLPPKAIQAAKEKNMAPDVFYAFRLLEETGVCIVPGSGFGQAPGTFHFRTTILPQPDLLNTMLENFRNFHKKFTQEYA-