Monarch geneset OGS2.0

DPOGS210347
TranscriptDPOGS210347-TA1314 bp
ProteinDPOGS210347-PA437 aa
Genomic positionDPSCF300025 - 53562-56017
RNAseq coverage8x (Rank: top 86%)
Annotation
HeliconiusHMEL0137633e-11748.98% 
BombyxBGIBMGA011984-TA6e-15757.11% 
DrosophilaCG1640-PB2e-12745.80% 
EBI UniRef50UniRef50_P242982e-11546.23%Alanine aminotransferase 1 n=72 Tax=Eukaryota RepID=ALAT1_HUMAN
NCBI RefSeqXP_001948711.16e-13649.37%PREDICTED: similar to alanine aminotransferase [Acyrthosiphon pisum]
NCBI nr blastpgi|3287045344e-13549.58%PREDICTED: alanine aminotransferase 2-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3287045341e-13049.58%PREDICTED: alanine aminotransferase 2-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00038244.3e-39catalytic activity
GO:00301704.3e-39pyridoxal phosphate binding
GO:00167695.3e-11transferase activity, transferring nitrogenous groups
GO:00090585.3e-11biosynthetic process
KEGG pathwayapi:1001648992e-135 
 K00814 (E2.6.1.2, GPT)maps-> Alanine, aspartate and glutamate metabolism
    Carbon fixation in photosynthetic organisms
InterPro domain[1-429] IPR0154246.8e-53Pyridoxal phosphate-dependent transferase, major domain
[248-432] IPR0154224.3e-39Pyridoxal phosphate-dependent transferase, major region, subdomain 2
[87-236] IPR0154218.8e-27Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[85-235] IPR0048395.3e-11Aminotransferase, class I/classII
Orthology groupMCL30891 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210347-TA
ATGAAGATAGAGAAATCTCTCAACGAAGACAACATGAACCAGCGTCTCCTGAAGATACATTATGCGGTTCGCGGACCCATCCTGGAGCGAGCCCTGGCCATACAGACGGACCTCAACGAGGGTGTACTGAAGCCATTTAAGCGTGTAATCCGAGCAAACATCGGTGACTGTCACGCTCTGGGTCAAAGACCGATAACTTTCATCAGACAAGTGTTGGCATTGGCGACCTGTCCTGGTCTCCGGAGTTTACAAAACATACCCGAGGATGTAAAGGAACGAGTCAGAGAAATCCTAGGAGAGTGCGTGAGTGGATCAGTGGGATCGTATTCACCAGCTCCAGGACTCCTGTTAATTCGTAAGCACGTTGCTCAATACCTGACCGCCCGTGATGGTGTGGCCGCCAACTTTAATAACATATACCTCGGCTCGGGAGCTTCCGATCTCATCAAAAGTGTTCTCACCCTTTTTGTGGAAAAAGTCGATGGAAAACCACCAGGTGTTATGATACCCATTCCCCAGTATCCATTATTTTCGGGGACCCTCTCGGAGTTGGGACTGCAGCAGGTGGACTACTATTTAGACGAAGATGACGGCTGGGTTCTGAAGTACGAAGAGTTGGAGCGGAGCTGGCGAGCAGCGAGCGAACACTGTAGCGTGCGAGCAATAGTCGTCATTAATCCTGGGAACCCAACGGGACAGGTGATGCATGAGATGGGCGATCCATTTAAGAAGCTGCAGTTGTCTTCGTTCATGACGTGTTCCAAGGGCTGGGCTGCGGAGTGCGGTCTACGAGCCGGCGTTTTGGAGCTCGTGTCTCTAGAGCCTCGAGTGATCTCCGCCCTGGAGGCGGCGCGGTCGACTCAGCAGTGTGCCAGTGTGCTCGGACAGTGTGTTGTGGACTGCGTGATGCGTCCTCCGACTCCAGGTTCTCCGTCGTTTTCTCTCTTCTCTTCTGAACGTGATCGCCTGAGGCGCGCTCTCAGTGAACGAGCCTTCGCGGCTCACACAGCTTTCAACTCCATTCCGGGTTACTTCTGTAACCCTATCGAAGGTGCAATGTTTGCATTTCCACGCATTGAAATACCGGGAAAAGCAAAACAGGAGGCCGCTGAGAGGGGTTTGGTTCCTGATGAATTTTACTGTCTACGACTGCTCGAAGAAACAGGAGTATGCGTGGTGCCCGGATCAGGGTTCGGTCAGCGCCCCGGCTCTTACCATTTTCGAACCACCATTCTTCACGACAAAGACGAATTTAGCTACATGTTAGCGTGCATACGACGGTTCCATTTAAACTTTATTGAGGAATATTCATAA

Protein sequence:

>DPOGS210347-PA
MKIEKSLNEDNMNQRLLKIHYAVRGPILERALAIQTDLNEGVLKPFKRVIRANIGDCHALGQRPITFIRQVLALATCPGLRSLQNIPEDVKERVREILGECVSGSVGSYSPAPGLLLIRKHVAQYLTARDGVAANFNNIYLGSGASDLIKSVLTLFVEKVDGKPPGVMIPIPQYPLFSGTLSELGLQQVDYYLDEDDGWVLKYEELERSWRAASEHCSVRAIVVINPGNPTGQVMHEMGDPFKKLQLSSFMTCSKGWAAECGLRAGVLELVSLEPRVISALEAARSTQQCASVLGQCVVDCVMRPPTPGSPSFSLFSSERDRLRRALSERAFAAHTAFNSIPGYFCNPIEGAMFAFPRIEIPGKAKQEAAERGLVPDEFYCLRLLEETGVCVVPGSGFGQRPGSYHFRTTILHDKDEFSYMLACIRRFHLNFIEEYS-