Monarch geneset OGS2.0

DPOGS201634
TranscriptDPOGS201634-TA882 bp
ProteinDPOGS201634-PA293 aa
Genomic positionDPSCF300639 + 10632-14592
RNAseq coverage2x (Rank: top 91%)
Annotation
HeliconiusHMEL0137632e-7154.96% 
BombyxBGIBMGA011984-TA1e-7553.60% 
DrosophilaCG1640-PB1e-6346.31% 
EBI UniRef50UniRef50_C3ZV833e-5743.80%Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZV83_BRAFL
NCBI RefSeqXP_392720.28e-7352.44%PREDICTED: similar to CG1640-PA, isoform A isoform 1 [Apis mellifera]
NCBI nr blastpgi|3838539748e-7252.44%PREDICTED: alanine aminotransferase 2-like [Megachile rotundata]
NCBI nr blastxgi|3838539742e-6852.44%PREDICTED: alanine aminotransferase 2-like [Megachile rotundata]
Group
Gene OntologyGO:00038242.6e-42catalytic activity
GO:00301702.6e-42pyridoxal phosphate binding
GO:00167698.2e-21transferase activity, transferring nitrogenous groups
GO:00090588.2e-21biosynthetic process
KEGG pathwayame:4091962e-72 
 K00814 (E2.6.1.2, GPT)maps-> Alanine, aspartate and glutamate metabolism
    Carbon fixation in photosynthetic organisms
InterPro domain[61-291] IPR0154212.6e-42Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[13-293] IPR0154244.2e-40Pyridoxal phosphate-dependent transferase, major domain
[51-226] IPR0048398.2e-21Aminotransferase, class I/classII
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201634-TA
ATGGACCCATTTGGACCTACGGTGTCCCATTCTCGGGTCTCGCCAGCAAGATCAATATATTGTGCCTACATCGAGTACGAAATAACATCCTACGAACCATCGCAAATGCTCCGTTGGTTCTCCAGAAATGACGATATACATAAACATCTCAACGTCCCAACAGTTCAATCAGAGATAGACAAAAACAAAAAAAAGAGTGGATCAGTGGGATCGTATTCACCAGCTCCAGGACTCCTGTTAATTCGTAAGCACGTTGCTCAATACCTGACCGCCCGTGATGGTGTGGCCGCCAACTTTAATAACATATACCTCGGCTCGGGAGCTTCCGATCTCATCAAAAGTGTTCTCACCCTTTTTGTGGAAAAAGTCGATGGAAAACCACCAGGTGTTATGATACCCATTCCCCAGTATCCATTATTTTCGGGGACCCTCTCGGAGTTGGGACTGCAGCAGGTGGACTACTATTTAGACGAAGATGACGGCTGGGTTCTGAAGTACGAAGAGTTGGAGCGGAGCTGGCGAGCAGCGAGCGAACACTGTAGCGTGCGAGCAATAGTCGTCATTAATCCTGGGAACCCAACGGGACAGGTTTTGTCGAGGGATAACATCGAAGATATCATTAGATTTGCATACAAACACAACCTCTTTATCGTGGCAGATGAGGTGATGCATGAGATGGGCGATCCATTTAAGAAGCTGCAGTTGTCTTCGTTCATGACGTGTTCCAAGGGCTGGGCTGCGGAGTGCGGTCTACGAGCCGGCGTCTTGGAGCTCGTGTCTCTAGAGCCTCGAGTGATCTCCGCCCTGGAGGCGGCGCGGTCGACTCAGCAGTGTGCCAGTGTGCTCGGACAGTGTGTTGTGGACTGCGTGGTCAGTCAATAG

Protein sequence:

>DPOGS201634-PA
MDPFGPTVSHSRVSPARSIYCAYIEYEITSYEPSQMLRWFSRNDDIHKHLNVPTVQSEIDKNKKKSGSVGSYSPAPGLLLIRKHVAQYLTARDGVAANFNNIYLGSGASDLIKSVLTLFVEKVDGKPPGVMIPIPQYPLFSGTLSELGLQQVDYYLDEDDGWVLKYEELERSWRAASEHCSVRAIVVINPGNPTGQVLSRDNIEDIIRFAYKHNLFIVADEVMHEMGDPFKKLQLSSFMTCSKGWAAECGLRAGVLELVSLEPRVISALEAARSTQQCASVLGQCVVDCVVSQ-