Monarch geneset OGS2.0

DPOGS202372
TranscriptDPOGS202372-TA1425 bp
ProteinDPOGS202372-PA474 aa
Genomic positionDPSCF300104 + 153006-158352
RNAseq coverage657x (Rank: top 19%)
Annotation
HeliconiusHMEL0028875e-10682.46% 
BombyxBGIBMGA013895-TA0.079.11% 
DrosophilaCG4802-PA3e-8860.56% 
EBI UniRef50UniRef50_Q7ZV229e-8957.25%S-methyl-5'-thioadenosine phosphorylase n=28 Tax=cellular organisms RepID=MTAP_DANRE
NCBI RefSeqNP_001040514.15e-12579.85%5'-methylthioadenosine phosphorylase [Bombyx mori]
NCBI nr blastpgi|1140522841e-12379.85%5'-methylthioadenosine phosphorylase [Bombyx mori]
NCBI nr blastxgi|1140522849e-12680.44%5'-methylthioadenosine phosphorylase [Bombyx mori]
Group
Gene OntologyGO:00167631.2e-123transferase activity, transferring pentosyl groups
GO:00091161.3e-43nucleoside metabolic process
GO:00038241.3e-43catalytic activity
GO:00054882.5e-15binding
KEGG pathwaytgu:1002284064e-89 
 K00772 (E2.4.2.28, mtaP)maps-> Cysteine and methionine metabolism
InterPro domain[202-472] IPR0013691.2e-123Purine phosphorylase, family 2
[204-447] IPR0100445.3e-90Methylthioadenosine phosphorylase
[204-443] IPR0008451.3e-43Nucleoside phosphorylase domain
[97-204] IPR0119902.5e-15Tetratricopeptide-like helical
Orthology groupMCL14520 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202372-TA
ATGACTTACGACCTCAAAAGGGAAGAGGAAGTGAAAGAATACGTTGAAAACCTCGGTATTGAATATAGGTTTGGTTGTTACAAAGAGAAAAAACCGGAGGTCTGTCACCTTTTGGGGGATTATTTAGAAGCTATAAAAAAGGATTTTAGCAAAGCGGCGGCGGTTTTCAAGACCAATTGCGACGATTATAATTACGGGAAATCATGTTTAAAGTACGGAAATTACGCGTTGTTGGGAAAAGGCAGGGAAAAGAGTGACACACAGGAGGCATTAAAGTATTTCGAGAAGGGTTGTGAATTGAACGATCCTACGGCATGTTTACATGCTGGGGTGATTTTAACAGCTACTGGACCCGCTGTTACTGTACAACGAGATGTTCCAAAAGGTTACAACTACTTAAAGAAAAGCTGTGATCAAAATGATGCAATGGCTTGTCACTATCTAGCTGGCATGTACTTAACGGGAGTCCCGAAGAATCCGACAGAGTATAACCCACACAATCCAGAGAAGAACAAAAATTTAGACTACCTCATAAAACCTGATCCAATACAGGCTTTTGGTTTTGCCAAAAAGGGTTGCGAGAATGGTAACATATTTGCCTGTGCGAACATCGGTATAATCGGAGGCTCTGGGTTCGATGACCCAGATCTATTTGAAAATCCAATACCCCGTGATGTCGACACTCCATTCGGGAAGCCCTCGGACGTTCTTCTAGAAGGATCTATTAAAGGAGTGTCTTGCGTCCTATTAGCGAGACACGGAAGAAAACACCAGTATCAACCGAGCGATGTCAATTACCGAGCTAATATTTGGGCGTTGAAACAAATTGGCTGTACACATATATTGGCCACTACCGCCACTGGATCGCTTGTAGAAAATTACCGGCCCGGGGATCTTGTTATACTGGACGATTTCATTGACAGAACATGGGGTCGTAAGTGTACGTTCTTCGACGGCACATCGGGTGGACCCCGCGGGGTGTGCCATTTACCCATGAGGCCGGCGTTCTGTGAGCGGGCGAGGGGGGCTCTCGTGACGGCCGCGAACGAGGCGGGGCTACGCTGTCACGAGCGAGGGACTGCGGTCACTATACAGGGACCGAGATTCTCAAGTCGAGCTGAGAGTCTGATGCATCGTCAGTGGGGAGCGCACGTAGTCAACATGACCACCGTACCGGAGGTGGTGTTGGCTAAGGAAGCTGGGTTGAGCTACGCCGCGGTGGCTCTGGTCACCGACTATGACTGCTGGAGAGACAACGAGCAGTCGGTGTCAGTGAGCGAGGTGCTGGAGATGTTCGCGAGGAACATTAAGAAGGCGATCCAGGTGATCGTGGAGGCGGTGGTGCTCCTCGCCGCGGAAGACGACCTGACGTACCTGGACTCACACACGGACCTGGTGTCGTCGGCTGTGATGCTGAAGGACTAG

Protein sequence:

>DPOGS202372-PA
MTYDLKREEEVKEYVENLGIEYRFGCYKEKKPEVCHLLGDYLEAIKKDFSKAAAVFKTNCDDYNYGKSCLKYGNYALLGKGREKSDTQEALKYFEKGCELNDPTACLHAGVILTATGPAVTVQRDVPKGYNYLKKSCDQNDAMACHYLAGMYLTGVPKNPTEYNPHNPEKNKNLDYLIKPDPIQAFGFAKKGCENGNIFACANIGIIGGSGFDDPDLFENPIPRDVDTPFGKPSDVLLEGSIKGVSCVLLARHGRKHQYQPSDVNYRANIWALKQIGCTHILATTATGSLVENYRPGDLVILDDFIDRTWGRKCTFFDGTSGGPRGVCHLPMRPAFCERARGALVTAANEAGLRCHERGTAVTIQGPRFSSRAESLMHRQWGAHVVNMTTVPEVVLAKEAGLSYAAVALVTDYDCWRDNEQSVSVSEVLEMFARNIKKAIQVIVEAVVLLAAEDDLTYLDSHTDLVSSAVMLKD-