Monarch geneset OGS2.0

DPOGS208787
TranscriptDPOGS208787-TA1707 bp
ProteinDPOGS208787-PA568 aa
Genomic positionDPSCF300036 - 684521-690122
RNAseq coverage1001x (Rank: top 13%)
Annotation
HeliconiusHMEL0041904e-17986.43% 
BombyxBGIBMGA007647-TA0.080.53% 
DrosophilaPrat2-PB0.075.96% 
EBI UniRef50UniRef50_P281730.064.09%Amidophosphoribosyltransferase n=65 Tax=Bilateria RepID=PUR1_CHICK
NCBI RefSeqXP_001657045.10.079.51%amidophosphoribosyltransferase [Aedes aegypti]
NCBI nr blastpgi|1571373730.079.51%amidophosphoribosyltransferase [Aedes aegypti]
NCBI nr blastxgi|1571373730.078.65%amidophosphoribosyltransferase [Aedes aegypti]
Group
Gene OntologyGO:00091132.9e-300purine base biosynthetic process
GO:00040442.9e-300amidophosphoribosyltransferase activity
GO:00081525.1e-17metabolic process
GO:00091164.8e-13nucleoside metabolic process
KEGG pathwayaag:AaeL_AAEL0035810.0 
 K00764 (E2.4.2.14, purF)maps-> Alanine, aspartate and glutamate metabolism
    Purine metabolism
InterPro domain[54-568] IPR0058542.9e-300Amidophosphoribosyl transferase
[119-258] IPR0005835.1e-17Glutamine amidotransferase, class-II
[371-482] IPR0008364.8e-13Phosphoribosyltransferase
Orthology groupMCL10883 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208787-TA
ATGGAGGGTCCCGCAGGACCATCATGTCGCCTGTTAAGCGATGACGTGGCAGGGCACAGTCAAGACACAAATGACAACTGTCAGAGTGACAGCTGTCACTGTGACAGGACACAGGCCGAGGTAAAGATAAAAAGCAAGAGGTTCGGAAGAGGAGCAGTTGAGTCAGGTCTGACACACGAATGCGGCGTGTTCGGTGCTATAGGGACGGGAGAGTGGCCGACACAAGTCGATGTGGCACAGGTTATATGTCTTGGCCTGGTGGCGCTGCAACATCGAGGTCAAGAATCCGCGGGAATAGTGACGTCAGAGGGCAAGAGCGCTCGCACCTTCAACTCGCACAAGGGCATGGGACTCATAAACAACATATTCAATGATGATGCCATGAGGAAACTGAAGGGGAACCTGGGGATAGGACACACGAGGTATTCGACGTCGGCCGCTAGTGAGGAGGTGAACTGTCAGCCGTTCGTGGTGCACACGGCGCACGGCGCGCTCGCGGTCGCCCACAACGGGGAGCTCGTCAACTGCAGCAGTCTGAGGAAGATGGTGTTGGGTCGCGGTGTGGGTCTCTCGACTCACTCGGACTCGGAGTTGATCACACAGGCGCTGTGTCTGAACCCGCCCGAGGGAGAGACGGACGGCCCCGACTGGCCCGCCAGGATCAACCACCTCATGAGGCTGGCTCCGCTGAGCTACTCGCTGGTGATCATGCTGAAAGATAAGATATACGCCGTCCGCGATCCTTACGGGAACAGGCCGCTGTGCCTCGGGAAGATATTACCGCTCGGATCATCCTATGCCTACAAATCCTCATCATCGAAGCATGCGGCGGTTTTGTTGAACGGCTGTGCTAAGAACGGTATGGACGACAAGCCCGAGGGCTGGGTGGTGTCGTCCGAGTCGTGCGGGTTCCTGTCTATTGGCGCTCGTTACGTACGCGAGGTTCTTCCCGGGGAGATCATCGAGATGTCCCGCCGCGGCATCAGGACAGTGGACGTGGTGGAGCGACCGGCCATGAAACAACAAGCCTTCTGCATCTTCGAATATGTCTACTTCGCACGGGCTGATAGTATTTTTGAAGGTCAGATGGTGTATTCCGCTCGCTTGCAATGTGGCCGCATGTTAGCGCGCGAGTCTCCTGTAGACGCAGACATCGTGTCTTCCGTGCCGGAGTCAGGAACCGCCGCCGCTCACGGATACGCCAGACAGTCCGGGATCCCGTTCATGGAGGTGTTGTGTAAGAACCGTTACGTGGGCCGCACCTTCATCCAGCCCTCGACTCGTCTCCGACAGCTCGGCGTGGCCAAGAAGTTCGGCGCCCTGTCCGAGAACGTGCGCGGGAAGCGGATCGTCCTCATAGACGACTCCATCGTCAGAGGCAACACCATAGGACCCATCATAAAACTCCTGAGGGACGCCGGCGCCGCTGAGGTTCATATCCGGATAGCCTCTCCGCCGCTCAAGTACCCCTGCTATATGGGGATCAACATTCCCACAAGAGAGGAACTTATCGCTAACAAGATGGACCCCTTCAAGCTCGCTGAGCACGTCGGCGCCGACAGTCTGGAGTATCTGAGCGTGGAGGGTCTGGTGAGCGCGGTGCACTACAACATGAAGACGACGCCCAGCGACGGCGTGGGCGGCCACTGCACGGCCTGCCTCACGGGCGACTACCCCGGCGGGCTGCCCGACGACGCCGACTGGTGA

Protein sequence:

>DPOGS208787-PA
MEGPAGPSCRLLSDDVAGHSQDTNDNCQSDSCHCDRTQAEVKIKSKRFGRGAVESGLTHECGVFGAIGTGEWPTQVDVAQVICLGLVALQHRGQESAGIVTSEGKSARTFNSHKGMGLINNIFNDDAMRKLKGNLGIGHTRYSTSAASEEVNCQPFVVHTAHGALAVAHNGELVNCSSLRKMVLGRGVGLSTHSDSELITQALCLNPPEGETDGPDWPARINHLMRLAPLSYSLVIMLKDKIYAVRDPYGNRPLCLGKILPLGSSYAYKSSSSKHAAVLLNGCAKNGMDDKPEGWVVSSESCGFLSIGARYVREVLPGEIIEMSRRGIRTVDVVERPAMKQQAFCIFEYVYFARADSIFEGQMVYSARLQCGRMLARESPVDADIVSSVPESGTAAAHGYARQSGIPFMEVLCKNRYVGRTFIQPSTRLRQLGVAKKFGALSENVRGKRIVLIDDSIVRGNTIGPIIKLLRDAGAAEVHIRIASPPLKYPCYMGINIPTREELIANKMDPFKLAEHVGADSLEYLSVEGLVSAVHYNMKTTPSDGVGGHCTACLTGDYPGGLPDDADW-