Monarch geneset OGS2.0

DPOGS201017
TranscriptDPOGS201017-TA801 bp
ProteinDPOGS201017-PA266 aa
Genomic positionDPSCF300147 + 291770-292570
RNAseq coverage703x (Rank: top 18%)
Annotation
HeliconiusHMEL0130552e-10775.41% 
BombyxBGIBMGA009104-TA2e-12274.81% 
DrosophilaSptr-PA2e-4337.69% 
EBI UniRef50UniRef50_C0STP81e-11772.36%Sepiapterin reductase n=4 Tax=Obtectomera RepID=C0STP8_BOMMO
NCBI RefSeqXP_001654165.19e-5340.53%short-chain dehydrogenase [Aedes aegypti]
NCBI nr blastpgi|2266931309e-12074.81%sepiapterin reductase [Bombyx mori]
NCBI nr blastxgi|2266931306e-11374.81%sepiapterin reductase [Bombyx mori]
Group
Gene OntologyGO:00047571.3e-69sepiapterin reductase activity
GO:00551141.3e-69oxidation-reduction process
GO:00067291.3e-69tetrahydrobiopterin biosynthetic process
GO:00054883e-33binding
GO:00081522.8e-12metabolic process
GO:00164912.8e-12oxidoreductase activity
KEGG pathwayaag:AaeL_AAEL0018453e-52 
 K00072 (SPR)maps-> Folate biosynthesis
InterPro domain[14-264] IPR0063931.3e-69Sepiapterin reductase
[14-260] IPR0160403e-33NAD(P)-binding domain
[15-191] IPR0021982.8e-12Short-chain dehydrogenase/reductase SDR
[14-31] IPR0023472.6e-09Glucose/ribitol dehydrogenase
Orthology groupMCL12241 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201017-TA
ATGGCCGCATCTACTAACATTGATTTATCTGGGCCATCTTTCTGCGTCGTGTCCGGCGCATCTCAGGGAATAGGTAGGGCTCTAGCCATCGAAGTATCGAAATGCCTGAAACCAAAATCTGTCATGGTGCTACTGGCTCGCAATAAACAGCAACTTGCGATCACAGCCAGCCTCTGTGAAAATGATGGATTGAAAGTTCTCATTAACTCTATAGATCTGTCGATAGCATCAGAGAAAGAAATGACTGATGTTATCATGCAAGCTCTGGGTGGACAGAAGGTTACCGATTTTGCCAACTGCATAATATTCCATAACGTTGGCTCACTTGGTAACTTGGCTGTGGAGACGACTCGAATGGAAAATATCGAGGAACTGAGAGGATACTATGATTTGAATGTGTTCAAAGTAATATCACTTAACACACAGCTCCTTAAAATATTTGAAGAGGTGGAGGATAGGGTTATCATTGTCAACATCACATCACTGTGTGCCATCAAGCCAATGGGCGGTATGGCTTACTACTGCAGCGGGAAGGCCGCAAGGGAAATGTACTTCCGAGTACTCGCTGAAGAGAAACGGCACATTAGGGTCTTGAACTATTCCCCCGGCCCTGTCGAAACTGCCATGATAGATTTTGTTCTCCAAGAAGCTGTTAATGAAAACCTCCGAGATGTGTTCACATCATTCAAGAACCAGGGAACATTGCTGACACCTGAGATAACAGCTAAAAAATGTATCAAGGTTCTGCTAGCAGGAAAGTTCAGTCCCGGGGAACACATCGACTACTTCGATGACGAATAA

Protein sequence:

>DPOGS201017-PA
MAASTNIDLSGPSFCVVSGASQGIGRALAIEVSKCLKPKSVMVLLARNKQQLAITASLCENDGLKVLINSIDLSIASEKEMTDVIMQALGGQKVTDFANCIIFHNVGSLGNLAVETTRMENIEELRGYYDLNVFKVISLNTQLLKIFEEVEDRVIIVNITSLCAIKPMGGMAYYCSGKAAREMYFRVLAEEKRHIRVLNYSPGPVETAMIDFVLQEAVNENLRDVFTSFKNQGTLLTPEITAKKCIKVLLAGKFSPGEHIDYFDDE-