Monarch geneset OGS2.0

DPOGS211586
TranscriptDPOGS211586-TA786 bp
ProteinDPOGS211586-PA261 aa
Genomic positionDPSCF300084 + 285348-287176
RNAseq coverage451x (Rank: top 27%)
Annotation
HeliconiusHMEL0169961e-10166.67% 
BombyxBGIBMGA006364-TA3e-6465.36% 
DrosophilaP5cr-PA7e-5543.02% 
EBI UniRef50UniRef50_F4WFL51e-5744.44%Pyrroline-5-carboxylate reductase n=6 Tax=Acromyrmex echinatior RepID=F4WFL5_ACREC
NCBI RefSeqXP_974776.14e-6345.42%PREDICTED: similar to GA19292-PA [Tribolium castaneum]
NCBI nr blastpgi|3407133111e-6247.31%PREDICTED: pyrroline-5-carboxylate reductase 2-like [Bombus terrestris]
NCBI nr blastxgi|3407133112e-6147.31%PREDICTED: pyrroline-5-carboxylate reductase 2-like [Bombus terrestris]
Group
Gene OntologyGO:00065612e-93proline biosynthetic process
GO:00551142e-93oxidation-reduction process
GO:00047352e-93pyrroline-5-carboxylate reductase activity
GO:00166161.7e-34oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor
GO:00164911.7e-34oxidoreductase activity
GO:00054881.1e-28binding
KEGG pathwaytca:6636471e-62 
 K00286 (E1.5.1.2, proC)maps-> Arginine and proline metabolism
InterPro domain[1-260] IPR0003042e-93Pyrroline-5-carboxylate reductase
[153-260] IPR0089271.7e-346-phosphogluconate dehydrogenase, C-terminal-like
[1-151] IPR0160401.1e-28NAD(P)-binding domain
[2-88] IPR0044551.6e-07NADP oxidoreductase, coenzyme F420-dependent
Orthology groupMCL12250 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211586-TA
ATGTCAACTGCGATTGTACAAGGGATATGCAAAAATGAAATCCGCGATTCCTTAAATATCTGGGTCTCAGGACCCCATAAGGAAAATCTTGAACATTGGAAACAATATGGTGCGAATGTAACAACTAGCAATGGAGAAGTTATTTGTAATTGTGATATAGTATTCATTGGAGTGAAACCTGCAATGCTGGATAATGCTTTATGTAATTGCTACTTACCTACAAAAGATCCTAAAAATATTCTGTTCATATCTATGCTTGCTGGAGTGAACATACAAAAATTGAAACAGGTGTTGAAGCAGTTGCCTTTTAATTCCAATGTGATCCGTATTTTTCCAAACACACCTATGTCGGTGGGCGCTGGTTCATGTTTATATGCCATAGATGAAAATGTAACTCTAGAACAGTGTGCTGTTTTGGAGAAATTACTAGCCGGCTGTGGACTGTGTGAAAAAGTTTCTGAACCATTGATGGACTCCTTGGGAATTCTGACAGGATGTGGACCTGCATTTATGTACATGATAATAGAAGCCCTGGCTGATGGTGCAGTAAAACAGGGTGTTCCCCGAGCTATGGCCCTCCGACACTCAGCTCAAATGATGGCCGGCAGTGCAACCATGGTGTTACAAAGCAATAAGCATCCGGGACAGCTAAAGGATGAAGTCTGCTCACCGGGAGGATGTACCATCGCTGGAGTTACCGCCCTTGAGAATGGGAAGCTAAGGGCTACAATGATCAATGCTATAGAAGCAGCGACATTAAGAGTCAAGGAGATGAGCAAGAATTAA

Protein sequence:

>DPOGS211586-PA
MSTAIVQGICKNEIRDSLNIWVSGPHKENLEHWKQYGANVTTSNGEVICNCDIVFIGVKPAMLDNALCNCYLPTKDPKNILFISMLAGVNIQKLKQVLKQLPFNSNVIRIFPNTPMSVGAGSCLYAIDENVTLEQCAVLEKLLAGCGLCEKVSEPLMDSLGILTGCGPAFMYMIIEALADGAVKQGVPRAMALRHSAQMMAGSATMVLQSNKHPGQLKDEVCSPGGCTIAGVTALENGKLRATMINAIEAATLRVKEMSKN-