Monarch geneset OGS2.0

DPOGS215437
TranscriptDPOGS215437-TA822 bp
ProteinDPOGS215437-PA273 aa
Genomic positionDPSCF300298 - 129982-136424
RNAseq coverage1211x (Rank: top 10%)
Annotation
HeliconiusHMEL0163212e-12078.52% 
BombyxBGIBMGA005727-TA3e-13482.66% 
DrosophilaCG5840-PA3e-8157.14% 
EBI UniRef50UniRef50_Q9VEJ34e-7957.14%Pyrroline-5-carboxylate reductase n=24 Tax=Endopterygota RepID=Q9VEJ3_DROME
NCBI RefSeqXP_975446.12e-9362.83%PREDICTED: similar to pyrroline-5-carboxylate reductase [Tribolium castaneum]
NCBI nr blastpgi|910791344e-9262.83%PREDICTED: similar to pyrroline-5-carboxylate reductase [Tribolium castaneum]
NCBI nr blastxgi|910791342e-8863.30%PREDICTED: similar to pyrroline-5-carboxylate reductase [Tribolium castaneum]
Group
Gene OntologyGO:00065612.4e-113proline biosynthetic process
GO:00551142.4e-113oxidation-reduction process
GO:00047352.4e-113pyrroline-5-carboxylate reductase activity
GO:00054882.3e-43binding
GO:00166166.4e-35oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor
GO:00164916.4e-35oxidoreductase activity
KEGG pathwaytca:6643436e-93 
 K00286 (E1.5.1.2, proC)maps-> Arginine and proline metabolism
InterPro domain[1-263] IPR0003042.4e-113Pyrroline-5-carboxylate reductase
[1-160] IPR0160402.3e-43NAD(P)-binding domain
[161-270] IPR0089276.4e-356-phosphogluconate dehydrogenase, C-terminal-like
[2-98] IPR0044553.9e-18NADP oxidoreductase, coenzyme F420-dependent
Orthology groupMCL12466 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215437-TA
ATGAAGATAGGTTTCATCGGAGGCGGTAAACTTGCTTACGCCTTGGCCAATGGCTTTGTTTCTGCAGGATTGGCTAAGCCCGACGAGATTACTGCTAGCTGTCACCCAGCAGATAAGGCTAGCGCCGAAGCATTCAAAAGCTTAGGAGCTACAGCACTATTTGAAAACAAGTCTGTTGTGGAACGCTCAGAAGTAGTTATAGTCTCGGTCAAGCCAGATGTAGTGGTGCCAGCTCTCAAGGATGTCAAAGATCTTGCTGCGTCCAAAAATAAATTATTCATATCAGTGGCCATGGGCGTTTCGACAAGCACCATAGAGAAGGCATTACCGTCAGAAGCACGCGTTATTCGCGTGATGCCGAACACTCCAGCTCTAGTCAAAGAAGGTGCTGCAGCGTTAAGCAGAGGTTCAAAAGCTACAGCAGAGGACGCCAAGTTGGCAGCTGAACTTTTCCGAGCTGTCGGCACGTGTGACGAGGTCCCAGAGTATCAAATGGATGCAGTCACAGCTTTGAGCGGCAGTGGTCCAGCTTATGTGTACATGCTGATAGAGTCCCTAGCGGATGGTGGTGTCCGATGTGGACTCCCCCGGGACCTGGCACTTCGGCTCGCCACCCAGACCACGAGAGGCGCTGCCAGTATGCTGAGTACCGGAAGCCATCCAGCTGTGTTGAAGGACAATGTGACTTCTCCAGCTGGTTCCACAGCCGAGGGGACCTATCACCTGGAACAGAATGGATTCAGATCAGCTGTTATTGGAGCAGTAATGGCTGCGACGAATAGATGTAAGGTCGTTAATGAACAATTAAACCAAATGAATTAA

Protein sequence:

>DPOGS215437-PA
MKIGFIGGGKLAYALANGFVSAGLAKPDEITASCHPADKASAEAFKSLGATALFENKSVVERSEVVIVSVKPDVVVPALKDVKDLAASKNKLFISVAMGVSTSTIEKALPSEARVIRVMPNTPALVKEGAAALSRGSKATAEDAKLAAELFRAVGTCDEVPEYQMDAVTALSGSGPAYVYMLIESLADGGVRCGLPRDLALRLATQTTRGAASMLSTGSHPAVLKDNVTSPAGSTAEGTYHLEQNGFRSAVIGAVMAATNRCKVVNEQLNQMN-