Monarch geneset OGS2.0

DPOGS215291
TranscriptDPOGS215291-TA1452 bp
ProteinDPOGS215291-PA483 aa
Genomic positionDPSCF300120 - 496657-498108
RNAseq coverage1291x (Rank: top 10%)
Annotation
HeliconiusHMEL0086600.089.65% 
BombyxBGIBMGA007601-TA0.083.64% 
DrosophilaCG7145-PA0.071.52% 
EBI UniRef50UniRef50_Q7QA890.075.78%AGAP004366-PA n=6 Tax=Eumetazoa RepID=Q7QA89_ANOGA
NCBI RefSeqXP_313649.30.075.78%AGAP004366-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479717700.075.78%AGAP004366-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479717700.075.78%AGAP004366-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00065619.7e-214proline biosynthetic process
GO:00057599.7e-214mitochondrial matrix
GO:00551149.7e-214oxidation-reduction process
GO:00038429.7e-2141-pyrroline-5-carboxylate dehydrogenase activity
GO:00081526e-116metabolic process
GO:00164916e-116oxidoreductase activity
GO:00166204.4e-51oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor
KEGG pathwayaga:AgaP_AGAP0043660.0 
 K00294 (E1.5.1.12)maps-> Arginine and proline metabolism
    Alanine, aspartate and glutamate metabolism
InterPro domain[1-481] IPR0059319.7e-214Delta-1-pyrroline-5-carboxylate dehydrogenase 1
[2-470] IPR0161616e-116Aldehyde/histidinol dehydrogenase
[5-465] IPR0155901.3e-113Aldehyde dehydrogenase domain
[4-247] IPR0161624.3e-57Aldehyde dehydrogenase, N-terminal
[248-440] IPR0161634.4e-51Aldehyde dehydrogenase, C-terminal
Orthology groupMCL12094 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215291-TA
ATGCCACATGACCATGGACGGAAATTGGCGAAGTTTTATTATGCCAGCGAGAAAACTATTCAAAAGGCGATCCAAGCGTCCGCCGATGCTCAACGGCGTTGGGATAAAACGCCACTCGAAGAGAGAATACGCATTTGGCAGAACGCTGCCGACCTCATGGCGGGCGCCCATCGCCAACGACTGAATGCAGCCACCATGCTCGGCCAGTCAAAGTCGGTTGTGCAAGCGGAAATTGATTCAGCCGCCGAACTTATTGACTTCTTCCGTTTTAATGTTTTTTTCTTAAAAGAAAATGCAAAATATCAACCTATCTCTGAAAACTTGTCAGTTACAAGAAACTCGCTTAGATTTAGAGGTTTGGACGGATTTGTTGCTGCTATAAGTCCATTCAACTTTACTGCTATTGGAGGGAATCTGGCCTACACCCCAGCCCTCATGGGGAATGGTGTCTTATGGAAACCCTCTGATACAGCTCTGCTGTCAAATTGGCGCATATTTAACATCATGAGAGAGGCTGGTCTTCCACCTGGTATTGTTAACTTTGTCCCAGCTGATGGACCTACTTTTGGTAAAACTATCACAGCTTCTTCACGGCTGGCTGGAATTAACTTCACTGGATCTGTGCCAACTTTCAATTGGTTATGGAATGAGGTAGGAAAGAATCTCAACAAATACCAGAACTACCCCAGACTCATTGGAGAATGTGGAGGAAAGAACTATCACTTTATACATCCATCTGCAGACATACAGTCTGTTGTAACAGGCACAATTCGGTCAGCTTTTGAATATTGTGGTCAAAAGTGCTCCGCCTGCTCCAGAATGTATGTACCCAGATCCTTATCTGAACCAATTAAACAAGGACTGCTTGAAGAAAGGTCAAAATTGAAAATTGGTGACCCCACAGACTTTAAGATATTTACAGGGGCAGTTATTGACGATAAAGCCTTTGCCAGGATTACAGGCTACATCAAGAATGCTAAAAATAATCCTAAAAACAAAATTTTGGGAGGTGGAGAATTTGATGGAAGCAAAGGATACTTTGTGCAACCCACAATCATCGAGACAGTGGACCCATTTGATAAGCTGATGACAGAAGAGATCTTCGGGCCTGTCCTCACCATGTATGTTTATGAAGACAGGGACCTGGACCAGGCGCTGGGCCTTGTAGGCTCTTCTACTAAATTTGCCTTAACAGGTGCAGTATTTGCTACAGACCAGAAGTTCCTAGAAAGTGCTTTTGAAGAACTTAAAATGACAGCGGGTAATTTCTACCTGAACGACAAGTCAACCGGTTCCGTCGTGGGGCAGCAGCCATTCGGAGGAGGTCGCATGTCCGGCACCAACGACAAAGCGGGCGGACCCAACTATGTAATGCGTTGGACTACTCCACAGTCCATCAAGGAGACCTTCGTTCCTCTAAAAGATATTGACTACCCTTACATGAGGGACTAA

Protein sequence:

>DPOGS215291-PA
MPHDHGRKLAKFYYASEKTIQKAIQASADAQRRWDKTPLEERIRIWQNAADLMAGAHRQRLNAATMLGQSKSVVQAEIDSAAELIDFFRFNVFFLKENAKYQPISENLSVTRNSLRFRGLDGFVAAISPFNFTAIGGNLAYTPALMGNGVLWKPSDTALLSNWRIFNIMREAGLPPGIVNFVPADGPTFGKTITASSRLAGINFTGSVPTFNWLWNEVGKNLNKYQNYPRLIGECGGKNYHFIHPSADIQSVVTGTIRSAFEYCGQKCSACSRMYVPRSLSEPIKQGLLEERSKLKIGDPTDFKIFTGAVIDDKAFARITGYIKNAKNNPKNKILGGGEFDGSKGYFVQPTIIETVDPFDKLMTEEIFGPVLTMYVYEDRDLDQALGLVGSSTKFALTGAVFATDQKFLESAFEELKMTAGNFYLNDKSTGSVVGQQPFGGGRMSGTNDKAGGPNYVMRWTTPQSIKETFVPLKDIDYPYMRD-