Monarch geneset OGS2.0

DPOGS213064
TranscriptDPOGS213064-TA1248 bp
ProteinDPOGS213064-PA415 aa
Genomic positionDPSCF300016 - 808588-814013
RNAseq coverage1753x (Rank: top 7%)
Annotation
HeliconiusHMEL0103610.093.98% 
BombyxBGIBMGA007681-TA0.091.37% 
DrosophilaPgk-PA0.077.59% 
EBI UniRef50UniRef50_E5EVW60.081.73%Phosphoglycerate kinase n=1 Tax=Bombyx mori RepID=E5EVW6_BOMMO
NCBI RefSeqXP_968140.10.082.65%PREDICTED: similar to putative phosphoglycerate kinase [Tribolium castaneum]
NCBI nr blastpgi|910846750.082.65%PREDICTED: similar to putative phosphoglycerate kinase [Tribolium castaneum]
NCBI nr blastxgi|910846750.082.65%PREDICTED: similar to putative phosphoglycerate kinase [Tribolium castaneum]
Group
Gene OntologyGO:00060963.4e-297glycolysis
GO:00046183.4e-297phosphoglycerate kinase activity
KEGG pathwaytca:6565210.0 
 K00927 (PGK, pgk)maps-> Glycolysis / Gluconeogenesis
    Carbon fixation in photosynthetic organisms
InterPro domain[1-415] IPR0015763.4e-297Phosphoglycerate kinase
[198-413] IPR0159017.8e-87Phosphoglycerate kinase, C-terminal
[6-197] IPR0158241.1e-64Phosphoglycerate kinase, N-terminal
Orthology groupMCL11394 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213064-TA
ATGGCTCTGAACAAACTCAGTATCGACGCTTTAAATTTAAGTGGAAAACGAGTTTTAATGAGGGTGGATTTTAATGTTCCCTTAAAGGATGGTGTTATTACTAATAATCAGCGTATTGTGGCTGCGTTGGATTCCATTAAGTATGCCTTAGACAAGGGAGCTAAATCAGTAGTACTTATGTCACATCTTGGAAGACCTGATGGACAAGCAAACCTCAAGTACACATTGAAACCAGTAGCCGAAGAGCTTAAAAAACTTCTTAATAGGGACATAAATTTCTTGTCTGACTGTGTAGGGTCAGAAGTTGAAGGGGCTTGTGCTGATCCCCCAGTAGGTTCAGTAATATTGCTAGAGAATCTTCGCTTCCATGTTGAAGAGGAGGGCAAAGGTGTAGATGCCAGTGGAGCAAAAGTAAAAGCTGACCCAGCAAAAGTTAAAGAGTTCAGGTCTAGTTTACGCAAACTTGGAGATGTTTACATCAATGATGCTTTTGGTACTGCTCACAGAGCACACAGCTCTATGATGGGAGAAGGTTTTGATCAGAGAGCCAGTGGTTTCTTACTGAAGAAAGAGCTCCAGTACTTTGCTAAAGCTCTTCATGAGCCAGAAAGACCATTCCTTGCCATTCTGGGAGGTGCAAAAGTAGCAGACAAAATTTTACTTATAGAAAATCTGCTTGACAAAGTTAACGAGATGATCATCGGTGGAGGCATGGCATACACCTTCCTCAAGGAAACCCAGGGCATGGCTATTGGCAACTCTCTCTATGATGCTGATGGCGCTAAAATTGTTGCAAAACTTCTTGAAAAGGCAGCCAAAAACAATGTCAAAGTACATCTACCTGTAGACTTTGTGACAGCTGATAAATTTGATGAGAATGCACAGGTTGGCTCAGCCGATGTTGCGTCTGGTATACCTGATGGATGGATGGGTCTTGATGTTGGACCCAAGTCAAGAGAATTATTTGCTGAACCAATCGCCAGAGCCAAAGTCATTGTATGGAATGGACCAGCAGGAGTATTTGAATTTGAGAAATTTGCTGGCGGAACTCGTGCTCTTATGGACGGAGTAGTGAAGGCTACTACTAATGGATGCGTCACAATTATTGGTGGAGGTGACACCGCTACCTGCTGTGCCAAGTGGGGCACCGAGGACAAGGTTTCTCATGTGTCCACCGGTGGGGGCGCTTCACTCGAGCTCCTTGAAGGTAAAGTGCTTCCAGGTGTCGCCGCGCTATCTGATGCATAA

Protein sequence:

>DPOGS213064-PA
MALNKLSIDALNLSGKRVLMRVDFNVPLKDGVITNNQRIVAALDSIKYALDKGAKSVVLMSHLGRPDGQANLKYTLKPVAEELKKLLNRDINFLSDCVGSEVEGACADPPVGSVILLENLRFHVEEEGKGVDASGAKVKADPAKVKEFRSSLRKLGDVYINDAFGTAHRAHSSMMGEGFDQRASGFLLKKELQYFAKALHEPERPFLAILGGAKVADKILLIENLLDKVNEMIIGGGMAYTFLKETQGMAIGNSLYDADGAKIVAKLLEKAAKNNVKVHLPVDFVTADKFDENAQVGSADVASGIPDGWMGLDVGPKSRELFAEPIARAKVIVWNGPAGVFEFEKFAGGTRALMDGVVKATTNGCVTIIGGGDTATCCAKWGTEDKVSHVSTGGGASLELLEGKVLPGVAALSDA-