Monarch geneset OGS2.0

DPOGS208874
TranscriptDPOGS208874-TA876 bp
ProteinDPOGS208874-PA291 aa
Genomic positionDPSCF300009 - 1359852-1362472
RNAseq coverage12x (Rank: top 83%)
Annotation
HeliconiusHMEL0047573e-1430.00% 
BombyxBGIBMGA002508-TA1e-1129.55% 
DrosophilaCG2964-PA4e-1229.55% 
EBI UniRef50UniRef50_P524891e-1131.82%Pyruvate kinase 2 n=25 Tax=Opisthokonta RepID=KPYK2_YEAST
NCBI RefSeqXP_315228.45e-1429.10%AGAP004596-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3853048212e-1431.25%cdc19 [Dekkera bruxellensis AWRI1499]
NCBI nr blastxgi|3205826122e-1434.65%Pyruvate kinase [Ogataea parapolymorpha DL-1]
Group
Gene OntologyGO:00309551.9e-17potassium ion binding
GO:00002871.9e-17magnesium ion binding
GO:00047431.9e-17pyruvate kinase activity
GO:00060961.9e-17glycolysis
KEGG pathwaycnb:CNBC41309e-15 
 K00873 (PK, pyk)maps-> Purine metabolism
    Glycolysis / Gluconeogenesis
    Type II diabetes mellitus
    Pyruvate metabolism
    Carbon fixation in photosynthetic organisms
InterPro domain[152-285] IPR0157956.5e-21Pyruvate kinase, C-terminal-like
[164-285] IPR0157941.9e-17Pyruvate kinase, alpha/beta
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208874-TA
ATGGGCTGGTGGATAAATAAGTTTACACGACCTTCTAACTCTTGCAGTTGTGGTGCTCTGTGCTATCATATGGAGGATGATGAAAACGATATATTTTATCAATCTTCCAATATAATATTTACCTTCAAGGGAAGGAAATTAATTTTATCATTACCAGACCATATAATATACGACGTAGATGACATCTACAGTCTCACTTGTTGCGGCGTCAACATCTTCAACGTTGATATGACGGCTGAAGATGAATCGTTCTTAAAAAACGTCACAGATGTTATTACTGAATCCGAAAATCTACCAAGAATGTTTCCATTTACATACTACAGACCAACGGGTTTATCAGTTACTATGGACATCAACAAATCTGTACAAATTAACGAAAGATTTTCAGATGGAGTTGTGTTAGATCTTTTCGACGATAATCCAAGTTGGGATGAATTTTGTAACTGTCCTCTGGACTTTTGTAAAGAGACGAAAATACCTCTGCTTGCACCTTTATCTACAGCTTTTGCGGCATCCCTGGCGGCTCTTACAAGTGGCGCCAGAGTAATACTAGTACTATCAGTCACCGGTGTTTCATCTCAATTGATATCTTTTACCTCACCCCCGTGTCATATCATCTGTATTATATCTAGAAAATCAATGGCTCGTCGACTACATATGTACCGTAAAGTCATCCCATTATTTTGTAAACCGAATCGCTCCACAAACTGGCATCAGAAATGTTGGAGTCGCATACATTTTGGCACGACTTTCGCTTTGAAGATCGGTTTGCTGGAGCTTGGAGCCAAATTAGTGGTGGTACAGCCCTCGGAAGAAGCAAATGGATATTGTGATTCTTTGCGAATTTTGTCCATTCCACTGCTGTGTGATAAGTGA

Protein sequence:

>DPOGS208874-PA
MGWWINKFTRPSNSCSCGALCYHMEDDENDIFYQSSNIIFTFKGRKLILSLPDHIIYDVDDIYSLTCCGVNIFNVDMTAEDESFLKNVTDVITESENLPRMFPFTYYRPTGLSVTMDINKSVQINERFSDGVVLDLFDDNPSWDEFCNCPLDFCKETKIPLLAPLSTAFAASLAALTSGARVILVLSVTGVSSQLISFTSPPCHIICIISRKSMARRLHMYRKVIPLFCKPNRSTNWHQKCWSRIHFGTTFALKIGLLELGAKLVVVQPSEEANGYCDSLRILSIPLLCDK-