Monarch geneset OGS2.0

DPOGS210295
TranscriptDPOGS210295-TA1671 bp
ProteinDPOGS210295-PA556 aa
Genomic positionDPSCF300551 - 8501-14216
RNAseq coverage1812x (Rank: top 7%)
Annotation
HeliconiusHMEL0055440.087.05% 
BombyxBGIBMGA004221-TA0.086.74% 
DrosophilaPgi-PB0.073.82% 
EBI UniRef50UniRef50_P067440.070.13%Glucose-6-phosphate isomerase n=2015 Tax=root RepID=G6PI_HUMAN
NCBI RefSeqNP_001091761.10.085.07%glucose-6-phosphate isomerase [Bombyx mori]
NCBI nr blastpgi|2811905730.087.77%glucose-6-phosphate isomerase [Euphydryas aurinia]
NCBI nr blastxgi|2811905730.087.77%glucose-6-phosphate isomerase [Euphydryas aurinia]
Group
Gene OntologyGO:00043472.3e-233glucose-6-phosphate isomerase activity
GO:00060942.3e-233gluconeogenesis
GO:00060962.3e-233glycolysis
KEGG pathwaytca:6588080.0 
 K01810 (GPI, pgi)maps-> Starch and sucrose metabolism
    Pentose phosphate pathway
    Glycolysis / Gluconeogenesis
    Amino sugar and nucleotide sugar metabolism
InterPro domain[9-555] IPR0016720Phosphoglucose isomerase (PGI)
[510-554] IPR0230965.8e-20Phosphoglucose isomerase, C-terminal
Orthology groupMCL11150 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210295-TA
ATGGAACCCAAAGTGAATTTGCTAGAGGACCCCGTGTATAAAAAACTGAAAGATTATTACTCTGCGAATGATGAGAAGATTAATATTTATCAACTGTTCCAACAAGACCCGGATCGCTTTAAGAAGTTTAGCCTCTGCATCCCAACACCCAACGATGGTGAGATCCTGTTAGACTATTCAAAGAACAGAGTTAATGATGCAGTGTTGGGGCTTCTGATGCAGCTGGCAAGGAGTAGGAATGTGGAACAGGCTAGAGATGCCATGTTCTCAGGTCAAAAGATCAACTTCACAGAAGACCGAGCTGTACTGCATATAGCACTCCGTAATAGAGAAAATCGCCCCATACTAGTGAATGGAAAGGATGTAATGCCTGATGTGAACAGTGTTTTACAGCACATGAAGGAATTCACCAATCAGGTCATCAGCGGATCATGGAAAGGTTATACCGGCAAAAGTATTACCGATGTCATCAATATTGGCATCGGAGGTTCAGACCTTGGACCTCTCATGGTGACGGAGGCGTTAAAACCATACGCCAAGCATCTCAAGGTCCACTTCGTGTCGAATATAGATGGTACCCACTTGGCGGAGGTCCTTAAACGCTTGGATCCGGAGACCAGCCTCTTCATCATAGCGTCGAAAACCTTCACCACACAGGAAACTATCACCAATGCCACCTCCGCTAAAGACTGGTTCCTGAACTCGGCCAAAGATCCGTCAGCAGTATCAAAGCATTTTGTAGCGCTCTCAACCAATGGAGAAAAGGTCACTGCTTTCGGCATTGACCCCAAAAACATGTTTGGTTTCTGGGACTGGGTCGGAGGAAGATATTCCTTGTGGTCTGCGATTGGTCTGTCAATATCTCTATACATTGGCTTCGACAACTTCGAGAAACTTCTAGAAGGCGCCCACTTTATGGATAAGCATTTCATGAGCGCACCTTTGGAAGAAAATGCTCCAGTAATATTAGCTCTTCTTGGAGTGTGGTATGGAAACTTCTATGGAGCCGAGACACACGCGTTGTTACCTTACGACCAATACTTGCACAGGTTCGCGGCGTACTTCCAGCAGGGTGACATGGAGTCCAACGGCAAGTCAGTGACGAGGGGCGGTCTGCGGGCCGAGTACAGCACAGGGCCGGTAGTGTGGGGCGAGCCGGGGACTAACGGACAGCACGCCTTCTACCAGCTCGTGCACCAGGGTACCAGACTGATCCCCTGTGACTTCATCGCTCCAGCGCAGAGCCACAACCCGATCTCATCAGGCGTCCACCATAAGATCCTGTTAGCCAACTTCCTGGCGCAGACGGAGGCTCTCATGAAGGGAAAGACACCGGATGAGGCTAAAGCGGAGCTCGTCAAGTCCGGCATGGCACCGGAAGCGATCTCCAAGATTCTTCCTCATAAGGTCTTCACCGGAAACAGGCCAACGAACTCTATCGTAGTTAAGAAAGTGACACCCTTCACACTGGGAGCGCTTATTGCTATGTACGAGCACAAGATCTTCACCCAGGGCGTGATCTGGGACATCAACTCGTTCGACCAGTGGGGAGTGGAGCTCGGCAAACAGCTCGCCAAGATCATAGAGCCGGAACTCGCAGAGGGCGCCGCCGTCACCTCACACGACACCTCCACCAACGGACTGATCCACTTCCTGAAGAAAAACTTCTAA

Protein sequence:

>DPOGS210295-PA
MEPKVNLLEDPVYKKLKDYYSANDEKINIYQLFQQDPDRFKKFSLCIPTPNDGEILLDYSKNRVNDAVLGLLMQLARSRNVEQARDAMFSGQKINFTEDRAVLHIALRNRENRPILVNGKDVMPDVNSVLQHMKEFTNQVISGSWKGYTGKSITDVINIGIGGSDLGPLMVTEALKPYAKHLKVHFVSNIDGTHLAEVLKRLDPETSLFIIASKTFTTQETITNATSAKDWFLNSAKDPSAVSKHFVALSTNGEKVTAFGIDPKNMFGFWDWVGGRYSLWSAIGLSISLYIGFDNFEKLLEGAHFMDKHFMSAPLEENAPVILALLGVWYGNFYGAETHALLPYDQYLHRFAAYFQQGDMESNGKSVTRGGLRAEYSTGPVVWGEPGTNGQHAFYQLVHQGTRLIPCDFIAPAQSHNPISSGVHHKILLANFLAQTEALMKGKTPDEAKAELVKSGMAPEAISKILPHKVFTGNRPTNSIVVKKVTPFTLGALIAMYEHKIFTQGVIWDINSFDQWGVELGKQLAKIIEPELAEGAAVTSHDTSTNGLIHFLKKNF-