Monarch geneset OGS2.0

DPOGS206961
TranscriptDPOGS206961-TA1086 bp
ProteinDPOGS206961-PA361 aa
Genomic positionDPSCF300001 + 57328-64451
RNAseq coverage155x (Rank: top 53%)
Annotation
HeliconiusHMEL0140283e-7644.90% 
BombyxBGIBMGA012827-TA2e-14573.82% 
DrosophilaAld-PE2e-13063.74% 
EBI UniRef50UniRef50_E9IA804e-11965.03%Fructose-bisphosphate aldolase (Fragment) n=4 Tax=Solenopsis invicta RepID=E9IA80_SOLIN
NCBI RefSeqNP_001091766.18e-14067.31%fructose 1,6-bisphosphate aldolase [Bombyx mori]
NCBI nr blastpgi|453308181e-13968.13%fructose 1,6-bisphosphate aldolase [Antheraea yamamai]
NCBI nr blastxgi|453308181e-13368.13%fructose 1,6-bisphosphate aldolase [Antheraea yamamai]
Group
Gene OntologyGO:00043323.9e-217fructose-bisphosphate aldolase activity
GO:00060963.9e-217glycolysis
GO:00081527e-155metabolic process
GO:00038247e-155catalytic activity
KEGG pathwaydgr:Dgri_GH181789e-136 
 K01623 (ALDO, fbaB)maps-> Pentose phosphate pathway
    Glycolysis / Gluconeogenesis
    Fructose and mannose metabolism
    Carbon fixation in photosynthetic organisms
InterPro domain[1-361] IPR0007413.9e-217Fructose-bisphosphate aldolase, class-I
[11-342] IPR0137857e-155Aldolase-type TIM barrel
Orthology groupMCL30960 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206961-TA
ATGCCAACCTTCTTCTCTTACCCTAATCTTGAGCTTCAAGAGGAGCTCAAACGGACCGCCGAGGCCATTGTCGCACCGGGAAAAGGAATCCTAGCTGTTGACGAAAACAATGAGGGAATTGGCAAGTTGCTTGCTGGTGTGGATTTGGAAAACACCGAGGAAAATAGGCGCCGGTATAGACAAATGTTATTCACAAGCGACGAGATGCTATCAAATAACATCTCAGCCGTGATTCTATTCGAAGAGACTTTATACCAAAAGACGGATGACGGCATCTTATTTATGGATTTATTGAGGAAGAAAAACATAATACCCGGCGTCAAAGTAGATAAGGATGTTGTTCCATTGCATCTGACAGAAGAATACACAACCCAGGGTTTGGACAACCTGGCCGAGCGGTGCGCGGCTTACAAGAAGCTGGGATGTAGATTTGCAAAGTGGAGGTGCCCATTGAAGATTGGGGAGCGCTCACCGTCTGTGCAGGCCATAGAAGACGCGTCGCATGTTCTTGCAAGATATGCTTCGATATGTCAGAGTGAGGGTCTGGTGCCCATCGTGGAACCTGATGTGCTTTTAGACGGAAACCATGACATCGTAAGAAGCCAGAAGGTGACTGAAGTGGTGTTGGCATCTGTTTATAAAGCTCTGAGTGATCATCACGTCTTTCTAGAAGGGACTCTGCTTAAACCAAATATGGTGACTCCAGGTCAACAATGCGTGAAGAGGAGCACACCAGAGGAGATTGCTTCAGCCACTGTCACAGCATTGCTAAGAACCGTACCAGTAGCGGTACCCGGTATCACATTCCTCTCCGGTGGTCTCTCTGAGGAAGACGCTACATTAAATCTGAACGCGATCAATCAAATATCAAAAGCACCTTGGAGACTAACATTCAGCTATGGCAGAGCATTGCAGGCGTCGGTATGGAAAAGCTGGGCGGGGAAAGATGAGAATATCGGGAAGGCCCAGAAGGAGTTGCTGCAAAGAGCTCAGGCGAACGGTCTTGCCTCTTTGGGAAAATATGTAACAGCCTGTATTAGCAGTGCCGGTGACATGTCAAACTACAGGGAAAACCACATATACTAG

Protein sequence:

>DPOGS206961-PA
MPTFFSYPNLELQEELKRTAEAIVAPGKGILAVDENNEGIGKLLAGVDLENTEENRRRYRQMLFTSDEMLSNNISAVILFEETLYQKTDDGILFMDLLRKKNIIPGVKVDKDVVPLHLTEEYTTQGLDNLAERCAAYKKLGCRFAKWRCPLKIGERSPSVQAIEDASHVLARYASICQSEGLVPIVEPDVLLDGNHDIVRSQKVTEVVLASVYKALSDHHVFLEGTLLKPNMVTPGQQCVKRSTPEEIASATVTALLRTVPVAVPGITFLSGGLSEEDATLNLNAINQISKAPWRLTFSYGRALQASVWKSWAGKDENIGKAQKELLQRAQANGLASLGKYVTACISSAGDMSNYRENHIY-