Monarch geneset OGS2.0

DPOGS206959
TranscriptDPOGS206959-TA1362 bp
ProteinDPOGS206959-PA453 aa
Genomic positionDPSCF300001 - 45894-54859
RNAseq coverage9724x (Rank: top 1%)
Annotation
HeliconiusHMEL0140282e-9560.00% 
BombyxBGIBMGA013021-TA0.094.29% 
DrosophilaAld-PE8e-16684.68% 
EBI UniRef50UniRef50_E9IA808e-14776.49%Fructose-bisphosphate aldolase (Fragment) n=4 Tax=Solenopsis invicta RepID=E9IA80_SOLIN
NCBI RefSeqNP_001091766.10.094.29%fructose 1,6-bisphosphate aldolase [Bombyx mori]
NCBI nr blastpgi|453308180.095.80%fructose 1,6-bisphosphate aldolase [Antheraea yamamai]
NCBI nr blastxgi|453308180.095.80%fructose 1,6-bisphosphate aldolase [Antheraea yamamai]
Group
Gene OntologyGO:00043321.7e-267fructose-bisphosphate aldolase activity
GO:00060961.7e-267glycolysis
GO:00081521.3e-169metabolic process
GO:00038241.3e-169catalytic activity
KEGG pathwaydpo:Dpse_GA193293e-169 
 K01623 (ALDO, fbaB)maps-> Pentose phosphate pathway
    Glycolysis / Gluconeogenesis
    Fructose and mannose metabolism
    Carbon fixation in photosynthetic organisms
InterPro domain[27-359] IPR0007411.7e-267Fructose-bisphosphate aldolase, class-I
[35-360] IPR0137851.3e-169Aldolase-type TIM barrel
Orthology groupMCL10752 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206959-TA
ATGGGACCTAAAGGTAAAAAGGGTAAAAAGGGCCCACCAGAGTCAGGATGGATGCCTGGCTTTGGAACCAAAGCAACAATGAGCACCTACTTCCAATACCCCACCCCCGAGGTTCAGGAGGAGCTGAAGAAGATCGCACAGGCGATCGTCGCCCCCGGCAAGGGTATCCTCGCCGCTGACGAGTCCACTGGTACTATGGGCAAGCGTCTGCAGGACATCGGTGTAGAGAATACTGAGGAGAACCGCCGCAAGTACCGCCAGCTCCTCTTCAGCAGTGACCCGGCGCTATCGGAGAACATCTCCGGCGTGATTCTGTTCCACGAGACGCTGTACCAGAAGGCTGATGACGGAACACCCCTGGTGTCCTTGTTGGAGAAGCGAGGCATCATCCCCGGGATCAAGGTCGACAAGGGTGTGGTGCCACTGTTCGGTTCAGAGGACGAGTGCACTACACAGGGTTTGGATGATCTCGCCCAGCGTTGTGCTCAATACAAGAAGGATGGCTGTCACTTCGCCAAGTGGCGTTGCGTCCTTAAGATCGGCCGATTCACCCCGTCCTATCAGGCCATTATGGAGAACGCTAACGTATTAGCTCGTTATGCTTCCATCTGCCAGAGCCAGCGCATCGTACCAATCGTTGAACCCGAAGTACTCCCTGATGGTGAGCACGACCTTGACCGCGCTCAGAAGGTGACTGAGGTGGTGCTGGCGGCCGTGTACAAGGCGCTCAGCGACCACCACGTGTACCTTGAAGGAACTCTACTGAAACCAAACATGGTGACAGCTGGTCAATCCTGCAAGAAGACTTACACTCCAATGGACATCGGCCGCGCCACCGTTACAGCTCTGTTGAGAACCGTACCAGCCGCAGTTCCCGGAGTGACTTTCCTTTCCGGCGGTCAGTCTGAGGAGGAGGCGTCTGTCAATCTGAACGCCATCAACACCGTGGACCTGAAGCGGCCTTGGGCCTTGACCTTCAGCTACGGCCGGGCCCTTCAGGCCTCTGTACTGCGCGCTTGGGCCGGCAAGAACGAGAACCTACTCGCCGGACAGCAGGAACTCCTGAAACGTGCTAAGGAATTTTTGACACCGAGGTCCACATATGACATCATGGTAATATCATTGCCAGATGTCGATACGAGGTATTGTATATGTATATTGGAATCTAGGGACAGTTTACAATCCCTGGACCATACGGATAATGTAGCATCACCCGGTAACCCAATTGTACAACAGAGTATAAAAGTTTTACCAAAACGGGCACAGGCAAACGGTCAAGCCAGCCAGGGCAAATACGTCGCTGGATCCGTAACAGGTGTGGGCGCTGAGGCCGGTCTCTTTGTAGCTAACCACGCGTACTAA

Protein sequence:

>DPOGS206959-PA
MGPKGKKGKKGPPESGWMPGFGTKATMSTYFQYPTPEVQEELKKIAQAIVAPGKGILAADESTGTMGKRLQDIGVENTEENRRKYRQLLFSSDPALSENISGVILFHETLYQKADDGTPLVSLLEKRGIIPGIKVDKGVVPLFGSEDECTTQGLDDLAQRCAQYKKDGCHFAKWRCVLKIGRFTPSYQAIMENANVLARYASICQSQRIVPIVEPEVLPDGEHDLDRAQKVTEVVLAAVYKALSDHHVYLEGTLLKPNMVTAGQSCKKTYTPMDIGRATVTALLRTVPAAVPGVTFLSGGQSEEEASVNLNAINTVDLKRPWALTFSYGRALQASVLRAWAGKNENLLAGQQELLKRAKEFLTPRSTYDIMVISLPDVDTRYCICILESRDSLQSLDHTDNVASPGNPIVQQSIKVLPKRAQANGQASQGKYVAGSVTGVGAEAGLFVANHAY-