Monarch geneset OGS2.0

DPOGS201756
TranscriptDPOGS201756-TA1050 bp
ProteinDPOGS201756-PA349 aa
Genomic positionDPSCF300279 + 27461-30580
RNAseq coverage1320x (Rank: top 10%)
Annotation
HeliconiusHMEL0067130.092.55% 
BombyxBGIBMGA002651-TA0.091.98% 
DrosophilaSas-PA2e-7943.73% 
EBI UniRef50UniRef50_Q9NR451e-9850.14%Sialic acid synthase n=70 Tax=Coelomata RepID=SIAS_HUMAN
NCBI RefSeqXP_973182.24e-10353.78%PREDICTED: similar to CG17754 CG17754-PC [Tribolium castaneum]
NCBI nr blastpgi|455929386e-10353.13%sialic acid synthase [Danio rerio]
NCBI nr blastxgi|1892349572e-10353.78%PREDICTED: similar to CG17754 CG17754-PC [Tribolium castaneum]
Group
Gene OntologyGO:00081522.9e-102metabolic process
GO:00038242.9e-102catalytic activity
GO:00160518.1e-75carbohydrate biosynthetic process
KEGG pathwaydre:3227809e-104 
 K05304 (NANS, SAS)maps-> Amino sugar and nucleotide sugar metabolism
InterPro domain[7-295] IPR0137852.9e-102Aldolase-type TIM barrel
[38-275] IPR0131328.1e-75N-acetylneuraminic acid synthase, N-terminal
[296-348] IPR0061901.2e-08Antifreeze-like/N-acetylneuraminic acid synthase C-terminal
Orthology groupMCL17079 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201756-TA
ATGCTCGAAGTTAAGATAACAGAGGACATTAGAATAGGCGGTAAAAATCCTTGCTTCATTATAGCTGAAGTTGGACAAAATCACCAAGGTGACATTGAAGTAGCGAAAAAATTGATCAAAGCAGCAAAGGACGCGGGGGCTAACTGCGTTAAATTTCAAAAGACTTGTCTGAATGAAAAATTTACGAAAAAGTATTTGGAGAAGCCTTACGATAGCCCGAACTCTTGGGGGAAAACTTACGGTGAACATAAGAGACATTTAGAATTTTCGGAAAGTCAATACAGAGAATTGTTTAAATATGCTCAAGAGGTCGGAATACTCTTCACAGCTTCAGCAATGGACATGGTATCTTTCGACTTTCTGGTGAACATAAAAGTGCCTTTCATAAAAATCGGATCCGGTGACTCCAACAATTTATTATTCTTGAAATATGCCGCATCCAAAAAGATCCCTCTTATAATATCCACGGGCATGGTGGACAAGCAGGCAGTGAAAACTATATACGACATTATTGCTGCTCAACACAAACAATTCTGCTTGTTACATTGTATATCAGCGTACCCTGTGCCCTTCGAGGACTGTAATCTGACCGTCCTACAAGACTACAAGAACACTTTTGACATCCCGGTCGGATATTCTGGTCAAGAAGTTGGCACCGCCGTTGCTTTAGGTGCAATAGCACTGGGAGCTAAGGTCCTGGAGAAGCATATAACATTAGACAAAGGTCTCCGGGGCACCGACCACGTGTGTTCTTTAACACCGACAGAGTTCCAACAGCTGGTGCGCGATGTGCGAGTCATTGAGGCCTCGCTTGGTACACCCATTAAAAAAGTTGTTACTTCAGAAATTCCTTGCATCGATAAATTGCAAAAGTCGCTGGTGATGGGCTGCACTAAAAATAAAGGCGAGATTCTTTATCCGGGAGATGTAAAGATCAAAGTCGCTGAACCGAAAGGTCTGAACGCGTTGCACTTCGAGGACGTTATATATAAAACACTTGTCTACGATAAGAAGGAAGATGAACCACTCAACGAGGGAGATTTCTGTTGA

Protein sequence:

>DPOGS201756-PA
MLEVKITEDIRIGGKNPCFIIAEVGQNHQGDIEVAKKLIKAAKDAGANCVKFQKTCLNEKFTKKYLEKPYDSPNSWGKTYGEHKRHLEFSESQYRELFKYAQEVGILFTASAMDMVSFDFLVNIKVPFIKIGSGDSNNLLFLKYAASKKIPLIISTGMVDKQAVKTIYDIIAAQHKQFCLLHCISAYPVPFEDCNLTVLQDYKNTFDIPVGYSGQEVGTAVALGAIALGAKVLEKHITLDKGLRGTDHVCSLTPTEFQQLVRDVRVIEASLGTPIKKVVTSEIPCIDKLQKSLVMGCTKNKGEILYPGDVKIKVAEPKGLNALHFEDVIYKTLVYDKKEDEPLNEGDFC-