Monarch geneset OGS2.0

DPOGS204461
TranscriptDPOGS204461-TA1401 bp
ProteinDPOGS204461-PA466 aa
Genomic positionDPSCF300002 + 408780-416455
RNAseq coverage7x (Rank: top 86%)
Annotation
HeliconiusHMEL0062470.070.87% 
Bombyx% 
DrosophilaST6Gal-PB6e-6838.26% 
EBI UniRef50UniRef50_D6WKX76e-10642.89%Putative uncharacterized protein n=2 Tax=Endopterygota RepID=D6WKX7_TRICA
NCBI RefSeqXP_968750.11e-10642.89%PREDICTED: similar to Sialyltransferase CG4871-PB [Tribolium castaneum]
NCBI nr blastpgi|910828152e-10542.89%PREDICTED: similar to Sialyltransferase CG4871-PB [Tribolium castaneum]
NCBI nr blastxgi|910828152e-10442.68%PREDICTED: similar to Sialyltransferase CG4871-PB [Tribolium castaneum]
Group
Gene OntologyGO:00064862.2e-52protein glycosylation
GO:00301732.2e-52integral to Golgi membrane
GO:00083732.2e-52sialyltransferase activity
KEGG pathway 
InterPro domain[230-453] IPR0016752.2e-52Glycosyl transferase, family 29
Orthology groupMCL16766 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204461-TA
ATGAAAACGGCGGCTATGTCTGTTTGGATTTTCATTAATCTTTTGTGTTTTGGAATGTGTGGATACTTATACTTAATATGGTCTCAATACTGGATGAGTATAGAAAGACAAAGATTGTTTAGTAAGACGCCAGGACCTCCGCACAAAAACAGCATAGCTGGAATCAGTTTTCAAGAGACATTTTCAGAAAATATACAGACAAATATAGCTATACATGGTAACAGTTCCATACTAAACGCAGATTCCATCCGCAAAATAAGAAGATCGGTTGCATTAAAAGCTAGCCAAATATCGCAAGGGTCCAATAGATCTAATGGTAGCCAAATGCGTTTTGCAAATAACATAGTCTTAAAAAGTCACGGAAGTCCCCGCTTTCCCAACATTCATACACCAGTTTTGGAATTCGATAGTGATAAGTATAATTGCGAGGATTATACCACGCCAGAATGTGAGTCAAAAACAATAGAATTTAAAGAGCTTTTATTGAAGGAGTTTCATAGAGTTTTAATGAGTGAAAGTAAAGTATTTACGTCAGGGCTTGAATCGCAGAATCCTTATAATGTTAAATACCAGAGAAGTGCTGTGAAACAATATAGTAGGGAAGAAATATTATGTGCACTTTTGAAAGTTCGGGTAAAGACAGTAACGTCAAAAGATCAACCCTTCGCCAGATTGGGTTTCCAAATACCGAAATATCCGATATTAAAAAATAAAAGATTTAATAGTTGTGCCGTTGTAACAAGTGCCGGGGCTCTTCTTGGATCGCGTTTAGGAGAATTTATTGATTCCCACGATATGATATTAAGATTCAACAACGCACCGACAGAGAACTACACAGAGGACGTGGGCTCTAGAACTACAATGCGTGTTCTTAATTCGCAGGTAGCTACAAAACCAGAATATCGATTTTTGGATGATCCTTTGTATAAAAATATTTCTATCCTGATTTGGGATCCATCAAATTATTCTGCGTCCTTAGAAGACTGGTACCGTGATCCAGATTTTCCCGTATTCTCCGTTTATAAAAAATTATTAGAACGTAATAATTCTGTAGATGTGCATCTCCTGAATCCTCAAGTCTTATGGGATTTATGGGAAGTTCTTCAAGATTCAACTCCATACAGATTAAGGAGGAATCCTCCGTCATCAGGATTTTTAGGACTGTGGTTCTCAATTCACAATTGTAAGCGAGTCCGCGTGTTTGAATACGTCCCATCCGTGCGTGCGACCCGCCGCTGTCACTATCACTCGCCGGTCGAAGACTCTGGTTGTACGATGGGTGTGTGGCACCCGCTCGCCCAGGAGAAGTGGCTAGCGCAGCGCTTGAGCGATAACTCCGATGTGAATGTGTTCCAGAGAGGTTTTATCGATGTACCTGGAGTAACTAATGTGCAGTGTTAA

Protein sequence:

>DPOGS204461-PA
MKTAAMSVWIFINLLCFGMCGYLYLIWSQYWMSIERQRLFSKTPGPPHKNSIAGISFQETFSENIQTNIAIHGNSSILNADSIRKIRRSVALKASQISQGSNRSNGSQMRFANNIVLKSHGSPRFPNIHTPVLEFDSDKYNCEDYTTPECESKTIEFKELLLKEFHRVLMSESKVFTSGLESQNPYNVKYQRSAVKQYSREEILCALLKVRVKTVTSKDQPFARLGFQIPKYPILKNKRFNSCAVVTSAGALLGSRLGEFIDSHDMILRFNNAPTENYTEDVGSRTTMRVLNSQVATKPEYRFLDDPLYKNISILIWDPSNYSASLEDWYRDPDFPVFSVYKKLLERNNSVDVHLLNPQVLWDLWEVLQDSTPYRLRRNPPSSGFLGLWFSIHNCKRVRVFEYVPSVRATRRCHYHSPVEDSGCTMGVWHPLAQEKWLAQRLSDNSDVNVFQRGFIDVPGVTNVQC-