Monarch geneset OGS2.0

DPOGS208058
TranscriptDPOGS208058-TA1383 bp
ProteinDPOGS208058-PA460 aa
Genomic positionDPSCF300203 + 435709-443736
RNAseq coverage12x (Rank: top 83%)
Annotation
HeliconiusHMEL0208481e-6938.95% 
BombyxBGIBMGA009243-TA6e-7242.55% 
DrosophilaFucTC-PB5e-4437.55% 
EBI UniRef50UniRef50_Q05GU13e-4538.75%Alpha1,3-fucosyltransferase C n=2 Tax=Apis mellifera carnica RepID=Q05GU1_APICA
NCBI RefSeqXP_001120699.16e-4638.75%PREDICTED: similar to Alpha-(1,3)-fucosyltransferase C (Galactoside 3-L-fucosyltransferase) [Apis mellifera]
NCBI nr blastpgi|3800235322e-4539.45%PREDICTED: alpha-(1,3)-fucosyltransferase C-like [Apis florea]
NCBI nr blastxgi|3800235327e-4839.66%PREDICTED: alpha-(1,3)-fucosyltransferase C-like [Apis florea]
Group
Gene OntologyGO:00160203.5e-62membrane
GO:00084173.5e-62fucosyltransferase activity
GO:00064863.5e-62protein glycosylation
KEGG pathwayapi:1001617264e-24 
 K00753 (E2.4.1.214)maps-> N-Glycan biosynthesis
InterPro domain[3-319] IPR0015033.5e-62Glycosyl transferase, family 10
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208058-TA
ATGAAATCAAAGTATATTTTACAATGGACGAAACGACGAGGAGTCGGTAAAAGGGTATCCCCAACTTTAACGTGCTCCGAACTGTGTATTTTCACAGAAGACAAAGGACACTTTGACGGCGATTACACGAAATTCGATGCCATAATTTTTAATGAAGACATCCTCTCCGCCAGCGAGCGTCCGATCAAACGAGATCCCTCCCAAATGTATATATTCAATACCCTGGAATCGTCTCACACGGCTCCGGCGTGCGATGTACACAACGACGGTTATTTCAATTGGACTTTTACGTACCGTCTCGACTCGGATATAGTGTGGAGTTACTTTCAAGTGAGAAGTCTTAAGGGACAGCTCGTCGCTCCGAGCGTTGCCGTCGTCTGGAAACATAGCAGCCACCCTGTGAAAAAGAAAATAAGGACAATTTTGAAACGTAAACGTAAAGCGGCTGCGTGGCTCGTAAGCCATTGTAGAGCCGACAGTCTCAGAGACGACTACCTAACCAGACTTCAAGAGCACCTATTCCATTTCTCTCTTAATATCGACGTTTACGGCGACTGTTCGAAACGGAAATGTCCTAACGACGCCTGCGACTATATGATACGCAAAGATTATTACTTTTACATGGCCTTTGAGAATTCGTTCGCTGATGATTACGTTACGGAGAAAATTTTGCACGGTTACAAGAATTACGCCGTACCTATAGTTTACGGGGGAGCGAACTACAGCAGGTTTCTGCCGCTCGGCTCCTACATCAACGCACGCGGAATGCACCCATACAACTTAGCGTACAAAATGTACCAGGCGATAAAAAATCGTGACATTTACCTTAAATACTTCAAATGGACGAATCTCTATAAGATAACATCTGAATTAAAACCTCATCCTCTCTGCGAGGTCTGCAAGCGTCTCCACCACATGGACAGAGAATATCCGGCGAGCAAATACTTCAGACTGTGGTGGAATCGGCCCAACGGAATGAGGTGGTGTCTCTCTGACCAGTTCTGGAACGAAACGTCTAATGTCAATCTCGACGGCAGACATATCTTTAATATGTTTCTGCCGCCCGGCTCCTACATCAACGCACACGGAATGCACCCATACAACTTAGCGTACAAAATGTACCAGGCGATAAAAAATCGTGACATTTACCTTAAATACTTCAAATGGACGAATCTCTATAAGATTACATCTGAATTAAAACCTCATCCTCTCTGCGAGGTCTGCAAGCGTCTCCACCACATGGACAGAGAATATCCGGCGAGCAAATACTTCAGACTGTGGTGGAATCGGCCCAACGGAATGAGGTGGTGTCTCTCTGACCAGTTCTGGAACGAAACATCCAATGTCAATCTCGACGGCAGACATATTTTTAATATGTATTAA

Protein sequence:

>DPOGS208058-PA
MKSKYILQWTKRRGVGKRVSPTLTCSELCIFTEDKGHFDGDYTKFDAIIFNEDILSASERPIKRDPSQMYIFNTLESSHTAPACDVHNDGYFNWTFTYRLDSDIVWSYFQVRSLKGQLVAPSVAVVWKHSSHPVKKKIRTILKRKRKAAAWLVSHCRADSLRDDYLTRLQEHLFHFSLNIDVYGDCSKRKCPNDACDYMIRKDYYFYMAFENSFADDYVTEKILHGYKNYAVPIVYGGANYSRFLPLGSYINARGMHPYNLAYKMYQAIKNRDIYLKYFKWTNLYKITSELKPHPLCEVCKRLHHMDREYPASKYFRLWWNRPNGMRWCLSDQFWNETSNVNLDGRHIFNMFLPPGSYINAHGMHPYNLAYKMYQAIKNRDIYLKYFKWTNLYKITSELKPHPLCEVCKRLHHMDREYPASKYFRLWWNRPNGMRWCLSDQFWNETSNVNLDGRHIFNMY-