Monarch geneset OGS2.0

DPOGS213771
TranscriptDPOGS213771-TA1023 bp
ProteinDPOGS213771-PA340 aa
Genomic positionDPSCF300212 + 172745-174120
RNAseq coverage0x (Rank: top 99%)
Annotation
HeliconiusHMEL0114454e-10250.26% 
BombyxBGIBMGA009243-TA2e-6838.05% 
DrosophilaFucTC-PB1e-3327.51% 
EBI UniRef50UniRef50_UPI00015B56842e-3732.69%UPI00015B5684 related cluster n=1 Tax=unknown RepID=UPI00015B5684
NCBI RefSeqXP_001607468.14e-3832.69%PREDICTED: similar to alpha1,3-fucosyltransferase C [Nasonia vitripennis]
NCBI nr blastpgi|1565478737e-3732.69%PREDICTED: alpha-(1,3)-fucosyltransferase C-like [Nasonia vitripennis]
NCBI nr blastxgi|910876971e-3830.70%PREDICTED: similar to FucTC CG40305-PA [Tribolium castaneum]
Group
Gene OntologyGO:00160201.5e-51membrane
GO:00084171.5e-51fucosyltransferase activity
GO:00064861.5e-51protein glycosylation
KEGG pathwayapi:1001617262e-25 
 K00753 (E2.4.1.214)maps-> N-Glycan biosynthesis
InterPro domain[2-325] IPR0015031.5e-51Glycosyl transferase, family 10
Orthology groupMCL34991 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213771-TA
ATGAAGTATATTTTAGTTTGGACTAACATCTATGGAATAGAAGAAGAAGGTCAGAAATATTTTATTAGTAAAAACTGCAAACATATAAACTGTTACATTACCAAAAATGATAGTCTCTTCAATGACGTGAGATATTTCGACGCCATCTTATTTGATGGTCAAGACGTATCTAGGGATATTATTGCATTGCCTCAGCTTAGAAGCACAATGCAGAAATATATTTTTGTGGCTAAAGAATCATCAGATAATTTTCCAGTATGCAATAAAATTTATGACAACTTTTTCAATTGGACTTGGACATATCGCTACGATTCAACAATATCGTACCATTTCATAACGGTCTTCAATTATCAATATATAGAACTTGGCAACCGTTTCCTTTGGGAATCTTATATGAAACCAATAGACAAGACATTAAAATCTCAATTCGTAACGAAATCTAAGGCCGCTGTTATATTCTTGGACAAATGCAAGAGTCGAAGCAAAAGAGAGGACGTCATAGAGAAATTAAAAGGTTATTTGTCTAAATATAATCTAACTATTGATATTTTCGGCCCATGCAGCGATAAAAAATGCAAGAGGAAGAACATGAAACCATGTTTGTGGAGATTGAAGAAAACATACTATTTCTATCTAGCGTTCGAGGATTCTATCTCTTTAGACTATATAACTGACATAGTTTTATACGCGTATAACAATAACGCTATACCGATTGTCTATGGCGGTGCCCAGTACGACAAATATCTTCCACCCCGATCATATCTGAATGCGCGGAACAAAACGTTAAAGTCATTAGCTCATACAATGCATAAAATTATTTCAAACCAAGAAATGTACTATGATTTCTTTCGATGGAAAAATTACTACACGCTAGCTAAATCACCCATCTTGGATGGCTGTGTGTTGTGCGAAGCGTTGAATCATCAAGATCGACTCACTGCAAGAGTCGTCTATAGCAAGTTCAGGAAATGGTGGAACCACTTGTATGAAAAGAGATGCTTAGAACATAGTGTTGATATATAA

Protein sequence:

>DPOGS213771-PA
MKYILVWTNIYGIEEEGQKYFISKNCKHINCYITKNDSLFNDVRYFDAILFDGQDVSRDIIALPQLRSTMQKYIFVAKESSDNFPVCNKIYDNFFNWTWTYRYDSTISYHFITVFNYQYIELGNRFLWESYMKPIDKTLKSQFVTKSKAAVIFLDKCKSRSKREDVIEKLKGYLSKYNLTIDIFGPCSDKKCKRKNMKPCLWRLKKTYYFYLAFEDSISLDYITDIVLYAYNNNAIPIVYGGAQYDKYLPPRSYLNARNKTLKSLAHTMHKIISNQEMYYDFFRWKNYYTLAKSPILDGCVLCEALNHQDRLTARVVYSKFRKWWNHLYEKRCLEHSVDI-