Monarch geneset OGS2.0

DPOGS205740
TranscriptDPOGS205740-TA996 bp
ProteinDPOGS205740-PA331 aa
Genomic positionDPSCF300342 + 117817-119454
RNAseq coverage179x (Rank: top 49%)
Annotation
HeliconiusHMEL0208486e-12760.12% 
BombyxBGIBMGA009243-TA1e-9451.06% 
DrosophilaFucTC-PB1e-5135.53% 
EBI UniRef50UniRef50_F4X2Z64e-5035.84%Alpha-(1,3)-fucosyltransferase C n=1 Tax=Acromyrmex echinatior RepID=F4X2Z6_ACREC
NCBI RefSeqXP_002102047.13e-5135.74%GE15255 [Drosophila yakuba]
NCBI nr blastpgi|1954824395e-5035.74%GE15255 [Drosophila yakuba]
NCBI nr blastxgi|3214777474e-5640.68%hypothetical protein DAPPUDRAFT_41601 [Daphnia pulex]
Group
Gene OntologyGO:00160204.3e-63membrane
GO:00084174.3e-63fucosyltransferase activity
GO:00064864.3e-63protein glycosylation
KEGG pathwaycqu:CpipJ_CPIJ0132027e-35 
 K00753 (E2.4.1.214)maps-> N-Glycan biosynthesis
InterPro domain[12-325] IPR0015034.3e-63Glycosyl transferase, family 10
Orthology groupMCL31095 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205740-TA
ATGGGCGAAGGTCAAGAAGGTTTTATATCCAGGAACTGTTCCTATACAAACTGTTTTGTGACAAGCAACAGAACATATCTCGGTGATTATACAAAGTTTGATGTTATAGCATTTGCTGGTCCCGAAGTGAGATTCATGCAAACTCCTGGTTACCCCGATCGTCTCCCAGAAAAAAGGTCACAGCATCAGCGATATGTATTCGCTAGTATCGAGTCAGCTGAGAACTATCCTGTGTGTTCAGATAAATTTAACGGGTTCTTTAATTGGACGTGGACCTACAGATTAGAGTCTGAAGCAAAATGGGGATACATTGTGATACGCGATGCACAAAATAATATAATAGGTCCAAAAACCAATATGAATTGGTTAAAAACAGATCAAATGGATGTGGTGGGTGATGATATCAAAGAGAAGTTAAGAAAAAAAACTAAAGCAGTTGCTTGGTTTGTTTCCAATTGTGTTTCTAGAAGTCGACGTGAGAAATTTGCGAGTGTGTTAGGTATTTGGTTAGCAAAATACGATCTAGAAATAGATATATACGGTGAATGCGGAAACTTGAAATGTTCTCGTGATAATGAAGAAGAGTGCGACAAAATGATTGAAAGAGACTACTATTTTTATCTTTCTTTCGAGAACTCGTTTGCTGAGGATTATGTGACGGAAAAATTATTGCACCCACTAAAATACTTGGCGGTGCCTATTGTATATGGCGGTGCAAATTATTCAAGATTCATGCCGGAAAATATTTACTTAGATGCAAGAGAACTGGGACCACAGAAGTTGGCAAACAAGATAAATGAACTTATTGAAAATCCTGATTTGTACGCTGAATATTTCAGGTGGAAAAAATATTACTCTTATCACAGAAGATCAGAAAATATTGAAACAGATGATTACTGTGGCTTTTGTTCATTGTTAAATGATGAAAAATTTGTTAAGAAAACTTCTATTTATGAGGACTTTAGGAAGTGGTGGGATCCACCTTATCGTTGTTAA

Protein sequence:

>DPOGS205740-PA
MGEGQEGFISRNCSYTNCFVTSNRTYLGDYTKFDVIAFAGPEVRFMQTPGYPDRLPEKRSQHQRYVFASIESAENYPVCSDKFNGFFNWTWTYRLESEAKWGYIVIRDAQNNIIGPKTNMNWLKTDQMDVVGDDIKEKLRKKTKAVAWFVSNCVSRSRREKFASVLGIWLAKYDLEIDIYGECGNLKCSRDNEEECDKMIERDYYFYLSFENSFAEDYVTEKLLHPLKYLAVPIVYGGANYSRFMPENIYLDARELGPQKLANKINELIENPDLYAEYFRWKKYYSYHRRSENIETDDYCGFCSLLNDEKFVKKTSIYEDFRKWWDPPYRC-