Monarch geneset OGS2.0

DPOGS205454
TranscriptDPOGS205454-TA1722 bp
ProteinDPOGS205454-PA573 aa
Genomic positionDPSCF300166 - 326470-341142
RNAseq coverage770x (Rank: top 17%)
Annotation
HeliconiusHMEL0118010.076.10% 
BombyxBGIBMGA008394-TA0.078.06% 
DrosophilaFucT6-PA1e-14843.54% 
EBI UniRef50UniRef50_UPI0001CBA86F2e-14749.06%UPI0001CBA86F related cluster n=4 Tax=unknown RepID=UPI0001CBA86F
NCBI RefSeqXP_969111.22e-16149.91%PREDICTED: similar to alpha-(1,6)-fucosyltransferase [Tribolium castaneum]
NCBI nr blastpgi|1892369144e-16049.91%PREDICTED: similar to alpha-(1,6)-fucosyltransferase [Tribolium castaneum]
NCBI nr blastxgi|3479667571e-15748.71%AGAP001888-PA [Anopheles gambiae str. PEST]
Group
KEGG pathwaytca:6575655e-161 
 K00717 (FUT8)maps-> Glycosaminoglycan biosynthesis - keratan sulfate
    N-Glycan biosynthesis
Orthology groupMCL13804 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205454-TA
ATGAAGCTCCACCCCGGCTGGCACACGGGTCTTGATGGCTACTTTATGATGTACCTGACGAAATGGAAGAGGGCGGCTGTGCTCTTACTGTTCATCTGGATCGCTGTCACCTATCTAGTCATATCGCCTCTAAGATGCGACAGCAGTTCAGAGGAGTCGATTGATTTCCAGGAGAGGCTGAAGACGGCCTCTCTACAGCTGGAGCTTCTAAAACAACAGCACAGTAACCTAATATCACAAATAAAGAAATCCTCTGGATTGAACGTTAATCTAAACGAAATAGATGCAGCCGAGTTCCACAACGGAGGCGGTCCATCCGAGGAGTACGAGAATCTCAGGAGGAGAATATACTCTAATACCAAGGAAATATGGTACTTCATCAATCACGAGCTGACGAAGCTCGTCAACGACGACGTCCAGCCGGAGAAGGTTCAAGCGATTCTCGACCAAGTTGCAGACAGAAAGAGATCACTGCTATCAGATCAAGAGAAACTCCCGAAACTCGATGGTTACGAGGATTGGAGGCGGTCTGAGGCGAGTGAGGTCAGTGACCTTGTTCAGAAGAGGCTGAAATACCTACAGAACCCACCAGACTGTAGGGACGCTAGGAAGGTCATATGCAACTTGAATAAGGGTTGCGGGTTCGGCTGTCAGCTTCATCACATAGTATACTGTCTAATATTCGCGTACGCTACCGAGAGGACTCTCATACTGAACTCAAAGGGCTGGAGGTATAATAACAAGGGCTGGGAATACGTGTTCCATCCCATATCAGACACCTGCACCACGGCCTATGACGATAAGGTGGTGCCCTGGCCAGCGTCTTACGACGCGAAGGTGGTGTCTCTGCCGTTCATCGACTCTGTGTCTCAGAAGCCCAAGTTCCTGCCACTGGCAGTACCCTCGGACCTGGCGCACAGGATAGTCCGTTTCAACGGCGACCCGTCGTCATGGTGGATCGGTCAGATGCTCAAGTACGTGCTGAAGCCCCGCGCCGCCATGCAGAAAGCTATCAACGAGACCATAGCCAAGATGAACTTCAAGAATCCCATAGTCGGTGTTCACATCCGTCGCACAGACAAAGTCGGCACAGAGGCAGCCTTCCACCACATAGACGAGTACATGGTACATGTGAAGGAGTACTACAGGACCTTGGAGATGACCAAACATGTGGACAAGAAGAGGGTCTACCTCGCCACTGATGACGCTAATGTGCTACAAGACGCGAGGAACAAGTACAAGGAGTACGAGTTTCTAGGCGACCCGTCCATAGCCAAGACCGCGGCCACTCACCGCCGCTACACCCCGCTATCCCTCACCGGGCTGCTGGTAGACCTGCATATGCTGGCTATGTGCGACTATATAGTGTGCACTTTCAGCAGTCAGGTGGGTCGCGTCGCCTACGAGATGATGCAGGCCAACAGACCTGATGCATCCGACAGTTTCCACTCACTGGACGACATCTACTACTTCGGGGGTCAGAACGCTCACGACAGGAGGGCCCTCATGAATCACGAAGCCGGGGGGCAGGAGATAAGCTTTCAGGCTGGTGACTTGATCGGCATCGCCGGCAACCACTGGAACGGCTTCGGGAGGGGGACAAACAAACGGACTAATTTGAACGGCCTCATACCATGGTACAAGACCGCGGACCACCTCGTTCTGTACCCATTCCCCGAATACAAACACCTGCAGACGGAAACGCGACAGAAAGACTTATAA

Protein sequence:

>DPOGS205454-PA
MKLHPGWHTGLDGYFMMYLTKWKRAAVLLLFIWIAVTYLVISPLRCDSSSEESIDFQERLKTASLQLELLKQQHSNLISQIKKSSGLNVNLNEIDAAEFHNGGGPSEEYENLRRRIYSNTKEIWYFINHELTKLVNDDVQPEKVQAILDQVADRKRSLLSDQEKLPKLDGYEDWRRSEASEVSDLVQKRLKYLQNPPDCRDARKVICNLNKGCGFGCQLHHIVYCLIFAYATERTLILNSKGWRYNNKGWEYVFHPISDTCTTAYDDKVVPWPASYDAKVVSLPFIDSVSQKPKFLPLAVPSDLAHRIVRFNGDPSSWWIGQMLKYVLKPRAAMQKAINETIAKMNFKNPIVGVHIRRTDKVGTEAAFHHIDEYMVHVKEYYRTLEMTKHVDKKRVYLATDDANVLQDARNKYKEYEFLGDPSIAKTAATHRRYTPLSLTGLLVDLHMLAMCDYIVCTFSSQVGRVAYEMMQANRPDASDSFHSLDDIYYFGGQNAHDRRALMNHEAGGQEISFQAGDLIGIAGNHWNGFGRGTNKRTNLNGLIPWYKTADHLVLYPFPEYKHLQTETRQKDL-