Monarch geneset OGS2.0

DPOGS204351
TranscriptDPOGS204351-TA1140 bp
ProteinDPOGS204351-PA379 aa
Genomic positionDPSCF300142 + 292253-295410
RNAseq coverage16x (Rank: top 81%)
Annotation
HeliconiusHMEL0070155e-12763.96% 
BombyxBGIBMGA010226-TA6e-3027.80% 
DrosophilaFucTA-PA7e-2626.42% 
EBI UniRef50UniRef50_UPI00015B64151e-7943.85%UPI00015B6415 related cluster n=1 Tax=unknown RepID=UPI00015B6415
NCBI RefSeqXP_001604314.12e-8043.85%PREDICTED: similar to alpha1,3-fucosyltransferase B homologue [Nasonia vitripennis]
NCBI nr blastpgi|3323753321e-8044.14%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|3323753321e-8244.14%unknown [Dendroctonus ponderosae]
Group
Gene OntologyGO:00160202.1e-70membrane
GO:00084172.1e-70fucosyltransferase activity
GO:00064862.1e-70protein glycosylation
KEGG pathwaygga:4280922e-33 
 K07632 (FUT4)maps-> Glycosphingolipid biosynthesis - lacto and neolacto series
InterPro domain[39-363] IPR0015032.1e-70Glycosyl transferase, family 10
Orthology groupMCL17845 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204351-TA
ATGTATGTTTTAGTTAAACGCGGTACGCCTACCTGGTCGACATCAAATTTAGGATCATATTTATATAAAAATTATCCAATCGATGCCATATCTCCTGCAGGTTCTCAACGGAATACGACTTTTTTAGTTCTTATATGGAAACACTGGGAATGGCTTAAAAATCGTCATATTTACAGTTTTGATAAAAATCGGCCACTTGATCCGTTAGAAGATTGCAGCGTAAAAAACTGTAAATTTACAGGCGACGATGAGAAGTTGTTATTAGCCGACGCTGTTATAGTCCACGTACTAAAGGGTCTGTTTCCAAATACGACAACAAGAAATTTAACTCAAAGGTGGATCTTTTTGAATGATGAGTCTCCACAAAACGCATTTTACGCTGCGGTTAATAAACCTAAATTAAAAGACCTATCAAATATGTTCAATTGGTCAATGACTTACAGGAGCGATTCCGACGTGCCTGTTCCTTACGGCCGAACGGTACCCTTGAAAAAAGCGATTCTGAATCAAATAACATATGAATCTCTGGCATCATTGGTTCCGTACTGGGAAAATAAACGCAAGGACGTGTTAGCTTCTATCCTTATGTCCCACTGTGGGGTGCCCCGTAGAACAGAATATTTACAGAAATTACAGGAATATTTGACCGTTGATGTTTATGGAAAATGTTCTAAGAATAACAAAAATAGCTGTCCTGGTCACTTTCGGTCTGACTGTAATCTTGTATCGAAATATCTTTTTTACTTAGTGTTCGAAAACACACAGTGTCACGAGTACATGACAGAAAAATTATTTTACAATGCTTATAGCAAAGGTGCTATACCTGTCATTATGGGGCCTTCCATAGACTGCTGTGAGGGGCTTCTACCGCCTGATTCGTTTTTACACATCGACAATTATGATAATCCGCAACAATTAGCGGAGCATATGGTTGAAATAAGTGAAGATCTTAAAAAAATTCTAAGATTTCATCGATGGAGAAATGACTTTGAGGTAAAAAATGAGCATGGATATTTTGGAACTAGATCGTATCATTTATGTAGAATATGCGAAGCTTTGAATTACAATGACCAAGCAGTGAAATATTACGACGAAGAAGATCTAAGGATATTTTTTGATCCAACCTTGTCTTGTCGCTAA

Protein sequence:

>DPOGS204351-PA
MYVLVKRGTPTWSTSNLGSYLYKNYPIDAISPAGSQRNTTFLVLIWKHWEWLKNRHIYSFDKNRPLDPLEDCSVKNCKFTGDDEKLLLADAVIVHVLKGLFPNTTTRNLTQRWIFLNDESPQNAFYAAVNKPKLKDLSNMFNWSMTYRSDSDVPVPYGRTVPLKKAILNQITYESLASLVPYWENKRKDVLASILMSHCGVPRRTEYLQKLQEYLTVDVYGKCSKNNKNSCPGHFRSDCNLVSKYLFYLVFENTQCHEYMTEKLFYNAYSKGAIPVIMGPSIDCCEGLLPPDSFLHIDNYDNPQQLAEHMVEISEDLKKILRFHRWRNDFEVKNEHGYFGTRSYHLCRICEALNYNDQAVKYYDEEDLRIFFDPTLSCR-