Monarch geneset OGS2.0

DPOGS204991
TranscriptDPOGS204991-TA1308 bp
ProteinDPOGS204991-PA435 aa
Genomic positionDPSCF300123 + 123580-128672
RNAseq coverage364x (Rank: top 33%)
Annotation
HeliconiusHMEL0094980.082.06% 
BombyxBGIBMGA010226-TA0.076.48% 
DrosophilaFucTA-PA4e-12353.68% 
EBI UniRef50UniRef50_D6WLF91e-12360.22%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WLF9_TRICA
NCBI RefSeqXP_967512.12e-12460.22%PREDICTED: similar to alpha(1,3)fucosyltransferase [Tribolium castaneum]
NCBI nr blastpgi|910830674e-12360.22%PREDICTED: similar to alpha(1,3)fucosyltransferase [Tribolium castaneum]
NCBI nr blastxgi|3323728973e-12757.72%unknown [Dendroctonus ponderosae]
Group
Gene OntologyGO:00160202.5e-122membrane
GO:00084172.5e-122fucosyltransferase activity
GO:00064862.5e-122protein glycosylation
KEGG pathwaytca:6558576e-124 
 K00753 (E2.4.1.214)maps-> N-Glycan biosynthesis
InterPro domain[143-434] IPR0015032.5e-122Glycosyl transferase, family 10
Orthology groupMCL10963 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204991-TA
ATGTGGGCGCGAGCGGCGCGGTTTCTGAGGCGAGTGGCGCTTCTAGTGCCGCTGGCGTTGACGGCACTAGCGCTACTGCTGCTGCCGCGCCCGCAACCATTCAGCGCACCGAGACTGCCCTCGCCGCCGCCGCCGAAAGAAGACTTGCGTCAGAACTTGATAACAGAAAATGAAATCAATTTAAATCAAGACGTGGATGACGGTTCTCAGAAGCAATGGTTTATGAGCGGCGGCGAACACCGACCTTGGAAACACGACCCACATGCCAAACTCTTCCCCGAAGATGCACCTGACAATGACAGAATCGTCGAGCAGTTGATGTATGCATTGCCAAATGAAGAAGATATACCCTTGAAGAAGATTCTGCTAGCTAATGGTTTAGGTACGTGGGGGGTGACTGGGGGTCGCACTGAGTTCAAACGTAATAAATGTCCGGTCGATCGATGCACGCTGACAGCTGACACCAGAGAAGCTGAAACAGCAGATGCCATACTCTTCAAAGACCATCACACGCCATTCAATGTGAAACGACCTCACAGTCAGATATGGATCCTCTATTACTTGGAGTGTCCTTACCACACGGCATCGTTACGACCTACGTCGCTCGATGTGTTCAACTGGACAGCGACCTACAGACGCGACTCTGACATAGTGGCGCCGTACGAGAGATGGGTCTATTACGACCCACTGGTCACCGAGAGAGAACTTGATAGAAACTATGCAGCCAATAAGACCAAAAAGGTAGCGTGGTTTGTATCTAACTGCCACGCTCGTAACAGTAGGCTTCAGTACGCTAGACAGCTAGCTAAGTTCATCCCTGTAGACATATACGGTGCTTGCGGCTCTCACCACTGTCCACGAGCTGACCCCAACTGTTTGGAGATGCTCGATAAAGAGTACAAATTCTATTTGGCGTTCGAGAACTCAAATTGCAGGGATTACGTCACCGAGAAGTTTTTCGTCAATGGATTACAACACGACGTGTTACCGATAGTGATGGGCGCGAGACCGTCTGAATACGCTGCAGTAGCTCCTCACAACTCCTACTTACACGTAGAAGAGTTCGCTGGGCCCGAGGAGCTCGCGAACTACTTGCGGAGATTGGACGAGGACGACAATATGTACAATTCGTACTTCAAATGGAAGGGTACTGGAGAATTCATTAATACCTATTTTTTCTGCCGAGTGTGTGCTATGGTTCATGCAAATGCTCGCAGACAGCGCAATGCACATTATACCGATGTGCAAGCTTGGTGGCGAGAAGGCGCTTGCACGCGCACCGACTGGCGGCCACGCGACCCACCATAG

Protein sequence:

>DPOGS204991-PA
MWARAARFLRRVALLVPLALTALALLLLPRPQPFSAPRLPSPPPPKEDLRQNLITENEINLNQDVDDGSQKQWFMSGGEHRPWKHDPHAKLFPEDAPDNDRIVEQLMYALPNEEDIPLKKILLANGLGTWGVTGGRTEFKRNKCPVDRCTLTADTREAETADAILFKDHHTPFNVKRPHSQIWILYYLECPYHTASLRPTSLDVFNWTATYRRDSDIVAPYERWVYYDPLVTERELDRNYAANKTKKVAWFVSNCHARNSRLQYARQLAKFIPVDIYGACGSHHCPRADPNCLEMLDKEYKFYLAFENSNCRDYVTEKFFVNGLQHDVLPIVMGARPSEYAAVAPHNSYLHVEEFAGPEELANYLRRLDEDDNMYNSYFKWKGTGEFINTYFFCRVCAMVHANARRQRNAHYTDVQAWWREGACTRTDWRPRDPP-