Monarch geneset OGS2.0

DPOGS213364
TranscriptDPOGS213364-TA1035 bp
ProteinDPOGS213364-PA344 aa
Genomic positionDPSCF300109 - 91245-94276
RNAseq coverage16x (Rank: top 81%)
Annotation
HeliconiusHMEL0058811e-2132.87% 
BombyxBGIBMGA009167-TA1e-7543.96% 
DrosophilaCG8673-PA3e-1830.39% 
EBI UniRef50UniRef50_B7PGF22e-3130.36%Galactosyltransferase, putative n=1 Tax=Ixodes scapularis RepID=B7PGF2_IXOSC
NCBI RefSeqXP_002434274.14e-3230.36%galactosyltransferase, putative [Ixodes scapularis]
NCBI nr blastpgi|2419992627e-3130.36%galactosyltransferase, putative [Ixodes scapularis]
NCBI nr blastxgi|2419992624e-3130.36%galactosyltransferase, putative [Ixodes scapularis]
Group
Gene OntologyGO:00160202.5e-41membrane
GO:00064862.5e-41protein glycosylation
GO:00083782.5e-41galactosyltransferase activity
KEGG pathwayisc:IscW_ISCW0037301e-31 
 K07819 (B3GALT1)maps-> Glycosphingolipid biosynthesis - lacto and neolacto series
InterPro domain[85-311] IPR0026592.5e-41Glycosyl transferase, family 31
Orthology groupMCL30637 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213364-TA
ATGATGCAGAAGATTTTATGCGTAGTCCCTTGCAAGAGACGGATCAAGATTCGATTGGTTCTGGTCACCCTCACCATCGTCTCTCTGGTTGTTGGTTGGCATCTGTCATATAGATGTAGCAGTGCACTGTTACCTCCAAAACCTAACCTGGCCCTCGATCACTTCAGGACAGACAGAAGTTTGAAGAGCTATCTCGAAAAAATCAATATCCTGATAGAGCCTTCGAAGGTGGTATGTCCCACGACAACGCAATTACTGGTCCTTGTGACATCAGCCCCTGATAGGTTCGAACACCGTGACGCGGTCAGAAATACCTGGGCTTCGCACTTCCCGACTTACTTCATCATGGGACTCCATGGGAACACCGTCGAGGATCTGATGGTAGAAAACTACGTGGAAGCAAAAATGTACAGTGACGTCATCATATACAAGTTCAAAGATCACTACCAGAATCTGACTCTGAAGACGGCTCTCATGCTGGAGTGGACGGCTACCAGGTGCCCCACAGACTTGGTGCTGTTCAAGACTGATGATGATGTTCTGGTGAACCCGTGGGTGATGAAGCAGCTGGTGAAGGAACACGCTGGCCGCGACCTTGTCGGCTACAAGCTACTAAACAAAAAGTTCCATCGCGACGTGTACAACAAGTGGTTCGTGCCGAGGTGGATGTTGAATGAGGATCACATCGAGGAATATCTCTCGGGGACAGGATATCTCATCAATGGTTATCACTTGAGGGACATCTTGGCGACGGCGTACAAGACGCCAATGATCAACTTGGAAGACGTGTACTTCACGTACTTGGTGTCGAAACGGAAACTGGGTTTAAACCTGACGCACGACAGAAGGCTGAGTCCCTTCAAGCCGTGGTTGCCGGGCGCCTGTATGTACTTCAAGTTGGCGTCATCGCATAGTCTATCTCCAGCGGAGATGACGCAGCACTGGCGGGGCGTCCAGCGGCTGGGTCGCGAGTATGACATGGGAAATGACGTCTGCGGCGATGACGTCACCTGGAGCGAAATGTTCCTGTATTGA

Protein sequence:

>DPOGS213364-PA
MMQKILCVVPCKRRIKIRLVLVTLTIVSLVVGWHLSYRCSSALLPPKPNLALDHFRTDRSLKSYLEKINILIEPSKVVCPTTTQLLVLVTSAPDRFEHRDAVRNTWASHFPTYFIMGLHGNTVEDLMVENYVEAKMYSDVIIYKFKDHYQNLTLKTALMLEWTATRCPTDLVLFKTDDDVLVNPWVMKQLVKEHAGRDLVGYKLLNKKFHRDVYNKWFVPRWMLNEDHIEEYLSGTGYLINGYHLRDILATAYKTPMINLEDVYFTYLVSKRKLGLNLTHDRRLSPFKPWLPGACMYFKLASSHSLSPAEMTQHWRGVQRLGREYDMGNDVCGDDVTWSEMFLY-