Monarch geneset OGS2.0

DPOGS203630
TranscriptDPOGS203630-TA1509 bp
ProteinDPOGS203630-PA502 aa
Genomic positionDPSCF300063 + 1076971-1081095
RNAseq coverage1152x (Rank: top 11%)
Annotation
HeliconiusHMEL0158705e-15571.69% 
BombyxBGIBMGA001379-TA1e-14056.20% 
DrosophilaCG33145-PB2e-3339.68% 
EBI UniRef50UniRef50_E2A2S59e-4556.14%Beta-1,3-galactosyltransferase 1 n=3 Tax=Formicidae RepID=E2A2S5_CAMFO
NCBI RefSeqXP_624773.12e-4754.12%PREDICTED: similar to CG33145-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3072079869e-4755.29%Beta-1,3-galactosyltransferase 1 [Harpegnathos saltator]
NCBI nr blastxgi|3072079864e-4654.71%Beta-1,3-galactosyltransferase 1 [Harpegnathos saltator]
Group
Gene OntologyGO:00160201.4e-70membrane
GO:00064861.4e-70protein glycosylation
GO:00083781.4e-70galactosyltransferase activity
KEGG pathwayame:5523987e-47 
 K07819 (B3GALT1)maps-> Glycosphingolipid biosynthesis - lacto and neolacto series
InterPro domain[96-477] IPR0026591.4e-70Glycosyl transferase, family 31
Orthology groupMCL11591 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203630-TA
ATGGCACGAGATGTCAACGGAGGGTGGGACGAGGACGACACCTGCACGGAGAGGGACAGCGACGATAGCCCCATCCTCTCATACAGGTATGACAAACGTCGCTACCCGTACAGAGAGCGGCCGGCGGAGCGCCGCTGGGCGGTGACCACGGTCGCCAAGTTCTCCTTCTACTGCACAGCCATGATTCTGTTCTGCGTGTTAATGTACATACCAGTTTATAACAGAGCCAACGAACAGATACCAAAGGTGGCTGTAGCTGGATGGTCGATTCACACGAACAGAGACACCAAAATATACGTACAACCAGATAACGTCACAACAATACATGAACCGAAATATGTCTGTCCCAAGAGTAACAGAGACAAAAAGAACAGTCTGCTCCTCCTCATAGTGGTCTGTTCATCGACCAGCAACTTCGACCGTCGAGTGGCCATACGAGAGACGTGGGGCAGTTATAAGAACTACTTAGAGTCGGGAAGAATATTCAAAGCAGTACAAGAAAAATACAAGAATTACAACTACACGTATGATTTATATGAGGAGGAAAAAAATCATTTAAATTTTAGTCTTAACTTAGATAGGAAAAAACGAGACATATCTGGTCTTGGGCGGTTCCTTCCGGAACTGGCGAGGGCGCTGCAGAAGAACCTCATCACTGTGACTGAAGAGGTTGAAGAAAAGAGGTTCGAAGACGAGAAGGGAGAAGAAATGTTGACCGGCTTCGACATGAACAAAGAACTTAACGAACAAGACGAGCTCGACTTAAACTATGATGACGAGTCTAACATCATGAAGATACCACCGAAAGGATACGAGGACCAGAGTCCAGACTTGGATGACGTTATTGATATGTTGAAGAAGTCGAAAGATTTTCCCAAAGAAGACGTCTCAGAACCAGTTTCGGACAAAGAAGTCGATTTCAAGCTCGTGTTCCTCCTGGGGCTGCCGTCACAAGATAACGACACGGACGTCCAGGAGAAGATCGAAGAGGAAGTTGACAAGTATGGTGACGTCATTCAAGAAGGATTCATAGACTCGTATAACAACCTCACCCTCAAGTCGATCATGATGCTGAAGTGGGTCACCAACAACTGCAACGAGAGTGTTCGCTACATCCTCAAGACTGATGACGACATGTATGTGAACGTCCCCAACCTGGTTCAGAACCTGAAGAACAGGTCCAAGGTCCACGACAGCACCAAGGGCCAGGAGAAAGAGTACATGCTGATCGGCGACCTGATATGTGGGGCGCGACCCGTCCAAGACGTTAGCAATAAGTGGTACAGCCCGCGGTACATGTACGGGGGCCGCGTGTACCCCCGCTACCTGTCGGGCACGGGGTACGCTCTGTCAGCGCCGGCCGCCAGCTCCCTCTACCGCGCCGCGCTACGAACCTCATACTTCCACCTTGAGGATATCTATATCACGGGTGACTATCACAAACTTCACACGTTGACTGACGAAGTTCTTTATCGACGGATCGACGTTTGTACGGCGTGGAAAGTGTAA

Protein sequence:

>DPOGS203630-PA
MARDVNGGWDEDDTCTERDSDDSPILSYRYDKRRYPYRERPAERRWAVTTVAKFSFYCTAMILFCVLMYIPVYNRANEQIPKVAVAGWSIHTNRDTKIYVQPDNVTTIHEPKYVCPKSNRDKKNSLLLLIVVCSSTSNFDRRVAIRETWGSYKNYLESGRIFKAVQEKYKNYNYTYDLYEEEKNHLNFSLNLDRKKRDISGLGRFLPELARALQKNLITVTEEVEEKRFEDEKGEEMLTGFDMNKELNEQDELDLNYDDESNIMKIPPKGYEDQSPDLDDVIDMLKKSKDFPKEDVSEPVSDKEVDFKLVFLLGLPSQDNDTDVQEKIEEEVDKYGDVIQEGFIDSYNNLTLKSIMMLKWVTNNCNESVRYILKTDDDMYVNVPNLVQNLKNRSKVHDSTKGQEKEYMLIGDLICGARPVQDVSNKWYSPRYMYGGRVYPRYLSGTGYALSAPAASSLYRAALRTSYFHLEDIYITGDYHKLHTLTDEVLYRRIDVCTAWKV-