Monarch geneset OGS2.0

DPOGS203077
TranscriptDPOGS203077-TA891 bp
ProteinDPOGS203077-PA296 aa
Genomic positionDPSCF300294 + 26641-27869
RNAseq coverage128x (Rank: top 56%)
Annotation
HeliconiusHMEL0117308e-6763.04% 
BombyxBGIBMGA007765-TA9e-11362.59% 
Drosophilabeta3GalTII-PA5e-4038.43% 
EBI UniRef50UniRef50_E9HTR42e-5642.86%Putative uncharacterized protein n=1 Tax=Daphnia pulex RepID=E9HTR4_DAPPU
NCBI RefSeqXP_968057.14e-5738.98%PREDICTED: similar to UDP-Gal:betaGal beta 1,3-galactosyltransferase polypeptide 6 [Tribolium castaneum]
NCBI nr blastpgi|3227963536e-5841.89%hypothetical protein SINV_00253 [Solenopsis invicta]
NCBI nr blastxgi|3227963532e-5841.89%hypothetical protein SINV_00253 [Solenopsis invicta]
Group
Gene OntologyGO:00160201.7e-85membrane
GO:00064861.7e-85protein glycosylation
GO:00083781.7e-85galactosyltransferase activity
KEGG pathwaygga:4281854e-58 
 K00734 (B3GALT6)maps-> Glycosaminoglycan biosynthesis - heparan sulfate
    Glycosaminoglycan biosynthesis - chondroitin sulfate
InterPro domain[32-295] IPR0026591.7e-85Glycosyl transferase, family 31
Orthology groupMCL13758 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203077-TA
ATGAAAAGAGACGCTATAAGGGCGACATGGGCTAATTTTATTAATAATATATTCATTGAAAACGGTGAAACGTTATTCAAGTGGGATAACTCCTGGCTCAGAACAAATACAAAAACAGATTTAATAAAAATATTTTTTGTAATTGGAACACAAAATCTAGAAAAGGATAAGCTCATAAAAATTAATAATGAACTTAGCAGAAGTAATGACTTATTACTGTTAAATAAGTTCGAAGACTCTTATGAGAATCTAACATTAAAGTTACTTTATTCTCTAGATTTTCTTAGCAATAATTTAAAGAAATTAAAATATGTTATCAAATGTGACGATGATTCATTTGTTAGGGTTGATTTAATAGTAAAAGATTTGGAAGCTTTTGGACCGAAAATGGATGATCCATCTATTAGTTCTTATGTTACTTATAAGGAGACTGAACAAAACCAGAAAGGACTATATTGGGGATATTTTAATGGCAGGGCTCAAGTATTTTTAAATGGGAAGTGGCAGGAAAAAAAATGGTTTCTTTGTGACACCTATCTTCCTTATGCTTTAGGAGGTGGCTATGTTATATCCCACAATATAGTTGATTATATTTCAAGAAACTTGGAGTACCTAAGTGTTTATAATTCTGAAGATGTATCTATGGGTGTATGGACGGCAGCACTAAACGGAATAAATAGAGTGCACGACATAAGATTTGACACACAGTGGAAATCTCGCGGCTGTGAAGACAACATGTTGATACGGCACAAGCAAAGTCCGAGTGACATGTTGAAAATGTATAAAAACTTGATAGAATCCAAAGGCCTAGCGCTCTGTAAGTCGCAGTCCGTACTACGTAAGTCTTATAAATATAATTGGAATGTGCTCCCGAGTATGTGTTGTAAATAA

Protein sequence:

>DPOGS203077-PA
MKRDAIRATWANFINNIFIENGETLFKWDNSWLRTNTKTDLIKIFFVIGTQNLEKDKLIKINNELSRSNDLLLLNKFEDSYENLTLKLLYSLDFLSNNLKKLKYVIKCDDDSFVRVDLIVKDLEAFGPKMDDPSISSYVTYKETEQNQKGLYWGYFNGRAQVFLNGKWQEKKWFLCDTYLPYALGGGYVISHNIVDYISRNLEYLSVYNSEDVSMGVWTAALNGINRVHDIRFDTQWKSRGCEDNMLIRHKQSPSDMLKMYKNLIESKGLALCKSQSVLRKSYKYNWNVLPSMCCK-