Monarch geneset OGS2.0

DPOGS215053
TranscriptDPOGS215053-TA1008 bp
ProteinDPOGS215053-PA335 aa
Genomic positionDPSCF300208 + 4465-5505
RNAseq coverage0x (Rank: top 99%)
Annotation
HeliconiusHMEL0024133e-12763.10% 
BombyxBGIBMGA005534-TA1e-11256.40% 
Drosophilabrn-PA9e-6038.49% 
EBI UniRef50UniRef50_E2D8891e-11458.72%Beta-1,3-galactosyltransferase n=5 Tax=Obtectomera RepID=E2D889_HELAM
NCBI RefSeqNP_001040545.13e-11156.40%beta-1,3-galactosyltransferase [Bombyx mori]
NCBI nr blastpgi|3010723314e-11458.72%beta-1,3-galactosyltransferase [Helicoverpa armigera]
NCBI nr blastxgi|3010723312e-11559.17%beta-1,3-galactosyltransferase [Helicoverpa armigera]
Group
Gene OntologyGO:00160203.2e-86membrane
GO:00064863.2e-86protein glycosylation
GO:00083783.2e-86galactosyltransferase activity
KEGG pathwayapi:1001646417e-70 
 K02175 (BRN)maps-> Dorso-ventral axis formation
InterPro domain[7-322] IPR0026593.2e-86Glycosyl transferase, family 31
Orthology groupMCL16373 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215053-TA
ATGAGGAAGAAGAAGTGTATTTATATAGTGGTCGTGGTTGTGTTTCTTTACTACGTGTTTGGTGTCGATGACTATATACATGCCAGGAGTTACGACAAGGAGTTCGACTACCCCCTCAGCATCGACATACGGCCTCTGGTAGACGAAGTCCTAGCTGGGAAGAAGCCCTCACTGGCTCCTATTAATTTCTACCCATACAGATTTTTGAGCAATTCTGGGAAATGTACGTTAATAGAAAAAATCGATTTGTTCATCATCGTAAAATCTGCTATGAATAATTTTGAAAGACGTGACGCTATACGACAAACGTACGGAATGGAAACATTCAATCAGGGAATCGTTATGAGCACGATGTTCTTCGTCGGTGTCGATGAACCGAAGTCTGCCACCCAAAGGAGGCTCGAGCACGAGATGGCGGACTTCAAGGACATCATCCAAGTAGACTTCCAGGACACGTACGACAACAACACCATCAAGACCATGATGTCCTTCAGGTGGCTGTACGAGCACTGCCCCATCGCAGACTTCTACTTCTTCACCGACGACGACATGTACGTCTCCGTCAAGAATCTTCTAGAATACCTTAAAGAACAAACTAAGACCAAAGAAAGGGACCCCTTATTTTACGCTGGCTACATGTTCCATTCGAGTCCTCAGAGATTCAGATCGAGTAAGTGGCGGATCACCCTGGAGGAGTATCCTTTCGACCGCTGGCCGCCGTACATCACGGCCGGAGCGTACGTCGTCTCCAACCGCGCCATGAAGGTCATGTATGCGGCGAGCTTGTTCGTTAAGAACTTCCGCTTCGACGACATATACCTGGGAATAGTCGCCAAGAAGGCGAACATACCTATGACGCATTGTCCTAGAATTTACTTCTACAAGAAGAGCTCTTCCGTCGATGGGTACAAGGATGTGATCGCGTCCCACGGGTTCCACGACCCCGAGGTTCTCATGGCGACTTGGAGACACCAGCATCTTCAGAGCCCCCACACCAAGGTCGGATGA

Protein sequence:

>DPOGS215053-PA
MRKKKCIYIVVVVVFLYYVFGVDDYIHARSYDKEFDYPLSIDIRPLVDEVLAGKKPSLAPINFYPYRFLSNSGKCTLIEKIDLFIIVKSAMNNFERRDAIRQTYGMETFNQGIVMSTMFFVGVDEPKSATQRRLEHEMADFKDIIQVDFQDTYDNNTIKTMMSFRWLYEHCPIADFYFFTDDDMYVSVKNLLEYLKEQTKTKERDPLFYAGYMFHSSPQRFRSSKWRITLEEYPFDRWPPYITAGAYVVSNRAMKVMYAASLFVKNFRFDDIYLGIVAKKANIPMTHCPRIYFYKKSSSVDGYKDVIASHGFHDPEVLMATWRHQHLQSPHTKVG-