Monarch geneset OGS2.0

DPOGS204706
TranscriptDPOGS204706-TA621 bp
ProteinDPOGS204706-PA206 aa
Genomic positionDPSCF300170 + 616109-617930
RNAseq coverage28x (Rank: top 76%)
Annotation
HeliconiusHMEL0083632e-4950.52% 
BombyxBGIBMGA007484-TA1e-5057.86% 
Drosophilabeta4GalNAcTA-PA2e-4548.85% 
EBI UniRef50UniRef50_Q6J4T95e-5350.77%Beta 1,4-N-acetylgalactosaminyltransferase n=6 Tax=Arthropoda RepID=Q6J4T9_TRINI
NCBI RefSeqXP_318033.42e-4851.56%AGAP004781-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|471560632e-5250.77%beta 1,4-N-acetylgalactosaminyltransferase [Trichoplusia ni]
NCBI nr blastxgi|471560633e-5150.77%beta 1,4-N-acetylgalactosaminyltransferase [Trichoplusia ni]
Group
Gene OntologyGO:00167578.8e-78transferase activity, transferring glycosyl groups
GO:00059758.8e-78carbohydrate metabolic process
KEGG pathwaycel:Y73E7A.71e-42 
 K07968 (B4GALT3)maps-> Glycosphingolipid biosynthesis - lacto and neolacto series
    Glycosaminoglycan biosynthesis - keratan sulfate
    N-Glycan biosynthesis
InterPro domain[1-206] IPR0038598.8e-78Galactosyltransferase, metazoa
Orthology groupMCL30504 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204706-TA
ATGAAACAGAAGTTGGAGTACCGGATATTCATTGTTGAACAGAAAGGTACAGATTTCTTCAATAGAGGGCGCCTCTTCAATGCTGGTTATTTGGAGGTACGGAAGTTTGGCAATTGGAAATGTGTAGTGTTCCACGACGTAGACCTGCTTCCTCTGGATGATAGAATACTCTACTCCTGTCCAATGTGGCCAAGACACATGTGTGGCACAGTCGTGGAGGTTAAGAATCCGAGTTTCCGAACACTATTCGGCGGAGTTTCCGCAATGATTCCACAGCATTTCGAGAAAGTTAACGGTTTCTCAAACGTATATTGGGGTTGGGGCGGCGAGGATAACGACTTATTTTGGAGGATTCGTGCCGTCGGTCTACCAATAGTTAGATACAACAAGCTTATAGCAAAATATACGTCACTTCAACATGACAAGTCGAAACCCAATACACTCAGATATAACCTTCTCAAAACTTTTGCGACACGTTTTTTACGAGACGGTCTTACAACTTTGGAATATGTCGTCGATAAAGTCACATTGCACCATCTCTACACGCACTTGATGCTGGATATAAACCCCAAGAAGAAAAACATTACAAAGATAATGTTAGAAGCATCTAGATGGATATAA

Protein sequence:

>DPOGS204706-PA
MKQKLEYRIFIVEQKGTDFFNRGRLFNAGYLEVRKFGNWKCVVFHDVDLLPLDDRILYSCPMWPRHMCGTVVEVKNPSFRTLFGGVSAMIPQHFEKVNGFSNVYWGWGGEDNDLFWRIRAVGLPIVRYNKLIAKYTSLQHDKSKPNTLRYNLLKTFATRFLRDGLTTLEYVVDKVTLHHLYTHLMLDINPKKKNITKIMLEASRWI-