Monarch geneset OGS2.0

DPOGS206283
TranscriptDPOGS206283-TA888 bp
ProteinDPOGS206283-PA295 aa
Genomic positionDPSCF300290 + 186883-190149
RNAseq coverage406x (Rank: top 30%)
Annotation
HeliconiusHMEL0168979e-14580.68% 
BombyxBGIBMGA010795-TA9e-11974.09% 
Drosophilabeta4GalT7-PA1e-8460.43% 
EBI UniRef50UniRef50_E2BNK86e-10059.59%Beta-1,4-galactosyltransferase 7 n=1 Tax=Harpegnathos saltator RepID=E2BNK8_HARSA
NCBI RefSeqXP_001603688.12e-9756.99%PREDICTED: similar to beta-1,4-galactosyltransferase [Nasonia vitripennis]
NCBI nr blastpgi|3072038512e-9959.59%Beta-1,4-galactosyltransferase 7 [Harpegnathos saltator]
NCBI nr blastxgi|3072038512e-9860.00%Beta-1,4-galactosyltransferase 7 [Harpegnathos saltator]
Group
Gene OntologyGO:00167572.9e-129transferase activity, transferring glycosyl groups
GO:00059752.9e-129carbohydrate metabolic process
KEGG pathwaynvi:1001200024e-97 
 K00733 (B4GALT7)maps-> Glycosaminoglycan biosynthesis - heparan sulfate
    Glycosaminoglycan biosynthesis - chondroitin sulfate
InterPro domain[53-287] IPR0038592.9e-129Galactosyltransferase, metazoa
Orthology groupMCL12677 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206283-TA
ATGTACAGAGGCTTGAATTTGATGGGTTACCGGATCAGCGGTTCAAAATGCCTCACTGTATGCTTAGGTCTTACTTTTCTTATGGGCTGTTTTATAGCGTCCCTTCCCATCGTGCCTTCAGAATCAGCTAACCTTCAGTTATCAACATCACAAAGTAAAAAGCTGGCCATAATCGTCCCATTCAGAGACCGTTTCGAAGAACTACTAGAATTTGTCCCTCACATGTATAAGTTTCTAAACAAACAGAAGATTCCGTTCCATATATTTGTTGTCCAACAAAAAGACAACAACAGATTTAACCGAGCATCACTAATTAATGTTGGTTTTATTTATACAAGGAACAACTATGAGTATATTGCGATGCATGATGTAGATTTGCTACCTTTGAACGATAAACTGAGCTATGAGTATCCCAAAAATGGACCAATCCACATCTCATCACCGCAAACACACCCAAAATATCACTACGATACTTTCATCGGTGGGATTTTGTTGATAAAACGTGAAGATTTCGAATTAGTCAATGGTTTATCAAATAATTATTGGGGTTGGGGTCTCGAAGATGATGAATTCTATGTAAGATTGAAAGATGCCGGACTGAAAGTAAGTCGTCCAGAAGGTATAACCACAGGACCTGAAAACACTTTTAAGCACATCCATGACAAGTCTTATCGCAAACGAGACATGCGGAAATGCTATAACCAGCGTGAAGTAACTCGGAGACGTGATCGGAGAACCGGTGTTCATGATGTGGCATACAACTTACACAGCTCTCATAACGTGTCCATAGATTCATTGCCCATAACGGTGATTAATGTGGAATTGATATGCAACAAAGATCTTACGCCATGGTGCCAATGTCCAGAACCTAAAGTAAAAAAGAACTAA

Protein sequence:

>DPOGS206283-PA
MYRGLNLMGYRISGSKCLTVCLGLTFLMGCFIASLPIVPSESANLQLSTSQSKKLAIIVPFRDRFEELLEFVPHMYKFLNKQKIPFHIFVVQQKDNNRFNRASLINVGFIYTRNNYEYIAMHDVDLLPLNDKLSYEYPKNGPIHISSPQTHPKYHYDTFIGGILLIKREDFELVNGLSNNYWGWGLEDDEFYVRLKDAGLKVSRPEGITTGPENTFKHIHDKSYRKRDMRKCYNQREVTRRRDRRTGVHDVAYNLHSSHNVSIDSLPITVINVELICNKDLTPWCQCPEPKVKKN-