Monarch geneset OGS2.0

DPOGS215377
TranscriptDPOGS215377-TA1218 bp
ProteinDPOGS215377-PA405 aa
Genomic positionDPSCF300088 - 689804-711357
RNAseq coverage179x (Rank: top 50%)
Annotation
HeliconiusHMEL0174386e-13886.04% 
BombyxBGIBMGA012390-TA1e-13283.98% 
DrosophilaCG9220-PC6e-8760.00% 
EBI UniRef50UniRef50_Q7QHY55e-8646.78%AGAP001010-PA n=1 Tax=Anopheles gambiae RepID=Q7QHY5_ANOGA
NCBI RefSeqXP_309203.41e-8646.78%AGAP001010-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479649382e-8546.78%AGAP001010-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1954788952e-8160.25%GE16036 [Drosophila yakuba]
Group
Gene OntologyGO:00325807e-87Golgi cisterna membrane
GO:00167587e-87transferase activity, transferring hexosyl groups
GO:00160202e-14membrane
GO:00167572e-14transferase activity, transferring glycosyl groups
KEGG pathwayaga:AgaP_AGAP0010103e-86 
 K13499 (CHSY)maps-> Glycosaminoglycan biosynthesis - chondroitin sulfate
InterPro domain[67-253] IPR0084287e-87Chondroitin N-acetylgalactosaminyltransferase
[69-252] IPR0033782e-14Fringe-like
Orthology groupMCL25963 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215377-TA
ATGCCGCGAAGGAGCAAGTGGATGCATCTGCTGGCCGGCATCGTCGCCGGAGTCACGTTGGGTGTTTTTTTGATACTGTGCTTGAGGACTTCTTCGCGGTCTGTGGAAACTTGTCCGGTTGCGGCTAAGACGTATGTGGCTCCAGACCCCCTGGTGTTAATAGATTTAAAACCGGACGAAAATATACAAAATTCTAACAGAACTTTAGTGTTTGTCGGTGTGATGACCGCGGAACAGTATCTGACTACTAGGGCTAGAGCGGTTTATGAAACCTGGGCTCAAGATCTACCAGGCAGATTAGCGTTCTTTAGCTCGGAAGTATCGCGGGCCCCGGGTCTGCCTCTGGTGTCGTTGCGGAACGTCGACGACAGTTATCCTCCGCAGAAGAAGTCATTCATGATGCTGTTGTATATGTACGAGAATTACGGTGACAAGTTCGAGTGGTTCATGCGTGCGGACGACGACGTTTACGTGAGAGGTGACAAGTTGGGGCGGTTCTTGAGATCCGTGGACAGTAGGAAGCCGCAGTTTATAGGGCAAGCTGGGAGAGGGACCAACTCAGAGAGGGACGCTTTGGCCTTAGATTACAACGAGAACTTCTGTATGGGAGGGCCTGGAGTGCTGATGTCGCGGGAGACTCTCCGGCGAGTGGCTCCTCACGTGAAGTACTGTCTGAAGCATCTTTACACGACACACGAGGACGTCGAGATCGGCCGCTGTGTCGCCAAATTCGCTGGAGTATCCTGCACCTGGAGCTACGATCTCCTCGGAACTATTGTAAGTTTCTTAGCACGAAAAACCGATTTTTTTATCGGTGGTAAAGAAGGAGAATGGGAAAAGGTAGTAAAGGATACATTTTATGTTCATGCTAAAAAAGCTAAAGAGGAAGCTGATAAAATTAAAAAAGAAAAAGAAGAAGCTGATAGAAGGTTAAAAGAAATTCAACAGAAGAAGGAACAAGAACGTCTAGCTCAGGACTTTGTACCAGCGACTGTTACAGAACTAACAGATCAAGAAGCTAATAAGATGCTTGAAGACATAGAAAAGGAGAAACAGAGCAAAAGCGTCAACACTAAGAGCCGGCCCAGTCGGTCCTGCGGTGTCGGCCGCTCGTCGCTCGAGTATGATAAAGCGTCTCGTCTAACCTCAATATTTCCTAAGACAAGGCTGGCTGGATGTCGCTTAGATTGGCGTTTCCTCGCGTGCGATGTTCAGTAA

Protein sequence:

>DPOGS215377-PA
MPRRSKWMHLLAGIVAGVTLGVFLILCLRTSSRSVETCPVAAKTYVAPDPLVLIDLKPDENIQNSNRTLVFVGVMTAEQYLTTRARAVYETWAQDLPGRLAFFSSEVSRAPGLPLVSLRNVDDSYPPQKKSFMMLLYMYENYGDKFEWFMRADDDVYVRGDKLGRFLRSVDSRKPQFIGQAGRGTNSERDALALDYNENFCMGGPGVLMSRETLRRVAPHVKYCLKHLYTTHEDVEIGRCVAKFAGVSCTWSYDLLGTIVSFLARKTDFFIGGKEGEWEKVVKDTFYVHAKKAKEEADKIKKEKEEADRRLKEIQQKKEQERLAQDFVPATVTELTDQEANKMLEDIEKEKQSKSVNTKSRPSRSCGVGRSSLEYDKASRLTSIFPKTRLAGCRLDWRFLACDVQ-