Monarch geneset OGS2.0

DPOGS215378
TranscriptDPOGS215378-TA1833 bp
ProteinDPOGS215378-PA610 aa
Genomic positionDPSCF300088 - 678380-683577
RNAseq coverage298x (Rank: top 37%)
Annotation
HeliconiusHMEL0174370.074.03% 
BombyxBGIBMGA012391-TA0.071.24% 
DrosophilaCG9220-PC5e-10236.55% 
EBI UniRef50UniRef50_UPI0002063A3B8e-12740.20%UPI0002063A3B related cluster n=3 Tax=unknown RepID=UPI0002063A3B
NCBI RefSeqXP_396991.33e-12540.20%PREDICTED: similar to CG9220-PC isoform 1 [Apis mellifera]
NCBI nr blastpgi|3838574277e-13041.76%PREDICTED: chondroitin sulfate synthase 1-like [Megachile rotundata]
NCBI nr blastxgi|3407233934e-12640.60%PREDICTED: chondroitin sulfate synthase 1-like isoform 1 [Bombus terrestris]
Group
Gene OntologyGO:00325804.5e-128Golgi cisterna membrane
GO:00167584.5e-128transferase activity, transferring hexosyl groups
KEGG pathwayame:4135491e-124 
 K13499 (CHSY)maps-> Glycosaminoglycan biosynthesis - chondroitin sulfate
InterPro domain[1-604] IPR0084284.5e-128Chondroitin N-acetylgalactosaminyltransferase
Orthology groupMCL11425 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215378-TA
ATGCAGACCATACTTCACCATAACAGCAGCGGCGCGGCGGCTTACACCGGCCGTCTGAAGAAACGAGAGATCCACCGAGCGATCACGCTACATCCGGTAAAAGATCATCGTCAGATGTACCGACTACACACATACTTTAAGAATCTTCACATCCAAAGCCTCCGAGAGCAGTCGTTGGACCTCCACAGAGACATCGCCTCTTCAGTTCGCGAGCTGGGCGTAAAGATGGATCAAGTGTCGGATTACGAGCTCGAGCCCGGGGTGCCGCTGTTCCCCGCCAAGACGGGGGAGCCGGGGTACTTAGGAAACAATCGGATATTAGGCGCGCCCGTGGACTTCAACAGGTTCAAGGCGGACAACCCGGAAGATATAGTGCCTTTCGATTTCATCAGCAAGTCCATCTACTCCATCAGTCACTCCAACCCCAAGAGGAGAATAGAAACTCCTCTCAAGGAGGCTATAGACGATATTATTAGAGAGGTGATGGAGATCATCAACGGCCCGTCTCGTCAGCGCGGGCGCGTCATCGACTTCAACGAGCTTCTGTACGGGTACGTCCGCCTGCAGCCGCTCCACGGCGTGGACACGGTGCTGGATCTGCTGCTGCACTACAGGAGGTACCGCGGCAGGAAGATGACCGCCGCCGTCCGCAGGCACGCCTACCTACAGCAGACCTTCGCCGCTACGGAAATCCGCGAGTTACCAATGGGCGAACCTCCGCCAGCCGAGGACCCGCGGGTACTGGAGAGACTTGATCTGGACGGGAGGAACTCTCGGCACGAATACGAGGACCTGGAAGATACGGAGAGAGACAGCCCCAGCAGTCTGCTGGACAGGATGAAGGTCAGGGAAGCCTTCCAAAACGGTCTGTCGCATCTGAGGAACGGCCTCCCGAAAATATTGAAGTGGACCGACGACTTCGAGTACCCGATTTACGACCGACGGATACACTTCATCCTGCCGCTGATGGGGCGAGCCGAGATATTCCACAGGTTCATGAGGAACTACGAGGACATAGTGCTGAGGAGCGACGAGACTGTCAGTCTCATAGTGGTCTTGTACATGGACAGCAAGAAGCCACTGGACTACGAGAACTCGGAGGCGTTGATCACGACTTACAAAGATAGGTACGGCAAGGACATCCGCTTAGTGTCCATGGGTTCCAGAACTTTCTCCCGGGGAGCCGCGCTGAGCGCCGGGCTCGAGCTGATAGAGGACGACCAGCTCGCCTTCTTCGTCGACGTGGACATGATGTTCAACCACGTGTCGCTCAGGAGGATCAGAATAAACACCATCAAGTACCACCAGGTCTACTTCCCCATCGTGTTCAGCGAGTACAATCCCGAGGTGGTCTACGGCGACGACTACAACAAGTTCAAAGGCGAGACGCTGGTCATGAAGAGCGACCTCGGGAAATACGTCGCCAAAGACGACCAGTTGACGGCGGCCCAGCTGGCGGACCTCAAGTACAGCAAACAGATCACCGACGACTTCGGATACTTCAGGCAGTACGGCTTCGGGATACTCGGGATATACAAGTGTGACTTCATGAGAGTGGGGGGCTTCGACCTCAACATCAAGGGCTGGGGCTTGGAAGACGTCCAGCTTTTCGAGACCTTGATCAAATCCAACCTTACGGTTTTCCGGATCGCCGACGACACCCTCGTGCACATCTTCCATTCGGTGGACTGCGATAAGAATTTAGAGAAGAGCCAGTTCCTCATGTGTCTGGGGACCAAAGCCTCCACGTACGGGAGTGACAAACATATGTACTACTACATGTTGAACCATCCGGAACTACTGTGGCCTCGAGACGAGAAGGCAGCGAGCTAG

Protein sequence:

>DPOGS215378-PA
MQTILHHNSSGAAAYTGRLKKREIHRAITLHPVKDHRQMYRLHTYFKNLHIQSLREQSLDLHRDIASSVRELGVKMDQVSDYELEPGVPLFPAKTGEPGYLGNNRILGAPVDFNRFKADNPEDIVPFDFISKSIYSISHSNPKRRIETPLKEAIDDIIREVMEIINGPSRQRGRVIDFNELLYGYVRLQPLHGVDTVLDLLLHYRRYRGRKMTAAVRRHAYLQQTFAATEIRELPMGEPPPAEDPRVLERLDLDGRNSRHEYEDLEDTERDSPSSLLDRMKVREAFQNGLSHLRNGLPKILKWTDDFEYPIYDRRIHFILPLMGRAEIFHRFMRNYEDIVLRSDETVSLIVVLYMDSKKPLDYENSEALITTYKDRYGKDIRLVSMGSRTFSRGAALSAGLELIEDDQLAFFVDVDMMFNHVSLRRIRINTIKYHQVYFPIVFSEYNPEVVYGDDYNKFKGETLVMKSDLGKYVAKDDQLTAAQLADLKYSKQITDDFGYFRQYGFGILGIYKCDFMRVGGFDLNIKGWGLEDVQLFETLIKSNLTVFRIADDTLVHIFHSVDCDKNLEKSQFLMCLGTKASTYGSDKHMYYYMLNHPELLWPRDEKAAS-