DPGLEAN00788 in OGS1.0

New model in OGS2.0DPOGS215378 
Genomic Positionscaffold2874:- 15248-20445
See gene structure
CDS Length1833
Paired RNAseq reads  704
Single RNAseq reads  2035
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012391 (0.0)
Best Drosophila hit  CG9220 (2e-93)
Best Human hitchondroitin sulfate synthase 1 (5e-83)
Best NR hit (blastp)  PREDICTED: similar to CG9220-PC isoform 1 [Apis mellifera] (4e-124)
Best NR hit (blastx)  chondroitin synthase [Aedes aegypti] (5e-118)
GeneOntology terms



  
GO:0030206 chondroitin sulfate biosynthetic process
GO:0008375 acetylglucosaminyltransferase activity
GO:0015020 glucuronosyltransferase activity
GO:0032580 Golgi cisterna membrane
GO:0016758 transferase activity, transferring hexosyl groups
InterPro families  IPR008428 Chondroitin N-acetylgalactosaminyltransferase
Orthology groupMCL10993

Nucleotide sequence:

ATGCAGACCATACTTCACCATAACAGCAGCGGCGCGGCGGCTTACACCGGCCGTCTGAAG
AAACGAGAGATCCACCGAGCGATCACGCTACATCCGGTAAAAGATCATCGTCAGATGTAC
CGACTACACACATACTTTAAGAATCTTCACATCCAAAGCCTCCGAGAGCAGTCGTTGGAC
CTCCACAGAGACATCGCCTCTTCAGTTCGCGAGCTGGGCGTAAAGATGGATCAAGTGTCG
GATTACGAGCTCGAGCCCGGGGTGCCGCTGTTCCCCGCCAAGACGGGGGAGCCGGGGTAC
TTAGGAAACAATCGGATATTAGGCGCGCCCGTGGACTTCAACAGGTTCAAGGCGGACAAC
CCGGAAGATATAGTGCCTTTCGATTTCATCAGCAAGTCCATCTACTCCATCAGTCACTCC
AACCCCAAGAGGAGAATAGAAACTCCTCTCAAGGAGGCTATAGACGATATTATTAGAGAG
GTGATGGAGATCATCAACGGCCCGTCTCGTCAGCGCGGGCGCGTCATCGACTTCAACGAG
CTTCTGTACGGGTACGTCCGCCTGCAGCCGCTCCACGGCGTGGACACGGTGCTGGATCTG
CTGCTGCACTACAGGAGGTACCGCGGCAGGAAGATGACCGCCGCCGTCCGCAGGCACGCC
TACCTACAGCAGACCTTCGCCGCTACGGAAATCCGCGAGTTACCAATGGGCGAACCTCCG
CCAGCCGAGGACCCGCGGGTACTGGAGAGACTTGATCTGGACGGGAGGAACTCTCGGCAC
GAATACGAGGACCTGGAAGATACGGAGAGAGACAGCCCCAGCAGTCTGCTGGACAGGATG
AAGGTCAGGGAAGCCTTCCAAAACGGTCTGTCGCATCTGAGGAACGGCCTCCCGAAAATA
TTGAAGTGGACCGACGACTTCGAGTACCCGATTTACGACCGACGGATACACTTCATCCTG
CCGCTGATGGGGCGAGCCGAGATATTCCACAGGTTCATGAGGAACTACGAGGACATAGTG
CTGAGGAGCGACGAGACTGTCAGTCTCATAGTGGTCTTGTACATGGACAGCAAGAAGCCA
CTGGACTACGAGAACTCGGAGGCGTTGATCACGACTTACAAAGATAGGTACGGCAAGGAC
ATCCGCTTAGTGTCCATGGGTTCCAGAACTTTCTCCCGGGGAGCCGCGCTGAGCGCCGGG
CTCGAGCTGATAGAGGACGACCAGCTCGCCTTCTTCGTCGACGTGGACATGATGTTCAAC
CACGTGTCGCTCAGGAGGATCAGAATAAACACCATCAAGTACCACCAGGTCTACTTCCCC
ATCGTGTTCAGCGAGTACAATCCCGAGGTGGTCTACGGCGACGACTACAACAAGTTCAAA
GGCGAGACGCTGGTCATGAAGAGCGACCTCGGGAAATACGTCGCCAAAGACGACCAGTTG
ACGGCGGCCCAGCTGGCGGACCTCAAGTACAGCAAACAGATCACCGACGACTTCGGATAC
TTCAGGCAGTACGGCTTCGGGATACTCGGGATATACAAGTGTGACTTCATGAGAGTGGGG
GGCTTCGACCTCAACATCAAGGGCTGGGGCTTGGAAGACGTCCAGCTTTTCGAGACCTTG
ATCAAATCCAACCTTACGGTTTTCCGGATCGCCGACGACACCCTCGTGCACATCTTCCAT
TCGGTGGACTGCGATAAGAATTTAGAGAAGAGCCAGTTCCTCATGTGTCTGGGGACCAAA
GCCTCCACGTACGGGAGTGACAAACATATGTACTACTACATGTTGAACCATCCGGAACTA
CTGTGGCCTCGAGACGAGAAGGCAGCGAGCTAG

Protein sequence:

MQTILHHNSSGAAAYTGRLKKREIHRAITLHPVKDHRQMYRLHTYFKNLHIQSLREQSLD
LHRDIASSVRELGVKMDQVSDYELEPGVPLFPAKTGEPGYLGNNRILGAPVDFNRFKADN
PEDIVPFDFISKSIYSISHSNPKRRIETPLKEAIDDIIREVMEIINGPSRQRGRVIDFNE
LLYGYVRLQPLHGVDTVLDLLLHYRRYRGRKMTAAVRRHAYLQQTFAATEIRELPMGEPP
PAEDPRVLERLDLDGRNSRHEYEDLEDTERDSPSSLLDRMKVREAFQNGLSHLRNGLPKI
LKWTDDFEYPIYDRRIHFILPLMGRAEIFHRFMRNYEDIVLRSDETVSLIVVLYMDSKKP
LDYENSEALITTYKDRYGKDIRLVSMGSRTFSRGAALSAGLELIEDDQLAFFVDVDMMFN
HVSLRRIRINTIKYHQVYFPIVFSEYNPEVVYGDDYNKFKGETLVMKSDLGKYVAKDDQL
TAAQLADLKYSKQITDDFGYFRQYGFGILGIYKCDFMRVGGFDLNIKGWGLEDVQLFETL
IKSNLTVFRIADDTLVHIFHSVDCDKNLEKSQFLMCLGTKASTYGSDKHMYYYMLNHPEL
LWPRDEKAAS