New model in OGS2.0 | DPOGS215378  |
---|---|
Genomic Position | scaffold2874:- 15248-20445 |
See gene structure | |
CDS Length | 1833 |
Paired RNAseq reads   | 704 |
Single RNAseq reads   | 2035 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012391 (0.0) |
Best Drosophila hit   | CG9220 (2e-93) |
Best Human hit | chondroitin sulfate synthase 1 (5e-83) |
Best NR hit (blastp)   | PREDICTED: similar to CG9220-PC isoform 1 [Apis mellifera] (4e-124) |
Best NR hit (blastx)   | chondroitin synthase [Aedes aegypti] (5e-118) |
GeneOntology terms    | GO:0030206 chondroitin sulfate biosynthetic process GO:0008375 acetylglucosaminyltransferase activity GO:0015020 glucuronosyltransferase activity GO:0032580 Golgi cisterna membrane GO:0016758 transferase activity, transferring hexosyl groups |
InterPro families   | IPR008428 Chondroitin N-acetylgalactosaminyltransferase |
Orthology group | MCL10993 |
Nucleotide sequence:
ATGCAGACCATACTTCACCATAACAGCAGCGGCGCGGCGGCTTACACCGGCCGTCTGAAG
AAACGAGAGATCCACCGAGCGATCACGCTACATCCGGTAAAAGATCATCGTCAGATGTAC
CGACTACACACATACTTTAAGAATCTTCACATCCAAAGCCTCCGAGAGCAGTCGTTGGAC
CTCCACAGAGACATCGCCTCTTCAGTTCGCGAGCTGGGCGTAAAGATGGATCAAGTGTCG
GATTACGAGCTCGAGCCCGGGGTGCCGCTGTTCCCCGCCAAGACGGGGGAGCCGGGGTAC
TTAGGAAACAATCGGATATTAGGCGCGCCCGTGGACTTCAACAGGTTCAAGGCGGACAAC
CCGGAAGATATAGTGCCTTTCGATTTCATCAGCAAGTCCATCTACTCCATCAGTCACTCC
AACCCCAAGAGGAGAATAGAAACTCCTCTCAAGGAGGCTATAGACGATATTATTAGAGAG
GTGATGGAGATCATCAACGGCCCGTCTCGTCAGCGCGGGCGCGTCATCGACTTCAACGAG
CTTCTGTACGGGTACGTCCGCCTGCAGCCGCTCCACGGCGTGGACACGGTGCTGGATCTG
CTGCTGCACTACAGGAGGTACCGCGGCAGGAAGATGACCGCCGCCGTCCGCAGGCACGCC
TACCTACAGCAGACCTTCGCCGCTACGGAAATCCGCGAGTTACCAATGGGCGAACCTCCG
CCAGCCGAGGACCCGCGGGTACTGGAGAGACTTGATCTGGACGGGAGGAACTCTCGGCAC
GAATACGAGGACCTGGAAGATACGGAGAGAGACAGCCCCAGCAGTCTGCTGGACAGGATG
AAGGTCAGGGAAGCCTTCCAAAACGGTCTGTCGCATCTGAGGAACGGCCTCCCGAAAATA
TTGAAGTGGACCGACGACTTCGAGTACCCGATTTACGACCGACGGATACACTTCATCCTG
CCGCTGATGGGGCGAGCCGAGATATTCCACAGGTTCATGAGGAACTACGAGGACATAGTG
CTGAGGAGCGACGAGACTGTCAGTCTCATAGTGGTCTTGTACATGGACAGCAAGAAGCCA
CTGGACTACGAGAACTCGGAGGCGTTGATCACGACTTACAAAGATAGGTACGGCAAGGAC
ATCCGCTTAGTGTCCATGGGTTCCAGAACTTTCTCCCGGGGAGCCGCGCTGAGCGCCGGG
CTCGAGCTGATAGAGGACGACCAGCTCGCCTTCTTCGTCGACGTGGACATGATGTTCAAC
CACGTGTCGCTCAGGAGGATCAGAATAAACACCATCAAGTACCACCAGGTCTACTTCCCC
ATCGTGTTCAGCGAGTACAATCCCGAGGTGGTCTACGGCGACGACTACAACAAGTTCAAA
GGCGAGACGCTGGTCATGAAGAGCGACCTCGGGAAATACGTCGCCAAAGACGACCAGTTG
ACGGCGGCCCAGCTGGCGGACCTCAAGTACAGCAAACAGATCACCGACGACTTCGGATAC
TTCAGGCAGTACGGCTTCGGGATACTCGGGATATACAAGTGTGACTTCATGAGAGTGGGG
GGCTTCGACCTCAACATCAAGGGCTGGGGCTTGGAAGACGTCCAGCTTTTCGAGACCTTG
ATCAAATCCAACCTTACGGTTTTCCGGATCGCCGACGACACCCTCGTGCACATCTTCCAT
TCGGTGGACTGCGATAAGAATTTAGAGAAGAGCCAGTTCCTCATGTGTCTGGGGACCAAA
GCCTCCACGTACGGGAGTGACAAACATATGTACTACTACATGTTGAACCATCCGGAACTA
CTGTGGCCTCGAGACGAGAAGGCAGCGAGCTAG
Protein sequence:
MQTILHHNSSGAAAYTGRLKKREIHRAITLHPVKDHRQMYRLHTYFKNLHIQSLREQSLD
LHRDIASSVRELGVKMDQVSDYELEPGVPLFPAKTGEPGYLGNNRILGAPVDFNRFKADN
PEDIVPFDFISKSIYSISHSNPKRRIETPLKEAIDDIIREVMEIINGPSRQRGRVIDFNE
LLYGYVRLQPLHGVDTVLDLLLHYRRYRGRKMTAAVRRHAYLQQTFAATEIRELPMGEPP
PAEDPRVLERLDLDGRNSRHEYEDLEDTERDSPSSLLDRMKVREAFQNGLSHLRNGLPKI
LKWTDDFEYPIYDRRIHFILPLMGRAEIFHRFMRNYEDIVLRSDETVSLIVVLYMDSKKP
LDYENSEALITTYKDRYGKDIRLVSMGSRTFSRGAALSAGLELIEDDQLAFFVDVDMMFN
HVSLRRIRINTIKYHQVYFPIVFSEYNPEVVYGDDYNKFKGETLVMKSDLGKYVAKDDQL
TAAQLADLKYSKQITDDFGYFRQYGFGILGIYKCDFMRVGGFDLNIKGWGLEDVQLFETL
IKSNLTVFRIADDTLVHIFHSVDCDKNLEKSQFLMCLGTKASTYGSDKHMYYYMLNHPEL
LWPRDEKAAS