New model in OGS2.0 | DPOGS202474  |
---|---|
Genomic Position | scaffold386:- 10759-13809 |
See gene structure | |
CDS Length | 2268 |
Paired RNAseq reads   | 2094 |
Single RNAseq reads   | 4940 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | ND |
Best Drosophila hit   | CG4351 (3e-52) |
Best Human hit | chondroitin sulfate synthase 2 isoform 1 (8e-26) |
Best NR hit (blastp)   | PREDICTED: similar to chondroitin synthase [Nasonia vitripennis] (2e-94) |
Best NR hit (blastx)   | PREDICTED: similar to chondroitin synthase [Nasonia vitripennis] (8e-67) |
GeneOntology terms    | GO:0032580 Golgi cisterna membrane GO:0016758 transferase activity, transferring hexosyl groups |
InterPro families   | IPR008428 Chondroitin N-acetylgalactosaminyltransferase |
Orthology group | MCL12460 |
Nucleotide sequence:
ATGTTATCACGCTACGTGGTATCGCAAGTGAAACATAACTCCTACTTCCTGGTGGGGTTG
GGTATCGGTCTATGGCTCGCACTGGCGACGGTGCCACTTGAAGAGGATGTGGTGTCTTGC
GAGGACACCACTGCCGCGGCCCTGGACCCAGGCCTGGACGAATTCCAGCCGCAGCGTGAG
GAGCGACCTCCGGGCGCGGTGGGACCCGCGGGTCGCACGGTCACTAGACCACGATACTAC
AGTACAGAACTGGGCATGCGAGCCGCCCTACTGGCAGGTGTCCTGAGTTCCGAGGCAGCC
CTAGAGTCTCGCGCCGCCGCCTTCAACCAGACGGCCGCAGATCTTAAGCCCGCCCTGCGC
TTCTTCATCACGGCGAGCGCTCTACAAGGCGCCCCGGGCCGAGCCAACGTGGTGGGCTTC
ACAGACACACGCGAGATGCTGAAGCCGTTCCACGCGCTCAAGTACCTCGCCGATAACTTC
CTCGAGGAGTACGACTTCTTCTTCCTCGTGTCGGACTCCACTTTCGTGAACGCGCGTCGT
CTGAACCGGCTCGTGGCCAGCCTCAGCGTGAGCCAGGACCTGTACATGGGAGCCGTCTCC
GGCGACGACACTCACTACTGCACGCTGGAGGCCGGCATCCTCATGTCCAACTCTGTGCTG
CGAGCCGTGCACGAGGAGCTGGACTGGTGCGTCAGGAACTCCTACTCCCCGCACCACCAC
GAGAACCTGGGCCGCTGCGTGCTGCACGCGGCCGGCCTCCGGTGCGTCGCCGGCCTCCAG
GCCGTCTCTTACGACACGGCCCACCTCCGCCCCGCTCACCCGGACGGCCCCGCCAGTTTG
CACCCCGCCTTGGCGGACGCAGTAACCGTCCACCCGGCGCTGACCCCCGAGGACTTCTAC
CGCCTGCACGCCTACGTGTCCAGGGTGAACCTGGAGCGTGTTGGGGAGGACGAGGCGCGG
ACTCGAGCGGAGGCGGCGCTCAGCTCCCGTCACCATCCCCGGGGGTACAGGAACGTGTCG
TGGCCAACCGCCCTACGAGCGGACGCAGGTCTAGCGCCGCCACCCCCACCCACCAGGTTC
GACCTCCTCCGCTGGACGCGGTTTAATCTCACACACGCCCTCCAGCTGGACGACCACCGC
GCCGTCTCCAAGCTGAGCGCATCCTACAAGCAAGCCGTGGCCCTGATCGTAGAGGAGGCA
CGGGCGTGGGTGGAGCGGAGATGGGGCGGCGAGGAAGGCGGGGCGCTTTCGGTGAGCCTC
GAGGAAGGAGCGTGGTGCTGGGAGCCGCCCCGGGCGCTCCGGTACCGCCTCTTGCTGAGA
GTGACCGCGGAGGGAGGCGGGCGTCTGCTGCAAGTGGAGGCGGCGCGAGCGCTGGGAGCG
GCCCGCCTCGCACCCGCAGCCTACGTCACGGAGAGCGCCCGCGTCCACCTCGTGCTGCCA
GCCCCCGACCAGCGCTCACACCTCACCGCTTTCCTGGAGCGGTACGAGACGGTCTGCCTC
CAGAGAGACGACAACACGGCTCTGTATGTGGTCGTGATACCGGCCAGTGACGGAGGACAT
CTGACAGCAGAAGAGCGAGCTCATCTGGAGGAGGTCAAGGAGATGGTGAGGGCGGTCGGA
GAGAAACACCGCGCGGGACAACACATGGACGTTATCGTGTCCAGCATCGGGCGCGGCGCG
GGGACGGGGGGTGGTGTCTCCGGGAGTGGGGAGAGAGCGCGGGAGGACGTACGGCTCGCC
CTGAGGGCGGCACTCGTTCGGGCCGCGAAGGATGCGTTGCTGTTGGTGGCCGACCATAGC
ATGGAGTTCACCGAAGACTTCCTCAACAGGGTCCGCATGAACACGATCGCGGGCTCGCAG
TGGTTCAGTCCGCTGGCCTTCGCTCGCTTCGCGCAGTACGCTCACCCTCGCTTCGTGGAG
GCGGACGGGTCGCGGCCGACTCTCCACACGGGCCGCTTCTCTCACACCGAGCTGCTCTCC
GTGTACAAGGGCGACTACTCGGACGCTCTCCGCAGCTGGCTGGAGGCGGGAGGCTCCGAG
GAGGCGTCACCGTCCGCGGTCCTCGCCGCTAGCCCCCTACGCGTGCTGCGCGCCCCTGAG
CCGGCCCTGCTACTCCCGCCCCGGCCCCGCCCCTGCACACCCTCCTCCCCCTCCGAGGAG
AGGGCGTGTCTGGTCCGTGAGCGCGAGCGTGGTTTCTCTGACCTGTTGCTGGGCGCTCGT
CAGTCGCTCGCCAAGTTGCTGCTGCAGACTCAGGCGGAGCTCGAGTGA
Protein sequence:
MLSRYVVSQVKHNSYFLVGLGIGLWLALATVPLEEDVVSCEDTTAAALDPGLDEFQPQRE
ERPPGAVGPAGRTVTRPRYYSTELGMRAALLAGVLSSEAALESRAAAFNQTAADLKPALR
FFITASALQGAPGRANVVGFTDTREMLKPFHALKYLADNFLEEYDFFFLVSDSTFVNARR
LNRLVASLSVSQDLYMGAVSGDDTHYCTLEAGILMSNSVLRAVHEELDWCVRNSYSPHHH
ENLGRCVLHAAGLRCVAGLQAVSYDTAHLRPAHPDGPASLHPALADAVTVHPALTPEDFY
RLHAYVSRVNLERVGEDEARTRAEAALSSRHHPRGYRNVSWPTALRADAGLAPPPPPTRF
DLLRWTRFNLTHALQLDDHRAVSKLSASYKQAVALIVEEARAWVERRWGGEEGGALSVSL
EEGAWCWEPPRALRYRLLLRVTAEGGGRLLQVEAARALGAARLAPAAYVTESARVHLVLP
APDQRSHLTAFLERYETVCLQRDDNTALYVVVIPASDGGHLTAEERAHLEEVKEMVRAVG
EKHRAGQHMDVIVSSIGRGAGTGGGVSGSGERAREDVRLALRAALVRAAKDALLLVADHS
MEFTEDFLNRVRMNTIAGSQWFSPLAFARFAQYAHPRFVEADGSRPTLHTGRFSHTELLS
VYKGDYSDALRSWLEAGGSEEASPSAVLAASPLRVLRAPEPALLLPPRPRPCTPSSPSEE
RACLVRERERGFSDLLLGARQSLAKLLLQTQAELE