New model in OGS2.0 | DPOGS208641 |
---|---|
Genomic Position | scaffold594:+ 30099-42553 |
See gene structure | |
CDS Length | 2793 |
Paired RNAseq reads | 671 |
Single RNAseq reads | 1624 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007776 (0.0) |
Best Drosophila hit | brother of tout-velu (0.0) |
Best Human hit | exostosin-like 3 (0.0) |
Best NR hit (blastp) | PREDICTED: similar to brother of tout-velu CG15110-PA [Apis mellifera] (0.0) |
Best NR hit (blastx) | exostosin-2 [Aedes aegypti] (0.0) |
GeneOntology terms | GO:0008375 acetylglucosaminyltransferase activity GO:0007367 segment polarity determination GO:0015014 heparan sulfate proteoglycan biosynthetic process, polysaccharide chain biosynthetic process GO:0042328 heparan sulfate N-acetylglucosaminyltransferase activity GO:0016021 integral to membrane GO:0015012 heparan sulfate proteoglycan biosynthetic process GO:0007224 smoothened signaling pathway GO:0008101 decapentaplegic receptor signaling pathway GO:0016055 Wnt receptor signaling pathway GO:0031227 intrinsic to endoplasmic reticulum membrane GO:0006024 glycosaminoglycan biosynthetic process GO:0005783 endoplasmic reticulum GO:0005794 Golgi apparatus GO:0001888 glucuronyl-galactosyl-proteoglycan 4-alpha-N-acetylglucosaminyltransferase activity GO:0006044 N-acetylglucosamine metabolic process |
InterPro families | IPR015338 EXTL2, alpha-1,4-N-acetylhexosaminyltransferase IPR004263 Exostosin-like |
Orthology group | MCL11532 |
Nucleotide sequence:
ATGGCTGATGAAATCGAAATTGAAGAACATGATGTTTTAGAAAACCCTCGTATCAAACGG
ATTTTTCCTGATTTATCAATACTTAAAAAAGAAATAGGTGATAATGATGTCTTGATTCTG
TTTGTTGTGCCGCTCTTTACACATTATTATTTGTCTAAGTATGAATCATCATCAATGACA
TTGGGTTCCAATAACATGCGACACACACTAGAAGCCCTCGGAGACTTATCAGCCGTCAAT
ATCGGAGACCTTAAGATAAGGATAGAAGAAATGCTTAGAATTAAGGCGTCAGTGTCCACG
GAGTTGCGTGAATTAGAAGAACGACGAGGGAAACTGCAAAAAGAAGCGGCAGCTGCTAGT
GCAAACGCAGACAGTGTTAAGGCTGAGTATGCACGCGCGACTGCTGAATTGCAGAGATTG
AGGGTATCCGCAGACCAAGCTCGACTAGCCCAGTTGGAGGCTATACGACGGGATTCCCCT
GAACTCGCCCCACCACTGCCAATCTTACCTTCGTCCCCACCACCCATTCTCCCACCTGCT
ACATCAACATCTGAACTACACTGTCGAATGCATTCATGCTTTGATCACTCCCGCTGTTCA
CTAACTTCCGGTTTCCCAGTGTACTTCTATGATCCTGATGTATTCTCTCCTCTCATTGGG
GCGGAGGTGGATGGTTTTCTCAAAACCACATTACGACAAACGTTAAGCTACAATTCACAC
CTTACTCAAAACCCTAATGAAGCATGCGTCTATCTCGTGCTGGTCGGCGAAGGATTTCCT
TCTGACAAGACTCAAACTTCCACGAAAAAGCTGTTGTTGAATGAGACAGCAATCAAAAGT
CTGCCATATTGGGGCGGAGACGGGCGTAACCATGTGTTACTAAATCTGGCTCGTCGCGAT
CTATCCGTCGGTTCCGGAGACGCGTTTCTGGATTCGTCGACTGGTAGAGCGATGATAGCG
CAGTCTACGTTTACATTGCAACAGTTCCGGCCAGGATTTGACCTGGTGACACCTCCAGCC
CTCGGACCTCCTGGAGGAGACGTATGGTCAGACTGCGCGCCTATGGCACCGGCAAGACGT
CTATACATACTTAGTTTTCAGGGTTCACAGACTCCAGCGGCAGGGTCCCACGTAGATGAC
GATCAGTCACTCATCGAGTCTCTGAGGAAGATGGTCAGCCAGGCTCCTTCTTCTGATGTG
TTTCTATTGCAATTCGACTGCGACCCGCCTATCGACAAGCGTGCGGTCCTTCCGATCGGT
GACTGGGGACTCTGCGGCACCGATCGGTCGAGACGAGCCGTTCTTAGAGATTCCACTTTC
GTATTAATATTGGCACCGGCTGACGGAGATTATGCTTCAACAGCTCTCCTGCAAGCGAGG
CTATATGAAGCGCTACGCTCCGGAGCTATACCCGTCATACTTGGGGGTGATCGTATACAG
CTGCCGTATAGCGAAGTTTTAGACTGGCGAAGGGCTACATTATCCCTCCCGAAAGCTCGC
GTCACTGAGTTACATTTTCTGCTGAGAGCTCTATCGGATGCAGATTTACTAGCGTTCCGT
AGACAGGGACGTTTGTTATGGGAGAGATATTTAAGTTCGGTACAAGCTAGTATGGACTCG
CTCCTGGCTACTATACGGACTCGTTTGAACATTCCTCCACATTCAGCGGCACCGACTATG
GGTGTGCCGGCGTTCAATGACACCTTCTATCCACCGAAAATTGAACCGCCGGCCGTGGAC
ACTGAGCCCGAAGAGACCCTCGGGCCTTTAGAAGCTCCTTATCCGAGTCCGGCCTATAGA
CGTAATTACTCGGTGTCTCTATTAAACGGTTACGAACTATGGAATGACTGGGGAGAGCCG
TTCGCACTGTTTCCTCAATTGCCTTGGGATCCGCCGGTAACATCGGAAGCCCGGTTCATG
GGTTCCGCAGCAGGTTTCCGACCAATCGGAGCAGGAGCCGGGGGTTCTGGGAAGGAGTTC
AGCGAAGCTCTAGGAGGTGACCGGCCGAGGGAACAGTTCACTATTGTCATCCTCACGTAT
GAGAGGGAAGCCGTTCTGGCAGCGGCACTGGCGAGGCTCCGGGGTCTACCGTACTTGAAT
AAGGTGGTGGTTGTATGGAACGGAGTGAACCCACCACTCTCGTCCCAGTCGTGGCCGGAG
TCGGGCGCGCCGGTGGCGGTGGTGCGGGCTCCTCGCAACTCATTGAACAACCGCTTCCTA
CCATACAACGTGATCGACACTGAGGCCGTTCTCTGCGTAGACGATGACGCGCATTTGAGA
CACGATGAGATAGTCTTCGCGTTTAGAGTCTGGCGTGAACATCGCGATCGTATAGTGGGC
TTCCCTGGGAGGTACCACGCGTGGGATCTCAACTTCAATAATGGATTCCTTTACAACTCT
AACTACAGTTGTGAGCTGAGTATGGTGTTAACCGGGGCGGCGTTCGTGCACCGCTACTAT
TTGTGGTCGTACTGGCGTCTGCTGCCCGCCGCTGTCCGGGACTACGTCGACCAGTACATG
AACTGCGAGGACATCGCTATGAACTTCCTAGTGGCTCACATCACGAGGAAACCGCCGGTC
AAGGTGACATCTCGTTGGACGTTCCGTTGTCCTGGTTGCCCTGTGACGCTGTCAGCGGAC
GAGACCCATTTCCACGAGCGACACAAATGCATTCAGTTCTTCTCCCAGGTGTTTGGTTAC
ACTCCACTTCTGTCGACACAGTACAGAGCTGATTCCGTACTTTTTAAGACGAGGATATCA
CACGACAAGCAGAAGTGCTTTAAATTCATTTAA
Protein sequence:
MADEIEIEEHDVLENPRIKRIFPDLSILKKEIGDNDVLILFVVPLFTHYYLSKYESSSMT
LGSNNMRHTLEALGDLSAVNIGDLKIRIEEMLRIKASVSTELRELEERRGKLQKEAAAAS
ANADSVKAEYARATAELQRLRVSADQARLAQLEAIRRDSPELAPPLPILPSSPPPILPPA
TSTSELHCRMHSCFDHSRCSLTSGFPVYFYDPDVFSPLIGAEVDGFLKTTLRQTLSYNSH
LTQNPNEACVYLVLVGEGFPSDKTQTSTKKLLLNETAIKSLPYWGGDGRNHVLLNLARRD
LSVGSGDAFLDSSTGRAMIAQSTFTLQQFRPGFDLVTPPALGPPGGDVWSDCAPMAPARR
LYILSFQGSQTPAAGSHVDDDQSLIESLRKMVSQAPSSDVFLLQFDCDPPIDKRAVLPIG
DWGLCGTDRSRRAVLRDSTFVLILAPADGDYASTALLQARLYEALRSGAIPVILGGDRIQ
LPYSEVLDWRRATLSLPKARVTELHFLLRALSDADLLAFRRQGRLLWERYLSSVQASMDS
LLATIRTRLNIPPHSAAPTMGVPAFNDTFYPPKIEPPAVDTEPEETLGPLEAPYPSPAYR
RNYSVSLLNGYELWNDWGEPFALFPQLPWDPPVTSEARFMGSAAGFRPIGAGAGGSGKEF
SEALGGDRPREQFTIVILTYEREAVLAAALARLRGLPYLNKVVVVWNGVNPPLSSQSWPE
SGAPVAVVRAPRNSLNNRFLPYNVIDTEAVLCVDDDAHLRHDEIVFAFRVWREHRDRIVG
FPGRYHAWDLNFNNGFLYNSNYSCELSMVLTGAAFVHRYYLWSYWRLLPAAVRDYVDQYM
NCEDIAMNFLVAHITRKPPVKVTSRWTFRCPGCPVTLSADETHFHERHKCIQFFSQVFGY
TPLLSTQYRADSVLFKTRISHDKQKCFKFI