DPGLEAN16737 in OGS1.0

New model in OGS2.0DPOGS208641 
Genomic Positionscaffold594:+ 30099-42553
See gene structure
CDS Length2793
Paired RNAseq reads  671
Single RNAseq reads  1624
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007776 (0.0)
Best Drosophila hit  brother of tout-velu (0.0)
Best Human hitexostosin-like 3 (0.0)
Best NR hit (blastp)  PREDICTED: similar to brother of tout-velu CG15110-PA [Apis mellifera] (0.0)
Best NR hit (blastx)  exostosin-2 [Aedes aegypti] (0.0)
GeneOntology terms













  
GO:0008375 acetylglucosaminyltransferase activity
GO:0007367 segment polarity determination
GO:0015014 heparan sulfate proteoglycan biosynthetic process, polysaccharide chain biosynthetic process
GO:0042328 heparan sulfate N-acetylglucosaminyltransferase activity
GO:0016021 integral to membrane
GO:0015012 heparan sulfate proteoglycan biosynthetic process
GO:0007224 smoothened signaling pathway
GO:0008101 decapentaplegic receptor signaling pathway
GO:0016055 Wnt receptor signaling pathway
GO:0031227 intrinsic to endoplasmic reticulum membrane
GO:0006024 glycosaminoglycan biosynthetic process
GO:0005783 endoplasmic reticulum
GO:0005794 Golgi apparatus
GO:0001888 glucuronyl-galactosyl-proteoglycan 4-alpha-N-acetylglucosaminyltransferase activity
GO:0006044 N-acetylglucosamine metabolic process
InterPro families
  
IPR015338 EXTL2, alpha-1,4-N-acetylhexosaminyltransferase
IPR004263 Exostosin-like
Orthology groupMCL11532

Nucleotide sequence:

ATGGCTGATGAAATCGAAATTGAAGAACATGATGTTTTAGAAAACCCTCGTATCAAACGG
ATTTTTCCTGATTTATCAATACTTAAAAAAGAAATAGGTGATAATGATGTCTTGATTCTG
TTTGTTGTGCCGCTCTTTACACATTATTATTTGTCTAAGTATGAATCATCATCAATGACA
TTGGGTTCCAATAACATGCGACACACACTAGAAGCCCTCGGAGACTTATCAGCCGTCAAT
ATCGGAGACCTTAAGATAAGGATAGAAGAAATGCTTAGAATTAAGGCGTCAGTGTCCACG
GAGTTGCGTGAATTAGAAGAACGACGAGGGAAACTGCAAAAAGAAGCGGCAGCTGCTAGT
GCAAACGCAGACAGTGTTAAGGCTGAGTATGCACGCGCGACTGCTGAATTGCAGAGATTG
AGGGTATCCGCAGACCAAGCTCGACTAGCCCAGTTGGAGGCTATACGACGGGATTCCCCT
GAACTCGCCCCACCACTGCCAATCTTACCTTCGTCCCCACCACCCATTCTCCCACCTGCT
ACATCAACATCTGAACTACACTGTCGAATGCATTCATGCTTTGATCACTCCCGCTGTTCA
CTAACTTCCGGTTTCCCAGTGTACTTCTATGATCCTGATGTATTCTCTCCTCTCATTGGG
GCGGAGGTGGATGGTTTTCTCAAAACCACATTACGACAAACGTTAAGCTACAATTCACAC
CTTACTCAAAACCCTAATGAAGCATGCGTCTATCTCGTGCTGGTCGGCGAAGGATTTCCT
TCTGACAAGACTCAAACTTCCACGAAAAAGCTGTTGTTGAATGAGACAGCAATCAAAAGT
CTGCCATATTGGGGCGGAGACGGGCGTAACCATGTGTTACTAAATCTGGCTCGTCGCGAT
CTATCCGTCGGTTCCGGAGACGCGTTTCTGGATTCGTCGACTGGTAGAGCGATGATAGCG
CAGTCTACGTTTACATTGCAACAGTTCCGGCCAGGATTTGACCTGGTGACACCTCCAGCC
CTCGGACCTCCTGGAGGAGACGTATGGTCAGACTGCGCGCCTATGGCACCGGCAAGACGT
CTATACATACTTAGTTTTCAGGGTTCACAGACTCCAGCGGCAGGGTCCCACGTAGATGAC
GATCAGTCACTCATCGAGTCTCTGAGGAAGATGGTCAGCCAGGCTCCTTCTTCTGATGTG
TTTCTATTGCAATTCGACTGCGACCCGCCTATCGACAAGCGTGCGGTCCTTCCGATCGGT
GACTGGGGACTCTGCGGCACCGATCGGTCGAGACGAGCCGTTCTTAGAGATTCCACTTTC
GTATTAATATTGGCACCGGCTGACGGAGATTATGCTTCAACAGCTCTCCTGCAAGCGAGG
CTATATGAAGCGCTACGCTCCGGAGCTATACCCGTCATACTTGGGGGTGATCGTATACAG
CTGCCGTATAGCGAAGTTTTAGACTGGCGAAGGGCTACATTATCCCTCCCGAAAGCTCGC
GTCACTGAGTTACATTTTCTGCTGAGAGCTCTATCGGATGCAGATTTACTAGCGTTCCGT
AGACAGGGACGTTTGTTATGGGAGAGATATTTAAGTTCGGTACAAGCTAGTATGGACTCG
CTCCTGGCTACTATACGGACTCGTTTGAACATTCCTCCACATTCAGCGGCACCGACTATG
GGTGTGCCGGCGTTCAATGACACCTTCTATCCACCGAAAATTGAACCGCCGGCCGTGGAC
ACTGAGCCCGAAGAGACCCTCGGGCCTTTAGAAGCTCCTTATCCGAGTCCGGCCTATAGA
CGTAATTACTCGGTGTCTCTATTAAACGGTTACGAACTATGGAATGACTGGGGAGAGCCG
TTCGCACTGTTTCCTCAATTGCCTTGGGATCCGCCGGTAACATCGGAAGCCCGGTTCATG
GGTTCCGCAGCAGGTTTCCGACCAATCGGAGCAGGAGCCGGGGGTTCTGGGAAGGAGTTC
AGCGAAGCTCTAGGAGGTGACCGGCCGAGGGAACAGTTCACTATTGTCATCCTCACGTAT
GAGAGGGAAGCCGTTCTGGCAGCGGCACTGGCGAGGCTCCGGGGTCTACCGTACTTGAAT
AAGGTGGTGGTTGTATGGAACGGAGTGAACCCACCACTCTCGTCCCAGTCGTGGCCGGAG
TCGGGCGCGCCGGTGGCGGTGGTGCGGGCTCCTCGCAACTCATTGAACAACCGCTTCCTA
CCATACAACGTGATCGACACTGAGGCCGTTCTCTGCGTAGACGATGACGCGCATTTGAGA
CACGATGAGATAGTCTTCGCGTTTAGAGTCTGGCGTGAACATCGCGATCGTATAGTGGGC
TTCCCTGGGAGGTACCACGCGTGGGATCTCAACTTCAATAATGGATTCCTTTACAACTCT
AACTACAGTTGTGAGCTGAGTATGGTGTTAACCGGGGCGGCGTTCGTGCACCGCTACTAT
TTGTGGTCGTACTGGCGTCTGCTGCCCGCCGCTGTCCGGGACTACGTCGACCAGTACATG
AACTGCGAGGACATCGCTATGAACTTCCTAGTGGCTCACATCACGAGGAAACCGCCGGTC
AAGGTGACATCTCGTTGGACGTTCCGTTGTCCTGGTTGCCCTGTGACGCTGTCAGCGGAC
GAGACCCATTTCCACGAGCGACACAAATGCATTCAGTTCTTCTCCCAGGTGTTTGGTTAC
ACTCCACTTCTGTCGACACAGTACAGAGCTGATTCCGTACTTTTTAAGACGAGGATATCA
CACGACAAGCAGAAGTGCTTTAAATTCATTTAA

Protein sequence:

MADEIEIEEHDVLENPRIKRIFPDLSILKKEIGDNDVLILFVVPLFTHYYLSKYESSSMT
LGSNNMRHTLEALGDLSAVNIGDLKIRIEEMLRIKASVSTELRELEERRGKLQKEAAAAS
ANADSVKAEYARATAELQRLRVSADQARLAQLEAIRRDSPELAPPLPILPSSPPPILPPA
TSTSELHCRMHSCFDHSRCSLTSGFPVYFYDPDVFSPLIGAEVDGFLKTTLRQTLSYNSH
LTQNPNEACVYLVLVGEGFPSDKTQTSTKKLLLNETAIKSLPYWGGDGRNHVLLNLARRD
LSVGSGDAFLDSSTGRAMIAQSTFTLQQFRPGFDLVTPPALGPPGGDVWSDCAPMAPARR
LYILSFQGSQTPAAGSHVDDDQSLIESLRKMVSQAPSSDVFLLQFDCDPPIDKRAVLPIG
DWGLCGTDRSRRAVLRDSTFVLILAPADGDYASTALLQARLYEALRSGAIPVILGGDRIQ
LPYSEVLDWRRATLSLPKARVTELHFLLRALSDADLLAFRRQGRLLWERYLSSVQASMDS
LLATIRTRLNIPPHSAAPTMGVPAFNDTFYPPKIEPPAVDTEPEETLGPLEAPYPSPAYR
RNYSVSLLNGYELWNDWGEPFALFPQLPWDPPVTSEARFMGSAAGFRPIGAGAGGSGKEF
SEALGGDRPREQFTIVILTYEREAVLAAALARLRGLPYLNKVVVVWNGVNPPLSSQSWPE
SGAPVAVVRAPRNSLNNRFLPYNVIDTEAVLCVDDDAHLRHDEIVFAFRVWREHRDRIVG
FPGRYHAWDLNFNNGFLYNSNYSCELSMVLTGAAFVHRYYLWSYWRLLPAAVRDYVDQYM
NCEDIAMNFLVAHITRKPPVKVTSRWTFRCPGCPVTLSADETHFHERHKCIQFFSQVFGY
TPLLSTQYRADSVLFKTRISHDKQKCFKFI