DPGLEAN05471 in OGS1.0

New model in OGS2.0DPOGS205975 
Genomic Positionscaffold176:+ 57309-66314
See gene structure
CDS Length2079
Paired RNAseq reads  496
Single RNAseq reads  1254
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009403 (2e-26)
Best Drosophila hit  CG3253 (2e-13)
Best Human hitglycosyltransferase-like protein LARGE1 (2e-158)
Best NR hit (blastp)  conserved hypothetical protein [Pediculus humanus corporis] (3e-176)
Best NR hit (blastx)  conserved hypothetical protein [Pediculus humanus corporis] (3e-176)
GeneOntology terms




  
GO:0016757 transferase activity, transferring glycosyl groups
GO:0005794 Golgi apparatus
GO:0016740 transferase activity
GO:0016020 membrane
GO:0016021 integral to membrane
GO:0006486 protein amino acid glycosylation
InterPro families  IPR002495 Glycosyl transferase, family 8
Orthology groupMCL15958

Nucleotide sequence:

ATGAATTTCACATCAAAAAATGTCTTCACTTGTCTATTTTTTCGACCCTGGCTAGGCGTG
CCGTTGGGTATATGGGTGGCTGTGTCTATTGTGTATTATTGGTGGTTAATATCCAGAATT
GATGTGTTGGAAACTGAAAATAAAGTGATCAAAAAGCAATTATCATTGGGTAGTGTTCAC
GAAGTCATTGACGGAGCCACCAGTTATACCGTGCCTGAGATCCCAGATGTTGTTGGTACT
CTGTGTGAAACGGTTCACATAGCTTTGGTGTGTATGGGGAAATGCACCCGTAACATAACA
CCGATGTTGAAGTCTCTCCTACATCATCGACAGAATCCCATACATTTCCACTTTGTAGTC
GACCCGGAATCAATGCGGACTTTAAATAAACTGTTTGAAACGTGGGATTTGCCTGATGTG
AAATACAGCTGCTATGACGCCCAAGACCGTCTGTCTGAGGTCAAATGGATCCCCAACAAC
CACTACTCTGGTGTATATGCACTCGTCAAACTTCTCTTCCCCAGCATATTGCCGGATAAA
CTGGAACAGGTTATAGTACTGGACTCTGATCTAACATTCCTCTCCGATGTTTCTCAGCTG
TGGCACATGTTCAGAAACATGAGCAGTCTGCAGTTCATAGGACTAGTTGAAAACGAAAGC
AACTGGTACACGAATCAGAAAACAAGATGGCCGGCGTTACGACGCGGATACAATACGGGT
GTCATGTTGTTAGATTTATATAAAATAAGATCACTCATAAGCTGGACTAGTGTTTGGCAG
AAGACTGTCAATGAAAATCTTAACAGGCTCAAAACCACCGCTCTGGCTGATCAAGACGTT
ATAAACGCGATCATAAAGAACCATCCCCACATTGTCTATGACATCAGCTGTCATTATAAC
GTGCAAATGTCAACTCAAACGTTAGCCAAGAACTGTTACGGTGAAGACGTTAAAAATATT
AAGATATTACATTGGAATTCTCCCAGCAAATACAACATCCGTATCCGGGACGCTGATTAT
TTTAAAAACATTCATCAATCATATGTTAATTTCGATGGCAATTTGTTGAGGGAGAAGTTG
CATCGGTGCTCTCAAAACGAAGTTGTGACATACAAGATGAACCATTCAGACCTTTGCCTA
AGCTTCCGAGCTGCTCAAAGAGTAAAGTTAAGAACGCACATATACTACATGGACTATAGC
TATATGAATGTTGACAATTTTGATGTAACTCTAGTTCTACAATTGTCTATGGACAGATTA
CAGTTTTTGGAGAGGATTGTCAAATATTGGGAAGGTCCGTTGAGTGCTGCCATCTACCTG
TCCGACTGTGAAGTTACTAAACTAGAAAGTATCCTAAGGGATTGGTCGTCAACACTTAAC
ATAAAAAATAATATTGGATACCACTTGGTGTTTAAGCATGATTCGGTACATTATCCAGTG
AACTATCTTCGTAATGTAGCCTTGGAGAACGTGAACACTCCATATGTGTTCCTTATGGAC
GCCGATTTTGTACCAATGGCCGGTCTGTACAGTTATCTGAGGGAATCAATAAAGTTGATC
AACCCGTATCCACAGAAGAAGTGTCTGGTGGTGCCGGCGTTTGAAACTCAAAGGTATAGG
GCTTCACCACCCCGGTATAAGGAGGAATTGTTATCCAGGCTCTCCATCAAACACCTTGGG
GATGTTGCTCCGTTCAGGGCCAGGGAGTGGCCCAGGGGACACAGGGCTACCAACTACATA
AGGTGGTCGACTGCCACTGCCCCGTATGAGGTGGATTGGCAATCGGACTACGAACCGTAC
CTAGTGGCACATCGCAGCATTCCTAAATACGACACGAGATTCTCTGGCTTCGGATGGAAT
AAAGTGTCTCACAGCGTTGAACTCCGTGCCCAAGGATACAAACTGTCGGTGCTGCCGGGC
GCGTTCGTGGTACACACGCCGCACGCCCCCTCCCCTGACATCACAGCCTTCAGAGCCGAT
CCGCACTATCGCGTATGTCTATCACTTTTGAAACAAGAGTTCATGGACGATCTCAAGAGG
AAGTACAACGTCACACTCGATGATAACCAAAGAAAATAG

Protein sequence:

MNFTSKNVFTCLFFRPWLGVPLGIWVAVSIVYYWWLISRIDVLETENKVIKKQLSLGSVH
EVIDGATSYTVPEIPDVVGTLCETVHIALVCMGKCTRNITPMLKSLLHHRQNPIHFHFVV
DPESMRTLNKLFETWDLPDVKYSCYDAQDRLSEVKWIPNNHYSGVYALVKLLFPSILPDK
LEQVIVLDSDLTFLSDVSQLWHMFRNMSSLQFIGLVENESNWYTNQKTRWPALRRGYNTG
VMLLDLYKIRSLISWTSVWQKTVNENLNRLKTTALADQDVINAIIKNHPHIVYDISCHYN
VQMSTQTLAKNCYGEDVKNIKILHWNSPSKYNIRIRDADYFKNIHQSYVNFDGNLLREKL
HRCSQNEVVTYKMNHSDLCLSFRAAQRVKLRTHIYYMDYSYMNVDNFDVTLVLQLSMDRL
QFLERIVKYWEGPLSAAIYLSDCEVTKLESILRDWSSTLNIKNNIGYHLVFKHDSVHYPV
NYLRNVALENVNTPYVFLMDADFVPMAGLYSYLRESIKLINPYPQKKCLVVPAFETQRYR
ASPPRYKEELLSRLSIKHLGDVAPFRAREWPRGHRATNYIRWSTATAPYEVDWQSDYEPY
LVAHRSIPKYDTRFSGFGWNKVSHSVELRAQGYKLSVLPGAFVVHTPHAPSPDITAFRAD
PHYRVCLSLLKQEFMDDLKRKYNVTLDDNQRK