DPGLEAN17358 in OGS1.0

New model in OGS2.0DPOGS200445 
Genomic Positionscaffold1611:+ 36205-51547
See gene structure
CDS Length1881
Paired RNAseq reads  2439
Single RNAseq reads  6154
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008904 (3e-122)
Best Drosophila hit  polypeptide GalNAc transferase 5, isoform A (0.0)
Best Human hitpolypeptide N-acetylgalactosaminyltransferase 13 (0.0)
Best NR hit (blastp)  PREDICTED: similar to polypeptide GalNAc transferase 5 CG31651-PA [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to polypeptide GalNAc transferase 5 CG31651-PA [Tribolium castaneum] (0.0)
GeneOntology terms

  
GO:0004653 polypeptide N-acetylgalactosaminyltransferase activity
GO:0009312 oligosaccharide biosynthetic process
GO:0005795 Golgi stack
InterPro families

  
IPR008997 Ricin B-related lectin
IPR000772 Ricin B lectin
IPR001173 Glycosyl transferase, family 2
Orthology groupMCL10863

Nucleotide sequence:

ATGTTTAGAAGCAAAATAAGGATACACACATGTCGCATAATTTTACTCACATCATTAGTG
TGGTTATTAGTTGATGTTGCCTTGTTAGCACTCTATTCAGATTGTTTTGGTGATGGATGG
GATTGCAATAAAAATAAAAATCTAAATAATGACTACACAATCAAGACAAATGATGAGTAC
AAAGGAAAGAAAGCAGCCATAGCAGCAGCTTTACAGAAGGATGAAGATTTTGATGAGACA
AATTTAGAAGAGAATGAAGTGGAACATGACATGGGTGATGATGGTTTAATTTTACCACCC
TATCCAAAATCACAGCTTAGGAGATGGGCCCCAGCACCATTTGTTAAACCCCAGGAAGAG
ACTCCTGGTGAAATGGGTAAGGCAGTTAACATACCGATTGAACAAGAAAAAGTGATGTTG
GAAAAGTTCCAAGAGAATCAGTTCAATTTACTCGCAAGTGACATGATATCACTGAACAGA
TCACTCACTGATGTTAGATTTGAAAAATGTAAAGCCAAACGCTATCCGACACTTTTGCCG
ACGACGAGTGTAGTTATAGTTTTCCATAATGAAGCGTGGACTACACTACTTAGGACAATA
TGGAGTACAATCAATCGGTCTCCCAGACCGCTGTTGAAGGAGATCATTCTCGTCGACGAT
GCCAGCGAAAAAGAACATCTAGGTAAGAAATTGGAAGAATATATAAAGACCCTGCCAGTT
TCTACCCGGTTGTTCCGTACAGAGAGTCGATCGGGTTTAATAAGAGCCAGATTGCTTGGA
GCCAAACACGTTAAAGGGGATGTCATAACGTTTTTGGACGCTCATTGTGAATGTACCGAG
GGATGGTTGGAGCCGTTGTTATCACGGATCGTTGAGGACAGGAGTACGGTGGTGTGTCCT
ATTATAGATGTTATATCGGACACAACCTTCGAATATATACAGGCGTCTGATATGACCTGG
GGCGGATTCAACTGGAAACTGAACTTTAGATGGTATCGTGTCCCAGAACGCGAGATGCAG
CGCCGTGGTGGTGACCGCACCGCTCCTCTGCGTACACCCACCATGGCTGGCGGCTTGTTC
GCCATCGATCGTGAATACTTCTACAAGATAGGATCCTATGATGAGGGCATGGATATATGG
GGTGGGGAGAACTTGGAGATGAGCTTCAGGGTATGGCAGTGCGGCGGCGTGCTGGAGATC
GTTCCGTGCTCTCACGTGGGCCACGTGTTCAGGGACAAGTCCCCCTACTCCTTCCCCGGG
GGGGTACAGGCCGTGGTGCTGAAGAACGCGGCCAGGGTCGCAGAAGTTTGGATGGACGAA
TGGGGGGAATTCTATTACGCCATGAACCCAGGCGCTCTCAACGTACCCGTGGGCGACGTG
AGCGAGCGGAAGGCGCTCCGTGAGCGTCTCAAGTGTAAAAGCTTCAGGTGGTACCTCGAA
AACATATATCCAGAAAGTCAAATGCCATTGGATTATTACTATTTGGGAGAGATACGGAAC
GCGGAAACATCGAACTGTTTGGATACATTGGGTGGGAAGGCCGGGCAGCCGCTGGGTATG
GGATACTGTCACGGGATGGGGGGAAACCAGGTGTTCGCGTATACTAAACGCAAGCAGATC
ATGTCGGATGACAATTGTTTGGACGCAGCTCACCCTCGCGGACCAATCAAGCTGATACGA
TGTCATGGGATGAGGGGAAATCAAGAGTGGACGTATGATACTAAGAGCCGTACAATAAAG
CACACCAACACTGGCATGTGTCTCGACAAGCCAGAGTCTACAGACGTTTGGAAGCCGGTG
TTGAGGTCCTGCGACAGGTCCAGAGGTCAACAGTGGCTGATGCAGGTCGACTTCAAGTGG
CAAGCGAGGCATTCCAGCTAG

Protein sequence:

MFRSKIRIHTCRIILLTSLVWLLVDVALLALYSDCFGDGWDCNKNKNLNNDYTIKTNDEY
KGKKAAIAAALQKDEDFDETNLEENEVEHDMGDDGLILPPYPKSQLRRWAPAPFVKPQEE
TPGEMGKAVNIPIEQEKVMLEKFQENQFNLLASDMISLNRSLTDVRFEKCKAKRYPTLLP
TTSVVIVFHNEAWTTLLRTIWSTINRSPRPLLKEIILVDDASEKEHLGKKLEEYIKTLPV
STRLFRTESRSGLIRARLLGAKHVKGDVITFLDAHCECTEGWLEPLLSRIVEDRSTVVCP
IIDVISDTTFEYIQASDMTWGGFNWKLNFRWYRVPEREMQRRGGDRTAPLRTPTMAGGLF
AIDREYFYKIGSYDEGMDIWGGENLEMSFRVWQCGGVLEIVPCSHVGHVFRDKSPYSFPG
GVQAVVLKNAARVAEVWMDEWGEFYYAMNPGALNVPVGDVSERKALRERLKCKSFRWYLE
NIYPESQMPLDYYYLGEIRNAETSNCLDTLGGKAGQPLGMGYCHGMGGNQVFAYTKRKQI
MSDDNCLDAAHPRGPIKLIRCHGMRGNQEWTYDTKSRTIKHTNTGMCLDKPESTDVWKPV
LRSCDRSRGQQWLMQVDFKWQARHSS