DPGLEAN13871 in OGS1.0

New model in OGS2.0DPOGS200434 
Genomic Positionscaffold2511:+ 11451-16847
See gene structure
CDS Length1803
Paired RNAseq reads  435
Single RNAseq reads  966
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008895 (0.0)
Best Drosophila hit  polypeptide GalNAc transferase 3 (2e-112)
Best Human hitpolypeptide N-acetylgalactosaminyltransferase 13 (7e-96)
Best NR hit (blastp)  PREDICTED: similar to AGAP008229-PA [Acyrthosiphon pisum] (1e-146)
Best NR hit (blastx)  PREDICTED: similar to AGAP008229-PA [Acyrthosiphon pisum] (9e-138)
GeneOntology terms





  
GO:0004653 polypeptide N-acetylgalactosaminyltransferase activity
GO:0005795 Golgi stack
GO:0009312 oligosaccharide biosynthetic process
GO:0016266 O-glycan processing
GO:0006493 protein amino acid O-linked glycosylation
GO:0008376 acetylgalactosaminyltransferase activity
GO:0033628 regulation of cell adhesion mediated by integrin
InterPro families

  
IPR001173 Glycosyl transferase, family 2
IPR000772 Ricin B lectin
IPR008997 Ricin B-related lectin
Orthology groupMCL16166

Nucleotide sequence:

ATGTACATCACAAATAAGAAAATTCAGTCTTTCAAATGCAGAGGGCGTAAGAAAAAAATT
GTTGGTTTAGTTATAGTATTATTATTTTTGATAAATGCAATGTTTTATGTGAGTTTAGAG
TTATTAAATACAATCAAAAGGAGAAACAAGATTGCCTGGTATAATAATTTATCATATGAC
CAAGATTCTTATATGGATAGAAGTGGTATGAGGGTGATTGTTGGTCATTATGTTGGAGGT
CATGGAGGGGGTAATTTATCTGAAGATGTCATAAACACAAATCACTATTCTCCGGTACAA
GGAGCTGGCGAGGGTGGCCGACCAGTCCAGCTTCTCCCCAAGGAGATCATACCGGCTAGG
GAGCTTTACAGCTTACATTCCTATAACATATTTGTTAGTGATAGGATATCTATAAATAGA
CATCTTCCGGATATGAGGAGTGAAAGTTGCAGAAATGTGAAATACGATATAGAAAATCTC
CCTACAGCAAGTGTCATAATAGTCTTCCACAACGAAGCTTGGTCCACTCTCATGAGGACT
GTCATGTCTGTTATATTGAGGTCACCAGATATGTTATTGAAGGAGATAATCCTAGTAGAT
GATGCTAGTGAAAGAAAATATTTAGGTAAGGAGCTGGACGATGCTGTTGCCAATTTAGAT
AAAGTGGTTATATTGAGAAGTTTGAATAGGACTGGTTTAGTTGGTGCTAGGCTCATGGGC
GCCAAAACAGCCACGGGGAACGTATTAGTTTTTTTAGATGCACATTGTGAGGTAACAAAA
GGTTGGTTGGAACCGCTTTTGGATAGGGCTGGGAGTGATGACGTTTTTATATGTCCTCAT
ATCGATCTGTTGTCCGATGATACATTGGCTTACACAAAGAGTATTGACGCTCACTGGGGC
GCTTTTAGCTGGCGTCTACACTTCAGATGGCTGATGCCAAGTAATGAGATAATGATGAAT
AAATCCAGGTATCCTTCCAAACCGTTTCCAACACCGGCCATGGCTGGCGGATTATTTGCT
GTAAGAAAAAGTTTGTTCTGGCGTCTGGGTGGCTATGACGAGGAAATGTCGATTTGGGGT
GGAGAAAATCTGGAACTTTCATGGCGAGCGTGGCAATGTGGTGCCAGAGTAGAGATAACG
CATTGCTCAAGAGTTGGTCATATATTTAGACGCCATAGCCCATACAAATACCCTGGGGGA
GTATTTAAAGTGCTCAATACAAATTTAGCGCGAGCAGCAACAGTTTGGATGGATGAATGG
GCAGATTTCTTCTTCAAATTTAACCCATCTGTGGCCGCCATACGCGATACGCAAAATGTA
GCAAATCGTATCGAGCTGCGGAAAAATTTGAAATGTAAGAGCTTTAAGTGGTATCTGGAA
AATGTGTGGCCCAAAAATTTCTTTCCCAGTGATGAAAGGTGGTTTGGCAGAATACGGAAT
GATAAAGAAGGTTGTATAGGCGTCGTTGGTGGAACACCAGGCTTGGGAGGTCCCGCATCC
GGTGTACATTGCGGTAGTGATCTTGATCTTGACAGACTGTTAGTCTACACCCCCGATGGT
AATATAATGGCGGACGAAGGTCTATGCCTCCAGCAAGGAAATGGGAGATCTGTATGGAAA
AGCTGTAGTGAAAATAAGAAACAAATATGGAAGCAGAAAGGTCCAAGGTTGGTAACTCTA
GATGGACTATGCTTGACGATGCTTAAAGCGGATGACAAACAGCCCTTTGGTGCTTTAACA
GCTAAGAGATGTCTCAACGATAGTCGACAAATATGGCATTTCGAAAGAGTGCCCTGGCGA
TGA

Protein sequence:

MYITNKKIQSFKCRGRKKKIVGLVIVLLFLINAMFYVSLELLNTIKRRNKIAWYNNLSYD
QDSYMDRSGMRVIVGHYVGGHGGGNLSEDVINTNHYSPVQGAGEGGRPVQLLPKEIIPAR
ELYSLHSYNIFVSDRISINRHLPDMRSESCRNVKYDIENLPTASVIIVFHNEAWSTLMRT
VMSVILRSPDMLLKEIILVDDASERKYLGKELDDAVANLDKVVILRSLNRTGLVGARLMG
AKTATGNVLVFLDAHCEVTKGWLEPLLDRAGSDDVFICPHIDLLSDDTLAYTKSIDAHWG
AFSWRLHFRWLMPSNEIMMNKSRYPSKPFPTPAMAGGLFAVRKSLFWRLGGYDEEMSIWG
GENLELSWRAWQCGARVEITHCSRVGHIFRRHSPYKYPGGVFKVLNTNLARAATVWMDEW
ADFFFKFNPSVAAIRDTQNVANRIELRKNLKCKSFKWYLENVWPKNFFPSDERWFGRIRN
DKEGCIGVVGGTPGLGGPASGVHCGSDLDLDRLLVYTPDGNIMADEGLCLQQGNGRSVWK
SCSENKKQIWKQKGPRLVTLDGLCLTMLKADDKQPFGALTAKRCLNDSRQIWHFERVPWR