New model in OGS2.0 | DPOGS200434  |
---|---|
Genomic Position | scaffold2511:+ 11451-16847 |
See gene structure | |
CDS Length | 1803 |
Paired RNAseq reads   | 435 |
Single RNAseq reads   | 966 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008895 (0.0) |
Best Drosophila hit   | polypeptide GalNAc transferase 3 (2e-112) |
Best Human hit | polypeptide N-acetylgalactosaminyltransferase 13 (7e-96) |
Best NR hit (blastp)   | PREDICTED: similar to AGAP008229-PA [Acyrthosiphon pisum] (1e-146) |
Best NR hit (blastx)   | PREDICTED: similar to AGAP008229-PA [Acyrthosiphon pisum] (9e-138) |
GeneOntology terms    | GO:0004653 polypeptide N-acetylgalactosaminyltransferase activity GO:0005795 Golgi stack GO:0009312 oligosaccharide biosynthetic process GO:0016266 O-glycan processing GO:0006493 protein amino acid O-linked glycosylation GO:0008376 acetylgalactosaminyltransferase activity GO:0033628 regulation of cell adhesion mediated by integrin |
InterPro families    | IPR001173 Glycosyl transferase, family 2 IPR000772 Ricin B lectin IPR008997 Ricin B-related lectin |
Orthology group | MCL16166 |
Nucleotide sequence:
ATGTACATCACAAATAAGAAAATTCAGTCTTTCAAATGCAGAGGGCGTAAGAAAAAAATT
GTTGGTTTAGTTATAGTATTATTATTTTTGATAAATGCAATGTTTTATGTGAGTTTAGAG
TTATTAAATACAATCAAAAGGAGAAACAAGATTGCCTGGTATAATAATTTATCATATGAC
CAAGATTCTTATATGGATAGAAGTGGTATGAGGGTGATTGTTGGTCATTATGTTGGAGGT
CATGGAGGGGGTAATTTATCTGAAGATGTCATAAACACAAATCACTATTCTCCGGTACAA
GGAGCTGGCGAGGGTGGCCGACCAGTCCAGCTTCTCCCCAAGGAGATCATACCGGCTAGG
GAGCTTTACAGCTTACATTCCTATAACATATTTGTTAGTGATAGGATATCTATAAATAGA
CATCTTCCGGATATGAGGAGTGAAAGTTGCAGAAATGTGAAATACGATATAGAAAATCTC
CCTACAGCAAGTGTCATAATAGTCTTCCACAACGAAGCTTGGTCCACTCTCATGAGGACT
GTCATGTCTGTTATATTGAGGTCACCAGATATGTTATTGAAGGAGATAATCCTAGTAGAT
GATGCTAGTGAAAGAAAATATTTAGGTAAGGAGCTGGACGATGCTGTTGCCAATTTAGAT
AAAGTGGTTATATTGAGAAGTTTGAATAGGACTGGTTTAGTTGGTGCTAGGCTCATGGGC
GCCAAAACAGCCACGGGGAACGTATTAGTTTTTTTAGATGCACATTGTGAGGTAACAAAA
GGTTGGTTGGAACCGCTTTTGGATAGGGCTGGGAGTGATGACGTTTTTATATGTCCTCAT
ATCGATCTGTTGTCCGATGATACATTGGCTTACACAAAGAGTATTGACGCTCACTGGGGC
GCTTTTAGCTGGCGTCTACACTTCAGATGGCTGATGCCAAGTAATGAGATAATGATGAAT
AAATCCAGGTATCCTTCCAAACCGTTTCCAACACCGGCCATGGCTGGCGGATTATTTGCT
GTAAGAAAAAGTTTGTTCTGGCGTCTGGGTGGCTATGACGAGGAAATGTCGATTTGGGGT
GGAGAAAATCTGGAACTTTCATGGCGAGCGTGGCAATGTGGTGCCAGAGTAGAGATAACG
CATTGCTCAAGAGTTGGTCATATATTTAGACGCCATAGCCCATACAAATACCCTGGGGGA
GTATTTAAAGTGCTCAATACAAATTTAGCGCGAGCAGCAACAGTTTGGATGGATGAATGG
GCAGATTTCTTCTTCAAATTTAACCCATCTGTGGCCGCCATACGCGATACGCAAAATGTA
GCAAATCGTATCGAGCTGCGGAAAAATTTGAAATGTAAGAGCTTTAAGTGGTATCTGGAA
AATGTGTGGCCCAAAAATTTCTTTCCCAGTGATGAAAGGTGGTTTGGCAGAATACGGAAT
GATAAAGAAGGTTGTATAGGCGTCGTTGGTGGAACACCAGGCTTGGGAGGTCCCGCATCC
GGTGTACATTGCGGTAGTGATCTTGATCTTGACAGACTGTTAGTCTACACCCCCGATGGT
AATATAATGGCGGACGAAGGTCTATGCCTCCAGCAAGGAAATGGGAGATCTGTATGGAAA
AGCTGTAGTGAAAATAAGAAACAAATATGGAAGCAGAAAGGTCCAAGGTTGGTAACTCTA
GATGGACTATGCTTGACGATGCTTAAAGCGGATGACAAACAGCCCTTTGGTGCTTTAACA
GCTAAGAGATGTCTCAACGATAGTCGACAAATATGGCATTTCGAAAGAGTGCCCTGGCGA
TGA
Protein sequence:
MYITNKKIQSFKCRGRKKKIVGLVIVLLFLINAMFYVSLELLNTIKRRNKIAWYNNLSYD
QDSYMDRSGMRVIVGHYVGGHGGGNLSEDVINTNHYSPVQGAGEGGRPVQLLPKEIIPAR
ELYSLHSYNIFVSDRISINRHLPDMRSESCRNVKYDIENLPTASVIIVFHNEAWSTLMRT
VMSVILRSPDMLLKEIILVDDASERKYLGKELDDAVANLDKVVILRSLNRTGLVGARLMG
AKTATGNVLVFLDAHCEVTKGWLEPLLDRAGSDDVFICPHIDLLSDDTLAYTKSIDAHWG
AFSWRLHFRWLMPSNEIMMNKSRYPSKPFPTPAMAGGLFAVRKSLFWRLGGYDEEMSIWG
GENLELSWRAWQCGARVEITHCSRVGHIFRRHSPYKYPGGVFKVLNTNLARAATVWMDEW
ADFFFKFNPSVAAIRDTQNVANRIELRKNLKCKSFKWYLENVWPKNFFPSDERWFGRIRN
DKEGCIGVVGGTPGLGGPASGVHCGSDLDLDRLLVYTPDGNIMADEGLCLQQGNGRSVWK
SCSENKKQIWKQKGPRLVTLDGLCLTMLKADDKQPFGALTAKRCLNDSRQIWHFERVPWR