DPGLEAN09899 in OGS1.0

New model in OGS2.0DPOGS207662 
Genomic Positionscaffold975:- 35477-38199
See gene structure
CDS Length1500
Paired RNAseq reads  1546
Single RNAseq reads  3719
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010523 (0.0)
Best Drosophila hit  UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 2, isoform A (0.0)
Best Human hitN-acetylgalactosaminyltransferase 7 (2e-116)
Best NR hit (blastp)  PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium castaneum] (0.0)
GeneOntology terms

  
GO:0004653 polypeptide N-acetylgalactosaminyltransferase activity
GO:0009312 oligosaccharide biosynthetic process
GO:0005795 Golgi stack
InterPro families

  
IPR000772 Ricin B lectin
IPR008997 Ricin B-related lectin
IPR001173 Glycosyl transferase, family 2
Orthology groupMCL14117

Nucleotide sequence:

ATGCCTCAGGACAGAGCCAATGATATAGCTGAGTCGGAAAGCGAATACGGCATGAACATA
GCTGCATCTAATGATATTGCTATGAACAGATCAATTCCGGACACTCGTCTGGATGAATGT
AAATATTGGCACTATCCTGAAGAACTGCCGAGTACATCAGTAATTATTGTGTTCCACAAC
GAAGGTTTCTCGGTGCTCATGAGGACCGTGCACACTGTCATAGATCGTTCACCGCCTAAC
ATATTGAAGGAGGTTGTTATGGTTGACGATTTTTCAGATAAAGACGATTTAAAAGAAAAC
TTAGACAACTATGTTAAACGTTGGAAAGGCAAAGTGAGAATAATAAGAAACAGTGAAAGA
CAGGGTCTGATACGTACCAGATCGAGAGGGGCTATGGAAGCGACGGGGGAGGTCATAGTA
TTTTTGGACGCTCACTGCGAGGTCAACGTCAACTGGTTACCGCCACTACTCGCTCCCATA
TACAGGGACTACAAGATCATGACCGTACCAGTTATAGATGGTATCGACCACAAAACCTTC
GAATACAGACCGGTTTACTCGCATGGTATTAATTATAGAGGTATATTCGAATGGGGTATG
CTTTACAAAGAAAACGAAGTACCTGACAGGGAAGCCAGTTTGCACAAACATAAATCTGAA
CCATACAAAAGTCCTACCCACGCTGGTGGTCTTTTCGCTATAAACAGGAATTATTTCCTT
GAAATCGGTGCATACGATCCCGGTCTTTTGGTATGGGGTGGAGAGAATTTCGAATTAAGC
TTCAAGATTTGGCAATGCGGCGGTAGTATTGAATGGGTGCCATGCTCCAGGGTCGGTCAC
GTGTATAGAGCCTTCATGCCGTACTCGTTCGGAAATCTAGCTAAAAACCGGAAAGGATCT
CTCATCACAATTAATTACAAACGGGTCATTGAAACTTGGTTCGATGAGGAGCATAAGGAA
TTTTTCTATACAAGGGAACCCATGGCCAGGTTTCTGGATATGGGCGACATCAGTGAACAA
GTAGCCCTGAGGGACAAATTGAACTGCAAGAGCTTCAGTTGGTACATGGAGAATGTCGCT
TATGACGTATATGATAAATTCCCCAAATTACCCAAAAATGTTCATTGGGGTATGGTGAAG
AATAAAGCAATCGGCCTGTGTCTAGATACTATGGGAAAAGCAGCTCCTTCATATATTGGT
ATACAGTCCTGTCATGGGGCTGGGAACAATCAGCTGTACAGATTGAATGAGGCGGGACAG
TTGGGTGTTGGCGAGAGATGTCTGGAAGCCGATACGGACAGCCTCAAACAGACGATCTGC
CGGCTAGGGACTGTTGACGGACCTTGGAGGTACGACAAAGAGCGCAGCCATCTGATACAC
AGGTTGCACAGCTATTGTCTGACCCTGCAGCCCAATTCCAGAACACTTGGTCTGGCTCCT
TGCGACCCCAACAATACTTATCAACAGTGGACCATAACGCAGAAGAACCCCAAGTGGTGA

Protein sequence:

MPQDRANDIAESESEYGMNIAASNDIAMNRSIPDTRLDECKYWHYPEELPSTSVIIVFHN
EGFSVLMRTVHTVIDRSPPNILKEVVMVDDFSDKDDLKENLDNYVKRWKGKVRIIRNSER
QGLIRTRSRGAMEATGEVIVFLDAHCEVNVNWLPPLLAPIYRDYKIMTVPVIDGIDHKTF
EYRPVYSHGINYRGIFEWGMLYKENEVPDREASLHKHKSEPYKSPTHAGGLFAINRNYFL
EIGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPCSRVGHVYRAFMPYSFGNLAKNRKGS
LITINYKRVIETWFDEEHKEFFYTREPMARFLDMGDISEQVALRDKLNCKSFSWYMENVA
YDVYDKFPKLPKNVHWGMVKNKAIGLCLDTMGKAAPSYIGIQSCHGAGNNQLYRLNEAGQ
LGVGERCLEADTDSLKQTICRLGTVDGPWRYDKERSHLIHRLHSYCLTLQPNSRTLGLAP
CDPNNTYQQWTITQKNPKW