New model in OGS2.0 | DPOGS207662  |
---|---|
Genomic Position | scaffold975:- 35477-38199 |
See gene structure | |
CDS Length | 1500 |
Paired RNAseq reads   | 1546 |
Single RNAseq reads   | 3719 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010523 (0.0) |
Best Drosophila hit   | UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 2, isoform A (0.0) |
Best Human hit | N-acetylgalactosaminyltransferase 7 (2e-116) |
Best NR hit (blastp)   | PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0004653 polypeptide N-acetylgalactosaminyltransferase activity GO:0009312 oligosaccharide biosynthetic process GO:0005795 Golgi stack |
InterPro families    | IPR000772 Ricin B lectin IPR008997 Ricin B-related lectin IPR001173 Glycosyl transferase, family 2 |
Orthology group | MCL14117 |
Nucleotide sequence:
ATGCCTCAGGACAGAGCCAATGATATAGCTGAGTCGGAAAGCGAATACGGCATGAACATA
GCTGCATCTAATGATATTGCTATGAACAGATCAATTCCGGACACTCGTCTGGATGAATGT
AAATATTGGCACTATCCTGAAGAACTGCCGAGTACATCAGTAATTATTGTGTTCCACAAC
GAAGGTTTCTCGGTGCTCATGAGGACCGTGCACACTGTCATAGATCGTTCACCGCCTAAC
ATATTGAAGGAGGTTGTTATGGTTGACGATTTTTCAGATAAAGACGATTTAAAAGAAAAC
TTAGACAACTATGTTAAACGTTGGAAAGGCAAAGTGAGAATAATAAGAAACAGTGAAAGA
CAGGGTCTGATACGTACCAGATCGAGAGGGGCTATGGAAGCGACGGGGGAGGTCATAGTA
TTTTTGGACGCTCACTGCGAGGTCAACGTCAACTGGTTACCGCCACTACTCGCTCCCATA
TACAGGGACTACAAGATCATGACCGTACCAGTTATAGATGGTATCGACCACAAAACCTTC
GAATACAGACCGGTTTACTCGCATGGTATTAATTATAGAGGTATATTCGAATGGGGTATG
CTTTACAAAGAAAACGAAGTACCTGACAGGGAAGCCAGTTTGCACAAACATAAATCTGAA
CCATACAAAAGTCCTACCCACGCTGGTGGTCTTTTCGCTATAAACAGGAATTATTTCCTT
GAAATCGGTGCATACGATCCCGGTCTTTTGGTATGGGGTGGAGAGAATTTCGAATTAAGC
TTCAAGATTTGGCAATGCGGCGGTAGTATTGAATGGGTGCCATGCTCCAGGGTCGGTCAC
GTGTATAGAGCCTTCATGCCGTACTCGTTCGGAAATCTAGCTAAAAACCGGAAAGGATCT
CTCATCACAATTAATTACAAACGGGTCATTGAAACTTGGTTCGATGAGGAGCATAAGGAA
TTTTTCTATACAAGGGAACCCATGGCCAGGTTTCTGGATATGGGCGACATCAGTGAACAA
GTAGCCCTGAGGGACAAATTGAACTGCAAGAGCTTCAGTTGGTACATGGAGAATGTCGCT
TATGACGTATATGATAAATTCCCCAAATTACCCAAAAATGTTCATTGGGGTATGGTGAAG
AATAAAGCAATCGGCCTGTGTCTAGATACTATGGGAAAAGCAGCTCCTTCATATATTGGT
ATACAGTCCTGTCATGGGGCTGGGAACAATCAGCTGTACAGATTGAATGAGGCGGGACAG
TTGGGTGTTGGCGAGAGATGTCTGGAAGCCGATACGGACAGCCTCAAACAGACGATCTGC
CGGCTAGGGACTGTTGACGGACCTTGGAGGTACGACAAAGAGCGCAGCCATCTGATACAC
AGGTTGCACAGCTATTGTCTGACCCTGCAGCCCAATTCCAGAACACTTGGTCTGGCTCCT
TGCGACCCCAACAATACTTATCAACAGTGGACCATAACGCAGAAGAACCCCAAGTGGTGA
Protein sequence:
MPQDRANDIAESESEYGMNIAASNDIAMNRSIPDTRLDECKYWHYPEELPSTSVIIVFHN
EGFSVLMRTVHTVIDRSPPNILKEVVMVDDFSDKDDLKENLDNYVKRWKGKVRIIRNSER
QGLIRTRSRGAMEATGEVIVFLDAHCEVNVNWLPPLLAPIYRDYKIMTVPVIDGIDHKTF
EYRPVYSHGINYRGIFEWGMLYKENEVPDREASLHKHKSEPYKSPTHAGGLFAINRNYFL
EIGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPCSRVGHVYRAFMPYSFGNLAKNRKGS
LITINYKRVIETWFDEEHKEFFYTREPMARFLDMGDISEQVALRDKLNCKSFSWYMENVA
YDVYDKFPKLPKNVHWGMVKNKAIGLCLDTMGKAAPSYIGIQSCHGAGNNQLYRLNEAGQ
LGVGERCLEADTDSLKQTICRLGTVDGPWRYDKERSHLIHRLHSYCLTLQPNSRTLGLAP
CDPNNTYQQWTITQKNPKW