New model in OGS2.0 | DPOGS200445  |
---|---|
Genomic Position | scaffold1611:+ 36205-51547 |
See gene structure | |
CDS Length | 1881 |
Paired RNAseq reads   | 2439 |
Single RNAseq reads   | 6154 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008904 (3e-122) |
Best Drosophila hit   | polypeptide GalNAc transferase 5, isoform A (0.0) |
Best Human hit | polypeptide N-acetylgalactosaminyltransferase 13 (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to polypeptide GalNAc transferase 5 CG31651-PA [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to polypeptide GalNAc transferase 5 CG31651-PA [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0004653 polypeptide N-acetylgalactosaminyltransferase activity GO:0009312 oligosaccharide biosynthetic process GO:0005795 Golgi stack |
InterPro families    | IPR008997 Ricin B-related lectin IPR000772 Ricin B lectin IPR001173 Glycosyl transferase, family 2 |
Orthology group | MCL10863 |
Nucleotide sequence:
ATGTTTAGAAGCAAAATAAGGATACACACATGTCGCATAATTTTACTCACATCATTAGTG
TGGTTATTAGTTGATGTTGCCTTGTTAGCACTCTATTCAGATTGTTTTGGTGATGGATGG
GATTGCAATAAAAATAAAAATCTAAATAATGACTACACAATCAAGACAAATGATGAGTAC
AAAGGAAAGAAAGCAGCCATAGCAGCAGCTTTACAGAAGGATGAAGATTTTGATGAGACA
AATTTAGAAGAGAATGAAGTGGAACATGACATGGGTGATGATGGTTTAATTTTACCACCC
TATCCAAAATCACAGCTTAGGAGATGGGCCCCAGCACCATTTGTTAAACCCCAGGAAGAG
ACTCCTGGTGAAATGGGTAAGGCAGTTAACATACCGATTGAACAAGAAAAAGTGATGTTG
GAAAAGTTCCAAGAGAATCAGTTCAATTTACTCGCAAGTGACATGATATCACTGAACAGA
TCACTCACTGATGTTAGATTTGAAAAATGTAAAGCCAAACGCTATCCGACACTTTTGCCG
ACGACGAGTGTAGTTATAGTTTTCCATAATGAAGCGTGGACTACACTACTTAGGACAATA
TGGAGTACAATCAATCGGTCTCCCAGACCGCTGTTGAAGGAGATCATTCTCGTCGACGAT
GCCAGCGAAAAAGAACATCTAGGTAAGAAATTGGAAGAATATATAAAGACCCTGCCAGTT
TCTACCCGGTTGTTCCGTACAGAGAGTCGATCGGGTTTAATAAGAGCCAGATTGCTTGGA
GCCAAACACGTTAAAGGGGATGTCATAACGTTTTTGGACGCTCATTGTGAATGTACCGAG
GGATGGTTGGAGCCGTTGTTATCACGGATCGTTGAGGACAGGAGTACGGTGGTGTGTCCT
ATTATAGATGTTATATCGGACACAACCTTCGAATATATACAGGCGTCTGATATGACCTGG
GGCGGATTCAACTGGAAACTGAACTTTAGATGGTATCGTGTCCCAGAACGCGAGATGCAG
CGCCGTGGTGGTGACCGCACCGCTCCTCTGCGTACACCCACCATGGCTGGCGGCTTGTTC
GCCATCGATCGTGAATACTTCTACAAGATAGGATCCTATGATGAGGGCATGGATATATGG
GGTGGGGAGAACTTGGAGATGAGCTTCAGGGTATGGCAGTGCGGCGGCGTGCTGGAGATC
GTTCCGTGCTCTCACGTGGGCCACGTGTTCAGGGACAAGTCCCCCTACTCCTTCCCCGGG
GGGGTACAGGCCGTGGTGCTGAAGAACGCGGCCAGGGTCGCAGAAGTTTGGATGGACGAA
TGGGGGGAATTCTATTACGCCATGAACCCAGGCGCTCTCAACGTACCCGTGGGCGACGTG
AGCGAGCGGAAGGCGCTCCGTGAGCGTCTCAAGTGTAAAAGCTTCAGGTGGTACCTCGAA
AACATATATCCAGAAAGTCAAATGCCATTGGATTATTACTATTTGGGAGAGATACGGAAC
GCGGAAACATCGAACTGTTTGGATACATTGGGTGGGAAGGCCGGGCAGCCGCTGGGTATG
GGATACTGTCACGGGATGGGGGGAAACCAGGTGTTCGCGTATACTAAACGCAAGCAGATC
ATGTCGGATGACAATTGTTTGGACGCAGCTCACCCTCGCGGACCAATCAAGCTGATACGA
TGTCATGGGATGAGGGGAAATCAAGAGTGGACGTATGATACTAAGAGCCGTACAATAAAG
CACACCAACACTGGCATGTGTCTCGACAAGCCAGAGTCTACAGACGTTTGGAAGCCGGTG
TTGAGGTCCTGCGACAGGTCCAGAGGTCAACAGTGGCTGATGCAGGTCGACTTCAAGTGG
CAAGCGAGGCATTCCAGCTAG
Protein sequence:
MFRSKIRIHTCRIILLTSLVWLLVDVALLALYSDCFGDGWDCNKNKNLNNDYTIKTNDEY
KGKKAAIAAALQKDEDFDETNLEENEVEHDMGDDGLILPPYPKSQLRRWAPAPFVKPQEE
TPGEMGKAVNIPIEQEKVMLEKFQENQFNLLASDMISLNRSLTDVRFEKCKAKRYPTLLP
TTSVVIVFHNEAWTTLLRTIWSTINRSPRPLLKEIILVDDASEKEHLGKKLEEYIKTLPV
STRLFRTESRSGLIRARLLGAKHVKGDVITFLDAHCECTEGWLEPLLSRIVEDRSTVVCP
IIDVISDTTFEYIQASDMTWGGFNWKLNFRWYRVPEREMQRRGGDRTAPLRTPTMAGGLF
AIDREYFYKIGSYDEGMDIWGGENLEMSFRVWQCGGVLEIVPCSHVGHVFRDKSPYSFPG
GVQAVVLKNAARVAEVWMDEWGEFYYAMNPGALNVPVGDVSERKALRERLKCKSFRWYLE
NIYPESQMPLDYYYLGEIRNAETSNCLDTLGGKAGQPLGMGYCHGMGGNQVFAYTKRKQI
MSDDNCLDAAHPRGPIKLIRCHGMRGNQEWTYDTKSRTIKHTNTGMCLDKPESTDVWKPV
LRSCDRSRGQQWLMQVDFKWQARHSS