New model in OGS2.0 | DPOGS205676  |
---|---|
Genomic Position | scaffold5485:- 2277-6409 |
See gene structure | |
CDS Length | 2004 |
Paired RNAseq reads   | 926 |
Single RNAseq reads   | 2203 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001024 (0.0) |
Best Drosophila hit   | CG4050, isoform B (5e-178) |
Best Human hit | transmembrane and TPR repeat-containing protein 3 (2e-156) |
Best NR hit (blastp)   | PREDICTED: similar to GA17918-PA [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to CG4050-PA, isoform A isoform 1 [Apis mellifera] (0.0) |
GeneOntology terms    | GO:0016262 protein N-acetylglucosaminyltransferase activity GO:0017122 protein N-acetylglucosaminyltransferase complex GO:0008150 biological_process GO:0005488 binding |
InterPro families    | IPR011990 Tetratricopeptide-like helical IPR019734 Tetratricopeptide repeat IPR001440 Tetratricopeptide TPR-1 IPR013618 Domain of unknown function DUF1736 IPR013026 Tetratricopeptide repeat-containing |
Orthology group | MCL13655 |
Nucleotide sequence:
ATGCTGTGTAAAGAACAAGGCATCACGGTCACCGCCGTCTGCGTGGTCTACGAATTATTC
GTAGCGCAAAAGCGACGTGGCAGCCCTGGTCGTTCTTGGCTGGAGTATTGGGGTAAGGGA
GGGTGGGGATGTGCCTGTGGGGAAGCAGCCCGCAGGGTTGCCACCGTCTGCTGTGCTACA
CTGGCGCTTCTGGCTGCCAGACTACACGTTATGGGAGCACAGCTGCCAGTTTTCACACGA
TTCGACAACCCTGCTGGAGCCTCGCCCCCACCTGCGAGACATCTAACCTTCGCCTATCTT
CCCGCGTTGAACGCCTGGCTGCTGACTCTGCCGGAGGCGTTGTGCTGTGACTGGACCATG
GGTACTGTGGCTTTGTTACGATCGTGGAGCGACCCACGGAATATGGCCACGGCGGGTCTC
GCAGTCATGCTTGTCGCTGGAACCATACATGCCTTGAGGACCAGATCTTCGGCATTATCG
ATGGGTCTTGCGCTACTTGTTTTGCCGTTTCTTCCCGCATCAAACCTCTTCTTCCCCGTG
GGTTTCGTTGTGGCTGAGCGTGTTTTATACATGCCGTCTATGGGCTGGTGTCTGTTAGTG
GCACACGGTTGGAGGCTTGTGGCCAGGAAACGAGCGAAACTGGCCGCAGCTTCACTCGTG
TTCCTCCTGCTGGCTTTTAGTGCCAAAACATACGTCAGAAACTGGGACTGGAAGACTGAA
TATACAATATTTGCATCGGGACTGAAGGTGAATCGTAATAACGCCAAGCTATACAACAAC
GTGGGTCACGCTTTAGAAGCTGAAGGGAAATACGGAGAAGCTTTGGAATTCTTCAAAATT
GCTGTGAACGTCCAACCAGACGACGTTGGAGCCCATATCAACGTTGGAAGAACTTTCAAT
CATTTAGGCAAATATCAGGAGGCAGAAGCCGCTTACGTGAAAGCCAAATCTCTCCTACCG
AAAGCCAAACCCGGGGAATCTTACCAAGCTAGAATAGCCCCCAATCACTTGAACGTCTTC
CTTAATTTGGCTAATTTGATATCCAAAAACGCGACACGATTAGAAGAAGCTGACATGTTG
TATCGGCAAGCGATCAGTATGAGAGCTGATTACACACAGGCCTATATAAACAGGGGTGAT
ATTTTAATTAAATTAAACAGGACCAAGGAGGCCCAGGAGGTCTACGAACGGGCGCTGTTG
TATGACAGTGGGAACCCTGACATTTATTACAATCTGGGGGTAGTGCTACTTGAACAAGGC
AAGGCGTCCCAGGCGCTGGCGTATTTGGACAAAGCTCTGGAACTCGAACCGGAACATGAA
CAGGCATTACTGAACTCTGCCATACTTCTGCAAGAGCTGGGAGCTGCAGACTTGAGACAC
CTTGCCAGACAAAGATTACTCAAATTGTTGGACAAAGATGCCACTAACGAGCGCGTCCAC
TTTAACCTCGGCATGGTGTGTATGGATGAGGGAGACGCGGAGTGCGCTGAACGCTGGTTC
AGGGCCGCGGTTCATCTTAAACCGGACTTCCGCTCCGCTCTCTTCAACCTGGCTTTACTA
CTAGCTGACAGACGAAGACCCCTGGAGGCCGCGCCTTTCCTAAAACAATTGGTCAGACAT
CACCCCGATCATGTGAAAGCCCTAGTACTGTTGGGAGACATTTACATCAATTCGGTCAAG
GATTTGGATGCTGCTGAAAGTTGCTATCGACGCATCCTCGAACTAGAACCAGACAACGTG
CAAGCTCTCCACAATTTATGCGTTGTTGCTGTAGAAAGAGGGAAGTTAGCCGTTGCTGAA
GAGTGTCTTACAAGAGCCGCGGTTTTGGCGCCACACGAACATTACATACAGCGCCATCTA
GCGGTAGTACGCGCGAGACTGGCAGCTGTCTCGCTCACACACCCCAACACCCGACCATCC
GACGCATCCGACGCACAAGTGCGAGCGAGATGGAACTACATCCCCCAACAACCCCCCGAC
CCACACGAGTCGGATCCCTCATAG
Protein sequence:
MLCKEQGITVTAVCVVYELFVAQKRRGSPGRSWLEYWGKGGWGCACGEAARRVATVCCAT
LALLAARLHVMGAQLPVFTRFDNPAGASPPPARHLTFAYLPALNAWLLTLPEALCCDWTM
GTVALLRSWSDPRNMATAGLAVMLVAGTIHALRTRSSALSMGLALLVLPFLPASNLFFPV
GFVVAERVLYMPSMGWCLLVAHGWRLVARKRAKLAAASLVFLLLAFSAKTYVRNWDWKTE
YTIFASGLKVNRNNAKLYNNVGHALEAEGKYGEALEFFKIAVNVQPDDVGAHINVGRTFN
HLGKYQEAEAAYVKAKSLLPKAKPGESYQARIAPNHLNVFLNLANLISKNATRLEEADML
YRQAISMRADYTQAYINRGDILIKLNRTKEAQEVYERALLYDSGNPDIYYNLGVVLLEQG
KASQALAYLDKALELEPEHEQALLNSAILLQELGAADLRHLARQRLLKLLDKDATNERVH
FNLGMVCMDEGDAECAERWFRAAVHLKPDFRSALFNLALLLADRRRPLEAAPFLKQLVRH
HPDHVKALVLLGDIYINSVKDLDAAESCYRRILELEPDNVQALHNLCVVAVERGKLAVAE
ECLTRAAVLAPHEHYIQRHLAVVRARLAAVSLTHPNTRPSDASDAQVRARWNYIPQQPPD
PHESDPS