New model in OGS2.0 | DPOGS205291  |
---|---|
Genomic Position | scaffold501:+ 20062-31005 |
See gene structure | |
CDS Length | 3165 |
Paired RNAseq reads   | 3007 |
Single RNAseq reads   | 7413 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011033 (0.0) |
Best Drosophila hit   | super sex combs, isoform C (0.0) |
Best Human hit | UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 110 kDa subunit isoform 2 (0.0) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC003916 [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC003916 [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0016757 transferase activity, transferring glycosyl groups GO:0005488 binding GO:0006486 protein amino acid glycosylation GO:0035220 wing disc development GO:0008375 acetylglucosaminyltransferase activity |
InterPro families    | IPR001440 Tetratricopeptide TPR-1 IPR019734 Tetratricopeptide repeat IPR013026 Tetratricopeptide repeat-containing IPR011990 Tetratricopeptide-like helical |
Orthology group | MCL12341 |
Nucleotide sequence:
ATGCAACCTCAAGCGAATGTTGCCGTGCCTCAATCTGTCACGACGCAGCCTCAACAAATT
GTCAGCGTTCCTGCAAATGCTGTGATCTTGAAAATGTCGGAAATTCAACAGATATCTACA
GTGGGACTCCTGGAGCTTGCACACCGGGAATATCAAGCTGGAGACTATGATAGTGCGGAA
CTGCATTGTATGCAGCTATGGCGTCAAGATGGCACAAATACGGGTGTTCTTTTGCTGCTG
TCCTCCATACATTTTCAATGTCGGCGTTTAGACAAATCAGCACATTTTTCAACGCTTGCT
ATAAAACAGAATCCTCTCCTGGCGGAGGCGTACAGTAATCTCGGAAATGTATACAAGGAG
CGTGGGCAGTTGCAAGAGGCTTTGGAAAACTATCGTCACGCTGTCCGTCTAAAGCCAGAT
TTCATTGATGGGTACATCAACTTGGCAGCTGCCTTGGTGGCTGCAGGAGACATGGAACAG
GCTGTACAGGCTTATGTTACAGCATTGCAGTATAATCCTGAACTTTACTGCGTTAGAAGT
GACCTGGGCAATTTGCTCAAGGCCCTTGGACGTTTGGACGAAGCGAAGGCTTGTTACTTG
AAGGCCATCGAAACGAGGCCAGACTTTGCAGTGGCATGGAGTAACCTAGGATGCGTTTTT
AACGCACAAAGTGAAATCTGGTTGGCCATACATCATTTTGAAAAGGCCGTGGCATTGGAT
CCGAATTTCTTGGATGCTTATATCAATCTAGGAAATGTTCTCAAAGAAGCGAGAATTTTT
GACAGGGCGGTGGCTGCATATTTACGAGCTCTTAATTTATCGCCGAACAATGCAGTTGTT
CATGGTAATTTAGCGTGCGTGTATTATGAACAAGGACTTATTGATTTAGCGATCGACACT
TATCGGCGAGCTATAGAACTTCAACCGAATTTCCCAGATGCCTACTGTAATTTGGCTAAT
GCATTAAAGGAAAAGGGTCAAGTGGTTGATGCGGAGGAATGTTATAATACTGCTCTAAGG
TTGTGCCCATCACACGCTGATTCATTAAATAACTTAGCGAACATCAAACGCGAGCAAGGA
TACATAGAGGAAGCGACTCGTTTATATTTAAAAGCTTTGGAAGTATTTCCCGAGTTTGCA
GCAGCTCATAGTAACTTGGCGTCAGTTTTGCAACAACAAGGCAAACTAAACGAAGCACTC
ATGCATTATAAAGAGGCTATACGTATACAGCCAACGTTTGCTGATGCTTATAGTAATATG
GGCAATACTCTTAAAGAAATGCAAGACGTCGCTGGAGCATTGCAGTGTTATACCCGAGCT
ATACAAATTAATCCAGCGTTTGCCGATGCTCATAGCAATCTTGCCAGTATCCACAAAGAT
TCGGGAAATATACCGGAAGCTATACAGTCCTATAGAACAGCGTTGAAGTTGAAACCGGAC
TTCCCTGACGCGTATTGTAACTTGGCGCACTGTTTGCAAATCGTTTGCGATTGGACCGAC
TACGAGGCCCGTATGAAGAAATTAGTCAGTATTGTGGCAGAACAGCTTGAAAAGAATAGA
CTACCCTCGGTTCATCCTCATCATTCTATGCTTTACCCATTGACGCATGAATTCAGAAAG
GCTATTGCGGCCCGACATGCGAATTTATGTCTGGAGAAGGTTCAAGTTCTCCACAAGCCG
GCTTACAAATTTCCAAGAGAGCTGCAAAGCCGCCTGCGTATCGGTTATGTAAGCAGTGAT
TTTGGCAATCACCCAACATCACATTTGATGCAATCTGTGCCCGGATTACACGATCGTACT
AAGGTCGAGATCTTTTGTTACGCTCTTAGTCCAGATGATGGTACAACATTCCGTTCTAAA
ATAGCTAGAGAAGCCGAGCACTTTATTGATCTATCACAGATTCCATGCAACGGCAAAGCT
GCCGATAAAATATATTCTGATGGTATTAATATTCTGGTAAACATGAACGGATACACAAAG
GGTGCCAGGAATGAAATATTTGCTCTACGTCCGGCTCCTGTGCAAGTAATGTGGCTCGGA
TATCCAGGCACAAGTGGTGCAAGTTATATGGACTACTTAGTAACTGATGCTGTAACATCT
CCAGTCGAATTGGCAAGTCAGTACAGCGAGAAGCTCGCATACATGCCTCATACATATTTC
GTCGGCGACCACAAGCAGATGTTCCCCCACTTACAGGAGAGATTGATAGTTAGTGACAAA
ATCAAATCCCATAATAACATGGGCAGTCTAGCTGATAATGTCGCCGTCATTAATGCAACT
GATTTGTCTCCACTTGTCGAAAACACTGATATCAAAGAAATTAAAGAAGTTGTAAGAGCA
GCGAGGCCGGTTGAAATATCATTGAAGGTCGCAGAGTTACCTACTACTACGCCTATAGAA
AACATGATTGCTTCGGGACAAGTACAGACATCTGTAAATGGTGTCATCCTTCAAAACGGT
CTGGCCACAACACAAACAAACAACAAAGCGGCTACAGGAGAGGAAGTGCCACAGTCTATT
GTAATCACAACAAGACAACAGTACGGTCTACCGGATGATGCAGTGGTCTACTGTAATTTC
AATCAACTGTATAAGATAGATCCGCTAACTCTACACATGTGGGTATACATATTGAAACAC
GTCCCTAACAGCGTGTTGTGGCTTTTGAGATTCCCGGCTGTCGGTGAACCTAATTTACAA
GCAACGGCGCAGCAGTTGGGATTACCTCCCGGCCGTATAATCTTCTCAAACGTGGCTGCT
AAAGAGGAGCACGTGAGGCGCGGTCAACTGGCGGACGTATGTCTAGACACACCCTTATGT
AACGGTCACACTACTAGTATGGATATTTTGTGGACAGGCACCCCCGTTGTTACATTACCA
GGAGAGACATTAGCCTCACGGGTGGCTGCATCACAACTCAATACACTTGGTTGTCCTGAA
CTGATTGCGAGAACGAGACAGGAATATCAAGACATAGCTGTACGATTAGGAACGGACAGG
GAATATCTTAAAGCAATCCGAGTGAAAGTATGGACAGCTCGCACGGAGAGTCCACTATTC
GACTGCAAAGCATACGCCACCGGTTTGGAGATGTTGTACAACAAAATGTGGTCGAGGTAC
GCTCGCAACGAGCGACCCGACCACATACAGGCCATAGACAAATAG
Protein sequence:
MQPQANVAVPQSVTTQPQQIVSVPANAVILKMSEIQQISTVGLLELAHREYQAGDYDSAE
LHCMQLWRQDGTNTGVLLLLSSIHFQCRRLDKSAHFSTLAIKQNPLLAEAYSNLGNVYKE
RGQLQEALENYRHAVRLKPDFIDGYINLAAALVAAGDMEQAVQAYVTALQYNPELYCVRS
DLGNLLKALGRLDEAKACYLKAIETRPDFAVAWSNLGCVFNAQSEIWLAIHHFEKAVALD
PNFLDAYINLGNVLKEARIFDRAVAAYLRALNLSPNNAVVHGNLACVYYEQGLIDLAIDT
YRRAIELQPNFPDAYCNLANALKEKGQVVDAEECYNTALRLCPSHADSLNNLANIKREQG
YIEEATRLYLKALEVFPEFAAAHSNLASVLQQQGKLNEALMHYKEAIRIQPTFADAYSNM
GNTLKEMQDVAGALQCYTRAIQINPAFADAHSNLASIHKDSGNIPEAIQSYRTALKLKPD
FPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQLEKNRLPSVHPHHSMLYPLTHEFRK
AIAARHANLCLEKVQVLHKPAYKFPRELQSRLRIGYVSSDFGNHPTSHLMQSVPGLHDRT
KVEIFCYALSPDDGTTFRSKIAREAEHFIDLSQIPCNGKAADKIYSDGINILVNMNGYTK
GARNEIFALRPAPVQVMWLGYPGTSGASYMDYLVTDAVTSPVELASQYSEKLAYMPHTYF
VGDHKQMFPHLQERLIVSDKIKSHNNMGSLADNVAVINATDLSPLVENTDIKEIKEVVRA
ARPVEISLKVAELPTTTPIENMIASGQVQTSVNGVILQNGLATTQTNNKAATGEEVPQSI
VITTRQQYGLPDDAVVYCNFNQLYKIDPLTLHMWVYILKHVPNSVLWLLRFPAVGEPNLQ
ATAQQLGLPPGRIIFSNVAAKEEHVRRGQLADVCLDTPLCNGHTTSMDILWTGTPVVTLP
GETLASRVAASQLNTLGCPELIARTRQEYQDIAVRLGTDREYLKAIRVKVWTARTESPLF
DCKAYATGLEMLYNKMWSRYARNERPDHIQAIDK