New model in OGS2.0 | DPOGS207165 |
---|---|
Genomic Position | scaffold7:- 977725-982597 |
See gene structure | |
CDS Length | 1551 |
Paired RNAseq reads | 513 |
Single RNAseq reads | 1249 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000621 (2e-66) |
Best Drosophila hit | CG31915 (2e-111) |
Best Human hit | procollagen galactosyltransferase 2 precursor (6e-91) |
Best NR hit (blastp) | PREDICTED: similar to CG31915-PA [Apis mellifera] (1e-126) |
Best NR hit (blastx) | PREDICTED: similar to Glycosyltransferase 25 family member [Tribolium castaneum] (2e-129) |
GeneOntology terms | GO:0008475 procollagen-lysine 5-dioxygenase activity GO:0005575 cellular_component GO:0009103 lipopolysaccharide biosynthetic process |
InterPro families | IPR002654 Glycosyl transferase, family 25 |
Orthology group | MCL10759 |
Nucleotide sequence:
ATGAATTTGGACTATCCGAAAGATAGGATATTCTTATGGTTCCGCAGTGACTACAACAGC
GATCATTCAGTTGATGTTTTGCGTGATTTTGTTAATAAATTCGGAACCTTATACAACAGA
GTACATCTTTCATATAACACGTCTAAACAAAAGTTCGACGATGAATTGTCACCGACCCAT
TGGAGCCATAGTCGATTTATGCATTTGATTAAGTGGAGGGAAATGGGAATTAAATTTGCG
AAGCGACAATGGGCCGATTATGTATTTATGCTGGACGCAGATGTGTTTCTCACGAACCCC
CAGACGCTACGGCATCTCATCCAAAAACAACTCCGTGTGGTGGCGCCAATGCTCGTCTCA
GATCGATATTACTCCAACTTCTGGTTGTCCGTTGACGACGACTTCAACTATCGTCTAAAT
CACGAGGATGAATTCTATCCATTATATGAGTACAACGAATTGTACATGGGATGTCATATA
GTTCCAGTGATATACGGGGCGGTACTAATGGATCTTCGATCAAAGAAATCGGACTATATA
ACCTATGATCCCTACAAAATAGTCGATTACTTGGGCCCGCTGCAGGACCACATTATATTT
GCCGTGAACGCCATGAGGAACAATATATCGCTACACATTTGCAACGACGATTTCTTCGGT
TACATCACCCGGCCAATAAAAGAAGGCGAACCACTTGAAAGGGATGTATTGCATCTCACC
AACCTGAAGCTGTCAGCGATTGCACGCAGCAAGCCGCTTCAATACCACTACAAGCTACAG
CGTTTTGTGTACTACCCTCCTTCTCTGGATTACCAAGTCGACAAGATTTATATGATCAAC
CTCGAACGAAGACCCGATAAAAGGAAGCTGATGGAACAGAGCTTCAAGGAATTGGGCATG
AATGTTACACGTGTTGAAGCTGTCGACGGCAAGAGCCTGGATCCGAAAAAACTTCAAAAT
ATGAACGTTACCTTGATGCCTGGATATGAAGATGCCTACTATAAACGCCCTATGACCTAC
GGCGAGATCGGATGTTTCTTGAGCCATTACAAGATTTGGGTCGAGGTTGCGGAGAGAAAC
TACAACAGAGTGTTGATTTTGGAAGACGACGTTAATTTCTTGCCTTATTTCAAGGAAAAC
TATGATACGATTATATGGGAGTCTTCTGTTCTGAAACATGATTTTATCTACCTCGGCCGC
AAAATTATGATGGATAAGGTTGAGATTAGAATGACGACGCATCTCACTAAGCCGCTGTAC
TCTTACTGGACCATTGGTTACATTATAACAAAATTGGGTGCTGAGAAACTGATCGAGGCT
AAACCTTTGAGCAAATTGTTACCAGTCGATGAGTTCCTACCTATTATGTTCGATCAACAT
CCAGATAAAAAATATAAAGAATTCTTCCCCAATCGGAATCTGAACGCGCTGGCTGCGAGT
CCGTCTATCATTTCACCGACACACTACACGGGCATGCCGGGATACATCAGTGACACTGAA
GATTCTATCCCTCTGTCTACTGAGCCCTGCACTGATAATCCAGAACTTTAA
Protein sequence:
MNLDYPKDRIFLWFRSDYNSDHSVDVLRDFVNKFGTLYNRVHLSYNTSKQKFDDELSPTH
WSHSRFMHLIKWREMGIKFAKRQWADYVFMLDADVFLTNPQTLRHLIQKQLRVVAPMLVS
DRYYSNFWLSVDDDFNYRLNHEDEFYPLYEYNELYMGCHIVPVIYGAVLMDLRSKKSDYI
TYDPYKIVDYLGPLQDHIIFAVNAMRNNISLHICNDDFFGYITRPIKEGEPLERDVLHLT
NLKLSAIARSKPLQYHYKLQRFVYYPPSLDYQVDKIYMINLERRPDKRKLMEQSFKELGM
NVTRVEAVDGKSLDPKKLQNMNVTLMPGYEDAYYKRPMTYGEIGCFLSHYKIWVEVAERN
YNRVLILEDDVNFLPYFKENYDTIIWESSVLKHDFIYLGRKIMMDKVEIRMTTHLTKPLY
SYWTIGYIITKLGAEKLIEAKPLSKLLPVDEFLPIMFDQHPDKKYKEFFPNRNLNALAAS
PSIISPTHYTGMPGYISDTEDSIPLSTEPCTDNPEL