New model in OGS2.0 | DPOGS207165  |
---|---|
Genomic Position | scaffold7:- 977725-982597 |
See gene structure | |
CDS Length | 1551 |
Paired RNAseq reads   | 513 |
Single RNAseq reads   | 1249 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000621 (2e-66) |
Best Drosophila hit   | CG31915 (2e-111) |
Best Human hit | procollagen galactosyltransferase 2 precursor (6e-91) |
Best NR hit (blastp)   | PREDICTED: similar to CG31915-PA [Apis mellifera] (1e-126) |
Best NR hit (blastx)   | PREDICTED: similar to Glycosyltransferase 25 family member [Tribolium castaneum] (2e-129) |
GeneOntology terms    | GO:0008475 procollagen-lysine 5-dioxygenase activity GO:0005575 cellular_component GO:0009103 lipopolysaccharide biosynthetic process |
InterPro families   | IPR002654 Glycosyl transferase, family 25 |
Orthology group | MCL10759 |
Nucleotide sequence:
ATGAATTTGGACTATCCGAAAGATAGGATATTCTTATGGTTCCGCAGTGACTACAACAGC
GATCATTCAGTTGATGTTTTGCGTGATTTTGTTAATAAATTCGGAACCTTATACAACAGA
GTACATCTTTCATATAACACGTCTAAACAAAAGTTCGACGATGAATTGTCACCGACCCAT
TGGAGCCATAGTCGATTTATGCATTTGATTAAGTGGAGGGAAATGGGAATTAAATTTGCG
AAGCGACAATGGGCCGATTATGTATTTATGCTGGACGCAGATGTGTTTCTCACGAACCCC
CAGACGCTACGGCATCTCATCCAAAAACAACTCCGTGTGGTGGCGCCAATGCTCGTCTCA
GATCGATATTACTCCAACTTCTGGTTGTCCGTTGACGACGACTTCAACTATCGTCTAAAT
CACGAGGATGAATTCTATCCATTATATGAGTACAACGAATTGTACATGGGATGTCATATA
GTTCCAGTGATATACGGGGCGGTACTAATGGATCTTCGATCAAAGAAATCGGACTATATA
ACCTATGATCCCTACAAAATAGTCGATTACTTGGGCCCGCTGCAGGACCACATTATATTT
GCCGTGAACGCCATGAGGAACAATATATCGCTACACATTTGCAACGACGATTTCTTCGGT
TACATCACCCGGCCAATAAAAGAAGGCGAACCACTTGAAAGGGATGTATTGCATCTCACC
AACCTGAAGCTGTCAGCGATTGCACGCAGCAAGCCGCTTCAATACCACTACAAGCTACAG
CGTTTTGTGTACTACCCTCCTTCTCTGGATTACCAAGTCGACAAGATTTATATGATCAAC
CTCGAACGAAGACCCGATAAAAGGAAGCTGATGGAACAGAGCTTCAAGGAATTGGGCATG
AATGTTACACGTGTTGAAGCTGTCGACGGCAAGAGCCTGGATCCGAAAAAACTTCAAAAT
ATGAACGTTACCTTGATGCCTGGATATGAAGATGCCTACTATAAACGCCCTATGACCTAC
GGCGAGATCGGATGTTTCTTGAGCCATTACAAGATTTGGGTCGAGGTTGCGGAGAGAAAC
TACAACAGAGTGTTGATTTTGGAAGACGACGTTAATTTCTTGCCTTATTTCAAGGAAAAC
TATGATACGATTATATGGGAGTCTTCTGTTCTGAAACATGATTTTATCTACCTCGGCCGC
AAAATTATGATGGATAAGGTTGAGATTAGAATGACGACGCATCTCACTAAGCCGCTGTAC
TCTTACTGGACCATTGGTTACATTATAACAAAATTGGGTGCTGAGAAACTGATCGAGGCT
AAACCTTTGAGCAAATTGTTACCAGTCGATGAGTTCCTACCTATTATGTTCGATCAACAT
CCAGATAAAAAATATAAAGAATTCTTCCCCAATCGGAATCTGAACGCGCTGGCTGCGAGT
CCGTCTATCATTTCACCGACACACTACACGGGCATGCCGGGATACATCAGTGACACTGAA
GATTCTATCCCTCTGTCTACTGAGCCCTGCACTGATAATCCAGAACTTTAA
Protein sequence:
MNLDYPKDRIFLWFRSDYNSDHSVDVLRDFVNKFGTLYNRVHLSYNTSKQKFDDELSPTH
WSHSRFMHLIKWREMGIKFAKRQWADYVFMLDADVFLTNPQTLRHLIQKQLRVVAPMLVS
DRYYSNFWLSVDDDFNYRLNHEDEFYPLYEYNELYMGCHIVPVIYGAVLMDLRSKKSDYI
TYDPYKIVDYLGPLQDHIIFAVNAMRNNISLHICNDDFFGYITRPIKEGEPLERDVLHLT
NLKLSAIARSKPLQYHYKLQRFVYYPPSLDYQVDKIYMINLERRPDKRKLMEQSFKELGM
NVTRVEAVDGKSLDPKKLQNMNVTLMPGYEDAYYKRPMTYGEIGCFLSHYKIWVEVAERN
YNRVLILEDDVNFLPYFKENYDTIIWESSVLKHDFIYLGRKIMMDKVEIRMTTHLTKPLY
SYWTIGYIITKLGAEKLIEAKPLSKLLPVDEFLPIMFDQHPDKKYKEFFPNRNLNALAAS
PSIISPTHYTGMPGYISDTEDSIPLSTEPCTDNPEL