New model in OGS2.0 | DPOGS209672  |
---|---|
Genomic Position | scaffold27:- 25598-32752 |
See gene structure | |
CDS Length | 1983 |
Paired RNAseq reads   | 3705 |
Single RNAseq reads   | 8863 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000703 (6e-140) |
Best Drosophila hit   | CG6199, isoform B (0.0) |
Best Human hit | procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor (7e-166) |
Best NR hit (blastp)   | PREDICTED: similar to procollagen-lysine,2-oxoglutarate 5-dioxygenase [Nasonia vitripennis] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to procollagen-lysine,2-oxoglutarate 5-dioxygenase [Nasonia vitripennis] (0.0) |
GeneOntology terms    | GO:0008475 procollagen-lysine 5-dioxygenase activity GO:0055114 oxidation reduction GO:0005506 iron ion binding GO:0031418 L-ascorbic acid binding |
InterPro families    | IPR005123 Oxoglutarate/iron-dependent oxygenase IPR006620 Prolyl 4-hydroxylase, alpha subunit |
Orthology group | MCL10741 |
Nucleotide sequence:
ATGAAGCATGAGGGCGGTGGTCACAAAGTTAACCTGCTCAAAGATAAGCTGAGTTCAATG
AAAATACCTGAAGATAGAGATCAGATTATTCTATTCACTGACAGCTACGATGTGATGTTC
CTGGGATCGCTCGATGAGATAGTACAGAAGTTTCTCGCAATGTCCGTACGCGTGTTGTTC
TCTGCGGAACCTTTCTGTTGGCCAGATTCCTCTCTCGCGTCGCAGTATCCCGACAGCCAG
CAATTGAACCCGTTCCTTAATTCGGGCGGATTTATAGGATATTTACCGGAACTTTTGAAG
ATCTTGAACTATGAGACAGTGGGCAATAAAGATGACGATCAGCTCTTCTACACCAAAGTG
TACTTGGATGAGGATTATAGAGAAAGTCTAAGAATTTCCCTTGACCACAAATCGGCCATA
TTCCAAAACCTCCATGGTGCCTTGTCGGATGTGCAACTCGTCGCTAACTCCACTGATGAA
TGGCCGTATCTTGTCAACGTGGTGACCAAGCAGAGGCCTCTGATCGTTCACGGAAACGGG
CCCGCAAAATTGACCTTGAACAATTTGTCCAACTATTTGGCCAAGTCCTGGTCTGTCAGT
GAGGGATGCGTTCTGTGCGATGAGAAGAGGATTGTGCTGGATGAGGACAAGCTGCCGAAG
GTGATGCTCTCCGTATTCATAGAAGTCGCGACACCGTTTATAGAGGAATTTTTCCAAAGT
ATTCTGGCCATTGATTATCCCAAGCAGAAAATACATCTCTTTATCCGCAACGGTGTCGAG
TATCATGAGTCGGAGGTGGAGAATTTCTATCAGGCTCACAGTAGCGAATATTTTACCGCC
AAACGGATCAAATCCACTGACCTTGTGGGGGAGGCTGAAGCGAGGAACATTGCTAAGGAC
CGCTGTATCGGCAGCGACTGTGATTATCTCTTCTGCCTGGACAGCCACGCCCGTGTTGAA
CCTGATACACTTCATTACTTGCTCTCTACCGGATATGACGTCGTCGCCCCCTTACTAGTA
CGCAGTGGACAAGCTTGGTCAAACTTTTGGGGTGCTATTAACTCTGTTGGTTTCTATTCC
CGTTCAGCTGATTATATGGATATTGTCAACCGCAGCATTGAAGGTATCTGGAACGTCCCG
TTCATCAACAACTGCTACCTTATGAACATTTCCCTGTTCCGCAAACCGTCTGCCAAACAT
GTTAGCTATTTGAAAGAGGACACCGACCCTGATATGGCTTTCTGCGCTTCACTCAGATCT
GCTGGTATCATGATGTACGTGAGCAATGAAAAGGAATTCGGTCATCTCGTTAATTCTGAA
ACGTTTGACGTGAGCCGCACTAACCCTGACATTTACCAAGTGATTGATAACAAGCTTGAT
TGGGAACAACGTTACCTCCACCCCAAGTACCATGAAATCTTCGCCAACAAAGAAAAGCAA
CTCATGCCCTGCCCCGACGTCTATTGGTTCCCACTGATGTCGATGCGCTTCTGTAAGGAA
TGGATCGAAGTCATGGAGGCCTTTGGACAATGGAGCGATGGATCTAACAATGACAAACGT
CTAGAGAGTGGTTACGAAGCTGTTCCAACTCGTGATATTCACATGAACCAGGTCGGACTT
GACATTCAATGGCTCCGAATCCTCAAGGATTACGTTCGTCCGCTGCAGGAGTTAGTTTTC
ACTGGATACTACCATAACCCCCCCGTGTCCGTCATGAACTTCGTGGTCCGTTATCGTCCT
GATGAACAGCCCTCTCTACGGCCGCACCACGACTCCTCAACTTACACCATCAACCTGGCT
CTGAATACTCCCCACTTGGATTACGAGGGTGGTGGTTGTCGGTTCATCCGCTACAACTGT
TCGGTGAAGGACACCAAGCCCGGTTGGCTTTTGATGCACCCTGGCCGTCTGACCCACTTC
CACGAGGGTCTCCTCGTCACCAAGGGCACACGTTACATTATGATCTCATTCGTGGACCCG
TAA
Protein sequence:
MKHEGGGHKVNLLKDKLSSMKIPEDRDQIILFTDSYDVMFLGSLDEIVQKFLAMSVRVLF
SAEPFCWPDSSLASQYPDSQQLNPFLNSGGFIGYLPELLKILNYETVGNKDDDQLFYTKV
YLDEDYRESLRISLDHKSAIFQNLHGALSDVQLVANSTDEWPYLVNVVTKQRPLIVHGNG
PAKLTLNNLSNYLAKSWSVSEGCVLCDEKRIVLDEDKLPKVMLSVFIEVATPFIEEFFQS
ILAIDYPKQKIHLFIRNGVEYHESEVENFYQAHSSEYFTAKRIKSTDLVGEAEARNIAKD
RCIGSDCDYLFCLDSHARVEPDTLHYLLSTGYDVVAPLLVRSGQAWSNFWGAINSVGFYS
RSADYMDIVNRSIEGIWNVPFINNCYLMNISLFRKPSAKHVSYLKEDTDPDMAFCASLRS
AGIMMYVSNEKEFGHLVNSETFDVSRTNPDIYQVIDNKLDWEQRYLHPKYHEIFANKEKQ
LMPCPDVYWFPLMSMRFCKEWIEVMEAFGQWSDGSNNDKRLESGYEAVPTRDIHMNQVGL
DIQWLRILKDYVRPLQELVFTGYYHNPPVSVMNFVVRYRPDEQPSLRPHHDSSTYTINLA
LNTPHLDYEGGGCRFIRYNCSVKDTKPGWLLMHPGRLTHFHEGLLVTKGTRYIMISFVDP