DPGLEAN01285 in OGS1.0

New model in OGS2.0DPOGS209672 
Genomic Positionscaffold27:- 25598-32752
See gene structure
CDS Length1983
Paired RNAseq reads  3705
Single RNAseq reads  8863
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000703 (6e-140)
Best Drosophila hit  CG6199, isoform B (0.0)
Best Human hitprocollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor (7e-166)
Best NR hit (blastp)  PREDICTED: similar to procollagen-lysine,2-oxoglutarate 5-dioxygenase [Nasonia vitripennis] (0.0)
Best NR hit (blastx)  PREDICTED: similar to procollagen-lysine,2-oxoglutarate 5-dioxygenase [Nasonia vitripennis] (0.0)
GeneOntology terms


  
GO:0008475 procollagen-lysine 5-dioxygenase activity
GO:0055114 oxidation reduction
GO:0005506 iron ion binding
GO:0031418 L-ascorbic acid binding
InterPro families
  
IPR005123 Oxoglutarate/iron-dependent oxygenase
IPR006620 Prolyl 4-hydroxylase, alpha subunit
Orthology groupMCL10741

Nucleotide sequence:

ATGAAGCATGAGGGCGGTGGTCACAAAGTTAACCTGCTCAAAGATAAGCTGAGTTCAATG
AAAATACCTGAAGATAGAGATCAGATTATTCTATTCACTGACAGCTACGATGTGATGTTC
CTGGGATCGCTCGATGAGATAGTACAGAAGTTTCTCGCAATGTCCGTACGCGTGTTGTTC
TCTGCGGAACCTTTCTGTTGGCCAGATTCCTCTCTCGCGTCGCAGTATCCCGACAGCCAG
CAATTGAACCCGTTCCTTAATTCGGGCGGATTTATAGGATATTTACCGGAACTTTTGAAG
ATCTTGAACTATGAGACAGTGGGCAATAAAGATGACGATCAGCTCTTCTACACCAAAGTG
TACTTGGATGAGGATTATAGAGAAAGTCTAAGAATTTCCCTTGACCACAAATCGGCCATA
TTCCAAAACCTCCATGGTGCCTTGTCGGATGTGCAACTCGTCGCTAACTCCACTGATGAA
TGGCCGTATCTTGTCAACGTGGTGACCAAGCAGAGGCCTCTGATCGTTCACGGAAACGGG
CCCGCAAAATTGACCTTGAACAATTTGTCCAACTATTTGGCCAAGTCCTGGTCTGTCAGT
GAGGGATGCGTTCTGTGCGATGAGAAGAGGATTGTGCTGGATGAGGACAAGCTGCCGAAG
GTGATGCTCTCCGTATTCATAGAAGTCGCGACACCGTTTATAGAGGAATTTTTCCAAAGT
ATTCTGGCCATTGATTATCCCAAGCAGAAAATACATCTCTTTATCCGCAACGGTGTCGAG
TATCATGAGTCGGAGGTGGAGAATTTCTATCAGGCTCACAGTAGCGAATATTTTACCGCC
AAACGGATCAAATCCACTGACCTTGTGGGGGAGGCTGAAGCGAGGAACATTGCTAAGGAC
CGCTGTATCGGCAGCGACTGTGATTATCTCTTCTGCCTGGACAGCCACGCCCGTGTTGAA
CCTGATACACTTCATTACTTGCTCTCTACCGGATATGACGTCGTCGCCCCCTTACTAGTA
CGCAGTGGACAAGCTTGGTCAAACTTTTGGGGTGCTATTAACTCTGTTGGTTTCTATTCC
CGTTCAGCTGATTATATGGATATTGTCAACCGCAGCATTGAAGGTATCTGGAACGTCCCG
TTCATCAACAACTGCTACCTTATGAACATTTCCCTGTTCCGCAAACCGTCTGCCAAACAT
GTTAGCTATTTGAAAGAGGACACCGACCCTGATATGGCTTTCTGCGCTTCACTCAGATCT
GCTGGTATCATGATGTACGTGAGCAATGAAAAGGAATTCGGTCATCTCGTTAATTCTGAA
ACGTTTGACGTGAGCCGCACTAACCCTGACATTTACCAAGTGATTGATAACAAGCTTGAT
TGGGAACAACGTTACCTCCACCCCAAGTACCATGAAATCTTCGCCAACAAAGAAAAGCAA
CTCATGCCCTGCCCCGACGTCTATTGGTTCCCACTGATGTCGATGCGCTTCTGTAAGGAA
TGGATCGAAGTCATGGAGGCCTTTGGACAATGGAGCGATGGATCTAACAATGACAAACGT
CTAGAGAGTGGTTACGAAGCTGTTCCAACTCGTGATATTCACATGAACCAGGTCGGACTT
GACATTCAATGGCTCCGAATCCTCAAGGATTACGTTCGTCCGCTGCAGGAGTTAGTTTTC
ACTGGATACTACCATAACCCCCCCGTGTCCGTCATGAACTTCGTGGTCCGTTATCGTCCT
GATGAACAGCCCTCTCTACGGCCGCACCACGACTCCTCAACTTACACCATCAACCTGGCT
CTGAATACTCCCCACTTGGATTACGAGGGTGGTGGTTGTCGGTTCATCCGCTACAACTGT
TCGGTGAAGGACACCAAGCCCGGTTGGCTTTTGATGCACCCTGGCCGTCTGACCCACTTC
CACGAGGGTCTCCTCGTCACCAAGGGCACACGTTACATTATGATCTCATTCGTGGACCCG
TAA

Protein sequence:

MKHEGGGHKVNLLKDKLSSMKIPEDRDQIILFTDSYDVMFLGSLDEIVQKFLAMSVRVLF
SAEPFCWPDSSLASQYPDSQQLNPFLNSGGFIGYLPELLKILNYETVGNKDDDQLFYTKV
YLDEDYRESLRISLDHKSAIFQNLHGALSDVQLVANSTDEWPYLVNVVTKQRPLIVHGNG
PAKLTLNNLSNYLAKSWSVSEGCVLCDEKRIVLDEDKLPKVMLSVFIEVATPFIEEFFQS
ILAIDYPKQKIHLFIRNGVEYHESEVENFYQAHSSEYFTAKRIKSTDLVGEAEARNIAKD
RCIGSDCDYLFCLDSHARVEPDTLHYLLSTGYDVVAPLLVRSGQAWSNFWGAINSVGFYS
RSADYMDIVNRSIEGIWNVPFINNCYLMNISLFRKPSAKHVSYLKEDTDPDMAFCASLRS
AGIMMYVSNEKEFGHLVNSETFDVSRTNPDIYQVIDNKLDWEQRYLHPKYHEIFANKEKQ
LMPCPDVYWFPLMSMRFCKEWIEVMEAFGQWSDGSNNDKRLESGYEAVPTRDIHMNQVGL
DIQWLRILKDYVRPLQELVFTGYYHNPPVSVMNFVVRYRPDEQPSLRPHHDSSTYTINLA
LNTPHLDYEGGGCRFIRYNCSVKDTKPGWLLMHPGRLTHFHEGLLVTKGTRYIMISFVDP