DPGLEAN01689 in OGS1.0

New model in OGS2.0DPOGS204629 
Genomic Positionscaffold483:- 52322-57169
See gene structure
CDS Length1863
Paired RNAseq reads  884
Single RNAseq reads  2036
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014459 (7e-10)
Best Drosophila hit  CG31120, isoform A (5e-56)
Best Human hit2-oxoglutarate and iron-dependent oxygenase domain-containing protein 1 (2e-41)
Best NR hit (blastp)  AGAP002934-PA [Anopheles gambiae str. PEST] (7e-65)
Best NR hit (blastx)  AGAP002934-PA [Anopheles gambiae str. PEST] (1e-62)
GeneOntology terms


  
GO:0005506 iron ion binding
GO:0016706 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors
GO:0031418 L-ascorbic acid binding
GO:0055114 oxidation reduction
InterPro families

  
IPR006620 Prolyl 4-hydroxylase, alpha subunit
IPR019601 Oxoglutarate/iron-dependent oxygenase, C-terminal degradation domain
IPR005123 Oxoglutarate/iron-dependent oxygenase
Orthology groupMCL16666

Nucleotide sequence:

ATGAGTTCTCCGACTAAAGAAACTGAAGATCCATCATCTAGCAATGCTGAGAGTGAGGAA
TACACAGGCGGAAATAGCGATGCGAATACAGAGCAGAGACCGCCCGCCAAGCGGCCTATG
TCTACAGCAGTTATTGAAATTTCAGACACTGAAAGTGATGATTCCGATGTCTGCGCTGTT
AATTCTTACCAGGCCTCAGCTGATGAGGTGAAAAGAATACGAAGAGATTATTCATCTTCG
TCATCATCTTCCTCATCATCAAACTACAGCTCTGATTCTGACTCACCATGGGAAGATGAC
TCTGTAGTAATAGATGATAAAGCAATGGGAAGGCCTGTGATTGCTAAGATGTTAGTTAGA
GCTAATAGAATGGATGACCCTAAATTTAATCCTGAACTGAAGTCTCAAGAGATCATAAGT
AAAATTAAATCTCACTGGGATGAGAAGACAGACCACAGTAGTGACCAAGTGACCTTAACA
TGTAAACCGTTCAGACTCTGTCGGATTCATGGCTTGTTAGAGAACTCGGAGATAATAAAT
AATATAGTGGACGACATGAACACATTGGACTGGTCGAGGAAGAAGATGGATCTGTACGAG
TTTCACCAGACCTCTGACTTAGCAAACTTAACTTGGCAGCGTAGTATAAGAGGTATTTAC
GAATTATTGAAGACTGAAGTAATGACTTGGGTGTCGCAAGTAACGGGCATAGAGTTGACA
TCAGTGTCGGCGTCATGTTCGCTGTATGGCCCCGGAGACCATCTCTTGGTTCACGATGAT
CGACTCGGGGACAGGAGGGTGGCCTTCATCCTGTACCTAGCACCCTGGACGCCACGATCA
CCACCACACATGCAGAACGGAGCTGAAAGTCAAGATAAGTGTTGGAGCGGTCCGGGCTGG
AGGCCGCATATGGGTGGAGCGTTGGAGTTGGTCGAGGATGGACAGGTTGTGTTCCGTGCC
TTCCCCGCTAATAATACATTAGCATTCTTCGCAGTCGGCCCGACGTCCTTTCATCAGGTG
GGCGAAGTCCTATCTATGGAGCTTCCTCGGCTGTCTATTAACGGTTGGTTTCACGGTCCG
GCGCCGGAGTCCGAGGAGCCGCACGCGGAGCTCCCAGTGCCACTCACACCGCACAACCAA
GTGGTGGTGTTGAAGTCGTGGGTAGAGGCTGGGTACTTGTGTCCCCGAGCTCGAGCCCAG
GTCCAGGCGCAGATGGAGCGTGCCAGCGAGGTCTGCCTGCATGACCTGCTGCTGCCATCG
CGATGCCAGCAACTGCTGGAAGCGCTGGAGAAGAATGACATAGAATGGGAGCAGTGCGGT
CCAGCACATCAGCGACGGTATCAGCGAGTGACGGAGAAATGGCTCTCAGCCAGCGAACTC
TCTGAGGCAACAGAGGAAGAAGCCATCCAGGGCGAAGAGCCCGACGACTGCGGGGTACAG
GGGGAGACGCATGTCGTACGAGCACTGCTAAGGCTCCTCAGTAGTACAGCATTCATGAGG
CTGGTGGCGGACTGTACAGATCTACCGCTGACTTTGTACAGGAAACTAGAAATGCAACGC
TGGCGGGCTGGAGATTTCACTCTTCTCCCGCCCCGGGAACATTATCAGCAGCCTCGTCTA
GAGGCAGTCCTGTATCTGGGTGTGCCGAAACATCCTATCTGTGGAGGTCAAACGTTATAT
GTGGCCCCAGAAGAGGGGTCGCTTGCGGAGGCCGAGGCATTGGTGACTCTGCCCCCCAGA
CACAACGCGTTAGGGCTGGTGTACTGCGACGCTGGCGCAGCCTCCTTCACCAAATATCTC
AGCAAGATGACCATGTCGGAGAACGAGTGCTTCTATATAGTGACCTGTACTTATACCGAG
TGA

Protein sequence:

MSSPTKETEDPSSSNAESEEYTGGNSDANTEQRPPAKRPMSTAVIEISDTESDDSDVCAV
NSYQASADEVKRIRRDYSSSSSSSSSSNYSSDSDSPWEDDSVVIDDKAMGRPVIAKMLVR
ANRMDDPKFNPELKSQEIISKIKSHWDEKTDHSSDQVTLTCKPFRLCRIHGLLENSEIIN
NIVDDMNTLDWSRKKMDLYEFHQTSDLANLTWQRSIRGIYELLKTEVMTWVSQVTGIELT
SVSASCSLYGPGDHLLVHDDRLGDRRVAFILYLAPWTPRSPPHMQNGAESQDKCWSGPGW
RPHMGGALELVEDGQVVFRAFPANNTLAFFAVGPTSFHQVGEVLSMELPRLSINGWFHGP
APESEEPHAELPVPLTPHNQVVVLKSWVEAGYLCPRARAQVQAQMERASEVCLHDLLLPS
RCQQLLEALEKNDIEWEQCGPAHQRRYQRVTEKWLSASELSEATEEEAIQGEEPDDCGVQ
GETHVVRALLRLLSSTAFMRLVADCTDLPLTLYRKLEMQRWRAGDFTLLPPREHYQQPRL
EAVLYLGVPKHPICGGQTLYVAPEEGSLAEAEALVTLPPRHNALGLVYCDAGAASFTKYL
SKMTMSENECFYIVTCTYTE