New model in OGS2.0 | DPOGS204629  |
---|---|
Genomic Position | scaffold483:- 52322-57169 |
See gene structure | |
CDS Length | 1863 |
Paired RNAseq reads   | 884 |
Single RNAseq reads   | 2036 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA014459 (7e-10) |
Best Drosophila hit   | CG31120, isoform A (5e-56) |
Best Human hit | 2-oxoglutarate and iron-dependent oxygenase domain-containing protein 1 (2e-41) |
Best NR hit (blastp)   | AGAP002934-PA [Anopheles gambiae str. PEST] (7e-65) |
Best NR hit (blastx)   | AGAP002934-PA [Anopheles gambiae str. PEST] (1e-62) |
GeneOntology terms    | GO:0005506 iron ion binding GO:0016706 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors GO:0031418 L-ascorbic acid binding GO:0055114 oxidation reduction |
InterPro families    | IPR006620 Prolyl 4-hydroxylase, alpha subunit IPR019601 Oxoglutarate/iron-dependent oxygenase, C-terminal degradation domain IPR005123 Oxoglutarate/iron-dependent oxygenase |
Orthology group | MCL16666 |
Nucleotide sequence:
ATGAGTTCTCCGACTAAAGAAACTGAAGATCCATCATCTAGCAATGCTGAGAGTGAGGAA
TACACAGGCGGAAATAGCGATGCGAATACAGAGCAGAGACCGCCCGCCAAGCGGCCTATG
TCTACAGCAGTTATTGAAATTTCAGACACTGAAAGTGATGATTCCGATGTCTGCGCTGTT
AATTCTTACCAGGCCTCAGCTGATGAGGTGAAAAGAATACGAAGAGATTATTCATCTTCG
TCATCATCTTCCTCATCATCAAACTACAGCTCTGATTCTGACTCACCATGGGAAGATGAC
TCTGTAGTAATAGATGATAAAGCAATGGGAAGGCCTGTGATTGCTAAGATGTTAGTTAGA
GCTAATAGAATGGATGACCCTAAATTTAATCCTGAACTGAAGTCTCAAGAGATCATAAGT
AAAATTAAATCTCACTGGGATGAGAAGACAGACCACAGTAGTGACCAAGTGACCTTAACA
TGTAAACCGTTCAGACTCTGTCGGATTCATGGCTTGTTAGAGAACTCGGAGATAATAAAT
AATATAGTGGACGACATGAACACATTGGACTGGTCGAGGAAGAAGATGGATCTGTACGAG
TTTCACCAGACCTCTGACTTAGCAAACTTAACTTGGCAGCGTAGTATAAGAGGTATTTAC
GAATTATTGAAGACTGAAGTAATGACTTGGGTGTCGCAAGTAACGGGCATAGAGTTGACA
TCAGTGTCGGCGTCATGTTCGCTGTATGGCCCCGGAGACCATCTCTTGGTTCACGATGAT
CGACTCGGGGACAGGAGGGTGGCCTTCATCCTGTACCTAGCACCCTGGACGCCACGATCA
CCACCACACATGCAGAACGGAGCTGAAAGTCAAGATAAGTGTTGGAGCGGTCCGGGCTGG
AGGCCGCATATGGGTGGAGCGTTGGAGTTGGTCGAGGATGGACAGGTTGTGTTCCGTGCC
TTCCCCGCTAATAATACATTAGCATTCTTCGCAGTCGGCCCGACGTCCTTTCATCAGGTG
GGCGAAGTCCTATCTATGGAGCTTCCTCGGCTGTCTATTAACGGTTGGTTTCACGGTCCG
GCGCCGGAGTCCGAGGAGCCGCACGCGGAGCTCCCAGTGCCACTCACACCGCACAACCAA
GTGGTGGTGTTGAAGTCGTGGGTAGAGGCTGGGTACTTGTGTCCCCGAGCTCGAGCCCAG
GTCCAGGCGCAGATGGAGCGTGCCAGCGAGGTCTGCCTGCATGACCTGCTGCTGCCATCG
CGATGCCAGCAACTGCTGGAAGCGCTGGAGAAGAATGACATAGAATGGGAGCAGTGCGGT
CCAGCACATCAGCGACGGTATCAGCGAGTGACGGAGAAATGGCTCTCAGCCAGCGAACTC
TCTGAGGCAACAGAGGAAGAAGCCATCCAGGGCGAAGAGCCCGACGACTGCGGGGTACAG
GGGGAGACGCATGTCGTACGAGCACTGCTAAGGCTCCTCAGTAGTACAGCATTCATGAGG
CTGGTGGCGGACTGTACAGATCTACCGCTGACTTTGTACAGGAAACTAGAAATGCAACGC
TGGCGGGCTGGAGATTTCACTCTTCTCCCGCCCCGGGAACATTATCAGCAGCCTCGTCTA
GAGGCAGTCCTGTATCTGGGTGTGCCGAAACATCCTATCTGTGGAGGTCAAACGTTATAT
GTGGCCCCAGAAGAGGGGTCGCTTGCGGAGGCCGAGGCATTGGTGACTCTGCCCCCCAGA
CACAACGCGTTAGGGCTGGTGTACTGCGACGCTGGCGCAGCCTCCTTCACCAAATATCTC
AGCAAGATGACCATGTCGGAGAACGAGTGCTTCTATATAGTGACCTGTACTTATACCGAG
TGA
Protein sequence:
MSSPTKETEDPSSSNAESEEYTGGNSDANTEQRPPAKRPMSTAVIEISDTESDDSDVCAV
NSYQASADEVKRIRRDYSSSSSSSSSSNYSSDSDSPWEDDSVVIDDKAMGRPVIAKMLVR
ANRMDDPKFNPELKSQEIISKIKSHWDEKTDHSSDQVTLTCKPFRLCRIHGLLENSEIIN
NIVDDMNTLDWSRKKMDLYEFHQTSDLANLTWQRSIRGIYELLKTEVMTWVSQVTGIELT
SVSASCSLYGPGDHLLVHDDRLGDRRVAFILYLAPWTPRSPPHMQNGAESQDKCWSGPGW
RPHMGGALELVEDGQVVFRAFPANNTLAFFAVGPTSFHQVGEVLSMELPRLSINGWFHGP
APESEEPHAELPVPLTPHNQVVVLKSWVEAGYLCPRARAQVQAQMERASEVCLHDLLLPS
RCQQLLEALEKNDIEWEQCGPAHQRRYQRVTEKWLSASELSEATEEEAIQGEEPDDCGVQ
GETHVVRALLRLLSSTAFMRLVADCTDLPLTLYRKLEMQRWRAGDFTLLPPREHYQQPRL
EAVLYLGVPKHPICGGQTLYVAPEEGSLAEAEALVTLPPRHNALGLVYCDAGAASFTKYL
SKMTMSENECFYIVTCTYTE