Monarch geneset OGS2.0

DPOGS214589
TranscriptDPOGS214589-TA1416 bp
ProteinDPOGS214589-PA471 aa
Genomic positionDPSCF300050 - 351568-362122
RNAseq coverage908x (Rank: top 14%)
Annotation
HeliconiusHMEL0121803e-11471.69% 
BombyxBGIBMGA005120-TA5e-12063.78% 
DrosophilaHph-PC5e-7654.78% 
EBI UniRef50UniRef50_E2AT403e-11548.73%Egl nine-like protein 1 n=2 Tax=Formicidae RepID=E2AT40_CAMFO
NCBI RefSeqXP_397368.37e-11847.71%PREDICTED: similar to Egl nine homolog 1 (Hypoxia-inducible factor prolyl hydroxylase 2) (HIF-prolyl hydroxylase 2) (HIF-PH2) (HPH-2) (SM-20) [Apis mellifera]
NCBI nr blastpgi|3287788941e-11647.71%PREDICTED: egl nine homolog 1-like [Apis mellifera]
NCBI nr blastxgi|3287788943e-11547.71%PREDICTED: egl nine homolog 1-like [Apis mellifera]
Group
Gene OntologyGO:00167052.1e-33oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055062.1e-33iron ion binding
GO:00551142.1e-33oxidation-reduction process
GO:00314182.1e-33L-ascorbic acid binding
GO:00082705.5e-10zinc ion binding
GO:00167061.3e-08oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors
GO:00164911.3e-08oxidoreductase activity
KEGG pathwayame:4139292e-117 
 K09592 (EGLN, HPH)maps-> Pathways in cancer
    Renal cell carcinoma
InterPro domain[260-452] IPR0066202.1e-33Prolyl 4-hydroxylase, alpha subunit
[10-46] IPR0028935.5e-10Zinc finger, MYND-type
[359-452] IPR0051231.3e-08Oxoglutarate/iron-dependent oxygenase
Orthology groupMCL13521 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214589-TA
ATGAATCAAGAAGGCGTACTCGCGAGTTGCGCCGTTTGCAATCAGCAAACCCAAAGGAGATGTGGTCGTTGTTTTAGTGTTTATTATTGTAACACAGAACATCAAAGACAAGATTGGAAAAGGCATAAAATCAATTGTGCACCTAAGTTACAGGAACAGGGTTCGCGAATTGAAAAAAATATTAAATCATCTATCCACAAAGAAGAGAAAAAGAAAGAAACCAAAAAAGAACAATTAATAACCTCATCTGTAGTGAATAATAATAAAGAAGTGTTACAAGTGGGTAGTGAAACTAGACGTTTGAAAAAATCTAAAAAGAAAACTGCCAAGAAAGAGGGTAGTGACACTATTAGTGATAACAAAAATATCTCTCAGATTATAGAAAATAATAATGAAAGCACAGCAAGTGTTAAAGTCGATTTAAAGGACAAAAGTGTATTTTCAAGCGTAGTTTACTCTAATAAAGACGTAAATAAGATAAGTGCCATAACTTACGAAGGTTCGTCGGAACAGGAGATATTAAATGAAAAGGCTCAGCAGCTGAGCAGCGTTGATTTCGCGAGTGCAAGCACTTCAAATGTTTTAAAAACAATTAACAGAGCTGATGTGAAAATGCCACCTGTACCTATTGAACAGTCAACTAGAATGAAGGAGTACCCCGAGGCCTCACTAAAAGGTAGCGGGGCTCCATTTAATACCACAATGAACAGTTACTTTATGGATCCCAGTGATCCATCCTATGAGATCTGCCAGAGAGTTATCAGAGACATGACACAATATGGTGTGTGTGTTATAAACAATTTCCTTGGTAAAGAACGGGGACATCTTGTATTGAATGAAGTGCTGGAAATGTACAGATCGGGGATATTTACGGCAGGTCAATTGGTTTCTAGCCCTGGAAGCACAGAAGCACAGACAATTAGATCGGACAGAATAACATGGATCGACGGCAAGGAGCCTCAATGCCATCACATAGGACAATTAATATCACAAGTGGACAGTATAATACTGAAAGCTAACAAAATGTCCAACAACGGGAAGATGGGGAACTACATCATCAATGGCAGGACGAAGGCGATGGTAGCATGTTATCCTGGTTCCGGAAGTCACTACGTCAAACACGTGGACAATCCGAATAAAGATGGCCGCTGCATCACAGCCATTTACTACTTGAATCTCGACTGGGATGTCAAGAGATGTGGGGGTCTGCTCAGGGTATTCCCCGAAGGAACCAACCAGGTGGCTGACATCGCGCCCATCTTCGATAGGATGCTGTTCTTCTGGTCAGATCGAAGAAATCCTCACGAAGTGCAACCTGCTTACTCAACGAGATATGCGATCACATTGTGGTATTTCGACTCTCAAGAACGTGAAGAAGCCCTTAGGAATTTCAGTAAGTTTTTTTTGATAAAATAA

Protein sequence:

>DPOGS214589-PA
MNQEGVLASCAVCNQQTQRRCGRCFSVYYCNTEHQRQDWKRHKINCAPKLQEQGSRIEKNIKSSIHKEEKKKETKKEQLITSSVVNNNKEVLQVGSETRRLKKSKKKTAKKEGSDTISDNKNISQIIENNNESTASVKVDLKDKSVFSSVVYSNKDVNKISAITYEGSSEQEILNEKAQQLSSVDFASASTSNVLKTINRADVKMPPVPIEQSTRMKEYPEASLKGSGAPFNTTMNSYFMDPSDPSYEICQRVIRDMTQYGVCVINNFLGKERGHLVLNEVLEMYRSGIFTAGQLVSSPGSTEAQTIRSDRITWIDGKEPQCHHIGQLISQVDSIILKANKMSNNGKMGNYIINGRTKAMVACYPGSGSHYVKHVDNPNKDGRCITAIYYLNLDWDVKRCGGLLRVFPEGTNQVADIAPIFDRMLFFWSDRRNPHEVQPAYSTRYAITLWYFDSQEREEALRNFSKFFLIK-