Monarch geneset OGS2.0

DPOGS204227
TranscriptDPOGS204227-TA1116 bp
ProteinDPOGS204227-PA371 aa
Genomic positionDPSCF300046 - 664759-669159
RNAseq coverage77x (Rank: top 65%)
Annotation
HeliconiusHMEL0151519e-9858.73% 
BombyxBGIBMGA007508-TA5e-16074.16% 
DrosophilaCG18003-PC3e-11454.62% 
EBI UniRef50UniRef50_Q9UJM81e-10854.52%Hydroxyacid oxidase 1 n=67 Tax=Eukaryota RepID=HAOX1_HUMAN
NCBI RefSeqXP_970519.14e-13462.97%PREDICTED: similar to AGAP010885-PA [Tribolium castaneum]
NCBI nr blastpgi|910836358e-13362.97%PREDICTED: similar to AGAP010885-PA [Tribolium castaneum]
NCBI nr blastxgi|1565440327e-13061.73%PREDICTED: hydroxyacid oxidase 1-like [Nasonia vitripennis]
Group
Gene OntologyGO:00101811.2e-163FMN binding
GO:00551141.2e-163oxidation-reduction process
GO:00164911.2e-163oxidoreductase activity
GO:00081523.1e-142metabolic process
GO:00038243.1e-142catalytic activity
KEGG pathwaytca:6590941e-133 
 K11517 (HAO)maps-> Peroxisome
    Glyoxylate and dicarboxylate metabolism
InterPro domain[1-367] IPR0121331.2e-163Alpha-hydroxy acid dehydrogenase, FMN-dependent
[5-367] IPR0137853.1e-142Aldolase-type TIM barrel
[16-363] IPR0002621.9e-128FMN-dependent dehydrogenase
Orthology groupMCL13730 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204227-TA
ATGGAGAAATATATTAGTGTTAAAGATTTTGAAGACGCCGCGCTGGCAGCGCTGCCGAAGACTGTCAGGGACTATTATAAGAGTGGAGCGACAGACGAATACACTTTGGCGGAAAATAGACGAGCATTTCAGAGATTAAGAATACGTCCAAAATGTCTGGTGGGTATCAAAGGATGCGATACTTCGACTACCATACTAGGTGAGAAGGTCTCGATGCCGGTCGGAATCTCCCCCACGGCCATGCAGAGAATGGCGCACCCTGATGGCGAAACCGCTACCGCACGAGCTGCTCAAGCTGAACGTGTGATATACACTTTGAGTACAATCTCCACCAGCTCCATAGAAGAGGTAGCTCAAGCAGCCCCAAACGCTGTGAAGTGGTTTCAACTCTATATCTATAATGACAGGGAAATTACAAAGAATCTGGTTTTAAGAGCTGAAAAAGCAGGCTTTAAAGCAATAGCGTTGACCGTGGATACACCTCTGTTTGGTCTAAGAAGAGCAGATATCCGAAATAAGTTTACACTTCCCAAACACTTGACGTTGGCCAATTTTGAAGGACATTTATCTAATAAAATACATAGTTCCGGCGAAGGCAGCGGTTTGAGTCATTACGTTAACAATTTGTTCGATCCATCATTAACTTGGGACGAAATAAGATGGCTGAAGAGTATAACCAAGCTGCCGATCATAGCGAAGGGCATCCTCCGCGGTGATGACGCGGCTCGTGCTGCGCGGGCGGGGTGCTCCGCCGTGCTCGTCTCTAACCACGGGGCGAGACAACTGGACGGAGTACCCGCCACGATCGAGGTTCTTCCAGAAATCATAGCAGCGGTTGAGCAATACAATGTCGAAGTGTACTTGGACGGAGGAGTCACCACAGGGACAGACGTTTACAAAGCCTTAGCCCTGGGAGCGAAAATGGCAAGTATTCTTGTGTTTGTTGGTCGTCCGGCCCTTTGGGGACTGGCGGTGGCCGGACAAGAAGGTGTCCAGAGAATGTTGAACATTATTCGTAAAGAATTAGAGTACACTTTACAAATTGCAGGAACTCAAACTGTACCGGAAATAACAAAAGACATGGTGCGACACGAGTCTACTTACAGTAGACTGTGA

Protein sequence:

>DPOGS204227-PA
MEKYISVKDFEDAALAALPKTVRDYYKSGATDEYTLAENRRAFQRLRIRPKCLVGIKGCDTSTTILGEKVSMPVGISPTAMQRMAHPDGETATARAAQAERVIYTLSTISTSSIEEVAQAAPNAVKWFQLYIYNDREITKNLVLRAEKAGFKAIALTVDTPLFGLRRADIRNKFTLPKHLTLANFEGHLSNKIHSSGEGSGLSHYVNNLFDPSLTWDEIRWLKSITKLPIIAKGILRGDDAARAARAGCSAVLVSNHGARQLDGVPATIEVLPEIIAAVEQYNVEVYLDGGVTTGTDVYKALALGAKMASILVFVGRPALWGLAVAGQEGVQRMLNIIRKELEYTLQIAGTQTVPEITKDMVRHESTYSRL-