Monarch geneset OGS2.0

DPOGS209367
TranscriptDPOGS209367-TA1044 bp
ProteinDPOGS209367-PA347 aa
Genomic positionDPSCF300118 - 234438-235481
RNAseq coverage170x (Rank: top 51%)
Annotation
HeliconiusHMEL0131161e-7762.93% 
BombyxBGIBMGA005692-TA6e-5548.28% 
DrosophilaCG9519-PA1e-2639.38% 
EBI UniRef50UniRef50_Q95NZ02e-5148.47%Ecdysone oxidase n=1 Tax=Spodoptera littoralis RepID=Q95NZ0_SPOLI
NCBI RefSeqXP_972484.17e-2933.65%PREDICTED: similar to CG9518 CG9518-PA [Tribolium castaneum]
NCBI nr blastpgi|3796990443e-5248.28%ecdysone oxidase [Bombyx mori]
NCBI nr blastxgi|3796990441e-4947.80%ecdysone oxidase [Bombyx mori]
Group
Gene OntologyGO:00166143.5e-34oxidoreductase activity, acting on CH-OH group of donors
GO:00551143.5e-34oxidation-reduction process
KEGG pathwayhse:Hsero_47683e-26 
 K00119 (E1.1.99.-)maps-> Benzoate degradation via hydroxylation
    Limonene and pinene degradation
    Phosphonate and phosphinate metabolism
InterPro domain[63-199] IPR0078673.5e-34Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL30361 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209367-TA
ATGTCATTCAACGCGTCGAACAAAAATGTTCCCGATTTCGCCATTTACACCAGCTGCATGCCGGTGGACACGCGGTACTACGAAAGTTGTAGGAGCGTTTTAAATTTAAGTCCACATATGTGCTCGAAAATTCAAGAAGTGAATAAAAGATATGAGGTTTTCACTTTGAGCGTCGTGAATCTGAAGCCAAACTCACGAGGAAGGGTTCAACTGAAGTCAGCGGATCCTTTGGAGCCGCCTCGCATCTATTCGGGGACGTTTAGTGACCCCAGTGACTTGACGTACTATCCGGACGCGATTCGCAAAGCTTTATCTATAATCAGAACTTCATATTTCCGATCTAAGAACGCTTTCCCGTTAGACTTCAACTTGAAGAATTGTGTTTCACTATCCGACGACGAACGTTTCAAGTGCATAGCAAAGAATTTGGCCATGACGGCTTGGCATTCCGTCGGAACGGCGCCGATGGGAACAGTTTTGGATTCAAAATTAAGAGTCAAAGGTGTTTCCGGTTTAAGGGTGGCCGACGCTAGCTCGATGCCCAAAGTGATTCGAGGGAATACGAATTCCCCCGTGGTTATGATAGCCGAGAGAGCAGCAGATTTTATCAAAGAAGCTGTTGGAAAACCATACCACATGCCGACATCCGACAGCAAGCCGACGAGCTGGAAATATAAATACCCTAAACCGAACCAGAACACGCGGCCGAACATCAACAATCATCCCGAAGCAGACACCGTCGTGTACAACGACCAGAACTCATATACAAACTCCTTAAACAGTCCCAACCAATACAACAACTACCCGAACAGGTATCCCGCCTCCGGCACGAATCCCAACGGCTTGACGGGCTGGGCGGCGGTCGCTACCACGGCCATCGAGACGGTCGGGTCTGTTGTAGATTCATTCATCAAAGTGAGAATACCACAACTGGGCGGACTCATAGCGGGCAGCTACAGCAACAATCCCAAGGACGCCGGAAACATATCAAGTACTGAGAAGTACGATAGGGCGGAAACGACGACACCATCAGGGCAGCCTTAA

Protein sequence:

>DPOGS209367-PA
MSFNASNKNVPDFAIYTSCMPVDTRYYESCRSVLNLSPHMCSKIQEVNKRYEVFTLSVVNLKPNSRGRVQLKSADPLEPPRIYSGTFSDPSDLTYYPDAIRKALSIIRTSYFRSKNAFPLDFNLKNCVSLSDDERFKCIAKNLAMTAWHSVGTAPMGTVLDSKLRVKGVSGLRVADASSMPKVIRGNTNSPVVMIAERAADFIKEAVGKPYHMPTSDSKPTSWKYKYPKPNQNTRPNINNHPEADTVVYNDQNSYTNSLNSPNQYNNYPNRYPASGTNPNGLTGWAAVATTAIETVGSVVDSFIKVRIPQLGGLIAGSYSNNPKDAGNISSTEKYDRAETTTPSGQP-