Monarch geneset OGS2.0

DPOGS215781
TranscriptDPOGS215781-TA834 bp
ProteinDPOGS215781-PA277 aa
Genomic positionDPSCF300041 + 1856640-1858096
RNAseq coverage418x (Rank: top 29%)
Annotation
HeliconiusHMEL0065354e-11669.68% 
BombyxBGIBMGA003658-TA1e-10564.62% 
DrosophilaCG3609-PB2e-5842.55% 
EBI UniRef50UniRef50_D6W7631e-5643.17%Putative uncharacterized protein (Fragment) n=3 Tax=Tribolium castaneum RepID=D6W763_TRICA
NCBI RefSeqXP_320004.42e-5943.49%AGAP009225-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582999923e-5843.49%AGAP009225-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582999926e-5643.35%AGAP009225-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00054889.1e-24binding
GO:00164912.6e-11oxidoreductase activity
GO:00081524e-08metabolic process
GO:00551144e-08oxidation-reduction process
KEGG pathwayxla:4445392e-51 
 K00078 (DHDH)maps-> Pentose and glucuronate interconversions
    Metabolism of xenobiotics by cytochrome P450
InterPro domain[4-85] IPR0160409.1e-24NAD(P)-binding domain
[5-60] IPR0006832.6e-11Oxidoreductase, N-terminal
[77-165] IPR0041044e-08Oxidoreductase, C-terminal
Orthology groupMCL11088 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215781-TA
ATGGCAATAAGCAATGATATTGATGTGGCTTACATTGGAGCACTCAATCACGACCACTACGCACTATCGAAATTATTCCTTGAATCTGGCAAACATGTGTTATGTGAAAAGCCGTTTTGTCTAAATGCCAAACAAGTGGAAAGCTTGGTGAAAATTGCTAAAAACCAAAACCTTTTCTTAATGGAGGCCCTATGGTCTCGTTTTGCACCATTTTACGTTAATTTGGAAGAACAACTAGAATCAGGAATTATCGGTCGACCTCAATTTGTAGAAGTAAATTTTGGATTGCCGATTGAGAACGTGGAGAGGCTAAGAAAAAGGGATATGGGAGGCGGCGCGCTGATGGACATTGGTATTTATACAGTACATTTTGCACAACTGGTGTTCAAAGAGGATCCTATCAAGATAACAGCAGTTGGTGAATTAAATGATGATGGGGTGGATTGTGTGGAAACCGTTATCTTGGAATATTCTGAAGGAAGACGGGCTGTTTTAAACAATCACGCCAAAGTAAAACTCTGGAACAAAGCTACTGTTGTAGGCGACTACGGAAAAAGAATAACATTTGAGGATCCTTTCAACATGCCAGACACTATGATCCACGCTGATGGTCGCGTCGAAAAATTTGAGTTCCATCCTTCGAAGATTCCCTATAATTTCATGAACAGCGCCGGTTTGGTCTACGAAGCTTTGGAAACGGTTCGTTGCATTAAGGAAGGCTTAAAGGAGTCACCAATCATGAGTCACAGTAGAAGTCTCCTATTGATTAAAATCTTGGATACTGTGCGAAAACAACTCGGCGTACATTATGATGTTGATGATCAAGATTTTTAA

Protein sequence:

>DPOGS215781-PA
MAISNDIDVAYIGALNHDHYALSKLFLESGKHVLCEKPFCLNAKQVESLVKIAKNQNLFLMEALWSRFAPFYVNLEEQLESGIIGRPQFVEVNFGLPIENVERLRKRDMGGGALMDIGIYTVHFAQLVFKEDPIKITAVGELNDDGVDCVETVILEYSEGRRAVLNNHAKVKLWNKATVVGDYGKRITFEDPFNMPDTMIHADGRVEKFEFHPSKIPYNFMNSAGLVYEALETVRCIKEGLKESPIMSHSRSLLLIKILDTVRKQLGVHYDVDDQDF-