Monarch geneset OGS2.0

DPOGS208720
TranscriptDPOGS208720-TA948 bp
ProteinDPOGS208720-PA315 aa
Genomic positionDPSCF300043 + 11337-14016
RNAseq coverage8x (Rank: top 86%)
Annotation
HeliconiusHMEL0152712e-11965.81% 
BombyxBGIBMGA000813-TA5e-5844.82% 
DrosophilaCG30491-PA4e-5942.86% 
EBI UniRef50UniRef50_C4WWA51e-9054.02%ACYPI007265 protein n=3 Tax=Acyrthosiphon pisum RepID=C4WWA5_ACYPI
NCBI RefSeqXP_973517.21e-9256.19%PREDICTED: similar to Retinol dehydrogenase 13 [Tribolium castaneum]
NCBI nr blastpgi|2700148913e-9456.51%hypothetical protein TcasGA2_TC010879 [Tribolium castaneum]
NCBI nr blastxgi|2700148919e-9656.51%hypothetical protein TcasGA2_TC010879 [Tribolium castaneum]
Group
Gene OntologyGO:00054889.9e-63binding
GO:00081523.9e-17metabolic process
GO:00164913.9e-17oxidoreductase activity
KEGG pathwaycfa:4907444e-65 
 K11153 (RDH12)maps-> Retinol metabolism
InterPro domain[10-304] IPR0160409.9e-63NAD(P)-binding domain
[17-159] IPR0021983.9e-17Short-chain dehydrogenase/reductase SDR
[18-35] IPR0023471.1e-15Glucose/ribitol dehydrogenase
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208720-TA
ATGGGTTGGTTTAGTGGACGCTGTTATAGTGATGCAAAATTAAACGGTAAAACAATAATTGTCACCGGATGCAACACCGGAATCGGCAAGGTTACCGTGGAGGAATTTTATAAAAGAGGAGCAAAAGTTATCATGGCCTGTAGAGACGTTGGAAAGGCGGAGGAAGCTAAGATTGATATAAAAGAAACTTGTAAAAATTCACCAAACAAAGGCGAACTCATCGTTGAAGAATGTGACTTATCCTCTTTTAAATCGATTAGAAATTTTAGTCAAAAGGTTCTCAAAAGTAAAACTGAAATTAACGTTTTGGTGAACAACGCTGGAGTGATGATGGCCCCTCGAGGAGAGACGGAAGATGGTTTTGAAACTCATTTCGGTACAAATCACCTCGGACATTTCCTCCTTACAATGCTTTTACTGCCCAGGATAATTAAAAGCACCCCAGCAAGAATAGTTACTGTATCCTCTAAAGCACATTCGTTGTTTAATTTACATTTGGAGGACCTGAACTATACACTAAGACCATATAATTCTGCTGAGGCATATGCACAAAGTAAAATAGCAAATATTTTATTTTCGAGAGAACTATCCAAAAAACTCAAGAGTTACAATATCCAAGGCATAAACACTTACAGCCTCCATCCGGGTTTAATTAAAACTGATTTATATCGACATTTGAATAGCCCGATCAGAAGTTTAATAAGAACCATTGTCGTGGATTACATTTTCTATCCATTTTCGAAAACTATAGAAATGGGAGCTCAAACTACTATTTACTGCGCTATAGATGAAAAATGTTCCAATGAGACCGGTCTTTATTACACTGACTGCACAGTGACGTCACCAAGTACACATGCTCTAAATGATGAGAATGCAAAAAAATTGTGGGATATGTCGATGGAAATGGTGGGATTAAAGGATTGTAATCCATTCACCTCCGTTTATTGA

Protein sequence:

>DPOGS208720-PA
MGWFSGRCYSDAKLNGKTIIVTGCNTGIGKVTVEEFYKRGAKVIMACRDVGKAEEAKIDIKETCKNSPNKGELIVEECDLSSFKSIRNFSQKVLKSKTEINVLVNNAGVMMAPRGETEDGFETHFGTNHLGHFLLTMLLLPRIIKSTPARIVTVSSKAHSLFNLHLEDLNYTLRPYNSAEAYAQSKIANILFSRELSKKLKSYNIQGINTYSLHPGLIKTDLYRHLNSPIRSLIRTIVVDYIFYPFSKTIEMGAQTTIYCAIDEKCSNETGLYYTDCTVTSPSTHALNDENAKKLWDMSMEMVGLKDCNPFTSVY-