Monarch geneset OGS2.0

DPOGS215703
TranscriptDPOGS215703-TA912 bp
ProteinDPOGS215703-PA303 aa
Genomic positionDPSCF300041 - 145108-147442
RNAseq coverage5325x (Rank: top 2%)
Annotation
HeliconiusHMEL0096392e-12768.75% 
BombyxBGIBMGA005763-TA9e-12270.11% 
DrosophilaCG14946-PC3e-8547.70% 
EBI UniRef50UniRef50_Q7PRW25e-9553.42%AGAP000275-PA n=2 Tax=Culicidae RepID=Q7PRW2_ANOGA
NCBI RefSeqXP_310845.45e-9553.47%AGAP000275-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479635142e-9453.42%AGAP000275-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3454978371e-9053.75%PREDICTED: epidermal retinol dehydrogenase 2-like [Nasonia vitripennis]
Group
Gene OntologyGO:00054883.2e-60binding
GO:00081523.1e-29metabolic process
GO:00164913.1e-29oxidoreductase activity
KEGG pathwaycfa:4869984e-49 
 K11151 (RDH10)maps-> Retinol metabolism
InterPro domain[42-269] IPR0160403.2e-60NAD(P)-binding domain
[44-208] IPR0021983.1e-29Short-chain dehydrogenase/reductase SDR
[44-61] IPR0023474.5e-18Glucose/ribitol dehydrogenase
Orthology groupMCL12872 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215703-TA
ATGAAGATCTACGAAGGCGTGGTAGTGACTCTGGAGGTGATCGTTCTCATTATCAAGATGTACGCCACTTGGTTCTACCGAATGTACAAGTTCTTTGTGCCTAATGAACCTAAGAGTGTCAAGGGAGAGATTGTACTGATCACTGGCGCTGGTCATGGTATGGGCCGGGAGATGGCACTCAGCTTCGCGAAGCTCGGCAGTATAGTGGTATGTGTGGACATCAATCCCACCAGCAACGAGGAGACAGTGAATATGGTCAAGGAAAACAAGGGGCAAGCACATAGCTATCTATGTGACGTAACCAACCGGTCAGCCATCAATGAAATGGCAGAAAAAATCCGCAAGGAGGTAGGCGAGGTGTCGATCCTCGTGAACAACGCTGGCATCATGCCTTGCAAGCCGCTCCTCAAACAGACGGAGAAAGAAATTAGGGCTACCTTTGAGGTCAACTGTCTCGCCCACATCTGGCTCTTCCAAGCTTTTCTTCCATCGATGATGGAGAGGAACCACGGTCATATAGTGGCGATGTCCTCTATGGCGGGGGTGCTTGGACTCAGGAATCTGGTGCCCTACTGCGGTACAAAGTTCGCCGTGAGAGGAATGATGGAGGCCGTGTATGAGGAATTAAGGGAGGATCCCAGAGACTTCAGCGGAATTAAACTGACTTGCATCTGTCCTTACATTGTGGACACGGGCTTATGCAAGAATCCCAAGATTAAATTCCCCACCCTGATGAAGATCCTGTCGCCCAAGGAGGCGGTGCAGGACATCATAGACGCCGTCAGGAGAGAATATAACGAGATCACTATACCCAGCTCGTTGTATTACACCAATCAGTTCCTGCGGATGTTCCCCCGCGAGGTGCCGCTACACTTCAAAGACTTCCTGGACTCTGGCTTGGAGGCTGACTAA

Protein sequence:

>DPOGS215703-PA
MKIYEGVVVTLEVIVLIIKMYATWFYRMYKFFVPNEPKSVKGEIVLITGAGHGMGREMALSFAKLGSIVVCVDINPTSNEETVNMVKENKGQAHSYLCDVTNRSAINEMAEKIRKEVGEVSILVNNAGIMPCKPLLKQTEKEIRATFEVNCLAHIWLFQAFLPSMMERNHGHIVAMSSMAGVLGLRNLVPYCGTKFAVRGMMEAVYEELREDPRDFSGIKLTCICPYIVDTGLCKNPKIKFPTLMKILSPKEAVQDIIDAVRREYNEITIPSSLYYTNQFLRMFPREVPLHFKDFLDSGLEAD-