Monarch geneset OGS2.0

DPOGS203541
TranscriptDPOGS203541-TA999 bp
ProteinDPOGS203541-PA332 aa
Genomic positionDPSCF300055 + 268906-269904
RNAseq coverage777x (Rank: top 17%)
Annotation
HeliconiusHMEL0132031e-9162.02% 
BombyxBGIBMGA004341-TA2e-7249.47% 
DrosophilaCG7675-PB5e-5638.59% 
EBI UniRef50UniRef50_G6D2794e-5645.65%Putative RDH13 n=2 Tax=Eumetazoa RepID=G6D279_DANPL
NCBI RefSeqXP_002115352.11e-6443.00%hypothetical protein TRIADDRAFT_28989 [Trichoplax adhaerens]
NCBI nr blastpgi|1960109762e-6343.00%hypothetical protein TRIADDRAFT_28989 [Trichoplax adhaerens]
NCBI nr blastxgi|1959970538e-6344.79%hypothetical protein TRIADDRAFT_18543 [Trichoplax adhaerens]
Group
Gene OntologyGO:00054881.4e-69binding
GO:00081524.7e-25metabolic process
GO:00164914.7e-25oxidoreductase activity
KEGG pathwaydre:4365971e-52 
 K11153 (RDH12)maps-> Retinol metabolism
InterPro domain[46-326] IPR0160401.4e-69NAD(P)-binding domain
[54-191] IPR0021984.7e-25Short-chain dehydrogenase/reductase SDR
[55-72] IPR0023473.4e-15Glucose/ribitol dehydrogenase
Orthology groupMCL25010 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203541-TA
ATGTTGTACTACATTGCATATATTTTGCTCACGATATTACTTTTAGCGGTCAAAGTCGTGGTTGGTTTCTTCTTATTTGTATTTTTGTGTTTTGCCATCGCAAGGCTGTGGTTTGAACCAATAAAGGGTGTGTGTAGAGCCAAAACCAAGCTTCATGGAAAAGTGGCACTGATCACCGGCGGGAATTCGGGAATAGGGCTGGAAACGGCGAAGGATTTGGCGCAGAGAGGCGCTAGGGTCGTCATAGCCAGCAGAAATGATAAAAAATCAGCGGAAGCCGTCGAAGAAATCAAACGGATCACTGGAAACGAGAAAGTGGAATATAGACATTTAAATCTTAGAGACATGGACAGCGTCAGGGAGTTCGCAAAGAAATTCAACGAAGAGTTCGACCGTTTAGACCTTCTGGTAAACAACGCTGGCATCGGAGCAGCGAAGAACGCGCTGACAGCTGACAATATAGACATCCTGATGGCCATCAACTACGTGGGTCCGTTCCTCCTCACGCACTTACTACTAGATAAAATTAAAGCCACTAAAACAAGTAGAATCGTCATAGTGTCGTCATACCTCCACTTCCACGCCAACTTTGAGCTGGACGACCTCACGAGGGTTACAACAAAAAATACATTGATCAAGTACTGTAATGCAAAACTCTGCGATGTTCTGTGGACGAAGGAGCTCTCCAGAAGATTGCCAGCAGGTGTAACGGTGAACGTACTCCATCCAGGTCTAGTGAAGACCAACATTTTTGATACCTTACACAAATGTTTAAAGAATCCGCTGTATGTTATTATCGATCTGCTTTTCAAAACGGCGAAAGAAGGTGCACAGACTGTTATATACCTGTGTGTAGATCCAGCAGTCGAGAACATGACAGGAGGCTACTACATGGACTGTAAGAAAATACCCTCGTCGAAACTATCGGAAGATGAAGACCTCGCGAAAGCATTGTGGGACAAAACATTAGAGTTGGTTTGCGTCAAACCCGTCATATAA

Protein sequence:

>DPOGS203541-PA
MLYYIAYILLTILLLAVKVVVGFFLFVFLCFAIARLWFEPIKGVCRAKTKLHGKVALITGGNSGIGLETAKDLAQRGARVVIASRNDKKSAEAVEEIKRITGNEKVEYRHLNLRDMDSVREFAKKFNEEFDRLDLLVNNAGIGAAKNALTADNIDILMAINYVGPFLLTHLLLDKIKATKTSRIVIVSSYLHFHANFELDDLTRVTTKNTLIKYCNAKLCDVLWTKELSRRLPAGVTVNVLHPGLVKTNIFDTLHKCLKNPLYVIIDLLFKTAKEGAQTVIYLCVDPAVENMTGGYYMDCKKIPSSKLSEDEDLAKALWDKTLELVCVKPVI-