Monarch geneset OGS2.0

DPOGS205840
TranscriptDPOGS205840-TA987 bp
ProteinDPOGS205840-PA328 aa
Genomic positionDPSCF300081 - 28860-29846
RNAseq coverage20x (Rank: top 79%)
Annotation
HeliconiusHMEL0173303e-7949.16% 
BombyxBGIBMGA012527-TA2e-5539.56% 
DrosophilaCG2070-PA1e-4437.72% 
EBI UniRef50UniRef50_G6D2791e-5142.65%Putative RDH13 n=2 Tax=Eumetazoa RepID=G6D279_DANPL
NCBI RefSeqXP_002115352.13e-5340.55%hypothetical protein TRIADDRAFT_28989 [Trichoplax adhaerens]
NCBI nr blastpgi|1960109765e-5240.55%hypothetical protein TRIADDRAFT_28989 [Trichoplax adhaerens]
NCBI nr blastxgi|3214683792e-5341.05%hypothetical protein DAPPUDRAFT_319680 [Daphnia pulex]
Group
Gene OntologyGO:00054881e-59binding
GO:00081523.7e-22metabolic process
GO:00164913.7e-22oxidoreductase activity
KEGG pathway 
InterPro domain[28-312] IPR0160401e-59NAD(P)-binding domain
[38-192] IPR0021983.7e-22Short-chain dehydrogenase/reductase SDR
[38-55] IPR0023474.4e-16Glucose/ribitol dehydrogenase
Orthology groupMCL34631 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205840-TA
ATGTTTCTACTTGTTTTATTTTTAATAATTGTCACTATAATTATTTTATATAGAGCGGTATGCAGGAAACACAACTCCATCTGTACATCCAGAAAAAAACTGAAGGAACGGACAGCTGTCATAACGGGCGGTACTGCTGGTATCGGACAACAAATAGCATTGGACTTTGCTGCGAGAGGTGCAAGAGTTATAATCGCCAGTCCCTTTGAAGATGAAGGAGTTAAAGCACGCAATGAAATTATCAGAAGATCTGGAAATGAGGATGTTATCTATAAATTCCTGGACCTTGCTTCATTGAAGTCCATCCGTAAATTCGCAGCAGATATAAATGAATCAGAGAAACGACTACATATTTTAGTGAATAATGCCGGCATCGGAATACCCGGAGAATTTAAGACTGACGACGGAATGAATTTAGTTATGCAAGTTAATTATTATGGTCACTTCCTCCTAACACTCCTCCTTTTACCACTTTTGATAAAATCTGGGACAAAATTGGAACCGAGCAGAATAATAAACATGTCCTCGATTACACGCTTTATTGGAAGTTTTGATATACATAATTACAACAGAACTAAGTATTGGAATAGTTTTAAAACATATTGTAACAGTAAACTAAGTTTAGTGTTGTTCTCCCATGAATTGACAAAAAAAATTGCTGGTAAAAATGTTGTTATAAACTGTGCCGATCCTGGATGCGTATCCACAAAAATTTTTAAAAGTTATTTTCCTTACGCCGGTAAAATGTGTCAAGTTGTTATAAAAATATTCTTTAAAAATCCATGGGAAGGAGCTCAAACGGCAGTTTATATGGCAGTGGACCAAAAAGCTGGCGAAGTTAGTGGACAAGTATTCAATAATTGCAAATTAAACTCTGCTCATACGTGGAATGATATTGATAAATATTCGGAAATGTTGTGGAAACAATCATCGGAACTAGTGTTTAAAGACGAAAAAGAAATATCAAGCCTTCTCCCAAAGGATTGA

Protein sequence:

>DPOGS205840-PA
MFLLVLFLIIVTIIILYRAVCRKHNSICTSRKKLKERTAVITGGTAGIGQQIALDFAARGARVIIASPFEDEGVKARNEIIRRSGNEDVIYKFLDLASLKSIRKFAADINESEKRLHILVNNAGIGIPGEFKTDDGMNLVMQVNYYGHFLLTLLLLPLLIKSGTKLEPSRIINMSSITRFIGSFDIHNYNRTKYWNSFKTYCNSKLSLVLFSHELTKKIAGKNVVINCADPGCVSTKIFKSYFPYAGKMCQVVIKIFFKNPWEGAQTAVYMAVDQKAGEVSGQVFNNCKLNSAHTWNDIDKYSEMLWKQSSELVFKDEKEISSLLPKD-