Monarch geneset OGS2.0

DPOGS215714
TranscriptDPOGS215714-TA918 bp
ProteinDPOGS215714-PA305 aa
Genomic positionDPSCF300041 + 164989-175240
RNAseq coverage432x (Rank: top 28%)
Annotation
HeliconiusHMEL0096391e-8448.74% 
BombyxBGIBMGA005808-TA1e-11464.77% 
DrosophilaCG2254-PA1e-8153.19% 
EBI UniRef50UniRef50_Q7PRW26e-8952.54%AGAP000275-PA n=2 Tax=Culicidae RepID=Q7PRW2_ANOGA
NCBI RefSeqXP_001862645.15e-9052.88%short-chain dehydrogenase [Culex quinquefasciatus]
NCBI nr blastpgi|1700533781e-8852.88%short-chain dehydrogenase [Culex quinquefasciatus]
NCBI nr blastxgi|1700533785e-8552.88%short-chain dehydrogenase [Culex quinquefasciatus]
Group
Gene OntologyGO:00054884.9e-61binding
GO:00081523e-31metabolic process
GO:00164913e-31oxidoreductase activity
KEGG pathwaycfa:4869986e-46 
 K11151 (RDH10)maps-> Retinol metabolism
InterPro domain[42-265] IPR0160404.9e-61NAD(P)-binding domain
[44-208] IPR0021983e-31Short-chain dehydrogenase/reductase SDR
[44-61] IPR0023476.3e-21Glucose/ribitol dehydrogenase
Orthology groupMCL25172 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215714-TA
ATGGATTTTCAAGGAGCTCTACAAGTCGTGTTCGACTTACTATGGACTTTGATAAAGGTCAATTATTTGACGGTGGTCGGCATGTGGCGAACCATAATGCCACCAGATCCAAAGGATGTACGGGATGAAGTCGTTCTCATCACGGGTACCGGTCATGGCATCGGTCGTGAGATGGCGTTGCGGTTCGCAAGACTCGGAGCTACCCTGGTGTGTGTCGACATCAACGCCTCCACCAACGAGGAAACCGTCAGGATCATCAAACAGGAGAAGAATAAGGCATTCAGTTATCAGTGTGACGTCACCGATCGCGCTGCAGTGATGCAAATGGCTGAAAAAATCCGTCATGAAGTTGGCGAGGTGTCGATCCTCGTGAACAACGCTGGGATCATGCCCTGCAAGCCGCTCCTTAACCAGACGGAGAAAGAAATAAGACTGATGAACGACATCAACGTCAACGCTAACCTATGGATGATCCAAGCTTTTCTTCCATCAATGATGGAGAGGAACCACGGTCATATAGTGGCGATGTCCTCTATGGCGGGGCTGATGGGACTTCGCAACCTGGTTCCGTACTGTGGGTCAAAGTACGCAGTCAGAGGCATCATGGAGGCATTGGCGATTGAACTCAAAGAGGATCCAAGAGAGTTTAGCGGGATTAAATTCACAACGATATGTCCATACATGGTTGACACTGGGTTATGCAAGAAACCTCGCATCCGGTTTCCTAGTCTGATGAAGGTGGTGTCTGCTAGTGAGACCGCTGACTTGATCGTTGACGCTGTAAGGAGAGACATCTTGGAGATCACCGTCCCACAGGAACTGCACTTCATGAACAGATACATTTATCGTTTCCTACCGTTCCCCGCTGCCGTCGCTTGGAACGAGTTTTTCAACACGGGTGTCGACTCCCACGAATGA

Protein sequence:

>DPOGS215714-PA
MDFQGALQVVFDLLWTLIKVNYLTVVGMWRTIMPPDPKDVRDEVVLITGTGHGIGREMALRFARLGATLVCVDINASTNEETVRIIKQEKNKAFSYQCDVTDRAAVMQMAEKIRHEVGEVSILVNNAGIMPCKPLLNQTEKEIRLMNDINVNANLWMIQAFLPSMMERNHGHIVAMSSMAGLMGLRNLVPYCGSKYAVRGIMEALAIELKEDPREFSGIKFTTICPYMVDTGLCKKPRIRFPSLMKVVSASETADLIVDAVRRDILEITVPQELHFMNRYIYRFLPFPAAVAWNEFFNTGVDSHE-