Monarch geneset OGS2.0

DPOGS210802
TranscriptDPOGS210802-TA1365 bp
ProteinDPOGS210802-PA454 aa
Genomic positionDPSCF300027 - 870284-874677
RNAseq coverage1889x (Rank: top 7%)
Annotation
HeliconiusHMEL0056572e-14590.26% 
BombyxBGIBMGA007118-TA0.077.04% 
DrosophilaCG5590-PA2e-15560.31% 
EBI UniRef50UniRef50_A4FUZ65e-13154.63%Hydroxysteroid dehydrogenase-like protein 2 n=212 Tax=root RepID=HSDL2_BOVIN
NCBI RefSeqNP_001040436.10.075.94%hydroxysteroid dehydrogenase [Bombyx mori]
NCBI nr blastpgi|1140518680.075.94%hydroxysteroid dehydrogenase [Bombyx mori]
NCBI nr blastxgi|1140518680.075.94%hydroxysteroid dehydrogenase [Bombyx mori]
Group
Gene OntologyGO:00054884e-55binding
GO:00329341.6e-29sterol binding
GO:00081524.5e-20metabolic process
GO:00164914.5e-20oxidoreductase activity
KEGG pathwaymxa:MXAN_73106e-99 
 K13775 (atuG)maps-> Geraniol degradation
InterPro domain[5-243] IPR0160404e-55NAD(P)-binding domain
[344-453] IPR0030331.6e-29SCP2 sterol-binding domain
[14-186] IPR0021984.5e-20Short-chain dehydrogenase/reductase SDR
[14-31] IPR0023472.4e-09Glucose/ribitol dehydrogenase
Orthology groupMCL12490 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210802-TA
ATGAGTTTAGTTGCGAATACAGGGAAGTTGGCTGGTCGTACCCTATTCATAACGGGAGCATCCCGTGGTATTGGTAAAGCTATCGCACTCAAAGCGGCCAAAGATGGAGCTAACGTTGTTGTTGCCGCTAAAACCGCAGAACCTCATCCCAAATTACCGGGAACTATTTACACGGCCGCAGAAGAGATTGAGGCTCTTGGTGGAAAAGCTTTGCCTTGCATTGTTGATGTGAGAGATGAGAAACAGATTCAGAAAGCCATCGATGAAGCTGTTAAAAAGTTCAATGGCATTGATATACTCATAAACAATGCTTCGGCCATATCACTCACTGGGACTGCGGAGACCGATATGAAGAGATACGACCTCATGCACAATATTAACACCAGGGGCACATTTTTGGCATCAAAACTATGCCTGCCGTTGTTGAAAGAAAGCAACCACGCTCACATCCTAAACCTGTCGCCACCACTTAACATGAATCCTTATTGGTTTTCACTGCACGTTGCTTACACAATGGCTAAATATGGGATGTCTATGTGTGTGCTGGGGATGAGTGAAGAATTTAGACAATTCAATATTGGGGTTAATGCACTTTGGCCAAAAACTGCTATCGCGACAGCCGCCATTGAAATGTTGACTGGCGACACTTCGTCCAGTCGCAAACCGGAAATAGTCTCAGATGCTGCCTACGTCATGTTGAGCAAAGACCCTAAATCATACACGGGCAAGTTTGAGATAGATGAAGATGTAGTTAAATCAGTCGGAATCAAAGACCTCGCGCCCTATGCTTGTGATCCAAAGAACGCAAATAATCTGCTGTTGGATGGTTTCTTGGATGATCCAGCGTCATTACTCCATCATCAGCAGACAGTCTCCTCCGCTTCCATACGACGTTATCATACAACTTCCGCCAATTATAAGGATAAGGTAACACTGTTGGTACTGGGTAATGTTAACAATTTGTTACCTGACTTCTTTTTGGATTTACCCGGGCACCAAACGCAAGAGGTCAAAAAGAGTGAGCCGGCAGGACAGATCCCGGAACTATTTTCAGTTATCAACAAGACGATAACACCTGAATTAGTTAAAAAAACACAGGCCGTGTTCCAGTTTAATGTGAAAGGTAAAGAGGAAGGTATATGGCACCTCGATCTCAAGAACGGTGACGGAGCCTGCGGTCAGGGGGAACCAAAACATGCACCCGATGCCACCCTCACCATGGACAGCACCAACTTCGCTGATATGTTCGCTGGGAAATTGAAGCCGACCACAGCCTTTATGATGGGCAAGCTGAAAATAAAGGGGGACATGCAGAAGGCGATGAAACTCGAGAAAATGATGAAATCACTCAAAGCTAAAGTGTAA

Protein sequence:

>DPOGS210802-PA
MSLVANTGKLAGRTLFITGASRGIGKAIALKAAKDGANVVVAAKTAEPHPKLPGTIYTAAEEIEALGGKALPCIVDVRDEKQIQKAIDEAVKKFNGIDILINNASAISLTGTAETDMKRYDLMHNINTRGTFLASKLCLPLLKESNHAHILNLSPPLNMNPYWFSLHVAYTMAKYGMSMCVLGMSEEFRQFNIGVNALWPKTAIATAAIEMLTGDTSSSRKPEIVSDAAYVMLSKDPKSYTGKFEIDEDVVKSVGIKDLAPYACDPKNANNLLLDGFLDDPASLLHHQQTVSSASIRRYHTTSANYKDKVTLLVLGNVNNLLPDFFLDLPGHQTQEVKKSEPAGQIPELFSVINKTITPELVKKTQAVFQFNVKGKEEGIWHLDLKNGDGACGQGEPKHAPDATLTMDSTNFADMFAGKLKPTTAFMMGKLKIKGDMQKAMKLEKMMKSLKAKV-