Monarch geneset OGS2.0

DPOGS213161
TranscriptDPOGS213161-TA762 bp
ProteinDPOGS213161-PA253 aa
Genomic positionDPSCF300016 + 1312023-1313735
RNAseq coverage390x (Rank: top 31%)
Annotation
HeliconiusHMEL0150803e-10477.25% 
BombyxBGIBMGA007919-TA1e-11680.24% 
DrosophilaCG3603-PA3e-7855.60% 
EBI UniRef50UniRef50_Q9W4U24e-7655.60%CG3603 n=24 Tax=Coelomata RepID=Q9W4U2_DROME
NCBI RefSeqXP_001991088.17e-8157.48%GH12482 [Drosophila grimshawi]
NCBI nr blastpgi|2700011081e-8057.26%hypothetical protein TcasGA2_TC011405 [Tribolium castaneum]
NCBI nr blastxgi|2700011082e-7757.26%hypothetical protein TcasGA2_TC011405 [Tribolium castaneum]
Group
Gene OntologyGO:00054881.2e-80binding
GO:00081523.2e-32metabolic process
GO:00164913.2e-32oxidoreductase activity
KEGG pathwayecb:1000614142e-72 
 K13370 (HSD17B8)maps-> Steroid hormone biosynthesis
InterPro domain[6-253] IPR0160401.2e-80NAD(P)-binding domain
[10-27] IPR0023471.1e-39Glucose/ribitol dehydrogenase
[10-181] IPR0021983.2e-32Short-chain dehydrogenase/reductase SDR
Orthology groupMCL10554 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213161-TA
ATGGTTAGCCAGCTCGTTTCAGGAAAATTAGCATTGGTTACTGGTGCGGGGTCTGGTATAGGACGAGCTGCTTGCCAGGTGCTATCAAGGGAAGGCGCAACAGTCATAGCAGCTGACAGAAATTATGAAGCTGCAATGGAAACTATTAAGAAACATGCCGCTTTAGCTTCAGGCACAACCGGTGACCATACCGCAGTAGAATTAGATGTATCGGATTCCAAATCAGTTCAGAAACTTTTACAATCTACATTAAATATTTATAAAACTCCACCGAACATTATTGTCAACTGTGCTGGTATTACAAAAGATAATTGGCTCTTAAAACTATCCGAACAGGACTATGACAGTGTTCTCGATGTTAACTTGAAGGGCACCTTCTTAGTGATGCAGACATTTGCCAAGGCATTGACGGAAGCATCACTTCCTGGATCAATTATCAATATTTCTAGTATAGTAGGAAAATATGGAAATATGGGGCAAACAAACTATTCAGCGAGTAAAGCTGGTGTTGTTGCTATGACACAGACAGCGGCCAAAGAGCTCGGAAAGTTTAATATTAGGGTCAATGCCATTTTGCCGGGGTTTATTAAAACTCCTATAATTAGCACAGTTCCTGACAAAGTAAAGGAAAATTTGTTAAAACTAGTTCCTCTAGGCAGACTCGGTGAACCTTCGGAGATTGCCGAAGTCATCACTTTTTTGAGTTCAGAAAAGAGCTCCTTTATAACTGGTGCTGCGATTGATGTCACGGGAGGCTTTTGA

Protein sequence:

>DPOGS213161-PA
MVSQLVSGKLALVTGAGSGIGRAACQVLSREGATVIAADRNYEAAMETIKKHAALASGTTGDHTAVELDVSDSKSVQKLLQSTLNIYKTPPNIIVNCAGITKDNWLLKLSEQDYDSVLDVNLKGTFLVMQTFAKALTEASLPGSIINISSIVGKYGNMGQTNYSASKAGVVAMTQTAAKELGKFNIRVNAILPGFIKTPIISTVPDKVKENLLKLVPLGRLGEPSEIAEVITFLSSEKSSFITGAAIDVTGGF-