Monarch geneset OGS2.0

DPOGS212868
TranscriptDPOGS212868-TA939 bp
ProteinDPOGS212868-PA312 aa
Genomic positionDPSCF300086 + 454433-457316
RNAseq coverage349x (Rank: top 33%)
Annotation
HeliconiusHMEL0152718e-4636.05% 
BombyxBGIBMGA000813-TA2e-11464.80% 
DrosophilaCG30491-PA1e-7445.36% 
EBI UniRef50UniRef50_E0VUN21e-7044.26%Restnol dehydrogenase, putative n=1 Tax=Pediculus humanus corporis RepID=E0VUN2_PEDHC
NCBI RefSeqXP_001655347.12e-8049.03%short-chain dehydrogenase [Aedes aegypti]
NCBI nr blastpgi|1571292803e-7949.03%short-chain dehydrogenase [Aedes aegypti]
NCBI nr blastxgi|1571292802e-7547.80%short-chain dehydrogenase [Aedes aegypti]
Group
Gene OntologyGO:00054886.9e-48binding
GO:00081529.5e-16metabolic process
GO:00164919.5e-16oxidoreductase activity
KEGG pathwaydme:Dmel_CG20702e-63 
 K00100 (E1.1.1.-)maps-> Linoleic acid metabolism
    Bisphenol A degradation
    Fructose and mannose metabolism
    Butanoate metabolism
    Tetrachloroethene degradation
InterPro domain[39-300] IPR0160406.9e-48NAD(P)-binding domain
[42-146] IPR0021989.5e-16Short-chain dehydrogenase/reductase SDR
[43-60] IPR0023474.2e-13Glucose/ribitol dehydrogenase
Orthology groupMCL14315 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212868-TA
ATGTGGGTGCCCAACTTGCCGGTGACGGTGCTCACCGGGCTCGCAGCGGGTGCGGGCGTCATCTGTATATTTAAGGACATCTACGGTGGCCCACCTTTCGATAAAAAAGTACTAGCTGATGGCAAGACGGTGATATTGACGGGCGCCACCAGCGGCATCGGCAGTAAAGCTGCTTGGGACTTTGCAAAACGAGGGGCCAAAGTGTTCATGGCGTGTCGTGACATGAAGAAGTGCGAGGAGGTACGACGGGAGATAGTGCTGGACACTGGAAACAAATTCGTATACTGTCGGCCGTGTGACCTCGCCAGCACTGACTCCATACGAGCCTTCGTAGAAAGGTTCAAGAAAGAAGAACCATACGTAGACATCCTGGTGAACAATGCGGGGGTCATGGAGGCGCCGGCGAGGGTCACACTGGACGGGTTCGAGACACATTTAGGACAATCGGCGCCGAGCCGCGTCATATTGGTGACCTGCAGCGCACACAGTAAGGGTCAGATTCACAAAGAGGATCTCAACATGACCGCCAAATACGATCCCGTGGCCGCCTACAACCAGAGCAAACTGGCTAACGTGCTGTTCGCGAGAGAACTCGGGAGGCGGATGCTTAACACCGGCGTGTCTGTGATAGCCGTGGACCCGGGGTTCTCGGATACGGACCTCACTCGTAACATGGCCATGATGAAGAGCGTCACGAGGTTCCTCGTGTACCCGCTGTTCTGGCCCGTCATGAAGAGAGCCATGACCGGCGCTCAGGTCATCCTGCACGCGGCCCTGGACCCAGCCCTGGACGGCTCGGCGGGGGACTACTATGTGGACATGAAAAAGACCAACCCATCAGAATTAGCTCAGGATTACGAGCTGGCGCTGTGGATGTGGAAGGTCAGTCAGAAGTGGACCAAAGTCGCCGAGCACGCGTCGGCGCTGGCGGCCACTTAG

Protein sequence:

>DPOGS212868-PA
MWVPNLPVTVLTGLAAGAGVICIFKDIYGGPPFDKKVLADGKTVILTGATSGIGSKAAWDFAKRGAKVFMACRDMKKCEEVRREIVLDTGNKFVYCRPCDLASTDSIRAFVERFKKEEPYVDILVNNAGVMEAPARVTLDGFETHLGQSAPSRVILVTCSAHSKGQIHKEDLNMTAKYDPVAAYNQSKLANVLFARELGRRMLNTGVSVIAVDPGFSDTDLTRNMAMMKSVTRFLVYPLFWPVMKRAMTGAQVILHAALDPALDGSAGDYYVDMKKTNPSELAQDYELALWMWKVSQKWTKVAEHASALAAT-