Monarch geneset OGS2.0

DPOGS203035
TranscriptDPOGS203035-TA981 bp
ProteinDPOGS203035-PA326 aa
Genomic positionDPSCF300068 + 686818-688769
RNAseq coverage25x (Rank: top 77%)
Annotation
HeliconiusHMEL0140795e-11061.64% 
BombyxBGIBMGA002976-TA3e-10157.19% 
DrosophilaWwox-PA2e-5238.00% 
EBI UniRef50UniRef50_Q9NZC77e-4637.37%WW domain-containing oxidoreductase n=82 Tax=Coelomata RepID=WWOX_HUMAN
NCBI RefSeqXP_001962175.12e-5339.00%GF15334 [Drosophila ananassae]
NCBI nr blastpgi|3071882232e-5539.81%WW domain-containing oxidoreductase [Camponotus floridanus]
NCBI nr blastxgi|3320189466e-5541.31%WW domain-containing oxidoreductase [Acromyrmex echinatior]
Group
Gene OntologyGO:00054881.1e-40binding
GO:00081522.2e-15metabolic process
GO:00164912.2e-15oxidoreductase activity
KEGG pathwaydre:3938877e-40 
 K00100 (E1.1.1.-)maps-> Linoleic acid metabolism
    Bisphenol A degradation
    Fructose and mannose metabolism
    Butanoate metabolism
    Tetrachloroethene degradation
InterPro domain[27-282] IPR0160401.1e-40NAD(P)-binding domain
[31-164] IPR0021982.2e-15Short-chain dehydrogenase/reductase SDR
[32-49] IPR0023473.1e-06Glucose/ribitol dehydrogenase
Orthology groupMCL10674 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203035-TA
ATGCTAAAAGCACTCCAGAAGAGCTTCGAATTCAAAAGTATATTTGGACAAACTGCCGAAGAGGTCGTGAATAACGTAGATTTGTCAAATAAAACATGCCTCATTACGGGGGCCAGCAGCGGCATCGGTCTAGAGATTGCTCGATGTCTTAATAGTCGCGACTGTAACTTACTAATGGCTTCTCGAAATGTTTACAAAGCGAATCTTCTTGCCAACAAAACCTGTCTAAACAACCAGAGAATTCGGCACTACCAAATAAATCTTGCTTCCTTGGCTTCTGTGAGACAATGTGCCCAAGAAATCATTGAAAATGAAAGACAAATAGATATAGTGATTCTTAATGCCGCCACCTTTGGTATACCATGGACTGTTACTAAAGATGGCCTGGAAACAACCTTCCAAGTTAATTTCCTAAGTCAATACTATTTGTTGCTGTGCCTGGGTAAGATGCTGGCTCCTGACGCCAGGGTGGTATTTACCTCCTCCGAATCTCATAGAAACATAAAATGGCCAGAAAAAAATAGATTCAATCCGGTGTTCGAGAACCTTTCACTCCTCAAACACGAATACACGTCCATCAAGTCGTATAATATATCGAAGCTGTGTTGTTTATTACTCATGCACTATTTGAGCTACCAGTGGTCTAATAGTGAGAGGAGCTTCTTGTGTGCACACCCGGGTTCTTTCATCAAAACTGGTCTCTGTCGCAACTGGTGGCCTTACGAGGCACTGTACACAATTATGTTACCATTCTCAAAGTCTATTATGCAAGGTGCTAGTACCATACTTTATTGCGCAACTTCGCCAAATTTAAAAGGTGCTACAGGTATGTACTTCAGCAACTGTAACCACTGTAATGAAAGTGACCTCGCCAAGGATATATACTTCTCATTTAGGATTCACGACTTGATTCTCGACATACTGCGAGACCGTGTACAAGATTTAGATAAATTGACTAAGGAACTGCGAGTGAATAAATAA

Protein sequence:

>DPOGS203035-PA
MLKALQKSFEFKSIFGQTAEEVVNNVDLSNKTCLITGASSGIGLEIARCLNSRDCNLLMASRNVYKANLLANKTCLNNQRIRHYQINLASLASVRQCAQEIIENERQIDIVILNAATFGIPWTVTKDGLETTFQVNFLSQYYLLLCLGKMLAPDARVVFTSSESHRNIKWPEKNRFNPVFENLSLLKHEYTSIKSYNISKLCCLLLMHYLSYQWSNSERSFLCAHPGSFIKTGLCRNWWPYEALYTIMLPFSKSIMQGASTILYCATSPNLKGATGMYFSNCNHCNESDLAKDIYFSFRIHDLILDILRDRVQDLDKLTKELRVNK-