Monarch geneset OGS2.0

DPOGS200495
TranscriptDPOGS200495-TA1266 bp
ProteinDPOGS200495-PA421 aa
Genomic positionDPSCF300158 + 116748-119484
RNAseq coverage412x (Rank: top 29%)
Annotation
HeliconiusHMEL0140583e-16971.57% 
BombyxBGIBMGA002976-TA3e-5938.76% 
DrosophilaWwox-PA1e-12353.71% 
EBI UniRef50UniRef50_E0VY391e-10448.75%WW domain-containing oxidoreductase, putative n=1 Tax=Pediculus humanus corporis RepID=E0VY39_PEDHC
NCBI RefSeqXP_001846460.12e-12554.68%WW domain-containing oxidoreductase [Culex quinquefasciatus]
NCBI nr blastpgi|1700372264e-12454.68%WW domain-containing oxidoreductase [Culex quinquefasciatus]
NCBI nr blastxgi|910883071e-12356.57%PREDICTED: similar to WW domain-containing oxidoreductase [Tribolium castaneum]
Group
Gene OntologyGO:00054884.2e-53binding
GO:00081522.2e-14metabolic process
GO:00164912.2e-14oxidoreductase activity
GO:00055157.7e-14protein binding
KEGG pathwaydre:3938872e-89 
 K00100 (E1.1.1.-)maps-> Linoleic acid metabolism
    Bisphenol A degradation
    Fructose and mannose metabolism
    Butanoate metabolism
    Tetrachloroethene degradation
InterPro domain[114-396] IPR0160404.2e-53NAD(P)-binding domain
[120-258] IPR0021982.2e-14Short-chain dehydrogenase/reductase SDR
[7-43] IPR0012027.7e-14WW/Rsp5/WWP
[120-137] IPR0023472.7e-10Glucose/ribitol dehydrogenase
Orthology groupMCL10674 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200495-TA
ATGCACAATTTAGATTCAGACAGTGAGGATGAACTACCTCCAGGTTGGGAGGAAAAGTCTACTGAGGATGGAAATGTATACTTTGTCAATTCGTGCAATAAGAAAACACAATGGACACATCCTCGAACCGGTTGTAAAAAAGTAATCCCAAAAGATTTGCCATTCGGTTGGAGTAAGACGGTTGATGAAAGTGGTAAGACACTATATGTTAATCAGGAAACCGGTAACAAGACATATGTGGACCCACGACTGGCCTTCTCGAAGGAAGAAAAAAAACATGTAAATGACTTTAGGCAGCGGTTTGATGGTTCTACCACAGCATTTCAGGTTCTACATGGAGTAGATTTATCAGGCAAATATGCTGTTATAACAGGATGTAATGCCGGTATAGGCTATGAAACTGCAAAGTCCCTAGCTAGACACGGCTGTAATATTTTATTTGCAAATCGTAATTTAGAAGCCACTCAAAAAGCCATTGAAGATATTGTAAAAGAAACAAATGTTTCCGAAGAAAATTTGAAGTCAATTCAATTGGATTTAGGCTCATTGAAAAGCGTGAAGAAGTGTGCCTTAGCTGTTAAAACCGTTTTTTCTGATCACATTGATATGCTCATATTAAATGCTGGCGTCTTTGGTTTGCCATATGAAGAGACAGAAGACCAGCTAGATCGTACATTCCAAGTGAATCATCTCTCACACATGTACTTTGCTATGTTGTTGGAACCGTTGCTGAGAAAAGGCTCAAGGGTTATCTTTGTATCATCCGAGTCACATAGAACAGCCAGCCTTAAAAATGTATTTGTAAAACAAAATTTAGCACATCCCAAAGAATCCTACAGCGCTATGACAGCGTACGGAAATTCAAAACTTTATAATATTATAACTGCCAAGATGTTAAGTGAAGAGTGGAAACAAAAGGGCATTGCTGTGAATTCTTTACATCCCGGCAACATGGTCTCAACGAATCTGCCAAAGAGTTGGTGGCTGTACCAAGTCTTATTTTTTATTGTTCGACCGTTTACCAAATCATTGCAACAAGCAGCGGCAACAACAGTATATGTAGCTACAGCCTCCGAATTGGAAGGAGTCACAGGACTCTATTTCAACAACTGCTTCTATTGCGAAGAGTCGACTCTAGCCAGAGATAGAGATATATCTCACGAGGTCTTCTCTATATCCCTGAAGATGATCCAAGAGAGAATGGGTACTGAATATATAGAAGGGTTTGTACAGAAATATTGTACAGTCAAGAAGAATAGTGATTAA

Protein sequence:

>DPOGS200495-PA
MHNLDSDSEDELPPGWEEKSTEDGNVYFVNSCNKKTQWTHPRTGCKKVIPKDLPFGWSKTVDESGKTLYVNQETGNKTYVDPRLAFSKEEKKHVNDFRQRFDGSTTAFQVLHGVDLSGKYAVITGCNAGIGYETAKSLARHGCNILFANRNLEATQKAIEDIVKETNVSEENLKSIQLDLGSLKSVKKCALAVKTVFSDHIDMLILNAGVFGLPYEETEDQLDRTFQVNHLSHMYFAMLLEPLLRKGSRVIFVSSESHRTASLKNVFVKQNLAHPKESYSAMTAYGNSKLYNIITAKMLSEEWKQKGIAVNSLHPGNMVSTNLPKSWWLYQVLFFIVRPFTKSLQQAAATTVYVATASELEGVTGLYFNNCFYCEESTLARDRDISHEVFSISLKMIQERMGTEYIEGFVQKYCTVKKNSD-