Monarch geneset OGS2.0

DPOGS210515
TranscriptDPOGS210515-TA1818 bp
ProteinDPOGS210515-PA605 aa
Genomic positionDPSCF300186 + 128156-130452
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0161090.080.83% 
BombyxBGIBMGA012618-TA0.075.34% 
DrosophilaCG9517-PA1e-9034.48% 
EBI UniRef50UniRef50_E2BJK22e-11137.82%Glucose dehydrogenase [acceptor] n=9 Tax=Endopterygota RepID=E2BJK2_HARSA
NCBI RefSeqXP_001602062.13e-11640.95%PREDICTED: similar to RE28171p [Nasonia vitripennis]
NCBI nr blastpgi|3838604645e-11640.10%PREDICTED: glucose dehydrogenase [acceptor]-like [Megachile rotundata]
NCBI nr blastxgi|3838604643e-11139.93%PREDICTED: glucose dehydrogenase [acceptor]-like [Megachile rotundata]
Group
Gene OntologyGO:00166142.8e-93oxidoreductase activity, acting on CH-OH group of donors
GO:00088122.8e-93choline dehydrogenase activity
GO:00506602.8e-93flavin adenine dinucleotide binding
GO:00551142.8e-93oxidation-reduction process
GO:00060662.8e-93alcohol metabolic process
KEGG pathwaydpo:Dpse_GA218492e-87 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[2-601] IPR0121322.8e-93Glucose-methanol-choline oxidoreductase
[44-343] IPR0001722.2e-60Glucose-methanol-choline oxidoreductase, N-terminal
[447-588] IPR0078676.5e-30Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL25990 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210515-TA
ATGGATTTACATTATATAAGCATCATATACCTCACATACATAGTGTACGCGACATCCGATGCTAGAGCTGACCTGTTCTCGAAGGCTACCAAAACTAAAACAAACGAAAAAGACCTGAGGAACAGTGTCTTCGACTTCATCGTAGTGGGGGGCGGCACAGCTGGCTGCGTTCTGGCTAACCGACTGTCAGCCAACCCTGAGTGGAAGGTTTTAGTTCTGGAGGCTGGCGACGAAGAGAACATAGATTATGACATTCCGGCTGTTCCAACAAATGAATACCGACCGATCCTCTGGAATTTTAGGACTGAGAGAAATGGCTTCTCTTGTTTGTCGAGACCTGGAGGGAGCTGTGAGGTGAAAACGGGGAAGGTTTTAGGAGGGTCCAGCGTAACCAATGATATGAAGTACACCAGAGGTTCAAAGAAAGATTATGACGCTTGGCACATCACTGGAGATATGGGATCTCTGAACTGGAAATTTGAGAATTTAATAGAACATTTCAAGGCTTCGGAAGATAATGGCGACTATGATGTACTGATGAATTCTTACTATCATTCCCGCGGAGGCGAAATGCATGTACAAAGATTCAAACACATCGACAAACACATGGAGCTGTTCCTTGGCGCATTCGCTGAGATGGGATTCAGGTCTGTGGACATAAATACTAACGCTCCAGAAGCGGCTCTCAACAACCAGTTCGTTGTAGCTAACAATACACGACTCAGTACAAACAGCGCTTTCCTGAAACCGGTCAGACAAAGAAAGAACTTAGTTGTCAAAACCGGAGCAACGGTTACGAAATTAATTATTGATGCCCAAGAGAATAAAGTAGACGGAGTATTTTATGAATTGGAAAAGGGAAAGGAACAACTGGCTTACGCAAAAAAGGAAACGATACTGACTACCGGTGCCATCAACAATGTAAAAATACTCCATCTGTCAGGAATAGGACCGGCAGCTGAATTAGAGAAGCAGAATATTTCAGTTATAATCGATCTTCCTGTTGGTTCCAATTATCAAGATCAAGTGACCATAGGTGGCCTCGCATTTTCTCTTAGTGATGTAACTTCAGTTTCTGAACAACAAATTGTAGAAGATTTTAAAACGTGGTTTCAAAACAGAGGGGGCCCTCTGGCTTCGCGGGGAATCAATCAAGTATCAGCATTTATACAGTTGTTTGAAAATAAAAATGGTCCGGATATAGAGTTGGCATTACACGGAAACTATATAAGAAGTGACAAGTTCATGTTAACAAATGTTAGTGTGCATACAGCCGAGGAGGTCAACTTACCCATAGCTTATTACAATTTAGTCAATATAAACCCAGTATTACTAAAACCAAAGAGCAAAGGTAAAGTAACGTTGAATAAAAGAAACCCAAAATACGGTAGACCAGTGATTCAAGCTAATTTGTTGAAAGAGCAGGAAGATTTAGACGCCATTATCGACAGTGTAGACATAGCACTGCAGCTACTAAATACAAAGGAACTGAAAAAGGCTGAAATAAACATGGCTCCATTAGATATATTTCCCTGTGATCGATTAAACAACAGAGATCAATGGAATTGTATAGCAAGGCACTATACGAAGGCTATGAGTAATCCTATCGGCACGTGTCGAATGGGTCAAAATAGTACAGATTCTGTTGTAAATTATGAATTCAAAGTCCATAACGTTGAGTCATTAAGAGTGATCGATGCGTCGGTAATGCCATCACATGTACGAGGCAATATATTCGCTCCCACCGTTATGGTCGCAGAGAAAGGAACAAAATTAATTATCAAAGACTGGAAGGAAAACAAAGGAAATTTCTTATAA

Protein sequence:

>DPOGS210515-PA
MDLHYISIIYLTYIVYATSDARADLFSKATKTKTNEKDLRNSVFDFIVVGGGTAGCVLANRLSANPEWKVLVLEAGDEENIDYDIPAVPTNEYRPILWNFRTERNGFSCLSRPGGSCEVKTGKVLGGSSVTNDMKYTRGSKKDYDAWHITGDMGSLNWKFENLIEHFKASEDNGDYDVLMNSYYHSRGGEMHVQRFKHIDKHMELFLGAFAEMGFRSVDINTNAPEAALNNQFVVANNTRLSTNSAFLKPVRQRKNLVVKTGATVTKLIIDAQENKVDGVFYELEKGKEQLAYAKKETILTTGAINNVKILHLSGIGPAAELEKQNISVIIDLPVGSNYQDQVTIGGLAFSLSDVTSVSEQQIVEDFKTWFQNRGGPLASRGINQVSAFIQLFENKNGPDIELALHGNYIRSDKFMLTNVSVHTAEEVNLPIAYYNLVNINPVLLKPKSKGKVTLNKRNPKYGRPVIQANLLKEQEDLDAIIDSVDIALQLLNTKELKKAEINMAPLDIFPCDRLNNRDQWNCIARHYTKAMSNPIGTCRMGQNSTDSVVNYEFKVHNVESLRVIDASVMPSHVRGNIFAPTVMVAEKGTKLIIKDWKENKGNFL-