Monarch geneset OGS2.0

DPOGS205716
TranscriptDPOGS205716-TA1860 bp
ProteinDPOGS205716-PA619 aa
Genomic positionDPSCF300250 + 191203-195375
RNAseq coverage0x (Rank: top 97%)
Annotation
HeliconiusHMEL0072970.067.51% 
BombyxBGIBMGA009925-TA7e-17853.14% 
DrosophilaCG9509-PA3e-8633.39% 
EBI UniRef50UniRef50_E0VUB13e-8633.77%Alcohol oxidase, putative n=1 Tax=Pediculus humanus corporis RepID=E0VUB1_PEDHC
NCBI RefSeqXP_559558.53e-8834.86%AGAP003782-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3320230792e-8835.98%Glucose dehydrogenase [Acromyrmex echinatior]
NCBI nr blastxgi|3227964013e-8635.70%hypothetical protein SINV_00375 [Solenopsis invicta]
Group
Gene OntologyGO:00166143.2e-144oxidoreductase activity, acting on CH-OH group of donors
GO:00088123.2e-144choline dehydrogenase activity
GO:00506603.2e-144flavin adenine dinucleotide binding
GO:00551143.2e-144oxidation-reduction process
GO:00060663.2e-144alcohol metabolic process
KEGG pathwaydme:Dmel_CG95092e-84 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[3-620] IPR0121323.2e-144Glucose-methanol-choline oxidoreductase
[50-345] IPR0001723.3e-53Glucose-methanol-choline oxidoreductase, N-terminal
[472-607] IPR0078671.4e-35Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL19844 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205716-TA
ATGTTTACCCGAGTCGACAAGTATAGAGACGCTAGGACACTGAGGATCCAGCACTTGTACCGCTGGTTCACACACAGCGCTAAGGAACAAGATCTCAAACCAAGGCAGCTCCTGCGATGGGATCTGGTCTTTTATGACGGCGACGAGTTTGACTACGTGGTGATTGGCGCCGGCGCAGCTGGGAGTGCGGTGGCAGCGAGACTGGCGCTGGCTGGACATAGCGTGCTGTTGGTTGAAGCAGGCGGAGATCCCAACATCCTCACAAGAATACCTGGAGCAACTTTGGCTTTGACTGGTTCAAATCTGGATTGGTACTATGATACGATACCGAATAACAAGTCGTGTCTTTCTTCTAAAGGAGGGAAATGTCGTTTAAGTCGAGGTCGATGTCTAGGAGGATCGACTAGCCTTAACTACATGATGTATACTCGAGGAAATAAGCAGGATTACGACTTTAATGTTACCGGCTGGAATTGGGAAGACATTAAACCGTATTTTCTTAGATTTGAAGGACTACAGGAACCTTCTAGACTTCCAAAATCGTCTGGAGCGTATCATAATACTTCTGGTATAACGCCGATAGGATACTTTGGTGATTCCGGCAATCCATGGCACCAGAGGATTGTCGAGGGCCTGACTTCCGTGAATTTTCCATATAATCCAGACGTAAATTCCAAGTCTCAGATAGGTGTTTCTAAAATTCTGGGTTTTACTTCCGGCGGAGAACGAGTTAGCACTGCAACTGCTTATTTAGGTACAAAAAATGTGAAGGAATCCTTAAAAATTATTAAAAATACAAAGTGTACAGGAGTAATTATTGATACTGAAAATATAGCTAGAGGGGTAACTATAGCGAGAGGTTTTAATGATACTATAAATATATTTACAAAAAAAGAAGTAATTTTAAGTGCTGGAGCTTTTAACACTCCTCAATTACTAATGCTGTCAGGAATTGGACCAAAAGAACATTTAGAGGAATTTAACATTCCTGTCAAAGCAAATTTGCCCGTAGGTCACGGAATGTCTGACCATGTTTTGCCCATAATAAACGTAAGAGTCGATCATGATTCTATGCCATCATCAAATATTTTATCTATTGGATCCAAGCTCTGGCAGGGTCTCAGTTGGCTACTAATGCGTAGCGGACCATTAGCGTCCAATAGTATAACTGACCTGACTGCTTTTGCGAACACCGAATGCTACGACTTTAAACTTAGGCGATTACTGAATGATAGGCCTGAATGTGAATTGCCAAATTTACAATTAATTTATGCTTACATTGACAAGGGGTTACTTAGTATGGTTAAATCGTTATATGAAATTGCCGCTCCGCACTCTCCTGAAGTTATGAATCAAGTGGTGTCAGCCAACGAAGAAAGCTCTTTCATTGTGGTGTCACCGGTAGTGCTAAAGCCAAAGTCTCGGGGCTGGGTGAAGCTAGCTAGTTCCGATCCATTCGAACAACCGGCAATTATTCCCAACTACTTGAGTGACAAAAGAGATGTCGAAGAAATGGTGCGTGCAATAAAATTACTGGAGCAAGTGGTTGAGACGCCTGCATTTAAAAACTTTAATGCATCCATTTTGAAGCTTCATATTTCCGAATGTCCTGCCTTTGATGAAGAAGGTTACTGGGAATGTTATTCAAGACATATGACGCATTCAGTACAACACGCGGTCGGAACAGCCGCACTCGGGCAAGTGGTTGACGAAAGATTAAGAGTTAAGGGTGTTAAAAATCTTCGCATTGCCGACGCCTCGGTACTTCCACACTTGCCACGTGGCAATACGGCCGCTGCTATAATCGCTATTGGGGAACGTTTATCAGATTTCCTTTTACAAGATCGAGGATTAGAATGA

Protein sequence:

>DPOGS205716-PA
MFTRVDKYRDARTLRIQHLYRWFTHSAKEQDLKPRQLLRWDLVFYDGDEFDYVVIGAGAAGSAVAARLALAGHSVLLVEAGGDPNILTRIPGATLALTGSNLDWYYDTIPNNKSCLSSKGGKCRLSRGRCLGGSTSLNYMMYTRGNKQDYDFNVTGWNWEDIKPYFLRFEGLQEPSRLPKSSGAYHNTSGITPIGYFGDSGNPWHQRIVEGLTSVNFPYNPDVNSKSQIGVSKILGFTSGGERVSTATAYLGTKNVKESLKIIKNTKCTGVIIDTENIARGVTIARGFNDTINIFTKKEVILSAGAFNTPQLLMLSGIGPKEHLEEFNIPVKANLPVGHGMSDHVLPIINVRVDHDSMPSSNILSIGSKLWQGLSWLLMRSGPLASNSITDLTAFANTECYDFKLRRLLNDRPECELPNLQLIYAYIDKGLLSMVKSLYEIAAPHSPEVMNQVVSANEESSFIVVSPVVLKPKSRGWVKLASSDPFEQPAIIPNYLSDKRDVEEMVRAIKLLEQVVETPAFKNFNASILKLHISECPAFDEEGYWECYSRHMTHSVQHAVGTAALGQVVDERLRVKGVKNLRIADASVLPHLPRGNTAAAIIAIGERLSDFLLQDRGLE-