Monarch geneset OGS2.0

DPOGS205715
TranscriptDPOGS205715-TA1860 bp
ProteinDPOGS205715-PA619 aa
Genomic positionDPSCF300250 + 183369-187532
RNAseq coverage0x (Rank: top 97%)
Annotation
HeliconiusHMEL0072970.067.51% 
BombyxBGIBMGA009925-TA3e-17853.14% 
DrosophilaCG9509-PA5e-8633.39% 
EBI UniRef50UniRef50_E0VUB12e-8633.77%Alcohol oxidase, putative n=1 Tax=Pediculus humanus corporis RepID=E0VUB1_PEDHC
NCBI RefSeqXP_559558.53e-8834.86%AGAP003782-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3320230792e-8835.98%Glucose dehydrogenase [Acromyrmex echinatior]
NCBI nr blastxgi|3227964013e-8635.70%hypothetical protein SINV_00375 [Solenopsis invicta]
Group
Gene OntologyGO:00166142.8e-144oxidoreductase activity, acting on CH-OH group of donors
GO:00088122.8e-144choline dehydrogenase activity
GO:00506602.8e-144flavin adenine dinucleotide binding
GO:00551142.8e-144oxidation-reduction process
GO:00060662.8e-144alcohol metabolic process
KEGG pathwaydme:Dmel_CG95094e-84 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[3-620] IPR0121322.8e-144Glucose-methanol-choline oxidoreductase
[50-345] IPR0001722.2e-53Glucose-methanol-choline oxidoreductase, N-terminal
[472-607] IPR0078671.4e-35Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL19844 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205715-TA
ATGTTTACCCGAGTCGACAAGTATAGAGACGCTAGGACACTGAGGATCCAGCACTTGTATCGCTGGTTCACACACAGCGCTAAGGAACAAGATCTCAAACCGAGGCAGCTCCTGCGATGGGATCTGGTCTTTTATGACGGCGACGAGTTTGACTACGTGGTGATTGGCGCCGGCGCAGCTGGGAGTGCGGTGGCAGCGAGACTGGCGCTGGCTGGACATAGCGTGCTGTTGGTTGAAGCAGGCGGAGATCCCAACATCCTCACAAGAATACCTGGAGCAACTTTGGCTTTGACTGGTTCAAATCTGGATTGGTACTATGATACGATACCCAATAACAAGTCGTGTCTTTCTTCTAAAGGAGGGAAATGTCGTTTAAGTCGAGGTCGATGTCTAGGAGGATCGACTAGCCTTAACTACATGATGTACACTCGAGGAAATAAGCAGGATTACGACTTTAATGTTACCGGCTGGAATTGGGAAGACATTAAACCGTATTTTCTTAGATTTGAAGGACTACAGGAACCTTCTAGACTTCCAAAATCGTCTGGAGCGTATCATAATACTTCTGGTATAACGCCGATAGGATACTTTGGTGATTCCGGCAATCCATGGCACCAGAGGATTGTCGAGGGCCTGACTTCCGTGAATTTTCCATATAATCCAGACGTAAATTCCAAGTCTCAGATAGGTGTTTCTAAAATTCTGGGTTTTACTTCCGGCGGAGAACGAGTTAGCACTGCAACTGCTTATTTAGGTAAAAAAAATGTGAAGGAATCATTAAAAATTATTAAAAATACAAAGTGTACAGGAGTAATTATTGATACTGAAAATATAGCTAGAGGGGTAACTATAGCGAGAGGTTTTAATGATACTATAAATATATTTACAAAAAAAGAAGTAATTTTAAGTGCTGGAGCTTTTAACACTCCTCAATTACTAATGCTGTCAGGAATTGGACCAAAAGAACATTTAGAGGAATTTAACATTCCTGTCAAAGCAAATTTGCCCGTAGGTCACGGAATGTCTGACCATGTTTTGCCCATAATAAACGTAAGAGTCGATCATGATTCTATGCCATCATCAAATATTTTATCTATTGGATCCAAGCTCTGGCAGGGTCTCAGTTGGCTACTAATGCGTAGCGGACCATTAGCGTCCAATAGTATAACTGACCTGACTGCTTTTGCGAACACCGAATGCTACGACTTTAAACTTAGGCGATTACTGAATGATAGGCCTGAATGTGAATTGCCAAATTTACAATTAATTTATGCTTACATTGACAAGGGGTTACTTAGTATGGTTAAATCGTTATATGAAATTGCCGCTCCGCACTCTCCTGAAGTTATGAATCAAGTGGTGTCAGCCAACGAAGAAAGCTCTTTCATTGTGGTGTCACCGGTAGTGCTAAAGCCAAAGTCTCGGGGCTGGGTGAAGCTAGCTAGTTCCGATCCATTCGAACAACCGGCAATTATTCCCAACTACTTGAGTGACAAAAGAGATGTCGAAGAAATGGTGCGTGCAATAAAATTACTGGAGCAAGTGGTTGAGACGCCTGCATTTAAAAACTTTAATGCATCCATTTTGAAGCTTCATATTTCCGAATGTCCTGCCTTTGATGAAGAAGGTTACTGGGAATGTTATTCAAGACATATGACGCATTCAGTACAACACGCGGTCGGAACAGCCGCACTCGGGCAAGTGGTTGACGAAAGATTAAGAGTTAAGGGTGTTAAAAATCTTCGCATTGCCGACGCCTCGGTACTTCCACACTTGCCACGTGGCAATACGGCCGCTGCTATAATCGCTATTGGGGAACGTTTATCAGATTTCCTTTTACAAGATCGAGGATTAGAATGA

Protein sequence:

>DPOGS205715-PA
MFTRVDKYRDARTLRIQHLYRWFTHSAKEQDLKPRQLLRWDLVFYDGDEFDYVVIGAGAAGSAVAARLALAGHSVLLVEAGGDPNILTRIPGATLALTGSNLDWYYDTIPNNKSCLSSKGGKCRLSRGRCLGGSTSLNYMMYTRGNKQDYDFNVTGWNWEDIKPYFLRFEGLQEPSRLPKSSGAYHNTSGITPIGYFGDSGNPWHQRIVEGLTSVNFPYNPDVNSKSQIGVSKILGFTSGGERVSTATAYLGKKNVKESLKIIKNTKCTGVIIDTENIARGVTIARGFNDTINIFTKKEVILSAGAFNTPQLLMLSGIGPKEHLEEFNIPVKANLPVGHGMSDHVLPIINVRVDHDSMPSSNILSIGSKLWQGLSWLLMRSGPLASNSITDLTAFANTECYDFKLRRLLNDRPECELPNLQLIYAYIDKGLLSMVKSLYEIAAPHSPEVMNQVVSANEESSFIVVSPVVLKPKSRGWVKLASSDPFEQPAIIPNYLSDKRDVEEMVRAIKLLEQVVETPAFKNFNASILKLHISECPAFDEEGYWECYSRHMTHSVQHAVGTAALGQVVDERLRVKGVKNLRIADASVLPHLPRGNTAAAIIAIGERLSDFLLQDRGLE-