Monarch geneset OGS2.0

DPOGS207057
TranscriptDPOGS207057-TA1851 bp
ProteinDPOGS207057-PA616 aa
Genomic positionDPSCF300001 + 2193412-2195531
RNAseq coverage287x (Rank: top 38%)
Annotation
HeliconiusHMEL0101890.094.16% 
BombyxBGIBMGA013002-TA0.091.23% 
DrosophilaCG9517-PA0.068.39% 
EBI UniRef50UniRef50_Q9VY070.068.39%CG9517, isoform A n=76 Tax=Pancrustacea RepID=Q9VY07_DROME
NCBI RefSeqXP_001602133.10.070.61%PREDICTED: similar to ENSANGP00000024305 [Nasonia vitripennis]
NCBI nr blastpgi|3838604680.069.66%PREDICTED: glucose dehydrogenase [acceptor]-like [Megachile rotundata]
NCBI nr blastxgi|3838604680.069.66%PREDICTED: glucose dehydrogenase [acceptor]-like [Megachile rotundata]
Group
Gene OntologyGO:00166141.4e-175oxidoreductase activity, acting on CH-OH group of donors
GO:00088121.4e-175choline dehydrogenase activity
GO:00506601.4e-175flavin adenine dinucleotide binding
GO:00551141.4e-175oxidation-reduction process
GO:00060661.4e-175alcohol metabolic process
KEGG pathwaydme:Dmel_CG95180.0 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[10-617] IPR0121321.4e-175Glucose-methanol-choline oxidoreductase
[52-349] IPR0001728.7e-83Glucose-methanol-choline oxidoreductase, N-terminal
[461-605] IPR0078672e-41Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL10024 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207057-TA
ATGGAGGCGGCTGGCGCATTGGCGAGTCTAGCTCCATCGCCGATTACCGTGCTGGGACTGATACCACTCTTAGCACTTGGGATCACCTACTTCAGATATCAGCAATATGATCCGGAATCTTATATCACAGACACAAACATTATATTACCAATCTATGACTTCGTGGTGGTCGGGGGAGGCTCTGCAGGTGCGGTTATGGCATCAAGACTCTCTGAGATTGGTAATTGGACTGTCCTGCTCCTTGAAGCCGGTCAAGATGAAAACGAGATTTCTGATATCCCTGCATTAGCCGGATACACCCAATTGTCGGATATGGATTGGAAGTTCCAAACAACGCCATCTAAAAACCGTTCTTATTGCCTCGCTATGAACGGTGACCGATGCAATTGGCCTAGAGGAAAAGTCCTTGGCGGAAGCAGTGTCCTGAACGCAATGGTTTACGTCAGAGGCAATCGCAACGACTACGATTTGTGGGAGGCTCTAGGCAATCCGGGCTGGTCGTACGATCAAGTGTTACCCTACTTTTTGAAATCTGAAGATAATCGAAATCCTTATTTGGCCTCAACACCGTATCATTCAGCAGGTGGATATTTGACGGTTCAAGAAGCGCCGTGGCGGACACCGTTATCCATTACATTTTTAAAAGGCGGAATGGAACTAGGTTATGATTTTCGCGATATAAACGGCGAGAAACAAACGGGTTTTATGTTGACCCAAGCAACTATGCGTCGTGGGAGCAGATGTAGTACGGCCAAAGCTTTTCTTAGACCAATACGTAATAGAGATAATTTGCACATAGCTCTGGGAGCGCAAGTCACTCGTATATTAATAAACTCGGTCAAGAAACAAGCCTATGGTGTAGAATTTTATCGTAACGGCCAAAGACACAAAGTCAGAATAAAACGAGAAGTAATCATGTCCGCAGGAGCATTAGCAACGCCCCAAATAATGATGTTGAGTGGAATTGGACCCGCAGATCATCTCAGAGAGCACGGTATACCACTTGTTGCAAATCTTAAAGTCGGTCACAACTTGCAAGATCACGTGGGCCTAGGTGGTCTTACATTTGTCGTTAACAAACCGGTCACATTTAAAAAGGACCGGTTCCAATCATTCTCAGTTGCAATGAACTACATTTTATATGAGAATGGACCGATGACGACACAAGGCGTTGAAGGTTTGGCATTTGTCAACACTAAATACGCTCCCACTTCTGGTAACTGGCCCGATATTCAATTTCACTTTGCACCTAGTTCAGTAAATTCTGATGGAGGGGAGCAGATACGAAAAATTTTGAATTTACGTGACAGAGTTTACAATACCGTATACAAACCTATGGAAAACGCTGAAACTTGGACCATACTACCTTTGTTATTGCGACCCAAAAGTTCCGGCTGGATAAAATTAAAAAGTCGAAACCCGTTCCAAGCGCCATCGATTGAGCCCAATTACTTCGCATACAAAGAGGACATTAAAGTGCTAACGGAAGGTATAAAGATCGCTTTCGCCTTATCAAATACCACTGCGTTCCAGAGATACGGGTCGAGACCTCTTAACATTCCATTGCCAGGTTGCCAGCAGCATGTACTATTCAGTGATGAATATTGGGAATGCAGCCTTAAACACTTCACGTTCACAATTTACCATCCAACTGGCACATGTAAGATGGGTCCCAATCATGACCAGGATGCTGTTGTCGATCCAAGATTACGAGTTCACGGGGTTGCCAACCTTCGAGTTGTGGATGCAAGCATCATGCCCACGATCATCAGCGGCAACCCTAATGCTCCAGTAATTATGATAGCCGAAAAAGCCGCCGACATGATCAAAGAAGACTGGCTCGTATTATGA

Protein sequence:

>DPOGS207057-PA
MEAAGALASLAPSPITVLGLIPLLALGITYFRYQQYDPESYITDTNIILPIYDFVVVGGGSAGAVMASRLSEIGNWTVLLLEAGQDENEISDIPALAGYTQLSDMDWKFQTTPSKNRSYCLAMNGDRCNWPRGKVLGGSSVLNAMVYVRGNRNDYDLWEALGNPGWSYDQVLPYFLKSEDNRNPYLASTPYHSAGGYLTVQEAPWRTPLSITFLKGGMELGYDFRDINGEKQTGFMLTQATMRRGSRCSTAKAFLRPIRNRDNLHIALGAQVTRILINSVKKQAYGVEFYRNGQRHKVRIKREVIMSAGALATPQIMMLSGIGPADHLREHGIPLVANLKVGHNLQDHVGLGGLTFVVNKPVTFKKDRFQSFSVAMNYILYENGPMTTQGVEGLAFVNTKYAPTSGNWPDIQFHFAPSSVNSDGGEQIRKILNLRDRVYNTVYKPMENAETWTILPLLLRPKSSGWIKLKSRNPFQAPSIEPNYFAYKEDIKVLTEGIKIAFALSNTTAFQRYGSRPLNIPLPGCQQHVLFSDEYWECSLKHFTFTIYHPTGTCKMGPNHDQDAVVDPRLRVHGVANLRVVDASIMPTIISGNPNAPVIMIAEKAADMIKEDWLVL-