Monarch geneset OGS2.0

DPOGS213756
TranscriptDPOGS213756-TA1851 bp
ProteinDPOGS213756-PA616 aa
Genomic positionDPSCF300212 - 401987-404964
RNAseq coverage77x (Rank: top 65%)
Annotation
HeliconiusHMEL0097350.062.52% 
BombyxBGIBMGA009242-TA0.057.14% 
DrosophilaCG9503-PA3e-9034.87% 
EBI UniRef50UniRef50_Q7QFX93e-9436.29%AGAP003785-PA n=2 Tax=Culicidae RepID=Q7QFX9_ANOGA
NCBI RefSeqXP_394224.17e-9337.80%PREDICTED: similar to CG9503-PA [Apis mellifera]
NCBI nr blastpgi|3407273774e-10139.17%PREDICTED: glucose dehydrogenase [acceptor]-like [Bombus terrestris]
NCBI nr blastxgi|3407273778e-9938.35%PREDICTED: glucose dehydrogenase [acceptor]-like [Bombus terrestris]
Group
Gene OntologyGO:00166144.1e-150oxidoreductase activity, acting on CH-OH group of donors
GO:00088124.1e-150choline dehydrogenase activity
GO:00506604.1e-150flavin adenine dinucleotide binding
GO:00551144.1e-150oxidation-reduction process
GO:00060664.1e-150alcohol metabolic process
KEGG pathwaydme:Dmel_CG95181e-83 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[7-615] IPR0121324.1e-150Glucose-methanol-choline oxidoreductase
[54-356] IPR0001721.2e-60Glucose-methanol-choline oxidoreductase, N-terminal
[463-603] IPR0078672.4e-35Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL25592 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213756-TA
ATGGAAATCGTTTACACATTCGCGAACACGTTCCTTAGTTTTATACAAAGTGTTAGAGATTTGCCGTGGCTGAGCTGGCTGTTGAGATATTTATCCATATACCAAGCTCTGTCTCCCGTGGAATGGCCGGCGTCTTATGATTTGAAAGATGGTGACACCTTTGATTTCATCGTGGTGGGCGCTGGGTCGGCCGGAGCGATCGTGGCTTCCCGTCTCAGTGAAATTTACAATTGGAAGGTTCTTCTGCTGGAAGCAGGTGGTAATCCACCGCCTGCGAGCGTGTTACCAAGTACTTTCGCGATTCTCTCTCACACGGAGTACGACTGGAACTATAAAGCCGATCTGGACAATGGTACCGGTCAGAGTCACGTTGCAGGAAGTATCTACATGTCTAGGGGGAAAATGCTAGGAGGATGCTCATCAAACAACTACGAAATATACGCTCGAGGAGCGCCCCAAGACTTTGATGACTGGAGTAAAGTAGCACCAGGCTGGGATTGGAACTCAGTTCTATATTACTACAAAAAATTAGAAAATATGACAGATCATACCGTTTTGGAAGATCCTAACTCCTCATACCTTTATTCAACCCACGGACCCGTGGCTATATCGAGGCCTAAGCAAAATCAATACTTTGAAAAAGTTGATGAAACGGTCTTAGCTTCATATGAAGAAATGGGGTTGAAAAGACTTTTATCTACAAACGGTCCAGAAATCCTTGGCGTTTCTCGCCCACACGTTACATTTGCTAATGGTAGACGATCCAGTACAGCAGAGGCATATTTGCGACCACTTAGAGATAGACGAAATCTTTTGGTCACGAAATACGCTAGAGTGATAAAAATTTTAATAAAATCAAACAGAAGAAAGGCTTATGGCGTTCAAGTTCAATTAAAGACTGGGCAATTTATTAATGTTTTTGCCAAACTGGAGGTTATTGTATCAGCTGGAACTATTGATACTCCCAAATTACTAATGCTTTCGGGAATAGGCCCTAAAGAAATATTGCAGAAGCATAACATAAAAATGGTCGCAGATCTTCCGGTGGGGAAGAACTTGCAGGACCATAATTTAACCCCGCTCATATTTACGGGCAAAAAGGGTTTTCACACAGCTATTCAGAACGTACTTATCACAGCTGAATTAGATTCATATCCCGTCCCTATTCAAACCGGATTTTTTAGGCTGAACTGTTCTATTTGTCAGAACATTGCCGTAGGAAAGCCTCACATACAAATATTCAATATTCACGCTGGGGCTACCGTTGCTCCGGGGGTGTTATTTGGTTGCCGTACCGTTACCAACTACAACAAAAACTATTGTTATTCATTCAGCAGAGCCAATGTACTGCACGAAATTGACGTCACTTCACTCGTTCTCCTCCATCCTTTATCGAGAGGTCAAGTGAAAATAAGGAGCACCAATCCGTTCGACGATCCTATAATAGAATTAGGTTATTTCAGGAACAAACAGGATGTCATGATAGCTGTGGAAGCAGTTCAATTCATGATGAAATTTACCGAAACTTCTTATTACAAGAAAGTTGGTGGAAGACTTGTTAAACTAGACGTAGATGGTTGTCAAGGGATTCCTTACAATACATATGAGTATTGGTACTGCTATGTCATAAGCTCAGCGACCTCCATATTACATCCGGTAGGAACCTGCGCGATGGGCAGAAACGGTGTGGTGAATGAACGATTAAAAGTACATAATATTGACGGTTTGAGGGTAGTTGACGCTTCAGTCATGCCTCTGATAACCAGTGGAAATACCAACGCACCCACTATGATGATAGGAGAAAAAGCTGCGGATATGATCAAAGAAGATTACAAAGTATTTTACGGGTAA

Protein sequence:

>DPOGS213756-PA
MEIVYTFANTFLSFIQSVRDLPWLSWLLRYLSIYQALSPVEWPASYDLKDGDTFDFIVVGAGSAGAIVASRLSEIYNWKVLLLEAGGNPPPASVLPSTFAILSHTEYDWNYKADLDNGTGQSHVAGSIYMSRGKMLGGCSSNNYEIYARGAPQDFDDWSKVAPGWDWNSVLYYYKKLENMTDHTVLEDPNSSYLYSTHGPVAISRPKQNQYFEKVDETVLASYEEMGLKRLLSTNGPEILGVSRPHVTFANGRRSSTAEAYLRPLRDRRNLLVTKYARVIKILIKSNRRKAYGVQVQLKTGQFINVFAKLEVIVSAGTIDTPKLLMLSGIGPKEILQKHNIKMVADLPVGKNLQDHNLTPLIFTGKKGFHTAIQNVLITAELDSYPVPIQTGFFRLNCSICQNIAVGKPHIQIFNIHAGATVAPGVLFGCRTVTNYNKNYCYSFSRANVLHEIDVTSLVLLHPLSRGQVKIRSTNPFDDPIIELGYFRNKQDVMIAVEAVQFMMKFTETSYYKKVGGRLVKLDVDGCQGIPYNTYEYWYCYVISSATSILHPVGTCAMGRNGVVNERLKVHNIDGLRVVDASVMPLITSGNTNAPTMMIGEKAADMIKEDYKVFYG-