Monarch geneset OGS2.0

DPOGS207060
TranscriptDPOGS207060-TA2031 bp
ProteinDPOGS207060-PA676 aa
Genomic positionDPSCF300001 + 2243593-2248463
RNAseq coverage723x (Rank: top 18%)
Annotation
HeliconiusHMEL0104600.052.42% 
BombyxBGIBMGA013007-TA0.063.29% 
DrosophilaCG9503-PA3e-12839.28% 
EBI UniRef50UniRef50_E0VUA41e-14242.60%Glucose dehydrogenase, putative n=1 Tax=Pediculus humanus corporis RepID=E0VUA4_PEDHC
NCBI RefSeqXP_972632.21e-14443.94%PREDICTED: similar to AGAP003781-PA [Tribolium castaneum]
NCBI nr blastpgi|3071850972e-14642.42%Glucose dehydrogenase [acceptor] [Camponotus floridanus]
NCBI nr blastxgi|2700090892e-14344.85%hypothetical protein TcasGA2_TC015724 [Tribolium castaneum]
Group
Gene OntologyGO:00166141.3e-141oxidoreductase activity, acting on CH-OH group of donors
GO:00088121.3e-141choline dehydrogenase activity
GO:00506601.3e-141flavin adenine dinucleotide binding
GO:00551141.3e-141oxidation-reduction process
GO:00060661.3e-141alcohol metabolic process
KEGG pathwaydme:Dmel_CG95182e-122 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[36-677] IPR0121321.3e-141Glucose-methanol-choline oxidoreductase
[263-404] IPR0001722.8e-36Glucose-methanol-choline oxidoreductase, N-terminal
[522-666] IPR0078676.6e-35Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL10024 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207060-TA
ATGATAATGACAAAAATTATTCTATTAATAATATCGTGCATTTCGGTTGCATTAAGTGATATTAAAGAAAATGATGTCAGATCTGGAAAACTCTTCTGGCCGGAGTACAGAAACCCTGTTGTTGAAAAGATTATGAGTGCAGTAGGAGCGAATCCTTACGCGACGGGAGATTTTTTCGATTTCCTCAGGGACTCGTATCCTTTGCCGAGAGGCTTAAAAGAACCATATTCAGAATATGATTTCGTCATCGTGGGAGCTGGTTCTGCTGGCAGCGCTTTAGCGTCACGACTCACGAGAAACAGAAATACGACTGTGTTACTCATAGAAGCTGGAAAACCGGAGATGCTTTTAACAGATGTTCCAGTGGTGGCACCATATTTTCAAGACACACCATATGTTTGGCATTACTACATGGAACCTCAACCAGGAGTTTGCATGGGTATGAAAAATCAACGCTGCTTTTGGCCTCGAGGCAGAGCAGTCGGCGGAACTAGCGTCATCAACTACATGATCTACACCAGAGGCAGACCACAGGATTGGAACAGAATAGCTGCAGATGGAAATTACGGATGGGCTTACAATGATGTTCTCAAGTATTATATCGAAATGGAAAAATCTGATTTGAAGGGCTACGAGAAGGCGGCGCACAGAGGTCGTGATGGTGACCTGCCTGTGGAATTTCCACCCATAAAGCAAGTTTACGAAAGTTTTCTTACTAACGATATATTTTACTATATTTTCACTCATAAAGTTTTAGAACGCTTTATTTTAATAATGACGACGAGGTTAGTTGAAGCATTTCTTAAAGCTGGTGAAATTCTTGGATATCCGACCGTCGATTACAATGCACCAGACAAAATTGGTTTCGGGCGTGTACAAGCCACAATAAGCAGAGGTCATAGATTCAGTGCTGCCAAATCTTTTCTTCATGGTCATAAAAATAGACCAAATTTGCATATCTTACCCGAGAGTAGAGCAACAAAAATATTAATAGACCCTGTAACAAAAACAGCGTATGGTGTAGAATATATAAGAAATGACCTTCTCCACACAGTTTTCGCACGCAAAGAAGTTATATTGTCCGCCGGACCCATAGCCTCGCCACAATTGCTTATGTTATCTGGTATTGGACCCGAAGAACATCTAAAATCCGTGGGAATACCAGTCATACAAGACCTTCAAGTAGGACAAAGACTCTATGACCACATTTGTTTCCCTGGTTTGATATTCACATTGAACACGACAGAAATCAGTTTCATTGAGAATAGAGATGTATCTCTGAAGGTCATATTAGACTGGCTGCAACATGGGGATAATTTACTTTCAACGCCTGGCGCAGTAGAAGGTATTGGGTATATCCGAACTCCAGTTTCCAACGATCCCGACCCGACAGTTCCAGATATTGAACTTATAAATATCGGTGGCTCTATAATATCTGACGGTGGTATTGGTGCAAGTAGGGCTGTAAGGAGGGGTATGAGAATATCTGAGACTCTCTTTGATGAAGCATATGGACCTATTGATGGGCAAGATTCATGGTCCGTTTTTCCACTCCTGATACATCCTAAATCATTTGGTCATATTAAACTAAGAGATAATAATCCCTTAAGTCATCCGAAAATGTATGGTAACTATTTGACCGATCCGAGTGACGTTGCTACCTTCCTAGCATCATTCCGGTATATCCAATCATTAGCAGCAACGCCTGCTCTTCAAAAATATGGAGCCAAAACGTACCTGCCCAAATTTAAAACCTGTATACAGCACGTACCTGATACAGACGAATACTGGGAATGTGCGTTACGTACATTGACTGCGACCCTACATCATCAAATAGCCACAACGCGCATGGGTCCAGATGGAGACCCGGATGCTGTAGTTGACCCCGAATTAAGGGTTCGAGGTATTAAAAACTTAAGAGTTGTTGACTCAGGTATCATACCTCGTACTATATCAGCACATACAAATGGTCCAGCCATTATGATAGGGTATAAGGCGGCGGATATGATAAGAAAGACATGGAATATATAA

Protein sequence:

>DPOGS207060-PA
MIMTKIILLIISCISVALSDIKENDVRSGKLFWPEYRNPVVEKIMSAVGANPYATGDFFDFLRDSYPLPRGLKEPYSEYDFVIVGAGSAGSALASRLTRNRNTTVLLIEAGKPEMLLTDVPVVAPYFQDTPYVWHYYMEPQPGVCMGMKNQRCFWPRGRAVGGTSVINYMIYTRGRPQDWNRIAADGNYGWAYNDVLKYYIEMEKSDLKGYEKAAHRGRDGDLPVEFPPIKQVYESFLTNDIFYYIFTHKVLERFILIMTTRLVEAFLKAGEILGYPTVDYNAPDKIGFGRVQATISRGHRFSAAKSFLHGHKNRPNLHILPESRATKILIDPVTKTAYGVEYIRNDLLHTVFARKEVILSAGPIASPQLLMLSGIGPEEHLKSVGIPVIQDLQVGQRLYDHICFPGLIFTLNTTEISFIENRDVSLKVILDWLQHGDNLLSTPGAVEGIGYIRTPVSNDPDPTVPDIELINIGGSIISDGGIGASRAVRRGMRISETLFDEAYGPIDGQDSWSVFPLLIHPKSFGHIKLRDNNPLSHPKMYGNYLTDPSDVATFLASFRYIQSLAATPALQKYGAKTYLPKFKTCIQHVPDTDEYWECALRTLTATLHHQIATTRMGPDGDPDAVVDPELRVRGIKNLRVVDSGIIPRTISAHTNGPAIMIGYKAADMIRKTWNI-