Monarch geneset OGS2.0

DPOGS207055
TranscriptDPOGS207055-TA1875 bp
ProteinDPOGS207055-PA624 aa
Genomic positionDPSCF300001 + 2169799-2172051
RNAseq coverage149x (Rank: top 53%)
Annotation
HeliconiusHMEL0225370.082.85% 
BombyxBGIBMGA013001-TA0.075.58% 
DrosophilaCG9514-PA0.076.48% 
EBI UniRef50UniRef50_Q7PS750.074.25%AGAP003784-PA n=18 Tax=cellular organisms RepID=Q7PS75_ANOGA
NCBI RefSeqXP_001602085.10.077.06%PREDICTED: similar to ENSANGP00000015188 [Nasonia vitripennis]
NCBI nr blastpgi|1565517500.077.06%PREDICTED: glucose dehydrogenase [acceptor]-like [Nasonia vitripennis]
NCBI nr blastxgi|1565517500.077.45%PREDICTED: glucose dehydrogenase [acceptor]-like [Nasonia vitripennis]
Group
Gene OntologyGO:00166142.1e-181oxidoreductase activity, acting on CH-OH group of donors
GO:00088122.1e-181choline dehydrogenase activity
GO:00506602.1e-181flavin adenine dinucleotide binding
GO:00551142.1e-181oxidation-reduction process
GO:00060662.1e-181alcohol metabolic process
KEGG pathwaydme:Dmel_CG95140.0 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[1-595] IPR0121322.1e-181Glucose-methanol-choline oxidoreductase
[29-326] IPR0001723.1e-86Glucose-methanol-choline oxidoreductase, N-terminal
[439-582] IPR0078671.8e-42Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL10024 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207055-TA
ATGTTAGCAGCCCTCGCATATTTCCATTACGACTTGCTGGACCCGGAAAACAGACCATTCAATCAAAAATACCTCAGAGAAGAATACGACTTTGTAATAATCGGAGGCGGTTCTGCTGGAGCAGTGCTAGCGAATAGATTAACAGAAGTAGAGGGCTGGAATGTACTACTACTGGAAGCTGGGGGTCACGAAACGGATATTAGTGACGTGCCATTATTATCTTTGTATCTGCACAAAAGCAAATTAGATTGGAAATATCGAACTCAACCACAAGATTCGGCGTGTCAGGCAATGATTGATAAGAGGTGCAGTTGGACGAAGGGGAAGGTTCTCGGCGGTTCATCAGTACTCAACACAATGCTTTACATCCGAGGAAACAAACGAGACTTTGACCAATGGGAATCATTCGGTAACCCGGGCTGGGGTTACGAAGACGTCTTACCTTACTTTAAGAAATCGGAAGATCAACGGAACCCCTATTTGGCGAAAGATACAAAATATCACTCCACGGGTGGATATCTCACGGTTCAAGACGCACCATATAATACGCCCATCGGAGCAGCGTTTCTTCAGGCTGGTGAAGAAATGGGTTATGACATATTGGATATTAATGGTGCCCAGCAAACAGGTTATGCTTGGTACCAATTTACTATGAGACGAGGAACAAGATGTTCTACGGCTAAAGCTTTCTTGAGACCTGTGCGAGTACGACAAAATCTTCACATTGCTCTTTTTTCTCATGTTACAAAAGTTTTGATAGACAAAGATAAGAAAAGAGCTTATGGCGTAGAGTTCTTTAGAGATGGAATCAAACAGGTCGTTTATGCAAAACGAGAGGTAATTCTGGCTGCAGGAGCAATTGGATCTCCACAATTACTCATGCTTTCTGGTATTGGACCAGCTCAACATTTGGAAGAAGTGGGTATTGATGTTGTCTACAATTCCGCTGGAGTGGGAAGAAATTTACAAGATCATATCGCCGTAGGAGGTATAGTTTTTCAAATCGATTATCCTATAAGCATCGTTATGAATAGACTTGTGAACATTAATTCAGCTTTACGCTACGCTGTTACGGAAGATGGACCATTAACTTCAAGCATTGGCTTAGAAGTTGTAGCCTTTATTAATACTAAATATGCTAATGAAACTGAAGATTGGCCAGACATTGAGTTCATGATGACATCTGCATCTATACCTTCGGATGGGGGGACACAAGTTAAAGTTGCTCACGGCATAACTGATGAGTTCTATGAAGAAGTTTTTGGTCATCTAACTAGTAAGGACGTCTGTGGAATATTTCCCATGATGTTGAGACCAAAAAGTCGGGGCTTTATAAAATTAAGATCTAAAAATCCCTTAGATTATCCATTGATGTACCATAATTATTTGACGCACCCCGATGATGTTGGAGTAATGAGAGAAGGTGTAAAAGCCGCCGTAGCTGTAGCTGAAACAGCAGCCATGAAGCGTTTGGGTGCTAGATATAACAGTAAACCTGTCCCAAATTGCAAACATTTACCTCTATACACGGATGAATACTGGGAATGCTATATACGCCAGTATACCATGACAATTTATCATTTGTCTGGCACAGCTAAGATGGGCCCATCAAGTGATCCTATGGCCGTGGTGGATCCTGAATTACGTGTTTATGGTGTCGAAGGCTTGCGTGTCATTGATGCAAGTATAATGCCAGCTGTTACCAATGGTAATATTAATGCTCCAGTAATTATGATTGCAGAAAAAGGCTCAGATCTAATTAAAAATACTTGGAAGCCAAAACAAAAAAGTAGAAGCCGTAGATCTTCGAAATGTTCCAAATTGGAGCAAATTCTGAAGATAGAACGAACTTCACAATGCCAGAAAGAAAGATGA

Protein sequence:

>DPOGS207055-PA
MLAALAYFHYDLLDPENRPFNQKYLREEYDFVIIGGGSAGAVLANRLTEVEGWNVLLLEAGGHETDISDVPLLSLYLHKSKLDWKYRTQPQDSACQAMIDKRCSWTKGKVLGGSSVLNTMLYIRGNKRDFDQWESFGNPGWGYEDVLPYFKKSEDQRNPYLAKDTKYHSTGGYLTVQDAPYNTPIGAAFLQAGEEMGYDILDINGAQQTGYAWYQFTMRRGTRCSTAKAFLRPVRVRQNLHIALFSHVTKVLIDKDKKRAYGVEFFRDGIKQVVYAKREVILAAGAIGSPQLLMLSGIGPAQHLEEVGIDVVYNSAGVGRNLQDHIAVGGIVFQIDYPISIVMNRLVNINSALRYAVTEDGPLTSSIGLEVVAFINTKYANETEDWPDIEFMMTSASIPSDGGTQVKVAHGITDEFYEEVFGHLTSKDVCGIFPMMLRPKSRGFIKLRSKNPLDYPLMYHNYLTHPDDVGVMREGVKAAVAVAETAAMKRLGARYNSKPVPNCKHLPLYTDEYWECYIRQYTMTIYHLSGTAKMGPSSDPMAVVDPELRVYGVEGLRVIDASIMPAVTNGNINAPVIMIAEKGSDLIKNTWKPKQKSRSRRSSKCSKLEQILKIERTSQCQKER-