Monarch geneset OGS2.0

DPOGS212710
TranscriptDPOGS212710-TA1884 bp
ProteinDPOGS212710-PA627 aa
Genomic positionDPSCF300012 - 679553-685507
RNAseq coverage388x (Rank: top 31%)
Annotation
HeliconiusHMEL0141540.086.60% 
BombyxBGIBMGA013215-TA0.077.85% 
DrosophilaGld-PA0.065.71% 
EBI UniRef50UniRef50_P181730.065.71%Glucose dehydrogenase [acceptor] n=44 Tax=Neoptera RepID=DHGL_DROME
NCBI RefSeqXP_967340.10.070.07%PREDICTED: similar to AGAP002557-PA [Tribolium castaneum]
NCBI nr blastpgi|910841910.070.07%PREDICTED: similar to AGAP002557-PA [Tribolium castaneum]
NCBI nr blastxgi|910841910.070.07%PREDICTED: similar to AGAP002557-PA [Tribolium castaneum]
Group
Gene OntologyGO:00166144.6e-212oxidoreductase activity, acting on CH-OH group of donors
GO:00088124.6e-212choline dehydrogenase activity
GO:00506604.6e-212flavin adenine dinucleotide binding
GO:00551144.6e-212oxidation-reduction process
GO:00060664.6e-212alcohol metabolic process
KEGG pathwaydme:Dmel_CG11520.0 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[8-599] IPR0121324.6e-212Glucose-methanol-choline oxidoreductase
[51-348] IPR0001725.4e-91Glucose-methanol-choline oxidoreductase, N-terminal
[443-587] IPR0078672e-38Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL10306 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212710-TA
ATGTCTCCTCCGGGGCCGACGCTGGCAGCGGCGTGTGGAGGGGGAGCTTTCATGTTGTTCATGGGTCTACTGGAAGTATTCTTGCGCAGCCAATGTGACCTCGAAGACCCATGTGGACGAGCACAGTTTCGTCGCCACATGGACTCAGTATACGACTTCATCGTAGTGGGTGGTGGGTCCGCTGGTTCCGTGATGGCAGCAAGACTGTCCGAAGTGCCTGAGTGGCGAGTGCTGCTCCTAGAAGCTGGCTTCGACGAACCCACTGGTGCTCAAGTACCTTCAATGTTCTTGAACTTCATCGGCTCAAGCATCGACTGGGGCTACCATACGGAACCTGAGCCAGCAGCTTGTCTTGGAGAGAAGGATAGAAAGTGTTACTGGCCCAGGGGAAAAGTTTTAGGAGGAACGTCTGTCATGAACGGTATGATGTACATTCGGGGTTCGCGAAAAGACTTCGACAGTTGGGCAGCTGCTGGCAATGAAGGGTGGTCCTACGATGAAGTGCTACCGTATTTCCTAAAATCTGAAGATAATAAGCAAATCGAGGAGATGGATAAAGGGTATCACGCTACAGGGGGTCCCTTGACCGTCTCTCAATTTCCATACCACCCACCTCTGAGCCATAGCATCGTTAAAGCTGCTGAAGAACTAGGCTATGAAATAAGAGATCTGAACGGCGAAAAACATACTGGATTCTCAATAGCTCAAACCACAAACAGAAATGGTTCCCGTTTGAGCGCAGCTCGAGCCTTTCTCCGTCCAGCCAAAAACCGACCGAACCTGCACATCATGTTGAACGCCACCGTCTCCAAGATCCTCATCAACCAGACAACCAGACAGGCCTACGCGGTTGAAGTAAGGAACAGTTTTGGCGGCACGGAAGTAATTTTCGCCAACCATGAAATTATTTTAAGTGCGGGCGCGGTGGCGTCGCCTCAGATTCTTCAGCTCAGCGGTGTAGGGGACCCCAAAGTGTTGAACCGCGCTGGCGTGCGACCATTACACGTGTTGCCAGCTGTCGGACGAAATCTCCACAACCACGTGGCGCACTTCCTAAACTTCCACGTGAACGACAATAACACAGTACCACTGAACTGGGCCACTGCCATGGAGTACCTTCTGTTCAGAGACGGACTCATGTCCGGAACCGGTATATCCGAAGTGACTGGTTTTATCAACACGAGGTACTCCGACCCCTCCGAGGACAACCCTGACATTCAACTCTTCTTCGGCGGCTTCCTCGCTGACTGCGCCAAAACCGGCATGGTCGGGGAGAAACTTGGCGAGGGATTCAGAAGCGTCCAGATGTTCCCGGCTGTATTACGTCCCAAAAGCAGAGGAAGATTGGAGATCGCTAGTGCAGATCCGTTCGAATACCCCAAGATTTATGCTAACTACCTCACCCATCCCGACGACGTGAAGACCCTAGTCGAGGGCATCAAATTTGCCATCCGACTCTCTGAAACGAAGGCGCTCAAGAAATACGGAATGAGGTTGGATAAAACACCGGTGAAAGGTTGCGAGAAGATAAAATTTGGTTGTGATGCATACTGGGAATGTGCGGTCAGGGTGCAAACGGCCCCAGAAAATCATCAGGCGGGGTCCTGCAAGATGGGACCCAGAGGAGACCCCACCGCTGTCGTTGATAATTTATTACAGGTCCAAGGTCTAGACCGTCTCCGCGTGGTCGACGCGAGTGTGATGCCCTCCGTGACATCAGGTAACACCAACGCACCTGTAATCATGATAGCCGAGCGCGCCGCTGACTTCATCAAACAGCGCTGGCTTGGAACTACTCCTGTCTCGTACACCAATACCGATGGCGGTGTCTCCAACGTTTTGTCCGCCAGCGAAAGTCACCACCCCGGCTGGCTGTGGCGCTAA

Protein sequence:

>DPOGS212710-PA
MSPPGPTLAAACGGGAFMLFMGLLEVFLRSQCDLEDPCGRAQFRRHMDSVYDFIVVGGGSAGSVMAARLSEVPEWRVLLLEAGFDEPTGAQVPSMFLNFIGSSIDWGYHTEPEPAACLGEKDRKCYWPRGKVLGGTSVMNGMMYIRGSRKDFDSWAAAGNEGWSYDEVLPYFLKSEDNKQIEEMDKGYHATGGPLTVSQFPYHPPLSHSIVKAAEELGYEIRDLNGEKHTGFSIAQTTNRNGSRLSAARAFLRPAKNRPNLHIMLNATVSKILINQTTRQAYAVEVRNSFGGTEVIFANHEIILSAGAVASPQILQLSGVGDPKVLNRAGVRPLHVLPAVGRNLHNHVAHFLNFHVNDNNTVPLNWATAMEYLLFRDGLMSGTGISEVTGFINTRYSDPSEDNPDIQLFFGGFLADCAKTGMVGEKLGEGFRSVQMFPAVLRPKSRGRLEIASADPFEYPKIYANYLTHPDDVKTLVEGIKFAIRLSETKALKKYGMRLDKTPVKGCEKIKFGCDAYWECAVRVQTAPENHQAGSCKMGPRGDPTAVVDNLLQVQGLDRLRVVDASVMPSVTSGNTNAPVIMIAERAADFIKQRWLGTTPVSYTNTDGGVSNVLSASESHHPGWLWR-