Monarch geneset OGS2.0

DPOGS207054
TranscriptDPOGS207054-TA2256 bp
ProteinDPOGS207054-PA751 aa
Genomic positionDPSCF300001 + 2158224-2161343
RNAseq coverage1x (Rank: top 94%)
Annotation
HeliconiusHMEL0105040.077.05% 
BombyxBGIBMGA012996-TA0.071.73% 
DrosophilaCG12398-PA8e-17847.85% 
EBI UniRef50UniRef50_E2BJJ84e-17751.00%Glucose dehydrogenase [acceptor] n=14 Tax=cellular organisms RepID=E2BJJ8_HARSA
NCBI RefSeqXP_001601971.10.057.28%PREDICTED: similar to CG12398-PA [Nasonia vitripennis]
NCBI nr blastpgi|1565517420.057.28%PREDICTED: glucose dehydrogenase [acceptor]-like [Nasonia vitripennis]
NCBI nr blastxgi|1565517420.057.28%PREDICTED: glucose dehydrogenase [acceptor]-like [Nasonia vitripennis]
Group
Gene OntologyGO:00166141.9e-82oxidoreductase activity, acting on CH-OH group of donors
GO:00506601.9e-82flavin adenine dinucleotide binding
GO:00551141.9e-82oxidation-reduction process
KEGG pathwaydme:Dmel_CG123986e-176 
 K00115 (E1.1.99.10)maps-> Pentose phosphate pathway
InterPro domain[62-357] IPR0001721.9e-82Glucose-methanol-choline oxidoreductase, N-terminal
[476-620] IPR0078678.4e-44Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL10024 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207054-TA
ATGGCAAGACACAATCCTGCGTCCATTAAACCTTTAGTTTCTAGTACTAGAATTATGTTCGGGCTCTTACCGGGACTTGGAGCCATTATTTTATTACGTTTAGTGATACATTTATATCGTCCTGATATAGAAGATGCAGAAAACAGAGTAAAGGATTGCGAGCCGGAGGACTTGTATGAGTGGTATGACTTTATCGTTATTGGAGGTGGATCAGCGGGTTCTGTTGTTGCAAGCCGTTTATCAGAAAATCCTGGATGGAATATTCTTCTCTTGGAGGCGGGACCAGACGAAAATGTATTATCAGATGTACCGGTAATGTTTCCTGCACTACAAACATCGAATGTGGATTGGCAATTCTTAACTGAACCTAGTGATAAATATTGCTTAAGTATGGATAATACAATGTGCAAGTGGCCTCGAGGGAAGGTATTGGGCGGCTCTAGCACACTAAATGCTATGCTTTATATAAGAGGAAATAAACGAGATTACGATAATTGGGCTGACATGGGCAACGAAGGTTGGTCTTATAATGATGTTTTGAAGTATTTTCTTAAGGCAGAGGATATGAAAATACCAGAGTACCAGAATAGTCCTTATCACTCAACTGGTGGTCCGATAACAGTTGAATACTTTCGTTACCAGCAACCAATTACAAGTAAAATTTTGGAAGCTGGAGTACAACTTGGCTACAACATTTTAGATGTAAATGGCGAAACGCAAACTGGATTTACTAGATCTCATGCTACCATTCGTGATGGTCTTCGTTGTAGTACAGCAAAAGGCTATCTGAGACCGGCGAGTAAAAGACCCAATCTTCATGTGAGCATGCATTCATTTGTAGAGAAAGTATTAATAGATGAACTAAAAGTTGCATATGGTATAAAGTTTACCAAACACAAAAAATCGTATGTTATAAGGGCTAGTGGGGAAATAATTATATCAGCGGGGGCGATACAATCCCCACAAATATTGATGTTATCTGGGGTTGGAGACAGTGAACAGTTGGAGGAACTCGGTATACATCCAATAATAAATTCGCCTGGTGTTGGCCAGAATCTCCAGGATCACGTCGCCATGGGCGGTCATTCATTTTTATTTGATAATCCCTATACTAATGGAACCGATTATTGCTTCAATTTGAACACAGTAGTTTCATTAGCAAGCCTCATCGACTTCACCATTAATAAAAACGGACCATTATACAGCATGATGGAGGCGGAAGCTATGGCTTTTGTAAACACAAAGTACCAGGATCCAACGGAAGACTATCCGGATATACAGTTTTTCATTGCTCCGACAGCAGACAATATGGATGGTGGATTGTTTGGGAAACGTGCTAATGGAATATCGGATGAAACTTATGCTGAATTGTATGAGGACATCCTTTACGACTCCTCATTTTCTATAGTTCCCTTGCTTTTAAGACCTAAGAGTCGCGGCTACATAAAGTTAAGAGACGCAAGCCCCTTTTCCGCTCCTCTGATCTATCCAAACTATTTCACAGAGCCAGAGGATGTCAAAATATTGACCGAAGGCGCTAGAATAGCACTAAAGTTGGTTCAACAACCAGCGCTGCAAGAATTAAATGCAAGACCTAACCCGAATCGTAACCCTGGATGTGCGGAACATCCATTGATGTCAGATGAACATTTGGAATGTCAAGCACGCCACCATACTTTAACAATATATCATCCAGTGGGAACCTGCGCTATGGGTCCCCGCGGAGACCCAAATGCTGTGGTAGATCCTAGACTAAGGGTGTACGGAGTAAGTAACTTAAGAGTGGTTGATGGAAGTATAATGCCGAAAATAGTAAGCGGAAATACGAATGCTCCAATAATCATGATAGCAGAAAAGGCATCAGACATGATTAAAGACGATTATGAGCAAGCAGACTTTGACGCTATGACACCTTACAATAACTATGAACGGACATTCAATCATTACCTCGAACCAACTAACTATATATTGCCTTACTCCTTCGATTTTACAATACCAAATATCTTATATGAGCCGTATCCATACTACAATTATAATCCAAGTGAGTTTACTAATGATTATTCAAAATTTTCAAAAAATATTATAAAACATGATCAAAGAAATGTACACATTCCAGATATTGCATACGTGCCTCCGTTTATTGATCAAACACAACAAATATCTCCAAAATTCTACAGTAATAGCCGAAGAGAGCAAAATAAAAAATGTCGTCACTGGTTATTCTATAACGGGAATAAAATTGAAATAGATATTTAA

Protein sequence:

>DPOGS207054-PA
MARHNPASIKPLVSSTRIMFGLLPGLGAIILLRLVIHLYRPDIEDAENRVKDCEPEDLYEWYDFIVIGGGSAGSVVASRLSENPGWNILLLEAGPDENVLSDVPVMFPALQTSNVDWQFLTEPSDKYCLSMDNTMCKWPRGKVLGGSSTLNAMLYIRGNKRDYDNWADMGNEGWSYNDVLKYFLKAEDMKIPEYQNSPYHSTGGPITVEYFRYQQPITSKILEAGVQLGYNILDVNGETQTGFTRSHATIRDGLRCSTAKGYLRPASKRPNLHVSMHSFVEKVLIDELKVAYGIKFTKHKKSYVIRASGEIIISAGAIQSPQILMLSGVGDSEQLEELGIHPIINSPGVGQNLQDHVAMGGHSFLFDNPYTNGTDYCFNLNTVVSLASLIDFTINKNGPLYSMMEAEAMAFVNTKYQDPTEDYPDIQFFIAPTADNMDGGLFGKRANGISDETYAELYEDILYDSSFSIVPLLLRPKSRGYIKLRDASPFSAPLIYPNYFTEPEDVKILTEGARIALKLVQQPALQELNARPNPNRNPGCAEHPLMSDEHLECQARHHTLTIYHPVGTCAMGPRGDPNAVVDPRLRVYGVSNLRVVDGSIMPKIVSGNTNAPIIMIAEKASDMIKDDYEQADFDAMTPYNNYERTFNHYLEPTNYILPYSFDFTIPNILYEPYPYYNYNPSEFTNDYSKFSKNIIKHDQRNVHIPDIAYVPPFIDQTQQISPKFYSNSRREQNKKCRHWLFYNGNKIEIDI-