Monarch geneset OGS2.0

DPOGS207053
TranscriptDPOGS207053-TA3612 bp
ProteinDPOGS207053-PA1203 aa
Genomic positionDPSCF300001 + 2150416-2157116
RNAseq coverage123x (Rank: top 57%)
Annotation
HeliconiusHMEL0105040.039.81% 
BombyxBGIBMGA012996-TA0.042.03% 
DrosophilaCG9503-PA0.063.14% 
EBI UniRef50UniRef50_E2BJJ80.039.31%Glucose dehydrogenase [acceptor] n=14 Tax=cellular organisms RepID=E2BJJ8_HARSA
NCBI RefSeqXP_394224.10.065.32%PREDICTED: similar to CG9503-PA [Apis mellifera]
NCBI nr blastpgi|3072060620.039.31%Glucose dehydrogenase [acceptor] [Harpegnathos saltator]
NCBI nr blastxgi|3072060620.038.73%Glucose dehydrogenase [acceptor] [Harpegnathos saltator]
Group
Gene OntologyGO:00166143.2e-79oxidoreductase activity, acting on CH-OH group of donors
GO:00506603.2e-79flavin adenine dinucleotide binding
GO:00551143.2e-79oxidation-reduction process
KEGG pathwaydpo:Dpse_GA218490.0 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[640-935] IPR0001723.2e-79Glucose-methanol-choline oxidoreductase, N-terminal
[1048-1192] IPR0078671.7e-42Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL10024 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207053-TA
ATGGAATCTCTGGCGGCAAATATAACTGCAACATGCCCGTTATCGTTTGGTGGCACAGCTGGAGAACTTTTTTTAAAAGCAGTTACAACGGTGATTACCGCACATTGTGGAATCATGGATGACTATAAATGGCCCCCAGACGATGCTTATGATATCATCAATAAAGGATCTGGAATATCTTTTGATTTCATAGTCGTTGGCGCAGGAACTGCTGGATCTTTAATTGCCAGCAGACTTTCAAAGCAATATCCGTCTTGGAATATACTTCTGATTGAAGCTGGTGATGATCCCGGAATTGATAGTGAGATCCCAGCATTTTTATTTTTAAATCAAAACTCAAGCAATGACTGGTCATATACAACAGAGGGACGTGGGGAGAGTTGTTTGGGTTTCAATAATGAAAGATGCATTTGGAGTAAAGGAAAAGGACTCGGCGGATCAAGTTCTATTAATGCGATGATTTATTTAAGAGGGCACCCTAAAGACTATAACACATGGGAAAAGTTAGGCAACCCGGGATGGGGATACAAGGAAATGTCTAAATATTTCGATAAAATAGAAAATATTTTTAATATTACTGACCCTCACTTCAGCGGATACGAAAACCAATGGTATAAAATTTTAGATAATGCATGGAAAGAATTATCTTTTGCAAATTATAATTACGAAAATCATGAAGCCCTAACCGGGACCAAGAAAACGAGACTGCTAACAAGAAATGGGAAACGTATGAACACAGCTAAAGCATTTTTTAACCAGGCAGGAAAAATGACTGTAATGAAAAATACGCAGGTAGAGAAGGTTATAATTAACCCAAAAACTAAACGAGCTACTGGTGTCAAAATACACCACAAAGATGGAACCATCATGGAAATTGATGTTAGCAAAGAGATATTATTGGCAGCTGGTTCGATTGCAACTCCACAAATTCTTATGCTATCAGGAATCGGACCTAAAGATCACCTTAAAGTTATGGGCATCGATATCATCTTAAATTCACCCGTAGGAAAAAACTTACAAGATCATATTATTCTTCCATTATTTCTTAAAACCAATATAAAAATGGAACTGCCTTCTTCTGTTATTCAAATGTTTTTGTTACAGTACATGTTAACGAAATCGGGACCAATATCAAACATCGGTCTAACAGATTACATGGGTTTTATAGATACGAAAAACGTATCAGATTATCCAGATATACAATTTCACTACACATATTTCACTAAGAACGACAATTTTGTTTTAAGGCCATACCTAGAAGGCATTGGTTATAAAAGAAAAATCATTGAAGCCATAGAGGCGTTGAACTACAAAAACGATATTCTAGGCATTTATCCGACATTATTGCATCCTAAGGCTAGGGGTGAGATATTTCTTTCAGAACGTGATTTATCAAAACCTATTATAAATGCTAATTATTTTCAACATTCTGACGACATGCTAGCAATGATAGAGGCTATTGATTTTATTCACACACTCGAAAAAACCTCCACGTTCGAGAAATACAATATAAAATTGTTACATATTAATATTTCTGAATGCGATATATATCCATTTGACACTGAGAAATATTGGGAATGTTATATAAAATATATGGCGACGACGATTTATCATCCCGTCGGTACTACCAAGATGGGACCACCAGAAGATGCGTCTGCTGTTGTAAATTCTGAATTAATTGTTCATGGAACACCAAACATCAGAGTTGTTGACGCTAGCATAATGCCTAACATACCGGGAGGTAACACTATGGCAGCGACTTTGGCGATCGCCGAAAAAGCATTCGACATTATAATTTGTTTAAAGTACCTATTAATACATTTGAGACTGGTGATTACAGAGGCATTTGGGATTAAAGACATAAAATTAGACACAGTGTTTGTAGATATTTTACAAGAAAGTCGACTAATGTCTGAATATGATTTCATAGTAGTTGGGGCTGGATCTGCTGGAGCTGTGGTGGCAAATCGTCTTTCGGAAATAAAAGATTGGAACATATTGCTTTTAGAAGCTGGAAGCGACAGGAACATTTTGACCGATATTCCAATTTTGGCTGCTGAATTTCAATTAGGACATCAAGATTGGCAATATAAGACAAGCCCCCAAGGAACAACATGTCTTGCAATGAACAACGGCAGTTGCAATTGGCCTCGAGGAAAAGTTTTAGGCGGTAGCTCTGTTCTAAACTACATGTTATACTTAAGAGGTAACAGTAGAGATTACGACGGTTGGGAGTCATTAGGTAATAAAGGCTGGGGATTTAAAGAAGTACTGCCCTATTTTAAAAAATCTGAAGATAACAAAAATCCCAACTATGCTCACACTAAATATCACGGAACGGGAGGATATTTGACTGTTTCTGACGTACCATATCATACTCGTCTCGCAACAAGTTTTATTGAAGCTGGATTAGAATTAGGTTATAAAAATAGAGATATCAATGGAAAATACCAAACTGGATTTACTCTCGCTCAGGGGACTACTCGACGCGGAGCAAGATGTTCTACAGCAAAAGCCTTTTTAGATACGGCTAAAAATAGAAAAAATTTACATATATCAAAACAATCATTTGTAACTAAGATTTTGATAGATCCTAAAACTAAAACTGTATCCGGTGTTTCATTTGAAAAAAGAGGCAAAAAATATGAAATTAGAGCTAAAAAAGAGGTTATTTTGTCAACGGGAACAATAAATACGCCTCAGTTATTAATGTTATCTGGTATAGGTCCAAGAGACGAATTGTTAAAACATCAAATACCTATAATTCAAAATCTTCAAGTTGGAAAAAATCTCCAAGACCACGTCAGTGTGGGCGGCTTAGCATTCACTATAAACAAACCGGTTTCTATTGTAGAAACAAGGATGCTGAAACCAAAATATTTTTTCCAATATTTGATATCGCGTAATGGACCATTTACCATTTTAGGCGGAGTTGAAGGACTAGCATTTATTAATACAAAGTACGCGAATGCGTCACACGACTATCCCGACATACAGTTTCATTTTATACCCGGCGCAACTAACTCCGACGGTGGAAGAAATTTAAAAAAAGTCCATGGTCTAACAAATGAATTTTATGATGCTGTGTTTAAGCCAATAAACTATAAAGACACATGGAGTGTTATGCCGATATTATTGCGTCCACAAAGTAGAGGATACATTGAATTAAAGAGCTCAAATCCTCATGACTATCCCATTATCCATCCAAATTACCTAGCCGAAGATATAGACTTGAAAACCTTAATCGAAGGAGTCAAGGCTGGATATAAATTGTCCAAAACAACAGCTTTTAAGAAATATAATTCAGAATTCAATAAAAATATATTTCCCGCATGTAAAGCAATTAAAAAATTCACTGATGAGTTCTGGGAGTGTATGATTAGACAGTACACATTTACCTTCTATCATCCGGTCGGAACGGCAAAAATGGGTCCTAATTCCGATCCTAATGCTGTTGTTGACCCCGAACTTAAAGTATATGGCGTAAAAGGTCTACGAGTTGTAGATGGTAGTATAATGCCGAATATTGTGAGCGGTAATACAAATGCACCTATCATTATGATTGCAGAGAAAGCAAGTGATATGATTAAAAAATTCTGGAAAAAAAAATGA

Protein sequence:

>DPOGS207053-PA
MESLAANITATCPLSFGGTAGELFLKAVTTVITAHCGIMDDYKWPPDDAYDIINKGSGISFDFIVVGAGTAGSLIASRLSKQYPSWNILLIEAGDDPGIDSEIPAFLFLNQNSSNDWSYTTEGRGESCLGFNNERCIWSKGKGLGGSSSINAMIYLRGHPKDYNTWEKLGNPGWGYKEMSKYFDKIENIFNITDPHFSGYENQWYKILDNAWKELSFANYNYENHEALTGTKKTRLLTRNGKRMNTAKAFFNQAGKMTVMKNTQVEKVIINPKTKRATGVKIHHKDGTIMEIDVSKEILLAAGSIATPQILMLSGIGPKDHLKVMGIDIILNSPVGKNLQDHIILPLFLKTNIKMELPSSVIQMFLLQYMLTKSGPISNIGLTDYMGFIDTKNVSDYPDIQFHYTYFTKNDNFVLRPYLEGIGYKRKIIEAIEALNYKNDILGIYPTLLHPKARGEIFLSERDLSKPIINANYFQHSDDMLAMIEAIDFIHTLEKTSTFEKYNIKLLHINISECDIYPFDTEKYWECYIKYMATTIYHPVGTTKMGPPEDASAVVNSELIVHGTPNIRVVDASIMPNIPGGNTMAATLAIAEKAFDIIICLKYLLIHLRLVITEAFGIKDIKLDTVFVDILQESRLMSEYDFIVVGAGSAGAVVANRLSEIKDWNILLLEAGSDRNILTDIPILAAEFQLGHQDWQYKTSPQGTTCLAMNNGSCNWPRGKVLGGSSVLNYMLYLRGNSRDYDGWESLGNKGWGFKEVLPYFKKSEDNKNPNYAHTKYHGTGGYLTVSDVPYHTRLATSFIEAGLELGYKNRDINGKYQTGFTLAQGTTRRGARCSTAKAFLDTAKNRKNLHISKQSFVTKILIDPKTKTVSGVSFEKRGKKYEIRAKKEVILSTGTINTPQLLMLSGIGPRDELLKHQIPIIQNLQVGKNLQDHVSVGGLAFTINKPVSIVETRMLKPKYFFQYLISRNGPFTILGGVEGLAFINTKYANASHDYPDIQFHFIPGATNSDGGRNLKKVHGLTNEFYDAVFKPINYKDTWSVMPILLRPQSRGYIELKSSNPHDYPIIHPNYLAEDIDLKTLIEGVKAGYKLSKTTAFKKYNSEFNKNIFPACKAIKKFTDEFWECMIRQYTFTFYHPVGTAKMGPNSDPNAVVDPELKVYGVKGLRVVDGSIMPNIVSGNTNAPIIMIAEKASDMIKKFWKKK-