Monarch geneset OGS2.0

DPOGS207058
TranscriptDPOGS207058-TA3684 bp
ProteinDPOGS207058-PA1227 aa
Genomic positionDPSCF300001 + 2208086-2227203
RNAseq coverage35x (Rank: top 74%)
Annotation
HeliconiusHMEL0101900.077.40% 
BombyxBGIBMGA013003-TA0.087.95% 
DrosophilaCG9518-PA0.069.98% 
EBI UniRef50UniRef50_Q6NR100.069.98%RE11240p n=48 Tax=Pancrustacea RepID=Q6NR10_DROME
NCBI RefSeqXP_001648302.10.072.93%glucose dehydrogenase [Aedes aegypti]
NCBI nr blastpgi|1571042100.072.93%glucose dehydrogenase [Aedes aegypti]
NCBI nr blastxgi|1700307790.072.77%glucose dehydrogenase [Culex quinquefasciatus]
Group
Gene OntologyGO:00166141.3e-83oxidoreductase activity, acting on CH-OH group of donors
GO:00506601.3e-83flavin adenine dinucleotide binding
GO:00551141.3e-83oxidation-reduction process
KEGG pathwaydme:Dmel_CG95180.0 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[53-348] IPR0001721.3e-83Glucose-methanol-choline oxidoreductase, N-terminal
[461-604] IPR0078674.1e-41Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL10024 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207058-TA
ATGGCTATCCAAGTCCTTTTAGCATCGACAGCTTTAAAATCAGTCAGCGTTACTGGTCTGTGGTTAATACCACTTCTGCTTGGGGCCTTCACGTATCATAATTATAACTCCTACGATCCAGAATCGAAGGTACTAGAAAAAGAACCTAAGAGGGAGTACGATTTCGTTGTAGTTGGCGGAGGCTCTGCTGGTGCAGTCGTTGCAAATCGTCTAACCGAAATCAAAGATTGGAATTTACTTTTATTAGAAAGTGGACCAGACGAGAACGAGATTACTGATGTCCCCTCTTTAGCCGCTTATTTGCAACTAACGAAGTTGGATTGGCAATACAAGACTGAACCGACACCTTACGCTTGTTTGGGTTTTAAGAACAACAGGTGCAGCTGGCCGAGAGGAAAGCTTCTCGGCGGTTCCAGCGTTTTAAACTATATGATTTACGTAAGAGGTAATAAATACGACTACGACCAATGGGAATCTTTTGGCAATCCAGGATGGGGATATCGAGATGTTCTTAAATATTTTATTAAATCCGAAGATAACAGAAACCCTTATTTGGCCAAAAATCAGTATCATGGTCAAGGCGGTTATTTGACTGTGCAGGAAGCACCATGGAAAACACCCCTTGTAGCAGCTTTCGTTGAAGCTGGGGTCGAAATTGGCTATGACAACAGAGATATAAATGGTGCCATCCAAACCGGGTTCATGATGGCCCAAGGGACGATAAGACGTGGTTCTAGATGCAGCACAGCTAAAGCATTTTTAAGACCAGTGAGAACCCGTAAAAATTTAGATATTTCACTGCATTCACACGTTACTAAAATACTCATTAATCCTATGACAATGAAAGCTTACGGAGTAGAATATGTAAAACATGGTATTAAGAAAGTGGTTTATGCTAGAAAGGAAGTTATATTGTCGGCAGGAGCCATTAACAGTCCACAATTATTAATGCTTTCTGGTATTGGTCCAAAAGATCACTTACAGAGCGTTGGCATAAAAGTCCTAAAAGATTTACCAGTAGGAGAAAATTTAATGGATCATGTGGGAGTAGGAGGACTGACATTTCTAGTCGATAAACCAGTCGGAATTGTCCAAAATAGACTTCAGGCATTTCCTGTTACAATGAATTACGTATTAAACGAAAGAGGCCCCATGACCACATTAGGAGGACTTGAAGGTATTGCTTTTGTAAATACAAAATATGCTAATAGCTCCGGATTATGGCCTGATATTCAATTCCATATGGCTCCTGCAACATTTGCTTCAGATAATGGACAAACTGTGAAAAAAGTGTTAGGTCTGAAAGATGAAATTTATGACACTGTTTTTAAACCTATAGCAAATAAAGATGGGTGGACTATTATGCCACTGTTGTTACGGCCTAATACTAGAGGTTACGTTCGATTAAAAAGTTCCAATCCTTTTGAGTATCCTATAATGAATCCACGCTATCATGAAGATCCTCTAGATGTAAGTCGCCTTGTTGAAGGGATAAAAATTGCCTTAAAAGTTGCGAACGCTTCCCCATTTAAGCAATTTGGATCAAGATTATATATGAAACCATTACCAAACTGTAAACAACACAAATTTATGTCCGATGAATATATTGAATGTCAAGTTAGATCAATAAGTATGACCATATATCACCAATGTGGGACGGCTAAAATGGGACCATCTTGGGATAAGGGTGCTGTTGTTGACCCTAGATTGAGGGTGTTTGGTATTGAAGGACTAAGAGTTATAGACGCTAGCATAATGCCGACTATTGTGAGTGGAAACACAAATGCACCAGTAATCATGATAGGAGAAAAGGGTTCTGACATGATAAAAGAAGATTGGTTGAATAGCCTCTGCTACTTCCCTCTCGCAACTTTTGGAAGGGATACTATCCTCGATGGGATAGCCGGCTTTCTCCGTGACGCAGCGGAGATACATAACGGTGAGCCAGCCGAGACTGACTTCATCTTACCCAAGTACGACTTCATCATCGTCGGTGCTGGCACAGCTGGTTGTATACTTAGCAACAGATTGACCGAAGTCGATAAGTTTAAGGTCCTTTTAATAGAGGCAGGTGGAGCAGAGCAAGTATTTATGGACATCCCCGTTCTGGCTACAATGCTGCAATTCACTGAAGCAAATTGGAAGTATCGCACAGAACCTCAAAAGGCCGGATGTATGGGTATGCGTGATAAACGGTGCGCATGGCCAAGAGGAAAAGTCGTAGGAGGGTCTTCCGTGCTCCATTCAATGATGCACACGAGGGGAAATAAACGAGATTATGATACATGGGCAGCTAGTGGAAATCCAGGTTGGGATTATGATAGCGTATTGAAATATTTTAAAAAATCAGAAAATATTGAAATTCCACATTTGGTAAATGACAAAAAATATCATTCAACTCAGGGGCCGATGACAATACAAGAGCCAAGATGGCGAACTCCACTATCAGATGCCTTCCTTGATGCCGGAGTCGAAATCGGTGGAAATATTAATGATTATAATGGTAAAACACAGATTGGATATTCCATTATTCAATTTACTATGAAGAATGGAACTAGAATGAGTGTCAGTCGAGCTTTCTTACATCCTATAAAAAAACGACGTAATTTTCATATCATTAAGAATGCTTTAGTGACCAAAGTTCTCATAGATCACAAAAAAAAACGCGCTTATGGCGTACAATTTGAAAAAGATGGTAAACAAATTGTAGTAAGAGCAAAACGAGAAGTGATTTTATCCGCCGGATCTGTGAACTCTCCACAGTTATTGATGCTGAGCGGAATAGGACCAAGGGACGATCTCATAAAAATAAATATTACAACAGTGTCAGACTTACCGGTAGGATACAATTTGCAAGATCACTATGCGTTGGGTGGTCTAACTTTCATAATCAATACAACAGACTCTCTTAGATTTGAAAGAATTGCAACCTTGAATAACATCATTGAATACTTTTGTCATCACACCGGTCCTCTAACAGTTCCGACCGGTGCGGAAGCACTTGCTTTCATTGATACCAAAAATCCAAATAATAGAGATGGTTATCCTGATTTAGAACTATTATTTGTGGGCGGTTCAATTGTTTCCCAAAATGCTTACCGGTACGCATTTGACATCGATGACATTTTGTATGACACAGTTTATAGACCAATTGCCAATAGTGATACCTGGATGGTATTTCCGATGCTGTTACTCCCTAAATCGAGAGGCTACATAAAACTAAGGAGTAATAAACCACACGACAAACCAATTATCAATCCAAACTATTTTACTGACGGAGGACACGACGATCATGTTATCTTGTATGGTATTAGGAAAGTGTTACAGTTATCCCAAACAAAAGCTTTTCAAAAATATGGGAGTAAACTTCACGATATTCCTATTCCTAATTGCGCTCAACACAAATTCGATTCAGATAGTTATTGGTTATGCGCTATGAGGGCACTAACGAATACTATATACCATCCTTGCTGCACAGCAAAAATGGGACCAAGTAATGACCCTGAAGCAGTCGTCGATTCACGTTTGAAAGTCCACGGTATGGAAGGTCTAAGGGTTGTGGATGCTAGTATAATGCCAAATATTCCTGCGGCCCACACAAATGCACCCACAATGATGATCGCTGAAAAGGCCGCCGACATGATAAAAGAAGACTGGGGTATACCCATACCAATATCAAACTGA

Protein sequence:

>DPOGS207058-PA
MAIQVLLASTALKSVSVTGLWLIPLLLGAFTYHNYNSYDPESKVLEKEPKREYDFVVVGGGSAGAVVANRLTEIKDWNLLLLESGPDENEITDVPSLAAYLQLTKLDWQYKTEPTPYACLGFKNNRCSWPRGKLLGGSSVLNYMIYVRGNKYDYDQWESFGNPGWGYRDVLKYFIKSEDNRNPYLAKNQYHGQGGYLTVQEAPWKTPLVAAFVEAGVEIGYDNRDINGAIQTGFMMAQGTIRRGSRCSTAKAFLRPVRTRKNLDISLHSHVTKILINPMTMKAYGVEYVKHGIKKVVYARKEVILSAGAINSPQLLMLSGIGPKDHLQSVGIKVLKDLPVGENLMDHVGVGGLTFLVDKPVGIVQNRLQAFPVTMNYVLNERGPMTTLGGLEGIAFVNTKYANSSGLWPDIQFHMAPATFASDNGQTVKKVLGLKDEIYDTVFKPIANKDGWTIMPLLLRPNTRGYVRLKSSNPFEYPIMNPRYHEDPLDVSRLVEGIKIALKVANASPFKQFGSRLYMKPLPNCKQHKFMSDEYIECQVRSISMTIYHQCGTAKMGPSWDKGAVVDPRLRVFGIEGLRVIDASIMPTIVSGNTNAPVIMIGEKGSDMIKEDWLNSLCYFPLATFGRDTILDGIAGFLRDAAEIHNGEPAETDFILPKYDFIIVGAGTAGCILSNRLTEVDKFKVLLIEAGGAEQVFMDIPVLATMLQFTEANWKYRTEPQKAGCMGMRDKRCAWPRGKVVGGSSVLHSMMHTRGNKRDYDTWAASGNPGWDYDSVLKYFKKSENIEIPHLVNDKKYHSTQGPMTIQEPRWRTPLSDAFLDAGVEIGGNINDYNGKTQIGYSIIQFTMKNGTRMSVSRAFLHPIKKRRNFHIIKNALVTKVLIDHKKKRAYGVQFEKDGKQIVVRAKREVILSAGSVNSPQLLMLSGIGPRDDLIKINITTVSDLPVGYNLQDHYALGGLTFIINTTDSLRFERIATLNNIIEYFCHHTGPLTVPTGAEALAFIDTKNPNNRDGYPDLELLFVGGSIVSQNAYRYAFDIDDILYDTVYRPIANSDTWMVFPMLLLPKSRGYIKLRSNKPHDKPIINPNYFTDGGHDDHVILYGIRKVLQLSQTKAFQKYGSKLHDIPIPNCAQHKFDSDSYWLCAMRALTNTIYHPCCTAKMGPSNDPEAVVDSRLKVHGMEGLRVVDASIMPNIPAAHTNAPTMMIAEKAADMIKEDWGIPIPISN-