Monarch geneset OGS2.0

DPOGS209371
TranscriptDPOGS209371-TA1860 bp
ProteinDPOGS209371-PA619 aa
Genomic positionDPSCF300118 - 151251-153353
RNAseq coverage64x (Rank: top 68%)
Annotation
HeliconiusHMEL0131161e-11138.53% 
BombyxBGIBMGA005703-TA0.066.67% 
DrosophilaCG9509-PA8e-10234.24% 
EBI UniRef50UniRef50_Q7QFX91e-11939.54%AGAP003785-PA n=2 Tax=Culicidae RepID=Q7QFX9_ANOGA
NCBI RefSeqXP_310335.32e-12039.54%AGAP003785-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582884685e-11939.54%AGAP003785-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582884681e-11639.44%AGAP003785-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00166146e-156oxidoreductase activity, acting on CH-OH group of donors
GO:00088126e-156choline dehydrogenase activity
GO:00506606e-156flavin adenine dinucleotide binding
GO:00551146e-156oxidation-reduction process
GO:00060666e-156alcohol metabolic process
KEGG pathwaydme:Dmel_CG95096e-100 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[8-614] IPR0121326e-156Glucose-methanol-choline oxidoreductase
[50-348] IPR0001721.5e-64Glucose-methanol-choline oxidoreductase, N-terminal
[458-602] IPR0078671.4e-41Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL25159 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209371-TA
ATGACGAGTCTAAGTCCATGTGTGCCTGCCACGTCACCGGCGGGAGCTGCTTTCACTGCTTTAATATCTTATATATCGACCCTCCAGTGTCTCATCACGGAACCCTGGCCGGAAGACCATAGCCATCGCGTTAAAGACGGTGATCAATTCGATTTCATTATTATCGGTTCCGGGACAGCTGGATCAATCTTAGCGAATCGTTTGACACAAGCTGATGATTGGAAGGTTTTACTCCTTGAGGCCGGCGACAATCCGCCTTTGGAGAGTATTATCCCGAATTTCTCCGGAGCGACACATAGGAGTGACCAGGTGTGGCAATATTATACGGAGAGAGATGAGATGTCGAATAGGGCCTGCGTTGATGGACGGTCTTTCTGGCCTCGAGGCAGGATGCTGGGTGGCACGGGATCAATCAATGGAATGCTGCACATGACGGGCAGTCCCGGGGACTATCAATCTTGGAACGTCGATGACGGTTGGGACTATCTTACCATAAAGAAATATTTTAGGAAAAGTGAAAAAATTATCGATCCCTATATTCTTAATAATCCAGAACTTTTAAATAATCACGGCACGAATGGGGAGTTTGTAGTTGATCAATTGAATTTCACACATACGGATATAGCTGATAAACTGACGGAGGCCTACTTGGAAATTGGTCTCGATTACTTGGATGACCTGAATGGACCAACTCAAATGGGTGTTGGTAAGATAAGGGGCGGTCATCACAAAGGGAAACGAGTGAGCACTGCAACTGCTTTTTTAAACGTAATCAAAGAACGTAAAAATTTATACATTCTCAAAAATACATTTGCTACAAAAATTATTTTTCAAGACTCTAAAGCAATTGGCGTAAAGGTTTCTTTGCCAGACAAGAAAACAGCGCAGTATTATACAACAAAAGAGATAATTGTGAGTGCTGGAACAATAAACACTCCAGTTTTACTCATGTCCTCTGGTATAGGACCAAAAGAACATTTGGAGAGTTTGGACATCAAAGTCGTTTCTGACTTACCAGTCGGCAAAAATCTGCAGGATCATGTTAGAATTCCAATACCGGTGAGGATTAATACAGGAGCGAAGGCAAAATCTCAAGATTATTGGCAAAAAGCCACACTGCAATACTTACTAGAGCAGTCAGGTCCACACTCAACTAACTATGATCAACCTAATATTAATGCTTTTCTATCAGTCACAGATCATAAGCAACTCCCGGATATACAAATCGATCATAATTATTTTGTTCCAAATACTTCCTACATATATTCTATGTGTAAAAATGTCATGAACTACAAGGATGAGATTTGCGAACAATTTGCTAAAATGAACGTTGAGAGTGAAATGATAATATTTTTTGTATCTCTATGCCGACCATTTTCAAAGGGTGAGATTTTATTGCGTTCAACTAATCCCTTCGATCATCCACGTATATATCCAAAATATTTCAGTGATCGACGAGACATGGATACATTCATAAAGGGTTTAAAAAAAGTTACGGAAATTGTGAACACAGAAGCATTAAGAAATGTAGACGCGAAGGTTGAAAGAATCTATTTTAAGGACTGTGATGATTTTAAATTTAAATCTGATGATTATTGGGAGTGTATGGCCAGGGCTTTGACGTACAATGTATATCATCCTGTGGGCACCTCGAAGATGGGCAAGCCTGGAGACGCTAGCAGTGTAGTGGATAGTAGGTTGAGGGTGTTAGGAGTGAAAAACTTGAGAGTCGTCGACGCTAGTATAATGCCAACTATAACAAGCGTTAATACTAACGCTCCGACCATGATGATCGCAGAAAGAGCTTCTGCGTTCATAAAACTGCAATATAAAAGCAAATACGCGAATGACGAGTTATAA

Protein sequence:

>DPOGS209371-PA
MTSLSPCVPATSPAGAAFTALISYISTLQCLITEPWPEDHSHRVKDGDQFDFIIIGSGTAGSILANRLTQADDWKVLLLEAGDNPPLESIIPNFSGATHRSDQVWQYYTERDEMSNRACVDGRSFWPRGRMLGGTGSINGMLHMTGSPGDYQSWNVDDGWDYLTIKKYFRKSEKIIDPYILNNPELLNNHGTNGEFVVDQLNFTHTDIADKLTEAYLEIGLDYLDDLNGPTQMGVGKIRGGHHKGKRVSTATAFLNVIKERKNLYILKNTFATKIIFQDSKAIGVKVSLPDKKTAQYYTTKEIIVSAGTINTPVLLMSSGIGPKEHLESLDIKVVSDLPVGKNLQDHVRIPIPVRINTGAKAKSQDYWQKATLQYLLEQSGPHSTNYDQPNINAFLSVTDHKQLPDIQIDHNYFVPNTSYIYSMCKNVMNYKDEICEQFAKMNVESEMIIFFVSLCRPFSKGEILLRSTNPFDHPRIYPKYFSDRRDMDTFIKGLKKVTEIVNTEALRNVDAKVERIYFKDCDDFKFKSDDYWECMARALTYNVYHPVGTSKMGKPGDASSVVDSRLRVLGVKNLRVVDASIMPTITSVNTNAPTMMIAERASAFIKLQYKSKYANDEL-