Monarch geneset OGS2.0

DPOGS207052
TranscriptDPOGS207052-TA1518 bp
ProteinDPOGS207052-PA505 aa
Genomic positionDPSCF300001 + 2144973-2148105
RNAseq coverage28x (Rank: top 76%)
Annotation
HeliconiusHMEL0105040.092.15% 
BombyxBGIBMGA012996-TA1e-17169.59% 
DrosophilaCG9503-PA0.071.82% 
EBI UniRef50UniRef50_B3MXK32e-15761.35%GF19432 n=3 Tax=Sophophora RepID=B3MXK3_DROAN
NCBI RefSeqXP_002009482.10.071.82%GI15372 [Drosophila mojavensis]
NCBI nr blastpgi|1951300800.071.82%GI15372 [Drosophila mojavensis]
NCBI nr blastxgi|1947679340.072.06%GF19422 [Drosophila ananassae]
Group
Gene OntologyGO:00166148.1e-108oxidoreductase activity, acting on CH-OH group of donors
GO:00088128.1e-108choline dehydrogenase activity
GO:00506608.1e-108flavin adenine dinucleotide binding
GO:00551148.1e-108oxidation-reduction process
GO:00060668.1e-108alcohol metabolic process
KEGG pathwaydpo:Dpse_GA218492e-158 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[15-499] IPR0121328.1e-108Glucose-methanol-choline oxidoreductase
[60-355] IPR0001726.5e-80Glucose-methanol-choline oxidoreductase, N-terminal
[431-486] IPR0078672.5e-19Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL10024 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207052-TA
ATGAGCTCGATAGTGACAGCAGCTGCGGACGTGCTGTCCGGCACGGCCATCGCTGCTGGCACCCAGGTGGCATGGTTTATTCCAATGTTGGTAGCAGCGATAGCCTACTACCAGTTTGATCAAACGGATCCTGAAGGTCGTCCTGCTGATATTCCAAACTCAAGGTTGCTGCTCGAGTATGACTTCATAATTGTAGGAGCTGGATCAGCTGGAGCAGTAGTAGCAAACCGATTATCCGAAATTGGTCATTGGAAAGTGCTACTTTTGGAAGCAGGTGGCGATGAAACAGAAATATCTGATGTGCCTCTGCTAGCGGGATATTTACAACTTAGCAAACTGGACTGGAAGTACAAGACCGAGCCTCAAGGAACCAGTTGCTTGGCTATGGAGGGTGGTCGCTGCAATTGGCCGAGGGGCAAAGTTCTAGGAGGAAGCTCTGTGCTAAATTATATGCTTTATCTGAGAGGAAATAAAAAAGACTATGACACTTGGGAATCTCTAGGAAACAAAGGTTGGAGTTATAACGATGTCCTTTATTATTTTAAAAAGTCTGAAGATAACCAGAATCCTTATTTGGCCAAAACACCATATCATAGCACCGGAGGGTACCTAACGATATCGGAAGCGCCGTATCATACACCTCTCGTATCCAGTTTTATAGATGCTGGTCTGGAAATGGGTTATCTAAATAGAGACATAAACGGTGAAAACCAAACTGGTTTCATGGTAGCCCAAGGAACATTGAGAAGAGGCAGCCGGTGTTCAACCTCCAAGGCATTTTTACGGCCAGCTAAAGATCGAACAAATCTACATATATCGATAAATTCCTTCGTTACGAAAGTTATGATAGATCCCCGGACTAAAATCGCATTTGGCGTTGAATTTGTTAAAAATAAAATGGTTTATCGGATAAGAGCTCGAAAGGAAGTCATTCTTTCAGGCGGAACAATAAACTCTGCGCAGTTACTACTCTTATCAGGAATAGGTCCAGCAGATGAACTAGCTAAACATAGAATACCCTTGATACAAAACCTTCAAGTTGGAAAGAATCTTCAAGATCATATAGGTCTAGGAGGATTAGCATTTATGATAAATAAACCGATATCGATTGTTGAAAATAGACTACATACTGTCAGTACATTAATGGAATATGCTGTACTTGGAGAAGGACCACTAACTATAATGGGCGGTGTTGAAGGTCTAGCTTTTGTTAACACAAAATATGTGAACGCGTCAGATGACTTTCCTGATATCGAATTTCATTTTATATCAGGAGCTACAAATTCTGATGGAGGAGTGGGTACCGCAAAGATGGGCCCTTATTGGGATCCCGAAGCTGTAGTTGACCCCGAACTGAAAGTATACGGAGTCAAAGGTCTAAGGGTTATCGATGGAAGCATAATGCCTAATCTGGTTAGCGGCAACACTAACGCACCTATAATTATGATTGGAGAAAAAGGCAGTGATATGATCAAAAACTTCTGGCTGAAACGACGAATTTCTAGATATTATGCATGA

Protein sequence:

>DPOGS207052-PA
MSSIVTAAADVLSGTAIAAGTQVAWFIPMLVAAIAYYQFDQTDPEGRPADIPNSRLLLEYDFIIVGAGSAGAVVANRLSEIGHWKVLLLEAGGDETEISDVPLLAGYLQLSKLDWKYKTEPQGTSCLAMEGGRCNWPRGKVLGGSSVLNYMLYLRGNKKDYDTWESLGNKGWSYNDVLYYFKKSEDNQNPYLAKTPYHSTGGYLTISEAPYHTPLVSSFIDAGLEMGYLNRDINGENQTGFMVAQGTLRRGSRCSTSKAFLRPAKDRTNLHISINSFVTKVMIDPRTKIAFGVEFVKNKMVYRIRARKEVILSGGTINSAQLLLLSGIGPADELAKHRIPLIQNLQVGKNLQDHIGLGGLAFMINKPISIVENRLHTVSTLMEYAVLGEGPLTIMGGVEGLAFVNTKYVNASDDFPDIEFHFISGATNSDGGVGTAKMGPYWDPEAVVDPELKVYGVKGLRVIDGSIMPNLVSGNTNAPIIMIGEKGSDMIKNFWLKRRISRYYA-