Monarch geneset OGS2.0

DPOGS204388
TranscriptDPOGS204388-TA1557 bp
ProteinDPOGS204388-PA518 aa
Genomic positionDPSCF300002 - 1528403-1535438
RNAseq coverage119x (Rank: top 58%)
Annotation
HeliconiusHMEL0078150.064.65% 
BombyxBGIBMGA013003-TA2e-13546.97% 
DrosophilaCG6142-PA4e-15949.42% 
EBI UniRef50UniRef50_Q9VBG86e-15749.42%CG6142 n=22 Tax=Neoptera RepID=Q9VBG8_DROME
NCBI RefSeqXP_002016883.13e-16151.76%GL21830 [Drosophila persimilis]
NCBI nr blastpgi|1951519135e-16051.76%GL21830 [Drosophila persimilis]
NCBI nr blastxgi|1951519132e-15751.76%GL21830 [Drosophila persimilis]
Group
Gene OntologyGO:00166143e-102oxidoreductase activity, acting on CH-OH group of donors
GO:00088123e-102choline dehydrogenase activity
GO:00506603e-102flavin adenine dinucleotide binding
GO:00551143e-102oxidation-reduction process
GO:00060663e-102alcohol metabolic process
KEGG pathwaydme:Dmel_CG95186e-131 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[1-515] IPR0121323e-102Glucose-methanol-choline oxidoreductase
[5-243] IPR0001722.5e-62Glucose-methanol-choline oxidoreductase, N-terminal
[359-503] IPR0078671.6e-38Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL26249 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204388-TA
ATGAATTGGGGTTACGTATCTGAACCACAACAAAAGGCCTGTCGTAATTTGAGAGATCATGTTTGTTATATGCCTCGAGGCAAAGTTCTCGGGGGCAGCAGCGTACTTAACTTTTTAATATACCAAAGAGGTCATCCCGAAGATTATAACGATTGGGTCAGGATGGGTAACGAGGGTTGGAGCTACAATGAAGTCTTGCCATACTTTAAGAAATCCGAAAATATACATATAAAAGAACTTCTAAACTCCACTTATCATGGCAAAGGAGGTTATTTAGACATTGATTATTCTTCATTTTCAACGCCACTCAATGATGCATTCAAGAACGCTGGTCACGAACTCGGGTACGAATGGAATGACCCCAATGGAGAAAATGTAATCGGTTTTTCAAAACCTCAAGCGACAATAAGAAAAGGAAGACGGTGTAGCTCATCAAAAGCATTTTTAGAACCCGTAAGGTATAGAAGAAACCTAAAAGTATCTAAATTTTCCACAGCAACGAAAATATTAATCGACCCTCTTACGAAAAGAGCTAATGGAGTGGAATTTATAAAAAATAATAAAATAAAACGTATATACGCCCGTCGTGAGGTTGTACTTGCTGGTGGCACAATAGGGTCTGCCCAATTATTAATGCTATCGGGAGTGGGCCCTAAAGAACACTTAAGCGAACTTGGAATACAAACTATAGTCGACCTGCCTGTAGGCTACAACCTTCAAGACCATGTAACCTTTTCGGGTAATGCGTTTATTGTCAATACCACTGGACTCTGTGTAAACGATATGATAGCTGCATCTCCAGCATCAGCTGTGTCATATATGTTGGGAGGCGGTCCATTAACCATACCGGGGGGTGCCACCGGACTTGCCTTCATACAAACGGACTACGCAAAAGACATGAATGGAAGGCCCGATATAGAAATGGTGATGGGCGCTGGATCCCTGGCAGGAGATCTGCTCGGAATTATACGGTCTATGCTTGGTGTAACTGATGAATGGTACCGAGAAGTTTACGGTTCTCTCCCACTCAATGAGAGACAGCAGTCGTTCGCTTTGAACCCGGTTTTAATTCGACCTAGAAGCGTCGGCCGTATGAAACTTAGTTCATCAAACTTCACAGATCAACCAAGAATACAACCGAACTATTTTGAACATCCCGACGATTTACAAGCCATTAAGGAGGGAGTGAGATTTGCACAAAAAATTATACAAACAAAAGCGTTCCAACGATACGGGACGCGACTCCACAATACACCATTCCCAAATTGCCGACACTTGACCTTCGACTCGGACGAGTATTGGGAGTGCGCCATCGAACAGACCTCCATCACGCTAGATCACCTGGCCGGGACCTGCAAAATGGGGTCACAAGGAGACCCATCAGCGGTGGTGTCTCCGCGTTTACTGGTTCATGGAATTCATGGTCTGAGGATAGCTGACGCCTCCATAATGCCTCGCATACCAGCGTCCCATACACATGCACCCGTCGTCATGATAGCCGAAAAAGCTGCCGATATCATTAAGCAGGATTGGAAGCAACCAATTCAACAATTATGA

Protein sequence:

>DPOGS204388-PA
MNWGYVSEPQQKACRNLRDHVCYMPRGKVLGGSSVLNFLIYQRGHPEDYNDWVRMGNEGWSYNEVLPYFKKSENIHIKELLNSTYHGKGGYLDIDYSSFSTPLNDAFKNAGHELGYEWNDPNGENVIGFSKPQATIRKGRRCSSSKAFLEPVRYRRNLKVSKFSTATKILIDPLTKRANGVEFIKNNKIKRIYARREVVLAGGTIGSAQLLMLSGVGPKEHLSELGIQTIVDLPVGYNLQDHVTFSGNAFIVNTTGLCVNDMIAASPASAVSYMLGGGPLTIPGGATGLAFIQTDYAKDMNGRPDIEMVMGAGSLAGDLLGIIRSMLGVTDEWYREVYGSLPLNERQQSFALNPVLIRPRSVGRMKLSSSNFTDQPRIQPNYFEHPDDLQAIKEGVRFAQKIIQTKAFQRYGTRLHNTPFPNCRHLTFDSDEYWECAIEQTSITLDHLAGTCKMGSQGDPSAVVSPRLLVHGIHGLRIADASIMPRIPASHTHAPVVMIAEKAADIIKQDWKQPIQQL-