Monarch geneset OGS2.0

DPOGS215413
TranscriptDPOGS215413-TA1830 bp
ProteinDPOGS215413-PA609 aa
Genomic positionDPSCF300088 + 638865-642675
RNAseq coverage175x (Rank: top 50%)
Annotation
HeliconiusHMEL0174337e-15163.73% 
BombyxBGIBMGA012374-TA0.065.45% 
DrosophilaninaG-PA8e-9135.49% 
EBI UniRef50UniRef50_Q380J01e-12340.61%AGAP001546-PA n=1 Tax=Anopheles gambiae RepID=Q380J0_ANOGA
NCBI RefSeqXP_001604519.12e-12440.26%PREDICTED: similar to ENSANGP00000029571 [Nasonia vitripennis]
NCBI nr blastpgi|3071686531e-12340.94%Neither inactivation nor afterpotential protein G [Camponotus floridanus]
NCBI nr blastxgi|3071686532e-12041.08%Neither inactivation nor afterpotential protein G [Camponotus floridanus]
Group
Gene OntologyGO:00166146.7e-83oxidoreductase activity, acting on CH-OH group of donors
GO:00088126.7e-83choline dehydrogenase activity
GO:00506606.7e-83flavin adenine dinucleotide binding
GO:00551146.7e-83oxidation-reduction process
GO:00060666.7e-83alcohol metabolic process
KEGG pathwaydme:Dmel_CG95197e-63 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[1-610] IPR0121326.7e-83Glucose-methanol-choline oxidoreductase
[38-329] IPR0001723.5e-44Glucose-methanol-choline oxidoreductase, N-terminal
[429-600] IPR0078673.2e-28Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL17034 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215413-TA
ATGACACTGTATACATTATTTGTTAGTGTTTTAGTTGTATGCGTGTCTTATCTCGTGTATCAAAAATACTCCAACGTAAATGTCAGCTTTGTACAGAAACCCGAAAAGCGATATGATTACATCATTGTCGGCGCCGGCACCGCTGGTTGCGTTCTGGCTTCCCGACTATCTGAAGATCCAAACGTCAAAGTGTTACTTATTGAAGCTGGAGACCACATGGGTTATTTCACTAAAATACCGCTGACCTCAACTGCCGCACAGCTAGGACCTAATGATTGGTCTGTACGAACTACGCCACAGAAATATTCCTCCTTTGGGCTCATTGATCGGACCCAAATAATCCCACGCGGGAGAGGTCCAGGAGGATCTGGACAGATAAATTTCCTGCTTCACGGATTTGGTTTACCCGAGGATTACAATCGTTGGTCCCGCTTGGGCTTTAAGGGGTGGACGTTGGAAAACTTAAAACCATATTTCATTAAAGCTTTCGGGACCTACAAAAGCGAATTCGACTCAGATATGTGTCCACCCAAAGGACAATGTTCGGAGGCTCCGATGAAGCTGAAATTGATCGATGACAGTAACGAACTGATGACCATCTTCAAACAAGCCTCCTTGAGTCTCTCTGACAAAACAACACTGTTTAGAAGAGCCACGGCGTCCATTAAGGGAGGATCAAGATATCAGACATACAGCGCCTATTTAAAACCAGCACTCAAAAGACCAAATCTACATGTCCTACTTAAAACACAGGCAATATCAATACGTTTTGAGGATCAGAAAGTTTCGTCTCTTTACATTTTGGAAGACCATCGAAATCTGGACAATATATTTGTAAACAAAGAAATAATACTTAGCGCCGGTTCCATTAAAACACCACAAATCTTAATGCTGTCTGGTATAGGACCAAGGAATTTAATGAGAAGGCTAAGGCTAGAATTGGTATCAGACAATGAGTTCGTGGGTCGCAACCTGCACGATCATCTTAATGTGCCTATCTATGTCAGCATAAAAAAACCTATCAGTATTACTTTAGCCAAGGTATTTAGTGCCAGTACCCTTTGGAATTACTTTTGGAACGGAGACGGTTATTTAGCTTTTCCTCCTGTAGCTGGTGTGGAATATCGTAACTCTTCAGCTCTGATGCTGTTCTCAATGGGATCTTCCAGTGAGAGATTGTTACGAGACTTGTCCAACTATAAACCTCAGGTTTTCCGTGAAACATTTCCGTTCCACAACGACACCAGTAAAGAAGGCTTCATGTGGCTGGCGTCCTGTGTGGCACCTCGGAGTCGTGGTCAAGTTACAGTCACGGACCCGAGCACTAGTGTTCCGCCGGAAGTTGACCCCAACTACCTTCATCGAGAATATGATGTACGATGCATTATTAGAGCGATCAGACGAGCGGAAAGATTGGTATCGACAAAATCATTCCGAAGTATCGGAGCGAAAATCCACTGGCCGAGATCAGAGCGCTGCTTACCGTTGTGGACCTACACGAAGTTGGAGCAACTCGGACTGAGGCGAAGAAAGAAGATAAAAGTACCAGGAGAGAAGCCACGCAAGGAGATACCAAAGCAGAAGCCTCGGAGTCCCCCTGACGCATACCTGGAGTGTATCATCCGGGAGGTAGGAGTGACGGGACACCACGCGGGGGGAACGTGCGCCGGGGGAAGGGTTGTCGACGATCTACTCCGTGTCAAAGACGTCGAAGGTCTCCGTATAATGGACGCGAGTGTTTTTCCCGCGCCAACATCGCTGTATCCAAACTCAGTAATAGTCGCGATGGCGGAAAAAGCAGCAGACTTAATAAGGAATACTGTAACGTAG

Protein sequence:

>DPOGS215413-PA
MTLYTLFVSVLVVCVSYLVYQKYSNVNVSFVQKPEKRYDYIIVGAGTAGCVLASRLSEDPNVKVLLIEAGDHMGYFTKIPLTSTAAQLGPNDWSVRTTPQKYSSFGLIDRTQIIPRGRGPGGSGQINFLLHGFGLPEDYNRWSRLGFKGWTLENLKPYFIKAFGTYKSEFDSDMCPPKGQCSEAPMKLKLIDDSNELMTIFKQASLSLSDKTTLFRRATASIKGGSRYQTYSAYLKPALKRPNLHVLLKTQAISIRFEDQKVSSLYILEDHRNLDNIFVNKEIILSAGSIKTPQILMLSGIGPRNLMRRLRLELVSDNEFVGRNLHDHLNVPIYVSIKKPISITLAKVFSASTLWNYFWNGDGYLAFPPVAGVEYRNSSALMLFSMGSSSERLLRDLSNYKPQVFRETFPFHNDTSKEGFMWLASCVAPRSRGQVTVTDPSTSVPPEVDPNYLHREYDVRCIIRAIRRAERLVSTKSFRSIGAKIHWPRSERCLPLWTYTKLEQLGLRRRKKIKVPGEKPRKEIPKQKPRSPPDAYLECIIREVGVTGHHAGGTCAGGRVVDDLLRVKDVEGLRIMDASVFPAPTSLYPNSVIVAMAEKAADLIRNTVT-