Monarch geneset OGS2.0

DPOGS205789
TranscriptDPOGS205789-TA2142 bp
ProteinDPOGS205789-PA713 aa
Genomic positionDPSCF300144 - 165532-172391
RNAseq coverage66x (Rank: top 67%)
Annotation
HeliconiusHMEL0104600.074.75% 
BombyxBGIBMGA013006-TA0.058.61% 
DrosophilaCG9503-PA7e-13140.96% 
EBI UniRef50UniRef50_D2A3N81e-13745.11%Putative uncharacterized protein GLEAN_15725 n=4 Tax=Endopterygota RepID=D2A3N8_TRICA
NCBI RefSeqXP_394224.11e-14244.70%PREDICTED: similar to CG9503-PA [Apis mellifera]
NCBI nr blastpgi|3227964063e-14545.60%hypothetical protein SINV_06973 [Solenopsis invicta]
NCBI nr blastxgi|3227964062e-14145.60%hypothetical protein SINV_06973 [Solenopsis invicta]
Group
Gene OntologyGO:00166141.2e-130oxidoreductase activity, acting on CH-OH group of donors
GO:00088121.2e-130choline dehydrogenase activity
GO:00506601.2e-130flavin adenine dinucleotide binding
GO:00551141.2e-130oxidation-reduction process
GO:00060661.2e-130alcohol metabolic process
KEGG pathwaydme:Dmel_CG95184e-128 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[102-713] IPR0121321.2e-130Glucose-methanol-choline oxidoreductase
[147-443] IPR0001721.8e-67Glucose-methanol-choline oxidoreductase, N-terminal
[557-701] IPR0078673.2e-35Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL10024 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205789-TA
ATGCGACGTCTAATTTATACAGCCTTGCTGCTAAACACATTATTATATACAAAAGCAATAGACATCTACGAAGAAGATGATTCGGTCGACGTAGAAAATGGACCTTATGATATACCTAATAACATAGAAAATAATGCAAACGTTCGAAGTTTTAGAAACTCTCGTATACTCTGGCCGTATCCTTTAAACAACAAAAGTGATCCCATCAAGGGTAATGAGGAGCCTAGTTCAGATGAAAATGTACTGCATCTGTCCAAACTTAAGAATGGAAATAGGAAAGCAAGGTTCGGATATTGGTCCTACCCTCAGAGTCCTATAGTGGATATTATGATGCAAACCGTAGCCTCGAATTACAATCCCACGAATGCAAACGATCCATTTGATTTTTTGAGAGATTCATATCCATTGCCCAAAGGTTACAACGAGCCCTTAAATGAATATGACTATGTAATAGTCGGCGCTGGTTCTTCTGGCTCAGTTCTAGCAGCTCGTCTTACTGAAGATAAACCTCGGGCCTCCGTCCTGCTAATAGAGGCTGGCAAGCCGGAGATGTTACTCTCAGATATTCCCGCTTTGACCCAGTACCTGCAACAGACAGATTATGTCTGGCCTTATACAATGGAGCACCAACCTGGTGTTTGTATGGGCAGTGAAGAGCAGAGATGCTATGCACCTAGAGGCAAGGCAATTGGTGGGACGAGTGTCACAAATAGTATGTTCTACACGCGAGGCAGACCACAGGATTGGGATAGGATTGCAGCTGACGGAAACTTTGGATGGTCATACGAAGAAGTATTGAAATATTACATGAAATCTGAAAGATCCGAGCTCAAAAAATACAGAGACCAACCCTATCGCGGCCGTGATGGAGAACTTACAGTAGAGAACGTCCCATTCAAAACTGGTTTAGTTGAAGCGTTTCTTGCAGCTGGGAGGATGTTGGGCCATCCAACAATCGATTACAATGCTCCAGATCAATTAGGTTTTGGATACGTTCAAACGATAACTAACAGAGGTCATCGTCTAAGTGCTGCAAAGGCGTTCCTGCATAGACACAAAGGGCGCAAAAATTTACATATATTGAGTGAAGCTAAAGCTACAAAAGTCATAATTGATCCGCAAACAAAGAAAGTTTCAGGAGTTGAATATATAAAGAATAACATTAAACACAGAGTAAACTGTAGACGAGAGGTTATTCTGTCAGCTGGACCTATTGGTTCACCCCAATTACTTATGCTCTCGGGAATAGGTCCCAAAGAACACTTACAGACTCTAGGGATACCTGTTGTTATGGACCTAAAAGTTGGAAGAACTCTTTACGATCACATCGGTTTTCCTGGTGTAATATTTAAGCTGAAAAGTACTAACGCTAGTTTATTGGAACCCAAGGTTGCCACATTACCAAATCTAATGCAGTGGCTTCAGTTTGGCGATGGATTACTCGCTTCACCTGGTGGAGTTGAGGCGATTGGATATCTAAAAACAGCATTATCAGAAGATCCTGAGTTGGTTCCCGATATTGAACTTTTAAGTATGGGTGGTTCAATCACTCAAGATTCAGGAGGTGCGATAAGAAGGAGTATGAGGATATCTGAAAATACATATGCTCGAGCATTTCACACATTAAATGGTATGGATACTTGGCAGGCTATACCAACACTTCTTTATCCCCGATCCAAAGGATATATGGAACTGCGAGATACCAGTCCATTTTCACATCCAAAACTATACGGAAATTATTTAACTGATCCAAAAGATTTAGCGACGTTAAAAGAAGCAGTAAAGCATATAATACAATTGGGAGAATCTCAACCATTCAAAAAATACGACGCAACTTTACATTTACCGCAATATCCCACTTGCTCAACATATCCCTTAGGTTCTGATGCTTACTGGGAATGTGCTATTAGAACCTTGATCGTATCTTTCCACGAGCCAATCGGTACGTGCAAAATGGGACCATCAAATGATTTTGAAGCCGTTGTAGATAATAACCTAAGGGTGTACGGTATTGAAGGTTTAAGGGTTGCCGATGCTAGCGTTATTCCTCGACCCATTGGTGCCAGAACAAACGTTCCTGAAATTATGATTGGAGAAAAGGCAGCTGATTTGATAAGGAATACGTGGTCAAATAACGTCTAA

Protein sequence:

>DPOGS205789-PA
MRRLIYTALLLNTLLYTKAIDIYEEDDSVDVENGPYDIPNNIENNANVRSFRNSRILWPYPLNNKSDPIKGNEEPSSDENVLHLSKLKNGNRKARFGYWSYPQSPIVDIMMQTVASNYNPTNANDPFDFLRDSYPLPKGYNEPLNEYDYVIVGAGSSGSVLAARLTEDKPRASVLLIEAGKPEMLLSDIPALTQYLQQTDYVWPYTMEHQPGVCMGSEEQRCYAPRGKAIGGTSVTNSMFYTRGRPQDWDRIAADGNFGWSYEEVLKYYMKSERSELKKYRDQPYRGRDGELTVENVPFKTGLVEAFLAAGRMLGHPTIDYNAPDQLGFGYVQTITNRGHRLSAAKAFLHRHKGRKNLHILSEAKATKVIIDPQTKKVSGVEYIKNNIKHRVNCRREVILSAGPIGSPQLLMLSGIGPKEHLQTLGIPVVMDLKVGRTLYDHIGFPGVIFKLKSTNASLLEPKVATLPNLMQWLQFGDGLLASPGGVEAIGYLKTALSEDPELVPDIELLSMGGSITQDSGGAIRRSMRISENTYARAFHTLNGMDTWQAIPTLLYPRSKGYMELRDTSPFSHPKLYGNYLTDPKDLATLKEAVKHIIQLGESQPFKKYDATLHLPQYPTCSTYPLGSDAYWECAIRTLIVSFHEPIGTCKMGPSNDFEAVVDNNLRVYGIEGLRVADASVIPRPIGARTNVPEIMIGEKAADLIRNTWSNNV-