Monarch geneset OGS2.0

DPOGS202782
TranscriptDPOGS202782-TA1614 bp
ProteinDPOGS202782-PA537 aa
Genomic positionDPSCF300018 - 910980-913314
RNAseq coverage193x (Rank: top 48%)
Annotation
HeliconiusHMEL0062901e-13947.66% 
BombyxBGIBMGA010461-TA2e-17552.87% 
DrosophilaCG9517-PA2e-8234.58% 
EBI UniRef50UniRef50_D0ABA63e-10640.07%Putative ecdysone oxidase n=2 Tax=Nymphalidae RepID=D0ABA6_9NEOP
NCBI RefSeqXP_310335.31e-9538.99%AGAP003785-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|2613359211e-10540.07%putative ecdysone oxidase [Heliconius melpomene]
NCBI nr blastxgi|2613359216e-10339.92%putative ecdysone oxidase [Heliconius melpomene]
Group
Gene OntologyGO:00166144.7e-119oxidoreductase activity, acting on CH-OH group of donors
GO:00088124.7e-119choline dehydrogenase activity
GO:00506604.7e-119flavin adenine dinucleotide binding
GO:00551144.7e-119oxidation-reduction process
GO:00060664.7e-119alcohol metabolic process
KEGG pathwayava:Ava_C01021e-79 
 K00108 (E1.1.99.1, betA, CHDH)maps-> Glycine, serine and threonine metabolism
InterPro domain[2-534] IPR0121324.7e-119Glucose-methanol-choline oxidoreductase
[2-284] IPR0001721.4e-69Glucose-methanol-choline oxidoreductase, N-terminal
[382-522] IPR0078671.3e-38Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL25741 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202782-TA
ATGGCCCACAGGCTAACTGAAGTGAAGAACTGGTCCGTGTTGCTGGTAGAAGCCGGGAACGATCCGCCCTACGTCTCGGAAGTTCCCGGTCTCGGAATACTACTAGGAGCTTCGTTCCCAGATTGGAATTACTACACAAACGATGACACTGATGATGACACGAGACTACGGAGTGTTCACATGATACAAGGAAAACTCGTAGGCGGGTCCAGCAGCGTGAATTATATGTATTACGTCAGAGGCAATCCAGCCGACTACGACGACTGGGCGGCTCAGGGTAATGAAGGCTGGGCATGGAGTGATGTCTTAAAGTATTTTAAAAAGAGTGAGAGACTAAATGACGATGAGATACTAAGCAGCAACTCCAACGACCTGCACGGTGTCGACGGAAACATAGGAGTGACACGGTCGGTATGGGACAAACAAACCAAGAGGTATTTCGAAGCTTTCAGAGAAAATGGTCACGAGATTTTGTCGGACACAAATGGCCACCAACAACTCGGATATTCGGTACCCAGTTTTACTATGGACAAATCGCGACGTCAGAGTGCCGCTGTTGCTTATTTAAGGCCAATCCTGAACCGTCCTAATATAAAAATACTCAAAGAAACTCTGGCGCGAAAATTAACTTTTGACGAGGATAGAAGAGTCACTGGAGTAGAAATCAGGGATTCAGAAGGCTTAATTAAGACGGTGATAGCAAAGAAGGAAGTTATTCTATCAGCCGGAGCTGTTAAAAGTCCACAATTATTAATGATGTCTGGTATAGGACCACAAGCATATTTAGAGGAAATGGGAATCAATGTGGTTGTAAACAATCCTCACGTCGGGTCCAATCTACAAGATCACATGCTAGTACCAGTGGTGATATCTCTTGATAATGAAGAATCATCCATAACAGAAAATTTCAGCTTTATAAGTAAACTAGGAACATTTCCCGCCCCAAATATTATGGGCCACGTTGCTCTAGACAAGAACCAAACTTTTCCAGACTACCAAGTCACCTCGATGCCACTCCCAGTTGGAACCATGCTGCCATCTCTGGTCTGTAATAGTATATTTCAATGGAATAAAGAAGTGTGCACGGCCCTAGCTGCCGCGGCGAGCCGAGACATGTTATTTGCATTAATTTCCTATCTTCACCCCGAATCTAGAGGATATATTAAGTTGAAGAGCAACGACCCCGATCAACCGCCATTAATTTATCCTAAATACTTGTCAAAGAGAAACGACTTAAAGAAATTCTCTCGAAGCTTACAACACTTTACGAGCTTGATTAATACCACGTCTTGCAAAAAGCTTAATTCTGATATCGTCGACTTAAATGTTGGTAAATGTAAAGACAAACCTTTCGGCAGTCTGGAATACTGGGAATGCTATATTTATAATCTAGTGACAACTCAGTACCACCCCGTCGGAACGTGTAGGATGGGACCAGATGGAGTCGTGGATGAGAGACTTCGAGTAAGAGGAGTCGAGGGTTTGAGGGTCGTGGACGCGAGCATCATGCCTTCCATAACGAGCGGCAACACGTACGCCCCGACCGTCATGATCGCTGAAAAGGCAGCTGACATGTTGAAAGTTGACAACGGTATTTGTGAAGACGTTCTGTAG

Protein sequence:

>DPOGS202782-PA
MAHRLTEVKNWSVLLVEAGNDPPYVSEVPGLGILLGASFPDWNYYTNDDTDDDTRLRSVHMIQGKLVGGSSSVNYMYYVRGNPADYDDWAAQGNEGWAWSDVLKYFKKSERLNDDEILSSNSNDLHGVDGNIGVTRSVWDKQTKRYFEAFRENGHEILSDTNGHQQLGYSVPSFTMDKSRRQSAAVAYLRPILNRPNIKILKETLARKLTFDEDRRVTGVEIRDSEGLIKTVIAKKEVILSAGAVKSPQLLMMSGIGPQAYLEEMGINVVVNNPHVGSNLQDHMLVPVVISLDNEESSITENFSFISKLGTFPAPNIMGHVALDKNQTFPDYQVTSMPLPVGTMLPSLVCNSIFQWNKEVCTALAAAASRDMLFALISYLHPESRGYIKLKSNDPDQPPLIYPKYLSKRNDLKKFSRSLQHFTSLINTTSCKKLNSDIVDLNVGKCKDKPFGSLEYWECYIYNLVTTQYHPVGTCRMGPDGVVDERLRVRGVEGLRVVDASIMPSITSGNTYAPTVMIAEKAADMLKVDNGICEDVL-