Monarch geneset OGS2.0

DPOGS204714
TranscriptDPOGS204714-TA3810 bp
ProteinDPOGS204714-PA1269 aa
Genomic positionDPSCF300257 - 52713-62914
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0030189e-17351.40% 
BombyxBGIBMGA008244-TA2e-17853.27% 
DrosophilaCG18516-PA6e-10138.43% 
EBI UniRef50UniRef50_A8TUC02e-12741.84%Aldehyde oxidase 2 n=5 Tax=Obtectomera RepID=A8TUC0_BOMMO
NCBI RefSeqNP_001103811.13e-12841.84%aldehyde oxidase 2 [Bombyx mori]
NCBI nr blastpgi|1603332476e-12741.84%aldehyde oxidase 2 [Bombyx mori]
NCBI nr blastxgi|1603332472e-12441.91%aldehyde oxidase 2 [Bombyx mori]
Group
Gene OntologyGO:00551141e-90oxidation-reduction process
GO:00164911e-90oxidoreductase activity
GO:00166148.7e-33oxidoreductase activity, acting on CH-OH group of donors
GO:00506608.7e-33flavin adenine dinucleotide binding
GO:00038248.7e-33catalytic activity
GO:00468726.2e-29metal ion binding
GO:00090559.6e-21electron carrier activity
GO:00515369.6e-21iron-sulfur cluster binding
KEGG pathwayppp:PHYPADRAFT_1625148e-65 
 K00106 (XDH)maps-> Peroxisome
    Purine metabolism
    Caffeine metabolism
    Drug metabolism - other enzymes
InterPro domain[844-1258] IPR0082741e-90Aldehyde oxidase/xanthine dehydrogenase, molybdopterin binding
[461-641] IPR0161668.7e-33FAD-binding, type 2
[82-158] IPR0028886.2e-29[2Fe-2S]-binding
[467-639] IPR0023464.7e-26Molybdopterin dehydrogenase, FAD-binding
[643-754] IPR0051071.5e-24CO dehydrogenase flavoprotein, C-terminal
[1-87] IPR0010419.6e-21Ferredoxin
[3-83] IPR0126756.6e-20Beta-grasp fold, ferredoxin-type
[777-843] IPR0006741.7e-15Aldehyde oxidase/xanthine dehydrogenase, a/b hammerhead
[526-635] IPR0161697.6e-12CO dehydrogenase flavoprotein-like, FAD-binding, subdomain 2
Orthology groupMCL10023 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204714-TA
ATGTCCAGAATTCAGTTTCGAGTCAACAACTTACAATGTTCAGTGGGCAGTGAGGTGAGTTCCTCTACAACCCTGCTGGAGTATCTCCGGAGACACCTGGAGCTGCGCGGTACCAAGTACATGTGTCTGGAGGGAGGATGTGGCGCCTGTATCGTGAACGTCGTCAAAAGTCCTGGTGAAGCTTCTTTGGGTGTCAATTCATGCATGGTGCCAATAACTTCTTGCAATGGCTGGGATATAACAACTATAGAGGGAATTGGTAACCGTTTGAAAGGTTATCACCCGATACAAGTGACACTCGCTGAGAACAACGGCTCTCAATGCGGTTACTGCAGTCCAGGATGGGTCATGGCATTGTACAGCATACTCAGAAACCGACGGCCGACGATGCTAGAAATAGAGCAATCATTTGGGAGTAACATATGCAGATGTACAGGATATAGGCCGATTTTAGAGGCGTTCAAGAAATTCGCAATAGATTCTCCTGATGTTAAGGTTATACCAGATATTGAAGATTTAAGATTATGTGAGAAGTCTAGAGAACAGTGTTCGAAAAATAGTTGTAGCGAGTGGGATTGGTGTGTAATAAACAAAAGCGATTTGACTGATGACATTCCTCACATTCAATTACGTGATCACCGAGATTGGTTTAAAGCGACCACTATTGATAATATTTTCTCACTTTGGCAGCAATTTGGTACAGAATCCTATATGCTGGTTGGAGGAAATACGGGAAAAGGCGTTGTTCCAATATTGGAATATCCCAAACTTCTTATTGATATAAATCATATTCCCGAACTACATGGTTATTACGTGGATCAAAATTTGGTGATCGGAGCATCAACGACTCTGACTGATTTAATGACAATATTTGAATTCAAAGGTGCTACTAGAGAATTTAATTATTTGAATATCCTCAATGATCATTTGAGAATGGTAGCTCATATAGCTATTAGAAATTGCATGGTGCCAATAACTTCTTGCAATGGCTGGGATATAACAACTATAGAGGGAATTGGTAACCGTTTGAAAGGTTATCACCCGATACAAGTGACACTCGCTGAGAACAACGGCTCTCAATGCGGTTACTGCAGTCCAGGATGGGTCATGGCATTATACAGCATACTCAGAAACCGACGGCCGACGATGCTAGAAATAGAGCAATCATTTGGGAGTAACATATGCAGATGTACAGGATATAGGCCGATTTTAGAGGCGTTCAAGAAATTCGCAATAGATTCTCCTGATGTTAAGGTTATACCAGATATTGAAGATTTAAGATTATGTGAGAAGTCTAGAGAACAGTGTTCGAAAAATAGTTGTAGCGAGTGGGATTGGTGTGTAATAAACAAAAGCGATTTGACTGATGACATTCCTCACATTCAATTACGTGATCACCGAGATTGGTTTAAAGCGACCACTATTGATAATATTTTCTCACTTTGGCAGCAATTTGGTACAGAATCCTATATGCTGGTTGGAGGAAATACGGGAAAAGGCGTTATTCCAATATTGGAATATCCGAAACTTCTTATTGATATAAATCATATTTCCGAACTACATGGTTATTACGTGGATCAAAATTTGGTGATCGGAGCATCAACGACTCTGACTGATTTGATGACAATATTTGAATTCAAAGGTGCTACTAGAGAATTTAATTATTTGAATATCCTTAATGATCATTTGAGAATGGTAGCTCATATAGCTATTAGAAATTCTGCAACTATTGGAGGAAACTTAGCATTGAAAAATTTACATCCTGGATTTCAATCTGACATTTACATAATTTTGGAGACTGCAGGAGCCCAATTGACCATATCGACAGATATCGACAGCCGCAAAGTTGTAACTATGCAAGAATTTCTCAAAATGGATATGAAAGGAAAAATTATTAAAAACGTCATTTTACCTCCTCTTAATGATAAGCACAAAATTGTGACGTTTAAGGTTGCGCCTCGAAGCCAAAATGCACACGCTTGGGTCCACGCCGGTTTTCACTATATAGTAGACTCGTATGATATAGTTCTAGATTGTATTATCGTGTATGGTGGACTTTCGCCTAATTATACCAGGTCGTGGAAGACTGAGCAATATCTAGTTGGTAAACATTTATGGAATAATAAAACCCTTCAGGGAGCACTAAATGTTCTGAGTGAAGAGTTGCAAGTGTCTGAGAGTCTACCGGATCCCCCGGTACAGTTTAGGCGACTGTCTGCTTTGGGACTATTTTACAAGGGTCTCCTCTCACTTTGTCCTCAAGAGATACTAAATCCTCGTTATATTTCTGGAATGACTAAAATTCATAACACTCGTCCTCTTTCACAAGGACGGCAAAATTTTGAAACTAATCCTGCCCTTTGGCCAATTAATTTGCCAATACCAAAACTGGAAGCCTTAATACAATGTGCTGGAGAAGCTGAATATACAGAGGATTTACCAACACTTCCGAGAGAAGTGTATGCAGCTTTCGTACTTACTACAGTTCCTCTTGGAACCATCACTAAAATCGATGCTAGTAAAGCTTTGGTTGCCATAAACCACGACGGAGTAATTCAGTACGTAGATTATGACCTTTACAGTGATAACGGTTATATTGTTAATGAAACTCTTTTGGTTGCTGGTACCGAAGACTATAACAATGCTTATAGAAGTGACAAGTGGAAATATAGATCTTTTACTGTTACAACCGACACTCCATCAAATAGCTGGTGTCGGGCGCCAGGTTCGTTGGAACATGTTGCAATGGCCGAAACTATCTTAGAAAGAATAGCGTATGAAATGAATATAGATCCATTTGAAATTCGTCTAACAAACTTCGATTCAATTAAATATGATGAGATGATAGGTATGATAACAAGAATAAGAACAGAGTCTCAATACGATGAAAGAAGAGTTCTGGTTGAAAAATTTAACATAGACAATCGCTGGAAAAAGCGTGGTCTAAAAGTCTCATTTCTTAGATGGAATACGTCTCGACGTAAATATTTAGATGTTAACTTAGATGTTTATAAAGATGATGGGACAGTGGCTATAACGCATGGCGGTATTGAAATGGGTCAAGGAATGAATACAAGAGCTGTGCAAATATGCGCCAGTTTTTTGGATATACCTTTGGATAAAATTCAAATAAAATGCACTAACACTATTAACGCACCGAACAATAGCGATACAGAGTCGAGTGTGACATCTCAAAATGTAGGTAGAGGCGTACGTAGATGCTGTGAAGAACTCTTGACACGATTAGAGCCAGCAAAAAAACAGCTGCATGACCCTACTTGGGTGGAACTTGTTCAAAAGGCTTATGCCATGAATATTGATTTGCAAGTACACGGTTTTGTAAGTCCTAATGATGAAGTAGACCAAACTATTTATGGTGTTACAGTCGCCGAAGTTGAAATAGATGTTTTAACAGGGGAATGGGAAATTTTGAGGGTCGATTTAATTGAAGATGTTGGAAGAAGTATTAATCCGGATCTTGATGTTGGTCAAATTGAAGGTGCCTTTGTGATGGGCCTTGGCTACTGGACGACAGAAAATATAGTGTATGAACCTGAAAGTGGAGAAATTCTCTCGGACCGTACTTGGCAATACTGGGTGCCTGGAGCCAGAGACATCCCACAGGATTTTAGGATCTACTTTAGAAAACGGTCGTTTAGTAATGATGCTTTTCTTGGATCTAAAGCAACGGGTGAGCCAGCGACATGTATGGCAATTGTTATACCATTTGCAATGAGGGGGGCCATAGCATCAGCTCGCGAGGAAACTGGCATACCTAAAACTGAATGGTTCCAAATAGGTGATTTTTAG

Protein sequence:

>DPOGS204714-PA
MSRIQFRVNNLQCSVGSEVSSSTTLLEYLRRHLELRGTKYMCLEGGCGACIVNVVKSPGEASLGVNSCMVPITSCNGWDITTIEGIGNRLKGYHPIQVTLAENNGSQCGYCSPGWVMALYSILRNRRPTMLEIEQSFGSNICRCTGYRPILEAFKKFAIDSPDVKVIPDIEDLRLCEKSREQCSKNSCSEWDWCVINKSDLTDDIPHIQLRDHRDWFKATTIDNIFSLWQQFGTESYMLVGGNTGKGVVPILEYPKLLIDINHIPELHGYYVDQNLVIGASTTLTDLMTIFEFKGATREFNYLNILNDHLRMVAHIAIRNCMVPITSCNGWDITTIEGIGNRLKGYHPIQVTLAENNGSQCGYCSPGWVMALYSILRNRRPTMLEIEQSFGSNICRCTGYRPILEAFKKFAIDSPDVKVIPDIEDLRLCEKSREQCSKNSCSEWDWCVINKSDLTDDIPHIQLRDHRDWFKATTIDNIFSLWQQFGTESYMLVGGNTGKGVIPILEYPKLLIDINHISELHGYYVDQNLVIGASTTLTDLMTIFEFKGATREFNYLNILNDHLRMVAHIAIRNSATIGGNLALKNLHPGFQSDIYIILETAGAQLTISTDIDSRKVVTMQEFLKMDMKGKIIKNVILPPLNDKHKIVTFKVAPRSQNAHAWVHAGFHYIVDSYDIVLDCIIVYGGLSPNYTRSWKTEQYLVGKHLWNNKTLQGALNVLSEELQVSESLPDPPVQFRRLSALGLFYKGLLSLCPQEILNPRYISGMTKIHNTRPLSQGRQNFETNPALWPINLPIPKLEALIQCAGEAEYTEDLPTLPREVYAAFVLTTVPLGTITKIDASKALVAINHDGVIQYVDYDLYSDNGYIVNETLLVAGTEDYNNAYRSDKWKYRSFTVTTDTPSNSWCRAPGSLEHVAMAETILERIAYEMNIDPFEIRLTNFDSIKYDEMIGMITRIRTESQYDERRVLVEKFNIDNRWKKRGLKVSFLRWNTSRRKYLDVNLDVYKDDGTVAITHGGIEMGQGMNTRAVQICASFLDIPLDKIQIKCTNTINAPNNSDTESSVTSQNVGRGVRRCCEELLTRLEPAKKQLHDPTWVELVQKAYAMNIDLQVHGFVSPNDEVDQTIYGVTVAEVEIDVLTGEWEILRVDLIEDVGRSINPDLDVGQIEGAFVMGLGYWTTENIVYEPESGEILSDRTWQYWVPGARDIPQDFRIYFRKRSFSNDAFLGSKATGEPATCMAIVIPFAMRGAIASAREETGIPKTEWFQIGDF-