Monarch geneset OGS2.0

DPOGS208007
TranscriptDPOGS208007-TA4098 bp
ProteinDPOGS208007-PA851 aa
Genomic positionDPSCF300270 + 220915-230465
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0030222e-16755.41% 
BombyxBGIBMGA008244-TA9e-17957.31% 
DrosophilaCG18522-PA5e-8534.79% 
EBI UniRef50UniRef50_B0X3W18e-12632.83%Xanthine dehydrogenase/oxidase n=3 Tax=Culicinae RepID=B0X3W1_CULQU
NCBI RefSeqXP_001864333.11e-12632.83%xanthine dehydrogenase/oxidase [Culex quinquefasciatus]
NCBI nr blastpgi|1603332493e-12245.42%aldehyde oxidase 1 [Bombyx mori]
NCBI nr blastxgi|1603332494e-13445.42%aldehyde oxidase 1 [Bombyx mori]
Group
Gene OntologyGO:00551142.3e-37oxidation-reduction process
GO:00164912.3e-37oxidoreductase activity
GO:00468728.8e-30metal ion binding
GO:00166141.7e-22oxidoreductase activity, acting on CH-OH group of donors
GO:00506601.7e-22flavin adenine dinucleotide binding
GO:00038241.7e-22catalytic activity
GO:00090556.7e-20electron carrier activity
GO:00515366.7e-20iron-sulfur cluster binding
KEGG pathwayphu:Phum_PHUM2990905e-69 
 K00106 (XDH)maps-> Peroxisome
    Purine metabolism
    Caffeine metabolism
    Drug metabolism - other enzymes
InterPro domain[623-786] IPR0006742.3e-37Aldehyde oxidase/xanthine dehydrogenase, a/b hammerhead
[82-198] IPR0028888.8e-30[2Fe-2S]-binding
[490-598] IPR0051072.7e-24CO dehydrogenase flavoprotein, C-terminal
[205-346] IPR0161661.7e-22FAD-binding, type 2
[3-83] IPR0126753.1e-20Beta-grasp fold, ferredoxin-type
[1-87] IPR0010416.7e-20Ferredoxin
[352-485] IPR0023467.8e-18Molybdopterin dehydrogenase, FAD-binding
[372-479] IPR0161691.3e-09CO dehydrogenase flavoprotein-like, FAD-binding, subdomain 2
[739-851] IPR0082746.4e-07Aldehyde oxidase/xanthine dehydrogenase, molybdopterin binding
Orthology groupMCL10023 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208007-TA
ATGGATAGAGTTCACTTCAGAGTCAACGGAGTGCATTGCTCGGTTGGTAACGAGGTGAGTTCCTCTATAACCCTGCTGGAGTATCTCCGGAGACACTTGGAGCTGCGCGGTACCAAATACATGTGTTTGGAGGGAGGATGTGGAGCCTGTATCGTGAACGTCACAAAACATCCTGGAGGAGAATCCCAAGGAGTCAACTCTTGTATGGTACCTATAACATCATGCAACGAGTGGGATATAACAACAATAGAGGGCATCGGGAATCGTCTGCACGGCTACCATCCAATTCAGGTGACACTGGCTGAAAACAATGGCACACAGTGTGGCTACTGCAGTCCAGGATGGGTCATGGCTATGTACAGTATTTTAAAAAATAAAAAACCGACGATGTTGGAAGTAGAGCAGTCATTTGGAAGCAACATCTGCCGGTGTACTGGATACAGACCCATCTTAGACGCGTTCAAGAAGTTCGCTTCAGATGCTCACGATGTATTAGATATCGAGGACCTAGAAATATGTAAAAAGTCTGGTCGACCGTGTGCGAAGAATAGTTGTGACGAATCTGATTGGTGTTTATTATCTAAAAACGAACTTAATGGAAAACTACTGCACATTATATTAAATGATAACAGGGACTGGTTCAAAGCGACGTGTATATCTGACATATTTGAAATTTTCCAAAAATGGGGAACTGAGTCTTATATGCTTCTGGCTGGAAATACAGGGAAAGGCGTTTTTCCAATATTAGAGTATCCAAGAGTATTGATAAATGTAAATGATGTCAAGGAATTGAGGCAACAATATATTGACCAAAACTTAGTGATTGGAGGAGCGACGACCCTCACAGAACTGATAAATATATTCGATACAGTGGGTCACACTGAATACTTTGGATATCTGCTGGTATTGAAAGACCACTTAGAGGAAGTGGCACACATTACTATAAGAAATAATGCAACAGTTGCTGGTAATCTTATGCTGAAGAATTTTCACCTCGATTTTAAATCCGACATTTTCATACTTTTTGAGACTGTTGGTGTGTATCCAATATTGGAATATCCAAGAGTTTTGATAAATGTAAATGATGTCAAGGAGCTGAGAGAACACTATATAGACCAAAACTTAGTGATTGGAGGAGCGACGACCCTTACAGAACTGATAAACATATTCGATACAGTAGGCCGGGTCAATTTCTTTGGATATCTCAAGATATTAAACGAACACTTACAAGAGGTTGCCCATATTCCTATTAGAAACAATGCAACAATCGCTGGTAATCTTATGCTAAAAAATTTGCATCCTGATTTCAAATCCGACATTTTCATACTTTTCGAAACAATCGGAGCTCAGTTAACTATACAGACTGGTCGCAACCAACTAAAGATCATCACAATGCAATCTTTCCTTTCAGAAAATATGCATGGAAAAATATTATTAAATGTTTTACTTCCACCGCTGAGTACTGAACATAAGATAGTAACTTTCAAAATAACGCCGCGGTCCCAAAACGCCCATGCTCTTATCCATGCTGGGTTTCTTTATAAAGTAGATCATAATGAAAGAGTTCTAGAAAGCCGAATTGTCTACGGAGGACTCTCACCATCATATACCAGATCTTGGAAAACAGAAAGATATCTGATCGGTAAACAACTTTTACGGAATGAGACGTTGCAAGGAGCCTTAAAAGTTCTTAACACAGAACTGGTAGTTACGGAAAGTCTGCCAGATCCCTCTGTACAGTACAGGCGACAAGTAGCTTTAGCACTTTTCTACAAGGGACTTCTTTCTCTATGCGCACAAAACAGATTAAATCCTCGTTACGTATCTGGATCCAGCAAAATTCATAAAACAAGACCAGTGTCTGAAGGAACTCAGATATTCGATACGAATCCAAGTCTGTGGCCTCTAAACAAACCAATACCCAAACTGGATGGTTTGATTCAATGTGCCGGTGAAGCAAAATATTCTGAAGACGTTCCAAGACTTCCGGGAGAAGTGTTCGCCGCATTTGTTTTAACAACTGTGGCTCTGGGAAAAATTAATCATATTGACGCTAGTCGTGCTTTGGAGGAGCCTGGAGTATTGGCATTTTATACAGCAGCAGATATCCTAGGCAGAAATAGTTTTATACCTGCTGTTAATTTGTTTAACAGAGCTGATGAAGAATTCTTGTGCAACGGAGAAGTTAAATATTTTAATCAGCCCCTTGGAATAATTGTTGCCGAATGTCAAAGCATTGCAGACAAAGCAGTACATCTTGTACAAGTTATTTATTCTGATATAAAGAATCCGGTCCTCGACATCAGGGTTGCCAAACATGATCCCTCAAAACTGAAATTGTTTCAAACGATAAACGCAACTTCTGCTGGTACAGATATCGCTAAAGTAATAAAAGGTGAACAAAGTATCTATACACAATATCCCTTCACTATGGAAACTTTGGTTACTGTGACACATCCTACAGAAGAAGGTTTAAGAATATACGCAGCAACACAATGGATGGATTCAGTTCATGTAGTGATTTCAAGAGCTCTTCTCCTAGATCAAAATAGGTAATAGTATTAATTTTTATTCAAATAACTCTTGAGTTCCGTCAAAACACTTTTTGTATTTAAATAAATTATTTAATCATATTCTGACAACAATTTTTCTAATACTCTTAAATAATTGAAATGAAATATTTCAGAATAGATATTCTTGTCCGTCGTTTGGGTGGTGGGTATGGCTACAAGTTATCAAGAGTTACACAAGTGTCTCTAGGAAGTGCTTTGGTTGCATATAAACTCAATCGACCTTGTCGTTTCATACAAAGCCTTAGTACTAATATGAGAGCTACCGGGAAACGATTTCCATGTTCTACAAGTTTTGAGGTAATATTTAAAAGTATTTGTACAATAAATTTTTAAATAAATTCTACGAATCGAGTAAGGTCCATCAATGTTTCTATTTCTTTGTAGTATCCAATTTTGAGTAGGCAATCCAACACGGCAAACATATTTTTTTTTTTCGGTAAATATAATACGTAAGAATGATGGTGATGATAAATATGTTAGAAAAGTATGATAAATTAAAAACGTAGTAAGTGTTCATAGTAGCAATAGCAAATGTGTGCCTTGTACGATAACTGGAAACAAGATAAAAAGGTACAGAGTAGCTTACCAAATCCAATGTCACCTCTCATATTGCAAGTAATTGAGATGGGATCCCATTGGAACTCAATACTTAGAGGTGAATATGTGTGTTTATTACGATGATGGCACTGTTGCTTTAACTCATGCAGGCATAGAAATGGGACAGGGAATTAATACTAAAGCTATACAAATAGCAGCTTATTTTCTTAAAATCCCCATAGAGAAAATTCAAGTCAAACCTAATGATACTGTTATTGCACCTAATTGTTTTGGATCGGGGGGAAGTATAACGTCTCAAAATATAGGAATAGGTGTACAGAGATGTTGTGAAGAATTACTTAGAAGACTTGAACCAGTTAGAAACCAGTTGAATAACCCATCTTGGGAGGAATTGGTGAAAAAAGCTTATGAAATGAATGTAGATTTACAAGTACATGATTTGGTAAGTGCTAAAGATGAACAGAAATATAATATCTATGGTGTAACCCTAGCCGAAGTTGAAATAGATGTTCTGACTGGTGAATGGGAAATAATGAGAGTTGATCTAATTGAAGACGTAGGTAGAAGTGTTAACCCTGAATTGGATCTCGGTCAAATTGAAGGTGCTTTTATAATGGGCGTTGGCTATTGGACTACTGAAAATATTGTGTATGGTCCTGAAAATGGGGAAATTCTCACGGACCGTACATGGGAATACTGGGTGCCTGGTCCTAGGGACATTCCCCAGGACTTTCGGGTCTATTTCAGAAAAAGATCTTTCAGTACTGAGAAAATTTTAGGAGCTAAAGCATCTGGTGAACCTGCAACATGTATGGGAATATCAGTGCCATTTGCTATGAGAGCAGCTATAGCTTCAACAAGAAAAGAGTCTGGAATGCCTGAATGGTTTCAAATAGATGGTCCTTTCACCGTTGATAAAATTTATCTTGCATGTGCTACAAAGTTTGAAGATTTTAAGTTTTACTAA

Protein sequence:

>DPOGS208007-PA
MDRVHFRVNGVHCSVGNEVSSSITLLEYLRRHLELRGTKYMCLEGGCGACIVNVTKHPGGESQGVNSCMVPITSCNEWDITTIEGIGNRLHGYHPIQVTLAENNGTQCGYCSPGWVMAMYSILKNKKPTMLEVEQSFGSNICRCTGYRPILDAFKKFASDAHDVLDIEDLEICKKSGRPCAKNSCDESDWCLLSKNELNGKLLHIILNDNRDWFKATCISDIFEIFQKWGTESYMLLAGNTGKGVFPILEYPRVLINVNDVKELRQQYIDQNLVIGGATTLTELINIFDTVGHTEYFGYLLVLKDHLEEVAHITIRNNATVAGNLMLKNFHLDFKSDIFILFETVGVYPILEYPRVLINVNDVKELREHYIDQNLVIGGATTLTELINIFDTVGRVNFFGYLKILNEHLQEVAHIPIRNNATIAGNLMLKNLHPDFKSDIFILFETIGAQLTIQTGRNQLKIITMQSFLSENMHGKILLNVLLPPLSTEHKIVTFKITPRSQNAHALIHAGFLYKVDHNERVLESRIVYGGLSPSYTRSWKTERYLIGKQLLRNETLQGALKVLNTELVVTESLPDPSVQYRRQVALALFYKGLLSLCAQNRLNPRYVSGSSKIHKTRPVSEGTQIFDTNPSLWPLNKPIPKLDGLIQCAGEAKYSEDVPRLPGEVFAAFVLTTVALGKINHIDASRALEEPGVLAFYTAADILGRNSFIPAVNLFNRADEEFLCNGEVKYFNQPLGIIVAECQSIADKAVHLVQVIYSDIKNPVLDIRVAKHDPSKLKLFQTINATSAGTDIAKVIKGEQSIYTQYPFTMETLVTVTHPTEEGLRIYAATQWMDSVHVVISRALLLDQNR-