Monarch geneset OGS2.0

DPOGS208000
TranscriptDPOGS208000-TA1935 bp
ProteinDPOGS208000-PA644 aa
Genomic positionDPSCF300270 - 171735-179583
RNAseq coverage41x (Rank: top 72%)
Annotation
HeliconiusHMEL0117182e-13049.06% 
BombyxBGIBMGA008244-TA7e-11856.29% 
DrosophilaCG18516-PA6e-7240.40% 
EBI UniRef50UniRef50_A7UU582e-8141.27%AGAP006220-PA n=1 Tax=Anopheles gambiae RepID=A7UU58_ANOGA
NCBI RefSeqXP_001864334.11e-8245.22%aldehyde oxidase [Culex quinquefasciatus]
NCBI nr blastpgi|1700571062e-8145.22%aldehyde oxidase [Culex quinquefasciatus]
NCBI nr blastxgi|1603332476e-8044.41%aldehyde oxidase 2 [Bombyx mori]
Group
Gene OntologyGO:00551141e-27oxidation-reduction process
GO:00468721e-27metal ion binding
GO:00164911e-27oxidoreductase activity
GO:00166145e-25oxidoreductase activity, acting on CH-OH group of donors
GO:00506605e-25flavin adenine dinucleotide binding
GO:00038245e-25catalytic activity
GO:00090551.2e-17electron carrier activity
GO:00515361.2e-17iron-sulfur cluster binding
KEGG pathwayaag:AaeL_AAEL0026832e-38 
 K00106 (XDH)maps-> Peroxisome
    Purine metabolism
    Caffeine metabolism
    Drug metabolism - other enzymes
InterPro domain[116-195] IPR0028881e-27[2Fe-2S]-binding
[513-637] IPR0082744.6e-25Aldehyde oxidase/xanthine dehydrogenase, molybdopterin binding
[242-388] IPR0161665e-25FAD-binding, type 2
[248-389] IPR0023461.4e-20Molybdopterin dehydrogenase, FAD-binding
[35-119] IPR0010411.2e-17Ferredoxin
[50-115] IPR0126751.6e-17Beta-grasp fold, ferredoxin-type
[430-519] IPR0006747e-15Aldehyde oxidase/xanthine dehydrogenase, a/b hammerhead
[307-390] IPR0161693.4e-08CO dehydrogenase flavoprotein-like, FAD-binding, subdomain 2
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208000-TA
ATGAACACGATGTGTTCGCCAGCAAACACAATAAGTTTTACAGTTAATGGTGAAAAATATTCAGGTATTATATTTCTCATTGTATATGTTATGAAAATAATACCAAAAAACTATTATATCGTTATGTCGTTGTCCGCAGTGGGCGGTGAGGTGAGTTCATGCACGACCTTACTGGACTATCTCCGACGACATCTCGAGTTACGCGGTACTAAATATATGTGTTTGGAGGGAGGATGTGGTGCCTGTATTGTCAATGTCACTAAGAAGCCGGGTGAACCTTCACAGAGTATCAATTCGTGTATGGTTCTTATAACATCATGCGCCGATTGGGATATCAATACAATAGAGAAAGTAGGTAACCGAAAAGACGGTTATCATGTGCTGCAAAAAGCTCTTGCCGAAAACAATGGAACGCAGTGTGGTTACTGTTCACCGGGATGGATCATGGCAATGTACAGTATTTTAAAAAACAGAAGACCTACTATGCTTCAAATAGAACAATCCTTCGGTAGCAACATTTGCAGATGCACAGGATATAGACCGATATTGGAAACTTTTAAGCGATTTTCATCGGATTCTGAAAATCCAATCAATATTTTGGATATTGAAGACCTAAAATTAGAAACATGTTCCAAGAGTGGATCGATTTGTTCTCAACATCGCTGCGACGAGTTTGAATGGTGTATGGTGTCAAAGAATCATATATATAATGAAATTCTGCATATTAAATTAAGGGATAACAGGGATTGGTATAACGCGGGAGATACAGATGACATTTTTTCTATATGGAATAAAAAAGGAACAGATTCTTATATGTTGATAGCCGGTAACACAGGAAAAGGTGTATACCCAATAATCGAGTATCCAAAAGTACTTATAAATATTAATGAAATATCCGAACTGAGAAAATACTATCTAGACCAGAATTTAGTGATCGGGAGTTCAACGACCCTTACCGAATTTATGAATATAATTGAAGTTGAATCTAGCACTGATAATTTTTCCTATCTTAAGATTTTATACGACCATTTAAAATTAGTAGCCAACATTAGTATAAGAAATATAGCAACTATTGGTGGTAACCTTATATTGAAAAATCGCCATCCAGAATTCCAGTCAGATATCTACTTACTTCTGGAGACAGTTGGTGCTCAGATAACCATTTGTGCTCTGGCAATACTGAGTAATGAGATTGAGGTGACAGAAAACTTACCGATGCCACCGGTGGCGTATCGACGTCAAACAGCTCTGGCTTTATTTTATAAGGGTCTCTTATCATTGTGTCCACAAAGCAAATTAAAATCACGCTATGCATCCGGCTCTATCAAGATCCACGAAACTCGAAAAGTTTCAGAAGCTCAATTCTTTTATGAAACCGATCCTTCGTTATGGCCTCTGACCAAACCTATACCAAGACTTAATGGATTGGTACAATGTGCTGGTGAAACTAAGTATGTAGACGATTTGGTACAGCAACCGGGTGAAGTGTTTGCAGCTTTCGTGTTGTCTACTGTAGCCCTCGGAACTATTGTTAATATAGACGCCAGCAAAGCATTAGTAGAAGGTGCGTTTACGCTTGGTGTTGGGTACAATACTTGTGAGCAAATCGTAAATGACCCTCACACGGGTGAAGTTCTTACCAATCGCACTTGGAATTATTGGGTTCCAGGTGCCACCGATATACCCCAAGATATGAGGATATATTTTAGAAAACGATCATTTAGTTATGAAGCTATTCTTGGATCCAAGGCGACTGGTGAACCGGCAACATGCATGGGTGTTGCTGTGCCTTTTGCGATGAGAGCTGCCATTGTAGCTTCTCGCCAAGAGTCGGGAAAACCCTACAACGAGTGGTTTCAAATAGATGGCGCTTGTACTGTGGATAAAATCGCTATTGCTTGCTCTACCAAAGTTGAAGAGTTTCAGTTTTTGTAA

Protein sequence:

>DPOGS208000-PA
MNTMCSPANTISFTVNGEKYSGIIFLIVYVMKIIPKNYYIVMSLSAVGGEVSSCTTLLDYLRRHLELRGTKYMCLEGGCGACIVNVTKKPGEPSQSINSCMVLITSCADWDINTIEKVGNRKDGYHVLQKALAENNGTQCGYCSPGWIMAMYSILKNRRPTMLQIEQSFGSNICRCTGYRPILETFKRFSSDSENPINILDIEDLKLETCSKSGSICSQHRCDEFEWCMVSKNHIYNEILHIKLRDNRDWYNAGDTDDIFSIWNKKGTDSYMLIAGNTGKGVYPIIEYPKVLININEISELRKYYLDQNLVIGSSTTLTEFMNIIEVESSTDNFSYLKILYDHLKLVANISIRNIATIGGNLILKNRHPEFQSDIYLLLETVGAQITICALAILSNEIEVTENLPMPPVAYRRQTALALFYKGLLSLCPQSKLKSRYASGSIKIHETRKVSEAQFFYETDPSLWPLTKPIPRLNGLVQCAGETKYVDDLVQQPGEVFAAFVLSTVALGTIVNIDASKALVEGAFTLGVGYNTCEQIVNDPHTGEVLTNRTWNYWVPGATDIPQDMRIYFRKRSFSYEAILGSKATGEPATCMGVAVPFAMRAAIVASRQESGKPYNEWFQIDGACTVDKIAIACSTKVEEFQFL-