Monarch geneset OGS2.0

DPOGS210796
TranscriptDPOGS210796-TA3084 bp
ProteinDPOGS210796-PA1027 aa
Genomic positionDPSCF300027 - 1120442-1130020
RNAseq coverage632x (Rank: top 20%)
Annotation
HeliconiusHMEL0141800.085.30% 
BombyxBGIBMGA007111-TA0.080.85% 
Drosophilasu(r)-PB0.068.76% 
EBI UniRef50UniRef50_Q128820.063.87%Dihydropyrimidine dehydrogenase [NADP+] n=142 Tax=root RepID=DPYD_HUMAN
NCBI RefSeqXP_001648193.10.070.53%dihydropyrimidine dehydrogenase [Aedes aegypti]
NCBI nr blastpgi|1571039450.070.53%dihydropyrimidine dehydrogenase [Aedes aegypti]
NCBI nr blastxgi|1571039450.070.53%dihydropyrimidine dehydrogenase [Aedes aegypti]
Group
Gene OntologyGO:00081527.1e-80metabolic process
GO:00038247.1e-80catalytic activity
GO:00041589e-53dihydroorotate oxidase activity
GO:00551149e-53oxidation-reduction process
GO:00057379e-53cytoplasm
GO:00062079e-53'de novo' pyrimidine base biosynthetic process
GO:00041527.8e-37dihydroorotate dehydrogenase activity
GO:00062227.8e-37UMP biosynthetic process
GO:00515363.9e-35iron-sulfur cluster binding
GO:00164917.8e-35oxidoreductase activity
KEGG pathwayame:4102070.0 
 K00207 (E1.3.1.2, DPYD)maps-> Pantothenate and CoA biosynthesis
    Drug metabolism - other enzymes
    Pyrimidine metabolism
    beta-Alanine metabolism
InterPro domain[494-838] IPR0137857.1e-80Aldolase-type TIM barrel
[529-834] IPR0057209e-53Dihydroorotate dehydrogenase, class 1
[529-834] IPR0121357.8e-37Dihydroorotate dehydrogenase, class 1/ 2
[3-197] IPR0090513.9e-35Alpha-helical ferredoxin
[36-188] IPR0122857.8e-35Fumarate reductase, C-terminal
[188-484] IPR0237533.4e-08Pyridine nucleotide-disulphide oxidoreductase, FAD/NAD(P)-binding domain
Orthology groupMCL12118 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210796-TA
ATGTCGAAGGTATTACTGAGTCGAGATCTGCCGGATATTGAGAACCTTCTTAAGCTTAACCCGACTGTTAAGCCATATACCAACCTGGTGCCATCAGCACAGACGAAGAAGAATAAACAGCACTGGAAAAGGAATGCCGATAGAAAATGCACTAGTTGTCCAAGTCTCGAGAGGAATTTCGATGACATCAAACACACGACATTGTCTGAGCGTGGTGCTTTGAAAGAGGCAGCTCGTTGTCTGAAATGTGCTGACGCGCCCTGTCAGAAGTCCTGCCCGACGCAGATCGACGTCAAAAGCTTTATAACAAGCATAGCTAATAAGGCAATGAACTATTATGGCGCCGCCAAAGAAATACTCTCAGACAACCCCTTGGGTCTCACTTGTGGAATGGTTTGCCCCACCAGTGACTTGTGCGTCGGCGGATGTAACTTACACGCGAGCGAAGAGGGCGCCATTAATATCGGTGGATTACAGCATTTCGCTGTTGAAACATTCATGAAAATGGGTATCTCCCAGACATTGGATCCAAAAACGAAACCCCTGCCTCGAGGTGATAAAAAAATTGCTCTTATAGGCGGCGGACCGGCAAGCTTGAGTTGTGCTTGTTTTTTGGCAAGGATGGGTTATAAGAATATTACGGTGTTTGAGAAAGAAAAATATCTTGGAGGACTAAGTTCATCAGAGATCCCACAGTACCGCCTTCCGTATGACGTGGTTCAATTCGAGATCGATCTCATCAGGGACCTTGGAGTGAAGTTCGTCACAGGAAGGAAACTTTCAACCAGTGACATAACTGTTAATGGACTTTTTAAAGATGGTTATGATGCTGTGTTTCTTGGCATCGGTCTGCCCGAGCCAAAGAGCATTCCTATCTTCGAAAACCTAACACCAGAAATGGGATTTTACACTAGCAAACAGTTTTTACCCCTGGTGTCCAGGGGCAGCAAAAGAGGCTTATGTAGCTGTTCGTCGTTGCCGGTTCTGTCTGGTACTGTGTTGGTTCTTGGGGCGGGAGACACGGCCTTCGACTGCGCCACATCAGCACTCCGCTGCGGCGCCAAGAAAGTGTTTGTCGTGTTCAGGAAAGGCATCACTCACATAAGAGCTGTTCCCGAGGAGGTTGATCTGGCTAAGGAAGAAAAATGTGAGTTCATACCATTTATGTCACCGCGAGAAGTAATCGTCAGAAACGGAAAGATAACTGCCTTAAAAATGTTCAGAACTGAACAATTAGATGATGGCGAATGGTTCGAAGACCCAGATCAAGTGTTGCAGCTGAAGGCCGACTTCATCATATCCGCGTTCGGATCTGGACTTTATGATGAAGACGTGAAACAGGCGATGGATGGCGTGAAATTAAATAGCTGGGGTCTTCCCGAAATAGATAATACTTGTATGCAGAGTTCGAGCAATCCGAAGGTGTTTGTTGGCGGGGACTTGGCCGGCGTCGCTGACACCACCGTGGAGTCCGTCAACGACGGAAAAACTGCCGCTTGGTACATGCATTGCTACCTACAGGGCATACCATTCTCAGCGGCCATTGAGCTGCCGAAATTCCATACAGACATCGATGAGGTGGATCTGTCAGTGGAGGTCTGCGGAATCCGGTTCGAGAATCCTTTCGGCCTAGCGAGCGCTCCGCCCACCACCAGCTCCGCCATGATAAGGAGAGCCTTCACCCAGGGCTGGGGCTTCGTGGTTACTAAGACCTTCGGACTTGATAAGGACATCGTTACGAACGTGTCACCGCGTATAGTACGCGGTGTGACCTCGGGAGAGAACTACGGACCCGGCCAAGGATCCTTCTTGAATATTGAGCTGATCTCCGAAAAATGTGAAGCTTACTGGTGTCAGAGTATCACGGAGCTAAAACGGGACTTCCCCACTAAGGTGATAATAGCGTCTATAATGTGTTCATACAACGAGGACGACTGGACGGAGCTGGCGAGGAAAGCCGAGGCGAGCGGAGCGGACGCGCTGGAGCTCAACCTGTCCTGCCCGCACGGCATGGGAGAGAGCGGGATGGGGCTAGCGTGCGGACAGGACCCGGTACTGGTGAAAGGAATATCCCAGTGGGTGCGGAAGGCTATAAACATACCGTTTTTCGTGAAGCTGACTCCCAACATCACAGACATAGTATCAATAGCGACGGCCGCTTATGAAGGCGGCGCTAGTGGTGTGTCTGCCATCAACACCGTGTCCGGTCTGATGACAGTTAGGGCGGATGCTACTCCGTGGCCTCAAGTCGGTCGCGAGAAAAGCACAACATACGGCGGTGTATCCGGCAACGCGACGCGGCCGATGGGATTGCGAGCTGTGTCCGCCATCGCTAACAAGTTACCGGGATTCCCCATACTCGGCATAGGAGGCATTGACTCCGCGGACTCAGCCTTGCAGTTCATGCTGTGCGGCGCACCAGTCGTTCAGATCTGTAGTGCGGTTCAAAACCAGGACTTCACCGTTGTTGAGGACTATGTGACAGGTCTTAAGGCGCTTTTATATTTACGATCGAGGGGTCTCCAAGGCTGGACGGGACAATCGCCGCCCACCAACAAACATCAGAAAGGGAAACCGGTGCAGACGATATGTGATGAAAACGGCAAGGTGTTAGCACACTTCGGTCCTTATATTAAGAAACGCGAAGGAGTACTTCACGAGCAACGTTTAAATACAAATATTTTGGCTGACAACACCACGGAATGTCCACGGAGGAATGTCAACGGCCATAATAAGGTGCCAAGGATCAGAGACGTCGTCGGTGAAGCGCTGAACAGGATCAGTTCATACAAAAAACTGGACAATACAAAACAAGTTGTAGCCTTGATAGACGATGATATGTGTATTAACTGTGGTAAATGTTACATGGCGTGCGCGGATTCAGGATACCAGGCTATTGAATTCGATGAACAAACGCACATACCTCGCGTCACTGAAGACTGTACCGGATGTACTTTGTGCCTTTCCGTTTGTCCCATCATCGACTGTATATCAATGGTTCCCAAGAAAATACCACACGTTATAAAACGGGGACTTCACTACGAGATACACCCTGTATCACCGTTGGACGGAGTTTGTCAATAA

Protein sequence:

>DPOGS210796-PA
MSKVLLSRDLPDIENLLKLNPTVKPYTNLVPSAQTKKNKQHWKRNADRKCTSCPSLERNFDDIKHTTLSERGALKEAARCLKCADAPCQKSCPTQIDVKSFITSIANKAMNYYGAAKEILSDNPLGLTCGMVCPTSDLCVGGCNLHASEEGAINIGGLQHFAVETFMKMGISQTLDPKTKPLPRGDKKIALIGGGPASLSCACFLARMGYKNITVFEKEKYLGGLSSSEIPQYRLPYDVVQFEIDLIRDLGVKFVTGRKLSTSDITVNGLFKDGYDAVFLGIGLPEPKSIPIFENLTPEMGFYTSKQFLPLVSRGSKRGLCSCSSLPVLSGTVLVLGAGDTAFDCATSALRCGAKKVFVVFRKGITHIRAVPEEVDLAKEEKCEFIPFMSPREVIVRNGKITALKMFRTEQLDDGEWFEDPDQVLQLKADFIISAFGSGLYDEDVKQAMDGVKLNSWGLPEIDNTCMQSSSNPKVFVGGDLAGVADTTVESVNDGKTAAWYMHCYLQGIPFSAAIELPKFHTDIDEVDLSVEVCGIRFENPFGLASAPPTTSSAMIRRAFTQGWGFVVTKTFGLDKDIVTNVSPRIVRGVTSGENYGPGQGSFLNIELISEKCEAYWCQSITELKRDFPTKVIIASIMCSYNEDDWTELARKAEASGADALELNLSCPHGMGESGMGLACGQDPVLVKGISQWVRKAINIPFFVKLTPNITDIVSIATAAYEGGASGVSAINTVSGLMTVRADATPWPQVGREKSTTYGGVSGNATRPMGLRAVSAIANKLPGFPILGIGGIDSADSALQFMLCGAPVVQICSAVQNQDFTVVEDYVTGLKALLYLRSRGLQGWTGQSPPTNKHQKGKPVQTICDENGKVLAHFGPYIKKREGVLHEQRLNTNILADNTTECPRRNVNGHNKVPRIRDVVGEALNRISSYKKLDNTKQVVALIDDDMCINCGKCYMACADSGYQAIEFDEQTHIPRVTEDCTGCTLCLSVCPIIDCISMVPKKIPHVIKRGLHYEIHPVSPLDGVCQ-