Monarch geneset OGS2.0

DPOGS208592
TranscriptDPOGS208592-TA2574 bp
ProteinDPOGS208592-PA857 aa
Genomic positionDPSCF300052 - 571518-588497
RNAseq coverage701x (Rank: top 18%)
Annotation
HeliconiusHMEL0165866e-10867.34% 
BombyxBGIBMGA005717-TA0.077.24% 
DrosophilaCRMP-PE7e-14559.39% 
EBI UniRef50UniRef50_Q8IPQ21e-14259.39%Collapsin response mediator protein n=21 Tax=Bilateria RepID=Q8IPQ2_DROME
NCBI RefSeqXP_973416.21e-15961.56%PREDICTED: similar to dihydropyrimidinase [Tribolium castaneum]
NCBI nr blastpgi|3800148893e-16565.33%PREDICTED: dihydropyrimidinase-like [Apis florea]
NCBI nr blastxgi|3800148892e-16065.33%PREDICTED: dihydropyrimidinase-like [Apis florea]
Group
Gene OntologyGO:00168123.8e-151hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in cyclic amides
GO:00057373.8e-151cytoplasm
GO:00062083.8e-151pyrimidine base catabolic process
GO:00168103.7e-38hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds
GO:00167873.6e-17hydrolase activity
KEGG pathwaytca:6622094e-159 
 K01464 (E3.5.2.2, DPYS)maps-> Pantothenate and CoA biosynthesis
    Drug metabolism - other enzymes
    Pyrimidine metabolism
    beta-Alanine metabolism
InterPro domain[18-429] IPR0117783.8e-151Hydantoinase/dihydropyrimidinase
[17-721] IPR0110593.7e-38Metal-dependent hydrolase, composite domain
[66-414] IPR0066803.6e-17Amidohydrolase 1
Orthology groupMCL10338 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208592-TA
ATGGCCTCACAATCAGTATCCGGTATTAATATTAAAAGCTCTCAAAACCGTCTCCTCATCAAAAATGGCTGTATCGTGAATGCTGATGGGATGGAAGACGCTGATGTTTACATCGAAGATGGTGTCATCAAGCAAGTGGGTAGGAATTTAATAATTCCTGGTGGGACTCGCACCATAGACGCCACCGGCAAGCTGGTCATGCCTGGTGGTATCGATCCCCATACACACTTCGAGTTAGAGATGATGGGCGCCAAGACCGCTGACGACTTCTATAAAGGCACGCGAGCGGCCGTGGCTGGTGGCACCACCACTATCATTGACTTTGTGCTGCCTCAGAAAGGACAGTCGTTGATAGAAGCCTACGGGAATTGGAGGCAGAAGGCTGACAATAAGGCGTGTTGCGATTACGCCTTGCACGTGGGTGTGACTTGGTGGTCAGCTTCCGTTAAGAAGGAGATCTCCCAGTTGGTGCACGATCACGGCGTGAACTCCTTCAAGATGTTCATGGCGTACAAAGACGTTTGGATGCTGGATGACTATAACATGTGCCTGGCGATGGAGGCGTGCGCCGAGCTGAAGGCACTACCCATGGTGCACGCGGAAAATGGTGGGATTATAGCACGTAACTCGGAAAAGTTGCTGGAAGCTGGGGTCAAAGGTCCCGAGGGTCATGCAGCGTCCAGAGACGACCAGGTCGAGGCCGAGGCCGTCAACCGCGCCTGTGTCATCGCTAACCAGATGGACGCTCCGCTGTATATAGTACACATGATGTCTGCCGCCGCCGTGCAGTCACTGCGTAACGCGCGTCGCGTCGCCAAACATCCAATATTCGGTGAAACTTTGGCCGCGACGGTTGGCACTGATGGTTCGCACTACAAGAACGCGTGTTTCCGCCACGCCGCCGCCCACGTCCTCTCCCCGCCACTCCGCGACCCCAGCACACCGGAAGCCATCATCGACGCCCTCGCACAGTCGCTGCTGGTCCAAGTGATAGCCAGCGACAACTGCACCTTCAATGAAAAAGATAAGGAATTGGGGAAAAACGACTTCACCAAGATACCTAACGGCGTGAACGGGGTCGAGGACCGCATGGCTATACTGTGGCAGAAAGCGGTCAACACTGGTGTCATGGACCCTTGTCGTTTCGTGGCCGTGACTAGTACCAACGCTGCGAATATCTTCAACCTACCATCCAAGGGCCGCGTGGCGGTGGGCGCGGACGGTGACGTCATCGTTTGGGACCCTCGCCTCGAGAAGACCATTTCCGCCGCGACCCACCACCACGCCGTGCACGCGGAAAATGGTGGGATTATAGCACGTAACTCGGAAAAGTTGCTGGAAGCTGGGGTCAAAGGTCCCGAGGGTCATGCAGCGTCCAGAGACGACCAGGTGGAGGCCGAAGCCGTCAACCGCGCCTGTGTCATCGCTAACCAGATGGACGCTCCGCTGTATATAGTACACATGATGTCTGCCGCCGCCGTGCAGTCACTGCGTAACGCGCGTCGCGTCGCCAAACATCCAATATTCGGTGAAACTTTGGCCGCGACGGTTGGCACTGATGGCCAGGCAATGTTGGTCTCATTAGTGTATAACCCGCCAGGTTCGCACTACAAGAACGCGTGTTTCCGCCACGCCGCCGCCCACGTCCTCTCCCCGCCACTCCGCGACCCCAGCACACCGGAAGCCATCATCGACGCCCTCGCACACGACGACCTCCAAGTGATAGCCAGCGACAACTGCACCTTCAATGAAAAAGATAAGGAATTGGGGAAAAACGACTTCACCAAGATACCTAACGGCGTGAACGGGGTCGAGGACCGCATGGCTATACTGTGGCAGAAAGCGGTCAACACTGGTGTCATGGACCCTTGTCGTTTCGTGGCCGTGACGAGTACCAACGCTGCGAATATCTTCAACCTACCGTCCAAGGGCCGCGTGGCGGTGGGCGCGGACGGTGACGTCATCGTTTGGGACCCTCGCCTCGAGAAGACCATTTCCGCCGCGACCCACCACCACGCCGTAGATTTTAATATATTTGAGGGTCAGCGCGTGGTCGGTGGACCTCAATACGTTATTGTGAACGGTCGAGTGTGTCTCGATGACGGTGACCTTAGGGTCGCTGAAGGTTACGGTAAATTCTTACCCACACCACCAAATTCTCCGTACGTGTACGGTGAAGTACCCACCACGCCGCAACCGGAAAGGGTTGAATACTTGCCCTCACCCGCCAGGGTCACTAACGGGACTCCCACAGAACTGCAGATATCTCACAAACTAGAAGCTACTTCCGTATCCGGCTGCAGCACGCCCACCGGCCGGAAGATGAGGGAGCCCGGACAGAGAAACCTTCAGAATTCCACCTTCTCCATCAGCCAACTGCAGATATCTCACAAACTAGAAGCTACTTCCGTATCCGGCTGCAGCACGCCCACCGGCCGGAAGATGAGGGAGCCCGGACAGAGAAACCTTCAGAATTCCACCTTCTCCATCAGCCAGGAAATGGAGGGACTCGACACGAAGACGTCAGTGCGCGTACGGAACCCACCCGGCGGGAAGTCATCCGGTTTGTGGTAA

Protein sequence:

>DPOGS208592-PA
MASQSVSGINIKSSQNRLLIKNGCIVNADGMEDADVYIEDGVIKQVGRNLIIPGGTRTIDATGKLVMPGGIDPHTHFELEMMGAKTADDFYKGTRAAVAGGTTTIIDFVLPQKGQSLIEAYGNWRQKADNKACCDYALHVGVTWWSASVKKEISQLVHDHGVNSFKMFMAYKDVWMLDDYNMCLAMEACAELKALPMVHAENGGIIARNSEKLLEAGVKGPEGHAASRDDQVEAEAVNRACVIANQMDAPLYIVHMMSAAAVQSLRNARRVAKHPIFGETLAATVGTDGSHYKNACFRHAAAHVLSPPLRDPSTPEAIIDALAQSLLVQVIASDNCTFNEKDKELGKNDFTKIPNGVNGVEDRMAILWQKAVNTGVMDPCRFVAVTSTNAANIFNLPSKGRVAVGADGDVIVWDPRLEKTISAATHHHAVHAENGGIIARNSEKLLEAGVKGPEGHAASRDDQVEAEAVNRACVIANQMDAPLYIVHMMSAAAVQSLRNARRVAKHPIFGETLAATVGTDGQAMLVSLVYNPPGSHYKNACFRHAAAHVLSPPLRDPSTPEAIIDALAHDDLQVIASDNCTFNEKDKELGKNDFTKIPNGVNGVEDRMAILWQKAVNTGVMDPCRFVAVTSTNAANIFNLPSKGRVAVGADGDVIVWDPRLEKTISAATHHHAVDFNIFEGQRVVGGPQYVIVNGRVCLDDGDLRVAEGYGKFLPTPPNSPYVYGEVPTTPQPERVEYLPSPARVTNGTPTELQISHKLEATSVSGCSTPTGRKMREPGQRNLQNSTFSISQLQISHKLEATSVSGCSTPTGRKMREPGQRNLQNSTFSISQEMEGLDTKTSVRVRNPPGGKSSGLW-