Monarch geneset OGS2.0

DPOGS200734
TranscriptDPOGS200734-TA1032 bp
ProteinDPOGS200734-PA343 aa
Genomic positionDPSCF300030 + 106157-107188
RNAseq coverage513x (Rank: top 24%)
Annotation
HeliconiusHMEL0089550.092.60% 
BombyxBGIBMGA001035-TA0.090.42% 
Drosophilaras-PC1e-15777.15% 
EBI UniRef50UniRef50_P208392e-13370.00%Inosine-5'-monophosphate dehydrogenase 1 n=379 Tax=root RepID=IMDH1_HUMAN
NCBI RefSeqXP_001861658.12e-15979.82%inosine-5'-monophosphate dehydrogenase [Culex quinquefasciatus]
NCBI nr blastpgi|1700512023e-15879.82%inosine-5'-monophosphate dehydrogenase [Culex quinquefasciatus]
NCBI nr blastxgi|1571212463e-15379.06%inosine-5-monophosphate dehydrogenase [Aedes aegypti]
Group
Gene OntologyGO:00039383.2e-190IMP dehydrogenase activity
GO:00551143.2e-190oxidation-reduction process
GO:00081526.5e-127metabolic process
GO:00038246.5e-127catalytic activity
GO:00055156.5e-07protein binding
KEGG pathwaycqu:CpipJ_CPIJ0116875e-159 
 K00088 (E1.1.1.205, guaB)maps-> Purine metabolism
    Drug metabolism - other enzymes
InterPro domain[1-338] IPR0185293.2e-190IMP dehydrogenase-related
[18-338] IPR0137856.5e-127Aldolase-type TIM barrel
[23-338] IPR0010931.8e-82IMP dehydrogenase/GMP reductase
[110-157] IPR0006446.5e-07Cystathionine beta-synthase, core
Orthology groupMCL10989 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200734-TA
ATGTCAAATATAGAAGGTGATTTGAGGGATGGTCTGTCTGCAAAAGAAATATTCGCGAATAGCGAAGGACTTACCTACAATGATTTCCTGTTGCTTCCGGGATATATAGATTTCACTGCTGAAGAGGTAGATTTGACTTCTCCTCTCACTAAGAAGATCAACATCAAGGCTCCATTGGTTTCGACGCCAATGGACACTGTGACTGAATCTGATATGGCCATAGCAATGGCCCTTTGTGGTGGTATCGGAATTATTCATCATAACTGTACTCCCGAATATCAAGCAAATGAAGTGCATAAGGTAAAGAAATACAAACATGGCTTTATCAGAGACCCCGTGTGCATGGGACCGAACAATACAGTTGCTGATGTGTTAGATGCGAAAAAGAAACATGGTTTCACTGGTTTTCCCATAACAGAAAATGGAAAGCTTGGTGGTTTTCTCATTGGTATAGTTACTTCTAGAGATATTGATTTCAGGGAAGGTTCACCAGAACTGAGTCTGAAAGAAGTTATGACACCCATTAACGAAATGATAACAGCTCAGTCTGGTGTAACATTGCAAGATGCTAATTACATACTTGAGAAAAGTAAAAAAGGAAAACTGCCCATCATAAATGGACACGGTGAACTTGTGGCGTTGATTGCGAGAACGGATCTTAAAAAGGCACGGAGTTACCCCAACGCTTCCAAAGATTCCAACAAACAATTGCTAGTAGGAGCAGCCATTGGTACACGGGAGTCGGATAAAGAACGCCTCGATCTGCTTGTGAATAATGGAGTAGATGTTATTGTTCTTGATTCCTCACAAGGTAACTCGAGTTTCCAAATAGATATGATCAAGCATATTAAAAAGAGTTATCCTGAAATTCAAGTAATTGGTGGTAATGTGGTTACAAGGATGCAGGCAAAGAACCTCATTGAAGCTGGTGCTGATGCACTGAGAGTGGGAATGGGAAGTGGTTCTATCTGTATAACCCAAGAGGTTATGGCATGTGGTTGTCCGCAAGCTACAGAAGGGCTCCGTTCTTAG

Protein sequence:

>DPOGS200734-PA
MSNIEGDLRDGLSAKEIFANSEGLTYNDFLLLPGYIDFTAEEVDLTSPLTKKINIKAPLVSTPMDTVTESDMAIAMALCGGIGIIHHNCTPEYQANEVHKVKKYKHGFIRDPVCMGPNNTVADVLDAKKKHGFTGFPITENGKLGGFLIGIVTSRDIDFREGSPELSLKEVMTPINEMITAQSGVTLQDANYILEKSKKGKLPIINGHGELVALIARTDLKKARSYPNASKDSNKQLLVGAAIGTRESDKERLDLLVNNGVDVIVLDSSQGNSSFQIDMIKHIKKSYPEIQVIGGNVVTRMQAKNLIEAGADALRVGMGSGSICITQEVMACGCPQATEGLRS-