Monarch geneset OGS2.0

DPOGS208890
TranscriptDPOGS208890-TA1785 bp
ProteinDPOGS208890-PA594 aa
Genomic positionDPSCF300009 - 943928-948522
RNAseq coverage869x (Rank: top 15%)
Annotation
HeliconiusHMEL0038900.084.40% 
BombyxBGIBMGA002462-TA0.082.11% 
DrosophilaCG11089-PE0.071.79% 
EBI UniRef50UniRef50_Q9VC180.071.79%CG11089 n=23 Tax=Eumetazoa RepID=Q9VC18_DROME
NCBI RefSeqXP_967875.10.071.21%PREDICTED: similar to 5-aminoimidazole-4-carboxamide ribonucleotide formyltransferase/IMP cyclohydrolase [Tribolium castaneum]
NCBI nr blastpgi|910765820.071.21%PREDICTED: similar to 5-aminoimidazole-4-carboxamide ribonucleotide formyltransferase/IMP cyclohydrolase [Tribolium castaneum]
NCBI nr blastxgi|910765820.071.21%PREDICTED: similar to 5-aminoimidazole-4-carboxamide ribonucleotide formyltransferase/IMP cyclohydrolase [Tribolium castaneum]
Group
Gene OntologyGO:00046437.3e-177phosphoribosylaminoimidazolecarboxamide formyltransferase activity
GO:00061647.3e-177purine nucleotide biosynthetic process
GO:00039377.3e-177IMP cyclohydrolase activity
GO:00038249.1e-145catalytic activity
KEGG pathwaytca:6562410.0 
 K00602 (purH)maps-> Purine metabolism
    One carbon pool by folate
InterPro domain[2-594] IPR0026950AICARFT/IMPCHase bienzyme
[201-594] IPR0161939.1e-145Cytidine deaminase-like
[6-190] IPR0116071.6e-75Methylglyoxal synthase-like domain
[523-583] IPR0240513.2e-49AICAR transformylase domain
[479-517] IPR0240504.7e-12AICAR transformylase, insert domain
Orthology groupMCL14268 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208890-TA
ATGGCTGCTAATGGAAAATTGGCTCTGCTAAGTGTGTCAGACAAGACCGGTTTGATACCACTGGCCAAATGCCTATCAGATATAGGTCTAGGATTGGTAGGAAGTGGTGGTACTGCTACAGCTCTCCGTAATGCTGGCCTTAAAGTTTTGGATGTGTCTGATATAACTGGAGCTCCAGAGATGCTCGGAGGAAGAGTGAAGTCACTCCACCCAGCAGTACACGCTGGTATACTATCAAGGCTCACAGATTCCGATCAAGCAGACATGAAACGCCAAAAGTTTGAAATGATAAGTGTGGTAGTGTGCAATCTCTATCCATTTGTGGAGACAATATCTAAACCTAACGTCACAATACCCGATGCGGTAGAAAACATCGATATTGGTGGTGTGACGCTACTAAGAGCAGCCGCTAAAAATCACGATAGAGTCACCGTGGTGTGCGATCCATCGGACTATGACGTAGTTATGAAAGAATTGAAGGAGAGTAATGACCACCAAACCTCCTCAGAGACCAGACAAAAGCTGGCCTTGAAAGCTTTCACCCACACTTCACAATACGACACTCACATTTCGGACTATTTCCGTAAGCAATATTCTGCGGGACAGTCACAGATGACCTTGAGATATGGTATGAACCCTCATCAGAAACCAGCTCAAGTATTTGTTACCCGCGACCAACTACCGTTGACAACCCTGAATGGTTCACCTGGCTTCATCAACCTCTGTGACGCCCTGAACGCCTGGCAATTGGTCATTGAACTCAAGGAGGCGCTAGGGCTGCCAGCAGCTACAAGTTTCAAACACGTTTCACCAGCTGGCGCTGCAATTGGGCTGCCCTTAAACGAGGAGGAAGCATCAGTATGTATGGTGTCAGATCTGCTTCTAAAACTGTCTCCTCTCGCATGTGCGTACGCCAGGGCTCGTGGGGCCGACAGAATGAGTTCGTTTGGAGACTTTATAGCGATATCGGATGAATGCGACGAGATCACAGCCAGGATAATATCTCGGGAAGTATCTGATGGCATCATCGCCCCCGGATATTCACCAGCCGCATTGGAAATCCTGAAGAAGAAGAAGGCTGGCAATTACTGCGTCCTTCAGATGGATCCCAAATACACGCCGGATTTAATGGAACAAAAGACAATATTCGGTTTGACCCTGGAACAAAGACGTAACGACGCCAAGATAACCGCTGATCTGTTTAAGAATGTTGTCACCAACGAAAAGTGCCTGCCGCCTAACGCGATTAGGGATCTCATAGTAGCAACCATCGCTCTCAAATACACACAAAGCAACTCTGTGTGCTTCGCTAGAGATGGACAAGTTATAGGTATCGGTGCGGGTCAACAGTCGAGGATCCACTGCACCCGTCTCGCTGGAGGCAAGGCCGGCTTGTGGTGGACAAGAAGACATCCCAGGGTTAGCGACATGAGGTTCAAGAAAAACGTCACCAGAGCTGTCATATCTAACGCCATTGACAACTACGTCAATGGTACCATAGGTACCGACTTGCCTCTGGACCAGTGGAACAGTCTTTTCGAAGGTGAACCTCCTGCGTTGCTAACTCCCGAGGAAAGAGACGCTTGGATCAAAAAAATGGATAAAGTAGCTCTCGCATCCGACGCGTTCTTCCCATTCAGAGATAACATTGATAGAGCAGTCCAGTGCGGTGTGGAATACATCGGCAGTCCATCCGGTTCAAACAACGACAAGGAAGTCATAGATGCGTGCAACGAACACAAAATAATTCTTGCTCACACAAATTTGAGACTTTTCCATCACTAA

Protein sequence:

>DPOGS208890-PA
MAANGKLALLSVSDKTGLIPLAKCLSDIGLGLVGSGGTATALRNAGLKVLDVSDITGAPEMLGGRVKSLHPAVHAGILSRLTDSDQADMKRQKFEMISVVVCNLYPFVETISKPNVTIPDAVENIDIGGVTLLRAAAKNHDRVTVVCDPSDYDVVMKELKESNDHQTSSETRQKLALKAFTHTSQYDTHISDYFRKQYSAGQSQMTLRYGMNPHQKPAQVFVTRDQLPLTTLNGSPGFINLCDALNAWQLVIELKEALGLPAATSFKHVSPAGAAIGLPLNEEEASVCMVSDLLLKLSPLACAYARARGADRMSSFGDFIAISDECDEITARIISREVSDGIIAPGYSPAALEILKKKKAGNYCVLQMDPKYTPDLMEQKTIFGLTLEQRRNDAKITADLFKNVVTNEKCLPPNAIRDLIVATIALKYTQSNSVCFARDGQVIGIGAGQQSRIHCTRLAGGKAGLWWTRRHPRVSDMRFKKNVTRAVISNAIDNYVNGTIGTDLPLDQWNSLFEGEPPALLTPEERDAWIKKMDKVALASDAFFPFRDNIDRAVQCGVEYIGSPSGSNNDKEVIDACNEHKIILAHTNLRLFHH-