Monarch geneset OGS2.0

DPOGS208829
TranscriptDPOGS208829-TA1362 bp
ProteinDPOGS208829-PA453 aa
Genomic positionDPSCF300036 + 697837-701540
RNAseq coverage1537x (Rank: top 8%)
Annotation
HeliconiusHMEL0041890.074.53% 
BombyxBGIBMGA007935-TA0.084.29% 
Drosophilaade5-PA6e-17463.90% 
EBI UniRef50UniRef50_P380249e-13554.02%Multifunctional protein ADE2 n=8 Tax=Amniota RepID=PUR6_CHICK
NCBI RefSeqNP_001040376.10.084.29%phosphoribosylaminoimidazole carboxylase, phosphoribosylaminoimidazole succinocarboxamide synthetase [Bombyx mori]
NCBI nr blastpgi|1140513250.084.29%phosphoribosylaminoimidazole carboxylase, phosphoribosylaminoimidazole succinocarboxamide synthetase [Bombyx mori]
NCBI nr blastxgi|1140513250.084.29%phosphoribosylaminoimidazole carboxylase, phosphoribosylaminoimidazole succinocarboxamide synthetase [Bombyx mori]
Group
Gene OntologyGO:00046395.8e-246phosphoribosylaminoimidazolesuccinocarboxamide synthase activity
GO:00061645.8e-246purine nucleotide biosynthetic process
GO:00061891e-37'de novo' IMP biosynthetic process
GO:00046381e-37phosphoribosylaminoimidazole carboxylase activity
GO:00055247.2e-25ATP binding
GO:00168747.2e-25ligase activity
KEGG pathwayaag:AaeL_AAEL0036067e-179 
 K01587 (PAICS)maps-> Purine metabolism
InterPro domain[10-453] IPR0016365.8e-246SAICAR synthetase
[297-453] IPR0000311e-37Phosphoribosylaminoimidazole carboxylase, core
[164-265] IPR0138167.2e-25ATP-grasp fold, subdomain 2
Orthology groupMCL13106 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208829-TA
ATGGCTCTCCCAAAAGAAGCTGGTGGCTATAAGCTCGGGAAATTGCTGATCGAAGGAAAGACTAAACAAGTTTTTGATCTTCCTGGGGAGCCGGGCTTCTGTCTTCTGCTGAACAAGGACAGGATCACAGCTGGAGATGGTGTCAAGGCTCACGATATGGAGGGCAAAGCCGCCATATCAAATTTGACAAACGCCAAAGTTTTCGAAATACTTAAGGCCGCCGGAATCAAGACCGCTTTCGTAAAGCTTGCCTCAGAGACGGCGTTTGTATCGAAGAAATGCGATATGGTACCCATCGAATGGGTCACCAGGCGACTGGCGACTGGTTCCTTCCTCAAGCGCAACCCCGGTGTTCCCGAAGGTTTCAGATTCACGCCGCCCAAGCAGGAAACCTTCTTCAAAGACGACGCCAACCACGACCCTCAGTGGTCTGAGGAGCAGATCGTGTCTGCTAATTTTAAAGTCAACGGCCTCGTCATCGAACATAAAAGACAAGAAGCCAACCACGACCCTCAGTGGTCCGAGGAGCAGATCGTGTCTGCTAATTTTAAAGTCAACGGCCTCGTCATAGGTCAGGACGAGGTGGACTACATGCGGAAGGTAACAATCCTCGTGTTTGAAGTCTTGGAGAAGGCGTGGGCGTTACGTGACTGTGCCCTTATCGATATGAAAATTGAGTTTGGAGTTGATGCTGAAGGTAACATTCTCTTGGCTGATGTCATTGATTCCGACTCCTGGAGACTTTGGCCGTCAGGTGACAAGAGACTGATGGTAGACAAACAGGTGTACAGAAACCTGTCGAATGTAACAGCCGCGGACCTCGACACTGTGAAGCGTAACTTCTCATGGGTCAAAGACCAGCTCGACCACCTGAAGCCTTCGATTCATCACAAGGTCGTTATCTTCATGGGCTCGCCAGCTGACCAGGAACATTGTCAGAAGATTGCTAAAGCCGCCCGGGAGTTCGGCCTGGACGTGGACCTCCGGGTGACGTCAGCCCACAAGGCGACGGAAGAGACTCTGCGTATCATGCAGCGCTACGAGGACACGCATGGAGCGCTCGTCTTCATAGCGGTGGCTGGCCGTTCTAACGGTCTGGGACCGGTTCTATCTGGCAACACTTCCTATCCTGTCATCAACTGCCCGCCGCCGTCTGATAAACTTGTCCAGGACATCTGGTCTTCCCTGTCTGTGCCTTCCGGTCTGGGCTGTGCGACGGTCATCTACCCCGACAGCGCTGCCCTTATGGCCGCTCAAATAATCGGTCTTCAGGACTACCTCGTGTGGGCGCGTCTTAGGGCGAAGCAGCTCGATATGGCAACATCTCTTAGGGCTGCCGACAAGAAGATCCGTAACCTCTGA

Protein sequence:

>DPOGS208829-PA
MALPKEAGGYKLGKLLIEGKTKQVFDLPGEPGFCLLLNKDRITAGDGVKAHDMEGKAAISNLTNAKVFEILKAAGIKTAFVKLASETAFVSKKCDMVPIEWVTRRLATGSFLKRNPGVPEGFRFTPPKQETFFKDDANHDPQWSEEQIVSANFKVNGLVIEHKRQEANHDPQWSEEQIVSANFKVNGLVIGQDEVDYMRKVTILVFEVLEKAWALRDCALIDMKIEFGVDAEGNILLADVIDSDSWRLWPSGDKRLMVDKQVYRNLSNVTAADLDTVKRNFSWVKDQLDHLKPSIHHKVVIFMGSPADQEHCQKIAKAAREFGLDVDLRVTSAHKATEETLRIMQRYEDTHGALVFIAVAGRSNGLGPVLSGNTSYPVINCPPPSDKLVQDIWSSLSVPSGLGCATVIYPDSAALMAAQIIGLQDYLVWARLRAKQLDMATSLRAADKKIRNL-