New model in OGS2.0 | DPOGS206359  |
---|---|
Genomic Position | scaffold232:- 66308-79619 |
See gene structure | |
CDS Length | 4095 |
Paired RNAseq reads   | 7475 |
Single RNAseq reads   | 16858 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA014119 (0.0) |
Best Drosophila hit   | adenosine 3, isoform A (0.0) |
Best Human hit | trifunctional purine biosynthetic protein adenosine-3 isoform 1 (0.0) |
Best NR hit (blastp)   | AGAP009786-PA [Anopheles gambiae str. PEST] (0.0) |
Best NR hit (blastx)   | AGAP009786-PA [Anopheles gambiae str. PEST] (0.0) |
GeneOntology terms    | GO:0004641 phosphoribosylformylglycinamidine cyclo-ligase activity GO:0006189 'de novo' IMP biosynthetic process GO:0004637 phosphoribosylamine-glycine ligase activity GO:0004644 phosphoribosylglycinamide formyltransferase activity GO:0005737 cytoplasm GO:0009113 purine base biosynthetic process GO:0005524 ATP binding |
InterPro families    | IPR000115 Phosphoribosylglycinamide synthetase IPR004733 Phosphoribosylformylglycinamidine cyclo-ligase IPR004607 Phosphoribosylglycinamide formyltransferase IPR020561 Phosphoribosylglycinamide synthetase, ATP-grasp (A) domain IPR002376 Formyl transferase, N-terminal IPR010918 AIR synthase-related protein, C-terminal IPR020562 Phosphoribosylglycinamide synthetase, N-domain IPR000728 AIR synthase-related protein IPR020560 Phosphoribosylglycinamide synthetase, C-domain IPR013817 Pre-ATP-grasp fold IPR013815 ATP-grasp fold, subdomain 1 IPR013816 ATP-grasp fold, subdomain 2 IPR020559 Phosphoribosylglycinamide synthetase, conserved site IPR001555 Phosphoribosylglycinamide formyltransferase, active site IPR011761 ATP-grasp fold IPR016188 PurM, N-terminal-like IPR016185 PreATP-grasp-like fold IPR011054 Rudiment single hybrid motif |
Orthology group | MCL13595 |
Nucleotide sequence:
ATGTCGGCAAACGTGCTCGTTATCGGTGGTGGAGGCAGGGAACATGCAATTTGTTGGAAA
TTAGCAGATTCTCCATTAATTTGTAAAATATTCTGTGCGCCCGGTAGTGTCGGTATATCG
GCGACGAAGGAAAATGTTGAATGTGTGGATTTGAATATTAAGGATTTTCCGGGTCTGGCG
AAGTGGTGCAAAGACAAGTCAATTGATCTCGTCATCATTGGACCCGAAGACCCTCTTGCT
AATGGCATTGTGGATGCCCTCGAACCAGCTGGCATCAAATGCTTCGGACCCACAAAAGCC
GGTGCACAGATCGAGGCCAATAAAGACTGGTCCAAGAAATTTATGAACAAATATCAAATA
CCAACGGCAAGATACCAATCATTCACAGATGCTGAAGCTGCTAAGCAATTCATTAAGAGT
GCACCGTTTAGAGCGTTAGTCGTGAAAGCATCCGGTTTAGCGGCTGGGAAAGGAGTTGTG
GTAGCGAGCAATGTAGATGAGGCCTGTGCAGCGGTGGATGAGATCTTAACCGAAGCCAAG
TACGGAACTGCCGGGCAGGTGGTCGTGGTTGAAGAACTGTTGGAAGGGGAAGAGGTTTCG
GTGCTGGCATTCACGGATGGGAACACGGTGTCCATGATGCCCCCCGCCCAGGATCACAAG
CGCATCGGCGAGGGGGACACGGGACCCAACACGGGGGGTATGGGGGCATATTGTCCCTGT
CCGCTTATCACCCCAGAACAGCTGGCCGATGTCAAGGATCAAGTGTTACAGAGAGCTGTG
GACGGGCTCAGGGCTGAGGGCATCAAATATGTCGGGGTCCTCTACGCTGGTCTTATGGTG
ACCAAGTCTGGTCCAATGACCCTCGAGTTTAACTGCCGCTTCGGAGACCCAGAGACACAG
GTCCTCATGATGTTGCTGGAATCGGATCTCTACTCCGTCATTAAGGCGTGTGTGAGCGGT
ACATTAAAGGAGACTCCGGTCAAATGGAACACCTCAATGTCGGCCGTGGGAGTGGTGATC
GCTTCCAAAGGTTACCCTGAGAGCTCCACCAAGGGCTGTGTCATAAGCGGTCTATCCCAG
GTGTCCAGGGAAGACGTGGTCGTGTTCATGAGCGGAGTGTCCCGTGGAGCAAACGACTCG
CTGGTCACTGCCGGCGGCAGAGTGCTGCTGCTGGCGGCGAGGAGAGCCGACCTCCGGACT
GCCGCCGCCGCTGCGACACGAGCCGCCGCCGCCGTGGACTTCCCCGGGAAACAGTACAGG
AAGGACATCGCCAGGCGGGCCTTCTGCAAAATGAACGGCCTGTCATACCTGGAGAGCGGT
GTGGACATTGAAGCGGCGGCGGCCTTGGTCCGTCTGATGGAGCCCCTGGCCACGGGGACC
CACAGACCCGGGGTACTTGGAAGGCTGGGCTGCTACAGCGGACTGTTCCAACTGGCGGCC
GTGGACCCCGGCCTCACAGACCCCGTGCTGGTCCAGGGAACGGACGGCGTTGGAACCAAG
GTCAAGATAGCAGAGATGATGCAGAAGTACGACACCATCGGCCAGGACCTGGTGGCCATG
TGCGTCAACGACATCCTGTGTGCGGGCGCCGAGCCCTTCGCCTTCCTGGACTACCTGGCG
TGCGGCCGGCTGCAGCTGCACACCTCCACCACCATCGTCAAGGGAATCGCCGACGCCTGC
GTCATGGCCGGCTGTGCTTTGTTGGGGGGAGAGACGGCGGAGATGCCGAGTATGTATGAC
GTGGGGAAGTACGATTTGGCGGGGTTCGCGGTGGGCGTGGTGGACAACCTCAAGCAGCTG
CCCCGCTACAAGGAGATACGACCCGGGGACGTGGTGCTCGCGCTGCCCTCCACCGGCGTG
CACAGCAACGGGTACAGCCTCGTGCAGAGGATCATGGCTGAAAGTGGACATAGTTTCTAC
GAAAAGGCTCCGTTCAGTAAATCCAACAAGAACTTCGGCGAGGAGTTCCTGGAGCCGACC
GGTATCTACGTGAAGGCCCTCCTGCCGGCCATCAAGAAAGGCCTCGTGAAGGGCCTGGCA
CACATCACCGGGGGAGGCCTCCTGGAGAATATCCCCAGGATACTACCGCCCGGCGTCAGG
GTCAGGCTCGACGCCACTAAGTTCCAGATAAACCCTATCTTCGGCTGGCTGCAAGCTAAA
GGGATGGTGTCGGACTTCGAGATGCTGCGCACGTTCAACTGCGGTGTCGGCATGGTGGTG
GTGGCGGACCCCGTGCTGGTGAGCGAGCTGGTCGCCGCCGTGGACGGCACCATCAGTGTG
GTCGGGCAGCTGGAGGACATGAGAACGGAAGGAGGTCAGCAGGTCATAGTGGAGAACTTC
CAGCAGGCCATGTCCCCTCTGACGTCACCCTACTCGTCCGCCAGTCCGTGTCAGAAGTCA
CTCTCCTACAAGGACAGCGGGGTCGACATCGAGGCCGGGGACTCGCTGGTGTCACTCATA
AAGCCTTTGGCTAGATCCACGTCTCGATCCGGGGTCCTGGGAGGTCTTGGAGGCTTCGGC
GGGTGTTTCCAGCTGAAGGCTGTGGAGCAGGAGTATAAGGACCCGGTGCTGGTGGTGGCG
GCGGACGGTGTGGGCACCAAGCTGCGCGTGGCTCAGAAGATGAACCGACACGCCACCATC
GGCGTTGACCTGGTGGCCATGTGCGTCAACGACATCCTGTGCAACGGCGCCGCGCCGCTC
ACCTTCCTGGACTACTTCGCCTGCGGAGCCCTGGACGTGACCGTGGCCAGGGACGTCGTG
GCCGGGGTCGCGGACGGCTGCAAGCAGTCCTCAGCGGCTCTCATCGGCGGAGAGACGGCG
GAGATGCCCGGCATGTACGAGGCCGGCGTGTACGACATCGCAGGGTTCGCGCTGGGAGTG
GTGGAGAGGGACAACATACTGCCGAAGATCAACGACATCAATGTTGGCGACACGATAATA
GGTCTGCCATCGAACGGCGTCCACAGCAACGGGTTCAGTCTCATCCACAGCCTCATGAAG
AAGGCCGGTCTGAGTCTCAACGACAAGGCGCCCTTCAGCGAGGAAGGACTCACTCTCGGC
CAGGAGCTGATCAAGCCGACCCGCATCTACGTCCGCAGCGTGCTGCCGGCGCTGCAGCGC
GGCGTGGTGAAGGCGGTGGCGCACATCACGGGCGGCGGCCTCATGGAGAACATCCCGCGC
ATCATGCCGGACTCCGTGCGGGCCCGCCTCAACGCGCACTGGTGGAAAGTTCACCCTGTG
TTCGCGTGGATCGCGGAGACCGGCGAGGTCAAGAACGACGAGATGCTGAGGACATTCAAC
TGCGGCATCGGCTTGGTGCTGATAGTGTCTCCGGAACACCAGGCAGAGGTGATGAACATC
ACTCGCTCGCACGGCGCGATGGTGATCGGCTCCATACAAGCCCGGCCCCCGGGCGGCGCT
CGCGTGCTCGTCGACAACTTCACCTCCGCGCTGGACTTCACGAGGCGGATGCCGCACCTC
ACTAAGAAGAGGGTGGCGGTGCTGGTGTCGGGTAGCGGCAGCAACCTGCAGGCGCTCATG
GACAGTGCGTCGGACCCCGCCCAGTGCATGTGTGCGGAGGTGGCGCTCGTCGTCAGCAAC
AAACCCGACGCCTTCGCCCTCAAACGGGCCCGGGACGCCGGCGTCAACACGCTGGTGCTG
AGTCACAAGGACTACTCCAGCCGCGAGGAGTACGACCGCGCCCTCAGCGCCGCCCTGGAC
GCGCACCGGATCGACCTCGTGTGTCTGGCCGGCTTCATGAGGATACTCACGCCGGGCTTC
GTTAAGAAGTGGAAGGGTCGCCTCATCAACATCCACCCGTCCCTGCTGCCGGCCCACCCT
GGACTCCACGCTCAGAGACAGTGTCTACAGGCGGGAGACAAGGAGTCGGGCTGCACCGTA
CACTTCGTCGACGAGGGCATGGACACGGGTCCGATCATTCTCCAGGAGCGCGTGCCGGTG
ATGCCGGGAGACACGGAGCAGGTTCTCAGTGACAGGATCCTGTCCGCGGAACACCGCGCC
TACCCTCAGGCGCTCAGACTGCTCGCTACGGGCCGGGTCCGGCTACATGAGGACACTATC
ATATGGCATTCATGA
Protein sequence:
MSANVLVIGGGGREHAICWKLADSPLICKIFCAPGSVGISATKENVECVDLNIKDFPGLA
KWCKDKSIDLVIIGPEDPLANGIVDALEPAGIKCFGPTKAGAQIEANKDWSKKFMNKYQI
PTARYQSFTDAEAAKQFIKSAPFRALVVKASGLAAGKGVVVASNVDEACAAVDEILTEAK
YGTAGQVVVVEELLEGEEVSVLAFTDGNTVSMMPPAQDHKRIGEGDTGPNTGGMGAYCPC
PLITPEQLADVKDQVLQRAVDGLRAEGIKYVGVLYAGLMVTKSGPMTLEFNCRFGDPETQ
VLMMLLESDLYSVIKACVSGTLKETPVKWNTSMSAVGVVIASKGYPESSTKGCVISGLSQ
VSREDVVVFMSGVSRGANDSLVTAGGRVLLLAARRADLRTAAAAATRAAAAVDFPGKQYR
KDIARRAFCKMNGLSYLESGVDIEAAAALVRLMEPLATGTHRPGVLGRLGCYSGLFQLAA
VDPGLTDPVLVQGTDGVGTKVKIAEMMQKYDTIGQDLVAMCVNDILCAGAEPFAFLDYLA
CGRLQLHTSTTIVKGIADACVMAGCALLGGETAEMPSMYDVGKYDLAGFAVGVVDNLKQL
PRYKEIRPGDVVLALPSTGVHSNGYSLVQRIMAESGHSFYEKAPFSKSNKNFGEEFLEPT
GIYVKALLPAIKKGLVKGLAHITGGGLLENIPRILPPGVRVRLDATKFQINPIFGWLQAK
GMVSDFEMLRTFNCGVGMVVVADPVLVSELVAAVDGTISVVGQLEDMRTEGGQQVIVENF
QQAMSPLTSPYSSASPCQKSLSYKDSGVDIEAGDSLVSLIKPLARSTSRSGVLGGLGGFG
GCFQLKAVEQEYKDPVLVVAADGVGTKLRVAQKMNRHATIGVDLVAMCVNDILCNGAAPL
TFLDYFACGALDVTVARDVVAGVADGCKQSSAALIGGETAEMPGMYEAGVYDIAGFALGV
VERDNILPKINDINVGDTIIGLPSNGVHSNGFSLIHSLMKKAGLSLNDKAPFSEEGLTLG
QELIKPTRIYVRSVLPALQRGVVKAVAHITGGGLMENIPRIMPDSVRARLNAHWWKVHPV
FAWIAETGEVKNDEMLRTFNCGIGLVLIVSPEHQAEVMNITRSHGAMVIGSIQARPPGGA
RVLVDNFTSALDFTRRMPHLTKKRVAVLVSGSGSNLQALMDSASDPAQCMCAEVALVVSN
KPDAFALKRARDAGVNTLVLSHKDYSSREEYDRALSAALDAHRIDLVCLAGFMRILTPGF
VKKWKGRLINIHPSLLPAHPGLHAQRQCLQAGDKESGCTVHFVDEGMDTGPIILQERVPV
MPGDTEQVLSDRILSAEHRAYPQALRLLATGRVRLHEDTIIWHS