New model in OGS2.0 | DPOGS211321  |
---|---|
Genomic Position | scaffold2292:- 26314-36155 |
See gene structure | |
CDS Length | 2805 |
Paired RNAseq reads   | 7443 |
Single RNAseq reads   | 17076 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004950 (0.0) |
Best Drosophila hit   | pugilist, isoform C (0.0) |
Best Human hit | C-1-tetrahydrofolate synthase, cytoplasmic (0.0) |
Best NR hit (blastp)   | RecName: Full=C-1-tetrahydrofolate synthase, cytoplasmic; Short=C1-THF synthase; Includes: RecName: Full=Methylenetetrahydrofolate dehydrogenase; Includes: RecName: Full=Methenyltetrahydrofolate cyclohydrolase; Includes: RecName: Full=Formyltetrahydrofolate synthetase (0.0) |
Best NR hit (blastx)   | RecName: Full=C-1-tetrahydrofolate synthase, cytoplasmic; Short=C1-THF synthase; Includes: RecName: Full=Methylenetetrahydrofolate dehydrogenase; Includes: RecName: Full=Methenyltetrahydrofolate cyclohydrolase; Includes: RecName: Full=Formyltetrahydrofolate synthetase (0.0) |
GeneOntology terms    | GO:0004488 methylenetetrahydrofolate dehydrogenase (NADP+) activity GO:0004477 methenyltetrahydrofolate cyclohydrolase activity GO:0004329 formate-tetrahydrofolate ligase activity GO:0005524 ATP binding GO:0009396 folic acid and derivative biosynthetic process GO:0005811 lipid particle |
InterPro families    | IPR020628 Formate-tetrahydrofolate ligase, FTHFS, conserved site IPR020867 Tetrahydrofolate dehydrogenase/cyclohydrolase, conserved site IPR000672 Tetrahydrofolate dehydrogenase/cyclohydrolase IPR000559 Formate-tetrahydrofolate ligase, FTHFS IPR020631 Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain IPR020630 Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain IPR016040 NAD(P)-binding domain |
Orthology group | MCL10876 |
Nucleotide sequence:
ATGACATGGAAAAATCAGGAGGACATTGACGCTGAAATAATATCTATAGAAAATGACCTC
CGTCAGCAAGTGGCCGTCATGGGTCAACAGCACCCTGGCTTCCAACCAAAGCTGGCTATC
GTGCAGGTCGGAGGACGAGAAGACTCCAACGTTTACATCCGAGCCAAGCTGAAAGCAGCA
GAAAACATCGGCATAGCTGCTGAACACATCAAACTTCCCAGAGAAATCTCGCAGGCTGAG
CTACTTACAAAGTTGACAGCTCTAAATGATTCGCCGTTAGTACATGGCATAATAGTTCAG
ATGCCACTGGATTCGGTCGAGAAAATCGACTCGCATCTCATCACCGACGCTGTCTCCTCG
CAGAAAGATGTTGACGGATTGAATACTGAAAACGAAGGACGTGTGGCCCTCGGTGATATG
TCAGGCTTCGTTTCTTGCACCCCAGCTGGTTGTATAGAACTCATCAAACGTACTGGAATC
TCCATCGAAGGCAAACAGGCGGTGGTTATCGGACGCAGTAGGATCGTTGGAACACCAGTA
GCTGAACTTCTCAAGTGGGAAAACGCCACTGTTACCGTTTGCCACTCGAAGACTAAGAAC
TTAAGTGAAATTACCAAAACTGCTGATATTTTAGTGGTAGCGATTGGTAAAGCAGAAATG
GTTCGTGGCTCTTGGATTAAACCGGGGGCGGTGGTAATAGACTGCGGTATTAATCCCATC
CCAGATACATCAAAACCCAGCGGCCGGAGGTTGGTAGGTGACGTGGCATATTCCGAGGCG
GTACAGGTCGCGTCGCATGTAACCCCTGTACCCGGCGGTGTGGGTCCCATGACTGTGGCT
ATGTTGATGAAAAACACCGTGTTGGCTGCTAGCAGACAACTCCAACGGATCTCTACACCC
GTGTGGCCGCTGCAGCCGCTTAGACTTAGCACGGTTTCGCCACCTCCAAGCGACATTGTT
ATAGCGCGTTCTCAAAAACCTAAATATATTAGTAAGTTGGCGGAGGAGATAGGATTGTTC
CCCAGTGAGGTGTCACAATATGGTAATACCAAGGCGAAAATATCTTTGTCTGTGCTGGAT
CGTCTCCGAGATCAGCGTGGCGGAAAATACATCGTCGTGGCTGGCATAACCCCCACTCCT
CTCGGTGAGGGTAAGAGTACGACGTTGATCGGTCTGGTGCAGGCTCTGGGTGCTCATCGC
GGAAGGAACGCCTTCGCCGTCATGCGTCAGCCCAGTCAGGGACCAACCTTCGGAGTCAAG
GGCGGAGCCGCTGGCGGAGGATACTCACAGGTCATTCCTATGGAAGATTTCAACCTTCAT
CTCACTGGTGACATTCACGCCGTTTCTGCAGCCAACAATCTCCTCGCAGCTCACATGGAT
GCCAGGATCTTCCATGAGCTAACACAAAAAGACGGTCCTCTGTATGATCGTTTGGTGCCA
GAAATTAAAGGAGTCAGAAAATTCTCCCCCATTCAGTTGAGAAGATTAAAGAGATTGGGA
ATCGAAAAGACCGATCCGAACGCCTTAACACCAGAAGAAAGAGTTAAATTTGCACGACTT
AACATTGACCCCAAAAAAGTTATGTGGAATAGAGTCGTGGATTTGAACGATAGATATTTA
CGTAAAATTACTATCGGACAATCACCCACTGAGAAAGGTTTTACCCGCGAGACTAGTTTT
GACATCGCCGTAGCGTCTGAAATTATGGCTGTGTTGGCTCTGGGCAAGGATGTGAATGAT
ATTAAGGAGAGACTCGCGAATATGGTGGTAGCTCTGGACACAAACGGCAAACCAGTAATA
GCTGATGATCTTGGCATTACAGGGGCTTTAATGGTGTTGCTTAAGGACGCATTTGAGCCC
ACATTGATGCAGACTTTGGAAGGTACTCCTGTATTGGTCCACACGGGACCGTTCGCCAAC
ATAGCTCATGGATGCTCCTCTATACTTGCCGATAAGATAGCCATGAAACTGGCCCGAGAA
AATGGCTATGTGGCAACTGAAGCCGGCTTTGGATCTGACATCGGTATGGAAAAGTTCTTT
GATATAAAGTGTCGTTCGAGCGGCGACACCCCTCACTGCGCTGTCATCGTGAGTACAGTC
CGCGCGCTCAAGATGCACGGCGGAGGACCTACCGTCAGCCCTGGACAACCGCTCCACTCA
GTATATGTCCAAGAAAACTTGGAACTGCTTAGCAAAGGACTGTGCAATTTAGGAAAACAC
ATCAGCAACGGCAATAAGTTTGGCGTTCCTGTCGTTATTGCTGTTAACAAACACGGAAAC
GACACAGAAGCAGAACTGAACATGGTTAGAGAATATGCCTTGAAAAATGGAGCATTCCGT
GCTGTTATTTGCGATCACTGGGCTAAGGGAGGCGCTGGCGCCTTGGAACTAGCGGACGCG
GTCGTAGAAGCCTGCGACCGTCCCTCGAACTTCCAATATCTCTATCCATTGGAAATGACG
ATCCAAGATAAAATTAAGAAGATCGCTGTAGAGATGTACGGAGCTGGGACAGTGGAATAC
ACAGATGTGGTTTTGGAGAAAATTAAAGTTTTGAATGATAGGGGCTACGATAAGCTGGCG
ATATGTATGGCCAAGACTTCTAATTCGCTGACCGGCGACCCCAGTATCAAGGGTGCTCCT
ACCGGATTCACTCTTCGTATCAATGATATTTTCGTGTCTGCGGGCGCTGGTTTTATTGTT
CCTATGGTTGGCGAGATATCCAAAATGCCTGGCCTTCCTACAAGACCCAGCATCTACGAT
ATAGATCTGAACACCGAGACCGGTGAAATCGATGGCCTTTTTTAA
Protein sequence:
MTWKNQEDIDAEIISIENDLRQQVAVMGQQHPGFQPKLAIVQVGGREDSNVYIRAKLKAA
ENIGIAAEHIKLPREISQAELLTKLTALNDSPLVHGIIVQMPLDSVEKIDSHLITDAVSS
QKDVDGLNTENEGRVALGDMSGFVSCTPAGCIELIKRTGISIEGKQAVVIGRSRIVGTPV
AELLKWENATVTVCHSKTKNLSEITKTADILVVAIGKAEMVRGSWIKPGAVVIDCGINPI
PDTSKPSGRRLVGDVAYSEAVQVASHVTPVPGGVGPMTVAMLMKNTVLAASRQLQRISTP
VWPLQPLRLSTVSPPPSDIVIARSQKPKYISKLAEEIGLFPSEVSQYGNTKAKISLSVLD
RLRDQRGGKYIVVAGITPTPLGEGKSTTLIGLVQALGAHRGRNAFAVMRQPSQGPTFGVK
GGAAGGGYSQVIPMEDFNLHLTGDIHAVSAANNLLAAHMDARIFHELTQKDGPLYDRLVP
EIKGVRKFSPIQLRRLKRLGIEKTDPNALTPEERVKFARLNIDPKKVMWNRVVDLNDRYL
RKITIGQSPTEKGFTRETSFDIAVASEIMAVLALGKDVNDIKERLANMVVALDTNGKPVI
ADDLGITGALMVLLKDAFEPTLMQTLEGTPVLVHTGPFANIAHGCSSILADKIAMKLARE
NGYVATEAGFGSDIGMEKFFDIKCRSSGDTPHCAVIVSTVRALKMHGGGPTVSPGQPLHS
VYVQENLELLSKGLCNLGKHISNGNKFGVPVVIAVNKHGNDTEAELNMVREYALKNGAFR
AVICDHWAKGGAGALELADAVVEACDRPSNFQYLYPLEMTIQDKIKKIAVEMYGAGTVEY
TDVVLEKIKVLNDRGYDKLAICMAKTSNSLTGDPSIKGAPTGFTLRINDIFVSAGAGFIV
PMVGEISKMPGLPTRPSIYDIDLNTETGEIDGLF