DPGLEAN07138 in OGS1.0

New model in OGS2.0DPOGS211321 
Genomic Positionscaffold2292:- 26314-36155
See gene structure
CDS Length2805
Paired RNAseq reads  7443
Single RNAseq reads  17076
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004950 (0.0)
Best Drosophila hit  pugilist, isoform C (0.0)
Best Human hitC-1-tetrahydrofolate synthase, cytoplasmic (0.0)
Best NR hit (blastp)  RecName: Full=C-1-tetrahydrofolate synthase, cytoplasmic; Short=C1-THF synthase; Includes: RecName: Full=Methylenetetrahydrofolate dehydrogenase; Includes: RecName: Full=Methenyltetrahydrofolate cyclohydrolase; Includes: RecName: Full=Formyltetrahydrofolate synthetase (0.0)
Best NR hit (blastx)  RecName: Full=C-1-tetrahydrofolate synthase, cytoplasmic; Short=C1-THF synthase; Includes: RecName: Full=Methylenetetrahydrofolate dehydrogenase; Includes: RecName: Full=Methenyltetrahydrofolate cyclohydrolase; Includes: RecName: Full=Formyltetrahydrofolate synthetase (0.0)
GeneOntology terms




  
GO:0004488 methylenetetrahydrofolate dehydrogenase (NADP+) activity
GO:0004477 methenyltetrahydrofolate cyclohydrolase activity
GO:0004329 formate-tetrahydrofolate ligase activity
GO:0005524 ATP binding
GO:0009396 folic acid and derivative biosynthetic process
GO:0005811 lipid particle
InterPro families





  
IPR020628 Formate-tetrahydrofolate ligase, FTHFS, conserved site
IPR020867 Tetrahydrofolate dehydrogenase/cyclohydrolase, conserved site
IPR000672 Tetrahydrofolate dehydrogenase/cyclohydrolase
IPR000559 Formate-tetrahydrofolate ligase, FTHFS
IPR020631 Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain
IPR020630 Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain
IPR016040 NAD(P)-binding domain
Orthology groupMCL10876

Nucleotide sequence:

ATGACATGGAAAAATCAGGAGGACATTGACGCTGAAATAATATCTATAGAAAATGACCTC
CGTCAGCAAGTGGCCGTCATGGGTCAACAGCACCCTGGCTTCCAACCAAAGCTGGCTATC
GTGCAGGTCGGAGGACGAGAAGACTCCAACGTTTACATCCGAGCCAAGCTGAAAGCAGCA
GAAAACATCGGCATAGCTGCTGAACACATCAAACTTCCCAGAGAAATCTCGCAGGCTGAG
CTACTTACAAAGTTGACAGCTCTAAATGATTCGCCGTTAGTACATGGCATAATAGTTCAG
ATGCCACTGGATTCGGTCGAGAAAATCGACTCGCATCTCATCACCGACGCTGTCTCCTCG
CAGAAAGATGTTGACGGATTGAATACTGAAAACGAAGGACGTGTGGCCCTCGGTGATATG
TCAGGCTTCGTTTCTTGCACCCCAGCTGGTTGTATAGAACTCATCAAACGTACTGGAATC
TCCATCGAAGGCAAACAGGCGGTGGTTATCGGACGCAGTAGGATCGTTGGAACACCAGTA
GCTGAACTTCTCAAGTGGGAAAACGCCACTGTTACCGTTTGCCACTCGAAGACTAAGAAC
TTAAGTGAAATTACCAAAACTGCTGATATTTTAGTGGTAGCGATTGGTAAAGCAGAAATG
GTTCGTGGCTCTTGGATTAAACCGGGGGCGGTGGTAATAGACTGCGGTATTAATCCCATC
CCAGATACATCAAAACCCAGCGGCCGGAGGTTGGTAGGTGACGTGGCATATTCCGAGGCG
GTACAGGTCGCGTCGCATGTAACCCCTGTACCCGGCGGTGTGGGTCCCATGACTGTGGCT
ATGTTGATGAAAAACACCGTGTTGGCTGCTAGCAGACAACTCCAACGGATCTCTACACCC
GTGTGGCCGCTGCAGCCGCTTAGACTTAGCACGGTTTCGCCACCTCCAAGCGACATTGTT
ATAGCGCGTTCTCAAAAACCTAAATATATTAGTAAGTTGGCGGAGGAGATAGGATTGTTC
CCCAGTGAGGTGTCACAATATGGTAATACCAAGGCGAAAATATCTTTGTCTGTGCTGGAT
CGTCTCCGAGATCAGCGTGGCGGAAAATACATCGTCGTGGCTGGCATAACCCCCACTCCT
CTCGGTGAGGGTAAGAGTACGACGTTGATCGGTCTGGTGCAGGCTCTGGGTGCTCATCGC
GGAAGGAACGCCTTCGCCGTCATGCGTCAGCCCAGTCAGGGACCAACCTTCGGAGTCAAG
GGCGGAGCCGCTGGCGGAGGATACTCACAGGTCATTCCTATGGAAGATTTCAACCTTCAT
CTCACTGGTGACATTCACGCCGTTTCTGCAGCCAACAATCTCCTCGCAGCTCACATGGAT
GCCAGGATCTTCCATGAGCTAACACAAAAAGACGGTCCTCTGTATGATCGTTTGGTGCCA
GAAATTAAAGGAGTCAGAAAATTCTCCCCCATTCAGTTGAGAAGATTAAAGAGATTGGGA
ATCGAAAAGACCGATCCGAACGCCTTAACACCAGAAGAAAGAGTTAAATTTGCACGACTT
AACATTGACCCCAAAAAAGTTATGTGGAATAGAGTCGTGGATTTGAACGATAGATATTTA
CGTAAAATTACTATCGGACAATCACCCACTGAGAAAGGTTTTACCCGCGAGACTAGTTTT
GACATCGCCGTAGCGTCTGAAATTATGGCTGTGTTGGCTCTGGGCAAGGATGTGAATGAT
ATTAAGGAGAGACTCGCGAATATGGTGGTAGCTCTGGACACAAACGGCAAACCAGTAATA
GCTGATGATCTTGGCATTACAGGGGCTTTAATGGTGTTGCTTAAGGACGCATTTGAGCCC
ACATTGATGCAGACTTTGGAAGGTACTCCTGTATTGGTCCACACGGGACCGTTCGCCAAC
ATAGCTCATGGATGCTCCTCTATACTTGCCGATAAGATAGCCATGAAACTGGCCCGAGAA
AATGGCTATGTGGCAACTGAAGCCGGCTTTGGATCTGACATCGGTATGGAAAAGTTCTTT
GATATAAAGTGTCGTTCGAGCGGCGACACCCCTCACTGCGCTGTCATCGTGAGTACAGTC
CGCGCGCTCAAGATGCACGGCGGAGGACCTACCGTCAGCCCTGGACAACCGCTCCACTCA
GTATATGTCCAAGAAAACTTGGAACTGCTTAGCAAAGGACTGTGCAATTTAGGAAAACAC
ATCAGCAACGGCAATAAGTTTGGCGTTCCTGTCGTTATTGCTGTTAACAAACACGGAAAC
GACACAGAAGCAGAACTGAACATGGTTAGAGAATATGCCTTGAAAAATGGAGCATTCCGT
GCTGTTATTTGCGATCACTGGGCTAAGGGAGGCGCTGGCGCCTTGGAACTAGCGGACGCG
GTCGTAGAAGCCTGCGACCGTCCCTCGAACTTCCAATATCTCTATCCATTGGAAATGACG
ATCCAAGATAAAATTAAGAAGATCGCTGTAGAGATGTACGGAGCTGGGACAGTGGAATAC
ACAGATGTGGTTTTGGAGAAAATTAAAGTTTTGAATGATAGGGGCTACGATAAGCTGGCG
ATATGTATGGCCAAGACTTCTAATTCGCTGACCGGCGACCCCAGTATCAAGGGTGCTCCT
ACCGGATTCACTCTTCGTATCAATGATATTTTCGTGTCTGCGGGCGCTGGTTTTATTGTT
CCTATGGTTGGCGAGATATCCAAAATGCCTGGCCTTCCTACAAGACCCAGCATCTACGAT
ATAGATCTGAACACCGAGACCGGTGAAATCGATGGCCTTTTTTAA

Protein sequence:

MTWKNQEDIDAEIISIENDLRQQVAVMGQQHPGFQPKLAIVQVGGREDSNVYIRAKLKAA
ENIGIAAEHIKLPREISQAELLTKLTALNDSPLVHGIIVQMPLDSVEKIDSHLITDAVSS
QKDVDGLNTENEGRVALGDMSGFVSCTPAGCIELIKRTGISIEGKQAVVIGRSRIVGTPV
AELLKWENATVTVCHSKTKNLSEITKTADILVVAIGKAEMVRGSWIKPGAVVIDCGINPI
PDTSKPSGRRLVGDVAYSEAVQVASHVTPVPGGVGPMTVAMLMKNTVLAASRQLQRISTP
VWPLQPLRLSTVSPPPSDIVIARSQKPKYISKLAEEIGLFPSEVSQYGNTKAKISLSVLD
RLRDQRGGKYIVVAGITPTPLGEGKSTTLIGLVQALGAHRGRNAFAVMRQPSQGPTFGVK
GGAAGGGYSQVIPMEDFNLHLTGDIHAVSAANNLLAAHMDARIFHELTQKDGPLYDRLVP
EIKGVRKFSPIQLRRLKRLGIEKTDPNALTPEERVKFARLNIDPKKVMWNRVVDLNDRYL
RKITIGQSPTEKGFTRETSFDIAVASEIMAVLALGKDVNDIKERLANMVVALDTNGKPVI
ADDLGITGALMVLLKDAFEPTLMQTLEGTPVLVHTGPFANIAHGCSSILADKIAMKLARE
NGYVATEAGFGSDIGMEKFFDIKCRSSGDTPHCAVIVSTVRALKMHGGGPTVSPGQPLHS
VYVQENLELLSKGLCNLGKHISNGNKFGVPVVIAVNKHGNDTEAELNMVREYALKNGAFR
AVICDHWAKGGAGALELADAVVEACDRPSNFQYLYPLEMTIQDKIKKIAVEMYGAGTVEY
TDVVLEKIKVLNDRGYDKLAICMAKTSNSLTGDPSIKGAPTGFTLRINDIFVSAGAGFIV
PMVGEISKMPGLPTRPSIYDIDLNTETGEIDGLF