DPGLEAN07248 in OGS1.0

New model in OGS2.0DPOGS215487 
Genomic Positionscaffold841:+ 32464-36511
See gene structure
CDS Length2409
Paired RNAseq reads  217
Single RNAseq reads  521
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007490 (1e-90)
Best Drosophila hit  glyceraldehyde 3 phosphate dehydrogenase 1, isoform B (4e-87)
Best Human hitglyceraldehyde-3-phosphate dehydrogenase (7e-82)
Best NR hit (blastp)  multivalent antigen sj97-GAPDH [synthetic construct] (4e-97)
Best NR hit (blastx)  glyceraldehyde 3-phosphate dehydrogenase [Schistosoma japonicum] (9e-91)
GeneOntology terms





















  
GO:0009986 cell surface
GO:0051701 interaction with host
GO:0009277 fungal-type cell wall
GO:0005515 protein binding
GO:0044416 induction by symbiont of host defense response
GO:0006096 glycolysis
GO:0004365 glyceraldehyde-3-phosphate dehydrogenase (phosphorylating) activity
GO:0001968 fibronectin binding
GO:0005737 cytoplasm
GO:0050840 extracellular matrix binding
GO:0043236 laminin binding
GO:0007160 cell-matrix adhesion
GO:0044406 adhesion to host
GO:0030445 yeast-form cell wall
GO:0030446 hyphal cell wall
GO:0008943 glyceraldehyde-3-phosphate dehydrogenase activity
GO:0005625 soluble fraction
GO:0006800 oxygen and reactive oxygen species metabolic process
GO:0006915 apoptosis
GO:0001950 plasma membrane enriched fraction
GO:0005739 mitochondrion
GO:0005811 lipid particle
GO:0005886 plasma membrane
InterPro families


  
IPR020828 Glyceraldehyde 3-phosphate dehydrogenase, NAD(P) binding domain
IPR020831 Glyceraldehyde/Erythrose phosphate dehydrogenase family
IPR016040 NAD(P)-binding domain
IPR020829 Glyceraldehyde 3-phosphate dehydrogenase, catalytic domain
Orthology groupMCL40164

Nucleotide sequence:

ATGGTTCTCAAAGTTGGCATTAACGGCTTTGGACGTATAGGCAGGGTGATTTTTAGGACT
TGCTTAGAAAACACTGATTTGGAGGTAAAGGCTATTAATGATCCGGCGGTAGACATTGAA
TATATTAGTTATCTTATAAAGTTCGACTCGACCCACGGCAAATTAAAATGGGATGTTACA
TTCAAAGACAACGAGATTACAATTAATGGAAATAATATAAAAGTTTTTCGCGAGAAAAAC
CCGTCGAATATACCGTGGCAGTCAGCATCTATACAATACGTTGTCGAAGCGTCCGGAATT
TTCACTAATTACGATAAGGCGTCGGGCCATTTATCGAGTAAATGTGTCAAACGAGTATTG
GTAACGGCCCCCAGCATCGACGTTCCAATGTTGATAATAGGCGTTAATGAAGACAGGATA
AATAAAGATCAGAAAGTAATATCTTGCGCGTCGAGCACTCTATACTGTTTGGCTCCAATA
ATTAAGGTTTTGGAAGACAACTACACAGTTGAAGAGGGTTTCATAACGAGCATCCATGCT
ATGACGCCATCTTTGAAACCGTTGGATGGATTGTGTTTAAAGGGAAAGCACTGGAGGGAC
CATAGGAGCATCCACCAGAACATAATACCGGCTGCGACGGGCGCTTGCAAGGCCCTCGGT
AAGATCATACCAGGAGTTAAAGACAAACTCACGGGCTTAGCGTTCAGGGTACCCGTTGTT
AACGTCTCCGTACTCGACATAACCATACGATTAAACAAAAGCACATCAATCGGGCACATT
GTTAAAATTATTGAAGACCAGATCAGATCTCCATTGAAGAATATAATCAAAGCTAGTAGT
GATGAGGCAGTGTCCTCAGATTTTCTTGGCGAAGAGAAATCTTGCATCCTTGATGTAAAC
TCCTGTCTCCAATTGACACCAAATTTCTTTAAATTGATTTGTTGGTACGAAAACGAATAT
TCCTATGCATGCAGAGTAGTAGATTCAATAAAATTTTTGGAAAAAAATTGCCAATTGGAA
CTGGTACCGAAGAATATTTTCAGGAGGCTTTCAGTGACACCCAGAATATCGAAGGAAAGG
CCGATACCGTTTTACAATAGAGATTTATTTTGTAATACTTTAATACAAAACAGATCCAAG
TTCTACATGGAACATGGCTTCAATGAAAGACCACCAACCCCAATGGGTTCCAGGCGACAT
GTAGATATGCACAATTCAGGCTCCATACAATCTATCTACGAACAGAAGCACAATTACCCT
GCACAAAATAGAACATCGCTATTTCAAAGCTGCGCACCATTCGAAACGCCAGACAAAGAA
CGTAAAAATTATCAAAAAACTCAAGAACGATTGGAGGAAGTGAAAAAGGAGTTCTCGAAA
ATGGTAGATATGACAGAGAAACTTTTGAAAAAATCGAAGGAGAGTAATTCACATCTTGAA
TATAGTACACCGAAAGAACAATACAAAGAAGAGAAAGATATTAAAAACGATTTATTACGT
GAAAGAGTAACGCCTTCTGGCATTAAAGATATGGTGATTGATAATAAAGATGAAAATCTC
AAAGATCAAGATGATGCCCAAGTGCAGGAGACCAAAGACGATGAATTTAACGACTGTTCG
AGATTAAGCGACCTGGGTACGTCAATCGATGAAAATGACAGTTCAAATGACATTAGATCT
GTCAAAAATTTGAATATTTTATCAGTTAATGTAGAGTACATTGCAAAGCCGAATACAAAT
AACGAAAGAAATGATGATACAATAAACGATACATTTAATTATGAGAAAAATTATCATCGT
GGCACACAGAATAAAACCAACTTGAATGAAAGTTGTACTAAAGAGCACAGCGTGAATTTG
GTCACCAGCGTCAAAAACAAAACCGAATTGGAAATCAGTGAATCGTGCTCAACCTCAGCG
GAAAAACAACAGAATAAAATTAAAAACGCCATCCCCAATAAAACTAACGCCATTGAAATT
AAAAAGAAGATTATAGGAGAAATTATAAAAAAGATGTCAAAAGATACTACAGAAGAAACT
AAAGACGTCAGTCTCATTTATAAAAAATCATCGATAACAAATGCCGTGGGTAATAAATTG
AACGATATTTACGATAAATTGGACAGCACTAGTGCAACGGACTCTGAGAATTCCTTCCAA
ATACATGAAAGAAAGTCACAAATCATCGAACTAACAGACCTCACTAACTCCATCGAAGAT
CTCGCCCGCTTGGACAAAATATGTAGAATTATAGAAATATCCGACGAGTTATCAGACAAA
TTGTTATCGGCGTTGGATAAAAAAGAACCGAATTATAAGAGGAAGAAATGGTCCCTCAAG
GACTTGTGTGAAACCATAAAGTTGGATGAATTTTGTAATAAAGTGTTCGGTCCCCAGACA
GGCAAGTAA

Protein sequence:

MVLKVGINGFGRIGRVIFRTCLENTDLEVKAINDPAVDIEYISYLIKFDSTHGKLKWDVT
FKDNEITINGNNIKVFREKNPSNIPWQSASIQYVVEASGIFTNYDKASGHLSSKCVKRVL
VTAPSIDVPMLIIGVNEDRINKDQKVISCASSTLYCLAPIIKVLEDNYTVEEGFITSIHA
MTPSLKPLDGLCLKGKHWRDHRSIHQNIIPAATGACKALGKIIPGVKDKLTGLAFRVPVV
NVSVLDITIRLNKSTSIGHIVKIIEDQIRSPLKNIIKASSDEAVSSDFLGEEKSCILDVN
SCLQLTPNFFKLICWYENEYSYACRVVDSIKFLEKNCQLELVPKNIFRRLSVTPRISKER
PIPFYNRDLFCNTLIQNRSKFYMEHGFNERPPTPMGSRRHVDMHNSGSIQSIYEQKHNYP
AQNRTSLFQSCAPFETPDKERKNYQKTQERLEEVKKEFSKMVDMTEKLLKKSKESNSHLE
YSTPKEQYKEEKDIKNDLLRERVTPSGIKDMVIDNKDENLKDQDDAQVQETKDDEFNDCS
RLSDLGTSIDENDSSNDIRSVKNLNILSVNVEYIAKPNTNNERNDDTINDTFNYEKNYHR
GTQNKTNLNESCTKEHSVNLVTSVKNKTELEISESCSTSAEKQQNKIKNAIPNKTNAIEI
KKKIIGEIIKKMSKDTTEETKDVSLIYKKSSITNAVGNKLNDIYDKLDSTSATDSENSFQ
IHERKSQIIELTDLTNSIEDLARLDKICRIIEISDELSDKLLSALDKKEPNYKRKKWSLK
DLCETIKLDEFCNKVFGPQTGK