DPGLEAN18004 in OGS1.0

New model in OGS2.0DPOGS204924 
Genomic Positionscaffold1529:- 24503-29282
See gene structure
CDS Length1515
Paired RNAseq reads  296
Single RNAseq reads  621
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011417 (4e-10)
Best Drosophila hit  CG3045 (5e-98)
Best Human hittRNA pseudouridine synthase 3 (2e-78)
Best NR hit (blastp)  PREDICTED: similar to pseudouridylate synthase [Nasonia vitripennis] (4e-112)
Best NR hit (blastx)  PREDICTED: similar to pseudouridylate synthase [Nasonia vitripennis] (1e-108)
GeneOntology terms


  
GO:0004730 pseudouridylate synthase activity
GO:0009982 pseudouridine synthase activity
GO:0003723 RNA binding
GO:0001522 pseudouridine synthesis
InterPro families



  
IPR020097 Pseudouridine synthase I, TruA, alpha/beta domain
IPR020103 Pseudouridine synthase, catalytic domain
IPR020094 Pseudouridine synthase I, TruA, N-terminal
IPR020095 Pseudouridine synthase I, TruA, C-terminal
IPR001406 Pseudouridine synthase I, TruA
Orthology groupMCL14853

Nucleotide sequence:

ATGTCGAAACAAATTAATAAATTACCACCAAAACAGAGAAAAACTAAAGGCTTATCTAGA
GAGGAGCTTATGAATATGGATAAAAATGAATTAGTTGATAGAATAATACAGTTGGAAGCT
CACACTACGCAACTTAAAAATATAATAAGCAAAAGTGAACCAGTTACAGAGAATATACAG
GGTTACAATAATCAAAGAAAATTTGATTTCACGAAGTGTACCTTCCGACGGGTTCTGCTA
CATATAATATACTTCGGCTGGGATTACCACGGGCTGGCCGTCCAGGAGGATTCAACGCAC
ACAATTGAGCATTACCTCTTCAATGCCCTCGTCAAGTCATGTCTCATTGAGAGTAGAGAA
CAGTCACAGTACCATCGATGTGGAAGGACAGATAAAGGCGTCAGTGCCTTCGGACAGATA
ATATCTATATCATTACGGAGCAAACTGGAACCGTCTTCAACCGACTACTCATCCGAGATC
CAATACTGCAAGATCCTTAACAGATTGTTTCCGAGGGATATTAAAGCGGTAGCCTGGATG
CCCATCCCTGATGATAGACCAGATTTCAGTGCAAGATTCGACTGTAAGGGCCGGCAGTAC
AAGTACTATTTCCCGAAATCTAATCTCAATATAACCGCTATGAGGGAGGCCTGTCGCCAG
CTCATCGGTTCACACGACTTCCGCCACCTCTGCAAGATGGACGTGGGGAACGGCGTCACT
GAGTTCACAAGGCGTGTCGTATCAGCTGACATTATAGCTCTGGATAAGGATTGCGAACAG
ACAACATCGATGTACGCATTAGTGATAGAAGGTAATGCATTTCTGTGGCATCAGATCAGG
TGTATAATGGGCGTGTTGTTGCTCGTGGGCCAAGGACACGAGAGCCCGGGTATCATAGCC
GAATTACTGGACGTCGAAGCAAATCCACGCAAACCTCAATACAATATGGCTCTGGATTTG
CCGTTGAACCTGTTCTGCTGCAGATATGATGTGAAGAGCCGCTGGGTTTATGACGACGAG
GAGCTCAAATACATCATCACCAACTTACAGGCGGACTGGACCTTGTATAATGTCAAATCC
ACCATGATAAAAGATGCTCTGGAACATCTGGAAGGTGTCCTATATGACTTGAGCAAGGAG
GGGAAAAAGTGTGACAGAGACGGAGAATATAACGACGTGGGAAGGAGAGAGATGCAAGAT
AAGGAAGATGTAGCGTTAGAGGGAGACCAAGATAAAATATGTGACATAGACAGAAATTTA
GAAGAGTTGGGAGAGAAAGAGAAAGAAGATGATAACAATAAGTGCGAGAGAGACAGGGGA
TTAAAAGAGTTGGAAGGGAAAGAGACGGGAGATAGAATAATATCGCACGCAGAATGCCTG
CTACAAGGAGTCAAACCAAAAATATACACACCGCTGTTGAAAAGACAAACCTGCTCGAGT
CTGCAGGAACGATTGCAATACTACAGGAAGAAAAGGAAAGTGGAGAGCGGTTCTGATGAT
GAAGAAATAAAATAA

Protein sequence:

MSKQINKLPPKQRKTKGLSREELMNMDKNELVDRIIQLEAHTTQLKNIISKSEPVTENIQ
GYNNQRKFDFTKCTFRRVLLHIIYFGWDYHGLAVQEDSTHTIEHYLFNALVKSCLIESRE
QSQYHRCGRTDKGVSAFGQIISISLRSKLEPSSTDYSSEIQYCKILNRLFPRDIKAVAWM
PIPDDRPDFSARFDCKGRQYKYYFPKSNLNITAMREACRQLIGSHDFRHLCKMDVGNGVT
EFTRRVVSADIIALDKDCEQTTSMYALVIEGNAFLWHQIRCIMGVLLLVGQGHESPGIIA
ELLDVEANPRKPQYNMALDLPLNLFCCRYDVKSRWVYDDEELKYIITNLQADWTLYNVKS
TMIKDALEHLEGVLYDLSKEGKKCDRDGEYNDVGRREMQDKEDVALEGDQDKICDIDRNL
EELGEKEKEDDNNKCERDRGLKELEGKETGDRIISHAECLLQGVKPKIYTPLLKRQTCSS
LQERLQYYRKKRKVESGSDDEEIK