DPGLEAN20199 in OGS1.0

New model in OGS2.0DPOGS202219 
Genomic Positionscaffold121:+ 33694-35656
See gene structure
CDS Length1377
Paired RNAseq reads  388
Single RNAseq reads  1075
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013494 (1e-162)
Best Drosophila hit  CG7265 (9e-61)
Best Human hitdiphthamide biosynthesis protein 2 isoform a (4e-31)
Best NR hit (blastp)  diptheria toxin resistance protein [Bombyx mori] (2e-169)
Best NR hit (blastx)  diptheria toxin resistance protein [Bombyx mori] (4e-165)
GeneOntology terms  GO:0017183 peptidyl-diphthamide biosynthetic process from peptidyl-histidine
InterPro families
  
IPR010014 Diphthamide synthesis, DHP2
IPR002728 Diphthamide synthesis, DPH1/DHP2
Orthology groupMCL14278

Nucleotide sequence:

ATGTCTAATTTTACAACAGATGGTAAAATCTGTATAGAAAGAGAGTTGGAAGTGGCTAAA
TCCGAAATAGAATTTGACAACTTGGAACAACGTTACAGTGTTACAGAAATTTGTAATTGG
CTTAAGGAACACAACTTTTCTAAAGTGTGCTTACAATTTCCTGATGAACTAATTGGGGTC
AGTGCTGCTATTTATCAAGAAATTAAAAAAAATATAAATGTAGACTTATACATTCTCGGT
GACACATCCTATGCTAGTTGTTGTGTTGACTCCGTGGCTGCTATGCATGTTCAGAGTGAT
GCAGTGATTCACTTTGGACATTCGTGTTTCACAAAGACAAACATCCCCGTGTTCACAGTA
CTGCCAAAACGGAACTTATCCACTGATGCCATAGAATCAGTATTACGAGATCATTTCAAA
TCAGATGACACTAAACTCTGTTTATTTTACGATGCAGAATATGAACACTGTAAAGCTCTA
AGATCAAAGACATTTATTTCTTCTTTATTACAACAGTTTTGTTTATTCTTTGATTTAGAT
AACATCCAGACGATTATATTGGGGAGACTTGTTAAAAATGAAACCGGCGTTGAATACCCA
ATGGAAAGTTTAAGAGATTGTATTTGTATATACATTGGATCAAAAGGACAGACAGTGTTT
AATTTTAGTGTTTCAGTTCCAGCTTTGAAATGGTTCCTACTGGATCCAGAGGACAAAAAA
CTTGAACATCTAGAAGAAACAATTTGGTTTAAAAGGAGAAGATTTTTGATAGAGAAATGC
AAAGATGCAAATGTCATCGGCATACTGGTGTGTAAACTTGCCGGTGAGCAGACGAAACAA
ATAGTAAAAAGAATGAAACAAATATGTAAAGCTAATGGGAAAAAAAGTTACATAGTGTCA
GTGGGGAAGCCGAATGTTGCCAAGTTGGCGAATTTCCCAGAGATTGATATATATGTTATG
ATAGCTTGTCCGGAAAATGACTTGTATAACAATCGAGACTTCTACAGACCAATAATATAC
CCTTTTGAATTGGAAGTGGCGCTCAATTCTAATAGGGAGCAATACTACAACTATCATGTT
ACAGACTATGACGATCTCTTGCCAGGGAAACGCCATCACTTAGAAATAGATCATACCAAG
CAAGCAACTGACGTCAGTCTTGTGACTGGCAAAATAAGGGAAAATAAAATACATAGCAAT
GAAGAGGGTGGCATGGAGGTGGCTGAAAAACAAAATTGGGCATTAGAGAGCATCGGTCAG
AACCTTCAAGAGAGGTCATGGAAAGGATTGGAACAAAAATTTGGAGAGACTGATGTCAAA
AATGTTGAAGAAGGTCGAAAAGGAATCCCCTTACAGTACAGCAATGAACCAGAATAG

Protein sequence:

MSNFTTDGKICIERELEVAKSEIEFDNLEQRYSVTEICNWLKEHNFSKVCLQFPDELIGV
SAAIYQEIKKNINVDLYILGDTSYASCCVDSVAAMHVQSDAVIHFGHSCFTKTNIPVFTV
LPKRNLSTDAIESVLRDHFKSDDTKLCLFYDAEYEHCKALRSKTFISSLLQQFCLFFDLD
NIQTIILGRLVKNETGVEYPMESLRDCICIYIGSKGQTVFNFSVSVPALKWFLLDPEDKK
LEHLEETIWFKRRRFLIEKCKDANVIGILVCKLAGEQTKQIVKRMKQICKANGKKSYIVS
VGKPNVAKLANFPEIDIYVMIACPENDLYNNRDFYRPIIYPFELEVALNSNREQYYNYHV
TDYDDLLPGKRHHLEIDHTKQATDVSLVTGKIRENKIHSNEEGGMEVAEKQNWALESIGQ
NLQERSWKGLEQKFGETDVKNVEEGRKGIPLQYSNEPE