New model in OGS2.0 | DPOGS202219  |
---|---|
Genomic Position | scaffold121:+ 33694-35656 |
See gene structure | |
CDS Length | 1377 |
Paired RNAseq reads   | 388 |
Single RNAseq reads   | 1075 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013494 (1e-162) |
Best Drosophila hit   | CG7265 (9e-61) |
Best Human hit | diphthamide biosynthesis protein 2 isoform a (4e-31) |
Best NR hit (blastp)   | diptheria toxin resistance protein [Bombyx mori] (2e-169) |
Best NR hit (blastx)   | diptheria toxin resistance protein [Bombyx mori] (4e-165) |
GeneOntology terms   | GO:0017183 peptidyl-diphthamide biosynthetic process from peptidyl-histidine |
InterPro families    | IPR010014 Diphthamide synthesis, DHP2 IPR002728 Diphthamide synthesis, DPH1/DHP2 |
Orthology group | MCL14278 |
Nucleotide sequence:
ATGTCTAATTTTACAACAGATGGTAAAATCTGTATAGAAAGAGAGTTGGAAGTGGCTAAA
TCCGAAATAGAATTTGACAACTTGGAACAACGTTACAGTGTTACAGAAATTTGTAATTGG
CTTAAGGAACACAACTTTTCTAAAGTGTGCTTACAATTTCCTGATGAACTAATTGGGGTC
AGTGCTGCTATTTATCAAGAAATTAAAAAAAATATAAATGTAGACTTATACATTCTCGGT
GACACATCCTATGCTAGTTGTTGTGTTGACTCCGTGGCTGCTATGCATGTTCAGAGTGAT
GCAGTGATTCACTTTGGACATTCGTGTTTCACAAAGACAAACATCCCCGTGTTCACAGTA
CTGCCAAAACGGAACTTATCCACTGATGCCATAGAATCAGTATTACGAGATCATTTCAAA
TCAGATGACACTAAACTCTGTTTATTTTACGATGCAGAATATGAACACTGTAAAGCTCTA
AGATCAAAGACATTTATTTCTTCTTTATTACAACAGTTTTGTTTATTCTTTGATTTAGAT
AACATCCAGACGATTATATTGGGGAGACTTGTTAAAAATGAAACCGGCGTTGAATACCCA
ATGGAAAGTTTAAGAGATTGTATTTGTATATACATTGGATCAAAAGGACAGACAGTGTTT
AATTTTAGTGTTTCAGTTCCAGCTTTGAAATGGTTCCTACTGGATCCAGAGGACAAAAAA
CTTGAACATCTAGAAGAAACAATTTGGTTTAAAAGGAGAAGATTTTTGATAGAGAAATGC
AAAGATGCAAATGTCATCGGCATACTGGTGTGTAAACTTGCCGGTGAGCAGACGAAACAA
ATAGTAAAAAGAATGAAACAAATATGTAAAGCTAATGGGAAAAAAAGTTACATAGTGTCA
GTGGGGAAGCCGAATGTTGCCAAGTTGGCGAATTTCCCAGAGATTGATATATATGTTATG
ATAGCTTGTCCGGAAAATGACTTGTATAACAATCGAGACTTCTACAGACCAATAATATAC
CCTTTTGAATTGGAAGTGGCGCTCAATTCTAATAGGGAGCAATACTACAACTATCATGTT
ACAGACTATGACGATCTCTTGCCAGGGAAACGCCATCACTTAGAAATAGATCATACCAAG
CAAGCAACTGACGTCAGTCTTGTGACTGGCAAAATAAGGGAAAATAAAATACATAGCAAT
GAAGAGGGTGGCATGGAGGTGGCTGAAAAACAAAATTGGGCATTAGAGAGCATCGGTCAG
AACCTTCAAGAGAGGTCATGGAAAGGATTGGAACAAAAATTTGGAGAGACTGATGTCAAA
AATGTTGAAGAAGGTCGAAAAGGAATCCCCTTACAGTACAGCAATGAACCAGAATAG
Protein sequence:
MSNFTTDGKICIERELEVAKSEIEFDNLEQRYSVTEICNWLKEHNFSKVCLQFPDELIGV
SAAIYQEIKKNINVDLYILGDTSYASCCVDSVAAMHVQSDAVIHFGHSCFTKTNIPVFTV
LPKRNLSTDAIESVLRDHFKSDDTKLCLFYDAEYEHCKALRSKTFISSLLQQFCLFFDLD
NIQTIILGRLVKNETGVEYPMESLRDCICIYIGSKGQTVFNFSVSVPALKWFLLDPEDKK
LEHLEETIWFKRRRFLIEKCKDANVIGILVCKLAGEQTKQIVKRMKQICKANGKKSYIVS
VGKPNVAKLANFPEIDIYVMIACPENDLYNNRDFYRPIIYPFELEVALNSNREQYYNYHV
TDYDDLLPGKRHHLEIDHTKQATDVSLVTGKIRENKIHSNEEGGMEVAEKQNWALESIGQ
NLQERSWKGLEQKFGETDVKNVEEGRKGIPLQYSNEPE