DPGLEAN07794 in OGS1.0

New model in OGS2.0DPOGS212408 
Genomic Positionscaffold871:- 7858-10016
See gene structure
CDS Length1374
Paired RNAseq reads  180
Single RNAseq reads  605
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002808 (4e-17)
Best Drosophila hit  CG8078 (4e-143)
Best Human hitcytoplasmic tRNA 2-thiolation protein 1 (5e-76)
Best NR hit (blastp)  GF12710 [Drosophila ananassae] (3e-151)
Best NR hit (blastx)  PREDICTED: similar to CG8078 CG8078-PA [Tribolium castaneum] (2e-147)
GeneOntology terms


  
GO:0000049 tRNA binding
GO:0002098 tRNA wobble uridine modification
GO:0005829 cytosol
GO:0034227 tRNA thio-modification
InterPro families


  
IPR011063 tRNA(Ile)-lysidine/2-thiocytidine synthase
IPR014729 Rossmann-like alpha/beta/alpha sandwich fold
IPR000541 tRNA 2-thiolation protein
IPR020554 Uncharacterised protein family UPF0021, conserved site
Orthology groupMCL12816

Nucleotide sequence:

ATGCCTGTACTATGCAAAGCAGGATGTGGAAAAAATGCTATGTTAAAGCGTCCTAAAACG
GGGGATACTCTATGTAAAGAATGCTTTTACGAAGCCTTTGAAACAGAAATCCATTTTACA
ATAACAAAAGCAGAATTATTTAATAGAGGAGATTCTGTCGCCATCGCCGCCTCTGGTGGC
AAGGATTCAACCGTATTGGCACATGTACTTAAAACATTAAATCAAAGATATGACTATGGA
CTTAATCTTATGTTGCTGTCCATAGACGAGGGCATAACCGGCTACAGAGATGACAGCTTG
GAAACAGTCAAACAAAACAGAGATGATTACGAGATGAATCTCAAAATATTATCATATAAA
GATTTATATGGTTGGACCATGGACGAAATTGTAGCCCAAATCGGAAGAAAGAATAATTGT
ACATTCTGTGGAGTATTCAGGAGGCAAGCTTTGGATAGAGGTGCCGCCATGCTTAATGTG
AAATGTATAGCAACAGGACACAATGCTGATGACATAGCAGAGACGGTGCTGATGAATGTG
CTGAGAGGAGACATAGCTCGGCTCAAGAGATGCACTGCTATATCTACTGGCAGCGAGGGC
ACAATTCCGAGAGTGAAGCCGTTGAAGTATACGTATGAAAAGGAGATTGTTATGTACGCT
CATTACAAGAAGCTGGTGTACTTCTCAACAGAATGTGTGTTTGCTCCAAACGCCTACAGA
GGTCATGCTAGGGCTCTGTTAAAAGATCTGGAAAAAATTAGACCTACTTGCATTATGGAT
ATCATATACTCAGGCGAAACAATGGCTGTGAAAGAGGAAGTGTCACTGCCCACACAGAGA
ATTTGCACGAGATGCAAATTTGTCTCCTCTCAAGAGGTATGCAAGGCTTGCGTTCTTCTG
GAAGGATTGAACAAGGGTTTACCAAAACTTGGCATTGGAAAGAGTTCCAAAGCCAAGAAG
ATGCTAGAAGAATACAACGCAAACCAAAATAGTACGAATAAAGCTATCGATGAAATTAAT
GTCGACTGCCAGAAAAATAATTGTGTCTCTAGAGGAAAAGCGTGCAGGTCGAATCGAAAT
AAAACAAATGATAACGAAGTCAACAGCCGAAACGGAGAAAAGTGCTGTAGTACACAGGAA
AAGACACATGACAGTGCTAATATAAGCAATACTAAATTAAACACACTTTTACAAGACTAC
GGCATACCAGAAAATGATTGTGGACACGATAAAAACTCATCAATAGGGGAAAGTTCGGAA
AATCATAATTTCGAAGTCGATTTACACAACGAAGATGTTACATCCTTAGCAGAGGAGACT
GACGCGTGTGGCGGAGCATGCGGCAAGATGGACTCCATGCATATAGGTTTCTGA

Protein sequence:

MPVLCKAGCGKNAMLKRPKTGDTLCKECFYEAFETEIHFTITKAELFNRGDSVAIAASGG
KDSTVLAHVLKTLNQRYDYGLNLMLLSIDEGITGYRDDSLETVKQNRDDYEMNLKILSYK
DLYGWTMDEIVAQIGRKNNCTFCGVFRRQALDRGAAMLNVKCIATGHNADDIAETVLMNV
LRGDIARLKRCTAISTGSEGTIPRVKPLKYTYEKEIVMYAHYKKLVYFSTECVFAPNAYR
GHARALLKDLEKIRPTCIMDIIYSGETMAVKEEVSLPTQRICTRCKFVSSQEVCKACVLL
EGLNKGLPKLGIGKSSKAKKMLEEYNANQNSTNKAIDEINVDCQKNNCVSRGKACRSNRN
KTNDNEVNSRNGEKCCSTQEKTHDSANISNTKLNTLLQDYGIPENDCGHDKNSSIGESSE
NHNFEVDLHNEDVTSLAEETDACGGACGKMDSMHIGF