DPGLEAN07478 in OGS1.0

New model in OGS2.0DPOGS201344 
Genomic Positionscaffold1837:+ 1677-4533
See gene structure
CDS Length1299
Paired RNAseq reads  194
Single RNAseq reads  605
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003127 (6e-77)
Best Drosophila hit  lethal (2) 37Cd (1e-35)
Best Human hitgeneral transcription factor 3C polypeptide 5 isoform 2 (1e-36)
Best NR hit (blastp)  hypothetical protein AaeL_AAEL009331 [Aedes aegypti] (3e-53)
Best NR hit (blastx)  conserved hypothetical protein [Pediculus humanus corporis] (8e-53)
GeneOntology terms






  
GO:0003709 RNA polymerase III transcription factor activity
GO:0042797 tRNA transcription from RNA polymerase III promoter
GO:0042791 5S class rRNA transcription from RNA polymerase III type 1 promoter
GO:0005634 nucleus
GO:0000127 transcription factor TFIIIC complex
GO:0006351 transcription, DNA-dependent
GO:0005515 protein binding
GO:0003677 DNA binding
InterPro families  IPR019136 Transcription factor IIIC, subunit 5
Orthology groupMCL14670

Nucleotide sequence:

ATGGGAGGTATAAAGGCTCTTTCACAGCATTACACCCAAGCTAATAAAAAGCGACTGGGA
TTTAGCTTTCAACCAGATAATCCTTTTATGAAGAAAATATATGCTGACGCAAAACCCACC
GCCGGTGTCCTCTTTAAACTAAAGGTTAAGAAAACTAAATCGGGTAATGAGGTTAAGAAA
GAAGTAATTTCAACATCTATTGTTGGAACTGTTAAGAAAATTAATAGGTTTGAATCAATG
TGTGACTTCCAATACTTACCACTCAGTACACCACACATAGAAGGTGACAAACCACAATGT
CTTATAGAACAAATCATACCCTCCGGCCTAGATGAGCTGAATTCCATATTGGAACCCACG
CCTCTCTTTATAACACCATCAAATTTTACGAGGTCAGACAAACCGATAACATACTGCTAC
ACAGAGAAACGCTATGTGACAAAGGATATGATGAAGGGCGAGTCCACAAATGACGAAGTA
CATAAGACAAGGATGGAGAGGTCTTTGCATTTACCGAGATTTATATTTTCACTGAATGAA
GAGTTACCAACTGAACCCAATGAATATTATATTAAATTGAGAAATGCAAGACAAGCTCTG
AATCCATCTTTAGAAGAGGAATACAATACAGTGGCAAAGCTCTTCGAAGAGAGACCGATA
TGGTCATTGAATCTAGTCAAGTTTCATACAAAGATAAAGCTGTCATCTCTTAAGGTGATA
ATGCCGTGTCTTGCATTGTACATGAGAGAAGGTCCGTGGAGGATGCTCTGGACAAGATTT
GGATACGATCCAAGAAAAGAACCCGGCTCAAGGATTTACCAGACCCTGGACTTTAGGATG
AGACATGCAGCCGGTGTTCACTCTATGGTGTCAACTCGTGATGAGTTCGTTCATTGCAAG
AAGAAAGATAGAATTAAGAATTTAAGCAAAAGTGCTATAGATGACCTCTCAATAGAGGAC
ACGGTTTACGAAGGCGCTGTTTACTTTAGACCCGGGATGGCGCCAACTCAGAGACAAATA
TACTACCAGTACTGTGACGTGTACCTGCCGGAGGTCCAGGAGCTCGTGTCTCTGTCCCCG
CCAGCCGGGTACACGTGTCACGAGCGCCGTGGCTGGCTGCCTCCCGACACTGACCAGCTC
TGTCGGGACCACATCTTCAGATACGTCATGCAGACTCTACTAGCGAATCGCGTTAAGTAT
GAGGACGGGGTTGGTACAGGCGGCGAGAGTAGCTCTGATGACGCGGACGAAGCAGCGAAT
GCTTCTGTCGCTGAAGTTGATGAATCCATTAATACATGA

Protein sequence:

MGGIKALSQHYTQANKKRLGFSFQPDNPFMKKIYADAKPTAGVLFKLKVKKTKSGNEVKK
EVISTSIVGTVKKINRFESMCDFQYLPLSTPHIEGDKPQCLIEQIIPSGLDELNSILEPT
PLFITPSNFTRSDKPITYCYTEKRYVTKDMMKGESTNDEVHKTRMERSLHLPRFIFSLNE
ELPTEPNEYYIKLRNARQALNPSLEEEYNTVAKLFEERPIWSLNLVKFHTKIKLSSLKVI
MPCLALYMREGPWRMLWTRFGYDPRKEPGSRIYQTLDFRMRHAAGVHSMVSTRDEFVHCK
KKDRIKNLSKSAIDDLSIEDTVYEGAVYFRPGMAPTQRQIYYQYCDVYLPEVQELVSLSP
PAGYTCHERRGWLPPDTDQLCRDHIFRYVMQTLLANRVKYEDGVGTGGESSSDDADEAAN
ASVAEVDESINT