New model in OGS2.0 | DPOGS201344  |
---|---|
Genomic Position | scaffold1837:+ 1677-4533 |
See gene structure | |
CDS Length | 1299 |
Paired RNAseq reads   | 194 |
Single RNAseq reads   | 605 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003127 (6e-77) |
Best Drosophila hit   | lethal (2) 37Cd (1e-35) |
Best Human hit | general transcription factor 3C polypeptide 5 isoform 2 (1e-36) |
Best NR hit (blastp)   | hypothetical protein AaeL_AAEL009331 [Aedes aegypti] (3e-53) |
Best NR hit (blastx)   | conserved hypothetical protein [Pediculus humanus corporis] (8e-53) |
GeneOntology terms    | GO:0003709 RNA polymerase III transcription factor activity GO:0042797 tRNA transcription from RNA polymerase III promoter GO:0042791 5S class rRNA transcription from RNA polymerase III type 1 promoter GO:0005634 nucleus GO:0000127 transcription factor TFIIIC complex GO:0006351 transcription, DNA-dependent GO:0005515 protein binding GO:0003677 DNA binding |
InterPro families   | IPR019136 Transcription factor IIIC, subunit 5 |
Orthology group | MCL14670 |
Nucleotide sequence:
ATGGGAGGTATAAAGGCTCTTTCACAGCATTACACCCAAGCTAATAAAAAGCGACTGGGA
TTTAGCTTTCAACCAGATAATCCTTTTATGAAGAAAATATATGCTGACGCAAAACCCACC
GCCGGTGTCCTCTTTAAACTAAAGGTTAAGAAAACTAAATCGGGTAATGAGGTTAAGAAA
GAAGTAATTTCAACATCTATTGTTGGAACTGTTAAGAAAATTAATAGGTTTGAATCAATG
TGTGACTTCCAATACTTACCACTCAGTACACCACACATAGAAGGTGACAAACCACAATGT
CTTATAGAACAAATCATACCCTCCGGCCTAGATGAGCTGAATTCCATATTGGAACCCACG
CCTCTCTTTATAACACCATCAAATTTTACGAGGTCAGACAAACCGATAACATACTGCTAC
ACAGAGAAACGCTATGTGACAAAGGATATGATGAAGGGCGAGTCCACAAATGACGAAGTA
CATAAGACAAGGATGGAGAGGTCTTTGCATTTACCGAGATTTATATTTTCACTGAATGAA
GAGTTACCAACTGAACCCAATGAATATTATATTAAATTGAGAAATGCAAGACAAGCTCTG
AATCCATCTTTAGAAGAGGAATACAATACAGTGGCAAAGCTCTTCGAAGAGAGACCGATA
TGGTCATTGAATCTAGTCAAGTTTCATACAAAGATAAAGCTGTCATCTCTTAAGGTGATA
ATGCCGTGTCTTGCATTGTACATGAGAGAAGGTCCGTGGAGGATGCTCTGGACAAGATTT
GGATACGATCCAAGAAAAGAACCCGGCTCAAGGATTTACCAGACCCTGGACTTTAGGATG
AGACATGCAGCCGGTGTTCACTCTATGGTGTCAACTCGTGATGAGTTCGTTCATTGCAAG
AAGAAAGATAGAATTAAGAATTTAAGCAAAAGTGCTATAGATGACCTCTCAATAGAGGAC
ACGGTTTACGAAGGCGCTGTTTACTTTAGACCCGGGATGGCGCCAACTCAGAGACAAATA
TACTACCAGTACTGTGACGTGTACCTGCCGGAGGTCCAGGAGCTCGTGTCTCTGTCCCCG
CCAGCCGGGTACACGTGTCACGAGCGCCGTGGCTGGCTGCCTCCCGACACTGACCAGCTC
TGTCGGGACCACATCTTCAGATACGTCATGCAGACTCTACTAGCGAATCGCGTTAAGTAT
GAGGACGGGGTTGGTACAGGCGGCGAGAGTAGCTCTGATGACGCGGACGAAGCAGCGAAT
GCTTCTGTCGCTGAAGTTGATGAATCCATTAATACATGA
Protein sequence:
MGGIKALSQHYTQANKKRLGFSFQPDNPFMKKIYADAKPTAGVLFKLKVKKTKSGNEVKK
EVISTSIVGTVKKINRFESMCDFQYLPLSTPHIEGDKPQCLIEQIIPSGLDELNSILEPT
PLFITPSNFTRSDKPITYCYTEKRYVTKDMMKGESTNDEVHKTRMERSLHLPRFIFSLNE
ELPTEPNEYYIKLRNARQALNPSLEEEYNTVAKLFEERPIWSLNLVKFHTKIKLSSLKVI
MPCLALYMREGPWRMLWTRFGYDPRKEPGSRIYQTLDFRMRHAAGVHSMVSTRDEFVHCK
KKDRIKNLSKSAIDDLSIEDTVYEGAVYFRPGMAPTQRQIYYQYCDVYLPEVQELVSLSP
PAGYTCHERRGWLPPDTDQLCRDHIFRYVMQTLLANRVKYEDGVGTGGESSSDDADEAAN
ASVAEVDESINT