New model in OGS2.0 | DPOGS214227  |
---|---|
Genomic Position | scaffold323:- 33858-44889 |
See gene structure | |
CDS Length | 5406 |
Paired RNAseq reads   | 1018 |
Single RNAseq reads   | 2657 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005950 (8e-06) |
Best Drosophila hit   | ND |
Best Human hit | general transcription factor 3C polypeptide 2 (2e-16) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC014717 [Tribolium castaneum] (2e-42) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC014717 [Tribolium castaneum] (8e-24) |
GeneOntology terms    | GO:0003677 DNA binding GO:0000127 transcription factor TFIIIC complex GO:0006351 transcription, DNA-dependent GO:0005634 nucleus GO:0003709 RNA polymerase III transcription factor activity GO:0042797 tRNA transcription from RNA polymerase III promoter GO:0042791 5S class rRNA transcription from RNA polymerase III type 1 promoter |
InterPro families   | IPR015880 Zinc finger, C2H2-like |
Orthology group | MCL17175 |
Nucleotide sequence:
ATGGAATCTACAAAGTTACTTCAGAATGACCCTCCAGAAGTTATGGTATTACCACCTAAC
ATTTTAGAAAAATTGGGCATTGATTTAGGAAATATTCAGCCCTGTAGTAATGGAACAAAT
GCAAAATCAGGTACCAGACAACCTGAAGACATATCTGTATTGGATATCAATGCACCCAAT
TATGTACCTGGTAAAAATAATGATGAAGAATTTGATGGATCTTCGATTTTAGAAACAGCT
AATTTGATGAAGGCATCAGTTTTCTTAAGTCCCACATTTACTAATTTGAAGGCAACCAAC
GGTTTAAATGATCTCGAGTTCAATAATTTTGAAAACGCTATAGCCACCATAGAGTGCAAT
GTTTCTAAACCTTTAAAAAGAAAACTACAAAAGTCAAATAATAAAATAAACATAATATCT
GAAGAAATTATTGAGTTGGAGAAAATTAAAGATTTAATAACATCTGAGATACTGGTTAGC
AATAATCTTGCTTCTGTGCCTATTAACACAATAATTACAGAAAAAACTATTAATATCAAA
AATGGTGCAGCTTCAGTACCATCAAAGGAGAAAAATAATAATGATTGTAAGGATAACAAT
GATAAACCATTAAATGATTGTAAAGCAGTAGAAAATAAAGTTGCCAACATAAATAACGGA
ATTCAAGATCAATTGGACAAATCAATTCAGTTCCAAATAGATAGCGATGGTTTAAAGTTA
TCAAAGGAAATAAATTTGATAAATAATACAAATGGAGATATAATTATGTACAAGAATCAT
GAAGAAAAATGTTCAGATGAAAACCAGGATCTTGTGAGAAATTGTGACAAAGATTTAAAG
ATCACGTCTGAGCAGCAAGATGAACATGCTACTTCTAGTTCAGAAGAAATTGTAACTATT
ATTCATGTTATAAATGAAAATGTTGATGAGGCAGATGTGAGTAAAGATTTAAATTTCTCC
ATTCCTGATAGTGTATGTCACTTCACTGTCACAGACACAATTGATATAGTAGTTCCCGAA
CATTCCGAACTTAGCATTAACTATGATCAGGAAAATAAATCTTGTGATGTAAAAACTTCA
GCAGACGAAAAAATTTTAGGCGACAGAAAAGATGCCAAAATTAATAACAAGAACCTAGAA
ATAAGTACTAGAAATAAAATTATAAATGTCAATCAAAACAAAGTTAAATCAAAGCTGTGT
ATACAGGGTGTTGAGAAACCCGGTGGAGATGATGATAATGTAAACAAAGGTATTTATGAT
AAAGATTTAAAGACTTATTTTAAAAGAAAACAAACAAAGTGCTTGAAACAGTTGTTTCTA
GATAATCAAGACTGCATAAAAAATAATTTAGATGTTGAAGTAAACATAAACTTAGATGTA
AATAATATAAATAGTATATATGAAAAGAAATCACAAAAATTTTGTGTATGTAAAGATTTT
GGTTATTTTTTCTACTACGACAAAGATGATGATTATCAACATATACATTATTGGGAGCAA
AAAGGTGCTGAATGTTCTCTAGATACATTACTTAGTATTTATGAAGATAACAAAGTATAT
GATGTAAATATAAACGAGCCTGAGTATAATGAAATAGAAACAGTCAAAGAAGACGACCAA
GAACAGAACAGTACAGATGTTTGCAATATATGTTGGATGAGTGTAAATAGTGATATTGAA
AATGAAGATTTAGTCGAAATTAATAAAGAAGCTGAAACCCGTCAAATAACTCAATCTATT
GAACCACAGGTTAAAAATACTGATGAAGGAAAAAGAAACAAACCATTACATTCTCAAGAT
GTTAATAGAGAATTGCAAGATAAGAGAAAAAATTCTTTATCATTAAACGATGTTAGTCCA
GAAGAAACTAAGAAGAAAAAGATGGAAGATGTTATAGAAAATAGAAATATTTCGTGTGGT
TTATGTAAAGCTAAGGTGGTAACTAGTGAATGGGATAATCATGTTAGGGATCATTGTTTT
ATAGCCTGGGAAGAGGGTCAGAAATTTGGTCACGTTGAACAATCGGATGATTTAAAAAAT
GTTATGTCGCGTCAATCGAAACACACAGTATGTGGAGTTTGCAAGACACAAGTCCCCAAA
AGCTCATGGATAGAACATATAGCTAAAGAGCATAATTATATAGCTTGGAAAGATGGAGAT
AAAGTATTGGATGTTGGAGATGAAGAAGCAGTTAAAAGGCATTTGAATAAGTTGGCTAGT
GATGTTAAAAGATTCGAGTGCAAATCCTGCGGCACAAAACGCAAGTGTTCGGACTCTTTC
TTTAAACATATTCAAAAATGTGGAAAATTAGGCGAAGGAATGGATACAGCAGACAATTCG
GTAGGAGCTGAAGACGATACGTCAGTAACATGTGGAGTGTGTCAGAACAAGATGTCAGCT
AACGAATGGCAAAACCACCAATTCAAAGAACACAAATATCTAGCGTGGAAAGCTGGGGAA
CAAGAGTTGGATCTCAATGACACTGAGCATGTGTATTCACATTTATATAATCTGTCCAAA
GATCTCGGCGGACTTCTATGCAGTAAGTGCGGGTGTCGACGCAAGTATGTTAATTCATAT
TTGAAGCACATTGAGAAATGTGATGGGGAAGAGAATCCAAACTGTACATTGGATTCCACT
ATGAACGAAAGTAATGTGTCAATAAACAAAACCTTGGGAGACGATGTGACTGGAGACTTA
GAGGGAATTGTTAGATGTGGAGTGTGCTCAAAGGAAATTGATAGAAAGCAATGGGAGAAT
CATATAAAAAAAGAACACTTCTATAAGGCGTGGCAGGAAGGACAGAAGCCAGTGGTAATT
ATATTATATATGATGCGACGAAACCTCTCAGCATTCCATGGATTCACCGTGCGGGTGCCA
AGCTATCGCGTTGCAGGGAGAGGAGCTGTTATGGAAACAGTTTCCAACAGTCCTTGGGAC
TTGCGGCCCAGGTGTTCTGATAATTTAGATAACGAAGAGGATGTGTATAACCATCTGTAT
GCGATGAGTAAGAAATATAAGGGTCTGGTTTGCAACAACTGCGGCACCAACAGGAAATAT
GTTAAGACATTCCTCCAACATATAGAGTCCTGTAACTCCCAGGACTCGTTTATCACAGAT
GAAGTTCTTAAACAAGAAACCTGTAAATGCGGTGTTTGCGGCGAAGAAGTTCCAAGTAAA
ATGTGGAAGACCCATGCTATGAAGACACATTACAACGTGGCGTGGCTGGATCAACAGACA
CCTATTGATACCAACAATGGAACAGCGGTTGAGAAATGTTTGAAGGAATACAAACAGGCT
TATAATAAATTCGTTTGCAATGTGTGCGGTATAACGCGGGTTTCCGCCGTGGGGTTCTTC
GCCCACGTGTTGCAGTGTGGGAAAACTGAAGAGGAAATCGATAAACATAGAGGCGTCTGC
GACATATGCAATAATAAATATTTACTGATATATAAGAATCAACATATATCGATGCACAGA
GATCAGGAGTACGCGAAACAGAGAAAGCTGGAGCTACAAGTGGAGAAGGAGGAGAAACAA
AAACAACAGAAACACTCTGACGCGTTACCAGAGAAGAGACAAGCTGCTGAGAGAGCCCGT
CATGTTATTGAAAAGTATAAAAAGCAATTTAAGCATAACTGTCCCACTTGCGACTTTGGT
GGCGATTCCGAAGAAGATTTGAAGAAGCACACGTGCTCGAAGACGAAGTATAACTTTAGC
GAGTCCGAAGATTCTCTGCAATTCAGTTCGGAACAAGAATCTGAGGACAGCGACGCTAAC
TGTGAACTGTTAGAAGAAGAACTAGAAAAGCCTCAGAAGAATAATACTAAGAAAAAAAAA
CATTCTGACTCAAATCCTGCTACTGCATTCTTACCGTTTCCCGTCAAAAACACGCAGACG
TATTTAGCTGAGAGTGCAGAGGACTTTCGTGAGAAGTTCTTAACGAGCGACATACTGTAC
CCACAGTGGAGGACATGCGAGTACGAGGTCGTGTCAGATGACCTGCTGACAAATTACTTG
CCGACCTTGGAGGAATCGTGCAAATTACAGTTACAGAAAGACGAATGGATAGCGTTGAAG
AAGTTTGAGTCTGTTAATGATCACAAATGGGTGAGTGCATCTTTCACGGGCGGTTGTATC
CAGTGCGTGTCGTGGTGTCCGCCGCACGTGTCGGACGCGGAGGACGAGCTGGGTCACGTG
TTGAGCGCGGCCGTGCACGTGTCCCGGGACGCGCCACGCCTCCCCGCCGACACGTGTCAC
ACACACCACGCCATGCTGCAGATATGGGACTACGGGGACATGCGCACGAAGCCAAAATTT
GCTCTGGGTATAGCTCTTGATTTTGGGACAATTTGGGCGAAAGATTGGTGTCCATCGGGC
ACGCGTGACATGTTGAACGGAGAGCCGACAACTTTTAAAAGACTTGGTCTTTTATCTATA
GCATGCTCAAACGGTTCAGCGTACATATTATCAGTACCGTATCCTTCAAGCATAACGGAC
GGGGGGAAAAAGATTTTCAACCTAAAGCCAGTCGCGGAGCTGAGATTGACTCGTGGTGAT
CGGCGGAAGTATCAAGCTACAGCTATCAATTGGCCAGCGCAAAAAGGGCATTCCACTATA
GTAGTCGGATATTCTGATGGAACAACCGCCTCGTATAATCTGTCGTGCGATTCTCCTCTC
TTGACCGAAACAGAGGACGGCGTTAAGATATTCTATCCTTATCAAGACGAACGAACACAC
AACACATGTGTCACCGCGGTGACGTCATTTCCTAGTAGCGGCGTGTCGTGCCCGGCGGGT
TCTTCGTCGGCTACAGGCGGCTCGCGGTCCGTGTGTCGCGGAGTCGGCCGCGGCTCTCGC
TCCGCAGTCACCGCTACCTCGGCCTGTTTCATGCCGCACTGGCCCGACCTACTGTTGGCT
GGGAACGACGCTATCGTATATCAAGCTCCGAACGTGTTGTCGTGGGTGGGGAACGGGCGA
CGCCTGGGCTCGCAGCAGGCGTGTGCTGGATGCAACACCTGCGGACGGGTGGCGCTAGTG
GCGCCGCCCGCGGTGCGACTCGTCACTACACACCCCGTGCATAACGACCTTAATAAAATT
ACAGTGGCGCTGTTACAAATGAAACCGCTCGTGGATAAGAAATCCAAGCAGAAGAATGAC
GACCTCGCCACGAGACTTGAGCCGGTGACTTATGAAGACGCCGTCAAGAAATATGGAGTA
GAGTTGAAACTAGCTGAAGATTGTGACAAGAGCTACCTCCAACAGTCAAACAAGCCGAGA
GACCACTACCCAGAGAGGTTCCCGCTCTCAGACGTGCCAGCTATGGCCTTCAACCTGTCT
CCGAAGCAACACAAGAAACTGGCGATCGCCACACATTCTGGATTTATTTTCGTTTTGACT
GTATGA
Protein sequence:
MESTKLLQNDPPEVMVLPPNILEKLGIDLGNIQPCSNGTNAKSGTRQPEDISVLDINAPN
YVPGKNNDEEFDGSSILETANLMKASVFLSPTFTNLKATNGLNDLEFNNFENAIATIECN
VSKPLKRKLQKSNNKINIISEEIIELEKIKDLITSEILVSNNLASVPINTIITEKTINIK
NGAASVPSKEKNNNDCKDNNDKPLNDCKAVENKVANINNGIQDQLDKSIQFQIDSDGLKL
SKEINLINNTNGDIIMYKNHEEKCSDENQDLVRNCDKDLKITSEQQDEHATSSSEEIVTI
IHVINENVDEADVSKDLNFSIPDSVCHFTVTDTIDIVVPEHSELSINYDQENKSCDVKTS
ADEKILGDRKDAKINNKNLEISTRNKIINVNQNKVKSKLCIQGVEKPGGDDDNVNKGIYD
KDLKTYFKRKQTKCLKQLFLDNQDCIKNNLDVEVNINLDVNNINSIYEKKSQKFCVCKDF
GYFFYYDKDDDYQHIHYWEQKGAECSLDTLLSIYEDNKVYDVNINEPEYNEIETVKEDDQ
EQNSTDVCNICWMSVNSDIENEDLVEINKEAETRQITQSIEPQVKNTDEGKRNKPLHSQD
VNRELQDKRKNSLSLNDVSPEETKKKKMEDVIENRNISCGLCKAKVVTSEWDNHVRDHCF
IAWEEGQKFGHVEQSDDLKNVMSRQSKHTVCGVCKTQVPKSSWIEHIAKEHNYIAWKDGD
KVLDVGDEEAVKRHLNKLASDVKRFECKSCGTKRKCSDSFFKHIQKCGKLGEGMDTADNS
VGAEDDTSVTCGVCQNKMSANEWQNHQFKEHKYLAWKAGEQELDLNDTEHVYSHLYNLSK
DLGGLLCSKCGCRRKYVNSYLKHIEKCDGEENPNCTLDSTMNESNVSINKTLGDDVTGDL
EGIVRCGVCSKEIDRKQWENHIKKEHFYKAWQEGQKPVVIILYMMRRNLSAFHGFTVRVP
SYRVAGRGAVMETVSNSPWDLRPRCSDNLDNEEDVYNHLYAMSKKYKGLVCNNCGTNRKY
VKTFLQHIESCNSQDSFITDEVLKQETCKCGVCGEEVPSKMWKTHAMKTHYNVAWLDQQT
PIDTNNGTAVEKCLKEYKQAYNKFVCNVCGITRVSAVGFFAHVLQCGKTEEEIDKHRGVC
DICNNKYLLIYKNQHISMHRDQEYAKQRKLELQVEKEEKQKQQKHSDALPEKRQAAERAR
HVIEKYKKQFKHNCPTCDFGGDSEEDLKKHTCSKTKYNFSESEDSLQFSSEQESEDSDAN
CELLEEELEKPQKNNTKKKKHSDSNPATAFLPFPVKNTQTYLAESAEDFREKFLTSDILY
PQWRTCEYEVVSDDLLTNYLPTLEESCKLQLQKDEWIALKKFESVNDHKWVSASFTGGCI
QCVSWCPPHVSDAEDELGHVLSAAVHVSRDAPRLPADTCHTHHAMLQIWDYGDMRTKPKF
ALGIALDFGTIWAKDWCPSGTRDMLNGEPTTFKRLGLLSIACSNGSAYILSVPYPSSITD
GGKKIFNLKPVAELRLTRGDRRKYQATAINWPAQKGHSTIVVGYSDGTTASYNLSCDSPL
LTETEDGVKIFYPYQDERTHNTCVTAVTSFPSSGVSCPAGSSSATGGSRSVCRGVGRGSR
SAVTATSACFMPHWPDLLLAGNDAIVYQAPNVLSWVGNGRRLGSQQACAGCNTCGRVALV
APPAVRLVTTHPVHNDLNKITVALLQMKPLVDKKSKQKNDDLATRLEPVTYEDAVKKYGV
ELKLAEDCDKSYLQQSNKPRDHYPERFPLSDVPAMAFNLSPKQHKKLAIATHSGFIFVLT
V