New model in OGS2.0 | DPOGS201595  |
---|---|
Genomic Position | scaffold609:- 19653-33975 |
See gene structure | |
CDS Length | 3084 |
Paired RNAseq reads   | 5238 |
Single RNAseq reads   | 13446 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010670 (8e-06) |
Best Drosophila hit   | CG33097, isoform A (6e-33) |
Best Human hit | transcription elongation regulator 1 isoform 2 (1e-63) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC004501 [Tribolium castaneum] (3e-135) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC004501 [Tribolium castaneum] (2e-89) |
GeneOntology terms    | GO:0016363 nuclear matrix GO:0003700 sequence-specific DNA binding transcription factor activity GO:0006350 transcription GO:0045449 regulation of transcription GO:0005634 nucleus GO:0005515 protein binding |
InterPro families    | IPR002713 FF domain IPR001202 WW/Rsp5/WWP |
Orthology group | MCL12933 |
Nucleotide sequence:
ATGGAAGACGAATACAGTATAGACCCTCAGGACTCTACTATGAGTGAATTAAACCGTAGC
GATGTTAGTGAAATTACGGATTCTTCTAAGGATTTTTGTGAGCCGGGGGGTTATGATGAA
AATTACGAAGATGAGGAAGGCGCGAACGGCTTCAGAGGCGGTAGAGGCGGTCGCGGGACG
TTCAGGGGTCGCGGGCGAGGTCCGCCGCCTGGTTGGATGAATGGGCCTCCGCCCATGCGC
TTCAGGGGCCGCGGCTACGGTCCCGGCGGGCCTCCTATGAGAGGTCGCGGATTCTTCCGT
GGGCGCGGTGGTCCGAGAGGATATGGTCCCAATAATGGTCCTAATTACGAAAATAACTGG
GGACCCATGGGTCCCCCGCCGCCTGGTATGATGGGAGGTCCTCCACCTTATGGACCACCT
CCTGGTATGATGCCGCCAGGTGGACCGATGGGCCCCCCGCCCAACATGATGGGGCAGCCG
CCGCCCTTCGGACCTCCAGGGATGCCACCACCAAATATGCCCGCTCCGGAGCTGTGGGTT
GAGACCAAGTCAGATGAGGGCAAGTCGTATTACTACCACGCCAGGACTAGAGAGACGACC
TGGACCAGACCCCAGGAGAGTCCCACGTGCAAGGTCATCACGCAGGCTGAGGTGGATGTC
ATGACAGCTGCTGGCCAGTATCCCGGCATGAATCAGTCGATGCCCATGAACGGTCCCATG
GGCGGGATGGGCGGTCCCATGGGCGCGCCAATGAACGGCCCCATGGGCATGATGGGCATG
ATGCCGCCAGGAGTCGGCCACGGGCCAACACCGGGGTCCGTCCCGCCCTTCATGAACCAA
CCGCCACCCTGGGTCAAGGATAATAACCAGATGCAGTCGAAGCTGGACAAACAGGACAGC
TCACCTGATGATGAAGCGCCCCCGGGCGAGGCACCCTCACAGGCTACACAACCACCCGGC
ACGGGCCCCCTAGGCCCAGCGCCGGGTGCCGGCGGGCCGTGGGGTTGGGGCTGGGCCCCG
CCGCTGGTGGCGCAGCCCCCTGGCCTGGCGGCCGCCTCAGCAGCCGTACCTGACACTGGC
GCAGCAACTCAGACACAGCCCATCGCGGTCATGGGAAATGACGCACAACCGGACAGCACA
GTGACGCCCAAGAAAGAGGAAACGGTGATACCTCCCGAACTGTCTCTACGTGCTGGGGAG
TGGACCACACACAGGGCTCCGGACGGCAGGCCATACTACTATCACGCTGGCACCAGGCAG
AGTGTGTGGGAGAAACCGCAGCCCCTCAAGGAGTTTGAAGAACTACAAAACAAAATAGCC
AAAGAGAAAGGCGAAAAGCTGGACGTCAAGAAGGACAGCAGGGTTATTGACGACGGCAAA
ATAGAAGTTATAGATGTAGAAGCTCACGCTGAGGCCGCGGCAGCTGCAGAGGCTGCTGAG
AGGGAGAGATTAGAGAGAGAGCGGCTGGAGAGAGAGAGAATAGAGAAGGAGAGGTTGGAG
AAAGAAAGGTTAGAGAAGGAGAAACAGGAGAAGGAGAAGGCTAAGACGGATAAGAGTAGA
CCGGTTTCAAGCACTCCCATATCTGGAACACCTTGGTGCGTTGTATGGACGGGTGACGGC
AGAGTGTTCTTCTACAATCCGACGGCGCGTCTGTCAGTGTGGGAGCGCCCGGCACAGCTG
GCGGGGAGAGCGGACGTGGATCAAGCGGTGTCTCACCCGCCCCACCAGAGGGATCAGCAG
CGGAAGGAACCGCCAGCGACCACGACCGTCACGCCGGCCAAGAACGCTAACGGGGAACTG
AAGAGGGGAGCGTCCGACTCATCCGACTCGGAGACTGAACCGGCCAAGAAGGCGAAATCC
GAAGAAACCAAGAAGAAGTCTGGCGTGTCAGCCGGCGTGATAGATATGGGCAAGGAGGCG
GCCAGGGAGGCGATGGCCAGGGCGGAACGCGAGAGGGCGCTGGTGCCGTTCGAACAGCGC
GTGAGGGCCTTCCTTCAGATGCTGCACGAGAGCGACGTGTCAGCGTTCTCGCCGTGGGAG
AAGGAACTGCATAAGATTGTGTTCGACAGCAGATATCTGCTGCTAGAGTCGAAGGAGAGG
AAACAGGTGTTCGATAAGTACGTGAGGGAGCGAGCTGAGGAGGAGCGTAAAGAGAAGAAG
AACAGGATCCAGCAGAAGAAGCAGGCCTTCAGGGCGCTCATGGACGAAGCCAAGCTGCAC
TCCAAATCTTCCTTCACCGAGTTCTCCGGCAAGTACAGCAGGGATGAGAGGTTCAAAAAT
ATTGAGAAGATGAGGGATAGGGAGACTTACTTCAACGAGTACATCGCTGAGGTCCGGAAG
AAGGAGAAGGATGACAAGGACAGGAAGAGGGAACAGGCCAAAACGGAGTACTTAGCGCTT
TTGAAAGAAAAGAGTGTTGACAGGCACTCTAGATGGTTGGACGTTAAGAAGAAGATAGAC
TCGGACGCTAGGTACAAGGCCGTGGAGAGTAGCTCGCTGAGGGAGGACTACTTCAGGGAG
TACTGCAAGATGGTTAAGGAGGAGAAGAAGAAGGAGAAGGACGGCAAGGAGAAGGAACGT
GAGAGGGGCAATAAGAAGGACAAGAAGGACAAAGAGAGGGAGAAAGAGAAGGACCGCGAG
AAGGAGACGAAGAAGGAGAAGAAGAAAGAGAAGCCTGCTGACAAACAGTTGGACCAGTCC
ACTGAGGATGAAAAGAAGCAGCCCACCCCCCCACCGGACCAGTGGGCCGAGATCCTGGGG
ATACCGGGGGAAGAGAAGGAGAAAGAGAAGGAAAAGGAGAGGGAGAGGGAGAAGAACGCC
AAAGAAGCTAAGACGAAGGACAAGAAGGAGTCTGAGAATAGCGAGATGGAACCACCGTCG
TCCGAAAAGGAGATGCTGTCACCGAAACAGCTGAGATCCTCGAAGAAAGATCCGCAGAAT
CAGGAGAAGAAATCCCCAATAAAGTCACCGCCCAAGAAGAGAAAGCAGGAGTTCAAGTCG
CCGGAGCCGGAGGCGGAGAAGAAGGGCAAGAAGAAGACTGAGAAGACGGAGAAGAGGCGG
CGGAAGAAGTCCGAGAAAGAATAG
Protein sequence:
MEDEYSIDPQDSTMSELNRSDVSEITDSSKDFCEPGGYDENYEDEEGANGFRGGRGGRGT
FRGRGRGPPPGWMNGPPPMRFRGRGYGPGGPPMRGRGFFRGRGGPRGYGPNNGPNYENNW
GPMGPPPPGMMGGPPPYGPPPGMMPPGGPMGPPPNMMGQPPPFGPPGMPPPNMPAPELWV
ETKSDEGKSYYYHARTRETTWTRPQESPTCKVITQAEVDVMTAAGQYPGMNQSMPMNGPM
GGMGGPMGAPMNGPMGMMGMMPPGVGHGPTPGSVPPFMNQPPPWVKDNNQMQSKLDKQDS
SPDDEAPPGEAPSQATQPPGTGPLGPAPGAGGPWGWGWAPPLVAQPPGLAAASAAVPDTG
AATQTQPIAVMGNDAQPDSTVTPKKEETVIPPELSLRAGEWTTHRAPDGRPYYYHAGTRQ
SVWEKPQPLKEFEELQNKIAKEKGEKLDVKKDSRVIDDGKIEVIDVEAHAEAAAAAEAAE
RERLERERLERERIEKERLEKERLEKEKQEKEKAKTDKSRPVSSTPISGTPWCVVWTGDG
RVFFYNPTARLSVWERPAQLAGRADVDQAVSHPPHQRDQQRKEPPATTTVTPAKNANGEL
KRGASDSSDSETEPAKKAKSEETKKKSGVSAGVIDMGKEAAREAMARAERERALVPFEQR
VRAFLQMLHESDVSAFSPWEKELHKIVFDSRYLLLESKERKQVFDKYVRERAEEERKEKK
NRIQQKKQAFRALMDEAKLHSKSSFTEFSGKYSRDERFKNIEKMRDRETYFNEYIAEVRK
KEKDDKDRKREQAKTEYLALLKEKSVDRHSRWLDVKKKIDSDARYKAVESSSLREDYFRE
YCKMVKEEKKKEKDGKEKERERGNKKDKKDKEREKEKDREKETKKEKKKEKPADKQLDQS
TEDEKKQPTPPPDQWAEILGIPGEEKEKEKEKEREREKNAKEAKTKDKKESENSEMEPPS
SEKEMLSPKQLRSSKKDPQNQEKKSPIKSPPKKRKQEFKSPEPEAEKKGKKKTEKTEKRR
RKKSEKE