New model in OGS2.0 | DPOGS205304  |
---|---|
Genomic Position | scaffold606:+ 65378-68953 |
See gene structure | |
CDS Length | 2919 |
Paired RNAseq reads   | 1144 |
Single RNAseq reads   | 2781 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011044 (0.0) |
Best Drosophila hit   | Mms19 (4e-57) |
Best Human hit | MMS19 nucleotide excision repair protein homolog (1e-68) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC011355 [Tribolium castaneum] (7e-111) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC011355 [Tribolium castaneum] (1e-102) |
GeneOntology terms    | GO:0005515 protein binding GO:0006289 nucleotide-excision repair GO:0003713 transcription coactivator activity GO:0045893 positive regulation of transcription, DNA-dependent GO:0030331 estrogen receptor binding GO:0045449 regulation of transcription GO:0000160 two-component signal transduction system (phosphorelay) GO:0030159 receptor signaling complex scaffold activity GO:0005634 nucleus GO:0006350 transcription GO:0005675 holo TFIIH complex GO:0030674 protein binding, bridging GO:0009725 response to hormone stimulus |
InterPro families    | IPR011989 Armadillo-like helical IPR016024 Armadillo-type fold |
Orthology group | MCL15415 |
Nucleotide sequence:
ATGGAGGCTATTTGGTATAAATCGGAGCTAGCCGATGAAATAATTAAAAATAATGATATA
TTTAATCAAACCCAAAGCATAATAGCAGATGTCGTTTCTGGAAAATTTGATATCACTGTC
CTGGTAGAAAATATGGCTGGTGTGCTAACCAGCAAAGAAGTAGAAGACAGAGAACGAGGC
ATGAGGTTTTTCACAAAAATCTTAAAGGAAATTCCTGCTGATTATTTGACAGAGATGCAA
ATTAAATTCATATCAAAATTCTATGTAGATCGTCTTAAAGATAACCACAGAGTGATACCG
GCAGTTTTAGAAGGCTATTTGTCTGTTATTGATATGAAACATTATAACGTGCAAAGTAGT
GGTGAATATCTCACTGCTTTATTCCGGGAGGTGGCATGCCAGTCACAAGTAAGGCAAGAT
AGGTACAACATATACCTAACTATTCAAAAATTGTTTGCCAAGAATGTAGAATACATGAAA
ACATTAGGCCCGGATTTCGTATATGGATTCATATCGTCTATGGATGGAGAGAGGGATCCA
CGAAACTTGCTTTTCTTGTTTAATTTCTTACCGAACTTCCTAAGTAACATTCCATTGGGA
CATTTAGTTGAAGAAATGTTTGAAGTTATATCATGTTATTATCCCATCGACTTTCATCCG
TCTCCTGACGACCCAGCAGCTGTGTCGAGAAACGATCTAGCTGCTGTATTATGTCCTTGT
TTATGCGCGATTCCAGAGTTTGCTGAACATTGTCTTGTATTATTAATAGAGAAACTGGAT
TCTAACCTTCGAGTAGCTAAAATTGATTCTCTTATGCTATTGGCAGCTAGTTGCAAGACC
TTTAAATATGAAAGTTATGGACCATTTTTGAAAGCATTGTGGTCTTCACTTCAAAGAGAA
CTTACACATAAAACTGACGACGAGTTAAAGTTAGCTGCTCATGAAGCTCTATCAGCTTTA
GTATCAAAATTGTCCACAAAAGCAGAAACCGATCAATCATTCGAGAACTTCATTAAAGGA
ATACTTATATCCATGCAAAGTGCTATAGCCGAGTCTTCAACTGTTATCCAGTTTATGGAA
GCAGTGAAAATATTATTAACGGCCGCCAATGCTTCAAAACAATCATGTGTTCTGATAATA
AAAGCGATGATACCAGCCATTTTAGCATACTATGAATTCAAACCTTTGGTGAAATTACAA
ATTTCTTGTTTAGATGTTTTAGGTGATATGTATGAATTGGCTGATCATTGGGGGGTTTTG
GATGAAATGGAAAAAGAGGTAAATGAAATACCACAGTTATGCCTTACAGCTGTCAGTGAG
AGGGTGAAAGACTATCAAGTATCTGGCTTTAAAACTCTAATAAGAGTTAAAAATGTTCTT
CACATAGATTTGGTGCTACCATTTGTGGAAATTCTGATCTATAACATTCAACATTCTCAA
GATAGCGAAATATTGAGTGTTTGTGTTGAAACAGTACATGCTATAGCAAGGAAATATCCA
GAGCTTATAATGACTTTGGTCATAAAAGGGAAGTGTGACTTGGAAAACCTAACACAGGAC
AAAACAGCGCTACAAAAAAGACTAGATCTGCTCTCAAATCTTGCCAGTATAGATGATTTT
ACCAAAATTATTATAGAGGAAATGTTGAAAGTAATAACAACAAATGATGAGGAAGCTTCT
AAAGTTGTCAAAGCACTCAGTGGATCTATATCCAATGTAAGTTTGTATACAGAAGAAAAA
GTCGCACAGATAGAAAGTGATCATGGCCTTATAAGTTCCATTATGGCCTGGTTAACGAAA
TCAATTTTGGATGAATCTCATGAATCGTTGAATCACGGCTGTACATTGATATCGAACACA
ATATGCAGTTTGCCGCCTGAAAAGCAAGCTAATATTTTATCCAAACATTCAAAGGCTATT
TTAGAGAAGTGTGATTCAAATGAGATGTACTTCTTAATATTAGAATGTTTGTATCGTTCT
ATAAGTCCAACCATCTATGACACAAATTTCAAGGACATTATGGGTTTGGCCTTAAAACTA
GCTTTAAACTGTGAGAACCAGTTACTTAGAACGAAAGCTTGTTGTATGGTAGCCCATTTT
CTTAATAAGGCTCAGAGTGGTCCAAATTTTGAGATCTTAAATGAGGTTTTGAAATCTTAT
TTAACATCATGTAGCAGAGATAATGTTAATATATTACCAAGACTAATAGAATTGTATGGC
TGGATAACAAAGGCGCTTATTATGAGAGGCAATGACCTCTTCCAATTTTGGCTTTCCAAG
ATATTGATTTCTATTTCAACAAGCGAATGCAGTGTTGAGGCATCAGAAGCTATCAAAATA
ATAATGACGGATTCTGAGAATTGTCTTAACGCTAGACATCATTGTAGGACAAGTTTGTTA
TACAGGCAGAGGATGTTCCAGACATTCGTCAATTTGACTGAAAAACTTGGACCACCAAAT
TCTGATTCCGAAGAGGCCTTTTACTTAAGTTGGGGTTATGTTTTAGAGAAAACGCCGAAA
AGCATACTTAATAGTCAAATAAATAAGGTTACACCTTTGGTTATAGATGCTTTAGTGTAT
GACAATAAAGAATTGTTGAAGGTGATGTTAGAAGTCCTAATACATTTTGTGCAATCAAAA
AACATAACAGTGGGACACAGTTTACAAACAATTTTACCCAGGCTAATAAATTTAACTACA
TATGTTAAATGTATGGATGTCAGAATAAAAAGTCTGCAATGTCTGTACGAAATCGCAAAT
TCTTACCAGACAAGATTGCTTTTACCTCATAAGCAGGATATTTTAATCGATTTAGCGCCA
TCTCTTGACGATAAGAAGCGACTTGTGAGGAATATGGCGGTTAAGGCCCGAACAAGATGG
TACTTAGTTGGAGCTCCAGGCGAAAGTAAAGAAGATTAA
Protein sequence:
MEAIWYKSELADEIIKNNDIFNQTQSIIADVVSGKFDITVLVENMAGVLTSKEVEDRERG
MRFFTKILKEIPADYLTEMQIKFISKFYVDRLKDNHRVIPAVLEGYLSVIDMKHYNVQSS
GEYLTALFREVACQSQVRQDRYNIYLTIQKLFAKNVEYMKTLGPDFVYGFISSMDGERDP
RNLLFLFNFLPNFLSNIPLGHLVEEMFEVISCYYPIDFHPSPDDPAAVSRNDLAAVLCPC
LCAIPEFAEHCLVLLIEKLDSNLRVAKIDSLMLLAASCKTFKYESYGPFLKALWSSLQRE
LTHKTDDELKLAAHEALSALVSKLSTKAETDQSFENFIKGILISMQSAIAESSTVIQFME
AVKILLTAANASKQSCVLIIKAMIPAILAYYEFKPLVKLQISCLDVLGDMYELADHWGVL
DEMEKEVNEIPQLCLTAVSERVKDYQVSGFKTLIRVKNVLHIDLVLPFVEILIYNIQHSQ
DSEILSVCVETVHAIARKYPELIMTLVIKGKCDLENLTQDKTALQKRLDLLSNLASIDDF
TKIIIEEMLKVITTNDEEASKVVKALSGSISNVSLYTEEKVAQIESDHGLISSIMAWLTK
SILDESHESLNHGCTLISNTICSLPPEKQANILSKHSKAILEKCDSNEMYFLILECLYRS
ISPTIYDTNFKDIMGLALKLALNCENQLLRTKACCMVAHFLNKAQSGPNFEILNEVLKSY
LTSCSRDNVNILPRLIELYGWITKALIMRGNDLFQFWLSKILISISTSECSVEASEAIKI
IMTDSENCLNARHHCRTSLLYRQRMFQTFVNLTEKLGPPNSDSEEAFYLSWGYVLEKTPK
SILNSQINKVTPLVIDALVYDNKELLKVMLEVLIHFVQSKNITVGHSLQTILPRLINLTT
YVKCMDVRIKSLQCLYEIANSYQTRLLLPHKQDILIDLAPSLDDKKRLVRNMAVKARTRW
YLVGAPGESKED