DPGLEAN13348 in OGS1.0

New model in OGS2.0DPOGS205304 
Genomic Positionscaffold606:+ 65378-68953
See gene structure
CDS Length2919
Paired RNAseq reads  1144
Single RNAseq reads  2781
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011044 (0.0)
Best Drosophila hit  Mms19 (4e-57)
Best Human hitMMS19 nucleotide excision repair protein homolog (1e-68)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC011355 [Tribolium castaneum] (7e-111)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC011355 [Tribolium castaneum] (1e-102)
GeneOntology terms











  
GO:0005515 protein binding
GO:0006289 nucleotide-excision repair
GO:0003713 transcription coactivator activity
GO:0045893 positive regulation of transcription, DNA-dependent
GO:0030331 estrogen receptor binding
GO:0045449 regulation of transcription
GO:0000160 two-component signal transduction system (phosphorelay)
GO:0030159 receptor signaling complex scaffold activity
GO:0005634 nucleus
GO:0006350 transcription
GO:0005675 holo TFIIH complex
GO:0030674 protein binding, bridging
GO:0009725 response to hormone stimulus
InterPro families
  
IPR011989 Armadillo-like helical
IPR016024 Armadillo-type fold
Orthology groupMCL15415

Nucleotide sequence:

ATGGAGGCTATTTGGTATAAATCGGAGCTAGCCGATGAAATAATTAAAAATAATGATATA
TTTAATCAAACCCAAAGCATAATAGCAGATGTCGTTTCTGGAAAATTTGATATCACTGTC
CTGGTAGAAAATATGGCTGGTGTGCTAACCAGCAAAGAAGTAGAAGACAGAGAACGAGGC
ATGAGGTTTTTCACAAAAATCTTAAAGGAAATTCCTGCTGATTATTTGACAGAGATGCAA
ATTAAATTCATATCAAAATTCTATGTAGATCGTCTTAAAGATAACCACAGAGTGATACCG
GCAGTTTTAGAAGGCTATTTGTCTGTTATTGATATGAAACATTATAACGTGCAAAGTAGT
GGTGAATATCTCACTGCTTTATTCCGGGAGGTGGCATGCCAGTCACAAGTAAGGCAAGAT
AGGTACAACATATACCTAACTATTCAAAAATTGTTTGCCAAGAATGTAGAATACATGAAA
ACATTAGGCCCGGATTTCGTATATGGATTCATATCGTCTATGGATGGAGAGAGGGATCCA
CGAAACTTGCTTTTCTTGTTTAATTTCTTACCGAACTTCCTAAGTAACATTCCATTGGGA
CATTTAGTTGAAGAAATGTTTGAAGTTATATCATGTTATTATCCCATCGACTTTCATCCG
TCTCCTGACGACCCAGCAGCTGTGTCGAGAAACGATCTAGCTGCTGTATTATGTCCTTGT
TTATGCGCGATTCCAGAGTTTGCTGAACATTGTCTTGTATTATTAATAGAGAAACTGGAT
TCTAACCTTCGAGTAGCTAAAATTGATTCTCTTATGCTATTGGCAGCTAGTTGCAAGACC
TTTAAATATGAAAGTTATGGACCATTTTTGAAAGCATTGTGGTCTTCACTTCAAAGAGAA
CTTACACATAAAACTGACGACGAGTTAAAGTTAGCTGCTCATGAAGCTCTATCAGCTTTA
GTATCAAAATTGTCCACAAAAGCAGAAACCGATCAATCATTCGAGAACTTCATTAAAGGA
ATACTTATATCCATGCAAAGTGCTATAGCCGAGTCTTCAACTGTTATCCAGTTTATGGAA
GCAGTGAAAATATTATTAACGGCCGCCAATGCTTCAAAACAATCATGTGTTCTGATAATA
AAAGCGATGATACCAGCCATTTTAGCATACTATGAATTCAAACCTTTGGTGAAATTACAA
ATTTCTTGTTTAGATGTTTTAGGTGATATGTATGAATTGGCTGATCATTGGGGGGTTTTG
GATGAAATGGAAAAAGAGGTAAATGAAATACCACAGTTATGCCTTACAGCTGTCAGTGAG
AGGGTGAAAGACTATCAAGTATCTGGCTTTAAAACTCTAATAAGAGTTAAAAATGTTCTT
CACATAGATTTGGTGCTACCATTTGTGGAAATTCTGATCTATAACATTCAACATTCTCAA
GATAGCGAAATATTGAGTGTTTGTGTTGAAACAGTACATGCTATAGCAAGGAAATATCCA
GAGCTTATAATGACTTTGGTCATAAAAGGGAAGTGTGACTTGGAAAACCTAACACAGGAC
AAAACAGCGCTACAAAAAAGACTAGATCTGCTCTCAAATCTTGCCAGTATAGATGATTTT
ACCAAAATTATTATAGAGGAAATGTTGAAAGTAATAACAACAAATGATGAGGAAGCTTCT
AAAGTTGTCAAAGCACTCAGTGGATCTATATCCAATGTAAGTTTGTATACAGAAGAAAAA
GTCGCACAGATAGAAAGTGATCATGGCCTTATAAGTTCCATTATGGCCTGGTTAACGAAA
TCAATTTTGGATGAATCTCATGAATCGTTGAATCACGGCTGTACATTGATATCGAACACA
ATATGCAGTTTGCCGCCTGAAAAGCAAGCTAATATTTTATCCAAACATTCAAAGGCTATT
TTAGAGAAGTGTGATTCAAATGAGATGTACTTCTTAATATTAGAATGTTTGTATCGTTCT
ATAAGTCCAACCATCTATGACACAAATTTCAAGGACATTATGGGTTTGGCCTTAAAACTA
GCTTTAAACTGTGAGAACCAGTTACTTAGAACGAAAGCTTGTTGTATGGTAGCCCATTTT
CTTAATAAGGCTCAGAGTGGTCCAAATTTTGAGATCTTAAATGAGGTTTTGAAATCTTAT
TTAACATCATGTAGCAGAGATAATGTTAATATATTACCAAGACTAATAGAATTGTATGGC
TGGATAACAAAGGCGCTTATTATGAGAGGCAATGACCTCTTCCAATTTTGGCTTTCCAAG
ATATTGATTTCTATTTCAACAAGCGAATGCAGTGTTGAGGCATCAGAAGCTATCAAAATA
ATAATGACGGATTCTGAGAATTGTCTTAACGCTAGACATCATTGTAGGACAAGTTTGTTA
TACAGGCAGAGGATGTTCCAGACATTCGTCAATTTGACTGAAAAACTTGGACCACCAAAT
TCTGATTCCGAAGAGGCCTTTTACTTAAGTTGGGGTTATGTTTTAGAGAAAACGCCGAAA
AGCATACTTAATAGTCAAATAAATAAGGTTACACCTTTGGTTATAGATGCTTTAGTGTAT
GACAATAAAGAATTGTTGAAGGTGATGTTAGAAGTCCTAATACATTTTGTGCAATCAAAA
AACATAACAGTGGGACACAGTTTACAAACAATTTTACCCAGGCTAATAAATTTAACTACA
TATGTTAAATGTATGGATGTCAGAATAAAAAGTCTGCAATGTCTGTACGAAATCGCAAAT
TCTTACCAGACAAGATTGCTTTTACCTCATAAGCAGGATATTTTAATCGATTTAGCGCCA
TCTCTTGACGATAAGAAGCGACTTGTGAGGAATATGGCGGTTAAGGCCCGAACAAGATGG
TACTTAGTTGGAGCTCCAGGCGAAAGTAAAGAAGATTAA

Protein sequence:

MEAIWYKSELADEIIKNNDIFNQTQSIIADVVSGKFDITVLVENMAGVLTSKEVEDRERG
MRFFTKILKEIPADYLTEMQIKFISKFYVDRLKDNHRVIPAVLEGYLSVIDMKHYNVQSS
GEYLTALFREVACQSQVRQDRYNIYLTIQKLFAKNVEYMKTLGPDFVYGFISSMDGERDP
RNLLFLFNFLPNFLSNIPLGHLVEEMFEVISCYYPIDFHPSPDDPAAVSRNDLAAVLCPC
LCAIPEFAEHCLVLLIEKLDSNLRVAKIDSLMLLAASCKTFKYESYGPFLKALWSSLQRE
LTHKTDDELKLAAHEALSALVSKLSTKAETDQSFENFIKGILISMQSAIAESSTVIQFME
AVKILLTAANASKQSCVLIIKAMIPAILAYYEFKPLVKLQISCLDVLGDMYELADHWGVL
DEMEKEVNEIPQLCLTAVSERVKDYQVSGFKTLIRVKNVLHIDLVLPFVEILIYNIQHSQ
DSEILSVCVETVHAIARKYPELIMTLVIKGKCDLENLTQDKTALQKRLDLLSNLASIDDF
TKIIIEEMLKVITTNDEEASKVVKALSGSISNVSLYTEEKVAQIESDHGLISSIMAWLTK
SILDESHESLNHGCTLISNTICSLPPEKQANILSKHSKAILEKCDSNEMYFLILECLYRS
ISPTIYDTNFKDIMGLALKLALNCENQLLRTKACCMVAHFLNKAQSGPNFEILNEVLKSY
LTSCSRDNVNILPRLIELYGWITKALIMRGNDLFQFWLSKILISISTSECSVEASEAIKI
IMTDSENCLNARHHCRTSLLYRQRMFQTFVNLTEKLGPPNSDSEEAFYLSWGYVLEKTPK
SILNSQINKVTPLVIDALVYDNKELLKVMLEVLIHFVQSKNITVGHSLQTILPRLINLTT
YVKCMDVRIKSLQCLYEIANSYQTRLLLPHKQDILIDLAPSLDDKKRLVRNMAVKARTRW
YLVGAPGESKED