DPGLEAN16939 in OGS1.0

New model in OGS2.0DPOGS208174 
Genomic Positionscaffold472:- 52232-57995
See gene structure
CDS Length1869
Paired RNAseq reads  411
Single RNAseq reads  1113
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010261 (1e-40)
Best Drosophila hit  Msh6 (2e-137)
Best Human hitDNA mismatch repair protein Msh6 (5e-120)
Best NR hit (blastp)  PREDICTED: similar to DNA mismatch repair protein muts [Nasonia vitripennis] (7e-156)
Best NR hit (blastx)  PREDICTED: similar to DNA mismatch repair protein muts [Nasonia vitripennis] (2e-149)
GeneOntology terms



  
GO:0006281 DNA repair
GO:0006301 postreplication repair
GO:0005524 ATP binding
GO:0030983 mismatched DNA binding
GO:0000710 meiotic mismatch repair
InterPro families



  
IPR000432 DNA mismatch repair protein MutS, C-terminal domain
IPR007696 DNA mismatch repair protein MutS, core
IPR007861 DNA mismatch repair protein MutS, clamp
IPR007860 DNA mismatch repair protein MutS, connector
IPR015536 DNA mismatch repair protein MutS-homologue MSH6
Orthology groupMCL14375

Nucleotide sequence:

ATGGGTTTCATAACAAAATATAATGAATACAATGGCGTTAGCAAGTACGGCGTTTGTTTC
GTAGATACGACGACAGGACAGTTCTACATCGGTCAGTTTGAGGATGACAAACATTCATCT
CGTCTCCTCACCACCGTTGCACATTATCCGCCAGCTTTAATTGTATTCGATCGTAAAACA
ACAAGTGCTCGTACAAGTAGACTGCTGTCAACGCATTGTCACAGCGCGAGACGTGAACCC
ACTACACTGTGGGCTCCCGAAAAGACTTTGAAGATTCTAGCTGAGAAATATTATAAAACT
GACGGCGACGGAAAATGGCCTACCGGGATTACGCCTTTCCTACACGAGGAGCAAAAATGT
CATCCGGACTCCAGAGCTATATTTTATGAAGAAAAAACTTATTCGAAGAGAAAAGTATTG
GATTTCATACTATTGTTGAACGGGTTCACGTCTATATTGAAGCTGGTTGACTTATTCTCC
GATGTGGATGCAGAGTTACTGAAGAAATTAACCCAATTTGCTCCGGAAGGCAGATTTCCT
GATTATAGAGATACTTTGAAATTTTTCAAGGAGGGTTTCAACCAACAAGAGGCGGAGAAA
GAAGGTCGTATACTACCTGGTAGCGGTGTTGACGCAGACTACGACAACACTATACAACTC
ATACAGAACATACAGGATGAATTGAAGGAATACTTGAGTGAGCAGGAGAGATACTTCAAA
TGTCGGTTAACGTATGTTGGAAGTGATAAGAAACGTTATCAAATAGAAGTTCCACAGAGC
GCAGCGGGGAAGGCAAATTCTGATTATCATCTAGAAGGTGCTAGGAAAGGATTCAAGAGA
TATTCAACAGTTGAAACAAAGGATCTGCTGGCGCGAATGATAGCCGCCGAGGAAAAGAAA
AGTAACGTACTGAAAGATCTTAGCAGACGGATGTTCGAGAAGTTCTCATCGCATCAGCAC
CAGTGGGAAATGGCCACCAAATGTGTCGCCACTATCGATATATTGTTAGCATTCACAGAG
TTCGCTAGGCAACAGACTGGGGATATCTGTCTACCGGAAATCACGTACAATAAGGACCAA
GAGCCCTACATAGACATAGTGGAGGGTCGCCACCCGTGTATTTCTATACCAGAGTTCATT
CCTAATGATACGAGGCTGGGTGTTGACAACCCTCGCCTGCTGCTGCTGACTGGTCCCAAC
ATGGGCGGCAAGTCTACACTCATGAGACAAGTCGGACTCCTCACCGTGTTAGCGCATCTG
GGCTGCCACGTACCAGCTTCAGAATGTCGTCTGAGTGTGTGTGACCGTATCTTCACCAGA
CTGGGGGCCTCGGATGATATTCTGTCCGGTCAGTCGACGTTTTTGGTTGAAATGAATGAG
ACAGCGGCCATAGTGAAGCACGCGACCAAACACTCGCTGGTACTACTGGATGAATTAGGT
CGCGGTACATCTACATACGATGGTACGTGCATCGCGTGGTCAGTATGCTGGTGGCTGGCT
GGCCGGTCGTGTCGCACGCTGTTCTCAACTCACTATCACTCGCTAGTCCATCACCTGGCT
GATCATCCCGCCGTACTTTTAGGACATATGGCGTGCATGGTAGAGACCGACGAATCTGCC
CCGGATGGTGACCATATACCGGAGGAAACGATAACCTTTTTGTACAAACTCTCCCCCGGT
GCCTGTCCGAAGTCATACGGCTTCAACGCGGCGCGGCTAGCGGGGATCCCCCGGGAAATA
ACGCAACGCGCACACACGATATCACGCAACCTGGAGAGCGAGGCGACGTGTGTACGCGCC
TTTAGAGATGTCATCAAAACGGACAACGCGGCTGAGTTGAGGAAAATATTGTCAGCCCTG
ACCATATAA

Protein sequence:

MGFITKYNEYNGVSKYGVCFVDTTTGQFYIGQFEDDKHSSRLLTTVAHYPPALIVFDRKT
TSARTSRLLSTHCHSARREPTTLWAPEKTLKILAEKYYKTDGDGKWPTGITPFLHEEQKC
HPDSRAIFYEEKTYSKRKVLDFILLLNGFTSILKLVDLFSDVDAELLKKLTQFAPEGRFP
DYRDTLKFFKEGFNQQEAEKEGRILPGSGVDADYDNTIQLIQNIQDELKEYLSEQERYFK
CRLTYVGSDKKRYQIEVPQSAAGKANSDYHLEGARKGFKRYSTVETKDLLARMIAAEEKK
SNVLKDLSRRMFEKFSSHQHQWEMATKCVATIDILLAFTEFARQQTGDICLPEITYNKDQ
EPYIDIVEGRHPCISIPEFIPNDTRLGVDNPRLLLLTGPNMGGKSTLMRQVGLLTVLAHL
GCHVPASECRLSVCDRIFTRLGASDDILSGQSTFLVEMNETAAIVKHATKHSLVLLDELG
RGTSTYDGTCIAWSVCWWLAGRSCRTLFSTHYHSLVHHLADHPAVLLGHMACMVETDESA
PDGDHIPEETITFLYKLSPGACPKSYGFNAARLAGIPREITQRAHTISRNLESEATCVRA
FRDVIKTDNAAELRKILSALTI