New model in OGS2.0 | DPOGS203760  |
---|---|
Genomic Position | scaffold1432:- 44859-55927 |
See gene structure | |
CDS Length | 2634 |
Paired RNAseq reads   | 799 |
Single RNAseq reads   | 2168 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011490 (0.0) |
Best Drosophila hit   | spellchecker1, isoform A (5e-145) |
Best Human hit | DNA mismatch repair protein Msh2 (4e-150) |
Best NR hit (blastp)   | PREDICTED: similar to mutS homolog 2 [Apis mellifera] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to mutS homolog 2 [Apis mellifera] (0.0) |
GeneOntology terms    | GO:0000400 four-way junction DNA binding GO:0032301 MutSalpha complex GO:0006302 double-strand break repair GO:0007281 germ cell development GO:0008584 male gonad development GO:0032137 guanine/thymine mispair binding GO:0042803 protein homodimerization activity GO:0003697 single-stranded DNA binding GO:0005524 ATP binding GO:0007050 cell cycle arrest GO:0008022 protein C-terminus binding GO:0008340 determination of adult lifespan GO:0030183 B cell differentiation GO:0032142 single guanine insertion binding GO:0051096 positive regulation of helicase activity GO:0006119 oxidative phosphorylation GO:0016887 ATPase activity GO:0019724 B cell mediated immunity GO:0043531 ADP binding GO:0043570 maintenance of DNA repeat elements GO:0000287 magnesium ion binding GO:0005634 nucleus GO:0006298 mismatch repair GO:0010224 response to UV-B GO:0016446 somatic hypermutation of immunoglobulin genes GO:0019237 centromeric DNA binding GO:0032405 MutLalpha complex binding GO:0045910 negative regulation of DNA recombination GO:0031573 intra-S DNA damage checkpoint GO:0043524 negative regulation of neuron apoptosis GO:0045190 isotype switching GO:0001701 in utero embryonic development GO:0006301 postreplication repair GO:0032181 dinucleotide repeat insertion binding GO:0032357 oxidized purine DNA binding GO:0032302 MutSbeta complex GO:0010165 response to X-ray GO:0032143 single thymine insertion binding GO:0003684 damaged DNA binding |
InterPro families    | IPR000432 DNA mismatch repair protein MutS, C-terminal domain IPR007696 DNA mismatch repair protein MutS, core IPR007860 DNA mismatch repair protein MutS, connector IPR007861 DNA mismatch repair protein MutS, clamp |
Orthology group | MCL13580 |
Nucleotide sequence:
ATGGGCATCGAGCCTAACAAACTAGACTATTTGGTCCTATCGAAGGGAAACTTTGAGATA
CTCATCAGGAAATTACTATTGGTACGGAGATACAGAGTCGAGATATTTGTGTCGGAGGGA
TCAGTGAAGTCCTGTGATTGGTCGCTCAGGTACAAAGGTTCTCCTGGATACCTGTCCCAA
TTGGAGGAAATTGTCGGGGACGGTTTAGGATCCGCCAATGAGCAATCTACATGCTTGATG
GCCGTCAATGTCAAGAGTGACGCCATCAGTAAGGGCCGCCTAGTGGGCATAGCGTGCGTG
TATCAGAACGATTACACTTTATCAGTGTCGGAGTTCACTGATGATGTTGACTTCACCCAG
CTAGAGTCGATCGTCGTACAAGTGGCGCCCTCTGAGTGCGTTGCGGCGCCGGCTGATAAC
GATTATAAAGCCTTAAAGAAGGTTATGGACAGAGCGAGTGTGACGGTGACGAAGGTCAAG
AAGTCGGAGTTCACGACGGAAGGTCTCATCCAGGATCTGAACAGACTTCTCAAGTTCAAA
GAGGATCAGCAAAAAGATGCCAATGGATTCCAGGAAACCAAACTACCGGTGGCCATGAGC
GCTCTGGCAGCCGCCGTTAGATATACGTCGCTGTTAAACGATGACACGAACTTTGGAAGG
TTCCGCATATCGTCAGTGAAGGCCGACTACCTTCAGCTGGACTCCTCGGCCCTGTCGGCA
CTGAATGTGTTCCCTGAACTCGGTGATACGAACACTTCGCCAACCAGGAGCATCTACGGA
CTACTCGACAGATGTAGAACACAGCATGGAAAACGACTTCTGTGCCAGTTGCTTCGTCAG
CCTCTTAGAGACATCAACCTGATCAACGAGCGCCTGGACATTATCCAGCTGTTGGTTTCC
AGTTCACAGATGAGGTTGCAGTTGCATGAAGATCATCTTAGGCGGATGCCGGACCTGCAA
GCTTTGGCCCGGAGACTGGCTAGGAAGAAAGCTGGCTTACAGGACTGTTACAGAATATAC
CAGGCTATCAACCGCATTCCCGTCCTATTGAAGTGTCTGTCTGAGTTCAACGACCCCACG
ATACATTCGGTGCTCTGTGAACCGATAGCTGAACTTAACAACGACCTGGAAAAGTTCCAG
CAGATGATTGAAACTACCATCGACCTAGAAGCTGTTGACAGAGATAGGGCCTTAAACCTA
CACCTGGGTTGCAAGTCCCAGGCACTGTTGGAGCTCCTCTCCCTGCAACGATGGACCCGG
CGCCCGAACGGTGATTTTCTCGTGAAGCCATCTTTCGATGAAGAGTTACAGGTACTAGCG
AATGATCTGGAAAAATTACAAAACTCAGCTGAGAAAGAATTAAACAAAGCGGCCAGGGAT
CTTGACATGGAAGCGGGGAAAACTATTAAATTAGAAAATAATCCACAGCACGGTTTTGTA
TTTAGGTATTATATCCTCGGGGTCGAAGGGTTTTTAAAAAAAGATTTAAAATACACGATA
GTGGATGCCATTAAAGGTGGGGTCAGATTCAGGAACAGTTGCTTAGGAGACATCACAGAG
AACTACCTCCAGGCGAAGGCTGCGTACGAGAAGGAGCAAGATAAAGTAGTCGCCGAAATC
ATTAATATAGCTTCCACTTATTCGGAGTGTCTGTATTGCCTGTCCAATATAATATCTAAG
TTGGATGTATTGGTGTCACTGTCTGTGGTGGCGAGTACCTCTTCATCCAAGTACACTCGA
CCAGTTCTCACTACCAGTATCCAGGATCTGGTGCTGAAGGATGTACGGCATCCGTGCCTC
GAACTACAGGAAGGCGTCTCGTATATACCCAATGATGTTGTTCTCGAACGAGATTCGAGT
CTGATGCATATAGTGACGGGCGCCAATATGGGTGGTAAATCCACGTGGATGAGGTCGTGT
GGGGTGGCTGTGATCCTCGCTCACGTGGGGTCCTTCGTGCCAGCCGAATACGCCAAAATA
CCCATCCTAAGGTCTCTATGCGCTAGAATCGGTGCCAGCGATAGAGAGGAGAAAGGCCAG
AGTACTTTCATGCTAGAGATGCTAGAGACGGCTGGGATATTGAGGAACGCTACGGCCGAT
TCTCTGGTCCTGATCGACGAACTCGGTCGTGGAACATCTACGTACGAGGGTTGCGGCATC
GCTTGGGCTATCGCTGAAAAACTTTCAAAGGAGATCCAATGCTTCTGTCTGTTCGCGACC
CACTACCACGAGCTGACCCGGCTGGCGTCGTGTGGTTCTCGCGTCGTCAACTCGCAGGCG
CTGGCGGATGTCGTCGACGGCCGGCTCGTGTTGCTGCATCGCGTGGTACAGGGGCCAGCC
GCCAAGTCTCTGGGGCTGCACGTCGCTAAGATCGCTGACTTACCGGAAGATATACTGCAG
TTCGCAGAAGAGAAGCAGGCGGAGTTAGAAACGGATCTTTGCGAGGTCGAATCCGAAGTT
AGATCTGAAGATACATCCGAAGGGCAGGCGTTCATCAAAGAGTTTCTCATAAAATGCAAG
CAAATACAGGAAAAGAACGAGTCGGATGAAAAAATGATGGCTGAAATAAAGAAGCTGAAA
CAAGAAATGTTGCAGACGGATAACAAATATGTGGCCGCGTTGCTCAGCCGCTGA
Protein sequence:
MGIEPNKLDYLVLSKGNFEILIRKLLLVRRYRVEIFVSEGSVKSCDWSLRYKGSPGYLSQ
LEEIVGDGLGSANEQSTCLMAVNVKSDAISKGRLVGIACVYQNDYTLSVSEFTDDVDFTQ
LESIVVQVAPSECVAAPADNDYKALKKVMDRASVTVTKVKKSEFTTEGLIQDLNRLLKFK
EDQQKDANGFQETKLPVAMSALAAAVRYTSLLNDDTNFGRFRISSVKADYLQLDSSALSA
LNVFPELGDTNTSPTRSIYGLLDRCRTQHGKRLLCQLLRQPLRDINLINERLDIIQLLVS
SSQMRLQLHEDHLRRMPDLQALARRLARKKAGLQDCYRIYQAINRIPVLLKCLSEFNDPT
IHSVLCEPIAELNNDLEKFQQMIETTIDLEAVDRDRALNLHLGCKSQALLELLSLQRWTR
RPNGDFLVKPSFDEELQVLANDLEKLQNSAEKELNKAARDLDMEAGKTIKLENNPQHGFV
FRYYILGVEGFLKKDLKYTIVDAIKGGVRFRNSCLGDITENYLQAKAAYEKEQDKVVAEI
INIASTYSECLYCLSNIISKLDVLVSLSVVASTSSSKYTRPVLTTSIQDLVLKDVRHPCL
ELQEGVSYIPNDVVLERDSSLMHIVTGANMGGKSTWMRSCGVAVILAHVGSFVPAEYAKI
PILRSLCARIGASDREEKGQSTFMLEMLETAGILRNATADSLVLIDELGRGTSTYEGCGI
AWAIAEKLSKEIQCFCLFATHYHELTRLASCGSRVVNSQALADVVDGRLVLLHRVVQGPA
AKSLGLHVAKIADLPEDILQFAEEKQAELETDLCEVESEVRSEDTSEGQAFIKEFLIKCK
QIQEKNESDEKMMAEIKKLKQEMLQTDNKYVAALLSR