DPGLEAN03942 in OGS1.0

New model in OGS2.0DPOGS203760 
Genomic Positionscaffold1432:- 44859-55927
See gene structure
CDS Length2634
Paired RNAseq reads  799
Single RNAseq reads  2168
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011490 (0.0)
Best Drosophila hit  spellchecker1, isoform A (5e-145)
Best Human hitDNA mismatch repair protein Msh2 (4e-150)
Best NR hit (blastp)  PREDICTED: similar to mutS homolog 2 [Apis mellifera] (0.0)
Best NR hit (blastx)  PREDICTED: similar to mutS homolog 2 [Apis mellifera] (0.0)
GeneOntology terms





































  
GO:0000400 four-way junction DNA binding
GO:0032301 MutSalpha complex
GO:0006302 double-strand break repair
GO:0007281 germ cell development
GO:0008584 male gonad development
GO:0032137 guanine/thymine mispair binding
GO:0042803 protein homodimerization activity
GO:0003697 single-stranded DNA binding
GO:0005524 ATP binding
GO:0007050 cell cycle arrest
GO:0008022 protein C-terminus binding
GO:0008340 determination of adult lifespan
GO:0030183 B cell differentiation
GO:0032142 single guanine insertion binding
GO:0051096 positive regulation of helicase activity
GO:0006119 oxidative phosphorylation
GO:0016887 ATPase activity
GO:0019724 B cell mediated immunity
GO:0043531 ADP binding
GO:0043570 maintenance of DNA repeat elements
GO:0000287 magnesium ion binding
GO:0005634 nucleus
GO:0006298 mismatch repair
GO:0010224 response to UV-B
GO:0016446 somatic hypermutation of immunoglobulin genes
GO:0019237 centromeric DNA binding
GO:0032405 MutLalpha complex binding
GO:0045910 negative regulation of DNA recombination
GO:0031573 intra-S DNA damage checkpoint
GO:0043524 negative regulation of neuron apoptosis
GO:0045190 isotype switching
GO:0001701 in utero embryonic development
GO:0006301 postreplication repair
GO:0032181 dinucleotide repeat insertion binding
GO:0032357 oxidized purine DNA binding
GO:0032302 MutSbeta complex
GO:0010165 response to X-ray
GO:0032143 single thymine insertion binding
GO:0003684 damaged DNA binding
InterPro families


  
IPR000432 DNA mismatch repair protein MutS, C-terminal domain
IPR007696 DNA mismatch repair protein MutS, core
IPR007860 DNA mismatch repair protein MutS, connector
IPR007861 DNA mismatch repair protein MutS, clamp
Orthology groupMCL13580

Nucleotide sequence:

ATGGGCATCGAGCCTAACAAACTAGACTATTTGGTCCTATCGAAGGGAAACTTTGAGATA
CTCATCAGGAAATTACTATTGGTACGGAGATACAGAGTCGAGATATTTGTGTCGGAGGGA
TCAGTGAAGTCCTGTGATTGGTCGCTCAGGTACAAAGGTTCTCCTGGATACCTGTCCCAA
TTGGAGGAAATTGTCGGGGACGGTTTAGGATCCGCCAATGAGCAATCTACATGCTTGATG
GCCGTCAATGTCAAGAGTGACGCCATCAGTAAGGGCCGCCTAGTGGGCATAGCGTGCGTG
TATCAGAACGATTACACTTTATCAGTGTCGGAGTTCACTGATGATGTTGACTTCACCCAG
CTAGAGTCGATCGTCGTACAAGTGGCGCCCTCTGAGTGCGTTGCGGCGCCGGCTGATAAC
GATTATAAAGCCTTAAAGAAGGTTATGGACAGAGCGAGTGTGACGGTGACGAAGGTCAAG
AAGTCGGAGTTCACGACGGAAGGTCTCATCCAGGATCTGAACAGACTTCTCAAGTTCAAA
GAGGATCAGCAAAAAGATGCCAATGGATTCCAGGAAACCAAACTACCGGTGGCCATGAGC
GCTCTGGCAGCCGCCGTTAGATATACGTCGCTGTTAAACGATGACACGAACTTTGGAAGG
TTCCGCATATCGTCAGTGAAGGCCGACTACCTTCAGCTGGACTCCTCGGCCCTGTCGGCA
CTGAATGTGTTCCCTGAACTCGGTGATACGAACACTTCGCCAACCAGGAGCATCTACGGA
CTACTCGACAGATGTAGAACACAGCATGGAAAACGACTTCTGTGCCAGTTGCTTCGTCAG
CCTCTTAGAGACATCAACCTGATCAACGAGCGCCTGGACATTATCCAGCTGTTGGTTTCC
AGTTCACAGATGAGGTTGCAGTTGCATGAAGATCATCTTAGGCGGATGCCGGACCTGCAA
GCTTTGGCCCGGAGACTGGCTAGGAAGAAAGCTGGCTTACAGGACTGTTACAGAATATAC
CAGGCTATCAACCGCATTCCCGTCCTATTGAAGTGTCTGTCTGAGTTCAACGACCCCACG
ATACATTCGGTGCTCTGTGAACCGATAGCTGAACTTAACAACGACCTGGAAAAGTTCCAG
CAGATGATTGAAACTACCATCGACCTAGAAGCTGTTGACAGAGATAGGGCCTTAAACCTA
CACCTGGGTTGCAAGTCCCAGGCACTGTTGGAGCTCCTCTCCCTGCAACGATGGACCCGG
CGCCCGAACGGTGATTTTCTCGTGAAGCCATCTTTCGATGAAGAGTTACAGGTACTAGCG
AATGATCTGGAAAAATTACAAAACTCAGCTGAGAAAGAATTAAACAAAGCGGCCAGGGAT
CTTGACATGGAAGCGGGGAAAACTATTAAATTAGAAAATAATCCACAGCACGGTTTTGTA
TTTAGGTATTATATCCTCGGGGTCGAAGGGTTTTTAAAAAAAGATTTAAAATACACGATA
GTGGATGCCATTAAAGGTGGGGTCAGATTCAGGAACAGTTGCTTAGGAGACATCACAGAG
AACTACCTCCAGGCGAAGGCTGCGTACGAGAAGGAGCAAGATAAAGTAGTCGCCGAAATC
ATTAATATAGCTTCCACTTATTCGGAGTGTCTGTATTGCCTGTCCAATATAATATCTAAG
TTGGATGTATTGGTGTCACTGTCTGTGGTGGCGAGTACCTCTTCATCCAAGTACACTCGA
CCAGTTCTCACTACCAGTATCCAGGATCTGGTGCTGAAGGATGTACGGCATCCGTGCCTC
GAACTACAGGAAGGCGTCTCGTATATACCCAATGATGTTGTTCTCGAACGAGATTCGAGT
CTGATGCATATAGTGACGGGCGCCAATATGGGTGGTAAATCCACGTGGATGAGGTCGTGT
GGGGTGGCTGTGATCCTCGCTCACGTGGGGTCCTTCGTGCCAGCCGAATACGCCAAAATA
CCCATCCTAAGGTCTCTATGCGCTAGAATCGGTGCCAGCGATAGAGAGGAGAAAGGCCAG
AGTACTTTCATGCTAGAGATGCTAGAGACGGCTGGGATATTGAGGAACGCTACGGCCGAT
TCTCTGGTCCTGATCGACGAACTCGGTCGTGGAACATCTACGTACGAGGGTTGCGGCATC
GCTTGGGCTATCGCTGAAAAACTTTCAAAGGAGATCCAATGCTTCTGTCTGTTCGCGACC
CACTACCACGAGCTGACCCGGCTGGCGTCGTGTGGTTCTCGCGTCGTCAACTCGCAGGCG
CTGGCGGATGTCGTCGACGGCCGGCTCGTGTTGCTGCATCGCGTGGTACAGGGGCCAGCC
GCCAAGTCTCTGGGGCTGCACGTCGCTAAGATCGCTGACTTACCGGAAGATATACTGCAG
TTCGCAGAAGAGAAGCAGGCGGAGTTAGAAACGGATCTTTGCGAGGTCGAATCCGAAGTT
AGATCTGAAGATACATCCGAAGGGCAGGCGTTCATCAAAGAGTTTCTCATAAAATGCAAG
CAAATACAGGAAAAGAACGAGTCGGATGAAAAAATGATGGCTGAAATAAAGAAGCTGAAA
CAAGAAATGTTGCAGACGGATAACAAATATGTGGCCGCGTTGCTCAGCCGCTGA

Protein sequence:

MGIEPNKLDYLVLSKGNFEILIRKLLLVRRYRVEIFVSEGSVKSCDWSLRYKGSPGYLSQ
LEEIVGDGLGSANEQSTCLMAVNVKSDAISKGRLVGIACVYQNDYTLSVSEFTDDVDFTQ
LESIVVQVAPSECVAAPADNDYKALKKVMDRASVTVTKVKKSEFTTEGLIQDLNRLLKFK
EDQQKDANGFQETKLPVAMSALAAAVRYTSLLNDDTNFGRFRISSVKADYLQLDSSALSA
LNVFPELGDTNTSPTRSIYGLLDRCRTQHGKRLLCQLLRQPLRDINLINERLDIIQLLVS
SSQMRLQLHEDHLRRMPDLQALARRLARKKAGLQDCYRIYQAINRIPVLLKCLSEFNDPT
IHSVLCEPIAELNNDLEKFQQMIETTIDLEAVDRDRALNLHLGCKSQALLELLSLQRWTR
RPNGDFLVKPSFDEELQVLANDLEKLQNSAEKELNKAARDLDMEAGKTIKLENNPQHGFV
FRYYILGVEGFLKKDLKYTIVDAIKGGVRFRNSCLGDITENYLQAKAAYEKEQDKVVAEI
INIASTYSECLYCLSNIISKLDVLVSLSVVASTSSSKYTRPVLTTSIQDLVLKDVRHPCL
ELQEGVSYIPNDVVLERDSSLMHIVTGANMGGKSTWMRSCGVAVILAHVGSFVPAEYAKI
PILRSLCARIGASDREEKGQSTFMLEMLETAGILRNATADSLVLIDELGRGTSTYEGCGI
AWAIAEKLSKEIQCFCLFATHYHELTRLASCGSRVVNSQALADVVDGRLVLLHRVVQGPA
AKSLGLHVAKIADLPEDILQFAEEKQAELETDLCEVESEVRSEDTSEGQAFIKEFLIKCK
QIQEKNESDEKMMAEIKKLKQEMLQTDNKYVAALLSR