DPGLEAN19641 in OGS1.0

New model in OGS2.0DPOGS207296 
Genomic Positionscaffold76:- 81936-85517
See gene structure
CDS Length2463
Paired RNAseq reads  177
Single RNAseq reads  534
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012027 (2e-117)
Best Drosophila hit  Pms2 (4e-98)
Best Human hitmismatch repair endonuclease PMS2 isoform a (4e-75)
Best NR hit (blastp)  PREDICTED: similar to DNA mismatch repair protein pms2 [Acyrthosiphon pisum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to DNA mismatch repair protein pms2 [Tribolium castaneum] (0.0)
GeneOntology terms









  
GO:0003677 DNA binding
GO:0003697 single-stranded DNA binding
GO:0005515 protein binding
GO:0005524 ATP binding
GO:0006298 mismatch repair
GO:0016446 somatic hypermutation of immunoglobulin genes
GO:0016447 somatic recombination of immunoglobulin gene segments
GO:0030983 mismatched DNA binding
GO:0032138 single base insertion or deletion binding
GO:0032389 MutLalpha complex
GO:0032407 MutSalpha complex binding
InterPro families






  
IPR014790 MutL, C-terminal, dimerisation
IPR013507 DNA mismatch repair protein, C-terminal
IPR003594 ATPase-like, ATP-binding domain
IPR014763 DNA mismatch repair protein, N-terminal
IPR014721 Ribosomal protein S5 domain 2-type fold, subgroup
IPR002099 DNA mismatch repair protein
IPR014762 DNA mismatch repair, conserved site
IPR020568 Ribosomal protein S5 domain 2-type fold
Orthology groupMCL13055

Nucleotide sequence:

ATGGAAGAAACAAATATACAAAACAAGCAAATCAATACTATTAAACCAATTAACAAGGAT
GCTGTTCACAAAATATGTTCTGATCAAGTGGTTCTTAGTTTAGCAGTAGCAGTGAAGGAA
CTAGTTGAAAACTCTTTAGATGCTGGGGCTACTAATATTGAAGTCAGACTTAAAAACTAT
GGCACAGAATTAATAGAAGTTTCAGATAATGGATCTGGTGTAACTGAGGATAATTTTGAA
GCCTTGACCTTAAAATATCATACATCAAAATTAAACGATTACTCGGATTTGCTTGGAGTA
TCTAGCTTTGGCTTCAGGGGAGAGGCTTTAAGTTCACTTTGCTCTTTAGCCAACCTTACA
GTGACAACCAGACATGAAACAAGTAAACATGCTACCAAAATTGAATATGATCAGAAAGGC
CATATATCAAGCAAAACACCTTGCTCCCGTCAAGTGGGAACAACAGTGACTTTAACTAAC
CTTTTCTATACATTGCCAGTAAGACAAAAAGAGTTTCATAAAAATGCAAAACGGGAGTTC
AATAAAATGACCAGTCTTTTATATGCATATTGTTTAATTTCTAAAGGGGTAAAAATAACA
TGTAGTAATCAAACAAACTCAAATTCTAAGTCACTAGTTGTTGCAACCCAAGGCTCTAAT
TCCTATAAGGATAATATTGCAAGTGTGTTTGGAGTTAAGCAATTACAAAGTATTCTAGAT
GTTAAAACTGAGCTTGTTTCTAATATCAAAGATAATATATTCAGAGGACTATCGGGAGAA
GCAAAGGTAAATGAAGAAAGTATTAATATAGAAGACATTGAAATTGATTTATCTGAAGAT
TCCAATGATGCCCAAACTGATGAAAATAATTCAAGTCAAATACCATTACCTCAAAGATCA
CAGGGTTACAAAAATATACCAAATCCTGTTGAGCTCACAGGATATATCTCTTCATGTGCT
CATGGTAGTGGAAGATCAAGTACAGACAGACAATTCTTCTATATCAACTCCAGGCCTTGT
GAGCCGGTGAAAATTATTAAACTGATCAATGAAATATATCGACAATACAACCCACATCAG
TATCCATTTGTATTTTTAAATGTTAATATTGAAAGAACATCAGTTGATGTAAATGTGACT
CCTGATAAGAGGAAAGTATTTTTAACCAAAGAGAAAGCTATATTGGATGTTGTTAAATGC
TCTCTTTTGAAAATGTTTGAGGATATTCCTAGATCTGTTAAAGTCGAGGCTCCGTCCATT
GTCGCTGCGGTGAAAACTGAGCCTGAACTTTCTCAGCCCAGGATATTTCAGTCATTTCTC
AAACAATTTAGCAACAAATCAAGTTCTATCAAACCTAGTGAGTCTAACAATCCTGATAAA
TGTGAATTAAAAAGGAAATCATCTTCAGTTTTGGACAATTTTATTCAGATAAAAAAGACT
CTTGTTACAAAACAAGAAGATAATTTAATTGAAGAGGAAACTGAAGAAAAATGTATGTTA
GATGAAAATAAGGAACATAATATACTAAATGTATCTACTGAAGAAAATATTGAAAATGCT
AATGAGAGAAATATTAATAATTCTTTGGAAAATAGAGATACTACCATTGTTGAAGAATCT
CATACTATTTACTGCAATACTAAAAGTATAGAAACAACAAAACCAAAAGGAAATAAAGTG
ATAACAGATAAAGAACAGTTAGGCAAGACTGTGAGAATGGAAGTCACATTAAAGACGTCT
ATGGAACAAATAAAAAAACTTTCTGATACATACAAAAAAAATAAAGACAATTCAAAACCG
GATAGAATAAGATTCAAAACTAAAATAGATCCAGTATTTAATAAGAAATGTGAAGAAGAA
TTAAGTAGGGAAATAGAAAAACAATCTTTTAAAAAAATGAAAATTATTGGTCAGTTTAAT
CTAGGCTTTATAATAACTAGACTTGATGACGATCTTTTTATTATTGATCAACATGCTACG
GATGAGATATACAACTTTGAAACCTTACAGAAAACTACAGAACTTACGAGTCAAAAGTTA
GTTATCCCACAGCAACTTGAACTCACTGGGGTCAATGAACAAATATTAATGGACAATCTA
GATATTTTCAAAAAGAATGGCTTTACTTTTGCAATAGACGAAACTGCTGCTCCTACCAAA
AGAGTTAAACTTTTAACTCTTCCTATGTCCAAAAATTGGATATTTGGAAAAGAAGACATT
GAGGAACTCCTATTTATTCTGAAGGAAAATCACTCAGAATATTGTAGGCCCAGCAGAGTA
AGAGCAATGTTCGCGTCCCGAGCGTGCAGAAAGTCTGTTATGATCGGAACGGCGCTCAGT
AAGGGAGACATGAGAAAACTAGTTGACCACATGGCTGAAATAGACAAGCCTTGGAATTGC
CCTCACGGAAGACCAACAATACGGCATCTCATAAATCTAGCGATGGTACACACTGTTGAC
TAA

Protein sequence:

MEETNIQNKQINTIKPINKDAVHKICSDQVVLSLAVAVKELVENSLDAGATNIEVRLKNY
GTELIEVSDNGSGVTEDNFEALTLKYHTSKLNDYSDLLGVSSFGFRGEALSSLCSLANLT
VTTRHETSKHATKIEYDQKGHISSKTPCSRQVGTTVTLTNLFYTLPVRQKEFHKNAKREF
NKMTSLLYAYCLISKGVKITCSNQTNSNSKSLVVATQGSNSYKDNIASVFGVKQLQSILD
VKTELVSNIKDNIFRGLSGEAKVNEESINIEDIEIDLSEDSNDAQTDENNSSQIPLPQRS
QGYKNIPNPVELTGYISSCAHGSGRSSTDRQFFYINSRPCEPVKIIKLINEIYRQYNPHQ
YPFVFLNVNIERTSVDVNVTPDKRKVFLTKEKAILDVVKCSLLKMFEDIPRSVKVEAPSI
VAAVKTEPELSQPRIFQSFLKQFSNKSSSIKPSESNNPDKCELKRKSSSVLDNFIQIKKT
LVTKQEDNLIEEETEEKCMLDENKEHNILNVSTEENIENANERNINNSLENRDTTIVEES
HTIYCNTKSIETTKPKGNKVITDKEQLGKTVRMEVTLKTSMEQIKKLSDTYKKNKDNSKP
DRIRFKTKIDPVFNKKCEEELSREIEKQSFKKMKIIGQFNLGFIITRLDDDLFIIDQHAT
DEIYNFETLQKTTELTSQKLVIPQQLELTGVNEQILMDNLDIFKKNGFTFAIDETAAPTK
RVKLLTLPMSKNWIFGKEDIEELLFILKENHSEYCRPSRVRAMFASRACRKSVMIGTALS
KGDMRKLVDHMAEIDKPWNCPHGRPTIRHLINLAMVHTVD