New model in OGS2.0 | DPOGS207296  |
---|---|
Genomic Position | scaffold76:- 81936-85517 |
See gene structure | |
CDS Length | 2463 |
Paired RNAseq reads   | 177 |
Single RNAseq reads   | 534 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012027 (2e-117) |
Best Drosophila hit   | Pms2 (4e-98) |
Best Human hit | mismatch repair endonuclease PMS2 isoform a (4e-75) |
Best NR hit (blastp)   | PREDICTED: similar to DNA mismatch repair protein pms2 [Acyrthosiphon pisum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to DNA mismatch repair protein pms2 [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0003677 DNA binding GO:0003697 single-stranded DNA binding GO:0005515 protein binding GO:0005524 ATP binding GO:0006298 mismatch repair GO:0016446 somatic hypermutation of immunoglobulin genes GO:0016447 somatic recombination of immunoglobulin gene segments GO:0030983 mismatched DNA binding GO:0032138 single base insertion or deletion binding GO:0032389 MutLalpha complex GO:0032407 MutSalpha complex binding |
InterPro families    | IPR014790 MutL, C-terminal, dimerisation IPR013507 DNA mismatch repair protein, C-terminal IPR003594 ATPase-like, ATP-binding domain IPR014763 DNA mismatch repair protein, N-terminal IPR014721 Ribosomal protein S5 domain 2-type fold, subgroup IPR002099 DNA mismatch repair protein IPR014762 DNA mismatch repair, conserved site IPR020568 Ribosomal protein S5 domain 2-type fold |
Orthology group | MCL13055 |
Nucleotide sequence:
ATGGAAGAAACAAATATACAAAACAAGCAAATCAATACTATTAAACCAATTAACAAGGAT
GCTGTTCACAAAATATGTTCTGATCAAGTGGTTCTTAGTTTAGCAGTAGCAGTGAAGGAA
CTAGTTGAAAACTCTTTAGATGCTGGGGCTACTAATATTGAAGTCAGACTTAAAAACTAT
GGCACAGAATTAATAGAAGTTTCAGATAATGGATCTGGTGTAACTGAGGATAATTTTGAA
GCCTTGACCTTAAAATATCATACATCAAAATTAAACGATTACTCGGATTTGCTTGGAGTA
TCTAGCTTTGGCTTCAGGGGAGAGGCTTTAAGTTCACTTTGCTCTTTAGCCAACCTTACA
GTGACAACCAGACATGAAACAAGTAAACATGCTACCAAAATTGAATATGATCAGAAAGGC
CATATATCAAGCAAAACACCTTGCTCCCGTCAAGTGGGAACAACAGTGACTTTAACTAAC
CTTTTCTATACATTGCCAGTAAGACAAAAAGAGTTTCATAAAAATGCAAAACGGGAGTTC
AATAAAATGACCAGTCTTTTATATGCATATTGTTTAATTTCTAAAGGGGTAAAAATAACA
TGTAGTAATCAAACAAACTCAAATTCTAAGTCACTAGTTGTTGCAACCCAAGGCTCTAAT
TCCTATAAGGATAATATTGCAAGTGTGTTTGGAGTTAAGCAATTACAAAGTATTCTAGAT
GTTAAAACTGAGCTTGTTTCTAATATCAAAGATAATATATTCAGAGGACTATCGGGAGAA
GCAAAGGTAAATGAAGAAAGTATTAATATAGAAGACATTGAAATTGATTTATCTGAAGAT
TCCAATGATGCCCAAACTGATGAAAATAATTCAAGTCAAATACCATTACCTCAAAGATCA
CAGGGTTACAAAAATATACCAAATCCTGTTGAGCTCACAGGATATATCTCTTCATGTGCT
CATGGTAGTGGAAGATCAAGTACAGACAGACAATTCTTCTATATCAACTCCAGGCCTTGT
GAGCCGGTGAAAATTATTAAACTGATCAATGAAATATATCGACAATACAACCCACATCAG
TATCCATTTGTATTTTTAAATGTTAATATTGAAAGAACATCAGTTGATGTAAATGTGACT
CCTGATAAGAGGAAAGTATTTTTAACCAAAGAGAAAGCTATATTGGATGTTGTTAAATGC
TCTCTTTTGAAAATGTTTGAGGATATTCCTAGATCTGTTAAAGTCGAGGCTCCGTCCATT
GTCGCTGCGGTGAAAACTGAGCCTGAACTTTCTCAGCCCAGGATATTTCAGTCATTTCTC
AAACAATTTAGCAACAAATCAAGTTCTATCAAACCTAGTGAGTCTAACAATCCTGATAAA
TGTGAATTAAAAAGGAAATCATCTTCAGTTTTGGACAATTTTATTCAGATAAAAAAGACT
CTTGTTACAAAACAAGAAGATAATTTAATTGAAGAGGAAACTGAAGAAAAATGTATGTTA
GATGAAAATAAGGAACATAATATACTAAATGTATCTACTGAAGAAAATATTGAAAATGCT
AATGAGAGAAATATTAATAATTCTTTGGAAAATAGAGATACTACCATTGTTGAAGAATCT
CATACTATTTACTGCAATACTAAAAGTATAGAAACAACAAAACCAAAAGGAAATAAAGTG
ATAACAGATAAAGAACAGTTAGGCAAGACTGTGAGAATGGAAGTCACATTAAAGACGTCT
ATGGAACAAATAAAAAAACTTTCTGATACATACAAAAAAAATAAAGACAATTCAAAACCG
GATAGAATAAGATTCAAAACTAAAATAGATCCAGTATTTAATAAGAAATGTGAAGAAGAA
TTAAGTAGGGAAATAGAAAAACAATCTTTTAAAAAAATGAAAATTATTGGTCAGTTTAAT
CTAGGCTTTATAATAACTAGACTTGATGACGATCTTTTTATTATTGATCAACATGCTACG
GATGAGATATACAACTTTGAAACCTTACAGAAAACTACAGAACTTACGAGTCAAAAGTTA
GTTATCCCACAGCAACTTGAACTCACTGGGGTCAATGAACAAATATTAATGGACAATCTA
GATATTTTCAAAAAGAATGGCTTTACTTTTGCAATAGACGAAACTGCTGCTCCTACCAAA
AGAGTTAAACTTTTAACTCTTCCTATGTCCAAAAATTGGATATTTGGAAAAGAAGACATT
GAGGAACTCCTATTTATTCTGAAGGAAAATCACTCAGAATATTGTAGGCCCAGCAGAGTA
AGAGCAATGTTCGCGTCCCGAGCGTGCAGAAAGTCTGTTATGATCGGAACGGCGCTCAGT
AAGGGAGACATGAGAAAACTAGTTGACCACATGGCTGAAATAGACAAGCCTTGGAATTGC
CCTCACGGAAGACCAACAATACGGCATCTCATAAATCTAGCGATGGTACACACTGTTGAC
TAA
Protein sequence:
MEETNIQNKQINTIKPINKDAVHKICSDQVVLSLAVAVKELVENSLDAGATNIEVRLKNY
GTELIEVSDNGSGVTEDNFEALTLKYHTSKLNDYSDLLGVSSFGFRGEALSSLCSLANLT
VTTRHETSKHATKIEYDQKGHISSKTPCSRQVGTTVTLTNLFYTLPVRQKEFHKNAKREF
NKMTSLLYAYCLISKGVKITCSNQTNSNSKSLVVATQGSNSYKDNIASVFGVKQLQSILD
VKTELVSNIKDNIFRGLSGEAKVNEESINIEDIEIDLSEDSNDAQTDENNSSQIPLPQRS
QGYKNIPNPVELTGYISSCAHGSGRSSTDRQFFYINSRPCEPVKIIKLINEIYRQYNPHQ
YPFVFLNVNIERTSVDVNVTPDKRKVFLTKEKAILDVVKCSLLKMFEDIPRSVKVEAPSI
VAAVKTEPELSQPRIFQSFLKQFSNKSSSIKPSESNNPDKCELKRKSSSVLDNFIQIKKT
LVTKQEDNLIEEETEEKCMLDENKEHNILNVSTEENIENANERNINNSLENRDTTIVEES
HTIYCNTKSIETTKPKGNKVITDKEQLGKTVRMEVTLKTSMEQIKKLSDTYKKNKDNSKP
DRIRFKTKIDPVFNKKCEEELSREIEKQSFKKMKIIGQFNLGFIITRLDDDLFIIDQHAT
DEIYNFETLQKTTELTSQKLVIPQQLELTGVNEQILMDNLDIFKKNGFTFAIDETAAPTK
RVKLLTLPMSKNWIFGKEDIEELLFILKENHSEYCRPSRVRAMFASRACRKSVMIGTALS
KGDMRKLVDHMAEIDKPWNCPHGRPTIRHLINLAMVHTVD