New model in OGS2.0 | DPOGS211800  |
---|---|
Genomic Position | scaffold306:+ 15619-27652 |
See gene structure | |
CDS Length | 1665 |
Paired RNAseq reads   | 544 |
Single RNAseq reads   | 1601 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006123 (6e-153) |
Best Drosophila hit   | ND |
Best Human hit | endonuclease/exonuclease/phosphatase family domain-containing protein 1 (1e-50) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC014464 [Tribolium castaneum] (7e-110) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC014464 [Tribolium castaneum] (2e-103) |
GeneOntology terms    | GO:0006281 DNA repair GO:0003677 DNA binding |
InterPro families   | IPR010994 RuvA domain 2-like |
Orthology group | MCL20481 |
Nucleotide sequence:
ATGGGGCAAAGCCCGAGCTCTATTCGGAGTAGGAGCAGCCGAAGGTCCTTCAGGTCCTTT
GTACGGCGCTCCAAGTTGAATAAGTCTAATTTGAGCCACACATTCAGTTTGCCACCTTCC
GAGGAGTATCCTGAGTTGATGAACGTGAACTCCGCCACCGAGGAACAGCTGATGACATTG
CCGGGGGTCTCTCGCCAGCTCGCGCGGGAAATTGTCCGACACAGACAAATGATTGGCAGA
TTCAAGCGCGTCGACGACCTTGCCTTAGTGTCAGGTATTGGAGCTGAAAAGCTTGAGTTG
TTAAGACCAGAAATATGTACAAATTCCAAAAGAGAGATATCAAGGGCAAGTTCCTGTACT
CATTCCTTAGACAGTGTAAGAATTACAAATGAAAATAAACTATGTTCTGTGAATTCATCC
AGTGTATTCCAACTGCAATGTGTGCCGGGACTGAATCAAGAATTAGCTGCTAATATTGTA
GATTATAGAAATAAGAAAGGACCATTCAAATCATTGGATGACTTAATAAAAGTCAGAGGC
ATAGATATTGTCAGGCTGAGTACTGTTAAACCACATTTAAATTTGGAATTACGCAAGAGT
GAGAGTGTGCAACATTTAACTAACGGACATGTAAATGGTTGGAAAGAGACATCTCTCGAT
GACTCATATCTAAACAGGGAAACCAAATCACTAAGGAGTCCTCATAGAAAAAGTATGTCT
ATGCCTACAAAGTTCCCAATCACATTGCCAAATGGTTTTGCTACAGCGCCGGTGAATGAT
ATATTAGATTTGCTATCAGCCTACTCTCACCGTCCCATTGTGGAGGAAGTCTTCAGATAT
GAGAGGGATGGAGTGAGATGCTGTCGTCTCGCATCTTGGAACCTCCATCAGCTCAGTGTC
GATAAAGTCACAAATCCAGGTGTCAGAGAAGTGATATGCCGGACCATTTTAGAATATAGA
TTGTCAATTGTAGCTATACAGGATGTGTGTGAGGAGTCATCTCTACGTATGATATGTGAA
GAATTGAACTCACCGGCTCTAAGGAGAGTGACTGAGTGGAGGTGGAATAATAGGTCTTGG
AACTACTGCTTACCGAGTGATGGAAAAGGAAGCAGCCTCGGCTTCTTATACGAGAGATCC
AACAAACACGTGTCCGTGGAGGAAGTGACGCGCGCGAAACGAGACGTCATCTCGGAACAC
GCTGCGAGGATCCTGGAGACAGTTGACAGTATTAAGAGTGACAGATTGCTACTATTCCCA
CAGGTGTTCCTACTAAATGACAGGCCATTAATAATGTTGAACGTCCAATGTAAGGACCGT
CTGAGTGAAGAGGAGAGCAACAAACTGAGAGAGATAGCTGACATGGCGCTCACTTCAAAA
TTACAATTAGCTTTCTTCGGGGATTTTCTGAGTTGGAAAAATGTACAATGTTTACGTAAC
TGTGAATCAGTTTTGGACACGGCGATAGTGTCGTCCTTGGATCCGAGCGTCGCGGGTCAG
TGTGCTATATTGTGTGTAGGGGAGGTCGAGGGCAGCTCCTTCAACGGACACGCGGGCGTC
GTCAAGACAGGCCTCTGCCACCTGGCCATACCTCGCGGCTGGTCGTGGGGCGGACCCGCG
TCTCCATTCTGTCCGATATGGGCCGAACTGAATGTACCCGATTGA
Protein sequence:
MGQSPSSIRSRSSRRSFRSFVRRSKLNKSNLSHTFSLPPSEEYPELMNVNSATEEQLMTL
PGVSRQLAREIVRHRQMIGRFKRVDDLALVSGIGAEKLELLRPEICTNSKREISRASSCT
HSLDSVRITNENKLCSVNSSSVFQLQCVPGLNQELAANIVDYRNKKGPFKSLDDLIKVRG
IDIVRLSTVKPHLNLELRKSESVQHLTNGHVNGWKETSLDDSYLNRETKSLRSPHRKSMS
MPTKFPITLPNGFATAPVNDILDLLSAYSHRPIVEEVFRYERDGVRCCRLASWNLHQLSV
DKVTNPGVREVICRTILEYRLSIVAIQDVCEESSLRMICEELNSPALRRVTEWRWNNRSW
NYCLPSDGKGSSLGFLYERSNKHVSVEEVTRAKRDVISEHAARILETVDSIKSDRLLLFP
QVFLLNDRPLIMLNVQCKDRLSEEESNKLREIADMALTSKLQLAFFGDFLSWKNVQCLRN
CESVLDTAIVSSLDPSVAGQCAILCVGEVEGSSFNGHAGVVKTGLCHLAIPRGWSWGGPA
SPFCPIWAELNVPD