DPGLEAN00442 in OGS1.0

New model in OGS2.0DPOGS205300 
Genomic Positionscaffold766:+ 12037-14130
See gene structure
CDS Length1839
Paired RNAseq reads  86
Single RNAseq reads  210
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011042 (6e-31)
Best Drosophila hit  mus81 (2e-37)
Best Human hitcrossover junction endonuclease MUS81 (3e-43)
Best NR hit (blastp)  Mus81 protein [Danio rerio] (4e-60)
Best NR hit (blastx)  Mus81 protein [Danio rerio] (3e-63)
GeneOntology terms










  
GO:0003677 DNA binding
GO:0005515 protein binding
GO:0003824 catalytic activity
GO:0006259 DNA metabolic process
GO:0004518 nuclease activity
GO:0006281 DNA repair
GO:0016787 hydrolase activity
GO:0004519 endonuclease activity
GO:0006310 DNA recombination
GO:0046872 metal ion binding
GO:0005634 nucleus
GO:0006974 response to DNA damage stimulus
InterPro families


  
IPR011335 Restriction endonuclease, type II-like
IPR010996 DNA-directed DNA polymerase, family X, beta-like, N-terminal
IPR020819 DNA repair nuclease, XPF-type/Helicase
IPR006166 ERCC4 domain
Orthology groupMCL13480

Nucleotide sequence:

ATGGCTGTTAATAGTAAAAGAATAACTTTAAAACGAACCAGACCAAATCCATTATTTCAA
CGATGGCTTCAGGAGCTACAGGATGAGGCAAAGCTTGAATTGAACAATTTAGAATATTCA
TTGGATGAAGCTATCAGTTCATTATCCAAGTACCCTTTACCGTTAGAAAGTGGCGCGGAG
TGCGCTATACTAAAAGGTTTCGATAATAAACTATGCTCTTTCCTCGATAAGCGTCTACAA
GCTTACACAAGCTCAAATAGATTGGATAATAATTCTTCCAAAGTTACTAGCACAACCCTA
CCTGAACTAGAGCCAATAGATATCAGTTCTAAAAAAGCCAAAAACAATTCCATTACCAAA
GATGACTTTGAGCAACAATCTGCAAGAAACATTTTAAAGTGCACCAAGCTTACTACTGAT
GTGCCTTGTGGAACAGATGACTTATCTGACAATGAAGTACAAGAAGTTCAAGAGGTACCA
TCCCAAAGTAATGTGCTAAAAGGATCTCGAAAAGATTACTGTTCTAATTTAATTCAAGCT
AACTACAGCTCTCTGAGTCCTGAATTAGAAAAATCACTTAGAGGAAGGGAAAGGAAATTA
AAATACAAACCTATCTACAAGTCTGGTAGTTATGCTATAGTGATGGGTCTCTGGGAACAT
TCAATAGTGAATTCAAAGCAGGGTATTAGTAAGATGGATTTGCTAGAACTAGCCCATAAA
TACATTGAGAGTTCTATGAAAAACGCTTCAGATGCTTTACATAATTTGTTATGGGCAAAT
ATGAATAACCTTGTATCAAAGGGCCTTGTAACGAGAAAGAATGGAGAAACTCCAGTATTT
AAATTGACAAAATTGGGTATTAAAACTGCAAAGGTACTGTATAAAGAATACAAAATTAGA
GAAAAACCTAAAGTTACAAATTCGAAACAACCCGAAGAAGAAGATTGTGACGGATCTAAA
AATAATGTTCCAATTAATTCCAGAAACCGCATTGACACTTGTAATACAGAGGTTGAAGAA
GTAGTGGAATTTGAAGCTGGTTCATATGATATTATTTTATTTGTAGATGTTAAAGAAACT
TCTGGCTTAGCTAAGAAGAATGATCCTCTGATGCTCCAAATGAAGAAATATCCTAATCTT
CAACACGAGTTTAGATCTTTGAGTGTCGGTGATTTTGCATGGATAGCAAGGCACAGGTTA
AGTAAAGAAGAATTAGTGCTGCCTTATATAGTTGAGAGGAAAAGAATGGACGATTTCGCT
AATAGTATAAAAGATGGTAGATATCATGAACAGAAATTCAGATTGAAGAAAAGTAAAGCA
AAAGTTGTTTACTTGGTTGAAAATTATGATAGTAAATATGTTGGTTTGCCCTATCAGACG
TTAATGCAAGGATTGGTCAATACGAGAATTAGGGATGAGATTCAGGTACATCGAACAGAT
TCATTGGCGGCTACTGTTAGATTCTTAGCCATTCTGACAATGAAAATAATTAACGAATAT
CAGAATTGTTCTGTTAAGGGTCACCACAAAATGGCGGAAGGCGACATGTTGATGACATTC
AATTATTTTAAAAAAGCTCTCGTAAAAAATAAACCTTTGTCTTTGAAATGTACCTTTATA
AAAATGCTTTTACAACTACGAGGATTAACAGCAGATAAAGCTGTGGCTATAACTAATGAA
TACGGTACGCCAAAATTATTAATGGATGCATATGAAAATTGTGATAAAAAAAAAGGTGAA
CTGTTGTTAGCTAATATTAAAGGCAAGAGTAAACGTAATGTAGGACCTCGTGTAAGTAAA
AAACTGTACAAATTGTTTACATTGAGAGAATTAACGTAA

Protein sequence:

MAVNSKRITLKRTRPNPLFQRWLQELQDEAKLELNNLEYSLDEAISSLSKYPLPLESGAE
CAILKGFDNKLCSFLDKRLQAYTSSNRLDNNSSKVTSTTLPELEPIDISSKKAKNNSITK
DDFEQQSARNILKCTKLTTDVPCGTDDLSDNEVQEVQEVPSQSNVLKGSRKDYCSNLIQA
NYSSLSPELEKSLRGRERKLKYKPIYKSGSYAIVMGLWEHSIVNSKQGISKMDLLELAHK
YIESSMKNASDALHNLLWANMNNLVSKGLVTRKNGETPVFKLTKLGIKTAKVLYKEYKIR
EKPKVTNSKQPEEEDCDGSKNNVPINSRNRIDTCNTEVEEVVEFEAGSYDIILFVDVKET
SGLAKKNDPLMLQMKKYPNLQHEFRSLSVGDFAWIARHRLSKEELVLPYIVERKRMDDFA
NSIKDGRYHEQKFRLKKSKAKVVYLVENYDSKYVGLPYQTLMQGLVNTRIRDEIQVHRTD
SLAATVRFLAILTMKIINEYQNCSVKGHHKMAEGDMLMTFNYFKKALVKNKPLSLKCTFI
KMLLQLRGLTADKAVAITNEYGTPKLLMDAYENCDKKKGELLLANIKGKSKRNVGPRVSK
KLYKLFTLRELT