DPGLEAN04792 in OGS1.0

New model in OGS2.0DPOGS215742 
Genomic Positionscaffold442:- 66551-69164
See gene structure
CDS Length1464
Paired RNAseq reads  486
Single RNAseq reads  1158
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003618 (0.0)
Best Drosophila hit  photorepair, isoform A (1e-164)
Best Human hitND
Best NR hit (blastp)  DNA photolyase [Aedes aegypti] (0.0)
Best NR hit (blastx)  DNA photolyase [Aedes aegypti] (0.0)
GeneOntology terms
  
GO:0006281 DNA repair
GO:0003904 deoxyribodipyrimidine photo-lyase activity
InterPro families


  
IPR005101 DNA photolyase, FAD-binding/Cryptochrome, C-terminal
IPR006050 DNA photolyase, N-terminal
IPR008148 DNA photolyase, class 2
IPR014729 Rossmann-like alpha/beta/alpha sandwich fold
Orthology groupMCL18379

Nucleotide sequence:

ATGAAACAAATACATAAAAAGCGAGAAGAAACTGCTAAGTCCATTTTAGATTTTAACTTC
AATAAGTCTCGTTTAAGAATAATATCACAAGAGCAGATGGTATCTGATGATTGCGAAGGA
ATTGTATATTGGATGTCGAGAGACAGCAGAGTTCAAGACAATTGGGCTTTTCTATACGCA
CAGGAACTGGCGTTAAAAAATAAAGTACCGCTCCATGTATGTTTCTGTTTAATAGCAAAA
TATTTGGATGCATCTGTTAGACAATTTCACTTTCTTATCAAAGGTCTCGAAAAAGTTGCT
GCTGATTGTGACAAGCTTAACATTTCATTTCACTTACTGGAAGGCAATGGTGCAGAAGTT
TTACCTCAATGGGTTATCGATCACAGGATAGGGGCTGTGGTTTGTGATTTCAATCCTCTA
AGAGTGCCATTAGGCTGGGTCGAGGGGGCAAAGAAAAAATTAAAAAAGGATGTGCCATTA
ATTCAGGTTGATGCCCATAATGTTGTGCCGTGTTGGGTGGCATCTAACAAACAGGAGTAT
TCCGCTAGAACCATAAGAAATAAGATCAACTCAAAACTTGATGAATACCTGACCGAGTTT
CCTCCGGTTATTAAACATCCACATTCAAGCAGTTTTAAACCAGAGCCAATAGATTGGGAT
AAGGCGATAGAGACGAGAGAAGCAGACAAATCTGTCGGTCCAATAGGATGGGCGGGTCCT
GGCTATGACAATGCTGTCAAAACATTGAAGAGTTTTCTTGACAAACGTCTCAAAGTCTTT
GCAACCAAAAGGAATGATCCCACTCAGGATGCACTTAGCAATTTATCACCATGGTTTCAT
TTTGGTCAAATATCAGCACAACGGGTAGCCTTGTGTGTGAAGGAGTACAAAACCAAGTAT
ACAGAGAGCGTCAATTCTTATTTAGAAGAGGCTATAGTGCGAAGAGAATTGGCTGACAAT
TTTTGTTTTTACTGTGAACATTATGATAGCATCAAAGGTGCGAGCCAGTGGGCACAGAAG
ACTTTAGACGACCATAGAAATGACAAAAGAACACATATATATACACTTGAACAGTTCTGC
AAAGCAGAAACCCATGACGACCTGTGGAACTCGGCTCAAATACAAATGGTTAAAGAGGGG
AAGATGCATGGGTTTCTAAGAATGTACTGGTGTAAGAAGATCCTAGAGTGGACCTCGAGT
CCGGAAGAGGCATTGAAATATGCCATATATTTGAACGATCATTACAGTGTTGACGGCAGG
GACCCTAGCGGATATGTTGGTTGTATGTGGTCTATCTGCGGCGTCCACGACCAGGGCTGG
GCGGAGCGTGCTGTGTTTGGCAAAATCCGTTTCATGAACTATGACGGCTGCAAACGAAAG
TTTAACGTACCAGCCTTCGTATGCAGATACGGAGGGAAAGTCCACAAATATAACAACTTG
ACCGACAAACAGAAGAAAAAGTAG

Protein sequence:

MKQIHKKREETAKSILDFNFNKSRLRIISQEQMVSDDCEGIVYWMSRDSRVQDNWAFLYA
QELALKNKVPLHVCFCLIAKYLDASVRQFHFLIKGLEKVAADCDKLNISFHLLEGNGAEV
LPQWVIDHRIGAVVCDFNPLRVPLGWVEGAKKKLKKDVPLIQVDAHNVVPCWVASNKQEY
SARTIRNKINSKLDEYLTEFPPVIKHPHSSSFKPEPIDWDKAIETREADKSVGPIGWAGP
GYDNAVKTLKSFLDKRLKVFATKRNDPTQDALSNLSPWFHFGQISAQRVALCVKEYKTKY
TESVNSYLEEAIVRRELADNFCFYCEHYDSIKGASQWAQKTLDDHRNDKRTHIYTLEQFC
KAETHDDLWNSAQIQMVKEGKMHGFLRMYWCKKILEWTSSPEEALKYAIYLNDHYSVDGR
DPSGYVGCMWSICGVHDQGWAERAVFGKIRFMNYDGCKRKFNVPAFVCRYGGKVHKYNNL
TDKQKKK