DPGLEAN15514 in OGS1.0

New model in OGS2.0DPOGS206937 
Genomic Positionscaffold1:- 447975-477141
See gene structure
CDS Length2337
Paired RNAseq reads  396
Single RNAseq reads  976
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012879 (7e-14)
Best Drosophila hit  Snm1 (3e-66)
Best Human hitDNA cross-link repair 1A protein (2e-70)
Best NR hit (blastp)  predicted protein [Nematostella vectensis] (2e-85)
Best NR hit (blastx)  predicted protein [Nematostella vectensis] (2e-83)
GeneOntology terms
  
GO:0006289 nucleotide-excision repair
GO:0016787 hydrolase activity
InterPro families
  
IPR001279 Beta-lactamase-like
IPR011084 DNA repair metallo-beta-lactamase
Orthology groupMCL18418

Nucleotide sequence:

ATGGGTGGTCATAAGTCTAGGACTACATCTTATCATCCACGTTCAGAAGGAATGGTGGAG
CGATTCAACAAAATATTGGAAAGATATCTGTCAAAGATAGTTTCTTTGTCGCTAAGAACA
GGTAAAATTAAATTAAATCAACCACTAGGTGAGGTTCCCCAATTGGAAGAGACAGCTACA
ACCGAACCAGCTGCCAGCACCATGGAAGGTGTATCAAGTATAACATGTAATAATAATGGC
CAATTAGGAATTACTCCTATCGCTGAACTAAATAATGATACACCAGTTTCACTCAATTTT
AACAATATAGACATAGAAGTTCCTAAAACTAACACCGACGACAATGAGAATGACAATAAT
CAAAATTCTCTGATACTAGTACTGTTACAAGAACCCATTATGTGTAGTGATACAACTCTT
CTGTATGAAGACAATAAAATTCCCATCGCTTCAACGTCATCAGTTGGTAATTTTAACAAT
ACAATTATCAATTCAAATAATAATATGAAAATTGCTGATGATTGCAAAGAGGAAAATGAA
GCAATGGATAATGTTGATTGTGTTATCGCTGAAAGCTGTCAAAAATCTGATGAGTCAACA
TTAGATGTGAACAATTTGAGTCCTAAACTCAAGCCATCAAAGAGAAAATCTGTCTCTGTC
AACTTAGAAATTAAAAAACAAAAGTTGACACATACAGATGATGGAGAAGAGAAATTAAAA
GTTCTTATTTTAAACGCAAACGCCATTGAGAAACTATCCCCTGTCATTATACAACCACAA
AATTATAAATTAGTGAAACGAAAAATTCGTGATAGAAGTATATTGAAAAACAAAGTTAAC
AAACCGGTGTGTGATACATATAATGGGAAAAGTGTTAGTGGCAATTTAAAACAGAAACCA
ATAGATTGTTATTTCACTAGTGCTTACCATCTGAGGTCTTATACAAAGAGATTAAAGGAT
AATGTTGACTCAGTTAATCACGCACTAGCCGTGGTGCGTGATGAGGCTCAGGAAATGAGC
AAAAATTTGGGTATAGCCGACCTCAAATGCGATATAGGCCGGTCGTTAAAGGAGCGCAGA
AATGACAAATTACCGAAACTATTGCCGTATTCAGCACCAGCTAAACGAGATATCAAGGCG
TCCATGTCGGAGGCGCCCAGCGGTTTGTCGCCGAGAAATAGACAGGATAGCCTAGGCTCC
CATTCGCCGAAAAAAAACGCTGACTCCATTCAGCTCGGTTTGGGATTGAAGACGAAGTTA
AATAAAGCAGGCGTCAACCGAAACATTCCACATTATAAAATTGTCGCAGGGACGCATTTT
GCCGTCGATGCATTCTCATACGGCGATATACCCAACGTGAAACATTACTTCCTGACGCAT
TTCCATTCCGACCATTATTCGGGTTTAAAGAAGAATTTCAACAAATTGATTTTCTGTTCG
CAGATTACAGCGGATTTATGCATTTCGCGTTTGGGCGTTAATTTGAAATGCTTCCACGTT
ATAAACGTCGATGAAACTATAAAAATTGAGGGTGTCGAGGTCACAGCCGTTGACGCTAAT
CACTGCCCTGGCGCTTTGATGTTGGTATTCACTTTGCCCAATGGTAAGACGCTTCTGCAT
ACTGGAGATTTTAGGGCGTGTCCTCCTATGGAGTCATATCCTGTTTTTTGGAACAAAGAT
ATACACACAATATATCTTGATACGACCTATTGCAATCCTCGCTATGACTTTCCAACGCAA
GATCAAAGCTTGGAGATGGCTCTGTACATTTTGAGGCAGAAGAAAATTACTCTAGAAAAG
GCCGGGAAGCAGTTTTCATCTGTACTCATAGTGTGTGGAACTTACACCATTGGTAAGGAG
AAGTTCTTCCTTGGTCTGGCTCGTCGCGTGGGATGTTCAGTGTGGGCGTGTCCGGAGAAG
GACCGCGTGCTGCAGGCGGTGGAGGGTCGCAGCTTCAATCACTCGCAACCAGCCAGCTGT
CAGCTGCATGTTGTGCCCATGAGGGATCTGGTGCATGAGAAATTACAGACATATCTGGAA
AGTCTGAAAGGATCGTTCAGCGAAGTTGTTGCTTTCAAACCCAGTGGCTGGGAGAATGGT
AGAAATTCATCAGTGCAAAAGGACTCTGTTACAATACATGGTATACCATACAGTGAACAT
TCTAGTTTTTCAGAAATGATTAGATTTGTCAAATTCCTAAAGCCGAAACAAGTTGTGCCC
ATAGTTGATATTTCCGGTGGGATTAAAACTGTACAGAAGTTTTTTCCTTGTCCTTTGGTT
AATAGAGATGATCTGCAGTGTCAAAGTCGGGTCACAGACTACTTCACACACGGCTGA

Protein sequence:

MGGHKSRTTSYHPRSEGMVERFNKILERYLSKIVSLSLRTGKIKLNQPLGEVPQLEETAT
TEPAASTMEGVSSITCNNNGQLGITPIAELNNDTPVSLNFNNIDIEVPKTNTDDNENDNN
QNSLILVLLQEPIMCSDTTLLYEDNKIPIASTSSVGNFNNTIINSNNNMKIADDCKEENE
AMDNVDCVIAESCQKSDESTLDVNNLSPKLKPSKRKSVSVNLEIKKQKLTHTDDGEEKLK
VLILNANAIEKLSPVIIQPQNYKLVKRKIRDRSILKNKVNKPVCDTYNGKSVSGNLKQKP
IDCYFTSAYHLRSYTKRLKDNVDSVNHALAVVRDEAQEMSKNLGIADLKCDIGRSLKERR
NDKLPKLLPYSAPAKRDIKASMSEAPSGLSPRNRQDSLGSHSPKKNADSIQLGLGLKTKL
NKAGVNRNIPHYKIVAGTHFAVDAFSYGDIPNVKHYFLTHFHSDHYSGLKKNFNKLIFCS
QITADLCISRLGVNLKCFHVINVDETIKIEGVEVTAVDANHCPGALMLVFTLPNGKTLLH
TGDFRACPPMESYPVFWNKDIHTIYLDTTYCNPRYDFPTQDQSLEMALYILRQKKITLEK
AGKQFSSVLIVCGTYTIGKEKFFLGLARRVGCSVWACPEKDRVLQAVEGRSFNHSQPASC
QLHVVPMRDLVHEKLQTYLESLKGSFSEVVAFKPSGWENGRNSSVQKDSVTIHGIPYSEH
SSFSEMIRFVKFLKPKQVVPIVDISGGIKTVQKFFPCPLVNRDDLQCQSRVTDYFTHG