DPGLEAN20817 in OGS1.0

New model in OGS2.0DPOGS201081 
Genomic Positionscaffold358:+ 6090-13441
See gene structure
CDS Length3669
Paired RNAseq reads  1508
Single RNAseq reads  3499
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001390 (0.0)
Best Drosophila hit  ND
Best Human hittumor suppressor p53-binding protein 1 isoform 2 (1e-27)
Best NR hit (blastp)  AGAP001466-PA [Anopheles gambiae str. PEST] (6e-49)
Best NR hit (blastx)  AGAP001466-PA [Anopheles gambiae str. PEST] (1e-46)
GeneOntology terms













  
GO:0005634 nucleus
GO:0006974 response to DNA damage stimulus
GO:0006350 transcription
GO:0003677 DNA binding
GO:0005622 intracellular
GO:0042162 telomeric DNA binding
GO:0005694 chromosome
GO:0005657 replication fork
GO:0005515 protein binding
GO:0008134 transcription factor binding
GO:0003684 damaged DNA binding
GO:0006281 DNA repair
GO:0006355 regulation of transcription, DNA-dependent
GO:0000776 kinetochore
GO:0045449 regulation of transcription
InterPro families  IPR001357 BRCT
Orthology groupMCL21825

Nucleotide sequence:

ATGGAGGATTCCAGTGATAACATTATCACTTTTAATTCATCAGCAGGAGATTCAGAGACG
AAGACAGAGGTGCCCGTAAGTCCCGACAAACCCAAGGATATATCTCAGAACCTCTTCCCC
GAGAGTCCGGTGTATGAACCTGATGATGAAGTAAAATGCATTGAGGATGACAGCAATATG
AATGATCAGGTCGAAAAAGATGACAACCCCGTCCACGCTAAACGGAAACTCTCAGAGGAA
ACATGTGATAACAACGCTAAAATACTTAAACTGGATCTGACGGAAGACGAAGTTACTGTG
ACGACGCAGACCTTGGACGACGAGGTGTCGGAAATCAATGATGTGGGAAACCAGAATGAG
GAACTACCCTCGACATCGCGCGCGCGGACGCCAGTGCGGTCGAGTCCTGGGAGGAACAAG
AGAAGCAACAGCGTCGAAGTCACCCCAAAAACCCCAAAATCTCGATCCGCCAGTGTCGAA
GTCGTCGCCAAGAACCCAAGAACCCCAAAATCTAGATCCGCCAGTGTCGATGTCGCCCCT
AAGACCCCAAAAACCCCAAAATCCAGGTCCGCCAGTGTTGACTTATCTAAAGCTCAGACG
CCCCGCAGAGTACCGTTAGATTTATTAGCAGAAAAAGACGATGACGTCATTATACATTCC
GATGACAGTTCAAGCCGACTGAGCGTGGAGTTCGTTAGAGAACTACCAGCGCGGAAACTA
CCGGACACCATCCCAGAAGAAAGCGGCTCCGAACTCAACGATTCGCAGAACTTCCATTTA
ATTCTGTCGCCGGTCGAGAACTATGACCCGAATGATAACGAGAAATCGAATGCAGACACG
AACTACGATACTGAGAAAGTGACCGGCAAAGAGAGGGAAGTTATTGATGCTGGGAAGGTT
GCGACTAGAAGCGGCTACAACTACTCCGAGATGAGCAGCGTCACATCTCAGGCACCAAAC
GACTGTAAGGAACGTGCGACGGACCAGGAGAGCACGGATAGCATAGGGTCGATGGGCCTC
GACAGCCCCGAACCAGTTGCTAGCGTCGCTTCCAAGCTCATATCCAAGCTGTCTAACGGA
AACTCATCCACACCGACGGACGTCAACGATCCGGCTCAAGAAATAACCCCCGATATGTGC
AAGCTGAATGGGAACAAGGGGAAACGGGGGAAAAACGAATCGTTCAGGGTCAGCAACACG
ACGACGCCGTCCACCATCTCGCCTTTGCAGAACGGACATTCCTTGCCTATAAGCACACCC
CTGATACCTGTTTTCGACGTACATGTCAGTCACAACGAGGACTGCGAGTTCCTGTCGCTG
TACGTCGTTAGAACCGACAACGACGTGGGCATGGACATGTGCAGGGAGTACCAAAGAATA
TCGAAGAGGTTCACCATAGATCCTTACTTAGGCGACGTGTCGGTCAGTAACTCCCCGTCC
AGCGTCACGAGCGGCGGACTCATGAGTTTGCCGAACAGAACCTCCTTCGCCTCGACTATT
AGTTCAACGTCATCATCCAGCACTCGCACCAGTGACGGAGCCTTCGTGGTCCCCCCACCT
CCGAGGAAATCCGTATCCAACCCTACAACCACCGTAAAAGGTTACGAGGCACTGATGAAA
AAACTGCAAGACATATTTTCGCACATCAGAGACGCGTCCATAGAAGCGAACCGATCTCTC
AACGACGACAAAATATCCGTCGGCATTCAGGCTTCCATATCCGAAGCTACCTTCAGCAAC
GGCAACGCGAGTCCAGAAGAGGTCAGCAAGTGTGACAAAGCGACGCCGAAGAGCTCGCTC
AAAAAGACACGAGTGAGAGGACGAAGGCCGATAGCGGGGAAAACTAAGAGAGCCTTGCTG
CCCACGCAACACGAAGAAGCGGAGTACATGCAGGGAATGAACTCACCGGAAATGATTCCC
AGCAACGGAGACACCGGGAAGATATCACCGAAAGAGGAAAAGCCCGCTGTTGTCGGAACA
CCGAAGTCGGTTAGCAAGTTAAAACAAAAACGTAGACCTCCCTCCCCGCGGCCGGCGACT
CCGGTCGAGAAGGCGATCGCGAAGCCCGAGTACCCCGGCTTTGCACCGGACACGGTAGTC
CTGGCCAAATGGGTGGACAAGAGATACTATTCCGGAAAAGTACTAGAGATCACCGAACCC
AACAAGTATCTGATCAAGTTCGACGACGGTCAGAGCAAAGTCCTCCTGGACGACTTCATA
ATATTCGGCGACATGAAGAAGCTGCCGCTGCAAGGACAGTCGGTGTACGCGCTGGTCGAC
GAGGAGTTGAACTACGAACCGGGACTGGTGCTGGGGGTGGAGGAGAACGGTAGCGGCACG
GTCACCTACAGATGCACCACCGACGGGGACACGATAGTAGTGGTGACGGCGAGCGAGTTA
TATCTCACCGAAGACCAGGCCAGGTCGCTCAAGGAGTCCAGGGCCAGGTCACCAGCAACG
CCGACCACGCCCAGGCGGAGACATCACAGAGAGCTAGACCTCGATAATATTATACAGGGT
CCTCGCAGTGCAAGAAGTCGAGACAAAGGCAGCTCCAGTGCAAGAAAACGAGTGGCGTCA
CCCAAAAGTCCCAAAGCATCTACCTCAGGTGTTAAAACGAAGAGCATAGCTCGCAAGCGT
CTGGCTAGCGAAAGTAGCGAGTTGAGTGAGAACAGCAACTCGGCGCCGGCCAGGATCGAG
GAGGTCGCTGGGGTGGAGCCCGAGGTGCAGCGGACGCCGAGGAAGATAGACGGAGTTAAG
GCCGGACCCCTTCAGTTGAAGGGAGCGGCCAAACAGAACATTGGGAAGAAGAATTCTAAG
CTGACGAAGTTTGAAAACGATGAAGATACTATCTCAGCGCTGGGGCCCATCCCCACCGAC
AGCAAGATGTTCGCTGGCTATTGTTTCCTTCTAACATGTACGGAACCACCGAAGAAGAAT
AGAGTGACGGACAGGAAGGAGAAACAGATGAACCAGGACAGCCGGCATTACTCCTCGGAG
GAAGACGGCGAGAGCACAGCCGCTGGGACGGACACGGAGGACCTGGTGTTCTGTGAACGA
CCCTACAACAAGGAACGACTGCGGGAACAGCTGGAAACAGCTGGAGGAGTTGTTTACAGT
CATTTCGACGACGTGCCAAAGACGAAGTACCCGCAATGCTACCTGATATCGCCCCGTCCC
TGCCTCACCGCTAAGTACATCTCCTGCCTGGCCGCGGCGATAAAGGCCGTGTCCCACGAC
TGGGTGATACAATCTTGCATGGTGGGTCACCTGCTGGATGTGGACTCGTTCGTGCTGCCC
ACCGGCTGGAGCTTAAAGAAGTCATCATTCGTTAATTGGACGACATCATCTGGCAAAAGA
AACACGACCTTCAAGGACAAGATAATACTCCTGTGCGGAGATCAAGATACATTTGTTAAG
TTCTGGGAGCGCGTCTGCACGTTGGCCGGCGCTACGACAAGAATTGTCAATGAAGATAAC
TTAAATATGACCGGGGCCATTGCCCTGGTGACCGAGTGGGACTGTCCTCATGAAGTACAG
AATAAAGCGAACCAGGATAACATACCGCTGGTGTCGACGACCTGGGTGGTCCAGTGCCTG
ATTGAGGGCAAGGTCGTCGCCCCCACCGCCTTGGACAAGTTCTCATTTATGTACGCGGAG
CCCGAATGA

Protein sequence:

MEDSSDNIITFNSSAGDSETKTEVPVSPDKPKDISQNLFPESPVYEPDDEVKCIEDDSNM
NDQVEKDDNPVHAKRKLSEETCDNNAKILKLDLTEDEVTVTTQTLDDEVSEINDVGNQNE
ELPSTSRARTPVRSSPGRNKRSNSVEVTPKTPKSRSASVEVVAKNPRTPKSRSASVDVAP
KTPKTPKSRSASVDLSKAQTPRRVPLDLLAEKDDDVIIHSDDSSSRLSVEFVRELPARKL
PDTIPEESGSELNDSQNFHLILSPVENYDPNDNEKSNADTNYDTEKVTGKEREVIDAGKV
ATRSGYNYSEMSSVTSQAPNDCKERATDQESTDSIGSMGLDSPEPVASVASKLISKLSNG
NSSTPTDVNDPAQEITPDMCKLNGNKGKRGKNESFRVSNTTTPSTISPLQNGHSLPISTP
LIPVFDVHVSHNEDCEFLSLYVVRTDNDVGMDMCREYQRISKRFTIDPYLGDVSVSNSPS
SVTSGGLMSLPNRTSFASTISSTSSSSTRTSDGAFVVPPPPRKSVSNPTTTVKGYEALMK
KLQDIFSHIRDASIEANRSLNDDKISVGIQASISEATFSNGNASPEEVSKCDKATPKSSL
KKTRVRGRRPIAGKTKRALLPTQHEEAEYMQGMNSPEMIPSNGDTGKISPKEEKPAVVGT
PKSVSKLKQKRRPPSPRPATPVEKAIAKPEYPGFAPDTVVLAKWVDKRYYSGKVLEITEP
NKYLIKFDDGQSKVLLDDFIIFGDMKKLPLQGQSVYALVDEELNYEPGLVLGVEENGSGT
VTYRCTTDGDTIVVVTASELYLTEDQARSLKESRARSPATPTTPRRRHHRELDLDNIIQG
PRSARSRDKGSSSARKRVASPKSPKASTSGVKTKSIARKRLASESSELSENSNSAPARIE
EVAGVEPEVQRTPRKIDGVKAGPLQLKGAAKQNIGKKNSKLTKFENDEDTISALGPIPTD
SKMFAGYCFLLTCTEPPKKNRVTDRKEKQMNQDSRHYSSEEDGESTAAGTDTEDLVFCER
PYNKERLREQLETAGGVVYSHFDDVPKTKYPQCYLISPRPCLTAKYISCLAAAIKAVSHD
WVIQSCMVGHLLDVDSFVLPTGWSLKKSSFVNWTTSSGKRNTTFKDKIILLCGDQDTFVK
FWERVCTLAGATTRIVNEDNLNMTGAIALVTEWDCPHEVQNKANQDNIPLVSTTWVVQCL
IEGKVVAPTALDKFSFMYAEPE