DPGLEAN10797 in OGS1.0

New model in OGS2.0DPOGS215661 
Genomic Positionscaffold613:- 338-9249
See gene structure
CDS Length3897
Paired RNAseq reads  270
Single RNAseq reads  639
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003556 (0.0)
Best Drosophila hit  Ligase4 (2e-75)
Best Human hitDNA ligase 4 (2e-108)
Best NR hit (blastp)  PREDICTED: similar to DNA ligase IV [Acyrthosiphon pisum] (9e-128)
Best NR hit (blastx)  PREDICTED: similar to DNA ligase IV [Acyrthosiphon pisum] (5e-128)
GeneOntology terms


































  
GO:0000166 nucleotide binding
GO:0005634 nucleus
GO:0010165 response to X-ray
GO:0033077 T cell differentiation in the thymus
GO:0050769 positive regulation of neurogenesis
GO:0003677 DNA binding
GO:0045190 isotype switching
GO:0046872 metal ion binding
GO:0007417 central nervous system development
GO:0008283 cell proliferation
GO:0032807 DNA ligase IV complex
GO:0005524 ATP binding
GO:0043524 negative regulation of neuron apoptosis
GO:0035019 somatic stem cell maintenance
GO:0033152 immunoglobulin V(D)J recombination
GO:0006303 double-strand break repair via nonhomologous end joining
GO:0005622 intracellular
GO:0051276 chromosome organization
GO:0006297 nucleotide-excision repair, DNA gap filling
GO:0005958 DNA-dependent protein kinase-DNA ligase 4 complex
GO:0002328 pro-B cell differentiation
GO:0001701 in utero embryonic development
GO:0070419 nonhomologous end joining complex
GO:0033153 T cell receptor V(D)J recombination
GO:0007049 cell cycle
GO:0003910 DNA ligase (ATP) activity
GO:0000793 condensed chromosome
GO:0016874 ligase activity
GO:0000012 single strand break repair
GO:0008022 protein C-terminus binding
GO:0010332 response to gamma radiation
GO:0048146 positive regulation of fibroblast proliferation
GO:0051102 DNA ligation involved in DNA recombination
GO:0051103 DNA ligation involved in DNA repair
GO:0006260 DNA replication
GO:0051301 cell division
InterPro families









  
IPR016027 Nucleic acid-binding, OB-fold-like
IPR001357 BRCT
IPR019734 Tetratricopeptide repeat
IPR000977 DNA ligase, ATP-dependent
IPR012310 DNA ligase, ATP-dependent, central
IPR012308 DNA ligase, ATP-dependent, N-terminal
IPR012309 DNA ligase, ATP-dependent, C-terminal
IPR013026 Tetratricopeptide repeat-containing
IPR016059 DNA ligase, ATP-dependent, conserved site
IPR011990 Tetratricopeptide-like helical
IPR012340 Nucleic acid-binding, OB-fold
Orthology groupMCL11475

Nucleotide sequence:

ATGGATGTGGATAAAGATCTTACTAATAAATTTTTAAGTGGTGAGATGTCTTTCTCCCAA
TACTCTAGTGAATGGTATAGTGGAGAAGAGGATGAAGATGAAGATGAGCCAGAGGAATCC
AAAAAATATGAAGAAGAAGCTGAAATGTCTACCACAGTTTCAAAGAGAGGTCTTAAACGA
CAATCCAAGTTCCGTCGCCTCTTTCCTGCATTATCTGGTCTTATGGGAGAAGCAAATATA
AGGCTTGCCAGGGGTGATAGTGAAATGGCTGAACGTATGTGCCATGAAATAATCAAACAA
CAACCCACAGCGGCTGAACCATATCAAACCTTAGCACAAATATACGAACATGATCCCAAT
AAATCATTGCAGTTTTCTTTGCTTGCTGCACATTTGAGTTTTACAGACAAAAGTGAATGG
TGGAGACTCGCTGCATTATGTAGACAGAGAAGTGATTATAAACAGGAAATGGTCTGTTAC
ACTCAGGCTATAAAATCTGAGCCACAAAATTTAGAGACACACTTGAAAAGGCTAGAGTTG
TTGTCAGAATTAGAAAAACTACCGGACTTTCCCGTTAATTCACTGAAAGTATCTAAGGTG
AAATGTTATCACAAAATTGTACGTTCCTTAGGACCTAGTGATGCTGAAACAATTATGAAG
TATGCCAAAATGGCTGCAACTTTATATCACAACAGCACCGAAGTTGAACAAGCAGTTGAA
GTGATGGGTATTGCATATAAAAAATGCTTTTCATTATTTACATTGGAGGATATTAATATG
TATTTGGAGCTGTTAATTACTCAAAAGCAGTTCACCAAATGTATTGAAGTATTTGTTTCA
AGTATAGGTGTGGAAATTGAAGCTGAAATTCAAACAGTGAAAAATGCTAATGGTGATATT
GAAGAACAAACACACTACCTTAATTGTGTTATACCCAATAACTTAGCTATAGATTTGAAA
AGTAAACTATTGGTGTGCTTTATACATTTAGGAGCACTTAATTTGGTCCAATCATTGCTT
AATGATTTTTTGAGCAGTGATGTTGAAAAAGCTGGAGATCTCTATATGGATATAGAAGAA
GCATTTTCAGCTGTTGGTCATTATGAGATGGCTATAAAATTATTGGAGCCTCTAATTAAA
AATACTAGCTTTGATTTAGGAGCTGTATGGCTTAAATATGCAGATTGCCTGAACAAGTTG
GGAAGACATGATGATGCTATAGAATCATATTACAAAGTGTTAAAGCATGTGCCACAACAC
GCTGACGCGAGGCGAAAGCTGTTTACAATTCTAGAAAACAAAGGAAGAATTGATGACGCT
TTGAACATTCTACAGCAGGATTACAAATTTGTCGTCAGCGCTCATCTACTGTTTGATCAT
TGTCAATACTTAAAGAAATATAATAGAATGTTGAAATATTTGGAGAACTCTACTTTTTTT
CCCATATTGAGGTTGCTCTTGCCAAGTTGTGATCGGGAACGTGGTCCCTACAACCTTAAA
GAAACCAGACTAAGTACTTTATTGGTAAAAGTACTGTCTCTCAATAAAGAGTCGACAGAT
GCGAAACAACTGATACATTTTAGTTCTTCAAATAACTCAGTTCTAGATAGCGACTTCCCT
GGTGTCGCGTTTTACGTTATAAAGAAAAGAGTTGGTCAGAATAATTCAGTATTGACAGTC
AGAGAGATCAATGAGATACTTAACTCTGTTGCAACTGTAGATAATGTTCATAAAACTCCA
TTGGATGAAATTTTTAGTTATGCTTTAAAAAAACTGACTGCCATCGAATTCAAATGGCTT
CTGAGAATAATATTAAAGGATTTAAAATTAAGTATGAGTGCAGATCGAATCTTGGGGATT
TTCCATCCAGATGCCCCAGAGGTCTTCAAGAACTGCAGCAGTATTTTAAAGGTGTGCGAA
GAATTAGAAGATGGCGACACTCGACCATCAGAACTGGGCGTCAATTTGTTCTACGCTGTA
AGACCAATGCTGTCTGAGAGGTTGGACATCACACACATACACGTCTTGGATAAGACGAAG
ACCTACTGTATGGAGGAGAAGTTTGATGGTGAGAGATTCCAGATGCACATGGATAACAAC
GTATTTGAATACTTTTCACGGAAAGGTTTCAAGTACTCCAAAAACTATGGGCAAAGTTAC
GACTCCGGCATGTTAACGCCGTATTTGAAGGATATTTTTGCTCCTGAGGCGAGGAATTTC
ATTCTTGACGGTGAAATGATGGGTTGGCACAAAATAGATAATTATTTCGGATGCAAAGCG
ATGTCATACGATGTTAAGAAAATCACAGAGAACAGTTCGTTCCGCCCTTGCTTTTGCGTG
TTTGATATTCTATATTATAACGACAGACCACTCATCGGCTCGCCAGATAAGGGCGGTTTA
CCTTTACGGGAACGACTCAAAATACTCGACGATCTATTCATAGACAAGCGAGGTGTTATA
GAACATAGCAAGCGAAAAATTATCAAAGAAAGTTCAGAAGTTGTGGACGCCGTCAACGAT
GCCATAGACAATCAGGACGAGGGTATTGTAGTTAAAGATATAAATTCATACTACATCGCT
AACAAAAGAAACGCTGGCTGGTACAAAATAAAACCGGAGTATACGGACGACACCATGAAT
GACCTAGACCTGGTGGTGGTTGGTGCTGATGAAGCCACCAACAAAAGACAGGGGCGTGCC
AAAAGTTTCTATGTCGCGTGTGGGGATAACAATGATGGCGACCCTGTCTGGACCTGCATT
GGCCGCGTGTCTAACGGACTGAAGCACGAGGAGAAGGAACGCGTTTGTTCATTACTTGAA
CGGAACTGGTGTATGTATAGGAAAAAACCTCCGCCTCCCTGTCTGCGCTTCGGCAAAGAC
AAGCCGGACTTCTGGATACTTCCAGAACATTCTATCGTATTGCAGGTGCGTGCCACCGAG
CTGTTAAGCGTTGGGGACTCACACGTGCTGCGATTCCCGCGCGTGGAAGATATAAGATCA
GACAAGCCGGTCGATGACGTGTGCACAATACACGAACTTAGACAACTGGCTGTGAGCAGA
AGCCCGGTCAGTAAGCTAAGTACAAAGCGCGTAAACGAATCGCAAATAGATCAAAACTAT
ATTAAAACACGCAAGCGCGGTCTGTCTAAGACCGTCCAAGTAGCGGAAAAATTCCGCACA
AAGACGATTGGAGACGTGCAAGTTATATCACGAGCTTTGTTTGGGAAGAAACTTTGTGTG
TTGTCGGATGACGAGGATTGTAAGAAAACGGAATTGAAACGCGTCATAGAGTCCCACGGA
GGGAGACACGTTGAGAACCCAGGTTCAGATACTTGGTGCTGTGTAGTGGGAACTATAACA
CCGCGAGCCCGTAGACTCATAGAGACACAAGACCTAGACATCATTAGCACAGCCTGGCTC
AGAAGCCTACCAGCGACAGACGACCCGTGTCAACTGTCGCCATTGGACATGCTATCAATC
AAACCCGAAACGAAGCTCAAACTGAGCCTAGACTACGACCCCTTCGGTGATAGTTACAAG
GATGAAATAGATGAAAAAACATTGAAGAAACTGCTGGACAAAATGGATTCGGAGTTCCCG
TTGTATCCAACTTTAAAAGAAAAAGTCTGTCTGGATAAACAATTATTCGGCGCCAACAAT
CCTTACTCATTTTTGAGGAATTGTTTCATTCACGTTATTGACAATTCGCTTTACGAAACT
ATGGCGTCCTTTTTCGGAGCCAAAATCTGTTCTCTCGATGACGTCAGACTGACGCACGTC
GTTATGTCAAAAGACGCGAATGTCAAAATAGATAAAGGAATTCTAGTGTCGGATGGATGG
TTGGAAGAATGTTTTAACAAAAGGAGTTTTGTTCCTGTCGATGATTATCTAATTTAA

Protein sequence:

MDVDKDLTNKFLSGEMSFSQYSSEWYSGEEDEDEDEPEESKKYEEEAEMSTTVSKRGLKR
QSKFRRLFPALSGLMGEANIRLARGDSEMAERMCHEIIKQQPTAAEPYQTLAQIYEHDPN
KSLQFSLLAAHLSFTDKSEWWRLAALCRQRSDYKQEMVCYTQAIKSEPQNLETHLKRLEL
LSELEKLPDFPVNSLKVSKVKCYHKIVRSLGPSDAETIMKYAKMAATLYHNSTEVEQAVE
VMGIAYKKCFSLFTLEDINMYLELLITQKQFTKCIEVFVSSIGVEIEAEIQTVKNANGDI
EEQTHYLNCVIPNNLAIDLKSKLLVCFIHLGALNLVQSLLNDFLSSDVEKAGDLYMDIEE
AFSAVGHYEMAIKLLEPLIKNTSFDLGAVWLKYADCLNKLGRHDDAIESYYKVLKHVPQH
ADARRKLFTILENKGRIDDALNILQQDYKFVVSAHLLFDHCQYLKKYNRMLKYLENSTFF
PILRLLLPSCDRERGPYNLKETRLSTLLVKVLSLNKESTDAKQLIHFSSSNNSVLDSDFP
GVAFYVIKKRVGQNNSVLTVREINEILNSVATVDNVHKTPLDEIFSYALKKLTAIEFKWL
LRIILKDLKLSMSADRILGIFHPDAPEVFKNCSSILKVCEELEDGDTRPSELGVNLFYAV
RPMLSERLDITHIHVLDKTKTYCMEEKFDGERFQMHMDNNVFEYFSRKGFKYSKNYGQSY
DSGMLTPYLKDIFAPEARNFILDGEMMGWHKIDNYFGCKAMSYDVKKITENSSFRPCFCV
FDILYYNDRPLIGSPDKGGLPLRERLKILDDLFIDKRGVIEHSKRKIIKESSEVVDAVND
AIDNQDEGIVVKDINSYYIANKRNAGWYKIKPEYTDDTMNDLDLVVVGADEATNKRQGRA
KSFYVACGDNNDGDPVWTCIGRVSNGLKHEEKERVCSLLERNWCMYRKKPPPPCLRFGKD
KPDFWILPEHSIVLQVRATELLSVGDSHVLRFPRVEDIRSDKPVDDVCTIHELRQLAVSR
SPVSKLSTKRVNESQIDQNYIKTRKRGLSKTVQVAEKFRTKTIGDVQVISRALFGKKLCV
LSDDEDCKKTELKRVIESHGGRHVENPGSDTWCCVVGTITPRARRLIETQDLDIISTAWL
RSLPATDDPCQLSPLDMLSIKPETKLKLSLDYDPFGDSYKDEIDEKTLKKLLDKMDSEFP
LYPTLKEKVCLDKQLFGANNPYSFLRNCFIHVIDNSLYETMASFFGAKICSLDDVRLTHV
VMSKDANVKIDKGILVSDGWLEECFNKRSFVPVDDYLI