New model in OGS2.0 | DPOGS215661  |
---|---|
Genomic Position | scaffold613:- 338-9249 |
See gene structure | |
CDS Length | 3897 |
Paired RNAseq reads   | 270 |
Single RNAseq reads   | 639 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003556 (0.0) |
Best Drosophila hit   | Ligase4 (2e-75) |
Best Human hit | DNA ligase 4 (2e-108) |
Best NR hit (blastp)   | PREDICTED: similar to DNA ligase IV [Acyrthosiphon pisum] (9e-128) |
Best NR hit (blastx)   | PREDICTED: similar to DNA ligase IV [Acyrthosiphon pisum] (5e-128) |
GeneOntology terms    | GO:0000166 nucleotide binding GO:0005634 nucleus GO:0010165 response to X-ray GO:0033077 T cell differentiation in the thymus GO:0050769 positive regulation of neurogenesis GO:0003677 DNA binding GO:0045190 isotype switching GO:0046872 metal ion binding GO:0007417 central nervous system development GO:0008283 cell proliferation GO:0032807 DNA ligase IV complex GO:0005524 ATP binding GO:0043524 negative regulation of neuron apoptosis GO:0035019 somatic stem cell maintenance GO:0033152 immunoglobulin V(D)J recombination GO:0006303 double-strand break repair via nonhomologous end joining GO:0005622 intracellular GO:0051276 chromosome organization GO:0006297 nucleotide-excision repair, DNA gap filling GO:0005958 DNA-dependent protein kinase-DNA ligase 4 complex GO:0002328 pro-B cell differentiation GO:0001701 in utero embryonic development GO:0070419 nonhomologous end joining complex GO:0033153 T cell receptor V(D)J recombination GO:0007049 cell cycle GO:0003910 DNA ligase (ATP) activity GO:0000793 condensed chromosome GO:0016874 ligase activity GO:0000012 single strand break repair GO:0008022 protein C-terminus binding GO:0010332 response to gamma radiation GO:0048146 positive regulation of fibroblast proliferation GO:0051102 DNA ligation involved in DNA recombination GO:0051103 DNA ligation involved in DNA repair GO:0006260 DNA replication GO:0051301 cell division |
InterPro families    | IPR016027 Nucleic acid-binding, OB-fold-like IPR001357 BRCT IPR019734 Tetratricopeptide repeat IPR000977 DNA ligase, ATP-dependent IPR012310 DNA ligase, ATP-dependent, central IPR012308 DNA ligase, ATP-dependent, N-terminal IPR012309 DNA ligase, ATP-dependent, C-terminal IPR013026 Tetratricopeptide repeat-containing IPR016059 DNA ligase, ATP-dependent, conserved site IPR011990 Tetratricopeptide-like helical IPR012340 Nucleic acid-binding, OB-fold |
Orthology group | MCL11475 |
Nucleotide sequence:
ATGGATGTGGATAAAGATCTTACTAATAAATTTTTAAGTGGTGAGATGTCTTTCTCCCAA
TACTCTAGTGAATGGTATAGTGGAGAAGAGGATGAAGATGAAGATGAGCCAGAGGAATCC
AAAAAATATGAAGAAGAAGCTGAAATGTCTACCACAGTTTCAAAGAGAGGTCTTAAACGA
CAATCCAAGTTCCGTCGCCTCTTTCCTGCATTATCTGGTCTTATGGGAGAAGCAAATATA
AGGCTTGCCAGGGGTGATAGTGAAATGGCTGAACGTATGTGCCATGAAATAATCAAACAA
CAACCCACAGCGGCTGAACCATATCAAACCTTAGCACAAATATACGAACATGATCCCAAT
AAATCATTGCAGTTTTCTTTGCTTGCTGCACATTTGAGTTTTACAGACAAAAGTGAATGG
TGGAGACTCGCTGCATTATGTAGACAGAGAAGTGATTATAAACAGGAAATGGTCTGTTAC
ACTCAGGCTATAAAATCTGAGCCACAAAATTTAGAGACACACTTGAAAAGGCTAGAGTTG
TTGTCAGAATTAGAAAAACTACCGGACTTTCCCGTTAATTCACTGAAAGTATCTAAGGTG
AAATGTTATCACAAAATTGTACGTTCCTTAGGACCTAGTGATGCTGAAACAATTATGAAG
TATGCCAAAATGGCTGCAACTTTATATCACAACAGCACCGAAGTTGAACAAGCAGTTGAA
GTGATGGGTATTGCATATAAAAAATGCTTTTCATTATTTACATTGGAGGATATTAATATG
TATTTGGAGCTGTTAATTACTCAAAAGCAGTTCACCAAATGTATTGAAGTATTTGTTTCA
AGTATAGGTGTGGAAATTGAAGCTGAAATTCAAACAGTGAAAAATGCTAATGGTGATATT
GAAGAACAAACACACTACCTTAATTGTGTTATACCCAATAACTTAGCTATAGATTTGAAA
AGTAAACTATTGGTGTGCTTTATACATTTAGGAGCACTTAATTTGGTCCAATCATTGCTT
AATGATTTTTTGAGCAGTGATGTTGAAAAAGCTGGAGATCTCTATATGGATATAGAAGAA
GCATTTTCAGCTGTTGGTCATTATGAGATGGCTATAAAATTATTGGAGCCTCTAATTAAA
AATACTAGCTTTGATTTAGGAGCTGTATGGCTTAAATATGCAGATTGCCTGAACAAGTTG
GGAAGACATGATGATGCTATAGAATCATATTACAAAGTGTTAAAGCATGTGCCACAACAC
GCTGACGCGAGGCGAAAGCTGTTTACAATTCTAGAAAACAAAGGAAGAATTGATGACGCT
TTGAACATTCTACAGCAGGATTACAAATTTGTCGTCAGCGCTCATCTACTGTTTGATCAT
TGTCAATACTTAAAGAAATATAATAGAATGTTGAAATATTTGGAGAACTCTACTTTTTTT
CCCATATTGAGGTTGCTCTTGCCAAGTTGTGATCGGGAACGTGGTCCCTACAACCTTAAA
GAAACCAGACTAAGTACTTTATTGGTAAAAGTACTGTCTCTCAATAAAGAGTCGACAGAT
GCGAAACAACTGATACATTTTAGTTCTTCAAATAACTCAGTTCTAGATAGCGACTTCCCT
GGTGTCGCGTTTTACGTTATAAAGAAAAGAGTTGGTCAGAATAATTCAGTATTGACAGTC
AGAGAGATCAATGAGATACTTAACTCTGTTGCAACTGTAGATAATGTTCATAAAACTCCA
TTGGATGAAATTTTTAGTTATGCTTTAAAAAAACTGACTGCCATCGAATTCAAATGGCTT
CTGAGAATAATATTAAAGGATTTAAAATTAAGTATGAGTGCAGATCGAATCTTGGGGATT
TTCCATCCAGATGCCCCAGAGGTCTTCAAGAACTGCAGCAGTATTTTAAAGGTGTGCGAA
GAATTAGAAGATGGCGACACTCGACCATCAGAACTGGGCGTCAATTTGTTCTACGCTGTA
AGACCAATGCTGTCTGAGAGGTTGGACATCACACACATACACGTCTTGGATAAGACGAAG
ACCTACTGTATGGAGGAGAAGTTTGATGGTGAGAGATTCCAGATGCACATGGATAACAAC
GTATTTGAATACTTTTCACGGAAAGGTTTCAAGTACTCCAAAAACTATGGGCAAAGTTAC
GACTCCGGCATGTTAACGCCGTATTTGAAGGATATTTTTGCTCCTGAGGCGAGGAATTTC
ATTCTTGACGGTGAAATGATGGGTTGGCACAAAATAGATAATTATTTCGGATGCAAAGCG
ATGTCATACGATGTTAAGAAAATCACAGAGAACAGTTCGTTCCGCCCTTGCTTTTGCGTG
TTTGATATTCTATATTATAACGACAGACCACTCATCGGCTCGCCAGATAAGGGCGGTTTA
CCTTTACGGGAACGACTCAAAATACTCGACGATCTATTCATAGACAAGCGAGGTGTTATA
GAACATAGCAAGCGAAAAATTATCAAAGAAAGTTCAGAAGTTGTGGACGCCGTCAACGAT
GCCATAGACAATCAGGACGAGGGTATTGTAGTTAAAGATATAAATTCATACTACATCGCT
AACAAAAGAAACGCTGGCTGGTACAAAATAAAACCGGAGTATACGGACGACACCATGAAT
GACCTAGACCTGGTGGTGGTTGGTGCTGATGAAGCCACCAACAAAAGACAGGGGCGTGCC
AAAAGTTTCTATGTCGCGTGTGGGGATAACAATGATGGCGACCCTGTCTGGACCTGCATT
GGCCGCGTGTCTAACGGACTGAAGCACGAGGAGAAGGAACGCGTTTGTTCATTACTTGAA
CGGAACTGGTGTATGTATAGGAAAAAACCTCCGCCTCCCTGTCTGCGCTTCGGCAAAGAC
AAGCCGGACTTCTGGATACTTCCAGAACATTCTATCGTATTGCAGGTGCGTGCCACCGAG
CTGTTAAGCGTTGGGGACTCACACGTGCTGCGATTCCCGCGCGTGGAAGATATAAGATCA
GACAAGCCGGTCGATGACGTGTGCACAATACACGAACTTAGACAACTGGCTGTGAGCAGA
AGCCCGGTCAGTAAGCTAAGTACAAAGCGCGTAAACGAATCGCAAATAGATCAAAACTAT
ATTAAAACACGCAAGCGCGGTCTGTCTAAGACCGTCCAAGTAGCGGAAAAATTCCGCACA
AAGACGATTGGAGACGTGCAAGTTATATCACGAGCTTTGTTTGGGAAGAAACTTTGTGTG
TTGTCGGATGACGAGGATTGTAAGAAAACGGAATTGAAACGCGTCATAGAGTCCCACGGA
GGGAGACACGTTGAGAACCCAGGTTCAGATACTTGGTGCTGTGTAGTGGGAACTATAACA
CCGCGAGCCCGTAGACTCATAGAGACACAAGACCTAGACATCATTAGCACAGCCTGGCTC
AGAAGCCTACCAGCGACAGACGACCCGTGTCAACTGTCGCCATTGGACATGCTATCAATC
AAACCCGAAACGAAGCTCAAACTGAGCCTAGACTACGACCCCTTCGGTGATAGTTACAAG
GATGAAATAGATGAAAAAACATTGAAGAAACTGCTGGACAAAATGGATTCGGAGTTCCCG
TTGTATCCAACTTTAAAAGAAAAAGTCTGTCTGGATAAACAATTATTCGGCGCCAACAAT
CCTTACTCATTTTTGAGGAATTGTTTCATTCACGTTATTGACAATTCGCTTTACGAAACT
ATGGCGTCCTTTTTCGGAGCCAAAATCTGTTCTCTCGATGACGTCAGACTGACGCACGTC
GTTATGTCAAAAGACGCGAATGTCAAAATAGATAAAGGAATTCTAGTGTCGGATGGATGG
TTGGAAGAATGTTTTAACAAAAGGAGTTTTGTTCCTGTCGATGATTATCTAATTTAA
Protein sequence:
MDVDKDLTNKFLSGEMSFSQYSSEWYSGEEDEDEDEPEESKKYEEEAEMSTTVSKRGLKR
QSKFRRLFPALSGLMGEANIRLARGDSEMAERMCHEIIKQQPTAAEPYQTLAQIYEHDPN
KSLQFSLLAAHLSFTDKSEWWRLAALCRQRSDYKQEMVCYTQAIKSEPQNLETHLKRLEL
LSELEKLPDFPVNSLKVSKVKCYHKIVRSLGPSDAETIMKYAKMAATLYHNSTEVEQAVE
VMGIAYKKCFSLFTLEDINMYLELLITQKQFTKCIEVFVSSIGVEIEAEIQTVKNANGDI
EEQTHYLNCVIPNNLAIDLKSKLLVCFIHLGALNLVQSLLNDFLSSDVEKAGDLYMDIEE
AFSAVGHYEMAIKLLEPLIKNTSFDLGAVWLKYADCLNKLGRHDDAIESYYKVLKHVPQH
ADARRKLFTILENKGRIDDALNILQQDYKFVVSAHLLFDHCQYLKKYNRMLKYLENSTFF
PILRLLLPSCDRERGPYNLKETRLSTLLVKVLSLNKESTDAKQLIHFSSSNNSVLDSDFP
GVAFYVIKKRVGQNNSVLTVREINEILNSVATVDNVHKTPLDEIFSYALKKLTAIEFKWL
LRIILKDLKLSMSADRILGIFHPDAPEVFKNCSSILKVCEELEDGDTRPSELGVNLFYAV
RPMLSERLDITHIHVLDKTKTYCMEEKFDGERFQMHMDNNVFEYFSRKGFKYSKNYGQSY
DSGMLTPYLKDIFAPEARNFILDGEMMGWHKIDNYFGCKAMSYDVKKITENSSFRPCFCV
FDILYYNDRPLIGSPDKGGLPLRERLKILDDLFIDKRGVIEHSKRKIIKESSEVVDAVND
AIDNQDEGIVVKDINSYYIANKRNAGWYKIKPEYTDDTMNDLDLVVVGADEATNKRQGRA
KSFYVACGDNNDGDPVWTCIGRVSNGLKHEEKERVCSLLERNWCMYRKKPPPPCLRFGKD
KPDFWILPEHSIVLQVRATELLSVGDSHVLRFPRVEDIRSDKPVDDVCTIHELRQLAVSR
SPVSKLSTKRVNESQIDQNYIKTRKRGLSKTVQVAEKFRTKTIGDVQVISRALFGKKLCV
LSDDEDCKKTELKRVIESHGGRHVENPGSDTWCCVVGTITPRARRLIETQDLDIISTAWL
RSLPATDDPCQLSPLDMLSIKPETKLKLSLDYDPFGDSYKDEIDEKTLKKLLDKMDSEFP
LYPTLKEKVCLDKQLFGANNPYSFLRNCFIHVIDNSLYETMASFFGAKICSLDDVRLTHV
VMSKDANVKIDKGILVSDGWLEECFNKRSFVPVDDYLI