New model in OGS2.0 | DPOGS209112  |
---|---|
Genomic Position | scaffold3402:+ 9545-14012 |
See gene structure | |
CDS Length | 1314 |
Paired RNAseq reads   | 88 |
Single RNAseq reads   | 443 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001792 (1e-34) |
Best Drosophila hit   | Snm1 (4e-09) |
Best Human hit | protein artemis isoform a (3e-29) |
Best NR hit (blastp)   | PREDICTED: similar to Artemis protein (DNA cross-link repair 1C protein) (SNM1-like protein) (A-SCID protein) (hSNM1C) [Apis mellifera] (8e-45) |
Best NR hit (blastx)   | PREDICTED: similar to Artemis protein (DNA cross-link repair 1C protein) (SNM1-like protein) (A-SCID protein) (hSNM1C) [Apis mellifera] (5e-44) |
GeneOntology terms    | GO:0000014 single-stranded DNA specific endodeoxyribonuclease activity GO:0000723 telomere maintenance GO:0003674 molecular_function GO:0004519 endonuclease activity GO:0005575 cellular_component GO:0005634 nucleus GO:0006302 double-strand break repair GO:0006310 DNA recombination GO:0008150 biological_process GO:0008409 5'-3' exonuclease activity GO:0010212 response to ionizing radiation GO:0016787 hydrolase activity GO:0030183 B cell differentiation GO:0051276 chromosome organization |
InterPro families   | ND |
Orthology group | MCL17464 |
Nucleotide sequence:
ATGTTTAGGAAAGCACAAACCGCCTTCCACGGCGCTATAGAAGAATTACCGGGTATTTAC
GTTGATAATTTCGAAAACGCTGCTAAAGTAAATGCCAGAGCTTATTTTTTAAGCCACTGC
CACGCTGATCATATGCACGGATTGAGTTCTGAGGAGTTAATGGCTACGCTGAAAAAGAGT
GGAGCCAAGATTTACACAACTGAATTATCTGCAGCCATTATAAAGACCGATGTAAATAAA
GATATCGGTGATCATGTACAGAGTTTGAAAATGGGTGGTACACAAATATTAAGTTTTCCT
TCCATACCCGAACAGAATATTCCAGAACTACTTCTCACCGTGACTCTCATTCCGGCCGGC
CACAGCGCCGGTTCGACTATGTTTTTGTTCAGGACCACGACTAAAACTATTCTATTCACT
GGTGACTTCAGAATGAACCCAAACGATTTGCCCAAATATTCGGCACTTCATGACGACGGC
CACCCTATAAAGTTGACCAGTCTCTATGTAGACACAACCTTCTTAAGCTACAATTACGAC
AATTTTCCCAAACGTAGCGAGAGTATAGAAAAAATGTGCTCCGAAATCAAGAAGTGGTTG
AGTTACGAACAAAATGCCGTGTCCTTGCACACCTCAGCCAAGTATGGTTACGAGTTTGCA
TTCAATGAGATATATCGGAGGTTGGGTTTGAAGGTACATGTACCGACGGAGAGGTGGAGT
TTGTACAGTTCCATACAACATTTGGTGCCCGGTGTCACAAACGAGTCGACAAAGATACAT
TTATGCAAGAAACATGTCACCGACCAAAGTCATCAGCTTCCCAAACCCTTACAGGCCACT
CAAGAGGAAGATCCAGGCATACTGATTTATATAACGAGCGTGTTTTATGAAATATGGAAA
CCGATGGTTAGCGACAGCTGGAAGAACCGTTTCCGCCAACCATTTCATCCAGCGTGGATG
GATTACTGCGATCCTTATCATTGTAACGATTACCACAAAATAGCCTGCGGTTTGAACCGG
ATGACAATGAGGTTCAAGTGGTTTCAGAGCCAATGTCACATCATTCTGAACAATATGTGT
TCAAACTACAGGGGATCTCTGCAATACGACGTCGTAGATACGAAATACTGTTCGTATTAC
GTAATGTTCCTACGGACAGGTTGTCCGAATGTCTGTCCTGATGTATTGGAGCCAGTTTGT
TGTATGAGCACCGTCGATAGCCATGTGGTATTGTTTAAGAACAGTTGCGAAATGGAGAAA
GCTAATTGTAAGGGCGGAATGTTGGAAGGTAAGTTGCCCGTTATACGACAATAG
Protein sequence:
MFRKAQTAFHGAIEELPGIYVDNFENAAKVNARAYFLSHCHADHMHGLSSEELMATLKKS
GAKIYTTELSAAIIKTDVNKDIGDHVQSLKMGGTQILSFPSIPEQNIPELLLTVTLIPAG
HSAGSTMFLFRTTTKTILFTGDFRMNPNDLPKYSALHDDGHPIKLTSLYVDTTFLSYNYD
NFPKRSESIEKMCSEIKKWLSYEQNAVSLHTSAKYGYEFAFNEIYRRLGLKVHVPTERWS
LYSSIQHLVPGVTNESTKIHLCKKHVTDQSHQLPKPLQATQEEDPGILIYITSVFYEIWK
PMVSDSWKNRFRQPFHPAWMDYCDPYHCNDYHKIACGLNRMTMRFKWFQSQCHIILNNMC
SNYRGSLQYDVVDTKYCSYYVMFLRTGCPNVCPDVLEPVCCMSTVDSHVVLFKNSCEMEK
ANCKGGMLEGKLPVIRQ