DPGLEAN22382 in OGS1.0

New model in OGS2.0DPOGS209112 
Genomic Positionscaffold3402:+ 9545-14012
See gene structure
CDS Length1314
Paired RNAseq reads  88
Single RNAseq reads  443
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001792 (1e-34)
Best Drosophila hit  Snm1 (4e-09)
Best Human hitprotein artemis isoform a (3e-29)
Best NR hit (blastp)  PREDICTED: similar to Artemis protein (DNA cross-link repair 1C protein) (SNM1-like protein) (A-SCID protein) (hSNM1C) [Apis mellifera] (8e-45)
Best NR hit (blastx)  PREDICTED: similar to Artemis protein (DNA cross-link repair 1C protein) (SNM1-like protein) (A-SCID protein) (hSNM1C) [Apis mellifera] (5e-44)
GeneOntology terms












  
GO:0000014 single-stranded DNA specific endodeoxyribonuclease activity
GO:0000723 telomere maintenance
GO:0003674 molecular_function
GO:0004519 endonuclease activity
GO:0005575 cellular_component
GO:0005634 nucleus
GO:0006302 double-strand break repair
GO:0006310 DNA recombination
GO:0008150 biological_process
GO:0008409 5'-3' exonuclease activity
GO:0010212 response to ionizing radiation
GO:0016787 hydrolase activity
GO:0030183 B cell differentiation
GO:0051276 chromosome organization
InterPro families  ND
Orthology groupMCL17464

Nucleotide sequence:

ATGTTTAGGAAAGCACAAACCGCCTTCCACGGCGCTATAGAAGAATTACCGGGTATTTAC
GTTGATAATTTCGAAAACGCTGCTAAAGTAAATGCCAGAGCTTATTTTTTAAGCCACTGC
CACGCTGATCATATGCACGGATTGAGTTCTGAGGAGTTAATGGCTACGCTGAAAAAGAGT
GGAGCCAAGATTTACACAACTGAATTATCTGCAGCCATTATAAAGACCGATGTAAATAAA
GATATCGGTGATCATGTACAGAGTTTGAAAATGGGTGGTACACAAATATTAAGTTTTCCT
TCCATACCCGAACAGAATATTCCAGAACTACTTCTCACCGTGACTCTCATTCCGGCCGGC
CACAGCGCCGGTTCGACTATGTTTTTGTTCAGGACCACGACTAAAACTATTCTATTCACT
GGTGACTTCAGAATGAACCCAAACGATTTGCCCAAATATTCGGCACTTCATGACGACGGC
CACCCTATAAAGTTGACCAGTCTCTATGTAGACACAACCTTCTTAAGCTACAATTACGAC
AATTTTCCCAAACGTAGCGAGAGTATAGAAAAAATGTGCTCCGAAATCAAGAAGTGGTTG
AGTTACGAACAAAATGCCGTGTCCTTGCACACCTCAGCCAAGTATGGTTACGAGTTTGCA
TTCAATGAGATATATCGGAGGTTGGGTTTGAAGGTACATGTACCGACGGAGAGGTGGAGT
TTGTACAGTTCCATACAACATTTGGTGCCCGGTGTCACAAACGAGTCGACAAAGATACAT
TTATGCAAGAAACATGTCACCGACCAAAGTCATCAGCTTCCCAAACCCTTACAGGCCACT
CAAGAGGAAGATCCAGGCATACTGATTTATATAACGAGCGTGTTTTATGAAATATGGAAA
CCGATGGTTAGCGACAGCTGGAAGAACCGTTTCCGCCAACCATTTCATCCAGCGTGGATG
GATTACTGCGATCCTTATCATTGTAACGATTACCACAAAATAGCCTGCGGTTTGAACCGG
ATGACAATGAGGTTCAAGTGGTTTCAGAGCCAATGTCACATCATTCTGAACAATATGTGT
TCAAACTACAGGGGATCTCTGCAATACGACGTCGTAGATACGAAATACTGTTCGTATTAC
GTAATGTTCCTACGGACAGGTTGTCCGAATGTCTGTCCTGATGTATTGGAGCCAGTTTGT
TGTATGAGCACCGTCGATAGCCATGTGGTATTGTTTAAGAACAGTTGCGAAATGGAGAAA
GCTAATTGTAAGGGCGGAATGTTGGAAGGTAAGTTGCCCGTTATACGACAATAG

Protein sequence:

MFRKAQTAFHGAIEELPGIYVDNFENAAKVNARAYFLSHCHADHMHGLSSEELMATLKKS
GAKIYTTELSAAIIKTDVNKDIGDHVQSLKMGGTQILSFPSIPEQNIPELLLTVTLIPAG
HSAGSTMFLFRTTTKTILFTGDFRMNPNDLPKYSALHDDGHPIKLTSLYVDTTFLSYNYD
NFPKRSESIEKMCSEIKKWLSYEQNAVSLHTSAKYGYEFAFNEIYRRLGLKVHVPTERWS
LYSSIQHLVPGVTNESTKIHLCKKHVTDQSHQLPKPLQATQEEDPGILIYITSVFYEIWK
PMVSDSWKNRFRQPFHPAWMDYCDPYHCNDYHKIACGLNRMTMRFKWFQSQCHIILNNMC
SNYRGSLQYDVVDTKYCSYYVMFLRTGCPNVCPDVLEPVCCMSTVDSHVVLFKNSCEMEK
ANCKGGMLEGKLPVIRQ