New model in OGS2.0 | DPOGS201072  |
---|---|
Genomic Position | scaffold1249:+ 48285-51545 |
See gene structure | |
CDS Length | 1389 |
Paired RNAseq reads   | 444 |
Single RNAseq reads   | 1080 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007164 (8e-159) |
Best Drosophila hit   | ND |
Best Human hit | putative nuclease HARBI1 (3e-30) |
Best NR hit (blastp)   | PREDICTED: hypothetical protein [Acyrthosiphon pisum] (1e-45) |
Best NR hit (blastx)   | PREDICTED: hypothetical protein [Acyrthosiphon pisum] (7e-44) |
GeneOntology terms    | GO:0005737 cytoplasm GO:0016787 hydrolase activity GO:0005634 nucleus GO:0046872 metal ion binding GO:0004518 nuclease activity GO:0005575 cellular_component GO:0003674 molecular_function GO:0008150 biological_process |
InterPro families   | ND |
Orthology group | MCL10113 |
Nucleotide sequence:
ATGGCTGATAAGCCGCCAGAAGACAGCTACATATCTGTTGTTGACGAAGAACCTGTGTTT
TTTGAGTTACTCAAATGGGATTCGTCACAATCGCAAAAGCAACAAGAGCCGAGGAGTTTA
GAAAAAGCAAAAAGTCCGGAGAAAATGGTGACAAAATCTAAAGAAGAGTCAGATCCGTTT
GATTTGAGTGATGCCGCCTTTTTAGATATGTATCGACTTTCAAAAGATCTGGCGCGAAAT
CTTTGTGAGGAATTGAAACCTGTTATGCCCGATTCTATTAAATCGATTGAGTTTTCAGTC
GAAAGTAAAGTTTTAGCAGCTTTATCATTCTATGCTACTGGCAAGTATCAGAAATCAATA
GGGGGTAGATCGGACCCCAGTATAACTCAGTATTTTGTGGCAACAGCGGTGATGCAGGTC
ACTGAAGCTATGAATGACCCCAGTATTATTAAGAAATATATACACTTCCCACATTTGAGA
AATGAGAGGGAAGTCATCAAAAATGGTTTTTACATGAAGTATGGCATCCCTAATGTTGTT
GGCTGTGTGGACTGTGTGCATGTGCCCATCGCCCGGCCCGATGAAGATCAGAAGAAGCAC
TTCAACAAATCATACCACTCTAAGAAAGTACAAATAATAAGCGACAGTCGCCAGCGCATC
ATGAGCGTGTGTTCTGAGGGTGGAGGCTCATACTCCCACGACGCTCTGCTGGCCAGACAC
GCCGTCACCGTGGACCTGGTCAGTCTGAACAACTCACGGGATCTCTGCTGGCTGCTAGGC
GGGCCGCATTACTCACAGAAACCGTACCTGATGGCCCCAGTGCCGAAAATGACGAAGAAG
TCTTCCATGTCACCGGAAAAGTATTACACGAACCTGCACGCGCAGGCGCACTCGGCCGTC
ACGGAGACTATCAAACAGTTGAAGGCGCGCTGGAAGTGTCTGCAGGCCACCAGCAACAAG
CAGTTCGACCCGCCCACCGTCGCCAAGATGGTCCTCGCCTGCTGCGTGCTACACAACATA
TGCACGGAGCACGGCATTCCGCCCGTGGACATGACGCAGGCCGAGGAGCGTCTGGAGGCC
ATGAAGCAGAGGGTGGCCAACGCCCCGGCCTCCAGGAGACAGGAACACGACCAGCTCGGC
CTGCAAGCGCGGGCTGCGCTCATACAGAGGCTGTGGGCCGAGAGGAGCATCACGACCGAC
GCCTGCCCCGCCACCAAGAGGAGGCTGGCGAAGAAGGACCGGCCGCCGGAGACCCACCCT
GTACATCACCCAGAGGTGCATCAGCATCAGATGCACGACGACCCCAAGAGACCCAGAATA
CTCATGAACAACCCCTACAGCATCGGAGTGGGCATGCCGCCGGCCTGGGGTCACTACCCG
CAACACTGA
Protein sequence:
MADKPPEDSYISVVDEEPVFFELLKWDSSQSQKQQEPRSLEKAKSPEKMVTKSKEESDPF
DLSDAAFLDMYRLSKDLARNLCEELKPVMPDSIKSIEFSVESKVLAALSFYATGKYQKSI
GGRSDPSITQYFVATAVMQVTEAMNDPSIIKKYIHFPHLRNEREVIKNGFYMKYGIPNVV
GCVDCVHVPIARPDEDQKKHFNKSYHSKKVQIISDSRQRIMSVCSEGGGSYSHDALLARH
AVTVDLVSLNNSRDLCWLLGGPHYSQKPYLMAPVPKMTKKSSMSPEKYYTNLHAQAHSAV
TETIKQLKARWKCLQATSNKQFDPPTVAKMVLACCVLHNICTEHGIPPVDMTQAEERLEA
MKQRVANAPASRRQEHDQLGLQARAALIQRLWAERSITTDACPATKRRLAKKDRPPETHP
VHHPEVHQHQMHDDPKRPRILMNNPYSIGVGMPPAWGHYPQH