New model in OGS2.0 | DPOGS203374  |
---|---|
Genomic Position | scaffold6:+ 231471-237773 |
See gene structure | |
CDS Length | 2331 |
Paired RNAseq reads   | 265 |
Single RNAseq reads   | 619 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002109 (7e-101) |
Best Drosophila hit   | CG1868, isoform B (3e-24) |
Best Human hit | SET and MYND domain-containing protein 4 (1e-31) |
Best NR hit (blastp)   | PREDICTED: hypothetical protein [Nasonia vitripennis] (2e-103) |
Best NR hit (blastx)   | PREDICTED: hypothetical protein [Nasonia vitripennis] (6e-87) |
GeneOntology terms    | GO:0046872 metal ion binding GO:0005488 binding GO:0008270 zinc ion binding |
InterPro families    | IPR011990 Tetratricopeptide-like helical IPR001214 SET domain |
Orthology group | MCL12197 |
Nucleotide sequence:
ATGAACAGTGTCAATTTGCCGGAGGACTGTGCCAAACAATGGGAAATTTTGTTGTTGTTA
TTGTCGTCAGAGGAAAAGAAAATCGATCGTACTCACGAAACCGAGATCGATACTATGAGT
TATTTCTACAACAACAAGGGTGTTAGAAGGATTTTACTCTACTGGCTTCAGCAGATGAAG
GACACATACGCACGTAAAACATGTGATACAGATACAGCGCTGAAATGCGACGGAGTGTCG
CTGTGCTGGCGACAGAAGGGGAACGAGAAATTCAGGGCTAATTTGGTGGAAGAGAGCTAT
AAGTGTTACACTAATAGCGTATTGTACGCTAAACCTAATAGTCTTATGTACACACTGGCT
CTAGCGAACCGGTCGGCTGCTTTATTGAGGTTAAAGAGATTTCAGGAATGTTTGTCGGAT
GTGTCTCTGGCTATAGACAGCGGTTATCCCGCGGAACAGAGACACAAACTTTTACTGCGT
AGAGCAGATTGTTACATTGAATTGCAAAGTAAGGAAGCAAGGACCGCTCTCAACTCGGCT
ATACAGTATGCAGGGACACTCAACATGACAGTGGCGAACAAATTGGAATTCGAGCGTCAC
ATAAAGATTTTAGAGAAGAAACTAGAGTGTGTCAAACGGGATGTTGTTAAAGAGGAAGAG
GTGCTGCTACCTGATTGTTACCTGGGACCGAATCCTGACTTTGTATCGGCCTCCAAATCT
ATTGAACTAAGATATAAGGAGTCACGCGGTCGTCACGTGGTCGCGGTAGAGCCGGCAGGT
CGTGGGGACGTGCTCTTCTCAGAGGAGCCCTACGCATGGGTCGCTCTGCCCTCAGACGAC
GCCATTTGTGAGATGTGCTGCGACACAGACATCAATCCAGTGCCCGTGTACTGTGGGTGT
GAGTGTGCGTCTCGTGCCATATCCTTCCACCGCTGGGAGTGTGTGGGTGCACAGTGCTCA
CTCTTCCCCACTATAGGCATCGCGCATTTAGCTTTAAGGGTGCTACTAATAAGTACGAAC
AACGGATTCCCCCCGTCGCCGGTGTCGTTGCCGCAGGCGTGTACCGCCGGCGAGTTGTTC
AGGAGCTACGGGCTGGTAGACAACATCCAGATATACAAGACGGGCACGGATCCCTTCTAC
AGGATGTTCAATCTGGTGACCAACTTCAACAAGATGGACAACACAGATTACATACAATAC
GCCTTAACGGCCACGATGTTGACGTTGTATTTGGAGAATTTCACGAGTTTTTTTGATTAT
CTACCAAGCAAAATGCCGTGTAGTATGTCTGAGAGTCAATTGAAGTTGTTCGCTGCGGCC
GTCATACTGAGGAGTATGGGCCAGTTGGTTTGTAACGGACACGCAACCTTAAGCCTGGCA
GTCGTGGAAGAGGACGATGGCAGAAACGGCAAAACGATAACGGAAAAGGAAGTCCGTAGA
GCCACCGCCATCTATCCCTCGGCCGCCATGATGAATCACTCCTGCGATCCCAACATAATA
AACACTTTCTACAAGAGTCGTCTTATAGTCCGATGCCAGCGCGAGTTACCGGCAGGAGGC
GAAGTGTTCAACTGTTATGGTCCACACCGAGCCCGCGCGCCCGCCGCAGCACGACGGAAA
GCGCTCAAGGCTCAGTATATGTTCACGTGCCACTGTGCTGACTGCAACGACACGGAGAGG
AAAGACTTCGTGTCGTTGTTCAGCGCGTACCTGTGTCAGTCGTGTAAGGGTCCGGTGTGG
GCGCACTGTGTTCGTCCTCTGTGCACACAGTGTCGGTCAGCACTCCACCTGGAACGTGCA
CACACACTACTGGATCGAGCTGACGACCTCGCAACACAAGCGGAACAGGTCGTCAGTTTG
GAGGAGCGTTGCGAAAAGATGGCGGCCTCGTACCGGCTGAAACAACAGGTGTGGCATCGA
CACCACGCCTCGCTCAGAATGGCGGCGGATAGACTGGCAAGACTGTATGCTGACACAGGT
GATTTCGGTAAAAGTATGGAGCTCATCAAACAGAACATCCAGAGCCTCGAGTATCGCTTT
GGTTCTTTCAGTGTAGAGGTCGCCCATGAACTCCGTAAACTGTCGGATGTTATGTTAGAA
AGGATTTTGAATTCACCGCAGCATCTGGAATACAGAGAATGGTGTCTAGAAGCTCATAAG
GTGGTCAAAAAGGCTATACAATTGATGGAATTGAACTACGGCTCGTGGGAACCTCTTGTG
TCTAGACTGAAGCAACACGAGTGCTACCTCGCGGCGACGCTCGCCGAGAGCAGGACTCCC
GAGGCTGTGGACTGTGTTCATCACAACTTACATTACAATCTTAAAATATAA
Protein sequence:
MNSVNLPEDCAKQWEILLLLLSSEEKKIDRTHETEIDTMSYFYNNKGVRRILLYWLQQMK
DTYARKTCDTDTALKCDGVSLCWRQKGNEKFRANLVEESYKCYTNSVLYAKPNSLMYTLA
LANRSAALLRLKRFQECLSDVSLAIDSGYPAEQRHKLLLRRADCYIELQSKEARTALNSA
IQYAGTLNMTVANKLEFERHIKILEKKLECVKRDVVKEEEVLLPDCYLGPNPDFVSASKS
IELRYKESRGRHVVAVEPAGRGDVLFSEEPYAWVALPSDDAICEMCCDTDINPVPVYCGC
ECASRAISFHRWECVGAQCSLFPTIGIAHLALRVLLISTNNGFPPSPVSLPQACTAGELF
RSYGLVDNIQIYKTGTDPFYRMFNLVTNFNKMDNTDYIQYALTATMLTLYLENFTSFFDY
LPSKMPCSMSESQLKLFAAAVILRSMGQLVCNGHATLSLAVVEEDDGRNGKTITEKEVRR
ATAIYPSAAMMNHSCDPNIINTFYKSRLIVRCQRELPAGGEVFNCYGPHRARAPAAARRK
ALKAQYMFTCHCADCNDTERKDFVSLFSAYLCQSCKGPVWAHCVRPLCTQCRSALHLERA
HTLLDRADDLATQAEQVVSLEERCEKMAASYRLKQQVWHRHHASLRMAADRLARLYADTG
DFGKSMELIKQNIQSLEYRFGSFSVEVAHELRKLSDVMLERILNSPQHLEYREWCLEAHK
VVKKAIQLMELNYGSWEPLVSRLKQHECYLAATLAESRTPEAVDCVHHNLHYNLKI