New model in OGS2.0 | DPOGS205246  |
---|---|
Genomic Position | scaffold1389:+ 37175-40026 |
See gene structure | |
CDS Length | 1539 |
Paired RNAseq reads   | 34 |
Single RNAseq reads   | 119 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012158 (4e-27) |
Best Drosophila hit   | ND |
Best Human hit | tigger transposable element-derived protein 1 (8e-55) |
Best NR hit (blastp)   | PREDICTED: similar to Tigger transposable element-derived protein 2 [Hydra magnipapillata] (6e-90) |
Best NR hit (blastx)   | PREDICTED: similar to Tigger transposable element-derived protein 2 [Hydra magnipapillata] (4e-88) |
GeneOntology terms    | GO:0000775 chromosome, centromeric region GO:0003677 DNA binding GO:0005634 nucleus GO:0045449 regulation of transcription |
InterPro families    | IPR009057 Homeodomain-like IPR012287 Homeodomain-related IPR004875 DDE superfamily endonuclease, CENP-B-like |
Orthology group | ND |
Nucleotide sequence:
ATGGAGTTTAATACAGATACTTACGGTAAATGCAGATTTTGTAACACAACAGGACATCAC
AGAGATATTACGAAAGTTTACAATATCGGAGGTGTACGCGAAGTGTATTTCGATATTATA
TTGGATTGTTTTAATTTATGCTTACGAGATGCAAACAGCTTCAGAACTCTGGTGATGAAC
GCCGAGATTCATCTGCATGATGGCCTCGGAAATGAGAACACGGTTTTCATCAACACTGGT
AAAAACCCTTTGGATACCGAAGTAAAGTTGGAAAATGTGAAGGAAGACAATGAGAATGTT
GAAATAAATGACCATAATAATGAAGCCATGACAAGTGAAAAGGAAACAAAACCTAAGTTA
AAAAGGAAATTTATTTCTCTACAACAAAAAATTGATATTTTGGATCAGCTAAGTAATGGC
AAGAAATTAACAGCAATAGCAAAGGATCTGGAGCTAAACGAGTCGTCAATACGAACAGTT
AAACAAAATGAAAGTAAAATTAGGAGTGCTGTGATGTCTAGATCGTTGCAGACCTTAAGA
GAGCGCGCTTCAGCTGATTATGAAAGTGCTCGGAATTTTAAAGAAGAACTTCCGAAGATA
ATTGATGAAGGCGAATATACAGCTGACCAAGTGTATAATGCCGACGAAACTGGCTTATAT
TGGAAGAGAATGCCTAAGCGAACATATTTATCGGAAAACGAGAGATCTGCTGGTGGGCTG
AAGGCCTCTAAGGAGAGAATAACCTTGCTTGTTTGTAGTAATGCATCTGGCGACCATATA
ACAAAGCCGATGTTGATCAATCGTTTCTTAAGTCCACGGGCAATGAAAGGCATTGACAAG
ACTACACTTCCCGTTCACTGGAGAGCAAACGAAAATGCATGGGTCACAGCTGATATATTT
CACGACTGGTTTTACAACTGCTTTGTACCAGAGGTCGAAAATTACTCGAAAACTAAAAAT
GTCAGTTCCAAGGCTTTACTCCTGATAGACGATGCTCCACAACATCCAGTAGATCTAGTT
CATCCGAACGTAAAAGTACTTTTTTTACCAGCCAATACAACATCAATACTACAACCACTT
GACCAGGGTGTTATGAAAACAATTAAATCTCATTATATACGGAGAACACTTGAACTTATA
TCGGAGAAATTTGAATGCAAACCAGATATGAAATTAGCTGAGATGTGGAAAGATTTCTCA
ATTTTAAAATGTGTAGAATTAATATGTCTATCTGTTCGAGAGTTGAAGTCTTCGACATTA
AACGCCTGTTGGAAAAATGTTTGGCCTGAAGTCGTTTTGCAAGAAAACTTGTTAGATTCT
ACAAGTATAAATATCGAACCTATAGTGAATATTGCTAGATCAGTGGGCGGAGAAGGATTC
GACGACATGAACGAACGAGATATTTATGAATTAATAAACGACGCTGCAGATCTAGATGAG
GAAGAGCTCGTACAGTTAGCTGATACATCTGACGCGAATATGACAAATAGTGCTGAAGAG
AGCTCGATGCAGACTGTGGACGAAGGAGACGAACAATGA
Protein sequence:
MEFNTDTYGKCRFCNTTGHHRDITKVYNIGGVREVYFDIILDCFNLCLRDANSFRTLVMN
AEIHLHDGLGNENTVFINTGKNPLDTEVKLENVKEDNENVEINDHNNEAMTSEKETKPKL
KRKFISLQQKIDILDQLSNGKKLTAIAKDLELNESSIRTVKQNESKIRSAVMSRSLQTLR
ERASADYESARNFKEELPKIIDEGEYTADQVYNADETGLYWKRMPKRTYLSENERSAGGL
KASKERITLLVCSNASGDHITKPMLINRFLSPRAMKGIDKTTLPVHWRANENAWVTADIF
HDWFYNCFVPEVENYSKTKNVSSKALLLIDDAPQHPVDLVHPNVKVLFLPANTTSILQPL
DQGVMKTIKSHYIRRTLELISEKFECKPDMKLAEMWKDFSILKCVELICLSVRELKSSTL
NACWKNVWPEVVLQENLLDSTSINIEPIVNIARSVGGEGFDDMNERDIYELINDAADLDE
EELVQLADTSDANMTNSAEESSMQTVDEGDEQ