DPGLEAN16716 in OGS1.0

New model in OGS2.0DPOGS205246 
Genomic Positionscaffold1389:+ 37175-40026
See gene structure
CDS Length1539
Paired RNAseq reads  34
Single RNAseq reads  119
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012158 (4e-27)
Best Drosophila hit  ND
Best Human hittigger transposable element-derived protein 1 (8e-55)
Best NR hit (blastp)  PREDICTED: similar to Tigger transposable element-derived protein 2 [Hydra magnipapillata] (6e-90)
Best NR hit (blastx)  PREDICTED: similar to Tigger transposable element-derived protein 2 [Hydra magnipapillata] (4e-88)
GeneOntology terms


  
GO:0000775 chromosome, centromeric region
GO:0003677 DNA binding
GO:0005634 nucleus
GO:0045449 regulation of transcription
InterPro families

  
IPR009057 Homeodomain-like
IPR012287 Homeodomain-related
IPR004875 DDE superfamily endonuclease, CENP-B-like
Orthology groupND

Nucleotide sequence:

ATGGAGTTTAATACAGATACTTACGGTAAATGCAGATTTTGTAACACAACAGGACATCAC
AGAGATATTACGAAAGTTTACAATATCGGAGGTGTACGCGAAGTGTATTTCGATATTATA
TTGGATTGTTTTAATTTATGCTTACGAGATGCAAACAGCTTCAGAACTCTGGTGATGAAC
GCCGAGATTCATCTGCATGATGGCCTCGGAAATGAGAACACGGTTTTCATCAACACTGGT
AAAAACCCTTTGGATACCGAAGTAAAGTTGGAAAATGTGAAGGAAGACAATGAGAATGTT
GAAATAAATGACCATAATAATGAAGCCATGACAAGTGAAAAGGAAACAAAACCTAAGTTA
AAAAGGAAATTTATTTCTCTACAACAAAAAATTGATATTTTGGATCAGCTAAGTAATGGC
AAGAAATTAACAGCAATAGCAAAGGATCTGGAGCTAAACGAGTCGTCAATACGAACAGTT
AAACAAAATGAAAGTAAAATTAGGAGTGCTGTGATGTCTAGATCGTTGCAGACCTTAAGA
GAGCGCGCTTCAGCTGATTATGAAAGTGCTCGGAATTTTAAAGAAGAACTTCCGAAGATA
ATTGATGAAGGCGAATATACAGCTGACCAAGTGTATAATGCCGACGAAACTGGCTTATAT
TGGAAGAGAATGCCTAAGCGAACATATTTATCGGAAAACGAGAGATCTGCTGGTGGGCTG
AAGGCCTCTAAGGAGAGAATAACCTTGCTTGTTTGTAGTAATGCATCTGGCGACCATATA
ACAAAGCCGATGTTGATCAATCGTTTCTTAAGTCCACGGGCAATGAAAGGCATTGACAAG
ACTACACTTCCCGTTCACTGGAGAGCAAACGAAAATGCATGGGTCACAGCTGATATATTT
CACGACTGGTTTTACAACTGCTTTGTACCAGAGGTCGAAAATTACTCGAAAACTAAAAAT
GTCAGTTCCAAGGCTTTACTCCTGATAGACGATGCTCCACAACATCCAGTAGATCTAGTT
CATCCGAACGTAAAAGTACTTTTTTTACCAGCCAATACAACATCAATACTACAACCACTT
GACCAGGGTGTTATGAAAACAATTAAATCTCATTATATACGGAGAACACTTGAACTTATA
TCGGAGAAATTTGAATGCAAACCAGATATGAAATTAGCTGAGATGTGGAAAGATTTCTCA
ATTTTAAAATGTGTAGAATTAATATGTCTATCTGTTCGAGAGTTGAAGTCTTCGACATTA
AACGCCTGTTGGAAAAATGTTTGGCCTGAAGTCGTTTTGCAAGAAAACTTGTTAGATTCT
ACAAGTATAAATATCGAACCTATAGTGAATATTGCTAGATCAGTGGGCGGAGAAGGATTC
GACGACATGAACGAACGAGATATTTATGAATTAATAAACGACGCTGCAGATCTAGATGAG
GAAGAGCTCGTACAGTTAGCTGATACATCTGACGCGAATATGACAAATAGTGCTGAAGAG
AGCTCGATGCAGACTGTGGACGAAGGAGACGAACAATGA

Protein sequence:

MEFNTDTYGKCRFCNTTGHHRDITKVYNIGGVREVYFDIILDCFNLCLRDANSFRTLVMN
AEIHLHDGLGNENTVFINTGKNPLDTEVKLENVKEDNENVEINDHNNEAMTSEKETKPKL
KRKFISLQQKIDILDQLSNGKKLTAIAKDLELNESSIRTVKQNESKIRSAVMSRSLQTLR
ERASADYESARNFKEELPKIIDEGEYTADQVYNADETGLYWKRMPKRTYLSENERSAGGL
KASKERITLLVCSNASGDHITKPMLINRFLSPRAMKGIDKTTLPVHWRANENAWVTADIF
HDWFYNCFVPEVENYSKTKNVSSKALLLIDDAPQHPVDLVHPNVKVLFLPANTTSILQPL
DQGVMKTIKSHYIRRTLELISEKFECKPDMKLAEMWKDFSILKCVELICLSVRELKSSTL
NACWKNVWPEVVLQENLLDSTSINIEPIVNIARSVGGEGFDDMNERDIYELINDAADLDE
EELVQLADTSDANMTNSAEESSMQTVDEGDEQ