New model in OGS2.0 | DPOGS202772  |
---|---|
Genomic Position | scaffold30:- 11246-14008 |
See gene structure | |
CDS Length | 1764 |
Paired RNAseq reads   | 672 |
Single RNAseq reads   | 1633 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010495 (9e-175) |
Best Drosophila hit   | Caf1-105 (2e-92) |
Best Human hit | chromatin assembly factor 1 subunit B (1e-71) |
Best NR hit (blastp)   | AGAP007544-PA [Anopheles gambiae str. PEST] (2e-128) |
Best NR hit (blastx)   | AGAP007544-PA [Anopheles gambiae str. PEST] (1e-112) |
GeneOntology terms    | GO:0006333 chromatin assembly or disassembly GO:0005678 chromatin assembly complex GO:0006334 nucleosome assembly GO:0003677 DNA binding |
InterPro families    | IPR019781 WD40 repeat, subgroup IPR011046 WD40 repeat-like-containing domain IPR015943 WD40/YVTN repeat-like-containing domain IPR001632 G-protein, beta subunit IPR001680 WD40 repeat IPR019782 WD40 repeat 2 IPR017986 WD40-repeat-containing domain IPR019775 WD40 repeat, conserved site |
Orthology group | MCL14509 |
Nucleotide sequence:
ATGAAGTTTGCTATACCTGAAATATCATGGCATAACAGAGATCCAGTTTTAAGTGTAGAC
ATTCAGCCCAAAACAAATGCAAGTGAACCACTGCGGTTAGCTACCGGGGGCACAGATTCT
CATGTTGTGATATGGTATTTATCAAAAACAATAACCGGTTCAGTGAAATTAGAAGTCGCT
ACTGATCTCACCAGGCATCAAAAAGCCGTTAATGTAGTGAGATGGTCGCCCAATGGTGTC
TACTTAGCATCTGGAGATGATGAATCTATCATATTTATATGGAAGCAAAAGACGGAAGAG
CCAATAGCACCACCCTTAGAGGGAGAGGAGCAGTATAAAGAGACTTGGGTTATACATAAA
ACTTTAAGGGGTCACATGGAGGATGTTCTGGACATCAGTTGGAGTAGTTCATCACTACAT
TTGGCATCCGGCTCAGTAGACAACAAGCTGATTGTCTGGGATGTGGCGAGAGCTCGATCT
AGTGGTATTGTCTCTGATCATAAAGGCTTTGTCCAGGGAGTAGCATGGGACCCTCAAGGA
CAGCTGATAGCCACAGCTAGCTCGGATAGAGTTTTCCGAACATTTGATGTGGGGACTAAG
AAAGTGTTGTCTCGTAGCAGTAAGGCTATTCTACCGTTCCCTAAGGAGCATACCCTACAT
GAAGTGAAGGTCCGCCTCTACCATGACGACACTCTACAGACGTACTACAGGAGATTACAT
TTCAGTCCCGATGGAATGTTCATTGCTGTGCCGGCCGGAAGAATAGAACCAGAACAAGGC
AAACTGGACATTAAACCAATGAATGCTGTTTACATTTACACTAGACACTCTCTCAAAACT
CCTGCGTGTGTGGTTCCGTGTGGAGAGCCGGCGCTGGTGTGCCGCTGGTCGCCCGTGCGT
CGTGCGGCGCGGACTTCGCCCCCCGCGCCGTCTGCTTTGCAGCACGCCCCTCGGCTTCTG
CTGGCGGTGGCCACGCGGAGATCGCTGCTGTTGTACGACACGCACCAGAAAGCGCCCGTC
GCGCTCATCTCAAACATACACTACACCAGGATCACAGACCTTTCGTGGTCTTCCGACGGC
CTGACCCTAGTGGCCTCCAGCACTGACGGTTTCTGCTCCGTCGTCAGTTTCACCGAGGAA
GAGCTGGGCGAGGCGCTCACCACCGCGGACGCCGTTAGTGCAGAGCCGATGGAAACGGAG
GAACAGAAACATAACCAAGAAACTCCTAAACAGAGACACGCTGAGGCGAAACCCATAGAA
GTCAAGCGGAGGCCGTCCTCGAACAACACCAAAATAGACGCCTTCATTAAGTTTAAAACT
CCCGAAGATAAGTCTCCGAAGAAGAAGAAGATCGAAAACATTCAGCAGAAGACGCCCGTC
AAGATGGACGTCCTCATGGAGACCGCGCTGCCATCCTGGTCTGACAACTCCAGCAACGAC
CTCATCAGACCCAAGGACACGGAGACCGCGACCCTCGGCGACGAAAATGACGTCACCGTC
ATAGAGGACAGCGAGGACATCCAGCTGGTCTACGAGGAGACCAAGGACGGCCAGTCGCCC
AAGACGGAACCCTCGGAGGAAAAACCTGCTCCCAAGACGATGTCTCCCAAACAATGCGGC
ACGGCCGACAGCAACTTCCTAATGAAGGCAAAGATCACCGACATCAGGGAGCCGGCGCCG
CTCACCGCCGTGCCGAGTCCCAAGGCACCGCGGAGAGTCAGCTTCGTGACGCTGTCGAGT
CCTAAGAGCACGAAAAAAAAATAA
Protein sequence:
MKFAIPEISWHNRDPVLSVDIQPKTNASEPLRLATGGTDSHVVIWYLSKTITGSVKLEVA
TDLTRHQKAVNVVRWSPNGVYLASGDDESIIFIWKQKTEEPIAPPLEGEEQYKETWVIHK
TLRGHMEDVLDISWSSSSLHLASGSVDNKLIVWDVARARSSGIVSDHKGFVQGVAWDPQG
QLIATASSDRVFRTFDVGTKKVLSRSSKAILPFPKEHTLHEVKVRLYHDDTLQTYYRRLH
FSPDGMFIAVPAGRIEPEQGKLDIKPMNAVYIYTRHSLKTPACVVPCGEPALVCRWSPVR
RAARTSPPAPSALQHAPRLLLAVATRRSLLLYDTHQKAPVALISNIHYTRITDLSWSSDG
LTLVASSTDGFCSVVSFTEEELGEALTTADAVSAEPMETEEQKHNQETPKQRHAEAKPIE
VKRRPSSNNTKIDAFIKFKTPEDKSPKKKKIENIQQKTPVKMDVLMETALPSWSDNSSND
LIRPKDTETATLGDENDVTVIEDSEDIQLVYEETKDGQSPKTEPSEEKPAPKTMSPKQCG
TADSNFLMKAKITDIREPAPLTAVPSPKAPRRVSFVTLSSPKSTKKK