DPGLEAN09042 in OGS1.0

New model in OGS2.0DPOGS202772 
Genomic Positionscaffold30:- 11246-14008
See gene structure
CDS Length1764
Paired RNAseq reads  672
Single RNAseq reads  1633
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010495 (9e-175)
Best Drosophila hit  Caf1-105 (2e-92)
Best Human hitchromatin assembly factor 1 subunit B (1e-71)
Best NR hit (blastp)  AGAP007544-PA [Anopheles gambiae str. PEST] (2e-128)
Best NR hit (blastx)  AGAP007544-PA [Anopheles gambiae str. PEST] (1e-112)
GeneOntology terms


  
GO:0006333 chromatin assembly or disassembly
GO:0005678 chromatin assembly complex
GO:0006334 nucleosome assembly
GO:0003677 DNA binding
InterPro families






  
IPR019781 WD40 repeat, subgroup
IPR011046 WD40 repeat-like-containing domain
IPR015943 WD40/YVTN repeat-like-containing domain
IPR001632 G-protein, beta subunit
IPR001680 WD40 repeat
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR019775 WD40 repeat, conserved site
Orthology groupMCL14509

Nucleotide sequence:

ATGAAGTTTGCTATACCTGAAATATCATGGCATAACAGAGATCCAGTTTTAAGTGTAGAC
ATTCAGCCCAAAACAAATGCAAGTGAACCACTGCGGTTAGCTACCGGGGGCACAGATTCT
CATGTTGTGATATGGTATTTATCAAAAACAATAACCGGTTCAGTGAAATTAGAAGTCGCT
ACTGATCTCACCAGGCATCAAAAAGCCGTTAATGTAGTGAGATGGTCGCCCAATGGTGTC
TACTTAGCATCTGGAGATGATGAATCTATCATATTTATATGGAAGCAAAAGACGGAAGAG
CCAATAGCACCACCCTTAGAGGGAGAGGAGCAGTATAAAGAGACTTGGGTTATACATAAA
ACTTTAAGGGGTCACATGGAGGATGTTCTGGACATCAGTTGGAGTAGTTCATCACTACAT
TTGGCATCCGGCTCAGTAGACAACAAGCTGATTGTCTGGGATGTGGCGAGAGCTCGATCT
AGTGGTATTGTCTCTGATCATAAAGGCTTTGTCCAGGGAGTAGCATGGGACCCTCAAGGA
CAGCTGATAGCCACAGCTAGCTCGGATAGAGTTTTCCGAACATTTGATGTGGGGACTAAG
AAAGTGTTGTCTCGTAGCAGTAAGGCTATTCTACCGTTCCCTAAGGAGCATACCCTACAT
GAAGTGAAGGTCCGCCTCTACCATGACGACACTCTACAGACGTACTACAGGAGATTACAT
TTCAGTCCCGATGGAATGTTCATTGCTGTGCCGGCCGGAAGAATAGAACCAGAACAAGGC
AAACTGGACATTAAACCAATGAATGCTGTTTACATTTACACTAGACACTCTCTCAAAACT
CCTGCGTGTGTGGTTCCGTGTGGAGAGCCGGCGCTGGTGTGCCGCTGGTCGCCCGTGCGT
CGTGCGGCGCGGACTTCGCCCCCCGCGCCGTCTGCTTTGCAGCACGCCCCTCGGCTTCTG
CTGGCGGTGGCCACGCGGAGATCGCTGCTGTTGTACGACACGCACCAGAAAGCGCCCGTC
GCGCTCATCTCAAACATACACTACACCAGGATCACAGACCTTTCGTGGTCTTCCGACGGC
CTGACCCTAGTGGCCTCCAGCACTGACGGTTTCTGCTCCGTCGTCAGTTTCACCGAGGAA
GAGCTGGGCGAGGCGCTCACCACCGCGGACGCCGTTAGTGCAGAGCCGATGGAAACGGAG
GAACAGAAACATAACCAAGAAACTCCTAAACAGAGACACGCTGAGGCGAAACCCATAGAA
GTCAAGCGGAGGCCGTCCTCGAACAACACCAAAATAGACGCCTTCATTAAGTTTAAAACT
CCCGAAGATAAGTCTCCGAAGAAGAAGAAGATCGAAAACATTCAGCAGAAGACGCCCGTC
AAGATGGACGTCCTCATGGAGACCGCGCTGCCATCCTGGTCTGACAACTCCAGCAACGAC
CTCATCAGACCCAAGGACACGGAGACCGCGACCCTCGGCGACGAAAATGACGTCACCGTC
ATAGAGGACAGCGAGGACATCCAGCTGGTCTACGAGGAGACCAAGGACGGCCAGTCGCCC
AAGACGGAACCCTCGGAGGAAAAACCTGCTCCCAAGACGATGTCTCCCAAACAATGCGGC
ACGGCCGACAGCAACTTCCTAATGAAGGCAAAGATCACCGACATCAGGGAGCCGGCGCCG
CTCACCGCCGTGCCGAGTCCCAAGGCACCGCGGAGAGTCAGCTTCGTGACGCTGTCGAGT
CCTAAGAGCACGAAAAAAAAATAA

Protein sequence:

MKFAIPEISWHNRDPVLSVDIQPKTNASEPLRLATGGTDSHVVIWYLSKTITGSVKLEVA
TDLTRHQKAVNVVRWSPNGVYLASGDDESIIFIWKQKTEEPIAPPLEGEEQYKETWVIHK
TLRGHMEDVLDISWSSSSLHLASGSVDNKLIVWDVARARSSGIVSDHKGFVQGVAWDPQG
QLIATASSDRVFRTFDVGTKKVLSRSSKAILPFPKEHTLHEVKVRLYHDDTLQTYYRRLH
FSPDGMFIAVPAGRIEPEQGKLDIKPMNAVYIYTRHSLKTPACVVPCGEPALVCRWSPVR
RAARTSPPAPSALQHAPRLLLAVATRRSLLLYDTHQKAPVALISNIHYTRITDLSWSSDG
LTLVASSTDGFCSVVSFTEEELGEALTTADAVSAEPMETEEQKHNQETPKQRHAEAKPIE
VKRRPSSNNTKIDAFIKFKTPEDKSPKKKKIENIQQKTPVKMDVLMETALPSWSDNSSND
LIRPKDTETATLGDENDVTVIEDSEDIQLVYEETKDGQSPKTEPSEEKPAPKTMSPKQCG
TADSNFLMKAKITDIREPAPLTAVPSPKAPRRVSFVTLSSPKSTKKK