DPGLEAN06507 in OGS1.0

New model in OGS2.0DPOGS204633 
Genomic Positionscaffold3406:+ 1943-4402
See gene structure
CDS Length2460
Paired RNAseq reads  4514
Single RNAseq reads  11200
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009463 (0.0)
Best Drosophila hit  host cell factor, isoform D (5e-152)
Best Human hithost cell factor 1 (2e-164)
Best NR hit (blastp)  AGAP004774-PA [Anopheles gambiae str. PEST] (0.0)
Best NR hit (blastx)  PREDICTED: host cell factor 1-like [Pongo abelii] (1e-164)
GeneOntology terms












  
GO:0043254 regulation of protein complex assembly
GO:0005634 nucleus
GO:0005515 protein binding
GO:0019046 reactivation of latent virus
GO:0005737 cytoplasm
GO:0071339 MLL1 complex
GO:0070688 MLL5-L complex
GO:0044419 interspecies interaction between organisms
GO:0006366 transcription from RNA polymerase II promoter
GO:0042802 identical protein binding
GO:0003713 transcription coactivator activity
GO:0045787 positive regulation of cell cycle
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0045449 regulation of transcription
InterPro families
  
IPR006652 Kelch repeat type 1
IPR015915 Kelch-type beta propeller
Orthology groupMCL15808

Nucleotide sequence:

ATGAAGGAAAACGCTGTACTTAAGTGGCAAAAAGTTTACAATCCTAGCGGCCCTCAACCA
CGACCTCGCCATGGCCATCGTGCAGTTGCCATTAAAGATTTGATGATAGTTTTCGGCGGC
GGAAATGAAGGGATAGTTCATGAGCTTCACGTCTTCAATACCACCACGAATCAATGGTTT
GTTCCTGTTACGAAAGGAGAGGTGCCTCCAGGATGCGCCGCCTACGGCTTCGTGGTCGAC
GGAACACGTCTCCTGGTTTTTGGCGGCATGGTAGAGTACGGCAAGTATTCCAACGATCTG
TACGAATTACAGGCCTCCCGGTGGGAATGGAAACGTCTAAAGCCTCTGCCACCCAAGCAA
GGCCTACCACCATGCCCAAGGTTAGGTCATAGCTTTACTCTCTTAAATGGTAAGGTTTAC
CTGTTCGGTGGACTGGCCAATGAGAGCGACGATCCTAAGAACAACATACCAAGATATCTC
AATGACCTGTATACATTGGAGCTGTATCCGAATTCATCTATGACGGTATGGGACATACCC
ATCACTTATGGTCAGTCACCACCTCCTCGCGAGTCACATAGTGGTGTGAGTTACACAGAT
AAGAATACTGGAAAATCCTCGTTGATCATATATGGAGGTATGAGCGGCTCACGTCTCGGT
GACCTGTGGGTGCTTGATGTGGACAGCATGACTTGGTCTCGGCCAGACCTGGGTGGCCCT
CCACCACTACCCCGCTCTCTTCACACAGCAACTGTCATTGGACATCACATGTATGTGTAT
GGCGGTTGGGTGCCATTGGTTCCAGATGAATCAAAACTTGCAACACATGAAAAGGAATGG
AAGTGCACCAACACACTTGCTTCTTTAAATTTGGATACCATGACTTGGGATTGCATTGCC
CTTGATAAGTTTGAAGAGTGTGTACCCAGAGCTCGAGCCGGGCACAGTGCGGTTGCTATT
CAAACAAGACTCTACATCTGGTCAGGCCGGGATGGCTATAGAAAAACTTGGAACAATCAG
ATTTGCTGCAAAGATCTGTGGTATCTTGAGGTTGGTGTGCCACCACAAGCGGGTCGTGTA
GCACTTGTAAGAGCAGGCACAACTTCACTTGAACTCTGCTGGCCCGCTATGAACACAGTC
ACTACATATCTTCTACAAGTTCAAAAATGTGGTAAAGTGACTCCCACAAGGTTCCCGACA
GCTACGGAACCGGTGGCAGCTCCTCCTCCTGCCTCACCTGTCTCTGGACAACCGGATGCT
GCCAAAGCGTTTGGACTCACATCACCTATTGGGACTGGACTTCCACCAACCCCAATTGAC
CTACCAATAAGACCTGCGGCAGCCGCGGCGTCTCCAGCTGCCAATCCCATTGTATCAACT
CCACAGAAAGTCGTATCCAGTGCTATTAAAATGCCTGGTCAAGCTGCCGTCAAAATATCT
CCTAACACTCCGAAACAAACCTACCACGGCAAAACAGTTGTAAAATCACCCGCTGCAGGC
TCTTCACAGCAGATAAAGGTCGCTGCGGTCACGCCACAAGGAGTGACTCGTATTGTAAGC
GGAGTAGCGACTCCTAACACAGTTCGTGTGTCCACTCCGCAATCAAACGCTCAGATAGTG
CTCGGCGCGGCGGCGGGCTCGGCTGGCTCGCCGAGGTTCGTTCAAGTTAAAACTGGCGGT
AACACTGGTGTCGTCAAATTAGGAAGCAGTAATGTGCCTGTGAAACTCGCGGGCAATGCC
GTTCCATTAAAAATTGGTGCAGCCAATCTCCAAGGCAAAATCACAGCAAACAGTGTACAG
AAGGTCGGCACGACTCCGTTGAAGCTAGGTGCTGGAAATGTAAAAATAAGTACGAGCAAT
GTGTTAGTGAAGACAGCTGGAGGAGTTCCCATACAAGCCGGAGTCGGCAGTGTGCAAGTC
AAGGCTGGAGTGCCGGTGAAGCTGAGCGCTGGTAATCTGCCGGTAATCGGAAGCTACGGT
GGCGCGATACAAATCGCAAGCGGCGCCATCGGCCAGCAAGTGAAAACTCCCGTTTATAAA
ATAGTGACGGCTAAATCTAGCGATCAAGGAACGGCTGTCACGAACGCCGTGTCAGGATCT
CCGGTCCTGAGGCAGGCCGGAGGTAACGTCATCATTAAGAAGACGCCGGGTGCTAGTTCA
CAATCAAGCACGTCGCCCCAGTACGTGACACTGGTGAAGACCTCTACAGGTATGACCGTG
GCGACTGTACCCAAGATGGCTGTGATGCAGAACCGACCGGCGACTCCGGCCAGTGCTGCA
CAGGGGATTACTCCTGGGGCGACGATCGTCAAGCTCGTATCGGCTAACTCAGTAGGCGGC
AACAAGATCATAACACTACCACCCAATAAGCTGCAGCTCGGCAAGACAGGTGTTGGAGGC
AAGCAGACCATAGTTATCACCAAGTCAGCGAGTCAGTCACAACAGGGGCAACCGCAGTGA

Protein sequence:

MKENAVLKWQKVYNPSGPQPRPRHGHRAVAIKDLMIVFGGGNEGIVHELHVFNTTTNQWF
VPVTKGEVPPGCAAYGFVVDGTRLLVFGGMVEYGKYSNDLYELQASRWEWKRLKPLPPKQ
GLPPCPRLGHSFTLLNGKVYLFGGLANESDDPKNNIPRYLNDLYTLELYPNSSMTVWDIP
ITYGQSPPPRESHSGVSYTDKNTGKSSLIIYGGMSGSRLGDLWVLDVDSMTWSRPDLGGP
PPLPRSLHTATVIGHHMYVYGGWVPLVPDESKLATHEKEWKCTNTLASLNLDTMTWDCIA
LDKFEECVPRARAGHSAVAIQTRLYIWSGRDGYRKTWNNQICCKDLWYLEVGVPPQAGRV
ALVRAGTTSLELCWPAMNTVTTYLLQVQKCGKVTPTRFPTATEPVAAPPPASPVSGQPDA
AKAFGLTSPIGTGLPPTPIDLPIRPAAAAASPAANPIVSTPQKVVSSAIKMPGQAAVKIS
PNTPKQTYHGKTVVKSPAAGSSQQIKVAAVTPQGVTRIVSGVATPNTVRVSTPQSNAQIV
LGAAAGSAGSPRFVQVKTGGNTGVVKLGSSNVPVKLAGNAVPLKIGAANLQGKITANSVQ
KVGTTPLKLGAGNVKISTSNVLVKTAGGVPIQAGVGSVQVKAGVPVKLSAGNLPVIGSYG
GAIQIASGAIGQQVKTPVYKIVTAKSSDQGTAVTNAVSGSPVLRQAGGNVIIKKTPGASS
QSSTSPQYVTLVKTSTGMTVATVPKMAVMQNRPATPASAAQGITPGATIVKLVSANSVGG
NKIITLPPNKLQLGKTGVGGKQTIVITKSASQSQQGQPQ