New model in OGS2.0 | DPOGS204633  |
---|---|
Genomic Position | scaffold3406:+ 1943-4402 |
See gene structure | |
CDS Length | 2460 |
Paired RNAseq reads   | 4514 |
Single RNAseq reads   | 11200 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009463 (0.0) |
Best Drosophila hit   | host cell factor, isoform D (5e-152) |
Best Human hit | host cell factor 1 (2e-164) |
Best NR hit (blastp)   | AGAP004774-PA [Anopheles gambiae str. PEST] (0.0) |
Best NR hit (blastx)   | PREDICTED: host cell factor 1-like [Pongo abelii] (1e-164) |
GeneOntology terms    | GO:0043254 regulation of protein complex assembly GO:0005634 nucleus GO:0005515 protein binding GO:0019046 reactivation of latent virus GO:0005737 cytoplasm GO:0071339 MLL1 complex GO:0070688 MLL5-L complex GO:0044419 interspecies interaction between organisms GO:0006366 transcription from RNA polymerase II promoter GO:0042802 identical protein binding GO:0003713 transcription coactivator activity GO:0045787 positive regulation of cell cycle GO:0003700 sequence-specific DNA binding transcription factor activity GO:0045449 regulation of transcription |
InterPro families    | IPR006652 Kelch repeat type 1 IPR015915 Kelch-type beta propeller |
Orthology group | MCL15808 |
Nucleotide sequence:
ATGAAGGAAAACGCTGTACTTAAGTGGCAAAAAGTTTACAATCCTAGCGGCCCTCAACCA
CGACCTCGCCATGGCCATCGTGCAGTTGCCATTAAAGATTTGATGATAGTTTTCGGCGGC
GGAAATGAAGGGATAGTTCATGAGCTTCACGTCTTCAATACCACCACGAATCAATGGTTT
GTTCCTGTTACGAAAGGAGAGGTGCCTCCAGGATGCGCCGCCTACGGCTTCGTGGTCGAC
GGAACACGTCTCCTGGTTTTTGGCGGCATGGTAGAGTACGGCAAGTATTCCAACGATCTG
TACGAATTACAGGCCTCCCGGTGGGAATGGAAACGTCTAAAGCCTCTGCCACCCAAGCAA
GGCCTACCACCATGCCCAAGGTTAGGTCATAGCTTTACTCTCTTAAATGGTAAGGTTTAC
CTGTTCGGTGGACTGGCCAATGAGAGCGACGATCCTAAGAACAACATACCAAGATATCTC
AATGACCTGTATACATTGGAGCTGTATCCGAATTCATCTATGACGGTATGGGACATACCC
ATCACTTATGGTCAGTCACCACCTCCTCGCGAGTCACATAGTGGTGTGAGTTACACAGAT
AAGAATACTGGAAAATCCTCGTTGATCATATATGGAGGTATGAGCGGCTCACGTCTCGGT
GACCTGTGGGTGCTTGATGTGGACAGCATGACTTGGTCTCGGCCAGACCTGGGTGGCCCT
CCACCACTACCCCGCTCTCTTCACACAGCAACTGTCATTGGACATCACATGTATGTGTAT
GGCGGTTGGGTGCCATTGGTTCCAGATGAATCAAAACTTGCAACACATGAAAAGGAATGG
AAGTGCACCAACACACTTGCTTCTTTAAATTTGGATACCATGACTTGGGATTGCATTGCC
CTTGATAAGTTTGAAGAGTGTGTACCCAGAGCTCGAGCCGGGCACAGTGCGGTTGCTATT
CAAACAAGACTCTACATCTGGTCAGGCCGGGATGGCTATAGAAAAACTTGGAACAATCAG
ATTTGCTGCAAAGATCTGTGGTATCTTGAGGTTGGTGTGCCACCACAAGCGGGTCGTGTA
GCACTTGTAAGAGCAGGCACAACTTCACTTGAACTCTGCTGGCCCGCTATGAACACAGTC
ACTACATATCTTCTACAAGTTCAAAAATGTGGTAAAGTGACTCCCACAAGGTTCCCGACA
GCTACGGAACCGGTGGCAGCTCCTCCTCCTGCCTCACCTGTCTCTGGACAACCGGATGCT
GCCAAAGCGTTTGGACTCACATCACCTATTGGGACTGGACTTCCACCAACCCCAATTGAC
CTACCAATAAGACCTGCGGCAGCCGCGGCGTCTCCAGCTGCCAATCCCATTGTATCAACT
CCACAGAAAGTCGTATCCAGTGCTATTAAAATGCCTGGTCAAGCTGCCGTCAAAATATCT
CCTAACACTCCGAAACAAACCTACCACGGCAAAACAGTTGTAAAATCACCCGCTGCAGGC
TCTTCACAGCAGATAAAGGTCGCTGCGGTCACGCCACAAGGAGTGACTCGTATTGTAAGC
GGAGTAGCGACTCCTAACACAGTTCGTGTGTCCACTCCGCAATCAAACGCTCAGATAGTG
CTCGGCGCGGCGGCGGGCTCGGCTGGCTCGCCGAGGTTCGTTCAAGTTAAAACTGGCGGT
AACACTGGTGTCGTCAAATTAGGAAGCAGTAATGTGCCTGTGAAACTCGCGGGCAATGCC
GTTCCATTAAAAATTGGTGCAGCCAATCTCCAAGGCAAAATCACAGCAAACAGTGTACAG
AAGGTCGGCACGACTCCGTTGAAGCTAGGTGCTGGAAATGTAAAAATAAGTACGAGCAAT
GTGTTAGTGAAGACAGCTGGAGGAGTTCCCATACAAGCCGGAGTCGGCAGTGTGCAAGTC
AAGGCTGGAGTGCCGGTGAAGCTGAGCGCTGGTAATCTGCCGGTAATCGGAAGCTACGGT
GGCGCGATACAAATCGCAAGCGGCGCCATCGGCCAGCAAGTGAAAACTCCCGTTTATAAA
ATAGTGACGGCTAAATCTAGCGATCAAGGAACGGCTGTCACGAACGCCGTGTCAGGATCT
CCGGTCCTGAGGCAGGCCGGAGGTAACGTCATCATTAAGAAGACGCCGGGTGCTAGTTCA
CAATCAAGCACGTCGCCCCAGTACGTGACACTGGTGAAGACCTCTACAGGTATGACCGTG
GCGACTGTACCCAAGATGGCTGTGATGCAGAACCGACCGGCGACTCCGGCCAGTGCTGCA
CAGGGGATTACTCCTGGGGCGACGATCGTCAAGCTCGTATCGGCTAACTCAGTAGGCGGC
AACAAGATCATAACACTACCACCCAATAAGCTGCAGCTCGGCAAGACAGGTGTTGGAGGC
AAGCAGACCATAGTTATCACCAAGTCAGCGAGTCAGTCACAACAGGGGCAACCGCAGTGA
Protein sequence:
MKENAVLKWQKVYNPSGPQPRPRHGHRAVAIKDLMIVFGGGNEGIVHELHVFNTTTNQWF
VPVTKGEVPPGCAAYGFVVDGTRLLVFGGMVEYGKYSNDLYELQASRWEWKRLKPLPPKQ
GLPPCPRLGHSFTLLNGKVYLFGGLANESDDPKNNIPRYLNDLYTLELYPNSSMTVWDIP
ITYGQSPPPRESHSGVSYTDKNTGKSSLIIYGGMSGSRLGDLWVLDVDSMTWSRPDLGGP
PPLPRSLHTATVIGHHMYVYGGWVPLVPDESKLATHEKEWKCTNTLASLNLDTMTWDCIA
LDKFEECVPRARAGHSAVAIQTRLYIWSGRDGYRKTWNNQICCKDLWYLEVGVPPQAGRV
ALVRAGTTSLELCWPAMNTVTTYLLQVQKCGKVTPTRFPTATEPVAAPPPASPVSGQPDA
AKAFGLTSPIGTGLPPTPIDLPIRPAAAAASPAANPIVSTPQKVVSSAIKMPGQAAVKIS
PNTPKQTYHGKTVVKSPAAGSSQQIKVAAVTPQGVTRIVSGVATPNTVRVSTPQSNAQIV
LGAAAGSAGSPRFVQVKTGGNTGVVKLGSSNVPVKLAGNAVPLKIGAANLQGKITANSVQ
KVGTTPLKLGAGNVKISTSNVLVKTAGGVPIQAGVGSVQVKAGVPVKLSAGNLPVIGSYG
GAIQIASGAIGQQVKTPVYKIVTAKSSDQGTAVTNAVSGSPVLRQAGGNVIIKKTPGASS
QSSTSPQYVTLVKTSTGMTVATVPKMAVMQNRPATPASAAQGITPGATIVKLVSANSVGG
NKIITLPPNKLQLGKTGVGGKQTIVITKSASQSQQGQPQ