New model in OGS2.0 | DPOGS205508  |
---|---|
Genomic Position | scaffold699:+ 28668-43384 |
See gene structure | |
CDS Length | 2844 |
Paired RNAseq reads   | 1223 |
Single RNAseq reads   | 3679 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000140 (6e-35) |
Best Drosophila hit   | CG7839 (1e-82) |
Best Human hit | CCAAT/enhancer-binding protein zeta (5e-112) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC002139 [Tribolium castaneum] (5e-179) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC002139 [Tribolium castaneum] (2e-167) |
GeneOntology terms    | GO:0006366 transcription from RNA polymerase II promoter GO:0003677 DNA binding GO:0005488 binding GO:0045449 regulation of transcription GO:0005634 nucleus |
InterPro families    | IPR016024 Armadillo-type fold IPR011989 Armadillo-like helical IPR005612 CCAAT-binding factor |
Orthology group | MCL12474 |
Nucleotide sequence:
ATGAAGCACAAAAGATTCGAAGAGAACATTTTAGTAGACGAAGCGTATACGAATTACGGT
AAAGTTGATAAAAATGCTGATGAAAAAAAAGGTTATGGTATAGCAAAACATTTCGCTGAT
ACTCTAGAATACGCGGACAAAAAGAAATGGTATCAGCAGCTACCAGAAGAGCCTTCGACA
CAAGGATCGATAACGCAAGAAAAAATTGAAGAACTCCGTAAAGAAGCGAGTAGTGCTCTA
CATGGTGACACATTAGCTTATGAAACAAAATCAAGTAAAAGTGGTTCATCCGACCAACAG
TGGGCTCGTACACTATTAACCAAAGGTACAATTGGAGACAGAGTGGCCGCAGCAACAATA
CTAATACAGGACAATCCGCTGTATAATTTGACGGCATTGAGGAATTTAATCAATAATGTG
AAGCCAGCTAAGAAAAAGGATGGAATAGTTATAATAGATGCTGTATCGGAACTGTTAGTA
TCAGAGCTGCTCATCCCTGACACCAAGCTGCGCACCTTCCAGCAGCATCCTCTACATGTC
ATTGATGAAATCACATCGGGCAACAAACAGGCAAGACGGAATGTTCTAAAACTATGGTAC
TATGAGGATCAACTTAAAGAGTTGTACGGAACTTATGTGGAAGCATTAAACAAGTTTGCT
CATGACACAGTGGACGCTAATAAGGAAAAGTCTATCAGTGCCATGTCTTACCTCTTGATG
CATCATCCGGAGAGAGAAAAGATGCTCCTAACAAATATAATAAATAAACTCGGTGATCCG
AGCCAGACGGTGGCATCGAAAGTGATCTACCATCTCTGTCAACTTCTATACAACCATCCG
AATATGAAATCTGTTGTTTTAGCTGAGATAGAGAAAATGTTGTTCAGATCTAACATATCC
CCACGCGCTCAATACTATGGAGTGTGTTTCCTGAACCAGTTCTTCCTTGGTAAGGATGAC
AGTCGGATAGCTGAGAACTTGATCAGAATATACTTCTCATTCTTCAAGGCTTCTATCAAG
AAGGGTGAAATAGATTCTCGTCTCATGTCAGCTATCCTGACGGGTGTGAAGCGAGCCTAT
CCCTTTGCTGACAGGGAGCGGTTGGTTGAGGCCTCCCAGCATGTAGACGCTGTACACCGA
CTGGTCCACCTGGCCAACATCAACGTGGCGATCCATGCACTGGCCTTGCTGTATCACATC
AGTGATGCTAACAAAGGGACATCCGACAGATACTACACAGCCTTGTACCGGAAACTGACA
AATTCCAATATATTCAATACTACCCACTCTGCATTGTTCTTCTCTCTCATATACAAGTCG
TTGAAGCAGGACAAGGATATAGACCGGGTGACGTCATTCATCAAGAGATTATTACAGCTG
TCCTGCTACATGAGCCCTGGCCAGGCTTGCGGAATGCTCTTCCTCATCTCGCAAGTATTG
AAGAGTGATGATAAGAGAGAGGCTGTAAAACTGGTCTTCAGTGAGATTAAAGAGGAAATT
AAAGAAGAAAATGAAACTAAAAATAATGATGAAAATCCAGAAGAATTAATGCATTCAGAA
GTTGAATTAGATGAGAGTAAGGAAGATGCTGAGGAAAATGTCAAACAGAAAAAAATTGAT
CTCTTAATAGGAGATAAGAAAGATTTATTAATGGATGATGAAGAAGAGACATATGTTGAC
CTCAAAATAGACGATGAAGGTAACATAAAGCCTAAGAAGAGGAATACGAACTCTGTGACT
GGGTGGTTTCATGCTAGAGTTGACAAGAAAGATGTACAAGAAAAAAACGTTGAGAAACAG
TTGAAGAAAGCTATTAATATTGGAAAGACGATAACCAGTTATAGTCCACTGTGCCGTGAC
CCTCGTTTCACCGGAGCACACCTGACGGCGATGGCTGAACTGACAATGCTGATGAAACAT
CATCATCCGAGTGTCAAGATGTTTGCTGAAAAATTACTGAATAATCAAATAATCCAATAT
GGCGGCGATCCTTTGAAGGACTTTTCCGGTATCCGTTTCCTGGATAGATTCGTGTTCAAG
AATCCAAAGAAACGTGCCGAGGTCACTGATGGGGAGGTCAAAAAGGTTAAGGGGTCACAT
CCGAAGTTCGCTGTTAGAAAGAACTATACAGCTAAAGGCATCAGAAGTATCGCTGTCAAT
TCATCGGCATATTTGAATGAGGATGTCAAGAAAATTCCTGTCGATGAAAGATTCCTATAT
GATTTCCTTCAAAAGCGCCGAGCGGCTGCTGATAGTGATGAGGAGAGTGACAACGACTCG
GTGACCAGCGAAGATTTTGAGACCTATTTGGATTCAGTCACTGGAACCAAAGCACAGGAA
TCCGATGAGGAGTTAGATTATTTGGGTGAATTGGAGTCGAGTAAACAGAAACGACCGAAG
GAAGTTGATGATGAGAAAGATGAGGTGATGAGCGATGATCAAGATGAAGACGATGATAGC
GATGGCGAACTCAATATATCCGGTGATGAAGACGAGCCAGTACTATCCGGAGACGAGGAC
GAACTAATGTTAGAAGACAGCGAAGAAGAAGACCAGATAGATATACCAGGAAAGAAGTCC
AAAAAGGATGCTATTAAATTAAAAGCTGCGAGTGACATTGACGTGGTGGAATCACGTAGG
CGCCCCCTCGGTTATACACAGTACAGTTATATTGGATTTTTAAATCGGTCGGATATTGTC
GCAGCGGCGCCCTCTGCAGACTGGAGTCACTTGAACTTTGCACAAGTCCACTGCAGGTTT
GAGTATGGTCGCGAATTATATTACGTGTCGTGTTTACTTTCGATACATTTGAAACAATTC
AAACTAAACAACGGTACTCGCTAG
Protein sequence:
MKHKRFEENILVDEAYTNYGKVDKNADEKKGYGIAKHFADTLEYADKKKWYQQLPEEPST
QGSITQEKIEELRKEASSALHGDTLAYETKSSKSGSSDQQWARTLLTKGTIGDRVAAATI
LIQDNPLYNLTALRNLINNVKPAKKKDGIVIIDAVSELLVSELLIPDTKLRTFQQHPLHV
IDEITSGNKQARRNVLKLWYYEDQLKELYGTYVEALNKFAHDTVDANKEKSISAMSYLLM
HHPEREKMLLTNIINKLGDPSQTVASKVIYHLCQLLYNHPNMKSVVLAEIEKMLFRSNIS
PRAQYYGVCFLNQFFLGKDDSRIAENLIRIYFSFFKASIKKGEIDSRLMSAILTGVKRAY
PFADRERLVEASQHVDAVHRLVHLANINVAIHALALLYHISDANKGTSDRYYTALYRKLT
NSNIFNTTHSALFFSLIYKSLKQDKDIDRVTSFIKRLLQLSCYMSPGQACGMLFLISQVL
KSDDKREAVKLVFSEIKEEIKEENETKNNDENPEELMHSEVELDESKEDAEENVKQKKID
LLIGDKKDLLMDDEEETYVDLKIDDEGNIKPKKRNTNSVTGWFHARVDKKDVQEKNVEKQ
LKKAINIGKTITSYSPLCRDPRFTGAHLTAMAELTMLMKHHHPSVKMFAEKLLNNQIIQY
GGDPLKDFSGIRFLDRFVFKNPKKRAEVTDGEVKKVKGSHPKFAVRKNYTAKGIRSIAVN
SSAYLNEDVKKIPVDERFLYDFLQKRRAAADSDEESDNDSVTSEDFETYLDSVTGTKAQE
SDEELDYLGELESSKQKRPKEVDDEKDEVMSDDQDEDDDSDGELNISGDEDEPVLSGDED
ELMLEDSEEEDQIDIPGKKSKKDAIKLKAASDIDVVESRRRPLGYTQYSYIGFLNRSDIV
AAAPSADWSHLNFAQVHCRFEYGRELYYVSCLLSIHLKQFKLNNGTR