New model in OGS2.0 | DPOGS206420  |
---|---|
Genomic Position | scaffold341:+ 18246-21152 |
See gene structure | |
CDS Length | 927 |
Paired RNAseq reads   | 339 |
Single RNAseq reads   | 877 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013786 (3e-109) |
Best Drosophila hit   | Mat1 (9e-95) |
Best Human hit | CDK-activating kinase assembly factor MAT1 isoform 1 (4e-75) |
Best NR hit (blastp)   | PREDICTED: similar to Mat1 CG7614-PA [Apis mellifera] (6e-118) |
Best NR hit (blastx)   | PREDICTED: similar to Mat1 CG7614-PA [Apis mellifera] (2e-106) |
GeneOntology terms    | GO:0016251 general RNA polymerase II transcription factor activity GO:0005675 holo TFIIH complex GO:0006367 transcription initiation from RNA polymerase II promoter GO:0008270 zinc ion binding GO:0007049 cell cycle GO:0005515 protein binding |
InterPro families    | IPR001841 Zinc finger, RING-type IPR015877 Cdk-activating kinase assembly factor MAT1, centre IPR004575 Cdk-activating kinase assembly factor (MAT1) IPR013083 Zinc finger, RING/FYVE/PHD-type IPR017907 Zinc finger, RING-type, conserved site IPR016390 Cdk-activating kinase assembly factor (MAT1), metazoa |
Orthology group | MCL13644 |
Nucleotide sequence:
ATGGATGATCAAGCATGTCCCCGTTGTAAAACAACGAAATACAGAAATCCATCCCTAAAG
TTGATGGTAAACATTTGTGGCCATGCTTTGTGCGAGAGCTGTGTTGATTTATTGTTTTTA
AAAGGATCTGGTTCATGTCCTGATTGCAATGTTCCTTTGCGTCGTAGTAATTTTCGTGTA
CAGCTTTTCGAAGATTCCATGGTGGAAAAAGAAATGGATATAAGAAAACGTGTTCTCAAG
GACTTTAACAAAAAAGAAGAGGATTTCTCAACACTCAGAGAATATAACGATTATTTAGAA
GAAATAGAAGTAATAATATATAATTTAGTCAATAACATAGATGTGGTCGGAACAAACAAA
AGGATAGAACAATATAAAAGGGATAATAAAGAACTTATTATGAAAAACAAAGCCAAAATC
GGTAGGGAAGAAATAGAATTAGAGGAGATATTGGAAATTGAAAAGCAAATGGAGGAATTA
AGACGTCAGGAAATAGCTAAGATGGAGGATGAGGCGAAGAAACAGAAAATAAGAGCAAAG
GAAGCTTTGATTGATGAGTTAATGTTCGCCGACGGAGACGCTAAGGATATATTGAACACA
TTTGCACAAACTGTGGCTAATAAGCAAGAGGAAGTTGTGCCGCTGCTACCTAAAGTGACA
CAGTTCTCATCGGGTGTGAAATTTACTAGAGGTTCGAGTCAGGCAATACCTATAATAGAA
GAAGGGCCGCTTTACAAATATGAACCGTTAGAAATACCTGATAGATGTGGACCGGATCCA
CCGTCGTTGGAGGAGATTATGAATAACGGGTTTCTGCATCACGTTAGAGCAGAGAACGAG
ACAGAGAAAGCTGGTGGTTATACATCTACTCTACCGTGTCTGAGAGCACTCCAAGATGCA
CTCTCCGGCCTCTACCACGCCAGCTGA
Protein sequence:
MDDQACPRCKTTKYRNPSLKLMVNICGHALCESCVDLLFLKGSGSCPDCNVPLRRSNFRV
QLFEDSMVEKEMDIRKRVLKDFNKKEEDFSTLREYNDYLEEIEVIIYNLVNNIDVVGTNK
RIEQYKRDNKELIMKNKAKIGREEIELEEILEIEKQMEELRRQEIAKMEDEAKKQKIRAK
EALIDELMFADGDAKDILNTFAQTVANKQEEVVPLLPKVTQFSSGVKFTRGSSQAIPIIE
EGPLYKYEPLEIPDRCGPDPPSLEEIMNNGFLHHVRAENETEKAGGYTSTLPCLRALQDA
LSGLYHAS