DPGLEAN19668 in OGS1.0

New model in OGS2.0DPOGS206420 
Genomic Positionscaffold341:+ 18246-21152
See gene structure
CDS Length927
Paired RNAseq reads  339
Single RNAseq reads  877
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013786 (3e-109)
Best Drosophila hit  Mat1 (9e-95)
Best Human hitCDK-activating kinase assembly factor MAT1 isoform 1 (4e-75)
Best NR hit (blastp)  PREDICTED: similar to Mat1 CG7614-PA [Apis mellifera] (6e-118)
Best NR hit (blastx)  PREDICTED: similar to Mat1 CG7614-PA [Apis mellifera] (2e-106)
GeneOntology terms




  
GO:0016251 general RNA polymerase II transcription factor activity
GO:0005675 holo TFIIH complex
GO:0006367 transcription initiation from RNA polymerase II promoter
GO:0008270 zinc ion binding
GO:0007049 cell cycle
GO:0005515 protein binding
InterPro families




  
IPR001841 Zinc finger, RING-type
IPR015877 Cdk-activating kinase assembly factor MAT1, centre
IPR004575 Cdk-activating kinase assembly factor (MAT1)
IPR013083 Zinc finger, RING/FYVE/PHD-type
IPR017907 Zinc finger, RING-type, conserved site
IPR016390 Cdk-activating kinase assembly factor (MAT1), metazoa
Orthology groupMCL13644

Nucleotide sequence:

ATGGATGATCAAGCATGTCCCCGTTGTAAAACAACGAAATACAGAAATCCATCCCTAAAG
TTGATGGTAAACATTTGTGGCCATGCTTTGTGCGAGAGCTGTGTTGATTTATTGTTTTTA
AAAGGATCTGGTTCATGTCCTGATTGCAATGTTCCTTTGCGTCGTAGTAATTTTCGTGTA
CAGCTTTTCGAAGATTCCATGGTGGAAAAAGAAATGGATATAAGAAAACGTGTTCTCAAG
GACTTTAACAAAAAAGAAGAGGATTTCTCAACACTCAGAGAATATAACGATTATTTAGAA
GAAATAGAAGTAATAATATATAATTTAGTCAATAACATAGATGTGGTCGGAACAAACAAA
AGGATAGAACAATATAAAAGGGATAATAAAGAACTTATTATGAAAAACAAAGCCAAAATC
GGTAGGGAAGAAATAGAATTAGAGGAGATATTGGAAATTGAAAAGCAAATGGAGGAATTA
AGACGTCAGGAAATAGCTAAGATGGAGGATGAGGCGAAGAAACAGAAAATAAGAGCAAAG
GAAGCTTTGATTGATGAGTTAATGTTCGCCGACGGAGACGCTAAGGATATATTGAACACA
TTTGCACAAACTGTGGCTAATAAGCAAGAGGAAGTTGTGCCGCTGCTACCTAAAGTGACA
CAGTTCTCATCGGGTGTGAAATTTACTAGAGGTTCGAGTCAGGCAATACCTATAATAGAA
GAAGGGCCGCTTTACAAATATGAACCGTTAGAAATACCTGATAGATGTGGACCGGATCCA
CCGTCGTTGGAGGAGATTATGAATAACGGGTTTCTGCATCACGTTAGAGCAGAGAACGAG
ACAGAGAAAGCTGGTGGTTATACATCTACTCTACCGTGTCTGAGAGCACTCCAAGATGCA
CTCTCCGGCCTCTACCACGCCAGCTGA

Protein sequence:

MDDQACPRCKTTKYRNPSLKLMVNICGHALCESCVDLLFLKGSGSCPDCNVPLRRSNFRV
QLFEDSMVEKEMDIRKRVLKDFNKKEEDFSTLREYNDYLEEIEVIIYNLVNNIDVVGTNK
RIEQYKRDNKELIMKNKAKIGREEIELEEILEIEKQMEELRRQEIAKMEDEAKKQKIRAK
EALIDELMFADGDAKDILNTFAQTVANKQEEVVPLLPKVTQFSSGVKFTRGSSQAIPIIE
EGPLYKYEPLEIPDRCGPDPPSLEEIMNNGFLHHVRAENETEKAGGYTSTLPCLRALQDA
LSGLYHAS