DPGLEAN01991 in OGS1.0

New model in OGS2.0DPOGS215556 
Genomic Positionscaffold860:+ 5273-7201
See gene structure
CDS Length1929
Paired RNAseq reads  182
Single RNAseq reads  494
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010685 (7e-142)
Best Drosophila hit  PSEA-binding protein 95kD (5e-19)
Best Human hitsnRNA-activating protein complex subunit 4 (7e-27)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC004408 [Tribolium castaneum] (9e-40)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC004408 [Tribolium castaneum] (3e-40)
GeneOntology terms





  
GO:0005634 nucleus
GO:0006350 transcription
GO:0045449 regulation of transcription
GO:0003677 DNA binding
GO:0005575 cellular_component
GO:0003674 molecular_function
GO:0008150 biological_process
InterPro families





  
IPR012287 Homeodomain-related
IPR017877 MYB-like
IPR017930 Transcription regulator HTH, Myb-type, DNA-binding
IPR014778 Myb, DNA-binding
IPR001005 SANT domain, DNA binding
IPR009057 Homeodomain-like
IPR015495 Myb transcription factor
Orthology groupMCL12085

Nucleotide sequence:

ATGAAAAACGTAGACAAAGATCTATCTGATAGACAGGACAGACAAGAGGCGTTCCGTTAT
GTTAATTGCGGTAAACCATACTTCAAAGATAGATGCTTCTTCCCAGCTCCTGATAATGAG
GATACATTATTAATGGTTAAGTCTGGAATGTATGATTTTTCAAATATAGCTTCCATTGCT
GGTTGGACTGTTAAAGACAAAAGTCAATTTTTAATTGTATTGCAAAAACTGTCACAGACA
CTTAAAAAGAATGAGATAAAATCAAAAATAGCAGAGTTAAAACGCGAAAGTAAAGTTAAT
GAAACGAAGAAAAATTGCAGAATGATTGTCGCTTTGAACAGGGAAATGGATAGAATCAAT
AAAATGGCATTAAAGGATGTAGCTTTACCCATTGACCAAGAATATGATTGGGATTTTGTC
GCCAATAAATTGAATTTCAGACACAGTGCACAGGAATATCGATCACTTTGGAAATTGTTC
CTCCATCCAAGTATCAATAAGAGTTCATGGAATAAGACTGAACACACATTACTACAAAGA
ATAGTACTAGAAGAGAATCTTGTAAATTGGGACACAATAGCATTAAAATTAGGCACTAGA
CGCACTAGTTATCAGTGCTTTGTTTATTTCAGGACAAACATGAGCAATACATTCACAGGT
ATGAAATGGACTAAGGAGGAAGAGGAATATCTTAAAAGGTTAATTGATTACTACAAACAA
GATAACTACATCCCTTGGGGTAAAGTTGCAGCATCTATGGAGAATAGGACCAAGATTCAA
ATTTATAACAAATTTTTACGATTGGAGGAACACAGAAAAGGCAGGTTTATGCCCGAGGAA
GATGCTGTAATATTGACAGGTGTTGATAGTTTTGGGCAAGACTATAAGAAAATTTCAAAA
TACCTTCCGGGTAGATCCGCAGCCCAGTGCAGAGTTAGGTATCAAGTGTTAGCTAAGAAG
CGAATTTCAGCTGTTTGGACAGTGGACGAGGATAGAAAACTAGTGCAATTAATGGCAAAT
CAAGATTCAAACATTAATTATTCCACTTTGGTTCCTTATTTTCAAGGAAAAGATAGGTTT
CATTTGAGATCCCGATATTTGACCTTGACAAAGTGGATGAGACTGCATCCCAATATGGAT
ATAGCTTTAGCACCCAGACGAGGGGCTCGACGCTTAGGTCACGGCCAATCATCTGATGAC
CTGAATTCAGCCATTGAAAGTTTAAAAACGAAGATTCAATCAGAACTCACAGACAATAGG
AAGAAAAGGGTAACTAAAGATTCTCCTGAAAATGTTATTGAAGATGCAATTATTGCTACG
CTTGTTACGGAAAATGTCAGGTTGGAAGAAGCTAGGAAAGGTCAGACATCTTGCGATACG
CAGACTGGTATGGAACAAAGAAATAAAACGAGCAACGCCTGCAATCTATCAAGTTTGCAG
AAAGTTTTAATTCTATTACGATCAAAGTTAAATAAAAAGAAATTTATTCAAAATGGCGAT
CCGAAGTATAAAGGTTTGATTGAGACAGAAAATGATATTTATTCCGTAAGAGTTAAATCC
TATTCTAAGGAGAATATAAAAAAGAATAATGTTAACATTAATTCTAAACCCGATATTTGG
GGTGAAGTTTGTCTTGGTCCTTTGGAACACGTGTTTCCACCGCATTACGCGACGATAACT
GGTTGCAGGAAACTAATGTCCTATGTAAGCAGCAAACCCAATAGGGATGACACTGTTAAC
CTACAAACGTTACTAAGAAAAAACATCCTTCTCAAAGAACAACTGCTTCTTTTGATGGAA
CGATTTAATGCTTTATTTCTTTGGCCCCTTCTTCTATCCAACTCACCTCCAGAGCCTTTC
GCATCAATAGAAAATGATAAGTCATTAATTGATAAAGACATTAAAATCTTTAAGGATGAT
GAAAATTAG

Protein sequence:

MKNVDKDLSDRQDRQEAFRYVNCGKPYFKDRCFFPAPDNEDTLLMVKSGMYDFSNIASIA
GWTVKDKSQFLIVLQKLSQTLKKNEIKSKIAELKRESKVNETKKNCRMIVALNREMDRIN
KMALKDVALPIDQEYDWDFVANKLNFRHSAQEYRSLWKLFLHPSINKSSWNKTEHTLLQR
IVLEENLVNWDTIALKLGTRRTSYQCFVYFRTNMSNTFTGMKWTKEEEEYLKRLIDYYKQ
DNYIPWGKVAASMENRTKIQIYNKFLRLEEHRKGRFMPEEDAVILTGVDSFGQDYKKISK
YLPGRSAAQCRVRYQVLAKKRISAVWTVDEDRKLVQLMANQDSNINYSTLVPYFQGKDRF
HLRSRYLTLTKWMRLHPNMDIALAPRRGARRLGHGQSSDDLNSAIESLKTKIQSELTDNR
KKRVTKDSPENVIEDAIIATLVTENVRLEEARKGQTSCDTQTGMEQRNKTSNACNLSSLQ
KVLILLRSKLNKKKFIQNGDPKYKGLIETENDIYSVRVKSYSKENIKKNNVNINSKPDIW
GEVCLGPLEHVFPPHYATITGCRKLMSYVSSKPNRDDTVNLQTLLRKNILLKEQLLLLME
RFNALFLWPLLLSNSPPEPFASIENDKSLIDKDIKIFKDDEN