DPGLEAN15963 in OGS1.0

New model in OGS2.0DPOGS210302 
Genomic Positionscaffold1133:+ 146-39170
See gene structure
CDS Length3396
Paired RNAseq reads  1779
Single RNAseq reads  5612
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013826 (4e-104)
Best Drosophila hit  Myb oncogene-like, isoform A (5e-62)
Best Human hittranscriptional activator Myb isoform 1 (2e-65)
Best NR hit (blastp)  PREDICTED: similar to Myb protein [Apis mellifera] (1e-104)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC030738 [Tribolium castaneum] (2e-102)
GeneOntology terms






  
GO:0048566 embryonic digestive tract development
GO:0005634 nucleus
GO:0010468 regulation of gene expression
GO:0006355 regulation of transcription, DNA-dependent
GO:0006816 calcium ion transport
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0000082 G1/S transition of mitotic cell cycle
GO:0001701 in utero embryonic development
InterPro families





  
IPR017930 Transcription regulator HTH, Myb-type, DNA-binding
IPR012287 Homeodomain-related
IPR014778 Myb, DNA-binding
IPR015395 C-myb, C-terminal
IPR009057 Homeodomain-like
IPR001005 SANT domain, DNA binding
IPR015495 Myb transcription factor
Orthology groupMCL12661

Nucleotide sequence:

ATGTCGGGGGCGGACGCTTGTTCCCTGGCAGGTGCATACCTCCAGCTATGCGAACACCAG
CACGTACCAGCGCATATACCAGATCACTGTGTCCAGTGTACAACACCGACAGGAGACATC
ATAGAAGAAGGATCGTTTCTCCAACTCCAGAATATTCCATCTTCAATGGATGTGGTATTC
GTGGTTGAAGCCCAGAACTGTAATAAAAATATACGTAAGGCCAAAAACATCGATTTGTTC
GTGGAGACATTGGACAGCAAACTACAAGGAAATGGGTTCTCTGATAACAGGGACCCCTTA
AACCTACACCTGGGTCGCAACTCCCAGGCGCTGTTGAAGCTCCTCTCCCTGCAACGCGAT
GGACCCGGCACCCGCACGGTGAATCTATTGAATGCTAAGAGATACGCTGTAGTTGTATAC
GGAGGTCGCGGCGTGTTCAGTCGCGCTCGAGCTCTGTACGTCAACAACAAGCCGTTCACA
GACGCGGTTGATATACCGAGATATTTTGAAGCATTCCAGATCGAAAAGTCAACAGCTGAC
CGTAACCGGTCCTCGGAGGCGTTGCGTGCTCTACAGACAGTGAGCACCCTGCCCCTAAGA
GCGGGCGTACCTCGAATTGTCATACTGTTCCCGTGCCGCTCGTGTGGCAGCGGAGAGGAG
CTGGACTACTCAACCATCTACCACAATTTAATGGAGAACTCGATCACGCTCCACATACTT
ATGGACGGCGATTTTTCACTGTCAAAGAAACGAGTCGCTAAATATTTGTTTGGAATGGAT
AATTCCGTCGCCTACACTAATAAAGACTACGAGCGGCTGACAGGCGACGCCGGGTTGAAG
AAACAAGTTAGATTACCGAAGGAAAAACTTGGACTGTGTAGCTCATTGGCTCTAGAAACT
AATGGTACAATCTGGGCGGGTTCCAAATTAGAGTCAGACCGCGCGGCCGCTCGTCGTTTC
TCGACGGTTTGCGGCGGTAGAGTTTCTCGTGTCACGCCGTGTGCCGCTCCACGCTGCGAG
TGCCGTAACGCAGCGTTACACTGCAGGCCGTGCGCCAATCACGATCCGTTGGAGCTATCT
TTTTGGAACTCCGATGACATCGATGAACTCATTGACCTCGCAATGGATCCACCAACATTA
CCCTCCATGGAAGAGCAAGGCACGAAATCATATCTACAGCACCAGATAGATAAGCTGAAG
GCATTGCAAGCGAAACTATCTACTGAAGTGGAAGAGGATTATGTTATTATTAAGGAGGAA
GCACGCAGCGGCTATGATTCAGAGTCGAGTGACTATTCAGAGGATGACACGTATGAAGAT
GTACCGCCTCCAACCAAAGGCTCTGGGCCGCGGAAGAATATCAATAGGGGAAGGTGGACC
AAAGATGAGGACAAACGTTTGAAGGTTTACGTTAAGATGTACAATGAGAATTGGGAGAAG
ATAGCGAGTCAGTTTCCTGATAGATCTGATGTTCAGTGTCAACAGAGGTGGACCAAGGTT
GTCAACCCTGAACTAGTCAAGGGTCCCTGGACGAAAGAGGAAGACGAAAAAGTTATGGAG
TTGGTAGCGAAGTACGGACCAAAAAAATGGACTCTCATAGCGCGACATCTCAAAGGCAGA
ATAGGGAAACAATGCAGAGAGCGTTGGCACAATCATCTCAATCCTTCAATAAAGAAAAGT
GCGTGGACGGAGCATGAAGACAGAGTCATATATCAAGCTCACAGACAGCTTGGAAACCAA
TGGGCGAAAATAGCGAAGCTCTTACCCGGAAGAACTGACAATGCTATAAAGAACCATTGG
AACTCAACAATGAGAAGAAAATACGAGCCGGAGTTACTTGACAGTTTTGAACACTTGAGG
AAGAAGAAACGAAAGGAAGAAGATACACAACACAATGATGTGAGTCAAACATTGAATATA
CTTACCACGGTGCTATTACCCGACTTCATCGACAGAAGGACATGGTCGGACACACTGAAC
GAGTCGAGTCAGTCATCAAGTGCACCGGCGGTTCATCTTCGACAGTTGTTAAGAGAGAGG
GCACGAGGCTCACTGGCACCAGCGGATAGCGTCGAGATAGTCGATTCGCCGTTTAGATTC
GTCAATTTAGAGTCACTGCCTTTGAATTCACCAGTGAAGAATTATTTGAGTCAAGCCTCC
ACAAGCGATAATAACAGTCAAAATACAATAACTTACTCAATACAATCTGAAGCTGATAAA
GAAATAGTGGTACCGTCAATATACTCACCTCGAGACTCCCCACCGCCTATATTAAGAAGG
GCCAAGCGGAAAACAGCTGACACCACAACACCTTCAAGACAACCATGGTCGGATCCTCTA
TCTCGTGTAATGGAGAGCGGTGCTCCTCTACAAGCATTACCGTTCAGCCCATCTCAGTTC
CTGGTAGCGCCGCTTGGGAAACAGGACGCGACCCCGCTTAGGACTAAGGAATTGGGTGTA
AGTCCTCTATTGCACACGCCGACACCGAACCTAACGCCTGGCGGAACACAGTTTGAACAG
AAACATACACCTAAAACCCCGACCCCGTTCAAGTTAGCGATGGCCGAAATTGGCAAGAAG
TCAGGTTTGAAATATGAACCGTCCAGCCCGGAGCTGCTGGTGGAGGACATCACTGAGATG
ATACAGCGAGAGAACTCCGACAGTCTACACGAGTGTCTACTGTCACACGACCAAGAAGCG
CAGATGAGTTCAGACTCCGGTATATCAAGCGTGCAGCGCGGTAAAGAGAACGTTCCTGGT
GTACGTCGTTCGCGTAAAGCGCTCGCACATACATGGGGAGCCGCTACTAGCACGCCGCGA
GCGAGACTGCACGTGCCCGATGTATGCTTCGGAGTTGAGACGCCGAGTAAGACCCTGGCC
GGCGACAGTTCCGTTCTATTCTCGCCGCCATCGATAGTGAAGCATTCCCTCCTAGAAGAA
TCTACAAGCATCATATCAGAGAACACGCCAGAAGCCTACGAAGAAATTAAGGTACAACAC
AGGTGTCATGATCCTAACCCTAACTCCTACTACGAGCCTGTGTTTAGGAATATCACTAAC
GAATCGCCTAGAACATTCGCTAGGTTAAACGCGGACAGACTAAAAACTATAACAAGCGCA
AATCTATGTGACAGCTCGTTGGCGCCAAAAGATGTTCTTACGAACGCGTTAGTTGATCAC
ATATACAAAGTGACCAATCAGAAGCCGTCTACGTCACGCATTGACGAAATATTGACCAAT
CAGAAACCTTCTACGTCACACATAAACGATATGTCTACCAATAAAATTAGTAAACCAAAT
GACACGGAAAAAGAGAACCAATGGTGGCAGGTTGAGAAAGATTCCGGTATAGATGATCTT
TACAACGACTATTACATTTTTACTAATAATATATAA

Protein sequence:

MSGADACSLAGAYLQLCEHQHVPAHIPDHCVQCTTPTGDIIEEGSFLQLQNIPSSMDVVF
VVEAQNCNKNIRKAKNIDLFVETLDSKLQGNGFSDNRDPLNLHLGRNSQALLKLLSLQRD
GPGTRTVNLLNAKRYAVVVYGGRGVFSRARALYVNNKPFTDAVDIPRYFEAFQIEKSTAD
RNRSSEALRALQTVSTLPLRAGVPRIVILFPCRSCGSGEELDYSTIYHNLMENSITLHIL
MDGDFSLSKKRVAKYLFGMDNSVAYTNKDYERLTGDAGLKKQVRLPKEKLGLCSSLALET
NGTIWAGSKLESDRAAARRFSTVCGGRVSRVTPCAAPRCECRNAALHCRPCANHDPLELS
FWNSDDIDELIDLAMDPPTLPSMEEQGTKSYLQHQIDKLKALQAKLSTEVEEDYVIIKEE
ARSGYDSESSDYSEDDTYEDVPPPTKGSGPRKNINRGRWTKDEDKRLKVYVKMYNENWEK
IASQFPDRSDVQCQQRWTKVVNPELVKGPWTKEEDEKVMELVAKYGPKKWTLIARHLKGR
IGKQCRERWHNHLNPSIKKSAWTEHEDRVIYQAHRQLGNQWAKIAKLLPGRTDNAIKNHW
NSTMRRKYEPELLDSFEHLRKKKRKEEDTQHNDVSQTLNILTTVLLPDFIDRRTWSDTLN
ESSQSSSAPAVHLRQLLRERARGSLAPADSVEIVDSPFRFVNLESLPLNSPVKNYLSQAS
TSDNNSQNTITYSIQSEADKEIVVPSIYSPRDSPPPILRRAKRKTADTTTPSRQPWSDPL
SRVMESGAPLQALPFSPSQFLVAPLGKQDATPLRTKELGVSPLLHTPTPNLTPGGTQFEQ
KHTPKTPTPFKLAMAEIGKKSGLKYEPSSPELLVEDITEMIQRENSDSLHECLLSHDQEA
QMSSDSGISSVQRGKENVPGVRRSRKALAHTWGAATSTPRARLHVPDVCFGVETPSKTLA
GDSSVLFSPPSIVKHSLLEESTSIISENTPEAYEEIKVQHRCHDPNPNSYYEPVFRNITN
ESPRTFARLNADRLKTITSANLCDSSLAPKDVLTNALVDHIYKVTNQKPSTSRIDEILTN
QKPSTSHINDMSTNKISKPNDTEKENQWWQVEKDSGIDDLYNDYYIFTNNI