New model in OGS2.0 | DPOGS210302  |
---|---|
Genomic Position | scaffold1133:+ 146-39170 |
See gene structure | |
CDS Length | 3396 |
Paired RNAseq reads   | 1779 |
Single RNAseq reads   | 5612 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013826 (4e-104) |
Best Drosophila hit   | Myb oncogene-like, isoform A (5e-62) |
Best Human hit | transcriptional activator Myb isoform 1 (2e-65) |
Best NR hit (blastp)   | PREDICTED: similar to Myb protein [Apis mellifera] (1e-104) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC030738 [Tribolium castaneum] (2e-102) |
GeneOntology terms    | GO:0048566 embryonic digestive tract development GO:0005634 nucleus GO:0010468 regulation of gene expression GO:0006355 regulation of transcription, DNA-dependent GO:0006816 calcium ion transport GO:0003700 sequence-specific DNA binding transcription factor activity GO:0000082 G1/S transition of mitotic cell cycle GO:0001701 in utero embryonic development |
InterPro families    | IPR017930 Transcription regulator HTH, Myb-type, DNA-binding IPR012287 Homeodomain-related IPR014778 Myb, DNA-binding IPR015395 C-myb, C-terminal IPR009057 Homeodomain-like IPR001005 SANT domain, DNA binding IPR015495 Myb transcription factor |
Orthology group | MCL12661 |
Nucleotide sequence:
ATGTCGGGGGCGGACGCTTGTTCCCTGGCAGGTGCATACCTCCAGCTATGCGAACACCAG
CACGTACCAGCGCATATACCAGATCACTGTGTCCAGTGTACAACACCGACAGGAGACATC
ATAGAAGAAGGATCGTTTCTCCAACTCCAGAATATTCCATCTTCAATGGATGTGGTATTC
GTGGTTGAAGCCCAGAACTGTAATAAAAATATACGTAAGGCCAAAAACATCGATTTGTTC
GTGGAGACATTGGACAGCAAACTACAAGGAAATGGGTTCTCTGATAACAGGGACCCCTTA
AACCTACACCTGGGTCGCAACTCCCAGGCGCTGTTGAAGCTCCTCTCCCTGCAACGCGAT
GGACCCGGCACCCGCACGGTGAATCTATTGAATGCTAAGAGATACGCTGTAGTTGTATAC
GGAGGTCGCGGCGTGTTCAGTCGCGCTCGAGCTCTGTACGTCAACAACAAGCCGTTCACA
GACGCGGTTGATATACCGAGATATTTTGAAGCATTCCAGATCGAAAAGTCAACAGCTGAC
CGTAACCGGTCCTCGGAGGCGTTGCGTGCTCTACAGACAGTGAGCACCCTGCCCCTAAGA
GCGGGCGTACCTCGAATTGTCATACTGTTCCCGTGCCGCTCGTGTGGCAGCGGAGAGGAG
CTGGACTACTCAACCATCTACCACAATTTAATGGAGAACTCGATCACGCTCCACATACTT
ATGGACGGCGATTTTTCACTGTCAAAGAAACGAGTCGCTAAATATTTGTTTGGAATGGAT
AATTCCGTCGCCTACACTAATAAAGACTACGAGCGGCTGACAGGCGACGCCGGGTTGAAG
AAACAAGTTAGATTACCGAAGGAAAAACTTGGACTGTGTAGCTCATTGGCTCTAGAAACT
AATGGTACAATCTGGGCGGGTTCCAAATTAGAGTCAGACCGCGCGGCCGCTCGTCGTTTC
TCGACGGTTTGCGGCGGTAGAGTTTCTCGTGTCACGCCGTGTGCCGCTCCACGCTGCGAG
TGCCGTAACGCAGCGTTACACTGCAGGCCGTGCGCCAATCACGATCCGTTGGAGCTATCT
TTTTGGAACTCCGATGACATCGATGAACTCATTGACCTCGCAATGGATCCACCAACATTA
CCCTCCATGGAAGAGCAAGGCACGAAATCATATCTACAGCACCAGATAGATAAGCTGAAG
GCATTGCAAGCGAAACTATCTACTGAAGTGGAAGAGGATTATGTTATTATTAAGGAGGAA
GCACGCAGCGGCTATGATTCAGAGTCGAGTGACTATTCAGAGGATGACACGTATGAAGAT
GTACCGCCTCCAACCAAAGGCTCTGGGCCGCGGAAGAATATCAATAGGGGAAGGTGGACC
AAAGATGAGGACAAACGTTTGAAGGTTTACGTTAAGATGTACAATGAGAATTGGGAGAAG
ATAGCGAGTCAGTTTCCTGATAGATCTGATGTTCAGTGTCAACAGAGGTGGACCAAGGTT
GTCAACCCTGAACTAGTCAAGGGTCCCTGGACGAAAGAGGAAGACGAAAAAGTTATGGAG
TTGGTAGCGAAGTACGGACCAAAAAAATGGACTCTCATAGCGCGACATCTCAAAGGCAGA
ATAGGGAAACAATGCAGAGAGCGTTGGCACAATCATCTCAATCCTTCAATAAAGAAAAGT
GCGTGGACGGAGCATGAAGACAGAGTCATATATCAAGCTCACAGACAGCTTGGAAACCAA
TGGGCGAAAATAGCGAAGCTCTTACCCGGAAGAACTGACAATGCTATAAAGAACCATTGG
AACTCAACAATGAGAAGAAAATACGAGCCGGAGTTACTTGACAGTTTTGAACACTTGAGG
AAGAAGAAACGAAAGGAAGAAGATACACAACACAATGATGTGAGTCAAACATTGAATATA
CTTACCACGGTGCTATTACCCGACTTCATCGACAGAAGGACATGGTCGGACACACTGAAC
GAGTCGAGTCAGTCATCAAGTGCACCGGCGGTTCATCTTCGACAGTTGTTAAGAGAGAGG
GCACGAGGCTCACTGGCACCAGCGGATAGCGTCGAGATAGTCGATTCGCCGTTTAGATTC
GTCAATTTAGAGTCACTGCCTTTGAATTCACCAGTGAAGAATTATTTGAGTCAAGCCTCC
ACAAGCGATAATAACAGTCAAAATACAATAACTTACTCAATACAATCTGAAGCTGATAAA
GAAATAGTGGTACCGTCAATATACTCACCTCGAGACTCCCCACCGCCTATATTAAGAAGG
GCCAAGCGGAAAACAGCTGACACCACAACACCTTCAAGACAACCATGGTCGGATCCTCTA
TCTCGTGTAATGGAGAGCGGTGCTCCTCTACAAGCATTACCGTTCAGCCCATCTCAGTTC
CTGGTAGCGCCGCTTGGGAAACAGGACGCGACCCCGCTTAGGACTAAGGAATTGGGTGTA
AGTCCTCTATTGCACACGCCGACACCGAACCTAACGCCTGGCGGAACACAGTTTGAACAG
AAACATACACCTAAAACCCCGACCCCGTTCAAGTTAGCGATGGCCGAAATTGGCAAGAAG
TCAGGTTTGAAATATGAACCGTCCAGCCCGGAGCTGCTGGTGGAGGACATCACTGAGATG
ATACAGCGAGAGAACTCCGACAGTCTACACGAGTGTCTACTGTCACACGACCAAGAAGCG
CAGATGAGTTCAGACTCCGGTATATCAAGCGTGCAGCGCGGTAAAGAGAACGTTCCTGGT
GTACGTCGTTCGCGTAAAGCGCTCGCACATACATGGGGAGCCGCTACTAGCACGCCGCGA
GCGAGACTGCACGTGCCCGATGTATGCTTCGGAGTTGAGACGCCGAGTAAGACCCTGGCC
GGCGACAGTTCCGTTCTATTCTCGCCGCCATCGATAGTGAAGCATTCCCTCCTAGAAGAA
TCTACAAGCATCATATCAGAGAACACGCCAGAAGCCTACGAAGAAATTAAGGTACAACAC
AGGTGTCATGATCCTAACCCTAACTCCTACTACGAGCCTGTGTTTAGGAATATCACTAAC
GAATCGCCTAGAACATTCGCTAGGTTAAACGCGGACAGACTAAAAACTATAACAAGCGCA
AATCTATGTGACAGCTCGTTGGCGCCAAAAGATGTTCTTACGAACGCGTTAGTTGATCAC
ATATACAAAGTGACCAATCAGAAGCCGTCTACGTCACGCATTGACGAAATATTGACCAAT
CAGAAACCTTCTACGTCACACATAAACGATATGTCTACCAATAAAATTAGTAAACCAAAT
GACACGGAAAAAGAGAACCAATGGTGGCAGGTTGAGAAAGATTCCGGTATAGATGATCTT
TACAACGACTATTACATTTTTACTAATAATATATAA
Protein sequence:
MSGADACSLAGAYLQLCEHQHVPAHIPDHCVQCTTPTGDIIEEGSFLQLQNIPSSMDVVF
VVEAQNCNKNIRKAKNIDLFVETLDSKLQGNGFSDNRDPLNLHLGRNSQALLKLLSLQRD
GPGTRTVNLLNAKRYAVVVYGGRGVFSRARALYVNNKPFTDAVDIPRYFEAFQIEKSTAD
RNRSSEALRALQTVSTLPLRAGVPRIVILFPCRSCGSGEELDYSTIYHNLMENSITLHIL
MDGDFSLSKKRVAKYLFGMDNSVAYTNKDYERLTGDAGLKKQVRLPKEKLGLCSSLALET
NGTIWAGSKLESDRAAARRFSTVCGGRVSRVTPCAAPRCECRNAALHCRPCANHDPLELS
FWNSDDIDELIDLAMDPPTLPSMEEQGTKSYLQHQIDKLKALQAKLSTEVEEDYVIIKEE
ARSGYDSESSDYSEDDTYEDVPPPTKGSGPRKNINRGRWTKDEDKRLKVYVKMYNENWEK
IASQFPDRSDVQCQQRWTKVVNPELVKGPWTKEEDEKVMELVAKYGPKKWTLIARHLKGR
IGKQCRERWHNHLNPSIKKSAWTEHEDRVIYQAHRQLGNQWAKIAKLLPGRTDNAIKNHW
NSTMRRKYEPELLDSFEHLRKKKRKEEDTQHNDVSQTLNILTTVLLPDFIDRRTWSDTLN
ESSQSSSAPAVHLRQLLRERARGSLAPADSVEIVDSPFRFVNLESLPLNSPVKNYLSQAS
TSDNNSQNTITYSIQSEADKEIVVPSIYSPRDSPPPILRRAKRKTADTTTPSRQPWSDPL
SRVMESGAPLQALPFSPSQFLVAPLGKQDATPLRTKELGVSPLLHTPTPNLTPGGTQFEQ
KHTPKTPTPFKLAMAEIGKKSGLKYEPSSPELLVEDITEMIQRENSDSLHECLLSHDQEA
QMSSDSGISSVQRGKENVPGVRRSRKALAHTWGAATSTPRARLHVPDVCFGVETPSKTLA
GDSSVLFSPPSIVKHSLLEESTSIISENTPEAYEEIKVQHRCHDPNPNSYYEPVFRNITN
ESPRTFARLNADRLKTITSANLCDSSLAPKDVLTNALVDHIYKVTNQKPSTSRIDEILTN
QKPSTSHINDMSTNKISKPNDTEKENQWWQVEKDSGIDDLYNDYYIFTNNI