New model in OGS2.0 | DPOGS211699  |
---|---|
Genomic Position | scaffold1817:+ 3350-25075 |
See gene structure | |
CDS Length | 3714 |
Paired RNAseq reads   | 1941 |
Single RNAseq reads   | 4301 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011397 (0.0) |
Best Drosophila hit   | CG3328 (8e-128) |
Best Human hit | myelin gene regulatory factor isoform 2 (8e-120) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC015354 [Tribolium castaneum] (4e-145) |
Best NR hit (blastx)   | PREDICTED: similar to CG3328-PA [Apis mellifera] (2e-140) |
GeneOntology terms    | GO:0045449 regulation of transcription GO:0003700 sequence-specific DNA binding transcription factor activity |
InterPro families    | IPR008967 p53-like transcription factor, DNA-binding IPR007888 NDT80/PhoG-like DNA-binding |
Orthology group | MCL18068 |
Nucleotide sequence:
ATGAGGAACCTTCTGGAAACACAAGGACGCTTAGAGGAGATGAAGTCTCTCTCTGAAATA
CCGTCGGACGTACTGAAGGAAAGAGAGGAGCTTGGACGGTGGAGGCGTGACCTTAACGCG
TGTCCAATAGGCCCTTTAAACTTACACCTGGGTCGCAAGTCCCAGGCGCTGTTGAAGCTC
CTCTCCCTGCAAATATGGACCCGACACCCGCTACCTGCGAGCCCGCTGCTGAGCAAAGGC
CTCTCCTCACATGGAGAAGGTCCGGGTCTTATAGACGAATTGGAGGCTTTTATGGCCCCC
GTCGGCGAGGGTTGCTTCGGCCAGCCTCCAGACGCACGCCGTATAGGGGGCCACCAGCTG
CCTGAAAGTCCACCGGACTCTGGCTCCGAGAACCCCTACAGTCCTGAGACCCAGGTATCC
CACACCATAGCGGTGTCACAGACCGTGTTAGGCACGGACTATATGCTGGTGCCGGAACAC
ATGACCTCCCATGAAATTTTACAGCAAAACGGCGATTACATATACGAGGAATTGAAGAGT
GATAGCATAGATCATGAGGTTTTAAGGAATAACCTCAATGATGTAGTGGTCCTACCAGCT
GATCAGAACTTAGATTTGGGTATACGCTCTGTCAGGCACGATTTGGGTTTAACTGAACCC
ATAGCCTATAATAGATACGGCCAGATGAGGGTCGAGCTCCCGGAATTGGAGCAGGGTATA
TTAAACCCGCAGCTGGTGTCTTTGGGTCATGAAAACCTGACCCCGGTGTATACGAACCTT
CAGGAACCCAGCGCCAAGAAGAGGAAGCACTCTCAAGACGTGAACTCCCAAGTGAAATGT
GAACCAGCGGCCCTGTCTCCCGAGAGCGTAGCCCGCCCTCCCCCATCCGTGGACGGCTCC
GAGGCCGGCGATGACCCGCCACTCCAGTGCATCAGGTTCTCAGCCTTCCAGCAGAACGTT
TGGTGTCCGCTATACGACTGCAACTTGAAGCCTATTCTGAACACTTCGTACGTGGTCGGC
GCTGACAAAGGCTTCAATTTCTCTCAGATAGACGAGGCTTTTGTCTGCCAGAAGAAGAAC
CACTTCCAGGTGACCTGCCAGATACAGGTCCAGGGGGAGGCTCAGTACGTGAAGACTCCA
GACGGATTCAAGAAGATTAACAACTTCTGTCTGCATTTCTATGGCGTCAAAGCGGAAGAC
CCGAGCCAGGAGGTGAGGATCGAGCAGAGCCAATCAGACAGGACCAAGAAACCCTTCCAT
CCAGTGCCTGTGGACATCCGGCGCGAGGGCGCTAAGGTCACCGTGGGTCGTCTGCACTTC
GCTGAAACCACAAACAATAACATGAGGAAGAAGGGGCGCCCCAACCCCGACCAGAGGCAC
TTCCAGCTGGTGGTGGCGCTGAGGGCTCACGTCGCGCACAGCGACTACATAGTGGCAGCG
CAGGCTAGTGACAGGATCATTGTCAGGGCATCGAACCCGGGCCAGTTCGAGTCCGACTGC
ACTGAGAGCTGGTGGCAGAGAGGAGTCGCCGAGAACAGCGTCCATTACAGTGGGAGAGTC
GGCATCAACACCGACCGGCCGGACGAGGCCTGTGTTATCAATGGAAACCTTAAAGTGATG
GGACACATAGTACATCCGTCCGACGCCAGAGCTAAACACGACATTGAAGAGTTGGACACG
GCGCAGCAACTGAGGAACGTGCAGAGCATACGAGTTGTTAAATTTCACTACGACCCGTCG
TTCGCCCACCACACGGGCCTGGCGGGTCATGCAACGGTCCCCGACACGGGAGTGCTGGCA
CAGGAGGTGAGGGAGGTCATCCCGGACGCTGTCAAGGAGGCCGGTGACGTCACCCTCGCT
AACGGGGACAGGATACGGAAGTTCCTCGTCGTGAACAAGGATCGCATCTTCATGGAAAAT
CTTGGAGCTGTGAAGGAGCTGTGCAAGGTGACAGGAAACCTGGAGACGAGGATCGACCAG
CTCGAGAGACTCACCAGGAGGCGGACCACCAGACATGACAGCGCCATCAGTAACGATTCG
CGAGTGTCGATCACATCTTCGAGGTCCTTCTACAGTGATGGGAACATCTCTATCGATCAG
ATAAGAGACATCGCCCGCAGCATCCGCAGACACGAGTGTTGCCACAAACTCAGTCACAAA
TCCCCAAAATTCACCCGAAAACAGTGCAAAAACTGTCATACGAATTACACGAAATATGGG
AAATATTACAACTACAATAAGACATGTATCAAGAATAAAGAAATGAAGGACAGTGAACAT
CCGTATCCGGATTATGTTAATAGTGCGGAACAGAAGTCACCGGAAGATACTTATACGGTA
ACAAAAAAGAAGGACTCGTGTCTGTGGTTGCGGGACGACGAGAGCTATCAACATGGCAAC
ACTAAACTGCCGTTCTGCTGCAGGAGGAAGTACAGATACGGCGGCAGCGGGGAGCTAATA
TCAAATAAATTCCTGCAGATCGTTATCACTATACTGATATTCGTCATGGCTATCTGCTTA
GTTGCGATGTCAGCGCTGTACTTCCGGGAGCACCGCGAGCTGGTGTCGCTGAGGGAGAGG
CGGGTGACGCCGCGGACAGCGAAACATAACACTATACAGAATCTTAAAATATCACAGCAC
ATGGTGGTCAAGAAAACGGCAAAAGAGAAAGCCCCATACAAAGTGTCCCCGCAGTACGTG
ACGTCATCACCGGAGCCTCCGACGACCCACACGACAAGGAACTACGTGAAGACATTGAGT
ACTCCTGTAGATGCTCCGTCCTTGTTTCGCTCAGCGGCCACTATCGGAGCTGGCTGCGGG
TCCTCGAGCACTGACAATGAGCTGGATTCTGGTTGTCAGTCGTCATGCTCGGACCCGTCC
CAGGTGTTCAACAGCCAGCCCCTGGAGAGCATCAGGACCGACGAGGAGGAGAAACAGAAC
GACACACAGACGGAGAGAAAGATCCTCACGCCGGTGATGACGGAGAACAACTACCTGGAG
AAGAACGAGAGCAGAGTGAGGCGCGCCGCGGACAGCGACGAGGCGCCCAGGAGCCAGGAG
GAGATGGCGCTGGAGGGGGACGGCGAGGGGGGAGGGGAATGCGACACGCTCACCCTGGGT
GTCGTGAGCAAGAGCTACACCAACGCGAGCGTGTTCAGCGAGCGCGTGTGTGTCCGCGTG
CTACGGAATTATACGTACACGCTGCCGGTGTCCGCCTGCCTGCACCAGAAATACCTGGAC
GTGGTCTTCAGATCATCCAAGCTGAGGGAGGTCCGTCTGTGCGACCTTCAGTGCAAATCC
GAGTCCATGAAGACCTGCCAGGTGGAGCGCGAGTCCTCAAAACCTATACCGGCTGGTGAC
ATCTGGACAGCCAGGGTCGGACTCCGCTGCAGACTCGACCGCTGGATGAAGATAAGGGCT
GGATTCGTCCCTCTCAAGAGACGGGACGAGAGGAGCTTCAACAGCGCCTGGGACCTGCAA
CCCAGCTCACTTCACAACTACACCGATTGCCCTGGCAGTAGCCCACATCGGGGTCCAACA
GGGAGTACGCCTTCAGTATATTGTACAAGGCCAGCTGACCTGGACCCAGGGCTGATGCGA
AGTAGGAATGCTTTGGGAACGTCCCGCCTGATGGACCTGTGCTACCTGAACCCCGAGGAC
AAGATCCCATTCGTTGAGTTCAACATACACATATACAGGGACTGCAGGAACTGA
Protein sequence:
MRNLLETQGRLEEMKSLSEIPSDVLKEREELGRWRRDLNACPIGPLNLHLGRKSQALLKL
LSLQIWTRHPLPASPLLSKGLSSHGEGPGLIDELEAFMAPVGEGCFGQPPDARRIGGHQL
PESPPDSGSENPYSPETQVSHTIAVSQTVLGTDYMLVPEHMTSHEILQQNGDYIYEELKS
DSIDHEVLRNNLNDVVVLPADQNLDLGIRSVRHDLGLTEPIAYNRYGQMRVELPELEQGI
LNPQLVSLGHENLTPVYTNLQEPSAKKRKHSQDVNSQVKCEPAALSPESVARPPPSVDGS
EAGDDPPLQCIRFSAFQQNVWCPLYDCNLKPILNTSYVVGADKGFNFSQIDEAFVCQKKN
HFQVTCQIQVQGEAQYVKTPDGFKKINNFCLHFYGVKAEDPSQEVRIEQSQSDRTKKPFH
PVPVDIRREGAKVTVGRLHFAETTNNNMRKKGRPNPDQRHFQLVVALRAHVAHSDYIVAA
QASDRIIVRASNPGQFESDCTESWWQRGVAENSVHYSGRVGINTDRPDEACVINGNLKVM
GHIVHPSDARAKHDIEELDTAQQLRNVQSIRVVKFHYDPSFAHHTGLAGHATVPDTGVLA
QEVREVIPDAVKEAGDVTLANGDRIRKFLVVNKDRIFMENLGAVKELCKVTGNLETRIDQ
LERLTRRRTTRHDSAISNDSRVSITSSRSFYSDGNISIDQIRDIARSIRRHECCHKLSHK
SPKFTRKQCKNCHTNYTKYGKYYNYNKTCIKNKEMKDSEHPYPDYVNSAEQKSPEDTYTV
TKKKDSCLWLRDDESYQHGNTKLPFCCRRKYRYGGSGELISNKFLQIVITILIFVMAICL
VAMSALYFREHRELVSLRERRVTPRTAKHNTIQNLKISQHMVVKKTAKEKAPYKVSPQYV
TSSPEPPTTHTTRNYVKTLSTPVDAPSLFRSAATIGAGCGSSSTDNELDSGCQSSCSDPS
QVFNSQPLESIRTDEEEKQNDTQTERKILTPVMTENNYLEKNESRVRRAADSDEAPRSQE
EMALEGDGEGGGECDTLTLGVVSKSYTNASVFSERVCVRVLRNYTYTLPVSACLHQKYLD
VVFRSSKLREVRLCDLQCKSESMKTCQVERESSKPIPAGDIWTARVGLRCRLDRWMKIRA
GFVPLKRRDERSFNSAWDLQPSSLHNYTDCPGSSPHRGPTGSTPSVYCTRPADLDPGLMR
SRNALGTSRLMDLCYLNPEDKIPFVEFNIHIYRDCRN