DPGLEAN21862 in OGS1.0

New model in OGS2.0DPOGS211699 
Genomic Positionscaffold1817:+ 3350-25075
See gene structure
CDS Length3714
Paired RNAseq reads  1941
Single RNAseq reads  4301
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011397 (0.0)
Best Drosophila hit  CG3328 (8e-128)
Best Human hitmyelin gene regulatory factor isoform 2 (8e-120)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC015354 [Tribolium castaneum] (4e-145)
Best NR hit (blastx)  PREDICTED: similar to CG3328-PA [Apis mellifera] (2e-140)
GeneOntology terms
  
GO:0045449 regulation of transcription
GO:0003700 sequence-specific DNA binding transcription factor activity
InterPro families
  
IPR008967 p53-like transcription factor, DNA-binding
IPR007888 NDT80/PhoG-like DNA-binding
Orthology groupMCL18068

Nucleotide sequence:

ATGAGGAACCTTCTGGAAACACAAGGACGCTTAGAGGAGATGAAGTCTCTCTCTGAAATA
CCGTCGGACGTACTGAAGGAAAGAGAGGAGCTTGGACGGTGGAGGCGTGACCTTAACGCG
TGTCCAATAGGCCCTTTAAACTTACACCTGGGTCGCAAGTCCCAGGCGCTGTTGAAGCTC
CTCTCCCTGCAAATATGGACCCGACACCCGCTACCTGCGAGCCCGCTGCTGAGCAAAGGC
CTCTCCTCACATGGAGAAGGTCCGGGTCTTATAGACGAATTGGAGGCTTTTATGGCCCCC
GTCGGCGAGGGTTGCTTCGGCCAGCCTCCAGACGCACGCCGTATAGGGGGCCACCAGCTG
CCTGAAAGTCCACCGGACTCTGGCTCCGAGAACCCCTACAGTCCTGAGACCCAGGTATCC
CACACCATAGCGGTGTCACAGACCGTGTTAGGCACGGACTATATGCTGGTGCCGGAACAC
ATGACCTCCCATGAAATTTTACAGCAAAACGGCGATTACATATACGAGGAATTGAAGAGT
GATAGCATAGATCATGAGGTTTTAAGGAATAACCTCAATGATGTAGTGGTCCTACCAGCT
GATCAGAACTTAGATTTGGGTATACGCTCTGTCAGGCACGATTTGGGTTTAACTGAACCC
ATAGCCTATAATAGATACGGCCAGATGAGGGTCGAGCTCCCGGAATTGGAGCAGGGTATA
TTAAACCCGCAGCTGGTGTCTTTGGGTCATGAAAACCTGACCCCGGTGTATACGAACCTT
CAGGAACCCAGCGCCAAGAAGAGGAAGCACTCTCAAGACGTGAACTCCCAAGTGAAATGT
GAACCAGCGGCCCTGTCTCCCGAGAGCGTAGCCCGCCCTCCCCCATCCGTGGACGGCTCC
GAGGCCGGCGATGACCCGCCACTCCAGTGCATCAGGTTCTCAGCCTTCCAGCAGAACGTT
TGGTGTCCGCTATACGACTGCAACTTGAAGCCTATTCTGAACACTTCGTACGTGGTCGGC
GCTGACAAAGGCTTCAATTTCTCTCAGATAGACGAGGCTTTTGTCTGCCAGAAGAAGAAC
CACTTCCAGGTGACCTGCCAGATACAGGTCCAGGGGGAGGCTCAGTACGTGAAGACTCCA
GACGGATTCAAGAAGATTAACAACTTCTGTCTGCATTTCTATGGCGTCAAAGCGGAAGAC
CCGAGCCAGGAGGTGAGGATCGAGCAGAGCCAATCAGACAGGACCAAGAAACCCTTCCAT
CCAGTGCCTGTGGACATCCGGCGCGAGGGCGCTAAGGTCACCGTGGGTCGTCTGCACTTC
GCTGAAACCACAAACAATAACATGAGGAAGAAGGGGCGCCCCAACCCCGACCAGAGGCAC
TTCCAGCTGGTGGTGGCGCTGAGGGCTCACGTCGCGCACAGCGACTACATAGTGGCAGCG
CAGGCTAGTGACAGGATCATTGTCAGGGCATCGAACCCGGGCCAGTTCGAGTCCGACTGC
ACTGAGAGCTGGTGGCAGAGAGGAGTCGCCGAGAACAGCGTCCATTACAGTGGGAGAGTC
GGCATCAACACCGACCGGCCGGACGAGGCCTGTGTTATCAATGGAAACCTTAAAGTGATG
GGACACATAGTACATCCGTCCGACGCCAGAGCTAAACACGACATTGAAGAGTTGGACACG
GCGCAGCAACTGAGGAACGTGCAGAGCATACGAGTTGTTAAATTTCACTACGACCCGTCG
TTCGCCCACCACACGGGCCTGGCGGGTCATGCAACGGTCCCCGACACGGGAGTGCTGGCA
CAGGAGGTGAGGGAGGTCATCCCGGACGCTGTCAAGGAGGCCGGTGACGTCACCCTCGCT
AACGGGGACAGGATACGGAAGTTCCTCGTCGTGAACAAGGATCGCATCTTCATGGAAAAT
CTTGGAGCTGTGAAGGAGCTGTGCAAGGTGACAGGAAACCTGGAGACGAGGATCGACCAG
CTCGAGAGACTCACCAGGAGGCGGACCACCAGACATGACAGCGCCATCAGTAACGATTCG
CGAGTGTCGATCACATCTTCGAGGTCCTTCTACAGTGATGGGAACATCTCTATCGATCAG
ATAAGAGACATCGCCCGCAGCATCCGCAGACACGAGTGTTGCCACAAACTCAGTCACAAA
TCCCCAAAATTCACCCGAAAACAGTGCAAAAACTGTCATACGAATTACACGAAATATGGG
AAATATTACAACTACAATAAGACATGTATCAAGAATAAAGAAATGAAGGACAGTGAACAT
CCGTATCCGGATTATGTTAATAGTGCGGAACAGAAGTCACCGGAAGATACTTATACGGTA
ACAAAAAAGAAGGACTCGTGTCTGTGGTTGCGGGACGACGAGAGCTATCAACATGGCAAC
ACTAAACTGCCGTTCTGCTGCAGGAGGAAGTACAGATACGGCGGCAGCGGGGAGCTAATA
TCAAATAAATTCCTGCAGATCGTTATCACTATACTGATATTCGTCATGGCTATCTGCTTA
GTTGCGATGTCAGCGCTGTACTTCCGGGAGCACCGCGAGCTGGTGTCGCTGAGGGAGAGG
CGGGTGACGCCGCGGACAGCGAAACATAACACTATACAGAATCTTAAAATATCACAGCAC
ATGGTGGTCAAGAAAACGGCAAAAGAGAAAGCCCCATACAAAGTGTCCCCGCAGTACGTG
ACGTCATCACCGGAGCCTCCGACGACCCACACGACAAGGAACTACGTGAAGACATTGAGT
ACTCCTGTAGATGCTCCGTCCTTGTTTCGCTCAGCGGCCACTATCGGAGCTGGCTGCGGG
TCCTCGAGCACTGACAATGAGCTGGATTCTGGTTGTCAGTCGTCATGCTCGGACCCGTCC
CAGGTGTTCAACAGCCAGCCCCTGGAGAGCATCAGGACCGACGAGGAGGAGAAACAGAAC
GACACACAGACGGAGAGAAAGATCCTCACGCCGGTGATGACGGAGAACAACTACCTGGAG
AAGAACGAGAGCAGAGTGAGGCGCGCCGCGGACAGCGACGAGGCGCCCAGGAGCCAGGAG
GAGATGGCGCTGGAGGGGGACGGCGAGGGGGGAGGGGAATGCGACACGCTCACCCTGGGT
GTCGTGAGCAAGAGCTACACCAACGCGAGCGTGTTCAGCGAGCGCGTGTGTGTCCGCGTG
CTACGGAATTATACGTACACGCTGCCGGTGTCCGCCTGCCTGCACCAGAAATACCTGGAC
GTGGTCTTCAGATCATCCAAGCTGAGGGAGGTCCGTCTGTGCGACCTTCAGTGCAAATCC
GAGTCCATGAAGACCTGCCAGGTGGAGCGCGAGTCCTCAAAACCTATACCGGCTGGTGAC
ATCTGGACAGCCAGGGTCGGACTCCGCTGCAGACTCGACCGCTGGATGAAGATAAGGGCT
GGATTCGTCCCTCTCAAGAGACGGGACGAGAGGAGCTTCAACAGCGCCTGGGACCTGCAA
CCCAGCTCACTTCACAACTACACCGATTGCCCTGGCAGTAGCCCACATCGGGGTCCAACA
GGGAGTACGCCTTCAGTATATTGTACAAGGCCAGCTGACCTGGACCCAGGGCTGATGCGA
AGTAGGAATGCTTTGGGAACGTCCCGCCTGATGGACCTGTGCTACCTGAACCCCGAGGAC
AAGATCCCATTCGTTGAGTTCAACATACACATATACAGGGACTGCAGGAACTGA

Protein sequence:

MRNLLETQGRLEEMKSLSEIPSDVLKEREELGRWRRDLNACPIGPLNLHLGRKSQALLKL
LSLQIWTRHPLPASPLLSKGLSSHGEGPGLIDELEAFMAPVGEGCFGQPPDARRIGGHQL
PESPPDSGSENPYSPETQVSHTIAVSQTVLGTDYMLVPEHMTSHEILQQNGDYIYEELKS
DSIDHEVLRNNLNDVVVLPADQNLDLGIRSVRHDLGLTEPIAYNRYGQMRVELPELEQGI
LNPQLVSLGHENLTPVYTNLQEPSAKKRKHSQDVNSQVKCEPAALSPESVARPPPSVDGS
EAGDDPPLQCIRFSAFQQNVWCPLYDCNLKPILNTSYVVGADKGFNFSQIDEAFVCQKKN
HFQVTCQIQVQGEAQYVKTPDGFKKINNFCLHFYGVKAEDPSQEVRIEQSQSDRTKKPFH
PVPVDIRREGAKVTVGRLHFAETTNNNMRKKGRPNPDQRHFQLVVALRAHVAHSDYIVAA
QASDRIIVRASNPGQFESDCTESWWQRGVAENSVHYSGRVGINTDRPDEACVINGNLKVM
GHIVHPSDARAKHDIEELDTAQQLRNVQSIRVVKFHYDPSFAHHTGLAGHATVPDTGVLA
QEVREVIPDAVKEAGDVTLANGDRIRKFLVVNKDRIFMENLGAVKELCKVTGNLETRIDQ
LERLTRRRTTRHDSAISNDSRVSITSSRSFYSDGNISIDQIRDIARSIRRHECCHKLSHK
SPKFTRKQCKNCHTNYTKYGKYYNYNKTCIKNKEMKDSEHPYPDYVNSAEQKSPEDTYTV
TKKKDSCLWLRDDESYQHGNTKLPFCCRRKYRYGGSGELISNKFLQIVITILIFVMAICL
VAMSALYFREHRELVSLRERRVTPRTAKHNTIQNLKISQHMVVKKTAKEKAPYKVSPQYV
TSSPEPPTTHTTRNYVKTLSTPVDAPSLFRSAATIGAGCGSSSTDNELDSGCQSSCSDPS
QVFNSQPLESIRTDEEEKQNDTQTERKILTPVMTENNYLEKNESRVRRAADSDEAPRSQE
EMALEGDGEGGGECDTLTLGVVSKSYTNASVFSERVCVRVLRNYTYTLPVSACLHQKYLD
VVFRSSKLREVRLCDLQCKSESMKTCQVERESSKPIPAGDIWTARVGLRCRLDRWMKIRA
GFVPLKRRDERSFNSAWDLQPSSLHNYTDCPGSSPHRGPTGSTPSVYCTRPADLDPGLMR
SRNALGTSRLMDLCYLNPEDKIPFVEFNIHIYRDCRN