DPGLEAN04877 in OGS1.0

New model in OGS2.0DPOGS212481 
Genomic Positionscaffold1116:+ 34971-55397
See gene structure
CDS Length1656
Paired RNAseq reads  2730
Single RNAseq reads  7584
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009653 (5e-08)
Best Drosophila hit  gemini, isoform C (7e-82)
Best Human hitupstream-binding protein 1 isoform b (6e-58)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC009932 [Tribolium castaneum] (2e-96)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC009932 [Tribolium castaneum] (5e-93)
GeneOntology terms

  
GO:0005634 nucleus
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0045449 regulation of transcription
InterPro families  IPR007604 CP2 transcription factor
Orthology groupMCL10687

Nucleotide sequence:

ATGCATGACGGTCGTAAGGAGGACTTCGCTTGGGAGTCTTCTGTGAAGTTGACACTAGTT
GGTTCTGAGGACTTGGGGATGTATTTAGAGGAAAGGGAAAGGGAGGGAAGGAAGCGCAAG
TGGCAGGAGGGGAAGATGCGCAGGGAGGAGTTGGAGGAGAAGGAGCAAAGGAAGATGCCG
CCCAAGAAGGTCATGGCTAACCTCGATTTGGGGGGTAGCGGCACCAGCTCCTCCTCGTCA
CACCTTTCCCCGGGATGGCAGGTCAATGACCTGGATCTTGATTTACCCGGGGAACTTTCC
ATGAATGAGGCCCTACTGTCGTTGCCGTCACTGGCCGTGTTCAAGCAAGAGGCGCCTTCT
CCAACTGGGAACGCACTGTCTCCGCCGCGCAGGACCTGGCCCGTGAGGCGTACTGATGAC
AGACAGATAACCAATATGGTTGTGGACAACCGGGATGCCATGGACGAGGGCTGCCAGCAA
CACGCTGGAGTCATGAACACGCAATCCAATAGCCCGGAAAGCATGCAGTGCCAAACCATG
CCAGTCATAATGCCAATTAATGGCTACCATTCTCCTAGTGGACAAGAAAATAAAAGTAAT
GAGGCCCTACTGTCGTTGCCGTCACTGGCCGTGTTCAAGCAAGAGGCGCCTTCTCCAACT
GGGAACGCACTGTCTCCGCCGCGCAGGACCTGGCCCGTGAGGCGTACTGATGACAGACAG
ATAACCAATATGGTTGTGGACAACCGGGATGCCATGGACGAGGGCTGCCAGCAACACGCT
GGAGTCATGAACACGCAATCCAATAGCCCGGAAAGCATGCAGTGCCAAACCATGCCAGTC
ATAATGCCAATTAATGGCTACCATTCTCCTAGTGGACAAGAAAATAAAAACGCTGGTCTT
CTGATGTGTTCTCCGGCCAGCTCTCTGGATGGCTTCCTGCACTCTCCGAGGCCCGACTCC
GGCTTCAAAGACGACAACAGATTCCAATACGTCCTAGCGGCGGCTACGTCCATAGCGACC
AAACAGAACGAAGAGACGTTGACTTACTTGAACCAGGGACAGCCTTACGAGGTCAAGCTG
AAGAAGCTCGGAGACCTCGCGCACTACAAGGGGAAGATACTGAAGAGTATAATAAAGATC
TGCTTCCACGAGCGCCGTCTGCAGTACATGGAGAGAGAACAAATAGCACAGTGGCACAAC
GACAGGCCGGGAGAGAGGATATTAGAGGTGGACGTACCGCTATCGTACGGTGTATCACGC
GTGGAGCAACCAGCTGCCCTCAATGAACTACATGTACACTGGGACCCGACCAAAGATGTC
GGGGTGTACGTTAAAGTGAACTGCATATCAACAGAATTCACAGCCAAGAAACATGGCGGT
GAAAAGGGAGTGCCGTTCCGTATCCAGGTGGAGACGATGTATGAGGACAGGCGACTGCAT
ACAGCTGCCTGCCAGATTAAGGTCTTCAAGTTGAAGGGCGCTGATCGTAAACACAAGCAG
GACCGGGAACGAGTACTGAGAAGGCCCAGGTCCGAGGTGGAGAGGTATCAACCTGGGTGT
GACGCAACTGTCCTGACGACGCTATCTAACGACGCTCTGATGCCACCGCCTTCCCTTGTG
ACAACATCACCCCCATACTCCCCAGAGATGTGGTAA

Protein sequence:

MHDGRKEDFAWESSVKLTLVGSEDLGMYLEEREREGRKRKWQEGKMRREELEEKEQRKMP
PKKVMANLDLGGSGTSSSSSHLSPGWQVNDLDLDLPGELSMNEALLSLPSLAVFKQEAPS
PTGNALSPPRRTWPVRRTDDRQITNMVVDNRDAMDEGCQQHAGVMNTQSNSPESMQCQTM
PVIMPINGYHSPSGQENKSNEALLSLPSLAVFKQEAPSPTGNALSPPRRTWPVRRTDDRQ
ITNMVVDNRDAMDEGCQQHAGVMNTQSNSPESMQCQTMPVIMPINGYHSPSGQENKNAGL
LMCSPASSLDGFLHSPRPDSGFKDDNRFQYVLAAATSIATKQNEETLTYLNQGQPYEVKL
KKLGDLAHYKGKILKSIIKICFHERRLQYMEREQIAQWHNDRPGERILEVDVPLSYGVSR
VEQPAALNELHVHWDPTKDVGVYVKVNCISTEFTAKKHGGEKGVPFRIQVETMYEDRRLH
TAACQIKVFKLKGADRKHKQDRERVLRRPRSEVERYQPGCDATVLTTLSNDALMPPPSLV
TTSPPYSPEMW