DPGLEAN01120 in OGS1.0

New model in OGS2.0DPOGS210231 
Genomic Positionscaffold42:- 32824-40012
See gene structure
CDS Length2673
Paired RNAseq reads  1536
Single RNAseq reads  3989
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002551 (0.0)
Best Drosophila hit  Zn finger homeodomain 1, isoform B (7e-93)
Best Human hitzinc finger E-box-binding homeobox 2 isoform 1 (3e-35)
Best NR hit (blastp)  zinc finger protein, putative [Pediculus humanus corporis] (7e-146)
Best NR hit (blastx)  zinc finger protein, putative [Pediculus humanus corporis] (7e-130)
GeneOntology terms

















  
GO:0007399 nervous system development
GO:0007498 mesoderm development
GO:0003677 DNA binding
GO:0008354 germ cell migration
GO:0008406 gonad development
GO:0007280 pole cell migration
GO:0003702 RNA polymerase II transcription factor activity
GO:0005634 nucleus
GO:0007507 heart development
GO:0007417 central nervous system development
GO:0007517 muscle organ development
GO:0006355 regulation of transcription, DNA-dependent
GO:0008270 zinc ion binding
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0043565 sequence-specific DNA binding
GO:0008045 motor axon guidance
GO:0019730 antimicrobial humoral response
GO:0007514 garland cell differentiation
GO:0048542 lymph gland development
InterPro families





  
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR012287 Homeodomain-related
IPR001356 Homeobox
IPR007087 Zinc finger, C2H2-type
IPR009057 Homeodomain-like
IPR015880 Zinc finger, C2H2-like
IPR017970 Homeobox, conserved site
Orthology groupMCL16518

Nucleotide sequence:

ATGACATGTTCGGCTGCTACGGTGACAGCTACTCGTATCGCCTGTGGAGCCTGGCCAACG
TGTGGGGCGCCTGTCGATGTGAGTAGGAGGCCGTGTCGTGGCCGCGGCCGAGTGTCTCTC
AGTGAGGTGATGATAGTGCACAGCTCTCTTATGGGAGTTGTCGGTTGGTCTGAACGTTCG
TCAGCGCCCTGGCTGGCCGACATGAGGAGTAAGCAGGTCCCCACGTCGACGACCTCCGCC
AGGCGGCGAGCTGCCTTCATCCGGCCTCTAGCTGGTGGAGGTTCCGAGATCAAGACGTGT
GAGGAGGAAGAGGATCGTGGGTCTTCGCCTCTGGGTCCCGGCGGGCCTTTCCCCTGCAGC
CACTGTCGCGGAGCCTACCCGACCCGCGAACAGCTCGAGCGACATGAGACGCTGCACGCG
CCGAGCACTCAGACGTGCAAAATATGCCACAAGAGTTTTCAAAATGTCTACAGACTTCAG
CGTCACATGATAAGTCATGATGACAGCGCGAAGTTACGCAAATATAAATGCAATGACTGC
GATAAAGCATTTAAATTTAAACATCATCTCAAAGAGCATCTCCGGATACACAGTGGAGAA
AAACCGTTCGAATGCGCTAACTGTGGAAAAAAGTTTTCACATTCCGGCTCTTATTCGTCT
CATATGACTTCCAAGAAATGTCTTGTTATGAATCTCAAAATGGGAAGAATAAAGCCAAAT
AATCCGGCATTGAATCCAGACCGGAGTCCATCTCGGAAACGTGCAAACGCCATGGCAGCT
AGTCAGTTGAATAATAATATCGCGCCAAACGGCAATTCGTTTTTGCCAATATTGCCAAAG
TATAACGAAGCTGCAGCATTTTTCGCATCTATGTCATCTCAAGAAAATAATTTCCTGAGG
CCCCCGTTGGGTCAACCTGGATTAAATCCTTTCTATATGCCTCCTGGTATGCCAATGAGC
CCAGCCAACGGTATTGCACCTTACACCTTCCCTACTTCATTAAGCCAATTATTTGAGCAA
CTGGCCTCTCAACATTATCAACAACGAAAAATAGAGATTCCTAGTCCAAAGCTTGTGAGT
CCACCCGCAAACCCTGAAGACTTAATCGAGGAAGTAGTGGATGAGGAAGATAAGCGCTCC
GAGGCCAGCGCAGAACTAGTGATGGATATAGACGACGATGACAACGTTACCGTAAAGAAA
GAACAAGAAGACCGGGAAACTGAGGCGAGTTCTCCTTCCCGTAATTACGAATCTATCTTA
AGTAGCAATGAACGCGGCGAGTCCGACATCAATCATTCGGATTTTAATACTGTTAAAGCA
TCCGATACTAAATATTATTTTAAGACGCACAATGATCAGCAGTCTCCTATATCTGGCCAG
GAATACCCCGGTGAAGCTTTACCATTGACACAGATAAATGTTAAAGAGGAACCTGATATT
GATACCCTACGTTGCTTAAAATGTAACGTATTGTTCAATGACAAAAATGATTTATTGGAA
CATGACAAAGCAGCGTGTGGTAATATTTTTAGAAAACATGAAGGACTAGCTGCTCAAGTG
GCTGAGACGGTGGCTCTGAATAGATTAGAAGCTGAAATGCGCGCATCTATACAAAGTGGG
GTAAGTGCGAGTGAAGATGAGGATTTCGGGAGAGAGGATCGGGAAGACAAAGCCTCTATA
AATGAAAATGACAGAAAAATCAGAGTTCGCACCGCACTTACCGAAGAACAACAGATGGTG
TTAAAAAGGCACTATTCGATCAACCCTCGACCGAATCGAGAGGAATTTAAGAAGATCGCA
CAGCAGATAGGCTTAGATAACCGAGTAGTACAAGTTTGGTTCCAAAATAATAGAGCCAGA
GTACGGAGGATGACTCAGGCGGTCGCGATATCTGATCAACCTCTAGATTTATCTACAAAA
AAATCGAATACCTCCGTTACTTCAAGCCCGTCACCTTCACCGACTTGCAGCATTTCAGTA
ACACATTCCGATTCCGAGGAAGCGGTTAATTTAAGTCAAAAATCTTCTCGCAGCACGACC
CCACATCGCGCTAACTACATAAATACGTATCCACATTCCAACTGCTCGTCCTCATCGTTC
ACGGATTTTCGGTTATCACCCTCACCAGGTGAAACTATGAACGGTTACAAAAGAATGTTG
CAACAGAAAATGCCCATCAATCCTATGATGCCGATGGACAAACTTCTTCATTACAACGAC
CTGAGTAACGGACGATCTCCAATTCTTAACATGCAAGTGCCCGAGAGGCAAGAATCCAGT
CCGTCTTACGACCGCCCAATATGGAACGAGGATCTCCAGACTCAAATCGAATTAGAAGAT
GAAACCACTGTACTTAAAAAGAGCAAAATAAAGGCTGGAAATGAGTTGAAAGAGGGAGAA
GGACAATTCGTTTGTGATCAATGTGATAAAACTTTTGTCAAGCAAAGTTCTCTCGCGAGG
CATAAATACGAGCACTCAGGCCAGCGACCTTACAAATGCTTGGAATGTCCTAAGGCTTTC
AAGCATAAGCACCACCTGACTGAACACAAGCGGTTGCACACCGGCGAGAAGCCGTTCCAG
TGCTGCAAGTGTCTCAAGAAGTTCTCTCACTCCGGCTCCTACAGCCAGCACATGAACCAC
AGGTTCGCGATCTGCAAGCCATACAGAGACTAG

Protein sequence:

MTCSAATVTATRIACGAWPTCGAPVDVSRRPCRGRGRVSLSEVMIVHSSLMGVVGWSERS
SAPWLADMRSKQVPTSTTSARRRAAFIRPLAGGGSEIKTCEEEEDRGSSPLGPGGPFPCS
HCRGAYPTREQLERHETLHAPSTQTCKICHKSFQNVYRLQRHMISHDDSAKLRKYKCNDC
DKAFKFKHHLKEHLRIHSGEKPFECANCGKKFSHSGSYSSHMTSKKCLVMNLKMGRIKPN
NPALNPDRSPSRKRANAMAASQLNNNIAPNGNSFLPILPKYNEAAAFFASMSSQENNFLR
PPLGQPGLNPFYMPPGMPMSPANGIAPYTFPTSLSQLFEQLASQHYQQRKIEIPSPKLVS
PPANPEDLIEEVVDEEDKRSEASAELVMDIDDDDNVTVKKEQEDRETEASSPSRNYESIL
SSNERGESDINHSDFNTVKASDTKYYFKTHNDQQSPISGQEYPGEALPLTQINVKEEPDI
DTLRCLKCNVLFNDKNDLLEHDKAACGNIFRKHEGLAAQVAETVALNRLEAEMRASIQSG
VSASEDEDFGREDREDKASINENDRKIRVRTALTEEQQMVLKRHYSINPRPNREEFKKIA
QQIGLDNRVVQVWFQNNRARVRRMTQAVAISDQPLDLSTKKSNTSVTSSPSPSPTCSISV
THSDSEEAVNLSQKSSRSTTPHRANYINTYPHSNCSSSSFTDFRLSPSPGETMNGYKRML
QQKMPINPMMPMDKLLHYNDLSNGRSPILNMQVPERQESSPSYDRPIWNEDLQTQIELED
ETTVLKKSKIKAGNELKEGEGQFVCDQCDKTFVKQSSLARHKYEHSGQRPYKCLECPKAF
KHKHHLTEHKRLHTGEKPFQCCKCLKKFSHSGSYSQHMNHRFAICKPYRD