DPGLEAN20032 in OGS1.0

New model in OGS2.0DPOGS205405 
Genomic Positionscaffold1443:- 13448-15511
See gene structure
CDS Length2064
Paired RNAseq reads  323
Single RNAseq reads  1000
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001408 (0.0)
Best Drosophila hit  disco-related (7e-39)
Best Human hitzinc finger protein basonuclin-2 (4e-21)
Best NR hit (blastp)  hypothetical protein Phum_PHUM243450 [Pediculus humanus corporis] (1e-68)
Best NR hit (blastx)  hypothetical protein Phum_PHUM243450 [Pediculus humanus corporis] (3e-68)
GeneOntology terms


  
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005634 nucleus
GO:0045449 regulation of transcription
GO:0008270 zinc ion binding
InterPro families
  
IPR007087 Zinc finger, C2H2-type
IPR015880 Zinc finger, C2H2-like
Orthology groupMCL39534

Nucleotide sequence:

ATGGAAGGCAAAGAGTTTGGTGCGAAGCTTCCCGCCGGGCTGGATTACTTCATGATGCCG
CGCGTAACGCAGACGGCGCCGCAGTTCGATTTTAGAAAATTAGGCGCCAGTTTCAGTGGC
CGCGATGATGAGAAACACGACGAGCCCGAGTATCCTCAGGAGCGTCGCGAGCCGCCTCCC
GCGCCCTGGCCGCTGGGACTCGGGGTGCAATTCGTCAACCCCGCGACTGGCAAGAAAAGA
GTGCAGTGCAACGTTTGCCTTAAGACTTTTTGCGATAAAGGCGCCCTAAAGATTCACTTC
TCGGCGGTTCACCTCAGGGAAATGCATAAGTGTACAGTGGAGGGGTGTACCATGATGTTC
AGTTCGCGTCGTTCTCGTAACCGCCACAGTGCTAACCCTAATCCTAAATTGCATTCGCCG
CATGTGAGACGTAAGATATCGGCGCATGACGGTAGATCGTCGCAGCCGTTCCCGCTCCTG
CCGGCTCTGGCGCGTCTCCCGCTACCACCGCCAGCGCTTCTCCCGCCAGAGCTGGCGTCA
CGTCTTCCACACCCACTGGCCCCGCCGCCTGGACACACACACCTGCCCCCGCGAGTGCCT
CTCGATGATCTCCGTAACTTCAGTGAAATAGAAAAAATGTATAGAAAAATACCATCGCCA
GAAAATTACACACGTCACCCGTTAGACCTCGCTCGATCGTCGGCTTCGCGCGAGGAGACC
TCGAATAATTACGTCGATAATAACGAGGACGTCTCGGAAGATGAGAACTGTCAAAAGCCT
AAAATTGAAAACGAGTGCCCTAAAGAGGAAACGTCTTGTTACAATGACGAACCCGAAGAT
CTGAGTGTAAATAAAAAGAAAAATATACAGACTGCACCTGTCGTGCTCACAGACTCTAGT
GGTCGTACAGAGCCAGCCACTTCGGATGATAAGTCGTCGCTGGTGCCTAACAAGCGGAAA
AGAAAGAGCGGCAATCCGACGAGATGCTCACAGAGTAACGAGTATAGTGCTTCCGACGAA
GAATACAATAGTGATTTATTTAGGAATCTATCTACTACTAGTCATGTAGAAGAAGAGCCC
TTGTCCTTAAAAAAACAGAAACCTGAGAAATGGGAGCCCTTTCAGAACTCCGAAGAACCG
GTTGTGAACGCGGAATCACTCAAAGTGAAGGCTGAGAGTGAATCGGACGACGAGTCCAGT
GCGCCCAGCGTCGTGCGTGAGGGTCTGAGGCTGAGATCAGATCTGTATACACCAAGTGAT
AGCGGGAGCGATTTACACACGATCGAGGAGAGACTCGCCCGAGTGAGGTCACCGTCCGCC
AGCAGTGACAGGACAGACGATAGGAACGACGTGAACGATCTCGAAATACCCATAGACGAA
GAGAACCCTGACAGATGCACCGCCTGCGGTAAAATATTTCAGAACCATTTCACACTACGC
ATGCACTACAGGAACGATCACCTGAAGCTACTCCACCCGTGCGATGTGAGCGGCTGCGAC
GCCGCCTTCCCCTCGAGGCGGAGCCGCGACAGGCACAGCTTGAATGTAGATCTCCACAGA
CGACTGCTGTCCACTGGGTCGCCTCAAAACCCAGACCAAAGGCCGGCGTTCGAGGTGAAC
GCTGAATTGTTGAATAAATTATACGCCGACATAAAAGGGCTCGCGTCAACCCTCGAGACC
TTGAGATACGGCGAAGAGTCTTCCCAAGTTCCTAGCTATGTCTCCGAGACAATGAAGTTT
TATAGTAGAAACTTGACGGCTCTGCAGGCGGGTCTGTTTCCGCAGCTTGATCGAGGTTTT
TTTCCGAGCCCCTTTCTGATGAACGGCGCCCCTCAGCCCGGGGGCCTGTATGCTCCGGGA
GCTCAGTCGACTCGAGAGTCCCTGTCGCCGCTGTCCGCCTCATCTCCCCCGGTAATATCT
CCGGGACGACTGGACGCAGCTCACGCCCGGCCCCGGGATGACGCCAAGCTGCTTTTTCGG
GAGACCTCGGAGTCCTTCAAGAAGATGACCTCGCTCTGTGAGCGCCAGGAACAGCTATAT
CAGCACCACGTCCCTGTATCATGA

Protein sequence:

MEGKEFGAKLPAGLDYFMMPRVTQTAPQFDFRKLGASFSGRDDEKHDEPEYPQERREPPP
APWPLGLGVQFVNPATGKKRVQCNVCLKTFCDKGALKIHFSAVHLREMHKCTVEGCTMMF
SSRRSRNRHSANPNPKLHSPHVRRKISAHDGRSSQPFPLLPALARLPLPPPALLPPELAS
RLPHPLAPPPGHTHLPPRVPLDDLRNFSEIEKMYRKIPSPENYTRHPLDLARSSASREET
SNNYVDNNEDVSEDENCQKPKIENECPKEETSCYNDEPEDLSVNKKKNIQTAPVVLTDSS
GRTEPATSDDKSSLVPNKRKRKSGNPTRCSQSNEYSASDEEYNSDLFRNLSTTSHVEEEP
LSLKKQKPEKWEPFQNSEEPVVNAESLKVKAESESDDESSAPSVVREGLRLRSDLYTPSD
SGSDLHTIEERLARVRSPSASSDRTDDRNDVNDLEIPIDEENPDRCTACGKIFQNHFTLR
MHYRNDHLKLLHPCDVSGCDAAFPSRRSRDRHSLNVDLHRRLLSTGSPQNPDQRPAFEVN
AELLNKLYADIKGLASTLETLRYGEESSQVPSYVSETMKFYSRNLTALQAGLFPQLDRGF
FPSPFLMNGAPQPGGLYAPGAQSTRESLSPLSASSPPVISPGRLDAAHARPRDDAKLLFR
ETSESFKKMTSLCERQEQLYQHHVPVS