DPGLEAN13819 in OGS1.0

New model in OGS2.0DPOGS202118 
Genomic Positionscaffold495:+ 63325-72850
See gene structure
CDS Length3243
Paired RNAseq reads  302
Single RNAseq reads  704
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006961 (3e-145)
Best Drosophila hit  CG32778 (2e-40)
Best Human hitmyelin transcription factor 1-like protein (6e-45)
Best NR hit (blastp)  PREDICTED: similar to CG32778-PA [Nasonia vitripennis] (3e-147)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC004325 [Tribolium castaneum] (2e-124)
GeneOntology terms








  
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0008270 zinc ion binding
GO:0030154 cell differentiation
GO:0005515 protein binding
GO:0007275 multicellular organismal development
GO:0016563 transcription activator activity
GO:0046872 metal ion binding
GO:0006355 regulation of transcription, DNA-dependent
GO:0005634 nucleus
GO:0007399 nervous system development
InterPro families  IPR002515 Zinc finger, C2HC-type
Orthology groupMCL11015

Nucleotide sequence:

AGTCACACAACGTTGTCGCAAATTAATAAAAGACGTTTAGAATTTGGAGTTGATATAGAA
ACGTTCAGAGTGAACGATCGAAGACAAGATCATAGTGACAGTGAAGCCTCAATGGACGAA
GAAGCAGCGAAACGGCGACGACGACTCCTTACAGAAGAAAGAAGAGAAGAAACAGATTCG
TACTATAGAAATTATAACACTCATCGAACTAACATCCTACGTCACTCCGACGAGGATGAC
GATTCACTTGTTCAAAAAACTCAAAACGCTCTAAAAACTTTAAGTTGCTGGGCACGAGAC
CACGCTGCTATCCCTAACGATCAAAGCGAAGACGCTAACGACTTTGATAACATTTTCAAC
GATAAACGTTCCGACAAAATGTTGCCATCATCACCATCTTTGTCTTCTGACAGTGCTGAG
AACTCACGCCGAAGCTTCTCTTTGCATTGTCAAAACGACGAACTAAATGCTGCCATGAAT
GAAAGGATGGATCCCAGGGAAAATACTGAACGGCCACCAAAAATGGAAAATGACAACAGT
ATGGACAATTACAAGATAGACAAAAGTCCTAATCCTTATGATGCTTCTAATTTCGATGAA
TTAGCGGATTCTTCATCAAACGAACTAGAAATAGACATGTCAGACAGAAACGAAGACAAA
TACGAAGAGGACAGAGAAGCGAAACGTAGGCAGGCTGAAATAATCTCAGTTAACAAGCAA
TCTCTTTACAACGCTTACAAAGCTGCCACAGCTTCATTGCCGTATTCGACCCAATCCGCC
TTCAAGCCACCAGCGGAAGTGAAACACAAGATCCACGGCTCCAGTTTTCCCAGCGAGCCA
TTCGGTGGCTATTCTAATGATCACGAGAAGAGTGCGATTCATAAGGGTTCGAAGCAGTAC
ACGGTGCTGCAGCCAGCCGCTGCTGGGTCAAGGGCTGCTACAGCGTTGCAGGAGGCTCGC
ACTGTACCATCAGCTCAACCGGCGCGAGACTTGCGACCCATCAACCCTCTCTCCCCACCA
GCCGTCCGAGAAGGCAACAAGTGTCCGACTCCTGGTTGCAATGGGCAGGGCCACGTTACA
GGCCTCTATACGCATCATCGAAGTATTTATACATTTCATCCAGTTCAATTGCCCGCACCG
GCTCCTGTGACTCAAGTGTCAGCCAGCACGTCAGATAGCCACACTTCAGACGGTTCCCGC
GATCGGGTTCCAGCGGCGTCGCCGCAAACTCCGGCCGTGAAGCGCGAAGCCCCCGAACTG
CTGGTGCCGAAGCGTGAGGCCGCCGAGCCCGAGCGCGACTCCCCCGGCATGGAGACGCGC
CACGCCGGCTACGGCGCGCCGCCCGATCAGCGCTCACCATACGAACGACCGCCTGACGAC
CACGTACGGTCATACAGTCAAATGAACGAAGCTCGGTACGGATACGAAGCTAGGTGCTAT
GAGGGCGCCCCAGCTTTTGAGAGATATGACCCAGCTCAATGCCCTCAGAGGCCTTACGGT
TGGGAAGAAGAACGATACCATGACCCTCATTTGCCAACGCCAATGAAAACGGACCAATCA
GAACAGGAAACTAATTCTGGACCTATATATCCTAGACCAATGTACCATTACGAAGCTGGC
GGTGTAGGCGCCGTGGGCGGGGTGGGCGCTATGGGCCCAGGCGTTCCCCCCGGCTTCTCC
GCTATCAATCTCTCAGTGAAGATAGCCGCAGCTCAGGCTCAACGTCCTCGAAGTCCCACA
CCCAGGGATCCTCGTGATCCGCGTCCGGCTATAGATCTATCCACATCTAGTGGCAGTCCA
CAGGGTCCATATGCGTCACCGGTATACACGAGCGCCGGTGGTGGCAGTGGGGGCGGTGCA
CGGGGAAGCCCGCAGCCGGGCGCTTCGCCCCAACTTACGGCAAGTCCCCAAGTTCCCAGT
CCACAAGGCCAGACCCTCGACCTTAGTGTGTCCCGTTTACCACATAGTAGAAGTTTTCCG
GGTGGTGTTTCATACAGTCGAGAATCAACGCCGGATAGCGGTGGAAGCCATCCATATCTT
GAAGCATACCATCGCGACACAGCCGGGTACGGTGGTGTAAGCCCTCACCCGGTAGCCGGA
TACGGTCTTGCGCAGCCGGATTACGCAGCTGCTGCTGCCGCTGCCGGATACGGTGGCTAT
CAGTACCAATGCGGGGCATACCCACCCCCGCCCGCGTACCCCCCGCACGCGCCGCCGTAT
TCACCACCGTGCTATATGCCGCCGCCGCACGCACCGCACGACAAGCCCAAGGATAGTCTG
TCCGGGTGTCCTCGTGCTGATCGCTCTCAACTGCAGCCACATTCTCAAGAACTGAAGTGT
CCCACACCAGGTTGCGACGGCTCCGGGCACGTTACCGGGAACTACTCCTCCCATCGATCA
CTATCAGGTTGCCCCAGGGCTAATAAACCGAAAAGCAAGCCCAGGGATGGCCAAGATTCT
GAACCGCTCAGCGCGTCGGGCTGCCCTATTGCAAATCGGAACAAAATGCGGGTTCTAGAA
AGCGGCGGCACAGTTGAGCAGCACAAAGCGGCAGTGGCGGCAGCGGCATCCGCTATTAAA
TTCGATGGCGTGAACTGTCCTACCCCGGGATGTGATGGATCGGGACATATAAACGGTTCG
TTTCTAACCCATCGTTCGCTATCCGGCTGTCCCGTAGCCGGTGCAACCACACCGACGCCT
CAACCAAAGAAACCGAAATATCCTGATGATATCACTCCGCTATACCCCAAGCCCTATTCA
GGTATGGATATTAACATGCAGACAGGAAACGGCGAAGATTTAATGACACTGGAGCAAGAA
ATTACTGAACTCCAGCGTGAAAATGCAAGAGTGGAATCACAGATGATGCGTCTGAAATCG
GACATAAACGCGATGGAGTCACACTTGAGCCATGGAGAAAGGGAGAATCAGCTCATCATT
CATCGCAACAGCAATCTGAATGAATATTACGAAAGCCTTCGGAACAATGTGATCACGTTG
CTGGAGCACGTTAAGATACCAGGAGGAGGTACGGTGCCCGTATCAACTGCCCCCGGAACT
CCCGGAGCTGCACCCCCGACTGGCCCTGGTGATAAACCCGCCCACGATAACTTCGACTCT
TATCTCACCAAGCTGCAGACCCTATGCTCCCCGGAAGGATACTGCCCCGATGAGAATCGA
CCGATCTATGAGACCGTTAAAAACGCGCTCCAAGACTTCACAGTGCTACCAACGCCGATA
TAA

Protein sequence:

SHTTLSQINKRRLEFGVDIETFRVNDRRQDHSDSEASMDEEAAKRRRRLLTEERREETDS
YYRNYNTHRTNILRHSDEDDDSLVQKTQNALKTLSCWARDHAAIPNDQSEDANDFDNIFN
DKRSDKMLPSSPSLSSDSAENSRRSFSLHCQNDELNAAMNERMDPRENTERPPKMENDNS
MDNYKIDKSPNPYDASNFDELADSSSNELEIDMSDRNEDKYEEDREAKRRQAEIISVNKQ
SLYNAYKAATASLPYSTQSAFKPPAEVKHKIHGSSFPSEPFGGYSNDHEKSAIHKGSKQY
TVLQPAAAGSRAATALQEARTVPSAQPARDLRPINPLSPPAVREGNKCPTPGCNGQGHVT
GLYTHHRSIYTFHPVQLPAPAPVTQVSASTSDSHTSDGSRDRVPAASPQTPAVKREAPEL
LVPKREAAEPERDSPGMETRHAGYGAPPDQRSPYERPPDDHVRSYSQMNEARYGYEARCY
EGAPAFERYDPAQCPQRPYGWEEERYHDPHLPTPMKTDQSEQETNSGPIYPRPMYHYEAG
GVGAVGGVGAMGPGVPPGFSAINLSVKIAAAQAQRPRSPTPRDPRDPRPAIDLSTSSGSP
QGPYASPVYTSAGGGSGGGARGSPQPGASPQLTASPQVPSPQGQTLDLSVSRLPHSRSFP
GGVSYSRESTPDSGGSHPYLEAYHRDTAGYGGVSPHPVAGYGLAQPDYAAAAAAAGYGGY
QYQCGAYPPPPAYPPHAPPYSPPCYMPPPHAPHDKPKDSLSGCPRADRSQLQPHSQELKC
PTPGCDGSGHVTGNYSSHRSLSGCPRANKPKSKPRDGQDSEPLSASGCPIANRNKMRVLE
SGGTVEQHKAAVAAAASAIKFDGVNCPTPGCDGSGHINGSFLTHRSLSGCPVAGATTPTP
QPKKPKYPDDITPLYPKPYSGMDINMQTGNGEDLMTLEQEITELQRENARVESQMMRLKS
DINAMESHLSHGERENQLIIHRNSNLNEYYESLRNNVITLLEHVKIPGGGTVPVSTAPGT
PGAAPPTGPGDKPAHDNFDSYLTKLQTLCSPEGYCPDENRPIYETVKNALQDFTVLPTPI