DPGLEAN20216 in OGS1.0

New model in OGS2.0DPOGS213246 
Genomic Positionscaffold3323:+ 11221-15353
See gene structure
CDS Length1776
Paired RNAseq reads  31
Single RNAseq reads  85
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009525 (4e-17)
Best Drosophila hit  CG31670 (2e-34)
Best Human hitfez family zinc finger protein 1 isoform 2 (3e-33)
Best NR hit (blastp)  PREDICTED: similar to CG1402-PA [Apis mellifera] (2e-97)
Best NR hit (blastx)  PREDICTED: similar to CG1402-PA [Apis mellifera] (2e-95)
GeneOntology terms



  
GO:0005622 intracellular
GO:0003676 nucleic acid binding
GO:0008270 zinc ion binding
GO:0019827 stem cell maintenance
GO:0050767 regulation of neurogenesis
InterPro families

  
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR007087 Zinc finger, C2H2-type
IPR015880 Zinc finger, C2H2-like
Orthology groupMCL10884

Nucleotide sequence:

ATGCGGCCGGGAGATGAGATAATAAGTATACGGGAGGTATGCCTCGTCGACAAGTACGAT
AAAAAATATTTGGGCGGTGAAACATTGCTTTCTACTTCAGGTCGAAATGATTTGGAGGTT
CCTAGGTACCTCCGTGACAGTCTCTGTGAGGGTCTCGAGGCGAGATTTGCGTGGACCGAG
GAAGATGGGAATATCGCTTCCTTAGTTTCTGAACCAGGATTTGGAAGCGGAACCTTTGGC
AAGAGCCTTGATGCGACTTTGACTAGTGAAATTAAGGAAGCTGACACCAGTTTTCCATCT
ACCAGCAACACGTTTGATGGAAAGTCTTTTGTGGAACCATCTTTAGTGTCTTCTACGTCT
TCCGGATCCAGTCTGAACGCCTCGTCGGAACGTTCGAAACGCTCATACAAAGTTGACAAA
ACCGACCCACGTTCAAGATATCACTACGTAAAATACGTTAAAAGACACGGGCGCACCGTC
AAACTCTGGGAGTGCGGAATATGTTCTAGGGAGTTTCAACATCAGTATACCTTAATGCGT
CATCTCCCCACACACACGGACGAAAGGAATTTCCACTGTGACGCCTGCGCTAAGAGCTTC
CGTCAGCTGTCCACGCTCAGCCAACACAGAGCCATACACTCCGCTGAAAGACCATACGCC
TGTGAGGTATGCAACAAGACTTTCAATCGAGTATCAACCCTGATCTCGCACCGCAAAACC
CATTCCAACGAGAAACCATATAGATGCCACATTTGTCCAAAAGGCTTCCATCAAAAAGGA
AATCTCCGTAATCATTTATTCACTCATACCAATGAGCGACCCTACCGATGCAATATTTGC
ATGAAGGGCTTCAATCAGCAGTCGAATCTGGTGTGCCACAAAAATAAGGCCCATCCAGAA
GAAAATGGCAGCAGTAATGGAAGAAATGTGAATCAGCCGCGAAGAGTTACACAACCTCAA
CCAGAACCACAGACGGACAATAGATCATCACAGCAGTGTGAAGTGACGTCATCAGTAGGT
CCGTACATACCAGAACCGAAGTCGTGGTCAAATTCGAAACCAAGCTGGTTGTCGAAACCT
GATAATGATATATGGAATGAGGTTTCGTGGGGTAACAATGGGGTTATCGTGGATCCTATA
AACACTTATCACATGGGGGTTGCCATAGCAACCAGACAGACTCCTTTCGCGCTACTAAAG
TCTGATACGGGAACTCCTGTGTTGGTGAAAGTAGTCGATACAAAGCTTCCCGGCGGCAAA
CAGATGCTAGTACCGGCTACGGCAGAGGATTTGCGTGTTGGTAGTAAAATAATTTTGGAC
AATCAAGAAAGTCCTGCAGTGGATGTCCAATCTTCCGACGCTAACGCGGTTCAGATCAGG
GTGCCGGTTGTGGCGACTGTGGTCCCTAAAATGAAACCGGGTGGTAGACTCCAGTTATCA
GTAGAAGAACCCCATCATTCATACCATTCAGCTCTACCGACTGACATTGGTGAAGTGAAG
GTAGAGCCTTGTACAAGTCCGGCATCGAACCCGCTGCCTGATGCCAAACAAGTCGATGGA
ACATTCCAACCGAGCATAAAACCTGGTCGCTCATGGATCACACCGGCCTCCCCACCACTG
GACCTCATCCCCCTAGACCTGTTTGAGCCCATGGGGTGTATACCACTAGGGCCCCAAATA
ACATCAGTAGATATCGACCAGCCCCCACATTCTGATGATTCTGACATATTTATAGGAAAG
TTCGAGGAAAGTATCCCTTTAACTGATTCTGACTGA

Protein sequence:

MRPGDEIISIREVCLVDKYDKKYLGGETLLSTSGRNDLEVPRYLRDSLCEGLEARFAWTE
EDGNIASLVSEPGFGSGTFGKSLDATLTSEIKEADTSFPSTSNTFDGKSFVEPSLVSSTS
SGSSLNASSERSKRSYKVDKTDPRSRYHYVKYVKRHGRTVKLWECGICSREFQHQYTLMR
HLPTHTDERNFHCDACAKSFRQLSTLSQHRAIHSAERPYACEVCNKTFNRVSTLISHRKT
HSNEKPYRCHICPKGFHQKGNLRNHLFTHTNERPYRCNICMKGFNQQSNLVCHKNKAHPE
ENGSSNGRNVNQPRRVTQPQPEPQTDNRSSQQCEVTSSVGPYIPEPKSWSNSKPSWLSKP
DNDIWNEVSWGNNGVIVDPINTYHMGVAIATRQTPFALLKSDTGTPVLVKVVDTKLPGGK
QMLVPATAEDLRVGSKIILDNQESPAVDVQSSDANAVQIRVPVVATVVPKMKPGGRLQLS
VEEPHHSYHSALPTDIGEVKVEPCTSPASNPLPDAKQVDGTFQPSIKPGRSWITPASPPL
DLIPLDLFEPMGCIPLGPQITSVDIDQPPHSDDSDIFIGKFEESIPLTDSD