DPGLEAN12696 in OGS1.0

New model in OGS2.0DPOGS209864 
Genomic Positionscaffold5089:- 6615-13667
See gene structure
CDS Length2553
Paired RNAseq reads  696
Single RNAseq reads  1791
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001744 (9e-73)
Best Drosophila hit  crooked legs, isoform A (6e-23)
Best Human hitzinc finger protein 658 (1e-44)
Best NR hit (blastp)  PREDICTED: zinc finger protein 228 [Pan troglodytes] (4e-52)
Best NR hit (blastx)  novel protein [Xenopus (Silurana) tropicalis] (2e-60)
GeneOntology terms



  
GO:0046872 metal ion binding
GO:0008150 biological_process
GO:0005634 nucleus
GO:0005575 cellular_component
GO:0003674 molecular_function
InterPro families


  
IPR007087 Zinc finger, C2H2-type
IPR012934 Zinc finger, AD-type
IPR015880 Zinc finger, C2H2-like
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL39581

Nucleotide sequence:

ATGAAGACGTCTTGGTTGAGCTTGTTAGGGTTACCGCAATGGGTTCCTCGTAGTTTGGTG
TGCTCTGACCACTTTAAGGACGACGATATTTGTGAAGTCGAAGGTGGCGAGAGGAAACTC
CTACCAGGAGCTGTACCTAAAGCTGTTACTACACCAGTGAACAGTACTGTTACACAGGTA
TGTCGTATATGCTTGGGGGCAGAGATGAGCGTCTACCCGATACTAAATCCCGATCTTCAG
CAGACATTTATATTACTCACAGGCATGAACATATACAAAGATGACTACCTGCCTCAGAAT
CTTTGTATCGAATGCATGCAAAGATTAAAAAATTGCTATAAATTTAGAACGCAGACGTTA
CAGGCGCAGACTATTATTGTGGATCTCATTAGAATAGGCAATTTCACAACAGAAGCTATA
AAATCATTAAATCACAATCCCAAATGGAATCTGCAAATGCAGAGGTTTGATGCGAACCAC
TGCGATGCATACATTAACAAAGAGGACGTGAACCCCATACAAGAGATCGTTATAGAGGAA
CAGGTTAAAATTGAAAAGGAATTTATTAACGAGCAAGACACAGTCGCTATGGGCTACACA
ACAGACGACAGTCTGCCACTAGAGACGCAGAGAACCAAAGCCAAGAAGACAAAGAAGAAG
AAAGAGAAGAAGATACAAGAACCGAAGGTAGATAGGAGGAGAAAGCCGTTCCTTAACGAT
GATCTGAATGAGAGTCTGTTCACTATCACCGATCTGACCTTGGAGGAACAGATAGCTGAT
ATCCAGAAGAGACAGGAGAGTTCTAACTTCAAGAATTCAGTGTACAAGTGTATGGAGTGC
TTTAAGGGTTTCCTTGATGAAGGAGCGTACAACGGACATATGACAAGGCATACTACTCAA
TGCGGTGAATATTGTTGTGAAATTTGTAAGACACATTTCAAACACTCGCACGCATTGAGG
AAACACACGACGGCGCATCACGCGCAGAGGTTCAATTGTAACCGGTGTGCTTTCGTTACT
ACACACAGACAAACAGCACGACTCCACGAGCGATGGCACAAGGGCACCAAATACGAGTGT
CCGCACTGTAATGAAGTGTTCCTTAAATTCACAACCTACATGGGACACATTCGCATCAAA
CACCCGTCGGATTTCGTGTGCGCTCTGTGCGGTTATTGCTTCGTCAGTCAGAAGGGGATA
GATTTGCACAAGAAACTGAAACACAGATTACATCTCGGACAGATCCCGGAGGACGGGCCG
CTCTGTGAGCTATGTGACGTGCGGTTCATATCACAAGAGGCTTACAAACGACACCTCAGC
GTCTCCGCGAGACACGCTGGCGACGAAATATCCAAGGATCCCAGCAAACCGAAACGCGGA
AGAAAATCAAGGGACGCCCTCGACAAAGACGACAGCGAAAAAAACGACTTAAACGACAAG
AAAGTGTATCCGAGTCAAGTCAGAAAAGCAGAAGGTCCTATACCGTGCGAGCAATGCGGT
ATGCAGTTAGAGGACTCGCGCGCCTACCACGCTCACTTCAGACGGAACCATCCCGACAAG
AACAGGACCAACTACCCCAGCATGAAGTCGCCCTGCATGTGCGAGGTGTGCGGCAGGATG
TTCCAGAGTCACGCATTGTTGAAAGACCATCGCTGGGTGCATACAAACGAGAGGCCATTC
GCCTGTGAGTGTGGCAAGAGGTTCCGTATGAAACAACGCCTGGTGGCGCACAGAAGGGTA
CACAGACAGACCAGGCACTACACATGTGCTCTGTGCGGGAAAGGATTCAGCACACACAGC
AACAGGCAGAGACATATGATTGTACATTCACACCGGGCTAAAACCATTAGCAAACCGAAA
CGCGGAAGAAAATCAAGGGACGCCCTCGACAAAGACGACAGCGAAAAAAACGACTTAAAC
GACAAGAAAGTGTATCCGAGTCAAGTCAGAAAAGCAGAAGGTCCTATACCGTGCGAGCAA
TGCGGTATGCAGTTAGAGGACTCGCGCGCCTACCACGCTCACTTCAGACGGAACCATCCC
GACAAGAACAGGACCAACTACCCCAGCATGAAGTCGCCCTGCATGTGCGAGGTGTGCGGC
AGGATGTTCCAGAGTCACGCATTGTTGAAAGACCATCGCTGGGTGCATACAAACGAGAGG
CCATTCGCCTGTGAGTGTGGCAAGAGGTTCCGTATGAAACAACGCCTGGTGGCGCACAGA
AGGGTACACAGACAGACCAGGCACTACACATGTGCTCTGTGCGGGAAAGGATTCAGCACA
CACAGCAACAGGCAGAGACATATGATTATTCACACCGGGCTAAAACCATTTAAGTGTGAG
ATGTGCGGCAAATGTTTTAAGCATGCCAGCGAGAAACGGGCTCACATAACATATGTACAT
CTCAAGAAGCCCTGGCCGAAGAGATCACGGGCGAAGAGACAAGGACAGAATATAACAGGT
ATGGCGAGCGCGAGCGAGATAGACATACAAGGATGGAACGATCCCAAGATAGATCTGATG
ATGGATAAACAGTATTTCAATGTCAAGATGTAG

Protein sequence:

MKTSWLSLLGLPQWVPRSLVCSDHFKDDDICEVEGGERKLLPGAVPKAVTTPVNSTVTQV
CRICLGAEMSVYPILNPDLQQTFILLTGMNIYKDDYLPQNLCIECMQRLKNCYKFRTQTL
QAQTIIVDLIRIGNFTTEAIKSLNHNPKWNLQMQRFDANHCDAYINKEDVNPIQEIVIEE
QVKIEKEFINEQDTVAMGYTTDDSLPLETQRTKAKKTKKKKEKKIQEPKVDRRRKPFLND
DLNESLFTITDLTLEEQIADIQKRQESSNFKNSVYKCMECFKGFLDEGAYNGHMTRHTTQ
CGEYCCEICKTHFKHSHALRKHTTAHHAQRFNCNRCAFVTTHRQTARLHERWHKGTKYEC
PHCNEVFLKFTTYMGHIRIKHPSDFVCALCGYCFVSQKGIDLHKKLKHRLHLGQIPEDGP
LCELCDVRFISQEAYKRHLSVSARHAGDEISKDPSKPKRGRKSRDALDKDDSEKNDLNDK
KVYPSQVRKAEGPIPCEQCGMQLEDSRAYHAHFRRNHPDKNRTNYPSMKSPCMCEVCGRM
FQSHALLKDHRWVHTNERPFACECGKRFRMKQRLVAHRRVHRQTRHYTCALCGKGFSTHS
NRQRHMIVHSHRAKTISKPKRGRKSRDALDKDDSEKNDLNDKKVYPSQVRKAEGPIPCEQ
CGMQLEDSRAYHAHFRRNHPDKNRTNYPSMKSPCMCEVCGRMFQSHALLKDHRWVHTNER
PFACECGKRFRMKQRLVAHRRVHRQTRHYTCALCGKGFSTHSNRQRHMIIHTGLKPFKCE
MCGKCFKHASEKRAHITYVHLKKPWPKRSRAKRQGQNITGMASASEIDIQGWNDPKIDLM
MDKQYFNVKM