DPGLEAN12646 in OGS1.0

New model in OGS2.0DPOGS203885 
Genomic Positionscaffold697:+ 27125-40848
See gene structure
CDS Length1233
Paired RNAseq reads  340
Single RNAseq reads  751
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003834 (4e-81)
Best Drosophila hit  crooked legs, isoform D (3e-26)
Best Human hitzinc finger protein 717 (2e-27)
Best NR hit (blastp)  hypothetical protein BRAFLDRAFT_208499 [Branchiostoma floridae] (4e-34)
Best NR hit (blastx)  hypothetical protein BRAFLDRAFT_57705 [Branchiostoma floridae] (1e-42)
GeneOntology terms


  
GO:0005622 intracellular
GO:0003676 nucleic acid binding
GO:0008270 zinc ion binding
GO:0008150 biological_process
InterPro families


  
IPR007087 Zinc finger, C2H2-type
IPR012934 Zinc finger, AD-type
IPR015880 Zinc finger, C2H2-like
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL39783

Nucleotide sequence:

ATGTCAGTAGATCTGTTACATTCAGATTCATATAATTTTAGTTACTTTCAACAAGACGAC
GTATGCAGACTTTGCTGGAGTAGGAACGCTTTCACACAAATTATAGAAATATCTCCGAAT
ATGTCGTGTGGAAATGAGAATTATTTTCAAAAAATAACAGAATGCTTGGATATAGATCTA
ACAAAATATGATCATCCTAACAAAGCATGCGACAGTTGCTTGGATCAAATTAACAAATTT
CATGACTTTAAAAAATTTTGCCAAGAAACAGATAGGAGGTTGAGAGAAATTTTTGAAAAT
CAACACAACATCATCAAAAAAGTCGGAAGACAAAGCACTATAGTGGAGATCTTTGATTGT
TTACAAACCGACAGCGAAAACGAAAAAAAAGAAATTAAAAAATCTTGGCGGTACAAACCG
AAGCGAACACCTACGTATTGCAATATATGTAGAATAGATTTTAAAACTTTAGAAAAATTC
AGCGAACACAGTTCTCAAGAGCACGGCATCGAAAGTGGGCTGTACAAATGTTTTGGTTGC
GAGAAGAGGTTCAAAAATCGAAAAACGAGACTTGGCCATGAGCTGAAAATTTGTAAAAAT
CTTAAAAATGGGTATAGATGTGGCATTTGTAATAGATATCTCCCGAGACGAGGCTTGTAC
GAGACACATATGAGAGACCACAGAGGGAATGTACCAATGAAGCTTCCGAATGAGCTATTC
AAGTGCAGAAAGTGTGACAAAGTGTTTGACACAAACGACAATCTCTCGAGACATGTCTCC
GAACATGACTTGAATGAGGACAATTATATATGTGAGAAATGTGGTCGCGTATTCACAAGG
AAGGACTACCTGCACAAGCACAAACTAACGCACACAGGCGAAAAACAGCACACATGTCCG
CACTGCGACTTCCGGACGATACAGAGGTCGTCGCTGATTGTTCATATAAGGAAGCACACC
GGCGAACGTCCCTACAAATGTAGCGTGTGTCCGCAACGGTGCATCTCCAGTTCAAACCTG
AGAGCACATCAGCAAAGACACTTGGGTCTCAAAGTTCATGAGTGTACAATCTGCAATAAA
AAATTCGGTTATAAAATAAGTTTAAAAGAGCACATGTCGACGCATGCTCCGTCGAGTTAC
TCTTGCGATCAGTGCAGCTCGACTTACTCGAGATTGAGAGGGTTAAGGCGACATGTGCTG
ACGAAACATGGAACCAGAAAGGAGGGACTATGA

Protein sequence:

MSVDLLHSDSYNFSYFQQDDVCRLCWSRNAFTQIIEISPNMSCGNENYFQKITECLDIDL
TKYDHPNKACDSCLDQINKFHDFKKFCQETDRRLREIFENQHNIIKKVGRQSTIVEIFDC
LQTDSENEKKEIKKSWRYKPKRTPTYCNICRIDFKTLEKFSEHSSQEHGIESGLYKCFGC
EKRFKNRKTRLGHELKICKNLKNGYRCGICNRYLPRRGLYETHMRDHRGNVPMKLPNELF
KCRKCDKVFDTNDNLSRHVSEHDLNEDNYICEKCGRVFTRKDYLHKHKLTHTGEKQHTCP
HCDFRTIQRSSLIVHIRKHTGERPYKCSVCPQRCISSSNLRAHQQRHLGLKVHECTICNK
KFGYKISLKEHMSTHAPSSYSCDQCSSTYSRLRGLRRHVLTKHGTRKEGL