DPGLEAN16212 in OGS1.0

New model in OGS2.0DPOGS202570 
Genomic Positionscaffold1759:+ 33629-36306
See gene structure
CDS Length1215
Paired RNAseq reads  118
Single RNAseq reads  356
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008429 (1e-09)
Best Drosophila hit  CG6654 (1e-51)
Best Human hitzinc finger protein 135 isoform 2 (2e-57)
Best NR hit (blastp)  PREDICTED: similar to novel KRAB box and zinc finger, C2H2 type domain containing protein [Tribolium castaneum] (1e-69)
Best NR hit (blastx)  PREDICTED: similar to novel KRAB box and zinc finger, C2H2 type domain containing protein [Tribolium castaneum] (3e-76)
GeneOntology terms


  
GO:0005622 intracellular
GO:0003676 nucleic acid binding
GO:0008270 zinc ion binding
GO:0008150 biological_process
InterPro families

  
IPR007087 Zinc finger, C2H2-type
IPR015880 Zinc finger, C2H2-like
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL40274

Nucleotide sequence:

ATGAAAGAACATGTTGCATCTAGCGCAATATATGTGTGTGATATTTGCAAGAAAAATTTC
TATTTAGAGACAGTTTTCAAACATCATATGGAAGAACACAGTAAACCGCAGACAAATCCA
AATCAATGCATGAAATGTCTCGTTAATTTTGTTAGCACCGAAAGCCTTTTACAGCATATC
GACAGCTGCAATACTAACAGGGAAGTGAAGTTGGAGAATTACAATGAATACGAATATTTG
GACTCCGATGTGATATTCAAAGATAAATCATATTTAGATAATTTAGAAAATCAAGAGGAG
GATAAACGTAAGAAAATATTCAAATGCGACAATTGTAGTAAGTCATTCTCGTTGAAGACG
TTGCTGAGACGTCACATGAGGCTACATTCCACTAGCAAACCGTTCCAATGTACGAAGTGC
TCCAAGTGTTACACACGCCAAGACCAGCTGGCGGCACACATGAGAATTCATGACGGATAT
AAACCGTATGCCTGTCCACATTGTAGCAAAGCATTTTCCCAGCTGTGCAGTCTTAAAGAC
CATGTCCGTACTCACACAGGAGAGACGCCGTTTCTGTGTTCCCAATGCGGCAAGGGCTTC
GCTAACAGTTCCAATTTAAGACAGCATTTAAGAAGGCACACTGGTGTGAAACCGTTTGCT
TGTAGTCTATGCCCTAAGACATTCTCAACCAAAGGTCAAATGAAACAGCACATAGACACA
CACACAGGCGTACACCCGTACAAGTGTAGTGTTTGCGGCGCCTCCTTCACTAAACCTAAC
TCGTTAAAGAAACACAAATTAATACATCTCGGCGTGAGACCGTTTGCTTGCGACACTTGT
AATATGAGGTTTACATGCAAGGACCACCTGACTCGCCACAAAAGGATTCATACCGGAGAA
CGGCCGTACCGCTGTACACACTGCACTCGGACCTTCACACAGAGCAATGACCTCAATAAG
CATGTGCGGGCCCACCTCGGACAGAATATCTATCAATGCACCGTATGTCAAGCTAAATTC
AGATTAATGAGAGAATTAAAAAGCCACTACCCGGTGCATTACATCAACGACCAAGGGGAG
TCACAGAGCGAGCCGGTGAAGAAAGACAAACAGACAGACGGACAGATCACTATAACATTC
AATAGAAATGTATTAGATAAAGACAGTTTAGGAGATATCACCATAAACATAACACCAGAC
AAAATAACCAACTGA

Protein sequence:

MKEHVASSAIYVCDICKKNFYLETVFKHHMEEHSKPQTNPNQCMKCLVNFVSTESLLQHI
DSCNTNREVKLENYNEYEYLDSDVIFKDKSYLDNLENQEEDKRKKIFKCDNCSKSFSLKT
LLRRHMRLHSTSKPFQCTKCSKCYTRQDQLAAHMRIHDGYKPYACPHCSKAFSQLCSLKD
HVRTHTGETPFLCSQCGKGFANSSNLRQHLRRHTGVKPFACSLCPKTFSTKGQMKQHIDT
HTGVHPYKCSVCGASFTKPNSLKKHKLIHLGVRPFACDTCNMRFTCKDHLTRHKRIHTGE
RPYRCTHCTRTFTQSNDLNKHVRAHLGQNIYQCTVCQAKFRLMRELKSHYPVHYINDQGE
SQSEPVKKDKQTDGQITITFNRNVLDKDSLGDITINITPDKITN