DPGLEAN18181 in OGS1.0

New model in OGS2.0DPOGS205470 
Genomic Positionscaffold3919:+ 8637-10809
See gene structure
CDS Length1401
Paired RNAseq reads  148
Single RNAseq reads  389
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008429 (1e-20)
Best Drosophila hit  CG6654 (2e-48)
Best Human hitzinc finger protein 180 (4e-54)
Best NR hit (blastp)  PREDICTED: similar to novel KRAB box and zinc finger, C2H2 type domain containing protein [Tribolium castaneum] (1e-65)
Best NR hit (blastx)  PREDICTED: similar to novel KRAB box and zinc finger, C2H2 type domain containing protein [Tribolium castaneum] (2e-72)
GeneOntology terms

  
GO:0005575 cellular_component
GO:0003674 molecular_function
GO:0008150 biological_process
InterPro families


  
IPR007087 Zinc finger, C2H2-type
IPR012934 Zinc finger, AD-type
IPR015880 Zinc finger, C2H2-like
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupND

Nucleotide sequence:

ATGTTCGGTATATTTATGGAAATAAATTCTACCAGTGCTGCACATATTTTGGCTACTTGC
ACTAATATAAATATCACTAAAGACGATTTGTTACCGAAGCAGATATGTTTCCATTGCTAC
AATTGTCTTGTAAGTTTTTATAAGTTTAGAAAACTAGCAGAAAGTATTGATGAGAAACTT
CAGAGTGCTTTGTATAACAGATTATACAGCAGTTCAAGTGAGGATAATAAATTAGGTGAA
ATAAAAATAGAGCATGTTAATTACATTGATGGCGATATTCAACCAAAGGTAGAGGATAGA
AGTGAGACTGAATCAAATTATAATAATAATACTAATAAAGACATTAAATGTAATATAAAG
GCAAATGAAACAACAACACAGTCAAGTGAAGTCAATAAAAGATTATTACCGACGGTATCA
AAGACTGAAAGCAAGGATGCAGCACATGATACAGATAATGTTAAATTCAAGTGTGAGATC
TGTAGTAGAACATTCAAATCAATTAAATCCCTCTCCGCACATATGATCAAGCATACTAAG
AAAGGCAGAATATTATCATGTAGTATATGCGGCAAGGAATTCAAAAAAGTCAGTCATGTT
AAAAGACATGAAAAAATACATGAAATCAATCGGCCACACAAATGTGCTGTCTGTTCTAAA
TCATTTCCTAGCGAGGACATATTGAAAGAGCATTTAAACAAACACAATGGTGTAAAACCA
CATACATGCACATATTGTTCAAAGTCGTTTGCACATTTATTTACTCTGAAAGCACATATA
AGGGTGCACACAATCGACAAGGCCTTCTTGTGTCCGACATGCGGAAAAAGCTTCTATTCG
AGCACAAATTTTAAACAGCATATGAAAAGGCATGCTGGCTTGAAGACGTTTGCATGTGCA
ATGTGTCCAAAGATATTTATAAGTAAAGGTGAATTAAAATCCCATACCATAACACATACA
GGTGAGAGGAATTGTACGTGTGATCAGTGTGGGTCGTCCTTCACTAAGAACAGTTCTTTG
ACGAAACATATCAAGTTGAAGCATTTGGGATTGAAGCCTCATCAATGTGATAAGTGTTCT
ATGAAATTCACAACCAAGGATCATTTGAAACGGCACTATAGAAGTCACACGGGTGAAAAG
CCTTATAAATGTGATCTGTGCGAAAGAGCTTTCTCACAAAGCAACGACCTCGTCAAACAT
CGTCGTGTGCATTTGGGAGATAAAACTTATAAATGCATGGAGTGTACTCAAAGTTTTCGC
CTCAAATATGAACTACAGCAACATATTTCGGAGCATTTTATTAATTTAAAATTGTTCAAC
AATCCACCAATAGATGGCGCTAGTTCCGTGATGGTTCCAGCTAATATTACAGATGGCGCT
GATGTTAAAATTAATAAATGA

Protein sequence:

MFGIFMEINSTSAAHILATCTNINITKDDLLPKQICFHCYNCLVSFYKFRKLAESIDEKL
QSALYNRLYSSSSEDNKLGEIKIEHVNYIDGDIQPKVEDRSETESNYNNNTNKDIKCNIK
ANETTTQSSEVNKRLLPTVSKTESKDAAHDTDNVKFKCEICSRTFKSIKSLSAHMIKHTK
KGRILSCSICGKEFKKVSHVKRHEKIHEINRPHKCAVCSKSFPSEDILKEHLNKHNGVKP
HTCTYCSKSFAHLFTLKAHIRVHTIDKAFLCPTCGKSFYSSTNFKQHMKRHAGLKTFACA
MCPKIFISKGELKSHTITHTGERNCTCDQCGSSFTKNSSLTKHIKLKHLGLKPHQCDKCS
MKFTTKDHLKRHYRSHTGEKPYKCDLCERAFSQSNDLVKHRRVHLGDKTYKCMECTQSFR
LKYELQQHISEHFINLKLFNNPPIDGASSVMVPANITDGADVKINK