DPGLEAN07644 in OGS1.0

New model in OGS2.0DPOGS210028 
Genomic Positionscaffold365:+ 46627-53545
See gene structure
CDS Length1452
Paired RNAseq reads  1687
Single RNAseq reads  4827
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010720 (5e-145)
Best Drosophila hit  CG9215 (2e-35)
Best Human hitzinc finger protein 79 (6e-26)
Best NR hit (blastp)  hypothetical protein AaeL_AAEL013321 [Aedes aegypti] (6e-58)
Best NR hit (blastx)  hypothetical protein AaeL_AAEL013321 [Aedes aegypti] (3e-57)
GeneOntology terms

  
GO:0008270 zinc ion binding
GO:0005634 nucleus
GO:0003676 nucleic acid binding
InterPro families




  
IPR007087 Zinc finger, C2H2-type
IPR012934 Zinc finger, AD-type
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR017956 AT hook, DNA-binding motif
IPR015880 Zinc finger, C2H2-like
IPR020478 AT hook-like
Orthology groupMCL21794

Nucleotide sequence:

ATGGATTGGCAATTAACGTGCCGTGTATGCTTAGAAACTGGAGATATGGTTTCCCTTTTC
GATTGGGATGAAAATAATGAACAACTCGGTGATAAATATACCTACTGTTGCGGCGTGGAG
GTAACTAAAAATGATACATTGCCGACACTAATATGCCTGAATTGCGTGGATCGTTTAACC
TACGCCTGCCAATTCAAACAACAATGTCTCTCATCGAATGAAACATTAAAAGAGTGTCTC
GAAGAATTTTTAAAAACATCGGCTGCAGTTTCAGCAAACAAACCAGACGAAGGTACTGAA
ACAGAAACTATAACAATCAAGCAAGAAAATGGTATGCTTTTACAATATGAGTTGCCAGTG
GAGCATGACCCACAGGTTCAATGGACGAGGATCATTAAGACTCCTAATAGTGCCGTTGTT
ACTAAAGCACCGGGTCGGAGGAAGAGAGGTCGGCCACGGAAATATCCTAATGAACAGGGG
CAGATTGAACTGGATGGTGGTGATGATCCGGACTTTGTGCCAGTCGGTGAGGACCCTGAT
TCTGATCATGATATCAAAGAGGAGATACCTGTACCAAAGAAACGTGGCCGTCCGCGGAAG
TCTATCCAGGAACAGCCTAAGCCGAAGGACGATGACGAGGACGTGCTCCTCAAGGAAACC
ATCATGGCCTTCTCGGAGCCCATACCTGAACATATACTGAACCCCAAGCCAAAGAAGAAA
CGACAACAGCCAAAAAGAAATATACATGTTTGTGAAACTTGCGGGGCCTCGTTCACTTCG
AATGCATCCTTGCAAGCTCATATACGTCGTCATTTGGGCATAAAACCGTTCGTGTGCAGT
GTGTGCGGCTACGCGTGTGTCCTGAACATGGAACTGCGTCGGCACATGATGCGGCACACC
GGCGTGAGGCCGTACAAGTGCAGGGTGTGCGACAGAAGATTCGGCGACTTCGGCAGCCGG
CAGAAACACGAACGATTACACATGGGTCTTCGTCCATATCAGTGTTCGTTGTGCGGCAAA
GCGTTTACATATTCATATGTGCTAGCCAACCACATGCTGACACACACGGGCGAGAAGAAA
TATTCTTGTACTCCGTGCAACAAGAAGTTCACAAAGGCGCATCACCTGAAGTACCACAAT
AAGGTTCATCACAAGGAGCTGTACATCCAACAGCAGTTGGAACAGGAGGCGAGGAAGATC
AGGCAGCAGTTGAATGTTACCAACCTGTCCGGAGTACTCACAGGACAGTTTGTGGACGGG
ACACTCCAACTCATACAGACTAACGAAGACGGCGGAGAAGAAACTCAATTACAAGTTGTC
GAGGAACACGAGGATAGCGAGGAGAGGGAGACGCACCAGGCCGTGGCCAACATGCAGGGG
GTGGTGTTAGAAACTGACTTCGCTATAGAGGAAGACGACGATGAAGACAGTAAAGATAAA
TACGACAATTGA

Protein sequence:

MDWQLTCRVCLETGDMVSLFDWDENNEQLGDKYTYCCGVEVTKNDTLPTLICLNCVDRLT
YACQFKQQCLSSNETLKECLEEFLKTSAAVSANKPDEGTETETITIKQENGMLLQYELPV
EHDPQVQWTRIIKTPNSAVVTKAPGRRKRGRPRKYPNEQGQIELDGGDDPDFVPVGEDPD
SDHDIKEEIPVPKKRGRPRKSIQEQPKPKDDDEDVLLKETIMAFSEPIPEHILNPKPKKK
RQQPKRNIHVCETCGASFTSNASLQAHIRRHLGIKPFVCSVCGYACVLNMELRRHMMRHT
GVRPYKCRVCDRRFGDFGSRQKHERLHMGLRPYQCSLCGKAFTYSYVLANHMLTHTGEKK
YSCTPCNKKFTKAHHLKYHNKVHHKELYIQQQLEQEARKIRQQLNVTNLSGVLTGQFVDG
TLQLIQTNEDGGEETQLQVVEEHEDSEERETHQAVANMQGVVLETDFAIEEDDDEDSKDK
YDN