DPGLEAN01992 in OGS1.0

New model in OGS2.0DPOGS215557 
Genomic Positionscaffold860:+ 9364-15308
See gene structure
CDS Length2088
Paired RNAseq reads  483
Single RNAseq reads  1105
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010686 (3e-152)
Best Drosophila hit  CG5245 (2e-18)
Best Human hitPREDICTED: zinc finger protein 729 (3e-24)
Best NR hit (blastp)  hypothetical protein BRAFLDRAFT_280890 [Branchiostoma floridae] (4e-27)
Best NR hit (blastx)  Predicted gene, EG630579 [Mus musculus] (6e-37)
GeneOntology terms

  
GO:0005575 cellular_component
GO:0003674 molecular_function
GO:0008150 biological_process
InterPro families

  
IPR007087 Zinc finger, C2H2-type
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR015880 Zinc finger, C2H2-like
Orthology groupMCL40551

Nucleotide sequence:

ATGTTCGACCTTAAGGCATGTGTCGTGTGTTTAAAAGCCGATGTAAAGTTATTTAGCATG
AATAATGGTCAACTTAGACAACAATTCTACTTAGTAGCTGGGCTTAAGATAACACAATCT
GTATTAAATGAATTGGATTATAGTATTCTAAACATTAAGCCAGGCTTAGCTTACATGGAC
TCTATGAAAGGCGGTCATCAGTATGAGGACATAAAATTCAAGTGGTTGAAACAGAATCGC
ACATGCATAGAACCTCAAGGCACTATTCCCGTTATGTTATGTAGTACGACAGAAAATATA
TTTGAATTGCCGTTCAAAACGAAATCGAATTGTGATGATGGGAAGAATACATATGATATG
GCTGTTGGATTAAGCAATAATGTTGAGAAGAATGAAGTAGAATTGTTTGATTCCAATATA
AATATAAGTAATAATGATTTAGAAGTATTAGAATTGCAGAAAGATGATGCCGACGGTTCC
ATCCTCAATGAGGAATATGGTAATGTAATCCCAATCAGTCTGAAAGAGGCTCAGGCTGTT
GTAGATATTAATAAAAAATTTGCACTCGGAAAGTTTCGTTGTGATATCTGTGATAAGGCG
TACTGTAATGAAAAAAATTTAGAATTACATAAGAGGATGCATGTCGAGAGCGTAAGTGGT
TCACATTACTGTGTGCTGTGTAAATATTATTATAAGACAGAATTTTTACTGAAAACCCAT
TATAAAGACAAACATATGTATAAATATTTATGTAGGAATTGTCCTGAAGTTAGTTTTGAC
AGATTTTCAGCAAAAAGACATTTCATGTTGATACATGGTCCGAAAGGTACAAAGAAAGAT
GGTGTCACCGACAAACTAAACAAGAAGAACATAGACAAGAAAAAACAAGGGATCTATGTT
CATAAGAAGATTAAACCCAAAGATCCGGAAGATTTCCTCATATATACACCGATAAAACAA
GCGGAACAATACTCTATGGTGCTAGACAGGCAGAAAACAAAGAATTATATAGAATCTCCG
TACAAATGTCAGTATTGCTTCAAAGGTTTCAGGGAAGTGGTCACATATGAGAAGCACATG
CAGAAACATGACCCTGTGTATTCCGGTAAATATCAGTGCGACATGTGTAAAATACACTGT
TCGAGCACGAGGAAGATGTACAAACATATGAACACTACGCATAACTTCAAATTTTCCTGT
CAAATGTGCAGTTTCGTGTGTTACAGCAGGAAGCAGTCGACTCGTTTGACCCACATTCGT
ATAAAACATCCGTCTACTTATATCTGTAATATATGCGGACACAGTTACGTGAGCGAGGCT
GGTTTATACTGTCACAAGAAGATAGCACACAGCGCTGAGGAGATAAAAGTCCAAGAGATG
CCGACTCCGTCCCTATCCCTGTACTGTTCTGAGTGTGAAGTACAGTTTACCAATCAAAAA
GCCTACGACACACATTTCGGATCATCGAACAAACACGCAGATACTAACGTATCAACTAAA
CCGTCTCGTAGTAATAAGTGCAGTCCGTCGCGACCTCGCGGCCGGCCTCGGTCGGGGTCC
GATGTCCTCAACACCGGGGTCACGACCGCTTCGCACTGCGAGATATGCCAGCAATACTTA
CCAAACGACGTCCAAGCGAAGCGACACTACGAATCCGAACATCCGGGGGCGACTTACCTC
AAGAGATACATGTGTGATATATGCGGACATACAACTAAGCAATACGCGAACCTGTTGGTA
CACATGCGGACGCACACACAGGAAAAGCCGTATTCGTGTCCTCACTGTCAAAGGAGATTT
AGTATGGTCAGCAACAGAGACAGACATCTGGTGGTACACACAGGTGAAAAGAGATATCAA
TGCCAGCATTGTAACCGTCGCTTCACACAGAGCAGTGCCGTCAAGCTTCACATACAGACT
GTCCATCTGAAGATACCTTATGCTCCGTGGAATAAGAAGAACCGGAAACGACGTCGCGAC
GAGCCCGCTCCCCCCTCACCTACACCGCCCCAACCTCCTCAGGCCCCCCACAAGCTGGTG
TTAGACGCTGGGAATTACCTCAGCGCCTATATAACATATAATGAATAG

Protein sequence:

MFDLKACVVCLKADVKLFSMNNGQLRQQFYLVAGLKITQSVLNELDYSILNIKPGLAYMD
SMKGGHQYEDIKFKWLKQNRTCIEPQGTIPVMLCSTTENIFELPFKTKSNCDDGKNTYDM
AVGLSNNVEKNEVELFDSNINISNNDLEVLELQKDDADGSILNEEYGNVIPISLKEAQAV
VDINKKFALGKFRCDICDKAYCNEKNLELHKRMHVESVSGSHYCVLCKYYYKTEFLLKTH
YKDKHMYKYLCRNCPEVSFDRFSAKRHFMLIHGPKGTKKDGVTDKLNKKNIDKKKQGIYV
HKKIKPKDPEDFLIYTPIKQAEQYSMVLDRQKTKNYIESPYKCQYCFKGFREVVTYEKHM
QKHDPVYSGKYQCDMCKIHCSSTRKMYKHMNTTHNFKFSCQMCSFVCYSRKQSTRLTHIR
IKHPSTYICNICGHSYVSEAGLYCHKKIAHSAEEIKVQEMPTPSLSLYCSECEVQFTNQK
AYDTHFGSSNKHADTNVSTKPSRSNKCSPSRPRGRPRSGSDVLNTGVTTASHCEICQQYL
PNDVQAKRHYESEHPGATYLKRYMCDICGHTTKQYANLLVHMRTHTQEKPYSCPHCQRRF
SMVSNRDRHLVVHTGEKRYQCQHCNRRFTQSSAVKLHIQTVHLKIPYAPWNKKNRKRRRD
EPAPPSPTPPQPPQAPHKLVLDAGNYLSAYITYNE