DPGLEAN18480 in OGS1.0

New model in OGS2.0DPOGS207385 
Genomic Positionscaffold604:+ 33573-48276
See gene structure
CDS Length3843
Paired RNAseq reads  2425
Single RNAseq reads  7044
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009009 (2e-29)
Best Drosophila hit  crooked legs, isoform A (5e-70)
Best Human hitzinc finger protein 182 isoform 1 (1e-91)
Best NR hit (blastp)  zinc finger protein [Aedes aegypti] (9e-116)
Best NR hit (blastx)  zinc finger protein [Aedes aegypti] (2e-116)
GeneOntology terms



  
GO:0008150 biological_process
GO:0003674 molecular_function
GO:0005634 nucleus
GO:0046872 metal ion binding
GO:0005575 cellular_component
InterPro families

  
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR015880 Zinc finger, C2H2-like
IPR007087 Zinc finger, C2H2-type
Orthology groupMCL10285

Nucleotide sequence:

ATGGGTGATAGAGAGTATGATGGTGGGGGGGGTGCCCGGGGGGGAATACCGGGTATTCCG
AGTGTAGGGAGTGTGGCGGCGGATGATTCGCTGTCTTCGCGTCCGGGATCGGAGTTAGGA
GCCAAGCGACTCTTTTCGGAGATGTCTGACTCCGATACCGAGACCGGGGGCGGGATAGTT
AGAGGGCTGGATCTGGCGGGAAGGCGAGGGCGGGGTAAAAATCTGGCTAGAGCCAGGAAA
TTCTTGCGCCAGCGGGAAGAGGAGGAGTCGGAGGCTGCCTTCGACTCCTGTATGGAAAGG
AGTCTTGCGAGAGAGAGGGACGGTGCGAAGATGAGGGCGAAGGGGAAGGAGGTCCCGCTT
GAGCTGGAGGCCATGGGGGCGGAAATGATAATGGTCGAGGCTGAGAAAAGCCTCGACCTT
ATCAAAAGTTTGGTAGGAAAAAGTACCAATTTGAAGGGAGGGTACGCCTCAAAAATAACG
AAGGCTTCTGTCTTCCTGAAGGAGGTCCTCGACGTGCTGGTGACGCGCACGGAAGCGGAA
GAGACCCGCCGTCTTAGAGCCGACATCGGTAGGCTCCAAAGAGAAAACGCGGGTCTGAAG
GAGGAAGTGCGCGCTCACCGGCAGCAGTTTGAGGAGATGCGACGGGAGAGGGTGACGGCG
GCTCAGGCGGCTTCAGACGGCCAAACCAGCCGGGACCAGTTCCAGGTGCTGGAGGCCAAC
ATCGCCAGGATGGTGGGTAACCTGGTGGATGGAAGGCTTGCAGCACTGGAGTCCCGGCTG
AAACGGGAAGAGGTCTCACGCCCCCCCCTGGCGGGCGACAACTCGGCCGTCGCCGTAGCT
GCCAGAGCGGCGATCCGCCGTGCGGGGATGCAAAGGGGGGCCGGTAAGGCGGCTCCTTCA
CCGGCGTCTTCGGCGCCGGTGCTGGTGGTGTCGGAGGACGAATTTCCCTCCCTCCCTCCA
CCACCCTCCAAGGGAAAGGGTAGGGGCAAGGACCCCAAGAGAGGAAAAGAGGTGGCCTGG
ACCGCGTTTGGCCCAGCCATCTCTAGTGGAAGCGTCGAACATGCTGCAGCTGGGGCCGGG
GCTGTTGCGGCGGGGTGGACGGAGGTGGTCCGCCGCAAGGCCCCCAAAAAGAAGGAAGTT
GTGCCGGTAACCAAAACACCGGCGCCACAGCCAAAAAAGAAGGCGGGCCCTAAGGAGGGC
CCCAAGAAAGTGGCGCTCCCACGCTCACAGGCCGTAATGTTGAAGTTACGGCCTGAAGCA
GCGGCTAAGGGGGCGACCTACTTGTCGGTCCTCTTAAGGGCCGAGAGGGAGGTAAATACG
AAGGAGCTGGGTATCGGGCCCCTGAAAATCCGTTCATCGGCAACAGGGGCCCGCATCATC
GAGGTGCCCGGCTCAGCCAGCGCGGACAAAGCCGACGCTTTGGCCGCTAAGCTGAAGTCT
GTGCTGGCGGAGGAGGCGGAGGTGTCGAGGCCCGTGAAATTCACGGACGTAAGAGTAACG
GGCCTCAACGACGCGACGACCGCGGACCGGCTGATAGCCGCGGTCGCGCAGGAGGGGGGC
TGCACCGAGGCCCAGGTCAGGGTTCGTAGCGTGCGACCTGGGCCTCGCGGCACAGGCTCC
GCCCTGGTGGAGGTGCCGGCTGCGGCGGCTAAAAAGCTGCTGCAGCTGGGCAGCCTGTCA
GTCGGGTGGAGCCAGGTGCGGCTCTCGCACATGGAGGCACGACCCAAGCACTGCTTCAAG
CAGTTCGCCATTAGTTTCCCAGCACACTGGGGCGTGGGCGTCGGGGGTGTGGCGCGGGGC
GTTGGTGTCCCCGTGCCACCCCCCGGCGACCTACTCGCCACGATATCCATGTTCGAACAA
CAAATCAAGGCAGAGCCCATGAGCTTTTACACACATTCACATATAAATACTGGACCCCCA
ACGATAATGCGCTCGGATTCCGGCCATGGCATAATCAGTATGAATCAGCACCACCCCCAG
GAGGACTCCAAGGACAGTCTTATACAACAGCAAGTACAACACCAACAAGAGCTGATGGAA
CAGCACCAACAGGACTTGCAGCACGACGATGATGTTGATAATTTAAGCTTCAAAGGCATG
GATGATGAAGGTGTTGAATTGGACATGGACGGCAGACAATGTTCTCAGGGCATGGTCGAC
ATGGGATCAGTTCAAACCAAAATGGAAGTCACAAACGGGGGTGGGATGCCAAGATCGAAA
CCACAAGCTTGTAAAGTGTGTGGTAAGGTGTTATCATCTGCCTCGTCATATTATGTTCAC
ATGAAGCTACATTCTGGCAACAAACCTTTTCAATGCACGGTATGCGACGCAGCGTTTTGT
CGCAAGCCTTATCTTGAGGTGCACATGCGCACGCACACTGGCGAGCGTCCCTTCCAGTGC
GATCTGTGCCTAAAACGCTTCACACAGAAGTCCAGCCTCAACACGCACAAGCGCGTACAT
ACCGATGAGCACATGCGAGCCTTGATGGTGAAGGATCGACCCTACCAGTGTGAGGTCTGT
CTGATGCGCTTCACTCAGAGCTCCAGCCTCAACAGACACAAGAAAATACACACGGAGGAG
CACAGACGAGCCCTGTTAGAAAAAGTGCGGCCGTACCAGTGCCACATCTGTTTTATGCGC
TTCACTCAGAAGTCCAGCCTGGGCCGACACGGAAAGATACACACCGAGGAGCACATCCAA
TCGCTGATCAACAAAGTGCGCCCCTATCAATGCGACATCTGTGACAAGCGGTTCACTCAG
AAGTCCAGCCTTGGCACTCATAAACGTATACACACCGTCCAGGGGAGACCGTTCCAGTGC
CTGTCGTGCCCGGCCGCCTTCACCTGCAAGCAATATCTGGAGATACACACGCGCACACAC
ACAGGCGAGCGGCCCTATCAGTGCGACATCTGCCTCAAGCGGTTCACACAGAAATCCAGT
TTGAACATCCACAAGCGGACGCACTCAGTTCAGGGCCGGCCGTTCCAGTGTCTCCAGTGC
CCGGCCGCCTTCACCTGCAAGCAGTACCTCGAGATACACAACCGCACGCACACCGGCGAG
CGCCCCTACCAGTGTGACGTCTGCCTCAAGAGATTCGCGCAAAAGTCTACACTCAACATA
CACAAAAGAACGCACACAGTGCAAGGGAGGCCGTATCAGTGCATGGAGTGCCCGGCGGCG
TTCACATGCAAGCCGTACTTGGAAATACACATGCGCACTCACACTGGCGAGCGTCCCTTC
GAGTGCGATGTCTGTTACAAACGCTTCACCCAGAAATCAACACTCAACATTCACAAGCGA
ATTCATACCGGCGAACGTCCATACGCATGTGATATTTGCCAAAAACGATTCGCAGTTAAA
AGCTACGTAACAGCGCACCGATGGTCCCACGTGGCGGACAAACCCCTGAACTGCGAGCGA
TGTTCTATGACGTTCACTTCCAAGTCCCAGTTCGCCCTCCACATCCGGACCCACGCCAGT
GGACCCTGCTACGAGTGTAGCGTCTGCGGCCGGACCTTCGTCAGGGACAGTTATCTCATA
CGTCACCACAACCGCGTACACCGTGAGAACCACAGCAACATATCAGCGAACAGCATCGGC
ACCATCAACAGCGTGGCCACCAACACCAACTCCAACAACGGCAACTACGACTCGCCAGGC
GTCTGTGACCTCAGCTTCGTGCCGATGGTGAATCGCTACATGACGTCTCAGGGCACGCAG
GTTTCCATGCAGGACACGAAAATGTCCGCCATGTCGCCGCAGTCGATCGCGTCAATATCG
TCGCCGCCGCCCCCGCACACCCCCACGCCCCAGCCGCAGATGTCCATGCGACTGTCGGAT
TGA

Protein sequence:

MGDREYDGGGGARGGIPGIPSVGSVAADDSLSSRPGSELGAKRLFSEMSDSDTETGGGIV
RGLDLAGRRGRGKNLARARKFLRQREEEESEAAFDSCMERSLARERDGAKMRAKGKEVPL
ELEAMGAEMIMVEAEKSLDLIKSLVGKSTNLKGGYASKITKASVFLKEVLDVLVTRTEAE
ETRRLRADIGRLQRENAGLKEEVRAHRQQFEEMRRERVTAAQAASDGQTSRDQFQVLEAN
IARMVGNLVDGRLAALESRLKREEVSRPPLAGDNSAVAVAARAAIRRAGMQRGAGKAAPS
PASSAPVLVVSEDEFPSLPPPPSKGKGRGKDPKRGKEVAWTAFGPAISSGSVEHAAAGAG
AVAAGWTEVVRRKAPKKKEVVPVTKTPAPQPKKKAGPKEGPKKVALPRSQAVMLKLRPEA
AAKGATYLSVLLRAEREVNTKELGIGPLKIRSSATGARIIEVPGSASADKADALAAKLKS
VLAEEAEVSRPVKFTDVRVTGLNDATTADRLIAAVAQEGGCTEAQVRVRSVRPGPRGTGS
ALVEVPAAAAKKLLQLGSLSVGWSQVRLSHMEARPKHCFKQFAISFPAHWGVGVGGVARG
VGVPVPPPGDLLATISMFEQQIKAEPMSFYTHSHINTGPPTIMRSDSGHGIISMNQHHPQ
EDSKDSLIQQQVQHQQELMEQHQQDLQHDDDVDNLSFKGMDDEGVELDMDGRQCSQGMVD
MGSVQTKMEVTNGGGMPRSKPQACKVCGKVLSSASSYYVHMKLHSGNKPFQCTVCDAAFC
RKPYLEVHMRTHTGERPFQCDLCLKRFTQKSSLNTHKRVHTDEHMRALMVKDRPYQCEVC
LMRFTQSSSLNRHKKIHTEEHRRALLEKVRPYQCHICFMRFTQKSSLGRHGKIHTEEHIQ
SLINKVRPYQCDICDKRFTQKSSLGTHKRIHTVQGRPFQCLSCPAAFTCKQYLEIHTRTH
TGERPYQCDICLKRFTQKSSLNIHKRTHSVQGRPFQCLQCPAAFTCKQYLEIHNRTHTGE
RPYQCDVCLKRFAQKSTLNIHKRTHTVQGRPYQCMECPAAFTCKPYLEIHMRTHTGERPF
ECDVCYKRFTQKSTLNIHKRIHTGERPYACDICQKRFAVKSYVTAHRWSHVADKPLNCER
CSMTFTSKSQFALHIRTHASGPCYECSVCGRTFVRDSYLIRHHNRVHRENHSNISANSIG
TINSVATNTNSNNGNYDSPGVCDLSFVPMVNRYMTSQGTQVSMQDTKMSAMSPQSIASIS
SPPPPHTPTPQPQMSMRLSD