DPGLEAN20963 in OGS1.0

Genomic Positionscaffold2326:- 14951-34387
See gene structure
CDS Length3375
Paired RNAseq reads  2183
Single RNAseq reads  5418
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008948 (3e-154)
Best Drosophila hit  CG5080, isoform A (6e-22)
Best Human hitzinc finger protein 155 (1e-21)
Best NR hit (blastp)  paramyosin, putative [Pediculus humanus corporis] (2e-96)
Best NR hit (blastx)  paramyosin, putative [Pediculus humanus corporis] (2e-93)
GeneOntology terms



  
GO:0046872 metal ion binding
GO:0008150 biological_process
GO:0005575 cellular_component
GO:0003674 molecular_function
GO:0005634 nucleus
InterPro families


  
IPR007087 Zinc finger, C2H2-type
IPR012934 Zinc finger, AD-type
IPR015880 Zinc finger, C2H2-like
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL15465

Nucleotide sequence:

ATGACATTTAATAGGAAGTACAGATTGGTAATGAGGAATAGTATAGATTCTGAGGATTGG
CCACTTTATGACCCCATGTTGGAGTATCACACTAAATATGAGCCAGAATATCTGGAACGC
CTCAGCAGTATGTCTGCTGGTGGTATGGCTGCTATAGAGTTCCGCTTAAAACATAGACCT
AGAGAGAAGAAAAAAAAAGATGAAGCAGATGATGAATTCCAATGGTCCAGAGATATCACG
GAATCATTCATTCAGATTAGAATGCAGAATGATTGGCTTTTTAGGGACAGGAAATGGGCG
TGGAGTAACCTGCGTCAGATTATGATAGAAGAGTACGGTTTCCCACATTGCCTGTCTAGC
AGAGACCTCAGCAGGAAGTGGGCTGCAATATATGCTGAGTACCAAAAAGCTAAAGCGACA
AACAATATCTCATGGATGTATTATTCTCTTTTTGAAGTTTATTTCGGAGAAAGCAGTATG
AGTCTCAACCCTTTGCTTGGCTGGCAAGAAGAGTGGGTGATTAATTTAATAAGTACCAGA
ACAGAATTAGAACAATTGTTTAAAATGTGGGAAAAGAAAAAGGAGACACCGTGGCGAGAA
GTGGAGAAAAAACTCAGGAAAATGGGAATTCCTTTGGATCATAGTCTTCTAGAAATAGAG
GAAATTTGGCGGCACTTATTGAAGACTTTTAAGTGGAAGCAGAAATTCGCTAGCAAAGGT
ATACTCAACGAGCAGTGGCCGTACTACGAACACGTGTCCAGATATGTCGACCAGCACGAA
GCAAAGGAGGCTAATGACGGAGATTTCGAAGACGACGTGAAGCTGTACGAGCTGAAGAAG
ATCGCCATGGAACCGAAGCATGAAGTGACCAATGTGTGCAGATCGTGCTCGAGCGACGAT
GGCTGTGTGAAAATATTTGAGGAAACAGACGACGAAGGTCTCGATGTGGCGTATAAGCTG
AAAGTCATCGGTGGCATAGAGATACAAAGATCAGATACCTTACCCACCCAAATATGTCTT
CAGTGTCTACAAGAGTTGGAGAACGCGTTCAAGTTCAGACGTCAGTGTCAAGAGGTGGAC
AAAAATCTCAGAAGCAGCTCCTCCTTCATCAAAGTGGAATTACAACTAGACGATAAACAT
CATACGAACGAAATCTGCGATGGAGAGAGACAGAACTATGAAATAGAGATGGATAGAGAC
GGCGTCACCATGGCAACGAAAAAAAAAACATCCCCGCAAATGAGACCCGCGAGGAAAGTT
ATAAGGAGGAAGAAGGTCCGCAAGTCCGAATACGAATATCTAAAGGTGTGCGAAGTGTGC
GGGAAACACACCAGAAACCTCAAGGCGCACATGGACGTACACTCGAAAGACAAATGTTAC
TCGTGTGAAATATGCGAGAAGAAATTTAAATTCAAAAGCGGGTTGATAGTCCACAAAGCC
ACCCACAATCCGACACCCAAAAAGACATGCGAAGTCTGCGGGAAGAGCTTCCATATATTG
TCTCAATACAGAAGACATTACGCCTACCACGCGAACGAAAGGAAATACGGTTGTGAGACA
TGCGGGAAAAGATTCAATTCTTTAGACATTTTAAAAGTCCACGCCAGAATCCACACGGAC
GAGAGACCGTTTAGCTGTTCCGAATGTGGTAAAACTTTCAGAACAGCCGGATGTGTGGGC
AGACACAAGAGGATAGTCCACAGGAATACAAAATTAGACAAACAGGACGAGCTACACTTC
AATATGAGAGGTTGGTGGATGATAGCTGTTGTTGTGCTTGTGGCATCAGAAACTCAAGGA
AGAGATGTCACACACGAGGACATCCGAGACGCCATGTTGTCTCTGGTTCATATCGTCCGC
GCCTCGGAGGACAAGTTGGAGCGACACGAACTACGAGAGAAAGCACTCGGCGATCAACTC
AAGAAGATGATGGCTGGTCTTGAGAAGAAACACAGGAACCTGGAGACATTGAAAGGCACG
ATATCGAGACTCGACGACAGATTATATAATGTAGAGAATATATTCCTGCAGAAGGAAGAG
AGAGAAAAGGAAACTCAGAAGAAAACAAATGAAGCTTTGGAAGAAATACAGAAATCACTA
AAATCACTTACGGAAATGGTATCAAGTAACTTAAAACCAATCAGCACAACTACCGAGATG
GACAATAGTTTAACTCCGAATGAGGATCCACTAACTAAGCGATTAGACGCGACCGACGCT
AAGTTGGATAATATTAAAGTCGAAATAGAAAAACTTAAAAACAGCATCAACAAGGATGCC
TTACAAGCAATGTGCGCAGAAGTGGCTATAGATTTAAATCAATTATCTGAGACGGAAAAG
CTCTTGAACAAGTATGAATTGAAGTTAAACGAGTACAATGGAACCGCTAGTAAAGTGCAG
ACGGACTTCGTGCCACTAAGTGAAGTATCGCTGGCTGATGAAGCATGGCACAGTAAAATG
ACTGAAGTAATGGAGCGTCAAGAGAAAGATATTATAAAGATACGACAGTTATTGTCTGAT
GCTGAGAGCATGTGGAAAGATTTACCGCATTTGGCTGACATCAAGCGTTCAACCAATGAC
ACACTAGAGGCCATTGCCGCCCTACAGCGAAACGTCACTGATATTATGGAAAAGGGAGTT
GCCAAAACGAACATGAAAGTGAAAGAACTAGGGGATAGGCTTGTTGCCACCAACGAGGAC
ATACAACAGAGCCTTACACAGGGCAACACCATGAGCGAACGAGCTTACACGGACTTACAG
AGGAGCTACACCAATCTTCGAGAAGAATTGCAAGGTTTCTCCAAAAATGAGCACGTGATG
CTGCAAACAGCGGACAATGTCATAGCCACAAAGAAACGCATTGAATATGGAGTACATCAG
ATATCATTAGAAGTTAGCGAGCTAATTAGAATTCAGAGCAATTTGTTGAACAAAACTATG
AATGAAAGGTTCGACAGCATAGAGTCCTCTATAGTGACAAACCAAAGCCGCGCTATGAAC
GCCCTGAGTGACAAGCTTGAGACGGACATGTCGCAGGTGTGGCGACAGATGGGTGTAGTG
TACACTCAGCTCACAGCTAGCAGACAGGCGCTCGATAAACTATCGGAACAAACCGCGCAA
TACGTTAATGGAAGCTCAAGTAAATTGGACAGCATGAAGGAGAAGGTGAGCGCGATAACA
ACACGCATGTCCGAAGTTGATGACAATTTGAACTATTTATTAGGAAGAATTTCATTAGTG
ACTCAGGAATTCAGTCTAATTAAAACTGGGCTGGGCATCGCACTGGACAAAGCGAAGAAC
GGCCTCGACGAGGTTCAAGCTAAACTGGACGATAACAGTCCAGGACCGCATCCCGTCGAG
GTTAAGGCGAATTAA

Protein sequence:

MTFNRKYRLVMRNSIDSEDWPLYDPMLEYHTKYEPEYLERLSSMSAGGMAAIEFRLKHRP
REKKKKDEADDEFQWSRDITESFIQIRMQNDWLFRDRKWAWSNLRQIMIEEYGFPHCLSS
RDLSRKWAAIYAEYQKAKATNNISWMYYSLFEVYFGESSMSLNPLLGWQEEWVINLISTR
TELEQLFKMWEKKKETPWREVEKKLRKMGIPLDHSLLEIEEIWRHLLKTFKWKQKFASKG
ILNEQWPYYEHVSRYVDQHEAKEANDGDFEDDVKLYELKKIAMEPKHEVTNVCRSCSSDD
GCVKIFEETDDEGLDVAYKLKVIGGIEIQRSDTLPTQICLQCLQELENAFKFRRQCQEVD
KNLRSSSSFIKVELQLDDKHHTNEICDGERQNYEIEMDRDGVTMATKKKTSPQMRPARKV
IRRKKVRKSEYEYLKVCEVCGKHTRNLKAHMDVHSKDKCYSCEICEKKFKFKSGLIVHKA
THNPTPKKTCEVCGKSFHILSQYRRHYAYHANERKYGCETCGKRFNSLDILKVHARIHTD
ERPFSCSECGKTFRTAGCVGRHKRIVHRNTKLDKQDELHFNMRGWWMIAVVVLVASETQG
RDVTHEDIRDAMLSLVHIVRASEDKLERHELREKALGDQLKKMMAGLEKKHRNLETLKGT
ISRLDDRLYNVENIFLQKEEREKETQKKTNEALEEIQKSLKSLTEMVSSNLKPISTTTEM
DNSLTPNEDPLTKRLDATDAKLDNIKVEIEKLKNSINKDALQAMCAEVAIDLNQLSETEK
LLNKYELKLNEYNGTASKVQTDFVPLSEVSLADEAWHSKMTEVMERQEKDIIKIRQLLSD
AESMWKDLPHLADIKRSTNDTLEAIAALQRNVTDIMEKGVAKTNMKVKELGDRLVATNED
IQQSLTQGNTMSERAYTDLQRSYTNLREELQGFSKNEHVMLQTADNVIATKKRIEYGVHQ
ISLEVSELIRIQSNLLNKTMNERFDSIESSIVTNQSRAMNALSDKLETDMSQVWRQMGVV
YTQLTASRQALDKLSEQTAQYVNGSSSKLDSMKEKVSAITTRMSEVDDNLNYLLGRISLV
TQEFSLIKTGLGIALDKAKNGLDEVQAKLDDNSPGPHPVEVKAN