Genomic Position | scaffold2326:- 14951-34387 |
---|---|
See gene structure | |
CDS Length | 3375 |
Paired RNAseq reads   | 2183 |
Single RNAseq reads   | 5418 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008948 (3e-154) |
Best Drosophila hit   | CG5080, isoform A (6e-22) |
Best Human hit | zinc finger protein 155 (1e-21) |
Best NR hit (blastp)   | paramyosin, putative [Pediculus humanus corporis] (2e-96) |
Best NR hit (blastx)   | paramyosin, putative [Pediculus humanus corporis] (2e-93) |
GeneOntology terms    | GO:0046872 metal ion binding GO:0008150 biological_process GO:0005575 cellular_component GO:0003674 molecular_function GO:0005634 nucleus |
InterPro families    | IPR007087 Zinc finger, C2H2-type IPR012934 Zinc finger, AD-type IPR015880 Zinc finger, C2H2-like IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding |
Orthology group | MCL15465 |
Nucleotide sequence:
ATGACATTTAATAGGAAGTACAGATTGGTAATGAGGAATAGTATAGATTCTGAGGATTGG
CCACTTTATGACCCCATGTTGGAGTATCACACTAAATATGAGCCAGAATATCTGGAACGC
CTCAGCAGTATGTCTGCTGGTGGTATGGCTGCTATAGAGTTCCGCTTAAAACATAGACCT
AGAGAGAAGAAAAAAAAAGATGAAGCAGATGATGAATTCCAATGGTCCAGAGATATCACG
GAATCATTCATTCAGATTAGAATGCAGAATGATTGGCTTTTTAGGGACAGGAAATGGGCG
TGGAGTAACCTGCGTCAGATTATGATAGAAGAGTACGGTTTCCCACATTGCCTGTCTAGC
AGAGACCTCAGCAGGAAGTGGGCTGCAATATATGCTGAGTACCAAAAAGCTAAAGCGACA
AACAATATCTCATGGATGTATTATTCTCTTTTTGAAGTTTATTTCGGAGAAAGCAGTATG
AGTCTCAACCCTTTGCTTGGCTGGCAAGAAGAGTGGGTGATTAATTTAATAAGTACCAGA
ACAGAATTAGAACAATTGTTTAAAATGTGGGAAAAGAAAAAGGAGACACCGTGGCGAGAA
GTGGAGAAAAAACTCAGGAAAATGGGAATTCCTTTGGATCATAGTCTTCTAGAAATAGAG
GAAATTTGGCGGCACTTATTGAAGACTTTTAAGTGGAAGCAGAAATTCGCTAGCAAAGGT
ATACTCAACGAGCAGTGGCCGTACTACGAACACGTGTCCAGATATGTCGACCAGCACGAA
GCAAAGGAGGCTAATGACGGAGATTTCGAAGACGACGTGAAGCTGTACGAGCTGAAGAAG
ATCGCCATGGAACCGAAGCATGAAGTGACCAATGTGTGCAGATCGTGCTCGAGCGACGAT
GGCTGTGTGAAAATATTTGAGGAAACAGACGACGAAGGTCTCGATGTGGCGTATAAGCTG
AAAGTCATCGGTGGCATAGAGATACAAAGATCAGATACCTTACCCACCCAAATATGTCTT
CAGTGTCTACAAGAGTTGGAGAACGCGTTCAAGTTCAGACGTCAGTGTCAAGAGGTGGAC
AAAAATCTCAGAAGCAGCTCCTCCTTCATCAAAGTGGAATTACAACTAGACGATAAACAT
CATACGAACGAAATCTGCGATGGAGAGAGACAGAACTATGAAATAGAGATGGATAGAGAC
GGCGTCACCATGGCAACGAAAAAAAAAACATCCCCGCAAATGAGACCCGCGAGGAAAGTT
ATAAGGAGGAAGAAGGTCCGCAAGTCCGAATACGAATATCTAAAGGTGTGCGAAGTGTGC
GGGAAACACACCAGAAACCTCAAGGCGCACATGGACGTACACTCGAAAGACAAATGTTAC
TCGTGTGAAATATGCGAGAAGAAATTTAAATTCAAAAGCGGGTTGATAGTCCACAAAGCC
ACCCACAATCCGACACCCAAAAAGACATGCGAAGTCTGCGGGAAGAGCTTCCATATATTG
TCTCAATACAGAAGACATTACGCCTACCACGCGAACGAAAGGAAATACGGTTGTGAGACA
TGCGGGAAAAGATTCAATTCTTTAGACATTTTAAAAGTCCACGCCAGAATCCACACGGAC
GAGAGACCGTTTAGCTGTTCCGAATGTGGTAAAACTTTCAGAACAGCCGGATGTGTGGGC
AGACACAAGAGGATAGTCCACAGGAATACAAAATTAGACAAACAGGACGAGCTACACTTC
AATATGAGAGGTTGGTGGATGATAGCTGTTGTTGTGCTTGTGGCATCAGAAACTCAAGGA
AGAGATGTCACACACGAGGACATCCGAGACGCCATGTTGTCTCTGGTTCATATCGTCCGC
GCCTCGGAGGACAAGTTGGAGCGACACGAACTACGAGAGAAAGCACTCGGCGATCAACTC
AAGAAGATGATGGCTGGTCTTGAGAAGAAACACAGGAACCTGGAGACATTGAAAGGCACG
ATATCGAGACTCGACGACAGATTATATAATGTAGAGAATATATTCCTGCAGAAGGAAGAG
AGAGAAAAGGAAACTCAGAAGAAAACAAATGAAGCTTTGGAAGAAATACAGAAATCACTA
AAATCACTTACGGAAATGGTATCAAGTAACTTAAAACCAATCAGCACAACTACCGAGATG
GACAATAGTTTAACTCCGAATGAGGATCCACTAACTAAGCGATTAGACGCGACCGACGCT
AAGTTGGATAATATTAAAGTCGAAATAGAAAAACTTAAAAACAGCATCAACAAGGATGCC
TTACAAGCAATGTGCGCAGAAGTGGCTATAGATTTAAATCAATTATCTGAGACGGAAAAG
CTCTTGAACAAGTATGAATTGAAGTTAAACGAGTACAATGGAACCGCTAGTAAAGTGCAG
ACGGACTTCGTGCCACTAAGTGAAGTATCGCTGGCTGATGAAGCATGGCACAGTAAAATG
ACTGAAGTAATGGAGCGTCAAGAGAAAGATATTATAAAGATACGACAGTTATTGTCTGAT
GCTGAGAGCATGTGGAAAGATTTACCGCATTTGGCTGACATCAAGCGTTCAACCAATGAC
ACACTAGAGGCCATTGCCGCCCTACAGCGAAACGTCACTGATATTATGGAAAAGGGAGTT
GCCAAAACGAACATGAAAGTGAAAGAACTAGGGGATAGGCTTGTTGCCACCAACGAGGAC
ATACAACAGAGCCTTACACAGGGCAACACCATGAGCGAACGAGCTTACACGGACTTACAG
AGGAGCTACACCAATCTTCGAGAAGAATTGCAAGGTTTCTCCAAAAATGAGCACGTGATG
CTGCAAACAGCGGACAATGTCATAGCCACAAAGAAACGCATTGAATATGGAGTACATCAG
ATATCATTAGAAGTTAGCGAGCTAATTAGAATTCAGAGCAATTTGTTGAACAAAACTATG
AATGAAAGGTTCGACAGCATAGAGTCCTCTATAGTGACAAACCAAAGCCGCGCTATGAAC
GCCCTGAGTGACAAGCTTGAGACGGACATGTCGCAGGTGTGGCGACAGATGGGTGTAGTG
TACACTCAGCTCACAGCTAGCAGACAGGCGCTCGATAAACTATCGGAACAAACCGCGCAA
TACGTTAATGGAAGCTCAAGTAAATTGGACAGCATGAAGGAGAAGGTGAGCGCGATAACA
ACACGCATGTCCGAAGTTGATGACAATTTGAACTATTTATTAGGAAGAATTTCATTAGTG
ACTCAGGAATTCAGTCTAATTAAAACTGGGCTGGGCATCGCACTGGACAAAGCGAAGAAC
GGCCTCGACGAGGTTCAAGCTAAACTGGACGATAACAGTCCAGGACCGCATCCCGTCGAG
GTTAAGGCGAATTAA
Protein sequence:
MTFNRKYRLVMRNSIDSEDWPLYDPMLEYHTKYEPEYLERLSSMSAGGMAAIEFRLKHRP
REKKKKDEADDEFQWSRDITESFIQIRMQNDWLFRDRKWAWSNLRQIMIEEYGFPHCLSS
RDLSRKWAAIYAEYQKAKATNNISWMYYSLFEVYFGESSMSLNPLLGWQEEWVINLISTR
TELEQLFKMWEKKKETPWREVEKKLRKMGIPLDHSLLEIEEIWRHLLKTFKWKQKFASKG
ILNEQWPYYEHVSRYVDQHEAKEANDGDFEDDVKLYELKKIAMEPKHEVTNVCRSCSSDD
GCVKIFEETDDEGLDVAYKLKVIGGIEIQRSDTLPTQICLQCLQELENAFKFRRQCQEVD
KNLRSSSSFIKVELQLDDKHHTNEICDGERQNYEIEMDRDGVTMATKKKTSPQMRPARKV
IRRKKVRKSEYEYLKVCEVCGKHTRNLKAHMDVHSKDKCYSCEICEKKFKFKSGLIVHKA
THNPTPKKTCEVCGKSFHILSQYRRHYAYHANERKYGCETCGKRFNSLDILKVHARIHTD
ERPFSCSECGKTFRTAGCVGRHKRIVHRNTKLDKQDELHFNMRGWWMIAVVVLVASETQG
RDVTHEDIRDAMLSLVHIVRASEDKLERHELREKALGDQLKKMMAGLEKKHRNLETLKGT
ISRLDDRLYNVENIFLQKEEREKETQKKTNEALEEIQKSLKSLTEMVSSNLKPISTTTEM
DNSLTPNEDPLTKRLDATDAKLDNIKVEIEKLKNSINKDALQAMCAEVAIDLNQLSETEK
LLNKYELKLNEYNGTASKVQTDFVPLSEVSLADEAWHSKMTEVMERQEKDIIKIRQLLSD
AESMWKDLPHLADIKRSTNDTLEAIAALQRNVTDIMEKGVAKTNMKVKELGDRLVATNED
IQQSLTQGNTMSERAYTDLQRSYTNLREELQGFSKNEHVMLQTADNVIATKKRIEYGVHQ
ISLEVSELIRIQSNLLNKTMNERFDSIESSIVTNQSRAMNALSDKLETDMSQVWRQMGVV
YTQLTASRQALDKLSEQTAQYVNGSSSKLDSMKEKVSAITTRMSEVDDNLNYLLGRISLV
TQEFSLIKTGLGIALDKAKNGLDEVQAKLDDNSPGPHPVEVKAN