New model in OGS2.0 | DPOGS209014  |
---|---|
Genomic Position | scaffold831:- 30639-36670 |
See gene structure | |
CDS Length | 3426 |
Paired RNAseq reads   | 984 |
Single RNAseq reads   | 2543 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012553 (3e-83) |
Best Drosophila hit   | CG10979 (6e-34) |
Best Human hit | zinc finger protein 800 (8e-10) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC011044 [Tribolium castaneum] (3e-59) |
Best NR hit (blastx)   | conserved hypothetical protein [Culex quinquefasciatus] (5e-46) |
GeneOntology terms    | GO:0008270 zinc ion binding GO:0005622 intracellular |
InterPro families    | IPR007087 Zinc finger, C2H2-type IPR015880 Zinc finger, C2H2-like |
Orthology group | MCL31078 |
Nucleotide sequence:
ATGACAGTGACGTCTATTAACAAGTTTCGTCAAAATTTAAGGGAGAGACTGTCAGTCTGC
GATCAGTTTGATGTCAGAATGGCTGGTAACAAAATAAACACTAAGAAAAAGGAAGAAAAG
GGTAAATCAAACAGAATTGCTGGTCAATCTATTGAGGAAACAGAGGACCTTGACTTCTCC
TTGCTGAGAAAACCAATACATACTAGTGTTACAGGCTTTGCTCAAGCAAGAAAAGTTTTC
GACTTAGCCACTGAGGAGCTCAAAGGTTTACTCAGCAATGAATGTGACTTATTATATGAA
TGTAAAGTATGCAGAAATATATTCAGAAGTTTAGCCAATTTTATATCACATAAGCGAGTC
TACTGTAAAGAAAAGTTTAATCCCTCTGAACATGGACATTTCATTAAAAATACCTCCTCT
CTAAATGAAATTTTGAAAATACGAAAACTCGAAGAGAGCTATCAGGAAATATTAAGAAAA
GAAAATGACTCCAATGAAATGGATACAGAGGAGACGGAAGAAAGGATCCCACTCACAAAG
GATCTTACAGATATAATAGAAAGGATATCTAAAACTAAAGGAGTACAAAAGAAGCAATTA
AAGGAACAAAATTTAGTGTTTCAAAAAATACCAAAAAGTAATGTTGCTGTTTTCCAAAAT
ATTGAAAGTGATGTGAACAAAACTGATACAATGAAGGCTGAGGTTAGTGAATTGGACAAG
ATGTTGTCTCAAGAGAATGCAGTGTTACAAAGCGATGGCACATTCAAAGTACAAACGAAC
GACGTACAAATGAATACAGAAAATGTTATACAAATCAGTGACGATGAAGATAATGAAGGT
GCACCATACTCGGTCAAACATGGTGTACTGAAATGTGAAATTTGTGATTTACAATTTTCA
ACTCAGAAAACCTTAAAGTTCCATATGAAATATAAACATTTAGAGAGCCGTTTGGTCTAT
CCCTGTCCCGATTGCTTGGATATCTTCTCAACATCCTGGAGTGTTTATAGACATCTGTTC
AAAGTACACAGAAAAACAGCTGCTCAAATCCGTCGACTCCGAGAGTCTATACAAGCTAAG
GCATTTAAAATGAACAACCCGCCAGCATTTTACGAGAAACGGAAGTCCGTTTTGAAAAAT
TTTCCAGCTCAAAAAATAACAGAGGAAGAGAGAATCTATCAAGAGAATCAGTCTTGGGAG
TTGGAGGTGGAAGGCGAAGGTCGTCGTTGCGGTGGTTGTGGGCGGTCGTTCGAGCGTCGA
GCCGCGCTCGCAGCACACGCACACACGTGCGCTAGGAGACACACACGAAGAATACAGATA
CAGATTAGGAAGGACTATCACAAGGAACAGAGCGCGCCGTACCTCGTGATGAACAGAAAC
AATGAAAATAAACCATCCGAGGAAAAGTCTGAAAAGCCGCCAGAAAAAGAAATCAAAGAG
AAAGAACCGAAACCGCTCGAGGAAGCTGTGATGAGTACAGTTGATGTGAAACAACAAGAG
ACCGAGGATTACAGAGATGACGACACTCAAGACGTGCCAGCTGGGAACACTCTACAGTAT
TACACGAATACGCTTATAAATAAATTACCATTCGCCCAACAAGCGGAGAAGAGCAATCTG
AACGCGTTCAAAAAGAGATTACAATCAGATGTCGAAATAGATCAACTTTTATGTAAAAAA
TGTAATAGTAAATTTGAACAAATAGGTGAATTATTAGAACACGTCGCTGGACATTATAAA
TGGTTGCGCTACGCCTGTAAACTTTGCAACTTCAAGCATTTCAACTTTGATAAACTCCCG
GAACACGTTAAAGTTGTCCACAAACTCAAAGGCGATACTGATTTCTACTATAGTACCGTA
AAAGCCATAGACGGTTCGGAAGCCAGCGAACTATCTTCCCCCGTGGAAGAATTAACCGAA
TCTAATGAAACTAGTCCAGATTCACGACGTCCAAGCAGATGTTCTAGTGACTCCAGCAGA
TTATCTGACGATAGCTCCTCCAGCAGTACACGAGTCGAAACCGGTTCGAGAAAACGCAAA
GCACGACTGGTCAAAAACATCGGAAAGAAGAAAAAGGATACTGTTGTTATAGATGACAAC
GAAGAAAGTAAAGAGGTTATGCATAAAGGAGTTTTGTTAGGAGAAAATGATTCGTCCTCC
AATTCAAAAATATTCGAAGAAAATTCATCAGATTTGGATGAAGTTGATGAGAAAATAGCA
AAGCGCGAAAACATGACATCCGTAGCATGCCGTAGACCAGTTCGTAAGAAAACTAAACGC
AAGAACGAAGATTTCGAATACGATCTGTCGAATTTGTTAAAAATGGAAGCGCAGGGCTAT
CGCGATTCACAAGTCACACCAAAAACTGCTCCTTCTAAGAAGAAAGTACAACAAGATGTT
AACCCTCAGTACGAGCTCATCAACAAAGAGTGTTGTGGTGCACTAGTGACGATGTCGAGG
TCCTCGGTAGAAAAAGCTCAAGCCCATATGAAGACTGCAACCTTTGCTGTGTTTAACACT
TCAAAAGAACCTCGTGTATCAAATATTTTTGTGAGGCCTCTGGTGCCTAAAATTAATAGA
GTAGATAAAATATCGCCTAAGAAGGCTGAAAATGAAGAAACAAAAGAAATCTCCCACCCT
AGTCCCACTAAAATAATAGACGCCTCCACTCTATCAAATCTCTGTAAGGAATTGGTGATA
ACTAAAGTTGTAAATAAAAAATGTGAGGAAAAAGAAGCAAATGTATCCGCTAATGAAACT
CCAAAAGAAACCGAGCCGATACCTCAAGTAGATAATAAAACGGTCAGCGACGACAGTAAA
GAGAAAAAGGAAGAAAAGAATAAGGTATCTGAAATAGAAGCAAAATCTGACGAGAGTGCC
TCATCCGAACAAACTAAAACTAATGTGAATGTACCAACAATACTTCCTATAAAATTCCGA
AGACAAAGTTTGGAGGTTATACAAAATCCCTTAATAAAGAAAAATATCACAGACTTCACA
AAAGCCGGTATGAAAACTAAAATTTTGGTAATCAAACCCATCAATAGGAGCACCGATGGA
ACAAAAACACTGAAATTTCAAACAATAAAATTGAAAGATCCGAACAAGACCACCACGAAA
AATGATGAAATGAAAACCGAACAGGTCGTCGTTGTGAAAGTTCCCAAAGTGGATTGTTCT
ATAAGCAGATCAATACCAGCCAGCGACGCCCCTGTGGCACTCGACGAGAAATGTGATGAG
AATGAAAACGAAAAAGTTAAAACGAATGCTGCAAATCCATCAAATCCTACCGGTGAAAAC
AGTGTGGAAGAACCTAAAAAAGACATTAAAATAGAAAATGACATAACTGACTTGGTGGAA
GACAAACCCGAATCAAAATTAATAGAATGTATAGAATTGGAAGAGGCCGTGATGCAATCT
GGTTGA
Protein sequence:
MTVTSINKFRQNLRERLSVCDQFDVRMAGNKINTKKKEEKGKSNRIAGQSIEETEDLDFS
LLRKPIHTSVTGFAQARKVFDLATEELKGLLSNECDLLYECKVCRNIFRSLANFISHKRV
YCKEKFNPSEHGHFIKNTSSLNEILKIRKLEESYQEILRKENDSNEMDTEETEERIPLTK
DLTDIIERISKTKGVQKKQLKEQNLVFQKIPKSNVAVFQNIESDVNKTDTMKAEVSELDK
MLSQENAVLQSDGTFKVQTNDVQMNTENVIQISDDEDNEGAPYSVKHGVLKCEICDLQFS
TQKTLKFHMKYKHLESRLVYPCPDCLDIFSTSWSVYRHLFKVHRKTAAQIRRLRESIQAK
AFKMNNPPAFYEKRKSVLKNFPAQKITEEERIYQENQSWELEVEGEGRRCGGCGRSFERR
AALAAHAHTCARRHTRRIQIQIRKDYHKEQSAPYLVMNRNNENKPSEEKSEKPPEKEIKE
KEPKPLEEAVMSTVDVKQQETEDYRDDDTQDVPAGNTLQYYTNTLINKLPFAQQAEKSNL
NAFKKRLQSDVEIDQLLCKKCNSKFEQIGELLEHVAGHYKWLRYACKLCNFKHFNFDKLP
EHVKVVHKLKGDTDFYYSTVKAIDGSEASELSSPVEELTESNETSPDSRRPSRCSSDSSR
LSDDSSSSSTRVETGSRKRKARLVKNIGKKKKDTVVIDDNEESKEVMHKGVLLGENDSSS
NSKIFEENSSDLDEVDEKIAKRENMTSVACRRPVRKKTKRKNEDFEYDLSNLLKMEAQGY
RDSQVTPKTAPSKKKVQQDVNPQYELINKECCGALVTMSRSSVEKAQAHMKTATFAVFNT
SKEPRVSNIFVRPLVPKINRVDKISPKKAENEETKEISHPSPTKIIDASTLSNLCKELVI
TKVVNKKCEEKEANVSANETPKETEPIPQVDNKTVSDDSKEKKEEKNKVSEIEAKSDESA
SSEQTKTNVNVPTILPIKFRRQSLEVIQNPLIKKNITDFTKAGMKTKILVIKPINRSTDG
TKTLKFQTIKLKDPNKTTTKNDEMKTEQVVVVKVPKVDCSISRSIPASDAPVALDEKCDE
NENEKVKTNAANPSNPTGENSVEEPKKDIKIENDITDLVEDKPESKLIECIELEEAVMQS
G