New model in OGS2.0 | DPOGS205369  |
---|---|
Genomic Position | scaffold100:- 86972-94172 |
See gene structure | |
CDS Length | 3279 |
Paired RNAseq reads   | 430 |
Single RNAseq reads   | 1102 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008762 (3e-24) |
Best Drosophila hit   | crooked legs, isoform A (4e-18) |
Best Human hit | zinc finger protein 208 (3e-24) |
Best NR hit (blastp)   | hypothetical protein BRAFLDRAFT_71028 [Branchiostoma floridae] (1e-47) |
Best NR hit (blastx)   | hypothetical protein BRAFLDRAFT_71028 [Branchiostoma floridae] (1e-62) |
GeneOntology terms    | GO:0003676 nucleic acid binding GO:0008270 zinc ion binding GO:0005622 intracellular |
InterPro families    | IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding IPR007087 Zinc finger, C2H2-type IPR015880 Zinc finger, C2H2-like |
Orthology group | MCL40326 |
Nucleotide sequence:
ATGCACGAGGAACATCAGACTTTTAAAGTCGAAACAGCGTTCGCACATTGCAATGAGGGA
TACTTGAAAGCTGATTGCACAGACCTCAAGTGCAGAATCTGTTGGGAACCATTTAAAAAG
TTGGATGATGTCGCCAAACATATTAACGATGTCCATAATATTAAAATACTATTTGAATTT
CACATTGGCATACAACCTTTTAAGTTCGATGATGAAAAGCTTTTATGCGGTATATGTGAT
AGGAATTTCCCTTGTCTCCGGCAGCTTAGTCGTCACATGACTTCCCATTACCAGAACTAT
ACCTGCGAAGAGTGCGGGAAGTCTTATACTACAAACAGCTCTCTACAGCAGCATATCAGA
TTTTCGCATATATCAAACAAAAGGATTTGCAGAAGGTGTAAGAAGACATTTAATTCGTTG
CTAGATAAAAGGGAACATGTTAAATCTTCATCAAAATGTTGGGCTTACCAGTGCGCCGTT
TGTGGTGTACGATTTATGACGTGGACTCTGAAAGAACAGCATTTAGAAAATGTTCATGGA
CAAGCTAAAAAGACACATAAGTGTCCCGAATGCGCAACAATATTCTTGACCAGGAACGCC
TACAGAAATCATTTTGCGACAATGCACGCGGGAATTAATTTCGTTTGCTCTTATTGCGGA
CCCACGAAGGATATTGACGGACTCAAAGAGAAAGAGAGAAAACAAAAAAAGGGGTTATTC
GAAAGATCCGTTAAACATAACCCCCAGCGTCGAAACGCTGTTTTGGTTCTTAGACATTCC
ACGGCTATTCCTTTTAAAACACGATTTAATAGAATACTCTGCTCTTATTGCCATGATGAA
TTCCAGCCCATGGAGGCTCTTAGAATACATATTAAGGAGAAACACATTAACGCTGATTTT
AACAGTGCTTTTTATAAGGTGGTCGATGATCTCAAAATTGATATAAGCCATTTCAAATGT
AACATATGCTCGCAAGACATTGAGAACGTTGACACATTTATGAATCATTTATCAGGGGAT
CATGGGAAACCAGTTAATTTTGATGTACCTTTCGGTGTGTTACCGTATAGACAGAATGAG
ACTGGTGCTTGGCTGTGCCTTCACTGTGATAAAATATATCCGGAATTCTCCCAAATAAAC
AGCCATTTACGAACCCACGCCAAAATTTCTACTTGCGATAAGTGCGGGGCGACTTTCCTC
TCGGAGCACGGTCTAAAACAACACGAGCGTAATTTCCAATGCTATAAAGCAACATACAAA
CCTCGCTTCGGTAAAGCCTTGAAGCATAAATACAATACTGAAATTATTTTACAATGTTCA
ACTGCATGTCCTTTCAGAACGTGGGGACAAAATTTTAACTGTGTCTTTTGCAGAGTGCAA
TCAAATGATCCCAATGGGCTACGAGCTCATATGGCATCCAGACATGCCAACTTTGACATA
CAACTAGTATTTAGCAGGAAATTACGAAAGGAATTTTTAAAAGTCGATATAACAGATTTG
CAATGTAAACTTTGCTTCATGCACATTGACACTTTAGATGATTTATTGACACATCTCAAA
AATGATCACAAACAACCGGTGAACATAGACGTCCAACCGGGGGTCTTGCCGTTCAAGTTG
AACGACGGCTCTTGTTGGAAATGTGCTATATGCAAAATACAGTTCTCCGATTTCATATCG
TTAAAAAAACACACAGCGGAACACTATCAGAACTACGTTTGCGACACATGTGGGGAGGGT
TTCATAACAGAAGTCGCATTGCGGGCGCACACGAAAATACCGCATGATAATAAATACACC
TGCAGTAGATGCGTTGCGACGTTCTCCACGTTAGAAGAGAGAAGTGTTCACATAAAAACA
CAACACACGAACCTACCGTACATGTGTACTTATTGCAAGGACAAACCGCGGTTCGCCACC
TGGGAGCTCAGGAAACGGCATTTATATGAGATACACAATTATAAATCAGGGGCGGAAATG
TACGAGTGTACCACCTGTCACATGATGTTCAAGACGCGATCTCAGAAATACCACCACAAC
GTCAAAGTTCATCGGACAAAAAAGGAAATAGATTTCGGTTTCTCTTGCGGCCACTGCGCT
AGAGGAAGAGGCAGAGAAACAGAAAATATCACTGATATAAACATACTCCAACAGTCACAT
CTCGATACAGATAATGACATTAAGGTTGGAGAGAAACGAAAGTATCAAAAAAGTGCTAGA
TCCCAAGCGAGATTCATGACAAAGAAGAACGCAAGCTCTATTCTCGAATGCTGGTCTGGA
ATACCATTCAGATGGAAAAAAAATAGATTTAAATGCGCCTATTGTGAAGAAAATTTTAAT
GAGTGTTCGGATTTGAGGGAGCATGTTAGATTATGTGCTACCCAGTACAATGTAGGCAGT
ATATTCAGTAAATTTAAAGAAATGACTCTCATAAATATGGATGTCAGTGAGGCCGCTTGT
CGAATATGTTCTGAGCCGTTCAGAGAACTTGATGGTATGCGAGAGCACGTCATTCGACAC
GGCTACGAATTAGATGTTTCGCATCCGGACGGTGTTATACCGTTTTGTCTCACGAAAGAA
TCCTGGTCGTGTGTCTTATGTCGCGAGACATTCAATAACTTTCTGAAACTTTACGAGCAT
ATGAACACGCATTATCAGTACCACATATGTTCTATATGCGGAAAAGGTTACATGACTGGA
CCGAGGCTAAGGAAACATCTAGAATTACACATAACGGGAACATTTCCTTGCGATAAATGC
AAGAAAGTTTTCACAAAACGCACAGGGAGAGACAATCACAAAGCCTACGCCCACGCCAAA
GGTCCGCGTTATGAATGCCCACAATGTAATATGAGATTTGAAGGTTATTATGATCGAATG
AATCATTTGAAACAAGCTCACAGAGAAAAGGAAGTGAAGTACGGCTGTTCACACTGTGAT
CTGTCATTTAAAACGAGCGGCAAGCGAGCCATCCACGTCAAAACGGTTCACTTTCCTCGT
CAGAGTAACTTTAGTTGCCCTTATTGCAAAACCCTATTCAAAACAGCCTTCGGTATGAAA
CGTCACATGGTAAAACATAATGGAGAAACCTGTACTGTTTGTGGTGAAAGTTTTACTAAA
AGTAAAGCATTGAAGGAACACTTGGCAGGTCACGCTGATGGACTTCGTTGTAAATGGTGC
GGAAATAATTTTAAGGAAGCAGCACTTCTGTCGACACATACGCGTGAGAAGCATCCAGAA
GTAGACGAATTGATGATGTCCGGGCTGGTTAATGTTTAG
Protein sequence:
MHEEHQTFKVETAFAHCNEGYLKADCTDLKCRICWEPFKKLDDVAKHINDVHNIKILFEF
HIGIQPFKFDDEKLLCGICDRNFPCLRQLSRHMTSHYQNYTCEECGKSYTTNSSLQQHIR
FSHISNKRICRRCKKTFNSLLDKREHVKSSSKCWAYQCAVCGVRFMTWTLKEQHLENVHG
QAKKTHKCPECATIFLTRNAYRNHFATMHAGINFVCSYCGPTKDIDGLKEKERKQKKGLF
ERSVKHNPQRRNAVLVLRHSTAIPFKTRFNRILCSYCHDEFQPMEALRIHIKEKHINADF
NSAFYKVVDDLKIDISHFKCNICSQDIENVDTFMNHLSGDHGKPVNFDVPFGVLPYRQNE
TGAWLCLHCDKIYPEFSQINSHLRTHAKISTCDKCGATFLSEHGLKQHERNFQCYKATYK
PRFGKALKHKYNTEIILQCSTACPFRTWGQNFNCVFCRVQSNDPNGLRAHMASRHANFDI
QLVFSRKLRKEFLKVDITDLQCKLCFMHIDTLDDLLTHLKNDHKQPVNIDVQPGVLPFKL
NDGSCWKCAICKIQFSDFISLKKHTAEHYQNYVCDTCGEGFITEVALRAHTKIPHDNKYT
CSRCVATFSTLEERSVHIKTQHTNLPYMCTYCKDKPRFATWELRKRHLYEIHNYKSGAEM
YECTTCHMMFKTRSQKYHHNVKVHRTKKEIDFGFSCGHCARGRGRETENITDINILQQSH
LDTDNDIKVGEKRKYQKSARSQARFMTKKNASSILECWSGIPFRWKKNRFKCAYCEENFN
ECSDLREHVRLCATQYNVGSIFSKFKEMTLINMDVSEAACRICSEPFRELDGMREHVIRH
GYELDVSHPDGVIPFCLTKESWSCVLCRETFNNFLKLYEHMNTHYQYHICSICGKGYMTG
PRLRKHLELHITGTFPCDKCKKVFTKRTGRDNHKAYAHAKGPRYECPQCNMRFEGYYDRM
NHLKQAHREKEVKYGCSHCDLSFKTSGKRAIHVKTVHFPRQSNFSCPYCKTLFKTAFGMK
RHMVKHNGETCTVCGESFTKSKALKEHLAGHADGLRCKWCGNNFKEAALLSTHTREKHPE
VDELMMSGLVNV