New model in OGS2.0 | DPOGS212670  |
---|---|
Genomic Position | scaffold2337:- 377-10985 |
See gene structure | |
CDS Length | 2931 |
Paired RNAseq reads   | 410 |
Single RNAseq reads   | 1533 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004432 (4e-98) |
Best Drosophila hit   | CG15011 (4e-28) |
Best Human hit | NF-X1-type zinc finger protein NFXL1 (3e-28) |
Best NR hit (blastp)   | nuclear transcription factor, X-box binding protein, putative [Pediculus humanus corporis] (9e-43) |
Best NR hit (blastx)   | PREDICTED: similar to ovarian zinc finger protein [Tribolium castaneum] (8e-49) |
GeneOntology terms    | GO:0003700 sequence-specific DNA binding transcription factor activity GO:0005634 nucleus GO:0006355 regulation of transcription, DNA-dependent GO:0008270 zinc ion binding |
InterPro families    | IPR000967 Zinc finger, NF-X1-type IPR015880 Zinc finger, C2H2-like IPR007087 Zinc finger, C2H2-type IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding IPR018908 Uncharacterised protein family UPF0546 |
Orthology group | MCL31134 |
Nucleotide sequence:
ATGGTTACATGCTTTGGTGAGCATGAAACAGATAACCAACCGTGTCACACTGCTTCAAGG
AAGCCGTGTGGAAGACAATGTGGTCGTCCATTAGCATGTGGAAACCATAAGTGTGAATTA
TCCTGTCATTTGTATGAGCCCAATGCAGATTATCCAAATGTACCATATACATGCAAACCA
TGTAACAGAGAGTGCTTGGTAGTCCGTCCACCGAAATGTACACACAAGTGTGCAAAACAG
GGCTGTCACCCTGGACCTTGTCCGCCATGTAATATACTAGAAAGGATACCCTGTCATTGT
GGTGTAACCGAGATATATGTGAGATGTCGTGAGTTACAGAGTGCTACAGAAGAAATGCTA
AGTTGCAAACAACAATGCCCTAAGAGTCTGGAATGTGGTCATAGATGTAAAAACCTGTGC
CATTCAGGTAGCTGTGGGCAGAATCAAGTATGCAACAAAAAGACTAAAATACACTGCCCA
TGTGGCAATTTAAAGAAAGAGGCAGCTTGCAAAGCTGTTAGGAATATGGAGGTGCAGGTC
ATTTGTGACGAGAGCTGTGAAGCCAAAAAAGTTGCTGCCCAATTAGAAAAAGAGAAAGAG
GCGAAAAGACTCAAGGAATTAGAAGAAGAACGGAATCGTAGAGAGTTAGAAGAATACACT
TGGAAATTAAGCGGCAAAAAGAAGAAATATAAAGAGAAGAAGATTGTCGTTGTTACTGAT
AACAGGAACTGGCTTCAGAAGTATTGGTTTCCTATTTTGTGTGTTTTGATTGGTTTGCTT
GTATTAACGGGGATATTATGGGGTTGTACAAACCCATTCATAAGACAAGGCACAAAAGGT
TTACGGAAAGTTTGCGCTAAAACGAAATTGGGCCAGGCTTACGCAGAGATTATTTTTCTC
TTAGGGAATTGGAGGTACGTTGTACCCTGGTTGATTAACCAATGCGGTTCGTTGGTGTAT
TTATCGGCTGTGCAGCGTGTGCCTTTGTCTCTTGCTGTGCCTACCGCCAACAGCCTTGCG
TTCGCCTTTACAGCACTAACGGGAGCAACGCTGGGTATTGAAGAGCCTTTGGATTTCGAA
ACTAAGCTTTACAACGTGCACCAATACCAACTGAAAACGTATTATGATGAAGTGATAAAT
AATATTAACTCCGATATGGATGATCTACCCCAATACTTCTGTTTCGAGTGTGCATATTTG
CTGTATAAGTTTCACAAGTTCAAAGAAAAGTGTAGTATTGGTTATAAAATACTGACAGAG
ATGTTGTGTCGAGGCCCAATAACCAACAACACAATTAACGATTCATATACATTTAATTCA
AATATGAAACCATCATTAAAGATAGTGAATGTGTTCGAAATTAACTATACTTTCAAAGAG
GAGCCGAAATATGAAGAAATTCATACAAAAGTTGAGAATGTTGATACTGAAATTGATAAC
AATGTATATACTGGTGAAGACAATCAGGATATAGATACAAATGCTAAATATATAGACTCA
CTTAATACAGAAGATGAGCTTAATATATATGAACATAAAAATCCAAATGAATCCAAATCA
CTACACACATATGATAATGAAAATGATAGTAAAAACGGAATACTAAGACATAAGATCTCA
TTGGATGAACCTTTTTGGAAGAAACATGAAATGTCAGAAGAAGAAGCAGTCGAGCAATTC
AAAGCAAGAGCGGAGAATGATAAATACAAAAGCGCACCATTCAAGTGTACAGATTGTTTT
AAAGGTTTCTCAAAAGAGAATATATTAAAGAGACATAGAATCTTGAGGCATAATGAAATA
TATAAAATAGAATGTCCATTTTGTCACATGCGTTTTAAATTAAATTGTTTCATGCAGAAA
CATTTGCGGGGCCATTACACTAAGTATGAATGCAGGAGGTGTAATGTGGTGTATCCCTTG
GAGGGTTCAGCATTGTTCCACGAGGAGTTCCACAGCGGTGTCATAAGAACCTGCAGACAC
TGCAATGAGGAGTTCCGTCACTCGTCAACATATTATTCTCACCTCCGAACACATAAGAGT
GAGTTCGTGTGTTGTGTGTGCGGGTCGTCGTTCGTCAGCGAGGCCGGACTCCATCAACAC
AAGCGGATCAAACACTGTGACAGTGTTGAATCCCCTGACGATGAGGAGATGAACACTTTT
TGCACTAAGTGTGACATCAGCTTTGAGAGCAGGCCGGCCTACGACGAGCATCTACTGCAT
TCCATCAAACACATAGAAGACATTGAGAATGATAATGAAGTTGTTCAAGACAAACGGAAG
AAGGTTTTGGGCAAGAGAATGAAGGAAAAGATAACCGGTCAATTGTCCAAGAAGTCAAGG
AATATGACAAAGTCGGAGAGGAAGAGTTGTAAGAGAAGAAGACAACCAAGGAAACCAACC
ACCTGTCATCAGTGCGGTAAACATTTCGACACCCAGGCGGCGTGCATGAAGCACCACGTG
ACGGAACATCCGCGGACGTCCTTCACGGCTCCACACCAGAGACACATCTGTGAGATATGC
GGAGCGTCCCTCGCGCCGGGGAGCGTCATCGCTCATCAGAACATGCACAGCAGAGAAAAG
GTCCACCCGTGTGAGACGTGCGGCAAACAGTTCTATACAACCATATCTCTCAAACGACAC
TCCGTGACTCACACCGGAGAGAAACCGTTCCCTTGTAGTTTATGCGACAAGAGGTTCACA
CAGAGCAACAGCATGAAACTCCACTACAGGACCTTCCATCTCAAACAACCTTACCCAAAA
AGAAACAGAAGAAAGAAAAAGATGAATGATAGCATGGAAGAATCTCACAGTGAAGACTCC
AGTGACGTGAAGACCAAGAAGAAAACAGTTCATGAACAGGGAGTCCAAGCCAGCGCTATC
ACTGTACAAGTTATAAGTGACACTAACAGCCTCTTCAACTTCTGTGGGTAG
Protein sequence:
MVTCFGEHETDNQPCHTASRKPCGRQCGRPLACGNHKCELSCHLYEPNADYPNVPYTCKP
CNRECLVVRPPKCTHKCAKQGCHPGPCPPCNILERIPCHCGVTEIYVRCRELQSATEEML
SCKQQCPKSLECGHRCKNLCHSGSCGQNQVCNKKTKIHCPCGNLKKEAACKAVRNMEVQV
ICDESCEAKKVAAQLEKEKEAKRLKELEEERNRRELEEYTWKLSGKKKKYKEKKIVVVTD
NRNWLQKYWFPILCVLIGLLVLTGILWGCTNPFIRQGTKGLRKVCAKTKLGQAYAEIIFL
LGNWRYVVPWLINQCGSLVYLSAVQRVPLSLAVPTANSLAFAFTALTGATLGIEEPLDFE
TKLYNVHQYQLKTYYDEVINNINSDMDDLPQYFCFECAYLLYKFHKFKEKCSIGYKILTE
MLCRGPITNNTINDSYTFNSNMKPSLKIVNVFEINYTFKEEPKYEEIHTKVENVDTEIDN
NVYTGEDNQDIDTNAKYIDSLNTEDELNIYEHKNPNESKSLHTYDNENDSKNGILRHKIS
LDEPFWKKHEMSEEEAVEQFKARAENDKYKSAPFKCTDCFKGFSKENILKRHRILRHNEI
YKIECPFCHMRFKLNCFMQKHLRGHYTKYECRRCNVVYPLEGSALFHEEFHSGVIRTCRH
CNEEFRHSSTYYSHLRTHKSEFVCCVCGSSFVSEAGLHQHKRIKHCDSVESPDDEEMNTF
CTKCDISFESRPAYDEHLLHSIKHIEDIENDNEVVQDKRKKVLGKRMKEKITGQLSKKSR
NMTKSERKSCKRRRQPRKPTTCHQCGKHFDTQAACMKHHVTEHPRTSFTAPHQRHICEIC
GASLAPGSVIAHQNMHSREKVHPCETCGKQFYTTISLKRHSVTHTGEKPFPCSLCDKRFT
QSNSMKLHYRTFHLKQPYPKRNRRKKKMNDSMEESHSEDSSDVKTKKKTVHEQGVQASAI
TVQVISDTNSLFNFCG