DPGLEAN17928 in OGS1.0

New model in OGS2.0DPOGS212670 
Genomic Positionscaffold2337:- 377-10985
See gene structure
CDS Length2931
Paired RNAseq reads  410
Single RNAseq reads  1533
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004432 (4e-98)
Best Drosophila hit  CG15011 (4e-28)
Best Human hitNF-X1-type zinc finger protein NFXL1 (3e-28)
Best NR hit (blastp)  nuclear transcription factor, X-box binding protein, putative [Pediculus humanus corporis] (9e-43)
Best NR hit (blastx)  PREDICTED: similar to ovarian zinc finger protein [Tribolium castaneum] (8e-49)
GeneOntology terms


  
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005634 nucleus
GO:0006355 regulation of transcription, DNA-dependent
GO:0008270 zinc ion binding
InterPro families



  
IPR000967 Zinc finger, NF-X1-type
IPR015880 Zinc finger, C2H2-like
IPR007087 Zinc finger, C2H2-type
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR018908 Uncharacterised protein family UPF0546
Orthology groupMCL31134

Nucleotide sequence:

ATGGTTACATGCTTTGGTGAGCATGAAACAGATAACCAACCGTGTCACACTGCTTCAAGG
AAGCCGTGTGGAAGACAATGTGGTCGTCCATTAGCATGTGGAAACCATAAGTGTGAATTA
TCCTGTCATTTGTATGAGCCCAATGCAGATTATCCAAATGTACCATATACATGCAAACCA
TGTAACAGAGAGTGCTTGGTAGTCCGTCCACCGAAATGTACACACAAGTGTGCAAAACAG
GGCTGTCACCCTGGACCTTGTCCGCCATGTAATATACTAGAAAGGATACCCTGTCATTGT
GGTGTAACCGAGATATATGTGAGATGTCGTGAGTTACAGAGTGCTACAGAAGAAATGCTA
AGTTGCAAACAACAATGCCCTAAGAGTCTGGAATGTGGTCATAGATGTAAAAACCTGTGC
CATTCAGGTAGCTGTGGGCAGAATCAAGTATGCAACAAAAAGACTAAAATACACTGCCCA
TGTGGCAATTTAAAGAAAGAGGCAGCTTGCAAAGCTGTTAGGAATATGGAGGTGCAGGTC
ATTTGTGACGAGAGCTGTGAAGCCAAAAAAGTTGCTGCCCAATTAGAAAAAGAGAAAGAG
GCGAAAAGACTCAAGGAATTAGAAGAAGAACGGAATCGTAGAGAGTTAGAAGAATACACT
TGGAAATTAAGCGGCAAAAAGAAGAAATATAAAGAGAAGAAGATTGTCGTTGTTACTGAT
AACAGGAACTGGCTTCAGAAGTATTGGTTTCCTATTTTGTGTGTTTTGATTGGTTTGCTT
GTATTAACGGGGATATTATGGGGTTGTACAAACCCATTCATAAGACAAGGCACAAAAGGT
TTACGGAAAGTTTGCGCTAAAACGAAATTGGGCCAGGCTTACGCAGAGATTATTTTTCTC
TTAGGGAATTGGAGGTACGTTGTACCCTGGTTGATTAACCAATGCGGTTCGTTGGTGTAT
TTATCGGCTGTGCAGCGTGTGCCTTTGTCTCTTGCTGTGCCTACCGCCAACAGCCTTGCG
TTCGCCTTTACAGCACTAACGGGAGCAACGCTGGGTATTGAAGAGCCTTTGGATTTCGAA
ACTAAGCTTTACAACGTGCACCAATACCAACTGAAAACGTATTATGATGAAGTGATAAAT
AATATTAACTCCGATATGGATGATCTACCCCAATACTTCTGTTTCGAGTGTGCATATTTG
CTGTATAAGTTTCACAAGTTCAAAGAAAAGTGTAGTATTGGTTATAAAATACTGACAGAG
ATGTTGTGTCGAGGCCCAATAACCAACAACACAATTAACGATTCATATACATTTAATTCA
AATATGAAACCATCATTAAAGATAGTGAATGTGTTCGAAATTAACTATACTTTCAAAGAG
GAGCCGAAATATGAAGAAATTCATACAAAAGTTGAGAATGTTGATACTGAAATTGATAAC
AATGTATATACTGGTGAAGACAATCAGGATATAGATACAAATGCTAAATATATAGACTCA
CTTAATACAGAAGATGAGCTTAATATATATGAACATAAAAATCCAAATGAATCCAAATCA
CTACACACATATGATAATGAAAATGATAGTAAAAACGGAATACTAAGACATAAGATCTCA
TTGGATGAACCTTTTTGGAAGAAACATGAAATGTCAGAAGAAGAAGCAGTCGAGCAATTC
AAAGCAAGAGCGGAGAATGATAAATACAAAAGCGCACCATTCAAGTGTACAGATTGTTTT
AAAGGTTTCTCAAAAGAGAATATATTAAAGAGACATAGAATCTTGAGGCATAATGAAATA
TATAAAATAGAATGTCCATTTTGTCACATGCGTTTTAAATTAAATTGTTTCATGCAGAAA
CATTTGCGGGGCCATTACACTAAGTATGAATGCAGGAGGTGTAATGTGGTGTATCCCTTG
GAGGGTTCAGCATTGTTCCACGAGGAGTTCCACAGCGGTGTCATAAGAACCTGCAGACAC
TGCAATGAGGAGTTCCGTCACTCGTCAACATATTATTCTCACCTCCGAACACATAAGAGT
GAGTTCGTGTGTTGTGTGTGCGGGTCGTCGTTCGTCAGCGAGGCCGGACTCCATCAACAC
AAGCGGATCAAACACTGTGACAGTGTTGAATCCCCTGACGATGAGGAGATGAACACTTTT
TGCACTAAGTGTGACATCAGCTTTGAGAGCAGGCCGGCCTACGACGAGCATCTACTGCAT
TCCATCAAACACATAGAAGACATTGAGAATGATAATGAAGTTGTTCAAGACAAACGGAAG
AAGGTTTTGGGCAAGAGAATGAAGGAAAAGATAACCGGTCAATTGTCCAAGAAGTCAAGG
AATATGACAAAGTCGGAGAGGAAGAGTTGTAAGAGAAGAAGACAACCAAGGAAACCAACC
ACCTGTCATCAGTGCGGTAAACATTTCGACACCCAGGCGGCGTGCATGAAGCACCACGTG
ACGGAACATCCGCGGACGTCCTTCACGGCTCCACACCAGAGACACATCTGTGAGATATGC
GGAGCGTCCCTCGCGCCGGGGAGCGTCATCGCTCATCAGAACATGCACAGCAGAGAAAAG
GTCCACCCGTGTGAGACGTGCGGCAAACAGTTCTATACAACCATATCTCTCAAACGACAC
TCCGTGACTCACACCGGAGAGAAACCGTTCCCTTGTAGTTTATGCGACAAGAGGTTCACA
CAGAGCAACAGCATGAAACTCCACTACAGGACCTTCCATCTCAAACAACCTTACCCAAAA
AGAAACAGAAGAAAGAAAAAGATGAATGATAGCATGGAAGAATCTCACAGTGAAGACTCC
AGTGACGTGAAGACCAAGAAGAAAACAGTTCATGAACAGGGAGTCCAAGCCAGCGCTATC
ACTGTACAAGTTATAAGTGACACTAACAGCCTCTTCAACTTCTGTGGGTAG

Protein sequence:

MVTCFGEHETDNQPCHTASRKPCGRQCGRPLACGNHKCELSCHLYEPNADYPNVPYTCKP
CNRECLVVRPPKCTHKCAKQGCHPGPCPPCNILERIPCHCGVTEIYVRCRELQSATEEML
SCKQQCPKSLECGHRCKNLCHSGSCGQNQVCNKKTKIHCPCGNLKKEAACKAVRNMEVQV
ICDESCEAKKVAAQLEKEKEAKRLKELEEERNRRELEEYTWKLSGKKKKYKEKKIVVVTD
NRNWLQKYWFPILCVLIGLLVLTGILWGCTNPFIRQGTKGLRKVCAKTKLGQAYAEIIFL
LGNWRYVVPWLINQCGSLVYLSAVQRVPLSLAVPTANSLAFAFTALTGATLGIEEPLDFE
TKLYNVHQYQLKTYYDEVINNINSDMDDLPQYFCFECAYLLYKFHKFKEKCSIGYKILTE
MLCRGPITNNTINDSYTFNSNMKPSLKIVNVFEINYTFKEEPKYEEIHTKVENVDTEIDN
NVYTGEDNQDIDTNAKYIDSLNTEDELNIYEHKNPNESKSLHTYDNENDSKNGILRHKIS
LDEPFWKKHEMSEEEAVEQFKARAENDKYKSAPFKCTDCFKGFSKENILKRHRILRHNEI
YKIECPFCHMRFKLNCFMQKHLRGHYTKYECRRCNVVYPLEGSALFHEEFHSGVIRTCRH
CNEEFRHSSTYYSHLRTHKSEFVCCVCGSSFVSEAGLHQHKRIKHCDSVESPDDEEMNTF
CTKCDISFESRPAYDEHLLHSIKHIEDIENDNEVVQDKRKKVLGKRMKEKITGQLSKKSR
NMTKSERKSCKRRRQPRKPTTCHQCGKHFDTQAACMKHHVTEHPRTSFTAPHQRHICEIC
GASLAPGSVIAHQNMHSREKVHPCETCGKQFYTTISLKRHSVTHTGEKPFPCSLCDKRFT
QSNSMKLHYRTFHLKQPYPKRNRRKKKMNDSMEESHSEDSSDVKTKKKTVHEQGVQASAI
TVQVISDTNSLFNFCG