DPGLEAN04396 in OGS1.0

New model in OGS2.0DPOGS215634 
Genomic Positionscaffold2955:+ 6323-13360
See gene structure
CDS Length2796
Paired RNAseq reads  1217
Single RNAseq reads  2728
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003534 (3e-27)
Best Drosophila hit  CG13350 (3e-142)
Best Human hitWD repeat and HMG-box DNA-binding protein 1 isoform 1 (4e-109)
Best NR hit (blastp)  acidic nucleoplasmic DNA-binding protein 1 [Bombyx mori] (0.0)
Best NR hit (blastx)  acidic nucleoplasmic DNA-binding protein 1 [Bombyx mori] (0.0)
GeneOntology terms  GO:0003677 DNA binding
InterPro families





  
IPR015943 WD40/YVTN repeat-like-containing domain
IPR019775 WD40 repeat, conserved site
IPR011046 WD40 repeat-like-containing domain
IPR001680 WD40 repeat
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR019781 WD40 repeat, subgroup
Orthology groupMCL14314

Nucleotide sequence:

ATGAAGATTGATAGTAAACCACTTCGATACGCTCATGCTGAGGGTCACACAGATGTATGC
TATACTGAGGATGGAAAACATATCATTACTTGTGGTCACGACGGAGACGTAAGAATTTGG
CTGGATATAGAAGATGACGACCCAAACTCTCACTGTGTTGGTGAAAGTGCTTTGGCCGTT
TGCTTCAAAGATGGCAGGTTGTATGTGGCCACTGATAATCATGCTGTTCAAGCATACACA
TTCCCGGCATTTGATAAAGATGGGATAATAACAAGGTTTACAGCACCAGTCACTCAAATT
ATGTCTAAACCTAAAGTAGAAGCACTGGGTTGTTCTTCTGAGAATATGGAAGCGAAAATT
TGCAATTTGGAAGGTGGTGCCCCATTATTTGTCATGTCCGAACAAAAAGGCCCAGTACTT
AGTATTGCAATATGTCCAAACATGAAGCATGCTTGTACAGCATCAGGTGATGGAATTATG
AGGGTCTGGGATATTGATACACAGAAAGTTATGAAAGAACTCTCCTGTGTTCCCAAGATA
AACACTTTCTATTCAGCTAAAGTGCTCTGTAGAATGGATTTTGAGCCACAAGAAGGAAAA
TCCTTAGCTTATCCCAATAATAGAGAGATCATTATATTGGATTGTGAATCATGGAATCAG
AGAGTAGCGTTCACACATAGTACAATTAAATGTGCTATTTCACAATGCCTTTTCTCTCCA
TCTGGTCAATATCTGGCGGGGAGCACAGTGGCTGGTCAGATAGCTGTCTGGGAGGTGGAT
TCCGGAGCCTGTATAGACATCATTGAACATCCCACATCACATAATGTGTCTTCTATGACT
TGGAATCCTAAAGATAATGGTGTTCTGGTGTACTGTGATGTTGCTGGTCAAATGGGTATG
TTGGTCAACTGTTATGGGAAGGACAGTAACAAAATTGGTGATGATAATACAGATGTGGAG
ATGGTCGAAAGAGATGACGAAAATGACTTGGACAATTTAATAGAGAATTATGAAAGTGAT
GACGATAATGCTATATCCTTGGAAAGAATCAAGAATCAGACGCTAAGGATGGTCGACCAG
GAGGATTCCAGGCCTGTGTCAAGAGCTACAGTGGTTCCGCAGAGTACATCTGCACAACTG
GCATTCCAACCGTCCTCAACACCTGTACATTTAGAACATAGATACATGTGTTGGAACGAT
ATCGGCATAGTCAGATGTCATACAGCCGAGAACGGCGAGTCTACGATAGATGTCGAATTT
CATGACTCCAACCTACACCATGGCATACATTTAAACAACTATTTAAATCACACCATGGCT
AGTCTGTCCGCCAATGTTCTGGTTTTAGCCTGTGAGACACCGAGCAAACTGGTGTGTATA
TCGCTAGCTGGTAGCAGTCGTGAGTGGAGCGTCTCCATGACGGAGACGGAGGAGGTGGTG
TGCGTCGGCGCTGGCGGAGTGGTCGCGTGCTGCACCAGCGCTAGACTACTGAGGCTGTAC
ACGCCGCTGGGAACCCAGAGACAGGTGATATGTCTCTCGGGGCCGGCTGTGACGCTCGCG
TGTTACAACACTACAGTCGCTGCCGTGTACCACAGGACGGACCCCGGACTGACTGACCAA
CATCTCGCTATGGACATCATTGCTCTTAATGGTCGTCAGGTGCGCAGTAAAACAGTACCG
GTGCCTCTATCGGCGGGGGCCAAGCTATCCTGGCTGGGCGTGTCAGACGCGGGGTCCCCC
TGTGCCCACGACTCCGCTGGGGTCCTACGACTCTATGACGTCACTAGCGCCCTCTGGTTA
CCAGTCTGCGACACCAGCCATCATTCTAAGGGGGTTTCGGACTCGTGGTTTATTGTGTCC
GTCAGTGAACCGACACAGAAAGTTCGAGCCATATTATGTAGAGGTGCCTCATTCCCGTTG
ACAGCACCGAAACCTATCATCTCTGAGTTAGCGATACAGATACCGTTATGTGAGTTAGAT
ACGGAAAAGGCGCAGTACGAGGAGCAGTTGGTGCGATGGGCGCATATGACGTCAGATGTT
GACATTAAAACCGCCAGAGAGACCGCCTTGAAATTGTTTGCTTTGGCGTGTCGCAGTGAA
ATAGAGCAAAGGGCGTTGGAACTAATGGAATTATTGAGAGACGATCGTCTGATACCTCTC
GCTGCGAAATACGCGTCGCGGCTAGGGCGGATACATCTCGCTGAGAAATTAACGAGCCTC
GCTGAGACCTGGGAGAGTGACGCGAGTAAGGTTAACGAAGCCCAAACGACGCACTTCCGA
GAGCTGGACACACAGGAAACTTATGACGTCACAGAACAACACGAGGACCTGAACACGAGC
TTAATAATACCACAGAAAACAACGAAAAAGAAAGAGACGGAAACTGCTTTAAAACCTGTA
CCAATCAAATCATCGCCCGGCGGCGCCAGGAATCCATTTAAGAAACATCTAGATAACAAA
CCTAGCCAGAGTCCACTCAGCCTCACAGAACGAACGCTGGTCGAAGTACATCAACAACAA
ACTGACGATACCGAAAACAGTGACTCTCTAAAACCGGCAGACGGTGAGACGTTTGTAGAA
TGGTTTTCAAGGAACAAGTCCATATTAGAAAAACAAAATCCGGATCTTACACCCGCGGAA
TTAACGAGACACTGCGTTAGGACGTTCAAGAGTTCGCAGAACAAAGTGCTAGAGAATGGC
CAGAAGAGAAAATTGGGAGAAGACGATGAGACCCCGGCCACTAGCGCGCCCAAGCAGTCC
AAGCTAAGCGCATTCGCATTCTCCAAAAAGACTTGA

Protein sequence:

MKIDSKPLRYAHAEGHTDVCYTEDGKHIITCGHDGDVRIWLDIEDDDPNSHCVGESALAV
CFKDGRLYVATDNHAVQAYTFPAFDKDGIITRFTAPVTQIMSKPKVEALGCSSENMEAKI
CNLEGGAPLFVMSEQKGPVLSIAICPNMKHACTASGDGIMRVWDIDTQKVMKELSCVPKI
NTFYSAKVLCRMDFEPQEGKSLAYPNNREIIILDCESWNQRVAFTHSTIKCAISQCLFSP
SGQYLAGSTVAGQIAVWEVDSGACIDIIEHPTSHNVSSMTWNPKDNGVLVYCDVAGQMGM
LVNCYGKDSNKIGDDNTDVEMVERDDENDLDNLIENYESDDDNAISLERIKNQTLRMVDQ
EDSRPVSRATVVPQSTSAQLAFQPSSTPVHLEHRYMCWNDIGIVRCHTAENGESTIDVEF
HDSNLHHGIHLNNYLNHTMASLSANVLVLACETPSKLVCISLAGSSREWSVSMTETEEVV
CVGAGGVVACCTSARLLRLYTPLGTQRQVICLSGPAVTLACYNTTVAAVYHRTDPGLTDQ
HLAMDIIALNGRQVRSKTVPVPLSAGAKLSWLGVSDAGSPCAHDSAGVLRLYDVTSALWL
PVCDTSHHSKGVSDSWFIVSVSEPTQKVRAILCRGASFPLTAPKPIISELAIQIPLCELD
TEKAQYEEQLVRWAHMTSDVDIKTARETALKLFALACRSEIEQRALELMELLRDDRLIPL
AAKYASRLGRIHLAEKLTSLAETWESDASKVNEAQTTHFRELDTQETYDVTEQHEDLNTS
LIIPQKTTKKKETETALKPVPIKSSPGGARNPFKKHLDNKPSQSPLSLTERTLVEVHQQQ
TDDTENSDSLKPADGETFVEWFSRNKSILEKQNPDLTPAELTRHCVRTFKSSQNKVLENG
QKRKLGEDDETPATSAPKQSKLSAFAFSKKT