DPGLEAN05604 in OGS1.0

New model in OGS2.0DPOGS209531 
Genomic Positionscaffold2488:+ 6693-10329
See gene structure
CDS Length1923
Paired RNAseq reads  930
Single RNAseq reads  2256
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013837 (0.0)
Best Drosophila hit  CG9062 (0.0)
Best Human hitWD repeat-containing protein 48 (0.0)
Best NR hit (blastp)  PREDICTED: similar to CG9062 CG9062-PB [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to CG9062 CG9062-PB [Tribolium castaneum] (0.0)
GeneOntology terms




  
GO:0005515 protein binding
GO:0044419 interspecies interaction between organisms
GO:0005764 lysosome
GO:0005634 nucleus
GO:0016579 protein deubiquitination
GO:0005737 cytoplasm
InterPro families







  
IPR021772 Protein of unknown function DUF3337
IPR019781 WD40 repeat, subgroup
IPR011046 WD40 repeat-like-containing domain
IPR001680 WD40 repeat
IPR020472 G-protein beta WD-40 repeat
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR019775 WD40 repeat, conserved site
IPR015943 WD40/YVTN repeat-like-containing domain
Orthology groupMCL14000

Nucleotide sequence:

ATGCGTAAAAAGACCCAGGTATCATTTGTGATTCGAGACGAAGAAGAGCGACGTCATAAA
AACGGTGTTAGCTCCTTACAATTGGATCCTATACAAGGCAGACTTTATTCAGCAGGCCGA
GATGGAATTATTCGTGTCTGGCATACAGGAGGCGGTACTCAGGATAGATACATACAGAGT
ATGGAACACCACACCGATTGGGTGAATGACATAGTATTATGTTGTGGCGGAAAAAATCTT
ATAAGCGCGTCATCTGACACTACTGTTAAAGTATGGAACGCACCAAAAGGTTTTTGTATG
TCCACATTAAGAACTCATAAAGATTATGTTCGGACTCTTGCGTACGCTAAAGATAAAGAG
CAGGTCGCCAGTGCAGGGCTTGATCGTGCCATATTCTTGTGGGATGTAAATACTTTAACC
GCTTTAACAGCTAGCAACAATACTGTTACTACGTCAAGTTTGGTAGGAAATAAAGAATCT
ATATACAGCTTGGCTATGAATCCTCCTGGGACGATTTTAGTCAGTGGCTCAACTGAAAAA
GTTCTTCGAGTTTGGGATCCCAGAAATTGCTCGCGTCTTATGAAGCTTAAGGGCCATGCT
GATAACGTAAAAGCGTTAGTCGTGAGCAGAGATGGCTCACAATGTGTATCTGGAAGCTCT
GATGGTACAATAAAATTATGGTCTCTGTCACAACAGAGATGCGTTTCTACTATACGTGTT
CATTCCGAGGCTGTGTGGGCCCTACTGGCGACTGAAAACTTCACACATATAATATCAGGT
GGTAGAGATCGTCTAGTCATCATAACAGAACTCAGGAACCCAGAAAACTACATGATAGTA
TGTGAGGAAACTGCCCCAATATTAAAATTGTGTTTCACTGCCGACCAACAAGGTATATGG
GTGGCAACATCAGATTCAGACATAAGATGTTGGAAATTACCACCACTGAACTCATTAAAC
TCAGATATGTATACTCAGAACAATTATAATACTAACAATGTGTACCAAACACAACCATTA
CACAACATAGTCGGCGGCAGAGCCATAAAACACTACACAGTTTTAAACGACAAACGACAC
ATTTTAACTAAAGACACCACCAACAATGTTGTATTGTATGATGTGCTGAAAGCATGCAAA
GTCGAAGATTTGGGTGAGGTTGATTATGAAGAGGAATTGAAGAAACGTTTCAAAATGGTT
TACGTACCAAATTGGTTTAACGTAGATTTAAAGACTGGAATGCTAACAATACATCTGGGG
CAAGATGAGACCGACTGTTTTAGCGCCTGGGTCAGCGCTAAAGAGGCCGGTTTGATAACG
GAGAATGATCAGAAAGTCAATTTTGGGGCTCTATTATTGCAAGCTTTGTTGGATCATTGG
AATCATCCTAATAGGGTTAATGAAGCAGGTCAAAAAGTCGTCGGTAACATATACTTCAGT
GTTCCGTTACACACTCCCCTAATATTTAGTGAAGTTGGCGGAAGAACGCTTTACAGATTG
CAGGTTGGTGACGCTGGCGGTGAAACGGAGGGCAACCTTCTCATGGAGACTGTTCCGTCG
TGGGTTGTAGATGTGGCCATAGAAATGGCTGCCCCAAAACTGAACAAACTACCATTCTAC
CTATTGCCACATTCAAGTTGTCAGAGTAAACAGGATCGGCAGAAAAAGGACCGTCTGGTG
GCAAATGATTTCATCCAAGTCCGTAAAGTCGGTGAGCATGTCGTGGAGAAGATTGTCGGC
GGCGGTGATGTAAACGGCAGTTCCAAGAACGAAGACTGTAACAACGACTCCCCGGAAGAA
AGAGTGGAACTGTTGTGCTGTGATCAGGTCCTCGACCCGAACATGGATTTACGTACGGTC
CGTCACTTCATATGGAAATCGAATGTGGAATTTACATTGCACTACAGAATATTGAAACAA
TGA

Protein sequence:

MRKKTQVSFVIRDEEERRHKNGVSSLQLDPIQGRLYSAGRDGIIRVWHTGGGTQDRYIQS
MEHHTDWVNDIVLCCGGKNLISASSDTTVKVWNAPKGFCMSTLRTHKDYVRTLAYAKDKE
QVASAGLDRAIFLWDVNTLTALTASNNTVTTSSLVGNKESIYSLAMNPPGTILVSGSTEK
VLRVWDPRNCSRLMKLKGHADNVKALVVSRDGSQCVSGSSDGTIKLWSLSQQRCVSTIRV
HSEAVWALLATENFTHIISGGRDRLVIITELRNPENYMIVCEETAPILKLCFTADQQGIW
VATSDSDIRCWKLPPLNSLNSDMYTQNNYNTNNVYQTQPLHNIVGGRAIKHYTVLNDKRH
ILTKDTTNNVVLYDVLKACKVEDLGEVDYEEELKKRFKMVYVPNWFNVDLKTGMLTIHLG
QDETDCFSAWVSAKEAGLITENDQKVNFGALLLQALLDHWNHPNRVNEAGQKVVGNIYFS
VPLHTPLIFSEVGGRTLYRLQVGDAGGETEGNLLMETVPSWVVDVAIEMAAPKLNKLPFY
LLPHSSCQSKQDRQKKDRLVANDFIQVRKVGEHVVEKIVGGGDVNGSSKNEDCNNDSPEE
RVELLCCDQVLDPNMDLRTVRHFIWKSNVEFTLHYRILKQ