DPGLEAN06152 in OGS1.0

New model in OGS2.0DPOGS205694 
Genomic Positionscaffold156:- 41470-45692
See gene structure
CDS Length2448
Paired RNAseq reads  1095
Single RNAseq reads  2737
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005916 (5e-06)
Best Drosophila hit  CG7609, isoform A (4e-108)
Best Human hitWD repeat-containing protein 24 (1e-96)
Best NR hit (blastp)  PREDICTED: similar to WD repeat domain 24 [Apis mellifera] (1e-146)
Best NR hit (blastx)  PREDICTED: similar to WD repeat domain 24 [Apis mellifera] (5e-138)
GeneOntology terms
  
GO:0005053 peroxisome matrix targeting signal-2 binding
GO:0007031 peroxisome organization
InterPro families





  
IPR011046 WD40 repeat-like-containing domain
IPR019781 WD40 repeat, subgroup
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR015943 WD40/YVTN repeat-like-containing domain
IPR001680 WD40 repeat
IPR019775 WD40 repeat, conserved site
Orthology groupMCL13541

Nucleotide sequence:

ATGATACCTTCAAACACTATTTGTGTATCTCAGGAAGGTCCAGCGAATGCCTTAGCACTT
AATAAAGATTGTACACAAGTCGTAATCGCTGGCAGGAATGTATTCAAAGTATTTTCGATT
GGTGAAAATGAATTTTCCGAAGTGTGCAACCTAAGAGTAGGCAAGAATTTGAACTTAAAT
TTCTCATCCATTGATGTAGCATGGAGTACTATTGAAGAGAACACATTGGCTACAGCTGCA
ACCAATGGAGCTGTGGTGGTTTGGAATCTGGGCAGGTCGGGACGGTCCAAGCAAGAGCAT
GTGTTCTCAGACCACAAGAGAACCGTCAATAAGGTCAGCTTCCACCTGACAGAGCCATCT
CTTCTCATATCAGGCTCCCAGGATGGCATGATGAAGTACTTCGACCTCCGCATGAAGGAG
GTAGCTAGAACTTTCATAAGCAACACCGAGTCCATCCGCGATGTCCAGTTCAGTCCCCAC
GCGGCGCACGTGTTCGCTGCGGTGTCCGAGAACGGCACGGTCCAGCTGTGGGACTCTCGC
AGACACGAGCGGGCCATGCATCAGTTCACGGCGCACTCGGGACCCGTGTTCGCCTGCGAC
TGGCATCCCGACGTGCCCTGGCTGGCCACCGCCTCCAGGGACAAGACCATCAAGGTGTGG
GACATCCACGCGCGTCCCAACTTGGAGCACACCATATACACCATCGCATCAGTAGGTCAC
GTCAAGTGGAGACCGCAGAGGAAGTTCCAGGTGGCGTCGTGTGCTCTAGTGCTGGATTGC
GCGGTGCACGTGTGGGATGTGAGGAGGCCGCACGTGCCCCTAGCCACGTTCGCGGAACAC
CGTGACGTCACCACCGCCATCGCCTGGCAGGCCACGCCGCACGCCTTTCTCTCTACCAGC
CGAGACTGCAGCCTGTACCGTCACAAGTTTAGCGAGGCGGCTCACCCGGTGCTGTGGGCC
AACCCACAGGCAGTGTGCGTGTCGTCTCGCGGTCAGCTCGCCAGCGCTCTTCCGGATGCT
CCCCTTCCTCGGCCGCGGACCTGCGCTCCCGAGAGACTCGTGCCCGGCCTCGGTCGTAAG
CAGCCCACGGCAGGATCGCTGAGTGCATCTGCGCAGGCGGCCTTAGAGCGCGCATTTCCG
GGTAGCGCGTCTTCCTCCCTGGCCCGACACACTCCCGTGGCCTCTCAGGACCAGCCGGTG
TTGGCCTGCGCACACCTTTACCGTATTCGGGGCCAGCCGCCTCATGTCCTGGCCGAGCAC
AACGCGAAAGTCGCGCGTGATCACCATCGGTACATGGTGAGTCACGTGTGGGAGGTGGTC
CGCAGCGTGTACAGCGCCCGCGCCGTGCGCAGCGCTCCCGCCCCGCCGCCGCCCCGCGCT
CCGCCACCGGTCGAGGATACGCCGCCCCTGCCGCTGTACAGCGCCGTCGAAGAGGAACCT
CTCGAGGAAGTCGAGGAGTGGGAGAATCGTCTGCACCAGCACAGCGGCGTGCTTGGACTG
CCGACACACGCTGTTTACATCCCGCCTGCCAAATACGCCAGGGAAGGAGAGGACGGCGGC
TGGGTGTCGGCGGCGTCGGCTCACTACGTGGACGTGGAGGCGGCGGACTGGACGCTGCCG
GACGAGGCCTTCCCGCTCCGAGCGCCGCCCGCACCCCCCGCACCGCCGCCCGCACCCTCG
CAGCACACGCGACCCAACGAACAAGAACACGACGCAGAGTCAGTAGTCGCAGGAGGCACG
GGCGTGGTCGGACATGCGGGGCCCTCGTCCCCCGGGTCCTCGGGCTCCGGCACGGGCTCC
AGTCGACACGACGGGCTACAGCGTTCCAGCAGCCAGAACTGTGGCAGTTCCGAGGGGGAG
GGTGGCGAGGGCGGAGAGGGTGCGGAGGGCGGAGCGCTGTGTGTGAGGGAGGGTGCGGGG
CGGGGCACCGGGCGAGCGGCACTGGACCCCGCGCCGCTGCTGGCCCTGGCACTGCGTCTG
CACGCGGACCTCGGCGACGTGCAGACGGCTGCCGTAGTCTGCCTGGCGCTGCAGGACCAC
CGGAGCGACCTGTTCCCGTACATCGACGAGAGCCTGCAGGAGCAGTGGCTGCTGGGGTAC
ATCGAGCTGCTGCAGCGCCACAAGCTGTACAACGCGGCCACGGAGGTGATCCGCTGCTCA
TGGGTGAGCGGCGTGTGGTCTCTGTCGCAGCAGTCCACGAGCGTGGCGGCGTGCTGCGCG
CGCTGCGGCCGTCGCAGCAGGCCGTGCGTGCGAGTGCGCGCCGCGGGCTCCGCCGGACCT
GTGCGCCGTGTGCCACCAGTGCGTGCGCGGCCTGTACGCGTGGTGCCAGGGCTGCGCGCA
CGGCGGCCACCTGCGACACGTTCAGTCGTGGCTCAAGGAACACCAGCTGTGCCCGGCCGG
CTGCGGACACGCCTGCCAGCTCGCATGACCTCGCGCCCCGACCAGTGA

Protein sequence:

MIPSNTICVSQEGPANALALNKDCTQVVIAGRNVFKVFSIGENEFSEVCNLRVGKNLNLN
FSSIDVAWSTIEENTLATAATNGAVVVWNLGRSGRSKQEHVFSDHKRTVNKVSFHLTEPS
LLISGSQDGMMKYFDLRMKEVARTFISNTESIRDVQFSPHAAHVFAAVSENGTVQLWDSR
RHERAMHQFTAHSGPVFACDWHPDVPWLATASRDKTIKVWDIHARPNLEHTIYTIASVGH
VKWRPQRKFQVASCALVLDCAVHVWDVRRPHVPLATFAEHRDVTTAIAWQATPHAFLSTS
RDCSLYRHKFSEAAHPVLWANPQAVCVSSRGQLASALPDAPLPRPRTCAPERLVPGLGRK
QPTAGSLSASAQAALERAFPGSASSSLARHTPVASQDQPVLACAHLYRIRGQPPHVLAEH
NAKVARDHHRYMVSHVWEVVRSVYSARAVRSAPAPPPPRAPPPVEDTPPLPLYSAVEEEP
LEEVEEWENRLHQHSGVLGLPTHAVYIPPAKYAREGEDGGWVSAASAHYVDVEAADWTLP
DEAFPLRAPPAPPAPPPAPSQHTRPNEQEHDAESVVAGGTGVVGHAGPSSPGSSGSGTGS
SRHDGLQRSSSQNCGSSEGEGGEGGEGAEGGALCVREGAGRGTGRAALDPAPLLALALRL
HADLGDVQTAAVVCLALQDHRSDLFPYIDESLQEQWLLGYIELLQRHKLYNAATEVIRCS
WVSGVWSLSQQSTSVAACCARCGRRSRPCVRVRAAGSAGPVRRVPPVRARPVRVVPGLRA
RRPPATRSVVAQGTPAVPGRLRTRLPARMTSRPDQ