New model in OGS2.0 | DPOGS205694  |
---|---|
Genomic Position | scaffold156:- 41470-45692 |
See gene structure | |
CDS Length | 2448 |
Paired RNAseq reads   | 1095 |
Single RNAseq reads   | 2737 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005916 (5e-06) |
Best Drosophila hit   | CG7609, isoform A (4e-108) |
Best Human hit | WD repeat-containing protein 24 (1e-96) |
Best NR hit (blastp)   | PREDICTED: similar to WD repeat domain 24 [Apis mellifera] (1e-146) |
Best NR hit (blastx)   | PREDICTED: similar to WD repeat domain 24 [Apis mellifera] (5e-138) |
GeneOntology terms    | GO:0005053 peroxisome matrix targeting signal-2 binding GO:0007031 peroxisome organization |
InterPro families    | IPR011046 WD40 repeat-like-containing domain IPR019781 WD40 repeat, subgroup IPR019782 WD40 repeat 2 IPR017986 WD40-repeat-containing domain IPR015943 WD40/YVTN repeat-like-containing domain IPR001680 WD40 repeat IPR019775 WD40 repeat, conserved site |
Orthology group | MCL13541 |
Nucleotide sequence:
ATGATACCTTCAAACACTATTTGTGTATCTCAGGAAGGTCCAGCGAATGCCTTAGCACTT
AATAAAGATTGTACACAAGTCGTAATCGCTGGCAGGAATGTATTCAAAGTATTTTCGATT
GGTGAAAATGAATTTTCCGAAGTGTGCAACCTAAGAGTAGGCAAGAATTTGAACTTAAAT
TTCTCATCCATTGATGTAGCATGGAGTACTATTGAAGAGAACACATTGGCTACAGCTGCA
ACCAATGGAGCTGTGGTGGTTTGGAATCTGGGCAGGTCGGGACGGTCCAAGCAAGAGCAT
GTGTTCTCAGACCACAAGAGAACCGTCAATAAGGTCAGCTTCCACCTGACAGAGCCATCT
CTTCTCATATCAGGCTCCCAGGATGGCATGATGAAGTACTTCGACCTCCGCATGAAGGAG
GTAGCTAGAACTTTCATAAGCAACACCGAGTCCATCCGCGATGTCCAGTTCAGTCCCCAC
GCGGCGCACGTGTTCGCTGCGGTGTCCGAGAACGGCACGGTCCAGCTGTGGGACTCTCGC
AGACACGAGCGGGCCATGCATCAGTTCACGGCGCACTCGGGACCCGTGTTCGCCTGCGAC
TGGCATCCCGACGTGCCCTGGCTGGCCACCGCCTCCAGGGACAAGACCATCAAGGTGTGG
GACATCCACGCGCGTCCCAACTTGGAGCACACCATATACACCATCGCATCAGTAGGTCAC
GTCAAGTGGAGACCGCAGAGGAAGTTCCAGGTGGCGTCGTGTGCTCTAGTGCTGGATTGC
GCGGTGCACGTGTGGGATGTGAGGAGGCCGCACGTGCCCCTAGCCACGTTCGCGGAACAC
CGTGACGTCACCACCGCCATCGCCTGGCAGGCCACGCCGCACGCCTTTCTCTCTACCAGC
CGAGACTGCAGCCTGTACCGTCACAAGTTTAGCGAGGCGGCTCACCCGGTGCTGTGGGCC
AACCCACAGGCAGTGTGCGTGTCGTCTCGCGGTCAGCTCGCCAGCGCTCTTCCGGATGCT
CCCCTTCCTCGGCCGCGGACCTGCGCTCCCGAGAGACTCGTGCCCGGCCTCGGTCGTAAG
CAGCCCACGGCAGGATCGCTGAGTGCATCTGCGCAGGCGGCCTTAGAGCGCGCATTTCCG
GGTAGCGCGTCTTCCTCCCTGGCCCGACACACTCCCGTGGCCTCTCAGGACCAGCCGGTG
TTGGCCTGCGCACACCTTTACCGTATTCGGGGCCAGCCGCCTCATGTCCTGGCCGAGCAC
AACGCGAAAGTCGCGCGTGATCACCATCGGTACATGGTGAGTCACGTGTGGGAGGTGGTC
CGCAGCGTGTACAGCGCCCGCGCCGTGCGCAGCGCTCCCGCCCCGCCGCCGCCCCGCGCT
CCGCCACCGGTCGAGGATACGCCGCCCCTGCCGCTGTACAGCGCCGTCGAAGAGGAACCT
CTCGAGGAAGTCGAGGAGTGGGAGAATCGTCTGCACCAGCACAGCGGCGTGCTTGGACTG
CCGACACACGCTGTTTACATCCCGCCTGCCAAATACGCCAGGGAAGGAGAGGACGGCGGC
TGGGTGTCGGCGGCGTCGGCTCACTACGTGGACGTGGAGGCGGCGGACTGGACGCTGCCG
GACGAGGCCTTCCCGCTCCGAGCGCCGCCCGCACCCCCCGCACCGCCGCCCGCACCCTCG
CAGCACACGCGACCCAACGAACAAGAACACGACGCAGAGTCAGTAGTCGCAGGAGGCACG
GGCGTGGTCGGACATGCGGGGCCCTCGTCCCCCGGGTCCTCGGGCTCCGGCACGGGCTCC
AGTCGACACGACGGGCTACAGCGTTCCAGCAGCCAGAACTGTGGCAGTTCCGAGGGGGAG
GGTGGCGAGGGCGGAGAGGGTGCGGAGGGCGGAGCGCTGTGTGTGAGGGAGGGTGCGGGG
CGGGGCACCGGGCGAGCGGCACTGGACCCCGCGCCGCTGCTGGCCCTGGCACTGCGTCTG
CACGCGGACCTCGGCGACGTGCAGACGGCTGCCGTAGTCTGCCTGGCGCTGCAGGACCAC
CGGAGCGACCTGTTCCCGTACATCGACGAGAGCCTGCAGGAGCAGTGGCTGCTGGGGTAC
ATCGAGCTGCTGCAGCGCCACAAGCTGTACAACGCGGCCACGGAGGTGATCCGCTGCTCA
TGGGTGAGCGGCGTGTGGTCTCTGTCGCAGCAGTCCACGAGCGTGGCGGCGTGCTGCGCG
CGCTGCGGCCGTCGCAGCAGGCCGTGCGTGCGAGTGCGCGCCGCGGGCTCCGCCGGACCT
GTGCGCCGTGTGCCACCAGTGCGTGCGCGGCCTGTACGCGTGGTGCCAGGGCTGCGCGCA
CGGCGGCCACCTGCGACACGTTCAGTCGTGGCTCAAGGAACACCAGCTGTGCCCGGCCGG
CTGCGGACACGCCTGCCAGCTCGCATGACCTCGCGCCCCGACCAGTGA
Protein sequence:
MIPSNTICVSQEGPANALALNKDCTQVVIAGRNVFKVFSIGENEFSEVCNLRVGKNLNLN
FSSIDVAWSTIEENTLATAATNGAVVVWNLGRSGRSKQEHVFSDHKRTVNKVSFHLTEPS
LLISGSQDGMMKYFDLRMKEVARTFISNTESIRDVQFSPHAAHVFAAVSENGTVQLWDSR
RHERAMHQFTAHSGPVFACDWHPDVPWLATASRDKTIKVWDIHARPNLEHTIYTIASVGH
VKWRPQRKFQVASCALVLDCAVHVWDVRRPHVPLATFAEHRDVTTAIAWQATPHAFLSTS
RDCSLYRHKFSEAAHPVLWANPQAVCVSSRGQLASALPDAPLPRPRTCAPERLVPGLGRK
QPTAGSLSASAQAALERAFPGSASSSLARHTPVASQDQPVLACAHLYRIRGQPPHVLAEH
NAKVARDHHRYMVSHVWEVVRSVYSARAVRSAPAPPPPRAPPPVEDTPPLPLYSAVEEEP
LEEVEEWENRLHQHSGVLGLPTHAVYIPPAKYAREGEDGGWVSAASAHYVDVEAADWTLP
DEAFPLRAPPAPPAPPPAPSQHTRPNEQEHDAESVVAGGTGVVGHAGPSSPGSSGSGTGS
SRHDGLQRSSSQNCGSSEGEGGEGGEGAEGGALCVREGAGRGTGRAALDPAPLLALALRL
HADLGDVQTAAVVCLALQDHRSDLFPYIDESLQEQWLLGYIELLQRHKLYNAATEVIRCS
WVSGVWSLSQQSTSVAACCARCGRRSRPCVRVRAAGSAGPVRRVPPVRARPVRVVPGLRA
RRPPATRSVVAQGTPAVPGRLRTRLPARMTSRPDQ