New model in OGS2.0 | DPOGS212160  |
---|---|
Genomic Position | scaffold378:- 3578-5173 |
See gene structure | |
CDS Length | 1596 |
Paired RNAseq reads   | 461 |
Single RNAseq reads   | 1381 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006608 (0.0) |
Best Drosophila hit   | CG14815, isoform B (9e-83) |
Best Human hit | peroxisomal biogenesis factor 5 isoform b (4e-69) |
Best NR hit (blastp)   | PREDICTED: similar to predicted protein [Tribolium castaneum] (8e-97) |
Best NR hit (blastx)   | PREDICTED: similar to predicted protein [Tribolium castaneum] (6e-92) |
GeneOntology terms    | GO:0000268 peroxisome targeting sequence binding GO:0042221 response to chemical stimulus |
InterPro families    | IPR001440 Tetratricopeptide TPR-1 IPR019734 Tetratricopeptide repeat IPR011990 Tetratricopeptide-like helical IPR013026 Tetratricopeptide repeat-containing |
Orthology group | MCL15444 |
Nucleotide sequence:
ATGTCATTAAACAAACTTGTCGGAGGTGATTGTGGCGGTAACAATTCACTTGTTAAACTA
ACAAATATTGTAGGTAGAGATGGTTCCATAACACAAAACTTATCACAATCTGACAGATTT
GTCAATGAATTTCTGGCACAAAATTCTCAAGTTCCTCAAACTTTTAATATGAACGCCCTT
CTTAATAATATGCCAGAAGTGGAGAAAGTGTCAAACATTACTGCTCAACCATCTACAAGT
CAAATTTCTAATGTTCGTCCACATATGCCATCTCCTTGGATGCATGCTCCTTCAGCATCT
TTCATGCCCTCGGCAATGAGACCTTTTCAAACACCATTCCAAATAATGAGACAACCACAG
ACATCAAATGTTCAGATACAATATGTTAATGAATCCGAGTTGCAAAAATCTGATAGTGAT
GTGAAAACTAAAGCTCAAGAATATGTCAACAGTGTTAAAGAAGATGACGAACTTGCTTAT
AATCAATTCATGTCATTTATGAAAAGAATAAGTTCAGGTGAATTAAATCTCGGAGAAAGT
CTGGAGGGGGAACAAAAAAGTATGAGCAAAGATAAAATAGTCGAAGAGATGGCTGAAAAA
TACAAAGATGAATGGGCTAAGTTGAGTGATGTCAATGAATACTGGGATTCTGAAGCGGCA
AATGGAATAGCAAAAGAATATACATTCGCGGAAGGGAATATGATGTTGGAAAATAAAAGT
GCTCTAGAACTTGGTAAGGAGAAGTTGAAGATGGGTGATATTCCAGGTGCCGTTCTTTGT
TTTGAGGCGGCAGCTCAGCAGCAACCCGATTCAGCTGAAGCTTGGTTCTTACTTGGCACA
ACACAAGCTGAAAATGAACAAGATCCTCTAGCAATAACAGCACTAAAAAAATCCCTAGCA
ATTGATCCAAGGCAACTGGAAGCATATATAACCTTAGCAGCTGCATACACCAATGAGAAC
ATGGCTAAACATGCATATTTGACATTGCTGGATTGGTTGAAGGCCAGTAGTAAATATAGT
GATTTGGTTCCCCAAGACATTGATCCTAACAAAATGAGTATTAAAGAATTGGAGGCCTAT
TCAACATCACTATATCTGAAAGCGGCACAATTAAACCCTGTTCAAGTGGATCCTGATGTG
CAAAATGCATTGGGTGTAATTTGTAACATTAATCAGCAATATGATAAAGCGGTGGATTGT
TTTAAAGCAGCTCTGGCTGTGGCTTCGGATAATGCTAAACTGTGGAACAGGCTAGGAGCC
ACTCTTGCCAACAGTGACAGGTCTGAGGAAGCCCTGGATGCTTATCATGAGGCTCTCAAC
CTAGAACCGGGTTTCATAAGAGCTAGATATAATGTTGGTATCACATGCATGAATTTAGGA
GCTCATAAACAAGCAGCAGAGCATTTCTTAGTTGTACTGAATCAGCAATATAAAGCTCAA
AGTTCGAACCCCAATGCTTCATCAGATATAAGCTCTTCAACCATTTGGACAACATTAAGA
ATGGTTTGTTCCTTTATGGGCGAGCATGATGCTGCAAAATTAGTTGATGATAGAAATCTT
AGTGAGCTGAACAAATTTTTTGAAGTTGAGCCGTAA
Protein sequence:
MSLNKLVGGDCGGNNSLVKLTNIVGRDGSITQNLSQSDRFVNEFLAQNSQVPQTFNMNAL
LNNMPEVEKVSNITAQPSTSQISNVRPHMPSPWMHAPSASFMPSAMRPFQTPFQIMRQPQ
TSNVQIQYVNESELQKSDSDVKTKAQEYVNSVKEDDELAYNQFMSFMKRISSGELNLGES
LEGEQKSMSKDKIVEEMAEKYKDEWAKLSDVNEYWDSEAANGIAKEYTFAEGNMMLENKS
ALELGKEKLKMGDIPGAVLCFEAAAQQQPDSAEAWFLLGTTQAENEQDPLAITALKKSLA
IDPRQLEAYITLAAAYTNENMAKHAYLTLLDWLKASSKYSDLVPQDIDPNKMSIKELEAY
STSLYLKAAQLNPVQVDPDVQNALGVICNINQQYDKAVDCFKAALAVASDNAKLWNRLGA
TLANSDRSEEALDAYHEALNLEPGFIRARYNVGITCMNLGAHKQAAEHFLVVLNQQYKAQ
SSNPNASSDISSSTIWTTLRMVCSFMGEHDAAKLVDDRNLSELNKFFEVEP