DPGLEAN05565 in OGS1.0

New model in OGS2.0DPOGS212160 
Genomic Positionscaffold378:- 3578-5173
See gene structure
CDS Length1596
Paired RNAseq reads  461
Single RNAseq reads  1381
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006608 (0.0)
Best Drosophila hit  CG14815, isoform B (9e-83)
Best Human hitperoxisomal biogenesis factor 5 isoform b (4e-69)
Best NR hit (blastp)  PREDICTED: similar to predicted protein [Tribolium castaneum] (8e-97)
Best NR hit (blastx)  PREDICTED: similar to predicted protein [Tribolium castaneum] (6e-92)
GeneOntology terms
  
GO:0000268 peroxisome targeting sequence binding
GO:0042221 response to chemical stimulus
InterPro families


  
IPR001440 Tetratricopeptide TPR-1
IPR019734 Tetratricopeptide repeat
IPR011990 Tetratricopeptide-like helical
IPR013026 Tetratricopeptide repeat-containing
Orthology groupMCL15444

Nucleotide sequence:

ATGTCATTAAACAAACTTGTCGGAGGTGATTGTGGCGGTAACAATTCACTTGTTAAACTA
ACAAATATTGTAGGTAGAGATGGTTCCATAACACAAAACTTATCACAATCTGACAGATTT
GTCAATGAATTTCTGGCACAAAATTCTCAAGTTCCTCAAACTTTTAATATGAACGCCCTT
CTTAATAATATGCCAGAAGTGGAGAAAGTGTCAAACATTACTGCTCAACCATCTACAAGT
CAAATTTCTAATGTTCGTCCACATATGCCATCTCCTTGGATGCATGCTCCTTCAGCATCT
TTCATGCCCTCGGCAATGAGACCTTTTCAAACACCATTCCAAATAATGAGACAACCACAG
ACATCAAATGTTCAGATACAATATGTTAATGAATCCGAGTTGCAAAAATCTGATAGTGAT
GTGAAAACTAAAGCTCAAGAATATGTCAACAGTGTTAAAGAAGATGACGAACTTGCTTAT
AATCAATTCATGTCATTTATGAAAAGAATAAGTTCAGGTGAATTAAATCTCGGAGAAAGT
CTGGAGGGGGAACAAAAAAGTATGAGCAAAGATAAAATAGTCGAAGAGATGGCTGAAAAA
TACAAAGATGAATGGGCTAAGTTGAGTGATGTCAATGAATACTGGGATTCTGAAGCGGCA
AATGGAATAGCAAAAGAATATACATTCGCGGAAGGGAATATGATGTTGGAAAATAAAAGT
GCTCTAGAACTTGGTAAGGAGAAGTTGAAGATGGGTGATATTCCAGGTGCCGTTCTTTGT
TTTGAGGCGGCAGCTCAGCAGCAACCCGATTCAGCTGAAGCTTGGTTCTTACTTGGCACA
ACACAAGCTGAAAATGAACAAGATCCTCTAGCAATAACAGCACTAAAAAAATCCCTAGCA
ATTGATCCAAGGCAACTGGAAGCATATATAACCTTAGCAGCTGCATACACCAATGAGAAC
ATGGCTAAACATGCATATTTGACATTGCTGGATTGGTTGAAGGCCAGTAGTAAATATAGT
GATTTGGTTCCCCAAGACATTGATCCTAACAAAATGAGTATTAAAGAATTGGAGGCCTAT
TCAACATCACTATATCTGAAAGCGGCACAATTAAACCCTGTTCAAGTGGATCCTGATGTG
CAAAATGCATTGGGTGTAATTTGTAACATTAATCAGCAATATGATAAAGCGGTGGATTGT
TTTAAAGCAGCTCTGGCTGTGGCTTCGGATAATGCTAAACTGTGGAACAGGCTAGGAGCC
ACTCTTGCCAACAGTGACAGGTCTGAGGAAGCCCTGGATGCTTATCATGAGGCTCTCAAC
CTAGAACCGGGTTTCATAAGAGCTAGATATAATGTTGGTATCACATGCATGAATTTAGGA
GCTCATAAACAAGCAGCAGAGCATTTCTTAGTTGTACTGAATCAGCAATATAAAGCTCAA
AGTTCGAACCCCAATGCTTCATCAGATATAAGCTCTTCAACCATTTGGACAACATTAAGA
ATGGTTTGTTCCTTTATGGGCGAGCATGATGCTGCAAAATTAGTTGATGATAGAAATCTT
AGTGAGCTGAACAAATTTTTTGAAGTTGAGCCGTAA

Protein sequence:

MSLNKLVGGDCGGNNSLVKLTNIVGRDGSITQNLSQSDRFVNEFLAQNSQVPQTFNMNAL
LNNMPEVEKVSNITAQPSTSQISNVRPHMPSPWMHAPSASFMPSAMRPFQTPFQIMRQPQ
TSNVQIQYVNESELQKSDSDVKTKAQEYVNSVKEDDELAYNQFMSFMKRISSGELNLGES
LEGEQKSMSKDKIVEEMAEKYKDEWAKLSDVNEYWDSEAANGIAKEYTFAEGNMMLENKS
ALELGKEKLKMGDIPGAVLCFEAAAQQQPDSAEAWFLLGTTQAENEQDPLAITALKKSLA
IDPRQLEAYITLAAAYTNENMAKHAYLTLLDWLKASSKYSDLVPQDIDPNKMSIKELEAY
STSLYLKAAQLNPVQVDPDVQNALGVICNINQQYDKAVDCFKAALAVASDNAKLWNRLGA
TLANSDRSEEALDAYHEALNLEPGFIRARYNVGITCMNLGAHKQAAEHFLVVLNQQYKAQ
SSNPNASSDISSSTIWTTLRMVCSFMGEHDAAKLVDDRNLSELNKFFEVEP