Combining one-class classification models based on diverse biological data for prediction of protein-protein interactions

Reyes, Jose A.; Gilbert, David; Bairoch, A; CohenBoulakia, S; Froidevaux, C

Abstract

This research addresses the problem of prediction of protein-protein interactions (PPI) when integrating diverse biological data. Cold Standard data sets frequently employed for this task contain a high proportion of instances related to ribosomal proteins. We demonstrate that this situation biases the classification results and additionally that the prediction of non-ribosomal based PPI is a much more difficult task. In order to improve the performance of this subtask we have integrated more biological data into the classification process, including data from mRNA expression experiments and protein secondary structure information. Furthermore we have investigated several strategies for combining diverse one-class classification (OCC) models generated from different subsets of biological data. The weighted average combination approach exhibits the best results, significantly improving the performance attained by any single classification model evaluated.

Más información

Título según WOS: ID WOS:000257304400018 Not found in local WOS DB
Título de la Revista: BIO-INSPIRED SYSTEMS AND APPLICATIONS: FROM ROBOTICS TO AMBIENT INTELLIGENCE, PT II
Volumen: 5109
Editorial: SPRINGER INTERNATIONAL PUBLISHING AG
Fecha de publicación: 2008
Página de inicio: 177
Página final: 191
Notas: ISI