𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Predicting protein subcellular location by fusing multiple classifiers

✍ Scribed by Kuo-Chen Chou; Hong-Bin Shen


Publisher
John Wiley and Sons
Year
2006
Tongue
English
Weight
290 KB
Volume
99
Category
Article
ISSN
0730-2312

No coin nor oath required. For personal study only.

✦ Synopsis


Abstract

One of the fundamental goals in cell biology and proteomics is to identify the functions of proteins in the context of compartments that organize them in the cellular environment. Knowledge of subcellular locations of proteins can provide key hints for revealing their functions and understanding how they interact with each other in cellular networking. Unfortunately, it is both time‐consuming and expensive to determine the localization of an uncharacterized protein in a living cell purely based on experiments. With the avalanche of newly found protein sequences emerging in the post genomic era, we are facing a critical challenge, that is, how to develop an automated method to fast and reliably identify their subcellular locations so as to be able to timely use them for basic research and drug discovery. In view of this, an ensemble classifier was developed by the approach of fusing many basic individual classifiers through a voting system. Each of these basic classifiers was trained in a different dimension of the amphiphilic pseudo amino acid composition (Chou [2005] Bioinformatics 21: 10–19). As a demonstration, predictions were performed with the fusion classifier for proteins among the following 14 localizations: (1) cell wall, (2) centriole, (3) chloroplast, (4) cytoplasm, (5) cytoskeleton, (6) endoplasmic reticulum, (7) extracellular, (8) Golgi apparatus, (9) lysosome, (10) mitochondria, (11) nucleus, (12) peroxisome, (13) plasma membrane, and (14) vacuole. The overall success rates thus obtained via the resubstitution test, jackknife test, and independent dataset test were all significantly higher than those by the existing classifiers. It is anticipated that the novel ensemble classifier may also become a very useful vehicle in classifying other attributes of proteins according to their sequences, such as membrane protein type, enzyme family/sub‐family, G‐protein coupled receptor (GPCR) type, and structural class, among many others. The fusion ensemble classifier will be available at www.pami.sjtu.edu.cn/people/hbshen. J. Cell. Biochem. 99: 517–527, 2006. © 2006 Wiley‐Liss, Inc.


📜 SIMILAR VOLUMES


Support vector machines for prediction o
✍ Yu-Dong Cai; Xiao-Jun Liu; Xue-biao Xu; Kuo-Chen Chou 📂 Article 📅 2002 🏛 John Wiley and Sons 🌐 English ⚖ 124 KB

## Abstract Support Vector Machine (SVM), which is one class of learning machines, was applied to predict the subcellular location of proteins by incorporating the quasi‐sequence‐order effect (Chou [2000] Biochem. Biophys. Res. Commun. 278:477–483). In this study, the proteins are classified into t

Multiple classifier integration for the
✍ Lei Chen; Lin Lu; Kairui Feng; Wenjin Li; Jie Song; Lulu Zheng; Youlang Yuan; Zh 📂 Article 📅 2009 🏛 John Wiley and Sons 🌐 English ⚖ 112 KB

## Abstract Supervised classifiers, such as artificial neural network, partition trees, and support vector machines, are often used for the prediction and analysis of biological data. However, choosing an appropriate classifier is not straightforward because each classifier has its own strengths an

Prediction and classification of protein
✍ Kuo-Chen Chou; Yu-Dong Cai 📂 Article 📅 2003 🏛 John Wiley and Sons 🌐 English ⚖ 206 KB

Given a protein sequence, how to identify its subcellular location? With the rapid increase in newly found protein sequences entering into databanks, the problem has become more and more important because the function of a protein is closely correlated with its localization. To practically deal with

Virus-PLoc: A fusion classifier for pred
✍ Hong-Bin Shen; Kuo-Chen Chou 📂 Article 📅 2007 🏛 Wiley (John Wiley & Sons) 🌐 English ⚖ 150 KB 👁 2 views

## Abstract Viruses can reproduce their progenies only within a host cell, and their actions depend both on its destructive tendencies toward a specific host cell and on environmental conditions. Therefore, knowledge of the subcellular localization of viral proteins in a host cell or virus‐infected