Fast Fourier Transform-based Support Vector Machine for Subcellular Localization Prediction Using Different Substitution Models
Authors: WANG, Zhimeng1; JIANG, Lin1; LI, Menglong; SUN, Lina1; LIN, Rongying1
Source: Acta Biochimica et Biophysica Sinica, Volume 39, Number 9, September 2007 , pp. 715-721(7)
Publisher: Blackwell Publishing
Abstract:
There are approximately 109 proteins in a cell. A hotspot in bioinformatics is how to identify a protein's subcellular localization, if its sequence is known. In this paper, a method using fast Fourier transform-based support vector machine is developed to predict the subcellular localization of proteins from their physicochemical properties and structural parameters. The prediction accuracies reached 83% in prokaryotic organisms and 84% in eukaryotic organisms with the substitution model of the c-p-v matrix (c, composition; p, polarity; and v, molecular volume). The overall prediction accuracy was also evaluated using the “leave-one-out” jackknife procedure. The influence of the substitution model on prediction accuracy has also been discussed in the work. The source code of the new program is available on request from the authors.Keywords: protein subcellular localization; prediction; substitution model; fast Fourier transform; support vector machine
Document Type: Research article
DOI: 10.1111/j.1745-7270.2007.00326.x
Affiliations: 1: College of Chemistry, Sichuan University, Chengdu 610064, China

Click here for Page Help