Acta Phys. -Chim. Sin. ›› 2006, Vol. 22 ›› Issue (09): 1052-1055.doi: 10.3866/PKU.WHXB20060903

• ARTICLE • Previous Articles     Next Articles

A New Peptide Sequences Representation Technique and Support Vector Machine for Quantitative Structure-Retention Modeling of Peptides in HPLC

LIANG Gui-Zhao;LI Zhi-Liang;ZHOU Yuan;He Liu;ZHOU Peng   

  1. College of Chemistry and Chemical Engineering, Chongqing University, Chongqing 400030, P. R. China; Key Laboratory of Biomechanics and Tissue Engineering, MOE, Chongqing University, Chongqing 400030, P, R. China
  • Received:2006-01-10 Revised:2006-03-21 Published:2006-09-04
  • Contact: LI Zhi-Liang


A new representation technique for peptide sequences, namely SZOTT(scores vector of zero dimension, one dimension, two dimension, and three dimension), was derived from 1369 parameters of 20 coded amino acids using principle components analysis (PCA). It was then employed to express 71 peptide sequences with different lengths. Quantitative structure-retention modelings (QSRMs) were constructed by support vector machine (SVM) and partial least square (PLS). The results indicated that 71 peptide sequences could be preferably represented by SZOTT with many advantages, such as plentiful structural information and easy manipulation. Also simulative power for interior samples and predictive power for exterior samples by SVM were superior to those from PLS. SZOTT and SVM can be applied to develop QSRMs.

Key words: Peptides, SVM, QSRM, SZOTT