Acta Phys. -Chim. Sin. ›› 2013, Vol. 29 ›› Issue (09): 1945-1953.doi: 10.3866/PKU.WHXB201306182


Prediction of HLA-A*0201 Restricted Cytotoxic T Lymphocyte Epitopes Based on High-Dimensional Descriptor Nonlinear Screening

HAN Na, YUAN Zhe-Ming, CHEN Yuan, DAI Zhi-Jun, WANG Zhi-Ming   

  1. Hunan Provincial Key Laboratory of Crop GermplasmInnovation and Utilization, Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Changsha 410128, P. R. China
  • Received:2013-04-24 Revised:2013-06-14 Published:2013-08-28
  • Contact: YUAN Zhe-Ming
  • Supported by:

    The project was supported by the Science Foundation for Distinguished Young Scholars of Hunan Province, China (10JJ1005) and Specialized Research Fund for the Doctoral Program of Higher Education, China (20124320110002).


Determining highly active epitopes of the cytotoxic T lymphocyte (CTL) is essential for the computational design of peptide vaccines for tumors. In this study, we characterized each residue in the restricted CTL epitopes using 531 physicochemical properties. We selected 18 descriptors with clear meanings from 531×9 descriptors for each peptide of length nine using the binary matrix shuffling filter and worst descriptor elimination multi-round methods. Most of the 18 selected descriptors were the hydrophobic and steric properties of the residues. Among the 18 descriptors, 10 descriptors were related to the second, fourth, and ninth residues, which is consistent with the known facts. We then constructed a support-vectorregression-based quantitative sequence activity model (QSAM) using 18 selected descriptors. The values of the accuracies of fitting (R2), leave-one-out cross validation (Qcv2), and extra-sample prediction (Qext2, RMSEext) were 0.957, 0.708, 0.818, and 0.366, respectively. These results, which were tested on HLAA* 0201 data, showed that our QSAM was superior to those reported in the literature. Finally, we predicted the activities of peptides of all possible combinations of the nine residues. Several peptides were found with higher affinity activities than those of previously reported epitopes. Our study improves the understanding of the relationship between the compositional residues and the affinity activity of the peptide, which provides a valuable guideline for the design of highly active peptide vaccines. Our predicted high affinity peptides are potential candidates for further experimental verification.

Key words: Antigenic peptide, Quantitative sequence activity model, High-dimensional descriptor, Support vector regression, Peptide vaccine