物理化学学报 >> 2011, Vol. 27 >> Issue (09): 2111-2117.doi: 10.3866/PKU.WHXB20110831

理论与计算化学 上一篇    下一篇

基于特征选择的决策树方法在磷酸铝AlPO4-5定向合成中的应用

霍卫峰, 高娜, 颜岩, 李激扬, 于吉红, 徐如人   

  1. 吉林大学无机合成与制备化学国家重点实验室, 长春 130012
  • 收稿日期:2011-04-26 修回日期:2011-06-09 发布日期:2011-08-26
  • 通讯作者: 李激扬 E-mail:lijiyang@jlu.edu.cn
  • 基金资助:

    国家自然科学基金(20871051)资助项目

Decision Trees Combined with Feature Selection for the Rational Synthesis of Aluminophosphate AlPO4-5

HUO Wei-Feng, GAO Na, YAN Yan, LI Ji-Yang, YU Ji-Hong, XU Ru-Ren   

  1. State Key Laboratory of Inorganic Synthesis and Preparative Chemistry, Jilin University, Changchun 130012, P. R. China
  • Received:2011-04-26 Revised:2011-06-09 Published:2011-08-26
  • Contact: LI Ji-Yang E-mail:lijiyang@jlu.edu.cn
  • Supported by:

    The project was supported by the National Natural Science Foundation of China (20871051).

摘要: 分子筛类开放骨架材料的合成与结构关系研究对实现这类材料的定向合成起着至关重要的作用. 本文在建立开放骨架磷酸铝合成反应数据库的基础上, 提出了利用基于特征选择的决策树(C5.0)方法, 考察了不同反应条件(即各反应特征参数)对磷酸铝分子筛AlPO4-5 生成的影响. 基于决策树模型, 利用8 个反应特征参数,可以有效预测磷酸铝分子筛AlPO4-5的生成, 准确率达到88.18%, 接收者操作特性(ROC)曲线下面积(AUC)达到90%. 研究结果表明, 在众多的反应特征参数中, 有机模板剂的几何尺寸参数, 特别是模板剂的次长距离, 是影响AlPO4-5分子筛合成的重要因素.

关键词: 磷酸铝, 定向合成, 数据挖掘, 决策树, 特征选择

Abstract: The relationship between the synthetic features and the types of final product is critical for the rational synthesis of zeolite-type open-framework materials. In this paper, an AlPO4-5 prediction system based on C5.0 combined with a feature selection is proposed on the basis of the establishment of a database of AlPO syntheses. 26 synthetic parameters associated with gel composition, an organic amine template and a solvent were used as input to predict the formation of AlPO4-5. The effects of different synthetic parameters on the formation of AlPO4-5 were also studied. The performance of the method was evaluated using classification accuracy and a receiver operating characteristic (ROC) curve. The results show that the highest area under the ROC curve (90%) and the classification accuracy (88.18%) was obtained for the decision tree model that contains eight input features and some useful rules with high confidence degrees were extracted from the model. Among the various synthetic parameters the geometric size of the organic template, particularly the second longest distance of the template plays an important role in the formation of AlPO4-5.

Key words: Aluminophosphate, Rational synthesis, Data mining, Decision tree, Feature selection