TY - JOUR
T1 - Dealing with small sample size problems in process industry using virtual sample generation
T2 - a Kriging-based approach
AU - Zhu, Qun Xiong
AU - Chen, Zhong Sheng
AU - Zhang, Xiao Han
AU - Rajabifard, Abbas
AU - Xu, Yuan
AU - Chen, Yi Qun
N1 - Publisher Copyright:
© 2019, Springer-Verlag GmbH Germany, part of Springer Nature.
PY - 2020/5/1
Y1 - 2020/5/1
N2 - The operational data of advanced process systems have met with explosive growth, but its fluctuations are so slight that the number of the extracted representative samples is quite limited, making it difficult to reflect the nature of the process and to establish prediction models. In this study, inspired by the process of fisherman repairing nets, a Kriging-based virtual sample generation (VSG) named Kriging-VSG is proposed to generate feasible virtual samples in data sparse regions. Then, the accuracy of prediction models is further enhanced by applying the generated virtual samples. In order to reasonably find data sparse regions, a distance-based criterion is imposed on each dimension to identify important samples with large information gaps. Similar to the process of fisherman repairing nets, a certain dimension is initially fixed at different quantiles. A dimension-wise interpolation process using Kriging is then performed on the center between important samples with large information gaps. To validate the performance of the proposed Kriging-VSG, two numerical simulations and a real-world application from a cascade reaction process for high-density polyethylene are carried out. The results indicate that the proposed Kriging-VSG outperforms other methods.
AB - The operational data of advanced process systems have met with explosive growth, but its fluctuations are so slight that the number of the extracted representative samples is quite limited, making it difficult to reflect the nature of the process and to establish prediction models. In this study, inspired by the process of fisherman repairing nets, a Kriging-based virtual sample generation (VSG) named Kriging-VSG is proposed to generate feasible virtual samples in data sparse regions. Then, the accuracy of prediction models is further enhanced by applying the generated virtual samples. In order to reasonably find data sparse regions, a distance-based criterion is imposed on each dimension to identify important samples with large information gaps. Similar to the process of fisherman repairing nets, a certain dimension is initially fixed at different quantiles. A dimension-wise interpolation process using Kriging is then performed on the center between important samples with large information gaps. To validate the performance of the proposed Kriging-VSG, two numerical simulations and a real-world application from a cascade reaction process for high-density polyethylene are carried out. The results indicate that the proposed Kriging-VSG outperforms other methods.
KW - High-density polyethylene
KW - Kriging interpolation
KW - Small sample size problems
KW - Soft sensing modeling
KW - Virtual sample generation
UR - http://www.scopus.com/inward/record.url?scp=85074033092&partnerID=8YFLogxK
U2 - 10.1007/s00500-019-04326-3
DO - 10.1007/s00500-019-04326-3
M3 - Article
AN - SCOPUS:85074033092
SN - 1432-7643
VL - 24
SP - 6889
EP - 6902
JO - Soft Computing
JF - Soft Computing
IS - 9
ER -