跳至主導覽 跳至搜尋 跳過主要內容

Novel virtual sample generation method based on data augmentation and weighted interpolation for soft sensing with small data

  • Xiao Lu Song
  • , Yan Lin He
  • , Xing Yuan Li
  • , Qun Xiong Zhu
  • , Yuan Xu
  • Beijing University of Chemical Technology
  • Ministry of Education of China

研究成果: Article同行評審

37 引文 斯高帕斯(Scopus)

摘要

Data-driven soft sensing modeling plays an increasingly important role in the prediction of key variables in the process industry. Since data is an essential part of modeling, how to obtain sufficient samples to build more accurate soft sensors becomes a formidable challenge. In this paper, a virtual sample generation method based on data augmentation and weighted interpolation (DAWI-VSG) is proposed to expand the soft sensing dataset with high-quality samples. First, the original dataset is decomposed by singular value decomposition (SVD), and the features are extracted and then synthesized into a matrix to obtain new samples that can approximate the original sample set. Second, the two sample sets are merged, and outliers are detected with the improved Fast Angle-based Outlier Detection (FastABOD), which makes the data more uniformly distributed by weighted interpolation between the outliers. In addition, XGboost is utilized for predicting the outputs of the virtual samples. To verify that the effect of the proposed DAWI-VSG, simulations of the numerical function and the actual chemical process Pure terephthalic acid (PTA) were performed, and correlation analysis was introduced as a measure of whether the generated samples are consistent with the real ones. The results showed that the proposed DAWI-VSG can boost the predictive power of soft sensing by generating higher quality and more reasonable samples compared to other advanced methods.

原文English
文章編號120085
期刊Expert Systems with Applications
225
DOIs
出版狀態Published - 1 9月 2023
對外發佈

指紋

深入研究「Novel virtual sample generation method based on data augmentation and weighted interpolation for soft sensing with small data」主題。共同形成了獨特的指紋。

引用此