TY - GEN
T1 - Novel Virtual Sample Generation Using Score Based Model for Addressing Small Data in Soft Sensing
AU - Wang, Hai Lin
AU - Zhu, Qun Xiong
AU - He, Yan Lin
AU - Xu, Yuan
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - With the development of complex industries, soft sensors have extensive application prospects. For the optimization of intricate industrial processes, precise models are essential. However, due to the insufficient and poor-quality training data in industrial processes, the established models frequently exhibit low accuracy. We propose an effective method for virtual sample generation based on the Score Based Generative Model (SGM) to address this challenge. In this approach, the Local Outlier Factor (LOF) algorithm is initially employed to detect outliers in the data. Subsequently, the Score Based Generative Model generates virtual input samples around the identified outliers. Following this, the Mean Teacher approach for semi-supervised learning is utilized to forecast the outputs of the virtual samples. The student model's prediction accuracy is improved by incorporating the virtual samples into its updates. Finally, the synthetic dataset is formed by combining the input and output components of the virtual samples, augmenting the original dataset. In order to prove the efficiency and superiority of this approach, three-dimensional numerical simulations and industrial data purified terephthalic acid (PTA) were used for experiments. The results show that SGM-VSG can improve the prediction accuracy of soft sensor better than other methods of generating virtual samples.
AB - With the development of complex industries, soft sensors have extensive application prospects. For the optimization of intricate industrial processes, precise models are essential. However, due to the insufficient and poor-quality training data in industrial processes, the established models frequently exhibit low accuracy. We propose an effective method for virtual sample generation based on the Score Based Generative Model (SGM) to address this challenge. In this approach, the Local Outlier Factor (LOF) algorithm is initially employed to detect outliers in the data. Subsequently, the Score Based Generative Model generates virtual input samples around the identified outliers. Following this, the Mean Teacher approach for semi-supervised learning is utilized to forecast the outputs of the virtual samples. The student model's prediction accuracy is improved by incorporating the virtual samples into its updates. Finally, the synthetic dataset is formed by combining the input and output components of the virtual samples, augmenting the original dataset. In order to prove the efficiency and superiority of this approach, three-dimensional numerical simulations and industrial data purified terephthalic acid (PTA) were used for experiments. The results show that SGM-VSG can improve the prediction accuracy of soft sensor better than other methods of generating virtual samples.
UR - http://www.scopus.com/inward/record.url?scp=85208222864&partnerID=8YFLogxK
U2 - 10.1109/CoDIT62066.2024.10708328
DO - 10.1109/CoDIT62066.2024.10708328
M3 - Conference contribution
AN - SCOPUS:85208222864
T3 - 10th 2024 International Conference on Control, Decision and Information Technologies, CoDIT 2024
SP - 199
EP - 204
BT - 10th 2024 International Conference on Control, Decision and Information Technologies, CoDIT 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 10th International Conference on Control, Decision and Information Technologies, CoDIT 2024
Y2 - 1 July 2024 through 4 July 2024
ER -