TY - JOUR
T1 - Marginal region-integrated regressive conditional variational autoencoder-generative adversarial network
T2 - A soft sensing enhancement method
AU - Liu, Guo yu
AU - Zhu, Qun Xiong
AU - Luo, Yi
AU - Ke, Wei
AU - He, Yan Lin
AU - Zhang, Yang
AU - Zhang, Ming Qing
AU - Xu, Yuan
N1 - Publisher Copyright:
© 2025
PY - 2026/1/15
Y1 - 2026/1/15
N2 - In industrial processes, due to limitations of actual industrial production, many industrial data are difficult to obtain directly, which limits sample size and leads to uneven data distribution, ultimately affecting the fitting performance of soft sensing models. To address this challenge, we propose a marginal Isolation Mega Trend Diffusion with Regressor Conditional Variational Autoencoder-Generative Adversarial Network (IRCVGAN). designed to improve model accuracy by expanding the sample size. Specifically, the proposed method first applies the isolation forest algorithm to detect sparse marginal regions in the dataset, followed by Mega Trend Diffusion (MTD) to broaden the range of input data by generating virtual samples, thus increasing dataset diversity. Next, an improved regressive conditional Variational Autoencoder-Generative Adversarial Network (RCVAEGAN) is developed to perform fine-grained selection on the virtual samples generated by MTD. Furthermore, the mapping between input variables and production quality indicators is embedded in RCVAEGAN, enhancing the representativeness of the samples and improving the model's fitting accuracy, the effectiveness of our proposed method is validated through function fitting tests and real-world industrial data from a purified terephthalic acid (PTA) solvent system.
AB - In industrial processes, due to limitations of actual industrial production, many industrial data are difficult to obtain directly, which limits sample size and leads to uneven data distribution, ultimately affecting the fitting performance of soft sensing models. To address this challenge, we propose a marginal Isolation Mega Trend Diffusion with Regressor Conditional Variational Autoencoder-Generative Adversarial Network (IRCVGAN). designed to improve model accuracy by expanding the sample size. Specifically, the proposed method first applies the isolation forest algorithm to detect sparse marginal regions in the dataset, followed by Mega Trend Diffusion (MTD) to broaden the range of input data by generating virtual samples, thus increasing dataset diversity. Next, an improved regressive conditional Variational Autoencoder-Generative Adversarial Network (RCVAEGAN) is developed to perform fine-grained selection on the virtual samples generated by MTD. Furthermore, the mapping between input variables and production quality indicators is embedded in RCVAEGAN, enhancing the representativeness of the samples and improving the model's fitting accuracy, the effectiveness of our proposed method is validated through function fitting tests and real-world industrial data from a purified terephthalic acid (PTA) solvent system.
KW - Generative adversarial network
KW - Mega trend diffusion
KW - Soft sensor
KW - Virtual sample generation
UR - https://www.scopus.com/pages/publications/105022633851
U2 - 10.1016/j.chemolab.2025.105577
DO - 10.1016/j.chemolab.2025.105577
M3 - Article
AN - SCOPUS:105022633851
SN - 0169-7439
VL - 268
JO - Chemometrics and Intelligent Laboratory Systems
JF - Chemometrics and Intelligent Laboratory Systems
M1 - 105577
ER -