TY - GEN
T1 - Novel Imbalanced Fault Diagnosis Method based on CSMOTE integrated with LSDA and LightGBM for Industrial Process
AU - Zhu, Qun Xiong
AU - Zhang, Ning
AU - He, Yan Lin
AU - Xu, Yuan
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - With the coming of the big data era, the data collected in the process industry shows features of high volume, high-dimensional and non-linear. Meanwhile, these process data present imbalanced feature, leading to a lack of fault information. These problems mentioned above have brought difficulties to fault diagnosis. To solve the above difficulties, a new synthetic minority over-sampling technique (SMOTE) considering the correlation of sample integrated with locality sensitive discriminant analysis (LSDA) and LightGBM fault diagnosis methodology (CSMOTE-LSDA-LightGBM) is proposed in this article. In our proposed methodology, firstly, the SMOTE fully considering correlation (CSMOTE) which uses both Euclidean and Mahalanobis distance to calculate the nearest neighbor relationship is used to resample the imbalanced samples and expand the number of small classification fault samples; secondly, the LSDA is used to dimensionality reduction (DR) to extract the fault-related critical features; finally, the LightGBM classifier is used for fault classification. The Tennessee Eastman (TE) process case is selected for simulation to verify the effectiveness of the proposed CSMOTE-LSDA-LightGBM in fault diagnosis. The simulation results of TE process case show that the proposed method has improved the accuracy of fault diagnosis compared with imbalanced data and traditional DR methods indicating the CSMOTE-LSDA-LightGBM methodology is applicable to fault diagnosis for imbalanced samples.
AB - With the coming of the big data era, the data collected in the process industry shows features of high volume, high-dimensional and non-linear. Meanwhile, these process data present imbalanced feature, leading to a lack of fault information. These problems mentioned above have brought difficulties to fault diagnosis. To solve the above difficulties, a new synthetic minority over-sampling technique (SMOTE) considering the correlation of sample integrated with locality sensitive discriminant analysis (LSDA) and LightGBM fault diagnosis methodology (CSMOTE-LSDA-LightGBM) is proposed in this article. In our proposed methodology, firstly, the SMOTE fully considering correlation (CSMOTE) which uses both Euclidean and Mahalanobis distance to calculate the nearest neighbor relationship is used to resample the imbalanced samples and expand the number of small classification fault samples; secondly, the LSDA is used to dimensionality reduction (DR) to extract the fault-related critical features; finally, the LightGBM classifier is used for fault classification. The Tennessee Eastman (TE) process case is selected for simulation to verify the effectiveness of the proposed CSMOTE-LSDA-LightGBM in fault diagnosis. The simulation results of TE process case show that the proposed method has improved the accuracy of fault diagnosis compared with imbalanced data and traditional DR methods indicating the CSMOTE-LSDA-LightGBM methodology is applicable to fault diagnosis for imbalanced samples.
UR - http://www.scopus.com/inward/record.url?scp=85134310514&partnerID=8YFLogxK
U2 - 10.1109/CoDIT55151.2022.9803941
DO - 10.1109/CoDIT55151.2022.9803941
M3 - Conference contribution
AN - SCOPUS:85134310514
T3 - 2022 8th International Conference on Control, Decision and Information Technologies, CoDIT 2022
SP - 326
EP - 331
BT - 2022 8th International Conference on Control, Decision and Information Technologies, CoDIT 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 8th International Conference on Control, Decision and Information Technologies, CoDIT 2022
Y2 - 17 May 2022 through 20 May 2022
ER -