TY - JOUR
T1 - Water Quality Monitoring
T2 - A Water Quality Dataset from an On-Site Study in Macao
AU - Gao, Jiawei
AU - Chen, Bochao
AU - Tang, Su Kit
N1 - Publisher Copyright:
© 2025 by the authors.
PY - 2025/4
Y1 - 2025/4
N2 - The Building Safe Water Use Plan promoted by the Macao Marine and Water Bureau aims to encourage property management entities to regularly maintain building water supply systems to ensure the safety and stability of drinking water. However, traditional laboratory testing methods are often time-consuming and labor-intensive, making real-time and efficient water quality monitoring challenging. To address this issue, this study proposes a Raspberry Pi-based multi-sensor system for rapid water quality detection and improved monitoring efficiency. This system integrates multiple sensors to measure key water quality parameters, such as pH, total dissolved solids (TDSs), temperature, and turbidity, while recording data in real-time. The data were continuously collected over a period of five months (July to November 2024). The collected data were analyzed and validated using machine learning algorithms, including Isolation Forest, Random Forest, Logistic Regression, and Local Outlier Factor. Among these models, Random Forest exhibited the best overall performance, achieving an accuracy of 98.10% and an F1 score of 98.99%. These results show that the dataset demonstrates high reliability in anomaly detection and classification tasks, accurately identifying deviations in water quality. This approach not only enhances the efficiency of water quality monitoring but also provides technological support for urban drinking water safety management.
AB - The Building Safe Water Use Plan promoted by the Macao Marine and Water Bureau aims to encourage property management entities to regularly maintain building water supply systems to ensure the safety and stability of drinking water. However, traditional laboratory testing methods are often time-consuming and labor-intensive, making real-time and efficient water quality monitoring challenging. To address this issue, this study proposes a Raspberry Pi-based multi-sensor system for rapid water quality detection and improved monitoring efficiency. This system integrates multiple sensors to measure key water quality parameters, such as pH, total dissolved solids (TDSs), temperature, and turbidity, while recording data in real-time. The data were continuously collected over a period of five months (July to November 2024). The collected data were analyzed and validated using machine learning algorithms, including Isolation Forest, Random Forest, Logistic Regression, and Local Outlier Factor. Among these models, Random Forest exhibited the best overall performance, achieving an accuracy of 98.10% and an F1 score of 98.99%. These results show that the dataset demonstrates high reliability in anomaly detection and classification tasks, accurately identifying deviations in water quality. This approach not only enhances the efficiency of water quality monitoring but also provides technological support for urban drinking water safety management.
KW - anomaly detection
KW - dataset
KW - Macao
KW - water quality monitoring
UR - http://www.scopus.com/inward/record.url?scp=105003721942&partnerID=8YFLogxK
U2 - 10.3390/app15084130
DO - 10.3390/app15084130
M3 - Article
AN - SCOPUS:105003721942
SN - 2076-3417
VL - 15
JO - Applied Sciences (Switzerland)
JF - Applied Sciences (Switzerland)
IS - 8
M1 - 4130
ER -