TY - GEN
T1 - Vision-based mobile people counting system
AU - Yang, Xu
AU - Gaspar, Jose
AU - Lou, Weng Hong
AU - Ke, Wei
AU - Lam, Chan Tong
AU - Wang, Yapeng
N1 - Publisher Copyright:
© 2019 ACM.
PY - 2019/6/21
Y1 - 2019/6/21
N2 - People detection and counting systems are highly valuable in multiple situations including managing emergency situations and efficiently allocating resources. However, most people counting systems are based on fixed sensors or fixed cameras, which lack flexibility and convenience. In this paper, we have developed a vision-based mobile people counting system which uses Android smartphones to capture images, and state-of-the-art person detectors, based on artificial intelligence, to count the number of people in a designated area. The embedded devices in smartphones such as camera, clock, GPS, are utilized to provide additional information for data collection. Several person detection frameworks such as You Only Look Once v2 (YOLO2), Aggregate Channel Features (ACF) and Multi-Task cascade Convolutional Neural Network (MTCNN) were evaluated to determine the best performing algorithm capable of offering accurate counting results across different scenarios. The experiments results show that YOLO2 outperforms ACF and MTCNN detection algorithms in different scenarios. However, YOLO2 has its own limitations as it often outputs redundant detections, requiring an additional Non-Maxima Suppression (NMS) algorithm to output a single bounding box per detection. The NMS threshold has to be carefully pre-fixed to provide top detection and counting performance across different scenarios.
AB - People detection and counting systems are highly valuable in multiple situations including managing emergency situations and efficiently allocating resources. However, most people counting systems are based on fixed sensors or fixed cameras, which lack flexibility and convenience. In this paper, we have developed a vision-based mobile people counting system which uses Android smartphones to capture images, and state-of-the-art person detectors, based on artificial intelligence, to count the number of people in a designated area. The embedded devices in smartphones such as camera, clock, GPS, are utilized to provide additional information for data collection. Several person detection frameworks such as You Only Look Once v2 (YOLO2), Aggregate Channel Features (ACF) and Multi-Task cascade Convolutional Neural Network (MTCNN) were evaluated to determine the best performing algorithm capable of offering accurate counting results across different scenarios. The experiments results show that YOLO2 outperforms ACF and MTCNN detection algorithms in different scenarios. However, YOLO2 has its own limitations as it often outputs redundant detections, requiring an additional Non-Maxima Suppression (NMS) algorithm to output a single bounding box per detection. The NMS threshold has to be carefully pre-fixed to provide top detection and counting performance across different scenarios.
KW - ACF
KW - MTCNN
KW - Mobile Application
KW - People Counting System
KW - YOLO2
UR - http://www.scopus.com/inward/record.url?scp=85071138663&partnerID=8YFLogxK
U2 - 10.1145/3340997.3340999
DO - 10.1145/3340997.3340999
M3 - Conference contribution
AN - SCOPUS:85071138663
T3 - ACM International Conference Proceeding Series
SP - 42
EP - 46
BT - Proceedings of the 2019 4th International Conference on Machine Learning Technologies, ICMLT 2019
PB - Association for Computing Machinery
T2 - 4th International Conference on Machine Learning Technologies, ICMLT 2019
Y2 - 21 June 2019 through 23 June 2019
ER -