TY - GEN
T1 - Recognition of Score Word in Freestyle Kayaking
AU - Zhang, Qiyuan
AU - Yuan, Xiaochen
AU - Lam, Chan Tong
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Speech is the most natural information carrier for human beings, and it is likely to become the main way of human-computer interaction in the future. This paper presents an isolated score word recognition method using Mel-scale Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW). The processing stage of the speech signal is the basic stage of the speech recognition system, to analyze the speech signal and convert it into speech feature parameters. An endpoints detection method is proposed using the joint adjustment of short-term energy and zero-crossing rate. It can better detect the endpoints, and directly improve the accuracy of subsequent work. On this basis, the MFCC feature is then extracted from the preprocessed speech signal, and the DTW pattern matching is applied to the extracted features. In the experiments, speeches from multiple speakers were collected, each with a specific freestyle kayak action word. The results show that this method has better performance comparing with the existing methods.
AB - Speech is the most natural information carrier for human beings, and it is likely to become the main way of human-computer interaction in the future. This paper presents an isolated score word recognition method using Mel-scale Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW). The processing stage of the speech signal is the basic stage of the speech recognition system, to analyze the speech signal and convert it into speech feature parameters. An endpoints detection method is proposed using the joint adjustment of short-term energy and zero-crossing rate. It can better detect the endpoints, and directly improve the accuracy of subsequent work. On this basis, the MFCC feature is then extracted from the preprocessed speech signal, and the DTW pattern matching is applied to the extracted features. In the experiments, speeches from multiple speakers were collected, each with a specific freestyle kayak action word. The results show that this method has better performance comparing with the existing methods.
KW - Dynamic Time Warping
KW - End Point Detection
KW - Freestyle Kayaking
KW - Mel-scale Frequency Cepstral Coefficients
UR - http://www.scopus.com/inward/record.url?scp=85136213927&partnerID=8YFLogxK
U2 - 10.1109/ICEIEC54567.2022.9835045
DO - 10.1109/ICEIEC54567.2022.9835045
M3 - Conference contribution
AN - SCOPUS:85136213927
T3 - ICEIEC 2022 - Proceedings of 2022 IEEE 12th International Conference on Electronics Information and Emergency Communication
SP - 67
EP - 70
BT - ICEIEC 2022 - Proceedings of 2022 IEEE 12th International Conference on Electronics Information and Emergency Communication
A2 - Wenzheng, Li
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 12th IEEE International Conference on Electronics Information and Emergency Communication, ICEIEC 2022
Y2 - 15 July 2022 through 17 July 2022
ER -