An Automatic Speech Segmentation Algorithm of Portuguese based on Spectrogram Windowing

研究成果: Conference contribution同行評審

摘要

Sentence segmentation is important for improving the human readability of Automatic Speech Recognition (ASR) systems. Although it has been explored through numerous interdisciplinary studies, segmentation of Portuguese is still time-consuming due to the lack of efficient automatic segmentation methods and the reliance on qualified phonetic experts. This paper presents a novel algorithm that efficiently segments speech into sentences by learning the spectrogram of sentences through windows using a classification model developed with an Artificial Neural Network (ANN). Based on our experiments, the beginning part of a European Portuguese (EP) sentence enables better identification of the sentence's boundaries. In addition, a window frame of spectrogram constructed by the previous ending of 100 milliseconds (ms) and the subsequent beginning of 300 ms presents the best performance in the automatic sentence segmentation. As a result, the proposed algorithm can automatically segment Portuguese speech into sentences by analyzing its spectrogram without knowing the speech semantics.

原文English
主出版物標題2022 IEEE World AI IoT Congress, AIIoT 2022
發行者Institute of Electrical and Electronics Engineers Inc.
頁面290-295
頁數6
ISBN(電子)9781665484534
DOIs
出版狀態Published - 2022
事件2022 IEEE World AI IoT Congress, AIIoT 2022 - Seattle, United States
持續時間: 6 6月 20229 6月 2022

出版系列

名字2022 IEEE World AI IoT Congress, AIIoT 2022

Conference

Conference2022 IEEE World AI IoT Congress, AIIoT 2022
國家/地區United States
城市Seattle
期間6/06/229/06/22

指紋

深入研究「An Automatic Speech Segmentation Algorithm of Portuguese based on Spectrogram Windowing」主題。共同形成了獨特的指紋。

引用此