An Automatic Speech Segmentation Algorithm of Portuguese based on Spectrogram Windowing

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Sentence segmentation is important for improving the human readability of Automatic Speech Recognition (ASR) systems. Although it has been explored through numerous interdisciplinary studies, segmentation of Portuguese is still time-consuming due to the lack of efficient automatic segmentation methods and the reliance on qualified phonetic experts. This paper presents a novel algorithm that efficiently segments speech into sentences by learning the spectrogram of sentences through windows using a classification model developed with an Artificial Neural Network (ANN). Based on our experiments, the beginning part of a European Portuguese (EP) sentence enables better identification of the sentence's boundaries. In addition, a window frame of spectrogram constructed by the previous ending of 100 milliseconds (ms) and the subsequent beginning of 300 ms presents the best performance in the automatic sentence segmentation. As a result, the proposed algorithm can automatically segment Portuguese speech into sentences by analyzing its spectrogram without knowing the speech semantics.

Original languageEnglish
Title of host publication2022 IEEE World AI IoT Congress, AIIoT 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages290-295
Number of pages6
ISBN (Electronic)9781665484534
DOIs
Publication statusPublished - 2022
Event2022 IEEE World AI IoT Congress, AIIoT 2022 - Seattle, United States
Duration: 6 Jun 20229 Jun 2022

Publication series

Name2022 IEEE World AI IoT Congress, AIIoT 2022

Conference

Conference2022 IEEE World AI IoT Congress, AIIoT 2022
Country/TerritoryUnited States
CitySeattle
Period6/06/229/06/22

Keywords

  • Portuguese speech
  • natural language processing
  • sentence segmentation
  • spectrogram

Fingerprint

Dive into the research topics of 'An Automatic Speech Segmentation Algorithm of Portuguese based on Spectrogram Windowing'. Together they form a unique fingerprint.

Cite this