Automatic Speech Recognition for Portuguese with Small Data Set

研究成果: Conference contribution同行評審

摘要

Voice recognition has become more and more popular in various systems and applications. To further promote Macau tourism worldwide, a mobile Macau tourism APP is being developing that supports voice control to facilitate Portuguese users. Consequently, this paper is about the research and implementation of an Automatic Speech Recognition (ASR) engine for Portuguese language. In this research, three well-known open-source ASR platforms were evaluated and compared. The complete ASR development procedure using Kaldi platform is discussed. Due to the limitation of collected voice data, a novel few-shot learning and transfer learning is implemented in this project. The final model achieved a stable 95.25% accuracy which is good enough for production use. The novel technics implemented in this research can be used for ASR trainings with limited training data and can be extended to a wide range of applications in the future.

原文English
主出版物標題Computer and Information Science 2021 - Fall
編輯Roger Lee
發行者Springer Science and Business Media Deutschland GmbH
頁面1-13
頁數13
ISBN(列印)9783030905279
DOIs
出版狀態Published - 2022
事件21st IEEE/ACIS International Fall Virtual Conference on Computer and Information Science, ICIS 2021 - Xi'an, China
持續時間: 13 10月 202115 10月 2021

出版系列

名字Studies in Computational Intelligence
1003 SCI
ISSN(列印)1860-949X
ISSN(電子)1860-9503

Conference

Conference21st IEEE/ACIS International Fall Virtual Conference on Computer and Information Science, ICIS 2021
國家/地區China
城市Xi'an
期間13/10/2115/10/21

指紋

深入研究「Automatic Speech Recognition for Portuguese with Small Data Set」主題。共同形成了獨特的指紋。

引用此