Automatic Speech Recognition for Portuguese with Small Data Set

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Voice recognition has become more and more popular in various systems and applications. To further promote Macau tourism worldwide, a mobile Macau tourism APP is being developing that supports voice control to facilitate Portuguese users. Consequently, this paper is about the research and implementation of an Automatic Speech Recognition (ASR) engine for Portuguese language. In this research, three well-known open-source ASR platforms were evaluated and compared. The complete ASR development procedure using Kaldi platform is discussed. Due to the limitation of collected voice data, a novel few-shot learning and transfer learning is implemented in this project. The final model achieved a stable 95.25% accuracy which is good enough for production use. The novel technics implemented in this research can be used for ASR trainings with limited training data and can be extended to a wide range of applications in the future.

Original languageEnglish
Title of host publicationComputer and Information Science 2021 - Fall
EditorsRoger Lee
PublisherSpringer Science and Business Media Deutschland GmbH
Pages1-13
Number of pages13
ISBN (Print)9783030905279
DOIs
Publication statusPublished - 2022
Event21st IEEE/ACIS International Fall Virtual Conference on Computer and Information Science, ICIS 2021 - Xi'an, China
Duration: 13 Oct 202115 Oct 2021

Publication series

NameStudies in Computational Intelligence
Volume1003 SCI
ISSN (Print)1860-949X
ISSN (Electronic)1860-9503

Conference

Conference21st IEEE/ACIS International Fall Virtual Conference on Computer and Information Science, ICIS 2021
Country/TerritoryChina
CityXi'an
Period13/10/2115/10/21

Keywords

  • ASR
  • Few-shot learning
  • Portuguese voice recognition
  • Transfer learning

Fingerprint

Dive into the research topics of 'Automatic Speech Recognition for Portuguese with Small Data Set'. Together they form a unique fingerprint.

Cite this