Audio post-processing detection and identification based on audio features

Yunzhen Zhan, Xiaochen Yuan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Citations (Scopus)

Abstract

As an important communication medium, audios are easily modified or tampered during transmission; thus the authenticity of audios is of high importance. This paper mainly introduces a method to detect audio post-processing based on audio features; the Support Vector Machine (SVM) is applied for classification during the detection. In the proposed method, the Mel Frequency Cepstral Coefficient (MFCC) and the Linear Prediction Coding (LPC) of host audios are calculated as audio features, to which SVM is applied to judge the authenticity of the audios. Experimental results show that the proposed audio feature based method can not only verify the authenticity of speech audio, but also have a significant effect on detecting different types of post-processing operations.

Original languageEnglish
Title of host publicationProceedings of 2017 International Conference on Wavelet Analysis and Pattern Recognition, ICWAPR 2017
PublisherIEEE Computer Society
Pages154-158
Number of pages5
ISBN (Electronic)9781538604106
DOIs
Publication statusPublished - 19 Oct 2017
Externally publishedYes
Event2017 International Conference on Wavelet Analysis and Pattern Recognition, ICWAPR 2017 - Ningbo, China
Duration: 9 Jul 201712 Jul 2017

Publication series

NameInternational Conference on Wavelet Analysis and Pattern Recognition
Volume1
ISSN (Print)2158-5695
ISSN (Electronic)2158-5709

Conference

Conference2017 International Conference on Wavelet Analysis and Pattern Recognition, ICWAPR 2017
Country/TerritoryChina
CityNingbo
Period9/07/1712/07/17

Keywords

  • Audio Feature
  • Audio Post-processing Detection
  • Linear Prediction Coding (LPC)
  • Mel Frequency Cepstral Coefficient (MFCC)
  • Support Vector Machine (SVM)

Fingerprint

Dive into the research topics of 'Audio post-processing detection and identification based on audio features'. Together they form a unique fingerprint.

Cite this