RNPM: Neural-Guided Embedding Region Selection and Error Correction for Robust Audio Multi-Watermarking

Qiutong Li, Tong Liu, Xiaochen Yuan

Research output: Contribution to journalArticlepeer-review

Abstract

Robust audio watermarking plays a crucial role in copyright protection; however, existing techniques suffer from low embedding capacity and limited robustness under severe signal distortions. To solve these limitations, this paper proposes a Robust Neural-Guided Parallel Multi-Watermarking (RNPM) scheme. In the RNPM, we propose a U-Net-Based Embedding Region Selection (ERSU-Net) module to accurately locate multiple embedding regions based on robustness characteristics. To better exploit the intrinsic frequency and energy distribution of audio signals, the ERSU-Net module is enhanced with dual-attention modules, thereby improving the robustness. After determining the embedding regions, they are segmented into multiple overlapping frames to facilitate embedding. To further enhance embedding capacity without compromising robustness, the proposed RNPM integrates Discrete Cosine Transform (DCT) and inter-frame difference-based embedding with Gram–Schmidt orthogonalization, enabling parallel multi-watermark embedding. Furthermore, to mitigate extraction errors caused by signal distortion, an error correction mechanism is integrated with the localized embedding regions, improving overall extraction reliability. Experimental results demonstrate that the proposed RNPM achieves superior robustness and inaudibility. In particular, RNPM maintains high robustness with a Bit Error Rate (BER) value of 0 under 20% cropping, MP3 compression at 64 kbps, and 22.5 kHz resampling attacks, surpassing existing state-of-the-art methods.

Original languageEnglish
Pages (from-to)4552-4562
Number of pages11
JournalIEEE Transactions on Audio, Speech and Language Processing
Volume33
DOIs
Publication statusPublished - 2025

Keywords

  • Gram–Schmidt orthogonalization
  • Robust audio watermarking
  • discrete cosine transform (DCT)
  • error correction
  • inter-frame projection

Fingerprint

Dive into the research topics of 'RNPM: Neural-Guided Embedding Region Selection and Error Correction for Robust Audio Multi-Watermarking'. Together they form a unique fingerprint.

Cite this