TY - JOUR
T1 - QLAW
T2 - An Improved Quantization-Based Local Audio Watermarking Scheme Using Inter-Frame Correlation
AU - Li, Qiutong
AU - Xing, Zheng
AU - Wang, Ju
AU - Huang, Guoheng
AU - Yuan, Xiaochen
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2025
Y1 - 2025
N2 - With the rapid development of the Internet, audio distribution has become more convenient with increasing copyright infringement. To address this problem, this paper proposes a quantization-based local audio watermarking scheme using inter-frame correlation, integrating machine learning techniques and traditional methods. To obtain the time-frequency spectrogram of the audio signal, a Short-Time Fourier Transform (STFT) is first applied to the audio signal. Then, Main Energy Region Extractor (MERE) is proposed to extract the main energy region of the spectrogram. Based on the main energy region, the Stable Frequency and Energy Region Extractor is conducted to find the local feature region for embedding. After segmenting the local feature embedding region into several frames, Adjacent Frame Extraction Process (AFEP) is conducted to select the adjacent frame. Then, Discrete Cosine Transform (DCT) is applied to each embedding frame and its adjacent frame to extract their corresponding frequency domain coefficients. To improve robustness, mid-frequency DCT coefficients are alternately selected to embed the watermark. By adjusting the difference between the embedding frame and its corresponding adjacent frame in a predefined range, the local watermark is embedded. Experimental results show that the proposed scheme outperforms existing schemes in inaudibility and robustness, achieving an average Signal-to-Noise Ratio (SNR) above 25 dB and a lower Bit Error Rate (BER) under various attacks.
AB - With the rapid development of the Internet, audio distribution has become more convenient with increasing copyright infringement. To address this problem, this paper proposes a quantization-based local audio watermarking scheme using inter-frame correlation, integrating machine learning techniques and traditional methods. To obtain the time-frequency spectrogram of the audio signal, a Short-Time Fourier Transform (STFT) is first applied to the audio signal. Then, Main Energy Region Extractor (MERE) is proposed to extract the main energy region of the spectrogram. Based on the main energy region, the Stable Frequency and Energy Region Extractor is conducted to find the local feature region for embedding. After segmenting the local feature embedding region into several frames, Adjacent Frame Extraction Process (AFEP) is conducted to select the adjacent frame. Then, Discrete Cosine Transform (DCT) is applied to each embedding frame and its adjacent frame to extract their corresponding frequency domain coefficients. To improve robustness, mid-frequency DCT coefficients are alternately selected to embed the watermark. By adjusting the difference between the embedding frame and its corresponding adjacent frame in a predefined range, the local watermark is embedded. Experimental results show that the proposed scheme outperforms existing schemes in inaudibility and robustness, achieving an average Signal-to-Noise Ratio (SNR) above 25 dB and a lower Bit Error Rate (BER) under various attacks.
KW - Audio watermarking technology
KW - DCT
KW - STFT
KW - machine learning
UR - http://www.scopus.com/inward/record.url?scp=105006886215&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2025.3573838
DO - 10.1109/ACCESS.2025.3573838
M3 - Article
AN - SCOPUS:105006886215
SN - 2169-3536
VL - 13
SP - 93359
EP - 93371
JO - IEEE Access
JF - IEEE Access
ER -