Noise-To-Mask Ratio Loss for Deep Neural Network Based Audio Watermarking
Moritz, Martin; Olan, Toni; Virtanen, Tuomas (2024)
Moritz, Martin
Olan, Toni
Virtanen, Tuomas
2024
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:tuni-202504043299
https://urn.fi/URN:NBN:fi:tuni-202504043299
Kuvaus
Peer reviewed
Tiivistelmä
<p>Digital audio watermarking consists in inserting a message into audio signals in a transparent way and can be used to allow automatic recognition of audio material and management of the copyrights. We propose a perceptual loss function to be used in deep neural network based audio watermarking systems. The loss is based on the noise-To-mask ratio (NMR), which is a model of the psychoacoustic masking effect characteristic of the human ear. We use the NMR loss between marked and host signals to train the deep neural models and we evaluate the objective quality with PEAQ and the subjective quality with a MUSHRA test.</p>
Kokoelmat
- TUNICRIS-julkaisut [20210]