Watermarking Capabilities #2940

camscottie · 2025-06-26T17:14:08Z

camscottie
Jun 26, 2025

I have been thinking of implementing a utilities for watermarking within the speechbrain library. I just wanted to share my thoughts here and open a discussion to see if others agree, or have thoughts of their own.

Watermarking in audio is not new, and recent developments in neural methods are allowing for more robust implementations.
The primary use cases I can see:

Responsible development: With the potential for conversational agents being used for purposes to deceive others (e.g. deepfake), it may benefit developers to identify if their agent has been used by bad actors to generate speech for such purposes.
Linking modified data to source data: Watermarking files could be utilised as a way to identify a source file. For example say your data goes through some iterations of augmentation or slicing, with a key:value database you could link back to the source file without having to build a graph of relations.

In the first instance, I was thinking of adding an interface to utilise wavmark with default settings, but this could expanded later.

What do people think, would this be a useful addition to the speechbrain library? I'm open to suggestions.

References:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Watermarking Capabilities #2940

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Watermarking Capabilities #2940

Uh oh!

camscottie Jun 26, 2025

Replies: 0 comments

camscottie
Jun 26, 2025