Watermark for LLM-Generated Text – Schneier on Security
Watermark for LLM-Generated Text
Researchers at Google have developed a watermark for LLM-generated text. The basics are pretty obvious: the LLM chooses between tokens partly based on a cryptographic key, and someone with knowledge of the key can detect those choices. What makes this hard is (1) how much text is required for the watermark to work, and (2) how robust the watermark is to post-generation editing. Google’s version looks pretty good: it’s detectable in text as small as 200 tokens.
Posted on October 25, 2024 at 9:56 AM •
0 Comments
Sidebar photo of Bruce Schneier by Joe MacInnis.