Skip to content

plll4zzx/Awesome-LLM-Watermark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 

Repository files navigation

Awesome-LLM-Watermark

Page Views Count Stars

An UP-TO-DATE collection list for Large Language Model (LLM) Watermark

Star History Chart

1. LLM watermark

1.1. Token-level watermark

1.2. Sentence-level watermark (sentence embedding-based watermark)

  • WaterPool: A Watermark Mitigating Trade-offs among Imperceptibility, Efficacy and Robustness
  • SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation
  • k-SemStamp: A Clustering-Based Semantic Watermark for Detection of Machine-Generated Text
  • A Semantic Invariant Robust Watermark for Large Language Models
  • A Robust Semantics-based Watermark for Large Language Model against Paraphrasing
  • Context-aware Watermark with Semantic Balanced Green-red Lists for Large Language Models
  • Token-Specific Watermarking with Enhanced Detectability and Semantic Coherence for Large Language Models
  • SEFD: Semantic-Enhanced Framework for Detecting LLM-Generated Text paper
  • DeepTextMark: Deep Learning based Text Watermarking for Detection of Large Language Model Generated Text paper
  • Adversarial Watermarking Transformer: Towards Tracing Text Provenance with Data Hiding
  • PersonaMark: Personalized LLM watermarking for model protection and user attribution paper
  • REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative Large Language Models
    • USENIX Security 2024
  • In-Context Watermarks for Large Language Models
  • PMark: Towards Robust and Distortion-free Semantic-level Watermarking with Channel Constraints
  • Watermarking Conditional Text Generation for AI Detection: Unveiling Challenges and a Semantic-Aware Watermark Remedy
    • AAAI
  • CoheMark: A Novel Sentence-Level Watermark for Enhanced Text Quality

1.3. Model-level watermark

1.4.Watermark for Multi-Modal

1.5.Watermark for New Area

1.6.Watermarking detection

1.7.COT Watermark

1.8.Low Entropy Watermark

  • Practical and Effective Code Watermarking for Large Language Models
  • Disappearing Ink: Obfuscation Breaks N-gram Code Watermarks in Theory and Practice
  • HeavyWater and SimplexWater: Watermarking Low-Entropy Text Distributions
  • Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation
  • Invisible Entropy: Towards Safe and Efficient Low-Entropy LLM Watermarking
  • CODEIP: A Grammar-Guided Multi-Bit Watermark for Large Language Models of Code
  • Who Wrote this Code? Watermarking for Code Generation

2. Attack for watermark

2.1. Watermark stealing attack

2.2. Watermark removal attack

2.3. Watermark spoofing attack & Learnability

  • Discovering Clues of Spoofed LM Watermarks
  • On the Learnability of Watermarks for Language Models
  • CAN WATERMARKS BE USED TO DETECT LARGE LANGUAGE MODEL INTELLECTUAL PROPERTY IN- FRINGEMENT FOR FREE?
  • STAMP Your Content: Proving Dataset Membership via Watermarked Rephrasings
  • Bileve: Securing Text Provenance in Large Language Models Against Spoofing with Bi-level Signature

2.4. Robust watermark

2.5. Anti-spoofing Watermark

3. Multi-bit watermark

4. Unbiased watermark

  • Unbiased Watermark for Large Language Models
  • Undetectable Watermarks for Language Models
  • Robust Distortion-free Watermarks for Language Models
  • A Watermark for Low-entropy and Unbiased Generation in Large Language Models
  • A Resilient and Accessible Distribution-Preserving Watermark for Large Language Models
  • Watermarking Language Models with Error Correcting Codes paper
  • Scalable watermarking for identifying large language model outputs
  • Multi-Bit Distortion-Free Watermarking for Large Language Models paper
  • Distortion-free Watermarks are not Truly Distortion-free under Watermark Key Collisions paper
    • Alias Pseudo- vs. True-Randomness: Rethinking Distortion-Free Watermarks of Language Models under Watermark Key Collisions paper
  • HeavyWater and SimplexWater: Distortion-free LLM Watermarks for Low-Entropy Distributions
  • An Ensemble Framework for Unbiased Language Model Watermarking
  • Analyzing and Evaluating Unbiased Language Model Watermark
  • Watermarking Large Language Models: An Unbiased and Low-risk Method
    • ACL 2025
  • BiMark: Unbiased Multilayer Watermarking for Large Language Models
  • LLM Watermarking Using Mixtures and Statistical-to-Computational Gaps
  • From Trade-off to Synergy: A Versatile Symbiotic Watermarking Framework for Large Language Models
  • Optimized Couplings for Watermarking Large Language Models
  • Improved Unbiased Watermark for Large Language Models
  • Debiasing Watermarks for Large Language Models via Maximal Coupling

5. Analysis of LLM watermark

6. Watermark for Diffusion Language Model

  • STEAD: Robust Provably Secure Linguistic Steganography with Diffusion Language Model
  • LR-DWM: Efficient Watermarking for Diffusion Language Models
  • Watermarking Discrete Diffusion Language Models
  • A watermark for order-agnostic language models
    • ICLR 2025
  • Every Step Counts: Decoding Trajectories as Authorship Fingerprints of dLLMs
  • DMark: Order-Agnostic Watermarking for Diffusion Large Language Models
  • Watermarking Diffusion Language Models
    • ICLR 2026

7. Survey

  • A Survey of Text Watermarking in the Era of Large Language Models
  • Mark My Words: Analyzing and Evaluating Language Model Watermarks paper
  • WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models
  • SoK: On the Role and Future of AIGC Watermarking in the Era of Gen-AI paper
  • SoK: Watermarking for AI-Generated Content paper

About

A collection list for Large Language Model (LLM) Watermark

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors