An UP-TO-DATE collection list for Large Language Model (LLM) Watermark
- A Watermark for Large Language Models
- ICML 2023
- http://arxiv.org/abs/2301.10226
- Publicly Detectable Watermarking for Language Models paper
- An Unforgeable Publicly Verifiable Watermark for Large Language Models
- On the Reliability of Watermarks for Large Language Models
- Improving the Generation Quality of Watermarked Large Language Models via Word Importance Scoring paper
- WatME: Towards Lossless Watermarking Through Lexical Redundancy
- ACL 2024
- https://aclanthology.org/2024.acl-long.496/
- Alias X-Mark: Towards Lossless Watermarking Through Lexical Redundancy paper
- Towards Optimal Statistical Watermarking paper
- Who Wrote this Code? Watermarking for Code Generation
- Natural language watermarking via paraphraser-based lexical substitution
- Artificial Intelligence
- https://linkinghub.elsevier.com/retrieve/pii/S000437022300005X
- Adaptive Text Watermark for Large Language Models
- ICML 2024
- paper
- Duwak: Dual Watermarks in Large Language Models
- ACL findings 2024
- https://aclanthology.org/2024.findings-acl.678
- Permute-and-Flip: An optimally stable and watermarkable decoder for LLMs
- NeurIPS workshop 2024
- https://arxiv.org/pdf/2402.05864
- BiMarker: Enhancing Text Watermark Detection for Large Language Models with Bipolar Watermarks
- Publicly-Detectable Watermarking for Language Models
- PVMark: Enabling Public Verifiability for LLM Watermarking Schemes
- A watermark for order-agnostic language models
- ICLR 2025
- GumbelSoft: Diversified Language Model Watermarking via the GumbelMax-trick
- MorphMark: Flexible Adaptive Watermarking for Large Language Models
- WaterPool: A Watermark Mitigating Trade-offs among Imperceptibility, Efficacy and Robustness
- SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation
- NAACL 2024
- http://arxiv.org/abs/2310.03991
- k-SemStamp: A Clustering-Based Semantic Watermark for Detection of Machine-Generated Text
- ACL Findings 2024
- http://arxiv.org/abs/2402.11399
- A Semantic Invariant Robust Watermark for Large Language Models
- ICLR 2024
- http://arxiv.org/abs/2310.06356
- A Robust Semantics-based Watermark for Large Language Model against Paraphrasing
- NAACL Findings 2024
- https://aclanthology.org/2024.findings-naacl.40
- Context-aware Watermark with Semantic Balanced Green-red Lists for Large Language Models
- Token-Specific Watermarking with Enhanced Detectability and Semantic Coherence for Large Language Models
- ICML 2024
- http://arxiv.org/abs/2402.18059
- SEFD: Semantic-Enhanced Framework for Detecting LLM-Generated Text paper
- DeepTextMark: Deep Learning based Text Watermarking for Detection of Large Language Model Generated Text paper
- Adversarial Watermarking Transformer: Towards Tracing Text Provenance with Data Hiding
- IEEE S&P 2021
- https://ieeexplore.ieee.org/document/9519400/
- PersonaMark: Personalized LLM watermarking for model protection and user attribution paper
- REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative Large Language Models
- USENIX Security 2024
- In-Context Watermarks for Large Language Models
- PMark: Towards Robust and Distortion-free Semantic-level Watermarking with Channel Constraints
- Watermarking Conditional Text Generation for AI Detection: Unveiling Challenges and a Semantic-Aware Watermark Remedy
- AAAI
- CoheMark: A Novel Sentence-Level Watermark for Enhanced Text Quality
- Provable Robust Watermarking for AI-Generated Text
- ICLR 2024
- http://arxiv.org/abs/2306.17439
- Watermarking LLMs with Weight Quantization paper
- EmMark: Robust Watermarks for IP Protection of Embedded Quantized Large Language Models paper
- Watermarking Counterfactual Explanations paper
- Provably Robust Watermarks for Open-Source Language Models paper
- Learning to Watermark LLM-generated Text via Reinforcement Learning paper
- Towards Watermarking of Open-Source LLMs paper
- GaussMark: A Practical Approach for Structural Watermarking of Language Models
- Can Watermarked LLMs be Identified by Users via Crafted Prompts?
- An Efficient White-box LLM Watermarking for IP Protection on Online Market Platforms
- VLA-Mark: A cross modal watermark for large vision-language alignment model
- A Watermark for Auto-Regressive Speech Generation Models
- From One Stolen Utterance: Assessing the Risks of Voice Cloning in the AIGC Era
- Evaluating Durability: Benchmark Insights into Multimodal Watermarking
- KGMark: A Diffusion Watermark for Knowledge Graphs
- ICME 2025
- https://arxiv.org/abs/2505.23873
- AgentMark: Utility-Preserving Behavioral Watermarking for Agents
- Securing the Language of Life: Inheritable Watermarks from DNA Language Models to Proteins
- NeurIPS 2025
- https://openreview.net/pdf?id=lh5sXuGfk8
- FoldMark: Safeguarding Protein Structure Generative Models with Distributional and Evolutionary Watermarking
- Enhancing privacy in biosecurity with watermarked protein design
- TabularMark: Watermarking Tabular Datasets for Machine Learning
- An Entropy-based Text Watermarking Detection Method
- WaterSeeker: Efficient Detection of Watermarked Segments in Large Documents paper
- Optimal Detection for Language Watermarks with Pseudorandom Collision
- Adaptive Testing for Segmenting Watermarked Texts From Language Models
- On the Empirical Power of Goodness-of-Fit Tests in Watermark Detection
- Improving Detection of Watermarked Language Models
- Watermarking Cryptographic Capabilities
- 10.1145/2897518.2897651
- Black-Box Detection of Language Model Watermarks
- Distilling the Thought, Watermarking the Answer: A Principle Semantic Guided Watermark for Large Reasoning Models
- CoTGuard: Using Chain-of-Thought Triggering for Copyright Protection in Multi-Agent LLM Systems
- CoTSRF: Utilize Chain of Thought as Stealthy and Robust Fingerprint of Large Language Models
- Practical and Effective Code Watermarking for Large Language Models
- Disappearing Ink: Obfuscation Breaks N-gram Code Watermarks in Theory and Practice
- HeavyWater and SimplexWater: Watermarking Low-Entropy Text Distributions
- Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation
- Invisible Entropy: Towards Safe and Efficient Low-Entropy LLM Watermarking
- CODEIP: A Grammar-Guided Multi-Bit Watermark for Large Language Models of Code
- Who Wrote this Code? Watermarking for Code Generation
- Large Language Model Watermark Stealing With Mixed Integer Programming
- ACSAC 2024
- http://arxiv.org/abs/2405.19677
- Watermark Stealing in Large Language Models
- ICLR 2024 Workshop, ICML 2024
- http://arxiv.org/abs/2402.19361
- Bypassing LLM Watermarks with Color-Aware Substitutions
- Breaking Distortion-free Watermarks in Large Language Models
- Character-Level Perturbations Disrupt LLM Watermarks
- NDSS 2026
- http://arxiv.org/abs/2509.09112
- Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense
- NeurIPS 2023
- http://arxiv.org/abs/2303.13408
- Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models
- ACL 2024
- http://arxiv.org/abs/2402.14007
- Watermark Smoothing Attacks against Language Models
- De-mark: Watermark Removal in Large Language Models
- No Free Lunch in LLM Watermarking: Trade-offs in Watermarking Design Choices
- NeurIPS 2024
- paper
- Watermarks in the Sand: Impossibility of Strong Watermarking for Language Models
- ICML 2024
- paper
- offical cite
- Watermark under Fire: A Robustness Evaluation of LLM Watermarking
- EMNLP 2025 findings
- https://aclanthology.org/2025.findings-emnlp.1148/
- Alias WaterPark: A Robustness Assessment of Language Model Watermarking
-
$B^4$ : A Black-Box Scrubbing Attack on LLM Watermarks - Can AI-Generated Text be Reliably Detected? paper
- Lost in Overlap: Exploring Watermark Collision in LLMs paper
- RLCracker: Exposing the Vulnerability of LLM Watermarks with Adaptive RL Attacks
- Robustness Assessment and Enhancement of Text Watermarking for Google's SynthID
- Sandcastles in the Storm: Revisiting the (Im)possibility of Strong Watermarking
- Attacking LLM Watermarks by Exploiting Their Strengths
- Revealing Weaknesses in Text Watermarking Through Self-Information Rewrite Attacks
- ICML 2025
- http://arxiv.org/abs/2505.05190
- Warfare:Breaking the Watermark Protection of AI-Generated Content
- Optimizing Adaptive Attacks against Content Watermarks for Language Models
- Discovering Clues of Spoofed LM Watermarks
- On the Learnability of Watermarks for Language Models
- ICLR 2024
- http://arxiv.org/abs/2312.04469
- CAN WATERMARKS BE USED TO DETECT LARGE LANGUAGE MODEL INTELLECTUAL PROPERTY IN- FRINGEMENT FOR FREE?
- STAMP Your Content: Proving Dataset Membership via Watermarked Rephrasings
- Bileve: Securing Text Provenance in Large Language Models Against Spoofing with Bi-level Signature
- Edit Distance Robust Watermarks for Language Models
- NeurIPS 2024
- https://openreview.net/pdf?id=FZ45kf5pIA
- Waterfall: Framework for Robust and Scalable Text Watermarking paper
- Pseudorandom Error-Correcting Codes
- http://arxiv.org/abs/2402.09370
- Advances in Cryptology – CRYPTO 2024
- Can Watermarked LLMs be Identified by Users via Crafted Prompts?
- Robust Steganography from Large Language Models
- Post-Hoc Watermarking for Robust Detection in Text Generated by Large Language Models
- COLING 2025
- https://aclanthology.org/2025.coling-main.364/
- PostMark: A Robust Blackbox Watermark for Large Language Models
- WaterSearch: A Quality-Aware Search-based Watermarking Framework for Large Language Models
- DualGuard: Dual-stream Large Language Model Watermarking Defense against Paraphrase and Spoofing Attack
- Improved Pseudorandom Codes from Permuted Puzzles
- Watermarking Language Models with Error Correcting Codes
- Two Halves Make a Whole: How to Reconcile Soundness and Robustness in Watermarking for Large Language Models
- Robust and Efficient Watermarking of Large Language Models Using Error Correction Codes
- Proceedings on Privacy Enhancing Technologies
- Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation
- A Certified Robust Watermark For Large Language Models
- WaterMax: breaking the LLM watermark detectability-robustness-quality trade-off
- Is Multilingual LLM Watermarking Truly Multilingual? Scaling Robustness to 100+ Languages via Back-Translation
- Detecting Post-generation Edits to Watermarked LLM Outputs via Combinatorial Watermarking
- Discovering Spoofing Attempts on Language Model Watermarks
- DAMAGE: Detecting Adversarially Modified AI Generated Text
- Let Watermarks Speak: A Robust and Unforgeable Watermark for Language Models
- An Unforgeable Publicly Verifiable Watermark for Large Language Models
- DualGuard: Dual-stream Large Language Model Watermarking Defense against Paraphrase and Spoofing Attack
- Two Halves Make a Whole: How to Reconcile Soundness and Robustness in Watermarking for Large Language Models
- Mitigating Watermark Forgery in Generative Models via Multi-Key Watermarking
- Provably Robust and Secure Steganography in Asymmetric Resource Scenario
- SP 2025
- Defending LLM Watermarking Against Spoofing Attacks with Contrastive Representation Learning
- Three Bricks to Consolidate Watermarks for Large Language Models paper
- Provably Robust Multi-bit Watermarking for AI-generated Text via Error Correction Code paper
- USENIX Security 2025
- Advancing Beyond Identification: Multi-bit Watermark for Large Language Models
- NAACL 2024
- https://aclanthology.org/2024.naacl-long.224
- Towards Codable Watermarking for Injecting Multi-bits Information to LLMs paper
- Robust Multi-bit Natural Language Watermarking through Invariant Features
- Multi-Bit Distortion-Free Watermarking for Large Language Models paper
- Towards Codable Watermarking for Injecting Multi-bits Information to LLMs
- Robust Multi-bit Text Watermark with LLM-based Paraphrasers paper
- PersonaMark: Personalized LLM watermarking for model protection and user attribution paper
- CODEIP: A Grammar-Guided Multi-Bit Watermark for Large Language Models of Code
- EMNLP findings 2024
- https://aclanthology.org/2024.findings-emnlp.541
- Enhancing Watermarked Language Models to Identify Users paper
- CredID: Credible Multi-Bit Watermark for Large Language Models Identification paper
- Watermarking Language Models for Many Adaptive Users
- SP 2025
- SAEMark: Multi-bit LLM Watermarking with Inference-Time Scaling
- Majority Bit-Aware Watermarking For Large Language Models
- BiMark: Unbiased Multilayer Watermarking for Large Language Models
- ICML 2025
- StealthInk: A Multi-bit and Stealthy Watermark for Large Language Models
- ICML 2025
- http://arxiv.org/abs/2506.05502
- TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent
- DERMARK: A Dynamic, Efficient and Robust Multi-bit Watermark for Large Language Models
- Distributional Information Embedding: A Framework for Multi-bit Watermarking
- Unbiased Watermark for Large Language Models
- ICLR 2023
- http://arxiv.org/abs/2310.10669
- Undetectable Watermarks for Language Models
- Robust Distortion-free Watermarks for Language Models
- A Watermark for Low-entropy and Unbiased Generation in Large Language Models
- A Resilient and Accessible Distribution-Preserving Watermark for Large Language Models
- ICML 2024
- https://arxiv.org/abs/2310.07710
- Watermarking Language Models with Error Correcting Codes paper
- Scalable watermarking for identifying large language model outputs
- Multi-Bit Distortion-Free Watermarking for Large Language Models paper
- Distortion-free Watermarks are not Truly Distortion-free under Watermark Key Collisions paper
- Alias Pseudo- vs. True-Randomness: Rethinking Distortion-Free Watermarks of Language Models under Watermark Key Collisions paper
- HeavyWater and SimplexWater: Distortion-free LLM Watermarks for Low-Entropy Distributions
- An Ensemble Framework for Unbiased Language Model Watermarking
- Analyzing and Evaluating Unbiased Language Model Watermark
- Watermarking Large Language Models: An Unbiased and Low-risk Method
- ACL 2025
- BiMark: Unbiased Multilayer Watermarking for Large Language Models
- LLM Watermarking Using Mixtures and Statistical-to-Computational Gaps
- From Trade-off to Synergy: A Versatile Symbiotic Watermarking Framework for Large Language Models
- Optimized Couplings for Watermarking Large Language Models
- Improved Unbiased Watermark for Large Language Models
- Debiasing Watermarks for Large Language Models via Maximal Coupling
- Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?
- ICML workshop
- https://openreview.net/pdf?id=79NfpNZkXW
- On Evaluating The Performance of Watermarked Machine-Generated Texts Under Adversarial Attacks
- Optimizing Adaptive Attacks against Content Watermarks for Language Models
- Optimizing Watermarks for Large Language Models
- Performance Trade-offs of Watermarking Large Language Models paper
- Watermarking Makes Language Models Radioactive paper
- WaterJudge: Quality-Detection Trade-off when Watermarking Large Language Models paper
- Towards Better Statistical Understanding of Watermarking LLMs paper
- Inevitable Trade-off between Watermark Strength and Speculative Sampling Efficiency for Language Models paper
- LLM Watermarking Using Mixtures and Statistical-to-Computational Gaps
- Lost in Overlap: Exploring Logit-based Watermark Collision in LLMs
- STEAD: Robust Provably Secure Linguistic Steganography with Diffusion Language Model
- LR-DWM: Efficient Watermarking for Diffusion Language Models
- Watermarking Discrete Diffusion Language Models
- A watermark for order-agnostic language models
- ICLR 2025
- Every Step Counts: Decoding Trajectories as Authorship Fingerprints of dLLMs
- DMark: Order-Agnostic Watermarking for Diffusion Large Language Models
- Watermarking Diffusion Language Models
- ICLR 2026
- A Survey of Text Watermarking in the Era of Large Language Models
- ACM Computing Surveys 2024
- http://arxiv.org/abs/2312.07913
- Mark My Words: Analyzing and Evaluating Language Model Watermarks paper
- WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models
- SoK: On the Role and Future of AIGC Watermarking in the Era of Gen-AI paper
- SoK: Watermarking for AI-Generated Content paper
