CommonMark has DisallowedRawHtml extension bypass via whitespace in HTML tag names
Moderate severity
GitHub Reviewed
Published
Mar 5, 2026
in
thephpleague/commonmark
•
Updated Mar 20, 2026
Description
Published to the GitHub Advisory Database
Mar 6, 2026
Reviewed
Mar 6, 2026
Published by the National Vulnerability Database
Mar 7, 2026
Last updated
Mar 20, 2026
Impact
The
DisallowedRawHtmlextension can be bypassed by inserting a newline, tab, or other ASCII whitespace character between a disallowed HTML tag name and the closing>. For example,<script\n>would pass through unfiltered and be rendered as a valid HTML tag by browsers. This is a cross-site scripting (XSS) vector for any application that relies on this extension to sanitize untrusted user input.All applications using the
DisallowedRawHtmlextension to process untrusted markdown are affected. Applications that use a dedicated HTML sanitizer (such as HTML Purifier) on the rendered output are not affected.Patches
Fixed in 2.8.1. The regex character class
[ \/>]was changed to[\s\/>]to match all whitespace characters that browsers accept as valid tag name terminators.Workarounds
html_inputconfiguration option to'escape'or'strip'to disable all raw HTML, though this is a broader restriction than theDisallowedRawHtmlextension provides.References