Description
html-sanitizer is an allowlist-based HTML cleaner. If using `keep_typographic_whitespace=False` (which is the default), the sanitizer normalizes unicode to the NFKC form at the end. Some unicode characters normalize to chevrons; this allows specially crafted HTML to escape sanitization. The problem has been fixed in 2.4.2.
References (3)
Core 3
Core References
Vendor Advisory x_refsource_confirm
https://github.com/matthiask/html-sanitizer/security/advisories/GHSA-wvhx-q427-fgh3
Scores
CVSS v3
6.1
EPSS
0.0031
EPSS Percentile
54.4%
Attack Vector
NETWORK
CVSS:3.1/AV:N/AC:L/PR:N/UI:R/S:C/C:L/I:L/A:N
CISA SSVC
Vulnrichment
Exploitation
none
Automatable
no
Technical Impact
partial
Details
CWE
CWE-79
Status
published
Products (2)
matthiask/html-sanitizer
< 2.4.2
pypi/html-sanitizer
0 - 2.4.2PyPI
Published
May 06, 2024
Tracked Since
Feb 18, 2026