Home

Description

lxml_html_clean is a project for HTML cleaning functionalities copied from `lxml.html.clean`. Prior to version 0.4.4, the <base> tag passes through the default Cleaner configuration. While page_structure=True removes html, head, and title tags, there is no specific handling for <base>, allowing an attacker to inject it and hijack relative links on the page. This issue has been patched in version 0.4.4.

PUBLISHED Reserved 2026-02-26 | Published 2026-03-05 | Updated 2026-03-06 | Assigner GitHub_M




MEDIUM: 6.1CVSS:3.1/AV:N/AC:L/PR:N/UI:R/S:C/C:L/I:L/A:N

Problem types

CWE-116: Improper Encoding or Escaping of Output

Product status

< 0.4.4
affected

References

github.com/..._clean/security/advisories/GHSA-xvp8-3mhv-424c exploit

github.com/..._clean/security/advisories/GHSA-xvp8-3mhv-424c

github.com/...ommit/9c5612ca33b941eec4178abf8a5294b103403f34

cve.org (CVE-2026-28350)

nvd.nist.gov (CVE-2026-28350)

Download JSON