Description
lxml_html_clean is a project for HTML cleaning functionalities copied from `lxml.html.clean`. Prior to version 0.4.4, the <base> tag passes through the default Cleaner configuration. While page_structure=True removes html, head, and title tags, there is no specific handling for <base>, allowing an attacker to inject it and hijack relative links on the page. This issue has been patched in version 0.4.4.
Problem types
CWE-116: Improper Encoding or Escaping of Output
Product status
References
github.com/..._clean/security/advisories/GHSA-xvp8-3mhv-424c
github.com/..._clean/security/advisories/GHSA-xvp8-3mhv-424c
github.com/...ommit/9c5612ca33b941eec4178abf8a5294b103403f34