Description
A Regular Expression Denial of Service (ReDoS) vulnerability was discovered in the Hugging Face Transformers library, specifically affecting the MarianTokenizer's `remove_language_code()` method. This vulnerability is present in version 4.52.4 and has been fixed in version 4.53.0. The issue arises from inefficient regex processing, which can be exploited by crafted input strings containing malformed language code patterns, leading to excessive CPU consumption and potential denial of service.
Problem types
CWE-1333 Inefficient Regular Expression Complexity
Product status
References
huntr.com/bounties/6a6c933f-9ce8-4ded-8b3b-2c1444c61f36
github.com/...ommit/47c34fba5c303576560cb29767efb452ff12b8be