Home

Description

The comm utility in uutils coreutils silently corrupts data by performing lossy UTF-8 conversion on all output lines. The implementation uses String::from_utf8_lossy(), which replaces invalid UTF-8 byte sequences with the Unicode replacement character (U+FFFD). This behavior differs from GNU comm, which processes raw bytes and preserves the original input. This results in corrupted output when the utility is used to compare binary files or files using non-UTF-8 legacy encodings.

PUBLISHED Reserved 2026-04-02 | Published 2026-04-22 | Updated 2026-04-22 | Assigner canonical




LOW: 3.3CVSS:3.1/AV:L/AC:L/PR:L/UI:N/S:U/C:N/I:L/A:N

Problem types

CWE-176: Improper Handling of Unicode Encoding

Product status

Default status
unaffected

Any version before 0.6.0
affected

Credits

Zellic finder

References

github.com/uutils/coreutils/issues/10192 exploit

github.com/uutils/coreutils/pull/10206 patch

github.com/uutils/coreutils/issues/10192 issue-tracking

github.com/uutils/coreutils/releases/tag/0.6.0 vendor-advisory

cve.org (CVE-2026-35346)

nvd.nist.gov (CVE-2026-35346)

Download JSON