Description
Retrieval-based-Voice-Conversion-WebUI is a voice changing framework based on VITS. Versions 2.2.231006 and prior are vulnerable to command injection. The variables exp_dir1, np7 and f0method8 take user input and pass it into the extract_f0_feature function, which concatenates them into a command that is run on the server. This can lead to arbitrary command execution. As of time of publication, no known patches exist.
Problem types
CWE-77: Improper Neutralization of Special Elements used in a Command ('Command Injection')
Product status
References
securitylab.github.com/...eval-based-Voice-Conversion-WebUI/
github.com/...7780cf703841ebafb565a4e47d1ea86ff/infer-web.py
github.com/...7780cf703841ebafb565a4e47d1ea86ff/infer-web.py
github.com/...7780cf703841ebafb565a4e47d1ea86ff/infer-web.py
github.com/...7780cf703841ebafb565a4e47d1ea86ff/infer-web.py
github.com/...7780cf703841ebafb565a4e47d1ea86ff/infer-web.py
github.com/...7780cf703841ebafb565a4e47d1ea86ff/infer-web.py