Home

Description

LangChain is a framework for building LLM-powered applications. Prior to 1.1.14, the RecursiveUrlLoader class in @langchain/community is a web crawler that recursively follows links from a starting URL. Its preventOutside option (enabled by default) is intended to restrict crawling to the same site as the base URL. The implementation used String.startsWith() to compare URLs, which does not perform semantic URL validation. An attacker who controls content on a crawled page could include links to domains that share a string prefix with the target, causing the crawler to follow links to attacker-controlled or internal infrastructure. Additionally, the crawler performed no validation against private or reserved IP addresses. A crawled page could include links targeting cloud metadata services, localhost, or RFC 1918 addresses, and the crawler would fetch them without restriction. This vulnerability is fixed in 1.1.14.

PUBLISHED Reserved 2026-02-09 | Published 2026-02-11 | Updated 2026-02-12 | Assigner GitHub_M




MEDIUM: 4.1CVSS:3.1/AV:N/AC:L/PR:L/UI:R/S:C/C:L/I:N/A:N

Problem types

CWE-918: Server-Side Request Forgery (SSRF)

Product status

< 1.1.14
affected

References

github.com/...hainjs/security/advisories/GHSA-gf3v-fwqg-4vh7

github.com/langchain-ai/langchainjs/pull/9990

github.com/...ommit/d5e3db0d01ab321ec70a875805b2f74aefdadf9d

github.com/...ainjs/releases/tag/@langchain/community@1.1.14

cve.org (CVE-2026-26019)

nvd.nist.gov (CVE-2026-26019)

Download JSON