CVE-2021-41125

medium
Published 2021-10-06 · Modified 2023-11-08
CVSS v3
CVSS:3.1/AV:N/AC:L/PR:L/UI:R/S:U/C:H/I:N/A:N
CVSS v2
VIR risk
5.5

Description

Scrapy is a high-level web crawling and scraping framework for Python. If you use `HttpAuthMiddleware` (i.e. the `http_user` and `http_pass` spider attributes) for HTTP authentication, all requests will expose your credentials to the request target. This includes requests generated by Scrapy components, such as `robots.txt` requests sent by Scrapy when the `ROBOTSTXT_OBEY` setting is set to `True`, or as requests reached through redirects. Upgrade to Scrapy 2.5.1 and use the new `http_auth_domain` spider attribute to control which domains are allowed to receive the configured HTTP authentication credentials. If you are using Scrapy 1.8 or a lower version, and upgrading to Scrapy 2.5.1 is not an option, you may upgrade to Scrapy 1.8.1 instead. If you cannot upgrade, set your HTTP authentication credentials on a per-request basis, using for example the `w3lib.http.basic_auth_header` function to convert your credentials into a value that you can assign to the `Authorization` header of your request, instead of defining your credentials globally using `HttpAuthMiddleware`.

Predictions

Exploit likelihood
30%
Patch ETA

Heuristic predictions, AS-IS, for prioritization only.

Mitigations

vendor Authored 2026-05-27

Vendor advisory: debian — https://security-tracker.debian.org/tracker/CVE-2021-41125

OS impact

OSVersionStatusFixed in
arch archfixed2.5.1-1
debian debianbookwormfixed2.5.1-1
debian debianbullseyefixed2.4.1-2+deb11u1
debian debianforkyfixed2.5.1-1
debian debiansidfixed2.5.1-1
debian debiantrixiefixed2.5.1-1

Package impact

EcosystemPackageVulnerableFixed
python PyPIscrapy<1.8.11.8.1
python PyPIscrapy>=2.0.0,<2.5.12.5.1
python PyPIscrapy<b01d69a1bf48060daec8f751368622352d8b85a6||>=2.0.0,<2.5.1b01d69a1bf48060daec8f751368622352d8b85a6

References

Verify integrity in audit chain (admin only). AS-IS.