Skip to content

Commit e1b60c7

Browse files
committed
improve apify proxy middleware
1 parent a379eb4 commit e1b60c7

File tree

1 file changed

+5
-0
lines changed

1 file changed

+5
-0
lines changed

src/apify/scrapy/middlewares/apify_proxy.py

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -78,6 +78,11 @@ async def process_request(self, request: Request, spider: Spider) -> None:
7878
Raises:
7979
ValueError: If username and password are not provided in the proxy URL.
8080
"""
81+
# Do not use proxy for robots.txt, as it causes 403 Forbidden.
82+
if request.url.endswith('/robots.txt'):
83+
request.meta.pop('proxy', None)
84+
return
85+
8186
Actor.log.debug(f'ApifyHttpProxyMiddleware.process_request: request={request}, spider={spider}')
8287
url = await self._get_new_proxy_url()
8388

0 commit comments

Comments
 (0)