Skip to content

Commit 6a3e0e7

Browse files
authored
fix: Deduplicate requests by unique key before submitting them to the queue (#499)
This should fix the issue mentioned in apify/apify-sdk-python#261 (comment)
1 parent 164fff9 commit 6a3e0e7

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

src/crawlee/storages/_request_provider.py

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -90,10 +90,10 @@ def _transform_request(self, request: str | BaseRequestData | Request) -> Reques
9090

9191
def _transform_requests(self, requests: Sequence[str | BaseRequestData | Request]) -> list[Request]:
9292
"""Transforms a list of request-like objects into a list of Request objects."""
93-
processed_requests: list[Request] = []
93+
processed_requests = dict[str, Request]()
9494

9595
for request in requests:
9696
processed_request = self._transform_request(request)
97-
processed_requests.append(processed_request)
97+
processed_requests.setdefault(processed_request.unique_key, processed_request)
9898

99-
return processed_requests
99+
return list(processed_requests.values())

0 commit comments

Comments
 (0)