-
Notifications
You must be signed in to change notification settings - Fork 232
Open
Description
Hello @mvdbos
I haven't found time to look into the robots.txt filter discussed in the other issue. Sorry! I stumbled on a new question you might be able to shine some light on:
I'm trying to filter out URLs that have been redirected externally. I'm keen to implement a PostFetchFilter to keep it all within the spider. I was wondering if it possible to get the final URL (after redirects) in a PostFetchFilter? It seems like only the original URL is part of the Resource.
Appreciate any ideas on how you would approach this.
Cheers,
Peter
Metadata
Metadata
Assignees
Labels
No labels