Replies: 1 comment 1 reply
-
|
Hello, One way to do this is to use the This is very simple example, the pseudo URL can be made more precise. For more lively discussion, I recommend using our Discord community server. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I am using
Apify.utils.enqueueLinksand I'd like to prevent URLs that are not in the root domain so I don't crawl anything other than the website itself.For example if I
rushi.comwith several pages, and one of those pages links toblog.github.comI don't want to recurse into that URL because github.com != rushi.comWhat I'm looking for is Apify's equivalent of wget's
--no-parentoption. I also want to ensure relative URLs are picked up sobaseUrl: request.loadedUrlneeds to be presentThis is the code I have so far direct from the examples
Beta Was this translation helpful? Give feedback.
All reactions