You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- An [n8n instance](https://docs.n8n.io/getting-started/) (self‑hosted or cloud)
16
+
10
17
## Apify Scraper for AI Crawling
11
18
12
19
Apify Scraper for AI Crawling from [Apify](https://apify.com/) lets you extract text content from websites to feed AI models, LLM applications, vector databases, or Retrieval Augmented Generation (RAG) pipelines. It supports rich formatting using Markdown, cleans the HTML of irrelevant elements, downloads linked files, and integrates with AI ecosystems like LangChain, LlamaIndex, and other LLM frameworks.
13
20
14
21
To use these modules, you need an [Apify account](https://console.apify.com) and an [API token](https://docs.apify.com/platform/integrations/api#api-token). You can find your token in the [Apify Console](https://console.apify.com/) under **Settings > Integrations**. After connecting, you can automate content extraction at scale and incorporate the results into your AI workflows.
15
22
16
-
## Connect Apify Scraper for AI Crawling
23
+
## Install the Apify Node (self-hosted)
24
+
25
+
If you're running a self-hosted n8n instance, you can install the Apify community node directly from the editor. This process adds the node to your available tools, enabling Apify operations in workflows.
26
+
27
+
1. Open your n8n instance.
28
+
1. Go to **Settings > Community Nodes**.
29
+
1. Select **Install**.
30
+
1. Enter the npm package name: `@apify/n8n-nodes-apify-content-crawler` (for latest version). To install a specific [version](https://www.npmjs.com/package/@apify/n8n-nodes-apify-content-crawler?activeTab=versions) enter e.g `@apify/[email protected]`.
31
+
1. Agree to the [risks](https://docs.n8n.io/integrations/community-nodes/risks/) of using community nodes and select **Install**.
32
+
1. You can now use the node in your workflows.
33
+
34
+

35
+
36
+
## Connect Apify Scraper for AI Crawling (self-hosted)
17
37
18
38
1. Create an account at [Apify](https://console.apify.com/). You can sign up using your email, Gmail, or GitHub account.
19
39
20
40

21
41
22
42
1. To connect your Apify account to n8n, you can use an OAuth connection (recommended) or an Apify API token. To get the Apify API token, navigate to **[Settings > API & Integrations](https://console.apify.com/settings/integrations)** in the Apify Console.
23
43
24
-

44
+

25
45
26
46
1. Find your token under **Personal API tokens** section. You can also create a new API token with multiple customizable permissions by clicking on **+ Create a new token**.
27
47
1. Click the **Copy** icon next to your API token to copy it to your clipboard. Then, return to your n8n workflow interface.
28
48
29
-

49
+

30
50
31
51
1. In n8n, click **Create new credential** of the chosen Apify Scraper module.
32
52
1. In the **API key** field, paste the API token you copied from Apify and click **Save**.
33
53
34
-
IMG
54
+

35
55
36
56
Once connected, you can build workflows to automate website extraction and integrate results into your AI applications.
0 commit comments