Skip to content

Commit 20a90e6

Browse files
protoss70TC-MO
andauthored
fix: Update sources/platform/integrations/workflows-and-notifications/n8n/website-content-crawler.md
Co-authored-by: Michał Olender <[email protected]>
1 parent dc1ff9c commit 20a90e6

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

sources/platform/integrations/workflows-and-notifications/n8n/website-content-crawler.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -101,7 +101,7 @@ This module provides complete control over the content extraction process, allow
101101

102102
### How it works
103103

104-
The Advanced Settings module provides granular control over the entire crawling process. For _Crawler selection_, you can choose from Playwright (Firefox/Chrome) or Cheerio, depending on the complexity of the target website. _URL management_ allows you to define the crawling scope with include and exclude URL patterns. You can also exercise precise _DOM manipulation_ by controlling which HTML elements to keep or remove. To ensure the best results, you can apply specialized algorithms for _Content transformation_ and select from various _Output formatting_ options for better AI model compatibility.
104+
The **Advanced Settings** module provides granular control over the entire crawling process. For _Crawler selection_, you can choose from Playwright (Firefox/Chrome) or Cheerio, depending on the complexity of the target website. _URL management_ allows you to define the crawling scope with include and exclude URL patterns. You can also exercise precise _DOM manipulation_ by controlling which HTML elements to keep or remove. To ensure the best results, you can apply specialized algorithms for _Content transformation_ and select from various _Output formatting_ options for better AI model compatibility.
105105

106106
### Output data
107107

0 commit comments

Comments
 (0)