Skip to content

Conversation

@chawboiii
Copy link

@chawboiii chawboiii commented Jul 5, 2025

No description provided.

- Confirmed Cheerio is unavailable in n8n Cloud Code Nodes.
- Script will use regex fallback when deployed to n8n Cloud.
- Improved regex for image extraction to attempt to capture src and alt attributes more reliably, and return only {src: url} per request.
- Maintained regex for title and main text extraction with minor cleanup.
- Added extensive comments and warnings about the unreliability of regex for HTML parsing and strongly recommended using n8n's native HTML Node for robust scraping in n8n Cloud.
- Created the 'article_scraper.js' file with the modified code.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant