Extract data from websites using Claude's web_fetch and web_search tools. Convert any website to markdown or structured JSON.
- Extract content from any website URL
- Search the web and extract data from results
- Output as clean markdown or structured JSON
- Use natural language to specify what data to extract
https://claude-web-scraper.vercel.app
Requirements:
- Node.js 18+
- Anthropic API key
Install:
git clone https://github.com/kadoa-org/claude-web-scraper.git
cd claude-web-scraper
npm install
Configure:
Create .env.local
:
ANTHROPIC_API_KEY=your_api_key_here
Run:
npm run dev
Input options:
- Website URL: Extract from a specific page
- Web search: Search first, then extract from results
Output formats:
- Markdown: Clean formatted text
- JSON: Structured data with custom prompts
Examples:
Extract product data:
URL: https://example.com/products
Prompt: "Get product names, prices, and availability"
Search and extract:
Search: "quantum computing news 2024"
Prompt: "Extract titles, dates, and key points"
- Next.js 15, TypeScript
- Tailwind CSS, shadcn/ui
- Claude API (web_fetch, web_search)
npm run dev # Development server
npm run build # Production build
npm run lint # Linting
MIT License - see LICENSE
Made with ❤️ by Kadoa