-
Notifications
You must be signed in to change notification settings - Fork 0
feat: actorify web scrapping ai agent - hard #5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
- by running apify-cli command: `apify init` - update .gitignore
- also run scraper in a separate thread - SmartScraperGraph uses asyncio.run internally so it can't be ran within other asyncio.run
Inspired by our Apify templates e.g.: - https://github.com/apify/actor-templates/tree/master/templates/python-playwright - https://github.com/apify/actor-templates/tree/master/templates/python-start Features: - build `minify-html` with Rust - optimize Dockerfile by adding multi-stage builds and virtual environment - to reduce image size
To try it, run Other PR: #4 |
6f6f751
to
8abdc49
Compare
Actorification – Web Scraping AI Agent as an Apify Actor 🚀Hey Shubham, We love the improvements in this Web Scraping AI Agent! 🚀 This PR fully optimizes the app for deployment on Apify, enabling seamless scaling, automation, and concurrency handling—all while keeping the local version intact. Apify’s Actor model makes it easy to run web scraping tasks in a serverless, managed environment. (Learn more in the Actor Whitepaper). 🔥 What’s in this PR?This PR enhances the Web Scraping AI Agent by: ✅ Refactoring the app to remove Streamlit and focus purely on backend scraping logic. 🚀 How to Deploy on Apify?This project is already structured as an Apify Actor, making deployment incredibly simple:
📖 Learn more about Actor Development in the Apify Docs. 🏗️ Key Enhancements
🎨 How It Looks After DeploymentHere's how the Apify Actor Console will look once the Web Scraping AI Agent is deployed and running: 1️⃣ Actor Readme in Apify2️⃣ Input Configuration for Web Scraping3️⃣ Logs & Results After Execution🔗 Related PR: Alternative version of this PR This PR makes it easier than ever to deploy, run, and scale this AI-powered web scraping tool on Apify. Looking forward to your thoughts! 🚀 |
@tomasjindra |
Polished PR
apify init
initialize Apify:.actor/actor.json