Skip to content

Commit 7a3212d

Browse files
committed
feat: continue with intro
1 parent f2c09ac commit 7a3212d

File tree

3 files changed

+20
-7
lines changed

3 files changed

+20
-7
lines changed

.github/styles/config/vocabularies/Docs/accept.txt

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -87,19 +87,20 @@ preconfigured
8787
[Mm]ultiselect
8888

8989
asyncio
90-
Langflow
9190
backlinks?
9291
captchas?
9392
Chatbot
9493
combinator
9594
deduplicating
95+
dev
9696
Fakestore
9797
Fandom('s)?
9898
IMDb
9999
influencers
100100
iPads?
101101
iPhones?
102102
jQuery
103+
Langflow
103104
learnings
104105
livestreams
105106
outro

sources/academy/webscraping/scraping_basics_python/13_platform.md

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,24 @@ Before starting with a scraping platform, let's highlight a few caveats in our c
1919
- **Manual data management:** To track prices over time, we'd have to figure out how to organize the exported data. If we wanted to process the data, we might discover that different data analysis tools require specific formats.
2020
- **Prone to anti-scraping:** If the target website detects we're scraping their data, they can rate-limit or even block us. We could take a laptop to a nearby coffee shop and run the program while connected to their public Wi-Fi, but eventually they'll probably block that one too—risking seriously annoying your barista.
2121

22+
In this lesson, we'll use a platform to help us with all the above. Generic cloud platforms, such as [GitHub Actions](https://github.com/features/actions), can provide sufficient tools for simple scenarios. Platforms dedicated to scraping, such as [Apify](https://apify.com/), offer additional features, such as monitoring scrapers, managing the retreived data, or allowing overcoming anti-scraping measures.
23+
24+
:::info Why Apify
25+
26+
Many scraping platforms exist, each providing a diverse offer of products and approaches. Obviously, as course authors we're biased to think that Apify is the best and most complete.
27+
28+
The main goal of this lesson is to **show how deployment to any platform can be useful**, and that isn't specific to Apify. Also, rest assured, everything here fits within our [free tier](https://apify.com/pricing).
29+
30+
:::
31+
32+
## Registering
33+
34+
##
35+
36+
---
37+
38+
<Exercises />
39+
2240
:::danger Work in progress
2341

2442
This course is incomplete. As we work on adding new lessons, we would love to hear your feedback. You can comment right here under each page or [file a GitHub Issue](https://github.com/apify/apify-docs/issues) to discuss a problem.

sources/academy/webscraping/scraping_basics_python/index.md

Lines changed: 0 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -12,12 +12,6 @@ import DocCardList from '@theme/DocCardList';
1212

1313
---
1414

15-
:::danger Work in progress
16-
17-
This course is incomplete. As we work on adding new lessons, we would love to hear your feedback. Comment right here under each page or [file a GitHub Issue](https://github.com/apify/apify-docs/issues) to discuss a problem.
18-
19-
:::
20-
2115
In this course we'll use Python to create an application for watching prices. It'll be able to scrape all product pages of an e-commerce website and record prices. Data from several runs of such program would be useful for seeing trends in price changes, detecting discounts, etc.
2216

2317
<!--

0 commit comments

Comments
 (0)