Skip to content

Commit 8d094b9

Browse files
committed
academy: migrate some requirements from expert scraping course intro to beginners course intro
1 parent d28068b commit 8d094b9

File tree

4 files changed

+38
-25
lines changed

4 files changed

+38
-25
lines changed

content/academy/expert_scraping_with_apify.md

Lines changed: 2 additions & 24 deletions
Original file line numberDiff line numberDiff line change
@@ -17,38 +17,16 @@ Before developing a pro-level Apify scraper, there are some important things you
1717

1818
> If you've already gone through the [Web scraping for beginners course]({{@link web_scraping_for_beginners.md}}) and the first lessons of the [Apify platform course]({{@link apify_platform.md}}), you will be more than well equipped to continue on with the lessons in this course.
1919
20-
### [](#javascript-and-node) JavaScript + Node.js
20+
<!-- ### [](#puppeteer-playwright) Puppeteer/Playwright
2121
22-
It is recommended to understand at least the fundamentals of JavaScript and be proficient with Node.js prior to developing an actor on the Apify platform. If you are not yet comfortable with asynchronous programming (with promises and `async...await`), loops (and the different types of loops in JavaScript), modularity, or working with external packages, we would recommend studying the following resources before coming back and continuing this section:
23-
24-
- [`async...await` (YouTube)](https://www.youtube.com/watch?v=vn3tm0quoqE&ab_channel=Fireship)
25-
- [JavaScript loops (MDN)](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Guide/Loops_and_iteration)
26-
- [Modularity in Node.js](https://www.section.io/engineering-education/how-to-use-modular-patterns-in-nodejs/)
27-
28-
### [](#general-web-development) General web development
29-
30-
Throughout the next lessons, we will sometimes use certain technologies and terms related to the web without explaining them. This is because the knowledge of them will be **assumed** (unless we're showing something out of the ordinary).
31-
32-
- [HTML](https://developer.mozilla.org/en-US/docs/Web/HTML)
33-
- [HTTP protocol](https://developer.mozilla.org/en-US/docs/Web/HTTP)
34-
- [DevTools]({{@link web_scraping_for_beginners/data_collection/browser_devtools.md}})
35-
36-
### [](#jquery-or-cheerio) jQuery or Cheerio
37-
38-
We'll be using the [`cheerio`](https://www.npmjs.com/package/cheerio) package a whole lot to parse data from HTML. This package provides a simple API using jQuery syntax to help traverse downloaded HTML within Node.js.
39-
40-
### [](#puppeteer-playwright) Puppeteer/Playwright
41-
42-
[Puppeteer](https://pptr.dev/) is a library for running and controlling a [headless browser]({{@link web_scraping_for_beginners/crawling/headless_browser.md}}) in Node.js, and was developed at Google. The team working on it was hired by Microsoft to work on the [Playwright](https://playwright.dev/) project; therefore, many parallels can be seen between both the `puppeteer` and `playwright` packages. Proficiency in at least one of these will be good enough.
22+
[Puppeteer](https://pptr.dev/) is a library for running and controlling a [headless browser]({{@link web_scraping_for_beginners/crawling/headless_browser.md}}) in Node.js, and was developed at Google. The team working on it was hired by Microsoft to work on the [Playwright](https://playwright.dev/) project; therefore, many parallels can be seen between both the `puppeteer` and `playwright` packages. Proficiency in at least one of these will be good enough. -->
4323

4424
### [](#crawlee-apify-sdk-and-cli) Crawlee, Apify SDK, and the Apify CLI
4525

4626
If you're feeling ambitious, you don't need to have any prior experience with Crawlee to get started with this course; however, at least 5-10 minutes of exposure is recommended. If you haven't yet tried out Crawlee, you can refer to [this lesson]({{@link web_scraping_for_beginners/crawling/pro_scraping.md}}) in the **Web scraping for beginners** course (and ideally follow along). To familiarize with the Apify SDK,you can refer to the [Apify Platform]({{@link apify_platform.md}}) course.
4727

4828
The Apify CLI will play a core role in the running and testing of the actor you will build, so if you haven't gotten it installed already, please refer to [this short lesson]({{@link tools/apify_cli.md}}).
4929

50-
<!-- todo: remove all requirements up to this point -->
51-
5230
### [](#git) Git
5331

5432
In one of the later lessons, we'll be learning how to integrate our actor on the Apify platform with a Github repository. For this, you'll need to understand at least the basics of [Git](https://git-scm.com/docs). Here's a [great tutorial](https://product.hubspot.com/blog/git-and-github-tutorial-for-beginners) to help you get started with Git.

content/academy/web_scraping_for_beginners.md

Lines changed: 22 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -42,6 +42,28 @@ You don't need to be a developer or a software engineer to complete this course,
4242
4343
As you progress to the more advanced courses, the coding will get more challenging, but will still be manageable to a person with an intermediate level of programming skills.
4444

45+
Ideally, you should have at least a moderate understanding of the following concepts:
46+
47+
### [](#javascript-and-node) JavaScript + Node.js
48+
49+
It is recommended to understand at least the fundamentals of JavaScript and be proficient with Node.js prior to developing an actor on the Apify platform. If you are not yet comfortable with asynchronous programming (with promises and `async...await`), loops (and the different types of loops in JavaScript), modularity, or working with external packages, we would recommend studying the following resources before coming back and continuing this section:
50+
51+
* [`async...await` (YouTube)](https://www.youtube.com/watch?v=vn3tm0quoqE&ab_channel=Fireship)
52+
* [JavaScript loops (MDN)](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Guide/Loops_and_iteration)
53+
* [Modularity in Node.js](https://www.section.io/engineering-education/how-to-use-modular-patterns-in-nodejs/)
54+
55+
### [](#general-web-development) General web development
56+
57+
Throughout the next lessons, we will sometimes use certain technologies and terms related to the web without explaining them. This is because the knowledge of them will be **assumed** (unless we're showing something out of the ordinary).
58+
59+
* [HTML](https://developer.mozilla.org/en-US/docs/Web/HTML)
60+
* [HTTP protocol](https://developer.mozilla.org/en-US/docs/Web/HTTP)
61+
* [DevTools]({{@link web_scraping_for_beginners/data_collection/browser_devtools.md}})
62+
63+
### [](#jquery-or-cheerio) jQuery or Cheerio
64+
65+
We'll be using the [`cheerio`](https://www.npmjs.com/package/cheerio) package a whole lot to parse data from HTML. This package provides a simple API using jQuery syntax to help traverse downloaded HTML within Node.js.
66+
4567
## [](#next) Next up
4668

4769
The course begins with a small bit of theory and moves into some realistic and practical examples of collecting data from the most popular websites on the internet using your browser console. So [let's get to it!]({{@link web_scraping_for_beginners/introduction.md}})

content/academy/web_scraping_for_beginners/best_practices.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
title: Best practices
33
description: Understand the standards and best practices that we here at Apify abide by to write readable, scalable, and maintainable code.
4-
menuWeight: 1.4
4+
menuWeight: 1.5
55
paths:
66
- web-scraping-for-beginners/best-practices
77
---
Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,13 @@
1+
---
2+
title: Challenge
3+
description: Before moving on with the rest of the
4+
menuWeight: 1.4
5+
paths:
6+
- web-scraping-for-beginners/best-practices
7+
---
8+
9+
# Challenge
10+
11+
Before moving onto the other courses in the academy, we recommend following along with this section, as it combines everything you've learned in the previous lessons into one cohesive project that helps you prove to yourself that you've thoroughly understood the material.
12+
13+
<!-- blah -->

0 commit comments

Comments
 (0)