Skip to content

Commit f1be8fd

Browse files
authored
feat: add new typos PR check (#1141)
add PR check for new typos action add _typos.toml to configure ignores for the new action
1 parent 1730e55 commit f1be8fd

File tree

14 files changed

+51
-25
lines changed

14 files changed

+51
-25
lines changed

.github/workflows/typos-check.yaml

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,18 @@
1+
name: Typos Check
2+
3+
on:
4+
pull_request:
5+
branches: [master]
6+
7+
jobs:
8+
run:
9+
name: Spell Check with Typos
10+
runs-on: ubuntu-latest
11+
steps:
12+
- name: Checkout code
13+
uses: actions/checkout@v4
14+
15+
- name: Check spelling
16+
uses: crate-ci/typos@master
17+
with:
18+
files: ./sources

_typos.toml

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,8 @@
1+
[default]
2+
extend-ignore-re = [
3+
'`[^`\n]+`',
4+
'```[\s\S]*?```',
5+
]
6+
7+
[default.extend-words]
8+
SER = "SER"

sources/academy/glossary/tools/switchyomega.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ slug: /tools/switchyomega
1313

1414
SwitchyOmega is a Chrome extension for managing and switching between proxies which can be added in the [Chrome Webstore](https://chrome.google.com/webstore/detail/padekgcemlokbadohgkifijomclgjgif).
1515

16-
After adding it to Chrome, you can see the SwitchyOmega icon somewhere amongst all your other Chrome extension icons. Clicking on it will display a menu, where you can select various differnt connection profiles, as well as open the extension's options.
16+
After adding it to Chrome, you can see the SwitchyOmega icon somewhere amongst all your other Chrome extension icons. Clicking on it will display a menu, where you can select various different connection profiles, as well as open the extension's options.
1717

1818
![The SwitchyOmega interface](./images/switchyomega.png)
1919

sources/academy/tutorials/node_js/caching_responses_in_puppeteer.js

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -29,7 +29,7 @@ const crawler = new PuppeteerCrawler({
2929
try {
3030
buffer = await response.buffer();
3131
} catch (error) {
32-
// some responses do not contain buffer and do not need to be catched
32+
// some responses do not contain buffer and do not need to be cached
3333
return;
3434
}
3535

sources/academy/webscraping/anti_scraping/index.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -66,7 +66,7 @@ Anti-scraping protections can work on many different layers and use a large amou
6666

6767
1. **Where you are coming from** - The IP address of the incoming traffic is always available to the website. Proxies are used to emulate a different IP addresses but their quality matters a lot.
6868
2. **How you look** - With each request, the website can analyze its HTTP headers, TLS version, cyphers, and other information. Moreover, if you use a browser, the website can also analyze the whole browser fingerprint and run challenges to classify your hardware (like graphics hardware acceleration).
69-
3. **What you are scraping** - The same data can be extracted in many ways from a website. You can just get the inital HTML or you can use a browser to render the full page or you can reverse engineer internal APIs. Each of those endpoints can be protected differently.
69+
3. **What you are scraping** - The same data can be extracted in many ways from a website. You can just get the initial HTML or you can use a browser to render the full page or you can reverse engineer internal APIs. Each of those endpoints can be protected differently.
7070
4. **How you behave** - The website can see patterns in how you are ordering your requests, how fast you are scraping, etc. It can also analyze browser behavior like mouse movement, clicks or key presses.
7171

7272
These are the 4 main principles that anti-scraping protections are based on.

sources/academy/webscraping/anti_scraping/mitigation/cloudflare_challenge.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
title: Bypasing Cloudflare browser check
2+
title: Bypassing Cloudflare browser check
33
description: Learn how to bypass Cloudflare browser challenge with Crawlee.
44
sidebar_position: 3
55
slug: /anti-scraping/mitigation/cloudflare-challenge.md

sources/platform/actors/development/actor_definition/docker.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ sidebar_position: 4
99

1010
---
1111

12-
When developing an [Actor](/sources/platform/actors/index.mdx) on the Apify platform, you can choose from a variety of pre-built Docker iamges to serve as the base for your Actor. These base images come with pre-installed dependencies and tools, making it easier to set up your development envrionment and ensuring consistent behavior across different environments.
12+
When developing an [Actor](/sources/platform/actors/index.mdx) on the Apify platform, you can choose from a variety of pre-built Docker images to serve as the base for your Actor. These base images come with pre-installed dependencies and tools, making it easier to set up your development environment and ensuring consistent behavior across different environments.
1313

1414
## Base Docker images
1515

@@ -105,7 +105,7 @@ By default, Apify base Docker images with the Apify SDK and Crawlee start your N
105105
}
106106
```
107107

108-
This means the system expects the source code to be in `main.js` by default. If you want to override this behavior, ues a custom `package.json` and/or `Dockerfile`.
108+
This means the system expects the source code to be in `main.js` by default. If you want to override this behavior, use a custom `package.json` and/or `Dockerfile`.
109109

110110
:::tip Optimization tips
111111

sources/platform/actors/development/actor_definition/output_schema.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -111,7 +111,7 @@ To set up the Actor's output tab UI using a single configuration file, use the f
111111
}
112112
```
113113

114-
The template above defines the configuration for the default dataset output view. Under the `views` property, there is one view titled _Overview_. The view configuartion consists of two main steps:
114+
The template above defines the configuration for the default dataset output view. Under the `views` property, there is one view titled _Overview_. The view configuration consists of two main steps:
115115

116116
1. `transformation` - set up how to fetch the data.
117117
2. `display` - set up how to visually present the fetched data.
@@ -124,7 +124,7 @@ The default behavior of the Output tab UI table is to display all fields from `t
124124

125125
Output configuration files need to be located in the `.actor` folder within the Actor's root directory.
126126

127-
You have two choices of how to organize files withing the `.actor` folder.
127+
You have two choices of how to organize files within the `.actor` folder.
128128

129129
### Single configuration file
130130

sources/platform/actors/development/programming_interface/actor_standby.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -70,7 +70,7 @@ async def main() -> None:
7070
</Tabs>
7171

7272
Please make sure to describe your Actors, their endpoints, and the schema for their
73-
inputs and ouputs in your README.
73+
inputs and outputs in your README.
7474

7575
## Can I monetize my Actor in the Standby mode
7676

sources/platform/api_v2/api_v2_reference.apib

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -989,7 +989,7 @@ received in the response JSON to the [Get items](#reference/datasets/item-collec
989989
otherwise it will have a transitional status (e.g. `RUNNING`).
990990
+ webhooks: `dGhpcyBpcyBqdXN0IGV4YW1wbGUK...` (string, optional) - Specifies optional webhooks associated with the actor run, which can be used to receive a notification
991991
e.g. when the actor finished or failed. The value is a Base64-encoded JSON array of objects defining the webhooks. For more information, see
992-
[Webhooks documenation](https://docs.apify.com/platform/integrations/webhooks).
992+
[Webhooks documentation](https://docs.apify.com/platform/integrations/webhooks).
993993

994994
+ Request
995995

@@ -1023,7 +1023,7 @@ received in the response JSON to the [Get items](#reference/datasets/item-collec
10231023
+ build: `0.1.234` (string, optional) - Specifies the actor build to run. It can be either a build tag or build number. By default, the run uses the build specified in the default run configuration for the actor (typically `latest`).
10241024
+ webhooks: `dGhpcyBpcyBqdXN0IGV4YW1wbGUK...` (string, optional) - Specifies optional webhooks associated with the actor run, which can be used to receive a notification
10251025
e.g. when the actor finished or failed. The value is a Base64-encoded JSON array of objects defining the webhooks. For more information, see
1026-
[Webhooks documenation](https://docs.apify.com/platform/integrations/webhooks).
1026+
[Webhooks documentation](https://docs.apify.com/platform/integrations/webhooks).
10271027

10281028
### With input [POST]
10291029

@@ -1141,7 +1141,7 @@ To run the actor asynchronously, use the [Run actor](#reference/actors/run-colle
11411141
+ build: `0.1.234` (string, optional) - Specifies the actor build to run. It can be either a build tag or build number. By default, the run uses the build specified in the default run configuration for the actor (typically `latest`).
11421142
+ webhooks: `dGhpcyBpcyBqdXN0IGV4YW1wbGUK...` (string, optional) - Specifies optional webhooks associated with the actor run, which can be used to receive a notification
11431143
e.g. when the actor finished or failed. The value is a Base64-encoded JSON array of objects defining the webhooks. For more information, see
1144-
[Webhooks documenation](https://docs.apify.com/platform/integrations/webhooks).
1144+
[Webhooks documentation](https://docs.apify.com/platform/integrations/webhooks).
11451145
+ format: `json` (string, optional) - Format of the results, possible values are: `json`, `jsonl`, `csv`, `html`, `xlsx`, `xml` and `rss`. The default value is `json`.
11461146
+ clean: `false` (boolean, optional) - If `true` or `1` then the API endpoint returns only non-empty items and skips hidden fields
11471147
(i.e. fields starting with the # character).
@@ -1758,7 +1758,7 @@ received in the response JSON to the [Get items](#reference/datasets/item-collec
17581758
e.g. when the actor finished or failed. The value is a Base64-encoded JSON array of objects defining the webhooks.
17591759
**Note**: if you already have a webhook set up for the actor or task, you do not have to add it again here.
17601760
For more information, see
1761-
[Webhooks documenation](https://docs.apify.com/platform/integrations/webhooks).
1761+
[Webhooks documentation](https://docs.apify.com/platform/integrations/webhooks).
17621762

17631763
+ Request
17641764

@@ -1792,7 +1792,7 @@ received in the response JSON to the [Get items](#reference/datasets/item-collec
17921792
in the response. By default, it is `OUTPUT`.
17931793
+ webhooks: `dGhpcyBpcyBqdXN0IGV4YW1wbGUK...` (string, optional) - Specifies optional webhooks associated with the actor run, which can be used to receive a notification
17941794
e.g. when the actor finished or failed. The value is a Base64-encoded JSON array of objects defining the webhooks. For more information, see
1795-
[Webhooks documenation](https://docs.apify.com/platform/integrations/webhooks).
1795+
[Webhooks documentation](https://docs.apify.com/platform/integrations/webhooks).
17961796

17971797
### Run task synchronously (POST) [POST]
17981798

@@ -1898,7 +1898,7 @@ To run the Task asynchronously, use the [Run task asynchronously](#reference/act
18981898
+ build: `0.1.234` (string, optional) - Specifies the actor build to run. It can be either a build tag or build number. By default, the run uses the build specified in the task settings (typically `latest`).
18991899
+ webhooks: `dGhpcyBpcyBqdXN0IGV4YW1wbGUK...` (string, optional) - Specifies optional webhooks associated with the actor run, which can be used to receive a notification
19001900
e.g. when the actor finished or failed. The value is a Base64-encoded JSON array of objects defining the webhooks. For more information, see
1901-
[Webhooks documenation](https://docs.apify.com/platform/integrations/webhooks).
1901+
[Webhooks documentation](https://docs.apify.com/platform/integrations/webhooks).
19021902
+ format: `json` (string, optional) - Format of the results, possible values are: `json`, `jsonl`, `csv`, `html`, `xlsx`, `xml` and `rss`. The default value is `json`.
19031903
+ clean: `false` (boolean, optional) - If `true` or `1` then the API endpoint returns only non-empty items and skips hidden fields
19041904
(i.e. fields starting with the # character).
@@ -3005,7 +3005,7 @@ The pagination is always performed with the granularity of a single item, regard
30053005
By default, the **Items** in the response are sorted by the time they were stored to the database, therefore you can use
30063006
pagination to incrementally fetch the items as they are being added.
30073007
The maximum number of items that will be returned in a single API call is limited to 250,000. <!-- GET_ITEMS_LIMIT -->
3008-
If you specify `desc=1` query paremeter, the results are returned in the reverse order
3008+
If you specify `desc=1` query parameter, the results are returned in the reverse order
30093009
than they were stored (i.e. from newest to oldest items).
30103010
Note that only the order of **Items** is reversed, but not the order of the `unwind` array elements.
30113011

@@ -3081,7 +3081,7 @@ The POST payload is a JSON object or a JSON array of objects to save into the da
30813081
**IMPORTANT:** The limit of request payload size for the dataset is 5 MB. If the array exceeds the size,
30823082
you'll need to split it into a number of smaller arrays.
30833083

3084-
If the dataset has fields schema defined, the push request can potentialy fail with `400 Bad Request` if any item does not match the schema.
3084+
If the dataset has fields schema defined, the push request can potentially fail with `400 Bad Request` if any item does not match the schema.
30853085
In such case, nothing will be inserted into the dataset and the response will contain an error message with a list of invalid items and their validation errors.
30863086

30873087
+ Parameters
@@ -3767,7 +3767,7 @@ parameter.
37673767

37683768
+ Parameters
37693769

3770-
+ dispatchId: `Zib4xbZsmvZeK55ua` (string, required) - Webhook dispacth ID.
3770+
+ dispatchId: `Zib4xbZsmvZeK55ua` (string, required) - Webhook dispatch ID.
37713771
+ token: `soSkq9ekdmfOslopH` (string, required) - API authentication token.
37723772

37733773
### Get webhook dispatch [GET]
@@ -4091,7 +4091,7 @@ a summary of your limits, and your current usage.
40914091
- taggedBuilds (object, nullable)
40924092
- latest (object, nullable)
40934093
- buildId: `z2EryhbfhgSyqj6Hn` (string, nullable)
4094-
- buldNumber: `0.0.2` (string, nullable)
4094+
- buildNumber: `0.0.2` (string, nullable)
40954095
- finishedAt: `2019-06-10T11:15:49.286Z` (string, nullable)
40964096

40974097
## ActCreate (object)

0 commit comments

Comments
 (0)