Skip to content

Commit 45c39d2

Browse files
committed
style: improve section about proxy links
1 parent 3312662 commit 45c39d2

File tree

1 file changed

+14
-4
lines changed
  • sources/academy/webscraping/anti_scraping/mitigation

1 file changed

+14
-4
lines changed

sources/academy/webscraping/anti_scraping/mitigation/proxies.md

Lines changed: 14 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -26,17 +26,27 @@ Although IP quality is still the most important factor when it comes to using pr
2626

2727
Fixing rate-limiting issues is only the tip of the iceberg of what proxies can do for your scrapers, though. By implementing proxies properly, you can successfully avoid the majority of anti-scraping measures listed in the [previous lesson](../index.md).
2828

29-
## A bit about proxy links {#understanding-proxy-links}
29+
## About proxy links {#understanding-proxy-links}
3030

31-
When using proxies in your crawlers, you'll most likely be using them in a format that looks like this:
31+
To use a proxy, you need a proxy link, which contains the connection details, sometimes including credentials.
3232

3333
```text
3434
http://proxy.example.com:8080
3535
```
3636

37-
This link is separated into two main components: the **host**, and the **port**. In our case, our hostname is `http://proxy.example.com`, and our port is `8080`. Sometimes, a proxy might use an IP address as the host, such as `103.130.104.33`.
37+
The proxy link above has several parts:
3838

39-
If authentication (a username and a password) is required, the format will look a bit different:
39+
- `http://` tells us we're using HTTP protocol,
40+
- `proxy.example.com` is a hostname, i.e. an address to the proxy server,
41+
- `8080` is a port number.
42+
43+
Sometimes the proxy server has no name, so the link contains an IP address instead:
44+
45+
```text
46+
http://123.456.789.10:8080
47+
```
48+
49+
If proxy requires authentication, the proxy link can contain username and password:
4050

4151
```text
4252
http://USERNAME:[email protected]:8080

0 commit comments

Comments
 (0)