Skip to content

Commit 79b5f45

Browse files
authored
[Bots] Update AI bots info (#22961)
* [Bots] Update AI bots info * one more perplexity * update to include ai audit
1 parent 942993f commit 79b5f45

File tree

4 files changed

+75
-45
lines changed

4 files changed

+75
-45
lines changed

src/content/docs/ai-audit/get-started.mdx

Lines changed: 11 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ head:
1111
description: Learn how to set up AI Audit.
1212
---
1313

14-
import { Render, Steps } from "~/components";
14+
import { Details, Render, Steps } from "~/components";
1515

1616
This guide instructs you through
1717

@@ -36,6 +36,16 @@ To use AI Audit:
3636
4. From the **Bot traffic** page, under **Block AI Bots**, select **Enable**.
3737
</Steps>
3838

39+
<Details header="Which bots will Cloudflare block?">
40+
<Render file="list-ai-bots" product="bots" />
41+
</Details>
42+
43+
:::note
44+
45+
For more details on how this rule interacts with other Cloudflare settings, refer to [How it works](/bots/concepts/bot/#how-it-works).
46+
47+
:::
48+
3949
## 2. Block specific bot categories (Enterprise plan only)
4050

4151
Customers on the Enterprise plan -- and with a [Bot Management subscription](/bots/plans/bm-subscription/) -- can choose to only block specific AI crawlers, while allowing others.

src/content/docs/bots/concepts/bot/index.mdx

Lines changed: 33 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -6,18 +6,17 @@ sidebar:
66
learning_center:
77
title: What is a bot?
88
link: https://www.cloudflare.com/learning/bots/what-is-a-bot/
9-
109
---
1110

12-
import { Render } from "~/components"
11+
import { Render } from "~/components";
1312

1413
<Render file="what-is-a-bot" />
1514

1615
Bots can be used for good (chatbots, search engine crawlers) or for evil (inventory hoarding, credential stuffing).
1716

1817
:::note[More information]
1918

20-
For more background, refer to [What is a bot?](https://www.cloudflare.com/learning/bots/what-is-a-bot/).
19+
For more background, refer to [What is a bot?](https://www.cloudflare.com/learning/bots/what-is-a-bot/).
2120
:::
2221

2322
## Verified bots
@@ -26,14 +25,42 @@ For more background, refer to [What is a bot?](https://www.cloudflare.com/learni
2625

2726
:::note
2827

29-
The method for allowing or blocking verified bots depends on [your plan](/bots/get-started/).
28+
The method for allowing or blocking verified bots depends on [your plan](/bots/get-started/).
3029
:::
3130

3231
## AI bots
3332

34-
<Render file="ai-bots-definition" />
33+
You can block artificial intelligence (AI) bots, crawlers, and scrapers from scraping your website content and training large language models (LLM) to recreate it without your permission.
34+
35+
### Which bots are blocked
36+
37+
<Render file="list-ai-bots" />
38+
39+
### How it works
40+
41+
When you enable this feature via a pre-configured managed rule, Cloudflare can detect and block verified AI bots that comply with `robots.txt` and respect crawl rates, and do not hide their behavior from your website. The rule has also been expanded to include more signatures of AI bots that do not follow the rules.
42+
43+
The rule to block AI bots takes precedence over all other Super Bot Fight Mode rules. For example, if you have enabled **Block AI bots** and **Allow verified bots**, verified AI bots will also be blocked even if you allow other verified bots on your website or application.
44+
45+
For Bot Management customers, if you have set a rule to serve managed challenges to definitely automated bots, AI bots will also be challenged because custom rules run in a phase before Super Bot Fight Mode, which is the phase when the rule to block AI bots runs.
46+
47+
This behavior remains the same if the setting for verified, definitely automated, and likely bots is set to `block` or `allow`. If you have an action to `allow` for these rules, the request is not matched to any rule and proceeds to the next ruleset phase. Similarly, if the action is set to `block`, they will be blocked in the earlier phase and do not move on to match the AI rule at all. However, when the action is `challenge`, the request matches a rule and therefore will not be matched to any rules after.
48+
49+
For self-serve non-Bot Management customers, all rules for verified, definitely automated, and likely bots run in the phase following the AI bots rule.
50+
51+
```mermaid
52+
---
53+
title: Rule phases
54+
---
55+
flowchart LR
56+
accTitle: AI bots rule phases diagram
57+
accDescr: This diagram details the phases in which AI bots rules run.
58+
A[Custom rules] --> B[Block AI bots<br>managed rule] --> C[Allow verified bots rule]
59+
```
60+
61+
This feature is available on all Cloudflare plans.
3562

3663
:::note
3764

38-
The method for blocking AI bots depends on [your plan](/bots/get-started/).
65+
The method for blocking AI bots depends on [your plan](/bots/get-started/).
3966
:::

src/content/partials/bots/ai-bots-definition.mdx

Lines changed: 0 additions & 38 deletions
This file was deleted.
Lines changed: 31 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,31 @@
1+
---
2+
{}
3+
---
4+
5+
When you enable this feature, Cloudflare will block the following bots:
6+
7+
- `Amazonbot` (Amazon)
8+
- `Applebot` (Apple)
9+
- `Bytespider` (ByteDance)
10+
- `ChatGPT-User` (OpenAI)
11+
- `ClaudeBot` (Anthropic)
12+
- `Claude-SearchBot` (Anthropic)
13+
- `Claude-User` (Anthropic)
14+
- `DuckAssistBot` (DuckDuckGo)
15+
- `Google-CloudVertexBot` (Google)
16+
- `GoogleOther` (Google)
17+
- `GPTBot` (OpenAI)
18+
- `Meta-ExternalAgent` (Meta)
19+
- `OAI-SearchBot` (OpenAI)
20+
- `PerplexityBot` (Perplexity)
21+
- `Perplexity-User` (Perplexity)
22+
- `PetalBot` (Huawei)
23+
- `TikTokSpider` (ByteDance)
24+
25+
In addition to this list, all verified bots that are classified as `AI Search`, `AI Assistant`, `AI Crawler`, or an `Archiver`, as well as a number of unverified bots that fall into the [verified bot categories](/bots/concepts/bot/verified-bots/categories/) are blocked. It does not block verified bots that fall into the `Search Engine` categories.
26+
27+
:::note
28+
Some AI bots overlap with definitely automated bots and verified bots, the latter becoming verified AI bots.
29+
30+
For a partial list of verified AI Bots, refer to the [Cloudflare Radar](https://radar.cloudflare.com/bots#verified-bots) categories of AI Search, AI Assistant, or AI Crawler, as well as some other bots that harvest data for AI training.
31+
:::

0 commit comments

Comments
 (0)