Skip to content

Commit 92253b4

Browse files
Oxyjunnikitacano
authored andcommitted
[AI Audit] Initialising PPC docs (cloudflare#23350)
* Initialising PPC docs * Making pages visible * Edits * Fixing headings clash * Fixing bad link * Fixing bad redirect file * Update 2025-07-01-pay-per-crawl.mdx --------- Co-authored-by: Nikita Cano <[email protected]>
1 parent 42e8d4e commit 92253b4

27 files changed

+748
-35
lines changed

public/__redirects

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -158,7 +158,6 @@
158158
/ai-gateway/configuration/websockets-api/non-realtime-api/ /ai-gateway/websockets-api/non-realtime-api/ 301
159159
/ai-gateway/configuration/websockets-api/realtime-api/ /ai-gateway/websockets-api/realtime-api/ 301
160160

161-
162161
# agents
163162
/agents/capabilities/mcp-server/ /agents/model-context-protocol/ 301
164163

93.3 KB
Loading
341 KB
Loading
Lines changed: 20 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,20 @@
1+
---
2+
title: Introducing Pay Per Crawl (private beta)
3+
description: A new feature which allows site owners to charge crawlers for access to their content
4+
date: 2025-07-01T12:00:00Z
5+
---
6+
7+
We are introducing a new feature of [AI Audit](/ai-audit/) — Pay Per Crawl. [Pay Per Crawl](/ai-audit/features/pay-per-crawl/what-is-pay-per-crawl/) enables site owners to require payment from AI crawlers every time the crawlers access their content, thereby fostering a fairer Internet by enabling site owners to control and monetize how their content gets used by AI.
8+
9+
![Pay per crawl](~/assets/images/changelog/ai-audit/pay-per-crawl.png)
10+
11+
**For Site Owners:**
12+
- Set pricing and select which crawlers to charge for content access
13+
- Manage payments via Stripe
14+
- Monitor analytics on successful content deliveries
15+
16+
**For AI Crawler Owners:**
17+
- Use HTTP headers to request and accept pricing
18+
- Receive clear confirmations on charges for accessed content
19+
20+
Learn more in the [Pay Per Crawl documentation](/ai-audit/features/pay-per-crawl/what-is-pay-per-crawl/).
File renamed without changes.
Lines changed: 38 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,38 @@
1+
---
2+
title: AI Audit with Cloudflare Bots
3+
pcx_content_type: concept
4+
sidebar:
5+
order: 5
6+
---
7+
8+
import { GlossaryTooltip, Example } from "~/components";
9+
10+
AI Audit works alongside other Cloudflare products, such as Cloudflare [bot solutions](/bots/). Bot solutions identifies traffic matching patterns of known bots, and can challenge or block the bots as you wish.
11+
12+
## Order of precedence
13+
14+
- AI Audit's AI crawler blocking uses [WAF custom rules](/waf/custom-rules/), which take place before Cloudflare bot solutions.
15+
- AI Audit's pay per crawl takes place after Cloudflare bot solutions.
16+
17+
```mermaid
18+
graph LR
19+
A[Traffic] --> B[WAF custom rules<br>AI Audit: Crawler blocks]
20+
B --> C[Cloudflare<br>Bot Solutions]
21+
C --> D[AI Audit:<br>Pay Per Crawl]
22+
classDef highlight fill:#F6821F,color:white
23+
```
24+
25+
For more information on how Cloudflare bot solutions works with WAF custom rules, refer to [How it works](/bots/concepts/bot/#how-it-works).
26+
27+
## Examples
28+
29+
Consider the following examples.
30+
31+
### Bot rule which blocks all AI bots vs pay per crawl
32+
33+
You may have both of the following enabled:
34+
35+
- A selection of AI crawlers to be charged through AI Audit's pay per crawl
36+
- Bot configuration option to [Block AI Bots](/bots/get-started/bot-fight-mode/#block-ai-bots).
37+
38+
Since pay per crawl happens after bot solutions, you need to first turn off **Block AI Bots** to ensure pay per crawl works as intended.
Lines changed: 81 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,81 @@
1+
---
2+
title: AI Audit with Cloudflare WAF
3+
pcx_content_type: concept
4+
sidebar:
5+
order: 2
6+
7+
---
8+
9+
import { GlossaryTooltip, Example, Steps } from "~/components";
10+
11+
AI Audit works alongside other Cloudflare products, such as Cloudflare [Web Application Firewall (WAF)](/waf/). WAF checks incoming web and API requests, and filters undesired traffic based on rules. [WAF custom rules](/waf/custom-rules/) allow you to perform certain actions such as enforcing <GlossaryTooltip term="robots.txt">`robots.txt`</GlossaryTooltip>.
12+
13+
## Order of precedence
14+
15+
- AI Audit uses WAF custom rules to block the selection of AI crawlers the site owner has decided to block.
16+
- AI Audit's pay per crawl feature takes place after WAF.
17+
18+
```mermaid
19+
graph LR
20+
A[Traffic] --> B[WAF custom rules<br>AI Audit: Crawler blocks]
21+
B --> C[Cloudflare<br>Bot Solutions]
22+
C --> D[AI Audit:<br>Pay Per Crawl]
23+
classDef highlight fill:#F6821F,color:white
24+
```
25+
26+
For this reason, if you plan on using AI Audit to manage AI crawlers, you may wish to modify your existing WAF custom rules such that it does not affect AI crawlers. This will allow you to manage AI crawlers only from AI Audit, thereby streamlining your workflow.
27+
28+
:::note[How AI Audit uses WAF custom rules]
29+
When you block AI crawlers via AI Audit (either all or some), you are using **one** WAF custom rule to block those AI crawlers.
30+
31+
If you choose to allow all AI crawlers, AI Audit does not utilize any WAF custom rules.
32+
33+
Depending on the type of account you have, you may have a limited number of WAF custom rules.
34+
:::
35+
36+
## Examples of using WAF vs AI Audit
37+
38+
Consider the following examples.
39+
40+
### Traffic from a restricted country vs pay per crawl
41+
42+
You may have both of the following features enabled:
43+
44+
- [WAF custom rule to block traffic from specific countries](/waf/custom-rules/use-cases/block-traffic-from-specific-countries/)
45+
- AI Audit's [pay per crawl](/ai-audit/features/pay-per-crawl/what-is-pay-per-crawl/) to charge AI crawlers when they request access to your content
46+
47+
Since WAF custom rules are enforced before pay per crawl, traffic (including AI crawlers) from your blocked countries will continue to be blocked, even if they provide the [required headers](/ai-audit/features/pay-per-crawl/use-pay-per-crawl-as-ai-owner/crawl-pages/#1-include-required-headers) for pay per crawl.
48+
49+
### Allowed search engine bots via WAF custom rule vs pay per crawl
50+
51+
You may have both of the following features enabled:
52+
53+
- [WAF custom rule to allow search engine bots](/waf/custom-rules/use-cases/allow-traffic-from-verified-bots/)
54+
- AI Audit's [pay per crawl](/ai-audit/features/pay-per-crawl/what-is-pay-per-crawl/) to charge all AI crawlers when they request access to your content (including search engine bots).
55+
56+
Since WAF custom rules are enforced before pay per crawl:
57+
58+
- Only search engine bots will be able to access your site (enforced by WAF custom rule).
59+
- The search engine bots will then be charged for access to your content (enforced by AI Audit's pay per crawl).
60+
61+
:::note
62+
This example only serves to highlight the order of precedence between WAF and AI Audit.
63+
64+
Practically, it may be beneficial to allow well-behaved search engine bots to access your content to ensure your content is indexed.
65+
:::
66+
67+
### Conflict in AI crawler blocking logic
68+
69+
You may have both of the following features enabled:
70+
71+
- A WAF custom rule which blocks all bots.
72+
- AI Audit selection which allows certain AI crawlers.
73+
74+
In this scenario, you have two WAF custom rules, each directing a different logic for handling AI crawlers. To resolve this issue:
75+
76+
<Steps>
77+
1. Log in to the [Cloudflare dashboard](https://dash.cloudflare.com/), and select your account and domain.
78+
2. Go to **Rules** > **Overview**.
79+
3. Identify your WAF custom rule and the AI Audit rule.
80+
4. Drag the rule you wish to prioritize to the top, or modify your WAF custom rule to ensure it does not conflict with your AI Audit configurations.
81+
</Steps>
Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,12 @@
1+
---
2+
title: Configuration
3+
pcx_content_type: navigation
4+
sidebar:
5+
group:
6+
hideIndex: true
7+
order: 8
8+
---
9+
10+
import { DirectoryListing } from "~/components";
11+
12+
<DirectoryListing />

src/content/docs/ai-audit/features/analyze-ai-crawlers.mdx

Lines changed: 22 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -2,14 +2,14 @@
22
title: Analyze AI crawlers
33
pcx_content_type: concept
44
sidebar:
5-
order: 7
5+
order: 5
66
---
77

8-
import { Tabs, TabItem } from "~/components";
8+
import { Steps, Tabs, TabItem } from "~/components";
99

10-
AI Audit metrics provides you with key insights on how AI crawlers are interacting with your website.
10+
AI Audit metrics provide you with insight on how AI crawlers are interacting with your website ([Cloudflare zone](/fundamentals/concepts/accounts-and-zones/#zones)).
1111

12-
To analyze AI crawlers:
12+
To view AI Audit metrics:
1313

1414
1. Log in to the [Cloudflare dashboard](https://dash.cloudflare.com/), and select your account and domain.
1515
2. Go to **AI Audit**.
@@ -19,23 +19,29 @@ To analyze AI crawlers:
1919

2020
AI Audit provides you with the following metrics to help you understand how AI crawlers are interacting with your website.
2121

22-
| Metric | Description |
23-
| ---------------------------------- | ------------------------------------------------------------------------ |
24-
| Total requests | The total number of requests to crawl your website, from all AI crawlers |
25-
| Allowed requests | The number of requests you have allowed (by allowing AI crawlers) |
26-
| Blocked requests | The number of requests you have blocked (by blocking AI crawlers) |
27-
| Requests by AI crawlers | A graph which displays the number of crawl requests from each AI crawler |
28-
| Most popular paths by AI crawlers | The most popular pages crawled by AI crawlers, for each AI crawler |
29-
| Hosts | Top five most requested hosts (website domains) |
30-
| Paths | Top five most requested paths |
31-
| AI crawler operators | Top five most active AI crawler operators (by requests) |
22+
| Metric | Description |
23+
| ------------------------------------- | ----------------------------------------------------------------------------- |
24+
| Total requests | The total number of requests to crawl your website, from all AI crawlers |
25+
| Blocked requests | The number of requests you have blocked (by blocking AI crawlers) |
26+
| Allowed requests | The number of requests you have allowed (by allowing AI crawlers) |
27+
| Request by AI crawlers | A graph which displays the number of crawl requests from each AI crawler |
28+
| Most popular paths by AI crawlers | The most popular pages crawled by AI crawlers, for each AI crawler |
29+
| Hosts | The owner of the AI crawler |
30+
| Overall popular paths | The most popular pages crawled by AI crawlers, from all AI crawlers |
31+
| Most active AI crawlers by operators | The AI crawler owners with the highest number of requests to access your site |
32+
33+
:::note[Requests in AI Audit metrics]
34+
The number of requests in AI Audit metrics are specifically requests which were met with HTTP code 200 (the request was successfully served, with actual content).
35+
36+
AI Audit metrics filter all other HTTP codes.
37+
:::
3238

3339
## Filter date range
3440

3541
You can use the date filter to choose the period of time you wish to analyze.
3642

3743
<Tabs>
38-
<TabItem label="Free plan">
44+
<TabItem label="Free plans">
3945

4046
Filter options:
4147

@@ -53,4 +59,4 @@ Filter options:
5359
</TabItem>
5460
</Tabs>
5561

56-
The values of the AI Audit metrics will update according to your filter.
62+
The values of the AI Audit metrics will update according to your filter.

src/content/docs/ai-audit/features/manage-ai-crawlers.mdx

Lines changed: 61 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ sidebar:
55
order: 2
66
---
77

8-
import { Steps, GlossaryTooltip, Tabs, TabItem } from "~/components";
8+
import { Steps, GlossaryTooltip, Tabs, TabItem, Example } from "~/components";
99

1010
AI Audit enables you to take specific action for each AI crawler.
1111

@@ -17,9 +17,9 @@ To manage AI crawlers:
1717

1818
## View the list of AI crawlers
1919

20-
The **AI Crawlers** tab displays a table of AI crawlers that are requesting access to your content, and how they interact with your pages. The table provides the following information, depending on the type of plan you are on.
20+
The **AI Crawlers** tab displays a table of AI crawlers that are requesting access to your content, and how they interact with your pages. The table provides the following information.
2121

22-
<Tabs syncKey ="plan">
22+
<Tabs syncKey="plan">
2323
<TabItem label="Free plan">
2424
| Column | Details |
2525
| --------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
@@ -29,24 +29,75 @@ The **AI Crawlers** tab displays a table of AI crawlers that are requesting acce
2929
| Block | Toggle for blocking specific AI crawlers. |
3030

3131
</TabItem> <TabItem label="Paid plan">
32-
3332
| Column | Details |
3433
| --------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
3534
| AI Crawlers | The name of the AI crawler. |
3635
| Operator | The name of the entity who owns the AI crawler. Note that an operator may have multiple AI crawlers. |
3736
| Category | The category of the AI crawler. Refer to [Verified bot categories](/bots/concepts/bot/verified-bots/categories/). |
3837
| Requests | The number of times the AI crawler has requested to crawl your content. |
3938
| Robots.txt violations | The number of times the AI crawler has violated your <GlossaryTooltip term="robots.txt">`robots.txt`</GlossaryTooltip> file. |
40-
| Block | Toggle for blocking specific AI crawlers. |
39+
| Action | The action you wish to take for the AI crawler. Refer to [Take action for each AI crawler](/ai-audit/features/manage-ai-crawlers/#take-action-for-each-ai-crawler). |
4140

4241
</TabItem>
4342
</Tabs>
4443

45-
## Block access
44+
## Take action for each AI crawler
45+
46+
<Tabs>
47+
<TabItem label="With pay per crawl">
48+
49+
For each AI crawler, you can take one of three actions: allow, charge, or block.
50+
51+
<Example title="Allow access">
52+
53+
You can allow an AI crawler to scrape your content.
54+
55+
From the **Actions** column, select **Allow**.
56+
57+
Note that you can still choose to [Enforce `robots.txt`](/ai-audit/features/manage-ai-crawlers/#take-action-for-each-ai-crawler).
58+
59+
For more details on how this rule interacts with other Cloudflare settings, refer to [How it works](/bots/concepts/bot/#how-it-works).
60+
61+
</Example>
62+
63+
<Example title="Charge for crawl (Enterprise only)">
64+
65+
You can charge the owner of the AI crawler for each crawl request.
66+
67+
From the **Actions** column, select **Charge**.
68+
69+
For more information, refer to [What is Pay Per Crawl?](/ai-audit/features/pay-per-crawl/what-is-pay-per-crawl/).
70+
71+
</Example>
72+
73+
<Example title="Block access">
4674

4775
You can block an AI crawler to completely stop the AI crawler from scraping your webpage.
4876

77+
From the **Actions** column, select **Block**.
78+
</Example>
79+
80+
</TabItem>
81+
<TabItem label="Without pay per crawl">
82+
83+
For each AI crawler, you can choose to allow or block access.
84+
85+
<Example title="Allow access">
86+
87+
You can allow an AI crawler to scrape your content. Note that you can still choose to [Enforce `robots.txt`](/ai-audit/features/manage-ai-crawlers/#take-action-for-each-ai-crawler).
88+
89+
For more details on how this rule interacts with other Cloudflare settings, refer to [How it works](/bots/concepts/bot/#how-it-works).
90+
91+
</Example>
92+
93+
<Example title="Block access">
94+
You can block an AI crawler to completely stop the AI crawler from scraping your webpage.
95+
4996
Use the toggles in the **Block** column to block specific AI crawlers from accessing your content.
97+
</Example>
98+
99+
</TabItem>
100+
</Tabs>
50101

51102
## Filter AI crawler data
52103

@@ -57,3 +108,7 @@ You can use filters to narrow the scope of your result.
57108
- **Category:** Filter by the category of the AI crawler (for example, AI crawler, AI assistant, or archiver).
58109

59110
The values of the table will update according to your filter.
111+
112+
## Related resources
113+
114+
- Use [pay per crawl](/ai-audit/features/pay-per-crawl/what-is-pay-per-crawl/) to charge AI crawlers every time they access your content.

0 commit comments

Comments
 (0)