Skip to content

Commit a6ade34

Browse files
AI Crawl Control: Reflect Dashboard Changes and Update Pay Per Crawl Site Owner Onboarding (#25081)
* AI Crawl Control: updated to reflect dash changes and pay per crawl onboarding AI Crawl Control: removed duplicate FAQ AI Crawl Control: fixed missing links AI Crawl Control: fixed missing links * Setting up redirects, PCX review. * Nit fix --------- Co-authored-by: Jun Lee <[email protected]>
1 parent 0643a72 commit a6ade34

File tree

18 files changed

+327
-238
lines changed

18 files changed

+327
-238
lines changed

public/__redirects

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -153,6 +153,12 @@
153153
/ai-audit/features/enforce-robots-txt/ /ai-crawl-control/features/analyze-ai-crawlers/ 301
154154
/ai-audit/features/analyze-ai-crawlers/ /ai-crawl-control/features/analyze-ai-traffic/ 301
155155

156+
# ai crawl control
157+
/ai-crawl-control/reference/faq/ /ai-crawl-control/features/pay-per-crawl/faq/ 301
158+
/ai-crawl-control/features/pay-per-crawl/use-pay-per-crawl-as-site-owner/set-up-cloudflare-account/ /ai-crawl-control/features/pay-per-crawl/use-pay-per-crawl-as-site-owner/enable-in-account-settings/ 301
159+
/ai-crawl-control/features/pay-per-crawl/use-pay-per-crawl-as-site-owner/connect-to-stripe/ /ai-crawl-control/features/pay-per-crawl/use-pay-per-crawl-as-site-owner/manage-payouts/ 301
160+
/ai-crawl-control/features/pay-per-crawl/use-pay-per-crawl-as-site-owner/monitor-metrics/ /ai-crawl-control/features/pay-per-crawl/use-pay-per-crawl-as-site-owner/monitor-activity/ 301
161+
156162
# AI Gateway
157163
/ai-gateway/get-started/configuring-settings/ /ai-gateway/get-started/ 301
158164
/ai-gateway/get-started/connecting-applications/ /ai-gateway/get-started/ 301
@@ -209,7 +215,7 @@
209215
/analytics/analytics-integrations/google-cloud/ /analytics/analytics-integrations/ 301
210216
/analytics/dashboards/ /log-explorer/custom-dashboards/ 301
211217
/analytics/analytics-integrations/looker/ /analytics/analytics-integrations/ 301
212-
/analytics/network-analytics/reference/network-analytics-v1/ /analytics/graphql-api/migration-guides/network-analytics-v2/ 301
218+
/analytics/network-analytics/reference/network-analytics-v1/ /analytics/graphql-api/migration-guides/network-analytics-v2/ 301
213219

214220
# email-security
215221
/email-security/reporting/search/detection-search/ /email-security/reporting/search/ 301

src/content/docs/ai-crawl-control/features/analyze-ai-traffic.mdx

Lines changed: 14 additions & 21 deletions
Original file line numberDiff line numberDiff line change
@@ -16,38 +16,31 @@ To view AI Crawl Control metrics:
1616
1. Log in to the [Cloudflare dashboard](https://dash.cloudflare.com/), and select your account and domain.
1717
2. Go to **AI Crawl Control**.
1818

19-
You can find meaningful information across both **AI Crawlers** and **Metrics** tabs.
19+
You can find meaningful information across both **Crawlers** and **Metrics** tabs.
2020

2121
## View AI crawler activity
2222

23-
The **AI Crawlers** tab provides you with the following information:
23+
The **Crawlers** tab provides you with the following information:
2424

25-
- Complete list of detected AI crawlers and their operators
25+
- Total number of requests to crawl your website from common AI crawlers
2626
- Number of requests made by each AI crawler
2727
- Number of `robots.txt` violations for each crawler
2828

2929
## View AI Crawl Control metrics
3030

3131
The **Metrics** tab provides you with the following metrics to help you understand how AI crawlers are interacting with your website.
3232

33-
| Metric | Description |
34-
| ------------------------------------ | ----------------------------------------------------------------------------- |
35-
| Total requests | The total number of requests to crawl your website, from all AI crawlers |
36-
| Blocked requests | The number of crawler requests you have blocked, from any rule |
37-
| Allowed requests | The number of crawler requests you have allowed |
38-
| Hosts | The owner of the AI crawler |
39-
| Overall popular paths | The most popular pages crawled by AI crawlers, from all AI crawlers |
40-
| Most active AI crawlers by operators | The AI crawler owners with the highest number of requests to access your site |
41-
| Request by AI crawlers | A graph which displays the number of crawl requests from each AI crawler |
42-
| Most popular paths by AI crawlers | The most popular pages crawled by AI crawlers, for each AI crawler |
43-
| Referrals | A graph which displays the number of referrals from each AI operator |
44-
| Referers | The list of referers who directed traffic to your site |
45-
46-
:::note[Requests in AI Crawl Control metrics]
47-
The number of requests in AI Crawl Control metrics are specifically requests which were met with HTTP code 200 (the request was successfully served, with actual content).
48-
49-
AI Crawl Control metrics filter all other HTTP codes.
50-
:::
33+
| Metric | Description |
34+
| ------------------------------------ | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
35+
| Total requests | The total number of requests to crawl your website, from all AI crawlers |
36+
| Allowed requests | The number of crawler requests that received a successful response from your site |
37+
| Unsuccessful requests | The number of crawler requests that failed (HTTP 4xx or 5xx) as a result of a blocked request, other security rules, or website errors such as a crawler attempting to access a non-existent page |
38+
| Overall popular paths | The most popular pages crawled by AI crawlers, from all AI crawlers |
39+
| Most active AI crawlers by operators | The AI crawler owners with the highest number of requests to access your site |
40+
| Request by AI crawlers | A graph which displays the number of crawl requests from each AI crawler |
41+
| Most popular paths by AI crawlers | The most popular pages crawled by AI crawlers, for each AI crawler |
42+
| Referrals | A graph which displays the number of visits that were directed to your site from each AI operator |
43+
| Referers | The list of referers who directed visits to your site |
5144

5245
## Filter date range
5346

src/content/docs/ai-crawl-control/features/manage-ai-crawlers.mdx

Lines changed: 40 additions & 35 deletions
Original file line numberDiff line numberDiff line change
@@ -13,22 +13,22 @@ To manage AI crawlers:
1313

1414
1. Log in to the [Cloudflare dashboard](https://dash.cloudflare.com/), and select your account and domain.
1515
2. Go to **AI Crawl Control**.
16-
3. Go to the **AI Crawlers** tab.
16+
3. Go to the **Crawlers** tab.
1717

1818
## Review AI crawler activity
1919

2020
The **Crawlers** tab displays a table of AI crawlers that are requesting access to your content, and how they interact with your pages. The table provides the following information.
2121

22-
| Column | Details |
23-
| --------------------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
24-
| Crawler | The name of the AI crawler and the operator that owns it. |
25-
| Category | The category of the AI crawler. Refer to [Verified bot categories](/bots/concepts/bot/verified-bots/#categories). |
26-
| Requests | Total allowed and blocked requests with trend chart. Blocked requests may come from any configured rule, not just the actions shown here. |
27-
| Robots.txt violations | The number of times the AI crawler has violated your <GlossaryTooltip term="robots.txt">`robots.txt`</GlossaryTooltip> file. |
28-
| Action | The action you wish to take for the AI crawler. Refer to [Take action for each AI crawler](/ai-crawl-control/features/manage-ai-crawlers/#take-action-for-each-ai-crawler). |
22+
| Column | Details |
23+
| --------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ |
24+
| Crawler | The name of the AI crawler and the operator that owns it. |
25+
| Category | The category of the AI crawler. Refer to [Verified bot categories](/bots/concepts/bot/verified-bots/#categories). |
26+
| Requests | The total number of allowed and unsuccessful requests, with trend chart. Unsuccessful requests may come from any rule or response error, not just the block action in AI Crawl Control. |
27+
| Robots.txt violations | The number of times the AI crawler has violated your <GlossaryTooltip term="robots.txt">`robots.txt`</GlossaryTooltip> file. |
28+
| Action | The action you wish to take for the AI crawler. Refer to [Take action for each AI crawler](/ai-crawl-control/features/manage-ai-crawlers/#take-action-for-each-ai-crawler). |
2929

3030
:::note[Quality of AI crawler detection]
31-
On the free plan, AI Crawl Control identifies AI crawlers based on their [user agent strings](https://developer.mozilla.org/en-US/docs/Web/HTTP/Reference/Headers/User-Agent). This enables AI Crawl Control to detect easy-to-detect (well-known) AI crawlers.
31+
On the free plan, AI Crawl Control identifies AI crawlers based on their [user agent strings](https://developer.mozilla.org/en-US/docs/Web/HTTP/Reference/Headers/User-Agent). This enables AI Crawl Control to detect well-known, self-identifying AI crawlers.
3232

3333
Upgrade your plan to enable a more thorough detection using Cloudflare's [Bot Management detection ID](/bots/reference/bot-management-variables/#ruleset-engine-fields) field.
3434
:::
@@ -46,31 +46,21 @@ The values of the table will update according to your filter.
4646
## Take action for each AI crawler
4747

4848
<Tabs>
49-
<TabItem label="With pay per crawl">
50-
51-
:::note[Pay per crawl closed beta]
52-
Pay per crawl is currently in closed beta.
53-
54-
To find out how to join the beta program, reach out to us at [Pay per crawl signup](http://www.cloudflare.com/paypercrawl-signup/), or contact your account executive if you are an existing Enterprise customer.
55-
56-
To learn more about pay per crawl, refer to Cloudflare blog: [Introducing pay per crawl: enabling content owners to charge AI crawlers for access](https://blog.cloudflare.com/introducing-pay-per-crawl/).
57-
:::
49+
<TabItem label="Without pay per crawl">
5850

59-
For each AI crawler, you can take one of three actions: allow, charge, or block.
51+
For each AI crawler, you can choose to allow or block access.
6052

6153
<Example title="Allow access">
6254

6355
- **Summary:** You can allow an AI crawler to scrape your content.
6456
- **When to use:** Allow AI crawlers that offer services which provide value through citations, referrals, or existing agreements.
6557
- **Implementation:** From the **Actions** column, select **Allow**.
66-
Note that you can still choose to [Enforce `robots.txt`](/ai-crawl-control/features/manage-ai-crawlers/#take-action-for-each-ai-crawler).
6758

68-
For more details on how this rule interacts with other Cloudflare settings, refer to [How it works](/bots/concepts/bot/#how-it-works).
59+
Note that you can still choose to [Enforce `robots.txt`](/ai-crawl-control/features/manage-ai-crawlers/#take-action-for-each-ai-crawler).
6960

7061
</Example>
7162

7263
<Example title="Block access">
73-
7464
- **Summary:** You can block an AI crawler to completely stop the AI crawler from scraping your webpage.
7565
- **When to use:** Block AI crawlers when their behavior do not align with your content strategy, or violate your policies.
7666
- **Implementation:** From the **Actions** column, select **Block**.
@@ -79,32 +69,32 @@ Note that you can configure the response that gets returned when blocking an AI
7969

8070
</Example>
8171

82-
<Example title="Charge for crawl (private beta)">
83-
84-
- **Summary:** You can charge the owner of the AI crawler for each crawl request.
85-
- **When to use:** Charge AI crawlers when your content has training value, and you want to explore monetization options
86-
- **Implementation:** From the **Actions** column, select **Charge**.
72+
</TabItem>
73+
<TabItem label="With pay per crawl">
8774

88-
For more information, refer to [What is Pay Per Crawl?](/ai-crawl-control/features/pay-per-crawl/what-is-pay-per-crawl/).
75+
:::note[Pay per crawl closed beta]
76+
Pay per crawl is currently in closed beta.
8977

90-
</Example>
78+
To find out how to join the beta program, reach out to us at [Pay per crawl signup](http://www.cloudflare.com/paypercrawl-signup/), or contact your account executive if you are an existing Enterprise customer.
9179

92-
</TabItem>
93-
<TabItem label="Without pay per crawl">
80+
To learn more about pay per crawl, refer to Cloudflare blog: [Introducing pay per crawl: enabling content owners to charge AI crawlers for access](https://blog.cloudflare.com/introducing-pay-per-crawl/).
81+
:::
9482

95-
For each AI crawler, you can choose to allow or block access.
83+
For each AI crawler, you can take one of three actions: allow, charge, or block.
9684

9785
<Example title="Allow access">
9886

9987
- **Summary:** You can allow an AI crawler to scrape your content.
10088
- **When to use:** Allow AI crawlers that offer services which provide value through citations, referrals, or existing agreements.
10189
- **Implementation:** From the **Actions** column, select **Allow**.
90+
Note that you can still choose to [Enforce `robots.txt`](/ai-crawl-control/features/manage-ai-crawlers/#take-action-for-each-ai-crawler).
10291

103-
Note that you can still choose to [Enforce `robots.txt`](/ai-crawl-control/features/manage-ai-crawlers/#take-action-for-each-ai-crawler).
92+
For more details on how this rule interacts with other Cloudflare settings, refer to [How it works](/bots/concepts/bot/#how-it-works).
10493

10594
</Example>
10695

10796
<Example title="Block access">
97+
10898
- **Summary:** You can block an AI crawler to completely stop the AI crawler from scraping your webpage.
10999
- **When to use:** Block AI crawlers when their behavior do not align with your content strategy, or violate your policies.
110100
- **Implementation:** From the **Actions** column, select **Block**.
@@ -113,9 +103,24 @@ Note that you can configure the response that gets returned when blocking an AI
113103

114104
</Example>
115105

106+
<Example title="Charge for crawl (private beta)">
107+
108+
- **Summary:** You can charge the owner of the AI crawler for each successful crawl request.
109+
- **When to use:** Charge AI crawlers when your content has training value, and you want to explore monetization options.
110+
- **Implementation:** From the **Actions** column, select **Charge**.
111+
112+
For more information, refer to [What is Pay Per Crawl?](/ai-crawl-control/features/pay-per-crawl/what-is-pay-per-crawl/).
113+
114+
</Example>
115+
116116
</TabItem>
117+
117118
</Tabs>
118119

120+
:::tip[Need more advanced control?]
121+
You can also create more complex rules when taking action on AI crawlers, using [Cloudflare WAF](/waf/). For more information on creating more specific rules, refer to [Create a custom rule in the dashboard](/waf/custom-rules/create-dashboard/).
122+
:::
123+
119124
## Configure block response
120125

121126
<Plan type="paid" />
@@ -147,8 +152,8 @@ You can choose which HTTP response code to return when blocking an AI crawler.
147152

148153
Use the dropdown menu to select the desired response code. You can choose from:
149154

150-
- `403 Forbidden`: Use this option if you wish to indicate that you do not want the AI crawler to access your content.
151-
- `402 Payment Required`: Use this option if you wish to indicate that the AI crawler must pay to access your content.
155+
- `403 Forbidden`: Use this option if you wish to indicate that you do not want the AI crawler to access your content.
156+
- `402 Payment Required`: Use this option if you wish to indicate that the AI crawler must pay to access your content.
152157

153158
:::note
154159
Behind the scenes, AI Crawl Control uses [Cloudflare WAF](/waf/) to return custom block responses.
Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,28 @@
1+
---
2+
title: Pay Per Crawl FAQ
3+
pcx_content_type: concept
4+
sidebar:
5+
order: 4
6+
---
7+
8+
import { Steps } from "~/components";
9+
10+
## Frequently asked questions for site owners
11+
12+
### Can I set different prices for different AI crawlers?
13+
14+
No. Pay per crawl allows you to configure different actions (Block, Charge, or Allow) for each crawler, but you can only set a single price that applies to all crawlers configured with the "Charge" option.
15+
16+
## Frequently asked questions for AI bot operators
17+
18+
### Will I be charged for re-crawling the same page?
19+
20+
Yes. Every time your AI crawler accesses content on a website protected with pay per crawl, it will incur the cost set by the site owner. You should implement mechanisms within your crawler to track expenditure and enforce any spending limits you want to set.
21+
22+
### Am I charged for error responses?
23+
24+
No. Charging events are only triggered for successful HTTP response codes. Error responses are not billed, even if you have sent the `crawler-exact-price` or `crawler-max-price` headers.
25+
26+
### What user agent should I use?
27+
28+
Use the standard user agents associated with your AI crawler that you have onboarded to Cloudflare and identified through Web Bot Auth.

src/content/docs/ai-crawl-control/features/pay-per-crawl/use-pay-per-crawl-as-ai-owner/crawl-pages.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -85,4 +85,4 @@ Cloudflare strongly recommends tracking and saving these values to keep an accur
8585

8686
You may wish to refer to the following resources.
8787

88-
- [AI Crawl Control FAQs](/ai-crawl-control/reference/faq/).
88+
- [Pay Per Crawl FAQs](/ai-crawl-control/features/pay-per-crawl/faq).

src/content/docs/ai-crawl-control/features/pay-per-crawl/use-pay-per-crawl-as-ai-owner/index.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ pcx_content_type: navigation
44
sidebar:
55
group:
66
hideIndex: true
7-
order: 5
7+
order: 3
88
---
99

1010
import { DirectoryListing } from "~/components";

0 commit comments

Comments
 (0)