Skip to content

Commit 14379cc

Browse files
committed
notes and disclaimers
1 parent 26ccc75 commit 14379cc

File tree

2 files changed

+11
-0
lines changed

2 files changed

+11
-0
lines changed

src/content/docs/autorag/configuration/data-source/website.mdx

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,10 @@ You can only crawl domains that you have onboarded onto the same Cloudflare acco
1313
Refer to [Onboard a domain](/fundamentals/manage-domains/add-site/) for more information on adding a domain to your Cloudflare account.
1414
:::
1515

16+
:::caution[Bot protection may block crawling]
17+
If you use Cloudflare products that control or restrict bot traffic such as [Bot Management](/bots/), [WAF](/waf/), or [Turnstile](/turnstile/), the same rules will apply to the AutoRAG crawler. Make sure to configure an exception or allow-list for the AutoRAG crawler in your settings.
18+
:::
19+
1620
## How website crawling works
1721
When you connect a domain, the crawler looks for your website’s sitemap to determine which pages to visit:
1822

src/content/docs/autorag/tutorial/brower-rendering-autorag-tutorial.mdx

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,6 +9,13 @@ import { TypeScriptExample } from "~/components";
99

1010
AutoRAG is designed to work out of the box with data in R2 buckets. But what if your content lives on a website or needs to be rendered dynamically?
1111

12+
:::note
13+
14+
AutoRAG now lets you use your [website](/autorag/configuration/data-source/website/) as a data source. When enabled, AutoRAG will automatically crawl and parse your site content for you.
15+
16+
:::
17+
18+
1219
In this tutorial, we’ll walk through how to:
1320

1421
1. Render your website using Cloudflare's Browser Rendering API

0 commit comments

Comments
 (0)