PCX review

Oxyjun · Oxyjun · commit fdc400b96e11 · 2025-04-07T11:25:59.000+01:00
diff --git a/src/content/docs/r2/api/tokens.mdx b/src/content/docs/r2/api/tokens.mdx
@@ -47,14 +47,14 @@ Jurisdictional buckets can only be accessed via the corresponding jurisdictional
 
 | Permission          | Description                                                                                                                                                                                 |
 | ------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
-| Admin Read & Write  | Allows the ability to create, list, and delete buckets, edit bucket configuration, read, write, and list objects, and read and write access to data catalog tables and associated metadata. |
-| Admin Read only     | Allows the ability to list buckets and view bucket configuration, read and list objects, and read access to data catalog tables and associated metadata.                                    |
+| Admin Read & Write  | Allows the ability to create, list, and delete buckets, edit bucket configuration, read, write, and list objects, and read and write to data catalog tables and associated metadata.        |
+| Admin Read only     | Allows the ability to list buckets and view bucket configuration, read and list objects, and read from the data catalog tables and associated metadata.                                     |
 | Object Read & Write | Allows the ability to read, write, and list objects in specific buckets.                                                                                                                    |
 | Object Read only    | Allows the ability to read and list objects in specific buckets.                                                                                                                            |
 
 :::note
 
-Currently Admin Read & Write or Admin Read only permission is required to interact with and query [R2 Data Catalog](/r2/data-catalog/).
+Currently **Admin Read & Write** or **Admin Read only** permission is required to interact with [R2 Data Catalog](/r2/data-catalog/).
 
 :::
 
diff --git a/src/content/docs/r2/data-catalog/config-examples/pyiceberg.mdx b/src/content/docs/r2/data-catalog/config-examples/pyiceberg.mdx
@@ -8,8 +8,8 @@ Below is an example of using [PyIceberg](https://py.iceberg.apache.org/) to conn
 ## Prerequisites
 
 - Sign up for a [Cloudflare account](https://dash.cloudflare.com/sign-up/workers-and-pages).
-- Create an [R2 bucket](/r2/buckets/) and enable the data catalog.
-- Create an [R2 API token](/r2/api/tokens/) with both [R2 and data catalog permissions](/r2/api/tokens/#permissions).
+- [Create an R2 bucket](/r2/buckets/create-buckets/) and enable the data catalog.
+- [Create an R2 API token](/r2/api/tokens/) with both [R2 and data catalog permissions](/r2/api/tokens/#permissions).
 - Install the [PyIceberg](https://py.iceberg.apache.org/#installation) and [PyArrow](https://arrow.apache.org/docs/python/install.html) libraries.
 
 ## Example usage
diff --git a/src/content/docs/r2/data-catalog/config-examples/snowflake.mdx b/src/content/docs/r2/data-catalog/config-examples/snowflake.mdx
@@ -8,13 +8,13 @@ Below is an example of using [Snowflake](https://docs.snowflake.com/en/user-guid
 ## Prerequisites
 
 - Sign up for a [Cloudflare account](https://dash.cloudflare.com/sign-up/workers-and-pages).
-- Create an [R2 bucket](/r2/buckets/) and enable the data catalog.
-- Create an [R2 API token](/r2/api/tokens/) with both [R2 and data catalog permissions](/r2/api/tokens/#permissions).
+- [Create an R2 bucket](/r2/buckets/create-buckets/) and enable the data catalog.
+- [Create an R2 API token](/r2/api/tokens/) with both [R2 and data catalog permissions](/r2/api/tokens/#permissions).
 - A [Snowflake](https://www.snowflake.com/) account with the necessary privileges to create external volumes and catalog integrations.
 
 ## Example usage
 
-In your Snowflake [SQL worksheet](https://docs.snowflake.com/en/user-guide/ui-snowsight-worksheets-gs) or [notebook](https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks) run the following commands:
+In your Snowflake [SQL worksheet](https://docs.snowflake.com/en/user-guide/ui-snowsight-worksheets-gs) or [notebook](https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks), run the following commands:
 
 ```sql
 -- Create a database (if you don't already have one) to organize your external data
diff --git a/src/content/docs/r2/data-catalog/config-examples/spark.mdx b/src/content/docs/r2/data-catalog/config-examples/spark.mdx
@@ -3,20 +3,25 @@ title: Spark
 pcx_content_type: example
 ---
 
-Below is an example of how you can build an [Apache Spark](https://spark.apache.org/) application (with Scala) which connects to the R2 Data Catalog. This application is built to run locally, but it can be adapted to run on a cluster.
+import { FileTree } from "~/components"
+
+
+Below is an example of how you can build an [Apache Spark](https://spark.apache.org/) application (with Scala) which connects to R2 Data Catalog. This application is built to run locally, but it can be adapted to run on a cluster.
 
 ## Prerequisites
 
 - Sign up for a [Cloudflare account](https://dash.cloudflare.com/sign-up/workers-and-pages).
-- Create an [R2 bucket](/r2/buckets/) and enable the data catalog.
-- Create an [R2 API token](/r2/api/tokens/) with both [R2 and data catalog permissions](/r2/api/tokens/#permissions).
+- [Create an R2 bucket](/r2/buckets/create-buckets/) and enable the data catalog.
+- [Create an R2 API token](/r2/api/tokens/) with both [R2 and data catalog permissions](/r2/api/tokens/#permissions).
 - Install Java 17, Spark 3.5.3, and SBT 1.10.11
   - Note: The specific versions of tools are critical for getting things to work in this example.
   - Tip: [“SDKMAN”](https://sdkman.io/) is a convenient package manager for installing SDKs.
 
 ## Example usage
 
-To start, create a new empty project directory somewhere on your machine. Inside that directory, create the following file at `src/main/scala/com/example/R2DataCatalogDemo.scala`. This will serve as the main entry point for your Spark application.
+To start, create a new empty project directory somewhere on your machine.
+
+Inside that directory, create the following file at `src/main/scala/com/example/R2DataCatalogDemo.scala`. This will serve as the main entry point for your Spark application.
 
 ```java
 package com.example
@@ -105,7 +110,7 @@ To enable the [sbt-assembly plugin](https://github.com/sbt/sbt-assembly?tab=read
 addSbtPlugin("com.eed3si9n" % "sbt-assembly" % "1.2.0")
 ```
 
-Make sure Java, Spark, and sbt are installed and available in your shell. If you're using SDKMAN, you can install them as shown below:
+Make sure Java, Spark, and sbt are installed and available in your shell. If you are using SDKMAN, you can install them as shown below:
 
 ```bash
 sdk install java 17.0.14-amzn
@@ -121,7 +126,7 @@ sbt clean assembly
 
 After building, the output JAR should be located at `target/scala-2.12/R2DataCatalogDemo-assembly-1.0.jar`.
 
-To run the application, you'll use `spark-submit`. Below is an example shell script (`submit.sh`) that includes the necessary Java compatability flags for Spark on Java 17:
+To run the application, you will use `spark-submit`. Below is an example shell script (`submit.sh`) that includes the necessary Java compatability flags for Spark on Java 17:
 
 ```
 # We need to set these "--add-opens" so that Spark can run on Java 17 (it needs access to
@@ -142,23 +147,22 @@ chmod +x submit.sh
 
 At this point, your project directory should be structured like this:
 
-```
-.
-├── Makefile
-├── README.md
-├── build.sbt
-├── project
-│   ├── assembly.sbt
-│   ├── build.properties
-│   └── project
-├── spark-submit.sh
-└── src
-    └── main
-        └── scala
-            └── com
-                └── example
-                    └── R2DataCatalogDemo.scala
-```
+<FileTree>
+- Makefile
+- README.md
+- build.sbt
+- project
+  - assembly.sbt
+  - build.properties
+  - project
+- spark-submit.sh
+- src
+  - main
+	  - scala
+		  - com
+			  - example
+				  - R2DataCatalogDemo.scala
+</FileTree>
 
 Before submitting the job, make sure you have the required environment variable set for your catalog URI, warehouse, and [Cloudflare API token](/r2/api/tokens/).
 
@@ -168,7 +172,7 @@ export WAREHOUSE=
 export TOKEN=
 ```
 
-You're now ready to run the job:
+You are now ready to run the job:
 
 ```bash
 ./submit.sh
diff --git a/src/content/docs/r2/data-catalog/get-started.mdx b/src/content/docs/r2/data-catalog/get-started.mdx
@@ -1,6 +1,6 @@
 ---
 pcx_content_type: get-started
-title: Get started
+title: Getting started
 head: []
 sidebar:
   order: 2
@@ -44,7 +44,7 @@ This guide will instruct you through:
     	npx wrangler login
     	```
 
-2.  Then, enable the catalog on your chosen R2 bucket:
+2.  Enable the catalog on your chosen R2 bucket:
 
         ```
         npx wrangler r2 bucket r2-data-catalog-tutorial
@@ -104,20 +104,20 @@ Iceberg clients (including [PyIceberg](https://py.iceberg.apache.org/)) must aut
 
 6. Select **Create API Token**.
 
-7. Note the **Token value**, you will need this.
+7. Note the **Token value**.
 
 </Steps>
 
 ## 4. Install uv
 
-Next, you'll need to install a Python package manager, in this guide we'll be using [uv](https://docs.astral.sh/uv/). If you don't already have uv installed, follow the [installing uv guide](https://docs.astral.sh/uv/getting-started/installation/).
+You need to install a Python package manager. In this guide, use [uv](https://docs.astral.sh/uv/). If you do not already have uv installed, follow the [installing uv guide](https://docs.astral.sh/uv/getting-started/installation/).
 
 ## 5. Install marimo
 
-We'll be using [marimo](https://github.com/marimo-team/marimo) as a Python notebook.
+We will use [marimo](https://github.com/marimo-team/marimo) as a Python notebook.
 
 <Steps>
-1. Create a directory where our notebook will live:
+1. Create a directory where our notebook will be stored:
 
     	```
     	mkdir r2-data-catalog-notebook
@@ -269,7 +269,7 @@ We'll be using [marimo](https://github.com/marimo-team/marimo) as a Python noteb
         		app.run()
         ```
 
-3.  Replace the `CATALOG_URI`, `WAREHOUSE` and `TOKEN` variables with your values from sections **2** and **3** respectively.
+3.  Replace the `CATALOG_URI`, `WAREHOUSE`, and `TOKEN` variables with your values from sections **2** and **3** respectively.
 
 </Steps>
 In the Python notebook above, you:
@@ -286,7 +286,7 @@ In the Python notebook above, you:
 
 <LinkCard
 	title="Managing catalogs"
-	href="/r2/data-catalog/managing-catalogs/"
+	href="/r2/data-catalog/manage-catalogs/"
 	description="Enable or disable R2 Data Catalog on your bucket, retrieve configuration details, and authenticate your Iceberg engine."
 />
 
diff --git a/src/content/docs/r2/data-catalog/index.mdx b/src/content/docs/r2/data-catalog/index.mdx
@@ -19,7 +19,7 @@ R2 Data Catalog is a managed [Apache Iceberg](https://iceberg.apache.org/) data
 
 R2 Data Catalog makes it easy to turn an R2 bucket into a data warehouse or lakehouse for a variety of analytical workloads including log analytics, business intelligence, and data pipelines. R2's zero-egress fee model means that data users and consumers can access and analyze data from different clouds, data platforms, or regions without incurring transfer costs.
 
-Refer to the [get started guide](/r2/data-catalog/get-started/) to start with R2 Data Catalog.
+To get started with R2 Data Catalog, refer to the [R2 Data Catalog: Getting started](/r2/data-catalog/get-started/).
 
 ## What is Apache Iceberg?
 
@@ -49,7 +49,7 @@ Similarly, data catalogs ensure consistent, coordinated access, which allows mul
 
 <LinkCard
 	title="Managing catalogs"
-	href="/r2/data-catalog/managing-catalogs/"
+	href="/r2/data-catalog/manage-catalogs/"
 	description="Enable or disable R2 Data Catalog on your bucket, retrieve configuration details, and authenticate your Iceberg engine."
 />
 
diff --git a/src/content/docs/r2/data-catalog/manage-catalogs.mdx b/src/content/docs/r2/data-catalog/manage-catalogs.mdx
@@ -1,6 +1,6 @@
 ---
 pcx_content_type: configuration
-title: Managing catalogs
+title: Manage catalogs
 description: Understand how to manage Iceberg REST catalogs associated with R2 buckets
 sidebar:
   order: 3
@@ -18,7 +18,10 @@ import {
 	LinkCard,
 } from "~/components";
 
-Learn how to enable and disable [R2 Data Catalog](/r2/data-catalog/) on your buckets and authenticate Iceberg engines using API tokens.
+Learn how to:
+
+- Enable and disable [R2 Data Catalog](/r2/data-catalog/) on your buckets.
+- Authenticate Iceberg engines using API tokens.
 
 ## Enable R2 Data Catalog on a bucket
 
@@ -72,7 +75,7 @@ To connect your Iceberg engine to R2 Data Catalog, you will need a Cloudflare AP
 2. Copy the **Token value** from your new API token.
 3. In your engine configuration, provide this token as a bearer token.
    Internally, this token will be sent as:
-	 
+
 	 ```
 	 Authorization: Bearer <TOKEN_VALUE>
 	 ```
diff --git a/src/content/docs/r2/pricing.mdx b/src/content/docs/r2/pricing.mdx
@@ -24,13 +24,13 @@ To learn about potential cost savings from using R2, refer to the [R2 pricing ca
 
 ## R2 pricing
 
-|                                    | Standard storage         | Infrequent Access storage<InlineBadge preset="beta" /> |
-| ---------------------------------- | ------------------------ | ------------------------------------------------------ |
-| Storage                            | $0.015 / GB-month        | $0.01 / GB-month                                       |
-| Class A Operations                 | $4.50 / million requests | $9.00 / million requests                               |
-| Class B Operations                 | $0.36 / million requests | $0.90 / million requests                               |
-| Data Retrieval (processing)        | None                     | $0.01 / GB                                             |
-| Egress (data transfer to Internet) | Free [^1]                | Free [^1]                                              |
+|                                    | Standard storage         | Infrequent Access storage <InlineBadge preset="beta" /> |
+| ---------------------------------- | ------------------------ | ------------------------------------------------------- |
+| Storage                            | $0.015 / GB-month        | $0.01 / GB-month                                        |
+| Class A Operations                 | $4.50 / million requests | $9.00 / million requests                                |
+| Class B Operations                 | $0.36 / million requests | $0.90 / million requests                                |
+| Data Retrieval (processing)        | None                     | $0.01 / GB                                              |
+| Egress (data transfer to Internet) | Free [^1]                | Free [^1]                                               |
 
 ### Free tier
 
@@ -82,7 +82,7 @@ For objects stored in Infrequent Access storage, you will be charged for the obj
 
 ## R2 Data Catalog pricing
 
-R2 Data Catalog is in **public beta**, and any developer with [R2 subscription](/r2/pricing/) can start using it. Currently, outside of standard R2 storage and operations, you will not be billed for your use of R2 Data Catalog. We'll provide at least 30 days notice before we make any changes or start charging for usage
+R2 Data Catalog is in **public beta**, and any developer with an [R2 subscription](/r2/pricing/) can start using it. Currently, outside of standard R2 storage and operations, you will not be billed for your use of R2 Data Catalog. We will provide at least 30 days' notice before we make any changes or start charging for usage.
 
 To learn more about our thinking on future pricing, refer to the [R2 Data Catalog announcement blog](https://blog.cloudflare.com/r2-data-catalog-public-beta).