You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Please note that this approach may store the access token in plain text in the configuration file. A more secure way is to configure it through the OpenClaw onboarding wizard or the dashboard.
109
-
110
108
### Usage Examples
111
109
112
110
After configuration, describe the OCR or document parsing task in natural language and provide a file URL or local path so the AI app can invoke the corresponding skill.
@@ -158,7 +156,7 @@ Make sure your working directory is the directory containing this file.
158
156
159
157
2. Configure environment variables (see [Configure Environment Variables](#configure-environment-variables) for the list of variables). Choose one of the following methods:
160
158
161
-
**Option A**: run the interactive configuration script.
Copy file name to clipboardExpand all lines: skills/paddleocr-doc-parsing/SKILL.md
+9-14Lines changed: 9 additions & 14 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,9 +1,6 @@
1
1
---
2
2
name: paddleocr-doc-parsing
3
-
description: >
4
-
Advanced document parsing with PaddleOCR. Returns complete document
5
-
structure including text, tables, formulas, charts, and layout information. The AI agent extracts
6
-
relevant content based on user needs.
3
+
description: Complex document parsing with PaddleOCR. Intelligently converts complex PDFs and document images into Markdown and JSON files that preserve the original structure.
7
4
metadata:
8
5
openclaw:
9
6
requires:
@@ -198,11 +195,13 @@ Then return:
198
195
199
196
### First-Time Configuration
200
197
198
+
You can generally assume that the required environment variables have already been configured. Only when a parsing task fails should you analyze the error message to determine whether it is caused by a configuration issue. If it is indeed a configuration problem, you should notify the user to fix it.
199
+
201
200
**When API is not configured**:
202
201
203
202
The error will show:
204
203
```
205
-
PADDLEOCR_DOC_PARSING_API_URL not configured. Get your API at: https://paddleocr.com
204
+
CONFIG_ERROR: PADDLEOCR_DOC_PARSING_API_URL not configured. Get your API at: https://paddleocr.com
206
205
```
207
206
208
207
**Configuration workflow**:
@@ -217,29 +216,25 @@ PADDLEOCR_DOC_PARSING_API_URL not configured. Get your API at: https://paddleocr
217
216
- PADDLEOCR_ACCESS_TOKEN
218
217
- Optional: PADDLEOCR_DOC_PARSING_TIMEOUT
219
218
```
219
+
- For security reasons, do not run `configure.py` or create a local `.env` file by default if the skill is installed under a host application directory (for example, `~/.claude/skills`). You should also advise the user not to do this.
220
220
221
-
3. **If the user provides credentials in chat anyway** (accept any reasonable format):
221
+
3. **If the user provides credentials in chat anyway** (accept any reasonable format), for example:
- **Security note**: Warn the user that credentials shared in chat may be stored in conversation history. Recommend setting them through the host application's configuration instead when possible.
227
227
228
-
4. **Parse and validate the values**:
228
+
Then parse and validate the values:
229
229
- Extract `PADDLEOCR_DOC_PARSING_API_URL` (look for URLs with `paddleocr.com` or similar)
230
230
- Confirm `PADDLEOCR_DOC_PARSING_API_URL` is a full endpoint ending with `/layout-parsing`
231
231
- Extract `PADDLEOCR_ACCESS_TOKEN` (long alphanumeric string, usually 40+ chars)
232
-
- Tell the user exactly which environment variables to set
233
232
234
-
5. **Ask the user to confirm the environment is configured**:
235
-
- Wait for the user to confirm these values have been set in their host application, runtime environment, or appropriate config file
236
-
- For security reasons, do not run `configure.py` or create a local `.env` file by default if the skill is installed under a host application directory (for example, `~/.claude/skills`)
233
+
4. **Ask the user to confirm the environment is configured**.
237
234
238
-
6. **Retry only after confirmation**:
235
+
5. **Retry only after confirmation**:
239
236
- Once the user confirms the environment variables are available, retry the original parsing task
240
237
241
-
**IMPORTANT**: The error message format is STRICT and must be shown exactly as provided by the script. Do not modify or paraphrase it.
242
-
243
238
### Handling Large Files
244
239
245
240
There is no file size limit for the API. For PDFs, the maximum is 100 pages per request.
description: Extracts text (with locations) from images and PDF documents using PaddleOCR.
6
4
metadata:
7
5
openclaw:
8
6
requires:
@@ -156,6 +154,8 @@ The output JSON structure is as follows:
156
154
157
155
### First-Time Configuration
158
156
157
+
You can generally assume that the required environment variables have already been configured. Only when an OCR task fails should you analyze the error message to determine whether it is caused by a configuration issue. If it is indeed a configuration problem, you should notify the user to fix it.
158
+
159
159
**When API is not configured**:
160
160
161
161
The error will show:
@@ -175,25 +175,23 @@ CONFIG_ERROR: PADDLEOCR_OCR_API_URL not configured. Get your API at: https://pad
175
175
- PADDLEOCR_ACCESS_TOKEN
176
176
- Optional: PADDLEOCR_OCR_TIMEOUT
177
177
```
178
+
- For security reasons, do not run `configure.py` or create a local `.env` file by default if the skill is installed under a host application directory (for example, `~/.claude/skills`). You should also advise the user not to do this.
178
179
179
-
3. **If the user provides credentials in chat anyway** (accept any reasonable format):
180
+
3. **If the user provides credentials in chat anyway** (accept any reasonable format), for example:
- **Security note**: Warn the user that credentials shared in chat may be stored in conversation history. Recommend setting them through the host application's configuration instead when possible.
185
186
186
-
4. **Parse and validate the values**:
187
+
Then parse and validate the values:
187
188
- Extract `PADDLEOCR_OCR_API_URL` (look for URLs with `paddleocr.com` or similar)
188
189
- Confirm `PADDLEOCR_OCR_API_URL` is a full endpoint ending with `/ocr`
189
190
- Extract `PADDLEOCR_ACCESS_TOKEN` (long alphanumeric string, usually 40+ chars)
190
-
- Tell the user exactly which environment variables to set
191
191
192
-
5. **Ask the user to confirm the environment is configured**:
193
-
- Wait for the user to confirm these values have been set in their host application, runtime environment, or appropriate config file
194
-
- For security reasons, do not run `configure.py` or create a local `.env` file by default if the skill is installed under a host application directory (for example, `~/.claude/skills`)
192
+
4. **Ask the user to confirm the environment is configured**.
195
193
196
-
6. **Retry only after confirmation**:
194
+
5. **Retry only after confirmation**:
197
195
- Once the user confirms the environment variables are available, retry the original OCR task
0 commit comments