[Feature Request]: Leverage prompt caching by swapping instructions & content #1699

y0hnn · 2026-01-08T11:36:05Z

y0hnn
Jan 8, 2026

What needs to be done?

We should first give the instructions in the prompt, then give the URL & Content : https://github.com/unclecode/crawl4ai/blob/main/crawl4ai/prompts.py

What problem does this solve?

We could leverage prompt caching. Depending on the provider, the cached tokens for input can ben up to 90% cheaper than normal input tokens. In my use-case, I'm crawling a lot of different pages with the same instructions set.

If we change the prompt to be :

Instructions
HTML Content
URL

We could also leverage on prompt caching for the HTML content which is quite the same (for example common body structure).

Target users/beneficiaries

Everyone using LLM Extraction. It would be cheaper at the end. For example with OpenAI : https://platform.openai.com/docs/pricing

Current alternatives/workarounds

No response

Proposed approach

No response

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Feature Request]: Leverage prompt caching by swapping instructions & content #1699

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Uh oh!

[Feature Request]: Leverage prompt caching by swapping instructions & content #1699

Uh oh!

y0hnn Jan 8, 2026

What needs to be done?

What problem does this solve?

Target users/beneficiaries

Current alternatives/workarounds

Proposed approach

Replies: 0 comments

y0hnn
Jan 8, 2026