Skip to content

Commit 3e2f6f3

Browse files
committed
Prompt caching/conversation summarization
1 parent 4ee796f commit 3e2f6f3

File tree

2 files changed

+13
-1
lines changed

2 files changed

+13
-1
lines changed
97.4 KB
Loading

release-notes/v1_100.md

Lines changed: 13 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -97,7 +97,19 @@ The keyboard shortcut `kb(workbench.action.chat.open)` still just opens the chat
9797

9898
### Agent informed of undos and manual edits
9999

100-
Previously, making manual edits during an Agent mode session could confuse the model. Now, the agent is prompted about your changes, and should re-read files when necessary before editing files that may have changed.
100+
Previously, making manual edits during an agent mode session could confuse the model. Now, the agent is prompted about your changes, and should re-read files when necessary before editing files that may have changed.
101+
102+
### Conversation history summarized and optimized for prompt caching
103+
104+
We've made some changes to how our agent mode prompt is built to optimize for prompt caching. Prompt caching is a way to speed up model responses by maintaining a stable prefix for the prompt. The next request is able to resume from that prefix, and the result is that each request should be a bit faster. This is especially effective in a repetitive series of requests with large context, like you typically have in agent mode.
105+
106+
When your conversation gets long, or your context gets very large, you may see this message in your agent mode session:
107+
108+
![summarized conversation message](images/1_100/summarized-conversation.png)
109+
110+
Instead of keeping the whole conversation as a FIFO, breaking the cache, we compress the conversation so far into a summary of the most important information and the current state of your task. This keeps the prompt prefix stable, and your responses fast.
111+
112+
If you prefer, you can disable this with `setting(github.copilot.chat.advanced.summarizeAgentConversationHistory)`.
101113

102114
## Workbench
103115

0 commit comments

Comments
 (0)