Summarizing for Multi-Turn Conversations #818

devpatelio · 2025-12-31T00:10:04Z

No description provided.

gemini-code-assist

Code Review

This pull request introduces a mechanism to summarize multi-turn conversation histories when the context length exceeds a certain threshold. The implementation adds a new configuration flag summarize_chat, a method to construct and perform the summarization, and integrates this into the agent loop. My review focuses on improving the maintainability, robustness, and configurability of the new summarization logic. I've suggested moving a large hardcoded prompt to a constant, simplifying some redundant code, handling a potential failure case in summary extraction, and making the summarization threshold configurable. These changes should make the feature more robust and easier to manage.

gemini-code-assist · 2025-12-31T00:11:53Z

skyrl-train/skyrl_train/generators/skyrl_gym_generator.py

+        if match:
+            summary_text = match.group(1).strip()


If the language model does not return the summary within <summary> tags, re.search will return None, and the original, unparsed summary_text will be used. This could lead to a malformed context for the next turn. It's safer to handle this case, for instance, by logging a warning.

Suggested change

if match:

summary_text = match.group(1).strip()

if match:

summary_text = match.group(1).strip()

else:

logger.warning("Could not find <summary> tags in the summarization response. Using the full response as summary.")

gemini-code-assist · 2025-12-31T00:11:53Z

skyrl-train/skyrl_train/generators/skyrl_gym_generator.py

+        summary_prompt = """
+        Your operational context is full. Generate a concise summary by populating the template below. 
+        This summary will be your sole context for continuing this task. Be brief but ensure all critical data is present.
+        - Mission Objective
+            – Original query: [State the user's verbatim query.] 
+            – Verification checklist: [Status (VERIFIED/PENDING)] [Checklist item]
+        - Key Findings
+            – Sources: [List the most critical, verified facts with sources.]
+            – Discrepancies: [Note any conflicting information found between sources.] 
+        - Tactical Plan
+            - Promising leads: [List the best remaining keywords, sources, or angles to investigate.] 
+            – Known dead ends: [List queries or sources that proved useless to avoid repetition.] 
+            – Immediate next action: [State the exact tool call or query you were about to execute next.] 
+        Now generate the summary, and put your summary inside tag <summary></summary>.
+        """


This large summary prompt is hardcoded within the summarize_chat_history method. For better readability and maintainability, consider moving it to a module-level constant (e.g., _SUMMARY_PROMPT) at the top of the file.

gemini-code-assist · 2025-12-31T00:11:53Z

skyrl-train/skyrl_train/generators/skyrl_gym_generator.py

+        history_to_summarize = self.chat_history[initial_chat_history_length:]
+        summarize_request = self.chat_history[:initial_chat_history_length].copy()
+        summarize_request.extend(history_to_summarize)


The logic to create summarize_request can be simplified. These three lines are equivalent to self.chat_history.copy().

Suggested change

history_to_summarize = self.chat_history[initial_chat_history_length:]

summarize_request = self.chat_history[:initial_chat_history_length].copy()

summarize_request.extend(history_to_summarize)

summarize_request = self.chat_history.copy()

gemini-code-assist · 2025-12-31T00:11:54Z

skyrl-train/skyrl_train/generators/skyrl_gym_generator.py

            done=False,
        )

+        threshold = int(max_input_length * 0.8)


The summarization threshold is hardcoded as 80% of max_input_length. This magic number makes the code harder to maintain. It's better to make it a configurable parameter with a default value.

Suggested change

threshold = int(max_input_length * 0.8)

threshold = int(max_input_length * self.generator_cfg.get("summarization_threshold_ratio", 0.8))

initial PR for context management

0cb19f1

gemini-code-assist bot reviewed Dec 31, 2025

View reviewed changes

done

6a36d49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Summarizing for Multi-Turn Conversations #818

Summarizing for Multi-Turn Conversations #818

Uh oh!

devpatelio commented Dec 31, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 31, 2025

Uh oh!

gemini-code-assist bot Dec 31, 2025

Uh oh!

gemini-code-assist bot Dec 31, 2025

Uh oh!

gemini-code-assist bot Dec 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	threshold = int(max_input_length * 0.8)
	threshold = int(max_input_length * self.generator_cfg.get("summarization_threshold_ratio", 0.8))

Summarizing for Multi-Turn Conversations #818

Are you sure you want to change the base?

Summarizing for Multi-Turn Conversations #818

Uh oh!

Conversation

devpatelio commented Dec 31, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 31, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 31, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 31, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 31, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant