Testing Fix/issue 400 clean #474

tylerslaton · 2025-10-03T18:08:31Z

testing #471

Replace fragile usage_metadata-based logic with robust streaming detection that checks multiple explicit streaming indicators. **Problem:** The original logic relied on `not adk_event.usage_metadata` to determine if an event should be processed as streaming. This was fragile because Claude models can include usage_metadata even in streaming chunks, causing responses to disappear. **Solution:** Implement comprehensive streaming detection that checks: - `partial` attribute (explicitly marked as partial) - `turn_complete` attribute (live streaming completion status) - `is_final_response()` method (final response indicator) - `finish_reason` attribute (fallback for content without finish reason) This ensures all streaming content is captured regardless of usage_metadata presence, fixing compatibility with Claude Sonnet 4 and other models. **Testing:** ✅ All 277 tests pass ✅ Streaming detection works across different model providers 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]>

…k-agent Add regression test for partial final ADK chunks

Change TextMessageContentEvent to TextMessageChunkEvent in test to match actual AG-UI protocol event types. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]>

…ttranslator

…ttranslator Add test for ADK streaming fallback branch

…e for streaming event

contextablemark · 2025-10-03T20:08:04Z

@tylerslaton I see the failures - I'll take a look this evening.

The Tool Based Generative UI haiku test was exhibiting flaky behavior where it would sometimes pass and sometimes fail with the same test conditions. The test was more reliable when run with --headed than when run headless, suggesting a timing-related issue. Root cause: The extractMainDisplayHaikuContent() method was concatenating ALL visible haiku lines from the main display, while the chat extraction only captured the most recent haiku. When multiple haikus were displayed simultaneously (due to rendering timing), this caused mismatches. Fix: Modified extractMainDisplayHaikuContent() to extract only the last 3 lines (the most recent haiku), matching the behavior of the chat extraction and eliminating timing-related flakiness. This affects all 10 platform integration tests that use ToolBaseGenUIPage. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]>

Setup Workload Identity Federation (cherry picked from commit 979b3dc)

🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]>

Add fallback logic to detect streaming completion using finish_reason when is_final_response returns False but finish_reason is set. **Problem:** Gemini returns events with partial=True and is_final_response()=False even on the final chunk that contains finish_reason="STOP". This caused streaming messages to remain open and require force-closing, resulting in warnings. **Solution:** Enhanced should_send_end logic to check for finish_reason as a fallback: - Check if finish_reason attribute exists and is truthy - If streaming is active and finish_reason is present, emit TEXT_MESSAGE_END - Formula: should_send_end = (is_final_response and not is_partial) or (has_finish_reason and self._is_streaming) **Testing:** ✅ All 277 tests pass ✅ Added test_partial_with_finish_reason to verify the fix ✅ Eliminates "Force-closing unterminated streaming message" warnings ✅ Properly emits TEXT_MESSAGE_END for events with finish_reason 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]>

- Prefer LRO routing in ADKAgent when long‑running tool call IDs are present in event.content.parts (prevents misrouting into streaming path and tool loops; preserves HITL pause) - Force‑close any active streaming text before emitting LRO tool events (guarantees TEXT_MESSAGE_END precedes TOOL_CALL_START) - Harden EventTranslator.translate to filter out long‑running tool calls from the general path; only emit non‑LRO calls (avoids duplicate tool events) - Add tests: * test_lro_filtering.py (translator‑level filtering + LRO‑only emission) * test_integration_mixed_partials.py (streaming → non‑LRO → final LRO: order, no duplicates, correct IDs)

contextablemark and others added 10 commits October 2, 2025 00:00

Merge branch 'ag-ui-protocol:main' into fix/issue-400-clean

603e294

test: ensure partial final chunks use streaming translation

67d3fe5

Merge pull request #79 from Contextable/codex/add-asyncio-test-for-ad…

ed8d02c

…k-agent Add regression test for partial final ADK chunks

test: cover turn complete fallback in ADK agent

2b47630

fix: correct event type in partial final chunk test

d7e2fb9

Change TextMessageContentEvent to TextMessageChunkEvent in test to match actual AG-UI protocol event types. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]>

Merge branch 'fix/issue-400-clean' into codex/add-async-test-for-even…

b76bcde

…ttranslator

Merge pull request #80 from Contextable/codex/add-async-test-for-even…

e38aaf0

…ttranslator Add test for ADK streaming fallback branch

Add test for streaming finish reason fallback

014d05b

Fix test_streaming_finish_reason_fallback: set is_final_response=Fals…

660b564

…e for streaming event

tylerslaton requested review from mme, ranst91, ataibarkai, maxkorp and NathanTarbert as code owners October 3, 2025 18:08

contextablemark and others added 13 commits October 3, 2025 22:41

Update dojo-e2e.yml

748fec6

Setup Workload Identity Federation (cherry picked from commit 979b3dc)

Reverting Workload Identity Federation

0bbcd9e

Re-adding linefeed so the file matches up exactly.

81f057c

test: update dojo-e2e workflow

c2db02d

🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]>

Re-adding temporary removals.

d8fadcc

tests: make function-call detection assertion semantic

eb79c97

tests: align EventTranslator streaming expectations

4dde1b0

tests: reconcile EventTranslator comprehensive expectations

0479cb6

fix: restore LRO routing guard and streaming tests

a61605a

tests: stabilize ToolBaseGenUIPage haiku comparison

01cab61

contextablemark mentioned this pull request Oct 5, 2025

Fix/issue 400 #471

Closed

test(adk): restore SystemMessage between tests

5aca8b3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Testing Fix/issue 400 clean #474

Testing Fix/issue 400 clean #474

Uh oh!

tylerslaton commented Oct 3, 2025

Uh oh!

contextablemark commented Oct 3, 2025

Uh oh!

Uh oh!

Testing Fix/issue 400 clean #474

Are you sure you want to change the base?

Testing Fix/issue 400 clean #474

Uh oh!

Conversation

tylerslaton commented Oct 3, 2025

Uh oh!

contextablemark commented Oct 3, 2025

Uh oh!

Uh oh!