Adding ThinkingBlock to Ollama and Bedrock Converse #19936

AstraBert · 2025-09-24T18:40:44Z

With this PR we add:

Support for thinking block in Ollama
Support for thinking and thinking block in Bedrock Converse

AstraBert · 2025-09-25T10:49:18Z

Ok this should be now good to go, I tested e2e multiple times both with Ollama and Bedrock Converse and both are good with thinking :))

tysonite · 2025-09-26T01:59:36Z

llama-index-integrations/llms/llama-index-llms-ollama/llama_index/llms/ollama/base.py

                        additional_kwargs={
-                            "tool_calls": all_tool_calls,
-                            "thinking": thinking_txt,
+                            "tool_calls": list(set(all_tool_calls)),


This looks like a revert of the fix made around day back: 8079db7

Mmh maybe the merge did this wrong, but technically we are addressing the tool_calls kwargs in this PR: #19947

Technically tool_calls won't be needed anymore :)

I think this will still break though, we should probably go back to all_tool_calls to avoid the error in the PR that tyson linked

logan-markewich · 2025-09-29T22:57:57Z

Looks like some merge conflicts got created (sorry, that might be my fault lol)

logan-markewich · 2025-09-29T23:04:46Z

...ntegrations/llms/llama-index-llms-bedrock-converse/llama_index/llms/bedrock_converse/base.py

                if content_block_delta := chunk.get("contentBlockDelta"):
                    content_delta = content_block_delta["delta"]
                    content = join_two_dicts(content, content_delta)
+                    thinking = ""


Is it only returning the full thinking/reasoning text? Or is it streaming?

If its streaming, setting thinking = "" means we are removing the complete thinking string right?

Might want to update the tests to check if the thinking is over, like, 50 chars?

As far as I tested the streaming behavior, this basically means that we have separate thinking chunks for each streamed response, instead of an incrementally growing response. So, rather than this:

The quick brown fox j The quick brown fox jumps over th The quick brown fox jumps over the lazy dog

We would have this:

The quick brown fox jumps over the lazy dog

Also the test for the streaming of thinking blocks does not check for the character length, it checks for the number of thinking blocks produced (which should be non-zero). I will add a test for character length, tho, to test the streaming

logan-markewich · 2025-09-29T23:06:00Z

...ntegrations/llms/llama-index-llms-bedrock-converse/llama_index/llms/bedrock_converse/base.py

+                        delta=content_delta.get("text", None)
+                        or content_delta.get("thinking", None)
+                        or "",


This is a bit of a breaking change right? Because now its streaming the thinking and deltas in the same in same string field

We might want to save this for when we have proper streaming content blocks

(The other async streaming method also doesn't do this)

llama-index-integrations/llms/llama-index-llms-ollama/llama_index/llms/ollama/base.py

AstraBert · 2025-09-30T10:41:12Z

Ok @logan-markewich just implemented your requested changes and merged main, hopefully I did not break anything ^_^

logan-markewich

Made quite a few changes lol but got it working

fyi I was testing with this e2e script. Will probably be helpful to use this for the tool calling block PR (and maybe double checking the other thinking LLMs)

from audioop import mul
from llama_index.core.agent import FunctionAgent, AgentStream
from llama_index.llms.ollama import Ollama
from llama_index.llms.bedrock_converse import BedrockConverse
from llama_index.core.workflow import Context


llm = ...


async def get_the_greeting() -> str:
    """Useful for getting the current greeting for new people/users."""
    return "Good day sir, top of the morning to ye."


async def multiply(a: float, b: float) -> float:
    """Useful for multiplying two numbers."""
    return float(a) * float(b)


async def divide(a: float, b: float) -> float:
    """Useful for dividing two numbers."""
    return float(a) / float(b)


agent = FunctionAgent(
    llm=llm,
    tools=[get_the_greeting, multiply, divide],
)
ctx = Context(agent)


async def main():
    resp = llm.complete("Hello!")
    resp = await llm.acomplete("Hello!")
    handler = agent.run("Hello! I am a new user!", ctx=ctx)
    async for ev in handler.stream_events():
        if isinstance(ev, AgentStream):
            print(ev.delta, end="", flush=True)

    print()
    resp = await handler
    print(str(resp))
    print()
    for block in resp.response.blocks:
        print(block)
    print()

    handler = agent.run("What is 1244 * 12 / 234.5? Think carefully", ctx=ctx)
    async for ev in handler.stream_events():
        if isinstance(ev, AgentStream):
            print(ev.delta, end="", flush=True)

    print()
    resp = await handler
    print(str(resp))
    print()
    for block in resp.response.blocks:
        print(block)
    print()

    resp = await agent.run("thanks!", ctx=ctx)
    print(resp)


if __name__ == "__main__":
    import asyncio

    asyncio.run(main())

logan-markewich · 2025-10-04T05:26:53Z

weird, make lint modified a ton of docs files. Gonna go with that for now lol

AstraBert and others added 3 commits September 24, 2025 20:39

feat: thinking block in Ollama

585b355

feat: add support for thinking + thinking block to Bedrock Converse

0f9c6be

Merge branch 'main' into clelia/ollama-and-bedrock-thinking

659d5dc

AstraBert marked this pull request as ready for review September 25, 2025 10:19

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Sep 25, 2025

AstraBert added 4 commits September 25, 2025 12:23

chore: vbump and lock

1f66fb7

chore: typedict from typing_extensions

17cca1d

chore: make python 3.9 happy

07ec9d7

chore: bedrock tests

2541823

tysonite reviewed Sep 26, 2025

View reviewed changes

logan-markewich reviewed Sep 29, 2025

View reviewed changes

llama-index-integrations/llms/llama-index-llms-ollama/llama_index/llms/ollama/base.py Show resolved Hide resolved

AstraBert and others added 2 commits September 30, 2025 12:34

chore: implement PR review suggestions

18f3cf5

Merge branch 'main' into clelia/ollama-and-bedrock-thinking

38b940e

AstraBert and others added 6 commits September 30, 2025 12:44

chore: clean up merge mess

6b8d786

chore: adjust tests for bedrock converse

7532d20

Merge branch 'main' into clelia/ollama-and-bedrock-thinking

22dc500

some thinking nits with ollama

f39062c

thinking corrections in bedrock converse

bbf087e

make bedrock thinking work

caa0e4e

logan-markewich approved these changes Oct 4, 2025

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Oct 4, 2025

logan-markewich added 3 commits October 3, 2025 23:13

add back ollama thinking delta properly

cfb716e

make lint

6b6c2b0

Merge branch 'main' into clelia/ollama-and-bedrock-thinking

fa0d8b0

logan-markewich merged commit 6957c5b into main Oct 4, 2025
10 of 11 checks passed

logan-markewich deleted the clelia/ollama-and-bedrock-thinking branch October 4, 2025 05:29

mattref mentioned this pull request Oct 4, 2025

feat: List Claude Sonnet 4.5 as a reasoning model #20022

Merged

16 tasks

Adding ThinkingBlock to Ollama and Bedrock Converse #19936

Adding ThinkingBlock to Ollama and Bedrock Converse #19936

Uh oh!

Conversation

AstraBert commented Sep 24, 2025

Uh oh!

AstraBert commented Sep 25, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

logan-markewich commented Sep 29, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

AstraBert Sep 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AstraBert commented Sep 30, 2025

Uh oh!

logan-markewich left a comment

Choose a reason for hiding this comment

Uh oh!

logan-markewich commented Oct 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

AstraBert Sep 30, 2025 •

edited

Loading