Extend deepseek-r1 support #606

Szpadel · 2025-01-27T20:47:00Z

This PR extends reasoning preview support for deepseek api.
It no longer use system message (model is not designed to use it)
It will merge any consecutive messages of the same role (deepseek api rejects requests that do not use altering user/assistant pattern). This was also extended to models loaded through openrouter - I assume they know that it might confuse model.
While testing I discovered that while using deepseek api sometimes chunk in cline.ts was unknown - I wasn't able to trace source of it, stack trace was directly from nodejs microtask. Added workaround to just ignore such chunks so code won't crash, this should have no negative impact, but it would be great if anyone could explain source of it.

Description

Type of change

Bug fix (non-breaking change which fixes an issue)
New feature
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

Tested deepseek-r1 model using openrouter and deepseek api (this took few days as they seem to have big outage since Saturday and almost all request return only :keep-alive)
tested few other models to make sure there is no regression for other models.

Checklist:

My code follows the patterns of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation

Additional context

Related Issues

Reviewers

Important

Extend deepseek-r1 support by merging consecutive messages and handling undefined chunks.

Behavior:
- Extend support for deepseek-r1 model in openai.ts and openrouter.ts by merging consecutive messages of the same role using convertToR1Format().
- Remove system message usage for deepseek-reasoner in openai.ts.
- Add workaround in Cline.ts to ignore undefined chunks during streaming.
Functions:
- Add convertToR1Format() in r1-format.ts to merge consecutive messages of the same role.
Misc:
- Adjust message handling in openai.ts and openrouter.ts to accommodate deepseek-r1 model requirements.

^{This description was created by}^{for cb23be6. It will automatically update as commits are pushed.}

changeset-bot · 2025-01-27T20:47:03Z

⚠️ No Changeset found

Latest commit: 18c7f57

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

mrubens

This is great, thank you! Do you mind adding a test for the transformer? Once you do I'm happy to merge this. Really appreciate the contributions 🙌

mrubens · 2025-01-28T05:16:29Z

Actually I just added a test. I would love to verify myself that this works, but haven't been able to get r1 working 😞 Hopefully by tomorrow I can verify it and then merge this in. Thanks again!

Szpadel · 2025-01-28T10:24:50Z

Thanks for introducing tests.
FYI I currently I have much better chances at getting successful responses from deepseek, so if you have time it might be good moment for testing (they start responding after about 2min)

I discovered that chunk issue also currently affects cline when used with deepseek api
I still do not understand what emits those undefined values but I feel that this might be related with emitted keep-alives

mrubens · 2025-01-28T12:30:23Z

If you’ve tested well let’s go for it 🙏

Claw256 · 2025-01-28T15:54:23Z

Hiya, does this change also apply to the new R1 Nitro model on OpenRouter?: https://openrouter.ai/deepseek/deepseek-r1:nitro

Szpadel · 2025-01-28T16:40:37Z

@Claw256 no, but it's trivial change to do.
Do you know what's the difference between nitro and the standard one?
price and parameters reported by openrouter look identical between those models

mrubens · 2025-01-29T16:24:58Z

@Szpadel I just saw this issue - any thoughts? Thank you! #641

Szpadel · 2025-01-29T18:25:01Z

I see the issue I will provide fix in 5min

Extend deepseek-r1 support

cb23be6

Szpadel requested review from ColemanRoo, mrubens and stea9499 as code owners January 27, 2025 20:47

mrubens reviewed Jan 27, 2025

View reviewed changes

Add test

18c7f57

mrubens approved these changes Jan 28, 2025

View reviewed changes

mrubens merged commit f07109b into RooCodeInc:main Jan 28, 2025
4 checks passed

Szpadel mentioned this pull request Jan 28, 2025

Run Local DeepSeek-R1-Distill-Qwen32B on Ollama, Error #611

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Extend deepseek-r1 support #606

Extend deepseek-r1 support #606

Uh oh!

Szpadel commented Jan 27, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

changeset-bot bot commented Jan 27, 2025 •

edited

Loading

Uh oh!

mrubens left a comment

Uh oh!

mrubens commented Jan 28, 2025

Uh oh!

Szpadel commented Jan 28, 2025

Uh oh!

mrubens commented Jan 28, 2025

Uh oh!

Uh oh!

Claw256 commented Jan 28, 2025

Uh oh!

Szpadel commented Jan 28, 2025

Uh oh!

mrubens commented Jan 29, 2025

Uh oh!

Szpadel commented Jan 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Extend deepseek-r1 support #606

Extend deepseek-r1 support #606

Uh oh!

Conversation

Szpadel commented Jan 27, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

How Has This Been Tested?

Checklist:

Additional context

Related Issues

Reviewers

Uh oh!

changeset-bot bot commented Jan 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ No Changeset found

Uh oh!

mrubens left a comment

Choose a reason for hiding this comment

Uh oh!

mrubens commented Jan 28, 2025

Uh oh!

Szpadel commented Jan 28, 2025

Uh oh!

mrubens commented Jan 28, 2025

Uh oh!

Uh oh!

Claw256 commented Jan 28, 2025

Uh oh!

Szpadel commented Jan 28, 2025

Uh oh!

mrubens commented Jan 29, 2025

Uh oh!

Szpadel commented Jan 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Szpadel commented Jan 27, 2025 •

edited by ellipsis-dev bot

Loading

changeset-bot bot commented Jan 27, 2025 •

edited

Loading