Request for small sonar model #118

liam-twist · 2025-01-23T20:14:56Z

liam-twist
Jan 23, 2025

We're currently using the llama-3.1-sonar-small-128k-online model, which is set to be deprecated on 2025-02-22. We're moving over to the new sonar model but noticing it is significantly slower than the legacy small model - it seems to have similar speed to the legacy large model.

For our use case speed it rather critical... are there any plans to release a smaller, faster variation of the sonar model?

akowynia · 2025-01-28T19:58:53Z

akowynia
Jan 28, 2025

I also ask for the mini model when I test the answers in the current Sonar, for my language (Polish) the better answers were in the older model than in Sonar and it actually works much faster, when I have a csv file with 150 queries the earlier model worked faster with the current one it takes a little too long to do so

0 replies

damianoneill · 2025-01-29T18:54:32Z

damianoneill
Jan 29, 2025

+1 for this feature request, real need for a 7/8B parameter model for low latency responses, or the existing sonar model inference significantly increased.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Request for small sonar model #118

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

Request for small sonar model #118

Uh oh!

liam-twist Jan 23, 2025

Replies: 2 comments

Uh oh!

akowynia Jan 28, 2025

Uh oh!

Uh oh!

damianoneill Jan 29, 2025

liam-twist
Jan 23, 2025

akowynia
Jan 28, 2025

damianoneill
Jan 29, 2025