[Model Submission] Nanbeige4-3B-Thinking-2511: Open-sourced 3B model with solid reasoning capability

We are pleased to submit our recently open-sourced model, Nanbeige4-3B-Thinking-2511, for evaluation in the Chatbot Arena. This model has achieved a score of **60 points** on the Arena-Hard V2 benchmark via advanced distillation techniques and reinforcement learning optimization.

**Model Details:**
* Organization: Nanbeige LLM Lab
* Model Name: Nanbeige4-3B-Thinking-2511
* HuggingFace Link: https://huggingface.co/Nanbeige/Nanbeige4-3B-Thinking-2511

Despite its small size (3B parameters), Nanbeige4-3B-Thinking-2511 exhibits competitive performance across a broad range of tasks, making it a compelling case study for efficient language modeling. We believe its inclusion in Chatbot Arena will provide valuable insights into the capabilities of parameter-efficient models in real-world, human-preference-driven evaluations.
We are happy to provide any additional support required for evaluation, such as API access.

Thank you for your consideration. We look forward to your feedback and the opportunity to contribute to the community.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Model Submission] Nanbeige4-3B-Thinking-2511: Open-sourced 3B model with solid reasoning capability #3754

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Model Submission] Nanbeige4-3B-Thinking-2511: Open-sourced 3B model with solid reasoning capability #3754

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions