Skip to content

Add support for OpenAI's gpt-oss open weight models #584

@0xdevalias

Description

@0xdevalias
  • https://openai.com/open-models/
    • Open models by OpenAI

    • gpt-oss-120b
      A large open model designed to run in data centers and on high-end desktops and laptops.

    • gpt-oss-20b
      A medium-sized open model that can run on most desktops and laptops.

  • https://node-llama-cpp.withcat.ai/blog/v3.12-gpt-oss
    • gpt-oss is here!

    • August 10, 2025

    • node-llama-cpp v3.12 is here, with full support for gpt-oss models!

    • gpt-oss

      gpt-oss comes in two flavors:

      • gpt-oss-20b - 21B parameters with 3.6B active parameters
      • gpt-oss-120b - 117B parameters with 5.1B active parameters

      Here are a few highlights of these models:

      • Due to the low number of active parameters, these models are very fast
      • These are reasoning models, and you can adjust their reasoning effort
      • They are very good at function calling, and are built with agentic capabilities in mind
      • These models were trained with native MXFP4 precision, so no need to quantize them further. They're small compared to their capabilities already
      • They are provided with an Apache 2.0 license, so you can use them in your commercial applications

See Also

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions