From e04551e1e7281b9b5f481ff628ac316cefd6efcd Mon Sep 17 00:00:00 2001 From: Hemendu Roy <89282414+hemenduroy@users.noreply.github.com> Date: Wed, 6 Aug 2025 12:25:36 -0700 Subject: [PATCH] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 7d4f2791..0a9f0d22 100644 --- a/README.md +++ b/README.md @@ -171,7 +171,7 @@ huggingface-cli download openai/gpt-oss-20b --include "original/*" --local-dir g We include an inefficient reference PyTorch implementation in [gpt_oss/torch/model.py](gpt_oss/torch/model.py). This code uses basic PyTorch operators to show the exact model architecture, with a small addition of supporting tensor parallelism in MoE so that the larger model can run with this code (e.g., on 4xH100 or 2xH200). In this implementation, we upcast all weights to BF16 and run the model in BF16. -To run the reference implementation. Install dependencies: +To run the reference implementation, install these dependencies: ```shell pip install -e .[torch]