Skip to content

Commit dd84d16

Browse files
committed
release v0.2.32
1 parent af4dfe3 commit dd84d16

File tree

3 files changed

+6
-6
lines changed

3 files changed

+6
-6
lines changed

README.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -66,7 +66,7 @@ pip3 install -e ".[model_worker,webui]"
6666

6767
## Model Weights
6868
### Vicuna Weights
69-
[Vicuna](https://lmsys.org/blog/2023-03-30-vicuna/) is based on LLaMA and should be used under LLaMA's [model license](https://github.com/facebookresearch/llama/blob/main/LICENSE).
69+
[Vicuna](https://lmsys.org/blog/2023-03-30-vicuna/) is based on Llama 2 and should be used under Llama's [model license](https://github.com/facebookresearch/llama/blob/main/LICENSE).
7070

7171
You can use the commands below to start chatting. It will automatically download the weights from Hugging Face repos.
7272
See more command options and how to handle out-of-memory in the "Inference with Command Line Interface" section below.
@@ -84,7 +84,7 @@ See more command options and how to handle out-of-memory in the "Inference with
8484
**Old weights**: see [docs/vicuna_weights_version.md](docs/vicuna_weights_version.md) for all versions of weights and their differences.
8585

8686
### LongChat
87-
We release [LongChat](https://lmsys.org/blog/2023-06-29-longchat/) models under LLaMA's [model license](https://github.com/facebookresearch/llama/blob/main/LICENSE).
87+
We release [LongChat](https://lmsys.org/blog/2023-06-29-longchat/) models under Llama's [model license](https://github.com/facebookresearch/llama/blob/main/LICENSE).
8888

8989
| Size | Chat Command | Hugging Face Repo |
9090
| --- | --- | --- |
@@ -276,7 +276,7 @@ MT-bench is the new recommended way to benchmark your models. If you are still l
276276
## Fine-tuning
277277
### Data
278278

279-
Vicuna is created by fine-tuning a LLaMA base model using approximately 125K user-shared conversations gathered from ShareGPT.com with public APIs. To ensure data quality, we convert the HTML back to markdown and filter out some inappropriate or low-quality samples. Additionally, we divide lengthy conversations into smaller segments that fit the model's maximum context length. For detailed instructions to clean the ShareGPT data, check out [here](docs/commands/data_cleaning.md).
279+
Vicuna is created by fine-tuning a Llama base model using approximately 125K user-shared conversations gathered from ShareGPT.com with public APIs. To ensure data quality, we convert the HTML back to markdown and filter out some inappropriate or low-quality samples. Additionally, we divide lengthy conversations into smaller segments that fit the model's maximum context length. For detailed instructions to clean the ShareGPT data, check out [here](docs/commands/data_cleaning.md).
280280

281281
We will not release the ShareGPT dataset. If you would like to try the fine-tuning code, you can run it with some dummy conversations in [dummy_conversation.json](data/dummy_conversation.json). You can follow the same format and plug in your own data.
282282

@@ -295,7 +295,7 @@ We use similar hyperparameters as the Stanford Alpaca.
295295
pip3 install -e ".[train]"
296296
```
297297

298-
- You can use the following command to train Vicuna-7B with 4 x A100 (40GB). Update `--model_name_or_path` with the actual path to LLaMA weights and `--data_path` with the actual path to data.
298+
- You can use the following command to train Vicuna-7B with 4 x A100 (40GB). Update `--model_name_or_path` with the actual path to Llama weights and `--data_path` with the actual path to data.
299299
```bash
300300
torchrun --nproc_per_node=4 --master_port=20001 fastchat/train/train_mem.py \
301301
--model_name_or_path meta-llama/Llama-2-7b-hf \

fastchat/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
__version__ = "0.2.31"
1+
__version__ = "0.2.32"

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ build-backend = "setuptools.build_meta"
44

55
[project]
66
name = "fschat"
7-
version = "0.2.31"
7+
version = "0.2.32"
88
description = "An open platform for training, serving, and evaluating large language model based chatbots."
99
readme = "README.md"
1010
requires-python = ">=3.8"

0 commit comments

Comments
 (0)