Skip to content

Commit 6f0578c

Browse files
Update model list and reproduction guide (#4)
* README: add Qwen3.5 models Made-with: Cursor * CI: add PyPI publish workflow on release Made-with: Cursor * Update model list and reproduction guide --------- Co-authored-by: Yesheng Liang <liang2kl@outlook.com>
1 parent 92a3385 commit 6f0578c

File tree

2 files changed

+66
-6
lines changed

2 files changed

+66
-6
lines changed
Lines changed: 51 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,51 @@
1+
name: Publish to PyPI
2+
3+
on:
4+
release:
5+
types: [published]
6+
7+
permissions:
8+
contents: read
9+
id-token: write
10+
11+
concurrency:
12+
group: pypi-publish
13+
cancel-in-progress: false
14+
15+
jobs:
16+
build:
17+
runs-on: ubuntu-latest
18+
steps:
19+
- name: Checkout
20+
uses: actions/checkout@v4
21+
22+
- name: Set up Python
23+
uses: actions/setup-python@v5
24+
with:
25+
python-version: "3.11"
26+
27+
- name: Install build tools
28+
run: pip install build
29+
30+
- name: Build sdist and wheel
31+
run: python -m build
32+
33+
- name: Upload dist artifacts
34+
uses: actions/upload-artifact@v4
35+
with:
36+
name: dist
37+
path: dist/
38+
39+
publish:
40+
runs-on: ubuntu-latest
41+
needs: build
42+
environment: pypi
43+
steps:
44+
- name: Download dist artifacts
45+
uses: actions/download-artifact@v4
46+
with:
47+
name: dist
48+
path: dist/
49+
50+
- name: Publish to PyPI
51+
uses: pypa/gh-action-pypi-publish@release/v1

README.md

Lines changed: 15 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -50,19 +50,32 @@ All models are available on [Hugging Face](https://huggingface.co/collections/z-
5050
| Qwen3-4B | [`z-lab/Qwen3-4B-PARO`](https://huggingface.co/z-lab/Qwen3-4B-PARO) |
5151
| Qwen3-8B | [`z-lab/Qwen3-8B-PARO`](https://huggingface.co/z-lab/Qwen3-8B-PARO) |
5252
| Qwen3-14B | [`z-lab/Qwen3-14B-PARO`](https://huggingface.co/z-lab/Qwen3-14B-PARO) |
53-
| Qwen3-4B-Thinking-2507 | [`z-lab/Qwen3-4B-Thinking-2507-PARO`](https://huggingface.co/z-lab/Qwen3-4B-Thinking-2507-PARO) |
53+
54+
**Qwen3.5**
55+
56+
| Model | Checkpoint |
57+
|---|---|
58+
| Qwen3.5-0.8B | [`z-lab/Qwen3.5-0.8B-PARO`](https://huggingface.co/z-lab/Qwen3.5-0.8B-PARO) |
59+
| Qwen3.5-2B | [`z-lab/Qwen3.5-2B-PARO`](https://huggingface.co/z-lab/Qwen3.5-2B-PARO) |
60+
| Qwen3.5-4B | [`z-lab/Qwen3.5-4B-PARO`](https://huggingface.co/z-lab/Qwen3.5-4B-PARO) |
61+
| Qwen3.5-9B | [`z-lab/Qwen3.5-9B-PARO`](https://huggingface.co/z-lab/Qwen3.5-9B-PARO) |
5462

5563
**Llama**
5664

5765
| Model | Checkpoint |
5866
|---|---|
5967
| Llama-2-7B | [`z-lab/Llama-2-7b-hf-PARO`](https://huggingface.co/z-lab/Llama-2-7b-hf-PARO) |
6068
| Llama-3-8B | [`z-lab/Meta-Llama-3-8B-PARO`](https://huggingface.co/z-lab/Meta-Llama-3-8B-PARO) |
61-
| Llama-3-70B | [`z-lab/Meta-Llama-3-70B-PARO`](https://huggingface.co/z-lab/Meta-Llama-3-70B-PARO) |
6269
| Llama-3.1-8B-Instruct | [`z-lab/Llama-3.1-8B-Instruct-PARO`](https://huggingface.co/z-lab/Llama-3.1-8B-Instruct-PARO) |
6370

6471
Want a model that's not listed? [Open an issue](https://github.com/z-lab/paroquant/issues/new) and let us know.
6572

73+
## Reproduction
74+
75+
> [!NOTE]
76+
> The main branch of this repository is under active development, and reproducibility is not guaranteed.
77+
> Please use the [`legacy`](https://github.com/z-lab/paroquant/tree/legacy) branch to reproduce results from the paper.
78+
6679
## Installation
6780

6881
```bash
@@ -89,10 +102,6 @@ python -m paroquant.cli.convert \
89102
--output-path models/Qwen3-8B-PARO
90103
```
91104

92-
## Reproduction
93-
94-
See [`experiments/README.md`](./experiments/README.md) for scripts to reproduce all results in the paper.
95-
96105
## Docker Images
97106

98107
| Image | Purpose |

0 commit comments

Comments
 (0)