lewidi2025

Start the vLLM server

In thinking mode:

vllm serve Qwen/Qwen3-32B \
    --dtype auto \
    --reasoning-parser deepseek_r1 \
    --task generate \
    --disable-log-requests \
    --max-model-len 8192 \
    --gpu-memory-utilization 0.95 \
    --enable-chunked-prefill

Use non-thinking mode, as describe in the Qwen3 docs:

vllm serve Qwen/Qwen3-32B \
    --dtype auto \
    --chat-template ./qwen3_nonthinking.jinja \
    --task generate \
    --disable-log-requests \
    --max-model-len 8192 \
    --gpu-memory-utilization 0.95 \
    --enable-chunked-prefill

Start the server and run the inference:

python inference.py \
    --model_id Qwen/Qwen3-4B \
    --gen_kwargs thinking \
    --datasets VariErrNLI \
    --template_id 01 \
    --remote_call_concurrency 10 \
    --n_examples 10 \
    --vllm.port 8000 \
    --vllm.start_server=False

SLURM

Create the sbatch files:

cd lewidi2025/slurm
python create_sbatch_files.py

submit all those jobs:

cd slurm_scripts/
ls | xargs -n 1 sbatch

Check the status of the jobs:

squeue -u $USER

Plot Metrics

After installing the package, you can plot the metrics by running:

lewidi-plot --log_file /dss/dssfs02/lwp-dss-0001/pn76je/pn76je-dss-0000/lewidi-data/sbatch/di38bec/Qwen_Qwen3-32B_thinking/out.logs

Where out.logs is generated by the sbatch file.

Gather Logs

find . -name '*_responses.jsonl' | xargs wc -l
find . -name '*_responses.jsonl' | xargs cat > combined.jsonl

duckdb -c "COPY (SELECT * FROM read_json_auto('combined.jsonl', union_by_name=True)) TO 'combined.parquet'"

srun --partition=lrz-cpu --cpus-per-task=20 --time=01:00:00 --qos=cpu duckdb -c "COPY (SELECT * FROM read_json_auto('combined.jsonl', union_by_name=True)) TO 'combined.parquet'"

Alternatively,

import pandas as pd
df = pd.read_json("combined_responses.jsonl", lines=True, dtype={"error": "string"})
df.to_parquet("combined_responses.parquet")

Name		Name	Last commit message	Last commit date
Latest commit History 535 Commits
.vscode		.vscode
aime		aime
lewidi_lib		lewidi_lib
notebook		notebook
prm800k		prm800k
regression		regression
slurm		slurm
st_app		st_app
tests		tests
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
inference.py		inference.py
lediwi.code-workspace		lediwi.code-workspace
llm_judge.py		llm_judge.py
qwen3_nonthinking.jinja		qwen3_nonthinking.jinja
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

lewidi2025

SLURM

Plot Metrics

Gather Logs

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

lewidi2025

SLURM

Plot Metrics

Gather Logs

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages