Skip to content

Commit 9006401

Browse files
authored
minor
1 parent b00f23e commit 9006401

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

examples/main/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -312,7 +312,7 @@ These options help improve the performance and memory usage of the LLaMA models.
312312

313313
- `-ub N`, `--ubatch-size N`: Physical batch size. This is the maximum number of tokens that may be processed at a time. Increasing this value may improve performance during prompt processing, at the expense of higher memory usage. Default: `512`.
314314

315-
- `-b N`, `--batch-size N`: Logical batch size. Increasing this value above the value of the physical batch size may improve prompt processing performance when using multiple GPUs with pipeline parallelism. Default: `2048`
315+
- `-b N`, `--batch-size N`: Logical batch size. Increasing this value above the value of the physical batch size may improve prompt processing performance when using multiple GPUs with pipeline parallelism. Default: `2048`.
316316

317317
### Prompt Caching
318318

0 commit comments

Comments
 (0)