Skip to content

Commit 2889dce

Browse files
authored
Merge branch 'main' into feat/max-error-rate
2 parents fa56258 + 6fff1bf commit 2889dce

File tree

6 files changed

+13
-45
lines changed

6 files changed

+13
-45
lines changed

.github/workflows/development.yml

Lines changed: 0 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -37,22 +37,6 @@ jobs:
3737
- name: Run quality checks
3838
run: tox -e types
3939

40-
link-checks:
41-
runs-on: ubuntu-latest
42-
strategy:
43-
matrix:
44-
python: ["3.9"]
45-
steps:
46-
- uses: actions/checkout@v4
47-
- name: Set up Python
48-
uses: actions/setup-python@v5
49-
with:
50-
python-version: ${{ matrix.python }}
51-
- name: Install dependencies
52-
run: pip install tox
53-
- name: Run link checks
54-
run: tox -e links
55-
5640
precommit-checks:
5741
runs-on: ubuntu-latest
5842
strategy:

.github/workflows/main.yml

Lines changed: 0 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -38,22 +38,6 @@ jobs:
3838
- name: Run quality checks
3939
run: tox -e types
4040

41-
link-checks:
42-
runs-on: ubuntu-latest
43-
strategy:
44-
matrix:
45-
python: ["3.9"]
46-
steps:
47-
- uses: actions/checkout@v4
48-
- name: Set up Python
49-
uses: actions/setup-python@v5
50-
with:
51-
python-version: ${{ matrix.python }}
52-
- name: Install dependencies
53-
run: pip install tox
54-
- name: Run link checks
55-
run: tox -e links
56-
5741
precommit-checks:
5842
runs-on: ubuntu-latest
5943
strategy:

CONTRIBUTING.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -47,7 +47,7 @@ You can either clone the repository directly or fork it if you plan to contribut
4747
cd guidellm
4848
```
4949

50-
For detailed instructions on setting up your development environment, please refer to the [DEVELOPING.md](https://github.com/neuralmagic/speculators/blob/main/DEVELOPING.md) file. It includes step-by-step guidance on:
50+
For detailed instructions on setting up your development environment, please refer to the [DEVELOPING.md](https://github.com/neuralmagic/guidellm/blob/main/DEVELOPING.md) file. It includes step-by-step guidance on:
5151

5252
- Installing dependencies
5353
- Running tests
@@ -114,8 +114,8 @@ If you encounter a bug or have a feature request, please open an issue on GitHub
114114

115115
## Community Standards
116116

117-
We are committed to fostering a welcoming and inclusive community. Please read and adhere to our [Code of Conduct](https://github.com/neuralmagic/speculators/blob/main/CODE_OF_CONDUCT.md).
117+
We are committed to fostering a welcoming and inclusive community. Please read and adhere to our [Code of Conduct](https://github.com/neuralmagic/guidellm/blob/main/CODE_OF_CONDUCT.md).
118118

119119
## License
120120

121-
By contributing to Speculators, you agree that your contributions will be licensed under the [Apache License 2.0](https://github.com/neuralmagic/speculators/blob/main/LICENSE).
121+
By contributing to GuideLLM, you agree that your contributions will be licensed under the [Apache License 2.0](https://github.com/neuralmagic/guidellm/blob/main/LICENSE).

DEVELOPING.md

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
1-
# Developing for Speculators
1+
# Developing for GuideLLM
22

3-
Thank you for your interest in contributing to Speculators! This document provides detailed instructions for setting up your development environment, implementing changes, and adhering to the project's best practices. Your contributions help us grow and improve this project.
3+
Thank you for your interest in contributing to GuideLLM! This document provides detailed instructions for setting up your development environment, implementing changes, and adhering to the project's best practices. Your contributions help us grow and improve this project.
44

55
## Setting Up Your Development Environment
66

@@ -142,7 +142,7 @@ tox
142142
To ensure your changes are covered by tests, run:
143143

144144
```bash
145-
tox -e test-unit -- --cov=speculators --cov-report=html
145+
tox -e test-unit -- --cov=guidellm --cov-report=html
146146
```
147147

148148
Review the coverage report to confirm that your new code is adequately tested.
@@ -181,7 +181,7 @@ Review the coverage report to confirm that your new code is adequately tested.
181181

182182
## Additional Resources
183183

184-
- [CONTRIBUTING.md](https://github.com/neuralmagic/speculators/blob/main/CONTRIBUTING.md): Guidelines for contributing to the project.
185-
- [CODE_OF_CONDUCT.md](https://github.com/neuralmagic/speculators/blob/main/CODE_OF_CONDUCT.md): Our expectations for community behavior.
186-
- [tox.ini](https://github.com/neuralmagic/speculators/blob/main/tox.ini): Configuration for Tox environments.
187-
- [.pre-commit-config.yaml](https://github.com/neuralmagic/speculators/blob/main/.pre-commit-config.yaml): Configuration for pre-commit hooks.
184+
- [CONTRIBUTING.md](https://github.com/neuralmagic/guidellm/blob/main/CONTRIBUTING.md): Guidelines for contributing to the project.
185+
- [CODE_OF_CONDUCT.md](https://github.com/neuralmagic/guidellm/blob/main/CODE_OF_CONDUCT.md): Our expectations for community behavior.
186+
- [tox.ini](https://github.com/neuralmagic/guidellm/blob/main/tox.ini): Configuration for Tox environments.
187+
- [.pre-commit-config.yaml](https://github.com/neuralmagic/guidellm/blob/main/.pre-commit-config.yaml): Configuration for pre-commit hooks.

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -92,7 +92,7 @@ After the evaluation is completed, GuideLLM will summarize the results into thre
9292

9393
The sections will look similar to the following: <img alt="Sample GuideLLM benchmark output" src="https://raw.githubusercontent.com/neuralmagic/guidellm/main/docs/assets/sample-output.png" />
9494

95-
For more details about the metrics and definitions, please refer to the [Metrics documentation](https://raw.githubusercontent.com/neuralmagic/guidellm/main/docs/metrics.md).
95+
For more details about the metrics and definitions, please refer to the [Metrics documentation](https://github.com/neuralmagic/guidellm/blob/main/docs/metrics.md).
9696

9797
#### 4. Explore the Results File
9898

@@ -106,7 +106,7 @@ The results from GuideLLM are used to optimize your LLM deployment for performan
106106

107107
For example, when deploying a chat application, we likely want to ensure that our time to first token (TTFT) and inter-token latency (ITL) are under certain thresholds to meet our service level objectives (SLOs) or service level agreements (SLAs). For example, setting TTFT to 200ms and ITL 25ms for the sample data provided in the example above, we can see that even though the server is capable of handling up to 13 requests per second, we would only be able to meet our SLOs for 99% of users at a request rate of 3.5 requests per second. If we relax our constraints on ITL to 50 ms, then we can meet the TTFT SLA for 99% of users at a request rate of approximately 10 requests per second.
108108

109-
For further details on determining the optimal request rate and SLOs, refer to the [SLOs documentation](https://raw.githubusercontent.com/neuralmagic/guidellm/main/docs/service_level_objectives.md).
109+
For further details on determining the optimal request rate and SLOs, refer to the [SLOs documentation](https://github.com/neuralmagic/guidellm/blob/main/docs/service_level_objectives.md).
110110

111111
### Configurations
112112

src/guidellm/utils/random.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -37,7 +37,7 @@ def __iter__(self) -> Iterator[int]:
3737
if calc_min == calc_max:
3838
yield calc_min
3939
elif not self.variance:
40-
yield self.rng.randint(calc_min, calc_max + 1)
40+
yield self.rng.randint(calc_min, calc_max)
4141
else:
4242
rand = self.rng.gauss(self.average, self.variance)
4343
yield round(max(calc_min, min(calc_max, rand)))

0 commit comments

Comments
 (0)