Skip to content

Commit 35ae2d1

Browse files
authored
Sample to sample compare for integration tests (#977)
* Track parquet files with Git LFS * init * remove logging in slow tests and log nvidia smi
1 parent 6ee4ff8 commit 35ae2d1

File tree

67 files changed

+499
-19
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

67 files changed

+499
-19
lines changed

.gitattributes

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,2 +1,3 @@
11
*.json filter=lfs diff=lfs merge=lfs -text
22
tests/unit/metrics/test_cases/*.json -filter -diff -merge text
3+
*.parquet filter=lfs diff=lfs merge=lfs -text

.github/workflows/slow_tests.yaml

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -38,5 +38,8 @@ jobs:
3838
run: uv sync --extra dev
3939

4040

41+
- name: run nvidia-smi
42+
run: nvidia-smi
43+
4144
- name: Run tests
42-
run: uv run pytest --disable-pytest-warnings -o log_cli=true -o log_cli_level=INFO --runslow tests/slow_tests/
45+
run: uv run pytest --disable-pytest-warnings --runslow tests/slow_tests/

examples/model_configs/vllm_model_config.yaml

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -21,13 +21,11 @@ model_parameters:
2121
presence_penalty: 0.0
2222
repetition_penalty: 1.0
2323
frequency_penalty: 0.0
24-
temperature: 0.1
24+
temperature: 0.0
2525
top_k: null
2626
min_p: 0.0
2727
top_p: 0.9
2828
seed: 42
2929
stop_tokens: null
3030
max_new_tokens: 2048
3131
min_new_tokens: 0
32-
metrics_options:
33-
yo: null

src/lighteval/main_accelerate.py

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -146,7 +146,8 @@ def accelerate( # noqa C901
146146
pipeline.show_results()
147147

148148
results = pipeline.get_results()
149+
details = pipeline.get_details()
149150

150151
pipeline.save_and_push_results()
151152

152-
return results
153+
return results, details

src/lighteval/main_vllm.py

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -131,7 +131,8 @@ def vllm(
131131
pipeline.show_results()
132132

133133
results = pipeline.get_results()
134+
details = pipeline.get_details()
134135

135136
pipeline.save_and_push_results()
136137

137-
return results
138+
return results, details

src/lighteval/pipeline.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -437,3 +437,6 @@ def show_results(self):
437437
def get_results(self):
438438
self._init_final_dict()
439439
return self.final_dict
440+
441+
def get_details(self):
442+
return self.evaluation_tracker.details_logger.details
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
version https://git-lfs.github.com/spec/v1
2+
oid sha256:228e58884c153beb351f47a3d0c5704d9e764844aa6990375855ebdddfe234cb
3+
size 11827875
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
version https://git-lfs.github.com/spec/v1
2+
oid sha256:1f470cfb11d5901a644ccf28b267829448c511929034ee520a4f870253e3be99
3+
size 11540572
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
version https://git-lfs.github.com/spec/v1
2+
oid sha256:c2168453c9482af24bcb501f7b1b5f4f83f261e416c40db8e23ec123a13ef27f
3+
size 27746331
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
version https://git-lfs.github.com/spec/v1
2+
oid sha256:bf8c0061d55f76265ec3d88762e4d806ce2d932909384cd03637413fd5cb89be
3+
size 88248

0 commit comments

Comments
 (0)