You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
flop_counter: bool=False# Enable Flop counter to measure model throughput, can not be used with pytorch profiler at the same time.
54
-
flop_counter_startpoint: int=3# The step to start profiling, default is 3, which means after 3 steps of warmup stage, the profiler will start to count flops.
55
-
use_profiler: bool=False# Enable pytorch profiler, can not be used with flop counter at the same time.
53
+
flop_counter: bool=False# Enable FLOPS counter to measure model throughput, can not be used with pytorch profiler at the same time.
54
+
flop_counter_start: int=3# The step to start profiling, default is 3, which means after 3 steps of warm-up stage, the profiler will start to count FLOPS.
55
+
use_profiler: bool=False# Enable pytorch profiler, can not be used with FLOPS counter at the same time.
56
56
profiler_dir: str="PATH/to/save/profiler/results"# will be used if using profiler
57
57
```
58
58
@@ -94,8 +94,8 @@ You'll be able to access a dedicated project or run link on [wandb.ai](https://w
To help with benchmarking effort, we are adding the support for counting the flops during the fine-tuning process. You can achieve this by setting `--flop_counter` when launching your single/multi GPU fine-tuning. Use `--flop_counter_startpoint` to choose which step to count the flops. It is recommended to allow a warmup stage before using the flop counter.
99
+
To help with benchmarking effort, we are adding the support for counting the FLOPS during the fine-tuning process. You can achieve this by setting `--flop_counter` when launching your single/multi GPU fine-tuning. Use `--flop_counter_start` to choose which step to count the FLOPS. It is recommended to allow a warm-up stage before using the FLOPS counter.
100
100
101
-
Similarly, you can set `--use_profiler` flag and pass a profiling output path using `--profiler_dir` to capture the profile traces of your model using [PyTorch profiler](https://pytorch.org/tutorials/intermediate/tensorboard_profiler_tutorial.html). This would be helpful for debugging purposes. However, the `--flop_counter` and `--use_profiler` can not be used in the same time to ensure the measurement accuarcy.
101
+
Similarly, you can set `--use_profiler` flag and pass a profiling output path using `--profiler_dir` to capture the profile traces of your model using [PyTorch profiler](https://pytorch.org/tutorials/intermediate/tensorboard_profiler_tutorial.html). This would be helpful for debugging purposes. However, the `--flop_counter` and `--use_profiler` can not be used in the same time to ensure the measurement accuracy.
Copy file name to clipboardExpand all lines: src/llama_recipes/configs/training.py
+2-2Lines changed: 2 additions & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -42,7 +42,7 @@ class train_config:
42
42
use_fast_kernels: bool=False# Enable using SDPA from PyTroch Accelerated Transformers, make use Flash Attention and Xformer memory-efficient kernels
43
43
use_wandb: bool=False# Enable wandb for experient tracking
44
44
save_metrics: bool=False# saves training metrics to a json file for later plotting
45
-
flop_counter: bool=False# Enable Flop counter to measure model throughput, can not be used with pytorch profiler at the same time.
46
-
flop_counter_startpoint: int=3# The step to start profiling, default is 3, which means after 3 steps of warmup stage, the profiler will start to count flops.
45
+
flop_counter: bool=False# Enable flop counter to measure model throughput, can not be used with pytorch profiler at the same time.
46
+
flop_counter_start: int=3# The step to start profiling, default is 3, which means after 3 steps of warmup stage, the profiler will start to count flops.
47
47
use_profiler: bool=False# Enable pytorch profiler, can not be used with flop counter at the same time.
48
48
profiler_dir: str="PATH/to/save/profiler/results"# will be used if using profiler
raiseValueError(f"flop counter requires at least {cfg.flop_counter_startpoint} train steps, please increase the max_train_step, current max_train_step {cfg.max_train_step}")
raiseValueError(f"flop counter requires at least {cfg.flop_counter_start} train steps, please increase the max_train_step, current max_train_step {cfg.max_train_step}")
0 commit comments