This procedure explains how to perform calibration for a single Intel® Gaudi® node.
Before you start, familiarize with notes and recommendations.
-
Build and install the latest version of vLLM Hardware Plugin for Intel® Gaudi® by following the Installation procedure.
-
In the plugin project directory, navigate to the
calibrationsubdirectory and install the required dependencies.cd calibration pip install -r requirements.txt -
Download the dataset.
-
Run the
calibrate_model.shscript with the obligatory-m,-d, and-oarguments, as in the following example:./calibrate_model.sh -m /path/to/local/llama3.1/Meta-Llama-3.1-405B-Instruct/ -d dataset-processed.pkl -o /path/to/measurements/vllm-benchmarks/inc -b 128 -t 8 -l 4096 # OR ./calibrate_model.sh -m facebook/opt-125m -d dataset-processed.pkl -o inc/ # OR Calibrate DeepSeek models with dataset NeelNanda/pile-10k ./calibrate_model.sh -m deepseek-ai/DeepSeek-R1 -d NeelNanda/pile-10k -o inc/ -t 8
Where:
-m <path/ID>: Path to a locally stored model or the model ID from the Hugging Face hub.-d <path>: Path to the source dataset in the pickle (.pkl) format.-o <path>: Path to the directory where you want to save the generated measurements.