Skip to content

Latest commit

 

History

History
36 lines (24 loc) · 1.48 KB

File metadata and controls

36 lines (24 loc) · 1.48 KB

Calibrating a Single Node

This procedure explains how to perform calibration for a single Intel® Gaudi® node.

Prerequisites

Before you start, familiarize with notes and recommendations.

Calibration procedure

  1. Build and install the latest version of vLLM Hardware Plugin for Intel® Gaudi® by following the Installation procedure.

  2. In the plugin project directory, navigate to the calibration subdirectory and install the required dependencies.

    cd calibration
    pip install -r requirements.txt
  3. Download the dataset.

  4. Run the calibrate_model.sh script with the obligatory -m, -d, and -o arguments, as in the following example:

    ./calibrate_model.sh -m /path/to/local/llama3.1/Meta-Llama-3.1-405B-Instruct/ -d dataset-processed.pkl -o /path/to/measurements/vllm-benchmarks/inc -b 128 -t 8 -l 4096
    # OR
    ./calibrate_model.sh -m facebook/opt-125m -d dataset-processed.pkl -o inc/
    # OR Calibrate DeepSeek models with dataset NeelNanda/pile-10k
    ./calibrate_model.sh -m deepseek-ai/DeepSeek-R1  -d NeelNanda/pile-10k -o inc/ -t 8

    Where:

    • -m <path/ID>: Path to a locally stored model or the model ID from the Hugging Face hub.
    • -d <path>: Path to the source dataset in the pickle (.pkl) format.
    • -o <path>: Path to the directory where you want to save the generated measurements.