LLM-for-HLS

██╗     ██╗     ███╗   ███╗██╗  ██╗██╗  ██╗██╗     ███████╗
██║     ██║     ████╗ ████║██║  ██║██║  ██║██║     ██╔════╝
██║     ██║     ██╔████╔██║███████║███████║██║     ███████╗
██║     ██║     ██║╚██╔╝██║╚════██║██╔══██║██║     ╚════██║
███████╗███████╗██║ ╚═╝ ██║     ██║██║  ██║███████╗███████║
╚══════╝╚══════╝╚═╝     ╚═╝     ╚═╝╚═╝  ╚═╝╚══════╝╚══════╝

Framework demo

Framework overview

LLM-for-HLS is an open-source project focused on leveraging Large Language Models (LLMs) for High-Level Synthesis (HLS). This project aims to construct datasets, fine-tune LLMs, and evaluate their performance in generating HLS code based on natural language instructions. It also involves some techniques like feedback loop and chain of thoughts.

Project structure

LLM-for-HLS/
├── axolotl                  # Directory for the Axolotl project
├── data                     # Directory containing data files used by the project
├── functionality_data       # Directory storing data specifically for illustrating functionality
├── last_run_prepared        # Directory for prepared data cache
├── qlora-out                # Directory for outputs from qlora model
├── src                      # Source code directory where the main project code is located
├── inference.sh             # Shell script for running inference tasks
├── functionality_check.sh   # Script for executing functional or syntax checks on generated results.
├── README.md                # Markdown file providing an overview and general information about the project
├── requirements.txt         # Text file listing dependencies needed by the project
└── train.sh                 # Shell script for training models or running training processes

Dataset construction

We have filtered the original labeled design benchmark dataset according to the 'perf' value to construct a dataset optimized for HLS code generation. Later, we extended the orignial dataset by adding tens of new source file after thoroughly inspecting the repository https://github.com/UT-LCA/ML4Accel-Dataset/tree/main/fpga_ml_dataset/HLS_dataset.

dataset source codes path: ./data/sources
dataset design path: ./data/designs
lately added dataset sources: ./data/new_data
dataset prepared path: ./data/gpt35/["processed_sources_train_c.jsonl", "processed_sources_test_c.jsonl"]

Code to Text(instruction)

used gpt3.5(will probably to gpt4) to generate the corresponding instructions for generating HLS codes

instruction fine-tuning

used alpaca format .jsonl dataset

{"instruction": "...", "input": "...", "output": "...", "source": "..."}

adapter: qlora
trained model path: ./qlora-out/merged
predicted_data_dir: ./test_output*

Environment setup

Install vLLM with CUDA 11.8.

export VLLM_VERSION=0.4.0
export PYTHON_VERSION=38 # Your Python version

pip install https://github.com/vllm-project/vllm/releases/download/v${VLLM_VERSION}/vllm-${VLLM_VERSION}+cu118-cp${PYTHON_VERSION}-cp${PYTHON_VERSION}-manylinux1_x86_64.whl --extra-index-url https://download.pytorch.org/whl/cu118

then, run:

pip install -r requirements.txt

then, cd into axlotl dir, and run:

pip install -e .

Prepare Pre-trained Models

Create a new folder named "models" under the axolotl folder, then download the CodeLlama-7b-hf model and save it in the newly created "models" folder.

fine-tuning command

Modify the axolotl/examples/code-llama/7b/qlora.yml file, changing the base_model to the absolute path of the model just downloaded. Before running the script, modify the axolotl/examples/code-llama/7b/qlora.yml file and set the load_4bit parameter to true.

sh train.sh

inference/test command

Description: This script is a wrapper for running various Python scripts related to feedback. Before running the script, modify the axolotl/examples/code-llama/7b/qlora.yml file and set the load_4bit parameter to false. And you must run pip uninstall flash-attn

Usage:

The script takes one or two command-line arguments:

The first argument specifies the type of feedback or inference to run:
- -syntax_feedback: Run syntax feedback inference
- -functionality_feedback: Run functionality feedback inference
- -cot: Run CoT (Contextualized Transformer) inference
The second argument is optional and only applies to the first two options:
- -cot: Use CoT for inference (only applicable with -syntax_feedback or -functionality_feedback)

Examples:

inference.sh -syntax_feedback: Run syntax feedback inference without CoT
inference.sh -syntax_feedback -cot: Run syntax feedback inference with CoT
inference.sh -functionality_feedback: Run functionality feedback inference without CoT
inference.sh -functionality_feedback -cot: Run functionality feedback inference with CoT
inference.sh -cot: Run CoT inference
inference.sh: Run inference without CoT (default behavior)

syntax/functionality check

First argument:
- -woCot: Without cot
- -cot: With cot
Second argument:
- -woFd: Without any feedback loop
- -synFd: With syntax feedback loop
- -funFd: With functionality feedback loop

Example Commands

./syntax_check.sh -woCot -woFd
./functionality_check.sh -cot -synFd

Name		Name	Last commit message	Last commit date
Latest commit History 118 Commits
axolotl		axolotl
data		data
functionality_data		functionality_data
src		src
test_output		test_output
test_output_cot_funFd		test_output_cot_funFd
test_output_cot_functionality_feedback_loop		test_output_cot_functionality_feedback_loop
test_output_cot_synFd		test_output_cot_synFd
test_output_cot_syntax_feedback_loop		test_output_cot_syntax_feedback_loop
test_output_cot_woFd		test_output_cot_woFd
test_output_with_feedback_loop		test_output_with_feedback_loop
test_output_without_cot		test_output_without_cot
test_output_woCot_funFd		test_output_woCot_funFd
test_output_woCot_synFd		test_output_woCot_synFd
test_output_woCot_synFd2		test_output_woCot_synFd2
test_output_woCot_woFd		test_output_woCot_woFd
webui		webui
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
demo.gif		demo.gif
functionality_check.sh		functionality_check.sh
inference.sh		inference.sh
requirements.txt		requirements.txt
syntax_check.sh		syntax_check.sh
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM-for-HLS

Framework demo

Framework overview

Project structure

Dataset construction

Code to Text(instruction)

instruction fine-tuning

Environment setup

Prepare Pre-trained Models

fine-tuning command

inference/test command

syntax/functionality check

Example Commands

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Languages

jiahaogai/LLM-for-HLS

Folders and files

Latest commit

History

Repository files navigation

LLM-for-HLS

Framework demo

Framework overview

Project structure

Dataset construction

Code to Text(instruction)

instruction fine-tuning

Environment setup

Prepare Pre-trained Models

fine-tuning command

inference/test command

syntax/functionality check

Example Commands

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Languages

Packages