LLM4VKG: Leveraging Large Language Models for Virtual Knowledge Graph Construction

LLM4VKG is a framework that leverages Large Language Models (LLMs) for Virtual Knowledge Graph (VKG) construction. By integrating established mapping patterns, LLM4VKG effectively structures and maps ontologies, making them more comprehensive and practical. Additionally, we developed an automated evaluation framework to simplify the assessment process.

Installation

Install UV

First, install UV (a fast Python package installer and resolver). You can install it using one of the following methods:

Using pip:

pip install uv

Using curl (Linux/macOS):

curl -LsSf https://astral.sh/uv/install.sh | sh

Using Homebrew (macOS):

brew install uv

For more installation options, visit: https://github.com/astral-sh/uv

Install Dependencies

After installing UV, install the project dependencies:

uv sync

This will create a virtual environment and install all dependencies specified in pyproject.toml.

Requirements

Please refer to the pyproject.toml file for a list of dependencies.

Resources

The following external resources are required. Please download and place them in the ./resources directory:

ontop: https://github.com/ontop/ontop
logmap: https://github.com/ernestojimenezruiz/logmap-matcher

Prepare for Run

Instantiate the database according to the SQL dump file in ./datasets/rodi/*/dump.sql. And then set the corresponding DB config in src/db_utils/db_utils.py.
Set API config for LLMs in src/llm/resources/ampi.json.

How to Run

All scripts are located in the script/ directory and use UV to run the Python programs. Make sure you have completed the installation steps above before running.

Mapping pattern recognition:
```
./script/MPR.sh
```
Ontology completion and mapping generation:
```
./script/OC_MG.sh
```
Evaluate:
```
uv run python rodi_evaluate.py
```

Alternative Scripts

script/MPR_infk.sh / script/MPR_nofk.sh: Mapping pattern recognition with different configurations
script/OC_MG_infk.sh / script/OC_MG_nofk.sh: Ontology completion and mapping generation with different configurations
script/dataEnrichment.sh: Data enrichment script

Note: Make sure the scripts have execute permissions. If not, run:

chmod +x script/*.sh

Results

The directory outputs/ will contain the full outputs of LLM4VKG. This includes the generated ontology, mappings, and a comprehensive evaluation report detailing performance metrics and validation outcomes.

Acknowledgements

This work utilizes the RODI (Relational-to-Ontology Mapping Quality Benchmark) dataset. We thank the creators and maintainers for their contribution.

The RODI benchmark can be found at: https://github.com/chrpin/rodi

Citation

If you find this work useful, please consider citing our paper accepted at IJCAI 2025:

@inproceedings{Xiao2025LLM4VKG,
  author    = {Guohui Xiao and Lin Ren and Guilin Qi and Haohan Xue and Marco Di Panfilo and Davide Lanti},
  title     = {LLM4VKG: Leveraging Large Language Models for Virtual Knowledge Graph Construction},
  booktitle = {Proceedings of the 34th International Joint Conference on Artificial Intelligence (IJCAI-25)},
  year      = {2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
datasets		datasets
prompts/OntologyCompletion		prompts/OntologyCompletion
resources		resources
script		script
setting/strategy/OntologyCompletion		setting/strategy/OntologyCompletion
src		src
.gitignore		.gitignore
.python-version		.python-version
MPR.py		MPR.py
OC_MG.py		OC_MG.py
logmap-log.out		logmap-log.out
pyproject.toml		pyproject.toml
readme.md		readme.md
requirements.txt		requirements.txt
rodi_evaluate.py		rodi_evaluate.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM4VKG: Leveraging Large Language Models for Virtual Knowledge Graph Construction

Installation

Install UV

Install Dependencies

Requirements

Resources

Prepare for Run

How to Run

Alternative Scripts

Results

Acknowledgements

Citation

About

Uh oh!

Releases

Packages

Languages

Ai4c-AI/LLM4VKG

Folders and files

Latest commit

History

Repository files navigation

LLM4VKG: Leveraging Large Language Models for Virtual Knowledge Graph Construction

Installation

Install UV

Install Dependencies

Requirements

Resources

Prepare for Run

How to Run

Alternative Scripts

Results

Acknowledgements

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages