Add LLM-KG-Bench evaluation for the rudof MCP server

## Summary

This issue tracks the implementation of an evaluation framework for the rudof MCP server using a fork of [LLM-KG-Bench](https://github.com/AKSW/LLM-KG-Bench).

## Motivation

The `rudof_mcp` crate exposes the main functionalities of the rudof library as an MCP (Model Context Protocol) server. To measure how well LLMs can use this server to solve Knowledge Graph tasks  we need a systematic evaluation framework.

## Approach

We forked LLM-KG-Bench into [rudof-project/LLM-KG-Bench-rudof](https://github.com/rudof-project/LLM-KG-Bench-rudof) and will adapt it to support MCP servers, allowing LLMs to interact with external tools through the Model Context Protocol during evaluation. This will enable us to evaluate the rudof MCP server specifically.

## Tasks

- [ ] Adapt LLM-KG-Bench to support MCP servers as tool providers for LLMs during evaluation
- [ ] Document how to run the benchmark against the rudof MCP server
- [ ] Add `BENCHMARKING.md` to the `rudof_mcp` crate linking to the benchmark repo

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LLM-KG-Bench evaluation for the rudof MCP server #492

Summary

Motivation

Approach

Tasks

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add LLM-KG-Bench evaluation for the rudof MCP server #492

Description

Summary

Motivation

Approach

Tasks

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions