Add C API for LLMPipeline #1778

apinge · 2025-02-21T03:32:21Z

I've added a C API for the LLMPipeline class. The purpose of the C API is to enable the use of cgo to build a Go wrapper, which will serve as the backend for Ollama.

Closes #888

Wovchena

Answering to your question about the naming style, stick to openvino style. But instead of ov_ prefix use ov_genai_.

src/cpp/src/c_wrapper/llm_pipeline_c.cpp

samples/c/text_generation/chat_sample_c.c

samples/c/text_generation/CMakeLists.txt

samples/c/text_generation/benchmark_genai_c.c

src/cpp/include/openvino/genai/openvino_genai_c.h

samples/c/text_generation/chat_sample_c.c

src/cpp/include/openvino/genai/c_wrapper/generation_config_c.h

src/cpp/src/c_wrapper/perf_metrics_c.cpp

src/cpp/include/openvino/genai/c_wrapper/llm_pipeline_c.h

src/cpp/src/c_wrapper/llm_pipeline_c.cpp

sammysun0711 · 2025-03-06T01:22:43Z

build_jenkins

sammysun0711 · 2025-03-06T05:03:16Z

All test passed except ci/jenkins/comment, but it is not related to this PR's change, only indicate that trigger build_jenkins via comment not working as expected.

@ilya-lavrenov, could you please kindly review it, thanks!

samples/c/text_generation/benchmark_genai_c.c

samples/c/text_generation/chat_sample_c.c

samples/c/text_generation/greedy_causal_lm_c.c

src/c/include/openvino/genai/c/generation_config_c.h

ilya-lavrenov · 2025-03-06T09:58:28Z

tests/python_tests/samples/test_greedy_causal_lm.py

+from conftest import SAMPLES_PY_DIR, SAMPLES_CPP_DIR, SAMPLES_C_DIR
 from test_utils import run_sample

 class TestGreedyCausalLM:


could you please add tests for other C samples as well?

I've added a test for benchmark_genai_c, aligned with the corresponding tests in c++ and python samples. I have not found the chat_sample test under the openvino.genai/tests/python_tests/samples folder for c++ or python. I plan to add it later.

src/cpp/CMakeLists.txt

src/c/include/openvino/genai/c/generation_config_c.h

src/cpp/CMakeLists.txt

…apis

samples/c/text_generation/benchmark_genai_c.c

ilya-lavrenov · 2025-03-07T06:51:01Z

build_jenkins

ilya-lavrenov · 2025-03-07T07:12:16Z

@apinge looks like to fix macOS Node.JS we need to wait for openvinotoolkit/openvino#29320 and then wait for nightly builds.

Could you please disable this job temporary to unblock your PR?
You can add:

if: ${{ false }}

to that job.

rkazants · 2025-03-07T07:58:13Z

src/c/include/openvino/genai/c/llm_pipeline.h

+                                                                    const char* inputs,
+                                                                    const ov_genai_generation_config* config,
+                                                                    const stream_callback* streamer,
+                                                                    char* output,


Could you please clarify - how to get to know about required sufficient size of output for successful generation?
In case unsifficient memory, it returns only the first part of generated tokens, how can I get the remained part?

Each token is 2-3 symbols, you can allocate max_new_tokens * num_of_symbols_in_token.

But I agree - maybe it's better to allocate required size inside generate() function and return it to end user? In this case output will not be truncated
In this case, output buffer needs to be freed on app side

let's fix in a separate PR.

We think we can allow ov_genai_llm_pipeline_generate's arg output to be NULL, and in this scenario get the result only depending on the streamer? In this way, the output's size is not a limitation.

The same issue with ov_genai_decoded_results_get_string - it does not allow to extract full text from decoded results.

but what if users want output w/o streaming? they need a way to get full untruncated output anyway.

I've created another PR #1871
One aspect is to obtain the required buffer size from ov_genai_decoded_results_get_string. Another is to allow the ov_genai_llm_pipeline_generate interface to have either the output or the streamer as an option.

ilya-lavrenov · 2025-03-07T08:47:13Z

apinge

Looks like Node.JS is not mandatory for precommit. So, merged as is.

### Details: - Required for GenAI JS API as GenAI will depend on C API after openvinotoolkit/openvino.genai#1778

… sufficient size for the output. (#1871) Based on the discussion in #1778, I have adjusted the LLM pipeline C APIs to ensure it can determine the required sufficient size for the output string. `ov_genai_llm_pipeline_generate_decoded_results` has been removed and `ov_genai_llm_pipeline_generate` has been modified to get decoded results. ```C ov_genai_decoded_results* results = NULL; size_t output_size=0; char* output = NULL; // the caller is responsible for allocating and freeing the memory. ov_genai_llm_pipeline_generate(pipeline, prompt, config, NULL &results); ov_genai_decoded_results_get_string(results,NULL,&output_size); // The function is called with NULL as the output to determine the required buffer size. output = (char*)malloc(output_size); // check.. ov_genai_decoded_results_get_string(results,output,&output_size); // Get the actual output string // print and free ``` Another change is to allow the `ov_genai_llm_pipeline_generate` to have either the `results` or `streamer` as an option, but one of them must not be null. This facilitates users who only need the streamer functionality, preventing them from allocating excessive unnecessary memory.

I've added a test for the C API in chat_sample_c, due to the discussion from #1778

### Details: - Required for GenAI JS API as GenAI will depend on C API after openvinotoolkit/openvino.genai#1778

github-actions bot added category: cmake / build Cmake scripts category: LLM samples GenAI LLM samples category: CPP API Changes in GenAI C++ public headers no-match-files labels Feb 21, 2025

andrei-kochin requested review from Wovchena and ilya-lavrenov February 21, 2025 08:38

Wovchena requested changes Feb 24, 2025

View reviewed changes

apinge marked this pull request as draft February 25, 2025 06:27

ilya-lavrenov reviewed Feb 27, 2025

View reviewed changes

TongQiu added 17 commits March 3, 2025 13:40

add basic c api for genai

4d2b17a

add comments and rectify a function name

aa76235

update with the clang-format

f61a6b0

update c samples with clang-format

d4790c8

fix double free issue in benchmark_genai_c.c

77a3a5d

update copyright year and DecodeResultsGetPerfMetrics api

8edb4f6

add LLMPipelineGenerateStream

20ece2c

remove openvino_genai_c.h

3523c15

add OPENVINO_GENAI_C_EXPORTS and regulate folder layer

c3988e4

remove the modification in gitignore

e8f58d4

remove the new line in gitignore

ff92344

add common_c.h

66bf514

change naming style

98f6884

update all interface with return status code

7fa1886

fix memory issue in benchmark_genai_c and fix CHECK_STATUS

a9613dd

regulate the header files and cmake

8abcc25

use callback as streamer and update the code format

d7ead62

apinge force-pushed the c_api_ds branch from 26453d5 to d7ead62 Compare March 3, 2025 05:40

add sample test in python_tests and update comments in src files

bb21d68

github-actions bot added the category: GHA CI based on Github actions label Mar 3, 2025

apinge marked this pull request as ready for review March 3, 2025 07:38

ilya-lavrenov added this to the 2025.1 milestone Mar 5, 2025

ilya-lavrenov reviewed Mar 6, 2025

View reviewed changes

ilya-lavrenov assigned ilya-lavrenov and Wovchena Mar 6, 2025

ilya-lavrenov reviewed Mar 6, 2025

View reviewed changes

src/cpp/CMakeLists.txt Outdated Show resolved Hide resolved

src/cpp/CMakeLists.txt Show resolved Hide resolved

Qiu, Tong added 4 commits March 7, 2025 02:17

update c samples, modify one c api name, and update CMakeLists for c …

90cb3e8

…apis

rename the c api's files

d47ba49

Merge branch 'master' into c_api_ds

cd76099

add testing for benchmark_genai_c

9d3fb1b

sammysun0711 requested review from Wovchena and ilya-lavrenov March 7, 2025 05:07

ilya-lavrenov approved these changes Mar 7, 2025

View reviewed changes

samples/c/text_generation/benchmark_genai_c.c Outdated Show resolved Hide resolved

ilya-lavrenov mentioned this pull request Mar 7, 2025

CPACK: added C API component to JS component openvinotoolkit/openvino#29320

Merged

rkazants reviewed Mar 7, 2025

View reviewed changes

update benchmark_genai_c

a4e4dc8

ilya-lavrenov merged commit 5636312 into openvinotoolkit:master Mar 7, 2025
44 of 45 checks passed

ilya-lavrenov mentioned this pull request Mar 7, 2025

Request for C API Support in openvino.genai #888

Closed

github-merge-queue bot pushed a commit to openvinotoolkit/openvino that referenced this pull request Mar 7, 2025

CPACK: added C API component to JS component (#29320)

fb20424

### Details: - Required for GenAI JS API as GenAI will depend on C API after openvinotoolkit/openvino.genai#1778

github-merge-queue bot pushed a commit to openvinotoolkit/openvino that referenced this pull request Mar 7, 2025

CPACK: added C API component to JS component (#29320)

3d9c56c

### Details: - Required for GenAI JS API as GenAI will depend on C API after openvinotoolkit/openvino.genai#1778

AJThePro99 pushed a commit to AJThePro99/openvino that referenced this pull request Mar 9, 2025

CPACK: added C API component to JS component (openvinotoolkit#29320)

ab305c5

### Details: - Required for GenAI JS API as GenAI will depend on C API after openvinotoolkit/openvino.genai#1778

apinge mentioned this pull request Mar 10, 2025

Adjust the LLM pipeline C API to ensure it can determine the required sufficient size for the output. #1871

Merged

apinge mentioned this pull request Mar 18, 2025

Add testcase for chat_sample_c #1934

Merged

github-merge-queue bot pushed a commit that referenced this pull request Mar 25, 2025

Add testcase for chat_sample_c (#1934)

39f39cf

I've added a test for the C API in chat_sample_c, due to the discussion from #1778

timxu826 pushed a commit to timxu826/openvino that referenced this pull request Apr 7, 2025

CPACK: added C API component to JS component (openvinotoolkit#29320)

ec469c2

### Details: - Required for GenAI JS API as GenAI will depend on C API after openvinotoolkit/openvino.genai#1778

Add C API for LLMPipeline #1778

Add C API for LLMPipeline #1778

Uh oh!

Conversation

apinge commented Feb 21, 2025 • edited by ilya-lavrenov Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Wovchena left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sammysun0711 commented Mar 6, 2025

Uh oh!

sammysun0711 commented Mar 6, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ilya-lavrenov Mar 6, 2025

Choose a reason for hiding this comment

Uh oh!

apinge Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ilya-lavrenov commented Mar 7, 2025

Uh oh!

ilya-lavrenov commented Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rkazants Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

ilya-lavrenov Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ilya-lavrenov Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

sammysun0711 Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

ilya-lavrenov Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

apinge Mar 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ilya-lavrenov commented Mar 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

apinge commented Feb 21, 2025 •

edited by ilya-lavrenov

Loading

ilya-lavrenov commented Mar 7, 2025 •

edited

Loading

ilya-lavrenov Mar 7, 2025 •

edited

Loading

ilya-lavrenov Mar 7, 2025 •

edited

Loading