-
Notifications
You must be signed in to change notification settings - Fork 13.9k
Add OpenVINO backend #15307
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
wine99
wants to merge
179
commits into
ggml-org:master
Choose a base branch
from
ravi9:dev_backend_openvino
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+5,725
−34
Open
Add OpenVINO backend #15307
Changes from 165 commits
Commits
Show all changes
179 commits
Select commit
Hold shift + click to select a range
5ef91fc
Add ggml-openvino base files
YangleiZouIntel 2e42b6c
add openvino as optional backend for Llama.cpp ggml
zhanmyz bfba5b9
* Configure the device(default CPU) that uses OpenVINO to compile th…
zhanmyz e3b1386
Solve the issue of abnormal model output caused by using OpenVINO ADD…
zhanmyz 9ba111e
Add OpenVINO MUL operator to GGML of Llama.cpp.
zhanmyz 543d929
Add compile options
zhanmyz 684086c
add OpenVINO frontend convert process steps
zhanmyz e71e41a
add get openvino available ops function
zhanmyz 311674e
Add PoC of integration of openvino frontend. Main changes: ggml-ov-fr…
yumengbo 51ecdf4
Implement GgmlOvDecoder. Add dump functions.
yumengbo 727246e
Convert subgraph with add, sub, mul, div op to ov model and do infer …
yumengbo 0802220
Add GGML_OV_FRONTEND option. Add readme.
yumengbo d6c148b
Change output for infer request to set output tensor. Support scale, …
yumengbo 8769d9e
add GET_ROWS operator of OpenVINO to GGML of llama.cpp
zhanmyz e4754ab
Update build.md and add operation mapping(GGML to OpenVINO)
zhanmyz 76ee005
add the rms_norm operator implemented using OpenVINO to the GGML back…
zhanmyz 2a86e7b
Fix issue for output memory copy of infer request
yumengbo 0689ee3
Change to implementation following pytorch frontend
yumengbo 1c301ce
Add support for UNARY SILU op . Fix pytorch impl bugs.
yumengbo 6028316
Support Softmax op
yumengbo 213761e
Support Softmax op
yumengbo b0406c2
Support ROPE op.
yumengbo 9b4d445
Add support for RMS_NORM OP
zhanmyz 60e899c
Add MUL_MAT,CPY,CONT as operators implemented in OpenVINO for GGML ba…
zhanmyz 5749e82
Move CPY from GGML OV Backend to OV Frontend
zhanmyz ad57734
add implementation of MUL_MAT, CPY, CONT of GGML ops using OV ops
zhanmyz 015f11e
add implementation of CPY when the output tensor is non-contiguous
zhanmyz cc3066b
add tmp source code files
zhanmyz 81f8c75
Execute singel CONT operator is OK
zhanmyz 28acc0e
Execute CONT & VIEW operators in OV Frontend is OK
zhanmyz 3b4f3ac
OV Frontend supports GET_ROWS/RMS_NORM/MUL/MUL_MAT graph conversion o…
zhanmyz dceeefa
OV Frontend supports GET_ROWS/RMS_NORM/MUL/MUL_MAT/ROPE/SCALE/SOFTMAX…
zhanmyz b794841
Change the input parameter shape of CONT operator
zhanmyz f4bb7d2
Change the input and ouput node shape of MUL_MAT operator
zhanmyz e077a7c
Change the input and ouput node shape of MUL_MAT operator
zhanmyz 171deac
change CONT and MULMAT input node shape
zhanmyz a0672d3
All adjacent ops can conversion but calculation result is wrong and n…
zhanmyz f508c15
1. All operators implemented using OpenVINO can be successfully execu…
zhanmyz ee35e8c
1. Update the implementation of CPY node when it's non-contiguous
zhanmyz 0ee0781
Minor Update
zhanmyz a9f6725
Try to add VIEW node to OV Frontend and have some issues that need to…
zhanmyz a6da47b
1. In the Prompt process and predict first token stage, the PERMUTE n…
zhanmyz 952dbc4
add debug info
zhanmyz 1b7ed3d
Process Prompt and predict first token is OK
zhanmyz cc21645
1. Solve the AC issue of Permute+VIEW and MULMAL issue in the phase o…
zhanmyz bdd0962
1. Delete some comments
zhanmyz 3f40786
* Use find_package in CMake to configure OpenVINO
wine99 d75fee7
change op mappings to list in openvino_supports_op
wine99 c53e290
2nd+ token correct by fix CPY in OV, remove single op backend compute…
wine99 d424199
Arbitrary token len (>32) work; Fix bug in mulmat
wine99 87f691d
FEAT: do PERMUTE eagerly
wine99 dafb10e
FEAT: Add interleaved mode for ROPE
wine99 70c234a
REFACTOR: support weigts as constant
wine99 216fdc2
STYLE: minor refactor
wine99 3314ef0
PERF: share const nodes for weights for diff infer
wine99 f27e526
BUILD: update build doc, add cmake preset, add CACHE_DIR env var
wine99 9d3ee0b
FEAT: improve debug capability
wine99 18be2ca
PERF: compile once (dynamic graph + cache)
wine99 4e1d196
Rebase - Bring up to date and fix build process
virajwad ce5df66
fix build error
wine99 0036a21
FIX: backend buffer type issue
wine99 79449d7
STYLE: clang-format
wine99 3e8e678
FEAT: Add all conversion code from ov side
wine99 3a3d776
PERF: favor low precision matmul
wine99 f881c58
STYLE and minor REFACTOR
wine99 a3be048
FIX: Re-add tensor names in cgraph, Add another case for RESHAPE
wine99 7ce1783
FIX: input shape of KQ_mask
wine99 ea520a3
PERF: add weight constant in parallel
wine99 264011b
FIX: set_max_token_len
wine99 f6de4c1
PERF: use Slice+Concat in writing cache_v
wine99 c632aed
Update build doc
wine99 3427daa
Add cgraph tensor output name to OV op name
wine99 c6d3e92
Update openvino build instructions
ravi9 aa2f495
Add initial NPU support
wine99 5984be4
draft NPU support version 2: prefill + kvcache
wine99 1a9411f
NPU support version 2: prefill + kvcache
wine99 ee36029
Change due to ggml cgraph changes, not correct yet
wine99 51f7698
Change due to ggml cgraph changes, llama-3.2 CPU work
wine99 f922d18
Add AMD64 to CMakeLists
wine99 0fc9477
Change due to ggml cgraph changes, all device work
wine99 43d57f3
Refactor: clean, fix warning
wine99 e8ce78f
Update clang-format
wine99 a63cfb2
Statful transformation for CPU GPU
wine99 389d3c4
Add SwiGLU
wine99 0200596
Fuse to SDPA
wine99 25d5197
Replace Concat with Broadcast in MulMat for GQA
wine99 d30f6f7
Pull out indices creation for kv cache update
wine99 93ac991
Refactor: remove past_token_len from extra_inputs
wine99 5de7da5
Fix Phi3 SwiGLU and SoftMax
wine99 2df2e39
Pull out sin cos from rope
wine99 bc2bfaf
Reduce memory: free ov weights node after graph conversion
wine99 01b858a
Fix CPY due to cgraph change
wine99 c5313d3
Added OpenVINO CI/CD. Updated docs
ravi9 2a8d318
Fix llama-cli
wine99 e0c370c
Fix Phi3 ROPE; Add test-backend-ops
wine99 2e5ebb7
Fix NPU
wine99 d388d7e
Fix llama-bench; Clang-format
wine99 3a5eb95
Fix llama-perplexity
wine99 407114f
temp. changes for mark decomp
cavusmustafa 5f47e95
matmul in fp32
wine99 9e34ea4
mulmat input conversion fix
cavusmustafa 1ab7de3
mulmat type conversion update
cavusmustafa cc7c17b
add mark decomp pass
cavusmustafa e2cfd6e
Revert changes in fuse_to_sdpa
wine99 4dced3a
Update build.md
ravi9 d693fda
Fix test-backend-ops
wine99 164bfeb
Skip test-thread-safety; Run ctest only in ci/run.sh
wine99 42577f7
Use CiD for NPU
wine99 2197129
Optimize tensor conversion, improve TTFT
wine99 fb758ff
Support op SET_ROWS
wine99 2541b9d
Fix NPU
wine99 7424136
Remove CPY
wine99 49c75c2
Fix test-backend-ops
wine99 006f6e8
Minor updates for raising PR
wine99 c7f165a
Perf: RMS fused to OV internal RMS op
wine99 bcb7053
Fix after rebasing
wine99 04dba82
Change openvino device_type to GPU; Enable flash_attn
wine99 13c0d71
Update supports_buft and supports_op for quantized models
wine99 f7f9273
Add quant weight conversion functions from genai gguf reader
wine99 604adc3
Quant models run with accuracy issue
wine99 b35884a
Fix accuracy: disable cpu_repack
wine99 85247b6
Fix CI; Disable test-backend-ops
wine99 e1235b9
Fix Q4_1
wine99 63792a1
Fix test-thread-safety
wine99 e1f9aab
Fix test-backend-ops: Treat quantized tensors as weights
wine99 715fd26
Add NPU Q4_0 support
wine99 ca5ceb7
NPU perf: eliminate zp
wine99 9623246
Dequantize q4_1 q4_k q6_k for NPU
wine99 7a0b852
Add custom quant type: q8_1_c, q4_0_128
wine99 c02d362
Set m_is_static=false as default in decoder
wine99 e7a3ab9
Simpilfy translation of get_rows
wine99 404fac9
Fix after rebasing
wine99 dc2eeb4
Improve debug util; Eliminate nop ReshapeReshape
wine99 c3b8963
STYLE: make get_types_to_requant a function
wine99 3cd3def
Support BF16 model
wine99 a482f40
Fix NPU compile
wine99 bd862a0
WA for npu 1st token acc issue
wine99 4eb3819
Apply EliminateZP only for npu
wine99 7f69755
Add GeGLU
wine99 244ec02
Fix Hunyuan
wine99 29b4e72
Support iSWA
wine99 51f9bea
Fix NPU accuracy
wine99 dd416f7
Fix ROPE accuracy when freq_scale != 1
wine99 72833f2
Minor: not add attention_size_swa for non-swa model
wine99 0e50ed9
Minor refactor
wine99 cee3982
Add Q5_K to support phi-3-q4_k_m
wine99 8825c3d
Requantize Q6_K (gs16) to gs32 on GPU
wine99 3e18759
Fix after rebasing
wine99 47e253a
Always apply Eliminate_ZP to fix GPU compile issue on some platforms
wine99 3dc9a72
kvcachefusion support
cavusmustafa 61d007d
env variable GGML_OPENVINO_DISABLE_SDPA_OPTIMIZATION added
cavusmustafa ba62f7b
Fix for Phi3
cavusmustafa de961a0
Fix llama-cli (need to run with --no-warmup)
wine99 fa18b7b
Fix add_sliced_mask; Revert mulmat, softmax; Remove input attention_s…
wine99 4c1f60f
fix after rebasing
wine99 8cc6cd0
Fix llama-3-8b and phi3-mini q4_0 NPU
wine99 8af46c4
Update to OV-2025.3 and CMakeLists.txt
ravi9 509c5f4
Add OV CI cache
wine99 cfd40a9
Apply CISC review and update CI to OV2025.3
ravi9 4c280cc
Update CI to run OV dep install before build
ravi9 3feac74
Update OV dockerfile to use OV2025.3 and update build docs
ravi9 7ac02a8
Style: use switch in supports_ops
wine99 7c8a4a5
Style: middle ptr and ref align, omit optional struct keyword
wine99 0f97715
NPU Unify PD (#14)
wine99 d5038aa
Clean placeholders in ggml-openvino.cpp
wine99 e866ed0
Update .github/workflows/docker.yml
wine99 75c720a
NPU unify PD (handled internally)
wine99 0981dec
Update ggml-decoder.cpp
I-N-T-E-L 02eb109
Update ggml-decoder.cpp
I-N-T-E-L 546cabd
Update ggml-decoder.cpp
I-N-T-E-L 6b2153d
Update ggml-decoder.cpp
I-N-T-E-L 51167ab
Update ggml-decoder.cpp
I-N-T-E-L b8d0e2a
Update ggml-decoder.cpp
I-N-T-E-L 5070d2d
change graph to 4d, support multi sequences
wine99 6be0146
Fix llama-bench
wine99 bbecac0
Fix NPU
wine99 1c05c32
Merge pull request #17 from I-N-T-E-L/fix---unsetenv()
ynimmaga 5d433c8
Remove the second decoder for node. Moving the function into the mode…
zhaixuejun1993 33a5b45
Fix error for naive
zhaixuejun1993 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,134 @@ | ||
| ARG OPENVINO_VERSION_MAJOR=2025.3 | ||
| ARG OPENVINO_VERSION_FULL=2025.3.0.19807.44526285f24 | ||
| ARG UBUNTU_VERSION=24.04 | ||
|
|
||
| # Optional proxy build arguments - empty by default | ||
| ARG http_proxy= | ||
| ARG https_proxy= | ||
|
|
||
| ## Build Image | ||
| FROM ubuntu:${UBUNTU_VERSION} AS build | ||
|
|
||
| # Pass proxy args to build stage | ||
| ARG http_proxy | ||
| ARG https_proxy | ||
|
|
||
| RUN apt-get update && \ | ||
| apt-get install -y --no-install-recommends \ | ||
| ca-certificates \ | ||
| gnupg \ | ||
| wget \ | ||
| git \ | ||
| cmake \ | ||
| ninja-build \ | ||
| build-essential \ | ||
| libtbb12 \ | ||
| libcurl4-openssl-dev && \ | ||
| rm -rf /var/lib/apt/lists/* | ||
|
|
||
| # Install OpenVINO for Ubuntu 24.04 | ||
| ARG OPENVINO_VERSION_MAJOR | ||
| ARG OPENVINO_VERSION_FULL | ||
| RUN mkdir -p /opt/intel && \ | ||
| wget https://storage.openvinotoolkit.org/repositories/openvino/packages/${OPENVINO_VERSION_MAJOR}/linux/openvino_toolkit_ubuntu24_${OPENVINO_VERSION_FULL}_x86_64.tgz && \ | ||
| tar -xf openvino_toolkit_ubuntu24_${OPENVINO_VERSION_FULL}_x86_64.tgz && \ | ||
| mv openvino_toolkit_ubuntu24_${OPENVINO_VERSION_FULL}_x86_64 /opt/intel/openvino_${OPENVINO_VERSION_MAJOR} && \ | ||
| cd /opt/intel/openvino_${OPENVINO_VERSION_MAJOR} && \ | ||
| echo "Y" | ./install_dependencies/install_openvino_dependencies.sh && \ | ||
| cd - && \ | ||
| ln -s /opt/intel/openvino_${OPENVINO_VERSION_MAJOR} /opt/intel/openvino | ||
|
|
||
| ENV OpenVINO_DIR=/opt/intel/openvino | ||
|
|
||
| WORKDIR /app | ||
|
|
||
| COPY . . | ||
|
|
||
| # Build Stage | ||
| RUN bash -c "source ${OpenVINO_DIR}/setupvars.sh && \ | ||
| cmake -B build/ReleaseOV -G Ninja \ | ||
| -DCMAKE_BUILD_TYPE=Release \ | ||
| -DGGML_OPENVINO=ON && \ | ||
| cmake --build build/ReleaseOV -j$(nproc)" | ||
|
|
||
| # Copy all necessary libraries | ||
| RUN mkdir -p /app/lib && \ | ||
| find build/ReleaseOV -name '*.so*' -exec cp {} /app/lib \; && \ | ||
| find ${OpenVINO_DIR}/runtime/lib/intel64 -name '*.so*' -exec cp -P {} /app/lib \; 2>/dev/null || \ | ||
| find ${OpenVINO_DIR}/lib/intel64 -name '*.so*' -exec cp -P {} /app/lib \; | ||
|
|
||
| # Create runtime directories and copy binaries | ||
| RUN mkdir -p /app/full \ | ||
| && cp build/ReleaseOV/bin/* /app/full/ \ | ||
| && cp *.py /app/full \ | ||
| && cp -r gguf-py /app/full \ | ||
| && cp -r requirements /app/full \ | ||
| && cp requirements.txt /app/full \ | ||
| && cp .devops/tools.sh /app/full/tools.sh | ||
|
|
||
| ## Base Runtime Image | ||
| FROM ubuntu:${UBUNTU_VERSION} AS base | ||
|
|
||
| # Pass proxy args to runtime stage | ||
| ARG http_proxy | ||
| ARG https_proxy | ||
|
|
||
| RUN apt-get update \ | ||
| && apt-get install -y libgomp1 libtbb12 curl\ | ||
| && apt autoremove -y \ | ||
| && apt clean -y \ | ||
| && rm -rf /tmp/* /var/tmp/* \ | ||
| && find /var/cache/apt/archives /var/lib/apt/lists -not -name lock -type f -delete \ | ||
| && find /var/cache -type f -delete | ||
|
|
||
| COPY --from=build /app/lib/ /app/ | ||
|
|
||
| ### Full (all binaries) | ||
| FROM base AS full | ||
|
|
||
| ARG http_proxy | ||
| ARG https_proxy | ||
|
|
||
| COPY --from=build /app/full /app/ | ||
|
|
||
| WORKDIR /app | ||
|
|
||
| RUN apt-get update && \ | ||
| apt-get install -y --no-install-recommends \ | ||
| git \ | ||
| python3 \ | ||
| python3-venv \ | ||
| python3-pip && \ | ||
| python3 -m venv /ov-venv && \ | ||
| /ov-venv/bin/pip install --no-cache-dir --upgrade pip setuptools wheel && \ | ||
| /ov-venv/bin/pip install --no-cache-dir -r requirements.txt && \ | ||
| apt-get autoremove -y && \ | ||
| apt-get clean && \ | ||
| rm -rf /tmp/* /var/tmp/* && \ | ||
| find /var/cache/apt/archives /var/lib/apt/lists -not -name lock -type f -delete && \ | ||
| find /var/cache -type f -delete | ||
|
|
||
| ENTRYPOINT ["/bin/bash", "-c", "source /ov-venv/bin/activate && exec /app/tools.sh \"$@\"", "--"] | ||
|
|
||
|
|
||
| ### Light, CLI only | ||
| FROM base AS light | ||
|
|
||
| COPY --from=build /app/full/llama-cli /app/ | ||
|
|
||
| WORKDIR /app | ||
|
|
||
| ENTRYPOINT [ "/app/llama-cli" ] | ||
|
|
||
| ### Server, Server only | ||
| FROM base AS server | ||
|
|
||
| ENV LLAMA_ARG_HOST=0.0.0.0 | ||
|
|
||
| COPY --from=build /app/full/llama-server /app/ | ||
|
|
||
| WORKDIR /app | ||
|
|
||
| HEALTHCHECK CMD [ "curl", "-f", "http://localhost:8080/health" ] | ||
|
|
||
| ENTRYPOINT [ "/app/llama-server" ] |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,25 @@ | ||
| name: "Linux - Setup OpenVINO Toolkit" | ||
| description: "Setup OpenVINO Toolkit for Linux" | ||
| inputs: | ||
| path: | ||
| description: "Installation path" | ||
| required: true | ||
| version_major: | ||
| description: "OpenVINO major version (e.g., 2025.3)" | ||
| required: true | ||
| version_full: | ||
| description: "OpenVINO full version (e.g., 2025.3.0.19807.44526285f24)" | ||
| required: true | ||
|
|
||
| runs: | ||
| using: "composite" | ||
| steps: | ||
| - name: Setup OpenVINO Toolkit | ||
| id: setup | ||
| uses: ./.github/actions/unarchive-tar | ||
| with: | ||
| url: https://storage.openvinotoolkit.org/repositories/openvino/packages/${{ inputs.version_major }}/linux/openvino_toolkit_ubuntu24_${{ inputs.version_full }}_x86_64.tgz | ||
| path: ${{ inputs.path }} | ||
| type: z | ||
| strip: 1 | ||
|
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -700,6 +700,61 @@ jobs: | |
| -DGGML_SYCL_F16=ON | ||
| cmake --build build --config Release -j $(nproc) | ||
|
|
||
| ubuntu-24-cmake-openvino: | ||
| runs-on: ubuntu-24.04 | ||
|
|
||
| env: | ||
| # Make sure this is in sync with build-cache.yml | ||
| OPENVINO_VERSION_MAJOR: "2025.3" | ||
| OPENVINO_VERSION_FULL: "2025.3.0.19807.44526285f24" | ||
|
|
||
| steps: | ||
| - name: Clone | ||
| id: checkout | ||
| uses: actions/checkout@v4 | ||
|
|
||
| - name: ccache | ||
| uses: ggml-org/[email protected] | ||
| with: | ||
| key: ubuntu-24-cmake-openvino-no-preset-v1 | ||
| evict-old-files: 1d | ||
|
|
||
| - name: Dependencies | ||
| id: depends | ||
| run: | | ||
| sudo apt-get update | ||
| sudo apt-get install -y build-essential libcurl4-openssl-dev libtbb12 cmake ninja-build python3-pip | ||
|
|
||
| - name: Use OpenVINO Toolkit Cache | ||
| uses: actions/cache@v4 | ||
| id: cache-openvino | ||
| with: | ||
| path: ./openvino_toolkit | ||
| key: openvino-toolkit-v${{ env.OPENVINO_VERSION_FULL }}-${{ runner.os }} | ||
|
|
||
| - name: Setup OpenVINO Toolkit | ||
| if: steps.cache-openvino.outputs.cache-hit != 'true' | ||
| uses: ./.github/actions/linux-setup-openvino | ||
| with: | ||
| path: ./openvino_toolkit | ||
| version_major: ${{ env.OPENVINO_VERSION_MAJOR }} | ||
| version_full: ${{ env.OPENVINO_VERSION_FULL }} | ||
|
|
||
| - name: Install OpenVINO dependencies | ||
| run: | | ||
| cd ./openvino_toolkit | ||
| chmod +x ./install_dependencies/install_openvino_dependencies.sh | ||
| echo "Y" | sudo -E ./install_dependencies/install_openvino_dependencies.sh | ||
|
|
||
| - name: Build | ||
| id: cmake_build | ||
| run: | | ||
| source ./openvino_toolkit/setupvars.sh | ||
| cmake -B build/ReleaseOV -G Ninja \ | ||
| -DCMAKE_BUILD_TYPE=Release \ | ||
| -DGGML_OPENVINO=ON | ||
| cmake --build build/ReleaseOV --config Release -j $(nproc) | ||
|
|
||
| build-linux-cross: | ||
| uses: ./.github/workflows/build-linux-cross.yml | ||
|
|
||
|
|
@@ -1625,27 +1680,27 @@ jobs: | |
| GG_BUILD_VULKAN=1 bash ./ci/run.sh ~/results/llama.cpp ~/mnt/llama.cpp | ||
|
|
||
| ggml-ci-arm64-cpu-kleidiai: | ||
| runs-on: ubuntu-22.04-arm | ||
|
|
||
| steps: | ||
| - name: Clone | ||
| id: checkout | ||
| uses: actions/checkout@v4 | ||
|
|
||
| - name: ccache | ||
| uses: ggml-org/[email protected] | ||
| with: | ||
| key: ggml-ci-arm64-cpu-kleidiai | ||
| evict-old-files: 1d | ||
|
|
||
| - name: Dependencies | ||
| id: depends | ||
| run: | | ||
| sudo apt-get update | ||
| sudo apt-get install -y build-essential libcurl4-openssl-dev | ||
|
|
||
| - name: Test | ||
| id: ggml-ci | ||
| run: | | ||
| GG_BUILD_KLEIDIAI=1 GG_BUILD_EXTRA_TESTS_0=1 bash ./ci/run.sh ./tmp/results ./tmp/mnt | ||
| runs-on: ubuntu-22.04-arm | ||
|
|
||
| steps: | ||
| - name: Clone | ||
| id: checkout | ||
| uses: actions/checkout@v4 | ||
|
|
||
| - name: ccache | ||
| uses: ggml-org/[email protected] | ||
| with: | ||
| key: ggml-ci-arm64-cpu-kleidiai | ||
| evict-old-files: 1d | ||
|
|
||
| - name: Dependencies | ||
| id: depends | ||
| run: | | ||
| sudo apt-get update | ||
| sudo apt-get install -y build-essential libcurl4-openssl-dev | ||
|
|
||
| - name: Test | ||
| id: ggml-ci | ||
| run: | | ||
| GG_BUILD_KLEIDIAI=1 GG_BUILD_EXTRA_TESTS_0=1 bash ./ci/run.sh ./tmp/results ./tmp/mnt | ||
|
|
||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.