Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
186 commits
Select commit Hold shift + click to select a range
9c55e5c
fix: check model pointer validity before use (#13631)
D2hugging May 19, 2025
60aea02
ggml : Fix missing backtrace on Linux (ggml/1228)
danielzgtg May 17, 2025
8b5e19a
ggml : fix apple OS check in ggml_print_backtrace (ggml/1229)
slaren May 19, 2025
6c35981
mnist: fix segmentation fault (ggml/1227)
JohannesGaessler May 19, 2025
d30cb5a
sync : ggml
ggerganov May 19, 2025
f71f40a
ci : upgraded oneAPI version in SYCL workflows and dockerfile (#13532)
Alcpz May 19, 2025
92ecdcc
mtmd : add vision support for llama 4 (#13282)
ngxson May 19, 2025
725f23f
sycl : backend documentation review (#13544)
Alcpz May 19, 2025
8960efd
Vulkan: Add f32 accumulator support to quantized mul mat to fix GLM4 …
0cc4m May 19, 2025
1dfbf2c
common : add load_progress_callback (#13617)
psocolovsky May 19, 2025
f7c9429
sycl : Overcoming workaround for mmap() allocation on Windows (#13482)
s-Nick May 20, 2025
f0adb80
CANN: Update CANN model support (#13162)
bachelor-dou May 20, 2025
e298d2f
kv-cache : add SWA support (#13194)
ggerganov May 20, 2025
c00a263
metal : fix typo in FA kernel comments (#13651)
ggerganov May 20, 2025
c9c64de
Set GLM4 blk.*.attn_output.weight, kqv_out-* matmul to GGML_PREC_F32 …
0cc4m May 20, 2025
4245e62
sycl: disable reorder for sycl mulmat (#13536)
sgeor255 May 20, 2025
759e37b
tests : avoid github urls due to throttling (#13654)
CISC May 20, 2025
b69f164
CUDA: skip fully masked-out KV in FA vec kernel (#13584)
JohannesGaessler May 20, 2025
a4090d1
llama : remove llama_kv_cache_view API + remove deprecated (#13653)
ggerganov May 20, 2025
be02396
model : fix llama4 graph (#13663)
ggerganov May 20, 2025
b7a1746
mtmd-helper : bug fix to token batching in mtmd (#13650)
l3utterfly May 20, 2025
fb1cab2
vulkan: fix warnings (#13626)
netrunnereve May 20, 2025
3398305
musa: Upgrade MUSA SDK version to rc4.0.1 and use mudnn::Unary::IDENT…
yeahdongcn May 21, 2025
b44890d
model : disable SWA for Phi models (#13676)
ggerganov May 21, 2025
797f2ac
kv-cache : simplify the interface (#13660)
ggerganov May 21, 2025
42158ae
server : fix first message identification (#13634)
doringeman May 21, 2025
0d5c742
server : Add the endpoints /api/tags and /api/chat (#13659)
R-Dson May 21, 2025
cf4cb59
ggml : add ggml_gelu_erf() (#13667)
ngxson May 21, 2025
eb0f5c2
gguf-py : display the invalid gguf type (#13687)
emmanuel-ferdman May 21, 2025
2aa777d
examples : switch retrieval to llama_encode (#13685)
CISC May 21, 2025
c76532e
convert : add qwen2vl support for unsloth merges (#13686)
antichristHater May 21, 2025
5fbfe38
server : improve error reporting (#13680)
ggerganov May 21, 2025
8e186ef
hparams : support models for which all layers use SWA (#13682)
ggerganov May 21, 2025
d643bb2
releases : build CPU backend separately (windows) (#13642)
slaren May 21, 2025
edbf42e
opencl: fix couple crashes (#12795)
May 21, 2025
a4e8912
opencl: Add support for multiple devices (#12622)
May 21, 2025
6b56a64
SYCL: Avoid using with SYCL-Graph for unsupported nodes (#13587)
May 22, 2025
d394a9a
sycl : Remove waits from function calls (#13702)
s-Nick May 22, 2025
5be24af
gguf-py : correct charsmap parameter typing (#13701)
CISC May 22, 2025
cc74d5b
server : pad small embedding batches (#13692)
ggerganov May 22, 2025
ab86335
common: Include torch package for s390x (#13699)
taronaeo May 22, 2025
797990c
mtmd : add ultravox audio input (#13623)
ngxson May 22, 2025
8a1d206
tts : fix n_ubatch + make WavTokenizer cache-less (#13713)
ggerganov May 22, 2025
3079e9a
release : fix windows hip release (#13707)
slaren May 22, 2025
a190692
ggml-qnn: add Qualcomm QNN backend for GGML
jeffzhou2000 Feb 14, 2025
e935dca
ggml-qnn: santiy check
jeffzhou2000 Feb 15, 2025
63120a0
ggml-qnn: update script build-run-android.sh to compare peformance of…
jeffzhou2000 Feb 16, 2025
e39eff8
ggml-qnn: fix minor issue in test-backend-ops.cpp
jeffzhou2000 Feb 17, 2025
360e243
ggml-qnn: merge QNN RPC feature from https://github.com/zhouwg/kantv/…
jeffzhou2000 Feb 18, 2025
ebf2a04
ggml-qnn: sync from branch kantvai-ggmlqnn-npurpc
jeffzhou2000 Feb 18, 2025
76b7b24
ggml-qnn: a concise approach to offload mulmat to QNN backend(sync fr…
jeffzhou2000 Feb 19, 2025
ff75d64
ggml-qnn: remove redundant codes
jeffzhou2000 Feb 20, 2025
1e0992a
ggml-qnn: sync from branch kantvai-ggmlqnn-npurpc
jeffzhou2000 Feb 20, 2025
21da7b6
ggml-qnn: sync from branch kantvai-ggmlqnn-npurpc
jeffzhou2000 Feb 20, 2025
153265c
ggml-qnn: sync from branch kantvai-ggmlqnn-npurpc
jeffzhou2000 Feb 21, 2025
978ca54
ggml-qnn: add Qualcomm QNN backend for GGML
jeffzhou2000 Feb 14, 2025
a31dd75
ggml-qnn: merge QNN RPC feature from https://github.com/zhouwg/kantv/…
jeffzhou2000 Feb 18, 2025
cd276af
ggml-qnn: sync from branch kantvai-ggmlqnn-npurpc
jeffzhou2000 Feb 18, 2025
8bfdddf
ggml-qnn: a concise approach to offload mulmat to QNN backend(sync fr…
jeffzhou2000 Feb 19, 2025
69e4790
ggml-qnn: remove redundant codes
jeffzhou2000 Feb 20, 2025
0c5027b
ggml-qnn: sync from branch kantvai-ggmlqnn-npurpc
jeffzhou2000 Feb 20, 2025
aa4f410
ggml-qnn: sync from branch kantvai-ggmlqnn-npurpc
jeffzhou2000 Feb 20, 2025
2cdc65d
ggml-qnn: sync from branch kantvai-ggmlqnn-npurpc
jeffzhou2000 Feb 21, 2025
cf2ab29
ggml-qnn: fix a minior typo in internal doc
jeffzhou2000 Feb 23, 2025
b103c55
ggml-qnn: refine function ggml_qnn_create_general_tensor() to avoid c…
jeffzhou2000 Feb 23, 2025
a22e0aa
ggml-qnn: fix a minor typo in source code
jeffzhou2000 Feb 24, 2025
c5fd6ec
build: avoid ggml-qnn backend breaking other backend's builds
jeffzhou2000 Feb 24, 2025
735cdb3
ggml-qnn: remove redundant codes to make PR reviewers happy
jeffzhou2000 Feb 25, 2025
8d12dc1
ggml-qnn: refine code format
jeffzhou2000 Feb 25, 2025
c6535a1
ggml-qnn: offload quantized type mulmat to QNN backend
jeffzhou2000 Feb 26, 2025
9393fa2
ggml-qnn: refine source code structure to make code more clearly
jeffzhou2000 Feb 27, 2025
91e94da
ggml-qnn: enable release build with necessary logs to make reviewers …
jeffzhou2000 Feb 27, 2025
7a12df9
ggml-qnn: enable all quantize type with 2d mulmat
jeffzhou2000 Feb 27, 2025
9a82567
ggml-qnn: enable log output of GGMLQNN_LOG_INFO in command line mode …
jeffzhou2000 Feb 28, 2025
abf6b72
ggml-qnn: Windows port --- step2
jeffzhou2000 Feb 28, 2025
53bfce0
ggml-qnn: merge UT code and corresponding script from local dev branc…
jeffzhou2000 Mar 2, 2025
318fe52
ggml-qnn: merge ggml_qnn_mul_mat_4d from local dev branch to make wor…
jeffzhou2000 Mar 2, 2025
a520f1b
ggml-qnn: submit AI-assisted ggml_qnn_mul_mat_4d(not worked currently…
jeffzhou2000 Mar 2, 2025
42339d6
ggml-qnn: AI-assisted ggml_qnn_mul_mat_4d by Grok 3 --- step2
jeffzhou2000 Mar 2, 2025
1692920
ggml-qnn: AI-assisted ggml_qnn_mul_mat_4d by Grok 3 --- step3
jeffzhou2000 Mar 2, 2025
fbcab48
ggml-qnn: AI-assisted ggml_qnn_mul_mat_4d by Grok 3 --- step4
jeffzhou2000 Mar 2, 2025
d1a2eeb
ggml-qnn: AI-assisted ggml_qnn_mul_mat_4d by Grok 3 --- step5
jeffzhou2000 Mar 2, 2025
cd80ec2
ggml-qnn: AI-assisted ggml_qnn_mul_mat_4d by Grok 3 --- step6
jeffzhou2000 Mar 2, 2025
99ee1b5
ggml-qnn: AI-assisted ggml_qnn_mul_mat_4d by Grok 3 --- step7
jeffzhou2000 Mar 2, 2025
fac1422
ggml-qnn: AI-assisted ggml_qnn_mul_mat_4d by Grok 3 --- step8
jeffzhou2000 Mar 2, 2025
b18325e
ggml-qnn: AI-assisted ggml_qnn_mul_mat_4d by Grok 3 --- good in step9
jeffzhou2000 Mar 2, 2025
c85c847
ggml-qnn: AI-assisted ggml_qnn_mul_mat_4d by Grok 3 --- narrow down t…
jeffzhou2000 Mar 2, 2025
f02d81a
ggml-qnn: AI-assisted ggml_qnn_mul_mat_4d by Grok 3 --- step10
jeffzhou2000 Mar 2, 2025
b5d19d1
ggml-qnn: AI-assisted ggml_qnn_mul_mat_4d by Grok 3 --- narrow down t…
jeffzhou2000 Mar 2, 2025
08f0b68
ggml-qnn: AI-assisted ggml_qnn_mul_mat_4d by Grok 3 --- step11
jeffzhou2000 Mar 2, 2025
c9c4f61
ggml-qnn: AI-assisted ggml_qnn_mul_mat_4d by Grok 3 --- both ok in st…
jeffzhou2000 Mar 2, 2025
0bf8d73
ggml-qnn: AI-assisted ggml_qnn_mul_mat_4d by Grok 3 ---finalizing ver…
jeffzhou2000 Mar 2, 2025
02a4cd3
ggml-qnn: refine ggml_qnn_mul_mat and ggml_qnn_general_node according…
jeffzhou2000 Mar 2, 2025
ad8d974
ggml-qnn: remove no-needed comments
jeffzhou2000 Mar 2, 2025
c40837d
ggml-qnn: Windows port --- step3
jeffzhou2000 Mar 3, 2025
7b09b8d
ggml-qnn: remove un-needed function
jeffzhou2000 Mar 4, 2025
f8c2f82
ggml-qnn:rebase to upstream
jeffzhou2000 Mar 4, 2025
0adece2
ggml-qnn: fix a minior issue during rebase to upstream
jeffzhou2000 Mar 4, 2025
3562385
ggml-qnn: update script according to https://github.com/ggml-org/llam…
jeffzhou2000 Mar 4, 2025
145d5c9
ggml-qnn: fix a minior issue in ggmlqnn_create_general_tensor()
jeffzhou2000 Mar 4, 2025
08a8dc5
ggml-qnn: active member variable _device_id in class qnn_instance
jeffzhou2000 Mar 4, 2025
198fa0a
ggml-qnn: refine ggml_qnn_general_node and ggml_qnn_mul_mat to make c…
jeffzhou2000 Mar 4, 2025
e64e2b9
ggml-qnn: Windows port --- step4
jeffzhou2000 Mar 6, 2025
6cb26f4
ggml-qnn: Windows port -- step5
jeffzhou2000 Mar 7, 2025
567c1d2
ggml-qnn: WoA(Windows on ARM) -- step6
jeffzhou2000 Mar 8, 2025
27d3a7f
ggml-qnn: rebase to upstream
jeffzhou2000 Mar 9, 2025
0d8a2a7
ggml-qnn: pr to upstream
jeffzhou2000 Mar 11, 2025
ce322d8
ggml-qnn: rebase to upstream
jeffzhou2000 Mar 18, 2025
0e9013a
ggml-qnn: self code-review
jeffzhou2000 Mar 18, 2025
3e279c7
ggml-qnn: rebase upstream
jeffzhou2000 Mar 19, 2025
6a2a205
ggml-qnn: add approach through Hexagon cDSP
jeffzhou2000 Mar 22, 2025
7923cca
ggml-qnn: refine general approach through Hexagon cDSP
jeffzhou2000 Mar 23, 2025
05a5967
ggml-qnn: refine the entire ggml-qnn.cpp to make code more clear
jeffzhou2000 Mar 24, 2025
a318d90
ggml-qnn: refine the entire ggml-qnn.cpp to make code more clear
jeffzhou2000 Mar 24, 2025
a3b9dd8
ggml-qnn: add build script for libggmlop_skel.so
jeffzhou2000 Mar 24, 2025
8060bb4
ggml-qnn: remove redundant functions in this PR and make codes more c…
jeffzhou2000 Mar 25, 2025
24ebb53
ggml-qnn: original ggml_compute_forward_add and ggml_compute_forward_…
jeffzhou2000 Mar 25, 2025
dd72238
ggml-qnn: modify build-run-android.sh to verify mulmat and validate m…
jeffzhou2000 Mar 25, 2025
5f04a3a
ggml-qnn: make host code(ggml-qnn.cpp) more clear and more stable
jeffzhou2000 Mar 26, 2025
7de9a23
ggml-qnn: refine code according to self code-review and make code mor…
jeffzhou2000 Mar 26, 2025
be10a78
ggml-qnn: offload more ggml op to Hexagon cDSP
jeffzhou2000 Mar 27, 2025
59fe066
ggml-hexagon: code on AP(arm-cpu) side is stable now
jeffzhou2000 Mar 28, 2025
b07e3cd
ggml-hexagon: optimize GGML_OP_ADD on cDSP side
jeffzhou2000 Mar 28, 2025
2a6a6a0
ggml-hexagon: simplify hexagon-kernel build logic in CMakeLists.txt
jeffzhou2000 Mar 29, 2025
1a0504e
ggml-hexagon: release ggml-hexagon v0.98
jeffzhou2000 Mar 29, 2025
752b8b5
ggml-hexagon: release ggml-hexagon v0.99
jeffzhou2000 Mar 29, 2025
48452c2
ggml-hexagon: try to offload q6_k mulmat to cDSP
jeffzhou2000 Mar 29, 2025
5655c22
ggml-hexagon: fix minior issue in ggml-hexagon.cpp after self code-re…
jeffzhou2000 Mar 29, 2025
e349d20
ggml-hexagon: check validation of ggml-hexagon.cfg before create appr…
jeffzhou2000 Mar 30, 2025
7649509
ggml-hexagon: fix all compiler warnings in ggml-hexagon.cpp
jeffzhou2000 Mar 30, 2025
a7a1d33
ggml-hexagon: enable only one backend device for HWACCEL_CDSP and ena…
jeffzhou2000 Mar 31, 2025
4cefb8f
ggml-hexagon: rpc ion memory pool and test-backend-ops works fine in …
jeffzhou2000 Mar 31, 2025
073cc9b
ggml-hexagon: make comprision of mulmat performance between HWACCEL_Q…
jeffzhou2000 Mar 31, 2025
3ad316e
ggml-hexagon: release ggml-hexagon v1.00
jeffzhou2000 Mar 31, 2025
5b9167d
ggml-hexagon: rebase to upstream
jeffzhou2000 Apr 1, 2025
ca45d04
ggml-hexagon: check configuration of enable_rpc_dma_mempool in functi…
jeffzhou2000 Apr 1, 2025
7474110
ggml-hexagon: uniform rpc_ion_memsize and rpc_ion_usage between HWACC…
jeffzhou2000 Apr 1, 2025
978e8db
ggml-hexagon: make buffer mechanism more clear in HWACCEL_CDSP approach
jeffzhou2000 Apr 1, 2025
34daad4
ggml-hexagon: add perf function in hexagon kernerls on cDSP side
jeffzhou2000 Apr 2, 2025
2cacf87
ggml-hexagon: fix a stupid issue of why set rpc latency failure and i…
jeffzhou2000 Apr 2, 2025
cec7ae6
ggml-hexagon: make helper function ggmlhexagon_get_timestring() threa…
jeffzhou2000 Apr 2, 2025
33bac5b
ggml-hexagon: fix a typo in ggml-hexagon.cpp
jeffzhou2000 Apr 2, 2025
e88761e
ggml-hexagon: list all known todo and fixme tasks in ggml-hexagon.cpp
jeffzhou2000 Apr 2, 2025
0345da5
ggml-hexagon: fix units MB -> MiB
jeffzhou2000 Apr 2, 2025
32ab3f2
ggml-hexagon: try to make ggml-hexagon backend works fine in a standa…
jeffzhou2000 Apr 3, 2025
2a4fa3f
ggml-hexagon: remove reduament code and make debug log more clear
jeffzhou2000 Apr 3, 2025
6f72449
ggml-hexagon: add gemma-3-4b-it-Q8_0.gguf to verify q8_0 mulmat on cDSP
jeffzhou2000 Apr 3, 2025
3e5b4ab
ggml-hexagon:add skeleton code of offload GGML_OP_SOFT_MAX/GGML_OP_RM…
jeffzhou2000 Apr 3, 2025
f8704f4
ggml-hexagon: release ggml-dsp v0.60 on cDSP side
jeffzhou2000 Apr 4, 2025
6c94b3a
ggml-hexagon: merge build logic in kernels/Makefile to ggml-hexagon/C…
jeffzhou2000 Apr 5, 2025
9aea8ac
ggml-hexagon: fix a typo in ggml-hexagon.cpp
jeffzhou2000 Apr 5, 2025
232bd29
ggml-hexagon: uniform NDEBUG usage in ggml-hexagon.cpp and ggml-dsp.c
jeffzhou2000 Apr 6, 2025
2090679
ggml-hexagon: add profiler feature for purpose of visualize NPU perfo…
jeffzhou2000 Apr 7, 2025
c35d7c1
ggml-hexagon: remove so-called dma memory pool to avoid confusion and…
jeffzhou2000 Apr 8, 2025
1d140a7
ggml-hexagon: make function ggmlhexagon_init_rpcmempool in ggml-hexag…
jeffzhou2000 Apr 8, 2025
aaafc75
ggml-hexagon: fix potential resource leak in class hexagon_profiler
jeffzhou2000 Apr 8, 2025
f83535b
ggml-hexagon: enable multi-threading feature on cDSP side
jeffzhou2000 Apr 8, 2025
670cb9d
ggml-hexagon: upgrade QNN SDK to v2.33.0.250327
jeffzhou2000 Apr 9, 2025
8c3f88a
ggml-hexagon: fix typo in ggml-hexagon.cpp
jeffzhou2000 Apr 9, 2025
ea95d1d
ggml-dsp: probe QuRT RTOS information in function ggmlop_dsp_open
jeffzhou2000 Apr 9, 2025
afd7099
ggml-hexagon: setting enable_rpc_ion_mempool to 1 and make test-backe…
jeffzhou2000 Apr 10, 2025
d8b6340
ggml-hexagon: check whether user's specified htp arch is valid in CMa…
jeffzhou2000 Apr 10, 2025
644a9ea
ggml-hexagon: sync with upstream
jeffzhou2000 Apr 11, 2025
59640a5
ggml-hexagon: refine pinned-memory feature
jeffzhou2000 Apr 11, 2025
26d498f
ggml-hexagon: refine build system in ggml-hexagon
jeffzhou2000 Apr 11, 2025
55e9044
ggml-hexagon: remove redundant code in struct ggml_backend_hexagon_bu…
jeffzhou2000 Apr 11, 2025
5d48d7e
ggml-hexagon: upgrade Android NDK to android-ndk-r28
jeffzhou2000 Apr 11, 2025
fa825e4
ggml-dsp: split ggml-dsp.c into multiple files and cleanup
jeffzhou2000 Apr 11, 2025
9e12dfd
ggml-dsp: refine ggml-dsp and make ggml-dsp more clear
jeffzhou2000 Apr 12, 2025
148f674
ggml-hexagon: fix a minior issue in dev ops
jeffzhou2000 Apr 12, 2025
ae07b3d
ggml-hexagon: fix a build issue in CI
jeffzhou2000 Apr 12, 2025
8c9bf8f
ggml-dsp: cleanup code
jeffzhou2000 Apr 15, 2025
7e0a440
ggml-hexagon: sync with upstream
jeffzhou2000 Apr 15, 2025
8706d83
ggml-dsp: cleanup code
jeffzhou2000 Apr 16, 2025
5279547
ggml-dsp:refine ggmlhexagon_dsp_add_f32
jeffzhou2000 Apr 16, 2025
1eee593
ggml-dsp: refine logic of thread_counts
jeffzhou2000 Apr 17, 2025
d48f16e
ggml-hexagon: release v1.06 and ready for code review
jeffzhou2000 Apr 17, 2025
14d00f1
ggml-dsp: make GGML_OP_ADD more faster on cDSP side
jeffzhou2000 Apr 19, 2025
5b84e56
ggml-hexagon: sync from project kantv(make ggml-hexagon backend can w…
jeffzhou2000 Apr 24, 2025
2ebfa2d
sync with upstream llama.cpp and sync ggml-hexagon.cpp from project k…
jeffzhou2000 Apr 29, 2025
2b9d91f
sync with upstream
jeffzhou2000 May 7, 2025
addfaaf
sync with upstream
jeffzhou2000 May 10, 2025
947e43a
ggml-hexagon: upgrade QNN SDK to v2.34.0.250424
jeffzhou2000 May 11, 2025
32fd16a
sync with upstream
jeffzhou2000 May 16, 2025
cc46b70
ggml-hexagon: sync from project kantv(fix a long-term issue which int…
jeffzhou2000 May 17, 2025
0389cf3
ggml-hexagon: sync with upstream llama.cpp
jeffzhou2000 May 23, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion .devops/intel.Dockerfile
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
ARG ONEAPI_VERSION=2025.0.0-0-devel-ubuntu22.04
ARG ONEAPI_VERSION=2025.1.1-0-devel-ubuntu24.04

## Build Image

Expand Down
15 changes: 4 additions & 11 deletions .devops/musa.Dockerfile
Original file line number Diff line number Diff line change
@@ -1,10 +1,10 @@
ARG UBUNTU_VERSION=22.04
# This needs to generally match the container host's environment.
ARG MUSA_VERSION=rc3.1.1
ARG MUSA_VERSION=rc4.0.1
# Target the MUSA build image
ARG BASE_MUSA_DEV_CONTAINER=mthreads/musa:${MUSA_VERSION}-devel-ubuntu${UBUNTU_VERSION}
ARG BASE_MUSA_DEV_CONTAINER=mthreads/musa:${MUSA_VERSION}-mudnn-devel-ubuntu${UBUNTU_VERSION}

ARG BASE_MUSA_RUN_CONTAINER=mthreads/musa:${MUSA_VERSION}-runtime-ubuntu${UBUNTU_VERSION}
ARG BASE_MUSA_RUN_CONTAINER=mthreads/musa:${MUSA_VERSION}-mudnn-runtime-ubuntu${UBUNTU_VERSION}

FROM ${BASE_MUSA_DEV_CONTAINER} AS build

Expand All @@ -21,21 +21,14 @@ RUN apt-get update && \
libcurl4-openssl-dev \
libgomp1

COPY requirements.txt requirements.txt
COPY requirements requirements

RUN pip install --upgrade pip setuptools wheel \
&& pip install -r requirements.txt

WORKDIR /app

COPY . .

# Use the default MUSA archs if not specified
RUN if [ "${MUSA_DOCKER_ARCH}" != "default" ]; then \
export CMAKE_ARGS="-DMUSA_ARCHITECTURES=${MUSA_DOCKER_ARCH}"; \
fi && \
cmake -B build -DGGML_NATIVE=OFF -DGGML_MUSA=ON -DLLAMA_BUILD_TESTS=OFF -DGGML_BACKEND_DL=ON -DGGML_CPU_ALL_VARIANTS=ON ${CMAKE_ARGS} -DCMAKE_EXE_LINKER_FLAGS=-Wl,--allow-shlib-undefined . && \
cmake -B build -DGGML_NATIVE=OFF -DGGML_MUSA=ON -DGGML_BACKEND_DL=ON -DGGML_CPU_ALL_VARIANTS=ON -DLLAMA_BUILD_TESTS=OFF ${CMAKE_ARGS} -DCMAKE_EXE_LINKER_FLAGS=-Wl,--allow-shlib-undefined . && \
cmake --build build --config Release -j$(nproc)

RUN mkdir -p /app/lib && \
Expand Down
4 changes: 4 additions & 0 deletions .editorconfig
Original file line number Diff line number Diff line change
Expand Up @@ -48,3 +48,7 @@ end_of_line = unset
charset = unset
trim_trailing_whitespace = unset
insert_final_newline = unset

[tools/mtmd/miniaudio.h]
trim_trailing_whitespace = unset
insert_final_newline = unset
4 changes: 2 additions & 2 deletions .github/workflows/build.yml
Original file line number Diff line number Diff line change
Expand Up @@ -351,7 +351,7 @@ jobs:

ubuntu-22-cmake-musa:
runs-on: ubuntu-22.04
container: mthreads/musa:rc3.1.1-devel-ubuntu22.04
container: mthreads/musa:rc4.0.1-mudnn-devel-ubuntu22.04

steps:
- name: Clone
Expand Down Expand Up @@ -899,7 +899,7 @@ jobs:
shell: bash

env:
WINDOWS_BASEKIT_URL: https://registrationcenter-download.intel.com/akdlm/IRC_NAS/b380d914-366b-4b77-a74a-05e3c38b3514/intel-oneapi-base-toolkit-2025.0.0.882_offline.exe
WINDOWS_BASEKIT_URL: https://registrationcenter-download.intel.com/akdlm/IRC_NAS/7cd9bba0-7aab-4e30-b3ae-2221006a4a05/intel-oneapi-base-toolkit-2025.1.1.34_offline.exe
WINDOWS_DPCPP_MKL: intel.oneapi.win.cpp-dpcpp-common:intel.oneapi.win.mkl.devel:intel.oneapi.win.dnnl:intel.oneapi.win.tbb.devel
ONEAPI_ROOT: "C:/Program Files (x86)/Intel/oneAPI"
steps:
Expand Down
Loading
Loading