Skip to content

Conversation

@jan-service-account
Copy link

Updates dev branch with latest release (b5121) from ggml-org/llama.cpp

taronaeo and others added 15 commits April 11, 2025 08:20
* ggml: fixes ggml-org#12846 compilation error

Signed-off-by: Aaron Teo <[email protected]>

Co-authored-by: Aleksei Nikiforov <[email protected]>

* ggml: add documentation for code change

Signed-off-by: Aaron Teo <[email protected]>

Co-authored-by: Aleksei Nikiforov <[email protected]>

* ggml: refactor to type-cast and update documentation

Signed-off-by: Aaron Teo <[email protected]>

Co-authored-by: Aleksei Nikiforov <[email protected]>

* ggml: update documentation to provide full issue link

Signed-off-by: Aaron Teo <[email protected]>

Co-authored-by: Aleksei Nikiforov <[email protected]>

---------

Co-authored-by: Aleksei Nikiforov <[email protected]>
* Llama-4 mapping

* remove hacky renaming

---------

Co-authored-by: Daniel Han <[email protected]>
This commit adds a check for the visionos build version used with vtool
in build-xcframework.sh. The script now checks the Xcode version and
determines whether to use "xros" or "visionos" for the build version.

This commit also uses xcrun for the vtool so that the version of vtool
in xcode command line tools is used instead of the one in the system
path.

Refs: ggml-org/whisper.cpp#2994 (comment)
* SYCL: Add fp16 support to some elementwise OP kernels

* remove comment

ggml-ci

* Use static_cast directly

* remove not needed cast from tanh

* Use static cast and remove unneeded castings

* Adjust device_support_op for unary OPs

* Use cast_data and typed_data struct to deduplicate casting code
* clip : use smart pointers

* fix warmup

* add forward declaration

* misisng include

* fix include (2)

* composite

* simplify batch ptr

* fix conflict
…2867)

* GLM-4-0414

* use original one

* Using with tensor map

* fix bug

* change order

* change order

* format with flask8
* support download from modelscope

* support login

* remove comments

* add arguments

* fix code

* fix win32

* test passed

* fix readme

* revert readme

* change to MODEL_ENDPOINT

* revert tail line

* fix readme

* refactor model endpoint

* remove blank line

* fix header

* fix as comments

* update comment

* update readme

---------

Co-authored-by: tastelikefeet <yuze.zyz@alibaba-inc/com>
The current usage of the SYCL-Graph extension checks for
the `sycl_ext_oneapi_graph` device aspect. However, it is also
possible to support `sycl_ext_oneapi_limied_graph` devices that
don't support update
…templates (ggml-org#12900)

* `tool-call`: don't call common_chat_params_init_hermes_2_pro when there aren't tools (or when there's a schema)

* test all chat formats w/o tools
* server : add VSCode's Github Copilot Chat support

* cont : update handler name
@jan-service-account jan-service-account merged commit 932e858 into dev Apr 12, 2025
15 checks passed
@jan-service-account jan-service-account deleted the update-dev-from-master-2025-04-12-00-08 branch April 12, 2025 00:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.