Releases · ngxson/llama.cpp

05 Mar 14:04

074c4fd

b4829

ci : add fetch-depth to xcframework upload (#12195)

This commit adds the fetch-depth: 0 option to the checkout action in the
build.yml workflow file (0 meaning that it fetches the complete
history). The default value is 1 when not specified which only fetches
the latest commit.

This is necessary to ensure that `git rev-list --count HEAD` counts the
total number of commits in the history. Currently because the default is
being used the name of the xcframework artifact is always
llama-b1-xcframework.

Assets 26

05 Mar 10:07

github-actions

b4827

fa31c43

b4827

ci : fix xcframework artifact tag (#12191)

The commit add the name parameter to the upload-artifact action to
ensure that the artifact is uploaded with the correct name.

The motivation for this is that currently the uploaded xcframework
is named as llama-b1-xcframework.zip. With this change the name of this
artifact should contain the build number like the other artifacts.

Assets 26

05 Mar 08:25

github-actions

b4826

3ccbfe5

b4826

ci : remove xframework upload (#12190)

* ci : remove xframework upload

This commit removes the upload of the xframework zip file as an
artifact.

The motivation for this change is that the xframework zip file is
currently being uploaded as part of strategy and will therefore be
attempted to be uploaded multiple times and will fail the build.

The uploading should be moved to somewhere else in the build to avoid
this.

* ci : add xcframework upload to macos-latest job

Assets 26

05 Mar 06:18

github-actions

b4824

a057897

b4824

llama : add xcframework build script (#11996)

* llama : add xcframework build script

This commit adds a script to build an XCFramework for Apple
ios, macos, visionos, and tvos platforms.

The generated XCFramework can then be added to a project and used in
the same way as a regular framework. The llama.swiftui example project
has been updated to use the XCFramework and can be started using the
following command:
```console
$ open examples/llama.swiftui/llama.swiftui.xcodeproj/
```

Refs: https://github.com/ggml-org/llama.cpp/issues/10747

* examples : remove llama.cpp (source dir ref) from project.pbxproj

This commit removes the reference to llama.cpp from the project.pbxproj
file since Package.swift has been removed.

* ci : updated build.yml to use build-xcframework.sh

* ci : add xcframework build to github releases

This commit adds the ability to create a GitHub release with the
xcframework build artifact.

* scripts : add apple app validation scripts

This commit adds scripts that can validate the iOS, macOS, tvOS, and
VisionOS applications. The scripts create a simple test app project,
copy the llama.xcframework to the test project, build and archive the
app, create an IPA from the archive, and validate the IPA using altool.

The motivation for this is to provide some basic validation and
hopefully avoid having to manually validate apps in Xcode.

* llama : remove Package.swift

This commit removes the Package.swift file, as we are now building an
XCFramework for the project.

* llama : remove Sources and spm-headers directories

* llama : use TargetConditionals.h for visionOS/tvOS

Assets 26

04 Mar 17:46

github-actions

b4823

5bbe6a9

b4823

ggml : portability fixes for VS 2017 (#12150)

* Add include files for std::min/max and std::toupper/tolower

* win32: move _USE_MATH_DEFINES before includes to ensure M_PI is defined

* Use GGML_RESTRICT instead of "restrict" keyword everywhere, and use "__restrict" in MSVC plain C mode

* win32: only use __restrict in MSVC if C11/C17 support is not enabled

---------

Co-authored-by: Marcus Groeber <[email protected]>

Assets 25

04 Mar 17:07

github-actions

b4821

56d7a9f

b4821

main: allow preloading conversation with -p and add -st / --single-tu…

Assets 25

04 Mar 07:05

github-actions

b4820

1a24c46

b4820

`server`: fix deadly typo in response_format.json_schema.schema handl…

Assets 25

03 Mar 22:07

github-actions

b4819

becade5

b4819

HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (#12032)

Adds GGML_HIP_ROCWMMA_FATTN and rocwmma header check
Adds rocWMMA support to fattn-wmma-f16

---

Signed-off-by: Carl Klemm <[email protected]>
Co-authored-by: Johannes Gäßler <[email protected]>
Co-authored-by: Ben Jackson <[email protected]>

Assets 25

03 Mar 17:06

github-actions

b4818

dfd6b2c

b4818

sync : ggml

ggml-ci

Assets 25

03 Mar 14:08

github-actions

b4806

c43af92

b4806

tts: add speaker file support (#12048)

* tts: add speaker file support

Signed-off-by: dm4 <[email protected]>

* tts: handle outetts-0.3

* tts : add new line in error message

---------

Signed-off-by: dm4 <[email protected]>
Co-authored-by: Georgi Gerganov <[email protected]>

Assets 25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: ngxson/llama.cpp

b4829

Uh oh!

b4827

Uh oh!

b4826

Uh oh!

b4824

Uh oh!

b4823

Uh oh!

b4821

Uh oh!

b4820

Uh oh!

b4819

Uh oh!

b4818

Uh oh!

b4806

Uh oh!