Skip to content

Latest commit

 

History

History
23 lines (15 loc) · 910 Bytes

File metadata and controls

23 lines (15 loc) · 910 Bytes

LLMBasic

A minimal example of:

  • loading a model, including downloading weights
  • setting up a ChatSession
  • a simple UI for a back and forth session with the model

The ChatModel has a few parameters at the top if you want to try a different model or system prompt.

The goal of this example is to be a minimal application that loads and interacts with an LLM.

See LLMEval and MLXChatExample for more full featured applications.

As always, you must set the Team on the LLMBasic target.

Some notes about the setup:

  • this downloads models from hugging face so LLMBasic -> Signing & Capabilities has the "Outgoing Connections (Client)" set in the App Sandbox
  • LLM models are large so this uses the Increased Memory Limit entitlement on iOS to allow ... increased memory limits for devices that have more memory
  • Memory.cacheLimit = 20 * 1024 * 1024 is used to limit the buffer cache size