llama.cpp inference #4643

s0569591 · 2023-12-26T10:51:06Z

s0569591
Dec 26, 2023

Prerequisites

Please answer the following questions for yourself before submitting an issue.

I am running the latest code. Development is very rapid so there are no tagged versions as of now.
I carefully followed the README.md.
I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
I reviewed the Discussions, and have a new bug or useful enhancement to share.

Feature Description

Please provide a detailed written description of what you were trying to do, and what you expected llama.cpp to do as an enhancement.

I want to inference llama.cpp (model: 7B-chat) locally with a dataset from huggingface (e.g. wmt16 Link: https://huggingface.co/datasets/wmt16).
How can i accomplish this?
I could not find any special answers on the internet.

Motivation

Please provide a detailed written description of reasons why this feature is necessary and how it is useful to llama.cpp users.

I needed for a university project, where i have to measure the power consumption of Llama2 on a local machine.

Possible Implementation

I am currently trying to learn langchain to create an inference file.

If you have an idea as to how it can be implemented, please write a detailed description. Feel free to give links to external sources or share visuals that might be helpful to understand the details better.

y10ab1 · 2023-12-29T15:11:33Z

y10ab1
Dec 29, 2023

You can run llama.cpp in server mode in the background and perform inference on your dataset by calling the API using a Python script. I believe this is the easiest way to accomplish the task.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama.cpp inference #4643

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

llama.cpp inference #4643

Uh oh!

s0569591 Dec 26, 2023

Prerequisites

Feature Description

Motivation

Possible Implementation

Replies: 1 comment

Uh oh!

y10ab1 Dec 29, 2023

s0569591
Dec 26, 2023

y10ab1
Dec 29, 2023