Mismatch between embedding model's context length and chunk size

### Which version of Nextcloud are you using?

32.0.0

### Which version of PHP context_chat are you using?

4.5.0

### Which version of backend context_chat are you using?

4.5.0

### Which browser are you using? In case you are using the phone App, specify the Android or iOS version and device please.

Chrome 128

### Nextcloud deployment method?

docker compose

### Describe the Bug

```
decode: cannot decode batches with this context (calling encode() instead)
init: embeddings required but some input tokens were not marked as outputs -> overriding
```

the context length of the embedding model is 512 which is exceeded by our config and the chunk size of texts we pass in, which is why the error. It would be tricky to fix without re-indexing but our first effort should be to keep the impact of the change minimal in terms of doc search quality.

one solution would be to just reduce the chunk size and the context size config to match the native context size of the model. The quality of the doc search may not change much with the previously indexed docs since the embedding of  the query we would use to search them most of the time would be small enough. For the newer indexed docs, it is yet to be seen since 512 is the no. of token, chunk size would be around this, lesser even with non-english languages.

one other solution would be to use rope scaling and try to increase the context length of the model through the config only. It would allow larger context lengths and a better doc search than the above solution. It is, however, yet to be seen how much we can scale it keeping the results good.
https://github.com/ggml-org/llama.cpp/discussions/1965

### To Reproduce

embed some docs into the vector db and inspect the output of "<persistent_storage>/logs/em_server.log".

### PHP logs (Warning these might contain sensitive information)

_No response_

### Ex-App logs (Warning these might contain sensitive information)

_No response_

### Server logs (if applicable)

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Mismatch between embedding model's context length and chunk size #218

Which version of Nextcloud are you using?

Which version of PHP context_chat are you using?

Which version of backend context_chat are you using?

Which browser are you using? In case you are using the phone App, specify the Android or iOS version and device please.

Nextcloud deployment method?

Describe the Bug

To Reproduce

PHP logs (Warning these might contain sensitive information)

Ex-App logs (Warning these might contain sensitive information)

Server logs (if applicable)

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Mismatch between embedding model's context length and chunk size #218

Description

Which version of Nextcloud are you using?

Which version of PHP context_chat are you using?

Which version of backend context_chat are you using?

Which browser are you using? In case you are using the phone App, specify the Android or iOS version and device please.

Nextcloud deployment method?

Describe the Bug

To Reproduce

PHP logs (Warning these might contain sensitive information)

Ex-App logs (Warning these might contain sensitive information)

Server logs (if applicable)

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions