Feature Request: SINQ and more SINQ based functionality

### Prerequisites

- [x] I am running the latest code. Mention the version if possible as well.
- [x] I carefully followed the [README.md](https://github.com/ggml-org/llama.cpp/blob/master/README.md).
- [x] I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
- [x] I reviewed the [Discussions](https://github.com/ggml-org/llama.cpp/discussions), and have a new and useful enhancement to share.

### Feature Description

huawei SINQ quant method might be low hanging fruit quant to support in llama.cpp:

https://github.com/huawei-csl/SINQ/blob/main/README.md#3-quantize-any-llm-with-sinq

### Motivation

`⚡️ A fast, plug-and-play, model-agnostic quantization technique delivering state-of-the-art performance for Large Language Models without sacrificing accuracy.

💡 Want to run a large model on your GPU but don’t have enough memory? With SINQ, you can deploy models that would otherwise be too big drastically reducing memory usage while preserving LLM quality.

⏱️ SINQ quantizes Qwen3-14B in just ~21 sec and DeepSeekV2.5-236B in ~5 min`

### Possible Implementation

https://github.com/huawei-csl/SINQ/blob/main/README.md#3-quantize-any-llm-with-sinq apache 2.0 copy a lot of stuff from here

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature Request: SINQ and more SINQ based functionality #16478

Prerequisites

Feature Description

Motivation

Possible Implementation

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Feature Request: SINQ and more SINQ based functionality #16478

Description

Prerequisites

Feature Description

Motivation

Possible Implementation

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions