UseExtraBuffers

UseExtraBuffers is an advanced llama.cpp flag that enables extra buffer types used by the weight-repacking path. Rarely tuned in practice; leave at default unless specifically instructed.

Quick reference


Type	`bool?`
Default	`null` (use native default)
Category	Advanced
Field on	`ModelInferenceParameters.UseExtraBuffers`

What it does

Internal to llama.cpp. Controls whether the engine uses additional buffer types during weight repacking for specific hardware paths. The exact behavior depends on the backend and release tag.

null — native default. Correct for almost all users.
true / false — override. Not useful without specific backend expertise.

When to change it

Scenario	Value
Default	`null`
Backend-specific advice from SDK docs	As instructed

Do not speculate. If you are not sure whether you need this flag, you do not need it.

Example

var preset = new Qwen25Preset();
// preset.BaseModelInferenceParameters.UseExtraBuffers = null; // default

Interactions

Backend-specific. Effects vary by acceleration variant.

What’s next

Model inference hub — all inference knobs.

CheckTensors TensorSplit