UseExtraBuffers

UseExtraBuffers is an advanced llama.cpp flag that enables extra buffer types used by the weight-repacking path. Rarely tuned in practice; leave at default unless specifically instructed.

Quick reference

Type bool?
Default null (use native default)
Category Advanced
Field on ModelInferenceParameters.UseExtraBuffers

What it does

Internal to llama.cpp. Controls whether the engine uses additional buffer types during weight repacking for specific hardware paths. The exact behavior depends on the backend and release tag.

  • null — native default. Correct for almost all users.
  • true / false — override. Not useful without specific backend expertise.

When to change it

Scenario Value
Default null
Backend-specific advice from SDK docs As instructed

Do not speculate. If you are not sure whether you need this flag, you do not need it.

Example

var preset = new Qwen25Preset();
// preset.BaseModelInferenceParameters.UseExtraBuffers = null; // default

Interactions

  • Backend-specific. Effects vary by acceleration variant.

What’s next