UseExtraBuffers
Contents
[
Hide
]
UseExtraBuffers is an advanced llama.cpp flag that enables extra buffer types used by the weight-repacking path. Rarely tuned in practice; leave at default unless specifically instructed.
Quick reference
| Type | bool? |
| Default | null (use native default) |
| Category | Advanced |
| Field on | ModelInferenceParameters.UseExtraBuffers |
What it does
Internal to llama.cpp. Controls whether the engine uses additional buffer types during weight repacking for specific hardware paths. The exact behavior depends on the backend and release tag.
null— native default. Correct for almost all users.true/false— override. Not useful without specific backend expertise.
When to change it
| Scenario | Value |
|---|---|
| Default | null |
| Backend-specific advice from SDK docs | As instructed |
Do not speculate. If you are not sure whether you need this flag, you do not need it.
Example
var preset = new Qwen25Preset();
// preset.BaseModelInferenceParameters.UseExtraBuffers = null; // default
Interactions
- Backend-specific. Effects vary by acceleration variant.
What’s next
- Model inference hub — all inference knobs.