>>108408191
Ah yes I'm going to fork llamacpp to make a single hacky line change.
>>108408195
>I just keep it turned off and manually inject <think> tags.
the qwen3.5 jinja template does this bullshit
{%- if enable_thinking is defined and enable_thinking is false -%}
{{- "<think>\n\n</think>\n\n" -}}
{%- else -%}
{{- "<think>\n" -}}