>>106967317
Can you have a look at my settings? I'm still getting used to ik_llama.cpp, so I may be missing/misconfiguring something.
# Change to the directory this script is in
Set-Location -Path $PSScriptRoot
# === Full path to your GLM-4.6 model ===
$MODEL = "G:\LLM\Models\Qwen3-235B-A22B-Instruct-2507-UD-Q4_K_XL\Qwen3-235B-A22B-Instruct-2507-UD-Q4_K_XL-00001-of-00003.gguf"
# === Launch llama-server with recommended GLM-4.6 settings ===
& .\llama-server.exe `
--model "$MODEL" `
--alias "Qwen3-235B-A22B" `
--ctx-size 16384 `
-fa -fmoe `
-ub 4096 -b 4096 `
-ngl 999 `
-ot exps=CPU `
--n-cpu-moe 999 `
--parallel 1 `
--threads 20 `
--host 127.0.0.1 `
--port 5001 `
--no-mmap `
--verbosity 2 `
--color
Pause