>>109185577
~/BH/llama.cpp/build/bin/llama-server \
--model ~/CB/models/gemma-4-26B-A4B-heretic-APEX-I-Compact.gguf \
-ngl 22 \
-c 122880 \
-np 1 \
-fa on \
-ctk q4_0 \
-ctv q4_0 \
--no-kv-offload \
-b 512 \
-ub 128 \
--host 0.0.0.0 \
--port 8080 \
Do you remember your speeds? ~14t/s feels terribly slow for this purpose.