>>102447333
here
>>102447389
>>102447449
you're right, I forgot speccy was stupid as shit -- GPU-Z is reporting correctly 8192MB vram
I did get a "Mistral-Nemo-Instruct-2407-IQ4_XS.gguf" loaded (other ones I found didn't load up right) with the Mistral presets in sillytavern
I had been gunning for the Q4 when possible, and leaving out the "--quantkv 2 --flashattention" args. Right now my CLI looks like:
call bin-kobold\koboldcpp_cu12.exe --model "N:\IGGER\F\A\I\Mistral-Nemo-Instruct-2407-IQ4_XS.gguf" --contextsize 12288 --threads 7 --blasthreads 14 --usecublas normal 0 1 --gpulayers -1 --blasbatchsize 512 --highpriority --foreground --skiplauncher --nommap --usemlock --onready "SillyTavern.bat" %*