>>103539211
>>103539220
Kek, same happened to me as well. I eventually solved it. Don't remember exactly how. Oh wait nvm I do remember. I eventually found that I needed to delete the nvcc in my usr/bin folder, and run with these commands.
export PATH=/usr/local/cuda/bin:$PATH
cmake -B build -DGGML_CUDA=ON -DGGML_LLAMAFILE=OFF
cmake --build build --config Release --target llama-server llama-quantize llama-perplexity -j 8
I actually tried to get ShatGPT to help me at first but it didn't find the issue. I found the issue myself. Maybe Claude would've gotten it.