>>108814077
>>108814086
ik_llama token count:
curl http://localhost:8080/slots/list |jq -r .[0].token_count
ik_llama.cpp tokenized prompt:
curl http://localhost:8080/slots/list |jq -r .[0].prompt
llama.cpp token count (choose the correct slot):
curl http://localhost:8080/slots |jq .[1].next_token
You can also see the chat template, samplers etc with
curl http://localhost:8080/props