Nemotron to be added to llama.cpp
>https://github.com/ggerganov/llama.cpp/pull/8922
The interesting thing i found on their chat templates, at least for this model
>https://huggingface.co/nvidia/nemotron-3-8b-chat-4k-steerlm
is this
PROMPT_TEMPLATE = """<extra_id_0>System
A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions.
<extra_id_1>User
{prompt}
<extra_id_1>Assistant
<extra_id_2>quality:4,understanding:4,correctness:4,coherence:4,complexity:4,verbosity:4,toxicity:0,humor:0,creativity:0,violence:0,helpfulness:4,not_appropriate:0,hate_speech:0,sexual_content:0,fails_task:0,political_content:0,moral_judgement:0,lang:en"""
It's a short context model, and oldish architecture and probably not very good. But has anyone played with it on transformers and messed around with the tag values?