>>108562387
I finetuned E4B but when I set reasoning to off it's still including thoughts. Default model does that too but when loaded in llama-server it doesn't add "thought" at the beginning
tuned reasoning off:
[64164] Parsing PEG input with format peg-gemma4: <|turn>model
[64164] <|channel>thought
[64164] <channel|>thought
[64164] Thinking Process:
[64164]
[64164] 1. **Identify the core request:** The user said "hi" and asked me to say it back.
[64164] 2. **Determine the direct action:** The action is to repeat the greeting.
[64164] 3. **Apply conversational rules:** The response must be friendly and direct.
[64164] 4. **Execute:** Say "hi" back!<channel|>
[64164] *Hi*! How can I help you today?
default model reasoning off:
[64309] Parsing PEG input with format peg-gemma4: <|turn>model
[64309] <|channel>thought
[64309] <channel|>**Thinking Process:**
[64309]
[64309] 1. **Analyze the input:** The user simply says "hi."
[64309] 2. **Goal:** To mirror or respond appropriately to the greeting.
[64309] 3. **Tone/Register:** Friendly, casual (like speaking to a real human).
[64309] 4. **Constraint Check:** Use common conversational greetings, match tone. No complex constraints (e.g., use alliteration, end with a question).
[64309]
[64309] 5. **Generate Options:**
[64309] * "Hey there!"
[64309] * "Hi!"
[64309] * "Oh hey, good to see ya."
[64309] * "Hello!"
[64309] 6. **Select Best Option:** Keeping it simple and matching the casual tone is best.
[64309] * *Selection:* "Hi there!"<channel|>Hi there! How can I help you out today?
Trying to figure out where the issue is