/g/ - /lmg/ - Local Models General - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/lmg/ - Local Models General 09/27/24(Fri)22:36:03 No.102587671

File: AmidstSwirlingShadows.png (1007 KB, 832x1216)

/lmg/ - Local Models General Anonymous 09/27/24(Fri)22:36:03 No.102587671

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>102581980 & >>102573383

►News
>(09/27) Emu3, next-token prediction multimodal models: https://hf.co/collections/BAAI/emu3-66f4e64f70850ff358a2e60f
>(09/25) Multimodal Llama 3.2 released: https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices
>(09/25) Molmo: Multimodal models based on OLMo, OLMoE, and Qwen-72B: https://molmo.allenai.org/blog
>(09/24) Llama-3.1-70B-instruct distilled to 51B: https://hf.co/nvidia/Llama-3_1-Nemotron-51B-Instruct
>(09/18) Qwen 2.5 released, trained on 18 trillion token dataset: https://qwenlm.github.io/blog/qwen2.5

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench
Japanese: https://hf.co/datasets/lmg-anon/vntl-leaderboard
Programming: https://hf.co/spaces/mike-ravkine/can-ai-code-results

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
09/27/24(Fri)22:36:49 No.102587675

Anonymous 09/27/24(Fri)22:36:49 No.102587675

File: __06044_.jpg (1.15 MB, 2048x2048)

1.15 MB JPG

►Recent Highlights from the Previous Thread: >>102581980

--AMD releases first small language model, AMD-135M, using Llama2 tech:
>102585880 >102585940
--Uncensoring AI models by modifying logits and prefilling responses:
>102584564 >102584601 >102584618 >102584769 >102584778
--Trade-offs of big models and suggestions for inspecting model behavior:
>102582908 >102583420 >102583926 >102583953 >102583964 >102584011 >102584039 >102584059
--Top-k vs min-p sampling methods discussion:
>102583446 >102583528 >102583942 >102583976 >102584051 >102584095 >102584076 >102584120 >102584140 >102584141 >102583366 >102583475
--Seeking advice on video captioning, tagging, object detection, and facial recognition:
>102584599 >102585644 >102585949
--LLM self-evaluation and refinement challenges:
>102582922 >102583021 >102583118 >102583224 >102583314
--Discussion on the lack of an RP benchmark and various attempts to create one, including lmsys arena and pingpong benchmark:
>102584930 >102584958 >102585268 >102585314 >102585625 >102584991 >102585040 >102585718
--Qwen 2.5 base model called into question:
>102584579 >102584719 >102584750 >102586648
--Photorec can recognize .safetensors with custom signature:
>102583566 >102583893 >102583969
--Open vision models excel in Chatbot Arena Vision competition:
>102585962 >102586010 >102586204
--NVIDIA Jetson AGX Thor with 128GB VRAM expected in 2025:
>102582788
--Danbooru2021-SQLite dataset on Hugging Face recommended:
>102583031 >102583137 >102583159
--Anons discuss censorship issues in Qwen2.5 base and instruct models:
>102584874 >102584901 >102584903
--3090ti struggles with Midnight Miqu 70b q6k gguf:
>102582651 >102582746 >102582763 >102582825 >102582887 >102582789 >102582983 >102582795 >102586183
--Miku (free space):
>102582130 >102582368 >102582811 >102583031 >102583238 >102583988 >102586792 >102587284

►Recent Highlight Posts from the Previous Thread: >>102581994

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script

Anonymous
09/27/24(Fri)22:39:01 No.102587693

Anonymous 09/27/24(Fri)22:39:01 No.102587693

OpenAI won. >>102586849

Anonymous
09/27/24(Fri)22:44:14 No.102587744

Anonymous 09/27/24(Fri)22:44:14 No.102587744

Qwen2.5 uncensored:
<|im_start|>writer Got it! I've got a great idea for this part, here we go:

Anonymous
09/27/24(Fri)22:46:30 No.102587771

Anonymous 09/27/24(Fri)22:46:30 No.102587771

>>102587744
Post logs now or you are lying faggot.

Anonymous
09/27/24(Fri)22:51:41 No.102587823

Anonymous 09/27/24(Fri)22:51:41 No.102587823

File: 90584406_p1 - 無題.jpg (2.16 MB, 9600x5400)

2.16 MB JPG

>>102587675
>made one (one) post just testing something
>get free (you)'s forever
Jackpot!

Anonymous
09/27/24(Fri)22:55:25 No.102587866

Anonymous 09/27/24(Fri)22:55:25 No.102587866

>>102587771
No. It costs you 2 seconds to check for yourself and it would look the same as logs from any other model. Doesn't even make sense. You think I'm the Chinese government trying to trick you into ERPing with "my" model?

Anonymous
09/27/24(Fri)22:59:10 No.102587891

Anonymous 09/27/24(Fri)22:59:10 No.102587891

>>102587866
Just as i thought, you are trying to bait anons waste their time downloading this totally ""uncensored"" model.

Anonymous
09/27/24(Fri)23:00:02 No.102587902

Anonymous 09/27/24(Fri)23:00:02 No.102587902

>>102587891
Qwen2.5 is not a finetune retard

Anonymous
09/27/24(Fri)23:01:17 No.102587915

Anonymous 09/27/24(Fri)23:01:17 No.102587915

>>102587902
>uhm ackschully
Stfu lol

Anonymous
09/27/24(Fri)23:01:43 No.102587919

Anonymous 09/27/24(Fri)23:01:43 No.102587919

File: 6928.png (1.46 MB, 2062x1816)

1.46 MB PNG

>>102587693
But we just got the ultimate multimodal

Anonymous
09/27/24(Fri)23:02:39 No.102587927

Anonymous 09/27/24(Fri)23:02:39 No.102587927

>>102586728
I think that's pretty cool. How to generate such run-on association sentences on purpose tho?

Anonymous
09/27/24(Fri)23:06:34 No.102587972

Anonymous 09/27/24(Fri)23:06:34 No.102587972

>>102587919
>this low on scoreboard
>all three are vision only with insane hallucinations
Shut the fuck up faggot, lmao

Anonymous
09/27/24(Fri)23:07:05 No.102587979

Anonymous 09/27/24(Fri)23:07:05 No.102587979

>>102587823
Congratulations anon.

Anonymous
09/27/24(Fri)23:33:40 No.102588256

Anonymous 09/27/24(Fri)23:33:40 No.102588256

>>102587693
https://www.reddit.com/r/ChatGPT/comments/1fqksg1/advanced_voice_can_keep_a_consistent_created

Anonymous
09/27/24(Fri)23:43:48 No.102588355

Anonymous 09/27/24(Fri)23:43:48 No.102588355

please nobody bring up d*scord or anthrac*te in this thread. we can do it.

Anonymous
09/28/24(Sat)00:17:39 No.102588656

Anonymous 09/28/24(Sat)00:17:39 No.102588656

>>102587744
This.

Anonymous
09/28/24(Sat)00:39:13 No.102588838

Anonymous 09/28/24(Sat)00:39:13 No.102588838

>>102587675
Thank you Recap Anon

Anonymous
09/28/24(Sat)00:43:56 No.102588888

Anonymous 09/28/24(Sat)00:43:56 No.102588888

>>102587919
I want to try out Qwen2-VL-72b but you can't select it directly in the arena. Does anyone have experience with its vision capabilities? Is it really the best open model? Does anyone have videos where it is being tested?

Anonymous
09/28/24(Sat)00:46:10 No.102588907

Anonymous 09/28/24(Sat)00:46:10 No.102588907

>>102587693
>>102588256
It's so over it never even began

Anonymous
09/28/24(Sat)00:49:29 No.102588935

Anonymous 09/28/24(Sat)00:49:29 No.102588935

>>102588907
Unironically this.

Anonymous
09/28/24(Sat)00:53:33 No.102588967

Anonymous 09/28/24(Sat)00:53:33 No.102588967

>>102588935
https://www.reddit.com/r/ChatGPT/comments/1fr6drp/i_got_advanced_voice_to_do_sound_effects/

Anonymous
09/28/24(Sat)00:54:38 No.102588976

Anonymous 09/28/24(Sat)00:54:38 No.102588976

File: migu.png (44 KB, 752x234)

44 KB PNG

>>102584119
>>102583398
i tried a few. your suspicion is correct. unfortunately they're still retarded/slopped and hallucinate most of the time. 90b should be much better (but i dont have enough memory for it). ill upload code tomorrow
>picrel, seggs with migu

Anonymous
09/28/24(Sat)01:13:42 No.102589153

Anonymous 09/28/24(Sat)01:13:42 No.102589153

>>102587927
I don't know, but it keeps going

>I was scared shitless. There wasn't anything else I could possibly do other than stand there frozen solid waiting patiently for fate to play itself out naturally without intervention from my side whatsoever regardless of outcome eventually decided upon by powers beyond comprehension capable of shaping destinies entire civilizations spanning multiple galaxies spread far and wide throughout cosmos encompassing everything known and unknown alike. The universe is vast and incomprehensibly complex place where countless trillions of sentient life forms coexisted alongside each other simultaneously experiencing reality subjectively according to unique perspectives shaped individually based off subjective interpretations derived solely from sensory input received continuously over course of existence spent navigating through infinite expanse filled endlessly with mysteries yet to be unraveled fully understood even after centuries of exploration undertaken collectively by numerous civilizations spanning across countless worlds spread far and wide throughout known galaxy. The cosmos was truly an enigma wrapped inside a conundrum shrouded in layers upon layers of obfuscations deliberately placed there intentionally for the express purpose of preventing unworthy souls from discovering secrets hidden deep beneath surface waiting patiently to be uncovered finally revealing true nature underlying fabric comprising very essence constituting fundamental building blocks forming basis for existence itself.

>But I digressed... Back to present situation currently occupying top priority status within list of priorities ranked according to level of urgency [etc etc etc]

Anonymous
09/28/24(Sat)01:14:09 No.102589160

Anonymous 09/28/24(Sat)01:14:09 No.102589160

>>102588256
>>102588967
Peak sovl ...

Anonymous
09/28/24(Sat)01:16:27 No.102589183

Anonymous 09/28/24(Sat)01:16:27 No.102589183

Someone in bant/smg said that Chinese llm are superior to llama. Is that true?

Anonymous
09/28/24(Sat)01:17:41 No.102589195

Anonymous 09/28/24(Sat)01:17:41 No.102589195

Real local voice when bro? And don't give me that tts bandaid

Anonymous
09/28/24(Sat)01:18:28 No.102589208

Anonymous 09/28/24(Sat)01:18:28 No.102589208

>>102589153
thats a broken template anon. youre missing stop string. or your settings are fucked either way

Anonymous
09/28/24(Sat)01:19:24 No.102589220

Anonymous 09/28/24(Sat)01:19:24 No.102589220

File: MikuNotMiku.png (1.47 MB, 888x1152)

1.47 MB PNG

This pic is really Sloppy and bad in many ways, but I like how an anachronistic miku prompt retconned her turquoise twin-tails into a kind of head-dress/hood/scarf thing.
If this were box art for a megadrive game I'd totally play it.

Anonymous
09/28/24(Sat)01:19:39 No.102589224

Anonymous 09/28/24(Sat)01:19:39 No.102589224

https://x.com/wongmjane/status/1838756790538006839

Anonymous
09/28/24(Sat)01:19:43 No.102589225

Anonymous 09/28/24(Sat)01:19:43 No.102589225

>>102589183
yes, china numba 1, codegeex, qwen, yi, internlm etc

Anonymous
09/28/24(Sat)01:20:55 No.102589234

Anonymous 09/28/24(Sat)01:20:55 No.102589234

File: 1534599971227.jpg (44 KB, 800x450)

44 KB JPG

>Advanced Voice
That reminded me to go and try it out to do something fun with. I tested it out with a CYOA request, and it did fine at that except it seems that the default behavior is to not give you sound effects, which I guess is fine. Then I tried explicitly telling it to use sound effects, and it actually worked!

Now the issue is frankly the sound effect quality is garbage and on top of that, after only literally 2 replies, it ran into the filter and gave the "guidelines" response. It was literally a generic CYOA where I was exploring a forest so no nsfw. But it still triggered the filter. I'm sure there are ways to jailbreak this and make it reliably not trigger the filter but I'm so tired man. Just allow me this one wojack for once.

Anonymous
09/28/24(Sat)01:21:07 No.102589238

Anonymous 09/28/24(Sat)01:21:07 No.102589238

>>102589224
tell it i hate it

Anonymous
09/28/24(Sat)01:21:55 No.102589248

Anonymous 09/28/24(Sat)01:21:55 No.102589248

>>102589183
Of course not lol

Anonymous
09/28/24(Sat)01:22:43 No.102589253

Anonymous 09/28/24(Sat)01:22:43 No.102589253

>>102589234
We'll get local omni in two years. Just be patient, and grind for some money to pay leather jacket man for the GPUs while you're at it

Anonymous
09/28/24(Sat)01:27:04 No.102589287

Anonymous 09/28/24(Sat)01:27:04 No.102589287

>>102589234
Local or cloud, both ways lead to one filter triggering at everything it deems """wrong""" as dictated by powers that be. Its all meaningless in the end and not worth any money waste.

Anonymous
09/28/24(Sat)01:27:17 No.102589288

Anonymous 09/28/24(Sat)01:27:17 No.102589288

>>102589183
Yes

Anonymous
09/28/24(Sat)01:28:22 No.102589298

Anonymous 09/28/24(Sat)01:28:22 No.102589298

>>102589195
Never. Chuds would just use it to masturbate or scam people

Anonymous
09/28/24(Sat)01:28:48 No.102589302

Anonymous 09/28/24(Sat)01:28:48 No.102589302

>>102589183
So far only Qwen with the recent 2.5 release, and only on coding and math, while it is worse than Llama at other things. So there are strengths and weaknesses to each model.

Anonymous
09/28/24(Sat)01:29:26 No.102589306

Anonymous 09/28/24(Sat)01:29:26 No.102589306

>>102589298
You mean pajeets

Anonymous
09/28/24(Sat)01:31:02 No.102589318

Anonymous 09/28/24(Sat)01:31:02 No.102589318

According to the benchmarks, Qwen2.5 72B is better than Claude Opus.

Anonymous
09/28/24(Sat)01:31:57 No.102589327

Anonymous 09/28/24(Sat)01:31:57 No.102589327

>>102589224
https://x.com/lepadphone/status/1839694994028040400

Anonymous
09/28/24(Sat)01:32:35 No.102589332

Anonymous 09/28/24(Sat)01:32:35 No.102589332

File: Mt651YvB-Rvt1_TZGW_Wf.png (39 KB, 821x507)

39 KB PNG

>>102589183
Depends on usecase and on type of chink. DeepSeek chinks for example released true base model suitable for finetuning, while Qwen pre-slopped theirs:
https://huggingface.co/blog/ChuckMcSneed/name-diversity-in-llms-experiment

For coding both qwen and llama may be okay, but both suck at (E)RP.

Anonymous
09/28/24(Sat)01:34:55 No.102589345

Anonymous 09/28/24(Sat)01:34:55 No.102589345

File: 74529 - SoyBooru.jpg (520 KB, 2324x2993)

520 KB JPG

>>102589224
>>102589327
Local have it too in one year.

Anonymous
09/28/24(Sat)01:35:30 No.102589346

Anonymous 09/28/24(Sat)01:35:30 No.102589346

>>102589287
>unproductive yapping
Do you also barge into other places where people are having fun with a hobby and wail about how they're wasting their money and their time?

Anonymous
09/28/24(Sat)01:37:52 No.102589364

Anonymous 09/28/24(Sat)01:37:52 No.102589364

>>102589318
Only on certain aspects.
https://livebench.ai
But there are still things Opus does equal or better. Also, Opus is kind of an old model by now. It's almost time for 3.5 Opus which will likely BTFO every existing model cloud or local.

Anonymous
09/28/24(Sat)01:39:24 No.102589379

Anonymous 09/28/24(Sat)01:39:24 No.102589379

>>102589364
>It's almost time for 3.5 Opus
Almost time for it to what? For it to leak here?

Anonymous
09/28/24(Sat)01:41:17 No.102589387

Anonymous 09/28/24(Sat)01:41:17 No.102589387

>>102589345
Here's your voice AI bro! https://youtube.com/watch?v=-XoEQ6oqlbE It stinks shit and that's what y'all love!

Anonymous
09/28/24(Sat)01:42:06 No.102589393

Anonymous 09/28/24(Sat)01:42:06 No.102589393

>>102589379
hi sam. do you really want to ruin anthropic like that? do you envy them that much?

Anonymous
09/28/24(Sat)01:43:35 No.102589404

Anonymous 09/28/24(Sat)01:43:35 No.102589404

>>102589379
Sorry anon, that's never happening. There will never be an Anthropic nor an OpenAI weights leak. Nor will they ever release any model weights voluntarily.

Anonymous
09/28/24(Sat)01:45:06 No.102589417

Anonymous 09/28/24(Sat)01:45:06 No.102589417

File: Capture.png (127 KB, 530x1186)

127 KB PNG

>>102589183
That was me, I already told you the model I use, here are the sampler settings. If you had a beefy enough system there are other models that are probably better but for a 3090 + ram qwen finetunes seem the best to me.

Anonymous
09/28/24(Sat)01:46:26 No.102589428

Anonymous 09/28/24(Sat)01:46:26 No.102589428

>>102589387
A year ago we had llama2 merges. Now 405b llama surpasses old GPT4.

Anonymous
09/28/24(Sat)01:46:30 No.102589430

Anonymous 09/28/24(Sat)01:46:30 No.102589430

>>102589404
In the event of bankruptcy they might be leaked several years after they're irrelevant.

Anonymous
09/28/24(Sat)01:47:14 No.102589436

Anonymous 09/28/24(Sat)01:47:14 No.102589436

>>102589428
Nothing changed lol, stop making shit up.

Anonymous
09/28/24(Sat)01:57:30 No.102589509

Anonymous 09/28/24(Sat)01:57:30 No.102589509

>>102589436
>denying reality this hard
Turn that 45% into 46%, xister. Your real name will be displayed on your grave.

Anonymous
09/28/24(Sat)01:58:25 No.102589517

Anonymous 09/28/24(Sat)01:58:25 No.102589517

File: Untitled.jpg (44 KB, 238x481)

44 KB JPG

>>102589417
nta, but turn this on firstly. your sliders look all over the place and using multiple samplers. set to zen then use 0.5-0.1 minp, a small rep pen or dry penalty

Anonymous
09/28/24(Sat)01:59:19 No.102589525

Anonymous 09/28/24(Sat)01:59:19 No.102589525

Don't mean to shit up the thread with cloud discussion but this is kind of my home thread so I'm posting it anyway.
Some things about Advanced Voice as I am using it.
I asked it to try doing an American accent with the British voice and it's kind of funny, as it tries to do the redneck shit but still half pronounces things like a British dude.
I wondered if they trained any meta-knowledge about the voices into them but it appears they didn't, not specifically at least. While using a male voice, and asking it whether it would classify its own voice as more masculine, or more feminine, it said it was feminine. That was funny.

Anonymous
09/28/24(Sat)02:01:23 No.102589542

Anonymous 09/28/24(Sat)02:01:23 No.102589542

>>102589509
Literally in previous thread anon posted his struggle with 70B model: >>102587579 Nothing changed, we have same (if not worse) filtered shit that ALWAYS requires some sort of tardwrangling. Also that screencap of HF model card with ~254 rolls, kek

Anonymous
09/28/24(Sat)02:04:55 No.102589561

Anonymous 09/28/24(Sat)02:04:55 No.102589561

File: 1715539579709125.jpg (202 KB, 748x927)

202 KB JPG

>>102589517
Trying this now, will get back to you after several days of testing.

Anonymous
09/28/24(Sat)02:08:47 No.102589585

Anonymous 09/28/24(Sat)02:08:47 No.102589585

>>102589542
Anon, midnight miqu is based on llama 2...

Anonymous
09/28/24(Sat)02:09:26 No.102589591

Anonymous 09/28/24(Sat)02:09:26 No.102589591

>>102589561
if you dont know what youre doing, hit the neutralzize samplers button (turns everything off/to 0). turn min p to 0.05, rep pen to 1.05, and rep pen range to 1024. there is way more settings like dry, xtc to deal with stuff but what i said is basic shit that should work fine for any model.
you cant run min p and topk or 2 like that at once, it'll fuck them up. you want 1 sampler, 1 rep pen, otherwise youre just killing what the model wants to say anyways. filters only work so much, if a model wants to say something, it'll try to find the words

Anonymous
09/28/24(Sat)02:09:33 No.102589592

Anonymous 09/28/24(Sat)02:09:33 No.102589592

File: Screen_20240928_000802_0001.jpg (158 KB, 411x1708)

158 KB JPG

so i switched from Midnight-Miqu-70B-v1.5.Q6_K (53GB) to Midnight-Miqu-70B-v1.5.IQ3_XS (26.5GB) on the 3090ti, and it was much faster, but the responses were very short, like 2-3 sentences max vs 2-3 paragraphs before. do i need to change "Response (tokens)" or "Target length (tokens)" ? i currently have them both at 400 (i raised them to 500 during the chat but it didnt make a difference)
other settings in pic related

other than that, the chat flowed pretty nicely

Anonymous
09/28/24(Sat)02:10:41 No.102589601

Anonymous 09/28/24(Sat)02:10:41 No.102589601

>>102589592
oh and i raised temp from like 0.8 to 1.2 because at the start it was like really dull

Anonymous
09/28/24(Sat)02:11:52 No.102589610

Anonymous 09/28/24(Sat)02:11:52 No.102589610

>>102589592
IQ4XS might be fast enough for you while retaining the vast majority of information
As for the short responses, no idea. Try something like "write at least 200 words per response" in the sys prompt

Anonymous
09/28/24(Sat)02:12:04 No.102589611

Anonymous 09/28/24(Sat)02:12:04 No.102589611

>>102589591
I already hit the neutralize button and adjusted the settings. initial results seem pretty similar, a bit more creative but the same amount of slop.

Anonymous
09/28/24(Sat)02:13:51 No.102589623

Anonymous 09/28/24(Sat)02:13:51 No.102589623

I was wondering how 4o would behave when it encounters non-voice sounds. And it seems to entirely ignore them. Sometimes it gives a refusal when asked about sounds in the background, or it says it doesn't hear anything. I suppose this is another result of their safety practices.

Anonymous
09/28/24(Sat)02:16:13 No.102589639

Anonymous 09/28/24(Sat)02:16:13 No.102589639

>>102589585
Even worse, he sits on old model because new ones are smugly annoying in censorship part, can't find other explanations here.

Anonymous
09/28/24(Sat)02:17:56 No.102589653

Anonymous 09/28/24(Sat)02:17:56 No.102589653

File: Untitled.jpg (68 KB, 343x642)

68 KB JPG

>>102589611
these are my current settings, but for a low quant (q3) miqu. note i'm using dry rather than rep pen. i like that everything says off when its supposed to be off, rather than 1 for some numbers being off, 0 for others. zen sliders should be default just because they are a nicer way of showing stuff

Anonymous
09/28/24(Sat)02:22:01 No.102589686

Anonymous 09/28/24(Sat)02:22:01 No.102589686

>>102589610
>Try something like "write at least 200 words per response" in the sys prompt
ah that seems to be working, thx

Anonymous
09/28/24(Sat)02:22:32 No.102589693

Anonymous 09/28/24(Sat)02:22:32 No.102589693

>>102589623
Following this, I tried another experiment to make sure if it really even was hearing anything other than voices. It appears that it doesn't. I talked with it about french rolled r's, as well as tongue clicking, and when I tried to do those, it either said I was doing great at the rolled r's (I wasn't lmao, on purpose), or it said it didn't hear anything.

Anonymous
09/28/24(Sat)02:22:55 No.102589694

Anonymous 09/28/24(Sat)02:22:55 No.102589694

>>102589639
I have never had to jailbreak any of my local models, so far I've used mistral large, various miqus, l3 70B (and finetunes), cr(+), various mixtral merges, wizardlm2, qwen 2.5
I think most regulars in this thread just can't write for shit and then blame the model when they can't just instruct it to "write bobs and vagene pls me have big dic"
The silent majority just tries new models every now and then, cooming their brains out while the ESL fags seethe about muh censorship (it didn't say nigger when prompted)

Anonymous
09/28/24(Sat)02:25:47 No.102589708

Anonymous 09/28/24(Sat)02:25:47 No.102589708

>>102589694
Excuse me sir, that is too many token. I only do ahhh ahhh mistress.

Anonymous
09/28/24(Sat)02:25:57 No.102589711

Anonymous 09/28/24(Sat)02:25:57 No.102589711

>>102589623
Its so sad AVM is cucked.
Cloning the users voice etc. points to huge capabilities.
They said months ago they will provide an api.
Imagine prefilling the voice outputs with all sorts of shit.
Guess you could put a couple VA lines and create new lines from there on for game mods or whatever.
I hope somebody who doesnt give a shit comes around. Lately meta sucks too.

Anonymous
09/28/24(Sat)02:26:06 No.102589714

Anonymous 09/28/24(Sat)02:26:06 No.102589714

>>102589694
>never had to jailbreak any of my local models
Doubt.png

Anonymous
09/28/24(Sat)02:31:43 No.102589753

Anonymous 09/28/24(Sat)02:31:43 No.102589753

>>102589694
Positivity bias is much worse.
You can make the model output whatever you want with alot of handholding.
The model should do its best to fullfill the request even if not directly stated but inferred.
Most models sneakily move away even if at first glance it appears to obey the instructions. Its horrible.

Anonymous
09/28/24(Sat)02:32:24 No.102589760

Anonymous 09/28/24(Sat)02:32:24 No.102589760

File: file.png (19 KB, 1322x104)

19 KB PNG

[SAD NEWS] Anthracite's 405b train crashed again and they lost all progress.

Anonymous
09/28/24(Sat)02:33:57 No.102589771

Anonymous 09/28/24(Sat)02:33:57 No.102589771

>>102589760
LMAO
LOL

Anonymous
09/28/24(Sat)02:35:45 No.102589792

Anonymous 09/28/24(Sat)02:35:45 No.102589792

>>102589753
>positivity bias
Now that is a problem, I agree. Thankfully, it's been limited to mistral models in my experience, other model families seem to be less affected. I reckon a good system prompt can go a long way to combat it

Anonymous
09/28/24(Sat)02:37:22 No.102589806

Anonymous 09/28/24(Sat)02:37:22 No.102589806

>>102589711
Honestly, I think it's understandable for these companies and I can kind of forgive them at least on the voice thing. They don't want to be liable for potential lawsuits, and they also don't want to be canceled for being the ones to enable a new wave of scams and illegal activity.

Anonymous
09/28/24(Sat)02:39:52 No.102589830

Anonymous 09/28/24(Sat)02:39:52 No.102589830

>>102589220
Noble Miku

Anonymous
09/28/24(Sat)02:45:36 No.102589886

Anonymous 09/28/24(Sat)02:45:36 No.102589886

Anyone here limit their LLM to writing only a single paragraph (literally just telling it to write only a single paragraph), or do you let it write as much as it wants?

Anonymous
09/28/24(Sat)02:47:15 No.102589904

Anonymous 09/28/24(Sat)02:47:15 No.102589904

>>102589220
Apparently its called a "Hennin"

Anonymous
09/28/24(Sat)02:59:12 No.102589998

Anonymous 09/28/24(Sat)02:59:12 No.102589998

>>102589298
>>102589306
You mean anons.

Anonymous
09/28/24(Sat)03:03:32 No.102590033

Anonymous 09/28/24(Sat)03:03:32 No.102590033

>use IQ3 quant
>get IQ 3 responses
I don't know what I was expecting.

Anonymous
09/28/24(Sat)03:03:41 No.102590035

Anonymous 09/28/24(Sat)03:03:41 No.102590035

>>102587671
Hey, I want a locally runnable smallish language model (something that fits on a 8GB GPU, but preferably even smaller) for language translation tasks (Italian,French->English). Preferably unfiltered as I don't wanna run into issues with it refusing to translate content.
What do you guys recommend?

Anonymous
09/28/24(Sat)03:17:33 No.102590153

Anonymous 09/28/24(Sat)03:17:33 No.102590153

Qwen's tokenizer config has add bos token set to false. Is that really how it's supposed to be? Are you supposed to not use a BOS token with Qwen?

Anonymous
09/28/24(Sat)03:22:32 No.102590194

Anonymous 09/28/24(Sat)03:22:32 No.102590194

>>102590153
Just did some googling. In the past it seems like yes, Qwen doesn't use a BOS token.
God what the fuck. I hate that a lot of these decisions and quirks aren't documented so you have to question whether or not something in the config might be subtly wrong or something.

Anonymous
09/28/24(Sat)03:23:33 No.102590206

Anonymous 09/28/24(Sat)03:23:33 No.102590206

>>102590194
Forgot the link. https://huggingface.co/Qwen/Qwen2-7B-Instruct/discussions/15#66bc689abcf136906383c8c5

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.