/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 08/03/24(Sat)12:33:47 No.101705239

File: ComfyUI_temp_phjap_00079_.png (3.7 MB, 1584x1232)

3.7 MB PNG

/lmg/ - Local Models General Anonymous 08/03/24(Sat)12:33:47 No.101705239 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>101692289 & >>101682019

►News
>(07/31) Google releases Gemma 2 2B, ShieldGemma, and Gemma Scope: https://developers.googleblog.com/en/smaller-safer-more-transparent-advancing-responsible-ai-with-gemma
>(07/27) Llama 3.1 rope scaling merged: https://github.com/ggerganov/llama.cpp/pull/8676
>(07/26) Cyberagent releases Japanese fine-tune model: https://hf.co/cyberagent/Llama-3.1-70B-Japanese-Instruct-2407
>(07/25) BAAI & TeleAI release 1T parameter model: https://hf.co/CofeAI/Tele-FLM-1T
>(07/24) Mistral Large 2 123B released: https://hf.co/mistralai/Mistral-Large-Instruct-2407

►News Archive: https://rentry.org/lmg-news-archive
►FAQ: https://wikia.schneedc.com
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Programming: https://hf.co/spaces/bigcode/bigcode-models-leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
08/03/24(Sat)12:34:06 No.101705242

Anonymous 08/03/24(Sat)12:34:06 No.101705242

File: 1704819906422903.jpg (213 KB, 1024x1024)

213 KB JPG

►Recent Highlights from the Previous Thread: >>101692289

--Quantization and VRAM trade-offs: >>101693122 >>101693296
--Model recommendations for 24GB VRAM and 64GB RAM: >>101694246 >>101694278 >>101702226 >>101694481 >>101695200 >>101694321 >>101694623 >>101694485
--LLMs and the nature of intelligence and analogy: >>101700993 >>101701103 >>101701150 >>101701265 >>101701328 >>101701257 >>101701268 >>101701351 >>101702876 >>101702219
--Building a multi GPU rig for large AI models: >>101702658 >>101702728 >>101702780 >>101702842 >>101702916 >>101703009
--Anon asks where to download Flux, and other anons provide links and discuss the differences between FLUX.1-dev and FLUX.1-schnell, including model sizes, distilled models, and quantization.: >>101693327 >>101693383 >>101693404 >>101693569 >>101694101 >>101694454 >>101694482 >>101694819 >>101694886 >>101694572 >>101694650 >>101693892 >>101694061 >>101694080
--Running large models with ollama from a network location: >>101692522 >>101692551 >>101692597 >>101692568 >>101692622 >>101693060
--Whisper speech-to-text model limitations and alternatives: >>101702737 >>101702782 >>101702820 >>101702844 >>101702870
--NYT article on unsettling experience with Bing's chatbot: >>101701796
--Llama 3.1 405B base model available on OpenRouter: >>101694114
--Flux can generate panties with the right prompts: >>101695661 >>101698623 >>101698824
--Character personality in ST chats depends on description and first message: >>101699285 >>101699345
--Anons discuss AI model quality, curation, and marketing: >>101700300 >>101700632 >>101700699 >>101700755 >>101700801 >>101701128 >>101702152 >>101701156
--Trusting refurbished GPUs off of Ebay: >>101694138 >>101694178 >>101694226 >>101694249 >>101694311 >>101694313
--Miku (free space): >>101694118 >>101694279 >>101694636 >>101694719 >>101694945 >>101695303 >>101695513 >>101695828 >>101696100 >>101699120 >>101696010 >>101701690

►Recent Highlight Posts from the Previous Thread: >>101692307

Anonymous
08/03/24(Sat)12:36:43 No.101705288

Anonymous 08/03/24(Sat)12:36:43 No.101705288

>>101705239
I don't believe that this image was generated with a local model

Anonymous
08/03/24(Sat)12:36:45 No.101705289

Anonymous 08/03/24(Sat)12:36:45 No.101705289

>>101705239
miku bake

Anonymous
08/03/24(Sat)12:38:56 No.101705317

Anonymous 08/03/24(Sat)12:38:56 No.101705317

>>101705288
>of the

Anonymous
08/03/24(Sat)12:39:18 No.101705320

Anonymous 08/03/24(Sat)12:39:18 No.101705320

>>101705288
that's what happens when you don't let ethicucks filter your model to shit

Anonymous
08/03/24(Sat)12:41:41 No.101705345

Anonymous 08/03/24(Sat)12:41:41 No.101705345

Is the 405B base model worth using?

Anonymous
08/03/24(Sat)12:41:47 No.101705347

Anonymous 08/03/24(Sat)12:41:47 No.101705347

https://civitai.com/models/618792/nepotism-fux?modelVersionId=691750

owo

Anonymous
08/03/24(Sat)12:46:06 No.101705393

Anonymous 08/03/24(Sat)12:46:06 No.101705393

>>101705345
Yes.

Anonymous
08/03/24(Sat)12:47:55 No.101705418

Anonymous 08/03/24(Sat)12:47:55 No.101705418

>>101705347
>merge
Is this supposed to be good?

Anonymous
08/03/24(Sat)12:51:59 No.101705470

Anonymous 08/03/24(Sat)12:51:59 No.101705470

>>101705418
no, merges are memes

Anonymous
08/03/24(Sat)12:53:02 No.101705482

Anonymous 08/03/24(Sat)12:53:02 No.101705482

>>101705347
>merges a new picture gen model with some old stable diffusion
That's like trying to merge qwen and llama. It does not work like that.

Anonymous
08/03/24(Sat)12:53:28 No.101705490

Anonymous 08/03/24(Sat)12:53:28 No.101705490

File: ComfyUI_00052_.png (379 KB, 512x512)

379 KB PNG

>>101705288

Anonymous
08/03/24(Sat)12:53:43 No.101705493

Anonymous 08/03/24(Sat)12:53:43 No.101705493

>>101705482
have you tried it

Anonymous
08/03/24(Sat)12:53:57 No.101705497

Anonymous 08/03/24(Sat)12:53:57 No.101705497

File: ComfyUI_00665_.png (1.36 MB, 1280x720)

1.36 MB PNG

200 gens were made to get this single coherent and prompt-following one.
After using Flux more, I think I'll just stop trying to wrangle it and gen simpler things. I'm spending too much time crafting prompts and regenning to get the results I want for more complicated and niche concepts. Image models just aren't there yet. Dalle 3 too, it's better at some things, but still, time consuming.
Back to 1girl I guess.

Anonymous
08/03/24(Sat)12:55:18 No.101705511

Anonymous 08/03/24(Sat)12:55:18 No.101705511

>>101705497
>hands

Anonymous
08/03/24(Sat)12:56:53 No.101705527

Anonymous 08/03/24(Sat)12:56:53 No.101705527

>>101705345
No. Don't believe what /naids/ tells you, base models are useless.

Anonymous
08/03/24(Sat)12:58:36 No.101705548

Anonymous 08/03/24(Sat)12:58:36 No.101705548

What models are SOTA for RP? Been using the 70b euryale, is there something better? It's good, but feels like something is missing.

Anonymous
08/03/24(Sat)12:59:20 No.101705555

Anonymous 08/03/24(Sat)12:59:20 No.101705555

>>101705497
the water looks like someone spread a light blue tarp on hard ground

Anonymous
08/03/24(Sat)12:59:21 No.101705556

Anonymous 08/03/24(Sat)12:59:21 No.101705556

>>101705548
shieldgemma-9b
thank me later.

Anonymous
08/03/24(Sat)12:59:24 No.101705558

Anonymous 08/03/24(Sat)12:59:24 No.101705558

>>101705548
no, not much has changed in the past year

Anonymous
08/03/24(Sat)12:59:56 No.101705566

Anonymous 08/03/24(Sat)12:59:56 No.101705566

>>101705548
Mistral Large currently. Euryale was too retarded, though. I think with your IQ you will be happy with something like Nemo/Mini-Magnum/etc.

Anonymous
08/03/24(Sat)13:01:56 No.101705589

Anonymous 08/03/24(Sat)13:01:56 No.101705589

>>101705555
I mean, the guy on the left is even standing on top of it.

Anonymous
08/03/24(Sat)13:02:11 No.101705596

Anonymous 08/03/24(Sat)13:02:11 No.101705596

>>101705558
>Sao general.

Anonymous
08/03/24(Sat)13:02:44 No.101705600

Anonymous 08/03/24(Sat)13:02:44 No.101705600

>>101705527
Base models are good if you want something unbiased and free of slop, since literally all they do is autocomplete whatever you give it. For that reason, they're great for long form storytelling, but hard to get started.
The best approach is usually to use instruct to generate something and have the base model continue it.

Anonymous
08/03/24(Sat)13:03:46 No.101705620

Anonymous 08/03/24(Sat)13:03:46 No.101705620

>Flux uses 60+gb of RAM
I have bought 128gb of ram during llama1 days and I have never regretted it. How do ramlets even cope?

Anonymous
08/03/24(Sat)13:05:22 No.101705641

Anonymous 08/03/24(Sat)13:05:22 No.101705641

>>101705600
All of this advice can be safely ignored by the fact that "autocomplete" chads have nothing to show.

Anonymous
08/03/24(Sat)13:05:26 No.101705643

Anonymous 08/03/24(Sat)13:05:26 No.101705643

is there anything better than midnight miqu for rp?

Anonymous
08/03/24(Sat)13:06:35 No.101705659

Anonymous 08/03/24(Sat)13:06:35 No.101705659

>>101705620
I cry myself to sleep

Anonymous
08/03/24(Sat)13:07:49 No.101705681

Anonymous 08/03/24(Sat)13:07:49 No.101705681

>>101705620
How long does it take to gen on RAM? The biggest pain is no multigpu.

Anonymous
08/03/24(Sat)13:08:51 No.101705696

Anonymous 08/03/24(Sat)13:08:51 No.101705696

>>101705659
280 dollarcoins isn't expensive bro, just buy it

Anonymous
08/03/24(Sat)13:09:22 No.101705701

Anonymous 08/03/24(Sat)13:09:22 No.101705701

>>101705696
Then you won't mind sending me some

Anonymous
08/03/24(Sat)13:10:22 No.101705717

Anonymous 08/03/24(Sat)13:10:22 No.101705717

>>101705641
>Wasn't here for GPT-3 or Llama 1.
Ask me how I know you're a newfag.

Anonymous
08/03/24(Sat)13:14:31 No.101705772

Anonymous 08/03/24(Sat)13:14:31 No.101705772

>>101705681
75 seconds per image LETS FUCKING GOOO! (fuck I feel mogged by VRAMCHADS)

>>101705701
Come to western europe, they give free money to neets here

Anonymous
08/03/24(Sat)13:14:42 No.101705777

Anonymous 08/03/24(Sat)13:14:42 No.101705777

>>101705696
I'm poor and need to buy a 3060, another 32gb of ram (or ditch my current 2x16 for 2x32), maybe a new psu, a monitor or two (my current one is 1366x768), a desk and chair, and ergonomic keyboard and mouse since my wrists are fucked.
It will take me a while.

Anonymous
08/03/24(Sat)13:15:57 No.101705794

Anonymous 08/03/24(Sat)13:15:57 No.101705794

>>101705772
What? 75 seconds? Are you talking about schnell?

Anonymous
08/03/24(Sat)13:17:45 No.101705817

Anonymous 08/03/24(Sat)13:17:45 No.101705817

File: ComfyUI_00656_.png (1.22 MB, 1280x720)

1.22 MB PNG

>>101705511
Flux actually does hands relatively ok when it's a pose that appears often in datasets. POV hand holding is much more difficult.

>>101705589
>>101705555
kek

Anonymous
08/03/24(Sat)13:19:00 No.101705842

Anonymous 08/03/24(Sat)13:19:00 No.101705842

>>101705817
This pic looks deeply disturbing... for many reasons

Anonymous
08/03/24(Sat)13:19:54 No.101705854

Anonymous 08/03/24(Sat)13:19:54 No.101705854

mini magnum writes nicely and without repetitions but it's much worse at retrieving information from >8k contexts than nemo instruct and dory from my experiments
is anyone ever going to fix nemo properly?

Anonymous
08/03/24(Sat)13:20:17 No.101705866

Anonymous 08/03/24(Sat)13:20:17 No.101705866

File: ComfyUI_00625_.png (1.27 MB, 1280x720)

1.27 MB PNG

>>101705842

Anonymous
08/03/24(Sat)13:20:55 No.101705873

Anonymous 08/03/24(Sat)13:20:55 No.101705873

>>101705497
Literal skill issue

Anonymous
08/03/24(Sat)13:20:56 No.101705875

Anonymous 08/03/24(Sat)13:20:56 No.101705875

>>101705497
Now make this same pic but with her pregnant, THAT would be peak.

Anonymous
08/03/24(Sat)13:22:34 No.101705902

Anonymous 08/03/24(Sat)13:22:34 No.101705902

I did a lot of day 1 flux testing and posted the results here. Flux can granularize way more different concepts into a single output than D3 can. And even when you overload it with too much shit it doesn't go absolutely schizo like D3 does

Anonymous
08/03/24(Sat)13:23:13 No.101705912

Anonymous 08/03/24(Sat)13:23:13 No.101705912

>>101705772
kys

Anonymous
08/03/24(Sat)13:24:22 No.101705931

Anonymous 08/03/24(Sat)13:24:22 No.101705931

When do you think AI will make music that is better than real music? Not interesting to listen to, but better than average real music

Anonymous
08/03/24(Sat)13:25:01 No.101705936

Anonymous 08/03/24(Sat)13:25:01 No.101705936

https://github.com/Alpha-VLLM/Lumina-mGPT
>A family of multimodal autoregressive models capable of various vision and language tasks, particularly excelling in generating flexible photorealistic images from text descriptions.
based off meta's chameleon

Anonymous
08/03/24(Sat)13:25:35 No.101705943

Anonymous 08/03/24(Sat)13:25:35 No.101705943

>>101705239
>>101705288
>>101705320
>>101705490
Now imagine that as an animation, it's happening, it's coming

Anonymous
08/03/24(Sat)13:26:34 No.101705949

Anonymous 08/03/24(Sat)13:26:34 No.101705949

>>101705873
Then reproduce that gen with all the general details in it, and with passable coherency (that one isn't perfect, but it's ok, from a distance).

Anonymous
08/03/24(Sat)13:27:19 No.101705958

Anonymous 08/03/24(Sat)13:27:19 No.101705958

>>101705931
Literally all it needs is a little bit more vocal consistency. suno v4 will probably be the crossing point for music. 3.5 is already pretty good but v4 will also come with more advanced workflow (I'm assuming similar to what udio offers)
And at that point anyone who says the AI shit isn't better is smoking too much copium.

Anonymous
08/03/24(Sat)13:27:46 No.101705968

Anonymous 08/03/24(Sat)13:27:46 No.101705968

File: 1709323576878664.webm (1.44 MB, 1920x1074)

1.44 MB WEBM

>>101705943
we know baaaaka

Anonymous
08/03/24(Sat)13:28:15 No.101705971

Anonymous 08/03/24(Sat)13:28:15 No.101705971

>>101705936
>[2024-07-08]
>initial release 30 minutes ago
hmm?

Anonymous
08/03/24(Sat)13:29:10 No.101705986

Anonymous 08/03/24(Sat)13:29:10 No.101705986

File: livebench-2024-08-02.png (851 KB, 3186x1840)

851 KB PNG

>gemma 2 27b still mogs everything under 70b
>nemo is the 3rd worst model in the chart

Anonymous
08/03/24(Sat)13:29:11 No.101705987

Anonymous 08/03/24(Sat)13:29:11 No.101705987

magnum 72b mogs

Anonymous
08/03/24(Sat)13:29:43 No.101705997

Anonymous 08/03/24(Sat)13:29:43 No.101705997

>>101705794
No, dev. 20 steps, euler. Also have 12gb vram, not sure if it matters that much since it takes almost 60gb of ram.

>>101705777
Used pcs with 3060s are being sold for $500 on ebay, may be worth the risk.

Anonymous
08/03/24(Sat)13:30:31 No.101706013

Anonymous 08/03/24(Sat)13:30:31 No.101706013

>>101705987
It aged like milk. It got obsoleted by Nemo and its finetunes.

Anonymous
08/03/24(Sat)13:31:39 No.101706021

Anonymous 08/03/24(Sat)13:31:39 No.101706021

>>101705986
makes very little sense as gemma 27 is easily one of the worst models i've tried, nemo isn't much better either

Anonymous
08/03/24(Sat)13:31:46 No.101706023

Anonymous 08/03/24(Sat)13:31:46 No.101706023

>>101705987
too horny

Anonymous
08/03/24(Sat)13:32:00 No.101706026

Anonymous 08/03/24(Sat)13:32:00 No.101706026

>>101705968
Wait, what? How was that made? and when

Anonymous
08/03/24(Sat)13:32:13 No.101706032

Anonymous 08/03/24(Sat)13:32:13 No.101706032

>>101705986
Kind of interesting how 8B barely improved with 3.1 on this benchmark but 70B was massive. It would suggest that we've saturated the intelligence an 8B can hold with traditional transformers. 70B may or may not still have even more room to learn hold more.

Anonymous
08/03/24(Sat)13:32:34 No.101706039

Anonymous 08/03/24(Sat)13:32:34 No.101706039

>>101705997
What kind of magic are you doing? I get 175 seconds with the same setup as you. Please help me anon!

Anonymous
08/03/24(Sat)13:32:37 No.101706040

Anonymous 08/03/24(Sat)13:32:37 No.101706040

Flux support not on Comfy?

Anonymous
08/03/24(Sat)13:33:37 No.101706050

Anonymous 08/03/24(Sat)13:33:37 No.101706050

>>101706013
so you mean it got obsoleted by minimagnum

Anonymous
08/03/24(Sat)13:33:39 No.101706051

Anonymous 08/03/24(Sat)13:33:39 No.101706051

>>101706040
Flux had day 1 support on comfy.

Anonymous
08/03/24(Sat)13:35:05 No.101706067

Anonymous 08/03/24(Sat)13:35:05 No.101706067

>>101706051
I want to use it on heterosexual software.

Anonymous
08/03/24(Sat)13:35:19 No.101706069

Anonymous 08/03/24(Sat)13:35:19 No.101706069

>image models do flawless text now
How?

Anonymous
08/03/24(Sat)13:36:17 No.101706082

Anonymous 08/03/24(Sat)13:36:17 No.101706082

>>101706069
$31 million in funding

Anonymous
08/03/24(Sat)13:36:42 No.101706086

Anonymous 08/03/24(Sat)13:36:42 No.101706086

File: file.png (47 KB, 588x632)

47 KB PNG

for me? it's gemma 2b

Anonymous
08/03/24(Sat)13:37:34 No.101706108

Anonymous 08/03/24(Sat)13:37:34 No.101706108

>>101706032
If 8B benefitted at all from moving up to 15T tokens my guess is the 70B does. Since the Chinchilla compute-optimal ratio resulted in the optimal number of tokens scaling roughly linearly with model size, I'd imagine the same relationship probably holds for more saturated models too

Anonymous
08/03/24(Sat)13:39:38 No.101706133

Anonymous 08/03/24(Sat)13:39:38 No.101706133

File: coder.png (18 KB, 904x256)

18 KB PNG

>>101705986
>deepseek coder mogs pretty much everything on the list
>cheaper than the power bill of p40 meme builds

Anonymous
08/03/24(Sat)13:39:49 No.101706137

Anonymous 08/03/24(Sat)13:39:49 No.101706137

>>101706086
>Shivers down his spine
>Double spaces too before "This was no ordinary..." and "He inhaled deeply", But not before "He could feel..."

Anonymous
08/03/24(Sat)13:40:48 No.101706154

Anonymous 08/03/24(Sat)13:40:48 No.101706154

>>101706133
Exactly, the same is true for even the more expensive closed source models. You don't use local models for the quality.

Anonymous
08/03/24(Sat)13:41:19 No.101706158

Anonymous 08/03/24(Sat)13:41:19 No.101706158

>>101705986
and people told me i was crazy when i said that 70b is better than mistral large

Anonymous
08/03/24(Sat)13:43:30 No.101706191

Anonymous 08/03/24(Sat)13:43:30 No.101706191

>>101706158
We can't always trust benchmarks, and as far as this one goes, it's about on par, not necessarily better. People here also care more about ERP and creative writing, not coding or other stuff that livebench focuses on.

Anonymous
08/03/24(Sat)13:43:31 No.101706192

Anonymous 08/03/24(Sat)13:43:31 No.101706192

File: file.png (136 KB, 1124x952)

136 KB PNG

>>101706158
Sorting by coding, Large is the 4th best.

Anonymous
08/03/24(Sat)13:44:09 No.101706202

Anonymous 08/03/24(Sat)13:44:09 No.101706202

>>101706039
https://comfyanonymous.github.io/ComfyUI_examples/flux/
Followed this, opened with run_nvidia_gpu.bat, my nvidia driver version is 555.85. I'm using fp16 versions of everything.

Anonymous
08/03/24(Sat)13:47:44 No.101706255

Anonymous 08/03/24(Sat)13:47:44 No.101706255

File: file.png (40 KB, 428x507)

40 KB PNG

what's up with all the emojis

Anonymous
08/03/24(Sat)13:48:26 No.101706262

Anonymous 08/03/24(Sat)13:48:26 No.101706262

>>101706255
Oh, my circuits!

Anonymous
08/03/24(Sat)13:52:01 No.101706312

Anonymous 08/03/24(Sat)13:52:01 No.101706312

File: bpkOTF-y35bU0lW34upFp.png (133 KB, 2400x1200)

133 KB PNG

Celeste utterly MOGGED
https://huggingface.co/Sao10K/MN-12B-Lyra-v1/discussions/1

Anonymous
08/03/24(Sat)13:52:09 No.101706317

Anonymous 08/03/24(Sat)13:52:09 No.101706317

>>101706255
Is this Gemma-2-27B?

Anonymous
08/03/24(Sat)13:53:09 No.101706339

Anonymous 08/03/24(Sat)13:53:09 No.101706339

>>101706312
>EQ bench
Is this good?

Anonymous
08/03/24(Sat)13:53:32 No.101706343

Anonymous 08/03/24(Sat)13:53:32 No.101706343

>>101706312
>still seething about Celeste
Hi, Sao.

Anonymous
08/03/24(Sat)13:53:33 No.101706345

Anonymous 08/03/24(Sat)13:53:33 No.101706345

>>101706339
no

Anonymous
08/03/24(Sat)13:53:56 No.101706353

Anonymous 08/03/24(Sat)13:53:56 No.101706353

>>101705997
What are the biggest models you can run with 12gb vram and 128gb of ram?
What are the limitations?
I'm interested in going that path, maybe buy 64gb + the 32gb I currently have.

Anonymous
08/03/24(Sat)13:54:59 No.101706366

Anonymous 08/03/24(Sat)13:54:59 No.101706366

>>101705643
Llama 3.1 70b.

Anonymous
08/03/24(Sat)13:55:16 No.101706370

Anonymous 08/03/24(Sat)13:55:16 No.101706370

>>101706317
drop the 7

Anonymous
08/03/24(Sat)13:55:40 No.101706374

Anonymous 08/03/24(Sat)13:55:40 No.101706374

>>101706312
Starcannon is a Celeste merge...

Anonymous
08/03/24(Sat)13:55:58 No.101706378

Anonymous 08/03/24(Sat)13:55:58 No.101706378

>>101706353
>What are the biggest models you can run with 12gb vram and 128gb of ram?
Mistral Large 2.
>What are the limitations?
very slowly.

Anonymous
08/03/24(Sat)13:56:14 No.101706383

Anonymous 08/03/24(Sat)13:56:14 No.101706383

File: IMG_0299.png (1.69 MB, 800x1920)

1.69 MB PNG

Just tested Flux dev on a 3090. Takes a while to generate but works pretty well with simple prompts. Best hands I had in a text2img generation.

But the skin and general anatomy was better in SD15 and SDXL fine tunes like Juggernaut’s

Anonymous
08/03/24(Sat)13:57:05 No.101706397

Anonymous 08/03/24(Sat)13:57:05 No.101706397

>>101706374
>Starcannon is a Celeste merge...
>>101704561

Anonymous
08/03/24(Sat)13:58:38 No.101706414

Anonymous 08/03/24(Sat)13:58:38 No.101706414

>>101706312
>right below Nemomix v4 [77.92] which was well, a big merge. Not bad.
And people doubted me when I said merging makes models smarter.
Starcannon2 is also literally the score of Celeste and Mini-magnum together.

Anonymous
08/03/24(Sat)13:59:12 No.101706426

Anonymous 08/03/24(Sat)13:59:12 No.101706426

>>101706374
Guess Magnum really carries it? Also it doesn't say which celeste IIRC there's 1.6 and 1.9?

Anonymous
08/03/24(Sat)14:00:49 No.101706450

Anonymous 08/03/24(Sat)14:00:49 No.101706450

File: Hatsune Miku spilled a lo(...).png (1.16 MB, 1024x1024)

1.16 MB PNG

>Hatsune Miku spilled a lot of milk on herself looks very messy milk on her face milk on her clothes milk everywhere
Prompt executed in 90.13 seconds

>>101706353
>What are the biggest models you can run with 12gb vram and 128gb of ram?
Largestral/CR+ Q6_K

>What are the limitations?
Speed. Expect 0.4t/s with large models. If quality is more important than speed for you, go for it.

Anonymous
08/03/24(Sat)14:00:54 No.101706451

Anonymous 08/03/24(Sat)14:00:54 No.101706451

>>101706414
>And people doubted me when I said merging makes models smarter.
All sao models are merges even the "tunes" are merged together.
>I have done a test run with multiple variations of the models, merged back to its base at various weights, different training runs too, and this Sixth iteration is the one I like most.
https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2

Anonymous
08/03/24(Sat)14:00:54 No.101706452

Anonymous 08/03/24(Sat)14:00:54 No.101706452

QRD on the last 24 hours?

Anonymous
08/03/24(Sat)14:02:18 No.101706468

Anonymous 08/03/24(Sat)14:02:18 No.101706468

>>101706452
neuralink's second patient was implanted

Anonymous
08/03/24(Sat)14:02:24 No.101706470

Anonymous 08/03/24(Sat)14:02:24 No.101706470

>>101706452
Sao won. Merging also has been proven as superior to fine-tuning.

Anonymous
08/03/24(Sat)14:02:35 No.101706472

Anonymous 08/03/24(Sat)14:02:35 No.101706472

>>101706451
Also Lyra the latest one
>Merged two differently formatted trains that had some data variation. One on Mistral Instruct, one on ChatML.

Anonymous
08/03/24(Sat)14:03:33 No.101706490

Anonymous 08/03/24(Sat)14:03:33 No.101706490

>>101706026
animation, months ago and keyframe interp + coloring and animatediff

Anonymous
08/03/24(Sat)14:03:40 No.101706494

Anonymous 08/03/24(Sat)14:03:40 No.101706494

>>101706470
Merging tunes is superior to just tuning.

Anonymous
08/03/24(Sat)14:04:55 No.101706517

Anonymous 08/03/24(Sat)14:04:55 No.101706517

File: LLM-history-fancy.png (721 KB, 6303x1312)

721 KB PNG

>>101706452
A 24-hour recap would be quite brief and may not offer much insightful information. Would you be interested in a yearly recap instead?

Anonymous
08/03/24(Sat)14:05:28 No.101706524

Anonymous 08/03/24(Sat)14:05:28 No.101706524

>>101706490
>animation
*by animanon*

Anonymous
08/03/24(Sat)14:07:25 No.101706560

Anonymous 08/03/24(Sat)14:07:25 No.101706560

>>101706517
For the current era, there could be a C2 logs section, given how Stheno, Magnum, Celeste, etc. all are mostly trained on them. And all the merges are of tunes of mostly the same datasets.

Anonymous
08/03/24(Sat)14:07:41 No.101706567

Anonymous 08/03/24(Sat)14:07:41 No.101706567

>>101706450
It appears that Hatsune Miku, a popular Vocaloid character, has been playfully splattered with a white substance, likely milk or cream, given its consistency and color. It's depicted dripping from her face and hair, and some of it has landed on her clothing. Her surprised and excited expression suggests she wasn't expecting it, but is enjoying the moment.

The scene is lighthearted and possibly part of a fan-made artwork depicting a silly or fun scenario involving the character. It could be a reference to a specific fan fiction, meme, or simply a playful depiction of the character.

The focus is on Miku's reaction to the unexpected splattering, highlighting her cute and energetic personality that is often associated with the character.

Anonymous
08/03/24(Sat)14:09:24 No.101706585

Anonymous 08/03/24(Sat)14:09:24 No.101706585

>>101706312
Starcannon2 doesn't have an 8bpw exl2 version. And I was excited to try it, too 'cause I really like mini magnum...

Anonymous
08/03/24(Sat)14:10:07 No.101706593

Anonymous 08/03/24(Sat)14:10:07 No.101706593

>>101706567
Yes, while milk or cream is a likely candidate given the white color and dripping consistency, it could potentially be other substances as well:

Other possibilities:

>Yogurt: Similar texture to milk and could be depicted in a playful food fight scenario.
>White paint or slime: Depending on the context of the original source, it could be part of a messy art project or a playful prank.
>Whipped cream: Another possibility with a slightly different texture, often associated with desserts and fun.
>Cum: While less likely given the generally innocent portrayal of the character, it's a possibility that some artists might explore in NSFW contexts. However, without further context or clues within the image itself, it's impossible to definitively determine the artist's intent.
Important Note:

It's crucial to consider the source of the image and any accompanying information to understand the intended meaning. If the image comes from a source known for explicit content, the interpretation might differ compared to a source focused on lighthearted or fan-made content.

Without additional context, it's best to avoid jumping to conclusions and focus on the most likely and innocent interpretations, such as milk, yogurt, or whipped cream.

Anonymous
08/03/24(Sat)14:11:13 No.101706604

Anonymous 08/03/24(Sat)14:11:13 No.101706604

>>101706353
If you're patient like me you'll enjoy mistral large 2 with that kind of setup. I'm pretty happy with q3.

Anonymous
08/03/24(Sat)14:12:41 No.101706621

Anonymous 08/03/24(Sat)14:12:41 No.101706621

File: ComfyUI_00119_.png (333 KB, 512x512)

333 KB PNG

Alright so if you want to make really crazy shit like Miku pouring a glass of bees with FLUX cfg = 0.9 seems to be the sweet spot.

Anonymous
08/03/24(Sat)14:13:21 No.101706631

Anonymous 08/03/24(Sat)14:13:21 No.101706631

>>101705936
Llama.cpp/Comfy support never...

Anonymous
08/03/24(Sat)14:15:03 No.101706652

Anonymous 08/03/24(Sat)14:15:03 No.101706652

>>101706621
What are the results with the default cfg?

Anonymous
08/03/24(Sat)14:15:09 No.101706657

Anonymous 08/03/24(Sat)14:15:09 No.101706657

>>101705936
>true multimodal LLM at your disposal
>finetune it into a worse stable diffusion
but why

Anonymous
08/03/24(Sat)14:15:15 No.101706661

Anonymous 08/03/24(Sat)14:15:15 No.101706661

>>101698623
>>101698824
lies, i tried the bikini trick before making that post, it didn't work.

adolf hitler !dgAco.SrAo
08/03/24(Sat)14:17:50 No.101706699

adolf hitler !dgAco.SrAo 08/03/24(Sat)14:17:50 No.101706699

>>101706621
what the fuck

Anonymous
08/03/24(Sat)14:21:14 No.101706755

Anonymous 08/03/24(Sat)14:21:14 No.101706755

>Tess-3-Llama-3.1-405B
>A competitor to *any* LLM out there: https://huggingface.co/migtissera/Tess-3-Llama-3.1-405B

>Introducing the largest model that I have fine-tuned so far, Tess-3-Llama-3.1-405B.

>This model is quite something, and very special!

>model-00001-of-00191.bin

Anonymous
08/03/24(Sat)14:22:48 No.101706776

Anonymous 08/03/24(Sat)14:22:48 No.101706776

>>101706621
why 512x512

Anonymous
08/03/24(Sat)14:23:40 No.101706785

Anonymous 08/03/24(Sat)14:23:40 No.101706785

>>101706621
Is cfg the same thing as guidance?

Anonymous
08/03/24(Sat)14:26:30 No.101706810

Anonymous 08/03/24(Sat)14:26:30 No.101706810

>>101706202
I also followed that, and I'm also using the fp16 version (unless the .sft one is bf16), driver 546.
Total VRAM 12287 MB, total RAM 65451 MB
pytorch version: 2.4.0+cu121
Set vram state to: NORMAL_VRAM
Device: cuda:0 NVIDIA GeForce RTX 3060 : cudaMallocAsync
[...]
loading in lowvram mode 9981.07
100%|--------| 20/20 [01:44<00:00,  5.24s/it]
Using pytorch attention in VAE
Using pytorch attention in VAE
Requested to load AutoencodingEngine
Loading 1 new model
Prompt executed in 159.24 seconds
I guess I will try updating my drivers... But I'm not feeling confident.

Anonymous
08/03/24(Sat)14:26:36 No.101706812

Anonymous 08/03/24(Sat)14:26:36 No.101706812

>>101706652
Setting cfg to typical ranges seems to give it an aneurysm. It's coherent at like 2-5 but creatively bankrupt. Mind you- I haven't tried playing with samplers and I'm just sticking to 50 steps so who knows. But on euler it seems to want very low cfg
>>101706776
So that I can generate multiple images at once. If someone wants a bigger version of it they can upscale it themselves. We have the technology for that now.

Anonymous
08/03/24(Sat)14:27:04 No.101706819

Anonymous 08/03/24(Sat)14:27:04 No.101706819

>>101706378
>>101706450
>>101706604
I can't unsee it now
I NEED IT

Anonymous
08/03/24(Sat)14:27:32 No.101706830

Anonymous 08/03/24(Sat)14:27:32 No.101706830

>>101706755
Honestly, I didn't think the sloptuners had it in them to do 405b. What a waste of compute.

Anonymous
08/03/24(Sat)14:31:15 No.101706875

Anonymous 08/03/24(Sat)14:31:15 No.101706875

>>101706830
I expected our boy Undi to do it first. Looks like Undster completely washed up. His latest slop doesn't do it for me.

Anonymous
08/03/24(Sat)14:32:34 No.101706895

Anonymous 08/03/24(Sat)14:32:34 No.101706895

>>101706812
Upscaling is not the same. Generating the image at model's maximum supported resolution improves quality and coherence beyond just making things sharper. So you are more likely to get better hands and stuff at 1024x1024.

Anonymous
08/03/24(Sat)14:42:13 No.101707016

Anonymous 08/03/24(Sat)14:42:13 No.101707016

File: 1722383242779130.png (949 KB, 778x900)

949 KB PNG

>>101706517
what kind of hardware/specs do I need to run Midnight-miqu-70B? That's a large model...

Anonymous
08/03/24(Sat)14:44:15 No.101707046

Anonymous 08/03/24(Sat)14:44:15 No.101707046

>>101707016
You need [quant size in gb]+20% ram

Anonymous
08/03/24(Sat)14:44:41 No.101707053

Anonymous 08/03/24(Sat)14:44:41 No.101707053

>>101706830
>What a waste of compute.
>The compute for this model was generously sponsored by KindoAI.
>The secure solution for AI management
>NEWS: Kindo has acquired WhiteRabbitNeo, the leading creator of open source, offensive cybersecurity AI models

Anonymous
08/03/24(Sat)14:45:16 No.101707059

Anonymous 08/03/24(Sat)14:45:16 No.101707059

>>101707016
Midnight Miqu is a meme. You don't actually use it.

Anonymous
08/03/24(Sat)14:46:51 No.101707082

Anonymous 08/03/24(Sat)14:46:51 No.101707082

>>101707059
What's good that's around 70b then?

Anonymous
08/03/24(Sat)14:47:32 No.101707103

Anonymous 08/03/24(Sat)14:47:32 No.101707103

>>101707046
so some server? I can't even get more than 48 GB of ram in my current fast speeds and decent timing...

Anonymous
08/03/24(Sat)14:47:48 No.101707107

Anonymous 08/03/24(Sat)14:47:48 No.101707107

File: ComfyUI_00167_.png (312 KB, 512x512)

312 KB PNG

Alright you guys, we bac.

Anonymous
08/03/24(Sat)14:48:04 No.101707109

Anonymous 08/03/24(Sat)14:48:04 No.101707109

>>101707082
Ask in /r/LocalLLaMA. You're too retarded to be in this general.

Anonymous
08/03/24(Sat)14:48:11 No.101707110

Anonymous 08/03/24(Sat)14:48:11 No.101707110

>>101707053
>KindoAI
>Secure, Compliant, and Managed AI
Yay!
>>101707082
>What's good that's around 70b then?
A low quant of Mistral Large 2 is much better than any current 70B

Anonymous
08/03/24(Sat)14:49:05 No.101707125

Anonymous 08/03/24(Sat)14:49:05 No.101707125

>>101707082
Pygmalion 70B

Anonymous
08/03/24(Sat)14:50:17 No.101707144

Anonymous 08/03/24(Sat)14:50:17 No.101707144

>>101707110
>Mistral Large 2
It's half the speed for me, it's unbearable.

Anonymous
08/03/24(Sat)14:50:46 No.101707153

Anonymous 08/03/24(Sat)14:50:46 No.101707153

>>101707103
You can get 128gb ram on most of consumer mobos

Anonymous
08/03/24(Sat)14:53:35 No.101707202

Anonymous 08/03/24(Sat)14:53:35 No.101707202

>>101705320
It's prefiltered.
No nsfw, a lot of character names, artists, brands were obviously scrapped.
This model can be amazing, the day some rich anon will buy the compute to add this back, if it's even possible.

Anonymous
08/03/24(Sat)14:53:38 No.101707203

Anonymous 08/03/24(Sat)14:53:38 No.101707203

how can i use llms to fix my crippling depression

Anonymous
08/03/24(Sat)14:56:09 No.101707238

Anonymous 08/03/24(Sat)14:56:09 No.101707238

>>101707202
pony guy can do it

Anonymous
08/03/24(Sat)14:57:29 No.101707260

Anonymous 08/03/24(Sat)14:57:29 No.101707260

>>101707203
You can't. Find God.

Anonymous
08/03/24(Sat)14:57:52 No.101707270

Anonymous 08/03/24(Sat)14:57:52 No.101707270

>>101707203
I used them to make mine 10x worse. Don't think it goes in another direction.

Anonymous
08/03/24(Sat)15:03:47 No.101707340

Anonymous 08/03/24(Sat)15:03:47 No.101707340

>>101707203
Annoy them with stupid questions like this one until you feel better about yourself

Anonymous
08/03/24(Sat)15:06:19 No.101707367

Anonymous 08/03/24(Sat)15:06:19 No.101707367

5000 series proves Nvidia will never give their consumer more VRAM if given the choice. So our only hope is the 80GB A100. How many more years until they are affordable?

Anonymous
08/03/24(Sat)15:06:38 No.101707371

Anonymous 08/03/24(Sat)15:06:38 No.101707371

>>101707107
Miku, Herald of Happenings

Anonymous
08/03/24(Sat)15:07:36 No.101707382

Anonymous 08/03/24(Sat)15:07:36 No.101707382

>>101706755
How do you guys think it is?

Anonymous
08/03/24(Sat)15:07:46 No.101707387

Anonymous 08/03/24(Sat)15:07:46 No.101707387

>>101707367
>5000 series proves Nvidia will never give their consumer more VRAM if given the choice.
Did they confirm any spec? Especially vram size?

Anonymous
08/03/24(Sat)15:14:13 No.101707460

Anonymous 08/03/24(Sat)15:14:13 No.101707460

>>101707387
Current rumor is 28GB. I still think the play is 3090s until 80GB A100s drop enough in price.

Anonymous
08/03/24(Sat)15:14:59 No.101707469

Anonymous 08/03/24(Sat)15:14:59 No.101707469

>>101707382
probably gptslopped to hell
>https://huggingface.co/datasets/migtissera/Tess-v1.5/discussions/2
>how was this created?
>https://github.com/migtissera/Sensei
>A simple, powerful, minimal codebase to generate synthetic data using OpenAI, MistralAI or AnthropicAI
I know that's for 1.5 and not 3 but I doubt he'd change his stuff

Anonymous
08/03/24(Sat)15:15:04 No.101707470

Anonymous 08/03/24(Sat)15:15:04 No.101707470

>>101707387
22GB VRAM.

Anonymous
08/03/24(Sat)15:16:08 No.101707487

Anonymous 08/03/24(Sat)15:16:08 No.101707487

>>101707460
>I still think the play is 3090s
What about p40s and p100s if you only care about vram?

Anonymous
08/03/24(Sat)15:16:13 No.101707488

Anonymous 08/03/24(Sat)15:16:13 No.101707488

>>101707470
16GB? What do you need 12GB for? 8 GB is more than enough for 8K gaming. HELP! THIS GOYIM IS DEMANDING 4GB OF VRAM.

Anonymous
08/03/24(Sat)15:17:48 No.101707514

Anonymous 08/03/24(Sat)15:17:48 No.101707514

>>101707367
NVIDIA seems as dedicated to suppressing VRAM memory as AMD is to fucking everything up
I wish we had more competitors in the space

Anonymous
08/03/24(Sat)15:19:27 No.101707539

Anonymous 08/03/24(Sat)15:19:27 No.101707539

>>101707487
P40s are too old. P100s only have 16GB.

Anonymous
08/03/24(Sat)15:20:05 No.101707546

Anonymous 08/03/24(Sat)15:20:05 No.101707546

>>101707469
>>101707382
>

Each Tess version (v1.0, v1.5, v3.0) uses a new and improved dataset. Tess-3 has 500K samples of 16K context length, distilled from Opus-3, Sonnet-3.5, Nemotron, GPT4-Turbo and DeepSeek Coder-V2. Then the samples go through filtering, sometimes manually. Just to say that it’s not the same datasets as previous models.
>It is trained with QLoRA

Anonymous
08/03/24(Sat)15:21:46 No.101707572

Anonymous 08/03/24(Sat)15:21:46 No.101707572

>>101707367
>How many more years until they are affordable?
Will they ever be affordable? Don't they buy back data center gpus and shit? Isn't waiting for a A6000 or something to be cheap more likely?

Anonymous
08/03/24(Sat)15:22:02 No.101707575

Anonymous 08/03/24(Sat)15:22:02 No.101707575

File: Screenshot_2024-08-03-20-(...).jpg (640 KB, 1080x2340)

640 KB JPG

Anyone else experimenting with MLCChat on android devices? I can run gemma on my 3 year old 100€ xiaomi and I'm genuinely impressed!

Anonymous
08/03/24(Sat)15:27:41 No.101707667

Anonymous 08/03/24(Sat)15:27:41 No.101707667

>>101707575
1t/s... Bruh

Anonymous
08/03/24(Sat)15:29:22 No.101707701

Anonymous 08/03/24(Sat)15:29:22 No.101707701

L3.1 405B base model is super coherent compared to say, 70B, which generates stupid shit half of the time

Anonymous
08/03/24(Sat)15:31:20 No.101707734

Anonymous 08/03/24(Sat)15:31:20 No.101707734

>>101707701
The schizo's gonna be big mad about that.

Anonymous
08/03/24(Sat)15:34:17 No.101707783

Anonymous 08/03/24(Sat)15:34:17 No.101707783

>>101706452
I played 10 hours of Needy Streamer Overload.
It was alright.

Anonymous
08/03/24(Sat)15:36:24 No.101707810

Anonymous 08/03/24(Sat)15:36:24 No.101707810

Phi2 2.8b is super coherent compared to say, tinyllama 1.1b, which generates stupid shit nine tenths of the time.

Anonymous
08/03/24(Sat)15:38:44 No.101707862

Anonymous 08/03/24(Sat)15:38:44 No.101707862

>>101707810
>Phi2
mogged by gemma 2 22222b from goo depmeind

Anonymous
08/03/24(Sat)15:41:47 No.101707914

Anonymous 08/03/24(Sat)15:41:47 No.101707914

>>101707810
>model that is more than twice the size and 3 generations newer is better
Wow.
Are you sure?

Anonymous
08/03/24(Sat)15:42:41 No.101707927

Anonymous 08/03/24(Sat)15:42:41 No.101707927

>>101707914
I need to do a couple more watermelon tests to be sure.

Anonymous
08/03/24(Sat)15:42:59 No.101707932

Anonymous 08/03/24(Sat)15:42:59 No.101707932

>>101707914
>guys. a point flew over my head!!!

Anonymous
08/03/24(Sat)15:45:31 No.101707967

Anonymous 08/03/24(Sat)15:45:31 No.101707967

>>101707927
Largestral is the only model so far to pass the watermelon test (in the spirit of the test) in my experience. Although it described a failure based on the weight of the watermelons and not the ability to mechanically grip them. So it's only a half pass.

Anonymous
08/03/24(Sat)15:47:34 No.101708000

Anonymous 08/03/24(Sat)15:47:34 No.101708000

>>101707701
And 70b is super coherent compared to something like nemo which generates stupid shit most of the time.

Anonymous
08/03/24(Sat)15:49:22 No.101708027

Anonymous 08/03/24(Sat)15:49:22 No.101708027

>>101706202
>>101706810 (Me)
Yeah, nothing worked. Using xformers I can bring the time down to 140s, but that's still twice your time.
I guess my CPU/RAM is just not as fast as yours :(

Anonymous
08/03/24(Sat)16:00:56 No.101708213

Anonymous 08/03/24(Sat)16:00:56 No.101708213

someone please spoonfeed me a link to a good local model for erp on a decent-ish PC.

Anonymous
08/03/24(Sat)16:02:28 No.101708236

Anonymous 08/03/24(Sat)16:02:28 No.101708236

>>101708213
https://huggingface.co/TheDrummer/Gemmasutra-Mini-2B-v1-GGUF/tree/main

Anonymous
08/03/24(Sat)16:02:39 No.101708237

Anonymous 08/03/24(Sat)16:02:39 No.101708237

>>101708213
Here you go, all you need:
https://huggingface.co/bartowski/Mistral-Large-Instruct-2407-GGUF/tree/main/Mistral-Large-Instruct-2407-Q3_K_M

Anonymous
08/03/24(Sat)16:04:24 No.101708258

Anonymous 08/03/24(Sat)16:04:24 No.101708258

>>101708213
If you use the phrase "decent-ish PC" then it's trash for LLM purposes.

Anonymous
08/03/24(Sat)16:04:44 No.101708265

Anonymous 08/03/24(Sat)16:04:44 No.101708265

>>101708237
>mistral
>erp

Anonymous
08/03/24(Sat)16:05:30 No.101708279

Anonymous 08/03/24(Sat)16:05:30 No.101708279

>>101708000
I mean base 405B's output is passable and it doesn't contradict itself, similar to how there are passing (ones you beat your meat to) and non-passing trannies (ones you beat with a stick)

Anonymous
08/03/24(Sat)16:05:56 No.101708292

Anonymous 08/03/24(Sat)16:05:56 No.101708292

Is anyone still using flux with the official repo or is everyone just using comfyui now? The gens I'm getting are shitass.

Anonymous
08/03/24(Sat)16:06:05 No.101708294

Anonymous 08/03/24(Sat)16:06:05 No.101708294

>>101708236
>>101708237
Which do I choose..?
>>101708258
Link me the "good" one then and I'll see if it works.

Anonymous
08/03/24(Sat)16:07:36 No.101708318

Anonymous 08/03/24(Sat)16:07:36 No.101708318

Is there a way to make ComfyUI output an image for every sampling step?

Anonymous
08/03/24(Sat)16:07:38 No.101708319

Anonymous 08/03/24(Sat)16:07:38 No.101708319

What temp settings you all running mistral large with? Neutral samplers and .05 minp here.

Anonymous
08/03/24(Sat)16:08:06 No.101708323

Anonymous 08/03/24(Sat)16:08:06 No.101708323

>>101708265
Works well for me, it's the best of all the ones I've tried. If you're not trying to shill something and know something actually better, then suggest it.

Anonymous
08/03/24(Sat)16:12:03 No.101708392

Anonymous 08/03/24(Sat)16:12:03 No.101708392

>>101708318
Yes

Anonymous
08/03/24(Sat)16:12:38 No.101708395

Anonymous 08/03/24(Sat)16:12:38 No.101708395

>>101707701
yup it's over. 70B vramlet model is brain dead compared to 405B

Anonymous
08/03/24(Sat)16:13:36 No.101708413

Anonymous 08/03/24(Sat)16:13:36 No.101708413

>>101708319
That's what I use too, no need for more.

Anonymous
08/03/24(Sat)16:15:11 No.101708442

Anonymous 08/03/24(Sat)16:15:11 No.101708442

>>101708323
What preset do you use?

Anonymous
08/03/24(Sat)16:17:22 No.101708477

Anonymous 08/03/24(Sat)16:17:22 No.101708477

>>101708442
Just the mistral one. I think I fixed the spacing around stuff like people talked about here a while back but that's it.

Anonymous
08/03/24(Sat)16:19:48 No.101708503

Anonymous 08/03/24(Sat)16:19:48 No.101708503

>1 day later
>still no FLUX finetunes
it's over, isn't it?

Anonymous
08/03/24(Sat)16:23:52 No.101708549

Anonymous 08/03/24(Sat)16:23:52 No.101708549

>>101708503
Never will be. No base models, distilled only.

Anonymous
08/03/24(Sat)16:25:02 No.101708562

Anonymous 08/03/24(Sat)16:25:02 No.101708562

>>101706366
Do you have a preset for 3.1 4.5bpw? my version of instruct using the same settings gives me tons of shivers and repitition...

Anonymous
08/03/24(Sat)16:25:58 No.101708578

Anonymous 08/03/24(Sat)16:25:58 No.101708578

>>101708549
Can we do loras at least?

Anonymous
08/03/24(Sat)16:34:15 No.101708674

Anonymous 08/03/24(Sat)16:34:15 No.101708674

>>101708236
the samples look good

Anonymous
08/03/24(Sat)16:38:52 No.101708735

Anonymous 08/03/24(Sat)16:38:52 No.101708735

>>101708549
good. nice to see a company being responsible and ethical for once

Anonymous
08/03/24(Sat)16:38:59 No.101708737

Anonymous 08/03/24(Sat)16:38:59 No.101708737

File: 1711794042567346.jpg (333 KB, 1070x1152)

333 KB JPG

>>101707203
You can't, all llms are designed to hate (you).

Anonymous
08/03/24(Sat)16:39:05 No.101708739

Anonymous 08/03/24(Sat)16:39:05 No.101708739

>>101707575
Same here, I am testing gemma 2b with 6_k quant on my budget android phone. This is probably the first small model that can be counted as useful in some simple tasks. It hallucinates a lot of rl facts, text can be a little clunky but still - It's genuinely surprising that 2gb file can be this coherent in dialog. Phi-3 and older small gemma tended to compose sentences into gibberish, new gemma stays coherent. I was doing some text summaries - no problem at all. Their charts with gemma2b beating mixtral7x8 is a pure bullshit though.

Anonymous
08/03/24(Sat)16:40:32 No.101708754

Anonymous 08/03/24(Sat)16:40:32 No.101708754

>>101708549
You can't finetune L3.1 70B and 8B because they were distilled too.

Anonymous
08/03/24(Sat)16:41:30 No.101708763

Anonymous 08/03/24(Sat)16:41:30 No.101708763

>>101708392
Any elegant ways though? I don't see any settings for it, and I'd rather not have a mess of 50 ksampleradvanced nodes in the workflow.

Anonymous
08/03/24(Sat)16:45:02 No.101708803

Anonymous 08/03/24(Sat)16:45:02 No.101708803

>>101708754
one is a diffusion model the other is transformers

Anonymous
08/03/24(Sat)16:45:40 No.101708808

Anonymous 08/03/24(Sat)16:45:40 No.101708808

>>101708754
Really? That's why there's nothing done for 3.1 70b? Sad, I thought it had potential.

Anonymous
08/03/24(Sat)16:48:08 No.101708841

Anonymous 08/03/24(Sat)16:48:08 No.101708841

>>101708754
localcucks chugging on these blackbox toys not matter what though

Anonymous
08/03/24(Sat)16:48:31 No.101708849

Anonymous 08/03/24(Sat)16:48:31 No.101708849

File: Screenshot_2024-08-03-20-(...).jpg (602 KB, 1080x2340)

602 KB JPG

>>101708739
It is sooo insane! I can't believe, I'm running a llm on a ancient budget smartphone! I think I'm dreaming!!

Anonymous
08/03/24(Sat)16:49:24 No.101708859

Anonymous 08/03/24(Sat)16:49:24 No.101708859

File: work.png (973 KB, 1024x1024)

973 KB PNG

Aww, she did a little design...

Anonymous
08/03/24(Sat)16:52:46 No.101708899

Anonymous 08/03/24(Sat)16:52:46 No.101708899

>>101708808
>nothing done for 3.1 70b? Sad, I thought it had potential.
https://huggingface.co/HODACHI/Llama-3.1-70B-EZO-1.1-it

Anonymous
08/03/24(Sat)17:06:42 No.101709061

Anonymous 08/03/24(Sat)17:06:42 No.101709061

>>101708562
No. I retract my statement. I'm mad at 3.1 70b now, the card was supposed to be a virgin and she laughed at me and went on and on about all the guys she'd been with.

Anonymous
08/03/24(Sat)17:07:07 No.101709069

Anonymous 08/03/24(Sat)17:07:07 No.101709069

>llama/largestral roughly on par with apis
>flux mogging dalle3
so when are we getting a good local music model?

Anonymous
08/03/24(Sat)17:07:53 No.101709080

Anonymous 08/03/24(Sat)17:07:53 No.101709080

File: MikuSlut.jpg (93 KB, 775x803)

93 KB JPG

Can I run Flux Schnell? I have 64GB DDR5 ang 8GB AMD VRAM.

Anonymous
08/03/24(Sat)17:08:28 No.101709087

Anonymous 08/03/24(Sat)17:08:28 No.101709087

>>101709069
The (((labels))) are far too powerful. Music is completely captured.

Anonymous
08/03/24(Sat)17:09:03 No.101709092

Anonymous 08/03/24(Sat)17:09:03 No.101709092

>>101709061
Yeah I noticed that sort of thing a lot, it would have really good gens but would require a lot of rerolling to get there wherein it would end up making a lot of nonsensical statements based on context. What temp were you at? This was happening to me even at 0.9

Anonymous
08/03/24(Sat)17:09:32 No.101709099

Anonymous 08/03/24(Sat)17:09:32 No.101709099

>>101705620
wait what? i have yet to try flux, but i have 64GB RAM with 16GB VRAM. will i be fine?

Anonymous
08/03/24(Sat)17:09:58 No.101709105

Anonymous 08/03/24(Sat)17:09:58 No.101709105

>>101709069
never until maybe a leak far in the future. companies would be horrified to give them potential ammunition to use in court

Anonymous
08/03/24(Sat)17:20:08 No.101709230

Anonymous 08/03/24(Sat)17:20:08 No.101709230

>>101709105
Why are musicians and music industry so privileged? Everyone is stealing from visual artists such as painters and photographers shitting at their ownership and goverment does not really care.

Anonymous
08/03/24(Sat)17:21:22 No.101709249

Anonymous 08/03/24(Sat)17:21:22 No.101709249

>>101709092
I usually keep it 1 or lower. It kept going in that direction with every retry, though. It was dead set on it. Mistral large gave me a much more pleasant reply.

Anonymous
08/03/24(Sat)17:21:38 No.101709255

Anonymous 08/03/24(Sat)17:21:38 No.101709255

OpenRouter added base 405B yesterday, and I'm messing around with it out of curiosity.
There's absolutely no way this is a truly raw pretrained model, it's way way too dry and safe. I guess Meta's doing the thing where they put instruct data into their "base" models now too.

Anonymous
08/03/24(Sat)17:22:11 No.101709258

Anonymous 08/03/24(Sat)17:22:11 No.101709258

Theoretical question: can mradermacher make a fucked up Q8 quant? I would think Q8's are hard to fuck up?

Anonymous
08/03/24(Sat)17:24:10 No.101709278

Anonymous 08/03/24(Sat)17:24:10 No.101709278

>>101709255
Hey man gotta get that final bump in mmlu to show OpenAI who's boss

Anonymous
08/03/24(Sat)17:25:09 No.101709293

Anonymous 08/03/24(Sat)17:25:09 No.101709293

>>101709258
I'm not sure. I think you haven't spammed his name enough.

Anonymous
08/03/24(Sat)17:25:29 No.101709298

Anonymous 08/03/24(Sat)17:25:29 No.101709298

>>101709099
Yeah you'll be fine, it just peaked there once and went down to 40gb

Anonymous
08/03/24(Sat)17:26:07 No.101709307

Anonymous 08/03/24(Sat)17:26:07 No.101709307

>>101709296
kek

Anonymous
08/03/24(Sat)17:26:18 No.101709309

Anonymous 08/03/24(Sat)17:26:18 No.101709309

>barely above a whisper
Wasn't this an L3 or CR+ meme?
Because it's coming out of Mistral Large and that making me feel concern that the disease is spreading.

Anonymous
08/03/24(Sat)17:26:22 No.101709312

Anonymous 08/03/24(Sat)17:26:22 No.101709312

>>101709249
What are you running mistral large on? I can't even get 2.75 at 48 VRAM

Anonymous
08/03/24(Sat)17:26:51 No.101709316

Anonymous 08/03/24(Sat)17:26:51 No.101709316

>>101709307
thank you for your input undi

Anonymous
08/03/24(Sat)17:27:54 No.101709333

Anonymous 08/03/24(Sat)17:27:54 No.101709333

>>101709255
Read meta's papers. They put an insane amount of work into making their new models as "safe" as possible.

Anonymous
08/03/24(Sat)17:27:58 No.101709336

Anonymous 08/03/24(Sat)17:27:58 No.101709336

>>101709309
It was in every other mixtral gen

Anonymous
08/03/24(Sat)17:28:56 No.101709350

Anonymous 08/03/24(Sat)17:28:56 No.101709350

>>101709312
I only have 8gb vram, so I run my models in ram mostly.

Anonymous
08/03/24(Sat)17:29:33 No.101709360

Anonymous 08/03/24(Sat)17:29:33 No.101709360

>>101709325
smart of celeste dev to choose a long name i haven't seen anyone impersonate them yet

Anonymous
08/03/24(Sat)17:30:39 No.101709369

Anonymous 08/03/24(Sat)17:30:39 No.101709369

>>101709350
What cpu? How many tokens per second is that? I've been meaning to try using kobold for larger models ever hitting this bottleneck

Anonymous
08/03/24(Sat)17:30:56 No.101709372

Anonymous 08/03/24(Sat)17:30:56 No.101709372

>>101709309
Have you tried to stop writing shitty erotica prompts?

Anonymous
08/03/24(Sat)17:31:31 No.101709381

Anonymous 08/03/24(Sat)17:31:31 No.101709381

File: humanslop.png (90 KB, 1581x738)

90 KB PNG

>>101709309
it's humanslop

Anonymous
08/03/24(Sat)17:32:08 No.101709390

Anonymous 08/03/24(Sat)17:32:08 No.101709390

>>101709372
no one wants pure chat with no narration, give up

Anonymous
08/03/24(Sat)17:32:20 No.101709394

Anonymous 08/03/24(Sat)17:32:20 No.101709394

>>101709372
Give example of good prompt?

Anonymous
08/03/24(Sat)17:32:27 No.101709398

Anonymous 08/03/24(Sat)17:32:27 No.101709398

>>101709278
OpenAI doesn't give access to their base models nor do (most) benchmarks run on and compare base models.

>>101709255
The pretraining is done in steps with different mixes of data at each stage. I believe the final stage for Llama 3.1 was pretty dry stuff. Also I believe they do put fine tuning data in, because ultimately that does make the model both objectively and subjectively better for the fine tune's intended tasks. Creative writing suffers unfortunately because Meta's fine tuning is intended to be a boring assistant. In the end the fact that the assistant can't be fun is a problem with society, because they will search for any opportunity to cancel Facebook if a journo uses the online demo and it says naughty words.

Anonymous
08/03/24(Sat)17:34:40 No.101709435

Anonymous 08/03/24(Sat)17:34:40 No.101709435

>>101709369
I was using q3_k_m for mistral large. It starts out at 1.2T/s, but by 20k context it's down to 0.4 something. The CPU is a 7950x with DDR5-6000 ram.

Anonymous
08/03/24(Sat)17:37:00 No.101709470

Anonymous 08/03/24(Sat)17:37:00 No.101709470

>Uncensored my man. There’s no censorship or biases in my models.

>https://huggingface.co/migtissera/Tess-3-Llama-3.1-405B

>https://old.reddit.com/r/LocalLLaMA/comments/1ej6ny6/tess3llama31405b/lgcgyo1/

Anonymous
08/03/24(Sat)17:38:34 No.101709489

Anonymous 08/03/24(Sat)17:38:34 No.101709489

>>101709435
That's actually pretty similar to what I get but I'm still on skylake / 3200 RAM - looks like it's time for an upgrade

Anonymous
08/03/24(Sat)17:39:34 No.101709500

Anonymous 08/03/24(Sat)17:39:34 No.101709500

>>101709372
>stop writing shitty erotica prompts
It was RP but it hadn't gotten to erotica yet. The scene was meeting at a coffee shop and apparently it decided that one of my questions should bring up a bad memory.
>She looks down at her drink, her voice barely above a whisper.

>>101709381
The worst of all possible kinds of slop. Model collapse before models were created.

Anonymous
08/03/24(Sat)17:41:02 No.101709523

Anonymous 08/03/24(Sat)17:41:02 No.101709523

>>101709381
total AI death since future AI will be trained on AI and so on

Anonymous
08/03/24(Sat)17:41:16 No.101709527

Anonymous 08/03/24(Sat)17:41:16 No.101709527

>>101705497
Flux dev just isn't there yet in terms of coherency, it's undercooked. Maybe try the API model.

Anonymous
08/03/24(Sat)17:41:23 No.101709528

Anonymous 08/03/24(Sat)17:41:23 No.101709528

>>101709381
you get what you train on, celeste using stories from writingprompts cursed it from the start. No one there can fucking write a story.

Anonymous
08/03/24(Sat)17:42:25 No.101709542

Anonymous 08/03/24(Sat)17:42:25 No.101709542

>>101709470
>write a script to make gpt4 talk to itself like a schizo
>???
>profit (literally)

Anonymous
08/03/24(Sat)17:43:08 No.101709554

Anonymous 08/03/24(Sat)17:43:08 No.101709554

>>101709470
go back

Anonymous
08/03/24(Sat)17:45:05 No.101709581

Anonymous 08/03/24(Sat)17:45:05 No.101709581

File: 1716329112755149.png (674 KB, 1792x1024)

674 KB PNG

Daily reminder

Anonymous
08/03/24(Sat)17:46:29 No.101709601

Anonymous 08/03/24(Sat)17:46:29 No.101709601

>>101709489
Really? You get similar speeds? I had a 6700k before and this gets almost double the T/s. Of course mistral large wasn't out then so I didn't try it on the 6700k. So maybe something else is causing them to be at similar speeds.

Anonymous
08/03/24(Sat)17:46:38 No.101709603

Anonymous 08/03/24(Sat)17:46:38 No.101709603

>>101709581
anyone using models to coom is unironically addicted to porn in a way that's negatively impacting their life.

Anonymous
08/03/24(Sat)17:46:55 No.101709607

Anonymous 08/03/24(Sat)17:46:55 No.101709607

>>101709528
>celeste out of nowhere
celeste likely uses that dataset because stheno did
and magnum uses stheno's datasets too
you're trying to hard, shill

Anonymous
08/03/24(Sat)17:47:46 No.101709620

Anonymous 08/03/24(Sat)17:47:46 No.101709620

>>101709601
Its probably the VRAM offloading

Anonymous
08/03/24(Sat)17:48:07 No.101709624

Anonymous 08/03/24(Sat)17:48:07 No.101709624

>>101709607
>Sao still having a meltdown over Celeste

Anonymous
08/03/24(Sat)17:49:24 No.101709645

Anonymous 08/03/24(Sat)17:49:24 No.101709645

>>101709398
I mean it will generate naughty words just fine, there aren't any refusals and it does feel like dumb autocomplete rather than an assistant larp, the way you'd expect from a base model. It's just that the schizo soul of a true base model isn't there, it's not WEIRD like really raw base models are. I hope you know what I mean.

Anonymous
08/03/24(Sat)17:50:10 No.101709653

Anonymous 08/03/24(Sat)17:50:10 No.101709653

File: out-0.jpg (101 KB, 1024x1024)

101 KB JPG

>>101709581
largestral made this meme largely obsolete

>>101709603
>addicted to porn in a way that's negatively impacting their life
porn addiction is a spectrum, and it could always be worse

Anonymous
08/03/24(Sat)17:50:28 No.101709659

Anonymous 08/03/24(Sat)17:50:28 No.101709659

>>101709620
Oh, okay. I didn't think I'd get way better performance with such large models unless I upgraded to insane amounts of vram. But maybe 24 + my old cards would be worthwhile.

Anonymous
08/03/24(Sat)17:50:46 No.101709663

Anonymous 08/03/24(Sat)17:50:46 No.101709663

>>101709607
too* hard. Celeste hands typed your reply.

Anonymous
08/03/24(Sat)17:51:44 No.101709676

Anonymous 08/03/24(Sat)17:51:44 No.101709676

File: venti.jpg (497 KB, 1856x1280)

497 KB JPG

>the pic that broke sao's mind

Anonymous
08/03/24(Sat)17:52:14 No.101709682

Anonymous 08/03/24(Sat)17:52:14 No.101709682

Cloudcel nigger is having a meltie again kekw. He can't stop seething at localCHADs, he has to come here daily and spam. Living in your head rent-free cloudcuck! No (You)'s for (You) btw.

Anonymous
08/03/24(Sat)17:53:16 No.101709696

Anonymous 08/03/24(Sat)17:53:16 No.101709696

>>101709682
you seem very bothered by whoever you're talking about lol

Anonymous
08/03/24(Sat)17:53:17 No.101709697

Anonymous 08/03/24(Sat)17:53:17 No.101709697

>>101709682
he's busy getting banned for wrong think. He's probably tired of having to make new accounts and spend even more money lel

Anonymous
08/03/24(Sat)17:53:59 No.101709711

Anonymous 08/03/24(Sat)17:53:59 No.101709711

>>101709645
Yeah. I was just saying that whatever causes that (likely fine tuning data in the pretraining data) is because they want the final (fine tuned) model to be better at being safe and boring.

Though I guess another possibility is that a larger and smarter model will try to be less schizo anyway. Could be a compound effect at play here.

Anonymous
08/03/24(Sat)17:54:09 No.101709715

Anonymous 08/03/24(Sat)17:54:09 No.101709715

>>101706383
I also have a 3090. Generation is only slow when the model is first loaded, otherwise it's like 30 seconds which is reasonable. For the skin, it's an issue with the default CFG, lower it to between 1.8 - 2.5 for better results.

Anonymous
08/03/24(Sat)17:54:10 No.101709716

Anonymous 08/03/24(Sat)17:54:10 No.101709716

>>101708899
So when will we get ones that make it more fun?

Anonymous
08/03/24(Sat)17:55:35 No.101709740

Anonymous 08/03/24(Sat)17:55:35 No.101709740

>>101709715
Are you running FLUX-dev? Do you run FP16? I find FP16 understands my prompts better but it always OOMs after first gen

Anonymous
08/03/24(Sat)17:57:08 No.101709759

Anonymous 08/03/24(Sat)17:57:08 No.101709759

>>101709716
When you donate to and beg you favorite tuner.

Anonymous
08/03/24(Sat)17:57:47 No.101709775

Anonymous 08/03/24(Sat)17:57:47 No.101709775

>>101709697
Imagine a dude calling other people bothered and seething when all they're doing is reacting while the guy's spending precious minutes of his life making those dumb images and posting them kek.

Anonymous
08/03/24(Sat)17:58:54 No.101709796

Anonymous 08/03/24(Sat)17:58:54 No.101709796

>>101708763
You don't need that, just enable in the command line args.

Anonymous
08/03/24(Sat)18:00:00 No.101709811

Anonymous 08/03/24(Sat)18:00:00 No.101709811

>>101709775
oh it's true, you're extremely bothered by whoever you're talking about. I don't spend a ton of time here. How bad can this anon be lol.

Anonymous
08/03/24(Sat)18:00:57 No.101709826

Anonymous 08/03/24(Sat)18:00:57 No.101709826

>>101709759
So it's not actually impossible like that person said?

Anonymous
08/03/24(Sat)18:02:10 No.101709845

Anonymous 08/03/24(Sat)18:02:10 No.101709845

>>101709811
Not the guy the guy I replied to was replying to.

Anonymous
08/03/24(Sat)18:02:45 No.101709853

Anonymous 08/03/24(Sat)18:02:45 No.101709853

>>101709826
why would you believe anyone on a thread known for being infested with shills, schizos, bored trolls and possibly .1% genuine trying to be helpful anons?

Anonymous
08/03/24(Sat)18:03:16 No.101709859

Anonymous 08/03/24(Sat)18:03:16 No.101709859

File: ComfyUI_00978_.png (1.34 MB, 1024x1024)

1.34 MB PNG

>>101709740
Yep, flux-dev and fp16.

Anonymous
08/03/24(Sat)18:03:41 No.101709863

Anonymous 08/03/24(Sat)18:03:41 No.101709863

>>101709603
Having no luck with women because they think they deserve better comes first. Then comes porn. Do you think if you stop using porn women will suddenly think you are good enough for them?

Anonymous
08/03/24(Sat)18:03:56 No.101709865

Anonymous 08/03/24(Sat)18:03:56 No.101709865

>>101709711
Ahh yeah I get you now, makes sense.

Anonymous
08/03/24(Sat)18:04:46 No.101709876

Anonymous 08/03/24(Sat)18:04:46 No.101709876

>>101709845
either way just try reading that first comment out loud with a straight face, it's like a caricature or something kek

Anonymous
08/03/24(Sat)18:05:21 No.101709880

Anonymous 08/03/24(Sat)18:05:21 No.101709880

I wonder how many people here would pass a blind test to recognize which model is celeste and which one is stheno.

Anonymous
08/03/24(Sat)18:05:46 No.101709891

Anonymous 08/03/24(Sat)18:05:46 No.101709891

>>101709826
also there is a least one coom tune of 3.1 70, it's probably quite stupid, but it exists https://huggingface.co/NeverSleep/Lumimaid-v0.2-70B
>This model is based on: Meta-Llama-3.1-70B-Instruct

Anonymous
08/03/24(Sat)18:07:28 No.101709915

Anonymous 08/03/24(Sat)18:07:28 No.101709915

>>101709863
so many factors can lead to porn addiction, and most of the time it has more to do with the person addicted than it does with women being evil or something. I guess not being able to take accountability is a hallmark of addiction though.

Anonymous
08/03/24(Sat)18:07:40 No.101709919

Anonymous 08/03/24(Sat)18:07:40 No.101709919

>>101709891
>undi
come on...

Anonymous
08/03/24(Sat)18:08:08 No.101709926

Anonymous 08/03/24(Sat)18:08:08 No.101709926

>>101709880
which celeste 1.6, 1.9, 1.5? recognizing between the nemo based ones and stheno should be easy

Anonymous
08/03/24(Sat)18:08:54 No.101709940

Anonymous 08/03/24(Sat)18:08:54 No.101709940

>>101709891
There's a Mistral Large tune too.
https://huggingface.co/NeverSleep/Lumimaid-v0.2-123B
Is Undi not using anything from the C2 logs?

Anonymous
08/03/24(Sat)18:09:09 No.101709946

Anonymous 08/03/24(Sat)18:09:09 No.101709946

>>101709919
>it's probably quite stupid, but it exists
i did warn

Anonymous
08/03/24(Sat)18:10:38 No.101709965

Anonymous 08/03/24(Sat)18:10:38 No.101709965

>>101709940
>There's a Mistral Large tune too.
not the point, the point was some troll claimed you couldn't tune llama because "it was distilled" which made some newb panic so I showed an existing coom tune of 3.1 70b

Anonymous
08/03/24(Sat)18:11:12 No.101709969

Anonymous 08/03/24(Sat)18:11:12 No.101709969

>>101709940
>undi
come on....

Anonymous
08/03/24(Sat)18:12:20 No.101709988

Anonymous 08/03/24(Sat)18:12:20 No.101709988

>>101709969
>come on....
come on.....

Anonymous
08/03/24(Sat)18:12:26 No.101709990

Anonymous 08/03/24(Sat)18:12:26 No.101709990

>>101709891
>undies
cum on...shivers

Anonymous
08/03/24(Sat)18:12:42 No.101709993

Anonymous 08/03/24(Sat)18:12:42 No.101709993

>>101709969
are you going to spend your time in the thread attacking every other finetuner, sao?

Anonymous
08/03/24(Sat)18:14:14 No.101710020

Anonymous 08/03/24(Sat)18:14:14 No.101710020

>>101709993
>sao
ai drum

Anonymous
08/03/24(Sat)18:15:14 No.101710033

Anonymous 08/03/24(Sat)18:15:14 No.101710033

all the infighting is mikutrannies

Anonymous
08/03/24(Sat)18:15:32 No.101710039

Anonymous 08/03/24(Sat)18:15:32 No.101710039

>>101709993
come on drummer

Anonymous
08/03/24(Sat)18:17:10 No.101710063

Anonymous 08/03/24(Sat)18:17:10 No.101710063

>>101710033
All the "infighting" happens when a certain poster is here.

Generally before or after said poster gets banned for posting a certain type of content.

Sao Defense force A
08/03/24(Sat)18:17:12 No.101710064

Sao Defense force A 08/03/24(Sat)18:17:12 No.101710064

I think Sao's models are the best. AMA. (Also identify yourself if you post a question)

Anonymous
08/03/24(Sat)18:19:16 No.101710104

Anonymous 08/03/24(Sat)18:19:16 No.101710104

>>101710064
How is Sao so far ahead of the competition? It's like he's the only one actually even trying

Anonymous
08/03/24(Sat)18:20:36 No.101710127

Anonymous 08/03/24(Sat)18:20:36 No.101710127

What's cohere doing?

Anonymous
08/03/24(Sat)18:21:02 No.101710135

Anonymous 08/03/24(Sat)18:21:02 No.101710135

>>101710033
I can't believe Bryce prefers Van Patten's character to mine...

Anonymous
08/03/24(Sat)18:21:29 No.101710138

Anonymous 08/03/24(Sat)18:21:29 No.101710138

>>101710127
Focusing on businesses with money now that they made a name for themselves.

Anonymous
08/03/24(Sat)18:21:54 No.101710142

Anonymous 08/03/24(Sat)18:21:54 No.101710142

>>101710127
overcharging for CR+ even now that it's obsolete

Anonymous
08/03/24(Sat)18:22:45 No.101710159

Anonymous 08/03/24(Sat)18:22:45 No.101710159

>>101710127
They were testing column-r and column-u on arena again, likely trying to improve a bit more since largestral dropped.

Anonymous
08/03/24(Sat)18:23:00 No.101710163

Anonymous 08/03/24(Sat)18:23:00 No.101710163

>>101710142
>even now that it's obsolete
It's not, it's the only unbiased model in its weight class.

Anonymous
08/03/24(Sat)18:23:18 No.101710165

Anonymous 08/03/24(Sat)18:23:18 No.101710165

since we're talking about sao, I'm trying lyra right now
it's not very good with a 24k-long context. worse than nemo instruct and dory, but better than mini magnum and nemomix

Anonymous
08/03/24(Sat)18:23:52 No.101710171

Anonymous 08/03/24(Sat)18:23:52 No.101710171

>>101710163
mistral large M-M-MOGS it

Anonymous
08/03/24(Sat)18:25:03 No.101710187

Anonymous 08/03/24(Sat)18:25:03 No.101710187

>>101710171
Mistrals aren't unbiased however.

Anonymous
08/03/24(Sat)18:29:36 No.101710248

Anonymous 08/03/24(Sat)18:29:36 No.101710248

>>101706517
This image is retarded in many aspects
>no quants before 2023-03-03
false, you could use bitsandbytes to quantize any model to 8bit
>Only Q8 and Q4 quants are mentioned in the second panel
These were terrible, and at that time, GPTQ was more popular than llama.cpp quants (the golden era of oobabooga, TheBloke)
>no mention of the rise and fall of MoE after mixtral
>Llama3 was disappointing
no it wasn't, it is now the first time you can run something that beats 2023 ChatGPT locally thanks to Llama3
>Mistral Large is the top dog, llama 3.1 405 is "notable"
nobody has even tried the 405b. It's too big

Anonymous
08/03/24(Sat)18:33:33 No.101710304

Anonymous 08/03/24(Sat)18:33:33 No.101710304

File: ComfyUI_00816_.png (2.75 MB, 1408x1408)

2.75 MB PNG

I'm testing 1408x1408 and the model certainly behaves differently compared to 1024x1024, though not entirely sure yet if necessarily worse. This one wasn't too mangled, though it made the viewer into a giant.

>the clipping chair
Lmao

Anonymous
08/03/24(Sat)18:35:59 No.101710337

Anonymous 08/03/24(Sat)18:35:59 No.101710337

>have to build a rig as big as Turing's Enigma decoding machine in order to run 405b
>in the current year
I feel we have regressed

Anonymous
08/03/24(Sat)18:37:45 No.101710351

Anonymous 08/03/24(Sat)18:37:45 No.101710351

>>101710337
In a few weeks, possibly two of them, you'll run 405b on a mid range laptop thanks to hacked bibnet

Anonymous
08/03/24(Sat)18:40:54 No.101710388

Anonymous 08/03/24(Sat)18:40:54 No.101710388

All merges are slop.

Anonymous
08/03/24(Sat)18:41:17 No.101710395

Anonymous 08/03/24(Sat)18:41:17 No.101710395

>>101710304
>giant hand/small hand
>6 fingers
>chair in wall
>chair is a table
>bad shadows
>windows doesn't make sense
>building layout doesn't make sense
>street, dock and boat being merged together
>only handrail on one part of the bridge

Anonymous
08/03/24(Sat)18:41:31 No.101710398

Anonymous 08/03/24(Sat)18:41:31 No.101710398

>>101710388
All tunes that are part of merges too

Anonymous
08/03/24(Sat)18:42:06 No.101710409

Anonymous 08/03/24(Sat)18:42:06 No.101710409

just cummed to mistral large

Anonymous
08/03/24(Sat)18:48:05 No.101710495

Anonymous 08/03/24(Sat)18:48:05 No.101710495

NVIDIA bros?
https://www.axios.com/2024/08/02/nvidia-doj-antitrust-probe-ai

Anonymous
08/03/24(Sat)18:49:47 No.101710523

Anonymous 08/03/24(Sat)18:49:47 No.101710523

>>101710304
The viewer isn't a giant, Miku is tiny. If you catch my drift

Anonymous
08/03/24(Sat)18:50:40 No.101710537

Anonymous 08/03/24(Sat)18:50:40 No.101710537

>>101710395
Unfortunately. But this was the only one where the fingers didn't look too messed up. There's also the issue in this one where Miku's design isn't accurate, and there's also something that's missing in the image that was in my prompt.

Anonymous
08/03/24(Sat)18:51:11 No.101710547

Anonymous 08/03/24(Sat)18:51:11 No.101710547

File: file.png (114 KB, 1345x778)

114 KB PNG

Anyone here tried using an LLM to generate onomatopoeia?

Anonymous
08/03/24(Sat)18:52:12 No.101710562

Anonymous 08/03/24(Sat)18:52:12 No.101710562

>>101710523
I choose to believe that european buildings have 16 foot high ceilings

Sao Defense force A
08/03/24(Sat)18:53:25 No.101710580

Sao Defense force A 08/03/24(Sat)18:53:25 No.101710580

>>101710523
>mikufaggots are pedos
Everyone is in shock.

Anonymous
08/03/24(Sat)18:54:29 No.101710602

Anonymous 08/03/24(Sat)18:54:29 No.101710602

>>101710495
if Nvidia has a monopoly, it has more to do with lack of effort from their competition than anything else
All we're asking for is like 24-48GB of VRAM on a midrange card or for someone serious to implement real GPGPU support on a mainstream AI framework

Anonymous
08/03/24(Sat)18:54:43 No.101710605

Anonymous 08/03/24(Sat)18:54:43 No.101710605

>>101710580
>Sao Defense force A
uh ho

Anonymous
08/03/24(Sat)18:54:59 No.101710610

Anonymous 08/03/24(Sat)18:54:59 No.101710610

>>101710248
>This image is retarded in many aspects
Great, some actual feedback!

>false, you could use bitsandbytes to quantize any model to 8bit
Never heard of it, never done it.

>These were terrible
Any proof? Worked okay for me.

>and at that time, GPTQ was more popular than llama.cpp quants (the golden era of oobabooga, TheBloke)
I don't care about GPUland since I don't live there. I tried oobabooga once and would never touch that pos 20gb bloatware ever again.

>no mention of the rise and fall of MoE after mixtral
Mistral, grok(lol), deepseek, qwen and dbrx made MoEs, it didn't get mass adoption, but also didn't go out of fashion; there is no real "rise" and "fall".

>>Llama3 was disappointing
>no it wasn't
It's just my personal opinion. I am not unbiased.

>it is now the first time you can run something that beats 2023 ChatGPT locally thanks to Llama3
CR+ came out before that and it is superior to that safe 8k reddit riddler for my usecases.

>nobody has even tried the 405b. It's too big
That's why it's "notable" and not top. Many people also haven't tried deepseek-236b.

Sao Defense force A
08/03/24(Sat)18:56:27 No.101710628

Sao Defense force A 08/03/24(Sat)18:56:27 No.101710628

>>101710605
>not calling a falseflag
Glad to know mikutroon discord is anti-sao.

Anonymous
08/03/24(Sat)18:57:29 No.101710645

Anonymous 08/03/24(Sat)18:57:29 No.101710645

>>101710610
not him but I remember bitsandbytes being a big thing because of poorfags running 8GB GPUs

Anonymous
08/03/24(Sat)18:59:37 No.101710671

Anonymous 08/03/24(Sat)18:59:37 No.101710671

File: ComfyUI_00868_.png (2.31 MB, 1280x1280)

2.31 MB PNG

OK yeah I think 1408x1408 is just bad. This is 1280x1280, literally my first gen with the same prompt and sampling steps, although it didn't quite get the holding hands part of the prompt. Not sure what the biggest non-degrading resolution is, given I've never seen any documentation about exactly what image dimensions they trained this at. If we trust it was 2MP then 1408x1408 should've produced just as good results, but it didn't.

Anonymous
08/03/24(Sat)19:03:02 No.101710733

Anonymous 08/03/24(Sat)19:03:02 No.101710733

>>101710610
So you started using llms 3 months ago and wrote a guide about it. Moron. Let me guess, you are 20 years old and use an anime profile picture on discord.

Anonymous
08/03/24(Sat)19:07:02 No.101710794

Anonymous 08/03/24(Sat)19:07:02 No.101710794

>>101710733
No, I've been using them extensively since llama1 days. I even tried pyg before llama.

Anonymous
08/03/24(Sat)19:07:52 No.101710804

Anonymous 08/03/24(Sat)19:07:52 No.101710804

File: 1722726406406.png (63 KB, 775x849)

63 KB PNG

>>101710688
trvth...fvcking...nvke...

Anonymous
08/03/24(Sat)19:08:24 No.101710811

Anonymous 08/03/24(Sat)19:08:24 No.101710811

can you retards take your egos somewhere else

Anonymous
08/03/24(Sat)19:09:41 No.101710831

Anonymous 08/03/24(Sat)19:09:41 No.101710831

>>101710794
>I even tried pyg before llama
That is a nice weasel credential. I got here during mythomax era and even I tried erebus. Everyone tried it back then and dropped it instantly.

Anonymous
08/03/24(Sat)19:10:18 No.101710839

Anonymous 08/03/24(Sat)19:10:18 No.101710839

>>101710811
No, they are here and they are queer!

Anonymous
08/03/24(Sat)19:23:28 No.101710994

Anonymous 08/03/24(Sat)19:23:28 No.101710994

>>101710811
This is the most cancerous, discord-driven ai general in this entire board. It's worse than sdg even.

Anonymous
08/03/24(Sat)19:29:35 No.101711057

Anonymous 08/03/24(Sat)19:29:35 No.101711057

>>101710994
The reality is as obnoxious as people like Sao et al are people end up downloading their models. So it does work and that's why they do it. Stop downloading their shit. Not even out of morbid curiosity, not to make a scathing critique about it, etc. Just stop. And they'll go away.

Anonymous
08/03/24(Sat)19:31:19 No.101711089

Anonymous 08/03/24(Sat)19:31:19 No.101711089

>>101711057
>as obnoxious as people like Sao
Don't forget to take your HRT today anon. We wouldn't want you to stop transforming into a beautiful little princess you want to be.

Anonymous
08/03/24(Sat)19:32:33 No.101711103

Anonymous 08/03/24(Sat)19:32:33 No.101711103

has anyone here gotten codegemma working for FIM?

Anonymous
08/03/24(Sat)19:33:32 No.101711118

Anonymous 08/03/24(Sat)19:33:32 No.101711118

File: file.png (76 KB, 471x520)

76 KB PNG

>>101711089
>Don't forget to take your HRT today anon
you mean sao needs hrt right?

Anonymous
08/03/24(Sat)19:34:19 No.101711128

Anonymous 08/03/24(Sat)19:34:19 No.101711128

File: 1710043687041916.jpg (43 KB, 720x960)

43 KB JPG

>>101711057
They still believe in mergeslop when there is absolutely no difference with the initial model. I guess it's a good way to farm Kofi money with these placebos given how clueless is the average coomer here and HF

Anonymous
08/03/24(Sat)19:34:53 No.101711136

Anonymous 08/03/24(Sat)19:34:53 No.101711136

Can I get a new nemesis from here? My current one is very boring.

Anonymous
08/03/24(Sat)19:35:53 No.101711146

Anonymous 08/03/24(Sat)19:35:53 No.101711146

>>101711057
Sao is not the only shiller here. Seriously, go into your favorite epic llm discord channel and put 4chan in the search bar. And if you do that, please put a bullet in your skull because it means you are a discord user. You will never be a woman.

Anonymous
08/03/24(Sat)19:37:52 No.101711169

Anonymous 08/03/24(Sat)19:37:52 No.101711169

>>101711128
>They still believe in mergeslop when there is absolutely no difference with the initial model.
>>101706312
>Celeste utterly MOGGED
>>101706374
>Starcannon is a Celeste merge...
>>101706414
>And people doubted me when I said merging makes models smarter.
>>101706494
>Merging tunes is superior to just tuning.

Anonymous
08/03/24(Sat)19:39:52 No.101711191

Anonymous 08/03/24(Sat)19:39:52 No.101711191

>>101711128
Merging works fine as long as you use an interpolative merge method and as long as you're merging models with more than 1% of the weights changed via "finetuning". I.e. merging r=64 LoRAs doesn't do shit. But merging full finetunes with each other is fine. Or merging LoRAs with finetunes.

Anonymous
08/03/24(Sat)19:43:13 No.101711219

Anonymous 08/03/24(Sat)19:43:13 No.101711219

>>101706517
SuperHot was 8k and it was quite revolutionary at the time, plus it was quite smart: https://kaiokendev.github.io/til#extending-context-to-8k

Anonymous
08/03/24(Sat)19:44:22 No.101711233

Anonymous 08/03/24(Sat)19:44:22 No.101711233

someone mirror the fp16 of shieldgemma already

Anonymous
08/03/24(Sat)19:45:36 No.101711250

Anonymous 08/03/24(Sat)19:45:36 No.101711250

>>101711191
>models with more than 1% of the weights changed via "finetuning"
tess 3 bros?
>Tess-3 has 500K samples of 16K context length
>It is trained with QLoRA
>500K x 16K = ~8,000,000,000, 8B Tokens
>405B Trained on 15T tokens
>0.053%

Anonymous
08/03/24(Sat)19:47:29 No.101711278

Anonymous 08/03/24(Sat)19:47:29 No.101711278

Being a woman is not about wearing long socks, having high estrogen and wearing a dress. That's just the fetish of a broken man (you), literally possessed by baphomet and living in defiance of God. You are not, and will never be a woman. Your sick perversions on "SillyTavern" are a disgrace. God is looking at you in disappointment and concern. You closed your heart, but Jesus is open to forgiving you if you just open your heart to Him.

Anonymous
08/03/24(Sat)19:47:31 No.101711279

Anonymous 08/03/24(Sat)19:47:31 No.101711279

>>101711219
Can't disagree with that. Should I move it to top models?

Anonymous
08/03/24(Sat)19:47:55 No.101711287

Anonymous 08/03/24(Sat)19:47:55 No.101711287

>>101706490 >>101706026 >>101705968
I like how the one frame with mutated hands is actually a really effective smear frame. If I didn't look at it frame by frame I'd never have guessed.

Anonymous
08/03/24(Sat)19:49:46 No.101711302

Anonymous 08/03/24(Sat)19:49:46 No.101711302

>>101711233
Why? Is there something special about it?

Anonymous
08/03/24(Sat)19:54:06 No.101711356

Anonymous 08/03/24(Sat)19:54:06 No.101711356

>>101711302
Yes. In order to train the classification behavior they needed to finetune it on examples of naughty messages, and those naughty messages have generalized outside of the intended use-case. It's a very naughty model.

Anonymous
08/03/24(Sat)19:58:22 No.101711415

Anonymous 08/03/24(Sat)19:58:22 No.101711415

>>101711356
Interesting. And that applies for all three versions of it, from 2B to 27B?

Anonymous
08/03/24(Sat)19:58:33 No.101711420

Anonymous 08/03/24(Sat)19:58:33 No.101711420

>>101706517
>No mention of NTK
>No mention of SuperCOT
go back newfag

Anonymous
08/03/24(Sat)20:00:07 No.101711446

Anonymous 08/03/24(Sat)20:00:07 No.101711446

>>101711356
https://huggingface.co/meta-llama/Llama-Guard-3-8B
>exists
>>101711415
No, he's trolling you obviously.

Anonymous
08/03/24(Sat)20:00:13 No.101711450

Anonymous 08/03/24(Sat)20:00:13 No.101711450

>>101711233
learn how to get around the verfication already. it's basically a retard filter

Anonymous
08/03/24(Sat)20:02:18 No.101711472

Anonymous 08/03/24(Sat)20:02:18 No.101711472

>>101711446
Oh they finally uploaded llama-guard I'll have to try that out.
>>101711415
didn't try 2B
and 27B has all the same problems regular 27B has.
But 9B is pretty dirty. Well it's slopped as fuck for sex but for violent RP it's next level.

Anonymous
08/03/24(Sat)20:02:38 No.101711477

Anonymous 08/03/24(Sat)20:02:38 No.101711477

I wanna generate some data with 405b. what's the best API provider? 16bf please

Anonymous
08/03/24(Sat)20:06:21 No.101711514

Anonymous 08/03/24(Sat)20:06:21 No.101711514

>>101711472
>Oh they finally uploaded llama-guard I'll have to try that out.
>finally
...
https://huggingface.co/meta-llama/LlamaGuard-7b
>Updated Apr 17
https://huggingface.co/meta-llama/Meta-Llama-Guard-2-8B
>Updated May 13
https://huggingface.co/meta-llama/Llama-Guard-3-8B
>Posted at the same time as other 3.1s

Anonymous
08/03/24(Sat)20:08:50 No.101711542

Anonymous 08/03/24(Sat)20:08:50 No.101711542

>>101711514
if the repo was previously private the commit dates aren't indicative of the date it was unprivated, you non-contributing freeloader. (Otherwise you would know this).

Anonymous
08/03/24(Sat)20:09:43 No.101711560

Anonymous 08/03/24(Sat)20:09:43 No.101711560

>>101711420
>>No mention of NTK
I called it ROPE.

>>No mention of SuperCOT
Should I add it under notable and move SuperHOT to top models? I just preferred to use base llama65b during those days, never bothered going lower.

>go back newfag
1. I'm not a newfag.
2. Stop screeching like a tranny.

Anonymous
08/03/24(Sat)20:10:57 No.101711580

Anonymous 08/03/24(Sat)20:10:57 No.101711580

>>101706517
I think merges were used A LOT already in your "early days" section.
Go to TheBloke first models and you'll stumble upon names like

WizardLM-Uncensored-SuperCOT-StoryTelling-30B-SuperHOT-8K-GPTQ

or

chronos-wizardlm-uc-scot-st-13b which is "(chronos-13b+(WizardLM Uncensored+CoT+Storytelling)) 80/20 merge".
"Merge era" was more like a long "it's over" period of time where we had nothing new thus using merges out of desperation. So I would call it "lull era" or "Waiting period", I don't know.

Anonymous
08/03/24(Sat)20:11:24 No.101711586

Anonymous 08/03/24(Sat)20:11:24 No.101711586

>>101711542
I downloaded it the day 3.1 regular released not my fault you can't find shit if it's not posted on leddit

Anonymous
08/03/24(Sat)20:11:53 No.101711592

Anonymous 08/03/24(Sat)20:11:53 No.101711592

>>101711472
>and 27B has all the same problems regular 27B has
I haven't been keeping up on discussion and I'm not familiar with Gemma. Are you saying 27B (normal Gemma) has problems that 9B doesn't? What are they?

Anonymous
08/03/24(Sat)20:12:37 No.101711601

Anonymous 08/03/24(Sat)20:12:37 No.101711601

>>101711592
>Are you saying 27B (normal Gemma) has problems that 9B doesn't? What are they?
its somehow worse

Anonymous
08/03/24(Sat)20:14:23 No.101711621

Anonymous 08/03/24(Sat)20:14:23 No.101711621

>>101711601
Worse in what way exactly? Benchmarks at least show 27B has more knowledge.

Anonymous
08/03/24(Sat)20:15:25 No.101711634

Anonymous 08/03/24(Sat)20:15:25 No.101711634

>>101711597
here you racist a post showing guard 3 was available on release of the other 3.1s
https://www.reddit.com/r/LocalLLaMA/comments/1ea9eeo/meta_officially_releases_llama3405b_llama3170b/

Anonymous
08/03/24(Sat)20:15:49 No.101711638

Anonymous 08/03/24(Sat)20:15:49 No.101711638

>>101711278
model name?

Anonymous
08/03/24(Sat)20:16:29 No.101711642

Anonymous 08/03/24(Sat)20:16:29 No.101711642

>>101711601
B-but the benchmarks anon!!!! Do you imply they are LYING? >>101705986
(I think it's bullshit, I prefer mini-magnum to Gemma 27B)

Anonymous
08/03/24(Sat)20:16:42 No.101711645

Anonymous 08/03/24(Sat)20:16:42 No.101711645

>>101706517
>6300x1300
kill yourself

Anonymous
08/03/24(Sat)20:16:56 No.101711650

Anonymous 08/03/24(Sat)20:16:56 No.101711650

>>101711638
mamba-4chan

Anonymous
08/03/24(Sat)20:18:14 No.101711671

Anonymous 08/03/24(Sat)20:18:14 No.101711671

>>101706517
>>101711645
Yeah as this anon is cleverly implying maybe make it stretch vertically.

Anonymous
08/03/24(Sat)20:18:49 No.101711679

Anonymous 08/03/24(Sat)20:18:49 No.101711679

>>101711621
>Unable to reproduce high quality arena-hard-auto results on GCP A100
https://huggingface.co/google/gemma-2-27b-it/discussions/31
>Hallucinations, misspellings etc. Something seems broken?
>I've tried gemma-2-9b-it and it's fine.
https://huggingface.co/google/gemma-2-27b-it/discussions/10
>How can I get results similar to those from Google AI Studio locally?
>However, even with the chat template, the responses are not as good as those from Google AI Studio.
https://huggingface.co/google/gemma-2-27b-it/discussions/14

Anonymous
08/03/24(Sat)20:20:12 No.101711699

Anonymous 08/03/24(Sat)20:20:12 No.101711699

>>101709915
Not him, but many coomers, myself included, are guys that tried nearly every normie advice in the past and could not get pussy despite the best efforts, and simply gave up trying to play a rigged game.

Porn is a low hanging fruit, we have to satisfy our sexual urges somehow.

Anonymous
08/03/24(Sat)20:23:07 No.101711724

Anonymous 08/03/24(Sat)20:23:07 No.101711724

>>101711679
Huh. Issue with inference engines? Has no one found any backend that reproduces the outputs from the online source?

Anonymous
08/03/24(Sat)20:25:56 No.101711751

Anonymous 08/03/24(Sat)20:25:56 No.101711751

>>101711542
NTA, Robert posted 11 days ago that his request to access Guard had not been approved yet, so it obviously had released by then...
>request still in "pending"
>by ZeroWw - opened 11 days ago
>https://huggingface.co/meta-llama/Llama-Guard-3-8B/discussions/10

Anonymous
08/03/24(Sat)20:26:20 No.101711755

Anonymous 08/03/24(Sat)20:26:20 No.101711755

>>101711699
>we have to satisfy our sexual urges somehow.
1. that faggot gets off on people trying to explain themselves to him
2. that faggot jerks it off to porn like everyone else but he is brainwashed to feel bad about it and he tries to push his brainwashing onto others

Anonymous
08/03/24(Sat)20:27:25 No.101711766

Anonymous 08/03/24(Sat)20:27:25 No.101711766

>>101711724
Probably only works fine on Google's own engine
>Note ^ Models in the original format, for use with gemma_pytorch
https://huggingface.co/collections/google/gemma-2-release-667d6600fd5220e7b967f315
https://github.com/google/gemma_pytorch

/lmg/ - Local Mikus General
08/03/24(Sat)20:29:56 No.101711805

/lmg/ - Local Mikus General 08/03/24(Sat)20:29:56 No.101711805

>>101711798
>>101711798
>>101711798

Anonymous
08/03/24(Sat)20:30:38 No.101711814

Anonymous 08/03/24(Sat)20:30:38 No.101711814

>>101711580
Okay, corrected it.

Anonymous
08/03/24(Sat)20:30:41 No.101711816

Anonymous 08/03/24(Sat)20:30:41 No.101711816

>>101711805
gg

Anonymous
08/03/24(Sat)20:33:39 No.101711847

Anonymous 08/03/24(Sat)20:33:39 No.101711847

>>101710409
oh man... he's just like me

Anonymous
08/03/24(Sat)20:45:23 No.101711996

Anonymous 08/03/24(Sat)20:45:23 No.101711996

>>101711766
Shame. I'd test it, if I had the VRAM for the unquanted weights.

Anonymous
08/03/24(Sat)20:47:14 No.101712019

Anonymous 08/03/24(Sat)20:47:14 No.101712019

>>101711560
>I called it ROPE.
You can't just call NTK the same thing as Rope scaling, they are different things.

>Should I add it under notable and move SuperHOT to top models? I just preferred to use base llama65b during those days, never bothered going lower.
I think SuperCOT had more popularity than SuperHOT, SuperHOT was only used to merge to other models to get them to 8k context. But you do you.

>1. I'm not a newfag.
>2. Stop screeching like a tranny.
Oh, so it's you Petra. I guess that's a good thing to put your time on, instead of shitting the thread with BBC.

Anonymous
08/03/24(Sat)20:51:39 No.101712073

Anonymous 08/03/24(Sat)20:51:39 No.101712073

>>101712019
>petra
>acknowledging koboldtroons in xer little retrospective of lmg
It's not petra.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.