/g/ - >"Dude, just use your own local AI! It's better an - Technology

Anonymous

05/23/26(Sat)00:32:15 No.108885822

Anonymous 05/23/26(Sat)00:32:15 No.108885822 Archived

>"Dude, just use your own local AI! It's better and not censored!"

>actually listen to /g/ and install local AI (gemma4, 26b)
>try it out
>fans spin up like I'm playing Crysis in 2007
>several minutes pass
>its response is barely better than grok, nevermind Claude/Gemini/ChatGPT

lmao, I was bamboozled again

Anonymous
05/23/26(Sat)01:14:45 No.108885992

Anonymous 05/23/26(Sat)01:14:45 No.108885992

>>several minutes pass
stopped reading there

Anonymous
05/23/26(Sat)01:25:41 No.108886032

Anonymous 05/23/26(Sat)01:25:41 No.108886032

>>108885822
>26b
lol

Anonymous
05/23/26(Sat)01:48:42 No.108886130

Anonymous 05/23/26(Sat)01:48:42 No.108886130

Specs? fastfetch? speccy (bleh)?

Anonymous
05/23/26(Sat)01:50:03 No.108886136

Anonymous 05/23/26(Sat)01:50:03 No.108886136

local AI is only free if your daddy pays for your expensive gpu and electricity

Anonymous
05/23/26(Sat)01:55:18 No.108886155

Anonymous 05/23/26(Sat)01:55:18 No.108886155

>>108885822
Seriously is there an open source model I with enough billions parameter that’s decent for either 24-32gb gpus?

Anonymous
05/23/26(Sat)02:43:13 No.108886382

Anonymous 05/23/26(Sat)02:43:13 No.108886382

>>108885822
>several minutes
>on a 26b3 moe
maybe you should get a job so you can buy a computer from this decade instead of playing with ai

Helios/TON 618 !!XUajJUbEW2t
05/23/26(Sat)03:23:43 No.108886520

Helios/TON 618 !!XUajJUbEW2t 05/23/26(Sat)03:23:43 No.108886520

>>108885822
"I don't know how to use computers"

Anonymous
05/23/26(Sat)08:48:35 No.108887897

Anonymous 05/23/26(Sat)08:48:35 No.108887897

>>108885992
>>108886382
>"Run AI locally bro, just buy a $5000 card and do it yourself, bro."

Anonymous
05/23/26(Sat)09:28:29 No.108888107

Anonymous 05/23/26(Sat)09:28:29 No.108888107

>>108885822
skill issue

Anonymous
05/23/26(Sat)09:30:50 No.108888125

Anonymous 05/23/26(Sat)09:30:50 No.108888125

File: 1564953104693.jpg (209 KB, 1607x617)

209 KB JPG

does the model fit in vram? is there space left over in vram for context? if not youre doing it wrong

Anonymous
05/23/26(Sat)09:35:47 No.108888148

Anonymous 05/23/26(Sat)09:35:47 No.108888148

>>108887897
Btw I did 0 research and now I'm whining on /g/

Anonymous
05/23/26(Sat)09:41:23 No.108888183

Anonymous 05/23/26(Sat)09:41:23 No.108888183

File: 1748383026443316.jpg (16 KB, 260x282)

16 KB JPG

I have a RX 7900 XTX/64GB DDR5 RAM and the results aren't great on local AI either. Maybe only worth doing local AI with Nvidia GPU specific? Tried with gemma4 and phi4, could be bad models too I dunno fuck about the intricacies of AI, I just wanted to see if it was really better than online ones and it wasn't. Only plus was being able to run uncensored models which is okay I guess? Most prompts don't need to be censored anyway, so who cares.

Anonymous
05/23/26(Sat)09:54:39 No.108888249

Anonymous 05/23/26(Sat)09:54:39 No.108888249

ive found the uncensored prompts to just be for loli and saying nigger, but will refuse racism against jews, it fills the context with <IM END>

Anonymous
05/23/26(Sat)09:59:44 No.108888290

Anonymous 05/23/26(Sat)09:59:44 No.108888290

>>108888148
>listening to /g/'s recommendations doesn't count as research

Anonymous
05/23/26(Sat)10:03:29 No.108888316

Anonymous 05/23/26(Sat)10:03:29 No.108888316

File: GGzQBrnaAAEI5TK.jpg (34 KB, 600x360)

34 KB JPG

>>108885822
>fans spin up like I'm playing Crysis in 2007

Anonymous
05/23/26(Sat)10:51:36 No.108888581

Anonymous 05/23/26(Sat)10:51:36 No.108888581

Bigger the model, the better it is.
Depends on use case, but for me m2.7 at q4 is the minimum I'd ever use locally and that fits into about 180 gb of vram

Anonymous
05/23/26(Sat)12:03:25 No.108889064

Anonymous 05/23/26(Sat)12:03:25 No.108889064

have you tried not having a dog shit GPU?

Anonymous
05/23/26(Sat)12:10:03 No.108889125

Anonymous 05/23/26(Sat)12:10:03 No.108889125

>>108885822
>26b Moe
Lmao
What you even tried to do OP? 24b is fine for RP session or gooning but if you tried to vibrcode your dream game forget it, you need to fit in cards +120b model and still have space for context size

Anonymous
05/23/26(Sat)12:56:43 No.108889421

Anonymous 05/23/26(Sat)12:56:43 No.108889421

>>108885822
You found us out! Clever OP we were trolling you

Anonymous
05/23/26(Sat)18:02:27 No.108891310

Anonymous 05/23/26(Sat)18:02:27 No.108891310

>>108885822
>its response is barely better than grok, nevermind Claude/Gemini/ChatGPT

Getting around 60-70 tps with MoE models like Gemma 4 or Qwen 3.6, using q4 or iq4_xs quants.

Yes, responses are as good as or better than the 'frontier' models in a lot of categories. For high-stakes work, like legal drafting, I'd stick with frontier models. For coding, Gemma 4 or Qwen 3.6 are both very competent even at q4 quants tha will fit in 24 GB vram with full context (q8 quantized kv cache).

The dense models are slower but better quality overall, especially as context length grows.

Anonymous
05/23/26(Sat)18:03:35 No.108891316

Anonymous 05/23/26(Sat)18:03:35 No.108891316

>have shit hardware
>use shit model
>get shit results
wow
>>108887897
A 3060 barely costs 300 bucks, bitch.

Anonymous
05/23/26(Sat)18:05:18 No.108891320

Anonymous 05/23/26(Sat)18:05:18 No.108891320

>>108888183
What model did you use? With that amount of VRAM you should try Qwen 3.6 27B quanted at Q5, or Gemma 4 31B quanted at Q4. Anyways, it won't be better than cloud models, but it'll be yours to do as you please.

Anonymous
05/23/26(Sat)18:07:27 No.108891340

Anonymous 05/23/26(Sat)18:07:27 No.108891340

>>108887897
>"Run AI locally bro, just buy a $5000 card and do it yourself, bro."

You can build a very good system with two 16 GB VRAM cards like 5060. The prices on 3090s is dropping, and one 3090 is 24 GB. They also support nvlink which lets you share VRAM between cards and can further boost performance. Even if you pay the overpriced $1500+ prices for 2 3090s you're still getting a far better deal than a single 48 GB card that will likely cost $5K+

Anonymous
05/23/26(Sat)18:17:36 No.108891400

Anonymous 05/23/26(Sat)18:17:36 No.108891400

local AI is not there yet
trannies are lying to you again