/g/ - OK YOU GOT ME - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
OK YOU GOT ME 06/28/26(Sun)21:04:56 No.109158099

File: Screenshot 2026-06-25 at (...).png (915 KB, 1110x934)

OK YOU GOT ME Anonymous 06/28/26(Sun)21:04:56 No.109158099

I need more VRAM, Like, a lot. Don't fucking ask why but I need local enterprise AI. It doesn't even need to be on the cheap. My department has just been authorized one hell of a budget.
But again it needs to be local and can not talk to the internet. What do>>>>

Anonymous
06/28/26(Sun)21:07:02 No.109158105

Anonymous 06/28/26(Sun)21:07:02 No.109158105

>>109158099
if it's just LLM buy bunch of mac mini or mac studio with huge ram and run local on it. If it's something like image gen, then well, mac won't do

Anonymous
06/28/26(Sun)21:10:23 No.109158125

Anonymous 06/28/26(Sun)21:10:23 No.109158125

>>109158099
ask ai

Anonymous
06/28/26(Sun)21:12:11 No.109158132

Anonymous 06/28/26(Sun)21:12:11 No.109158132

>>109158125
y'all are better than ai until I get this nigger running

Anonymous
06/28/26(Sun)21:14:43 No.109158141

Anonymous 06/28/26(Sun)21:14:43 No.109158141

>>109158099
Okay. If money is no object, then buy it.

Anonymous
06/28/26(Sun)21:19:02 No.109158154

Anonymous 06/28/26(Sun)21:19:02 No.109158154

File: shiggydiggy.jpg (223 KB, 1729x1426)

223 KB JPG

>>109158141
I have 1,6M

Anonymous
06/28/26(Sun)21:21:13 No.109158164

Anonymous 06/28/26(Sun)21:21:13 No.109158164

>>109158099
sudo systemd-run --quiet --scope -p IPAddressDeny=any -p IPAddressAllow=127.0.0.1 sudo -u ai ./llama-server

Anonymous
06/28/26(Sun)21:24:33 No.109158181

Anonymous 06/28/26(Sun)21:24:33 No.109158181

>>109158099
To run any of the big boy models you'll need at least an H200 HGX node, which is extremely expensive on its own, but in order to rack it, cable it and power it you'll need a ton of other equipment.
If your company already has a server room with rack space and someone who knows how to hook it up and configure everything then just convince them to buy an HGX node.

Anonymous
06/28/26(Sun)21:24:55 No.109158187

Anonymous 06/28/26(Sun)21:24:55 No.109158187

>>109158099
Sounds like you got everything you need. Buy Nvidia TI series cards, the datacenter ones. The DCGLs that work in series.
https://resources.nvidia.com/l/en-us-gpu#referrer=vanity

Get in touch with them, leave overhead in your project for contingency, and enjoy having your own personal sexbot. Get the minimum requirements for DeepL or Minimax

Minimax is the best open weight model but not the best model. It's the one you can run no questions asked.
https://www.aimadetools.com/blog/how-to-run-minimax-m3-locally/
Look at the requirements
Setup    VRAM needed    Hardware    Cost
Full model (estimated)    400-800GB    4-8× A100 80GB or 4-8× H100    $30K-80K
Multi-node    Distributed    2+ servers with NVLink    Enterprise
If you have 30K available in your project, get the needed GPUs to fill the minimum or recommended VRAM capacity using A100 series cards or whatever the Nvidia salesman can hook you up with.

Good job anon, you hit the fucking motherload.

Anonymous
06/28/26(Sun)21:26:37 No.109158193

Anonymous 06/28/26(Sun)21:26:37 No.109158193

>>109158181
Oh yeah this is the other thing: Make sure you have the power capacity for this, because 4-8 H100s or H200s WILL put strain on any current closet you have if it's not IT rated. H200s draw 1000W each, H100s draw 700W each. Contact an electrician and ask them to upgrade the service to the room this is going in to account for 4-8 of these.

Anonymous
06/28/26(Sun)21:29:50 No.109158210

Anonymous 06/28/26(Sun)21:29:50 No.109158210

>>109158193
Yup this is true. My company needed to have the electrical reconfigured and rewired to accommodate adding some AI compute capacity.
Those fuckers use a LOT of power.

Anonymous
06/28/26(Sun)21:31:28 No.109158221

Anonymous 06/28/26(Sun)21:31:28 No.109158221

File: gelbooru.com 14306519 1gi(...).jpg (3.01 MB, 2784x4416)

3.01 MB JPG

>>109158099
Pomni owes me sex

Anonymous
06/28/26(Sun)21:31:51 No.109158223

Anonymous 06/28/26(Sun)21:31:51 No.109158223

>>109158210
Oh and Lots of power means lots of heat. The room needs to be really well ventilated and the HVAC needs to be extremely robust.

Anonymous
06/28/26(Sun)21:36:36 No.109158242

Anonymous 06/28/26(Sun)21:36:36 No.109158242

>>109158193
Thank you good sir I am on it. Our firm can more than handle it, just being a dinosaur corpo technically owned by black rock things... go slow. But we can and will do it (eventually)
Two 8X H200's servers will be on order before the fourth. Im not an IT guy but it's not like those counts get paid to sit around.
Again thank you for checking my "market research" box in my fucking docuserv account. We are a go.

Anonymous
06/28/26(Sun)21:36:44 No.109158243

Anonymous 06/28/26(Sun)21:36:44 No.109158243

>>109158223
Right, that too. This equipment has listed in it's specs the "heat load" of the hardware. It's usually on the power supply or listed with the power supply specs. Your HVAC load needs to be upgraded to fit the specifications of the Heat Load or it WILL turn into an oven in the room.

Thankfully, we fixed it at my place of work with Minisplit units, fairly cheap, can operate in parallel.

Oh I almost forgot one other thing: Also account for the fact you will need an additional server for database operations and queries to your new cluster. Don't forget to account for that.

Anonymous
06/28/26(Sun)21:37:07 No.109158247

Anonymous 06/28/26(Sun)21:37:07 No.109158247

>>109158221
based and Jax pilled

Anonymous
06/28/26(Sun)21:37:46 No.109158252

Anonymous 06/28/26(Sun)21:37:46 No.109158252

File: png-transparent-emoji-smi(...).png (80 KB, 920x804)

80 KB PNG

>>109158242
based, good luck, also remember our consulting fees are cheap

Anonymous
06/28/26(Sun)21:44:35 No.109158295

Anonymous 06/28/26(Sun)21:44:35 No.109158295

>>109158252
I mean, the other departments are lucky and get to use public frontier models. We are not for.... reasons.

Anonymous
06/28/26(Sun)22:01:21 No.109158400

Anonymous 06/28/26(Sun)22:01:21 No.109158400

Sam, post from your normal 4chan Premium+ account.

Anonymous
06/28/26(Sun)22:25:43 No.109158497

Anonymous 06/28/26(Sun)22:25:43 No.109158497

>>109158154
You're not gonna buy shit with that. 1.6m is like the fee Nvidia demands just for a quote.

Anonymous
06/28/26(Sun)22:56:42 No.109158609

Anonymous 06/28/26(Sun)22:56:42 No.109158609

>>109158099
No amount of VRAM is going to unabstract him
I'm sorry :(

Anonymous
06/29/26(Mon)00:08:33 No.109158906

Anonymous 06/29/26(Mon)00:08:33 No.109158906

>>109158609
>him

Anonymous
06/29/26(Mon)01:10:47 No.109159115

Anonymous 06/29/26(Mon)01:10:47 No.109159115

>>109158099
If its for LMs, then Mac Studios linked together

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.

Janitor acceptance emails will be sent out over the coming weeks. Make sure to check your spam folder!