[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor acceptance emails will be sent out over the coming weeks. Make sure to check your spam folder!


[Advertise on 4chan]


I need more VRAM, Like, a lot. Don't fucking ask why but I need local enterprise AI. It doesn't even need to be on the cheap. My department has just been authorized one hell of a budget.
But again it needs to be local and can not talk to the internet. What do>>>>
>>
>>109158099
if it's just LLM buy bunch of mac mini or mac studio with huge ram and run local on it. If it's something like image gen, then well, mac won't do
>>
>>109158099
ask ai
>>
>>109158125
y'all are better than ai until I get this nigger running
>>
>>109158099
Okay. If money is no object, then buy it.
>>
File: shiggydiggy.jpg (223 KB, 1729x1426)
223 KB JPG
>>109158141
I have 1,6M
>>
>>109158099
sudo systemd-run --quiet --scope -p IPAddressDeny=any -p IPAddressAllow=127.0.0.1 sudo -u ai ./llama-server
>>
>>109158099
To run any of the big boy models you'll need at least an H200 HGX node, which is extremely expensive on its own, but in order to rack it, cable it and power it you'll need a ton of other equipment.
If your company already has a server room with rack space and someone who knows how to hook it up and configure everything then just convince them to buy an HGX node.
>>
>>109158099
Sounds like you got everything you need. Buy Nvidia TI series cards, the datacenter ones. The DCGLs that work in series.
https://resources.nvidia.com/l/en-us-gpu#referrer=vanity

Get in touch with them, leave overhead in your project for contingency, and enjoy having your own personal sexbot. Get the minimum requirements for DeepL or Minimax

Minimax is the best open weight model but not the best model. It's the one you can run no questions asked.
https://www.aimadetools.com/blog/how-to-run-minimax-m3-locally/
Look at the requirements

Setup    VRAM needed    Hardware    Cost
Full model (estimated) 400-800GB 4-8× A100 80GB or 4-8× H100 $30K-80K
Multi-node Distributed 2+ servers with NVLink Enterprise


If you have 30K available in your project, get the needed GPUs to fill the minimum or recommended VRAM capacity using A100 series cards or whatever the Nvidia salesman can hook you up with.

Good job anon, you hit the fucking motherload.
>>
>>109158181
Oh yeah this is the other thing: Make sure you have the power capacity for this, because 4-8 H100s or H200s WILL put strain on any current closet you have if it's not IT rated. H200s draw 1000W each, H100s draw 700W each. Contact an electrician and ask them to upgrade the service to the room this is going in to account for 4-8 of these.
>>
>>109158193
Yup this is true. My company needed to have the electrical reconfigured and rewired to accommodate adding some AI compute capacity.
Those fuckers use a LOT of power.
>>
>>109158099
Pomni owes me sex
>>
>>109158210
Oh and Lots of power means lots of heat. The room needs to be really well ventilated and the HVAC needs to be extremely robust.
>>
>>109158193
Thank you good sir I am on it. Our firm can more than handle it, just being a dinosaur corpo technically owned by black rock things... go slow. But we can and will do it (eventually)
Two 8X H200's servers will be on order before the fourth. Im not an IT guy but it's not like those counts get paid to sit around.
Again thank you for checking my "market research" box in my fucking docuserv account. We are a go.
>>
>>109158223
Right, that too. This equipment has listed in it's specs the "heat load" of the hardware. It's usually on the power supply or listed with the power supply specs. Your HVAC load needs to be upgraded to fit the specifications of the Heat Load or it WILL turn into an oven in the room.

Thankfully, we fixed it at my place of work with Minisplit units, fairly cheap, can operate in parallel.

Oh I almost forgot one other thing: Also account for the fact you will need an additional server for database operations and queries to your new cluster. Don't forget to account for that.
>>
>>109158221
based and Jax pilled
>>
>>109158242
based, good luck, also remember our consulting fees are cheap
>>
>>109158252
I mean, the other departments are lucky and get to use public frontier models. We are not for.... reasons.
>>
Sam, post from your normal 4chan Premium+ account.
>>
>>109158154
You're not gonna buy shit with that. 1.6m is like the fee Nvidia demands just for a quote.
>>
>>109158099
No amount of VRAM is going to unabstract him
I'm sorry :(
>>
>>109158609
>him



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.