/g/ - memory prices coming back down to normal sooner th - Technology

Anonymous

03/26/26(Thu)14:07:00 No.108460578

File: tq.png (190 KB, 621x649)

190 KB PNG

Anonymous 03/26/26(Thu)14:07:00 No.108460578 Archived

memory prices coming back down to normal sooner than you think

Anonymous
03/26/26(Thu)14:09:44 No.108460591

Anonymous 03/26/26(Thu)14:09:44 No.108460591

>>108460578
That's just the cache though. Newer models have already been able to do 1 GB per 10k tokens for cache. If it comes down further, great, but it's not going to change the biggest LLMs requiring 1.5 TB of VRAM to run.

Anonymous
03/26/26(Thu)14:16:03 No.108460618

Anonymous 03/26/26(Thu)14:16:03 No.108460618

>>108460591
anon, can you tell the class where cache is being stored and how that might affect memory prices?

Anonymous
03/26/26(Thu)14:20:00 No.108460635

Anonymous 03/26/26(Thu)14:20:00 No.108460635

>>108460618
It's a series of tubes

Anonymous
03/26/26(Thu)14:25:21 No.108460661

Anonymous 03/26/26(Thu)14:25:21 No.108460661

>>108460578
>memory prices coming back down to normal sooner than you think
https://en.wikipedia.org/w/index.php?title=Jevons_paradox

Anonymous
03/26/26(Thu)14:27:23 No.108460669

Anonymous 03/26/26(Thu)14:27:23 No.108460669

>>108460618
On whatever medium happens to be faster than whatever you're originally loading it from. If you're loading it from memory the cache is on the CPU. If you're loading it from an NVMe the page cache is in memory. If You're loading it from network you could cache it on mass storage.

Anonymous
03/26/26(Thu)14:35:53 No.108460699

Anonymous 03/26/26(Thu)14:35:53 No.108460699

>>108460661
It's going to get much worse in the future whenever LLMs get replaced by something better.

Anonymous
03/26/26(Thu)14:41:49 No.108460727

Anonymous 03/26/26(Thu)14:41:49 No.108460727

>>108460578
As if that's going to fix anything. Even if memory use was made 100x more efficient, they'd just understand that they can keep their orders in place and use 100x as much bullshit.

Anonymous
03/26/26(Thu)14:41:49 No.108460728

Anonymous 03/26/26(Thu)14:41:49 No.108460728

>>108460578
>sooner than you think
less than two weeks?

Anonymous
03/26/26(Thu)15:21:34 No.108460937

Anonymous 03/26/26(Thu)15:21:34 No.108460937

>>108460578
quantizing kv cache to 8 bit completely mindrapes the model and makes it hallucinate and go off the rails even worse than they do normally, 4 bit amplifies that even more. There is no way you're getting 3 bit KV quant with "no accuracy loss"

Anonymous
03/26/26(Thu)15:25:24 No.108460960

Anonymous 03/26/26(Thu)15:25:24 No.108460960

>>108460591
it's KV cache not CPU/GPU cache, and you know it, you are one of these SK Hynix / Samsung bag holders screeching at your stocks dumping

Anonymous
03/26/26(Thu)15:30:04 No.108460981

Anonymous 03/26/26(Thu)15:30:04 No.108460981

File: bad.png (134 KB, 500x462)

134 KB PNG

>>108460578
what if Jevon's paradox applies to this?

Anonymous
03/26/26(Thu)15:38:36 No.108461028

Anonymous 03/26/26(Thu)15:38:36 No.108461028

>>108460635
Bags of sand

Anonymous
03/26/26(Thu)15:42:41 No.108461049

Anonymous 03/26/26(Thu)15:42:41 No.108461049

>>108461028
Shoop do woop with milk and pennies, my mudkip

Anonymous
03/26/26(Thu)15:49:35 No.108461088

Anonymous 03/26/26(Thu)15:49:35 No.108461088

this paper is from q1 of last year and its effects are already in place

Anonymous
03/26/26(Thu)15:49:59 No.108461093

Anonymous 03/26/26(Thu)15:49:59 No.108461093

>>108460578
Even if this works
>Implying they won't run more and bigger shit on same hardware
>Implying they'll stop buying up supply which sucks out oxygen from competition
>Implying the plan isn't to bully out personal computing as a concept and force everything to be hardware as a service

Anonymous
03/26/26(Thu)15:55:53 No.108461139

Anonymous 03/26/26(Thu)15:55:53 No.108461139

>>108460591
>>108460618
so is this very good news or not???

>>108460981
>Jevon's paradox
you know what, 100% a nothingburger. with these greedy techniggers we can only lose.

Anonymous
03/26/26(Thu)16:00:07 No.108461160

Anonymous 03/26/26(Thu)16:00:07 No.108461160

>>108461139
It's not.

Anonymous
03/26/26(Thu)16:12:27 No.108461228

Anonymous 03/26/26(Thu)16:12:27 No.108461228

>>108460578
does this mean anything for coomer image and video gen

Anonymous
03/26/26(Thu)16:15:14 No.108461246

Anonymous 03/26/26(Thu)16:15:14 No.108461246

File: file.png (861 KB, 1557x1376)

861 KB PNG

i am begging people to stop reading pc gay men consumer blogs for ai news

Anonymous
03/26/26(Thu)16:17:00 No.108461258

Anonymous 03/26/26(Thu)16:17:00 No.108461258

>>108461139
>so is this very good news or not???
It's good news, but not something revolutionary for home use. The biggest benefit is when you run concurrent requests that all need their own context. Even at 1 GB per 10k tokens it adds up pretty quick. 10 GB for 100k tokens, but if you have 20 concurrent requests then that's 200 GB.

Anonymous
03/26/26(Thu)16:21:20 No.108461275

Anonymous 03/26/26(Thu)16:21:20 No.108461275

>>108461258
>Even at 1 GB per 10k tokens it adds up pretty quick. 10 GB for 100k tokens, but if you have 20 concurrent requests then that's 200 GB.
what? does this make any sense? I think he speaks gibberish like a tard and has no idea whatsup.

Anonymous
03/26/26(Thu)16:22:59 No.108461281

Anonymous 03/26/26(Thu)16:22:59 No.108461281

>>108460578
That's just wrong. It means they will be able to do even more with the memory they already have, and can have.

Anonymous
03/26/26(Thu)17:04:46 No.108461492

Anonymous 03/26/26(Thu)17:04:46 No.108461492

>>108460578
Wait... over UNQUANTIZED bits???
What about comparison to the existing quantization?

Anonymous
03/26/26(Thu)17:32:01 No.108461621

Anonymous 03/26/26(Thu)17:32:01 No.108461621

>>108460981
Jevon's paradox applies to everything that consumes resources.

Anonymous
03/26/26(Thu)18:05:35 No.108461805

Anonymous 03/26/26(Thu)18:05:35 No.108461805

>>108460578
>google
>not even 10x
Prices are never coming down.

Anonymous
03/26/26(Thu)18:06:49 No.108461812

Anonymous 03/26/26(Thu)18:06:49 No.108461812

>>108460578
okay can I locally run a 1 trillion qubit ai locally yet?

Anonymous
03/26/26(Thu)18:29:57 No.108461917

Anonymous 03/26/26(Thu)18:29:57 No.108461917

>>108461621
>Jevon's paradox applies to everything that consumes resources
Even me?
Wait, every time when I get better I consume more.
When my gf gets better penis, she wants more.

This checks out. Ai could 10x tomorrow and it would only mean more porn and more slop generation. Humans are a virus.

Anonymous
03/26/26(Thu)18:31:08 No.108461925

Anonymous 03/26/26(Thu)18:31:08 No.108461925

>>108460578
>google's
Yeah no, the prices are gonna go up thanks to them.

Anonymous
03/26/26(Thu)19:38:35 No.108462304

Anonymous 03/26/26(Thu)19:38:35 No.108462304

>>108460661
this
>thanks to leds outdoor lighting is going to consume much less electricity
>proceeds to install 100x more outdoor lights
>oh for some reason outdoor lightning consume even more electricity than before