[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 8543xtby1vvg1.jpg (118 KB, 1203x759)
118 KB JPG
DS V4 leak
>>
>benchmarks
kek
>>
>>108630010
Big benis if true.
>>
>>108630064
How else would you evaluate an llm is better than an other ?
>>
>>108631506
Human surveys of people actually using it. Error benchmarks mean jack shit in the real world.
>>
>>108630010
>numbers
waow line went up
>>
>it'll come out before new years
>oh it's just delayed because of new years
>it'll come out in february
>it'll come out in march
>it'll come out in april

>we will never le accept external funding because we are le amazing hedge fund and will le fund ourselves
>pls gib 10bn usd
>>
Has anyone used Deepseek R or v3 for 'vibe coding'? How does it compare to a model like Claude Sonnet 4.6?
>>
>>108630010
China is unironically desperate if they keep putting off the actual release and just only throwing blatant fake news about it
>>
>>108631506
Could start by not using benchmarks that have all the answers online.
>>
>>108630010
mention taiwan is a country once and the rest of the conversation is no longer usable, no matter how hard you try to shift the topic back.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.