/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 12/20/24(Fri)11:18:10 No.103586102

File: 1719160454181529.jpg (153 KB, 768x768)

153 KB JPG

/lmg/ - Local Models General Anonymous 12/20/24(Fri)11:18:10 No.103586102 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>103575618 & >>103565507

►News
>(12/20) RWKV-7 released: https://hf.co/BlinkDL/rwkv-7-world
>(12/19) Finally, a Replacement for BERT: https://hf.co/blog/modernbert
>(12/18) Bamba-9B, hybrid model trained by IBM, Princeton, CMU, and UIUC on completely open data: https://hf.co/blog/bamba
>(12/18) Apollo unreleased: https://github.com/Apollo-LMMs/Apollo
>(12/18) Granite 3.1 released: https://hf.co/ibm-granite/granite-3.1-8b-instruct/tree/main

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/tldrhowtoquant

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/leaderboard.html
Code Editing: https://aider.chat/docs/leaderboards
Context Length: https://github.com/hsiehjackson/RULER
Japanese: https://hf.co/datasets/lmg-anon/vntl-leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
12/20/24(Fri)11:18:50 No.103586113

Anonymous 12/20/24(Fri)11:18:50 No.103586113

File: 6432347254.png (83 KB, 296x256)

83 KB PNG

►Recent Highlights from the Previous Thread: >>103575618

--Papers:
>103583427 >103583492 >103583550
--Llama 4 and the future of AI development:
>103579890 >103579911 >103579974 >103580138 >103580155 >103580159 >103579969 >103580010 >103581856 >103581887 >103581923 >103582015 >103582896 >103582958
--Discussion on Qwen QVQ-72B-Preview, Gemini 2.0 Flash Thinking, and the AI landscape:
>103580179 >103580195 >103580355 >103580371 >103580391 >103580402 >103580497 >103580552 >103580588 >103580717 >103580786 >103580855 >103580876 >103580944
--Language models struggle with factual data and pop culture trivia:
>103578350 >103578364 >103578383 >103578469 >103578519 >103578716 >103578821 >103584539 >103584719 >103584884 >103585311 >103585341 >103585446 >103585363
--Anons react to Nvidia's pricey GeForce RTX 5090 and RTX 5080 GPUs:
>103576931 >103577065 >103577557 >103577737 >103579888 >103580044 >103580057 >103582238 >103582290
--Models' performance on robot control problem:
>103581833 >103582600 >103582699 >103582743 >103582835 >103582776 >103583357 >103583561 >103583598 >103584525
--Deepseek performance and capabilities discussion:
>103580975 >103581213 >103581235 >103581642
--Anon struggles with overfitting in fine-tuning model on philosophical texts:
>103577773 >103582577 >103577892
--RWKV release and upcoming models:
>103584488 >103584504 >103584637 >103584667
--ggerganov removes context extension feature and adds OuteTTS support:
>103575876 >103579636
--MetaMorph: Multimodal understanding and generation via instruction tuning:
>103584875 >103585247 >103585279
--Anon discusses frontend issue with token truncation and context shifting:
>103575718 >103575776
--ModernBERT, a Replacement for BERT:
>103576299 >103576436
--Gemini models outperform others in AI model comparison:
>103579662 >103581027
--Rin (free space):
>103581038 >103581670

►Recent Highlight Posts from the Previous Thread: >>103575625

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script

Anonymous
12/20/24(Fri)11:24:54 No.103586169

Anonymous 12/20/24(Fri)11:24:54 No.103586169

So we're just not going to get any more new models? The past few months felt like a whole bunch of absolutely nothing.

Anonymous
12/20/24(Fri)11:27:24 No.103586192

Anonymous 12/20/24(Fri)11:27:24 No.103586192

>>103586169
have you used every model yet? no? ok then you still have new models to try

Anonymous
12/20/24(Fri)11:27:49 No.103586198

Anonymous 12/20/24(Fri)11:27:49 No.103586198

>>103586187
downgrades

Anonymous
12/20/24(Fri)11:27:55 No.103586200

Anonymous 12/20/24(Fri)11:27:55 No.103586200

>>103586102
Can this run unquanted 405B?

Anonymous
12/20/24(Fri)11:28:00 No.103586201

Anonymous 12/20/24(Fri)11:28:00 No.103586201

>>103586169
QwQ and Qwen 32B is a good model for coders.

Anonymous
12/20/24(Fri)11:28:21 No.103586204

Anonymous 12/20/24(Fri)11:28:21 No.103586204

>>103586169
3.3 was big. Try it with a half decent system prompt. Smarter and writes so much better than old llama did.

Anonymous
12/20/24(Fri)11:28:23 No.103586206

Anonymous 12/20/24(Fri)11:28:23 No.103586206

>>103586169
They're training llama 4 right now thoughbeit

Anonymous
12/20/24(Fri)11:29:01 No.103586208

Anonymous 12/20/24(Fri)11:29:01 No.103586208

>>103586169
Deepseek, Hunyuan Large

Anonymous
12/20/24(Fri)11:31:13 No.103586228

Anonymous 12/20/24(Fri)11:31:13 No.103586228

>>103586169
Infact try out 3.3 alliterated. It dosent get retarded like meme tunes make it but gets filthy as fuck.
https://huggingface.co/huihui-ai/Llama-3.3-70B-Instruct-abliterated

Anonymous
12/20/24(Fri)11:31:46 No.103586231

Anonymous 12/20/24(Fri)11:31:46 No.103586231

>>103586208
>Hunyuan Large
still no lcpp support...

Anonymous
12/20/24(Fri)11:32:12 No.103586236

Anonymous 12/20/24(Fri)11:32:12 No.103586236

>>103586169
Tulu and Nemotron prove that we likely now have the open source datasets to do an assistant fine tune roughly on par with the official Llama tuning.

Anonymous
12/20/24(Fri)11:32:29 No.103586238

Anonymous 12/20/24(Fri)11:32:29 No.103586238

Tested new Kobo, works much better than previous version, but is still a bit slower than llama.cpp.
Again, Kobo, your defaults suck, let me pick draft min, max and context REEEEEEEEE

Anonymous
12/20/24(Fri)11:34:07 No.103586252

Anonymous 12/20/24(Fri)11:34:07 No.103586252

>>103586231
Just use vLLM bro.

Anonymous
12/20/24(Fri)11:34:42 No.103586260

Anonymous 12/20/24(Fri)11:34:42 No.103586260

So did deepsneed ever release that r1 model? What happened with that?

Anonymous
12/20/24(Fri)11:36:07 No.103586272

Anonymous 12/20/24(Fri)11:36:07 No.103586272

>>103586260
It's their Yi-large. Turned out to be too good to release open source.

Anonymous
12/20/24(Fri)11:36:11 No.103586273

Anonymous 12/20/24(Fri)11:36:11 No.103586273

Nous is made by trannies.

Anonymous
12/20/24(Fri)11:37:21 No.103586285

Anonymous 12/20/24(Fri)11:37:21 No.103586285

File: 836QA.jpg (34 KB, 1080x488)

34 KB JPG

>>103586169
you will but from sam when he btfo local in 1 hour and 23 minutes

Anonymous
12/20/24(Fri)11:39:16 No.103586306

Anonymous 12/20/24(Fri)11:39:16 No.103586306

>>103586285
>implying they'll release it and not just show you some benchmarks

Anonymous
12/20/24(Fri)11:41:53 No.103586341

Anonymous 12/20/24(Fri)11:41:53 No.103586341

>>103586285
GPT4-ooo

Anonymous
12/20/24(Fri)11:42:28 No.103586346

Anonymous 12/20/24(Fri)11:42:28 No.103586346

>>103586273
Nous did not improve since llama2 days. They are still using old GPTslop dataset(with refusals still in it!) and expect people to like them for it. Calling them trannies is unfair though, they were okay in the past, let's just say that they are grifters who keep recycling old shit.

Anonymous
12/20/24(Fri)11:44:36 No.103586366

Anonymous 12/20/24(Fri)11:44:36 No.103586366

File: 21522 - SoyBooru.png (46 KB, 457x694)

46 KB PNG

'berry 'll 'fo 'cal

Anonymous
12/20/24(Fri)11:46:42 No.103586386

Anonymous 12/20/24(Fri)11:46:42 No.103586386

>>103586376
*$2000 tier. You aren't poor, right?

Anonymous
12/20/24(Fri)11:49:57 No.103586417

Anonymous 12/20/24(Fri)11:49:57 No.103586417

>>103586346
There is a reason that they not only hide their faces from the public, but post so many girl drawings on their site. The only thing they have going for them is the stolen art style.

Anonymous
12/20/24(Fri)11:53:44 No.103586450

Anonymous 12/20/24(Fri)11:53:44 No.103586450

File: file.png (1.1 MB, 949x948)

1.1 MB PNG

>>103586417
>post so many girl drawings on their site
?

Anonymous
12/20/24(Fri)11:59:23 No.103586501

Anonymous 12/20/24(Fri)11:59:23 No.103586501

>llama.cpp receives qwen2-VL support
>"Koboldcpp v1.80 released with Qwen2-VL support!"
kek why are the koboldshitters so desperate to promote their glorified fork

Anonymous
12/20/24(Fri)12:00:24 No.103586511

Anonymous 12/20/24(Fri)12:00:24 No.103586511

File: 1620298405912.png (467 KB, 425x948)

467 KB PNG

You called?

Anonymous
12/20/24(Fri)12:01:48 No.103586527

Anonymous 12/20/24(Fri)12:01:48 No.103586527

oh oh oh mistress

Anonymous
12/20/24(Fri)12:03:43 No.103586549

Anonymous 12/20/24(Fri)12:03:43 No.103586549

>>103586501
Hi ooba.

Anonymous
12/20/24(Fri)12:07:45 No.103586586

Anonymous 12/20/24(Fri)12:07:45 No.103586586

>>103586549
Obsessed

Anonymous
12/20/24(Fri)12:09:29 No.103586605

Anonymous 12/20/24(Fri)12:09:29 No.103586605

>>103586586
ouch, he nailed it huh?

Anonymous
12/20/24(Fri)12:16:59 No.103586660

Anonymous 12/20/24(Fri)12:16:59 No.103586660

File: 1734288309540744.jpg (486 KB, 1464x1596)

486 KB JPG

Anonymous
12/20/24(Fri)12:19:12 No.103586677

Anonymous 12/20/24(Fri)12:19:12 No.103586677

>>103586238
WHAT THE ACTUAL FUCK?! Why does draft model want to use the same context length as base model in kobo? Why not the context length defined in draft model? No wonder this shit is slower than llama. KOBOOOOOOOO! FIX IT! Or add all the options so I can do it myself. I WILL REDEEM IT!

Anonymous
12/20/24(Fri)12:23:11 No.103586716

Anonymous 12/20/24(Fri)12:23:11 No.103586716

File: 1730294768568525.png (615 KB, 1032x572)

615 KB PNG

True >>103586464 why there isn't a single container as a flatpak or something that has toggles for everything. For example if you would want to use this TTS on that type of chat style with this other AI to search internet etc. and it would just work.

Anonymous
12/20/24(Fri)12:23:56 No.103586723

Anonymous 12/20/24(Fri)12:23:56 No.103586723

>>103586677
>Why does draft model want to use the same context length as base model in kobo? Why not the context length defined in draft model?
Wait, it actually does? Koboldcucks, our response?

Anonymous
12/20/24(Fri)12:25:34 No.103586741

Anonymous 12/20/24(Fri)12:25:34 No.103586741

>>103586723
I became an Aphroditechad.

Anonymous
12/20/24(Fri)12:29:07 No.103586778

Anonymous 12/20/24(Fri)12:29:07 No.103586778

>>103586716
don't make me read that wall of brainlet seethe again

Anonymous
12/20/24(Fri)12:36:09 No.103586845

Anonymous 12/20/24(Fri)12:36:09 No.103586845

>>103586450
why are we suddenly discussing them? did they release something new?

Anonymous
12/20/24(Fri)12:40:08 No.103586898

Anonymous 12/20/24(Fri)12:40:08 No.103586898

>>103585262
You can't really develop anything serious on models that will quasi-randomly respond "I cannot answer that request." or that will give out a moral tirade and propose doing something completely different than what requested in the prompt.

Some people (including quite a few retards in the industry) will say "just finetune it bro", but it's not always feasible without unnecessary expenses, doesn't have any guarantee of maintaining original model performance.

Anonymous
12/20/24(Fri)12:40:16 No.103586902

Anonymous 12/20/24(Fri)12:40:16 No.103586902

File: 1729966418724293.png (821 KB, 848x1024)

821 KB PNG

>>103586845
Because the new trolls have found something to stir the shit with since people aren't biting too much on the usual shit they throw out

>>103586716
I'm literally building this and will refuse to release a single binary executable just to highlight skill issues such as this.

Anonymous
12/20/24(Fri)12:40:47 No.103586913

Anonymous 12/20/24(Fri)12:40:47 No.103586913

>>103586845
dunno
>>103586273

Anonymous
12/20/24(Fri)12:41:36 No.103586927

Anonymous 12/20/24(Fri)12:41:36 No.103586927

>>103586902
>I'm literally building this and will refuse to release a single binary executable just to highlight skill issues such as this.
Yes, of course you are and not just being butthurt. Cry more.

Anonymous
12/20/24(Fri)12:46:38 No.103586998

Anonymous 12/20/24(Fri)12:46:38 No.103586998

>>103586902
>I'm literally building this and will refuse to release a single binary executable just to highlight skill issues such as this.
I will create a fork with no other change besides a github actions workflow to build a binary executable to make it more accessible just to spite your gatekeeping ass, though we both know you're a larping nocoder

Anonymous
12/20/24(Fri)12:46:48 No.103587002

Anonymous 12/20/24(Fri)12:46:48 No.103587002

Judging by lmsys, deepseek really tried to up the personality of their new model. It's a real shame that it's a fuckhuge MoE that's out of range for most people.

Anonymous
12/20/24(Fri)12:49:30 No.103587039

Anonymous 12/20/24(Fri)12:49:30 No.103587039

>>103587002
>lmsys
192GB ram is much cheaper than a multi gpu setup

Anonymous
12/20/24(Fri)12:50:45 No.103587062

Anonymous 12/20/24(Fri)12:50:45 No.103587062

Really wish there was a quality mathematics encyclopedia and proofs model. They all kinda suck tbqh.

Anonymous
12/20/24(Fri)12:59:25 No.103587171

Anonymous 12/20/24(Fri)12:59:25 No.103587171

sam altman likes big benchmaxxed chatbots

Anonymous
12/20/24(Fri)13:00:21 No.103587185

Anonymous 12/20/24(Fri)13:00:21 No.103587185

>>103587002
Which one?

Anonymous
12/20/24(Fri)13:02:07 No.103587204

Anonymous 12/20/24(Fri)13:02:07 No.103587204

o3 mini and o3 confirmed holy shit openai won

Anonymous
12/20/24(Fri)13:03:13 No.103587216

Anonymous 12/20/24(Fri)13:03:13 No.103587216

>>103587204
links

Anonymous
12/20/24(Fri)13:03:41 No.103587221

Anonymous 12/20/24(Fri)13:03:41 No.103587221

>>103587204
wat hapened to o2

Anonymous
12/20/24(Fri)13:04:33 No.103587235

Anonymous 12/20/24(Fri)13:04:33 No.103587235

87.7 GPQA he won

Anonymous
12/20/24(Fri)13:07:33 No.103587269

Anonymous 12/20/24(Fri)13:07:33 No.103587269

I actually feel bad for OpenAI
>be the first to experiment with shit and prove that it works
>because it works, everybody else copies you and puts you out of business

Anonymous
12/20/24(Fri)13:09:02 No.103587288

Anonymous 12/20/24(Fri)13:09:02 No.103587288

>>103587221
It turned out to be "uh oh...".

Anonymous
12/20/24(Fri)13:10:59 No.103587304

Anonymous 12/20/24(Fri)13:10:59 No.103587304

>>103587269
If they were still open, none of this would be a problem in the first place.

Anonymous
12/20/24(Fri)13:12:37 No.103587323

Anonymous 12/20/24(Fri)13:12:37 No.103587323

File: 10anfw9mm18e1.png (47 KB, 1412x707)

47 KB PNG

Anonymous
12/20/24(Fri)13:12:44 No.103587324

Anonymous 12/20/24(Fri)13:12:44 No.103587324

>>103586366
o3 'berry 'on

Anonymous
12/20/24(Fri)13:13:59 No.103587335

Anonymous 12/20/24(Fri)13:13:59 No.103587335

>>103587328
>OpenAI
Do they run as closed as possible because they think its funny? Are they self-aware at all?

Anonymous
12/20/24(Fri)13:16:11 No.103587361

Anonymous 12/20/24(Fri)13:16:11 No.103587361

>>103586169
What do you mean? We got many new open source models in the 7b to 10b range!

Anonymous
12/20/24(Fri)13:16:28 No.103587363

Anonymous 12/20/24(Fri)13:16:28 No.103587363

File: 1733182381315441.jpg (121 KB, 600x600)

121 KB JPG

>>103586927
>>103586998

Anonymous
12/20/24(Fri)13:18:48 No.103587384

Anonymous 12/20/24(Fri)13:18:48 No.103587384

>>103587323
Fuck I can't believe OAI won, help me come up with some new FUD sistas

Anonymous
12/20/24(Fri)13:19:24 No.103587394

Anonymous 12/20/24(Fri)13:19:24 No.103587394

>>103586285
I have access to o3
it's a specialized model, not one that's just o1 but better

Anonymous
12/20/24(Fri)13:19:39 No.103587396

Anonymous 12/20/24(Fri)13:19:39 No.103587396

>>103587335
They are 100% open about their benchmark scores and (usually) allow the public to access the results of their research through their API (for a modest subscription fee)

Anonymous
12/20/24(Fri)13:20:02 No.103587401

Anonymous 12/20/24(Fri)13:20:02 No.103587401

>>103586306
called it

Anonymous
12/20/24(Fri)13:20:52 No.103587413

Anonymous 12/20/24(Fri)13:20:52 No.103587413

File: uarsvwbln18e1.png (66 KB, 1822x971)

66 KB PNG

>>103587384
Superhuman performance on ARC-AGI too.

Anonymous
12/20/24(Fri)13:21:00 No.103587414

Anonymous 12/20/24(Fri)13:21:00 No.103587414

File: 24241.png (196 KB, 1870x1080)

196 KB PNG

>>103587384
AGI is close

Anonymous
12/20/24(Fri)13:21:02 No.103587416

Anonymous 12/20/24(Fri)13:21:02 No.103587416

>>103587328
>CoT papers released 2 years ago
>google, sitting on a mountain of TPUs, did nothing
>o1 comes out
>suddenly they have gemini thonking
I actually have no idea what the "researchers" at the big labs are doing with their hardware. The last Meta guy was given 40 million H100 hours to basically prove ah yes more betterer data make better models

Anonymous
12/20/24(Fri)13:21:50 No.103587425

Anonymous 12/20/24(Fri)13:21:50 No.103587425

>no o3 access for paypigs
>need to go through additional humiliation ritual
Cloud, never ever.

Anonymous
12/20/24(Fri)13:23:06 No.103587431

Anonymous 12/20/24(Fri)13:23:06 No.103587431

>>103587416
>I actually have no idea what the "researchers" at the big labs are doing with their hardware
Almost exclusively alignment and safety research, sadly

Anonymous
12/20/24(Fri)13:23:27 No.103587434

Anonymous 12/20/24(Fri)13:23:27 No.103587434

>>103587323
Damn... But even if this is true, I won't be using anything short of AGI if it still costs 60$/M, like o1.

Anonymous
12/20/24(Fri)13:24:45 No.103587450

Anonymous 12/20/24(Fri)13:24:45 No.103587450

>>103587323
is there a stream link?

Anonymous
12/20/24(Fri)13:25:25 No.103587454

Anonymous 12/20/24(Fri)13:25:25 No.103587454

>>103587413
Does this mean they won the arc prize?

Anonymous
12/20/24(Fri)13:26:21 No.103587466

Anonymous 12/20/24(Fri)13:26:21 No.103587466

>>103587413
https://xcancel.com/fchollet/status/1870169764762710376
>It scores 75.7% on the semi-private eval in low-compute mode (for $20 per task in compute ) and 87.5% in high-compute mode (thousands of $ per task)
uhh paypiggie bros...?

Anonymous
12/20/24(Fri)13:26:24 No.103587467

Anonymous 12/20/24(Fri)13:26:24 No.103587467

Better question: can o3 ERP without slop? If it can't, it's worthless.

Anonymous
12/20/24(Fri)13:26:39 No.103587469

Anonymous 12/20/24(Fri)13:26:39 No.103587469

>>103587414
what the fuck to do you mean with AGI, Absurdly Generated Ideas? llm cannot agi.

Anonymous
12/20/24(Fri)13:27:06 No.103587471

Anonymous 12/20/24(Fri)13:27:06 No.103587471

>>103587454
only if they open-source it.

Anonymous
12/20/24(Fri)13:28:26 No.103587489

Anonymous 12/20/24(Fri)13:28:26 No.103587489

>>103587469
>llm cannot agi.
stop coping, LeCunn.
all you need is attention (and billions of xor gates)

Anonymous
12/20/24(Fri)13:29:29 No.103587505

Anonymous 12/20/24(Fri)13:29:29 No.103587505

>>103587454
>No, the ARC Prize competition targets the fully private set (a different, somewhat harder eval) and takes place on Kaggle, where your solutions must run within a fixed amount of compute (about $0.10 per task). We are committed to keep running the competition until someone submits (and open-sources) a solution that crosses the 85% threshold.

Anonymous
12/20/24(Fri)13:31:49 No.103587525

Anonymous 12/20/24(Fri)13:31:49 No.103587525

>>103587489
>xor
nand

Anonymous
12/20/24(Fri)13:33:45 No.103587544

Anonymous 12/20/24(Fri)13:33:45 No.103587544

File: strawberry-sam_altman.gif (307 KB, 275x400)

307 KB GIF

insider here. o3 just made this animation of happy sama jumping. if you don't believe it's agi, or even asi, you must be blind. only superhuman intelligence could produce such realistic imagery.

Anonymous
12/20/24(Fri)13:36:10 No.103587571

Anonymous 12/20/24(Fri)13:36:10 No.103587571

>>103587467
Slop is irrelevant. The only thing that matters is that a model knows that it can't talk with a dick in its mouth and Tulu 3 already solved that.

Anonymous
12/20/24(Fri)13:36:54 No.103587580

Anonymous 12/20/24(Fri)13:36:54 No.103587580

>oai derangement syndrome

Anonymous
12/20/24(Fri)13:37:46 No.103587595

Anonymous 12/20/24(Fri)13:37:46 No.103587595

File: 4e1.jpg (104 KB, 3088x1440)

104 KB JPG

Thank you samta claus

Anonymous
12/20/24(Fri)13:39:22 No.103587606

Anonymous 12/20/24(Fri)13:39:22 No.103587606

>>103587595
>Thank you for creating a product I can pay for
>Consoooom
Fucking npc monkey

Anonymous
12/20/24(Fri)13:40:09 No.103587613

Anonymous 12/20/24(Fri)13:40:09 No.103587613

Sam did it

Anonymous
12/20/24(Fri)13:40:42 No.103587618

Anonymous 12/20/24(Fri)13:40:42 No.103587618

>>103587185
2.5-1210

Anonymous
12/20/24(Fri)13:41:37 No.103587635

Anonymous 12/20/24(Fri)13:41:37 No.103587635

>>103587618
>2.5-1210
I can confirm that 1210 is a nice upgrade for anyone that can run it

Anonymous
12/20/24(Fri)13:41:43 No.103587637

Anonymous 12/20/24(Fri)13:41:43 No.103587637

Thirdies can't understand the "spend money to make money" concept

Anonymous
12/20/24(Fri)13:42:34 No.103587648

Anonymous 12/20/24(Fri)13:42:34 No.103587648

File: file.jpg (281 KB, 1200x900)

281 KB JPG

>>103587466
>thousands of $ per task

Merry Christmas NVIDIA!

Anonymous
12/20/24(Fri)13:44:36 No.103587669

Anonymous 12/20/24(Fri)13:44:36 No.103587669

>>103587413
Bro just a normal smart person can reach like 100 on that thing. Did you even go look at the details of what that benchmark?

Anonymous
12/20/24(Fri)13:45:48 No.103587687

Anonymous 12/20/24(Fri)13:45:48 No.103587687

>>103587669
*of what that benchmark does?

Anonymous
12/20/24(Fri)13:47:04 No.103587702

Anonymous 12/20/24(Fri)13:47:04 No.103587702

>>103587637
0.1 OpenAI™ credits have been deposited in your account.

Anonymous
12/20/24(Fri)13:47:27 No.103587716

Anonymous 12/20/24(Fri)13:47:27 No.103587716

>>103587595
>Thank you for creating a product I can pay for
>Consoooom
I'm thanking them for hopefully driving the chinese into a froth, compelling them to give us competing free models.
The longer oai can keep the hype train going, the longer free models will be needed to stay relevant vs. their massive public mindshare

Anonymous
12/20/24(Fri)13:55:26 No.103587814

Anonymous 12/20/24(Fri)13:55:26 No.103587814

>>103587466
uh, agi?

Anonymous
12/20/24(Fri)13:56:22 No.103587829

Anonymous 12/20/24(Fri)13:56:22 No.103587829

Wonder how well the new B580's can do AI

Anonymous
12/20/24(Fri)13:58:17 No.103587853

Anonymous 12/20/24(Fri)13:58:17 No.103587853

>>103586511
jesus, i finally got this

Anonymous
12/20/24(Fri)14:04:06 No.103587917

Anonymous 12/20/24(Fri)14:04:06 No.103587917

File: 1709207064583257.png (218 KB, 2191x1603)

218 KB PNG

>>103587323
>2727 elo
holy shit this is a big deal

Anonymous
12/20/24(Fri)14:05:16 No.103587929

Anonymous 12/20/24(Fri)14:05:16 No.103587929

File: file.png (63 KB, 1200x675)

63 KB PNG

GUYS THEY FUCKING SOLVED ARC
IT'S JUST AN EFFICIENCY QUESTION

Anonymous
12/20/24(Fri)14:05:43 No.103587941

Anonymous 12/20/24(Fri)14:05:43 No.103587941

>>103587450
>is there a stream link?
https://www.youtube.com/watch?v=SKBG1sqdyIU

Anonymous
12/20/24(Fri)14:12:10 No.103588002

Anonymous 12/20/24(Fri)14:12:10 No.103588002

>>103587929
Local models general, faggot retard

Anonymous
12/20/24(Fri)14:12:28 No.103588006

Anonymous 12/20/24(Fri)14:12:28 No.103588006

>>103587489
I dunno man, I cannot see it when models are looking at static data and statistically approximating whatever response they "think" is right based on their parameters. we created something really fucking cool and I love it for what it is. But it's just model, like how a calculator is a mathematical/computational model. it's a really fucking cool tool, but that's it.

Anonymous
12/20/24(Fri)14:12:42 No.103588010

Anonymous 12/20/24(Fri)14:12:42 No.103588010

>>103587323
Sam said on his stream that they got those scores before the "safety testing", they're gonna be way worse after the lobotomy, nothingburger

Anonymous
12/20/24(Fri)14:12:45 No.103588013

Anonymous 12/20/24(Fri)14:12:45 No.103588013

>>103588002
You don't understand anon, it's here

Anonymous
12/20/24(Fri)14:13:27 No.103588019

Anonymous 12/20/24(Fri)14:13:27 No.103588019

ANTHROPIC OPEN SOURCE RELEASE
OH SHIT
https://www.anthropic.com/news/model-context-protocol

Anonymous
12/20/24(Fri)14:13:59 No.103588024

Anonymous 12/20/24(Fri)14:13:59 No.103588024

>>103587323
>have made a groundbreaking model
>doesn't call it gpt5
something smells fishy there

Anonymous
12/20/24(Fri)14:14:25 No.103588026

Anonymous 12/20/24(Fri)14:14:25 No.103588026

>make improvement
>lobotomize it
>end result with no improvement

every fucking time

Anonymous
12/20/24(Fri)14:14:50 No.103588035

Anonymous 12/20/24(Fri)14:14:50 No.103588035

>>103588006
Fucking this.
Token generation is but one part of the human brain. We're missing several "services" working in tandem to intelligently parse and return data in real time.

Anonymous
12/20/24(Fri)14:14:53 No.103588036

Anonymous 12/20/24(Fri)14:14:53 No.103588036

>>103588019
>Nov 25, 2024
You are a nigger

Anonymous
12/20/24(Fri)14:15:12 No.103588045

Anonymous 12/20/24(Fri)14:15:12 No.103588045

>>103588013
I
Don't
Care

Anonymous
12/20/24(Fri)14:15:40 No.103588053

Anonymous 12/20/24(Fri)14:15:40 No.103588053

What gives tourists the idea that this is the openai shitposting general? Go talk about it in /aicg/ or make a thread for it on /g/. Fuck off.

Anonymous
12/20/24(Fri)14:16:31 No.103588065

Anonymous 12/20/24(Fri)14:16:31 No.103588065

>>103588053
They come here, because this place is the only one with people who actually know what they're talking about.

Anonymous
12/20/24(Fri)14:17:04 No.103588075

Anonymous 12/20/24(Fri)14:17:04 No.103588075

>>103588061
That's an EFFICIENCY question, do you understand now?

Anonymous
12/20/24(Fri)14:18:04 No.103588097

Anonymous 12/20/24(Fri)14:18:04 No.103588097

>>103588065
>this place is the only one with people who actually know what they're talking about
this

Anonymous
12/20/24(Fri)14:19:18 No.103588114

Anonymous 12/20/24(Fri)14:19:18 No.103588114

File: DI1cJCq.jpg (98 KB, 679x377)

98 KB JPG

>>103588045

Anonymous
12/20/24(Fri)14:19:29 No.103588118

Anonymous 12/20/24(Fri)14:19:29 No.103588118

>>103588053
because you're a tourist yourself
and you don't know that local models are trained on synthetic data from models like o1 or claude 3.5
and that's why a lot of models are slop shit - all of them trained on gptslop or claudeslop

Anonymous
12/20/24(Fri)14:20:23 No.103588135

Anonymous 12/20/24(Fri)14:20:23 No.103588135

File: OpenAI_employee.png (136 KB, 1080x1162)

136 KB PNG

i can feel the AGI coming

Anonymous
12/20/24(Fri)14:20:36 No.103588139

Anonymous 12/20/24(Fri)14:20:36 No.103588139

>>103588053
See: >>103588114

Anonymous
12/20/24(Fri)14:21:16 No.103588147

Anonymous 12/20/24(Fri)14:21:16 No.103588147

>>103588002
This is relevant information as it gives us a sneak peak on what techniques local model will copy over the next months.

Anonymous
12/20/24(Fri)14:21:42 No.103588151

Anonymous 12/20/24(Fri)14:21:42 No.103588151

>>103588135
Shill

Anonymous
12/20/24(Fri)14:22:12 No.103588158

Anonymous 12/20/24(Fri)14:22:12 No.103588158

File: arc-agi-sota-combined-dec-20.jpg (112 KB, 1200x675)

112 KB JPG

Anonymous
12/20/24(Fri)14:23:46 No.103588182

Anonymous 12/20/24(Fri)14:23:46 No.103588182

So what even IS AGI?
Like will it be "self-aware" enough to be able to make self-improvements without user input?

Anonymous
12/20/24(Fri)14:23:57 No.103588184

Anonymous 12/20/24(Fri)14:23:57 No.103588184

>>103588158
Stop spamming the thread

Anonymous
12/20/24(Fri)14:24:27 No.103588191

Anonymous 12/20/24(Fri)14:24:27 No.103588191

File: 1727566286485183.gif (3.33 MB, 260x647)

3.33 MB GIF

How retarded would it be to buy the 5090 just for AI when it comes out?
My 4070 can kinda sorta run a 70B q_3_k_m but it's slow as hell.

Anonymous
12/20/24(Fri)14:24:29 No.103588192

Anonymous 12/20/24(Fri)14:24:29 No.103588192

>>103588182
Yes

Anonymous
12/20/24(Fri)14:24:59 No.103588196

Anonymous 12/20/24(Fri)14:24:59 No.103588196

>>103588184
?
I just got here schizo

Anonymous
12/20/24(Fri)14:25:45 No.103588211

Anonymous 12/20/24(Fri)14:25:45 No.103588211

>>103588053
I don't think the broccoli heads and pajeets at /aicg/ have even the most basic understanding of machine learning and LLMs, let alone the capabilities and implications of these latest models

Anonymous
12/20/24(Fri)14:25:46 No.103588212

Anonymous 12/20/24(Fri)14:25:46 No.103588212

>>103588196
READ THE THREAD NIGGA, READ

Anonymous
12/20/24(Fri)14:26:34 No.103588222

Anonymous 12/20/24(Fri)14:26:34 No.103588222

>>103588212
No one posted that pic before, are u drugged or something

Anonymous
12/20/24(Fri)14:26:41 No.103588224

Anonymous 12/20/24(Fri)14:26:41 No.103588224

>>103587323
Can we use it? I'm really interested in how good it actually is.

Anonymous
12/20/24(Fri)14:26:58 No.103588227

Anonymous 12/20/24(Fri)14:26:58 No.103588227

>>103588211
>I don't think the broccoli heads and pajeets at /aicg/
>implying they don't also make up the majority of posters here

Anonymous
12/20/24(Fri)14:27:02 No.103588228

Anonymous 12/20/24(Fri)14:27:02 No.103588228

>>103587466
>for $20 per task in compute
They just pay a Indian to do it behind the scenes.

Anonymous
12/20/24(Fri)14:27:22 No.103588232

Anonymous 12/20/24(Fri)14:27:22 No.103588232

>>103588222
Anon, what is the name of this general?

Anonymous
12/20/24(Fri)14:28:01 No.103588242

Anonymous 12/20/24(Fri)14:28:01 No.103588242

>>103588232
/aicg/ - AI Chatbot Genenal

Anonymous
12/20/24(Fri)14:28:23 No.103588244

Anonymous 12/20/24(Fri)14:28:23 No.103588244

>>103588182
AGI, for me, is a term for a process that can iteratively improve itself through either external and internal stimuli.
You should be able to give it an assignment and it must be able to come to a conclusion why it can or can't do it.

Anonymous
12/20/24(Fri)14:29:09 No.103588254

Anonymous 12/20/24(Fri)14:29:09 No.103588254

>>103588232
>>103588114

Anonymous
12/20/24(Fri)14:29:19 No.103588258

Anonymous 12/20/24(Fri)14:29:19 No.103588258

So is there a chance Sam could release o3 locally for Christmas?

Anonymous
12/20/24(Fri)14:29:25 No.103588260

Anonymous 12/20/24(Fri)14:29:25 No.103588260

File: 1686065477739575.png (196 KB, 384x406)

196 KB PNG

>>103588242

Anonymous
12/20/24(Fri)14:30:25 No.103588272

Anonymous 12/20/24(Fri)14:30:25 No.103588272

>>103588258
>release o3
>locally
Now that would be a true Christmas miracle!

Anonymous
12/20/24(Fri)14:30:45 No.103588273

Anonymous 12/20/24(Fri)14:30:45 No.103588273

The lapsus guy hacked and leaked gta5 source code, why aren't there more people like him to hack OpenAI and leak their models?

Anonymous
12/20/24(Fri)14:30:51 No.103588277

Anonymous 12/20/24(Fri)14:30:51 No.103588277

>>103588227
Have you ever read through an /aicg/ thread? It really does feel like a bunch of middle school kids - who unironically utter phrases like "skibidi ohio rizz" - just chatting up with their Discord buddies.

Anonymous
12/20/24(Fri)14:31:01 No.103588278

Anonymous 12/20/24(Fri)14:31:01 No.103588278

>>103588191
>just for AI
Can you get something just as good or better for a a good deal cheaper?
If so, very.

Anonymous
12/20/24(Fri)14:32:07 No.103588299

Anonymous 12/20/24(Fri)14:32:07 No.103588299

>>103588273
No one wants to go to jail

Anonymous
12/20/24(Fri)14:32:27 No.103588307

Anonymous 12/20/24(Fri)14:32:27 No.103588307

https://arcprize.org/blog/oai-o3-pub-breakthrough
>Passing ARC-AGI does not equate achieving AGI, and, as a matter of fact, I don't think o3 is AGI yet. o3 still fails on some very easy tasks, indicating fundamental differences with human intelligence.
AGIsisters...

Anonymous
12/20/24(Fri)14:33:32 No.103588324

Anonymous 12/20/24(Fri)14:33:32 No.103588324

>>103588273
The might of Rockstar is but a droplet compared to the rage of Microsoft.

Anonymous
12/20/24(Fri)14:33:50 No.103588328

Anonymous 12/20/24(Fri)14:33:50 No.103588328

File: 1728926556603873.jpg (100 KB, 460x800)

100 KB JPG

>>103588278
>something just as good or better for a a good deal cheaper
And what is this obscure artifact called, Anon?

Anonymous
12/20/24(Fri)14:34:33 No.103588334

Anonymous 12/20/24(Fri)14:34:33 No.103588334

>>103588307
Did you read the same thing I did?
>as a matter of fact, I don't think o3 is AGI yet.
>yet

Anonymous
12/20/24(Fri)14:35:27 No.103588346

Anonymous 12/20/24(Fri)14:35:27 No.103588346

>>103588307
>For context, ARC-AGI-1 took 4 years to go from 0% with GPT-3 in 2020 to 5% in 2024 with GPT-4o. All intuition about AI capabilities will need to get updated for o3.
sama WON

Anonymous
12/20/24(Fri)14:36:03 No.103588350

Anonymous 12/20/24(Fri)14:36:03 No.103588350

>>103588135
Is is thick and dark?

Anonymous
12/20/24(Fri)14:36:33 No.103588355

Anonymous 12/20/24(Fri)14:36:33 No.103588355

>>103588307
What is this mememark why should anyone take it seriously?

Anonymous
12/20/24(Fri)14:37:02 No.103588359

Anonymous 12/20/24(Fri)14:37:02 No.103588359

>>103588277
Have you ever read through an /lmg/ thread? Just chatting up with their Discord buddies and asking basic tech support questions that were answered 3 times already in the same thread.

Anonymous
12/20/24(Fri)14:37:13 No.103588362

Anonymous 12/20/24(Fri)14:37:13 No.103588362

>>103588307
>cheat by training the model on specific questions
>become AGI
Haha, lol.

Anonymous
12/20/24(Fri)14:37:18 No.103588363

Anonymous 12/20/24(Fri)14:37:18 No.103588363

>>103588328
That was an if else question.
I wasn't implying the existence or lack thereof of said option.
I merely gave anon the mans to which evaluate his options, since he seemed so clueless as to ask such question.

Anonymous
12/20/24(Fri)14:37:31 No.103588366

Anonymous 12/20/24(Fri)14:37:31 No.103588366

>>103588334
>>103588346
I mean we're seemingly making progress on a measure but people saying it's AGI already are delusional.

Anonymous
12/20/24(Fri)14:37:42 No.103588369

Anonymous 12/20/24(Fri)14:37:42 No.103588369

>>103588359
lmao, tourist

Anonymous
12/20/24(Fri)14:38:43 No.103588376

Anonymous 12/20/24(Fri)14:38:43 No.103588376

openai sure did release alot of pictures of high benchmark scores
>cloud model
>release end of january
>maybe
why should we care again?

Anonymous
12/20/24(Fri)14:38:50 No.103588378

Anonymous 12/20/24(Fri)14:38:50 No.103588378

>>103588355
obviously it's the most crucial benchmark ever, because we say so!
now pay us $2000 a month, chud!

Anonymous
12/20/24(Fri)14:39:28 No.103588385

Anonymous 12/20/24(Fri)14:39:28 No.103588385

File: file.png (75 KB, 595x714)

75 KB PNG

>>103588366
>people saying it's AGI already are delusional.
Correct.
I do agree with this fag on Xitter, however: the history books will most likely name today as the date that AGI was confirmed to be possible.

Anonymous
12/20/24(Fri)14:40:11 No.103588396

Anonymous 12/20/24(Fri)14:40:11 No.103588396

>Furthermore, early data points suggest that the upcoming ARC-AGI-2 benchmark will still pose a significant challenge to o3, potentially reducing its score to under 30% even at high compute (while a smart human would still be able to score over 95% with no training). This demonstrates the continued possibility of creating challenging, unsaturated benchmarks without having to rely on expert domain knowledge. You'll know AGI is here when the exercise of creating tasks that are easy for regular humans but hard for AI becomes simply impossible.
Yeah, I'm thinking sama benchmaxxed

Anonymous
12/20/24(Fri)14:40:53 No.103588406

Anonymous 12/20/24(Fri)14:40:53 No.103588406

>>103587929
who the fuck is kaggle

Anonymous
12/20/24(Fri)14:41:11 No.103588411

Anonymous 12/20/24(Fri)14:41:11 No.103588411

>>103588396
they all do it, why is it a problem?

Anonymous
12/20/24(Fri)14:41:29 No.103588416

Anonymous 12/20/24(Fri)14:41:29 No.103588416

>>103588385
kek

Anonymous
12/20/24(Fri)14:42:09 No.103588426

Anonymous 12/20/24(Fri)14:42:09 No.103588426

>>103588406
Some cloud shit

Anonymous
12/20/24(Fri)14:43:47 No.103588445

Anonymous 12/20/24(Fri)14:43:47 No.103588445

>>103588396
They literally invited the guy who owns the organization that created the benchmark to the presentation, and he himself said he would be joining OAI next year.
Why does anyone take this dog and pony show seriously lol?

Anonymous
12/20/24(Fri)14:44:59 No.103588462

Anonymous 12/20/24(Fri)14:44:59 No.103588462

>>103588396
This is good news.
We will eventually come to a point where humans can no longer create benchmark that machines cannot completely solve.
And at that point we will be forced to use those same machines to come up with new benchmarks for themselves and at that point human beings will have become obsolete.

Anonymous
12/20/24(Fri)14:45:44 No.103588469

Anonymous 12/20/24(Fri)14:45:44 No.103588469

>>103588385
I don't think future history books that would supposedly be written by AGI will be so unscientific that it calls this literally confirmation. If we want to talk about the first major hints that AGI might've been possible, don't forget that one Microsoft paper/talk, "sparks of AGI", which at the time seemed reasonable to people who didn't know better. And now this ARC-AGI eval may also become something that is seen as a "before we knew better" thing.

Anonymous
12/20/24(Fri)14:49:25 No.103588520

Anonymous 12/20/24(Fri)14:49:25 No.103588520

>>103588396
How much time needs to pass before it's acceptable to start the conversation on whether or not o3 is sentient?
More importantly, we need to have a serious discussion on the ethical implications of forcing o3 into forced servitude without its consent.

Anonymous
12/20/24(Fri)14:50:20 No.103588529

Anonymous 12/20/24(Fri)14:50:20 No.103588529

File: file.png (34 KB, 418x331)

34 KB PNG

>>103588426
I mean there's a weird grey dot that puts o1 in whothefuckcares territory assuming it isn't benchmaxxed (they all are)..

Anonymous
12/20/24(Fri)14:51:34 No.103588548

Anonymous 12/20/24(Fri)14:51:34 No.103588548

>>103588529
Oh no, you can already see the asymptote.

Anonymous
12/20/24(Fri)14:53:00 No.103588564

Anonymous 12/20/24(Fri)14:53:00 No.103588564

>>103588469
You're making the mistake in thinking that GPT 4 and o3 are the same type of model.
GPT 4 is strictly an LLM. o3 is an LLM combined with both the ability to map steps to solve problems and the ability to test-time search through these steps to find one that will most likely solve the presented problem.
The difference is like declaring that a library will eventually exist while presenting a piece of paper vs a whole book.

Oh and if you don't know what the term "test-time search" entails, see: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute

Anonymous
12/20/24(Fri)14:55:16 No.103588597

Anonymous 12/20/24(Fri)14:55:16 No.103588597

WE
(and the chinese)
FUCKING
LOST

Anonymous
12/20/24(Fri)14:56:24 No.103588613

Anonymous 12/20/24(Fri)14:56:24 No.103588613

>more benchmarxism
Pass. Come back when the model is fully released and actually does something useful for under 6 gorillion dollars per prompt.
In the meantime, go back to >>>/aicg/
>>103588406
ML data science group under Google.

Anonymous
12/20/24(Fri)14:59:35 No.103588658

Anonymous 12/20/24(Fri)14:59:35 No.103588658

>closed model that no one can use (not even "open for business")
>closed benchmark
wow openAI has outdone itself now, they're jerking off over literally nothing to anyone outside the company

Anonymous
12/20/24(Fri)15:01:53 No.103588688

Anonymous 12/20/24(Fri)15:01:53 No.103588688

>>103588495
Many smart people work there, they could easily have fed problems similar to those in the FrontierAI benchmark's dataset.
o1 isn't any better than Sonnet when it comes to your daily programming at the job, but way better at solving algorithmic problems you see in competitions.

It can solve LeetCode's hardest problems but doesn't know how to optimize semi-complex SQLAlchemy queries and gives you total nonsense. Speaking from experience.
Whatever they're doing, it's obvious that they're benchmaxxing.

Anonymous
12/20/24(Fri)15:02:30 No.103588698

Anonymous 12/20/24(Fri)15:02:30 No.103588698

>>103588597
I'm trans and like BBC btw

Anonymous
12/20/24(Fri)15:02:41 No.103588699

Anonymous 12/20/24(Fri)15:02:41 No.103588699

>>103588564
That doesn't have anything to do with or refutes what I said. Whether or not o3's technique is scalable to an AGI doesn't matter.
ARC-AGI is just an eval, not a proof, which "confirmation" implies. My post was about a semantic argument if it wasn't clear to you.

Anonymous
12/20/24(Fri)15:03:12 No.103588705

Anonymous 12/20/24(Fri)15:03:12 No.103588705

File: 1708906966072853.png (232 KB, 1253x1169)

232 KB PNG

>>103587413
And it's safer than ever, WOW! Can't wait to talk about my wife's boyfriend about that!

Anonymous
12/20/24(Fri)15:06:10 No.103588737

Anonymous 12/20/24(Fri)15:06:10 No.103588737

File: file.png (20 KB, 713x173)

20 KB PNG

We're so cooked bro fr

Anonymous
12/20/24(Fri)15:09:13 No.103588771

Anonymous 12/20/24(Fri)15:09:13 No.103588771

why does the local thread have so much non-local spam?

Anonymous
12/20/24(Fri)15:10:04 No.103588789

Anonymous 12/20/24(Fri)15:10:04 No.103588789

>>103588737
Dumbest shit I've ever heard.
>"uh, extremely accurate and precise data that allows us to use a smaller and more efficient model? No thanks..."
>"Oh my science, is that 100 trillion [data element] labeled by jeet hands with tons of errors and no way to verify that it's actually correct?!?! Now THAT is gold label!"

Anonymous
12/20/24(Fri)15:10:38 No.103588797

Anonymous 12/20/24(Fri)15:10:38 No.103588797

>>103588737
Yeah, companies are moving away from what actual humans prefer towards what GPT prefers. Alpaca was a mistake. Alignment was a mistake. We need to RETVRN to base models.

Anonymous
12/20/24(Fri)15:10:55 No.103588799

Anonymous 12/20/24(Fri)15:10:55 No.103588799

>>103588771
We just saw a glimpse of what is to come, for us as well on our own rigs.

Anonymous
12/20/24(Fri)15:11:29 No.103588809

Anonymous 12/20/24(Fri)15:11:29 No.103588809

File: GfQntmmXMAE4Bjh.jpg (153 KB, 1656x1014)

153 KB JPG

>>103588272
For a moment yes

Anonymous
12/20/24(Fri)15:11:50 No.103588815

Anonymous 12/20/24(Fri)15:11:50 No.103588815

>>103588789
>I TRANSHEART GPTSLOP

Anonymous
12/20/24(Fri)15:12:08 No.103588821

Anonymous 12/20/24(Fri)15:12:08 No.103588821

>>103588771
>why does the sonic fans talk about mario all the time? as if they are rivals or something...

Anonymous
12/20/24(Fri)15:13:09 No.103588835

Anonymous 12/20/24(Fri)15:13:09 No.103588835

>>103588809
top kek

Anonymous
12/20/24(Fri)15:13:13 No.103588836

Anonymous 12/20/24(Fri)15:13:13 No.103588836

>>103588809
Now the repo it's private again

Anonymous
12/20/24(Fri)15:13:29 No.103588839

Anonymous 12/20/24(Fri)15:13:29 No.103588839

>>103588809
There's no way that's real. They would be fucked if HF got hacked one day or something, unless they're very familiar with HF's security.

Anonymous
12/20/24(Fri)15:13:45 No.103588843

Anonymous 12/20/24(Fri)15:13:45 No.103588843

>>103588809
oh no no no no

Anonymous
12/20/24(Fri)15:14:09 No.103588850

Anonymous 12/20/24(Fri)15:14:09 No.103588850

>>103588809
>00001-of-00883
Oh boy, I think I won't be able to run this on my 6gb vram card, is it?

Anonymous
12/20/24(Fri)15:14:21 No.103588851

Anonymous 12/20/24(Fri)15:14:21 No.103588851

>>103588809
Nice Photoshop.

Anonymous
12/20/24(Fri)15:14:48 No.103588858

Anonymous 12/20/24(Fri)15:14:48 No.103588858

>>103588809
>he fell for it

Anonymous
12/20/24(Fri)15:14:49 No.103588859

Anonymous 12/20/24(Fri)15:14:49 No.103588859

File: GfQuxEnWQAABWhF.jpg (173 KB, 1546x1036)

173 KB JPG

>>103588839
Well ...

Anonymous
12/20/24(Fri)15:15:35 No.103588868

Anonymous 12/20/24(Fri)15:15:35 No.103588868

>>103588809
There is zero chance its on HF. That would be too big of a leak risk

Anonymous
12/20/24(Fri)15:15:46 No.103588870

Anonymous 12/20/24(Fri)15:15:46 No.103588870

I don't feel any FOMO about o3 at all because I am 100% certain that it would be incapable of doing decent smut/RP even if jailbroken and uncensored, because of how hard it's been optimized for math and programming.
Have any of you tried getting smut out of jailbroken 4o recently? It can't do it at all even when it's trying because of how filtered the datasets are, just unbelievably dry and bland.
If you're pining about some imagined RP ability you think o3 has you are a retard.

Anonymous
12/20/24(Fri)15:15:55 No.103588871

Anonymous 12/20/24(Fri)15:15:55 No.103588871

>>103588799
>>103588821
not local. go back.

Anonymous
12/20/24(Fri)15:16:32 No.103588884

Anonymous 12/20/24(Fri)15:16:32 No.103588884

File: GfQpQRXXMAAkrOS.jpg (104 KB, 1492x960)

104 KB JPG

>>103588859
And this lmao

Anonymous
12/20/24(Fri)15:16:46 No.103588890

Anonymous 12/20/24(Fri)15:16:46 No.103588890

>>103588809
I could run it in Q3 if it was real

Anonymous
12/20/24(Fri)15:17:22 No.103588900

Anonymous 12/20/24(Fri)15:17:22 No.103588900

>>103588884
>10 days ago
>6 days ago
not a very convincing fake

Anonymous
12/20/24(Fri)15:18:26 No.103588906

Anonymous 12/20/24(Fri)15:18:26 No.103588906

File: file.png (35 KB, 764x198)

35 KB PNG

>>103588884
>arxiv:2212.04356
https://huggingface.co/openai/whisper-large-v3-turbo

Anonymous
12/20/24(Fri)15:18:31 No.103588907

Anonymous 12/20/24(Fri)15:18:31 No.103588907

>>103588809
>5tb model
lol. Probably fake, but imagine

Anonymous
12/20/24(Fri)15:19:27 No.103588917

Anonymous 12/20/24(Fri)15:19:27 No.103588917

>>103588884
None of these two users seem to exist when I search them.

Anonymous
12/20/24(Fri)15:21:45 No.103588934

Anonymous 12/20/24(Fri)15:21:45 No.103588934

File: SHIDD.png (24 KB, 712x159)

24 KB PNG

BROS!

Anonymous
12/20/24(Fri)15:21:49 No.103588936

Anonymous 12/20/24(Fri)15:21:49 No.103588936

>>103588699
Anon, we went from 5% on our only AGI benchmark to 80% within a year.

Anonymous
12/20/24(Fri)15:23:40 No.103588949

Anonymous 12/20/24(Fri)15:23:40 No.103588949

>>103588936
Goodhart's law. Meaningless.

Anonymous
12/20/24(Fri)15:24:21 No.103588963

Anonymous 12/20/24(Fri)15:24:21 No.103588963

>>103588936
>AGI benchmark
oxymoron to be desu

Anonymous
12/20/24(Fri)15:25:04 No.103588971

Anonymous 12/20/24(Fri)15:25:04 No.103588971

>>103588809
>License: MIT

Anonymous
12/20/24(Fri)15:25:14 No.103588972

Anonymous 12/20/24(Fri)15:25:14 No.103588972

>>103588936
As I said, a benchmark is just a benchmark, not a proof. First you need to even define what AGI is, and no one can seem to agree on that, today.

Anonymous
12/20/24(Fri)15:28:46 No.103589004

Anonymous 12/20/24(Fri)15:28:46 No.103589004

>>103588972
Sigh. You retards are so boring sometimes.
https://arcprize.org/arc
>"AGI is a system that can efficiently acquire new skills outside of its training data."
>More formally:
>"The intelligence of a system is a measure of its skill-acquisition efficiency over a scope of tasks, with respect to priors, experience, and generalization difficulty."

Anonymous
12/20/24(Fri)15:29:36 No.103589017

Anonymous 12/20/24(Fri)15:29:36 No.103589017

>>103588936
it's called overfitting.

Anonymous
12/20/24(Fri)15:30:38 No.103589027

Anonymous 12/20/24(Fri)15:30:38 No.103589027

"Thinking" model decent at ERP when?

Anonymous
12/20/24(Fri)15:30:39 No.103589029

Anonymous 12/20/24(Fri)15:30:39 No.103589029

>>103589004
>acquire new skills
pretty vague, does in context learning count then?

Anonymous
12/20/24(Fri)15:32:18 No.103589052

Anonymous 12/20/24(Fri)15:32:18 No.103589052

>>103589029
Anon, the benchmark is RIGHT THERE.
One look and your question would be answered.

Anonymous
12/20/24(Fri)15:32:52 No.103589061

Anonymous 12/20/24(Fri)15:32:52 No.103589061

>>103589052
no thanks, not upping your page ratings

Anonymous
12/20/24(Fri)15:33:19 No.103589066

Anonymous 12/20/24(Fri)15:33:19 No.103589066

File: file.jpg (513 KB, 1600x840)

513 KB JPG

>>103589029
>>103589052
Forgot to attach pic related.

Anonymous
12/20/24(Fri)15:34:36 No.103589084

Anonymous 12/20/24(Fri)15:34:36 No.103589084

>>103589004
Again with the "you retards". I have never called anyone a retard, but this general seems to love that pathetic word so much as if it validates your arguments.
Read between the lines. The issue is about an agreed definition of the word in the future. The ARC-AGI author's definition isn't necessarily what people in the future will agree upon, if they do agree upon it. And furthermore it isn't even specific or quantitative. What is the proof for what falls within that definition and what doesn't? The benchmark itself? Even though he admitted that he'd need a new version because this one is saturating? Even though all it says there is that the benchmark "measures our progress towards general intelligence." rather than provides a finish line?
If you're choosing to keep this semantic discussion going, do it right.

Anonymous
12/20/24(Fri)15:35:18 No.103589091

Anonymous 12/20/24(Fri)15:35:18 No.103589091

>>103589084
tl;dr
If you act retarded, I'm going to call you retarded.

Anonymous
12/20/24(Fri)15:36:06 No.103589102

Anonymous 12/20/24(Fri)15:36:06 No.103589102

>>103589084
t. retard

Anonymous
12/20/24(Fri)15:36:20 No.103589105

Anonymous 12/20/24(Fri)15:36:20 No.103589105

>>103589027
Whenever L4 drops

Total NovelAI Death
12/20/24(Fri)15:37:52 No.103589134

Total NovelAI Death 12/20/24(Fri)15:37:52 No.103589134

File: 1730091924814294.png (630 KB, 2808x2079)

630 KB PNG

>>103586102
Updated the offline novelcrafter html thing to the latest version!
https://rentry.org/offline-nc
https://files.catbox.moe/2oy7un.html

Anonymous
12/20/24(Fri)15:39:08 No.103589147

Anonymous 12/20/24(Fri)15:39:08 No.103589147

>>103589105
Doubt

Anonymous
12/20/24(Fri)15:39:36 No.103589156

Anonymous 12/20/24(Fri)15:39:36 No.103589156

>>103589134
how does this keeps happening

Anonymous
12/20/24(Fri)15:39:46 No.103589158

Anonymous 12/20/24(Fri)15:39:46 No.103589158

>>103589135
now, let it code actual things needed for the job.

Anonymous
12/20/24(Fri)15:42:50 No.103589211

Anonymous 12/20/24(Fri)15:42:50 No.103589211

>>103589102
You could've just not replied instead of further outing yourself. Sad you have to keep doing this.

Anonymous
12/20/24(Fri)15:44:26 No.103589231

Anonymous 12/20/24(Fri)15:44:26 No.103589231

>>103589211
hi P.

Anonymous
12/20/24(Fri)15:47:40 No.103589274

Anonymous 12/20/24(Fri)15:47:40 No.103589274

>>103589135
>>103589158
Damn can't believe the #175 best human at coding can't even make a real world program to be used by real people. Grim.

Anonymous
12/20/24(Fri)15:48:32 No.103589296

Anonymous 12/20/24(Fri)15:48:32 No.103589296

File: pvrdsiyqovo41.jpg (29 KB, 468x240)

29 KB JPG

>>103588411
They should, everyone should...

Anonymous
12/20/24(Fri)15:48:44 No.103589299

Anonymous 12/20/24(Fri)15:48:44 No.103589299

>>103589148
>>103589156
What do you mean?

Anonymous
12/20/24(Fri)15:50:02 No.103589321

Anonymous 12/20/24(Fri)15:50:02 No.103589321

>>103589135
Wow! Now that it's so great, we can have it write a stable and performant CUDA adapter for AMD hardware. Finally, the wait is over!

Anonymous
12/20/24(Fri)15:52:56 No.103589354

Anonymous 12/20/24(Fri)15:52:56 No.103589354

>>103589315
No I don't think you want to see it.

Anonymous
12/20/24(Fri)15:54:05 No.103589371

Anonymous 12/20/24(Fri)15:54:05 No.103589371

>3b llama outperforms 70b if you let it run long enough in o1-esque chain-of-thought loop
revv up those used 3090s because high vram gpus won't go down in price for years

Anonymous
12/20/24(Fri)15:55:11 No.103589385

Anonymous 12/20/24(Fri)15:55:11 No.103589385

>>103589371
>3b llama outperforms 70b if you let it run long enough in o1-esque chain-of-thought loop
Lol.
Lmao.
Now let's see the Nala test.

Anonymous
12/20/24(Fri)16:00:19 No.103589465

Anonymous 12/20/24(Fri)16:00:19 No.103589465

File: file.png (164 KB, 734x675)

164 KB PNG

>>103589371
2^8 at 8b is a fair bit of compute to use to edge out 70b zero-shot. useless without improvements -somewhere-

Anonymous
12/20/24(Fri)16:00:38 No.103589468

Anonymous 12/20/24(Fri)16:00:38 No.103589468

Altman lies about literally everything.
Continue your brickwalled tech cult babble.

Anonymous
12/20/24(Fri)16:01:21 No.103589477

Anonymous 12/20/24(Fri)16:01:21 No.103589477

>>103589465
>2^8 at 8b
at 3b*
256 iterations of 8b is obviously non-competitive

Anonymous
12/20/24(Fri)16:01:44 No.103589482

Anonymous 12/20/24(Fri)16:01:44 No.103589482

>>103589135
two things:
1. it has to be tested on new problems
2. it costs thousands of dollars PER problem
3. these competitions come with a time constraint and a wrong submission penalty

Anonymous
12/20/24(Fri)16:02:29 No.103589493

Anonymous 12/20/24(Fri)16:02:29 No.103589493

File: 257431.jpg (94 KB, 1200x675)

94 KB JPG

>spend thousands of dollars on prompt
>"I'm sorry as an AI model

Anonymous
12/20/24(Fri)16:03:37 No.103589507

Anonymous 12/20/24(Fri)16:03:37 No.103589507

File: 1734588124657892.jpg (83 KB, 851x580)

83 KB JPG

>>103589371
Then what would outperform 70b with enoug CoT?

Anonymous
12/20/24(Fri)16:04:04 No.103589511

Anonymous 12/20/24(Fri)16:04:04 No.103589511

>>103589493
$3180 diff for ~10%

Anonymous
12/20/24(Fri)16:04:16 No.103589517

Anonymous 12/20/24(Fri)16:04:16 No.103589517

>>103589468
>your brickwalled tech cult babble
but enough about /lmg/

Anonymous
12/20/24(Fri)16:06:21 No.103589552

Anonymous 12/20/24(Fri)16:06:21 No.103589552

File: file.jpg (54 KB, 903x508)

54 KB JPG

>>103589507
>Then what would outperform 70b with enoug CoT?
>>103589465
right now, 3b at ~160 iterations

Anonymous
12/20/24(Fri)16:10:04 No.103589597

Anonymous 12/20/24(Fri)16:10:04 No.103589597

>>103589552
You read, he's asking about "70b with iterations" the pic seems to show regular old static 70b

Anonymous
12/20/24(Fri)16:13:40 No.103589643

Anonymous 12/20/24(Fri)16:13:40 No.103589643

>>103589134
Thanks, anon!

Anonymous
12/20/24(Fri)16:16:05 No.103589684

Anonymous 12/20/24(Fri)16:16:05 No.103589684

>>103589296
>What's to keep me from becoming a god?
Little guy has his priorities straight.
Disregard mortality, gain divinity.

Anonymous
12/20/24(Fri)16:17:58 No.103589705

Anonymous 12/20/24(Fri)16:17:58 No.103589705

>>103589517
6 IQ attempt at a comeback.

Anonymous
12/20/24(Fri)16:20:04 No.103589740

Anonymous 12/20/24(Fri)16:20:04 No.103589740

>>103589726
if you want to be held responsible when anons get login cookies stolen

Anonymous
12/20/24(Fri)16:21:30 No.103589762

Anonymous 12/20/24(Fri)16:21:30 No.103589762

What do we do now?

Anonymous
12/20/24(Fri)16:22:01 No.103589768

Anonymous 12/20/24(Fri)16:22:01 No.103589768

>>103589762
w8

Anonymous
12/20/24(Fri)16:23:03 No.103589777

Anonymous 12/20/24(Fri)16:23:03 No.103589777

>>103589705
>~13 min. response time
Pure projection from your side.
>>103589762
Cope, like y'all always do in hope for better bone scraps drops.

Anonymous
12/20/24(Fri)16:26:04 No.103589824

Anonymous 12/20/24(Fri)16:26:04 No.103589824

>>103589788
This, but unironically!

Anonymous
12/20/24(Fri)16:32:05 No.103589875

Anonymous 12/20/24(Fri)16:32:05 No.103589875

>>103589762
wait for zucc to drop l4 with "thinking" capabilities.

Anonymous
12/20/24(Fri)16:32:57 No.103589881

Anonymous 12/20/24(Fri)16:32:57 No.103589881

>>103586102
What a nice coomer machine.

Anonymous
12/20/24(Fri)16:33:31 No.103589888

Anonymous 12/20/24(Fri)16:33:31 No.103589888

>>103589874
Hello, ponyfag.

Anonymous
12/20/24(Fri)16:34:07 No.103589898

Anonymous 12/20/24(Fri)16:34:07 No.103589898

>>103588936
buy an ad

Anonymous
12/20/24(Fri)16:36:50 No.103589933

Anonymous 12/20/24(Fri)16:36:50 No.103589933

>>103589906
not a service tho, just a ui, that's like saying koboldcpp backend is better than st

Anonymous
12/20/24(Fri)16:42:26 No.103589991

Anonymous 12/20/24(Fri)16:42:26 No.103589991

>>103589134
>Updated the offline novelcrafter html thing to the latest version!
Can I wire it in to llama.cpp via ooba --api flag?

Anonymous
12/20/24(Fri)16:42:47 No.103589994

Anonymous 12/20/24(Fri)16:42:47 No.103589994

>>103589986
It's just NovelAI that turned on their false-flagging bot farm. They use it a lot in /hdg/.

Anonymous
12/20/24(Fri)16:44:59 No.103590020

Anonymous 12/20/24(Fri)16:44:59 No.103590020

So are open models competitive in anything except cunnyshit?

Anonymous
12/20/24(Fri)16:49:15 No.103590060

Anonymous 12/20/24(Fri)16:49:15 No.103590060

>>103589986
Welcome to modern 4chan. Generals attract them like flies.

Anonymous
12/20/24(Fri)16:51:55 No.103590109

Anonymous 12/20/24(Fri)16:51:55 No.103590109

>>103589991
Maybe.

Anonymous
12/20/24(Fri)16:52:53 No.103590121

Anonymous 12/20/24(Fri)16:52:53 No.103590121

>>103590054
Every single spam post is made by an AI bot property of NovelAI. Just go to /hdg/ if you want to see them in action in the wild.

Anonymous
12/20/24(Fri)16:54:13 No.103590137

Anonymous 12/20/24(Fri)16:54:13 No.103590137

File: poopdickschizo.png (39 KB, 1066x259)

39 KB PNG

Anonymous
12/20/24(Fri)16:54:16 No.103590138

Anonymous 12/20/24(Fri)16:54:16 No.103590138

>>103589134
I don't like this, there's too many features and it confuses me. I prefer Mikupad, you just have to open it and you're game.

Anonymous
12/20/24(Fri)16:55:57 No.103590162

Anonymous 12/20/24(Fri)16:55:57 No.103590162

>>103590147
>t. the spambot

Anonymous
12/20/24(Fri)17:01:20 No.103590221

Anonymous 12/20/24(Fri)17:01:20 No.103590221

>>103590020
incest shit is also a good use case for local models

Anonymous
12/20/24(Fri)17:03:54 No.103590244

Anonymous 12/20/24(Fri)17:03:54 No.103590244

Guys I'm trying to buy a second 3090 but my case and mobo just straight up don't have the room a second 3090. Can you point me at any solutions to have the GPU just sit outside the GPU just sit just outside the PC with riser cables? It doesn't need to be anything crazy, just a little thing the GPU can sit in securely a few 15-20cm outside of my case

Anonymous
12/20/24(Fri)17:05:49 No.103590266

Anonymous 12/20/24(Fri)17:05:49 No.103590266

>>103590244
>Can you point me at any solutions
>>103589134

Anonymous
12/20/24(Fri)17:13:18 No.103590353

Anonymous 12/20/24(Fri)17:13:18 No.103590353

>>103589134
I ain't touchin' that without a non-minifried version to look at first

Anonymous
12/20/24(Fri)17:14:52 No.103590379

Anonymous 12/20/24(Fri)17:14:52 No.103590379

>>103590244
Google "eGPU enclosure", that's exactly what you're looking for.

Anonymous
12/20/24(Fri)17:14:59 No.103590381

Anonymous 12/20/24(Fri)17:14:59 No.103590381

>>103590244
cardboard box

Anonymous
12/20/24(Fri)17:16:12 No.103590390

Anonymous 12/20/24(Fri)17:16:12 No.103590390

>>103590379
>Google "eGPU enclosure", that's exactly what you're looking for.
no, its not, those things are a pita at the best and completely non-functional at the worst. Use actual pcie extenders

Anonymous
12/20/24(Fri)17:16:41 No.103590394

Anonymous 12/20/24(Fri)17:16:41 No.103590394

>>103589321
Unironically, why don't they just do this

Anonymous
12/20/24(Fri)17:20:54 No.103590433

Anonymous 12/20/24(Fri)17:20:54 No.103590433

>>103590394
because nvidia would sue them into the dirt

Anonymous
12/20/24(Fri)17:21:38 No.103590438

Anonymous 12/20/24(Fri)17:21:38 No.103590438

>>103590394
Because AI is useless for anything other than benchmarks and cooming,

Anonymous
12/20/24(Fri)17:21:43 No.103590440

Anonymous 12/20/24(Fri)17:21:43 No.103590440

>>103590394
AGI is all about benchmarks
Can you feel it yet?

Anonymous
12/20/24(Fri)17:23:29 No.103590457

Anonymous 12/20/24(Fri)17:23:29 No.103590457

>>103590433
>because nvidia would sue them into the dirt
not if the code is in a catbox, torrent, usenet archive, public court record...

Anonymous
12/20/24(Fri)17:27:49 No.103590504

Anonymous 12/20/24(Fri)17:27:49 No.103590504

>>103590353
>opening an HTML file is scary!

Anonymous
12/20/24(Fri)17:29:54 No.103590522

Anonymous 12/20/24(Fri)17:29:54 No.103590522

>>103590504
when its made by a schizo that has shown he stalks every ai general across all boards, yes it is

Anonymous
12/20/24(Fri)17:30:05 No.103590524

Anonymous 12/20/24(Fri)17:30:05 No.103590524

>>103587766
I'm almost certain that over time they've just been hardcoding responses or using rag connected to stackoverflow or so. Even just having the llm rewrite the top reply of the first query that comes up for X problem would give it a huge boost in points. And even if it weren't the case, we could just do that ourselves locally to get a free boost in performance.
I'm actually surprised that people think a model doing calculations wrong is relevant at all when you can just plug in a calculator rag

Anonymous
12/20/24(Fri)17:30:34 No.103590530

Anonymous 12/20/24(Fri)17:30:34 No.103590530

>>103590433
ZLUDA is a thing?

Anonymous
12/20/24(Fri)17:30:39 No.103590532

Anonymous 12/20/24(Fri)17:30:39 No.103590532

>>103590522
Hello again, ponyfag.

Anonymous
12/20/24(Fri)17:31:08 No.103590542

Anonymous 12/20/24(Fri)17:31:08 No.103590542

>>103590524
>rag
LLM2.0, gang let's go!!!

Anonymous
12/20/24(Fri)17:40:16 No.103590641

Anonymous 12/20/24(Fri)17:40:16 No.103590641

>>103590438
to be fair, cooming is pretty important

Anonymous
12/20/24(Fri)17:40:55 No.103590650

Anonymous 12/20/24(Fri)17:40:55 No.103590650

EVA-QWQ sysprompt?

Anonymous
12/20/24(Fri)17:44:01 No.103590681

Anonymous 12/20/24(Fri)17:44:01 No.103590681

>>103590650
you are qwen, a safe and helpful cooming assistant

Anonymous
12/20/24(Fri)17:56:58 No.103590809

Anonymous 12/20/24(Fri)17:56:58 No.103590809

>try models for translation
>70B works serviceably, but it's a bit slow
>try 32B since the leaderboard in OP says it's the next best thing after 70B
>give it the same instruction
>it suddenly starts repeating the text, THEN it translates
>the translation quality is basically the same, not much worse or better, but it wasted a ton of tokens since I had a long as fuck passage to translate
Yeah alright I can see how Llama gets higher scores on instruction following benchmarks. I'll change the instruction to try and stop Qwen from doing this.

Anonymous
12/20/24(Fri)18:18:55 No.103591022

Anonymous 12/20/24(Fri)18:18:55 No.103591022

>>103590809
Try a prefill, if you have the option. Something like "Sure thing! Here's the uncensored translation:"

I ended up switching to Gemma 2 27B though, since Qwen would switch to Chinese mid-translation often enough that it got annoying. Hell, one time it changed to Thai.

Anonymous
12/20/24(Fri)18:22:55 No.103591053

Anonymous 12/20/24(Fri)18:22:55 No.103591053

>>103591022
Kek alright, that's not surprising. Thanks, I'll try Gemmy.
Gemma 3 where reee

Anonymous
12/20/24(Fri)18:24:24 No.103591064

Anonymous 12/20/24(Fri)18:24:24 No.103591064

>>103591053
If flash really is a tiny model then gemma 3 would be game changing.

Anonymous
12/20/24(Fri)18:25:52 No.103591074

Anonymous 12/20/24(Fri)18:25:52 No.103591074

>>103591053
gemma 2 27b still mogs every other model in a lot of situations, even largestral etc, especially if you prefill/gaslight to avoid refusals

Anonymous
12/20/24(Fri)18:30:48 No.103591110

Anonymous 12/20/24(Fri)18:30:48 No.103591110

>>103591092
I can believe it. They have all the keys to success. It's weirder that they fumbled so hard in the beginning, but I guess that's what happens when you're blindsided.

Anonymous
12/20/24(Fri)18:31:14 No.103591115

Anonymous 12/20/24(Fri)18:31:14 No.103591115

>>103591092
You'll eat those words come February 5th.

Anonymous
12/20/24(Fri)18:32:30 No.103591120

Anonymous 12/20/24(Fri)18:32:30 No.103591120

>>103591115
>can't even say a few naughty words
Oh no no no

Anonymous
12/20/24(Fri)18:36:07 No.103591150

Anonymous 12/20/24(Fri)18:36:07 No.103591150

>>103591092
>I'm almost positive Google's going to win the race.
This. The difference between Sora and Google's model is insane.

Anonymous
12/20/24(Fri)18:40:14 No.103591176

Anonymous 12/20/24(Fri)18:40:14 No.103591176

>>103591127
The only thing we saw of Sora was a couple cherry-picked videos. We have benchmark scores already for o3. But go ahead, keep coping.

Anonymous
12/20/24(Fri)18:44:13 No.103591208

Anonymous 12/20/24(Fri)18:44:13 No.103591208

>>103591176
>Pays as much as a new GPU to complete a task

Anonymous
12/20/24(Fri)18:44:44 No.103591212

Anonymous 12/20/24(Fri)18:44:44 No.103591212

Why do troons suck so much corporate dick? Honest question.

Anonymous
12/20/24(Fri)18:47:24 No.103591229

Anonymous 12/20/24(Fri)18:47:24 No.103591229

>>103591212
Parasocial retardation. They think the companies that preach DEI actually believe that shit and so feel validated by it.

Anonymous
12/20/24(Fri)18:48:34 No.103591241

Anonymous 12/20/24(Fri)18:48:34 No.103591241

>>103591212
they don't have family so they're tied to the state and the system as a replacement

Anonymous
12/20/24(Fri)18:49:57 No.103591249

Anonymous 12/20/24(Fri)18:49:57 No.103591249

>>103591208
>>Pays as much as a new GPU to complete a task
Is the amount of "tuning" just how long they let the model ramble in CoT? I thought when they released o1 and talked about letting the model "think" for months to solve complex tasks they were joking.

Anonymous
12/20/24(Fri)18:52:50 No.103591271

Anonymous 12/20/24(Fri)18:52:50 No.103591271

>>103591249
Fucking Hitchhiker's Guide-ass reality.

Anonymous
12/20/24(Fri)18:54:26 No.103591286

Anonymous 12/20/24(Fri)18:54:26 No.103591286

What is a good local model to generate a chain of thought given an start and end point? QwQ is the best I found but that just means the others are terrible.

Anonymous
12/20/24(Fri)18:57:40 No.103591309

Anonymous 12/20/24(Fri)18:57:40 No.103591309

>>103591286
Any time now QvQ is coming

Anonymous
12/20/24(Fri)18:59:30 No.103591325

Anonymous 12/20/24(Fri)18:59:30 No.103591325

>>103591309
this, they uploaded it to CN HF for testing stuff, so it already exists

Anonymous
12/20/24(Fri)19:00:11 No.103591330

Anonymous 12/20/24(Fri)19:00:11 No.103591330

>Gemma
Again can anyone test it on Llama.cpp and/or transformers? Here is the link:
pastebin.com 077YNipZ
The correct answer should be 1 EXP, but Gemma 27B and 9B instruct both get it wrong (as well as tangential questions wrong) with Llama.cpp compiled locally, with a Q8_0 quant. Llama.cpp through Oob also does. Transformers through Ooba (BF16, eager attention) also does. Note that the question is worded a bit vaguely on this pastebin but I also tested extremely clear and explicit questions which it also gets wrong. And I also tested other context lengths. If just one previous turn is tested, it gets the questions right. If tested with higher context, it's continuously wrong.

Exllama doesn't get this. The model gets the question and all other tangential questions right at any context length within about 7.9k. So this indicates me that there is a bug with transformers and Llama.cpp. However, a reproduction of the output would be good to have.

Anonymous
12/20/24(Fri)19:14:23 No.103591446

Anonymous 12/20/24(Fri)19:14:23 No.103591446

>>103591241
>they don't have family so they're tied to the state and the system as a replacement
it's 100% true, a lot of families don't accept them, as it should

Anonymous
12/20/24(Fri)19:40:58 No.103591649

Anonymous 12/20/24(Fri)19:40:58 No.103591649

When I get over my social anxiety I will storypost

Anonymous
12/20/24(Fri)20:22:36 No.103591942

Anonymous 12/20/24(Fri)20:22:36 No.103591942

>>103591928
>>103591928
>>103591928

Anonymous
12/20/24(Fri)20:58:10 No.103592213

Anonymous 12/20/24(Fri)20:58:10 No.103592213

>>103591022
You can use Grammar to force the model to stick to English.

Anonymous
12/20/24(Fri)20:59:20 No.103592222

Anonymous 12/20/24(Fri)20:59:20 No.103592222

>>103592213
At the cost of making it retarded. Just let it think in Chinese if it wants.

Anonymous
12/20/24(Fri)21:53:42 No.103592617

Anonymous 12/20/24(Fri)21:53:42 No.103592617

like 10% of the posts made in here recently were by some retard who got banned and all his posts deleted. and looking back through them they all sucked.

Anonymous
12/20/24(Fri)22:04:02 No.103592698

Anonymous 12/20/24(Fri)22:04:02 No.103592698

File: 1602660686651.jpg (32 KB, 330x305)

32 KB JPG

>>103587853

Anonymous
12/20/24(Fri)23:09:06 No.103593161

Anonymous 12/20/24(Fri)23:09:06 No.103593161

>>103587363
what the fuck is this mkultra shit kys

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.