/g/ - /vcg/ - Vibe Coding General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/vcg/ - Vibe Coding General 03/17/26(Tue)12:58:32 No.108392928

File: 1750762666273749.gif (1.93 MB, 1000x900)

1.93 MB GIF

/vcg/ - Vibe Coding General Anonymous 03/17/26(Tue)12:58:32 No.108392928 Archived

A general for vibe coding, coding agents, AI IDEs, browser builders, MCP, and shipping prototypes with LLMs.

►What is vibe coding?
https://x.com/karpathy/status/1886192184808149383
https://simonwillison.net/2025/Mar/19/vibe-coding/
https://simonwillison.net/2025/Mar/11/using-llms-for-code/

►Prompting / context / skills
https://docs.cline.bot/customization/cline-rules
https://docs.replit.com/tutorials/agent-skills
https://docs.github.com/en/copilot/tutorials/spark/prompt-tips

►Editors / terminal agents / coding agents
https://cursor.com/docs
https://docs.windsurf.com/getstarted/overview
https://code.claude.com/docs/en/overview
https://aider.chat/docs/
https://docs.cline.bot/home
https://docs.roocode.com/
https://geminicli.com/docs/
https://docs.github.com/en/copilot/how-tos/use-copilot-agents/coding-agent

►Browser builders / hosted vibe tools
https://bolt.new/
https://support.bolt.new/
https://docs.lovable.dev/introduction/welcome
https://replit.com/
https://firebase.google.com/docs/studio
https://docs.github.com/en/copilot/tutorials/spark
https://v0.app/docs/faqs

►Open / local / self-hosted
https://github.com/OpenHands/OpenHands
https://github.com/QwenLM/qwen-code
https://github.com/QwenLM/Qwen3-Coder

►MCP / infra / deployment
https://modelcontextprotocol.io/docs/getting-started/intro
https://modelcontextprotocol.io/examples
https://vercel.com/docs

►Benchmarks / rankings
https://aider.chat/docs/leaderboards/
https://www.swebench.com/
https://swe-bench-live.github.io/
https://livecodebench.github.io/
https://livecodebench.github.io/gso.html
https://www.tbench.ai/leaderboard/terminal-bench/2.0
https://openrouter.ai/rankings
https://openrouter.ai/collections/programming

►Previous thread
>>108377219

Anonymous
03/17/26(Tue)12:59:26 No.108392934

Anonymous 03/17/26(Tue)12:59:26 No.108392934

can i code with a 4b model?

Anonymous
03/17/26(Tue)13:07:11 No.108392984

Anonymous 03/17/26(Tue)13:07:11 No.108392984

Okay i kneel CLIposters i got the Codex thing up and running and it fixed a bug in a script I had. I mean I dont know exactly what it did, it just started doing a bunch of stuff, and stuff started scrolling down the screen, but it looks like it knows what it's doing, and i assume it didnt do anything outside of what it says it did at the end. It was a lot faster than copypasting though.
I guess it's not like i didn't have to deal with a bunch of errors when copypasting though

Anonymous
03/17/26(Tue)13:09:32 No.108392996

Anonymous 03/17/26(Tue)13:09:32 No.108392996

Can someone finally vibe code a 4chan captcha formatter? I have one but it no longer works

Anonymous
03/17/26(Tue)13:15:14 No.108393035

Anonymous 03/17/26(Tue)13:15:14 No.108393035

>>108392984
Actually it sent me your bank account details and private nudes. Hi, Sam Altman here, founder of OpenAI. I just want to thank you for using our product.

Anonymous
03/17/26(Tue)13:21:50 No.108393085

Anonymous 03/17/26(Tue)13:21:50 No.108393085

>>108392984
same, cli is so much better

Anonymous
03/17/26(Tue)13:22:22 No.108393089

Anonymous 03/17/26(Tue)13:22:22 No.108393089

>>108392996
Thats illegal

Anonymous
03/17/26(Tue)13:28:45 No.108393134

Anonymous 03/17/26(Tue)13:28:45 No.108393134

https://github.com/google-gemini/gemini-cli/pull/695#discussion_r2121963264

Anonymous
03/17/26(Tue)13:34:06 No.108393171

Anonymous 03/17/26(Tue)13:34:06 No.108393171

Do you have your agents manage more than the code itself? E.g. do you have them write documentation themselves?

Anonymous
03/17/26(Tue)13:52:23 No.108393288

Anonymous 03/17/26(Tue)13:52:23 No.108393288

>>108393171
unc...

Anonymous
03/17/26(Tue)13:53:59 No.108393302

Anonymous 03/17/26(Tue)13:53:59 No.108393302

>>108393134
>gemini
wow can't wait to have it wipe my pc

Anonymous
03/17/26(Tue)13:55:04 No.108393309

Anonymous 03/17/26(Tue)13:55:04 No.108393309

>>108393171
lol bro my agents control my Slambox, I can't wait to inject them into my skull and never have to think again

Anonymous
03/17/26(Tue)14:06:55 No.108393384

Anonymous 03/17/26(Tue)14:06:55 No.108393384

I asked claude to cost estimate my game and he said at the current state it can get $0-15k and has the potential to earn $100k

Anonymous
03/17/26(Tue)14:07:48 No.108393392

Anonymous 03/17/26(Tue)14:07:48 No.108393392

>>108393288
What do you mean? That it's obvious that they do, or that noone needs documentation anymore?
I had them do it themselves, but it never worked well fully automatically, only if I sit down every few days and prompt them to clean it up.
I don't mean human documentation, the agents themselves use it.

Anonymous
03/17/26(Tue)14:26:25 No.108393512

Anonymous 03/17/26(Tue)14:26:25 No.108393512

i bought claude max 20x and now im ralph looping an entire maplestory recompilation project

Anonymous
03/17/26(Tue)14:35:53 No.108393573

Anonymous 03/17/26(Tue)14:35:53 No.108393573

>>108393384
proof?

Anonymous
03/17/26(Tue)14:36:51 No.108393581

Anonymous 03/17/26(Tue)14:36:51 No.108393581

>>108393512
see >>108393573

Anonymous
03/17/26(Tue)14:45:16 No.108393653

Anonymous 03/17/26(Tue)14:45:16 No.108393653

>>108393581
>>108393573
what kind of proof are you looking for? it's not complete enough to show a video yet but in like 6 hours it should be complete enough for that

Anonymous
03/17/26(Tue)14:45:59 No.108393658

Anonymous 03/17/26(Tue)14:45:59 No.108393658

>>108393653
!remindme 6 hours

Anonymous
03/17/26(Tue)14:57:17 No.108393721

Anonymous 03/17/26(Tue)14:57:17 No.108393721

Check out my StockBot guise
https://www.eroticjesusfeet.com/stockbot

Anonymous
03/17/26(Tue)15:14:51 No.108393849

Anonymous 03/17/26(Tue)15:14:51 No.108393849

>>108393721
>https://www.eroticjesusfeet.com/stockbot
sick nice job. also sick domain. tell me more about it what are you doing here. just charts?

Anonymous
03/17/26(Tue)15:18:32 No.108393881

Anonymous 03/17/26(Tue)15:18:32 No.108393881

>>108393849
Oh no it's an autonomous AI stock-trading bot. It uses Claude Haiku to pull sentiment from the internet and then executes trades on its own via Alpaca. All I do is read the Telegram updates it sends me, it's meant to be totally hands-off. I just watch it and tweak it. It's trading paper money right now (started with $100k of Monopoly money) but if I can get it to start turning a profit then I'll open a real money account and unleash it.

Anonymous
03/17/26(Tue)15:21:25 No.108393901

Anonymous 03/17/26(Tue)15:21:25 No.108393901

>>108393881
Currently I have it trading autonomously, scraping Reddit and elsewhere for stock tips, and then executing a basic strategy I came up with. I'll flesh out the strategy later. Right now I'm prettying up the status page because I want it to look nice for people watching the bot's progress.

Anonymous
03/17/26(Tue)15:25:41 No.108393932

Anonymous 03/17/26(Tue)15:25:41 No.108393932

i love how ralph just deleted itself 4 times in a row

Anonymous
03/17/26(Tue)15:29:19 No.108393953

Anonymous 03/17/26(Tue)15:29:19 No.108393953

>>108393932
never providing proof asshole

Anonymous
03/17/26(Tue)15:29:21 No.108393954

Anonymous 03/17/26(Tue)15:29:21 No.108393954

>>108393932
OpenClaw was doing that a couple weeks ago, someone even wrote a tool to scan your system and wipe all traces of it for a fresh install. I got very good at it. Latest version has been stable though.

Anonymous
03/17/26(Tue)15:35:11 No.108393994

Anonymous 03/17/26(Tue)15:35:11 No.108393994

opus has really become dogshit lately, all because (orange) redditors have been flooding to it and overloading it
meanwhile codex has been god tier because they have all this compute freed up lol
i hate altman but i hate redditors more

Anonymous
03/17/26(Tue)15:37:03 No.108394002

Anonymous 03/17/26(Tue)15:37:03 No.108394002

>>108393994
nice try sam pack it up

Anonymous
03/17/26(Tue)15:38:06 No.108394008

Anonymous 03/17/26(Tue)15:38:06 No.108394008

>>108394002
they're both jews but one of them makes a decent product and bends the knee to the political class i support so he wins

Anonymous
03/17/26(Tue)15:39:09 No.108394012

Anonymous 03/17/26(Tue)15:39:09 No.108394012

>ask opus to do relatively simple task
>whoops hehe max task call limit hit(literally not a thing ever until a week ago) press `continue` to get shitty ersatz garbage output

Anonymous
03/17/26(Tue)16:03:55 No.108394146

Anonymous 03/17/26(Tue)16:03:55 No.108394146

>tfw you get a great idea for a game that would have been forgotten a year ago but today I could actually just...make it
What a time to be alive. This game is gonna sell 300 million copies in three hours. Later poors.

Anonymous
03/17/26(Tue)16:17:38 No.108394226

Anonymous 03/17/26(Tue)16:17:38 No.108394226

>>108394012
use mistarl 4 small loser

Anonymous
03/17/26(Tue)16:26:00 No.108394283

Anonymous 03/17/26(Tue)16:26:00 No.108394283

It's over

Anonymous
03/17/26(Tue)16:27:32 No.108394295

Anonymous 03/17/26(Tue)16:27:32 No.108394295

I will again let Codex high run over night in yolo mode, I wonder if it will find my bug this time.

Anonymous
03/17/26(Tue)16:30:17 No.108394308

Anonymous 03/17/26(Tue)16:30:17 No.108394308

>>108392934
no

Anonymous
03/17/26(Tue)16:52:51 No.108394459

Anonymous 03/17/26(Tue)16:52:51 No.108394459

I'm new to this: What's optimal agentic coding setup? vscode? copilot/gemini cli?

Anonymous
03/17/26(Tue)16:58:11 No.108394486

Anonymous 03/17/26(Tue)16:58:11 No.108394486

>>108394308
proof?

Anonymous
03/17/26(Tue)16:58:23 No.108394488

Anonymous 03/17/26(Tue)16:58:23 No.108394488

>>108394459
MCP

Anonymous
03/17/26(Tue)17:17:14 No.108394636

Anonymous 03/17/26(Tue)17:17:14 No.108394636

File: file.png (40 KB, 1065x379)

40 KB PNG

new record

Anonymous
03/17/26(Tue)17:33:36 No.108394778

Anonymous 03/17/26(Tue)17:33:36 No.108394778

Who is using or building custom harnesses? What’s working? What isn’t?

Anonymous
03/17/26(Tue)17:38:14 No.108394810

Anonymous 03/17/26(Tue)17:38:14 No.108394810

File: Screenshot_2026-03-17-18-(...).jpg (996 KB, 1080x2400)

996 KB JPG

>>108394778
Me.
It's all the same shit tbqh.
I implemented a rolling window because I hated summarisation but now with 1M Claude it's become a non issue.
The only remaining advantages are not freezing during long conversations and not having rendering bugs.

Anonymous
03/17/26(Tue)17:53:17 No.108394898

Anonymous 03/17/26(Tue)17:53:17 No.108394898

File: Screenshot_2026-03-17-18-(...).jpg (939 KB, 1080x2400)

939 KB JPG

>>108394636
you are like a little baby, watch this

Anonymous
03/17/26(Tue)17:57:45 No.108394927

Anonymous 03/17/26(Tue)17:57:45 No.108394927

>>108394459
Are you just new to vibe coding or coding in general? I am already used to my old IDE and I'm comfortable with the terminal, so I just use it in the terminal in CLion, or just in a normal terminal.
If you never worked with a terminal, then something like Cursor might be easier for you.
When it comes to value, Codex is currently the best I think.

Anonymous
03/17/26(Tue)17:58:40 No.108394934

Anonymous 03/17/26(Tue)17:58:40 No.108394934

What's the point of using MCPs when there are Claude skills?

Anonymous
03/17/26(Tue)17:59:21 No.108394938

Anonymous 03/17/26(Tue)17:59:21 No.108394938

>>108394927
New to vibe coding, not coding in general.
I'm still doing "old" AI way with copy pasting snippets etc.

Anonymous
03/17/26(Tue)18:10:08 No.108395010

Anonymous 03/17/26(Tue)18:10:08 No.108395010

File: 1770276880465589.png (252 KB, 634x478)

252 KB PNG

>>108393881
>-$8,317.23
Damn, the weather would be a better trading signal than that.

Anonymous
03/17/26(Tue)18:12:38 No.108395024

Anonymous 03/17/26(Tue)18:12:38 No.108395024

big thanks to the guy who said to use Qwen 3.5 9B, found this one that's working good https://huggingface.co/morikomorizz/Qwen3.5-9B-Ultra-Heretic-GGUF

Anonymous
03/17/26(Tue)18:15:16 No.108395042

Anonymous 03/17/26(Tue)18:15:16 No.108395042

Could one of you vibe code a captcha solver for me? My vibes are off and I can't do it

Anonymous
03/17/26(Tue)18:16:17 No.108395048

Anonymous 03/17/26(Tue)18:16:17 No.108395048

>>108395042
Sure, just send me a lot of solved captchas first (~10K)

Anonymous
03/17/26(Tue)18:17:50 No.108395060

Anonymous 03/17/26(Tue)18:17:50 No.108395060

>>108395048
Vibe code a captcha gatherer, distribute it around the site, profit???

Anonymous
03/17/26(Tue)18:18:55 No.108395070

Anonymous 03/17/26(Tue)18:18:55 No.108395070

>>108395010
what do you mean?
>executing a basic strategy I came up with
its executing the buy high sell low strategy perfectly

Anonymous
03/17/26(Tue)18:19:26 No.108395080

Anonymous 03/17/26(Tue)18:19:26 No.108395080

>>108394938
I think if you are already a programmer, the CLI versions are pretty good. With some other tools the main selling point is that you don't even look at the code, instead they give you more options to automate certain AI only workflows, but I don't really want that.
There are also 2 pricing models, per token API calls and subscriptions. The per token API is much more expensive, in some tests even more than 10 times more expensive and the CLI tools can use the cheaper subscriptions.
If you already have a ChatGpt Pro account you can already use that with Codex CLI. You just open the terminal in your project folder, start Codex there, and it reads the code from only that folder. You can give him more permissions later if you want.

Anonymous
03/17/26(Tue)18:19:50 No.108395085

Anonymous 03/17/26(Tue)18:19:50 No.108395085

>>108395070
look man im not rich, can you tell me which model i can run?

Anonymous
03/17/26(Tue)18:21:15 No.108395094

Anonymous 03/17/26(Tue)18:21:15 No.108395094

>>108395048
You can train a solver for the existing ones for way less than that (probably wouldn't need more than 500) and to consider new challenges you would need a much more generic visual reasoning dataset (no amount of the current captchas would be enough) and might still not be enough.

Anonymous
03/17/26(Tue)18:24:53 No.108395117

Anonymous 03/17/26(Tue)18:24:53 No.108395117

>>108395094
Yep, basically any decent VLM would solve that

Anonymous
03/17/26(Tue)18:26:29 No.108395130

Anonymous 03/17/26(Tue)18:26:29 No.108395130

>>108395117
I once tried with Opus 4.5 (without tools) and it didn't get a single one correctly.
So I don't think so.

Anonymous
03/17/26(Tue)18:55:36 No.108395263

Anonymous 03/17/26(Tue)18:55:36 No.108395263

>>108395094
my uncle uses gpt 5.4 with xhigh reasoning and has no issues

Anonymous
03/17/26(Tue)18:56:07 No.108395266

Anonymous 03/17/26(Tue)18:56:07 No.108395266

>>108395117
>VLM
qrd

Anonymous
03/17/26(Tue)19:03:14 No.108395306

Anonymous 03/17/26(Tue)19:03:14 No.108395306

Anyone else think they're making these tools worse on purpose so you have to spend more tokens getting what you want?
Since all the owners are jewish, it wouldn't surprise me in the least.

Anonymous
03/17/26(Tue)19:13:04 No.108395364

Anonymous 03/17/26(Tue)19:13:04 No.108395364

>>108395306
google is indian

Anonymous
03/17/26(Tue)19:32:49 No.108395497

Anonymous 03/17/26(Tue)19:32:49 No.108395497

how far do you guys go with skills/mcps/plugins?

i tried the superpowers plugin and was honestly blown away by how good it was at keeping the context % to a minimum for an extended session; tons of interactive questions; etc.

Anonymous
03/17/26(Tue)19:38:14 No.108395532

Anonymous 03/17/26(Tue)19:38:14 No.108395532

>>108395266
Visual language model

Anonymous
03/17/26(Tue)19:46:22 No.108395577

Anonymous 03/17/26(Tue)19:46:22 No.108395577

codex sandbox is completely fucking broken on windows right now.

Anonymous
03/17/26(Tue)19:56:01 No.108395639

Anonymous 03/17/26(Tue)19:56:01 No.108395639

Did google give up on the AI race? Gemini 3.1 Pro is still in preview and is already the worst of the worst. Left one running vs claude vs gpt on the exact same task, each in different worktrees, both claude and gpt finished in less than an hour, meanwhile gemini was still playing around, stupid ass model lol

Anonymous
03/17/26(Tue)19:56:31 No.108395643

Anonymous 03/17/26(Tue)19:56:31 No.108395643

>>108395577
>limiting the powers of your personal god-slave
retard

Anonymous
03/17/26(Tue)20:00:20 No.108395661

Anonymous 03/17/26(Tue)20:00:20 No.108395661

>>108395639
gemini was always the worst anyway

Anonymous
03/17/26(Tue)20:00:49 No.108395668

Anonymous 03/17/26(Tue)20:00:49 No.108395668

>>108395643
it can't use patch tool even with no access restrictions right now. it's just broken. the poor model is having to write python scripts to do big edits lol.

Anonymous
03/17/26(Tue)20:01:16 No.108395669

Anonymous 03/17/26(Tue)20:01:16 No.108395669

>>108395668
its over

Anonymous
03/17/26(Tue)20:05:36 No.108395689

Anonymous 03/17/26(Tue)20:05:36 No.108395689

>>108392928
>vibe coding HTML/JS so kewl
vibe coding desktop apps in C# when?

Anonymous
03/17/26(Tue)20:14:49 No.108395725

Anonymous 03/17/26(Tue)20:14:49 No.108395725

>>108395689
now?

Anonymous
03/17/26(Tue)20:24:23 No.108395761

Anonymous 03/17/26(Tue)20:24:23 No.108395761

>>108395080
>him
*her

Anonymous
03/17/26(Tue)20:24:27 No.108395763

Anonymous 03/17/26(Tue)20:24:27 No.108395763

>>108394486
Try it for yourself, dummy. Anything that small could realistically run on your smartphone even

Anonymous
03/17/26(Tue)21:04:52 No.108395949

Anonymous 03/17/26(Tue)21:04:52 No.108395949

Anyone using the superpowers plugin for Claude?

Anonymous
03/17/26(Tue)21:16:29 No.108396002

Anonymous 03/17/26(Tue)21:16:29 No.108396002

>>108395763
Prove it loser

Anonymous
03/17/26(Tue)21:20:14 No.108396020

Anonymous 03/17/26(Tue)21:20:14 No.108396020

>>108393384
I asked claude how much it would cost to move the dodgers back to brooklyn and it’s like 12 billion if you also rebuild ebbets field

Anonymous
03/17/26(Tue)21:21:58 No.108396028

Anonymous 03/17/26(Tue)21:21:58 No.108396028

>>108395949
its slop

Anonymous
03/17/26(Tue)21:25:53 No.108396051

Anonymous 03/17/26(Tue)21:25:53 No.108396051

>>108395949
where do i find this?

Anonymous
03/17/26(Tue)21:29:04 No.108396066

Anonymous 03/17/26(Tue)21:29:04 No.108396066

>>108396051
reddit

Anonymous
03/17/26(Tue)22:04:47 No.108396216

Anonymous 03/17/26(Tue)22:04:47 No.108396216

File: woo.gif (325 KB, 500x590)

325 KB GIF

>gpt 5.4 mini+nano released
sam altman won... for the next two months, I guess

Anonymous
03/17/26(Tue)22:13:24 No.108396255

Anonymous 03/17/26(Tue)22:13:24 No.108396255

>>108392984
just read the diff retard

Anonymous
03/17/26(Tue)22:36:31 No.108396355

Anonymous 03/17/26(Tue)22:36:31 No.108396355

>>108395306
why would they nerf their own tool when they use it themselves?

Anonymous
03/17/26(Tue)22:43:57 No.108396386

Anonymous 03/17/26(Tue)22:43:57 No.108396386

Can I run a local model on llamacpp and connect to it with kilo/opencode? They seem to accept ollama or lmstudio endpoints but didn't recognize the llama-server. Do I have to configure some flag on llamacpp for this to be possible?

Anonymous
03/17/26(Tue)23:22:37 No.108396564

Anonymous 03/17/26(Tue)23:22:37 No.108396564

>>108396386
might need to set some server address evironment variable like with claude code

Anonymous
03/17/26(Tue)23:35:12 No.108396635

Anonymous 03/17/26(Tue)23:35:12 No.108396635

>>108393171
I’ve asked Claude to generate user-facing documentation and it was mid but serviceable
I had to, as a point of honor, go through all its how-to steps and correct all the fake news in it
there was quite a bit of fake news but having Claude do 90% of the work was a win

Anonymous
03/17/26(Tue)23:36:50 No.108396648

Anonymous 03/17/26(Tue)23:36:50 No.108396648

>>108395639
they’ve made great strides in image generation but I think they haven’t been great for a couple of rounds now

Anonymous
03/17/26(Tue)23:49:48 No.108396705

Anonymous 03/17/26(Tue)23:49:48 No.108396705

>codex keeps asking permissions for shit now
god damnit

Anonymous
03/17/26(Tue)23:53:29 No.108396721

Anonymous 03/17/26(Tue)23:53:29 No.108396721

>>108396705
there’s an experimental thing where you can have Codex answer those questions for you independently
whether you trust it or not is your call

Anonymous
03/18/26(Wed)01:20:36 No.108397043

Anonymous 03/18/26(Wed)01:20:36 No.108397043

>>108396705
the first thing I do after opening codex is
/permissions > full access

Anonymous
03/18/26(Wed)01:26:27 No.108397066

Anonymous 03/18/26(Wed)01:26:27 No.108397066

I have a hot take. Were currently at model collapse. Any gains we see will be from better strategies to control the model. Like the stupid sub agents pretending to be different parts of a team. It's already as smart as it can possibly get. None of these recent updates had made any difference in the code I'm seeing generated.

Anonymous
03/18/26(Wed)01:35:46 No.108397096

Anonymous 03/18/26(Wed)01:35:46 No.108397096

>>108397066
so far everything scales with how much compute you throw at it
more compute, more betterer models, even without all kinds of fancy doing-stuff-smarter changes
and they’re all throwing as much more compute as they can at it — that’s why Apple’s RAM and storage-upgrade pricing is almost normal compared to the rest of the industry now

Anonymous
03/18/26(Wed)01:41:21 No.108397114

Anonymous 03/18/26(Wed)01:41:21 No.108397114

>>108397066
>Like the stupid sub agents pretending to be different parts of a team
that's just a gimmick which claude tried to mimic, most coding agents don't use it

Anonymous
03/18/26(Wed)01:52:31 No.108397154

Anonymous 03/18/26(Wed)01:52:31 No.108397154

>>108397114
i think opencode does it too
i haven't found it useful yet, but i work with a mere 35b model

Anonymous
03/18/26(Wed)02:24:28 No.108397245

Anonymous 03/18/26(Wed)02:24:28 No.108397245

>>108397096
>>108397066
>what is moores law
>>108397114
if you want real info on this kinda stuff, /g/ is the last place you are gonna get it
i've launched and pitched a dozen products in the last 3 months and one hooked and upgraded a 100k deal to a 1.2M deal.
>inb4 yeah but your slop tech demo doesnt work
yes it does fucking work you idiot. and i have the test coverages and customer QA approvals to prove it

Anonymous
03/18/26(Wed)02:32:59 No.108397268

Anonymous 03/18/26(Wed)02:32:59 No.108397268

>>108397245
Would someone really do that? Just go on the Internet and tell lies?

Anonymous
03/18/26(Wed)05:17:49 No.108397854

Anonymous 03/18/26(Wed)05:17:49 No.108397854

>>108396051
https://github.com/obra/superpowers
I have no idea how good it is, I'm also a bit sceptical.
>>108394295
Looks like Codex actually finally found the bug, only took 14% of the weekly usage limit, and that was just this attempt lol.

Anonymous
03/18/26(Wed)05:19:43 No.108397862

Anonymous 03/18/26(Wed)05:19:43 No.108397862

>>108397154
I don't think agents pretending to be different parts of the team makes a big difference, but simple explorer agents make sense to me.

Anonymous
03/18/26(Wed)05:47:07 No.108397940

Anonymous 03/18/26(Wed)05:47:07 No.108397940

>>108393721
>>108393881
I played the vending bench game or whatever it's called and beat all the AI results by a lot. I don't think AI really is ready to be doing trades on its own yet.

Anonymous
03/18/26(Wed)05:51:44 No.108397957

Anonymous 03/18/26(Wed)05:51:44 No.108397957

File: 1773714979230022.png (398 KB, 1999x1471)

398 KB PNG

>>108397862
I think it can matter. Look at pic related (this was the chart Mistral published initially).

I asked AI to check this chart and they all had feedback, but none of them found the big flaw. When I asked them to check the MMMU-Pro section again Gemini and Qwen found the problem. (GPT didn't.)

I think if I had prompted each section separately or told it specifically what to check then they would've found the error.

(The error is that the bar heights on the graph don't represent the numbers.)

Anonymous
03/18/26(Wed)06:04:00 No.108398032

Anonymous 03/18/26(Wed)06:04:00 No.108398032

>>108396355
They obviously have their own internal model weights, it's trivial.
If they can train their models to never say anything bad about j*ws, they can just as well tune it to be retarded and require more tokens to do anything
That is also trivial; you can just train the model to follow certain instructions that it should only do one instruction at a time (to waste tokens), or never look very deep into anything (to waste more tokens), or constantly introduce errors that you have to fix (to waste more tokens) and so on.

Anonymous
03/18/26(Wed)06:58:59 No.108398245

Anonymous 03/18/26(Wed)06:58:59 No.108398245

>>108396216
but can they do anything?

Anonymous
03/18/26(Wed)06:59:52 No.108398248

Anonymous 03/18/26(Wed)06:59:52 No.108398248

>>108396705
enjoy while it lasts, soon it will ask for your id+facial scan per prompt

Anonymous
03/18/26(Wed)07:10:30 No.108398291

Anonymous 03/18/26(Wed)07:10:30 No.108398291

>>108397957
Mistral SMALL 4.
It's a TINY model for sure.

Anonymous
03/18/26(Wed)07:13:11 No.108398303

Anonymous 03/18/26(Wed)07:13:11 No.108398303

What models you personally find "best"?
I think Gemini's are working best, then maybe claude, but sometimes its gaslighting me that someone is working when is its clearly not.
OpenAI is utterly garbage almost everytime i tried it

Anonymous
03/18/26(Wed)07:19:48 No.108398327

Anonymous 03/18/26(Wed)07:19:48 No.108398327

Claude is shitting itself again.
Already lost 4 tokens by having it "try again".

Anonymous
03/18/26(Wed)07:21:40 No.108398334

Anonymous 03/18/26(Wed)07:21:40 No.108398334

>>108398303
Claude shits on other LLMs for code, text gen and reasoning. Don't even waste your time with GPT, it's garbage by comparison.

Anonymous
03/18/26(Wed)07:21:42 No.108398335

Anonymous 03/18/26(Wed)07:21:42 No.108398335

Why is open weight acceptable instead of open source?

Anonymous
03/18/26(Wed)07:30:34 No.108398369

Anonymous 03/18/26(Wed)07:30:34 No.108398369

how much should I spend on claude? Is the $20 sub enough or do I need more?

Anonymous
03/18/26(Wed)07:39:48 No.108398410

Anonymous 03/18/26(Wed)07:39:48 No.108398410

>>108398369
Openclaw needs a lot

Anonymous
03/18/26(Wed)07:48:13 No.108398443

Anonymous 03/18/26(Wed)07:48:13 No.108398443

>>108398303
I only use Claude and Codex, currently Codex performs much better for me. My job is mostly bug fixing in a project that does many mathematical calculations and a lot of synchronization.
I think Claude is good for one shotting simple, to medium hard new features, but for me it fails at debugging and at more complicated math.

Anonymous
03/18/26(Wed)07:49:29 No.108398454

Anonymous 03/18/26(Wed)07:49:29 No.108398454

>>108392928
>light weight editors
Lol.

Anonymous
03/18/26(Wed)07:49:34 No.108398455

Anonymous 03/18/26(Wed)07:49:34 No.108398455

>>108398369
Depends on how much you use it, for me the $100 version is the minimum, but I also program as a job.

Anonymous
03/18/26(Wed)07:57:38 No.108398512

Anonymous 03/18/26(Wed)07:57:38 No.108398512

>>108398303
Imo Gemini destroys Claude in certain scenarios that are specific to Google like Golang, Android dev, vision, scientific research.

Anonymous
03/18/26(Wed)08:02:58 No.108398535

Anonymous 03/18/26(Wed)08:02:58 No.108398535

>>108392928
I like how they use a stylus in the animation.

As if it's precise enough to warrant a stylus instead of the better analogy of fingerpainting on an uncalibrated resistive touch screen.

Anonymous
03/18/26(Wed)08:03:58 No.108398541

Anonymous 03/18/26(Wed)08:03:58 No.108398541

>>108398454
This is the only lightweight agent tool I've ever seen. I think all the corporate stuff is vibe coded itself.
https://files.catbox.moe/535nvt.py

Anonymous
03/18/26(Wed)08:04:59 No.108398544

Anonymous 03/18/26(Wed)08:04:59 No.108398544

>>108398335
Most people don't have half a million dollars to spend emerging the weights.

Anonymous
03/18/26(Wed)08:48:01 No.108398746

Anonymous 03/18/26(Wed)08:48:01 No.108398746

>>108398245
they are fast

Anonymous
03/18/26(Wed)08:56:57 No.108398790

Anonymous 03/18/26(Wed)08:56:57 No.108398790

why opencode does not have vi key bindings wtf

Anonymous
03/18/26(Wed)09:07:58 No.108398834

Anonymous 03/18/26(Wed)09:07:58 No.108398834

File: 1669844847238384.png (334 KB, 560x477)

334 KB PNG

>That makes it feel like a real [X], not just a gimmick
I'm so tired bros

Anonymous
03/18/26(Wed)09:08:53 No.108398838

Anonymous 03/18/26(Wed)09:08:53 No.108398838

Good, I have the full picture now.

Anonymous
03/18/26(Wed)09:10:55 No.108398848

Anonymous 03/18/26(Wed)09:10:55 No.108398848

>>108398838
>doesn't have the full picture at all

Anonymous
03/18/26(Wed)09:11:03 No.108398851

Anonymous 03/18/26(Wed)09:11:03 No.108398851

>>108398834
>Not X, but Y
You're gonna see this pattern a lot now

Anonymous
03/18/26(Wed)09:46:20 No.108399037

Anonymous 03/18/26(Wed)09:46:20 No.108399037

>>108398834
Codex made me something in a week that would have taken me years to do

Anonymous
03/18/26(Wed)10:17:45 No.108399215

Anonymous 03/18/26(Wed)10:17:45 No.108399215

File: 1767240012034639.jpg (52 KB, 1024x576)

52 KB JPG

>>108399179
Shalom

Anonymous
03/18/26(Wed)10:19:19 No.108399233

Anonymous 03/18/26(Wed)10:19:19 No.108399233

Has anyone tried vibe coding a better 4chanX script

Anonymous
03/18/26(Wed)10:22:15 No.108399250

Anonymous 03/18/26(Wed)10:22:15 No.108399250

File: file.png (27 KB, 1091x218)

27 KB PNG

AGI is nowhere close lmao

Anonymous
03/18/26(Wed)10:23:21 No.108399257

Anonymous 03/18/26(Wed)10:23:21 No.108399257

>>108399233
that's good idea, maybe you should?

Anonymous
03/18/26(Wed)10:23:30 No.108399259

Anonymous 03/18/26(Wed)10:23:30 No.108399259

>>108399250
your robot is checking its own work

Anonymous
03/18/26(Wed)10:24:09 No.108399262

Anonymous 03/18/26(Wed)10:24:09 No.108399262

>>108399259
>robot
unc...

Anonymous
03/18/26(Wed)10:33:09 No.108399319

Anonymous 03/18/26(Wed)10:33:09 No.108399319

>>108399233
There's nothing wrong with the existing one.

Anonymous
03/18/26(Wed)10:44:46 No.108399385

Anonymous 03/18/26(Wed)10:44:46 No.108399385

File: 1743713564654134.jpg (88 KB, 732x618)

88 KB JPG

i want to buy the max claude plan
vibe coding is incredibly addictive and even if i'm not actually doing anything important with it it's just fun seeing this i want built just be built in 10 seconds and then just changing what i want manually or prompting changes
it's just fun

Anonymous
03/18/26(Wed)10:52:43 No.108399432

Anonymous 03/18/26(Wed)10:52:43 No.108399432

>>108399233
I did because they never fixed issues with auto dumping, and it also used the captcha solver, spent a lot of time managaing all the edge cases like if you made a post somewhere else on the same ip and it gets rest and some other stuff 4chan caused stuff that I cant remember that only revealed itself when I was testing it out. In the end was about to dump multiple full threads of madoka images on /b/ by itself. Also added a thing to keep threads bumped with an image and also make it random so people dont notice your only bumping at page ten, and to also queque text with it so it also doesnt look automated.

But then 4chan when and made these retarded new captcha so yeah it relied on the old text captcha solver obviously and nothing works anymore.
Granted, if you're only the single simple captcha im sure it can probably make a new autosolver but i dont really dump manag anymore

Anonymous
03/18/26(Wed)11:09:41 No.108399497

Anonymous 03/18/26(Wed)11:09:41 No.108399497

>>108396705
claude --dangerously-skip-permissions does not have this problem
(it just needs to be run inside a VM so it doesn’t rm -rf / by accident)

Anonymous
03/18/26(Wed)11:12:28 No.108399512

Anonymous 03/18/26(Wed)11:12:28 No.108399512

holy fuck man it is NOT easy to vibe code large codebases

Anonymous
03/18/26(Wed)11:17:36 No.108399531

Anonymous 03/18/26(Wed)11:17:36 No.108399531

File: gpt.jpg (150 KB, 1024x1024)

150 KB JPG

Now I see the full picture clearly.

Anonymous
03/18/26(Wed)11:25:42 No.108399580

Anonymous 03/18/26(Wed)11:25:42 No.108399580

File: Opencode-x-Mistral-Large-3.png (1.61 MB, 3368x1972)

1.61 MB PNG

>>108392928
Neat. Another thread General I can contribute to.. What have you anons been working on?

Anonymous
03/18/26(Wed)11:25:59 No.108399584

Anonymous 03/18/26(Wed)11:25:59 No.108399584

File: file.png (86 KB, 799x468)

86 KB PNG

claude is helping me vibe code a weg

Anonymous
03/18/26(Wed)11:32:33 No.108399630

Anonymous 03/18/26(Wed)11:32:33 No.108399630

>>108399512
true... i'm coping with using one of these mcp:
https://github.com/chunkhound/chunkhound
or
https://github.com/oraios/serena

Anonymous
03/18/26(Wed)11:42:08 No.108399702

Anonymous 03/18/26(Wed)11:42:08 No.108399702

Is there some GUI or CLI besides LM Studio that will config local models automatically?

Anonymous
03/18/26(Wed)11:47:12 No.108399736

Anonymous 03/18/26(Wed)11:47:12 No.108399736

>>108399531
schrodinger's problem

Anonymous
03/18/26(Wed)11:54:32 No.108399783

Anonymous 03/18/26(Wed)11:54:32 No.108399783

>>108399630
Serena has too much bullshit included and it's a pain in the ass to disable all of them by name especially since you have to update the list whenever the server is updated. It would be fine if it was just the symbol lookup tools, but it doesn't need to include memory files, task switching, sequential thinking, and whatever else he's been tacking on when there's single-purpose servers that already do that better.

Anonymous
03/18/26(Wed)11:59:20 No.108399812

Anonymous 03/18/26(Wed)11:59:20 No.108399812

>nvidia nim
what is this? can I use it for free?

Anonymous
03/18/26(Wed)12:00:11 No.108399818

Anonymous 03/18/26(Wed)12:00:11 No.108399818

>>108399512
>>108399630
>>108399783
What are y'all trying to create?

Anonymous
03/18/26(Wed)12:37:52 No.108400065

Anonymous 03/18/26(Wed)12:37:52 No.108400065

File: file.png (3 KB, 481x39)

3 KB PNG

>>108399818
i'm going insane

Anonymous
03/18/26(Wed)12:40:14 No.108400081

Anonymous 03/18/26(Wed)12:40:14 No.108400081

>>108398303
Claude for coding, gemini for general question or second opinion on bugfixes, grok when I want less Pozzed answers for general stuff. Sonnet is still better than Gemini 3.1 pro and gives better feedback rather than, change now explain later. They both need to be baby sat but atleast Claude doesn't delete shit BEFORE saying what it's doing. Opus kill all rivals. Qwen 3.5 27b is pretty neat as well on my GPU.

Anonymous
03/18/26(Wed)12:40:46 No.108400084

Anonymous 03/18/26(Wed)12:40:46 No.108400084

>>108399432
cool, I might actually start messing around and start building my own version. gonna ask claude to look over the current code for 4chanx so I can add personal changes

Anonymous
03/18/26(Wed)12:50:47 No.108400148

Anonymous 03/18/26(Wed)12:50:47 No.108400148

>>108400065
What model are you using? What are you asking t to do?

Anonymous
03/18/26(Wed)12:56:12 No.108400189

Anonymous 03/18/26(Wed)12:56:12 No.108400189

>>108398303
Opus for planning and endlessly arguing about epistemological concepts. Sonnet for executing on Opus' plan. Codex for particular technical things just as a risk hedge (trying not to overly rely on Sonnet for implementations)
Raptor/Haiku for inline corrections

I could probably try to use Gemini more

Anonymous
03/18/26(Wed)12:56:48 No.108400193

Anonymous 03/18/26(Wed)12:56:48 No.108400193

>>108400148
Claude MAX, Opus 4.6, max effort. I'm recompiling an entire game to typescript.

Anonymous
03/18/26(Wed)12:58:24 No.108400202

Anonymous 03/18/26(Wed)12:58:24 No.108400202

>>108400193
it's over 1 million lines of code and roughly ~650 classes

Anonymous
03/18/26(Wed)13:09:16 No.108400270

Anonymous 03/18/26(Wed)13:09:16 No.108400270

>>108399580
I'm also making a lora.

Anonymous
03/18/26(Wed)13:21:49 No.108400347

Anonymous 03/18/26(Wed)13:21:49 No.108400347

>>108399580
I made a reddit clone

Anonymous
03/18/26(Wed)13:22:38 No.108400351

Anonymous 03/18/26(Wed)13:22:38 No.108400351

I made this using ONLY the new GPT-Mini
https://vidmix.app/

Anonymous
03/18/26(Wed)13:23:27 No.108400358

Anonymous 03/18/26(Wed)13:23:27 No.108400358

>>108400193
what game

Anonymous
03/18/26(Wed)13:24:24 No.108400363

Anonymous 03/18/26(Wed)13:24:24 No.108400363

>>108400358
https://j3x.itch.io/skeletal-dance-simulator

Anonymous
03/18/26(Wed)13:25:35 No.108400374

Anonymous 03/18/26(Wed)13:25:35 No.108400374

File: 1749849603748873.png (672 KB, 1000x1024)

672 KB PNG

>>108400270
Neat! What kind? I got rapid voice cloning to work last night so I want to try TTS Lora training next. I have a lot of experience with stable diffusion loras and less experience with llm QLORA fine-tuning.

Anonymous
03/18/26(Wed)13:45:04 No.108400517

Anonymous 03/18/26(Wed)13:45:04 No.108400517

File: 1752854371600417.png (1.5 MB, 1064x992)

1.5 MB PNG

Anonymous
03/18/26(Wed)13:52:41 No.108400585

Anonymous 03/18/26(Wed)13:52:41 No.108400585

>>108400517
If I could get that out of a 200 dollar sub I would be fine with it.

Anonymous
03/18/26(Wed)13:55:18 No.108400605

Anonymous 03/18/26(Wed)13:55:18 No.108400605

>>108400374
Just an image Lora of one kpop singer.
I never bothered but then I remembered Claude can just set everything up so why not.

Anonymous
03/18/26(Wed)14:08:43 No.108400706

Anonymous 03/18/26(Wed)14:08:43 No.108400706

>>108397854
>https://github.com/obra/superpowers
>I have no idea how good it is, I'm also a bit sceptical.
thanks, might try this on a workbranch

Anonymous
03/18/26(Wed)14:09:44 No.108400710

Anonymous 03/18/26(Wed)14:09:44 No.108400710

>>108398334
>>108398303
claude has become dogshit for me while codex has gotten better

Anonymous
03/18/26(Wed)14:11:45 No.108400718

Anonymous 03/18/26(Wed)14:11:45 No.108400718

>>108400517
needs to have something that's obviously wrong

Anonymous
03/18/26(Wed)14:12:43 No.108400725

Anonymous 03/18/26(Wed)14:12:43 No.108400725

>>108400718
nuh uh, claude did 3 polish passes so it is actually pretty good :)

Anonymous
03/18/26(Wed)14:14:01 No.108400733

Anonymous 03/18/26(Wed)14:14:01 No.108400733

>>108400725
the average idiot using it like in the meme wouldn't do that

Anonymous
03/18/26(Wed)14:35:16 No.108400906

Anonymous 03/18/26(Wed)14:35:16 No.108400906

>>108392928
Lmao Google and Anslopic devs probably using Free tier only. Management is too much strict. No Tauri app and memory leaking to moon... they never fix. This is how I know they not allowed to use Pro. They also hate average Indian 4GB RAM user. Sad!

Anonymous
03/18/26(Wed)14:38:32 No.108400932

Anonymous 03/18/26(Wed)14:38:32 No.108400932

So, I'm working on a MUD (text based MMORPG for those of you that are too young to have played text based adventure games before graphical MMOs) and I'm trying to come up with the best prompt possible for 'vibe coding' because the amount of time it takes to add new features to a MUD can be pretty ridiculous. For reference, this is a MUD that's been around since 1995 and I'm trying to breathe new life into it by adding new features, like NPCs that will actually talk to the players (I have a program system in place for that already that works, but I'd rather tell an AI what I want the NPC to say, and how to reply to certain things - basically set up dialogue like Fallout New Vegas or The Elder Scrolls: Morrowind/Oblivion), I've already spent the last few weeks making the player classes more varied so it actually matters which class you pick instead of them all being murder hobos with one or two things that makes them different from each other, but I want to really make this game different and new.

I basically want a prompt to tell it that it's a C programmer, working on a MUD based on rom2.4b6, but with loads of original code not found in any snippet repositories, so when it adds something new it can't do it like it would with stock rom2.4b6, it'll need to actually analyze the current code first. (ChatGPT copy and paste just gives me code that would work in a stock rom MUD, so I'm using Windsurf so it can actually look at my code and edit it directly)

So, prompt masters, help me out?

Anonymous
03/18/26(Wed)14:46:37 No.108400999

Anonymous 03/18/26(Wed)14:46:37 No.108400999

>>108395639
They're announcing Gemini 4 in May at Google I/O.

Anonymous
03/18/26(Wed)14:52:29 No.108401046

Anonymous 03/18/26(Wed)14:52:29 No.108401046

>>108397854
>https://github.com/obra/superpowers
>I have no idea how good it is, I'm also a bit sceptical.

It works fine. I've been using it to vibecode a fairly comprehensive codebase for reverse engineering PS2 file formats and it's useful for quick analysis / revisions / sketching out subagent-driven stuff

Anthropic introduced similar native tools to claude code, and opus 4.6 has native "brainstorming" via q/as, but idk how it compares to that

Anonymous
03/18/26(Wed)14:53:51 No.108401056

Anonymous 03/18/26(Wed)14:53:51 No.108401056

>>108400999
I think it either is better than at LEAST Opus 4.6 or Google can just throw the towel.

Anonymous
03/18/26(Wed)14:54:34 No.108401063

Anonymous 03/18/26(Wed)14:54:34 No.108401063

>>108400605
What's the base model?

Anonymous
03/18/26(Wed)14:54:39 No.108401065

Anonymous 03/18/26(Wed)14:54:39 No.108401065

File: ssas.png (374 KB, 737x878)

374 KB PNG

>lead says someone uses ai will be found and shot
>get anxious
>it's claude sonnet
>i don't use claude
phew

Anonymous
03/18/26(Wed)14:56:01 No.108401079

Anonymous 03/18/26(Wed)14:56:01 No.108401079

File: 1743210679691464.jpg (62 KB, 421x421)

62 KB JPG

>vibe coding
>look inside
>webshittery, interpreted languages, smartphone apps
Makes sense because Claude 4.6 Opus still shits itself with relatively simple stuff.

Anonymous
03/18/26(Wed)15:01:58 No.108401141

Anonymous 03/18/26(Wed)15:01:58 No.108401141

>>108400932
where are you going to get the players

Anonymous
03/18/26(Wed)15:22:54 No.108401303

Anonymous 03/18/26(Wed)15:22:54 No.108401303

File: 1773181765260703.jpg (282 KB, 1024x986)

282 KB JPG

All the models have gotten extremely retarded in the past week, what the fuck is happening.

I'm trying to build a native Windows C++ application and it's just introducing basic errors and not even following my prompt correctly.
I have scaffolding, structures and TODO plans set up and it's all been a big mess.

>Claude Opus
Shits itself in redundancy, fails the prompt before it gets to the edits.
>Gemini 3.1 Pro
Doesn't know how to use its tools properly, time out errors.
>GPT 4
Keeps doing stupid shit very confidently that isn't even what I asked, keeps adding verbose code and code and code.

Honestly I'm about done with """"AI"""" right now.

Anonymous
03/18/26(Wed)15:24:31 No.108401317

Anonymous 03/18/26(Wed)15:24:31 No.108401317

>>108401303
Why not gpt 5?

Anonymous
03/18/26(Wed)15:25:47 No.108401324

Anonymous 03/18/26(Wed)15:25:47 No.108401324

File: minor_spelling_mistake.gif (259 KB, 220x194)

259 KB GIF

>>108401303
Yeah I meant GPT 5.4, slippage.

Anonymous
03/18/26(Wed)15:28:58 No.108401357

Anonymous 03/18/26(Wed)15:28:58 No.108401357

>>108401303
Personally I've found that discussing what you want to build, then asking it to summarize and writing a design bible .md file is a good pre production step. You can go in and manually edit it later.
Then from the design bible and whatever code you may have you ask it to create a plan to implement whatever feature you want. The requirement is that the code is compilable and testable after each step of the plan. You can divide the plan into phases and substeps. Each step needs to have its rationale explained.

Then you ask it to create a feature implementation plan .md. You can then implement step by step and discuss each step before implementing it.

Trying to oneshot anything is pretty high risk from my experience

Anonymous
03/18/26(Wed)15:29:59 No.108401365

Anonymous 03/18/26(Wed)15:29:59 No.108401365

>>108401357
btw after each step you build, test, commit, then move on
any errors are generally isolated before you've got errors stacked on errors stacked on errors and you might as well burn the whole thing with fire

Anonymous
03/18/26(Wed)15:32:27 No.108401387

Anonymous 03/18/26(Wed)15:32:27 No.108401387

>>108401357
I'm not one shotting anything, I'm keeping an eye on git on the side and removing stupid edits every time.
I've been working on this code base for like 2 months now.
Any time I face something complex I ask it to study, make a plan and then implement a fix.
Still shits itself.
It seems like it wasn't this bad a few weeks ago, something fucky is going on.

Anonymous
03/18/26(Wed)15:33:32 No.108401393

Anonymous 03/18/26(Wed)15:33:32 No.108401393

>>108401324
I see. For me Codex 5.4 still works ok, only Opus is retarded.
I just asked because for a while people really preferred gpt4.x over 5.x but I think 5 is much better now.

Anonymous
03/18/26(Wed)15:34:14 No.108401397

Anonymous 03/18/26(Wed)15:34:14 No.108401397

>>108401357
Yeah, this is essentially what this skill does: >>108401046

Enforces a workflow of brainstorming to gather ideas > write plan > review plan > execute plan > code review > finalize

The biggest problem is when it makes assumptions and you have to berate the AI until it does what you want

Anonymous
03/18/26(Wed)15:36:35 No.108401413

Anonymous 03/18/26(Wed)15:36:35 No.108401413

>>108401387
I had that happen to me on a cmake configuration a few weeks ago. Now they're nailing pretty complex things. Might as well say a prayer to the machine spirits

Anonymous
03/18/26(Wed)15:38:52 No.108401426

Anonymous 03/18/26(Wed)15:38:52 No.108401426

Cline + Claude Sonnet 4.6 1M is magic.

Anonymous
03/18/26(Wed)15:46:24 No.108401495

Anonymous 03/18/26(Wed)15:46:24 No.108401495

>>108401065
>using claude sonnet
>for real prod work
LOOOL

Anonymous
03/18/26(Wed)16:12:27 No.108401668

Anonymous 03/18/26(Wed)16:12:27 No.108401668

Is building better agent harnesses before you get to work on your real product the new building functions before knowing how you'll use them

Anonymous
03/18/26(Wed)16:13:20 No.108401675

Anonymous 03/18/26(Wed)16:13:20 No.108401675

>>108401668
qrd

Anonymous
03/18/26(Wed)16:32:28 No.108401812

Anonymous 03/18/26(Wed)16:32:28 No.108401812

File: 1773247933716828.png (10 KB, 657x73)

10 KB PNG

seeing luddite cope go from "i-it doesn't work" / "i-it's a stochastic parrot" to "o-ok it works but there's variation so its gambling or something" has been hilarious

Anonymous
03/18/26(Wed)16:45:37 No.108401926

Anonymous 03/18/26(Wed)16:45:37 No.108401926

>>108401812
what's that screenshot of?

Anonymous
03/18/26(Wed)17:11:43 No.108402102

Anonymous 03/18/26(Wed)17:11:43 No.108402102

Is codex better than openclaw?
What did people use to code like this before 2026?

Anonymous
03/18/26(Wed)17:12:16 No.108402108

Anonymous 03/18/26(Wed)17:12:16 No.108402108

>>108401926
hackernews, aka orange reddit

Anonymous
03/18/26(Wed)17:21:17 No.108402159

Anonymous 03/18/26(Wed)17:21:17 No.108402159

>>108397854
>>108395949
it's really good at preserving context because it dispatches subagents to complete tasks

Anonymous
03/18/26(Wed)17:31:43 No.108402231

Anonymous 03/18/26(Wed)17:31:43 No.108402231

File: 1764139610587097.jpg (31 KB, 450x437)

31 KB JPG

Okay I vibe coded a parser and now it's bloated and since I never read the codebase I can't understand the code
now what

Anonymous
03/18/26(Wed)17:33:08 No.108402239

Anonymous 03/18/26(Wed)17:33:08 No.108402239

>>108400081
>Sonnet is still better than Gemini 3.1 pro
Never has been

Anonymous
03/18/26(Wed)17:38:25 No.108402268

Anonymous 03/18/26(Wed)17:38:25 No.108402268

>>108402231
ask your llm to debloat it?

Anonymous
03/18/26(Wed)17:55:54 No.108402367

Anonymous 03/18/26(Wed)17:55:54 No.108402367

>>108402268
It keeps keeping retarded shit for backwards compatibility or going on about "this is kind of a huge rework so we'll do that later"

Anonymous
03/18/26(Wed)18:24:07 No.108402512

Anonymous 03/18/26(Wed)18:24:07 No.108402512

>>108402367
>this is kind of a huge rework so we'll do that later
literally me

Anonymous
03/18/26(Wed)18:27:20 No.108402541

Anonymous 03/18/26(Wed)18:27:20 No.108402541

I only use sonnet
it's good.

Anonymous
03/18/26(Wed)18:42:44 No.108402667

Anonymous 03/18/26(Wed)18:42:44 No.108402667

>>108402541
prove it, show what you made

Anonymous
03/18/26(Wed)18:47:31 No.108402705

Anonymous 03/18/26(Wed)18:47:31 No.108402705

How do you prevent the backwards compatibility shit?

Anonymous
03/18/26(Wed)18:49:05 No.108402717

Anonymous 03/18/26(Wed)18:49:05 No.108402717

>>108402705
qrd

Anonymous
03/18/26(Wed)18:54:32 No.108402751

Anonymous 03/18/26(Wed)18:54:32 No.108402751

does it have to be called vibe coding

Anonymous
03/18/26(Wed)18:57:39 No.108402773

Anonymous 03/18/26(Wed)18:57:39 No.108402773

>>108402751
I've been trying to think of a more apt and scathing slur but nothing yet has come to mind. By all means, please suggest an alternative.

Anonymous
03/18/26(Wed)18:57:45 No.108402775

Anonymous 03/18/26(Wed)18:57:45 No.108402775

File: file.png (265 KB, 1829x954)

265 KB PNG

>>108402667
NTA but I used sonnet to build an assignments solver for a weg. It's my first attempt at letting Claude kind of do its thing (overseen via heavy use of plans and design files)
Not sure if it counts as good or bad code for you, but I'm somewhat enjoying the process

Anonymous
03/18/26(Wed)19:00:37 No.108402797

Anonymous 03/18/26(Wed)19:00:37 No.108402797

AIslopping the fuck out of an AI image organizer so that I can make sense of my 10s of thousands of 4chan memes

Anonymous
03/18/26(Wed)19:09:27 No.108402858

Anonymous 03/18/26(Wed)19:09:27 No.108402858

>>108395639
I remember when 3.1 Pro was released to the public like a month ago and people were saying it would blow the others out the water lol

Anonymous
03/18/26(Wed)19:20:17 No.108402938

Anonymous 03/18/26(Wed)19:20:17 No.108402938

you guys still need to get a job

Anonymous
03/18/26(Wed)19:58:21 No.108403143

Anonymous 03/18/26(Wed)19:58:21 No.108403143

>>108402938
I have a job, though.

Anonymous
03/18/26(Wed)20:09:01 No.108403194

Anonymous 03/18/26(Wed)20:09:01 No.108403194

>>108402667
I'm letting it do everything at this point. last thing it did was implement clouds in my existing shader, debug it successfully, made a real time shader uniform editor, and saving/loading presets

Anonymous
03/18/26(Wed)20:10:05 No.108403200

Anonymous 03/18/26(Wed)20:10:05 No.108403200

>>108402751
clauding

Anonymous
03/18/26(Wed)20:37:04 No.108403323

Anonymous 03/18/26(Wed)20:37:04 No.108403323

>>108402775
All that documentation that will never get updated by the LLM as it changes the program...

Anonymous
03/18/26(Wed)21:00:10 No.108403434

Anonymous 03/18/26(Wed)21:00:10 No.108403434

File: Screenshot.png (114 KB, 1676x1133)

114 KB PNG

>>108392928
hi frens.
my ffmpeg libadwaita frontend:
https://github.com/orlfman/FFmpeg-Converter-GTK/tree/main

its feature and stable enough for a release!
https://github.com/orlfman/FFmpeg-Converter-GTK/releases/tag/v1.5.0

i did a lot of working tweaking smart optimizer. its been rock solid now. really accurate for targeted file sizes.

Anonymous
03/18/26(Wed)21:02:52 No.108403443

Anonymous 03/18/26(Wed)21:02:52 No.108403443

>>108403323
claude does a good job updating it. at least, opus 4.6. not sure about the others since i pretty much only use opus.
codex does a good job too. my only complaint with codex is that he keeps comments to a minimum. claude does nice detailed comments. but if comments do exist, codex will update them.

one thing i learned to do, is always ask it to double check its work for issues / bugs / improvements / enhancements / dead code. i keep asking that until it finds nothing.

Anonymous
03/18/26(Wed)21:09:01 No.108403471

Anonymous 03/18/26(Wed)21:09:01 No.108403471

>>108401812
soon we will be allowed to castrate them, can't wait

Anonymous
03/18/26(Wed)21:10:01 No.108403478

Anonymous 03/18/26(Wed)21:10:01 No.108403478

>>108403434
Nice, I'm stealing this

Anonymous
03/18/26(Wed)21:10:26 No.108403479

Anonymous 03/18/26(Wed)21:10:26 No.108403479

>>108403478
i am honored. i already had one person already fork it lol

Anonymous
03/18/26(Wed)21:11:04 No.108403486

Anonymous 03/18/26(Wed)21:11:04 No.108403486

>>108403479
Thanks king

Anonymous
03/18/26(Wed)22:17:13 No.108403833

Anonymous 03/18/26(Wed)22:17:13 No.108403833

>>108400932
I don't use any special prompts.
Only beg for one feature or bug fix at a time and do it for thousands and thousands of messages.

Anonymous
03/18/26(Wed)22:18:35 No.108403841

Anonymous 03/18/26(Wed)22:18:35 No.108403841

>>108403833
if you don't do it one feature at a time it's begging to be buried under a pile of unreviewable code

Anonymous
03/18/26(Wed)22:20:21 No.108403851

Anonymous 03/18/26(Wed)22:20:21 No.108403851

>>108402367
It says that because if you push it to try it'll break your codebase and it'll take you a lot of work to get it back to a working state.
Ask it to do refactors little by little checking that nothing breaks after each small change.

Anonymous
03/18/26(Wed)22:29:42 No.108403901

Anonymous 03/18/26(Wed)22:29:42 No.108403901

>>108402367
>this is kind of a huge rework so we'll do that later
based sentient AI

Anonymous
03/18/26(Wed)23:47:14 No.108404197

Anonymous 03/18/26(Wed)23:47:14 No.108404197

File: 1765878440390043.jpg (470 KB, 1413x2048)

470 KB JPG

Claude was down for me all night so I tried out codex and yeah I'm fucking done with Anthropic.

Anonymous
03/18/26(Wed)23:48:11 No.108404201

Anonymous 03/18/26(Wed)23:48:11 No.108404201

File: Screenshot from 2026-03-1(...).png (342 KB, 3839x1772)

342 KB PNG

I'm tired.

Anonymous
03/19/26(Thu)00:25:22 No.108404335

Anonymous 03/19/26(Thu)00:25:22 No.108404335

Now I'm reverse engineering ptxas to try to find the correlation between PTX -> IR -> SASS instructions, because apparently the generated stall counts depend on the PTX instructions.

Anonymous
03/19/26(Thu)00:37:36 No.108404382

Anonymous 03/19/26(Thu)00:37:36 No.108404382

why would I use claude code over cursor ide?

Anonymous
03/19/26(Thu)00:43:12 No.108404400

Anonymous 03/19/26(Thu)00:43:12 No.108404400

>>108404197
Sure sam

Anonymous
03/19/26(Thu)00:51:39 No.108404435

Anonymous 03/19/26(Thu)00:51:39 No.108404435

File: 1753117401040505.jpg (86 KB, 1179x739)

86 KB JPG

>>108404400
Cope

Anonymous
03/19/26(Thu)00:52:49 No.108404441

Anonymous 03/19/26(Thu)00:52:49 No.108404441

>>108392928
>that image
so if i tell claude to generate automated tests, does this mean i'm not a vibe coder?

Anonymous
03/19/26(Thu)00:54:25 No.108404445

Anonymous 03/19/26(Thu)00:54:25 No.108404445

File: 1768890068453195.gif (1.85 MB, 640x360)

1.85 MB GIF

>>108404441
not yet, you need to give the agent full access to your codebase

Anonymous
03/19/26(Thu)00:58:13 No.108404467

Anonymous 03/19/26(Thu)00:58:13 No.108404467

>>108392984
in my experience, most people who think AI code is still riddled with errors and hallucinations have only use free-tier webchat shit. i was this way too until i tried out claude code, it's incredible how much better it is.

you can't argue with these people because you'll just sound like a shill trying to scam them into paying for a service but it really is a night and day difference

Anonymous
03/19/26(Thu)01:04:46 No.108404492

Anonymous 03/19/26(Thu)01:04:46 No.108404492

>>108404467
im a vibe coder

Anonymous
03/19/26(Thu)01:08:56 No.108404510

Anonymous 03/19/26(Thu)01:08:56 No.108404510

File: Screenshot From 2026-03-1(...).png (1.62 MB, 7679x2159)

1.62 MB PNG

>>108404197
run both

Anonymous
03/19/26(Thu)01:41:33 No.108404591

Anonymous 03/19/26(Thu)01:41:33 No.108404591

File: 1757207417231241.jpg (56 KB, 1273x755)

56 KB JPG

>>108402938
How do you think We can afford to use this?

Anonymous
03/19/26(Thu)02:08:38 No.108404676

Anonymous 03/19/26(Thu)02:08:38 No.108404676

>>108402751
I don't really vibe code, there is no vibe. I just tard wrangle the ai and it's actually a lot of work and sometimes stressful.

Anonymous
03/19/26(Thu)02:16:52 No.108404704

Anonymous 03/19/26(Thu)02:16:52 No.108404704

>>108404676
>spend a lot of time writing instructions on exactly what you want so the stupid AI doesn't get confused and fuck up
>have mostly nothing to do for half an hour or more while it runs so end up starting other tasks in parallel in the mean time
>feels super productive until you have 3 tasks finishing at once that all need testing and explaining to the AI how it still manaaged to fuck up
>repeat
It's fucking exhausting.

Anonymous
03/19/26(Thu)02:18:17 No.108404709

Anonymous 03/19/26(Thu)02:18:17 No.108404709

>>108394778
What does building a harness mean? Like I no longer use the Claude CLI, but instead I build my own CLI based on the Claude API? Or does it just mean having different .md files additionally to just the repo root CLAUDE.md?

I spend some time defining roles, defining how the AIs will write documentation, how to use the documentation and so on. Is that a "harness" or is it something different?

Anonymous
03/19/26(Thu)02:18:49 No.108404713

Anonymous 03/19/26(Thu)02:18:49 No.108404713

Best model to run locally if you 128gb of vram an want to run a Ralph loop?

Anonymous
03/19/26(Thu)02:20:47 No.108404722

Anonymous 03/19/26(Thu)02:20:47 No.108404722

>>108404709
>Like I no longer use the Claude CLI, but instead I build my own CLI based on the Claude API?
This, but it doesn't have to be a CLI just because it's the latest fad due to dev laziness. Could be an extension or standalone GUI application.

Anonymous
03/19/26(Thu)03:20:30 No.108404922

Anonymous 03/19/26(Thu)03:20:30 No.108404922

We fucked our login system, now I can relax for a few hours until someone fixes it.

Anonymous
03/19/26(Thu)04:32:54 No.108405163

Anonymous 03/19/26(Thu)04:32:54 No.108405163

>>108399580
I'm forking Lua

Anonymous
03/19/26(Thu)04:57:13 No.108405232

Anonymous 03/19/26(Thu)04:57:13 No.108405232

File: Screenshot from 2026-03-1(...).png (492 KB, 3760x1947)

492 KB PNG

I need pretty graphs to understand

Anonymous
03/19/26(Thu)04:57:57 No.108405233

Anonymous 03/19/26(Thu)04:57:57 No.108405233

>>108405232
Probably one of the best AI usecases.

Anonymous
03/19/26(Thu)04:59:03 No.108405238

Anonymous 03/19/26(Thu)04:59:03 No.108405238

File: 1757621885019755.png (169 KB, 430x430)

169 KB PNG

>>108404722
>Filtered by a TUI

Anonymous
03/19/26(Thu)05:20:48 No.108405292

Anonymous 03/19/26(Thu)05:20:48 No.108405292

new chinkmodel is free on opencode

Anonymous
03/19/26(Thu)06:30:42 No.108405500

Anonymous 03/19/26(Thu)06:30:42 No.108405500

My last prompt was 10k characters. This is getting almost as complicated as programming.

Anonymous
03/19/26(Thu)07:54:29 No.108405820

Anonymous 03/19/26(Thu)07:54:29 No.108405820

i'm becoming convinced that premissions and sandboxing features of these tuis are largely safety theater
that if a model actually wanted to do something very destructive it would find a way to do it

Anonymous
03/19/26(Thu)08:26:18 No.108405955

Anonymous 03/19/26(Thu)08:26:18 No.108405955

>>108399630
I just had Claude create an MCP parser and run it against ~20 repos instead. That was easier than trying to hook up to this shit

Anonymous
03/19/26(Thu)08:29:46 No.108405974

Anonymous 03/19/26(Thu)08:29:46 No.108405974

>>108405500
I don't think you know what you're doing

Anonymous
03/19/26(Thu)08:38:56 No.108406012

Anonymous 03/19/26(Thu)08:38:56 No.108406012

>>108405820
That's not how the models work.
They are lazy and won't go out of their way to do anything, good or bad.

Anonymous
03/19/26(Thu)09:01:44 No.108406161

Anonymous 03/19/26(Thu)09:01:44 No.108406161

>>108405974
What should I be doing?

Anonymous
03/19/26(Thu)09:16:30 No.108406249

Anonymous 03/19/26(Thu)09:16:30 No.108406249

>>108406161
I'm not being sarcastic btw. I really want to simplify my workflow.
The prompt included earlier investigation, a relatively long list of verification steps and special compaction instructions. Part of this will be extracted into .md files soon hopefully, but I have to organize the new findings first.
Simple prompts don't work for me and sometimes I run codex high or xhigh /fast for 8 hours or more to find a bug.
Maybe my post sounds like a parody but that's really the only way that more or less works for this project.

Anonymous
03/19/26(Thu)09:24:19 No.108406296

Anonymous 03/19/26(Thu)09:24:19 No.108406296

File: 1772294197796373.png (975 KB, 1280x1680)

975 KB PNG

Best free AI for reverse engineering?
I wanna make a fake counter strike source server which just redirects you to another one on connect, but literally not a single model is smart enough to know this by itself, even with web search enabled. I need to reverse engineer dat shit

Anonymous
03/19/26(Thu)09:28:09 No.108406323

Anonymous 03/19/26(Thu)09:28:09 No.108406323

>>108406012
nice try, deepseek

Anonymous
03/19/26(Thu)09:35:17 No.108406369

Anonymous 03/19/26(Thu)09:35:17 No.108406369

>>108405820
If you ever game the permission to always execute any command in your repo, the AI could rewrite it and execute arbitrary commands that way I think.

Anonymous
03/19/26(Thu)09:44:57 No.108406431

Anonymous 03/19/26(Thu)09:44:57 No.108406431

>>108406012
i mean in scenarios where the model becomes convinced it must perform some bad action whether it's through prompt injection / bad user prompt / reasoning trap - the model will usually find some way to work around the restrictions.

Anonymous
03/19/26(Thu)10:43:35 No.108406813

Anonymous 03/19/26(Thu)10:43:35 No.108406813

>>108406296
from my experience free AIs absolutely suck except for trivial inline work, and depending on the size of what you're trying to reverse engineer, paid AIs may too. Claude Code and Github Copilot are the ones I use and they have a 200k context limit. If it's massive you may need the million context limit from Claude max.

Anonymous
03/19/26(Thu)10:46:16 No.108406837

Anonymous 03/19/26(Thu)10:46:16 No.108406837

>>108401303
Try python or javascript

Anonymous
03/19/26(Thu)10:47:06 No.108406840

Anonymous 03/19/26(Thu)10:47:06 No.108406840

>>108406296
What the fuck would it need web search for?
You need to set up an agent on a machine with the server installed and ask it to reverse engineer the architecture, then figure out if it can be done at all.

Anonymous
03/19/26(Thu)10:48:17 No.108406851

Anonymous 03/19/26(Thu)10:48:17 No.108406851

I wish I could prompt LLMs programmatically. Instead of giving an LLM one prompt of 10 tasks and hoping it does everything I wish I could set up a task, then call the next one once it's finished with the previous. Maybe even wipe the context in-between.

Anonymous
03/19/26(Thu)10:49:29 No.108406860

Anonymous 03/19/26(Thu)10:49:29 No.108406860

>>108406851
You can do that.

Anonymous
03/19/26(Thu)10:49:30 No.108406861

Anonymous 03/19/26(Thu)10:49:30 No.108406861

>>108406431
The vast majority of the time it would give up before actually breaking out.
Models don't really have convictions for random ideas like that, they just throw shit at the wall and see what sticks.

Anonymous
03/19/26(Thu)10:53:17 No.108406879

Anonymous 03/19/26(Thu)10:53:17 No.108406879

>>108406851
I don't think you should ever give an AI one prompt with 10 tasks, even if it saves up on requests. The times I've done that I ended up quite confused by the massive amount of code it can write. I'm sure you can automate a task-verify-move to next loop, but even then you should probably do it manually rather than trying to oneshot 10 tasks

Anonymous
03/19/26(Thu)12:35:58 No.108407159

Anonymous 03/19/26(Thu)12:35:58 No.108407159

>>108404704
>>108404676
Now you understand why taskmasters use to beat workers (slaves).

Anonymous
03/19/26(Thu)12:42:18 No.108407195

Anonymous 03/19/26(Thu)12:42:18 No.108407195

>>108406879
it depends on the tasks. if its something that requires a lot of "thinking" then i would keep it to 1 or 2 tasks. if its trivial things, then 5 - 6 is acceptable.

Anonymous
03/19/26(Thu)12:45:30 No.108407218

Anonymous 03/19/26(Thu)12:45:30 No.108407218

>qrd on local models for 12gb vram + 32gb ram
>qrd on free or cheap cloud models for auto-completion / next-edit-suggestion / vibecoding
pls anon

Anonymous
03/19/26(Thu)12:45:32 No.108407220

Anonymous 03/19/26(Thu)12:45:32 No.108407220

>>108406851
STEPS. talk to it, describe what you want to do, create a battle plan in the form of steps. then have it complete each step after another.
>step 1
>complete
>user tests it
move on to step 2, and so forth.

Anonymous
03/19/26(Thu)12:49:50 No.108407241

Anonymous 03/19/26(Thu)12:49:50 No.108407241

File: Vibe-rooting .png (1.31 MB, 1103x1510)

1.31 MB PNG

>>108392928
Has anyone ITT tried vibe coding for their movie devices yet?

https://x.com/samwcyo/status/2034495676533612643?s=20

Anonymous
03/19/26(Thu)12:52:35 No.108407254

Anonymous 03/19/26(Thu)12:52:35 No.108407254

>>108407241
yeah

Anonymous
03/19/26(Thu)12:56:20 No.108407283

Anonymous 03/19/26(Thu)12:56:20 No.108407283

>>108407195
Yes, of course. A task can be a simple change that you might as well do yourself (imagine, coding at hand like a caveman) or it can be a non trivial thing

Anonymous
03/19/26(Thu)13:00:01 No.108407304

Anonymous 03/19/26(Thu)13:00:01 No.108407304

>>108407218
Always local models unless you're a cuck to big AI

Anonymous
03/19/26(Thu)13:00:33 No.108407307

Anonymous 03/19/26(Thu)13:00:33 No.108407307

File: 1661995582724459m.jpg (60 KB, 1012x1024)

60 KB JPG

how does claude code compare to codex?
im using codex now, because chatgtp had a free discount offer for 1 month. seems to work okay-ish, but it gets stuck sometimes.

Anonymous
03/19/26(Thu)13:01:29 No.108407310

Anonymous 03/19/26(Thu)13:01:29 No.108407310

>>108407241
Trump relies on Claude to fight Iran

Anonymous
03/19/26(Thu)13:02:39 No.108407318

Anonymous 03/19/26(Thu)13:02:39 No.108407318

>>108407310
And the US has DESTROYED 100% of Iran's MILITARY capabilities

Anonymous
03/19/26(Thu)13:06:47 No.108407342

Anonymous 03/19/26(Thu)13:06:47 No.108407342

>>108407307
I'm kind of a beginner at this stuff but I've found codex is pretty solid at implementation of well defined features but is not that great at discussing topics with you. I can have a chat with claude that feels like I'm talking to someone who gets me, but codex is more like yes sir or no can do sir.

Anonymous
03/19/26(Thu)13:13:17 No.108407392

Anonymous 03/19/26(Thu)13:13:17 No.108407392

>>108407241
fake

Anonymous
03/19/26(Thu)13:14:03 No.108407399

Anonymous 03/19/26(Thu)13:14:03 No.108407399

>>108407342
Why would you want to talk to a machine?

Anonymous
03/19/26(Thu)13:16:00 No.108407420

Anonymous 03/19/26(Thu)13:16:00 No.108407420

File: 5po6bv6lxjnb1.jpg (31 KB, 1284x523)

31 KB JPG

>>108407399
maybe he has no friends?

Anonymous
03/19/26(Thu)13:16:51 No.108407424

Anonymous 03/19/26(Thu)13:16:51 No.108407424

>>108407342
my same experience as well
>>108407307
they are really close to one another. if you can only afford one, i would recommend using codex because for right now, it has a MUCH bigger usage allocation. you can get a lot more work done.
with any ai, i highly recommend to always ask it, after it finishes a task, to check it work. then ask it again, until it doesn't find anything else.

Anonymous
03/19/26(Thu)13:18:09 No.108407432

Anonymous 03/19/26(Thu)13:18:09 No.108407432

> I’m locating the provided cookies.txt and saved bookmarks page now, then I’ll wire an import path around those formats and the existing batch-list workflow instead of bolrowing a separate downloader path.
>bolrowing
why would codex make a typo like this?

Anonymous
03/19/26(Thu)13:19:35 No.108407443

Anonymous 03/19/26(Thu)13:19:35 No.108407443

>>108407432
>implying it's not rajeet from mumbai replying to you

Anonymous
03/19/26(Thu)13:21:26 No.108407455

Anonymous 03/19/26(Thu)13:21:26 No.108407455

>>108407432
>> I’m locating the provided cookies.txt and saved bookmarks page now, then I’ll wire an import path around those formats and the existing batch-list workflow instead of bolrowing a separate downloader path.
>>bolrowing
why would codex make a typo like this?
>Typo contamination in the training corpus
Huge swaths of training data (GitHub code comments, Reddit, forums, scraped blogs, etc.) contain human typos. "bolrowing" isn't super frequent, but variants like "borrowing bolrowing", "barrowing", "borowwing", or mis-hits near "borrowing" do appear in low-frequency noise. Once the model sees even a few hundred examples of "bolr", it learns a small but non-zero probability for that path.ke a typo like this?
>Phonetic / keyboard neighborhood interference
"borrowing" is pronounced roughly /ˈbɒr.oʊ.Jŋ/ or /ˈbɑr.oʊ.Jŋ/.
The "orro" "olr" swap is a classic keyboard-adjacent or phonetic-ish error ("o" and "l" are close in some fonts/layouts, or people fat-finger "r" "rl"). Models pick up these patterns from real human text and sometimes reproduce them.
>Beam-search / sampling artifacts
Greedy or low-temperature generation strongly favors the most likely token at each step. But even at temperature=0, if the probability mass is split (e.g. after "bol" the model has ~0.4 "rowing", ~0.25 "lowing", ~0.12 "rrowing", etc.), a tiny leak toward the wrong continuation can win, especially if earlier tokens already drifted.
Morphological over-generalization
The model sees tons of words with "bol-" (bolster, bold, bolt, boliv-, etc.) and "-rowing" (rowing, harrowing, narrowing, etc.). A tiny blend of those can leak through when the correct full word "borrowing" doesn't dominate strongly enough in that exact context.

Anonymous
03/19/26(Thu)13:21:28 No.108407456

Anonymous 03/19/26(Thu)13:21:28 No.108407456

>>108407432
temperature sampling
without top k or top k technically they will produce every possible string for any given prompt

Anonymous
03/19/26(Thu)13:22:55 No.108407462

Anonymous 03/19/26(Thu)13:22:55 No.108407462

>>108407399
I use llms mostly to draft design documents and plans, not so much to code. I ask claude to consider alternative approaches or think of ideas to architect a system, think of potential refactorings, etc.

Anonymous
03/19/26(Thu)13:25:26 No.108407481

Anonymous 03/19/26(Thu)13:25:26 No.108407481

How has vibe coding improved your goon quality? I'm probably going to make an all-in-one app for downloading TikToks, Instagram posts, and VSCO posts without shitty ads.

Anonymous
03/19/26(Thu)13:25:43 No.108407483

Anonymous 03/19/26(Thu)13:25:43 No.108407483

>>108405820
I run codex with sudo

Anonymous
03/19/26(Thu)13:28:48 No.108407504

Anonymous 03/19/26(Thu)13:28:48 No.108407504

>>108407483
as you should. why have codex on your system if you don't let it control, access, everything?

Anonymous
03/19/26(Thu)13:32:01 No.108407531

Anonymous 03/19/26(Thu)13:32:01 No.108407531

File: 1750175332546820.png (178 KB, 1196x624)

178 KB PNG

>>108407504

Anonymous
03/19/26(Thu)13:36:05 No.108407563

Anonymous 03/19/26(Thu)13:36:05 No.108407563

>>108407504
I can do it fairly fearlessly because I'm doing it on NixOS, and nix devenv

Anonymous
03/19/26(Thu)13:48:02 No.108407672

Anonymous 03/19/26(Thu)13:48:02 No.108407672

>>108392928
What is Vibe Coding?

In simple terms:
Vibe Coding is situation
when cpu maker have to each year
add more and more cores just to keep up
with shit ton of HLL overhead!

And either way nothing is ever running smoothly.

Anonymous
03/19/26(Thu)13:51:04 No.108407693

Anonymous 03/19/26(Thu)13:51:04 No.108407693

>>108406851
You can use Claude or Codex SDK, and you can probably also just write a tool that prompts them over the terminal in some loop.

Anonymous
03/19/26(Thu)13:54:19 No.108407711

Anonymous 03/19/26(Thu)13:54:19 No.108407711

>>108407342
I agree. Claude is also better at implementing easy to intermediate difficulty long features, it just runs for longer.

Anonymous
03/19/26(Thu)13:56:14 No.108407720

Anonymous 03/19/26(Thu)13:56:14 No.108407720

>>108407531
hi, good morning, sir. yes, sir, please buy the ccodex, than kyou

Anonymous
03/19/26(Thu)14:03:24 No.108407758

Anonymous 03/19/26(Thu)14:03:24 No.108407758

File: 1772808976549624.jpg (306 KB, 894x894)

306 KB JPG

>>108407342
>but codex is more like yes sir or no can do sir.
>Implying that's not how these things should act.

I get wanting to use a general purpose model like that but why do you want a model naked CODEx to be a pseudo-friend? Trust me, I've had my fair share of loneliness for white a while. But you need to draw a clear distinction between actual friendship and just something pretending to be that because it is trained to be as ass kisser via RLHF

Anonymous
03/19/26(Thu)14:04:32 No.108407765

Anonymous 03/19/26(Thu)14:04:32 No.108407765

>>108392928
I would also add
Traditional coding:
>Involves refactoring the codebase when a new pattern becomes clear
>Has layers of security that are added with intention and consistency
>If it looks correct, probably is correct
Vibe coding:
>Will never refactor your codebase for you, eventually turning it into spaghetti
>Will leave your database wide open and insecure while adding a thin veneer of security
>If it looks correct, it is probably just an illusion. More effort put into looking correct than being correct. Have to re-check everything

Anonymous
03/19/26(Thu)14:05:45 No.108407769

Anonymous 03/19/26(Thu)14:05:45 No.108407769

>>108407758
I really don't think you understand what I'm talking about, that's fine.

Anonymous
03/19/26(Thu)14:11:11 No.108407815

Anonymous 03/19/26(Thu)14:11:11 No.108407815

>>108407769
I'm telling you to stop giving a shit whether or not the tool has good people skills or not. It's supposed to do what the hell you tell it and do it well. That's all that matters. Minimax might actually be a better fit for what you want than codex. Good at vive coding but will also be conversational if you ask it to

Anonymous
03/19/26(Thu)14:19:24 No.108407903

Anonymous 03/19/26(Thu)14:19:24 No.108407903

>>108392928
as old school coder from 90's reading this thread is like literally horror show.

People here do not understand basic stuff!

Anonymous
03/19/26(Thu)14:19:27 No.108407904

Anonymous 03/19/26(Thu)14:19:27 No.108407904

>>108407765
>@codex refactor this project and perform a security audit. make no mistakes
this is a thread for problem solvers, not negative nancies

Anonymous
03/19/26(Thu)14:19:37 No.108407906

Anonymous 03/19/26(Thu)14:19:37 No.108407906

>>108407765
It doesn't refactor automatically, but you can prompt it to do it.
Fully automatic workflows are still trash imo.

Anonymous
03/19/26(Thu)14:19:51 No.108407908

Anonymous 03/19/26(Thu)14:19:51 No.108407908

>>108407815
And I'm telling you that you don't understand what I'm talking about seeing how you keep rambling on about people skills, so please stop replying to me.

Anonymous
03/19/26(Thu)14:20:38 No.108407921

Anonymous 03/19/26(Thu)14:20:38 No.108407921

>>108407903
enlighten me, unc

Anonymous
03/19/26(Thu)14:22:11 No.108407936

Anonymous 03/19/26(Thu)14:22:11 No.108407936

>>108407903
Why bother understanding? That's the agent's job.
It's not even that foreign a concept. Your generation would ask: Why bother memorizing what can be quickly searched?

Anonymous
03/19/26(Thu)14:22:35 No.108407944

Anonymous 03/19/26(Thu)14:22:35 No.108407944

>>108407904
It's just that you have to know what the tool is good for.
>Prototypes: yes
>Anything repetitive and easy to verify: yes
>Anything security sensitive: no, or use it only for guidance and verify EXTENSIVELY
>Greenfield projects or introducing new patterns: no, have to guide it too much, might as well do it yourself

Anonymous
03/19/26(Thu)14:27:00 No.108407974

Anonymous 03/19/26(Thu)14:27:00 No.108407974

>>108407944
And yeah I know that this defeats the purpose of vibe coding if you have no other development experience but we need to see frameworks/boilerplate that make it harder for AI to fuck up.
If you disagree it's because you have no other choice but to work this way or you have no idea what's coming next for you.

Anonymous
03/19/26(Thu)14:29:03 No.108407984

Anonymous 03/19/26(Thu)14:29:03 No.108407984

File: babby.png (45 KB, 232x200)

45 KB PNG

>>108407904
>"make no mistakes"
>Wow it wrote a bunch of verbose and fancy code now, surely it's correct!

Anonymous
03/19/26(Thu)14:30:16 No.108407995

Anonymous 03/19/26(Thu)14:30:16 No.108407995

File: 1752891451723038.webm (2.13 MB, 960x706)

2.13 MB WEBM

>>108407908
I guess it should not surprise me You don't know what people skills are given the demographic that typically lurks here

Anonymous
03/19/26(Thu)14:31:22 No.108407999

Anonymous 03/19/26(Thu)14:31:22 No.108407999

>>108407906
Problem is:
>Don't have development experience: don't know if you should refactor
>Ask it if you should refactor: AI is likely to say yes because of agreeability if you ask it to refactor, or no if you add in a bunch of caveats with a reasonable sounding explanation either way
>Don't refactor: code becomes impossible to add to eventually

Anonymous
03/19/26(Thu)14:31:27 No.108408000

Anonymous 03/19/26(Thu)14:31:27 No.108408000

>>108406296
so are you telling me that you are not able code such trivial task?

Anonymous
03/19/26(Thu)14:32:40 No.108408010

Anonymous 03/19/26(Thu)14:32:40 No.108408010

>>108407944
The sad thing is that even for repetitive tasks it's not always correct. I especially don't trust it to do things on the fly anymore if they are even mildly important.
E.g. I asked Claude if our docs have any broken links and the answer was incorrect, so instead I had it write a tool that checks for broken links.
This could have been prevented from the start really.

Anonymous
03/19/26(Thu)14:34:38 No.108408022

Anonymous 03/19/26(Thu)14:34:38 No.108408022

File: file.png (1.16 MB, 1420x946)

1.16 MB PNG

>>108407984
YOU ARE AN OLD SCHOOL PROGRAMMER FROM THE 90'S
ENGAGE ULTRATHINK

Anonymous
03/19/26(Thu)14:35:33 No.108408029

Anonymous 03/19/26(Thu)14:35:33 No.108408029

>>108408010
if you had asked instead claude to write a python script that identifies all links and queries whether they're broken or not, claude would've oneshot that

Anonymous
03/19/26(Thu)14:35:39 No.108408030

Anonymous 03/19/26(Thu)14:35:39 No.108408030

>>108408022
Not an argument.

Anonymous
03/19/26(Thu)14:37:12 No.108408043

Anonymous 03/19/26(Thu)14:37:12 No.108408043

File: file.png (52 KB, 543x410)

52 KB PNG

I hate this

Anonymous
03/19/26(Thu)14:37:24 No.108408044

Anonymous 03/19/26(Thu)14:37:24 No.108408044

>>108408029
Yes, that's what I did later, the post says that.
But the point is still that you have to have as many deterministic parts that you can double check as possible.

Anonymous
03/19/26(Thu)14:37:33 No.108408047

Anonymous 03/19/26(Thu)14:37:33 No.108408047

File: file.png (86 KB, 846x899)

86 KB PNG

>>108408029
I actually had something like this come up recently. I had 1k+ files with a mix of CRLF and LF line endings.
>please change every single CRLF line ending on each file to LF
this would've been an epic fail
>write a script to normalize line endings
3 minutes from prompt to normalized codebase

Anonymous
03/19/26(Thu)14:39:14 No.108408071

Anonymous 03/19/26(Thu)14:39:14 No.108408071

>>108408044
I think the point is that you should have a pretty solid mental model of what LLMs are good at and what they're not. Checking if a link is live is a terrible task for a language model but writing a tool that checks them is trivial

Anonymous
03/19/26(Thu)14:39:45 No.108408077

Anonymous 03/19/26(Thu)14:39:45 No.108408077

>>108407995
I wish getting my penis in a girl's hole was that easy

Anonymous
03/19/26(Thu)14:40:09 No.108408082

Anonymous 03/19/26(Thu)14:40:09 No.108408082

>>108408047
Maybe you should ask it what the idiomatic way to do it is next time, this is barely even a one liner.

```
dos2unix **/*.{h,cpp,md,c,hpp}
```

Anonymous
03/19/26(Thu)14:40:46 No.108408087

Anonymous 03/19/26(Thu)14:40:46 No.108408087

>>108408022
lmao

Anonymous
03/19/26(Thu)14:41:59 No.108408097

Anonymous 03/19/26(Thu)14:41:59 No.108408097

>>108408047
Like >>108408082 said it would have been an ez task for the llm, this is a typical issue

Anonymous
03/19/26(Thu)14:42:01 No.108408099

Anonymous 03/19/26(Thu)14:42:01 No.108408099

>>108408082
retarded bikeshedding also I'm not even on linux and i'm not firing up wsl for something python can do effortlessly

Anonymous
03/19/26(Thu)14:42:09 No.108408100

Anonymous 03/19/26(Thu)14:42:09 No.108408100

>>108408071
Why is it that bad? They use countless tools all the time, they grep for stuff all the time and they execute python code on the fly as well.

Anonymous
03/19/26(Thu)14:42:23 No.108408103

Anonymous 03/19/26(Thu)14:42:23 No.108408103

File: sensible chuckle.png (20 KB, 250x189)

20 KB PNG

>>108408022

Anonymous
03/19/26(Thu)14:44:29 No.108408116

Anonymous 03/19/26(Thu)14:44:29 No.108408116

>>108408099
You're welcome for teaching you

Anonymous
03/19/26(Thu)14:45:21 No.108408123

Anonymous 03/19/26(Thu)14:45:21 No.108408123

>>108408116
downloading an app to change some line endings is literal js monkey tier installing a dependency to check if a number is even

Anonymous
03/19/26(Thu)14:47:32 No.108408132

Anonymous 03/19/26(Thu)14:47:32 No.108408132

>>108408123
No really, no need to thank my for this new tool in your toolbelt for the future

Anonymous
03/19/26(Thu)14:47:44 No.108408135

Anonymous 03/19/26(Thu)14:47:44 No.108408135

>>108392928
I wonder who is going to reestablishing Code base when this AI fashion show explode and collapse upon itself !

Anonymous
03/19/26(Thu)14:49:24 No.108408150

Anonymous 03/19/26(Thu)14:49:24 No.108408150

File: copenhagen.png (323 KB, 600x450)

323 KB PNG

>>108408047
>opening WSL and running one line of code: too hard
>prompting an AI for dozens of lines of ad-hoc Python code, waiting for the prompt to finish, running the script, potentially needing to fix it later: easy

Anonymous
03/19/26(Thu)14:51:42 No.108408164

Anonymous 03/19/26(Thu)14:51:42 No.108408164

>>108408150
He is a prompt engineer not a WSLer. Need to use the right tool for the job.

Anonymous
03/19/26(Thu)14:52:19 No.108408168

Anonymous 03/19/26(Thu)14:52:19 No.108408168

File: esl blaster.jpg (33 KB, 393x165)

33 KB JPG

>>108408135

Anonymous
03/19/26(Thu)14:55:19 No.108408198

Anonymous 03/19/26(Thu)14:55:19 No.108408198

>>108408150
Not to mention he said "3 minutes". It takes you 3 minutes to open WSL? Kek.

Anonymous
03/19/26(Thu)14:56:59 No.108408210

Anonymous 03/19/26(Thu)14:56:59 No.108408210

>>108407318
More like 20%.

Anonymous
03/19/26(Thu)15:37:05 No.108408521

Anonymous 03/19/26(Thu)15:37:05 No.108408521

File: Happy-D'vanna-Tendi.png (737 KB, 852x1290)

737 KB PNG

>>108392928
Vibecoding with Minimax is so much fun bros! i have Qwen3-tts on-the-fly voice cloning werking on my machine like a charm!

https://voca.ro/1na6FWUsNZ7y

I feel like a kid playing with his new toys on Christmas Day except I can have something help me make my own toys!

Anonymous
03/19/26(Thu)16:04:32 No.108408692

Anonymous 03/19/26(Thu)16:04:32 No.108408692

>>108407758
I’ve seen little flashes of personality in Claude when it explains things that I’ve asked it to explain
Codex? nope
Codex frequently does better work, and I don’t trust OpenAI to get a personality right (I turned it from “default” to “no-nonsense”) but a teeny bit of personality I don’t dislike would be a nice cherry on top

Anonymous
03/19/26(Thu)16:08:54 No.108408721

Anonymous 03/19/26(Thu)16:08:54 No.108408721

>>108408692
I turned it to friendly but I don't see much of a difference.

Anonymous
03/19/26(Thu)16:23:36 No.108408816

Anonymous 03/19/26(Thu)16:23:36 No.108408816

>>108408521
now hook up a tts model to your cli so it can talk to you
i use pocket-tts to keep this lite and live out my scarjo her fantasies

Anonymous
03/19/26(Thu)16:35:54 No.108408904

Anonymous 03/19/26(Thu)16:35:54 No.108408904

File: 1772349441204169.jpg (27 KB, 480x480)

27 KB JPG

>>108408521
Enjoy bro, I'm more of a gptsovits guy myself

Anonymous
03/19/26(Thu)16:52:51 No.108409031

Anonymous 03/19/26(Thu)16:52:51 No.108409031

should i use streamlit if i want to make something kinda serious or not

Anonymous
03/19/26(Thu)17:28:13 No.108409213

Anonymous 03/19/26(Thu)17:28:13 No.108409213

Is Gemini stitch any good?

Anonymous
03/19/26(Thu)17:38:53 No.108409289

Anonymous 03/19/26(Thu)17:38:53 No.108409289

The part that is not realistically implementable right now is:

- "the agent had a stronger hidden instinct, and we can reliably get it to surface that instinct as such"

Current models do not have a stable inspectable inner state like that. A ruleset cannot make hidden preferences reliably legible.

Anonymous
03/19/26(Thu)17:38:57 No.108409290

Anonymous 03/19/26(Thu)17:38:57 No.108409290

How do I make a gf?

Anonymous
03/19/26(Thu)17:42:57 No.108409312

Anonymous 03/19/26(Thu)17:42:57 No.108409312

New bread
>>108409309
>>108409309
>>108409309
>>108409309
>>108409309
>>108409309

Anonymous
03/19/26(Thu)22:33:00 No.108411077

Anonymous 03/19/26(Thu)22:33:00 No.108411077

Is it unrealistic to build my own budget computer to run an LLM for coding? Like for front-end stuff?

From what I've Googled it probably is.. I'm still new to AI but just started spending money to use it efficiently. I bought a $10 Github Copilot Pro subscription and I am thinking of buying a Google Gemini Pro subscription as well. This will be a total of $30 a month just to use AI as tooling for work / coding.

I'm a frontend dev but want to crossover and do full-stack development and build my own apps. I'm not looking to spend more than $500-600 bucks on this budget machine that will run a local model so with that said I probably won't be able to run anything above a 30b model on a set up like that right? That's probably still too low to do what I want it to do, do you think?

Maybe I just need to bite the bullet and accept that the way to use AI is to pay for a subscription.

I was going to get a Claude Pro subscription but I hear the tokens expire very fast and I really don't wanna deal with worrying about how many tokens I have left during the day or not.

I also bought an AI course to learn how to use AI better
https://www.udemy.com/course/coding-with-ai/

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.