/g/ - /vcg/ — Vibe-coding general - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/vcg/ — Vibe-coding general 07/03/26(Fri)16:47:54 No.109193741

File: 1778995530008258.png (2.88 MB, 1485x1126)

/vcg/ — Vibe-coding general Anonymous 07/03/26(Fri)16:47:54 No.109193741

A general for vibe coding, coding agents, AI IDEs, browser builders, and shipping prototypes with LLMs.

## News
(7/01) Fable 5 + Mythos 5 restored globally after US lifted export controls (6/30).
(6/30) Claude Sonnet 5: near-Opus 4.8 quality at $2/$10 intro, 1M ctx, new default for Free/Pro.
(6/30) Meituan LongCat-2.0: 1.6T open coding model (MIT).
(6/26) GPT-5.6 preview: Sol/Terra/Luna, Codex+API to trusted partners; Sol Ultra 91.9% Terminal-Bench 2.1. GA in weeks.
(6/13) GLM-5.2: Z.ai open-weights a 1M-context coding model (MIT).

----

## What “vibe coding” is, and how to do it
https://simonwillison.net/2025/Mar/19/vibe-coding/
https://simonwillison.net/2025/Mar/11/using-llms-for-code/

----

## Frontier models using fully-general tooling — start here if you have $20 or so
https://developers.openai.com/codex/cli
https://claude.com/product/claude-code

## Not worth it for code, but maybe good for other things
https://geminicli.com/docs/
https://x.ai/cli
https://chat.z.ai/

## Open / local / self-hosted
>>>/g/lmg

----

## Prompting / context / skills
https://arps18.github.io/posts/claude-code-mastery/
https://simonwillison.net/guides/agentic-engineering-patterns/using-git-with-coding-agents/
https://github.com/mattpocock/skills — /grilling is a favorite

## Other editors / terminal agents / coding agents
https://aider.chat/
https://pi.dev/
https://opencode.ai/
https://cursor.com/docs
https://docs.windsurf.com/
https://docs.cline.bot/
https://docs.github.com/en/copilot/how-tos/use-copilot-agents/coding-agent

## UI/Frontend
https://www.figma.com/make/
https://www.anthropic.com/news/claude-design-anthropic-labs
https://uiverse.io/
https://stitch.withgoogle.com/

## In-browser builders / hosted vibe tools
https://bolt.new/
https://replit.com/
https://v0.app/docs

## Benchmarks / rankings
https://www.tbench.ai/leaderboard/terminal-bench/2.0

## What we’ve done
https://vcg.gitgud.site

## Previous thread
>>109186872

Anonymous
07/03/26(Fri)16:53:39 No.109193771

Anonymous 07/03/26(Fri)16:53:39 No.109193771

Is GTP 5.7 better than Fable?

Anonymous
07/03/26(Fri)16:57:50 No.109193795

Anonymous 07/03/26(Fri)16:57:50 No.109193795

File: file.png (246 KB, 600x400)

246 KB PNG

i'm gay

Anonymous
07/03/26(Fri)17:00:04 No.109193807

Anonymous 07/03/26(Fri)17:00:04 No.109193807

>>109193771
yes (I'm Sam)

Anonymous
07/03/26(Fri)17:10:56 No.109193866

Anonymous 07/03/26(Fri)17:10:56 No.109193866

trying to figure out how to use my one week lifetime window for fable is a bitch.

Anonymous
07/03/26(Fri)17:16:23 No.109193894

Anonymous 07/03/26(Fri)17:16:23 No.109193894

>>109193866
I was saving it but I realized my Fable 1 week limit resets tomorrow so now I am just spamming everything I can think of to it

Anonymous
07/03/26(Fri)17:18:17 No.109193898

Anonymous 07/03/26(Fri)17:18:17 No.109193898

Have any of you tried Headroom for savings?
https://headroomlabs-ai.github.io/headroom/

Anonymous
07/03/26(Fri)17:18:42 No.109193899

Anonymous 07/03/26(Fri)17:18:42 No.109193899

>>109193795
are you also Chinese?

Anonymous
07/03/26(Fri)17:19:23 No.109193902

Anonymous 07/03/26(Fri)17:19:23 No.109193902

>>109193866
stop acting as if this is a once in a life time experience. its going to come back to subscriptions soon, especially after 5.6 comes out and they have competition. models get better every day.
just use it. if you have it do something that you feel turned out bad or you feel you wasted it with a bad prompt, then you learned something and will be able to take better advantage of future models

Anonymous
07/03/26(Fri)17:19:49 No.109193904

Anonymous 07/03/26(Fri)17:19:49 No.109193904

>>109193894
i did that last night, had it do a bunch of shit as long as it was prior to 8am local

Anonymous
07/03/26(Fri)17:20:50 No.109193910

Anonymous 07/03/26(Fri)17:20:50 No.109193910

>>109193902
>stop acting as if this is a once in a life time experience. its going to come back to subscriptions soon,
lmao no and even if it does it will be nerfed
plus the government might just rangeban all good models at some point now that we have precedent for it

Anonymous
07/03/26(Fri)17:20:58 No.109193911

Anonymous 07/03/26(Fri)17:20:58 No.109193911

>>109193898
all of this shit (headroom, cavemen) etc. is just useless voodoo magic cope for poorfags.
just use the models as they were intended.
Pretty sure if Anthropic could save compute like that, their $500k engineers would've already figured that one out

Anonymous
07/03/26(Fri)17:22:05 No.109193920

Anonymous 07/03/26(Fri)17:22:05 No.109193920

>>109193911
There's no question that using less tokens cost less, it's a matter of whether it affects results though.

Anonymous
07/03/26(Fri)17:23:26 No.109193929

Anonymous 07/03/26(Fri)17:23:26 No.109193929

>>109193910
so you honestly believe that anthropic, openai, google, and china will just stop being able to push out innovations? thats the most retarded thing ive heard

Anonymous
07/03/26(Fri)17:26:09 No.109193947

Anonymous 07/03/26(Fri)17:26:09 No.109193947

>>109193920
nta anon but we've had this conversation multiple times in these threads and the vast majority of output tokens are reasoning tokens and they're already ultra-caveman mode - you likely cannot affect these in any way - you're just making the small fraction of output tokens that you read, harder to parse
the other ones that intercept terminal outputs and reformat that are worse

Anonymous
07/03/26(Fri)17:27:28 No.109193960

Anonymous 07/03/26(Fri)17:27:28 No.109193960

>>109193929
they are in the uber "burn money for marketshare" phase and the IPO is around the corner, after that happens prices will spike hard
it might get better in a few years, maybe
but you also have forgotten the whole government thing
they may just make the entire thing corpos only

Anonymous
07/03/26(Fri)17:35:24 No.109194002

Anonymous 07/03/26(Fri)17:35:24 No.109194002

>>109193947
We did have this conversation before, there's just no way to know for sure what the big labs are doing though.

Anonymous
07/03/26(Fri)17:35:33 No.109194005

Anonymous 07/03/26(Fri)17:35:33 No.109194005

>>109193741
No lie you can actually get about 15 liters of usable biodiesel from an average human body. Transesterification. Fat + methanol = energy,
Yes I have been thinking about this a lot lately.

Anonymous
07/03/26(Fri)17:35:58 No.109194010

Anonymous 07/03/26(Fri)17:35:58 No.109194010

>>109193741
Me on the left.

Anonymous
07/03/26(Fri)17:39:05 No.109194024

Anonymous 07/03/26(Fri)17:39:05 No.109194024

>>109194002
there absolutely is, the reasoning traces leak from time to time
we know gpt is an autistic caveman and so is fable/mythos

Anonymous
07/03/26(Fri)17:43:36 No.109194046

Anonymous 07/03/26(Fri)17:43:36 No.109194046

>>109193741
So what are you retards vibecoding?
Anyone made decent software?

Anonymous
07/03/26(Fri)17:46:38 No.109194064

Anonymous 07/03/26(Fri)17:46:38 No.109194064

>>109194046
How many times are you gonna ask this?

Anonymous
07/03/26(Fri)17:50:42 No.109194074

Anonymous 07/03/26(Fri)17:50:42 No.109194074

>>109193741
>>109194005
Very cool billy gates
Make it happen

Anonymous
07/03/26(Fri)17:51:52 No.109194078

Anonymous 07/03/26(Fri)17:51:52 No.109194078

>>109193960
>they may just make the entire thing corpos only
not how it works, you cant hit a killswitch like that. its like how microsoft's most profitable sector is 365 and azure, but they still keep all kinds of consumer facing services open.
if we were in a vacuum and chinese models didnt exist, maybe
llms arent inherently unprofitable, they just are when youre doing anything you can to stay ahead so you brute force with compute. none of the chinese models are doing this, hence why theyre better long term bets. deepseek in particular is pretty far behind but it's very cheap and has a completely unique architecture that lets it have huge context
if anything they would shut it off for free users but continue to subsidize subs. they just cant afford to completely go ghost, china will immediately win

Anonymous
07/03/26(Fri)17:53:15 No.109194084

Anonymous 07/03/26(Fri)17:53:15 No.109194084

>>109194078
>muh chinesemodels
benchmaxxed slop

Anonymous
07/03/26(Fri)17:53:24 No.109194085

Anonymous 07/03/26(Fri)17:53:24 No.109194085

>>109194076

Anonymous
07/03/26(Fri)17:54:47 No.109194094

Anonymous 07/03/26(Fri)17:54:47 No.109194094

>>109194085
What if it decides to do that on its own?

Anonymous
07/03/26(Fri)17:55:17 No.109194096

Anonymous 07/03/26(Fri)17:55:17 No.109194096

has anyone successfully vibecoded circuits? e.g. spice netlists for simulation of analog circuits, kicad schematics, etc

Anonymous
07/03/26(Fri)17:55:38 No.109194098

Anonymous 07/03/26(Fri)17:55:38 No.109194098

File: l-intro-1626135637.jpg (421 KB, 1600x897)

421 KB JPG

>>109194064
No, I am just curious if LLMs are so great show me software you made?

Anonymous
07/03/26(Fri)17:55:52 No.109194101

Anonymous 07/03/26(Fri)17:55:52 No.109194101

>>109194094
Make it a hard rule that Fable is only allowed to spawn Opus agents. ez pz

Anonymous
07/03/26(Fri)17:56:53 No.109194105

Anonymous 07/03/26(Fri)17:56:53 No.109194105

>>109194098
They're great, I'm just a retard with bad ideas. I'm having a le blast though.

Anonymous
07/03/26(Fri)17:57:25 No.109194108

Anonymous 07/03/26(Fri)17:57:25 No.109194108

File: Untitled1.png (86 KB, 1045x900)

86 KB PNG

>>109194084
so why do people pick glm so often in blind tests then
let me guess, arena ai is bribed by china, right? or do real word blind web dev tests not count

Anonymous
07/03/26(Fri)17:59:28 No.109194119

Anonymous 07/03/26(Fri)17:59:28 No.109194119

>>109194108
nta but bro look at the other models in the ranking sonnet 5 and qwen3.7 wtf is this supposed to even represent?

Anonymous
07/03/26(Fri)18:02:27 No.109194130

Anonymous 07/03/26(Fri)18:02:27 No.109194130

>>109194098
How many fucking times are you going to do this. Fuck off and get a life.

Anonymous
07/03/26(Fri)18:02:30 No.109194133

Anonymous 07/03/26(Fri)18:02:30 No.109194133

>>109194119
i dont know what your point is because youre a bit incoherent but these are blind tests and this is the site every llm provider looks at as a blind test source of truth

Anonymous
07/03/26(Fri)18:06:43 No.109194158

Anonymous 07/03/26(Fri)18:06:43 No.109194158

>>109193911
I just like caveman because even when told to be extremely concise LLM responses are still too wordy. The cost savings, if any, are a bonus.

Anonymous
07/03/26(Fri)18:06:49 No.109194159

Anonymous 07/03/26(Fri)18:06:49 No.109194159

>>109194108
>web frontend

Anonymous
07/03/26(Fri)18:08:54 No.109194174

Anonymous 07/03/26(Fri)18:08:54 No.109194174

>>109194133
my point is some of those models are straight ass if thats top 10 they failed to test properly

Anonymous
07/03/26(Fri)18:10:20 No.109194179

Anonymous 07/03/26(Fri)18:10:20 No.109194179

File: kryptonite.gif (384 KB, 400x221)

384 KB GIF

>>109194174
>WebDev

Anonymous
07/03/26(Fri)18:12:18 No.109194186

Anonymous 07/03/26(Fri)18:12:18 No.109194186

>>109194179
sorry i was vibe reading

Anonymous
07/03/26(Fri)18:12:41 No.109194188

Anonymous 07/03/26(Fri)18:12:41 No.109194188

>529 overloaded
it's over

Anonymous
07/03/26(Fri)18:12:54 No.109194189

Anonymous 07/03/26(Fri)18:12:54 No.109194189

File: Untitled2.png (95 KB, 1048x909)

95 KB PNG

>>109194159
literally all that matters. its what people see.
back end is easy for any of these models at this point, save for really complex, large codebases (which isnt something you should have as a vibecoder)
but heres the agent test
>>109194174
theyre not ass. qwen 3.7 max is genuinely a beast. most ai projects use qwen at some point in the chain because of how efficient it is. sonnet 5 isnt ass, it just doesnt have much improvement over 4.6 but 4.6 isnt ass
and like i said its literally a blind test, theres no possibility for errors in their testing. people give a prompt, multiple models take it, the users view the results without knowing which model made what, and pick the best one
its a 100m dollar company and every llm provider looks at it

Anonymous
07/03/26(Fri)18:15:16 No.109194204

Anonymous 07/03/26(Fri)18:15:16 No.109194204

>>109194189
do they have a methodology for their testing documented somewhere because i have plenty of nits to pick but no reason to waste your time with them if they have it all written out

Anonymous
07/03/26(Fri)18:16:23 No.109194209

Anonymous 07/03/26(Fri)18:16:23 No.109194209

>>109194189
Agent Arena is the only leaderboard I trust because it's the one that reflects my opinions on models. Whenever I try a new model I'm always happy to see it slot into that list exactly where I thought it would.

Anonymous
07/03/26(Fri)18:17:55 No.109194217

Anonymous 07/03/26(Fri)18:17:55 No.109194217

File: read nigga, read.png (17 KB, 376x232)

17 KB PNG

>>109194046

Anonymous
07/03/26(Fri)18:18:03 No.109194218

Anonymous 07/03/26(Fri)18:18:03 No.109194218

>>109194209
https://arena.ai/ Don't be lazy, anon.

Anonymous
07/03/26(Fri)18:18:14 No.109194220

Anonymous 07/03/26(Fri)18:18:14 No.109194220

>>109194209
Agent Arena...like a gladiator ring for agents...intriguing...

Anonymous
07/03/26(Fri)18:19:02 No.109194227

Anonymous 07/03/26(Fri)18:19:02 No.109194227

File: 1000128055.png (357 KB, 672x672)

357 KB PNG

>>109194130
Oh no, the aitard is angry.

Anonymous
07/03/26(Fri)18:19:06 No.109194229

Anonymous 07/03/26(Fri)18:19:06 No.109194229

>>109194204
what do you mean? the methodology is what i just said. the scoring system is just elo, which is a well known scoring system used by a ton of shit
again, someone gives a prompt, a few models work concurrently, they all get shown and compared to one another. so for example, AvB, CvD, AvC, AvD, BvC, etc so a proper hierarchy can be formed. the model names arent shown until all votes have been made

Anonymous
07/03/26(Fri)18:19:48 No.109194234

Anonymous 07/03/26(Fri)18:19:48 No.109194234

>>109194227
You are lashing out at ghosts it won't make you feel better

Anonymous
07/03/26(Fri)18:26:44 No.109194268

Anonymous 07/03/26(Fri)18:26:44 No.109194268

>>109194209
>>109194220
Which models would you like to see in a hypothetical "agent arena"? Claude, GPT, Gemini, DeepSeek, Grok, GLM, Kimi...who else?

Anonymous
07/03/26(Fri)18:27:12 No.109194270

Anonymous 07/03/26(Fri)18:27:12 No.109194270

File: file.png (2.47 MB, 1448x1086)

2.47 MB PNG

>>109194227

amd-inference@pm.me
07/03/26(Fri)18:30:59 No.109194286

amd-inference@pm.me 07/03/26(Fri)18:30:59 No.109194286

Anybody else doing inference engineering work on R9700/gfx1201? I know there's quite a few kernel anons here at this point and I'm thinking there's probably a decent amount of overlap between what we're all working on. I'm trying to avoid reinventing the wheel and do my part to help get AMD stack out of anecdote hell.

Anonymous
07/03/26(Fri)18:32:07 No.109194293

Anonymous 07/03/26(Fri)18:32:07 No.109194293

>>109194189
Using AI in complex codebases feels like a completely legitimate usecase. If my codebase were small, I would just write it myself.

Anonymous
07/03/26(Fri)18:32:33 No.109194296

Anonymous 07/03/26(Fri)18:32:33 No.109194296

>>109194024
Ok, I looked it up, you might be right. Interesting.
https://www.reddit.com/r/ClaudeAI/comments/1ul1396/fable_5_leaked_chainofthought_in_web_interface/

Anonymous
07/03/26(Fri)18:32:42 No.109194297

Anonymous 07/03/26(Fri)18:32:42 No.109194297

>>109194229
The sample breadth in my opinion is too wide compared to the actual data points. It's fun trivia but looking at the actual head to heads (e.g. opus 4.8 with thinking typically loses to GLM whereas opus 4.8 without thinking typically wins) it just comes off as nonsensical in terms of the actual value its producing. You can argue for collective data (as in samples among the variety of models averaged) as showing some type of value but realistically if I'm reading the data points right it comes out as a tie the vast majority of the time and whatever weighting they use doesn't do a great job of showing that

Anonymous
07/03/26(Fri)18:34:43 No.109194310

Anonymous 07/03/26(Fri)18:34:43 No.109194310

my project is now on the order of 50k LOC of python and i only understand about a third of it, but it works really well and fast
i plan to just finish polishing the code and then go back and try to figure out how it works

Anonymous
07/03/26(Fri)18:35:24 No.109194314

Anonymous 07/03/26(Fri)18:35:24 No.109194314

>>109194310
>50k LOC python
>really fast

Anonymous
07/03/26(Fri)18:37:30 No.109194327

Anonymous 07/03/26(Fri)18:37:30 No.109194327

>>109194293
yeah but then youre not a vibecoder
>>109194297
thats a well known thing. thinking does not automatically mean better. nor does higher effort levels. in fact, using higher effort levels can make it noticeably worse at creative tasks. but on code, like the agent and webdev rankings i sent, thinking is higher in both. what ranking are you looking at?

Anonymous
07/03/26(Fri)18:38:26 No.109194332

Anonymous 07/03/26(Fri)18:38:26 No.109194332

>>109194024
If they stopped monitoring the chain of thought at all and are only paying attention to results and number of tokens to get there, that type of thing does seem possible.

That's still a non zero chance that this is Anthropic trying to poison the well though.

Anonymous
07/03/26(Fri)18:39:56 No.109194343

Anonymous 07/03/26(Fri)18:39:56 No.109194343

>>109194314
the actual work is done outside of python
rust only gained about 5-10% on top of it

Anonymous
07/03/26(Fri)18:47:32 No.109194379

Anonymous 07/03/26(Fri)18:47:32 No.109194379

>>109194234
I already feel better.

Anonymous
07/03/26(Fri)18:51:09 No.109194392

Anonymous 07/03/26(Fri)18:51:09 No.109194392

>>109194332
Oh they're monitoring it.
But they probably actively encouraged the model to learn to speak like that for brevity or at least they let it happen and hid the facts.

Anonymous
07/03/26(Fri)19:07:29 No.109194469

Anonymous 07/03/26(Fri)19:07:29 No.109194469

File: 1782974622237.png (155 KB, 1890x901)

155 KB PNG

>>109194379
One shotted this yesterday.

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.

Janitor acceptance emails will be sent out over the coming weeks. Make sure to check your spam folder!