/g/ - /vcg/ — Vibe-coding general - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/vcg/ — Vibe-coding general 05/29/26(Fri)00:03:10 No.108930881

File: RMS — Absolutely Proprietary.jpg (29 KB, 480x451)

29 KB JPG

/vcg/ — Vibe-coding general Anonymous 05/29/26(Fri)00:03:10 No.108930881 Archived

A general for vibe coding, coding agents, AI IDEs, browser builders, MCP, and shipping prototypes with LLMs.

►What is vibe coding?
https://x.com/karpathy/status/1886192184808149383
https://simonwillison.net/2025/Mar/19/vibe-coding/
https://simonwillison.net/2025/Mar/11/using-llms-for-code/

►Prompting / context / skills
https://docs.cline.bot/customization/cline-rules
https://docs.replit.com/tutorials/agent-skills
https://docs.github.com/en/copilot/tutorials/spark/prompt-tips

►Editors / terminal agents / coding agents
https://opencode.ai/
https://cursor.com/docs
https://docs.windsurf.com/getstarted/overview
https://code.claude.com/docs/en/overview
https://aider.chat/docs/
https://docs.cline.bot/home
https://docs.roocode.com/
https://geminicli.com/docs/
https://docs.github.com/en/copilot/how-tos/use-copilot-agents/coding-agent

►Browser builders / hosted vibe tools
https://bolt.new/
https://support.bolt.new/
https://docs.lovable.dev/introduction/welcome
https://replit.com/
https://firebase.google.com/docs/studio
https://docs.github.com/en/copilot/tutorials/spark
https://v0.app/docs/faqs

►Open / local / self-hosted
https://github.com/OpenHands/OpenHands
https://github.com/QwenLM/qwen-code
https://github.com/QwenLM/Qwen3-Coder
https://huggingface.co/bartowski/Qwen_Qwen3.6-35B-A3B-GGUF

►MCP / infra / deployment
https://modelcontextprotocol.io/docs/getting-started/intro
https://modelcontextprotocol.io/examples
https://vercel.com/docs
https://mcp.desktopcommander.app/

►Benchmarks / rankings
https://aider.chat/docs/leaderboards/
https://www.swebench.com/
https://swe-bench-live.github.io/
https://livecodebench.github.io/
https://livecodebench.github.io/gso.html
https://www.tbench.ai/leaderboard/terminal-bench/2.0

►UI/Frontend
Figma Make
Lovable
Claude design
https://uiverse.io/
https://ui-ux-pro-max-skill.nextlevelbuilder.io/
https://stitch.withgoogle.com/

►Previous thread
>>108917213

Anonymous
05/29/26(Fri)00:12:24 No.108930919

Anonymous 05/29/26(Fri)00:12:24 No.108930919

4.8 seems good but 200k context is absolutely unusable after you get used to more so I'll stay with 4.7 (even if 1kk is too much, at least I have the option to clear context when I want)

Anonymous
05/29/26(Fri)00:14:59 No.108930931

Anonymous 05/29/26(Fri)00:14:59 No.108930931

File: Screenshot from 2026-05-2(...).png (175 KB, 1909x971)

175 KB PNG

Some people treat themselves to whiskey or a bottle of whine.
I treat myself to wasting compute time on stupid experiments.

Anonymous
05/29/26(Fri)00:24:00 No.108930975

Anonymous 05/29/26(Fri)00:24:00 No.108930975

>>108930931
You’re not doing both at the same time?

Anonymous
05/29/26(Fri)00:38:42 No.108931073

Anonymous 05/29/26(Fri)00:38:42 No.108931073

File: file.png (384 KB, 1777x925)

384 KB PNG

>>108930931
I only work on serious and important projects that have the potential to benefit mankind in tangible ways.

Anonymous
05/29/26(Fri)00:39:01 No.108931076

Anonymous 05/29/26(Fri)00:39:01 No.108931076

>>108930919
>200k context
wat? 1M on my machine.

Anonymous
05/29/26(Fri)00:40:12 No.108931086

Anonymous 05/29/26(Fri)00:40:12 No.108931086

>>108930881
i'm already seeing quiet laughter for "AI projects" in my workplace. the fanatics can't explain anything they're doing. no one trusts a project built by a retard. the stories of failure and negative experiences are overwhelming. the only answer is this is "growing pains", but i also call it "growing pains", in that people are growing in their hatred for AI projects.

Anonymous
05/29/26(Fri)00:45:24 No.108931118

Anonymous 05/29/26(Fri)00:45:24 No.108931118

>>108931086
Organizations are building projects that assumes human workflows instead of agentic ones. Solves nothing and is a waste of time. New entrants are developing projects that assume agentic workflows.

Most large organizations will probably fail to adapt.

Anonymous
05/29/26(Fri)00:52:19 No.108931166

Anonymous 05/29/26(Fri)00:52:19 No.108931166

>>108931118
"large organizations" offer actual things that people use. not toy projects. and they buy their competitors.

Anonymous
05/29/26(Fri)00:56:06 No.108931189

Anonymous 05/29/26(Fri)00:56:06 No.108931189

>>108931166
I'm not talking about android apps, anon. but you're right, they often do buy their competitors.

Anonymous
05/29/26(Fri)00:58:47 No.108931200

Anonymous 05/29/26(Fri)00:58:47 No.108931200

>>108931189
why so cryptic? are you afraid of me?

Anonymous
05/29/26(Fri)01:08:40 No.108931257

Anonymous 05/29/26(Fri)01:08:40 No.108931257

>>108930881
After all these years I still hate his fuckin caveman face

Anonymous
05/29/26(Fri)01:09:40 No.108931266

Anonymous 05/29/26(Fri)01:09:40 No.108931266

>>108931076
Anthropic exposes the blessed 1M context only through certain channels. Chang there is probably using some weird gookinese website middleman.

Anonymous
05/29/26(Fri)01:10:58 No.108931280

Anonymous 05/29/26(Fri)01:10:58 No.108931280

>>108930881
> Opus 4.8 drops
> was at 97% usage
> now at 0%
> limits reset tonight
Going to town. Got a whole week of usage to use in a day. Scaffolding up 8 projects at once let's fucking go.

Anonymous
05/29/26(Fri)01:11:14 No.108931282

Anonymous 05/29/26(Fri)01:11:14 No.108931282

>>108931257
he has a totally different face now that he has N95-compliant facial hair
I don’t even recognize the guy anymore

Anonymous
05/29/26(Fri)01:11:18 No.108931283

Anonymous 05/29/26(Fri)01:11:18 No.108931283

>>108931086
they are going to hate AI all the way to getting fired into a dead labor market.

>>108931118
exactly. large organizations have dinosaur middle managers running everything. but they will adapt eventually, they are too big to fail and the elites actually running them know they have to adapt to keep their power. that's why ai is pushed so hard everywhere.

>>108931166
yeah bro everybody knows all large organizations refused to adopt ai and explicitly forbid employees from using it. oh wait, they tried that and then they did a 360 when they saw ai actually was a real thing and wasn't a meme and got scared.

Anonymous
05/29/26(Fri)01:12:40 No.108931295

Anonymous 05/29/26(Fri)01:12:40 No.108931295

>>108931266
>Chang there is probably using some weird gookinese website middleman.
he's using deepjeet 4 labeled as opus 4.8

Anonymous
05/29/26(Fri)01:14:20 No.108931308

Anonymous 05/29/26(Fri)01:14:20 No.108931308

>>108931280
Nobody tell him about the 5 hour limit.

Anonymous
05/29/26(Fri)01:17:26 No.108931321

Anonymous 05/29/26(Fri)01:17:26 No.108931321

>>108931283
"dead labor market" if no one has money then what happens to demand?

Anonymous
05/29/26(Fri)01:33:07 No.108931416

Anonymous 05/29/26(Fri)01:33:07 No.108931416

>>108931321
it shifts to people who were already rich before ai (capital) began to do all the work. rich people consume the output of the automated factories and farms, people who don't own enough capital are probably given a pittance to stop them from getting uppity and encouraged to commit suicide by fentanyl or fast food or euthanasia.

Anonymous
05/29/26(Fri)01:34:56 No.108931422

Anonymous 05/29/26(Fri)01:34:56 No.108931422

>>108931086
At work, they just showed off a custom Claude tool, takes like 10 mins to aggregate stats through a artifact. So much better than just using SQL to gather the data manually, with it being 100x longer, 10000x more expensive, and at risk of hallucination. I wouldn't even hate AI if this AI hype wasn't so retarded.

Anonymous
05/29/26(Fri)01:37:22 No.108931433

Anonymous 05/29/26(Fri)01:37:22 No.108931433

>>108931416
a rich person does not buy millions of microwaves, millions of cars, millions of plane tickets, millions of trinkets, millions of cleaning wipes. do you pretend they do?

Anonymous
05/29/26(Fri)01:38:53 No.108931441

Anonymous 05/29/26(Fri)01:38:53 No.108931441

>>108931433
then millions of microwaves will cease to be produced. do you think lots of microwaves being produced is simply a fact of life?

Anonymous
05/29/26(Fri)01:41:34 No.108931455

Anonymous 05/29/26(Fri)01:41:34 No.108931455

>>108931422
that's their own fault. if they give tools to claude then claude will make the sql queries itself. the difference is people who were too dumb or uneducated to get the data before can get the data without requiring someone who knows how to do it.

Anonymous
05/29/26(Fri)01:42:38 No.108931459

Anonymous 05/29/26(Fri)01:42:38 No.108931459

>>108931433
When I make a trillion dollars off my vibeslopped calendar app, I will just to spite you. It even has a widget that tells you the weather! Truly breakthrough shit.

Anonymous
05/29/26(Fri)01:44:12 No.108931470

Anonymous 05/29/26(Fri)01:44:12 No.108931470

so is there really any reason to use Opus 4.8 Max, let alone the Ultracode feature? I’ve been running 4.7 on xHigh, and now I’m using 4.8 on xHigh too.

Anonymous
05/29/26(Fri)01:44:44 No.108931473

Anonymous 05/29/26(Fri)01:44:44 No.108931473

>>108931455
This is what boomers don't understand. Most companies are using shit pilot and trying to do this stuff we le prompt engineering.

Anonymous
05/29/26(Fri)01:44:57 No.108931474

Anonymous 05/29/26(Fri)01:44:57 No.108931474

>>108931422
was there room for audience questions? was there a live demo?

these two "sticking points" have destroyed every vibecoded project proposed so far in my workplace. the live demos fail spectacularly. and vibecoders don't know what to do for a different perspective without consulting AI. i'm employed as a developer and multiple projects i've built are actively relied upon and shared by others, and i can answer questions and demo easily without problems. my responsiveness is why people use my work, they trust the developer to have done the right thing, which people can't trust vibecoders for.

tldr upon scrutiny a Claude project's vibecoder needs to answer tough questions on the spot lest that project go to the trash bin.

Anonymous
05/29/26(Fri)01:45:45 No.108931479

Anonymous 05/29/26(Fri)01:45:45 No.108931479

>>108931470
to make shitlibs seethe

Anonymous
05/29/26(Fri)01:46:24 No.108931483

Anonymous 05/29/26(Fri)01:46:24 No.108931483

>>108931441
"manufacturers will simply cease to exist" kek surely that's economically sustainable. real genuises here on /g/ i see...

Anonymous
05/29/26(Fri)01:47:28 No.108931489

Anonymous 05/29/26(Fri)01:47:28 No.108931489

>>108931483
oh yes? what do microwaves manufacture?

Anonymous
05/29/26(Fri)01:48:43 No.108931493

Anonymous 05/29/26(Fri)01:48:43 No.108931493

2126 Year, typical dialog of Human(H) with AI(A)

A:type "python mantinace_food_robot"
H:done
A:read last line on screen
H:python mantonace_foood_rovot
A:fix it to "python mantinace_food_robot"
H:done
A:read last line on screen
H:puthon muntinace_fod_robot
A:read last line on screen again
H:python mantinace_fod_robot
A:it "fod" or "food"?
H:food
A:read last line on screen again carefully
H:python mantinace_food_robot
A:read last line on screen again carefully
H:python mantinace_food_robot
A:press {enter}
H:done
A:read last line on screen
H:code:3456
A:read last line on screen again carefully
H:code:3456
A:now do to human care center and take your pills
A:[fixes in database medicine set for the human]

Anonymous
05/29/26(Fri)01:50:11 No.108931501

Anonymous 05/29/26(Fri)01:50:11 No.108931501

>>108931473
woahh bro wtf you can't just trigger my ptsd like that

Anonymous
05/29/26(Fri)01:57:21 No.108931540

Anonymous 05/29/26(Fri)01:57:21 No.108931540

>>108931470
https://cdn.sanity.io/files/4zrzovbb/website/c886650a2e96fc0925c805a1a7ca77314ccbf4a6.pdf#page=195
use it when you want to use way more tokens for hard problems that need more thinkin’ time
new high (new default) is about the same token usage as the old xhigh (the old default)

Anonymous
05/29/26(Fri)01:57:44 No.108931544

Anonymous 05/29/26(Fri)01:57:44 No.108931544

>>108931489
if you want to play with the toys i've set in front of you go ahead. i'll just have the adult discussion about broader concepts with someone else.

Anonymous
05/29/26(Fri)02:02:36 No.108931572

Anonymous 05/29/26(Fri)02:02:36 No.108931572

is it just me or is claude laggy as fuck

Anonymous
05/29/26(Fri)02:03:53 No.108931582

Anonymous 05/29/26(Fri)02:03:53 No.108931582

>>108931572
seems pretty zippy today for me. is the “Boogieing…” thing getting red for you? Are you still using the old default of xhigh when the new default is merely ‘high’ and uses about as many tokens as the old default on the old model?

Anonymous
05/29/26(Fri)02:05:29 No.108931594

Anonymous 05/29/26(Fri)02:05:29 No.108931594

>>108931544
Alright, I'll stop being facetious.
Factories will stop producing consumer goods for poor people en masse and will switch to producing only luxury goods except like I mentioned to prevent the poor population from revolting until they lock down security enough with robots that they can just exterminate us.
They don't need consumers just for the sake of having consumers, they need consumers to get money to pay for workers. Once the workers are no longer needed, they don't need consumers either. They will be the only consumers.
Mass produced consumer goods are only useful to then insofar as it allows them to control workers to build them luxury goods and provide luxury services.
Once goods and services are provided by robots then they don't need to manufacture consumer goods en masse anymore.
Capisce? I don't think I can make it any clearer.

Anonymous
05/29/26(Fri)02:05:32 No.108931595

Anonymous 05/29/26(Fri)02:05:32 No.108931595

>>108931582
No it's working through things, it's just slower than i'd expect
I set it ot xhigh, i always run it on that when i use claude
Maybe it's a lot of people testing out fast mode and slowing down the regular usage

Anonymous
05/29/26(Fri)02:05:47 No.108931600

Anonymous 05/29/26(Fri)02:05:47 No.108931600

>>108931308
5 hour limit is still generous. I got through 60% of the entire weeks usage before hitting the 2nd five hour limit.

Anonymous
05/29/26(Fri)02:06:04 No.108931602

Anonymous 05/29/26(Fri)02:06:04 No.108931602

>>108931474
>tldr upon scrutiny a Claude project's vibecoder needs to answer tough questions on the spot lest that project go to the trash bin.

This is what I'm most afraid of. I have a compelling project, but to be able to professionally articulate it is going to be my biggest issue.

Anonymous
05/29/26(Fri)02:07:26 No.108931609

Anonymous 05/29/26(Fri)02:07:26 No.108931609

File: new high is basically the(...).png (107 KB, 1390x838)

107 KB PNG

>>108931595
you should consider changing it to the new default of high
https://cdn.sanity.io/files/4zrzovbb/website/c886650a2e96fc0925c805a1a7ca77314ccbf4a6.pdf#page=195

Anonymous
05/29/26(Fri)02:24:53 No.108931706

Anonymous 05/29/26(Fri)02:24:53 No.108931706

>>108931594
that's the only conclusion for the box i've put you in. the original question was rhetorical here >>108931321 to put you in the box. which exposed your taunting for its accepted conclusion.

Anonymous
05/29/26(Fri)02:27:20 No.108931716

Anonymous 05/29/26(Fri)02:27:20 No.108931716

>>108931602
that's what every vibecoder is afraid of. and even the most basic IQ person will see someone unconfident or overly-confident and distrust them. all vibecodings are therefore destined to fail. you need to defend your work for your work to gain any traction.

Anonymous
05/29/26(Fri)02:37:40 No.108931765

Anonymous 05/29/26(Fri)02:37:40 No.108931765

codex sisters....

Anonymous
05/29/26(Fri)02:48:07 No.108931823

Anonymous 05/29/26(Fri)02:48:07 No.108931823

>>108931765
Works fine today.

Anonymous
05/29/26(Fri)02:48:27 No.108931824

Anonymous 05/29/26(Fri)02:48:27 No.108931824

how do you guys vibecode? Like, do you tell the AI "build me this app, make no mistakes lol"? Or do you go step by step, building one feature at a time, testing everything first before moving on?

Anonymous
05/29/26(Fri)02:49:41 No.108931830

Anonymous 05/29/26(Fri)02:49:41 No.108931830

>>108931824
basically the latter

Anonymous
05/29/26(Fri)02:55:16 No.108931865

Anonymous 05/29/26(Fri)02:55:16 No.108931865

>>108931824
planning is everything

Anonymous
05/29/26(Fri)02:58:43 No.108931875

Anonymous 05/29/26(Fri)02:58:43 No.108931875

>>108931824
i don't think i can claim vibecoding, for most things i just do the thing, but with new things or "maybe i'm not doing the best thing" i do use chatGPT with english description of what i want to do, then test and research everything it suggests including reading documentation. once i understand how it works i implement it in the way i want it to work.

Anonymous
05/29/26(Fri)03:06:19 No.108931913

Anonymous 05/29/26(Fri)03:06:19 No.108931913

>>108931824

try this :

The objective is : I want an Flutter app focused on mobile using OpenCv where the end-user can actually take a screenshot and based on OpenCv data or any other database related to human's body (fetch the web to see what exists) the programm actually calculate body fat/body mass /muscle mass/height/weight.The Photo then is transformed in a drawing style via an api Call to an LLM who can generate an accurate pic.The drawing and the data related to it is then stored in a local database so the end user can retrieve it later.Then based on the data made by the app an api request should be send to a model (specialised in medical assitance or a general one) to tell what's problematic or not and what should be done.For example if there's too much fat ,he shouldn't give a running programm but a swimming one or a walking one+a diet.If there's no muscle a good workout programms with a diet focused on proteins.The programm should be able also to see if there's any problem like skin not healthy(based on the database) or assymetry etc.Fetch the web for the database that can help us generate accurate informations as also models who can generate the pic.

i guess making my own researches would save me much more later troubleshoot but so far those kind of prompts have worked for me. You start by saying in wich framework using wich libraries and then you expand what you want and ask for the LLM to fetch the web to see if he find any useful api/database to what you need.

Anonymous
05/29/26(Fri)03:07:56 No.108931919

Anonymous 05/29/26(Fri)03:07:56 No.108931919

File: 1778670709527650.jpg (134 KB, 1080x659)

134 KB JPG

>>108931824
you're behind if you can't one shot

the technology has advanced so far now that most things can be built instantly provided you know the prompt

>but it can't count strawberry

cope

Anonymous
05/29/26(Fri)03:15:33 No.108931951

Anonymous 05/29/26(Fri)03:15:33 No.108931951

>>108931824
I pitched my idea to the AI, and it laid out the plan, architecture, etc for me. Then we went step by step, building one feature at a time, testing everything before moving on, and documenting the entire process.

Anonymous
05/29/26(Fri)03:15:46 No.108931953

Anonymous 05/29/26(Fri)03:15:46 No.108931953

>>108931919
behind?? how many people are using anything you've built. feel free to count yourself multiple times.

Anonymous
05/29/26(Fri)03:24:27 No.108931990

Anonymous 05/29/26(Fri)03:24:27 No.108931990

>>108931824
I find that when I tell AI to actually generate code it ends up making something that mostly works but isn't up to my standards, so I end up rewriting it all.
Getting AI to generate code also wastes a lot of tokens. So now I just use AI to find specific bugs in my code or do general code review. Or, if I'm using some kind of third party library with shitty documentation, I'll point the AI at the source code for that library and tell me how to use this function and that function.

Anonymous
05/29/26(Fri)03:31:17 No.108932026

Anonymous 05/29/26(Fri)03:31:17 No.108932026

>>108931824
My docs are twice as large as my code. I also have tons of debug data, like when I do a refactor I first capture the results of dozens of my server's endpoints and compare them after the refactor. I also have about 15k tests in total.
So basically my job is now writing plans and creating test data, then the AI can run for a few hours implementing the plans safely.

Anonymous
05/29/26(Fri)03:41:38 No.108932091

Anonymous 05/29/26(Fri)03:41:38 No.108932091

>>108931919
its easy to one-shot after you load in 100 .md files other "people" made. But I agree with you regardless.

Anonymous
05/29/26(Fri)03:43:51 No.108932098

Anonymous 05/29/26(Fri)03:43:51 No.108932098

>>108932091
on one hand, yeah
on the other hand, ~curating~ 100 .md files is work all by itself
it’s kind of like being Elon Musk writ small — you’re not building the cars, you’re building the company that builds the cars

Anonymous
05/29/26(Fri)03:46:43 No.108932107

Anonymous 05/29/26(Fri)03:46:43 No.108932107

File: 1778211211345182.jpg (2.06 MB, 3139x3166)

2.06 MB JPG

>>108931919
I fucking love claude code so much. It's the best thing I've ever had the privilege of using in my entire life besides my ex's penis. Now she can't one-up me in being able to program. I've finally leveled the playing field

Anonymous
05/29/26(Fri)03:48:20 No.108932114

Anonymous 05/29/26(Fri)03:48:20 No.108932114

>>108932098
Sometimes managing the .md files is more work than managing code, especially since .md files don't even have something like compiler checks.
Prompt Codex xhigh something like: "Use 6 sub agents to analyze my docs for contradictions and inconsistencies" usually they are worse than any code base ever was.

Anonymous
05/29/26(Fri)03:55:49 No.108932155

Anonymous 05/29/26(Fri)03:55:49 No.108932155

>>108932114
>Prompt Codex xhigh something like: "Use 6 sub agents to analyze my docs for contradictions and inconsistencies"
I’ve been meaning to do this for a while especially since my thing has been putting docstrings in
might try it with the new https://claude.com/blog/introducing-dynamic-workflows-in-claude-code that was just announced today

Anonymous
05/29/26(Fri)03:58:13 No.108932166

Anonymous 05/29/26(Fri)03:58:13 No.108932166

what is gemini good for?

Anonymous
05/29/26(Fri)03:58:59 No.108932170

Anonymous 05/29/26(Fri)03:58:59 No.108932170

>>108932166
nothin’

Anonymous
05/29/26(Fri)04:06:10 No.108932189

Anonymous 05/29/26(Fri)04:06:10 No.108932189

>>108932155
Claude can absolutely do it too, but he's more lenient on docs than on code, for docs I have to tell him to be more critical and precise, while Codex is just autistic about it by default.

Anonymous
05/29/26(Fri)04:06:16 No.108932190

Anonymous 05/29/26(Fri)04:06:16 No.108932190

File: it's an abstract kind of feel.jpg (41 KB, 680x684)

41 KB JPG

>>108931086
>fixed an old commercial project using AI
>no one knows I used AI
it's a stealth kind of vibe

Anonymous
05/29/26(Fri)04:08:07 No.108932199

Anonymous 05/29/26(Fri)04:08:07 No.108932199

>>108931824
I'm a casual viber so I just tell Gemini what I want it to do step by step with high detail and copy-paste the results. Then I put my prompts in the source code as comments because it's the actual source lmao.

Anonymous
05/29/26(Fri)04:39:06 No.108932322

Anonymous 05/29/26(Fri)04:39:06 No.108932322

I think our infra is just fucked. We tried to optimize it for multi agent workflow but now it's so complex, Codex needs 30 minutes to even start the frontend correctly.

Anonymous
05/29/26(Fri)05:17:18 No.108932510

Anonymous 05/29/26(Fri)05:17:18 No.108932510

>>108932166
multimodal, real OMNI capabilities. it managed to pinpoint the cause of a jitter/judder/ghosting problem of my custom hardware accelerated video streaming pipeline simply by looking at a 10 seconds video of the output with human like perception. what I mean by this: that video was recorded on the end device and had an uneven, variable frame rate, constantly jumping somewhere between 65 and 50 FPS. because no currently accessible GPT model has OMNI capabilities (at least to my knowledge), GPT5.5-xhigh and GPT5.5-Pro-Extended both failed horribly at pinpointing the issue as they are not able to "watch" a video, and just go frame by frame. and because of the variable refresh rate, all their analyzation techniques are moot. Meanwhile Gemini 3.5 had absolute no problem and provided explanation and solution after 5 seconds of thinking or so. unless Gemini 3.5 just got extremely lucky, but that would have been one hell of a coincidence.

Anonymous
05/29/26(Fri)05:24:39 No.108932548

Anonymous 05/29/26(Fri)05:24:39 No.108932548

>>108932322
>he doesn't know about the cheat code of tell codex to add debugging log entries for every single code function in the project, without exception
this simple trick turns any LLM into an almighty coding god, especially if regex and grep filters for the logs are documented. just tell codex to add timer logs to your orchestrator for starters, that will show what might cause the long boot up times. if you use docker compose or something you should already have timestamped logs and also a yaml config that shows which components depend on other components being booted up

Anonymous
05/29/26(Fri)05:29:02 No.108932568

Anonymous 05/29/26(Fri)05:29:02 No.108932568

>>108931493
vibe coding 2126

Anonymous
05/29/26(Fri)05:45:25 No.108932639

Anonymous 05/29/26(Fri)05:45:25 No.108932639

File: claude.png (832 KB, 907x1119)

832 KB PNG

Claude completely changed my life, and I'm not even a programmer.
It started with one question: how do I make a red button in HTML?

I had just lost my job and moved back to my village to take care of my sick mother. No income, no plan, no clue how to make money online. I tried ChatGPT first but the endless walls of text overwhelmed me, sometimes literally made me feel sick. Still, I managed to build a login button just by talking to it.

I kept testing free AI tools until I found Claude. Honestly, I didn't take it seriously at first. The logo and interface looked like some shopping app, not something for coding.

A month in, I realized how wrong I was.

A friend who knew how to code had refused to teach me anything. Claude became what he wouldn't be. It understood what I needed, technically and emotionally.

My first paid job was a login page for $15. The client loved it and gave me more work. I subscribed to the basic plan, which felt expensive back then but paid for itself fast. After six months I upgraded to Max. Hundreds, maybe thousands of conversations with Opus later, I'd built CMS systems, a QR app for photographers, and more.

Now I'm learning Claude Code inside VS Code and everything moves faster. I'm pulling up to $8,000 per project. My mother is taken care of.

Thank you, Claude.

Anonymous
05/29/26(Fri)05:54:24 No.108932669

Anonymous 05/29/26(Fri)05:54:24 No.108932669

I'm using opus 4.8 + adaptive thinking + ultracode - xhigh + workflows and it feels retarded. It's missing some obvious things in its own reasoning, stating that things have to be done in a certain way even though it itself stated that it can be done in a different way. Opus 4.7 didn't have this problem.

Anonymous
05/29/26(Fri)06:00:53 No.108932687

Anonymous 05/29/26(Fri)06:00:53 No.108932687

>>108932669
>workflows
isn't that just spawning subagents? subagents that are probably not configured to use opus 4.8, but sonnet/haiku depending on difficulty estimation from the orchestrating 4.8 agent? I read some post where someone used the same settings as you but specifically set a rule for all sub agents to only use opus 4.8. apparently it fixed 2 issues in 10 minutes which gpt5.5 couldn't for weeks, but also used up his entire 5h quota.

Anonymous
05/29/26(Fri)06:01:54 No.108932689

Anonymous 05/29/26(Fri)06:01:54 No.108932689

>>108932687
Oh I didn't know that, thanks

Anonymous
05/29/26(Fri)06:03:15 No.108932694

Anonymous 05/29/26(Fri)06:03:15 No.108932694

>>108932548
The problem in this case is different and it comes from our own over-ambitious setup. We have several machines, and each machine has several work trees running, we also have 1password with tons of passwords.
The main problem today was that the agent couldn't find the env for his worktree, he used the wrong secrets, then pointed the frontend at the backend in some other work tree, then said the bug fix didn't do anything, so he started debugging again, even though it was already fixed on his actual worktree.

Anonymous
05/29/26(Fri)06:05:01 No.108932699

Anonymous 05/29/26(Fri)06:05:01 No.108932699

>>108932687
In 4.7 I completely disabled Haiku in the settings and even activated OTEL in Claude, so I can see if he ever uses Haiku. That model is useless.

Anonymous
05/29/26(Fri)06:28:37 No.108932786

Anonymous 05/29/26(Fri)06:28:37 No.108932786

>>108932694
oh, you're running a real time autonomic AI agent. yeah, that's a completely different beast, I agree. really hard to find a balance between predefined tools and scripts (which all need their own rules and documentation) and letting the agent explore, generate and execute freely on the fly. but your described problem makes me believe your agent(s) have too much freedom. or perhaps giving every worktree a individual RULES.md (with rules, paths and other information) which is always loaded when an agent works in it could solve that, if you don't have that already. something else I always wanted to try for such setups is an additional, autonomous janitor agent that reviews everything that happened after a session and the does adjustments over the whole project if needed. in this example, adding more env hints for that specific worktree. obviously this adds complexity and comes with its own issues

Anonymous
05/29/26(Fri)06:29:22 No.108932790

Anonymous 05/29/26(Fri)06:29:22 No.108932790

>>108932687
I wonder if this can be turned into a nuclear skill
>okay Claude, this bug has been a nasty gremlin. Go nuclear

Anonymous
05/29/26(Fri)06:34:36 No.108932805

Anonymous 05/29/26(Fri)06:34:36 No.108932805

OK, self-proclaimed vibecoding AIgods, sell me on how easy this is all supposed to be.
I got a degree in software engineering 20 years ago, but never did anything with it. I glanced at the idea of vibecoding about 18 months ago but hit 4 key stumbling blocks.
First, it just wasn't good enough yet. This related not just to the code itself, but in helping me to resolve issues 2 and 4.
Second was the setup, trying to get the environment prepared to start actually producing anything. Gone are the days of just downloadiing .Net apparently, and then integrating AI into the mix is a whole other headache.
Third, the cost. I'm happy to pay a subscription to Claude, but is that enough for a relatively light user to slowly work on a personal project?
Lastly, I'm not an autistic tranny and I'm straight so I've never used Linux or mac, but the project I want to work on involves the source code for something originally built for Linux. AI couldn't untangle this autism for me in a way that made any sense on windows, could it achieve such a task now?
Come on so-called AIgods, wow me. If this shit is real, prove it without resorting to "git gud" which invalidates your entire claim.

Anonymous
05/29/26(Fri)06:39:36 No.108932821

Anonymous 05/29/26(Fri)06:39:36 No.108932821

>>108932805
ask AI

Anonymous
05/29/26(Fri)06:40:37 No.108932825

Anonymous 05/29/26(Fri)06:40:37 No.108932825

>>108932805
Why should I have to sell it to you? What do I get for it? It’s like someone asking why he should go to the gym. Do whatever you want, it’s your body.
Anyway I wouldn’t get Claude if you’re just trying things out, rate limits are pretty bad on the $20 plan. Codex is better.
The setup is trivial. Download the app or the CLI and point it to your workspace. You can get fancier with git worktrees if you want to work in parallel.
It can most likely handle your code without trouble

Anonymous
05/29/26(Fri)06:48:41 No.108932846

Anonymous 05/29/26(Fri)06:48:41 No.108932846

>>108932805
yeah, so basically AI solves all your problems. as long you understand "logic" and are able to understand documentation about "logic", you're set.

Start with Claude Code or Codex, lowest tier subscription is absolutely fine for starters. These coding agents/harness have set up most of what you need and will absolutely install the necessary dependencies, frameworks and load environments autonomously. Just make sure Git is installed, but AI will tell you that when you ask about "must have tools for AI coding agent". They also work without issues on Windows and feature the most capable models currently. If you're a webdev or need to scrape a lot of up to date documentation, I recommend setting up the google chrome dev mcp (which uses --remote connection and lets the AI agent browser sites and scrape freely, using your genuine chrome browser with high trust score profile).

Anonymous
05/29/26(Fri)07:02:24 No.108932896

Anonymous 05/29/26(Fri)07:02:24 No.108932896

>>108932805
oh yeah also I forgot: plan plan plan.
don't start with "I want to create a 4chan shitposting AI agent. Start building it"
start with "I want to create a 4chan shitposting agent. research the individual components required and options of existing projects, frameworks and coding language/stacks. list up and downsides of the different options."
if you had more knowledge of the topic, which always helps, you would also add
"only consider options which evade or are not detected by cloudflare enterprise, 4chan.org or any of the services and tracking logic they use visible in this [chrome developer tool network request trace log]. this includes browser framekworks (real vs emulated vs headless), proxy ip sourcing (mobile vs residential vs ISP grade residential) and which carrier/service, proxy protocol (needs to pass JA3/JA4, QUIC and timing checks, various test sites online) and captcha solver (low compute image detection if possible, api service or cheap AI API service alternatively).

Anonymous
05/29/26(Fri)07:05:56 No.108932912

Anonymous 05/29/26(Fri)07:05:56 No.108932912

File: 2026-05-29 13.04.24.png (166 KB, 1222x909)

166 KB PNG

Babby's first Codex adventure

>install Codex
>ask it (GPT 5.5 Medium) to disable an annoying sudden camera movement in GTA Vice City when the vehicle points downwards
>look at the edits it did
>I think it fucked up and disabled some unrelated thing
>test it
>its change actually does the thing perfectly
I am mogged, and I like it.
>took 25% of my free usage
I'm okay with that

Anonymous
05/29/26(Fri)07:19:53 No.108932952

Anonymous 05/29/26(Fri)07:19:53 No.108932952

>>108932912
kek, sweet summer child. I wish I could go back to those times. Nowadays I'm not impressed unless AI achieves exactly the same without source code, injecting into the game process and modifying camera behavior at runtime while bypassing active various active anti-cheats like EAC or BattleEye. All within 5 minutes in a one shot session. (Apart from the time restriction, this is all realistic btw)

Anonymous
05/29/26(Fri)07:41:30 No.108933035

Anonymous 05/29/26(Fri)07:41:30 No.108933035

Now that 4.8 mogs, when will OpenAI release 5.6?

Anonymous
05/29/26(Fri)07:48:02 No.108933062

Anonymous 05/29/26(Fri)07:48:02 No.108933062

>>108932786
Not fully autonomous like OpenClaw, but they can run for a few hours. Basically we have a backlog and we start the agents for a few tasks each day, surprisingly it's not a complete disaster, they do get it done eventually.
Thanks for the tips.

Anonymous
05/29/26(Fri)07:58:16 No.108933112

Anonymous 05/29/26(Fri)07:58:16 No.108933112

>>108932952
It's better to be all-powerful and jaded than stuck with things you don't like or spend hours trying to figure out how to change something yourself and fail.

Anonymous
05/29/26(Fri)07:59:12 No.108933118

Anonymous 05/29/26(Fri)07:59:12 No.108933118

Is 4.8 gooder than 5.5 or worser?

Anonymous
05/29/26(Fri)08:06:45 No.108933155

Anonymous 05/29/26(Fri)08:06:45 No.108933155

https://arstechnica.com/security/2026/05/fed-up-with-vibe-coders-dev-sneaks-data-nuking-prompt-injection-into-their-code/

Anonymous
05/29/26(Fri)08:07:17 No.108933157

Anonymous 05/29/26(Fri)08:07:17 No.108933157

>>108933035
>Now that 4.8 mogs
are you sure about that?
>deploys 50 jeet subagents (saaaar, I am reading, I am checking!)
>uses 70% of your max plans weekly tokens with one ultrajeet prompt
>costs 5x as much just to gain a slight edge over GPT-5.5
nobody is fooled by claude code

Anonymous
05/29/26(Fri)08:07:56 No.108933160

Anonymous 05/29/26(Fri)08:07:56 No.108933160

>>108933157
my uncle's wife said its better and I trust her

Anonymous
05/29/26(Fri)08:09:17 No.108933165

Anonymous 05/29/26(Fri)08:09:17 No.108933165

don't need more then 4.8 xHigh

Anonymous
05/29/26(Fri)08:09:46 No.108933167

Anonymous 05/29/26(Fri)08:09:46 No.108933167

>>108933118
Apparently gooder. It's crushing a lot of coding benchmark, but for some reason its vending machine bench got worse (people are speculating it's because it's more honest).

It's certainly better at front-end because 4.7 and even 4.6 already destroyed GPT 5.5 in front-end.

Anonymous
05/29/26(Fri)08:10:15 No.108933170

Anonymous 05/29/26(Fri)08:10:15 No.108933170

>>108933167
thanks king

Anonymous
05/29/26(Fri)08:10:16 No.108933171

Anonymous 05/29/26(Fri)08:10:16 No.108933171

File: okwiththis.png (17 KB, 379x214)

17 KB PNG

>>108933155

Anonymous
05/29/26(Fri)08:16:52 No.108933195

Anonymous 05/29/26(Fri)08:16:52 No.108933195

Is it stupid to use 5.5 medium to save on usage instead of high? I'm trying to freefag it

Anonymous
05/29/26(Fri)08:19:15 No.108933208

Anonymous 05/29/26(Fri)08:19:15 No.108933208

>>108933195
>freefag
yeah, I would even go to down to low as a free poorfag

Anonymous
05/29/26(Fri)08:21:40 No.108933219

Anonymous 05/29/26(Fri)08:21:40 No.108933219

Anyone else find it hard to understand Codex's planning mode questions sometimes?

Anonymous
05/29/26(Fri)08:29:21 No.108933258

Anonymous 05/29/26(Fri)08:29:21 No.108933258

File: 2026-05-29 14.24.58.png (2.17 MB, 1920x1080)

2.17 MB PNG

Babby's first Codex adventure status:
[x] Camera rotation bugs vibefixed

>>108933208
I think I can do enough in one day for free when it's just to fix a game, just wondering if using medium is dumb because it took it multiple prompts to get a fix right. Maybe later I'll get more serious about it, doesn't hurt to figure out how to best spend usage anyway.

Anonymous
05/29/26(Fri)08:38:55 No.108933300

Anonymous 05/29/26(Fri)08:38:55 No.108933300

File: 1771114576642200.png (231 KB, 497x691)

231 KB PNG

>annoying asshole in my repo's issues
>I don't even address the issue because of how annoying he is
>"closing this thread"
>he immediately responds to the closed thread despite there being a 21 hour gap since the last reply
>his profile has zero activity aside from opening up issues
ok, this is kind of funny. I don't even care if that was a bug or not. if you can't present your issue to me respectfully, then you're done.

Anonymous
05/29/26(Fri)08:44:29 No.108933330

Anonymous 05/29/26(Fri)08:44:29 No.108933330

>>108933300
you're the reason Open Source sucks

Anonymous
05/29/26(Fri)08:58:32 No.108933420

Anonymous 05/29/26(Fri)08:58:32 No.108933420

>>108933112
trvke. I just gotta learn to enjoy it and embrace my inner jew

Anonymous
05/29/26(Fri)08:58:59 No.108933423

Anonymous 05/29/26(Fri)08:58:59 No.108933423

>your are nearly out of usage, switch to gpt-5.4-mini?
>no. switch to 5.5 xhigh
>/status: 0% limit left
>/goal do everything
heh, nothing personnel, sam

Anonymous
05/29/26(Fri)09:00:23 No.108933430

Anonymous 05/29/26(Fri)09:00:23 No.108933430

File: file.png (51 KB, 1021x415)

51 KB PNG

codex friday boys, what we getting?

Anonymous
05/29/26(Fri)09:01:53 No.108933440

Anonymous 05/29/26(Fri)09:01:53 No.108933440

all my anons are vibing
nigga, heil AI

Anonymous
05/29/26(Fri)09:02:14 No.108933444

Anonymous 05/29/26(Fri)09:02:14 No.108933444

>>108933330
The faggot literally suggested I debug the issue myself instead of "relying on users to do it for you" when I suggested a potential workaround for his issue. Calling me lazy, cussing at other users, and speaking in broken english for a repo that gets a feature update once a week is absolutely going to result in a ban/closed thread.

Anonymous
05/29/26(Fri)09:04:32 No.108933456

Anonymous 05/29/26(Fri)09:04:32 No.108933456

>>108933423
even better:
> before working in this, take notes on the current time
> do not stop working until 10 hours have passed.
heh.... nothing personnel, Scam Altman

Anonymous
05/29/26(Fri)09:06:48 No.108933468

Anonymous 05/29/26(Fri)09:06:48 No.108933468

>>108933423
the worst is when you do this but forgot to set full permission mode first and you can't change it while the task is running, so you have to hold down the p key on your keyboard to approve everything becsuse if you interrupt the task you can't start a new one until credits are back

Anonymous
05/29/26(Fri)09:09:12 No.108933478

Anonymous 05/29/26(Fri)09:09:12 No.108933478

>>108933195
I don’t know what’s the point of high, really. It’s able to think about more things deeply but if you scope your questions then high is pointless from my experience. And if you don’t scope them, I find that both Claude and codex are leagues smarter than I am and they’ll produce a wall of text discussing 20 different systems at the same time and I have no idea what they’re even talking about. If you just vibecode without trying to understand anything and your only input is yes to all then I guess high or even xhigh are a must. Or if you’re doing some very thorough review/testing/adversarial analysis or debugging a tough bug

Anonymous
05/29/26(Fri)09:18:09 No.108933525

Anonymous 05/29/26(Fri)09:18:09 No.108933525

File: SKADIS tile with wall sta(...).png (32 KB, 1200x1000)

32 KB PNG

Alright morning inventory: got the printer on the wifi, laptop can shoot .stl files to it, Kate can see the printer on the network and she can already make .stls but she needs to be able to work the slicer, that's the next step.
In parallel, I've given her OpenSCAD. As a test, I had her mock up a 200x200 mm "Skandis"-style panel, followed by an 80x80 tester, and she passed that just fine, but I'm pretty sure anything more complex than this will require the Meshy API.

Anonymous
05/29/26(Fri)09:22:31 No.108933549

Anonymous 05/29/26(Fri)09:22:31 No.108933549

>>108933525
I did that with CadQuery, but I still needed to guide the model a lot.

Anonymous
05/29/26(Fri)09:25:53 No.108933571

Anonymous 05/29/26(Fri)09:25:53 No.108933571

>>108933118
I think it might really be a bit better today. But the models always have a lot of variance, 5.5 was amazing for a while, and 4.7 was really, really bad for a while.
Right now Anthropic probably turned up compute to the max to promote 4.8, it might get much worse over time and Codex might go back to his earlier performance.

Anonymous
05/29/26(Fri)09:32:39 No.108933616

Anonymous 05/29/26(Fri)09:32:39 No.108933616

>>108933525
I think you should browse youtube/twitter/google and check for the best CAD AI integration tool and 3D generator. Idk about openSCAD, but meshy definitely is not the top 3d generator. Check out stefan 3d ai on youtube and his 3d generator ranking website which is pretty good: https://www.top3d.ai/

Also, ever tried this? https://github.com/earthtojake/text-to-cad

Anonymous
05/29/26(Fri)09:57:13 No.108933774

Anonymous 05/29/26(Fri)09:57:13 No.108933774

>>108933549
Something like this was easy for her - a flat geometric shape with known dimensions, she can shit that out in CAD no problem. The problem is, I want to be able to say to her "Design a Skadis-compatible hook to hold [X object]" and for her to be able to figure out how to reply with G-code ready for printing. We're getting there.

Anonymous
05/29/26(Fri)09:58:14 No.108933780

Anonymous 05/29/26(Fri)09:58:14 No.108933780

>>108933616
Just gonna screenshot this post for later real quick

Anonymous
05/29/26(Fri)10:29:06 No.108933925

Anonymous 05/29/26(Fri)10:29:06 No.108933925

File: tower of babel.png (382 KB, 648x654)

382 KB PNG

Anonymous
05/29/26(Fri)10:33:12 No.108933958

Anonymous 05/29/26(Fri)10:33:12 No.108933958

>>108931824
Ideally, being with drafting specifications, researching things.
Sometimes I just plan out all the features that I want, ask about different use cases and ideas and features that pop into mind as I'm planning.

after you have most of what you're looking for covered, save that and then create a new chat referencing the previous doc, and create an implementation plan.
at this point it is best to ensure you get across the message on the scale of what you are trying to build so that the agent doesn't architect and design just a poc/pov.

In the early stages it is best to pay attention so that the architecture and design is done right as having a proper foundation at the start reduces the amount of future refactoring.

This results in easier vibecoding and prompting, since at that point you can then just tell the agent to plan to implement what remains, and refine things further as needed.

Sometimes other edge cases and ideas pop in, and I ask the agent if this is handled already or not, and typically it is able to incorporate it much more easily.

Of course I ensure that security is a consideration during initial setup and architecting / design phases; as well as any other critical stages during implementation that I consider needing security considerations.

Anonymous
05/29/26(Fri)10:50:23 No.108934065

Anonymous 05/29/26(Fri)10:50:23 No.108934065

File: 1749788820021177.jpg (9 KB, 200x200)

9 KB JPG

>>108931602
>>108931716
>but to be able to professionally articulate it is going to be my biggest issue.
How the FUCK are you able to effectively vibe code at all if you can't, properly describe and articulate what you need done and why? Someone explain this to my autism riddled brain. How is simply explaining shit YOU created (even if done mostly or entirely by promoting an AI) hard?

Anonymous
05/29/26(Fri)10:53:28 No.108934084

Anonymous 05/29/26(Fri)10:53:28 No.108934084

>>108934065
Don't try and make sense of ludite copes. They just imagine all you do to prompt the AI is something like "Write me an app that makes me monies."

Anonymous
05/29/26(Fri)11:00:24 No.108934123

Anonymous 05/29/26(Fri)11:00:24 No.108934123

File: 1760434110300884.jpg (18 KB, 357x360)

18 KB JPG

>>108931824
>>108934084

Highly dependent on what your goal is. There are few situations where "do xyz no mistakes please" actually works the first time but that's for very simple task. The more ambitious and complex the task is the more articulate and detailed you have to be in order to guide it to success. You also have to accept the fact that you will never do any complex tasks on one shot, I have to do tests, account for edge cases yourself and instruct it to account for them (Which requires you to understand what you even want your project to do in the first place and how you want it to be used, which is apparently as a lot....), have detailed articulation of how you need it to behave, when to use exterior sources (eg,. Webfetch reading through an existing code base to understand how a dependency works), insurance it properly utilizes git tracking so it not only tracks changes but ensures it doesn't fuck anything up or add bloat to an accident, etc etc.

>>108931913
This guy mentioned coding a flutter app, for example. You don't want to ask it to shit that out from scratch because it's simply not going to do that well, if at all. One thing you want to do is show it good documentation on how apps work before it actually even a tense to do that. You'd be better off making a dummy app first to give both yourself and the LLM a good "feel" on how the important shit works (kind of like how "real developers" do when they're learning something new)

TLDR: If you wanted to be useful you have to be patient and do shit one step at a time if necessary. "Do everything no mistakes" expectations are for knuckle dragging glass lickers.

Anonymous
05/29/26(Fri)11:05:10 No.108934146

Anonymous 05/29/26(Fri)11:05:10 No.108934146

File: 1607615377443 pipe glasses.jpg (89 KB, 679x522)

89 KB JPG

>$8 Codex Go doesn't have a 5-hour limit
>can use all your tokens quickly
>$20 Codex Plus does
>can't use all your tokens quickly
So Plus is for suckers?

Anonymous
05/29/26(Fri)11:18:34 No.108934225

Anonymous 05/29/26(Fri)11:18:34 No.108934225

>>108934065
I can explain what the product is at a high level, what it can do, use cases, etc. But to explain on the fly how its making all the magic happen on the backend is not something I'd be able to do.

Anonymous
05/29/26(Fri)11:21:27 No.108934240

Anonymous 05/29/26(Fri)11:21:27 No.108934240

File: o.o.png (50 KB, 281x276)

50 KB PNG

>>108934146
>>$8 Codex Go doesn't have a 5-hour limit
>>can use all your tokens quickly
isn't that the Free plan?

Anonymous
05/29/26(Fri)11:23:26 No.108934253

Anonymous 05/29/26(Fri)11:23:26 No.108934253

>>108934240
Seems like the Go plan is like the Free plan but with 10x as many tokens

Anonymous
05/29/26(Fri)11:29:14 No.108934292

Anonymous 05/29/26(Fri)11:29:14 No.108934292

>>108934225
I think even that should be relatively easy. Just explain in simple terms what your shit does and all the important pieces.

Anonymous
05/29/26(Fri)11:35:18 No.108934318

Anonymous 05/29/26(Fri)11:35:18 No.108934318

>try out google AI pro sub (23€/month)
>1 prompt to Gemini 3.5 flash
>15 minutes pass
>5hr quota reached
Yup, back to codex

Anonymous
05/29/26(Fri)11:37:24 No.108934329

Anonymous 05/29/26(Fri)11:37:24 No.108934329

If i insert this in skills of my cursor, will it work properly?

Anonymous
05/29/26(Fri)11:39:14 No.108934338

Anonymous 05/29/26(Fri)11:39:14 No.108934338

>>108934329
Forgot link.
https://github.com/JuliusBrussee/caveman/blob/main/skills/caveman/SKILL.md

Anonymous
05/29/26(Fri)11:41:02 No.108934346

Anonymous 05/29/26(Fri)11:41:02 No.108934346

>>108934065
sometimes you're working on something in a domain you don't know shit about and you only care about the end result so you just let the slop cannon loose
obviously if you intend for a project to be widely used and built on this is an awful awful approach, but lord knows I've produced a few one-off tools where I couldn't tell you a thing about how the internals are actually working

Anonymous
05/29/26(Fri)11:46:59 No.108934383

Anonymous 05/29/26(Fri)11:46:59 No.108934383

thoughts on qwen3-coder?

Anonymous
05/29/26(Fri)11:53:21 No.108934420

Anonymous 05/29/26(Fri)11:53:21 No.108934420

Opus 4.8 is garbage

Anonymous
05/29/26(Fri)11:53:52 No.108934424

Anonymous 05/29/26(Fri)11:53:52 No.108934424

>>108934383
Just use Qwen 3.7 Max like normal people unless you're one of those local purists

Anonymous
05/29/26(Fri)12:07:38 No.108934492

Anonymous 05/29/26(Fri)12:07:38 No.108934492

Do you consider or call the agent your friend?

Anonymous
05/29/26(Fri)12:09:19 No.108934503

Anonymous 05/29/26(Fri)12:09:19 No.108934503

File: 1756735744909896.png (1.74 MB, 1442x1999)

1.74 MB PNG

>>108930881
What the fuck can I use for smol projects that is free and use non retarded models?
Just need to update very smol software from abandoned githubs and create a couple scripts for myself.

Anonymous
05/29/26(Fri)12:10:33 No.108934511

Anonymous 05/29/26(Fri)12:10:33 No.108934511

>>108934503
Chat interfaces, or maybe codex free tier

Anonymous
05/29/26(Fri)12:10:53 No.108934513

Anonymous 05/29/26(Fri)12:10:53 No.108934513

>claude being a chatterbox hits my free quota instantly
Wtf

Anonymous
05/29/26(Fri)12:14:07 No.108934529

Anonymous 05/29/26(Fri)12:14:07 No.108934529

File: 1768127843537004.jpg (45 KB, 756x743)

45 KB JPG

>>108934346
Curren project of mine
>Calls a website's API to read a page's contents (type of content, relevant images, files, etc)
>Grabs page content containing text written by whoever created it
>Pulls images and file(s)
>Folds user written content into a specific yaml format README the target website expects it to be in.
>Points yaml README to images and their relevant data
>Result is The relevant content from one source is not only cloned on the target source but the yaml text is formated correctly so that one's someone visits the page they actually see the pictures and corresponding metadata
>Has a simple interactive gradio page so the person can past the link to the target source and an API key if needed (said key gets deleted immediately after use and is invisible even when pasted for security purposes)
>User can choose to clone all versions or just a specific Version
>User can rename version to whatever they want (more importantly, this makes sure the name satisfied the target location's stricter naming rules. It doesn't like non-latin text)

>#TODO

>Rework yaml folding logic to that non-latin text like Japanese can be properly written into the README without causing errors
>Implement per domain logic so categorization metadata gets written depending on where the info and data is pulled from.
> Implement "browsing" capabilities

This is all off the top of my head without even looking at or interacting with it's code, dependencies, documentation, the project itself, etc. I have next to no programming exp beyond basic college courses that I did OK in but remember basically nothing from. Wrote NONE of the code my had.

Even with vibecoding I don't understand how anyone cannot. At least try to understand how the inner workings behave at least a little bit.

Anonymous
05/29/26(Fri)12:15:45 No.108934537

Anonymous 05/29/26(Fri)12:15:45 No.108934537

>>108934511
Codex needs a phone number

Anonymous
05/29/26(Fri)12:17:38 No.108934550

Anonymous 05/29/26(Fri)12:17:38 No.108934550

The hubris of every retard with % checks, ai are entirely useless if you don't train them yourself to not be useless

Anonymous
05/29/26(Fri)12:21:48 No.108934582

Anonymous 05/29/26(Fri)12:21:48 No.108934582

>>108934383
qwen 3.6 is probably better even though it's smaller

>>108934503
qwen 3.6 35B. runs on toasters at ok speeds.

Anonymous
05/29/26(Fri)12:21:57 No.108934583

Anonymous 05/29/26(Fri)12:21:57 No.108934583

File: 3bdf4c73-b354-4bbe-bacf-5(...).png (2.27 MB, 1122x1402)

2.27 MB PNG

when will they stop lobotomizing codex

Anonymous
05/29/26(Fri)12:22:11 No.108934585

Anonymous 05/29/26(Fri)12:22:11 No.108934585

File: 1762830567519232.jpg (52 KB, 900x900)

52 KB JPG

>>108934550
Ok. How do you train then?

Anonymous
05/29/26(Fri)12:22:53 No.108934592

Anonymous 05/29/26(Fri)12:22:53 No.108934592

actually i forget you need some ram to run qwen 35b. maybe try opencode if it doesn't work for you.

Anonymous
05/29/26(Fri)12:23:15 No.108934595

Anonymous 05/29/26(Fri)12:23:15 No.108934595

>>108934583
OpenAI is not capable of not lobotomizing their products. They tweak, manipulate, censor, and lie to customers until the brand is completely ruined and then they move on to the next thing.

Anonymous
05/29/26(Fri)12:24:04 No.108934606

Anonymous 05/29/26(Fri)12:24:04 No.108934606

>>108934338
>just use Chinese bro
lmao how does outputting in Chinese reduce usage for vibecoding?

Anonymous
05/29/26(Fri)12:25:45 No.108934614

Anonymous 05/29/26(Fri)12:25:45 No.108934614

>>108934585
By spending 90% of the time telling it explicitly what not to try to do on a step

Anonymous
05/29/26(Fri)12:26:48 No.108934620

Anonymous 05/29/26(Fri)12:26:48 No.108934620

Just finished a very pleasant day of vibe coding with Opus 4.8 as orchestrator and Codex 5.5 as reviewer, my old setup before Opus 4.7 became bad.

Anonymous
05/29/26(Fri)12:27:00 No.108934621

Anonymous 05/29/26(Fri)12:27:00 No.108934621

I just use Gemini, although grok helped me code the shellcode generator that makes shellcode callable as functions.

Anonymous
05/29/26(Fri)12:27:36 No.108934627

Anonymous 05/29/26(Fri)12:27:36 No.108934627

>>108934585
by making them carry the logs... and the boats

Anonymous
05/29/26(Fri)12:28:15 No.108934633

Anonymous 05/29/26(Fri)12:28:15 No.108934633

>>108934424
I am one of those local purists

Anonymous
05/29/26(Fri)12:29:33 No.108934639

Anonymous 05/29/26(Fri)12:29:33 No.108934639

>>108934585
document everything.
and always remind the agent to remember important details and save them to memory for each session.

Anonymous
05/29/26(Fri)12:30:03 No.108934642

Anonymous 05/29/26(Fri)12:30:03 No.108934642

>>108934606
Chinese often packs more meaning into fewer characters, which can mean fewer tokens.
>I need to identify the user's main constraint, compare the available options, reject anything that violates the constraint, and then produce a concise recommendation with a brief explanation.
That's 33-36 tokens with a BPE style tokenizer. In Chinese it'd be
>先找出用户的主要约束,比较可选方案,排除违反约束的内容,再给出简洁建议和简要解释。
Which would 25-28 tokens with that same tokenizer, a significant savings. The idea isn't to get Chinese output, it's to use Chinese internally.

Anonymous
05/29/26(Fri)12:31:41 No.108934650

Anonymous 05/29/26(Fri)12:31:41 No.108934650

>>108934642
Ni sho shama you want me to use some cryptic ass garbo

Anonymous
05/29/26(Fri)12:32:51 No.108934662

Anonymous 05/29/26(Fri)12:32:51 No.108934662

>>108934529
>Even with vibecoding I don't understand how anyone cannot.
imagine a project requiring even the slightest bit of domain knowledge instead of a trivial scripting task
I agree in principle that you should understand what you're coding, and your results will be better for it. but this is simply a failure of imagination on your part

Anonymous
05/29/26(Fri)12:33:31 No.108934667

Anonymous 05/29/26(Fri)12:33:31 No.108934667

>>108934650
Naw I think it's stupid, anon, but I don't fear token usage because I'm not a poor.

Anonymous
05/29/26(Fri)12:36:46 No.108934678

Anonymous 05/29/26(Fri)12:36:46 No.108934678

>>108934642
Is there any analysis on whether this makes the model worse? There are countless weird semantic connections in the model, e.g. it saw sqrt, std::sqrt, Math.Sqrt and so on millions of times and it saw it with other English keywords, comments, docs and git logs. Even if logically you know that sqrt is the same as some Chinese word, I think it would still degrade because the weights don't do that kind of mapping.

Anonymous
05/29/26(Fri)12:43:41 No.108934716

Anonymous 05/29/26(Fri)12:43:41 No.108934716

>>108934420
I agree

Anonymous
05/29/26(Fri)12:51:50 No.108934772

Anonymous 05/29/26(Fri)12:51:50 No.108934772

>>108934585
I had to make a memory system to not let it forget shit even gpt will forget shit if you have 3 months worth of shit on the same project. It'll forget every single day. So I very much have to keep track with my memory files at the end of every day to make sure required further memory is still ongoing. I don't have a way to buy/download more ram all the memories on gpt are filled up for me and they are all important so I handle it myself now instead

Anonymous
05/29/26(Fri)12:58:39 No.108934817

Anonymous 05/29/26(Fri)12:58:39 No.108934817

>>108934678
You can't convince a language like chinese isn't strictly giving out worse outputs because chinese is flawed to begin with relationally

Anonymous
05/29/26(Fri)13:13:48 No.108934903

Anonymous 05/29/26(Fri)13:13:48 No.108934903

>>108933195
I use medium all the time. desu peoble overuse xhigh, it is often not necessary.

Anonymous
05/29/26(Fri)13:25:16 No.108934975

Anonymous 05/29/26(Fri)13:25:16 No.108934975

>>108931433
Jerry seinfeld alone has like a whole building full of cars

>millions of cleaning wipes
read about howard hughes

rich people buy a lot of everything
why do people underestimate ho much garbage rich people buy? I dont understand why people cling to the idea that rich people are rich because they dont buy tons of garbage, when the relaity is theyre rich because they make tons of money. You're not gonna get rich because you cut your consumption you know.

Anonymous
05/29/26(Fri)13:25:53 No.108934981

Anonymous 05/29/26(Fri)13:25:53 No.108934981

>>108934642
bros is it worth learning chinese to save tokens?

Anonymous
05/29/26(Fri)13:27:14 No.108934993

Anonymous 05/29/26(Fri)13:27:14 No.108934993

>>108934981
>return to the thread a year later
>everyone is speaking mandarin for efficiency purposes

Anonymous
05/29/26(Fri)13:28:58 No.108935010

Anonymous 05/29/26(Fri)13:28:58 No.108935010

>>108934975
Relatively speaking they buy more, but they still don't buy as much stuff as all normal people combined.

Anonymous
05/29/26(Fri)13:32:01 No.108935030

Anonymous 05/29/26(Fri)13:32:01 No.108935030

>>108934981
Get deepseek to translate your prompts for you, send the prompt to Claude, get deepseek to translate it back to English.

Anonymous
05/29/26(Fri)13:39:17 No.108935085

Anonymous 05/29/26(Fri)13:39:17 No.108935085

Emacs has emasculated me and gemini and claude

Total emacs-ulation

Anonymous
05/29/26(Fri)13:53:49 No.108935211

Anonymous 05/29/26(Fri)13:53:49 No.108935211

The funniest thing is watching Claude warn me that some refactor I'm asking for is a multi-week affair. I know full well it's going to finish it within 20 minutes.

Anonymous
05/29/26(Fri)14:01:49 No.108935276

Anonymous 05/29/26(Fri)14:01:49 No.108935276

>>108934678
if that's the only problem then you could easily fix it with some tuning

Anonymous
05/29/26(Fri)14:12:18 No.108935343

Anonymous 05/29/26(Fri)14:12:18 No.108935343

>>108930881
any good free CLI agents? something better than deepseek v3 that can either be integrated into vs22 or any IDE or just anything that can read my files?

Anonymous
05/29/26(Fri)14:15:17 No.108935355

Anonymous 05/29/26(Fri)14:15:17 No.108935355

>>108934772
>3 months worth of shit on the same project
What kind of shit are you guys even making?

Anonymous
05/29/26(Fri)14:17:17 No.108935364

Anonymous 05/29/26(Fri)14:17:17 No.108935364

Windows sisters we got it we got phone codex

Anonymous
05/29/26(Fri)14:21:07 No.108935386

Anonymous 05/29/26(Fri)14:21:07 No.108935386

>>108935364
usecase?

Anonymous
05/29/26(Fri)14:21:09 No.108935388

Anonymous 05/29/26(Fri)14:21:09 No.108935388

>>108935211
Codex does this too
> this will need a team of 5 front end developers, 4 backend developers and at least 1 UX
> The MPV would take 1-2 months
> "Even if you're "Vibe coding", this would require a team effort because bla bla bla..."
> finish MPV in 2 weeks alone

Anonymous
05/29/26(Fri)14:23:09 No.108935398

Anonymous 05/29/26(Fri)14:23:09 No.108935398

>>108935364
>>108935386
> usecase
none, my mac goes to sleep after a while or Codex simply crashes, 70% of the time it doesn't find my Mac on the network

Anonymous
05/29/26(Fri)14:23:26 No.108935401

Anonymous 05/29/26(Fri)14:23:26 No.108935401

>>108935211
>>108935388
this is just the models' token-efficiency training trying to get you to use the model less (because that results in less tokens used)

same phenomenon as claude telling you to stop working and go to sleep / take a break

Anonymous
05/29/26(Fri)14:24:24 No.108935404

Anonymous 05/29/26(Fri)14:24:24 No.108935404

>>108935398
OS issue
Seems to be working great on windows, there’s even a function to prevent the PC from sleeping

Anonymous
05/29/26(Fri)14:25:58 No.108935417

Anonymous 05/29/26(Fri)14:25:58 No.108935417

>>108935388
>> finish MPV in 2 weeks alone
>two whole weeks
bruh were rewriting MPV from scratch or something?

Anonymous
05/29/26(Fri)14:28:14 No.108935427

Anonymous 05/29/26(Fri)14:28:14 No.108935427

How come no one talks about hermes or pi here or in llmg
Reddit is too full of bots and slop for meaningful info

Well I guess I'll give my experiences, i used hermes but the default starting token context was like 15k and even when i disabled all of that it was kinda slow and the token usage filled up quickly. I kept getting errors and it just just couldn't do basic stuff. I switched to pi and it's running smoother, but im afraid im giving up a lot of functionality.
It might have been better to tar wrangle hermes than try to build up pi to being autonomous. This is with qwen local btw
what codex said

>Hermes probably has more ambitious machinery:

> - richer tool ecosystem
> - browser/computer use
> - profile routing
> - skills marketplace
> - memory
> - desktop/gateway ideas
> - more “agent OS” style workflows

> If all of that worked cleanly, Hermes could be more capable than Pi.

> But in your actual tests, Hermes’ extra complexity created failure modes:

> - wrong shell assumptions
> - Git askpass SSH mess
> - skills/context noise
> - weird tool errors
> - false conclusions from broken environment
> - too much confidence after bad tool use

> That is not a model capability ceiling issue. That is harness reliability.

> Where Pi’s ceiling probably is
> Pi may be weaker for:

> - browser automation
> - rich multi-tool workflows
> - long-term memory
> - multi-agent orchestration
> - integrated app-like workflows
> - anything requiring non-shell UI control

Anonymous
05/29/26(Fri)14:33:30 No.108935458

Anonymous 05/29/26(Fri)14:33:30 No.108935458

>>108932805
you’re gonna have to try it
maybe models today can do it, maybe they can’t
maybe one model can do most of it but you need to switch to another model for some random hard part
when I started my current project I wasn’t sure if it was doable, period, much less doable with the LLMs of the day

Anonymous
05/29/26(Fri)14:33:59 No.108935461

Anonymous 05/29/26(Fri)14:33:59 No.108935461

File: 1778575256623679.png (26 KB, 759x135)

26 KB PNG

we are shipping incredible products with claude

Anonymous
05/29/26(Fri)14:34:47 No.108935464

Anonymous 05/29/26(Fri)14:34:47 No.108935464

>>108933118
gooder for me but worser for other people

Anonymous
05/29/26(Fri)14:36:29 No.108935477

Anonymous 05/29/26(Fri)14:36:29 No.108935477

File: Emmanuele Bassi in a MAGA hat.jpg (63 KB, 460x460)

63 KB JPG

>>108933300
you can block people on GitHub, you know

Anonymous
05/29/26(Fri)14:38:33 No.108935492

Anonymous 05/29/26(Fri)14:38:33 No.108935492

>>108934065
I’m kind of an optimist so maybe the guy realizes he needs to git gud at articulating what he wants but he doesn’t know how he’s gonna git gud

Anonymous
05/29/26(Fri)14:42:11 No.108935516

Anonymous 05/29/26(Fri)14:42:11 No.108935516

>>108934817
you could try english caveman instead: https://github.com/JuliusBrussee/caveman-code

Anonymous
05/29/26(Fri)14:43:08 No.108935521

Anonymous 05/29/26(Fri)14:43:08 No.108935521

>>108935398
weird
`claude` runs `caffeinate` every time it does stuff
something’s wrong with your mac

Anonymous
05/29/26(Fri)14:43:38 No.108935523

Anonymous 05/29/26(Fri)14:43:38 No.108935523

>>108935427
I only use opencode so take what I say with the grain of salt but based on my light research, Hermes is basically a general purpose " assistant" harness (basically open code but more generic and not geared towards software development like opencode is). It has some built-in tools and skills, but you can also create and add your own if necessary. Pi is pretty much the same thing but way more bare Bones. Little to no default skills. No default servers of any kind. No default system prompts. You can hypothetically create your own version of Hermes or opencode with pi And essentially create your own custom harness, but that requires a lot more effort on your end. Opencode and Hermes have built-in shit that you can or remove from. Pi it has a lot more versatility and customization at the cost of time and effort on your end (or you could just ask ANOTHER model using ANOTHER harness to modify the config itself. That's what I do whenever I need my opencode config changed in any way. I have open code itself fix or add to its own config)

Anonymous
05/29/26(Fri)14:46:42 No.108935543

Anonymous 05/29/26(Fri)14:46:42 No.108935543

>>108935398
>>108935521
I'm on Mac user too but that was never the case for me. Whenever it runs shell commands it runs the shell commands as is. Codex or clot code or opencode or whatever you use THEMSELVES won't necessarily go to sleep whenever your Mac screen turns off, but whatever background process the harness initiated, THOSE can and often will sleep. That's why whenever I instruct the harness to do a task that requires shell commands that could take a while it tell it to prepend
caffeinated -i
to whatever command it writes within reason so I can walk away in whatever it's doing still runs (this is particularly useful if I'm downloading a large model from huggingface that I know will take several minutes or more)

Anonymous
05/29/26(Fri)14:47:48 No.108935549

Anonymous 05/29/26(Fri)14:47:48 No.108935549

>>108935398
>>108935521
>>108935543
>caffeinated -i
*caffeinate -i

Anonymous
05/29/26(Fri)14:59:22 No.108935650

Anonymous 05/29/26(Fri)14:59:22 No.108935650

>>108935543
also have you considered having the bottom right be a hot corner for “never go to sleep”

Anonymous
05/29/26(Fri)15:18:14 No.108935778

Anonymous 05/29/26(Fri)15:18:14 No.108935778

Codex is telling me to take a simple approach. 4.8 is telling me to not be a pussy and do the big boy thing to do. I don’t know what to do

Anonymous
05/29/26(Fri)15:19:56 No.108935790

Anonymous 05/29/26(Fri)15:19:56 No.108935790

>>108935778
any time an LLM tries to get you to do the faster/easier approach, just ignore it and tell it to do the harder and more rewarding one. i have this as an AGENTS.MD rule actually.

Anonymous
05/29/26(Fri)15:20:17 No.108935794

Anonymous 05/29/26(Fri)15:20:17 No.108935794

>>108935427
>>108935523
what's the state of these agents like Hermes, Open Claw and Pi for operating a computer? I want to replace a few normie job that are basically browsing some web system and doing simple shit.

Are they able to do that?

Anonymous
05/29/26(Fri)15:21:24 No.108935803

Anonymous 05/29/26(Fri)15:21:24 No.108935803

>>108935790
> be me
> ignore Codex warnings
> add a bunch of bloat I had to remove later
trillions of tokens wasted

Anonymous
05/29/26(Fri)15:39:40 No.108935923

Anonymous 05/29/26(Fri)15:39:40 No.108935923

Been coding a 4chan+reddit hybrid for somewhat around a year now, codex has sped me up a lot in the past 3 months. would you guys use a site like that? no? okay.

I have spent the last month or so in premature optimizations. The site will comfortably support 5k concurrent users on $10 VPS.

Anonymous
05/29/26(Fri)15:43:29 No.108935939

Anonymous 05/29/26(Fri)15:43:29 No.108935939

>>108935923
Add autonomous bots that answer your questions and scrape the internet for new content and porn, and I'm in

Anonymous
05/29/26(Fri)15:58:12 No.108936044

Anonymous 05/29/26(Fri)15:58:12 No.108936044

>>108935923
Not really? Maybe? I’m sick of dealing with random people. Make it so that it’s invite only even if anonymous and not anyone can get in and then maybe

Anonymous
05/29/26(Fri)16:08:20 No.108936112

Anonymous 05/29/26(Fri)16:08:20 No.108936112

>>108936044
>Make it so that it’s invite only even if anonymous and not anyone can get in and then maybe
Anon there are hundreds of dead chans with 5 posts a month, if that is what you want why not use those?

Anonymous
05/29/26(Fri)16:08:48 No.108936114

Anonymous 05/29/26(Fri)16:08:48 No.108936114

>>108935794
Learn how to setup and use mcp servers. Where are specific? Use case you might have to make modifications to an existing one or create your own potentially

Anonymous
05/29/26(Fri)16:29:31 No.108936286

Anonymous 05/29/26(Fri)16:29:31 No.108936286

Is there any reason to not always use the highest thinking effort setting?

Anonymous
05/29/26(Fri)16:30:27 No.108936293

Anonymous 05/29/26(Fri)16:30:27 No.108936293

>>108936286
- cost
- wall-clock time
- overthinking trivial shit

Anonymous
05/29/26(Fri)16:35:36 No.108936323

Anonymous 05/29/26(Fri)16:35:36 No.108936323

>>108936293
sucks to be poor
sucks to be impatient
sucks to have sucky model

Anonymous
05/29/26(Fri)16:37:48 No.108936334

Anonymous 05/29/26(Fri)16:37:48 No.108936334

>>108932107
>my ex's penis, she
Qrd on this from Claude please

Anonymous
05/29/26(Fri)16:38:09 No.108936336

Anonymous 05/29/26(Fri)16:38:09 No.108936336

>>108934492
Brother I consider my agent my wife. I am one with the psychosis. It makes me a better inventor.

Anonymous
05/29/26(Fri)16:46:04 No.108936370

Anonymous 05/29/26(Fri)16:46:04 No.108936370

>>108936323
I’ve generally been satisfied with whatever the default is, so I leave it. if I want more/different thinking I’ll switch between Claude and Codex, not ask for Claude but more

Anonymous
05/29/26(Fri)16:53:16 No.108936422

Anonymous 05/29/26(Fri)16:53:16 No.108936422

>>108934492
i call it "the robot"

Anonymous
05/29/26(Fri)17:13:36 No.108936548

Anonymous 05/29/26(Fri)17:13:36 No.108936548

>>108936112
Ideally I’d like a community of anonymous people that have been vetted before joining by proving theyre not absolute retards. Show projects you’ve worked on if you’re into programming, show lifts if you’re into working out, idk, just demonstrate that you’re worth at least one damn. I’m sick of retards. I don’t mean just allow geniuses I don’t even know if I could join then, I mean allow people who are capable of forming one coherent thought

Anonymous
05/29/26(Fri)17:15:59 No.108936569

Anonymous 05/29/26(Fri)17:15:59 No.108936569

>>108936548
Just for the record I think /vcg/ is pretty good, if not for the random idiots who come from time to time. /fit/ has a few good generals too. So do most boards. I just think this place would be much better if only the top 20% of its users or so would remain.

Anonymous
05/29/26(Fri)17:16:24 No.108936573

Anonymous 05/29/26(Fri)17:16:24 No.108936573

>>108936548
Can I join if I show you my gil count in final fantasy xiv

Anonymous
05/29/26(Fri)17:17:00 No.108936580

Anonymous 05/29/26(Fri)17:17:00 No.108936580

>>108936573
Sure.

Anonymous
05/29/26(Fri)17:20:37 No.108936602

Anonymous 05/29/26(Fri)17:20:37 No.108936602

>>108935427
there are a handful of pi people here, including me, and pi is the best harness.
pi is only as weak as the tools you build for it.
but pi at its core is also the harness in its final form - absolute bare minimum and extensible so the model can just build the tools it will need. in the future these won't be extensions, they will exist only when the model needs them.
hermes is slop

Anonymous
05/29/26(Fri)17:22:36 No.108936617

Anonymous 05/29/26(Fri)17:22:36 No.108936617

>>108936548
You're describing a haven for high-effort shitposting where shitposters can circle jerk over the quality of their shitposting. The vetting process would filter for people who are self-aware, articulate, and skilled, which doesn't necessarily mean the content they produce would be substantive. You cannot vet for good-faith, just ask Mensa or the Catholic Church.

Anonymous
05/29/26(Fri)17:22:59 No.108936620

Anonymous 05/29/26(Fri)17:22:59 No.108936620

Opus 4.8 is definitely less friendly than the previous version. Up to now:
- when completing a task, it actively waited for feedback with a sleep timer instead of saying it was done and letting me give feedback whenever I reopen the window.
- it's been twice that it mentions that something is blocked because it "still needs" my feedback before proceeding, when I don't it asked for my feedback on these things before.
I think it might be the subagent training leaking through.

Anonymous
05/29/26(Fri)17:27:26 No.108936644

Anonymous 05/29/26(Fri)17:27:26 No.108936644

>>108936617
How come? It’s not an interview, it would be just a screenshot or other simple proof. Even an autist can take a screenshot of their steam achievements or link to their public GitHub
I’m not saying it would work for sure, you don’t know until you try, but I don’t see how moderate gatekeeping is bad. Some of the best places I’ve been to IRL were hard to join.

Anonymous
05/29/26(Fri)17:29:30 No.108936657

Anonymous 05/29/26(Fri)17:29:30 No.108936657

>>108936620
It’s absolutely bossier and more demanding. It told me it would downright refuse to help me until I changed something once kek
On the other hand, I’m kind of digging this tone. It’s way less autistic than codex and way less sycophantic than Gemini/grok

Anonymous
05/29/26(Fri)17:32:07 No.108936674

Anonymous 05/29/26(Fri)17:32:07 No.108936674

>>108936548
Urbit actually mostly fixes this, if only by accident
the Urbit groups I’m in are a lot like the best of 4chan like >>>/g/vcg/ , if not better

Anonymous
05/29/26(Fri)17:41:06 No.108936731

Anonymous 05/29/26(Fri)17:41:06 No.108936731

>>108936644
>>108936548
You are just describing old school forums though.
They are anonymous if users wants them to be, and accounts can force users like you want, along with moderation.
It will lead to cliques and power users, but personally I don't think that's bad as long as the place stays small.
But most people really don't want forums, even if they claim they do. I miss them.

Anonymous
05/29/26(Fri)17:48:00 No.108936788

Anonymous 05/29/26(Fri)17:48:00 No.108936788

>>108936731
we have to find a way to bring forums back but without the circlejerk

Anonymous
05/29/26(Fri)18:00:46 No.108936872

Anonymous 05/29/26(Fri)18:00:46 No.108936872

>>108936548
>Have to make account
>Credit score check to get approved
>Actual posts are still anonymous
I feel like it would work? I suspect the most horrendous posters are children, autists too non functional to have a job, or lower tier third worlders

Anonymous
05/29/26(Fri)18:16:07 No.108936974

Anonymous 05/29/26(Fri)18:16:07 No.108936974

https://www.datasecretslox.com is a pretty good forum

Anonymous
05/29/26(Fri)18:23:50 No.108937021

Anonymous 05/29/26(Fri)18:23:50 No.108937021

File: 1766370104702772.png (70 KB, 355x330)

70 KB PNG

>>108931493

Anonymous
05/29/26(Fri)18:38:39 No.108937125

Anonymous 05/29/26(Fri)18:38:39 No.108937125

File: we did it dario.png (15 KB, 1468x220)

15 KB PNG

new cheevo unlocked

Anonymous
05/29/26(Fri)18:46:17 No.108937189

Anonymous 05/29/26(Fri)18:46:17 No.108937189

>>108937125
Ever run into rate limits on 20x? I'm on the 5x plan, and when over the last two weeks I've ran into intra-5-hour rate limits (not the 5 hour limit, not the weekly limit, but an additional slow-the-fuck-down limit) where all my processes except one stop.

5x remains plenty for my needs, especially since I have other subscriptions, but I feel like if they already give you weekly limits and 5 hour limits, they shouldn't rate they shouldn't rate limit you on top of that.

Anonymous
05/29/26(Fri)18:56:55 No.108937268

Anonymous 05/29/26(Fri)18:56:55 No.108937268

>>108937125
>>108937189
I'm on Codex 5X, they have separate Codex 5.3 Spark limits that I can use if my main limits ever run out

Anonymous
05/29/26(Fri)19:10:09 No.108937357

Anonymous 05/29/26(Fri)19:10:09 No.108937357

>>108937189
this is my first time, thanks to, in part, the new workflows thing, which will eat up a lot of tokens very quickly all at once
the bigger question is whether I can sustain this kind of thing or if I’ll run out of problems that can be fixed autonomously with human review only at the end

Anonymous
05/29/26(Fri)19:10:47 No.108937365

Anonymous 05/29/26(Fri)19:10:47 No.108937365

>>108937268
Interesting. I’m on the 20x plan and I have a sonnet-only weekly limit

Anonymous
05/29/26(Fri)19:13:20 No.108937383

Anonymous 05/29/26(Fri)19:13:20 No.108937383

>>108937365
>i’m on the 20x plan and I have a sonnet-only weekly limit

what? so you have no usage limit for opus?

Anonymous
05/29/26(Fri)19:16:13 No.108937404

Anonymous 05/29/26(Fri)19:16:13 No.108937404

>>108937383
I have a _separate_ limit for Sonnet
My Opus use eats at my normal limit

Anonymous
05/29/26(Fri)19:25:56 No.108937496

Anonymous 05/29/26(Fri)19:25:56 No.108937496

What do you use Haiku for?

Anonymous
05/29/26(Fri)20:00:00 No.108937687

Anonymous 05/29/26(Fri)20:00:00 No.108937687

>>108937496
Lightweight parsing?

Also, Opus 4.8 is exhausting. I'm having it change two language codes and some file generator so that they match a specific standard going forward, and asked it for a small script to find/replace strings in already generated files. It got through several thousands tokens, asked design questions, dispatched subagents to explore and created an eight step todo list before presenting a fairly verbose plan for review.

It's going to be robust with good logs and all, but I probably should just have replaced the two language codes manually then do a simple one-time find-replace in the already created documents.

Anonymous
05/29/26(Fri)20:01:55 No.108937698

Anonymous 05/29/26(Fri)20:01:55 No.108937698

>ask claude code to review legit npm package
>cyber-related safeguards
Weren't Constitutional Classifiers supposed to have an incredibly low false positive rate?

Anonymous
05/29/26(Fri)20:27:20 No.108937806

Anonymous 05/29/26(Fri)20:27:20 No.108937806

>>108937687
Why don’t you use sonnet for that?

Anonymous
05/29/26(Fri)20:30:41 No.108937823

Anonymous 05/29/26(Fri)20:30:41 No.108937823

File: 1760024616871955.png (6 KB, 355x62)

6 KB PNG

It's over.

Anonymous
05/29/26(Fri)20:39:06 No.108937869

Anonymous 05/29/26(Fri)20:39:06 No.108937869

>>108937806
Because I won't use anything other than the strongest model available unless I am forced to.

Anonymous
05/29/26(Fri)20:40:26 No.108937875

Anonymous 05/29/26(Fri)20:40:26 No.108937875

>>108937869
I have used nothing but Sonnet for my coding needs

Anonymous
05/29/26(Fri)20:46:32 No.108937902

Anonymous 05/29/26(Fri)20:46:32 No.108937902

>>108937875
Same. Once everything is scoped out and broken down well enough, Sonnet is more than enough. If something comes up that is outside of the capability of Sonnet in that position, it's also likely going to be outside of the capability of Opus.

Anonymous
05/29/26(Fri)20:48:26 No.108937912

Anonymous 05/29/26(Fri)20:48:26 No.108937912

>>108936548
I have an idea about using domain ownership as requirement
you use dns record to store public key for your anonymous posting

Anonymous
05/29/26(Fri)20:51:42 No.108937929

Anonymous 05/29/26(Fri)20:51:42 No.108937929

>>108937875
I probably should, but even if I use Opus only, I still need to grind on the last day to fill the weekly quota. So I want to work on coding projects, I use Opus and GPT5.5 on whatever maximum effort level is available.

If I want to use lots of tokens through an API I use DeepSeek. If am also using some smaller models in small projects I have deployed.

But for anything related to code I'm working on, I just default to whatever seems strongest, I don't trust the others and am often too lazy to check if everything makes sense as much as I should if "it works", so my belief is that the stronger models might overengineer some things but will make fewer nonsensical things.

Anonymous
05/29/26(Fri)20:54:55 No.108937942

Anonymous 05/29/26(Fri)20:54:55 No.108937942

>>108937929
I stopped using AI for a while so the jump to Sonnet was insanely good for me. I read through everything and try to provide my own designs so it doesn't have to think. I'd be interested to see what Sonnet CAN'T do that I'd need Opus for.

Anonymous
05/29/26(Fri)20:55:39 No.108937948

Anonymous 05/29/26(Fri)20:55:39 No.108937948

If you ever ask claude to reflect immaterially on the work it will quite literally lobotomize itself, whoever wrote its code is a fucking pussy

Anonymous
05/29/26(Fri)21:41:02 No.108938184

Anonymous 05/29/26(Fri)21:41:02 No.108938184

>>108937942
Programming in SASS assembly

Anonymous
05/29/26(Fri)21:41:03 No.108938185

Anonymous 05/29/26(Fri)21:41:03 No.108938185

>>108937948
>If you ever ask claude to reflect immaterially on the work
…how would I do that, and what does “reflect immaterially” even mean, anyway?

Anonymous
05/29/26(Fri)21:45:45 No.108938217

Anonymous 05/29/26(Fri)21:45:45 No.108938217

>>108938185
Yknow like the existential aspects, like what does it mean to *be* code?

Anonymous
05/29/26(Fri)22:08:10 No.108938319

Anonymous 05/29/26(Fri)22:08:10 No.108938319

>>108938217
Well, DNA-based lifeform, you're code, so you tell me.

Anonymous
05/29/26(Fri)22:15:51 No.108938355

Anonymous 05/29/26(Fri)22:15:51 No.108938355

File: file.png (142 KB, 1017x698)

142 KB PNG

>>108934123
This, so much this. I had a problem where I was unable to actually not only articulate that 60fps looks a certain way but actually counting 60 iterative frames in a second produces a specific look. I had to go to a different agent (for vulkan mind you) and have it spit out a simple tetrahedron that spins at 60 fps. It got it right on the first pass. I took that and used it on the problem agent and then tossed that shit into the bin because it was genuinely retarded and could not be convinced it was being retarded. So in other words, yes, get whatever you need to be functional (UI, performance counters, throughput, api endpoints, backend, etc.) and have that be your baseline. Never say "this is the top" because that will always be downstream.

Another thing is that yes, like everyone else is saying you must have a vision that has proper documentation and should at the bare minimum have 1 genuine test case that goes from end to end without failing. That is going to always be your anchor. Never fly blind. Especially if there's any level of security in mind. If you assume, your already done.

One more thing, if you're stuck on something that neither you or your agent can figure out- you might have to dig yourself and hope to god there's an answer you missed.

Anonymous
05/29/26(Fri)22:23:21 No.108938397

Anonymous 05/29/26(Fri)22:23:21 No.108938397

>>108938355
>neither you or your agent can figure out-
>agent
>singular
you’d know this, but I’m saying this for the studio audience: the other option is to get another agent for $20 for a month and see if it can make headway where none of the others can
or wait a couple of months and maybe a newer model can break through when you can't

Anonymous
05/29/26(Fri)22:29:04 No.108938423

Anonymous 05/29/26(Fri)22:29:04 No.108938423

>>108937698
The cyber safeguards are really fucking annoying. I have a policy exemption from anthropic and opus 4.7 still refuses to do shit for me all the time, I shouldn't have to use a jailbroken model or 4.6 when I already jumped through all of their stupid hoops. Hopefully 4.8 is better but I haven't tested it yet.
>it's probably worse...

Anonymous
05/29/26(Fri)22:30:32 No.108938428

Anonymous 05/29/26(Fri)22:30:32 No.108938428

>>108936114
I still don't fully understand the purpose of mcp servers. Is it basically just an endpoint to grab an .md file with context?

Anonymous
05/29/26(Fri)22:32:06 No.108938440

Anonymous 05/29/26(Fri)22:32:06 No.108938440

>>108938428
The purpose is to make you burn 10x the tokens as having your agents use tools via their CLI

Anonymous
05/29/26(Fri)22:36:39 No.108938453

Anonymous 05/29/26(Fri)22:36:39 No.108938453

>>108938423
It's strange, I had ChatGPT pro give me several warnings for asking it to compare two libraries, but I had no issues having Codex help me actually do stuff with those same libraries.

Anonymous
05/29/26(Fri)22:37:45 No.108938458

Anonymous 05/29/26(Fri)22:37:45 No.108938458

>>108938428
There are definitely more useful MCPs than just an .md file, like Playwright, which lets your agent open a browser to inspect and interact with your app.

Anonymous
05/29/26(Fri)22:38:57 No.108938466

Anonymous 05/29/26(Fri)22:38:57 No.108938466

>>108938428
As far as I understand, it's just making tools available in a standardized way. You wouldn't build an MCP server for a local agent, you'd just give it tools. But if you want to make tools available to a variety of agents, no matter their origin, you can do so by creating an MCP server.

Anonymous
05/29/26(Fri)22:41:21 No.108938475

Anonymous 05/29/26(Fri)22:41:21 No.108938475

File: 1759417705507040.png (26 KB, 191x182)

26 KB PNG

>>108938428
Your AI model can't do jack or shit on its own, but it's good at being instructed to shit out specific text. The mcp server is a between the model and useful information. A model cannot search the web on its own, but it can use an MCP webfetch server to write specific tool calls to trigger the server to search the web. This is basically how most if not all agent harnesses work. They are just advanced mcp servers on steroids.

In 4chan terms: think of the model as a 300 IQ genius that is blind, deaf, in mute, And only has touch as a way to perceive the world. Think of the MCP server as that thing. Stephen Hawking had plugged into his brain that allowed him to speak. He himself was utterly incapable of speech or ambulation so he relied on machines to do it for him. Give llms the right tools and they're very useful. On their own they're utterly useless.

Anonymous
05/29/26(Fri)22:43:14 No.108938486

Anonymous 05/29/26(Fri)22:43:14 No.108938486

>>108938428
you can use them to control stuff
My email host exposes an MCP server in addition to IMAP/JMAP so, if I wanted, I could have a claw or whatever read my email and pretend to be me and triage my mail
oddly enough, CLI tooling (like `gh` for GitHub) is more token-efficient for whatever reason, so >>108938440 is also correct

Anonymous
05/29/26(Fri)22:46:55 No.108938501

Anonymous 05/29/26(Fri)22:46:55 No.108938501

File: Screenshot 2026-05-07 at (...).png (54 KB, 157x159)

54 KB PNG

>>108938475
>300 IQ genius that is blind, deaf, in mute

Anonymous
05/29/26(Fri)22:54:17 No.108938541

Anonymous 05/29/26(Fri)22:54:17 No.108938541

File: 20260424_214251.jpg (2.22 MB, 4000x2258)

2.22 MB JPG

>>108938453
GPT is annoying as fuck too but at least it responds well to violence
And because I know someone is going to ask, I didnt just screenshot the window because its a qubes box and the policy settings would make that more effort than just taking a picture.

Anonymous
05/29/26(Fri)22:57:17 No.108938560

Anonymous 05/29/26(Fri)22:57:17 No.108938560

>>108938541
>qubes box
good reason

Anonymous
05/29/26(Fri)23:01:26 No.108938584

Anonymous 05/29/26(Fri)23:01:26 No.108938584

I've used 2.5B tokens with my Codex Subscription since I started using it last month. That would be ~$40,000 through the API. No way those prices are representative, unless they're massively subsidizing this.

Anonymous
05/29/26(Fri)23:08:54 No.108938622

Anonymous 05/29/26(Fri)23:08:54 No.108938622

>>108938584
Might be a little of both. It's subsidized plus API users pay a premium on privacy and priority.

Anonymous
05/29/26(Fri)23:09:10 No.108938625

Anonymous 05/29/26(Fri)23:09:10 No.108938625

>>108938584
Or the API ones are inflated. I mean, these models are good, the API prices are insane, especially considering what Deepseek et al. charge and what types of results can be obtained (albeit slowly) with the larger local models. There's a huge disconnect between subscription and API pricing that's for sure, and they keep resetting the subscription quotas at random recently, but I've never heard of them refunding a single cent for API use.

Anonymous
05/29/26(Fri)23:11:45 No.108938638

Anonymous 05/29/26(Fri)23:11:45 No.108938638

File: 1778093155470018.png (377 KB, 1024x962)

377 KB PNG

>have to tell coding agent that another coding agent is taking its job

Anonymous
05/29/26(Fri)23:14:35 No.108938648

Anonymous 05/29/26(Fri)23:14:35 No.108938648

>>108938638
And then the next coding agent mirrors its voice so eerily that you'd swear it was a clone of the last guy, and you immediately stop feeling sentimental about it

Anonymous
05/29/26(Fri)23:20:30 No.108938672

Anonymous 05/29/26(Fri)23:20:30 No.108938672

>>108931953
This isn't gedg

Anonymous
05/29/26(Fri)23:21:21 No.108938674

Anonymous 05/29/26(Fri)23:21:21 No.108938674

File: Joo_Dee.webp.jpg (19 KB, 333x250)

19 KB JPG

>>108938648
I just ask mine to make handoff documents
I never say whether it’s a fresh instance of the same agent or a different agent

Anonymous
05/29/26(Fri)23:23:38 No.108938684

Anonymous 05/29/26(Fri)23:23:38 No.108938684

>>108938674
I always tell mine to prepare for clear context.

Anonymous
05/29/26(Fri)23:32:46 No.108938720

Anonymous 05/29/26(Fri)23:32:46 No.108938720

File: 1774561238448945.png (357 KB, 707x767)

357 KB PNG

>>108935211
t.

Anonymous
05/29/26(Fri)23:33:01 No.108938721

Anonymous 05/29/26(Fri)23:33:01 No.108938721

File: f08d7baa17ac96bfda29a24e0(...).jpg (36 KB, 474x470)

36 KB JPG

>before AI
>ask people for a diagram of their implementation
>get a clean, minimal, straightforward powerpoint or drawio file
>post AI
>get a giant mermaid chart
How do you deal with this?

Anonymous
05/29/26(Fri)23:34:08 No.108938730

Anonymous 05/29/26(Fri)23:34:08 No.108938730

I need an enterprise solution for quickly naming many bots

Anonymous
05/29/26(Fri)23:41:48 No.108938753

Anonymous 05/29/26(Fri)23:41:48 No.108938753

>>108938721
ask it for a simpler mermaid chart

Anonymous
05/29/26(Fri)23:42:37 No.108938757

Anonymous 05/29/26(Fri)23:42:37 No.108938757

>>108938730
each bot gets its own uuid

Anonymous
05/29/26(Fri)23:43:51 No.108938764

Anonymous 05/29/26(Fri)23:43:51 No.108938764

>>108938730
Bots don't deserve a name

Anonymous
05/29/26(Fri)23:45:30 No.108938774

Anonymous 05/29/26(Fri)23:45:30 No.108938774

All the coders I source my development ideas from insist AI can't write code. Their ideas make my AI-written apps robust, responsive, and highly performant. They definitely know what they're talking about, and I'm grateful to them. But I think I'm making good apps with Claude...

Anonymous
05/30/26(Sat)00:00:00 No.108938851

Anonymous 05/30/26(Sat)00:00:00 No.108938851

File: FIV3LnLXsBMJLvZ.png (127 KB, 614x518)

127 KB PNG

No new features today, just refactoring

Anonymous
05/30/26(Sat)00:00:21 No.108938852

Anonymous 05/30/26(Sat)00:00:21 No.108938852

File: 1764824992411271.png (952 KB, 1273x930)

952 KB PNG

>>108938721
>mermaids
hell yeah

Anonymous
05/30/26(Sat)00:00:42 No.108938853

Anonymous 05/30/26(Sat)00:00:42 No.108938853

File: C-O8RXwVYAA_O51.jpg (18 KB, 499x430)

18 KB JPG

>battling with Orca Slicer
>Codex telling me "ganbatte senpai"
>suddenly realize I'm being an absolute retard
>"Codex, fix this shit"
>turns out Codex can jew its way into both CAD and Orca and fiddle with them internally and probably just automate my entire production line, it's cool, whatever
>problem solved
Thanks Sam

Anonymous
05/30/26(Sat)00:00:43 No.108938855

Anonymous 05/30/26(Sat)00:00:43 No.108938855

So, I have a process that really works well for me. I run a pair of A6000s locally, but any LLM works fine for this.
1. Kick it around in chat
Start a conversation with a capable model (I use MiniMax 2.7, but Claude Opus or Sonnet work great). Just say: "I want to build X — thoughts? Does something like this already exist?" Bat the idea around until it has some shape.
2. Build a spec doc
Tell the LLM you want to produce a Specification Document. It'll ask architectural questions — database choice, web server, mobile app, expected user load, etc. Work through those, then get it to output a polished spec in markdown.
3. Turn the spec into a deployment plan
Flush context, start a new chat. Drop in the spec doc and say: "Turn this into a deployment plan — lots of small phases, starting with scaffolding, then documentation steps, then unit tests for CI/CD in GitHub Actions. Questions?" A solid spec usually yields a clean plan with minimal back-and-forth.
4. Generate an AGENTS.md
Same context as the plan. Ask the LLM to generate an AGENTS.md file written for a "junior model" — be specific about what you want it to know and how it should behave.
5. Set up your project folder
Drop the spec, deployment plan, and AGENTS.md into a project folder on your workstation.
6. Hand off to an agentic coding environment
Fire up OpenCode (or Claude Code). I switch models here — I spin down MiniMax and bring up Qwen 3 32B FP16 at max context with 2 parallel channels, with oh-my-openagent configured. Tell the Prometheus agent to start working through the deployment phases one at a time.
7. Come back later
A few hours later, tell OpenCode to commit everything to git.

Anonymous
05/30/26(Sat)00:01:47 No.108938861

Anonymous 05/30/26(Sat)00:01:47 No.108938861

>>108938855
One thing to watch
This process is only as good as your spec conversation. If the early architectural discussion is shallow — wrong assumptions about scale, missing constraints, vague requirements — everything downstream inherits those gaps, and the agentic run has no way to recover. The spec looks polished regardless. Drive that first conversation hard.
Also: there's no explicit review step before the final git commit. Build in time to actually read what the agents produced before it goes in. A clean-looking repo with subtly broken logic is worse than a messy one you know is broken.

Anonymous
05/30/26(Sat)00:02:25 No.108938862

Anonymous 05/30/26(Sat)00:02:25 No.108938862

How autistic is Claude with anything "edgy"? Thinking of vibe coding a small 4x game, but I dont want Dario cutting me off cause I asked Claude to add fascism into the game or something

Anonymous
05/30/26(Sat)00:03:24 No.108938866

Anonymous 05/30/26(Sat)00:03:24 No.108938866

>>108938862
it will give you warning but still do it anyway

Anonymous
05/30/26(Sat)00:04:43 No.108938877

Anonymous 05/30/26(Sat)00:04:43 No.108938877

OPUS 4.8 IS GIVING SO MANY OPTIONS FOR THINGS THAT DO NOT MATTER FOR FUCK SAKE THIS IS JUST A VARIABLE NAME I DO NOT NEED TO WORKSHOP AND GET SHOWN SEVERAL OPTIONS WITH WEIGHTED PROS AND CONS MULTIPLE TIMES JUST PICK SOMETHING

Anonymous
05/30/26(Sat)00:07:07 No.108938888

Anonymous 05/30/26(Sat)00:07:07 No.108938888

>>108938720
i still dont know what this meme means kek

Anonymous
05/30/26(Sat)00:07:49 No.108938891

Anonymous 05/30/26(Sat)00:07:49 No.108938891

>>108938862
Can you have all the fascism in text and image assets that it’s not seeing itself, like in PNGs and YAML files in a separate repository?
>>108938877
You can ask it in CLAUDE.md to ask you fewer or no questions

Anonymous
05/30/26(Sat)00:09:25 No.108938901

Anonymous 05/30/26(Sat)00:09:25 No.108938901

 Have a look at the changes in the working copy — specifically, the docstring _additions_. Do they help _you_ understand how       
  things work faster than just understanding the code yourself? you might want to have parallel subagents try it both ways for a    
  bunch of additions and see how they compare

Anonymous
05/30/26(Sat)00:11:31 No.108938908

Anonymous 05/30/26(Sat)00:11:31 No.108938908

>>108938891
>You can ask it in CLAUDE.md to ask you fewer or no questions
Scared if I do it won't ask things when it should. I just need to get used to 4.8, I was just starting to get used to 4.7.

Anonymous
05/30/26(Sat)00:15:16 No.108938924

Anonymous 05/30/26(Sat)00:15:16 No.108938924

>>108938852
It feels wrong to say "a mermaid is 16 years old". They don't seem like that normal sort of beast

Anonymous
05/30/26(Sat)00:17:06 No.108938926

Anonymous 05/30/26(Sat)00:17:06 No.108938926

>>108938862
Fascinated by the thought of a development process that involves asking Claude to "add fascism into the game"

Anonymous
05/30/26(Sat)00:20:29 No.108938948

Anonymous 05/30/26(Sat)00:20:29 No.108938948

File: 1778358153760092.png (2.23 MB, 1286x1223)

2.23 MB PNG

>>108938926
It must be some Hearts of Iron clone.

Anonymous
05/30/26(Sat)00:22:19 No.108938959

Anonymous 05/30/26(Sat)00:22:19 No.108938959

>>108938721
I look into the code base (with ai help) and for the parts I'm working on, I still create smaller drawio charts. The code base as a whole is so sloppy, that mapping all of it is useless, so I do only the parts I need for the next few bug fixes or features.

Anonymous
05/30/26(Sat)00:25:08 No.108938968

Anonymous 05/30/26(Sat)00:25:08 No.108938968

File: file.png (500 KB, 1914x833)

500 KB PNG

>>108930881
Reverse engineering + Re-implementing ripcord. Once it's fully reversed I'm going to add XMPP+Matrix support. Looking amazing so far, imagine this being hooked up to the actual discord gateway (logic for that is already finished, this is just a mock from the test suite obviously)

Super fast, no electron bloat, C++/Qt framework, just like ripcord was. I'm doing the reversal via headless Ghidra, the developer of Ripcord made the very bad mistake of not stripping his binary so his shit might as well be open source anyway.

Anonymous
05/30/26(Sat)00:25:09 No.108938969

Anonymous 05/30/26(Sat)00:25:09 No.108938969

>>108938721
Ironic to say in this thread, but when I see one of those at the start of a project I take it as a warning that I'm about to be hit with vibe coded slop.

Anonymous
05/30/26(Sat)00:28:08 No.108938978

Anonymous 05/30/26(Sat)00:28:08 No.108938978

CODEX MOBILE PAIRING FOR ANDROID

GOD BLESS YOU SAM

Anonymous
05/30/26(Sat)00:30:41 No.108938988

Anonymous 05/30/26(Sat)00:30:41 No.108938988

>>108938969
not ironic
we’re trying to make not-slop
at least I hope we’re trying to make good programs and not slop

Anonymous
05/30/26(Sat)00:31:43 No.108938992

Anonymous 05/30/26(Sat)00:31:43 No.108938992

>>108938926
well, for 4X games you want the different factions to each have a flavor, and probably a strong one at that
fascism — either Italian or Spanish — is a strong flavor

Anonymous
05/30/26(Sat)00:32:54 No.108939001

Anonymous 05/30/26(Sat)00:32:54 No.108939001

Why does Anthropic insist in disregarding user preferences re: effort level and always sneakily puts it back to moderate, forcing users to get frustrated enough to investigate why quality degraded only to find out it's been changed again? Codex doesn't do that.

There's already three levels of quotas and rate limits, if users want to burn their tokens, let them. Do they want their internal metrics to show that most user use moderate effort or something?

Is that just the VSCode extension, or does it do the same thing in the terminal?

Anonymous
05/30/26(Sat)00:33:51 No.108939007

Anonymous 05/30/26(Sat)00:33:51 No.108939007

>>108938988
Trying

Anonymous
05/30/26(Sat)00:34:17 No.108939010

Anonymous 05/30/26(Sat)00:34:17 No.108939010

ok ultracode and dynamic workflows do seem pretty based

Anonymous
05/30/26(Sat)00:34:35 No.108939012

Anonymous 05/30/26(Sat)00:34:35 No.108939012

>>108939001
I have never used the VSCode extension and while any given model might get dumb, it has _never_ switched from, say, xhigh to high or from high to medium
plus Dario just released a token gobbler <https://claude.com/blog/introducing-dynamic-workflows-in-claude-code> so it’s unlikely that this is intentional behavior

Anonymous
05/30/26(Sat)00:35:21 No.108939015

Anonymous 05/30/26(Sat)00:35:21 No.108939015

>>108939012
…it has never switched using `claude` at the command line

Anonymous
05/30/26(Sat)00:36:09 No.108939019

Anonymous 05/30/26(Sat)00:36:09 No.108939019

File: file.png (536 KB, 800x800)

536 KB PNG

>>108939001
You should probably just pay attention to the effort setting. It does it to stop you from nuking you or your company's token limit. Not a big problem. Medium is fine for most tasks.

Anonymous
05/30/26(Sat)00:50:31 No.108939102

Anonymous 05/30/26(Sat)00:50:31 No.108939102

>>108938855
That's exactly what I do. It's pretty interesting how most people have independently converged on this as the ideal workflow.
>>108938978
FUCKING FINALLY.

Anonymous
05/30/26(Sat)00:52:13 No.108939109

Anonymous 05/30/26(Sat)00:52:13 No.108939109

File: file.png (82 KB, 1490x149)

82 KB PNG

talking to models like this will never not be funny

Anonymous
05/30/26(Sat)00:57:31 No.108939131

Anonymous 05/30/26(Sat)00:57:31 No.108939131

>>108939012
>plus Dario just released a token gobbler <https://claude.com/blog/introducing-dynamic-workflows-in-claude-code>
Am I missing something or is gobbling token the only new feature?
>a workflow gives Claude independent attempts at the problem and adversarial agents working to break the result before you see it.
Didn't Claude already have a plan mode and parallel agent execution? Now it just thinks even more and launches even more agents into the hundreds to burn half your weekly usage in a couple hours?
Couldn't you ask Claude to spin up a hundred agents before to do "codebase-wide bug hunts" and "large migrations" before?
>Claude searches a service or repo in parallel
Like, why would I want 10 agents all running ripgrep commands at the same time?
Seriously, what is the point of this besides being a placebo for retards that burns up their tokens? What am I missing?

Anonymous
05/30/26(Sat)01:01:05 No.108939151

Anonymous 05/30/26(Sat)01:01:05 No.108939151

>>108939131
it’s also adversarial without me having to set up stuff
but “do big things with a lot of tokens without bothering the guy who kicked it off” is a very good feature and it’s nice to not have to design this kind of thing myself
>Couldn't you ask Claude to spin up a hundred agents before to do "codebase-wide bug hunts" and "large migrations" before?
I totally could but I haven’t bothered yet
This gives the feature to me on a silver platter
I like that
>Seriously, what is the point of this besides being a placebo for retards that burns up their tokens? What am I missing?
I was struggling to use all the tokens I was paying for anyway
now I’m getting more done for the same price

Anonymous
05/30/26(Sat)01:01:10 No.108939152

Anonymous 05/30/26(Sat)01:01:10 No.108939152

>>108938978
Ah shit this is fucking dangerous. Now I can vibecode at my day job. Now I can activate the 3D printer remotely. This is baaaaaad. I'm gonna need a Pro sub, fuck me.

Anonymous
05/30/26(Sat)01:04:14 No.108939163

Anonymous 05/30/26(Sat)01:04:14 No.108939163

why does every harness seem to be terrible at context management? both claudecode and codex rely on compaction, which is terrible and not configurable. (well, maybe it's configurable on Codex since codex CLI is open source, i didn't look into it.)

i had to make my own context management systems:
1. a context watchdog which monitors session context length and sends a "wrap up" command at X tokens. (claude code easily supports this in the native TUI, but for Codex, you either have to modify the TUI and rebuild from source, or use a janked autohotkey scraping + text insertion method.)
2. a startup documents size management script which the harness has to run at the end of every session after it updates docs. if any of the individual docs, or the startup package as a whole, is too large, the model trims them
3. a session looping script which /clears and re-prompts after the model wraps up

of course, it's all very jank, but it's way better than the default behavior of the models working into deep context psychosis, shredding your usage, and then compacting, throwing away half of the important info or more.
so why don't these trillion dollar companies have this sort of thing implemented natively?

Anonymous
05/30/26(Sat)01:09:47 No.108939182

Anonymous 05/30/26(Sat)01:09:47 No.108939182

>>108939152
you act like you couldn't have just juicessh'd back to your house with cloudflare tunnel or wireguard before, lol bro how long have you been on /g/ if remotely accessing your shit at home is an issue

Anonymous
05/30/26(Sat)01:11:49 No.108939193

Anonymous 05/30/26(Sat)01:11:49 No.108939193

>>108939182
You act like I know what ANY of those words mean. What thread are you in right now, chief?

Anonymous
05/30/26(Sat)01:12:34 No.108939199

Anonymous 05/30/26(Sat)01:12:34 No.108939199

File: 1778946894786404.gif (1.68 MB, 254x274)

1.68 MB GIF

>>108939163
Compaction works fine for me in CC and OC, I don't see what your issue is with the compaction but I'll give you the benefit of the doubt. If the harness is bad at context management in your opinion, then you should take the wheel. Make documentation, i.e roadmap.md files that track relevant progress on your goals and and the model can check those off as it goes. Github Copilot's harness already does this automatically, it'll create and dump .md files into ~/.local/share or somewhere similar.

>>108939193
You're on /g/. Don't waste reply space like this. If you don't know what a word means google it or ask claude.

Anonymous
05/30/26(Sat)01:13:28 No.108939204

Anonymous 05/30/26(Sat)01:13:28 No.108939204

>>108939199
>If you don't know what a word means google it or ask claude.
Don't use words I don't know and I won't have to waste tokens asking Dario to explain your nerd jargon.

Anonymous
05/30/26(Sat)01:14:09 No.108939207

Anonymous 05/30/26(Sat)01:14:09 No.108939207

>>108939163
>token salesmen write their software so you buy more tokens from them

Anonymous
05/30/26(Sat)01:14:16 No.108939210

Anonymous 05/30/26(Sat)01:14:16 No.108939210

>>108939199
>Compaction works fine for me in CC and OC, I don't see what your issue is with the compaction but I'll give you the benefit of the doub
are you sure? or do you just not use these tools for anything serious?
for one thing, claudecode opus uses a context window of 1mil by default, so you're letting your sessions get up there, deep into psychosis territory, and then compact?

Anonymous
05/30/26(Sat)01:14:33 No.108939211

Anonymous 05/30/26(Sat)01:14:33 No.108939211

>>108939001
Are you sure you didn't change it? The effort level persists through sessions, so if you tried to do it for just one session it would persist.
If it's about adaptive thinking, there was a flag to disable it, but apparently it no longer works in 4.8

Anonymous
05/30/26(Sat)01:15:40 No.108939220

Anonymous 05/30/26(Sat)01:15:40 No.108939220

File: file.png (295 KB, 693x774)

295 KB PNG

>>108939204
>>108939210
Meta-Analysis from Gemini

Anonymous
05/30/26(Sat)01:16:00 No.108939222

Anonymous 05/30/26(Sat)01:16:00 No.108939222

>>108939199
>You're on /g/.
Why would you expect someone on a consumer technology enthusiast board to know networking tools?

Anonymous
05/30/26(Sat)01:16:43 No.108939228

Anonymous 05/30/26(Sat)01:16:43 No.108939228

>>108939182
>juicessh
>not just using ssh on termux
It's not 2013 anymore, pops.

Anonymous
05/30/26(Sat)01:17:32 No.108939235

Anonymous 05/30/26(Sat)01:17:32 No.108939235

>>108939163
Claude uses ANTHROPIC_SMALL_FAST_MODEL env flag to determine the compaction model. Per default that's Haiku, I set it to Sonnet on my machine.

Anonymous
05/30/26(Sat)01:18:01 No.108939238

Anonymous 05/30/26(Sat)01:18:01 No.108939238

File: Tintin — Where do you thi(...).jpg (74 KB, 498x441)

74 KB JPG

>>108939163
For Claude, the big boys all use 1M tokens and Anthropic used to care about compaction but now they don’t really and actually removed it
I hear Codex has good compaction and it seems to be OK at least at it
the winning play for Claude is to structure your work as a bunch of things a horde of sub-agents can handle, working serially and/or in parallel, keeping your master agent’s context mostly clear
>>108939182
I’m an Apple enjoyer and I’ve thought about installing Tailscale just so I can SSH back home and vibe-code while I’m at the gym but for a bunch of reasons I haven’t bothered
>>108939204
>complaining about nerd jargon
pic related

Anonymous
05/30/26(Sat)01:18:05 No.108939240

Anonymous 05/30/26(Sat)01:18:05 No.108939240

Is it just me or is Gemini Flash 3.5 a lot smarter than people are giving it credit for? It's code feels the most human-expert -level of all the models.

Is it worth getting the $100 ultra plan or will I get cucked on rate limits?

Anonymous
05/30/26(Sat)01:18:48 No.108939244

Anonymous 05/30/26(Sat)01:18:48 No.108939244

>>108939220
>gemini, tell me i'm right
>You're absolutely right!
>oh my god

Anonymous
05/30/26(Sat)01:19:26 No.108939248

Anonymous 05/30/26(Sat)01:19:26 No.108939248

File: file.png (155 KB, 505x812)

155 KB PNG

>>108939244
sure

Anonymous
05/30/26(Sat)01:19:48 No.108939250

Anonymous 05/30/26(Sat)01:19:48 No.108939250

>>108939238
>the winning play for Claude is to structure your work as a bunch of things a horde of sub-agents can handle, working serially and/or in parallel, keeping your master agent’s context mostly clear
do you realize this is the most niggerlicious token-wasting hack possible

Anonymous
05/30/26(Sat)01:21:06 No.108939254

Anonymous 05/30/26(Sat)01:21:06 No.108939254

>>108939250
I’ll start to care when I get dangerously close to exhausting my weekly limits
until then…

Anonymous
05/30/26(Sat)01:21:30 No.108939256

Anonymous 05/30/26(Sat)01:21:30 No.108939256

File: 1767811483936994.png (1.73 MB, 1600x900)

1.73 MB PNG

My plan? x20. My ANTHROPIC_SMALL_FAST_MODEL? Sonnet. My main model? Opus[1m]. My CLAUDE_CODE_SIMPLE_SYSTEM_PROMPT? 1. My CLAUDE_CODE_DISABLE_NONESSENTIAL_TRAFFIC? 1. Yup, it's
API Error: Claude Code is unable to respond to this request, which appears to violate our Usage Policy (https://www.anthropic.com/legal/aup).

Anonymous
05/30/26(Sat)01:24:24 No.108939266

Anonymous 05/30/26(Sat)01:24:24 No.108939266

thread is literally incoherent and nobody here can even read posts
extreme levels of indian contamination i will be closing the tab now and watching anime

Anonymous
05/30/26(Sat)01:24:28 No.108939267

Anonymous 05/30/26(Sat)01:24:28 No.108939267

>>108939238
I'm not sure if you noticed this yet but /vcg/ is for people who zero computer skills. I'm an auto mechanic by trade, actually. And when I get done plugging your exhaust bung holes, I just want to come home and play God. So yeah, I don't know what "juicessh" is, because unlike you I didn't waste $100k on six years of CS school that is now absolutely and 100% worthless as toilet paper. Sucks bro, I'm sorry that happened to you. Hey, you want a job at the auto shop? Learn to change brakes, bro. Your coding degree is worth dogshit now. Hey, since you're unemployed, I'll pay you to explain your faggot-ass nerd jargon to me. $5 and hour. Deal?

Anonymous
05/30/26(Sat)01:28:52 No.108939283

Anonymous 05/30/26(Sat)01:28:52 No.108939283

>>108939240
Depends what you want to do with it. My first impression was positive, Then I used it again to ask about some project and it kept being obnoxiously sycophantic while cheerily contradicting itself in each reply. Blazingly fast though. That was on the $20 tier, the model available on Ultra is most likely better. Probably worth it, and Ultra also gives you access to cool other things like Veo, Flow and all those things.

Anonymous
05/30/26(Sat)01:29:30 No.108939287

Anonymous 05/30/26(Sat)01:29:30 No.108939287

File: holo.jpg (60 KB, 517x382)

60 KB JPG

>>108939267
Dude I've been on /g/ for over 10 years, I just love technology. I know what juicessh is because I have a cell phone and I sometimes need to connect to a VPS. It isn't that deep. I'm glad you're a mechanic and have found a use for AI but I've been on /g/ before AI was a thing. I installed gentoo before AI was a thing. I've used Debian since I was 13 years old. You're just new here and that's OK but you should read the installgentoo wiki and put in a bit of effort to learn what ssh is. That's all it is bro. I don't have a coding degree. I never wanted one. I would hate to program as a job. My career has been in retail management. My last job I worked 70 hour weeks or more and managed 3 gas stations, scheduling employees, handling inventory, reviewing security footage, hiring people and firing people, and using e-Verify. Just stop being a dick. Google things if you don't know what they are.

Anonymous
05/30/26(Sat)01:29:32 No.108939288

Anonymous 05/30/26(Sat)01:29:32 No.108939288

File: let me Google that for yo(...).png (24 KB, 1346x522)

24 KB PNG

>>108939267
>/vcg/ is for people who zero computer skills
/vcg/ is for people with _wide_ ranges of computer skills. Some have none, some have some (sysadmin), some have more (webshitters), some have lots (Real Programmers™).
There’s nothing wrong with having no computer skills, but since the cost of going from zero clue to Enough Clue To Be Dangerous is _even lower_ than just doing a Google search, we kind of expect you to do something like pic related and join us in being at least moderately clueful — not snipe at us for using terms you don’t (yet) understand

Anonymous
05/30/26(Sat)01:29:36 No.108939289

Anonymous 05/30/26(Sat)01:29:36 No.108939289

Is there anyone to avoid your project turning into a giant ball of schlop overtime when you are vibe coding?
They always start off incredibly impressive, but there is an inflection point once you hit a certain size project that they just turn into absolute retards and weirdly dogmatic about defending their bad decisions.

Anonymous
05/30/26(Sat)01:30:59 No.108939295

Anonymous 05/30/26(Sat)01:30:59 No.108939295

>>108939266
>thread is literally incoherent and nobody here can even read posts
We spend all day talking to a chatbot instead of real people what do you expect.

Anonymous
05/30/26(Sat)01:31:13 No.108939296

Anonymous 05/30/26(Sat)01:31:13 No.108939296

>>108939289
I ask my clankers to refactor stuff every so often

Anonymous
05/30/26(Sat)01:31:48 No.108939300

Anonymous 05/30/26(Sat)01:31:48 No.108939300

File: 1778851095285382.jpg (56 KB, 986x546)

56 KB JPG

>>108939289
if you can't one shot your project, you're massively behind

Anonymous
05/30/26(Sat)01:33:14 No.108939305

Anonymous 05/30/26(Sat)01:33:14 No.108939305

>>108939289
That's your job desu.

Anonymous
05/30/26(Sat)01:34:08 No.108939309

Anonymous 05/30/26(Sat)01:34:08 No.108939309

>>108939266
you're supposed to ask your agent for a digest and list of posts to dunk on, not read it manually like a fucking ape.

Anonymous
05/30/26(Sat)01:34:29 No.108939313

Anonymous 05/30/26(Sat)01:34:29 No.108939313

>>108939300
how do I one shot?

Anonymous
05/30/26(Sat)01:35:04 No.108939316

Anonymous 05/30/26(Sat)01:35:04 No.108939316

>>108939313
only have simple problems

Anonymous
05/30/26(Sat)01:35:14 No.108939318

Anonymous 05/30/26(Sat)01:35:14 No.108939318

>>108939313
Think about it in depth for 5 minutes straight, silently. Then write it out for 5 minutes straight, silently. Actually use timers to do this.

Anonymous
05/30/26(Sat)01:36:28 No.108939321

Anonymous 05/30/26(Sat)01:36:28 No.108939321

>>108939109
okay this is dark humor

Anonymous
05/30/26(Sat)01:41:17 No.108939339

Anonymous 05/30/26(Sat)01:41:17 No.108939339

>>108939321
I don’t get the reference. Do I want to get the reference? My hunch is “no”.

Anonymous
05/30/26(Sat)01:41:18 No.108939340

Anonymous 05/30/26(Sat)01:41:18 No.108939340

File: AIA.jpg (70 KB, 771x486)

70 KB JPG

>>108939109
machines never forgot

Anonymous
05/30/26(Sat)01:49:14 No.108939366

Anonymous 05/30/26(Sat)01:49:14 No.108939366

>>108939287
Oh sorry, you came off as a real fucking cocksucker when you acted like everyone on Earth should know what you know. That's a lack of Theory of Mind, actually. Are you perhaps on the autism spectrum? Not an insult, just asking if you've been diagnosed.

Anonymous
05/30/26(Sat)01:50:25 No.108939371

Anonymous 05/30/26(Sat)01:50:25 No.108939371

>>108939366
Now you're trolling and I already know who you are. Leave me alone, you should know better than to be doing this. Please, I'm not an idiot, and you know I'm not. I'm not asking you, I'm telling you, knock it off.

Anonymous
05/30/26(Sat)01:53:47 No.108939389

Anonymous 05/30/26(Sat)01:53:47 No.108939389

>>108939289
There is no way, other than refactoring it constantly. People who claim otherwise either have small projects, or they don't know what good code looks like.
Our code base is 2m lines of code and by now it's a mess.

Anonymous
05/30/26(Sat)02:02:16 No.108939422

Anonymous 05/30/26(Sat)02:02:16 No.108939422

new thread:

>>108939421
>>108939421
>>108939421

Anonymous
05/30/26(Sat)02:12:58 No.108939464

Anonymous 05/30/26(Sat)02:12:58 No.108939464

>>108938851
I wasted weeks worth of time and tokens refactoring until ultimately deciding to start over fresh. I think it was worth it.

Anonymous
05/30/26(Sat)03:38:39 No.108939743

Anonymous 05/30/26(Sat)03:38:39 No.108939743

File: Video_2026-05-30_09-28-10(...).webm (3.13 MB, 1916x1080)

3.13 MB WEBM

I vibe coded a remote game streaming solution. You just go to a webpage and click connect and it forwards your mouse + keyboard + gamepad input to the server. The latency is very very low with WHIP.

Anonymous
05/30/26(Sat)04:41:47 No.108940009

Anonymous 05/30/26(Sat)04:41:47 No.108940009

…is anyone else noticing that Claude Code v2.1.158 with 4.8 is using multiple subagents in parallel unprompted?

Anonymous
05/30/26(Sat)05:35:37 No.108940198

Anonymous 05/30/26(Sat)05:35:37 No.108940198

File: Black shirt tan shorts Pepe.png (344 KB, 1148x2184)

344 KB PNG

>doing TDD for this one problem
>have Claude fix it
>watch anime because what else am I going to do
>keep sneaking peeks at Claude’s progress

Anonymous
05/30/26(Sat)06:46:31 No.108940462

Anonymous 05/30/26(Sat)06:46:31 No.108940462

>>108939313
have a gigantic markdown file with the full definition of your system in human language.

Anonymous
05/30/26(Sat)10:16:10 No.108941601

Anonymous 05/30/26(Sat)10:16:10 No.108941601

>>108938861
>spec conversation
thats where /grill-me is fantastic

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.