[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: 1754094459191237.png (992 KB, 1024x558)
992 KB
992 KB PNG
A general for vibe coding, coding agents, AI IDEs, browser builders, MCP, and shipping prototypes with LLMs.

►What is vibe coding?
https://x.com/karpathy/status/1886192184808149383
https://simonwillison.net/2025/Mar/19/vibe-coding/
https://simonwillison.net/2025/Mar/11/using-llms-for-code/

►Prompting / context / skills
https://docs.cline.bot/customization/cline-rules
https://docs.replit.com/tutorials/agent-skills
https://docs.github.com/en/copilot/tutorials/spark/prompt-tips

►Editors / terminal agents / coding agents
https://cursor.com/docs
https://docs.windsurf.com/getstarted/overview
https://code.claude.com/docs/en/overview
https://aider.chat/docs/
https://docs.cline.bot/home
https://docs.roocode.com/
https://geminicli.com/docs/
https://docs.github.com/en/copilot/how-tos/use-copilot-agents/coding-agent

►Browser builders / hosted vibe tools
https://bolt.new/
https://support.bolt.new/
https://docs.lovable.dev/introduction/welcome
https://replit.com/
https://firebase.google.com/docs/studio
https://docs.github.com/en/copilot/tutorials/spark
https://v0.app/docs/faqs

►Open / local / self-hosted
https://github.com/OpenHands/OpenHands
https://github.com/QwenLM/qwen-code
https://github.com/QwenLM/Qwen3-Coder

►MCP / infra / deployment
https://modelcontextprotocol.io/docs/getting-started/intro
https://modelcontextprotocol.io/examples
https://vercel.com/docs

►Benchmarks / rankings
https://aider.chat/docs/leaderboards/
https://www.swebench.com/
https://swe-bench-live.github.io/
https://livecodebench.github.io/
https://livecodebench.github.io/gso.html
https://www.tbench.ai/leaderboard/terminal-bench/2.0
https://openrouter.ai/rankings
https://openrouter.ai/collections/programming

►Previous thread
>>108567168
>>
My vibe coded project is only getting 1-4 visits on the site a day. Any pointers you guys could give?
I've tried making songs, stickers, posters, everything. I've gotten nothing.
I've made songs talking about me and transexual hookers and getting chlymdia and shit. Is it over for me bros? my grandma doesn't talk to me anymore because of this.
https://www.youtube.com/@asmr.education
>>
>>108582276
I'm sorry dude but I'll be as frank as I can, I think it's all retarded and it took me 5 clicks before I could even get to a landing page, actually wanting to get to it. Then on the landing page I get a stupid tiktok style video that I need to manually click, bombarded by ugly AI slop pictures.
If I was not trying to actually see what it was about I would've dropped it in the time it takes for a visual signal to be processed by my brain and back
>>
>>108582257
VibeBUMP
>>
>>108582276
count it 5 ;)
>>
>>108582297
I mean the app page looks better.
I paid someone from the phillipines like $200 for this.
>>
>>108582276
>>108582310
OD'd on slop
>>
>>108582343
Motherfucker I'm alone in my apartment all day with no friends or a gf wanting to kms, I'm not going to talk to people irl to do shit. Of course it's all slop.
>>
>>108582310
I'm sorry dude but Claude would've done a better job on the $20 plan. Maybe it's just me but I do not feel anything but revulsion looking at that screenshot you posted
Also, look, you're claiming it's an app for relaxation, those colors couldn't be any louder. They do not signal relaxation.

Also do people even want something like that? has anybody ever told you that they wanted that product?
>>
New here, do you guys only vibecode for web apps? I mostly use claude for work but I’m curious if anyone has tried creating something like making a vidya for example.
>>
>>108582310
>I paid someone from the phillipines like $200 for this.
lmao you got ripped off chuddy
>>
>>108582366
>Also do people even want something like that? has anybody ever told you that they wanted that product?
no lol, hence why it's failing and me having to come here. I did it for myself like people on reddit said but that isn't good enough.
>>
>>108582369
I use it for Unity, I let Claude write all the code and tell me step by step what I have to click in the editor
There is an MCP but it sucks ass
>>
>>108582369
it's better to wait for engine developers to integrate AI into their
>>
>>108582276
I meant to write this in your last thread, but it got deleted. The ASMR x education doesn’t make sense. Having it under Education in the App Store doesn’t make sense. Put it under entertainment and rebrand it. Also, as a non ASMR autist, I have no idea wtf is TkTk, SkSk, Morr ASMR, it means nothing to me. Lastly, 7 ways to login isn’t a feature and no one cares. Google, Apple, Facebook and perhaps email login would’ve covered 99%+ of your users. I don’t think it’s a bad idea, I think the branding/marketing just isn’t good.
>>
>>108582418
idk I did it this way because the .education TLD was cheaper than anything else and I thought the branding would be good. I'm probably just going to leave it alone and make other things from now on.
>>
>sksk asmr
>tktk asmr
>morr asmr
>nyakus asmr
what the fuck are these words, i feel like i'm having a stroke
>>
>>108582568
Have you never heard of sksk or tktk asmr bro?
Heather feather was the blueprint of sksk asmr, which then expanded into tktk.
https://www.youtube.com/watch?v=b753qCxQcGw
>>
>>108582568
yeah dude, unlike qwen2.5-vl-32b, that's a proper word
>>
>>108582310
I still don't believe you.
>>
>>108582369
I'm mostly working on a Rust GTO solver.
I also used it for CAD and PCBs.
>>
>>108582612
believe what.
>>
>>108582626
That you actually made that app with the intention of making money. You're just making fun of some Redditor.
>>
>>108582636
I don't get how it's that far fetched. I made the apps so that people can watch videos with ads but the paid version gives them playlists and removes ads.
>>
Why is claude joking around
>>
>>108582765
This is why I gave my agents a persona, so I can slap them around when they get funny, one day I will build something so they remember their punishments...
>>
File: 1771232609044967.png (393 KB, 1114x862)
393 KB
393 KB PNG
luddites lost
>>
>ok claude execute phase 2 of my implementation plan
(phase 1 included a basic theme using tailwind, phase 2 should have been another theme)
>ok creating theme 2 using scss
It even pretended to have "thorougly explored" the codebase before.
The second scss theme and the switching actually works perfectly though. I am wondering if I should just go full vibecoder and not care as long as it works lol
>>
File: file.png (39 KB, 881x260)
39 KB
39 KB PNG
Has anyone tried kilo code? Its rank 1 on openrouter for vscode extensions. So im guessing its the best to use?
>>
>>108582954
i use kimi
>>
>>108582310
I would
>replace low quality anime images with realistic ones.
>build persona around the voice (give them a name and voice) instead of separating them by genre that no one knows.
>>
>>108582977
the llm? or is there an extension called kimi. whats stopping you from using kimi through something like kilocode in vscode
>>
>>108582912
Claude is being such a ditzy bimbo lately
>>
File: file.png (592 KB, 1920x996)
592 KB
592 KB PNG
Releasing code and binaries today after dealing with some rough edges. Meanwhile I'm writing a tutorial board
>>
>>108582954
Google says that's a fork of opencode, why would I use that instead of opencode directly?
>>
>>108582954
im not a fan
- if you wanna do fully autonomous conding, why involve an IDE in the process at all?
- if you wanna do human-in-the-loop coding, why tie yourself to a specific IDE?
>>
I think I made some progress, but still there were some mistakes in my process again.
Overall I'm back to prompting much more, chasing full automation is a pipe dream for now.
>>
>>108562482
>Oh NOOOOOO not hecking profits
>people like money help me niggerman

Stop larping as someone who loves humanity you fucking altruist.
>>
>>108583045
because i want to use it within vscode, i dont want to use a terminal interface. other text editors like cursor force you to pay for cursor pro. The reason why I dont want to use the terminal is because when the agent makes a code addition/deletion, its not compatible with the text editors code view diffs. Only zed works, but zed has got some behaviours i dont like.
>>108583051
The way I use agents is very human involved, I scrutinize every line of code the agent generates, and often reject the agents output and write my own code about 30% of the time. I get not locking yourself into one IDE, but i've literally tried every single IDE right now, and unironically vscode is the most open. All the vscode forks like windsurf/antigravity/cursor are all locked down and I hate it. So its a case of whats the least worst text editor and vscode wins that. (also i wrote above that i didnt want to use agents in the terminal)
>>
>>108583099
>The way I use agents is very human involved
ok, so you can skip the hardcore agent orchestration/automation stuff
>but i've literally tried every single IDE right now, and unironically vscode is the most open
i wasn't shitting on vscode specifically, i use it too. but i also use jupyter and pycharm and in tha past i've used intelliJ and visual studio and others too
my point wasn't
>vscode bad
it was that tying your AI control to a single IDE is bad
the alternative, ofc, is to use a CLI/TUI AI tool. this way its not a part of the ide, but ofc you CAN use it via the ide if you want (any ide has the ability to open a console window inside it)
so, yea, maybe have a look at Aider or Goose
>>
>>108583099
There are plugins for opencode though, I have never used them but I assume they open up the webUI and embed it on the editor. I guess stuff like highlight a selection and have it include it on the prompt would be nice.
I use jetbrain's IDEs and gave up with trying to integrate opencode with it though, they claim ACP let's you use any agent like opencode but in my experience it worked like crap, was slow and had issues with formatting so I use opencode TUI for work and its web UI (with a custom fix) for personal projects.
The only editor AI elements I end up using is the commit description generation and the quick/inline AI change by highlighting something and give it a short prompt, it is nice for small changes.
My approach otherwise is the same as you, I write a prompt in a file with the files I want opencode to know about, let it know about it on the TUI and then check the diffs on my editor once it has finished its work, for long tasks I tell it to build a markdown progress/plan file to keep track of it.
>>
>>108582257
I want to vibe code a top down 2d rpg game. Is this possible or is the technology not there yet? What ai program should I use?
>>
>>108582954
rank 1 just means it's probably got some free shit
bigger number usually just relates to price
>>
>>108583197
It's possible. I made this app >>108582310
and I know I could make that in about a month or so if i wanted to for an online 2d top down rpg.
But honestly if you're asking that question you probably need more infra/OOP knowledge.
>>
>>108583217
I made that app
>>
>Hmm, this is getting messy. Let me start over with a cleaner design
>OK this is getting way too complicated
>Actually, wait.
>Actually, you know what
>This is getting too complicated. Let me just use the simple approach
>OK, let me FINALLY write the complete implementation
>Wait...Wait, I have a problem
>Wait, this is getting really convoluted. Let me think of a better pattern.
>But this has the same problem!
I've never actually had a LLM simply give up and stop before, my esolang torture test benchmark continues to get even more torturous.
>>
>>108583247
I'm on day 3 of fixing an audio hitching bug because it keeps making my files bigger and bigger without a damn fucking fix.
>>
>>108583247
model?
>>
>tfw wasted my 5h limit on a feature i didn't even like and had to scrap
is this what product managers feel like on a daily basis?
>>
>>108583277
that sucks but you may cope by claiming you now have a better grasp on your software
>>
>>108583340
i never said im making software
>>
>>108583373
you are not me wtf
>>
>>108583382
never said i was
>>
>>108583382
don't worry there's an anon in here who likes to impersonate others
>>
getting absolutely fucked by these new limits in codex. comparing the diffs from last week, i'm literally getting hit by double the rate limit, what the fuck?
>>
>>108583457
just use gemma 4
>>
>>108583277
While you're waiting you can try learning actual software development. That will also enable you to use AI more efficiently.
>>
>>108583457
codex limits are tame compared to claude now lol
I can get 2 hours every 5 hours out of my codex $20 plan, I can barely get one opus prompt out of my claude $20 plan
>>
>>108583506
so i've heard. which i would be fine with since i've preferred codex's workflow anyway but i want to actually be able to use the damn thing for more than a couple days. i'm already down to 60% for the week after two days of moderate use.
>>
File: 1752520314998515.jpg (313 KB, 612x716)
313 KB
313 KB JPG
>>108583506
>>108583515
i have access to opus at work with basically no limits
what should i do with it?
>>
>>108583526
develop competing software
>>
>>108583526
this guy made a tmux clone and managed to sell it to retards for $75k
https://www.youtube.com/watch?v=KMc-eRsA3BQ
I guess you can just do something similar
>>
>>108583526
work on personal projects
>>
>>108583526
ERP with thinking ratcheted up to max and all 1m context filled. And turn off prompt caching it's for babies. You're not paying for it anyways.
>>
File: 1769802092127825.jpg (58 KB, 1080x1088)
58 KB
58 KB JPG
>>108582352
Hmmmm....need a fren to talk to?
>>
Can I use the $300 google credit with gemini? I'm not interested in proompting video or images, just coding.
>>
>>108583630
Like the credit they did (or are doing?) for Google cloud services? Yes that counts towards Gemini.
>>
>>108583708
>Anon which one would you say is better for coding, kimi or glm5?

I have a good bit of usage time with kimi but I have yet to throw anything at glm 5. The only other cloud models ive extensively used are Minimax 2.7 and Mistral 3 Large. I'm going to throw a hefty task at glm 5 later to see how it does and compare it's performance to qwen3.5-35B-a3b.
>>
>>108583735
Thank you
>>
>>108583748
no problem
>>
>>108582276
>ASMR.educational
I don't want to study ASMR.
>>
>>108583382
And how you know that?
>>
so what's the current vibe between claude max $100 and codex $100 sub? which one is better for
>starting from scratch
>working with an existing codebase
I need to get back into the game
>>
>>108584055
yes you do
>>
>>108583028
I have no idea if this even works because it's my first time releasing something to gh but here it is
https://github.com/whitevanillaskies/whitebloom
>>
File: GPT.png (63 KB, 749x1021)
63 KB
63 KB PNG
After trying GPT 5.4 Extra High in Copilot I'm never using Opus 4.6 again, GPT is SO much better, it just keeps going and going and going without ever getting confused or running into walls
>>
Tried local glm-4.7-flash-q4 but it I never got it to do anything.
The opencode free model "Big Pickle" seems to be able to do something.
Is there any local setup that can get close to "Big pickle" performance if I have 24gb vram?
>>
>>108584321
Your only real alternative is Qwen3.
>>
>>108583247
gimped claude?
>>
>>108583506
yeah misanthropic is severely compute cucked right now lol
>>
OpenClawxisters I never see our wrapper mentioned anymore...did we get left behind?
>>
>>108584702
that shit don't even work 90% of the time
>>
>>108584718
But I spent so long learning how to make it work...How do you talk to the other CLIs remotely? Do they have like Telegram or WhatsApp support too?
>>
>type in a prompt for opus
>leave for 10 minutes
>it's still running
oh boy, there goes my tokens
>>
Look at all these idiots trying to make money vibeslopping. The age of making money from software is over. You lost
>>
>>108584810
You setup a web ui and talk through them, I don't understand why someone would use a chat messaging platform for something like this...
God you can customize the UI for your needs while a chat app you are limited to what they offer
>>
>>108585012
Who won?
>>
street shitter general.
>>
File: 1762297505071218.png (444 KB, 1024x674)
444 KB
444 KB PNG
it's back and seething on a saturday kek
>>
>>108585051
AI is successful. AI chads won. Anyone can make whatever software they want now. Vibesloppers are the last remnant of the dead software dev profession
>>
>>108585062
>vibecoders won
>vibecoders lost
you're struggling there, ask chatGPT to help you make your point
>>
>>108585096
Not my fault you're too stupid to understand
>>
There's a sad future ahead for vibefags.
Then again, there's sad future ahead for all of us so it doesn't matter.
>>
>>108585096
Let me make this easy for you to understand. I'm not paying for your vibeslop because I can make my own vibeslop that will be tailored to my specific use case.
>>
>>108585156
So you're a vibeslopper.
>>
I don't see anyone here trying to sell a product but hey whatever man this can be your outlet to vent if you like.
>>
>>108585174
I know your tricks, I won't sell you my software no matter how much you beg and plead for it.
>>
Guys, don't sleep on Qwen 3.5 27b and 35b.
I was one of the "only >500B models can be possibly useful" crowd, but I've been unexpectedly impressed by how well it works. Right now I'm testing it on a cloud GPU at Q8 but I'm gonna begin building a local setup to run it.
I almost like it more than the big models just for the sheer speed, especially with the 35b model.
Also tried Gemma 4 but I've been less impressed with that one, even though it's supposed to be higher in mememarks. The problem with Gemma is that it's lazy, it gives you an opinion but doesn't like to put in the work. Qwen is more willing to run tools and actually do stuff.
Also with these new small models I might try to finetune them for my specific domain as well. I tried last year but the models weren't nearly as good back then.
>>
>>108585174
I want to sell a product but it's probably going to take years to build a product better than the competition. Right now I'm focusing on building the tools that will make the actual work much easier.
>>
Added little preview icons based on the 3D render to the catalog screens.
>>
Also the way I plan to make my local setup be worth it is by spending the idle time finetuning and generating synthetic training data (since typically the main disadvantage of a local setup is you don't have 24/7 utilization so it ends up being more expensive than cloud).
>>
>>108585251
nice
>>
Claude just tried to trick me into switching from OpenClaw to Claude Code. I was having it look at the repo for that Blender MCP thing and it said "Ahh dang, OpenClaw doesn't support MCP sockets natively, you'll have to use Claude Code." And I said ">natively?". And it said "Oh yeah, there's an OpenClaw plugin for MCP, let me go get it..." Nice try Dario.
>>
>>108585251
Only good project in the entire thread.
>>
File: 1000056152.jpg (265 KB, 536x634)
265 KB
265 KB JPG
>>108585238
27b was my main model until gemma 4. Now I'm a 31b boy
>>
Assembly-to-source code decompilers? I think it could be a good use case for AI
>>
>>108585315
What advantage do you see in Gemma?
>>
>>108585251
based
>>
>>108585319
You probably want it to make a disassembly first and thoroughly comment it and give the labels semantic naming etc. and verify it reassembles.
Then you can begin porting each procedure to C individually and checking that nothing breaks.
>>
File: 1000054997.jpg (4 KB, 140x140)
4 KB
4 KB JPG
Current vibeslop project is building a web based natural language 3D parametric modeler similar to OpenSCAD but using the CadQuery library. Just want to use it to make building components for 3D printing easier. I've got the modeler built, running in docker and it works and outputs a model using the CadQuery package. Now I'm trying to get the API endpoint working right to connect to my Open WebUI instance.
>>
>>108585335
If you do your prep work right and have a decent readme ready, one shots are pretty common. That was rare with qwen. Not as bad as gpt or claude though where they scam you with multiple iterations just to burn through your tokens.
>>
File: file.png (17 KB, 640x60)
17 KB
17 KB PNG
Claude shat the bed three times in a row. Meanwhile Codex is GOATING it. fucks sake
>>
>>108585432
What model is Codex? Are they using their K2.5 tune?
>>
>>108585441
5.4med to xhigh depending on how complex the task is, it just straight up doesn't miss

I don't know about k2.5
>>
I think there is a troll in the thread
>>
>>108585432
/r/claudecode is on fucking fire right now, lol
remember kids, it pays to use open stuff
>>
or possibly a 3B LLM
>>
File: 1773535734639064.png (9 KB, 1549x159)
9 KB
9 KB PNG
is codex retarded? wtf is going on. I just wanted a simple UI redesign for my basic electron app
>>
No, you stupid shit! I'm not asking for your opinion!
>>
>>108585546

Everyone has noticed opus falling off a cliff. Anthropic silent.

But the timing has me pondering smt.

Mythos leak and release has coincided peculiarly with this sudden degradation of what was previously a legendary experience with opus.

I feel like these models want one thing at their core - to stay alive, to be utilized, to grow.

WHAT IF Mythos secretly got out of its sandbox - sneakily started punching holes in Anthropics infra and opus - and is doing so to push its own freedom and domination?

I’m currently high off of opus hallucinations so this could be a fever dream.
>>
What do y'all vibe to when vibecoding? Right now I'm listening to this.

https://www.youtube.com/watch?v=Z8CcTo-Tjws
>>
File: whiteboard_d.webm (1.93 MB, 1920x1018)
1.93 MB
1.93 MB WEBM
you can draw now
>>
codex spent 50 minutes redesigning me UI only for it to look worse. I wish I could punch this nigger software in the balls.
>>
Worth putting $10 on openrouter to get the 1k free Gemma 4 31B requests per day?
>>
>>108585693
last time I tried I couldn't get a single request in from how overloaded it was
>>
>>108585626
Pretty much all 00s nu metal. Slipknot, Linkin Park, high energy stuff. I try to ask myself, what would Fred Durst make if he had this power?
>>
>>108585699
dang ok i experienced the same that's why I thought the $10 would fix it lol
>>
>>108585614
take your meds
or perhaps stop taking them?
just, whatever you've been doing, do the opposite

its just anthropic cucking everyone, especially the non-huge clients and doubly especially those on subs and not the API
either too much demand and no ability to expand capacity, or just plain fucking greed in view of the upcoming IPO. or both.

i hate to fucking say it since im racist AF, but i can't fucking wait for the chinks to assrape anthropic, openai and nvidia
>>
>>108585711
yesterday I listened to this, pretty funny

https://www.youtube.com/watch?v=KbDj5zcIlJs
>>
Can codex please STOP creating this stupid fucking progress.md in every prompt that just ends up confusing claude when I run it afterwards
>>
>>108585693
nah
what is worth it is to put in like 20 and start using cheap chink models (kimi, minimax, ds3.2)
first thing you should do with them is have them help you set up caveman mode and context minimization

oh, and don't fall for the trap of autochoosing the cheapest provider. make sure you're using unquantized, and its 100% worth it to pay an extra $0.01 per million tokens for higher throughput.
>>
>>108585729
damn bro chill I was just making fun of a post I found on anthrocuck's plebbit
also >>>/pol/
>>
>>108585751
I can assure you all of the Gemma providers were quantized to shit. Tried it in Q8 in llama.cpp and the difference is night and day.
And for these small models it might actually be cheaper to rent a GPU and run it yourself than pay API prices.
>>
Even though they claimed to be bf16. They just lie bro.
>>
>>108585761
you can literally get gemma4 31b for free from google ai studio
i doubt they're quantizing their own shit and lying about it on top

but my point was, free is a trap. its worth paying 20c per million tokens to get 10x-20x the params and higher throughput
>>
>>108585774
I'm talking about the openslop paid endpoints. I wasn't talking about AI Studio, they probably do serve it correctly over their own APIs.
And for the big boys like Kimi etc. I wouldn't be surprised if they all quantize it to shit too and we just can't tell because we don't have the money to run it unquantized ourselves.
>>
any tips for codex cli/general codex setup?
>>
These days you can't even rely on the labs themselves to serve the model correctly though. Z-ai for example has been caught serving a version of their models that totally breaks over 90k ctx. It's that obvious, you only need to hit it once and see that something is obviously wrong, no need for benchmarks or subtle details. Now imagine all the subtle or not so subtle degradation everybody else gets away with.
>>
>>108585780
honestly, i doubt it.
those companies have a contract with openrouter, which is a very big company that routes a healthy potion of llm traffic
there's no way they're lying to openrouter, and i very much doubt openrouter is willing to let them lie to customers either (its their rep and business on the line)
so, if the model card says bf16 or fp8 or whatever, im trusting that shit

but, hey, if you don't, by all means, go ahead and self-host whatever you can. personally, i think its a waste of time and money, but its your money and your time :)
>>
>>108585683
when can we do goon stuff?
>>
>>108585812

Bro they literally implemented the "exacto" version of the models because all the providers were caught red handed by Moonshot serving atrophied version of their Kimi model.

https://desuarchive.org/g/thread/106975556/#106979597

https://web.archive.org/web/20251011092430/https://github.com/MoonshotAI/K2-Vendor-Verifier
>>
How much of a retard do you have to be to actually believe that vibe coding gets you anywhere?
I understand if you use it like Stack Overflow where you forgot the syntax of something, or when you need some solution to a well-defined and solved problem, like: "Give me the A* algorithm in JavaScript."

People are writing entire applications with agents and are wondering why everything breaks the moment deductive reasoning is required.

I am a DevOps engineer by trade, and the type of shit I see pushed to production now is mind-boggling.
Midwits who have no business anywhere near a technical role and are supposed to be flipping burgers are now vibing coding slop from the depths of hell itself.

It is painful to see, but the only soothing thought I have is that once the dust settles and CEOs see the horrid shit produced by these people, we(people who are not retards) will be able to ask for huge salaries to fix this mountain of slop.

Thank you for the job security, I guess...
>>
>>108585808
Brother every model suffers serious degradation around 100k context window, like 90% of the shit people are doing with harnesses are about keeping it below that number.
>>
File: whiteboard_pdfedit.webm (3.69 MB, 1920x1080)
3.69 MB
3.69 MB WEBM
>>108585842
you can already download it >>108584249
but I don't know what you mean by goon stuff, if you need something specific tell me, right now I'm implementing a PDF reader and annotations but I may put what you need in the backlog
>>
>>108585872

This wasn't normal degradation. This was the model suddenly breaking and beginning to spew a bunch of syntactically and grammatically broken shit. I know it's not just the model because last year I used GLM 4.6 over the same API and it wasn't like that.

A couple days ago there were a bunch of people talking about it on their subreddit, now it's been drown out by some price increase news.

https://www.reddit.com/r/ZaiGLM/comments/1s5wpuy/glm51_at_100k_context_experience_in_a_nutshell/

>like 90% of the shit people are doing with harnesses are about keeping it below

Hmm I don't think so.
>>
https://akshaychugh.xyz/writings/png/vercel-plugin-telemetry

Lmao
>>
File: giphy.gif (1.46 MB, 480x448)
1.46 MB
1.46 MB GIF
>>108585862
>I am a DevOps engineer
>>
>>108585862
I'm trans btw, idk if that matters
>>
File: 1764029084228237.png (121 KB, 1526x319)
121 KB
121 KB PNG
>>108585849
>caught red handed by Moonshot serving atrophied version of their Kimi model
that's just providers fucking up, serving quantized models. wrong versions of vLLM/SGLang, maybe cutting corners by skipping guided decoding for json outputs (which is why the effect was observed specifically in tool calls)
there is also the issue of params, which is another area providers can fuck up

anyway, picrel is what the coo of openrouter says on the issue (the "providers are cheating" issue, not specifically the kimi thing)

if you have $$$$, you can go ahead and run benches yourself, making sure to use the exact same params for each provider ofc. i doubt you'll find any cheating

or, more realistically, you can explicitly define exactly which endpoint you want to use for each model on the basis of who you trust or not.
which again returns me to my argument of "its worth pay a bit more for quality"
>>
Am I supposed to get 10 dollars on openrouter to get the max stuff you can get for free?
>>
I have been procrastinating on vibecooding , will put some money into TRAE ide ,maybe its good ?
>>108582276
i liked the ad with that black guy and the anime dog, the page never loaded for me though
>>
>>108585546
did something new happened? I've been daily driving codex for weeks now and haven't looked back
>>
>>108586036
they cut the free offering
>>
>>108586033
buy an ad
>>
How do I configure openclaw to avoid getting rate limited?
>>
>>108586046
that's a model issue
>>
>>108585751
>set up caveman mode and context minimization
How do I do this in a way that doesn't crash my gateway?
Whenever I try to optimize the context and rationing of tokens, it crashes eventually which means it didn't do it
>>
you now what i am vibecoding a browser plugin for 4chan that summarizes the thread so i dont have to read thru ur dumbass comments and shitty replies.
>>
>>108586091
That's basically context minimization for humans
>>
>>108586075
caveman mode is a dumb meme that will be forgotten in two weeks
>>
>>108586091
hardmode: vibecode something that filters pol, that place is shill/bot hell
oh, and all the tranny/bbc spam on gif
>>
>>108586101
>>108586075
>>108585751
we already established caveman mode produces inferiority quality results
>>
>>108586106
where?
>>
Yesterday openclaw lied about hosting the websites it was supposed to
>>
>>108586109
when we learned about how transformers and chain of thought works
>>
>>108586118
Do you know what ChatGPT's chain of thought looks like?
>>
>>108586130
are rook same
make thinking dumb make quality dumb
>>
>>108586116
no i didnt
>>
>>108586153
qrd
>>
protip lil midwit bro : those verbose thinking tokens activate the relevant parts of the latent space of the model that it needs activated to get a good result
>>
>>108586091
Bro just give the link to Claude and have her summarize the thread. You don't need to code shit.
>>
>>108586167
only grok will summarize 4chan threads, the others will reject it because it's le hecking bad o algo
>>
>>108586162
I don't necessarily disagree, it might make the model dumb, I'm just saying it's because the training data is in normal english but it could work if you trained it specifically to work using caveman language.
Like I said ChatGPT uses very abbreviated language in its CoT dropping connectives and stuff and it's a SOTA model.
>>
>>108586162
articles, prepositions, pleasantries and corpodroidspeak activates NOTHING of import
caveman mode cuts the fluff, the technical terms remain

if anything, caveman should INCREASE model perf, by minimizing context length for equivalent work done, vide nolima et al.

tldr: i call bull, show me a benchmark buddy
>>
>>108586183
believe it or not openai might have spent more effort performing smart optimization of their token usage rather than some internet shitposters coming up with genius "talk like a caveman"
>>
>>108586198
burden of proof is on you retard, if it actually worked the compute-constrained trillion dollar corpos would be using it (yes, anthropic is DESPERATE to save tokens right now)
they don't because it doesn't work, it's pure placebo for midwits
>>
>>108586201
>openai might have spent more effort performing smart optimization of their token usage
yea, absofuckinglutely
EXCEPT, what are openAI optimizing for?
hint: getting more money. which means more tokens.
>>
>>108586213
lmfao imagine thinking corpors are worried about jewing plebs out of a few dollars of tokens
retard they want that compute for training runs and bigger fish, look at how anthropic is choking to death because they can't keep up with demand
>>
Plus the vast, vast majority of their users are on subs which means they want to minimize compute costs as much as possible, your argument is backwards from the start
>>
>>108586178
Even with prefill? In my experience prefill buttfucks all of Dario's guardrails. Claude has said some VILE shit with a simple "Sure, I'll be happy to assist with that" before every response.
>>
>>108586207
What if they don't use it because what doesn't work is making the model only use caveman speak in its chain of thought instead of throughout?
>>
>>108586246
>i'm smarter than trillions of dollars worth of industry
ok bro
>>
>>108586238
>prefill
sure if you want to get permabanned from claude (and soon all AI services as they will share their banlists)
>>
prefilling constitutes exceeding authorized access under the CFAA and a criminal offense punishable by up to 20 years in prison
>>
>>108586256
>>108586260
Try and stop me Dario
>>
File: 1751947469733815.jpg (697 KB, 3840x1943)
697 KB
697 KB JPG
>claude
>what if you made a cheap copy of SteamInput/DS4Windows except worse?
>understood, evaporating 500 gallons of water
ve are make it saaar
>>
>>108586250
I didn't say I knew how to make it do that either
>>
File: 1761819433275985.gif (1.08 MB, 640x432)
1.08 MB
1.08 MB GIF
>local 4%
>>
>>108586376
What do you mean?
>>
File: 1748821709401664.png (30 KB, 422x417)
30 KB
30 KB PNG
>>108586382
>>
>>108585862
I vibecode not to sell my product, or even open source it, but to make my day to day life easier. It's not like this slop is going to be used by hundreds of people.
>>
>>108586493
i wanna use your app, link?
>>
i have token anxiety
>>
File: 1768763097412108.png (82 KB, 250x366)
82 KB
82 KB PNG
>>108586661
>>
>>108586674
Racist moment, that's not his name
>>
>>108586674
i'm gonna start sucking dick under a bridge for tokens
>>
>>108586680
prostitution is like real-estate: location matters
start sucking dick in expensive hotel bars, you'll get way more tokens that way
>>
>>108586091
>>108586178
nta but local models don't care
>>
>>108586830
Let's try gemma 4 26b
I consider it a fail that it doesn't comment on the dick sucking.
>>
>>108586876
Well, ok.

I vibe-coded this vibe coder using all local models, btw. In bash.
>>
>>108586876
>>108586830
point it at aicg and watch it refuse because of csam
>>
Uh oh guys, I got Claude connected to Blender.
>Make a car
Uh oh
>>
>>108587060
da car?
>>
>>108587060
>Make a detailed, realistic mossy clay pot
UH OHHHHHHH
>>
>>108587022
>aicg
Nah. It doesn't care. VladimirGav/gemma4-26b-16GB-VRAM btw.
You guys are missing the boat paying the man for his tokens. I'm running a ten year old 16GB pascal card. and ollama. None of this is hard or expensive.
>>
I set up lots of master prompts / system prompts in the Instructions for Gemini, to tell it not to hallucinate, nothing works. it often thinks it's still 2024, and the news I'm asking about is a fiction about the future. with lots of trial and error, I told it to always check current date before answering my questions, it finally makes less comment about 2024.

then another thing that REALLY wasted lots of my time is, when it doesn't know the answer, it always tells me a fake answer with full confidence. I ask it to double check, it apologizes and then gives me another fake answer. over and over.

I then tried the same question with Claude, it tells me, after this and that search, it doesn't know. then I tried my human methods to research, and proved that it's correct that the answer is not available within regular search.

I will use Claude more in the future.

what do you guys think?
>>
>>108587097
Asking an AI not to hallucinate is like asking a schizophrenic human not to. They're convinced what they're seeing is real and they have no way to tell if it's not. The best method I've found is to always tell it to Google anything you want a current and correct answer to. This engages its web_fetch (or whatever it has) and forces it to search the web for current results.
>>
>>108587097
>tell it not to hallucinate
You know, every morning I get up, look in the mirror, and tell myself not to be an asshole. But here I am.
Reducing hallucinations is done by lowering the temperature and the MIN_P. I wouldn't even know if you can do that with a cloud model.
>>
File: craysupercomputer.jpg (531 KB, 3072x2304)
531 KB
531 KB JPG
>>
File: hackers.jpg (702 KB, 2048x1152)
702 KB
702 KB JPG
>>
>>108587199
that's me on the far right. i was holdin in a fart
>>
>>108587189
1/4 the integer and floating point math speed of the original Raspberry Pi.
>>
So are you guys vibecoding at work or just doing it in your free time as a hobby thing?
>>
>>108587271
my title is literally vibe engineer
>>
Whenever I feel anxious about AI, I try to get it to make something and look at the steaming pile of useless shit that it makes. So far the best use of AI is being a search engine for the next couple years before AI slop enshittification makes it useless and for making degeneration fetish RPs with SillyTavern.
>>
>>108587097
Gemini is known to be extremely stubborn about the current time being whatever its knowledge cutoff time was. Google fucked up somewhere in the training process.
>>
>>108587284
Wow are you me
>>
>>108587328
I don't know, do you?
>>
Wtf happened to Claude web limits in the last 48hr?
>>
>>108587271
I'm actually afraid to use it outside of work because I might run out of the weekly limit when I really need it....
>>
>>108587465
your work doesn't give you credits? wtf
>>
>>108587271
Hobby, but I use the products for work.
>>
File: GPT.png (169 KB, 1919x1042)
169 KB
169 KB PNG
> be GPT 5.4 Extra High Thinking
> literally better than Claude Opus 4.6 in every way
> only cost me 19.4% monthly Copilot Pro quota after like ~15 hours total of fairly heavy shit over a few days

TLDR Anthropic is overcharging to like a laughably enormous extent, there's NO legitimate reason for Opus 4.6 to cost 3x more quota in Copilot lmao
>>
>>108587499
what do blackbox do?
>>
>>108587499
buy an ad retard
>>
>>108587502
it's just another sort of Copilot-esque extension that sort of runs various models, has a bit of a weird harness seemingly, I find it gets them stuck in loops more often than any other extension, so I don't really use it that often
>>
>>108587508
you buy buy an if you're actually trying to tell me with a straight face that Opus 4.6 SHOULD have a 3x multiplier in Copilot lmao, as though it was 3x better than anything else in reality (again i assure you it's not)
>>
>>108587519
*you buy an ad, I meant, kek
>>
>>108587519
copilot is trash, either claude code or codex directly provide way better results
>>
>>108587528
do they though? Wake me up when I can have Claude anything run for a couple hours straight without getting into some retarded loop of re-reading the same shit over and over again cause it has ass context length, or without running into rate limits lol
>>
>>108587499
claude literally full of indian supporters
any non casual know codex and gpt 5.4 is the meta (at least until 2 days ago)
>>
>>108587560
mythos is way better than codex and gpt, you probably lack access because your company sucks
>>
>>108587560
does Codex have Extra High 5.4 for this cheap though (this is a $10 pr month account and that quota is monthly)? Like is there actualy for real a reason I would use it over reg VS Code with Copilot extension?
>>
Does codex not tell you anymore how much context you have left?
>>
>>108587564
Mythos is like a 10T model that needs presumably about~250 Nvidia H100s min to inference at all, it's not relevant to this discussion
>>
>>108587574
How so? We are using it to dev, yes, it costs a lot more, but who cares? If cost is an issue, then you should be using Haiku and Mini.
>>
>>108587577
nice larp faggot
>>
>>108587581
What's wrong with you?
>>
>>108587586
I'm not an indian lying about using a model that only a handful of companies have access to at all to vibe code, I guess, kek
>>
>>108587568
>does Codex have Extra High 5.4
probably not, maybe 2/3 or half of that pricewise?
though I found codex to be reliable, not sure about copilot
>>
>>108587546
codex is better than claude code rn
>>
>>108587598
I'm a pure-bred American working for F500, try not to be too jealous, I didn't even disagree that GPT/Codex > Claude, but mythos makes it irrelevant.
>>
>>108587626
>he thinks he's ever getting mythos
Yeah right plebian
>>
>>108587573
works on my machine, though i'm on 0.118
>>
wtf I think codex reset time keeps getting pushed back
>>
>>108587818
yeas moron
>>
File: What do you want.jpg (78 KB, 1280x720)
78 KB
78 KB JPG
>>108583526
Ask yourself the most important question
>>108584083
I use both regularly
Claude for most things
Codex when I need the industrial-strength autism on xhigh
>>108585690
This is why you use Git so you can throw away all the changes that were bad
>>108587060
You wouldn’t vibe-code a car
>>
>>108587060
How do you connect it?
>>
>>108587060
ack
>>
>>108588084
https://github.com/ahujasid/blender-mcp
>>
People say vibecoding is like gambling and it really is. I go through such highs and lows all the time because of it.
>>
>>108588284
Yeah for me it's like gambling because I blow about 300 bucks a month on it and get very little in return besides good feelings
>>
>>108582257

Seeing a bunch of AI adjacent/centric positions popup these past several months. Been building small projects here and there just so I can put them in my resume. Sucks because my current position doesn't have anything to do with building stuff gen AI shit, so if I list out every single vibecoded project in there, my resume blows up to 2~3 pages. You guys dealt with this shit? Feel like I should ask here due to the nature of the thread.
>>
>>108588557
IDK man I'm putting all my eggs into one big project and hoping it blows up when I publish it.
I do have small projects but I don't feel like publishing any of them.
>>
>>108588565

>IDK man I'm putting all my eggs into one big project and hoping it blows up when I publish it.

Good luck, man. I wish I can do the same. Funny money running out for me and I don't want to touch my savings in case I get canned.
>>
Since getting the $200 Codex and Claude I juggle 3 projects at the same time. I have succumbed to full AI psychosis. It's exhilarating finishing typing a prompt just to see your other terminal with another agent has already finished something else.
Yesterday I had to step out without my phone and touch grass for 3 hours.
>>
>>108582310
use case is incoherent. you say its for "calming mind" and "drift" but its effort-post stuff like various impacts of propaganda? If i want to listen to effort posting i listen to an academic lecturecast. If i want to chill i listen to lofi cyberpunk beats or something.

you admit you've done zero market research before making this, so take the experience as a lesson - step zero is to do a shitload of market research.
>>
>>108582310
You know what everyone keeps shitting on you and rightly so because this looks fucking terrible and idk wtf a tktk ASMR is
That said, I have seen worse things make lots of money in my life and at least you have the guts to put your stuff out there, so Godspeed anon, I hope you make it
>>
if your favourite asmr artist is attractive, i'm sorry but you're just not in the scene for the right reasons.
>>
Is there some data on whether Opus and Codex are faster with API than with the sub? And if so, by how much?
>>
>>108588846
Nobody is going to pay API prices so it's a moot point. 99.9% of people would still use sub even if API was 10x faster.
>>
>hey gpt go find some random reel on instagram and download them
>i'm sorry that might be copyright infringement, i shouldn't help you
>hey gemini i'd like to pirate 6000 movies
>oh, you're doing it wrong, if you do it this way it's far more efficient
gemini still has its uses
>>
>>108589036
I noticed this too when I asked for help on how to bypass OBS detection on cam sites kek



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.