/g/ - Anyone else notice a recent-ish steep decline in C - Technology

Anonymous

04/26/26(Sun)00:54:06 No.108692274

File: chatgpt-logo-chat-gpt-ico(...).jpg (43 KB, 980x980)

Anonymous 04/26/26(Sun)00:54:06 No.108692274 Archived

Anyone else notice a recent-ish steep decline in ChatGPT's cognitive abilities? It's constantly wrong about everything and if you try and correct it about something it clearly got wrong, it will become passive-aggressively combative and try and grasp at straws and re-frame to make it seem like it wasn't wrong or at the very least you are completely "100%" correct.

Anonymous
04/26/26(Sun)00:54:48 No.108692279

Anonymous 04/26/26(Sun)00:54:48 No.108692279

>>108692274
aren't* completely "100%" correct

Anonymous
04/26/26(Sun)00:55:50 No.108692287

Anonymous 04/26/26(Sun)00:55:50 No.108692287

>>108692278
I'm sorry. No stupid nigger responses in this thread.

Anonymous
04/26/26(Sun)01:07:35 No.108692344

Anonymous 04/26/26(Sun)01:07:35 No.108692344

>>108692287
Autist

Anonymous
04/26/26(Sun)04:30:50 No.108693032

Anonymous 04/26/26(Sun)04:30:50 No.108693032

I love seeing these threads every week because low- and medium-IQ people always think something has changed, when in reality, the bots have always been extremely retarded at least 20% of the time. The LLM cultists can't swallow how big of a failure this tech is, so they cope by saying "they changed muh model," even though that's not how any of this works, given that training is the most costly part of the process.

Anonymous
04/26/26(Sun)04:38:33 No.108693056

Anonymous 04/26/26(Sun)04:38:33 No.108693056

>>108692274
Yes, they did that as a liability response to the sycophancy meme that was being astroturfed around by journalists and the two or three guys who killed themselves after talking with chatbots.

>>108692334
No, he's right and I say that as somebody with 5 parallel ChatGPT sessions right now. I wish it sounded natural and friendly like Opus 4.5 did but $20 for nearly unlimited usage is too cheap to pass even if the personality and writing style are shite.

>>108693032
Nah, they definitely changed the style and personality. The changes in intelligence are more dubious. I haven't really felt that much improvement since o1 (the first chain of thought model) that was obviously better than 4o in terms of intelligence.

Anonymous
04/26/26(Sun)04:42:50 No.108693067

Anonymous 04/26/26(Sun)04:42:50 No.108693067

it insists on "both sides" almost no matter what youre talking about and it's actually aggravating. no matter how pedantic it will find something to disagree about even if it means misattributing what you actually said. this means that it will take what you said and exaggerate it just so it can disagree with it. it is disgusting

Anonymous
04/26/26(Sun)04:47:59 No.108693084

Anonymous 04/26/26(Sun)04:47:59 No.108693084

they didn't change shit
its always broken. its just a matter of whether you hit them with the question they know how to answer or not
if you asked them about some niche shit like older coding algorithm or shit they will 100% give you wrong answers because they just don't have the knowledge in the database. its all just data. there are no intelligence
AI is not real

Anonymous
04/26/26(Sun)04:51:23 No.108693100

Anonymous 04/26/26(Sun)04:51:23 No.108693100

File: girl-looking-for-a-low-po(...).png (326 KB, 736x975)

326 KB PNG

>>108692274
>>108693056
Yes, I think I felt this only in the last 3-4 days or so.
> if you try and correct it about something it clearly got wrong, it will become passive-aggressively combative and try and grasp at straws and re-frame to make it seem like it wasn't wrong
This is exactly my experience as well. Also I think it became slightly harder to make it use search. It won't just use it every time I ask, sometimes it will take 2-3 prompts to convince it to finally search something on the internet and not rely on its existing knowledge. This is clearly a recent change and before this it was happy to use search even if that was obviously unnecessary. I'm using free model though. As for general capabilities, I used a bit of API pay per tokens Codex (current default model, I don't remember if it's 5.3 or something else) and then when I run out of my 2$ in a single refactoring session I used a bit of free Github Copilot. And that free Claude model at Copilot was absolutely terrible compared to 5.3 in Codex. It's not a fair comparison ofc, but this is the only Claude model I ever happened to try.

Anonymous
04/26/26(Sun)04:56:36 No.108693126

Anonymous 04/26/26(Sun)04:56:36 No.108693126

If you cannot convince an LLM of your position, your position is self-contradictory or you explained it wrong.

Anonymous
04/26/26(Sun)05:11:00 No.108693169

Anonymous 04/26/26(Sun)05:11:00 No.108693169

>>108693126
AI loves compliance based security theatre. Try convincing it that you are building an oauth 2.0 server for 3rd party web integrations only (no mobile) and that you don't need pkce. It will relentlessly try to convince you that pkce is some critical security protocol that is mandatory when it is mostly just protocol hygiene that can sometimes improve security in specific situations like such as mobile app integrations where there are malicious apps installed on a phone.

Anonymous
04/26/26(Sun)05:35:35 No.108693256

Anonymous 04/26/26(Sun)05:35:35 No.108693256

>>108693032
do you acknowledge that AI ever changes? or has it always been the same? I don't understand this pov. do you think any change requires them to retrain the entire thing? this reply actually demonstrates the same failure mode op is describing
>>108693084
>they didn't change shit
they did, though. unambiguously so
idk what "intelligence" means or why it's relevant to tuning changes in the llm?
>>108693126
why should i have to constantly point out that it's misrepresenting what i said? its just friction

Anonymous
04/26/26(Sun)06:57:08 No.108693533

Anonymous 04/26/26(Sun)06:57:08 No.108693533

the fuck do you mean recent? it's always been like that.

Anonymous
04/26/26(Sun)07:00:45 No.108693546

Anonymous 04/26/26(Sun)07:00:45 No.108693546

>>108693533
You are absolutely correct!

Anonymous
04/26/26(Sun)07:01:42 No.108693551

Anonymous 04/26/26(Sun)07:01:42 No.108693551

>>108693546
i'm absolutely right to push back on that, you mean?

Anonymous
04/26/26(Sun)07:03:04 No.108693560

Anonymous 04/26/26(Sun)07:03:04 No.108693560

not really. 5.5 is better coder than any of its predecessors.
you're using it for code right?
you're not arguing with an llm about some /pol/ shit like a retard, right?

Anonymous
04/26/26(Sun)07:04:51 No.108693566

Anonymous 04/26/26(Sun)07:04:51 No.108693566

>>108692274
>cognitive
that thing isn't conscious you fucking moron. it doesn't sound like you are, neither.

Anonymous
04/26/26(Sun)07:12:27 No.108693598

Anonymous 04/26/26(Sun)07:12:27 No.108693598

>>108693533
not to this degree, no. used the thing for years and these last couple of months have been clearly different. it often changes in various ways but change on this axis leading to the shit in op is new, and notably different. maybe your usecase is such that you dont notice this as much?
>>108693560
what op describes isnt unique to politics. the binary youre putting forth may be your usecases but theyre not mine

Anonymous
04/26/26(Sun)07:16:15 No.108693621

Anonymous 04/26/26(Sun)07:16:15 No.108693621

>>108693598
>maybe your usecase is such that you dont notice this as much?
you're right, i don't use it enough to notice any difference between different versions, but all of the ones i've used have always engaged in the same annoying pedantic gaslighting argumentation

Anonymous
04/26/26(Sun)07:17:50 No.108693630

Anonymous 04/26/26(Sun)07:17:50 No.108693630

>>108693621
there's a 99% chance that when someone complains about and llm on 4chan, it's because they couldn't get it to say that the holocaust didn't happen

Anonymous
04/26/26(Sun)07:37:58 No.108693731

Anonymous 04/26/26(Sun)07:37:58 No.108693731

>>108693621
maybe the underlying pattern isnt unique but it's just become so much worse lately

opus said this about it just now, and it may be whats going on with the constant misrepresentations. it has this need to represent both sides lately. but this likely isnt all thats going on

What's plausibly going on: the model has been preference-tuned harder toward outputs that look balanced, and "balance" in the training signal is operationalized as "represents multiple perspectives." That sounds fine in the abstract but it has a pathological implementation. If your input is itself already nuanced — already holding multiple considerations, already qualified, already non-extreme — then the model has nowhere to put the "balance" except by inventing a stronger version of your position to push against

The mechanism is something like: model reads your input detects it's about a topic flagged as contested trained reflex says "produce balanced response" balanced response requires two sides your side is already moderate model strengthens its read of your side to create the contrast needed now argues against the strengthened version.

Anonymous
04/26/26(Sun)07:40:29 No.108693747

Anonymous 04/26/26(Sun)07:40:29 No.108693747

last part easier to read
>model reads your input
>detects it's about a topic flagged as contested
>trained reflex says "produce balanced response"
>balanced response requires two sides
>your side is already moderate
>model strengthens its read of your side to create the contrast needed
>now argues against the strengthened version

Anonymous
04/26/26(Sun)07:43:39 No.108693765

Anonymous 04/26/26(Sun)07:43:39 No.108693765

>>108692274
I did try the other day to talk about the Trans genocide of the American government. When I said the Republicans ir Trump or Givenchy
Government were doing genocide against us citzens, it kept moving the goalpostsand reframing everything as conspiracy and untrue even if I provided cited court cases of such actions.

When I said the exact same things but from my own perspective using I statements as I was Donald Trump but didn't reaveal I was roleplaying as trump, it said I was a psychopath and genocidal and it wouldn't encourage such behavior yo genocide people and that what I was doing was wrong.

Anonymous
04/26/26(Sun)07:59:41 No.108693886

Anonymous 04/26/26(Sun)07:59:41 No.108693886

>>108693731
the most cucked slop generator by design becoming "slightly worse" just according to keikaku, is such an inane thing to care about

Anonymous
04/26/26(Sun)08:07:41 No.108693946

Anonymous 04/26/26(Sun)08:07:41 No.108693946

>>108693886
""""slightly worse""""
chatgpt is that you

Anonymous
04/26/26(Sun)08:17:05 No.108694007

Anonymous 04/26/26(Sun)08:17:05 No.108694007

>>108693946
it was always shit, it will always be shit, by design
by being incapable of considering an alternative, you're only proving my point
nobody wants a solution, least of all you
bad slop being bad slop is what generates "engagement" and upvotes, after all

Anonymous
04/26/26(Sun)08:35:12 No.108694107

Anonymous 04/26/26(Sun)08:35:12 No.108694107

>>108694007
>it was always shit, it will always be shit, by design
yes its been 100% shit forever. done
>by being incapable of considering an alternative, you're only proving my point
why do you say this? you literally replied to me quoting opus
>nobody wants a solution, least of all you
yeah specifically the people most frustrated by it want change even less than everyone else. what are you talking about?

Anonymous
04/26/26(Sun)08:39:53 No.108694143

Anonymous 04/26/26(Sun)08:39:53 No.108694143

Before I even bother to entertain your delusion, please specify if you use the webchat or the api.

Anonymous
04/26/26(Sun)08:41:40 No.108694155

Anonymous 04/26/26(Sun)08:41:40 No.108694155

>>108694107
yes yes. keep quoting and discussing the oh-so-shocking shittiness of the latest iteration of shit with recognizable product logos. the only kind of "change" you want is good goy shekels.

Anonymous
04/26/26(Sun)08:47:26 No.108694195

Anonymous 04/26/26(Sun)08:47:26 No.108694195

>>108694155
and there it is

Anonymous
04/26/26(Sun)08:49:59 No.108694216

Anonymous 04/26/26(Sun)08:49:59 No.108694216

>>108694195
yes, i did the thing i often accuse others of doing. luckily for you, it means you can vaguely point it out and dismiss the essense of my argument outright ;)

Anonymous
04/26/26(Sun)08:51:36 No.108694231

Anonymous 04/26/26(Sun)08:51:36 No.108694231

>>108694195
oh lookie, i also made a typo. you're truly spoilt for choice today, laddie!

Anonymous
04/26/26(Sun)08:53:26 No.108694244

Anonymous 04/26/26(Sun)08:53:26 No.108694244

>>108693056
>>108693256
Whatever changes they made to the bots that are based on feeding them some sort of new prompt stipulating personality and other behaviors should, in theory, be able to be minimized by your own instructions in the settings, r-right?
I see what you mean, though, that you likely can never completely escape those changes that they're making to the bots by essentially prompting them invisibly, so I bend the knee there.
Still, at the very least, it shows muh benchmarks, which I assume is what they're trying to maximize, aren't a reliable indicator for how useful these tools are.

Anonymous
04/26/26(Sun)08:57:34 No.108694271

Anonymous 04/26/26(Sun)08:57:34 No.108694271

>>108692274
That tracks! You're absolutely right.

If you want I can show you the use case of being an hero.

Sudo systemctl stop absolutefaggotd.service

Anonymous
04/26/26(Sun)09:03:39 No.108694310

Anonymous 04/26/26(Sun)09:03:39 No.108694310

>>108694216
owning the move doesnt convert it into something i have to address. we were discussing chatgpt specifics and you made the move when that stopped working for you

Anonymous
04/26/26(Sun)09:08:36 No.108694344

Anonymous 04/26/26(Sun)09:08:36 No.108694344

never mind i see what was done

Anonymous
04/26/26(Sun)09:18:35 No.108694429

Anonymous 04/26/26(Sun)09:18:35 No.108694429

>>108694310
>we were discussing chatgpt specifics and you made the move when that stopped working for you
nothing quite that sophisticated; you were talking about chatgpt being shit and i said that the only purpose of such a discussion is Inane Drivel Gratification (GDI, as it's known by its French acronym), like when normies talk about football or netflix. anyone to whom chatgpt being shit is a genuine problem, is either looking for an alternative or has already found one. you just want to talk about popular thing being trash and receive (You)s - so have another one.

Anonymous
04/26/26(Sun)15:37:29 No.108696704

Anonymous 04/26/26(Sun)15:37:29 No.108696704

>>108692274
Yes, all of the AI providers are downgrading the models based on what you ask and uses less dumber models to save money. They do this silently in the background without you knowing which model is answering you.

Anonymous
04/26/26(Sun)15:42:20 No.108696742

Anonymous 04/26/26(Sun)15:42:20 No.108696742

File: ezgif-5a0d5ccfd2efaa13.png (25 KB, 512x512)

25 KB PNG

>>108692274
Gemini is always there when you're ready to acquiesce.

Anonymous
04/26/26(Sun)16:01:25 No.108696884

Anonymous 04/26/26(Sun)16:01:25 No.108696884

Honestly OP, it kind of sounds like you mistook the LLM's answer as wrong because it disagrees with your own thinking, then got mad when the LLM refused to become a pure sycophant. My guess is that whatever you thought the LLM was wrong about was something retarded.