/g/ - Are local agent models just retarded? - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
Are local agent models just re(...) 12/23/25(Tue)23:02:55 No.107652098

File: 1759753823006.jpg (90 KB, 1200x982)

Are local agent models just retarded? Anonymous 12/23/25(Tue)23:02:55 No.107652098

I have a quad 3090 setup with qwen code 30b. With 30k context. I send one message with qwen code cli. I tell it to review my files and explain a project aa a test. It gets to 3 files before the context is out. This was with maybe 500 lines of code total. What gives? A chat bot can go on forever, but the moment I try to use an agent it's pretty much worthless. I'm at the point of trying to set up context and rag to try to mimic even a fraction of what gemenis app builder does. I don't want the cloud. I just want my llm on my hardware. Hundreds of gigs of ram and vram. I should be able to at least have it review some files. Is this shit just ass or is it me who is the brainlet?

Anonymous
12/23/25(Tue)23:04:49 No.107652113

Anonymous 12/23/25(Tue)23:04:49 No.107652113

first of all 30k context is absolutely miniscule for what you're asking of it, much smaller models can remain coherent with much bigger contexts. second of all the big chatbots have code that intervenes when the context fills up and summarizes what's happened so far. That's why after a while they will still forget who said what and gaslight the shit out of you.

Anonymous
12/23/25(Tue)23:45:39 No.107652373

Anonymous 12/23/25(Tue)23:45:39 No.107652373

>>107652113
I mean the context goes like 16k 30k 60 then 128. Idk if 30k is really that small to review a few cs files.

Anonymous
12/24/25(Wed)00:26:29 No.107652567

Anonymous 12/24/25(Wed)00:26:29 No.107652567

>>107652098
>quad 3090
Ayo nigga gimme one you don't need all 4.

Anonymous
12/24/25(Wed)00:41:12 No.107652641

Anonymous 12/24/25(Wed)00:41:12 No.107652641

>>107652098
yea this happens with legacy cards

Anonymous
12/24/25(Wed)01:01:01 No.107652739

Anonymous 12/24/25(Wed)01:01:01 No.107652739

>>107652098
I guess you want to increase context to about what the remote LLMs would have.

Anonymous
12/24/25(Wed)01:21:18 No.107652828

Anonymous 12/24/25(Wed)01:21:18 No.107652828

>>107652098
Try to see if you can find out how much of the context window it's using as it goes and/or what it's putting into the context window. I know the first is somehow possible as I know of tools that will show you that stat, but I'm not sure how to do it exactly since I use proprietary tools internal to the company I work for

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.