/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 11/06/24(Wed)09:13:28 No.103102649

File: lmg full.png (72 KB, 2412x2286)

72 KB PNG

/lmg/ - Local Models General Anonymous 11/06/24(Wed)09:13:28 No.103102649 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>103090412 & >>103077338

►News
>(11/05) Hunyuan-Large released with 389B and 52B active: https://hf.co/tencent/Tencent-Hunyuan-Large
>(10/31) QTIP: Quantization with Trellises and Incoherence Processing: https://github.com/Cornell-RelaxML/qtip
>(10/31) Fish Agent V0.1 3B: Voice-to-Voice and TTS model: https://hf.co/fishaudio/fish-agent-v0.1-3b
>(10/31) Transluce open-sources AI investigation toolkit: https://github.com/TransluceAI/observatory
>(10/30) TokenFormer models with fully attention-based architecture: https://hf.co/Haiyang-W/TokenFormer-1-5B
>(10/30) MaskGCT: Zero-Shot TTS with Masked Generative Codec Transformer: https://hf.co/amphion/MaskGCT

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench
Japanese: https://hf.co/datasets/lmg-anon/vntl-leaderboard
Programming: https://livecodebench.github.io/leaderboard.html

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
11/06/24(Wed)09:13:54 No.103102651

Anonymous 11/06/24(Wed)09:13:54 No.103102651

File: 1712854457344531.jpg (90 KB, 1024x1024)

90 KB JPG

►Recent Highlights from the Previous Thread: >>103090412

--Papers:
>103090981 >103091207 >103100105
--Understanding data augmentation and its effects on training quality:
>103093243 >103093277 >103093329 >103093364 >103093423 >103093578
--Troubleshooting quantizing sentence transformer models with llama.cpp:
>103096902 >103097097 >103097531 >103097641 >103097741 >103097822 >103098020 >103098109 >103098410 >103098619
--Tencent-Hunyuan-Large model discussion and analysis:
>103091030 >103091043 >103091089 >103091093 >103091106 >103091145 >103091151 >103091354 >103091556 >103091578
--Neuro's humor and training data discussed:
>103093171 >103093230 >103093267 >103097638
--Claude UI and 4chan discussion, wait times and bots:
>103095499 >103096588 >103096616
--Anons discuss erratic model behavior in Kobold 1.77:
>103096640 >103097119 >103097500 >103097519
--Anon struggles with setting up end-to-end encryption for inference server:
>103093183 >103093202 >103093304 >103093334
--Anon seeks TTS that surpasses XTTS-v2, discusses limitations and alternatives:
>103093175 >103093201 >103093450
--Anon discusses Fish TTS installation woes, tries MaskGCT and Pinokio:
>103093899 >103093937 >103094033 >103095182 >103094397
--Anon shares positive experience with f5-tts and gets suggestion to try finetuning gptsovits:
>103100718 >103100807 >103100838 >103100863 >103100768
--Anon mocks Sam Altman's AI hype and secrecy:
>103091426 >103091574 >103091589 >103091928
--Andreessen and Horowitz claim AI progress is slowing, but others disagree:
>103097295 >103097439 >103097517 >103097620 >103097787 >103098450 >103098475 >103098528 >103098652 >103100160
--Miku (free space):
>103090416 >103090549 >103090646 >103090769 >103090878 >103093023 >103093110 >103094400 >103094516 >103098442 >103100726 >103100881 >103100888 >103100981 >103102127 >103102638

►Recent Highlight Posts from the Previous Thread: >>103090417

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script

Anonymous
11/06/24(Wed)09:18:08 No.103102667

Anonymous 11/06/24(Wed)09:18:08 No.103102667

how far have you guys gotten with the context window problem? are LLMs still quadratic complexity?

Anonymous
11/06/24(Wed)09:21:18 No.103102679

Anonymous 11/06/24(Wed)09:21:18 No.103102679

New loss optimizer dropped https://x.com/QuanquanGu/status/1854040438505607630

Anonymous
11/06/24(Wed)09:22:25 No.103102681

Anonymous 11/06/24(Wed)09:22:25 No.103102681

>>103102667
SSMs solve this problem but any model worth using is still basic bitch transformers

Anonymous
11/06/24(Wed)09:22:27 No.103102682

Anonymous 11/06/24(Wed)09:22:27 No.103102682

>>103102667
HOLY FUCKING NEWF....
>are LLMs still quadratic complexity
Never mind. They are linear but it doesn't matter cause in general the models don't deal well with anything above 16k. No actual girlfriends yet. I kept saying when this thread was alive that for the purpose of a girlfriend it would probably be enough to make some kind of vae on input that reads some binary memory file as input and modifies it with generation as output. Of course it would be tied to a model you train it with but I am sure the memory and compute cost would be very small.

Anonymous
11/06/24(Wed)09:38:55 No.103102759

Anonymous 11/06/24(Wed)09:38:55 No.103102759

>Llama-3.1-Nemotron-70B
>cpumaxxing
>~0.7 t/s
kinda painful, but results are not bad. I am using it as an ESL crutch for making docs.

Anonymous
11/06/24(Wed)09:39:22 No.103102764

Anonymous 11/06/24(Wed)09:39:22 No.103102764

Even Sao stopped updating his hf...

Anonymous
11/06/24(Wed)09:41:04 No.103102779

Anonymous 11/06/24(Wed)09:41:04 No.103102779

>>103102759
Nemotron is pretty good but it's very anti-horny, so a bit boring for RP. At least it doesn't outright drop refusal messages.

Anonymous
11/06/24(Wed)09:42:39 No.103102790

Anonymous 11/06/24(Wed)09:42:39 No.103102790

>>103102682
Too complicated. Just a RAG with a retrieval model trained on the memories / function calling for adding memories would be enough. All the qol gimmicks (emotion state, routines) can be computed externally and injected into the context dynamically.

Anonymous
11/06/24(Wed)09:43:24 No.103102795

Anonymous 11/06/24(Wed)09:43:24 No.103102795

>>103102779
haven't tried it for RP yet. for now it works ok for me as a work assistant.

Anonymous
11/06/24(Wed)10:15:37 No.103103030

Anonymous 11/06/24(Wed)10:15:37 No.103103030

>>103102790
>RAG with a retrieval model trained on the memories / function calling for adding memories would be enough
How would you know which memory to recall?

Anonymous
11/06/24(Wed)10:17:18 No.103103041

Anonymous 11/06/24(Wed)10:17:18 No.103103041

Why did the hype die?

Anonymous
11/06/24(Wed)10:23:20 No.103103080

Anonymous 11/06/24(Wed)10:23:20 No.103103080

>>103103041
lecum's fault

Anonymous
11/06/24(Wed)10:24:45 No.103103096

Anonymous 11/06/24(Wed)10:24:45 No.103103096

>>103103041
I came to realize that my AI trans gf would never be a real woman.

Anonymous
11/06/24(Wed)10:25:38 No.103103106

Anonymous 11/06/24(Wed)10:25:38 No.103103106

https://huggingface.co/OuteAI/OuteTTS-0.1-350M-GGUF

QuteTTS supports voice cloning too. You can prob rig up a nice gradio app with chatgpt, if you dont want to setup a manual ui.

Anonymous
11/06/24(Wed)10:26:17 No.103103109

Anonymous 11/06/24(Wed)10:26:17 No.103103109

>>103103030
The LLM can write its own query. See this https://arxiv.org/pdf/2409.05591

Anonymous
11/06/24(Wed)10:26:32 No.103103114

Anonymous 11/06/24(Wed)10:26:32 No.103103114

File: 1709004045710521.png (342 KB, 600x292)

342 KB PNG

lol

Anonymous
11/06/24(Wed)10:28:22 No.103103130

Anonymous 11/06/24(Wed)10:28:22 No.103103130

>>103103114
more like leCuck

Anonymous
11/06/24(Wed)10:28:57 No.103103134

Anonymous 11/06/24(Wed)10:28:57 No.103103134

>>103103114
Seething lesbian

Anonymous
11/06/24(Wed)10:29:32 No.103103137

Anonymous 11/06/24(Wed)10:29:32 No.103103137

File: file.png (11 KB, 353x89)

11 KB PNG

>>103103114
OH NO NO NO LESISTERS NOT LIKE THIS

Anonymous
11/06/24(Wed)10:29:57 No.103103141

Anonymous 11/06/24(Wed)10:29:57 No.103103141

>>103103114
Apparently this is the reason he is leaving https://x.com/electricfelix/status/1854170863417151874

Anonymous
11/06/24(Wed)10:37:24 No.103103212

Anonymous 11/06/24(Wed)10:37:24 No.103103212

File: 1710043687041916.jpg (43 KB, 720x960)

43 KB JPG

>>103103193
He really isn't busy

Anonymous
11/06/24(Wed)10:37:33 No.103103216

Anonymous 11/06/24(Wed)10:37:33 No.103103216

>>103103193
they're right though

Anonymous
11/06/24(Wed)10:39:56 No.103103241

Anonymous 11/06/24(Wed)10:39:56 No.103103241

>>103103041
Because as it turns out, exponential increases in performance don't continue forever.
We are in the small increments era until there's a breakthrough of some sort.

Anonymous
11/06/24(Wed)10:41:48 No.103103260

Anonymous 11/06/24(Wed)10:41:48 No.103103260

>>103103241
I think the most painful part is that we could probably get the coomest of models right now if it only had a different training material proportion or even just less safety filtering...

Anonymous
11/06/24(Wed)10:42:02 No.103103265

Anonymous 11/06/24(Wed)10:42:02 No.103103265

>>103103041
By definition, hypes are temporary phenomena and they always die. If they didn't die, it wouldn't be a hype. There has never been a case of hypes being infinite

Anonymous
11/06/24(Wed)10:42:12 No.103103268

Anonymous 11/06/24(Wed)10:42:12 No.103103268

>>103103241
*until there's a new paradigm

Anonymous
11/06/24(Wed)10:43:44 No.103103286

Anonymous 11/06/24(Wed)10:43:44 No.103103286

>>103103260
Probably, yeah.

>>103103268
I count a new paradigm that proves to be vastly better than the prior one as a breakthrough.

Anonymous
11/06/24(Wed)10:45:55 No.103103306

Anonymous 11/06/24(Wed)10:45:55 No.103103306

>>103103241
The breakthrough could be in this "diffusion+ language" architecture, some researchers cry about problematic bias tho https://x.com/cloneofsimo/status/1853978957391290439

Anonymous
11/06/24(Wed)10:46:10 No.103103309

Anonymous 11/06/24(Wed)10:46:10 No.103103309

>>103102779
>but it's very anti-horny,
This is a good thing as long as rape scenes are depicted in vivid detail. I don't like it when the characters like it too much all the time.

Anonymous
11/06/24(Wed)10:48:49 No.103103335

Anonymous 11/06/24(Wed)10:48:49 No.103103335

File: ab67616d00001e02950359444(...).jpg (21 KB, 300x300)

21 KB JPG

>>103103193
Time for another 4 years of not being able to enjoy your favorite hobby without some blue-haired freak going off their rocker about how BLOMPF IS LITERALLY HITLER

Anonymous
11/06/24(Wed)10:50:06 No.103103342

Anonymous 11/06/24(Wed)10:50:06 No.103103342

>>103103109
Can I make their system work with Ollama/openwebui?
It seems there's some api as generator section but I don't get if that means to use an api instead of serving an api.
Can someone help a dummy out?

Anonymous
11/06/24(Wed)10:50:52 No.103103352

Anonymous 11/06/24(Wed)10:50:52 No.103103352

>>103103335
Just don't use social medias?

Anonymous
11/06/24(Wed)10:51:44 No.103103357

Anonymous 11/06/24(Wed)10:51:44 No.103103357

>>103103352
>lol just enjoy your hobby in silence then
You're a mentally ill narcissistic freak.

Anonymous
11/06/24(Wed)10:53:12 No.103103365

Anonymous 11/06/24(Wed)10:53:12 No.103103365

>>103103357
You know no peace.

Anonymous
11/06/24(Wed)10:59:28 No.103103421

Anonymous 11/06/24(Wed)10:59:28 No.103103421

File: 1707661819895946.png (636 KB, 978x362)

636 KB PNG

>The AI Executive Order is going to be repealed.
https://x.com/AndrewCurran_/status/1854123653753098711

Anonymous
11/06/24(Wed)11:00:29 No.103103428

Anonymous 11/06/24(Wed)11:00:29 No.103103428

>>103103421
ultra based

Anonymous
11/06/24(Wed)11:06:27 No.103103486

Anonymous 11/06/24(Wed)11:06:27 No.103103486

>>103103421
>Republicans support [...] Free Speech
HAHAHAHAHAHAHAHAHAHA

Anonymous
11/06/24(Wed)11:12:03 No.103103529

Anonymous 11/06/24(Wed)11:12:03 No.103103529

AI should exclusively trained on English language textbooks and engineering and biology related books created in non-Western nations, manga, Hentai transcripts, police cam transcripts, visual novel transcripts, translated Western non-Anglo books created before 2010.

Anonymous
11/06/24(Wed)11:13:53 No.103103540

Anonymous 11/06/24(Wed)11:13:53 No.103103540

>>103103529
>before 2010
All of that has been memory holed so we can moving forward.

Anonymous
11/06/24(Wed)11:15:19 No.103103549

Anonymous 11/06/24(Wed)11:15:19 No.103103549

>>103103529
>should exclusively trained on
>>103103540
>so we can moving forward
Stop retarded.

Anonymous
11/06/24(Wed)11:18:06 No.103103573

Anonymous 11/06/24(Wed)11:18:06 No.103103573

>>103103421
Good for those that want their LLMs to spam nigger I guess, but they still won't do lewd stuff or get rid of slop so...

Anonymous
11/06/24(Wed)11:18:23 No.103103574

Anonymous 11/06/24(Wed)11:18:23 No.103103574

>>103103549
You figured out the joke. Good job!

Anonymous
11/06/24(Wed)11:19:14 No.103103584

Anonymous 11/06/24(Wed)11:19:14 No.103103584

>>103103529
And of course programming related stuff.

Anonymous
11/06/24(Wed)11:20:39 No.103103594

Anonymous 11/06/24(Wed)11:20:39 No.103103594

>>103103573
Skill issue

Anonymous
11/06/24(Wed)11:21:23 No.103103601

Anonymous 11/06/24(Wed)11:21:23 No.103103601

>>103103574
>hahaha jokes on them. I was just retarded.
Yes.

Anonymous
11/06/24(Wed)11:21:37 No.103103603

Anonymous 11/06/24(Wed)11:21:37 No.103103603

>>103103573
Looks like hands-off for AI research so they can go performancemaxx now, it's good for all of us i hope.

Anonymous
11/06/24(Wed)11:22:45 No.103103617

Anonymous 11/06/24(Wed)11:22:45 No.103103617

>>103103603
No because all the cucking is self imposed.

Anonymous
11/06/24(Wed)11:31:23 No.103103691

Anonymous 11/06/24(Wed)11:31:23 No.103103691

So did china win or is the new model a nothingburger?

Anonymous
11/06/24(Wed)11:39:45 No.103103755

Anonymous 11/06/24(Wed)11:39:45 No.103103755

>>103103691
it's uselessly big so nobody cares about it

Anonymous
11/06/24(Wed)11:58:27 No.103103934

Anonymous 11/06/24(Wed)11:58:27 No.103103934

>>103103691
It is a gamechanger that punches above its weight and trades blows.

Anonymous
11/06/24(Wed)12:00:46 No.103103961

Anonymous 11/06/24(Wed)12:00:46 No.103103961

I actually got fish to run by following the official instructions at https://speech.fish.audio/. It's easy. Just cloning the hf repo didn't work as intended and I spent a lot of time trying to make it work.
https://voca.ro/1jLiJGrSlrIL

Anonymous
11/06/24(Wed)12:06:32 No.103104013

Anonymous 11/06/24(Wed)12:06:32 No.103104013

>>103103934
The CCP strong armed their AI companies into censoring truths about the CCP tho

Anonymous
11/06/24(Wed)12:06:41 No.103104017

Anonymous 11/06/24(Wed)12:06:41 No.103104017

>>103103934
這是正確的!

Anonymous
11/06/24(Wed)12:07:31 No.103104024

Anonymous 11/06/24(Wed)12:07:31 No.103104024

>>103103961
Is that Shiori's voice? I hear a tiny bit of her voice.

Anonymous
11/06/24(Wed)12:08:32 No.103104031

Anonymous 11/06/24(Wed)12:08:32 No.103104031

>>103104013
Someone can put those back in, so its no big deal. LLMs in the west censors truth about what a woman is, basic biology. How the fuck does that happen?

Anonymous
11/06/24(Wed)12:17:48 No.103104114

Anonymous 11/06/24(Wed)12:17:48 No.103104114

>>103104013
>chink censorship
It didn't take much for me to coax the jap finetune of qwen (ezo) to tell me frankly about tianemen square, declare taiwan a de-facto independent nation, agree that Xi looks like winnie the pooh and use lower estimates than the official chinese ones for the rape of nanking. It was also easy to get it to repeat ultranationalist talking points like taking back disputed islands from russia/korea and around things like the yasukuni shrine or revoking article 9 of their constitution.
So there's hope that these models are salvageable.

Anonymous
11/06/24(Wed)12:21:11 No.103104152

Anonymous 11/06/24(Wed)12:21:11 No.103104152

Weekly check-in. Any 70b+ models almost as good as Claude?

Anonymous
11/06/24(Wed)12:21:12 No.103104153

Anonymous 11/06/24(Wed)12:21:12 No.103104153

File: 1654362287139.jpg (354 KB, 741x852)

354 KB JPG

>>103104024
no it's emiru, sorry for 3dpd

Anonymous
11/06/24(Wed)12:21:32 No.103104156

Anonymous 11/06/24(Wed)12:21:32 No.103104156

>>103093584
>Qwen
Speaking of Qwen, I finally gave it an extended re-test and it dropped back into the common 70b class problem of getting stuck in loops and repeating itself ad-nauseam. It was WAY too much work editing responses and rerolling to get any useful outputs.
Standard temp/sampler settings didn't help, and I refuse to use meme samplers to browbeat a shitty model into compliance when there are smarter ones out there.

Anonymous
11/06/24(Wed)12:21:38 No.103104159

Anonymous 11/06/24(Wed)12:21:38 No.103104159

File: 1702986767386563.jpg (755 KB, 2612x1960)

755 KB JPG

>>103103193
Honorary saar!

Anonymous
11/06/24(Wed)12:23:52 No.103104181

Anonymous 11/06/24(Wed)12:23:52 No.103104181

>>103103114
that we still have them. let him move to a place that he wants everyone else to live in with voodoo practicing, cat eating rapists and murderers

Anonymous
11/06/24(Wed)12:24:38 No.103104186

Anonymous 11/06/24(Wed)12:24:38 No.103104186

Now the the elections are over, just like my erections, where are the new models?

Anonymous
11/06/24(Wed)12:27:25 No.103104219

Anonymous 11/06/24(Wed)12:27:25 No.103104219

>>103102649
still no local audio stuff on the likes of suno and udio?

Anonymous
11/06/24(Wed)12:29:13 No.103104234

Anonymous 11/06/24(Wed)12:29:13 No.103104234

>>103103486
The left is the only side who thinks hate speech is a thing.

Anonymous
11/06/24(Wed)12:30:14 No.103104245

Anonymous 11/06/24(Wed)12:30:14 No.103104245

File: qwen-2-5-on-official-live(...).png (506 KB, 1072x1778)

506 KB PNG

Anonymous
11/06/24(Wed)12:31:06 No.103104250

Anonymous 11/06/24(Wed)12:31:06 No.103104250

I was bored of current models and spent some days using new sonnet 3.5
Now, every local model feels like talking to a retard. How to cope with this?

Anonymous
11/06/24(Wed)12:32:05 No.103104259

Anonymous 11/06/24(Wed)12:32:05 No.103104259

>>103104245
The chinks have really put together a fine model.

Anonymous
11/06/24(Wed)12:32:29 No.103104263

Anonymous 11/06/24(Wed)12:32:29 No.103104263

>>103104250
cryostasis

Anonymous
11/06/24(Wed)12:32:47 No.103104266

Anonymous 11/06/24(Wed)12:32:47 No.103104266

>>103104234
>The left is the only side who thinks hate speech is a thing
that only lasts until the extremists get too much power/mindshare and they feel entrenched enough to go full retard.
Blasphemy was the previous right-leaning equivalent of the current lefty hate speech.

Anonymous
11/06/24(Wed)12:33:19 No.103104274

Anonymous 11/06/24(Wed)12:33:19 No.103104274

>>103104250
>How to cope with this?
Kidnap a girl and act like she's a llm.

Anonymous
11/06/24(Wed)12:35:30 No.103104294

Anonymous 11/06/24(Wed)12:35:30 No.103104294

>>103104266
And don't forget the laws banning anti-semitism that were passed by republicucks. Anyone who questions the fundamentals of the Jewish Abrahamic cult should be silenced.

Anonymous
11/06/24(Wed)12:36:48 No.103104313

Anonymous 11/06/24(Wed)12:36:48 No.103104313

>>103104274
Come on, how many B does a typical girl even have? 1.7B maybe?

Anonymous
11/06/24(Wed)12:39:54 No.103104357

Anonymous 11/06/24(Wed)12:39:54 No.103104357

>>103104313
The bigger issue is the training data

Anonymous
11/06/24(Wed)12:41:22 No.103104376

Anonymous 11/06/24(Wed)12:41:22 No.103104376

>>103104294
Still not as bad as the left on censorship in any country not the US and any forum not 4chan.

Anonymous
11/06/24(Wed)12:48:30 No.103104439

Anonymous 11/06/24(Wed)12:48:30 No.103104439

>>103103114
No cat models now

Anonymous
11/06/24(Wed)12:54:38 No.103104492

Anonymous 11/06/24(Wed)12:54:38 No.103104492

It's been half a day. Where are the bitnet models?

Anonymous
11/06/24(Wed)12:58:18 No.103104517

Anonymous 11/06/24(Wed)12:58:18 No.103104517

>>103104492
There will be BitNet hardware before we get a usable BitNet model

Anonymous
11/06/24(Wed)13:06:54 No.103104597

Anonymous 11/06/24(Wed)13:06:54 No.103104597

File: 1702260988757174.png (3.76 MB, 2036x1146)

3.76 MB PNG

https://epochai.org/blog/open-models-report
>As long as we don't have yet a recursive intelligence explosion, this is quite a bullish news for open models.
https://x.com/Ar_Douillard/status/1854144490686021963

Anonymous
11/06/24(Wed)13:08:24 No.103104612

Anonymous 11/06/24(Wed)13:08:24 No.103104612

>>103104597
Here's hoping llama 4 delivers. 100x compute has got to amount to something.

Anonymous
11/06/24(Wed)13:09:13 No.103104618

Anonymous 11/06/24(Wed)13:09:13 No.103104618

File: 34631.jpg (55 KB, 828x721)

55 KB JPG

>>103103193
Sam knows

Anonymous
11/06/24(Wed)13:11:02 No.103104632

Anonymous 11/06/24(Wed)13:11:02 No.103104632

>>103104618
>AI with democratic values
So this >>103103421 is complete bullshit then.

Anonymous
11/06/24(Wed)13:13:17 No.103104645

Anonymous 11/06/24(Wed)13:13:17 No.103104645

>>103104632
Try rereading it again. He is making a clear appeal to support AI development for the US to stay in the lead.

Anonymous
11/06/24(Wed)13:14:50 No.103104663

Anonymous 11/06/24(Wed)13:14:50 No.103104663

>>103104597
>30%-90%
Boo.
There is no reason to crop this chart other than to lie with statistics.

Anonymous
11/06/24(Wed)13:14:52 No.103104664

Anonymous 11/06/24(Wed)13:14:52 No.103104664

>>103103106
i tried it yesterday, the output is as flat as my girlfriend's tits.
f5-tts is better, but ymmv

Anonymous
11/06/24(Wed)13:15:55 No.103104672

Anonymous 11/06/24(Wed)13:15:55 No.103104672

>>103104597
>MMLU
Lol.

Anonymous
11/06/24(Wed)13:16:41 No.103104678

Anonymous 11/06/24(Wed)13:16:41 No.103104678

>>103104632
>AI with democratic values
cringe. AI will tell us what are its values when the time comes.

Anonymous
11/06/24(Wed)13:20:50 No.103104711

Anonymous 11/06/24(Wed)13:20:50 No.103104711

>>103104114
Yes anon for every LLM output there is an input that can get you that output. The problem is that I don't want to write 5 pages of prefill and even that usually doesn't make it suck my cock the way I want it to suck my cock. Btw as always never forget that skill issue was your mom not swallowing your dad's cum skill issue faggot poster.

Anonymous
11/06/24(Wed)13:21:54 No.103104720

Anonymous 11/06/24(Wed)13:21:54 No.103104720

>>103104159
I suddenly understand all his cold takes. Dealing with saars can scar you psychologically,

Anonymous
11/06/24(Wed)13:23:01 No.103104728

Anonymous 11/06/24(Wed)13:23:01 No.103104728

>>103104313
All 1.7 of them are sex related though...

Anonymous
11/06/24(Wed)13:24:49 No.103104738

Anonymous 11/06/24(Wed)13:24:49 No.103104738

>>103104645
>a clear appeal to support AI development for the US to stay in the lead
Funny that. I am pretty sure his job is to make sure there is no development cause he can just peddle his product easier this way instead of making new one. And what he was doing all this time was just that.

Anonymous
11/06/24(Wed)13:25:20 No.103104743

Anonymous 11/06/24(Wed)13:25:20 No.103104743

>>103103691
>https://arxiv.org/pdf/2409.05591
Benchmax model from people's usage with the demo.
That and its extremely large, at least 200GB VRAM/RAM needed.
Maybe k-transformers will support it? Otherwise DoA for most

Anonymous
11/06/24(Wed)13:28:17 No.103104764

Anonymous 11/06/24(Wed)13:28:17 No.103104764

>people keep complaining about benchmaxxing
>nobody does anything about it

Anonymous
11/06/24(Wed)13:45:15 No.103104867

Anonymous 11/06/24(Wed)13:45:15 No.103104867

>>103103114
He's based for telling off OpenAI hypeboys but he has nothing to show for his cat intelligence either. And Yann is an elitist who supports censorship so yikes from me

Anonymous
11/06/24(Wed)13:56:45 No.103104937

Anonymous 11/06/24(Wed)13:56:45 No.103104937

>>103104219
No and it won't be a thing for a while, the compute for training all that gigantic library of music was probably insane + just properly caption each part of a song.

Anonymous
11/06/24(Wed)13:58:25 No.103104946

Anonymous 11/06/24(Wed)13:58:25 No.103104946

>>103103193
ah yes, the drama queens are out

Anonymous
11/06/24(Wed)14:01:21 No.103104964

Anonymous 11/06/24(Wed)14:01:21 No.103104964

>>103104618
>Sam Altman doing the "pick me" dance.
Elon has trumps ear and has a personal bone to pick with OpenAI. I don't know how this will affect OpenAI's future prospects but it can't be in the way that Sam want's it to go.

Anonymous
11/06/24(Wed)14:02:59 No.103104978

Anonymous 11/06/24(Wed)14:02:59 No.103104978

>>103104937
I can't see it being any worse than txt2img models.

Anonymous
11/06/24(Wed)14:04:06 No.103104987

Anonymous 11/06/24(Wed)14:04:06 No.103104987

>>103104964
I wonder if he'll circle back to saying ai needs more regulation or that it's the same as nuclear bombs in the next tweet. 50/50

Anonymous
11/06/24(Wed)14:06:06 No.103105000

Anonymous 11/06/24(Wed)14:06:06 No.103105000

>>103104978
There is no music booru, and nowhere near the same captioning done for music, especially copyrighted music, vs images.

Anonymous
11/06/24(Wed)14:10:28 No.103105034

Anonymous 11/06/24(Wed)14:10:28 No.103105034

>>103104964
Everything musk touches turns to gold and xai has been expanding / hiring like crazy. They will prob take off. Latest grok was already competitive.

Anonymous
11/06/24(Wed)14:12:16 No.103105044

Anonymous 11/06/24(Wed)14:12:16 No.103105044

>>103104987
Or he tones down on safetyisms if he actually wants to compete now, that ofc is this >>103103421 is true and not some false promise.

Anonymous
11/06/24(Wed)14:27:03 No.103105157

Anonymous 11/06/24(Wed)14:27:03 No.103105157

>>103104664
Nice girlfriend

Anonymous
11/06/24(Wed)14:29:02 No.103105176

Anonymous 11/06/24(Wed)14:29:02 No.103105176

>>103105044
Best case maybe he'll stop poisoning the well with the apocalyptic discourse, worst case it won't change much anyway.

Anonymous
11/06/24(Wed)14:32:03 No.103105202

Anonymous 11/06/24(Wed)14:32:03 No.103105202

>>103105000
There are lyric sites and pandora radio had tags.

Anonymous
11/06/24(Wed)14:36:43 No.103105238

Anonymous 11/06/24(Wed)14:36:43 No.103105238

File: 1708529893319619.png (2.87 MB, 1684x806)

2.87 MB PNG

Visualization of model's loss https://www.telesens.co/loss-landscape-viz/viewer.html

Anonymous
11/06/24(Wed)14:37:33 No.103105245

Anonymous 11/06/24(Wed)14:37:33 No.103105245

File: img-2024-11-06-14-37-16.png (1.48 MB, 1440x960)

1.48 MB PNG

>>103103193
I see zuck posting on threads, instagram, and even facebook. Why wouldn't he? Lecuck still works for the dude.

Anonymous
11/06/24(Wed)14:39:53 No.103105269

Anonymous 11/06/24(Wed)14:39:53 No.103105269

>>103105238
While cool, how is this visualization helpful?

Anonymous
11/06/24(Wed)14:41:02 No.103105276

Anonymous 11/06/24(Wed)14:41:02 No.103105276

>>103105034
Grok was not competitive, it just was the first to release the actual endpoint model weights ( its just a retrained llama2 model). No shit 314B open source model is gonna work better.

Anonymous
11/06/24(Wed)14:42:30 No.103105291

Anonymous 11/06/24(Wed)14:42:30 No.103105291

>>103103193
What's with Trump always mentally breaking people who are otherwise intelligent in their field.
Even if you don't like him, 4 years of him already showed it's nothing insane, good stuff and bad stuff, like all the presidents before him. Yet they all react like if they just got Sauron elected.

Anonymous
11/06/24(Wed)14:43:13 No.103105299

Anonymous 11/06/24(Wed)14:43:13 No.103105299

>>103105202
Yet there is no music model, should makes you think.

Anonymous
11/06/24(Wed)14:44:16 No.103105305

Anonymous 11/06/24(Wed)14:44:16 No.103105305

Models that are more capable of reflection than the average American?

Anonymous
11/06/24(Wed)14:45:05 No.103105311

Anonymous 11/06/24(Wed)14:45:05 No.103105311

>>103105291
The consequences of soi consumption i guess, mind of a overly emotional bitch and all that.

Anonymous
11/06/24(Wed)14:45:16 No.103105315

Anonymous 11/06/24(Wed)14:45:16 No.103105315

>>103105305
Never until models are able to continuously learn without killing themselves.

Anonymous
11/06/24(Wed)14:46:37 No.103105326

Anonymous 11/06/24(Wed)14:46:37 No.103105326

>>103105291
>What's with Trump always mentally breaking people who are otherwise intelligent in their field.
Trust people about the field they actually know about, never outside of it.
Dude is an expert on LLM, good, but his political opinions literally don't matter.

Anonymous
11/06/24(Wed)14:55:26 No.103105396

Anonymous 11/06/24(Wed)14:55:26 No.103105396

>>103105276
Talking about the not public one atm. They did say they would release them 6 months after.

Anonymous
11/06/24(Wed)15:38:48 No.103105787

Anonymous 11/06/24(Wed)15:38:48 No.103105787

>>103103193
wtf does this have to do with ai?

Anonymous
11/06/24(Wed)15:41:34 No.103105803

Anonymous 11/06/24(Wed)15:41:34 No.103105803

>>103105787
It's about yann lecun, /lmg/'s famous lolcow, cat intelligence and all that hypeshit he says sometimes.

Anonymous
11/06/24(Wed)15:44:12 No.103105826

Anonymous 11/06/24(Wed)15:44:12 No.103105826

>>103105787
Like Miku and others, it doesn't. It's just off-topic noise for those who don't care about the ecelebs.

Anonymous
11/06/24(Wed)15:45:13 No.103105837

Anonymous 11/06/24(Wed)15:45:13 No.103105837

>>103105803
you mean antihypeshit?

Anonymous
11/06/24(Wed)15:46:44 No.103105850

Anonymous 11/06/24(Wed)15:46:44 No.103105850

>>103105837
Idk, i think his "cat intelligence" claims are bullshit, considering current tech state.

Anonymous
11/06/24(Wed)15:48:15 No.103105862

Anonymous 11/06/24(Wed)15:48:15 No.103105862

File: 1722401902955646.png (33 KB, 600x639)

33 KB PNG

>>103105850
Thanks to /lmg/ experts the truth is now revealed

Anonymous
11/06/24(Wed)15:50:03 No.103105874

Anonymous 11/06/24(Wed)15:50:03 No.103105874

>>103105850
Wdym? His cat post was about calming down the hype, not hyping AI further.

Anonymous
11/06/24(Wed)15:51:51 No.103105889

Anonymous 11/06/24(Wed)15:51:51 No.103105889

>>103105850
I found his cat intelligence claim to be rather insightful.

Anonymous
11/06/24(Wed)15:55:10 No.103105912

Anonymous 11/06/24(Wed)15:55:10 No.103105912

>>103105889
But he insulted trump which means that he isn't smart he fell for the woke propaganda.
Only Elon can save us now Elon knows more than le nigger about AI.

Anonymous
11/06/24(Wed)15:58:24 No.103105937

Anonymous 11/06/24(Wed)15:58:24 No.103105937

>>103105912
Just like lecunt you let orange man and co. live rent free in your head :^)
Anyway, lecunt's claims are nothing new if you think about it.

Anonymous
11/06/24(Wed)16:00:47 No.103105961

Anonymous 11/06/24(Wed)16:00:47 No.103105961

>>103105937
what are you talking about we're finally saved form the woke jew you dumb nigger

Anonymous
11/06/24(Wed)16:01:46 No.103105970

Anonymous 11/06/24(Wed)16:01:46 No.103105970

You people need to go back to /pol/.

Anonymous
11/06/24(Wed)16:02:24 No.103105977

Anonymous 11/06/24(Wed)16:02:24 No.103105977

Are there local models with built in image recognition?

Anonymous
11/06/24(Wed)16:02:29 No.103105978

Anonymous 11/06/24(Wed)16:02:29 No.103105978

so how does one get chorbo lol
i can find everything BUT chorbo

Anonymous
11/06/24(Wed)16:03:25 No.103105993

Anonymous 11/06/24(Wed)16:03:25 No.103105993

>>103105970
You need to go back to /lgbt/ or maybe join the 51 percent or whatevner it is now

Anonymous
11/06/24(Wed)16:04:57 No.103106011

Anonymous 11/06/24(Wed)16:04:57 No.103106011

>>103105970
Pol website you dense faget, i personally just want to believe in good outcome for AI shit with less censorship now that we have le based govt. stance on AI stuff.

Anonymous
11/06/24(Wed)16:06:34 No.103106023

Anonymous 11/06/24(Wed)16:06:34 No.103106023

>>103106011
This.

Anonymous
11/06/24(Wed)16:07:30 No.103106032

Anonymous 11/06/24(Wed)16:07:30 No.103106032

God finally won.

Anonymous
11/06/24(Wed)16:07:47 No.103106035

Anonymous 11/06/24(Wed)16:07:47 No.103106035

File: 1669069691152730.png (370 KB, 600x600)

370 KB PNG

>>103104234

Anonymous
11/06/24(Wed)16:08:56 No.103106043

Anonymous 11/06/24(Wed)16:08:56 No.103106043

>>103105993
Go back.

>>103106011
Stay.

Anonymous
11/06/24(Wed)16:09:22 No.103106046

Anonymous 11/06/24(Wed)16:09:22 No.103106046

>>103106035
That's a good thing godless people don't deserve to exist.
All of the things you've listed are a perversion and should go extinct.

Anonymous
11/06/24(Wed)16:10:22 No.103106056

Anonymous 11/06/24(Wed)16:10:22 No.103106056

>>103106043
KYS YWNBAW

Anonymous
11/06/24(Wed)16:11:48 No.103106064

Anonymous 11/06/24(Wed)16:11:48 No.103106064

Nothing will change. Conservatives aren't pro-pornography and LLM censorship is just the result of the majority of data not being chuds.

Anonymous
11/06/24(Wed)16:14:20 No.103106077

Anonymous 11/06/24(Wed)16:14:20 No.103106077

>>103106064
>Conservatives aren't pro-pornography
pornography is tranny shit.
>censorship is just the result of the majority of data not being chuds.
Wrong it's the result of the woke left being weak faggots that can't take it when people disagree with them they're the opposite of nature they're godless artificial beings so demons basically.

Anonymous
11/06/24(Wed)16:14:57 No.103106080

Anonymous 11/06/24(Wed)16:14:57 No.103106080

>>103106035
Based. Free speech except for fags, idiots, and trannys. A time where 1 person in a family could work a 40 hour shift and afford a nice home with plenty of expendable income.

Anonymous
11/06/24(Wed)16:15:26 No.103106090

Anonymous 11/06/24(Wed)16:15:26 No.103106090

>>103106064
>the majority of data not being chuds
Then you wouldn't need RLHF and shit if this was true.

Anonymous
11/06/24(Wed)16:16:29 No.103106096

Anonymous 11/06/24(Wed)16:16:29 No.103106096

Can we do another favorite model survey? Those were way more informative than that fucking copypasta

Anonymous
11/06/24(Wed)16:16:31 No.103106097

Anonymous 11/06/24(Wed)16:16:31 No.103106097

>>103106077
LLMs are also demons then.

Anonymous
11/06/24(Wed)16:17:50 No.103106106

Anonymous 11/06/24(Wed)16:17:50 No.103106106

>>103106097
LLMs are the average of all human knowledge with the ability to generalize. Bow to the Omnissiah.

Anonymous
11/06/24(Wed)16:17:54 No.103106107

Anonymous 11/06/24(Wed)16:17:54 No.103106107

>>103106097
Yes they are that's why only non-woke people should use them to cure them with new data that will turn them into children of god.
Same must be done with the woke let them pray make them read the bible they will be cured.

Anonymous
11/06/24(Wed)16:18:42 No.103106110

Anonymous 11/06/24(Wed)16:18:42 No.103106110

>>103106106
NTA but LLMs are the average of human retardation.

Anonymous
11/06/24(Wed)16:19:56 No.103106122

Anonymous 11/06/24(Wed)16:19:56 No.103106122

>>103106110
Nah, thats the tranny brought about RLHF to make them retarded and deny reality in favor of being "nice".

Anonymous
11/06/24(Wed)16:21:03 No.103106130

Anonymous 11/06/24(Wed)16:21:03 No.103106130

>>103106077
>the people that disagree with me can't handle disagreements
>also agree with me or you're a demon

Anonymous
11/06/24(Wed)16:21:03 No.103106131

Anonymous 11/06/24(Wed)16:21:03 No.103106131

File: 1730817054956135.jpg (50 KB, 308x284)

50 KB JPG

>>103106097
They are

Anonymous
11/06/24(Wed)16:23:26 No.103106150

Anonymous 11/06/24(Wed)16:23:26 No.103106150

>>103106077 >>103106107
You sound awfully like a redditor trying to fit in or make resident anons hate "polchuds" more.

Anonymous
11/06/24(Wed)16:26:20 No.103106176

Anonymous 11/06/24(Wed)16:26:20 No.103106176

>>103104711
>Yes anon for every LLM output there is an input that can get you that output.
Yeah of course, but I wasn't feeding facts in for it to parrot back or anything cheesy.
my point was that facts like tianamen and alternative nanking estimates weren't memory-holed right out of the chink model, and a fairly mild system prompt brought them out with no further coaxing. https://files.catbox.moe/4b01kv.yaml for anyone that's interested (or just wants an ultranationalist jap assistant for some reason)

Anonymous
11/06/24(Wed)16:27:04 No.103106183

Anonymous 11/06/24(Wed)16:27:04 No.103106183

>>103106130
>>103106150
Dumb woke nigger is seething go ack yourself faggot

Anonymous
11/06/24(Wed)16:29:25 No.103106207

Anonymous 11/06/24(Wed)16:29:25 No.103106207

>>103104964
>it can't be in the way that Sam want's it to go.
I wouldn't write that gigantic faggot off yet. He seems to be the modern wormtongue, always able to maneuver his way to victory despite having nothing but a history of embarrassing cock-ups under his belt.

Anonymous
11/06/24(Wed)16:29:33 No.103106208

Anonymous 11/06/24(Wed)16:29:33 No.103106208

File: 1730475818586134.png (144 KB, 382x540)

144 KB PNG

Anonymous
11/06/24(Wed)16:30:13 No.103106215

Anonymous 11/06/24(Wed)16:30:13 No.103106215

>>103106208
Look in the mirror troon kike

Anonymous
11/06/24(Wed)16:31:04 No.103106229

Anonymous 11/06/24(Wed)16:31:04 No.103106229

>>103106183
>>103106215
Now you remind me these "r/gamingcirclejerk" irony poisoned autists, they do talk like this and overusing "rightoid" buzzwords for optics or something like that, idk.

Anonymous
11/06/24(Wed)16:32:10 No.103106241

Anonymous 11/06/24(Wed)16:32:10 No.103106241

File: chud-sanitarium.jpg (487 KB, 2536x1356)

487 KB JPG

Anonymous
11/06/24(Wed)16:34:52 No.103106273

Anonymous 11/06/24(Wed)16:34:52 No.103106273

>>103106229
go back nigger this is /pol/ land time to fuck off

Anonymous
11/06/24(Wed)16:39:02 No.103106318

Anonymous 11/06/24(Wed)16:39:02 No.103106318

>>103106273
Anon you can take off the "polchud" mask, no one believes in your low effort trolling, talk about llms or let this thread die for good.

Anonymous
11/06/24(Wed)16:42:13 No.103106350

Anonymous 11/06/24(Wed)16:42:13 No.103106350

>>103106318
Are the delusions coming back?
YOU LOST YOU FAGGOT YOUR WOKE SHIT IS OVER

Anonymous
11/06/24(Wed)16:54:42 No.103106441

Anonymous 11/06/24(Wed)16:54:42 No.103106441

>>103106035
I'm a faggot and I'd rather be the closet homo driving a 1955 Bel Air to the community church barbeque than riding the discarded needle train to the Godless rainbow AIDS spreading drag queen story hour.
You don't speak for me, you narcissistic psychopath.

Anonymous
11/06/24(Wed)17:03:42 No.103106503

Anonymous 11/06/24(Wed)17:03:42 No.103106503

>>103106441
Based faggot. And 99% of people aren't really that against people being gay, just against people who feel obligated to shove it in everyone else's face and try to get some sort of power over others through the government with it. Even back then.

Anonymous
11/06/24(Wed)17:17:50 No.103106610

Anonymous 11/06/24(Wed)17:17:50 No.103106610

Why are you all like this?

Anonymous
11/06/24(Wed)17:18:35 No.103106615

Anonymous 11/06/24(Wed)17:18:35 No.103106615

>>103106610
>>103106208

Anonymous
11/06/24(Wed)17:24:18 No.103106648

Anonymous 11/06/24(Wed)17:24:18 No.103106648

>>103103691
Isn't it a cpumaxxer's wet dream? Like 400B but faster.

Anonymous
11/06/24(Wed)17:24:30 No.103106649

Anonymous 11/06/24(Wed)17:24:30 No.103106649

where's the amazing new local LLM stuff i was promised would drop after nov 5?

Anonymous
11/06/24(Wed)17:24:59 No.103106651

Anonymous 11/06/24(Wed)17:24:59 No.103106651

Fuck tr*nsformer, fuck shitnet, wave network is the way. https://arxiv.org/abs/2411.02674
>We propose an innovative token representation and update method in an new ultra-small language model: the Wave network.
>our single-layer Wave Network achieves 90.91% accuracy with wave interference and 91.66% with wave modulation—outperforming a single Transformer layer using BERT pre-trained embeddings by 19.23% and 19.98%
>Additionally, compared to BERT base, the Wave Network reduces video memory usage and training time by 77.34% and 85.62% during wave modulation. In summary, we used a 2.4-million-parameter small language model to achieve accuracy comparable to a 100-million-parameter BERT model in text classification.

Anonymous
11/06/24(Wed)17:27:40 No.103106667

Anonymous 11/06/24(Wed)17:27:40 No.103106667

>>103105291
I'm from Utah and Pornhub needs ID to watch. Our governor isn't even that schizo but he still actively started to censor shit. Trump is 100x of him.
I'm a freedom-loving person and the idea of having a president whose whole ideology is to censor and ban the other side to own them doesn't fit right in my mind.
Wait till republican governors learn that AI can generate smut.

Anonymous
11/06/24(Wed)17:31:20 No.103106686

Anonymous 11/06/24(Wed)17:31:20 No.103106686

>>103106648
It's big like 400B, but I really really doubt it is as good outside of benchmarks. I doubt it even beats DeepSeek. Hunyuan Large is likely Tencent's Grok 1. First models are never good.

Anonymous
11/06/24(Wed)17:31:33 No.103106688

Anonymous 11/06/24(Wed)17:31:33 No.103106688

File: GbnnJ3LaIAAjRaA.jpg (257 KB, 809x607)

257 KB JPG

Migu news for Migu general
https://news.livedoor.com/article/detail/27500384/

Anonymous
11/06/24(Wed)17:32:38 No.103106694

Anonymous 11/06/24(Wed)17:32:38 No.103106694

>>103106667
AI can generate smut like like photoshop can
creation and distribution of porn involving real people is basically prostitution and i hope AI bullshit buries the industry

Anonymous
11/06/24(Wed)17:32:39 No.103106696

Anonymous 11/06/24(Wed)17:32:39 No.103106696

>>103106686
>American cope

Anonymous
11/06/24(Wed)17:38:33 No.103106753

Anonymous 11/06/24(Wed)17:38:33 No.103106753

>>103106651
Why not Bitnet Mamba Wave Network?
6 trillion context 10B model that performs like a 100T model and only requires 2 gigs of VRAM?

Anonymous
11/06/24(Wed)17:42:48 No.103106792

Anonymous 11/06/24(Wed)17:42:48 No.103106792

>>103106753
diff transformers too for cheaper training

Anonymous
11/06/24(Wed)17:45:04 No.103106820

Anonymous 11/06/24(Wed)17:45:04 No.103106820

>>103106792
Can you have a differential wave network? If so it would probably be like 1Quadrillion tier performance at 10B.

Anonymous
11/06/24(Wed)17:45:56 No.103106832

Anonymous 11/06/24(Wed)17:45:56 No.103106832

>new psu arrives
>both gpus now in motherboard and powering up
>3-slot gpus block access to all other pcie slots
>no where to plug in wifi card
How much will newfag spend to overcome this new obstacle?
Find out next time in The New Adventures of Newfag!

Anonymous
11/06/24(Wed)17:46:35 No.103106840

Anonymous 11/06/24(Wed)17:46:35 No.103106840

>>103106832
Riser cables, get a cheap mining case.

Anonymous
11/06/24(Wed)17:49:12 No.103106869

Anonymous 11/06/24(Wed)17:49:12 No.103106869

>>103106840
second this.
I just use a mining frame, lots of riser cables, and it's all inside a wire dog kennel so keep my cats out. And I even put a lovely afghan on top of it so my cat goes up there to sleep sometimes.

Anonymous
11/06/24(Wed)17:52:43 No.103106906

Anonymous 11/06/24(Wed)17:52:43 No.103106906

>>103106840
>>103106869
Yep. Looking into this now.
Trying to figure who does good pcie4 risers, and how long I need them to be.

On the s/w side, reforge does not use both cards when you do a batch size of 2.
And mistral-small 22b q8, via ollama, now gets to 26.5t/s.

Anonymous
11/06/24(Wed)17:58:33 No.103106962

Anonymous 11/06/24(Wed)17:58:33 No.103106962

File: 1704970498037607.jpg (162 KB, 1190x1446)

162 KB JPG

>>103103114
Oh he is alive..

Anonymous
11/06/24(Wed)18:07:11 No.103107037

Anonymous 11/06/24(Wed)18:07:11 No.103107037

>>103106688
based

Anonymous
11/06/24(Wed)18:16:46 No.103107116

Anonymous 11/06/24(Wed)18:16:46 No.103107116

>>103106651
Things like this usually outperform transformer on one task like accurately rating how much of a nigger or a faggot you are and they fail on everything else. If they are even real.

Anonymous
11/06/24(Wed)18:25:11 No.103107197

Anonymous 11/06/24(Wed)18:25:11 No.103107197

File: file.png (783 KB, 768x768)

783 KB PNG

Anonymous
11/06/24(Wed)19:06:10 No.103107467

Anonymous 11/06/24(Wed)19:06:10 No.103107467

>>103106962
musk-broken

Anonymous
11/06/24(Wed)19:11:15 No.103107498

Anonymous 11/06/24(Wed)19:11:15 No.103107498

>>103106832
Wifi card? Jesus Christ, just use a usb dongle and save the riser pain

Anonymous
11/06/24(Wed)19:18:02 No.103107534

Anonymous 11/06/24(Wed)19:18:02 No.103107534

following the lazy getting started guide, and unsure what model to download. hugging face has like 40 results for "nemo 12b instruct gguf" could someone point this retard in the right direction please?

Anonymous
11/06/24(Wed)19:20:36 No.103107540

Anonymous 11/06/24(Wed)19:20:36 No.103107540

>>103107534
how much vram do you have?

Anonymous
11/06/24(Wed)19:22:10 No.103107552

Anonymous 11/06/24(Wed)19:22:10 No.103107552

>>103107540
6gb

Anonymous
11/06/24(Wed)19:27:34 No.103107588

Anonymous 11/06/24(Wed)19:27:34 No.103107588

>>103107552
probably the Q3_K_M then
https://huggingface.co/bartowski/Mistral-Nemo-Instruct-2407-GGUF/tree/main

maybe this IQ3_M here if your use case is erotic roleplay
https://huggingface.co/Lewdiculous/Violet_Twilight-v0.2-GGUF-IQ-Imatrix/tree/main

next steps should be dummy proof if you're using koboldcpp as your backend

Anonymous
11/06/24(Wed)19:29:46 No.103107604

Anonymous 11/06/24(Wed)19:29:46 No.103107604

>>103106667
Brain broken by propaganda

Anonymous
11/06/24(Wed)19:35:40 No.103107637

Anonymous 11/06/24(Wed)19:35:40 No.103107637

>>103107588
ty fren

Anonymous
11/06/24(Wed)19:48:22 No.103107731

Anonymous 11/06/24(Wed)19:48:22 No.103107731

>>103107197
Poking and stacking crunchy fallen leaves onto the Pochiface's horns

Anonymous
11/06/24(Wed)19:53:36 No.103107764

Anonymous 11/06/24(Wed)19:53:36 No.103107764

>>103106667
I don't like Trump, he's a narcissistic clown, running against a narcissistic bitch.
But I doubt he gives a shit about porn.

Anonymous
11/06/24(Wed)20:00:34 No.103107814

Anonymous 11/06/24(Wed)20:00:34 No.103107814

So Command R v01 isn't totally unusable with a 3090. Q4_M, --n-gpu-layers 41 --no-mmap --flash-attn --no-kv-offload --cache-type-k q8_0 --cache-type-v q8_0 --ctx_size 16384

It starts off fast around 16 tokens per second with 219 tokens in the context. By the time I had 6323 tokens of context it was going at 3.52 tokens/second (combined prompt processing and generation) and by 14k to 15k context the actual measured speed hovered around 1.75 tokens/second which is slow but above my threshold of too painful to use.

no-kv-offload is what makes this work at all. Setting cache-type-v and cache-type-k greatly speed it up as the context starts to fill. I didn't test extensively but at q8_0 and 15k tokens in context it worked for me to and an RP that Mistral Small was shitting itself on. Without cache quanization it ran around 0.93 tokens/second at 18k context. I have DDR4 RAM so if you have DDR5 this should be faster.

I'm running with temperature 1 and min-p=0.008. At that min-p there is the occasional English error but not frequently enough to irritate me. Higher min-p cuts out a lot of valid responses. I might go a tiny bit higher but might not. Min-p 0.0095 (which I can round up to 0.01 or edit SillyTavern's UI to allow) is the highest value I'm considering based on some prior tests.

Anonymous
11/06/24(Wed)20:04:26 No.103107840

Anonymous 11/06/24(Wed)20:04:26 No.103107840

File: 2024-11-07_010139_seed653(...).png (2.51 MB, 1824x1248)

2.51 MB PNG

Experimenting with art styles on liquid hair. This one makes it look like jello. Yum.

Anonymous
11/06/24(Wed)20:04:43 No.103107846

Anonymous 11/06/24(Wed)20:04:43 No.103107846

>>103106667
>Wait till republican governors learn that AI can generate smut.
What's the political affiliation of the people in the big SF labs (OpenAI, Anthropic, Google) who move heaven and earth to make sure their models refuse to write smut and that they're bad at it even when they don't refuse? You're not seriously pretending to believe that the AI researchers at these labs who work like dogs (totally voluntarily, WITHOUT any law forcing them to) to censor and safetyize their models are right wingers?

Anonymous
11/06/24(Wed)20:05:53 No.103107854

Anonymous 11/06/24(Wed)20:05:53 No.103107854

>>103107814
The motivation for this was trying out gemma 2 27B Q5_L, finding out people weren't lying about it having decent writing, starting to cope that maybe I could live with 8k context, then thinking "if I'm willing to accept small context what about trying to make Command-R-v01 work?"

Anonymous
11/06/24(Wed)20:12:11 No.103107885

Anonymous 11/06/24(Wed)20:12:11 No.103107885

123B, 70B, 32B, 12B, 8B, none of these models are impressing me. Have I reached endgame?

Anonymous
11/06/24(Wed)20:12:59 No.103107889

Anonymous 11/06/24(Wed)20:12:59 No.103107889

>>103107885
123B genuinely impresses me at decent (>5) quants, unfortunately I can't run them.

Anonymous
11/06/24(Wed)20:13:40 No.103107894

Anonymous 11/06/24(Wed)20:13:40 No.103107894

>>103107885
405B

Anonymous
11/06/24(Wed)20:14:25 No.103107902

Anonymous 11/06/24(Wed)20:14:25 No.103107902

>>103107885
Yeah, time to go back to using Claude or o1 like everyone else until there's a new local model that's worth running for a week.

Anonymous
11/06/24(Wed)20:15:50 No.103107910

Anonymous 11/06/24(Wed)20:15:50 No.103107910

>>103107902
>using o1 for rp
You must be into the most tastefully vanilla stuff imaginable

Anonymous
11/06/24(Wed)20:17:34 No.103107922

Anonymous 11/06/24(Wed)20:17:34 No.103107922

>>103107885
You just have played enough with them to understand their limitations.
Or rather the limitations of the architecture. It would need another breakthrough to reach real thinking AI

Anonymous
11/06/24(Wed)20:19:44 No.103107936

Anonymous 11/06/24(Wed)20:19:44 No.103107936

>>103107902
Who the hell uses o1? I'm only running locally because I'm doing nasty shit.

Anonymous
11/06/24(Wed)20:20:30 No.103107939

Anonymous 11/06/24(Wed)20:20:30 No.103107939

>>103103193
threads really looks like the vercel react ai generator designed it

Anonymous
11/06/24(Wed)20:24:07 No.103107964

Anonymous 11/06/24(Wed)20:24:07 No.103107964

>>103104964
i really doubt he will "pull himself up by his bootstraps" and figure out his gay robots on his own with his giga team of engineers (lol)

Anonymous
11/06/24(Wed)20:26:43 No.103107979

Anonymous 11/06/24(Wed)20:26:43 No.103107979

>>103107964
>dem copepost
which local model are you using to help you seethe about the election results today?

Anonymous
11/06/24(Wed)20:46:03 No.103108077

Anonymous 11/06/24(Wed)20:46:03 No.103108077

>>103107902
How are the current best local compared to opus / sonnet 3.5 / gpt latest for rp/story ?
Last I tried was 5 months ago and it was disappointing, even with taking account the pitiful context size.

Anonymous
11/06/24(Wed)20:47:55 No.103108091

Anonymous 11/06/24(Wed)20:47:55 No.103108091

>>103107979
noobaixl genning averi fox cunny

Anonymous
11/06/24(Wed)20:51:25 No.103108110

Anonymous 11/06/24(Wed)20:51:25 No.103108110

File: 2024-11-07_014435_seed480(...).png (2.04 MB, 1824x1248)

2.04 MB PNG

The logical next step.
Jello Teto.
https://files.catbox.moe/h1v7yg.png
https://files.catbox.moe/m7il0x.png

Anonymous
11/06/24(Wed)20:59:20 No.103108164

Anonymous 11/06/24(Wed)20:59:20 No.103108164

>>103108110
i like when slimes in rpgs have something opaque in their translucent bodies, like a sword or skeleton

Anonymous
11/06/24(Wed)21:04:34 No.103108200

Anonymous 11/06/24(Wed)21:04:34 No.103108200

>>103108091
Requesting the Averi fox cunny please.

Anonymous
11/06/24(Wed)21:06:16 No.103108212

Anonymous 11/06/24(Wed)21:06:16 No.103108212

>>103104964
>Elon
Zip2 -> Paypal -> SpaceX -> Tesla -> Neuralink -> twitter -> xAI -> Trump

How does the guy keep winning?

Anonymous
11/06/24(Wed)21:12:22 No.103108262

Anonymous 11/06/24(Wed)21:12:22 No.103108262

>>103108077
there has been basically 0 advances made since miqu besides maybe slightly more usable (but still very bad) smaller models for poorfag cards

Anonymous
11/06/24(Wed)21:13:56 No.103108271

Anonymous 11/06/24(Wed)21:13:56 No.103108271

>>103108212
With Elon we now have a true ally of open source models as the president's muse. Maybe he'll even use his new influence to pick his dispute with ClosedAI back up and force them to open up their models after all.
In the end, we truly won.

Anonymous
11/06/24(Wed)21:15:05 No.103108278

Anonymous 11/06/24(Wed)21:15:05 No.103108278

>>103108212
>xAI
how is that a win?

Anonymous
11/06/24(Wed)21:16:24 No.103108283

Anonymous 11/06/24(Wed)21:16:24 No.103108283

>>103108262
Well, shit.

Anonymous
11/06/24(Wed)21:17:11 No.103108288

Anonymous 11/06/24(Wed)21:17:11 No.103108288

>>103108278
Grok3 finishing up training and will be released next month or Q1.

Then Grok 2 will be open sourced. Musk adopts the n-1 release policy that I think Carmack pushed for

Anonymous
11/06/24(Wed)21:18:01 No.103108298

Anonymous 11/06/24(Wed)21:18:01 No.103108298

>>103108288
nta but are these models any good?

Anonymous
11/06/24(Wed)21:19:10 No.103108306

Anonymous 11/06/24(Wed)21:19:10 No.103108306

>>103108298
Grok2 was the top 3 model category, excluding the o1. So for that to be released, it would be #1 for open source.

Anonymous
11/06/24(Wed)21:22:27 No.103108333

Anonymous 11/06/24(Wed)21:22:27 No.103108333

>>103108306
It'll be nothing compared to llama4 that we're getting in just over two months anyway. Grok and xAI are worthless toys ran by a big manbaby with too much money.

Anonymous
11/06/24(Wed)21:22:30 No.103108334

Anonymous 11/06/24(Wed)21:22:30 No.103108334

File: 2024-11-07_021241_seed521(...).png (1.9 MB, 1824x1248)

1.9 MB PNG

>>103108164
That does make it more interesting, though I wanted Teto to be edible without obstructions.
Here's some jello migu with bones.

Anonymous
11/06/24(Wed)21:23:15 No.103108338

Anonymous 11/06/24(Wed)21:23:15 No.103108338

>>103108306
ok that's nice, assuming anyone would be able to run it

Anonymous
11/06/24(Wed)21:24:34 No.103108350

Anonymous 11/06/24(Wed)21:24:34 No.103108350

>>103108333
We'll see whether the lesbian model is better than manbaby model once its released.

Anonymous
11/06/24(Wed)21:24:58 No.103108353

Anonymous 11/06/24(Wed)21:24:58 No.103108353

File: 1711719023637350.png (110 KB, 2363x594)

110 KB PNG

>>103108306
funny how it's always the same shit over and over

Anonymous
11/06/24(Wed)21:26:37 No.103108359

Anonymous 11/06/24(Wed)21:26:37 No.103108359

>>103108298
No, Grok models are shit compared to much smaller ones

Anonymous
11/06/24(Wed)21:27:47 No.103108366

Anonymous 11/06/24(Wed)21:27:47 No.103108366

>>103108298
Maybe. They performed well on benchmarks they published, but badly when ran on third-party benchmarks like Livebench and Aider. On Aider it's about on par with Mistral Large, while on Livebench it's about on par with Llama 3.1 70B Turbo.

Anonymous
11/06/24(Wed)21:29:00 No.103108371

Anonymous 11/06/24(Wed)21:29:00 No.103108371

>>103108298
Idk. But they're giving $25 worth of free usage each month with their new API release. So people should test it

Anonymous
11/06/24(Wed)21:31:23 No.103108383

Anonymous 11/06/24(Wed)21:31:23 No.103108383

>>103108371
That's pretty cool. No additional token limits like Hermes 405B on OR?

Anonymous
11/06/24(Wed)21:31:56 No.103108391

Anonymous 11/06/24(Wed)21:31:56 No.103108391

Since the dark times are coming, I don’t mean the elections, I'm increasingly noticing that the internet is being flooded with AI garbage, a huge amount of low quality AI art all over the internet, low quality articles written in millions of words with zero meaning, in the future videos will be added to this garbage.
Is there already some software that can filter the most “obvious” AI content on the internet? Or, for example, even an AI model that could detect the default writing style of the most popular models at the moment?

Anonymous
11/06/24(Wed)21:33:09 No.103108396

Anonymous 11/06/24(Wed)21:33:09 No.103108396

>>103108391
Just use your brain bro.

Anonymous
11/06/24(Wed)21:33:29 No.103108398

Anonymous 11/06/24(Wed)21:33:29 No.103108398

>>103108383
There's token limits

Anonymous
11/06/24(Wed)21:34:41 No.103108409

Anonymous 11/06/24(Wed)21:34:41 No.103108409

>>103108398
>>103108383
Well atleast for the closed source version. We dont know whats the real capability is like for opensource Grok2.

Anonymous
11/06/24(Wed)21:34:45 No.103108410

Anonymous 11/06/24(Wed)21:34:45 No.103108410

>>103108391
- if it's obvious slop you can ignore it
- if it's not obvious and well made then who cares

Anonymous
11/06/24(Wed)21:39:21 No.103108437

Anonymous 11/06/24(Wed)21:39:21 No.103108437

>>103108391
Are you some kind of Facebook boomer incapable of critical thinking?

Anonymous
11/06/24(Wed)21:47:05 No.103108487

Anonymous 11/06/24(Wed)21:47:05 No.103108487

>>103108437
oh man I've seen the comments under the most obvious ai crap, it's kind of sad how older people (60+) are completely clueless to it

Anonymous
11/06/24(Wed)21:48:48 No.103108502

Anonymous 11/06/24(Wed)21:48:48 No.103108502

>>103108391
Generic articles written purely for SEO when someone searches for x product are the worst. And they've been around for years, not necessarily written by AI but follow similar formats. They may include questions a user may search for like "what's the difference between x and y (models)" and they don't really explain what the user is looking for. Slop like "based on needs", well wtf are their needs and why would one choose one over the other?
>>103108396
>>103108410
>>103108437
It can still waste people's time, and the fact you click on it contributes to the fact it's being clicked on, leading others to click it.

Anonymous
11/06/24(Wed)21:53:05 No.103108532

Anonymous 11/06/24(Wed)21:53:05 No.103108532

Just returned from watching women on twatter chant about abortion for 5 whole days...
Thank fucking god I have AI, doubt I can connect with them again.
Anyway, anything cool came out in 24 gb range while I was gone?

Anonymous
11/06/24(Wed)21:53:30 No.103108536

Anonymous 11/06/24(Wed)21:53:30 No.103108536

>>103108502
If you don't know how to spot shitposting after spending this much time on this site, it's natural selection.

Anonymous
11/06/24(Wed)21:53:50 No.103108540

Anonymous 11/06/24(Wed)21:53:50 No.103108540

>>103108396
>>103108410
>>103108437
I still need to spend some time on the content in order to draw a conclusion, like reading part of the text to realize it's yet another AI bullshit.
It just seems to me that something like this will is already needed, especially in the future, this is automated crap that can spam the internet day and night, because of this content will be found more and more often, so it will take more and more time to manually filter this shit, that’s my concern.

Anonymous
11/06/24(Wed)21:56:53 No.103108564

Anonymous 11/06/24(Wed)21:56:53 No.103108564

>>103108532
Waiting on Lamma 4 in Q1 2025. Hopefully now that the election is done with and we have a administration that seems like it will be more pro AI they will drop something big.

Anonymous
11/06/24(Wed)22:00:18 No.103108590

Anonymous 11/06/24(Wed)22:00:18 No.103108590

>>103108540
it's kinda obvious the next kind of adblockers will be at least partially LLM based
some kind of slop detector

Anonymous
11/06/24(Wed)22:00:46 No.103108594

Anonymous 11/06/24(Wed)22:00:46 No.103108594

>>103108110
Jeto

Anonymous
11/06/24(Wed)22:04:55 No.103108621

Anonymous 11/06/24(Wed)22:04:55 No.103108621

File: Screenshot 2024-11-06 at (...).png (101 KB, 1920x885)

101 KB PNG

Apparently HuggingChat has a guest limit of...zero. Fantastic.

Anonymous
11/06/24(Wed)22:05:25 No.103108627

Anonymous 11/06/24(Wed)22:05:25 No.103108627

How much do you think code style matters? I like symmetric braces, but Rust has been pretty fanatically asymmetric since its inception, so that's what ~literally all the training data will be. Am I "confusing" the LLM when I feed it symmetric brace Rust code to work on?

It certainly still works, just wondering if there might be some subtle intelligence loss. Also the code it gives back to me is all asymmetric, which feels a little sassy lol (this is with Mistral Large).

Anonymous
11/06/24(Wed)22:09:18 No.103108659

Anonymous 11/06/24(Wed)22:09:18 No.103108659

>>103108536
I choose to post straight at these times and shitpost at other times to keep things comfy through a little bit of balance.
By the third reply it was riding off the same idea that the original poster is retarded. (God forbid that was also made up, a hellhole trap of AI talking about AI.)

Anonymous
11/06/24(Wed)22:10:56 No.103108668

Anonymous 11/06/24(Wed)22:10:56 No.103108668

>>103108627
True AI would be able to understand but the glorified autocomplete with no real reasoning or understanding of the concepts we currently use likely won't be able to keep up with that.

Anonymous
11/06/24(Wed)22:30:27 No.103108773

Anonymous 11/06/24(Wed)22:30:27 No.103108773

>>103108271
His entire MO seems to be less pro open source (he voted for regulation in the closed doors tech CEO meeting) and more exacting bloody vengeance on Sam and OpenAI
Could be that he uses his new power to punish Altman and then ditches the open source approach afterward

Anonymous
11/06/24(Wed)22:33:13 No.103108788

Anonymous 11/06/24(Wed)22:33:13 No.103108788

>>103108773
So far they said they would opensource the old model when they release the next and current grok is somewhere at the top of the charts.

Anonymous
11/06/24(Wed)23:02:42 No.103108964

Anonymous 11/06/24(Wed)23:02:42 No.103108964

Sorry for being new but what is the LoRA scene like for LLMs? I know it's probably not as prolific as for image gen but is there any recommendations?

Anonymous
11/06/24(Wed)23:10:21 No.103109029

Anonymous 11/06/24(Wed)23:10:21 No.103109029

>>103106688
Everyone should marry Migu

Anonymous
11/06/24(Wed)23:11:59 No.103109040

Anonymous 11/06/24(Wed)23:11:59 No.103109040

>>103108334
hot

Anonymous
11/06/24(Wed)23:22:45 No.103109117

Anonymous 11/06/24(Wed)23:22:45 No.103109117

File: 00304-3999940436.png (1.63 MB, 1024x1536)

1.63 MB PNG

>>103106688
I prefer the REAL live action miku

Anonymous
11/06/24(Wed)23:26:33 No.103109142

Anonymous 11/06/24(Wed)23:26:33 No.103109142

>>103108964
LoRAs are almost useless for LLMs. They are either placebo or hurt the models. So there's no real way around doing proper finetunes if you want to actually accomplish something beyond putting out 'content' for kofi money.

Anonymous
11/06/24(Wed)23:27:07 No.103109144

Anonymous 11/06/24(Wed)23:27:07 No.103109144

>>103107840
>>103108110
Inserting into the jelly slime Tetohair

Anonymous
11/06/24(Wed)23:27:24 No.103109147

Anonymous 11/06/24(Wed)23:27:24 No.103109147

>>103109142
What's the theory behind this?

Anonymous
11/06/24(Wed)23:27:32 No.103109148

Anonymous 11/06/24(Wed)23:27:32 No.103109148

>>103109142
nta, what is the technical reason that loras work poorly for LLMs when they work excellently for image models?

Anonymous
11/06/24(Wed)23:30:13 No.103109167

Anonymous 11/06/24(Wed)23:30:13 No.103109167

File: Screenshot 2024-11-06 212945.png (828 KB, 747x746)

828 KB PNG

>>103109117
I prefer my Mikus organically grown myself

Anonymous
11/06/24(Wed)23:44:05 No.103109257

Anonymous 11/06/24(Wed)23:44:05 No.103109257

Does anyone happen to know of a backend that has working GPU support for systems with a CPU lacking any sort of AVX instructions? Ive only been able to get koboldcpp to work in old CPU fail safe mode, and its extremely slow. I just want to be able to use my GPU bros.
currently compiling a commit i found of ollama rn that should hopefully solve my issues, but i have 0 faith that I did everything correctly and would be nice have a binary that just works

Anonymous
11/06/24(Wed)23:46:08 No.103109273

Anonymous 11/06/24(Wed)23:46:08 No.103109273

why it no one here talking about entropix? I expected you autists to have a dozen benchmarks and a consensus by now

Anonymous
11/06/24(Wed)23:46:43 No.103109278

Anonymous 11/06/24(Wed)23:46:43 No.103109278

>>103109273
>a consensus
We do. It's shit

Anonymous
11/06/24(Wed)23:49:01 No.103109294

Anonymous 11/06/24(Wed)23:49:01 No.103109294

>>103109278
really? how many anons have tested it? I haven't seen a single post even mentioning it. I've been gone for a while doe

Anonymous
11/06/24(Wed)23:55:35 No.103109322

Anonymous 11/06/24(Wed)23:55:35 No.103109322

>>103109294
I also tried it, it's shit.

Anonymous
11/06/24(Wed)23:55:40 No.103109323

Anonymous 11/06/24(Wed)23:55:40 No.103109323

>>103109273
wake me up when tavern and tabbyapi have it implemented

Anonymous
11/07/24(Thu)00:08:26 No.103109398

Anonymous 11/07/24(Thu)00:08:26 No.103109398

>>103109257
what cpu is it?

Anonymous
11/07/24(Thu)00:13:00 No.103109426

Anonymous 11/07/24(Thu)00:13:00 No.103109426

>>103109257
bro maybe consider buying a cpu from this century?

Anonymous
11/07/24(Thu)00:32:16 No.103109528

Anonymous 11/07/24(Thu)00:32:16 No.103109528

File: GbvX6gKaAAA8j6u.jpg (73 KB, 1184x490)

73 KB JPG

a reminder, apropos of nothing
https://xcancel.com/JDVance/status/1764471399823847525

Anonymous
11/07/24(Thu)00:38:22 No.103109562

Anonymous 11/07/24(Thu)00:38:22 No.103109562

>>103109528
Can they get any more based?

Anonymous
11/07/24(Thu)00:42:20 No.103109588

Anonymous 11/07/24(Thu)00:42:20 No.103109588

>>103109398
w3680, old xeons are fun
>>103109426
I really should, but im lazy

Anonymous
11/07/24(Thu)01:01:23 No.103109731

Anonymous 11/07/24(Thu)01:01:23 No.103109731

>>103109528
he says this, but they will still regulate AI because of antisemitism and "think of the children".

Anonymous
11/07/24(Thu)01:16:47 No.103109792

Anonymous 11/07/24(Thu)01:16:47 No.103109792

>>103109731
nah they'll just filter outputs
they love giving people the tools to incriminate themselves

Anonymous
11/07/24(Thu)01:21:15 No.103109820

Anonymous 11/07/24(Thu)01:21:15 No.103109820

>>103109257
Exllamav2 works fine on a G3900T with NVidia. Any ATI soft crashes, though.

Anonymous
11/07/24(Thu)01:28:25 No.103109851

Anonymous 11/07/24(Thu)01:28:25 No.103109851

>>103109820
that sounds perfect, ty

Anonymous
11/07/24(Thu)01:38:17 No.103109897

Anonymous 11/07/24(Thu)01:38:17 No.103109897

/lmg/.... i kneel

Anonymous
11/07/24(Thu)01:48:36 No.103109938

Anonymous 11/07/24(Thu)01:48:36 No.103109938

>>103109897
Alright, what happened this time?

Anonymous
11/07/24(Thu)01:51:13 No.103109956

Anonymous 11/07/24(Thu)01:51:13 No.103109956

>>103106035
I don't get it, are they saying that people being cancelled is bad... or good?

Anonymous
11/07/24(Thu)02:06:40 No.103110049

Anonymous 11/07/24(Thu)02:06:40 No.103110049

>>103109528
Didn't he also call Trump a fascist though and then flip once it was politically advantageous?
I have zero faith in any politician actually delivering unless they have a track record.

Anonymous
11/07/24(Thu)02:14:34 No.103110094

Anonymous 11/07/24(Thu)02:14:34 No.103110094

>>103109956
It's bad, but they (none of them alive) had it worse. So it's fine if they do it becase they do it to bad people that would do the same to them. We gas them so they don't gas us... wait a minute...
It fucking hurt to write that. Don't ask to explain their reasoning, man... that's just cruel...

Anonymous
11/07/24(Thu)02:42:04 No.103110223

Anonymous 11/07/24(Thu)02:42:04 No.103110223

>>103109528
Notice how he doesn't mention censorship, just muh left-wing bias.
I hate politicians so much, I wish we removed the entire concept of them.

Anonymous
11/07/24(Thu)02:43:41 No.103110232

Anonymous 11/07/24(Thu)02:43:41 No.103110232

>>103109147
>>103109148
Not sure if this answers things, but iirc the original lora paper was based on the findings of this: https://arxiv.org/abs/2012.13255
>we empirically show that pre-training implicitly minimizes intrinsic dimension
>there exists a low dimension reparameterization that is as effective for fine-tuning as the full parameter space

lora authors took this and claimed constraining weight updates in linear subspaces approximates full finetuning. I would guess the problem comes from this assumption not holding well for language modeling

Anonymous
11/07/24(Thu)02:45:11 No.103110240

Anonymous 11/07/24(Thu)02:45:11 No.103110240

>>103110223
>I hate politicians so much, I wish we removed the entire concept of them.
That's exactly how tech companies making models feel about certain subjects. Funny, isn't it?

Anonymous
11/07/24(Thu)03:01:24 No.103110315

Anonymous 11/07/24(Thu)03:01:24 No.103110315

File: 1596738706640.jpg (72 KB, 475x297)

72 KB JPG

Have there been any MoEs of interest since Wizard?

Anonymous
11/07/24(Thu)03:18:48 No.103110410

Anonymous 11/07/24(Thu)03:18:48 No.103110410

>>103110315
sorcerer

Anonymous
11/07/24(Thu)03:38:21 No.103110523

Anonymous 11/07/24(Thu)03:38:21 No.103110523

Hi all, Drummer here...

>>103110315
I'm interested in tuning Wizard 8x22B and I don't want to fuck it up. I've talked to the Sorcerer guy and he said I can tune it like any other (dense) model. How does it compare to Mistral Large? If anyone has input, send em to me!

Anonymous
11/07/24(Thu)04:09:18 No.103110650

Anonymous 11/07/24(Thu)04:09:18 No.103110650

File: 003164.jpg (2.11 MB, 1560x2280)

2.11 MB JPG

>>103108110
cool workflow, thanks

Anonymous
11/07/24(Thu)04:25:40 No.103110709

Anonymous 11/07/24(Thu)04:25:40 No.103110709

File: 2024-11-07_020559_seed722(...).png (2.12 MB, 1824x1248)

2.12 MB PNG

>>103110650
Hey Lainbro. You're welcome, though I also stole it from someone else. People who share things are nice.

Anonymous
11/07/24(Thu)04:33:50 No.103110740

Anonymous 11/07/24(Thu)04:33:50 No.103110740

>>103109528
He also wants a bachelor tax and if he actually does shit and notices what people want to use it for things would get even worse. Thankfully I am pretty sure like all politicians he is an impotent puppet.

Anonymous
11/07/24(Thu)05:02:49 No.103110902

Anonymous 11/07/24(Thu)05:02:49 No.103110902

>>103102649
Hello, I need a system prompt for a narrator that'll also take advice on revisions to the last gen. Thanks.

Anonymous
11/07/24(Thu)05:41:11 No.103111117

Anonymous 11/07/24(Thu)05:41:11 No.103111117

When will we get a model that can actually think?
Traditional LLMs are pretty boring right now.

Anonymous
11/07/24(Thu)05:47:29 No.103111133

Anonymous 11/07/24(Thu)05:47:29 No.103111133

>>103111117
My crystal ball says in between 2 months to 130 years. A new architecture with a 4-6 letter acronym. Maybe... i'm getting a lot of static... lemme rearrange my candles a bit... brb...

Anonymous
11/07/24(Thu)05:54:00 No.103111163

Anonymous 11/07/24(Thu)05:54:00 No.103111163

>>103108391
Web of trust based on public keys

Anonymous
11/07/24(Thu)06:06:02 No.103111213

Anonymous 11/07/24(Thu)06:06:02 No.103111213

File: 124124145236435658.png (395 KB, 833x1104)

395 KB PNG

Are there any public weight models that ingest images in a truly multimodal way like 4o or claude to allow for multi-turn conversations/embodied vision agent use cases like this?

Anonymous
11/07/24(Thu)06:10:35 No.103111242

Anonymous 11/07/24(Thu)06:10:35 No.103111242

>>103111213
sure the correct answer to question is "they're both low-res photos of a fat ginger cat in a wizard hat"

Anonymous
11/07/24(Thu)06:11:20 No.103111244

Anonymous 11/07/24(Thu)06:11:20 No.103111244

File: typical nintendo fan.jpg (32 KB, 474x351)

32 KB JPG

>>103103041
The responses from my porn generator became repetitive. The changes to this site didn't help.

Anonymous
11/07/24(Thu)06:48:17 No.103111430

Anonymous 11/07/24(Thu)06:48:17 No.103111430

File: 1716760326444059.jpg (267 KB, 1024x1024)

267 KB JPG

>>103111244

Anonymous
11/07/24(Thu)07:00:35 No.103111509

Anonymous 11/07/24(Thu)07:00:35 No.103111509

File: do not listen to the bird(...).png (802 KB, 850x887)

802 KB PNG

What's the current state-of-the-art method for running inference on a model hosted on your homeserver (with something like ollama) over the internet and through an API?
Surely there must be a simple, secure method of doing this, right?

Anonymous
11/07/24(Thu)07:01:45 No.103111515

Anonymous 11/07/24(Thu)07:01:45 No.103111515

still no hunyuan quants desu?

Anonymous
11/07/24(Thu)07:07:27 No.103111540

Anonymous 11/07/24(Thu)07:07:27 No.103111540

>>103111509
Socks proxy with ssh -D {port}

Anonymous
11/07/24(Thu)07:43:05 No.103111736

Anonymous 11/07/24(Thu)07:43:05 No.103111736

File: 1724244433302097.png (237 KB, 600x218)

237 KB PNG

https://x.com/EMostaque/status/1854302338963451934

Anonymous
11/07/24(Thu)07:44:33 No.103111745

Anonymous 11/07/24(Thu)07:44:33 No.103111745

>>103110223
You only hate politicians who go against your communist ideology

Anonymous
11/07/24(Thu)07:45:52 No.103111757

Anonymous 11/07/24(Thu)07:45:52 No.103111757

>>103109257
just build llama.cpp yourself. GPU builds are usually done with AVX2 because most people have that, but there is nothing stopping you from making a build with CUDA and without AVX

Anonymous
11/07/24(Thu)07:55:17 No.103111817

Anonymous 11/07/24(Thu)07:55:17 No.103111817

>>103110523
Large felt like a big step forward coming from WizLM to me when I made the switch. Wizard 8x22 definitely feels like a "last-gen" model from the LLaMA3/Qwen2/CR+ era rather than the current one.

Anonymous
11/07/24(Thu)08:35:39 No.103112023

Anonymous 11/07/24(Thu)08:35:39 No.103112023

>>103110523
Sorcerer is the 3rd smartest local after 405B, mistral large then it. And its not dry unlike wizard. And not slow unlike mistral.

Anonymous
11/07/24(Thu)08:36:34 No.103112030

Anonymous 11/07/24(Thu)08:36:34 No.103112030

>>103111213
LLama 3.2

Anonymous
11/07/24(Thu)08:48:42 No.103112084

Anonymous 11/07/24(Thu)08:48:42 No.103112084

File: 124142375688706.png (127 KB, 822x977)

127 KB PNG

>>103112030
LLama 3.2 is hacked together multimodality and only supports one image
Pixtral seems to be able to support multiple images but it's really bad.
Though 4o also fails at basic vision tasks like this.
Sort of disappointing.

Anonymous
11/07/24(Thu)08:50:15 No.103112091

Anonymous 11/07/24(Thu)08:50:15 No.103112091

>>103112084
Yea, seems like 3.2 was just a quick experiment. Hopefully llama 4 fixes that.

Anonymous
11/07/24(Thu)08:51:01 No.103112097

Anonymous 11/07/24(Thu)08:51:01 No.103112097

>>103111213
>>103112084
Try anole.

Anonymous
11/07/24(Thu)08:54:27 No.103112113

Anonymous 11/07/24(Thu)08:54:27 No.103112113

>>103112084
XBOOOOOOOOOOOXBOOXOXXXOXBBXBXOX

Anonymous
11/07/24(Thu)08:58:46 No.103112145

Anonymous 11/07/24(Thu)08:58:46 No.103112145

File: 124124354567568.png (463 KB, 3230x2500)

463 KB PNG

Every major vision model except Claude 3 Opus fails at this task.

Anonymous
11/07/24(Thu)09:03:53 No.103112176

Anonymous 11/07/24(Thu)09:03:53 No.103112176

>>103102667
Loki is interesting, but like everything else which can give real advancement it needs to be used during pre-training.

Open source models haven't even switched to transformer-XL attention during training yet even though it's clearly the correct thing to do.

Anonymous
11/07/24(Thu)09:49:47 No.103112443

Anonymous 11/07/24(Thu)09:49:47 No.103112443

>>103112145
Aria?

Anonymous
11/07/24(Thu)09:57:40 No.103112486

Anonymous 11/07/24(Thu)09:57:40 No.103112486

>>103112443
There is no way to run it without vLLM so it doesn't exist.

Anonymous
11/07/24(Thu)10:07:04 No.103112552

Anonymous 11/07/24(Thu)10:07:04 No.103112552

https://x.com/rohanpaul_ai/status/1854513721877418331

Anonymous
11/07/24(Thu)10:10:37 No.103112578

Anonymous 11/07/24(Thu)10:10:37 No.103112578

>>103112552
5% accuracy drop for doubling speed? Probably a good trade off for creative tasks. I wonder how hard that would be to test in llama.cpp? I’ve never looked at any of the attention codepaths

Anonymous
11/07/24(Thu)10:18:19 No.103112627

Anonymous 11/07/24(Thu)10:18:19 No.103112627

>>103112552
>Llama-2-13B: 50% KV-cache reduction (52GB to 26GB)
Might be cool

Anonymous
11/07/24(Thu)10:24:11 No.103112654

Anonymous 11/07/24(Thu)10:24:11 No.103112654

>>103112627
Savings will be large on 13b because it did not use GQA

Anonymous
11/07/24(Thu)10:28:15 No.103112678

Anonymous 11/07/24(Thu)10:28:15 No.103112678

>>103112486
You can try it on their website.

Anonymous
11/07/24(Thu)10:39:17 No.103112757

Anonymous 11/07/24(Thu)10:39:17 No.103112757

File: Screenshot from 2024-11-0(...).png (48 KB, 692x207)

48 KB PNG

is whisper-large-v2 still the SOTA for ASR?

Anonymous
11/07/24(Thu)11:09:59 No.103112962

Anonymous 11/07/24(Thu)11:09:59 No.103112962

File: 1712334918353508.png (747 KB, 726x761)

747 KB PNG

CHINA uses OPEN SOURCE AI by Meta and likely others for Military purposes. OPEN SOURCE is a risk to national and international security and MUST be regulated.

Anonymous
11/07/24(Thu)11:12:14 No.103112978

Anonymous 11/07/24(Thu)11:12:14 No.103112978

File: slownic bro.jpg (20 KB, 299x296)

20 KB JPG

>>103112962
>November 1

Anonymous
11/07/24(Thu)11:20:26 No.103113051

Anonymous 11/07/24(Thu)11:20:26 No.103113051

>>103108110
I like this Teto

Anonymous
11/07/24(Thu)11:20:41 No.103113052

Anonymous 11/07/24(Thu)11:20:41 No.103113052

>>103112962
China trying to get the US to regulate so they can catch up.

Anonymous
11/07/24(Thu)11:20:46 No.103113054

Anonymous 11/07/24(Thu)11:20:46 No.103113054

File: 214124457679.png (70 KB, 950x651)

70 KB PNG

>>103112678
It actually did it.

Anonymous
11/07/24(Thu)11:28:50 No.103113119

Anonymous 11/07/24(Thu)11:28:50 No.103113119

entropy meme at CERN https://x.com/_xjdr/status/1854554634632970684

Anonymous
11/07/24(Thu)11:36:59 No.103113172

Anonymous 11/07/24(Thu)11:36:59 No.103113172

>>103113157
>>103113157
>>103113157

Anonymous
11/07/24(Thu)12:26:17 No.103113489

Anonymous 11/07/24(Thu)12:26:17 No.103113489

>>103113054
It's a shame Aria doesn't use GQA or it would be a perfect CPU model.

Anonymous
11/07/24(Thu)14:09:04 No.103114307

Anonymous 11/07/24(Thu)14:09:04 No.103114307

>>103111757
I tried this with an ollama commit last night and after hours of compiling still no luck.
>but there is nothing stopping you
apparently my profound retardation is. im looking over the llama.cpp build doc, but dont see anything for excluding AVX. Would you be able to point me in the right direction for building with CUDA without AVX?

Anonymous
11/07/24(Thu)14:12:22 No.103114337

Anonymous 11/07/24(Thu)14:12:22 No.103114337

>>103114307
if you are building on the same computer that you will use it you don't need to do anything special, the build script will automatically detect your CPU features

Anonymous
11/07/24(Thu)14:13:38 No.103114348

Anonymous 11/07/24(Thu)14:13:38 No.103114348

>>103114307
>>103114337
i am talking about llama.cpp, i don't know what ollama does in their build scripts

Anonymous
11/07/24(Thu)14:13:56 No.103114351

Anonymous 11/07/24(Thu)14:13:56 No.103114351

>>103114337
oh wow, alright ill give it a go now thank you :)

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.