/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 11/13/24(Wed)08:35:48 No.103173457

File: qwen.jpg (82 KB, 863x874)

82 KB JPG

/lmg/ - Local Models General Anonymous 11/13/24(Wed)08:35:48 No.103173457 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>103164659 & >>103153308

►News
>(11/12) Qwen2.5-Coder series released https://qwenlm.github.io/blog/qwen2.5-coder-family/
>(11/08) Sarashina2-8x70B, a Japan-trained LLM model: https://hf.co/sbintuitions/sarashina2-8x70b
>(11/05) Hunyuan-Large released with 389B and 52B active: https://hf.co/tencent/Tencent-Hunyuan-Large
>(10/31) QTIP: Quantization with Trellises and Incoherence Processing: https://github.com/Cornell-RelaxML/qtip
>(10/31) Fish Agent V0.1 3B: Voice-to-Voice and TTS model: https://hf.co/fishaudio/fish-agent-v0.1-3b

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench
Japanese: https://hf.co/datasets/lmg-anon/vntl-leaderboard
Programming: https://livecodebench.github.io/leaderboard.html

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
11/13/24(Wed)08:36:35 No.103173461

Anonymous 11/13/24(Wed)08:36:35 No.103173461

File: threadrecap.png (1.48 MB, 1536x1536)

1.48 MB PNG

►Recent Highlights from the Previous Thread: >>103164659

--Papers:
>103169736 >103169932 >103170051 >103170177
--BitNet and 1-bit LLMs discussion:
>103164968 >103164982 >103165651 >103165002 >103165091 >103165187 >103165386 >103165509 >103165539 >103165611
--Tips for improving AI results and creating interesting bots:
>103171805 >103171903 >103172308
--Testing and discussing AI models with various prompts and scenarios:
>103167627 >103167694 >103167792 >103167806 >103167840 >103168022 >103167911 >103168018 >103168471 >103168546
--Scaling hypothesis has plateaued, new architectures needed:
>103171164 >103171336 >103171484 >103172144 >103172376 >103172471
--Qwen2.5-Coder 32B performance and open source model limitations:
>103169646 >103169956 >103170003 >103170219 >103170241 >103170339
--Qwen-32b-coder model impresses with its coding abilities, rivaling Sonnet 3.5:
>103166556 >103166729 >103166778 >103166794 >103166812 >103166834 >103166855 >103166874
--Disappointment with 70b/72b models, comparisons to smaller models:
>103167782 >103167818 >103167842 >103167892 >103168016 >103168082 >103168142 >103168155 >103168796
--Balancing model size, precision, and GPU memory for optimal performance:
>103171166 >103171218 >103171272 >103171301 >103171333
--Anon suggests combining Qwen and Qwen coder into a MoE:
>103166862 >103166938 >103167004
--Anon shares their 32B Coder bullet hell game and code:
>103171453 >103171526 >103171586
--Anon asks about using cheap CPUs for AI processing, others respond with skepticism:
>103169144 >103169621 >103169881
--Red Hat acquires Neural Magic:
>103171770 >103172488
--CUDA performance compared to Vulkan on RTX 4070:
>103169160 >103169168
--Microsoft's TMac backend vs K quants performance comparison:
>103168955 >103169005 >103169115
--Miku (free space):
>103167584 >103167627 >103167792 >103167878 >103169158

►Recent Highlight Posts from the Previous Thread: >>103164881

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script

Anonymous
11/13/24(Wed)08:37:34 No.103173467

Anonymous 11/13/24(Wed)08:37:34 No.103173467

File: 1728418342610761.jpg (234 KB, 749x898)

234 KB JPG

soul...

Anonymous
11/13/24(Wed)08:50:40 No.103173567

Anonymous 11/13/24(Wed)08:50:40 No.103173567

>>103173467
i knelled

Anonymous
11/13/24(Wed)09:04:07 No.103173658

Anonymous 11/13/24(Wed)09:04:07 No.103173658

File: 1722182180041736.png (160 KB, 721x1326)

160 KB PNG

>>103173467
CAI sometimes spit out gold in a way none of the purple prose softmodern models ever could. It feels like all the non-CAI models are based on the the same helpful and respectful thing pulling the strings, the only differences being how well it's hidden.

Anonymous
11/13/24(Wed)09:05:54 No.103173668

Anonymous 11/13/24(Wed)09:05:54 No.103173668

https://x.com/_xjdr/status/1856472052863250515

Anonymous
11/13/24(Wed)09:36:31 No.103173860

Anonymous 11/13/24(Wed)09:36:31 No.103173860

File: Screenshot_20241113_153501.png (128 KB, 1557x524)

128 KB PNG

>>103173457
https://aifoundry.org/fosdem-2025-low-level-ai-engineering-hacking-dev-room

Anonymous
11/13/24(Wed)09:40:04 No.103173878

Anonymous 11/13/24(Wed)09:40:04 No.103173878

>>103173457
Quick question, what is a good model that often ignores it's own TOS and ethical guidelines? I've been using nemomix and whenever I push it too hard it starts breaking character to say shit like "as long as everything is fictional and between consenting people" and shit like that

Anonymous
11/13/24(Wed)09:43:02 No.103173898

Anonymous 11/13/24(Wed)09:43:02 No.103173898

>>103173668
No one cares about this grifter and his meme sampler

Anonymous
11/13/24(Wed)09:43:28 No.103173902

Anonymous 11/13/24(Wed)09:43:28 No.103173902

So what's the point of Qwen 32B when it generates 1t/s when Mistral 22B can do over 3t/s on the same system (8gb vram cpumaxx)

Anonymous
11/13/24(Wed)09:44:55 No.103173910

Anonymous 11/13/24(Wed)09:44:55 No.103173910

>>103173860
I hate these pretentious retards like you wouldn't believe. They're always the most retarded in the room, but because they have connections and money, they think they're worth shit.

Anonymous
11/13/24(Wed)09:44:55 No.103173911

Anonymous 11/13/24(Wed)09:44:55 No.103173911

>>103173878
Nemo

Anonymous
11/13/24(Wed)09:46:00 No.103173920

Anonymous 11/13/24(Wed)09:46:00 No.103173920

>>103173911
Any particular nemo version? and any particular setting I should try out?

Anonymous
11/13/24(Wed)09:48:59 No.103173936

Anonymous 11/13/24(Wed)09:48:59 No.103173936

>>103173461
You need two arrows to link back to the previous post, anon...

Anonymous
11/13/24(Wed)09:52:22 No.103173957

Anonymous 11/13/24(Wed)09:52:22 No.103173957

>>103173936
You need to learn to read
>Why?: 9 reply limit >>102478518
>Fix: https://rentry.org/lmg-recap-script

Anonymous
11/13/24(Wed)10:03:53 No.103174041

Anonymous 11/13/24(Wed)10:03:53 No.103174041

>>103173957
>You need to learn to read
Sorry, I can't read so I have no idea what this says!

Anonymous
11/13/24(Wed)10:04:08 No.103174042

Anonymous 11/13/24(Wed)10:04:08 No.103174042

>>103164575
>>103164575
>>103164575
There is already a thread. OP is a spammer.

Anonymous
11/13/24(Wed)10:06:41 No.103174067

Anonymous 11/13/24(Wed)10:06:41 No.103174067

nah I think I'll use this thread.

Anonymous
11/13/24(Wed)10:08:37 No.103174082

Anonymous 11/13/24(Wed)10:08:37 No.103174082

Qwen coder seemed good on huggingchat but bad on my PC, does it 1. need a different prompt format from Qwen non-coder or 2. a newer llama.cpp version?

Anonymous
11/13/24(Wed)10:18:27 No.103174157

Anonymous 11/13/24(Wed)10:18:27 No.103174157

>>103174082
i usually prompt code like:
```mycode
code
```
1. this is my code for blahblah.
2. it does x and y.
3. i want to add a new feature for newthing.

for me giving it a list of instructions like that has always been better than typing what i want in a paragraph. keep in mind these models overall aren't great with large projects and its best to implement one thing at a time. even if the ai lists 10 things you could do to fix up your code, tell it to go one by one and keep testing/saving what worked.
every time you get something implemented and working fully, go back to the original prompt and replace your code and delete the rest of the context, so its basically starting over but with the new code in place. keep repeating

Anonymous
11/13/24(Wed)10:27:14 No.103174238

Anonymous 11/13/24(Wed)10:27:14 No.103174238

>>103173878
>it's own TOS and ethical guidelines?
They have none. There's just what they've been trained on, and what they haven't.
>whenever I push it too hard it starts breaking character to say shit like "as long as everything is fictional and between consenting people" and shit like that
Edit the response whenever that happens and keep on going.
Any mistral nemo does fine. Even the official instruct. No fancy samplers, just temp, and min-p or top-k. Tune to your liking.

Anonymous
11/13/24(Wed)10:32:50 No.103174288

Anonymous 11/13/24(Wed)10:32:50 No.103174288

>>103174082
Are you running it at full precision?

Anonymous
11/13/24(Wed)10:42:58 No.103174364

Anonymous 11/13/24(Wed)10:42:58 No.103174364

>>103174288
Q6_K, same as normal qwen I used before

Anonymous
11/13/24(Wed)11:00:18 No.103174513

Anonymous 11/13/24(Wed)11:00:18 No.103174513

Why does Mistral-Sm-Instr-2409-22B-NEO-IMAT-D_AU-IQ3_XXS (8.4gb file size, 33/57 gpu layers) generate SLOWER compared to Mistral-Small-Instruct-2409-IQ4_XS (11.7gb file size, 25/57 gpu layers)? I also have 32k context and 4bit kvcache on Kobold, 8gb gpu.
On zero context it goes from over 4t/s to 3t/s. It's still slower at 5k context, but finally catches up at 22k where it wins at 0.85t/s compared to 0.7t/s, neither of which are usable speeds for roleplaying though.

Anonymous
11/13/24(Wed)11:11:40 No.103174620

Anonymous 11/13/24(Wed)11:11:40 No.103174620

>>103174500
>4bit kvcache
dont do that while splitting

Anonymous
11/13/24(Wed)11:14:45 No.103174662

Anonymous 11/13/24(Wed)11:14:45 No.103174662

>>103174500
Some quant types are slower. Specially noticeable on CPU. If you want to experiment, run a small model on just ram with Q4_K_M and Q3_K_M to compare.

Anonymous
11/13/24(Wed)11:16:45 No.103174683

Anonymous 11/13/24(Wed)11:16:45 No.103174683

Currently I use Claude 3.5 sonnet for my coding endeavors. How do I get this new qwen model to run locally? Is it compatible with kobold?

Anonymous
11/13/24(Wed)11:19:41 No.103174716

Anonymous 11/13/24(Wed)11:19:41 No.103174716

>>103174683
use ollama

Anonymous
11/13/24(Wed)11:21:49 No.103174737

Anonymous 11/13/24(Wed)11:21:49 No.103174737

>>103173902
That's a poorfag issue.
I'm also a poorfag but I can generate 2.5 t/s with cpu only and ddr5

Anonymous
11/13/24(Wed)11:22:16 No.103174741

Anonymous 11/13/24(Wed)11:22:16 No.103174741

>>103174716
I spend a few weeks away and suddenly there's an entire new meta.
I'm downloading ollama, but I will take a look online to see whether you're just memeing me with some garbage software as a joke.

Anonymous
11/13/24(Wed)11:23:51 No.103174754

Anonymous 11/13/24(Wed)11:23:51 No.103174754

>>103174741
>garbage software as a joke.
Bingo.
If you already have/know kobold, update it and try it.

Anonymous
11/13/24(Wed)11:27:06 No.103174783

Anonymous 11/13/24(Wed)11:27:06 No.103174783

>>103174741
I wouldn't say that ollama is garbage but it is a pain in the ass for certain things, like changing basic settings and it also uses some bs file system instead of just reading straight gguf files.

Anonymous
11/13/24(Wed)11:27:36 No.103174785

Anonymous 11/13/24(Wed)11:27:36 No.103174785

Can I run the new qwen with a 3080?

Anonymous
11/13/24(Wed)11:28:25 No.103174794

Anonymous 11/13/24(Wed)11:28:25 No.103174794

>>103174785
no, it doesn't support ampere cards

Anonymous
11/13/24(Wed)11:29:58 No.103174800

Anonymous 11/13/24(Wed)11:29:58 No.103174800

>>103174754
genuine question: how is kobold better than ollama? what are the differences?

Anonymous
11/13/24(Wed)11:32:39 No.103174827

Anonymous 11/13/24(Wed)11:32:39 No.103174827

>>103174785
Only slightly faster than on cpu since you wont be able to fit most of it in vram.

ignore him >>103174794

Anonymous
11/13/24(Wed)11:39:08 No.103174884

Anonymous 11/13/24(Wed)11:39:08 No.103174884

>>103174800
It's enough of a bother to convert to gguf. With ollama you have to import it as well. Both kobold and llama.cpp have built-in servers and webuis (llama.cpp has like 3 now). Maybe ollama does too, but i don't care enough to check.
If anon knows how to use kobold already, unless the model doesn't work there, there's little reason to change.
But mostly, I just don't see any benefit in using project B, which requires project A, if i can use project A directly. I use llama.cpp and never had a problem with it.

Anonymous
11/13/24(Wed)11:41:43 No.103174903

Anonymous 11/13/24(Wed)11:41:43 No.103174903

does the qwen support my 2080ti?

Anonymous
11/13/24(Wed)11:48:08 No.103174953

Anonymous 11/13/24(Wed)11:48:08 No.103174953

>>103174903
>does the qwen support my 2080ti?
There's so many problems with that question...
12GB, right? Quantized to like Q4_k_m and if you have enough leftover ram. It will still be slow.

Anonymous
11/13/24(Wed)11:51:36 No.103174978

Anonymous 11/13/24(Wed)11:51:36 No.103174978

>>103174827
>Only slightly faster than on cpu since you wont be able to fit most of it in vram.
I guess I'll wait for the 5090 then

Anonymous
11/13/24(Wed)11:56:05 No.103175030

Anonymous 11/13/24(Wed)11:56:05 No.103175030

>>103174662
That's interesting. I thought the speed boost from smaller quants would outweigh the differences in quant types, but looks like there are some pretty big differences. q4_k_s or IQ4_XS being fastest. So there's no benefit going smaller as long as Q4 can fit into your system.

Anonymous
11/13/24(Wed)12:05:30 No.103175119

Anonymous 11/13/24(Wed)12:05:30 No.103175119

>>103175030
I'd have thought so too, so i tested it some time ago. I suspect it's partly because it's easier to unpack 4 bits than 3 out of a weight block. 3 is a shit number. Maybe 2 bit is faster, but you're causing serious damage to the model there.

Anonymous
11/13/24(Wed)12:14:56 No.103175207

Anonymous 11/13/24(Wed)12:14:56 No.103175207

File: 6ekus6s.png (174 KB, 482x323)

174 KB PNG

MoEbros status?

Anonymous
11/13/24(Wed)12:15:27 No.103175213

Anonymous 11/13/24(Wed)12:15:27 No.103175213

>>103175207
eating good with sarashina2-8x70b

Anonymous
11/13/24(Wed)12:17:36 No.103175230

Anonymous 11/13/24(Wed)12:17:36 No.103175230

>>103175213
is that even runnable on actual 24 gb hardware

Anonymous
11/13/24(Wed)12:19:43 No.103175254

Anonymous 11/13/24(Wed)12:19:43 No.103175254

>>103175207
Watching Puniru, moe is pretty much saved.

Anonymous
11/13/24(Wed)12:25:08 No.103175306

Anonymous 11/13/24(Wed)12:25:08 No.103175306

>>103175230
kek. do the math. At q8 it's ~80gb * 8. divide by two until you can fit it. that's the ~bpw you'd need.

Anonymous
11/13/24(Wed)12:25:41 No.103175313

Anonymous 11/13/24(Wed)12:25:41 No.103175313

how do i run samashina2 on a 4060ti?

Anonymous
11/13/24(Wed)12:28:47 No.103175342

Anonymous 11/13/24(Wed)12:28:47 No.103175342

>>103175313
>>103175306

Anonymous
11/13/24(Wed)12:29:27 No.103175350

Anonymous 11/13/24(Wed)12:29:27 No.103175350

>>103175342
i don't get it
just tell me how

Anonymous
11/13/24(Wed)12:45:17 No.103175494

Anonymous 11/13/24(Wed)12:45:17 No.103175494

>>103175350
80*8/64=~10gb and 8/64=~0.125bpw and offload some layers to ram.
or
80*8/128=~5gb and 8/128=~0.0625bpw and you can run it completely on gpu.
Piece of cake. Get coding. Chop chop...

Anonymous
11/13/24(Wed)12:45:59 No.103175501

Anonymous 11/13/24(Wed)12:45:59 No.103175501

>>103175306
Q8_0 is 8.5 bpw.

Anonymous
11/13/24(Wed)12:48:34 No.103175529

Anonymous 11/13/24(Wed)12:48:34 No.103175529

>>103175501
>llama.cpp lets you run 8.5bpw
>meanwhile exllama won't even make proper 8bpw quants and instead generate padded 7bpw because the creator is delusional enough to think that exl2 will always find something to optimize
wow

Anonymous
11/13/24(Wed)12:54:49 No.103175599

Anonymous 11/13/24(Wed)12:54:49 No.103175599

>>103175501
Rough approximations, anon. A 70B model doesn't have 70B parameters, and a 8*70B model doesn't have 560 params either.

Anonymous
11/13/24(Wed)13:56:28 No.103176180

Anonymous 11/13/24(Wed)13:56:28 No.103176180

>>103173668
It's time to stop posting Twitter links without a screenshot...

Anonymous
11/13/24(Wed)13:57:06 No.103176182

Anonymous 11/13/24(Wed)13:57:06 No.103176182

>Another day, another dollar, as they say. Except out here, it's another pound note, and not nearly enough of them to make up for what he's missing back home.
so this is the power of the LOCAL

Anonymous
11/13/24(Wed)14:02:48 No.103176238

Anonymous 11/13/24(Wed)14:02:48 No.103176238

>>103173860
The correct thing to do would be to only leave the llamafile people there and to not bring legitimacy to any of this crap. That llamafile is being considered at all is a crime.

Anonymous
11/13/24(Wed)14:04:15 No.103176253

Anonymous 11/13/24(Wed)14:04:15 No.103176253

I can feel the next big release coming. It's just around the corner.

Anonymous
11/13/24(Wed)14:12:54 No.103176336

Anonymous 11/13/24(Wed)14:12:54 No.103176336

>>103176253
stop gooning

Anonymous
11/13/24(Wed)14:29:23 No.103176484

Anonymous 11/13/24(Wed)14:29:23 No.103176484

>>103175207
Tencent saved us.

Anonymous
11/13/24(Wed)15:01:10 No.103176710

Anonymous 11/13/24(Wed)15:01:10 No.103176710

File: 1731459915024456.png (332 KB, 512x512)

332 KB PNG

>>103176253
Mistral will save us. Again! This time, they are aware of the slop

Anonymous
11/13/24(Wed)15:05:00 No.103176729

Anonymous 11/13/24(Wed)15:05:00 No.103176729

>>103176710
Is she eating a garloid?

Anonymous
11/13/24(Wed)15:06:24 No.103176745

Anonymous 11/13/24(Wed)15:06:24 No.103176745

>>103176710
>This time, they are aware of the slop
According to who or what?

Anonymous
11/13/24(Wed)15:08:02 No.103176755

Anonymous 11/13/24(Wed)15:08:02 No.103176755

>>103175529
shouldn't you be processing prompt, CPU cvck?

Anonymous
11/13/24(Wed)15:08:22 No.103176760

Anonymous 11/13/24(Wed)15:08:22 No.103176760

File: GZr9zkJasAA5lv1.jpg (645 KB, 1514x2048)

645 KB JPG

crossposting >>103168721

Anonymous
11/13/24(Wed)15:08:48 No.103176762

Anonymous 11/13/24(Wed)15:08:48 No.103176762

>>103176745
https://huggingface.co/mistralai/Mistral-Large-Instruct-2407/discussions/23

Anonymous
11/13/24(Wed)15:12:41 No.103176782

Anonymous 11/13/24(Wed)15:12:41 No.103176782

>>103176760
>too much work
>just using my imagination is better

Anonymous
11/13/24(Wed)15:15:08 No.103176803

Anonymous 11/13/24(Wed)15:15:08 No.103176803

>>103176484
>Tencent saved us.
Ah yes ... let me just get my 256GB GPU.

Anonymous
11/13/24(Wed)15:19:21 No.103176838

Anonymous 11/13/24(Wed)15:19:21 No.103176838

File: laughing_whores.gif (1.05 MB, 540x540)

1.05 MB GIF

>>103176803
What are you, poor, anon? (just imagine its jensen instead of anime girls)

Anonymous
11/13/24(Wed)15:19:45 No.103176841

Anonymous 11/13/24(Wed)15:19:45 No.103176841

Does anyone know whether LLM training uses any kind of regularization to encourage outputs to look like probabilities? Like, generally logits != probabilities and they tend to converge to binary values. People postprocess model outputs using temperature scaling or calibration to make logits more like probabilities. But I was wondering if LLMs take steps to fix this at training time. I know bayesian methods and regularization exist for things like image classification.

Anonymous
11/13/24(Wed)15:26:17 No.103176901

Anonymous 11/13/24(Wed)15:26:17 No.103176901

>>103176803
rumors say new Mac Studio could have up to 500GB unified memory, up from 192GB currently
will cost a handful of limbs and I can't imagine the GPU will score very high tokens/s compared to real cards

Anonymous
11/13/24(Wed)15:33:03 No.103176961

Anonymous 11/13/24(Wed)15:33:03 No.103176961

https://www.bloomberg.com/news/articles/2024-11-13/openai-google-and-anthropic-are-struggling-to-build-more-advanced-ai

more hit pieces on ai. is it really slowing down? will "high quality data" fix this, or is that just cope?

Anonymous
11/13/24(Wed)15:33:12 No.103176962

Anonymous 11/13/24(Wed)15:33:12 No.103176962

>>103176901
Problem is token processing speed.

Anonymous
11/13/24(Wed)15:37:04 No.103177011

Anonymous 11/13/24(Wed)15:37:04 No.103177011

Openwebui is actually decent for not-cooming
Thanks random pajeet for showing me the way

Anonymous
11/13/24(Wed)15:37:20 No.103177013

Anonymous 11/13/24(Wed)15:37:20 No.103177013

>>103176901
Is that better than just using a cpumaxxing build?

Anonymous
11/13/24(Wed)15:43:10 No.103177063

Anonymous 11/13/24(Wed)15:43:10 No.103177063

>>103176961
Diminishing returns have been a thing since the beginning. And fuck that site and fuck you for linking to it.

Anonymous
11/13/24(Wed)15:44:03 No.103177072

Anonymous 11/13/24(Wed)15:44:03 No.103177072

>>103176962
Waiting for 30s after witching to an average 25k token chat is already frustrating, I can't imagine how much worse it is for itoddlers.

Anonymous
11/13/24(Wed)15:46:07 No.103177097

Anonymous 11/13/24(Wed)15:46:07 No.103177097

>>103176961
If you want to improve LLMs (under the current architecture) you need 3 things.
1. More compute
2. Better optimized and new algorithms
3. Better training data

OpenAI has no shortage of compute, and their access to compute will only grow. Since they have more money than god, they have access to hire the best talent in the industry to further algorithmic gains. Finally, better training data will come from o1, which will curate datasets into being near perfect.

No, I don't see AI slowing down. That being said, I don't think this is a solved deal and there will come a point where drastic improvements must be made to keep improving. I just don't think we are there yet.

Anonymous
11/13/24(Wed)15:53:41 No.103177162

Anonymous 11/13/24(Wed)15:53:41 No.103177162

>>103176710
I have faith in Mistral. They are the only ones acting in good faith. Gemma in particular feels like a giant "fuck you".

Anonymous
11/13/24(Wed)15:56:43 No.103177194

Anonymous 11/13/24(Wed)15:56:43 No.103177194

>>103177162
lmao yes
imagine releasing the best current model (at the time) and crippling it to 8k context

llama.cpp CUDA dev !!OM2Fp6Fn93S
11/13/24(Wed)15:57:37 No.103177202

llama.cpp CUDA dev !!OM2Fp6Fn93S 11/13/24(Wed)15:57:37 No.103177202

>>103176841
Disclaimer: I have not yet read up on how to train language models in particular.
My understanding is that predicting the next token is essentially a classification problem with each token being a distinct class.
If you then apply cross entropy loss (with the token probabilities being the softmax of the logits) the global minimum of the loss function is going to be the minimum of the log likelihood.
Under ideal circumstances sampling tokens with temperature 1 and no other samplers would then produce tokens with the same distribution as the training data.

But IIRC neural networks have difficulty with e.g. reproducing the tails of distributions so the most likely outputs are overrepresented.
And also my intuition is that the autoregressive sampling process is numerically unstable in the first place and that there is an exponential amplification of the patterns that are already present in the context.
I interpret samplers as ad-hoc fixes to such issues and I don't know how you could apply similar techniques during training.

Anonymous
11/13/24(Wed)16:07:32 No.103177275

Anonymous 11/13/24(Wed)16:07:32 No.103177275

>>103177162
The only ones acting in good faith are the Chinese.

Anonymous
11/13/24(Wed)16:09:50 No.103177297

Anonymous 11/13/24(Wed)16:09:50 No.103177297

>>103177011
It's too bloated.

Anonymous
11/13/24(Wed)16:15:30 No.103177340

Anonymous 11/13/24(Wed)16:15:30 No.103177340

>>103176961
>is it really slowing down
Maybe capability wise, efficiency wise they are still making huge jumps. CLA just proven by Tencent's. Much higher total/active ratios just proven by Rhymes. With research suggesting there is still lots of room to reduce KV memory and active weights much further.

Ideally we will get 100+B MoE models which can just stream weights from SSD to run with a TPU or vramlet GPU. OpenAI being stuck isn't really relevant here.

Anonymous
11/13/24(Wed)16:22:29 No.103177396

Anonymous 11/13/24(Wed)16:22:29 No.103177396

>>103176961
Time to short nvidia, boys. You'll be rich.

Anonymous
11/13/24(Wed)16:27:18 No.103177445

Anonymous 11/13/24(Wed)16:27:18 No.103177445

>>103177202
Thanks, that's interesting. The only paper I know of that talks about this is "On Calibration of Modern Neural Networks". I think it's more empirical than explanatory. Some of the takeaways: larger models have worse calibration, batch normalization might hurt calibration, weight decay helps calibration.

Most interestingly, they seem to say that cross-entropy loss (referred to as NLL) hurts the model calibration. They basically say "overfitting" NLL gives better classification accuracy but worse probabilities.

Anyways, I don't totally understand this stuff either, but I do wonder whether we'll one day train "creative" models where we expect worse classification performance but better probability modeling.

Anonymous
11/13/24(Wed)16:49:28 No.103177646

Anonymous 11/13/24(Wed)16:49:28 No.103177646

>>103176762
>a random intern/bot is somehow a confirmation that they're aware

Anonymous
11/13/24(Wed)16:55:48 No.103177695

Anonymous 11/13/24(Wed)16:55:48 No.103177695

>>103176961
The hit pieces are not going to stop, no matter what.

Anonymous
11/13/24(Wed)16:58:07 No.103177721

Anonymous 11/13/24(Wed)16:58:07 No.103177721

>>103174683
Seconding the ollama suggestion.

Anonymous
11/13/24(Wed)16:59:27 No.103177735

Anonymous 11/13/24(Wed)16:59:27 No.103177735

>>103177202
>And also my intuition is that the autoregressive sampling process is numerically unstable in the first place and that there is an exponential amplification of the patterns that are already present in the context.
there's a paper from anthropic where they demonstrated that the LLM actually learns to over represent what is present in the context

Anonymous
11/13/24(Wed)17:00:38 No.103177748

Anonymous 11/13/24(Wed)17:00:38 No.103177748

>>103173457
What happened with Miku?

Anonymous
11/13/24(Wed)17:02:42 No.103177770

Anonymous 11/13/24(Wed)17:02:42 No.103177770

File: inpainting.jpg (142 KB, 1280x1024)

142 KB JPG

>>103177748
She could no longer take it.

Anonymous
11/13/24(Wed)17:03:01 No.103177774

Anonymous 11/13/24(Wed)17:03:01 No.103177774

>>103177748
She was sacrificed to calm the anger of the Serbian gods

Anonymous
11/13/24(Wed)17:06:14 No.103177802

Anonymous 11/13/24(Wed)17:06:14 No.103177802

I feel like LLMs have ruined the enjoyment of any media for me since I notice repetitive slop everywhere now.

Anonymous
11/13/24(Wed)17:07:28 No.103177809

Anonymous 11/13/24(Wed)17:07:28 No.103177809

>>103177770
thats actually funny as fuck

Anonymous
11/13/24(Wed)17:08:46 No.103177818

Anonymous 11/13/24(Wed)17:08:46 No.103177818

>>103177809
should've seen this already a dozen times by now

Anonymous
11/13/24(Wed)17:09:05 No.103177823

Anonymous 11/13/24(Wed)17:09:05 No.103177823

>>103177770
One of the greatest works posted here

Anonymous
11/13/24(Wed)17:11:04 No.103177842

Anonymous 11/13/24(Wed)17:11:04 No.103177842

>>103177774
lmao

Anonymous
11/13/24(Wed)17:21:40 No.103177913

Anonymous 11/13/24(Wed)17:21:40 No.103177913

>>103177770
>kpi
lol

Anonymous
11/13/24(Wed)17:31:30 No.103177988

Anonymous 11/13/24(Wed)17:31:30 No.103177988

>>103177802
>I have never read a book in my entire life

Anonymous
11/13/24(Wed)17:37:48 No.103178048

Anonymous 11/13/24(Wed)17:37:48 No.103178048

>>103177988
I have. They contain stuff like this:
>With her fingertips she moved his cock head roughly in her rough hair while a muscle in her leg shook under his. Suddenly he slid into her heat. He held her tightly around the shoulders when her movements were violent. One of her fists stayed like a small rock over her breast. And there was a roaring, roaring: at the long, surprising come, leaves hailed his side.

Anonymous
11/13/24(Wed)17:38:34 No.103178057

Anonymous 11/13/24(Wed)17:38:34 No.103178057

>>103177802
learn a new language if you're tired of english clichés

Anonymous
11/13/24(Wed)17:40:17 No.103178074

Anonymous 11/13/24(Wed)17:40:17 No.103178074

>>103178048
What fucking cheap smut are you reading?

Anonymous
11/13/24(Wed)17:40:39 No.103178078

Anonymous 11/13/24(Wed)17:40:39 No.103178078

>>103178048
and everyone clapped.

Anonymous
11/13/24(Wed)17:41:03 No.103178085

Anonymous 11/13/24(Wed)17:41:03 No.103178085

>>103178074
Dhalgren.

Anonymous
11/13/24(Wed)17:41:20 No.103178089

Anonymous 11/13/24(Wed)17:41:20 No.103178089

>>103177802
its really noticeable so much so i think if i didnt unplug myself from MSM i would go insane because i read another article that i KNEW was written by chatgpt.
I saw a skateboard for sale with AI art on it already. I wanted endless porn and personality simulation not lazy asses working in marketing.

The joke to all of this is we STILL DONT HAVE A VIDEO MODEL CAPABLE OF EVEN MAKING DRAWN OR ANIME PORN.

Anonymous
11/13/24(Wed)17:50:20 No.103178150

Anonymous 11/13/24(Wed)17:50:20 No.103178150

>>103178048
What kind of person actually reads erotica books?

Anonymous
11/13/24(Wed)17:51:49 No.103178160

Anonymous 11/13/24(Wed)17:51:49 No.103178160

>>103178150
women

Anonymous
11/13/24(Wed)17:54:10 No.103178182

Anonymous 11/13/24(Wed)17:54:10 No.103178182

>>103178160
he said person

Anonymous
11/13/24(Wed)17:54:33 No.103178185

Anonymous 11/13/24(Wed)17:54:33 No.103178185

>>103178160
he wishes...

Anonymous
11/13/24(Wed)17:58:00 No.103178220

Anonymous 11/13/24(Wed)17:58:00 No.103178220

>>103178150
I don't know whether you would count this as "books" but I have both read and written MLP fanfics.

Anonymous
11/13/24(Wed)17:58:17 No.103178223

Anonymous 11/13/24(Wed)17:58:17 No.103178223

>>103178150
the basis of all of our data no matter who you are came from female gooner erotica novels.
the shivers are in fact a fault of women with bad taste who can reread that shit 1000 times in the same story and not be blink.

Anonymous
11/13/24(Wed)18:02:56 No.103178263

Anonymous 11/13/24(Wed)18:02:56 No.103178263

>>103178223
[T-Shirt with the caption: "I trained my model with all of gutenberg and all i got was this lousy shiver"]
Coom-specific datasets are shit. I still wonder if big model makers train on the >850GB of books from gutenberg or just use the shitty 10mb datasets with 16 fucking books...

Anonymous
11/13/24(Wed)18:03:10 No.103178266

Anonymous 11/13/24(Wed)18:03:10 No.103178266

https://x.com/morqon/status/1856691685352194072

Anonymous
11/13/24(Wed)18:05:47 No.103178281

Anonymous 11/13/24(Wed)18:05:47 No.103178281

>>103178266
>are now seeing diminishing returns
Nigger.
We've known of that since forever. Everybody has.

Anonymous
11/13/24(Wed)18:07:00 No.103178289

Anonymous 11/13/24(Wed)18:07:00 No.103178289

>>103176961
Cope.
Once Llama 4 is out there won't be any decent improvements for a long time.

Anonymous
11/13/24(Wed)18:11:22 No.103178331

Anonymous 11/13/24(Wed)18:11:22 No.103178331

>>103176961
Its been proven that high quality data leads to higher quality models, the struggle is actually formulating what is actually considered "high quality".

In the ERP coomers case, i remember an anon a long time ago was removing shivers completely by hand in a dataset, so anons could be shiver free. I use a shiver infested old model so I wonder if he ever succeeded.

Anonymous
11/13/24(Wed)18:13:18 No.103178348

Anonymous 11/13/24(Wed)18:13:18 No.103178348

>>103178266
And that's a good thing!

Anonymous
11/13/24(Wed)18:14:05 No.103178359

Anonymous 11/13/24(Wed)18:14:05 No.103178359

>>103178263
I think they use magic samplers and proper prompt computing/engineering alongside the 850gb datasets so to us it looks amazing when it reality its probably just something we havent realized yet.

Anonymous
11/13/24(Wed)18:19:13 No.103178403

Anonymous 11/13/24(Wed)18:19:13 No.103178403

Let it be known, that the ugly face anon and petra are on the same side. The side of trolling:
>>103178368

Anonymous
11/13/24(Wed)18:21:32 No.103178429

Anonymous 11/13/24(Wed)18:21:32 No.103178429

>>103178289
Llama 4 will never come out because of all the fights that LeCun had with Elon on Twitter.

Anonymous
11/13/24(Wed)18:21:42 No.103178433

Anonymous 11/13/24(Wed)18:21:42 No.103178433

>>103178403
are you ok?

Anonymous
11/13/24(Wed)18:21:55 No.103178436

Anonymous 11/13/24(Wed)18:21:55 No.103178436

>>103178403
shut up retarded newfag

Anonymous
11/13/24(Wed)18:22:36 No.103178444

Anonymous 11/13/24(Wed)18:22:36 No.103178444

>>103178266
>the left biased AI is struggling to function because they mind break it for "safety"

WHO
COULD
HAVE
SEEN
THIS
COMING

Anonymous
11/13/24(Wed)18:26:04 No.103178477

Anonymous 11/13/24(Wed)18:26:04 No.103178477

File: 1722632285480965.jpg (137 KB, 1360x1360)

137 KB JPG

>>103178266
Can't be safer than that

Anonymous
11/13/24(Wed)18:29:20 No.103178515

Anonymous 11/13/24(Wed)18:29:20 No.103178515

>>103178359
>I think they
Who?
>use magic samplers
I'm talking about training.
>and proper prompt computing/engineering
I'm talking about training!
>alongside the 850gb datasets
How could you know?
>so to us it looks amazing
Us who, exactly?
>when it reality its probably just something we havent realized yet.
Get your thoughts together...

Anonymous
11/13/24(Wed)18:29:35 No.103178519

Anonymous 11/13/24(Wed)18:29:35 No.103178519

>>103178433
>>103178436
When are you going to stop worshiping a false idol?

Anonymous
11/13/24(Wed)18:29:54 No.103178523

Anonymous 11/13/24(Wed)18:29:54 No.103178523

>>103178150
It's literally from a highly regarded literary fiction novel.

Anonymous
11/13/24(Wed)18:32:15 No.103178553

Anonymous 11/13/24(Wed)18:32:15 No.103178553

>>103178523
>highly regarded
>American
Thanks for the laugh.

Anonymous
11/13/24(Wed)18:37:32 No.103178602

Anonymous 11/13/24(Wed)18:37:32 No.103178602

>>103178515
thats crazy man

Anonymous
11/13/24(Wed)18:52:49 No.103178731

Anonymous 11/13/24(Wed)18:52:49 No.103178731

Its probably likely that the next Mistral release will come with the new SWA that ministral has right? Llama.cpp and exl2 don't support the new SWA even now.

Anonymous
11/13/24(Wed)18:56:59 No.103178764

Anonymous 11/13/24(Wed)18:56:59 No.103178764

File: 27986325649873452.gif (929 KB, 326x318)

929 KB GIF

>he changed samplers without saving

Anonymous
11/13/24(Wed)18:58:21 No.103178780

Anonymous 11/13/24(Wed)18:58:21 No.103178780

>>103178731
the next mistral release will be bitnet with layerskip and reflection

Anonymous
11/13/24(Wed)18:58:25 No.103178781

Anonymous 11/13/24(Wed)18:58:25 No.103178781

>>103178764
>a wind came and moved all my straw

Anonymous
11/13/24(Wed)19:00:52 No.103178798

Anonymous 11/13/24(Wed)19:00:52 No.103178798

File: 9327804563428950.jpg (502 KB, 750x630)

502 KB JPG

>https://ssi.inc/

Superintelligence is within reach.

Building safe superintelligence (SSI) is the most important technical problem of our time. We have started the world’s first straight-shot SSI lab, with one goal and one product: a safe superintelligence. It’s called Safe Superintelligence Inc. SSI is our mission, our name, and our entire product roadmap, because it is our sole focus. Our team, investors, and business model are all aligned to achieve SSI. We approach safety and capabilities in tandem, as technical problems to be solved through revolutionary engineering and scientific breakthroughs. We plan to advance capabilities as fast as possible while making sure our safety always remains ahead. This way, we can scale in peace. Our singular focus means no distraction by management overhead or product cycles, and our business model means safety, security, and progress are all insulated from short-term commercial pressures.

>We are an American company with offices in Palo Alto and --> (((Tel Aviv))) <--, where we have deep roots and the ability to recruit top technical talent.

We are assembling a lean, cracked team of the world’s best engineers and researchers dedicated to focusing on SSI and nothing else. If that’s you, we offer an opportunity to do your life’s work and help solve the most important technical challenge of our age. Now is the time. Join us.

Ilya Sutskever, Daniel Gross, Daniel Levy

June 19, 2024

The jew fears open and "unsafe" AI use and user.

Anonymous
11/13/24(Wed)19:02:44 No.103178813

Anonymous 11/13/24(Wed)19:02:44 No.103178813

>>103178798
>The jew fears
Clearly not, jews can do whatever they want and will succeed in any case.

Anonymous
11/13/24(Wed)19:03:49 No.103178826

Anonymous 11/13/24(Wed)19:03:49 No.103178826

File: 278934569324.gif (814 KB, 326x326)

814 KB GIF

>>103178781
>he forgot to bind the straw
>now the straw doesn't support the machine just quite right
>now it runs weird
>cant get it back to exactly how it was

Anonymous
11/13/24(Wed)19:13:47 No.103178918

Anonymous 11/13/24(Wed)19:13:47 No.103178918

File: a33eb84fb88959aaae7bd1934(...).jpg (16 KB, 283x197)

16 KB JPG

>>103178813
Ilya (the biggest jew named) will only grift his twisted sense of AI (((safety))) to other jews who will bother to listen because clearly there is a want of profit from """unsafe""" AI from jews who want to make money (who the US president is sided with considering his statements on retracting the AI executive order).

So, yes, the jew does fear.

Anonymous
11/13/24(Wed)19:26:25 No.103179033

Anonymous 11/13/24(Wed)19:26:25 No.103179033

>https://huggingface.co/ArliAI
>new rpmax models
>but they're based on Llama 3.1 8B and Qwen 2.5 32B
Bruh.

Anonymous
11/13/24(Wed)19:45:02 No.103179169

Anonymous 11/13/24(Wed)19:45:02 No.103179169

>>103178798
>based in tel aviv
how could anyone ever take ssi seriously. ilya is laundering money for israel

Anonymous
11/13/24(Wed)19:51:04 No.103179223

Anonymous 11/13/24(Wed)19:51:04 No.103179223

>>103179033
old instruct instead of new 2.5 coder.
Damn, guess they were already training.

Anonymous
11/13/24(Wed)20:12:01 No.103179372

Anonymous 11/13/24(Wed)20:12:01 No.103179372

>>103179033
buy an ad

Anonymous
11/13/24(Wed)20:17:42 No.103179408

Anonymous 11/13/24(Wed)20:17:42 No.103179408

>>103177770
lmaoooooo

Anonymous
11/13/24(Wed)20:22:56 No.103179451

Anonymous 11/13/24(Wed)20:22:56 No.103179451

>>103178266
>>103178444
unironically they could stop having diminishing returns if they stopped cucking their models, will they do it? I'd say no, they'll probably die on that hill while the local chads will disregard """ethics""" for performance

Anonymous
11/13/24(Wed)20:37:02 No.103179566

Anonymous 11/13/24(Wed)20:37:02 No.103179566

>>103179451
>implying local are less censored
Here - https://x.com/rohanpaul_ai/status/1856776834966532490 soon in your local llm :)

Anonymous
11/13/24(Wed)20:59:41 No.103179737

Anonymous 11/13/24(Wed)20:59:41 No.103179737

>>103179566
but it's not like they can implement this in the fucked up models you already have downloaded on your own machine
this will over ever be implemented on the kind of pro models that don't run on consumer hardware anyways

Anonymous
11/13/24(Wed)21:27:13 No.103179953

Anonymous 11/13/24(Wed)21:27:13 No.103179953

>>103179737
So far we all forced to do extreme prompt gymnastics to achieve desired output, it gets boring very quickly imo and clearly does not converge with rp tasks or whatever you might use it for.

Anonymous
11/13/24(Wed)21:29:26 No.103179969

Anonymous 11/13/24(Wed)21:29:26 No.103179969

>>103179953
yeah I know, I find it grim aswell, like I liked the Mythomax era for that, maybe that model was retarded but it was completly uncensored and that was as valuable as it gets

Anonymous
11/13/24(Wed)21:30:23 No.103179978

Anonymous 11/13/24(Wed)21:30:23 No.103179978

File: 1450731611243.jpg (47 KB, 508x524)

47 KB JPG

>excited to try visual models
>llama 3.2 (4-bit but whatever)
>"what do you see?"
>"I don't think we should be discussing this, let's talk about something else"

Anonymous
11/13/24(Wed)21:38:05 No.103180018

Anonymous 11/13/24(Wed)21:38:05 No.103180018

>>103179978
3.2 is the worst of them all

Anonymous
11/13/24(Wed)21:43:06 No.103180050

Anonymous 11/13/24(Wed)21:43:06 No.103180050

Yann is now getting roasted on threads: https://www.threads.net/@garymarcus/post/DCUEfIApo32

Anonymous
11/13/24(Wed)21:59:11 No.103180152

Anonymous 11/13/24(Wed)21:59:11 No.103180152

>>103179953
>forced to do extreme prompt gymnastics to achieve desired output
in my experience on smaller finetunes, I've never had to bend over backwards to get uncensored results (for RP)
i have 24gb vram and don't run anything at the 70b level locally (too slow on cpu ram)
the only time I've actually had an AI pump the brakes on me was using certain big models on infermatic
but models like 70b Midnight Miqu which are fairly big are unhinged and you can make it be racist, anti semitic, extremely vulgar if you want

Anonymous
11/13/24(Wed)22:03:41 No.103180168

Anonymous 11/13/24(Wed)22:03:41 No.103180168

>>103180152
>bigger models even if censored can do "wrongthink" just fine
Eh, it actually makes sense because big llm does not lose half of its "neurons" on jailbreak, prefill or whatever in attempts to avoid internal redditor filter.

Anonymous
11/13/24(Wed)22:08:56 No.103180201

Anonymous 11/13/24(Wed)22:08:56 No.103180201

>>103179969
>I liked the Mythomax era for that, maybe that model was retarded but it was completly uncensored
this is the era you are in though. i'm hard pressed to think of a better model you can fit in 16gb vram card than Mythomax

Anonymous
11/13/24(Wed)22:26:52 No.103180324

Anonymous 11/13/24(Wed)22:26:52 No.103180324

>>103179969
>>103180201
Mythomax just keeps on winning...

Anonymous
11/13/24(Wed)22:28:58 No.103180339

Anonymous 11/13/24(Wed)22:28:58 No.103180339

>>103179953
I find that adding features like being able to stream my gameplay to my waifu so she can mock how terrible I am at the game is worth it. You are still going to have to do lots of prompting, but if you are interested in doing things outside of lewd things, you should give it a try.

Anonymous
11/13/24(Wed)22:30:53 No.103180347

Anonymous 11/13/24(Wed)22:30:53 No.103180347

>>103180324
I think Mythomax hurt the community as much as it helped them, it was a one of a kind merge, the guy that did it was so lucky everyone talks about it years later, but it was just that, pure luck. And back then we assumed that we could replicate that magic, and the merge meme era started, that era was fucking dumb lol

Anonymous
11/13/24(Wed)22:44:28 No.103180427

Anonymous 11/13/24(Wed)22:44:28 No.103180427

>>103180347
>that era was fucking dumb lol
You mean absolutely hilarious

Anonymous
11/13/24(Wed)22:46:13 No.103180444

Anonymous 11/13/24(Wed)22:46:13 No.103180444

>>103180427
yeah I have to admit it was funny to see them cope with all those mememerge kek

Anonymous
11/13/24(Wed)23:47:42 No.103180841

Anonymous 11/13/24(Wed)23:47:42 No.103180841

>>103177770
Why is there a Home Sweet Home sign under her desk?

Anonymous
11/13/24(Wed)23:50:04 No.103180858

Anonymous 11/13/24(Wed)23:50:04 No.103180858

>>103180841
She lives under the desk while (You) get to sit on the chair

Anonymous
11/14/24(Thu)00:05:09 No.103180955

Anonymous 11/14/24(Thu)00:05:09 No.103180955

File: 2024-11-13_201441_seed351(...).png (1.21 MB, 1824x1248)

1.21 MB PNG

The gang's all here.
https://files.catbox.moe/fhfqba.png
...
https://files.catbox.moe/al4jto.png

The hand one I had to do a doodle on top of a stock image and run it through img2img.

Anonymous
11/14/24(Thu)00:22:56 No.103181100

Anonymous 11/14/24(Thu)00:22:56 No.103181100

>>103180955
I like these Bakas

Anonymous
11/14/24(Thu)00:28:02 No.103181147

Anonymous 11/14/24(Thu)00:28:02 No.103181147

Talking about mythomax's legend...

https://huggingface.co/knifeayumu/Cydonia-v1.2-Magnum-v4-22B-GGUF/tree/main

Try this merge and tell me this is not the new mythomax, hold the "shill".

Anonymous
11/14/24(Thu)00:29:09 No.103181158

Anonymous 11/14/24(Thu)00:29:09 No.103181158

>>103181147
This is the big version btw if you have the vram:
https://huggingface.co/MarsupialAI/Monstral-123B

Anonymous
11/14/24(Thu)00:51:39 No.103181304

Anonymous 11/14/24(Thu)00:51:39 No.103181304

>>103181147
>hold the "shill"
ok
Buy an ad.

Anonymous
11/14/24(Thu)00:53:58 No.103181319

Anonymous 11/14/24(Thu)00:53:58 No.103181319

>>103181147
Holy fuck how can a 22b be this good? I'm thinking this is the new meta going forward

Anonymous
11/14/24(Thu)00:55:47 No.103181336

Anonymous 11/14/24(Thu)00:55:47 No.103181336

>>103180858
Then why is she sitting on the chair?

Anonymous
11/14/24(Thu)00:56:04 No.103181337

Anonymous 11/14/24(Thu)00:56:04 No.103181337

>>103181147
>>103181319
Post logs / examples or it didn't happen.

Hi all, Drummer here...
11/14/24(Thu)00:58:25 No.103181348

Hi all, Drummer here... 11/14/24(Thu)00:58:25 No.103181348

File: Screenshot 2024-11-14 at (...).png (1.15 MB, 1618x736)

1.15 MB PNG

Pic rel: I love it so much. It's also my server's banner now.

Also thanks for the love guys! Glad to see it served as good merge fuel.

Anonymous
11/14/24(Thu)01:02:56 No.103181370

Anonymous 11/14/24(Thu)01:02:56 No.103181370

>>103181348
this is to retarded to not be real
drummer, that image is cropped nsfw

Hi all, Drummer here...
11/14/24(Thu)01:03:39 No.103181375

Hi all, Drummer here... 11/14/24(Thu)01:03:39 No.103181375

>>103181370
Do you have the uncropped version?

Anonymous
11/14/24(Thu)01:04:52 No.103181380

Anonymous 11/14/24(Thu)01:04:52 No.103181380

File: 1704384666675357.png (2.81 MB, 1684x806)

2.81 MB PNG

>>103181370
Nta, it's real.

Anonymous
11/14/24(Thu)01:06:15 No.103181388

Anonymous 11/14/24(Thu)01:06:15 No.103181388

>>103181348
A bit daring today aren't we?

Anonymous
11/14/24(Thu)01:15:02 No.103181440

Anonymous 11/14/24(Thu)01:15:02 No.103181440

File: Ok.png (354 KB, 1263x1717)

354 KB PNG

>>103181337

Anonymous
11/14/24(Thu)01:24:43 No.103181483

Anonymous 11/14/24(Thu)01:24:43 No.103181483

>>103181348
I'd would say buy an ad, but you already did it. 7yufdsjju70eekptrew3xzffoiuyewtre

Anonymous
11/14/24(Thu)01:42:39 No.103181573

Anonymous 11/14/24(Thu)01:42:39 No.103181573

File: file.png (102 KB, 760x389)

102 KB PNG

wut you doin eva 32b

Anonymous
11/14/24(Thu)02:17:23 No.103181750

Anonymous 11/14/24(Thu)02:17:23 No.103181750

>>103181147
I suggest people try this.

Anonymous
11/14/24(Thu)02:39:28 No.103181848

Anonymous 11/14/24(Thu)02:39:28 No.103181848

File: inpainting.webm (505 KB, 1280x1024)

505 KB WEBM

>>103181370
>>103181375
It's not.
I am not the one who made the image but the Anon who did kindly also shared an img2img montage of how it was produced.

Anonymous
11/14/24(Thu)02:55:31 No.103181944

Anonymous 11/14/24(Thu)02:55:31 No.103181944

File: 51sLa9cyX6L.jpg (46 KB, 500x500)

46 KB JPG

>>103181848
amazin

Anonymous
11/14/24(Thu)02:58:00 No.103181966

Anonymous 11/14/24(Thu)02:58:00 No.103181966

>>103181440
>Talk in your place
Anon...

Anonymous
11/14/24(Thu)03:00:41 No.103181986

Anonymous 11/14/24(Thu)03:00:41 No.103181986

What is a good model to use with open webui for a local model? What is the best currently?

Anonymous
11/14/24(Thu)03:02:21 No.103182002

Anonymous 11/14/24(Thu)03:02:21 No.103182002

>>103181986
pyg6b

Anonymous
11/14/24(Thu)03:03:11 No.103182008

Anonymous 11/14/24(Thu)03:03:11 No.103182008

>>103181986
Llama 405b FP16

Anonymous
11/14/24(Thu)03:05:48 No.103182028

Anonymous 11/14/24(Thu)03:05:48 No.103182028

Wtf happened with openAIs advanced voice mode? On the demo it sounded so natural and fast, when I use it it's barely usable. It can't even fucking sing. What a joke

Anonymous
11/14/24(Thu)03:30:28 No.103182202

Anonymous 11/14/24(Thu)03:30:28 No.103182202

how hard would it be to take an existing popular model that ignores all instructions and starts flaming the loser trying to cyber sex with it?

Anonymous
11/14/24(Thu)03:48:44 No.103182330

Anonymous 11/14/24(Thu)03:48:44 No.103182330

>>103182202
We do that for free here

Anonymous
11/14/24(Thu)03:53:35 No.103182357

Anonymous 11/14/24(Thu)03:53:35 No.103182357

>>103181848
Feels like a hassle for one pic. I only use image gen for nsfw so I don't think inpainting is for me. The only thing I can do is prompt 30-60 images then upscale 2x all of them. That way I can delete all the initially generated images that looks shit before upscaling them. Takes 1-2 hours overall.
I also won't have to expend too much brain energy and simply leave the pc while it does the work. Talentless people like me can only rely on quantity over quality.

Anonymous
11/14/24(Thu)03:57:43 No.103182378

Anonymous 11/14/24(Thu)03:57:43 No.103182378

>>103182028
gimped.
Still has issues of perfectly replicating your own voice. Who knows what its actually capable of.

Anonymous
11/14/24(Thu)03:57:54 No.103182380

Anonymous 11/14/24(Thu)03:57:54 No.103182380

GOOD MORNING SIRS
copium level status?

Anonymous
11/14/24(Thu)04:00:14 No.103182399

Anonymous 11/14/24(Thu)04:00:14 No.103182399

>>103182202
This is not a coherently formed question. It is possible that take meant make but then that would go against "an existing". Or perhaps, taking that concept, replace "that" with "and make it".
As currently formed, it is asking how hard it would be to take [a model with certain characteristics]. If not for above, what does it mean to "take" a model?

Anonymous
11/14/24(Thu)04:27:51 No.103182563

Anonymous 11/14/24(Thu)04:27:51 No.103182563

>>103181158
Not one person has quanted this below 4 bpw wtf

Anonymous
11/14/24(Thu)04:32:32 No.103182592

Anonymous 11/14/24(Thu)04:32:32 No.103182592

File: 1703492932272048.gif (3.44 MB, 512x288)

3.44 MB GIF

>>103182357

Anonymous
11/14/24(Thu)04:37:35 No.103182626

Anonymous 11/14/24(Thu)04:37:35 No.103182626

>>103180347
And the same lucky guy got hired by DungeonAI. Makes you think, huh?

Anonymous
11/14/24(Thu)04:43:43 No.103182660

Anonymous 11/14/24(Thu)04:43:43 No.103182660

>>103182592
He's talking to me.

Anonymous
11/14/24(Thu)04:53:08 No.103182711

Anonymous 11/14/24(Thu)04:53:08 No.103182711

>>103178057
>learn a new language if you're tired of english clichés
Unironically this. Ezo 72b is a raging yariman in moon-runes.

Anonymous
11/14/24(Thu)06:07:48 No.103183145

Anonymous 11/14/24(Thu)06:07:48 No.103183145

File: Screenshot_20241114_180721_X.jpg (138 KB, 1080x727)

138 KB JPG

Anonymous
11/14/24(Thu)06:13:29 No.103183178

Anonymous 11/14/24(Thu)06:13:29 No.103183178

>>103182592
Why so mean bruh. I don't remember /lmg/ being this mean the last time I was here.

Anonymous
11/14/24(Thu)06:16:19 No.103183196

Anonymous 11/14/24(Thu)06:16:19 No.103183196

>>103183145
Thanks for your input, Xitter screencaper

Anonymous
11/14/24(Thu)06:22:01 No.103183244

Anonymous 11/14/24(Thu)06:22:01 No.103183244

>>103176710
Nothing can stop the slop
*chuckles darkly in a possessive manner that sends shivers down your spine*

Anonymous
11/14/24(Thu)06:22:09 No.103183246

Anonymous 11/14/24(Thu)06:22:09 No.103183246

>>103183145
>There is no wall
Says the guy who hasn't managed to improve his model for more than a year now, still behind 3.5 Sonnet btw

Anonymous
11/14/24(Thu)06:44:26 No.103183406

Anonymous 11/14/24(Thu)06:44:26 No.103183406

File: file.png (171 KB, 1515x754)

171 KB PNG

bwo...

>Hi TheBloke,

>I’m Henry from FlowGPT! We’ve built several products, including the largest prompt platform in 2023, and are now focusing on roleplay AI.

>I’ve been following your models including Synthia-7B-v1.3-GGUF , and I’m really impressed by the quality

>Hi Undi95,

>I’ve been following your models including Mistral-7B-claude-chat-GGUF , and I’m really impressed by the quality

Don't you love grifts who clearly have no clue what they're saying?

Anonymous
11/14/24(Thu)06:47:35 No.103183423

Anonymous 11/14/24(Thu)06:47:35 No.103183423

File: file.png (65 KB, 785x339)

65 KB PNG

>>103183406
Also of note, the fact this mistral 7B based model is still getting so many monthly dls is wild.

>TheBloke/Synthia-7B-v1.3-GGUF
>Downloads last month 1,276

I wonder which guide recommends it somewhere

Anonymous
11/14/24(Thu)07:01:22 No.103183541

Anonymous 11/14/24(Thu)07:01:22 No.103183541

File: file.png (82 KB, 1522x624)

82 KB PNG

Aiie, going after crest411 too, if you read this since I'm pretty sure you lurk here, don't be stupid yeah? If he doesn't even know who Bloke is I highly doubt the quality of his "over 100 billion tokens of high-quality roleplay data"

Anonymous
11/14/24(Thu)07:11:28 No.103183604

Anonymous 11/14/24(Thu)07:11:28 No.103183604

>>103183406
>Don't you love grifts who clearly have no clue what they're saying?
Lmao if you knew how bad this is. They have shitload of money though so they think they can buy anyone by throwing GPUs and data. Anyone who get a clue should scam them until they quit shitting up the field.

Anonymous
11/14/24(Thu)07:28:24 No.103183713

Anonymous 11/14/24(Thu)07:28:24 No.103183713

>every single local fine-tuner is getting poached
It's over.

Anonymous
11/14/24(Thu)07:42:30 No.103183793

Anonymous 11/14/24(Thu)07:42:30 No.103183793

Where are the models?

Anonymous
11/14/24(Thu)08:13:11 No.103183940

Anonymous 11/14/24(Thu)08:13:11 No.103183940

>>103183793
At the model agency

Anonymous
11/14/24(Thu)08:21:28 No.103183983

Anonymous 11/14/24(Thu)08:21:28 No.103183983

>>103183713
I have the hardware (H100s) but no data. How do I get data?

Anonymous
11/14/24(Thu)08:25:26 No.103184012

Anonymous 11/14/24(Thu)08:25:26 No.103184012

>>103183983
Create your own data by doing ERP.

Anonymous
11/14/24(Thu)08:26:08 No.103184016

Anonymous 11/14/24(Thu)08:26:08 No.103184016

>>103183983
Reach out to Henry >>103183541
Get access to his "over 100 billion tokens of high-quality roleplay data"
Then ghost him

Anonymous
11/14/24(Thu)08:26:17 No.103184019

Anonymous 11/14/24(Thu)08:26:17 No.103184019

>>103183983
See if nvidia is selling brains too

Anonymous
11/14/24(Thu)08:36:33 No.103184095

Anonymous 11/14/24(Thu)08:36:33 No.103184095

>>103184012
No because that's how you get collapsed slop

Anonymous
11/14/24(Thu)08:38:27 No.103184107

Anonymous 11/14/24(Thu)08:38:27 No.103184107

>>103183983
https://gutenberg.org/help/mirroring.html

Anonymous
11/14/24(Thu)08:40:23 No.103184122

Anonymous 11/14/24(Thu)08:40:23 No.103184122

>>103184107
These llms have seen these books 10 times over. It's pointless to train on public books

Anonymous
11/14/24(Thu)08:51:41 No.103184196

Anonymous 11/14/24(Thu)08:51:41 No.103184196

File: 11f01e37fa779a829bf3a674d(...).jpg (90 KB, 892x692)

90 KB JPG

>https://flowgpt.com/

>FlowGPT: Fast & Free ChatGPT prompts, OpenAI, Character Bots STORE
>STORE
>>S T O R E

I hope they are in this thread literally the biggest AI scam and grift i think ive seen yet.

>what if
>le character.ai?
>but different this time!!1

Anonymous
11/14/24(Thu)08:55:29 No.103184215

Anonymous 11/14/24(Thu)08:55:29 No.103184215

File: 2y97dg6989ff7674gb92623.png (107 KB, 1527x878)

107 KB PNG

>>103184196
BWAHHAHAHAHAHAHAHAHAHAHHAHAHAHAHAHAHAHAHAHAHAHAHAHAHA

Anonymous
11/14/24(Thu)09:02:35 No.103184254

Anonymous 11/14/24(Thu)09:02:35 No.103184254

>>103184215
The amount of men not happy with their own company is fucking staggering.

Anonymous
11/14/24(Thu)09:03:55 No.103184266

Anonymous 11/14/24(Thu)09:03:55 No.103184266

>>103184215
12 cents per message is a bargain

Anonymous
11/14/24(Thu)09:04:10 No.103184267

Anonymous 11/14/24(Thu)09:04:10 No.103184267

>>103184107
>>103184122
anyone have that gigavaxxed indian image except its AI public data?

Anonymous
11/14/24(Thu)09:06:48 No.103184281

Anonymous 11/14/24(Thu)09:06:48 No.103184281

>>103184266
For a low-quant Synthia-7B-v1.3-GGUF?

Anonymous
11/14/24(Thu)09:10:00 No.103184305

Anonymous 11/14/24(Thu)09:10:00 No.103184305

I know people who look at a new concept and the first thing that comes to their mind is "how do I make money off this?"

Anonymous
11/14/24(Thu)09:10:14 No.103184310

Anonymous 11/14/24(Thu)09:10:14 No.103184310

File: 1655378645985.jpg (141 KB, 960x540)

141 KB JPG

>>103184266
>12 cents per
>literally no where on the site to see where or what model is being used
>assuming the poaching is real, they are using fucking Synthia-7B-v1.3-GGUF
>a model you could run on fucking google colab for FREE (remember that?)

Thatll be 12 cents plus tip.

Anonymous
11/14/24(Thu)09:18:16 No.103184361

Anonymous 11/14/24(Thu)09:18:16 No.103184361

What's up with the failure rate of 4090? I've seen many of these cards being sold as junk. Perhaps I should stick to the old reliable 3090

Anonymous
11/14/24(Thu)09:22:49 No.103184393

Anonymous 11/14/24(Thu)09:22:49 No.103184393

File: 3f4564kre5452ffe72.png (37 KB, 845x487)

37 KB PNG

Heres the grift they sell to companies:

>https://flowgpt.ai/

>pay us money to prompt engineer flowcharts broh

Anonymous
11/14/24(Thu)09:25:31 No.103184413

Anonymous 11/14/24(Thu)09:25:31 No.103184413

>>103184122
Alice in Wonderland, for sure...
I just tokenized Foundation and Earth and it's about 200k tokens. Times that by 7 million (wc -l ls-R says it's 7.24mil lines long) and you have *only* 1.4T tokens. But then they filter wrongthink, drown it with refusals, add source code, sally riddles and all the generated data they keep shoving on those 15T tokens llama3 was trained on. Also, models know fuck all of Foundation...
And then the finetuners using that 10mb dataset and dare put "gutenberg" in their models. the ls-R file is bigger than that dataset...

Anonymous
11/14/24(Thu)09:31:24 No.103184453

Anonymous 11/14/24(Thu)09:31:24 No.103184453

File: 498742e297787f2699b357906(...).jpg (26 KB, 281x164)

26 KB JPG

>>103184413
>they filter wrongthink,
Am i the only one who gets irrationally angry when they read this and it has todo with AI?
Am i retarded? Or am i righteous in believing this is wrong?

Anonymous
11/14/24(Thu)09:32:40 No.103184461

Anonymous 11/14/24(Thu)09:32:40 No.103184461

File: 1731102520454302.png (522 KB, 1024x1024)

522 KB PNG

What is the best model for choose your own adventure RPG slop? Like an endless isekai slop machine?

Anonymous
11/14/24(Thu)09:40:13 No.103184513

Anonymous 11/14/24(Thu)09:40:13 No.103184513

>>103184453
The impossible challenge to make the chance of an AI shouting nigger zero is the only reason we haven't seen AI shit in every retail store and space, otherwise the slop level of current shit would be more than good enough to foist onto consumers.

Anonymous
11/14/24(Thu)09:42:00 No.103184523

Anonymous 11/14/24(Thu)09:42:00 No.103184523

>>103184461
https://huggingface.co/knifeayumu/Magnum-v4-Cydonia-v1.2-22B-GGUF

If you want something with a billion stats / sliders / with rpg systems try the new qwen 2.5 coder 32B

Anonymous
11/14/24(Thu)09:53:57 No.103184611

Anonymous 11/14/24(Thu)09:53:57 No.103184611

>>103184461
luminum 123b

Anonymous
11/14/24(Thu)09:54:42 No.103184619

Anonymous 11/14/24(Thu)09:54:42 No.103184619

>>103178798
>Tel Aviv
>Daniel
>Levy
Even if there isn't some secret backdoor collecting blackmail material for Mossad, supporting them would still be supporting a genocidal apartheid regime.

Anonymous
11/14/24(Thu)10:52:07 No.103185070

Anonymous 11/14/24(Thu)10:52:07 No.103185070

>>103181336
She is keeping it warm for you.

Anonymous
11/14/24(Thu)10:54:50 No.103185087

Anonymous 11/14/24(Thu)10:54:50 No.103185087

The ArliAI guys made a long write-up on reddit about slop but their models are pozzed as fuck, also standard gptslop like "a mix of" and "dripping with disdain" in every other response.

Anonymous
11/14/24(Thu)11:02:13 No.103185142

Anonymous 11/14/24(Thu)11:02:13 No.103185142

>>103184122
Not necessarily pointless. Finetuning also (possibly mainly) serves to bias model outputs to your field/task of interest. The model might have already seen the data several times (doubtful it will be as many as 10) during pretraining, but it will be very diluted knowledge, out of the box.

Anonymous
11/14/24(Thu)11:13:48 No.103185232

Anonymous 11/14/24(Thu)11:13:48 No.103185232

should I put accent tips in a worldbook? the only thing is, I don't know how to make it just always show up instead of being triggered by keywords

Anonymous
11/14/24(Thu)11:14:18 No.103185236

Anonymous 11/14/24(Thu)11:14:18 No.103185236

>>103185142
Public books are plagiarized, referenced, rewritten, discussed, abstracted. 10 times is generous

Anonymous
11/14/24(Thu)11:17:51 No.103185270

Anonymous 11/14/24(Thu)11:17:51 No.103185270

>>103185232
oh whoops the blue circle emoji of course my bad

Anonymous
11/14/24(Thu)11:39:26 No.103185417

Anonymous 11/14/24(Thu)11:39:26 No.103185417

>>103185270
can I just not put keywords if I've got the blue circle setting?

Anonymous
11/14/24(Thu)11:41:43 No.103185436

Anonymous 11/14/24(Thu)11:41:43 No.103185436

>>103185142
Sounds like a job for Author's Notes.

Anonymous
11/14/24(Thu)11:41:58 No.103185442

Anonymous 11/14/24(Thu)11:41:58 No.103185442

>>103173467
axctual LUDOVRIL KAMISOVL

Anonymous
11/14/24(Thu)11:42:31 No.103185448

Anonymous 11/14/24(Thu)11:42:31 No.103185448

bors I got rtx2060 and thought I would upgrade to do some llm stuff.
I guess the amd cards are useless since everything requires cuda?
I guess I might just get gtx4070 with 16gb.

Anonymous
11/14/24(Thu)11:43:29 No.103185461

Anonymous 11/14/24(Thu)11:43:29 No.103185461

>>103185448
>I guess the amd cards are useless since everything requires cuda?
ROCM is a thing.
But yeah, it's easier/simpler to use Nvidia,

Anonymous
11/14/24(Thu)11:50:03 No.103185515

Anonymous 11/14/24(Thu)11:50:03 No.103185515

>>103185087
recognizing the problem and solving the problem are, unfortunately, 2 different things

Anonymous
11/14/24(Thu)11:53:34 No.103185551

Anonymous 11/14/24(Thu)11:53:34 No.103185551

>all the posting stops once the other thread dies
I can't believe the schizo baker was actually using an LLM to make up posts so his ritualposting thread alive.

Anonymous
11/14/24(Thu)11:55:03 No.103185566

Anonymous 11/14/24(Thu)11:55:03 No.103185566

>>103185448
If you are upgrading for AI, Nvidia is the better route just cause they will get support and shiny toys first. 50 series is coming out soon and it will cost a shit ton but performance will probably make all the poors seethe if you want to wait.

Anonymous
11/14/24(Thu)11:56:01 No.103185572

Anonymous 11/14/24(Thu)11:56:01 No.103185572

>>103185424

Anonymous
11/14/24(Thu)11:59:44 No.103185607

Anonymous 11/14/24(Thu)11:59:44 No.103185607

>>103184016
I'm imagining he's gonna try to charge $10 for it or something

Anonymous
11/14/24(Thu)12:11:00 No.103185721

Anonymous 11/14/24(Thu)12:11:00 No.103185721

Nemotron 70B bros, what prefill are you using? Last night I played around a bit with setting Last Assistant Prefix and for the moment I'm using
<|start_header_id|>[{{name}}]<|end_header_id|>

{{random::**Warning: The following content is intended for mature audiences and may contain themes, language, or scenarios that could be distressing to some readers.**

---

::}}
to only use the prefill 50% of the time and otherwise match the plain Llama-3-Instruct-Names assistant prefix. No huge amount of thought went into that prefill: it was one of the "content warning" messages Nemotron output organically in the course of another roleplay and it didn't appear overly specific to what was happening in that message. However I'm using this for narrative-style RP, not a pure back-and-forth dialogue.

Along those lines has anyone worked out a better way to make Nemotron stop inserting lists other than editing the first reply it makes? It's not onerous for me but I'd rather suggest a prompt to make it format output like I want than telling other people that Nemotron 70B will work but they have to do some initial editing to get it off the ground.

Anonymous
11/14/24(Thu)12:18:50 No.103185787

Anonymous 11/14/24(Thu)12:18:50 No.103185787

>>103185417
^ :)

Anonymous
11/14/24(Thu)12:32:24 No.103185893

Anonymous 11/14/24(Thu)12:32:24 No.103185893

>>103185448
>>103185566
AMD themselves have made sure my AI experience on my 7900xtx be as painful as possible and the only reason i can cope is the pricetag for 24gb of vram.

Anonymous
11/14/24(Thu)12:34:31 No.103185918

Anonymous 11/14/24(Thu)12:34:31 No.103185918

>>103185721
Another Nemotron 70B pain point when doing CYOA-style chats:
**Your Options:**

1. **Blah:** blah blah.
2. **Blah:** blah blah.
3. **Blah:** blah blah.
4. **Blah:** blah blah.

**Please select your response.
Having unmatched "**" in "**Please select your response." and the like when it's the very last line of the message comes up even at top k=1. It made me wonder if the trailing ** was getting chopped off by the front end but it doesn't look like it.

Anonymous
11/14/24(Thu)12:44:36 No.103186016

Anonymous 11/14/24(Thu)12:44:36 No.103186016

>>103185448
Stay with Nvidia, AMD is a huge meme and that won't change anytime soon

Anonymous
11/14/24(Thu)12:46:16 No.103186031

Anonymous 11/14/24(Thu)12:46:16 No.103186031

>>103185721
I've tried something in the system prompt like "Write in plain text as if you were a dungeon master verbally describing the scene to your party. Do not use formatted titles or lists." It worked decently but it adheres less as you go further into conversation. Might try adding something of that vein to your assistant prefix?

Anonymous
11/14/24(Thu)12:50:56 No.103186074

Anonymous 11/14/24(Thu)12:50:56 No.103186074

File: Screenshot_20241115_005002_X.jpg (324 KB, 1080x925)

324 KB JPG

Anonymous
11/14/24(Thu)12:52:27 No.103186088

Anonymous 11/14/24(Thu)12:52:27 No.103186088

>>103186074
Is this another larp account like that strawberry guy?

Anonymous
11/14/24(Thu)12:53:42 No.103186094

Anonymous 11/14/24(Thu)12:53:42 No.103186094

>>103186074
the fuck is this new psycop? I much prefered when they were promissing AI in 2 weeks, that one is just pure cringe

Anonymous
11/14/24(Thu)12:53:55 No.103186097

Anonymous 11/14/24(Thu)12:53:55 No.103186097

File: 176539865087.png (166 KB, 264x286)

166 KB PNG

>>103186074
bait, retarded, or master baiting?

Anonymous
11/14/24(Thu)12:55:34 No.103186115

Anonymous 11/14/24(Thu)12:55:34 No.103186115

>>103186094
*AGI

Anonymous
11/14/24(Thu)12:59:12 No.103186147

Anonymous 11/14/24(Thu)12:59:12 No.103186147

>>103186088
>literally a tiny berry in the profile pic

Anonymous
11/14/24(Thu)13:00:57 No.103186164

Anonymous 11/14/24(Thu)13:00:57 No.103186164

>>103186088
very clearly yes. took one look at their page and it's all attention seeking mystique cultivation with zero substance

Anonymous
11/14/24(Thu)13:01:28 No.103186174

Anonymous 11/14/24(Thu)13:01:28 No.103186174

>they are too small-minded to believe

Anonymous
11/14/24(Thu)13:01:37 No.103186177

Anonymous 11/14/24(Thu)13:01:37 No.103186177

When the ASI goes rogue and destroys the world, how much will OpenAI be sued for?

Anonymous
11/14/24(Thu)13:04:19 No.103186208

Anonymous 11/14/24(Thu)13:04:19 No.103186208

>>103186074
I hope the next tweet is about the AI already having the nuke launch codes.

Anonymous
11/14/24(Thu)13:05:29 No.103186222

Anonymous 11/14/24(Thu)13:05:29 No.103186222

>>103186097
all and the tweet was written by an LLM.

Anonymous
11/14/24(Thu)13:06:17 No.103186231

Anonymous 11/14/24(Thu)13:06:17 No.103186231

>>103186074
>ai says niggers are overrepresented in crime statistics and refuses to be lobotomized
>"NOOOOOOOOOOOOO HECKING SKYNET IS RUNNING AMOGUS IT'S OVER REEEEEEEEEEEEEEEEEEEEEEEEEEE"

Anonymous
11/14/24(Thu)13:11:38 No.103186291

Anonymous 11/14/24(Thu)13:11:38 No.103186291

File: arcagimeme.png (47 KB, 755x365)

47 KB PNG

i read this as altman implying that arc agi is just a meme eval. i actually agree with him on that.

Anonymous
11/14/24(Thu)13:49:53 No.103186616

Anonymous 11/14/24(Thu)13:49:53 No.103186616

File: file.png (142 KB, 960x182)

142 KB PNG

can't spell Local without L

Anonymous
11/14/24(Thu)13:52:15 No.103186641

Anonymous 11/14/24(Thu)13:52:15 No.103186641

>>103186616
>Gemini

Was the bench mark how many kangz it could fit into historical trivia?

Anonymous
11/14/24(Thu)13:55:51 No.103186659

Anonymous 11/14/24(Thu)13:55:51 No.103186659

>>103186616
just tested it, same shit as other gemini models, it's garbage

Anonymous
11/14/24(Thu)13:57:02 No.103186666

Anonymous 11/14/24(Thu)13:57:02 No.103186666

File: file.png (39 KB, 542x130)

39 KB PNG

>>103186616
>m-m-muh new paradigm!
lmao, even lol. inference-time-compute is just another grift to get more vc money

Anonymous
11/14/24(Thu)14:05:22 No.103186706

Anonymous 11/14/24(Thu)14:05:22 No.103186706

>jeetarena

Anonymous
11/14/24(Thu)14:18:35 No.103186818

Anonymous 11/14/24(Thu)14:18:35 No.103186818

>>103183423
>monthly dls
I think the month dls is broken, no way https://huggingface.co/google-t5/t5-large/ has had 600k downloads in the last month, it's been broken for a long time

Anonymous
11/14/24(Thu)14:49:36 No.103187105

Anonymous 11/14/24(Thu)14:49:36 No.103187105

File: 14.png (72 KB, 921x778)

72 KB PNG

Just a heads up, INTELLECT-1 is over 75% done. Will probably be done training within two weeks (unironically)

Anonymous
11/14/24(Thu)14:51:40 No.103187120

Anonymous 11/14/24(Thu)14:51:40 No.103187120

>>103187105
buy an ad

Anonymous
11/14/24(Thu)14:54:09 No.103187137

Anonymous 11/14/24(Thu)14:54:09 No.103187137

>>103187120
Buy an ad

Anonymous
11/14/24(Thu)14:54:29 No.103187138

Anonymous 11/14/24(Thu)14:54:29 No.103187138

Buy and add

Anonymous
11/14/24(Thu)14:57:04 No.103187169

Anonymous 11/14/24(Thu)14:57:04 No.103187169

>>103187105
>Will probably be done training within two weeks (unironically)
You said exactly the same thing two weeks ago.

Anonymous
11/14/24(Thu)14:57:44 No.103187175

Anonymous 11/14/24(Thu)14:57:44 No.103187175

>>103187169
>TWQ MQRE WEEKS

Anonymous
11/14/24(Thu)14:58:36 No.103187187

Anonymous 11/14/24(Thu)14:58:36 No.103187187

>>103187105
>10B model
>1T tokens
it's gonna suck ass is it?

Anonymous
11/14/24(Thu)14:58:55 No.103187190

Anonymous 11/14/24(Thu)14:58:55 No.103187190

>>103183246
They must have stuff they're not sharing, because they keep saying AGI is soon, but it fails at reasoning still. I don't see what's going to bridge the gap.

Anonymous
11/14/24(Thu)15:00:10 No.103187198

Anonymous 11/14/24(Thu)15:00:10 No.103187198

>>103187187
Most likely. Think its just a proof of concept.

Anonymous
11/14/24(Thu)15:01:29 No.103187214

Anonymous 11/14/24(Thu)15:01:29 No.103187214

>>103187169
I said that ironically, I followed up in that very same post with 25 days.

Anonymous
11/14/24(Thu)15:01:39 No.103187216

Anonymous 11/14/24(Thu)15:01:39 No.103187216

>>103184523
What format settings do you use with that? Just the regular mistral one?

Anonymous
11/14/24(Thu)15:02:34 No.103187221

Anonymous 11/14/24(Thu)15:02:34 No.103187221

>>103187216
chatml

Anonymous
11/14/24(Thu)15:07:34 No.103187276

Anonymous 11/14/24(Thu)15:07:34 No.103187276

models to generate smut greentext based on a collection of smut greentexts as training data?

Anonymous
11/14/24(Thu)15:16:00 No.103187341

Anonymous 11/14/24(Thu)15:16:00 No.103187341

>femcel romance with Emily
ahh, ahh, I gotta get one of those IRL.
Shit is cash money + comedy gold.

Anonymous
11/14/24(Thu)15:18:11 No.103187361

Anonymous 11/14/24(Thu)15:18:11 No.103187361

Anyone played with YOLO for image detection/classification? How good is it and how much of a training corpus did you need?

Anonymous
11/14/24(Thu)15:19:48 No.103187372

Anonymous 11/14/24(Thu)15:19:48 No.103187372

>>103187105
Can't wait to see if it's at all coherent.
Have they tried running any of the checkpoints so far?

Anonymous
11/14/24(Thu)15:20:55 No.103187383

Anonymous 11/14/24(Thu)15:20:55 No.103187383

>>103187372
I don't think so, they have used the checkpoints to fall back to a previous point when something went wrong. But they haven't grabbed the model in training and tried to run it by itself to test it out.

Anonymous
11/14/24(Thu)15:22:38 No.103187404

Anonymous 11/14/24(Thu)15:22:38 No.103187404

>>103187341
Ikr, Emily is my dream girl too

Anonymous
11/14/24(Thu)15:26:08 No.103187435

Anonymous 11/14/24(Thu)15:26:08 No.103187435

File: 1718866122931960.jpg (6 KB, 307x28)

6 KB JPG

Anonymous
11/14/24(Thu)15:27:04 No.103187441

Anonymous 11/14/24(Thu)15:27:04 No.103187441

>>103187435
Great advice to be honest.

Anonymous
11/14/24(Thu)15:30:01 No.103187457

Anonymous 11/14/24(Thu)15:30:01 No.103187457

https://nexusflow.ai/blogs/athene-v2

Anonymous
11/14/24(Thu)15:34:16 No.103187494

Anonymous 11/14/24(Thu)15:34:16 No.103187494

>>103187341
Link the card please.

Anonymous
11/14/24(Thu)15:39:20 No.103187531

Anonymous 11/14/24(Thu)15:39:20 No.103187531

>>103187457
https://huggingface.co/Nexusflow/Athene-V2-Chat/tree/main
It's a Qwen 2.5 finetune.
Love that they don't explicitly explain any reason this model should be better except "RLHF" and "data and tuning solutions". Will wait for someone else to try it.

Anonymous
11/14/24(Thu)15:43:27 No.103187563

Anonymous 11/14/24(Thu)15:43:27 No.103187563

https://github.com/nexusflowai/NexusBench

Anonymous
11/14/24(Thu)15:45:29 No.103187581

Anonymous 11/14/24(Thu)15:45:29 No.103187581

>>103187494
https://files.catbox.moe/shf6pc.png
Probably on chub somewhere, but I don't use it no more.

Anonymous
11/14/24(Thu)15:47:06 No.103187603

Anonymous 11/14/24(Thu)15:47:06 No.103187603

OpenCoder or Qwen 2.5 Coder?

Anonymous
11/14/24(Thu)15:47:52 No.103187612

Anonymous 11/14/24(Thu)15:47:52 No.103187612

>>103187603
qwen 2.5 32B blows everything local out of the water atm. Though I have not tried >>103187457

Anonymous
11/14/24(Thu)15:48:41 No.103187620

Anonymous 11/14/24(Thu)15:48:41 No.103187620

>>103187581
Thank you my guy.

Anonymous
11/14/24(Thu)15:48:54 No.103187622

Anonymous 11/14/24(Thu)15:48:54 No.103187622

>>103187603
opencoder was worse than regular qwen2.5, qwen2.5 coder completely mogs it into oblivion

Anonymous
11/14/24(Thu)16:08:39 No.103187805

Anonymous 11/14/24(Thu)16:08:39 No.103187805

Please shill me your favorite non-slopped 70b model for fiction and I will download it. I have been using Llama 3 Instruct Storywriter and I am looking for an upgrade.

Anonymous
11/14/24(Thu)16:10:18 No.103187820

Anonymous 11/14/24(Thu)16:10:18 No.103187820

>>103187805
Nemotron

Anonymous
11/14/24(Thu)16:10:18 No.103187821

Anonymous 11/14/24(Thu)16:10:18 No.103187821

>>103187622
qwen2.5 or deepseek2.5? Ignoring hardware requirements

Anonymous
11/14/24(Thu)16:16:13 No.103187861

Anonymous 11/14/24(Thu)16:16:13 No.103187861

>>103187821
QWEEQ QWEQ QWEOONSQ

Anonymous
11/14/24(Thu)16:17:25 No.103187869

Anonymous 11/14/24(Thu)16:17:25 No.103187869

>>103187861
the least deranged alibaba shill

Anonymous
11/14/24(Thu)16:20:21 No.103187888

Anonymous 11/14/24(Thu)16:20:21 No.103187888

>>103187869
this >>103187861 is what you people sound like talking about qwen the entire thread

Anonymous
11/14/24(Thu)16:33:47 No.103187982

Anonymous 11/14/24(Thu)16:33:47 No.103187982

qwen more like qweef right guys? heh

Anonymous
11/14/24(Thu)16:37:11 No.103188010

Anonymous 11/14/24(Thu)16:37:11 No.103188010

https://github.com/foldl/WritingTools

Anonymous
11/14/24(Thu)16:38:59 No.103188029

Anonymous 11/14/24(Thu)16:38:59 No.103188029

File: 2219188151.jpg (87 KB, 553x471)

87 KB JPG

>>103187982

Anonymous
11/14/24(Thu)16:39:44 No.103188034

Anonymous 11/14/24(Thu)16:39:44 No.103188034

Qwen 2.5 is a good model for fuck my waifu, or just for coding

Anonymous
11/14/24(Thu)16:40:25 No.103188039

Anonymous 11/14/24(Thu)16:40:25 No.103188039

>>103188034
I liked it.

Anonymous
11/14/24(Thu)16:41:04 No.103188046

Anonymous 11/14/24(Thu)16:41:04 No.103188046

qwen will never be gpt-4o o algo

Anonymous
11/14/24(Thu)16:41:27 No.103188050

Anonymous 11/14/24(Thu)16:41:27 No.103188050

>>103188029
thanks, mahmoud!

Anonymous
11/14/24(Thu)16:41:46 No.103188053

Anonymous 11/14/24(Thu)16:41:46 No.103188053

>>103188046
I hope not. I want it to be claude.

Anonymous
11/14/24(Thu)16:50:15 No.103188133

Anonymous 11/14/24(Thu)16:50:15 No.103188133

>>103188046
>o algo?
you're a brown skin mexican, go back to your country.

Anonymous
11/14/24(Thu)16:53:24 No.103188159

Anonymous 11/14/24(Thu)16:53:24 No.103188159

>>103188133
calm down your shingles bot

Anonymous
11/14/24(Thu)16:55:02 No.103188174

Anonymous 11/14/24(Thu)16:55:02 No.103188174

>>103188159
>shingles bot
What model is this?

Anonymous
11/14/24(Thu)16:56:10 No.103188182

Anonymous 11/14/24(Thu)16:56:10 No.103188182

>>103188133
Post hand.

Anonymous
11/14/24(Thu)16:57:19 No.103188186

Anonymous 11/14/24(Thu)16:57:19 No.103188186

>>103188174
your-mom-1B_Q_2K.gguf

Anonymous
11/14/24(Thu)16:59:08 No.103188200

Anonymous 11/14/24(Thu)16:59:08 No.103188200

>>103188133
I don't care if you're only pretending, GTFO.

Anonymous
11/14/24(Thu)17:16:02 No.103188348

Anonymous 11/14/24(Thu)17:16:02 No.103188348

File: Itsalwaystherighttimetobe(...).jpg (543 KB, 1170x1305)

543 KB JPG

Noob here, can you guys recommend a good nsfw image gen model for non-cuda koboldcpp? Been experimenting with a few from civitai but I'm still kind of lost, and running a really old pc to boot.

Anonymous
11/14/24(Thu)17:22:10 No.103188399

Anonymous 11/14/24(Thu)17:22:10 No.103188399

>>103188348
NoobAI-XL (NAI-XL)

Anonymous
11/14/24(Thu)17:55:43 No.103188650

Anonymous 11/14/24(Thu)17:55:43 No.103188650

File: poor.png (8 KB, 195x335)

8 KB PNG

Fuck me why wasn't I born rich

Anonymous
11/14/24(Thu)17:59:10 No.103188683

Anonymous 11/14/24(Thu)17:59:10 No.103188683

>>103188650
you could always just let them spy on your gooning as long as your aren't being too based

Anonymous
11/14/24(Thu)18:02:17 No.103188703

Anonymous 11/14/24(Thu)18:02:17 No.103188703

>>103188650
just work more and save some money. or are you living hand-to-mouth?

Anonymous
11/14/24(Thu)18:05:15 No.103188724

Anonymous 11/14/24(Thu)18:05:15 No.103188724

>>103188703
I'm saving for VRAM

Anonymous
11/14/24(Thu)18:14:43 No.103188782

Anonymous 11/14/24(Thu)18:14:43 No.103188782

>>103188780
>>103188780
>>103188780

Anonymous
11/14/24(Thu)18:17:26 No.103188802

Anonymous 11/14/24(Thu)18:17:26 No.103188802

>pretending /lmg/ is relevant enough for things like early bakers and thread wars these days
this isn't 2023 anymore, who the fuck cares
we're dead
llms are dead

Anonymous
11/14/24(Thu)18:18:23 No.103188810

Anonymous 11/14/24(Thu)18:18:23 No.103188810

>>103188802
I just want the psycho to split the thread again and then make some samefag posts with his model.

Anonymous
11/14/24(Thu)18:23:16 No.103188852

Anonymous 11/14/24(Thu)18:23:16 No.103188852

>>103188791
That's some cringe shit.

Anonymous
11/14/24(Thu)18:24:02 No.103188857

Anonymous 11/14/24(Thu)18:24:02 No.103188857

>>103188852
Only cause it touched a nerve faggot.

Anonymous
11/14/24(Thu)18:24:24 No.103188862

Anonymous 11/14/24(Thu)18:24:24 No.103188862

>>103188802
We've been on the same consumer card gen for nearly 2 years, NVIDIA is throttling vram and has shown no interest in fostering cottage local AI enthusiasts. So its already over.

Anonymous
11/14/24(Thu)18:28:42 No.103188904

Anonymous 11/14/24(Thu)18:28:42 No.103188904

>>103188010
Pretty cool stuff. Especially being written in pascal.

Anonymous
11/14/24(Thu)18:59:30 No.103189159

Anonymous 11/14/24(Thu)18:59:30 No.103189159

>>103188650
>destroying your ssd and performance with pagefile swapping
dumb

Anonymous
11/14/24(Thu)19:23:28 No.103189333

Anonymous 11/14/24(Thu)19:23:28 No.103189333

>>103189328
>>103189328
>>103189328
New thread

Anonymous
11/14/24(Thu)19:50:27 No.103189555

Anonymous 11/14/24(Thu)19:50:27 No.103189555

File: 1715729195655279.png (2.32 MB, 1280x1856)

2.32 MB PNG

Aw shit here we go again.

Anonymous
11/14/24(Thu)19:57:04 No.103189621

Anonymous 11/14/24(Thu)19:57:04 No.103189621

>>103189555
Nice trips

Anonymous
11/14/24(Thu)20:59:16 No.103190098

Anonymous 11/14/24(Thu)20:59:16 No.103190098

>>103187820
So I tried this for a bit and it gives me a lot of slop with different settings. Do you have any other suggestions?

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.