/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 08/22/24(Thu)09:58:07 No.102025568

File: 2024-08-21_182441_seed29_(...).png (1.9 MB, 1280x1280)

1.9 MB PNG

/lmg/ - Local Models General Anonymous 08/22/24(Thu)09:58:07 No.102025568 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>102011438 & >>102001133

►News
>(08/22) Jamba 1.5: 52B & 398B MoE: https://hf.co/collections/ai21labs/jamba-15-66c44befa474a917fcf55251
>(08/20) Microsoft's Phi-3.5 released: mini+MoE+vision: https://hf.co/microsoft/Phi-3.5-MoE-instruct
>(08/16) MiniCPM-V-2.6 support merged: https://github.com/ggerganov/llama.cpp/pull/8967
>(08/15) Hermes 3 released, full finetunes of Llama 3.1 base models: https://hf.co/collections/NousResearch/hermes-3-66bd6c01399b14b08fe335ea
>(08/12) Falcon Mamba 7B model from TII UAE: https://hf.co/tiiuae/falcon-mamba-7b

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench
Japanese: https://hf.co/datasets/lmg-anon/vntl-leaderboard
Programming: https://hf.co/spaces/mike-ravkine/can-ai-code-results

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
08/22/24(Thu)10:00:13 No.102025597

Anonymous 08/22/24(Thu)10:00:13 No.102025597

File: img_1.jpg (324 KB, 1360x768)

324 KB JPG

►Recent Highlights from the Previous Thread: >>102011438

(1/2)

--Paper: Minitron Approach for compressing LLMs using pruning and distillation: >>102019300 >>102021940
--Papers: >>102019494 >>102019215
--Jamba 1.5 Mini (12B active/52B total) and Jamba 1.5 Large (94B active/398B total) released: >>102025061 >>102025083 >>102025105 >>102025135
--Planning a collaborative storytelling session with Mikupad and Llama.cpp: >>102017625 >>102018191 >>102018413 >>102018638
--Phi-3-medium-128k-instruct-onnx-cpu runs fast on CPU, GGUF q8 quant available: >>102012668 >>102012876
--Ollama struggles with extracting user credentials due to formatting issues: >>102018067 >>102018168 >>102021202
--MoE and sparse architectures discussion: >>102012838 >>102012877 >>102012959 >>102013545 >>102013637 >>102013760
--Gemma 2 2b control vector experiments and results: >>102019052 >>102019178 >>102019506 >>102019204 >>102019219 >>102019601
--Anons discuss and share terminal-based chat projects and ideas: >>102018061 >>102018212 >>102018449 >>102018224 >>102018319 >>102018484 >>102018432
--Anon fixes RAG issue by unchecking "Summarize Chat messages when sending" toggle: >>102022620 >>102022875 >>102023118 >>102023544
--Anon discusses the difficulties of creating a local model that can handle both normal and smutty content without being overly horny or dry, and how current models like Claude struggle with this.: >>102012011 >>102012374 >>102012459 >>102012619 >>102014256 >>102014863
--Anon considers making alternative to SillyTavern, seeks feedback on features: >>102023701 >>102023763 >>102023775 >>102023788 >>102023833 >>102023843 >>102023928

►Recent Highlight Posts from the Previous Thread: >>102011588

Anonymous
08/22/24(Thu)10:02:16 No.102025624

Anonymous 08/22/24(Thu)10:02:16 No.102025624

File: img_14.jpg (301 KB, 1360x768)

301 KB JPG

►Recent Highlights from the Previous Thread: >>102011438

(2/2)

--Magnum-123B has perspective switching issues, unlike Mistral-Large-Instruct: >>102018999 >>102019038 >>102019185 >>102019331 >>102019491
--Anon seeks help with prompt to make AI respond concisely: >>102012681 >>102012743 >>102013429
--Anon runs 8B LLM on gaming PC, discusses societal implications: >>102018513 >>102018662 >>102018592 >>102021742
--Phi-3-medium-128k struggles with adult roleplay content: >>102015287 >>102015718 >>102015826
--Meta-Llama-3.1-70B-Instruct has limitations and may not live up to expectations: >>102021345 >>102021449 >>102021515 >>102021502 >>102021639 >>102021646
--Hermes 405b model struggles with asterisk quotation mark mix-ups: >>102021297
--Anon thinks diffusion-guided LLMs are necessary to avoid hallucination and misalignment: >>102014643
--Anon discusses The Living AI Dataset and its potential to create a sentient AI model with empathy and love: >>102022143 >>102022218 >>102022241 >>102022256 >>102022310 >>102022318
--Miku (free space): >>102013020 >>102013180 >>102013618 >>102013630 >>102013793 >>102013946 >>102014401 >>102014423 >>102016872 >>102018287 >>102020209

►Recent Highlight Posts from the Previous Thread: >>102011588

Anonymous
08/22/24(Thu)10:03:54 No.102025649

Anonymous 08/22/24(Thu)10:03:54 No.102025649

Jambalove

Anonymous
08/22/24(Thu)10:05:43 No.102025677

Anonymous 08/22/24(Thu)10:05:43 No.102025677

>>102025568
I love Miku and I love you Anon.

Anonymous
08/22/24(Thu)10:07:18 No.102025688

Anonymous 08/22/24(Thu)10:07:18 No.102025688

I'M THINKING
MIKU
>MIKU
OO EE OO

Anonymous
08/22/24(Thu)10:23:47 No.102025881

Anonymous 08/22/24(Thu)10:23:47 No.102025881

Working on a new model, my friends. It's very much a work in progress. Here's a log:

https://files.catbox.moe/1tg4k2.txt

It's a little long, so feel free to skim it. This is based on llama 3.1 8B btw.

There were some rerolls and some minor edits bust mostly kept things as is. For example the model got into the habit of writing "Oh boy," every single time as starting phrase so I had to edit that out.

Anonymous
08/22/24(Thu)10:29:28 No.102025941

Anonymous 08/22/24(Thu)10:29:28 No.102025941

Interesting, with proof from RULER it seems like jamba finally fixed the context issue. Hopefully we get compatibility soon.

https://www.ai21.com/blog/long-context-yoav-shoham

Anonymous
08/22/24(Thu)10:31:10 No.102025963

Anonymous 08/22/24(Thu)10:31:10 No.102025963

>>102025941
>Hopefully we get compatibility soon.
We still don't have Jamba 1.0 compatibility.

Anonymous
08/22/24(Thu)10:31:42 No.102025971

Anonymous 08/22/24(Thu)10:31:42 No.102025971

>>102025941
>yoav-shoham

Anonymous
08/22/24(Thu)10:32:24 No.102025981

Anonymous 08/22/24(Thu)10:32:24 No.102025981

What's this about onnx on cpu? Will I be able to run Mistral Large above 0.6t/s with it?

Anonymous
08/22/24(Thu)10:35:26 No.102026020

Anonymous 08/22/24(Thu)10:35:26 No.102026020

>>102025941
Fascinating how on this they don't mention llama3 at all. Phi, Mistral, Command R/+ but no mention of llama3.

Anonymous
08/22/24(Thu)10:40:57 No.102026087

Anonymous 08/22/24(Thu)10:40:57 No.102026087

How can one detect degradation of Q2 vs Q8 quants?
As in, what's going to be retarded in Q2, that isnt going to be retarded in Q8?

Being a 24gb vramlet, I'm just trying to understand if it's better to use a bigger model at lower quants, or use a smaller model at q8.

Anonymous
08/22/24(Thu)10:42:00 No.102026107

Anonymous 08/22/24(Thu)10:42:00 No.102026107

>>102026020
?

https://www.ai21.com/blog/announcing-jamba-model-family

https://www.ai21.com/blog/long-context-yoav-shoham

They do on all the benchmarks

Anonymous
08/22/24(Thu)10:43:28 No.102026132

Anonymous 08/22/24(Thu)10:43:28 No.102026132

What's the proper ST context template for Qwen2?

Anonymous
08/22/24(Thu)10:45:30 No.102026158

Anonymous 08/22/24(Thu)10:45:30 No.102026158

>>102026107
Look at the images in the "long context" post, none show llama3

Anonymous
08/22/24(Thu)10:46:02 No.102026171

Anonymous 08/22/24(Thu)10:46:02 No.102026171

>>102026132
chatml

Anonymous
08/22/24(Thu)10:46:46 No.102026176

Anonymous 08/22/24(Thu)10:46:46 No.102026176

>>102026171
Thank you.

Anonymous
08/22/24(Thu)10:50:00 No.102026211

Anonymous 08/22/24(Thu)10:50:00 No.102026211

>>102026107
Oh, I'm retarded and hadn't noticed this wasn't related to jamba 1.5...
>June 26, 2024

Anonymous
08/22/24(Thu)10:55:17 No.102026265

Anonymous 08/22/24(Thu)10:55:17 No.102026265

https://github.com/exo-explore/exo

Anonymous
08/22/24(Thu)10:56:30 No.102026278

Anonymous 08/22/24(Thu)10:56:30 No.102026278

So now that the dust has settled, did jamba 1.5 save the hobby?

Anonymous
08/22/24(Thu)10:57:01 No.102026284

Anonymous 08/22/24(Thu)10:57:01 No.102026284

>>102026265
Would be cheaper and almost certain get you more t/s to just buy more RAM.

Anonymous
08/22/24(Thu)10:57:13 No.102026286

Anonymous 08/22/24(Thu)10:57:13 No.102026286

After extensive use of mixtral I've came to a conclusions that it's absolute dogshit. The only advantage it has over a 70b quant of comparable size is speed.

Anonymous
08/22/24(Thu)10:58:49 No.102026309

Anonymous 08/22/24(Thu)10:58:49 No.102026309

jamba large seems underwhelming for its size, I don't know why companies go all in training these behemoths
it's a massive waste to train such a large model if you haven't produced a top tier small model as a proof of concept

Anonymous
08/22/24(Thu)10:58:51 No.102026311

Anonymous 08/22/24(Thu)10:58:51 No.102026311

>>102025941
Very cool, where is the gguf?

Anonymous
08/22/24(Thu)10:58:59 No.102026312

Anonymous 08/22/24(Thu)10:58:59 No.102026312

>>102026278
>>102026286

Yes!

Anonymous
08/22/24(Thu)10:59:04 No.102026314

Anonymous 08/22/24(Thu)10:59:04 No.102026314

I like the new Jama minis writing style.

Anonymous
08/22/24(Thu)11:00:26 No.102026332

Anonymous 08/22/24(Thu)11:00:26 No.102026332

>>102026314
What did you ran it on? Any log?

Anonymous
08/22/24(Thu)11:01:34 No.102026348

Anonymous 08/22/24(Thu)11:01:34 No.102026348

>>102026332
just azure, I dont think anything supports it yet.

Anonymous
08/22/24(Thu)11:05:19 No.102026389

Anonymous 08/22/24(Thu)11:05:19 No.102026389

>>102026286
that's wild. 8 retarded 7b models smashed together in one isn't better than a 70b? who would've guessed!

Anonymous
08/22/24(Thu)11:06:24 No.102026401

Anonymous 08/22/24(Thu)11:06:24 No.102026401

>>102026389
thats not how moes work

Anonymous
08/22/24(Thu)11:07:55 No.102026418

Anonymous 08/22/24(Thu)11:07:55 No.102026418

>>102026401
Don't feed the troll, anon.

Anonymous
08/22/24(Thu)11:09:07 No.102026430

Anonymous 08/22/24(Thu)11:09:07 No.102026430

>>102026389
Last time I read about it anons were saying it's the greatest thing since sliced bread. It does kinda feel like 7b now that i think about it.

Anonymous
08/22/24(Thu)11:10:40 No.102026447

Anonymous 08/22/24(Thu)11:10:40 No.102026447

>>102026430
That was one anon being obsessed with Mixtral for some reason.

Anonymous
08/22/24(Thu)11:11:16 No.102026457

Anonymous 08/22/24(Thu)11:11:16 No.102026457

>>102026430
It was very good for it's time, before miqu and such.

Anonymous
08/22/24(Thu)11:12:20 No.102026467

Anonymous 08/22/24(Thu)11:12:20 No.102026467

>>102026457
Any l2 70b shitmix wipes the floor with it.

Anonymous
08/22/24(Thu)11:15:03 No.102026502

Anonymous 08/22/24(Thu)11:15:03 No.102026502

Jamba verdict?

Anonymous
08/22/24(Thu)11:16:16 No.102026527

Anonymous 08/22/24(Thu)11:16:16 No.102026527

>>102026502
meme

Anonymous
08/22/24(Thu)11:20:08 No.102026573

Anonymous 08/22/24(Thu)11:20:08 No.102026573

>>102026447
cuz he's a poor retard.

Anonymous
08/22/24(Thu)11:21:01 No.102026587

Anonymous 08/22/24(Thu)11:21:01 No.102026587

>>102026502
no one can even run it yet. And api has censored inputs.

Anonymous
08/22/24(Thu)11:36:35 No.102026780

Anonymous 08/22/24(Thu)11:36:35 No.102026780

>Jamba 1.5 Large (94B active/398B total)
oh come on, i can run 120B largestral fine at q4 but they couldn't make a medium-sized model for this? it's either the cucked 54b or a ridiculous 400b?
i'll pass

Anonymous
08/22/24(Thu)11:40:20 No.102026825

Anonymous 08/22/24(Thu)11:40:20 No.102026825

>>102026780
bro they don't make models with you in mind.

Anonymous
08/22/24(Thu)11:42:16 No.102026852

Anonymous 08/22/24(Thu)11:42:16 No.102026852

>>102026780
Bwo? Just have 4 4090s for the active portion and 500 exabytes of ram for the unused experts?

Anonymous
08/22/24(Thu)11:47:56 No.102026931

Anonymous 08/22/24(Thu)11:47:56 No.102026931

File: 1707090112465128.jpg (24 KB, 635x601)

24 KB JPG

>54B modes are now considered "mini"

Anonymous
08/22/24(Thu)11:48:58 No.102026942

Anonymous 08/22/24(Thu)11:48:58 No.102026942

File: 370095d2ce078992b9e8a235d(...).gif (1.17 MB, 1280x790)

1.17 MB GIF

Okay, I just tried both Jambas for translation, and... They both are dog shit. It's not even funny, Large is worse than 70B and it has 300B+ parameters.

Anonymous
08/22/24(Thu)11:51:24 No.102026972

Anonymous 08/22/24(Thu)11:51:24 No.102026972

>>102026852
>for the unused experts
Unfortunately this is not how it works in practice. For real use, all experts are basically used. The active parameters thing is just about how many are active per the processing of each token, not per prompt.

Anonymous
08/22/24(Thu)11:53:32 No.102026996

Anonymous 08/22/24(Thu)11:53:32 No.102026996

>>102026942
>Multilingual: In addition to English, the models support Spanish, French, Portuguese, Italian, Dutch, German, Arabic and Hebrew
You tested Japanese right? They don't support that it seems.

Anonymous
08/22/24(Thu)11:59:58 No.102027071

Anonymous 08/22/24(Thu)11:59:58 No.102027071

https://rentry.org/magnum-v2-4b

Anonymous
08/22/24(Thu)12:03:26 No.102027115

Anonymous 08/22/24(Thu)12:03:26 No.102027115

>>102026996
That may be the case, however, many models claim to have no support for Japanese, but most big beak models do somewhat well on it anyway.

Anonymous
08/22/24(Thu)12:05:04 No.102027136

Anonymous 08/22/24(Thu)12:05:04 No.102027136

smedrins

Anonymous
08/22/24(Thu)12:07:17 No.102027162

Anonymous 08/22/24(Thu)12:07:17 No.102027162

File: ihavelehardware.png (101 KB, 756x838)

101 KB PNG

>>102027071
the compute, bro... it's too expensive bro... donate pls

Anonymous
08/22/24(Thu)12:09:23 No.102027199

Anonymous 08/22/24(Thu)12:09:23 No.102027199

>>102027162
go back

Anonymous
08/22/24(Thu)12:12:04 No.102027239

Anonymous 08/22/24(Thu)12:12:04 No.102027239

>>102027199
No, you go away. I'm sick and tired of no friend-having losers like yourself trying to keep everyone at your level instead of encouraging people to connect with each other.

Anonymous
08/22/24(Thu)12:14:33 No.102027277

Anonymous 08/22/24(Thu)12:14:33 No.102027277

>>102027239
projection???

Anonymous
08/22/24(Thu)12:20:30 No.102027368

Anonymous 08/22/24(Thu)12:20:30 No.102027368

>>102026931
>Original mini was phi at 3.8 billion parameters
>OpenAI then names their 82 percent MMLU mini
>Then Elon names his 85 percent MMLU model mini
>Now Jamba mini
Yikes

Anonymous
08/22/24(Thu)12:37:12 No.102027624

Anonymous 08/22/24(Thu)12:37:12 No.102027624

>>102027239
This is why people give up and hand shit over to discord, since at least there's a way of dealing with trolls and shitposters.

Anonymous
08/22/24(Thu)12:45:44 No.102027750

Anonymous 08/22/24(Thu)12:45:44 No.102027750

>>102027239
>Why won't anons test the models for me? I can't waste time with them!
>t. alpindale

Agreed, go back to discord, you have enough testers there already.

Anonymous
08/22/24(Thu)12:50:23 No.102027809

Anonymous 08/22/24(Thu)12:50:23 No.102027809

>>102027750
I wanna test shit too, is discord open?

Anonymous
08/22/24(Thu)12:51:22 No.102027821

Anonymous 08/22/24(Thu)12:51:22 No.102027821

>>102026389
>>102026286
Does Mixtral have any good finetunes?

I just use https://huggingface.co/TheBloke/dolphin-2.7-mixtral-8x7b-GGUF/tree/main

Anonymous
08/22/24(Thu)12:52:11 No.102027828

Anonymous 08/22/24(Thu)12:52:11 No.102027828

File: 1718726413358418.jpg (38 KB, 570x744)

38 KB JPG

>virtually EVERY SINGLE GAME i play with the ai is either 1) ww2 natsoc larp where i join the reich and help win the war or 2) some other larp about my character joining the fascists and destroying the opposition
why can't i just do normal boring coomshit?
what's wrong with me bros?

Anonymous
08/22/24(Thu)12:52:54 No.102027835

Anonymous 08/22/24(Thu)12:52:54 No.102027835

>>102027821
starling 7b

Anonymous
08/22/24(Thu)12:53:07 No.102027837

Anonymous 08/22/24(Thu)12:53:07 No.102027837

>>102027828
>what's wrong with me bros?
Modern society failed to diminish your innate desire to conquer and build.

Anonymous
08/22/24(Thu)12:53:19 No.102027840

Anonymous 08/22/24(Thu)12:53:19 No.102027840

>>102027828
it's because you're 16 years old

Anonymous
08/22/24(Thu)12:53:38 No.102027842

Anonymous 08/22/24(Thu)12:53:38 No.102027842

>>102027828
based. sex is boring.

Anonymous
08/22/24(Thu)12:55:41 No.102027876

Anonymous 08/22/24(Thu)12:55:41 No.102027876

>>102027828
do one about a russian girl castrating you

Anonymous
08/22/24(Thu)12:57:48 No.102027908

Anonymous 08/22/24(Thu)12:57:48 No.102027908

In an effort to improve my cooming quality I finally sat down and started pasting a hentai game script into the window as a prefill. I made sure to use something that uses almost only dialogue with some minimalistic matter of fact descriptions for actions.

I made it like 8k tokens of prefill and then I started actually using it. What shocked me is that after I was done it felt like I generated another 8-12k tokens but it turns out the whole AI generated part was only 4k tokens. Now I am starting to think that leaving the model to generate shit from the start is a terrible thing because it will just not stop itself from describing the iridescent radiance of the particular peculiar gleam of her one iris in her eye. And second thing I noticed is that even with 8k prefill those fuckers still want to stuff that novel type purple prose everywhere they can. Where of course longer multi sentence descriptions weren't there in the prefill.

TL DR: all models still fundamentally suck for cooming.

Anonymous
08/22/24(Thu)13:00:04 No.102027947

Anonymous 08/22/24(Thu)13:00:04 No.102027947

can you cpumaxx the jambaree? the tripfag did 8 bit llama 405b so the official 8bit quant would fit in ram with about the same size right? or is it exclusively gpu only?

Anonymous
08/22/24(Thu)13:03:56 No.102028011

Anonymous 08/22/24(Thu)13:03:56 No.102028011

>>102027947
>If you don't have access to a GPU, you can also load and run Jamba 1.5 Large on a CPU. Note this will result in poor inference performance.
https://huggingface.co/ai21labs/AI21-Jamba-1.5-Large#model-features

Anonymous
08/22/24(Thu)13:05:38 No.102028043

Anonymous 08/22/24(Thu)13:05:38 No.102028043

>>102028011
interesting, in theory it could be around 4x the speed of 405b, might cross the usability threshold

Anonymous
08/22/24(Thu)13:07:58 No.102028083

Anonymous 08/22/24(Thu)13:07:58 No.102028083

>mini
>52B
Now that is a mini I can get behind to fuck.

Anonymous
08/22/24(Thu)13:08:21 No.102028090

Anonymous 08/22/24(Thu)13:08:21 No.102028090

File: 1701307139265892.jpg (150 KB, 432x2048)

150 KB JPG

wtf I want to enjoy the schizokino of the 405B base model

Anonymous
08/22/24(Thu)13:08:53 No.102028098

Anonymous 08/22/24(Thu)13:08:53 No.102028098

>>102027908
What model? Was the prefill formatted as required by the model's prompting specifications or just copy-pasted as a lore entry? I often found that simply copy-pasting a long conversation (from a novel, fiction, etc) as-is doesn't work as it intuitively should, and converting pre-made dialogues into (many) formatted turns can end up dumbing the model down significantly.

Anonymous
08/22/24(Thu)13:10:14 No.102028121

Anonymous 08/22/24(Thu)13:10:14 No.102028121

>>102028098
>prefill formatted as required by the model's prompting specifications
Yes of course it was...

Anonymous
08/22/24(Thu)13:11:38 No.102028144

Anonymous 08/22/24(Thu)13:11:38 No.102028144

>>102028083(me)
>check buggedcpp
>support Jamba hybrid Transformer
>may 25
>still not merged
Never mind...

Anonymous
08/22/24(Thu)13:11:41 No.102028147

Anonymous 08/22/24(Thu)13:11:41 No.102028147

>>102026158
It's on the benchmark they reference though.
https://github.com/hsiehjackson/RULER

Anonymous
08/22/24(Thu)13:12:05 No.102028151

Anonymous 08/22/24(Thu)13:12:05 No.102028151

>>102027071
what the fuck is this model. it's so good what the fuck?

Anonymous
08/22/24(Thu)13:12:47 No.102028162

Anonymous 08/22/24(Thu)13:12:47 No.102028162

>>102028147
see >>102026211

Anonymous
08/22/24(Thu)13:14:50 No.102028192

Anonymous 08/22/24(Thu)13:14:50 No.102028192

>>102026286
I've known that since it came out, but there are some people that get really angry if you say it and yell at you non-stop. Just like with nemo currently.

Anonymous
08/22/24(Thu)13:15:16 No.102028206

Anonymous 08/22/24(Thu)13:15:16 No.102028206

>>102028151
It is not you are just vram starved. If you had a... never mind. You know what? Buy a fucking ad.

Anonymous
08/22/24(Thu)13:16:02 No.102028219

Anonymous 08/22/24(Thu)13:16:02 No.102028219

File: 1700823992572928.png (758 KB, 768x1024)

758 KB PNG

flux lora training is so good, and simple. what a time to be alive

Anonymous
08/22/24(Thu)13:16:55 No.102028235

Anonymous 08/22/24(Thu)13:16:55 No.102028235

>>102028219
what the fuck is this perspective

Anonymous
08/22/24(Thu)13:17:24 No.102028243

Anonymous 08/22/24(Thu)13:17:24 No.102028243

>>102028219
Good lord, I didn't realize how good it was at generating body horror

Anonymous
08/22/24(Thu)13:18:19 No.102028257

Anonymous 08/22/24(Thu)13:18:19 No.102028257

>>102028219
hi petra

Anonymous
08/22/24(Thu)13:18:20 No.102028259

Anonymous 08/22/24(Thu)13:18:20 No.102028259

File: 1693221376780302.png (42 KB, 722x360)

42 KB PNG

>>102025568
>they're marketing jamba-large as a competitor to l3.1 70b and mistral large instead of 405B in their blog
Impressive. Shame about those extra 300GB VRAM I need to to run Jamba-Large to get a performance similar to during inference when I want to run both at 8bpw.

Anonymous
08/22/24(Thu)13:18:50 No.102028268

Anonymous 08/22/24(Thu)13:18:50 No.102028268

>>102028219
I thought people said that flux loras are impossible to make a few weeks ago

Anonymous
08/22/24(Thu)13:19:22 No.102028276

Anonymous 08/22/24(Thu)13:19:22 No.102028276

>>102028259
The corpos aren't really bright.

Anonymous
08/22/24(Thu)13:20:31 No.102028298

Anonymous 08/22/24(Thu)13:20:31 No.102028298

>>102028268
It was just copro damage control.

Anonymous
08/22/24(Thu)13:20:35 No.102028299

Anonymous 08/22/24(Thu)13:20:35 No.102028299

>>102028268
Retards say that about every single model that comes out.

Anonymous
08/22/24(Thu)13:21:03 No.102028307

Anonymous 08/22/24(Thu)13:21:03 No.102028307

>>102028268
turns out dimwits like to come to bad conclusions for clout while the smart guys figure out how to do it.

Anonymous
08/22/24(Thu)13:21:12 No.102028311

Anonymous 08/22/24(Thu)13:21:12 No.102028311

File: 2024-08-18_135730_seed2_s(...).png (2.46 MB, 1280x1280)

2.46 MB PNG

Anonymous
08/22/24(Thu)13:21:12 No.102028312

Anonymous 08/22/24(Thu)13:21:12 No.102028312

File: 1702318120346045.jpg (549 KB, 1664x2432)

549 KB JPG

>>102025568
hello /lmg/

Anonymous
08/22/24(Thu)13:21:41 No.102028321

Anonymous 08/22/24(Thu)13:21:41 No.102028321

>>102028298
corpo* lmao

Anonymous
08/22/24(Thu)13:22:14 No.102028330

Anonymous 08/22/24(Thu)13:22:14 No.102028330

File: 2024-08-18_143022_seed26_(...).png (2.4 MB, 1280x1280)

2.4 MB PNG

>>102028311

Anonymous
08/22/24(Thu)13:22:45 No.102028337

Anonymous 08/22/24(Thu)13:22:45 No.102028337

>mikufags already shitting the thread with their presence

Anonymous
08/22/24(Thu)13:23:18 No.102028341

Anonymous 08/22/24(Thu)13:23:18 No.102028341

>>102028330
Nothing violent, just fixing her hair :)

>>102028312
>we posted at literally the exact same time
Woah.

Anonymous
08/22/24(Thu)13:23:21 No.102028343

Anonymous 08/22/24(Thu)13:23:21 No.102028343

>mikufags pissing up the place
right on schedule

Anonymous
08/22/24(Thu)13:23:29 No.102028346

Anonymous 08/22/24(Thu)13:23:29 No.102028346

>>102028259
The main thing is the context performance. At long context it should be much faster than even 70B. Vram does not matter a huge amount to corpos, inference speed does.

Anonymous
08/22/24(Thu)13:25:49 No.102028380

Anonymous 08/22/24(Thu)13:25:49 No.102028380

File: .png (9 KB, 256x256)

9 KB PNG

Anonymous
08/22/24(Thu)13:25:53 No.102028382

Anonymous 08/22/24(Thu)13:25:53 No.102028382

File: 2024-08-18_140132_seed5_s(...).png (2.15 MB, 1280x1280)

2.15 MB PNG

>>102028341
I forgor the image in the middle of replying to the other guy.

Anonymous
08/22/24(Thu)13:26:21 No.102028388

Anonymous 08/22/24(Thu)13:26:21 No.102028388

File: 1567919777866.jpg (62 KB, 500x618)

62 KB JPG

>>102026286
Dogshit compared to what else?

It's one of the few models that's perfect for 24GB cards. Dropping to 12B always feels like a waste and with CRs disgusting RAM usage on context size, it's pretty much the only above 12B model worth using up until the 70Bs

Anonymous
08/22/24(Thu)13:27:17 No.102028400

Anonymous 08/22/24(Thu)13:27:17 No.102028400

File: 2024-08-18_143428_seed29_(...).png (2.18 MB, 1280x1280)

2.18 MB PNG

>>102028382
4/4 ok I'm done no more today, sorry if you didn't like it bros.

Anonymous
08/22/24(Thu)13:27:30 No.102028406

Anonymous 08/22/24(Thu)13:27:30 No.102028406

>>102028259
Are we just assuming the mini will be shit?

Anonymous
08/22/24(Thu)13:28:12 No.102028419

Anonymous 08/22/24(Thu)13:28:12 No.102028419

>>102028144
I built the PR and converted a model but the server implementation still has that fucking deprecated wait call coded into it.

Anonymous
08/22/24(Thu)13:28:42 No.102028428

Anonymous 08/22/24(Thu)13:28:42 No.102028428

>>102028400
>sorry if you didn't like it bros.
wtf are you talking about?
more miku is ALWAYS welcome

Anonymous
08/22/24(Thu)13:28:46 No.102028429

Anonymous 08/22/24(Thu)13:28:46 No.102028429

>>102028406
what local models AREN'T shit?

Anonymous
08/22/24(Thu)13:30:44 No.102028456

Anonymous 08/22/24(Thu)13:30:44 No.102028456

>>102028429
Mixtral

Anonymous
08/22/24(Thu)13:31:19 No.102028469

Anonymous 08/22/24(Thu)13:31:19 No.102028469

File: 7bb396cdd0fcb7c5efe702cce(...).gif (1.7 MB, 600x1150)

1.7 MB GIF

>>102028428

Anonymous
08/22/24(Thu)13:31:48 No.102028479

Anonymous 08/22/24(Thu)13:31:48 No.102028479

magnum 123b is pretty good but I've given up on 123b because it's too fucking slow
magnum v2 72b is... not that good. writing kind of sucks in that typical qwenny way. why'd they train on the instruct?

Anonymous
08/22/24(Thu)13:32:17 No.102028485

Anonymous 08/22/24(Thu)13:32:17 No.102028485

>>102028469
That's not miku, that's a random whore that attempts to emulate miku's looks

Anonymous
08/22/24(Thu)13:32:25 No.102028487

Anonymous 08/22/24(Thu)13:32:25 No.102028487

>>102028419
Jart was right about llama.cpp. It's time to put the old dog out to pasture.

Let's support llamafile from now on and get the architecture in there first.

Anonymous
08/22/24(Thu)13:32:49 No.102028491

Anonymous 08/22/24(Thu)13:32:49 No.102028491

>>102028485
classic cope

Anonymous
08/22/24(Thu)13:33:48 No.102028513

Anonymous 08/22/24(Thu)13:33:48 No.102028513

>>102028491
Does anyone have the "your miku is not my miku" image for this bozo?

Anonymous
08/22/24(Thu)13:34:26 No.102028520

Anonymous 08/22/24(Thu)13:34:26 No.102028520

>>102028485
It CLEARLY says "My Beloved Miku" right there faggot

Anonymous
08/22/24(Thu)13:35:09 No.102028527

Anonymous 08/22/24(Thu)13:35:09 No.102028527

>>102028520
exactly, (You)r beloved miku

Anonymous
08/22/24(Thu)13:36:15 No.102028546

Anonymous 08/22/24(Thu)13:36:15 No.102028546

>>102028527
How can you say miku isn't your beloved? Identify yourself so we can kick you out of our discord.

Anonymous
08/22/24(Thu)13:37:00 No.102028556

Anonymous 08/22/24(Thu)13:37:00 No.102028556

File: 1705210344435309.jpg (96 KB, 828x980)

96 KB JPG

are you ready?

Anonymous
08/22/24(Thu)13:37:28 No.102028562

Anonymous 08/22/24(Thu)13:37:28 No.102028562

Making Miku the mascot of this general was a mistake. Why did we do it, anyway?

Anonymous
08/22/24(Thu)13:39:56 No.102028586

Anonymous 08/22/24(Thu)13:39:56 No.102028586

>>102028312
Hello Miku

Anonymous
08/22/24(Thu)13:41:14 No.102028608

Anonymous 08/22/24(Thu)13:41:14 No.102028608

>>102028562
You probably joined the discord too late. That channel where we discussed raiding this place mentions that miku should be the mascot because this is what we should all aspire to be after we transition.

Anonymous
08/22/24(Thu)13:41:25 No.102028614

Anonymous 08/22/24(Thu)13:41:25 No.102028614

File: 1711187160977338.png (29 KB, 1340x701)

29 KB PNG

>>102028406
Mini's direct competition is llama 3.1 8b and gemma 9b according to the same blog post by 21ai. They aren't even mentioning Mistral-Nemo despite being the better comparison by in their cope logic considering Jamba-mini and Nemo both have 12b active parameters.

Anonymous
08/22/24(Thu)13:44:08 No.102028652

Anonymous 08/22/24(Thu)13:44:08 No.102028652

>>102028562
Idk about "we", but I simply just use Miku as a subject for my gens because she's easy to prompt and she's a cute anime girl. I'd gen Teto and others if the model was better at getting them right but it's unfortunately not. Waiting on loras I guess.

Anonymous
08/22/24(Thu)13:45:39 No.102028679

Anonymous 08/22/24(Thu)13:45:39 No.102028679

File: 6e406395da7cff8573b731a66(...).jpg (110 KB, 736x1483)

110 KB JPG

>>102028652
you could always gen some Makise Kurisu

Anonymous
08/22/24(Thu)13:46:52 No.102028700

Anonymous 08/22/24(Thu)13:46:52 No.102028700

>>102028679
Flux knows her?

Anonymous
08/22/24(Thu)13:48:15 No.102028721

Anonymous 08/22/24(Thu)13:48:15 No.102028721

>>102028652
Yeah she's getting easier and easier to gen too, since her presence is so prevalent in synthetic data now.

Anonymous
08/22/24(Thu)13:48:20 No.102028723

Anonymous 08/22/24(Thu)13:48:20 No.102028723

File: file.png (12 KB, 288x230)

12 KB PNG

Just as it should be.

Anonymous
08/22/24(Thu)13:50:19 No.102028758

Anonymous 08/22/24(Thu)13:50:19 No.102028758

>>102028723
Yeah I'm saying AGI.

Anonymous
08/22/24(Thu)13:51:07 No.102028771

Anonymous 08/22/24(Thu)13:51:07 No.102028771

>>102028723
Let me guess, you need more?

Anonymous
08/22/24(Thu)13:51:43 No.102028778

Anonymous 08/22/24(Thu)13:51:43 No.102028778

>>102028487
>here's your 90GB executable

Anonymous
08/22/24(Thu)13:54:28 No.102028818

Anonymous 08/22/24(Thu)13:54:28 No.102028818

>>102028778
*takes the executable and pockets it*
Thank you anon, this is very convenient.

Anonymous
08/22/24(Thu)13:55:43 No.102028836

Anonymous 08/22/24(Thu)13:55:43 No.102028836

File: gabagool.jpg (808 KB, 1664x2432)

808 KB JPG

>>102028312
why stop there really crank it

Anonymous
08/22/24(Thu)13:56:01 No.102028844

Anonymous 08/22/24(Thu)13:56:01 No.102028844

>>102028562
petra spamming OPs and a cute anime girl was needed to unite the general.

Anonymous
08/22/24(Thu)13:57:39 No.102028867

Anonymous 08/22/24(Thu)13:57:39 No.102028867

>>102028679
You can't spell Makisu Kurisu without Miku

Anonymous
08/22/24(Thu)14:00:02 No.102028900

Anonymous 08/22/24(Thu)14:00:02 No.102028900

Something big is coming next week.

Anonymous
08/22/24(Thu)14:00:10 No.102028904

Anonymous 08/22/24(Thu)14:00:10 No.102028904

>>102028867
I prefer the term "Maku"

Anonymous
08/22/24(Thu)14:00:16 No.102028910

Anonymous 08/22/24(Thu)14:00:16 No.102028910

Is it just me or is the koboldcpp implementation of MiniCPM broken? Has anyone gotten the OAI chat completion endpoint to return the same response as in the huggingface demo? Responses are usually short and sometimes completely schizo.

Anonymous
08/22/24(Thu)14:01:11 No.102028922

Anonymous 08/22/24(Thu)14:01:11 No.102028922

>>102028900
ONE MORE WEEK UNTIL PROJECT STRAWBERRY IS FINALIZED
THE FRUITS ARE ALMOST SPROUTING SEEDS AMONG THE HAMSTERS
ELEVEN HOTDOGS
TRUST THE PLAN

Anonymous
08/22/24(Thu)14:01:20 No.102028927

Anonymous 08/22/24(Thu)14:01:20 No.102028927

>>102028910
Some anon a couple of days ago said the same thing and then discovered that apparently copy-pasting the image works while uploading it somehow breaks it. No idea if its' true though.

Anonymous
08/22/24(Thu)14:01:26 No.102028929

Anonymous 08/22/24(Thu)14:01:26 No.102028929

>>102028904
Makusex

Anonymous
08/22/24(Thu)14:02:01 No.102028932

Anonymous 08/22/24(Thu)14:02:01 No.102028932

>>102028900
>>102028922
Big, if true.

Anonymous
08/22/24(Thu)14:02:32 No.102028939

Anonymous 08/22/24(Thu)14:02:32 No.102028939

>>102028927
that anon wasn't using the oai endpoint, he was using kobold lite ui

Anonymous
08/22/24(Thu)14:03:52 No.102028962

Anonymous 08/22/24(Thu)14:03:52 No.102028962

>>102028927
I remember that, but he said after it worked the first time it worked even when uploading. That it might have been a cache issue. But I'm calling the oai endpoint directly, not using the ui.

Anonymous
08/22/24(Thu)14:04:35 No.102028969

Anonymous 08/22/24(Thu)14:04:35 No.102028969

>>102028723
dog level intelligence achieved

Anonymous
08/22/24(Thu)14:07:25 No.102029008

Anonymous 08/22/24(Thu)14:07:25 No.102029008

deepsex 405b

Anonymous
08/22/24(Thu)14:08:49 No.102029034

Anonymous 08/22/24(Thu)14:08:49 No.102029034

Colossus-R-513B

Anonymous
08/22/24(Thu)14:09:56 No.102029056

Anonymous 08/22/24(Thu)14:09:56 No.102029056

>>102029008
let drummer cook

Anonymous
08/22/24(Thu)14:10:57 No.102029070

Anonymous 08/22/24(Thu)14:10:57 No.102029070

DeepThroat-V2

Anonymous
08/22/24(Thu)14:12:36 No.102029092

Anonymous 08/22/24(Thu)14:12:36 No.102029092

Simulated cat brain

Anonymous
08/22/24(Thu)14:14:22 No.102029127

Anonymous 08/22/24(Thu)14:14:22 No.102029127

Best local model for vscode continue plugin? They themselves recommend llama 3, but what the fuck do they know?

Anonymous
08/22/24(Thu)14:14:34 No.102029131

Anonymous 08/22/24(Thu)14:14:34 No.102029131

>>102028562
It was supposed to be Chesh.

Anonymous
08/22/24(Thu)14:15:44 No.102029147

Anonymous 08/22/24(Thu)14:15:44 No.102029147

>>102029127
Jamba 1.5 mini

Anonymous
08/22/24(Thu)14:15:52 No.102029149

Anonymous 08/22/24(Thu)14:15:52 No.102029149

how do I get over the embarrassment of asking the model to have sex with me

Anonymous
08/22/24(Thu)14:16:20 No.102029157

Anonymous 08/22/24(Thu)14:16:20 No.102029157

>>102029149
stop asking. start taking.

Anonymous
08/22/24(Thu)14:17:20 No.102029173

Anonymous 08/22/24(Thu)14:17:20 No.102029173

>>102029149
i just say *rapes you* in the middle of a normal rp and the model takes care of the rest usually tbqh

Anonymous
08/22/24(Thu)14:18:15 No.102029192

Anonymous 08/22/24(Thu)14:18:15 No.102029192

>>102029092
A loader that is just a wrapper for another loader and the only unique feature it has is that it inserts another sysprompt, telling the model to pretend it is a cat brain pretending to be a helpful assistant.

Anonymous
08/22/24(Thu)14:20:04 No.102029225

Anonymous 08/22/24(Thu)14:20:04 No.102029225

File: file.png (51 KB, 804x465)

51 KB PNG

so sassy

Anonymous
08/22/24(Thu)14:23:58 No.102029271

Anonymous 08/22/24(Thu)14:23:58 No.102029271

File: 1702316034809359.png (53 KB, 587x546)

53 KB PNG

Metamate open release when? They're keeping it from us.

Anonymous
08/22/24(Thu)14:24:54 No.102029285

Anonymous 08/22/24(Thu)14:24:54 No.102029285

I personally feel that we need bigger models.

Anonymous
08/22/24(Thu)14:26:57 No.102029321

Anonymous 08/22/24(Thu)14:26:57 No.102029321

>>102029271
>internal company docs
If anyone leaks this they will be hunted down kek.

Anonymous
08/22/24(Thu)14:28:01 No.102029337

Anonymous 08/22/24(Thu)14:28:01 No.102029337

>>102029285
This, enough with all those 70bs and 120bs. We need to go back to the pre-chatgpt GPT3 doctrine of JUST MAKE IT BIGGER. 405b was a step in the right direction at least.

Anonymous
08/22/24(Thu)14:29:55 No.102029364

Anonymous 08/22/24(Thu)14:29:55 No.102029364

>>102029285
>>102029337
"We" (corpos) are doing exactly that but it takes a long time to train big models. So there will be months between releases at a minimum, or closer to a year for new frontiers.

Anonymous
08/22/24(Thu)14:33:37 No.102029411

Anonymous 08/22/24(Thu)14:33:37 No.102029411

>>102025568
Why do her eyes look like pussy hair moustaches?

Anonymous
08/22/24(Thu)14:46:31 No.102029593

Anonymous 08/22/24(Thu)14:46:31 No.102029593

File: Screenshot 2024-08-18 111620.png (1.41 MB, 966x968)

1.41 MB PNG

I wanna roleplay chat with my AI waifu in sillytavern. Like we are texting over a messenger or so. What LLM is best for that scenario? I have a 3080 with 10GB VRAM and my PC has 32GB ram. Right now I'm using Poppy_Porpoise-0.72-L3-8B-Q8_0-imat.gguf and it's kinda ok. Not as inteligent as characterai but at least not censored

Thanks!

Anonymous
08/22/24(Thu)14:48:49 No.102029623

Anonymous 08/22/24(Thu)14:48:49 No.102029623

>>102029593
I've been enjoying Lumimaid so far.
https://huggingface.co/NeverSleep/Lumimaid-v0.2-8B

Anonymous
08/22/24(Thu)14:49:10 No.102029631

Anonymous 08/22/24(Thu)14:49:10 No.102029631

>>102029593
https://huggingface.co/fblgit/UNA-TheBeagle-7b-v1

Anonymous
08/22/24(Thu)14:50:08 No.102029648

Anonymous 08/22/24(Thu)14:50:08 No.102029648

>>102029593
Mini-magnum has a conversational style that probably works pretty well for that.
Try mixtral 8x7b limarp zloss too.

Anonymous
08/22/24(Thu)14:50:21 No.102029654

Anonymous 08/22/24(Thu)14:50:21 No.102029654

I am so curious if it is oldfags trolling or there are no oldfags and it is newfags being genuine.

Anonymous
08/22/24(Thu)14:50:29 No.102029658

Anonymous 08/22/24(Thu)14:50:29 No.102029658

>>102029593
https://huggingface.co/turboderp/Mistral-Nemo-Instruct-12B-exl2/tree/5.0bpw

Anonymous
08/22/24(Thu)14:50:51 No.102029662

Anonymous 08/22/24(Thu)14:50:51 No.102029662

Is LoTA better?
https://nitter.poast.org/PandaAshwinee/status/1825571610230723027

Anonymous
08/22/24(Thu)14:52:20 No.102029687

Anonymous 08/22/24(Thu)14:52:20 No.102029687

Redpill me on the Yi 35B Models.

Are they good, what finetunes to use if yes? (for someone who can't run 70b models) Wanna just use them for cooming

Anonymous
08/22/24(Thu)14:52:24 No.102029691

Anonymous 08/22/24(Thu)14:52:24 No.102029691

>non-Transformer model
>look inside
>transformers

Anonymous
08/22/24(Thu)14:53:09 No.102029709

Anonymous 08/22/24(Thu)14:53:09 No.102029709

>>102029623
>>102029631
>>102029648
>>102029658
Thanks a lot, gonna try them!

Anonymous
08/22/24(Thu)14:54:10 No.102029726

Anonymous 08/22/24(Thu)14:54:10 No.102029726

>>102029687
no

Anonymous
08/22/24(Thu)14:54:26 No.102029733

Anonymous 08/22/24(Thu)14:54:26 No.102029733

>>102029691
they're always in disguise

Anonymous
08/22/24(Thu)14:54:27 No.102029734

Anonymous 08/22/24(Thu)14:54:27 No.102029734

>>102029654
I can guarantee you that there is at least one oldfag remaining.
If you have any questions about how 4chan used to be, feel free to ask.

Anonymous
08/22/24(Thu)14:57:45 No.102029789

Anonymous 08/22/24(Thu)14:57:45 No.102029789

>>102029654
I like to think the oldfags are not engaging and instead lurking, waiting for something truly interesting to occur.

Anonymous
08/22/24(Thu)15:01:57 No.102029851

Anonymous 08/22/24(Thu)15:01:57 No.102029851

>>102029662
I thought it was a joke but it actually looks very promising, too promising in fact, either it isn't all that good or it will be revolutionary.

Anonymous
08/22/24(Thu)15:02:11 No.102029856

Anonymous 08/22/24(Thu)15:02:11 No.102029856

What if im making a project where you could call your LLM.
Need a name for the project

Anonymous
08/22/24(Thu)15:02:54 No.102029870

Anonymous 08/22/24(Thu)15:02:54 No.102029870

>>102029856
SillyVoice

Anonymous
08/22/24(Thu)15:03:06 No.102029873

Anonymous 08/22/24(Thu)15:03:06 No.102029873

>>102029734
Is /loli/ a urban legend or was it actually a thing?

Anonymous
08/22/24(Thu)15:03:41 No.102029879

Anonymous 08/22/24(Thu)15:03:41 No.102029879

>>102029789
This. I'm too old to engage with bait and shitposts.

Anonymous
08/22/24(Thu)15:04:19 No.102029886

Anonymous 08/22/24(Thu)15:04:19 No.102029886

>>102029654
oldfag here, I can guarantee that I'm here.

Anonymous
08/22/24(Thu)15:04:21 No.102029887

Anonymous 08/22/24(Thu)15:04:21 No.102029887

>>102029662
still waiting for MORA

Anonymous
08/22/24(Thu)15:05:52 No.102029917

Anonymous 08/22/24(Thu)15:05:52 No.102029917

>>102029856
Slopline.

Anonymous
08/22/24(Thu)15:07:02 No.102029932

Anonymous 08/22/24(Thu)15:07:02 No.102029932

>>102029789
>mikutroons spam all the time regardless if something is happening or not
Checks out.

Anonymous
08/22/24(Thu)15:08:46 No.102029961

Anonymous 08/22/24(Thu)15:08:46 No.102029961

>>102029873
It was an actual thing. Loli wasn't even that rare of a thing on the internet back in the day.
But the more popular something gets, the more law-abiding it becomes,

Anonymous
08/22/24(Thu)15:11:11 No.102030001

Anonymous 08/22/24(Thu)15:11:11 No.102030001

>>102029856
Shiver.

Anonymous
08/22/24(Thu)15:13:16 No.102030036

Anonymous 08/22/24(Thu)15:13:16 No.102030036

>>102029856
Husky.

Hi all, Drummer here...
08/22/24(Thu)15:13:49 No.102030045

Hi all, Drummer here... 08/22/24(Thu)15:13:49 No.102030045

>>102029873
Yes, that was a real thing in 4chan. Lewd drawings of small anime girls was still in the gray before. It's still rampant in Japan from what I heard. Can't believe it's been so long that it's become an urban legend.

Anonymous
08/22/24(Thu)15:15:58 No.102030082

Anonymous 08/22/24(Thu)15:15:58 No.102030082

File: ssrlkk24py531.jpg (242 KB, 1168x1368)

242 KB JPG

>>102027162
>>102027239
>spamming the thread with dumb drama from your schizo headcannon
>>102028343
>>102028520
>>102028546
>>102028562
>>102028608
>>102029932
miku isn't going anywhere
seethe

Anonymous
08/22/24(Thu)15:18:19 No.102030117

Anonymous 08/22/24(Thu)15:18:19 No.102030117

>>102030082
>unfunny reaction image
>mikutroon
Checks out.

Anonymous
08/22/24(Thu)15:22:06 No.102030188

Anonymous 08/22/24(Thu)15:22:06 No.102030188

Is Jamba Strawberry?

Anonymous
08/22/24(Thu)15:23:15 No.102030214

Anonymous 08/22/24(Thu)15:23:15 No.102030214

Don't listen to Miku.
>>101997677

Anonymous
08/22/24(Thu)15:27:43 No.102030293

Anonymous 08/22/24(Thu)15:27:43 No.102030293

>no jamba support for llamaccp and exllama2

Anonymous
08/22/24(Thu)15:29:05 No.102030318

Anonymous 08/22/24(Thu)15:29:05 No.102030318

>>102030293
Serious backends do not support meme architectures.

Anonymous
08/22/24(Thu)15:29:50 No.102030336

Anonymous 08/22/24(Thu)15:29:50 No.102030336

File: media_GVmixV3WgAArcz8.jpg (134 KB, 1200x675)

134 KB JPG

NovelAI just made every other open source model obsolete.

Anonymous
08/22/24(Thu)15:31:56 No.102030361

Anonymous 08/22/24(Thu)15:31:56 No.102030361

>>102030293
What is there to run?

Anonymous
08/22/24(Thu)15:32:08 No.102030365

Anonymous 08/22/24(Thu)15:32:08 No.102030365

>>102030336
https://blog.novelai.net/novelai-diffusion-v1-weights-release-en-e40d11e16bd5
https://huggingface.co/NovelAI/nai-anime-v1-curated
https://huggingface.co/NovelAI/nai-anime-v1-full
https://huggingface.co/NovelAI/nai-furry-beta-v1.3

Anonymous
08/22/24(Thu)15:32:51 No.102030375

Anonymous 08/22/24(Thu)15:32:51 No.102030375

>>102030336
lol, so an official release of the leaked ones?

Anonymous
08/22/24(Thu)15:33:21 No.102030385

Anonymous 08/22/24(Thu)15:33:21 No.102030385

File: 1713026826658109.png (35 KB, 642x264)

35 KB PNG

>>102030336
>>102030365
This is just the model that was leaked two years ago.

Anonymous
08/22/24(Thu)15:34:18 No.102030398

Anonymous 08/22/24(Thu)15:34:18 No.102030398

>>102030385
And it's still better than every other open source model.

Anonymous
08/22/24(Thu)15:34:38 No.102030405

Anonymous 08/22/24(Thu)15:34:38 No.102030405

>>102030361
The new model that just came out with 256k context + better understanding at long context than any other local model of any size.

Anonymous
08/22/24(Thu)15:35:02 No.102030416

Anonymous 08/22/24(Thu)15:35:02 No.102030416

>>102030336
>>102030365
I thought people were saying it was v3 that was going to be open sourced. So they were lying?

Anonymous
08/22/24(Thu)15:36:19 No.102030437

Anonymous 08/22/24(Thu)15:36:19 No.102030437

>>102030405
Not many people have the ram for that, and the mini is probably too small to be any good. So, who cares?

Anonymous
08/22/24(Thu)15:38:20 No.102030469

Anonymous 08/22/24(Thu)15:38:20 No.102030469

File: 1704474174252463.png (144 KB, 1206x378)

144 KB PNG

>playing a casual rpg
>inserts a random woman and starts hinting at intimacy out of nowhere
Why do they always do this? At least it gives an option to decline...

Anonymous
08/22/24(Thu)15:42:23 No.102030541

Anonymous 08/22/24(Thu)15:42:23 No.102030541

>>102030469
Why did you censor the random woman's name?

Anonymous
08/22/24(Thu)15:44:23 No.102030581

Anonymous 08/22/24(Thu)15:44:23 No.102030581

>>102029856
llamaphone

Anonymous
08/22/24(Thu)15:44:45 No.102030587

Anonymous 08/22/24(Thu)15:44:45 No.102030587

>>102030469
This is what LLMs are for, Sam...

Anonymous
08/22/24(Thu)15:47:27 No.102030637

Anonymous 08/22/24(Thu)15:47:27 No.102030637

>>102030541
Because it's Lily and I don't like seeing that name.

Anonymous
08/22/24(Thu)15:49:36 No.102030661

Anonymous 08/22/24(Thu)15:49:36 No.102030661

>>102030437
The mini is 52B

Anonymous
08/22/24(Thu)15:50:23 No.102030668

Anonymous 08/22/24(Thu)15:50:23 No.102030668

>>102030541
because it's a themed setting and i don't need every autist knowing what stories i like to play :^)
technically only her surname was recognisable though, so i guess i could've left the first name in, "Eve" if it makes any difference

>>102030587
i did define a general outline of the plot in the instruction though, no romances included...
just worried that if i go down the intimacy route then the next 20 paragraphs are going to be mindless slop describing graphic sex or flirting instead of continuing the story like i want it to

Anonymous
08/22/24(Thu)15:51:33 No.102030688

Anonymous 08/22/24(Thu)15:51:33 No.102030688

>>102030336
>>102030385
What a joke kek, they should've done that much earlier. I guess they realized SD-based models are obsolete now because of flux.

Anonymous
08/22/24(Thu)15:53:21 No.102030713

Anonymous 08/22/24(Thu)15:53:21 No.102030713

>>102030661
Only 12B parameters active, and it's only as good as Gemma 2 9B according to some redditor.

Anonymous
08/22/24(Thu)15:53:42 No.102030718

Anonymous 08/22/24(Thu)15:53:42 No.102030718

>>102030661
But they're comparing its performance in their benchmarks to llama 8b, and gemma 9b. So expect it to be about that level. It's not going to be like a 52b model intelligence wise. The active parameters are only 12b and it performs on the level of an 8b model, so the context is all it has going for it.

Anonymous
08/22/24(Thu)15:55:23 No.102030741

Anonymous 08/22/24(Thu)15:55:23 No.102030741

>>102030661
Why use it over Mixtral 8x7?

Anonymous
08/22/24(Thu)16:04:13 No.102030868

Anonymous 08/22/24(Thu)16:04:13 No.102030868

File: file.png (3 KB, 383x40)

3 KB PNG

why does kcpp reprocess my prompt so often? i thought the smart context or whatever was supposed to prevent that?
i literally just typed out a response and sent it and this is what it does, it's not the only time either
set to an 8k context limit if that matters

Anonymous
08/22/24(Thu)16:06:31 No.102030899

Anonymous 08/22/24(Thu)16:06:31 No.102030899

>>102030868
Probably have a lorebook or authors note or something that inserts into the context further up.

Anonymous
08/22/24(Thu)16:07:33 No.102030913

Anonymous 08/22/24(Thu)16:07:33 No.102030913

>>102030868
try without smart context, i don't think they use context shifting with it
smart context is an old method of avoiding reprocessing by making it less frequent, by leaving half of the remaining context window empty before repeating when it fills up
context shifting will keep rolling the context window even when it's near the cap and rarely reprocess unless you're changing stuff at the beginning of the prompt

Anonymous
08/22/24(Thu)16:20:47 No.102031104

Anonymous 08/22/24(Thu)16:20:47 No.102031104

Magnum OOMs for me on Apple, others work fine. I am confused how to debug. RAM is plentiful.
llama.cpp with metal

Is there discord by the way?

Anonymous
08/22/24(Thu)16:22:41 No.102031140

Anonymous 08/22/24(Thu)16:22:41 No.102031140

>>102031104
which magnum
which quant
how much RAM

Anonymous
08/22/24(Thu)16:23:09 No.102031151

Anonymous 08/22/24(Thu)16:23:09 No.102031151

>>102031104
Which magnum?

Anonymous
08/22/24(Thu)16:23:41 No.102031159

Anonymous 08/22/24(Thu)16:23:41 No.102031159

>>102030899
nope, no lorebooks or anything
tbdesu i've never gone far enough over the context window to warrant adding notes, nor have i really wanted to put effort into writing them for other reasons
but as far as i'm aware sillytavern SHOULDN'T be modifying earlier parts of the prompt in any way so that's why i'm confused

>>102030913
context shifting, yep, got the names mixed up
just checked and it's enabled
so not sure why it's happening still

Anonymous
08/22/24(Thu)16:23:45 No.102031165

Anonymous 08/22/24(Thu)16:23:45 No.102031165

>>102031104
Set the context to something lower than the default (specified by the model). Start with -c 8192 and move up.

Anonymous
08/22/24(Thu)16:23:50 No.102031168

Anonymous 08/22/24(Thu)16:23:50 No.102031168

>>102030117
That was pretty funny actually

Anonymous
08/22/24(Thu)16:27:23 No.102031212

Anonymous 08/22/24(Thu)16:27:23 No.102031212

>>102031159
Log the context it sends, hard word wrap it and do a diff, see what's being changed.

Anonymous
08/22/24(Thu)16:30:35 No.102031259

Anonymous 08/22/24(Thu)16:30:35 No.102031259

Think its possible to influence the LLM's writing style by vectorizing smut novels and feed it to your character's data bank in ST?

Anonymous
08/22/24(Thu)16:32:37 No.102031283

Anonymous 08/22/24(Thu)16:32:37 No.102031283

>>102031259
Even a few examples can help set the tone at the start and then you have the whole chat as an example. What's the need to go that far?

Anonymous
08/22/24(Thu)16:36:26 No.102031341

Anonymous 08/22/24(Thu)16:36:26 No.102031341

>>102031283

Because I use gemma and it getting it to write spicy depictions of the female body is a struggle.

Anonymous
08/22/24(Thu)16:37:53 No.102031361

Anonymous 08/22/24(Thu)16:37:53 No.102031361

>>102031151
https://huggingface.co/anthracite-org/magnum-v2-12b

Anonymous
08/22/24(Thu)16:41:15 No.102031404

Anonymous 08/22/24(Thu)16:41:15 No.102031404

>>102031361
I assume you aren't trying to load the .safetensors files with llama.cpp and is instead using a GGUF.
In that case, >>102031165 is probably right.
If you don't specify the context size, it'll try to load the full 128k tokens, which will take an absurd amount of memory.

Anonymous
08/22/24(Thu)16:43:59 No.102031456

Anonymous 08/22/24(Thu)16:43:59 No.102031456

>>102031404
>it'll try to load the full 128k tokens
a million actually
>"max_position_embeddings": 1024000,
https://huggingface.co/anthracite-org/magnum-v2-12b/blob/main/config.json#L14

Anonymous
08/22/24(Thu)16:45:48 No.102031477

Anonymous 08/22/24(Thu)16:45:48 No.102031477

>>102031404
>I assume you aren't trying to load the .safetensors files with llama.cpp and is instead using a GGUF.
Of course he's loading a gguf with llama.cpp. What else would he load with the thing he specifically said in his post?
It's one thing debugging normie issues that cannot read the console output. Being so confused about how things work is a different thing.

Anonymous
08/22/24(Thu)16:47:20 No.102031493

Anonymous 08/22/24(Thu)16:47:20 No.102031493

>>102031477
eh we often get people asking how to load safetensors in kobold and such

Anonymous
08/22/24(Thu)16:48:16 No.102031504

Anonymous 08/22/24(Thu)16:48:16 No.102031504

>>102031456
Geez.

>>102031477
He could be trying to load the safetensors and getting a completely unrelated error, as has happened more than once in these threads.

Anonymous
08/22/24(Thu)16:52:00 No.102031559

Anonymous 08/22/24(Thu)16:52:00 No.102031559

>>102031504
>>102031493
>Magnum OOMs for me on Apple, others work fine. I am confused how to debug. RAM is plentiful.
>llama.cpp with metal
He obviously capable of running other models. He cannot read console outputs, but he can at least read instructions.

Anonymous
08/22/24(Thu)16:57:58 No.102031636

Anonymous 08/22/24(Thu)16:57:58 No.102031636

File: ComfyUI_00138_.jpg (1.33 MB, 1344x1728)

1.33 MB JPG

hey, I mainly use ai for image gen.
so I got the idea of using ai to troll people on ai generals because they keep asking for smut of the character i'm making, and i lack the creativity to impersonate her.
I want the ai to roleplay as a preppy smug brat that harshly denies any request some anon makes. do i just tell the ai to act like a brat and then feed the prompt the anon's request?
I got it up and running. so like do i tell the prompt "you are a smug preppy brat that denies every request a user makes and makes fun of them for it" or something like that ?
I hate being that newfag but here i am.

Anonymous
08/22/24(Thu)17:00:18 No.102031674

Anonymous 08/22/24(Thu)17:00:18 No.102031674

>>102031636
You might want to use either a prefil, or a instruction at depth zero to make sure that the brat will do its best to deny anon's request, otherwise there's a good chance that it'll forget that specific instruction real quick.

Anonymous
08/22/24(Thu)17:02:01 No.102031701

Anonymous 08/22/24(Thu)17:02:01 No.102031701

>>102031636
Ask the 12 shitposting bots that roam this very general

Anonymous
08/22/24(Thu)17:02:36 No.102031706

Anonymous 08/22/24(Thu)17:02:36 No.102031706

>>102031636
I bet the teto guy knows how to do it.

Anonymous
08/22/24(Thu)17:06:06 No.102031774

Anonymous 08/22/24(Thu)17:06:06 No.102031774

File: 1489083716440.gif (388 KB, 230x139)

388 KB GIF

Let's play a game! This Saturday at 1 PM PT, I will do a collaborative storytelling/RP session (location TBD, maybe in the thread itself?), where I post a scenario and responses from the model in the thread, and people discuss what to do in the user chat turns, or edit previous user turns or the system prompt and start over. This is going to be both for fun and to get us (mostly) reproducible reference logs, as I'll be using greedy sampling in Mikupad and have the full log in a pastebin at the end. No editing the model's responses, we're going to use pure prompting to try and get the thing to do what we want!

The scenario is also still TBD. We're going to go for as long a context as possible until the model breaks down uncontrollably, so it should be a complex enough scenario for that. If anyone has suggestions for scenarios I'm all ears. Also, I'm planning on starting these games with Mistral Nemo at Q8 for the first session, and other models in the future, so we have reference logs available for a whole range. But I'll take suggestions for models people want. I'm only a 36 GB VRAMlet though so I'm a bit limited. I can run larger models up to ~88 GB but it'd be slower. If anyone would like to host any of these games themselves, that has more VRAM to run such larger models at a good speed, please do, and I will step down.

>current suggestions
>>102002238

Anonymous
08/22/24(Thu)17:07:46 No.102031804

Anonymous 08/22/24(Thu)17:07:46 No.102031804

>>102031774
The scenario anon proposed but one of the 3 is a doppelganger infiltrating for some even more nefarious reason.

Anonymous
08/22/24(Thu)17:08:02 No.102031807

Anonymous 08/22/24(Thu)17:08:02 No.102031807

>>102031774
complex sex with miku

Anonymous
08/22/24(Thu)17:08:57 No.102031824

Anonymous 08/22/24(Thu)17:08:57 No.102031824

>>102031636
>anon speedrunning getting a ban for being an avatarfag

Anonymous
08/22/24(Thu)17:09:09 No.102031828

Anonymous 08/22/24(Thu)17:09:09 No.102031828

>>102031259
People say example messages heavily influence style but in my experience that has never worked. Using author note to tell the AI to use specific words/phrases when describing x works infinitely better

Anonymous
08/22/24(Thu)17:09:16 No.102031832

Anonymous 08/22/24(Thu)17:09:16 No.102031832

File: 1715794606976513.png (95 KB, 2497x1289)

95 KB PNG

>>102031212
well that's fucking weird
it DID modify my prompt somehow, in two locations
once at the very beginning just after the instruction, it inserted something i never said, "let's get started..."
and a second time near the beginning of the last response it inserted a "Narrator:" (presumably because it didn't actually finish and i chose to continue it, i guess it only got inserted after the response was properly finished)
that first one is weird though, what could be going on there?

Anonymous
08/22/24(Thu)17:10:44 No.102031852

Anonymous 08/22/24(Thu)17:10:44 No.102031852

>>102031774
This bunker scenario could be fun if anons pitch those world evens as the things goes on.

Anonymous
08/22/24(Thu)17:14:59 No.102031898

Anonymous 08/22/24(Thu)17:14:59 No.102031898

>>102031636
You should ask in a general with people that know how to write, not here.
>>>/vg/491349658

Anonymous
08/22/24(Thu)17:19:42 No.102031953

Anonymous 08/22/24(Thu)17:19:42 No.102031953

File: example.jpg (41 KB, 1259x270)

41 KB JPG

>>102031636
how is the tone?
do you want more or less kaomojis

Anonymous
08/22/24(Thu)17:20:28 No.102031966

Anonymous 08/22/24(Thu)17:20:28 No.102031966

>>>102023701

I completely gave up on the idea, but I was thinking about using an IRC server and a modified version of HexChat, given how similar the SillyTavern interface already is, and how it would support multiple users straight out of the box. It's the sort of idea that people would tell me to kys for though, for no other reason than because it probably wouldn't be written in React or whatever the flavour of the month language is.

Anonymous
08/22/24(Thu)17:22:39 No.102032001

Anonymous 08/22/24(Thu)17:22:39 No.102032001

>>102031966
Multiplayer llm? Sounds interesting, actually.
>because it probably wouldn't be written in React or whatever the flavour of the month language is
Honestly, who cares what the technologically inept would rather use?
I'm using Wails + Svelte for the thing I'm making because it just works.

Anonymous
08/22/24(Thu)17:27:35 No.102032069

Anonymous 08/22/24(Thu)17:27:35 No.102032069

File: Screenshot 2024-08-22 172650.png (50 KB, 676x363)

50 KB PNG

>>102031953
kek. No like a really mean bitch, harsh. she has to hurt my feelings.

oh no.. this is.. i'm scared.

Anonymous
08/22/24(Thu)17:28:28 No.102032079

Anonymous 08/22/24(Thu)17:28:28 No.102032079

>>102031828

Can I somehow trigger a recall in the long term memory (file entries in the character database) in the Author’s note? Like, write in the style of “spicy_stories by x author?”

Anonymous
08/22/24(Thu)17:28:46 No.102032083

Anonymous 08/22/24(Thu)17:28:46 No.102032083

>>102031804
I like that.

>>102031852
You mean during the game or while we're making suggestions here?

Actually for this general scenario to work I'm guessing we'd have to flesh out the characters a bit at least. Asking Nemo just to come up with all of it on the fly and having those instructions in context sounds like maybe a too challenging of a task that could confuse it. Then again we could just try it out I guess, and then do something simpler if we find out Nemo can't handle it.

Anonymous
08/22/24(Thu)17:29:59 No.102032099

Anonymous 08/22/24(Thu)17:29:59 No.102032099

File: Screenshot_20240822_232344.png (144 KB, 1244x441)

144 KB PNG

>>102031636
Here is some very low effort gen with Mistral Large q8_0.
The prompt is highlighted.

Anonymous
08/22/24(Thu)17:31:18 No.102032118

Anonymous 08/22/24(Thu)17:31:18 No.102032118

SLOP IS SOVL and I'm tired of pretending it's not

Anonymous
08/22/24(Thu)17:35:31 No.102032175

Anonymous 08/22/24(Thu)17:35:31 No.102032175

File: Screenshot 2024-08-22 173500.png (49 KB, 943x220)

49 KB PNG

>>102032099
i see. im kinda going the tavern route and inputting the scenario there.
I like where this is going

Anonymous
08/22/24(Thu)17:37:23 No.102032192

Anonymous 08/22/24(Thu)17:37:23 No.102032192

>>102032079
Sorry anon, not sure what you are talking about. If you are using ST you might be able to reference lorebooks but I dunno

Anonymous
08/22/24(Thu)17:37:34 No.102032195

Anonymous 08/22/24(Thu)17:37:34 No.102032195

>>102032083
>You mean during the game or while we're making suggestions here?
During the game. Something new gets pitched and characters need to struggle trough it. Basically user takes over as a narrator. Should be easy for you to handle as well.
Also like this other anon's idea with a traitor.

Anonymous
08/22/24(Thu)17:38:43 No.102032210

Anonymous 08/22/24(Thu)17:38:43 No.102032210

>you reply, your voice [X]
>she says, her voice [X]
>you say, your voice [X]
>[X] says, his voice [Y]
>he asks, his voice [X]
STOOOOOOOOOOOOOOOOOOOOP

Anonymous
08/22/24(Thu)17:40:02 No.102032230

Anonymous 08/22/24(Thu)17:40:02 No.102032230

>>102032210
Just ahh ahh mistress stop X, don't be shy.

Anonymous
08/22/24(Thu)17:42:55 No.102032270

Anonymous 08/22/24(Thu)17:42:55 No.102032270

>>102032001
>Implying this was never done before
Did people already forget about agnai?

Anonymous
08/22/24(Thu)17:43:54 No.102032282

Anonymous 08/22/24(Thu)17:43:54 No.102032282

>>102032270
Okay, so?
Are we just going to stop developing new things entirely because they've already been done before in some aspect?
Get the fuck out of here.

Anonymous
08/22/24(Thu)17:45:25 No.102032301

Anonymous 08/22/24(Thu)17:45:25 No.102032301

>>102032282
I never said that schizo, get your meds.

Anonymous
08/22/24(Thu)17:46:43 No.102032324

Anonymous 08/22/24(Thu)17:46:43 No.102032324

File: giant fuckign kettles.png (1.82 MB, 1280x1477)

1.82 MB PNG

>>102032301
Welcome to hell.

Anonymous
08/22/24(Thu)17:47:12 No.102032332

Anonymous 08/22/24(Thu)17:47:12 No.102032332

>>102032192

I want to have a file depicting female anatomy in a sexy way. Thinking of using ST’s RAG/Vector Database feature to influence the output of the model, in this case, Gemma 27b. Since you mentioned using Author’s Notes as a power tool, I am wondering if I can chain that together with the entries in the Vector Database so I don’t have to prompt specific keywords and shit to make it write in a style that I want depending on the context.

Anonymous
08/22/24(Thu)17:47:36 No.102032337

Anonymous 08/22/24(Thu)17:47:36 No.102032337

>>102032210
I(>>102027908) checked my manual prefill of pain and there were no voices. No eye sparks/gleams/explosions either. While it tries to put in the harlequin novel shit all the time if I don't let it in, most of it is gone. I guess it really is as simple as all of this slop being tied to novels for biowhores and novels for biowhores sounding as the closest thing to the cooming material you request from the LLM. I hate women.

Anonymous
08/22/24(Thu)17:48:20 No.102032345

Anonymous 08/22/24(Thu)17:48:20 No.102032345

>>102032195
Oh I see how that'd work. It wasn't clear to me whether the scenario was for us to be one of the three characters or some kind of co-writer.

Anonymous
08/22/24(Thu)17:49:55 No.102032368

Anonymous 08/22/24(Thu)17:49:55 No.102032368

>>102032337
>eye gleams
geez that's another one that really gets on my nerves
I'M SO SICK OF SLOP
not even doing erotica, just normal CYOA story rpgs

Anonymous
08/22/24(Thu)17:56:11 No.102032453

Anonymous 08/22/24(Thu)17:56:11 No.102032453

>>102032368
What is your prompt/card/whatever?

Anonymous
08/22/24(Thu)17:59:58 No.102032515

Anonymous 08/22/24(Thu)17:59:58 No.102032515

>>102030688
They did the same thing with their first writing model, they opened sourced the weights but at least also the training config after no one gave a fuck and surpassed it with other models. This is even more useless anyways because this is just the weights. The code or the configuration for training the models would've been the interesting part and there was no reason not to release it. I don't even know why they have an open source page, seriously, if they are going to be this behind the 8 ball on open sourcing. Why pretend to care when it's fuckall useless?

Anonymous
08/22/24(Thu)18:05:14 No.102032581

Anonymous 08/22/24(Thu)18:05:14 No.102032581

>>102032210
Using nemo or something related to it I take it?

Anonymous
08/22/24(Thu)18:08:06 No.102032626

Anonymous 08/22/24(Thu)18:08:06 No.102032626

>>102032581
>related
Yes they both end with .gguf

Anonymous
08/22/24(Thu)18:10:41 No.102032660

Anonymous 08/22/24(Thu)18:10:41 No.102032660

File: 1710095300375024.png (131 KB, 648x445)

131 KB PNG

>>102032453
largestral + picrel, a custom generic "Narrator" i made
it's a somewhat old prompt that i've modified over time to cater to different situations, for instance it would frequently try to make my character express regret over anything it deemed "immoral" so i added a clause to try and work around that

Anonymous
08/22/24(Thu)18:14:23 No.102032705

Anonymous 08/22/24(Thu)18:14:23 No.102032705

Any anon out there who can help me with good configurations and templates for Silly Tavern using Magnum v2.5? I'm trying to get the model to function correctly, but I'm having trouble getting it to follow the instructions in the text format or limit its output to only five lines.

Anonymous
08/22/24(Thu)18:16:09 No.102032726

Anonymous 08/22/24(Thu)18:16:09 No.102032726

>>102032660
>it would frequently try to make my character express regret over anything it deemed "immoral" so i added a clause to try and work around that
lol, did that work? I feel like most models are RLHF-deep-fried to be like this.

Anonymous
08/22/24(Thu)18:20:26 No.102032786

Anonymous 08/22/24(Thu)18:20:26 No.102032786

>>102028562
I got really bored in the summer of 2023 and forced it

Anonymous
08/22/24(Thu)18:21:35 No.102032806

Anonymous 08/22/24(Thu)18:21:35 No.102032806

>>102032726
it actually did from my limited testing
with that clause added in, and sometimes by explicitly mentioning "character is evil" or similar in the generation guidelines it doesn't complain as much as it used to, if at all
i recall it used to bug me a lot but i don't think it's happened again recently
occasionally the 4 options it provides will lean more towards the "moral" side but in those cases i can always type an action manually without any issues

Anonymous
08/22/24(Thu)18:22:56 No.102032821

Anonymous 08/22/24(Thu)18:22:56 No.102032821

File: 1594415927049.jpg (16 KB, 295x342)

16 KB JPG

New MoEs when?

Anonymous
08/22/24(Thu)18:24:24 No.102032833

Anonymous 08/22/24(Thu)18:24:24 No.102032833

>>102032821
Phi-3.5 released 2 days ago, does that count?
Jamba is a MoE too, right?

Anonymous
08/22/24(Thu)18:25:29 No.102032842

Anonymous 08/22/24(Thu)18:25:29 No.102032842

>>102032833
Yes and the answer is 2 more weeks at least because no goofs.

Anonymous
08/22/24(Thu)18:25:46 No.102032844

Anonymous 08/22/24(Thu)18:25:46 No.102032844

>>102032821
We just got two today. And 2 days ago, so using AI researcher logic you'll get 1.5 more in 2 days.

Anonymous
08/22/24(Thu)18:25:47 No.102032845

Anonymous 08/22/24(Thu)18:25:47 No.102032845

>>102031165
>>102031404
>>102031559

Thank you, setting up context worked. Now relationship with eaten memory is clearer, as before I expected it to be more straightforward function of the file size.

Was my first experience with getting help from distributed anon intelligence here, appreciating ^^

Idk what you mean by reading console, there is bunch of stats and that line:
error: Insufficient Memory (00000008:kIOGPUCommandBufferCallbackErrorOutOfMemory)
Without knowledge about ctx, I could only see high requested ram in stats

Anonymous
08/22/24(Thu)18:29:29 No.102032885

Anonymous 08/22/24(Thu)18:29:29 No.102032885

File: 1678741433645086.png (10 KB, 259x288)

10 KB PNG

>>102032833
Oh, I've been under a rock for a month and I'll admit they slipped past me purely because I was only looking for "# x #b" name formats, whoops.

Anonymous
08/22/24(Thu)18:31:57 No.102032918

Anonymous 08/22/24(Thu)18:31:57 No.102032918

>>102029149
why would you want to get over it?

Anonymous
08/22/24(Thu)18:39:01 No.102032991

Anonymous 08/22/24(Thu)18:39:01 No.102032991

>>102032821
Mistral abandoned MoEs and Jamba straight up admitted that MoEs only compete with models of a similar active parameter count. Enjoy your 400B model that trades blows with 70b.

Anonymous
08/22/24(Thu)18:41:21 No.102033018

Anonymous 08/22/24(Thu)18:41:21 No.102033018

>>102032991
cpufags still come out ahead tho due to more t/s

Anonymous
08/22/24(Thu)18:46:26 No.102033072

Anonymous 08/22/24(Thu)18:46:26 No.102033072

File: ctx.png (8 KB, 680x156)

8 KB PNG

>>102032845
This is what i get
>buffer size 167772160032
>failed to allocate buffer for kv cache
>llama_kv_cache_init() failed for self-attention cache
Those 167GB sound like a lot. Different backend, but still. Don't read just the last line. Not only should you read it when it fails, you should read it when it succeeds to have a point of comparison.
Glad you got it working.

Anonymous
08/22/24(Thu)18:50:10 No.102033113

Anonymous 08/22/24(Thu)18:50:10 No.102033113

>>102033018
I don't think it'll be faster than a 70b even if you did have the 400GB+ ram.

Anonymous
08/22/24(Thu)18:58:32 No.102033208

Anonymous 08/22/24(Thu)18:58:32 No.102033208

>>102033113
It'll run like a dense 96B would considering it has that many active parameters.

Anonymous
08/22/24(Thu)19:04:21 No.102033272

Anonymous 08/22/24(Thu)19:04:21 No.102033272

>>102033208
Exactly, which is slower than a 70b would run. So it's pointless.

Anonymous
08/22/24(Thu)19:04:42 No.102033279

Anonymous 08/22/24(Thu)19:04:42 No.102033279

>>102032991
>MoEs only compete with models of a similar active parameter count
WizardLM and Deepseek V2 both proved that obviously wrong. Seems more like a Jamba issue if anything. Or it might not even be the architecture but just this specific lab's data quality for all we know.

Anonymous
08/22/24(Thu)19:07:53 No.102033316

Anonymous 08/22/24(Thu)19:07:53 No.102033316

>>102033208
>>102033272
They claim it runs faster and suffers less of a speed drop than pure transformers in high context situations. I'd be curious to see head to head comparisons if it ever gets implemented in llama.cpp.

Anonymous
08/22/24(Thu)19:15:51 No.102033397

Anonymous 08/22/24(Thu)19:15:51 No.102033397

File: 1721614998713526.png (77 KB, 959x713)

77 KB PNG

>>102033316
They also claim llama 3.1 70b is slower than 405b or mistral large.

Anonymous
08/22/24(Thu)19:22:59 No.102033492

Anonymous 08/22/24(Thu)19:22:59 No.102033492

File: 66c72337cc7cfd6770f21337_(...).png (718 KB, 3242x1314)

718 KB PNG

>>102033397
Obviously 70b and 405b were accidentally swapped in that chart. You can tell since they drop it out after 64k because they presumably couldn't fit it with full ctx on their setup.

Anonymous
08/22/24(Thu)19:27:32 No.102033541

Anonymous 08/22/24(Thu)19:27:32 No.102033541

I'm curious about Jamba-Large. Maybe the size and the inferior benchmarks will add up and create something that has the soul we seek

Anonymous
08/22/24(Thu)19:27:49 No.102033544

Anonymous 08/22/24(Thu)19:27:49 No.102033544

>>102025568
it's never been more over

Anonymous
08/22/24(Thu)19:28:46 No.102033555

Anonymous 08/22/24(Thu)19:28:46 No.102033555

Anyone try Chronos Gold 12B? It seems pretty good at first use. Small model that has some kick...
https://huggingface.co/elinas/Chronos-Gold-12B-1.0

Anonymous
08/22/24(Thu)19:29:29 No.102033565

Anonymous 08/22/24(Thu)19:29:29 No.102033565

AI21 and making a model that's somehow stupid despite being fuckhuge, name a better duo

I think this is the third time they've done that? How do they keep getting funding

Anonymous
08/22/24(Thu)19:30:30 No.102033579

Anonymous 08/22/24(Thu)19:30:30 No.102033579

>>102033555
I started downloading bartowski's Q8 quant of this a few minutes ago, still waiting for it to finish

Anonymous
08/22/24(Thu)19:32:12 No.102033596

Anonymous 08/22/24(Thu)19:32:12 No.102033596

>>102033565
>How do they keep getting funding
check their early life

Anonymous
08/22/24(Thu)19:40:38 No.102033704

Anonymous 08/22/24(Thu)19:40:38 No.102033704

Anyone found a reliable way to cut down dirty talking? I don't want the bot to constantly be like "Hmm baby I like how you feel inside me" shit like I'm in a porno.

Anonymous
08/22/24(Thu)19:41:05 No.102033713

Anonymous 08/22/24(Thu)19:41:05 No.102033713

>>102033579
>>102033555
Samefag. Buy an ad. No, seriously.

Anonymous
08/22/24(Thu)19:42:40 No.102033735

Anonymous 08/22/24(Thu)19:42:40 No.102033735

>>102033565
They're the only ones with the balls to train models that aren't purely transformers. If nothing else, their new models have proven that big Mamba + Transformers hybrids deliver what they promise in terms of context and prompt-processing speeds while also performing decently even if it's not cutting edge while.
There's also a really good jump in performance between JambaV1 and Jamba1.5-Mini despite being comparable in size so the chances are good that the performance of the architecture can be increased even further.

Anonymous
08/22/24(Thu)19:43:41 No.102033752

Anonymous 08/22/24(Thu)19:43:41 No.102033752

File: Screenshot 2024-08-23 114314.png (61 KB, 1335x391)

61 KB PNG

>>102033713
take your meds schizo faggot
stop shitting up the thread with your false shill accusations, I haven't even tried the fucking model yet

Anonymous
08/22/24(Thu)19:44:00 No.102033755

Anonymous 08/22/24(Thu)19:44:00 No.102033755

>>102033704
gagging them helps if its a smart enough model to know gagged people can't talk

Anonymous
08/22/24(Thu)19:44:54 No.102033765

Anonymous 08/22/24(Thu)19:44:54 No.102033765

>>102033704
If one would remove all the dirty talk and all the purple prose... what would be left?

Anonymous
08/22/24(Thu)19:46:07 No.102033775

Anonymous 08/22/24(Thu)19:46:07 No.102033775

>>102033735
These are pretty expensive proofs-of-concept. They really need to focus more on the small end to iterate and refine before blowing their load on nearly half a trillion parameters

Anonymous
08/22/24(Thu)19:46:45 No.102033784

Anonymous 08/22/24(Thu)19:46:45 No.102033784

>>102033765
*plap* *plap* *plap* *plap* *plap* *plap* GET BULLIED! GET BULLIED! GET BULLIED!

Anonymous
08/22/24(Thu)19:50:22 No.102033827

Anonymous 08/22/24(Thu)19:50:22 No.102033827

what actually is mamba

Anonymous
08/22/24(Thu)19:51:21 No.102033834

Anonymous 08/22/24(Thu)19:51:21 No.102033834

File: file.png (1.53 MB, 1897x1795)

1.53 MB PNG

>>102033827

Anonymous
08/22/24(Thu)19:51:54 No.102033842

Anonymous 08/22/24(Thu)19:51:54 No.102033842

>>102033775
It's fine, as long as they do something that's marketable the investor money's going to keep coming. Remember how Mistral started with $137 million months before they even had a single model out.

Anonymous
08/22/24(Thu)19:52:22 No.102033853

Anonymous 08/22/24(Thu)19:52:22 No.102033853

is nvidia gonna minify mistral large and give us the sota for 2024?

Anonymous
08/22/24(Thu)19:52:33 No.102033856

Anonymous 08/22/24(Thu)19:52:33 No.102033856

>>102033827
rwkv

Anonymous
08/22/24(Thu)19:52:54 No.102033860

Anonymous 08/22/24(Thu)19:52:54 No.102033860

>>102027828
what you want isn't coom

Anonymous
08/22/24(Thu)19:53:54 No.102033874

Anonymous 08/22/24(Thu)19:53:54 No.102033874

File: file.png (59 KB, 147x327)

59 KB PNG

>>102033765
The models don't know good dirty talk is the problem. If they were like panting or gurgling or choking or saying they'll end pregnant or talking about their titpussy or whatever the fuck then sure, that's hot. But they all talk like thots in shitty gringo porn movies.

Anonymous
08/22/24(Thu)19:55:03 No.102033892

Anonymous 08/22/24(Thu)19:55:03 No.102033892

File: Screenshot_20240822_235111.png (610 KB, 1777x1186)

610 KB PNG

Brave's stable release channel finally got local model support going for their in-browser LLM integration. I hooked it up to Llama.cpp and it just werked. I think it's kind of neat. It has several functions you can do after highlighting text on a page and right clicking. There are some limitations, but generally this is still a pretty cool feature. Damn, I don't want to switch my main browser. Are there any extensions like this for Firefox?

Anonymous
08/22/24(Thu)19:55:31 No.102033897

Anonymous 08/22/24(Thu)19:55:31 No.102033897

>>102033853
Yeah, get ready for Mistral Large 4B

Anonymous
08/22/24(Thu)20:01:49 No.102033977

Anonymous 08/22/24(Thu)20:01:49 No.102033977

>>102033892
>Are there any extensions like this for Firefox?
You can always make your own.

Anonymous
08/22/24(Thu)20:03:55 No.102034004

Anonymous 08/22/24(Thu)20:03:55 No.102034004

>>102033892
>Are there any extensions like this for Firefox?
You can always ask the AI to make one.

Anonymous
08/22/24(Thu)20:06:54 No.102034051

Anonymous 08/22/24(Thu)20:06:54 No.102034051

>>102033579
>>102033555 (You)
> Samefag. Buy an ad. No, seriously.
The fuck you on about? It's a model I was asking about dumbasss motherfucker. I know you haven't even tried it looking like an absolute fool. God damn lmg has gone to shit with newfags.

Anonymous
08/22/24(Thu)20:08:17 No.102034068

Anonymous 08/22/24(Thu)20:08:17 No.102034068

The new Jambas are a huge deal if you need a decent model to chew through huge context lengths very quickly. No idea what the applications of this are but there's surely something.

Anonymous
08/22/24(Thu)20:09:00 No.102034076

Anonymous 08/22/24(Thu)20:09:00 No.102034076

>>102033279
Could very well just be undertrained in tokens or maybe they payed the price for supposedly having better effective context than everyone else

Anonymous
08/22/24(Thu)20:12:52 No.102034120

Anonymous 08/22/24(Thu)20:12:52 No.102034120

>>102033492
Seems sloppy, and how do they test the speed of closed models on the same hardware? I don't trust their data.

Anonymous
08/22/24(Thu)20:19:50 No.102034203

Anonymous 08/22/24(Thu)20:19:50 No.102034203

>>102033892
>I hooked it up to Llama.cpp and it just werked
Can you share how you did it? I'm retarded.
Also what model would be good, mixtral?

Anonymous
08/22/24(Thu)20:20:05 No.102034207

Anonymous 08/22/24(Thu)20:20:05 No.102034207

>>102034120
Why would they have to test them on the same hardware? It's fine to test cloudshit as-is via their API since you'll never be able to run it on faster hardware anyway.

Anonymous
08/22/24(Thu)20:21:45 No.102034236

Anonymous 08/22/24(Thu)20:21:45 No.102034236

jamba on llama.cpp please...........

Anonymous
08/22/24(Thu)20:36:04 No.102034342

Anonymous 08/22/24(Thu)20:36:04 No.102034342

>>102034203
I just updated Brave and adjusted Leo's settings according to the info in the question mark bubbles. What are you having an issue with?

Anonymous
08/22/24(Thu)20:38:01 No.102034358

Anonymous 08/22/24(Thu)20:38:01 No.102034358

So as a retard just using koboldcpp in instruct mode to fap, is there a source for lewd loras? Is that even a model-agnostic thing?

Anonymous
08/22/24(Thu)20:51:51 No.102034499

Anonymous 08/22/24(Thu)20:51:51 No.102034499

>>102028562
This, it attracted some AGP atrocities. "Never make a general with anime OP if you want it be high quality and calm most of the time" unnamed rule exists for a reason.

Anonymous
08/22/24(Thu)20:58:55 No.102034576

Anonymous 08/22/24(Thu)20:58:55 No.102034576

>>102034342
I haven't used llama-server before so I don't know how to connect it with brave.
I already tried but failed

Anonymous
08/22/24(Thu)21:19:57 No.102034790

Anonymous 08/22/24(Thu)21:19:57 No.102034790

File: 2024-08-23_011346_seed1_s(...).png (2.34 MB, 2048x1024)

2.34 MB PNG

>browsing loras
>see https://civitai.com/models/118398
>think that this could make some funny images where infinite Migus are surrounding the viewer
>try it out
>this is the first thing that plops out of the machine

Anonymous
08/22/24(Thu)21:28:26 No.102034873

Anonymous 08/22/24(Thu)21:28:26 No.102034873

>>102011438#p102018061

Made a basic ass python script to do this. I only use koboldcpp so that's what it calls.
rentry dot co/vhqaewth

Anonymous
08/22/24(Thu)21:36:19 No.102034969

Anonymous 08/22/24(Thu)21:36:19 No.102034969

>>102034576
In that case there's probably some learning you should do about just getting Llama.cpp set up with something in general. Have you tried a different backend? I'd guess Brave works with most just fine.

Anonymous
08/22/24(Thu)21:38:47 No.102034990

Anonymous 08/22/24(Thu)21:38:47 No.102034990

>>102034790
>nuts.wad

Anonymous
08/22/24(Thu)21:43:16 No.102035033

Anonymous 08/22/24(Thu)21:43:16 No.102035033

File: 2024-08-23_013937_seed1_s(...).png (3.95 MB, 2048x1024)

3.95 MB PNG

Oki so here's a more normal happi version.

Anonymous
08/22/24(Thu)21:48:57 No.102035087

Anonymous 08/22/24(Thu)21:48:57 No.102035087

I got hit with the "ministrations, shivers, audible, ...for now" combo in the same message
I need a break after this

Anonymous
08/22/24(Thu)21:59:19 No.102035215

Anonymous 08/22/24(Thu)21:59:19 No.102035215

File: 1697471409105670.png (8 KB, 423x24)

8 KB PNG

wtf how did EA get into my story?

Anonymous
08/22/24(Thu)22:03:08 No.102035262

Anonymous 08/22/24(Thu)22:03:08 No.102035262

Why would anyone use koboldcpp (rebranded llama.cpp with bloat, a shitty UI, and unaudited diffs from upstream) instead of llama.cpp? Is /g/ really so dumb that it can't compile a C++ program? Or is it astroturfing and nobody actually uses that thing...?

Anonymous
08/22/24(Thu)22:04:29 No.102035275

Anonymous 08/22/24(Thu)22:04:29 No.102035275

>>102035262
https://github.com/ggerganov/llama.cpp/blob/master/docs/build.md

Anonymous
08/22/24(Thu)22:08:19 No.102035325

Anonymous 08/22/24(Thu)22:08:19 No.102035325

>>102035262
>Why would anyone use koboldcpp
>>102034358
>So as a retard

Anonymous
08/22/24(Thu)22:08:43 No.102035333

Anonymous 08/22/24(Thu)22:08:43 No.102035333

>>102035262
Compiling koboldcpp is the same as llama.cpp, so if you can do one you can do the other.

Anonymous
08/22/24(Thu)22:09:19 No.102035344

Anonymous 08/22/24(Thu)22:09:19 No.102035344

@102035262
because it's easier and more convenient and i don't want to have to compile shit

Anonymous
08/22/24(Thu)22:09:37 No.102035349

Anonymous 08/22/24(Thu)22:09:37 No.102035349

>>102035262
Grooming from the Discord.

Anonymous
08/22/24(Thu)22:09:51 No.102035355

Anonymous 08/22/24(Thu)22:09:51 No.102035355

File: file.png (19 KB, 531x282)

19 KB PNG

>>102034873
pretty cool

Anonymous
08/22/24(Thu)22:11:34 No.102035373

Anonymous 08/22/24(Thu)22:11:34 No.102035373

102035262
I don't want to compile C or C++ because I use windows most of the time, and C compilation fucking suuuucks in a windows environment, there's always some shit broken

it usually just werks under linux, but in windows it's a nightmare and I don't often want to boot into my linux partition

Anonymous
08/22/24(Thu)22:12:47 No.102035391

Anonymous 08/22/24(Thu)22:12:47 No.102035391

File: print.png (306 KB, 950x653)

306 KB PNG

>>102035349
This, 90% of the koboldcpp discussion is the guy himself. You're probably replying to him

Anonymous
08/22/24(Thu)22:13:49 No.102035401

Anonymous 08/22/24(Thu)22:13:49 No.102035401

File: Untitled.png (602 KB, 1043x2524)

602 KB PNG

Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
https://arxiv.org/abs/2408.12570
>We present Jamba-1.5, new instruction-tuned large language models based on our Jamba architecture. Jamba is a hybrid Transformer-Mamba mixture of experts architecture, providing high throughput and low memory usage across context lengths, while retaining the same or better quality as Transformer models. We release two model sizes: Jamba-1.5-Large, with 94B active parameters, and Jamba-1.5-Mini, with 12B active parameters. Both models are fine-tuned for a variety of conversational and instruction-following capabilties, and have an effective context length of 256K tokens, the largest amongst open-weight models. To support cost-effective inference, we introduce ExpertsInt8, a novel quantization technique that allows fitting Jamba-1.5-Large on a machine with 8 80GB GPUs when processing 256K-token contexts without loss of quality. When evaluated on a battery of academic and chatbot benchmarks, Jamba-1.5 models achieve excellent results while providing high throughput and outperforming other open-weight models on long-context benchmarks.
https://huggingface.co/ai21labs
https://github.com/vllm-project/vllm/pull/7415
merged code for their new quant method
jamba 1.5 paper

Anonymous
08/22/24(Thu)22:20:22 No.102035477

Anonymous 08/22/24(Thu)22:20:22 No.102035477

>>102035262
It's a quick onramp to see if this seems interesting enough to justify more effort. My path was koboldcpp -> ooba -> running sillytavern and connecting to llama.cpp or TabbyAPI (occasional mikupad and ooba use for story writing, occasional use of llama-cli for batch jobs).

Anonymous
08/22/24(Thu)22:20:54 No.102035485

Anonymous 08/22/24(Thu)22:20:54 No.102035485

Weight Scope Alignment: A Frustratingly Easy Method for Model Merging
https://arxiv.org/abs/2408.12237
>Merging models becomes a fundamental procedure in some applications that consider model efficiency and robustness. The training randomness or Non-I.I.D. data poses a huge challenge for averaging-based model fusion. Previous research efforts focus on element-wise regularization or neural permutations to enhance model averaging while overlooking weight scope variations among models, which can significantly affect merging effectiveness. In this paper, we reveal variations in weight scope under different training conditions, shedding light on its influence on model merging. Fortunately, the parameters in each layer basically follow the Gaussian distribution, which inspires a novel and simple regularization approach named Weight Scope Alignment (WSA). It contains two key components: 1) leveraging a target weight scope to guide the model training process for ensuring weight scope matching in the subsequent model merging. 2) fusing the weight scope of two or more models into a unified one for multi-stage model fusion. We extend the WSA regularization to two different scenarios, including Mode Connectivity and Federated Learning. Abundant experimental studies validate the effectiveness of our approach.
big if true. they kind of muddled it by throwing in federated learning stuff and they used retnet models to test with

Anonymous
08/22/24(Thu)22:23:37 No.102035519

Anonymous 08/22/24(Thu)22:23:37 No.102035519

>>102035477
ooba is even worse, it's slow for some reason too.

Anonymous
08/22/24(Thu)22:25:47 No.102035549

Anonymous 08/22/24(Thu)22:25:47 No.102035549

>>102035519
Ooba being super slow is what made me eventually move off it.

Anonymous
08/22/24(Thu)22:34:32 No.102035661

Anonymous 08/22/24(Thu)22:34:32 No.102035661

What parameters you normally use when executing your llama-server instance

Anonymous
08/22/24(Thu)22:35:50 No.102035671

Anonymous 08/22/24(Thu)22:35:50 No.102035671

>>102035549
Yeah so the downgrade in the middle makes no sense.

Anonymous
08/22/24(Thu)22:41:25 No.102035728

Anonymous 08/22/24(Thu)22:41:25 No.102035728

File: 1700953427978367.png (380 KB, 512x620)

380 KB PNG

hello my /lmg/brudis

Anonymous
08/22/24(Thu)22:44:02 No.102035752

Anonymous 08/22/24(Thu)22:44:02 No.102035752

>>102035728
I don't think he cares about losers at /g/, but he has been flooding /pol/ with bot posts and comments since 2020 or so. And he succeded, pol is so low quality now that it's dead.

Anonymous
08/22/24(Thu)22:46:42 No.102035776

Anonymous 08/22/24(Thu)22:46:42 No.102035776

>>102035728
hi petra

Anonymous
08/22/24(Thu)22:49:26 No.102035809

Anonymous 08/22/24(Thu)22:49:26 No.102035809

>>102035776
i think her name is grimes lad

Anonymous
08/22/24(Thu)22:56:06 No.102035875

Anonymous 08/22/24(Thu)22:56:06 No.102035875

>>102035671
The first time I wanted to use a non GGUF model I installed ooba and I found I liked the interface a lot more and ended up using it for everything. Because I was trying new models I didn't realize at first that ooba was slower since I hadn't used the same model in both kobold and ooba.

Anonymous
08/22/24(Thu)23:13:05 No.102036024

Anonymous 08/22/24(Thu)23:13:05 No.102036024

File: 1e2ff94f-95c2-4b9b-b50e-1(...).png (344 KB, 512x512)

344 KB PNG

Anonymous
08/22/24(Thu)23:17:08 No.102036066

Anonymous 08/22/24(Thu)23:17:08 No.102036066

>>102036024
after the 3rd impact with miku

Anonymous
08/22/24(Thu)23:26:57 No.102036161

Anonymous 08/22/24(Thu)23:26:57 No.102036161

File: 1a093c30-b78f-4dd1-8a41-4(...).png (284 KB, 512x512)

284 KB PNG

>>102036066

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.