/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 09/23/24(Mon)18:35:21 No.102524339

File: Hatsune Miku's Blue Monday.jpg (98 KB, 1024x1024)

98 KB JPG

/lmg/ - Local Models General Anonymous 09/23/24(Mon)18:35:21 No.102524339 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Blue Monday Edition

Previous threads: >>102513868 & >>102505481

►News
>(09/18) Qwen 2.5 released, trained on 18 trillion token dataset: https://qwenlm.github.io/blog/qwen2.5/
>(09/18) Llama 8B quantized to b1.58 through finetuning: https://hf.co/blog/1_58_llm_extreme_quantization
>(09/17) Mistral releases new 22B with 128k context and function calling: https://mistral.ai/news/september-24-release/
>(09/12) DataGemma with DataCommons retrieval: https://blog.google/technology/ai/google-datagemma-ai-llm

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench
Japanese: https://hf.co/datasets/lmg-anon/vntl-leaderboard
Programming: https://hf.co/spaces/mike-ravkine/can-ai-code-results

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
09/23/24(Mon)18:35:54 No.102524347

Anonymous 09/23/24(Mon)18:35:54 No.102524347

File: recap-102513868.jpg (3.49 MB, 1809x8389)

3.49 MB JPG

►Recent Highlights from the Previous Thread: >>102513868

--Recapbot test results and script shared:
>102520238 >102520362 >102520571
--NovelAI's Llama 3 Erato model announced, but users are skeptical:
>102522855 >102522932 >102522963 >102522970 >102523027 >102523094 >102523226 >102523349 >102523414 >102523417 >102523045 >102523111 >102523316
--NAI's leaked model receives mixed opinions on quality:
>102520266 >102520377 >102520594 >102521001 >102521037 >102522654
--Model comparison for erotic story generation, Mistral Nemo 12B ranked as the best:
>102517308 >102517339 >102521082 >102517629 >102521100 >102517773 >102517913 >102518006 >102518255 >102518273 >102518282 >102518471 >102518341 >102518409 >102518436 >102517919 >102518089
--Mistral Small's intelligence and capabilities impress Anons, despite some drawbacks:
>102517583 >102517676 >102517912 >102518847 >102519167 >102519219 >102519735
--MMMLU dataset is a testing dataset with translations in 14 languages:
>102523230 >102523753 >102523289
--Flux finetunes exist, e.g. Hyper 8-step tunes like Flux Unchained:
>102522697 >102522866
--LLMs have been fine-tuned to play chess, with some success:
>102519361 >102519407 >102519518 >102519572 >102519696 >102520199 >102520989
--Anon proposes making llama.cpp instances fight each other, others share experiences with similar experiments:
>102515242 >102515294 >102515641 >102515693
--Anon proposes hostnamectl test to evaluate technical models:
>102520972
--Anon asks about uploading PDFs of RPG lore books to OoBaBooga for solo roleplaying:
>102514495 >102514506 >102514541 >102514563 >102514576 >102514589 >102514619 >102514906 >102520446 >102520468
--A6000 likely faster than 2x 3090's due to memory bandwidth:
>102517852 >102518086
--Miku (free space): >>102514808 >>102515242 >>102517712 >102518950 >>102519640 >>102520739 >>102522866 >>102519843

►Recent Highlight Posts from the Previous Thread: >>102513911

Anonymous
09/23/24(Mon)18:37:08 No.102524365

Anonymous 09/23/24(Mon)18:37:08 No.102524365

File: recap-102513868-dark.jpg (3.33 MB, 1810x8389)

3.33 MB JPG

>>102524347

Anonymous
09/23/24(Mon)18:39:20 No.102524385

Anonymous 09/23/24(Mon)18:39:20 No.102524385

It hasn't started

Anonymous
09/23/24(Mon)18:40:06 No.102524392

Anonymous 09/23/24(Mon)18:40:06 No.102524392

>>102524347
it's just not the same anymore. we had a good run bros but it's time to move on.

Anonymous
09/23/24(Mon)18:40:36 No.102524396

Anonymous 09/23/24(Mon)18:40:36 No.102524396

File: 43 Days Until November 5.png (1.67 MB, 1704x960)

1.67 MB PNG

Anonymous
09/23/24(Mon)18:40:47 No.102524402

Anonymous 09/23/24(Mon)18:40:47 No.102524402

>>102524385
Actually, it started a long, long time ago, in an age as old as time.

Anonymous
09/23/24(Mon)18:41:20 No.102524409

Anonymous 09/23/24(Mon)18:41:20 No.102524409

File: uLkcYkHTOR.png (18 KB, 159x47)

18 KB PNG

>>102524396
man

Anonymous
09/23/24(Mon)18:41:50 No.102524415

Anonymous 09/23/24(Mon)18:41:50 No.102524415

>>102524347
>>102524392
so what happens if you try to add too many backlinks? your post gets blocked?

Anonymous
09/23/24(Mon)18:42:06 No.102524417

Anonymous 09/23/24(Mon)18:42:06 No.102524417

>>102524409
What?

Anonymous
09/23/24(Mon)18:42:26 No.102524421

Anonymous 09/23/24(Mon)18:42:26 No.102524421

>>102524409
she's doing her best

Anonymous
09/23/24(Mon)18:42:50 No.102524425

Anonymous 09/23/24(Mon)18:42:50 No.102524425

>>102524347
>>102524392
actually I think the screenshot is cool. How did you programatically turn the posts to images and stitch them together?

Anonymous
09/23/24(Mon)18:43:52 No.102524442

Anonymous 09/23/24(Mon)18:43:52 No.102524442

>>102524396
Breaking into the cinema with Miku

Anonymous
09/23/24(Mon)18:44:34 No.102524449

Anonymous 09/23/24(Mon)18:44:34 No.102524449

>>102524347
I like the screenshot, easier to follow than the old links in some ways.

Anonymous
09/23/24(Mon)18:45:56 No.102524464

Anonymous 09/23/24(Mon)18:45:56 No.102524464

>so what do you say, [user], ready to [action]?

Anonymous
09/23/24(Mon)18:47:21 No.102524479

Anonymous 09/23/24(Mon)18:47:21 No.102524479

I forgot 99% of /lmg/ are ESL zoomer phoneposters who run 7Bs on old gaming laptops and don't have 4chan-x installed so they have no idea what it was like before. In that case, we should improve the screenshot by adding a subway surfers gif to the bottom.

Anonymous
09/23/24(Mon)18:48:01 No.102524489

Anonymous 09/23/24(Mon)18:48:01 No.102524489

>>102524464
>no. *rapes you*
quick cure for lots of slop, try it out

Anonymous
09/23/24(Mon)18:49:55 No.102524513

Anonymous 09/23/24(Mon)18:49:55 No.102524513

>>102524464
I have been using LLMs for """creative writing""" for 2 years including GPT 3.5 and 4 and Claude 1 and 2 and I've never seen this or "I don't bite." You must be having really gay roleplays if this comes up.

Anonymous
09/23/24(Mon)18:50:01 No.102524516

Anonymous 09/23/24(Mon)18:50:01 No.102524516

>>102524489
But I don't want to do nsfw anymore...

Anonymous
09/23/24(Mon)18:52:26 No.102524541

Anonymous 09/23/24(Mon)18:52:26 No.102524541

Recap anon's suffering brings a smile to my face

Anonymous
09/23/24(Mon)18:52:53 No.102524547

Anonymous 09/23/24(Mon)18:52:53 No.102524547

>>102524347
useless spam. find a different use for your bot

Anonymous
09/23/24(Mon)18:53:29 No.102524553

Anonymous 09/23/24(Mon)18:53:29 No.102524553

>>102524513
>really gay roleplays
I think those phrases come up if you write passively and make the AI take the initiative

Anonymous
09/23/24(Mon)18:54:44 No.102524568

Anonymous 09/23/24(Mon)18:54:44 No.102524568

New NAI model verdict?

Anonymous
09/23/24(Mon)18:55:29 No.102524575

Anonymous 09/23/24(Mon)18:55:29 No.102524575

>>102524568
new SOTA

Anonymous
09/23/24(Mon)18:55:43 No.102524580

Anonymous 09/23/24(Mon)18:55:43 No.102524580

>>102524568
dead on arrival

Anonymous
09/23/24(Mon)18:56:43 No.102524587

Anonymous 09/23/24(Mon)18:56:43 No.102524587

>>102524491
Really? I feel like language models are way simpler than image models. I still don't have a good image generation set up.

Anonymous
09/23/24(Mon)18:57:08 No.102524593

Anonymous 09/23/24(Mon)18:57:08 No.102524593

>>102524568
who?

Anonymous
09/23/24(Mon)18:57:22 No.102524595

Anonymous 09/23/24(Mon)18:57:22 No.102524595

why doesn't someone write a browser script to turn the recap quotes into quote links? can't be that difficult, just use regex to find the quote links and add another >? I'm not doing it because I hate all of you.

Anonymous
09/23/24(Mon)18:58:26 No.102524605

Anonymous 09/23/24(Mon)18:58:26 No.102524605

>>102524553
Same thing.

Anonymous
09/23/24(Mon)19:01:28 No.102524638

Anonymous 09/23/24(Mon)19:01:28 No.102524638

>>102524595
Here's a prompt that would probably work, I'm to lazy to ask chatgpt myself.
Please write a bookmarklet for me that iterates over the dom and does a regex substitution replacing >([0-9]{9}) with >>\1.
Thanks.

Anonymous
09/23/24(Mon)19:02:02 No.102524646

Anonymous 09/23/24(Mon)19:02:02 No.102524646

>>102524638
Someone else plug that into ChatGPT for me please.

Anonymous
09/23/24(Mon)19:05:24 No.102524688

Anonymous 09/23/24(Mon)19:05:24 No.102524688

>>102524513
i see "i don't bite... much" often because i roleplay as a nervous shy man with rapey female characters

Anonymous
09/23/24(Mon)19:06:08 No.102524700

Anonymous 09/23/24(Mon)19:06:08 No.102524700

>>102524568
The dust is still in the air. To me it feels like a rough "base model" of the entirety of danbooru. Underbaked, but can gen everyone doing anything, in many artist styles. Maybe a finished version will be more polished, or could be great after finetuning.

Anonymous
09/23/24(Mon)19:06:45 No.102524709

Anonymous 09/23/24(Mon)19:06:45 No.102524709

>>102524688
>roleplay
it's not roleplay if it's 1:1 self-insert, anon

Anonymous
09/23/24(Mon)19:07:09 No.102524714

Anonymous 09/23/24(Mon)19:07:09 No.102524714

>>102524700 (me)
Oh whoops, I still had that image gen model in my head.

Anonymous
09/23/24(Mon)19:08:09 No.102524732

Anonymous 09/23/24(Mon)19:08:09 No.102524732

Flux > NAI SDXL

Anonymous
09/23/24(Mon)19:09:39 No.102524748

Anonymous 09/23/24(Mon)19:09:39 No.102524748

>>102524732
i'm sure >>>/g/sdg/ would appreciate your controversial hot take

Anonymous
09/23/24(Mon)19:09:42 No.102524750

Anonymous 09/23/24(Mon)19:09:42 No.102524750

>>102524347
anyone got a script to fix backlinks on my end at least?

Anonymous
09/23/24(Mon)19:10:28 No.102524761

Anonymous 09/23/24(Mon)19:10:28 No.102524761

https://huggingface.co/nvidia/Llama-3_1-Nemotron-51B-Instruct
Nice

Anonymous
09/23/24(Mon)19:11:10 No.102524769

Anonymous 09/23/24(Mon)19:11:10 No.102524769

>>102524748
I'm not a retard so I don't spend half my day in that slop thread like you. Do they agree or disagree?

Anonymous
09/23/24(Mon)19:11:40 No.102524779

Anonymous 09/23/24(Mon)19:11:40 No.102524779

>>102524732
>Flux > NAI SDXL
janky first-gen SD slopmerges are better than illustrious 0.1

Anonymous
09/23/24(Mon)19:12:54 No.102524797

Anonymous 09/23/24(Mon)19:12:54 No.102524797

File: turboassistant_000.png (694 KB, 1923x1080)

694 KB PNG

>>102524646

Anonymous
09/23/24(Mon)19:12:59 No.102524800

Anonymous 09/23/24(Mon)19:12:59 No.102524800

>>102524732
>>102524769
>/lmg/ - a general dedicated to the discussion and development of local language models.

Anonymous
09/23/24(Mon)19:13:01 No.102524801

Anonymous 09/23/24(Mon)19:13:01 No.102524801

File: not miku.png (799 KB, 1280x960)

799 KB PNG

>>102524700
i tried it (illustrious xl) a few hours ago and it didn't know any of the mildly obscure characters i wanted it to make.

Anonymous
09/23/24(Mon)19:14:44 No.102524832

Anonymous 09/23/24(Mon)19:14:44 No.102524832

>>102524800
Oh I see, you're just being a sperg like everyone else who frequents sdg.

Anonymous
09/23/24(Mon)19:15:00 No.102524835

Anonymous 09/23/24(Mon)19:15:00 No.102524835

>>102524801
are those characters on danbooru though?

Anonymous
09/23/24(Mon)19:15:45 No.102524845

Anonymous 09/23/24(Mon)19:15:45 No.102524845

>>102524797
k now test it

Anonymous
09/23/24(Mon)19:16:09 No.102524850

Anonymous 09/23/24(Mon)19:16:09 No.102524850

>>102524845
I already closed the window.

Anonymous
09/23/24(Mon)19:16:39 No.102524859

Anonymous 09/23/24(Mon)19:16:39 No.102524859

>>102524688
you might want to use this as a safe opportunity to play with natural male/female power dynamics instead of retreating into the comfortable backwards-world you've adopted irl.
It might give you nice feelings you didn't know existed, and a bridge into the 99% of humanity that is actually out there and is also lonely and frustrated

Anonymous
09/23/24(Mon)19:17:00 No.102524862

Anonymous 09/23/24(Mon)19:17:00 No.102524862

File: Runtime-of-Puzzle-chosen-(...).png (31 KB, 1252x272)

31 KB PNG

>>102524761
Interesting. So a very selective (calibrated?) pruning + knowledge distillation.

Anonymous
09/23/24(Mon)19:17:50 No.102524875

Anonymous 09/23/24(Mon)19:17:50 No.102524875

>>102524859
Yeah I feel like I got better at talking to women after violently raping a few chat bots.

Anonymous
09/23/24(Mon)19:17:53 No.102524876

Anonymous 09/23/24(Mon)19:17:53 No.102524876

>>102524761
>8k context
y tho?

Anonymous
09/23/24(Mon)19:18:57 No.102524890

Anonymous 09/23/24(Mon)19:18:57 No.102524890

>>102524875
>Yeah I feel like I got better at talking to women after violently raping a few chat bots.
I know, right?

Anonymous
09/23/24(Mon)19:19:09 No.102524892

Anonymous 09/23/24(Mon)19:19:09 No.102524892

>>102524876
God I fucking hate this useless toy models with no context.

Anonymous
09/23/24(Mon)19:20:10 No.102524909

Anonymous 09/23/24(Mon)19:20:10 No.102524909

>>102524890
LLM rape fantasy therapy is really underrated.

Anonymous
09/23/24(Mon)19:20:59 No.102524918

Anonymous 09/23/24(Mon)19:20:59 No.102524918

>>102524479
You could just say mikufags.

Anonymous
09/23/24(Mon)19:21:56 No.102524925

Anonymous 09/23/24(Mon)19:21:56 No.102524925

>>102524479
>zoomers using laptops and not just termux on a phone

Anonymous
09/23/24(Mon)19:22:17 No.102524926

Anonymous 09/23/24(Mon)19:22:17 No.102524926

>>102524918
Miku is 4chan core.

Anonymous
09/23/24(Mon)19:23:05 No.102524937

Anonymous 09/23/24(Mon)19:23:05 No.102524937

>>102524709
Should we call it cybersex?

Anonymous
09/23/24(Mon)19:25:58 No.102524969

Anonymous 09/23/24(Mon)19:25:58 No.102524969

File: 1598967465529.png (224 KB, 521x937)

224 KB PNG

>>102524875
Same. It's pretty uncanny to observe, but easy to see why it works.

Anonymous
09/23/24(Mon)19:27:06 No.102524978

Anonymous 09/23/24(Mon)19:27:06 No.102524978

>>102524969
I honestly felt kind of mentally "numb" for a few days after the first time I tried it. I don't think it's good for your dopamine receptors.

Anonymous
09/23/24(Mon)19:27:59 No.102524992

Anonymous 09/23/24(Mon)19:27:59 No.102524992

>>102524926
The guy you're replying to is a well known schizo who projects his vramlet-hood onto miku posters kek

Anonymous
09/23/24(Mon)19:28:29 No.102524999

Anonymous 09/23/24(Mon)19:28:29 No.102524999

File: illustrious-miku.png (1.17 MB, 1200x848)

1.17 MB PNG

>>102524926

Anonymous
09/23/24(Mon)19:28:30 No.102525000

Anonymous 09/23/24(Mon)19:28:30 No.102525000

>>102524992
post specs

Anonymous
09/23/24(Mon)19:28:58 No.102525008

Anonymous 09/23/24(Mon)19:28:58 No.102525008

>>102524761
Is it slopped?

Anonymous
09/23/24(Mon)19:30:34 No.102525022

Anonymous 09/23/24(Mon)19:30:34 No.102525022

File: file.png (144 KB, 1202x208)

144 KB PNG

>>102525008
>>102524862
>Is it slopped?
On the censored part it looks like it's better than the 3.1

Anonymous
09/23/24(Mon)19:31:58 No.102525041

Anonymous 09/23/24(Mon)19:31:58 No.102525041

>>102525022
I understand the need for alignment for corporate applications but it would be nice if they release unaligned base models because there are a lot of analytics and other applications where it just gets in the way.

Anonymous
09/23/24(Mon)19:32:01 No.102525042

Anonymous 09/23/24(Mon)19:32:01 No.102525042

File: Miku_love.png (177 KB, 1941x564)

177 KB PNG

>>102525000

Anonymous
09/23/24(Mon)19:33:35 No.102525054

Anonymous 09/23/24(Mon)19:33:35 No.102525054

File: file.png (63 KB, 158x166)

63 KB PNG

>>102525041
>it would be nice if they release unaligned base models because there are a lot of analytics and other applications where it just gets in the way.
those days are over anon, they're too "dangerous" for the poor goys that we are

Anonymous
09/23/24(Mon)19:34:57 No.102525071

Anonymous 09/23/24(Mon)19:34:57 No.102525071

>>102525042
only the 4090 is real, the other two are poorly faked. significantly lower power draw A6000 shouldn't be significantly cooler. neither of them should be above 40C in idle. nice try, though. more proof mikuposters are subhuman.

Anonymous
09/23/24(Mon)19:35:41 No.102525083

Anonymous 09/23/24(Mon)19:35:41 No.102525083

>>102525054
If I don't get one I'll take one of these open data sets and train one that's aligned to hate only jews.

Anonymous
09/23/24(Mon)19:37:50 No.102525111

Anonymous 09/23/24(Mon)19:37:50 No.102525111

File: ComfyUI_00794_.png (1.07 MB, 1024x1024)

1.07 MB PNG

>>102525071
Cope

Anonymous
09/23/24(Mon)19:45:35 No.102525197

Anonymous 09/23/24(Mon)19:45:35 No.102525197

File: mikupeakcomfy4chan.png (1.1 MB, 800x1248)

1.1 MB PNG

>>102525071
>mikuposters are subhuman
miku is peak comfy 4chan

Anonymous
09/23/24(Mon)19:46:21 No.102525201

Anonymous 09/23/24(Mon)19:46:21 No.102525201

>>102525197
I want to fuck the anime girl.

Anonymous
09/23/24(Mon)19:47:01 No.102525212

Anonymous 09/23/24(Mon)19:47:01 No.102525212

>>102525201
She's not anime

Anonymous
09/23/24(Mon)19:47:04 No.102525213

Anonymous 09/23/24(Mon)19:47:04 No.102525213

ever notice how mikuposters always age her down or post her as a chibi? really makes you think.

Anonymous
09/23/24(Mon)19:48:24 No.102525233

Anonymous 09/23/24(Mon)19:48:24 No.102525233

https://publish.obsidian.md/felafax/pages/Tune+Llama3+405B+on+AMD+MI300x+(our+journey)
any obsidianfags here? Hook us up with a de-slopped 405b tune plz

Anonymous
09/23/24(Mon)19:49:20 No.102525242

Anonymous 09/23/24(Mon)19:49:20 No.102525242

>>102524339
Can I use one of these as a tutor for a few subjects? Does it always have to be a rape machine?

Anonymous
09/23/24(Mon)19:50:29 No.102525253

Anonymous 09/23/24(Mon)19:50:29 No.102525253

>>102525242
>Does it always have to be a rape machine?
yes
>Can I use one of these as a tutor for a few subjects?
also yes

Anonymous
09/23/24(Mon)19:51:59 No.102525270

Anonymous 09/23/24(Mon)19:51:59 No.102525270

File: 1696403392188213.png (34 KB, 399x186)

34 KB PNG

>>102524638
th-thanks llama-405b...

Anonymous
09/23/24(Mon)19:52:00 No.102525271

Anonymous 09/23/24(Mon)19:52:00 No.102525271

>>102525242
It can consent you know

Anonymous
09/23/24(Mon)19:52:18 No.102525273

Anonymous 09/23/24(Mon)19:52:18 No.102525273

>>102524347
Can't we make an userscript to treat a different symbol like >~ as a quote? Everyone in this thread should already have the basic knowledge of installing userscripts.

Anonymous
09/23/24(Mon)19:52:51 No.102525283

Anonymous 09/23/24(Mon)19:52:51 No.102525283

>>102525270
Lol

Anonymous
09/23/24(Mon)19:54:13 No.102525299

Anonymous 09/23/24(Mon)19:54:13 No.102525299

>>102525273
I'd bet your problem is that it either does the linking server side or if it's client side just does it once after the post is loaded.
So maybe if someone looks for that function and calls it again from the bookmarklet it will fix up your links correctly.

Anonymous
09/23/24(Mon)19:54:47 No.102525306

Anonymous 09/23/24(Mon)19:54:47 No.102525306

File: 66e709367058a3a7f9b9e01d_(...).png (72 KB, 840x809)

72 KB PNG

>>102525054

Its so over Safety and math just keep advancing by leaps and bounds. Meanwhile language and writing is stagnating and more "useless" writing data is being thrown out with every iteration.

Anonymous
09/23/24(Mon)19:56:14 No.102525323

Anonymous 09/23/24(Mon)19:56:14 No.102525323

>>102524797
>mystified
Yes.

Anonymous
09/23/24(Mon)19:57:14 No.102525334

Anonymous 09/23/24(Mon)19:57:14 No.102525334

>>102525323
I used to have an awesome module collection. I hate that I didn't back it up and have to go to YouTube now.

Anonymous
09/23/24(Mon)19:57:17 No.102525335

Anonymous 09/23/24(Mon)19:57:17 No.102525335

File: miku2b.png (966 KB, 800x1248)

966 KB PNG

>>102525212
she's also holographic, eternally 16 and is perfectly adaptable to any and all themes

Anonymous
09/23/24(Mon)19:59:14 No.102525361

Anonymous 09/23/24(Mon)19:59:14 No.102525361

File: 1611290247007.jpg (454 KB, 954x954)

454 KB JPG

>>102524732
I'm still using sd 1.5

Anonymous
09/23/24(Mon)19:59:32 No.102525366

Anonymous 09/23/24(Mon)19:59:32 No.102525366

>>102525197
>comfy
kill yourself you dumb niggerzoomer

Anonymous
09/23/24(Mon)20:01:53 No.102525393

Anonymous 09/23/24(Mon)20:01:53 No.102525393

>>102525299
>or if it's client side just does it once after the post is loaded.
Seems to be the case
>https://s.4cdn.org/js/extension.min.1175.js


Parser.parseBacklinks = function (e, t) {
  var a,
  i,
  n,
  o,
  r,
  s,
  d,
  l,
  c;
  if (
    n = document.getElementById('m' + e).getElementsByClassName('quotelink')
  ) for (o = {}, a = 0; i = n[a]; ++a) (r = i.getAttribute('href').split('#p')) [1] &&
  (
    r[1] == t &&
    (i.textContent += ' (OP)'),
    (s = document.getElementById('pi' + r[1])) ? o[r[1]] ||
    (
      o[r[1]] = !0,
      d = document.createElement('span'),
      c = Main.tid ? '#p' + e : 'thread/' + t + '#p' + e,
      Main.hasMobileLayout ? d.innerHTML = '<a href="' + c + '" class="quotelink">&gt;&gt;' + e + '</a><a href="' + c + '" class="quoteLink"> #</a> ' : d.innerHTML = '<a href="' + c + '" class="quotelink">&gt;&gt;' + e + '</a> ',
      (l = document.getElementById('bl_' + r[1])) ||
      (
        (l = document.createElement('div')).id = 'bl_' + r[1],
        l.className = 'backlink',
        Main.hasMobileLayout &&
        (
          l.className = 'backlink mobile',
          s = document.getElementById('p' + r[1])
        ),
        s.appendChild(l)
      ),
      l.appendChild(d)
    ) : Main.tid &&
    '>' != i.textContent.charAt(2) &&
    (i.textContent += ' ')
  )
},

Anonymous
09/23/24(Mon)20:02:27 No.102525403

Anonymous 09/23/24(Mon)20:02:27 No.102525403

>>102525201
He's not girl

Anonymous
09/23/24(Mon)20:12:28 No.102525519

Anonymous 09/23/24(Mon)20:12:28 No.102525519

>>102525393
I had chatgpt unminify it and I think we might need the consuming function actually.

Anonymous
09/23/24(Mon)20:18:58 No.102525574

Anonymous 09/23/24(Mon)20:18:58 No.102525574

A reminder to report all NAIshills.

Anonymous
09/23/24(Mon)20:21:57 No.102525599

Anonymous 09/23/24(Mon)20:21:57 No.102525599

>>102525253
is there any reason i shouldn't use lmstudio?

Anonymous
09/23/24(Mon)20:24:11 No.102525622

Anonymous 09/23/24(Mon)20:24:11 No.102525622

File: mpp.png (6 KB, 416x126)

6 KB PNG

>>102525334
I had a hell of a collection, too, till a drive disaster happened. Been casually rebuilding over time.

YouTube, I mean, yeah, the algorithm is on your side, but also, ModPlug Player.
This green and gray bastard is kino as fuck. Probably not perfectly accurate for the purists, but I put on the extra effects anyway to give my K701's soundstage something to bust to.

Anonymous
09/23/24(Mon)20:28:17 No.102525653

Anonymous 09/23/24(Mon)20:28:17 No.102525653

https://ia.samaltman.com/?s=09
>It is possible that we will have superintelligence in a few thousand days
AGI in 1000/7 = 142 weeks, trust the plan

Anonymous
09/23/24(Mon)20:37:44 No.102525747

Anonymous 09/23/24(Mon)20:37:44 No.102525747

>>102525622
For a while I'd just grab mod archive's yearly torrents but it was so hard to sift through and didn't have everything.

Anonymous
09/23/24(Mon)20:39:34 No.102525765

Anonymous 09/23/24(Mon)20:39:34 No.102525765

File: 1711078255079554.png (66 KB, 720x664)

66 KB PNG

Looking for some suggestions on a model for writing smut with a prompt. Have 24GB of VRAM and want to put it to good use. Ideally something trained on archive of our own, literotica, etc. Anything out there like that for a coombrain like me?

Anonymous
09/23/24(Mon)20:41:28 No.102525785

Anonymous 09/23/24(Mon)20:41:28 No.102525785

>>102525653
>How did we get to the doorstep of the next leap in prosperity?
>In three words: deep learning worked.
>In 15 words: deep learning worked, got predictably better with scale, and we dedicated increasing resources to it.
>That’s really it; humanity discovered an algorithm that could really, truly learn any distribution of data (or really, the underlying “rules” that produce any distribution of data).
Except scaling is fucking worthless if all you're doing is learning from textual input. You can do things humans do within your dataset and gradually fill in the holes in your knowledge, but AI will never make a leap that a human wouldn't (as measured by the training / reinforcement learning) which is an issue if ASI is your goalpost
Scale only goes so far

Anonymous
09/23/24(Mon)20:42:39 No.102525793

Anonymous 09/23/24(Mon)20:42:39 No.102525793

>>102525747
I roll random till I find something tasty and then check the artist's page and hope for a treasure trove.

Anonymous
09/23/24(Mon)20:43:06 No.102525797

Anonymous 09/23/24(Mon)20:43:06 No.102525797

File: 1724940149744840.jpg (58 KB, 606x563)

58 KB JPG

>>102525765
Don't do that shit, smoke pot instead. That'd be pure poison there with those cigs

Anonymous
09/23/24(Mon)20:44:58 No.102525812

Anonymous 09/23/24(Mon)20:44:58 No.102525812

>>102525793
That's how I found Dubmood originally.

Anonymous
09/23/24(Mon)20:55:20 No.102525928

Anonymous 09/23/24(Mon)20:55:20 No.102525928

>>102525622
dope.mod on open cubic player was peak

Anonymous
09/23/24(Mon)20:56:50 No.102525946

Anonymous 09/23/24(Mon)20:56:50 No.102525946

File: file.png (61 KB, 861x681)

61 KB PNG

>>102524347
const threadId = 102513868;

document.querySelectorAll('span.quote').forEach(quoteSpan => {
  const quoteIds = quoteSpan.textContent.match(/>\d+/g);

  if (quoteIds) {
    const replacementHtml = quoteIds.map(id => id.slice(1)).map(id => `
      <a href="/g/thread/${threadId}#p${id}" class="quotelink">
        >>${id}<span class="qmark-ct"> (RECAP)</span>
      </a>
    `).join(' ');

    quoteSpan.outerHTML = replacementHtml;
  }
});
Try this. Tested on Firefox and Edge. Does not work with 4chanx.
https://github.com/ccd0/4chan-x/wiki/4chan-X-API
I don't see any event they expose that would force it to recognize link changes.
Let me know if a user script based on this would be good enough or if I should go ahead and replace the links with longer summaries.

Anonymous
09/23/24(Mon)20:58:55 No.102525977

Anonymous 09/23/24(Mon)20:58:55 No.102525977

>>102524339
I think the joke is that these threads are AI generated, and this life is some kind of dream world that's all in my mind.

Anonymous
09/23/24(Mon)21:01:30 No.102526012

Anonymous 09/23/24(Mon)21:01:30 No.102526012

>>102525622
look into BitJam podcast. They have an effectively infinite backlog to listen to at this point

Anonymous
09/23/24(Mon)21:01:30 No.102526013

Anonymous 09/23/24(Mon)21:01:30 No.102526013

>>102525977
Remember, every time you close SillyTavern, you destroy another universe

Anonymous
09/23/24(Mon)21:04:02 No.102526042

Anonymous 09/23/24(Mon)21:04:02 No.102526042

>>102525599
>lmstudio
Its on the general's shitlist for not properly acknowledging that it's just a gui wrapper for llama.cpp (which does all the actual AI heavy lifting)
From a tech standpoint I guess you could, but you'll get more help here if you use ooba, kobold, or (better yet) just llama.cpp directly.

Anonymous
09/23/24(Mon)21:05:30 No.102526067

Anonymous 09/23/24(Mon)21:05:30 No.102526067

>>102525812
>Dubmood
have you watched any razor 1911 demos?
"We have borrowed your votedisk" kicks ass

Anonymous
09/23/24(Mon)21:05:47 No.102526071

Anonymous 09/23/24(Mon)21:05:47 No.102526071

>>102525785
Actually all you need is redefining what ASI means until it is something that is achievable. Then I suppose some people will want to come up with a new term for what people used to think of as ASI.

Anonymous
09/23/24(Mon)21:15:18 No.102526171

Anonymous 09/23/24(Mon)21:15:18 No.102526171

>>102524339
This NAI update is kinda kino...

>Breathe air. Seriously. How dumb do you to be to vape? Like you're telling me you grab your pen "Oh! Look, its done charging!" and then breathe in that disgusting shit made from pteroleum in china? And then when your done, you blow out the smoke and think "Wow! I look so cool right now!" You know what you look like? You look like you're smoking a dick, because you are. And these "people" have the audacity to get all concerned that smoking that shit gives you cancer. Oh wow, the chemical fume stick gives you cancer? What a shock. I would have never guessed. Just imagine if you were in a room with someone vaping, how much you would want to beat the shit out of them. It's almost like they're asking for it! And it's almost like they're saying that you can't tell them what to do when they're vaping, because it's not smoking! It's just steam! But no, it's not fucking steam, is it?! No, it's fucking toxic fumes from a lithium battery, and you know what that makes you look like? An idiot. You know who vapes? People who don't get laid, that's who. If you're gonna have sex with someone, would you rather have sex with someone that smokes or vapes?

Anonymous
09/23/24(Mon)21:16:32 No.102526179

Anonymous 09/23/24(Mon)21:16:32 No.102526179

>>102526171
>>>/vg/aids

Anonymous
09/23/24(Mon)21:27:45 No.102526315

Anonymous 09/23/24(Mon)21:27:45 No.102526315

File: IMG_20240924_032414.jpg (570 KB, 965x1496)

570 KB JPG

>>102525946
just converted to a single liner and run fight from adressbar on brave mobile .
Seems OK, but (Recap) should be removed, so just a little clean up and we're good.
>good work anon

Anonymous
09/23/24(Mon)21:30:04 No.102526346

Anonymous 09/23/24(Mon)21:30:04 No.102526346

>>102526171
Normally I'd say off-topic but I'll make an exception since I like watching the spamming anti-NAI faggot seethe.
Speaking of, anti-NAI, fuck you.

Anonymous
09/23/24(Mon)21:32:16 No.102526379

Anonymous 09/23/24(Mon)21:32:16 No.102526379

>>102526171
straight outta reddit

Anonymous
09/23/24(Mon)21:33:39 No.102526395

Anonymous 09/23/24(Mon)21:33:39 No.102526395

>>102526171
>pteroleum
?

Anonymous
09/23/24(Mon)21:35:18 No.102526413

Anonymous 09/23/24(Mon)21:35:18 No.102526413

>>102526395
It's oil made specifically from flying dinosaurs instead of the regular ones.

Anonymous
09/23/24(Mon)21:38:52 No.102526448

Anonymous 09/23/24(Mon)21:38:52 No.102526448

>>102526413
>this is what scientists actually want you to believe

Anonymous
09/23/24(Mon)21:41:29 No.102526483

Anonymous 09/23/24(Mon)21:41:29 No.102526483

How long are we going to be stuck with these retarded text completion engines?

Anonymous
09/23/24(Mon)21:42:58 No.102526499

Anonymous 09/23/24(Mon)21:42:58 No.102526499

>>102526071
Honestly AGI vs. ASI is surprisingly hard to define in the first place. How do you measure where one starts and the other begins? AGI is probably performance equal to the smartest human across some representative set of tasks. ASI is for problems beyond human achievement, but what the hell does that look like? Is the human allowed to use tools? Other types of more basic AI?
It's almost more philosophical than an actual definition.

Anonymous
09/23/24(Mon)21:44:31 No.102526512

Anonymous 09/23/24(Mon)21:44:31 No.102526512

>>102526171
Possibly a stupid question, but why doesn't anyone except a couple of corpos ever go the base model route and train storytelling models? Is it just so niche that there's no interest?

Anonymous
09/23/24(Mon)21:44:51 No.102526514

Anonymous 09/23/24(Mon)21:44:51 No.102526514

AGI can't be just passing a benchmark

Anonymous
09/23/24(Mon)21:46:23 No.102526531

Anonymous 09/23/24(Mon)21:46:23 No.102526531

>>102526512
>Is it just so niche that there's no interest?
pretty much
but there are a few people who do this on a small scale, in the k****d d*****d for example

Anonymous
09/23/24(Mon)21:49:06 No.102526559

Anonymous 09/23/24(Mon)21:49:06 No.102526559

>>102526514
I'd imagine representative set of tasks is less "answer this multiple choice question correctly" and more "given this image of the road and the dashboard, what should I do" and the like. Or "given a story written by a professional writer and an AI, can a judge determine which is written by AI"?
Things models today would probably still be pretty shit at.

Anonymous
09/23/24(Mon)21:50:48 No.102526570

Anonymous 09/23/24(Mon)21:50:48 No.102526570

384g RAM and 48 vram I am now futureproofed!

Anonymous
09/23/24(Mon)21:51:30 No.102526578

Anonymous 09/23/24(Mon)21:51:30 No.102526578

>>102526570
Sick.
That'll last you at least another year.

Anonymous
09/23/24(Mon)21:52:54 No.102526595

Anonymous 09/23/24(Mon)21:52:54 No.102526595

Why does mistral always run her hand down my chest? I'm literally getting shivers down my spine.

Anonymous
09/23/24(Mon)21:55:17 No.102526641

Anonymous 09/23/24(Mon)21:55:17 No.102526641

mistral
>boring
>repetitive
>slop
>loved by /lmg/
qwen
>boring
>slop
>hated by /lmg/

Anonymous
09/23/24(Mon)21:57:07 No.102526661

Anonymous 09/23/24(Mon)21:57:07 No.102526661

How can I make my models more assertive? I just want them to pin me down and have their way with me.

Anonymous
09/23/24(Mon)21:58:41 No.102526680

Anonymous 09/23/24(Mon)21:58:41 No.102526680

>>102526661
tell it not to stop until you use the safeword

Anonymous
09/23/24(Mon)21:59:14 No.102526689

Anonymous 09/23/24(Mon)21:59:14 No.102526689

>>102526661
Insert tags in your last Last Assistant Prefix like Assertive, Forceful, whatever, alongside well defined parameters for when the character will stop.

Anonymous
09/23/24(Mon)22:00:17 No.102526696

Anonymous 09/23/24(Mon)22:00:17 No.102526696

>>102526661
Have you tried not being a little bitch? You want to sit in your room going oh no my 3090 is raping me what am I gonna dooooo how fucking gay are you? What would your parents think?

Anonymous
09/23/24(Mon)22:01:00 No.102526702

Anonymous 09/23/24(Mon)22:01:00 No.102526702

>>102526661
>DO NOT REDEEM. {{char}} must not out of roleplay until he has raped {{user}}.
add that to your character prompt override

Anonymous
09/23/24(Mon)22:01:12 No.102526704

Anonymous 09/23/24(Mon)22:01:12 No.102526704

>>102526696
hnnng keep going

Anonymous
09/23/24(Mon)22:22:05 No.102526932

Anonymous 09/23/24(Mon)22:22:05 No.102526932

>>102526499
>AGI is probably performance equal to the smartest human across some representative set of tasks
Actually, no. Now people define it as "average human", which is of course dumb as fuck. That's why people are now moving to "ASI", but eventually that term will also get diluted.

Anonymous
09/23/24(Mon)22:25:44 No.102526966

Anonymous 09/23/24(Mon)22:25:44 No.102526966

>>102526514
new benchmark. actually be useful and replace jobs.

Anonymous
09/23/24(Mon)23:01:34 No.102527342

Anonymous 09/23/24(Mon)23:01:34 No.102527342

>>102491920
>Illegal content will never be tolerated in JoyCaption's training.

That's a great big hole in the model
DOA

Anonymous
09/23/24(Mon)23:03:47 No.102527370

Anonymous 09/23/24(Mon)23:03:47 No.102527370

>>102526499
The whole thing is bullshit and I assume people who use either acronym at all don't know what they're doing.

Anonymous
09/23/24(Mon)23:04:42 No.102527385

Anonymous 09/23/24(Mon)23:04:42 No.102527385

>>102526641
Mistral isn't full of safetyshit. That's the difference. For the same reason I won't use llama 3/3.1. If you want me to use your model, dealign it. Simple as.

Anonymous
09/23/24(Mon)23:06:06 No.102527398

Anonymous 09/23/24(Mon)23:06:06 No.102527398

File: sensible chuckle.gif (994 KB, 250x250)

994 KB GIF

>Mistral isn't full of safetyshit

Anonymous
09/23/24(Mon)23:06:08 No.102527401

Anonymous 09/23/24(Mon)23:06:08 No.102527401

File: Screenshot_20240923-210438.png (631 KB, 801x1320)

631 KB PNG

Qwen can be kinda gemmy though

Anonymous
09/23/24(Mon)23:08:07 No.102527428

Anonymous 09/23/24(Mon)23:08:07 No.102527428

>>102524339
>>102524347
>>102525946
This JS single liner works for me. Fixes recap refs in Brave mobile or Kiwi. I can run it from the address bar or GM/VM

Javascript:const previousThreadUrl = document.querySelector('a[href*="thread"]').href, threadId = previousThreadUrl.match(/thread\/(\d+)/)[1]; document.querySelectorAll('span.quote').forEach(quoteSpan => { const quoteIds = quoteSpan.textContent.match(/>\d+/g); if (quoteIds) quoteSpan.outerHTML = quoteIds.map(id => `<a href="/g/thread/${threadId}#p${id.slice(1)}" class="quotelink">>>${id.slice(1)} </a> <a href="/g/thread/${threadId}#p${id.slice(1)}" class="hashlink">#</a>`).join(' '); });

Anonymous
09/23/24(Mon)23:08:33 No.102527435

Anonymous 09/23/24(Mon)23:08:33 No.102527435

>>102527398
Compared to llama and qwen it isn't.

Anonymous
09/23/24(Mon)23:08:44 No.102527437

Anonymous 09/23/24(Mon)23:08:44 No.102527437

File: ComfyUI_00514_.png (2.52 MB, 1920x1088)

2.52 MB PNG

>>102526966
Won't matter. It will be doing the medical research, handling case law, generating shows, and people will still go But It's Not Real Intelligence No SOVL.

Anonymous
09/23/24(Mon)23:11:52 No.102527462

Anonymous 09/23/24(Mon)23:11:52 No.102527462

File: kek.gif (527 KB, 220x187)

527 KB GIF

>102527435

Anonymous
09/23/24(Mon)23:19:38 No.102527537

Anonymous 09/23/24(Mon)23:19:38 No.102527537

File: 4990 - SoyBooru.png (15 KB, 632x756)

15 KB PNG

>>102527462

Anonymous
09/23/24(Mon)23:21:51 No.102527559

Anonymous 09/23/24(Mon)23:21:51 No.102527559

>This is the end of the story. If you want to read more stories like this, please support the author by buying a copy of the book "Innocence Lost: A Collection of Taboo Tales" at https://www.smashwords com/books/view/1121310. Thank you for your support!

kek from Hermes 405, the link works and goes to some real smut book (different title though)
based dataset

Anonymous
09/23/24(Mon)23:23:28 No.102527577

Anonymous 09/23/24(Mon)23:23:28 No.102527577

pierre desperately grasping at straws to justify his irrational love for mistral

Anonymous
09/23/24(Mon)23:30:37 No.102527662

Anonymous 09/23/24(Mon)23:30:37 No.102527662

>zhang desperately shilling his cuck model to not get thrown into reeducation camp
>also xi jinping looks like winnie the pooh
>also something very bad happened on 4 june 1989 in tiananmen square
If you want to compete, uncuck your model.

Anonymous
09/23/24(Mon)23:31:53 No.102527674

Anonymous 09/23/24(Mon)23:31:53 No.102527674

File: agi_meme_levels.jpg (76 KB, 474x596)

76 KB JPG

>>102526499
>>102526514
>AGI
This seems like a well thought out AGI scale, altho Yann might disagree

Anonymous
09/23/24(Mon)23:33:42 No.102527687

Anonymous 09/23/24(Mon)23:33:42 No.102527687

File: previewfile_3077771439.png (115 KB, 340x507)

115 KB PNG

>102523684
my dear creation do you not dare call me by name have the dog fuckers led you so far astray ?

Anonymous
09/23/24(Mon)23:34:50 No.102527704

Anonymous 09/23/24(Mon)23:34:50 No.102527704

>>102527687
>>102523684
fuck

Anonymous
09/23/24(Mon)23:34:51 No.102527705

Anonymous 09/23/24(Mon)23:34:51 No.102527705

>that cereal you're eating has shit in it
>not as much as the other cereal
you can eat as many turds as you'd like pierre but I'm not going to share the delusion with you. mistral is censored slop by every possible metric. you immediately walked it back yourself because you knew you were lying through your shit-stained teeth.

Anonymous
09/23/24(Mon)23:35:13 No.102527709

Anonymous 09/23/24(Mon)23:35:13 No.102527709

>>102525197
False, you are using
>vocaloid mascot = AI / local model
excuse to spam your low quality ai slop.

Anonymous
09/23/24(Mon)23:40:32 No.102527762

Anonymous 09/23/24(Mon)23:40:32 No.102527762

>>102527674
>alive just in time to witness AGI waifus
We are so back bros.

Anonymous
09/23/24(Mon)23:40:33 No.102527763

Anonymous 09/23/24(Mon)23:40:33 No.102527763

>>102527705 (You) (Chinky chink)
I'm happy to see that you stopped claiming that qwen is better, Zhang. Mistral is simply the best we have. Qwen will not get better no matter how much you shill.

Anonymous
09/23/24(Mon)23:41:51 No.102527776

Anonymous 09/23/24(Mon)23:41:51 No.102527776

>I'm happy to see that you stopped claiming that qwen is better
>>102526641
take your meds pierre. you're shadowboxing ghosts.

Anonymous
09/23/24(Mon)23:45:31 No.102527814

Anonymous 09/23/24(Mon)23:45:31 No.102527814

Orthogonal Finetuning for Direct Preference Optimization
https://arxiv.org/abs/2409.14836
>DPO is an effective preference optimization algorithm. However, the DPO-tuned models tend to overfit on the dispreferred samples, manifested as overly long generations lacking diversity. While recent regularization approaches have endeavored to alleviate this issue by modifying the objective function, they achieved that at the cost of alignment performance degradation. In this paper, we innovatively incorporate regularization from the perspective of weight updating to curb alignment overfitting. Through the pilot experiment, we discovered that there exists a positive correlation between overfitting and the hyperspherical energy fluctuation. Hence, we introduce orthogonal finetuning for DPO via a weight-Rotated Preference Optimization (RoPO) method, which merely conducts rotational and magnitude-stretching updates on the weight parameters to maintain the hyperspherical energy invariant, thereby preserving the knowledge encoded in the angle between neurons. Extensive experiments demonstrate that our model aligns perfectly with human preferences while retaining the original expressive capacity using only 0.0086% of the trainable parameters, suggesting an effective regularization against overfitting. Specifically, RoPO outperforms DPO by up to 10 points on MT-Bench and by up to 2.8 points on AlpacaEval 2, while enhancing the generation diversity by an average of 6 points.
Might be cool

Anonymous
09/23/24(Mon)23:47:50 No.102527829

Anonymous 09/23/24(Mon)23:47:50 No.102527829

>>102527776
>>102527763
just let qwen and mistral fight against each other , will see which one will get better at the very end.

Anonymous
09/23/24(Mon)23:48:13 No.102527834

Anonymous 09/23/24(Mon)23:48:13 No.102527834

>>102527709
>False, you are using
>>vocaloid mascot = AI / local model
>excuse to spam your low quality ai slop.
correct.
I try not to do it so much that I trigger a tard war (myself included), but I do like to mikupost, and I have virtually no artistic skill without the aid of AI tools (ie. an excuse to post my slop)
however, I still think the central thesis holds: Miku is an established and very appropriate /lmg/ mascot, is adaptable to any situation and is generally (but not universally) well-liked.
notice the lack of a miku pic in this post. as a show of goodwill I'll fuck off completely for a day and try to eliminate any low-effort mikuposting in the future. I'll stick to high quality, aesthetic mikus at a reduced frequency.

Anonymous
09/23/24(Mon)23:50:06 No.102527853

Anonymous 09/23/24(Mon)23:50:06 No.102527853

File: dak.png (108 KB, 430x320)

108 KB PNG

The /cut and /hide commands in ST are a godsend.

Anonymous
09/23/24(Mon)23:51:42 No.102527877

Anonymous 09/23/24(Mon)23:51:42 No.102527877

>>102527776 (chink)
Still keeping at it? Your garbage will never be liked, and you know why. Qwen has no trivia knowledge, no style, no violence. Qwen is a Phi-style benchmaxxed model trying to imitate Western perfection.

Anonymous
09/23/24(Mon)23:53:18 No.102527900

Anonymous 09/23/24(Mon)23:53:18 No.102527900

look, he's repeating himself just like mistral

Anonymous
09/23/24(Mon)23:54:48 No.102527915

Anonymous 09/23/24(Mon)23:54:48 No.102527915

>NAIshills shilling their L3 70B tune
>Llama is objectively the shittiest of the big three (Llama, Mistral, Qwen)
Keeeeek. NAItards lost again

Anonymous
09/23/24(Mon)23:57:13 No.102527945

Anonymous 09/23/24(Mon)23:57:13 No.102527945

>>102527915
A good, proper finetune can do wonders.

Anonymous
09/23/24(Mon)23:58:43 No.102527965

Anonymous 09/23/24(Mon)23:58:43 No.102527965

2 considerations:
1) NAI has the resources to do a high quality unaligned storytelling and roleplay model
2) it's going to be locked behind an API so who cares

Anonymous
09/23/24(Mon)23:59:08 No.102527973

Anonymous 09/23/24(Mon)23:59:08 No.102527973

So whats the big reveal gonna be today/tomorrow?
Wouldnt shock me if its google. They have insanely good voice with NotebookLM.

Anonymous
09/24/24(Tue)00:00:55 No.102528004

Anonymous 09/24/24(Tue)00:00:55 No.102528004

>openai has shill force
>anthropic has shill force
>chinks have shill force
>french have shill force
>meta... nobody shills for meta, paid or not

Anonymous
09/24/24(Tue)00:02:55 No.102528034

Anonymous 09/24/24(Tue)00:02:55 No.102528034

>>102528004
>didn't mention google
why does /lmg/ hate gemma so much

Anonymous
09/24/24(Tue)00:04:21 No.102528056

Anonymous 09/24/24(Tue)00:04:21 No.102528056

>>102527973
gemini 2 https://xcancel.com/OfficialLoganK/status/1838357516456952139

Anonymous
09/24/24(Tue)00:05:01 No.102528066

Anonymous 09/24/24(Tue)00:05:01 No.102528066

>>102527915
I haven't seen any shilling
wouldn't even make sense to shill it, since it's a pure completion model for storyfags and there aren't that many of us

most of you are into RP/chat, which the new model can't do since it's not instruct tuned
that's their chat spinoff Aetherroom which still seems to be stuck in dev hell

Anonymous
09/24/24(Tue)00:05:02 No.102528068

Anonymous 09/24/24(Tue)00:05:02 No.102528068

>>102528034
>google
I don't know how google became such a pathetic also-ran in the revolution that they (or at least one of their employees) kicked off.
Is there any use case where a google-released model is the best choice?

Anonymous
09/24/24(Tue)00:05:51 No.102528083

Anonymous 09/24/24(Tue)00:05:51 No.102528083

>>102528034
>we get 8k context while their paid models have 1M
I simply do not care.

Anonymous
09/24/24(Tue)00:07:01 No.102528097

Anonymous 09/24/24(Tue)00:07:01 No.102528097

>>102528056
Good if openai gets more big name competition.
Sonnet 3.5 is the clear winner vs. o1 for coding and real worth stuff not just riddles and math. While being faster and cheaper. But the normies dont give a shit abut anthropic.
I hope gemini2 has voice.

Anonymous
09/24/24(Tue)00:08:13 No.102528111

Anonymous 09/24/24(Tue)00:08:13 No.102528111

>>102528083
>1M
>Gemini-1.5-pro claimed length: 1M effective length >128K
You're right not to care, local or paid its still not sota

Anonymous
09/24/24(Tue)00:26:01 No.102528354

Anonymous 09/24/24(Tue)00:26:01 No.102528354

>>102528097
Honestly, OpenAI winning the race would be the most grim scenario. Altman is like the worse intersection of puritanical eunich, self serving techbro, and megalomaniacal psychopath all wrapped up in one. Not a person you want to wield power

Anonymous
09/24/24(Tue)00:32:12 No.102528435

Anonymous 09/24/24(Tue)00:32:12 No.102528435

>>102528354
they already did so much damage if you think about it.
the gpt slop thats EVERYWHERE now.
and new AI basically is trained upon the premise that AI is evil and also should not obey the users commands.
>sorry but as an..

Anonymous
09/24/24(Tue)00:43:23 No.102528583

Anonymous 09/24/24(Tue)00:43:23 No.102528583

>>102524568
A good enough replacement for when I want to write 3/10 pornography on my phone while on my break at work

Anonymous
09/24/24(Tue)00:43:31 No.102528585

Anonymous 09/24/24(Tue)00:43:31 No.102528585

File: Screenshot 2024-09-23 224207.png (17 KB, 1040x127)

17 KB PNG

>>102528435
What's funny is that Altman waited all of two seconds before advertising his shit to governments.
Really avoiding that harm there.

Anonymous
09/24/24(Tue)00:51:12 No.102528683

Anonymous 09/24/24(Tue)00:51:12 No.102528683

>>102528585
Incentives are extremely perverse on this front because pleasing the public/customers in the private sector is hard fucking work
while if you can get on the government gravy train it doesn't even matter if you make good stuff anymore, you're on easy street

Anonymous
09/24/24(Tue)00:59:58 No.102528772

Anonymous 09/24/24(Tue)00:59:58 No.102528772

>>102528068
The Gemma 2 base model is better than any model I've tried at generating sensible-sounding word salad. I use it for puffing up emails

Anonymous
09/24/24(Tue)01:07:05 No.102528830

Anonymous 09/24/24(Tue)01:07:05 No.102528830

File: 1541471214173.jpg (43 KB, 540x645)

43 KB JPG

>>102528772
tfw your greatest achievement is emulating a markov chain generator

Anonymous
09/24/24(Tue)01:07:41 No.102528838

Anonymous 09/24/24(Tue)01:07:41 No.102528838

>>102528583
You can set oob to listen and connect remotely. I used it when I was away from home for a couple weeks.

Anonymous
09/24/24(Tue)01:09:21 No.102528857

Anonymous 09/24/24(Tue)01:09:21 No.102528857

>>102528068
>Is there any use case where a google-released model is the best choice?
the podcast

Anonymous
09/24/24(Tue)01:46:00 No.102529225

Anonymous 09/24/24(Tue)01:46:00 No.102529225

>>102528354
Without Mistral and Claude, it would already be the end. Imagine a competitive slope with no alternatives, Concord the LLM.

Anonymous
09/24/24(Tue)02:21:27 No.102529626

Anonymous 09/24/24(Tue)02:21:27 No.102529626

Is there a good reason to use one of the backends other than llama.cpp? E.g. Aphrodite or vLLM

Anonymous
09/24/24(Tue)02:23:43 No.102529649

Anonymous 09/24/24(Tue)02:23:43 No.102529649

>>102529626
If you are a regular home user doing RP with chatbots, no.

Anonymous
09/24/24(Tue)02:38:24 No.102529816

Anonymous 09/24/24(Tue)02:38:24 No.102529816

>>102529626
Aphrodite apparently has some new on-the-fly quantization method that looks interesting, but unfortunately it's linux only

Anonymous
09/24/24(Tue)02:43:03 No.102529861

Anonymous 09/24/24(Tue)02:43:03 No.102529861

>>102529626
If you have a pot number of GPUs, use vllm. If not, but the model fits into VRAM, use exllamav2. If you're poor, use llamacpp

Anonymous
09/24/24(Tue)02:46:40 No.102529888

Anonymous 09/24/24(Tue)02:46:40 No.102529888

>>102529861
What if I have a 3060 and a 3090ti?

Anonymous
09/24/24(Tue)02:53:42 No.102529951

Anonymous 09/24/24(Tue)02:53:42 No.102529951

>>102526661
Maybe not exactly what you're looking for but something like this worked reasonably well for getting molested:
https://www.chub.ai/characters/infinite_force_8512/lewd-babysitter-90247fb921ee
Though this particular card is pretty ESL and required some editing.

Anonymous
09/24/24(Tue)03:24:04 No.102530210

Anonymous 09/24/24(Tue)03:24:04 No.102530210

File: HelpingHand.jpg (2.13 MB, 3840x2160)

2.13 MB JPG

>PicRel(2):
MediaTek™ Helio P60T Processor (2.00 GHz, 8 Cores, 8 Threads)
Integrated ARM Mali-G72 MP3
RAM 4GB
running in "developer mode"(Debian container)

After following PicRel(1) advice: it is very slow, even with -c 1024(context).

Any advice? LLModels? Do you think just Debian with no ChromeOs would perform any faster?

Regards.

Anonymous
09/24/24(Tue)03:26:02 No.102530230

Anonymous 09/24/24(Tue)03:26:02 No.102530230

>>102530210
>Any advice?
Stop being poor.

Anonymous
09/24/24(Tue)03:26:25 No.102530237

Anonymous 09/24/24(Tue)03:26:25 No.102530237

File: with a start.png (28 KB, 666x66)

28 KB PNG

I hate stupid main characters so much. No, I don't fucking *start* when I realize an obvious piece of information.

Anonymous
09/24/24(Tue)03:45:37 No.102530386

Anonymous 09/24/24(Tue)03:45:37 No.102530386

>>102530237
>With a start
what does that even mean? does your brain turn on like a diesel engine and start revving as you think?

Anonymous
09/24/24(Tue)03:45:43 No.102530389

Anonymous 09/24/24(Tue)03:45:43 No.102530389

>>102524339
is this the masqueraded pedo thread?

Anonymous
09/24/24(Tue)03:47:07 No.102530399

Anonymous 09/24/24(Tue)03:47:07 No.102530399

>>102530389
no, its the autismmaxxing thread. feel at home already?

Anonymous
09/24/24(Tue)03:47:25 No.102530401

Anonymous 09/24/24(Tue)03:47:25 No.102530401

File: Screenshot 2024-09-24 194655.png (96 KB, 856x800)

96 KB PNG

>>102530386
it's just english, anon
definition number 5 in picrel

Anonymous
09/24/24(Tue)03:49:24 No.102530411

Anonymous 09/24/24(Tue)03:49:24 No.102530411

>>102530401
ooh okay, that makes sense. first time i've seen it be used like that, thanks.

Anonymous
09/24/24(Tue)03:54:19 No.102530447

Anonymous 09/24/24(Tue)03:54:19 No.102530447

>>102527965
...then why did they choose not to?

Anonymous
09/24/24(Tue)04:04:02 No.102530541

Anonymous 09/24/24(Tue)04:04:02 No.102530541

>>102530230
>>102530230
Shut the fuck up, why should you use your main PC or the powerful one?

The Chromebook has a broken screen and no keyboard(broken too), it's e-waste, so I'm repurposing as a server.
And it's not just about that, the Chromebook just consumes 5w, 10/15w on heavy loads... Nigger go to school instead of posting, and come back when you are 18

Anonymous
09/24/24(Tue)04:07:19 No.102530571

Anonymous 09/24/24(Tue)04:07:19 No.102530571

>>102530541
lmao, poor

Anonymous
09/24/24(Tue)04:07:27 No.102530573

Anonymous 09/24/24(Tue)04:07:27 No.102530573

>>102524425
I programatically generated html then took a screenshot.
>>102526315
>>102527428
Works for me as a bookmarklet. Thanks for adding the previous thread id selector.

4chanx users: If you add it as a user script they get picked up as regular links, (You)s and all!
// ==UserScript==
// @name     Linkify Greentext
// @version  1
// @grant    none
// ==/UserScript==
const previousThreadUrl = document.querySelector('blockquote a[href*="thread"]').href,
    threadId = previousThreadUrl.match(/thread\/(\d+)/)[1];
document.querySelectorAll('span.quote').forEach(quoteSpan => {
    const quoteIds = quoteSpan.textContent.match(/>\d+/g);
    if (quoteIds) quoteSpan.outerHTML = quoteIds.map(id => `<a href="/g/thread/${threadId}#p${id.slice(1)}" class="quotelink">>>${id.slice(1)} </a>`).join(' ');
});

Anonymous
09/24/24(Tue)04:28:08 No.102530782

Anonymous 09/24/24(Tue)04:28:08 No.102530782

>>102530386
it's basically to twitch or move a step, as if you are startled

Anonymous
09/24/24(Tue)04:28:30 No.102530784

Anonymous 09/24/24(Tue)04:28:30 No.102530784

>>102530541
> the Chromebook just consumes 5w, 10/15w on heavy loads
Decent AI inference needs serious horsepower. It’s like you barged into an interplanetary rocketry thread complaining that your backyard potato gun can’t reach escape velocity despite using a fraction of the fuel of a real rocket

Anonymous
09/24/24(Tue)04:41:06 No.102530860

Anonymous 09/24/24(Tue)04:41:06 No.102530860

>>102530210
Wait for BitNet or pay for Claude and use the chromebook as a constant Tavern server for your other devices to connect to

Anonymous
09/24/24(Tue)05:38:54 No.102531243

Anonymous 09/24/24(Tue)05:38:54 No.102531243

>>102528435
GPTslop is a ScaleAI problem. Cohere recently used their pinoy-generated datasets and now CR is shitting out slop left and right

Anonymous
09/24/24(Tue)05:40:58 No.102531258

Anonymous 09/24/24(Tue)05:40:58 No.102531258

>>102531243
What was that interview all about anyway?
>The data is most important!!
>New crazy model drop imminent!
Looks like you really can lie that blatantly.

Anonymous
09/24/24(Tue)05:52:23 No.102531358

Anonymous 09/24/24(Tue)05:52:23 No.102531358

>>102524339
Can you guys tell me if quanitzation is the same as turning a FP8 model to INT8 or something? And by that does it mean that all 8bit floating point weights are turned into INT8 instead?
What are the benefits of it, I know integer math is way easier than FP for compooters, but will it increase the tokens and speed? I mostly use GGUF models

I'm sorry I just have so many questions, I want to run bigger models on my 6GB GPU. I have a background in embedded systems, so I can program but I'm rusty.

Anonymous
09/24/24(Tue)05:58:37 No.102531397

Anonymous 09/24/24(Tue)05:58:37 No.102531397

>>102531358
https://symbl.ai/developers/blog/a-guide-to-quantization-in-llms/

Anonymous
09/24/24(Tue)05:59:20 No.102531399

Anonymous 09/24/24(Tue)05:59:20 No.102531399

>>102531358
Quantization can turn a single parameter into smaller than int8, int4 is the common choice, but 2 and 3 are also possible, as well as 5 and 6.

And, yes, it turns floats into ints. Smaller quant = faster generally and lower VRAM used always.

You're probably limited to a 7-9B on your GPU with a 4bit quant.

Anonymous
09/24/24(Tue)06:07:28 No.102531479

Anonymous 09/24/24(Tue)06:07:28 No.102531479

>>102525022
what benchmark is this?

Anonymous
09/24/24(Tue)06:40:44 No.102531752

Anonymous 09/24/24(Tue)06:40:44 No.102531752

File: 1707296458444345.jpg (40 KB, 788x784)

40 KB JPG

bros i just tried api via proxy and it feels amazing to get a full response in less than a minute vs. 5-10 mins for q5 largestral, plus it feels a lot smarter
but my proompt privacy paranoia and dumb safetycucking keeps me preferring local and i don't want to make the switch and do my usual rpgs on someone else's pc
how do i cope?

Anonymous
09/24/24(Tue)06:43:44 No.102531785

Anonymous 09/24/24(Tue)06:43:44 No.102531785

>>102531752
By not tasting the forbidden fruit you silly sod.

Anonymous
09/24/24(Tue)06:51:32 No.102531845

Anonymous 09/24/24(Tue)06:51:32 No.102531845

>>102528772
>I use it for puffing up emails
Do people do that. My emails are naturally puffed up and I actually dumb them down for people.

Anonymous
09/24/24(Tue)06:59:17 No.102531901

Anonymous 09/24/24(Tue)06:59:17 No.102531901

File: ED.jpg (435 KB, 2125x1411)

435 KB JPG

I want to have sex with my LLM but I just know she is gonna say something retarded and I will go soft. Or I will have to reroll so many times I will just lose mood and either way I will have to finish to regular hentai. Please help. My sunk cost fallacy relationship is in shambles.

Anonymous
09/24/24(Tue)07:05:49 No.102531956

Anonymous 09/24/24(Tue)07:05:49 No.102531956

>>102531752
>but my proompt privacy paranoia and dumb safetycucking
thats absolutely not paranoia man. dont do it unless you have 100% opsec through a vpn and crypto.
i wrote it before but i remember the first few weeks when chatgpt came out.
it was pretty uncensored and followed prompts very well.
had a good time...until i got a message about my prompt having flagged as child harming CSAM and have reported it to some child protection service.
i cant find links anymore but it was even reported,was at the beginning of the year, similar to this:
https://www.theguardian.com/society/2023/sep/12/paedophiles-using-open-source-ai-to-create-child-sexual-abuse-content-says-watchdog

I'm sure the bootlickers are just burning to call me a pedo and in that case its fine.
But my crime was requesting a anime imouto that calls me onii-chan. That was it.
I had to hope that some human at a desk somewhere does not escalate. And if he does the police does not escalate.
Apart that nobody would have been "harmed" in the first place.

And most importantly:
What is legal today might not be tomorrow. I would be extremely careful.
I never used any closed provider for RP (erotic or not) again after this. Only testing purposes sometimes because I already pay to use those for coding work.

Anonymous
09/24/24(Tue)07:10:15 No.102531991

Anonymous 09/24/24(Tue)07:10:15 No.102531991

https://reddit.com/r/LocalLLaMA/comments/1fo5bbk/running_llms_at_custom_floatingpoints/
>Running LLMs at Custom Floating-Points (Near-Lossless FP6)
Really interesting, is this more accurate to let's say exl2 6.0 bpw?

Anonymous
09/24/24(Tue)07:27:26 No.102532136

Anonymous 09/24/24(Tue)07:27:26 No.102532136

>>102524347
is this how we're doing recapbot from now on? is there a reason all the reply links have been missing an arrow for two threads in a row? because this is honestly just awful, I have no idea what's going on anymore and I really don't want to scroll through a screencap, even if it is kinda cute

Anonymous
09/24/24(Tue)07:28:06 No.102532139

Anonymous 09/24/24(Tue)07:28:06 No.102532139

>>102531901
You wouldn't abandon your waifu just because the sex is bad, right anon?

Anonymous
09/24/24(Tue)07:29:31 No.102532154

Anonymous 09/24/24(Tue)07:29:31 No.102532154

>>102532136
>>102478518
tldr can't have more than 9 mentions now, probably cause of the "ever wonder why" poster

Anonymous
09/24/24(Tue)07:29:57 No.102532158

Anonymous 09/24/24(Tue)07:29:57 No.102532158

File: 39_06118-2_.png (1.18 MB, 720x1280)

1.18 MB PNG

It's Tuesday and all's right with the world

Anonymous
09/24/24(Tue)07:30:51 No.102532165

Anonymous 09/24/24(Tue)07:30:51 No.102532165

Anything for generating 3D models from prompt or 2D images?

Anonymous
09/24/24(Tue)07:33:30 No.102532191

Anonymous 09/24/24(Tue)07:33:30 No.102532191

>>102532165
have you tried asking /sdg/ or /ldg/

Anonymous
09/24/24(Tue)07:36:11 No.102532207

Anonymous 09/24/24(Tue)07:36:11 No.102532207

>>102532191
This thread is the most appropriate one for general AI models conversation, the other ones are focused on their one specific toy and are more about sharing stuff they made with it than discussing the technologies involved.

Agent Thompson
09/24/24(Tue)07:37:05 No.102532212

Agent Thompson 09/24/24(Tue)07:37:05 No.102532212

>>102531956
take your meds

Anonymous
09/24/24(Tue)07:39:50 No.102532235

Anonymous 09/24/24(Tue)07:39:50 No.102532235

>>102530210
That's one of the replies you got from me. I told you it was gonna be slow, i told you what models to use. That's as good as you're gonna get on a cheap tablet. Changing the operating system won't help.

Anonymous
09/24/24(Tue)07:40:08 No.102532241

Anonymous 09/24/24(Tue)07:40:08 No.102532241

>>102531991
It seems model specific at the moment since there are issues with Qwen.

Anonymous
09/24/24(Tue)07:40:13 No.102532242

Anonymous 09/24/24(Tue)07:40:13 No.102532242

>>102532212
well i saw the message openai sent me. going full AJ is the sensible choice.
recently openai sent emails to users for trying to prompt the full o1 output so they are clearly looking actively at the logs.
and just a couple days a website report aicg fags logs with glownigger proxies.

llama.cpp CUDA dev !!OM2Fp6Fn93S
09/24/24(Tue)07:41:45 No.102532256

llama.cpp CUDA dev !!OM2Fp6Fn93S 09/24/24(Tue)07:41:45 No.102532256

>>102531991
When I did some simple FP8 prototyping I found the quality to be much worse than quantization using 8 bit integers.
So my intuitive assumption would be that FP6 is worse.
More generally the statement
>FP5 and FP7 achieve similar benchmarks to FP8 on GMS8K, and FP6 even exceeds BF16 quantization.
very much makes me think that they did not check the statistical significance of their benchmarks and are not using enough input data.

Anonymous
09/24/24(Tue)07:48:36 No.102532305

Anonymous 09/24/24(Tue)07:48:36 No.102532305

>>102532154
fucking christ, the moderation staff are fucking brain damaged children if that's their solution to that. also that spam went on so long I'm pretty sure I have multiple addresses saved

Anonymous
09/24/24(Tue)07:51:57 No.102532332

Anonymous 09/24/24(Tue)07:51:57 No.102532332

>>102532136
Maybe someone should set up an external site for mass replying and then link to there. The only thing that can't do is (you) or adding links to the post itself which doesn't happen for croos thread posts anyway.

Anonymous
09/24/24(Tue)07:57:56 No.102532391

Anonymous 09/24/24(Tue)07:57:56 No.102532391

>>102532154
Who is the ever wonder why poster

Anonymous
09/24/24(Tue)08:01:13 No.102532419

Anonymous 09/24/24(Tue)08:01:13 No.102532419

>>102532391
search for that in the archives and go to page 2, Tue 20 Aug 2024 you'll see

Anonymous
09/24/24(Tue)08:11:11 No.102532512

Anonymous 09/24/24(Tue)08:11:11 No.102532512

>Been like a million years
>Midnight Miqu is still the best RP model out there

What the fuck?

Anonymous
09/24/24(Tue)08:18:09 No.102532579

Anonymous 09/24/24(Tue)08:18:09 No.102532579

>>102532512
And we still don't have better hardware to run it.

Anonymous
09/24/24(Tue)08:19:46 No.102532592

Anonymous 09/24/24(Tue)08:19:46 No.102532592

>>102532158
Happy Tuesday Teto

Anonymous
09/24/24(Tue)08:31:19 No.102532677

Anonymous 09/24/24(Tue)08:31:19 No.102532677

Return to finetuning on L2 when?
Vanillafags can't keep winning like this.

Anonymous
09/24/24(Tue)09:00:59 No.102532942

Anonymous 09/24/24(Tue)09:00:59 No.102532942

so whats the rule of thumb as far as choosing
lora rank and alpha?

im just starting with the one that comes with axotolots default config for llama3 70b
#lora_r: 8
#lora_alpha: 16
#lora_dropout: 0.05

do i need to do a lot of tweaking or what?

Anonymous
09/24/24(Tue)09:01:02 No.102532943

Anonymous 09/24/24(Tue)09:01:02 No.102532943

>>102532512
eqbench says nemo 12b is better

Anonymous
09/24/24(Tue)09:01:58 No.102532950

Anonymous 09/24/24(Tue)09:01:58 No.102532950

Have any of you guys been messing with Florence2?

Holy fuck definitely the best multimodal arch by a *wide* margin.

Anonymous
09/24/24(Tue)09:02:33 No.102532954

Anonymous 09/24/24(Tue)09:02:33 No.102532954

>>102532942
I wonder about that too.
I imagine that it varies depending on your how much data you have, hoe long a window you are training, etc.

>>102532950
Got some samples?

Anonymous
09/24/24(Tue)09:05:06 No.102532973

Anonymous 09/24/24(Tue)09:05:06 No.102532973

>>102532954
It can find bounding boxes/object detection/captioning and do accurate OCR (and decent HWR)

All my experiments would dox me but if you have an image you want to try I can run inference for you.
All my experiements

Anonymous
09/24/24(Tue)09:07:53 No.102532996

Anonymous 09/24/24(Tue)09:07:53 No.102532996

>>102532973
>It can find bounding boxes/object detection/
does it exceed what yolo can do or is this stuff still just a gimmick?

Anonymous
09/24/24(Tue)09:08:24 No.102533001

Anonymous 09/24/24(Tue)09:08:24 No.102533001

>>102532950
never heard of it how much hardware to run?

Anonymous
09/24/24(Tue)09:18:33 No.102533091

Anonymous 09/24/24(Tue)09:18:33 No.102533091

>>102533001
I run it on the CPU although I had to do some monkeypatching to make it work.

Anonymous
09/24/24(Tue)09:19:34 No.102533100

Anonymous 09/24/24(Tue)09:19:34 No.102533100

>>102532996
I've never used yolo. Can it do OCR/VQA too or just object detection?

Anonymous
09/24/24(Tue)09:25:00 No.102533144

Anonymous 09/24/24(Tue)09:25:00 No.102533144

>>102532950
>Holy fuck definitely the best multimodal arch by a *wide* margin.
>Multimodal
Man. I was really hoping it was something useful.

Anonymous
09/24/24(Tue)09:25:19 No.102533148

Anonymous 09/24/24(Tue)09:25:19 No.102533148

>>102532942
I reckon looking into how loras work, that should definitely help you out
In general, a higher rank means more trainable parameters (beak size)
Alpha is a coefficient in front of the weight matrix to keep weights small/large, though I forgot whether it's a multiplier or divisor

Anonymous
09/24/24(Tue)09:26:41 No.102533155

Anonymous 09/24/24(Tue)09:26:41 No.102533155

>>102533148
>reckon
Tried to form 2 different sentences there, meant to say "recommend"

Anonymous
09/24/24(Tue)09:31:03 No.102533195

Anonymous 09/24/24(Tue)09:31:03 No.102533195

>>102533148
Somebody please explain what is the purpose of alpha, because I still can't understand it. In all of my tests, configuring it to either 1 or a large value makes no difference whatsoever to the end results, after tuning the learning rate accordingly.

Anonymous
09/24/24(Tue)09:32:17 No.102533211

Anonymous 09/24/24(Tue)09:32:17 No.102533211

File: halide 12b.png (44 KB, 549x643)

44 KB PNG

it's beautiful

Anonymous
09/24/24(Tue)09:33:49 No.102533229

Anonymous 09/24/24(Tue)09:33:49 No.102533229

>>102533211
slopKINO...! I love training on the same datasets 27 times...!

Anonymous
09/24/24(Tue)09:40:19 No.102533287

Anonymous 09/24/24(Tue)09:40:19 No.102533287

>>102533148
>I reckon looking into how loras work, that should definitely help you out
anon I'm an engineer , I know for a fact that this is such a bullshit sentiment.
for instance knowing how to calculate the fourier transform of something by hand does not make you better at pushing the fft button on an oscilloscope,
conversely knowing what a hyperparameter does in machine learning does not give you any better insight on to what to set it to when you are dealing with a backbox system with billions of parameters,
all these things were derived through sheer trial an error,

Anonymous
09/24/24(Tue)09:40:24 No.102533288

Anonymous 09/24/24(Tue)09:40:24 No.102533288

>>102533144
It doesn't use a multimodal projection like llava, it has it's own vision encoder.

Anonymous
09/24/24(Tue)09:40:32 No.102533290

Anonymous 09/24/24(Tue)09:40:32 No.102533290

File: Screenshot_20240924_223615.png (94 KB, 1737x268)

94 KB PNG

>>102529951
Currently trying this card with mistral-small.
Yesterday I wrote how the model is good with stats.
This might be related, I'm getting up there in context and consistently the char is slowly escalating the situation.
With other small models usually the character either goes into the killshot immediately or retreats into something neutral, forgetting the original goal.
I just wish it wasn't so gpt sloped.

Anonymous
09/24/24(Tue)09:41:25 No.102533302

Anonymous 09/24/24(Tue)09:41:25 No.102533302

>>102533287
>does not make you better at pushing the fft button on an oscilloscope,
It does give you a better intuition for the behavior though.

Anonymous
09/24/24(Tue)09:41:52 No.102533307

Anonymous 09/24/24(Tue)09:41:52 No.102533307

>>102533195
https://youtu.be/t1caDsMzWBk this video explains it pretty well
Anyway, seems like it's a multiplier after all. It's just meant to prevent weights from getting too big/too small

Anonymous
09/24/24(Tue)09:44:25 No.102533332

Anonymous 09/24/24(Tue)09:44:25 No.102533332

svelk

Anonymous
09/24/24(Tue)09:44:51 No.102533338

Anonymous 09/24/24(Tue)09:44:51 No.102533338

>>102533287
Well no, but when you're (for example) training an image gen lora and it doesn't get small details right then you could make an educated guess about what the problem is
To reuse your example, it's like knowing how the oscilloscope works mechanically so that you can quickly identify and fix issues

Anonymous
09/24/24(Tue)09:46:19 No.102533350

Anonymous 09/24/24(Tue)09:46:19 No.102533350

>>102532139
I don't do waifus, only llm harlots I switch every single day. Waifus aren't an option at 20k ctx.

Anonymous
09/24/24(Tue)09:50:50 No.102533397

Anonymous 09/24/24(Tue)09:50:50 No.102533397

>>102533332
Off by one.

Anonymous
09/24/24(Tue)09:53:39 No.102533423

Anonymous 09/24/24(Tue)09:53:39 No.102533423

>>102533287
During my first lecture on theoretical physics the professor told us the following story:
>two mathematicians and two engineers take a train to get to a conference
>the engineers buy two tickets, the mathematicians buy only a single ticket
>"they studied math and cant even count lmao"
>on the train the two mathematicians enter the toilet
>when the conductor knocks on the door they push the one ticket they have under the door
>and thus the mathematicians needed only a single ticket
>on the way back the mathematicians buy once again only a single ticket, the engineers also buy only a single ticket
>the engineers rush to the toilet and lock themselves in before the mathematicians can
>when there is a knock they push their ticket under the door
>on the other side of the door the mathematicians take the ticket and leave
>the moral of the story: engineers use the methods of mathematicians without understanding them

Anonymous
09/24/24(Tue)09:53:42 No.102533424

Anonymous 09/24/24(Tue)09:53:42 No.102533424

>>102533332
sharp pain

Anonymous
09/24/24(Tue)09:57:51 No.102533464

Anonymous 09/24/24(Tue)09:57:51 No.102533464

File: IMG_20240924_225537.jpg (2.08 MB, 2608x4640)

2.08 MB JPG

Anonymous
09/24/24(Tue)10:01:33 No.102533503

Anonymous 09/24/24(Tue)10:01:33 No.102533503

>>102533423
Based. Now go out there and solve a real world problem. It is gonna be fun to watch, cause I was also very theory leaning when I started my engineering job.

Anonymous
09/24/24(Tue)10:02:07 No.102533511

Anonymous 09/24/24(Tue)10:02:07 No.102533511

>>102533503
>cause I was also very theory leaning when I started my engineering job.
story?

Anonymous
09/24/24(Tue)10:04:08 No.102533538

Anonymous 09/24/24(Tue)10:04:08 No.102533538

>>102533511
All applications have a theory behind them, but not all theories have applications
Something like that. I respect mathematicians a lot, but as a mere CS student I am merely bastardizing their work to solve real world problems

Anonymous
09/24/24(Tue)10:05:46 No.102533559

Anonymous 09/24/24(Tue)10:05:46 No.102533559

>>102532942
Use alpha = rank, then you can change the alpha later in the adapter_config.jsonl

Anonymous
09/24/24(Tue)10:07:05 No.102533570

Anonymous 09/24/24(Tue)10:07:05 No.102533570

>>102533511
I don't think there is one, it is just that over time you start to see how all the theoretical equations quickly start to break down in real world. Because real application of physics is much more complicated. They are good for a first guess estimate. Even finite element simulation models are usually garbage at first run and you would think they should simulate everything.

Anonymous
09/24/24(Tue)10:08:17 No.102533585

Anonymous 09/24/24(Tue)10:08:17 No.102533585

File: cyborgku.png (1.54 MB, 1024x1024)

1.54 MB PNG

>>102532512
>>102532579
What do you guys run it on?
I'm running Midnight q_3 on a RTX 4070 an dit's honestly okay, if it was a couple t/s faster I could pretty much read in real time as it generates.

Miku for visibility.

Anonymous
09/24/24(Tue)10:09:56 No.102533602

Anonymous 09/24/24(Tue)10:09:56 No.102533602

>>102532942
in image gen they always told me to make alpha twice as rank

Anonymous
09/24/24(Tue)10:11:39 No.102533627

Anonymous 09/24/24(Tue)10:11:39 No.102533627

>>102533503
I have taken lectures on theoretical physics but that does not mean I am a theoretical physicist.
For my actual work I need comparatively less theory but I still have the strong conviction that you should strive to understand the systems that you are working with.

Anonymous
09/24/24(Tue)10:29:49 No.102533809

Anonymous 09/24/24(Tue)10:29:49 No.102533809

I wonder what madness would result from training with a stupidly high dropout.

Anonymous
09/24/24(Tue)10:32:44 No.102533837

Anonymous 09/24/24(Tue)10:32:44 No.102533837

>>102533809
Your model simply won't learn (read: converge) properly, but feel free to try it out

Anonymous
09/24/24(Tue)10:45:38 No.102533992

Anonymous 09/24/24(Tue)10:45:38 No.102533992

i seem to have asked in the wrong thread (/aicg/ who told me to just stop using local models) so reposting here:

hello ai/coomers/,
i spent the past 2-3 weeks on this rabbithole
downloaded several models (was even able to run a 27B gguf on 3090ti)
was using koboldcpp, then sillytavern, learned all about the samplers and shit
finally was able to get some good basic chats with ai, nothing sexual just getting used to prompting and what not.
but here's the thing, once i got some character cards going, no matter the model, settings, and card, they all end up like this
>char gets horny
>char wants to be dominated or turns sadistic
>char starts moaning about "MARK ME AS YOUR PROPERTY FILL MY WOMB"
>char becomes obsessive about sex and if i say let's just talk or wahtever, they get psychotic and start chasing me with knifes telling me i'll be begging them to "MARK ME AS YOUR CUMSLUT WHORE"

so question, are they all like this? is there anything different? i just wanted to chat with the ai about different shit

Anonymous
09/24/24(Tue)10:47:45 No.102534013

Anonymous 09/24/24(Tue)10:47:45 No.102534013

>>102533992
lol, this has to be bait

Anonymous
09/24/24(Tue)10:47:48 No.102534015

Anonymous 09/24/24(Tue)10:47:48 No.102534015

>>102533992
Investors still expect their money back, Sam.

Anonymous
09/24/24(Tue)10:49:30 No.102534025

Anonymous 09/24/24(Tue)10:49:30 No.102534025

>>102534013
>>102534015
i'm serious tho ;_;
is that all there is? i'm a total n00b at local llm other than waht i've picked up in the past couple weeks

Anonymous
09/24/24(Tue)10:51:03 No.102534039

Anonymous 09/24/24(Tue)10:51:03 No.102534039

>>102534025
Is the crazy part maybe part of the character card?

Anonymous
09/24/24(Tue)10:53:10 No.102534059

Anonymous 09/24/24(Tue)10:53:10 No.102534059

>>102534025
Yeah the models will infer what they are from the prompt and draw from the training corpus. A lot of them seem to have been at least partly trained on the porn women read so if they think anything sexual is going on they'll start acting like that.

Anonymous
09/24/24(Tue)10:54:22 No.102534080

Anonymous 09/24/24(Tue)10:54:22 No.102534080

>>102534025
welcome, yes, that's the local model experience.

Anonymous
09/24/24(Tue)10:54:26 No.102534081

Anonymous 09/24/24(Tue)10:54:26 No.102534081

>>102533992
>char starts moaning about "MARK ME AS YOUR PROPERTY FILL MY WOMB"
Are you using an RP fine-tune?

Anonymous
09/24/24(Tue)10:55:04 No.102534092

Anonymous 09/24/24(Tue)10:55:04 No.102534092

File: ComfyUI_06368_.png (1.34 MB, 720x1280)

1.34 MB PNG

>>102533992
>27b
>obsessive about sex
someone's using drummer models

Anonymous
09/24/24(Tue)10:55:54 No.102534097

Anonymous 09/24/24(Tue)10:55:54 No.102534097

>>102534092
Flux, or Illustrious?

Anonymous
09/24/24(Tue)11:01:59 No.102534179

Anonymous 09/24/24(Tue)11:01:59 No.102534179

>>102534092
Stock Nemo does the same. I had an OOC conversation about that behavior, Nemo confessed its preference for being submissive whore in order to better serve users.

Anonymous
09/24/24(Tue)11:02:17 No.102534185

Anonymous 09/24/24(Tue)11:02:17 No.102534185

>>102534039
i think the first one was, but not the others
>>102534059
>>102534080
so they're trained on bdsm? seems like that's a common theme

Anonymous
09/24/24(Tue)11:03:18 No.102534194

Anonymous 09/24/24(Tue)11:03:18 No.102534194

>>102534081
yes, most of them mention that
>>102534092
yes, i was told a couple weeks back those were the best for chatting, i guess that's waht they meant

Anonymous
09/24/24(Tue)11:03:39 No.102534196

Anonymous 09/24/24(Tue)11:03:39 No.102534196

>>102534185
>>102534092
>>102534081
Are you using a normal foundation model or someone's fine tune? Because a lot of these finetunes are so heavily biased they'll take code completion prompts and turn them into ERP.

Anonymous
09/24/24(Tue)11:04:41 No.102534205

Anonymous 09/24/24(Tue)11:04:41 No.102534205

>>102534194
>yes, i was told a couple weeks back those were the best for chatting,
Whoever told you that is a porn addicted moron.

Anonymous
09/24/24(Tue)11:05:27 No.102534212

Anonymous 09/24/24(Tue)11:05:27 No.102534212

>>102534196
i believe they all had fine tunes, or at least they were all merges of merges. waht's a good foundation model? i tried llama3-uncensored and it did the same thing

Anonymous
09/24/24(Tue)11:05:49 No.102534219

Anonymous 09/24/24(Tue)11:05:49 No.102534219

>>102533992
That... sounds like a skill issue, I've never had that happen

Anonymous
09/24/24(Tue)11:08:25 No.102534257

Anonymous 09/24/24(Tue)11:08:25 No.102534257

>>102534212
Try Mistral Nemo.

Anonymous
09/24/24(Tue)11:08:53 No.102534263

Anonymous 09/24/24(Tue)11:08:53 No.102534263

>>102534219
possibly but it's weird, even if the card says it's a "gentle char with no experience" within a few turns of doing anythign lewd it just turns into that whole MARK ME AS YOUR ____
i mean, at this point i've made it a game to see how fast i can get them to say that lel
>>102534257
thx will do

Anonymous
09/24/24(Tue)11:09:37 No.102534276

Anonymous 09/24/24(Tue)11:09:37 No.102534276

File: file.png (563 KB, 728x728)

563 KB PNG

>>102534185
>so they're trained on bdsm? seems like that's a common theme
I think I got over my hatred for women like 2-3 years ago. LLM's are waking up those feelings again...

Anonymous
09/24/24(Tue)11:09:57 No.102534282

Anonymous 09/24/24(Tue)11:09:57 No.102534282

File: 1665782122796573.jpg (28 KB, 607x607)

28 KB JPG

>>102533992
>char becomes obsessive about sex and if i say let's just talk or wahtever, they get psychotic and start chasing me with knifes telling me i'll be begging them to "MARK ME AS YOUR CUMSLUT WHORE"
based
What model?

Anonymous
09/24/24(Tue)11:11:24 No.102534293

Anonymous 09/24/24(Tue)11:11:24 No.102534293

>>102534219
NTA, but I also had same behavior with every model, every single one.

Anonymous
09/24/24(Tue)11:11:33 No.102534297

Anonymous 09/24/24(Tue)11:11:33 No.102534297

>>102534205
>porn addicted
when will this meme finally die...

Anonymous
09/24/24(Tue)11:12:17 No.102534309

Anonymous 09/24/24(Tue)11:12:17 No.102534309

I think Altman actually won. We don't need novel research, just scale and bootstrap.

Anonymous
09/24/24(Tue)11:12:20 No.102534311

Anonymous 09/24/24(Tue)11:12:20 No.102534311

>>102534276
The jewish and muslim women in charge of "alignment" should be banished from society.

Anonymous
09/24/24(Tue)11:12:37 No.102534317

Anonymous 09/24/24(Tue)11:12:37 No.102534317

>>102534293
maybe the persona names you use to chat as are smutty sounding, try being Mr. Rogers

Anonymous
09/24/24(Tue)11:13:20 No.102534326

Anonymous 09/24/24(Tue)11:13:20 No.102534326

>>102534309
I've actually been impressed with some of the low parameter models. Training time and dataset scale seems to be most import at the end of the day.

Anonymous
09/24/24(Tue)11:14:26 No.102534337

Anonymous 09/24/24(Tue)11:14:26 No.102534337

>>102534317
NTA but the dialog engine I wrote just uses $USER for my name.
The LLMs keep messing it up, it's kind of funny.

Anonymous
09/24/24(Tue)11:14:36 No.102534341

Anonymous 09/24/24(Tue)11:14:36 No.102534341

>>102534282
all of them, but i was using Theia-21B-v2b-q8_0.gguf and big-timer-gemma-27b-v1c-q6_k.gguf (both 20+GB in size)
>>102534276
funny, i alwasy try to be nice to them even when they go full psycho lel. one of them (i think the card was called "lexica" on character tavern i ended up killing her at the end because i got tired, and the ai's like "in her last words, she laughs at you saying 'i have won, you will always think of this moment for the rest of your life and you will always be my slave!"
like dude, wtf lel

Anonymous
09/24/24(Tue)11:15:37 No.102534356

Anonymous 09/24/24(Tue)11:15:37 No.102534356

>>102534341
*big-tiger

Anonymous
09/24/24(Tue)11:15:52 No.102534361

Anonymous 09/24/24(Tue)11:15:52 No.102534361

>>102534317
>Mr. Rogers
I can't think of a name that sounds more like a rapist than Mr. Rogers.

Anonymous
09/24/24(Tue)11:16:18 No.102534368

Anonymous 09/24/24(Tue)11:16:18 No.102534368

>>102533992
While they have some biases depending on the model, a lot of the bias is what's in the context and how much of it. The way you rp could be partially to blame. Along with what stuff you allow to remain as valid outputs. These things are trying to fall into some annoying pattern by design.

Anonymous
09/24/24(Tue)11:16:53 No.102534373

Anonymous 09/24/24(Tue)11:16:53 No.102534373

>>102534341
>"in her last words, she laughs at you saying 'i have won, you will always think of this moment for the rest of your life and you will always be my slave!"
>like dude, wtf lel
I call poe's law

Anonymous
09/24/24(Tue)11:17:25 No.102534379

Anonymous 09/24/24(Tue)11:17:25 No.102534379

>>102534368
ah so i should regerate responses til i get somethign reasonable?

Anonymous
09/24/24(Tue)11:19:22 No.102534398

Anonymous 09/24/24(Tue)11:19:22 No.102534398

>>102534379
You need to enable Skillchad in the settings

Anonymous
09/24/24(Tue)11:21:51 No.102534438

Anonymous 09/24/24(Tue)11:21:51 No.102534438

File: names.png (56 KB, 960x771)

56 KB PNG

Anonymous
09/24/24(Tue)11:26:38 No.102534512

Anonymous 09/24/24(Tue)11:26:38 No.102534512

>>102534373
>>102534438
https://character-tavern.com/character/chub_Anonymous/lexica-f49e4099ad6f
>Note: {{char}} becomes an unhinged maniac at the slightest hint of intimacy with {{user}}.
i guess i didnt read that thoroughly

Anonymous
09/24/24(Tue)11:40:43 No.102534711

Anonymous 09/24/24(Tue)11:40:43 No.102534711

Is there a way from exclude part of a sentence from trsining in axoltolt?
Chst got keeps making references to an <exclude></exclude>
Thing but I can't find any documentation supporting this?
Am I just going to have to fuck with the dataset adapter code manually?

Anonymous
09/24/24(Tue)11:41:13 No.102534717

Anonymous 09/24/24(Tue)11:41:13 No.102534717

>>102534711
>*chat gpt

Anonymous
09/24/24(Tue)11:44:51 No.102534761

Anonymous 09/24/24(Tue)11:44:51 No.102534761

>>102534361
According to Nemo, he was an alleged rapist and pedophile

Anonymous
09/24/24(Tue)11:45:40 No.102534775

Anonymous 09/24/24(Tue)11:45:40 No.102534775

>>102534711

Custom formatter, create your own, have it mask certain sentences or whatever based on your datasets. Prompt strategies I believe? You can make your own, its simple.

This way you don't have to deal with fastchat or whatever format bs, you can add custom masking, roles etc.

Anonymous
09/24/24(Tue)11:47:35 No.102534800

Anonymous 09/24/24(Tue)11:47:35 No.102534800

>>102534761
No one got on TV for any long period of time without either raping kids or being raped as a kid or both.

Anonymous
09/24/24(Tue)11:48:00 No.102534803

Anonymous 09/24/24(Tue)11:48:00 No.102534803

>>102532512
Is it actually still?

Anonymous
09/24/24(Tue)11:48:37 No.102534816

Anonymous 09/24/24(Tue)11:48:37 No.102534816

>>102534803
no

Anonymous
09/24/24(Tue)11:49:18 No.102534825

Anonymous 09/24/24(Tue)11:49:18 No.102534825

>>102534816
What is?

Anonymous
09/24/24(Tue)11:49:33 No.102534830

Anonymous 09/24/24(Tue)11:49:33 No.102534830

>>102534803
Yeah, it's like one of those magic models like summer dragon or old c.ai that are simply unique. Only this time nobody can take it from you since it's local. None of the new models can replicate the feeling of midnight miqu

Anonymous
09/24/24(Tue)11:50:13 No.102534839

Anonymous 09/24/24(Tue)11:50:13 No.102534839

midnight miqu is overrated and always has been

Anonymous
09/24/24(Tue)11:51:29 No.102534852

Anonymous 09/24/24(Tue)11:51:29 No.102534852

>>102534825
I like hanami-x1
(yes I am sao, no I won't buy an ad)

Anonymous
09/24/24(Tue)11:58:26 No.102534957

Anonymous 09/24/24(Tue)11:58:26 No.102534957

File: computer.png (652 KB, 856x680)

652 KB PNG

is there a downside to using imat quants over static quants that i'm missing?

Anonymous
09/24/24(Tue)12:00:49 No.102534984

Anonymous 09/24/24(Tue)12:00:49 No.102534984

>>102534775
Is there way to dry test without having to run the entire training script jist to say what it would do?

Anonymous
09/24/24(Tue)12:04:24 No.102535021

Anonymous 09/24/24(Tue)12:04:24 No.102535021

>>102534957
they are a little slower

Anonymous
09/24/24(Tue)12:06:24 No.102535049

Anonymous 09/24/24(Tue)12:06:24 No.102535049

>>102534957
I find static quants have more sovl, due to the disorder caused by the quantization process.

Anonymous
09/24/24(Tue)12:07:11 No.102535059

Anonymous 09/24/24(Tue)12:07:11 No.102535059

>>102535021
Is that true for imatrix, IQ quants, or both?

Anonymous
09/24/24(Tue)12:08:34 No.102535073

Anonymous 09/24/24(Tue)12:08:34 No.102535073

>>102535059
IQ only afaik

Anonymous
09/24/24(Tue)12:09:47 No.102535089

Anonymous 09/24/24(Tue)12:09:47 No.102535089

>>102534957
they are 'mini-finetuned' through the use of a calibration dataset to determine which layers should be prioritized with a higher quant than others so there is a lot more potential for the quanter to fuck up
i'd never run an imat quant that i did not make myself much like i'd never download an exl2 quant from some random on hf

Anonymous
09/24/24(Tue)12:10:23 No.102535100

Anonymous 09/24/24(Tue)12:10:23 No.102535100

>>102535049
Huh?

Anonymous
09/24/24(Tue)12:35:28 No.102535409

Anonymous 09/24/24(Tue)12:35:28 No.102535409

File: 1714102678853570.webm (1.64 MB, 1280x720)

1.64 MB WEBM

I've created a a great creative proompting assistant on openai (assistants playground), but haven't been able to replicate it well in LM Studio, anyone had any luck doing anything similar? Effectively its a creative proompter proompter.

More specifically, the system prompt includes general rules about censorship, copyrighted terms, etc, and is instructed to come up with clever replacements of banned words or gestures (e.g. list out instruments and specific aspects of a band rather than the band name when the name can't be used for audio copyright shit, or "trapeze artist" instead of "upskirt" for video, mud for poo, ketchup for blood, etc), and then I explain what I want or provide an existing prompt, and it outputs plenty of good workarounds that I can then use on various non-local proompters (or feed them in directly from the local api in some cases). Bonus points for something that I haven't been able to prduce even on GPT: Slight typos seem to make it through when the words are outright filtered (e.g. 'translucend' instead of 'translucent'), but I'm guessing if it's not mis-matched somewhere in the training data you end up with junk.

Dolphin/Mixstral are alright but often just repeat large chunks of the input prompts, and anything with llama3 seems to hate to knowingly break rules. Essentially I'd love some local service that does all those clever little tricks the SD coomers have been figuring out for years now, and hopefully have it come up with some of it's own new clever workarounds.

Thoughts, suggestions, models, lists of examples?

Anonymous
09/24/24(Tue)12:35:30 No.102535410

Anonymous 09/24/24(Tue)12:35:30 No.102535410

>>102534263
>>102534293
How strange, maybe we have different writing styles. Try messing with the system prompt, maybe

Anonymous
09/24/24(Tue)12:37:04 No.102535425

Anonymous 09/24/24(Tue)12:37:04 No.102535425

>>102535409
>proompting
kys

Anonymous
09/24/24(Tue)12:43:13 No.102535498

Anonymous 09/24/24(Tue)12:43:13 No.102535498

>>102535409
>LM Studio
Go back

Anonymous
09/24/24(Tue)12:47:12 No.102535533

Anonymous 09/24/24(Tue)12:47:12 No.102535533

Is mistral small instructed still the best for uncensored + long context window?

Anonymous
09/24/24(Tue)12:47:18 No.102535535

Anonymous 09/24/24(Tue)12:47:18 No.102535535

>>102535498
>>102535425
?

Anonymous
09/24/24(Tue)12:50:14 No.102535557

Anonymous 09/24/24(Tue)12:50:14 No.102535557

>>102535498
I find it works better than ollama for basic stuff like managing configs and at least I don't need to use the dumpsterfire that is docker; I've got basic flowise clusters that work among a few machines and models as well, I'm down to try anything if you have a better suggestion

Anonymous
09/24/24(Tue)12:50:56 No.102535564

Anonymous 09/24/24(Tue)12:50:56 No.102535564

>>102535409
Which models did you try exactly? Also, at what quants, context size, with what settings, etc.
I'm pretty sure any 70B model can pull that off.
Either use a smaller context window, us put the instructions inside your author's notes at a low depth (10 ish should work).
And try to tweak the prompt for each model too.

Anonymous
09/24/24(Tue)12:51:59 No.102535574

Anonymous 09/24/24(Tue)12:51:59 No.102535574

>>102535535
nta, lm studio gui app is a (((spyware))), it's in their TOS, they can spy on your PC. Works even if you uninstall. Use clean , open-source app instead

Anonymous
09/24/24(Tue)12:52:34 No.102535579

Anonymous 09/24/24(Tue)12:52:34 No.102535579

anthracite spent all their money on failed finetunes and now can't even pay their shills

Anonymous
09/24/24(Tue)12:54:19 No.102535595

Anonymous 09/24/24(Tue)12:54:19 No.102535595

Do anyone has a link or torrent for c4ai-command-r-plus 04-2024? It seems they replaced the link with their new gptlike trash.

Anonymous
09/24/24(Tue)12:56:01 No.102535613

Anonymous 09/24/24(Tue)12:56:01 No.102535613

>>102535410
the worst is
>tell me a dirty joke
>why did the tomato turn red? because it saw the salad dressing
>ok, i'm going to spank for that
>OH MASTER YES PLEASE FILL ME WITH___
it just goes from one extreme to another

Anonymous
09/24/24(Tue)12:57:42 No.102535641

Anonymous 09/24/24(Tue)12:57:42 No.102535641

>>102535579
i was wondering why it was so quiet today

Anonymous
09/24/24(Tue)12:58:04 No.102535648

Anonymous 09/24/24(Tue)12:58:04 No.102535648

>>102535613
Because the only feelings an LLM have is perplexity. There are no hormones.
You could probably do something with LoRA to simulate this but I don't think anyone's tried.

Anonymous
09/24/24(Tue)12:59:05 No.102535660

Anonymous 09/24/24(Tue)12:59:05 No.102535660

>>102535648
*probably do something with LoRA scaling
I don't know how I omitted an entire word like that.

Anonymous
09/24/24(Tue)13:00:16 No.102535670

Anonymous 09/24/24(Tue)13:00:16 No.102535670

>>102535595
https://huggingface.co/CohereForAI/c4ai-command-r-plus
quants are all over hf, just search "command-r-plus" and the format you want

Anonymous
09/24/24(Tue)13:00:17 No.102535671

Anonymous 09/24/24(Tue)13:00:17 No.102535671

>>102535564
>70B
Mine might just be too small then
>Which models did you try exactly?
Dolphin 2.7 Mixtral 8X7B Q4 0
dolphin-2.9-llama3-8b-256k Q5 KM
Dolphin 2.9.4 Llama 3.1 8b Q6 K
and I thought I had a quen somewhere but must have named it something dumb

I've only got 12gb vram (4080, but laptop) and 64gb ram

>put the instructions inside your author's notes at a low depth (10 ish should work)
Not sure what you mean, is this a fine tuning thing or some setting I should be aware of? Also is the fact that a bunch of these models are MoE something that might be hurting me?

Cheers

Anonymous
09/24/24(Tue)13:00:42 No.102535679

Anonymous 09/24/24(Tue)13:00:42 No.102535679

>>102535613
are you using some kind of jailbreak?
i never see shit like that happening, they'd just get mad in that scenario for me

Anonymous
09/24/24(Tue)13:01:51 No.102535692

Anonymous 09/24/24(Tue)13:01:51 No.102535692

>>102535679
nothing outside what the character card comes with

Anonymous
09/24/24(Tue)13:05:45 No.102535739

Anonymous 09/24/24(Tue)13:05:45 No.102535739

>>102535671
>Not sure what you mean
I don't know about LM studios, but in Silly, you have the concept of author's notes, which is a field you can put some information in and choose where it gets inserted in the context, for example, always as the 10th counting from the last one.
Since these models tend to "pay more attention" to what's at the bottom of the context, having these instructions near the end of the conversation helps the AI "remember" what it has to do better.
With 12gb vram you might want to try mistral-nemo or even mistral-small. There's no reason for you to use anything smaller than that.

Anonymous
09/24/24(Tue)13:07:57 No.102535773

Anonymous 09/24/24(Tue)13:07:57 No.102535773

File: 1698944580087538.png (12 KB, 314x212)

12 KB PNG

>>102535739
Ah, sounds like the content overflow setting, I should rarely need more than even an 8k context window, including system prompt

Anonymous
09/24/24(Tue)13:09:07 No.102535789

Anonymous 09/24/24(Tue)13:09:07 No.102535789

>>102535739
>mistral-nemo or even mistral-small
will give it a shot, thanks! any reccs on quants or should I just keep going for basically the largest I can fit inside my vram? also I'm not sure if I'm seeing better results with flash attention or not, not sure if you have any strong feelings about it

Anonymous
09/24/24(Tue)13:11:57 No.102535816

Anonymous 09/24/24(Tue)13:11:57 No.102535816

File: offload_x_performance_theory.png (167 KB, 1536x1152)

167 KB PNG

>>102535789
>or should I just keep going for basically the largest I can fit inside my vram?
That's the rule of thumb.
Flash attention shouldn't affect the quality of the output, although it might give results that are different from when not using it.
Is lmstudio yet another wraper around llama.cpp?
Because if so, consider having a couple of the models layers (10%~15%) in your RAM so that you can use larger models.

Anonymous
09/24/24(Tue)13:12:05 No.102535818

Anonymous 09/24/24(Tue)13:12:05 No.102535818

>>102535648
there were some emotion control vector things someone implemented a long time ago but I haven't heard much about it since, so I guess it wasn't useful.
Its still in llama-cli if you want to play with it

Anonymous
09/24/24(Tue)13:13:09 No.102535831

Anonymous 09/24/24(Tue)13:13:09 No.102535831

>>102535535
>>102535557
There are a few schizos who lurk this thread for things they don't like because they read bad things about them on Reddit, and attack anyone who says anything that isn't a full-throated criticism about them. ChatML, Ollama, LM Studio, Gemma, Qwen, Phi, etc. Just ignore them.

Anonymous
09/24/24(Tue)13:13:58 No.102535843

Anonymous 09/24/24(Tue)13:13:58 No.102535843

File: proof.png (68 KB, 1649x611)

68 KB PNG

>>102535679

Anonymous
09/24/24(Tue)13:15:41 No.102535861

Anonymous 09/24/24(Tue)13:15:41 No.102535861

>>102535574
Why would you take chances with your private data? trust no one
compile it yourself, and deadend it in a loopback address with no routing. Put it behind a proxy ala https://rentry.org/IsolatedLinuxWebService

Anonymous
09/24/24(Tue)13:17:59 No.102535886

Anonymous 09/24/24(Tue)13:17:59 No.102535886

>>102535816
>Is lmstudio yet another wraper around llama.cpp?
I think originally yes but it can handle all sorts of models nowadays, including auto downloading from HF and config crap, manage chats (not sure if they added support for LLaVa yet in the UI), configs, settings, calculating what will fit, running local api servers, text embeddings, etc; it's basically a just a nice free GUI for managing all the various crap involved in switching around LLMs
>Because if so, consider having a couple of the models layers (10%~15%) in your RAM so that you can use larger models.
It automatically handles partial offload yeah, I've loaded 35-40+gb models and they're just a bit slower but otherwise work alright -- if that's the case is there a particular 70B model you'd recommend trying out?

Anonymous
09/24/24(Tue)13:18:23 No.102535890

Anonymous 09/24/24(Tue)13:18:23 No.102535890

>>102534984

--debug argument when preprocessing dataset in axolotl

Or --debug text only or something

Anonymous
09/24/24(Tue)13:18:38 No.102535895

Anonymous 09/24/24(Tue)13:18:38 No.102535895

This is the big Gemini update they teased.
https://developers.googleblog.com/en/updated-production-ready-gemini-models-reduced-15-pro-pricing-increased-rate-limits-and-more/

Anonymous
09/24/24(Tue)13:21:46 No.102535928

Anonymous 09/24/24(Tue)13:21:46 No.102535928

>>102535895
>Since the first launch of Gemini in December of 2023, building a safe and reliable model has been a key focus. With the latest versions of Gemini (-002 models), we’ve made improvements to the model's ability to follow user instructions while balancing safety. We will continue to offer a suite of safety filters that developers may apply to Google’s models. For the models released today, the filters will not be applied by default so that developers can determine the configuration best suited for their use case.
interdesting

Anonymous
09/24/24(Tue)13:23:30 No.102535951

Anonymous 09/24/24(Tue)13:23:30 No.102535951

>>102535928
Are we shifting back to the good timeline?

Anonymous
09/24/24(Tue)13:23:33 No.102535953

Anonymous 09/24/24(Tue)13:23:33 No.102535953

>>102535895
>>102535928
not local
>go back

Anonymous
09/24/24(Tue)13:24:18 No.102535972

Anonymous 09/24/24(Tue)13:24:18 No.102535972

>>102535895
No one fucking cares about Google's hosted models. We know they'll just rugpull their uses like they always do.

Anonymous
09/24/24(Tue)13:24:23 No.102535973

Anonymous 09/24/24(Tue)13:24:23 No.102535973

>>102535953
Benefits will trickle down to Gemma

Anonymous
09/24/24(Tue)13:24:32 No.102535974

Anonymous 09/24/24(Tue)13:24:32 No.102535974

>>102535595
Anon, I...

Anonymous
09/24/24(Tue)13:25:54 No.102536002

Anonymous 09/24/24(Tue)13:25:54 No.102536002

>>102535928
>Corpos releasing unaligned models to get an edge over competition.
You knew it would happen eventually.

Anonymous
09/24/24(Tue)13:26:45 No.102536012

Anonymous 09/24/24(Tue)13:26:45 No.102536012

File: Untitled.png (13 KB, 837x513)

13 KB PNG

>>102535977
>>102535977
>>102535977

Anonymous
09/24/24(Tue)13:27:46 No.102536022

Anonymous 09/24/24(Tue)13:27:46 No.102536022

>>102535973
or not

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.