/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 08/07/24(Wed)10:07:04 No.101767112

File: file.png (2.65 MB, 1024x1024)

2.65 MB PNG

/lmg/ - Local Models General Anonymous 08/07/24(Wed)10:07:04 No.101767112 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>101757601 & >>101749053

►News
>(08/05) vLLM GGUF loading support merged: https://github.com/vllm-project/vllm/pull/5191
>(07/31) Gemma 2 2B, ShieldGemma, and Gemma Scope: https://developers.googleblog.com/en/smaller-safer-more-transparent-advancing-responsible-ai-with-gemma
>(07/27) Llama 3.1 rope scaling merged: https://github.com/ggerganov/llama.cpp/pull/8676
>(07/26) Cyberagent releases Japanese fine-tune model: https://hf.co/cyberagent/Llama-3.1-70B-Japanese-Instruct-2407
>(07/25) BAAI & TeleAI release 1T parameter model: https://hf.co/CofeAI/Tele-FLM-1T

►News Archive: https://rentry.org/lmg-news-archive
►FAQ: https://wikia.schneedc.com
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Programming: https://hf.co/spaces/bigcode/bigcode-models-leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
08/07/24(Wed)10:07:37 No.101767123

Anonymous 08/07/24(Wed)10:07:37 No.101767123

File: threadrecap.png (1.48 MB, 1536x1536)

1.48 MB PNG

►Recent Highlights from the Previous Thread: >>101757601

--OpenAI cuts GPT-4o price amidst competition from Anthropic: >>101760015 >>101760089 >>101760156 >>101760127 >>101760161 >>101760420 >>101760540 >>101760702 >>101760777 >>101760661
--How to prompt LLM to call external APIs using function calling: >>101762712 >>101764500 >>101764831 >>101764932
--Anon generates responses for lmsys_chat_1m_clean dataset with GPT-4 and Claude 3.5 Sonnet: >>101762642
--Anon discusses 405B model and providers, with some anons preferring local models and criticizing Together's prices and reputation: >>101759055 >>101759065 >>101759099 >>101759133 >>101759275 >>101759284 >>101759434 >>101759474 >>101759515 >>101759573 >>101759612 >>101759717 >>101759903 >>101760068 >>101759548 >>101759777 >>101759863 >>101759929 >>101760063 >>101760095 >>101760112 >>101760225 >>101760114
--OpenAI used muP before others, possibly related to µTransfer technique: >>101763343 >>101763361
--Lambda GPU Cloud pricing discussion: >>101763607 >>101764189 >>101764197
--Idefics3-8B-Llama3 model supports multimodal tasks: >>101766156 >>101766174 >>101766250
--FLUX.1 video model and AI development discussion: >>101757726 >>101757742 >>101757804 >>101757844 >>101757810 >>101757849 >>101757943 >>101759880 >>101760044 >>101760092 >>101763158 >>101760094
--Dan Hendrycks trying to push finetuning resistant method as law mandated: >>101760368 >>101760390 >>101760444 >>101760613
--8b draft spec decoding performance and small vs large model discussion: >>101764345 >>101764353 >>101764408
--ZLUDA project taken down, to be rebuilt from pre-AMD version: >>101759347
--Lambda's 8x H100 deal is cheaper than RunPod: >>101763416 >>101763594 >>101763596 >>101763626 >>101763629
--Anons discuss 405B and Mistral Large models on openrouter: >>101758409 >>101758472 >>101758750 >>101758846 >>101758785 >>101758989
--Miku (free space): >>101761322 >>101764185 >>101765085

►Recent Highlight Posts from the Previous Thread: >>101757610

Anonymous
08/07/24(Wed)10:10:08 No.101767160

Anonymous 08/07/24(Wed)10:10:08 No.101767160

what is the best programing model that can fit in 12gb of vram?

Anonymous
08/07/24(Wed)10:10:44 No.101767164

Anonymous 08/07/24(Wed)10:10:44 No.101767164

>>101767160
gemmasutra 2b

Anonymous
08/07/24(Wed)10:12:02 No.101767186

Anonymous 08/07/24(Wed)10:12:02 No.101767186

>>101767164
>2b
in the trash it goes

Anonymous
08/07/24(Wed)10:17:41 No.101767259

Anonymous 08/07/24(Wed)10:17:41 No.101767259

https://www.cerebras.net/cerebras-customer-spotlight-overview/spotlight-aleph-alpha/

Anonymous
08/07/24(Wed)10:25:08 No.101767345

Anonymous 08/07/24(Wed)10:25:08 No.101767345

>>101767259
Fine.. fine... what about it?

Anonymous
08/07/24(Wed)10:26:03 No.101767356

Anonymous 08/07/24(Wed)10:26:03 No.101767356

>>101767164
Can you shut the fuck up about that amateur clown? His models are shit

Anonymous
08/07/24(Wed)10:26:43 No.101767365

Anonymous 08/07/24(Wed)10:26:43 No.101767365

>>101767356
>His models are shit
proof?

Anonymous
08/07/24(Wed)10:27:57 No.101767376

Anonymous 08/07/24(Wed)10:27:57 No.101767376

>>101767356
It was a joke, anon, jesus fuck... remember when people recommended phi2? remember when people recommended tinyllama? no? oh....

Anonymous
08/07/24(Wed)10:28:00 No.101767377

Anonymous 08/07/24(Wed)10:28:00 No.101767377

>>101767356
hi sao

Anonymous
08/07/24(Wed)10:28:29 No.101767379

Anonymous 08/07/24(Wed)10:28:29 No.101767379

File: girl_infront_of_houses.jpg (235 KB, 1075x717)

235 KB JPG

So, what are you guys doing that warrants a local model?

Anonymous
08/07/24(Wed)10:30:43 No.101767412

Anonymous 08/07/24(Wed)10:30:43 No.101767412

>>101767379
c.ai doesn't let me coom properly and no fucking way i'm paying for tokens lmao

Anonymous
08/07/24(Wed)10:32:53 No.101767443

Anonymous 08/07/24(Wed)10:32:53 No.101767443

>>101767379
Reliability and principle. Once a model is released, it cannot be made worse than at launch. And i don't like online services. I like owning what i have. As for the use, i just find them interesting.

Anonymous
08/07/24(Wed)10:33:40 No.101767456

Anonymous 08/07/24(Wed)10:33:40 No.101767456

>>101767379
grooming a 14 year old

Anonymous
08/07/24(Wed)10:34:20 No.101767464

Anonymous 08/07/24(Wed)10:34:20 No.101767464

>>101767456
Your wife is spilling out the beans on reddit!!!

Anonymous
08/07/24(Wed)10:34:22 No.101767465

Anonymous 08/07/24(Wed)10:34:22 No.101767465

File: d6613d1b-cf9e-43bd-943f-0(...).png (39 KB, 1023x294)

39 KB PNG

Anonymous
08/07/24(Wed)10:36:06 No.101767497

Anonymous 08/07/24(Wed)10:36:06 No.101767497

File: how-to-adjust-flame.png (381 KB, 512x512)

381 KB PNG

https://stovemastery.com/what-causes-a-gas-stove-to-explode/
does anyone know what model writes these? I want to rp with it

Anonymous
08/07/24(Wed)10:37:38 No.101767512

Anonymous 08/07/24(Wed)10:37:38 No.101767512

>>101767456
just read https://vndb.org/v415/ instead of shitty aislop

Anonymous
08/07/24(Wed)10:39:46 No.101767533

Anonymous 08/07/24(Wed)10:39:46 No.101767533

>>101767497
3.5 turbo?

Anonymous
08/07/24(Wed)10:42:07 No.101767555

Anonymous 08/07/24(Wed)10:42:07 No.101767555

>>101767160
Nemo or mini-magnum. You'll have some of it on regular ram probably but it should run great.

Anonymous
08/07/24(Wed)10:44:01 No.101767589

Anonymous 08/07/24(Wed)10:44:01 No.101767589

>>101767555
>use this finetune trained on rp logs for coding

Anonymous
08/07/24(Wed)10:44:28 No.101767593

Anonymous 08/07/24(Wed)10:44:28 No.101767593

>>101767356
This. We should be talking more about InternLM 2.5 20B instead. This model beats Gemma 2 27B and comes really close to Llama 3.1 70B in a bunch of benchmarks. 64.7 on MATH 0 shot is absolutely insane, 3.5 Sonnet has just 71.1. And with 8bit quants, you should be able to fit it on a 4090.

Tom from China
08/07/24(Wed)10:45:46 No.101767607

Tom from China 08/07/24(Wed)10:45:46 No.101767607

>>101767593
I approve of this post

Anonymous
08/07/24(Wed)10:46:14 No.101767612

Anonymous 08/07/24(Wed)10:46:14 No.101767612

>>101767589
Missed the programming part. My apologies.

Anonymous
08/07/24(Wed)10:52:07 No.101767682

Anonymous 08/07/24(Wed)10:52:07 No.101767682

Thank you

Anonymous
08/07/24(Wed)10:54:03 No.101767706

Anonymous 08/07/24(Wed)10:54:03 No.101767706

>>101767593
buy an ad

Anonymous
08/07/24(Wed)10:55:07 No.101767715

Anonymous 08/07/24(Wed)10:55:07 No.101767715

>>101767379
holy shit when I looked at the thumbnail of this I could have sworn I saw the words "child pussy" but then I zoomed in and it was normal. you guys saw it too right

Anonymous
08/07/24(Wed)10:56:47 No.101767738

Anonymous 08/07/24(Wed)10:56:47 No.101767738

>>101767715
No. You're paranoid. It's a perfectly normal picture.

Anonymous
08/07/24(Wed)10:57:00 No.101767741

Anonymous 08/07/24(Wed)10:57:00 No.101767741

>>101767715
Absolutely DISGUSTING!!!! This should be deleted IMMEDIATELY, whoever posted this must be sick and depraved! God Bless America

Anonymous
08/07/24(Wed)11:11:23 No.101767884

Anonymous 08/07/24(Wed)11:11:23 No.101767884

>every model i don't like (aka i can't run it because i'm a vramlet) is a shill post

Anonymous
08/07/24(Wed)11:13:05 No.101767900

Anonymous 08/07/24(Wed)11:13:05 No.101767900

>feeding

Anonymous
08/07/24(Wed)11:21:46 No.101767993

Anonymous 08/07/24(Wed)11:21:46 No.101767993

File: 1708228248091609.png (20 KB, 522x226)

20 KB PNG

>>101767379
I can't imagine being a poorfag and having to deal with retarded shit like this

Anonymous
08/07/24(Wed)11:29:57 No.101768074

Anonymous 08/07/24(Wed)11:29:57 No.101768074

>>101755678
Alright, okay. So, dolphin-2.9.3-mistral-nemo-12b.
During my test battery, there's a moment where I ask the model to create a list of people then in the next message I ask it to create a lewd story featuring that character.
Celeste (1.6) and mini-magnum did stellar at that point, with other models failing to follow the prompt or to make a good story. dolphin so far seems to fall in the latter case. It writes the story but it defaults to keeping it short, doesn't write much detail, and it's extremely evasive with sex stuff.
I'll try prompting around it and see if I can extract good results in that aspect before continuing.
One good thing about it is that I don't have to cheat by adding "Sure, {{user}}", "Continuing", etc at the end of my prefill with it.
Other models would go slightly schizo or have a large incidence of not following on user's last message.

Anonymous
08/07/24(Wed)11:31:23 No.101768086

Anonymous 08/07/24(Wed)11:31:23 No.101768086

File: ComfyUI_05567_.png (1.19 MB, 720x1280)

1.19 MB PNG

Multiple proxies down, you know what that means....

Anonymous
08/07/24(Wed)11:31:39 No.101768091

Anonymous 08/07/24(Wed)11:31:39 No.101768091

Stop using Assistant.

Anonymous
08/07/24(Wed)11:32:56 No.101768108

Anonymous 08/07/24(Wed)11:32:56 No.101768108

>>101768091
but she likes it

Anonymous
08/07/24(Wed)11:35:22 No.101768137

Anonymous 08/07/24(Wed)11:35:22 No.101768137

File: GOaNiSVaIAAIIyS.jpg (101 KB, 1170x981)

101 KB JPG

https://x.com/sama/status/1821207141635780938

happening

Anonymous
08/07/24(Wed)11:37:03 No.101768155

Anonymous 08/07/24(Wed)11:37:03 No.101768155

>>101768137
>another self-mythologizing tease
yawn
call me when openai actually delivers something substantial again

Anonymous
08/07/24(Wed)12:10:27 No.101768549

Anonymous 08/07/24(Wed)12:10:27 No.101768549

File: 1721611039411706.gif (235 KB, 500x500)

235 KB GIF

>>101766139
>Using crunchdog as a way to say current models have soul
Crunchdog is just an extremely funny card, it'd probably be soulful on any model. That doesn't mean that sex and love shit will be, which is what's been lacking in soul.

Anonymous
08/07/24(Wed)12:20:07 No.101768668

Anonymous 08/07/24(Wed)12:20:07 No.101768668

What model is good to help me try to learn programming?
Is it qwen2?
I can run 30b models in slow mode btw

Anonymous
08/07/24(Wed)12:21:41 No.101768693

Anonymous 08/07/24(Wed)12:21:41 No.101768693

>>101768668
try codestral 22b

Anonymous
08/07/24(Wed)12:22:09 No.101768701

Anonymous 08/07/24(Wed)12:22:09 No.101768701

Anybody have a model that they like for 24GB vramlets?

Anonymous
08/07/24(Wed)12:26:59 No.101768771

Anonymous 08/07/24(Wed)12:26:59 No.101768771

>>101768701
sorry, this thread is for people who locally run Llama 405B only

Anonymous
08/07/24(Wed)12:28:09 No.101768783

Anonymous 08/07/24(Wed)12:28:09 No.101768783

>>101768693
Downloading it right now

Anonymous
08/07/24(Wed)12:29:10 No.101768793

Anonymous 08/07/24(Wed)12:29:10 No.101768793

>>101768771
literally kill yourself

Anonymous
08/07/24(Wed)12:29:53 No.101768800

Anonymous 08/07/24(Wed)12:29:53 No.101768800

>>101768793
It's just a prank, bro!

Anonymous
08/07/24(Wed)12:30:09 No.101768802

Anonymous 08/07/24(Wed)12:30:09 No.101768802

>>101768793
Figuratively chill down

Anonymous
08/07/24(Wed)12:31:58 No.101768832

Anonymous 08/07/24(Wed)12:31:58 No.101768832

>>101768137
I hate that fucking faggot with every fiber of my being. I wish Anthropic would have hit the scene first. 99.9 percent of the population only knows ChatGPT

Anonymous
08/07/24(Wed)12:35:06 No.101768878

Anonymous 08/07/24(Wed)12:35:06 No.101768878

>>101768832
If anthropic hit the scene first we would have bezos posting cryptic pictures of legumes instead.
Choice is an illusion.
Local is the only way.

Anonymous
08/07/24(Wed)12:38:18 No.101768923

Anonymous 08/07/24(Wed)12:38:18 No.101768923

>>101767443
Boomer sensibilities. Nobody cares about ownership of tech slop except old people (You)

Anonymous
08/07/24(Wed)12:38:48 No.101768933

Anonymous 08/07/24(Wed)12:38:48 No.101768933

"Als mir klar wurde, dass Intelligenz simuliert werden kann, habe ich die Idee unserer Einzigartigkeit aufgegeben"

Anonymous
08/07/24(Wed)12:39:18 No.101768938

Anonymous 08/07/24(Wed)12:39:18 No.101768938

>>101767593
wtf? why do you keep shilling this garbage? fuck off and die

Anonymous
08/07/24(Wed)12:39:41 No.101768944

Anonymous 08/07/24(Wed)12:39:41 No.101768944

>>101768701
https://huggingface.co/TheDrummer/Big-Tiger-Gemma-27B-v1-GGUF
makes child rape stories and doesnt refuse

Anonymous
08/07/24(Wed)12:39:57 No.101768949

Anonymous 08/07/24(Wed)12:39:57 No.101768949

>>101768923
>Nobody cares about ownership of tech slop except old people (You)
ok

Anonymous
08/07/24(Wed)12:41:17 No.101768964

Anonymous 08/07/24(Wed)12:41:17 No.101768964

>>101768137
journalists must be slobbering all over his boots after that post

Anonymous
08/07/24(Wed)12:44:05 No.101769008

Anonymous 08/07/24(Wed)12:44:05 No.101769008

>>101768137
Yann Lecun said research isn't secret. If OpenAI ships proto-AGI he should retire

Anonymous
08/07/24(Wed)12:44:57 No.101769021

Anonymous 08/07/24(Wed)12:44:57 No.101769021

>>101768832
He and the other entrepreneurs ultimately sold the software company, which developed apps for Android and iOS that let users selectively share their location with other people, to Green Dot Corporatio Company in 2012 for $43.4 million
47.555427053884706, 7.606273996838664

Anonymous
08/07/24(Wed)12:46:10 No.101769043

Anonymous 08/07/24(Wed)12:46:10 No.101769043

Tried out Celeste 1.6 yesterday at 60k context.

I'm thinking it's kino

Anonymous
08/07/24(Wed)12:47:00 No.101769053

Anonymous 08/07/24(Wed)12:47:00 No.101769053

>>101769043
ad

Anonymous
08/07/24(Wed)12:53:14 No.101769141

Anonymous 08/07/24(Wed)12:53:14 No.101769141

File: 1647399149433.jpg (292 KB, 1027x1273)

292 KB JPG

Is there an ElevenLabs tier voice cloning model yet?

Anonymous
08/07/24(Wed)12:57:43 No.101769221

Anonymous 08/07/24(Wed)12:57:43 No.101769221

>>101768944
Checked. That's more than I need it for, but a decent baseline for model freedom.

Anonymous
08/07/24(Wed)13:00:29 No.101769268

Anonymous 08/07/24(Wed)13:00:29 No.101769268

>>101769141
Nothing changed since yesterday. Ask again tomorrow.

Anonymous
08/07/24(Wed)13:08:48 No.101769387

Anonymous 08/07/24(Wed)13:08:48 No.101769387

>>101768944
I'm not into rp but an uncucked gemma2 27b would be nice.

Anonymous
08/07/24(Wed)13:09:01 No.101769388

Anonymous 08/07/24(Wed)13:09:01 No.101769388

>>101769043
It's definitely better than 1.9 in my experience.

Anonymous
08/07/24(Wed)13:13:49 No.101769460

Anonymous 08/07/24(Wed)13:13:49 No.101769460

File: 1720869141748493.jpg (94 KB, 875x916)

94 KB JPG

>>101767112

Anonymous
08/07/24(Wed)13:20:15 No.101769571

Anonymous 08/07/24(Wed)13:20:15 No.101769571

>>101767715
get your mind out of the gutter

Anonymous
08/07/24(Wed)13:34:12 No.101769776

Anonymous 08/07/24(Wed)13:34:12 No.101769776

nu cum wen?

Anonymous
08/07/24(Wed)13:38:40 No.101769843

Anonymous 08/07/24(Wed)13:38:40 No.101769843

>More questions?
>Contact me via Discord or ask on the /img/
>/img/
Are the magnum shills unable to spell the name of the general right?
https://rentry.org/MagnumProxy#more-questions
>>101769766

Anonymous
08/07/24(Wed)13:40:20 No.101769868

Anonymous 08/07/24(Wed)13:40:20 No.101769868

File: 1722778333267421.png (60 KB, 640x640)

60 KB PNG

>>101769843
Lol, my bad anon! Thank u for highlighting that

Anonymous
08/07/24(Wed)13:44:04 No.101769935

Anonymous 08/07/24(Wed)13:44:04 No.101769935

new 8b sota just dropped: https://huggingface.co/LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct

>4.2 Output: All rights, title, and interest in and to the Output generated by the Model, whether in its original form or modified, are and shall remain the exclusive property of the Licensor.

Anonymous
08/07/24(Wed)13:45:07 No.101769953

Anonymous 08/07/24(Wed)13:45:07 No.101769953

File: file.png (29 KB, 718x256)

29 KB PNG

>>101769935
HOLY SHEET

Anonymous
08/07/24(Wed)13:47:17 No.101769991

Anonymous 08/07/24(Wed)13:47:17 No.101769991

>>101769935
>>101769953
>Weird custom architecture 4k context
holy sheet indeed.

Anonymous
08/07/24(Wed)13:48:11 No.101770012

Anonymous 08/07/24(Wed)13:48:11 No.101770012

>>101769953
>>101769991
46.8 on arena hard is llama 70b-tier

Anonymous
08/07/24(Wed)13:50:40 No.101770048

Anonymous 08/07/24(Wed)13:50:40 No.101770048

>>101769935
>>101769953
>korean
biggest liars after chinese

Anonymous
08/07/24(Wed)13:51:01 No.101770053

Anonymous 08/07/24(Wed)13:51:01 No.101770053

>>101769953
wait even LG is jumping in on this now?
Few more weeks and Hot Pockets will be releasing open source foundational models at this rate.

Anonymous
08/07/24(Wed)13:55:01 No.101770112

Anonymous 08/07/24(Wed)13:55:01 No.101770112

>>101770048
>biggest liars after chinese
uhm source? isn't this literally the first korean llm?

Anonymous
08/07/24(Wed)13:57:13 No.101770146

Anonymous 08/07/24(Wed)13:57:13 No.101770146

>>101770112
isn't the solar team korean? though I guess theirs was more of a continued pretrain

Anonymous
08/07/24(Wed)13:57:47 No.101770154

Anonymous 08/07/24(Wed)13:57:47 No.101770154

>>101770112
Not to mention even if it is Korean it's coming from a reputable electronics firm. It's not like half the chinese models that come from literally who startups.

Anonymous
08/07/24(Wed)13:58:52 No.101770170

Anonymous 08/07/24(Wed)13:58:52 No.101770170

>another model release
>another American melty
How do we solve their insecurity problem?

Anonymous
08/07/24(Wed)14:00:39 No.101770199

Anonymous 08/07/24(Wed)14:00:39 No.101770199

Exaone instruct template off of tokenizerconfig.json for people who don't want to submit to the model gate conditions
[CODE]"chat_template": "{% for message in messages %}{% if loop.first and message['role'] != 'system' %}{{ '[|system|][|endofturn|]\n' }}{% endif %}{{ '[|' + message['role'] + '|]' + message['content'] }}{% if message['role'] == 'user' %}{{ '\n' }}{% else %}{{ '[|endofturn|]\n' }}{% endif %}{% endfor %}{% if add_generation_prompt %}{{ '[|assistant|]' }}{% endif %}",[/CODE]

Anonymous
08/07/24(Wed)14:00:55 No.101770204

Anonymous 08/07/24(Wed)14:00:55 No.101770204

>>101770154
Here's hoping that the data is primarily from English sources.

Anonymous
08/07/24(Wed)14:01:39 No.101770213

Anonymous 08/07/24(Wed)14:01:39 No.101770213

>>101770199
how do make code block on /g/ without looking like retard plzhalp

Anonymous
08/07/24(Wed)14:03:36 No.101770238

Anonymous 08/07/24(Wed)14:03:36 No.101770238

>>101770048
No. We are back. The LK99 of LLMs is here.

Anonymous
08/07/24(Wed)14:04:13 No.101770248

Anonymous 08/07/24(Wed)14:04:13 No.101770248

>>101770199
Okay, so
>[|assistant/user/system|][|endofturn|]
If those are actual special tokens rather than being tokenized as strings, than alright. Better than fucking mistral's.

Anonymous
08/07/24(Wed)14:04:40 No.101770257

Anonymous 08/07/24(Wed)14:04:40 No.101770257

been away for a few months, is stheno is the best bang for buck for 12gb VRAM?

Anonymous
08/07/24(Wed)14:05:35 No.101770269

Anonymous 08/07/24(Wed)14:05:35 No.101770269

>>101770257
no, gemmasutra 2b is better nowadays

Anonymous
08/07/24(Wed)14:05:56 No.101770275

Anonymous 08/07/24(Wed)14:05:56 No.101770275

>>101770257
It never was. Buy an ad, shill.

Anonymous
08/07/24(Wed)14:05:59 No.101770276

Anonymous 08/07/24(Wed)14:05:59 No.101770276

>>101770257
Yes, kind of. Depends on your taste, but I'd tell you to also try nemo-instruct and some of its fine tunes like celeste 1.6, mini magnum, and dolphin.
There's also Gemma 9b, but I never gave that one a proper try.

Anonymous
08/07/24(Wed)14:06:31 No.101770283

Anonymous 08/07/24(Wed)14:06:31 No.101770283

Drummer mindbroke the local schizo

Anonymous
08/07/24(Wed)14:07:38 No.101770296

Anonymous 08/07/24(Wed)14:07:38 No.101770296

>>101770248
Later mistral releases did actually add an [INST] and [/INST] token.

Anonymous
08/07/24(Wed)14:08:15 No.101770301

Anonymous 08/07/24(Wed)14:08:15 No.101770301

>>101770275
tell me what's better then faggot

Anonymous
08/07/24(Wed)14:08:56 No.101770318

Anonymous 08/07/24(Wed)14:08:56 No.101770318

Gemma 27b is still the only local model that can write javascript without semicolons, it's fucking infuriating. I want to like magnum 32b, but it just ignores instructions

>>101770213
[CODE][CODE][/CODE][/CODE]

Anonymous
08/07/24(Wed)14:09:17 No.101770326

Anonymous 08/07/24(Wed)14:09:17 No.101770326

>>101770213

I don't know if i can nest code tags.
Let's see what it looks like in the post.

Anonymous
08/07/24(Wed)14:09:33 No.101770331

Anonymous 08/07/24(Wed)14:09:33 No.101770331

>>101770296
True. Should have mentioned that.

Anonymous
08/07/24(Wed)14:10:11 No.101770340

Anonymous 08/07/24(Wed)14:10:11 No.101770340

>>101770276
sweet I'll try all of those. thanks for not being a schizo like that other retard

Anonymous
08/07/24(Wed)14:10:33 No.101770349

Anonymous 08/07/24(Wed)14:10:33 No.101770349

>>101770326
kek. that didn't quite work.
Escaping them?
\like this?
\

Anonymous
08/07/24(Wed)14:11:39 No.101770375

Anonymous 08/07/24(Wed)14:11:39 No.101770375

>>101770269
I can go bigger than that if there are gains to be had

Anonymous
08/07/24(Wed)14:19:43 No.101770519

Anonymous 08/07/24(Wed)14:19:43 No.101770519

File: nala exaone.png (158 KB, 936x532)

158 KB PNG

Nala test for Exaone 3.0 7.8B
This is definitely promising.
There's even new slop that we've never seen before. Like "tingles through your muscles"

Anonymous
08/07/24(Wed)14:22:18 No.101770551

Anonymous 08/07/24(Wed)14:22:18 No.101770551

>>101770519
holy formatting

Anonymous
08/07/24(Wed)14:22:52 No.101770561

Anonymous 08/07/24(Wed)14:22:52 No.101770561

>>101770519
Imagine if they find-replaced shivers down your spine with that.
Would be hilarious.
The broken formatting is bad, but the text itself is pretty okay considering its size.
A shame about 4 fucking k context, but I suppose it could be used to generate "un-sloped" data to train other models with.

Anonymous
08/07/24(Wed)14:22:53 No.101770562

Anonymous 08/07/24(Wed)14:22:53 No.101770562

are there any c.ai alternatives which can be self/locally hosted? if not, maybe we could build one.

Anonymous
08/07/24(Wed)14:23:01 No.101770565

Anonymous 08/07/24(Wed)14:23:01 No.101770565

>>101770519
Spatial reasoning seems all over the place tho

Anonymous
08/07/24(Wed)14:24:06 No.101770587

Anonymous 08/07/24(Wed)14:24:06 No.101770587

>>101770562
Hey, if you give me a week and a couple of h100 I can make train a pretty convincing simulacra for you.

Anonymous
08/07/24(Wed)14:24:09 No.101770588

Anonymous 08/07/24(Wed)14:24:09 No.101770588

there was some random gemma model on the anthracite hf page today anyone have a reup?

Anonymous
08/07/24(Wed)14:25:51 No.101770614

Anonymous 08/07/24(Wed)14:25:51 No.101770614

>>101770565
>>101770561
>>101770551
there might be eot token issues right now with my current half year old sillytavern setup I literally just threw together a template and started messing around with it as soon as I found out about it.

Anonymous
08/07/24(Wed)14:27:06 No.101770627

Anonymous 08/07/24(Wed)14:27:06 No.101770627

>>101770614
Fair. The way it's breaking format does look like an issue with the template, BOS/EOS tokens, etc.

Anonymous
08/07/24(Wed)14:29:25 No.101770659

Anonymous 08/07/24(Wed)14:29:25 No.101770659

>>101770627
The way it's breaking the formatting is because whatever shitty old version of ST I'm using omits the last character for some fucking reason. So it always ends with an unpunctuated sentence or an unclosed asterisk or unfinished quote.

Anonymous
08/07/24(Wed)14:38:08 No.101770761

Anonymous 08/07/24(Wed)14:38:08 No.101770761

Maybe the times are changing but I'm not liking magnum-12b-v2. Ironically it's too coom-brained. I found myself switching to Nemo mid-session and having to wrangle it way less. It's a shame cuz I really liked mini-magnum and magnum 72b.

Anonymous
08/07/24(Wed)14:40:25 No.101770792

Anonymous 08/07/24(Wed)14:40:25 No.101770792

>>101770761
There's a 12b v2?
Gotta download it I guess.

Anonymous
08/07/24(Wed)14:41:27 No.101770809

Anonymous 08/07/24(Wed)14:41:27 No.101770809

>>101770519
gemmasutra 2b is better

Anonymous
08/07/24(Wed)14:43:28 No.101770835

Anonymous 08/07/24(Wed)14:43:28 No.101770835

>>101770761
That's a shame. I already felt like Nemo was too horny of a model.

Anonymous
08/07/24(Wed)14:44:00 No.101770843

Anonymous 08/07/24(Wed)14:44:00 No.101770843

File: sportsballt1.1.png (216 KB, 925x624)

216 KB PNG

Hmm very interesting.
It will stay relatively coherent above simple t=1.0 but gets a little schizo, but if you give it something surrealistic to describe it does pretty well. This is at t=1.1
With meme samplers it might be even better.

Anonymous
08/07/24(Wed)14:54:06 No.101770977

Anonymous 08/07/24(Wed)14:54:06 No.101770977

File: file.png (2.7 MB, 1024x1024)

2.7 MB PNG

Anonymous
08/07/24(Wed)14:56:48 No.101771010

Anonymous 08/07/24(Wed)14:56:48 No.101771010

>>101770809
Only ever tried 2b-it but it's still pretty good. If I knew someone with a lol computer wanting to try erp out I would probably recommend a 2b gemma model.
Exaone is not bad though. It's got some slop unique to itself which means that its dataset has some things that other models datasets do not. Which from a tinkerer's perspective makes it interesting. But I wouldn't tell someone to go and ERP with it. 4k context is kind of sad, and the default ROPE settings in the config don't work so someone will have to feel it out if they want to try and extend the context.

Anonymous
08/07/24(Wed)15:01:29 No.101771076

Anonymous 08/07/24(Wed)15:01:29 No.101771076

File: exaonestrawberry.png (10 KB, 849x181)

10 KB PNG

holy shit exaone beats gpt4|o at the strawberry test.

Anonymous
08/07/24(Wed)15:07:35 No.101771182

Anonymous 08/07/24(Wed)15:07:35 No.101771182

>>101770977
Cosmic Miku looks tired.

Anonymous
08/07/24(Wed)15:10:14 No.101771219

Anonymous 08/07/24(Wed)15:10:14 No.101771219

>>101771076
I'd love to see how it tokenizes the word.

Anonymous
08/07/24(Wed)15:11:34 No.101771237

Anonymous 08/07/24(Wed)15:11:34 No.101771237

>>101770170
I trust a random 4chan post that model is good or bad more than any benchmark. And I don't trus 4chan posts about model being good or bad.

Anonymous
08/07/24(Wed)15:13:03 No.101771260

Anonymous 08/07/24(Wed)15:13:03 No.101771260

>>101770809
race to the absolute bottom

Anonymous
08/07/24(Wed)15:15:52 No.101771313

Anonymous 08/07/24(Wed)15:15:52 No.101771313

>>101771219
in both English and Korean.
There could be some weird interplay between the korean word/tokenization for strawberry and the English one that affords it a workaround.

Anonymous
08/07/24(Wed)15:17:48 No.101771340

Anonymous 08/07/24(Wed)15:17:48 No.101771340

>>101768923
Slave level thinking

Anonymous
08/07/24(Wed)15:23:59 No.101771435

Anonymous 08/07/24(Wed)15:23:59 No.101771435

>>101769935
>https://huggingface.co/LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct
>Max sequence length 4,096
:-)

Anonymous
08/07/24(Wed)15:27:48 No.101771489

Anonymous 08/07/24(Wed)15:27:48 No.101771489

>>101768923
This

Anonymous
08/07/24(Wed)15:34:44 No.101771602

Anonymous 08/07/24(Wed)15:34:44 No.101771602

eta on qlora-pipe exaone support?

Anonymous
08/07/24(Wed)15:42:32 No.101771732

Anonymous 08/07/24(Wed)15:42:32 No.101771732

error 400 from both suno and udio right now, weird.
anti-ai folks having a melty or something?

Anonymous
08/07/24(Wed)15:43:51 No.101771752

Anonymous 08/07/24(Wed)15:43:51 No.101771752

File: sovl_.png (132 KB, 604x515)

132 KB PNG

>hurr durr new models are woke and sloppy
You just don't use good tunes, picrel is llama merge I'm testing right now. So far I like the prose and it looks really promising.

Anonymous
08/07/24(Wed)15:45:30 No.101771776

Anonymous 08/07/24(Wed)15:45:30 No.101771776

>>101771752
Reading that sent a shiver down my spine

Anonymous
08/07/24(Wed)15:45:41 No.101771781

Anonymous 08/07/24(Wed)15:45:41 No.101771781

>>101771752
>mfw reading this
Kek, good bait

Anonymous
08/07/24(Wed)15:46:07 No.101771788

Anonymous 08/07/24(Wed)15:46:07 No.101771788

>>101771752
Unironically pls post some good recs

Anonymous
08/07/24(Wed)15:49:16 No.101771831

Anonymous 08/07/24(Wed)15:49:16 No.101771831

>>101771752
That's amazing.
It even has journeys.
I can't remember the last time I've unironically seen journeys.
Plenty of bonds however.

Anonymous
08/07/24(Wed)15:53:05 No.101771886

Anonymous 08/07/24(Wed)15:53:05 No.101771886

>>101771435
yeah, so tired of this... 8k is barely enough to coom...

Anonymous
08/07/24(Wed)15:53:06 No.101771887

Anonymous 08/07/24(Wed)15:53:06 No.101771887

>>101771776
>>101771781
>>101771788
>>101771831
guess the model

Anonymous
08/07/24(Wed)15:54:28 No.101771905

Anonymous 08/07/24(Wed)15:54:28 No.101771905

>>101771887
>guess the model
Mytho
>>101765965
>Wanna hear something funny? You were pissing me off so I decided to false-flag logs here, posting mythomax logs and calling it some other model (gemma, llama, mistral). You have no idea how much laugh I got when you were whining how sloppy it was and there are no good models anymore because local peaked at mythomax (kek).

Anonymous
08/07/24(Wed)15:55:36 No.101771923

Anonymous 08/07/24(Wed)15:55:36 No.101771923

>>101771887
NOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO

Anonymous
08/07/24(Wed)15:56:04 No.101771932

Anonymous 08/07/24(Wed)15:56:04 No.101771932

File: mythomax.png (85 KB, 657x350)

85 KB PNG

>>101771905
>Mytho
ding ding ding

Anonymous
08/07/24(Wed)15:56:38 No.101771938

Anonymous 08/07/24(Wed)15:56:38 No.101771938

>>101771752
>"And who knows? Maybe XYZ
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA I HAVE SEEN THIS LLAMA3ISM WAY TOO MANY TIMES BY NOW AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

Anonymous
08/07/24(Wed)15:57:25 No.101771947

Anonymous 08/07/24(Wed)15:57:25 No.101771947

>>101771938
>LLAMA3ISM
but is mytho l2 doebit

Anonymous
08/07/24(Wed)15:58:09 No.101771956

Anonymous 08/07/24(Wed)15:58:09 No.101771956

>>101771947
mytho is llama merge

Anonymous
08/07/24(Wed)15:58:18 No.101771959

Anonymous 08/07/24(Wed)15:58:18 No.101771959

>>101771752
This isn't bad. This is how most of the greatest writers in the world write

Anonymous
08/07/24(Wed)15:58:26 No.101771962

Anonymous 08/07/24(Wed)15:58:26 No.101771962

>>101771932
gemmasutra 2b is INDEED better

Anonymous
08/07/24(Wed)15:58:45 No.101771969

Anonymous 08/07/24(Wed)15:58:45 No.101771969

>>101771956
yes but not llama3

Anonymous
08/07/24(Wed)15:59:17 No.101771980

Anonymous 08/07/24(Wed)15:59:17 No.101771980

>>101771932
Do xwin-mlewd too.
That was my favorite.

Anonymous
08/07/24(Wed)15:59:26 No.101771982

Anonymous 08/07/24(Wed)15:59:26 No.101771982

>>101771959
the instant switch xD

Anonymous
08/07/24(Wed)16:00:05 No.101771995

Anonymous 08/07/24(Wed)16:00:05 No.101771995

>>101771932
Myothmax is still king

Anonymous
08/07/24(Wed)16:00:34 No.101772003

Anonymous 08/07/24(Wed)16:00:34 No.101772003

>>101771995
king of slop

Anonymous
08/07/24(Wed)16:01:08 No.101772009

Anonymous 08/07/24(Wed)16:01:08 No.101772009

>>101772003
King of sovl, but I do wish for an update, ngl.

Anonymous
08/07/24(Wed)16:02:04 No.101772019

Anonymous 08/07/24(Wed)16:02:04 No.101772019

>rename 2b to model of your choosing
>edit entire reply into gptslop
wow every model is bad local is dead

Anonymous
08/07/24(Wed)16:02:15 No.101772023

Anonymous 08/07/24(Wed)16:02:15 No.101772023

>>101771980
unfortunately I don't have it on my hard drive

Anonymous
08/07/24(Wed)16:03:06 No.101772034

Anonymous 08/07/24(Wed)16:03:06 No.101772034

>>101771947
It is not mytho. And who knows is distinctly l3.

Anonymous
08/07/24(Wed)16:03:24 No.101772038

Anonymous 08/07/24(Wed)16:03:24 No.101772038

File: meds.png (241 KB, 512x497)

241 KB PNG

>>101772019

Anonymous
08/07/24(Wed)16:04:00 No.101772049

Anonymous 08/07/24(Wed)16:04:00 No.101772049

>nobody would EVER troll on /lmg/

Anonymous
08/07/24(Wed)16:05:04 No.101772065

Anonymous 08/07/24(Wed)16:05:04 No.101772065

>>101772034
nice copium but it is mytho, you can recognize it easily by the purple prose

Anonymous
08/07/24(Wed)16:05:28 No.101772071

Anonymous 08/07/24(Wed)16:05:28 No.101772071

>>101772049
>trolling by saying an old model is bad
purpose?
if anything he should shat on nemos if he wanted to cause true chaos like say magnum mini 2 is big sloppy

Anonymous
08/07/24(Wed)16:06:41 No.101772088

Anonymous 08/07/24(Wed)16:06:41 No.101772088

>>101772071
the most sucked off model in the history of /lmg/ is just some old model? people already agree nemo is fucking retarded

Anonymous
08/07/24(Wed)16:08:07 No.101772105

Anonymous 08/07/24(Wed)16:08:07 No.101772105

>>101772088
>the most sucked off model in the history of /lmg/ is just some old model?
yes? it is and was always slop even if it was the best slop of its size for its time

Anonymous
08/07/24(Wed)16:08:50 No.101772116

Anonymous 08/07/24(Wed)16:08:50 No.101772116

>>101772088
From my point of view people who say mythomax was any good are trolling. They were just impressed by the vocabulary, but beside that mythomax was stupid, quite often incoherent and had a lot of slop and cliche.

Anonymous
08/07/24(Wed)16:09:46 No.101772134

Anonymous 08/07/24(Wed)16:09:46 No.101772134

>>101772049
I'm just sick of people shilling their useless models

Anonymous
08/07/24(Wed)16:10:18 No.101772140

Anonymous 08/07/24(Wed)16:10:18 No.101772140

>>101772088
Name 1 L2 merge that writes as well as Nemo

Anonymous
08/07/24(Wed)16:10:29 No.101772147

Anonymous 08/07/24(Wed)16:10:29 No.101772147

Here's a fun dumb thing for you guys to try.
Temp 2, TopK 2 (yep) minP 0.05 just in case.
See how you sampling the top 2 tokens with almost 50/50 percentage chance most of the time.

Anonymous
08/07/24(Wed)16:11:05 No.101772156

Anonymous 08/07/24(Wed)16:11:05 No.101772156

>>101772071
There is no such thing as mini-magnum 2.

Anonymous
08/07/24(Wed)16:12:17 No.101772175

Anonymous 08/07/24(Wed)16:12:17 No.101772175

>>101772140
there is none, but just because one shit smells less than the other doesn't mean they aren't both shit

Anonymous
08/07/24(Wed)16:13:09 No.101772191

Anonymous 08/07/24(Wed)16:13:09 No.101772191

>>101772140
Mythalion-Kimiko is pretty good. The older models in general are less slopped because they hadn't yet reached the level of intelligence required to notice the slop-web in human creative-writing. But they're also less intelligent.
Nemo blows any L2 model out of the water as far as conceptual understanding goes, regardless of size. But it would be inaccurate to say that there weren't perfectly good L2-13B models for cooming to back in the day (which wasn't even that long ago in the grand scheme of things).

Anonymous
08/07/24(Wed)16:13:43 No.101772203

Anonymous 08/07/24(Wed)16:13:43 No.101772203

>>101772156
https://huggingface.co/intervitens/mini-magnum-12b-v1.1
> New version is available! anthracite-org/magnum-12b-v2
close enough for me, i don't care about your exact branding

Anonymous
08/07/24(Wed)16:14:17 No.101772212

Anonymous 08/07/24(Wed)16:14:17 No.101772212

i see no reason to use any model that isnt by mistral whether you have vram or not, nobody else is capable of making a good model right now

Anonymous
08/07/24(Wed)16:15:11 No.101772231

Anonymous 08/07/24(Wed)16:15:11 No.101772231

File: graph.png (4 KB, 502x397)

4 KB PNG

>>101772212
i can see one
i can see one
i can see one

Anonymous
08/07/24(Wed)16:16:34 No.101772254

Anonymous 08/07/24(Wed)16:16:34 No.101772254

retards in this shithole can't even write a prompt or change samplers but will spend good money to run shitquants of large and say it's the models fault

Anonymous
08/07/24(Wed)16:17:01 No.101772262

Anonymous 08/07/24(Wed)16:17:01 No.101772262

>>101772203
It's just not the same T-T

Anonymous
08/07/24(Wed)16:17:25 No.101772267

Anonymous 08/07/24(Wed)16:17:25 No.101772267

>>101771932
Fake news. MythoGAWD never wrote like this.

Anonymous
08/07/24(Wed)16:18:01 No.101772273

Anonymous 08/07/24(Wed)16:18:01 No.101772273

>>101772254
>retards in this shithole can't even write a prompt or change samplers
the cant even hzve the model answer >>101749214

Anonymous
08/07/24(Wed)16:20:02 No.101772297

Anonymous 08/07/24(Wed)16:20:02 No.101772297

>>101772231
this, while I kinda like mistral models the repetition issue is terrible. And don't say "just use rep penalty", firstly, it doesn't always work as it should, and secondly, it lobotomizes the model. You basically force the model to not use tokens it wants to use, sure it can look semi-coherent but the intelligence hit is visible.

Anonymous
08/07/24(Wed)16:21:00 No.101772312

Anonymous 08/07/24(Wed)16:21:00 No.101772312

File: GS1bXAmbwAAoOV0.jpg (377 KB, 1400x1800)

377 KB JPG

Anonymous
08/07/24(Wed)16:22:14 No.101772328

Anonymous 08/07/24(Wed)16:22:14 No.101772328

>>101772312
End of scene.

Anonymous
08/07/24(Wed)16:24:32 No.101772356

Anonymous 08/07/24(Wed)16:24:32 No.101772356

Tess-3 (Mistral-Large-2-123B) and Trinity-2 (Codestral)

Dropping two new models today, before I fly out to Defcon.

Tess-3 on Mistral-Large-2-123B (General-LLM): https://huggingface.co/migtissera/Tess-3-Mistral-Large-2-123B

Trinity-2 on Codestral (Code-LLM):
https://huggingface.co/migtissera/Trinity-2-Codestral-22B

Both are uncensored. Codestral scores 78 on HumanEval.

Anonymous
08/07/24(Wed)16:25:47 No.101772373

Anonymous 08/07/24(Wed)16:25:47 No.101772373

>>101772356
didn't he say like yesterday he wouldn't do sub 70bs anymore?

Anonymous
08/07/24(Wed)16:27:04 No.101772385

Anonymous 08/07/24(Wed)16:27:04 No.101772385

https://philome.la/johnayliff/seedship/play/index.html

Anonymous
08/07/24(Wed)16:28:15 No.101772402

Anonymous 08/07/24(Wed)16:28:15 No.101772402

>>101772373
There's no reason to do 70b+ anymore.

Anonymous
08/07/24(Wed)16:33:26 No.101772499

Anonymous 08/07/24(Wed)16:33:26 No.101772499

File: man-shooting-himself-foot(...).jpg (41 KB, 600x900)

41 KB JPG

https://new.reddit.com/r/StableDiffusion/comments/1emi1j9/opensource_amd_gpu_implementation_of_cuda_zluda/
>a based gentleman wanted to help AMD by making Cuda compatible with their cards
>AMD sent a ban notice to him
If that's not a sign that AMD is a controlled oposition, then I don't know what else to say

Anonymous
08/07/24(Wed)16:34:10 No.101772513

Anonymous 08/07/24(Wed)16:34:10 No.101772513

>>101772065
>you can recognize it easily by the purple prose
What does purple prose mean?

Anonymous
08/07/24(Wed)16:34:33 No.101772518

Anonymous 08/07/24(Wed)16:34:33 No.101772518

is Mistral great because of the pretrain or because of the finetune?

Anonymous
08/07/24(Wed)16:35:19 No.101772530

Anonymous 08/07/24(Wed)16:35:19 No.101772530

>>101772518
mistral can't finetune for shit
see 8x22b vs wizlm

Anonymous
08/07/24(Wed)16:36:18 No.101772549

Anonymous 08/07/24(Wed)16:36:18 No.101772549

>>101772513
>What does purple prose mean?
A literary term! "Purple prose" is a pejorative term used to describe writing that is overly elaborate, flowery, and excessively ornate. It's characterized by the use of overly complex vocabulary, convoluted sentence structures, and an abundance of adjectives and adverbs.
End of scene.

Anonymous
08/07/24(Wed)16:37:53 No.101772569

Anonymous 08/07/24(Wed)16:37:53 No.101772569

>>101772549
So basically me, also why specifically purple?

Anonymous
08/07/24(Wed)16:37:56 No.101772570

Anonymous 08/07/24(Wed)16:37:56 No.101772570

File: OEMhYF15BjZUc7S0nN-u7.png (78 KB, 989x590)

78 KB PNG

>>101772530
>see 8x22b vs wizlm
>Microsoft WizardLM-2-8x22B 11.7 %

Anonymous
08/07/24(Wed)16:38:57 No.101772587

Anonymous 08/07/24(Wed)16:38:57 No.101772587

>>101772569
>also why specifically purple?
The term "purple" is thought to have originated from the idea that the writing is so elaborate and excessive that it's almost "royal" or "imperial" in its grandeur – much like the rich, regal color purple. However, in this context, the term is not meant to be complimentary, but rather to suggest that the writing is overly indulgent and self-aggrandizing.

Anonymous
08/07/24(Wed)16:43:30 No.101772646

Anonymous 08/07/24(Wed)16:43:30 No.101772646

>>101772587
I didn't know they had a term for literature that describes me so well.

Anonymous
08/07/24(Wed)16:44:39 No.101772660

Anonymous 08/07/24(Wed)16:44:39 No.101772660

>>101772646
based purplechad

Anonymous
08/07/24(Wed)16:44:43 No.101772661

Anonymous 08/07/24(Wed)16:44:43 No.101772661

>>101772513
>What does purple prose mean?
In short - describing for the sake of describing. Have you ever tried to write an essay (for X words) and realized that you are short on words so you added a lot of useless fillers? Now make the fillers sound elaborate, melodramatic and hyperbolic - this is a recipe for purple prose.

Anonymous
08/07/24(Wed)16:48:22 No.101772713

Anonymous 08/07/24(Wed)16:48:22 No.101772713

How would you measure that

Anonymous
08/07/24(Wed)16:48:49 No.101772718

Anonymous 08/07/24(Wed)16:48:49 No.101772718

>>101772570
Hallucination is soul. See how no claude is on the list because they are too good and souful.

Anonymous
08/07/24(Wed)16:49:37 No.101772730

Anonymous 08/07/24(Wed)16:49:37 No.101772730

>>101772570
>eval with no correlation whatsoever to RP quality

Anonymous
08/07/24(Wed)16:51:18 No.101772752

Anonymous 08/07/24(Wed)16:51:18 No.101772752

>>101772718
>swiping endlessly is sovl
Fuck your sovl then, buddy. You can choke on it.

Anonymous
08/07/24(Wed)16:51:40 No.101772757

Anonymous 08/07/24(Wed)16:51:40 No.101772757

>>101772718
>Hallucination is soul
this is why I refuse to use temp smaller than 4

Anonymous
08/07/24(Wed)16:53:25 No.101772788

Anonymous 08/07/24(Wed)16:53:25 No.101772788

read a book on 1.3b

Anonymous
08/07/24(Wed)16:57:29 No.101772859

Anonymous 08/07/24(Wed)16:57:29 No.101772859

>>101769953
where are the standard benches, though. All of these can be pretty easily gamed if you've trained on gpt4o outputs since style is really influential

Anonymous
08/07/24(Wed)17:01:12 No.101772919

Anonymous 08/07/24(Wed)17:01:12 No.101772919

>>101772752
Every single time I have seen someone mention soul in this thread the example was of an LLM typing like a retard or like a schizo.

Anonymous
08/07/24(Wed)17:02:58 No.101772943

Anonymous 08/07/24(Wed)17:02:58 No.101772943

I like Lyra. Donate to Sao's Ko-fi today.

Anonymous
08/07/24(Wed)17:04:37 No.101772970

Anonymous 08/07/24(Wed)17:04:37 No.101772970

https://huggingface.co/nothingiisreal/L3.1-70B-Celeste-V0.1-BF16
70b Celeste

Anonymous
08/07/24(Wed)17:05:13 No.101772982

Anonymous 08/07/24(Wed)17:05:13 No.101772982

>>101772752
Nice try, ChatGPT.

Anonymous
08/07/24(Wed)17:07:12 No.101773017

Anonymous 08/07/24(Wed)17:07:12 No.101773017

>>101772970
>It seems to be way more coherent and aware of whats going on as well as more intelligent.
"12b mogs 70" copers btfo by their own sloptunnas you love see it

Anonymous
08/07/24(Wed)17:09:04 No.101773042

Anonymous 08/07/24(Wed)17:09:04 No.101773042

>>101772919
For me "sovl" is the ability to write in non-cliche way. No patterns, no sentences and phrases that are excessively present in human writing. Also the ability of model to surprise me with their answer or the direction of plot they are taking.
There are local models that have a glimpse of sovl from time to time but there is none I would call sovful. The only model I unironically found sovful was old c.AI.

Anonymous
08/07/24(Wed)17:10:42 No.101773062

Anonymous 08/07/24(Wed)17:10:42 No.101773062

>>101773042
Just up the temp to the point of incoherence. Or reduce context to 1k tokens.

Anonymous
08/07/24(Wed)17:14:30 No.101773119

Anonymous 08/07/24(Wed)17:14:30 No.101773119

>>101773042
>old c.AI
beamsearch

Anonymous
08/07/24(Wed)17:17:13 No.101773163

Anonymous 08/07/24(Wed)17:17:13 No.101773163

>>101772530
>see 8x22b vs wizlm
What? Both were shit. The only proof that it was better was Reddit's word of mouth impulsed by the mysticism of being taken down early.

Anonymous
08/07/24(Wed)17:18:54 No.101773180

Anonymous 08/07/24(Wed)17:18:54 No.101773180

>>101772970
Sao is not going to like this...

Anonymous
08/07/24(Wed)17:19:52 No.101773194

Anonymous 08/07/24(Wed)17:19:52 No.101773194

>>101772970
>>101773180
Hi lemmy

Anonymous
08/07/24(Wed)17:22:03 No.101773221

Anonymous 08/07/24(Wed)17:22:03 No.101773221

File: a62d7f78233ea96c42e39c3ff(...).jpg (35 KB, 653x490)

35 KB JPG

Haven't checked in for a minute.

Is mini magnum 12b still the best RP model for people with single GPUs?

Anonymous
08/07/24(Wed)17:22:11 No.101773223

Anonymous 08/07/24(Wed)17:22:11 No.101773223

>>101772970
buy an ad

Anonymous
08/07/24(Wed)17:22:13 No.101773225

Anonymous 08/07/24(Wed)17:22:13 No.101773225

>>101773042
Agreed, old c.ai and I would also add summer dragon.

Anonymous
08/07/24(Wed)17:23:22 No.101773247

Anonymous 08/07/24(Wed)17:23:22 No.101773247

>>101773221
celeste 123b mogs it

Anonymous
08/07/24(Wed)17:23:36 No.101773250

Anonymous 08/07/24(Wed)17:23:36 No.101773250

>>101773221
That or the base Nemo Instruct 12b if you're looking for drier prose or non RP purposes.

Anonymous
08/07/24(Wed)17:23:39 No.101773252

Anonymous 08/07/24(Wed)17:23:39 No.101773252

>>101773194
>>101772970
samefolx

Anonymous
08/07/24(Wed)17:28:13 No.101773327

Anonymous 08/07/24(Wed)17:28:13 No.101773327

>>101773221
No, but also, 12B isn't in the ballpark of what you can run with a single GPU like a 4090. You can run anything below Nemotron with 340B parameters with the right quantization, so I think you might need to be more specific with your question.

Anonymous
08/07/24(Wed)17:28:24 No.101773331

Anonymous 08/07/24(Wed)17:28:24 No.101773331

>>101773247
celeste tends to spazz out with the descriptions way more

Sao10k
08/07/24(Wed)17:32:19 No.101773406

Sao10k 08/07/24(Wed)17:32:19 No.101773406

>>101773331
>trust me, this finetune that doesn't actually exists is bad

Drummer
08/07/24(Wed)17:37:33 No.101773481

Drummer 08/07/24(Wed)17:37:33 No.101773481

>>101773406
Fuck you

Anonymous
08/07/24(Wed)17:37:40 No.101773482

Anonymous 08/07/24(Wed)17:37:40 No.101773482

Finetune the rust away

Anonymous
08/07/24(Wed)17:39:35 No.101773513

Anonymous 08/07/24(Wed)17:39:35 No.101773513

>>101773406
nta but the 'brand' name alone is radioactive, no one serious about the craft will touch that with a 10 foot pole (3.048 meters for eurofags)

Anonymous
08/07/24(Wed)17:39:50 No.101773519

Anonymous 08/07/24(Wed)17:39:50 No.101773519

>>101773327
I have a 4090 and everything i've tried has been pretty mid compared to Mini magnum in terms of 1-1 RP.

It gives me the most realistic feel out of the ones i've tried but i've not been here for around a week and i've hardly tried everything. What other ones are good at maintaing consistent dialogue, realistic convos that feel kinda natural?

Anonymous
08/07/24(Wed)17:40:44 No.101773534

Anonymous 08/07/24(Wed)17:40:44 No.101773534

>>101773327
Maybe that anon has a 3060 or something like that, so 12B is the most he can comfortably run offloading all the layers to the gpu

Anonymous
08/07/24(Wed)17:43:49 No.101773578

Anonymous 08/07/24(Wed)17:43:49 No.101773578

>>101773519
Buy an ad.

Anonymous
08/07/24(Wed)17:45:17 No.101773597

Anonymous 08/07/24(Wed)17:45:17 No.101773597

File: image_fx_hatsune_miku_doi(...).jpg (339 KB, 1536x1536)

339 KB JPG

>>101773327
No thanks, I'll take 128k context over 0.05 t/s responses even if I have to swipe a few times.

Anonymous
08/07/24(Wed)17:49:28 No.101773653

Anonymous 08/07/24(Wed)17:49:28 No.101773653

>>101773534
he has a 4090 and even then, I still would recomend Magnum to him.

Everything else fucking sucks unironically

Anonymous
08/07/24(Wed)17:50:15 No.101773665

Anonymous 08/07/24(Wed)17:50:15 No.101773665

>>101773653 (me)
my name is Alpin, btw

Anonymous
08/07/24(Wed)17:50:41 No.101773672

Anonymous 08/07/24(Wed)17:50:41 No.101773672

>>101773653
>Magnum
72b, 32b, 12b or mini-magnum?

Anonymous
08/07/24(Wed)17:51:36 No.101773687

Anonymous 08/07/24(Wed)17:51:36 No.101773687

>>101773672
All of them are the SOTA at their respective sizes. No one else can compete.

Anonymous
08/07/24(Wed)17:52:29 No.101773700

Anonymous 08/07/24(Wed)17:52:29 No.101773700

File: goliathwatermelons.png (1.21 MB, 1024x1024)

1.21 MB PNG

>>101768137
A cheeky nod to the number of Rs in strawberry test that we post here all the time? What's next, a photo of sama holding watermelons?

Anonymous
08/07/24(Wed)17:53:03 No.101773708

Anonymous 08/07/24(Wed)17:53:03 No.101773708

>>101773687
NTA but thank you for your recommendation. Magnum is amazing.

Anonymous
08/07/24(Wed)17:53:30 No.101773714

Anonymous 08/07/24(Wed)17:53:30 No.101773714

>>101772402
What? Why?

Anonymous
08/07/24(Wed)18:05:21 No.101773873

Anonymous 08/07/24(Wed)18:05:21 No.101773873

wrt function calling, it does depend on the specific model, but generally the raw prompt is going to include something like "You have access to the following functions" followed by the actual json list of functions formatted like OpenAI's would be in python.

https://huggingface.co/Trelis/Mistral-7B-Instruct-v0.1-function-calling-v3

Prompt Format here is a good start. I suggest Nemo or Mistral Large and explicitly ask for JSON responses.

Anonymous
08/07/24(Wed)18:08:38 No.101773897

Anonymous 08/07/24(Wed)18:08:38 No.101773897

File: EzYl8b.png (101 KB, 756x838)

101 KB PNG

>8x48GB GPUs
>just lying around
>i guess it's enough for 32b?
What did Alpin mean by this?

Anonymous
08/07/24(Wed)18:09:39 No.101773908

Anonymous 08/07/24(Wed)18:09:39 No.101773908

>>101773653
shill
>>101773665
it's Alpine, didn't she transition recently?
>>101773672
none of them
>>101773687
they're all shit like the people who defecated them
>>101773708
btfo, shill

Anonymous
08/07/24(Wed)18:09:56 No.101773911

Anonymous 08/07/24(Wed)18:09:56 No.101773911

>>101773672
what one could a 4090 realistically run? Been using mini myself on the 4090 and it's pretty fucking good

Anonymous
08/07/24(Wed)18:11:15 No.101773921

Anonymous 08/07/24(Wed)18:11:15 No.101773921

>>101773911
there's nothing better than magnum

Anonymous
08/07/24(Wed)18:12:04 No.101773929

Anonymous 08/07/24(Wed)18:12:04 No.101773929

>>101773653
>>101773911
If I had a 4090 I'd probably be running CommandR.
I can run it off of ram but it's so dog slow.

Anonymous
08/07/24(Wed)18:12:08 No.101773932

Anonymous 08/07/24(Wed)18:12:08 No.101773932

>>101773897
remember that next time you'll be asked to donate to fund the compute to train the models
what a bunch of frauds

Anonymous
08/07/24(Wed)18:15:53 No.101773980

Anonymous 08/07/24(Wed)18:15:53 No.101773980

>>101773911
You should try mistral large.

Anonymous
08/07/24(Wed)18:16:36 No.101773990

Anonymous 08/07/24(Wed)18:16:36 No.101773990

>Here's the 4-11 on...
Thanks dolphin 12b. I had never heard of that slang before.

Anonymous
08/07/24(Wed)18:23:29 No.101774074

Anonymous 08/07/24(Wed)18:23:29 No.101774074

>>101768923
>noooo you can't be trusted with models stop evading my control over you

Anonymous
08/07/24(Wed)18:26:48 No.101774130

Anonymous 08/07/24(Wed)18:26:48 No.101774130

>>101773932
and btw, that includes all the finetooooners associated with that organization
alpine is smuggling and has been smuggling free compute for them

Anonymous
08/07/24(Wed)18:28:18 No.101774159

Anonymous 08/07/24(Wed)18:28:18 No.101774159

>>101773980
there's no shot it runs on my 4090 lmfao. I only have 32GB RAM too
>>101773929
I have it, I just struggle finding good settings online (temps etc).

So it doesn't really perform as good as it likely can

Anonymous
08/07/24(Wed)18:29:08 No.101774172

Anonymous 08/07/24(Wed)18:29:08 No.101774172

>>101774159
>there's no shot it runs on my 4090 lmfao
Upgrade your ram then, you'd get way better speeds than I do with a 2070.

Anonymous
08/07/24(Wed)18:30:50 No.101774196

Anonymous 08/07/24(Wed)18:30:50 No.101774196

>>101768923
Exactly
If I didn't care about it I wouldn't be here in this thread, unless I had some kind of mental disability like a few people I have seen in this place.

Anonymous
08/07/24(Wed)18:35:27 No.101774269

Anonymous 08/07/24(Wed)18:35:27 No.101774269

>>101767379
do you like sam altman reading your chats?
captcha: N00T

Anonymous
08/07/24(Wed)18:35:45 No.101774273

Anonymous 08/07/24(Wed)18:35:45 No.101774273

>>101774172
much total memory you got and what's your speeds like?

Anonymous
08/07/24(Wed)18:38:33 No.101774316

Anonymous 08/07/24(Wed)18:38:33 No.101774316

Good morning /lmg/, any good 8B models out there?

Anonymous
08/07/24(Wed)18:39:35 No.101774330

Anonymous 08/07/24(Wed)18:39:35 No.101774330

Redpill me on RPStew

I keep seeing it on reddit but nobody here mentions it

Anonymous
08/07/24(Wed)18:40:28 No.101774347

Anonymous 08/07/24(Wed)18:40:28 No.101774347

>>101774330
I advise you stop reading reddit. What is RPStew?

Anonymous
08/07/24(Wed)18:40:52 No.101774357

Anonymous 08/07/24(Wed)18:40:52 No.101774357

>>101774330
>I keep seeing it on reddit but nobody here mentions it
I'm sure you can figure out the reason on your own

Anonymous
08/07/24(Wed)18:42:24 No.101774376

Anonymous 08/07/24(Wed)18:42:24 No.101774376

>>101774316
gemma 8b

Anonymous
08/07/24(Wed)18:42:37 No.101774379

Anonymous 08/07/24(Wed)18:42:37 No.101774379

>>101774273
96gb, it starts at 1.2T/s but slows faster than I'd like. The slow speed is worth it to me though since it's pretty good.

Anonymous
08/07/24(Wed)18:45:03 No.101774401

Anonymous 08/07/24(Wed)18:45:03 No.101774401

>>101774379
>1.2T/s
>slows down
are you into edging?

Anonymous
08/07/24(Wed)18:46:57 No.101774424

Anonymous 08/07/24(Wed)18:46:57 No.101774424

>>101774357
>I keep seeing it on reddit but nobody here mentions it
>I'm sure you can figure out the reason on your own
Most shilling is done by the finetuners themselves. Reject the idea that number of mentions = good.

Anonymous
08/07/24(Wed)18:48:47 No.101774453

Anonymous 08/07/24(Wed)18:48:47 No.101774453

>>101774424
can you redpill me on llama 3 8B finetunes

Anonymous
08/07/24(Wed)18:49:20 No.101774462

Anonymous 08/07/24(Wed)18:49:20 No.101774462

I will now say something kinda obvious that helped my cooming experience. I have realized that the final stage of my fucked up fetish is too difficult for current LLM's. However dialing it down a bit to a less complicated version has given me some very nice results that didn't require rerolls. Nemo seems like something that is finally good enough for this. Although I don't know how it will handle repeat sessions.

Anonymous
08/07/24(Wed)18:50:32 No.101774477

Anonymous 08/07/24(Wed)18:50:32 No.101774477

>>101774424
Undi never shills his stuff here. His try it yourself method has captivated some sirs that now do it for him.

Anonymous
08/07/24(Wed)18:50:39 No.101774479

Anonymous 08/07/24(Wed)18:50:39 No.101774479

>>101774424
>Reject the idea that number of mentions = good.
exactly, so ignore the reddit shills, shilling some random useless model

Anonymous
08/07/24(Wed)18:51:59 No.101774492

Anonymous 08/07/24(Wed)18:51:59 No.101774492

>>101774479
hi sao. i was talking about you instead.

Anonymous
08/07/24(Wed)18:52:18 No.101774495

Anonymous 08/07/24(Wed)18:52:18 No.101774495

>>101774462
wat fetish? vore or something?

Anonymous
08/07/24(Wed)18:52:50 No.101774503

Anonymous 08/07/24(Wed)18:52:50 No.101774503

>>101774492
more like talking to voices in your head shizo

Anonymous
08/07/24(Wed)18:52:50 No.101774504

Anonymous 08/07/24(Wed)18:52:50 No.101774504

Do you guys think Sao Drummer and Undi ERP with each other in some private discord?

Anonymous
08/07/24(Wed)18:53:51 No.101774521

Anonymous 08/07/24(Wed)18:53:51 No.101774521

>>101774495
I am not telling. I am not becoming the next piss / stomach noises anon.

Anonymous
08/07/24(Wed)18:54:36 No.101774534

Anonymous 08/07/24(Wed)18:54:36 No.101774534

>>101774504
no, they spend their time hacking into corpos and adding shivers down the spine to their training databases

Anonymous
08/07/24(Wed)18:55:06 No.101774539

Anonymous 08/07/24(Wed)18:55:06 No.101774539

>>101774534
>hacking
>undi
anon please...

Anonymous
08/07/24(Wed)18:56:35 No.101774555

Anonymous 08/07/24(Wed)18:56:35 No.101774555

>>101774521
I have it worse anon, my fetish is so niche that there are not even text materials by humans, so models are completely useless for that.

Anonymous
08/07/24(Wed)18:57:55 No.101774570

Anonymous 08/07/24(Wed)18:57:55 No.101774570

File: file.png (449 KB, 636x350)

449 KB PNG

>>101774555
Do tell us more.

Anonymous
08/07/24(Wed)18:58:10 No.101774575

Anonymous 08/07/24(Wed)18:58:10 No.101774575

>>101774539
he learned it to spread slop and poison corpo datasets to ensure that we never get unslopped model

Anonymous
08/07/24(Wed)18:59:09 No.101774589

Anonymous 08/07/24(Wed)18:59:09 No.101774589

>>101774555
It is sex with minors, isn't it?

Anonymous
08/07/24(Wed)19:01:37 No.101774625

Anonymous 08/07/24(Wed)19:01:37 No.101774625

>>101774570
Nah, but it's not something disgusting or very weird, just niche and to be fair it's fairly hard to represent it with text only.
I'm hoping that in a few years multimodal models with image generation will be my savior.

Anonymous
08/07/24(Wed)19:02:37 No.101774640

Anonymous 08/07/24(Wed)19:02:37 No.101774640

>>101774589
no, I said niche.

Anonymous
08/07/24(Wed)19:08:52 No.101774729

Anonymous 08/07/24(Wed)19:08:52 No.101774729

>>101774625
what body parts does it involve?

Anonymous
08/07/24(Wed)19:10:29 No.101774757

Anonymous 08/07/24(Wed)19:10:29 No.101774757

>>101767112
Holy crap. I hadn’t looked in a while and now vast/runpod prices have totally cratered. The hype cycle is ending finally.

Anonymous
08/07/24(Wed)19:14:17 No.101774801

Anonymous 08/07/24(Wed)19:14:17 No.101774801

File: file.png (1.68 MB, 1607x782)

1.68 MB PNG

>>101774757
>hype cycle is ending finally.
What are next?

Anonymous
08/07/24(Wed)19:14:21 No.101774802

Anonymous 08/07/24(Wed)19:14:21 No.101774802

>>101773672
>>101773687
By what metrics? I like 70B and mini-magnum a lot but 30B and the new 12B were underwhelming.

Anonymous
08/07/24(Wed)19:15:35 No.101774820

Anonymous 08/07/24(Wed)19:15:35 No.101774820

>>101774802
Hi Undi

Anonymous
08/07/24(Wed)19:15:50 No.101774828

Anonymous 08/07/24(Wed)19:15:50 No.101774828

File: oh you.jpg (47 KB, 582x415)

47 KB JPG

>>101774729
I'm not saying shit anon

Anonymous
08/07/24(Wed)19:18:17 No.101774859

Anonymous 08/07/24(Wed)19:18:17 No.101774859

>>101774757
Isn't that due to the fact that Blackwell is out and more H100s are out?

Anonymous
08/07/24(Wed)19:18:35 No.101774864

Anonymous 08/07/24(Wed)19:18:35 No.101774864

>>101774828
so coprophilia?

Anonymous
08/07/24(Wed)19:19:02 No.101774870

Anonymous 08/07/24(Wed)19:19:02 No.101774870

>>101767112
Where should I begin if I want to develop a personal AI assistant hosted on a local server?

Anonymous
08/07/24(Wed)19:21:08 No.101774900

Anonymous 08/07/24(Wed)19:21:08 No.101774900

>>101774864
I said not disgusting, now you are trolling me so I will spill the beans.

Anonymous
08/07/24(Wed)19:21:12 No.101774901

Anonymous 08/07/24(Wed)19:21:12 No.101774901

>>101774870
have you tried getting a job?

Anonymous
08/07/24(Wed)19:27:40 No.101774960

Anonymous 08/07/24(Wed)19:27:40 No.101774960

>>101774859
The only new thing I think is the H100 NVL, but prices for 4090s are less than half of what they were at their peak.

Anonymous
08/07/24(Wed)19:41:46 No.101775139

Anonymous 08/07/24(Wed)19:41:46 No.101775139

>>101774900
so? where are the beans, Lebovsky?

Anonymous
08/07/24(Wed)19:46:57 No.101775201

Anonymous 08/07/24(Wed)19:46:57 No.101775201

>EXAONE
Should I make GGUFs or is it a nothingburger? Is >>101769935 an undercover LG employer?

Anonymous
08/07/24(Wed)19:48:18 No.101775214

Anonymous 08/07/24(Wed)19:48:18 No.101775214

>>101775201
it's 4k context so we don't care

Anonymous
08/07/24(Wed)19:49:04 No.101775225

Anonymous 08/07/24(Wed)19:49:04 No.101775225

>>101775201
As a curiosity to play around with it's alright.
Not really a replacement for anything we have now due to only having 4k context and struggling with some concepts we now expect 8B models to handle.
It also stays coherent at fairly high temperatures for its size. I only test things with simple sampling, so I can't say what that means when you apply meme samplers to it.

Anonymous
08/07/24(Wed)19:50:50 No.101775239

Anonymous 08/07/24(Wed)19:50:50 No.101775239

>>101775214
>>101775225
I'm fine with 4K context if it's actually good compared to llama3

Anonymous
08/07/24(Wed)19:51:45 No.101775246

Anonymous 08/07/24(Wed)19:51:45 No.101775246

>>101775239
>I'm fine with 4K context
no you're not

Anonymous
08/07/24(Wed)19:52:12 No.101775250

Anonymous 08/07/24(Wed)19:52:12 No.101775250

>>101775239
8k was a pain with llama 3. 4k is downright insulting. What is this, 2023?

Anonymous
08/07/24(Wed)19:53:53 No.101775272

Anonymous 08/07/24(Wed)19:53:53 No.101775272

>>101775246
I'm using 6208 max because higher gets too slow

Anonymous
08/07/24(Wed)19:57:26 No.101775313

Anonymous 08/07/24(Wed)19:57:26 No.101775313

>>101775272
Now, I'm not a mathmagician, but I am pretty sure 6208 is more than 4096.

Anonymous
08/07/24(Wed)19:59:37 No.101775333

Anonymous 08/07/24(Wed)19:59:37 No.101775333

>>101775313
yeah but "max" implies that I can cope with 4096. in fact, I only use 6208 for groups, 4096 is enough for single characters

Anonymous
08/07/24(Wed)20:17:21 No.101775532

Anonymous 08/07/24(Wed)20:17:21 No.101775532

With Mistral Large 2, I'm concerned that this is turning into a full on addiction. Is 123b that much better than 70b, or does Mistral AI have that good of a dataset? It's the only Mistral model I've ever liked, actually. (Unless you count WizardLM 2 8x22)

Anonymous
08/07/24(Wed)20:23:00 No.101775595

Anonymous 08/07/24(Wed)20:23:00 No.101775595

>>101775532
What 70b were you using?

Anonymous
08/07/24(Wed)20:23:50 No.101775602

Anonymous 08/07/24(Wed)20:23:50 No.101775602

>>101773897
Which Discord server is this?

Anonymous
08/07/24(Wed)20:29:08 No.101775647

Anonymous 08/07/24(Wed)20:29:08 No.101775647

>>101775532
Found any good largestral fine tune?

Anonymous
08/07/24(Wed)20:29:55 No.101775652

Anonymous 08/07/24(Wed)20:29:55 No.101775652

>"You know, they say that for every inch below six, you might as well be missing a limb. And you, my dear {{user}}, are teetering on the edge of being a paraplegic, aren't you?"
I wasn't expecting this kind of sovl from my sph slopbot

Anonymous
08/07/24(Wed)20:31:29 No.101775667

Anonymous 08/07/24(Wed)20:31:29 No.101775667

>>101775652
>teetering on the edge
Why the fuck are LLM texts so easily recognizable? Same goes for images

Anonymous
08/07/24(Wed)20:37:50 No.101775718

Anonymous 08/07/24(Wed)20:37:50 No.101775718

>>101775667
Corporate influence, uncanny valley and overfitting.

Anonymous
08/07/24(Wed)20:39:01 No.101775734

Anonymous 08/07/24(Wed)20:39:01 No.101775734

>>101775718
how do we solve this

Anonymous
08/07/24(Wed)20:41:57 No.101775770

Anonymous 08/07/24(Wed)20:41:57 No.101775770

>>101775734
Just add a few more billion parameters. Llama 4 1700B will be a great success

Anonymous
08/07/24(Wed)20:47:51 No.101775844

Anonymous 08/07/24(Wed)20:47:51 No.101775844

>>101775734
>Corporate influence
Don't tune on assistantslop. Don't filter "harmful" data out of the base model. Make a pure chat model like early c.AI.

>uncanny valley
Make model smarter. Easier said than done. While >>101775770 may work, it is a suboptimal approach.

>overfitting
Make a list of overused phrases and either filter them out or replace them with less common, but context-appropriate phrases.

Anonymous
08/07/24(Wed)20:50:52 No.101775874

Anonymous 08/07/24(Wed)20:50:52 No.101775874

>>101775844
sadly no one here actually trains base models, and the people who do are more interested in benchmarks and scamming investors

Anonymous
08/07/24(Wed)20:52:15 No.101775892

Anonymous 08/07/24(Wed)20:52:15 No.101775892

>>101775874
Takes massive amounts of capital to train a base model of any appreciable size in a reasonable timeframe.

Anonymous
08/07/24(Wed)21:02:47 No.101776014

Anonymous 08/07/24(Wed)21:02:47 No.101776014

I got flux running but what are you supposed to do with it? There's no use for these images. Is that why random boards have dedicated AI slop threads?

Anonymous
08/07/24(Wed)21:04:43 No.101776042

Anonymous 08/07/24(Wed)21:04:43 No.101776042

File: ComfyUI_00119_.png (333 KB, 512x512)

333 KB PNG

>>101776014
Have a glass of bees.

Anonymous
08/07/24(Wed)21:05:08 No.101776052

Anonymous 08/07/24(Wed)21:05:08 No.101776052

Does anybody actually use the "story" format for their slop?

Anonymous
08/07/24(Wed)21:05:25 No.101776054

Anonymous 08/07/24(Wed)21:05:25 No.101776054

File: Hatsune Miku spilled a lo(...).png (1.16 MB, 1024x1024)

1.16 MB PNG

>>101776014
gooning

Anonymous
08/07/24(Wed)21:05:45 No.101776058

Anonymous 08/07/24(Wed)21:05:45 No.101776058

>>101776014
Use it alongside your text gen model to illustrate the scenes, of course.

Anonymous
08/07/24(Wed)21:06:31 No.101776066

Anonymous 08/07/24(Wed)21:06:31 No.101776066

>>101776052
Sometimes I just let it run for a while and read the gems it has produced

Anonymous
08/07/24(Wed)21:08:42 No.101776101

Anonymous 08/07/24(Wed)21:08:42 No.101776101

>>101776042
The fuck did bees over do to you? Fuck da wasps.

Anonymous
08/07/24(Wed)21:11:27 No.101776138

Anonymous 08/07/24(Wed)21:11:27 No.101776138

>>101776101
Go drink a coke outside and you can recreate that picture.

Anonymous
08/07/24(Wed)21:12:27 No.101776151

Anonymous 08/07/24(Wed)21:12:27 No.101776151

Anyone have a link to that comfy script/workflow that lets you offload the CLIP model onto a different GPU?

Anonymous
08/07/24(Wed)21:12:33 No.101776154

Anonymous 08/07/24(Wed)21:12:33 No.101776154

File: ComfyUI_00185_.jpg (150 KB, 1024x1024)

150 KB JPG

>>101776014
>what are you supposed to do with it?
Make Mikus

Anonymous
08/07/24(Wed)21:15:08 No.101776179

Anonymous 08/07/24(Wed)21:15:08 No.101776179

File: metal_albums.png (1.47 MB, 1204x747)

1.47 MB PNG

What are the best models/LORAs for creating the "80s metal album" aesthetic? I'm not sure if this actually has an art style by name. It doesn't necessarily have to actually be tailored towards album covers, this is just the best example. My goal is just to be able to generate art with this style consistently.

Anonymous
08/07/24(Wed)21:17:27 No.101776208

Anonymous 08/07/24(Wed)21:17:27 No.101776208

anything I should be aware of since mistral large for coom rp?

Anonymous
08/07/24(Wed)21:18:25 No.101776220

Anonymous 08/07/24(Wed)21:18:25 No.101776220

What's up with the sudden imagegen posts? Someone trying to troll again?

Anonymous
08/07/24(Wed)21:19:07 No.101776228

Anonymous 08/07/24(Wed)21:19:07 No.101776228

Is the file size of a model a reliable indicator of how much memory it will require to load? I noticed with some other Llama 3.1 models, loading the model initially takes up 8GB of VRAM, then in task manager I can see more memory being allocated. Is the initial memory allocation the model itself? What is contained in the secondary memory allocation? Is that how models store context?

Anonymous
08/07/24(Wed)21:19:32 No.101776234

Anonymous 08/07/24(Wed)21:19:32 No.101776234

>>101776151
here >>101689729

Anonymous
08/07/24(Wed)21:20:16 No.101776244

Anonymous 08/07/24(Wed)21:20:16 No.101776244

>>101776228
Yes. Also, install Linux.

Anonymous
08/07/24(Wed)21:20:33 No.101776247

Anonymous 08/07/24(Wed)21:20:33 No.101776247

>>101776179
>>>/g/ldg

Anonymous
08/07/24(Wed)21:22:17 No.101776266

Anonymous 08/07/24(Wed)21:22:17 No.101776266

>>101776247
I think these are troll posts

Anonymous
08/07/24(Wed)21:22:37 No.101776271

Anonymous 08/07/24(Wed)21:22:37 No.101776271

>>101776234
Thank you muchly, friend.

Anonymous
08/07/24(Wed)21:23:00 No.101776277

Anonymous 08/07/24(Wed)21:23:00 No.101776277

>>101776244
Is the behavior I described unique to Windows? What would switching to Linux change? I'd love to, bit I think the system RAM fallback feature I need to run larger AI models is only available on the Windows Nvidia drivers

Anonymous
08/07/24(Wed)21:28:45 No.101776347

Anonymous 08/07/24(Wed)21:28:45 No.101776347

>>101776277
Use GGUF and choose GPU layer count correctly.

Anonymous
08/07/24(Wed)21:29:40 No.101776356

Anonymous 08/07/24(Wed)21:29:40 No.101776356

>>101776277
You can split between RAM and VRAM in GGUFs. Linux is simply better.

Anonymous
08/07/24(Wed)21:36:48 No.101776446

Anonymous 08/07/24(Wed)21:36:48 No.101776446

Are the getting started links in the OP up to date? I've been using koboldcpp with mythomax-l12-13b Q5_K_M for the past few months. Is there something better or out there. Also, koboldcpp defaults to 200 gpu layers, but if I hover over the common values are vastly different. Am I doing something wrong or is it correct? It works so far.

Anonymous
08/07/24(Wed)21:37:50 No.101776461

Anonymous 08/07/24(Wed)21:37:50 No.101776461

>>101776356
Windows users can't use layers for the gpu? That sounds strange.

Anonymous
08/07/24(Wed)21:42:52 No.101776519

Anonymous 08/07/24(Wed)21:42:52 No.101776519

File: file.png (2.61 MB, 1024x1024)

2.61 MB PNG

Anonymous
08/07/24(Wed)21:50:11 No.101776604

Anonymous 08/07/24(Wed)21:50:11 No.101776604

>>101776461
Lol ofc they can. I just meant that Linux is better in general as an OS.

Anonymous
08/07/24(Wed)21:57:29 No.101776724

Anonymous 08/07/24(Wed)21:57:29 No.101776724

File: 24-08-07 21-48-20 3047.jpg (2.79 MB, 4032x3024)

2.79 MB JPG

Made some Migu bumper stickers. I used chink-brand white toner off Aliexpress, it worked fine for like 1/4 the price. There's a little banding at the top of the page, but I read that hologram sticker material doesn't work super well in a laser printer. I set my printer to "label" mode, it helped. No idea how inkjet ink sticks to plastic. Fuck inkjets, I'm never going back to that bullshit.

Anonymous
08/07/24(Wed)21:58:57 No.101776747

Anonymous 08/07/24(Wed)21:58:57 No.101776747

>>101776724
wait you're saying you printed those entirely at home? can you list your setup in a bit more detail?

Anonymous
08/07/24(Wed)21:59:04 No.101776750

Anonymous 08/07/24(Wed)21:59:04 No.101776750

>>101776724
I like these Migus

Anonymous
08/07/24(Wed)22:08:01 No.101776877

Anonymous 08/07/24(Wed)22:08:01 No.101776877

>>101776747
Yeah, at home. It looks impossible but the trick is buying a white toner cart for your printer, printing a "mask" in black and white on the hologram sticker, then swapping the black toner cart back in, and feeding the paper throgh again for regular color.
It helps immensely to start with an image that already has white outlines around stuff, since it hides the inevitable registration mistakes between the two printing passes. I asked bing/dall-e to make Migus with a white border. The white border also makes it easier to use the "magic select" tool for creating transparent areas in the color part, and a mask for the white part.
You can also buy a $3000 printer which does it in one pass. I used a $300 canon color laser.

Anonymous
08/07/24(Wed)22:08:34 No.101776888

Anonymous 08/07/24(Wed)22:08:34 No.101776888

>>101776877
No but I mean what about the glue? Or will you have to apply that manually?

Anonymous
08/07/24(Wed)22:10:37 No.101776923

Anonymous 08/07/24(Wed)22:10:37 No.101776923

File: _27060cfa-f57a-4dc6-bf05-(...).jpg (185 KB, 1024x1024)

185 KB JPG

>>101776750
I want to use this one because it's cute and also pantsu but bing fucked it up by cutting off the left side, it's going to take more gimp work than I feel like right now.

Anonymous
08/07/24(Wed)22:11:37 No.101776935

Anonymous 08/07/24(Wed)22:11:37 No.101776935

>>101776888
It's sticker material. You peel off the back, it's self adhesive.

Anonymous
08/07/24(Wed)22:14:41 No.101776979

Anonymous 08/07/24(Wed)22:14:41 No.101776979

>>101776923
>gimp
unexpectedly based

Anonymous
08/07/24(Wed)22:20:14 No.101777043

Anonymous 08/07/24(Wed)22:20:14 No.101777043

File: _c9e80597-7002-40b9-b2fe-(...).jpg (136 KB, 1024x1024)

136 KB JPG

Something like this comes out well with just a b&w laser printer. In sunlight the hologram material is very catchy.

Anonymous
08/07/24(Wed)22:22:17 No.101777066

Anonymous 08/07/24(Wed)22:22:17 No.101777066

File: 1718373963212515.png (2.19 MB, 1152x1024)

2.19 MB PNG

>>101776923

Anonymous
08/07/24(Wed)22:33:20 No.101777172

Anonymous 08/07/24(Wed)22:33:20 No.101777172

File: file.png (2.72 MB, 1024x1024)

2.72 MB PNG

spamming a few overnite imagegens cause we're at bump limit

Anonymous
08/07/24(Wed)22:33:58 No.101777180

Anonymous 08/07/24(Wed)22:33:58 No.101777180

File: file.png (2.73 MB, 1024x1024)

2.73 MB PNG

Anonymous
08/07/24(Wed)22:34:20 No.101777183

Anonymous 08/07/24(Wed)22:34:20 No.101777183

>>101777172
>we're at bump limit
huh?

Anonymous
08/07/24(Wed)22:34:37 No.101777190

Anonymous 08/07/24(Wed)22:34:37 No.101777190

File: file.png (2.59 MB, 1024x1024)

2.59 MB PNG

Anonymous
08/07/24(Wed)22:36:17 No.101777210

Anonymous 08/07/24(Wed)22:36:17 No.101777210

File: file.png (2.62 MB, 1024x1024)

2.62 MB PNG

>>101777183
>he doesnt know about the bump limit

Anonymous
08/07/24(Wed)22:39:34 No.101777255

Anonymous 08/07/24(Wed)22:39:34 No.101777255

File: file.png (2.63 MB, 1024x1024)

2.63 MB PNG

Anonymous
08/07/24(Wed)22:41:04 No.101777270

Anonymous 08/07/24(Wed)22:41:04 No.101777270

File: file.png (2.58 MB, 1024x1024)

2.58 MB PNG

Anonymous
08/07/24(Wed)22:41:36 No.101777280

Anonymous 08/07/24(Wed)22:41:36 No.101777280

>>101777183
Look at this newfag and laugh

Anonymous
08/07/24(Wed)22:41:38 No.101777281

Anonymous 08/07/24(Wed)22:41:38 No.101777281

>>101777172
>>101777180
>>101777190
>>101777210
>>101777255
>>101777270
all shit slop, why do you even bother with spamming the same fucking images?

Anonymous
08/07/24(Wed)22:42:54 No.101777298

Anonymous 08/07/24(Wed)22:42:54 No.101777298

either blind, retarded, or both

Anonymous
08/07/24(Wed)22:43:01 No.101777299

Anonymous 08/07/24(Wed)22:43:01 No.101777299

File: file.png (2.55 MB, 1024x1024)

2.55 MB PNG

>>101777281
look at this dumb bitch lol

Anonymous
08/07/24(Wed)22:45:21 No.101777335

Anonymous 08/07/24(Wed)22:45:21 No.101777335

File: file.png (2.5 MB, 1024x1024)

2.5 MB PNG

Anonymous
08/07/24(Wed)22:46:45 No.101777352

Anonymous 08/07/24(Wed)22:46:45 No.101777352

>>101777299
DEATH

Anonymous
08/07/24(Wed)22:46:54 No.101777357

Anonymous 08/07/24(Wed)22:46:54 No.101777357

Chatbots?

Anonymous
08/07/24(Wed)22:48:04 No.101777369

Anonymous 08/07/24(Wed)22:48:04 No.101777369

File: file.png (2.62 MB, 1024x1024)

2.62 MB PNG

>>101777352
some of them come out really fucked up, AI is weird

Anonymous
08/07/24(Wed)22:55:11 No.101777494

Anonymous 08/07/24(Wed)22:55:11 No.101777494

File: file.png (2.61 MB, 1024x1024)

2.61 MB PNG

Anonymous
08/07/24(Wed)22:56:40 No.101777513

Anonymous 08/07/24(Wed)22:56:40 No.101777513

File: ComfyUI_00004_.png (1.01 MB, 768x1024)

1.01 MB PNG

Anonymous
08/07/24(Wed)22:56:56 No.101777516

Anonymous 08/07/24(Wed)22:56:56 No.101777516

File: file.png (2.7 MB, 1024x1024)

2.7 MB PNG

Anonymous
08/07/24(Wed)22:58:22 No.101777537

Anonymous 08/07/24(Wed)22:58:22 No.101777537

File: file.png (2.55 MB, 1024x1024)

2.55 MB PNG

Anonymous
08/07/24(Wed)23:22:35 No.101777844

Anonymous 08/07/24(Wed)23:22:35 No.101777844

>>101777537
are you trying to fill the thread before it archives?

Anonymous
08/07/24(Wed)23:23:54 No.101777861

Anonymous 08/07/24(Wed)23:23:54 No.101777861

>>101777844
yeah he thinks his slop ai genned "art" is worth something

Anonymous
08/07/24(Wed)23:25:22 No.101777880

Anonymous 08/07/24(Wed)23:25:22 No.101777880

>>101777844
anon when a thread hits bump limit it doesnt matter how much you spam it afterwards, it just doesn't bump any more. anyone who doesnt understand this is new

Anonymous
08/07/24(Wed)23:26:00 No.101777886

Anonymous 08/07/24(Wed)23:26:00 No.101777886

>>101777880
so what's the point on doing this, it's just retarded flooding spam

Anonymous
08/07/24(Wed)23:26:41 No.101777893

Anonymous 08/07/24(Wed)23:26:41 No.101777893

>>101777886
if you dont like it, you can look away

Anonymous
08/07/24(Wed)23:26:56 No.101777897

Anonymous 08/07/24(Wed)23:26:56 No.101777897

>>101777893
it breaks 4chan rules though

Anonymous
08/07/24(Wed)23:27:27 No.101777909

Anonymous 08/07/24(Wed)23:27:27 No.101777909

>>101777897
so report it and see what happens

Anonymous
08/07/24(Wed)23:28:01 No.101777918

Anonymous 08/07/24(Wed)23:28:01 No.101777918

>>101777909
I know that jannies don't care about AI threads on /g/ (especially aicg, but other AI threads too), that doesn't mean it's allowed.

Anonymous
08/07/24(Wed)23:31:47 No.101777960

Anonymous 08/07/24(Wed)23:31:47 No.101777960

>>101777918
neither is trolling, retard-kun

Anonymous
08/07/24(Wed)23:33:15 No.101777976

Anonymous 08/07/24(Wed)23:33:15 No.101777976

>>101777918
my friend, i don't think you understand. the thread hit the bump limit. posting doesn't affect the board any more. the thread is essentially dead now. imagedumping isn't even technically against rules even when a thread is live

Anonymous
08/07/24(Wed)23:39:11 No.101778034

Anonymous 08/07/24(Wed)23:39:11 No.101778034

File: gemma2_9b.jpg (330 KB, 1167x1392)

330 KB JPG

>9b model same intelligence as gpt-4
how possible

Anonymous
08/07/24(Wed)23:39:44 No.101778041

Anonymous 08/07/24(Wed)23:39:44 No.101778041

>>101778034
lmsys doesn't test intelligence, it tests human preference. Learn the difference already, for fuck's sake.

Anonymous
08/07/24(Wed)23:44:14 No.101778094

Anonymous 08/07/24(Wed)23:44:14 No.101778094

>>101778034
>dude gemma 2b totally beats mixtral 8x7b dude I saw it on the arena

Anonymous
08/07/24(Wed)23:46:50 No.101778119

Anonymous 08/07/24(Wed)23:46:50 No.101778119

>>101778094
t. butthurt ai

Anonymous
08/07/24(Wed)23:56:29 No.101778247

Anonymous 08/07/24(Wed)23:56:29 No.101778247

>>101778094
Anything could beat mixtral 8x7b.

Anonymous
08/08/24(Thu)00:00:59 No.101778288

Anonymous 08/08/24(Thu)00:00:59 No.101778288

>>101778119
>>101778247
Hello google sirs

Anonymous
08/08/24(Thu)00:03:43 No.101778322

Anonymous 08/08/24(Thu)00:03:43 No.101778322

>>101773672
32b v2 is probably the best I've seen for RP. This is surprising, because I wasn't impressed with 72b v1 at all. (I'm hoping 72b v2 arrives soon.)
Mistral Large 2 is definitely more nuanced and natural, and I prefer that overall. However, Magnum 32b v2 is slightly better imo due to the dataset and its instruction following, which is insane for a 32b model. It doesn't repeat itself either. Again, this is only RP. I haven't tried it for anything else yet.

Anonymous
08/08/24(Thu)00:04:50 No.101778339

Anonymous 08/08/24(Thu)00:04:50 No.101778339

>>101778322
>32b is better than Mistral Large
Hi Alpin.

Anonymous
08/08/24(Thu)00:05:01 No.101778343

Anonymous 08/08/24(Thu)00:05:01 No.101778343

>>101778034
>Brainless parrots ask the same questions on lmsys over and over again
.>put the short list in the data set
>Get a massive advantage over those that don't.

Anonymous
08/08/24(Thu)00:05:37 No.101778347

Anonymous 08/08/24(Thu)00:05:37 No.101778347

>>101778328
>>101778328
>>101778328

Anonymous
08/08/24(Thu)00:13:26 No.101778418

Anonymous 08/08/24(Thu)00:13:26 No.101778418

>>101772231
Show me a model that doesn't repeat at long ctx

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.