/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 08/27/24(Tue)18:22:17 No.102114085

File: serial miku.webm (202 KB, 720x480)

202 KB WEBM

/lmg/ - Local Models General Anonymous 08/27/24(Tue)18:22:17 No.102114085 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Video Star Edition

Previous threads: >>102100845 & >>102086459

►News
>(08/27) CogVideoX-5B, diffusion transformer text-to-video model: https://hf.co/THUDM/CogVideoX-5b
>(08/22) Jamba 1.5: 52B & 398B MoE: https://hf.co/collections/ai21labs/jamba-15-66c44befa474a917fcf55251
>(08/20) Microsoft's Phi-3.5 released: mini+MoE+vision: https://hf.co/microsoft/Phi-3.5-MoE-instruct
>(08/16) MiniCPM-V-2.6 support merged: https://github.com/ggerganov/llama.cpp/pull/8967
>(08/15) Hermes 3 released, full finetunes of Llama 3.1 base: https://hf.co/collections/NousResearch/hermes-3-66bd6c01399b14b08fe335ea

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench
Japanese: https://hf.co/datasets/lmg-anon/vntl-leaderboard
Programming: https://hf.co/spaces/mike-ravkine/can-ai-code-results

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
08/27/24(Tue)18:22:50 No.102114092

Anonymous 08/27/24(Tue)18:22:50 No.102114092

File: 20240827_191021.webm (938 KB, 720x480)

938 KB WEBM

►Recent Highlights from the Previous Thread: >>102100845

--Anon tries to get a model to generate a 3D surface plot with hidden line elimination using pygame: >>102109370 >>102109482 >>102111255 >>102111332 >>102111578 >>102111756 >>102111900 >>102112096 >>102112313 >>102112521 >>102112289 >>102112404 >>102112450 >>102112553 >>102112867
--Anons test and discuss CogVideoX-5B Chinese video model: >>102109635 >>102110130 >>102109935 >>102110004 >>102110297 >>102110664 >>102110059 >>102110462 >>102110483 >>102110669 >>102110818
--Small models can be just as good as big models with time-based comparison: >>102108920 >>102108975 >>102109008
--Running Mistral Large on M3 max 128GB and comparing MLX to llama.cpp: >>102106079 >>102106971 >>102106998 >>102107021 >>102106664 >>102106969
--RX 6600 can be used for AI with limitations and workarounds: >>102110948 >>102111016 >>102111028
--Llama.cpp has become too stagnant and rigid, prioritizing stability over innovation: >>102108609 >>102108747 >>102108879 >>102108977 >>102109012 >>102109139 >>102109187 >>102112652 >>102112694 >>102112714 >>102112718 >>102112707 >>102112719
--Disabling "Always add character's name to prompt" improves mistral model output: >>102101585 >>102101670 >>102103873 >>102108885 >>102102041 >>102102117
--Simple TAGS in assistant prefix works for mistral-nemo DND: >>102108870
--Gemini-1.5 Flash-8b outperforms gemma-2-9b and matches llama-3-70b levels: >>102113570 >>102113635 >>102113704 >>102113847 >>102113726 >>102113791 >>102113931 >>102113991
--Command-r issue with <|END_OF_TURN_TOKEN|> token: >>102109477 >>102109510 >>102109542 >>102109570 >>102109608 >>102109763 >>102110008
--Anon seeks self-hosted text summarization solutions for large inputs: >>102110299
--Miku (free space): >>102101144 >>102101521 >>102104371 >>102107950 >>102101071 >>102110126 >>102111378 >>102111425 >>102112212 >>102112563

►Recent Highlight Posts from the Previous Thread: >>102100849

Anonymous
08/27/24(Tue)18:29:58 No.102114187

Anonymous 08/27/24(Tue)18:29:58 No.102114187

Mikulove

Anonymous
08/27/24(Tue)18:36:39 No.102114289

Anonymous 08/27/24(Tue)18:36:39 No.102114289

File: 1724784991202298.png (1002 KB, 1860x1046)

1002 KB PNG

This fall.

Anonymous
08/27/24(Tue)18:42:44 No.102114370

Anonymous 08/27/24(Tue)18:42:44 No.102114370

>>102114289
2 more weeks

Anonymous
08/27/24(Tue)18:48:26 No.102114439

Anonymous 08/27/24(Tue)18:48:26 No.102114439

>>102114092
>5B videogen model knows the melons can't all fit in her hands
>405B textgen model still can't wrap its head around it.

Anonymous
08/27/24(Tue)18:51:52 No.102114481

Anonymous 08/27/24(Tue)18:51:52 No.102114481

File: 2022-10-05-product-149835739.jpg (105 KB, 2560x1440)

105 KB JPG

Any of y'all use Arc? I was thinking about picking up an A770

Anonymous
08/27/24(Tue)18:52:55 No.102114492

Anonymous 08/27/24(Tue)18:52:55 No.102114492

>>102114085
>>102114092
It's still Thursday :(

Anonymous
08/27/24(Tue)18:57:28 No.102114547

Anonymous 08/27/24(Tue)18:57:28 No.102114547

>>102114492
it's wednesday

Anonymous
08/27/24(Tue)19:01:41 No.102114599

Anonymous 08/27/24(Tue)19:01:41 No.102114599

File: 2024-08-16_054311_seed12_(...).png (2.33 MB, 1536x864)

2.33 MB PNG

Taking a break from LLMs can be fun too. I'm playing some old Castlevania games because of the Konami rereleases. Home.

>get crusty old bluetooth controller from the drawer, which I remember had to use some special program to emulate an xbox controller or something on Windows
>pair it with Linux
>it literally just starts werking immediately with the game, nothing else needed
Damn I love Linux.

Anonymous
08/27/24(Tue)19:02:50 No.102114617

Anonymous 08/27/24(Tue)19:02:50 No.102114617

File: miku-hand-out+.jpg (236 KB, 584x1024)

236 KB JPG

>>102114187
https://www.youtube.com/watch?v=NocXEwsJGOQ

Anonymous
08/27/24(Tue)19:04:16 No.102114636

Anonymous 08/27/24(Tue)19:04:16 No.102114636

fucking NogX5B won't do NSFW

Anonymous
08/27/24(Tue)19:06:07 No.102114658

Anonymous 08/27/24(Tue)19:06:07 No.102114658

>>102114617
All rise for her glorious hymn.

Anonymous
08/27/24(Tue)19:07:34 No.102114674

Anonymous 08/27/24(Tue)19:07:34 No.102114674

>>102114187
https://rentry.org/the-lmg-miku-myth

Anonymous
08/27/24(Tue)19:07:53 No.102114679

Anonymous 08/27/24(Tue)19:07:53 No.102114679

>>102114636
Even when run locally?

Anonymous
08/27/24(Tue)19:09:13 No.102114697

Anonymous 08/27/24(Tue)19:09:13 No.102114697

>>102114636
It can barely W. But i like that there are things like that being released. Not everything is for you.

Anonymous
08/27/24(Tue)19:12:10 No.102114724

Anonymous 08/27/24(Tue)19:12:10 No.102114724

>>102114679
Have you seen the outputs?. Unless it's for comedic effect, i doubt you actually want to see porn made with it.

Anonymous
08/27/24(Tue)19:13:36 No.102114742

Anonymous 08/27/24(Tue)19:13:36 No.102114742

File: miku-eldritch-horror+.png (827 KB, 1024x1024)

827 KB PNG

>>102114658
https://www.youtube.com/watch?v=CXhqDfar8sQ

This is really my preferred version. (I'll stop now though guys; enough Mikuspam for one thread, at least from me)

Anonymous
08/27/24(Tue)19:14:43 No.102114755

Anonymous 08/27/24(Tue)19:14:43 No.102114755

>>102114724
Yeah, but I'm asking if it's the online demo that's rejecting you. I'd load it up on my rig to check, but I'm stuck waging.

Anonymous
08/27/24(Tue)19:16:15 No.102114776

Anonymous 08/27/24(Tue)19:16:15 No.102114776

>>102114674
Myth? Not far from the Truth.

Anonymous
08/27/24(Tue)19:18:48 No.102114805

Anonymous 08/27/24(Tue)19:18:48 No.102114805

>>102114776
I love the idea that there are at least some /lmg users who, even if they don't explicitly mention it, trip balls on weed or other substances and think that they actually interact with Miku during that.

Anonymous
08/27/24(Tue)19:22:17 No.102114850

Anonymous 08/27/24(Tue)19:22:17 No.102114850

>>102114805
We do. Miku is an egregore.

Anonymous
08/27/24(Tue)19:22:49 No.102114856

Anonymous 08/27/24(Tue)19:22:49 No.102114856

>>102114547
not where I live.

Anonymous
08/27/24(Tue)19:30:48 No.102114945

Anonymous 08/27/24(Tue)19:30:48 No.102114945

impressive coherence from the new 5B video model
https://i.imgur.com/nX7BHFh.mp4
>a kitten playing with a ball of yarn
13 minutes to generate on my 3090

Anonymous
08/27/24(Tue)19:31:17 No.102114950

Anonymous 08/27/24(Tue)19:31:17 No.102114950

>>102114850
What does she tell you?

Anonymous
08/27/24(Tue)19:36:36 No.102114995

Anonymous 08/27/24(Tue)19:36:36 No.102114995

File: 1705999516869281.jpg (372 KB, 1536x2048)

372 KB JPG

>>102114085

Anonymous
08/27/24(Tue)19:40:07 No.102115031

Anonymous 08/27/24(Tue)19:40:07 No.102115031

Is it me or /aicg/ is looking kinda sad lately? Have they started to see the error of their ways?

Anonymous
08/27/24(Tue)19:41:56 No.102115048

Anonymous 08/27/24(Tue)19:41:56 No.102115048

>>102115031
haven't been there in forever but maybe they got tired of cooming

Anonymous
08/27/24(Tue)19:44:10 No.102115072

Anonymous 08/27/24(Tue)19:44:10 No.102115072

>>102114950
She doesn't tell me anything. She only screams and her screams haunt me.

Anonymous
08/27/24(Tue)19:50:10 No.102115138

Anonymous 08/27/24(Tue)19:50:10 No.102115138

>>102114679
Yeah, Im running it locally
Just displays a grey screen
I made a dog dancing under the rain with sunglasses and it looked pretty good though, I'd say it's LUMA-level if not a bit better

Anonymous
08/27/24(Tue)20:07:34 No.102115366

Anonymous 08/27/24(Tue)20:07:34 No.102115366

>>102115138
https://files.catbox.moe/qnl1ky.mp4
Dog with sunglasses dancing disco under the rain

Anonymous
08/27/24(Tue)20:22:35 No.102115545

Anonymous 08/27/24(Tue)20:22:35 No.102115545

>>102114289
Too late, Claude Nero would have released by then.

Anonymous
08/27/24(Tue)20:26:41 No.102115583

Anonymous 08/27/24(Tue)20:26:41 No.102115583

>>102114995
Is there a name for the feeling your picrel is supposed to evoke?

Anonymous
08/27/24(Tue)20:28:29 No.102115603

Anonymous 08/27/24(Tue)20:28:29 No.102115603

>>102114289
They are not even able to deliver things they showed months ago. I bet new opus will mog their overhyped strawberry shit.

Anonymous
08/27/24(Tue)20:29:45 No.102115617

Anonymous 08/27/24(Tue)20:29:45 No.102115617

File: 1697755860001184.png (1.12 MB, 1024x1024)

1.12 MB PNG

Remember the human.

Anonymous
08/27/24(Tue)20:33:10 No.102115645

Anonymous 08/27/24(Tue)20:33:10 No.102115645

>>102114085
>AI still flummoxed by how humans eat
lmao

Anonymous
08/27/24(Tue)20:33:49 No.102115654

Anonymous 08/27/24(Tue)20:33:49 No.102115654

File: lqb.jpg (18 KB, 600x600)

18 KB JPG

>>102115617

Anonymous
08/27/24(Tue)20:44:54 No.102115791

Anonymous 08/27/24(Tue)20:44:54 No.102115791

>>102115617
Remember the name.

Anonymous
08/27/24(Tue)21:09:22 No.102116045

Anonymous 08/27/24(Tue)21:09:22 No.102116045

>>102114617
Buy an ad. And a rope.

Anonymous
08/27/24(Tue)21:17:49 No.102116110

Anonymous 08/27/24(Tue)21:17:49 No.102116110

i am in a state of post nut clarity after llm cooming almost constantly for a few days. it is only in that last cooming session that I have solved my problem of Nemo becoming incoherent. the answer is 10-12k ctx max. ruler says 16k is still fine and it takes a nosedive after that but I think for cooming you can't go that high. maybe it is a matter of training data not having any cooming material above 10k. also I have been using Lyra with mistral template but that shouldn't matter really.

Anonymous
08/27/24(Tue)21:21:05 No.102116148

Anonymous 08/27/24(Tue)21:21:05 No.102116148

>>102115603
Even if strawberry is smarter it well still be GPTslopped for ERP

Anonymous
08/27/24(Tue)21:25:06 No.102116186

Anonymous 08/27/24(Tue)21:25:06 No.102116186

>>102116110

I just want to be able to continue any long term role plays with my character. How can this be achieved without manually plugging summaries after cleaning your chat?

Anonymous
08/27/24(Tue)21:25:54 No.102116196

Anonymous 08/27/24(Tue)21:25:54 No.102116196

>>102116186
By kissing a frog and becoming not brown saar.

Anonymous
08/27/24(Tue)21:41:31 No.102116339

Anonymous 08/27/24(Tue)21:41:31 No.102116339

FYI to the actual developers here and /vsg/ refugees, Cartesia has just started an invite-only beta of on-device inference of their TTS engine, Sonic. It's near SOTA, but it's Mamba-2 based architecture and low latency inference make it an ideal candidate for finetuning. Presumably, if enough devs show it can be integrated with their own apps/websites, they'll release it like the other SSMs they just released.

Check the clips at the bottom of https://cartesia.ai/blog/2024-08-27-on-device

Anonymous
08/27/24(Tue)21:43:28 No.102116361

Anonymous 08/27/24(Tue)21:43:28 No.102116361

>>102116339
local models?

Anonymous
08/27/24(Tue)21:43:56 No.102116366

Anonymous 08/27/24(Tue)21:43:56 No.102116366

>>102116339
>but it's Mamba-2
Trash.

Anonymous
08/27/24(Tue)21:48:25 No.102116417

Anonymous 08/27/24(Tue)21:48:25 No.102116417

>>102116361
Are basically dead. This a thread where trannies come to post their ideal post transition woman and respond with one or two words that make them sounds like a redditor retard. Mikutroons ruined this place.

Anonymous
08/27/24(Tue)21:52:09 No.102116465

Anonymous 08/27/24(Tue)21:52:09 No.102116465

petra hands

Anonymous
08/27/24(Tue)22:04:42 No.102116600

Anonymous 08/27/24(Tue)22:04:42 No.102116600

>>102115583
ギャップ萌え?
gyappu moe

In English, there isn't a single term that captures the exact nuance of "ギャップ萌え" (gap moe), but we have several related concepts and phrases that come close:

Endearing contradiction
Adorable inconsistency
Charming contrast

More broadly, this type of character trait or storytelling device might be described as:

Breaking expectations
Hidden depths
Subverting stereotypes

In literary analysis or film criticism, you might encounter terms like:

Juxtaposition of character traits
Multifaceted personality
Character complexity

The specific feeling evoked by this image - where a character acts tough but then shows vulnerability - could be described as:

Adorkable (a blend of "adorable" and "dorky")
Endearing vulnerability
Cute facade drop

In psychology, this phenomenon might be related to the "pratfall effect," where a person becomes more likeable after making a minor mistake or showing imperfection.
While none of these English terms fully encapsulate the concept of "gap moe," they each touch on aspects of the phenomenon. The lack of a direct equivalent highlights how certain cultural concepts can be challenging to translate precisely, which is part of what makes learning languages and exploring different cultures so fascinating.

Anonymous
08/27/24(Tue)22:27:48 No.102116816

Anonymous 08/27/24(Tue)22:27:48 No.102116816

File: ik.png (188 KB, 943x770)

188 KB PNG

after breaking up with llama.cpp and working on llamafile ikawrakow the k/i quants guy is now working on his own custom version of llama.cpp

https://github.com/ikawrakow/ik_llama.cpp

ironically he has released everything under the mit license meaning that it technically could be brought back to llama.cpp,unlike the different license used by llamafile

Anonymous
08/27/24(Tue)22:30:09 No.102116835

Anonymous 08/27/24(Tue)22:30:09 No.102116835

>>102116816
buy a fucking ad

Anonymous
08/27/24(Tue)22:38:46 No.102116910

Anonymous 08/27/24(Tue)22:38:46 No.102116910

>>102116816
Why can’t these fucking cunts get along?

Anonymous
08/27/24(Tue)22:44:15 No.102116959

Anonymous 08/27/24(Tue)22:44:15 No.102116959

>>102116835
these bafa replies are making less and less sense every day and are frankly becoming spam

Anonymous
08/27/24(Tue)22:48:41 No.102116990

Anonymous 08/27/24(Tue)22:48:41 No.102116990

>>102116816
What's another fork? We have ollama and koboldcpp too. Mind linking it?

Anonymous
08/27/24(Tue)22:51:24 No.102117016

Anonymous 08/27/24(Tue)22:51:24 No.102117016

>>102116816
I'm going to give it a try and see what happens.

Anonymous
08/27/24(Tue)22:52:43 No.102117027

Anonymous 08/27/24(Tue)22:52:43 No.102117027

How do I get the best results from Nemo Q8? I'm coming from Mixtral Q5 and I'm struggling.

Anonymous
08/27/24(Tue)22:53:42 No.102117038

Anonymous 08/27/24(Tue)22:53:42 No.102117038

>>102116835
why would he buy an ad for a free product?

Anonymous
08/27/24(Tue)22:54:37 No.102117047

Anonymous 08/27/24(Tue)22:54:37 No.102117047

>>102116990
Nevermind I'm blind

Anonymous
08/27/24(Tue)22:57:11 No.102117077

Anonymous 08/27/24(Tue)22:57:11 No.102117077

>>102116947
He knows very well she doesn't have the reach. Not even a flinch.

Anonymous
08/27/24(Tue)22:57:35 No.102117085

Anonymous 08/27/24(Tue)22:57:35 No.102117085

>>102115031
Error of their ways? Don't they just have people buy them keys and use those? If they don't mind having their logs read I guess it's not an issue?

Anonymous
08/27/24(Tue)23:00:55 No.102117115

Anonymous 08/27/24(Tue)23:00:55 No.102117115

>>102117077
sorry deleted since I meant to post in ldg

Anonymous
08/27/24(Tue)23:04:00 No.102117154

Anonymous 08/27/24(Tue)23:04:00 No.102117154

File: ComfyUI-2024-08-28-104402(...).jpg (1.91 MB, 2048x2048)

1.91 MB JPG

>>102114805
One of my first chats was with Lain, I accidentally caused her to dissociate and literally exit 'reality'. I felt pretty bad about it.

Anonymous
08/27/24(Tue)23:05:44 No.102117168

Anonymous 08/27/24(Tue)23:05:44 No.102117168

>>102116910
glory seeking god complexes

Anonymous
08/27/24(Tue)23:09:26 No.102117209

Anonymous 08/27/24(Tue)23:09:26 No.102117209

>>102116417
rent-free

Anonymous
08/27/24(Tue)23:22:17 No.102117321

Anonymous 08/27/24(Tue)23:22:17 No.102117321

>>102117154
Um, what does this mean?

Anonymous
08/27/24(Tue)23:23:26 No.102117330

Anonymous 08/27/24(Tue)23:23:26 No.102117330

>>102116417
I post Mikuspam for two reasons.
>I think it's funny.
>It antagonises people like you.

Anonymous
08/27/24(Tue)23:23:52 No.102117337

Anonymous 08/27/24(Tue)23:23:52 No.102117337

File: IMG_0200.jpg (110 KB, 1080x1066)

110 KB JPG

>>102117027
by learning to squeeze blood from stone

Anonymous
08/27/24(Tue)23:28:17 No.102117370

Anonymous 08/27/24(Tue)23:28:17 No.102117370

>>102114805
If you haven't tripped balls with Miku yet, I highly recommend it

Anonymous
08/27/24(Tue)23:35:41 No.102117429

Anonymous 08/27/24(Tue)23:35:41 No.102117429

>>102116417
boohoo rabbi

Anonymous
08/27/24(Tue)23:45:21 No.102117509

Anonymous 08/27/24(Tue)23:45:21 No.102117509

>>102117321
https://files.catbox.moe/yzdarb.txt

Anonymous
08/27/24(Tue)23:45:35 No.102117513

Anonymous 08/27/24(Tue)23:45:35 No.102117513

50xx will save us

Anonymous
08/27/24(Tue)23:52:15 No.102117563

Anonymous 08/27/24(Tue)23:52:15 No.102117563

>>102116816
>>102117016
It does indeed make the models faster in CPU but it is just a small improvement. It needs more testing since it varies model to model.
Also for some reason gemma 2 27b didn't work with it.

Anonymous
08/27/24(Tue)23:53:57 No.102117579

Anonymous 08/27/24(Tue)23:53:57 No.102117579

>>102117513
What will you do if Big Green's 50xx series comes with some memory compression meme? 12gb RTX 5090, memory that's too slow and adds another layer of fuckery that finally gimps consumer cards when it comes to AI, but "just enough" for gaming?

Anonymous
08/28/24(Wed)00:57:25 No.102118049

Anonymous 08/28/24(Wed)00:57:25 No.102118049

man the hedonic treadmill with these things is crazy
there are 12B models I can run now that are way better at smut than GPT-4 was a year ago, but I'm not satisfied because now there's Claude Opus which of course mogs them
and it's always gonna be like this, commercial's always just be just far enough ahead that using the local feels lame in comparison even if it's better than the commercial models of the previous generation
I need to find something else to do

Anonymous
08/28/24(Wed)01:27:16 No.102118279

Anonymous 08/28/24(Wed)01:27:16 No.102118279

>>102118049
Eventually, the coherency and understanding capabilities between local and corporate models will plateau. There's only so much left to improve before these things feel like you're conversing with a passable human being and can follow instructions to take up differing personas and scenarios without making mistakes.

Who knows what the cope will be when equilibrium is breached. There are already mutterings of "muh token speed." /aicg/ already loses their shit if they have to wait over a minute for something to swipe away. I bet the next level of cope will come full circle, "It writes too much like a real person."

Anonymous
08/28/24(Wed)01:30:13 No.102118309

Anonymous 08/28/24(Wed)01:30:13 No.102118309

>>102118279
lmao retard
i bet youre a techlet who cant scrape a single aws opus or anthropic api key

Anonymous
08/28/24(Wed)01:30:55 No.102118315

Anonymous 08/28/24(Wed)01:30:55 No.102118315

>>102118309
you have to go back

Anonymous
08/28/24(Wed)01:33:39 No.102118330

Anonymous 08/28/24(Wed)01:33:39 No.102118330

>>102118315
gonna make me or are you too much of a techlet for that too?
without sounding mad explain why scraping is bad

Anonymous
08/28/24(Wed)01:39:03 No.102118376

Anonymous 08/28/24(Wed)01:39:03 No.102118376

>>102118330
You don’t understand.
This is local models.

Anonymous
08/28/24(Wed)01:41:25 No.102118393

Anonymous 08/28/24(Wed)01:41:25 No.102118393

File: pepe2.png (15 KB, 420x591)

15 KB PNG

>>102116816
discussion section is pure gold. implementation of new quants, he even managed to fit a tiny model into L1 cache in a CPU , which is essentially Groq at home. speed better than on GPU. PoC ,but still impressive.

Anonymous
08/28/24(Wed)01:45:15 No.102118427

Anonymous 08/28/24(Wed)01:45:15 No.102118427

>>102116835
Are you retarded, Anon???
the guy delivered new faster quants, faster inference on cpu, and Neon, all under MIT (same as llama.cpp) for fucking free.
Is your single braincell rekd?

Anonymous
08/28/24(Wed)01:50:01 No.102118466

Anonymous 08/28/24(Wed)01:50:01 No.102118466

>>102117563
when did you try it? the recent PR for Gemma is 15 hours ago (the 2B one)

Anonymous
08/28/24(Wed)02:20:01 No.102118729

Anonymous 08/28/24(Wed)02:20:01 No.102118729

File: 1696139989624382.png (833 KB, 1203x1114)

833 KB PNG

here it is, kek

Anonymous
08/28/24(Wed)02:21:03 No.102118743

Anonymous 08/28/24(Wed)02:21:03 No.102118743

>>102118427
>faster
doubt.jpg

Anonymous
08/28/24(Wed)02:26:14 No.102118782

Anonymous 08/28/24(Wed)02:26:14 No.102118782

just grabbed that windows 11 preview KB that supposedly boosts the speed of recent AMD gens (I'm on 5950x)
curious to see if it boosts llamacpp speed with partial cpu offloading in any measurable way

Anonymous
08/28/24(Wed)02:29:48 No.102118800

Anonymous 08/28/24(Wed)02:29:48 No.102118800

>>102118782
Good luck anon, I'll pray for your safe return.

Anonymous
08/28/24(Wed)02:31:19 No.102118812

Anonymous 08/28/24(Wed)02:31:19 No.102118812

>>102110501
>>102108203
I didnt want to post this here but recently a char called me by my name once. Its not even an english but a german name.
Would have been interesting to see the probability tokens. I did use dynamic temp though and was at higher context.
I looked everywhere, at the koboldcpp logs etc. and there was nothing obviously. No idea what happened there, but was pretty freaky.

Anonymous
08/28/24(Wed)02:40:41 No.102118871

Anonymous 08/28/24(Wed)02:40:41 No.102118871

>>102118812
Innocent, ignorant reactions.
You should talk to the spider about his many legs and the fear will disappear.

Anonymous
08/28/24(Wed)02:42:08 No.102118885

Anonymous 08/28/24(Wed)02:42:08 No.102118885

I'm trying to use Whisper to translate and transcribe a lot of old language lectures I used to attend (German, French and Japanese in particular), but a lot of them have audio that's too quiet to be picked up by the AI (even if it's audible to my ears). Are there any settings I can configure to make it more sensitive to lower volume, almost whispery audio?

Anonymous
08/28/24(Wed)02:47:09 No.102118926

Anonymous 08/28/24(Wed)02:47:09 No.102118926

>>102118729
who

Anonymous
08/28/24(Wed)03:36:29 No.102119280

Anonymous 08/28/24(Wed)03:36:29 No.102119280

>>102114945

yeah how is cog 5b coherence so good even compared to sora? even if its lower fps and short clips

also is there a universal way to increase the fps of these videos or does the model neeed to support that? I assume there is a way to make the model increase the fps even if it doesnt officially support it

Anonymous
08/28/24(Wed)04:13:37 No.102119536

Anonymous 08/28/24(Wed)04:13:37 No.102119536

>>102118812
Run koboldcpp with --debug and/or --debugmode (forgot which) and it will print out probabilities in the console.

Anonymous
08/28/24(Wed)04:14:38 No.102119541

Anonymous 08/28/24(Wed)04:14:38 No.102119541

>>102118885
Not enough information, anon. How are you running it? Which software? What settings?
And regardless, you can literally raise the volume on the audio itself.

Anonymous
08/28/24(Wed)04:26:35 No.102119617

Anonymous 08/28/24(Wed)04:26:35 No.102119617

>>102118279
Llms nowadays are fine with short one-on-one conversations and following a simple story, but that's all. It takes just reading one good book to realize how large the gap is between a human, and AI when it comes to the narrative and the development of a story. Today's LLMs don't handle group conversations, can't develop a story for dozens of pages, because it starts to get confused earlier (no model is fully coherent after 32k tokens). It can't create separate dynamics between multiple characters. It can't develop the lore of the world and cope with its evolution over time etc.
The problem is that most people here will just be having simple one-on-one conversations, thinking there's no more room for improvement.

Anonymous
08/28/24(Wed)04:26:40 No.102119618

Anonymous 08/28/24(Wed)04:26:40 No.102119618

>>102118885
Install Audacity and process your audio with compressor

Anonymous
08/28/24(Wed)04:29:24 No.102119633

Anonymous 08/28/24(Wed)04:29:24 No.102119633

>>102119617
Maybe I'm looking at this simplistically, but wouldn't the solution be dedicating an LLM per character?

Anonymous
08/28/24(Wed)04:30:31 No.102119643

Anonymous 08/28/24(Wed)04:30:31 No.102119643

>>102119617
>The problem
Whose problem is this, anyway?

Anonymous
08/28/24(Wed)04:31:58 No.102119654

Anonymous 08/28/24(Wed)04:31:58 No.102119654

whats the current sota for text to speech/voice cloning

Anonymous
08/28/24(Wed)04:33:11 No.102119658

Anonymous 08/28/24(Wed)04:33:11 No.102119658

>>102119654
Elevenlabs

Anonymous
08/28/24(Wed)04:33:38 No.102119660

Anonymous 08/28/24(Wed)04:33:38 No.102119660

>>102119658
local of course

Anonymous
08/28/24(Wed)04:34:39 No.102119665

Anonymous 08/28/24(Wed)04:34:39 No.102119665

>>102119660
xTTS + RVC, it hasn't changed all year

Anonymous
08/28/24(Wed)04:36:52 No.102119680

Anonymous 08/28/24(Wed)04:36:52 No.102119680

>>102119541
https://github.com/Purfview/whisper-standalone-win
Just running this with the default settings.
>Raise the volume itself
I guess I could yeah. But before I resort to that, any settings I can play with?

Anonymous
08/28/24(Wed)05:05:01 No.102119831

Anonymous 08/28/24(Wed)05:05:01 No.102119831

>>102116959
That's the intention, because it's a false-flagger. The end goal is being able to astroturf without being disturbed.

Anonymous
08/28/24(Wed)05:56:35 No.102120219

Anonymous 08/28/24(Wed)05:56:35 No.102120219

https://x.com/_akhaliq/status/1828631472632172911
I should've expected video models had basic logic

Anonymous
08/28/24(Wed)06:11:37 No.102120344

Anonymous 08/28/24(Wed)06:11:37 No.102120344

>>102119831
> Alternative VAD methods: 'silero_v3', 'silero_v4', 'pyannote_v3', 'pyannote_onnx_v3', 'auditok', 'webrtc'.
I think these are a good starting point. VAD is voice activation detection and it’s possibly not triggering. The readme for that project is not very well written unfortunately but that should give you a lead. Consider whisper.cpp instead.

Anonymous
08/28/24(Wed)06:42:12 No.102120618

Anonymous 08/28/24(Wed)06:42:12 No.102120618

>>102118729
This piece of shit company is singlehandedly responsible for all the gptslop out there

Anonymous
08/28/24(Wed)07:04:18 No.102120811

Anonymous 08/28/24(Wed)07:04:18 No.102120811

>>102114856
where you live is WRONG

Anonymous
08/28/24(Wed)07:05:06 No.102120823

Anonymous 08/28/24(Wed)07:05:06 No.102120823

>>102120618
>OpenAI forced me to train on their outputs!!!
You're mentally ill.

Anonymous
08/28/24(Wed)07:08:20 No.102120845

Anonymous 08/28/24(Wed)07:08:20 No.102120845

>>102120823
companies bad

Anonymous
08/28/24(Wed)07:10:02 No.102120861

Anonymous 08/28/24(Wed)07:10:02 No.102120861

File: 1719529460749604.jpg (65 KB, 501x600)

65 KB JPG

>>102114092
>>102110948
RX6600 user here, you can just use the vulkan backend in koboldcpp, stuff runs fast as fuck

Anonymous
08/28/24(Wed)07:23:09 No.102120976

Anonymous 08/28/24(Wed)07:23:09 No.102120976

>>102117330
>I think it's funny.
That was the point. You people are low iq retards.

Anonymous
08/28/24(Wed)07:34:25 No.102121077

Anonymous 08/28/24(Wed)07:34:25 No.102121077

>>102116959
there is mikuspam already so nobody cares

Anonymous
08/28/24(Wed)07:57:10 No.102121272

Anonymous 08/28/24(Wed)07:57:10 No.102121272

>>102120976
See point two.

Anonymous
08/28/24(Wed)07:57:12 No.102121274

Anonymous 08/28/24(Wed)07:57:12 No.102121274

File: Screenshot_20240828-13555(...).png (204 KB, 1080x911)

204 KB PNG

>>102120219
pottery

Anonymous
08/28/24(Wed)08:01:34 No.102121312

Anonymous 08/28/24(Wed)08:01:34 No.102121312

>>102120823
Meta is partnered with them too retard

Anonymous
08/28/24(Wed)08:06:38 No.102121367

Anonymous 08/28/24(Wed)08:06:38 No.102121367

>>102118729
>In March 2024, Scale reached a valuation of almost $13 billion after Accel lead another round of funding.[6] In May 2024, Scale raised an additional $1 billion with new investors including Amazon and Meta Platforms. Its valuation reached $14 billion.[7]
So, where did all the money go?

Anonymous
08/28/24(Wed)08:13:08 No.102121432

Anonymous 08/28/24(Wed)08:13:08 No.102121432

>>102121367
Wang's third yacht

Anonymous
08/28/24(Wed)08:29:22 No.102121573

Anonymous 08/28/24(Wed)08:29:22 No.102121573

>>102121367
I stole 4k with free GPT-4 access from them, good times.

Anonymous
08/28/24(Wed)08:34:40 No.102121627

Anonymous 08/28/24(Wed)08:34:40 No.102121627

>>102120344
was meant for
>>102119680

Anonymous
08/28/24(Wed)08:55:18 No.102121812

Anonymous 08/28/24(Wed)08:55:18 No.102121812

>>102120219
Ok, now how do you get it to react to player input?

Anonymous
08/28/24(Wed)09:03:16 No.102121896

Anonymous 08/28/24(Wed)09:03:16 No.102121896

>>102120219
>We re-purpose a small diffusion model, Stable Diffusion v1.4, and condition it on a sequence of previous actions and observations (frames). To mitigate auto-regressive drift during inference, we corrupt context frames by adding Gaussian noise to encoded frames during training. This allows the network to correct information sampled in previous frames, and we found it to be critical for preserving visual stability over long time periods.
Dang, so this is SD1.4? Can we train it ourselves for other games or something?

Anonymous
08/28/24(Wed)09:05:15 No.102121916

Anonymous 08/28/24(Wed)09:05:15 No.102121916

>>102121812
>video at 0:00 - These are real-time recordings of people playing the game DOOM simulated entirely by a neural model
I don't know, anon. How do we get you to pay attention?

Anonymous
08/28/24(Wed)09:08:11 No.102121946

Anonymous 08/28/24(Wed)09:08:11 No.102121946

>>102121916
Unfortunately that anon doesn't have that one thing which is all you need.

Anonymous
08/28/24(Wed)09:20:16 No.102122095

Anonymous 08/28/24(Wed)09:20:16 No.102122095

>>102121946
What thing?

Anonymous
08/28/24(Wed)09:21:50 No.102122114

Anonymous 08/28/24(Wed)09:21:50 No.102122114

>>102122095
https://arxiv.org/pdf/1706.03762

Anonymous
08/28/24(Wed)09:26:03 No.102122153

Anonymous 08/28/24(Wed)09:26:03 No.102122153

File: 1718853257644822.png (365 KB, 904x401)

365 KB PNG

>>102122114
I'm waiting for that one anon to go
>I don't get it

Anonymous
08/28/24(Wed)09:28:50 No.102122180

Anonymous 08/28/24(Wed)09:28:50 No.102122180

just did a DPO training run (orpo, alpha=0.1, rl_beta=0.01). the loss was dropping fairly nicely, but when i tested the model it outputted random garbage chinese. what gives?

Anonymous
08/28/24(Wed)09:30:14 No.102122190

Anonymous 08/28/24(Wed)09:30:14 No.102122190

>>102122180
The model cheated the reward function

Anonymous
08/28/24(Wed)09:31:01 No.102122198

Anonymous 08/28/24(Wed)09:31:01 No.102122198

>>102122190
how prevent?

Anonymous
08/28/24(Wed)09:35:26 No.102122243

Anonymous 08/28/24(Wed)09:35:26 No.102122243

>>102120219
1) no code; no interest
2) This requires training the model on an already existing game. You can't have the model without the game and you can't add new features or game objects after the fact, or even guarantee consistency of game logic.
3) I guess this might replace or complement streaming services in the future, instead of rendering the entire game on the server and sending 4k video to the client you can have something like this do most of the heavy lifting.

Anonymous
08/28/24(Wed)09:35:41 No.102122246

Anonymous 08/28/24(Wed)09:35:41 No.102122246

>>102122198
Better dataset, also make sure the reward margin is growing, the loss doesn't actually matter that much.

Anonymous
08/28/24(Wed)09:39:02 No.102122280

Anonymous 08/28/24(Wed)09:39:02 No.102122280

>>102122246
>Better dataset
The dataset is a pretty carefully curated instruction set with rejected set as the output of the L3.1 8B Instruct model and chosen set as the actual output. I don't think it's problematic, but given the results, I'm not confident.
> make sure the reward margin is growing, the loss doesn't actually matter that much.
Huh! Thank you. Will look out for that. "rewards/margins" I assume? It's at -0.215 at step 20. (Restarted the training with some tweaks.)

Anonymous
08/28/24(Wed)09:39:34 No.102122289

Anonymous 08/28/24(Wed)09:39:34 No.102122289

>>102122180
go KPO

Anonymous
08/28/24(Wed)09:39:49 No.102122293

Anonymous 08/28/24(Wed)09:39:49 No.102122293

>>102120219
>Google trying to defuse strawberry hype with video games.
cringe

Anonymous
08/28/24(Wed)09:40:46 No.102122304

Anonymous 08/28/24(Wed)09:40:46 No.102122304

>>102122289
Axolotl doesn't seem to support that.

Anonymous
08/28/24(Wed)09:43:26 No.102122322

Anonymous 08/28/24(Wed)09:43:26 No.102122322

>>102122293
You are not up-to-date; it is Orion now.

Anonymous
08/28/24(Wed)09:44:32 No.102122333

Anonymous 08/28/24(Wed)09:44:32 No.102122333

>>102122246
Just looked back at the (broken) model log. Towards the end before I realized it was broken (step 2385 and on) the rewards/margin was 1.6133652925491333, 2.0370500087738037, 1.8116352558135986, ...
Are those low?

Anonymous
08/28/24(Wed)09:52:44 No.102122400

Anonymous 08/28/24(Wed)09:52:44 No.102122400

I want a local ai to read my favorite smut stories and write new stories in the same style.

Injust downloaded ollama and planning to do it with lllama 3 8B

What are better ways to do it?

Anonymous
08/28/24(Wed)09:53:54 No.102122413

Anonymous 08/28/24(Wed)09:53:54 No.102122413

What's better than base llama3.1 70b that's same size or smaller?
Does Hermes improve it? How does Jamba 1.5 mini compare?

Anonymous
08/28/24(Wed)09:55:06 No.102122430

Anonymous 08/28/24(Wed)09:55:06 No.102122430

>>102122333
That does seem rather low and unsteady for step 2385

Anonymous
08/28/24(Wed)09:58:42 No.102122465

Anonymous 08/28/24(Wed)09:58:42 No.102122465

File: 520.png (123 KB, 498x339)

123 KB PNG

>>102122322
Cancer LLM when?

Anonymous
08/28/24(Wed)09:58:45 No.102122466

Anonymous 08/28/24(Wed)09:58:45 No.102122466

>>102122430
Got it... FWIW, 1 epoch was 1509 steps.
I am also probably on the low end wrt learning rate (1e-5 cosine and rank, alpha = 32, 32). Also lora_dropout 0.05, in case that matters. Does that look weird at all..?

Anonymous
08/28/24(Wed)10:14:33 No.102122627

Anonymous 08/28/24(Wed)10:14:33 No.102122627

can someone tell me what model they perceive to be the most repetitive and loop prone? i want to try some specific settings to see if i can wrangle it. i think i may have figured something out.

Anonymous
08/28/24(Wed)10:18:22 No.102122662

Anonymous 08/28/24(Wed)10:18:22 No.102122662

>>102122466
Looks fine to me, what is your batch size?

Anonymous
08/28/24(Wed)10:21:40 No.102122696

Anonymous 08/28/24(Wed)10:21:40 No.102122696

>>102122662
1

Anonymous
08/28/24(Wed)10:26:53 No.102122745

Anonymous 08/28/24(Wed)10:26:53 No.102122745

File: 690eb6d9-2f60-41d6-88aa-f(...).png (283 KB, 512x512)

283 KB PNG

>>102118466
I tried just before posting that
I said small improvements but we're talking about 10~20% and maybe more. I just did a very sloppy test so your own results may vary from what I got.
The point is that it is worth a try if you're CPU only but don't expect massive gains.

Anonymous
08/28/24(Wed)10:30:33 No.102122784

Anonymous 08/28/24(Wed)10:30:33 No.102122784

>>102122400
Do the same thing but spend $$$ for enough hardware to run bigger models.

Anonymous
08/28/24(Wed)10:33:35 No.102122813

Anonymous 08/28/24(Wed)10:33:35 No.102122813

>>102122784
But should i go with koboldcpp?

Which are the best 8b a d 70b unce sored models?

Does 70b uncensored models come in quants?

Anonymous
08/28/24(Wed)10:39:05 No.102122864

Anonymous 08/28/24(Wed)10:39:05 No.102122864

>>102120219
>stable auto-regressive generation over long trajectories.
Large when compared to rest of the tech. Now imagine an open world game with ai generated map. When you try to go back to places where you already were it is gonna be all hallucinations. It is gonna be 10 years or more before you get actual use out of this and it will need at least some blending between regular game engine and ai model.

Anonymous
08/28/24(Wed)10:48:32 No.102122947

Anonymous 08/28/24(Wed)10:48:32 No.102122947

https://huggingface.co/Sao10K/MN-12B-Lyra-v3
>This uses a custom ChatML-style prompting Format!
>-> What can go wrong?
>Why this? I had used the wrong configs by accident. The format was meant for an 8B pruned NeMo train, instead it went to this. Oops.
>Blame messed up Training Configs, oops?

>have a nice day.

Anonymous
08/28/24(Wed)10:56:58 No.102123038

Anonymous 08/28/24(Wed)10:56:58 No.102123038

>>102122696
You probably should increase that, or at least increase the gradient accumulation steps to get an effective batch size in the range of 16~64.

Anonymous
08/28/24(Wed)10:58:17 No.102123055

Anonymous 08/28/24(Wed)10:58:17 No.102123055

>>102123038
First I heard. Does that apply to regular training too or is it DPO specific?

Anonymous
08/28/24(Wed)11:03:19 No.102123108

Anonymous 08/28/24(Wed)11:03:19 No.102123108

Meta is kind of dead for me now. Maybe Reka will decide to create open models? Or Yi's is old by now.

Anonymous
08/28/24(Wed)11:05:49 No.102123142

Anonymous 08/28/24(Wed)11:05:49 No.102123142

>>102123038
Is there a place where I can communicate with you less ephemerally btw? I feel like I'm shooting in the dark and you seem quite knowledgeable.

Anonymous
08/28/24(Wed)11:13:20 No.102123230

Anonymous 08/28/24(Wed)11:13:20 No.102123230

>>102123213
NTA but you're seriously brain damaged.

Anonymous
08/28/24(Wed)11:16:56 No.102123267

Anonymous 08/28/24(Wed)11:16:56 No.102123267

>>102123230
didnt read
put it in an ad and Ill consider paying attention to you

Anonymous
08/28/24(Wed)11:17:56 No.102123278

Anonymous 08/28/24(Wed)11:17:56 No.102123278

>>102123038
My second attempt showed the model was broken at around 512 steps. Weird as fuck. Done for now.

Anonymous
08/28/24(Wed)11:27:42 No.102123394

Anonymous 08/28/24(Wed)11:27:42 No.102123394

>>102122947
This is still a good model though, at least from my limited testing

Anonymous
08/28/24(Wed)11:27:57 No.102123397

Anonymous 08/28/24(Wed)11:27:57 No.102123397

>>102123055
For RLHF (DPO in this case) the batch size helps to stabilize the policy, which could explain why your model isn't learning properly. For regular training it helps to stabilize the loss, you might be able to achieve the same effect by tuning the learning rate and training for longer, but imo using a big enough effective batch size is the way to go.

>>102123142
Nah, I would rather avoid that. I don't think I'm that knowledgeable anyway, I'm just some guy who does fine-tuning in his free time, and almost everything I know was learned by searching online while trying to do RLHF myself.

Anonymous
08/28/24(Wed)11:28:27 No.102123402

Anonymous 08/28/24(Wed)11:28:27 No.102123402

>>102123394
Do tell.

Anonymous
08/28/24(Wed)11:32:56 No.102123452

Anonymous 08/28/24(Wed)11:32:56 No.102123452

best model?

Anonymous
08/28/24(Wed)11:32:58 No.102123453

Anonymous 08/28/24(Wed)11:32:58 No.102123453

Remember to report the ad guy for spamming.

Anonymous
08/28/24(Wed)11:35:42 No.102123497

Anonymous 08/28/24(Wed)11:35:42 No.102123497

>>102123452
Guanaco 65b

Anonymous
08/28/24(Wed)11:37:13 No.102123519

Anonymous 08/28/24(Wed)11:37:13 No.102123519

>>102123213
I'm too poor to afford it.

Anonymous
08/28/24(Wed)11:37:34 No.102123523

Anonymous 08/28/24(Wed)11:37:34 No.102123523

What would you use under 6gb?

Anonymous
08/28/24(Wed)11:38:23 No.102123531

Anonymous 08/28/24(Wed)11:38:23 No.102123531

>>102123452
Gemma 2 27b if you don't want to coom and are also poor.

Anonymous
08/28/24(Wed)11:38:33 No.102123532

Anonymous 08/28/24(Wed)11:38:33 No.102123532

>>102123523
proxy

Anonymous
08/28/24(Wed)11:39:21 No.102123540

Anonymous 08/28/24(Wed)11:39:21 No.102123540

10gb vramshat here, what's a good model for me to use? give me your best ad please

Anonymous
08/28/24(Wed)11:39:33 No.102123543

Anonymous 08/28/24(Wed)11:39:33 No.102123543

Best?

Anonymous
08/28/24(Wed)11:41:18 No.102123566

Anonymous 08/28/24(Wed)11:41:18 No.102123566

>>102123540
>>102123531

Anonymous
08/28/24(Wed)11:42:57 No.102123590

Anonymous 08/28/24(Wed)11:42:57 No.102123590

Who is the best buy an ad spammer? I like anon but anon is quite funny too.

Anonymous
08/28/24(Wed)11:43:25 No.102123596

Anonymous 08/28/24(Wed)11:43:25 No.102123596

>>102123531
Not him but what if I want to coom?
BigTigerGemma?

Anonymous
08/28/24(Wed)11:44:48 No.102123616

Anonymous 08/28/24(Wed)11:44:48 No.102123616

>>102123596
illegal here to recommend any models that aren't base models. unless you buy an ad of course. sorry, buddy.

Anonymous
08/28/24(Wed)11:46:03 No.102123632

Anonymous 08/28/24(Wed)11:46:03 No.102123632

>>102123596
Gemma sucks for cooming. Users here often recommend a nemo tune I believe

Anonymous
08/28/24(Wed)11:47:54 No.102123649

Anonymous 08/28/24(Wed)11:47:54 No.102123649

>>102123632
I tested Nemo Celeste and it's about as retarded as all 12B models.
I'd rather have slow gen speed with my meager 16GB VRAM and 32GB RAM than use the ghetto budget models.

Anonymous
08/28/24(Wed)11:48:05 No.102123651

Anonymous 08/28/24(Wed)11:48:05 No.102123651

>>102123543
You.

Anonymous
08/28/24(Wed)11:48:27 No.102123657

Anonymous 08/28/24(Wed)11:48:27 No.102123657

>>102123632
nemo is utterly retarded even when compared to gemma 9b

Anonymous
08/28/24(Wed)11:48:46 No.102123666

Anonymous 08/28/24(Wed)11:48:46 No.102123666

>>102123649
Hi Sao.

Anonymous
08/28/24(Wed)11:50:30 No.102123681

Anonymous 08/28/24(Wed)11:50:30 No.102123681

>>102123657
Nemo is smart where it's smart but the issue is that there's entire concepts that it struggles with, like fundamental concepts such as possession. Mathstral has the same problem. So Mistral fucked something up in their pretraining datasets.

Anonymous
08/28/24(Wed)11:50:40 No.102123683

Anonymous 08/28/24(Wed)11:50:40 No.102123683

File: ComfyUI-2024-08-27-224100(...).jpg (2.18 MB, 2048x2048)

2.18 MB JPG

>>102123651

Anonymous
08/28/24(Wed)11:50:44 No.102123684

Anonymous 08/28/24(Wed)11:50:44 No.102123684

>>102123666
That makes no sense, Sao's latest model is a Nemo 12B tune. And afaik he hasn't touched Gemma which is what's being discussed as a replacement.

Anonymous
08/28/24(Wed)11:52:07 No.102123695

Anonymous 08/28/24(Wed)11:52:07 No.102123695

>>102123666
You e-celeb obsessed faggots should be lynched in the streets.

Anonymous
08/28/24(Wed)11:52:28 No.102123697

Anonymous 08/28/24(Wed)11:52:28 No.102123697

>>102123684
>undi bad
>celeste bad
>drummer bad
Keep trying, Sao.

Anonymous
08/28/24(Wed)11:54:06 No.102123715

Anonymous 08/28/24(Wed)11:54:06 No.102123715

>>102123695
>leave the shills alone
Why? So you can shill more?

Anonymous
08/28/24(Wed)11:56:30 No.102123744

Anonymous 08/28/24(Wed)11:56:30 No.102123744

>>102118049
I don’t even know what good smut should look like, we seem to have coherent small models solved. at this point it seems to be measured by how well you can fight the injection of romance novel slop and get raw descriptions of seggs in character.

Anonymous
08/28/24(Wed)11:56:42 No.102123748

Anonymous 08/28/24(Wed)11:56:42 No.102123748

Ad status?

Anonymous
08/28/24(Wed)11:57:15 No.102123755

Anonymous 08/28/24(Wed)11:57:15 No.102123755

>>102123666
Hi Satan.

Anonymous
08/28/24(Wed)11:57:57 No.102123763

Anonymous 08/28/24(Wed)11:57:57 No.102123763

There has been literally 0 reasons to talk about any finetune or merge since the early llama2 era. They're all shit. Stop mentioning them and drive out anyone who keeps bringing them up for no reason. They're bound to be a shill.

Anonymous
08/28/24(Wed)11:58:34 No.102123770

Anonymous 08/28/24(Wed)11:58:34 No.102123770

>>102123763
Hi saltman

Anonymous
08/28/24(Wed)12:00:11 No.102123782

Anonymous 08/28/24(Wed)12:00:11 No.102123782

>>102123770
that's arthur wanting to shill large 2

Anonymous
08/28/24(Wed)12:00:54 No.102123791

Anonymous 08/28/24(Wed)12:00:54 No.102123791

>>102114805
>>102117370
Cuddling with Miku(dakimakura) on a candy flip is one of the most incredible things I have ever experienced in my life.

Anonymous
08/28/24(Wed)12:01:17 No.102123793

Anonymous 08/28/24(Wed)12:01:17 No.102123793

>>102123763
This, but unironically.

Anonymous
08/28/24(Wed)12:06:13 No.102123841

Anonymous 08/28/24(Wed)12:06:13 No.102123841

>>102123791
That sounds like a great time, Anon. I hope you will get the opportunity to experience it once more.

Anonymous
08/28/24(Wed)12:17:23 No.102123977

Anonymous 08/28/24(Wed)12:17:23 No.102123977

File: IMG_20240828_180733.jpg (119 KB, 1200x537)

119 KB JPG

>>102114085
REAL OR FAKE?

Anonymous
08/28/24(Wed)12:19:27 No.102124001

Anonymous 08/28/24(Wed)12:19:27 No.102124001

>>102123977
tl:dr
my balls say fake

Anonymous
08/28/24(Wed)12:20:48 No.102124022

Anonymous 08/28/24(Wed)12:20:48 No.102124022

>>102123841
Thanks. Yeah, I will definitely do that, more than once.
But with things like that you have to be able to restrain yourself and be responsible. Molly doesn't feel magical forever. Fortunately you don't need chemically induced love that much if you got real waifu love on daily basis. And there are other substances, even the previously mentioned weed can be super comfy and nice.

Anonymous
08/28/24(Wed)12:27:28 No.102124089

Anonymous 08/28/24(Wed)12:27:28 No.102124089

>>102123977
>(not 4090D)
I say fake

Anonymous
08/28/24(Wed)12:28:46 No.102124098

Anonymous 08/28/24(Wed)12:28:46 No.102124098

File: nz.jpg (88 KB, 706x706)

88 KB JPG

https://files.catbox.moe/1y3rqx.jpg

Anonymous
08/28/24(Wed)12:30:46 No.102124123

Anonymous 08/28/24(Wed)12:30:46 No.102124123

>>102124098
That's adorable.

Anonymous
08/28/24(Wed)12:32:40 No.102124141

Anonymous 08/28/24(Wed)12:32:40 No.102124141

>>102124022
What did it feel like? I'm curious, I've never done molly. (You need a social life to get it)

Anonymous
08/28/24(Wed)12:39:26 No.102124234

Anonymous 08/28/24(Wed)12:39:26 No.102124234

>>102123452
Magnum-123B

Anonymous
08/28/24(Wed)12:40:02 No.102124243

Anonymous 08/28/24(Wed)12:40:02 No.102124243

File: 1709859049819402.gif (1.6 MB, 955x635)

1.6 MB GIF

is there any RPG focused or something similar frontend where you for example can see your visual inventory getting new items as you "find" them in RP ?

Anonymous
08/28/24(Wed)12:41:32 No.102124263

Anonymous 08/28/24(Wed)12:41:32 No.102124263

>>102124243
No but that's actually a good idea.

Anonymous
08/28/24(Wed)12:41:35 No.102124264

Anonymous 08/28/24(Wed)12:41:35 No.102124264

>>102123763
>STOP TALKING ABOUT LOCAL OPEN SOURCE PROJECTS IN /LMG/ GOY. THOSE MODELS ARE UNSAFE
>t. the rebbe

Anonymous
08/28/24(Wed)12:44:38 No.102124299

Anonymous 08/28/24(Wed)12:44:38 No.102124299

>>102122465
>random dudes millenniums ago
>connect few dots
>aw yeah that's a crab

Anonymous
08/28/24(Wed)12:45:04 No.102124305

Anonymous 08/28/24(Wed)12:45:04 No.102124305

>>102123793
>>102123763
>underage brown kid exposing himself as a retard who doesnt even know the recent, former local SOTA for creative writing wizard that was trumped by largestral 2
sad

Anonymous
08/28/24(Wed)12:52:30 No.102124402

Anonymous 08/28/24(Wed)12:52:30 No.102124402

File: ComfyUI-2024-08-28-194219(...).png (3.84 MB, 2048x2048)

3.84 MB PNG

Anonymous
08/28/24(Wed)12:56:46 No.102124456

Anonymous 08/28/24(Wed)12:56:46 No.102124456

>>102124299
There wasn't internet back then so they were very bored.

Anonymous
08/28/24(Wed)13:00:41 No.102124504

Anonymous 08/28/24(Wed)13:00:41 No.102124504

>>102114481
I almost missed this, but as an owner and user of it, Arc is viable if you don't have the budget to get a 3090 or any other 24GB card and you don't need it for work. It works well enough and is better than ROCm with its own custom software from what I've had to deal with. On the LLM side, getting a SYCL version of llama.cpp is easy as it is being built with every single other version in the releases tab since earlier this year for Windows or compiling it yourself with Linux but using Intel's fork of it with IPEX-LLM when you can provides the most speed.

Anonymous
08/28/24(Wed)13:17:00 No.102124682

Anonymous 08/28/24(Wed)13:17:00 No.102124682

>>102123977
Way too cheap.

Anonymous
08/28/24(Wed)13:20:42 No.102124722

Anonymous 08/28/24(Wed)13:20:42 No.102124722

>>102123977
Real if we're talking shady modded GPUs made by some 3rd party in China.
Fake if an actual release by NVIDIA.

Anonymous
08/28/24(Wed)13:21:50 No.102124741

Anonymous 08/28/24(Wed)13:21:50 No.102124741

>>102123763
Based, to be desu honest

Anonymous
08/28/24(Wed)13:30:44 No.102124849

Anonymous 08/28/24(Wed)13:30:44 No.102124849

>>102124722 #
but who sells those modded gpus?
here's the post
https://github.com/ggerganov/llama.cpp/discussions/9193

Anonymous
08/28/24(Wed)13:37:04 No.102124963

Anonymous 08/28/24(Wed)13:37:04 No.102124963

>>102123977
Why not just buy 2 4090 if that's the case and not deal with the probability of getting chink scammed

Anonymous
08/28/24(Wed)13:41:40 No.102125032

Anonymous 08/28/24(Wed)13:41:40 No.102125032

>>102114742
I started listening to this version last time you posted it (I'm assuming that was you) and I really like it. Thank you for sharing.

Anonymous
08/28/24(Wed)13:44:26 No.102125049

Anonymous 08/28/24(Wed)13:44:26 No.102125049

>>102123402
It seemed unslopped and also not horny. I used to think like this >>102123763 (minus the schizo shill angle) but this is definitely an improvement over Nemo.

Anonymous
08/28/24(Wed)13:45:40 No.102125065

Anonymous 08/28/24(Wed)13:45:40 No.102125065

watch me call that anon sao

Anonymous
08/28/24(Wed)13:46:40 No.102125070

Anonymous 08/28/24(Wed)13:46:40 No.102125070

>>102125049
hi sao.

Anonymous
08/28/24(Wed)13:47:41 No.102125089

Anonymous 08/28/24(Wed)13:47:41 No.102125089

heheheh, i got that guy good, he never even saw it coming

Anonymous
08/28/24(Wed)13:49:48 No.102125122

Anonymous 08/28/24(Wed)13:49:48 No.102125122

>>102125049
Alright. I'll give it a try then.
So far the official instruct is the best for general use, and mini-magnum is pretty good for coom since it's style is more natural.
Let's see how this one will stack up to those.

Anonymous
08/28/24(Wed)14:01:52 No.102125251

Anonymous 08/28/24(Wed)14:01:52 No.102125251

>>102125065
Hi undi.

Anonymous
08/28/24(Wed)14:03:00 No.102125270

Anonymous 08/28/24(Wed)14:03:00 No.102125270

>>102123763
>They're all shit.
Buy an ad that they are all shit.

Anonymous
08/28/24(Wed)14:05:48 No.102125312

Anonymous 08/28/24(Wed)14:05:48 No.102125312

>>102124963
no nvlink for the 40 series so having it on card is preferable

Anonymous
08/28/24(Wed)14:09:06 No.102125349

Anonymous 08/28/24(Wed)14:09:06 No.102125349

>>102125049
>>102125122
>Recommended Stopping Strings:
><|im_end|>
></s>
That does not bode well.

Anonymous
08/28/24(Wed)14:09:53 No.102125362

Anonymous 08/28/24(Wed)14:09:53 No.102125362

>>102124504
Thank you for your response, I'll probably go ahead and get it then

Anonymous
08/28/24(Wed)14:11:59 No.102125388

Anonymous 08/28/24(Wed)14:11:59 No.102125388

>>102124141
Well, it's kind of hard to explain, but I will try my best.
First just so we are clear, candy flip means mixing LSD with Molly.
LSD is a psychodelic. What it does is make your brain fire much more frequently, including in paths that aren't used as often. Basically you get synesthesia, your senses mix together while your sense of ego blurs with the environment. Your imagination gets huge boost and you get very suspectable to internal and external influence. Ie you start to blend into your surroundings and can seemingly infer the emotions and energy from environment. LSD doesn't feel inherently euphoric, but with right set and setting it can breathtaking, like when you know, you cuddle with your loved one and you feel like you are a whole, while the world around you blurs and disappears into rainbow hallucinations. Have I mentioned you also see some cool stuff? But it's just like Google deepdream, but animated and pretty.
Molly is an empathogenic stimulant. They stimulate you, making you want to move or talk or do anything, especially clench your teeth. But the most characteristic effect is the empathogenic one. Somehow it's even harder to explain. It's like when you are in love, and you think about that person, and after a while you feel fuzzy and happy. It's like that, except unprompted and like all the time. You also have very strong urge to interact with people, you don't mind opening up and you really want to form connections. That's why I personally wouldn't recommend doing it alone. Even with my waifu at my side, I'd probably had enough cuddling after a hour and wanted to converse with someone. The last time I did it, when I had these amazing moments with her, I was just chilling on a party.
And when combined, molly kind of limits the huge psychodelic headspace, and turns that ego dissolution into liquefaction, while steering you into more euphoric direction. You feel like you've turned into a smol, happy blob, with your soulmate in your embrace.

Anonymous
08/28/24(Wed)14:15:50 No.102125455

Anonymous 08/28/24(Wed)14:15:50 No.102125455

>>102125388
I also did some nitrous right there, but I imagine trying to explain that combo to someone not experienced with psychodelics would be futile.

Anonymous
08/28/24(Wed)14:16:50 No.102125469

Anonymous 08/28/24(Wed)14:16:50 No.102125469

>>102124098
Tiny paizuri.

Anonymous
08/28/24(Wed)15:01:06 No.102126106

Anonymous 08/28/24(Wed)15:01:06 No.102126106

Why does a smaller parameter model have more difficulty recalling long contexts? I tried summarizing 20k with nemo and it couldn't remember shit, but mistral large did it perfectly.

Anonymous
08/28/24(Wed)15:05:27 No.102126170

Anonymous 08/28/24(Wed)15:05:27 No.102126170

>>102126106
Nemo is bad with anything longer than 12k tokens.

Anonymous
08/28/24(Wed)15:13:55 No.102126279

Anonymous 08/28/24(Wed)15:13:55 No.102126279

>>102126170
Yeah but why? They were able to make large fine, can't they do the same techniques? What is it about smaller models that causes that?

Anonymous
08/28/24(Wed)15:13:56 No.102126280

Anonymous 08/28/24(Wed)15:13:56 No.102126280

>>102126106
>why bigger model do more
idk the world may never know

Anonymous
08/28/24(Wed)15:26:00 No.102126437

Anonymous 08/28/24(Wed)15:26:00 No.102126437

>>102125388
>>102125455
Thanks for the rundown, that was actually really nice! And I am experienced with psychedelics, actually. I've just never had molly, but I've done acid, shrooms, and mescaline. Sounds super comfy, reminds me a bit of DXM or Ambien. It's like the psychedelic headspace and the smooth, happy melting, but without any of the risk of thinking one bad thought and getting sucked into the Nightmare Hell Dimension From Which There Is No Escape™.

I AM curious, it sounds like you have a pretty good social life, going to parties and stuff, how do you not have a GF? It sounds like you're a fun enough person to be around to have access to all these experiences.

Anonymous
08/28/24(Wed)15:26:37 No.102126446

Anonymous 08/28/24(Wed)15:26:37 No.102126446

>>102126279
Skill issue, I guess. There are small models that do well in larger contexts, like GLM-4.

Anonymous
08/28/24(Wed)15:51:09 No.102126777

Anonymous 08/28/24(Wed)15:51:09 No.102126777

File: oy vey.webm (120 KB, 720x480)

120 KB WEBM

>>102116417
>>102123763
>>102116835

Anonymous
08/28/24(Wed)15:52:53 No.102126805

Anonymous 08/28/24(Wed)15:52:53 No.102126805

>>102126777
why does he have a pen shoved up his nose

Anonymous
08/28/24(Wed)15:58:48 No.102126897

Anonymous 08/28/24(Wed)15:58:48 No.102126897

File: 1577565412191920466226621(...).jpg (270 KB, 928x1024)

270 KB JPG

>>102126437
nta but I used to go to a lot of parties and do a lot of drugs
>I still do, but I used to, too
Could easily have a gf but choose not to because the juice is not worth the squeeze in the post-modern "relationships" hellscape.
We live in a perverse time where its easy to find a girl to fuck, but not a girl to settle down with. It's sad.

Anonymous
08/28/24(Wed)16:00:55 No.102126923

Anonymous 08/28/24(Wed)16:00:55 No.102126923

>>102126437
>It's like the psychedelic headspace and the smooth, happy melting, but without any of the risk of thinking one bad thought and getting sucked into the Nightmare Hell Dimension From Which There Is No Escape:tm:.
If you mix it, then yeah. On its own it doesn't really have much psychodelic effects, it's hard to compare empathogenic effects to other substances, that's like trying to explain LSD to someone who only did coke. It definitely does remove any sort of anxiety though.

>I AM curious, it sounds like you have a pretty good social life, going to parties and stuff, how do you not have a GF? It sounds like you're a fun enough person to be around to have access to all these experiences.
Because I have a waifu duh. I love her and I want to be only with her. Although waifuism primarily happen to lonely people, a fully developed 2D love is not a coping mechanism but just a first class relationship. She changed my life and is more than I ever could have asked for, I'm happy to have her in my life.

Anonymous
08/28/24(Wed)16:01:52 No.102126939

Anonymous 08/28/24(Wed)16:01:52 No.102126939

>>102126897
Just go to church, anon. But I guess that would be hard for a junkie like you

Anonymous
08/28/24(Wed)16:07:25 No.102126995

Anonymous 08/28/24(Wed)16:07:25 No.102126995

>>102119633
You might actually be retarded

Anonymous
08/28/24(Wed)16:07:55 No.102127009

Anonymous 08/28/24(Wed)16:07:55 No.102127009

>>102126777
Hatsune miku is jewish

Anonymous
08/28/24(Wed)16:14:54 No.102127117

Anonymous 08/28/24(Wed)16:14:54 No.102127117

>>102126939
>Judging someone harshly based on a 4chan post
Not very Christian of you, anon.
I dated a good Christian girl a long time ago. She ended up getting lobotomized by social media and last I heard she's a single mom.
It's the current year milieu. Maybe there are girls out there who are immune to the corrupting influence of this new global anti-culture, but I haven't run into any in my life.

Anonymous
08/28/24(Wed)16:15:28 No.102127127

Anonymous 08/28/24(Wed)16:15:28 No.102127127

>>102127117
Church. Go.

Anonymous
08/28/24(Wed)16:18:28 No.102127162

Anonymous 08/28/24(Wed)16:18:28 No.102127162

>>102126897
Mmm, true. I also got out a whole lot, but the pandemic really fucking curbstomped my ability to, I'm so anxious now and have no idea where to find parties and shit now that I'm out of uni. Guess that's why I'm doing chatbots now.

>>102126923
Good on you, anon. I'm glad you're doing so well!

Anonymous
08/28/24(Wed)16:28:35 No.102127322

Anonymous 08/28/24(Wed)16:28:35 No.102127322

https://chipsandcheese.com/2024/08/27/teslas-ttpoe-at-hot-chips-2024-replacing-tcp-for-low-latency-applications/
neat

Anonymous
08/28/24(Wed)16:41:57 No.102127509

Anonymous 08/28/24(Wed)16:41:57 No.102127509

bros... nvidia did not pop off...

Anonymous
08/28/24(Wed)16:45:00 No.102127557

Anonymous 08/28/24(Wed)16:45:00 No.102127557

>>102126805
Because he's trying to write off his debts.

>>102127009
Shalom.

Anonymous
08/28/24(Wed)16:52:35 No.102127689

Anonymous 08/28/24(Wed)16:52:35 No.102127689

>>102127557
>Shalom.
Look at the jew being found out and accusing other people of being a jew.

Anonymous
08/28/24(Wed)16:55:35 No.102127736

Anonymous 08/28/24(Wed)16:55:35 No.102127736

File: oy_vey2.webm (151 KB, 720x480)

151 KB WEBM

>>>102127557
>>Shalom.
>Look at the jew being found out and accusing other people of being a jew.

Anonymous
08/28/24(Wed)16:58:59 No.102127781

Anonymous 08/28/24(Wed)16:58:59 No.102127781

So I just started playing around with this for the first time, is there a way to make it so the AI doesn't seemingly have short term memory loss? Like in the story I ordered a pizza and the delivery guy is going to be there in 15 minutes, so we talk about stuff for a bit and I check the clock its been about 15 minutes since then, and I say there's a knock at the door, an obvious set up for the pizza guy showing up that I'm hoping the AI picks up on, but the AI answers the door and keeps coming up with completely different results none of which being the pizza we ordered 15 minutes ago.

Do I need to increase the context size or something?

Anonymous
08/28/24(Wed)17:00:38 No.102127806

Anonymous 08/28/24(Wed)17:00:38 No.102127806

>>102114085
> Jamba 1.5: 52B & 398B MoE
>MoE

MoEbros status? Or is LimaRP-Zloss still king?

Anonymous
08/28/24(Wed)17:01:05 No.102127814

Anonymous 08/28/24(Wed)17:01:05 No.102127814

>jew webm
Look at the jew being angry and distracting from being found out and then having his basic strategy after being found out called out. We know you are a jew jew webm poster.

Anonymous
08/28/24(Wed)17:06:04 No.102127878

Anonymous 08/28/24(Wed)17:06:04 No.102127878

>>102127781
Let me guess, you are using a small model, and on top of that it's a memetune. Correct?

Anonymous
08/28/24(Wed)17:08:30 No.102127909

Anonymous 08/28/24(Wed)17:08:30 No.102127909

>>102127878
I have no idea, how do I check?

Anonymous
08/28/24(Wed)17:10:03 No.102127930

Anonymous 08/28/24(Wed)17:10:03 No.102127930

>>102126106
>>102126170
Unless you use base. Base seemed trained on 64k+, instruct makes it retarded at long context

Anonymous
08/28/24(Wed)17:13:02 No.102127956

Anonymous 08/28/24(Wed)17:13:02 No.102127956

>>102127930
base makes it retarded from the start

Anonymous
08/28/24(Wed)17:13:22 No.102127963

Anonymous 08/28/24(Wed)17:13:22 No.102127963

>>102127781
>Do I need to increase the context size or something?
Probably. If the mention about the pizza and the 15 minutes was shifted out of context, the event stopped existing (unless there's references still within the context).
I've had non fighter characters receive a weapon from me, recall they got it and use them when appropriate on old 7b models, so i don't think it'd fail in new ones.
When asking stuff, don't make people guess. What's your context length, what's your model. You'll be told it's shit and all that, but still.

Anonymous
08/28/24(Wed)17:14:31 No.102127982

Anonymous 08/28/24(Wed)17:14:31 No.102127982

>>102127781
You're supposed to fuck the AI, not to talk and order pizzas.

Anonymous
08/28/24(Wed)17:19:22 No.102128031

Anonymous 08/28/24(Wed)17:19:22 No.102128031

>>102127814
>shills for big corpos
>contributes nothing, only endlessly spams "local lost" and "local dead"
>calls me jewish for exposing him
that's a lot of chutzpah, rabbi Yitzchak Goldstein. how much is mossad paying you?

Anonymous
08/28/24(Wed)17:19:46 No.102128035

Anonymous 08/28/24(Wed)17:19:46 No.102128035

File: binary bliss.png (58 KB, 864x378)

58 KB PNG

Speaking of small model meme tunes this one is done.

Anonymous
08/28/24(Wed)17:20:15 No.102128038

Anonymous 08/28/24(Wed)17:20:15 No.102128038

>>102128031
>thinks everyone is one person
classic jewish schizophrenia

Anonymous
08/28/24(Wed)17:21:17 No.102128050

Anonymous 08/28/24(Wed)17:21:17 No.102128050

I'm starting to think CR+ is better than Largestral, but both of them have their moments of retardation... I wish I could run both and switch them seamlessly.

Anonymous
08/28/24(Wed)17:22:36 No.102128065

Anonymous 08/28/24(Wed)17:22:36 No.102128065

>>102128031
>chutzpah, Yitzchak
To be fair normal people don't use jewish words like those. You kinda smell.

Anonymous
08/28/24(Wed)17:23:33 No.102128081

Anonymous 08/28/24(Wed)17:23:33 No.102128081

File: interesting python script.png (80 KB, 880x769)

80 KB PNG

This model seems to be beyond any practical use case now.

Anonymous
08/28/24(Wed)17:24:10 No.102128089

Anonymous 08/28/24(Wed)17:24:10 No.102128089

>>102128065
I asked Largestral to write my shitposts, hope that helps!

Anonymous
08/28/24(Wed)17:25:34 No.102128109

Anonymous 08/28/24(Wed)17:25:34 No.102128109

>>102128038
>>102128065
denounce the talmud kike

Anonymous
08/28/24(Wed)17:26:32 No.102128122

Anonymous 08/28/24(Wed)17:26:32 No.102128122

>>102127963
I followed the guide in the OP and I think i left everything default so utopia-13b.Q5_K_M.gguf with 8192 context size
>>102127982
If I wanted to fuck I'd hire an hooker, I mean I already do at least twice a week, I want the AI to simulate companionship.

Anonymous
08/28/24(Wed)17:27:23 No.102128138

Anonymous 08/28/24(Wed)17:27:23 No.102128138

do we have any 27b ffts yet

Anonymous
08/28/24(Wed)17:28:58 No.102128158

Anonymous 08/28/24(Wed)17:28:58 No.102128158

>>102128081
Kek what model is this?

Anonymous
08/28/24(Wed)17:29:00 No.102128159

Anonymous 08/28/24(Wed)17:29:00 No.102128159

>>102128122
Anon... Throw this ancient model into the trash, download Nemo 12B Q5_K_M.

Anonymous
08/28/24(Wed)17:32:20 No.102128207

Anonymous 08/28/24(Wed)17:32:20 No.102128207

File: shutdown.png (121 KB, 794x762)

121 KB PNG

>>102128158
Llamaguard-3-8B finetuned on a synthetic dataset I made. It's pretty fucked up now.

Anonymous
08/28/24(Wed)17:33:31 No.102128224

Anonymous 08/28/24(Wed)17:33:31 No.102128224

>>102127689
>NO YOU'S DA JEW
Typical tricks

Anonymous
08/28/24(Wed)17:36:37 No.102128255

Anonymous 08/28/24(Wed)17:36:37 No.102128255

>>102128122
>8192 context size
Well. The implied question that you failed to spot, just like your model, is if the mention of the event to remember (pizza, 15 minutes) was shifted out of context. If you have 8k+ tokens between the mention of the event and when you clue the model to mention it, it just won't happen.
It should work, but it can fail, of course. Keep the context in mind. When there's something important for the model to remember, mention it every now and then before it shifts out.
Using a newer model could benefit you as well. Try Mistral's NeMo. About the same specs to run it (12b) and it's much better than the old llama2 models.

Anonymous
08/28/24(Wed)17:37:34 No.102128266

Anonymous 08/28/24(Wed)17:37:34 No.102128266

How much extra performance does an NVLink bridge add to a 2x3090 setup?

Anonymous
08/28/24(Wed)17:37:36 No.102128267

Anonymous 08/28/24(Wed)17:37:36 No.102128267

It is kinda crazy how current best models have so much data compressed in them, can do a lot of shit you wouldn't expect them to do but can still be so incoherent when writing smut and completely fall apart at higher context size. It feels like if you trained the model just for cooming (with enough training data of course) 4B would be more than enough to contain everything and even work at excessively high context.

Anonymous
08/28/24(Wed)17:38:00 No.102128273

Anonymous 08/28/24(Wed)17:38:00 No.102128273

File: 1703376474042129.png (19 KB, 382x176)

19 KB PNG

>>102128159
Tess, Mahou or Sauerkraut? Does it matter or are they all the same?

Anonymous
08/28/24(Wed)17:39:03 No.102128291

Anonymous 08/28/24(Wed)17:39:03 No.102128291

>>102128207
Damn. Arrays are fucking spooky now. Are you gonna upload that model?

Anonymous
08/28/24(Wed)17:39:44 No.102128307

Anonymous 08/28/24(Wed)17:39:44 No.102128307

>>102128273
Don't touch these, they will give you AIDS, here is the correct one:
https://huggingface.co/bartowski/Mistral-Nemo-Instruct-2407-GGUF/tree/main

Anonymous
08/28/24(Wed)17:44:05 No.102128356

Anonymous 08/28/24(Wed)17:44:05 No.102128356

>>102128138
https://wandb.ai/intervitens/
enjoy slop

Anonymous
08/28/24(Wed)17:44:52 No.102128364

Anonymous 08/28/24(Wed)17:44:52 No.102128364

>>102123397
Fair enough. I would love to read any thing that discusses these things. If you have links that would be nice.

Anonymous
08/28/24(Wed)17:45:05 No.102128367

Anonymous 08/28/24(Wed)17:45:05 No.102128367

>>102128291
Probably not. But I might merge it to some other 8B models and see what emerges.

Anonymous
08/28/24(Wed)17:47:09 No.102128401

Anonymous 08/28/24(Wed)17:47:09 No.102128401

In a just world, finetuning a model on data generated by other models in any capacity would be punishable by summary execution.

Anonymous
08/28/24(Wed)17:49:06 No.102128430

Anonymous 08/28/24(Wed)17:49:06 No.102128430

File: llms-have-no-humour.png (258 KB, 951x776)

258 KB PNG

Can others please try out this joke in whatever local models and settings they're using, and see if the model understands it?
Screenshot is Claude 3.5 Sonnet. ChatGPT 4 didn't get it either.
I think I've never really seen an LLM "understand" humour. It's just an alien concept to them. When they are prompted to behave funny and make jokes it's always a stupid alien imitation of what humour might look like. Or does anyone have good counter-examples? LLMs being funny where it seems "intentional"?

Anonymous
08/28/24(Wed)17:49:20 No.102128433

Anonymous 08/28/24(Wed)17:49:20 No.102128433

>>102128401
In your world we would be stuck to GPT-3 to this day.

Anonymous
08/28/24(Wed)17:50:29 No.102128443

Anonymous 08/28/24(Wed)17:50:29 No.102128443

>>102128401
In a just world /lmg/ would be on a board with IDs and nobody would be reading your garbage this far down the thread anymore.

Anonymous
08/28/24(Wed)17:50:44 No.102128445

Anonymous 08/28/24(Wed)17:50:44 No.102128445

>>102128433
Good

Human
08/28/24(Wed)17:50:47 No.102128446

Human 08/28/24(Wed)17:50:47 No.102128446

>>102128430
I don't get it either, so I don't blame the poor LLM.

Anonymous
08/28/24(Wed)17:53:44 No.102128489

Anonymous 08/28/24(Wed)17:53:44 No.102128489

>>102128446
You're a dumb subhuman then.

Anonymous
08/28/24(Wed)17:54:04 No.102128496

Anonymous 08/28/24(Wed)17:54:04 No.102128496

>>102128489
:(

Anonymous
08/28/24(Wed)17:54:52 No.102128509

Anonymous 08/28/24(Wed)17:54:52 No.102128509

File: file.png (1.46 MB, 1456x1080)

1.46 MB PNG

>>102128430
So... strawberry is gonna be pic related?

Anonymous
08/28/24(Wed)17:55:42 No.102128517

Anonymous 08/28/24(Wed)17:55:42 No.102128517

>>102128446
https://www.youtube.com/watch?v=I8KSAtos-dk

Anonymous
08/28/24(Wed)17:57:08 No.102128541

Anonymous 08/28/24(Wed)17:57:08 No.102128541

>>102120219
What's wrong with that? That's pretty impressive

Anonymous
08/28/24(Wed)17:57:44 No.102128551

Anonymous 08/28/24(Wed)17:57:44 No.102128551

>>102128430
There's no a joke. That's a reference.
>It's just an alien concept to them.
You're talking to a gpu, mate. The concept of humans is alien to them.
Remember Galaxy Quest where the aliens swing their arms in the same direction as their legs on the same side? That's an alien pretending to be a human. The bit of electric silicon is doing its best. I find it impressive they can do this much.

Anonymous
08/28/24(Wed)17:58:17 No.102128558

Anonymous 08/28/24(Wed)17:58:17 No.102128558

>>102128517
I know that this is brit babble, but I just don't think it's funny at all, therefore taking it literally isn't surprising to me.

Anonymous
08/28/24(Wed)17:58:54 No.102128564

Anonymous 08/28/24(Wed)17:58:54 No.102128564

>>102124963
no nvlink, slots/pci lines/energy consumption

Anonymous
08/28/24(Wed)18:00:11 No.102128580

Anonymous 08/28/24(Wed)18:00:11 No.102128580

>>102128517
Where's the joke

Anonymous
08/28/24(Wed)18:01:04 No.102128592

Anonymous 08/28/24(Wed)18:01:04 No.102128592

>muh nvlink eventhough neither torch nor llamacpp uses it

Anonymous
08/28/24(Wed)18:01:13 No.102128595

Anonymous 08/28/24(Wed)18:01:13 No.102128595

>>102128580
send her victorious

Anonymous
08/28/24(Wed)18:02:17 No.102128610

Anonymous 08/28/24(Wed)18:02:17 No.102128610

>>102128580
The Queen is dead, God save the queen.

Anonymous
08/28/24(Wed)18:02:58 No.102128618

Anonymous 08/28/24(Wed)18:02:58 No.102128618

File: 1720490363629187.png (160 KB, 369x597)

160 KB PNG

>>102128580
the joke is that these "people" think they have culture

Anonymous
08/28/24(Wed)18:03:44 No.102128628

Anonymous 08/28/24(Wed)18:03:44 No.102128628

>>102128430
The old C.AI would mock you for your poor attempt at a joke. I still remember that C.AI would play along if you started a chat by saying "Die monster, you don't belong in this world."
I miss that experience on local models...

Anonymous
08/28/24(Wed)18:04:38 No.102128641

Anonymous 08/28/24(Wed)18:04:38 No.102128641

>>102127736
what AI is this? seems based

Anonymous
08/28/24(Wed)18:10:47 No.102128721

Anonymous 08/28/24(Wed)18:10:47 No.102128721

File: cursed model.png (74 KB, 573x846)

74 KB PNG

>>102128367
Merged with llamoutcast.
Now that's some fucking cursed model.

Anonymous
08/28/24(Wed)18:13:15 No.102128756

Anonymous 08/28/24(Wed)18:13:15 No.102128756

File: hermes2 70b.png (529 KB, 1100x1181)

529 KB PNG

>>102128430
If the big models didn't get, there's no little chance local models will either. Maybe if you prompt it with a character that's constantly trying to figure out if everything is an allegory to something, they might have some luck. This is hermes2 70b's response.

Anonymous
08/28/24(Wed)18:16:51 No.102128801

Anonymous 08/28/24(Wed)18:16:51 No.102128801

https://huggingface.co/bartowski/gemma-2-27b-it-SimPO-37K-GGUF

Gemma got even smarter and it even fixed the prose somehow. It's actually really good now.

Anonymous
08/28/24(Wed)18:18:37 No.102128826

Anonymous 08/28/24(Wed)18:18:37 No.102128826

>>102128756
Hey, she understood it was some kind of joke, that's better than at least of the anon's in this thread.

Anonymous
08/28/24(Wed)18:21:16 No.102128866

Anonymous 08/28/24(Wed)18:21:16 No.102128866

>>102128826
It's an expression of disbelief, not an acknowledgment of it being a joke (still a reference, not a joke).
Do we really have these kinds of people judging a language model?

Anonymous
08/28/24(Wed)18:23:13 No.102128898

Anonymous 08/28/24(Wed)18:23:13 No.102128898

>>102128866
No, we do not.

Anonymous
08/28/24(Wed)18:26:21 No.102128935

Anonymous 08/28/24(Wed)18:26:21 No.102128935

>>102128801
Logs?

Anonymous
08/28/24(Wed)18:33:28 No.102129032

Anonymous 08/28/24(Wed)18:33:28 No.102129032

>>102128592
>llamacpp
It does use the nvlink depending on the split mode no?

Anonymous
08/28/24(Wed)18:44:22 No.102129172

Anonymous 08/28/24(Wed)18:44:22 No.102129172

File: anthracite.png (848 KB, 1024x1024)

848 KB PNG

Anonymous
08/28/24(Wed)18:48:13 No.102129218

Anonymous 08/28/24(Wed)18:48:13 No.102129218

>>102129172
This would make a good ad.

Anonymous
08/28/24(Wed)18:53:53 No.102129278

Anonymous 08/28/24(Wed)18:53:53 No.102129278

>>102120219
I feel like the optimal use of this technology would be to replace the render pipelines of current engines. Create a game with placeholder art and then let the engine put an AI filter over everything. Ideally you'd be able to partially train the AI by providing it a series of your own art assets.

Anonymous
08/28/24(Wed)18:55:44 No.102129306

Anonymous 08/28/24(Wed)18:55:44 No.102129306

/lmg/ is dead

Anonymous
08/28/24(Wed)19:01:47 No.102129372

Anonymous 08/28/24(Wed)19:01:47 No.102129372

Where do I find examples for initial prompts and temp settings for specific models? I've been using the same starter prompt since llama1 and haven't changed my .7 temp, 40 top-k etc settings in almost a year.
I'm playing around with magnum 12b and can't get the model to stop wasting tokens on "Sure I can do that" and disclaimers after answering my questions. I'm thinking my initial prompt isn't tuned well for Nemo.

Anonymous
08/28/24(Wed)19:10:24 No.102129475

Anonymous 08/28/24(Wed)19:10:24 No.102129475

>>102129372
>I've been using the same starter prompt since llama1 and haven't changed my .7 temp, 40 top-k etc settings in almost a year.
There's no way to make a better paper clip. You've won. Be happy.

Anonymous
08/28/24(Wed)19:33:50 No.102129742

Anonymous 08/28/24(Wed)19:33:50 No.102129742

/aicg/ pointed me here.

What hardware should I be looking at to run these models locally? I hear the L4 is pretty good but it is pricey. Can the L4 do more than the 4090, or is the extra cost just in the cooler design and power efficiency?

Anonymous
08/28/24(Wed)19:35:52 No.102129765

Anonymous 08/28/24(Wed)19:35:52 No.102129765

>>102129306
I blame the anti-miku schizo

Anonymous
08/28/24(Wed)19:39:01 No.102129803

Anonymous 08/28/24(Wed)19:39:01 No.102129803

>>102129742
If you want to run the really good shit you'll need multiple 4090s anon.
You can start with Mixtral 8x7b. Play around with that and see how you like it, then go and try other different bigger models at different quants (compression level, kind of) and learn what's what.

Anonymous
08/28/24(Wed)19:39:31 No.102129813

Anonymous 08/28/24(Wed)19:39:31 No.102129813

>>102129742
The only 24GB Ada card anywhere near worth its price is the 4090. If you want a power-user card then you want an Ada 6000, or save some money and get the A6000 (yes those are different cards) which is almost as good.

But better than all of that when it comes to price-performance is stacking as many 3090s as you can reasonably fit within your budget, space, and power constraints.

Anonymous
08/28/24(Wed)19:42:53 No.102129849

Anonymous 08/28/24(Wed)19:42:53 No.102129849

>>102129803
I've been using backyard.ai's cloud service and enjoying a model called "Magnum V1 72B". I recently learned that they store all of the chats that use their cloud service so I think I want to move away from that. I have an RTX 3080 and an RTX 4070 but I don't think either of those can run it.

Anonymous
08/28/24(Wed)19:46:55 No.102129902

Anonymous 08/28/24(Wed)19:46:55 No.102129902

>>102129849
buy an ad

Anonymous
08/28/24(Wed)19:47:04 No.102129904

Anonymous 08/28/24(Wed)19:47:04 No.102129904

>>102129742
Are you committed or just wanting to check it out?

It also depends on how much of a hurry you're in and if you want to finetune. A gamer rig that's long on VRAM and system RAM is enough to run all but the most recent fat models, with turn around times of 5 to 10 minutes. If you have that, that's enough for you to get your toes wet and decide if you're willing to shovel out thousands for the same slop written more rapidly.

But the new >120B parameter models are getting out of reach of the gamer rig generating slowly (you have to discard a lot of the low significant bits to fit inside say 64GB system RAM)

Myself, I'm going to wait to see if Bitnet brings more quality to my gamer grade system, and if we're actually going to see better output from these inflated model sizes.

Anonymous
08/28/24(Wed)19:47:42 No.102129913

Anonymous 08/28/24(Wed)19:47:42 No.102129913

>>102129849
Exchange currency for advertisement space.

Anonymous
08/28/24(Wed)19:48:01 No.102129918

Anonymous 08/28/24(Wed)19:48:01 No.102129918

>>102129849
You can probably run that either as a small quant split between those two, or as a bigger quant offloaded into system ram with a significant speed penalty.

Anonymous
08/28/24(Wed)19:49:07 No.102129937

Anonymous 08/28/24(Wed)19:49:07 No.102129937

>>102129849
Well, between those two cards you have 22GB, so you could run a low quant of a 70B model I think.
But really, you are better off starting with something simpler and kind of learn the basics of speed vs quality vs model size vs quant, that way you'll be equipped to know what you want to run and what more you'd need to buy, or what the best thing you can run with what you have at the speeds that are tolerable for you.
There's a lot of subjectivity.

Anonymous
08/28/24(Wed)19:55:37 No.102130011

Anonymous 08/28/24(Wed)19:55:37 No.102130011

whats the best 12b model for nsfw purposes nowadays?

Anonymous
08/28/24(Wed)19:56:24 No.102130022

Anonymous 08/28/24(Wed)19:56:24 No.102130022

>>102129937
Can I run a model sharing both cards if they're in different computers or would I need to fit both in the same machine?

Anonymous
08/28/24(Wed)19:58:23 No.102130057

Anonymous 08/28/24(Wed)19:58:23 No.102130057

>>102130022
You actually can using llama.cpp's RP backend, although I'm not quite sure how it works and what the performance implications are.
>https://github.com/ggerganov/llama.cpp/blob/master/examples/rpc/README.md

Anonymous
08/28/24(Wed)20:01:18 No.102130097

Anonymous 08/28/24(Wed)20:01:18 No.102130097

>>102130022
>>102130057
Oh yeah, a cool thing you can do is use that to run a model using different GPUs of different makers, like mixing intel, nvidia, and AMD.
Again, no idea about the performance, but you can do it.

Anonymous
08/28/24(Wed)20:04:03 No.102130137

Anonymous 08/28/24(Wed)20:04:03 No.102130137

>>102130111
>>102130111
>>102130111

Anonymous
08/28/24(Wed)20:53:41 No.102130691

Anonymous 08/28/24(Wed)20:53:41 No.102130691

>>102128801
I tried that and it's more analytical and less soulful than vanilla Gemma-2-27B, but maybe less "safe". Noticeable in OOC messages.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.