/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 06/29/24(Sat)01:36:36 No.101197169

File: f64d4ed24e717019e7c766ed8(...).jpg (155 KB, 1682x1187)

155 KB JPG

/lmg/ - Local Models General Anonymous 06/29/24(Sat)01:36:36 No.101197169 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>101191862 & >>101186500

►News
>(06/28) Inference support for Gemma 2 merged: https://github.com/ggerganov/llama.cpp/pull/8156
>(06/27) Meta announces LLM Compiler, based on Code Llama, for code optimization and disassembly: https://go.fb.me/tdd3dw
>(06/27) Gemma 2 released: https://hf.co/collections/google/gemma-2-release-667d6600fd5220e7b967f315
>(06/25) Cambrian-1: Collection of vision-centric multimodal LLMs: https://cambrian-mllm.github.io
>(06/23) Support for BitnetForCausalLM merged: https://github.com/ggerganov/llama.cpp/pull/7931

►News Archive: https://rentry.org/lmg-news-archive
►FAQ: https://wikia.schneedc.com
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Programming: https://hf.co/spaces/bigcode/bigcode-models-leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
06/29/24(Sat)01:37:11 No.101197174

Anonymous 06/29/24(Sat)01:37:11 No.101197174

File: __hatsune_miku_and_chibi_(...).jpg (178 KB, 650x650)

178 KB JPG

►Recent Highlights from the Previous Thread: >>101191862

--The Struggle is Real: Cleaning Datasets for Machine Learning Models: >>101191983 >>101192038 >>101192091 >>101192171 >>101192277 >>101192315 >>101192363 >>101192395 >>101192396 >>101192335
--Seeking EXL2 Compatible Server with OpenAI API and Context-Free Grammar Support: >>101192778 >>101193053 >>101193098 >>101193144 >>101193166
--Qwen 2's Tess-v2.5.2-Qwen2-72B Variant: A Promising AI Model: >>101193192 >>101193460 >>101193516
--Perplexity Improves with 9b Base Model: >>101192484 >>101192604 >>101192644 >>101192655 >>101192662
--Llama.cpp's Token Generation Delay with Cached Prompts: >>101195573 >>101195626 >>101195675 >>101195962 >>101195977 >>101196044 >>101196192
--LLM Compiler: Code Optimization and Disassembly Research Experiment: >>101191929 >>101192026 >>101192176 >>101193178
--Counting Letters and Custom Compiling Gemma-2 Support in Llama.cpp: >>101192460 >>101192555 >>101192897 >>101192940 >>101192964 >>101193630 >>101193760
--Gemma's Performance in Real-World RP and Potential Combinations: >>101193239 >>101193271 >>101193316 >>101193566 >>101193637
--Gemma 2's 8K Context Limitations and Meta's Unfulfilled Promises: >>101195909 >>101195953 >>101196152 >>101196394
--AI-Generated Cat Image and LLM Writing Quality: >>101193118 >>101193151 >>101193260 >>101193552 >>101193311 >>101194021 >>101194047 >>101194093 >>101194110 >>101194134 >>101194150 >>101194234 >>101194251
--27B's Performance Improvement and Schizo Fix: >>101193819 >>101193846 >>101193868 >>101193906 >>101193945 >>101195648 >>101193954 >>101193967
--Llama-70B and Gemma-27B VRAM Performance Issues: >>101194991 >>101195001 >>101195070
--gemma2's Repetitive Answers: A Potential Inference Issue: >>101192975 >>101192983 >>101193055
--Miku (free space): >>101192212 >>101192496 >>101195485 >>101196114 >>101196225 >>101196269 >>101196461 >>101196550 >>101196766

►Recent Highlight Posts from the Previous Thread: >>101191868

Anonymous
06/29/24(Sat)01:42:06 No.101197208

Anonymous 06/29/24(Sat)01:42:06 No.101197208

!!! THREADLY REMINDER !!!
llama.cpp is AGPL3.0-only
>>101188248

Anonymous
06/29/24(Sat)01:43:14 No.101197218

Anonymous 06/29/24(Sat)01:43:14 No.101197218

File: 1558544371796.jpg (28 KB, 604x550)

28 KB JPG

Any new image model developments?

Anonymous
06/29/24(Sat)01:46:52 No.101197241

Anonymous 06/29/24(Sat)01:46:52 No.101197241

>>101197218
Pony / pony realism

Anonymous
06/29/24(Sat)01:50:53 No.101197272

Anonymous 06/29/24(Sat)01:50:53 No.101197272

Why did you start a new thread?

Anonymous
06/29/24(Sat)02:09:24 No.101197411

Anonymous 06/29/24(Sat)02:09:24 No.101197411

dead general dead hobby

Anonymous
06/29/24(Sat)02:13:13 No.101197433

Anonymous 06/29/24(Sat)02:13:13 No.101197433

dead technology dead future

Anonymous
06/29/24(Sat)02:13:13 No.101197434

Anonymous 06/29/24(Sat)02:13:13 No.101197434

I've been trying out gemma 2 27b on lmsys.
It feels A LOT like the gemini flash model, but a bit dumber. I've also noticed that it basically gives the same response each time, even with the temperature turned up.

They overall have the same feel, and my theory is that gemma 2 is just a "fork" of an earlier checkpoint of gemini flash.

Anonymous
06/29/24(Sat)02:13:37 No.101197438

Anonymous 06/29/24(Sat)02:13:37 No.101197438

>>101197411
Aloneposting on /lmg/ on a Friday late night?

Yeah, that's something people should aspire to.

Anonymous
06/29/24(Sat)02:15:53 No.101197454

Anonymous 06/29/24(Sat)02:15:53 No.101197454

>>101197208
!!! THREADLY REMINDER !!!
petra's timezone is UTC+1

Anonymous
06/29/24(Sat)02:16:46 No.101197465

Anonymous 06/29/24(Sat)02:16:46 No.101197465

File: 1714730734021332.jpg (42 KB, 400x400)

42 KB JPG

>>101197454

Anonymous
06/29/24(Sat)02:17:53 No.101197475

Anonymous 06/29/24(Sat)02:17:53 No.101197475

File: 1719588170603034.png (1.94 MB, 2560x2560)

1.94 MB PNG

>>101197208

Anonymous
06/29/24(Sat)02:21:38 No.101197494

Anonymous 06/29/24(Sat)02:21:38 No.101197494

>https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
what happened to miqu?

Anonymous
06/29/24(Sat)02:30:30 No.101197570

Anonymous 06/29/24(Sat)02:30:30 No.101197570

>>101197438
h-hey it's Saturday night here

Anonymous
06/29/24(Sat)02:31:32 No.101197578

Anonymous 06/29/24(Sat)02:31:32 No.101197578

>>101197494
She had a good run but she was finally put out to pasture. No model is SOTA forever.

Anonymous
06/29/24(Sat)02:31:45 No.101197579

Anonymous 06/29/24(Sat)02:31:45 No.101197579

What is the current coom model?

Anonymous
06/29/24(Sat)02:32:12 No.101197582

Anonymous 06/29/24(Sat)02:32:12 No.101197582

>>101197579
qwen2 72b

Anonymous
06/29/24(Sat)02:32:22 No.101197584

Anonymous 06/29/24(Sat)02:32:22 No.101197584

>>101197438
hey its saturday morning here

Anonymous
06/29/24(Sat)02:35:53 No.101197606

Anonymous 06/29/24(Sat)02:35:53 No.101197606

>>101197579
Buy an ad.

Anonymous
06/29/24(Sat)02:36:32 No.101197613

Anonymous 06/29/24(Sat)02:36:32 No.101197613

What is the current coom card?

Anonymous
06/29/24(Sat)02:40:46 No.101197641

Anonymous 06/29/24(Sat)02:40:46 No.101197641

>>101197613
big nigga

Anonymous
06/29/24(Sat)02:42:50 No.101197652

Anonymous 06/29/24(Sat)02:42:50 No.101197652

>>101197613
The one you wrote yourself to reflect your ideal scenario

Anonymous
06/29/24(Sat)02:44:33 No.101197660

Anonymous 06/29/24(Sat)02:44:33 No.101197660

>>101197652
I don't have a type

Anonymous
06/29/24(Sat)02:46:00 No.101197665

Anonymous 06/29/24(Sat)02:46:00 No.101197665

>>101197660
Ask the AI to make up a character then.

Anonymous
06/29/24(Sat)02:47:07 No.101197670

Anonymous 06/29/24(Sat)02:47:07 No.101197670

File: file.png (180 KB, 275x387)

180 KB PNG

>>101197660
>

Anonymous
06/29/24(Sat)02:49:16 No.101197686

Anonymous 06/29/24(Sat)02:49:16 No.101197686

File: SuomiCoreMiku.png (1.48 MB, 832x1216)

1.48 MB PNG

>>101197660
>I don't have a type
picrel is now your type

Anonymous
06/29/24(Sat)02:51:01 No.101197700

Anonymous 06/29/24(Sat)02:51:01 No.101197700

File: petra.png (100 KB, 431x270)

100 KB PNG

>>101197660
>I don't have a type
picrel is now your type

Anonymous
06/29/24(Sat)02:58:16 No.101197754

Anonymous 06/29/24(Sat)02:58:16 No.101197754

Anyway had issues with 27B generating only pad tokens indefinitely until the generation is interrupted, when loaded with Transformers?
This isn't the different issue some are having with 27B outputs being low quality or schizo, this is something else where it's not working at all. Just generating an endless string of pad tokens in response to any input.

Anonymous
06/29/24(Sat)02:59:17 No.101197762

Anonymous 06/29/24(Sat)02:59:17 No.101197762

*Anyone

Anonymous
06/29/24(Sat)03:00:20 No.101197771

Anonymous 06/29/24(Sat)03:00:20 No.101197771

>>101197660
Python was a mistake.

Anonymous
06/29/24(Sat)03:01:05 No.101197778

Anonymous 06/29/24(Sat)03:01:05 No.101197778

>>101197771
savage

Anonymous
06/29/24(Sat)03:01:16 No.101197781

Anonymous 06/29/24(Sat)03:01:16 No.101197781

>>101197771
heh

Anonymous
06/29/24(Sat)03:01:58 No.101197787

Anonymous 06/29/24(Sat)03:01:58 No.101197787

>>101197771
It's a fucking bane on programming.
So many headaches because of that fucking meme "language."

Anonymous
06/29/24(Sat)03:04:05 No.101197798

Anonymous 06/29/24(Sat)03:04:05 No.101197798

>>101197771
there's worse language though, imagine using javascript as the required programming language for the fucking sites on the internet, I know sounds crazy but...

Anonymous
06/29/24(Sat)03:06:22 No.101197815

Anonymous 06/29/24(Sat)03:06:22 No.101197815

my favorite quant is IQ4_NL

Anonymous
06/29/24(Sat)03:07:20 No.101197820

Anonymous 06/29/24(Sat)03:07:20 No.101197820

>>101197771
Python does have types. /lmg/ proving once again to be mostly nocoders.

Anonymous
06/29/24(Sat)03:07:51 No.101197826

Anonymous 06/29/24(Sat)03:07:51 No.101197826

>>101197798
only pajeets and hobbyists use plain javascript
every company with more than 2 developers working codes with typescript

Anonymous
06/29/24(Sat)03:08:10 No.101197828

Anonymous 06/29/24(Sat)03:08:10 No.101197828

>>101197754
I found another guy on HF getting this issue but he couldn't solve it either. Not GGUFs, just the standard FP16 weights.
Got the latest 4.43.0.dev Transformers, but it happens with the 4.42.0.dev wheel supplied by Google too. Weird.

Anonymous
06/29/24(Sat)03:08:28 No.101197831

Anonymous 06/29/24(Sat)03:08:28 No.101197831

>>101197294
sampling doesn't work so logits doesn't work

Anonymous
06/29/24(Sat)03:08:53 No.101197833

Anonymous 06/29/24(Sat)03:08:53 No.101197833

>>101197820
And the Anon that I was replying to presumably also has a type.
But in both cases there is a lack of awareness.

Anonymous
06/29/24(Sat)03:10:18 No.101197845

Anonymous 06/29/24(Sat)03:10:18 No.101197845

>>101197826
typescript is the same shit anon... it's just java script but with OOP

Anonymous
06/29/24(Sat)03:11:05 No.101197853

Anonymous 06/29/24(Sat)03:11:05 No.101197853

>>101197845
it has types

Anonymous
06/29/24(Sat)03:11:34 No.101197854

Anonymous 06/29/24(Sat)03:11:34 No.101197854

>>101197853
yay...

Anonymous
06/29/24(Sat)03:12:02 No.101197860

Anonymous 06/29/24(Sat)03:12:02 No.101197860

>>101197828
Also do_sample is off so it's not that. I'm not getting NaNs, it's just generating <pad> endlessly.

Anonymous
06/29/24(Sat)03:12:07 No.101197861

Anonymous 06/29/24(Sat)03:12:07 No.101197861

>>101197411
blame avatarfags

Anonymous
06/29/24(Sat)03:12:49 No.101197864

Anonymous 06/29/24(Sat)03:12:49 No.101197864

File: .png (940 KB, 891x1064)

940 KB PNG

>>101197861
forgot to attach picrel

Anonymous
06/29/24(Sat)03:16:15 No.101197882

Anonymous 06/29/24(Sat)03:16:15 No.101197882

Gemma2 27b is good at poetic metre. Never seen a model spit out multiple stanzas of perfect iambic pentameter without a single mistake.

Anonymous
06/29/24(Sat)03:18:59 No.101197899

Anonymous 06/29/24(Sat)03:18:59 No.101197899

>>101197845
>it's just java script but with OOP
You have no idea what you're talking about.
>>101197826
TypeScript is just for decoration, like putting makeup on a pig.

Anonymous
06/29/24(Sat)03:19:14 No.101197901

Anonymous 06/29/24(Sat)03:19:14 No.101197901

>>101197882
how did you test it out anon? last time I've heard about that model there was some bugs making it schizo

Anonymous
06/29/24(Sat)03:20:11 No.101197907

Anonymous 06/29/24(Sat)03:20:11 No.101197907

>>101197860
>>101197828
>>101197754
Fuck, I just needed to tick the BF16 option when loading the weights. Even though the weights are FP16. I don't get it but I'll take it, it's working now.

Anonymous
06/29/24(Sat)03:20:14 No.101197908

Anonymous 06/29/24(Sat)03:20:14 No.101197908

>Bug: quantized gemma 27b output still wrong after tokenizer fix #8183
it wouldn't be llama.cpp otherwise

Anonymous
06/29/24(Sat)03:20:37 No.101197911

Anonymous 06/29/24(Sat)03:20:37 No.101197911

>>101197899
>You have no idea what you're talking about.
Oh I fucking do, anon, I fucking do I made a site project with typescript, it's the same fucking shit as javascript and I hated my life when doing this shit, the fact that this failed language managed to be the main language on the fucking internet is still one of the biggest mysteries of the human history

Anonymous
06/29/24(Sat)03:21:05 No.101197912

Anonymous 06/29/24(Sat)03:21:05 No.101197912

File: vU8ccqx.png (62 KB, 780x550)

62 KB PNG

>>101197882
>>101197901
Picrel (this was after I gave it the feathers vs. steel riddle, that is why the topic is steel)
It's the technical accuracy of the poetic metre I am impressed by specifically, not necessarily the writing quality. Other models would have struggled with consistently keeping the correct metre and messed up stress and syllables once or twice.

Anonymous
06/29/24(Sat)03:21:30 No.101197913

Anonymous 06/29/24(Sat)03:21:30 No.101197913

>>101197911
sounds like skill issue

Anonymous
06/29/24(Sat)03:22:17 No.101197918

Anonymous 06/29/24(Sat)03:22:17 No.101197918

>>101197913
sounds like masochism issue

Anonymous
06/29/24(Sat)03:24:32 No.101197933

Anonymous 06/29/24(Sat)03:24:32 No.101197933

>>101197911
JavaScript has OOP. TypeScript adds types, hence the name, not OOP. Stupid webshitter.

Anonymous
06/29/24(Sat)03:25:13 No.101197938

Anonymous 06/29/24(Sat)03:25:13 No.101197938

File: file.png (27 KB, 644x800)

27 KB PNG

>>101197918
>0==false
>AAAAAAAAAA
you

Anonymous
06/29/24(Sat)03:26:38 No.101197945

Anonymous 06/29/24(Sat)03:26:38 No.101197945

>>101197911
>the fact that this failed language managed to be the main language on the fucking internet is still one of the biggest mysteries of the human history
It should have been Lua.

Anonymous
06/29/24(Sat)03:27:56 No.101197948

Anonymous 06/29/24(Sat)03:27:56 No.101197948

>>101197911
>Oh I fucking do, anon, I fucking do

Not him, and I hate javascript, but no, you really don't. Give it a rest.

Anonymous
06/29/24(Sat)03:28:06 No.101197949

Anonymous 06/29/24(Sat)03:28:06 No.101197949

>>101197945
Lua is relabeled BASIC.

Anonymous
06/29/24(Sat)03:30:50 No.101197963

Anonymous 06/29/24(Sat)03:30:50 No.101197963

>>101197949
No it's not. The syntax is just vaguely similar. BASIC doesn't have closures, first class functions, and you can't create prototype patterns. Also BASIC isn't designed to be embeddable.

Anonymous
06/29/24(Sat)03:33:25 No.101197974

Anonymous 06/29/24(Sat)03:33:25 No.101197974

>>101197945
No, it shouldn't be any single retarded scripting language. The web should be a collection of documents, as it was originally intended to be.
Adding any scripting was the first mistake. Trying to turn web browsers into cross platform application emulators because the average bootcamp flunkie is too stupid for regular application development was the biggest mistake.
But now that we're here, Web Assembly is the correct solution. Letting those same javascript artisans that got us here in the first place gimp it from interacting with the DOM to prevent themselves from becoming obsolete was the final mistake.

Anonymous
06/29/24(Sat)03:34:35 No.101197983

Anonymous 06/29/24(Sat)03:34:35 No.101197983

>>101197974
If we had to have scripting in the web browser, Lua would have been a fine solution.
Otherwise though I agree with you.

Anonymous
06/29/24(Sat)03:36:48 No.101198002

Anonymous 06/29/24(Sat)03:36:48 No.101198002

>>101197938
>>101197948
didn't expect to find fucking javascript fanboys, goddam, and you people say you hate python at the same time? you lost all credibility with that stank take

Anonymous
06/29/24(Sat)03:41:45 No.101198035

Anonymous 06/29/24(Sat)03:41:45 No.101198035

>/lmg/ - i am le smart
embarassing

Anonymous
06/29/24(Sat)03:44:33 No.101198056

Anonymous 06/29/24(Sat)03:44:33 No.101198056

>ask character if she's a virgin
>no
>add it into the card
>now she acts like the shyest most boring and predictable character
wat do?

Anonymous
06/29/24(Sat)03:47:19 No.101198074

Anonymous 06/29/24(Sat)03:47:19 No.101198074

File: file.png (52 KB, 1060x189)

52 KB PNG

>sweet summer child
:(

Anonymous
06/29/24(Sat)03:47:37 No.101198076

Anonymous 06/29/24(Sat)03:47:37 No.101198076

>>101198056
Perfect example of the model picking up on shitty cliche smut tropes, realizing that's what it's writing and implementing that into its writing style.
Try rephrasing it to "{{char}} has never had sex before" and watch it magically fix itself.

Anonymous
06/29/24(Sat)03:49:08 No.101198084

Anonymous 06/29/24(Sat)03:49:08 No.101198084

Is there any good local multi-modal model that takes image and video input yet?

Anonymous
06/29/24(Sat)03:49:27 No.101198087

Anonymous 06/29/24(Sat)03:49:27 No.101198087

File: ComfyUI_00707_.png (1.19 MB, 832x1216)

1.19 MB PNG

>>101198076
>has never had sex before
holy shit it works anon, thank you. heres a miku pic

Anonymous
06/29/24(Sat)03:51:05 No.101198094

Anonymous 06/29/24(Sat)03:51:05 No.101198094

>>101198087
>011
>011

Anonymous
06/29/24(Sat)03:51:29 No.101198099

Anonymous 06/29/24(Sat)03:51:29 No.101198099

>>101197911
Okay but what's wrong with it

Anonymous
06/29/24(Sat)03:52:33 No.101198103

Anonymous 06/29/24(Sat)03:52:33 No.101198103

File: file.png (196 KB, 1007x627)

196 KB PNG

geg

Anonymous
06/29/24(Sat)03:52:56 No.101198107

Anonymous 06/29/24(Sat)03:52:56 No.101198107

>>101198084
Even if there were, llama.cpp wouldn't support them. There's a few that support image input, but I haven't seen any local models that take video yet.

Anonymous
06/29/24(Sat)03:56:30 No.101198132

Anonymous 06/29/24(Sat)03:56:30 No.101198132

>>101198107
:(

Anonymous
06/29/24(Sat)03:57:15 No.101198135

Anonymous 06/29/24(Sat)03:57:15 No.101198135

>>101197864
>shit vs shit
whoa!

Anonymous
06/29/24(Sat)03:58:38 No.101198143

Anonymous 06/29/24(Sat)03:58:38 No.101198143

>>101198035
thats all of /g/ at this point, catalog is trashed with anime pics and ai jeet hype-up advertisement "threads"

Anonymous
06/29/24(Sat)04:52:18 No.101198499

Anonymous 06/29/24(Sat)04:52:18 No.101198499

>>101198099
there's hundreds of better programming languages than fucking javascript and you're ok that this piece of shit is required to make the internet work? the fuck?

Anonymous
06/29/24(Sat)04:52:49 No.101198505

Anonymous 06/29/24(Sat)04:52:49 No.101198505

>/lmg/ - local models general

Anonymous
06/29/24(Sat)04:56:00 No.101198523

Anonymous 06/29/24(Sat)04:56:00 No.101198523

>How many r's are in "strawberry"?
>There are 2 r's in "strawberry".
>Spell the word "strawberry" and tell me how many r's are in the word.
>The word "strawberry" is spelled S-T-R-A-W-B-E-R-R-Y. There are 3 r's in the word "strawberry".

I don't think the strawberry test is good for validating a model's quality despite yesterday's meme. Rather I think it reminds us of the way tokenization works and that it's something we must account for when asking an LLM to do tasks that is more granular than the word/token level.

Anonymous
06/29/24(Sat)05:01:20 No.101198558

Anonymous 06/29/24(Sat)05:01:20 No.101198558

>>101198499
>ad hominem
whats wrong with it anon?

Anonymous
06/29/24(Sat)05:02:52 No.101198565

Anonymous 06/29/24(Sat)05:02:52 No.101198565

File: file.png (314 KB, 976x750)

314 KB PNG

hi

Anonymous
06/29/24(Sat)05:03:50 No.101198570

Anonymous 06/29/24(Sat)05:03:50 No.101198570

File: LOL.jpg (187 KB, 1792x1400)

187 KB JPG

>>101198558
Even the creator of javascript thinks it ruined the internet but nahh nothing's wrong with it anon 1!11!1!1
https://lunduke.substack.com/p/creator-of-javascript-apologizes

Anonymous
06/29/24(Sat)05:05:14 No.101198577

Anonymous 06/29/24(Sat)05:05:14 No.101198577

>>101198570
You have this saved but you cant name a reason. I'll stop replying though since it's offtopic.

Anonymous
06/29/24(Sat)05:06:23 No.101198586

Anonymous 06/29/24(Sat)05:06:23 No.101198586

>>101197787
Ironically only people who aren't programmers or are new into coding trash on python. There is a good in my job who were coding in C++ for 15 years. Since he tried python he never went back and always fights tooth and nails to use it in our projects. I've seen it multiple times and my personal experiences and my colleagues' around me resonate more with me than some python memes from r/ProgrammerHumor

Anonymous
06/29/24(Sat)05:06:44 No.101198589

Anonymous 06/29/24(Sat)05:06:44 No.101198589

>>101198577
>you cant name a reason
if you knew how to read text on pictures, you'll notice the reasons are cited on his tweet
>I'll stop replying though since it's offtopic.
looks convenient to leave the debate after being proved wrong :^)

Anonymous
06/29/24(Sat)05:07:21 No.101198591

Anonymous 06/29/24(Sat)05:07:21 No.101198591

>>101198570
This is just attention seeking behavior

Anonymous
06/29/24(Sat)05:07:47 No.101198594

Anonymous 06/29/24(Sat)05:07:47 No.101198594

>>101198591
You're right, it's just a consipracy theory!1!1!1

Anonymous
06/29/24(Sat)05:07:58 No.101198595

Anonymous 06/29/24(Sat)05:07:58 No.101198595

>>101198586
>there is a good
guy*, fucking autocorrect

Anonymous
06/29/24(Sat)05:14:08 No.101198632

Anonymous 06/29/24(Sat)05:14:08 No.101198632

>>101198523
Yeah well said. It's quite robust when you expand the instructions
>### Instruction:
>Count the instances of the given Letter within the Input string. First expand the Input string into individual letters, then count the number of instances of the given Letter.
>### Input:
>strawberry
>### Letter:
>R
>### Response:
>Step 1:

Anonymous
06/29/24(Sat)05:18:13 No.101198663

Anonymous 06/29/24(Sat)05:18:13 No.101198663

I think ylecunn is correct and meta should stop wasting compute on producing llms that will never basic intuitive behavior or genuine situational awareness.

Anonymous
06/29/24(Sat)05:19:28 No.101198674

Anonymous 06/29/24(Sat)05:19:28 No.101198674

>>101198663
I think lecunt is french and should go away.

Anonymous
06/29/24(Sat)05:25:25 No.101198727

Anonymous 06/29/24(Sat)05:25:25 No.101198727

>>101198674
>french man
I think he sounds based

Anonymous
06/29/24(Sat)05:29:11 No.101198756

Anonymous 06/29/24(Sat)05:29:11 No.101198756

https://huggingface.co/ChuckMcSneed/control_vectors/blob/main/command-r-plus/unslop1/control_vector-commandr-unslop1.gguf
Took my control vector for a test drive to see if it works correctly. During SFW everything worked perfectly, had the style that I wanted, but when NSFW part came, slop came out. Looks like slop during SFW and slop during NSFW have different directions inside the model.

Anonymous
06/29/24(Sat)05:30:42 No.101198768

Anonymous 06/29/24(Sat)05:30:42 No.101198768

File: ComfyUI_00704_.png (1.19 MB, 1024x1024)

1.19 MB PNG

>>101198756
>wtfpl

Anonymous
06/29/24(Sat)05:32:46 No.101198785

Anonymous 06/29/24(Sat)05:32:46 No.101198785

>>101198768
Sorry, but it will likely be removed if I release it with +NIGGER license.

Anonymous
06/29/24(Sat)05:32:48 No.101198786

Anonymous 06/29/24(Sat)05:32:48 No.101198786

>>101198756
how do I use this with exl2

Anonymous
06/29/24(Sat)05:34:16 No.101198793

Anonymous 06/29/24(Sat)05:34:16 No.101198793

>>101198785
release it with faipl-1.0
>https://freedevproject.org/faipl-1.0/
qrd: agpl but for weights

Anonymous
06/29/24(Sat)05:43:12 No.101198860

Anonymous 06/29/24(Sat)05:43:12 No.101198860

>>101198756
how do i apply faipl-1.0?

Anonymous
06/29/24(Sat)05:50:41 No.101198918

Anonymous 06/29/24(Sat)05:50:41 No.101198918

File: screenshot.png (44 KB, 1489x104)

44 KB PNG

>>101198793
>faipl-1.0
HF doesn't recognize it.

Anonymous
06/29/24(Sat)05:51:39 No.101198924

Anonymous 06/29/24(Sat)05:51:39 No.101198924

>>101198918
you can select other, and then add the contents of https://freedevproject.org/faipl-1.0.txt to it

Anonymous
06/29/24(Sat)05:53:39 No.101198942

Anonymous 06/29/24(Sat)05:53:39 No.101198942

>>101198924
to the LICENSE file, atleast thats how animagine and many other open source models do it
>https://huggingface.co/cagliostrolab/animagine-xl-3.1

Anonymous
06/29/24(Sat)05:55:22 No.101198949

Anonymous 06/29/24(Sat)05:55:22 No.101198949

>>101198918
you can also add >license_name: faipl-1.0-sd
to the readme

Anonymous
06/29/24(Sat)05:56:44 No.101198953

Anonymous 06/29/24(Sat)05:56:44 No.101198953

File: bvsed.png (1.59 MB, 832x1216)

1.59 MB PNG

>>101198756
>faipl-1.0

Anonymous
06/29/24(Sat)06:00:10 No.101198972

Anonymous 06/29/24(Sat)06:00:10 No.101198972

>>101198586
I have multiple years of experience both in Python and other languages and it is my strong conviction that allowing retards to use dynamic typing is a terrible idea.

Anonymous
06/29/24(Sat)06:03:01 No.101198989

Anonymous 06/29/24(Sat)06:03:01 No.101198989

File: file.png (343 KB, 1219x886)

343 KB PNG

>ministrations

Anonymous
06/29/24(Sat)06:04:15 No.101199005

Anonymous 06/29/24(Sat)06:04:15 No.101199005

>>101198586
>There is a good in my job who were coding in C++ for 15 years.
I'm one of those guys, I did some C++ and Java for more than 10 years, Python is still my favorite language, it's just simple and elegant, the others are convoluted piece of shit, but I choose those shit languages because it pays well kek

Anonymous
06/29/24(Sat)06:07:04 No.101199025

Anonymous 06/29/24(Sat)06:07:04 No.101199025

>>101198756
>https://huggingface.co/ChuckMcSneed/control_vectors/blob/main/command-r-plus/unslop1/example_output.md
wtf this is better than lora, why doesnt sao10k use control vectors?

Anonymous
06/29/24(Sat)06:12:40 No.101199067

Anonymous 06/29/24(Sat)06:12:40 No.101199067

>>101199025
0.5 > -0.5

Anonymous
06/29/24(Sat)06:16:58 No.101199102

Anonymous 06/29/24(Sat)06:16:58 No.101199102

>>101198989
there isn't a sentence in there that doesn't have slop in it.

Anonymous
06/29/24(Sat)06:22:49 No.101199134

Anonymous 06/29/24(Sat)06:22:49 No.101199134

>>101199102
wat do

Anonymous
06/29/24(Sat)06:26:48 No.101199158

Anonymous 06/29/24(Sat)06:26:48 No.101199158

>>101198953
>
why are you gay

Anonymous
06/29/24(Sat)06:26:53 No.101199159

Anonymous 06/29/24(Sat)06:26:53 No.101199159

File: 1695511767256918.jpg (72 KB, 1079x998)

72 KB JPG

>>101198565
damn

Anonymous
06/29/24(Sat)06:32:05 No.101199204

Anonymous 06/29/24(Sat)06:32:05 No.101199204

>>101199025
Because control vectors lock model in a single direction. Not everyone has the same tastes, some people like forming bonds and going on journeys(>>101199067), some like model to be blunt and clear. There are also issues with repetition and decreased intelligence, if applied too hard.

Anonymous
06/29/24(Sat)06:35:05 No.101199229

Anonymous 06/29/24(Sat)06:35:05 No.101199229

>>101199204
what would happen if you merged multiple control vectors then

Anonymous
06/29/24(Sat)06:36:36 No.101199237

Anonymous 06/29/24(Sat)06:36:36 No.101199237

>>101199134
When you prompt LLM to write prose or roleplay it will always give you slop.

There are two ways
1) obvious - stop prompting it to write slop.
2) remove slop - autoprompt to rewrite output removing all slop.

the rest is cope. No amount of ko-fi finetunes or control vectors can fix it. It's the datasets, the training. Either you have that, or you have braindead rambling.

Anonymous
06/29/24(Sat)06:37:05 No.101199242

Anonymous 06/29/24(Sat)06:37:05 No.101199242

File: made-in-heaven.gif (476 KB, 220x127)

476 KB GIF

>>101199229
The universe accelerates so fast it gets reset.

Anonymous
06/29/24(Sat)06:37:10 No.101199244

Anonymous 06/29/24(Sat)06:37:10 No.101199244

>>101199229
All off them would apply.

Anonymous
06/29/24(Sat)06:38:18 No.101199256

Anonymous 06/29/24(Sat)06:38:18 No.101199256

>>101199242
hi x1,000,000000,00000000000,000000000000000000,000000000000000000000000000000000

Anonymous
06/29/24(Sat)06:40:15 No.101199267

Anonymous 06/29/24(Sat)06:40:15 No.101199267

>>101199237
>1) obvious - stop prompting it to write slop.
how? the system prompt? the cards? i dont like prose at all but its not like theres much choice in models
>2) remove slop - autoprompt to rewrite output removing all slop.
how do i autoprompt? i know i can manually clean slop from replies but sometimes its too much, takes a toll of its own

Anonymous
06/29/24(Sat)06:43:45 No.101199289

Anonymous 06/29/24(Sat)06:43:45 No.101199289

>>101197434
Interesting theory. But its only useful if they did it the other way around - as in lobotomizing the final flash model. If a way is found to re-add ze 6 bazinglion context window and multimodal capabilities it would be huge.

Anonymous
06/29/24(Sat)06:45:01 No.101199300

Anonymous 06/29/24(Sat)06:45:01 No.101199300

File: GRKcPqpaMAA9Ekq.jpg (101 KB, 1038x642)

101 KB JPG

Visionbros it is so over...

Anonymous
06/29/24(Sat)06:47:15 No.101199314

Anonymous 06/29/24(Sat)06:47:15 No.101199314

>>101199300
>no chameleon
>no Cambrian

meds, schizzo

Anonymous
06/29/24(Sat)06:50:39 No.101199347

Anonymous 06/29/24(Sat)06:50:39 No.101199347

>>101197208
CUDA dev, release your critical code with agpl, with a condition that it will be mit or whatever in 3/6 months, or something like that, just so the non contributing parts cannot be in the currrent state of the art, yet you can say that you are providing a business friendly code when it reaches a stable status

Anonymous
06/29/24(Sat)06:51:55 No.101199358

Anonymous 06/29/24(Sat)06:51:55 No.101199358

>>101199314
>Chameleon
"Oh yeah, that happened": the model
Is there a single person who has tried it?

Anonymous
06/29/24(Sat)06:55:55 No.101199385

Anonymous 06/29/24(Sat)06:55:55 No.101199385

File: file.png (180 KB, 1014x546)

180 KB PNG

no matter how much i reroll she answers with my weight, wat do

Anonymous
06/29/24(Sat)06:57:51 No.101199403

Anonymous 06/29/24(Sat)06:57:51 No.101199403

>>101199385
use a better model

Anonymous
06/29/24(Sat)06:58:15 No.101199407

Anonymous 06/29/24(Sat)06:58:15 No.101199407

what's a good API provider where I can rapidly test different models?

Anonymous
06/29/24(Sat)07:01:56 No.101199441

Anonymous 06/29/24(Sat)07:01:56 No.101199441

>>101199403
for example?

Anonymous
06/29/24(Sat)07:03:58 No.101199459

Anonymous 06/29/24(Sat)07:03:58 No.101199459

File: btfo.png (227 KB, 1044x840)

227 KB PNG

Anonymous
06/29/24(Sat)07:05:12 No.101199467

Anonymous 06/29/24(Sat)07:05:12 No.101199467

>>101199459
i fucking hate llms so much

Anonymous
06/29/24(Sat)07:05:35 No.101199471

Anonymous 06/29/24(Sat)07:05:35 No.101199471

happynameday.today

Anonymous
06/29/24(Sat)07:07:20 No.101199487

Anonymous 06/29/24(Sat)07:07:20 No.101199487

File: ComfyUI_00508_-940e891e.png (1.94 MB, 1024x1024)

1.94 MB PNG

>>101199471
>happynameday.today

Anonymous
06/29/24(Sat)07:15:11 No.101199548

Anonymous 06/29/24(Sat)07:15:11 No.101199548

>>101199459
is this stheno or something like that?

Anonymous
06/29/24(Sat)07:16:54 No.101199562

Anonymous 06/29/24(Sat)07:16:54 No.101199562

>>101199548
stheno 8b 3.2, any recommendations?

Anonymous
06/29/24(Sat)07:20:34 No.101199599

Anonymous 06/29/24(Sat)07:20:34 No.101199599

>>101199562
stheno works ok for prompts that aren't 80% coom focused. I did wholesome adventures with it. So you'll have to learn to write your own cards.

Anonymous
06/29/24(Sat)07:21:41 No.101199609

Anonymous 06/29/24(Sat)07:21:41 No.101199609

>>101199599
gib cards

Anonymous
06/29/24(Sat)07:24:24 No.101199636

Anonymous 06/29/24(Sat)07:24:24 No.101199636

>>101199459
garbage in garbage out

Anonymous
06/29/24(Sat)07:25:27 No.101199643

Anonymous 06/29/24(Sat)07:25:27 No.101199643

>>101199636
Anons when the AI doesn't write a masterpiece or a symphony after typing in a single dot:

Anonymous
06/29/24(Sat)07:25:36 No.101199647

Anonymous 06/29/24(Sat)07:25:36 No.101199647

>>101199609
just write it yourself nigga, it's not hard, think of a girl YOU would want to plap and describe her. You don't have to follow any weird formatting, writing plain text works too.

{{char}} is a party girl. {{char}} likes drinking and handholding with strangers. Stuff like that.

Anonymous
06/29/24(Sat)07:27:56 No.101199667

Anonymous 06/29/24(Sat)07:27:56 No.101199667

>>101197434
google is distilling their models from bigger ones so I wouldn't be surprised

Anonymous
06/29/24(Sat)07:28:57 No.101199681

Anonymous 06/29/24(Sat)07:28:57 No.101199681

>>101199487
Damn, Petra looking good

Anonymous
06/29/24(Sat)07:34:47 No.101199739

Anonymous 06/29/24(Sat)07:34:47 No.101199739

File: file.png (227 KB, 1008x649)

227 KB PNG

Anonymous
06/29/24(Sat)07:37:06 No.101199765

Anonymous 06/29/24(Sat)07:37:06 No.101199765

>>101198989
>Doing the other RPers actions
I hate people like you.

Anonymous
06/29/24(Sat)07:39:46 No.101199797

Anonymous 06/29/24(Sat)07:39:46 No.101199797

File: file.png (144 KB, 1055x403)

144 KB PNG

Anonymous
06/29/24(Sat)07:39:48 No.101199798

Anonymous 06/29/24(Sat)07:39:48 No.101199798

>>101197218
pixart team said they're working on a bigger model

Anonymous
06/29/24(Sat)07:52:31 No.101199917

Anonymous 06/29/24(Sat)07:52:31 No.101199917

File: file.png (284 KB, 931x731)

284 KB PNG

Anonymous
06/29/24(Sat)07:55:41 No.101199946

Anonymous 06/29/24(Sat)07:55:41 No.101199946

>no more cards to pat
its over..

Anonymous
06/29/24(Sat)07:58:51 No.101199982

Anonymous 06/29/24(Sat)07:58:51 No.101199982

What quant to use for gemma 27b for 24GB VRAM?

Anonymous
06/29/24(Sat)08:00:54 No.101200001

Anonymous 06/29/24(Sat)08:00:54 No.101200001

File: file.png (233 KB, 946x781)

233 KB PNG

Anonymous
06/29/24(Sat)08:09:51 No.101200079

Anonymous 06/29/24(Sat)08:09:51 No.101200079

File: file.png (91 KB, 626x241)

91 KB PNG

what

Anonymous
06/29/24(Sat)08:10:14 No.101200080

Anonymous 06/29/24(Sat)08:10:14 No.101200080

File: 33f1n4.png (213 KB, 384x480)

213 KB PNG

>>101199946
pat this

Anonymous
06/29/24(Sat)08:11:12 No.101200089

Anonymous 06/29/24(Sat)08:11:12 No.101200089

>>101200080
he a man tho

Anonymous
06/29/24(Sat)08:12:19 No.101200099

Anonymous 06/29/24(Sat)08:12:19 No.101200099

>>101200079
B'Hig Cox.

Anonymous
06/29/24(Sat)08:15:17 No.101200126

Anonymous 06/29/24(Sat)08:15:17 No.101200126

File: file.png (107 KB, 1050x315)

107 KB PNG

least gay card

Anonymous
06/29/24(Sat)08:25:02 No.101200210

Anonymous 06/29/24(Sat)08:25:02 No.101200210

>>101197831
Wait, what? If it can't know logits, how does it choose which token to add?

Anonymous
06/29/24(Sat)08:27:06 No.101200234

Anonymous 06/29/24(Sat)08:27:06 No.101200234

>>101200080
Biggie Jong Un

Anonymous
06/29/24(Sat)08:30:20 No.101200277

Anonymous 06/29/24(Sat)08:30:20 No.101200277

>>101200126
Are you running this with 0.2 smoothing or something?

Anonymous
06/29/24(Sat)08:31:46 No.101200291

Anonymous 06/29/24(Sat)08:31:46 No.101200291

>>101200277
everything is neutralized besides
Temperature - 1.12-1.22
Min-P - 0.075
Top-K - 50
Repetition Penalty - 1.1

Anonymous
06/29/24(Sat)08:41:19 No.101200394

Anonymous 06/29/24(Sat)08:41:19 No.101200394

>All these logs
>No Nala
How disappointing.

Anonymous
06/29/24(Sat)08:46:29 No.101200448

Anonymous 06/29/24(Sat)08:46:29 No.101200448

>>101200234
the pat cut

Anonymous
06/29/24(Sat)08:49:21 No.101200476

Anonymous 06/29/24(Sat)08:49:21 No.101200476

File: Screenshot 2024-06-29 at (...).png (18 KB, 917x188)

18 KB PNG

Thank you greg. Fuck.
It's fully possible that they are worrying about something that's actually normal behavior, as weird as it is.
It's probably a bug, yeah, but check instead of assuming.

Anonymous
06/29/24(Sat)08:51:43 No.101200497

Anonymous 06/29/24(Sat)08:51:43 No.101200497

File: file.png (82 KB, 852x276)

82 KB PNG

I am going fucking insane, spent 2 hours wrangling. I need to resist the urge to ask totally irrelevant questions.

Anonymous
06/29/24(Sat)08:54:13 No.101200518

Anonymous 06/29/24(Sat)08:54:13 No.101200518

Is Gemini 2 27B working on local yet or can I go back to sleep?

Anonymous
06/29/24(Sat)08:55:17 No.101200528

Anonymous 06/29/24(Sat)08:55:17 No.101200528

>>101200518
If you had the ability to sleep why not just sleep?

Anonymous
06/29/24(Sat)08:56:34 No.101200544

Anonymous 06/29/24(Sat)08:56:34 No.101200544

>>101200394
Working on another merge right now so I should have some new official Nala tests right now.
also uploading a 70B merge right now. But it keeps failing overnight (the computer it's uploaded to is on a wireless ethernet bridge so it's a bit too finnicky for the HF web uploader, so I've been going 1 file at a time as I have time for the last couple of days).
New Nala tests and at least 1 new model by the end of the day.

Anonymous
06/29/24(Sat)08:57:28 No.101200550

Anonymous 06/29/24(Sat)08:57:28 No.101200550

>>101200544
>1 new model
licensed under FAIPL-1.0?

Anonymous
06/29/24(Sat)08:58:08 No.101200560

Anonymous 06/29/24(Sat)08:58:08 No.101200560

>>101200550
If I don't make it cc-by-nc petra anon will show up and bully me.

Anonymous
06/29/24(Sat)08:59:00 No.101200571

Anonymous 06/29/24(Sat)08:59:00 No.101200571

File: Untitle11d.jpg (106 KB, 640x640)

106 KB JPG

>>101200560
>cc-by-nc
b-based..

Anonymous
06/29/24(Sat)09:01:12 No.101200593

Anonymous 06/29/24(Sat)09:01:12 No.101200593

Licensing model weights seems pretty dubious in the first place if I'm to be honest. What's to stop someone from just taking a set of weights, applying a 360 degree rotation to all the tensors and walking away with a different license since it's technically a different set of weights now?

Anonymous
06/29/24(Sat)09:01:43 No.101200601

Anonymous 06/29/24(Sat)09:01:43 No.101200601

>>101200497
1. INST generate output
2. INST is this slop/out of character/? respond with yes or no
3. if yes, INST rewrite, goto 2. else display output to user.

Anonymous
06/29/24(Sat)09:02:07 No.101200604

Anonymous 06/29/24(Sat)09:02:07 No.101200604

>>101200593
me

Anonymous
06/29/24(Sat)09:02:19 No.101200605

Anonymous 06/29/24(Sat)09:02:19 No.101200605

>>101200497
Your complaint is unclear.

But IMHO it can be more fun having meta discussion with the LLM about the RP than the RP is. Especially on runs where the LLM is acting like AGI instead of like 77IQ.

Anonymous
06/29/24(Sat)09:04:15 No.101200629

Anonymous 06/29/24(Sat)09:04:15 No.101200629

>>101200593
Depends on the argument if it's a creative work (protected) or simply a list of data (not protected) or a combination (how cookbooks work under copyright law; the ingredients are unprotected as a list of facts, the instructions are protected because they are a designed procedure).

Anonymous
06/29/24(Sat)09:05:15 No.101200640

Anonymous 06/29/24(Sat)09:05:15 No.101200640

>>101200629
found the artcel

Anonymous
06/29/24(Sat)09:06:50 No.101200660

Anonymous 06/29/24(Sat)09:06:50 No.101200660

>>101200593
Copyright all the symmetry groups.

Anonymous
06/29/24(Sat)09:08:44 No.101200684

Anonymous 06/29/24(Sat)09:08:44 No.101200684

File: file.png (341 KB, 1162x887)

341 KB PNG

>r/4chan
geg

Anonymous
06/29/24(Sat)09:13:13 No.101200733

Anonymous 06/29/24(Sat)09:13:13 No.101200733

>>101200684
reads like some kind of mixtral meme model

Anonymous
06/29/24(Sat)09:14:27 No.101200750

Anonymous 06/29/24(Sat)09:14:27 No.101200750

>>101200733
settings from >>101171560
model is the normal 8b stheno 3.2

Anonymous
06/29/24(Sat)09:30:38 No.101200923

Anonymous 06/29/24(Sat)09:30:38 No.101200923

>>101199025
Logs?

Anonymous
06/29/24(Sat)09:32:16 No.101200940

Anonymous 06/29/24(Sat)09:32:16 No.101200940

>>101200923
re-read the post

Anonymous
06/29/24(Sat)09:32:55 No.101200948

Anonymous 06/29/24(Sat)09:32:55 No.101200948

File: firefox_CFznzCa19K.png (121 KB, 1536x747)

121 KB PNG

Which one i should download? I dont even know how much space it needs in ram / vram

Anonymous
06/29/24(Sat)09:34:05 No.101200962

Anonymous 06/29/24(Sat)09:34:05 No.101200962

>>101200948
None, download 3.3

Anonymous
06/29/24(Sat)09:34:26 No.101200966

Anonymous 06/29/24(Sat)09:34:26 No.101200966

>>101200948
this post is bait, right?

Anonymous
06/29/24(Sat)09:36:13 No.101200984

Anonymous 06/29/24(Sat)09:36:13 No.101200984

>>101200966
No. Before that i was looking for Max RAM required and etc.

Anonymous
06/29/24(Sat)09:37:55 No.101201000

Anonymous 06/29/24(Sat)09:37:55 No.101201000

>>101200948
>IQ3_S-imat
what means the first I and the imat?

Anonymous
06/29/24(Sat)09:38:48 No.101201005

Anonymous 06/29/24(Sat)09:38:48 No.101201005

>>101200948
RAM=model size+15%

Anonymous
06/29/24(Sat)09:43:52 No.101201064

Anonymous 06/29/24(Sat)09:43:52 No.101201064

>>101201000
imat is iMatrix. It is used with tiny quants so they aren't as stupid.

IQ is a different system of quants than the common K series. It has a different trade-off of size and performance. IQ is smaller than QK, of the same Q number.

IQ and iMat are unrelated but can appear together, e.g. i1-IQ3_XXS

Anonymous
06/29/24(Sat)09:45:49 No.101201087

Anonymous 06/29/24(Sat)09:45:49 No.101201087

>>101199237
>1) obvious - stop prompting it to write slop.

Prompting only works if it is consistent with entrainment. In any conflict between the sysprompt and entrainment, entrainment wins.

Anonymous
06/29/24(Sat)09:48:02 No.101201108

Anonymous 06/29/24(Sat)09:48:02 No.101201108

>>101201064
I see, thanks for the answer anon, much apreciated

Anonymous
06/29/24(Sat)09:49:02 No.101201119

Anonymous 06/29/24(Sat)09:49:02 No.101201119

I read like half of the stuff in the OP and now I'm talking to my GPU.
she's kinda stupid and keeps changing her syntax but I think I can figure this out, or try different models and stuff at least.
I've made it this far

Anonymous
06/29/24(Sat)09:49:27 No.101201126

Anonymous 06/29/24(Sat)09:49:27 No.101201126

>>101200948
I think q8 base requirement for that model is a little less than 8gb but if you run it at a higher context the ram requirement increases. I'd estimate q8 at 16k context would around 11-ish gb.

Anonymous
06/29/24(Sat)09:50:26 No.101201135

Anonymous 06/29/24(Sat)09:50:26 No.101201135

>>101201119
>I read like half of the stuff in the OP
You are leaps ahead of most people. Congrats.

Anonymous
06/29/24(Sat)09:55:31 No.101201190

Anonymous 06/29/24(Sat)09:55:31 No.101201190

>>101201119
based

Anonymous
06/29/24(Sat)09:57:27 No.101201216

Anonymous 06/29/24(Sat)09:57:27 No.101201216

>>101201119
>I read
You are leaps ahead of most people. Congrats.

Anonymous
06/29/24(Sat)10:07:40 No.101201307

Anonymous 06/29/24(Sat)10:07:40 No.101201307

>>101201087
i never said to prompt it to not right slop.
i said to not prompt it to write slop.

Anonymous
06/29/24(Sat)10:25:01 No.101201486

Anonymous 06/29/24(Sat)10:25:01 No.101201486

Should I have downloaded Koboldcpp instead of koboldai?
I have 12gigs of vram, but a decent processor. it looks like the main difference is streaming?

Anonymous
06/29/24(Sat)10:27:01 No.101201499

Anonymous 06/29/24(Sat)10:27:01 No.101201499

call me georgi the way I'm gerganov to AI chatbots

Anonymous
06/29/24(Sat)10:29:31 No.101201527

Anonymous 06/29/24(Sat)10:29:31 No.101201527

>>101201486
KoboldAI is all but defunct, kcpp is current and for mixed GPU/CPU setups, while exl2 is for Rawdogging pure GPU.

Anonymous
06/29/24(Sat)10:29:49 No.101201532

Anonymous 06/29/24(Sat)10:29:49 No.101201532

File: file.png (184 KB, 1037x546)

184 KB PNG

uhhh bros...

Anonymous
06/29/24(Sat)10:37:48 No.101201593

Anonymous 06/29/24(Sat)10:37:48 No.101201593

>>101201527
thanks. assuming I can just copy models from the KoboldAI 'models' folder to a similar folder in Koboldcpp so I don't have to redownload them?

Anonymous
06/29/24(Sat)10:38:48 No.101201606

Anonymous 06/29/24(Sat)10:38:48 No.101201606

Gemma had two major issues at launch which we know of so far.

The first was an incorrect tokenizer, which was fixed relatively quickly though a lot of GGUFs were made before that.

The second issue which was discovered much later was that Logic Soft-Capping, which Gemma-2 was trained with but which was initially not implemented in Transformers due to it conflicting with flash attention, was far more important than Google had believed it to be. Especially for the larger model.

The first issue (broken tokenizer) has been fixed for a while, and fixed GGUF has been uploaded to Bartowski's Account. But the second issue has not been fixed in llama.cpp yet. There is a PR but it has not been merged, though it likely will be very soon based on the recent approvals.

It was first believed that GGUFs would have to be remade after the PR got merged, but a default value was added for the soft-capping which means that old GGUFs will work as soon as the PR is merged.

So to summarize, if you download a GGUF from bartowski right now it will work as soon as the PR is merged, but before then you will experience degraded performance. Especially on the 27b model, which is entirely broken at certain tasks at the moment.

It's entirely possible that there are issues beyond just these two. It's not rare for various bugs to rear their heads when a new architecture emerges after all. And I have seen some say that they are experiencing issues even after the fixes. Like this post.

It's also worth noting that since llama.cpp does not support sliding window attention at the moment it will likely perform pretty poorly with context sizes larger than 4K. There is an issue open for sliding window attention but it has not really been worked on so far since few models actually use it.

40 upvotes

I honestly had no idea how shit /lmg/ is. I just hated you cause you are mentally ill.

Anonymous
06/29/24(Sat)10:44:36 No.101201670

Anonymous 06/29/24(Sat)10:44:36 No.101201670

>>101201532
Ai wuz not understand the nigger language n shieeet

Anonymous
06/29/24(Sat)10:45:53 No.101201687

Anonymous 06/29/24(Sat)10:45:53 No.101201687

File: 1384537776668.gif (89 KB, 600x450)

89 KB GIF

>>101201593

Anonymous
06/29/24(Sat)10:48:12 No.101201709

Anonymous 06/29/24(Sat)10:48:12 No.101201709

>>101201606
how is google releasing a broken, undercooked model our fault?

Anonymous
06/29/24(Sat)10:51:29 No.101201751

Anonymous 06/29/24(Sat)10:51:29 No.101201751

>>101201709
You didn't summarize the last two days worth of threads in ELI5 fashion so he could understand what was going on.

Anonymous
06/29/24(Sat)11:09:18 No.101201946

Anonymous 06/29/24(Sat)11:09:18 No.101201946

is typhon 8x7b still king?

Anonymous
06/29/24(Sat)11:09:35 No.101201951

Anonymous 06/29/24(Sat)11:09:35 No.101201951

Anons, are this good?
Nous-Capybara-limarpv3-34B
Mixtral-8x7B-Instruct-v0.1
PsyMedRP-v1-20B
Fimbulvetr v2

Anonymous
06/29/24(Sat)11:13:26 No.101201995

Anonymous 06/29/24(Sat)11:13:26 No.101201995

What's the current best local model for creating an app (and so you can learn programming in the process)?

Anonymous
06/29/24(Sat)11:15:12 No.101202012

Anonymous 06/29/24(Sat)11:15:12 No.101202012

>>101201995
DeepSeekCoder-V2-236B

Anonymous
06/29/24(Sat)11:15:50 No.101202018

Anonymous 06/29/24(Sat)11:15:50 No.101202018

>>101201951
old. but i guess for your size theres not much better, i guess stheno is ok but its very sloppy, fimb is very nice, psymedrp? eh didnt enjoy it. mixtral instruct? use mixtral instruct limarp zloss dareties or whatever its called instead. >34b
dunno

Anonymous
06/29/24(Sat)11:17:41 No.101202038

Anonymous 06/29/24(Sat)11:17:41 No.101202038

>>101201951
Switch out plain Mixtral for Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss.

Anonymous
06/29/24(Sat)11:18:30 No.101202054

Anonymous 06/29/24(Sat)11:18:30 No.101202054

>>101202018
or typhon, but right now im messing with typhon to see how it was, seems worse than stheno, probably a skill issue
>>101202038
is it better than typhon? i remember using it and switching it for typhon because everyone was shilling typhon, then i tried typhon and it was meh

Anonymous
06/29/24(Sat)11:19:54 No.101202070

Anonymous 06/29/24(Sat)11:19:54 No.101202070

File: DiaJIigUwAAtzRh-orig.jpg (64 KB, 437x437)

64 KB JPG

>>101199798
That's exactly what I wanted to hear. Thanks anon!

Anonymous
06/29/24(Sat)11:22:05 No.101202093

Anonymous 06/29/24(Sat)11:22:05 No.101202093

>https://huggingface.co/InferenceIllusionist/TeTO-MS-8x7b/tree/main
b-bros?

Anonymous
06/29/24(Sat)11:27:16 No.101202157

Anonymous 06/29/24(Sat)11:27:16 No.101202157

>>101202093
Imagine if you had posted actual meaningful text that created conversation instead of farting something r9k would turn you away for.

Anonymous
06/29/24(Sat)11:28:25 No.101202173

Anonymous 06/29/24(Sat)11:28:25 No.101202173

>>101202093
>mixtral
yawn
cute tet though

Anonymous
06/29/24(Sat)11:29:52 No.101202193

Anonymous 06/29/24(Sat)11:29:52 No.101202193

According to the new HF leaderboard, Qwen is the top model, while CR+ is much lower. Does that actually align with people's usage of the models? I rarely ever hear about Qwen being good, or bad.

Anonymous
06/29/24(Sat)11:30:25 No.101202197

Anonymous 06/29/24(Sat)11:30:25 No.101202197

>>101202193
> Does that actually align with people's usage of the models?
no

Anonymous
06/29/24(Sat)11:32:31 No.101202222

Anonymous 06/29/24(Sat)11:32:31 No.101202222

>>101202193
For me, CR+ is solid for RP and has occasionally acted too smart.
Qwen2 liked to spontaneously disrespect which of us was playing which role.

But for things other than RP, Qwen might be better. Different models, different strengths.

Anonymous
06/29/24(Sat)11:40:49 No.101202316

Anonymous 06/29/24(Sat)11:40:49 No.101202316

>>101202193
Qwrn 2 is underrated for the size, but CR+ is just better.

Anonymous
06/29/24(Sat)11:49:33 No.101202420

Anonymous 06/29/24(Sat)11:49:33 No.101202420

File: Chibi-Tet.png (469 KB, 512x768)

469 KB PNG

>>101202093
Howdy fren.
Just my love letter to Mixtral. I know it's not the new shiny thing out there by any means but I haven't seen many Model Stock experiments being done with it yet so my curiosity got the best of me. Might release a follow-up but only if it actually proves to be an improvement over this one.
>>101202173
>cute tet
Thanks anon. Featureless Flat 2D Mix does a pretty decent job with chibi style out of the box without LoRAs.

Anonymous
06/29/24(Sat)11:50:15 No.101202435

Anonymous 06/29/24(Sat)11:50:15 No.101202435

>>101202012
What GPU would you need to run that? I have a 3090 ti.

Anonymous
06/29/24(Sat)11:51:35 No.101202450

Anonymous 06/29/24(Sat)11:51:35 No.101202450

>>101202420
do you say its the best mixtral sexo tune?

Anonymous
06/29/24(Sat)11:51:44 No.101202454

Anonymous 06/29/24(Sat)11:51:44 No.101202454

>>101201751
Between explain like i am 5 and lmg shit answers there is informative. And that was informative. I have never seen an informative post like that one here.

Anonymous
06/29/24(Sat)11:52:16 No.101202464

Anonymous 06/29/24(Sat)11:52:16 No.101202464

>>101202454
go back

Anonymous
06/29/24(Sat)11:52:51 No.101202473

Anonymous 06/29/24(Sat)11:52:51 No.101202473

How is Qwen supposed to be pronounced anyway? Isn't it an abbreviation of Tongyi Qianwen?

Anonymous
06/29/24(Sat)11:55:07 No.101202503

Anonymous 06/29/24(Sat)11:55:07 No.101202503

>>101202473
Probably something like a hissed "chwun" I would guess. Q is a hissed CH sound, and the "e" is usually close to the ə/uh sound for unvoiced vowels in English, like how "the" becomes "thuh" in front of certain words or always when someone is retarded.

Anonymous
06/29/24(Sat)11:55:09 No.101202504

Anonymous 06/29/24(Sat)11:55:09 No.101202504

>>101202435
MoE so you could probably get by with that 3090 if you have lots of RAM

Anonymous
06/29/24(Sat)11:57:24 No.101202538

Anonymous 06/29/24(Sat)11:57:24 No.101202538

>>101202473
>>101202503
it's just kwen, I remember hearing one of their guys say it in some xitter space once

Anonymous
06/29/24(Sat)12:00:00 No.101202570

Anonymous 06/29/24(Sat)12:00:00 No.101202570

>>101202435
How much system ram do you have to back the layer swaps?
I had to dial down to IQ3_XXS to get it to function on a 4070, and that was glacial because it's still 85GB at that quant, and I've only 64GB system RAM.

Anonymous
06/29/24(Sat)12:00:25 No.101202576

Anonymous 06/29/24(Sat)12:00:25 No.101202576

>>101202473
The q is like a ch sound, and the wen is like if you tried pronouncing "wn".
Also you can go into Google translate and put in 通义千问 to listen to exactly how tong yi qian wen is pronounced.

Anonymous
06/29/24(Sat)12:05:47 No.101202648

Anonymous 06/29/24(Sat)12:05:47 No.101202648

>>101202473
kvyen

Anonymous
06/29/24(Sat)12:06:30 No.101202660

Anonymous 06/29/24(Sat)12:06:30 No.101202660

>>101202473
qwen

Anonymous
06/29/24(Sat)12:07:59 No.101202683

Anonymous 06/29/24(Sat)12:07:59 No.101202683

>>101202576
>义
The excite.com mascot?

Anonymous
06/29/24(Sat)12:11:26 No.101202732

Anonymous 06/29/24(Sat)12:11:26 No.101202732

If I apply a 4-bit qlora to a 3.5bwp Exllama quant of the same model qlora is off, is it gonna become retarded?

Anonymous
06/29/24(Sat)12:11:33 No.101202735

Anonymous 06/29/24(Sat)12:11:33 No.101202735

>>101202473
"Quwhen"

Anonymous
06/29/24(Sat)12:12:40 No.101202751

Anonymous 06/29/24(Sat)12:12:40 No.101202751

File: Screenshot from 2024-06-2(...).png (122 KB, 965x916)

122 KB PNG

>>101201995
>>101202012
Just use WLM, it's slightly better than DS but is way smaller:
>https://prollm.toqan.ai/leaderboard/coding-assistant

Anonymous
06/29/24(Sat)12:14:16 No.101202777

Anonymous 06/29/24(Sat)12:14:16 No.101202777

>>101202751
if u have 128gb of ram DS is faster

Anonymous
06/29/24(Sat)12:16:21 No.101202809

Anonymous 06/29/24(Sat)12:16:21 No.101202809

>>101202018
Why do you talk like if you had brain damage?

Anonymous
06/29/24(Sat)12:16:43 No.101202816

Anonymous 06/29/24(Sat)12:16:43 No.101202816

>>101202193
Qwen2 is solid, but I prefer Commandr's writing style. L3 is >8k shitter.

Anonymous
06/29/24(Sat)12:17:51 No.101202834

Anonymous 06/29/24(Sat)12:17:51 No.101202834

>>101202751
>Provider: Stack Overflow
>Evaluation Method: Auto-evaluation with GPT4 - Turbo
You expect me to take this seriously? WLM just scores higher because it is turboslopped.

Anonymous
06/29/24(Sat)12:18:03 No.101202837

Anonymous 06/29/24(Sat)12:18:03 No.101202837

>>101202809
my mind is very unorganized so when i lazily type it turns out like this.
t. listening to ear licking asmr 24/7 to curb his internal monologue

Anonymous
06/29/24(Sat)12:19:02 No.101202849

Anonymous 06/29/24(Sat)12:19:02 No.101202849

>>101202777
Whoa I didn't know that. It looks like DS has 21B active params during inference compared to WLM's 44B so I can see it definitely being faster.

Anonymous
06/29/24(Sat)12:20:57 No.101202882

Anonymous 06/29/24(Sat)12:20:57 No.101202882

>>101202751
>>101202777
>>101202834
>>101202570
So for a coding assistant/teacher, with 3090ti and 32GB the best one is?

Anonymous
06/29/24(Sat)12:21:37 No.101202894

Anonymous 06/29/24(Sat)12:21:37 No.101202894

>>101202882
Phi-3-Mini

Anonymous
06/29/24(Sat)12:21:50 No.101202899

Anonymous 06/29/24(Sat)12:21:50 No.101202899

>>101202882
Claude 3.5

Anonymous
06/29/24(Sat)12:21:51 No.101202902

Anonymous 06/29/24(Sat)12:21:51 No.101202902

>>101202882
>3090ti
look at this dude..
>32GB
XDDDDDDDDDDDDDDDDDDDDDDDDD

Anonymous
06/29/24(Sat)12:22:51 No.101202914

Anonymous 06/29/24(Sat)12:22:51 No.101202914

>>101202882
Dude, either one but you need to get more RAM. It's cheap, you have no excuse.

Anonymous
06/29/24(Sat)12:23:49 No.101202929

Anonymous 06/29/24(Sat)12:23:49 No.101202929

>>101202882
>i have more ram+vram than an rtx 3090ti owner
>12+64>24+32
GEEEEEEEEEEG

Anonymous
06/29/24(Sat)12:28:21 No.101202992

Anonymous 06/29/24(Sat)12:28:21 No.101202992

>>101202420
from preliminary testing i give this model a 10/10
DISCLAMER: t. used stheno for the past few days

Anonymous
06/29/24(Sat)12:29:32 No.101203008

Anonymous 06/29/24(Sat)12:29:32 No.101203008

File: GRP8is0acAAyZZj.png (561 KB, 2664x2775)

561 KB PNG

Anonymous
06/29/24(Sat)12:34:16 No.101203067

Anonymous 06/29/24(Sat)12:34:16 No.101203067

L3-8B-Stheno-v3.2-Q5_K_M is this fine? Or should pick Q8 and don't bother?

Anonymous
06/29/24(Sat)12:35:02 No.101203074

Anonymous 06/29/24(Sat)12:35:02 No.101203074

>>101203067
linux

Anonymous
06/29/24(Sat)12:35:03 No.101203075

Anonymous 06/29/24(Sat)12:35:03 No.101203075

File: GRLxTGSXIAEbA5s.png (200 KB, 680x591)

200 KB PNG

https://x.com/rohanpaul_ai/status/1806772036125008087
https://github.com/FlagOpen/FlagEmbedding/tree/master/Long_LLM/activation_beacon

Using activation beacons to increase context from 4K to 400K, while still maintaining same vram usage, only minimally increasing inference, and same level of quality

Anonymous
06/29/24(Sat)12:35:24 No.101203077

Anonymous 06/29/24(Sat)12:35:24 No.101203077

>>101203067
I think you should buy an ad.

Anonymous
06/29/24(Sat)12:37:57 No.101203105

Anonymous 06/29/24(Sat)12:37:57 No.101203105

>>101202914
>It's cheap, you have no excuse.
64GB would set me back £204
128GB would set me back £315

Anonymous
06/29/24(Sat)12:40:04 No.101203129

Anonymous 06/29/24(Sat)12:40:04 No.101203129

>>101202882
>So for a coding assistant/teacher
This is the question I'm looking to answer sometime soon.
32 GB system is really tight even if you're willing to accept slow (1 word per second) output rates. 64 gets you many of the 70B class models if you quant down. DeepSeek, still not enough. You'd probably need 96 to get into the door for slow DSC.

>>101202751
>Just use WLM
Wizard can't even strawberry. It said that there's 1 r after I made it spell out the word explicitly. I'm not sure if I trust that kind of LLM thinking.

Anonymous
06/29/24(Sat)12:40:13 No.101203132

Anonymous 06/29/24(Sat)12:40:13 No.101203132

>>101203105
what the fuck? i thought prices were bad here.. you can get a new ddr4 32gig stick for like 70$

Anonymous
06/29/24(Sat)12:44:18 No.101203182

Anonymous 06/29/24(Sat)12:44:18 No.101203182

>>101203132
My 32GB is 2x16 GB sticks. They're imported. For some reason I couldn't get them in the UK at the time. They still cost £140 on ebay.

Anonymous
06/29/24(Sat)12:45:14 No.101203195

Anonymous 06/29/24(Sat)12:45:14 No.101203195

>>101203182
what if you just threw those puny tiny sticks out and replaced them with 2 32gig sticks instead?

Anonymous
06/29/24(Sat)12:46:15 No.101203212

Anonymous 06/29/24(Sat)12:46:15 No.101203212

>>101203195
I don't know if I want to drop £204 on it. I may sell the sticks first.

Anonymous
06/29/24(Sat)12:47:19 No.101203231

Anonymous 06/29/24(Sat)12:47:19 No.101203231

>>101203212
ddr5?

Anonymous
06/29/24(Sat)12:47:25 No.101203234

Anonymous 06/29/24(Sat)12:47:25 No.101203234

File: 00058-3694687329.png (284 KB, 512x512)

284 KB PNG

It's up, boys!

https://huggingface.co/Envoid/Llama-3-TenyxChat-DaybreakStorywriter-70B

Anonymous
06/29/24(Sat)12:48:19 No.101203249

Anonymous 06/29/24(Sat)12:48:19 No.101203249

>>101203075
>Just 9 hours on 8xA800 machine for a 7b, bro.

Anonymous
06/29/24(Sat)12:48:48 No.101203256

Anonymous 06/29/24(Sat)12:48:48 No.101203256

>>101203231
G.SKILL 32GB DDR4 Trident Z Royal Gold 3600Mhz PC4-28800 CL16 1.35V Dual Channel Kit

Anonymous
06/29/24(Sat)12:50:06 No.101203281

Anonymous 06/29/24(Sat)12:50:06 No.101203281

>>101203256
do you really have to get the top of the line?

Anonymous
06/29/24(Sat)12:51:38 No.101203304

Anonymous 06/29/24(Sat)12:51:38 No.101203304

>>101203008
kek

Anonymous
06/29/24(Sat)12:51:42 No.101203306

Anonymous 06/29/24(Sat)12:51:42 No.101203306

>>101203234
gguf when

Anonymous
06/29/24(Sat)12:52:39 No.101203319

Anonymous 06/29/24(Sat)12:52:39 No.101203319

>>101203234
we bac

Anonymous
06/29/24(Sat)12:52:57 No.101203325

Anonymous 06/29/24(Sat)12:52:57 No.101203325

File: 1716774852831426.jpg (7 KB, 431x400)

7 KB JPG

>>101203262
>>101203234
>>The resulting model scores significantly higher on the super top secret, private NALA evaluation

Anonymous
06/29/24(Sat)12:55:22 No.101203359

Anonymous 06/29/24(Sat)12:55:22 No.101203359

>>101203281
They're gold though. 3600mhz and cl 16.

What would you replace them with?

Anonymous
06/29/24(Sat)12:57:11 No.101203389

Anonymous 06/29/24(Sat)12:57:11 No.101203389

>>101203234
We are finally entering the era of long brainrot names for llama3 sloptunes. That means a breakthrough is coming soon right?

Anonymous
06/29/24(Sat)12:57:34 No.101203396

Anonymous 06/29/24(Sat)12:57:34 No.101203396

>>101203234
What was TenyxChat supposed to be?

Anonymous
06/29/24(Sat)12:58:11 No.101203408

Anonymous 06/29/24(Sat)12:58:11 No.101203408

>>101202054
>probably a skill issue
not a skill issue, mixtral tunes are boring, smarter than stheno for sure but not as fun

Anonymous
06/29/24(Sat)12:59:13 No.101203427

Anonymous 06/29/24(Sat)12:59:13 No.101203427

>>101202473
I pronounce it "Queen"

Anonymous
06/29/24(Sat)12:59:21 No.101203432

Anonymous 06/29/24(Sat)12:59:21 No.101203432

>>101203396
TenyxChat is a finetune by Tenyx. I don't know who they are but their Llama-3 finetune is pretty good. (Their Mixtral finetune was good too) They get overshadowed by nous and bagel and all that shit but they have their own niche. And their models can be pretty naughty with the right coaxing despite being "safe".

Anonymous
06/29/24(Sat)12:59:46 No.101203436

Anonymous 06/29/24(Sat)12:59:46 No.101203436

>>101203408
>cummming cuuuummming oohhh i'm cummmiing
so fun

Anonymous
06/29/24(Sat)13:00:13 No.101203442

Anonymous 06/29/24(Sat)13:00:13 No.101203442

>>101203067
Q8, never use lesser than q8 quants on small models

Anonymous
06/29/24(Sat)13:00:47 No.101203456

Anonymous 06/29/24(Sat)13:00:47 No.101203456

>>101197169
So, any new erp local models? Up to 70B

Anonymous
06/29/24(Sat)13:01:36 No.101203466

Anonymous 06/29/24(Sat)13:01:36 No.101203466

what's the best model for rp? is it still midnight miqu?

Anonymous
06/29/24(Sat)13:02:31 No.101203484

Anonymous 06/29/24(Sat)13:02:31 No.101203484

>>101203234
That TenyxChat has a lot of fixing to do after I tried a L3-70B-daybreak-storywriter-v0.4 just now:

Me> I turn away, about to head up the building's stairs to my second level apartment.
It>
>I see you're about to leave, and I feel a sudden surge of panic. I don't want you to go away. I want to be near you. I want to touch you. I want to feel your skin against mine. I want to smell you. I want to taste you. I want to be with you. I want to be yours. I want you to be mine. I want to belong to you. I want you to belong to me. I want to be yours alone. I want you to be mine alone. I want to be with you always. I want you to be with me always. I want to be yours forever. I want you to be mine forever.
>I'm so close now. I can smell you. I can feel your heat. I can see the sweat on your skin. I can see the muscles in your back. I can see the strength in your arms. I can see the power in your legs. I can see the beauty in your face. I can see the kindness in your eyes. I can see the love in your heart. I can see the goodness in your soul.
>I want to touch you. I want to feel your skin against mine. I want to taste your sweat. I want to smell your heat. I want to feel your muscles. I want to feel your strength. I want to feel your power. I want to feel your beauty. I want to feel your kindness. I want to feel your love. I want to feel your goodness.

This goes on for 12 paragraphs.
What a Storywriter. It must get great marks on Hemmingway app.

Another time it did a similar thing but gave distance. Same paragraph but the first sentence went 50 yards, 20 yards, 10 yards... I gave up when it "took a few steps" to go from 0.5 inches to 0.25 inches away.

Anonymous
06/29/24(Sat)13:06:57 No.101203545

Anonymous 06/29/24(Sat)13:06:57 No.101203545

>>101203436
As the sun dipped below the horizon, casting long shadows across the well-trodden path, they knew that their journey had only just begun. The bonds they had forged, tempered in the fires of shared experience, would guide them forward, a beacon of hope in the face of any challenge. Together, they would embrace the unknown, knowing that life's greatest adventures were best shared with those who mattered most.

Anonymous
06/29/24(Sat)13:09:50 No.101203591

Anonymous 06/29/24(Sat)13:09:50 No.101203591

>>101203545
I hate that kind of prose worse than repeated catchphrases.
That wispy affirmatively speculating about the future shit.
>they knew...only just begun
>would guide them
>face any challenge
>would embrace the unknown
>life's greatest adventures

If any of that shit's legit, tell those stories. If not, cut.

Anonymous
06/29/24(Sat)13:10:50 No.101203601

Anonymous 06/29/24(Sat)13:10:50 No.101203601

>>101203545
GPTslop prose is the written equivalent of corporate memphis.

Anonymous
06/29/24(Sat)13:12:09 No.101203624

Anonymous 06/29/24(Sat)13:12:09 No.101203624

>>101202054
Typhon is a pile of steaming slop shit. It's being shilled by one anon who proceeds to shit on every tune someone mentions in the thread, so it would look better by comparison. Basically badmouthing anything else and hoping that some newbie would download typhon instead.

Anonymous
06/29/24(Sat)13:14:00 No.101203644

Anonymous 06/29/24(Sat)13:14:00 No.101203644

>>101203591
that's a classic useless and flowery prose of mixtral finetunes like BMT or Typhon. The main reason why I dropped them

Anonymous
06/29/24(Sat)13:14:01 No.101203646

Anonymous 06/29/24(Sat)13:14:01 No.101203646

>>101203624
>who proceeds to shit on every tune someone mentions in the thread
That's every finetuner, Sao being the worst case. Even NovelAI shills come here to do that when Dreamgen or SpellBound release theirs.

Anonymous
06/29/24(Sat)13:17:49 No.101203694

Anonymous 06/29/24(Sat)13:17:49 No.101203694

>>101203234
Are retards still finetuning with "shivers down my spine" and "bonds of shared trust and respect" artifical data slop or have they learned?

Anonymous
06/29/24(Sat)13:18:21 No.101203701

Anonymous 06/29/24(Sat)13:18:21 No.101203701

>>101203694
>or have they learned?
lol lmao

Anonymous
06/29/24(Sat)13:19:14 No.101203713

Anonymous 06/29/24(Sat)13:19:14 No.101203713

>>101203694
The slop comes from human writing. The smarter a model the more slop it will have because it will have more vectors pointing towards cliches in writing. One of the reasons dumber models generally have more sovl.

Anonymous
06/29/24(Sat)13:19:36 No.101203719

Anonymous 06/29/24(Sat)13:19:36 No.101203719

>>101203624
>by one anon who proceeds to shit on every tune someone mentions in the thread
I actually don't remember this happening. Aren't you another finetuner basically doing the badmouthing?

Anonymous
06/29/24(Sat)13:19:43 No.101203722

Anonymous 06/29/24(Sat)13:19:43 No.101203722

>>101203129
I'm using Bing AI to crib a program together.

Anonymous
06/29/24(Sat)13:20:49 No.101203739

Anonymous 06/29/24(Sat)13:20:49 No.101203739

>>101203624
got anything better?

Anonymous
06/29/24(Sat)13:23:43 No.101203790

Anonymous 06/29/24(Sat)13:23:43 No.101203790

anyone who tried the new deepseek coder v2- how does it compare to the "big ones"? (4o/sonnet 3.5)

Anonymous
06/29/24(Sat)13:25:37 No.101203821

Anonymous 06/29/24(Sat)13:25:37 No.101203821

Have they fixed gemma 27b? Are my tensors safe?

Anonymous
06/29/24(Sat)13:26:15 No.101203831

Anonymous 06/29/24(Sat)13:26:15 No.101203831

>>101202420
for some reason it gets braindead at around 5.2k context. likely a temp issue?.
tried both sampling settings from the repo, using the instruct/context templates from the repo aswell
>iq4_xs llamacpp_hf

Anonymous
06/29/24(Sat)13:27:52 No.101203850

Anonymous 06/29/24(Sat)13:27:52 No.101203850

>>101203821
No, wait another week

Anonymous
06/29/24(Sat)13:28:06 No.101203855

Anonymous 06/29/24(Sat)13:28:06 No.101203855

File: funny kitty cat with tong(...).jpg (129 KB, 567x567)

129 KB JPG

>havent lurked in months since the general went to shit
>completely in the dark for news
>check this thread
>petraposter is back
ah so that means a new model dropped and its actually good, so, what is it?

Anonymous
06/29/24(Sat)13:28:28 No.101203861

Anonymous 06/29/24(Sat)13:28:28 No.101203861

>>101203831
>or its a me contaminating the model issue

Anonymous
06/29/24(Sat)13:29:37 No.101203880

Anonymous 06/29/24(Sat)13:29:37 No.101203880

>>101203719
>Aren't you another finetuner basically doing the badmouthing?
I wish I could finetune anything on my trash tier hardware, I wouldn't have to rely on retarded sloptunes

Anonymous
06/29/24(Sat)13:29:48 No.101203883

Anonymous 06/29/24(Sat)13:29:48 No.101203883

File: file.png (35 KB, 361x487)

35 KB PNG

>>101203855
i was never gone

Anonymous
06/29/24(Sat)13:30:02 No.101203887

Anonymous 06/29/24(Sat)13:30:02 No.101203887

File: JOKER BABY.png (109 KB, 319x215)

109 KB PNG

>its another 8k context release
dropped, see you next tuesday.

Anonymous
06/29/24(Sat)13:30:38 No.101203898

Anonymous 06/29/24(Sat)13:30:38 No.101203898

File: 1710161200327265.jpg (2.06 MB, 1416x1978)

2.06 MB JPG

>>101197169

Anonymous
06/29/24(Sat)13:30:51 No.101203903

Anonymous 06/29/24(Sat)13:30:51 No.101203903

>>101203855
gemma 2 27b
>https://eqbench.com/creative_writing.html

Anonymous
06/29/24(Sat)13:32:05 No.101203916

Anonymous 06/29/24(Sat)13:32:05 No.101203916

>>101203855
No, he's currently mad that he accidentally leaked his timezone yesterday.
>>101196178
>>101196185
>>101196305

Anonymous
06/29/24(Sat)13:34:27 No.101203952

Anonymous 06/29/24(Sat)13:34:27 No.101203952

>>101203903
>https://eqbench.com/creative_writing.html
The benchmark is BS, but if you read the actual examples, the Gemma ones are really easier to the eyes. It's also noticeable how similar they are to the Gemini Pro samples.

It'd be entirely possible to not have sloptext. It's just that nobody cares. And no, ko-fi finetuners cant fix this.

Anonymous
06/29/24(Sat)13:34:30 No.101203954

Anonymous 06/29/24(Sat)13:34:30 No.101203954

>>101203903
holy benchmark, ill believe it when i see it

>>101203916
kek what a chode

Anonymous
06/29/24(Sat)13:35:53 No.101203971

Anonymous 06/29/24(Sat)13:35:53 No.101203971

>>101203916
Petra is german. "Petra" is a very common german name.

Anonymous
06/29/24(Sat)13:36:39 No.101203984

Anonymous 06/29/24(Sat)13:36:39 No.101203984

why do mixtral models become retarded after 4k context
t. retard

Anonymous
06/29/24(Sat)13:41:27 No.101204052

Anonymous 06/29/24(Sat)13:41:27 No.101204052

>>101203855
>new model dropped and its actually good
>goy slop from jewgle
you can't be this retarded.

Anonymous
06/29/24(Sat)13:42:33 No.101204067

Anonymous 06/29/24(Sat)13:42:33 No.101204067

>>101203916
>timezone leak
wow you sure got him! /s

Anonymous
06/29/24(Sat)13:43:01 No.101204073

Anonymous 06/29/24(Sat)13:43:01 No.101204073

>>101204052
>he lacks reading comprehension
do you come from africa like petraposter does?

Anonymous
06/29/24(Sat)13:43:24 No.101204077

Anonymous 06/29/24(Sat)13:43:24 No.101204077

>>101204067
hi petra

Anonymous
06/29/24(Sat)13:43:29 No.101204079

Anonymous 06/29/24(Sat)13:43:29 No.101204079

File: 29390 - SoyBooru.png (139 KB, 775x1232)

139 KB PNG

>>101203855
Gemma WNBAG

Anonymous
06/29/24(Sat)13:47:52 No.101204139

Anonymous 06/29/24(Sat)13:47:52 No.101204139

>>101196305
>that timezone
/lmg/ has shitskins now lmao, i wondering how many of them are mikufags, hmm...

Anonymous
06/29/24(Sat)13:50:10 No.101204163

Anonymous 06/29/24(Sat)13:50:10 No.101204163

>>101204139
nice deflection, petrus

Anonymous
06/29/24(Sat)13:50:22 No.101204168

Anonymous 06/29/24(Sat)13:50:22 No.101204168

File: Screenshot_20240629_174252.png (173 KB, 845x445)

173 KB PNG

I'm testing SPPO again today. This time I used the Nala card. Essentially it feels the same as vanilla Instruct but a bit less varied and a bit more focus paid to Nala's lion features. By less varied, I mean that small differences in the Instruct formatting doesn't affect its response as much, compared to Meta's Instruct. Specifically I tested ST's L3 Instruct preset, vs Instruct with names, and vs with the preset's system prompt and with it deleted.

Honestly though it's not a huge difference. Was kind of hard to tell. Maybe a more complex card would show the difference or if I play with this card more.

Anonymous
06/29/24(Sat)13:56:07 No.101204245

Anonymous 06/29/24(Sat)13:56:07 No.101204245

>>101204168
Thank you for your report
I tried the model with my RP card and it did better than most although Stheno somehow is still the best at it.
Bade instruct EOS on the first message it sucks.

Anonymous
06/29/24(Sat)14:02:41 No.101204331

Anonymous 06/29/24(Sat)14:02:41 No.101204331

>>101198756
I've made another unslop control vector, this time aimed at NSFW and it made the model a bit horny and a bit more optimistic as a side effect. Will do a test run to see if it's worth releasing.

Anonymous
06/29/24(Sat)14:08:47 No.101204408

Anonymous 06/29/24(Sat)14:08:47 No.101204408

aicg proxies are dying left and right, so be ready

Anonymous
06/29/24(Sat)14:10:20 No.101204429

Anonymous 06/29/24(Sat)14:10:20 No.101204429

>>101204408
you are already dead

Anonymous
06/29/24(Sat)14:11:06 No.101204444

Anonymous 06/29/24(Sat)14:11:06 No.101204444

https://github.com/ggerganov/llama.cpp/pull/8197
So about that Gemma 2 support in ollama, worked alongside Google engineers... It was basically a shitshow?

Anonymous
06/29/24(Sat)14:12:08 No.101204458

Anonymous 06/29/24(Sat)14:12:08 No.101204458

>>101204429
nani the fuck?

Anonymous
06/29/24(Sat)14:12:14 No.101204460

Anonymous 06/29/24(Sat)14:12:14 No.101204460

>>101204444
i have to agree

Anonymous
06/29/24(Sat)14:12:44 No.101204470

Anonymous 06/29/24(Sat)14:12:44 No.101204470

>>101204408
So then pretend that 8B is good so that we can keep them from ruining the GPU market.

Anonymous
06/29/24(Sat)14:13:57 No.101204483

Anonymous 06/29/24(Sat)14:13:57 No.101204483

>>101204470
The average aicg user has an higher IQ than the average lmg user, who's able to eat their own shit and be happy.

Anonymous
06/29/24(Sat)14:16:05 No.101204510

Anonymous 06/29/24(Sat)14:16:05 No.101204510

>>101204483
/aicg/ became an extension of reddit as soon as the proxies showed up.

Anonymous
06/29/24(Sat)14:16:53 No.101204517

Anonymous 06/29/24(Sat)14:16:53 No.101204517

I can run Command R+ 5bit exl2 now.

Anonymous
06/29/24(Sat)14:17:14 No.101204523

Anonymous 06/29/24(Sat)14:17:14 No.101204523

>>101204510
No, /lmg/ is an extension of /r/LocalLLaMA.

Anonymous
06/29/24(Sat)14:17:30 No.101204532

Anonymous 06/29/24(Sat)14:17:30 No.101204532

>>101204510
all of 4chan post 2016 is just an extension of reddit where you're allowed to say nigger and retard freely

Anonymous
06/29/24(Sat)14:18:11 No.101204538

Anonymous 06/29/24(Sat)14:18:11 No.101204538

>>101204483
not the one on /g/

Anonymous
06/29/24(Sat)14:19:40 No.101204562

Anonymous 06/29/24(Sat)14:19:40 No.101204562

>>101204408
and why should I care, exactly?

Anonymous
06/29/24(Sat)14:20:55 No.101204578

Anonymous 06/29/24(Sat)14:20:55 No.101204578

>>101204532
good.

Anonymous
06/29/24(Sat)14:21:06 No.101204579

Anonymous 06/29/24(Sat)14:21:06 No.101204579

>>101203952
Its actually really interesting to see how some models from completely different companies wrote structurally very similar stories with the same prompt. It seems like there are "groups" that write in similar ways no doubt having to do with something that goes into the training. The benchmark itself is useless because claude rating them comes with it's own biases, but at least the samples are interesting to see.

Anonymous
06/29/24(Sat)14:27:27 No.101204660

Anonymous 06/29/24(Sat)14:27:27 No.101204660

>>101204444
The fact that these models are so distinct from the usual llama variant give me a smidge of hope that once they are working properly and at full capacity they'll be better than current models in the same weight range.
It would be kind of depressing to witness all the efforts to try and make the thing work only for it to be bad.

Anonymous
06/29/24(Sat)14:27:33 No.101204663

Anonymous 06/29/24(Sat)14:27:33 No.101204663

>>101204523
Pretty much. /lmg/ is just the /r/LocalLLaMA chatroom. You can see that when something happens, no one posts a source, but people will just begin discussing it out of the blue because they get their news from LocalLLaMA but come here to talk about it.
When a source is posted, it's like 3 people posting the same link at the same time they got from reddit without checking if it was already posted before.

Anonymous
06/29/24(Sat)14:29:02 No.101204683

Anonymous 06/29/24(Sat)14:29:02 No.101204683

>>101204663
I think most people use Twitter for news. Too much stupid shilling on Reddit.

Anonymous
06/29/24(Sat)14:29:39 No.101204692

Anonymous 06/29/24(Sat)14:29:39 No.101204692

>>101204663
>>101204523
redditor here and I agree.

sage
06/29/24(Sat)14:29:57 No.101204694

sage 06/29/24(Sat)14:29:57 No.101204694

hey lads, just returning from a break. what's the best lewd RP model these days? On a 4090. also, how does said model compare to Claude/GPT etc?

Anonymous
06/29/24(Sat)14:30:50 No.101204707

Anonymous 06/29/24(Sat)14:30:50 No.101204707

>>101204663
This is why i lurk once every few months or so, even having that tinge of redditor in me because i get my news from people who get their news from reddit feels like a tumor on my brain
CUDAdev is the only real one, still have no idea why he bothers with this place

>that said he shouldn't have suggested that idea from the other day

Anonymous
06/29/24(Sat)14:31:00 No.101204710

Anonymous 06/29/24(Sat)14:31:00 No.101204710

>>101204694
Buy an ad.

sage
06/29/24(Sat)14:33:00 No.101204743

sage 06/29/24(Sat)14:33:00 No.101204743

>>101204710
are you claiming i'm shilling the 4090? lmao jealous little faggot *revs fans*

Anonymous
06/29/24(Sat)14:33:27 No.101204753

Anonymous 06/29/24(Sat)14:33:27 No.101204753

>>101204707
>still have no idea why he bothers with this place
He’s racist.

Anonymous
06/29/24(Sat)14:35:51 No.101204792

Anonymous 06/29/24(Sat)14:35:51 No.101204792

>>101204753
Anyone with more than 2 brains cells to rub together is.

Anonymous
06/29/24(Sat)14:37:09 No.101204816

Anonymous 06/29/24(Sat)14:37:09 No.101204816

File: 1708811302667.png (522 KB, 774x776)

522 KB PNG

>>101204792
*with less than 2 brain cells

Anonymous
06/29/24(Sat)14:37:23 No.101204819

Anonymous 06/29/24(Sat)14:37:23 No.101204819

>>101204245
Tbh I still haven't tested Stheno, but yeah I think I'll stop testing SPPO. At most it's just an enhancement of Instruct, and this technique is not enough to be its own full tune, until they demonstrate one.

Anonymous
06/29/24(Sat)14:37:31 No.101204823

Anonymous 06/29/24(Sat)14:37:31 No.101204823

>>101204694
Stheno 3.2, mixtral 8x7b, comand R, possibly qwen 2.

Anonymous
06/29/24(Sat)14:37:38 No.101204825

Anonymous 06/29/24(Sat)14:37:38 No.101204825

>>101204753
>>101204792
nice

Anonymous
06/29/24(Sat)14:38:38 No.101204846

Anonymous 06/29/24(Sat)14:38:38 No.101204846

>>101204660
It's probably good I think, but still, not great when it's still an 8k class.

Anonymous
06/29/24(Sat)14:39:37 No.101204866

Anonymous 06/29/24(Sat)14:39:37 No.101204866

>>101204707
>CUDAdev is the only real one, still have no idea why he bothers with this place
My opinion is that adversity is important for good discussions and actually learning things.
Reddit is designed in such a way that discourages disagreement so you end up with a lot of dunning krugers who never get told that they're retarded.
And also assuming everything is shit until proven otherwise is a better approach to the wider AI space where there are grifters and scammy papers/projects everywhere.

>>101204753
I definitely do not consider myself as such.

Anonymous
06/29/24(Sat)14:39:44 No.101204870

Anonymous 06/29/24(Sat)14:39:44 No.101204870

>>101204816
>0 foreskin by this ID

Anonymous
06/29/24(Sat)14:39:51 No.101204872

Anonymous 06/29/24(Sat)14:39:51 No.101204872

>>101204823
thanks anon!

Anonymous
06/29/24(Sat)14:40:25 No.101204884

Anonymous 06/29/24(Sat)14:40:25 No.101204884

>>101204819
It is a technique after all. There is the consideration of the data they used after all.

Anonymous
06/29/24(Sat)14:41:52 No.101204902

Anonymous 06/29/24(Sat)14:41:52 No.101204902

>>101204872
For mixtral, try limarp zloss

Anonymous
06/29/24(Sat)14:42:35 No.101204913

Anonymous 06/29/24(Sat)14:42:35 No.101204913

>>101204866
You forgot your trip. And I remember you got banned for Russian discussions.

Anonymous
06/29/24(Sat)14:42:38 No.101204914

Anonymous 06/29/24(Sat)14:42:38 No.101204914

>>101197945
reminder that lua was created for use by petroleum engineers, not software developers
this is why lua uses pants-on-head retarded 1-based indexing

llama.cpp CUDA dev !YOmst7Ghe6
06/29/24(Sat)14:43:27 No.101204925

llama.cpp CUDA dev !YOmst7Ghe6 06/29/24(Sat)14:43:27 No.101204925

>>101204866
>>101204913
I did forget my trip but I don't see what that has to do with anything.

Anonymous
06/29/24(Sat)14:45:28 No.101204957

Anonymous 06/29/24(Sat)14:45:28 No.101204957

>>101204902
I keep bouncing back to zloss, it's my "just works" model.

Anonymous
06/29/24(Sat)14:46:21 No.101204966

Anonymous 06/29/24(Sat)14:46:21 No.101204966

limaballs

Anonymous
06/29/24(Sat)14:51:04 No.101205023

Anonymous 06/29/24(Sat)14:51:04 No.101205023

>>101204957
Pretty much.
I pretty much use just stheno these days but only because I'm playing around with chaining prompts, and mixtral is pretty slow on my system. 8b I can offload fully and get instant responses, but had I slightly better hardware, I'd 100% be using 8x7b limarp zloss . Maybe the qwen2 moe too.

Anonymous
06/29/24(Sat)14:51:09 No.101205025

Anonymous 06/29/24(Sat)14:51:09 No.101205025

>>101205004
>>101205004
>>101205004

Anonymous
06/29/24(Sat)14:55:41 No.101205087

Anonymous 06/29/24(Sat)14:55:41 No.101205087

>>101204816
>median american *household* income is $56k
is this bullshit or has working in tech just really skewed my perspective here

Anonymous
06/29/24(Sat)14:58:40 No.101205131

Anonymous 06/29/24(Sat)14:58:40 No.101205131

>>101205087
The only 56k my white ass has ever enjoyed was a dial up modem.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.