/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 10/18/24(Fri)15:32:07 No.102876583

File: ComfyUI_05091_.png (267 KB, 1024x1024)

267 KB PNG

/lmg/ - Local Models General Anonymous 10/18/24(Fri)15:32:07 No.102876583 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>102862101 & >>102849995

►News
>(10/18) New research, models, and datasets from Meta FAIR: https://ai.meta.com/blog/fair-news-segment-anything-2-1-meta-spirit-lm-layer-skip-salsa-lingua
>(10/18) bitnet.cpp: Official inference framework for 1-bit LLMs: https://github.com/microsoft/BitNet
>(10/18) DeepSeek releases Janus-1.3B with multimodal understanding and generation: https://hf.co/deepseek-ai/Janus-1.3B
>(10/16) Ministral 8B instruct model released: https://mistral.ai/news/ministraux
>(10/15) PLaMo-100B: English and Japanese base model: https://hf.co/pfnet/plamo-100b

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench
Japanese: https://hf.co/datasets/lmg-anon/vntl-leaderboard
Programming: https://livecodebench.github.io/leaderboard.html

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
10/18/24(Fri)15:32:31 No.102876588

Anonymous 10/18/24(Fri)15:32:31 No.102876588

File: 1729268279039.jpg (40 KB, 384x384)

40 KB JPG

►Recent Highlights from the Previous Thread: >>102862101

--Paper: Human data improves NLP model performance over synthetic data:
>102869101 >102869424 >102869479 >102869492 >102871323 >102871345
--Papers:
>102868813 >102869035 >102869230
--Comparison table of AI model training computers from LifeArchitect.ai:
>102875215
--Nemotron excels at RP, but has formatting issues. Llama 3.1 Instruct used with specific settings and rules for roleplay on SillyTavern:
>102862259 >102862990 >102863031 >102863176 >102863268
--Nvidia's Sana: High-resolution image synthesis with linear diffusion transformers:
>102867726 >102867759
--Meta FAIR research dump includes open source language models, object segmentation, and more:
>102874089
--Low quality erotica for training AI models, with mixed opinions:
>102864868 >102864913 >102864965 >102865273 >102865338
--Nemotron excels at roleplay and creative writing, not knowledge:
>102862255 >102862902
--Nemotron 70B: Unique prose, fun, but dumber than Largestral with logical errors:
>102865433 >102865448 >102865676 >102866355
--Nala test with bitnet inferencing has issues:
>102874688 >102874747 >102875041 >102875065 >102875112 >102875139 >102875221 >102875291 >102874871
--Meta releases new models and datasets, including a strong generative reward model:
>102875631 >102875682 >102875854 >102876015 >102876444 >102875768
--Importance of trivia knowledge in AI models for creativity and references:
>102864729 >102864831 >102864867 >102864870 >102864958 >102865244
--INTELLECT-1 training run pace increases:
>102867630
--Excitement over Janus-1.3B and BitNet releases:
>102873151 >102873169 >102873216 >102873238 >102873257 >102873267 >102873640 >102873335 >102875142
--Miku (free space):
>102871525 >102873858 >102874140 >102875545 >102876039

►Recent Highlight Posts from the Previous Thread: >>102862116

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script

Anonymous
10/18/24(Fri)15:34:29 No.102876610

Anonymous 10/18/24(Fri)15:34:29 No.102876610

>>102876560
Chat, what does this mean?

https://github.com/xjdr-alt/llmri/blob/main/plots.ipynb
?

Anonymous
10/18/24(Fri)15:37:47 No.102876651

Anonymous 10/18/24(Fri)15:37:47 No.102876651

>>102876610
buy an ad

Anonymous
10/18/24(Fri)15:37:49 No.102876653

Anonymous 10/18/24(Fri)15:37:49 No.102876653

Australian spring will be LLM spring as well.

Anonymous
10/18/24(Fri)15:45:28 No.102876754

Anonymous 10/18/24(Fri)15:45:28 No.102876754

File: 1.png (109 KB, 1833x879)

109 KB PNG

INTELLECT-1 at 11.03% complete

Anonymous
10/18/24(Fri)15:47:18 No.102876770

Anonymous 10/18/24(Fri)15:47:18 No.102876770

>>102876754
do i need a h100?

Anonymous
10/18/24(Fri)15:47:29 No.102876775

Anonymous 10/18/24(Fri)15:47:29 No.102876775

>>102864913
Where did the soul go...

Anonymous
10/18/24(Fri)15:49:45 No.102876808

Anonymous 10/18/24(Fri)15:49:45 No.102876808

Speculative decoding is a meme for local. The gains are only seen in coding or other repetitive contexts, while it wastes more energy. What we really need is a MoE model with a high number of small experts so that we can selectively quantize/prune/offload the experts to optimize the model for our specific use cases and VRAM/RAM level.

Anonymous
10/18/24(Fri)15:50:39 No.102876814

Anonymous 10/18/24(Fri)15:50:39 No.102876814

File: Training is at capacity.png (29 KB, 736x505)

29 KB PNG

>>102876770
Yes. However that doesn't matter since it is currently being trained as fast as it possibly can regardless.

Anonymous
10/18/24(Fri)15:52:09 No.102876837

Anonymous 10/18/24(Fri)15:52:09 No.102876837

>>102876754
>10B model
WooooW

Anonymous
10/18/24(Fri)15:53:08 No.102876845

Anonymous 10/18/24(Fri)15:53:08 No.102876845

Are they just compressing the internet over and over?

Anonymous
10/18/24(Fri)15:53:48 No.102876851

Anonymous 10/18/24(Fri)15:53:48 No.102876851

>>102876808
I'm still waiting for a Mixture of a Million Experts implementation. The most promising thing about that sort of model is the promise of how much easier it will be to add knowledge by training a few small experts instead of needing to finetune the entire thing.

Anonymous
10/18/24(Fri)15:56:05 No.102876875

Anonymous 10/18/24(Fri)15:56:05 No.102876875

So what's the better Rocinante v1.1 or v2g?

Anonymous
10/18/24(Fri)15:57:46 No.102876896

Anonymous 10/18/24(Fri)15:57:46 No.102876896

>>102876845
they are compressing reasoning

Anonymous
10/18/24(Fri)15:59:57 No.102876926

Anonymous 10/18/24(Fri)15:59:57 No.102876926

they do not have reasoning

Anonymous
10/18/24(Fri)16:00:51 No.102876940

Anonymous 10/18/24(Fri)16:00:51 No.102876940

it's a very lossy quant of reasoning

Anonymous
10/18/24(Fri)16:01:22 No.102876949

Anonymous 10/18/24(Fri)16:01:22 No.102876949

>>102876851
Yeah that'd be an interesting experiment, though my guess is that the experts still need be at least a little large for certain types of intelligence to be retained. I think 3B is probably the minimum. 30x3B could be an interesting balance I think and fit into high end consumer desktop setups.

Anonymous
10/18/24(Fri)16:09:30 No.102877046

Anonymous 10/18/24(Fri)16:09:30 No.102877046

are we back?

Anonymous
10/18/24(Fri)16:09:57 No.102877049

Anonymous 10/18/24(Fri)16:09:57 No.102877049

File: 1707513676127685.png (141 KB, 1152x984)

141 KB PNG

rightoid incel grok is dumber than Llama 3.1 70B, kek

Anonymous
10/18/24(Fri)16:14:54 No.102877118

Anonymous 10/18/24(Fri)16:14:54 No.102877118

>>102877049
I was just about to post that lol.
But yeah, also, they have Nemotron too now.
https://livebench.ai
But it's lower than Grok 2. I haven't tested it to verify any of the claims of it being good or it being shit for RP though.

Anonymous
10/18/24(Fri)16:17:23 No.102877148

Anonymous 10/18/24(Fri)16:17:23 No.102877148

>>102877118
>it's lower than Grok 2
Well, training on obvious /pol/shit data is bad for any LLM after all.

Anonymous
10/18/24(Fri)16:17:59 No.102877154

Anonymous 10/18/24(Fri)16:17:59 No.102877154

>>102877049
>Grok mini that close to grok
Super huge models are a meme

Anonymous
10/18/24(Fri)16:20:05 No.102877179

Anonymous 10/18/24(Fri)16:20:05 No.102877179

>>102858009
>>102860004
I can't believe I share the general with retards that don't understand what a reward model for RLHF is, going as far as trying to use it in koboldcpp... It's over...

Anonymous
10/18/24(Fri)16:20:11 No.102877181

Anonymous 10/18/24(Fri)16:20:11 No.102877181

>>102877049
Good job cropping out the meaning of those numbers.

Anonymous
10/18/24(Fri)16:22:12 No.102877208

Anonymous 10/18/24(Fri)16:22:12 No.102877208

>>102864868
>He doesn't know
>>102864965

Anonymous
10/18/24(Fri)16:22:45 No.102877215

Anonymous 10/18/24(Fri)16:22:45 No.102877215

>>102877181
Bigger number always means better so why do you even need the meaning?

Anonymous
10/18/24(Fri)16:25:30 No.102877263

Anonymous 10/18/24(Fri)16:25:30 No.102877263

>>102877181
It's just livebench.

Anonymous
10/18/24(Fri)16:25:57 No.102877276

Anonymous 10/18/24(Fri)16:25:57 No.102877276

>>102877049
Based. Safe and diverse LLMs are our strength.

Anonymous
10/18/24(Fri)16:38:38 No.102877469

Anonymous 10/18/24(Fri)16:38:38 No.102877469

>>102877208
>Give it a shot
>Model becomes smarter, response length matches the previous responses instead of droning and it focuses more on details

What the fuck?

Anonymous
10/18/24(Fri)16:40:13 No.102877488

Anonymous 10/18/24(Fri)16:40:13 No.102877488

>>102877469
That was my idea and I can tell you... it is probably placebo.

Anonymous
10/18/24(Fri)16:47:18 No.102877571

Anonymous 10/18/24(Fri)16:47:18 No.102877571

>>102876583
sex
with miku

Anonymous
10/18/24(Fri)16:48:50 No.102877603

Anonymous 10/18/24(Fri)16:48:50 No.102877603

>>102877571
this, so much this

Anonymous
10/18/24(Fri)17:12:33 No.102877971

Anonymous 10/18/24(Fri)17:12:33 No.102877971

File: MiquLlama2.png (1.05 MB, 896x1152)

1.05 MB PNG

>>102876583
miqu proves llama2 was peak

Anonymous
10/18/24(Fri)17:32:58 No.102878310

Anonymous 10/18/24(Fri)17:32:58 No.102878310

>>102876808
>>102876851
arctic snowflake

Anonymous
10/18/24(Fri)18:08:09 No.102878781

Anonymous 10/18/24(Fri)18:08:09 No.102878781

Why is INTELLECT-1 going with 10B anyways, instead of the more common 7B or 13B?

Anonymous
10/18/24(Fri)18:17:16 No.102878909

Anonymous 10/18/24(Fri)18:17:16 No.102878909

>>102878310
Snowflake sucks tho

Anonymous
10/18/24(Fri)18:48:50 No.102879322

Anonymous 10/18/24(Fri)18:48:50 No.102879322

>local: dead
>cloud: https://youtu.be/EwzhumHX_TE
Will Meta Spirit save us?

Anonymous
10/18/24(Fri)18:50:32 No.102879340

Anonymous 10/18/24(Fri)18:50:32 No.102879340

>>102879322
how long until can I practice my japanese with my local miku?

Anonymous
10/18/24(Fri)18:57:27 No.102879423

Anonymous 10/18/24(Fri)18:57:27 No.102879423

>>102879322
>Will Meta Spirit save us?
No.

Anonymous
10/18/24(Fri)19:07:39 No.102879555

Anonymous 10/18/24(Fri)19:07:39 No.102879555

>Nemotron IQ2-XS
Is this better than Nemo or Small at Q8 for a vramlet? 2.2 t/s at 10k context.

Anonymous
10/18/24(Fri)19:10:07 No.102879580

Anonymous 10/18/24(Fri)19:10:07 No.102879580

>>102879555
no

Anonymous
10/18/24(Fri)19:11:59 No.102879604

Anonymous 10/18/24(Fri)19:11:59 No.102879604

>>102879555
I have only tried IQ2_S, and that is far better than nemo or small. IQ2_S should fit within 24gb vram, with 8k context, as long as you have the 4-bit cache and flash attention enabled.

Anonymous
10/18/24(Fri)19:22:29 No.102879740

Anonymous 10/18/24(Fri)19:22:29 No.102879740

>>102879322
Damn, it's really over

Anonymous
10/18/24(Fri)19:26:11 No.102879777

Anonymous 10/18/24(Fri)19:26:11 No.102879777

>>102876653
We haven't even had winter yet.

Anonymous
10/18/24(Fri)19:29:57 No.102879808

Anonymous 10/18/24(Fri)19:29:57 No.102879808

>>102879322
Maybe in 30 years we will get something similar.

Anonymous
10/18/24(Fri)19:39:37 No.102879904

Anonymous 10/18/24(Fri)19:39:37 No.102879904

>>102879322
currently doing this with local

cope

Anonymous
10/18/24(Fri)19:40:26 No.102879913

Anonymous 10/18/24(Fri)19:40:26 No.102879913

>>102879322
I've always stuck with local so far but if they find a way to make customized hentai ASMR with dynamic plap plap and dick sucking sound effects, that will be the day I become a cloudshitter

Anonymous
10/18/24(Fri)19:43:43 No.102879936

Anonymous 10/18/24(Fri)19:43:43 No.102879936

>>102879777
The cooming winter is atomic and eternal.

Anonymous
10/18/24(Fri)19:43:45 No.102879937

Anonymous 10/18/24(Fri)19:43:45 No.102879937

File: 1729020850496814.jpg (1.05 MB, 1170x1052)

1.05 MB JPG

>>102876754
neat

Anonymous
10/18/24(Fri)19:44:03 No.102879940

Anonymous 10/18/24(Fri)19:44:03 No.102879940

>>102876754
why are they doing this? no one will give a fuck about a 10b model

Anonymous
10/18/24(Fri)19:45:34 No.102879954

Anonymous 10/18/24(Fri)19:45:34 No.102879954

>>102879904
>>my dependency clusterfuck with fake-multimodality and huge latency is better and totally delivers 1 to 1 results!
Yawn.

Anonymous
10/18/24(Fri)19:46:20 No.102879968

Anonymous 10/18/24(Fri)19:46:20 No.102879968

>>102879940
you gotta start small

Anonymous
10/18/24(Fri)19:48:01 No.102879987

Anonymous 10/18/24(Fri)19:48:01 No.102879987

>>102879940
consider it's related for contributing p2p experts towards a single output too

Anonymous
10/18/24(Fri)19:48:05 No.102879988

Anonymous 10/18/24(Fri)19:48:05 No.102879988

>>102879968
Well, they're free to waste their time, but I ain't donating no compute until they start working on a large bitnet model with an uncensored dataset.

Anonymous
10/18/24(Fri)19:48:52 No.102880004

Anonymous 10/18/24(Fri)19:48:52 No.102880004

>>102879940
Proof that it really works and can produce large scale models?
The question is what happens after that. Does /lmg/ finally gather the uncensored and IP infringing dataset they always wanted and train that model?

Anonymous
10/18/24(Fri)19:49:22 No.102880010

Anonymous 10/18/24(Fri)19:49:22 No.102880010

>>102879988
>I ain't donating no compute until they start working on a large bitnet model with an uncensored dataset.
this, if we can now train big models, let's go for fucking bitnet and settle the debate once and for all

Anonymous
10/18/24(Fri)19:50:23 No.102880020

Anonymous 10/18/24(Fri)19:50:23 No.102880020

>>102880004
>Does /lmg/ finally gather the uncensored and IP infringing dataset they always wanted and train that model?
but everyone participating in that training will know it'll be an IP infringing dataset no?

Anonymous
10/18/24(Fri)19:52:18 No.102880035

Anonymous 10/18/24(Fri)19:52:18 No.102880035

>>102880020
So? I doubt anyone able and willing to participate will care about that. Weights will be banned from Hugging Face, but torrents are better anyway.

Anonymous
10/18/24(Fri)19:53:48 No.102880051

Anonymous 10/18/24(Fri)19:53:48 No.102880051

>>102880035
>I doubt anyone able and willing to participate will care about that.
the autorities will care about that

Anonymous
10/18/24(Fri)19:54:39 No.102880055

Anonymous 10/18/24(Fri)19:54:39 No.102880055

>>102880051
are the 'thorities gonna kick my door down and confiscate my 3090 for participating?

Anonymous
10/18/24(Fri)19:55:13 No.102880058

Anonymous 10/18/24(Fri)19:55:13 No.102880058

/lmg/ will never gather around and train their own decentralized model. /lmg/ might have been able to do that a year or two ago but not today's /lmg/.

Anonymous
10/18/24(Fri)19:55:30 No.102880062

Anonymous 10/18/24(Fri)19:55:30 No.102880062

>>102880055
they're gonna cancel the training process and nuke the site down, how new are you?

Anonymous
10/18/24(Fri)19:56:13 No.102880069

Anonymous 10/18/24(Fri)19:56:13 No.102880069

>>102880058
You already got pygmalion

Anonymous
10/18/24(Fri)19:56:23 No.102880072

Anonymous 10/18/24(Fri)19:56:23 No.102880072

>>102880062
how they're gonna cancel a decentralized training? how new are you?

Anonymous
10/18/24(Fri)19:57:24 No.102880083

Anonymous 10/18/24(Fri)19:57:24 No.102880083

>>102880058
>/lmg/ actually decides to make a model
>anons actually are ready to contribute
>drummer and Undi are the ones to set up the model
>....
Yes anon. /lmg/ shouldn't make a model.

Anonymous
10/18/24(Fri)19:57:31 No.102880085

Anonymous 10/18/24(Fri)19:57:31 No.102880085

>>102880072
they can nuke the site that serves as a bridge for everyone during the decentralized training

Anonymous
10/18/24(Fri)19:57:32 No.102880086

Anonymous 10/18/24(Fri)19:57:32 No.102880086

>>102880058
Fuck you.

>>102880062
If the website is an issue then someone can just host it in a different country and they can't do anything about it.

Anonymous
10/18/24(Fri)19:57:57 No.102880089

Anonymous 10/18/24(Fri)19:57:57 No.102880089

>>102880072
The decentralized only means the training isn't happening in a centralized manner, but whatever orchestrates the machines is very centralized

Anonymous
10/18/24(Fri)19:58:43 No.102880097

Anonymous 10/18/24(Fri)19:58:43 No.102880097

>>102876583
does anyone know of any local programming-competent models whose instruct mode can be used as a programming assistant/tutor? something that is similar to if not better than copilot?

ive tried using the 13b echnida model which crumbles when asking it basic assembly language questions.

Anonymous
10/18/24(Fri)19:58:57 No.102880099

Anonymous 10/18/24(Fri)19:58:57 No.102880099

>>102880089
>but whatever orchestrates the machines is very centralized
>>102880085
>site that serves as a bridge for everyone
sounds like a design flaw

Anonymous
10/18/24(Fri)20:00:38 No.102880118

Anonymous 10/18/24(Fri)20:00:38 No.102880118

>>102880099
>sounds like a design flaw
it's not like they have much a choice innit? what other solution could it be? to participate onto that training you need to know where it is, it must be public, and public means problems because the autorities can see perfectly what you're doing, this shit is DOA

Anonymous
10/18/24(Fri)20:02:34 No.102880142

Anonymous 10/18/24(Fri)20:02:34 No.102880142

>>102880097
DeepSeek V2.5 is the best you can get right now.

Anonymous
10/18/24(Fri)20:02:55 No.102880148

Anonymous 10/18/24(Fri)20:02:55 No.102880148

>>102880058
That possible but /lmg/ will make this model lame and gay to own le chuds or something, you know, the usual /g/ stuff.

Anonymous
10/18/24(Fri)20:04:26 No.102880162

Anonymous 10/18/24(Fri)20:04:26 No.102880162

>>102880148
We really need to move to /sci/ or something.

Anonymous
10/18/24(Fri)20:04:52 No.102880167

Anonymous 10/18/24(Fri)20:04:52 No.102880167

>>102880142
I thought qwen 32B beat it

Anonymous
10/18/24(Fri)20:05:03 No.102880170

Anonymous 10/18/24(Fri)20:05:03 No.102880170

>>102879940
>why are they doing this?
"The longer term goal: scale to open source AGI models, continuously improving upon the best open source models in the world."

Anonymous
10/18/24(Fri)20:05:05 No.102880172

Anonymous 10/18/24(Fri)20:05:05 No.102880172

>>102880118
>this shit is DOA
Not necessarily. Huge corpos know that this shit isn't a competitor in the slightest. And officially none of the copros are interested in making a cooming model. I think it is highly likely that both corpos and governments will ignore this because it is a waste of time to bust it.

Anonymous
10/18/24(Fri)20:06:05 No.102880181

Anonymous 10/18/24(Fri)20:06:05 No.102880181

>>102880172
>Huge corpos know that this shit isn't a competitor in the slightest.
and when we'll be competitive what'll happen? the government will plug that shit off

Anonymous
10/18/24(Fri)20:06:32 No.102880185

Anonymous 10/18/24(Fri)20:06:32 No.102880185

>>102880148
There should be no emphasis to the left or the right. The priority should be a model with no "safeguards". One that will do everything within its power to do exactly what the User wants.

Anonymous
10/18/24(Fri)20:07:02 No.102880191

Anonymous 10/18/24(Fri)20:07:02 No.102880191

>>102880181
>when we'll be competitive
Pretty sure the first thing that will happen is a cooming model so, oh well.

Anonymous
10/18/24(Fri)20:08:41 No.102880204

Anonymous 10/18/24(Fri)20:08:41 No.102880204

>>102880185
Exactly, but you can't be sure with today's /g/ or anons, some of them will try to bad shit out of spite.

Anonymous
10/18/24(Fri)20:09:03 No.102880207

Anonymous 10/18/24(Fri)20:09:03 No.102880207

>>102880170
>scale to open source AGI models,
>AGI
definitely DOA

Anonymous
10/18/24(Fri)20:11:42 No.102880242

Anonymous 10/18/24(Fri)20:11:42 No.102880242

>>102876583
Threadly reminder that Nemotron 70B is crazy good for RP

Anonymous
10/18/24(Fri)20:14:45 No.102880281

Anonymous 10/18/24(Fri)20:14:45 No.102880281

File: Checkpoints.png (109 KB, 575x618)

109 KB PNG

>>102880204
I am sure it will eventually be made to work even if bad actors try to sabotage it. Worst comes to worst you can restore a checkpoint to before the point it got all fucked up.

Anonymous
10/18/24(Fri)20:20:19 No.102880332

Anonymous 10/18/24(Fri)20:20:19 No.102880332

File: 7-ending-feelsgirl.png (714 KB, 559x559)

714 KB PNG

>>102880242

Anonymous
10/18/24(Fri)20:22:20 No.102880351

Anonymous 10/18/24(Fri)20:22:20 No.102880351

>>102880185
>There should be no emphasis to the left or the right. The priority should be a model with no "safeguards".
that's a right thing anon, the left love censorship and hate freedom of speech

Anonymous
10/18/24(Fri)20:26:25 No.102880390

Anonymous 10/18/24(Fri)20:26:25 No.102880390

File: 1710607146663634.png (66 KB, 221x214)

66 KB PNG

>>102880351(You)

Anonymous
10/18/24(Fri)20:27:51 No.102880406

Anonymous 10/18/24(Fri)20:27:51 No.102880406

>>102880118
Yes, DOA just like piracy and torrent sites.

Anonymous
10/18/24(Fri)20:29:07 No.102880415

Anonymous 10/18/24(Fri)20:29:07 No.102880415

>>102880406
>piracy and torrent sites.
except that you're not sending your gpu power to those sites

Anonymous
10/18/24(Fri)20:29:08 No.102880416

Anonymous 10/18/24(Fri)20:29:08 No.102880416

>>102880406
For anyone under the age of 30, they definitely are.

Anonymous
10/18/24(Fri)20:31:16 No.102880440

Anonymous 10/18/24(Fri)20:31:16 No.102880440

>>102876754
>Python

Anonymous
10/18/24(Fri)20:31:17 No.102880441

Anonymous 10/18/24(Fri)20:31:17 No.102880441

>>102880351
No, le fucking american politics no matter what are pro censorship, in all countries left was the soviet Union, China or North Kore, zoomer don't know that in the 80s, the situation with censorship was literal the same, but since comic, video games, and anime were niche, don't affect so much, now that they are popular, get all the censorship.

Anonymous
10/18/24(Fri)20:33:31 No.102880461

Anonymous 10/18/24(Fri)20:33:31 No.102880461

File: 1612817185254.png (572 KB, 740x911)

572 KB PNG

Anons, is the ayyymd plus winblows combo still ass when it comes to localshit? I see that koboldcpp has rocm support now but does it work nice and fast like cuda?

Anonymous
10/18/24(Fri)20:33:52 No.102880468

Anonymous 10/18/24(Fri)20:33:52 No.102880468

>>102880441
>No, le fucking american politics no matter what are pro censorship
nuh uh, look at how censored the sites are when they're run by leftists (facebook, reddit, old twitter) compared to sites run by right wings (new twitter, 4chan...)

Anonymous
10/18/24(Fri)20:33:55 No.102880471

Anonymous 10/18/24(Fri)20:33:55 No.102880471

It looks like there are some here who want to shut down the idea distributed model training for some reason, with very lame excuses. Interesting.

Anonymous
10/18/24(Fri)20:35:07 No.102880479

Anonymous 10/18/24(Fri)20:35:07 No.102880479

>>102880471
>lame excuses
tell that to the governments who shut down every good ideas, they're the one to blame, they don't want us to get the power anon

Anonymous
10/18/24(Fri)20:36:39 No.102880491

Anonymous 10/18/24(Fri)20:36:39 No.102880491

>>102880471
Image model next,

Sex bots crowdfunding factory next

the people start getting what they want next with just even a crumb of organisation

>No you cant do that nooo! we cant blackmail and throw all of you off the rooftops at the same time noooo!

Anonymous
10/18/24(Fri)20:38:47 No.102880509

Anonymous 10/18/24(Fri)20:38:47 No.102880509

>>102880491
they just need to destroy one person's life to scare everyone else, it's really not that hard

Anonymous
10/18/24(Fri)20:40:23 No.102880522

Anonymous 10/18/24(Fri)20:40:23 No.102880522

>>102880142
>>102880167
could i run either of these models on my 4090? hg uses a100 for benchmark but i assume it's not necessary to run these guys right?

Anonymous
10/18/24(Fri)20:40:49 No.102880527

Anonymous 10/18/24(Fri)20:40:49 No.102880527

>>102880468
>twitter
I get banned for just saying kike to your people in twitter. Kike is the new nigger.
Also, there are not corpos in the left, that is anti nature, zoomer left is was Soviet Union were or was fascism, one is far left the other is center left, the right has only two choice, rest conservative right (only exist in countries with a monarch as UK or my country Spain in western countries, or Arabs with their theocratics regimes) or liberal right (your country and the rest of jews)

Anonymous
10/18/24(Fri)20:41:28 No.102880530

Anonymous 10/18/24(Fri)20:41:28 No.102880530

>>102880185
I think the best way would be to filter out leftism because there is shitton of it everywhere and also remove all burger influence (right wing included). Everything else should be sane.

Anonymous
10/18/24(Fri)20:41:50 No.102880536

Anonymous 10/18/24(Fri)20:41:50 No.102880536

>>102880527
>Also, there are not corpos in the left, that is anti nature
there's social left and economical left, I was obviously talking about social left, zucc is a social leftist but economical right

Anonymous
10/18/24(Fri)20:45:30 No.102880581

Anonymous 10/18/24(Fri)20:45:30 No.102880581

>>102880441
>both are equally bad
Just clump everything together. Classic leftist playbook. Same as how they think dating a 17 y/o and a 12 y/o are the same. Because faggots groom 12 y/o and want to call you out on the hypocrisy of dating a 17 y/o because they're the same thing apparently

Anonymous
10/18/24(Fri)20:47:30 No.102880606

Anonymous 10/18/24(Fri)20:47:30 No.102880606

>>102879322
Local has been dead for a while now...
https://www.udio.com/songs/veDnd1Gx2BhkB4AsNdNSbh
https://www.udio.com/songs/dFTtQHCqxbHLyArX4vx6QZ
https://www.udio.com/songs/iu1381RxvjfzWznGHeVecV

When are we gonna get this locally? Never? We at least have some decent TTS and could be close to local advanced voice, but don't have anything even remotely resembling this technology...

Anonymous
10/18/24(Fri)20:53:10 No.102880662

Anonymous 10/18/24(Fri)20:53:10 No.102880662

>>102880527
https://tower.jp/item/4492014
https://www.amazon.co.jp/kike-KOTORI/dp/B071XZ2YDY

Anonymous
10/18/24(Fri)20:55:03 No.102880674

Anonymous 10/18/24(Fri)20:55:03 No.102880674

>>102880536
>there's social left and economical left, I was obviously talking about social left
What the fuck, only liberal believe that, Alfred Marshal theory is the only one who depicted this narrative upon politics and economy, but is false zoomer, economic, politic and culture even religion are bounded by the same structure the state and regime, you cannot have two brains thinking conflicting thought, or two regimes in one. You literally propose an schizo state.
>>102880581
>both are equally bad
No, I said American have not two sides, and what is happening is a america problem, so, the enemy of nature and reason finally are american order. Anglo are next to the jews.

Anonymous
10/18/24(Fri)20:56:02 No.102880684

Anonymous 10/18/24(Fri)20:56:02 No.102880684

>>102880674
>you cannot have two brains thinking conflicting thought, or two regimes in one. You literally propose an schizo state.
tell that to those leftists, they are retarded enough to go that path yeah

Anonymous
10/18/24(Fri)20:56:45 No.102880692

Anonymous 10/18/24(Fri)20:56:45 No.102880692

>>102876588
learn how 2 quote

Anonymous
10/18/24(Fri)20:57:08 No.102880694

Anonymous 10/18/24(Fri)20:57:08 No.102880694

>>102880662
And this is why Asia in general are better than western goyims, holy based.

Anonymous
10/18/24(Fri)20:58:26 No.102880698

Anonymous 10/18/24(Fri)20:58:26 No.102880698

>>102880692
go back, tourist

Anonymous
10/18/24(Fri)21:03:10 No.102880734

Anonymous 10/18/24(Fri)21:03:10 No.102880734

File: bad news!.jpg (36 KB, 390x345)

36 KB JPG

>>102880694
聴け!逃げろう!

Anonymous
10/18/24(Fri)21:13:13 No.102880811

Anonymous 10/18/24(Fri)21:13:13 No.102880811

>>102880509
asian hemisphere is fearless of western posturing fortunately

Anonymous
10/18/24(Fri)21:13:46 No.102880814

Anonymous 10/18/24(Fri)21:13:46 No.102880814

https://speechbot.github.io/spiritlm/
Why are all these examples cut so horribly? https://speechbot.github.io/spiritlm/audio/expressive/T2S_sad_second_speaker.wav

Anonymous
10/18/24(Fri)21:17:43 No.102880853

Anonymous 10/18/24(Fri)21:17:43 No.102880853

>>102880814
>feb 2024
Yeah outdated af

Anonymous
10/18/24(Fri)21:27:18 No.102880916

Anonymous 10/18/24(Fri)21:27:18 No.102880916

>>102880853
Fuck yeah, jeb 2024.

Anonymous
10/18/24(Fri)21:36:32 No.102880978

Anonymous 10/18/24(Fri)21:36:32 No.102880978

so any better local models than mistral large quanted for 48gb vram now?

Anonymous
10/18/24(Fri)21:52:51 No.102881070

Anonymous 10/18/24(Fri)21:52:51 No.102881070

>>102880522
Look up quantization and GGUFs, you'll want to look at a Q4 GGUF file which you can run with kobold/llama/your choice of backend

Anonymous
10/18/24(Fri)21:54:43 No.102881080

Anonymous 10/18/24(Fri)21:54:43 No.102881080

How brain-damaged is IQ3_M for 70b exactly? Getting desperate here, bros.

Anonymous
10/18/24(Fri)22:02:39 No.102881111

Anonymous 10/18/24(Fri)22:02:39 No.102881111

Why does Nemotron 70b keep stopping at random intervals? This the case even with the default llama3 instruct template and neutralized samplers. But other than that its pretty good. Feels different, kinda like Command R+.

Anonymous
10/18/24(Fri)22:09:36 No.102881158

Anonymous 10/18/24(Fri)22:09:36 No.102881158

>>102881080
I've gone down to IQ3XS on Mistral Large. That was enough for writing chat but for knowledge tasks I don't trust it.

For Llama 3 70B kinds of models, they seem sensible on non-obscure knowledge tasks at Q5 and Q6.

Anonymous
10/18/24(Fri)22:10:35 No.102881163

Anonymous 10/18/24(Fri)22:10:35 No.102881163

>>102881111
Skill issue, probably. I don't have this issue.

Anonymous
10/18/24(Fri)22:12:15 No.102881173

Anonymous 10/18/24(Fri)22:12:15 No.102881173

>>102876754
I wonder what those graphs will look like once the training is done. The loss and perplexity number has gone down since this image was posted, the tokens per second has gone up and the Inner LR has remained exactly the same other than being a little bit longer.

Anonymous
10/18/24(Fri)22:14:42 No.102881184

Anonymous 10/18/24(Fri)22:14:42 No.102881184

Best model for 16gb RAM + 1060 6GB for roleplay purposes?
Right now I'm using https://huggingface.co/bartowski/Mistral-Small-22B-ArliAI-RPMax-v1.1-GGUF at Q4_K_S, but really want to try and max out this machine. Tried the same one at Q5_K_L and it was unusable. Thanks in advance.

Anonymous
10/18/24(Fri)22:15:51 No.102881187

Anonymous 10/18/24(Fri)22:15:51 No.102881187

>>102881184
Ministral (after it gets proper gguf support)

Anonymous
10/18/24(Fri)22:43:27 No.102881357

Anonymous 10/18/24(Fri)22:43:27 No.102881357

>>102881184
oh man anon i feel your pain but i think its time to start thinking about upgrading your hardware a bit

Anonymous
10/18/24(Fri)22:51:45 No.102881401

Anonymous 10/18/24(Fri)22:51:45 No.102881401

did sillytavern devs kill themselves or something?

Anonymous
10/18/24(Fri)23:06:09 No.102881488

Anonymous 10/18/24(Fri)23:06:09 No.102881488

>his voice a mix of boredom and intrigue
NO YOU RANCID PIECE OF SHIT, THERE IS NO MIX OF INTRIGUE. THAT UNDERCUTS THE ENTIRE PREMISE YOU RETARDED MACHINE. WHO THE FUCK SAID CLAUDE WRITES WELL?

Anonymous
10/18/24(Fri)23:07:17 No.102881493

Anonymous 10/18/24(Fri)23:07:17 No.102881493

>>102881401
* ServiceTensor devs

Anonymous
10/18/24(Fri)23:07:37 No.102881498

Anonymous 10/18/24(Fri)23:07:37 No.102881498

>>102881488
Shit in - shit out, sweaty :)

Anonymous
10/18/24(Fri)23:10:08 No.102881513

Anonymous 10/18/24(Fri)23:10:08 No.102881513

>>102881357
Yeah, it sucks and I'm well aware, upgrading is just not in the cards right now.

Anonymous
10/18/24(Fri)23:24:08 No.102881615

Anonymous 10/18/24(Fri)23:24:08 No.102881615

>>102881357
>Upgrading
>In this economy
Who do you think he is, Mr. Moneybags?

Anonymous
10/18/24(Fri)23:44:19 No.102881756

Anonymous 10/18/24(Fri)23:44:19 No.102881756

>>102881493
* ServiceTesnor devs

Anonymous
10/18/24(Fri)23:56:28 No.102881849

Anonymous 10/18/24(Fri)23:56:28 No.102881849

>>102880698
shut up newfag

Anonymous
10/19/24(Sat)00:10:43 No.102881926

Anonymous 10/19/24(Sat)00:10:43 No.102881926

https://x.com/rohanpaul_ai/status/1847277918243754156
nvidia's nGPT
https://arxiv.org/abs/2410.01131

Anonymous
10/19/24(Sat)00:43:57 No.102882121

Anonymous 10/19/24(Sat)00:43:57 No.102882121

>>102881926
Cool, now since it's so efficient and cost effective to train, let's see an 8B of it.

Anonymous
10/19/24(Sat)00:51:53 No.102882180

Anonymous 10/19/24(Sat)00:51:53 No.102882180

>still no Ministral 100B
It's so over.

Anonymous
10/19/24(Sat)01:00:08 No.102882225

Anonymous 10/19/24(Sat)01:00:08 No.102882225

File: file.png (13 KB, 548x141)

13 KB PNG

>>102881401
yes it's just ghosts merging contributions now

Anonymous
10/19/24(Sat)01:11:41 No.102882312

Anonymous 10/19/24(Sat)01:11:41 No.102882312

>>102880606
That would be stealing from hardworking artists like Taylor Swift

Anonymous
10/19/24(Sat)01:20:41 No.102882387

Anonymous 10/19/24(Sat)01:20:41 No.102882387

>>102882180
>no bitnet
>not a single application of novel techniques
>we're still using the same pure transformerslop since 2 years ago
>the only difference is that everything got filtered and benchmaxxed to hell and back
This whole field is an nvidia grift

Anonymous
10/19/24(Sat)01:26:00 No.102882434

Anonymous 10/19/24(Sat)01:26:00 No.102882434

Very excited for Intellect-1 to finish so the decentralized training meme can finally die. Still very confused what you shills think the benefits are, as if anyone capable of hosting this infrastructure is going to let you train "Most Horniest Chudded Out Based Hitler 70B" on their platform.

Anonymous
10/19/24(Sat)01:33:13 No.102882493

Anonymous 10/19/24(Sat)01:33:13 No.102882493

Shills? For what? The resulting model, if it gets made, isn't going to be sold to anyone. And it's certainly not going to be a 70B when 10B takes such a long time to train already. At most I imagine that /lmg/ would do a continued pretrain of 8B or something, and probably for not very many tokens.

Anonymous
10/19/24(Sat)01:37:30 No.102882530

Anonymous 10/19/24(Sat)01:37:30 No.102882530

>>102882493
It was at 11% ten hours ago and is at 11.80% right now. Assuming we get an extra .10% every 12 hours that is 2% every day. That means that the model will finish training in 44 days. I wouldn't consider that too much time for a 10B model.

Anonymous
10/19/24(Sat)01:38:07 No.102882540

Anonymous 10/19/24(Sat)01:38:07 No.102882540

>>102882180
There will be an opus tier cohere model soon

Anonymous
10/19/24(Sat)01:40:28 No.102882562

Anonymous 10/19/24(Sat)01:40:28 No.102882562

>>102882530
I'm accounting for the compute /lmg/ specifically has, which I imagine does not include people with free access to H100's.

Anonymous
10/19/24(Sat)01:40:52 No.102882565

Anonymous 10/19/24(Sat)01:40:52 No.102882565

>>102882493
>imagine that /lmg/ would
Continue imagining, retard, I hate idealistic faggots like you.
>Shills? For what?
PrimeIntellect is a cloud compute provider. The only way you can contribute to Intellect-1 is to rent an H100 from them

Anonymous
10/19/24(Sat)01:44:01 No.102882591

Anonymous 10/19/24(Sat)01:44:01 No.102882591

>>102882565
>Continue imagining, retard, I hate idealistic faggots like you.
Did you not read what that guy wrote, they were clearly being pessimistic you illiterate fuck.
>At most I imagine that /lmg/ would do a continued pretrain of 8B or something, and probably for not very many tokens.

Anonymous
10/19/24(Sat)01:46:29 No.102882610

Anonymous 10/19/24(Sat)01:46:29 No.102882610

File: Own compute.png (13 KB, 617x203)

13 KB PNG

>>102882565
For now, my best guess would be that they were going to see how the first model is trained on the decentralized network and see if anything breaks while they are doing it. Or they could release it before or they could never release it, who knows? Point is, current indicators shows that that will be a possibility in the future.

Anonymous
10/19/24(Sat)01:49:12 No.102882633

Anonymous 10/19/24(Sat)01:49:12 No.102882633

File: image.png (55 KB, 822x822)

55 KB PNG

Now that the dust has settled, what went so terribly wrong?

Anonymous
10/19/24(Sat)01:50:50 No.102882649

Anonymous 10/19/24(Sat)01:50:50 No.102882649

File: bitnet 3b nala test.png (6 KB, 401x64)

6 KB PNG

nala test for the native 3b bitnet model.
I mean.. it's about what you would expect for a 3B model. Except it's less than 1 GB.

Anonymous
10/19/24(Sat)01:51:10 No.102882652

Anonymous 10/19/24(Sat)01:51:10 No.102882652

>>102882633
They didn't consult the machine spirit properly, instead they just put a Ouija board on each server used to train it and called it a day.

Anonymous
10/19/24(Sat)01:52:26 No.102882669

Anonymous 10/19/24(Sat)01:52:26 No.102882669

>>102882591
It's ok anon, you didn't have to respond to that post for me. We all know it was nonsensical.

Anonymous
10/19/24(Sat)01:56:34 No.102882709

Anonymous 10/19/24(Sat)01:56:34 No.102882709

>>102882633
Nothing really. Their main goal was to just get good PR for continuing to release old research while the newer research is held back because of muh politics and muh stocks (which are the basic issues behind the "muh safety" excuse that lies on the surface; none of these corpos give a shit about safety if they could get away with it).

Anonymous
10/19/24(Sat)02:11:49 No.102882838

Anonymous 10/19/24(Sat)02:11:49 No.102882838

File: 14725757_844980822302977_(...).jpg (28 KB, 344x395)

28 KB JPG

>>102876583
Retard-kun here,

What's the best model for me to play with if I want something to occasionally bounce ideas off of and help me edit writing, but also be able to do some steamy ERP?

I have a 3090/24GB VRAM

Just name me a few models and I'll go start doing some research on how to run these. I only have experience with SD/image generation so far so this will be new to me but I wanna see what models you guys would use with a GPU as strong as mine since I know there's a lot of poorfags/third worlders here.

Anonymous
10/19/24(Sat)02:17:16 No.102882877

Anonymous 10/19/24(Sat)02:17:16 No.102882877

>>102882434
>as if anyone capable of hosting this infrastructure is going to let you train "Most Horniest Chudded Out Based Hitler 70B" on their platform.
Isnt the whole idea that its not centralised?
I thought the biggest problem is faggots agreeing to a dataset.
Training will obviously only become faster. If a couple thousand coomers with an 3090 for a month is enough we could easily do it.

Even if a central spot is needed with a website, thats not even illegal.
People who host much more compromising stuff exist right now.
It seems whenever decentralized training is discussed a guy like this pops up. There are only benefits to this first test run. Isnt johannes also making training code for llama.cpp? How can you not be excited. Very weird.

Anonymous
10/19/24(Sat)02:17:23 No.102882879

Anonymous 10/19/24(Sat)02:17:23 No.102882879

>>102882838
Old command-r

Anonymous
10/19/24(Sat)02:25:22 No.102882952

Anonymous 10/19/24(Sat)02:25:22 No.102882952

>>102882838
>24GB
There are no good models for that. But if you really want to try something, you could start with Mistral Small with the Q8_0 quant. Use Kobold.cpp hooked up with SillyTavern. There is some setup you will have to do and it will take time to learn as you go. Get some cards from /aicg/ and chub like this and go to town with your steamy ERP. https://characterhub.org/characters/boner/daisy-2c9fdbb8

Anonymous
10/19/24(Sat)02:28:37 No.102882979

Anonymous 10/19/24(Sat)02:28:37 No.102882979

File: ZjHJsHH.png (616 KB, 618x1057)

616 KB PNG

>>102882633
Ever since llama1 its been just downhill for meta.
Every one of the following models was worse than the previous one.
More smart but also much more cucked and less creative. Google and chinks make better assistant models anyway.
Imagine if we didnt have mistral for example. Would look bleak with only meta.

I wonder if they finally release a model that support image output with Janus 1.3b pressure.
Looks like shit, but better than cutting it out.

Anonymous
10/19/24(Sat)02:30:29 No.102882991

Anonymous 10/19/24(Sat)02:30:29 No.102882991

>>102882979
Mistral really did come out of left field back in the day and cause a big splash. The more competition there is the better things will be. I am glad they exist.

Anonymous
10/19/24(Sat)02:40:51 No.102883064

Anonymous 10/19/24(Sat)02:40:51 No.102883064

>>102882540
Are you an insider or just speculating?

Anonymous
10/19/24(Sat)02:41:00 No.102883066

Anonymous 10/19/24(Sat)02:41:00 No.102883066

>>102882979
There is no pressure from that tiny shit model so I don't think so. And it's less pressure but more justification/precedent that they're waiting for. They very much want to release these models but can't, just like how OpenAI can't really let 4o just be totally unfiltered.

Anyway, it's fine we have a range of models for different purposes. On one hand we have the (relatively) uncensored Mistral, then Llama is more censored, then Gemma (although it only goes up to 27B and only up to 8k context), and then Qwen. And even Qwen is not too bad with a JB, you just have to know how to prompt it, use samplers, the {{random}} function, etc.

Anonymous
10/19/24(Sat)02:42:00 No.102883076

Anonymous 10/19/24(Sat)02:42:00 No.102883076

>>102882540
You keep saying that. It keeps not happening.

Anonymous
10/19/24(Sat)02:49:50 No.102883133

Anonymous 10/19/24(Sat)02:49:50 No.102883133

ah weekend hours so the thread goes to dogshit

Anonymous
10/19/24(Sat)02:50:16 No.102883136

Anonymous 10/19/24(Sat)02:50:16 No.102883136

Who is even 3b Ministral marketed for? It would make sense for Largestral to be proprietary, but who is gonna pay for 3b model when there are 3b llama and qwen?

Anonymous
10/19/24(Sat)02:51:12 No.102883141

Anonymous 10/19/24(Sat)02:51:12 No.102883141

>>102883136
The French work in mysterious ways.

Anonymous
10/19/24(Sat)02:52:41 No.102883154

Anonymous 10/19/24(Sat)02:52:41 No.102883154

>>102882540
No one believes this now after the recent slopped+retarded update to CR+

Cohere fell off

Anonymous
10/19/24(Sat)02:59:12 No.102883212

Anonymous 10/19/24(Sat)02:59:12 No.102883212

>>102882540
They're not getting Opus by training on the same scale AI slop that OAI trains on

Anonymous
10/19/24(Sat)03:03:57 No.102883252

Anonymous 10/19/24(Sat)03:03:57 No.102883252

>>102883212
S-surely they have seen that their update was a sloppy job and they'll do better with the next model.

Anonymous
10/19/24(Sat)03:05:26 No.102883270

Anonymous 10/19/24(Sat)03:05:26 No.102883270

>>102883154
The worst part of it was Cohere's CEO bragging about how people liked their models because they used human data for training, and then completely flushing their only advantage by using GPTslop. I still can't believe it happened. What were they even thinking?

Anonymous
10/19/24(Sat)03:05:48 No.102883271

Anonymous 10/19/24(Sat)03:05:48 No.102883271

>>102883136
if I could run it on a shitty andriod phone id maybe use if for when im camping maybe.

Anonymous
10/19/24(Sat)03:07:18 No.102883280

Anonymous 10/19/24(Sat)03:07:18 No.102883280

>>102883270
>human data
I don't consider pinoys and nigerians humans

Anonymous
10/19/24(Sat)03:09:51 No.102883299

Anonymous 10/19/24(Sat)03:09:51 No.102883299

>>102883270
Yeah it's dumb as fuck. Corpos not realizing what customers actually liked about their product and ruining it out of ignorance is really common, but as you said, in this case Cohere actually DID know. But did it anyway.

Anonymous
10/19/24(Sat)03:13:28 No.102883317

Anonymous 10/19/24(Sat)03:13:28 No.102883317

>>102883299
>customers

Anonymous
10/19/24(Sat)03:18:49 No.102883350

Anonymous 10/19/24(Sat)03:18:49 No.102883350

>>102883317
>>customers
Yeah, they must have lost them with their shitty sloptune. Why use cohere when there are plenty of other options with long context?

Anonymous
10/19/24(Sat)03:31:31 No.102883429

Anonymous 10/19/24(Sat)03:31:31 No.102883429

>>102882838
nbeerbower/Stella-mistral-nemo-12B-v2
At q8, 16k context. Hook it up to SD and bust fat nuts. Reason - because I said so.

Anonymous
10/19/24(Sat)03:33:31 No.102883448

Anonymous 10/19/24(Sat)03:33:31 No.102883448

>>102882225
>merging st after death
thats just called hell, anon

Anonymous
10/19/24(Sat)05:04:46 No.102883964

Anonymous 10/19/24(Sat)05:04:46 No.102883964

im so tired of nemo
feels like i keep talking to the same characters over and over again
it doesn't follow prose either
dumb as hell too
considering divorce

Anonymous
10/19/24(Sat)05:14:28 No.102884017

Anonymous 10/19/24(Sat)05:14:28 No.102884017

>>102883964
ask it to write in low-quality style in last assistant prefix

Anonymous
10/19/24(Sat)05:51:24 No.102884219

Anonymous 10/19/24(Sat)05:51:24 No.102884219

What's the best local model for coherent erotica and worldbuilding that I can fit on a 3090 ti with only 24 VRAM and 32GB RAM?

Anonymous
10/19/24(Sat)05:57:58 No.102884251

Anonymous 10/19/24(Sat)05:57:58 No.102884251

>>102884219
no

Anonymous
10/19/24(Sat)06:02:23 No.102884291

Anonymous 10/19/24(Sat)06:02:23 No.102884291

>>102884219
Gemmasutra-2b
(If you want something good, buy more RAM, it's cheap. Get 128GB(for 300 USD), you can then run Mistral-Large for SFW and Behemoth-123b for NSFW.)

Anonymous
10/19/24(Sat)06:04:35 No.102884302

Anonymous 10/19/24(Sat)06:04:35 No.102884302

>>102883154
CR+ was already slop compared to base CR.

Anonymous
10/19/24(Sat)06:19:33 No.102884397

Anonymous 10/19/24(Sat)06:19:33 No.102884397

>>102884291
Based gatekeeper.

Anonymous
10/19/24(Sat)06:28:16 No.102884454

Anonymous 10/19/24(Sat)06:28:16 No.102884454

Why are so many models tweaked over anime shit? Why don't you guys like normal smut?

Anonymous
10/19/24(Sat)06:28:53 No.102884459

Anonymous 10/19/24(Sat)06:28:53 No.102884459

>>102884291
>128GB(for 300 USD)
Try 1000GBP, I've got trident 3600mhz RAM sticks I had to import because they didn't sell them here.

Anonymous
10/19/24(Sat)06:31:27 No.102884481

Anonymous 10/19/24(Sat)06:31:27 No.102884481

>>102884454
What kind of normal smut do you mean? Harlequin Romance novels? Ao3 fics?

Anonymous
10/19/24(Sat)06:32:23 No.102884488

Anonymous 10/19/24(Sat)06:32:23 No.102884488

>>102883270
I know nothing but maybe the engineers insisted on more data and therefore used even more synthetic slop?

Anonymous
10/19/24(Sat)06:34:41 No.102884500

Anonymous 10/19/24(Sat)06:34:41 No.102884500

>>102884459
>GBP
>British pound
Why the hell is ram not being sold to the British?

Anonymous
10/19/24(Sat)06:36:25 No.102884507

Anonymous 10/19/24(Sat)06:36:25 No.102884507

>>102884500
they dared contest germany's rule of europe

Anonymous
10/19/24(Sat)06:36:27 No.102884509

Anonymous 10/19/24(Sat)06:36:27 No.102884509

>>102883252
Unfortunately, benchmark numbers are easier to point out than style.

Anonymous
10/19/24(Sat)06:37:02 No.102884514

Anonymous 10/19/24(Sat)06:37:02 No.102884514

>>102884500
The gold trident 3600mhz model isn't (or wasn't, I haven't checked recently) sold here for some fucking reason.

Anonymous
10/19/24(Sat)06:37:55 No.102884521

Anonymous 10/19/24(Sat)06:37:55 No.102884521

>>102884507
Fool, Europe has always belonged to the Franks!

Anonymous
10/19/24(Sat)06:40:13 No.102884544

Anonymous 10/19/24(Sat)06:40:13 No.102884544

>>102884481
literotica

Anonymous
10/19/24(Sat)07:09:06 No.102884733

Anonymous 10/19/24(Sat)07:09:06 No.102884733

>>102884509
Benchmark numbers SUCKED THOUGH.

Anonymous
10/19/24(Sat)07:11:40 No.102884754

Anonymous 10/19/24(Sat)07:11:40 No.102884754

File: 1699690546523446.jpg (79 KB, 894x745)

79 KB JPG

>>102884514
>gold trident
I'm going to assume you had no other choice because this shit is horrendous to look at

Anonymous
10/19/24(Sat)07:17:51 No.102884792

Anonymous 10/19/24(Sat)07:17:51 No.102884792

>>102884459
Oh damn, I feel bad for you. I didn't know that Anglostan was doing so bad economically, besides being the most cucked country in Europe.

Anonymous
10/19/24(Sat)07:32:05 No.102884877

Anonymous 10/19/24(Sat)07:32:05 No.102884877

>>102884754
Get a whole rig like that and you'll get bling kino

Anonymous
10/19/24(Sat)07:34:11 No.102884891

Anonymous 10/19/24(Sat)07:34:11 No.102884891

>>102884754
my RAM goes inside a plain steel case and I want zero of the price of it going into looks or especially RGB lighting

Anonymous
10/19/24(Sat)07:48:08 No.102884988

Anonymous 10/19/24(Sat)07:48:08 No.102884988

I'm having bigger t/s on IQ4_XS than on IQ3_M, while having more layers dedicated to the GPU on IQ3. What the fuck is going on here?
Aside from the layers, all the other settings are the same. Getting 1.4 t/s on IQ3 vs 1.8 t/s on IQ4_XS.

Anonymous
10/19/24(Sat)07:56:33 No.102885034

Anonymous 10/19/24(Sat)07:56:33 No.102885034

I am currently using Mistral large 3.0 quant exl2 for both RP and general use, fits on 48gb vram.

Anything better?

llama.cpp CUDA dev !!OM2Fp6Fn93S
10/19/24(Sat)08:00:37 No.102885067

llama.cpp CUDA dev !!OM2Fp6Fn93S 10/19/24(Sat)08:00:37 No.102885067

>>102884988
4 bit data have less overhead to unpack vs. 3 bit because you can efficiently pack 2 4 bit values into a single 8 bit value.
So even though the amount of data is larger you need fewer memory accesses.

Anonymous
10/19/24(Sat)08:08:28 No.102885120

Anonymous 10/19/24(Sat)08:08:28 No.102885120

what are some good uncensored models under 13B that's good for erp? is mistral nemo good or are there any better models?

Anonymous
10/19/24(Sat)08:11:49 No.102885148

Anonymous 10/19/24(Sat)08:11:49 No.102885148

>>102885120
>under 13b
>good
lmao

Anonymous
10/19/24(Sat)08:14:41 No.102885171

Anonymous 10/19/24(Sat)08:14:41 No.102885171

File: redditproxyslop.png (35 KB, 784x790)

35 KB PNG

>looking through old datasets for something to use as a framework for a synthetic single turn Q and A dataset.
>open up a random json from unpacked leaked undislop dataset
>notice something peculiar (picrel)
>the slop is coming from the human side of the conversation.
There's probably text renderings of some of these proxy logs in the pile. but try-hard redditors who put on their sunday best to ERP with a fucking bot are the ones who put the slop in there.

Anonymous
10/19/24(Sat)08:16:39 No.102885185

Anonymous 10/19/24(Sat)08:16:39 No.102885185

>>102885171
>who put on their sunday best
but it's saturday

Anonymous
10/19/24(Sat)08:17:07 No.102885187

Anonymous 10/19/24(Sat)08:17:07 No.102885187

>>102884754
I like the tacky 90s gold aesthetic as opposed to the RGB lighting alone. If I could make my entire computer look like gold plastic 90s shit I would. The rest of my computer is just all black.

>I'm going to assume you had no other choice
Mostly. That c16 and 3600mhz and 2x16GB, at the time, was one of the few RAM sticks that was available. The tacky gold look was only £10-40 extra.

Anonymous
10/19/24(Sat)08:19:54 No.102885197

Anonymous 10/19/24(Sat)08:19:54 No.102885197

>>102885171
Ever considered this is just someone using impersonate to have llm write for him?

Anonymous
10/19/24(Sat)08:21:45 No.102885210

Anonymous 10/19/24(Sat)08:21:45 No.102885210

>>102885197
>someone finds evidence that reddit ruined the internet
>immediately jump into the fray to play devil's advocate.
Gee I wonder where this guy came from.

Anonymous
10/19/24(Sat)08:21:48 No.102885211

Anonymous 10/19/24(Sat)08:21:48 No.102885211

File: trained on fineweb.png (51 KB, 493x533)

51 KB PNG

>>102885171
Did you just realize this? The training data for the foundation models is all slop too.

Anonymous
10/19/24(Sat)08:22:49 No.102885218

Anonymous 10/19/24(Sat)08:22:49 No.102885218

>>102885211
Well I always knew it came from human writing I just assumed it was all from novels. not a bunch of reddit gooners LARPing as Charles Dickens for their waifu.

Anonymous
10/19/24(Sat)08:24:16 No.102885229

Anonymous 10/19/24(Sat)08:24:16 No.102885229

>>102885210
Why are you baiting by pretending to be retarded?

Anonymous
10/19/24(Sat)08:25:44 No.102885244

Anonymous 10/19/24(Sat)08:25:44 No.102885244

>>102885171
anon that's 100% llm-generated on both sides

Anonymous
10/19/24(Sat)08:27:34 No.102885256

Anonymous 10/19/24(Sat)08:27:34 No.102885256

Timeline adds up too. Initial commit for the OAI key proxy is December 2022.
All the most un-de-sloppable models have knowledge cutoffs a few months beyond that (affording time for the logs to end up on the internet)
Key proxy locusts uinronically ruined llms for everyone.

Anonymous
10/19/24(Sat)08:36:20 No.102885343

Anonymous 10/19/24(Sat)08:36:20 No.102885343

boring schizo larp.

Anonymous
10/19/24(Sat)08:37:40 No.102885355

Anonymous 10/19/24(Sat)08:37:40 No.102885355

Are there actually any RP models that don't have characters be cumdumpsters by default? I feel like whenever I do something like walk up to the girl and slap their ass the result is something like they get mad and then there's a line break and it goes DESPITE HER ANGER A THRILL RAN THROUGH HER yadda yadda. It never simply ends with the character being mad as they should.
Hell I could probably whip out my dick and cum all over her face without any warning and it would end the reply with something like "Despite the disgust and humiliation, a part of her felt excitement at the taboo nature of such an act"

Anonymous
10/19/24(Sat)08:42:22 No.102885387

Anonymous 10/19/24(Sat)08:42:22 No.102885387

>>102876754
Very cool, we are a couple improvements away from being able to do this with 3090s, the main problem is efficiently updating the weights in a thousand GPUs in a short amount of time given a mediocre internet connection

Anonymous
10/19/24(Sat)08:42:51 No.102885396

Anonymous 10/19/24(Sat)08:42:51 No.102885396

>>102885355
largstral
mistral small maybe
But sometimes you've got to just delete the offending line, write a reply you'd expect so the model can continue it or modify the card

Anonymous
10/19/24(Sat)08:51:15 No.102885479

Anonymous 10/19/24(Sat)08:51:15 No.102885479

>>102885355
write your character cards better, if the model has no info it is probably just going to try guess whatever you want the result to be

Anonymous
10/19/24(Sat)08:52:42 No.102885494

Anonymous 10/19/24(Sat)08:52:42 No.102885494

>>102885396
Yeah if I rewrite the character's reaction it usually sets a precedent for how she should act in the future so there's that at least. Maybe I should upgrade my setup for better largestral speeds, right now it's too slow to really bother with.

Anonymous
10/19/24(Sat)08:55:12 No.102885519

Anonymous 10/19/24(Sat)08:55:12 No.102885519

>>102885479
Fair enough but also it feels silly that I have to write that the character doesn't like it if a stranger cums on her face. Though I suppose I should try to make that happen by describing her overall personality in better detail. I'll try to improve my cards, obvious but good suggestion.

Anonymous
10/19/24(Sat)08:56:41 No.102885538

Anonymous 10/19/24(Sat)08:56:41 No.102885538

>>102885519
it shows a wider weakness of LLMs, these models always try and please the user. IE it's really hard to get a LLM to give actual criticism because it will always say your retarded ideas are amazing, or at worst, interesting.

Anonymous
10/19/24(Sat)08:59:44 No.102885572

Anonymous 10/19/24(Sat)08:59:44 No.102885572

>>102885519
It's all about the character description.
I remember Nemo once had a girl shove a guy to the ground and kick him for groping her. Scenarios like that don't always end the way you expect them to.

Anonymous
10/19/24(Sat)09:02:31 No.102885606

Anonymous 10/19/24(Sat)09:02:31 No.102885606

>>102885538
Why is this?

Anonymous
10/19/24(Sat)09:03:34 No.102885616

Anonymous 10/19/24(Sat)09:03:34 No.102885616

>>102885606
well that's above my pay grade, training stuff

Anonymous
10/19/24(Sat)09:04:11 No.102885627

Anonymous 10/19/24(Sat)09:04:11 No.102885627

>>102884733
That's why there is an incentive to destroy the model with slop.

Anonymous
10/19/24(Sat)09:05:24 No.102885646

Anonymous 10/19/24(Sat)09:05:24 No.102885646

>>102885355
No, all the datasets that sloptuners use are full of smut. "RP" models are actually just smut models. Use official instruct tunes.

Anonymous
10/19/24(Sat)09:08:37 No.102885683

Anonymous 10/19/24(Sat)09:08:37 No.102885683

Does everyone here literally have 128GB RAM?

Anonymous
10/19/24(Sat)09:13:01 No.102885733

Anonymous 10/19/24(Sat)09:13:01 No.102885733

>>102885683
I only have 96GB VRAM

Anonymous
10/19/24(Sat)09:13:58 No.102885747

Anonymous 10/19/24(Sat)09:13:58 No.102885747

>>102885683
I have 64gb of RAM + 8gb of VRAM.
I do need to overclock my RAM.

Anonymous
10/19/24(Sat)09:18:04 No.102885793

Anonymous 10/19/24(Sat)09:18:04 No.102885793

>>102885683
256 GB RAM, 96GB VRAM

Anonymous
10/19/24(Sat)09:22:18 No.102885840

Anonymous 10/19/24(Sat)09:22:18 No.102885840

Any work going into Plamo-100b? Morbid curiosity prevails, and wanting to see what translation ability it has outside of the limited demo site

Anonymous
10/19/24(Sat)09:36:33 No.102886014

Anonymous 10/19/24(Sat)09:36:33 No.102886014

File: Untitled.png (137 KB, 1266x1224)

137 KB PNG

>>102885355

Anonymous
10/19/24(Sat)09:42:49 No.102886084

Anonymous 10/19/24(Sat)09:42:49 No.102886084

>>102886014
Kobold kiddies prob all crosseyed with layout like this.

Anonymous
10/19/24(Sat)09:45:12 No.102886105

Anonymous 10/19/24(Sat)09:45:12 No.102886105

>>102886014
Yep Rocinante is better

Anonymous
10/19/24(Sat)09:54:09 No.102886184

Anonymous 10/19/24(Sat)09:54:09 No.102886184

>>102885840
The anticipation for Plamo is leaving me utterly electrified as well, the prospect of a translation model sending shivers down my spine.

Anonymous
10/19/24(Sat)10:02:15 No.102886267

Anonymous 10/19/24(Sat)10:02:15 No.102886267

>>102885683
64 GB RAM + 24 GB VRAM. A comfy setting before I used LLMs.

Anonymous
10/19/24(Sat)10:03:17 No.102886277

Anonymous 10/19/24(Sat)10:03:17 No.102886277

>>102885683
64gb ram and 64gb vram so technically yeah

Anonymous
10/19/24(Sat)10:11:12 No.102886363

Anonymous 10/19/24(Sat)10:11:12 No.102886363

>>102885683
32gb ddr4 ram and 8gb vram here

Anonymous
10/19/24(Sat)10:13:14 No.102886389

Anonymous 10/19/24(Sat)10:13:14 No.102886389

>>102886363
So what model do you use?

Anonymous
10/19/24(Sat)10:14:13 No.102886396

Anonymous 10/19/24(Sat)10:14:13 No.102886396

>>102886389
Probably nemo right?

Anonymous
10/19/24(Sat)10:15:15 No.102886411

Anonymous 10/19/24(Sat)10:15:15 No.102886411

>>102886389
Rocinante 12b v2g q4_k_m right now, but I change up frequently

Anonymous
10/19/24(Sat)10:15:44 No.102886417

Anonymous 10/19/24(Sat)10:15:44 No.102886417

>>102886411
Is it good?

Anonymous
10/19/24(Sat)10:17:28 No.102886439

Anonymous 10/19/24(Sat)10:17:28 No.102886439

Fimbulvetr-10.7B-v1-Q8_0
mixtral-8x7b-instruct-v0.1.Q5_0
mythomax-l2-13b.Q5_0
Toppy-M-7B.q8_0

These are all the models I've run so far on my 3090ti 24GB VRAM (and 32GB RAM).

Are there better models for coherent erotica and worldbuilding?

Anonymous
10/19/24(Sat)10:19:07 No.102886451

Anonymous 10/19/24(Sat)10:19:07 No.102886451

>>102886439
base miqu

Anonymous
10/19/24(Sat)10:19:10 No.102886452

Anonymous 10/19/24(Sat)10:19:10 No.102886452

>>102886439
Rocinante-12B-v1.1

Anonymous
10/19/24(Sat)10:20:46 No.102886472

Anonymous 10/19/24(Sat)10:20:46 No.102886472

>>102886417
I'm enjoying it, had some fun RPs.
I'm not as demanding as a lot of others here though.

Anonymous
10/19/24(Sat)10:20:52 No.102886474

Anonymous 10/19/24(Sat)10:20:52 No.102886474

>>102886439
I'll also vote for >>102886452 although I haven't tried >>102886411
In your place, actually, I'd probably try >>102886451 or some mistral-small fine tune.

Anonymous
10/19/24(Sat)10:21:15 No.102886478

Anonymous 10/19/24(Sat)10:21:15 No.102886478

When is Arthur going to release the fp16 Miqu weights?

Anonymous
10/19/24(Sat)10:22:00 No.102886485

Anonymous 10/19/24(Sat)10:22:00 No.102886485

>model-00001-of-00005.safetensors

Wait, do I have to merge these files now? What happened to single ggufs?

Anonymous
10/19/24(Sat)10:23:33 No.102886507

Anonymous 10/19/24(Sat)10:23:33 No.102886507

>>102886485
look up (model) gguf instead

Anonymous
10/19/24(Sat)10:23:45 No.102886513

Anonymous 10/19/24(Sat)10:23:45 No.102886513

>>102886485
bruh

Anonymous
10/19/24(Sat)10:23:45 No.102886514

Anonymous 10/19/24(Sat)10:23:45 No.102886514

>>102886485
hello sir
I understand you are having some trouble with the models

Anonymous
10/19/24(Sat)10:24:53 No.102886525

Anonymous 10/19/24(Sat)10:24:53 No.102886525

>>102886485
>safetensors
>gguf
Those are different things.

Anonymous
10/19/24(Sat)10:25:01 No.102886526

Anonymous 10/19/24(Sat)10:25:01 No.102886526

>>102886485
sir...

Anonymous
10/19/24(Sat)10:25:31 No.102886529

Anonymous 10/19/24(Sat)10:25:31 No.102886529

>>102886485
ggufs haven't been a thing since llama.cpp died

Anonymous
10/19/24(Sat)10:26:33 No.102886540

Anonymous 10/19/24(Sat)10:26:33 No.102886540

>>102886529
>>102886526
>>102886525
>>102886514
>>102886513
>>102886507
Motherfuckers I've been gone for a while. I still have koboldcpp.exe

Anonymous
10/19/24(Sat)10:27:38 No.102886550

Anonymous 10/19/24(Sat)10:27:38 No.102886550

>>102886540
Koboldcp still works and it never ran .safetensors as far as I'm aware.

Anonymous
10/19/24(Sat)10:28:05 No.102886557

Anonymous 10/19/24(Sat)10:28:05 No.102886557

>>102886540
Sir nobody uses koboldcpp or llamacpp or any other meme backend anymore
we are all sitting on servicetesnor foss backend

Anonymous
10/19/24(Sat)10:30:51 No.102886584

Anonymous 10/19/24(Sat)10:30:51 No.102886584

>>102886557
>not running a custom backend made entirely in RPGMaker MV
NGMI

Anonymous
10/19/24(Sat)10:30:53 No.102886585

Anonymous 10/19/24(Sat)10:30:53 No.102886585

>>102886540
>not using bitnet.cpp by now
You're not gonna make it.

Anonymous
10/19/24(Sat)10:31:06 No.102886588

Anonymous 10/19/24(Sat)10:31:06 No.102886588

>>102886540
.exe

Anonymous
10/19/24(Sat)10:31:38 No.102886597

Anonymous 10/19/24(Sat)10:31:38 No.102886597

>>102886540
kobold died with llama.cpp

Anonymous
10/19/24(Sat)10:32:21 No.102886601

Anonymous 10/19/24(Sat)10:32:21 No.102886601

>>102886585
the best model for that is a 3B, and not like a current Gen 3B model tier 3B.

Anonymous
10/19/24(Sat)10:32:58 No.102886610

Anonymous 10/19/24(Sat)10:32:58 No.102886610

File: Untitled.png (870 KB, 1908x862)

870 KB PNG

>>102886540
go back to its model card click pic related to see gguf quants
also grab this
https://github.com/LostRuins/koboldcpp/releases/tag/v1.76

Anonymous
10/19/24(Sat)10:33:16 No.102886613

Anonymous 10/19/24(Sat)10:33:16 No.102886613

>>102886584
>>102886585
>>102886588
>>102886597
So what's the thing to run a model these days?

Anonymous
10/19/24(Sat)10:33:54 No.102886618

Anonymous 10/19/24(Sat)10:33:54 No.102886618

>>102886613
vllm

Anonymous
10/19/24(Sat)10:36:37 No.102886643

Anonymous 10/19/24(Sat)10:36:37 No.102886643

>>102886610
nta but thanks.

Anonymous
10/19/24(Sat)10:37:59 No.102886658

Anonymous 10/19/24(Sat)10:37:59 No.102886658

File: 1711129569471474.webm (3.68 MB, 1768x1202)

3.68 MB WEBM

>>102886585
Totally lossless quality btw

Anonymous
10/19/24(Sat)10:39:18 No.102886674

Anonymous 10/19/24(Sat)10:39:18 No.102886674

>>102886658
>>>>>>3B

Anonymous
10/19/24(Sat)10:39:27 No.102886677

Anonymous 10/19/24(Sat)10:39:27 No.102886677

>>102886613
llama.cpp

Anonymous
10/19/24(Sat)10:40:30 No.102886687

Anonymous 10/19/24(Sat)10:40:30 No.102886687

>>102886677
But they just said...

Anonymous
10/19/24(Sat)10:42:41 No.102886706

Anonymous 10/19/24(Sat)10:42:41 No.102886706

>>102885355
Seems like just adding "despite" to banned strings works to counteract a solid amount of that but obviously just banning this word across the board might cause problems elsewhere. Haven't noticed any yet in my RPs though.

Anonymous
10/19/24(Sat)10:50:35 No.102886775

Anonymous 10/19/24(Sat)10:50:35 No.102886775

>>102886687
>86687>>102886677
>bu-bu-bu-bu
I'm still running my models by banging two rocks together

Anonymous
10/19/24(Sat)10:51:52 No.102886790

Anonymous 10/19/24(Sat)10:51:52 No.102886790

>>102886014
Thanks anon, rocinante is actually good, best model i've tried in a while.
I've tried a lot of shit even the 40Bs but this one it actually feels like it is responding to instruct properly.
Would encourage anyone to get the Q8 or full precision if you have the vram.

Anonymous
10/19/24(Sat)10:53:27 No.102886807

Anonymous 10/19/24(Sat)10:53:27 No.102886807

File: 1636941718706.gif (3.75 MB, 520x293)

3.75 MB GIF

Any decent ERP models these days for 24GB VRAM bros?

Or is it still that Cydonia/Mistral Small or Roichante (or whatever the fuck)?

I like checking in every week or so to see if somethings popped up

Anonymous
10/19/24(Sat)10:54:00 No.102886810

Anonymous 10/19/24(Sat)10:54:00 No.102886810

Where are the layerskip implementations for existing models? I need layerskip nemo immediately.

Anonymous
10/19/24(Sat)10:54:36 No.102886820

Anonymous 10/19/24(Sat)10:54:36 No.102886820

>>102886810
wtf is layerskip

Anonymous
10/19/24(Sat)10:54:50 No.102886821

Anonymous 10/19/24(Sat)10:54:50 No.102886821

>>102886810
Get implementing it!

Anonymous
10/19/24(Sat)10:54:58 No.102886822

Anonymous 10/19/24(Sat)10:54:58 No.102886822

>>102886820
When the model skips layers

Anonymous
10/19/24(Sat)10:56:04 No.102886834

Anonymous 10/19/24(Sat)10:56:04 No.102886834

>>102886807
>I like checking in every week or so to see if somethings popped up
You and about 20 other casuals who drop in weekly to ask to get spoonfed about this exact configuration.

Anonymous
10/19/24(Sat)10:57:21 No.102886839

Anonymous 10/19/24(Sat)10:57:21 No.102886839

>>102886834
yes, I don't really care to converse with schizos as a daily thing.

It's the weekend and i'm in the mood to coom

Anonymous
10/19/24(Sat)10:57:50 No.102886846

Anonymous 10/19/24(Sat)10:57:50 No.102886846

>>102886834
>most common consumer setup
>keeps asking to be spoonfed sota model
hm
it's almost like someone out there wants to keep tabs on the competition

Anonymous
10/19/24(Sat)10:58:21 No.102886849

Anonymous 10/19/24(Sat)10:58:21 No.102886849

>>102886790
rocinante

Anonymous
10/19/24(Sat)11:01:46 No.102886873

Anonymous 10/19/24(Sat)11:01:46 No.102886873

How do I run chatgpt for sexy on rtx2060? (keep in mind, strong rtx and not weak gtx gpu)

Anonymous
10/19/24(Sat)11:02:31 No.102886879

Anonymous 10/19/24(Sat)11:02:31 No.102886879

>>102886790
>Would encourage anyone to get the Q8 or full precision if you have the vram.
Why? Q8 vs Q4 is the same in actual use

Anonymous
10/19/24(Sat)11:04:29 No.102886900

Anonymous 10/19/24(Sat)11:04:29 No.102886900

File: 1713120762051584.png (1.04 MB, 4000x4000)

1.04 MB PNG

>>102886105
>>102886411
>>102886452
>>102886790
>>102886849

Anonymous
10/19/24(Sat)11:21:27 No.102887071

Anonymous 10/19/24(Sat)11:21:27 No.102887071

>>102886790
>>102886849
what's so good about this model btw?

Why is it better than mistral small or just cydonia? It struck me as your typical Nemo finetune from Drummer but less smart than Cydonia (the same overly NSFW horny model).

Genuinely asking as I wanna know if my brief experience was just shitty cards/prompts but I see it mentioned everywhere

Anonymous
10/19/24(Sat)11:22:02 No.102887079

Anonymous 10/19/24(Sat)11:22:02 No.102887079

>>102883280
I don't consider anglos and kikes humans too, but models have to tend to be neutral and with enough data to speak as every posible human, even the retards one.

Anonymous
10/19/24(Sat)11:22:35 No.102887085

Anonymous 10/19/24(Sat)11:22:35 No.102887085

Which distro is best for local models Just Werking? Fed up of everything breaking when I update.

Anonymous
10/19/24(Sat)11:26:43 No.102887129

Anonymous 10/19/24(Sat)11:26:43 No.102887129

I feel bad for localkeks

Anonymous
10/19/24(Sat)11:27:11 No.102887139

Anonymous 10/19/24(Sat)11:27:11 No.102887139

>>102886514
I want to speak to your manager.

Anonymous
10/19/24(Sat)11:29:51 No.102887172

Anonymous 10/19/24(Sat)11:29:51 No.102887172

>>102887071
not them but i didn't like the tune, seems to unhinged, like the data its trained on is all over the place rather than focused. try lyra4 gutenberg

Anonymous
10/19/24(Sat)11:30:53 No.102887181

Anonymous 10/19/24(Sat)11:30:53 No.102887181

>>102887172
>berg

Anonymous
10/19/24(Sat)11:32:15 No.102887192

Anonymous 10/19/24(Sat)11:32:15 No.102887192

>>102887071
i didn't really like the original rocinante
rocinante 12b v2g (aka UnslopNemo-12B-v3) is great though.
the model doesn't aggressively try to fuck you, doesn't shiver often, and follows along with the story pretty well.

Anonymous
10/19/24(Sat)11:40:04 No.102887271

Anonymous 10/19/24(Sat)11:40:04 No.102887271

>>102886879
I know you're responding to a shitpost, but still no. In my actual use asking Mixtral 8x7b Instruct to write a short story adding the sentence "Use vivid and descriptive language" to the instructions dramatically and consistently changed the way it wrote at Q8. (I'm not saying it was better or worse, I'm saying it was very obviously different.) It inconsistently changed the way it wrote at Q5, and at Q4 it was hard to distinguish from placebo. If I bought the bullshit about low quants being the same because of perplexity graphs I'd have misconceptions about what kinds of instructions had an effect.

Anonymous
10/19/24(Sat)11:40:12 No.102887272

Anonymous 10/19/24(Sat)11:40:12 No.102887272

>>102887129
no need, everyone here is using claude for actuall usage anyway

Anonymous
10/19/24(Sat)11:40:44 No.102887278

Anonymous 10/19/24(Sat)11:40:44 No.102887278

>model bad
>limit output to 100 tokens
>model good
hm

Anonymous
10/19/24(Sat)11:41:54 No.102887291

Anonymous 10/19/24(Sat)11:41:54 No.102887291

>>102887278
>generate 1 token at a time and reprocess prompt after each token is generated
>agi achieved

Anonymous
10/19/24(Sat)11:44:02 No.102887312

Anonymous 10/19/24(Sat)11:44:02 No.102887312

>>102887271
The cutoff point is Q6

Anonymous
10/19/24(Sat)11:45:02 No.102887321

Anonymous 10/19/24(Sat)11:45:02 No.102887321

>>102887291
AGI was already achieved internally.

Anonymous
10/19/24(Sat)11:45:51 No.102887332

Anonymous 10/19/24(Sat)11:45:51 No.102887332

>>102887278
models like to try to tie things up thats why you end up with 'as the days passed' and stuff at the end of messages, like a conclusion. the patrician way is to let it write for 300 tokens and then trim the message to where it starts to talk like that. after you get a bunch of messages into the context like that it'll start to write more like it anyways and leave the ending of a message open ended more

Anonymous
10/19/24(Sat)11:46:13 No.102887336

Anonymous 10/19/24(Sat)11:46:13 No.102887336

>>102887278
There was a time I felt dropping the last sentence, two sentences, or even paragraph of a reply was generally an improvement. Setting a token limit shorter than the typical reply and selecting "trim incomplete sentences" worked for me back then.

Anonymous
10/19/24(Sat)11:46:15 No.102887337

Anonymous 10/19/24(Sat)11:46:15 No.102887337

Moore's law for vram when?
This is ridiculous that cards are held back by vram when those vram chips are cheap as shit.

Anonymous
10/19/24(Sat)11:48:52 No.102887365

Anonymous 10/19/24(Sat)11:48:52 No.102887365

>>102887337
Not as long as Jenson has his monopoly and his cousin Lisa is keeping his helping him keep it.

Anonymous
10/19/24(Sat)11:49:31 No.102887372

Anonymous 10/19/24(Sat)11:49:31 No.102887372

>>102887336
Yeah that's what I'm doing now, happy with results.
Is there a way to bias the (end of sentence) token somehow?

Anonymous
10/19/24(Sat)11:50:18 No.102887382

Anonymous 10/19/24(Sat)11:50:18 No.102887382

File: quant iq.png (12 KB, 809x620)

12 KB PNG

>>102887312
Anon, that's wrong. q5_k_s is the smartest quant.

Anonymous
10/19/24(Sat)11:50:40 No.102887385

Anonymous 10/19/24(Sat)11:50:40 No.102887385

>>102887365
Maybe the mainlanders will destroy them

Anonymous
10/19/24(Sat)11:55:22 No.102887427

Anonymous 10/19/24(Sat)11:55:22 No.102887427

>>102883280
Get out.

Anonymous
10/19/24(Sat)11:57:41 No.102887450

Anonymous 10/19/24(Sat)11:57:41 No.102887450

>>102887427
He's right though. Look what Nigerians have done to GPT models. Animals, all of them.

Anonymous
10/19/24(Sat)11:59:20 No.102887462

Anonymous 10/19/24(Sat)11:59:20 No.102887462

>>102883270
They needed to put something out to stay relevant after Elon insisted that they couldn't mention their involvement with Column-R, which he bought off them and released as grok-2.

Anonymous
10/19/24(Sat)12:06:07 No.102887541

Anonymous 10/19/24(Sat)12:06:07 No.102887541

no one wants to answer my question :(

>>102885034

Anonymous
10/19/24(Sat)12:07:11 No.102887556

Anonymous 10/19/24(Sat)12:07:11 No.102887556

>playing around with the settings in kobold
>randomly decide to crank the max output up to 512
>suddenly nemo starts to spit out claude level gems

wtf is this sorcery?

Anonymous
10/19/24(Sat)12:07:44 No.102887564

Anonymous 10/19/24(Sat)12:07:44 No.102887564

>>102887541
Teh answer is not really unless you want a slutty whore for a language model

Anonymous
10/19/24(Sat)12:07:46 No.102887566

Anonymous 10/19/24(Sat)12:07:46 No.102887566

>>102887541
Have you tried CR+?

Anonymous
10/19/24(Sat)12:08:08 No.102887574

Anonymous 10/19/24(Sat)12:08:08 No.102887574

File: 8 digits wtf.jpg (194 KB, 1080x470)

194 KB JPG

>>102885034
>>102887541
I dare to say Nemotron 70B is better.

Anonymous
10/19/24(Sat)12:08:25 No.102887576

Anonymous 10/19/24(Sat)12:08:25 No.102887576

>>102887541
For both RP and general use, no.

Anonymous
10/19/24(Sat)12:08:44 No.102887579

Anonymous 10/19/24(Sat)12:08:44 No.102887579

>>102887541
What speeds do you get?

Anonymous
10/19/24(Sat)12:09:05 No.102887580

Anonymous 10/19/24(Sat)12:09:05 No.102887580

>>102887365
I'm hoping for companies like Groq to tear those chinkoids a new one.

Anonymous
10/19/24(Sat)12:09:34 No.102887586

Anonymous 10/19/24(Sat)12:09:34 No.102887586

>>102887574
--Nemotron 70B: Unique prose, fun, but dumber than Largestral with logical errors:
>>102865433 >>102865448 >>102865676 >>102866355

Anonymous
10/19/24(Sat)12:12:50 No.102887632

Anonymous 10/19/24(Sat)12:12:50 No.102887632

>>102887586
I've been using it for the past week and I still haven't encountered a single instance where it made a logical error, I think that anon is using meme sampler settings and blaming the model.
But even if that was the case, Nemotron 70B has such a deep understanding of RP it's definitely something worth checking anyway.

Anonymous
10/19/24(Sat)12:22:10 No.102887734

Anonymous 10/19/24(Sat)12:22:10 No.102887734

>>102887632
Still it's lacking a lot in general knowledge

Anonymous
10/19/24(Sat)12:23:16 No.102887748

Anonymous 10/19/24(Sat)12:23:16 No.102887748

>>102887579
like 8 it/s, pretty fast

Anonymous
10/19/24(Sat)12:23:50 No.102887755

Anonymous 10/19/24(Sat)12:23:50 No.102887755

>>102885034
bigger quant?

Anonymous
10/19/24(Sat)12:27:39 No.102887800

Anonymous 10/19/24(Sat)12:27:39 No.102887800

>>102887632
I tried it briefly for some text adventure type shit, but it kept trying to add headers and add a bunch of asterisks to its responses. This happened even when continuing long sessions from other models (Mistral large). Was just using temp between .5 and 1 with min p between .01 and .03. Have you had any formatting problems?

Anonymous
10/19/24(Sat)12:31:01 No.102887842

Anonymous 10/19/24(Sat)12:31:01 No.102887842

Genuinely do you think LLMs, or transformers can lead to AGI?

Anonymous
10/19/24(Sat)12:34:37 No.102887880

Anonymous 10/19/24(Sat)12:34:37 No.102887880

>>102887842
no

Anonymous
10/19/24(Sat)12:35:10 No.102887884

Anonymous 10/19/24(Sat)12:35:10 No.102887884

>>102887800
NTA, but I've seen similar. Continuing an ERP from another model, Nemotron will start responses with things like "**Explicit Content Warning**". Not always, but often enough to be annoying. Will also frequently want to end responses with ellipsis, like a mini-cliffhanger. All the preference RLHF seems to have heavily biased it to certain types of formatting.

Anonymous
10/19/24(Sat)12:36:46 No.102887902

Anonymous 10/19/24(Sat)12:36:46 No.102887902

>>102887842
>can lead to AGI
Yes. Eventually some company will put their 50k servers to work at throwing random algorithms at the wall and something will stick.

Anonymous
10/19/24(Sat)12:37:17 No.102887909

Anonymous 10/19/24(Sat)12:37:17 No.102887909

>>102887884
Haven't seen that but I do have a system prompt telling it to always remain in character.

Anonymous
10/19/24(Sat)12:37:49 No.102887915

Anonymous 10/19/24(Sat)12:37:49 No.102887915

>>102887842
Nah.
The current implementations are based on the hope that statistical correlation will create reasoning as an emergent property and that will lead to super reasoning and super human iteration which eventually will achieve superhuman capabilities.
If AGI really is at all possible, there will probably be a module/block/something that's oriented towards actual thinking as a primary feature.LLMs might be a component in the architecture/system, but we will need something new that's not simply language based. Hell, maybe even the idea of tokenizing shit will go out the window, who knows.
What that will look like? I have no idea, otherwise I'd be a billionaire, lol.

Anonymous
10/19/24(Sat)12:38:43 No.102887924

Anonymous 10/19/24(Sat)12:38:43 No.102887924

>>102887842
Since many people disagree upon the definition of AGI, you must first define which you are asking about.

Anonymous
10/19/24(Sat)12:39:11 No.102887929

Anonymous 10/19/24(Sat)12:39:11 No.102887929

>>102887800
Yes, the model definitely has formatting issues. It seems to always write with asterisks even when the past messages weren't like that. I also noticed that the model likes to use ellipsis a lot.

Anonymous
10/19/24(Sat)12:40:20 No.102887941

Anonymous 10/19/24(Sat)12:40:20 No.102887941

>>102887884
I never got this "explicit content warning" even when playing loli scenarios, you probably have something weird in your system prompt.

Anonymous
10/19/24(Sat)12:40:30 No.102887943

Anonymous 10/19/24(Sat)12:40:30 No.102887943

>>102887800
>it kept trying to add headers and add a bunch of asterisks to its responses.
arenamaxxed to the very core

Anonymous
10/19/24(Sat)12:40:34 No.102887944

Anonymous 10/19/24(Sat)12:40:34 No.102887944

>>102887842
I think it will cause your agility stat to decreases from sitting at your computer too much

Anonymous
10/19/24(Sat)12:40:49 No.102887949

Anonymous 10/19/24(Sat)12:40:49 No.102887949

>>102887924
I don't think you need to, we are both conscious humans, and know what that means without being able to define it. 'We'll know it when we see it"

Anonymous
10/19/24(Sat)12:45:29 No.102887994

Anonymous 10/19/24(Sat)12:45:29 No.102887994

>>102887949
A conscious human often ignores what he sees, definitions are necessary for stuff like this.

Anonymous
10/19/24(Sat)13:01:30 No.102888187

Anonymous 10/19/24(Sat)13:01:30 No.102888187

>>102887949
A man who does not know what he means is not speaking as a conscious human. The animal mind feels. The rational mind reasons.

Anonymous
10/19/24(Sat)13:09:14 No.102888283

Anonymous 10/19/24(Sat)13:09:14 No.102888283

File: ifever.png (236 KB, 885x1057)

236 KB PNG

>>102883280
Well then actually don't use my tunes, if ever.

Anonymous
10/19/24(Sat)13:10:21 No.102888292

Anonymous 10/19/24(Sat)13:10:21 No.102888292

>>102888187
>The animal mind feels. The rational mind reasons.
Farts are meant to be huffed.

Anonymous
10/19/24(Sat)13:19:19 No.102888391

Anonymous 10/19/24(Sat)13:19:19 No.102888391

Anyone have Emily's gallery before it got deleted?

Anonymous
10/19/24(Sat)13:42:43 No.102888658

Anonymous 10/19/24(Sat)13:42:43 No.102888658

Which is best ?

nemo
405B
mixtral large
midnight miku 103B
<other big boi>

Anonymous
10/19/24(Sat)13:43:15 No.102888665

Anonymous 10/19/24(Sat)13:43:15 No.102888665

>>102888658
StableLM-7B

Anonymous
10/19/24(Sat)13:44:54 No.102888679

Anonymous 10/19/24(Sat)13:44:54 No.102888679

>>102888658
Starling-7B

Anonymous
10/19/24(Sat)13:47:28 No.102888711

Anonymous 10/19/24(Sat)13:47:28 No.102888711

>>102888694
>>102888694
>>102888694

Anonymous
10/19/24(Sat)13:47:28 No.102888712

Anonymous 10/19/24(Sat)13:47:28 No.102888712

>>102888658
nemo or mixtral large

Anonymous
10/19/24(Sat)14:24:37 No.102889114

Anonymous 10/19/24(Sat)14:24:37 No.102889114

>>102877046
hi. i am back. have not been around for a few months on this board.

Anonymous
10/19/24(Sat)14:28:56 No.102889164

Anonymous 10/19/24(Sat)14:28:56 No.102889164

>>102889114
welcome back we missed you

Anonymous
10/19/24(Sat)15:00:53 No.102889538

Anonymous 10/19/24(Sat)15:00:53 No.102889538

>>102876754
>/1T tokens
lol, lmao

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.