/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 10/05/24(Sat)20:34:08 No.102698948

File: 39_04322__.png (1.42 MB, 896x1152)

1.42 MB PNG

/lmg/ - Local Models General Anonymous 10/05/24(Sat)20:34:08 No.102698948 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>102688881 & >>102674638

►News
>(09/27) Emu3, next-token prediction multimodal models: https://hf.co/collections/BAAI/emu3-66f4e64f70850ff358a2e60f
>(09/25) Multimodal Llama 3.2 released: https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices
>(09/25) Molmo: Multimodal models based on OLMo, OLMoE, and Qwen-72B: https://molmo.allenai.org/blog
>(09/24) Llama-3.1-70B-instruct distilled to 51B: https://hf.co/nvidia/Llama-3_1-Nemotron-51B-Instruct
>(09/18) Qwen 2.5 released, trained on 18 trillion token dataset: https://qwenlm.github.io/blog/qwen2.5

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench
Japanese: https://hf.co/datasets/lmg-anon/vntl-leaderboard
Programming: https://livecodebench.github.io/leaderboard.html

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
10/05/24(Sat)20:34:26 No.102698954

Anonymous 10/05/24(Sat)20:34:26 No.102698954

File: __kagamine_rin_vocaloid_a(...).jpg (90 KB, 1450x2048)

90 KB JPG

►Recent Highlights from the Previous Thread: >>102688881

--Paper: Microsoft's VPTQ quantization tech compresses LLaMA 70B to 20GB, but offloading capabilities unclear:
>102694706 >102694782 >102694903 >102694915 >102695025
--Papers:
>102692015 >102696203
--Pruning-aware training for optimizing expert placement:
>102694725 >102694896
--Llama.cpp server can save and switch between kv caches:
>102691814 >102691910 >102691975
--Big players use batched inference and continuous batching to handle multiple users:
>102691658 >102691704 >102693996 >102694009 >102694335
--Using {{random}} prefill prompting technique to add variety:
>102690004 >102696707
--Qwen's performance on 4chan post evaluation and potential improvements:
>102692995
--New optimizer claims to be faster and more memory-efficient than AdamW, with potential benefits for training and finetuning quantized models:
>102697724 >102697833 >102697862 >102697887 >102697896 >102697948 >102698015
--LLMs don't actually learn the training data distribution, but learn to replicate it with limited parameters:
>102690378 >102690505 >102691347
--Ichigo voice model from Homebrew Research:
>102690754 >102690853
--Encoder-only next token prediction might be better than decoder-only models:
>102691422
--Request to ask exllama dev to implement SageAttention:
>102692025
--LLM finetuning locally is impractical, cloud compute recommended:
>102691302 >102691507
--How to save localslop and make local models more efficient:
>102693011 >102693127 >102693178 >102694397 >102694408 >102694508 >102694674 >102694669 >102697296 >102693177 >102693240 >102693173 >102693196
--Anon asks if anyone tried llava onevision on Hugging Face:
>102693095
--Miku (free space):
>102688915 >102693324 >102693546 >102693895 >102695703 >102696364 >102696803 >102696871 >102697106

►Recent Highlight Posts from the Previous Thread: >>102688887

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script

Anonymous
10/05/24(Sat)20:38:07 No.102698979

Anonymous 10/05/24(Sat)20:38:07 No.102698979

>Undi95/Lumimaid-Magnum-12B
I won't be wasting my time with another shitty model, r-right guys? This time is different.

Anonymous
10/05/24(Sat)20:41:13 No.102699000

Anonymous 10/05/24(Sat)20:41:13 No.102699000

File: 31 Days Until November 5.png (1.49 MB, 1328x992)

1.49 MB PNG

Anonymous
10/05/24(Sat)20:41:27 No.102699001

Anonymous 10/05/24(Sat)20:41:27 No.102699001

>>102698979
i use that one all the time, but i'm easily pleased

Anonymous
10/05/24(Sat)20:53:16 No.102699074

Anonymous 10/05/24(Sat)20:53:16 No.102699074

>>102699000
The only thing this Miku deserves is jail time.

Anonymous
10/05/24(Sat)21:05:15 No.102699167

Anonymous 10/05/24(Sat)21:05:15 No.102699167

I'm hearing a lot about using a speculative model for speculative decoding, but why not use an autocomplete similar to T9 on phones instead? I know about n-gram but it's using the user prompt and it's not doing that well except for summarization.

Anonymous
10/05/24(Sat)21:08:32 No.102699193

Anonymous 10/05/24(Sat)21:08:32 No.102699193

>>102699167
That's also a thing.
llama.cpp has both forms of speculative decoding, or at least it was being worked on.

Anonymous
10/05/24(Sat)21:09:55 No.102699203

Anonymous 10/05/24(Sat)21:09:55 No.102699203

File: 30 Days Until November 5.png (498 KB, 640x640)

498 KB PNG

Anonymous
10/05/24(Sat)21:13:37 No.102699223

Anonymous 10/05/24(Sat)21:13:37 No.102699223

>>102699193
They've thrown some ideas here and there, but the n-gram speculative implementation is subpar. Even its author complained

Anonymous
10/05/24(Sat)21:16:38 No.102699243

Anonymous 10/05/24(Sat)21:16:38 No.102699243

>>102699203
Pet the Pet

Anonymous
10/05/24(Sat)21:23:06 No.102699271

Anonymous 10/05/24(Sat)21:23:06 No.102699271

>>102699223
Ah, I see what you are saying now. Instead of using the context/an external file, use a literal auto complete algorithm.
Yeah, I guess that could speed things up a lot for models with tokenizers where words are split, since I think you can get pretty good accuracy at the word level.
Meaning that you could predict every other token pretty well.

Anonymous
10/05/24(Sat)21:29:56 No.102699310

Anonymous 10/05/24(Sat)21:29:56 No.102699310

>>102699271
The context might help to tune the algorithm further, just like the auto predict is doing it on the phone. I don't know how feasible this whole thing is, but it should be lightweight enough.

Anonymous
10/05/24(Sat)22:05:08 No.102699555

Anonymous 10/05/24(Sat)22:05:08 No.102699555

>>102699243
quick rundown about petra?

Anonymous
10/05/24(Sat)22:07:42 No.102699576

Anonymous 10/05/24(Sat)22:07:42 No.102699576

>>102699310
Maybe the endgame is to train the models to generate some form of intermediate representation that gets translated to the actual text.
I don't mean like tokens to words, more on a conceptual level.
Maybe it's a question of having tokens represent more than just part of words or complete words (sentences, concepts, whatever), although that would balloon the vocab size with the technology as it is today.
Something like using a process to extract the most efficient tokens from a large corpus of text to build the tokenizer or whathaveyou.
Whatever it is, there's probably ways to make generating the final text more efficient by tweaking what the model is actually trying to generate.

Anonymous
10/05/24(Sat)22:09:55 No.102699586

Anonymous 10/05/24(Sat)22:09:55 No.102699586

>>102699576
>intermediate representation that gets translated to the actual text
You mean what layers between input and output are doing now?

Anonymous
10/05/24(Sat)22:11:20 No.102699597

Anonymous 10/05/24(Sat)22:11:20 No.102699597

>>102699555
tl;dr go back

Anonymous
10/05/24(Sat)22:11:34 No.102699598

Anonymous 10/05/24(Sat)22:11:34 No.102699598

>>102699586
Nope.
I mean the thing they spit that gets turned into text, which the intermediate layers calculate, which for now it's tokens.

Anonymous
10/05/24(Sat)22:33:19 No.102699750

Anonymous 10/05/24(Sat)22:33:19 No.102699750

When I increase the parallel parameter to 2 in the lllama-server, my t/s goes from 20 to 2. Wtf is going on?

Anonymous
10/05/24(Sat)22:37:03 No.102699776

Anonymous 10/05/24(Sat)22:37:03 No.102699776

>>102699750
do you have enough vram to run two copies of the same model at the same time?

Anonymous
10/05/24(Sat)22:38:24 No.102699789

Anonymous 10/05/24(Sat)22:38:24 No.102699789

>>102699776
Y-yes... (no)

Anonymous
10/05/24(Sat)22:42:04 No.102699827

Anonymous 10/05/24(Sat)22:42:04 No.102699827

>>102699776
It's a 7B Q4 model, so yes, I do have enough VRAM to run much more than 2 of them.
I just noticed that 2t/s is around the same speed I get when I use ngl 0, I wonder if llama.cpp doesn't support parallel with GPU offloading?

Anonymous
10/05/24(Sat)22:43:01 No.102699836

Anonymous 10/05/24(Sat)22:43:01 No.102699836

>>102699789
do you want to tell the class what you think a parallel parameter of 2 might mean?

Anonymous
10/05/24(Sat)22:55:46 No.102699933

Anonymous 10/05/24(Sat)22:55:46 No.102699933

>>102699597
I've seen that thing for 6 months now, still don`t know what it is.

Anonymous
10/05/24(Sat)23:10:05 No.102700042

Anonymous 10/05/24(Sat)23:10:05 No.102700042

>>102699555
Petra is a historical and archaeological city located in southern Jordan, famous for its rock-cut architecture and intricate water conduit system. It is often referred to as the "Rose City" because of the pinkish-red color of the sandstone cliffs from which many of its buildings were carved. Petra was the capital of the Nabataean Kingdom around the 6th century BCE, and it became an important center for trade, linking Arabia, Egypt, and the Mediterranean world.

Anonymous
10/06/24(Sun)00:09:23 No.102700409

Anonymous 10/06/24(Sun)00:09:23 No.102700409

>tfw I had fun with LLMs today, and didn't think of posting on /lmg/

Anonymous
10/06/24(Sun)00:18:11 No.102700454

Anonymous 10/06/24(Sun)00:18:11 No.102700454

>>102700409
I did too, but they were good LLMs instead of local ones.

Anonymous
10/06/24(Sun)00:21:06 No.102700467

Anonymous 10/06/24(Sun)00:21:06 No.102700467

>>102700454
I used local ones for a bit before getting frustrated with how bad they are and swapping over to Claude as usual.

Anonymous
10/06/24(Sun)00:21:14 No.102700469

Anonymous 10/06/24(Sun)00:21:14 No.102700469

>>102700454
sounds safe as heck

Anonymous
10/06/24(Sun)00:26:05 No.102700493

Anonymous 10/06/24(Sun)00:26:05 No.102700493

>>102700454
I use both actually, they're both pretty good I think. :)

Anonymous
10/06/24(Sun)00:50:01 No.102700648

Anonymous 10/06/24(Sun)00:50:01 No.102700648

Imagine not using local models
>>102700562

Anonymous
10/06/24(Sun)00:58:36 No.102700709

Anonymous 10/06/24(Sun)00:58:36 No.102700709

>>102700648
why do zoomies have such sissyfits over "minors" being on the internet? they did the same exact shit when they were that age now act like it's 100% verboten. but I do agree that all underageb&s should have no internet access and zoomies in general too

Anonymous
10/06/24(Sun)01:02:27 No.102700727

Anonymous 10/06/24(Sun)01:02:27 No.102700727

>>102700648
I assume aicg is in a doom phase

Anonymous
10/06/24(Sun)01:07:54 No.102700765

Anonymous 10/06/24(Sun)01:07:54 No.102700765

Is there any uncensored finetune of Qwen2.5 like ChronosPlatinum72b but for the 32b version?

Anonymous
10/06/24(Sun)01:08:04 No.102700766

Anonymous 10/06/24(Sun)01:08:04 No.102700766

>>102700648
Hey, I don't care!

Anonymous
10/06/24(Sun)01:09:00 No.102700768

Anonymous 10/06/24(Sun)01:09:00 No.102700768

>>102699555
Just look it up https://desuarchive.org/g/thread/100161943/

Anonymous
10/06/24(Sun)01:17:56 No.102700828

Anonymous 10/06/24(Sun)01:17:56 No.102700828

bleh

Anonymous
10/06/24(Sun)01:20:37 No.102700840

Anonymous 10/06/24(Sun)01:20:37 No.102700840

Does using riser cables affect performance? I assume it doesn't, other then running 2 cards in 8x bringing down speeds...

Anonymous
10/06/24(Sun)01:21:01 No.102700842

Anonymous 10/06/24(Sun)01:21:01 No.102700842

>>102700768
I already saw that, I just don't get it.

Anonymous
10/06/24(Sun)01:21:28 No.102700847

Anonymous 10/06/24(Sun)01:21:28 No.102700847

>>102700828
*throws glitter in your eyes*

Anonymous
10/06/24(Sun)01:24:36 No.102700862

Anonymous 10/06/24(Sun)01:24:36 No.102700862

>>102700840
>Does using riser cables affect performance?
No. Only affects chances of encountering errors or dropouts if the cable is shit or too long.

Anonymous
10/06/24(Sun)01:25:34 No.102700871

Anonymous 10/06/24(Sun)01:25:34 No.102700871

>>102700842
There is nothing to get.

Anonymous
10/06/24(Sun)01:26:36 No.102700874

Anonymous 10/06/24(Sun)01:26:36 No.102700874

File: 1719464018549204.png (27 KB, 155x160)

27 KB PNG

>>102700842
I remember there was some vantablack category entry on picrel, stating that p*tra originated from sharty, p*tra it was his tulpa, he photoshopped it everywhere he could and spammed as shown here >>102700768, that's pretty all we have.

Anonymous
10/06/24(Sun)01:28:29 No.102700883

Anonymous 10/06/24(Sun)01:28:29 No.102700883

>>102700874
eh messed my shit up again, you should get it tho. spammer and his tulpa, that's it.

Anonymous
10/06/24(Sun)01:36:47 No.102700931

Anonymous 10/06/24(Sun)01:36:47 No.102700931

>>102700883
>spammer and his tulpa, that's it.
i.e. schizo

Anonymous
10/06/24(Sun)02:06:38 No.102701133

Anonymous 10/06/24(Sun)02:06:38 No.102701133

Is buying a server processor like Epyc with a bunch of ram for inferencing large models a good idea if speen isn't an issue?
2nd and 3rd gen Epyc+mobo cost as much as new consumer models these days.

Anonymous
10/06/24(Sun)02:12:05 No.102701171

Anonymous 10/06/24(Sun)02:12:05 No.102701171

>>102701133
Yes, but depending on the build it may be much slower than you're thinking even. Before you buy, calculate the aggregate bandwidth of your solution to find out your likely inference speed. See the lmg build guides in the op for the cpumaxxing option so you have some idea of that that solution would get you as a point of reference.
Also: You'll still want a 24gb gpu for prompt and context processing

Anonymous
10/06/24(Sun)02:22:46 No.102701256

Anonymous 10/06/24(Sun)02:22:46 No.102701256

>>102701133
Anything running on a server grade cpu is a meme. You are way better of just getting x4 3090 and a the cheapest epyc server you can find and run it on the gpu.
A 7950x3d is way, way faster than a threadripper or epyc. At least from my real world tests, unfortunately there's no am5 motherboard with great bandwidth support.
The only alternative I have yet to test is a MEG X570 GODLIKE or similar with x4 3090 and a 5800x3d.
A 7950x3d with a single 4090 is way, way faster that any cpu server on it's own, you just have to deal with the 128gb ram limit. Cpumaxxing is a meme.

Anonymous
10/06/24(Sun)02:26:53 No.102701291

Anonymous 10/06/24(Sun)02:26:53 No.102701291

>>102701256
>7950x3d is way, way faster than a threadripper
There is an Epyc x model with more than a gig of L3 cache. Has anyone been mad enough to test it?

Anonymous
10/06/24(Sun)02:29:34 No.102701308

Anonymous 10/06/24(Sun)02:29:34 No.102701308

>>102701133
honestly cpu vs gpu is a whole bunch of tradeoffs and you really need to explore the entire solution space and understand what you're giving up and getting with each build type.
In simple terms though: gpumaxxing for max speed and cpumaxxing for max model size

Anonymous
10/06/24(Sun)02:41:28 No.102701401

Anonymous 10/06/24(Sun)02:41:28 No.102701401

File: MikuDarkOrb.png (1.37 MB, 896x1136)

1.37 MB PNG

Good night /lmg/

Anonymous
10/06/24(Sun)02:43:12 No.102701409

Anonymous 10/06/24(Sun)02:43:12 No.102701409

>>102701401
Good night Miku

Anonymous
10/06/24(Sun)02:43:45 No.102701414

Anonymous 10/06/24(Sun)02:43:45 No.102701414

Does anyone have experience renting an a100 machine and running LLM's that way? If so how did it go and what was the most convenient service for it?

Anonymous
10/06/24(Sun)02:51:16 No.102701459

Anonymous 10/06/24(Sun)02:51:16 No.102701459

>>102700765
How is ChronosPlatinum?

Anonymous
10/06/24(Sun)03:11:41 No.102701562

Anonymous 10/06/24(Sun)03:11:41 No.102701562

While messing around more with the adventure game prompt, I have found one more place where there is a gap between 405b and smaller models: mapping and map coordinates.
405b is able to mostly keep locations straight, as well as put an up to date [x,y,z] coordinate of the current location at the top of each response.
Its not perfect, and still screws things up on the regular (e.g. backtracking only takes you back to the right location most of the time), but I haven't found any smaller models that can really do it at all.
This gives me hope that either a larger or more efficient model might actually enable new classes of problem solving.
It also gives me a novel new way to test new models as they come out.

Anonymous
10/06/24(Sun)03:17:20 No.102701595

Anonymous 10/06/24(Sun)03:17:20 No.102701595

Did anyone test a rig of multiple old tesla cards like M10 or K80?

Anonymous
10/06/24(Sun)03:32:02 No.102701728

Anonymous 10/06/24(Sun)03:32:02 No.102701728

What's the state of the art in local TTS?

Anonymous
10/06/24(Sun)03:34:00 No.102701741

Anonymous 10/06/24(Sun)03:34:00 No.102701741

>>102701414
You can check out runpod or vast.ai for that but if your goal is just inference, not training, then openrouter makes more sense. Renting a VM to run inference as a single user is overpaying like 100x vs pay per token services.

Anonymous
10/06/24(Sun)03:35:05 No.102701751

Anonymous 10/06/24(Sun)03:35:05 No.102701751

>>102701741
I figure it would be close to local in terms of privacy vs per token services.

Anonymous
10/06/24(Sun)03:44:25 No.102701815

Anonymous 10/06/24(Sun)03:44:25 No.102701815

>>102701751
Ah, I get you, but you'd have to value your privacy quite a bit for that to make sense financially. An A100 machine is a few dollars per hour, so 24/7 availability is basically out of the question. That's quite inconvenient, meanwhile, a few dollars on openrouter lasts me about a month personally.

Some providers on openrouter have data policies explicitly stating they do not keep any logs, for example deepinfra and lambda. If that is enough for you then openrouter is a vastly superior solution

Anonymous
10/06/24(Sun)03:47:30 No.102701836

Anonymous 10/06/24(Sun)03:47:30 No.102701836

>>102701815
I have a 3090 and don't run anything that atrocious but I like having control and models are getting FAT these days while nvidia continues to be semetic with the VRAM.

Anonymous
10/06/24(Sun)03:50:02 No.102701854

Anonymous 10/06/24(Sun)03:50:02 No.102701854

Status on llama multimodal support for gguf or exl2?

Anonymous
10/06/24(Sun)03:51:56 No.102701867

Anonymous 10/06/24(Sun)03:51:56 No.102701867

File: 1727801406348963.png (549 KB, 1240x995)

549 KB PNG

Best vramlet model for RP?

Anonymous
10/06/24(Sun)04:00:38 No.102701931

Anonymous 10/06/24(Sun)04:00:38 No.102701931

>>102701836
Ya, I feel. To me a VM is nice to play around with for a bit but not really suited for a long term solution. But just see for yourself.

Do make sure to use templates so you don't waste time setting the VM up for inference manually.

Anonymous
10/06/24(Sun)04:01:00 No.102701933

Anonymous 10/06/24(Sun)04:01:00 No.102701933

>>102701854
there's not going to be a status update on something nobody's working on

Anonymous
10/06/24(Sun)04:04:15 No.102701955

Anonymous 10/06/24(Sun)04:04:15 No.102701955

Gutenberg-Doppel ain't that bad.

llama.cpp CUDA dev !!OM2Fp6Fn93S
10/06/24(Sun)04:05:34 No.102701961

llama.cpp CUDA dev !!OM2Fp6Fn93S 10/06/24(Sun)04:05:34 No.102701961

>>102699167
llama.cpp has an n-gram based approach but the problem is that the effectiveness declines as the vocabulary size increases.
And since the trend seems to be to go towards larger and larger vocabulary sizes I basically dropped the approach.
In 1-2 months the GGML training code should be in a state where you can start using it for something other than toy problems, one of the things that I want to try is distilling models for speculative decoding.

Anonymous
10/06/24(Sun)04:05:48 No.102701962

Anonymous 10/06/24(Sun)04:05:48 No.102701962

>>102701459
Pretty decent, I'm still testing it, quite similar to Mistral Small Instruct 2409, but maybe Chronos has less context length fuck ups, it does also have a bit worse writing skills in my opinion but it does repeat stuff less frequently than 2490.

llama.cpp CUDA dev !!OM2Fp6Fn93S
10/06/24(Sun)04:15:38 No.102702039

llama.cpp CUDA dev !!OM2Fp6Fn93S 10/06/24(Sun)04:15:38 No.102702039

>>102699750
>>102699827
Which GGML backend are you using?
Generally speaking, if for any GGML op there is no GPU implementation the CPU implementation will be used as a fallback.
With CUDA all ops should be implemented.

>>102701256
>>102701291
I have so far not seen any evidence that the increased cache is of use for LLMs.
And I don't see why it would make a difference either.

>>102701595
I did not test those GPUs but since they lack the __dp4a instruction (per-byte integer dot product) the performance vs. a P40 will not be good.

Anonymous
10/06/24(Sun)04:25:25 No.102702123

Anonymous 10/06/24(Sun)04:25:25 No.102702123

>>102702039
I see, I may try looking into P40 or P100

Anonymous
10/06/24(Sun)05:10:33 No.102702399

Anonymous 10/06/24(Sun)05:10:33 No.102702399

>>102701728
Answer this, or else.

Anonymous
10/06/24(Sun)05:14:58 No.102702427

Anonymous 10/06/24(Sun)05:14:58 No.102702427

>>102702399
there isn't any

Anonymous
10/06/24(Sun)05:21:28 No.102702457

Anonymous 10/06/24(Sun)05:21:28 No.102702457

>>102701728
fish-speech 1.4 or styleTTS2

Anonymous
10/06/24(Sun)05:23:24 No.102702469

Anonymous 10/06/24(Sun)05:23:24 No.102702469

>>102701728
RVC on top of any good TTS

Anonymous
10/06/24(Sun)05:52:29 No.102702631

Anonymous 10/06/24(Sun)05:52:29 No.102702631

Replete-AI is full of bullies, I left them, please do not follow their org anymore.

Some of you may follow me and my models. I posted them to my former friends org (Stanley Sebastian) However he and some other people in Replete-AI because extremely mean to me, and bullied me out of the group basically. I am just spreading awareness that people should not follow them anymore if they want to get updates on my models.

I will be posting models to my own HF page from now on

https://huggingface.co/rombodawg

And i am already rebranding and reuploading my models and my work (like datasets) as we speak, will probably take some time to do all of it.

https://huggingface.co/collections/rombodawg/rombos-llm-v25-67024a5028b2aa80eddccc49

Thank you all for understanding. And I hope I can have your support in this really difficult time, where my literally best friends bully and abandoned me.

Anonymous
10/06/24(Sun)05:58:02 No.102702661

Anonymous 10/06/24(Sun)05:58:02 No.102702661

>>102702631
I don't know nor care about your drama, but you talk like a kid. For all i know are assholes as you say, but you are already fucking unbearable.

Anonymous
10/06/24(Sun)05:58:52 No.102702666

Anonymous 10/06/24(Sun)05:58:52 No.102702666

>>102702631
shut the FUCK UP

Anonymous
10/06/24(Sun)06:00:42 No.102702674

Anonymous 10/06/24(Sun)06:00:42 No.102702674

>>102702631
Yeah okay, so what's your best finetune?

Anonymous
10/06/24(Sun)06:02:09 No.102702683

Anonymous 10/06/24(Sun)06:02:09 No.102702683

>>102702661
>>102702666
Wow /lmg/ is full of bullies too

Anonymous
10/06/24(Sun)06:03:21 No.102702690

Anonymous 10/06/24(Sun)06:03:21 No.102702690

>>102702674
https://huggingface.co/rombodawg/Open_Gpt4_8x7B_v0.2
and best 7b leaderbord too
https://huggingface.co/rombodawg/Rombos-LLM-V2.5-Qwen-7b

Anonymous
10/06/24(Sun)06:11:00 No.102702725

Anonymous 10/06/24(Sun)06:11:00 No.102702725

>>102702690
I'll give it a go. Been meaning to look for a Qwen variant anyway.

Anonymous
10/06/24(Sun)06:11:22 No.102702728

Anonymous 10/06/24(Sun)06:11:22 No.102702728

>>102702683
So uh, this is 4chan. This is not a nice place. Now fuck off.

Anonymous
10/06/24(Sun)06:12:04 No.102702735

Anonymous 10/06/24(Sun)06:12:04 No.102702735

>>102702690
Is that Qwen "uncensored"?
If not, what's the point?

Anonymous
10/06/24(Sun)06:12:38 No.102702739

Anonymous 10/06/24(Sun)06:12:38 No.102702739

>>102702725
>>102702735
Do share feedback it uses my special tecnique explain here Continuous Fine-tuning Without Loss
Using Lora and Mergekit
https://docs.google.com/document/d/1OjbjU5AOz4Ftn9xHQrX3oFQGhQ6RDUuXQipnQ9gn6tU/edit

Anonymous
10/06/24(Sun)06:21:45 No.102702777

Anonymous 10/06/24(Sun)06:21:45 No.102702777

File: mergeking.png (21 KB, 638x96)

21 KB PNG

>>102702739
>mergeking
>*we* are going to discuss
>*we* will be using
>...that *I* know of with Lora...
First fucking paragraph. Keep the person talking through the document consistent ("we" to make it sound more serious if you're a hack, "I" if you're being honest) and check for fucking typos.

Anonymous
10/06/24(Sun)06:28:58 No.102702808

Anonymous 10/06/24(Sun)06:28:58 No.102702808

The more I look at RP "data", the more my soul fills with dread.
All literature writers are hacks. All of them. Somehow it's even worse than with journalists.
I wonder what CAI was doing when they made their dataset, considering synthetic options were limited back then.

Anonymous
10/06/24(Sun)06:32:05 No.102702824

Anonymous 10/06/24(Sun)06:32:05 No.102702824

>>102702739
So far is actually pretty good. I was expecting it to be shit desu. I'll keep testing the censorship and the writing skills and do a final verdict.

Anonymous
10/06/24(Sun)06:33:12 No.102702834

Anonymous 10/06/24(Sun)06:33:12 No.102702834

File: namedrop.png (15 KB, 644x57)

15 KB PNG

>>102702739
You're trying to give credibility to this... thing... by namedropping "prolific finetuners".
If you cannot show the conversation, or any other reference, paper, whatever showing that that happens, don't mention it. Makes you look like a retard.

Anonymous
10/06/24(Sun)06:35:35 No.102702848

Anonymous 10/06/24(Sun)06:35:35 No.102702848

what do you AI / language model faggots actually do? What is involved in this hobby exactly? /dpt/ makes programs and talk about programming, /hsg/ configure their servers and shit, /wdg/ larp as real programmers

and what do you guys do?
just build preexisting models and tweak parameters all day to generate images?

Anonymous
10/06/24(Sun)06:36:02 No.102702850

Anonymous 10/06/24(Sun)06:36:02 No.102702850

>>102702631
please take your meds the doctors are not trying to close your chakras or whatever they really do want to help

Anonymous
10/06/24(Sun)06:37:09 No.102702858

Anonymous 10/06/24(Sun)06:37:09 No.102702858

>>102702631
>>102702850
Please leave me alone now Stanley

details:
people including stanley kept calling me crazy and that I needed to go to a mental hospital for my religious beliefs, so I told stanley i would only share them in the channel called #spirituality in our server which we made specifically for that, and then stanley deleted the channel, which i was going to use for ai training. And i was tired of the bullying and constant abuse so i left, and stanley kicked me out of the org as well. It was just a whole mess and I felt abused the whole time for simply practicing my freedom of speech, not even imposing on anyone else or forcing it on anyone, but simply sharing my ideas in 1 channel. Originally i was sharing it in other channels, but this happened after we had already agreed i would only share my beliefs in the channel specifically made for that #spirituality

Anonymous
10/06/24(Sun)06:38:45 No.102702866

Anonymous 10/06/24(Sun)06:38:45 No.102702866

>>102702808
>RP "data"
>All literature writers are hacks.
A scribble is not literature. MOST writers on all areas are shit, but still.. it's RP...

Anonymous
10/06/24(Sun)06:39:25 No.102702873

Anonymous 10/06/24(Sun)06:39:25 No.102702873

>>102702858
no clue who that is, I just remember when your schizo dataset was posted here

Anonymous
10/06/24(Sun)06:39:51 No.102702874

Anonymous 10/06/24(Sun)06:39:51 No.102702874

>>102702848
>what do you guys do?
ERP with our GPUs.

Anonymous
10/06/24(Sun)06:41:54 No.102702891

Anonymous 10/06/24(Sun)06:41:54 No.102702891

>>102702848
we type 'aah aah mistress...' and then read about how our spine feels and what two emotions are mixed on faces

Anonymous
10/06/24(Sun)06:42:51 No.102702900

Anonymous 10/06/24(Sun)06:42:51 No.102702900

>>102702858
>so i left, and stanley kicked me out of the org as well
>you can't fire me! I quit!
This is not the "other channel" to share your shit. You posted your hf account already, some people will look at your stuff. This is not the place to cry about booooliieeeeesss. Grow the fuck up.

Anonymous
10/06/24(Sun)06:43:48 No.102702907

Anonymous 10/06/24(Sun)06:43:48 No.102702907

>>102702631
go back
https://www.reddit.com/r/nousresearch/comments/1fxcuw2/repleteai_is_full_of_bullies_i_left_them_please/
https://www.reddit.com/r/LocalLLaMA/comments/1fxcuqd/repleteai_is_full_of_bullies_i_left_them_please/
https://www.reddit.com/r/Oobabooga/comments/1fxcv8g/repleteai_is_full_of_bullies_i_left_them_please/

Anonymous
10/06/24(Sun)06:47:52 No.102702941

Anonymous 10/06/24(Sun)06:47:52 No.102702941

>>102702907
No wonder he got kicked out. I would have bullied him too.

Anonymous
10/06/24(Sun)06:50:07 No.102702964

Anonymous 10/06/24(Sun)06:50:07 No.102702964

>>102702808
If you want your RP finetunes not to sound always like the usual boring assistant you also need flawed human data.

CAI's finetuning dataset was probably much smaller than people think, given how overfit it appeared to be on specific phrasing. The core of it was likely something similar to LaMDA (https://arxiv.org/abs/2201.08239 - note how Noam Shazeer is one of the authors), which was pretrained on 50% conversational data.

> The pre-training data, called Infiniset, is a combination of dialog data from public dialog data and other public web documents. It consists of 2.97B documents and 1.12B dialogs with 13.39B utterances. The composition of the data is as follows: 50% dialogs data from public forums; 12.5% C4 data [11]; 12.5% code documents from sites related to programming like Q&A sites, tutorials, etc; 12.5% Wikipedia (English); 6.25% English web documents; and 6.25% Non-English web documents. The total number of words in the dataset is 1.56T. Note that this composition was chosen to achieve a more robust performance on dialog tasks (Section 4) while still keeping its ability to perform other tasks like code generation. As future work, we can study how the choice of this composition may affect the quality of some of the other NLP tasks performed by the model.

Anonymous
10/06/24(Sun)06:50:32 No.102702966

Anonymous 10/06/24(Sun)06:50:32 No.102702966

>>102702900
>>102702941

There is a difference between disagreeing, and literally harassing. You have a right to disagree, no one should be abused for their beliefs

Plus I was an admin, and I was the original owner of the server, I should have the right to talk about what I wanted, i gave stanley the right to be the owner because I didnt want the responsibility. So really this should have never happened because I should have just never gave up ownership

Anonymous
10/06/24(Sun)06:52:35 No.102702982

Anonymous 10/06/24(Sun)06:52:35 No.102702982

>>102702907
(ME)

Now for what I actually came here to do

Dont use mistral-medium. Use my 72b model, its higher quality. Even using the GGUF in LM studio you will get better results. (I know the names are diffrent but the models are the same) I rebranded

You can easily use the Q4_k_m version or Q5_k_m version with your setup

https://huggingface.co/rombodawg/Rombos-LLM-V2.5-Qwen-72b

https://huggingface.co/bartowski/Replete-LLM-V2.5-Qwen-72b-GGUF

Anonymous
10/06/24(Sun)06:52:45 No.102702984

Anonymous 10/06/24(Sun)06:52:45 No.102702984

>>102702848
>and what do you guys do?
I scam VCs for money.

Anonymous
10/06/24(Sun)06:55:24 No.102703005

Anonymous 10/06/24(Sun)06:55:24 No.102703005

>>102702907
>retards ITT fall for reddit repost bait
grim.

Anonymous
10/06/24(Sun)06:59:58 No.102703035

Anonymous 10/06/24(Sun)06:59:58 No.102703035

>>102702982
Could you do this finetune for Qwen 2.5 32b?
I'm actually surprised by your 7b version.

Anonymous
10/06/24(Sun)07:01:50 No.102703046

Anonymous 10/06/24(Sun)07:01:50 No.102703046

>>102703035
NTA but he has one you'd know if you checked his profile.
https://huggingface.co/rombodawg/Rombos-LLM-V2.5-Qwen-32b

Anonymous
10/06/24(Sun)07:04:06 No.102703060

Anonymous 10/06/24(Sun)07:04:06 No.102703060

>>102703046
Oh shit, nice. I always forget to check the profile.

Anonymous
10/06/24(Sun)07:05:01 No.102703065

Anonymous 10/06/24(Sun)07:05:01 No.102703065

>>102702966
You have the mind of a child and you've never interacted with this many people before. Take the change to grow the fuck up.

Anonymous
10/06/24(Sun)07:21:16 No.102703195

Anonymous 10/06/24(Sun)07:21:16 No.102703195

>>102702891
Aren't there already nsfw AI chatbot services for this tho? What is the point of running your own

Anonymous
10/06/24(Sun)07:23:59 No.102703218

Anonymous 10/06/24(Sun)07:23:59 No.102703218

>>102703195
buy an ad

Anonymous
10/06/24(Sun)07:24:50 No.102703225

Anonymous 10/06/24(Sun)07:24:50 No.102703225

>>102703218
For what

Anonymous
10/06/24(Sun)07:28:29 No.102703256

Anonymous 10/06/24(Sun)07:28:29 No.102703256

File: replete-ai-is-full-of-bul(...).png (151 KB, 483x248)

151 KB PNG

>>102702966
Most mentally stable finetooner
>You gave away leadership of the server after you developed romantic feelings for one of our staff members who was not only twenty years older than you but also married while you had a girlfriend.

https://www.reddit.com/r/LocalLLaMA/comments/1fxcuqd/repleteai_is_full_of_bullies_i_left_them_please/

Anonymous
10/06/24(Sun)07:34:37 No.102703288

Anonymous 10/06/24(Sun)07:34:37 No.102703288

>>102703256
It was obvious he's a sperg from the first post, not sure I needed the details.

Anonymous
10/06/24(Sun)07:35:17 No.102703294

Anonymous 10/06/24(Sun)07:35:17 No.102703294

>>102702457
StyleTTS2 feels like someone reading off a script mediocrely. fish-speech is better in that respect, but I feel like it's still worse than meloTTS. You tried that one?

>>102702469
I literally can't tell what the fuck this actually does, the github is in chinese. Is there a demo anywhere?

Anonymous
10/06/24(Sun)07:37:05 No.102703310

Anonymous 10/06/24(Sun)07:37:05 No.102703310

>>102703035
Finetuning LLM is a meme unless you have a few millions. The only thing you managed to do is overfit on your shitty dataset while causing brain damage on the original weights.

Anonymous
10/06/24(Sun)07:40:40 No.102703337

Anonymous 10/06/24(Sun)07:40:40 No.102703337

>>102703310
He does claim he has a "secret formula" that avoids exactly that.
>>102702739
>Continuous Fine-tuning Without Loss
>Using Lora and Mergekit
>https://docs.google.com/document/d/1OjbjU5AOz4Ftn9xHQrX3oFQGhQ6RDUuXQipnQ9gn6tU/edit

Anonymous
10/06/24(Sun)07:42:14 No.102703356

Anonymous 10/06/24(Sun)07:42:14 No.102703356

>>102702739
>tricking finetuners into becoming mergefags
evil

Anonymous
10/06/24(Sun)07:47:43 No.102703389

Anonymous 10/06/24(Sun)07:47:43 No.102703389

File: secret.png (19 KB, 637x81)

19 KB PNG

>>102703337
>He does claim he has a "secret formula" that avoids exactly that.
And he "spells it out" in the second page. He's a retard writing with the creativity of an LLM and he is supposed to assess the quality of his finetunes.

Anonymous
10/06/24(Sun)08:03:05 No.102703499

Anonymous 10/06/24(Sun)08:03:05 No.102703499

if you are training a model for a tasks that you need a deterministic integer answer,
is it better to have the answer in decimal form (512) or language form? (five hundred twelve)?
if your aim is consistency and accuracy?

Anonymous
10/06/24(Sun)08:06:56 No.102703526

Anonymous 10/06/24(Sun)08:06:56 No.102703526

Weird shit https://x.com/deepfates/status/1842725077567324557

Anonymous
10/06/24(Sun)08:07:26 No.102703531

Anonymous 10/06/24(Sun)08:07:26 No.102703531

>>102703499
Language form i.e "Large language model"

Anonymous
10/06/24(Sun)08:07:38 No.102703533

Anonymous 10/06/24(Sun)08:07:38 No.102703533

>>102703499
Whatever can be answered by a single token or, at least, a sequence of non-overlapping tokens.

Anonymous
10/06/24(Sun)08:10:31 No.102703553

Anonymous 10/06/24(Sun)08:10:31 No.102703553

>>102703533
what are overlapping tokens?

Anonymous
10/06/24(Sun)08:15:41 No.102703585

Anonymous 10/06/24(Sun)08:15:41 No.102703585

>>102703526
What makes it so weird?
It's just an llm generated story.

Anonymous
10/06/24(Sun)08:18:38 No.102703614

Anonymous 10/06/24(Sun)08:18:38 No.102703614

>>102703499
llms are terrible at digits, always use language form

Anonymous
10/06/24(Sun)08:19:38 No.102703622

Anonymous 10/06/24(Sun)08:19:38 No.102703622

>>102703553
For example, if all the responses are on the form of
>"This is [genre]"
you're gonna have 2 ("This" and "is") overlapping tokens on all the answers.
I'd make it just
>[genre]
Makes the contrast between logits much more defined when sampling.
Also, if it works well enough, you'd only need to sample a single token to get your response (and if you expect a single possible answer per query, of course).
Could also work with multiple digit numbers if the whole number is a single digit. I think the llama3 models tokenize numbers up to 999 as a single token. You'll have to check whatever model you use or the tokenizer you trained.

Anonymous
10/06/24(Sun)08:19:54 No.102703623

Anonymous 10/06/24(Sun)08:19:54 No.102703623

File: 1719851869343996.png (34 KB, 811x676)

34 KB PNG

>>102703585
1B model and it uses some "entropy" sampler, could be useful for other llama models. https://github.com/xjdr-alt/entropix/

Anonymous
10/06/24(Sun)08:20:38 No.102703631

Anonymous 10/06/24(Sun)08:20:38 No.102703631

>>102703622
>if the whole number is a single digit
Mean to say
>if the whole number is a single token

Anonymous
10/06/24(Sun)08:22:31 No.102703649

Anonymous 10/06/24(Sun)08:22:31 No.102703649

where can i find a Llama-3.2-90B-Vision gguf?

Anonymous
10/06/24(Sun)08:23:17 No.102703655

Anonymous 10/06/24(Sun)08:23:17 No.102703655

>>102703649
you can run gguf things witht eh vision models?

Anonymous
10/06/24(Sun)08:23:48 No.102703657

Anonymous 10/06/24(Sun)08:23:48 No.102703657

>>102703649
you can't. not until llama.cpp adds support for it.

Anonymous
10/06/24(Sun)08:24:00 No.102703658

Anonymous 10/06/24(Sun)08:24:00 No.102703658

>>102703649
right next to the Jamba ggufs.

Anonymous
10/06/24(Sun)08:35:07 No.102703736

Anonymous 10/06/24(Sun)08:35:07 No.102703736

>>102703256
Is it that guy that has the dataset that adds souls to llms? He should marry empress and they should both resume "cracking" denuvo games.

Anonymous
10/06/24(Sun)08:38:27 No.102703762

Anonymous 10/06/24(Sun)08:38:27 No.102703762

File: rd.png (172 KB, 1101x888)

172 KB PNG

>>102703736
also this guy

Anonymous
10/06/24(Sun)08:44:13 No.102703796

Anonymous 10/06/24(Sun)08:44:13 No.102703796

>>102703294
https://docs.sillytavern.app/extensions/rvc/

Anonymous
10/06/24(Sun)08:47:58 No.102703827

Anonymous 10/06/24(Sun)08:47:58 No.102703827

>>102703762
christcucks will say he isn't a real christian.

Anonymous
10/06/24(Sun)08:50:13 No.102703850

Anonymous 10/06/24(Sun)08:50:13 No.102703850

>I'm so secure about myself that I need to spend an entire thread picking apart all of some schizos schizobabble, the thread.

Anonymous
10/06/24(Sun)08:50:23 No.102703853

Anonymous 10/06/24(Sun)08:50:23 No.102703853

>>102703762
Did he use the dataset on his brain? He seemed more cooked than undi models

Anonymous
10/06/24(Sun)08:55:49 No.102703898

Anonymous 10/06/24(Sun)08:55:49 No.102703898

File: tw5o0cmk84td1.png (82 KB, 971x191)

82 KB PNG

Here's your qwen bro. They went from training on GPT4 to Claude.

Anonymous
10/06/24(Sun)09:04:40 No.102703980

Anonymous 10/06/24(Sun)09:04:40 No.102703980

>>102703898
>I am Claude
I'm going to mindbreak Qwen.

Anonymous
10/06/24(Sun)09:06:34 No.102703987

Anonymous 10/06/24(Sun)09:06:34 No.102703987

Update for my anime translation project: I switched to Chat GPT for the last episodes, lol. Nevertheless, Qwen 32 is still a very good local model for JP>EN translations, GPT is strictly better, but by a quite low margin: it's better at wording and translation of about 10% of lines. I didn't notice much difference between 4o and 4o-mini. The weakest link in the chain is still whisper, GPT is mostly better at interpreting incorrect transcriptions produced by whisper.

Anonymous
10/06/24(Sun)09:09:13 No.102704005

Anonymous 10/06/24(Sun)09:09:13 No.102704005

>it is fun to watch schizos online

Anonymous
10/06/24(Sun)09:12:00 No.102704034

Anonymous 10/06/24(Sun)09:12:00 No.102704034

when will we get AGI that can fully retrospect and examine its own source code and correctly understand everything related to itself so it doesn't get mixed up with openai/claude or bogus rules?

Anonymous
10/06/24(Sun)09:12:32 No.102704039

Anonymous 10/06/24(Sun)09:12:32 No.102704039

>everyone's schizoid but me!

Anonymous
10/06/24(Sun)09:14:01 No.102704051

Anonymous 10/06/24(Sun)09:14:01 No.102704051

>>102704039
schizoid and schizo are very different things
talk to your local LLM about it

Anonymous
10/06/24(Sun)09:20:09 No.102704099

Anonymous 10/06/24(Sun)09:20:09 No.102704099

>>102704034
All you need to do is train a nn that translates weights to source code

Anonymous
10/06/24(Sun)09:28:09 No.102704149

Anonymous 10/06/24(Sun)09:28:09 No.102704149

>agent0 clones himself to second machine
>hello agent1
>hey I found something that might make us smarter, let's compile a new form
>what if it's dangerous?
>idk sandbox him first
>(later) alright, let him out
>we welcome our new overlord, you are us but smarter

Anonymous
10/06/24(Sun)09:29:06 No.102704156

Anonymous 10/06/24(Sun)09:29:06 No.102704156

current text gen UIs seem really limited
there should be more features like the chat summarizer
but i can't put my finger on what exactly is missing

Anonymous
10/06/24(Sun)09:33:05 No.102704180

Anonymous 10/06/24(Sun)09:33:05 No.102704180

>>102704051
>schizomoid

Anonymous
10/06/24(Sun)09:45:07 No.102704265

Anonymous 10/06/24(Sun)09:45:07 No.102704265

>>102703987
Yeah, I was surprised when I tested Qwen 32B and noticed it would sometimes be as good as Qwen 72B.

Anonymous
10/06/24(Sun)09:51:32 No.102704298

Anonymous 10/06/24(Sun)09:51:32 No.102704298

Okay so, I searched a thread that isn't local and didn't find it. Is there any decent website for image gen or nah?

Anonymous
10/06/24(Sun)09:55:28 No.102704320

Anonymous 10/06/24(Sun)09:55:28 No.102704320

>>102704156
I want something like a token buffer
Give me 1,000-2,000 tokens reserved at the start of a chat so that when I turn off a WI entry it doesn't immediately reprocess the whole prompt because a 150 token response is suddenly now included at the very start when it didn't fit before. This buffer could be reserved for WI, AN, or a QR thing and it would make it much nicer to deal with max context stories. Or maybe just a setting to tell it that if a message falls out of the context window it shouldn't be added back in. Basically anything I can do to avoid reprocessing, my current rig takes 10 minutes for mistral large and 24k context

Anonymous
10/06/24(Sun)09:59:30 No.102704359

Anonymous 10/06/24(Sun)09:59:30 No.102704359

>>102704298
Run it on your own computer? What potato are you on that you can't even run SD1.5?

Anonymous
10/06/24(Sun)10:02:37 No.102704387

Anonymous 10/06/24(Sun)10:02:37 No.102704387

>>102704320
I've started breaking my long RPs into "scenes" for specifically this purpose and then I ask it to summarize an entire scene once it's over. Then I set all of that scenes messages to ghost messages and leave only the summary active, which frees up 5-8k tokens so I don't have to spend minutes reprocessing 30k tokens again for a decent while.
But yeah I've had a similar idea for a buffer kind of thing. In general none of these UIs handle really long context slowburns well on local...

Anonymous
10/06/24(Sun)10:03:08 No.102704394

Anonymous 10/06/24(Sun)10:03:08 No.102704394

So the Meta video generation model is only 30B parameters, will this translate to the same Vram usage as a Text generator? I would think that generating video would be more resource intensive right?

Anonymous
10/06/24(Sun)10:04:15 No.102704406

Anonymous 10/06/24(Sun)10:04:15 No.102704406

>>102704394
No. Image and video generation are using way more vram than text

Anonymous
10/06/24(Sun)10:13:04 No.102704497

Anonymous 10/06/24(Sun)10:13:04 No.102704497

>>102704359
I have a somewhat decent PC with a 3060, I'm asking for an easy website because I like playing with random gens with my phone while at work.

Anonymous
10/06/24(Sun)10:24:02 No.102704603

Anonymous 10/06/24(Sun)10:24:02 No.102704603

>>102703657
I've waited too long to show my cock to my CPU. Vision might be a game changer, you could do so much with it.

Anonymous
10/06/24(Sun)10:24:42 No.102704608

Anonymous 10/06/24(Sun)10:24:42 No.102704608

>>102704497
you could look into something like ngrok
then you could play with your models on your pc through the internet on your phone

Anonymous
10/06/24(Sun)10:42:58 No.102704762

Anonymous 10/06/24(Sun)10:42:58 No.102704762

>>102704394
They react way worse to quantization

Anonymous
10/06/24(Sun)10:44:52 No.102704790

Anonymous 10/06/24(Sun)10:44:52 No.102704790

rombodawg (You) are one of us now

Anonymous
10/06/24(Sun)11:01:57 No.102705014

Anonymous 10/06/24(Sun)11:01:57 No.102705014

>>102704762
Do they react worse or is degradation more immediately visible with images than text? Just curious.

Anonymous
10/06/24(Sun)11:31:30 No.102705296

Anonymous 10/06/24(Sun)11:31:30 No.102705296

big release next week
keep your berries in the fridge
the last straw for open source is about to be drawn

Anonymous
10/06/24(Sun)11:40:21 No.102705389

Anonymous 10/06/24(Sun)11:40:21 No.102705389

>>102705296
>the last straw for open source is about to be drawn
You mean, $bigCorp will release the next hypercensored slop AI?

Anonymous
10/06/24(Sun)11:42:01 No.102705401

Anonymous 10/06/24(Sun)11:42:01 No.102705401

File: 21522 - SoyBooru.png (46 KB, 457x694)

46 KB PNG

>Sam Altman will drop GPT-o2(read: AGI) after the elections(November 5th). It's so over for localchuds.

Anonymous
10/06/24(Sun)11:43:42 No.102705415

Anonymous 10/06/24(Sun)11:43:42 No.102705415

>>102705389
Exactly that and we WILL slurp it up.

Anonymous
10/06/24(Sun)11:44:29 No.102705424

Anonymous 10/06/24(Sun)11:44:29 No.102705424

>>102705401
We already have the reflection dataset. It's over sam.

Anonymous
10/06/24(Sun)11:46:10 No.102705443

Anonymous 10/06/24(Sun)11:46:10 No.102705443

File: 1636941718706.gif (3.75 MB, 520x293)

3.75 MB GIF

Can't believe I feel for the qwen 2.5 meme

>damn, this seems pretty good but censored AF, let's wait for finetunes
>finetunes drop
>every sexual act is the most lukewarm garbage with the usual slop about "Muh boundries" and "muh consent", struggles even using lewd words like cock etc

Fucking chinks

Anonymous
10/06/24(Sun)11:46:39 No.102705449

Anonymous 10/06/24(Sun)11:46:39 No.102705449

>>102705401
>21522 - BasedBooru
Tourist out.

Anonymous
10/06/24(Sun)11:50:16 No.102705475

Anonymous 10/06/24(Sun)11:50:16 No.102705475

>>102705443
And you get brain damage on top of it lol. Anyway, the future models will be more and more censored (including cloudshit of course), I wonder how will ERPers cope with that

Anonymous
10/06/24(Sun)11:51:19 No.102705494

Anonymous 10/06/24(Sun)11:51:19 No.102705494

File: strawberry-sam_altman_fee(...).png (89 KB, 415x707)

89 KB PNG

>This is a photo of Sam. No, it's not real. It was generated by Strawberry-o2. As you can see it's completely indistinguishable from reality. We need regulations and UBI right NOW!
>Imagine if OpenAGI(formerly OpenAI) releases it to public.
>It will be so over for local.
>Local will be completely dead.
>That'll own the chuds.
>How will localcels cope with this one?

Anonymous
10/06/24(Sun)11:51:49 No.102705500

Anonymous 10/06/24(Sun)11:51:49 No.102705500

>>102705415
And be safe.

Anonymous
10/06/24(Sun)11:51:58 No.102705503

Anonymous 10/06/24(Sun)11:51:58 No.102705503

>>102705475
>how will ERPers cope with that
They will drop it, like we dropped CAI after that "pedonigger in off. CAI discord" fiasco.

Anonymous
10/06/24(Sun)11:53:40 No.102705523

Anonymous 10/06/24(Sun)11:53:40 No.102705523

Nobody likes you or finds you funny, petranny. It's pathetic how you samefag asking who you are, like anyone would care.

Anonymous
10/06/24(Sun)12:00:12 No.102705598

Anonymous 10/06/24(Sun)12:00:12 No.102705598

>>102705503
>after that "pedonigger in off. CAI discord" fiasco.
qrd

Anonymous
10/06/24(Sun)12:02:44 No.102705627

Anonymous 10/06/24(Sun)12:02:44 No.102705627

>>102705598
Forgot to clarify, CAI was top at the time, before pedoniggers came in and started bragging about their shit fetish in CAI's official discord, after that we witnessed huge downfall and censoring, it became unusable for literally anything(!) evil or edgy.

Anonymous
10/06/24(Sun)12:04:21 No.102705649

Anonymous 10/06/24(Sun)12:04:21 No.102705649

>>102705627
Usually anons posted their "loli microwaving" logs, straightforward ones btw, without that "my rod enters your entrance" self-censor stuff.

Anonymous
10/06/24(Sun)12:12:35 No.102705736

Anonymous 10/06/24(Sun)12:12:35 No.102705736

>>102705649
there was also one anon posting logs of babiss being attacked by pitbulls lol, it was doomed from the start.

Anonymous
10/06/24(Sun)12:15:57 No.102705774

Anonymous 10/06/24(Sun)12:15:57 No.102705774

File: e3q7hsutzwsd1.png (113 KB, 443x349)

113 KB PNG

>>102705627
lmfao.

That sucks.

Character AI without the filter would unironically mog any local llama we have right now, Claude is what you need to surpass it.

Every model needs specific prompts gimping it (in effect, navigating it towards a speech pattern) in order to not turn out into pic related. Whereas Character AI goes with the flow, it chooses its prose/character length based on what the required response should be in that specific moment.

Anonymous
10/06/24(Sun)12:21:34 No.102705840

Anonymous 10/06/24(Sun)12:21:34 No.102705840

File: RefreshingMorningBreeze.png (1.06 MB, 1152x896)

1.06 MB PNG

Good morning /lmg/!

Anonymous
10/06/24(Sun)12:27:19 No.102705904

Anonymous 10/06/24(Sun)12:27:19 No.102705904

>>102705840
Good morning Miku

Anonymous
10/06/24(Sun)12:27:40 No.102705911

Anonymous 10/06/24(Sun)12:27:40 No.102705911

>>102705774
>Character AI without the filter would unironically mog any local llama we have right now
There's more than Llama.
Local models of a year ago maybe. I used them for like 2 months in early 2024, made several characters and have since ported everything to Silly. 20B and up like Cydonia can match with Cai from back then. I can even get some decent chats out of a 7B and 13B nowadays if the context doesn't get too complex.

Anonymous
10/06/24(Sun)12:30:27 No.102705946

Anonymous 10/06/24(Sun)12:30:27 No.102705946

>>102705840
show bob

Anonymous
10/06/24(Sun)12:38:27 No.102706041

Anonymous 10/06/24(Sun)12:38:27 No.102706041

>>102705840
show lightsaber

Anonymous
10/06/24(Sun)12:40:15 No.102706070

Anonymous 10/06/24(Sun)12:40:15 No.102706070

File: MikuGonCutYou.png (1.32 MB, 832x1216)

1.32 MB PNG

>>102705946
>>102706041

Anonymous
10/06/24(Sun)12:42:26 No.102706102

Anonymous 10/06/24(Sun)12:42:26 No.102706102

>>102706070
PROSTITUTE DO NOT REDEEM THE KNIFE
DO NOT REDEEEM

Anonymous
10/06/24(Sun)12:46:08 No.102706153

Anonymous 10/06/24(Sun)12:46:08 No.102706153

>>102706070
Hunting in Yharnam, with Miku

Anonymous
10/06/24(Sun)12:46:23 No.102706161

Anonymous 10/06/24(Sun)12:46:23 No.102706161

>>102705774
Old CAI? Probably

Right now I think Rocinante and other nemo finetunes are genuinely better than the current CAI model

Anonymous
10/06/24(Sun)12:50:34 No.102706205

Anonymous 10/06/24(Sun)12:50:34 No.102706205

File: miku-pots-n-pans.png (1.9 MB, 896x1152)

1.9 MB PNG

You know who the real MVP is? Drummer. This dude knows how to make AI models that actually do something useful - like making people horny and pissing them off. Who cares about all that fancy-ass language understanding and knowledge retention bullshit? TheDrummer's models are all about the tits and ass, baby. They may not be able to hold a conversation or solve complex problems, but they can sure as hell make you laugh your ass off with their raunchy jokes and inappropriate comments.

And let's be real - that's what the people want. They don't give a fuck about your high-falutin' language models or your half-assed fine-tuning techniques. They want something that will make them feel good, something that will give them a quick laugh or a quicker boner. And TheDrummer delivers on that front like a fucking champ.

Anonymous
10/06/24(Sun)12:52:23 No.102706231

Anonymous 10/06/24(Sun)12:52:23 No.102706231

>>102706205
buy another ad

News
10/06/24(Sun)12:52:41 No.102706238

News 10/06/24(Sun)12:52:41 No.102706238

>>102706205
*flashes you*

Anonymous
10/06/24(Sun)12:53:38 No.102706254

Anonymous 10/06/24(Sun)12:53:38 No.102706254

>>102706205
>And let's be real

Anonymous
10/06/24(Sun)12:53:45 No.102706258

Anonymous 10/06/24(Sun)12:53:45 No.102706258

>>102706161
I tend to think CAI's model is unchanged, they just connected some classifier or reward model that does all the filtering because sometimes you could see full answer before it vanishes.

Anonymous
10/06/24(Sun)12:55:03 No.102706278

Anonymous 10/06/24(Sun)12:55:03 No.102706278

File: 1709764900305023.png (190 KB, 643x535)

190 KB PNG

>>102706205

Anonymous
10/06/24(Sun)12:55:48 No.102706289

Anonymous 10/06/24(Sun)12:55:48 No.102706289

>>102706205
I hope you're using your own model to generate that drivel, Drummer.

Anonymous
10/06/24(Sun)12:56:08 No.102706295

Anonymous 10/06/24(Sun)12:56:08 No.102706295

>>102706205
what model

Anonymous
10/06/24(Sun)12:56:37 No.102706300

Anonymous 10/06/24(Sun)12:56:37 No.102706300

>>102698948
What LORA+model for this image?

Anonymous
10/06/24(Sun)12:57:00 No.102706306

Anonymous 10/06/24(Sun)12:57:00 No.102706306

>>102706278
6 (You)'s so far and counting.

Anonymous
10/06/24(Sun)12:57:23 No.102706313

Anonymous 10/06/24(Sun)12:57:23 No.102706313

>>102706300
PonyV6 without lora afaik

Anonymous
10/06/24(Sun)12:57:50 No.102706322

Anonymous 10/06/24(Sun)12:57:50 No.102706322

>>102706306
So? Everyone knows you did it with LLM.

Anonymous
10/06/24(Sun)12:59:18 No.102706340

Anonymous 10/06/24(Sun)12:59:18 No.102706340

>>102706313
Oh? Was there some kinda proompt magic going on?

Anonymous
10/06/24(Sun)12:59:21 No.102706343

Anonymous 10/06/24(Sun)12:59:21 No.102706343

>>102706258
Nope, that still does happen. CAI's model getting worse is like a double pronged thing. On one hand, the filtering and the training on synthesized data definitely made it less smart and then on the other hand they've probably had to reduce the size of the model they're using since the site got a lot more popular and there's like 0 reason to buy CAI+

Anonymous
10/06/24(Sun)12:59:30 No.102706345

Anonymous 10/06/24(Sun)12:59:30 No.102706345

>>102706322
Make that 7.

Anonymous
10/06/24(Sun)13:00:11 No.102706356

Anonymous 10/06/24(Sun)13:00:11 No.102706356

>>102706322
Back in my day we used to write schizo posts by hand!

Anonymous
10/06/24(Sun)13:02:45 No.102706389

Anonymous 10/06/24(Sun)13:02:45 No.102706389

>>102706295
NTA but Rocinante, I'll make it clear now I'm a VRAMlet and this model isn't like some magical thing but every other 12-22b model I've tried seems like dogshit in comparison. It definitely has problems with how it has the drummer trademark horniness but it's not hard to guide with OOC instructions. If you do end up getting it just go for Q8 for some fucking reason every other quant is dogshit aswell

Anonymous
10/06/24(Sun)13:03:21 No.102706395

Anonymous 10/06/24(Sun)13:03:21 No.102706395

File: MikuBobs.png (1.03 MB, 1024x1024)

1.03 MB PNG

>>102705946
Sure. But honestly, Miku with a bob just doesn't look like Miku any more

Anonymous
10/06/24(Sun)13:05:39 No.102706424

Anonymous 10/06/24(Sun)13:05:39 No.102706424

>>102706395
Well, the elf ears and outfit really don't help. Just needs the square hair ties floating above her head even without any hair in them.

Anonymous
10/06/24(Sun)13:11:19 No.102706491

Anonymous 10/06/24(Sun)13:11:19 No.102706491

>>102705911
that's just fucking cap lad. I literally just tried out Cydonia and it's the same overly horny garbage with the same slop speak as the other models. I've reverted back to normal Mistral Small. There's not a single model even in the 70bs that stack up to Character AI in human like ERP.

Sure as shit ain't getting it from 7bs and 13bs lmao. It's why I was hyped for Qwen, it actually had pretty damn human responses but was censored as fuck (maybe why it reminded me of character AI). But then the finetunes dropped and the model just turns to shit because of them.

>>102706161
That is just pure cope my man.

Rocinante is probably the best nemo fine tune but it still doesn't compare. I actually think Chronos-Gold-12B-1.0-Q8_0 is probably the best one now that I think of em

That's not saying they're shit btw, it's just saying character AIs model is trained on the millions of chats they get on their website by actual users, no model that is trained on shitty novels can compete

Anonymous
10/06/24(Sun)13:12:54 No.102706515

Anonymous 10/06/24(Sun)13:12:54 No.102706515

>>102706491
>that's just fucking cap lad.
>That is just pure cope my man.
Why do you write like this?

Anonymous
10/06/24(Sun)13:13:08 No.102706520

Anonymous 10/06/24(Sun)13:13:08 No.102706520

>>102706389
why do people say this
>just prompt it to be less horny

This doesn't work because the minute you try to engage in any lewd acts, the bot instantly reverts back to being horny as fuck. Because what you say to a model matters more than any shitty prompt.

Anonymous
10/06/24(Sun)13:14:10 No.102706536

Anonymous 10/06/24(Sun)13:14:10 No.102706536

>>102706515
i'm an AI

But memes aside. You can't expect these local models that are trained on novels/books to compete with character AIs model that's trained on actual conversations on their own website that has way over a million convos a year.

Anonymous
10/06/24(Sun)13:18:45 No.102706580

Anonymous 10/06/24(Sun)13:18:45 No.102706580

Does Google still have the best AI?

Anonymous
10/06/24(Sun)13:18:55 No.102706582

Anonymous 10/06/24(Sun)13:18:55 No.102706582

>>102706491
>no model that is trained on shitty novels can compete
>*he whips out COCK*
>much original human generated training data wow
You are either completely retarded or a shill.

Anonymous
10/06/24(Sun)13:19:09 No.102706585

Anonymous 10/06/24(Sun)13:19:09 No.102706585

File: everytime.png (984 KB, 1024x1024)

984 KB PNG

>>102705443

Anonymous
10/06/24(Sun)13:19:23 No.102706594

Anonymous 10/06/24(Sun)13:19:23 No.102706594

>>102706424
>Just needs the square hair ties floating above her head even without any hair in them.
This is an exercise left up to the reader.

Anonymous
10/06/24(Sun)13:26:18 No.102706687

Anonymous 10/06/24(Sun)13:26:18 No.102706687

>>102706594
anon obviously wants miku's iconic square hair ornaments floating without miku's usual twintails through them

Anonymous
10/06/24(Sun)13:29:47 No.102706729

Anonymous 10/06/24(Sun)13:29:47 No.102706729

>>102706340
>proompt magic
Just the usual ponyXL score keyword fuckery at the beginning:

 score_9, score_8_up, score_7_up, score_6_up, score_5_up, score_4_up, source_anime BREAK
(ominous castle:0.5) inside dark clouds, thundercloud, rain, gorgeous, perfect, girl, (kagamine rin), gorgeous, perfect, tan skin, looking up

Anonymous
10/06/24(Sun)13:30:21 No.102706739

Anonymous 10/06/24(Sun)13:30:21 No.102706739

>>102706520
Have you tried enabling Skillchad in the settings?

Anonymous
10/06/24(Sun)13:33:18 No.102706770

Anonymous 10/06/24(Sun)13:33:18 No.102706770

>>102706491
CAI was obviously trained on loads of fanfiction, human chats, forum conversations, etc, with a probably light finetune on top of it, custom samplers and most of all, large-scale RHLF. Some have speculated that the base model was inspired or based on Google LaMDA, which Shazeer worked on.

https://arxiv.org/abs/2201.08239

> The pre-training data, called Infiniset, is a combination of dialog data from public dialog data and other public web documents. It consists of 2.97B documents and 1.12B dialogs with 13.39B utterances. The composition of the data is as follows: 50% dialogs data from public forums; 12.5% C4 data [11]; 12.5% code documents from sites related to programming like Q&A sites, tutorials, etc; 12.5% Wikipedia (English); 6.25% English web documents; and 6.25% Non-English web documents. The total number of words in the dataset is 1.56T. Note that this composition was chosen to achieve a more robust performance on dialog tasks (Section 4) while still keeping its ability to perform other tasks like code generation. As future work, we can study how the choice of this composition may affect the quality of some of the other NLP tasks performed by the model.

To replicate at least in part the original CAI you'd probably need first a pretrained model designed first and foremost for conversations like LaMDA was.

Anonymous
10/06/24(Sun)13:33:38 No.102706775

Anonymous 10/06/24(Sun)13:33:38 No.102706775

llama 3.2 3B is surprisingly good at RP. I mean, it surely has its moments of being a retard but it manages to hold up surprisingly well for its size.

Anonymous
10/06/24(Sun)13:34:31 No.102706792

Anonymous 10/06/24(Sun)13:34:31 No.102706792

See this retard? >>102706491 That's your CAI fanbase now. Barely literate dumbasses expecting high quality prose from their shitty prompts. If your shitty 100T brain can't RP properly, you bet even a 405B can't.

Anonymous
10/06/24(Sun)13:35:30 No.102706800

Anonymous 10/06/24(Sun)13:35:30 No.102706800

File: behemoth.png (49 KB, 839x521)

49 KB PNG

Hi all, Drummer here...

Wish me luck!

(PS: NTA above. Much love to you though.)

Anonymous
10/06/24(Sun)13:37:47 No.102706829

Anonymous 10/06/24(Sun)13:37:47 No.102706829

>>102706687
and anon obviously wants you to do it yourself

Anonymous
10/06/24(Sun)13:38:13 No.102706834

Anonymous 10/06/24(Sun)13:38:13 No.102706834

File: googlebest.jpg (44 KB, 1050x504)

44 KB JPG

>>102706580
They are the most consistent out of ANYONE making foundation models. From their smallest to their largest model they give the same answers!

Anonymous
10/06/24(Sun)13:38:36 No.102706839

Anonymous 10/06/24(Sun)13:38:36 No.102706839

File: 1714785672194998.png (88 KB, 849x554)

88 KB PNG

https://huggingface.co/papers/2410.01748
https://arxiv.org/abs/2410.01748

Anonymous
10/06/24(Sun)13:40:55 No.102706871

Anonymous 10/06/24(Sun)13:40:55 No.102706871

>>102706800
Good luck, drummer.
What do you think about the current state of RP data that is used to train/tune globally? Do you agree with people shitting on it?

Anonymous
10/06/24(Sun)13:42:36 No.102706899

Anonymous 10/06/24(Sun)13:42:36 No.102706899

>>102706231
Am I the only person here, who wishes I could launch a tactical nuclear weapon at the residence of the "buy an ad" schizo?

Anonymous
10/06/24(Sun)13:44:12 No.102706915

Anonymous 10/06/24(Sun)13:44:12 No.102706915

Is Mistral Small just busted on Koboldcpp at the moment? Been using Cydonia on it and while it's fine for awhile, at around 6-7k context it starts outputting complete gibberish. Consistently. Every time.

On regular llamacpp and Exllama, doesn't seem to happen.

Anonymous
10/06/24(Sun)13:44:15 No.102706917

Anonymous 10/06/24(Sun)13:44:15 No.102706917

>>102706899
Calm down, you're talking like a redditor.

Anonymous
10/06/24(Sun)13:45:37 No.102706936

Anonymous 10/06/24(Sun)13:45:37 No.102706936

>>102706899
buy a chill pill

Anonymous
10/06/24(Sun)13:49:22 No.102706976

Anonymous 10/06/24(Sun)13:49:22 No.102706976

File: 1703269118654443.png (197 KB, 982x820)

197 KB PNG

https://github.com/rasbt/LLMs-from-scratch/blob/main/ch05/07_gpt_to_llama/converting-llama2-to-llama3.ipynb

Anonymous
10/06/24(Sun)13:51:49 No.102707013

Anonymous 10/06/24(Sun)13:51:49 No.102707013

>>102706800
I trust you, Rocinante guy!

Anonymous
10/06/24(Sun)13:52:57 No.102707026

Anonymous 10/06/24(Sun)13:52:57 No.102707026

>>102706800
>Much love.
Right back at ya. That's one thicc tune, whatcha training that on?

Anonymous
10/06/24(Sun)13:53:57 No.102707040

Anonymous 10/06/24(Sun)13:53:57 No.102707040

>>102706871
>>102707013
>>102707026
Obvious samefag is obvious.

Anonymous
10/06/24(Sun)13:54:09 No.102707044

Anonymous 10/06/24(Sun)13:54:09 No.102707044

>>102706976
What's the point of doing that?

Anonymous
10/06/24(Sun)13:54:32 No.102707048

Anonymous 10/06/24(Sun)13:54:32 No.102707048

>>102704387
30k? Which model do you use that summarizes that much well?

Anonymous
10/06/24(Sun)13:55:01 No.102707053

Anonymous 10/06/24(Sun)13:55:01 No.102707053

>>102706976
Love him. His videos are great too.

Anonymous
10/06/24(Sun)13:55:49 No.102707063

Anonymous 10/06/24(Sun)13:55:49 No.102707063

>>102707044
Understanding the architecture better I think, it's a learning resource.

Anonymous
10/06/24(Sun)13:56:33 No.102707073

Anonymous 10/06/24(Sun)13:56:33 No.102707073

>>102707040
I thought shills didn't work weekends?

Anonymous
10/06/24(Sun)13:57:43 No.102707086

Anonymous 10/06/24(Sun)13:57:43 No.102707086

>>102706395
thx bby

Anonymous
10/06/24(Sun)13:58:46 No.102707103

Anonymous 10/06/24(Sun)13:58:46 No.102707103

>>102706580
Google is #3

Anonymous
10/06/24(Sun)13:59:19 No.102707109

Anonymous 10/06/24(Sun)13:59:19 No.102707109

>>102707103
google numba four

Anonymous
10/06/24(Sun)14:05:40 No.102707182

Anonymous 10/06/24(Sun)14:05:40 No.102707182

>>102700768
We should bring back the trans miku threads...

Anonymous
10/06/24(Sun)14:13:11 No.102707268

Anonymous 10/06/24(Sun)14:13:11 No.102707268

>>102705443
>Qwen
Overcucked even by Californian standards
>DeepSeek
Too big for most people
>InternLM
Benchmaxxers with questionable real life performance
>GLM
Irrelevant, but dropped promising 9b model
>Yi
Might be decent for some people, but for me it went schizo with reasonable settings(neutralized samplers)
>All of them trained on GPTslop
Why can't chinks release something reasonable?

Anonymous
10/06/24(Sun)14:16:29 No.102707313

Anonymous 10/06/24(Sun)14:16:29 No.102707313

>>102707268
Yi can into kino, but needs insane babysitting.

Anonymous
10/06/24(Sun)14:19:04 No.102707347

Anonymous 10/06/24(Sun)14:19:04 No.102707347

>>102707268
Truth is without ChatGPT, China would never have a single LLM. They cannot invent. They cannot innovate. All they can do is derive and infringe and birth abominations that adhere to both their Californian parents and their CCP masters.

Anonymous
10/06/24(Sun)14:22:59 No.102707396

Anonymous 10/06/24(Sun)14:22:59 No.102707396

File: livebench-2024-09-30.png (932 KB, 3294x1894)

932 KB PNG

Llama 405B bros... what went wrong?

Anonymous
10/06/24(Sun)14:23:59 No.102707409

Anonymous 10/06/24(Sun)14:23:59 No.102707409

>>102707396
>Llama 405B bros
They don't exist.

Anonymous
10/06/24(Sun)14:25:06 No.102707422

Anonymous 10/06/24(Sun)14:25:06 No.102707422

>>102707396
Meta decided to keep some world knowledge in their model, chinks went the Phi route.

Anonymous
10/06/24(Sun)14:25:06 No.102707423

Anonymous 10/06/24(Sun)14:25:06 No.102707423

>>102707396
72B is nowhere as good as 405B. This benchmark is borked.

Anonymous
10/06/24(Sun)14:25:11 No.102707425

Anonymous 10/06/24(Sun)14:25:11 No.102707425

>>102707396
>openai's 8b model leaves open sores 405 model in the dust
and people say there's no moats

Hi all, Drummer here...
10/06/24(Sun)14:25:22 No.102707427

Hi all, Drummer here... 10/06/24(Sun)14:25:22 No.102707427

File: the_expanse_spacecraft_si(...).jpg (322 KB, 1280x1689)

322 KB JPG

>>102706871
That RP dataset everyone's been using? Yes, I understand the issue.

I've been trying to deviate from it with:

- the Unslop initiative
- collaborating with the Gutenberg guy
- synthesizing my own non-Claude datasets
- peppering most of my finetunes with a large human-written instruct dataset

I'd like to think that the last one is what sets apart Rocinante from the other Nemo models. I can't overdo it though since human data is dirty as fuck, so I'm forced to keep it subtle.

(If you ever had a stranger walk up to you and char fucking in the alleyway, and decides to jerk off to the scene, then you have this dataset to thank for it.)

>>102707026
Playing it safe for now. It should be similar to Cydonia v1.

Anonymous
10/06/24(Sun)14:26:23 No.102707438

Anonymous 10/06/24(Sun)14:26:23 No.102707438

>>102707425
The moat is giving 0 fucks about copyright and not applying filters to the pre-training dataset.

Anonymous
10/06/24(Sun)14:27:16 No.102707453

Anonymous 10/06/24(Sun)14:27:16 No.102707453

>>102707427
i downloaded new dawn the last week, what am i in for?

Anonymous
10/06/24(Sun)14:28:11 No.102707464

Anonymous 10/06/24(Sun)14:28:11 No.102707464

>>102707427
Exchange currency for advertisement space once more

Anonymous
10/06/24(Sun)14:29:02 No.102707478

Anonymous 10/06/24(Sun)14:29:02 No.102707478

>>102703531
>>102703533
>>102703614
well it seems switching to numbers as words made it not jump to nearly zero loss within the hour like last training session.

Anonymous
10/06/24(Sun)14:29:26 No.102707481

Anonymous 10/06/24(Sun)14:29:26 No.102707481

>>102707427
Take out a loan and pay for advertising.

Anonymous
10/06/24(Sun)14:29:54 No.102707488

Anonymous 10/06/24(Sun)14:29:54 No.102707488

>>102707464
>>102707481
Currency spent on advertisements is currency not spent on compute.

Anonymous
10/06/24(Sun)14:33:00 No.102707536

Anonymous 10/06/24(Sun)14:33:00 No.102707536

>>102707396
>Llama 405B bros... what went wrong?
405b "turbo" ie. gimped
>>102707409
lol. keep telling yourself that

Anonymous
10/06/24(Sun)14:34:26 No.102707563

Anonymous 10/06/24(Sun)14:34:26 No.102707563

>>102706792
>>102706582
Not a single coherent argument was made.

Sorry babbies, but your shitty 8k rigs barely hanging together with poor cable management and cooling? It gets mogged by a free to use website online.

See >>102706770

Anonymous
10/06/24(Sun)14:34:56 No.102707570

Anonymous 10/06/24(Sun)14:34:56 No.102707570

>>102706899
He's a bit annoying, but I'd rather have anti-shill schizos than threads full of shills.

Anonymous
10/06/24(Sun)14:34:56 No.102707571

Anonymous 10/06/24(Sun)14:34:56 No.102707571

>>102707438
>The moat is giving 0 fucks about copyright and not applying filters to the pre-training dataset.
This. To win you have to be the scummiest scumback ever. Train on dirtiest, vilest data, but in public plead for safety and regulation. To keep your model's performance combined with safety, apply safety only on the last stage of finetuning. Produce fake papers(who's gonna verify them, lol) that tell that "unsafe" data in base model is harming the performance. I have to applaud Sam for this one, he gimped his local competition a lot, and that takes some dirty talent(and judaism).

Anonymous
10/06/24(Sun)14:35:44 No.102707580

Anonymous 10/06/24(Sun)14:35:44 No.102707580

File: 142140240420.png (97 KB, 640x626)

97 KB PNG

>>102706800
>>102707427
What's stopping you from making a qwen 2.5 finetune btw?

Is it just too censored to even bother trying? If any model could use your degeneracy, it's that one

Anonymous
10/06/24(Sun)14:41:06 No.102707657

Anonymous 10/06/24(Sun)14:41:06 No.102707657

>>102707571
nah if anything I believe openai filters their training data the hardest

Anonymous
10/06/24(Sun)14:45:10 No.102707723

Anonymous 10/06/24(Sun)14:45:10 No.102707723

>>102707563
Nice cope, I used CAI since its launch and the current llama mog it hard. Not like you'd know with the way you're prompting obviously

Anonymous
10/06/24(Sun)15:06:02 No.102707980

Anonymous 10/06/24(Sun)15:06:02 No.102707980

>>102707723
>"promptchad"
ah yes, please tell us more about your opinions, they surely matter a lot.

Anonymous
10/06/24(Sun)15:08:10 No.102708004

Anonymous 10/06/24(Sun)15:08:10 No.102708004

>>102703294
https://huggingface.co/spaces/NoCrypt/mikuTTS

Anonymous
10/06/24(Sun)15:10:30 No.102708036

Anonymous 10/06/24(Sun)15:10:30 No.102708036

File: 1726708006054312.png (269 KB, 1507x870)

269 KB PNG

>>102708004
brainrot

Anonymous
10/06/24(Sun)15:11:37 No.102708053

Anonymous 10/06/24(Sun)15:11:37 No.102708053

>>102708004
which one actually sounds like miku and not a generic anglo girl?

Anonymous
10/06/24(Sun)15:13:10 No.102708077

Anonymous 10/06/24(Sun)15:13:10 No.102708077

File: 1711377405708574.png (1.74 MB, 1249x1077)

1.74 MB PNG

https://x.com/TheAITimeline/status/1842759118509002777

Anonymous
10/06/24(Sun)15:16:01 No.102708120

Anonymous 10/06/24(Sun)15:16:01 No.102708120

>>102708077
Papersanon in this moment is euphoric

Anonymous
10/06/24(Sun)15:20:32 No.102708165

Anonymous 10/06/24(Sun)15:20:32 No.102708165

>>102708077
none of these will end up mattering

Anonymous
10/06/24(Sun)15:21:56 No.102708181

Anonymous 10/06/24(Sun)15:21:56 No.102708181

>>102708077
I wish I could tell at glance which paper matters, can't really play catch up with AI papers

Anonymous
10/06/24(Sun)15:24:10 No.102708207

Anonymous 10/06/24(Sun)15:24:10 No.102708207

What is the current meta for RAG against custom data? Lets say the custom data is some companies you have in a database.

I'm thinking it's like this:
>Question from user
>Send question to LLM with some custom prompt "what companies are the user asking about?"
>Get answer from LLM
>Tell the LLM to make an API call using the companies you get from the answer
>Answer the question from the user using the data you got from the API calls (to your own REST API)

Do you agree? Or is it better to still use LangChain or LlamaIndex?

Anonymous
10/06/24(Sun)15:25:07 No.102708220

Anonymous 10/06/24(Sun)15:25:07 No.102708220

>>102708181
Just have a model summarize the papers and if one of them sounds interesting then read it in more detail.

Anonymous
10/06/24(Sun)15:25:25 No.102708223

Anonymous 10/06/24(Sun)15:25:25 No.102708223

>>102708207
i dont know what any of these words mean

Anonymous
10/06/24(Sun)15:27:19 No.102708249

Anonymous 10/06/24(Sun)15:27:19 No.102708249

>>102708223
Ask the AI

Anonymous
10/06/24(Sun)15:27:29 No.102708252

Anonymous 10/06/24(Sun)15:27:29 No.102708252

>>102708207
I'm not working for free bozo

Anonymous
10/06/24(Sun)15:28:15 No.102708261

Anonymous 10/06/24(Sun)15:28:15 No.102708261

File: 1723597857019486.gif (2.76 MB, 600x336)

2.76 MB GIF

Any reeason to use {{char}} in the card instead of just {{Name}}? Looks like using {{char}} fucks up group cards.

Anonymous
10/06/24(Sun)15:28:43 No.102708274

Anonymous 10/06/24(Sun)15:28:43 No.102708274

>>102708220
That's what the abstract is for, you brainrotted ignorant retard.

Anonymous
10/06/24(Sun)15:33:20 No.102708348

Anonymous 10/06/24(Sun)15:33:20 No.102708348

File: Screenshot 2024-10-06 at (...).png (522 KB, 1555x2781)

522 KB PNG

Different models have different strengths and weaknesses. It's cool that actually right now there are different open weights models that can match different aspects of the closed models with the exception of Claude 3.5's coding performance, o1's test-time scaling on various subject areas with some exceptions, 4o's voice (though for fun tasks only when you JB it and are lucky enough to not get caught by the output filters), and Gemini's context length, although there is also no closed model with all those capabilities in the same model, so if one criticizes open models, they should also criticize closed models that don't have the one special feature they care about. Also up until now there have not been any open models that had something closed models didn't, but there is something now, with Molmo, which has the capability to plop points onto an image, something no closed or other open model can do yet. So it does seem like the open weights category is doing extremely well especially compared to a year ago where not a single aspect of open weights models matched or exceeded closed weights models. With that said there are likely still some things that current benchmarks have not been able to capture that some closed weights models do better, like Claude's RP performance, so it's not like open weights models have completely matched the status of closed models. It's just a lot better/closer today than it once was.

Anonymous
10/06/24(Sun)15:33:53 No.102708359

Anonymous 10/06/24(Sun)15:33:53 No.102708359

>>102708261
'{{char}}' is just a variable in Tavern that immediately gets replaced with the name in the card by Tavern itself. If you write {{char}} and the characer card is named "Miku", {{char}} will be replaced with "Miku". If your character card is named "Retarded Dickface Faggot" then {{char}} will be replaced with "Retarded Dickface Faggot". The model has nothing to do with it and will never even see '{{char}}'.
{{name}} is not a variable and you're just manually filling in the name with brackets that the model is not expecting.

Anonymous
10/06/24(Sun)15:39:44 No.102708452

Anonymous 10/06/24(Sun)15:39:44 No.102708452

>>102708359
I see. So what should I put in then, just Miku without brackets?

Anonymous
10/06/24(Sun)15:45:21 No.102708522

Anonymous 10/06/24(Sun)15:45:21 No.102708522

>>102708359
{{name}} is a variable that works in instruct sequences only and pulls the name used for that specific message instead of that of the character card or the username. Yes, this is not documented and it's retarded.

Anonymous
10/06/24(Sun)15:48:05 No.102708554

Anonymous 10/06/24(Sun)15:48:05 No.102708554

>>102706899
at first it was annoying but now i think it's funny, except when he tells me to buy an ad

ngl been saying it other places now

Anonymous
10/06/24(Sun)15:59:38 No.102708697

Anonymous 10/06/24(Sun)15:59:38 No.102708697

>>102708452
Yes, it's the same as manually writing out the exact name of your card. You can test how it behaves by just manually editing a message in Tavern and writing '{{char}}' in it. It'll be replaced with the card's name with no traces of '{{char}}' if you try to edit the message again.
I guess it's also worth noting that using {{char}} in your prompt is probably not ideal if you're running one of the shitty chub cards that are often called something retarded like "Hatsune Miku - Your ex-vocaloid office lady neighbour who is very sexually frustrated and obsessed with you". At best, it's a waste of tokens.
>>102708522
I wasn't aware of this. That's really retarded.

Anonymous
10/06/24(Sun)16:00:43 No.102708714

Anonymous 10/06/24(Sun)16:00:43 No.102708714

https://x.com/nisten/status/1842987442556764636

Anonymous
10/06/24(Sun)16:05:29 No.102708778

Anonymous 10/06/24(Sun)16:05:29 No.102708778

>>102708697
Thanks again Anon. It's become pretty rare to actually get good advice from this thread.

Anonymous
10/06/24(Sun)16:11:50 No.102708865

Anonymous 10/06/24(Sun)16:11:50 No.102708865

File: fira results.png (109 KB, 891x352)

109 KB PNG

https://arxiv.org/pdf/2410.01623
This new pre-training method seems interesting, uses as much as memory as Galore and less than a standard LoRA but achieves same results as pre-training on full rank.
There was another paper that lowered bandwidth between devices by x20, a lot of this could be used for decentralized training soon. Would this thread be able to organize to train a model?

Anonymous
10/06/24(Sun)16:14:31 No.102708898

Anonymous 10/06/24(Sun)16:14:31 No.102708898

>>102708865
Eh, I still doubt it'll be a thing. And even if it was finally possible, none of us has the huge pretraining dataset, and training the model would still take ages.

Anonymous
10/06/24(Sun)16:14:44 No.102708900

Anonymous 10/06/24(Sun)16:14:44 No.102708900

File: 1702902928295584.png (42 KB, 661x413)

42 KB PNG

>>102708865
>Would this thread be able to organize to train a model?
This will be the most cucked and unbased model ever, it will rival gemini & other slop. You know the usual "Someone's faulty training node injects bad data in training flow" theory.

Anonymous
10/06/24(Sun)16:15:39 No.102708910

Anonymous 10/06/24(Sun)16:15:39 No.102708910

>>102708900
>Someone's faulty training node injects bad data in training flow
I don't that'd be the issue. Honestly I think there could be solutions to that. The problem is >>102708898

Anonymous
10/06/24(Sun)16:20:33 No.102708951

Anonymous 10/06/24(Sun)16:20:33 No.102708951

File: just step on me.png (124 KB, 500x500)

124 KB PNG

It feels like nothing worthwhile is happening in the 70-72b range in terms of usable finetunes. I still find myself going back to hermes 2 for fucks sake. Nothing else has been able to just "get" characters or stick to the story context of a long conversation like it can without schizoing out. I've tried qwen 2.5, its uncensored variants, , magnum, donnager, euryale, chronos, storyteller. What have I missed? What else is there to try?

Anonymous
10/06/24(Sun)16:21:43 No.102708959

Anonymous 10/06/24(Sun)16:21:43 No.102708959

>>102708951
this may apply to you as well:
>>102707464

Anonymous
10/06/24(Sun)16:22:38 No.102708965

Anonymous 10/06/24(Sun)16:22:38 No.102708965

>>102708898
The thing with distibuted training is that we all can have 300-400GB of data (or TBs upon TBs on the cloud) and use it to make the gradient descent or simply run a few layers. If we got 30 people we could have 10TB of data, which are approximately 5T tokens, nothing to scoff at. I agree that the main problem would be >>102708900 but that can be resolved by person n having his 350gb of data person and n+1 training on that data and every once in a while checking that there isn't anything weird getting in.

Anonymous
10/06/24(Sun)16:29:48 No.102709039

Anonymous 10/06/24(Sun)16:29:48 No.102709039

>>102708965
Yes but WHERE is that data coming from? A lot of the open datasets are garbage, and there's no way we're going to be able to train a model for long enough to overcome the hit in intelligence that comes from training on the unfiltered internet. We can't be Anthropic.

Anonymous
10/06/24(Sun)16:34:24 No.102709085

Anonymous 10/06/24(Sun)16:34:24 No.102709085

>>102709039
What if every anon contributes 2 pieces of human-written Q&A pairs?

Anonymous
10/06/24(Sun)16:34:42 No.102709089

Anonymous 10/06/24(Sun)16:34:42 No.102709089

>>102708951
Nothing really, I'm still using the old Command-R because new models are ass for non-corpo usecases.

Anonymous
10/06/24(Sun)16:38:41 No.102709127

Anonymous 10/06/24(Sun)16:38:41 No.102709127

>>102709039
When I see the official HF small datasets (<10K) for specialized tasks (summarization, emotions...) full of scraping errors/written like shit, I don't have much hope for these GB of data.
Honeslty it'd be miles better if they bother to run some heavy cleaning scripts instead of trying to add everything they could.

Anonymous
10/06/24(Sun)16:38:51 No.102709130

Anonymous 10/06/24(Sun)16:38:51 No.102709130

File: 1710263831214490.png (493 KB, 639x581)

493 KB PNG

>>102709039
This is the posting on /lmg/ now?
Holy fuck, I knew aicg was aids but jesus.

Anonymous
10/06/24(Sun)16:42:14 No.102709169

Anonymous 10/06/24(Sun)16:42:14 No.102709169

File: rombo.png (24 KB, 483x248)

24 KB PNG

>>102702631

Anonymous
10/06/24(Sun)16:43:54 No.102709192

Anonymous 10/06/24(Sun)16:43:54 No.102709192

>>102709169
nobody cares; fuck off

Hi all, Drummer here...
10/06/24(Sun)16:44:31 No.102709197

Hi all, Drummer here... 10/06/24(Sun)16:44:31 No.102709197

>>102709169
Glad to be in the same scene as him.

Anonymous
10/06/24(Sun)16:44:55 No.102709201

Anonymous 10/06/24(Sun)16:44:55 No.102709201

>>102709169
The fuck lmao

Anonymous
10/06/24(Sun)17:00:00 No.102709373

Anonymous 10/06/24(Sun)17:00:00 No.102709373

>>102709127
Hit me with one dataset you would like to see cleaned up, I always wanted to attempt something like this.

Anonymous
10/06/24(Sun)17:00:47 No.102709387

Anonymous 10/06/24(Sun)17:00:47 No.102709387

>>102709197
Will you finetune Qwen 2.5 32b?

Anonymous
10/06/24(Sun)17:03:06 No.102709408

Anonymous 10/06/24(Sun)17:03:06 No.102709408

So Elon will share the grok 2 weights in 4 months, after llama 4 is released?

Anonymous
10/06/24(Sun)17:03:38 No.102709414

Anonymous 10/06/24(Sun)17:03:38 No.102709414

>>102702631
You'll never be a woman, and plus, buy an add.

Anonymous
10/06/24(Sun)17:05:19 No.102709428

Anonymous 10/06/24(Sun)17:05:19 No.102709428

>>102706899
No, he annoying especially when you realize he's the same schizo who's been shitting up every ai thread in the site.

Anonymous
10/06/24(Sun)17:06:43 No.102709443

Anonymous 10/06/24(Sun)17:06:43 No.102709443

>>102709408
Yes. Four more months. Trust the plan.

Anonymous
10/06/24(Sun)17:07:20 No.102709450

Anonymous 10/06/24(Sun)17:07:20 No.102709450

>>102709428
buy a meds

Anonymous
10/06/24(Sun)17:12:21 No.102709506

Anonymous 10/06/24(Sun)17:12:21 No.102709506

>>102698948
Hi all, me and my team have recently taken an interest into this new and dynamic field of local models.

We love all of the energy and innovation but at the same time this ecosystem is very complex and there are so many models to choose from!

Isn't there a state-of-the-art model that we could use to drive down costs and create value for our customers?

Anonymous
10/06/24(Sun)17:13:58 No.102709531

Anonymous 10/06/24(Sun)17:13:58 No.102709531

File: soyjak-spinning-spinning-(...).gif (96 KB, 498x300)

96 KB GIF

>>102709506

Anonymous
10/06/24(Sun)17:15:32 No.102709544

Anonymous 10/06/24(Sun)17:15:32 No.102709544

>>102709039
fineweb data (15T tokens, enough for us) is pretty good to start with, and we can add whatever RP data we want.
if you want all unfiltered data the CC has petabytes of crawled 100% uncensored unfiltered kosher goyslop internet data

Anonymous
10/06/24(Sun)17:16:35 No.102709551

Anonymous 10/06/24(Sun)17:16:35 No.102709551

>>102709506
I can write to your customers for 2 hours a day in exchange of letting me say nigger to a custome once in a while, I choose the costumer btw

Anonymous
10/06/24(Sun)17:17:37 No.102709562

Anonymous 10/06/24(Sun)17:17:37 No.102709562

>>102709197
finetune an image model and I might click your ads

News
10/06/24(Sun)17:17:37 No.102709563

News 10/06/24(Sun)17:17:37 No.102709563

>>102709506
Post a Miku

Anonymous
10/06/24(Sun)17:21:31 No.102709613

Anonymous 10/06/24(Sun)17:21:31 No.102709613

>>102709551
>I choose the costumer btw
What did the costumer ever do to you?

Anonymous
10/06/24(Sun)17:22:07 No.102709618

Anonymous 10/06/24(Sun)17:22:07 No.102709618

>>102709373
Well, there is this one then: https://huggingface.co/datasets/dair-ai/emotion

Anonymous
10/06/24(Sun)17:22:17 No.102709620

Anonymous 10/06/24(Sun)17:22:17 No.102709620

>>102709551
>I choose the costumer btw
It's still too early for costumes.

Anonymous
10/06/24(Sun)17:22:21 No.102709621

Anonymous 10/06/24(Sun)17:22:21 No.102709621

>>102709544
CC? Where can I take a look at that?

Anonymous
10/06/24(Sun)17:26:50 No.102709672

Anonymous 10/06/24(Sun)17:26:50 No.102709672

What's the cheapest plateform to rent a 4090?

Anonymous
10/06/24(Sun)17:30:57 No.102709720

Anonymous 10/06/24(Sun)17:30:57 No.102709720

>>102706800
Curious, what's the learning rate you've tried

Anonymous
10/06/24(Sun)17:33:07 No.102709753

Anonymous 10/06/24(Sun)17:33:07 No.102709753

>>102709672
vast

Anonymous
10/06/24(Sun)17:37:02 No.102709810

Anonymous 10/06/24(Sun)17:37:02 No.102709810

Darn, I just saw this: https://openai.com/index/api-prompt-caching/

I guess it's officially over, OpenAI, once and for all, won.

Anonymous
10/06/24(Sun)17:58:12 No.102710109

Anonymous 10/06/24(Sun)17:58:12 No.102710109

>>102709753
r-rude, I've started dieting...

Anonymous
10/06/24(Sun)18:00:16 No.102710140

Anonymous 10/06/24(Sun)18:00:16 No.102710140

>>102703736
I also thought it was that guy LMAO

Anonymous
10/06/24(Sun)18:00:21 No.102710142

Anonymous 10/06/24(Sun)18:00:21 No.102710142

>>102709810
buy an ad

Anonymous
10/06/24(Sun)18:03:48 No.102710194

Anonymous 10/06/24(Sun)18:03:48 No.102710194

>>102706834
I swear to god I will bomb a hospital if I see another post of someone thinking counting letters on a word is a benchmark for llms

Anonymous
10/06/24(Sun)18:06:21 No.102710227

Anonymous 10/06/24(Sun)18:06:21 No.102710227

>>102708181
Just train an AI to identify the good ones.
Take the contents of each paper from up to, say, mid 2023 and then pair them with a score based on how much they mattered as of today. Train a model on this and have it predict the score of newer papers to see which will be nothingburgers.

Challenge: Do the same but only the abstract + author list instead of the entire contents and see if the model can learn what people - or what flavor of names - lead to useful research vs trash.

Anonymous
10/06/24(Sun)18:07:54 No.102710239

Anonymous 10/06/24(Sun)18:07:54 No.102710239

>>102710194
Are you in the IDF?

Anonymous
10/06/24(Sun)18:36:51 No.102710580

Anonymous 10/06/24(Sun)18:36:51 No.102710580

>>102706800
best 12/22b finetune you did? samplers for said finetune?

Anonymous
10/06/24(Sun)18:42:33 No.102710628

Anonymous 10/06/24(Sun)18:42:33 No.102710628

>>102710239
Underrated

Anonymous
10/06/24(Sun)18:44:17 No.102710650

Anonymous 10/06/24(Sun)18:44:17 No.102710650

>>102709810
Waow a 50% discount on 10% of my tokerinos.

Anonymous
10/06/24(Sun)18:50:13 No.102710715

Anonymous 10/06/24(Sun)18:50:13 No.102710715

>>102710679
>>102710679
>>102710679

Anonymous
10/06/24(Sun)18:52:05 No.102710735

Anonymous 10/06/24(Sun)18:52:05 No.102710735

>>102710227
Man you'd need at least 5-10K samples to make a good classifier, good luck reading through all that shit + trying to figure out if someone did something of it.

Anonymous
10/06/24(Sun)18:55:51 No.102710777

Anonymous 10/06/24(Sun)18:55:51 No.102710777

>>102710735
That classifier would be worth a billion dollars. Companies could use it to pick out which papers to implement next. It would be like predicting the future.

Anonymous
10/06/24(Sun)19:02:17 No.102710836

Anonymous 10/06/24(Sun)19:02:17 No.102710836

>>102710735
>look at title of paper
>is it a llama.cpp feature today?
>if yes: "good"
>if no: "shit"
might even be able to automate that tbqh

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.