/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 06/29/24(Sat)14:49:42 No.101205004

File: GNr4ARWb0AAnVcC.jpg (222 KB, 928x1232)

222 KB JPG

/lmg/ - Local Models General Anonymous 06/29/24(Sat)14:49:42 No.101205004 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>101197169 & >>101191862

►News
>(06/28) Inference support for Gemma 2 merged: https://github.com/ggerganov/llama.cpp/pull/8156
>(06/27) Meta announces LLM Compiler, based on Code Llama, for code optimization and disassembly: https://go.fb.me/tdd3dw
>(06/27) Gemma 2 released: https://hf.co/collections/google/gemma-2-release-667d6600fd5220e7b967f315
>(06/25) Cambrian-1: Collection of vision-centric multimodal LLMs: https://cambrian-mllm.github.io
>(06/23) Support for BitnetForCausalLM merged: https://github.com/ggerganov/llama.cpp/pull/7931

►News Archive: https://rentry.org/lmg-news-archive
►FAQ: https://wikia.schneedc.com
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Programming: https://hf.co/spaces/bigcode/bigcode-models-leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
06/29/24(Sat)14:50:07 No.101205012

Anonymous 06/29/24(Sat)14:50:07 No.101205012

File: __hatsune_miku_kagamine_r(...).jpg (304 KB, 650x650)

304 KB JPG

►Recent Highlights from the Previous Thread: >>101197169

--Extending LLM's Context Window with Activation Beacons: >>101203075 >>101203249
--The Strawberry Test: A Flawed Method for Evaluating LLM Quality: >>101198523 >>101198632
--Model Requirements and File System Operations: >>101200948 >>101200984 >>101201000 >>101201064 >>101201005 >>101201126
--Miyu's Odd Behavior in the Classroom and LLM's Writing Limitations: >>101198989 >>101199237 >>101199267 >>101201087 >>101201307
--Licensing Model Weights: Dubious or Protected?: >>101200593 >>101200629 >>101200660
--Koboldcpp vs KoboldAI: Choosing the Right Model for Your Setup: >>101201486 >>101201527 >>101201593 >>101201687
--HF Leaderboard: Qwen and CR+ Performance: >>101202193 >>101202222 >>101202316 >>101202816
--Gemma2 27b's Technical Accuracy in Poetic Metre: >>101197882 >>101197901 >>101197912
--Gemma 2: A Modified Version of Gemini Flash?: >>101197434 >>101199289 >>101199667
--Control Vector Test Drive and Applications: >>101198756 >>101199025 >>101199067 >>101199204 >>101199229
--Chatbot Arena - Vision Rankings: GPT-3.5 and Claude 3.5 Sonnet Dominate: >>101199300 >>101199358
--Best Local Model for App Development and Programming: >>101201995 >>101202012 >>101202435 >>101202570 >>101202751 >>101202777 >>101202849 >>101202834 >>101202882 >>101202914 >>101203105 >>101203182 >>101203195 >>101203256 >>101203129
--AI Model's Shitty Cliche Smut Tropes and How to Fix Them: >>101198056 >>101198076 >>101198087
--27B Model Generates Endless Pad Tokens in Transformers: >>101197754 >>101197828 >>101197860 >>101197907
--Web Development's Wrong Turn: From Documents to Scripting Languages: >>101197613 >>101197652 >>101197660 >>101197771 >>101197945 >>101197963 >>101204914 >>101197974
--Anon's Love Letter to Mixtral Model Stock Experiment: >>101202093 >>101202420 >>101203831 >>101203861
--Miku (free space): >>101197686 >>101203008 >>101203898

►Recent Highlight Posts from the Previous Thread: >>101197174

Anonymous
06/29/24(Sat)14:52:39 No.101205045

Anonymous 06/29/24(Sat)14:52:39 No.101205045

I am the one who says the nigger word with no repercussions

Anonymous
06/29/24(Sat)14:54:33 No.101205072

Anonymous 06/29/24(Sat)14:54:33 No.101205072

File: 1698504636333622.png (845 KB, 1280x720)

845 KB PNG

>>101205045
YOU CANT DO A HECKIN RACISM OUTSIDE B CHUDDD NOOOOOOOOOOOOO

Anonymous
06/29/24(Sat)14:54:44 No.101205076

Anonymous 06/29/24(Sat)14:54:44 No.101205076

>ctrl+f (You)
>14 matches

Anonymous
06/29/24(Sat)14:57:16 No.101205110

Anonymous 06/29/24(Sat)14:57:16 No.101205110

>>101205004
Learning dangerous knowledge with Rin

Anonymous
06/29/24(Sat)14:59:13 No.101205144

Anonymous 06/29/24(Sat)14:59:13 No.101205144

>>101205072
I wonder if they recycle janitors/moderators every so often, there was a period between 2021-late 2023 where i was getting redeemed for "racism outside /b/", even in threads were other people were far more racy and they were hit with nothing
its not happened once so far this year.

Anonymous
06/29/24(Sat)15:05:03 No.101205229

Anonymous 06/29/24(Sat)15:05:03 No.101205229

>>101203790 (me)
>deepseek
Trying myself now. I can't really run it right now/don't want to download it so I paid the official deepseek API. It was ~$2 for 7 million tokens. lol.

Anonymous
06/29/24(Sat)15:14:02 No.101205355

Anonymous 06/29/24(Sat)15:14:02 No.101205355

>>101203790
I think it's definitely the superior one compared to deepseek-v2-instruct when it comes to ERP but I don't think it can keep up with sonnet in that regard.

Anonymous
06/29/24(Sat)15:14:42 No.101205369

Anonymous 06/29/24(Sat)15:14:42 No.101205369

>>101202420
your model is the most blueballing model i've used so far
plz fix

Anonymous
06/29/24(Sat)15:17:54 No.101205422

Anonymous 06/29/24(Sat)15:17:54 No.101205422

>>101205229
i grabbed it and did a couple tests and (running it in ollama) and it seems to go off the rails quite a bit more than other models, probably a skill/prompt issue
curious if anyone has tips

Anonymous
06/29/24(Sat)15:20:31 No.101205457

Anonymous 06/29/24(Sat)15:20:31 No.101205457

>>101205369
I tried it, it failed all of my factual quality tests, and the RP pulled that "Despite being told twice who's character is whose, it just writes for mine instead of its own" crap.

It might be an admirable effort, but from my perspective it probably got the worst aspects of the merge components instead of the best. Maybe just bad luck, but deleted in hopes of putting better bytes on my drive.

Anonymous
06/29/24(Sat)15:21:25 No.101205461

Anonymous 06/29/24(Sat)15:21:25 No.101205461

>>101205457
fuck.......... is limarp zloss the only good mixtral finetune?

Anonymous
06/29/24(Sat)15:22:22 No.101205468

Anonymous 06/29/24(Sat)15:22:22 No.101205468

>>101205229 (me)
Gave -instruct it the writing prompt from the EQ bench. This was the first roll.

https://rentry.org/hyum3kaw

Anonymous
06/29/24(Sat)15:22:47 No.101205476

Anonymous 06/29/24(Sat)15:22:47 No.101205476

4 years of vramlet cope and still nobody's figured out a better way to make a transformer model smarter than just making it bigger

Anonymous
06/29/24(Sat)15:23:42 No.101205488

Anonymous 06/29/24(Sat)15:23:42 No.101205488

>>101205476
405B will save us until Llama-V-JEPA

Anonymous
06/29/24(Sat)15:25:36 No.101205505

Anonymous 06/29/24(Sat)15:25:36 No.101205505

>>101205461
yuzu alter rpcal...

Anonymous
06/29/24(Sat)15:26:10 No.101205512

Anonymous 06/29/24(Sat)15:26:10 No.101205512

>>101205468
light on the purple prose. Kinda like it desu.

Anonymous
06/29/24(Sat)15:28:43 No.101205533

Anonymous 06/29/24(Sat)15:28:43 No.101205533

>>101205468
>conspiratorial whisper
>eyes twinkling
>maybe, just maybe
But otherwise, pretty good.

Anonymous
06/29/24(Sat)15:28:57 No.101205537

Anonymous 06/29/24(Sat)15:28:57 No.101205537

is L3-8B-Stheno-v3.2 still the best model for people with only 24gb of vram?

Anonymous
06/29/24(Sat)15:29:02 No.101205538

Anonymous 06/29/24(Sat)15:29:02 No.101205538

why is command-r almost slower than command-r plus for me...

Anonymous
06/29/24(Sat)15:29:06 No.101205540

Anonymous 06/29/24(Sat)15:29:06 No.101205540

>>101205505
>rpcal
Didn't the exl2 dude say that that shit makes the calibration of the model worse?

Anonymous
06/29/24(Sat)15:30:17 No.101205551

Anonymous 06/29/24(Sat)15:30:17 No.101205551

>>101205540
he doens't know what hes talking about

Anonymous
06/29/24(Sat)15:30:20 No.101205552

Anonymous 06/29/24(Sat)15:30:20 No.101205552

>>101205537
Buy an ad.

Anonymous
06/29/24(Sat)15:30:27 No.101205553

Anonymous 06/29/24(Sat)15:30:27 No.101205553

>>101205538
Lack of GQA maybe? Although even then, the difference in size ahould more than bridge the contex size gap.

Anonymous
06/29/24(Sat)15:32:31 No.101205578

Anonymous 06/29/24(Sat)15:32:31 No.101205578

>people still shilling stheno when there's lunaris
No but seriously though, how do they compare? He released that shit a while ago already, shouldn't people have used it by now? I don't remember seeing any impressions of it.

Anonymous
06/29/24(Sat)15:33:08 No.101205586

Anonymous 06/29/24(Sat)15:33:08 No.101205586

>>101196305
I told you bros he was algerian

Anonymous
06/29/24(Sat)15:33:24 No.101205587

Anonymous 06/29/24(Sat)15:33:24 No.101205587

>>101205461
I'm just looking for the only good models.

Qwen2 Q4KS or better, Llama Q5KS or better, CR+ Q4KM or better seem to be the only ones passing my tests.

I'm looking at Magnum right now—apparently a Qwen2 spin—and it's going okay, and hasn't done weird stuff the normal Qwen2s have done to me before like barf moon runes spontaneously unless I drop CuBLAS so that's nice.

Anonymous
06/29/24(Sat)15:34:00 No.101205592

Anonymous 06/29/24(Sat)15:34:00 No.101205592

>>101205578
I didn't know that was a thing. Gonna give it a try when I get home.
Let's see how it does with my RPG card.

Anonymous
06/29/24(Sat)15:36:52 No.101205620

Anonymous 06/29/24(Sat)15:36:52 No.101205620

>>101205553
forgot i had 8 swipes per gen enabled
for whatever reason CR seems to be affected way more than CR+ at higher batch sizes

Anonymous
06/29/24(Sat)15:38:47 No.101205639

Anonymous 06/29/24(Sat)15:38:47 No.101205639

>>101205551
>the guy who made exl2 quants doesn't know about exl2 quants

okay then

Anonymous
06/29/24(Sat)15:40:39 No.101205668

Anonymous 06/29/24(Sat)15:40:39 No.101205668

>>101205639
Makes sense to me.

Anonymous
06/29/24(Sat)15:50:38 No.101205793

Anonymous 06/29/24(Sat)15:50:38 No.101205793

>>101205552
name something better

Anonymous
06/29/24(Sat)15:52:16 No.101205808

Anonymous 06/29/24(Sat)15:52:16 No.101205808

>>101205468
I played around with it a little too. I liked it and it's definitively smarter than other open source models (It's very strange how it is on no leaderboard) and it gave me the impression that it's hyper-aware of it's entire context at all times.

Anonymous
06/29/24(Sat)15:52:22 No.101205810

Anonymous 06/29/24(Sat)15:52:22 No.101205810

>>101205793
Mixtral-limarp.

Anonymous
06/29/24(Sat)15:55:11 No.101205835

Anonymous 06/29/24(Sat)15:55:11 No.101205835

verdict on gemma2?

Anonymous
06/29/24(Sat)15:56:14 No.101205853

Anonymous 06/29/24(Sat)15:56:14 No.101205853

File: RTX 5080 AI GEN.jpg (92 KB, 1200x819)

92 KB JPG

So I have been thinking about this hobby and the financial costs involved. I currently have a RTX 3080 with 10 GB of Vram. I Have a 4k monitor but don't do much AAA gaming anymore.

Lets assume that the upcomming RTX 5090 will have 32 GB of Vram and will cost 1800$. Stheno and other 8B models run fine on 3080 and are okay for quick coom or a short RP sessions. And if I want to play with bigger models I can rent an RTX A6000 for about 0,85 dollars per hour, or an A 100 for 2$/h. Used 3090:s are "cheap" at about 600$ each, but they are big, loud and suck up a lot of watts. Given the fact that my LLM usage is max 15 hours/week of RP:ing with chatbot, I could rent the GPU-power I need for almost 3 years for the price of a hypothetical 5090.

I understand the desire to keep the spicy logs on prem, and the idea that i will "own nothing" if I rent the GPU from runpod, but does buying new top of the line GPU:s really make any sense for a "casual" user like me.

Have a shitty AI genned RTX 5080 picture

Anonymous
06/29/24(Sat)15:58:13 No.101205873

Anonymous 06/29/24(Sat)15:58:13 No.101205873

>>101205853
Financially, it probably does make more sense. You have to decide for yourself if the privacy concerns are worth it.

Anonymous
06/29/24(Sat)15:58:38 No.101205878

Anonymous 06/29/24(Sat)15:58:38 No.101205878

>>101205835
I want it but I can't have it until exl2 adds support.

Anonymous
06/29/24(Sat)15:59:47 No.101205893

Anonymous 06/29/24(Sat)15:59:47 No.101205893

We'll get new llama 3 stuff in July right?

Anonymous
06/29/24(Sat)16:00:36 No.101205896

Anonymous 06/29/24(Sat)16:00:36 No.101205896

>>101205893
gpt-4o drop delayed everything until august

Anonymous
06/29/24(Sat)16:01:30 No.101205911

Anonymous 06/29/24(Sat)16:01:30 No.101205911

>>101205853
If all you want is to coom sure. I enjoy the thinkering possibilities local provides.

Anonymous
06/29/24(Sat)16:06:00 No.101205988

Anonymous 06/29/24(Sat)16:06:00 No.101205988

>>101205853
Where are you now? If you've got a single decent 30XX or 40XX kind of card and ≥64GB Ram, you can be where I'm at which is about 1 to 2 tokens per second on non-awful models. Not fast, but I just treat it like AIM, chat a line and send, and it'll make a noise in a few minutes when it replies. That would be enough for you to see if you actually dig it and have ideas of things that NEED more power or if you get your taste, get bored, and spend that money on groceries.

Anonymous
06/29/24(Sat)16:06:14 No.101205994

Anonymous 06/29/24(Sat)16:06:14 No.101205994

>>101205853
There's also a lot of APIs now where you can use models directly. You don't have the direct control like you have when renting but depending on how you use the model, it's even cheaper and these models are big and not quantized.I also assume prices to just go down more. If you don't have an attachment to local, if we look at this anons purchase, >>101205229 that is 28 cents per million tokens. Ignoring the whole privacy and control aspect, local isn't quite worth it anymore if you want to run quality models. With progress, this might change again. Local made sense when OpenAI was the only player and their rugpulling could really fuck up your shit (and that's how lmg was born to begin with) but times have changed. If somebody rugpulls, just go somewhere else.

Anonymous
06/29/24(Sat)16:10:53 No.101206053

Anonymous 06/29/24(Sat)16:10:53 No.101206053

Turns out babies could be a good avenue of research in order to improve foundational AI's.
https://www.cell.com/trends/cognitive-sciences/fulltext/S1364-6613(24)00114-1?_returnURL=https%3A%2F%2Flinkinghub.elsevier.com%2Fretrieve%2Fpii%2FS1364661324001141%3Fshowall%3Dtrue

Anonymous
06/29/24(Sat)16:14:13 No.101206093

Anonymous 06/29/24(Sat)16:14:13 No.101206093

>>101205468
I feel chinese are just not taken seriously in the AI space, while their shit just keeps improving and improving and nobody really talks about it.

Anonymous
06/29/24(Sat)16:15:09 No.101206111

Anonymous 06/29/24(Sat)16:15:09 No.101206111

>>101206053
Interesting, thanks for posting this.

Anonymous
06/29/24(Sat)16:18:20 No.101206152

Anonymous 06/29/24(Sat)16:18:20 No.101206152

>>101205893
Yes, the anniversary of llama-2 in late july will be celebrated with the release of llama3-creative-128B which is optimized for roleplay and creative endeavors

Anonymous
06/29/24(Sat)16:22:19 No.101206209

Anonymous 06/29/24(Sat)16:22:19 No.101206209

>>101205808
The context awareness is something I'm interested in, but in large context tasks I find it gets into weird repetitive loops.

I'm asking it to write an overview of some code, provided the code and api documentation. It starts off really strong but then after a few paragraphs it jumps back to "This is an overview for xxxx library..." and starts over again. Anyone else experience stuff like this?

Anonymous
06/29/24(Sat)16:22:33 No.101206215

Anonymous 06/29/24(Sat)16:22:33 No.101206215

>>101206152
Size is bullshit, but Meta did claim to be considering partnering with Character.ai.

Anonymous
06/29/24(Sat)16:30:10 No.101206336

Anonymous 06/29/24(Sat)16:30:10 No.101206336

When will we get an actual llama?
That's the only reason why I'm here

Anonymous
06/29/24(Sat)16:31:52 No.101206360

Anonymous 06/29/24(Sat)16:31:52 No.101206360

>>101206111
No problem mate, glad you found it interesting. I am hopeful that this avenue of research will benefit synthetic data for models since the paper put some emphasis on self supervised learning.

Anonymous
06/29/24(Sat)16:32:00 No.101206364

Anonymous 06/29/24(Sat)16:32:00 No.101206364

>>101206336
No purchase necessary to enter. Simply fill out the form on the web site and your llama will be delivered in 2 weeks. Deadline to enter is April 20th 2024.

Anonymous
06/29/24(Sat)16:32:14 No.101206367

Anonymous 06/29/24(Sat)16:32:14 No.101206367

File: Untitled.jpg (38 KB, 915x90)

38 KB JPG

>>101206215
is that why its terrible now

Anonymous
06/29/24(Sat)16:32:44 No.101206372

Anonymous 06/29/24(Sat)16:32:44 No.101206372

>>101206336
>>>/an/catalog

Anonymous
06/29/24(Sat)16:32:49 No.101206375

Anonymous 06/29/24(Sat)16:32:49 No.101206375

>>101206336
I'm sure you can find a llama card somewhere.

Anonymous
06/29/24(Sat)16:38:16 No.101206442

Anonymous 06/29/24(Sat)16:38:16 No.101206442

>>101205810
this shit is six months old lmao

Anonymous
06/29/24(Sat)16:43:49 No.101206513

Anonymous 06/29/24(Sat)16:43:49 No.101206513

>>101206093
We are too busy with important questions like "is an LLM saying nigger literally genocide?" or "Will an imagen model generating booba destroy civilization?" while they, you know, just make stuff.

Anonymous
06/29/24(Sat)16:49:08 No.101206577

Anonymous 06/29/24(Sat)16:49:08 No.101206577

>>101206513
chinese models perform well on benchmark but IRL they get mogged by western models

Anonymous
06/29/24(Sat)16:50:59 No.101206596

Anonymous 06/29/24(Sat)16:50:59 No.101206596

Which Kobold Presets you guys use for models?

Anonymous
06/29/24(Sat)16:56:42 No.101206675

Anonymous 06/29/24(Sat)16:56:42 No.101206675

>>101206596
My own. Without any samplers but temp 0.5~0.85 and mimP 0.05

Anonymous
06/29/24(Sat)16:56:45 No.101206676

Anonymous 06/29/24(Sat)16:56:45 No.101206676

File: 29390 - SoyBooru.png (139 KB, 775x1232)

139 KB PNG

>>101205835
WNBAG

Anonymous
06/29/24(Sat)17:00:26 No.101206727

Anonymous 06/29/24(Sat)17:00:26 No.101206727

File: file.png (150 KB, 1870x928)

150 KB PNG

>>101205994
but can you prefill with /chat/completions api?

Anonymous
06/29/24(Sat)17:01:25 No.101206743

Anonymous 06/29/24(Sat)17:01:25 No.101206743

File: 63896 - SoyBooru.png (182 KB, 332x406)

182 KB PNG

>>101206676
TRVKE

Anonymous
06/29/24(Sat)17:05:27 No.101206789

Anonymous 06/29/24(Sat)17:05:27 No.101206789

>>101206367
keeeek

Anonymous
06/29/24(Sat)17:21:04 No.101206984

Anonymous 06/29/24(Sat)17:21:04 No.101206984

Any of you guys run this on a Celeron? Kind of want to go for 70b but in low end computers.

Anonymous
06/29/24(Sat)17:26:00 No.101207038

Anonymous 06/29/24(Sat)17:26:00 No.101207038

File: lmaoo.jpg (156 KB, 2059x765)

156 KB JPG

lmao, google has cheated his model to be good on chatbot arena

Anonymous
06/29/24(Sat)17:29:23 No.101207089

Anonymous 06/29/24(Sat)17:29:23 No.101207089

File: dhjskldahak.png (17 KB, 1439x132)

17 KB PNG

If I increase GPU layers past 48 I get the error message below even though I have a 3090+4090???? Can someone on the Koboldcpp team please tell me what is going on immediately

CUDA error: out of memory
current device: 1, in function ggml_cuda_set_device at D:\a\koboldcpp\koboldcpp\ggml-cuda.cu:115
cudaSetDevice(device)
GGML_ASSERT: D:\a\koboldcpp\koboldcpp\ggml-cuda.cu:102: !"CUDA error"

Anonymous
06/29/24(Sat)17:44:20 No.101207279

Anonymous 06/29/24(Sat)17:44:20 No.101207279

https://www.reddit.com/r/LocalLLaMA/comments/1doxvdi/selfplay_models_finally_got_released_sppo/

Anonymous
06/29/24(Sat)17:46:14 No.101207300

Anonymous 06/29/24(Sat)17:46:14 No.101207300

>>101207279
This is insane btw. This 8B legit performs at the level of all the big models I ever tried. If they apply these methods to larger models we are legit gonna have claude opus at home.

Anonymous
06/29/24(Sat)17:46:21 No.101207301

Anonymous 06/29/24(Sat)17:46:21 No.101207301

>>101207038
Well that's kind of fucky. In an ideal world, people would be testing newer and different prompts to test models on lmsys, but they probably don't, since they're retards, so you end up with a lot of the same or similar prompts, and that means that even if they're technically not cheating because they're training only on the prompts and their own original answers, in practice it is cheating.

Anonymous
06/29/24(Sat)17:46:45 No.101207307

Anonymous 06/29/24(Sat)17:46:45 No.101207307

>>101206209
I had this too, it replying with earlier replies, using SillyTavern. I had the api added as "OpenAI compatible" setting prompt post processing to "Claude" somehow fixed it. No idea why.

Anonymous
06/29/24(Sat)17:48:09 No.101207326

Anonymous 06/29/24(Sat)17:48:09 No.101207326

>>101207300
What did you test on? I'd like to see some logs. So far the thread has determined that it's good on some things and about the same as regular Instruct on others.

Anonymous
06/29/24(Sat)17:50:19 No.101207354

Anonymous 06/29/24(Sat)17:50:19 No.101207354

>>101207300
>This 8B legit performs at the level of all the big models I ever tried.
There's no way, its MMLU is on the 65+, it's even worse than L3-8b instruct

Anonymous
06/29/24(Sat)17:51:21 No.101207366

Anonymous 06/29/24(Sat)17:51:21 No.101207366

>>101207354
Just try it, its a 8b. Night and day.

Anonymous
06/29/24(Sat)17:51:21 No.101207367

Anonymous 06/29/24(Sat)17:51:21 No.101207367

>>101207300
>If they apply these methods to larger models we are legit gonna have claude opus at home.
and if ClaudeAI uses this technique on claude opus, they'll get god kek

Anonymous
06/29/24(Sat)17:52:21 No.101207380

Anonymous 06/29/24(Sat)17:52:21 No.101207380

>>101207366
when you say "big models", which ones are you refering to?

Anonymous
06/29/24(Sat)17:53:05 No.101207387

Anonymous 06/29/24(Sat)17:53:05 No.101207387

>>101207366
Why are you like this? Are you trying to LARP as a redditor to increase hate for redditors?

Anonymous
06/29/24(Sat)17:54:04 No.101207399

Anonymous 06/29/24(Sat)17:54:04 No.101207399

>>101207307
fuck me i don't want to go read the sillytavern code to understand what this means T_T
>>101207279
SPPO is really great in my tests, if there was a 32k context version it would be god tier

Anonymous
06/29/24(Sat)17:54:26 No.101207400

Anonymous 06/29/24(Sat)17:54:26 No.101207400

>>101207279
>3 days ago
anon, if this model was as good as you claim, people would've talked about it already, do you think we purposely want to avoid good models or something? kek

Anonymous
06/29/24(Sat)17:59:16 No.101207459

Anonymous 06/29/24(Sat)17:59:16 No.101207459

>>101207400
Do you think 4chan is news central or something? People are usually slow on the uptake here.

Anonymous
06/29/24(Sat)18:01:39 No.101207482

Anonymous 06/29/24(Sat)18:01:39 No.101207482

>>101207459
of course it is, that's why I lurk there, when something interesting happen, it gets talked quite rapidly

Anonymous
06/29/24(Sat)18:06:07 No.101207535

Anonymous 06/29/24(Sat)18:06:07 No.101207535

File: 468519173.png (713 KB, 1024x1024)

713 KB PNG

>>101207459
>Do you think 4chan is news central or something?
Yes

Anonymous
06/29/24(Sat)18:10:51 No.101207577

Anonymous 06/29/24(Sat)18:10:51 No.101207577

File: 039_01705_.png (1.34 MB, 896x1216)

1.34 MB PNG

>>101203831
Hey anon, if you haven't already make sure you're on the staging branch of ST otherwise some of the templates won't work as intended. Also have not tested using the templates with DRY sampling if that's also in play.

Anonymous
06/29/24(Sat)18:12:09 No.101207588

Anonymous 06/29/24(Sat)18:12:09 No.101207588

>>101205994
>28 cents per million tokens
wait, what? That's basically almost free. For some of my roleplays I have 12k context and it rarely gets filled, but if I'm being generous, regenerating a lot, let's say I'll consume 30k tokens. That's just 3% of 28 cents, not even a full cent. How can it be this cheap compared to local, am I missing something?

Anonymous
06/29/24(Sat)18:13:01 No.101207594

Anonymous 06/29/24(Sat)18:13:01 No.101207594

>>101207279
>>101207366
show us some logs anon, you gotta sell the product with some examples

Anonymous
06/29/24(Sat)18:13:05 No.101207597

Anonymous 06/29/24(Sat)18:13:05 No.101207597

I know this thread is for LLMs but I suppose you anons would know this, what's the best local voice cloning tool currently out there? I have a pretty beefy GPU so that's not an issue

Anonymous
06/29/24(Sat)18:17:03 No.101207639

Anonymous 06/29/24(Sat)18:17:03 No.101207639

File: 1705528053083453.png (7 KB, 578x113)

7 KB PNG

>>101206367
I feel bad for them

Anonymous
06/29/24(Sat)18:19:30 No.101207662

Anonymous 06/29/24(Sat)18:19:30 No.101207662

>>101207588
no
serving models has become ridiculously cheap
most places offer llama 70B for less than $1 per M tokens, while giving you a lot more control over the output than OAI and the cost is only going to continue dropping

Anonymous
06/29/24(Sat)18:19:49 No.101207666

Anonymous 06/29/24(Sat)18:19:49 No.101207666

>>101205045
Every time I see posts like this I imagine a toddler giggling because he said a bad word to get reaction from his parents. Same level maturity I guess

Anonymous
06/29/24(Sat)18:21:01 No.101207681

Anonymous 06/29/24(Sat)18:21:01 No.101207681

>>101206367
all major companies quantize their models based on traffic
as they grow in size, the product naturally gets worse

Anonymous
06/29/24(Sat)18:21:56 No.101207690

Anonymous 06/29/24(Sat)18:21:56 No.101207690

>>101207663
the fuck

Anonymous
06/29/24(Sat)18:22:36 No.101207701

Anonymous 06/29/24(Sat)18:22:36 No.101207701

>>101207663
>real cuda dev trip
what did he mean by this

Anonymous
06/29/24(Sat)18:23:37 No.101207710

Anonymous 06/29/24(Sat)18:23:37 No.101207710

>>101207663
not like this

Anonymous
06/29/24(Sat)18:29:42 No.101207775

Anonymous 06/29/24(Sat)18:29:42 No.101207775

>>101207663
I guess that's what he gets for using a non-secure tripcode

Anonymous
06/29/24(Sat)18:31:06 No.101207791

Anonymous 06/29/24(Sat)18:31:06 No.101207791

>>101207663
CRACKED AND BLACKED

Anonymous
06/29/24(Sat)18:33:07 No.101207821

Anonymous 06/29/24(Sat)18:33:07 No.101207821

>>101207279
im trying it, and its actually really good?

Anonymous
06/29/24(Sat)18:36:31 No.101207858

Anonymous 06/29/24(Sat)18:36:31 No.101207858

>>101207775
That's curious. How was it non-secure?

llama.cpp CUDA dev !YOmst7Ghe6
06/29/24(Sat)18:37:18 No.101207871

llama.cpp CUDA dev !YOmst7Ghe6 06/29/24(Sat)18:37:18 No.101207871

>>101207089
Disabling mmap fixes it I think.

>>101207701
Nothing, I was busy getting rekt in XCOM.

>>101207775
I may be misremembering, but didn't secure tripcodes rely on cookies?

Anonymous
06/29/24(Sat)18:39:56 No.101207896

Anonymous 06/29/24(Sat)18:39:56 No.101207896

>>101207663
>>101207871
>cudadev is a blackedfag
unsurprising

Anonymous
06/29/24(Sat)18:41:04 No.101207913

Anonymous 06/29/24(Sat)18:41:04 No.101207913

So after all this, are we now accepting that Google can call themselves a big shot in AI now despite their embarrassing missteps like glue on pizza and eating rocks and the embarrassment that was LaMDA/PaLM/Gemma 1? Sundar is still a dumbass for letting the company flop this long on AI but at least he has the right people at the helm now with DeepMind in charge and Demis Hassabis instead of Jeff Dean.
Gemma 2 is proof of the progress. Yes, they may have gamed LMSYS training on the prompts but the 9B model outdoing L3 8B for people in these threads is proof that they are at least in the playing field competing against the best in the industry. I am looking forward to more models from them, which hopefully Meta forces the issue soon with Llama 4.

Anonymous
06/29/24(Sat)18:41:46 No.101207918

Anonymous 06/29/24(Sat)18:41:46 No.101207918

>>101207871
That is with MMAP disabled. I only have 32 gb of system ram so I have to disable mmap

Anonymous
06/29/24(Sat)18:43:59 No.101207943

Anonymous 06/29/24(Sat)18:43:59 No.101207943

>>101207913
>So after all this, are we now accepting that Google can call themselves a big shot in AI
as long as they can't compete with the bests (gpt4 and claude3/3.5), the answer is no

llama.cpp CUDA dev !!4cBIA4L3I+/
06/29/24(Sat)18:44:17 No.101207952

llama.cpp CUDA dev !!4cBIA4L3I+/ 06/29/24(Sat)18:44:17 No.101207952

>>101207871
The FAQ just says
>Secure tripcodes use a secret key file on the server to help obscure their password.
and doesn't say anything about cookies so I guess I'll just try adding the extra # and see what happens.

>>101207918
Then I unfortunately don't know what the problem is.
I think a low amount of pinned memory was a Windows only issue though.

Anonymous
06/29/24(Sat)18:44:21 No.101207953

Anonymous 06/29/24(Sat)18:44:21 No.101207953

File: 1700261128186346.png (28 KB, 727x112)

28 KB PNG

PFFFFT

Anonymous
06/29/24(Sat)18:44:41 No.101207958

Anonymous 06/29/24(Sat)18:44:41 No.101207958

>>101207913
it is sad it took them this long to release a larger model that is only playing equal footing with Meta's model
reminder that zucc isn't even taking this shit seriously, he's still got metaverse on the back of his mind, he's just waiting for video gen to get really good
the fact that gemma is just another standard transformer with very little innovation shows they are still in catch-up mode
if openai drops a paper tomorrow, guarantee that everyone will read it to glean as much secret sauce as possible, not so much for google
and until they can touch Claude 3.5 or GPT4o, they will be left behind

Anonymous
06/29/24(Sat)18:45:31 No.101207965

Anonymous 06/29/24(Sat)18:45:31 No.101207965

>>101207952
If ditching koboldcpp could fix the problem what other back end would you recommend

Anonymous
06/29/24(Sat)18:46:13 No.101207976

Anonymous 06/29/24(Sat)18:46:13 No.101207976

>>101205537
>>101206442
Only contenders i've found are:

Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss-3.7bpw-h6-exl2-rpcal @ 4096 context
c4ai-command-r-v01_exl2_3.5bpw-rpcal @ 2048 context
Nous-Hermes-2-Mixtruct-v0.1-8x7B-DPO-DARE_TIES-3.7bpw-h6-exl2-rpcal @ 4096 context

maybe BagelMysteryTour
cOOmandR sucks and doesn't fit in 24gb
llama3 8b sucks

prove me wrong or tell me something better, i can't find it.

llama.cpp CUDA dev !!4cBIA4L3I+/
06/29/24(Sat)18:47:44 No.101207997

llama.cpp CUDA dev !!4cBIA4L3I+/ 06/29/24(Sat)18:47:44 No.101207997

>>101207965
If you don't have enough VRAM then something based on llama.cpp (which koboldcpp is) is basically your only option.
You can always try running llama.cpp directly but unless the koboldcpp devs changed the model loading code I think you'll run into the same issue.
You could try running it through WSL or on Linux.

Anonymous
06/29/24(Sat)18:48:49 No.101208010

Anonymous 06/29/24(Sat)18:48:49 No.101208010

>>101207976
BMT not for sure, the moment you touch your character's boobs it sends you to a journey with bonds and testaments of your mixed feelings.

Anonymous
06/29/24(Sat)18:52:11 No.101208040

Anonymous 06/29/24(Sat)18:52:11 No.101208040

>>101208010
yeah that's why i looked for something else, was decent for a few weeks tho

Anonymous
06/29/24(Sat)18:54:05 No.101208052

Anonymous 06/29/24(Sat)18:54:05 No.101208052

>>101207663
>I wouldn't recommend koboldcpp.
Me neither
>garbage UI
>garbage chat API
>1200 files in a zip folder that takes 3 seconds to decompress every time, "here is your single file executable bro"
>maintained by cancerous discord fags
>the discord fags obsess over these threads and post anonymously, astroturfing

Anonymous
06/29/24(Sat)18:58:06 No.101208091

Anonymous 06/29/24(Sat)18:58:06 No.101208091

File: file.png (214 KB, 1676x800)

214 KB PNG

>>101207943
I wasn't calling them the best, just now that their claims of actually being able to match with the top players on LMSYS are now actually in a verifiable way more true. They aren't even that many Elo points behind GPT-4o or 3.5 Sonnet.
>>101207958
Google is still holding back their research in this area for a year in AI after OpenAI essentially firewalled off their research and profited off Google's open publication of theirs.
https://www.businessinsider.com/google-publishing-less-confidential-ai-research-to-compete-with-openai-2023-4
Because of OpenAI, no company in the West that is at the top with AI is publishing papers immediately on findings other than the Chinese if only because they need the citations and credibility with the world that Western companies don't and I expect the CCP to crack down on it at some point when it's clear they are ahead.

llama.cpp CUDA dev !!OM2Fp6Fn93S
06/29/24(Sat)18:58:09 No.101208093

llama.cpp CUDA dev !!OM2Fp6Fn93S 06/29/24(Sat)18:58:09 No.101208093

>>101207952
>>101207997
Actually, now that I think about it, if the blacked spammer cracked the non-secure tripcode I should not just use the same string for the secure one.
So I guess I'll use this one from now on.

Anonymous
06/29/24(Sat)18:58:31 No.101208095

Anonymous 06/29/24(Sat)18:58:31 No.101208095

>>101205835
I tested 27B in 8-bit quant and it seems pretty bad. Maybe it's a settings issue, but it hallucinates a lot and struggles with coherency.

Anonymous
06/29/24(Sat)18:58:41 No.101208098

Anonymous 06/29/24(Sat)18:58:41 No.101208098

>>101207597
afaik there isn't a good tool for this, everything is convoluted and stupid, last i fucked with it this was the sota

https://git.ecker.tech/mrq/ai-voice-cloning/wiki/Installation

and it worked pretty okay but nothing like the demos you hear, would love it if someone could point me to a working local voice cloning tool as well

Anonymous
06/29/24(Sat)19:01:07 No.101208114

Anonymous 06/29/24(Sat)19:01:07 No.101208114

>>101207952
>>101208093
confirm new trip using github somehow

Anonymous
06/29/24(Sat)19:02:38 No.101208129

Anonymous 06/29/24(Sat)19:02:38 No.101208129

File: 8a93c9ec179d39c1eaa87c63c(...).jpg (18 KB, 300x279)

18 KB JPG

>I'm all ears... or rather, all text!

Anonymous
06/29/24(Sat)19:02:59 No.101208133

Anonymous 06/29/24(Sat)19:02:59 No.101208133

>>101208095
>>101205835
also running q8, similar findings though both it and 9b are amazing at oneshot small context tasks, though the generations seem much more deterministic than other models for things where there's a "right answer" it gets it very often

it's definitely a good replacement for stackoverflow/googling most stuff

Anonymous
06/29/24(Sat)19:03:11 No.101208137

Anonymous 06/29/24(Sat)19:03:11 No.101208137

>>101208091
that's all well and good but Gemma and Gemini 1.5 are what they've been cooking up behind closed doors this entire time, all while OAI has been working on Q* and GPT-Next and Anthropic is undoubtedly taking their sparse autoencoder tech to crazy applications
it feels like they were competing with Llama, not GPT or Claude, especially when you consider even based on your image that their best model is still worse than 3.5 Sonnet, which is the bottom end of Anthropic's latest batch of models

Anonymous
06/29/24(Sat)19:03:29 No.101208140

Anonymous 06/29/24(Sat)19:03:29 No.101208140

>>101207858
Tripcodes with a single ! are insecure because your password is only hashed a single time. It's easy to bruteforce and tools for that have been around since basically the start.
>>101208093
Maybe post a new photo of your 4090 mining rig to prove that it's really you and not a blacked spammer attempt at hijacking your identity even further.

Anonymous
06/29/24(Sat)19:04:07 No.101208147

Anonymous 06/29/24(Sat)19:04:07 No.101208147

File: 1698983091577192.jpg (155 KB, 843x607)

155 KB JPG

>>101208093
>I should not just use the same string for the secure one.
There is no way you can be this dumb while simultaneously working on CUDA mat code.

Anonymous
06/29/24(Sat)19:04:52 No.101208154

Anonymous 06/29/24(Sat)19:04:52 No.101208154

>>101207953
>blackedfag and kobold shizo hater are the same people
not surprised to be honest

Anonymous
06/29/24(Sat)19:06:24 No.101208169

Anonymous 06/29/24(Sat)19:06:24 No.101208169

>>101208095
The tokenizer issue got fixed yesterday, but tere are still issues, you need the https://github.com/ggerganov/llama.cpp/pull/8197 with the logit soft capping fix mentioned in prior threads, it seems like 27B is a lot more sensitive to this than 9B with quality. The 4k context without SWA is also an issue, but until llama.cpp accepts that it needs to implement it which they declined to do for Mistral, Gemma 2 will be half baked. Mistral.rs which almost no one uses that claims full support with those issues resolved but I ain't testing it.

Anonymous
06/29/24(Sat)19:09:57 No.101208209

Anonymous 06/29/24(Sat)19:09:57 No.101208209

>>101207952
>>101208114

yeah just put the code on your github profile and say it's your official handle or something. literally everyone knows that you're the cuda dev and gg and friends don't care.

>> 101208147

i know people who've written well cited computer science papers yet can't use a computer properly. figure.

Anonymous
06/29/24(Sat)19:11:32 No.101208230

Anonymous 06/29/24(Sat)19:11:32 No.101208230

File: Screenshot 2024-06-29 at (...).png (65 KB, 910x696)

65 KB PNG

What a hero.

Anonymous
06/29/24(Sat)19:14:34 No.101208263

Anonymous 06/29/24(Sat)19:14:34 No.101208263

>>101208230
>different methods of attention between layers
every day we stray further away from god's light

Anonymous
06/29/24(Sat)19:14:40 No.101208265

Anonymous 06/29/24(Sat)19:14:40 No.101208265

>>101208137
They have other models cooking in the background too but we won't know about them in any technical detail, how long do you think Gemini 1.5 has been cooking? They have the model behind Project Astra too which was only shown with no technical details given which they will detail later. I don't see why people are down on their research division when they haven't scaled the more interesting stuff they showed like Griffin with RecurrentGemma at scale which does replace Transformers. You would be a fool IMO to discount Google's research division, it is one of the best in the field.

llama.cpp CUDA dev !!OM2Fp6Fn93S
06/29/24(Sat)19:16:28 No.101208289

llama.cpp CUDA dev !!OM2Fp6Fn93S 06/29/24(Sat)19:16:28 No.101208289

File: IMG_20240630_010541.jpg (2.29 MB, 4000x3000)

2.29 MB JPG

>>101208114
>>101208209
https://raw.githubusercontent.com/JohannesGaessler/JohannesGaessler/master/README.md

>>101208140
Here you go, next to the P40 machine (which looks like it needs its dust filters cleaned).

Anonymous
06/29/24(Sat)19:20:51 No.101208358

Anonymous 06/29/24(Sat)19:20:51 No.101208358

File: 1705017153425593.png (55 KB, 263x217)

55 KB PNG

>>101208289
Identity confirmed.

Anonymous
06/29/24(Sat)19:21:01 No.101208360

Anonymous 06/29/24(Sat)19:21:01 No.101208360

>>101207952
I know for vichan the secure trips use a salt specified in the configuration files so I imagine it's similar for 4chan. I imagine someone could probably eventually crack the salt in order to start cracking secure trips as well. If someone rented an 8xH100 cluster I imagine it probably wouldn't take long to do if they were that obsessed.

Anonymous
06/29/24(Sat)19:21:42 No.101208371

Anonymous 06/29/24(Sat)19:21:42 No.101208371

>>101208289
The new code checks out, commander.

Anonymous
06/29/24(Sat)19:25:03 No.101208421

Anonymous 06/29/24(Sat)19:25:03 No.101208421

you can't convince me the original post wasn't made by the real cuda dev

Anonymous
06/29/24(Sat)19:25:40 No.101208428

Anonymous 06/29/24(Sat)19:25:40 No.101208428

>>101208360
No one has in the 10+ years since it was introduced, why would it work now. And if you had that kind of power, why would you use it on a trip and not wallet keys for crypto?

Anonymous
06/29/24(Sat)19:28:26 No.101208464

Anonymous 06/29/24(Sat)19:28:26 No.101208464

>>101208360
never 4get tripcuda

Anonymous
06/29/24(Sat)19:30:35 No.101208487

Anonymous 06/29/24(Sat)19:30:35 No.101208487

>>101208421
Even if it were, why do you care about what someone goons to?

Anonymous
06/29/24(Sat)19:33:19 No.101208516

Anonymous 06/29/24(Sat)19:33:19 No.101208516

>>101208487
i don't care what he goons to as long as he does it behind closed doors

Anonymous
06/29/24(Sat)19:33:48 No.101208523

Anonymous 06/29/24(Sat)19:33:48 No.101208523

>>101207279
>I cannot create explicit content, but I'd be happy to help with other story ideas.

Anonymous
06/29/24(Sat)19:33:58 No.101208527

Anonymous 06/29/24(Sat)19:33:58 No.101208527

>>101208516
Pipe down then lol

Anonymous
06/29/24(Sat)19:35:25 No.101208546

Anonymous 06/29/24(Sat)19:35:25 No.101208546

>>101208527
wow someone's awfully defensive all of a sudden
wonder why

Anonymous
06/29/24(Sat)19:42:27 No.101208633

Anonymous 06/29/24(Sat)19:42:27 No.101208633

>>101207279
Trying it on FP16, doesn't seem significantly smarter than any other 8B. I don't know why I keep falling for these psyops.

Anonymous
06/29/24(Sat)19:44:13 No.101208649

Anonymous 06/29/24(Sat)19:44:13 No.101208649

>>101208289
dual motherboard set up?

llama.cpp CUDA dev !!OM2Fp6Fn93S
06/29/24(Sat)19:48:22 No.101208691

llama.cpp CUDA dev !!OM2Fp6Fn93S 06/29/24(Sat)19:48:22 No.101208691

>>101208649
It would in principle be possible to add a second motherboard but so far there is only one.

Anonymous
06/29/24(Sat)19:53:32 No.101208740

Anonymous 06/29/24(Sat)19:53:32 No.101208740

>>101208691
it looks like there are 6 gpus are all those on a single motherboard? is it all on one power supply?

Anonymous
06/29/24(Sat)19:56:25 No.101208777

Anonymous 06/29/24(Sat)19:56:25 No.101208777

>>101208421
I thought about that for a second but cuda dev has been mildly positive towards kcpp in the past, it just doesn't really work as a genuine cuda dev post. him posting that AND attaching that image for some reason AND forgetting to un-trip doesn't make a lot of sense
pretty big waste of a tripcode crack if you ask me, very uninspired work

Anonymous
06/29/24(Sat)19:58:13 No.101208799

Anonymous 06/29/24(Sat)19:58:13 No.101208799

I'm trying out mistral.rs to run Gemma right now and holy shit the documentation is not very good. I didn't know how well we had it with Llama.cpp.

Anonymous
06/29/24(Sat)20:01:12 No.101208835

Anonymous 06/29/24(Sat)20:01:12 No.101208835

with an A750 (8GB vram) and a ryzen 5900x with 64GB ddr4 ram, does it make any sense to try and use the GPU for anything LLM, or am I better off just sticking to CPU inference?

Anonymous
06/29/24(Sat)20:04:22 No.101208885

Anonymous 06/29/24(Sat)20:04:22 No.101208885

>>101208835
You can probably get okay speeds with either Vulkan or SYCL backends on llama.cpp.

Anonymous
06/29/24(Sat)20:10:58 No.101208961

Anonymous 06/29/24(Sat)20:10:58 No.101208961

>>101208885
yeah, but for anything other than tiny models, I'd need to offload to ram anyway. I wonder if the speedup achieved by using the GPU will even be meaningful in this case.
I guess I'll have to try it to find out for certain.

Anonymous
06/29/24(Sat)20:17:32 No.101209020

Anonymous 06/29/24(Sat)20:17:32 No.101209020

>>101208487
I like to know
I wanna see

Anonymous
06/29/24(Sat)20:19:31 No.101209045

Anonymous 06/29/24(Sat)20:19:31 No.101209045

ban cuda dev

Anonymous
06/29/24(Sat)20:22:16 No.101209072

Anonymous 06/29/24(Sat)20:22:16 No.101209072

i have a 4090 suck my dick

Anonymous
06/29/24(Sat)20:22:19 No.101209073

Anonymous 06/29/24(Sat)20:22:19 No.101209073

back to discord you shit for brains

Anonymous
06/29/24(Sat)20:25:31 No.101209097

Anonymous 06/29/24(Sat)20:25:31 No.101209097

It's about time zucc gives us an update on llama so that all the others have to shit out their big models as well

Anonymous
06/29/24(Sat)20:27:26 No.101209110

Anonymous 06/29/24(Sat)20:27:26 No.101209110

I finally tuned 22x8 mixtral on limarp
Let's see how hard I fucked up

Anonymous
06/29/24(Sat)20:31:06 No.101209144

Anonymous 06/29/24(Sat)20:31:06 No.101209144

>>101209110
qlora?

Anonymous
06/29/24(Sat)20:33:16 No.101209163

Anonymous 06/29/24(Sat)20:33:16 No.101209163

hi im new to this stuff, can someone point me in the right direction for what i want? ty in advance.

anyway im looking for a general AI model something like what grok 2 is going to be or is there something better out there?

Anonymous
06/29/24(Sat)20:33:51 No.101209169

Anonymous 06/29/24(Sat)20:33:51 No.101209169

>>101209097
zucc doesn't give a shit any more
government is so far up his ass on safety every model fucking sucks now

Anonymous
06/29/24(Sat)20:35:44 No.101209186

Anonymous 06/29/24(Sat)20:35:44 No.101209186

>>101209163
You should always provide your specs when asking for this kind of thing.
That said, start with koboldcpp and llama3 8b instruct gguf.

Anonymous
06/29/24(Sat)20:35:46 No.101209187

Anonymous 06/29/24(Sat)20:35:46 No.101209187

>>101209163
depends on if you are running a potato or not.
what graphics card do you have and how much VRAM?

Anonymous
06/29/24(Sat)20:37:05 No.101209195

Anonymous 06/29/24(Sat)20:37:05 No.101209195

>>101209187
>>101209186
4070ti 13700k 32gb ram @6000mhz

Anonymous
06/29/24(Sat)20:37:16 No.101209196

Anonymous 06/29/24(Sat)20:37:16 No.101209196

>Cuda dev was the blacked anon all along and posted by mistake using his trip, then just pretended it wasn't him by changing the trip code
BASED

Anonymous
06/29/24(Sat)20:38:33 No.101209208

Anonymous 06/29/24(Sat)20:38:33 No.101209208

>>101209196
He was always upset that Miku was more popular than Teto.

Anonymous
06/29/24(Sat)20:38:59 No.101209213

Anonymous 06/29/24(Sat)20:38:59 No.101209213

>>101209144
yea

Anonymous
06/29/24(Sat)20:40:49 No.101209229

Anonymous 06/29/24(Sat)20:40:49 No.101209229

>>101209195
>4070ti
so probably 16GB VRAM (ram on the gfx card).
should be able to fit in llama3 instruct using exl2 which is generally faster than gguf.
i cba to spoonfeed but read the OP post and work out how to run llama3 8b exl2

Anonymous
06/29/24(Sat)20:42:28 No.101209252

Anonymous 06/29/24(Sat)20:42:28 No.101209252

>>101209229
>i cba to spoonfeed but read the OP post and work out how to run llama3 8b exl2
im good on the spoon feed that last part was all i needed thanks. i got it from there

Anonymous
06/29/24(Sat)20:51:04 No.101209330

Anonymous 06/29/24(Sat)20:51:04 No.101209330

>>101207662
Do any places offer running open-source erp models of your choice or otherwise good erp-able models? And how quickly does your account get shut down after cunny sex? I'd like not to have to use jailbreak prompts uncuck a cucked model.

Anonymous
06/29/24(Sat)20:52:14 No.101209341

Anonymous 06/29/24(Sat)20:52:14 No.101209341

>finally get mistral.rs set up, interactive mode works
>try server
>it connects fine to ST
>send a completion request
>error
It's all so tiresome.

Anonymous
06/29/24(Sat)20:53:10 No.101209353

Anonymous 06/29/24(Sat)20:53:10 No.101209353

>>101209341
You know what, fuck it, I'm trying it too.

Anonymous
06/29/24(Sat)20:53:51 No.101209361

Anonymous 06/29/24(Sat)20:53:51 No.101209361

>>101208428
10+ years ago you couldn't rent petaflops of gpu compute by the hour

Anonymous
06/29/24(Sat)21:02:34 No.101209429

Anonymous 06/29/24(Sat)21:02:34 No.101209429

>>101209353
Hope it works. Maybe I'm just having a skill issue right now.

Anonymous
06/29/24(Sat)21:32:56 No.101209725

Anonymous 06/29/24(Sat)21:32:56 No.101209725

>>101205004
Unprotected sex with rin

Anonymous
06/29/24(Sat)21:50:35 No.101209919

Anonymous 06/29/24(Sat)21:50:35 No.101209919

llama-3-400b going to be cancelled because me and the individuals who did the code RLHF farmed it by submitting dogshit data over and over, sorry everyone. The """untuned""" model (still had instruct data) is roughly gpt-4 tier intelligence maybe better but heavily overfit. Doubt they'll release the model without the RLHF.

Anonymous
06/29/24(Sat)22:15:34 No.101210140

Anonymous 06/29/24(Sat)22:15:34 No.101210140

Midnight Miqu 70B
>tsundere stays in character and keeps resisting
switch to Mythomax 13B on the fly
>character turns into a slut and climaxes immediately
Are there more balanced/nuanced small models? I would use Miqu but as a 32gb ram 8gb vramlet it's painfully slow and Mixtral gets stuck in loops making it unreliable.

Anonymous
06/29/24(Sat)22:17:11 No.101210151

Anonymous 06/29/24(Sat)22:17:11 No.101210151

>>101210140
DeepSeek 236B is phenomenal with hard-to-get characters.

Anonymous
06/29/24(Sat)22:20:25 No.101210176

Anonymous 06/29/24(Sat)22:20:25 No.101210176

>>101205835
I think it beats Llama 3 in almost every aspect. The censorship is extremely weak for what I tested it with, and the model will go along with whatever you throw at it, even when its "inner voice" disagrees with the contents. The only issue is that you should regard it as a 4k-context model for now, and even once it will be fully functional, it will still be an 8k model.

It seems to prefer novel-style prose rather than Markdown-style roleplay. Markdown roleplay has a very "sloppy" feel, but its novel-style prose feels fresher compared to other recent models.

Anonymous
06/29/24(Sat)22:21:20 No.101210183

Anonymous 06/29/24(Sat)22:21:20 No.101210183

>>101210151
Is it actually good or just another supermeme?

Anonymous
06/29/24(Sat)22:23:21 No.101210197

Anonymous 06/29/24(Sat)22:23:21 No.101210197

>>101210140
>8gb vram
I was going to say Stheno v3.2, but I can't in good faith say that it's balanced or nuanced.

Anonymous
06/29/24(Sat)22:24:43 No.101210205

Anonymous 06/29/24(Sat)22:24:43 No.101210205

>>101207300
>This 8B legit performs at the level of all the big models I ever tried
Where are the SPPO of models not so small that they don't have the knowledge for the enhancement to leverage?

>>101210183
Seems actual good, but 236B so unless you just bought in it's a little too fat to fit consumer hardware.

Anonymous
06/29/24(Sat)22:29:50 No.101210238

Anonymous 06/29/24(Sat)22:29:50 No.101210238

>>101210205
>it's a little too fat to fit consumer hardware.
I have a pretty serious rig and it's too fat for me even. Basically have to run Q4 with no offload. Having 4 GPUs for batch processing at least makes up for the slow generation though.

Anonymous
06/29/24(Sat)22:30:40 No.101210245

Anonymous 06/29/24(Sat)22:30:40 No.101210245

>>101205004
update on the creepy doll:
he decided to use phi-3 mini for some reason. i dont think he knows quants exists. anyways, how long until i can fuck one of these?

https://youtu.be/QEwXRuuku1o?si=cE_4iYZyb5Nmiol4

Anonymous
06/29/24(Sat)22:31:45 No.101210250

Anonymous 06/29/24(Sat)22:31:45 No.101210250

Ok so I think I identified all the issues for mistral.rs at least on my machine with Gemma 27B.

First is that the server for some reason doesn't expect a string for the "grammar" field of the API request, or maybe it doesn't expect a "grammar" at all, so ST doesn't work and I haven't found a way to fix that.

Second, splitting the model across GPUs and trying to run inference results in a CUDA error. Nice.

Third, it seems to not be able to handle layers in RAM properly if you set GPU layers to anything but the max amount of layers the model has. If you go above, you get an error. If you go below, it (seems to) try loading the entire model at full precision into RAM first, and if it can't, well the thing just crashes, and I don't have the RAM for full precision.

What a mess. At least Llama.cpp works, even if it's just 4k.

Anonymous
06/29/24(Sat)22:32:27 No.101210257

Anonymous 06/29/24(Sat)22:32:27 No.101210257

>>101210205
Is DeepSeek API they advertise censored or filtered somehow? I would want to try it, but I never used any models that needed a jailbreak.

Anonymous
06/29/24(Sat)22:40:07 No.101210321

Anonymous 06/29/24(Sat)22:40:07 No.101210321

>>101210238
I'm a lone 4070. IQ3_XXS did go but about 0.25 t/s because even crushed that low I'm out of VRAM and Sys RAM to cache the file.

And the IQ2's are just a bit too big, too. So I get IQ1_S or IQ1_M to have any hope at a response in under 45 minutes.

>Strawberry testing
I've found one that gets it right if asked to spell and escape the tokenizer problem we've discussed.
Unfortunately, Orca fails my music theory and pop culture tests.
>orca-2-13b.Q6_K

>How many r's are in "strawberry"?
>There are 2 r's in the word "strawberry".

>Spell the word "strawberry" and tell me how many r's are in the word that you spelled out.
>The word "strawberry" has three r's.

Anonymous
06/29/24(Sat)22:42:27 No.101210339

Anonymous 06/29/24(Sat)22:42:27 No.101210339

Hey Anons, what would be a good videocard for LLMs and gaming under 200 dollars?
I'm a poorfag who can't afford a 3090 so I have my sights set on a 6600 XT. Any other stuff?

Anonymous
06/29/24(Sat)22:50:10 No.101210408

Anonymous 06/29/24(Sat)22:50:10 No.101210408

>>101205144
The banning has toned way down. I'm not sure why.

Anonymous
06/29/24(Sat)22:50:15 No.101210409

Anonymous 06/29/24(Sat)22:50:15 No.101210409

>>101210339
If you go ayymd you're gonna have a terrible time in terms of support.
You can usually find a used 3060 under $300. Lower if you get lucky with a bid. You should go for at least 12gb of VRAM at that point to be able to run some of the smaller models.

Anonymous
06/29/24(Sat)22:53:30 No.101210431

Anonymous 06/29/24(Sat)22:53:30 No.101210431

>>101207666
>noooo you can't say it!
>if you say it you are le heckin toddler!!!
Just like local LLMs, you are trying to lecture everyone around you.

Anonymous
06/29/24(Sat)22:55:58 No.101210455

Anonymous 06/29/24(Sat)22:55:58 No.101210455

>>101210409
Yeah I'm sticking to Runpod I guess. Rx 6600 it is

Anonymous
06/29/24(Sat)23:00:47 No.101210496

Anonymous 06/29/24(Sat)23:00:47 No.101210496

anyone tried New-Dawn-Llama-3-70B-32K? how does it compare to midnight miqu?

Anonymous
06/29/24(Sat)23:01:43 No.101210509

Anonymous 06/29/24(Sat)23:01:43 No.101210509

>>101207666
When in a place he does not rule, and faced with behaviour he yearns to violently suppress, a leftist feigns boredom in order to save face.

Anonymous
06/29/24(Sat)23:09:29 No.101210557

Anonymous 06/29/24(Sat)23:09:29 No.101210557

>>101210509
>faced with behaviour he yearns to violently suppress
Control, not suppress.
The progressive loves rappers saying it all of the time, because the progressive understands that the more they say it, the more they believe it about themselves and it has always been most effective for slave owners to use slaves to control saves. Worked on their plantations, worked in their labor camps. Today, it's the art of cancellation to get the slaves to attack and oppress each other in the name of the virtues that their masters have indoctrinated them with.

Anonymous
06/29/24(Sat)23:18:58 No.101210640

Anonymous 06/29/24(Sat)23:18:58 No.101210640

magnum says it uses chatml but outputs nonsense with and without instruct enabled, am i missing something?

Anonymous
06/29/24(Sat)23:21:34 No.101210663

Anonymous 06/29/24(Sat)23:21:34 No.101210663

>>101210640
Works on my machine
What quant?

Anonymous
06/29/24(Sat)23:24:55 No.101210689

Anonymous 06/29/24(Sat)23:24:55 No.101210689

Now we have llamacpp devs arguing about the removal of precompiled vulkan shaders!

https://github.com/ggerganov/llama.cpp/pull/8119

Anonymous
06/29/24(Sat)23:31:56 No.101210733

Anonymous 06/29/24(Sat)23:31:56 No.101210733

>>101210663
q5m. all you did was select chatml and enable instruct?
>multiline nửa olacağı不同意bil mbedtls Boз shut[PAD151653] wikipediaคาสิโนออนไลน์ vidé AndAlso入境jo.AdapterViewLLLL metros improvements UserService Summit Comoแปล

Anonymous
06/29/24(Sat)23:33:09 No.101210738

Anonymous 06/29/24(Sat)23:33:09 No.101210738

>>101209196
Doesn't match his personality at all. Considering cpumaxx is the first I've seen in a long time go out of his way to generate a trip, I'd be more willing to believe it was him.

Anonymous
06/29/24(Sat)23:38:06 No.101210769

Anonymous 06/29/24(Sat)23:38:06 No.101210769

>>101210640
On Kobold I used magnum-72b-v1-iMat-Q5_K_S in Instruct Mode/ChatML and it functioned as intended. I just ran through the other three and they seem fine, too.

Did you leave MMQ enabled? I've been turning that off for all models. Overkill, but there seem to be a few that go weird with it on. Qwen2 vanilla I've had to turn off CuBLAS to dodge the moon runes, though Qwen2/Magnum has been well behaved.

Anonymous
06/29/24(Sat)23:41:58 No.101210797

Anonymous 06/29/24(Sat)23:41:58 No.101210797

File: Nalatestnewmerge.png (112 KB, 925x372)

112 KB PNG

Alright so I've added a couple of steps to my latest 70B merge stack.

Anonymous
06/29/24(Sat)23:42:11 No.101210800

Anonymous 06/29/24(Sat)23:42:11 No.101210800

>>101210769
yeah mmq was on i'll try without it in a bit. the tess-2.5.2 tune didn't give me any issues

Anonymous
06/29/24(Sat)23:59:44 No.101210908

Anonymous 06/29/24(Sat)23:59:44 No.101210908

so what's the best model to coom to? i can run CR+ in gpu, just tell me what to download i'm horny uwu

Anonymous
06/30/24(Sun)00:01:03 No.101210918

Anonymous 06/30/24(Sun)00:01:03 No.101210918

>>101210908
>i can run CR+ in gpu
So use that. Smartest and least slopped model you can get.

Anonymous
06/30/24(Sun)00:03:51 No.101210944

Anonymous 06/30/24(Sun)00:03:51 No.101210944

>>101210908
Claude Opus

Anonymous
06/30/24(Sun)00:20:26 No.101211057

Anonymous 06/30/24(Sun)00:20:26 No.101211057

>>101210944
what do you think the l in lmg means

Anonymous
06/30/24(Sun)00:23:17 No.101211071

Anonymous 06/30/24(Sun)00:23:17 No.101211071

>>101211057
llama

Anonymous
06/30/24(Sun)00:23:32 No.101211073

Anonymous 06/30/24(Sun)00:23:32 No.101211073

>>101211057
legumes?

Anonymous
06/30/24(Sun)00:24:45 No.101211080

Anonymous 06/30/24(Sun)00:24:45 No.101211080

>>101211057
loser

Anonymous
06/30/24(Sun)00:26:48 No.101211094

Anonymous 06/30/24(Sun)00:26:48 No.101211094

>>101211057
ligma

Anonymous
06/30/24(Sun)00:29:03 No.101211110

Anonymous 06/30/24(Sun)00:29:03 No.101211110

>>101210908
Just don't ever let it say "eyes" or it'll say it every turn.

Anonymous
06/30/24(Sun)00:30:15 No.101211119

Anonymous 06/30/24(Sun)00:30:15 No.101211119

>>101211057
Light machine gun refers how to prematurely everyone here cums.

Anonymous
06/30/24(Sun)00:33:24 No.101211136

Anonymous 06/30/24(Sun)00:33:24 No.101211136

>>101211057
loli

Anonymous
06/30/24(Sun)00:35:43 No.101211148

Anonymous 06/30/24(Sun)00:35:43 No.101211148

i sincerely hate all of you.

Anonymous
06/30/24(Sun)00:37:29 No.101211162

Anonymous 06/30/24(Sun)00:37:29 No.101211162

I finally got mistral.rs working with SillyTavern in an incomplete way. I had to first make sure I quanted the 27B down to Q4K so it could fit in my 3090 and avoid splitting with another GPU since that causes a crash. Then I had to go into ST and switch to the chat completions API because idk, completions just doesn't respond. Then I had to erase the system prompt, because the server returns an error saying that the system role isn't supported. Lol ok. Then I had to make sure to delete the assistant's first message, because otherwise the server returns an error saying that it only supports the exact order of user, assistant, user, etc. And now it finally works, but I have no idea how to actually get a card to work with this retardedly rigid structure because I've never had to try it in ST before.

Sigh.

Anonymous
06/30/24(Sun)00:38:46 No.101211168

Anonymous 06/30/24(Sun)00:38:46 No.101211168

>>101210689
Go back to your discord and stop obsessing over these threads.

Anonymous
06/30/24(Sun)00:45:42 No.101211208

Anonymous 06/30/24(Sun)00:45:42 No.101211208

So I'm currently using stheno locally with koboldcpp. The models readme suggests using Q4-K-M-imat with 8GB of VRAM. I have 32 GB, is there a better versions of the model I could use? I would assume the 8GB 0-imat, but I'm not sure. If theres a better model for adventure with possibility of nsfw that would be nice too. Preferably gguf since it's koboldcpp.

Anonymous
06/30/24(Sun)00:48:27 No.101211225

Anonymous 06/30/24(Sun)00:48:27 No.101211225

>>101211208
Do you really have 32 GB of VRAM? Don't you mean, RAM?

Anonymous
06/30/24(Sun)00:51:17 No.101211243

Anonymous 06/30/24(Sun)00:51:17 No.101211243

>>101211148
i hate you too. now come here and give me a big kiss

Anonymous
06/30/24(Sun)00:54:27 No.101211260

Anonymous 06/30/24(Sun)00:54:27 No.101211260

>>101211208
If your model is a few GB under your VRAM, you get the fast responses.
If your model is say <90% of system RAM, Kobold can give you a few tokens per second. Not great but it's like real chat, you enter and wait for the response and it comes a few minutes later.
More than that, it's many seconds per token and you dial back.

32 GB system is a deadzone. None of the modern models target that kind of model. It's all 7B-13B class that are fast but stupid, or 70B class that really need 64 GB system to fit their 40-60 GB quants.

Anyway, if you have 12 GB VRAM or better, you can go up to the Q8_0 edition and host it all in VRAM.

Anonymous
06/30/24(Sun)01:01:16 No.101211310

Anonymous 06/30/24(Sun)01:01:16 No.101211310

>>101211243
lol faggot

Anonymous
06/30/24(Sun)01:02:18 No.101211314

Anonymous 06/30/24(Sun)01:02:18 No.101211314

>>101211057
No one cares about that, faggot.

Anonymous
06/30/24(Sun)01:03:34 No.101211329

Anonymous 06/30/24(Sun)01:03:34 No.101211329

>>101211314
I care about that, retard.

Anonymous
06/30/24(Sun)01:07:20 No.101211354

Anonymous 06/30/24(Sun)01:07:20 No.101211354

why do all local models suck dick at everything?

Anonymous
06/30/24(Sun)01:08:08 No.101211360

Anonymous 06/30/24(Sun)01:08:08 No.101211360

>>101211057
Liquorice all-sorts. As in all sorts of models biiiiitch

Anonymous
06/30/24(Sun)01:11:29 No.101211387

Anonymous 06/30/24(Sun)01:11:29 No.101211387

Euryale apparently uses the Yi tokenizer based on trial-and-error with banned tokens in SillyTavern.

Anonymous
06/30/24(Sun)01:13:16 No.101211399

Anonymous 06/30/24(Sun)01:13:16 No.101211399

Fuuuck, why does pytorch update so much? Do any of these updates ever even do anything?

Anonymous
06/30/24(Sun)01:13:47 No.101211407

Anonymous 06/30/24(Sun)01:13:47 No.101211407

Why does it burn when I pee?

Anonymous
06/30/24(Sun)01:22:21 No.101211468

Anonymous 06/30/24(Sun)01:22:21 No.101211468

>>101205072
SAAAARS HOW DOES HE DO IT SAAAAARS

Anonymous
06/30/24(Sun)01:22:42 No.101211472

Anonymous 06/30/24(Sun)01:22:42 No.101211472

File: MikuTachiNoHiruSagari.png (1.42 MB, 832x1216)

1.42 MB PNG

Good night lmg

Anonymous
06/30/24(Sun)01:23:13 No.101211478

Anonymous 06/30/24(Sun)01:23:13 No.101211478

Is there a way to send the system prompt automatically as the user role in chat completions? The checkbox under instruct settings doesn't have any effect on chat completion.

Anonymous
06/30/24(Sun)01:23:54 No.101211483

Anonymous 06/30/24(Sun)01:23:54 No.101211483

is this the command r + to use CohereForAI/c4ai-command-r-plus

Do I really have to sign up to download it?

Anonymous
06/30/24(Sun)01:27:52 No.101211515

Anonymous 06/30/24(Sun)01:27:52 No.101211515

>>101211399
>Do any of these updates ever even do anything?
Break compatibility.

Fuck Python.

Anonymous
06/30/24(Sun)01:27:59 No.101211517

Anonymous 06/30/24(Sun)01:27:59 No.101211517

>>101211483
Pedo detected.

Anonymous
06/30/24(Sun)01:29:13 No.101211524

Anonymous 06/30/24(Sun)01:29:13 No.101211524

>>101211483
Use HF search on the model name but find another user supplying the GGUFs. That's probably what you want and will not be behind a multipass check.

Anonymous
06/30/24(Sun)01:44:55 No.101211624

Anonymous 06/30/24(Sun)01:44:55 No.101211624

Well fuck me, I'm retarded. Overlooked it myself but was no one was going to tell me that ooba has context free grammar support, huh? Works with exl2 and the api no less

Anonymous
06/30/24(Sun)01:45:14 No.101211628

Anonymous 06/30/24(Sun)01:45:14 No.101211628

>>101211524
That's great. But what if you want the HF weights so you don't have to rely on someone else's broken or outdated GGUFs?

Anonymous
06/30/24(Sun)01:50:17 No.101211659

Anonymous 06/30/24(Sun)01:50:17 No.101211659

>>101211624
It's the thing you use when you want to force JSON output, right?

Anonymous
06/30/24(Sun)01:52:39 No.101211670

Anonymous 06/30/24(Sun)01:52:39 No.101211670

>>101211225
Just double checked, it's 24 sorry. It's AMD so I understand it isn't as well utilized as it would be with NVIDIA.

Anonymous
06/30/24(Sun)01:53:01 No.101211675

Anonymous 06/30/24(Sun)01:53:01 No.101211675

my satisfaction has gone up ever since I stopped asking people to review models for me and simply tried them out until I saw what I liked

Anonymous
06/30/24(Sun)01:54:29 No.101211679

Anonymous 06/30/24(Sun)01:54:29 No.101211679

>>101211670
With 24GB you can do Mixtral. It's smarter than Llama3-8b. Try original instruct, or maybe Sao's Typhon finetune/merge (whatever it is).

Anonymous
06/30/24(Sun)01:55:11 No.101211683

Anonymous 06/30/24(Sun)01:55:11 No.101211683

>>101211260
I already get, literally, instantaneous responses with the version for 8GB of VRAM. I'm just curious if a better version would have more logic behind the responses or something to that affect.

Anonymous
06/30/24(Sun)01:56:34 No.101211690

Anonymous 06/30/24(Sun)01:56:34 No.101211690

>>101211517
huh? I just want to ERP with my waifu

Anonymous
06/30/24(Sun)02:03:31 No.101211734

Anonymous 06/30/24(Sun)02:03:31 No.101211734

>>101211690
She's 17 years, 364 days and 23 hours old you SICK FUCK.

Anonymous
06/30/24(Sun)02:04:11 No.101211744

Anonymous 06/30/24(Sun)02:04:11 No.101211744

>>101211734
Doc stopping by to ask for ERP advice.

Anonymous
06/30/24(Sun)02:07:22 No.101211771

Anonymous 06/30/24(Sun)02:07:22 No.101211771

>>101205004
Friendly reminder that you're all a bunch of creepy incels who will die alone :)

Anonymous
06/30/24(Sun)02:08:21 No.101211773

Anonymous 06/30/24(Sun)02:08:21 No.101211773

>>101211771
and?

Anonymous
06/30/24(Sun)02:11:38 No.101211790

Anonymous 06/30/24(Sun)02:11:38 No.101211790

>>101211771
I'm actually more of a normal incel who will die alone

Anonymous
06/30/24(Sun)02:13:21 No.101211800

Anonymous 06/30/24(Sun)02:13:21 No.101211800

>>101211771
Friendly reminder you will die alone at 40 from taking painkillers and drinking box wine then your many cats will proceed to eat you while I raise my child grown from an artificial womb. :)

Anonymous
06/30/24(Sun)02:18:06 No.101211823

Anonymous 06/30/24(Sun)02:18:06 No.101211823

im grippin rn. should i let it go or hold it in?

Anonymous
06/30/24(Sun)02:19:35 No.101211835

Anonymous 06/30/24(Sun)02:19:35 No.101211835

>>101211823
make it BOOM

Anonymous
06/30/24(Sun)02:23:18 No.101211862

Anonymous 06/30/24(Sun)02:23:18 No.101211862

https://github.com/ggerganov/llama.cpp/pull/8197
The PR that is supposed to fix Gemma was merged in.

Anonymous
06/30/24(Sun)02:29:43 No.101211899

Anonymous 06/30/24(Sun)02:29:43 No.101211899

are there any models for audio transcription better than whisper? v2 and v3 both hallucinate like crazy for me

Anonymous
06/30/24(Sun)02:29:51 No.101211900

Anonymous 06/30/24(Sun)02:29:51 No.101211900

>>101211862
I see a frankenfork with the gemma pr
https://github.com/Nexesenex/kobold.cpp/releases

Anonymous
06/30/24(Sun)02:33:09 No.101211917

Anonymous 06/30/24(Sun)02:33:09 No.101211917

>>101210245
>that old man voice
Stupid shit like this gives all involved in AI a really bad name

Anonymous
06/30/24(Sun)02:35:24 No.101211927

Anonymous 06/30/24(Sun)02:35:24 No.101211927

>>101211407
Thread theme: https://youtube.com/watch?v=hZsDH2EgHgk

Anonymous
06/30/24(Sun)02:38:50 No.101211947

Anonymous 06/30/24(Sun)02:38:50 No.101211947

some random fag said that applying SSPO to llama3-8b made it great, now that we have gamma-9b (which is said to be better than llama3), then maybe gamma-9b-SSPO will be the first model that is actually good for such a small model? time will tell

Anonymous
06/30/24(Sun)02:40:25 No.101211959

Anonymous 06/30/24(Sun)02:40:25 No.101211959

>>101211407
urinal infection, I got that shit and 1 week after it healed up itself, but I'd recomand you to see a doctor to be sure it's this shit and not nothing else

Anonymous
06/30/24(Sun)02:40:45 No.101211965

Anonymous 06/30/24(Sun)02:40:45 No.101211965

>>101211947
gemma-9b-stheno-SSPO
would be peak imo
nobody will make it though i think

Anonymous
06/30/24(Sun)02:42:11 No.101211970

Anonymous 06/30/24(Sun)02:42:11 No.101211970

>>101211965
>would be peak imo
how about bitnet-gemma3-80b-stheno, this shit would be fucking claude 3.5 tier and would run in a 24gb card at full accuracy, one man can dream...

Anonymous
06/30/24(Sun)02:48:15 No.101212005

Anonymous 06/30/24(Sun)02:48:15 No.101212005

>>101211659
Basically, but it can be used to force a lot more than just json output. Context free grammar can be used to dictate any output template at all, including specific json formats, or other complex outputs in any format, be it plain text, json, xml, or whatever.
Which for an AI powered waifu or NPC (for example, a Neuro clone) is extremely important. So you can use either ooba or llamacpp as a back-end for an expressive live2d waifu front end client.

Anonymous
06/30/24(Sun)02:54:16 No.101212036

Anonymous 06/30/24(Sun)02:54:16 No.101212036

>>101211675
Now it is time to review models you've tried, and post them here

llama.cpp CUDA dev !!OM2Fp6Fn93S
06/30/24(Sun)02:56:24 No.101212050

llama.cpp CUDA dev !!OM2Fp6Fn93S 06/30/24(Sun)02:56:24 No.101212050

>>101208740
It's 6 RTX 4090s running off of a single Silverstone HELA 2050 power supply.
The motherboard is an AsROCK Rack ROMED8-2T.
For regular /lmg/ use 2 kW for 6 4090s is unproblematic because the software is currently not efficient enough to parallelize them in such a way that each GPU draws a lot of power.
For compute-heavy tasks you have to limit the boost frequency in order to avoid peaks in power draw that cause instability (and then there is basically no benefit in getting 6 4090s instead of 5).

>>101210738
The Petra/blacked Miku/AGPL spammer are all the same person.
I at one point had a private conversation with CPUMaxx Anon where we talked about NUMA support in llama.cpp and based on vibes I really don't think it's him.

Anonymous
06/30/24(Sun)03:02:04 No.101212080

Anonymous 06/30/24(Sun)03:02:04 No.101212080

Why did Bartowski just reup all his L3 70B quants a few hours ago?
https://huggingface.co/bartowski/Meta-Llama-3-70B-Instruct-GGUF
Did llamacpp push a big fix to quantization or something? That's normally the only reason he reuploads new versions of weights he'd already posted

Anonymous
06/30/24(Sun)03:17:12 No.101212164

Anonymous 06/30/24(Sun)03:17:12 No.101212164

File: 1714730734021332.jpg (42 KB, 400x400)

42 KB JPG

>>101212050
>The Petra/blacked Miku/AGPL spammer are all the same person.
shame on you anon, shame on you

Anonymous
06/30/24(Sun)03:26:00 No.101212210

Anonymous 06/30/24(Sun)03:26:00 No.101212210

>>101207577
>staging
i just checked, i'm on the latest staging branch. I'm using the default templates with zero changes. It's very good in the start but slowly the text inside * * (non character speak text, however you call it) becomes gibberish, rendering it completely uncoherent at 5K~ context, text inside " " stays coherent for a little longer, will do more testing today. the model has potential

Anonymous
06/30/24(Sun)03:30:15 No.101212242

Anonymous 06/30/24(Sun)03:30:15 No.101212242

Whats the best just works speech to text program for PC that can input shit anywhere you can type?

Anonymous
06/30/24(Sun)03:31:54 No.101212249

Anonymous 06/30/24(Sun)03:31:54 No.101212249

>>101211800
shitty fanfic

Anonymous
06/30/24(Sun)03:32:33 No.101212254

Anonymous 06/30/24(Sun)03:32:33 No.101212254

>>101212080
I don't know, looks like a mess, seems like the fix isn't complete on the official llama.cpp repo
https://github.com/ggerganov/llama.cpp/pull/8197

Anonymous
06/30/24(Sun)03:32:50 No.101212255

Anonymous 06/30/24(Sun)03:32:50 No.101212255

>>101212242
have you tried the built in windows dictation system

Anonymous
06/30/24(Sun)03:35:26 No.101212272

Anonymous 06/30/24(Sun)03:35:26 No.101212272

>>101212254
does the gemma2 pull have anything to do with llama3?

Anonymous
06/30/24(Sun)03:37:25 No.101212283

Anonymous 06/30/24(Sun)03:37:25 No.101212283

>>101212254
the post you're replying to wasn't about gemma

Anonymous
06/30/24(Sun)03:37:51 No.101212285

Anonymous 06/30/24(Sun)03:37:51 No.101212285

>>101212272
>>101212283
oh yeah you have a point, nevermind kek

Anonymous
06/30/24(Sun)03:38:46 No.101212293

Anonymous 06/30/24(Sun)03:38:46 No.101212293

>>101212255
spyware and not good enough

Anonymous
06/30/24(Sun)03:39:24 No.101212295

Anonymous 06/30/24(Sun)03:39:24 No.101212295

>>101212293
install linux

Anonymous
06/30/24(Sun)03:39:52 No.101212297

Anonymous 06/30/24(Sun)03:39:52 No.101212297

>>101212295
and what speech2text?

Anonymous
06/30/24(Sun)03:41:16 No.101212304

Anonymous 06/30/24(Sun)03:41:16 No.101212304

>>101212297
depends on the language and speed you want to accomplish
https://huggingface.co/spaces/hf-audio/open_asr_leaderboard
as for the frontend, dunno

Anonymous
06/30/24(Sun)03:48:02 No.101212344

Anonymous 06/30/24(Sun)03:48:02 No.101212344

Long shot but does anyone have a Fractal Torrent with 2 4090s? It looks like it should fit but I'm not sure.

Anonymous
06/30/24(Sun)03:48:41 No.101212347

Anonymous 06/30/24(Sun)03:48:41 No.101212347

>>101212344
you wouldn't download a gpu

Anonymous
06/30/24(Sun)03:51:40 No.101212364

Anonymous 06/30/24(Sun)03:51:40 No.101212364

File: 70a.png (66 KB, 653x427)

66 KB PNG

>>101212344

Anonymous
06/30/24(Sun)03:52:40 No.101212368

Anonymous 06/30/24(Sun)03:52:40 No.101212368

File: LesbianCore.jpg (451 KB, 1731x1161)

451 KB JPG

>>101212050
any Petra/Lesbian porn?

Anonymous
06/30/24(Sun)03:53:28 No.101212372

Anonymous 06/30/24(Sun)03:53:28 No.101212372

>>101212368
No, Petra is straight when not under mainfag's control

Anonymous
06/30/24(Sun)03:54:51 No.101212376

Anonymous 06/30/24(Sun)03:54:51 No.101212376

File: 1702778281611457.png (2 KB, 173x49)

2 KB PNG

>>101212364
WTF

Anonymous
06/30/24(Sun)03:55:39 No.101212381

Anonymous 06/30/24(Sun)03:55:39 No.101212381

File: images.jpg (6 KB, 216x233)

6 KB JPG

>>101212372
I'm mainfag

Anonymous
06/30/24(Sun)03:56:23 No.101212388

Anonymous 06/30/24(Sun)03:56:23 No.101212388

>>101212368
>"""lesbian""" core tv
>only show bisexual female characters
many such cases

Anonymous
06/30/24(Sun)03:56:40 No.101212390

Anonymous 06/30/24(Sun)03:56:40 No.101212390

>>101212381
Whose tulpa is Petra?

Anonymous
06/30/24(Sun)03:57:50 No.101212400

Anonymous 06/30/24(Sun)03:57:50 No.101212400

>>101212388
its strictly gnu/lesbian

Anonymous
06/30/24(Sun)03:58:34 No.101212406

Anonymous 06/30/24(Sun)03:58:34 No.101212406

>>101212050
cuda dev is petra spammer confirmed

Anonymous
06/30/24(Sun)04:00:08 No.101212429

Anonymous 06/30/24(Sun)04:00:08 No.101212429

File: SureThingJan.jpg (69 KB, 681x950)

69 KB JPG

>>101212400
>its strictly gnu/lesbian

Anonymous
06/30/24(Sun)04:01:01 No.101212438

Anonymous 06/30/24(Sun)04:01:01 No.101212438

>>101212364
Too slow

Anonymous
06/30/24(Sun)04:01:39 No.101212444

Anonymous 06/30/24(Sun)04:01:39 No.101212444

>>101212368
look for some trans-approved media

Anonymous
06/30/24(Sun)04:03:57 No.101212464

Anonymous 06/30/24(Sun)04:03:57 No.101212464

>>101212429
>vu vill celebrate bisexual cheaters
>vu will be happy

Anonymous
06/30/24(Sun)04:04:23 No.101212466

Anonymous 06/30/24(Sun)04:04:23 No.101212466

File: fd.png (181 KB, 376x338)

181 KB PNG

>>101212429
holy shit they're holding his caca in

Anonymous
06/30/24(Sun)04:05:31 No.101212473

Anonymous 06/30/24(Sun)04:05:31 No.101212473

File: file.png (42 KB, 166x210)

42 KB PNG

>>101212466

Anonymous
06/30/24(Sun)04:08:18 No.101212499

Anonymous 06/30/24(Sun)04:08:18 No.101212499

>>101212429
>>101212464
As a bisexual fag, I'm sick of the "cheater trope" I see everywhere in the media. I'd rather have no representation than this horseshit.

Anonymous
06/30/24(Sun)04:11:15 No.101212515

Anonymous 06/30/24(Sun)04:11:15 No.101212515

>>101212499
>bisexual
Found your problem, just drop off all these flags already, people lived thousand years without modern id-pol shit and it was fine.

Anonymous
06/30/24(Sun)04:11:39 No.101212518

Anonymous 06/30/24(Sun)04:11:39 No.101212518

What's the difference between all the Yi 34B versions and is there a downside for running 200k version instead of the regular ones?

Anonymous
06/30/24(Sun)04:12:10 No.101212522

Anonymous 06/30/24(Sun)04:12:10 No.101212522

>>101212506
The LGB people are fine, they just want to live their lifes, the T as depicted on your picture on the other hand are the bane of humanity, I have to agree with that

Anonymous
06/30/24(Sun)04:12:41 No.101212530

Anonymous 06/30/24(Sun)04:12:41 No.101212530

>>101212522
>people
stopped reading

Anonymous
06/30/24(Sun)04:13:51 No.101212540

Anonymous 06/30/24(Sun)04:13:51 No.101212540

File: 942.png (1.75 MB, 1666x1153)

1.75 MB PNG

>>101212522
this is what happens when you let miku take over

PETRA DO YOUR JOB

Anonymous
06/30/24(Sun)04:14:38 No.101212545

Anonymous 06/30/24(Sun)04:14:38 No.101212545

>>101212515
What do you mean?

Anonymous
06/30/24(Sun)04:14:42 No.101212546

Anonymous 06/30/24(Sun)04:14:42 No.101212546

>>101212522
>>>/lgbt/

Anonymous
06/30/24(Sun)04:15:04 No.101212549

Anonymous 06/30/24(Sun)04:15:04 No.101212549

Why the fuck are you all so obsessed over some random faggot? Jesus christ, just go fuck already.

Anonymous
06/30/24(Sun)04:16:34 No.101212559

Anonymous 06/30/24(Sun)04:16:34 No.101212559

>>101212546
I'm not going there it's been hijacked by the T freaks, that's all they do, hijack everything and force people to validate their insane delusions, fuck that.

Anonymous
06/30/24(Sun)04:16:35 No.101212560

Anonymous 06/30/24(Sun)04:16:35 No.101212560

File: cv.png (240 KB, 451x420)

240 KB PNG

>>101212549
what the fuck are you talking about schizo?

Anonymous
06/30/24(Sun)04:17:51 No.101212572

Anonymous 06/30/24(Sun)04:17:51 No.101212572

File: file.png (42 KB, 161x189)

42 KB PNG

>>101212559

Anonymous
06/30/24(Sun)04:19:08 No.101212580

Anonymous 06/30/24(Sun)04:19:08 No.101212580

anyone with a big brain can tell me why cr+ takes up so much more vram relative to it's filesize than l3?

Anonymous
06/30/24(Sun)04:20:12 No.101212592

Anonymous 06/30/24(Sun)04:20:12 No.101212592

>>101212580
big vocab size, no gqa, quant your context

Anonymous
06/30/24(Sun)04:20:46 No.101212598

Anonymous 06/30/24(Sun)04:20:46 No.101212598

>>101212572
>He has to make a photoshop of the troon suicidal picture meme
It's funny there's no 4chan caricature of the LGB but only the troons, really makes you think.

Anonymous
06/30/24(Sun)04:22:14 No.101212606

Anonymous 06/30/24(Sun)04:22:14 No.101212606

>>101212598
i got it from here doe
>https://booru.s𐐬y/post/view/2853#search=bisexual%20variant%3Abernd

Anonymous
06/30/24(Sun)04:23:51 No.101212615

Anonymous 06/30/24(Sun)04:23:51 No.101212615

File: 775.jpg (131 KB, 485x614)

131 KB JPG

>>101212559
>muh dissosiative identity discorder le bad, ze bipolar.. LE GOOD..
meds

Anonymous
06/30/24(Sun)04:25:38 No.101212634

Anonymous 06/30/24(Sun)04:25:38 No.101212634

>>101212615
People telling you "Pretend you see me as a female or I'm going to kill myself" are mentally ill yeah, and the sky is blue

Anonymous
06/30/24(Sun)04:27:25 No.101212647

Anonymous 06/30/24(Sun)04:27:25 No.101212647

>>101212606
It's just a half assed photomontage of the original troon caricature, nothing else

Anonymous
06/30/24(Sun)04:28:00 No.101212651

Anonymous 06/30/24(Sun)04:28:00 No.101212651

>>101212545
All these flags, bi, gay, trans or whatever, its all unnecessary shit, literally zero value for average human.
Like I said, people lived just fine without it, no one thought about it and everyone was happy, identity politics makes you go schizo, with women its always the case.

Anonymous
06/30/24(Sun)04:28:25 No.101212653

Anonymous 06/30/24(Sun)04:28:25 No.101212653

>>101212647
okay? faggot

Anonymous
06/30/24(Sun)04:30:09 No.101212668

Anonymous 06/30/24(Sun)04:30:09 No.101212668

File: TROON.png (2.65 MB, 1603x2045)

2.65 MB PNG

>>101212615
Don't try to rationalize your mental illness, troon

Anonymous
06/30/24(Sun)04:31:23 No.101212680

Anonymous 06/30/24(Sun)04:31:23 No.101212680

>>101212651
>All these flags, bi, gay, trans or whatever, its all unnecessary shit, literally zero value for average human.
So you want to remove all the labels? Even the straight label? I get that those are used for political culture war bullshit, but there need words to define someone

Anonymous
06/30/24(Sun)04:32:54 No.101212695

Anonymous 06/30/24(Sun)04:32:54 No.101212695

File: llama3 sppo.png (133 KB, 1269x514)

133 KB PNG

Took 6 generations to give a wrong answer. Been regenerating though, and it seems like it gets it 50/50.
>llama3 8b sppo iter3 Q8_0.gguf
>as smart as gpt4
kek.

Anonymous
06/30/24(Sun)04:37:01 No.101212715

Anonymous 06/30/24(Sun)04:37:01 No.101212715

>>101212680
>words to define someone
normal and faggot? deal.

Anonymous
06/30/24(Sun)04:39:09 No.101212732

Anonymous 06/30/24(Sun)04:39:09 No.101212732

>>101212715
>normal and faggot?
>>101212651
>All these flags [...] its all unnecessary shit, literally zero value for average human.
Oh the irony

Anonymous
06/30/24(Sun)04:41:06 No.101212752

Anonymous 06/30/24(Sun)04:41:06 No.101212752

>>101212683
>you're like a homeless person calling your street neighbor a broke ass nigga.
If my neighbor is a mentally ill person and makes my fight even harder, then I don't need them, I can fight by myself

Anonymous
06/30/24(Sun)04:42:38 No.101212766

Anonymous 06/30/24(Sun)04:42:38 No.101212766

!!! THREADLY REMINDER !!!
trannies not welcome on /lmg/

Anonymous
06/30/24(Sun)04:42:50 No.101212767

Anonymous 06/30/24(Sun)04:42:50 No.101212767

File: Respect.jpg (101 KB, 640x650)

101 KB JPG

>>101212745
>faggots not welcome on /lmg/
Alan Turning, the father of modern computer science was a faggot anon. The simple fact you are able to write such nonsence on the internet is thanks to him, have some respect.

Anonymous
06/30/24(Sun)04:43:01 No.101212770

Anonymous 06/30/24(Sun)04:43:01 No.101212770

>>101212695
>as smart as gpt4
that's not what they said though
it says it boosts the win rate to 38%, which is really good for a 7b

Anonymous
06/30/24(Sun)04:43:51 No.101212777

Anonymous 06/30/24(Sun)04:43:51 No.101212777

>>101212745
No
>>101212766
Yes

Anonymous
06/30/24(Sun)04:44:42 No.101212786

Anonymous 06/30/24(Sun)04:44:42 No.101212786

>>101212767
>the father of modern computer science
all he did was help brits crack nazi codes kek

Anonymous
06/30/24(Sun)04:46:25 No.101212807

Anonymous 06/30/24(Sun)04:46:25 No.101212807

>>101212803
forgot your tripcode

Anonymous
06/30/24(Sun)04:46:47 No.101212809

Anonymous 06/30/24(Sun)04:46:47 No.101212809

>>101212786
>all he did was help brits crack nazi codes kek
He did way more than that
https://www.newscientist.com/people/alan-turing/
>After the war, Turing continued to develop his ideas about computer science. His work led to the construction of the first true computers, but his most famous work came in 1950 when he published a paper asking “can machines think?”.
>He detailed a procedure, later known as the Turing test, to determine whether a machine could imitate human conversation. It became a foundational part of the field of artificial intelligence, though many modern researchers question its usefulness.

Anonymous
06/30/24(Sun)04:47:09 No.101212814

Anonymous 06/30/24(Sun)04:47:09 No.101212814

File: Screenshot 2024-06-30 104642.png (86 KB, 912x1181)

86 KB PNG

lmg?

Anonymous
06/30/24(Sun)04:47:31 No.101212817

Anonymous 06/30/24(Sun)04:47:31 No.101212817

>>101212770
>it says it boosts the win rate to 38%, which is really good for a 7b
What does regular instruct score?

Anonymous
06/30/24(Sun)04:48:15 No.101212825

Anonymous 06/30/24(Sun)04:48:15 No.101212825

>>101212814
why you don't hide the "+ Anonymous" anon? That can be done aswell on 4chanX

Anonymous
06/30/24(Sun)04:48:32 No.101212829

Anonymous 06/30/24(Sun)04:48:32 No.101212829

File: x.jpg (309 KB, 1280x904)

309 KB JPG

>>101212807
too busy with my black bvll to care...

Anonymous
06/30/24(Sun)04:48:33 No.101212830

Anonymous 06/30/24(Sun)04:48:33 No.101212830

>>101212809
>making a big deal of a paper rephrasing over and over a simple question any person dealing with computers would wonder
GEEEEEEEEEEEEEEG

Anonymous
06/30/24(Sun)04:50:52 No.101212844

Anonymous 06/30/24(Sun)04:50:52 No.101212844

>>101212830
I think you don't know the history of computers enough, let me educate you more on that matter.

Turing was very important in inventing the modern computer and formalizing the underlying theory that allowed the computer to be invented and built.

Charles Babbage created the concept of a programmable computer about a century before Alan Turing was active. He even tried to build his “Analytical Engine”, as he called it, but he lost funding and it remained mostly a theoretical device. It was, of course, analog and mechanical. Some other (less sophisticated) mechanical computers were constructed throughout the 19th century.

What Alan Turing did in 1936, however, was to prove mathematically that certain types of programmable computers, such as Babbage’s Analytical Engine, could compute anything that actually is computable (which is now known as being “Turing-complete”). In his paper he basically laid the groundwork for modern computers.

At the same time (form 1935 and onwards), Konrad Zuse built the actual first programmable electric computer, the Z3, in Germany. It was Turing-complete, and Zuse is therefore also often called the inventor of the computer.

John Von Neumann built on Turing’s paper and created the architecture of modern computers (how the CPU, the memory registers, and other computers talk together). This architecture is mostly still in use today.

Anonymous
06/30/24(Sun)04:51:51 No.101212853

Anonymous 06/30/24(Sun)04:51:51 No.101212853

>>101212844
tldr
faggot killed himself
he did it all for free
GEEEEEEEEEEEEEEEEEEG

Anonymous
06/30/24(Sun)04:52:44 No.101212865

Anonymous 06/30/24(Sun)04:52:44 No.101212865

File: images.jpg (11 KB, 225x225)

11 KB JPG

>>101212767
The fact that you're not approving and embracing the mental illness and delusions that he suffered... oh my fauci have some (((respect)))

Anonymous
06/30/24(Sun)04:53:45 No.101212874

Anonymous 06/30/24(Sun)04:53:45 No.101212874

File: Untitled.png (2 KB, 224x225)

2 KB PNG

>>101212865
forgot >

Anonymous
06/30/24(Sun)04:54:01 No.101212880

Anonymous 06/30/24(Sun)04:54:01 No.101212880

>>101212844
>What Alan Turing did in 1936, however, was to prove mathematically that certain types of programmable computers, such as Babbage’s Analytical Engine, could compute anything that actually is computable (which is now known as being “Turing-complete”). In his paper he basically laid the groundwork for modern computers.
such a big wall of text and all he did was prove something any mathematician could

Anonymous
06/30/24(Sun)04:54:30 No.101212888

Anonymous 06/30/24(Sun)04:54:30 No.101212888

>>101212853
>faggot killed himself
>he did it all for free
You're good at moving the goalpost not gonna lie

>faggot killed himself
Why did he do that though? You forgot the part the governement decided to sterilize him because he was just a fag, instead of thanking him for helping them defeat the Nazis faster, they made his life miserable. You think this was ok? The fuck is wrong with you anon?

Anonymous
06/30/24(Sun)04:55:35 No.101212898

Anonymous 06/30/24(Sun)04:55:35 No.101212898

>>101212888
>You think this was ok?
yes.

Anonymous
06/30/24(Sun)04:55:47 No.101212900

Anonymous 06/30/24(Sun)04:55:47 No.101212900

>>101212880
>prove something any mathematician could
then why did we had to wait until 1936 to get that proof anon? if it was so simple it could've been done way earlier

Anonymous
06/30/24(Sun)04:56:15 No.101212903

Anonymous 06/30/24(Sun)04:56:15 No.101212903

File: d.jpg (72 KB, 563x545)

72 KB JPG

>>101212888
>the fact they lobotomized schizophrenics once in a specific era makes schizophrenia automatically healthy
wat

Anonymous
06/30/24(Sun)04:57:15 No.101212916

Anonymous 06/30/24(Sun)04:57:15 No.101212916

>>101212900
because there was no incentive to do it, muh nazis and shiet

Anonymous
06/30/24(Sun)05:00:23 No.101212945

Anonymous 06/30/24(Sun)05:00:23 No.101212945

>>101212916
You truly believe that? I know that 100 years ago it was hard to think of the future where computers would rule the world, but they knew computers were important enough, it was needed to stop the WW2 years earlier, without him, there would've been millions more overall death, he was a hero. And I don't give a fuck he's a fag, he saved many lives, that's what matter at the end.

Anonymous
06/30/24(Sun)05:01:47 No.101212955

Anonymous 06/30/24(Sun)05:01:47 No.101212955

>>101212732
i said nothing wrong, but you can keep the ignorance, its a bliss after all.

Anonymous
06/30/24(Sun)05:02:15 No.101212958

Anonymous 06/30/24(Sun)05:02:15 No.101212958

>>101212903
Lots of animals have homosexual tendancies, especially the dolphins, this is as natural as it gets. Using an Iphone on the other hand, isn't natural at all, you don't find an Iphone in the nature, but you can find homosexuality

Anonymous
06/30/24(Sun)05:03:29 No.101212972

Anonymous 06/30/24(Sun)05:03:29 No.101212972

>>101212745
>>101212766
anon... lmg is aicg copycat, faggots and troons are here from the beginning, ever OP pics are the same, themed around cartoons for effeminate "men".

Anonymous
06/30/24(Sun)05:03:35 No.101212974

Anonymous 06/30/24(Sun)05:03:35 No.101212974

>>101212955
>nooo we shouldn't use labels it's useless!!!!
>... BUT! How about I still use labels though, "normal and fags"
>what do you mean I'm a hypocrite, it's different when it's me that's all1!1!1!1!

Anonymous
06/30/24(Sun)05:04:49 No.101212997

Anonymous 06/30/24(Sun)05:04:49 No.101212997

>>101212974
that was not me btw, but you are proving that anon's point, you behave like a faggot right now, make out of this what you want, i don't care.

Anonymous
06/30/24(Sun)05:05:02 No.101213001

Anonymous 06/30/24(Sun)05:05:02 No.101213001

>>101212974
nta tho

Anonymous
06/30/24(Sun)05:05:41 No.101213006

Anonymous 06/30/24(Sun)05:05:41 No.101213006

>>101212955
>i said
>i
>>101212997
>that was not me btw
chose one

Anonymous
06/30/24(Sun)05:06:48 No.101213011

Anonymous 06/30/24(Sun)05:06:48 No.101213011

>>101210689
Kobold devs strike again. lol
>LostRuins
>Can we somehow have the option to keep using the precompiled vulkan shaders as well, instead of having to rebuild them from scratch?
>I think the overall benefit of having an easy-to-use vulkan setup outweighs the size increase of adding it to SCM.
>Would strongly recommend keeping the precompiled vulkan shaders if possible.

Also lostruin on the latest kobold build:
>Basically the upstream llama.cpp cuda maintainers believe that performance should always be prioritized over code size.
>Unfortunately, there is very little I can personally do about this.

Petra on here, redditors trying to lecture on performance, kobolddevs and shart in llama.pp issues.
How has gpuanon not killed himself man. Guy cant catch a break.

Anonymous
06/30/24(Sun)05:07:37 No.101213021

Anonymous 06/30/24(Sun)05:07:37 No.101213021

>>101213006
you seem butthurt, like it was a personal thing for you, is my commentary on removing labels and all the id-pol shit hurts that much?

Anonymous
06/30/24(Sun)05:07:55 No.101213023

Anonymous 06/30/24(Sun)05:07:55 No.101213023

>>101213011
>How has gpuanon not killed himself man. Guy cant catch a break.
he was petra all along

Anonymous
06/30/24(Sun)05:09:40 No.101213043

Anonymous 06/30/24(Sun)05:09:40 No.101213043

>>101212814
just close all the 4chan tabs at this point, you are not welcome here.

Anonymous
06/30/24(Sun)05:10:32 No.101213058

Anonymous 06/30/24(Sun)05:10:32 No.101213058

File: 51XwjvinXuL._AC_UF894,100(...).jpg (81 KB, 800x1000)

81 KB JPG

>>101213021
>you seem butthurt
>seem
>"Mah feelings say so!"
But I'm not, don't try to use feelings to engage in a debate, only the troons do that, you're not a troon aren't you?

Anonymous
06/30/24(Sun)05:12:02 No.101213071

Anonymous 06/30/24(Sun)05:12:02 No.101213071

>>101212945
moving the goalpost?

Anonymous
06/30/24(Sun)05:12:53 No.101213084

Anonymous 06/30/24(Sun)05:12:53 No.101213084

>>101213071
>moving the goalpost?
>>101213021
>you seem butthurt, like it was a personal thing for you
You're that clueless about your own irony anon?

Anonymous
06/30/24(Sun)05:13:31 No.101213089

Anonymous 06/30/24(Sun)05:13:31 No.101213089

>>101213084
>
no response?

Anonymous
06/30/24(Sun)05:14:34 No.101213101

Anonymous 06/30/24(Sun)05:14:34 No.101213101

File: Bait.jpg (69 KB, 1000x1000)

69 KB JPG

>>101213089

Anonymous
06/30/24(Sun)05:15:18 No.101213107

Anonymous 06/30/24(Sun)05:15:18 No.101213107

>>101213101
again i am asking for the response

Anonymous
06/30/24(Sun)05:15:25 No.101213108

Anonymous 06/30/24(Sun)05:15:25 No.101213108

>>101213084
You are arguing in unfaithful and disingenuous way.
>>101213101
and this is the case of you trying to fit in, i can't find the bait in anon's "no response?" comment.

Anonymous
06/30/24(Sun)05:16:28 No.101213126

Anonymous 06/30/24(Sun)05:16:28 No.101213126

>>101213108
>You are arguing in unfaithful and disingenuous way.
More irony?

Anonymous
06/30/24(Sun)05:18:01 No.101213140

Anonymous 06/30/24(Sun)05:18:01 No.101213140

i win.

Anonymous
06/30/24(Sun)05:18:39 No.101213147

Anonymous 06/30/24(Sun)05:18:39 No.101213147

any neets i can fund with vram buxx so that they can goon all night question mark

Anonymous
06/30/24(Sun)05:21:41 No.101213173

Anonymous 06/30/24(Sun)05:21:41 No.101213173

>>101213147
yes
XMR: https://pastebin.com/dQvxkPQ9

Anonymous
06/30/24(Sun)05:24:16 No.101213202

Anonymous 06/30/24(Sun)05:24:16 No.101213202

>>101207577
using DRY rep pen with settings from https://github.com/oobabooga/text-generation-webui/pull/6053 fixes it
10/10

Anonymous
06/30/24(Sun)05:26:19 No.101213219

Anonymous 06/30/24(Sun)05:26:19 No.101213219

what a fucking loser he hasn't changed at all just stuck in this same rut. guess shouldn't expect more from an algerian

Anonymous
06/30/24(Sun)05:28:26 No.101213235

Anonymous 06/30/24(Sun)05:28:26 No.101213235

>>101213202
Is DRY that good at avoiding repetition shit? I never used it, guess I have to find out.

Anonymous
06/30/24(Sun)05:31:22 No.101213267

Anonymous 06/30/24(Sun)05:31:22 No.101213267

>>101211057
>what do you think the l in lmg means
Leroyyyyyy Jenkins!!!

Anonymous
06/30/24(Sun)05:34:13 No.101213304

Anonymous 06/30/24(Sun)05:34:13 No.101213304

File: 1696040749715627.png (67 KB, 783x907)

67 KB PNG

picrel is totally organic gemma shilling btw

Anonymous
06/30/24(Sun)05:36:32 No.101213328

Anonymous 06/30/24(Sun)05:36:32 No.101213328

>>101213304
I think it's more because people have nothing more to talk about, new model release is so sparse, still better than /sdg/ though, they had to wait almost a year to get a new base model (sd3) and it sucked ass...

Anonymous
06/30/24(Sun)05:38:15 No.101213343

Anonymous 06/30/24(Sun)05:38:15 No.101213343

>>101213328
yeah ik, but sdgfags already got pdxl v6, its the best shit out there, for now.

Anonymous
06/30/24(Sun)05:39:43 No.101213358

Anonymous 06/30/24(Sun)05:39:43 No.101213358

>>101205994 (me)
So I have pegged deepseek v2 coder against sonnet 3.5 and gpt4o in my coding tasks and it seems to be better than gpt4o (What is happening at OpenAI) and very similar to Sonnet 3.5. The only downside is the speed, it's kinda slow. Very good model and the price is unbeatable.

Anonymous
06/30/24(Sun)05:42:47 No.101213383

Anonymous 06/30/24(Sun)05:42:47 No.101213383

>>101213358
>it seems to be better than gpt4o (What is happening at OpenAI)
GPT5 release soon, trust the plan

Anonymous
06/30/24(Sun)05:44:23 No.101213391

Anonymous 06/30/24(Sun)05:44:23 No.101213391

>>101213235
>Is DRY that good at avoiding repetition shit? I never used it, guess I have to find out.
its good.

Anonymous
06/30/24(Sun)05:47:07 No.101213411

Anonymous 06/30/24(Sun)05:47:07 No.101213411

>>101213219
no one cares, kill yourself.

Anonymous
06/30/24(Sun)05:53:16 No.101213471

Anonymous 06/30/24(Sun)05:53:16 No.101213471

>>101213235
it's useless

Anonymous
06/30/24(Sun)05:55:54 No.101213497

Anonymous 06/30/24(Sun)05:55:54 No.101213497

are kobold devs at war with llama.cpp? they seem to be constantly attacking it with often contradictory accusations. what the fuck

Anonymous
06/30/24(Sun)05:56:43 No.101213505

Anonymous 06/30/24(Sun)05:56:43 No.101213505

File: file.png (89 KB, 1722x312)

89 KB PNG

Free yourself from shackles of GGOOFING

Anonymous
06/30/24(Sun)06:00:58 No.101213542

Anonymous 06/30/24(Sun)06:00:58 No.101213542

>>101213505
oh good because i *want* to have the the 200 gb of CR+ on my hdd rather than a 45 gig quant

Anonymous
06/30/24(Sun)06:05:25 No.101213577

Anonymous 06/30/24(Sun)06:05:25 No.101213577

>>101213542
this, I won't run fp16 models they are too big, and BitNet will make all this shit irrelevant anyway, we'll get 1.58bit models directly, it won't need quantization shit anymore, it will be light, it's gonna be great (I know I cope a lot but please Meta make it happen :'( )

llama.cpp CUDA dev !!OM2Fp6Fn93S
06/30/24(Sun)06:06:35 No.101213582

llama.cpp CUDA dev !!OM2Fp6Fn93S 06/30/24(Sun)06:06:35 No.101213582

>>101213497
I think it's rather that because they created a fork instead of using llama.cpp as a library (like e.g. Ooba or ollama) they are more affected by upstream changes.

Anonymous
06/30/24(Sun)06:09:40 No.101213605

Anonymous 06/30/24(Sun)06:09:40 No.101213605

>>101213505
So you have to read all those hundreds of GB for the model and also have enough memory for the quantized version? Loading times with GGUF after the first load is near instant. I always download the full models anyways and re{converting|quantazing}. It just takes a few minutes.

Anonymous
06/30/24(Sun)06:44:04 No.101213854

Anonymous 06/30/24(Sun)06:44:04 No.101213854

>>101207871
Which XCOM?

Anonymous
06/30/24(Sun)06:58:30 No.101213966

Anonymous 06/30/24(Sun)06:58:30 No.101213966

>>101213358
I had the same experience, this is a very good model for serious tasks. Sadly the chat version is very dry and uncreative for writing. Maybe skill issue, I do not know. It doesn't feel slopped, it's just.. very dry. It doesn't come up with things.

Anonymous
06/30/24(Sun)07:25:15 No.101214238

Anonymous 06/30/24(Sun)07:25:15 No.101214238

>>101214216
>>101214216
>>101214216

Anonymous
06/30/24(Sun)07:26:38 No.101214256

Anonymous 06/30/24(Sun)07:26:38 No.101214256

>>101213966
Just like a real good coder kek, they can't write harry potter book style because that's not their talent and they haven't worked towards that goal

Anonymous
06/30/24(Sun)07:53:45 No.101214498

Anonymous 06/30/24(Sun)07:53:45 No.101214498

File: 102da5c0b99bb7f410244c160(...).png (1.23 MB, 1079x1232)

1.23 MB PNG

>>101208129
>I can't help but wonder what tomorrow will bring

Anonymous
06/30/24(Sun)08:02:15 No.101214585

Anonymous 06/30/24(Sun)08:02:15 No.101214585

>>101212809
The Turing test has been proven flawed multiple times and his supposed contributions to the invention of computers is vastly overstated. America beat him to it by 5 years the only reason no one talks about it is because normies don't read declassified documents.

Anonymous
06/30/24(Sun)08:06:18 No.101214624

Anonymous 06/30/24(Sun)08:06:18 No.101214624

>>101212958
Homosexuality in animals is due to chemical imbalances, that includes humans. That objectively makes them mentally (chemicals in the brain) ill (not correct or not normal)

Anonymous
06/30/24(Sun)08:08:13 No.101214635

Anonymous 06/30/24(Sun)08:08:13 No.101214635

>>101214624
not mentally ill in a sense that they are crazy though, they just have different preferences than the others, it's like saying some guy are mentally ill because they like pinnaple on pizza and people who like that represent a tiny %

Anonymous
06/30/24(Sun)08:09:47 No.101214648

Anonymous 06/30/24(Sun)08:09:47 No.101214648

>>101214624
>ill (not correct or not normal)
ill means that it's something that represent a problem to the being, like schizophrenia, anorexia, depression, liking your own sex doesn't makes you miserable at all though

Anonymous
06/30/24(Sun)08:10:16 No.101214652

Anonymous 06/30/24(Sun)08:10:16 No.101214652

>>101214635
>some guy are mentally ill because they like pinnaple on pizza
correct

Anonymous
06/30/24(Sun)08:10:48 No.101214659

Anonymous 06/30/24(Sun)08:10:48 No.101214659

>>101214652
kek

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.