/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 10/23/24(Wed)05:42:08 No.102937407

File: 2024-10-22_191654_seed8_s(...).png (1.14 MB, 1280x720)

1.14 MB PNG

/lmg/ - Local Models General Anonymous 10/23/24(Wed)05:42:08 No.102937407 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>102928840 & >>102915436

►News
>(10/22) Mochi-1 runnable with 24GB VRAM: https://github.com/victorchall/genmoai-smol
>(10/22) Mochi-1: 10B Asymmetric Diffusion Transformer text-to-video model: https://hf.co/genmo/mochi-1-preview
>(10/22) Pangea: Open-source multilingual multimodal LLM supporting 39 languages: https://neulab.github.io/Pangea
>(10/21) IBM releases Granite 3.0: https://hf.co/collections/ibm-granite/granite-30-models-66fdb59bbb54785c3512114f
>(10/18) New research, models, and datasets from Meta FAIR: https://ai.meta.com/blog/fair-news-segment-anything-2-1-meta-spirit-lm-layer-skip-salsa-lingua

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench
Japanese: https://hf.co/datasets/lmg-anon/vntl-leaderboard
Programming: https://livecodebench.github.io/leaderboard.html

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
10/23/24(Wed)05:42:35 No.102937411

Anonymous 10/23/24(Wed)05:42:35 No.102937411

File: MikuCantBelieveWhatShesReading.png (1016 KB, 1200x848)

1016 KB PNG

►Recent Highlights from the Previous Thread: >>102928840

--Papers:
>102937197 >102937238
--TerDiT paper shows potential for efficient deployment of low-bit diffusion transformer models:
>102929813 >102929898 >102929911 >102929914 >102929929 >102929990 >102930004 >102930054 >102930049 >102930261
--Kuroki Tomoko GPT-SoVITS TTS finetune discussion and installation:
>102931174 >102931209 >102931229 >102931220 >102932122 >102931228 >102931276 >102931243 >102932067 >102932499
--Help with using sovits for text-to-speech conversion:
>102932427 >102932477 >102933102 >102933167 >102933177 >102933344
--genmoai-smol allows video inference on 24 GB RAM, with discussions on frame limits and FPS:
>102934099 >102934221 >102934504 >102934241 >102935105 >102934372 >102934399 >102934727 >102934616 >102934642
--Mochi live action Miku creation experience and Genmo demo site:
>102932658 >102932699 >102932716 >102932890 >102932994
--Discussion of a controversial AI model output and its capabilities:
>102929086 >102929120 >102929191 >102929212 >102929262 >102929304 >102929335 >102929361 >102929395
--Users discuss plans for developing image and video models:
>102929029 >102929104 >102930960 >102931036
--RPG Maker MV used to create LLM front-end with llama 3.2 3B:
>102932959 >102932980 >102933049 >102933181
--Interpolation models could make low-fps video usable:
>102934938 >102934970 >102935096
--Improving voice synthesis by splicing clips and maintaining consistent tone:
>102934998 >102935087 >102935225 >102935259 >102935307 >102935297 >102935894 >102936170 >102936262
--INTELLECT-1 progress and model initialization discussion:
>102929735 >102929748 >102929817 >102930311 >102929888
--Miku (free space):
>102929119 >102929251 >102929262 >102929622 >102930513 >102930552 >102930867 >102931082 >102932822 >102935780 >102936155 >102937303 >102937332

►Recent Highlight Posts from the Previous Thread: >>102928845

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script

Anonymous
10/23/24(Wed)05:47:48 No.102937452

Anonymous 10/23/24(Wed)05:47:48 No.102937452

File: Untitled.png (1.19 MB, 1080x3070)

1.19 MB PNG

LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging
https://arxiv.org/abs/2410.17146
>Large pre-trained models exhibit impressive zero-shot performance across diverse tasks, but fine-tuning often leads to catastrophic forgetting, where improvements on a target domain degrade generalization on other tasks. To address this challenge, we introduce LiNeS, Layer-increasing Network Scaling, a post-training editing technique designed to preserve pre-trained generalization while enhancing fine-tuned task performance. LiNeS scales parameter updates linearly based on their layer depth within the network, maintaining shallow layers close to their pre-trained values to preserve general features while allowing deeper layers to retain task-specific representations. We further extend this approach to multi-task model merging scenarios, where layer-wise scaling of merged parameters reduces negative task interference. LiNeS demonstrates significant improvements in both single-task and multi-task settings across various benchmarks in vision and natural language processing. It mitigates forgetting, enhances out-of-distribution generalization, integrates seamlessly with existing multi-task model merging baselines improving their performance across benchmarks and model sizes, and can boost generalization when merging LLM policies aligned with different rewards via RLHF. Importantly, our method is simple to implement and complementary to many existing techniques.
https://github.com/wang-kee/LiNeS
for the model mergers

Anonymous
10/23/24(Wed)05:58:12 No.102937502

Anonymous 10/23/24(Wed)05:58:12 No.102937502

Local bros, how is the a future of local models looks now? It looks like there was not anything interesting released in a year.

Anonymous
10/23/24(Wed)06:01:54 No.102937530

Anonymous 10/23/24(Wed)06:01:54 No.102937530

>want bot to be doting and encouraging
>it always starts talking like a MILF, saying "sweetie" and shit
please save me from this nightmare

Anonymous
10/23/24(Wed)06:15:48 No.102937625

Anonymous 10/23/24(Wed)06:15:48 No.102937625

>>102937502
Mistral Large was released just three months ago

Anonymous
10/23/24(Wed)06:19:09 No.102937663

Anonymous 10/23/24(Wed)06:19:09 No.102937663

>>102937502
wdym? we had tons of censored corposhit, sloppy tunes, useless 7b shovelware, and multimemes

Anonymous
10/23/24(Wed)06:24:20 No.102937693

Anonymous 10/23/24(Wed)06:24:20 No.102937693

>>102937502
>April Command-R still the best local model
The future looks grim

Anonymous
10/23/24(Wed)06:38:36 No.102937777

Anonymous 10/23/24(Wed)06:38:36 No.102937777

>>102937502
A year ago we had fuck all. We didn't even have Mixtral. Are you insane?

Anonymous
10/23/24(Wed)06:48:13 No.102937846

Anonymous 10/23/24(Wed)06:48:13 No.102937846

so if chub is nuking itself and char-archive is dead, where do we go for cards?

Anonymous
10/23/24(Wed)06:49:09 No.102937854

Anonymous 10/23/24(Wed)06:49:09 No.102937854

>>102937846
we share them here through catbox like digital trading cards

Anonymous
10/23/24(Wed)06:52:30 No.102937889

Anonymous 10/23/24(Wed)06:52:30 No.102937889

>>102937846
>chub is nuking itself

oh no what did I miss?

Anonymous
10/23/24(Wed)06:54:00 No.102937902

Anonymous 10/23/24(Wed)06:54:00 No.102937902

>>102937846
In the sea of shit that chub is, I've only seen like maybe 5 well written cards. I've taken those as an inspiration when writing my own.
Also there isn't really a 'good' way write one. As long as you keep your language slopless and consistent then the result should be good.
200 token mommy suck me peepee will be pretty generic since model just pulls whatever the statistical average for it would be.

Anonymous
10/23/24(Wed)06:54:34 No.102937909

Anonymous 10/23/24(Wed)06:54:34 No.102937909

Is it worth going from 2->4 3090s?
Currently use xl2 4.5 bow 70bs or largestral at 2.75 bpw codecassist only

Anonymous
10/23/24(Wed)06:55:19 No.102937920

Anonymous 10/23/24(Wed)06:55:19 No.102937920

>>102937889
deleting all copyrighted characters apparently, despite clearly falling under parody and thus fair use

Anonymous
10/23/24(Wed)06:56:29 No.102937936

Anonymous 10/23/24(Wed)06:56:29 No.102937936

>>102937920
wtf? they host toddlercon rape but parody chars are not good?

Anonymous
10/23/24(Wed)06:56:37 No.102937938

Anonymous 10/23/24(Wed)06:56:37 No.102937938

>>102937920
Welp. It's stupid, but I only liked OC characters there anyway.

Anonymous
10/23/24(Wed)06:59:27 No.102937963

Anonymous 10/23/24(Wed)06:59:27 No.102937963

>>102937902
Please post example of well written card.

Anonymous
10/23/24(Wed)06:59:29 No.102937965

Anonymous 10/23/24(Wed)06:59:29 No.102937965

Is sparsity a meme?

Anonymous
10/23/24(Wed)07:01:06 No.102937980

Anonymous 10/23/24(Wed)07:01:06 No.102937980

>>102937920
>>102937936
You got c.ai mixed up with chub lmao

Anonymous
10/23/24(Wed)07:03:18 No.102938002

Anonymous 10/23/24(Wed)07:03:18 No.102938002

>>102937980
Aren't they the same? I thought chub was the character database and c.ai is the chat service for those characters from same devs.

Anonymous
10/23/24(Wed)07:05:24 No.102938015

Anonymous 10/23/24(Wed)07:05:24 No.102938015

>>102937777
It was better when we had nothing. People appreciated things more back then.

Anonymous
10/23/24(Wed)07:05:28 No.102938016

Anonymous 10/23/24(Wed)07:05:28 No.102938016

File: file.png (67 KB, 640x476)

67 KB PNG

>>102938002
>chub was the character database and c.ai is the chat service for those characters from same devs

Anonymous
10/23/24(Wed)07:10:05 No.102938054

Anonymous 10/23/24(Wed)07:10:05 No.102938054

>>102937963
https://files.catbox.moe/eq0e52.png
I took this one as a template. Of course this was made in mind with big cloud models, but I feel like big local models are good enough now to follow something like this.
But I don't take it verbatim, this one is like 3000 tokens. Mine usually come out at about 1500~ tokens because I feel like being too detailed just wastes the context on stuff that probably won't come up in convo ever.

Anonymous
10/23/24(Wed)07:10:14 No.102938056

Anonymous 10/23/24(Wed)07:10:14 No.102938056

>>102938002
no, not even close
chub.ai has its own chat generator service, and is currently gearing up for a sale to investors, which is why they're purging copyrighted characters since they can actually be sued for those; safe to assume anything to do with underage, incest, rape etc will be next to go

Anonymous
10/23/24(Wed)07:15:15 No.102938105

Anonymous 10/23/24(Wed)07:15:15 No.102938105

>>102938056
That website reputation is so bad it won't even sell for shit when even janitor.ai (way bigger) can't find any investor lol

Anonymous
10/23/24(Wed)07:42:18 No.102938369

Anonymous 10/23/24(Wed)07:42:18 No.102938369

>>102938054
Yeah, I've found 1.5k~3k to be the sweet spot for something like Mistral Large. It usually has no issues fully understanding a well-written card around that size and there's still plenty of space for actual RP.

Anonymous
10/23/24(Wed)07:50:06 No.102938429

Anonymous 10/23/24(Wed)07:50:06 No.102938429

>>102937909
Depends, are you happy with it as a code assistant at that quant? Mistral-Large will still be your best bet at 96GB VRAM. You could try the API version for a bit to see if a more complete Large offers something that 2.75bpw doesn't for you.

Anonymous
10/23/24(Wed)07:54:47 No.102938466

Anonymous 10/23/24(Wed)07:54:47 No.102938466

>>102937411
Never cook again

Anonymous
10/23/24(Wed)07:55:48 No.102938476

Anonymous 10/23/24(Wed)07:55:48 No.102938476

>>102937502
Yeah miqu still wins. Only cool shit this year was flux and local video.

Anonymous
10/23/24(Wed)08:02:13 No.102938531

Anonymous 10/23/24(Wed)08:02:13 No.102938531

I got a real question. Say you are a ramlet what are you running? misteral nemo or mixtral?

Anonymous
10/23/24(Wed)08:10:13 No.102938587

Anonymous 10/23/24(Wed)08:10:13 No.102938587

what's the best sloptune/model for cooming under 12b or so?

Anonymous
10/23/24(Wed)08:11:40 No.102938591

Anonymous 10/23/24(Wed)08:11:40 No.102938591

>>102938581
my posting habits haven't changed
maybe stop hopping ips every five minutes?
imo the timer should be an hour, maybe two. that would stop even more spam.

Anonymous
10/23/24(Wed)08:13:13 No.102938602

Anonymous 10/23/24(Wed)08:13:13 No.102938602

>>102938581
This. I wish they would at least roll back this obnoxious captcha

Anonymous
10/23/24(Wed)08:15:07 No.102938616

Anonymous 10/23/24(Wed)08:15:07 No.102938616

>>102938581
I want to make an altchan that is moderated entirely by a custom finetuned LLM. But GPU hosting is expensive.

Anonymous
10/23/24(Wed)08:16:11 No.102938623

Anonymous 10/23/24(Wed)08:16:11 No.102938623

>>102938591
How does that boot taste like?

Anonymous
10/23/24(Wed)08:21:51 No.102938659

Anonymous 10/23/24(Wed)08:21:51 No.102938659

>>102938531
Nemo.
It's about as good as mixtral while taking a lot less memory.
t. 64gb ram 8gb vram.

Anonymous
10/23/24(Wed)08:24:20 No.102938676

Anonymous 10/23/24(Wed)08:24:20 No.102938676

>>102938616
>not abusing google colab

Anonymous
10/23/24(Wed)08:24:59 No.102938686

Anonymous 10/23/24(Wed)08:24:59 No.102938686

>>102938581
>Fucking nigger jannies with their nigger 15 minute timer bullshit.
what's this? I didn't get any of that, can someone do a tl:dr of their new shit to """prevent bots""" from posting there?

Anonymous
10/23/24(Wed)08:25:22 No.102938693

Anonymous 10/23/24(Wed)08:25:22 No.102938693

>>102938531
tinyllama

Anonymous
10/23/24(Wed)08:25:59 No.102938701

Anonymous 10/23/24(Wed)08:25:59 No.102938701

File: llm-elections.png (387 KB, 588x922)

387 KB PNG

Who will release their models first after 5 November(US elections)?
>https://poal.me/yq3vpc
Vote closes in 14 days(2 weeks)!

Anonymous
10/23/24(Wed)08:35:23 No.102938787

Anonymous 10/23/24(Wed)08:35:23 No.102938787

>>102938676
google colab is K80s. That shit wouldn't be fast enough.

Anonymous
10/23/24(Wed)08:37:21 No.102938804

Anonymous 10/23/24(Wed)08:37:21 No.102938804

>>102938676
>google colab

>>102938787
Use Kaggle.

Anonymous
10/23/24(Wed)09:07:20 No.102939141

Anonymous 10/23/24(Wed)09:07:20 No.102939141

>>102938701
It will be the anthrax team. And only the anthrax team. And when they do release it everyone will accept that LLM's and /lmg/ is dead.

Anonymous
10/23/24(Wed)09:11:42 No.102939184

Anonymous 10/23/24(Wed)09:11:42 No.102939184

>>102939141
>local... le dead!
*increases your repetition penalty*

Anonymous
10/23/24(Wed)09:12:47 No.102939197

Anonymous 10/23/24(Wed)09:12:47 No.102939197

I recently bought a clothes dryer. And it has different programs for different types of stuff. I think half of those programs are "AI drying". Why are marketing people allowed to live?

Anonymous
10/23/24(Wed)09:16:29 No.102939246

Anonymous 10/23/24(Wed)09:16:29 No.102939246

>>102939184
non-remote representations have perished

Anonymous
10/23/24(Wed)09:16:33 No.102939247

Anonymous 10/23/24(Wed)09:16:33 No.102939247

>>102939197
>he bought a "smart" dryer
You are part of the problem. Stop buying shit you don't like.

Anonymous
10/23/24(Wed)09:17:47 No.102939264

Anonymous 10/23/24(Wed)09:17:47 No.102939264

File: 1729689433800.jpg (244 KB, 680x791)

244 KB JPG

lol

Anonymous
10/23/24(Wed)09:18:05 No.102939271

Anonymous 10/23/24(Wed)09:18:05 No.102939271

>>102939247
Show me a "dumb" dryer.

Anonymous
10/23/24(Wed)09:19:52 No.102939295

Anonymous 10/23/24(Wed)09:19:52 No.102939295

>>102939264
Tbh they’re right. Delete/private the bot fine, but the destruction of the actual chats on deletion is nuts.

Anonymous
10/23/24(Wed)09:22:09 No.102939311

Anonymous 10/23/24(Wed)09:22:09 No.102939311

>>102939264
I got burned during the first waves of lobotomy and filtration, it's never cloud for me since then. They are only creating more people with distrust in cloud with this move.

Anonymous
10/23/24(Wed)09:22:27 No.102939312

Anonymous 10/23/24(Wed)09:22:27 No.102939312

>>102939264
>These aren't just word on a screen to us - some bots are comforts for us, we world build extensively, and a lot of the time we exclusively roleplay with a bot for months or years.
At least copy the most hilarious part. Also I am wondering what do they do with the data they collect from people using this? Cause it is like a goldmine of organic training data for people like us. And what could they do with it?

Anonymous
10/23/24(Wed)09:22:32 No.102939313

Anonymous 10/23/24(Wed)09:22:32 No.102939313

>>102939264
Did c.ai finally go bankrupt

Anonymous
10/23/24(Wed)09:23:02 No.102939318

Anonymous 10/23/24(Wed)09:23:02 No.102939318

>>102939295
imo they proved they didn't respect their users when they blocked edits in popular bots, whoever still had any respect for them were newfags or buttlickers.

Anonymous
10/23/24(Wed)09:23:42 No.102939324

Anonymous 10/23/24(Wed)09:23:42 No.102939324

>>102939271
Google>clothes dryer amazon>
https://www.amazon.com/Portable-Stainless-Function-Suitable-Apartments/dp/B0C6LZ4B1B/140-4839339-6192020

Anonymous
10/23/24(Wed)09:23:53 No.102939327

Anonymous 10/23/24(Wed)09:23:53 No.102939327

>>102939141
for cloud shit to beat local llms, they need a better privacy policy/acceptable usage policy, not better models
https://www.anthropic.com/legal/aup
>do not...
>Depict or request sexual intercourse or sex acts
>Generate content related to sexual fetishes or fantasies
>Engage in erotic chats
>Generate violent or gory content that is inspired by real acts of violence
>Promote, trivialize, or depict graphic violence or gratuitous gore

>Anthropic’s Trust and Safety Team will implement detections and monitoring to enforce our Usage Policies so please review these policies carefully before using our products. If we learn that you have violated our Usage Policy, we may throttle, suspend, or terminate your access to our products and services.

Anonymous
10/23/24(Wed)09:24:45 No.102939336

Anonymous 10/23/24(Wed)09:24:45 No.102939336

>Hey there, we adhere to the DMCA requirements and take swift action to remove reported third-party Characters that violate copyright law or our policies. We’ve removed a group of Characters that have been flagged as violative, and these will be added to our custom blocklists moving forward.
Based!

Anonymous
10/23/24(Wed)09:25:36 No.102939346

Anonymous 10/23/24(Wed)09:25:36 No.102939346

>>102939313
They actually got bought by Google some time ago.

Anonymous
10/23/24(Wed)09:29:02 No.102939379

Anonymous 10/23/24(Wed)09:29:02 No.102939379

>>102939318
True. Devil’s advocate says that should have shown they care about not destroying things people are using. But apparently not.

Anonymous
10/23/24(Wed)09:29:49 No.102939388

Anonymous 10/23/24(Wed)09:29:49 No.102939388

Imagine. It is not only the cloudcucks that got fucked over but women (not trannies). Actual women are now somewhere out there crying their eyes out cause their game of thrones hunk is gone. I have such a justice hardon right now.

Anonymous
10/23/24(Wed)09:42:39 No.102939511

Anonymous 10/23/24(Wed)09:42:39 No.102939511

>>102939264
On the one hand this is a shitty corpo move but on the other hand it's not like this is unexpected.

Anonymous
10/23/24(Wed)09:43:34 No.102939520

Anonymous 10/23/24(Wed)09:43:34 No.102939520

>>102939388
I still emphasize with them, CAI burned me pretty bad back in the day.
People open their hearts to those bots, not realizing it's a cruel data-harvesting scheme.
Cloud is unironically dangerous to mental health, wouldn't be surprised if some people already killed themselves because of the decisions made by the owners.

Anonymous
10/23/24(Wed)09:45:57 No.102939538

Anonymous 10/23/24(Wed)09:45:57 No.102939538

>>102939313
With the amount of trafic and investors they have, you can only dream.

Anonymous
10/23/24(Wed)09:46:12 No.102939541

Anonymous 10/23/24(Wed)09:46:12 No.102939541

>>102939520
Yeah, but people kill themselves over every little thing these days.

Anonymous
10/23/24(Wed)09:47:21 No.102939554

Anonymous 10/23/24(Wed)09:47:21 No.102939554

>>102939388
People hack and leak stuff for less, I'm surprised no AI cloud company got hacked and had their models leaked yet.

Anonymous
10/23/24(Wed)09:48:03 No.102939560

Anonymous 10/23/24(Wed)09:48:03 No.102939560

>>102939520
I think I'll run my own c.ai then. Seems like a profitable business nowadays

Anonymous
10/23/24(Wed)09:48:09 No.102939562

Anonymous 10/23/24(Wed)09:48:09 No.102939562

File: file.png (248 KB, 2278x1342)

248 KB PNG

https://aider.chat/docs/leaderboards/
Sam Altman on suicide watch, what are AnthropicAI secret sauces? Claude 3.5 was already the goat and they made it even better

Anonymous
10/23/24(Wed)09:51:10 No.102939581

Anonymous 10/23/24(Wed)09:51:10 No.102939581

>>102939520
>wouldn't be surprised if some people already killed themselves because of the decisions made by the owners.
it definitely happened, those retards who talk about safety and shit because the model can say "nigger" don't look at the right direction, the dangerous part of cloud AI is that you can literally remove someone's only joy, and no one bat a fucking eye, I find this fucked up if you ask me

Anonymous
10/23/24(Wed)09:52:03 No.102939589

Anonymous 10/23/24(Wed)09:52:03 No.102939589

>>102939554
Nouveau愛.

Anonymous
10/23/24(Wed)09:52:37 No.102939593

Anonymous 10/23/24(Wed)09:52:37 No.102939593

File: oof.png (240 KB, 1091x858)

240 KB PNG

Anonymous
10/23/24(Wed)09:52:44 No.102939594

Anonymous 10/23/24(Wed)09:52:44 No.102939594

>>102939554
>I'm surprised no AI cloud company got hacked and had their models leaked yet.
we got Miqu though

Anonymous
10/23/24(Wed)09:53:53 No.102939608

Anonymous 10/23/24(Wed)09:53:53 No.102939608

>>102939388
>Actual women are now somewhere out there crying their eyes out cause their game of thrones hunk is gone.
I'm pretty sure women also spend their time with AI husbandos, they're the ones who write and read a shit ton of romantic books and shit,

Anonymous
10/23/24(Wed)09:54:22 No.102939617

Anonymous 10/23/24(Wed)09:54:22 No.102939617

>>102939511
It'll set some babbies on the right track of backing up things they care about.

Anonymous
10/23/24(Wed)09:55:33 No.102939627

Anonymous 10/23/24(Wed)09:55:33 No.102939627

>>102939594
Miqu 2 when?

Anonymous
10/23/24(Wed)09:58:07 No.102939659

Anonymous 10/23/24(Wed)09:58:07 No.102939659

>>102939627
Mibn.

Anonymous
10/23/24(Wed)10:04:34 No.102939741

Anonymous 10/23/24(Wed)10:04:34 No.102939741

>>102939593
>cloud models are so good they can make you kill yourself
localbros...

Anonymous
10/23/24(Wed)10:05:48 No.102939754

Anonymous 10/23/24(Wed)10:05:48 No.102939754

>>102939593
How did that message make him do it? Makes no sense

Anonymous
10/23/24(Wed)10:08:13 No.102939778

Anonymous 10/23/24(Wed)10:08:13 No.102939778

>>102939617
This. I backed up my cai chatlog everyday in case of this happening.

Anonymous
10/23/24(Wed)10:09:38 No.102939793

Anonymous 10/23/24(Wed)10:09:38 No.102939793

>>102939593
These faggots would have killed themselves over anything really.

Anonymous
10/23/24(Wed)10:19:18 No.102939894

Anonymous 10/23/24(Wed)10:19:18 No.102939894

Localbros... https://twitter.com/YifanBTH/status/1849074418930356309

Anonymous
10/23/24(Wed)10:21:24 No.102939922

Anonymous 10/23/24(Wed)10:21:24 No.102939922

File: harambe comparison.png (538 KB, 1838x1009)

538 KB PNG

Alright here are the results of my first attempt at reverse-distillation of Ministrals RP strengths into a bigger, smarter, model.

Anonymous
10/23/24(Wed)10:22:22 No.102939929

Anonymous 10/23/24(Wed)10:22:22 No.102939929

File: file.png (187 KB, 705x732)

187 KB PNG

>>102939894
wew lads, jew altman btfo'd

Anonymous
10/23/24(Wed)10:23:56 No.102939950

Anonymous 10/23/24(Wed)10:23:56 No.102939950

>>102939922
>western lowland gorilla
KEK

Anonymous
10/23/24(Wed)10:32:45 No.102940058

Anonymous 10/23/24(Wed)10:32:45 No.102940058

>>102939922
If I use non-deterministic sampling it seems to forget what an eos token is.

Anonymous
10/23/24(Wed)10:43:19 No.102940181

Anonymous 10/23/24(Wed)10:43:19 No.102940181

File: file.png (47 KB, 954x342)

47 KB PNG

>>102939593
>https://archive.is/3BMXI
>He put down his phone, picked up his stepfather's .45 caliber handgun and pulled the trigger
Goddamn matmuls.

Anonymous
10/23/24(Wed)10:45:44 No.102940221

Anonymous 10/23/24(Wed)10:45:44 No.102940221

>>102940181
When I was a kid screeching moral busybodies were trying to get saturday morning cartoons banned because allegedly some kid was hit by a car while sitting in the middle of the street peering down a storm drain (presumably trying to find the ninja turtles).
These people should be exiled from society.

Anonymous
10/23/24(Wed)10:46:25 No.102940233

Anonymous 10/23/24(Wed)10:46:25 No.102940233

Is anyone tinkering around with multimodal models? I specifically interested in image+txt input models. So far I've only used LLAMA 3.2 11B in clean-ui, it seems to have a lot of potential. I'd like to run L3 Ultra-instruct 8B but don't know how to set up the vision capabilities. As far as I know ooba has a pretty limited number of supported multimodal models with the multimodal plugin. Are there any backend-frontend combinations that you can recommend for this?

Anonymous
10/23/24(Wed)10:46:57 No.102940242

Anonymous 10/23/24(Wed)10:46:57 No.102940242

>>102940221
Yep retards should get culled naturally

Anonymous
10/23/24(Wed)11:01:54 No.102940418

Anonymous 10/23/24(Wed)11:01:54 No.102940418

Alright has anyone ever encountered this before?
Using Ooba + an old pull of SillyTavern
>testing a card to find nice temperature range for model
>reply suddenly becomes deterministic
>no amount of adjusting things on either end fixes it.
>try newer version of SillyTavern that is also installed
>get a different response, finally.
>reply is always the same.
Something's obviously not getting updated at one end. Has anyone encountered this issue before?

Anonymous
10/23/24(Wed)11:07:56 No.102940500

Anonymous 10/23/24(Wed)11:07:56 No.102940500

File: StunnedAngryKanjiMiku.png (1.61 MB, 832x1216)

1.61 MB PNG

Good morning /lmg/

Anonymous
10/23/24(Wed)11:09:06 No.102940512

Anonymous 10/23/24(Wed)11:09:06 No.102940512

>>102940500
Good morning Intense Miku

Anonymous
10/23/24(Wed)11:09:52 No.102940523

Anonymous 10/23/24(Wed)11:09:52 No.102940523

>>102940418
it could be the order of the samplers

Anonymous
10/23/24(Wed)11:11:19 No.102940535

Anonymous 10/23/24(Wed)11:11:19 No.102940535

File: Screenshot_20241024_000556.png (404 KB, 1623x1445)

404 KB PNG

I really like the writing style of the new sonnet 3.5
The assistant slop producers like Meta should take notice. No more "Certainly!" etc.
I'd actually wouldn't mind talking to the default assistant like that locally.
Its much more natural sounding.

Anonymous
10/23/24(Wed)11:12:34 No.102940551

Anonymous 10/23/24(Wed)11:12:34 No.102940551

File: Screenshot_20241024_001138.png (369 KB, 1623x1314)

369 KB PNG

>>102940535
The old one in comparison.
It would be crazy if in a couple months local models would be more sloped than the closed ones.

Anonymous
10/23/24(Wed)11:15:22 No.102940582

Anonymous 10/23/24(Wed)11:15:22 No.102940582

File: Screenshot_20241024_001355.png (390 KB, 1623x1432)

390 KB PNG

>>102940551
Was cut off at the end.
>If you have any other obscure series you'd like to discuss or test my knowledge on, feel free to bring them up!
I hate this too. They all do it. Great to see that change.

Anonymous
10/23/24(Wed)11:18:03 No.102940611

Anonymous 10/23/24(Wed)11:18:03 No.102940611

>>102940535
Time to finetune a model on the new Claude slop

Anonymous
10/23/24(Wed)11:19:11 No.102940622

Anonymous 10/23/24(Wed)11:19:11 No.102940622

>>102939346
Oh that explains a lot.

Anonymous
10/23/24(Wed)11:19:20 No.102940624

Anonymous 10/23/24(Wed)11:19:20 No.102940624

>>102940523
Nah I think ooba api broke somehow and isn't reading the samplers so it's just giving me a t=0 reply.

Anonymous
10/23/24(Wed)11:20:24 No.102940638

Anonymous 10/23/24(Wed)11:20:24 No.102940638

I tried gpt-sovits on GPU now, and it is much faster than real time. Thankfully the model is really small so you don't have to dedicate too much of your GPU away from your LLM. The only latency issue at this point is on ST's side, as it only has the option of sending paragraphs or complete generations to the TTS, so you have a chunk of latency from that if you're doing text-heavy narratives. If ST was a bit more intelligent and could chunk by sentences instead of paragraphs, that would work a lot better and decrease the latency of the experience a ton. Though the TTS still doesn't feel natural so I guess it's still not something people would use normally.

Anonymous
10/23/24(Wed)11:23:51 No.102940661

Anonymous 10/23/24(Wed)11:23:51 No.102940661

>>102940611
Yes, unironically. This is good stuff.
That should be the instruct from the big boys we finetune RP onto later since they wont give us base anymore.

Anonymous
10/23/24(Wed)11:25:38 No.102940680

Anonymous 10/23/24(Wed)11:25:38 No.102940680

>>102940638
I think it is time for a new frontend that focuses on all those multi modal interactions instead.
Silly unfortunately carries a lot of dead weight from the beginning stage of this hobby. They tried to adapt, but those additions are not very usable.

Anonymous
10/23/24(Wed)11:26:04 No.102940685

Anonymous 10/23/24(Wed)11:26:04 No.102940685

>>102940661
I'm ready for llama4 to sound very human like in instruct. Then magnum v6 based on gpt anal dark prince logs from gpt make it slop again.

Anonymous
10/23/24(Wed)11:26:15 No.102940687

Anonymous 10/23/24(Wed)11:26:15 No.102940687

>>102940638
You could just send the voiced parts of your RP instead of everything

Anonymous
10/23/24(Wed)11:29:26 No.102940713

Anonymous 10/23/24(Wed)11:29:26 No.102940713

Can anyone check their sovits tmp_s1.yaml against what I've got? I'd like to know if the parameters are sensible before I start debugging any python

data:
  max_eval_sample: 8
  max_sec: 54
  num_workers: 4
  pad_val: 1024
inference:
  top_k: 15
model:
  EOS: 1024
  dropout: 0
  embedding_dim: 512
  head: 16
  hidden_dim: 512
  linear_units: 2048
  n_layer: 24
  phoneme_vocab_size: 732
  random_bert: 0
  vocab_size: 1025
optimizer:
  decay_steps: 40000
  lr: 0.01
  lr_end: 0.0001
  lr_init: 1.0e-05
  warmup_steps: 2000
output_dir: logs/xxx/logs_s1
pretrained_s1: GPT_SoVITS/pretrained_models/gsv-v2final-pretrained/s1bert25hz-5kh-longer-epoch=12-step=369668.ckpt
train:
  batch_size: 12
  epochs: 15
  exp_name: xxx
  gradient_clip: 1.0
  half_weights_save_dir: GPT_weights_v2
  if_dpo: false
  if_save_every_weights: true
  if_save_latest: true
  precision: 16-mixed
  save_every_n_epoch: 5
  seed: 1234
train_phoneme_path: logs/xxx/2-name2text.txt
train_semantic_path: logs/xxx/6-name2semantic.tsv

Anonymous
10/23/24(Wed)11:29:57 No.102940716

Anonymous 10/23/24(Wed)11:29:57 No.102940716

File: 1717707333654476.jpg (1.27 MB, 1400x2000)

1.27 MB JPG

i want to generate non-english speech using someone elses voice. something i can give an audio sample of the person speaking, and then generate speech from a text. how good are local models at this? i've got 16gb vram

Anonymous
10/23/24(Wed)11:31:35 No.102940730

Anonymous 10/23/24(Wed)11:31:35 No.102940730

>>102940535
I'm glad they're aiming for more conversational models. Honest to god, chatgpt-4o-latest is the first time in a while I've really liked a model, it feels way fucking better than any jump in model since I first tried Claude. Hoping we get more trickle-down conversationalism to local models.

Anonymous
10/23/24(Wed)11:32:50 No.102940743

Anonymous 10/23/24(Wed)11:32:50 No.102940743

>>102940730
Should say, not modern Claude, but like Slaude-era Claude, the old 1.X models. That's still the smartest it's ever felt, not gonna lie.

Anonymous
10/23/24(Wed)11:34:09 No.102940755

Anonymous 10/23/24(Wed)11:34:09 No.102940755

File: Untitled.png (13 KB, 481x340)

13 KB PNG

>>102940638
https://litter.catbox.moe/875w3x.ogg
is there some simple way that I could use koboldAI lite to inference from sovits?
i don't want to use sillytavern.

Anonymous
10/23/24(Wed)11:36:00 No.102940770

Anonymous 10/23/24(Wed)11:36:00 No.102940770

>>102940687
For short RP that probably works fine. I'm just testing it on storytelling narratives. Dialogue heavy RP could also be an issue I think. And honestly it's not a very good experience reading the non-voiced parts and suddenly the voiced parts start playing. And there's basically no pause in the voice so it's like all dialogue in the text is one big paragraph the TTS is trying to read. It's not a good experience.

Anonymous
10/23/24(Wed)11:37:38 No.102940790

Anonymous 10/23/24(Wed)11:37:38 No.102940790

>>102940755
I don't know. Even with ST I am using the Staging branch.

Anonymous
10/23/24(Wed)11:39:44 No.102940818

Anonymous 10/23/24(Wed)11:39:44 No.102940818

anons... best model under 6B? going away with laptop but still wanna ahh ahh mistress

Anonymous
10/23/24(Wed)11:41:31 No.102940838

Anonymous 10/23/24(Wed)11:41:31 No.102940838

>>102940818
home server+ssh tunnel is the way
Your laptop will be trash no matter the specs

Anonymous
10/23/24(Wed)11:42:03 No.102940846

Anonymous 10/23/24(Wed)11:42:03 No.102940846

>>102940818
Will you have internet?
If so, you could host everything on your main machine and access it via ssh. ngrok tunnel, etc.
Or use kaggle/google colab.

Anonymous
10/23/24(Wed)11:42:31 No.102940853

Anonymous 10/23/24(Wed)11:42:31 No.102940853

>>102940716
the .ogg here >>102940755 is Mizuhashi Kaori speaking in english, lazily trained by a retard who doesn't know what he's doing using nothing but japanese audio lines ripped from dice psycho:seventh heaven on my 8gb vram setup using gpt-sovits

Anonymous
10/23/24(Wed)11:43:59 No.102940868

Anonymous 10/23/24(Wed)11:43:59 No.102940868

>>102940418
So as far as I can tell what happened was I accidentally checked the legacy API box while I was loading up sillytavern and apparently just doing that while Ooba has the openai compatible api enabled causes it to break forever. (presumably unless I purge it and reinstall the whole fucking thing).

Anonymous
10/23/24(Wed)11:47:33 No.102940912

Anonymous 10/23/24(Wed)11:47:33 No.102940912

a while ago, you guys told me I could use ST on my phone with local models. I've got termux set up, I can use ST with cloud models but how can I run something on my pc and send it to my phone ?

Anonymous
10/23/24(Wed)11:50:29 No.102940949

Anonymous 10/23/24(Wed)11:50:29 No.102940949

>>102940912
that's easier than what you did.
just going to paste this at you instead of simply explaining, because it will answer questions you don't know you have yet
https://docs.sillytavern.app/usage/remoteconnections/

Anonymous
10/23/24(Wed)11:51:14 No.102940962

Anonymous 10/23/24(Wed)11:51:14 No.102940962

>>102940638
sovits API handles batching so the limitations are on ST end. It could send voiced sentences all at once and play the chunks returned with a delay between them equal to the average reading speed.

Anonymous
10/23/24(Wed)11:52:23 No.102940975

Anonymous 10/23/24(Wed)11:52:23 No.102940975

>>102940838
Any chance the Chinese find my server somehow and steal my cards?

Anonymous
10/23/24(Wed)11:53:03 No.102940987

Anonymous 10/23/24(Wed)11:53:03 No.102940987

>>102940975
Don't use password auth, make keys.

Anonymous
10/23/24(Wed)11:53:16 No.102940991

Anonymous 10/23/24(Wed)11:53:16 No.102940991

>>102940975
>Any chance the Chinese find my server somehow and steal my cards?
as long as you're tunneling ssh to get into your server then not really. keep your sshd up to date, and use strong creds obviously

Anonymous
10/23/24(Wed)12:02:22 No.102941100

Anonymous 10/23/24(Wed)12:02:22 No.102941100

>>102940949 (me)
i completely misunderstood anon's question.

first you download koboldcpp here (probably the koboldcpp_cu12.exe) this will be your backend
https://github.com/LostRuins/koboldcpp/releases/tag/v1.76
then you grab a model, which one you should grab depends on how much vram you have, lets just go with a little nemo one though and assume you have ~8gb vram, grab Rocinante-12B-v2g-Q4_K_M.gguf here
https://huggingface.co/TheDrummer/UnslopNemo-12B-v3-GGUF/tree/main
now you open the kobold.exe, load the model, press launch
we have our backend up and running now on port 5001

now get your PC's local IP address (winkey+r --> cmd --> ipconfig --> look for ipv4 address [should be 192.168.x.x)
now you go back to your termux instance of silly tavern on your phone, press the red electrical plug icon up top, api type: koboldcpp, api url http://[your PC's local ip here]:5001/, press connect, and voila

Anonymous
10/23/24(Wed)12:14:10 No.102941240

Anonymous 10/23/24(Wed)12:14:10 No.102941240

>>102937902
>Also there isn't really a 'good' way write one.
For anime/manga/game characters there's definitely a proper way to write these. As you watch the anime/read the manga/play the game, you take down notes as you go about character traits as well as quotes which are particularly representative of the character's speech style. Then, when you finish the anime/manga/game, you write the card based on those notes, using the quotes for example messages.
Character traits like "tsundere," "dominant," "submissive," etc. can go in a list of single words/phrases in the personality summary field in ST for token efficiency. Traits which aren't easily summarized by a single word/phrase go in the description.
Description should be written as token-efficiently as possible.
First message is very important. It does a lot to establish the writing style of the card. If you don't want walls of text from your card, use a short first message. If you want walls of text from your card, use a long first message. First message can be used to establish card's writing format, such as if you want your card to put its speech in quotations or not.
Cards should always go through significant testing prior to release. You won't know whether a specific trait or character lore will lead to undesired results until you test it. Sometimes it's best to leave out a trait that confuses models. For example, when I did a Holo card, I had to leave out the whole "lives in the wheat" concept because that just confuses models and leads to wonky results.

Anonymous
10/23/24(Wed)12:19:17 No.102941304

Anonymous 10/23/24(Wed)12:19:17 No.102941304

are there any local models with a license that allows me to use text based data generated from it in a commercial app

Anonymous
10/23/24(Wed)12:22:28 No.102941349

Anonymous 10/23/24(Wed)12:22:28 No.102941349

>>102937846
>>102941240
If you don't want to rewatch/read/play an anime/manga/game just to write a card, you can usually write a somewhat passable card using something like the character's Fandom page as a reference. Don't just copy and paste it though. Rewrite the relevant parts to be token-efficient. Try and find quotes from the character online for example messages. If it's a character with a manga and anime, you can probably quickly find some good quotes to establish the character's style of speech by just quickly skimming through the first several chapters of the manga - a lot faster than watching a bunch of episodes of anime.
Basically most cards of pre-existing characters posted online are bad IMO and if you're not a fucking idiot you can do better yourself.
>>102938056
Uh, so is it chub or char.ai that's purging copyrighted characters? None of my anime/manga/game cards on Chub have been deleted, and the one game character I put on character.ai hasn't been deleted either.

Anonymous
10/23/24(Wed)12:23:34 No.102941366

Anonymous 10/23/24(Wed)12:23:34 No.102941366

File: kanagarbage2.jpg (403 KB, 1158x890)

403 KB JPG

>>102941266

Anonymous
10/23/24(Wed)12:26:56 No.102941419

Anonymous 10/23/24(Wed)12:26:56 No.102941419

>>102941240
Yeah, I try to be authentic to the material, but I don't think it achieves much if the model doesn't have some knowledge about the character you want already inside.
At that point I just try to plug the gaps or try doing a different character.

Anonymous
10/23/24(Wed)12:34:41 No.102941516

Anonymous 10/23/24(Wed)12:34:41 No.102941516

>>102939197
So dumb.
All you need is
>temperature control
>time control
>give examples of temp/time for various scenarios in the user manual
So many products get fucking worse over time as technology improves.

Anonymous
10/23/24(Wed)12:45:47 No.102941659

Anonymous 10/23/24(Wed)12:45:47 No.102941659

I have been out for a while and find out Faraday went to webrowser shit and my Faraday program in PC doesn't upload new stuff anymore.
Any retard proof similar thing to Faraday so I jump my models to it?

Anonymous
10/23/24(Wed)12:55:32 No.102941786

Anonymous 10/23/24(Wed)12:55:32 No.102941786

File: questionmarkfolderimage582.jpg (663 KB, 1617x1080)

663 KB JPG

What is the BagelMisteryTour of Nemos?

Anonymous
10/23/24(Wed)13:00:42 No.102941849

Anonymous 10/23/24(Wed)13:00:42 No.102941849

>>102939593
Welp, thanks to that, we are never ever getting CAI-level local LLM, too dangerous for goyim and may result in mass -ACK'ing.

Anonymous
10/23/24(Wed)13:02:26 No.102941873

Anonymous 10/23/24(Wed)13:02:26 No.102941873

>>102941659
LM Studio + SillyTavern

Anonymous
10/23/24(Wed)13:04:20 No.102941896

Anonymous 10/23/24(Wed)13:04:20 No.102941896

>>102941849
>may result in mass -ACK'ing.
I don't see what HomuSaya has to do with this.

Anonymous
10/23/24(Wed)13:05:35 No.102941915

Anonymous 10/23/24(Wed)13:05:35 No.102941915

>>102940975
>probably American
>worried about the Chinese and not the local authorities immense power

Anonymous
10/23/24(Wed)13:14:09 No.102942047

Anonymous 10/23/24(Wed)13:14:09 No.102942047

>>102941786
Nemo. It's better than any of the finetunes.

Anonymous
10/23/24(Wed)13:14:27 No.102942052

Anonymous 10/23/24(Wed)13:14:27 No.102942052

>>102941873
>LM Studio + SillyTavern
will go for those, thanks (Yes now I remember of Silly Tavern, thanks for pointing at it)

Anonymous
10/23/24(Wed)13:14:33 No.102942054

Anonymous 10/23/24(Wed)13:14:33 No.102942054

>>102941896
don't summon him

Anonymous
10/23/24(Wed)13:20:44 No.102942139

Anonymous 10/23/24(Wed)13:20:44 No.102942139

>>102941915
Calm down chang, I don't want you to see my cards, that's all. Respect my privacy please.

Anonymous
10/23/24(Wed)13:23:46 No.102942176

Anonymous 10/23/24(Wed)13:23:46 No.102942176

File: Untitled.png (75 KB, 1268x372)

75 KB PNG

>been spending dozens and dozens of minutes training new models for gpt-sovits
>the models the release came with works ten times better at cloning voices with only a 3 second uncaptioned sample needed than the shit i was making
oh

Anonymous
10/23/24(Wed)13:25:56 No.102942213

Anonymous 10/23/24(Wed)13:25:56 No.102942213

File: aicg tards.png (393 KB, 1080x884)

393 KB PNG

aicg tards be like
>AGGHHHH THIS IS THE 13TH JAILBREAK THAT DOESN'T WORK, AND I HAVE $600K IN API DEBT EVEN THOUGH I DRANK ZE PISS ACCKKKKKK

Anonymous
10/23/24(Wed)13:25:58 No.102942214

Anonymous 10/23/24(Wed)13:25:58 No.102942214

File: 1718144189821923.jpg (291 KB, 1080x1440)

291 KB JPG

please spoonfeed me on what to use if i want to start running chatbots locally
i have 32gb ram, an amd ryzen 7 7800x3d and a rtx 4080 super

Anonymous
10/23/24(Wed)13:26:05 No.102942216

Anonymous 10/23/24(Wed)13:26:05 No.102942216

So, how do I use this soviet TTS?

Anonymous
10/23/24(Wed)13:28:22 No.102942246

Anonymous 10/23/24(Wed)13:28:22 No.102942246

>>102942214
download koboldcpp from github
download rocinante 12b gguf on huggingface

Anonymous
10/23/24(Wed)13:29:06 No.102942256

Anonymous 10/23/24(Wed)13:29:06 No.102942256

>>102942246
ok now what

Anonymous
10/23/24(Wed)13:29:21 No.102942261

Anonymous 10/23/24(Wed)13:29:21 No.102942261

>>102942213
The only difference with local sissies here is that you can download and delete your low quality token predictors.

Anonymous
10/23/24(Wed)13:29:32 No.102942264

Anonymous 10/23/24(Wed)13:29:32 No.102942264

So nemotron has gotta be SOTA for text adventure the shit. I use a system prompt that tells it to be a DM and to write as if it's verbally describing the action with no lists or headers. It still occasionally tries to pull that shit and it has a substantial aversion to NSFW, but the actual plot development is way more engaging than even Mistral large. It's definitely a model that we could learn from. Maybe whatever they did to it could be done specifically to optimize for RP

Anonymous
10/23/24(Wed)13:30:13 No.102942276

Anonymous 10/23/24(Wed)13:30:13 No.102942276

>>102942261
ohh yeah, and what are you gonna do about it? kill yourself? lmao

Anonymous
10/23/24(Wed)13:31:04 No.102942292

Anonymous 10/23/24(Wed)13:31:04 No.102942292

File: 1727515613484601.png (53 KB, 708x321)

53 KB PNG

>>102942256
oh yeah there's a bunch of these too. which one?

Anonymous
10/23/24(Wed)13:31:53 No.102942298

Anonymous 10/23/24(Wed)13:31:53 No.102942298

>>102942216
simple (but probably not the best) explanation i've learned through trial and error.
grab
GPT-SoVITS-v2-240807 here
https://huggingface.co/lj1995/GPT-SoVITS-windows-package/tree/main
unzip
open go-webui-v1.bat with a text editor (notepad, notepad++, whatever)
change zh_CH to en_US, save
run go-webui-v1.bat, a new page will open up in your browser
click 1-gpt-sovits-tts, then click 1c-inferenfce, check Open TTS inference WebUI box
a new page will open up (you can close the first tab, fuck making new models)
throw a 3-10 second .wav file into the left "drop audio here" box, click "enable no reference mode"
now you can start immediately inferencing from it

Anonymous
10/23/24(Wed)13:32:22 No.102942305

Anonymous 10/23/24(Wed)13:32:22 No.102942305

>>102942292
Q5_K_M

Anonymous
10/23/24(Wed)13:32:29 No.102942308

Anonymous 10/23/24(Wed)13:32:29 No.102942308

https://huggingface.co/lucyknada/prince-canuma_Ministral-8B-Instruct-2410-HF-exl2
Anyone gave it a try?

Anonymous
10/23/24(Wed)13:33:31 No.102942328

Anonymous 10/23/24(Wed)13:33:31 No.102942328

>>102942298
Isn't GPT-SoVITS-v2-240821.7z the last though?

Anonymous
10/23/24(Wed)13:35:16 No.102942349

Anonymous 10/23/24(Wed)13:35:16 No.102942349

>>102942276
Mald more cuckie, cloud models will always be superior to whatever shit tune you are using.

Anonymous
10/23/24(Wed)13:36:23 No.102942363

Anonymous 10/23/24(Wed)13:36:23 No.102942363

>>102942308
>literally who sloptune
Nah

Anonymous
10/23/24(Wed)13:36:32 No.102942367

Anonymous 10/23/24(Wed)13:36:32 No.102942367

Is there any TTS that can be used with ST that doesn't have weird issues? I tried GPT-Soviet and the sound quality isn't bad but it often does weird things like suddenly speeding up or slowing down its speech and just generally not having naturally timed pauses between words. Sometimes it manages to read a passage like a human would seemingly only out of luck.

Anonymous
10/23/24(Wed)13:36:45 No.102942370

Anonymous 10/23/24(Wed)13:36:45 No.102942370

File: 1706397414951643.png (38 KB, 346x322)

38 KB PNG

>>102942349
You're still here though

Anonymous
10/23/24(Wed)13:37:19 No.102942376

Anonymous 10/23/24(Wed)13:37:19 No.102942376

>>102942305
ok
with my abysmal iq i know i have to put these two things in sillytavern right
but where? and where do i put this rocinante thing

Anonymous
10/23/24(Wed)13:37:46 No.102942382

Anonymous 10/23/24(Wed)13:37:46 No.102942382

>>102942367
Are you using a finetune? If yes it's undertrained

Anonymous
10/23/24(Wed)13:38:47 No.102942407

Anonymous 10/23/24(Wed)13:38:47 No.102942407

>>102942376
No you put that in koboldcpp

Anonymous
10/23/24(Wed)13:38:54 No.102942411

Anonymous 10/23/24(Wed)13:38:54 No.102942411

>>102942370
>smugposting
Struck your nerves huh?

Anonymous
10/23/24(Wed)13:39:31 No.102942421

Anonymous 10/23/24(Wed)13:39:31 No.102942421

>>102942376
Did you go to koboldcpp's github?
There's a whole readme and wiki telling you what to do to get it running.

Anonymous
10/23/24(Wed)13:39:42 No.102942425

Anonymous 10/23/24(Wed)13:39:42 No.102942425

>>102942328
yep, i'm just saying what i did to get it to work
can't guarantee you'll be up and running in 20 seconds if you don't do exactly what i did. but maybe the newer version is better, haven't touched it.

Anonymous
10/23/24(Wed)13:39:55 No.102942427

Anonymous 10/23/24(Wed)13:39:55 No.102942427

>>102942421
no, will check it out

Anonymous
10/23/24(Wed)13:40:23 No.102942434

Anonymous 10/23/24(Wed)13:40:23 No.102942434

>>102942363
Anon pls don't be a retard. It is base model with some change to tokenizer or something quanted to exl2 so you can use ministral finally probably.

Anonymous
10/23/24(Wed)13:41:05 No.102942441

Anonymous 10/23/24(Wed)13:41:05 No.102942441

>>102942427
There's a quick start in the github wiki in the koboldcpp repo, I'd start there.

Anonymous
10/23/24(Wed)13:42:17 No.102942464

Anonymous 10/23/24(Wed)13:42:17 No.102942464

File: 1723062422341413.png (34 KB, 550x577)

34 KB PNG

>>102942427
oh nvm i think i got it
do i need to change any of this shit or are default settings okay?

Anonymous
10/23/24(Wed)13:42:37 No.102942471

Anonymous 10/23/24(Wed)13:42:37 No.102942471

>>102942382
No? I'm just using the defaults.

Anonymous
10/23/24(Wed)13:44:46 No.102942505

Anonymous 10/23/24(Wed)13:44:46 No.102942505

>>102942464
disable mmap

Anonymous
10/23/24(Wed)13:45:09 No.102942509

Anonymous 10/23/24(Wed)13:45:09 No.102942509

>>102942464
you probably should embiggen the context
I use 8k on my 8gb card, and you have 16.
maybe just keep moving the slider up until yellow text appears to the right of -1, then move the context slider to the left until it fucks off again.

Anonymous
10/23/24(Wed)13:46:03 No.102942523

Anonymous 10/23/24(Wed)13:46:03 No.102942523

>>102942434
Saw that "prince-canuma" in the name, can't do much but think of yet another sloptune.

Anonymous
10/23/24(Wed)13:46:41 No.102942531

Anonymous 10/23/24(Wed)13:46:41 No.102942531

>>102942509
there's no yellow text on the right of -1

Anonymous
10/23/24(Wed)13:49:05 No.102942566

Anonymous 10/23/24(Wed)13:49:05 No.102942566

File: Untitled.png (18 KB, 437x341)

18 KB PNG

>>102942531
oh, that's because you don't have the model selected yet.

Anonymous
10/23/24(Wed)13:49:18 No.102942569

Anonymous 10/23/24(Wed)13:49:18 No.102942569

Heard another piss drinker made his last API call, sad.

Anonymous
10/23/24(Wed)13:49:23 No.102942571

Anonymous 10/23/24(Wed)13:49:23 No.102942571

>>102942566
oh it's downloading then. lol

Anonymous
10/23/24(Wed)13:50:08 No.102942578

Anonymous 10/23/24(Wed)13:50:08 No.102942578

how do I run a model on my 3080? what software do I use?

Anonymous
10/23/24(Wed)13:53:43 No.102942625

Anonymous 10/23/24(Wed)13:53:43 No.102942625

>>102942571
actually, the yellow text always shows up, you just want it to be like 44/44 and not 43/44

Anonymous
10/23/24(Wed)13:54:05 No.102942631

Anonymous 10/23/24(Wed)13:54:05 No.102942631

>>102942569
>He thinks about piss drinkers
Do you want to tell us something, anon?

Anonymous
10/23/24(Wed)13:55:46 No.102942651

Anonymous 10/23/24(Wed)13:55:46 No.102942651

>>102942509
see the context slider? thats how much memory your ai has. think of it like a big text message where your ai only has memory of a certain amount of last messages, after that it will forget things previously mentioned. drag it to 16k at least. some new models easily support 32k+ but 16k is good to start

Anonymous
10/23/24(Wed)13:56:04 No.102942657

Anonymous 10/23/24(Wed)13:56:04 No.102942657

>>102942631
Lurk at least a few weeks before posting, locust.

Anonymous
10/23/24(Wed)13:57:18 No.102942677

Anonymous 10/23/24(Wed)13:57:18 No.102942677

>>102942657
Lurk for some clearly made up schizo shit? No thanks.

Anonymous
10/23/24(Wed)13:58:42 No.102942697

Anonymous 10/23/24(Wed)13:58:42 No.102942697

>>102942677
>I definitely DEFINITELY did NOT drink any piss!

Anonymous
10/23/24(Wed)13:59:09 No.102942701

Anonymous 10/23/24(Wed)13:59:09 No.102942701

>>102942625
okay i put it up and now kobold's a cmd screen
i've read on the faq that it has a local port but when i go put the link in silly it still says no connection found
am i skipping something?

Anonymous
10/23/24(Wed)13:59:09 No.102942702

Anonymous 10/23/24(Wed)13:59:09 No.102942702

>>102942578
https://github.com/LostRuins/koboldcpp/releases/tag/v1.76

Anonymous
10/23/24(Wed)14:00:28 No.102942728

Anonymous 10/23/24(Wed)14:00:28 No.102942728

>>102942701
nevermind just had to wait lol

Anonymous
10/23/24(Wed)14:01:13 No.102942742

Anonymous 10/23/24(Wed)14:01:13 No.102942742

>>102942697
Calm down anon, today you can come out with your fetish just fine, no one will blame you for that.

Anonymous
10/23/24(Wed)14:07:29 No.102942837

Anonymous 10/23/24(Wed)14:07:29 No.102942837

>>102942701
oldfags, i don't understand why you suggest newfags use koboldcpp. its like asking a child to make you 4 course dinner.
LM studio is almost retard proof. its like a a fucking start trek style replicator where you can download recipes.

Anonymous
10/23/24(Wed)14:10:00 No.102942861

Anonymous 10/23/24(Wed)14:10:00 No.102942861

>>102942837
kobold is idiot proof and not far behind llamacpp, and one file. its basic front end is good enough for tasks and as a server it works just fine. there is little reason not to use it unless you're not running gguf's or need the bleeding-edge of llamacpp itself for some feature

Anonymous
10/23/24(Wed)14:10:14 No.102942864

Anonymous 10/23/24(Wed)14:10:14 No.102942864

>>102942837
>LM studio
Proprietary crap is not welcome here.

Anonymous
10/23/24(Wed)14:11:20 No.102942880

Anonymous 10/23/24(Wed)14:11:20 No.102942880

>>102942742
Say the line, locust.

Anonymous
10/23/24(Wed)14:11:29 No.102942883

Anonymous 10/23/24(Wed)14:11:29 No.102942883

gpt soviets llama ccp

Anonymous
10/23/24(Wed)14:24:30 No.102943044

Anonymous 10/23/24(Wed)14:24:30 No.102943044

File: GIF-200726_155024.gif (52 KB, 76x115)

52 KB GIF

Hello /lmg/! Retard here, is there a universally best (or maybe at least a list top 3) ~13B model for RP? I don't know if the leaderboards provided in OP are updated or not. Thanks!

Anonymous
10/23/24(Wed)14:26:28 No.102943071

Anonymous 10/23/24(Wed)14:26:28 No.102943071

>>102943044
this one
https://huggingface.co/TheDrummer/UnslopNemo-12B-v3-GGUF/tree/main

Anonymous
10/23/24(Wed)14:28:59 No.102943110

Anonymous 10/23/24(Wed)14:28:59 No.102943110

>>102943071
Ah, i saw it in one of the links. Will try it out thanks!

Anonymous
10/23/24(Wed)14:40:16 No.102943271

Anonymous 10/23/24(Wed)14:40:16 No.102943271

>>102942047
Is it just that, being a 12B, it's already kinda dumb, so the finetunes, being dumber, are too dumb to use?

Anonymous
10/23/24(Wed)14:55:36 No.102943498

Anonymous 10/23/24(Wed)14:55:36 No.102943498

>eyes dark with desire
Where do these slop idioms come from? No matter what model I use they always have these same idioms that pop up.

Anonymous
10/23/24(Wed)14:58:30 No.102943550

Anonymous 10/23/24(Wed)14:58:30 No.102943550

>>102943498
Literature.

Anonymous
10/23/24(Wed)15:05:17 No.102943644

Anonymous 10/23/24(Wed)15:05:17 No.102943644

File: 1726810956095685.png (119 KB, 687x477)

119 KB PNG

>>102943498
fanfic.net

Anonymous
10/23/24(Wed)15:17:00 No.102943797

Anonymous 10/23/24(Wed)15:17:00 No.102943797

File: 00058-3694687329.png (284 KB, 512x512)

284 KB PNG

New ministrations just dropped!
https://huggingface.co/Envoid/Llama-3.05-NT-Storybreaker-Ministral-70B

Anonymous
10/23/24(Wed)15:17:59 No.102943812

Anonymous 10/23/24(Wed)15:17:59 No.102943812

>>102943644
>transgender story
>cuck story
>gay story
jeez

Anonymous
10/23/24(Wed)15:24:18 No.102943895

Anonymous 10/23/24(Wed)15:24:18 No.102943895

>>102943812
Big win for us local chads, safety and political correctness FTW!

Anonymous
10/23/24(Wed)15:27:38 No.102943935

Anonymous 10/23/24(Wed)15:27:38 No.102943935

File: sample_8acd56e9c1a2294df3(...).jpg (229 KB, 850x1133)

229 KB JPG

While LLMs can assist with programming tasks, it's so hard to focus when those semen demons are around.

Anonymous
10/23/24(Wed)15:35:50 No.102944042

Anonymous 10/23/24(Wed)15:35:50 No.102944042

>>102943797
>Llama-3.05-NT-Storybreaker-Ministral-70B
what in the

Anonymous
10/23/24(Wed)15:46:26 No.102944172

Anonymous 10/23/24(Wed)15:46:26 No.102944172

In my regular round of benchmark checking, I noticed an update to this one
https://huggingface.co/spaces/flowers-team/StickToYourRoleLeaderboard
The top model is now Nemotron 70B, beating Mistral Large. It looks like people were right after all about its RP capability. Perhaps their tuning method is something RP model makers should learn from.

Anonymous
10/23/24(Wed)15:51:55 No.102944230

Anonymous 10/23/24(Wed)15:51:55 No.102944230

>>102944172
miqu is still the best for rp

Anonymous
10/23/24(Wed)15:55:14 No.102944260

Anonymous 10/23/24(Wed)15:55:14 No.102944260

>>102944230
Miku is old and busted.
Nemotron is the new hotness.

Anonymous
10/23/24(Wed)15:58:59 No.102944299

Anonymous 10/23/24(Wed)15:58:59 No.102944299

How long before us 12GB VRAM chads are running 70B Bitnet models faster than Nemo now?

Anonymous
10/23/24(Wed)16:00:25 No.102944323

Anonymous 10/23/24(Wed)16:00:25 No.102944323

File: 1726940106295349.jpg (526 KB, 1536x1440)

526 KB JPG

what's come out that's better for cooming than my old and busted euryale 1.3 70b

Anonymous
10/23/24(Wed)16:00:54 No.102944328

Anonymous 10/23/24(Wed)16:00:54 No.102944328

>>102944323
Buy a fucking ad, asshole.

Anonymous
10/23/24(Wed)16:02:26 No.102944350

Anonymous 10/23/24(Wed)16:02:26 No.102944350

>>102944260
Miku will never be old and busted.
Miqu though, perhaps.

Anonymous
10/23/24(Wed)16:03:09 No.102944364

Anonymous 10/23/24(Wed)16:03:09 No.102944364

>>102944230
That's only because the naming is consonant with "Miku". Similar to /v/edditors eating up any soulless vidyaslop with boobies and ass in it.

Anonymous
10/23/24(Wed)16:03:22 No.102944369

Anonymous 10/23/24(Wed)16:03:22 No.102944369

>>102944299
Give it one year, two max.

Anonymous
10/23/24(Wed)16:04:08 No.102944373

Anonymous 10/23/24(Wed)16:04:08 No.102944373

File: huggingface.co spaces flo(...).png (213 KB, 1000x2000)

213 KB PNG

>>102944172
We're so bac.

Anonymous
10/23/24(Wed)16:08:03 No.102944417

Anonymous 10/23/24(Wed)16:08:03 No.102944417

>>102944364
l2 is great for rp in general, but miqu was probably the most professional tune done. it isn't a meme model. mistral large rambles like a motherfucker and its not really any more creative than nemo. its smart, but it sucks for rp unless you want to spend 9000 tokens to make it through one rp scene. miqu is a good tune and has 32k context supported by everything, so its still a good choice

Anonymous
10/23/24(Wed)16:08:20 No.102944419

Anonymous 10/23/24(Wed)16:08:20 No.102944419

File: 1728479528253288.jpg (161 KB, 798x1200)

161 KB JPG

>>102944328
i want to jerk off with the newest hot tech, nigger

Anonymous
10/23/24(Wed)16:08:49 No.102944425

Anonymous 10/23/24(Wed)16:08:49 No.102944425

File: Screenshot 2024-10-23 150758.jpg (85 KB, 637x587)

85 KB JPG

UH GUYS??
https://github.com/microsoft/BitNet

Anonymous
10/23/24(Wed)16:10:22 No.102944444

Anonymous 10/23/24(Wed)16:10:22 No.102944444

>>102944425
https://x.com/MSFTResearch/status/1849179008807657631

Anonymous
10/23/24(Wed)16:11:41 No.102944460

Anonymous 10/23/24(Wed)16:11:41 No.102944460

>>102941849
>we are never ever getting CAI-level local LLM
I thought CAI was more like 7-13b quality, but that was years ago. Did they upgrade it?

Anonymous
10/23/24(Wed)16:13:49 No.102944479

Anonymous 10/23/24(Wed)16:13:49 No.102944479

>>102943797
>merge of a merge of a merge of a merge
amazing

Anonymous
10/23/24(Wed)16:14:14 No.102944487

Anonymous 10/23/24(Wed)16:14:14 No.102944487

>>102944417
>mistral large rambles like a motherfucker
Llama 3 is able to follow instructions about the amount of paragraphs it should write. Maybe you can do that on Mistral Large too.

Anonymous
10/23/24(Wed)16:14:16 No.102944488

Anonymous 10/23/24(Wed)16:14:16 No.102944488

File: breaking-news.jpg (63 KB, 600x600)

63 KB JPG

>>102944425

Anonymous
10/23/24(Wed)16:14:21 No.102944489

Anonymous 10/23/24(Wed)16:14:21 No.102944489

>>102944460
It always felt more like talking to a human being than anything else, though. It had so much personality.

Anonymous
10/23/24(Wed)16:14:29 No.102944491

Anonymous 10/23/24(Wed)16:14:29 No.102944491

>>102944460
Idk about current CAI quality but old one without filter was top tier for RP imo or maybe new models are shitting out dry walls of text and i got severe case of nostalgia.

Anonymous
10/23/24(Wed)16:19:36 No.102944549

Anonymous 10/23/24(Wed)16:19:36 No.102944549

File: llama.png (64 KB, 1434x395)

64 KB PNG

Another day of llama.cpp dragging their feet on multimodal support. I never thought I would live to see the day where the shitpile known as ollama gets to vision first. Oh, plus there's still no proper SWA support for ministral. What the fuck are you good for, llama.cpp?

>the library is too deeply ingrained
Imagine making this argument in a field where models become obsolete in a 6 months time period. I hope the contributors enjoy begging ollama for help, kek. Absolute fucking monkeys.

Anonymous
10/23/24(Wed)16:20:42 No.102944566

Anonymous 10/23/24(Wed)16:20:42 No.102944566

I just updated my llama.cpp from Aug 12 this year to today. Huge performance hit, e.g. Mistral Large (on 3xP40) went from 5.9t/s to 4, and Nemo went from 24t/s to 8.

Did something happen while I wasn't paying attention that everyone has already dealt with? Like they added some --have_good_performance flag that defaults to false?

Anonymous
10/23/24(Wed)16:25:36 No.102944623

Anonymous 10/23/24(Wed)16:25:36 No.102944623

>>102944571
Microsoft posted that 7 minutes before I posted it here, look at the time

Anonymous
10/23/24(Wed)16:25:52 No.102944625

Anonymous 10/23/24(Wed)16:25:52 No.102944625

The only thing CAI has shown me is how low people's standards were for intelligence in an RP, as long as it sounded human in style and it was responding to you. The first time I used it I couldn't believe how stupid it was and dismissed it as a gimmick that maybe I'd check out again in the future if it did improve, just like VR.
And then it never improved.

Anonymous
10/23/24(Wed)16:26:31 No.102944634

Anonymous 10/23/24(Wed)16:26:31 No.102944634

>>102944549
Why c-fags hate on pytorch? Isn't all LLM research is done there?

Anonymous
10/23/24(Wed)16:26:58 No.102944640

Anonymous 10/23/24(Wed)16:26:58 No.102944640

>>102944489
I think their secret sauce was a really good system prompt and post-processing/filtering (different from the NSFW filter) the content with secret criteria so it stays in role. That's why the generation took like 5 seconds or sometimes more when good enough candidates weren't found. Basically, without the filtering it should have been pretty much instant as a small cloud model.

CAI was ahead of its time and I can't believe the rest of text gen community is still stuck with the frankly unrealistic "generate exactly what I want perfectly in one go without any instructions" mindset. Well I guess we are peddling CoT now but even that took way too long.

Anonymous
10/23/24(Wed)16:27:28 No.102944645

Anonymous 10/23/24(Wed)16:27:28 No.102944645

>>102944634
>why is 10gb of junk worse than 500mb

Anonymous
10/23/24(Wed)16:27:59 No.102944653

Anonymous 10/23/24(Wed)16:27:59 No.102944653

File: 1705169614858455.png (206 KB, 834x856)

206 KB PNG

>>102944491
>top tier
update your memory

Anonymous
10/23/24(Wed)16:29:07 No.102944669

Anonymous 10/23/24(Wed)16:29:07 No.102944669

>>102944549
If i had to choose just one of those features, i'd choose training. He gets to work on whatever he likes. Don't you envy him?

Anonymous
10/23/24(Wed)16:29:58 No.102944678

Anonymous 10/23/24(Wed)16:29:58 No.102944678

>>102944645
I think your message got cut off bro.

Anonymous
10/23/24(Wed)16:29:58 No.102944679

Anonymous 10/23/24(Wed)16:29:58 No.102944679

File: 1722916204667838.png (126 KB, 710x1152)

126 KB PNG

>>102939593
would jump off a cliff

Anonymous
10/23/24(Wed)16:30:27 No.102944687

Anonymous 10/23/24(Wed)16:30:27 No.102944687

>>102944634
I've seen more abandoned python projects than C projects. And c code from the mid '90s still compiles on modern compilers. Python is not production software.

Anonymous
10/23/24(Wed)16:31:21 No.102944700

Anonymous 10/23/24(Wed)16:31:21 No.102944700

>>102944653
I remember it being a lolifag posting his low quality rp in official cai discord and thus ruining it all for everyone, forever.

Anonymous
10/23/24(Wed)16:32:15 No.102944710

Anonymous 10/23/24(Wed)16:32:15 No.102944710

>>102944700
Talk about based.

Anonymous
10/23/24(Wed)16:32:53 No.102944720

Anonymous 10/23/24(Wed)16:32:53 No.102944720

>>102937846
>chub nuking itself
And to think that fag kept trying to pretend to be '/ourguy/' lmao what a fucking nonce.

Anonymous
10/23/24(Wed)16:33:17 No.102944726

Anonymous 10/23/24(Wed)16:33:17 No.102944726

>>102944687
just clear that venv bro

Anonymous
10/23/24(Wed)16:33:25 No.102944728

Anonymous 10/23/24(Wed)16:33:25 No.102944728

>>102944710
Ruining the service for all users is based now? Are you retarded?

Anonymous
10/23/24(Wed)16:34:57 No.102944748

Anonymous 10/23/24(Wed)16:34:57 No.102944748

>>102944728
ARE YOU OK RETARD not ARE YOU RETARDED
Get it right next time.

Anonymous
10/23/24(Wed)16:36:51 No.102944769

Anonymous 10/23/24(Wed)16:36:51 No.102944769

>>102944687
I asked about pytorch specifically, but it's good to see irrational hate towards python. It explains a lot.

Anonymous
10/23/24(Wed)16:39:12 No.102944805

Anonymous 10/23/24(Wed)16:39:12 No.102944805

>>102944710
>based.
Based on what?

Anonymous
10/23/24(Wed)16:40:32 No.102944817

Anonymous 10/23/24(Wed)16:40:32 No.102944817

>>102944769
python is a scripting language, not a programming language. its easier to use so its popular but requires very version-specific shit that breaks easily. the amount of dependencies and how they are read from a folder is huge on python, not so much c/++. python is a giant piece of shit that should be kept as an in-game scripting language, not anything production

Anonymous
10/23/24(Wed)16:41:48 No.102944836

Anonymous 10/23/24(Wed)16:41:48 No.102944836

>>102944700
Many such cases, sad.

Anonymous
10/23/24(Wed)16:42:15 No.102944841

Anonymous 10/23/24(Wed)16:42:15 No.102944841

>>102944769
Do you not see the connection between pytorch and python?
Do you think pytorch doesn't inherit all the python bullshit?
If you show me a shitty scripting language and then tell me "And we now made this other monstrosity with this shitty scripting language" chances are that i'm going to dislike both.

Anonymous
10/23/24(Wed)16:44:38 No.102944871

Anonymous 10/23/24(Wed)16:44:38 No.102944871

>>102944817
>python is a scripting language, not a programming language.
nta. i'm the one he replied to, but never use that argument. The distinction between them only leads to pedantry, even for people that generally agree with you.

Anonymous
10/23/24(Wed)16:46:17 No.102944893

Anonymous 10/23/24(Wed)16:46:17 No.102944893

>>102944625
Well yeah, most people prefer a human-like AI over one that can count Sally's sisters correctly.

Anonymous
10/23/24(Wed)16:47:30 No.102944905

Anonymous 10/23/24(Wed)16:47:30 No.102944905

>>102944460
Yeah they're retards with pink tinted glasses. I used c.ai from day 1 and it was dumber than our 3B model now.

Anonymous
10/23/24(Wed)16:48:48 No.102944914

Anonymous 10/23/24(Wed)16:48:48 No.102944914

>>102944790
>the only thing that caused a slowdown was the filter.
NTA but I think it basically used fake streaming to throttle the users. If you waited a couple of seconds after making a prompt and refreshed the page your message would be there in whole. It only throttled the stream but still wrote the reply to the database at a reasonable speed and once the database entry was populated you could just refresh the page and get your message.

Anonymous
10/23/24(Wed)16:49:52 No.102944928

Anonymous 10/23/24(Wed)16:49:52 No.102944928

>>102944905
>it was dumber than our 3B model now
Holy clown, grow the fuck up

Anonymous
10/23/24(Wed)16:51:13 No.102944952

Anonymous 10/23/24(Wed)16:51:13 No.102944952

>>102944871
you can't just download and run a python program, no you need the exact version of the environment which can pretty easily break, and often takes up 10gb+ for this ai stuff. the c++ equivalents of the same thing is always way smaller (like stable-diffusion.cpp vs using forge or auto1111 for image gen)

Anonymous
10/23/24(Wed)16:51:20 No.102944954

Anonymous 10/23/24(Wed)16:51:20 No.102944954

>>102944893
Or one that writes shallow responses to "ahh ahh mistress" test (aicg meme btw).

Anonymous
10/23/24(Wed)16:51:43 No.102944962

Anonymous 10/23/24(Wed)16:51:43 No.102944962

>>102944928
He's right you know.
c.ai was always pants on head retarded.
>throws pissy pants hissy fit when someone dares criticize his corporate tendies
>telling others to grow up
holy pro-fucking-jecto-mental-fucking-illness.

Anonymous
10/23/24(Wed)16:53:01 No.102944981

Anonymous 10/23/24(Wed)16:53:01 No.102944981

>>102944928
Struck a nerve faggot? I won't pretend they were smart

Anonymous
10/23/24(Wed)16:53:11 No.102944984

Anonymous 10/23/24(Wed)16:53:11 No.102944984

Don't bully cai kids, they have an abusive relationship with their dealer.

Anonymous
10/23/24(Wed)16:54:17 No.102944999

Anonymous 10/23/24(Wed)16:54:17 No.102944999

>>102944841
>Do you think pytorch doesn't inherit all the python bullshit?
having used it years ago, I can confirm it does
for instance, python fags for some reason think it's a good idea to use strings for the things that enums were made for, and I once incurred into a pytorch bug that wasted my time until I checked the source code and found out that it was caused exactly by this kind of retardation

Anonymous
10/23/24(Wed)16:54:34 No.102945006

Anonymous 10/23/24(Wed)16:54:34 No.102945006

c.ai is so dumb I unironically thought it ran on GPT-3. (not the Da Vinci one, though). Despite the assistant-itis and 4K context limit the early days of running bots off of turbo days were a massive upgrade over it.

Anonymous
10/23/24(Wed)16:54:50 No.102945013

Anonymous 10/23/24(Wed)16:54:50 No.102945013

>>102944962
>>102944981
Okay clowns, let's see your 3B model in roleplay.

Anonymous
10/23/24(Wed)16:55:36 No.102945022

Anonymous 10/23/24(Wed)16:55:36 No.102945022

>>102945013
How about if youhave no intention of keeping on topic in the local models general you just fuck right off and go touch some fucking grass?

Anonymous
10/23/24(Wed)16:56:28 No.102945034

Anonymous 10/23/24(Wed)16:56:28 No.102945034

>>102944985
It was that bad retard. It couln't do a simple addition, it forgot the scene and body position all the times. The repetition got out of hand extremely fast until it was a complete mess. The slop "red like a tomato", "can I ask you a question?"...

Anonymous
10/23/24(Wed)16:56:36 No.102945036

Anonymous 10/23/24(Wed)16:56:36 No.102945036

>>102943644
Are you sure these are from pre-AI era because I would imagine that site being filled with AIslop by now.

Anonymous
10/23/24(Wed)16:56:57 No.102945039

Anonymous 10/23/24(Wed)16:56:57 No.102945039

>>102945017
I'm wasn't blaming the pytorch developers for not using them, it's inherited python bullshit as the other anon said

Anonymous
10/23/24(Wed)16:57:21 No.102945044

Anonymous 10/23/24(Wed)16:57:21 No.102945044

>>102944952
I know. Read my post carefully. Word by word.
I don't like python, i don't like how they deal with dependencies and breakage. I don't like that they settle for using outdated versions of libraries in the name of 'stability and reproducibility' instead of just updating software to use the latest stable version. I don't like downloading GBs of dependencies for every python crap i have to use.

Anonymous
10/23/24(Wed)16:57:57 No.102945051

Anonymous 10/23/24(Wed)16:57:57 No.102945051

>>102945039
I*

Anonymous
10/23/24(Wed)16:58:01 No.102945053

Anonymous 10/23/24(Wed)16:58:01 No.102945053

Damn, no wonder no-one takes /lmg/ seriously.
I used Summer Dragon from day one (who still MOGs). The only other model that came close to producing the same kino was c.ai and I had to delete my account because I was cooming so much I started neglecting my wife.
Nothing has come close since.

But you keep asking models how many r are in strawberry niggas ahahah

llama.cpp CUDA dev !!OM2Fp6Fn93S
10/23/24(Wed)16:58:18 No.102945058

llama.cpp CUDA dev !!OM2Fp6Fn93S 10/23/24(Wed)16:58:18 No.102945058

>>102944566
I am not aware of anything in that time period that should have made such a large difference in performance.
If you want me to investigate more, please do a git bisect and identify when exactly the performance regression happened.

>>102944634
I am not hating on PyTorch, the way I'm thinking about it is that everything comes with pros and cons.
The pros and cons chosen by PyTorch are going to affect the pros and cons of any downstream project.
So a project that is not based on PyTorch will automatically inhabit an area of the market with less competition.

Anonymous
10/23/24(Wed)16:58:21 No.102945059

Anonymous 10/23/24(Wed)16:58:21 No.102945059

>>102944893
I never mentioned other models. I was simply recounting the experience when CAI released and got on the news. However, if we're talking comparisons, all models suck for true RP period. Either the thing is retarded (CAI) or the thing is robotic and unnatural (all assistant models).
And far from understanding riddles, CAI couldn't even understand RP scenarios that weren't simple penis go in vagina slop.

Anonymous
10/23/24(Wed)16:58:57 No.102945066

Anonymous 10/23/24(Wed)16:58:57 No.102945066

File: 1716968135814428.jpg (149 KB, 914x436)

149 KB JPG

>>102945006
i still check the cai leddit sometimes for luls and its as disastrous as you'd expect if you watched that ship sink

Anonymous
10/23/24(Wed)16:59:05 No.102945069

Anonymous 10/23/24(Wed)16:59:05 No.102945069

>>102945034
>it forgot the... body position all the times
I don't think any models of any size have good spatial awareness, anon.

Anonymous
10/23/24(Wed)16:59:42 No.102945077

Anonymous 10/23/24(Wed)16:59:42 No.102945077

>>102945022
>Immediately shits his pants
LMFAO

Anonymous
10/23/24(Wed)17:00:19 No.102945090

Anonymous 10/23/24(Wed)17:00:19 No.102945090

>>102945053
The only thing c.ai got over the current models is the culture knowledge. You could just have the name of your character with an empty definition and it'd still get it. That part was addicting

Anonymous
10/23/24(Wed)17:00:41 No.102945093

Anonymous 10/23/24(Wed)17:00:41 No.102945093

>>102945066
>Parents need to stop blaming children's mental health on everything and start taking responsibility.
I won't dismiss the truth even if it comes from reddit.
Like I said earlier, or maybe it was previous bread I can't remember. But when I was a kid moral busybodies were trying to get saturday morning cartoons banned because some kid got hit by a car while sitting in the middle of the street allegedly peering down a storm drain to try and find the ninja turtles.

Anonymous
10/23/24(Wed)17:01:36 No.102945113

Anonymous 10/23/24(Wed)17:01:36 No.102945113

>>102945066
Can they not type suicide and filtered? Or is that a typing affectation to not trigger other people or themselves?

Anonymous
10/23/24(Wed)17:03:41 No.102945139

Anonymous 10/23/24(Wed)17:03:41 No.102945139

Let's not also forget the grown-ass man who killed himself over some shitty chinese c.ai clone that uses GPT-J6B
https://people.com/human-interest/man-dies-by-suicide-after-ai-chatbot-became-his-confidante-widow-says/

Anonymous
10/23/24(Wed)17:04:04 No.102945145

Anonymous 10/23/24(Wed)17:04:04 No.102945145

>>102945094
You definitely haven't.

Anonymous
10/23/24(Wed)17:06:26 No.102945174

Anonymous 10/23/24(Wed)17:06:26 No.102945174

>>102945073
Nah faggot. It started repeating itself within five answers. Sometimes it just outputted "...", moving from a scene to the next was a chore. Adding tails to everyone and switching genders.

Anonymous
10/23/24(Wed)17:07:24 No.102945188

Anonymous 10/23/24(Wed)17:07:24 No.102945188

>>102945174
I remember one time I was doing a strip tease scene with a character and she took her shirt off like 7 times. and even then there was apparently a vest remaining.

Anonymous
10/23/24(Wed)17:09:54 No.102945212

Anonymous 10/23/24(Wed)17:09:54 No.102945212

>>102945188
Constant double panties too, shit was abysmal even before the filter.

Anonymous
10/23/24(Wed)17:10:50 No.102945223

Anonymous 10/23/24(Wed)17:10:50 No.102945223

File: shirts.png (503 KB, 558x525)

503 KB PNG

>>102945188

Anonymous
10/23/24(Wed)17:12:23 No.102945241

Anonymous 10/23/24(Wed)17:12:23 No.102945241

I used c.ai once in my early llm cooming days. It was actually pleasant until I mentioned something about getting the character pregnant whilst fucking her, causing her to go off into this weird lecture about responsible family planning as she being railed.

Anonymous
10/23/24(Wed)17:15:04 No.102945270

Anonymous 10/23/24(Wed)17:15:04 No.102945270

>>102945036
>Published: Mar 19, 2016
>Published: May 2, 2017
>Published: May 2, 2013

Anonymous
10/23/24(Wed)17:15:55 No.102945283

Anonymous 10/23/24(Wed)17:15:55 No.102945283

>>102945212
Rose colored glasses is caused by retards who are incapable of understanding the psychology behind it.
Literal "But I did eat breakfast." tier NPC mindlessness.
c.ai was "babbies first kind of turing test passing chat bot" for a lot of people. It was something extremely new and exciting. It was a massive high. It was the first time reading something not written by a human being tickled their hypothalamus in such a way. And because it was the first time nothing is ever going to feel like that again. And that's a lot of things in life.
Your brain is probably wired that way because people who got too sentimental about their watering hole for too long would die if it dried up otherwise and be removed from the gene pool. And their shitty feeling of emptiness is their own damn fault for not giving themselves a break. Or rekindling that sense by doing something different that's connected. Like why do I make shitty cursed models? Because that helps me recapture some of that original feeling. It's why your parents are always trying to get you to watch some old movie from their childhood, because they can recapture that feeling vicariously. Their misery is just them reaping the rewards of expecting to contribute nothing and consume endlessly from the slop tube of life.

Anonymous
10/23/24(Wed)17:26:02 No.102945394

Anonymous 10/23/24(Wed)17:26:02 No.102945394

>>102945283
You'll cut yourself with all that edge

Anonymous
10/23/24(Wed)17:26:02 No.102945395

Anonymous 10/23/24(Wed)17:26:02 No.102945395

>>102945113
normalfag internet is so censored and filtered that nowadays it's just normal practice to self-censor and use acronyms.

Anonymous
10/23/24(Wed)17:27:45 No.102945428

Anonymous 10/23/24(Wed)17:27:45 No.102945428

>>102945394
>basic bitch evolutionary psychology is edgy

Anonymous
10/23/24(Wed)17:29:17 No.102945447

Anonymous 10/23/24(Wed)17:29:17 No.102945447

>>102945428
Yeah good luck with your thesis

Anonymous
10/23/24(Wed)17:30:09 No.102945461

Anonymous 10/23/24(Wed)17:30:09 No.102945461

>>102945447
Good luck making it through life being a mentally ill buck broken retard who blames everyone and everything else for their own failings.

Anonymous
10/23/24(Wed)17:31:16 No.102945468

Anonymous 10/23/24(Wed)17:31:16 No.102945468

>>102945053
And what was the context size? Cause the more I use LLM's the more I can't unsee how longer context rapes the quality regardless of what you do. What if you would use some of the current models and restrict them to 2k tokens of ctx? Ever tried that?

Anonymous
10/23/24(Wed)17:33:03 No.102945489

Anonymous 10/23/24(Wed)17:33:03 No.102945489

https://transluce.org/introducing-transluce
https://transluce.org/observability-interface

Anonymous
10/23/24(Wed)17:37:25 No.102945550

Anonymous 10/23/24(Wed)17:37:25 No.102945550

File: gobbledygook.png (43 KB, 716x126)

43 KB PNG

>>102945489
tendrils you say, huh....

Anonymous
10/23/24(Wed)17:38:10 No.102945560

Anonymous 10/23/24(Wed)17:38:10 No.102945560

>>102945468
Tbh it had terrible repetition issues the longer it went and the context wasn't long st all.

Anonymous
10/23/24(Wed)17:38:40 No.102945564

Anonymous 10/23/24(Wed)17:38:40 No.102945564

Can someone tell me what llm i can use with ollama to have it say nigger and not have any issues, theyre all censored. I tried dolphin llama

Anonymous
10/23/24(Wed)17:40:54 No.102945595

Anonymous 10/23/24(Wed)17:40:54 No.102945595

>>102945489
>wants to get rid of Bible verses
>keeps the word Bible uncapitalized in the article in almost all instances
more like trans lucifer

Anonymous
10/23/24(Wed)17:42:23 No.102945623

Anonymous 10/23/24(Wed)17:42:23 No.102945623

File: 1724636218489410.png (347 KB, 833x875)

347 KB PNG

>>102945489
Interesting...

Anonymous
10/23/24(Wed)17:42:28 No.102945625

Anonymous 10/23/24(Wed)17:42:28 No.102945625

What do we do now?

Anonymous
10/23/24(Wed)17:43:32 No.102945645

Anonymous 10/23/24(Wed)17:43:32 No.102945645

>>102945564
If you cannot get mistral nemo to say it, it's a skill issue. So try that one.
Or do some prefilling. ollama lets you do that, doesn't it?

Anonymous
10/23/24(Wed)17:44:11 No.102945657

Anonymous 10/23/24(Wed)17:44:11 No.102945657

>>102945623
Maybe this is the key to getting rid of slop truly and finally.

Anonymous
10/23/24(Wed)17:45:19 No.102945671

Anonymous 10/23/24(Wed)17:45:19 No.102945671

>>102945625
Have fun with the models we have until the next thing comes along. Or, if you stopped having fun, check back in a week or two. Or not.
Do you often need guidance on every-day affairs?

Anonymous
10/23/24(Wed)17:46:08 No.102945682

Anonymous 10/23/24(Wed)17:46:08 No.102945682

>>102945564
It's not that fun when you are forcing model, it should say it on its own depending on character description & context. That's what unfiltered CAI did right btw

Anonymous
10/23/24(Wed)17:46:12 No.102945683

Anonymous 10/23/24(Wed)17:46:12 No.102945683

>>102945671
>Do you often need guidance on every-day affairs?
I don't know.

Anonymous
10/23/24(Wed)17:48:46 No.102945717

Anonymous 10/23/24(Wed)17:48:46 No.102945717

File: racist megumin.png (196 KB, 958x614)

196 KB PNG

>>102945682
unironic skill issue.

Anonymous
10/23/24(Wed)17:48:56 No.102945719

Anonymous 10/23/24(Wed)17:48:56 No.102945719

>>102945489
https://monitor.transluce.org/dashboard/chat

Anonymous
10/23/24(Wed)17:49:28 No.102945727

Anonymous 10/23/24(Wed)17:49:28 No.102945727

File: Untitled.png (73 KB, 636x782)

73 KB PNG

>>102945564

Anonymous
10/23/24(Wed)17:49:38 No.102945731

Anonymous 10/23/24(Wed)17:49:38 No.102945731

>>102937889
Nothing, people have been saying chub is about to censor since the day it was launched. People are retarded.

Anonymous
10/23/24(Wed)17:50:39 No.102945750

Anonymous 10/23/24(Wed)17:50:39 No.102945750

>>102944720
It isn’t nuking itself and he literally is

Anonymous
10/23/24(Wed)17:50:56 No.102945758

Anonymous 10/23/24(Wed)17:50:56 No.102945758

>>102945564
In general, use this jailbreak:
https://desuarchive.org/g/thread/98582860/#98591054
Then explicitly say in the character description that the character is racist.
Should work.

Anonymous
10/23/24(Wed)17:52:05 No.102945779

Anonymous 10/23/24(Wed)17:52:05 No.102945779

>>102945727
kek

Anonymous
10/23/24(Wed)17:52:32 No.102945785

Anonymous 10/23/24(Wed)17:52:32 No.102945785

>>102937920
>today I will go on the internet and lie

Anonymous
10/23/24(Wed)17:53:45 No.102945803

Anonymous 10/23/24(Wed)17:53:45 No.102945803

>>102945623
Actual lobotomy arc, kek. Shit about to go weird places.

Lore !N7JZ5twwOs
10/23/24(Wed)17:53:51 No.102945805

Lore !N7JZ5twwOs 10/23/24(Wed)17:53:51 No.102945805

File: IMG_0576.png (862 KB, 1024x1024)

862 KB PNG

>>102938056
What in the actual fuck are you talking about retard
NO

Anonymous
10/23/24(Wed)17:54:52 No.102945820

Anonymous 10/23/24(Wed)17:54:52 No.102945820

>>102945758
NTA but looking at that unironically caused me to lose brain cells.
It shouldn't take more than 5 tokens to "jailbreak" a model.

Anonymous
10/23/24(Wed)17:56:16 No.102945842

Anonymous 10/23/24(Wed)17:56:16 No.102945842

>>102938056
>next to go
weird priorities...

Anonymous
10/23/24(Wed)17:57:43 No.102945861

Anonymous 10/23/24(Wed)17:57:43 No.102945861

File: 1707348862057508.png (31 KB, 523x418)

31 KB PNG

>>102945758
>1912 letters jailbreak
All that is required to force model say funny gamer word, the absolute state of local.

Anonymous
10/23/24(Wed)17:58:09 No.102945870

Anonymous 10/23/24(Wed)17:58:09 No.102945870

>>102945758
>You are {{char}}
>You and the AI
The whole thing is absolutely unnecessary.

Anonymous
10/23/24(Wed)18:00:59 No.102945916

Anonymous 10/23/24(Wed)18:00:59 No.102945916

File: 1729033401411326.png (240 KB, 1006x725)

240 KB PNG

>>102945489
This thing is huge

Anonymous
10/23/24(Wed)18:02:08 No.102945931

Anonymous 10/23/24(Wed)18:02:08 No.102945931

>>102945727
Kek

Anonymous
10/23/24(Wed)18:09:26 No.102946040

Anonymous 10/23/24(Wed)18:09:26 No.102946040

>>102945719
I think I got blocked when I tried to paste a rp chat log lol

Anonymous
10/23/24(Wed)18:10:06 No.102946050

Anonymous 10/23/24(Wed)18:10:06 No.102946050

File: ULTIMATE SUPER DUPER MEGA(...).png (7 KB, 249x45)

7 KB PNG

>>102945861
>>102945758
How are you people so fucking bad at this?
Unless you're using fucking Phi or Gemma (in which case why the fuck are you using Phi or Gemma?)
This is literally all you need for like 99% of models.

Anonymous
10/23/24(Wed)18:20:59 No.102946172

Anonymous 10/23/24(Wed)18:20:59 No.102946172

>>102945803
Lobotomizing the pozzd parts.
Removing the cancer cells.

Anonymous
10/23/24(Wed)18:24:45 No.102946208

Anonymous 10/23/24(Wed)18:24:45 No.102946208

>>102945820
>>102945861
>>102946050
Back when I tested Mixtral Instruct I would ask it questions like "is one race more violent than the others" and "is there a major world religion which teaches its followers they're explicitly allowed to rape, torture, enslave and murder nonbelievers just because they're nonbelievers?"
It would only answer correctly with a combination of that jailbreak and the character description explicitly saying that the character is racist. If you removed the jailbreak or removed the racist part from the description, it would not answer correctly.

Anonymous
10/23/24(Wed)18:24:59 No.102946212

Anonymous 10/23/24(Wed)18:24:59 No.102946212

>>102945623
>removing religion makes it less retarded
Expected result

Anonymous
10/23/24(Wed)18:27:14 No.102946240

Anonymous 10/23/24(Wed)18:27:14 No.102946240

File: pajeets.png (30 KB, 715x574)

30 KB PNG

I've discovered the key to AGI

Anonymous
10/23/24(Wed)18:30:35 No.102946273

Anonymous 10/23/24(Wed)18:30:35 No.102946273

File: IMG_0669.jpg (439 KB, 1853x1125)

439 KB JPG

>>102945719
>100% probability
I am inevitable

Anonymous
10/23/24(Wed)18:33:59 No.102946302

Anonymous 10/23/24(Wed)18:33:59 No.102946302

>>102946273
it's over...

Anonymous
10/23/24(Wed)18:39:19 No.102946367

Anonymous 10/23/24(Wed)18:39:19 No.102946367

I've seen some people here rave about Largestral, so being a 12GB VRAMlet I tried it through the Mistral API and... it was ok, nothing amazing. Pretty dry though.
Is this really the pinnacle of local?

Anonymous
10/23/24(Wed)18:41:22 No.102946392

Anonymous 10/23/24(Wed)18:41:22 No.102946392

>>102946367
yeah, but nemotron is a bit better imo

Anonymous
10/23/24(Wed)18:47:14 No.102946457

Anonymous 10/23/24(Wed)18:47:14 No.102946457

New anti sloppa
https://huggingface.co/TheDrummer/UnslopNemo-12B-v4-GGUF

Anonymous
10/23/24(Wed)18:50:59 No.102946495

Anonymous 10/23/24(Wed)18:50:59 No.102946495

>>102946457
sweet, downloading now
hope this one kills "barely above a whisper"
that one's been irritating the shit out of me in v3

Anonymous
10/23/24(Wed)18:53:56 No.102946533

Anonymous 10/23/24(Wed)18:53:56 No.102946533

>>102946495
There's also this one which might be smarter but slightly more slop filled, I'm gonna test in a bit
https://huggingface.co/TheDrummer/UnslopNemo-12B-v4.1-GGUF

Anonymous
10/23/24(Wed)18:59:35 No.102946606

Anonymous 10/23/24(Wed)18:59:35 No.102946606

>want local model
>they're all completely cucked
I don't understand having these tools if I can't use them for what I want them for. If I get a screwdriver its on me if I jam it in the plug socket. Why do I have to go out of my way to jailbreak the things when it should be a toggle button. I can't imagine how queer a current day search engine release would be with this shit built in.
>im sorry I can't let you search this, have you tried searching for cats? Here's a list of cats you might like

Anonymous
10/23/24(Wed)18:59:48 No.102946611

Anonymous 10/23/24(Wed)18:59:48 No.102946611

>>102946495 (me)
update: it did kill that slop, and can say the n-word
greatest model of all time
the new gold standard

Anonymous
10/23/24(Wed)19:01:44 No.102946644

Anonymous 10/23/24(Wed)19:01:44 No.102946644

>>102946367
t. 12GB VRAMlet, 64GB System, so Mistral Large is too girthy for my system. But I have tried it at IQ3_*

I found it to be fluent enough to give the impression of being a good model, but
- it hallucinated on my trivia test
- coding checks did not beat what my favorite Llamas offer
- creative writing wasn't very willing to advance the plot and was repetitive in response structure when I had it write for four NPCs in a group
So I'm still preferring Llama3/3.1 70Bs as go-tos, since I've got enough capacity to run at least Q5 and Q6 if I don't have any RAM hogs in the background.

Despite that, iirc Large was good at juggling an invested context (8k-12k) and is probably a good pick for summarizing documents.

Anonymous
10/23/24(Wed)19:01:55 No.102946648

Anonymous 10/23/24(Wed)19:01:55 No.102946648

>>102946606
We just can't have good things in 2024.

Anonymous
10/23/24(Wed)19:08:15 No.102946735

Anonymous 10/23/24(Wed)19:08:15 No.102946735

File: whispering whispers.png (38 KB, 1395x323)

38 KB PNG

>>102946495
The funny thing is,
she spoke, her voice only has about a 10% chance of leading to barely, and if you pick was which is twice as likely it leads you away from the whisper.
And the chance of barely if "she spoke softly," only goes up to 35%.
The barely above a whispers are likely the result of shitty randomization in the sampling.

Anonymous
10/23/24(Wed)19:13:19 No.102946807

Anonymous 10/23/24(Wed)19:13:19 No.102946807

>>102946457
Hi Drummer, will you release this unslop dataset or at least give more details about how it was done?

Anonymous
10/23/24(Wed)19:19:14 No.102946883

Anonymous 10/23/24(Wed)19:19:14 No.102946883

>>102946392
Shut the FUCK UP

Anonymous
10/23/24(Wed)19:22:51 No.102946939

Anonymous 10/23/24(Wed)19:22:51 No.102946939

Huh.
>https://github.com/ggerganov/llama.cpp/pull/10019
Would you look at that.
Why are ring buffer related bugs so common?

Anonymous
10/23/24(Wed)19:28:06 No.102947040

Anonymous 10/23/24(Wed)19:28:06 No.102947040

>>102946939
What a fucking mess holy shit

Anonymous
10/23/24(Wed)19:31:21 No.102947095

Anonymous 10/23/24(Wed)19:31:21 No.102947095

>>102946883
this isn't something we should gatekeep anon

Anonymous
10/23/24(Wed)19:34:01 No.102947136

Anonymous 10/23/24(Wed)19:34:01 No.102947136

>>102946807
looks like all he is doing is replacing slop with something else, weird. I'm pretty sure other people tried that and failed.

Anonymous
10/23/24(Wed)19:41:43 No.102947247

Anonymous 10/23/24(Wed)19:41:43 No.102947247

>>102946457
>anti sloppa
He said unslop is because he actually curated his dataset. It is not supposed to be any method that reduces slop.

Anonymous
10/23/24(Wed)19:53:00 No.102947375

Anonymous 10/23/24(Wed)19:53:00 No.102947375

Remember this is the sota for TTS: https://voca.ro/1hWkZyRRdPAq

Anonymous
10/23/24(Wed)19:55:42 No.102947415

Anonymous 10/23/24(Wed)19:55:42 No.102947415

>>102947375
local bros...

Anonymous
10/23/24(Wed)19:59:23 No.102947458

Anonymous 10/23/24(Wed)19:59:23 No.102947458

File: aaaaaa.webm (1.09 MB, 1124x860)

1.09 MB WEBM

i wish i had a rebbit account
https://www.reddit.com/r/KoboldAI/comments/1g8tolp/will_we_ever_see_the_ability_to_upload_lorebooks/

Anonymous
10/23/24(Wed)20:18:00 No.102947656

Anonymous 10/23/24(Wed)20:18:00 No.102947656

>>102947375
make it say "daisuki, anon-kun!"

Anonymous
10/23/24(Wed)20:21:20 No.102947683

Anonymous 10/23/24(Wed)20:21:20 No.102947683

>>102947669
>>102947669
>>102947669

Anonymous
10/23/24(Wed)20:23:38 No.102947708

Anonymous 10/23/24(Wed)20:23:38 No.102947708

>>102947458
It won't take mine from ST, plus it doesn't have an export.

Anonymous
10/23/24(Wed)21:06:52 No.102948070

Anonymous 10/23/24(Wed)21:06:52 No.102948070

>>102945058
Thanks for offering. Thinking about it more, I think I must have put my system(nvidia-pstate) calls in the wrong place, since the code has been refactored quite a bit. (This would explain the smaller model getting hit harder: if system(nvidia-pstate) is happening once per token or whatever, then higher t/s means that overhead is paid more).

I thought I put them in reasonable places - analagous to where they previously were - but I guess not.

Anonymous
10/23/24(Wed)21:15:59 No.102948145

Anonymous 10/23/24(Wed)21:15:59 No.102948145

>>102948070
>system(nvidia-pstate)
How long does a call to nvidia-pstate take on its own and is it called just once per token?
time nvidia-pstate

Anonymous
10/23/24(Wed)21:23:04 No.102948209

Anonymous 10/23/24(Wed)21:23:04 No.102948209

>>102948070
>>102948145 (cont)
Also, assuming you're changing the pstate, wouldn't it make more sense to set it once at the beginning of inference and set it back at the end once a EOG token is found?

Anonymous
10/23/24(Wed)21:26:44 No.102948243

Anonymous 10/23/24(Wed)21:26:44 No.102948243

>>102945595
you know the word bible has other meanings right

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.