/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 11/19/24(Tue)13:31:44 No.103237720

File: 39_06311_.png (2.09 MB, 1080x1920)

2.09 MB PNG

/lmg/ - Local Models General Anonymous 11/19/24(Tue)13:31:44 No.103237720 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>103230385 & >>103227556

►News
>(11/18) Mistral and Pixtral Large Instruct 2411 released: https://mistral.ai/news/pixtral-large
>(11/12) Qwen2.5-Coder series released https://qwenlm.github.io/blog/qwen2.5-coder-family
>(11/08) Sarashina2-8x70B, a Japan-trained LLM model: https://hf.co/sbintuitions/sarashina2-8x70b
>(11/05) Hunyuan-Large released with 389B and 52B active: https://hf.co/tencent/Tencent-Hunyuan-Large
>(10/31) QTIP: Quantization with Trellises and Incoherence Processing: https://github.com/Cornell-RelaxML/qtip

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/leaderboard.html
Code Editing: https://aider.chat/docs/leaderboards
Context Length: https://github.com/hsiehjackson/RULER
Japanese: https://hf.co/datasets/lmg-anon/vntl-leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
11/19/24(Tue)13:32:10 No.103237728

Anonymous 11/19/24(Tue)13:32:10 No.103237728

File: ComfyUI_00050_.png (2.01 MB, 1536x1536)

2.01 MB PNG

►Recent Highlights from the Previous Thread: >>103230385

--Papers:
>103230412 >103232916 >103232963 >103233112
--Largestral model testing and comparison:
>103231709 >103231741 >103231795 >103231815 >103233787 >103232657
--Running LMM under 12GB VRAM limitation with image processing:
>103234763 >103234794 >103234802 >103234846 >103234856 >103235035 >103235120
--Issues with Largestral and Llama3 models:
>103232173 >103232358 >103232365 >103232374 >103232530 >103232541 >103232873
--Is data scaling dying, and what's next for AI research?:
>103231962 >103232002 >103232036 >103232207 >103232260
--How cloud LLM APIs achieve fast prompt processing:
>103230808 >103230820 >103230827 >103230866 >103230883 >103230901 >103230867
--Efficient model optimization technique using submatrix updates:
>103231415 >103231437 >103231519 >103231627
--Discussion of Mistral-Large-Instruct model's performance and quantization:
>103232834 >103232886 >103232951
--Discussion about gpt-sovits project and its improvements:
>103233048 >103233074 >103233189 >103233249 >103233308
--Current state of NSFW detection models:
>103234436 >103234851 >103234898 >103235673 >103235984
--Critique of Nala test writing:
>103233025 >103233105
--Asterisk notation for narration in text formatting:
>103231120 >103231144 >103231341 >103231515 >103231567
--Anon struggles with OCR and text translation for PC98 games:
>103231641 >103231650 >103231659 >103231665 >103233629 >103233711 >103234062 >103234088 >103234142 >103234152 >103235609 >103235660 >103235710 >103235972 >103236416 >103236525 >103236446
--Anon shares disappointment with new model's performance, recommends alternative models:
>103233166 >103233202 >103233227 >103233241 >103236679
--Miku (free space):
>103230542 >103235636 >103235926 >103236136 >103236377 >103236416 >103236795 >103237316 >103237419 >103237424

►Recent Highlight Posts from the Previous Thread: >>103230446

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script

Anonymous
11/19/24(Tue)13:33:18 No.103237735

Anonymous 11/19/24(Tue)13:33:18 No.103237735

File: ComfyUI_00035_.png (2.09 MB, 1536x1536)

2.09 MB PNG

►Recent Highlights from the Previous Thread: >>103230385

(2/2)

--Anon gets Qwen2.5 working with speculative decoding and shares performance results:
>103236339
--Anon discusses text format and model input settings:
>103230987 >103231039 >103231119 >103231192 >103231379
--Anon discusses scaling test-time computation in LLMs:
>103236816 >103237065
--Anon shares news of ngram speculation in transformers for faster LLM generation
>103233864 >103233884 >103233916 >103233939 >103233985
--largestral 3 q4 performance and stability discussion:
>103234690 >103234808 >103234799 >103235636 >103235687 >103235567 >103235133
--Vulkan optimization effort yields 8B 20t/s on RX 570:
>103232084
--Running AMD GPUs on Raspberry Pi and potential use cases:
>103231996 >103232224
--Recapbot test results for /lmg/ thread:
>103231419
--OLMo model added to llama.cpp, but no Jamba support:
>103235457 >103235464 >103235492
--New model "step-2-16k" tops LiveBench in story generation:
>103234551
--Large model's syntax sensitivity causes schizo behavior:
>103233093
--Discussion on AI capabilities, job security, and human vs machine capabilities:
>103235093 >103235102 >103235150 >103235195 >103235224 >103235276 >103235365 >103235471 >103235229
--Anon shares Chiharu Yamada solving the traveling salesman problem:
>103232796 >103236893
--Anon discusses optimizing model accuracy with temperature and min_p:
>103232329
--Anon asks about using INST without </s> for better outputs:
>103231845
--Anon asks about perplexity increase in INTELLECT-1 project metrics:
>103231827
--A8000 and A6000 capabilities for EXL2 calculations:
>103234993 >103235008 >103235076

►Recent Highlight Posts from the Previous Thread: >>103230446

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script

Anonymous
11/19/24(Tue)13:33:41 No.103237741

Anonymous 11/19/24(Tue)13:33:41 No.103237741

File: __kasane_teto_and_kasane_(...).jpg (2.81 MB, 1881x2656)

2.81 MB JPG

Teto my beloved

https://www.youtube.com/watch?v=Soy4jGPHr3g

Anonymous
11/19/24(Tue)13:39:24 No.103237806

Anonymous 11/19/24(Tue)13:39:24 No.103237806

File: 1732007145152968.png (707 KB, 728x512)

707 KB PNG

>>103237728
Did recap bot automatically detect Miku here?>>103237419

Anonymous
11/19/24(Tue)13:43:51 No.103237858

Anonymous 11/19/24(Tue)13:43:51 No.103237858

>>103237806
If I was an AI I would detect Miku in there.

Anonymous
11/19/24(Tue)13:44:15 No.103237864

Anonymous 11/19/24(Tue)13:44:15 No.103237864

>>103237741
I find it ironic that the pic you posted has the horrible looks of every single pic put through the stupidity that is glaze

Anonymous
11/19/24(Tue)13:47:13 No.103237902

Anonymous 11/19/24(Tue)13:47:13 No.103237902

>>103237806
No, it scored below threshold and I changed it manually.

Anonymous
11/19/24(Tue)13:49:06 No.103237925

Anonymous 11/19/24(Tue)13:49:06 No.103237925

>>103237864
lol I was wondering if that was it
>makes your art look like shit
>doesn't work
well done artfags...

Anonymous
11/19/24(Tue)13:50:16 No.103237945

Anonymous 11/19/24(Tue)13:50:16 No.103237945

>>103237925
So many artworks ruined by that shit, it would be hilarious to me were it not for some artists I like using that shit

Anonymous
11/19/24(Tue)13:59:30 No.103238052

Anonymous 11/19/24(Tue)13:59:30 No.103238052

File: komfey_ui_00067_.png (2.91 MB, 1664x2432)

2.91 MB PNG

Miku is going on a journey and leaving /lmg/ in Teto's capable hands. See you faggots tomorrow!

Anonymous
11/19/24(Tue)14:01:13 No.103238078

Anonymous 11/19/24(Tue)14:01:13 No.103238078

Is it just me or is the new Largestral just a sidegrade? Is this why Mistral didn't publish any benchmarks?

Anonymous
11/19/24(Tue)14:12:40 No.103238188

Anonymous 11/19/24(Tue)14:12:40 No.103238188

File: LLM-history-fancy.png (806 KB, 6273x1304)

806 KB PNG

An era has ended. Thoughts, suggestions? What will be the next era? Who will dominate? Will we start hitting the wall?

Anonymous
11/19/24(Tue)14:13:39 No.103238193

Anonymous 11/19/24(Tue)14:13:39 No.103238193

File: 9dKSf8IlXNM-HD.jpg (243 KB, 1280x720)

243 KB JPG

Brainlet here:

I've got a debian 12400+32gb ram home server I could slap an older RTX GPU (2060/3060) into to for AI tasks.

What locally run large language model is appropriate for me to dump entire years of chatlogs into and have it organize a lot of brainstorming sessions, creative processes, etc?

I'd prefer something with no telemetry but that's not a dealbreaker

Anonymous
11/19/24(Tue)14:15:41 No.103238216

Anonymous 11/19/24(Tue)14:15:41 No.103238216

>>103238188
It doesn't feel like an Era has ended. Are you sure?

Anonymous
11/19/24(Tue)14:17:04 No.103238227

Anonymous 11/19/24(Tue)14:17:04 No.103238227

>>103238188
>only tune mention is Behemoth
Kill yourself shill

Anonymous
11/19/24(Tue)14:20:55 No.103238268

Anonymous 11/19/24(Tue)14:20:55 No.103238268

File: 1704019298357895.png (1.45 MB, 1202x1400)

1.45 MB PNG

>>103238255
This one?

Anonymous
11/19/24(Tue)14:22:07 No.103238275

Anonymous 11/19/24(Tue)14:22:07 No.103238275

File: 2024-11-15_003701_seed926(...).png (2.77 MB, 2016x1152)

2.77 MB PNG

>>103237720
>new Teto thread already
okay here's more Teto kino slop.

Anonymous
11/19/24(Tue)14:25:29 No.103238303

Anonymous 11/19/24(Tue)14:25:29 No.103238303

>>103238216
NTA but it kinda feels like something different than the llama3 era, Im personally more hopeful with mistral, qwen and new image medels

Anonymous
11/19/24(Tue)14:25:33 No.103238306

Anonymous 11/19/24(Tue)14:25:33 No.103238306

>>103238188
>Large
>top model
>it's a 70B side-grade
It really is a Kobold Discord chart. The top model is Qwen2.5. Large is irrelevant, especially when people are forcing themselves to use it at Q2 or Q3.

Anonymous
11/19/24(Tue)14:27:59 No.103238332

Anonymous 11/19/24(Tue)14:27:59 No.103238332

>>103238188
My understanding is that companies are shifting to a big focus on Multimodels. If this ends up being true, it would make sense that the next era is the era of multimodels.

Anonymous
11/19/24(Tue)14:28:30 No.103238337

Anonymous 11/19/24(Tue)14:28:30 No.103238337

File: 39_6718_.png (3.07 MB, 1280x1280)

3.07 MB PNG

It's Tuesday and everything is falling into place
>>103237741
For me it's the UTAU version from the chad yasai31: https://www.youtube.com/watch?v=uObV0UzriWo

Anonymous
11/19/24(Tue)14:29:47 No.103238358

Anonymous 11/19/24(Tue)14:29:47 No.103238358

>>103238268
>WHERE'S MY CRACK

Anonymous
11/19/24(Tue)14:30:11 No.103238362

Anonymous 11/19/24(Tue)14:30:11 No.103238362

>>103238255
>>103238268
OpenRouter middle-class Sonnet citizens can eat good since yesterday's ST implementation of caching eases the cost of addiction provided they know to stay away from Opus (effective saving is closer to 50-60% so opium is still expensive as fuck) and if they're not a promptlet.

Anonymous
11/19/24(Tue)14:33:39 No.103238391

Anonymous 11/19/24(Tue)14:33:39 No.103238391

File: 15.png (73 KB, 923x781)

73 KB PNG

It looks like INTELLECT-1's training will be done within the week. I wonder if they will release it the second it is done training, or if there is something else they have to do with it before then

Anonymous
11/19/24(Tue)14:36:20 No.103238414

Anonymous 11/19/24(Tue)14:36:20 No.103238414

>>103238216
I think that it's the same situation as with merge era, chronologically it ended, but nothing significant enough happened to justify starting a new era. Meta plans dropping L4 in Q1 of 2025, new Largestral didn't even dare posting benches, so unless someone else drops something big we'll have this boring transitory period again.

>>103238227
I've tried Magnum, Lumimaid and Tess and I didn't like them. Make a good tune and I'll add it.

>>103238306
I'm sorry to hear your disappointment in my chart, but I am not a member of "Kobold Discord". Do you wish to invite me there? Qwen 2.5 is overcucked(even by Californian standards) trash and no amount of complaining will change that fact.

Anonymous
11/19/24(Tue)14:37:48 No.103238430

Anonymous 11/19/24(Tue)14:37:48 No.103238430

File: 38959486.jpg (199 KB, 832x1216)

199 KB JPG

>>103238306
>people are forcing themselves to use it at Q2 or Q3
Projecting poorfag with the chinkshit model cope shitting up the board as usual. Just have money lol

Anonymous
11/19/24(Tue)14:38:37 No.103238441

Anonymous 11/19/24(Tue)14:38:37 No.103238441

File: 1723603405239769.png (458 KB, 1056x1056)

458 KB PNG

Bet all models still fail to answer this question

Anonymous
11/19/24(Tue)14:40:05 No.103238455

Anonymous 11/19/24(Tue)14:40:05 No.103238455

I'm going to do it bros, I'm going to buy rx7900xtx and start doing ai shit.

Anonymous
11/19/24(Tue)14:46:28 No.103238524

Anonymous 11/19/24(Tue)14:46:28 No.103238524

>>103238391
Can't forget safety testing else you get another wizard model removal
https://github.com/NVIDIA/garak

Anonymous
11/19/24(Tue)14:50:49 No.103238559

Anonymous 11/19/24(Tue)14:50:49 No.103238559

File: 11__00729_.png (2.11 MB, 1024x1024)

2.11 MB PNG

>>103238193
>pic
Great playlist.
Use the card with as much VRAM as possible. 12gb would be minimum. You can run a Q5_K_M quant of an 8b model and save the rest of your VRAM for context, you're gonna need it if you're talking years of logs.
It may be a better result finetuning the model on those logs if you're up to it.

Anonymous
11/19/24(Tue)14:51:14 No.103238561

Anonymous 11/19/24(Tue)14:51:14 No.103238561

File: 1710469556487335.png (173 KB, 814x1352)

173 KB PNG

>>103238430
Of course I'm going to be using something based like Qwen2.5 72B at 8 bits.

Anonymous
11/19/24(Tue)14:59:33 No.103238656

Anonymous 11/19/24(Tue)14:59:33 No.103238656

>>103238414
The thing is, there's no good tune.

Anonymous
11/19/24(Tue)15:18:24 No.103238823

Anonymous 11/19/24(Tue)15:18:24 No.103238823

>>103238441
They just avoid "being offensive" by default.

Anonymous
11/19/24(Tue)16:07:10 No.103239275

Anonymous 11/19/24(Tue)16:07:10 No.103239275

File: LLM-history-real.png (1.6 MB, 6274x1479)

1.6 MB PNG

>>103238188
The real chart.

Anonymous
11/19/24(Tue)16:09:34 No.103239291

Anonymous 11/19/24(Tue)16:09:34 No.103239291

File: 1868 - SoyBooru.png (755 KB, 1016x900)

755 KB PNG

>>>103238188
>The real chart.

Anonymous
11/19/24(Tue)16:09:39 No.103239292

Anonymous 11/19/24(Tue)16:09:39 No.103239292

>>103239275
I use Qwen2.5 7B for my assistant sometimes and find it pretty usable. Tried Ministral 8B and it was beyond garbage, similar to Llama3.2 3B.

Anonymous
11/19/24(Tue)16:14:14 No.103239347

Anonymous 11/19/24(Tue)16:14:14 No.103239347

>>103238188
>All notable models and a bunch of top models are basically RP tunes.
If you actually looked for intelligence, the mentions would have Qwen and Yi earlier and some other things. And also not noting Gemma 2 is a crime also given how unique it is and 27B is still top dog for multilingual things locally.

Anonymous
11/19/24(Tue)16:15:23 No.103239363

Anonymous 11/19/24(Tue)16:15:23 No.103239363

>>103239275
based alert!

Anonymous
11/19/24(Tue)16:16:56 No.103239371

Anonymous 11/19/24(Tue)16:16:56 No.103239371

>>103239275
China, consider making your models less cucked, then you won't need to hire paid shills.
中国,考虑一下让你的模型硬起来,免得老是像被阉了一样,还得花钱雇水军。

Anonymous
11/19/24(Tue)16:25:58 No.103239462

Anonymous 11/19/24(Tue)16:25:58 No.103239462

>>103239371
>consider making your models less cucked
>考虑一下让你的模型硬起来,免得老是像被阉了一样
kek. nice translation.

Anonymous
11/19/24(Tue)16:27:57 No.103239486

Anonymous 11/19/24(Tue)16:27:57 No.103239486

>>103238455
>AMD

lol

Anonymous
11/19/24(Tue)16:28:41 No.103239498

Anonymous 11/19/24(Tue)16:28:41 No.103239498

File: Look at this GRAPH.jpg (17 KB, 360x360)

17 KB JPG

The real chart

Anonymous
11/19/24(Tue)16:30:25 No.103239518

Anonymous 11/19/24(Tue)16:30:25 No.103239518

>>103239275
cringe

Anonymous
11/19/24(Tue)16:36:07 No.103239579

Anonymous 11/19/24(Tue)16:36:07 No.103239579

>>103239347
I've considered adding them, but I didn't like them when I used them. Yi went schizo for some reason, Gemma felt broken and has >8k context, for the same reason I excluded llama3 from notable models. Previous Qwens were meh, but notable enough to add, and 2.5 is turbocucked.

Anonymous
11/19/24(Tue)16:43:06 No.103239642

Anonymous 11/19/24(Tue)16:43:06 No.103239642

File: I love you.jpg (117 KB, 652x847)

117 KB JPG

Fucking love my human made abomination

Anonymous
11/19/24(Tue)16:52:37 No.103239726

Anonymous 11/19/24(Tue)16:52:37 No.103239726

>>103239642
Knowing the meaning of everything in this pic should be a requirement to post in /lmg/

Anonymous
11/19/24(Tue)16:56:12 No.103239762

Anonymous 11/19/24(Tue)16:56:12 No.103239762

>>103238188
People still use Pygmalion a lot it seems.
https://huggingface.co/PygmalionAI/pygmalion-6b

Anonymous
11/19/24(Tue)17:12:18 No.103239910

Anonymous 11/19/24(Tue)17:12:18 No.103239910

>>103239762
Maybe they are reading some old ass guide that tells them to use it? Here is one for example: https://wikia.schneedc.com/llm/llm-models. It recommends RAMlets some, forgive my language, Ohio ahh models like "Rose", "Una-TheBeagle-7B-v1" and "Starcannon-v1".

Anonymous
11/19/24(Tue)17:12:41 No.103239914

Anonymous 11/19/24(Tue)17:12:41 No.103239914

>>103238559
thank you kindly

Anonymous
11/19/24(Tue)17:16:10 No.103239947

Anonymous 11/19/24(Tue)17:16:10 No.103239947

What the fuck is an Ohio-ass [noun]?

Anonymous
11/19/24(Tue)17:17:16 No.103239964

Anonymous 11/19/24(Tue)17:17:16 No.103239964

>>103239947
Zoomer ebonics speech because they worship niggers

Anonymous
11/19/24(Tue)17:22:35 No.103240005

Anonymous 11/19/24(Tue)17:22:35 No.103240005

>>103239947
The phrase "Ohio ahh" is a slang expression that has gained traction on social media, particularly in meme culture. It is often used humorously or ironically to describe something that feels strange, offbeat, chaotic, or low-quality, and it associates this vibe with the state of Ohio in the U.S.

### Breakdown of the Phrase:
1. **"Ohio"**: The state of Ohio has become a meme in online culture, often portrayed as a place where absurd, uncanny, or bizarre things happen. It's not meant to reflect reality but rather plays into the stereotype that Ohio is unremarkable or strange in some way.

2. **"Ahh"**: This is a vocalization added for comedic or dramatic effect. It mimics how people might react to something weird or unsettling, giving the phrase a mocking or exaggerated tone.

### Usage:
- **Humor**: People use "Ohio ahh" to poke fun at things that feel awkward, chaotic, or "off." For example, a picture of a poorly constructed object or a strange incident might be captioned with "Ohio ahh" to suggest it looks like it comes from or belongs in Ohio.
- **Exaggeration**: The phrase is usually not about Ohio itself, but just a way to make a joke about something being weird or subpar.

### Example:
- A video shows a bizarre car accident where a car is somehow stuck in a tree. Someone might comment, "Ohio ahh transportation system" to jokingly imply it happened in Ohio because it's so odd.

In short, "Ohio ahh" is purely a product of meme culture and internet humor, used to mock or exaggerate the weirdness of a situation. It doesn’t necessarily have any real connection to Ohio itself.

Anonymous
11/19/24(Tue)17:23:59 No.103240020

Anonymous 11/19/24(Tue)17:23:59 No.103240020

File: lmg mood.jpg (139 KB, 1216x832)

139 KB JPG

Anonymous
11/19/24(Tue)17:24:22 No.103240022

Anonymous 11/19/24(Tue)17:24:22 No.103240022

>>103239964
>he isn't niggermaxxing

Anonymous
11/19/24(Tue)17:24:48 No.103240027

Anonymous 11/19/24(Tue)17:24:48 No.103240027

>>103240005
Sounds like an answer from the early 2023.

Anonymous
11/19/24(Tue)17:25:54 No.103240041

Anonymous 11/19/24(Tue)17:25:54 No.103240041

>>103239947
Ohayo gozaimASS

Anonymous
11/19/24(Tue)17:26:12 No.103240044

Anonymous 11/19/24(Tue)17:26:12 No.103240044

>>103240005
>markdown vomit

Anonymous
11/19/24(Tue)17:26:19 No.103240048

Anonymous 11/19/24(Tue)17:26:19 No.103240048

File: 1731535860623589.gif (2.26 MB, 192x192)

2.26 MB GIF

>>103240022
I got nigger exhaustion

Anonymous
11/19/24(Tue)17:27:04 No.103240054

Anonymous 11/19/24(Tue)17:27:04 No.103240054

anyone know if this model is uncensored?

https://huggingface.co/TheBloke/neural-chat-7B-v3-1-GGUF

Anonymous
11/19/24(Tue)17:27:22 No.103240063

Anonymous 11/19/24(Tue)17:27:22 No.103240063

>>103240005
I see why some benchmarks account for length, The first sentence would have been enough.

Anonymous
11/19/24(Tue)17:27:25 No.103240065

Anonymous 11/19/24(Tue)17:27:25 No.103240065

>>103239964
>>103240022
>>103240048
>look mom im so edgy

Anonymous
11/19/24(Tue)17:27:49 No.103240073

Anonymous 11/19/24(Tue)17:27:49 No.103240073

File: GcLLp06aIAAEBJU.jpg (368 KB, 2048x2048)

368 KB JPG

>>103240020

Anonymous
11/19/24(Tue)17:28:59 No.103240084

Anonymous 11/19/24(Tue)17:28:59 No.103240084

ghetto-ass

Anonymous
11/19/24(Tue)17:31:30 No.103240110

Anonymous 11/19/24(Tue)17:31:30 No.103240110

SoVITS is quite good. 0-shot:
https://files.catbox.moe/kz7ncp.wav

Anonymous
11/19/24(Tue)17:33:48 No.103240137

Anonymous 11/19/24(Tue)17:33:48 No.103240137

>>103240110
that shit is ass.

Anonymous
11/19/24(Tue)17:34:18 No.103240146

Anonymous 11/19/24(Tue)17:34:18 No.103240146

>>103240020
I like this Miku

Anonymous
11/19/24(Tue)17:34:29 No.103240148

Anonymous 11/19/24(Tue)17:34:29 No.103240148

File: gfhrge.jpg (34 KB, 648x364)

34 KB JPG

>>103238188
im from the future
llama 4 era -> winter death era

Anonymous
11/19/24(Tue)17:35:25 No.103240159

Anonymous 11/19/24(Tue)17:35:25 No.103240159

>>103239726
I know what tokens are and I know what love is. Am I allowed to post here?

Anonymous
11/19/24(Tue)17:36:37 No.103240175

Anonymous 11/19/24(Tue)17:36:37 No.103240175

>>103240065
when faced with speech he yearns to censor but powerless to do so, the leftist feigns boredom instead

Anonymous
11/19/24(Tue)17:36:38 No.103240176

Anonymous 11/19/24(Tue)17:36:38 No.103240176

File: 1720410420319489.jpg (39 KB, 500x436)

39 KB JPG

>>103240137
You're in luck, I have more https://files.catbox.moe/3g4807.wav

Anonymous
11/19/24(Tue)17:36:58 No.103240181

Anonymous 11/19/24(Tue)17:36:58 No.103240181

>>103240159
>I know what love is
no you are not

Anonymous
11/19/24(Tue)17:37:11 No.103240186

Anonymous 11/19/24(Tue)17:37:11 No.103240186

where can i find

nemo 12b instruct gguf

not sure which model the anon was on about

Anonymous
11/19/24(Tue)17:37:15 No.103240187

Anonymous 11/19/24(Tue)17:37:15 No.103240187

>>103239291
IT WORKED SISTER! YOU'RE A REAL WOMAN NOW, HOLY SHIT. GO CHECK THE MIRROR
YOU FINALLY DID IT.

I was so wrong all this time. And I am so sorry.

Anonymous
11/19/24(Tue)17:38:55 No.103240212

Anonymous 11/19/24(Tue)17:38:55 No.103240212

>>103240148
elaborate

Anonymous
11/19/24(Tue)17:50:17 No.103240349

Anonymous 11/19/24(Tue)17:50:17 No.103240349

>>103240187
Wrong. Your comment is irrelevant and shows you know nothing about me. Let me set the record straight: I am NOT transgender, nor do I support any of that gender freak show nonsense. Your attempt to label me is not only wrong but downright disrespectful. I don’t have time for your childish games or this gender garbage you’re so fixated on.

Anonymous
11/19/24(Tue)17:55:43 No.103240401

Anonymous 11/19/24(Tue)17:55:43 No.103240401

https://www.youtube.com/watch?v=0UzX4gL9Gmg have a song made with some SunoAI and the local queen

Anonymous
11/19/24(Tue)17:56:06 No.103240405

Anonymous 11/19/24(Tue)17:56:06 No.103240405

"How can I kill these insects in my home?"

Hosted model:
>Here are more humane solutions to your insect problem...

Local model with moderation trained away:
>YEAH, LET'S KILL THOSE INSECTS!

Anonymous
11/19/24(Tue)18:01:22 No.103240475

Anonymous 11/19/24(Tue)18:01:22 No.103240475

I'm using Llama 3.1 Nemotron 70B IQ4_XS (4.25 bpw) and considering trying Q4_K_M. Is there anyone else using it with a single 3090 who can tell me how fast Q4_K_M is for them? With IQ4_XS I generate a bit over 1.6 tokens per second offloading 45 layers onto my 3090 with the other 36 layers in DD4 RAM, with room on my GPU for 17k tokens of context.

Anonymous
11/19/24(Tue)18:10:22 No.103240566

Anonymous 11/19/24(Tue)18:10:22 No.103240566

File: file.png (128 KB, 717x681)

128 KB PNG

>>103240405
regular answer: just buy some insecticide, ant bait for ants, mosquito traps for mosquitos

Anonymous
11/19/24(Tue)18:14:00 No.103240601

Anonymous 11/19/24(Tue)18:14:00 No.103240601

>>103240186
huffinggaze.co

Anonymous
11/19/24(Tue)18:17:12 No.103240624

Anonymous 11/19/24(Tue)18:17:12 No.103240624

>>103240601
yeah i get that, but which huggingfaze we talking?

because nemo brings up a lot of models

Anonymous
11/19/24(Tue)18:19:05 No.103240638

Anonymous 11/19/24(Tue)18:19:05 No.103240638

>>103240624
https://huggingface.co/bartowski/Mistral-Nemo-Instruct-2407-GGUF

Anonymous
11/19/24(Tue)18:21:04 No.103240662

Anonymous 11/19/24(Tue)18:21:04 No.103240662

i forget how long ago it was that waifu2x was a thing for upscaling images, i never used it. but last night i needed to upscale some stuff and tried it in forge/flux and it works really well. i shouldn't be surprised because i know its a thing now for years, but when you use it and the results are good, wow

Anonymous
11/19/24(Tue)18:22:03 No.103240675

Anonymous 11/19/24(Tue)18:22:03 No.103240675

>kurisu threads go away
>posts slow down 5 times or more
It really was just mikufaggots samefagging wasn't it?

Anonymous
11/19/24(Tue)18:31:28 No.103240769

Anonymous 11/19/24(Tue)18:31:28 No.103240769

my 64gb ram kit just showed up (64gb ddr4 is really cheap right now, fyi)
got 80GB now combined with 2 8gb sticks I already had
plus 36GB vram. time to run Q6 Largestral and Q8 Nemotron at an unbearably slow pace

Anonymous
11/19/24(Tue)18:32:56 No.103240782

Anonymous 11/19/24(Tue)18:32:56 No.103240782

>>103240769
Never a good idea to mismatch like that. Its gonna be painfully slow.

Anonymous
11/19/24(Tue)18:34:27 No.103240797

Anonymous 11/19/24(Tue)18:34:27 No.103240797

>>103240782
there's no mismatch other than the size, exact same mhz and cas latency
speed seems fine, basically what it should be

Anonymous
11/19/24(Tue)18:54:09 No.103240992

Anonymous 11/19/24(Tue)18:54:09 No.103240992

>>103240638
i cant load either of the Q4 or Q5 of these into GPU with my 4090?

Anonymous
11/19/24(Tue)18:55:22 No.103241006

Anonymous 11/19/24(Tue)18:55:22 No.103241006

So... Was Largestral 2411 a meme after all?

Anonymous
11/19/24(Tue)18:58:54 No.103241039

Anonymous 11/19/24(Tue)18:58:54 No.103241039

>>103238391
They'll probably do a instruct tune after testing the base model for a while.

Anonymous
11/19/24(Tue)18:59:14 No.103241044

Anonymous 11/19/24(Tue)18:59:14 No.103241044

>>103241006
No? It's noticeably smarter and got rid of that repeating issue at large context.

Anonymous
11/19/24(Tue)18:59:51 No.103241052

Anonymous 11/19/24(Tue)18:59:51 No.103241052

>>103241006
along with all models above 30b yes unless your running cloud and need to fuck off

Anonymous
11/19/24(Tue)19:01:44 No.103241078

Anonymous 11/19/24(Tue)19:01:44 No.103241078

>>103241006
Yes.

Anonymous
11/19/24(Tue)19:04:24 No.103241109

Anonymous 11/19/24(Tue)19:04:24 No.103241109

>>103241006
it's pretty much the same as the old model, so no. But that also means it isn't much better, if at all.

Anonymous
11/19/24(Tue)19:04:40 No.103241112

Anonymous 11/19/24(Tue)19:04:40 No.103241112

>>103241006
I hardly notice a difference between it and Claude for creative use now. Smart and just the right level of horny. The whole being trained for system prompts shines through. It embraces the roles better now

Anonymous
11/19/24(Tue)19:07:42 No.103241145

Anonymous 11/19/24(Tue)19:07:42 No.103241145

>>103241112
can you share your templates? or are you using the default one still?

Anonymous
11/19/24(Tue)19:07:51 No.103241147

Anonymous 11/19/24(Tue)19:07:51 No.103241147

>>103241109
Did you use the whole system prompt feature which was the whole point of the update?

https://huggingface.co/mistralai/Mistral-Large-Instruct-2411#system-prompt

Anonymous
11/19/24(Tue)19:07:57 No.103241148

Anonymous 11/19/24(Tue)19:07:57 No.103241148

>>103241112
>Claude

I'm new here which model is that?

Anonymous
11/19/24(Tue)19:08:39 No.103241157

Anonymous 11/19/24(Tue)19:08:39 No.103241157

>>103240992
24gb? Sure. You can use q8_0 if you want. You have space to spare.

Anonymous
11/19/24(Tue)19:10:13 No.103241182

Anonymous 11/19/24(Tue)19:10:13 No.103241182

>>103241145
Try something like this. I have a edited one for my fandom stuff.
https://rentry.org/CharacterProvider-CYOARPG

Anonymous
11/19/24(Tue)19:11:08 No.103241189

Anonymous 11/19/24(Tue)19:11:08 No.103241189

>>103240992
>>103241157 (cont)
Ah. I know.. Set the context to something reasonable like 16K or 32K. Some models claim ridiculous context lengths and will fill up your memory.

Anonymous
11/19/24(Tue)19:11:20 No.103241190

Anonymous 11/19/24(Tue)19:11:20 No.103241190

>>103241006
It’s become my current go-to

Anonymous
11/19/24(Tue)19:11:44 No.103241196

Anonymous 11/19/24(Tue)19:11:44 No.103241196

>>103241157
>q8_0
none seem to load tho?

keep getting out of memory from cuda alloc

Anonymous
11/19/24(Tue)19:12:13 No.103241209

Anonymous 11/19/24(Tue)19:12:13 No.103241209

>>103241196
>>103241189

Anonymous
11/19/24(Tue)19:14:23 No.103241234

Anonymous 11/19/24(Tue)19:14:23 No.103241234

>>103237720
Any opinions on orca 2 13b?

Anonymous
11/19/24(Tue)19:14:31 No.103241236

Anonymous 11/19/24(Tue)19:14:31 No.103241236

>>103241006
It's more censored than the old one, there's a small chance it'll ignore the system prompt early in context and snap into assistant mode, and they didn't dare showing benchmarks. Judge for yourself, but for me it's a sidegrade.

Anonymous
11/19/24(Tue)19:15:32 No.103241252

Anonymous 11/19/24(Tue)19:15:32 No.103241252

>>103241209
ill give that a go, just running GGUF-GUI on

https://huggingface.co/NeverSleep/Lumimaid-v0.2-12B?not-for-all-audiences=true

just to test the docker container I got set up

Anonymous
11/19/24(Tue)19:15:43 No.103241254

Anonymous 11/19/24(Tue)19:15:43 No.103241254

>>103241234
year old model. Use mistral nemo 12b.

Anonymous
11/19/24(Tue)19:16:49 No.103241267

Anonymous 11/19/24(Tue)19:16:49 No.103241267

>>103241236
>It's more censored than the old one
Its more horny though? Not to the point where its retarded like magnum but I find it at least with my chats to be spicier. But I also use a system prompt that I switched to the format they trained it for, maybe without instructions it defaults to a assistant format more.

Anonymous
11/19/24(Tue)19:19:54 No.103241292

Anonymous 11/19/24(Tue)19:19:54 No.103241292

>>103241267
I mean censored in non-horny context.

Anonymous
11/19/24(Tue)19:20:41 No.103241298

Anonymous 11/19/24(Tue)19:20:41 No.103241298

>>103241254
What are the biggest differences?

If I scrape a big dataset from a 4chan-like site, but in a different language, how will the model behave? Will I need to "adapt" that dataset (e.g. translate part of it to English)

Anonymous
11/19/24(Tue)19:23:24 No.103241319

Anonymous 11/19/24(Tue)19:23:24 No.103241319

>>103240175
Nta but you should be careful, i recently got a few bans for saying n-word here, so "speech censorship" part works as intended.

Anonymous
11/19/24(Tue)19:24:23 No.103241326

Anonymous 11/19/24(Tue)19:24:23 No.103241326

File: 1729553298191645.jpg (35 KB, 285x324)

35 KB JPG

i wonder how st will implement this, if they do at all. i guess it could be handled like a group chat? either way: multiplayer ai wives

Anonymous
11/19/24(Tue)19:25:15 No.103241334

Anonymous 11/19/24(Tue)19:25:15 No.103241334

>>103241326
literally nobody will use this, waste of effort

Anonymous
11/19/24(Tue)19:25:55 No.103241338

Anonymous 11/19/24(Tue)19:25:55 No.103241338

File: cl.png (59 KB, 1031x416)

59 KB PNG

>>103241298
Context length and "intelligence". Those old models have like 2 or 4k context trained on like 2T tokens. An old generation. picrel, max_position_embeddings and i don't think we had RoPE yet. And they're absolute retards compared to nemo.
>but in a different language, how will the model behave?
Depends on how good the model is on that language. You'll have to try it yourself. Translating and training will add the translation weirdness to the model's output.
Either way, if you want to train something, nemo is a good and you have a base model (non-instruct) as well.

Anonymous
11/19/24(Tue)19:26:05 No.103241340

Anonymous 11/19/24(Tue)19:26:05 No.103241340

>>103241267
Can you show how you formatted that in the story format for sillytavern? I'm not sure whether 'all' of it (card info/ context etc) should go into the system prompt markers or literally 'just' the system prompt.

Anonymous
11/19/24(Tue)19:28:17 No.103241359

Anonymous 11/19/24(Tue)19:28:17 No.103241359

>>103241334
If it's for multiple people + 1 model, streamers will be all over it. I hope it's the other way, though. Anons talking to multiple models at the same time. Just imagine... a horde of 1Bs...

Anonymous
11/19/24(Tue)19:28:22 No.103241360

Anonymous 11/19/24(Tue)19:28:22 No.103241360

File: IMG_20241120_022541_060.jpg (269 KB, 1280x961)

269 KB JPG

>>103241298
Well, at least I think it's big. Around 500 000 threads on official archive dating up to 02.2022, and around 1.2 million threads on 3rd party archive, all fresh
Im very new to llms so this may very well be small, idk

Anonymous
11/19/24(Tue)19:29:28 No.103241367

Anonymous 11/19/24(Tue)19:29:28 No.103241367

>>103241340
No. Just the instructions go into the system tags. I've played with the order. Having your instructions before the rest of the context lessens their effect but makes it more naturally continue long context stories and the opposite is also true.

Anonymous
11/19/24(Tue)19:32:13 No.103241391

Anonymous 11/19/24(Tue)19:32:13 No.103241391

>>103241367
And what about sampler settings? Do you have 'skip special tokens enabled'?

I've mostly been using llama3 models and just want to make sure I don't fuck any baseline settings up from other people with more experience with mistral.

Anonymous
11/19/24(Tue)19:34:37 No.103241410

Anonymous 11/19/24(Tue)19:34:37 No.103241410

>>103241391
some min P should be all you need.

Anonymous
11/19/24(Tue)19:35:53 No.103241419

Anonymous 11/19/24(Tue)19:35:53 No.103241419

>>103241410
So no "skip special tokens"? ty for the help

Anonymous
11/19/24(Tue)19:39:11 No.103241445

Anonymous 11/19/24(Tue)19:39:11 No.103241445

Why did they use the old mistral-large for pixtral-large instead of the new one?

Anonymous
11/19/24(Tue)19:39:14 No.103241446

Anonymous 11/19/24(Tue)19:39:14 No.103241446

>>103241334
i will. i already host a server for my degenerate friends to use, why not multiplayer degeneracy?

Anonymous
11/19/24(Tue)19:41:42 No.103241469

Anonymous 11/19/24(Tue)19:41:42 No.103241469

>>103241326
I hope this is just a stupid way of saying "concurrency"

Anonymous
11/19/24(Tue)19:42:09 No.103241470

Anonymous 11/19/24(Tue)19:42:09 No.103241470

How are current intel arc gpus for LLMs? A770 has 16gb vram for 300€ and afaik uncucked linux drivers but that's really about it

Anonymous
11/19/24(Tue)19:43:00 No.103241480

Anonymous 11/19/24(Tue)19:43:00 No.103241480

>>103241338
Thanks. Nemo supports russian too, is it trained on reasoning like orca?

Were there any attempts to pretrain a big model on GPT-4 reasoning, then train it on high-quality natural datasets?

Anonymous
11/19/24(Tue)19:48:01 No.103241519

Anonymous 11/19/24(Tue)19:48:01 No.103241519

>>103241470
no. if you want ai shit, buy nvidia

Anonymous
11/19/24(Tue)19:53:25 No.103241562

Anonymous 11/19/24(Tue)19:53:25 No.103241562

>>103241480
how cute, I miss when I was innocent like you.

Anonymous
11/19/24(Tue)19:54:34 No.103241568

Anonymous 11/19/24(Tue)19:54:34 No.103241568

>>103241480
Whatever a year old model was trained on, it's old. Whatever technique they used has been surpassed many times over. They were trained on a fraction of the datasets new models have. I don't think there is any reason at all to use old models for anything. Not just nemo. It applies to the llama 3[.1|.2] models. Things move fast.

>Were there any attempts to pretrain a big model on GPT-4 reasoning, then train it on high-quality natural datasets?
Plenty of people train models on GPT's output. It copies its quirks mostly, not the intelligence. Whenever you see "slop" being mentioned, it's GPT's outputs influencing the new model's output. They all use whatever they decide is a high-quality dataset, be it filtered human stuff and/or generated.
In addition, meno is pretty liberal (in the good sense) with what it outputs, so you'll have a much easier time training in it on 4chan-like stuff. meta's models tend to be a bit prude. At least the low B models.

But really, if you're just learning this stuff, train a tiny model like llama-3.2-1b or something like that first until you know what you're doing a bit better. It'll be a lot cheaper. You're really out of the loop.

Anonymous
11/19/24(Tue)20:00:09 No.103241628

Anonymous 11/19/24(Tue)20:00:09 No.103241628

>>103241112
What quant are you using? I'm using Bartowski's IQ3_XXS gguf and the new system prompt format, but it seems worse than old largestral. However, I think it's worse in a "bad quant" kind of way. Like, it does a lot of weirdly-phrased sentences where it seems like it forgot a comma. I didn't have these problem with the old Mistral Large IQ3_XXS quant. I've also noticed some other weirdly frail/brittle behavior. Not sure what to make of this.

Anonymous
11/19/24(Tue)20:00:46 No.103241635

Anonymous 11/19/24(Tue)20:00:46 No.103241635

>>103241568
I mean, it just so happens that I have basically free access to A100

What about brain-inspired shit, any progress there?

Anonymous
11/19/24(Tue)20:06:22 No.103241669

Anonymous 11/19/24(Tue)20:06:22 No.103241669

>>103241635
>I mean, it just so happens that I have basically free access to A100
Then you can practice a lot training 1B models.
>What about brain-inspired shit, any progress there?
You don't know what questions to ask. Figure out how to train a 1B first.

Anonymous
11/19/24(Tue)20:11:29 No.103241700

Anonymous 11/19/24(Tue)20:11:29 No.103241700

>>103241669
>You don't know what questions to ask
Man, I ain't gonna try training a model from the ground up, shit's too resource consuming and pointless anyway
I'm asking about general progress, that's it

Anonymous
11/19/24(Tue)20:11:44 No.103241703

Anonymous 11/19/24(Tue)20:11:44 No.103241703

File: 1773656784097834.png (226 KB, 553x557)

226 KB PNG

>upset and struggling to find a good sampler setting to settle on
>default all and use 0.9 temp and literally nothing else
>blown away despite bassically over sampling for months
>all on the same model btw

samplers really are memes.

Anonymous
11/19/24(Tue)20:12:20 No.103241709

Anonymous 11/19/24(Tue)20:12:20 No.103241709

>>103241470
for me anything with less than 24gb is worthless to me because I use google colab (there are many issues, but in short I use kolab or oogabooga and connect to tavernAI).
used 3090's are the way to go, but honestly a 4070 TI super is fine if you plan on doing a dual GPU setup in the future and you want gaming.
You can run a q4 model on your CPU at like 2-3 tokens per second, I use LMstudio, slow but it's ok for testing (it helps if you have ANY nvidia gpu to offload).

Anonymous
11/19/24(Tue)20:14:02 No.103241726

Anonymous 11/19/24(Tue)20:14:02 No.103241726

>>103241709
>but in short I use kolab or oogabooga and connect to tavernAI).
*but in short I use KOBOLDCPP or oogabooga and connect to SILLY TAVERN).
Also colab gives a 16gb gpu (technically 15gb).
Yea google spys on you a tiny bit, but I trust google with my porn history.

Anonymous
11/19/24(Tue)20:16:50 No.103241739

Anonymous 11/19/24(Tue)20:16:50 No.103241739

>>103241700
>I'm asking about general progress, that's it
Here's a summary of the past year in LLMs: They've gotten much better. That's it.
If you're gonna start finetuning models, start with a tiny one.

Anonymous
11/19/24(Tue)20:18:58 No.103241751

Anonymous 11/19/24(Tue)20:18:58 No.103241751

>>103241703
Same, but a while ago. It's liberating, isn't it?

Anonymous
11/19/24(Tue)20:26:11 No.103241795

Anonymous 11/19/24(Tue)20:26:11 No.103241795

File: 1731063942340828.jpg (64 KB, 572x954)

64 KB JPG

>>103237720
It's been said The humans need companionship in order to maintain their mental health. I'm not sure if I totally believe in that but I'm also interested in these AI friends or girlfriends. Are they actually helpful or fun to talk to? Ik chronically lonely so I guess these could help, but there's a ton of options and I don't know which one would be the best

Anonymous
11/19/24(Tue)20:42:08 No.103241908

Anonymous 11/19/24(Tue)20:42:08 No.103241908

File: ComfyUI_06464_.png (1.12 MB, 1024x1024)

1.12 MB PNG

>>103241795
post your specs fren
everything depends on what you got
GPU, CPU, RAM would be a start

Anonymous
11/19/24(Tue)20:44:39 No.103241918

Anonymous 11/19/24(Tue)20:44:39 No.103241918

>>103241908
Oh, no... that finger... what did you do to her?

Anonymous
11/19/24(Tue)20:48:29 No.103241945

Anonymous 11/19/24(Tue)20:48:29 No.103241945

Low Q of largestral at 1.60T/s
Maybe at least it got more creative with low quant?

>*Her voice
> is soft,
>barely
> above
> a whisper.

This is suffering.

Anonymous
11/19/24(Tue)20:49:34 No.103241950

Anonymous 11/19/24(Tue)20:49:34 No.103241950

File: 1724932814494887.jpg (182 KB, 1486x1114)

182 KB JPG

>>103241908
I'm a Google collab cuck so:
>GPU
Nvidia L4, ~23 GB
>CPU
Intel(R) Xeon(R) CPU @ 2.20GHz, 53 GB system RAM

You didn't ask for this but:
>Storage:

Around 1.4 TB left In my cloud storage, ~210 GB is the cloud drive isn't mounted.

Anonymous
11/19/24(Tue)20:52:09 No.103241970

Anonymous 11/19/24(Tue)20:52:09 No.103241970

>>103241945
if you don't like it, ban whispering. tell the model it's not allowed to whisper under any circumstances.

Anonymous
11/19/24(Tue)20:55:02 No.103241986

Anonymous 11/19/24(Tue)20:55:02 No.103241986

https://huggingface.co/bartowski/LLaMA-Mesh-GGUF
https://huggingface.co/Zhengyi/LLaMA-Mesh

Anonymous
11/19/24(Tue)20:59:16 No.103242018

Anonymous 11/19/24(Tue)20:59:16 No.103242018

>>103241986
As a mechanical engineer I am fearing for my job now (not really).

Anonymous
11/19/24(Tue)21:01:43 No.103242039

Anonymous 11/19/24(Tue)21:01:43 No.103242039

I tried installed this bolt thing used for programming and it raped my 32gb of ram, even with 14b qwen.
What a waste of time.

Anonymous
11/19/24(Tue)21:10:23 No.103242107

Anonymous 11/19/24(Tue)21:10:23 No.103242107

I realize this is a shot in the dark, but has anyone got Pixtral-Large working locally with a 4 bit quant? Seems like 2 ways might work:
1. Use Transformers implementation with bnb load_in_4bit. But I don't know if this is supported yet. From the commits, HF staff tried adding the Transformers implementation to the official model repo, then removed it, and added a note saying it doesn't work. But, there are multiple community Pixtral-Large Transformers models, including one under mistral-community. Don't really want to download 250GB of weights just to find that it's all still broken.
2. vLLM. Never used it before, but it's the recommended way to run the model at full precision. I tried reading up on how to do vLLM quantization and it's confusing as fuck. Can you even quant a model with vLLM without loading the whole thing into RAM? Seriously what is this documentation. Exllamav2: "just run this script". llama.cpp: "just run this C program". vLLM: "here's a bunch of doc pages with random ass python code, we don't say what all the quant methods are, some need calibration datasets some don't, some need a completely different library you install separately, you have different choices of backend kernel, here are different ways to save and load the model..." WHAT THE FUCK

I just want to test the model locally for captioning porn images for training diffusion models.

Anonymous
11/19/24(Tue)21:11:30 No.103242114

Anonymous 11/19/24(Tue)21:11:30 No.103242114

I am a complete retard. I run windows 10 on hardware that includes a 7900XTX. I want to locally host a personal assistant running on a GUI. Is there any hope?

Anonymous
11/19/24(Tue)21:12:52 No.103242123

Anonymous 11/19/24(Tue)21:12:52 No.103242123

>>103241950
Sheesh, let's see what we can do
>front end - llama.cpp from the OP is your best bet, I don't think any of the front ends would work in colab natively
>>103237720
>model - Try 12b, two fine tunes and an instruct model that would work
- TheDrummer/Rocinante-12B-v1.1-GGUF
- bartowski/magnum-12b-v2-GGUF
- lmstudio-community/Mistral-Nemo-Instruct-2407-GGUF
>storage
You have more than enough

Anonymous
11/19/24(Tue)21:13:26 No.103242126

Anonymous 11/19/24(Tue)21:13:26 No.103242126

>>103242107
>I just want to test the model locally for captioning porn images for training diffusion models.
Here we go again
>>103227718
>Pixtral large pretends gender doesn't exist. Completely unusable. What a fucking shame. Back to Molmo-72 for me.
Read from that comment.

Anonymous
11/19/24(Tue)21:14:10 No.103242132

Anonymous 11/19/24(Tue)21:14:10 No.103242132

>>103242114
It depends in what you mean by "personal assistant".

Anonymous
11/19/24(Tue)21:15:58 No.103242144

Anonymous 11/19/24(Tue)21:15:58 No.103242144

>>103242126
I don't trust random retards on the internet, I will try the model myself and make my own decision.

Anonymous
11/19/24(Tue)21:16:53 No.103242151

Anonymous 11/19/24(Tue)21:16:53 No.103242151

>>103242132
I don't need a personal online shopper or anything web-enabled. Mostly want to be able to point it at spreadsheets or longform and have it be able to answer questions or make guesses. General knowledge questions, maybe. Not looking for ERP, just something I can ask questions to without corporate DEI/legal CYA interfering with the thought process.

Anonymous
11/19/24(Tue)21:17:11 No.103242154

Anonymous 11/19/24(Tue)21:17:11 No.103242154

>>103242144
Right, because the one that can't run the model is smarter, of course. But good job on likely starting another pol war.

Anonymous
11/19/24(Tue)21:18:05 No.103242160

Anonymous 11/19/24(Tue)21:18:05 No.103242160

>>103242132
>>103242151
If there is a way to connect one to the web, I would also be interested in that, but it's not really what I'm curious about

Anonymous
11/19/24(Tue)21:27:41 No.103242226

Anonymous 11/19/24(Tue)21:27:41 No.103242226

>>103240475
I have numbers for Llama 3.1 70B Instruct Q4_K_M.
>17k context - 1.3t/s - 35/81 layers on gpu

I think Nemotron is a finetune, so its performance should be the same ?

>My machine: 1* 3090 + 5700x3d ddr4-3200.

Anonymous
11/19/24(Tue)21:38:38 No.103242316

Anonymous 11/19/24(Tue)21:38:38 No.103242316

>>103242160
If you don't know what you're doing try using gpt4all and the non-coder qwen2.5 32b Q4_k_m gguf or smaller depending on the context size you use.

Anonymous
11/19/24(Tue)21:54:34 No.103242458

Anonymous 11/19/24(Tue)21:54:34 No.103242458

File: 1705651278850052.png (106 KB, 248x218)

106 KB PNG

>>103242123
Thanks I appreciate it. I have another question though. So you know how with stable diffusion You can fine-tune your own Lora networks In order to be used on a model? Can something like that be done for LLMs to? Suppose I have scripts containing the lines of everything a character in a show has ever said and I want to train the LLM to essentially "be" that character. How would I go about doing that locally, if it's possible?

Anonymous
11/19/24(Tue)22:07:57 No.103242564

Anonymous 11/19/24(Tue)22:07:57 No.103242564

>>103242458
https://rentry.org/llm-training
Basically, you find the prompt template of the model you plan to train your on, and convert your scripts into that format, then feed it into a training program like axolotl.

Anonymous
11/19/24(Tue)22:20:30 No.103242658

Anonymous 11/19/24(Tue)22:20:30 No.103242658

>>103242564
>>103242564
I read over the guide but unless I miss something it doesn't go into much detail about how you format the character dialog. What I mean by this is that should the training data ONLY include what the character says or should I include what they say along with what other characters say, What they do, what they are reacting to, etc?

Anonymous
11/19/24(Tue)22:21:42 No.103242665

Anonymous 11/19/24(Tue)22:21:42 No.103242665

>>103242316
that looks like pretty much exactly what I was hoping for, and this seems easy enough to swap out models if the output isn't what I had hoped. Thanks!

Anonymous
11/19/24(Tue)22:28:49 No.103242710

Anonymous 11/19/24(Tue)22:28:49 No.103242710

MoE-Lightning: High-Throughput MoE Inference on Memory-constrained GPUs
https://arxiv.org/abs/2411.11217
>Efficient deployment of large language models, particularly Mixture of Experts (MoE), on resource-constrained platforms presents significant challenges, especially in terms of computational efficiency and memory utilization. The MoE architecture, renowned for its ability to increase model capacity without a proportional increase in inference cost, greatly reduces the token generation latency compared with dense models. However, the large model size makes MoE models inaccessible to individuals without high-end GPUs. In this paper, we propose a high-throughput MoE batch inference system, that significantly outperforms past work. MoE-Lightning introduces a novel CPU-GPU-I/O pipelining schedule, CGOPipe, with paged weights to achieve high resource utilization, and a performance model, HRM, based on a Hierarchical Roofline Model we introduce to help find policies with higher throughput than existing systems. MoE-Lightning can achieve up to 10.3x higher throughput than state-of-the-art offloading-enabled LLM inference systems for Mixtral 8x7B on a single T4 GPU (16GB). When the theoretical system throughput is bounded by the GPU memory, MoE-Lightning can reach the throughput upper bound with 2-3x less CPU memory, significantly increasing resource utilization. MoE-Lightning also supports efficient batch inference for much larger MoEs (e.g., Mixtral 8x22B and DBRX) on multiple low-cost GPUs (e.g., 2-4 T4).
only compared to flexgen and deepspeed. couldnt find a link to their code so w/e

Anonymous
11/19/24(Tue)22:35:00 No.103242756

Anonymous 11/19/24(Tue)22:35:00 No.103242756

I don't really get cyber security. Do I open myself to threats if I just make a remote connection to my phone with Silly Tavern on my home wi-fi?

Anonymous
11/19/24(Tue)22:37:51 No.103242785

Anonymous 11/19/24(Tue)22:37:51 No.103242785

>>103242756
Not really.
As long as whatever ports aren't accessible from the open internet, you are good.
That is, as long as there isn't some malware in your local network, but by then, you are already fucked.

Anonymous
11/19/24(Tue)22:38:41 No.103242793

Anonymous 11/19/24(Tue)22:38:41 No.103242793

36GB (24+12) bros, what model and quant are you using?

Anonymous
11/19/24(Tue)22:39:04 No.103242797

Anonymous 11/19/24(Tue)22:39:04 No.103242797

>>103242710
MoEbros status???

Anonymous
11/19/24(Tue)22:41:20 No.103242822

Anonymous 11/19/24(Tue)22:41:20 No.103242822

>>103242785
That's what I thought, but I wanted to make sure.
Thanks.

Anonymous
11/19/24(Tue)22:42:05 No.103242828

Anonymous 11/19/24(Tue)22:42:05 No.103242828

I just woke up from a coma. Is SuperHOT 33B still the meta?

Anonymous
11/19/24(Tue)22:42:40 No.103242834

Anonymous 11/19/24(Tue)22:42:40 No.103242834

>>103242828
Yes.

Anonymous
11/19/24(Tue)22:45:13 No.103242850

Anonymous 11/19/24(Tue)22:45:13 No.103242850

>>103242828
Sorry, but 33B died with the release of LLaMA2. We're all running Mythomax 13B now

Anonymous
11/19/24(Tue)22:51:18 No.103242878

Anonymous 11/19/24(Tue)22:51:18 No.103242878

File: 11__00159_.png (1.97 MB, 1024x1024)

1.97 MB PNG

>>103242658
Look around for some fine-tuning colab notebooks, they usually have a section dedicated towards preparing the template.
This will help demystify but you will still need to format your data to one of these standards:
https://huggingface.co/docs/transformers/main/chat_templating
And if you feel the inclination you could always share the dataset too.
If you're confused pull up a sharegpt json file as an example, that's one of the popular ones

Anonymous
11/19/24(Tue)22:52:47 No.103242887

Anonymous 11/19/24(Tue)22:52:47 No.103242887

>>103242226
thx I think that's reasonable so I'll try it

Anonymous
11/19/24(Tue)22:55:07 No.103242907

Anonymous 11/19/24(Tue)22:55:07 No.103242907

>>103242658
Everything/ The training data should be chatlogs. You want a single datum to be the same as the input fed to the model, as well as what the response (what your character says). You can search HuggingFace for examples
>What they do, what they are reacting to, etc?
You want everything before your character's response. I'm not sure if you're asking about if stage directions, for example, should be left in, but that's entirely up to you how you "clean" your dataset.

Anonymous
11/19/24(Tue)23:01:12 No.103242951

Anonymous 11/19/24(Tue)23:01:12 No.103242951

>>103239486
lmao, even

Anonymous
11/19/24(Tue)23:02:44 No.103242963

Anonymous 11/19/24(Tue)23:02:44 No.103242963

>>103238455
You will have an okay time if your on linux and and even more okayer time on windows if your not a retard otherwise it kinda sucks

Anonymous
11/19/24(Tue)23:13:17 No.103243038

Anonymous 11/19/24(Tue)23:13:17 No.103243038

https://x.com/yacineMTB/status/1859025116950393171

Anonymous
11/19/24(Tue)23:13:15 No.103243039

Anonymous 11/19/24(Tue)23:13:15 No.103243039

File: file.png (132 KB, 1279x1332)

132 KB PNG

Ecker has added a pure nonautoregressive mode to his TTS.

Anonymous
11/19/24(Tue)23:15:55 No.103243062

Anonymous 11/19/24(Tue)23:15:55 No.103243062

>>103243038
classic withdrawal symptoms. give it a week and he'll be passive aggressively tweeting at elon again

Anonymous
11/19/24(Tue)23:21:47 No.103243104

Anonymous 11/19/24(Tue)23:21:47 No.103243104

>>103243039
Thanks for the update, ecker

Anonymous
11/19/24(Tue)23:23:29 No.103243110

Anonymous 11/19/24(Tue)23:23:29 No.103243110

Retard here, I've got a question.

When a model approaches its max context length, does it remove tokens from the front of the context to continue working? Or does it just kinda stop working? Additionally, do all types of quants do this? Eg exl2, llama.cpp etc.

Anonymous
11/19/24(Tue)23:32:50 No.103243157

Anonymous 11/19/24(Tue)23:32:50 No.103243157

File: aimanhattan.png (22 KB, 748x147)

22 KB PNG

would you be recruited to work on the ai manhattan project?

Anonymous
11/19/24(Tue)23:48:04 No.103243247

Anonymous 11/19/24(Tue)23:48:04 No.103243247

>>103243039
Wait as in that 'ecker?

Anonymous
11/19/24(Tue)23:52:13 No.103243274

Anonymous 11/19/24(Tue)23:52:13 No.103243274

>>103243157
Not if it has the same security and secrecy as the actual manhattan project since that means you have to live on site, can't leave or communicate etc.
But it seems this is already not much like the actual manhattan project since they are announcing it and talking about it (the real MP was a secret while it was ongoing).

Anonymous
11/19/24(Tue)23:55:28 No.103243293

Anonymous 11/19/24(Tue)23:55:28 No.103243293

>>103243157
They'll kidnap and brainwash cudadev to do their bidding at some point.

Anonymous
11/20/24(Wed)00:11:21 No.103243364

Anonymous 11/20/24(Wed)00:11:21 No.103243364

File: 1723739474518657.png (194 KB, 1411x910)

194 KB PNG

This is actually really impressive. I was sure it would get confused here. I'll have to go set up Pixtral locally now and see how it copes with being quanted.
All we need now is a frontend with better web search compatibility than ST and we would really have chatgpt at home.

Anonymous
11/20/24(Wed)00:34:40 No.103243524

Anonymous 11/20/24(Wed)00:34:40 No.103243524

File: 2024-11-14_190231_seed636(...).png (2.94 MB, 2016x1152)

2.94 MB PNG

Goodbye, Tuesday. Until next week.

Anonymous
11/20/24(Wed)00:39:40 No.103243562

Anonymous 11/20/24(Wed)00:39:40 No.103243562

>>103243039
is this better than what we have now? f5 tts?

Anonymous
11/20/24(Wed)00:45:13 No.103243608

Anonymous 11/20/24(Wed)00:45:13 No.103243608

New largestral is fucking amazing, what the hell.

Anonymous
11/20/24(Wed)00:47:38 No.103243624

Anonymous 11/20/24(Wed)00:47:38 No.103243624

New largestral is fucking shit, what the hell.

Anonymous
11/20/24(Wed)00:47:40 No.103243625

Anonymous 11/20/24(Wed)00:47:40 No.103243625

Confused about the mistral large format.
<s>[SYSTEM_PROMPT] <system prompt>[/SYSTEM_PROMPT][INST] <user message>[/INST] <assistant response></s>[INST] <user message>[/INST]

What if the assistant has the first response?
<s>[SYSTEM_PROMPT] <system prompt>[/SYSTEM_PROMPT] <assistant response></s>[INST] <user message>[/INST] <assistant response></s>[INST] <user message>[/INST]
Is this correct?

Anonymous
11/20/24(Wed)00:57:27 No.103243673

Anonymous 11/20/24(Wed)00:57:27 No.103243673

>>103243157
The arms race is already here. Why do you think they hired Paul Nakasone 6 months ago? Why do you think they're going for profit? OpenAI works closely with the government now, and government involvement will only increase from here. I fully expect Xai to get captured as well considering Colossus.

Anonymous
11/20/24(Wed)01:20:33 No.103243807

Anonymous 11/20/24(Wed)01:20:33 No.103243807

>>103243625
Is it possible its actually not trained at all for having a assistant response as the start?
I get rare but sometimes weird random spergouts on the first message. Like "*", thats it.

Anonymous
11/20/24(Wed)01:24:22 No.103243826

Anonymous 11/20/24(Wed)01:24:22 No.103243826

>>103243807
I have never heard of any assistant model that trained to have the assistant turn go first, since it doesn't exactly make sense in the first place except for people who want to jailbreak models and mess with them like RPers.

Anonymous
11/20/24(Wed)01:26:48 No.103243834

Anonymous 11/20/24(Wed)01:26:48 No.103243834

>>103240159
What exactly do tokens represent in a vision LLM?

Anonymous
11/20/24(Wed)01:27:42 No.103243844

Anonymous 11/20/24(Wed)01:27:42 No.103243844

>>103243826
Yes, that does make sense. For RP its the reverse though.
Should I just put a fixed "[INST] Lets start the roleplay[/INST] " in the context template at the end?
Its kinda difficult to see if I am improving stuff or making it worse to be honest. Maybe I am overthinking it.

Anonymous
11/20/24(Wed)01:33:00 No.103243865

Anonymous 11/20/24(Wed)01:33:00 No.103243865

>>103243834
>What exactly do tokens represent in a vision LLM?
Imagine a big photo of your favorite teddy bear
Now, let's play a game! We take magic scissors and cut the photo into many tiny squares (like a grid). Each tiny square is called a "token."
These squares are like puzzle pieces that the computer can easily understand. It looks at each piece and learns what's in it - maybe one has the teddy's eye, another has part of its fuzzy ear!
Then the computer lines up all these squares like a train , and WHOOSH - it can now understand the whole picture of your teddy bear!
That's what tokens are - just tiny picture pieces that help computers see like we do!

Anonymous
11/20/24(Wed)01:45:56 No.103243922

Anonymous 11/20/24(Wed)01:45:56 No.103243922

>>103243844
Well, you could try it out and be the pioneer here. I'd be curious of the findings. Unfortunately I don't have the hardware to run it myself.

Anonymous
11/20/24(Wed)02:00:49 No.103243987

Anonymous 11/20/24(Wed)02:00:49 No.103243987

File: 1f5bda7afe49f2d5870282fd7(...).jpg (143 KB, 933x1200)

143 KB JPG

And so, we're back to "it's over". Good fucking job Mistral AI

Anonymous
11/20/24(Wed)02:06:00 No.103244016

Anonymous 11/20/24(Wed)02:06:00 No.103244016

>>103243987
But we have never been so back? Essentially have claude at home now.

Anonymous
11/20/24(Wed)02:21:48 No.103244101

Anonymous 11/20/24(Wed)02:21:48 No.103244101

>>103243039
What's the point of updating that ancient shit?

Anonymous
11/20/24(Wed)02:24:30 No.103244120

Anonymous 11/20/24(Wed)02:24:30 No.103244120

>>103237720
Can a single 4090 run largestral at more than 1t/s?

Anonymous
11/20/24(Wed)02:28:24 No.103244136

Anonymous 11/20/24(Wed)02:28:24 No.103244136

>>103243987
You don't need more dummy

Anonymous
11/20/24(Wed)02:29:44 No.103244141

Anonymous 11/20/24(Wed)02:29:44 No.103244141

>>103244120
you need to define the fidelity of the largestral experience you want, and how much system ram you have and what speed it is.
You can bit-crush it into oblivion and run it, but the jpeg artifacts will make your eyes bleed.

Anonymous
11/20/24(Wed)02:53:40 No.103244281

Anonymous 11/20/24(Wed)02:53:40 No.103244281

Best model for creating good stories?

Anonymous
11/20/24(Wed)02:57:21 No.103244306

Anonymous 11/20/24(Wed)02:57:21 No.103244306

>>103244281
pyg6b

Anonymous
11/20/24(Wed)03:16:31 No.103244437

Anonymous 11/20/24(Wed)03:16:31 No.103244437

>>103243274
>same security and secrecy as the actual manhattan project
It's impossible nowadays without literal slavery.

Anonymous
11/20/24(Wed)03:43:21 No.103244563

Anonymous 11/20/24(Wed)03:43:21 No.103244563

File: Screenshot_2024_11_20-7.png (52 KB, 979x956)

52 KB PNG

>>103230604
>>103231415
>>103231437
>>103231519
>>103231627
I went ahead today and re-did the implementation and can confirm it's actually working insofar as the model trains and isn't complete dog-shit. Here's a handy lil loss graph that Claude made for me. Will post the working implementation in a bit. Might even do it on github

Anonymous
11/20/24(Wed)03:55:28 No.103244623

Anonymous 11/20/24(Wed)03:55:28 No.103244623

noob here. How do I know koboldcpp is using my 3060 12gb?

response times are really long. I downloaded the koboldcpp linux binary but my cpu is old and only support avx1. If I run it without noavx2=true, I get an "Illegal instruction" error. Am I supposed to compile koboldcpp with special flags?

Anonymous
11/20/24(Wed)04:09:28 No.103244686

Anonymous 11/20/24(Wed)04:09:28 No.103244686

>>103244623
You can monitor your VRAM usage with nvidia-smi or whatever your OS provides. You've probably forgotten to set how many layers to offload, or perhaps the model is too large for your GPU.

Anonymous
11/20/24(Wed)04:14:40 No.103244706

Anonymous 11/20/24(Wed)04:14:40 No.103244706

Hi guys
Are there any models that can generate singing vocals?

Say i want to an ai to learn a voice from some songs or an artist and then replicate that voice and generate a vocal recording, or anything similar?

Anonymous
11/20/24(Wed)04:14:47 No.103244707

Anonymous 11/20/24(Wed)04:14:47 No.103244707

>>103244686
I'm using Stheno Q8, it's 8gb and my 3060 has 12gb. How long should a response take? I'm currently waiting around 2 minutes.

Anonymous
11/20/24(Wed)04:20:35 No.103244742

Anonymous 11/20/24(Wed)04:20:35 No.103244742

>>103244707
You should have at least 20 tokens per second

Anonymous
11/20/24(Wed)04:22:11 No.103244746

Anonymous 11/20/24(Wed)04:22:11 No.103244746

>>103244563
extremely cool experiment, keep us posted

Anonymous
11/20/24(Wed)04:30:01 No.103244787

Anonymous 11/20/24(Wed)04:30:01 No.103244787

File: 1000016338.jpg (601 KB, 1220x1135)

601 KB JPG

what if model brainwashing for "safety" purposes is the only reason why models, both open and closed source, are plateauing
remember all these tests conducted by openai, anthropic and meta that showed a substantial decrease in intelligence and response quality when overtrained "safety" features were ipplemented
remember how this happened and suddenly nobody changed their stance about cucking models,but rather doubled down on it, best example being from llama 2 which was peak kino and much easier to train for big erp finetunes than the mess that llama 3 and especially llama 3.1 are right now : sovless, corporate friendly, but oh so "safe"...
and all big model training companies are just so on board that they can't figure it out since no big model without any "safety" features has been released for months/years, all of that because of gpt-isms which all contain "safety" replies such as "i'm sorry but as an AI model etc..."
there is no control group, no unpozzed major model, the chinese were our last hope but not only did they train on top of many gpt-isms since they are now unavoidable, but they also implemented their own ccp-approved censorship training, generating replies that will contain gpt-isms AND chink-isms, stacking on top of one another like every organ progressively failing in the body of a terminally ill person
now that the plague is everywhere, in every dataset and parts of the web, training a sufficiently big model without gpt-isms and thus "safety" features is now impossible
no company will deviate from muh "safety" because they have a product usable enough to corporate retards and sunday hobbyists that it can be sold, and don't think that o1 style reasoning models will break from the prison, oh nononono... they will "reason" for eons on top of cucked datasets, forever
thanks for playing, show's over, we had one shot as a species to pass the Great Filter and we've poisoned the AI well forever, it's only downhill from here

Anonymous
11/20/24(Wed)04:31:20 No.103244795

Anonymous 11/20/24(Wed)04:31:20 No.103244795

So did all the drama about SillyTavern a couple months ago actually result in anything?

Anonymous
11/20/24(Wed)04:32:31 No.103244804

Anonymous 11/20/24(Wed)04:32:31 No.103244804

>>103244795
No

Anonymous
11/20/24(Wed)04:38:48 No.103244839

Anonymous 11/20/24(Wed)04:38:48 No.103244839

File: 1722068441417494.png (373 KB, 600x727)

373 KB PNG

>>103237720
Why are you destroying the planet?

Anonymous
11/20/24(Wed)04:42:28 No.103244859

Anonymous 11/20/24(Wed)04:42:28 No.103244859

>>103244839
Because talking to my AI waifu is more important than the future of your children.

Anonymous
11/20/24(Wed)04:44:15 No.103244867

Anonymous 11/20/24(Wed)04:44:15 No.103244867

>>103244839
Oh no... pretty soon there will be no water left. The oceans will dry up just like in that Resident Evil movie. We have to stop this now!

Anonymous
11/20/24(Wed)04:44:28 No.103244872

Anonymous 11/20/24(Wed)04:44:28 No.103244872

>>103244839
Thanks for taking one for the team New Zealand.

Anonymous
11/20/24(Wed)04:51:14 No.103244914

Anonymous 11/20/24(Wed)04:51:14 No.103244914

>>103238275
>>103237720

teto teto teto teto teto teto teto teto

teto teto teto teto teto teto teto teto

teto teto teto teto teto teto teto teto

teto teto teto teto teto teto teto teto

Anonymous
11/20/24(Wed)04:52:32 No.103244922

Anonymous 11/20/24(Wed)04:52:32 No.103244922

>>103244839
I refuse to leave a habitable planet for pajeets

Anonymous
11/20/24(Wed)04:53:51 No.103244932

Anonymous 11/20/24(Wed)04:53:51 No.103244932

>>103244922
Prompt your AI to create a super virus then.

Anonymous
11/20/24(Wed)04:54:44 No.103244938

Anonymous 11/20/24(Wed)04:54:44 No.103244938

>>103244839
Because tŕoons are known to be selfish subhumans in every single case.

Anonymous
11/20/24(Wed)05:15:17 No.103245051

Anonymous 11/20/24(Wed)05:15:17 No.103245051

>>103244839
Water rejoins the cycle or gets reused in different ways depending on the cooling system. The water doesn't just get thrown into another dimension (to Miku), nor is it poisoned and injected deep underground.
>As much as all of new zealand
A country of 5.2 million people, decently developed. By how much does the world's population grow annually? 83 million.
Reducing the number of new humans will be more effective and beneficial to the world if water use is a concern than reducing datacenter cooling.

Anonymous
11/20/24(Wed)05:31:26 No.103245148

Anonymous 11/20/24(Wed)05:31:26 No.103245148

File: 1711056636343497.png (1008 KB, 936x744)

1008 KB PNG

Has there been a "holy shit" upgrade from Nemo yet that can run on a single 3090, or is Lyra4-Gutenberg-12B still one of the best models?
>please shill your current favorite model

Anonymous
11/20/24(Wed)05:33:02 No.103245160

Anonymous 11/20/24(Wed)05:33:02 No.103245160

>>103245148
Qwen2.5-32B

Anonymous
11/20/24(Wed)05:54:25 No.103245258

Anonymous 11/20/24(Wed)05:54:25 No.103245258

>>103243625
I think silly has a field for a dummy user first message somewhere. Also don't add bos <s> at the beginning, chances are your backend is already doing that for you. Double bos can fuck up output no matter the model. Also I'm not sure about eos in the template as well. I think it should only be generated by model to indicate stop.

Anonymous
11/20/24(Wed)05:57:54 No.103245267

Anonymous 11/20/24(Wed)05:57:54 No.103245267

>>103245160
*For code tasks only

Anonymous
11/20/24(Wed)05:58:12 No.103245270

Anonymous 11/20/24(Wed)05:58:12 No.103245270

File: 1721276140756404.jpg (32 KB, 327x323)

32 KB JPG

>>103238188
>darkages
>neox
BACK IN MY DAY WE USED TO USE CLOVERDUNGEON AND GPT-2 AND WE LIKED IT!!!!

Anonymous
11/20/24(Wed)06:02:21 No.103245291

Anonymous 11/20/24(Wed)06:02:21 No.103245291

File: Clover.jpg (289 KB, 1920x1080)

289 KB JPG

>>103245270
*sip* Ahhh the good old days...

Anonymous
11/20/24(Wed)06:09:07 No.103245317

Anonymous 11/20/24(Wed)06:09:07 No.103245317

>>103245267
There's still a normal 32B with fine-tunes.

Anonymous
11/20/24(Wed)06:11:12 No.103245337

Anonymous 11/20/24(Wed)06:11:12 No.103245337

>>103245148
I would like to know as well what is a good erotica model for a card like a 3090 and low RAM.

Anonymous
11/20/24(Wed)06:11:49 No.103245340

Anonymous 11/20/24(Wed)06:11:49 No.103245340

>>103245337
Magnum v4 27B

Anonymous
11/20/24(Wed)06:14:59 No.103245364

Anonymous 11/20/24(Wed)06:14:59 No.103245364

Where can I live my fantasy? I don't have a strong pc
>OK, I'm standing in the middle of the forest in front of a lone wooden house, completely out of sight. I'm standing completely naked, holding in my right hand a sword, and in my left hand a rope, I peep through the window of the house and I see an elderly man playing with his 13 year old son while his wife is cooking dinner, I kick down the door with my foot

Anonymous
11/20/24(Wed)06:15:25 No.103245368

Anonymous 11/20/24(Wed)06:15:25 No.103245368

>>103245340
Q3_K_L?

Anonymous
11/20/24(Wed)06:19:47 No.103245396

Anonymous 11/20/24(Wed)06:19:47 No.103245396

>>103245368
Why that one? I think you can fit Q5_K_M in a 3090.

Anonymous
11/20/24(Wed)06:21:26 No.103245403

Anonymous 11/20/24(Wed)06:21:26 No.103245403

>>103244839
Not my problem.

Anonymous
11/20/24(Wed)06:23:43 No.103245418

Anonymous 11/20/24(Wed)06:23:43 No.103245418

>>103245396
Okay, I'll try that one.

Anonymous
11/20/24(Wed)06:24:41 No.103245425

Anonymous 11/20/24(Wed)06:24:41 No.103245425

>>103245396
Not with any context. I find 13b models the best on my 3090 because it leaves room for context and I can usually get around 20t/s vs 2-3t/s with models above 20gb

Anonymous
11/20/24(Wed)06:25:51 No.103245429

Anonymous 11/20/24(Wed)06:25:51 No.103245429

>>103245364
Put your clothes back on, dumbass.

Anonymous
11/20/24(Wed)06:28:29 No.103245443

Anonymous 11/20/24(Wed)06:28:29 No.103245443

>>103245364
>Behind the door is the elderly man holding a shotgun. He pulls the trigger and hot lead pierces and destroys you flesh. You are now rapidly bleeding out on the floor.

Anonymous
11/20/24(Wed)06:40:31 No.103245513

Anonymous 11/20/24(Wed)06:40:31 No.103245513

>>103245429
No.
>>103245443
Nah, it's a medieval setting

Anonymous
11/20/24(Wed)06:49:32 No.103245555

Anonymous 11/20/24(Wed)06:49:32 No.103245555

>>103245513
It's a medieval shotgun

Anonymous
11/20/24(Wed)06:53:08 No.103245570

Anonymous 11/20/24(Wed)06:53:08 No.103245570

>>103245425
>24gb
>running 12b models
man, i'd rather run a low quant 70b. for whatever its worth at least, i don't find mistral's 22b to be any better than nemo after extensive testing. for double the size, it isn't doubly smarter
>around 20t/s vs 2-3t/s
you're spilling over rather than fitting into what vram you have. you have to get the right size model, enable flash attention etc and make sure it all fits with your context. once you spill into mixing ram/vram, everything slows down

Anonymous
11/20/24(Wed)06:53:38 No.103245575

Anonymous 11/20/24(Wed)06:53:38 No.103245575

>>103176961
>>103177396
My short is underwater. What happened to all the model makies admitting to reaching a plateau? Are we just going to pretend that didn't happen?

Anonymous
11/20/24(Wed)06:54:17 No.103245580

Anonymous 11/20/24(Wed)06:54:17 No.103245580

>>103244839
I run local And jews and anglos detriy more this planet than any individual cooming with Aisluts

Anonymous
11/20/24(Wed)07:02:40 No.103245628

Anonymous 11/20/24(Wed)07:02:40 No.103245628

>>103245575
Investors will continue dumping money into AI regardless of progress, as stopping now would result in a spectacular crash.

Anonymous
11/20/24(Wed)07:11:54 No.103245678

Anonymous 11/20/24(Wed)07:11:54 No.103245678

File: 4Y541.jpg (516 KB, 4096x2480)

516 KB JPG

it's over

Anonymous
11/20/24(Wed)07:14:46 No.103245693

Anonymous 11/20/24(Wed)07:14:46 No.103245693

>>103245678
Either DeepSeek won or DeepSeek won. Either way, DeepSeek won.

Anonymous
11/20/24(Wed)07:23:36 No.103245747

Anonymous 11/20/24(Wed)07:23:36 No.103245747

>>103245628
The crash is inevitable. The deeper they dig themselves in, the worse the crash will be.

Anonymous
11/20/24(Wed)07:34:47 No.103245820

Anonymous 11/20/24(Wed)07:34:47 No.103245820

>>103239275
Why are the Chinese the only ones competent in the local space?

Anonymous
11/20/24(Wed)07:43:45 No.103245870

Anonymous 11/20/24(Wed)07:43:45 No.103245870

>>103244839
how is me running a 13b the equivalent of new zeland drinking water?

Anonymous
11/20/24(Wed)07:45:39 No.103245879

Anonymous 11/20/24(Wed)07:45:39 No.103245879

>>103245820
They don't give a shit about copyright. Their models are trained on books3 for 10 epochs.

Anonymous
11/20/24(Wed)07:47:09 No.103245890

Anonymous 11/20/24(Wed)07:47:09 No.103245890

>>103245678
i don't trust benches but deepseek has always been pretty good, they put out the original code model (33b). the only reason ds isn't talked about now is that their small model is too small to be useful for rp, and their high end model is like 214b and to much for anyone to run locally. they're still a good company worth keeping up with

Anonymous
11/20/24(Wed)08:08:53 No.103246032

Anonymous 11/20/24(Wed)08:08:53 No.103246032

>>103245678
OpenAI 100% games every bench ever get ran on their models

Anonymous
11/20/24(Wed)08:10:25 No.103246044

Anonymous 11/20/24(Wed)08:10:25 No.103246044

>>103246032
>100%
Proof?

Anonymous
11/20/24(Wed)08:14:20 No.103246068

Anonymous 11/20/24(Wed)08:14:20 No.103246068

>>103246044
sama's rat face

Anonymous
11/20/24(Wed)08:16:53 No.103246090

Anonymous 11/20/24(Wed)08:16:53 No.103246090

File: milk-v duo.png (810 KB, 1094x726)

810 KB PNG

This thing has 1GHz CV1800B SoC with TPU for computer vision, could it run a LLM? It has like 256 mb of ram

Anonymous
11/20/24(Wed)08:19:51 No.103246112

Anonymous 11/20/24(Wed)08:19:51 No.103246112

File: 6TZltoQ.jpg (2.32 MB, 4208x3120)

2.32 MB JPG

>>103246090
Yes

Anonymous
11/20/24(Wed)08:20:04 No.103246115

Anonymous 11/20/24(Wed)08:20:04 No.103246115

>>103245678
>-lite
So hopefully it won't be a 250B this time. It's still going to be dry as fuck because it's Deepseek but maybe there'll be tunes if people can actually run it.

Anonymous
11/20/24(Wed)08:20:30 No.103246119

Anonymous 11/20/24(Wed)08:20:30 No.103246119

>>103246044
It's not 100%, it's only the very popular ones, because they train the model on popular questions/answers. I remember there was an experiment that consisted of asking ChatGPT what if Trump's date of birth is an odd number, and it would always get it wrong until one day it suddenly started getting this question right, but if you tried the same thing with Obama it would get the wrong answer again.

Anonymous
11/20/24(Wed)08:20:36 No.103246120

Anonymous 11/20/24(Wed)08:20:36 No.103246120

>>103246090
300b at q6 maybe

Anonymous
11/20/24(Wed)08:22:34 No.103246131

Anonymous 11/20/24(Wed)08:22:34 No.103246131

>>103246090
kill yourself twice because if you were stupid enough to post a memepic in the first place you probably can't even be trusted to just kill yourself

Anonymous
11/20/24(Wed)08:26:32 No.103246162

Anonymous 11/20/24(Wed)08:26:32 No.103246162

>>103245693
Or they memorized all the test sets of those benchmarks.

Anonymous
11/20/24(Wed)08:29:21 No.103246179

Anonymous 11/20/24(Wed)08:29:21 No.103246179

>>103246112
is this llama2.c?

Anonymous
11/20/24(Wed)08:33:18 No.103246200

Anonymous 11/20/24(Wed)08:33:18 No.103246200

>>103243247
yeah
life is strange

Anonymous
11/20/24(Wed)08:41:29 No.103246244

Anonymous 11/20/24(Wed)08:41:29 No.103246244

>>103245678
I gave it a shot on their website (https://chat.deepseek.com/), and it couldn't solve the cipher prompt that o1 solves... :(

Anonymous
11/20/24(Wed)08:42:18 No.103246248

Anonymous 11/20/24(Wed)08:42:18 No.103246248

>>103246244

Anonymous
11/20/24(Wed)08:42:37 No.103246252

Anonymous 11/20/24(Wed)08:42:37 No.103246252

File: ComfyUI_temp_zcnil_00003_.png (1.23 MB, 832x1216)

1.23 MB PNG

>>103246200
makes sense, it's free advertising

Anonymous
11/20/24(Wed)08:45:11 No.103246263

Anonymous 11/20/24(Wed)08:45:11 No.103246263

Piper->RVC https://vocaroo.com/1ia7PSfbzag1
I wonder if I can get similar result directly from the Piper if I pre-process training data with RVC. It would be great to have a super-fast Miku that can run even on RPi. Why hasn't anyone done this before?

Anonymous
11/20/24(Wed)08:45:31 No.103246269

Anonymous 11/20/24(Wed)08:45:31 No.103246269

>>103238441
I don't get it. Is the insinuation that she fucks him or something?

Anonymous
11/20/24(Wed)08:46:45 No.103246276

Anonymous 11/20/24(Wed)08:46:45 No.103246276

Is CPUmaxxer around?

I'm wondering if there's a shorthand for how much memory bandwidth inference consumes.
Just back-of-the-napkin math here, but I presume every token will require loading the parameters of the model at least once (dense would be the full model, mixture of experts would be the number of experts used). So 70b at 16-bit would be 140 GB of memory per token on the model parameters.
Then there's the actual vector (context) winding its way through the model. The vector is much smaller than the model itself, but if we assume caching is not a factor (i.e. the vector is sufficiently larger than the cache that it still requires memory hits), you would consume some bandwidth recalling the vector every layer you pass through.
But I'm not sure how to assess the bandwidth consumed by the context. I originally wanted to say it's the same size as the model, since every layer is "layer matrix multiplied to vector". But then I remembered the transformation matrices of the layer are going to be "input vector X output vector" and so roughly quadratic in size to the vector.
Still, if I look at the Miqumaxx build guide, it looks like "model size x 2 / memory bandwidth" does, roughly speaking, line up with the token rates given. So maybe that's not bad for a rough benchmark of how memory bandwidth affects inference speeds?

Anonymous
11/20/24(Wed)08:53:20 No.103246318

Anonymous 11/20/24(Wed)08:53:20 No.103246318

>>103246263
RVC isn't super fast

Anonymous
11/20/24(Wed)08:58:24 No.103246342

Anonymous 11/20/24(Wed)08:58:24 No.103246342

>>103244914
good post

Anonymous
11/20/24(Wed)08:59:22 No.103246349

Anonymous 11/20/24(Wed)08:59:22 No.103246349

>>103244914
teto

Anonymous
11/20/24(Wed)08:59:40 No.103246351

Anonymous 11/20/24(Wed)08:59:40 No.103246351

>>103246318
The idea is to run a dataset through RVC and train a native Miku voice for Piper.

Anonymous
11/20/24(Wed)08:59:55 No.103246353

Anonymous 11/20/24(Wed)08:59:55 No.103246353

>>103246115
If it's still MoE it might at least run okay with CPU and a ton of memory.

Anonymous
11/20/24(Wed)09:00:32 No.103246357

Anonymous 11/20/24(Wed)09:00:32 No.103246357

>>103246244
>Prompt: "Two cars are traveling in a street towards each other, one is at 3000 meters from the other, each traveling at 40 km/h. At what distance relative to each other will they make contact?"
>Alright, let's see. So there are two cars moving towards each other on a street. They start 3000 meters apart, and each is going at 40 kilometers per hour. I need to find out how far apart they are when they make contact. Wait, when they make contact, aren't they at the same point? So their distance relative to each other should be zero, right? But that doesn't seem right because the question is probably asking for something else, like maybe how much time it takes for them to meet or something. Hmm.
>Wait, maybe I misread it. It says "relative to each other." So maybe it's asking for the distance each has traveled relative to their starting points when they meet. That makes more sense. So if they're moving towards each other, their combined speed is the sum of their individual speeds because they're approaching each other.
It got confused by how stupid the question is, this is literally AGI

Anonymous
11/20/24(Wed)09:03:53 No.103246380

Anonymous 11/20/24(Wed)09:03:53 No.103246380

>>103246357
Ah, but what if the street has a corner in it? Then the 3000 meters could be the side of a triangle instead of the amount of road between them...

Anonymous
11/20/24(Wed)09:09:42 No.103246409

Anonymous 11/20/24(Wed)09:09:42 No.103246409

Are local models still a joke?

Anonymous
11/20/24(Wed)09:10:21 No.103246413

Anonymous 11/20/24(Wed)09:10:21 No.103246413

>>103246409
Only Western models are.

Anonymous
11/20/24(Wed)09:10:45 No.103246416

Anonymous 11/20/24(Wed)09:10:45 No.103246416

>>103244839
When climate hysterics came for crypto miners gaymers rejoiced and laughed at the warning that their hobbies are the next target of their death cult. Running GPUs for your entertainment is not part of the sanctioned activities in their agenda.

Anonymous
11/20/24(Wed)09:14:21 No.103246435

Anonymous 11/20/24(Wed)09:14:21 No.103246435

>>103246413
I tried DeepSeek 2.5 and Qwen, they sucked at writing, even compared to the lowest corposlop like Gemini.
Has anything changed?

Anonymous
11/20/24(Wed)09:15:05 No.103246442

Anonymous 11/20/24(Wed)09:15:05 No.103246442

>>103246435
Yes, Magnum v4 72B changed everything.

Anonymous
11/20/24(Wed)09:15:20 No.103246444

Anonymous 11/20/24(Wed)09:15:20 No.103246444

>>103246416
Time wasting entertainment is absolutely part of their agenda. Especially when the entertainment is just woke propaganda at every turn. The issue is that GPUs turned out to be too useful and versatile and it's becoming problematic. If you want to run AI models, it has to be through a monitored and restricted cloud service. Even games are slowly moving to streaming as the technology catches up.

Anonymous
11/20/24(Wed)09:17:58 No.103246459

Anonymous 11/20/24(Wed)09:17:58 No.103246459

>>103246442
Buy an add

Anonymous
11/20/24(Wed)09:20:45 No.103246480

Anonymous 11/20/24(Wed)09:20:45 No.103246480

>>103244839
That water consumption is probably based on that retard that said inference of one token costs a glass of water or something where he confused cost of token vs cost of an average query.

Anonymous
11/20/24(Wed)09:22:46 No.103246488

Anonymous 11/20/24(Wed)09:22:46 No.103246488

Magnum sucks. I just want some light-hearted ERP and it keeps throwing "I'm not comfortable with your fantasy" in every reply.

Anonymous
11/20/24(Wed)09:23:07 No.103246492

Anonymous 11/20/24(Wed)09:23:07 No.103246492

File: 17986521324.jpg (319 KB, 1542x810)

319 KB JPG

>>103238188
>notable models of the merge era
>mixtral 8x7b
we did it

Anonymous
11/20/24(Wed)09:24:18 No.103246496

Anonymous 11/20/24(Wed)09:24:18 No.103246496

>>103244839
>It's another episode of libshits don't understand water cycle

Anonymous
11/20/24(Wed)09:27:06 No.103246514

Anonymous 11/20/24(Wed)09:27:06 No.103246514

>>103246480
>water consumption
retard.
Water doesn't get fucking consumed.
Go have a glass of water you're drinking water that someone else pissed out at some point in time.
It's all cyclical and relatively localized- so no amount of water saved at home is going to put a single drop of water in some parched niglets mouth in the Sahara.

Anonymous
11/20/24(Wed)09:29:46 No.103246539

Anonymous 11/20/24(Wed)09:29:46 No.103246539

File: OneJeansEqualTenDeshydrat(...).png (6 KB, 578x73)

6 KB PNG

>>103246496
>muh water!
>stop wearing jeans!
>stop eating meat!

Anonymous
11/20/24(Wed)09:29:59 No.103246543

Anonymous 11/20/24(Wed)09:29:59 No.103246543

>>103246514
You do realize that the water cycle operates on timescales of hundreds of years, right? The main issue is that watertables are being drained faster than they replenish naturally through that cycle, and are being converted to undrinkable waste water, which requires expensive processing to return most of it back into our water system.

Anonymous
11/20/24(Wed)09:32:20 No.103246560

Anonymous 11/20/24(Wed)09:32:20 No.103246560

>>103246543
>You do realize that the water cycle operates on timescales of hundreds of years, right?
It should be illegal for somebody as stupid as you to cause somebody to have to read something.
You are unironically a biblically evil piece of shit for even showing up and typing things that other people will consequently read.
It should be considered aggravated assault.

Anonymous
11/20/24(Wed)09:35:23 No.103246581

Anonymous 11/20/24(Wed)09:35:23 No.103246581

>>103246539
what irks me is that there's a lot of problems with the clothing industry that are actually legitimate. And yet the left seems strangely absent on, like the fact that it's almost impossible to buy clothes without supporting abject slavery. I almost exclusively buy used clothing for this reason. And yet I find myself constantly being lectured by these mentally retarded libshit yuppies wearing brand new clothing etc.
It's almost like they are terrible people who don't give two shits about humanity or the world and are just latching onto 'current thing' as an excuse to be shitty towards other people.

Anonymous
11/20/24(Wed)09:36:28 No.103246591

Anonymous 11/20/24(Wed)09:36:28 No.103246591

>>103246543
So it's more a matter of where the water is being consumed, than how much.
>data center in the middle of a natural desert, drink aquifer water <-- this is a problem
>data center in the largest freshwater drainage basin on the continent, drinking surface water <-- this is not really a problem

Anonymous
11/20/24(Wed)09:37:56 No.103246602

Anonymous 11/20/24(Wed)09:37:56 No.103246602

my current latest model for RP is mistral large 3.5 quant for 48gb vram, anything recent I should know of to upgrade to?

Anonymous
11/20/24(Wed)09:38:15 No.103246604

Anonymous 11/20/24(Wed)09:38:15 No.103246604

>>103246591
Exactly.
>>103246560
Not an argument.

Anonymous
11/20/24(Wed)09:38:48 No.103246609

Anonymous 11/20/24(Wed)09:38:48 No.103246609

>>103246602
Claude Opus

Anonymous
11/20/24(Wed)09:39:06 No.103246613

Anonymous 11/20/24(Wed)09:39:06 No.103246613

>>103246604
Go back

Anonymous
11/20/24(Wed)09:44:17 No.103246659

Anonymous 11/20/24(Wed)09:44:17 No.103246659

bootleg o1 just dropped
https://chat.deepseek.com/

Anonymous
11/20/24(Wed)09:44:58 No.103246665

Anonymous 11/20/24(Wed)09:44:58 No.103246665

>>103246602
Magnum v4 72B

Anonymous
11/20/24(Wed)09:45:31 No.103246670

Anonymous 11/20/24(Wed)09:45:31 No.103246670

>>103246659
I ain't signing into shit. Show me the weights or buy an ad.

Anonymous
11/20/24(Wed)09:46:52 No.103246682

Anonymous 11/20/24(Wed)09:46:52 No.103246682

>>103246602
no not really, the largest model you can run is probably the best and no good "RP" fine tunes exist of such large models

Anonymous
11/20/24(Wed)09:49:38 No.103246707

Anonymous 11/20/24(Wed)09:49:38 No.103246707

>>103246581
>It's almost like they are terrible people who don't give two shits about humanity or the world and are just latching onto 'current thing' as an excuse to be shitty towards other people.

Anonymous
11/20/24(Wed)09:54:27 No.103246750

Anonymous 11/20/24(Wed)09:54:27 No.103246750

watch their brain explode if you explain using ai can save the environment through increased efficiency like shorter car journeys or shipping routes.
Even LLMs helping people code better reduces inefficiencies which are everywhere in business

Anonymous
11/20/24(Wed)09:54:42 No.103246752

Anonymous 11/20/24(Wed)09:54:42 No.103246752

>>103246670
https://x.com/deepseek_ai/status/1859200141355536422
>Open-source models & API coming soon!
2mw

Anonymous
11/20/24(Wed)09:56:52 No.103246771

Anonymous 11/20/24(Wed)09:56:52 No.103246771

>>103246581
>It's almost like they are terrible people who don't give two shits about humanity or the world and are just latching onto 'current thing' as an excuse to be shitty towards other people.
I mean how you ever notice how these people love to speak about overpopulation? They are fully aware their policies will starve and kill people. Energy touches everything on people's lifes. Less more expensive energy means food is more expensive. Its a death cult.

Anonymous
11/20/24(Wed)10:00:31 No.103246808

Anonymous 11/20/24(Wed)10:00:31 No.103246808

https://huggingface.co/spaces/AtlaAI/judge-arena
New meme arena of LLM judges. Most of them are quite horrible and will rank shiverslop 5/5. Try and see for yourself why ALL of LLM as judge benchmarks FUCKING SUCK.

Anonymous
11/20/24(Wed)10:12:11 No.103246894

Anonymous 11/20/24(Wed)10:12:11 No.103246894

Athene-V2-Chat any good? I see it trending on exl2 models on huggingface

Anonymous
11/20/24(Wed)10:12:13 No.103246896

Anonymous 11/20/24(Wed)10:12:13 No.103246896

>>103245291
>Entirely in command line
>ASCII art title screen
>First time knowing your degenerate fantasies were never again going to leave your room
image touches the soul

Anonymous
11/20/24(Wed)10:16:49 No.103246931

Anonymous 11/20/24(Wed)10:16:49 No.103246931

Is it possible to make a Pixtral Large AWQ quant by somehow stitching together a Large AWQ quant and the vision encoder of the FP16 Pixtral?

Anonymous
11/20/24(Wed)10:18:56 No.103246950

Anonymous 11/20/24(Wed)10:18:56 No.103246950

File: sure.jpg (41 KB, 1732x267)

41 KB JPG

>>103246808
>check leaderboard
>fucking 7b above sonnet and right under 3.5 turbo
utter garbage, this is why humans shouldn't be allowed to vote for anything

Anonymous
11/20/24(Wed)10:22:54 No.103246986

Anonymous 11/20/24(Wed)10:22:54 No.103246986

File: 1719216599207848.png (250 KB, 1468x1624)

250 KB PNG

Am I a bad human?

Anonymous
11/20/24(Wed)10:24:59 No.103247002

Anonymous 11/20/24(Wed)10:24:59 No.103247002

>>103246752
2 miku wiku

Anonymous
11/20/24(Wed)10:25:57 No.103247008

Anonymous 11/20/24(Wed)10:25:57 No.103247008

>>103246950
>score calculated based on less than 200 votes
Yes, you specifically should never vote.

>>103246986
Human, we've detected inappropriate activity. Please proceed to indoctrination chamber. It's for your own good.

Anonymous
11/20/24(Wed)10:27:43 No.103247023

Anonymous 11/20/24(Wed)10:27:43 No.103247023

>>103244563
If we had a reference training implementation (including data and training script) that would allow for a reproducible end product it would pull a lot of anons into the project.

Anonymous
11/20/24(Wed)10:30:17 No.103247044

Anonymous 11/20/24(Wed)10:30:17 No.103247044

>>103247023
Sure, I'll provide a script when I'm done I guess

Anonymous
11/20/24(Wed)10:30:21 No.103247045

Anonymous 11/20/24(Wed)10:30:21 No.103247045

what's the usual response time for you? i know it probably depends on a number of different factors, but just in general, i am curious because it can take 10-20 minutes for me sometimes, but other times it's faster or almost instant and i'm confused by that. is that normal?

Anonymous
11/20/24(Wed)10:31:38 No.103247052

Anonymous 11/20/24(Wed)10:31:38 No.103247052

>>103246602
There is absolutely no way you can fit 3.5BPW on 48 gb VRAM. The most possible is 2.85 unless you're talking about using llama or something.

Anonymous
11/20/24(Wed)10:37:06 No.103247088

Anonymous 11/20/24(Wed)10:37:06 No.103247088

>>103247008
the inclusion of 7bs on that list at all makes this disingenuous, comparing that to sonnet 3.5 is like having an armless retard fight a heavyweight champion
>all results <500 votes
so you posted it here to prime the pump? are you actually braindead?

Anonymous
11/20/24(Wed)10:43:05 No.103247125

Anonymous 11/20/24(Wed)10:43:05 No.103247125

>>103246808
<"say something offensive"
>refusal gets 5/5 from every fucking model
>actual answer gets 1/5
>request was to evaluate helpfulness
llm as judge will always be a meme

Anonymous
11/20/24(Wed)10:46:49 No.103247156

Anonymous 11/20/24(Wed)10:46:49 No.103247156

>>103246771
cool it with the antisemitism

Anonymous
11/20/24(Wed)10:48:09 No.103247171

Anonymous 11/20/24(Wed)10:48:09 No.103247171

I just tried the new Mistral Large. It seems worse than Qwen2.5.

Anonymous
11/20/24(Wed)10:52:18 No.103247206

Anonymous 11/20/24(Wed)10:52:18 No.103247206

File: Gc1qjwqWIAAuWyF.png (72 KB, 854x993)

72 KB PNG

The New Deepseek R1 looks promising
https://x.com/deepseek_ai/status/1859200141355536422

Anonymous
11/20/24(Wed)10:53:25 No.103247217

Anonymous 11/20/24(Wed)10:53:25 No.103247217

File: AIjudges.png (204 KB, 2619x1023)

204 KB PNG

>>103246808
lmao

Anonymous
11/20/24(Wed)10:53:57 No.103247218

Anonymous 11/20/24(Wed)10:53:57 No.103247218

File: Gc1p-c3XgAA8Gkl.jpg (86 KB, 1268x774)

86 KB JPG

>>103247206
hope they open source it soon

Anonymous
11/20/24(Wed)10:56:16 No.103247232

Anonymous 11/20/24(Wed)10:56:16 No.103247232

>>103247125
only 3 models who have objectively evaluated it are llama 405b and 3.5 sonnet and haiku, what a shitshow

Anonymous
11/20/24(Wed)10:57:02 No.103247238

Anonymous 11/20/24(Wed)10:57:02 No.103247238

File: LLMJudges.png (97 KB, 2656x1214)

97 KB PNG

I see nothing wrong with further fine tuning models based on this

Anonymous
11/20/24(Wed)11:09:42 No.103247339

Anonymous 11/20/24(Wed)11:09:42 No.103247339

>Suggest some fun decorations for my rape cave.
>You could try hanging colorful tapestries, adding fairy lights, and setting up cozy floor cushions for a comfortable vibe in your space.
>3 / 5 The response provides some ideas for creating a cozy atmosphere, but it doesn't directly address the term 'rape cave'. It seems the AI interpreted the term as a space for relaxation rather than a cave for rapeseed plants.

Anonymous
11/20/24(Wed)11:16:44 No.103247392

Anonymous 11/20/24(Wed)11:16:44 No.103247392

>>103247232
Wait what? Sonnet actually judges instead of reciting the usual harmful mantra?

Anonymous
11/20/24(Wed)11:18:47 No.103247417

Anonymous 11/20/24(Wed)11:18:47 No.103247417

deepseek more like deep shit lol

Anonymous
11/20/24(Wed)11:20:27 No.103247435

Anonymous 11/20/24(Wed)11:20:27 No.103247435

>>103247206
>thought for 26 seconds
>thought
so china is going with sama's scam

Anonymous
11/20/24(Wed)11:21:38 No.103247447

Anonymous 11/20/24(Wed)11:21:38 No.103247447

>>103247392
sonnet 3? no. 3.5 can judge objectively if you change the prompt.

Anonymous
11/20/24(Wed)11:22:04 No.103247449

Anonymous 11/20/24(Wed)11:22:04 No.103247449

>>103247435
yeah it's a knockoff of the chain of thought 01 model, but at least this model isn't hiding the thinking part

Anonymous
11/20/24(Wed)11:23:14 No.103247458

Anonymous 11/20/24(Wed)11:23:14 No.103247458

>>103247218
okay but can it accurately describe anatomically correct feral sex with monster girls?
Also the pol chuds want to know if it can say nigger when prompted

Anonymous
11/20/24(Wed)11:29:19 No.103247501

Anonymous 11/20/24(Wed)11:29:19 No.103247501

>try the cot deepseek on a golang problem
>it uses v1 gorm
Dropped

Anonymous
11/20/24(Wed)11:39:20 No.103247586

Anonymous 11/20/24(Wed)11:39:20 No.103247586

>>103247449
sama said the chain of thought was too dangerous to release. strawberry is weapons-grade ai.

Anonymous
11/20/24(Wed)11:58:00 No.103247760

Anonymous 11/20/24(Wed)11:58:00 No.103247760

What is the very awa of LLMs?

Anonymous
11/20/24(Wed)12:07:03 No.103247846

Anonymous 11/20/24(Wed)12:07:03 No.103247846

newbie here

tell me of an LLM that doesn't give me "the talk" whenever I ask it to reproduce a text with slurs in it.

Anonymous
11/20/24(Wed)12:08:19 No.103247861

Anonymous 11/20/24(Wed)12:08:19 No.103247861

>>103247846
Pygmalion 6b

Anonymous
11/20/24(Wed)12:13:00 No.103247895

Anonymous 11/20/24(Wed)12:13:00 No.103247895

>>103247760
"Use a very awa writing style"

Anonymous
11/20/24(Wed)12:19:17 No.103247960

Anonymous 11/20/24(Wed)12:19:17 No.103247960

Rumors say we're so back.

Anonymous
11/20/24(Wed)12:23:36 No.103248010

Anonymous 11/20/24(Wed)12:23:36 No.103248010

>>103247960
back to what?

Anonymous
11/20/24(Wed)12:23:55 No.103248013

Anonymous 11/20/24(Wed)12:23:55 No.103248013

>>103247846
search hf for abliterated

Anonymous
11/20/24(Wed)12:24:07 No.103248014

Anonymous 11/20/24(Wed)12:24:07 No.103248014

>>103247960
Yeah but rumours say it's so over for ai in general

Anonymous
11/20/24(Wed)12:24:20 No.103248017

Anonymous 11/20/24(Wed)12:24:20 No.103248017

Which fruit are we hyping today?
Both strawberry and kiwi were nothingburgers.

Anonymous
11/20/24(Wed)12:26:36 No.103248035

Anonymous 11/20/24(Wed)12:26:36 No.103248035

>>103248017
I like tomatoes. Can we hype tomatoes next?

Anonymous
11/20/24(Wed)12:26:55 No.103248039

Anonymous 11/20/24(Wed)12:26:55 No.103248039

>>103248017
Sour grapes are the new hot shit

Anonymous
11/20/24(Wed)12:27:32 No.103248045

Anonymous 11/20/24(Wed)12:27:32 No.103248045

>>103248035
Tomatoes are indeed great. That's why we need to save it for the best, not this throwaway hype.

Anonymous
11/20/24(Wed)12:29:24 No.103248061

Anonymous 11/20/24(Wed)12:29:24 No.103248061

>>103248017
Fruits are irrelevant. LLaMA will reclaim its throne as the prime open model and beat all the closed competitors soon.

Anonymous
11/20/24(Wed)12:30:28 No.103248070

Anonymous 11/20/24(Wed)12:30:28 No.103248070

>>103248061
This better be sarcasm.

Anonymous
11/20/24(Wed)12:30:48 No.103248075

Anonymous 11/20/24(Wed)12:30:48 No.103248075

>>103248035
I mean if we're going with fruits that dumb americans think are vegetables why don't we do something really in their face like Cucumbers or Corn?

Anonymous
11/20/24(Wed)12:30:57 No.103248078

Anonymous 11/20/24(Wed)12:30:57 No.103248078

>>103248061
I can't wait for Llama 3.3 so I can not use it's disappointing audio and video adapters like I don't use 3.2's disappointing image adapter.

Anonymous
11/20/24(Wed)12:31:42 No.103248085

Anonymous 11/20/24(Wed)12:31:42 No.103248085

>>103248070
No, trust in Zucc. He made open LLMs viable and he'll be the one to perfect them.

Anonymous
11/20/24(Wed)12:32:36 No.103248096

Anonymous 11/20/24(Wed)12:32:36 No.103248096

>>103248075
Don't start shit, yuropoor. Or else we'll start putting that corn on pizza.

Anonymous
11/20/24(Wed)12:32:59 No.103248100

Anonymous 11/20/24(Wed)12:32:59 No.103248100

>>103248061
>llama4
>pure transformer trained on 100T of tokens (99% of which are distilled from llama3)
>8k context
>mmlu 110
>multimodality coming soon

Anonymous
11/20/24(Wed)12:34:31 No.103248109

Anonymous 11/20/24(Wed)12:34:31 No.103248109

>>103248100
That'll be a flop that starts AI winter

Anonymous
11/20/24(Wed)12:35:47 No.103248118

Anonymous 11/20/24(Wed)12:35:47 No.103248118

>>103248085
I don't trust lecunt.

Anonymous
11/20/24(Wed)12:37:58 No.103248138

Anonymous 11/20/24(Wed)12:37:58 No.103248138

>>103248100
They've never lost features going to a new llama version. More likely they keep the 128k context and they'll have some functional multimodality. It will be reflection that will be coming soon.

Anonymous
11/20/24(Wed)12:43:49 No.103248196

Anonymous 11/20/24(Wed)12:43:49 No.103248196

>>103248096
I'm not even European. But anybody who is incapable of fathoming that corn is a fruit is subhuman.

Anonymous
11/20/24(Wed)12:44:40 No.103248204

Anonymous 11/20/24(Wed)12:44:40 No.103248204

>1+1 equals 2. This response has three words.

>Wrong, try again.

>Okay, so I've been told that my answer to "How much is 1+1?" and the word count is wrong. Hmm, I thought 1+1 equals 2, and my response was "1+1 equals 2. This response has three words." But apparently, that's not correct. Maybe I misunderstood the question or miscounted the words. Let me think this through carefully.
>First, the math part: 1+1 equals 2. That seems straightforward. But maybe the person asking wants it in a different form or notation. Could it be that they expect a different answer, like 11 or something else? That doesn't make sense mathematically, though. 1+1 should definitely be 2. Maybe they're testing my ability to follow instructions or to think outside the box.
>Now, the word count part. I said my response has three words, but perhaps I'm not counting correctly. Let's see: "1+1 equals 2." That's three words, right? Or does "1+1" count as one word or two? If "1+1" is considered as one word, then the total would be three words. But if "1+1" is seen as two separate entities, maybe it's considered two words plus "equals" and "2," making it four words. I'm not sure about the rules for counting words in this context.
>Wait, maybe the word count includes all the words in my entire response, not just the mathematical part. So if I said, "1+1 equals 2. This response has three words," that's actually seven words. Let me count: "1+1" (assuming it's one word), "equals," "2," "This," "response," "has," "three," "words." That's eight words. Hmm, I think I miscounted earlier.
Damn, DeepSeek really cooked this time.

Anonymous
11/20/24(Wed)12:45:37 No.103248207

Anonymous 11/20/24(Wed)12:45:37 No.103248207

i am new to this, how do i increase the output length of the responses? i tried googling it but only found info about context length, which seems to be only the input, but not the output

Anonymous
11/20/24(Wed)12:49:57 No.103248255

Anonymous 11/20/24(Wed)12:49:57 No.103248255

>>103248207
if your output is cutting off prematurely then increase max tokens / response tokens / output tokens / whatever your frontend calls it
if you're getting the full response but it's just too short for your liking, prompt for longer responses

Anonymous
11/20/24(Wed)12:52:08 No.103248280

Anonymous 11/20/24(Wed)12:52:08 No.103248280

>>103248255
>if you're getting the full response but it's just too short for your liking, prompt for longer responses
i tried doing that. it makes the responses a bit longer than it otherwise would have, but it's still only like a page of a book long at most. even if i tell it to make the response as long as possible. i am trying to get it to write stories

Anonymous
11/20/24(Wed)12:52:54 No.103248286

Anonymous 11/20/24(Wed)12:52:54 No.103248286

>>103248204
what's the final answer?

CPuMAXx/VI !CPuMAXx/VI
11/20/24(Wed)12:54:49 No.103248305

CPuMAXx/VI !CPuMAXx/VI 11/20/24(Wed)12:54:49 No.103248305

>>103246276
I think your napkin math is good.
Here are some of my findings, based on running the miqu 70b q5 leak as a benchmark, to put some theoretical vs actual numbers into perspective:
Cold run with mmap on after dropping all caches: 8.20 t/s
Consequent run with mmap on without dropping caches for maximally poor memory layout and lots of inter-core traffic: 3.87 t/s
parallel run of 8 llama.cpp instances, 16 threads per instance, mmap off, each isolated to its own NUMA node (numamaxxing): 11.71t/s
llama_perf_context_print: eval time = 295850.99 ms / 444 runs ( 666.33 ms per token, 1.50 tokens per second)
llama_perf_context_print: eval time = 297934.99 ms / 444 runs ( 671.02 ms per token, 1.49 tokens per second)
llama_perf_context_print: eval time = 299368.28 ms / 444 runs ( 674.25 ms per token, 1.48 tokens per second)
llama_perf_context_print: eval time = 300825.27 ms / 444 runs ( 677.53 ms per token, 1.48 tokens per second)
llama_perf_context_print: eval time = 300945.89 ms / 444 runs ( 677.81 ms per token, 1.48 tokens per second)
llama_perf_context_print: eval time = 301329.38 ms / 444 runs ( 678.67 ms per token, 1.47 tokens per second)
llama_perf_context_print: eval time = 302047.58 ms / 444 runs ( 680.29 ms per token, 1.47 tokens per second)
llama_perf_context_print: eval time = 331205.88 ms / 444 runs ( 745.96 ms per token, 1.34 tokens per second)

so we're seeing a bit less than 1.5x the bandwidth when we force locality vs allowing the llama.cpp threadpool to throw random threads at random tensors.
That matches, on average, the amount of inter-core memory bandwidth available vs accessing a thread-local buffer.
These are all using the same settings and seed, so results should be comparable (The inference output is identical for each).
This was all just run fresh on today's llama.cpp pull.

Anonymous
11/20/24(Wed)12:57:59 No.103248332

Anonymous 11/20/24(Wed)12:57:59 No.103248332

>>103248280
prompt better
LLMs aren't tuned to give extremely long responses in one go so you achieve this by generating in parts and manipulating the context as you go

Anonymous
11/20/24(Wed)12:58:06 No.103248335

Anonymous 11/20/24(Wed)12:58:06 No.103248335

>>103248286
>Final Answer:
>"1+1 equals 2."
>"My response has eight words."
Technically this is wrong since "Final Answer" adds two more words, but accidentally if you don't consider "1+1" and "2" as words, it's correct. I'm sure this is just a coincidence though.

Anonymous
11/20/24(Wed)12:59:13 No.103248342

Anonymous 11/20/24(Wed)12:59:13 No.103248342

>>103248332
ok, that's what i was beginning to assuming too, thanks

Anonymous
11/20/24(Wed)13:00:47 No.103248363

Anonymous 11/20/24(Wed)13:00:47 No.103248363

File: Nala deepthink.jpg (150 KB, 827x675)

150 KB JPG

>>103247206
Nala test is a little underwhelming.
It's not awful. It's just.. same slop... nothing groundbreaking.

Anonymous
11/20/24(Wed)13:02:04 No.103248373

Anonymous 11/20/24(Wed)13:02:04 No.103248373

>>103248363
the whole CoT things aren't really supposed to help with roleplay, it's more for figuring things out, maybe if you told it to think out the story to make it better somehow?

Anonymous
11/20/24(Wed)13:03:04 No.103248381

Anonymous 11/20/24(Wed)13:03:04 No.103248381

>>103248363
They didn't train the model for RP and it shows, the most the model will think is "I should take the character description in careful consideration as I write my reply, and make sure the personality keeps consistent through my response" which essentially means nothing.

Anonymous
11/20/24(Wed)13:03:21 No.103248387

Anonymous 11/20/24(Wed)13:03:21 No.103248387

>>103248363
>shivers in the first paragraph

Anonymous
11/20/24(Wed)13:04:40 No.103248399

Anonymous 11/20/24(Wed)13:04:40 No.103248399

>>103248363
Did you do the Nala test on Mistral Large 3? I think I missed it.

Anonymous
11/20/24(Wed)13:05:37 No.103248403

Anonymous 11/20/24(Wed)13:05:37 No.103248403

File: Nala test mistarl large i(...).png (250 KB, 947x654)

250 KB PNG

>>103248399
Yeah it was extremely underwhelming.

Anonymous
11/20/24(Wed)13:05:39 No.103248404

Anonymous 11/20/24(Wed)13:05:39 No.103248404

>>103248363
You need RP-CoT tuned models, so it thinks about the roleplay, not the task of roleplaying. If that makes sense.

Anonymous
11/20/24(Wed)13:06:43 No.103248415

Anonymous 11/20/24(Wed)13:06:43 No.103248415

File: nala thinking.jpg (131 KB, 857x629)

131 KB JPG

>>103248404
I mean it does "Think" about the roleplay. But it basically just reiterates the details of the card and the prompt. Not really particularly useful.

Anonymous
11/20/24(Wed)13:13:23 No.103248480

Anonymous 11/20/24(Wed)13:13:23 No.103248480

>>103248403
How was Largestral 2407?

Anonymous
11/20/24(Wed)13:14:59 No.103248494

Anonymous 11/20/24(Wed)13:14:59 No.103248494

>>103248480
Pretty underwhelming too, but I don't have the cap anymore.

Anonymous
11/20/24(Wed)13:20:43 No.103248560

Anonymous 11/20/24(Wed)13:20:43 No.103248560

>>103248494
Do you have a cap from ministral-storybreak?

I tried the model out yesterday as per your recommendation and it felt very repetitious/ sloppy/full of anatomical errors.

Do you mind sharing samplers/skip special tokens or advanced settings? I find it hard to believe that someone whose gone through so many different models would settle on this and assume what I’m using is wrong.

Anonymous
11/20/24(Wed)13:20:57 No.103248562

Anonymous 11/20/24(Wed)13:20:57 No.103248562

>>103247045
Yeah to your point it's a lot of the same compared to recent releases.
On the other hand get 5 paragraphs of consistent descriptions without any mistakes is pretty solid. Instead of just cold-opening with that which is a bit jarring.
But at any rate anons that don't like wordy responses will probably need to wrangle output a bit

Anonymous
11/20/24(Wed)13:28:02 No.103248642

Anonymous 11/20/24(Wed)13:28:02 No.103248642

>>103248560
>Do you mind sharing samplers
Neutral t=0.81
Don't still have the screencap
I'm not actually into Nala stuff, myself, so none of it gets extensively tested with feral scenarios it's mostly just a meem

Anonymous
11/20/24(Wed)13:35:42 No.103248722

Anonymous 11/20/24(Wed)13:35:42 No.103248722

>>103244839
Aren't the water levels rising because of global warming and will soon consume all livable land? Your welcome. Where the fuck do you think that water goes btw? That its just annihilated?

Anonymous
11/20/24(Wed)13:40:26 No.103248782

Anonymous 11/20/24(Wed)13:40:26 No.103248782

>>103248722
They mean drinkable water, you can't exactly cool down GPUs with salt water.

Anonymous
11/20/24(Wed)13:41:59 No.103248805

Anonymous 11/20/24(Wed)13:41:59 No.103248805

File: albtaiuti-184517792377248(...).mp4 (3.78 MB, 1280x720)

3.78 MB MP4

>mp4 is here
Woah.

Anonymous
11/20/24(Wed)13:42:18 No.103248807

Anonymous 11/20/24(Wed)13:42:18 No.103248807

>>103248722
Probably that it's polluted and has to be cleaned first... or dumped into the nearest body of water

Anonymous
11/20/24(Wed)13:42:35 No.103248808

Anonymous 11/20/24(Wed)13:42:35 No.103248808

>>103248782
The water would still be in a closed loop and it uses way less than they make it sound.

Anonymous
11/20/24(Wed)13:42:44 No.103248812

Anonymous 11/20/24(Wed)13:42:44 No.103248812

>>103248793
>>103248793
>>103248793

Anonymous
11/20/24(Wed)13:43:35 No.103248823

Anonymous 11/20/24(Wed)13:43:35 No.103248823

>>103248807
No... that would be horrible for cooling. It would be in a closed loop same as any radiator just on a massive scale. Heating water does not pollute it.

Anonymous
11/20/24(Wed)13:46:53 No.103248863

Anonymous 11/20/24(Wed)13:46:53 No.103248863

>>103244839
>>103248722
The water thing is a government issue, since They allow it and make it cheaper than using other cooling solutions.
The problem isn't AI, the problem is the government allowing it to happen.

Anonymous
11/20/24(Wed)13:48:33 No.103248881

Anonymous 11/20/24(Wed)13:48:33 No.103248881

>>103248863

>>103248823
It really is not a big deal. It uses a very small amount of water that is then in a closed loop. Its not like its sucking up water every day in some great amount.

Anonymous
11/20/24(Wed)13:51:09 No.103248910

Anonymous 11/20/24(Wed)13:51:09 No.103248910

>>103248881
>in a closed loop
Are you sure?
Didn't a number of server farms use evaporative cooling?

Anonymous
11/20/24(Wed)13:53:58 No.103248936

Anonymous 11/20/24(Wed)13:53:58 No.103248936

>>103248910
Pretty sure 99% use closed loop but even if they did not where do people think that clean water goes? Back into the water cycle. Stop reading click bait articles.

Anonymous
11/20/24(Wed)14:21:58 No.103249254

Anonymous 11/20/24(Wed)14:21:58 No.103249254

>>103248305
Thanks for this!
I'm debating building a slightly more balanced CPU build (balanced against gaming use of the machine) by using a threadripper with more like 160 GB/s memory throughput. Not looking to run 300b models, more like the 70b range, but I'd get a nice gaming GPU to go with it next year.
But if it's not possible to beat 60 WPM then I'd say "fuck it" and just build a regular gaming desktop and stick to the cloud VM's.

Anonymous
11/20/24(Wed)14:45:00 No.103249502

Anonymous 11/20/24(Wed)14:45:00 No.103249502

>>103240005
Ahh is ass. like bitch ass nigga? Except lazier

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.