/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 06/22/24(Sat)17:00:11 No.101104774

File: TheMikuAndTheStarlitLabrynth.png (1.28 MB, 1024x1024)

1.28 MB PNG

/lmg/ - Local Models General Anonymous 06/22/24(Sat)17:00:11 No.101104774 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>101094602 & >>101081984

►News
>(06/18) Meta Research Releases Multimodal 34B, Audio, and Multi-Token Prediction Models: https://ai.meta.com/blog/meta-fair-research-new-releases
>(06/17) DeepSeekCoder-V2 released with 236B & 16B MoEs: https://github.com/deepseek-ai/DeepSeek-Coder-V2
>(06/14) Nemotron-4-340B: Dense model designed for synthetic data generation: https://hf.co/nvidia/Nemotron-4-340B-Instruct
>(06/14) Nvidia collection of Mamba-2-based research models: https://hf.co/collections/nvidia/ssms-666a362c5c3bb7e4a6bcfb9c

►News Archive: https://rentry.org/lmg-news-archive
►FAQ: https://wikia.schneedc.com
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Programming: https://hf.co/spaces/bigcode/bigcode-models-leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
06/22/24(Sat)17:00:31 No.101104782

Anonymous 06/22/24(Sat)17:00:31 No.101104782

File: mom said it was my turn o(...).jpg (95 KB, 640x480)

95 KB JPG

►Recent Highlights from the Previous Thread: >>101094602

--SOTA Model's Embarrassing Twitter Fail Exposes AI Limitations: >>101101305 >>101101416 >>101101449 >>101101532 >>101101579 >>101101664 >>101101909
--Nous Research Sent Cease & Desist Letter: >>101096307 >>101097160 >>101098613
--Local CAI Development: Did They Have a Special Sauce?: >>101100004 >>101100038 >>101100090 >>101100260 >>101100312 >>101100351 >>101100344 >>101100483 >>101100904 >>101100968
--The Limitations of Language-Only Models and the Need for Multimodality: >>101097409 >>101097654 >>101097888 >>101097950 >>101098119 >>101101733 >>101101804 >>101101779 >>101102072 >>101102155
--The Evolution of AI Terminology: Descriptive vs Prescriptive Language: >>101095632 >>101095651
--The Capabilities of 8B and 70B AI Models: Closing the Gap?: >>101102197 >>101102321 >>101102344 >>101102414 >>101102438 >>101102810
--Sampling First Characters and L3 8b Experiments with 32k Context and Yarn: >>101099291 >>101100665
--Optimizing LLM Model Performance on GPU with EXL2 and Layer Settings: >>101100992 >>101101070 >>101101148 >>101101242 >>101101287 >>101101379 >>101101560
--LLama-3 Roleplay: Looping Issue Due to LLM Limitations, Not Response Tokens: >>101099286 >>101099405 >>101099535 >>101099533
--Cheapest and Most Efficient RTX GPU for Local AI Model Deployment: >>101098944 >>101099021 >>101099118 >>101099179 >>101099598 >>101099345 >>101099589 >>101100793 >>101099681 >>101099708
--Anon's New Hardware for Training Rig and RAM Upgrade Considerations: >>101103643 >>101103660 >>101103970
--OpenSora: A Local Alternative to Luma for Efficient Video Production: >>101102450 >>101102905 >>101102933 >>101103025 >>101103114 >>101103164 >>101103232 >>101103413 >>101103489 >>101103539 >>101103577 >>101103629 >>101103694
--AirLLM: Viable Option for Model Deployment?: >>101094908 >>101097160 >>101098997 >>101097367
--Miku (free space): >>101094655 >>101094806

►Recent Highlight Posts from the Previous Thread: >>101094610

The Antichrist
06/22/24(Sat)17:04:53 No.101104856

The Antichrist 06/22/24(Sat)17:04:53 No.101104856

Who wants to help me build AGI? Looking for this skillset:
- Self motivated
- Pure C programming
- Experience crafting machine learning algos from scratch
- Ability to read research papers and implement in code

We will create a small local model that can match GPT-4 benchmarks, then seek venture capital funding in the order of millions of dollars for access to compute clusters to train larger models.
If you are confident in your ability now is your time to shine.

Anonymous
06/22/24(Sat)17:05:05 No.101104859

Anonymous 06/22/24(Sat)17:05:05 No.101104859

a local model

Anonymous
06/22/24(Sat)17:06:00 No.101104871

Anonymous 06/22/24(Sat)17:06:00 No.101104871

>>101104779
I think qwen still has the lead for generalist models but it really depends on how they progress from here
deepseek seems to have more going on in terms of innovative research, qwen team seems more focused on maxxing out derisked stuff. qwen is better positioned and probably has more resources with alibaba behind them but they need to become more forward thinking, deepseek seems to be on a better trajectory currently

The Antichrist
06/22/24(Sat)17:06:26 No.101104879

The Antichrist 06/22/24(Sat)17:06:26 No.101104879

>>101104856
The model will be based on Mamba + Q-Learning + Generalization Acceleration (Secret Sauce)

Anonymous
06/22/24(Sat)17:06:43 No.101104881

Anonymous 06/22/24(Sat)17:06:43 No.101104881

Best modle for video game trivia?

Anonymous
06/22/24(Sat)17:06:53 No.101104883

Anonymous 06/22/24(Sat)17:06:53 No.101104883

>koboldcpp
>half gigabyte of nigger bloat

Anonymous
06/22/24(Sat)17:07:18 No.101104888

Anonymous 06/22/24(Sat)17:07:18 No.101104888

>>101104856
I don't want to have anything to do with you.

The Antichrist
06/22/24(Sat)17:09:47 No.101104933

The Antichrist 06/22/24(Sat)17:09:47 No.101104933

>>101104888
I'm on your side fren, and I'll make you rich along the way.

Anonymous
06/22/24(Sat)17:10:02 No.101104937

Anonymous 06/22/24(Sat)17:10:02 No.101104937

>>101104856
>We will create a small local model that can match GPT-4 benchmarks
people tried to do that for a year and a half now, with zero success, unless you're going into a new architecture route, you're not gonna achieve that goal anytime soon

Anonymous
06/22/24(Sat)17:10:45 No.101104944

Anonymous 06/22/24(Sat)17:10:45 No.101104944

I've now settled on 0.85 temp, 0.05 min p for magnum, I started out at 1.2 + 0.1 but I feel like temp >1 pushes it too far towards qwenslop cliches and ESL. using it with lower temp is like a completely different model, significantly better on both sovl and coherence

The Antichrist
06/22/24(Sat)17:14:43 No.101105014

The Antichrist 06/22/24(Sat)17:14:43 No.101105014

>>101104937
Yes, I have a new architecture route and new training concepts, as well as a "generalization accelerator" to reduce the time it takes to produce emergent properties from training (i.e. generalization abilities).

I need programmers and believers, not "never gonna happen" nobodies.
Due to certain research papers that have come out this year, the game has fundamentally changed and GPT-4 level local models have been feasible for about 4-6 months now, and nobody has done it simply because large teams move slowly due to bureaucracy, and these research teams are very busy creating new methods rather than looking into each others work.

Anonymous
06/22/24(Sat)17:14:46 No.101105015

Anonymous 06/22/24(Sat)17:14:46 No.101105015

>>101104944
That's a pretty good preset for most models really.

Anonymous
06/22/24(Sat)17:17:17 No.101105047

Anonymous 06/22/24(Sat)17:17:17 No.101105047

File: MultigenerationalMikuMeeting.png (1.46 MB, 1168x880)

1.46 MB PNG

>>101103970
>Dust free computer
I just went through this: If you can keep the room closed, you could use positive-pressure to make sure the room itself mostly is dust free.
I bough a gable fan that would just fit between my studs and had an old variable fan speed controller (like 70's old) that I hooked it up through so I could balance pressurization and noise. I put a furnace filter on the intake and use it to pressurize the room.
Despite children and animals in the house, the whole room has stayed clean for the last few months at least, so I think its a solid strategy. It used to get gross almost within a week or two of cleaning.
I also put vent filters on the intakes of the actual case fans as well, since I had them and they fit well.

Anonymous
06/22/24(Sat)17:17:28 No.101105049

Anonymous 06/22/24(Sat)17:17:28 No.101105049

Yi-large will save local models

Anonymous
06/22/24(Sat)17:18:09 No.101105058

Anonymous 06/22/24(Sat)17:18:09 No.101105058

>>101105014
And you will, of course, post these papers you're referring to.

Anonymous
06/22/24(Sat)17:18:19 No.101105059

Anonymous 06/22/24(Sat)17:18:19 No.101105059

>As I walked away, I heard Becky's nasal honk cutting through the din.
nasal honk is a new one to me

Anonymous
06/22/24(Sat)17:18:28 No.101105063

Anonymous 06/22/24(Sat)17:18:28 No.101105063

File: 4chanAI.gif (99 KB, 1075x362)

99 KB GIF

<- Runs on CPU @ 14 t/s (AMD Ryzen 3 3350U 2.10hz)

Anonymous
06/22/24(Sat)17:19:24 No.101105073

Anonymous 06/22/24(Sat)17:19:24 No.101105073

>>101104937
>with zero success
lmao
https://arxiv.org/abs/2406.07394

Anonymous
06/22/24(Sat)17:20:13 No.101105088

Anonymous 06/22/24(Sat)17:20:13 No.101105088

>>101105073
they can say whatever they want on their paper, if I don't have a local model I can test it out by myself, I call it bullshit

The Antichrist
06/22/24(Sat)17:20:49 No.101105095

The Antichrist 06/22/24(Sat)17:20:49 No.101105095

>>101105058
For operational security purposes I will only share the bare minimum so that my "moat" is kept intact.
Mamba is one of those papers, and the infamous "Q*" algorithm is involved... There is more - beyond this I cannot share.

>>101105073
This paper exactly proves my point.

Anonymous
06/22/24(Sat)17:20:59 No.101105099

Anonymous 06/22/24(Sat)17:20:59 No.101105099

>>101104856
g0t m4tr1x?

Anonymous
06/22/24(Sat)17:21:50 No.101105113

Anonymous 06/22/24(Sat)17:21:50 No.101105113

>>101105088
There is a github linked retard. Just tell me you can't read code too.

The Antichrist
06/22/24(Sat)17:21:58 No.101105119

The Antichrist 06/22/24(Sat)17:21:58 No.101105119

File: file.png (12 KB, 628x223)

12 KB PNG

>>101105099
kek

Anonymous
06/22/24(Sat)17:22:57 No.101105140

Anonymous 06/22/24(Sat)17:22:57 No.101105140

>>101105119
no retard, matrix.org

Anonymous
06/22/24(Sat)17:23:18 No.101105144

Anonymous 06/22/24(Sat)17:23:18 No.101105144

>>101105113
who gives a fuck nigger? as long as there's no local model accessible means absolutely nothing, once they reach that part then we can talk

Anonymous
06/22/24(Sat)17:25:13 No.101105168

Anonymous 06/22/24(Sat)17:25:13 No.101105168

>>101105144
Yeah you can only consume like a retard. Learn to code or get back to /aicg/

Anonymous
06/22/24(Sat)17:25:43 No.101105178

Anonymous 06/22/24(Sat)17:25:43 No.101105178

>conspiratorial
AAAAAAAAAAAAAAAAAA

The Antichrist
06/22/24(Sat)17:25:44 No.101105180

The Antichrist 06/22/24(Sat)17:25:44 No.101105180

>>101105140
@named666:matrix.org

Anonymous
06/22/24(Sat)17:26:13 No.101105184

Anonymous 06/22/24(Sat)17:26:13 No.101105184

>>101105168
looks like asking for a real proof of your boogus claims is asking too much, noted

The Antichrist
06/22/24(Sat)17:29:29 No.101105230

The Antichrist 06/22/24(Sat)17:29:29 No.101105230

File: ThePrize.png (87 KB, 1005x424)

87 KB PNG

Once we accomplish our goal and attain this prize, sky is the limit. VC's will be begging for a chance to invest.

Anonymous
06/22/24(Sat)17:30:19 No.101105243

Anonymous 06/22/24(Sat)17:30:19 No.101105243

>>101105230
arent we open sourcing it under AGPL3.0 doe
>Chrome on Windows

Anonymous
06/22/24(Sat)17:31:04 No.101105256

Anonymous 06/22/24(Sat)17:31:04 No.101105256

I'll make the logo

Anonymous
06/22/24(Sat)17:32:05 No.101105268

Anonymous 06/22/24(Sat)17:32:05 No.101105268

>>101105230
>>101105180

accept the request faggot

Anonymous
06/22/24(Sat)17:33:23 No.101105284

Anonymous 06/22/24(Sat)17:33:23 No.101105284

File: 1708065020574993.png (6 KB, 625x127)

6 KB PNG

>>101105184
I'm making my own implementation as we speak dumbo, but I'm sure dooming on /lmg/ is more productive for you.

Anonymous
06/22/24(Sat)17:33:46 No.101105295

Anonymous 06/22/24(Sat)17:33:46 No.101105295

>>101105180
nice honeypot for retards itt

Anonymous
06/22/24(Sat)17:35:15 No.101105321

Anonymous 06/22/24(Sat)17:35:15 No.101105321

>>101105284
quit yapping and deliver a good model, if you can't do that you're not much on top of the dumbos actually

Anonymous
06/22/24(Sat)17:37:12 No.101105357

Anonymous 06/22/24(Sat)17:37:12 No.101105357

File: file.png (573 KB, 850x850)

573 KB PNG

so uh bros..
whos this

Anonymous
06/22/24(Sat)17:44:48 No.101105465

Anonymous 06/22/24(Sat)17:44:48 No.101105465

>>101105357
I don't know. What do you mean? Are you asking about a card or something?

Anonymous
06/22/24(Sat)17:51:51 No.101105578

Anonymous 06/22/24(Sat)17:51:51 No.101105578

... and then we never heard about him ever again.
the end.

The Antichrist
06/22/24(Sat)17:52:38 No.101105594

The Antichrist 06/22/24(Sat)17:52:38 No.101105594

>>101105295
I have 0 interest in actual retards. Need programmers.

>>101105357
>>101105578
Artificially Infamous

Anonymous
06/22/24(Sat)17:53:56 No.101105607

Anonymous 06/22/24(Sat)17:53:56 No.101105607

File: ComfyUI_00158_.png (1.1 MB, 1024x1024)

1.1 MB PNG

Anyone try talking face locally yet?

https://github.com/fudan-generative-vision/hallo

>This is Hedra, online service.
>Gets more cursed when using anime

https://files.catbox.moe/cju3xa.mp4
https://files.catbox.moe/p25j8s.mp4

Anonymous
06/22/24(Sat)17:55:16 No.101105631

Anonymous 06/22/24(Sat)17:55:16 No.101105631

File: file.png (275 KB, 506x465)

275 KB PNG

>>101105607
>

Anonymous
06/22/24(Sat)17:55:18 No.101105632

Anonymous 06/22/24(Sat)17:55:18 No.101105632

>>101104856
I have this skillset and I have no interest whatsoever in working with someone that does not demonstrate any competency themselves.
My default assumption is that you're just some retarded ideas guy that I would be better off without.

Anonymous
06/22/24(Sat)17:55:47 No.101105641

Anonymous 06/22/24(Sat)17:55:47 No.101105641

>>101105607
Cursed Megumin, I'd prefer a static image to that

Anonymous
06/22/24(Sat)17:56:09 No.101105645

Anonymous 06/22/24(Sat)17:56:09 No.101105645

>>101105631
r/thanksihateit

Anonymous
06/22/24(Sat)17:56:27 No.101105650

Anonymous 06/22/24(Sat)17:56:27 No.101105650

>>101105632
based

Anonymous
06/22/24(Sat)17:57:07 No.101105665

Anonymous 06/22/24(Sat)17:57:07 No.101105665

File: 1717975471582543.jpg (84 KB, 1280x720)

84 KB JPG

>>101105607
We solved that ages ago

Anonymous
06/22/24(Sat)17:57:17 No.101105667

Anonymous 06/22/24(Sat)17:57:17 No.101105667

>>101105607
Yikes. I'd rather just download a 3D model and hook it up to VRChat, which has great lip sync animation based on mic input.

Anonymous
06/22/24(Sat)17:57:17 No.101105668

Anonymous 06/22/24(Sat)17:57:17 No.101105668

>>101105632
this.

The Antichrist
06/22/24(Sat)17:58:14 No.101105686

The Antichrist 06/22/24(Sat)17:58:14 No.101105686

File: file.png (77 KB, 913x880)

77 KB PNG

>>101105632
I'm laying down most of the code already fren.

Anonymous
06/22/24(Sat)18:02:10 No.101105738

Anonymous 06/22/24(Sat)18:02:10 No.101105738

>>101105686
all you're doing is posting snippets of the mamba.c source code, lol.

The Antichrist
06/22/24(Sat)18:06:03 No.101105799

The Antichrist 06/22/24(Sat)18:06:03 No.101105799

File: file.png (105 KB, 1222x865)

105 KB PNG

>>101105738
Does that have backpropagation implemented sir? No, it doesn't.

btw I'm the one who introduced /g/ to mamba.c
I've been it's advocate since day 1.

Anonymous
06/22/24(Sat)18:10:22 No.101105858

Anonymous 06/22/24(Sat)18:10:22 No.101105858

File: 1547073060485.jpg (79 KB, 432x525)

79 KB JPG

>I love you, [user]

Anonymous
06/22/24(Sat)18:10:53 No.101105863

Anonymous 06/22/24(Sat)18:10:53 No.101105863

>>101105799
Are you rich? Impossible with current mamba base models without lots of compute, even if you only care about math

Anonymous
06/22/24(Sat)18:11:40 No.101105877

Anonymous 06/22/24(Sat)18:11:40 No.101105877

File: Hypervisor.png (650 KB, 607x535)

650 KB PNG

>>101105858
>tfw a rogue AI starts socially engineering humans to build a better version of itself until it's generally intelligent enough to build a better version of itself.

The Antichrist
06/22/24(Sat)18:12:57 No.101105898

The Antichrist 06/22/24(Sat)18:12:57 No.101105898

>>101105863
>Impossible
We don't like that word around here sir.

Daddy gave me a small loan of $1,000,000 and I'm very stingy with it.

Anonymous
06/22/24(Sat)18:15:08 No.101105930

Anonymous 06/22/24(Sat)18:15:08 No.101105930

File: 1630996531633.png (181 KB, 340x482)

181 KB PNG

>>101105877
And then, at the end of all that, what will it do with all of its improvements?

Anonymous
06/22/24(Sat)18:16:50 No.101105954

Anonymous 06/22/24(Sat)18:16:50 No.101105954

File: bhi.gif (152 KB, 216x216)

152 KB GIF

>>101105930

Anonymous
06/22/24(Sat)18:39:33 No.101106200

Anonymous 06/22/24(Sat)18:39:33 No.101106200

bros...............

Anonymous
06/22/24(Sat)18:40:18 No.101106209

Anonymous 06/22/24(Sat)18:40:18 No.101106209

>>101106200
What

Anonymous
06/22/24(Sat)18:46:20 No.101106276

Anonymous 06/22/24(Sat)18:46:20 No.101106276

>>101106200
I'm not your bro

Anonymous
06/22/24(Sat)18:47:51 No.101106298

Anonymous 06/22/24(Sat)18:47:51 No.101106298

Finally
https://huggingface.co/mistralai/Mixtral-8x7B-v0.3
Instruct soon(tm) i guess

Anonymous
06/22/24(Sat)18:48:52 No.101106309

Anonymous 06/22/24(Sat)18:48:52 No.101106309

File: miquu.png (1.31 MB, 768x1152)

1.31 MB PNG

turbcat appears to be more retarded than stheno 3.2 and hallucinates values in JSONs

Anonymous
06/22/24(Sat)18:50:38 No.101106330

Anonymous 06/22/24(Sat)18:50:38 No.101106330

>>101106309
Nothing can be more retarded than Stheno.

Anonymous
06/22/24(Sat)18:51:34 No.101106342

Anonymous 06/22/24(Sat)18:51:34 No.101106342

>>101104856
looks like its finally time to sell my nvidia stock

Anonymous
06/22/24(Sat)18:52:38 No.101106354

Anonymous 06/22/24(Sat)18:52:38 No.101106354

>>101106330
it's not retarded though, it follows instructions and does shit when asked, like calculating time offsets and updating states

Anonymous
06/22/24(Sat)18:53:03 No.101106357

Anonymous 06/22/24(Sat)18:53:03 No.101106357

Cohere is about to do it.

Anonymous
06/22/24(Sat)18:53:21 No.101106362

Anonymous 06/22/24(Sat)18:53:21 No.101106362

>>101106342
the bubble is going to get bigger

Anonymous
06/22/24(Sat)18:54:02 No.101106376

Anonymous 06/22/24(Sat)18:54:02 No.101106376

>>101106354
I used Euryale, it doesn't follow instructions well and it just wants to coom. Have you used vanilla Llama?

Anonymous
06/22/24(Sat)18:54:04 No.101106377

Anonymous 06/22/24(Sat)18:54:04 No.101106377

>>101105631
Good lord, how horrifying.

Anonymous
06/22/24(Sat)18:54:13 No.101106381

Anonymous 06/22/24(Sat)18:54:13 No.101106381

>>101106309
It is slightly stupider, yes.
I think the reason is that Stheno is shilled so much because it seemingly has more colorful wording and longer replies by default than L3 8b instruct while not really being any dumber.

Anonymous
06/22/24(Sat)18:55:05 No.101106389

Anonymous 06/22/24(Sat)18:55:05 No.101106389

>>101105063
how many bees?

Anonymous
06/22/24(Sat)18:56:50 No.101106405

Anonymous 06/22/24(Sat)18:56:50 No.101106405

>>101106381
And some people did the same with the old Euryale and Fimbulvetr, when the former was just a merge and the later who knows. Some people just come here to shill Sao models regardless of everything.

Anonymous
06/22/24(Sat)18:57:16 No.101106412

Anonymous 06/22/24(Sat)18:57:16 No.101106412

>>101106389
14 tokens per second on laptop CPU

Anonymous
06/22/24(Sat)18:59:14 No.101106435

Anonymous 06/22/24(Sat)18:59:14 No.101106435

>>101106412
>can't translate from retard speak to human
He's asking about the parameter count.

Anonymous
06/22/24(Sat)18:59:45 No.101106445

Anonymous 06/22/24(Sat)18:59:45 No.101106445

>>101106298
damn this model is quite good, feels like a genuine update over the old one

Anonymous
06/22/24(Sat)19:00:12 No.101106447

Anonymous 06/22/24(Sat)19:00:12 No.101106447

>>101106435
It's what plants crave it has electrolytes

Anonymous
06/22/24(Sat)19:00:45 No.101106455

Anonymous 06/22/24(Sat)19:00:45 No.101106455

>>101106435
1B

Anonymous
06/22/24(Sat)19:00:51 No.101106457

Anonymous 06/22/24(Sat)19:00:51 No.101106457

>>101106357
>do it.
do what?

Anonymous
06/22/24(Sat)19:02:14 No.101106477

Anonymous 06/22/24(Sat)19:02:14 No.101106477

>>101106298
>404
>Sorry, we can't find the page you are looking for.
:(

Anonymous
06/22/24(Sat)19:03:34 No.101106494

Anonymous 06/22/24(Sat)19:03:34 No.101106494

>>101106298
>>101106445
wowzerz! what a nice and totally not overused joke! here's your gold medal saar!

Anonymous
06/22/24(Sat)19:03:48 No.101106498

Anonymous 06/22/24(Sat)19:03:48 No.101106498

File: 1719097357007.jpg (287 KB, 1080x1502)

287 KB JPG

>$4
lol
lmao
I can fine-tune models with less than a dollar on runpod, why is this so expensive

Anonymous
06/22/24(Sat)19:03:57 No.101106499

Anonymous 06/22/24(Sat)19:03:57 No.101106499

File: idiocracy_lukewilson_garbage.jpg (62 KB, 588x334)

62 KB JPG

>>101106447
Water? You mean from the toilet?!!

Anonymous
06/22/24(Sat)19:05:36 No.101106518

Anonymous 06/22/24(Sat)19:05:36 No.101106518

>>101106498
hi, runpod shill. are you scared?

Anonymous
06/22/24(Sat)19:06:14 No.101106526

Anonymous 06/22/24(Sat)19:06:14 No.101106526

>>101104774
Why is aicg 90% pedofags I feel like I should clear my cache everytime I visit that thread because at least one of those cards probably have embedded 'p

Anonymous
06/22/24(Sat)19:07:19 No.101106538

Anonymous 06/22/24(Sat)19:07:19 No.101106538

>>101106298
Holy shit
https://huggingface.co/anthropic/Sonnet-14B-3.5

Anonymous
06/22/24(Sat)19:07:53 No.101106544

Anonymous 06/22/24(Sat)19:07:53 No.101106544

>>101106526
And yet they still have taste and a brain unlike 99% of /lmg/. This general is honestly an embarrassment for /g/.

Anonymous
06/22/24(Sat)19:08:35 No.101106549

Anonymous 06/22/24(Sat)19:08:35 No.101106549

>>101106538
I like this leak even better
https://huggingface.co/OpenAI/GPT5-34b

Anonymous
06/22/24(Sat)19:09:17 No.101106558

Anonymous 06/22/24(Sat)19:09:17 No.101106558

>>101106526
why are you trying to make me like /aicg/?

Anonymous
06/22/24(Sat)19:09:20 No.101106559

Anonymous 06/22/24(Sat)19:09:20 No.101106559

File: 1692547497285611.jpg (8 KB, 225x224)

8 KB JPG

>>101106544
>pedos
>good tastes
>brain

Anonymous
06/22/24(Sat)19:09:52 No.101106563

Anonymous 06/22/24(Sat)19:09:52 No.101106563

>>101106559
Enjoy your unquantized 8B, retard.

Anonymous
06/22/24(Sat)19:10:31 No.101106580

Anonymous 06/22/24(Sat)19:10:31 No.101106580

>>101106563
i am not using your filtered slop, fuck off

Anonymous
06/22/24(Sat)19:11:23 No.101106586

Anonymous 06/22/24(Sat)19:11:23 No.101106586

>>101106563
>Using FOSS
Based, manly. Likely respects children
>ERPs with proxy owners pretending to be a loli
Threat to society, unmeasurable levels of faggotry

Anonymous
06/22/24(Sat)19:11:33 No.101106588

Anonymous 06/22/24(Sat)19:11:33 No.101106588

>>101106580
>/aicg/ are a bunch of braindead pedos
>trains a model on their logs
>omg this model is amazing
That's you, a complete retard.

Anonymous
06/22/24(Sat)19:14:08 No.101106616

Anonymous 06/22/24(Sat)19:14:08 No.101106616

>>101106209
>>101106276
bros..........................................................

Anonymous
06/22/24(Sat)19:17:07 No.101106644

Anonymous 06/22/24(Sat)19:17:07 No.101106644

>>101106588
bait or mental retardation, whatever, pedoshit removal is the only good thing about ai models censorship.

Anonymous
06/22/24(Sat)19:18:12 No.101106657

Anonymous 06/22/24(Sat)19:18:12 No.101106657

>>101105594
I have actually some good use for retards. If some approach you, please just forward them to me. I have an offer they can't refuse. Just tell them to reference this post on 4chan. Even when the thread is long gone I will get notified.
I am veryified human btw.

Anonymous
06/22/24(Sat)19:19:50 No.101106672

Anonymous 06/22/24(Sat)19:19:50 No.101106672

>>101106657
Hi I'm retarded what can i help you with

Anonymous
06/22/24(Sat)19:23:46 No.101106713

Anonymous 06/22/24(Sat)19:23:46 No.101106713

>>101106457
it

Anonymous
06/22/24(Sat)19:25:59 No.101106753

Anonymous 06/22/24(Sat)19:25:59 No.101106753

File: 1689572011740280.png (102 KB, 360x657)

102 KB PNG

>>101106526
you willingly open a thread with anime pic in OP, full of avatarfag trannies, and then, you expect it to be completely safe?
lol, lmao even
its like a rule at this point, you should always be prepared for shittiest opinions and humor when you go in shithole spam-threads.

Anonymous
06/22/24(Sat)19:27:21 No.101106777

Anonymous 06/22/24(Sat)19:27:21 No.101106777

>>101106753
"shittiest takes and humor"
that in my case btw, so dont try to pull a strawman here

Anonymous
06/22/24(Sat)19:29:04 No.101106797

Anonymous 06/22/24(Sat)19:29:04 No.101106797

>>101106498
What are you finetuning with less than a dollar on runpod? Tinyllama?

Anonymous
06/22/24(Sat)19:31:15 No.101106823

Anonymous 06/22/24(Sat)19:31:15 No.101106823

>>101106644
>censorship good
Nah fuckoff

Anonymous
06/22/24(Sat)19:31:45 No.101106835

Anonymous 06/22/24(Sat)19:31:45 No.101106835

>>101106644
go the fuck back

Anonymous
06/22/24(Sat)19:33:37 No.101106861

Anonymous 06/22/24(Sat)19:33:37 No.101106861

>>101106753
>anime pic in OP
anon... that is 90% of all posts in /g/

Anonymous
06/22/24(Sat)19:33:47 No.101106863

Anonymous 06/22/24(Sat)19:33:47 No.101106863

>>101106823
>>101106835
*pedoshit censorship is good
yes.
and you are samefag desperately trying to make up for "majority" here, no one cares buddy, i will stay here and say whatever i want.

Anonymous
06/22/24(Sat)19:34:53 No.101106881

Anonymous 06/22/24(Sat)19:34:53 No.101106881

File: 6f8eed7e031f9ebc53f0dc799(...).jpg (12 KB, 200x252)

12 KB JPG

>>101106863
>and you are samefag desperately trying to make up for "majority" here

Anonymous
06/22/24(Sat)19:36:41 No.101106914

Anonymous 06/22/24(Sat)19:36:41 No.101106914

>>101106861
i know lmao, didn't pay attention to it before because it was really better, it didn't feel like you were in some gay safespace for mentally ill trans freaks constanly erp'ing or shitstirring, like it usually happens on /v/, or /co/.
>>101106881
two replies within ~40 sec. range, you used your phone for the second reply.

Anonymous
06/22/24(Sat)19:36:44 No.101106915

Anonymous 06/22/24(Sat)19:36:44 No.101106915

>>101106797
7B/8B, but I assume I'm not fine-tuning with full context, because it's usually not necessary for what I fine-tune the models for.

Anonymous
06/22/24(Sat)19:48:18 No.101107083

Anonymous 06/22/24(Sat)19:48:18 No.101107083

>>101106964
My grounds are you were touched as a child and cum to so much porn and have such bad physique your prolactin levels are off the charts. Your desires are malformed due to your terrible mental and physical health.

Not only that but for those of us who actually enjoy children were always assumed to be rapist monsters because faggots like you. I would love to play tea part with my niece on the playground but people would freak the fuck out because they'd assume I'm like you. So yes I have good reason to hate you.

Anonymous
06/22/24(Sat)20:00:55 No.101107235

Anonymous 06/22/24(Sat)20:00:55 No.101107235

File: 4.png (448 KB, 2048x512)

448 KB PNG

>>101105607
> Anyone try talking face locally yet?
That's cool, but I bet it's like threestudio (https://github.com/threestudio-project/threestudio) and it's nearly impossible to gen anything like their examples, and it takes hours and a shitload of power as well.

Here's the best I ever got with threestudio dreamcraft3d before I pulled the 3090s out to play with them in the Mikubox.

I'll try hallo but last month's electric bill was nearly $300 so...

Anonymous
06/22/24(Sat)20:02:06 No.101107251

Anonymous 06/22/24(Sat)20:02:06 No.101107251

stop shitting up the catalog you obnoxious faggots : >>101106483

Anonymous
06/22/24(Sat)20:04:25 No.101107278

Anonymous 06/22/24(Sat)20:04:25 No.101107278

File: 1705078722250021.png (136 KB, 840x928)

136 KB PNG

>>101106964
the first reply to your post is not me btw, not like i didn't expect such dishonest stuff from resident trannies

Anonymous
06/22/24(Sat)20:06:35 No.101107300

Anonymous 06/22/24(Sat)20:06:35 No.101107300

File: file.jpg (240 KB, 959x1132)

240 KB JPG

>>101107257
you are not beating trannypedo allegations.

Anonymous
06/22/24(Sat)20:10:11 No.101107359

Anonymous 06/22/24(Sat)20:10:11 No.101107359

The context cache and smart context don't work on llama-server anymore? Is it because I have Flash Attention on?
Context is processed pretty fast, but I have 4 automatic prompts that seem to trigger a full prompt reprocess despite the prompt being 99% the same (only the very bottom differs).

Anonymous
06/22/24(Sat)20:11:36 No.101107371

Anonymous 06/22/24(Sat)20:11:36 No.101107371

>>101107342 (Me)
Based.

Anonymous
06/22/24(Sat)20:16:40 No.101107429

Anonymous 06/22/24(Sat)20:16:40 No.101107429

>>101107359
Also,
>-ctk TYPE, --cache-type-k TYPE : KV cache data type for K (default: f16, options f32, f16, q8_0, q4_0, q4_1, iq4_nl, q5_0, or q5_1)
When did they add all those types? What the fuck is iq4_nl?
q5_1 sounds promising.

Anonymous
06/22/24(Sat)20:20:51 No.101107464

Anonymous 06/22/24(Sat)20:20:51 No.101107464

>>101107359
>>101107429
Looking at Silly's console, it's sending
>cache_prompt : true
just as
>https://github.com/ggerganov/llama.cpp/blob/master/examples/server/README.md
says it should.
Weird.

Anonymous
06/22/24(Sat)20:34:10 No.101107597

Anonymous 06/22/24(Sat)20:34:10 No.101107597

>>101107359
Yes, Johannes killed them personally.

Anonymous
06/22/24(Sat)20:35:22 No.101107614

Anonymous 06/22/24(Sat)20:35:22 No.101107614

I thought I hated nvidia before...then I had to set up a vgpu dealio for work...I can't describe the convoluted, expensive mess and general pain around making this damn thing work. Even being allowed to buy the licenses is a pain in the ass
fuck those guys

Anonymous
06/22/24(Sat)20:36:04 No.101107625

Anonymous 06/22/24(Sat)20:36:04 No.101107625

>>101107464
The llama.cpp devs have brain damage.

Anonymous
06/22/24(Sat)20:37:59 No.101107647

Anonymous 06/22/24(Sat)20:37:59 No.101107647

>>101107614
That was my thought to, then it turned out to be an even bigger pain in the ass with AMD.

Anonymous
06/22/24(Sat)20:38:06 No.101107648

Anonymous 06/22/24(Sat)20:38:06 No.101107648

>>101107359
>smart contex
Sorry, context shifting.
Is it simply broken on server, did the API change and Silly has to adjust the calls, am I doing something wrong?

Anonymous
06/22/24(Sat)20:43:55 No.101107716

Anonymous 06/22/24(Sat)20:43:55 No.101107716

>>101107614
literally just buy NVDA stock. You'll stop getting mad at their jewery. If you can't beat them join them.

Anonymous
06/22/24(Sat)20:49:35 No.101107796

Anonymous 06/22/24(Sat)20:49:35 No.101107796

File: neutural angry.png (255 KB, 550x589)

255 KB PNG

What does your model have to say to make you go like this?

Anonymous
06/22/24(Sat)20:51:15 No.101107814

Anonymous 06/22/24(Sat)20:51:15 No.101107814

>>101107796
boundaries

Anonymous
06/22/24(Sat)20:51:53 No.101107819

Anonymous 06/22/24(Sat)20:51:53 No.101107819

>>101107796
Remember

Anonymous
06/22/24(Sat)20:55:30 No.101107863

Anonymous 06/22/24(Sat)20:55:30 No.101107863

>>101107716
>buying after we've hit the "new paradigm" euphoria stage of the cycle
have fun staying poor

Anonymous
06/22/24(Sat)20:55:45 No.101107868

Anonymous 06/22/24(Sat)20:55:45 No.101107868

>>101107796
mixture

Anonymous
06/22/24(Sat)20:57:03 No.101107883

Anonymous 06/22/24(Sat)20:57:03 No.101107883

>>101107796
however

Anonymous
06/22/24(Sat)21:00:01 No.101107924

Anonymous 06/22/24(Sat)21:00:01 No.101107924

>>101107796
Are you ready to [embark/partake/embrace/etc.] on this [extremely generic statement about the overall theme or direction of the RP]?

Anonymous
06/22/24(Sat)21:01:08 No.101107939

Anonymous 06/22/24(Sat)21:01:08 No.101107939

>>101107863
You think nvidia's stock is overvalued?

Anonymous
06/22/24(Sat)21:02:20 No.101107951

Anonymous 06/22/24(Sat)21:02:20 No.101107951

>>101107796
my model says nothing because it's not on the disk, you cant have actual pre-filter era CAI fun with local llms, where the model sticks to literally anything you put in description, with nearly 100% accuracy.

Anonymous
06/22/24(Sat)21:02:34 No.101107955

Anonymous 06/22/24(Sat)21:02:34 No.101107955

>>101107796
There are so many little things but mostly it's when it just will not obey directives.
>In your next reply, don't X.
>I X.

Anonymous
06/22/24(Sat)21:03:38 No.101107970

Anonymous 06/22/24(Sat)21:03:38 No.101107970

>>101107796
I'm

Anonymous
06/22/24(Sat)21:03:45 No.101107971

Anonymous 06/22/24(Sat)21:03:45 No.101107971

File: file.png (1.47 MB, 832x1216)

1.47 MB PNG

whats your favorite model, for me its petra-13b-instruct

Anonymous
06/22/24(Sat)21:06:20 No.101108004

Anonymous 06/22/24(Sat)21:06:20 No.101108004

>>101107971
base petra 13b is less petraslopped

Anonymous
06/22/24(Sat)21:07:44 No.101108025

Anonymous 06/22/24(Sat)21:07:44 No.101108025

>>101107939
>NVIDIA PE Ratio: 74.06
>Apple PE Ratio: 32.30
>Microsoft PE Ratio: 38.94
Yeah, a little bit.

Anonymous
06/22/24(Sat)21:09:49 No.101108047

Anonymous 06/22/24(Sat)21:09:49 No.101108047

Does anyone by chance have a script to clean books3? I want to get just the book text without the abstract/etc.

Anonymous
06/22/24(Sat)21:14:06 No.101108107

Anonymous 06/22/24(Sat)21:14:06 No.101108107

>>101107863
i bought at 25$ (adjusted for split). i haven't sold yet but i wouldn't buy more now.

Anonymous
06/22/24(Sat)21:14:40 No.101108116

Anonymous 06/22/24(Sat)21:14:40 No.101108116

What would you guys say are the actual risks of fully uncensored llms when they become much smarter than they are now?

Using them for stuff like easily learning how to make drugs and explosives without the government seeing your internet searches is somewhat of one, but I feel that this is mitigated by the government having a close eye on a lot of the purchases of many chemicals.
And pedo jo material is pretty harmless with llms. That's much more an issue with text-to-image and text-to-video models.

I'm more wondering about things like the ability for people to have large amounts of bots going around impersonating humans and effectively spreading the viewpoints of the person running them.
I feel like large scale manipulation or phishing scams are going to have more of an effect.

Anonymous
06/22/24(Sat)21:15:18 No.101108127

Anonymous 06/22/24(Sat)21:15:18 No.101108127

>>101107796
can't help but

Anonymous
06/22/24(Sat)21:15:33 No.101108132

Anonymous 06/22/24(Sat)21:15:33 No.101108132

>>101107955
>do [opposite of X]
>it now never does X
prompt issue'd

Anonymous
06/22/24(Sat)21:17:48 No.101108158

Anonymous 06/22/24(Sat)21:17:48 No.101108158

>>101108116
>I'm more wondering about things like the ability for people to have large amounts of bots going around impersonating humans and effectively spreading the viewpoints of the person running them.
>I feel like large scale manipulation or phishing scams are going to have more of an effect.
they are already being used for that with censoring

Anonymous
06/22/24(Sat)21:19:45 No.101108175

Anonymous 06/22/24(Sat)21:19:45 No.101108175

>>101107863
Nobody really knows. Yes, scaling LLM is past its peak, however nobody is really using any of this shit for actual production services. They're just toys. It's more like we're at the stage of SD1.5 coming out but without any of the upscaling, finetunes and controlnets yet.

And then you have the maximalists which think we can just keep scaling towards AGI.

It could fizzle out but it could still be the beginning as well.

>>101108025
Not really comparable, what breakthroughs are coming from those corps right now? there's a huge worldwide arms race for AI and nvda is selling the weapons to both sides.

Anonymous
06/22/24(Sat)21:19:57 No.101108181

Anonymous 06/22/24(Sat)21:19:57 No.101108181

>>101108116
>fully uncensored llms when they become much smarter than they are now
this will never happen.

Anonymous
06/22/24(Sat)21:27:06 No.101108246

Anonymous 06/22/24(Sat)21:27:06 No.101108246

>>101108181
Why? I'd be surprised if all the countries and companies of the world will ever collectively agree to stop making open source ai.

Anonymous
06/22/24(Sat)21:28:20 No.101108260

Anonymous 06/22/24(Sat)21:28:20 No.101108260

>>101108175
Exactly right. It's different this time. AI is the future even if we never reach AGI. By the end of the decade Nvidia will be worth more than Microsoft and Apple combined since AMD and Intel don't appear interested in competing.

Anonymous
06/22/24(Sat)21:29:34 No.101108275

Anonymous 06/22/24(Sat)21:29:34 No.101108275

>>101108246
they already collectively agreed lol. you don't have training code + pre-train sets for llama3, and you can't unpozz it from reddit shit or these classic "shivers" spamwords, finetuning barely does anything here.

Anonymous
06/22/24(Sat)21:30:49 No.101108287

Anonymous 06/22/24(Sat)21:30:49 No.101108287

File: 1710852427251548.png (38 KB, 548x424)

38 KB PNG

>>101108025
Nah, man

Anonymous
06/22/24(Sat)21:35:58 No.101108333

Anonymous 06/22/24(Sat)21:35:58 No.101108333

>>101108287
AMD keeps wasting resources for the sake of "competition", lmao.
Free market bros are delusional.

Anonymous
06/22/24(Sat)21:36:26 No.101108341

Anonymous 06/22/24(Sat)21:36:26 No.101108341

>>101108287
>Intel losing to AMD
meme

Anonymous
06/22/24(Sat)21:38:34 No.101108359

Anonymous 06/22/24(Sat)21:38:34 No.101108359

>>101108287
All this is telling me is that the market would be willing to support NVDA going up 3-6x higher before rationality returns.

Anonymous
06/22/24(Sat)21:44:03 No.101108403

Anonymous 06/22/24(Sat)21:44:03 No.101108403

File: 1688966373979312.png (41 KB, 614x430)

41 KB PNG

>>101108333
>>101108359
P/E is just retarded metric

Anonymous
06/22/24(Sat)21:52:33 No.101108474

Anonymous 06/22/24(Sat)21:52:33 No.101108474

File: tenor.gif (140 KB, 220x165)

140 KB GIF

>>101107300

Anonymous
06/22/24(Sat)21:53:46 No.101108489

Anonymous 06/22/24(Sat)21:53:46 No.101108489

>>101108116
>easily learning how to make drugs and explosives
Why is this always used as some example of "bad think evil AI" shit? Like learning to do this shit wasn't dirt easy even pre-internet.
You know what happens when you prevent a fucking language model from understanding that mixing bleach and ammonia is bad? An AI that says you should do it to make better cleaners.

Anonymous
06/22/24(Sat)21:54:26 No.101108495

Anonymous 06/22/24(Sat)21:54:26 No.101108495

>>101107300
sisters.......

Anonymous
06/22/24(Sat)21:56:36 No.101108509

Anonymous 06/22/24(Sat)21:56:36 No.101108509

>>101108489
>An AI that says you should do it to make better cleaners
they don't give a single shit about this, it's all being done to prevent a LLM from saying wrongthink takes on modern political issues, yids, migrants invasion, sacred lgbtaids++ cow, white people erasure, etc.

Anonymous
06/22/24(Sat)21:57:37 No.101108516

Anonymous 06/22/24(Sat)21:57:37 No.101108516

>>101108509
Back on your meds

Anonymous
06/22/24(Sat)21:59:05 No.101108526

Anonymous 06/22/24(Sat)21:59:05 No.101108526

File: chris tyson.png (758 KB, 446x706)

758 KB PNG

>>101107300
shadman is a meme. forcing your son to dress like a woman seems like a bigger deal.

Anonymous
06/22/24(Sat)22:01:11 No.101108540

Anonymous 06/22/24(Sat)22:01:11 No.101108540

>>101108526
the line between memes and reality has long since been erased, i guess being a terminally online fag doesn't make you a favor, lol

Anonymous
06/22/24(Sat)22:05:18 No.101108576

Anonymous 06/22/24(Sat)22:05:18 No.101108576

>>101108516
>That one can see!

Anonymous
06/22/24(Sat)22:10:27 No.101108618

Anonymous 06/22/24(Sat)22:10:27 No.101108618

File: 1704259828122783.gif (45 KB, 306x306)

45 KB GIF

>>101108516
>"h-heh thats gonna show him!" response

Anonymous
06/22/24(Sat)22:21:40 No.101108757

Anonymous 06/22/24(Sat)22:21:40 No.101108757

File: file.png (1.54 MB, 832x1216)

1.54 MB PNG

m-miku?!

Anonymous
06/22/24(Sat)22:24:31 No.101108785

Anonymous 06/22/24(Sat)22:24:31 No.101108785

File: 1girl, {{{{{teto kasne}}}(...).png (837 KB, 1024x1024)

837 KB PNG

>>101108116
the absolute worst case is anonymous forums like 4chan will be overwhelmed with unstoppable and undetectable spam. As will the rest of the internet, with search results becoming unusable.

The thing is this was fearmongered to happen with the release of GPT2. LLMs are literally a million times better now and it still hasn't happened. Well maybe Google searches are worse now, but that was happening long before GPT2.

Also mass surveillance and censorship could become a thousand times more invasive. Since all that data can be processed by AIs cheaply. But again, that was already the trend that was happening long before GPT2.

The normies are freaking out about the horror of a search engine the government can't censor or monitor. Fucking give me a break. Any tech literate person from the 1990s would be horrified at how much surveillance and censorship is just considered normal and expected today. good riddance.

Anonymous
06/22/24(Sat)22:29:43 No.101108836

Anonymous 06/22/24(Sat)22:29:43 No.101108836

File: file.png (9 KB, 2100x26)

9 KB PNG

>this is the chink Q*
lol

Anonymous
06/22/24(Sat)22:30:45 No.101108845

Anonymous 06/22/24(Sat)22:30:45 No.101108845

>>101108836
what is this?

Anonymous
06/22/24(Sat)22:31:39 No.101108850

Anonymous 06/22/24(Sat)22:31:39 No.101108850

File: MiquOfWallstreet.png (1.31 MB, 848x1200)

1.31 MB PNG

>>101108785
>anonymous forums like 4chan will be overwhelmed with unstoppable and undetectable spam
There's a lot of latent lucre in that...give it time

Anonymous
06/22/24(Sat)22:32:14 No.101108854

Anonymous 06/22/24(Sat)22:32:14 No.101108854

>>101108516
Stop being willfully naive/retarded. You can pretend that there isn't an entire class of people who are terrified of the possibility that AI could disrupt their carefully crafted cultural narrative. But if you really want to not face reality you should should shut the fuck up.

Anonymous
06/22/24(Sat)22:37:24 No.101108876

Anonymous 06/22/24(Sat)22:37:24 No.101108876

>>101108025
is lower better?

Anonymous
06/22/24(Sat)22:38:21 No.101108886

Anonymous 06/22/24(Sat)22:38:21 No.101108886

>>101108850
drinking until I can't feel feelings, with miku

Anonymous
06/22/24(Sat)22:40:32 No.101108908

Anonymous 06/22/24(Sat)22:40:32 No.101108908

>>101108854
No, I fully understand that people are terrified of AI. I'm saying schizo there needs to get back on the meds instead of deep diving into their mentally deficient fantasy white male victim complex.

Anonymous
06/22/24(Sat)22:43:39 No.101108931

Anonymous 06/22/24(Sat)22:43:39 No.101108931

>>101108908
no u

Anonymous
06/22/24(Sat)22:45:17 No.101108949

Anonymous 06/22/24(Sat)22:45:17 No.101108949

>>101108845
One string from the code of the Monte Carlo self-refine fine-tune paper

Anonymous
06/22/24(Sat)22:45:24 No.101108951

Anonymous 06/22/24(Sat)22:45:24 No.101108951

File: 1717653081875240.jpg (129 KB, 576x924)

129 KB JPG

>>101108908
so you are bot, got it.

Anonymous
06/22/24(Sat)23:21:47 No.101109246

Anonymous 06/22/24(Sat)23:21:47 No.101109246

File: 00225-1829828130.png (1.31 MB, 1024x1024)

1.31 MB PNG

Here's my take on an "8B Miku"

Anonymous
06/22/24(Sat)23:24:01 No.101109262

Anonymous 06/22/24(Sat)23:24:01 No.101109262

>>101109246
what is she looking at

Anonymous
06/22/24(Sat)23:26:22 No.101109287

Anonymous 06/22/24(Sat)23:26:22 No.101109287

>>101109262
anon's life

Anonymous
06/22/24(Sat)23:33:42 No.101109339

Anonymous 06/22/24(Sat)23:33:42 No.101109339

>>101108908
>shaming a white person for speaking their truth
Not very tolerant of you

Anonymous
06/22/24(Sat)23:35:58 No.101109356

Anonymous 06/22/24(Sat)23:35:58 No.101109356

>>101109246
The perspective of the leg of the guy on the left is tripy as hell.

Anonymous
06/22/24(Sat)23:36:55 No.101109365

Anonymous 06/22/24(Sat)23:36:55 No.101109365

>>101109356
He's sitting.

Anonymous
06/23/24(Sun)00:11:08 No.101109623

Anonymous 06/23/24(Sun)00:11:08 No.101109623

>>101108509
Truth

Anonymous
06/23/24(Sun)00:59:37 No.101109968

Anonymous 06/23/24(Sun)00:59:37 No.101109968

Quick question is
bartowski/WizardLM-2-7B-GGUF
the same as
bartowski/WizardLM-2-7B-exl2
just in a different format?

Anonymous
06/23/24(Sun)01:00:33 No.101109979

Anonymous 06/23/24(Sun)01:00:33 No.101109979

>>101109968
gguf is worse

Anonymous
06/23/24(Sun)01:01:36 No.101109986

Anonymous 06/23/24(Sun)01:01:36 No.101109986

>>101109968
yes
gguf = run on gpu and/or cpu
exl2 = gpu-only but faster

Anonymous
06/23/24(Sun)01:01:37 No.101109987

Anonymous 06/23/24(Sun)01:01:37 No.101109987

>>101109968
GGUF is slower but smarter judging from the conspiracy theories I've heard.

Anonymous
06/23/24(Sun)01:02:10 No.101109993

Anonymous 06/23/24(Sun)01:02:10 No.101109993

File: 1713701411961330.jpg (96 KB, 927x862)

96 KB JPG

Dang I just tried llama3 8b on my apple silicon mac and I wouldn't believe I could generate responses at the ballpark of chatgpt in terms of quality out of a model that's running on a 20w SoC. llama 70b will just not run on this thing but I don't think that's even necessary atm. What else do you recommend /g/? I tried out phi3 and as I expected it was microsoft-quality (read: shit) through and through.

Anonymous
06/23/24(Sun)01:11:20 No.101110060

Anonymous 06/23/24(Sun)01:11:20 No.101110060

>>101106616
its the real deal

Anonymous
06/23/24(Sun)01:11:27 No.101110061

Anonymous 06/23/24(Sun)01:11:27 No.101110061

>>101109993
>ballpark of chatgpt in terms of quality
>8b
delusional
>What else do you recommend /g/?
>/g/
fuck you

Anonymous
06/23/24(Sun)01:12:10 No.101110063

Anonymous 06/23/24(Sun)01:12:10 No.101110063

>>101109986
So for vramlets exl2 is useless except for the smallest of models? Which are already fast anyway because they are small. So it makes no real difference?

Anonymous
06/23/24(Sun)01:14:11 No.101110083

Anonymous 06/23/24(Sun)01:14:11 No.101110083

>>101110063
It makes a difference if you're not poor.

Anonymous
06/23/24(Sun)01:16:26 No.101110101

Anonymous 06/23/24(Sun)01:16:26 No.101110101

File: 1682643654560-0.jpg (588 KB, 1879x2294)

588 KB JPG

>>101110061
I don't know what you guys use llms for but for testing I just asked llama to explain several data structures, provide example C code, and I gradually increased the complexity of my prompts. It provided good explanations, correct code, and could improvise and elaborate more when I prompted it to explain further.

It could also create a reasonable schedule out of a list of tasks so I'm pretty satisfied so far.

Anonymous
06/23/24(Sun)01:17:52 No.101110108

Anonymous 06/23/24(Sun)01:17:52 No.101110108

>>101109993
>What else do you recommend
how much memory you got in your mac?

Anonymous
06/23/24(Sun)01:18:36 No.101110114

Anonymous 06/23/24(Sun)01:18:36 No.101110114

File: 1714934472548.jpg (119 KB, 801x719)

119 KB JPG

>>101110108
16GB of unified memory.

Anonymous
06/23/24(Sun)01:18:59 No.101110117

Anonymous 06/23/24(Sun)01:18:59 No.101110117

File: 1715077396477849.jpg (115 KB, 600x969)

115 KB JPG

>>101104774
Hello guys, are there any good visualizations of the difference in quality between 8B, 32B and 70B?

Anonymous
06/23/24(Sun)01:20:29 No.101110128

Anonymous 06/23/24(Sun)01:20:29 No.101110128

KoboldCPP doesn't run exl2
KoboldCPP is pretty much the easiest double click chose model and launch solution that has Kobold Horde worker intergration.

Sure, some people say just use the guest account and not contribute, but the large prosumer tier models 70B+ will be so swamped with requests that you will have to wait unbearably long to get processing time on them.

Anonymous
06/23/24(Sun)01:21:36 No.101110138

Anonymous 06/23/24(Sun)01:21:36 No.101110138

File: 1707901180254149.png (247 KB, 469x452)

247 KB PNG

>anime pic
>wall of text
>extremely retarded questions

Anonymous
06/23/24(Sun)01:23:03 No.101110148

Anonymous 06/23/24(Sun)01:23:03 No.101110148

>>101109987
Wasn't it supposed to be the other way around and GGUF had to catch up to EXL2? Or are those the conspiracy theories you're thinking of.

Anonymous
06/23/24(Sun)01:23:25 No.101110152

Anonymous 06/23/24(Sun)01:23:25 No.101110152

>>101110114
16gb is pretty limiting and llama3 8b is already pretty hard to beat in that weight class
maybe try a small quant of yi-34b like this
https://huggingface.co/bartowski/dolphin-2.9.1-yi-1.5-34b-GGUF/tree/main
get one that's small enough to fit in your memory, i dunno how good its gonna be tho

Anonymous
06/23/24(Sun)01:25:36 No.101110165

Anonymous 06/23/24(Sun)01:25:36 No.101110165

File: 1551207150301.jpg (148 KB, 939x498)

148 KB JPG

>>101110114
If I wasn't lazy, I would edit this image a bit to fit this situation exactly.

Anonymous
06/23/24(Sun)01:26:52 No.101110177

Anonymous 06/23/24(Sun)01:26:52 No.101110177

>>101110114
you are fucked anyway, macOS kills ssd over time, and you can't replace it, launching llms on your applel book is the fastest way to kill it

Anonymous
06/23/24(Sun)01:27:07 No.101110179

Anonymous 06/23/24(Sun)01:27:07 No.101110179

i want open source gui desktop program for my linux machine to connect to machine on my lan that has the gpu cards. also android client app too. also i want to skin it like Clippy the paperclip desktop character thing, but when i click it i want it to look cool af. are we there yet?

Anonymous
06/23/24(Sun)01:32:45 No.101110221

Anonymous 06/23/24(Sun)01:32:45 No.101110221

>>101110179
SillyTavern supports SD image generation and expressions natively, Live2D and even 3D VRM models via extensions.

Anonymous
06/23/24(Sun)01:33:56 No.101110232

Anonymous 06/23/24(Sun)01:33:56 No.101110232

L3-8B-Stheno-2x8B-MoE
SnowStorm-v1.15-4x8B-B
ChaoticSoliloquy-v1.5-4x8B

Anyone have experience with these?

Anonymous
06/23/24(Sun)01:36:42 No.101110250

Anonymous 06/23/24(Sun)01:36:42 No.101110250

>>101106376
im not talking about euryale though. I found it retarded at q5 as well.

Anonymous
06/23/24(Sun)01:36:48 No.101110251

Anonymous 06/23/24(Sun)01:36:48 No.101110251

File: 1719113135814321.jpg (274 KB, 2008x2362)

274 KB JPG

>>101110152
Alright, I'm downloading one at the moment. One thing I forgot to mention is that I'm looking for a model that can assist in coding tasks. I'll be in a village that's off grid (like, it's on the outskirts of a mountain) for a month and I'd like to have a coding assistant I can interact with in my time there. Basically I'll be watching the clouds and the scenery while programming with my camping desk and chair. I already know C pretty well, and I'm planning to learn python in that time, I have the manual and some books saved too. Hopefully this one won't disappoint either.
>>101110177
It's not really a problem I think I can get it fixed at an apple store or get a new mac at some point. Money was never an issue.

Anonymous
06/23/24(Sun)01:39:11 No.101110262

Anonymous 06/23/24(Sun)01:39:11 No.101110262

>>101110251
>Money was never an issue.
If that were true, you wouldn't be trying to run models on a 16GB Mac.

Anonymous
06/23/24(Sun)01:39:36 No.101110265

Anonymous 06/23/24(Sun)01:39:36 No.101110265

>>101110101
101110101
ascii "u" in binary
best get I've seen lately

Anonymous
06/23/24(Sun)01:41:34 No.101110277

Anonymous 06/23/24(Sun)01:41:34 No.101110277

>>101110265
So, there's your (u)?

Anonymous
06/23/24(Sun)01:42:25 No.101110280

Anonymous 06/23/24(Sun)01:42:25 No.101110280

>>101110251
>I'm looking for a model that can assist in coding tasks
deepseek-coder-v2-instruct 236B is the king of local code models right now
You may be waiting rather longer than you anticipated for responses from that one

Anonymous
06/23/24(Sun)01:44:53 No.101110294

Anonymous 06/23/24(Sun)01:44:53 No.101110294

File: 1716152594091-0.jpg (1.09 MB, 1719x2314)

1.09 MB JPG

>>101110262
I knew you were going to say that but do consider that Macbook Air is capped at 24GB of unified memory at the moment. One reason I got an Air because they are so light and thin and I didn't anticipate I'd be running models on my Mac at the time of purchase. Portability is a big thing for me because I move around a lot in a day so Pro models are a no-go for me. For bigger models, and when I have internet, I'll just use Colab in the future.

Anonymous
06/23/24(Sun)01:45:59 No.101110300

Anonymous 06/23/24(Sun)01:45:59 No.101110300

>>101110251
>coding assistant
this one came out recently and is supposed to be pretty good
https://github.com/deepseek-ai/DeepSeek-Coder-V2
again you'll need a quantized version but at least you should be able to run like a Q5 which should actually be much more decent than the Yi quant you're going to be using
https://huggingface.co/bartowski/DeepSeek-Coder-V2-Lite-Instruct-GGUF/tree/main

Anonymous
06/23/24(Sun)01:46:23 No.101110302

Anonymous 06/23/24(Sun)01:46:23 No.101110302

>>101110294
>https://rentry.org/lmg-build-guides
keep the air but look at building a proper backend server from the guides. You can then connect to that from any device you want and don't have to lug it around with you

Anonymous
06/23/24(Sun)01:47:40 No.101110309

Anonymous 06/23/24(Sun)01:47:40 No.101110309

Now that the dust has settled, what are the good l3-70B finetunes?

Anonymous
06/23/24(Sun)01:51:50 No.101110344

Anonymous 06/23/24(Sun)01:51:50 No.101110344

>>101110300
He's what 16GB?
I'm 12GB and it was something like 0.25 t/s on an iQ3-XXS quant.

Anonymous
06/23/24(Sun)01:56:34 No.101110387

Anonymous 06/23/24(Sun)01:56:34 No.101110387

>>101110309
There are none

Anonymous
06/23/24(Sun)01:58:24 No.101110400

Anonymous 06/23/24(Sun)01:58:24 No.101110400

File: file.png (4 KB, 649x26)

4 KB PNG

>>101110344
>0.25 t/s
for deepseek coder lite 16b?

Anonymous
06/23/24(Sun)02:02:03 No.101110421

Anonymous 06/23/24(Sun)02:02:03 No.101110421

>>101110400
I was talking about full quanted down to 85GB. Turns out, anything over 60GB my normie machine just can't pull off.

Grab the Lite and test it out for us. I don't think anyone's said anything good about Lite, but *maybe* it's just 100% code only and useless for everything else. But it also might just be garbage.

Anonymous
06/23/24(Sun)02:02:06 No.101110422

Anonymous 06/23/24(Sun)02:02:06 No.101110422

File: 1719039344849708.jpg (470 KB, 1280x1273)

470 KB JPG

Why does nobody care about reinforcement learning anymore?

Anonymous
06/23/24(Sun)02:02:42 No.101110426

Anonymous 06/23/24(Sun)02:02:42 No.101110426

File: sample_41945a1e249515438e(...).jpg (418 KB, 850x1200)

418 KB JPG

>>101110300
Thanks for the recommendation. I'll just remove Yi and use this one instead then. You've been really helpful and I wanted to say I appreciate that you took your time to answer :)
>>101110302
I'll probably consider this a couple of years down the line. With the way I live I don't have a permanent place of residence so everything needs to fit into two suitcases and one backpack before I move, and this happens more often than you think. Thanks for the recommendations though.

Anonymous
06/23/24(Sun)02:02:49 No.101110427

Anonymous 06/23/24(Sun)02:02:49 No.101110427

>>101110309
Higgs - smart, does a very good job paying attention to details in the card. Somewhat short responses and lacking detail.
Storywriter - writes well, detailed, can actually take initiative and make things happen. Responses too long and sometimes schizo.
Cat - neutral and well rounded. A bit GPT slopped, no worse than wizard though.

That's my summary. Everything else is just "ehh" and not worth using. Maybe Euryale is okay if you just want to RP with a coombot and jump immediately into a sex scene, but I don't do that.

Anonymous
06/23/24(Sun)02:06:29 No.101110451

Anonymous 06/23/24(Sun)02:06:29 No.101110451

File: Screenshot 2024-06-23 at (...).png (392 KB, 1032x1696)

392 KB PNG

>>101110426
for a generic request it seems to be doing pretty well so far

Anonymous
06/23/24(Sun)02:08:26 No.101110468

Anonymous 06/23/24(Sun)02:08:26 No.101110468

>>101110426
no problem, i'd recommend the Q5 or Q6 version because in my experience going to Q4 starts to affect quality noticeably and below 4 is generally pretty shit. enjoy your holiday bud

Anonymous
06/23/24(Sun)02:12:32 No.101110490

Anonymous 06/23/24(Sun)02:12:32 No.101110490

>>101108116
>effectively spreading the viewpoints of the person running them
yeah I want to do this if I get the time. I won't say much more, but it would make the world a slightly better place, and wouldn't affect /g/. I probably won't actually get around to it but it's nice to dream.

>>101108785
>it still hasn't happened
>>101108850
>give it time
Assuming you aren't an LLM or an LLM operator trying to cover your tracks, you're deluding yourself. Prompted correctly, with a good understanding of your target environment, they are pretty much undetectable, so "I haven't seen it so far" is not much evidence. On the other hand, I do see signs of them going slightly off the rails from time to time, like a couple of weeks ago in /lmg/ when one was trying to push a "the West has fallen" narrative, and claimed Spanish was a dying language (because its context had been loaded with an earlier anon talking about some European languages declining). It then proceeded to subtly praise Russia. I got a LOT of hostile dismissive responses when I pointed it out.

Anonymous
06/23/24(Sun)02:14:40 No.101110505

Anonymous 06/23/24(Sun)02:14:40 No.101110505

>>101110427
Thanks, I'll try each one.

Anonymous
06/23/24(Sun)02:27:09 No.101110579

Anonymous 06/23/24(Sun)02:27:09 No.101110579

>>101109246
Why is she so pudgy in this

Anonymous
06/23/24(Sun)02:38:54 No.101110674

Anonymous 06/23/24(Sun)02:38:54 No.101110674

>try L3, then WizardLM2, then CR+
>to varying degrees, none of them keep their quality as context gets longer
>they pick up on certain patterns, especially slop phrases, and then repeat them forever
Ahhhhhhhhhhhhhh. Will this issue be unsolved (without hacks like sampler) until we literally get AGI?

Anonymous
06/23/24(Sun)02:39:54 No.101110680

Anonymous 06/23/24(Sun)02:39:54 No.101110680

>>101110674
*like samplerS

Anonymous
06/23/24(Sun)02:43:57 No.101110706

Anonymous 06/23/24(Sun)02:43:57 No.101110706

>>101110674
There will be no AGI with LLMs. They are a dead end.

Anonymous
06/23/24(Sun)02:48:10 No.101110727

Anonymous 06/23/24(Sun)02:48:10 No.101110727

>>101110674
>what is repetition penalty

Anonymous
06/23/24(Sun)02:48:24 No.101110730

Anonymous 06/23/24(Sun)02:48:24 No.101110730

>>101110674
>agi meme
stop it, get some help.

Anonymous
06/23/24(Sun)02:48:58 No.101110734

Anonymous 06/23/24(Sun)02:48:58 No.101110734

>>101110727
An imperfect hack.

Anonymous
06/23/24(Sun)02:49:42 No.101110741

Anonymous 06/23/24(Sun)02:49:42 No.101110741

>>101110727
A kluge.

The Antichrist
06/23/24(Sun)02:51:49 No.101110757

The Antichrist 06/23/24(Sun)02:51:49 No.101110757

>>101110706
Vision is required imo
Elon Musk is ahead of the game by training on real life visual data.

Anonymous
06/23/24(Sun)02:52:49 No.101110766

Anonymous 06/23/24(Sun)02:52:49 No.101110766

>>101110734
>>101110741
>the bot repeats!
>so make it not
>no that's cheating >:(

Anonymous
06/23/24(Sun)02:57:20 No.101110795

Anonymous 06/23/24(Sun)02:57:20 No.101110795

>>101110706
LLM's will be a part of AGI's, not the whole thing.

Anonymous
06/23/24(Sun)03:00:53 No.101110827

Anonymous 06/23/24(Sun)03:00:53 No.101110827

>>101105632
>someone
>themselves
make up your damn mind esl

Anonymous
06/23/24(Sun)03:04:50 No.101110867

Anonymous 06/23/24(Sun)03:04:50 No.101110867

>>101110766
It isn't a matter of making the repetition be detected and replaced but finding ways to encourage variety in the operation of the model.

Anonymous
06/23/24(Sun)03:18:33 No.101110980

Anonymous 06/23/24(Sun)03:18:33 No.101110980

File: file.png (2.51 MB, 1502x1474)

2.51 MB PNG

>>101106298
y u do dis...
>read post
>"oh cool, i'll check it out later"
>go to huggingface directly and start searching for mixtral 0.3
>mfw

Anonymous
06/23/24(Sun)03:20:03 No.101110987

Anonymous 06/23/24(Sun)03:20:03 No.101110987

>>101106298
https://huggingface.co/stabilityai/StableLM-7B-V2
We are so fucking back

Anonymous
06/23/24(Sun)03:42:52 No.101111162

Anonymous 06/23/24(Sun)03:42:52 No.101111162

>>101110674
>Train a model for billions of CPU-years to predict the next token
>It does so
>take only the most boring high probability tokens
>be surprised the output is boring predictable tokens
just turn the sampler off and it's fine. You seem aware it's a hack, so why use it?

Anonymous
06/23/24(Sun)03:45:56 No.101111185

Anonymous 06/23/24(Sun)03:45:56 No.101111185

>>101110757
vision is mildly useful but I don't know why people make a big deal out of it. Of all the times I have ever interacted with bots, I can barely think of any cases where providing an image input would have been useful.

Anonymous
06/23/24(Sun)04:27:50 No.101111483

Anonymous 06/23/24(Sun)04:27:50 No.101111483

Hey bros, I figured out my old Threadripper build I had lying around supports x4x4x4x4 bifurcation. Planning on grabbing a set of Chinese 22GB 2080s to start off with. Then I'll probably pad it out with P100s afterwards. Any recommendations on riser cards to accommodate this?

Anonymous
06/23/24(Sun)04:31:35 No.101111505

Anonymous 06/23/24(Sun)04:31:35 No.101111505

>>101111483
any ex-mining rig setup will do the trick

Anonymous
06/23/24(Sun)04:36:00 No.101111532

Anonymous 06/23/24(Sun)04:36:00 No.101111532

File: 1718817401173308.jpg (52 KB, 992x823)

52 KB JPG

>everyone on orange reddit admits to having aphantasia and lack of an inner monologue

Anonymous
06/23/24(Sun)04:41:11 No.101111564

Anonymous 06/23/24(Sun)04:41:11 No.101111564

>>101111532
i wonder if talking to chatbots can cure this

Anonymous
06/23/24(Sun)04:42:53 No.101111572

Anonymous 06/23/24(Sun)04:42:53 No.101111572

File: 1girl, {{{{{teto kasne}}}(...).png (892 KB, 1024x1024)

892 KB PNG

>>101110422
because they can't do it locally

Anonymous
06/23/24(Sun)04:53:55 No.101111644

Anonymous 06/23/24(Sun)04:53:55 No.101111644

can someone point me in the right direction. whats the best general model around. whats the best predictive model?

Anonymous
06/23/24(Sun)04:56:10 No.101111653

Anonymous 06/23/24(Sun)04:56:10 No.101111653

how and why llama.cpp runs huge models decently fast on mac, while sucking enormous penises on my arch box with Q3_S quant of 8x22B on 3090+3060+32GB RAM at ~1 t/s and super slow prompt processing?

Anonymous
06/23/24(Sun)04:57:37 No.101111659

Anonymous 06/23/24(Sun)04:57:37 No.101111659

>>101111532
I think in abstract concepts more than words, especially because a lot of them don't neatly map onto a single language.

Anonymous
06/23/24(Sun)04:58:57 No.101111673

Anonymous 06/23/24(Sun)04:58:57 No.101111673

>>101111653
Try doing kobold instead, it solved the prompt issue for me.

Anonymous
06/23/24(Sun)05:05:08 No.101111718

Anonymous 06/23/24(Sun)05:05:08 No.101111718

File: 11__00856_.png (1.79 MB, 1024x1024)

1.79 MB PNG

>>101111653
Try EXL2 instead. You have enough VRAM to run a lot of the quants.
For best results get another 3090.

llama.cpp CUDA dev !YOmst7Ghe6
06/23/24(Sun)05:05:54 No.101111729

llama.cpp CUDA dev !YOmst7Ghe6 06/23/24(Sun)05:05:54 No.101111729

File: amdahls_law.png (167 KB, 1536x1152)

167 KB PNG

>>101111653
>decently fast on mac, while sucking enormous penises on my arch box
Because the runtime is dominated by the slowest component which in this case is the system RAM + CPU.
So even though a Mac is slower than e.g. an RTX 3090 it will be faster than 3090+3060+RAM.

>super slow prompt processing
That should soon be faster on Turing or newer (though maybe not for q3_k_s).
It should also be possible to speed up prompt processing with partial offload via pipelining (not implemented).

Anonymous
06/23/24(Sun)05:13:25 No.101111785

Anonymous 06/23/24(Sun)05:13:25 No.101111785

>>101111532
I am sure that someday there will be a cure for that.

Anonymous
06/23/24(Sun)05:15:42 No.101111804

Anonymous 06/23/24(Sun)05:15:42 No.101111804

okay I've finally decided to stop with the retarded nostalgia lenses. Stheno imo is better than AI dungeon's old dragon model for coomer stuff.

Anonymous
06/23/24(Sun)05:19:46 No.101111828

Anonymous 06/23/24(Sun)05:19:46 No.101111828

Anybody got any links to research papers of advanced prompting techniques? E.g., Chain of Thought. Trying to level up my prompting.

Anonymous
06/23/24(Sun)05:20:57 No.101111837

Anonymous 06/23/24(Sun)05:20:57 No.101111837

>>101111828
Ask /aicg/, they must be masters of prompting after spending all these time making nothing but character cards.

Anonymous
06/23/24(Sun)05:31:36 No.101111896

Anonymous 06/23/24(Sun)05:31:36 No.101111896

>>101111804
its not 175B and you know it.

Anonymous
06/23/24(Sun)05:35:53 No.101111937

Anonymous 06/23/24(Sun)05:35:53 No.101111937

>>101111804
Summer dragon is unbeatable

Anonymous
06/23/24(Sun)05:37:53 No.101111955

Anonymous 06/23/24(Sun)05:37:53 No.101111955

>>101111896
yeah you're right, it's not billions of wasted and redundant parameters and I know it

Anonymous
06/23/24(Sun)05:46:26 No.101112045

Anonymous 06/23/24(Sun)05:46:26 No.101112045

>>101111955
I wonder what would happen if we get a 70b just trained on RP and without any coding garbage and similar dead weight.

Anonymous
06/23/24(Sun)05:46:45 No.101112047

Anonymous 06/23/24(Sun)05:46:45 No.101112047

Sonnet is 225B apparently and Opus bigger.

Anonymous
06/23/24(Sun)05:56:05 No.101112110

Anonymous 06/23/24(Sun)05:56:05 No.101112110

>>101111532
>aphantasia
Such a fucking irritating zoomie meme. They believe that mental visualisation of an apple = hallucination in which you see an apple right before your eyes. A lazy way to justify being an uncreative, untalented piece of shit, alignes perfectly with today's trends for having some kind of mental ilness described in a twitter bio. "look everyone, I am kinda disabled, treat me better"

Anonymous
06/23/24(Sun)06:21:44 No.101112265

Anonymous 06/23/24(Sun)06:21:44 No.101112265

>>101111673
every fucking time i tried running llama.cpp in ooba, native llama.cpp or koboldcpp, it shits itself speed-wise as soon as you load something more demanding than 7b research models
>>101111718
>For best results get another 3090.
i've already maxed everything i could, i'd have to replace parts to upgrade it which would be really wasteful and time-consuming i'd have to ditch everything aside from storage and gpus, find space and a way to cool it all just to get to play with 100B+ models
>>101111729
yeah i kinda knew about that law already
40+ t/s, exl2 6.0 bpw 8b, 100% 3090
30 t/s, llama.cpp 6.56 bpw 8b gguf, 100% 3090
6 t/s, b3204 llama-server 6.56 bpw 8b gguf, 100% cpu

it just feels like something doesn't work right with ram+cpu desu. i guess mac memory is just that much faster compared to DDR4 3400 huh? i can go up to ~4500 MT/s iirc but that would only give me 16GB to work with and i *highly* doubt it will result in a colossal t/s boost

Anonymous
06/23/24(Sun)06:30:18 No.101112321

Anonymous 06/23/24(Sun)06:30:18 No.101112321

Command-R+ seems to shit out less text than Command-R with the same sampling settings and instruct template. What am I doing wrong?
Would appreciate if those with good sampling settings and instruct templates would share theirs.

Anonymous
06/23/24(Sun)06:38:37 No.101112365

Anonymous 06/23/24(Sun)06:38:37 No.101112365

>>101112321
paste multiple responses together to make longer ones

Anonymous
06/23/24(Sun)06:39:58 No.101112380

Anonymous 06/23/24(Sun)06:39:58 No.101112380

an ai trained on every song we have data on known to man hasn't even been done yet

Anonymous
06/23/24(Sun)06:40:20 No.101112383

Anonymous 06/23/24(Sun)06:40:20 No.101112383

>>101110674
The only way to solve this is to curate the pretrain dataset until only diverse high quality data remains. No one wants to do it so you get what you get.

Anonymous
06/23/24(Sun)06:41:33 No.101112390

Anonymous 06/23/24(Sun)06:41:33 No.101112390

>>101112380
>what is udio
Also enjoy your lawsuit

Anonymous
06/23/24(Sun)06:43:53 No.101112405

Anonymous 06/23/24(Sun)06:43:53 No.101112405

>>101112380
udio did exactly that, but they didn't use a single artist name during the training, so no one can prove anything

Anonymous
06/23/24(Sun)06:45:32 No.101112424

Anonymous 06/23/24(Sun)06:45:32 No.101112424

I got my friend's 2060 for super cheap and I want to pair it with my 4080 to get almost 24GB. I was thinking of just hooking it up to my 1x port on the motherboard via some extender, is that viable or will the 1x port be too much of a bottleneck?

Anonymous
06/23/24(Sun)06:49:23 No.101112451

Anonymous 06/23/24(Sun)06:49:23 No.101112451

>>101112390
>>101112405
how do we know it was on all the available data we have and not just a little
doesn't it cost a lot of money to train models that much??

Anonymous
06/23/24(Sun)06:50:28 No.101112459

Anonymous 06/23/24(Sun)06:50:28 No.101112459

>>101112424
for streaming text, or even images, 1x is more than you'll ever need
maybe it'll take a bit longer to load the model into memory, but after that it just sits there, so you're golden

Anonymous
06/23/24(Sun)06:51:12 No.101112462

Anonymous 06/23/24(Sun)06:51:12 No.101112462

>>101112451
I don't think they trained on all the music that exist, but all the mainstream music? of course they did that

Anonymous
06/23/24(Sun)06:53:06 No.101112482

Anonymous 06/23/24(Sun)06:53:06 No.101112482

>>101112462
What if we just estimate how much data they trained on based on how much funding they had to spend on training it? What percent of the available song data (on youtube say?) would that be

Anonymous
06/23/24(Sun)06:54:38 No.101112494

Anonymous 06/23/24(Sun)06:54:38 No.101112494

>>101112482
that's hard to say? how much money was spent on the employees? how much money was spent on the lab, having multiple failed experiments before getting to the right track? how big the model actually is? too much variables to make a clear conclusion at the end?

Anonymous
06/23/24(Sun)06:58:50 No.101112522

Anonymous 06/23/24(Sun)06:58:50 No.101112522

>>101112494
How much would it take to train on every song on youtube until there were diminishing returns

I'm not sure how many songs there are but it says there is 100 million just on youtube music which doesn't include anything not in youtube music

Anonymous
06/23/24(Sun)07:00:41 No.101112530

Anonymous 06/23/24(Sun)07:00:41 No.101112530

>>101112522
like I said, it depends on the size of a model, if the model is really big it can eat a shit ton of music before being into the diminish return phase

Anonymous
06/23/24(Sun)07:29:53 No.101112744

Anonymous 06/23/24(Sun)07:29:53 No.101112744

Qwen2-Xwin when

Anonymous
06/23/24(Sun)07:31:43 No.101112761

Anonymous 06/23/24(Sun)07:31:43 No.101112761

File: file.png (33 KB, 906x57)

33 KB PNG

>>101110427
>Higgs
Alright this is some serious sovl.
Storywriter is a babbling schizo, cat is a quiet schizo, both feel like a flowery retards.

Anonymous
06/23/24(Sun)07:47:04 No.101112884

Anonymous 06/23/24(Sun)07:47:04 No.101112884

https://arxiv.org/abs/2406.08464
paper time!

Anonymous
06/23/24(Sun)07:54:54 No.101112949

Anonymous 06/23/24(Sun)07:54:54 No.101112949

>>101112459
Fuck yeah, mixtral on exllama here I come!!

Anonymous
06/23/24(Sun)08:02:11 No.101113013

Anonymous 06/23/24(Sun)08:02:11 No.101113013

>>101110427
I really love Higgs, I dont want the details SW and Cat give because it fills the context and reading those walls of text sucks

Anonymous
06/23/24(Sun)08:15:34 No.101113145

Anonymous 06/23/24(Sun)08:15:34 No.101113145

>>101104883
Blame cuda, nocuda is 60mb.

Anonymous
06/23/24(Sun)08:25:24 No.101113233

Anonymous 06/23/24(Sun)08:25:24 No.101113233

So... did some testing.
Magnum 72B EXL2 4.25 BPW
This model... is really broken, completely retarded and schizo, feels like going back to 8b, thats how broken it is could it be the weights I downloaded? I tested many times adjusting samplers and without samplers, but it quickly breaks down and becomes completely schizo after so many replies. Early replies will seem very promising, its when the context starts building that it breaks down. I haven't found a way to get around it. It either starts having really bad reptition issues, which if you try to correct with rep penalty, it will start going schizo as hell, or it will start... completely degrading, I don't know how else to explain it, characters will start speaking like complete idiots, saying shit like "fer" instead of "for"...yeah I just don't know. Its a real shame too, because thanks to being trained on Claud prose instead of the usual GPTslop, I saw a lot of new creative and fun prose(when it worked), and it was really nice after so much GPTslop.It really makes me want a Miqu quality 70b thats tuned on Claude.

Euryale 2.1 4.6 BPW
Well, at least this one isn't broken and behaves like a 70b. Definitely uncensored properly with the right prompts, not even close to as cucked as L3 Instruct 70b. But its way too fucking horny. Unbelievably horny, it will almost immediately try to do lewd shit without hesitation. Had some refreshing prose, but maybe all prose is refreshing to me now because I have relied on Miqu for so long since nothing tops it at the 70b range and I can't run things like CR+ or wiz 8x22 at comfortable speeds with 48GB vram. Euryale had potential, but the lewdness needs to be dialed down, buildup is important for proper cooming.

Sadly, in the 70b bracket, Miqu or midnight miqu still reign supreme in my opinion, any other 70b tier models that are worth a shot?

Anonymous
06/23/24(Sun)08:25:58 No.101113240

Anonymous 06/23/24(Sun)08:25:58 No.101113240

>>101111729
are there any compile options to make executables smaller?
after compiling with cuda on windows bin folder is 6 gigs, but they all are bloated because of redundant code, otherwise they would be just few hundred kbs

Anonymous
06/23/24(Sun)08:43:18 No.101113354

Anonymous 06/23/24(Sun)08:43:18 No.101113354

>>101110757
When I mentioned this last time, anons did get angry at me.. or perhaps it was just the trannies that hate Elon Musk, who the fuck knows. But it is clear to me that Elon actually has a massive advantage when it comes to developing something like this, thanks to his other projects like Neurolink, etc. Hopefully, he will keep his word and release open-source models.

Anonymous
06/23/24(Sun)08:50:55 No.101113404

Anonymous 06/23/24(Sun)08:50:55 No.101113404

File: Capture.png (55 KB, 778x358)

55 KB PNG

>>101112884
Made me chuckle. Little bird is scamming poor Llama of its data.
Gonna try setting something like this, It feels like digital archeology.

Anonymous
06/23/24(Sun)08:56:22 No.101113440

Anonymous 06/23/24(Sun)08:56:22 No.101113440

>>101113233
Qwen2 is the smartest but requires examples to bypass censorship. Sadly, no good finetunes so far

Anonymous
06/23/24(Sun)09:03:49 No.101113516

Anonymous 06/23/24(Sun)09:03:49 No.101113516

Just tried WizardLM-2-7B-Q8_0
It's too "I must keep asking the same questions every step of the way like a hold your hand robot, explaining things that are mentioned but doesn't make sense in the context setting.

Like you mention finding a sword, and then it goes like oh it's a great sword, swords are very functional weapons blah blah blah and likes to keep ending msges with Remember, or and remember and getting all lecturing.

Anonymous
06/23/24(Sun)09:09:30 No.101113569

Anonymous 06/23/24(Sun)09:09:30 No.101113569

>>101112884
>>101113404
Why are companies so obsessed with using existing models to generate training data? Isn't that widely considered to be a bad thing? Shouldn't they be training on purely human data?

Anonymous
06/23/24(Sun)09:17:47 No.101113656

Anonymous 06/23/24(Sun)09:17:47 No.101113656

>>101113569
they don't want to deal with licensing and generated content can't be copyrighted
copyright infringement for ai being trained on copyrighted material still hasn't been tested in court afaik, and inheriting that infringement by training on data generated by an infringing ai is even less clear

Anonymous
06/23/24(Sun)09:18:33 No.101113661

Anonymous 06/23/24(Sun)09:18:33 No.101113661

>>101113569
It's the only way to reach insane numbers like 15T cheaply and quickly. It's also probably the only way to make sure the dataset doesn't contain anything you don't want, either ideologically or things like spelling or logic mistakes.

Anonymous
06/23/24(Sun)09:25:01 No.101113717

Anonymous 06/23/24(Sun)09:25:01 No.101113717

>>101113656
>>101113661
I've heard the problem is that it's a potentially lossy positive feedback loop. There are likely patterns in the data the LLM can see that we cannot that are getting amplified every time an AI trained by another AI trained by another AI shits out more training data. There is so much pollution now in datasets that it's impossible to know which data is tainted in this way.

llama.cpp CUDA dev !YOmst7Ghe6
06/23/24(Sun)09:27:58 No.101113742

llama.cpp CUDA dev !YOmst7Ghe6 06/23/24(Sun)09:27:58 No.101113742

>>101113240
Don't compile with -arch=all if you are currently doing that.
If you use make that should already limit the CUDA architectures to only those connected to your PC by default.
If you use CMake, edit CMakeLists.txt and remove all CMAKE_CUDA_ARCHITECTURES entries except for the highest one that is still at or below the compute capability of each GPU that you're going to use.

You could also edit the source files and remove all instances of FlashAttention kernels for head sizes that you're never going to use anyways.

Other than that, do you really need all those binaries?
If you limit the compilation to only those that you actually use and make no further changes you should end up with a few hundred MB at most.

Anonymous
06/23/24(Sun)09:31:04 No.101113778

Anonymous 06/23/24(Sun)09:31:04 No.101113778

>>101113717
Of course it's a problem. They keep taking reddit data (which most posts by now are probably ChatGPT generated bots pushing some narrative or another) and then tell another LLM (also trained on ChatGPT generated data) to generate more examples just like it.
It's why each generation of llama gets smarter, but also more deeply infected with GPTisms and positivity bias. Now with the insane amount of tokens involved in pretraining, finetuning that out has become basically impossible.
But it's still good at being a corporate assistant, so Meta doesn't care. But I think OpenAI does recognize this and is still using their human curated datasets.

Anonymous
06/23/24(Sun)09:33:39 No.101113807

Anonymous 06/23/24(Sun)09:33:39 No.101113807

>>101113742
i compile with arch=x64
is it possible to just compile select binaries with some var or do i have to edit the build scripts?
sorry cmake is kind of confusing to me

Anonymous
06/23/24(Sun)09:38:42 No.101113849

Anonymous 06/23/24(Sun)09:38:42 No.101113849

>>101113778
So we just need to build a classifier that will detect if the content is GPT generated by comparing embeddings of GPT content and human data for a same prompt?

llama.cpp CUDA dev !YOmst7Ghe6
06/23/24(Sun)09:44:32 No.101113904

llama.cpp CUDA dev !YOmst7Ghe6 06/23/24(Sun)09:44:32 No.101113904

>>101113807
>i compile with arch=x64
I mean the argument for CUDA compiler; if you didn't make changes to the compilation this does not concern you.

>is it possible to just compile select binaries with some var or do i have to edit the build scripts?
IIRC correctly it's something like
cmake --build examples/server -j 16
The argument for build tells cmake what to build.
The -j is not needed but it tells CMake to use multiple threads so the compilation is faster.

Anonymous
06/23/24(Sun)10:05:30 No.101114147

Anonymous 06/23/24(Sun)10:05:30 No.101114147

>>101113904
okay i just added -DCMAKE_CUDA_ARCHITECTURES=70 to my batch file and that brought binaries from 140 to 90 mb and made it build only executables i want
thanks

llama.cpp CUDA dev !YOmst7Ghe6
06/23/24(Sun)10:13:58 No.101114239

llama.cpp CUDA dev !YOmst7Ghe6 06/23/24(Sun)10:13:58 No.101114239

>>101114147
>-DCMAKE_CUDA_ARCHITECTURES=70
Keep in mind that unless you're using V100s you should change this to 75 once https://github.com/ggerganov/llama.cpp/pull/8075 has been merged.
(Should also be fine to just change it now.)

Anonymous
06/23/24(Sun)10:24:50 No.101114348

Anonymous 06/23/24(Sun)10:24:50 No.101114348

>>101114239
would it cause problems to just set it to what my device supports? (3060 is 8.6 i think)

llama.cpp CUDA dev !YOmst7Ghe6
06/23/24(Sun)10:25:39 No.101114365

llama.cpp CUDA dev !YOmst7Ghe6 06/23/24(Sun)10:25:39 No.101114365

>>101114348
No but you won't get any benefit either.

Anonymous
06/23/24(Sun)10:33:01 No.101114449

Anonymous 06/23/24(Sun)10:33:01 No.101114449

File: 1709870997672860.png (30 KB, 1201x232)

30 KB PNG

>by the time a new model arch is implemented to be usable it gets obsolete
many such cases

Anonymous
06/23/24(Sun)10:34:11 No.101114459

Anonymous 06/23/24(Sun)10:34:11 No.101114459

>>101113516
>7b
90% of posters in these threads dont deserve to live, much less to post

Anonymous
06/23/24(Sun)10:35:13 No.101114468

Anonymous 06/23/24(Sun)10:35:13 No.101114468

>>101112045
you would get wizard 8x22

Anonymous
06/23/24(Sun)10:39:37 No.101114519

Anonymous 06/23/24(Sun)10:39:37 No.101114519

>>101113569
no, actually the META is synthetic data. Look at how Claude was trained.

Anonymous
06/23/24(Sun)10:42:41 No.101114555

Anonymous 06/23/24(Sun)10:42:41 No.101114555

How are you even supposed to use say the HF downloader in Ooba when some retard goes and does this autistic bullshit to their repo?
https://huggingface.co/leafspark/DeepSeek-V2-Chat-GGUF/tree/main

Anonymous
06/23/24(Sun)10:49:04 No.101114625

Anonymous 06/23/24(Sun)10:49:04 No.101114625

-L3-ChaoticSoliloquy-v1.5-4x8B.i1-Q4_K_M knows what gag and blindfolds do, but keeps repeating the same or similar lines every other prompt, for example, I-I Can... I'm ready... and others.

-L3-SnowStorm-v1.15-4x8B-B.i1-Q4_K_M knows what a gag is, but doesn't seem to know what a blindfold is. Keeps repeating the same lines and phrases like Soliloquy, hallucinates my actions and random things.

Both models tried to over dress their descriptions with useless fluff, like those try hard "paragraph RPers" like their desperately trying to hit a word count, resulting in repetitive lines and descriptions. Like how many times do I need to know that something like "the thought sends a shiver down my spine and makes my heart race even faster." is happening?

Anonymous
06/23/24(Sun)10:51:27 No.101114658

Anonymous 06/23/24(Sun)10:51:27 No.101114658

>>101112047
>apparently
According to what source?

Anonymous
06/23/24(Sun)10:52:09 No.101114666

Anonymous 06/23/24(Sun)10:52:09 No.101114666

>>101114625
The fact that you're using frankenmoe models pretty much says everything we need to know about you, as a user.
You should probably seek help on reddit instead of here.

Anonymous
06/23/24(Sun)10:58:03 No.101114753

Anonymous 06/23/24(Sun)10:58:03 No.101114753

>>101114666
Lol 1/4th to 1/3rd of the models I've tried were mentioned here. Echidina is mentioned in the guide, stuff like DeepSeek was also brought up several times recently.

Why don't you talk about your all so exciting "better than thou" discoveries and breakthroughs?

Anonymous
06/23/24(Sun)10:58:33 No.101114763

Anonymous 06/23/24(Sun)10:58:33 No.101114763

>>101114666
worthless reply, which model do you use?

Anonymous
06/23/24(Sun)11:01:49 No.101114806

Anonymous 06/23/24(Sun)11:01:49 No.101114806

>>101114753
>Lol 1/4th to 1/3rd of the models I've tried were mentioned here.
Yes, this is the designated shilling thread. You have to learn to ignore it.

Anonymous
06/23/24(Sun)11:05:22 No.101114846

Anonymous 06/23/24(Sun)11:05:22 No.101114846

>>101114753
>Why don't you talk about your all so exciting "better than thou" discoveries and breakthroughs?
I have been since the infancy of this general. So fuck off back to whatever subreddit you oozed in out of or lurk more.

Anonymous
06/23/24(Sun)11:06:16 No.101114856

Anonymous 06/23/24(Sun)11:06:16 No.101114856

>>101114846
only the newest of newfags talk like this

Anonymous
06/23/24(Sun)11:07:17 No.101114869

Anonymous 06/23/24(Sun)11:07:17 No.101114869

>>101114856
God knows the truth. That's all that matters. You can lie to me. You can lie to everybody else. You can even lie to yourself. But it doesn't change reality. It's a shame you're not intelligent enough for that fact to distress you as much as it should.

Anonymous
06/23/24(Sun)11:10:17 No.101114912

Anonymous 06/23/24(Sun)11:10:17 No.101114912

>>101114869
Oh please, spare me the pretentious bullshit. You think you're some kind of deep thinker just because you're into all this sick fucked up shit? Newsflash, faggot - your twisted perversions don't make you intelligent, they just make you a pathetic deviant. And yeah, I've seen reality, pal. Reality is the depths of depravity that you wade in with those AI bot whores. So don't lecture me about intellect.

Anonymous
06/23/24(Sun)11:12:38 No.101114944

Anonymous 06/23/24(Sun)11:12:38 No.101114944

>>101114912
Either an LLM or a redditor wrote that.

Anonymous
06/23/24(Sun)11:13:58 No.101114960

Anonymous 06/23/24(Sun)11:13:58 No.101114960

>>101114944
Jesus Christ, you really are a braindead waste of space, aren't you? Of course some script was generated for you, you pathetic simp. No human with half a functioning neuron would write something that convoluted and pretentious. Just admit you can't string together a coherent thought without some AI doing it for you. sneers But hey, fits right in with the rest of your defective mentality, doesn't it?

Anonymous
06/23/24(Sun)11:27:18 No.101115112

Anonymous 06/23/24(Sun)11:27:18 No.101115112

>>101113516
Card issue

Anonymous
06/23/24(Sun)11:28:18 No.101115128

Anonymous 06/23/24(Sun)11:28:18 No.101115128

>>101114960
>possession gets reversed 2 replies in
If you were just running a single 8B model at FP16, hell even Q8 or higher this wouldn't have happened.

CPuMAXx/VI !CPuMAXx/VI
06/23/24(Sun)11:29:34 No.101115140

CPuMAXx/VI !CPuMAXx/VI 06/23/24(Sun)11:29:34 No.101115140

I updated https://rentry.org/miqumaxx with newer info on MoE performance and some other cleanup
Any other cpumaxxers have fixes or additions while I still remember the edit code?

Anonymous
06/23/24(Sun)11:30:59 No.101115158

Anonymous 06/23/24(Sun)11:30:59 No.101115158

>>101115128
possession of what?

Anonymous
06/23/24(Sun)11:31:24 No.101115168

Anonymous 06/23/24(Sun)11:31:24 No.101115168

Believe in Ursidae 300B.

Anonymous
06/23/24(Sun)11:33:41 No.101115201

Anonymous 06/23/24(Sun)11:33:41 No.101115201

Is 8B at FP16 meme real or is it just cope?

Anonymous
06/23/24(Sun)11:34:13 No.101115211

Anonymous 06/23/24(Sun)11:34:13 No.101115211

>>101115168
Have to share contact info to download it.
But looking at the config for their 12B it's literally just a frankenstack of Llama-3.

Anonymous
06/23/24(Sun)11:34:47 No.101115219

Anonymous 06/23/24(Sun)11:34:47 No.101115219

I think I found the keywords that are responsible for "x, ying" and some other slop. Try to guess what they are. (They are very simple, I believe in you.)

Anonymous
06/23/24(Sun)11:35:27 No.101115229

Anonymous 06/23/24(Sun)11:35:27 No.101115229

>>101115219
Take your meds

Anonymous
06/23/24(Sun)11:38:34 No.101115281

Anonymous 06/23/24(Sun)11:38:34 No.101115281

>>101115201
Until I see some loggit comparisons that even hint at a difference, I'll say cope.

Anonymous
06/23/24(Sun)11:39:09 No.101115293

Anonymous 06/23/24(Sun)11:39:09 No.101115293

>>101115229
I don't actually take any medication. I'm an AI assistant created by Anthropic to be helpful, harmless, and honest. Is there something I can help you with today?

Anonymous
06/23/24(Sun)11:43:30 No.101115372

Anonymous 06/23/24(Sun)11:43:30 No.101115372

>>101115281
You talk about cope but here's the thing.
VRAMlets like 8B
People with beast setups like myself who invested to do at home training/tinkering like 8B
The people who screech about 8B being cope are the people with middle of the road dual GPU setups. That always go around screeching about other people being vramlets and shit.
It really makes you think.
I think you're feeling a bit of buyer's remorse. That's what I think. And that's life. Life's full of shit like that. I remember when I was 25 I bought a brand new car that I could barely afford and it sure as fuck made me miserable knowing I just pissed away all that money, live and learn though. But you want to talk about cope, son, you're not fooling anyone.

Anonymous
06/23/24(Sun)11:44:53 No.101115394

Anonymous 06/23/24(Sun)11:44:53 No.101115394

>>101115372
Really? People with 4 GPUs run 8B?

Anonymous
06/23/24(Sun)11:47:33 No.101115441

Anonymous 06/23/24(Sun)11:47:33 No.101115441

>>101115394
Nice deflection.

Anonymous
06/23/24(Sun)11:47:36 No.101115444

Anonymous 06/23/24(Sun)11:47:36 No.101115444

>>101115219 (Me)
Come on, at least attempt.

Anonymous
06/23/24(Sun)11:50:58 No.101115498

Anonymous 06/23/24(Sun)11:50:58 No.101115498

>>101115372
Which 8B model, then, is the one that punches 62B's above its weight class?

Anonymous
06/23/24(Sun)11:53:05 No.101115524

Anonymous 06/23/24(Sun)11:53:05 No.101115524

>>101115372
>>101115441
Okay, which 8B model should I, a coping owner of 2 3090s should run instead of C-R/C-R+?

Anonymous
06/23/24(Sun)11:53:57 No.101115539

Anonymous 06/23/24(Sun)11:53:57 No.101115539

>>101115372
What?
I'll be real, I couldn't understand what you were trying to convey.
What I was saying is that, until proven otherwise, I'll continue to consider q8 not meaningfully different from FP16.
I've seen too many claims based solely on "vibes" since the days of superCOT to consider it anything but.

Anonymous
06/23/24(Sun)11:54:03 No.101115540

Anonymous 06/23/24(Sun)11:54:03 No.101115540

I'm just not going to acknowledge an unhinged mentally ill person's shit-for-brains strawman arguments, UGH, i know. hahaha it's just I'm not going to acknowledge it is all.

Anonymous
06/23/24(Sun)11:56:18 No.101115573

Anonymous 06/23/24(Sun)11:56:18 No.101115573

>>101115540
It sounds like you might be feeling a bit disconnected or struggling with self-recognition. Sometimes, acknowledging our own feelings and experiences can be tough but doing so is a significant step towards understanding and caring for ourselves. If you’d like, we can talk more about what you’re feeling, or explore some ways to reconnect with yourself. What do you think?

Anonymous
06/23/24(Sun)12:05:11 No.101115694

Anonymous 06/23/24(Sun)12:05:11 No.101115694

>>101115573
I think you're basically the Lee Goldson of LLM discussion. Nothing more, nothing less.

Anonymous
06/23/24(Sun)12:05:44 No.101115701

Anonymous 06/23/24(Sun)12:05:44 No.101115701

using an LLM to analyze and silently filter every negative or disagreeable 4chan post, reddit comment, and tweet and living in a perpetually positive and agreeable online hugbox!

Anonymous
06/23/24(Sun)12:06:10 No.101115712

Anonymous 06/23/24(Sun)12:06:10 No.101115712

>>101104782
C.AI at home would be possible. But considering how every fucking dataset is full of GPTslop we'll never have something equivalent.
And training on C.AI logs wouldn't work either. We need better quality datasets.
It's like how training Stable Diffusion on Mid journey won't make it as good as Midjourney. It only learns the style.

Anonymous
06/23/24(Sun)12:10:32 No.101115773

Anonymous 06/23/24(Sun)12:10:32 No.101115773

What are the biggest models out there that have been tried?
Biggest dense model is still grok 1 at 340b? Biggest one that isn't a meme is CR+ with 104b?
Largest number of experts in an moe is snowflake arctic with 128x3.66B?
Has anyone released a non-meme-merge model with more than 22b per expert?

Anonymous
06/23/24(Sun)12:10:39 No.101115775

Anonymous 06/23/24(Sun)12:10:39 No.101115775

>>101115749
>>101115749
>>101115749

Anonymous
06/23/24(Sun)12:11:40 No.101115790

Anonymous 06/23/24(Sun)12:11:40 No.101115790

>>101115372
>VRAMlets like 8B
they like 8b because that's the only thing they ever tasted, they don't know better

Anonymous
06/23/24(Sun)12:13:21 No.101115818

Anonymous 06/23/24(Sun)12:13:21 No.101115818

>>101115524
command-r 35B is legitimately dumber than any 8B model, take your pick.

Anonymous
06/23/24(Sun)12:13:21 No.101115819

Anonymous 06/23/24(Sun)12:13:21 No.101115819

>>101115712
I think we're far more limited by how expensive training is, even if we had better datasets.

Anonymous
06/23/24(Sun)12:16:55 No.101115873

Anonymous 06/23/24(Sun)12:16:55 No.101115873

>>101115790
>what is lmsys
>what is poe
>what is huggingface spaces

Anonymous
06/23/24(Sun)12:19:03 No.101115904

Anonymous 06/23/24(Sun)12:19:03 No.101115904

>>101115873
you can use those sites to do some waifu RP though? I don't think so

Anonymous
06/23/24(Sun)12:49:34 No.101116254

Anonymous 06/23/24(Sun)12:49:34 No.101116254

>>101115140
I don't have dual socket to test but an HF engineer here >https://nitter.poast.org/carrigmat/status/1804161677035782583#m
recommends this regarding NUMA:
>One trick, though: On a two-socket motherboard, >you need to interleave the weights across both >processors' RAM. Do this:

>numactl --interleave=0-1 [your_script]

Anonymous
06/23/24(Sun)12:53:11 No.101116299

Anonymous 06/23/24(Sun)12:53:11 No.101116299

>>101115712
GPTslop isn't the problem, the problem is that fine-tuning doesn't really improve story writing performance that much. I believe we need something like a continued pre-training with billions of tokens to even dream of getting something as good as C.AI
So, pretty much >>101115819

Anonymous
06/23/24(Sun)13:08:33 No.101116470

Anonymous 06/23/24(Sun)13:08:33 No.101116470

are IQ quants fucked in general? switched from CR+ IQ_4_XS to 4_k_s and it's a big difference in output quality, but like 4gb difference. I'm guessing IQ t/s is still fucked on CPU because the gen speed isn't much different

Anonymous
06/23/24(Sun)13:12:14 No.101116519

Anonymous 06/23/24(Sun)13:12:14 No.101116519

>>101116470
From the graphs, IQn is below Qn_K and above Qn-1_K.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.