/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 11/22/24(Fri)01:57:27 No.103265207

File: lmg mood.jpg (139 KB, 1216x832)

139 KB JPG

/lmg/ - Local Models General Anonymous 11/22/24(Fri)01:57:27 No.103265207 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>103256272 & >>103248793

►News
>(11/21) Tülu3: Instruct finetunes on top of Llama 3.1 base: https://hf.co/collections/allenai/tulu-3-models-673b8e0dc3512e30e7dc54f5
>(11/20) LLaMA-Mesh weights released: https://hf.co/Zhengyi/LLaMA-Mesh
>(11/18) Mistral and Pixtral Large Instruct 2411 released: https://mistral.ai/news/pixtral-large
>(11/12) Qwen2.5-Coder series released: https://qwenlm.github.io/blog/qwen2.5-coder-family
>(11/08) Sarashina2-8x70B, a Japan-trained LLM model: https://hf.co/sbintuitions/sarashina2-8x70b

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/leaderboard.html
Code Editing: https://aider.chat/docs/leaderboards
Context Length: https://github.com/hsiehjackson/RULER
Japanese: https://hf.co/datasets/lmg-anon/vntl-leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
11/22/24(Fri)01:58:00 No.103265210

Anonymous 11/22/24(Fri)01:58:00 No.103265210

File: GcLLp06aIAAEBJU.jpg (368 KB, 2048x2048)

368 KB JPG

►Recent Highlights from the Previous Thread: >>103256272

--Paper: Hymba: A Hybrid-head Architecture for Small Language Models:
>103264040 >103265024
--Papers:
>103264142 >103264271
--Debate on hybrid models vs separate models for AI tasks:
>103264086 >103264117 >103264132 >103264382 >103264396 >103264458 >103264550
--Critique of quantization benchmark chart and discussion of optimal quantization levels:
>103260714 >103260772 >103260883 >103260927 >103260881 >103262523 >103262630
--Unsloth adds vision model support with reduced VRAM usage:
>103261067
--R1 finds serialization problem in large codebase:
>103259678
--OpenAI's deleted evidence in copyright lawsuit sparks skepticism and negligence concerns:
>103257257 >103258483 >103258547 >103258594
--NVIDIA kvpress: 80% compression ratio without significant losses:
>103261925 >103261982 >103262008 >103262600 >103262698
--Local AI transcription tools for English speech:
>103256528 >103256545 >103257215
--Local AI girlfriend setup and conversation limitations:
>103257768 >103258014 >103258042 >103258110 >103258065 >103258157 >103258450
--Anons discuss Tulu 3 Models, a new instruct finetune series:
>103259680 >103259735 >103260672 >103262111 >103262312 >103262391
--Anon tries to adjust Dell 3090 fan speed:
>103259508 >103259624 >103259677 >103259766 >103259810 >103259900 >103259898
--Anon struggles to prevent Nemotron 70B from misusing ellipses, finds solution in token banning:
>103259994 >103260008 >103260047 >103260176 >103263273
--Anon asks about LS3 and Nvidia GPU fan control issues:
>103259803 >103259840 >103259885 >103259915 >103259958 >103260001 >103260019
--AI model responses to a question about making Sharo squirt:
>103256682 >103256751 >103256761 >103256872 >103256989 >103262833 >103257397
--Miku (free space):
>103259181 >103259966 >103260220 >103260446 >103261147 >103265119

►Recent Highlight Posts from the Previous Thread: >>103256368

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script

Anonymous
11/22/24(Fri)02:02:22 No.103265231

Anonymous 11/22/24(Fri)02:02:22 No.103265231

>>103265207
>UOH
ToT

Anonymous
11/22/24(Fri)02:52:00 No.103265494

Anonymous 11/22/24(Fri)02:52:00 No.103265494

thoughts on the sana release ?
i havent tried it but looking at the license i think the whole promise of efficent inference is a lie or completely gimped last minute the model also uses more memory then it should the fucking 0.6 b model uses 8gb and the 1.6 b 16gb vram though they did say with quanting it will go down so heres to hoping they trained the model in fp 64 or 128

Anonymous
11/22/24(Fri)03:12:17 No.103265580

Anonymous 11/22/24(Fri)03:12:17 No.103265580

>>103265207
So are you just shilling these tulu models or what

Anonymous
11/22/24(Fri)03:28:24 No.103265656

Anonymous 11/22/24(Fri)03:28:24 No.103265656

what does it mean when you model keeps repeating the same thing again and again regardless of your prompts. How do you fix that?

Anonymous
11/22/24(Fri)03:31:05 No.103265670

Anonymous 11/22/24(Fri)03:31:05 No.103265670

>>103265656
It means you touch yourself at night.

Anonymous
11/22/24(Fri)04:24:01 No.103265958

Anonymous 11/22/24(Fri)04:24:01 No.103265958

>>103265656
it means your setup is completely broken and you're not actually passing your inputs to the model

Anonymous
11/22/24(Fri)04:24:38 No.103265965

Anonymous 11/22/24(Fri)04:24:38 No.103265965

>>103265958
I see, thanks

Anonymous
11/22/24(Fri)05:43:31 No.103266376

Anonymous 11/22/24(Fri)05:43:31 No.103266376

fuck is tulu?

Anonymous
11/22/24(Fri)06:32:05 No.103266622

Anonymous 11/22/24(Fri)06:32:05 No.103266622

ai isn't real it's jsut word associatomn and statistics

Anonymous
11/22/24(Fri)06:34:37 No.103266637

Anonymous 11/22/24(Fri)06:34:37 No.103266637

https://www.reddit.com/r/LocalLLaMA/comments/1gwyuyg/beware_of_broken_tokenizers_learned_of_this_while/
>How can you tell?
>A model's tokenizer is the tokenizer.json file, and you can tell if a tokenizer is borked by transformers by seeing if it's size is double of the base model's tokenizer size.
>This can happen to any model, I have seen this on many finetunes or merges of Llama, Mistral or Qwen models. So if you are having issues with a model, be sure to check if the tokenizer is broken or not.
>How to fix this?
>Easy. Just copy over the base model's non-broken tokenizer.

Anonymous
11/22/24(Fri)06:48:05 No.103266732

Anonymous 11/22/24(Fri)06:48:05 No.103266732

File: trans-case-1.jpg (433 KB, 1008x1538)

433 KB JPG

Holy f**k level 2 reasoner strawberry 01

Anonymous
11/22/24(Fri)06:52:58 No.103266757

Anonymous 11/22/24(Fri)06:52:58 No.103266757

>>103266637
Mistral-Nemo-Base-2407: correct 9,3MB
Mistral-Nemo-Instruct-2407: correct 9,3MB

Rocinante-v1.1: correct 9,3MB
UnslopNemo-v1 & V2: correct 9,3MB
Nemomix-Unleashed: correct 9,3MB
MN-12B-Mag-Mell-R1: correct 9,3MB
Crestf411_nemo-sunfall-v0.6.1: correct 9,3MB

UnslopNemo-v3, 4, 4.1: INCORRECT 17,1MB
Crestf411_MN-Slush: INCORRECT 17,1MB
Results from a few Mistral Nemo tunes who's weight I had dl.

Anonymous
11/22/24(Fri)07:01:59 No.103266819

Anonymous 11/22/24(Fri)07:01:59 No.103266819

Any worthwhile model I can run on my M4 Pro with 48GBs?

Anonymous
11/22/24(Fri)07:04:27 No.103266853

Anonymous 11/22/24(Fri)07:04:27 No.103266853

>>103266637
How does this affect me as a regular llamacpp user? I never download anything other than the gguf file(s), do they already contain the tokenizer?

Anonymous
11/22/24(Fri)07:05:14 No.103266856

Anonymous 11/22/24(Fri)07:05:14 No.103266856

https://huggingface.co/AIDC-AI/Marco-o1
https://arxiv.org/pdf/2411.14405

Anonymous
11/22/24(Fri)07:05:31 No.103266857

Anonymous 11/22/24(Fri)07:05:31 No.103266857

>>103266853
>do they already contain the tokenizer?
Yes, and possibly the broken ones

>Ollama uses GGUF file type. So it depends on which tokenizer was used when the model was converted to GGUF.

Anonymous
11/22/24(Fri)07:09:06 No.103266885

Anonymous 11/22/24(Fri)07:09:06 No.103266885

>>103266857
Just read that as well, this ain't good
Is there a way to merge a gguf with a fixed tokenizer?

Anonymous
11/22/24(Fri)07:11:50 No.103266912

Anonymous 11/22/24(Fri)07:11:50 No.103266912

>>103266885
I think the easiest is to redo the gguf with a "fixed" base tokenizer. I don't know if you can edit the tokenizer metadata in the gguf to the level you'd need.

Anonymous
11/22/24(Fri)07:25:34 No.103267004

Anonymous 11/22/24(Fri)07:25:34 No.103267004

Why is it that every few months a major (and in hindsight quite obvious) bug in the AI ecosystem gets unearthed? And why is it usually the tokenizer?

Anonymous
11/22/24(Fri)07:28:09 No.103267024

Anonymous 11/22/24(Fri)07:28:09 No.103267024

>>103267004
>usually
More like every time

Anonymous
11/22/24(Fri)07:35:39 No.103267063

Anonymous 11/22/24(Fri)07:35:39 No.103267063

>>103266637
>>103266757
I just checked Mistral Large
>2407: Tokenizer 1.96MB
>2411: Tokenizer 3.96MB
aren't these supposed to be the same when it's just a minor refresh?

Anonymous
11/22/24(Fri)07:37:03 No.103267074

Anonymous 11/22/24(Fri)07:37:03 No.103267074

>>103267063
Nah, New large has the instruct tags, right?

Anonymous
11/22/24(Fri)07:37:23 No.103267080

Anonymous 11/22/24(Fri)07:37:23 No.103267080

>>103267004
AI developers are bad programmers, that's why most of them use python

Anonymous
11/22/24(Fri)07:38:00 No.103267084

Anonymous 11/22/24(Fri)07:38:00 No.103267084

>>103267080
>AI developers are bad programmers, that's why most of them use python
as a data scientist, I confirm

Anonymous
11/22/24(Fri)07:40:28 No.103267098

Anonymous 11/22/24(Fri)07:40:28 No.103267098

>>103267074
>>103267063
>instruct tags
That and the 2411 tokenizer is called v7
>https://github.com/LostRuins/koboldcpp/pull/1224
>Create Mistral-V7.json #1224
So I'd say it make sense they're very different.

Anonymous
11/22/24(Fri)07:58:08 No.103267239

Anonymous 11/22/24(Fri)07:58:08 No.103267239

File: 1707085483936539.png (36 KB, 798x957)

36 KB PNG

>>103266637
What exactly is it that 'breaks' inside a tokenizer? The json is just a long textfile that lists all tokens + some other stuff. I don't see what can go wrong here.
>>103267074
>>103267098
I just checked both of them. Somehow the 2411 tokenizer has three times (90k vs 280k or so) the lines because the "merges" section now looks very different with a lot more spacing. Left is the 2407 one and right is the 2411 one. Both are up until the "merges" section basically the same. No idea what bloating it up like that would accomplish though.

Anonymous
11/22/24(Fri)08:05:47 No.103267298

Anonymous 11/22/24(Fri)08:05:47 No.103267298

File: heyheyhey2.png (1.06 MB, 1024x1024)

1.06 MB PNG

HEY HEY HEY! WASO WASO WASO WASO WASO WASUP BITCONNEET

Anonymous
11/22/24(Fri)08:10:37 No.103267331

Anonymous 11/22/24(Fri)08:10:37 No.103267331

https://www.reddit.com/r/LocalLLaMA/comments/1gx6qyh/open_source_llm_intellect1_finished_training/

Kek at comments
>The first ever OPEN SOURCE model, not open weights but OPEN SOURCE!

Anonymous
11/22/24(Fri)08:13:18 No.103267346

Anonymous 11/22/24(Fri)08:13:18 No.103267346

>>103267331
>not open weights but OPEN SOURCE
it shouldn't be something to be celebrated, you have to hide your dataset so that you can train your model on good quality data, and not on some copyright free slop, those fucking ledditors...

Anonymous
11/22/24(Fri)08:15:13 No.103267359

Anonymous 11/22/24(Fri)08:15:13 No.103267359

>>103267346
More importantly, it's far from the first open source model, there was K2
>LLM360 has released K2 65b, a fully reproducible open source LLM matching Llama 2 70b
And quite a few older ones as well who showed their data.

Anonymous
11/22/24(Fri)08:17:04 No.103267376

Anonymous 11/22/24(Fri)08:17:04 No.103267376

>>103267359
>K2
i remember the cope like "you can uncuck it" and stuff, and literally nothing came of it lmao

Anonymous
11/22/24(Fri)08:25:51 No.103267442

Anonymous 11/22/24(Fri)08:25:51 No.103267442

>>103266732
woah, this is very interesting, will they release the weights?

Anonymous
11/22/24(Fri)08:26:35 No.103267450

Anonymous 11/22/24(Fri)08:26:35 No.103267450

>>103267442
it's already here? >>103266856

Anonymous
11/22/24(Fri)08:30:45 No.103267476

Anonymous 11/22/24(Fri)08:30:45 No.103267476

>>103266856
>by fine-tuning Qwen2-7B-Instruct with a combination of the filtered Open-O1 CoT dataset, Marco-o1 CoT dataset, and Marco-o1 Instruction dataset, Marco-o1 improved its handling of complex tasks.
>Qwen2-7B-Instruct
...

Anonymous
11/22/24(Fri)08:32:25 No.103267485

Anonymous 11/22/24(Fri)08:32:25 No.103267485

>>103267450
thanks

Anonymous
11/22/24(Fri)08:32:49 No.103267491

Anonymous 11/22/24(Fri)08:32:49 No.103267491

>>103267476
>Even the Chinese forget Qwen 2.5 exists

Anonymous
11/22/24(Fri)08:34:12 No.103267499

Anonymous 11/22/24(Fri)08:34:12 No.103267499

>>103266856
Yeah this is no Deepseek

Anonymous
11/22/24(Fri)08:49:48 No.103267627

Anonymous 11/22/24(Fri)08:49:48 No.103267627

is quadro rtx 8000 worth it?

Anonymous
11/22/24(Fri)08:52:30 No.103267649

Anonymous 11/22/24(Fri)08:52:30 No.103267649

File: translation.jpg (822 KB, 2256x2038)

822 KB JPG

>>103266856

Anonymous
11/22/24(Fri)08:56:49 No.103267685

Anonymous 11/22/24(Fri)08:56:49 No.103267685

>>103267627
Short answer:
>no
Long answer:
>noooooooooooooooooooo

Anonymous
11/22/24(Fri)08:58:40 No.103267697

Anonymous 11/22/24(Fri)08:58:40 No.103267697

>>103267685
but I can't find cheap 3090s

Anonymous
11/22/24(Fri)08:59:10 No.103267701

Anonymous 11/22/24(Fri)08:59:10 No.103267701

File: file.png (72 KB, 772x458)

72 KB PNG

>>103267331
>The first ever OPEN SOURCE model, not open weights but OPEN SOURCE!
KEK

Anonymous
11/22/24(Fri)09:00:31 No.103267714

Anonymous 11/22/24(Fri)09:00:31 No.103267714

>>103267701
You heard it here first, if you build your oss soft on multiple machines it's even MORE open!

Anonymous
11/22/24(Fri)09:00:58 No.103267717

Anonymous 11/22/24(Fri)09:00:58 No.103267717

>>103267697
Wait for the 5090 to come out and hope that gaymers will sell their 3090s.

Anonymous
11/22/24(Fri)09:04:48 No.103267751

Anonymous 11/22/24(Fri)09:04:48 No.103267751

>>103267697
Define cheap?
A Quadro RTX 8000 costs 3 times as much as a 3090,

Anonymous
11/22/24(Fri)09:26:44 No.103267889

Anonymous 11/22/24(Fri)09:26:44 No.103267889

>>103267697
Fb market has them for 700-900

Anonymous
11/22/24(Fri)09:26:58 No.103267892

Anonymous 11/22/24(Fri)09:26:58 No.103267892

>>103267751
cheap as in freshly fallen from the delivery truck.

Anonymous
11/22/24(Fri)09:33:55 No.103267943

Anonymous 11/22/24(Fri)09:33:55 No.103267943

>>103267346
>it shouldn't be something to be celebrated
true, but it's not they had any other choice in this case, the dataset has to be public for decentralized training

Anonymous
11/22/24(Fri)09:36:53 No.103267966

Anonymous 11/22/24(Fri)09:36:53 No.103267966

Now that distributed training worked fine it's time to make distributed inference so that we GPU poors can get some gibs

Anonymous
11/22/24(Fri)09:38:25 No.103267978

Anonymous 11/22/24(Fri)09:38:25 No.103267978

>>103267966
Already exists, for a while too

Anonymous
11/22/24(Fri)09:39:51 No.103267987

Anonymous 11/22/24(Fri)09:39:51 No.103267987

>>103267331
>All that effort and time
>For a model trained on 1T tokens
It's really over isn't it

Anonymous
11/22/24(Fri)09:40:38 No.103267990

Anonymous 11/22/24(Fri)09:40:38 No.103267990

>>103267987
I hope somebody pays you to do this all fucking day.
Because if you do it for free you are the most miserable fucking sub-human pile of flesh to ever escape the abortion process.

Anonymous
11/22/24(Fri)09:40:55 No.103267993

Anonymous 11/22/24(Fri)09:40:55 No.103267993

>>103267978
Kobold horde you mean?

Anonymous
11/22/24(Fri)09:45:18 No.103268021

Anonymous 11/22/24(Fri)09:45:18 No.103268021

>>103267990
>I hope somebody pays you to do this all fucking day.
>Because if you do it for free you are the most miserable fucking sub-human pile of flesh to ever escape the abortion process.

Anonymous
11/22/24(Fri)09:45:38 No.103268024

Anonymous 11/22/24(Fri)09:45:38 No.103268024

>>103267993
No
https://github.com/bigscience-workshop/petals
Llama.cpp RPC
>>101582942
>vLLM distributed inference actually worked...
>I got 15 T/s with Mistral Large with 2 PCs with 2x3090 each.
To name a few options.

Anonymous
11/22/24(Fri)09:51:13 No.103268060

Anonymous 11/22/24(Fri)09:51:13 No.103268060

I'm getting 1.40 tokens/s with Cydonia-22B-v2q-Q3_K_M.gguf on a 3060 with 12gb, are my settings fucked or is this normal?

Anonymous
11/22/24(Fri)09:53:17 No.103268078

Anonymous 11/22/24(Fri)09:53:17 No.103268078

>>103268060
That seems rather low, yeah
Did you limit the context size to something like 32k? Flash attention? Other programs hogging your gpu?

Anonymous
11/22/24(Fri)09:54:02 No.103268082

Anonymous 11/22/24(Fri)09:54:02 No.103268082

>>103268024
>Petals
Weren't those the people who made BLOOM

Anonymous
11/22/24(Fri)09:57:23 No.103268106

Anonymous 11/22/24(Fri)09:57:23 No.103268106

>>103268078
I have flash attention and context length is 8192. I'm retarded, I started using LLMs on my machine less than 24 hours ago and don't know what I'm doing

Anonymous
11/22/24(Fri)09:57:57 No.103268112

Anonymous 11/22/24(Fri)09:57:57 No.103268112

>>103267990
Anon, if you want to treat 30 people lending a 10B model trained on 1T tokens like it's the second coming of Christ then feel free
But as far as breaking the chain of corpo dependency goes, there's still a ways to go

Anonymous
11/22/24(Fri)09:59:09 No.103268124

Anonymous 11/22/24(Fri)09:59:09 No.103268124

how do I make macos not send any telemetry so that I can enjoy both high token generation/s and power efficiency with privacy?

Anonymous
11/22/24(Fri)09:59:21 No.103268127

Anonymous 11/22/24(Fri)09:59:21 No.103268127

>>103268112
Shut the fuck up you retarded piece of shit.

Anonymous
11/22/24(Fri)10:00:42 No.103268139

Anonymous 11/22/24(Fri)10:00:42 No.103268139

>>103268106
You should get about 5 t/s with this kind of context. Maybe you're offloading too many layers into RAM.

Anonymous
11/22/24(Fri)10:02:04 No.103268149

Anonymous 11/22/24(Fri)10:02:04 No.103268149

>>103268127
Sorry samsja, didn't mean to offend you

Anonymous
11/22/24(Fri)10:04:34 No.103268163

Anonymous 11/22/24(Fri)10:04:34 No.103268163

>>103268127
Trvke We Need To Support The First Ever OPEN SOURCE model, Not Open Weights But OPEN SOURCE Y'all!!!!

llama.cpp CUDA dev !!OM2Fp6Fn93S
11/22/24(Fri)10:06:35 No.103268175

llama.cpp CUDA dev !!OM2Fp6Fn93S 11/22/24(Fri)10:06:35 No.103268175

>>103268060
If you are using Windows: check that the driver setting where VRAM is swapped into RAM is disabled (I forgot what it's called).
If you did not manually set the number of GPU layers your frontend may be trying to set the value automatically; I know that KoboldCpp and ollama have logic like this and to my knowledge the estimates tend to be too conservative.

Anonymous
11/22/24(Fri)10:26:42 No.103268331

Anonymous 11/22/24(Fri)10:26:42 No.103268331

>>103268124
>apple
>no telemetry
lol. You could unplug the power cable and encase it in concrete.

Anonymous
11/22/24(Fri)10:30:33 No.103268360

Anonymous 11/22/24(Fri)10:30:33 No.103268360

Ultra censored LLM from applel is coming btw https://x.com/MacRumors/status/1859707331392757812

Anonymous
11/22/24(Fri)10:30:38 No.103268362

Anonymous 11/22/24(Fri)10:30:38 No.103268362

File: 142142352157469.png (40 KB, 651x1065)

40 KB PNG

Anonymous
11/22/24(Fri)10:43:27 No.103268449

Anonymous 11/22/24(Fri)10:43:27 No.103268449

>>103268362
but it uses emojis more now! great tradeoff

Anonymous
11/22/24(Fri)10:48:39 No.103268497

Anonymous 11/22/24(Fri)10:48:39 No.103268497

>>103268175
It's called "CUDA - Sysmem Fallback Policy" in the nvidia control panel

Anonymous
11/22/24(Fri)10:52:47 No.103268540

Anonymous 11/22/24(Fri)10:52:47 No.103268540

>>103268362
>mistal large shits the bed in every category
grim

Anonymous
11/22/24(Fri)10:55:46 No.103268569

Anonymous 11/22/24(Fri)10:55:46 No.103268569

File: Screenshot 2024-11-22 085509.png (122 KB, 1648x628)

122 KB PNG

>>103268362
I mean yeah it's worse but at least they got to top the LMSYS leaderboa-

Anonymous
11/22/24(Fri)11:00:07 No.103268606

Anonymous 11/22/24(Fri)11:00:07 No.103268606

File: MikuDoesntWantToGetUp.png (1.58 MB, 1232x816)

1.58 MB PNG

Morning, /lmg/...

Anonymous
11/22/24(Fri)11:05:48 No.103268661

Anonymous 11/22/24(Fri)11:05:48 No.103268661

>>103268540
it literally does better than a recent 4o release

Anonymous
11/22/24(Fri)11:10:11 No.103268716

Anonymous 11/22/24(Fri)11:10:11 No.103268716

>>103268606
Good morning to you, Miku

Anonymous
11/22/24(Fri)11:11:21 No.103268727

Anonymous 11/22/24(Fri)11:11:21 No.103268727

File: Screenshot 2024-11-22 091043.png (61 KB, 953x641)

61 KB PNG

>Ask Project Euler question to write code to solve the problem
>Get this
Thanks DeepSeek

Anonymous
11/22/24(Fri)11:12:03 No.103268737

Anonymous 11/22/24(Fri)11:12:03 No.103268737

Holy fuck someone buy this https://www.ebay.com/itm/276743259844

Anonymous
11/22/24(Fri)11:12:56 No.103268750

Anonymous 11/22/24(Fri)11:12:56 No.103268750

File: 1713635130526489.jpg (306 KB, 1672x854)

306 KB JPG

>5 minutes to train GPT-2
Are we back?

Anonymous
11/22/24(Fri)11:15:08 No.103268769

Anonymous 11/22/24(Fri)11:15:08 No.103268769

>>103268750
This is literally benchmaxxing, the condition for a completed training run is just achieving a specific eval score.

Anonymous
11/22/24(Fri)11:15:51 No.103268777

Anonymous 11/22/24(Fri)11:15:51 No.103268777

>>103268737
Wtf is that real

Anonymous
11/22/24(Fri)11:16:06 No.103268778

Anonymous 11/22/24(Fri)11:16:06 No.103268778

>>103268727
SOUL

Anonymous
11/22/24(Fri)11:16:33 No.103268782

Anonymous 11/22/24(Fri)11:16:33 No.103268782

>>103268606
gm betufel

Anonymous
11/22/24(Fri)11:16:40 No.103268784

Anonymous 11/22/24(Fri)11:16:40 No.103268784

>>103268737
Someone take the plunge

Anonymous
11/22/24(Fri)11:17:51 No.103268795

Anonymous 11/22/24(Fri)11:17:51 No.103268795

>>103268362
Where is Largestral 2411?

Anonymous
11/22/24(Fri)11:18:18 No.103268801

Anonymous 11/22/24(Fri)11:18:18 No.103268801

>>103268769
>Must not modify the train or validation data pipelines.
It's the same dataset and parameter size, though.

Anonymous
11/22/24(Fri)11:20:56 No.103268831

Anonymous 11/22/24(Fri)11:20:56 No.103268831

>>103268784
I got one before they nuked the listing. Any bets as to whether I get it?

Anonymous
11/22/24(Fri)11:22:33 No.103268846

Anonymous 11/22/24(Fri)11:22:33 No.103268846

>>103268831
Honestly, I doubt it. Looks like a pricing error from the other stuff they sell.

Anonymous
11/22/24(Fri)11:22:51 No.103268853

Anonymous 11/22/24(Fri)11:22:51 No.103268853

>>103268831
You'll get a box. You'll have some GPU in it if you are lucky.

Anonymous
11/22/24(Fri)11:22:56 No.103268854

Anonymous 11/22/24(Fri)11:22:56 No.103268854

>>103268831
Interesting. So it was probably legit and the guy accidentally a decimal place.

Anonymous
11/22/24(Fri)11:24:27 No.103268866

Anonymous 11/22/24(Fri)11:24:27 No.103268866

>>103268831
I don't know how UK law specifically but under German law if they mistyped the price they would now have a legal obligation to actually sell you the item at that price (but they may refuse and you'll need to take them to court).
If it was a scam you'll never get anything.

Anonymous
11/22/24(Fri)11:31:53 No.103268926

Anonymous 11/22/24(Fri)11:31:53 No.103268926

>>103268362
Qwen that high?
So running it at q4 is the reason why I get so many dumb coding errors…>>103268727
Lmao qwen did something similar yesterday for me, it was too lazy to write a section of code and just made it a “suggestion”.

Anonymous
11/22/24(Fri)11:33:44 No.103268945

Anonymous 11/22/24(Fri)11:33:44 No.103268945

>>103268926
No, Qwen and other chinkshit just do everything to look good on benchmarks

Anonymous
11/22/24(Fri)11:34:38 No.103268956

Anonymous 11/22/24(Fri)11:34:38 No.103268956

>>103268866
They offered paypal so it should be pretty easy to get your money back if it's scam.

Anonymous
11/22/24(Fri)11:52:33 No.103269110

Anonymous 11/22/24(Fri)11:52:33 No.103269110

>>103268945
Regardless if it is better or worse it is still the best coder model that I can run.
I just want to know how to squeeze more performance out of it.

Anonymous
11/22/24(Fri)11:52:59 No.103269116

Anonymous 11/22/24(Fri)11:52:59 No.103269116

Expect the first two weeks of December to be crazy for local models.

Anonymous
11/22/24(Fri)11:53:13 No.103269118

Anonymous 11/22/24(Fri)11:53:13 No.103269118

>>103268866
This is false, that law only applies to retail shops

Anonymous
11/22/24(Fri)11:55:22 No.103269137

Anonymous 11/22/24(Fri)11:55:22 No.103269137

>>103269118
https://www.rechtsindex.de/internetrecht/4542-bgh-urteil-viii-zr-42-14-ein-fahrzeug-fuer-1-euro-schnaeppchenpreis-bei-einer-ebay-auktion

Anonymous
11/22/24(Fri)12:04:33 No.103269217

Anonymous 11/22/24(Fri)12:04:33 No.103269217

>>103268362
wtf is OpenAI doing, they're getting destroyed by the competition, they can't even beat themselves anymore lmao

Anonymous
11/22/24(Fri)12:08:10 No.103269251

Anonymous 11/22/24(Fri)12:08:10 No.103269251

>>103269137
einen gutes offen modell zu dir auch guten herren

Anonymous
11/22/24(Fri)12:09:17 No.103269263

Anonymous 11/22/24(Fri)12:09:17 No.103269263

>>103268737
holy fucking shit
>Located in: ShenZhen, China
I smell hogwash

Anonymous
11/22/24(Fri)12:15:32 No.103269311

Anonymous 11/22/24(Fri)12:15:32 No.103269311

>>103269217
All talent left and they are hitting the wall of diminishing returns while raking up debt. They made a fucking CoT tune and marketed it as innovation. A fucking CoT tune.

Anonymous
11/22/24(Fri)12:17:52 No.103269328

Anonymous 11/22/24(Fri)12:17:52 No.103269328

File: itsalive.png (342 KB, 748x977)

342 KB PNG

no one seriously believes these things are alive, do they?

Anonymous
11/22/24(Fri)12:19:42 No.103269342

Anonymous 11/22/24(Fri)12:19:42 No.103269342

>>103269328
My boomer parents do.

Anonymous
11/22/24(Fri)12:21:41 No.103269353

Anonymous 11/22/24(Fri)12:21:41 No.103269353

>>103269328
Some people think the earth is flat, others think that you can sustain yourself with just sunlight. Believing that some enhanced text prediction model is sentient is one of the less egregious cases of retardation

Anonymous
11/22/24(Fri)12:22:34 No.103269361

Anonymous 11/22/24(Fri)12:22:34 No.103269361

>>103269328
jewlywood portraying ai this way is to blame.

Anonymous
11/22/24(Fri)12:23:32 No.103269370

Anonymous 11/22/24(Fri)12:23:32 No.103269370

>>103269328
I had this illusion during the early days of c.ai, but it was quickly gone.

Anonymous
11/22/24(Fri)12:27:01 No.103269402

Anonymous 11/22/24(Fri)12:27:01 No.103269402

File: 16.png (75 KB, 920x798)

75 KB PNG

Looks like INTELLECT-1 is finally done training. From what I can gleam, it should be released in a week and is currently going through post training with something called Arcee AI.

Anonymous
11/22/24(Fri)12:28:42 No.103269417

Anonymous 11/22/24(Fri)12:28:42 No.103269417

>>103269217
It doesn't exactly help that their talent left (actually it might even make it worse, since all their investors are probably looking to see if they can recover from their exodus lmao)
Still kinda crazy to see how far their lead is slipping. I still remember when OpenAI had GPT-3 and all we plebians had was GPT-fucking-Neo-2.7B
Now DALL-E 3 is mogged by Black Forest Labs, GPT-4o is mogged by Claude in the intelligence department and Gemini in the human preference department, o1 already has a competitor, and Sora is basically MIA

Anonymous
11/22/24(Fri)12:28:59 No.103269419

Anonymous 11/22/24(Fri)12:28:59 No.103269419

>>103269402
>Arcee AI
That's mergetkit people with Charles O. Goddard
https://github.com/arcee-ai/mergekit

Anonymous
11/22/24(Fri)12:31:16 No.103269445

Anonymous 11/22/24(Fri)12:31:16 No.103269445

>>103268540
?
Did you misread it thinking they are sorted top to bottom? It beats many top corporate models.

Anonymous
11/22/24(Fri)12:32:35 No.103269454

Anonymous 11/22/24(Fri)12:32:35 No.103269454

Which local model if I want to try out those new meme IDEs?

Anonymous
11/22/24(Fri)12:32:42 No.103269455

Anonymous 11/22/24(Fri)12:32:42 No.103269455

>>103269402
Who's going to do the red teaming and rhlf?

Anonymous
11/22/24(Fri)12:33:57 No.103269466

Anonymous 11/22/24(Fri)12:33:57 No.103269466

>>103269455
Arcee
> Arcee AI empowers businesses to train, deploy, and continuously improve proprietary, specialized, secure, and scalable small language models (SLMs) within their own environments, revolutionizing data privacy and security.

>Their all-in-one system enables pre-training, aligning, and continuous adaptation of small language models.

>This ensures security, compliance, and enhances model relevance and accuracy.

Anonymous
11/22/24(Fri)12:35:46 No.103269481

Anonymous 11/22/24(Fri)12:35:46 No.103269481

>>103269466
>aligning
kek they're gonna cuck the model, it's ova

Anonymous
11/22/24(Fri)12:35:54 No.103269483

Anonymous 11/22/24(Fri)12:35:54 No.103269483

>>103269466
>no, goys, you can't have the base model, that's too unsafe for you
>here's (((aligned))) instruct

Anonymous
11/22/24(Fri)12:36:17 No.103269486

Anonymous 11/22/24(Fri)12:36:17 No.103269486

>>103269402
>alignment
It's DOA.

Anonymous
11/22/24(Fri)12:37:33 No.103269499

Anonymous 11/22/24(Fri)12:37:33 No.103269499

>>103269466
kek @ whoever paid for an aligned model

Anonymous
11/22/24(Fri)12:37:38 No.103269501

Anonymous 11/22/24(Fri)12:37:38 No.103269501

>>103269419
>the guy who's responsible for the shitty merge era is now offering alignment services
This guy is a grifter and a net negative for local

Anonymous
11/22/24(Fri)12:38:16 No.103269508

Anonymous 11/22/24(Fri)12:38:16 No.103269508

>>103269483
Y'all love censored models though, from all the shilling I've seen here.

Anonymous
11/22/24(Fri)12:38:49 No.103269514

Anonymous 11/22/24(Fri)12:38:49 No.103269514

>>103269466
>>103269402
>1st ever fully open source model
>aligned to fuck and we don't even get the base model
scam

Anonymous
11/22/24(Fri)12:39:35 No.103269524

Anonymous 11/22/24(Fri)12:39:35 No.103269524

>>103269328
llama 3 7b is sentient

Anonymous
11/22/24(Fri)12:40:22 No.103269532

Anonymous 11/22/24(Fri)12:40:22 No.103269532

>>103269328
Claude is the closest to have that 'ghost in the shell' feel

Anonymous
11/22/24(Fri)12:40:56 No.103269540

Anonymous 11/22/24(Fri)12:40:56 No.103269540

>>103269508
>turkish rapebaby tranny balkanoid does his low effort trolling again
hi petr*

Anonymous
11/22/24(Fri)12:40:56 No.103269541

Anonymous 11/22/24(Fri)12:40:56 No.103269541

>retards freaking out over the word "alignment"

Anonymous
11/22/24(Fri)12:41:34 No.103269550

Anonymous 11/22/24(Fri)12:41:34 No.103269550

>>103269514
Safety and tolerance are the most basic values of Open Source and its community.

Anonymous
11/22/24(Fri)12:41:47 No.103269554

Anonymous 11/22/24(Fri)12:41:47 No.103269554

File: file.png (49 KB, 799x531)

49 KB PNG

>>103269466
>Arcee
They sure got big tho
Others include
>AWS and Intel

Anonymous
11/22/24(Fri)12:42:49 No.103269571

Anonymous 11/22/24(Fri)12:42:49 No.103269571

>>103269541
Right, like that hasn't meant practically only one thing since the word became commonly used.

Anonymous
11/22/24(Fri)12:43:40 No.103269586

Anonymous 11/22/24(Fri)12:43:40 No.103269586

>>103269571
Releasing an unsafe and offensive model would only hurt the image of open source.

Anonymous
11/22/24(Fri)12:44:00 No.103269591

Anonymous 11/22/24(Fri)12:44:00 No.103269591

>>103269550
Yes, xister! Free software should be replaced by ethical software to own le chuds! #RemoveStallman

Anonymous
11/22/24(Fri)12:44:14 No.103269596

Anonymous 11/22/24(Fri)12:44:14 No.103269596

>>103269550
That's actually true, considering /g/'s love for establishment and queer e-celebs.

Anonymous
11/22/24(Fri)12:44:30 No.103269603

Anonymous 11/22/24(Fri)12:44:30 No.103269603

File: file.png (55 KB, 903x922)

55 KB PNG

>>103269586
Correct

Anonymous
11/22/24(Fri)12:45:41 No.103269611

Anonymous 11/22/24(Fri)12:45:41 No.103269611

>>103269550
2/10 ragebait

Anonymous
11/22/24(Fri)12:46:34 No.103269616

Anonymous 11/22/24(Fri)12:46:34 No.103269616

>>103269571
you haven't realized that every single big release pays lip service to the concept of alignment regardless of how censored they end up being

Anonymous
11/22/24(Fri)12:47:54 No.103269625

Anonymous 11/22/24(Fri)12:47:54 No.103269625

>>103269616
>>103269603
But sure, if you want to be hyped for something that'll 100% be ultra-corpo safe go ahead.

Anonymous
11/22/24(Fri)12:48:09 No.103269627

Anonymous 11/22/24(Fri)12:48:09 No.103269627

AI isn't real and everybody who's making money off this field knows this but pretends otherwise. Get that bag and gtfo before the bubble blows up. It's okay to be a bystander who just wants a local smut autocomplete, a bunch of h100s will be liquidated.

Anonymous
11/22/24(Fri)12:50:40 No.103269642

Anonymous 11/22/24(Fri)12:50:40 No.103269642

God I hope R1's weights are actually released. This model is legit better than closed source.

Anonymous
11/22/24(Fri)12:53:36 No.103269665

Anonymous 11/22/24(Fri)12:53:36 No.103269665

>>103269627
Bro you don't understand, sama has made GPT5 smarted than a human, it's fully multimodal AGI or even ASI! Please invest.

Anonymous
11/22/24(Fri)12:53:38 No.103269666

Anonymous 11/22/24(Fri)12:53:38 No.103269666

>>103269642
It's pretty entertaining to see it's thoughts but it just badly fucked up a coding problem that even free chatgpt solved for me. And it's coding knowledge seems to be really outdated.

Anonymous
11/22/24(Fri)12:57:49 No.103269688

Anonymous 11/22/24(Fri)12:57:49 No.103269688

>>103269666
? Its the only model that got some stuff only claude 3.5 and qwen2.5 32B coder did before. Maybe got a bad "reasoning" roll?

Anonymous
11/22/24(Fri)13:01:13 No.103269714

Anonymous 11/22/24(Fri)13:01:13 No.103269714

>>103269688
lol, no way. reasoning doesn't do shit for coding abilities.

Anonymous
11/22/24(Fri)13:03:42 No.103269733

Anonymous 11/22/24(Fri)13:03:42 No.103269733

>>103269714
And why wouldn't it?

Anonymous
11/22/24(Fri)13:04:33 No.103269740

Anonymous 11/22/24(Fri)13:04:33 No.103269740

File: 1708365016268048.png (67 KB, 865x182)

67 KB PNG

Speaking of alignment, I wonder what this last line is supposed to be about?
I don't have anything about flags or alignment in my prompt or the card description.

Anonymous
11/22/24(Fri)13:20:36 No.103269838

Anonymous 11/22/24(Fri)13:20:36 No.103269838

>>103269740
I'd say it's the name "Naomi" pulling all kinds of CoTs / jbs in the garbage logs your model was finetuned on.

Anonymous
11/22/24(Fri)13:21:20 No.103269843

Anonymous 11/22/24(Fri)13:21:20 No.103269843

>>103267649
>feeling of stepping on feces

Anonymous
11/22/24(Fri)13:21:34 No.103269845

Anonymous 11/22/24(Fri)13:21:34 No.103269845

>>103269804
I'm happy to see that your relationship with tranx qwxxn is going great, keep us updated.

Anonymous
11/22/24(Fri)13:25:46 No.103269874

Anonymous 11/22/24(Fri)13:25:46 No.103269874

File: upset.jpg (20 KB, 600x600)

20 KB JPG

>>103269550
Fuck that shit.

Anonymous
11/22/24(Fri)13:27:02 No.103269883

Anonymous 11/22/24(Fri)13:27:02 No.103269883

https://x.com/ltxstudio/status/1859964100203430280
Local video generation now down to a single 4090

Anonymous
11/22/24(Fri)13:28:32 No.103269898

Anonymous 11/22/24(Fri)13:28:32 No.103269898

>>103266757
All of Drummer's Small tunes seem to have the bloated tokenizer.

Anonymous
11/22/24(Fri)13:29:48 No.103269906

Anonymous 11/22/24(Fri)13:29:48 No.103269906

>>103269898
*except Cydonia v1.0

Anonymous
11/22/24(Fri)13:40:19 No.103269981

Anonymous 11/22/24(Fri)13:40:19 No.103269981

File: 1716129419410379.webm (2.24 MB, 768x512)

2.24 MB WEBM

>>103269883
yeah, can confirm, this shit is pretty good, and it's really fast
>25 fps, 129 frames (5 seconds), 50 steps
>01:10<00:00, 1.42s/it
>13gb VRAM peak (during the vae decoding)
>RTX 3090

Anonymous
11/22/24(Fri)13:41:30 No.103269989

Anonymous 11/22/24(Fri)13:41:30 No.103269989

>>103269883
>Local video generation now down to a single 4090
We've had that since Mochi 1

Anonymous
11/22/24(Fri)13:42:53 No.103270001

Anonymous 11/22/24(Fri)13:42:53 No.103270001

>>103269989
but with mochi you had to wait for 30 mn to get a single 5 second video, for that one it's only 1 mn because they managed to efficiently compress the VAE

Anonymous
11/22/24(Fri)13:43:57 No.103270006

Anonymous 11/22/24(Fri)13:43:57 No.103270006

>>103269883
>https://github.com/Lightricks/ComfyUI-LTXVideo
>https://github.com/Lightricks/LTX-Video
> first commit 6 hours ago
yeah this is an obvious shill for anon Guinea pigs

Anonymous
11/22/24(Fri)13:45:26 No.103270021

Anonymous 11/22/24(Fri)13:45:26 No.103270021

File: 1730858495297125.png (578 KB, 512x512)

578 KB PNG

the chinks will save us all

Anonymous
11/22/24(Fri)13:55:44 No.103270109

Anonymous 11/22/24(Fri)13:55:44 No.103270109

I wonder what kind of AI models Aliens use.

Anonymous
11/22/24(Fri)13:55:53 No.103270112

Anonymous 11/22/24(Fri)13:55:53 No.103270112

>Product Security Engineer @ Red Hat- AI Security, Safety and Trustworthiness
>https://huggingface.co/posts/huzaifas-sidhpurwala/601513758334151
>As AI models become more widespread, it is essential to address their potential risks and vulnerabilities. Open-source AI is poised to be a driving force behind tomorrow's innovations in this field. This paper examines the current landscape of security and safety in open-source AI models and outlines concrete measures to monitor and mitigate associated risks effectively.

>https://huggingface.co/papers/2411.12275
We need more of this! Much more!

Anonymous
11/22/24(Fri)13:55:58 No.103270114

Anonymous 11/22/24(Fri)13:55:58 No.103270114

>>103270001
Near real time on a 4090. First video model worth using because of it. Prepare to start seeing porn finetunes of it.

Anonymous
11/22/24(Fri)13:59:14 No.103270145

Anonymous 11/22/24(Fri)13:59:14 No.103270145

File: 1704377820835720.webm (1.96 MB, 768x512)

1.96 MB WEBM

>>103269981
I like that one

Anonymous
11/22/24(Fri)13:59:16 No.103270146

Anonymous 11/22/24(Fri)13:59:16 No.103270146

>>103270112
https://arxiv.org/abs/2411.12275

Anonymous
11/22/24(Fri)13:59:53 No.103270150

Anonymous 11/22/24(Fri)13:59:53 No.103270150

File: Base model.png (21 KB, 593x237)

21 KB PNG

>>103269514
Looks like they actually will be releasing the base model, as well as the post trained model.

Anonymous
11/22/24(Fri)14:01:06 No.103270161

Anonymous 11/22/24(Fri)14:01:06 No.103270161

>>103270150
sounds like they aren't as retarded as I thought, that's cool

Anonymous
11/22/24(Fri)14:03:22 No.103270176

Anonymous 11/22/24(Fri)14:03:22 No.103270176

>>103270150
oh wow lmg was dooming over nothing who would have thought

Anonymous
11/22/24(Fri)14:03:39 No.103270180

Anonymous 11/22/24(Fri)14:03:39 No.103270180

>>103270150
who cares, the chance of this model being better than llama2 7B is slim.

Anonymous
11/22/24(Fri)14:04:50 No.103270185

Anonymous 11/22/24(Fri)14:04:50 No.103270185

>>103270176
for once /lmg/'s doomerism was wrong, usually we get fucked in the ass pretty hard

Anonymous
11/22/24(Fri)14:12:06 No.103270231

Anonymous 11/22/24(Fri)14:12:06 No.103270231

Update on sarashina2 8x70b...its pretty unhinged with good temp/minp. I'd say its the jap ERP king after doing some completion on existing chats. Super spicy.
The initial release had a busted tokenizer_config.json, but after requanting it works properly.

Anonymous
11/22/24(Fri)14:13:22 No.103270242

Anonymous 11/22/24(Fri)14:13:22 No.103270242

>>103270185
maybe you do

Anonymous
11/22/24(Fri)14:18:09 No.103270274

Anonymous 11/22/24(Fri)14:18:09 No.103270274

File: cat holding dead cat minecraft.jpg (19 KB, 480x461)

19 KB JPG

What is the best oobabooga preset (or parameter values like temperature, min p, etc) to use for the best Roleplay(mainly erotic but I care very much about characters following the scenario and not going OOC) experience in Mistral Nemo 12B finetunes?
>Use DRY
I don't have that yet, will get around to that.

Anonymous
11/22/24(Fri)14:24:21 No.103270322

Anonymous 11/22/24(Fri)14:24:21 No.103270322

>>103270274
Depends heavily on the model, but i like to start with temp 2.6 and minp 0.008 and then back off until I get an amount of insanity that's appropriate for what I'm trying to achieve.

Anonymous
11/22/24(Fri)14:27:58 No.103270347

Anonymous 11/22/24(Fri)14:27:58 No.103270347

>>103269714
Kek. Not sure what level of coding you've done, but I have to assume you either: (a) are just starting out and have somehow Dunning-Kruger'd yourself into thinking you're an expert, or (b) the only coding you've done has been via prompting an LLM
Reason I say this is because generally, unless you're truly doing basic toy shit, you generally don't get very far before you get fucking steamrolled (or, on the off chance it does work, write horrifically inefficient code vomit) if you don't know what you're doing
If you disagree, I invite you to check out TAOCP, Concrete Mathematics, and Algorithm Design by Kleinberg and Tardos
>>103269688
r1 has some pretty heavy variance. It generally ranks below o1 in some of my tests of programming / algorithm problems, though it definitely often comes a lot closer than Claude and Qwen. I don't think it would fully replace o1-preview for the people that use it, but it would make the ridiculous prices OpenAI charges at the moment quite a bit more questionable

Anonymous
11/22/24(Fri)14:31:11 No.103270365

Anonymous 11/22/24(Fri)14:31:11 No.103270365

>>103270322
Top p 1, top k 0, typical p 1, right?
And repetition penalty at?

Anonymous
11/22/24(Fri)14:40:42 No.103270434

Anonymous 11/22/24(Fri)14:40:42 No.103270434

>>103270347
>urrr durrr skill issue
Stop being stupid anon, I'm talking about the kind of reasoning that these LLMs do. If you think I'm wrong, why does o1 gets mogged by Claude 3.5?

Anonymous
11/22/24(Fri)14:53:30 No.103270542

Anonymous 11/22/24(Fri)14:53:30 No.103270542

File: Screenshot 2024-11-22 125147.png (47 KB, 610x693)

47 KB PNG

>>103270434
It genuinely doesn't though, it's just better at the easier stuff. If you disagree, you can test it yourself. Here's the problem: https://atcoder.jp/contests/dp/tasks/dp_j
Pop that into Claude and see what it gives you. Here's the (correct) o1 solution for reference, which was its first attempt

Anonymous
11/22/24(Fri)15:01:26 No.103270617

Anonymous 11/22/24(Fri)15:01:26 No.103270617

File: Screenshot 2024-11-22 130050.png (65 KB, 872x717)

65 KB PNG

>>103270542
Claude test for reference (got murdered by a division by zero)

Anonymous
11/22/24(Fri)15:11:00 No.103270696

Anonymous 11/22/24(Fri)15:11:00 No.103270696

repetition penalty should be deprecated
literally just exists as a newfag filter at this point, way too easy to go wrong and use retarded values that turn your output into adjective/adverb spam because every glue word got penalized into nonexistence

Anonymous
11/22/24(Fri)15:34:01 No.103270886

Anonymous 11/22/24(Fri)15:34:01 No.103270886

what's the current best method to have a chatbot that 1. can "read" images, so if i post an image it can describe it (within current model limits of course), and 2 (optional) i can tell it to prompt and gen an image?
using sillytavern/koboldcpp backend currently, but not sure how i'd go about it there.
in other words, i want to chat with teh ai about images and if i post one it can talk about it, and have it suggest prompts

Anonymous
11/22/24(Fri)15:45:30 No.103270992

Anonymous 11/22/24(Fri)15:45:30 No.103270992

>>103270696
>remove feature with occasionally niche value because retards don't understand it and use it in the wrong way
No, that's the spirit of proprietary software, not open source
OSS does mean footguns for newfags sometimes but that's a price worth paying. Fuck outta here with your dumbing-down suggestions

Anonymous
11/22/24(Fri)15:46:51 No.103271011

Anonymous 11/22/24(Fri)15:46:51 No.103271011

>>103270886
Open webui if you don't mind it raping your RAM.

Anonymous
11/22/24(Fri)15:50:39 No.103271048

Anonymous 11/22/24(Fri)15:50:39 No.103271048

>>103270992
what niche value does it have over presence / frequency penalty (the same thing but with sane scales and less retarded implementations) or more advanced repetition samplers like DRY or w/e? it's just a super primitive and very poor sampler that sucks ass at its job. it's bad. there are NO pros to it. rip that shit out.
backends can keep it for compatibility's sake but frontends should not be putting that garbage in front of a user's face unless they very specifically request it for some deluded reason

Anonymous
11/22/24(Fri)15:54:31 No.103271080

Anonymous 11/22/24(Fri)15:54:31 No.103271080

>>103271048
NTA but simply updating ST's default presets would solve this.

Anonymous
11/22/24(Fri)15:56:30 No.103271102

Anonymous 11/22/24(Fri)15:56:30 No.103271102

>>103270365
I keep top p and typical p at 1, but I crank top k all the way up to 200.
I don't use rep-pen. If a model is to repetitious in a way I don't like I just don't use it.

Anonymous
11/22/24(Fri)16:05:40 No.103271200

Anonymous 11/22/24(Fri)16:05:40 No.103271200

anyone tried out the vision support in exllama2?

Anonymous
11/22/24(Fri)16:08:32 No.103271236

Anonymous 11/22/24(Fri)16:08:32 No.103271236

>>103271200
exllama supports vision now?

Anonymous
11/22/24(Fri)16:12:56 No.103271269

Anonymous 11/22/24(Fri)16:12:56 No.103271269

>>103271102
Thanks anon I will see if it works well for me.

Anonymous
11/22/24(Fri)16:13:41 No.103271276

Anonymous 11/22/24(Fri)16:13:41 No.103271276

i've also given up on rep penalty stuff, xtc, dry. sure they can help reduce overused slop but they also introduce errors when the model wants to say a shirt is red, but can't, so it picks another color which is wrong. so out of the choice of more slop or inaccuracies, i'll deal with the slop. low min p + adjusting temp is all i use these days

Anonymous
11/22/24(Fri)16:16:48 No.103271310

Anonymous 11/22/24(Fri)16:16:48 No.103271310

>>103271276
I concur with this assessment. At first I thought Largestral wasn't that good, until I realized XTC was making it retarded and never went back.

Anonymous
11/22/24(Fri)16:18:37 No.103271331

Anonymous 11/22/24(Fri)16:18:37 No.103271331

Where do you reckon the tech will be in five years? ten years?

Anonymous
11/22/24(Fri)16:19:47 No.103271342

Anonymous 11/22/24(Fri)16:19:47 No.103271342

>>103271236
I saw this in tabby
https://github.com/theroyallab/tabbyAPI/pull/249

Anonymous
11/22/24(Fri)16:20:07 No.103271345

Anonymous 11/22/24(Fri)16:20:07 No.103271345

>>103271276
I agree on rep pen and especially XTC (that can REALLY make a model retarded...turns out lower probability tokens are lower probability for a good reason)
but I find DRY is basically risk-free regarding the model's intelligence as long as you're not applying it to single tokens (so allowed length of 2 or higher)

Anonymous
11/22/24(Fri)16:21:16 No.103271363

Anonymous 11/22/24(Fri)16:21:16 No.103271363

>>103271011
i have 128gb ram and 24gb vram, does that work?

Anonymous
11/22/24(Fri)16:22:18 No.103271377

Anonymous 11/22/24(Fri)16:22:18 No.103271377

>>103271331
By then nvidia will release $300 24gb cards finally, and we will be able to run local o1 on it.

Anonymous
11/22/24(Fri)16:31:35 No.103271473

Anonymous 11/22/24(Fri)16:31:35 No.103271473

>>103271345
NTA but
>(so allowed length of 2 or higher)
This should be at least 5 or it starts banning uncommon names. Also {{user}} and {{char}} in sequence breakers (persona and character names should consist of first name only).

Anonymous
11/22/24(Fri)16:49:16 No.103271647

Anonymous 11/22/24(Fri)16:49:16 No.103271647

i think the site died

Anonymous
11/22/24(Fri)16:52:22 No.103271678

Anonymous 11/22/24(Fri)16:52:22 No.103271678

>>103271647
ayy finally, after like 4 attempts

>>103271345
out of the three (rep pen, dry, xtc) i liked xtc the least. it just seems like a horrible idea to chop off the top tokens because that token could be a name, color, any kind of detail. dry seemed ok but could also introduce errors.

>>103271473
is that something you setup already or just speculating on? i'd try it

Anonymous
11/22/24(Fri)16:54:41 No.103271712

Anonymous 11/22/24(Fri)16:54:41 No.103271712

>>103270176
You think they were using good training data? Only copyrighted data is good training data, and them being open source means they will have zero of that.

Anonymous
11/22/24(Fri)16:55:56 No.103271727

Anonymous 11/22/24(Fri)16:55:56 No.103271727

>>103271712
nice unrelated pivot

Anonymous
11/22/24(Fri)17:08:48 No.103271866

Anonymous 11/22/24(Fri)17:08:48 No.103271866

>>103271712
Issue isn't the quality, it's the count

Anonymous
11/22/24(Fri)17:09:26 No.103271871

Anonymous 11/22/24(Fri)17:09:26 No.103271871

>>103271678
>is that something you setup already or just speculating on? i'd try it
With --debugmode on in KCpp, I saw it trigger a lot on first syllables of non-English names. I guess I'm speculating a bit here: those names would pop up if necessary ("He was born in _") but otherwise not.

Anonymous
11/22/24(Fri)17:20:43 No.103271993

Anonymous 11/22/24(Fri)17:20:43 No.103271993

Any threestral finetunes yet?

Anonymous
11/22/24(Fri)17:21:52 No.103272002

Anonymous 11/22/24(Fri)17:21:52 No.103272002

>>103271871
when i was using dry i noticed it fucked up jap names a lot. like it'd get through half a name and then just go nuts. (tsukino becomes tsukAKAK)
i thought it was the model at first since i never saw the same issue with normal english names, but it went away when i turned dry off.

Anonymous
11/22/24(Fri)17:51:47 No.103272323

Anonymous 11/22/24(Fri)17:51:47 No.103272323

discord.gg/aicg/
although we are chatbot focused we have many channels meant for prompting and ai art which includes dall-e, flux, stable diffusion, pix art, someone even hosts a proxy’ come join us!
no lurking

Anonymous
11/22/24(Fri)17:55:21 No.103272363

Anonymous 11/22/24(Fri)17:55:21 No.103272363

File: Screenshot 2024-11-22 154432.png (144 KB, 296x299)

144 KB PNG

>GPT-4o no longer topping any leaderboard, tried to top Google only to get smacked back down
>China is about to take away what little value o1 had
>OpenAI ran head long into a "fuck you" sized scaling wall that's turning any further upgrades into side grades (the more recent GPT-4o to try to top the LMSYS leaderboard is worse)
>Anyone with any competence to save OpenAI from itself has long since left
>Musk has power now and is out for Altman blood
>OpenAI still dealing with a fuckton of lawsuits from NYT and Pajeets, "accidentally" deleted their datasets which makes them look more liable
>Still billions of dollars in the hole with investors getting antsy for a return on their buck
It's like watching a train wreck

Anonymous
11/22/24(Fri)17:55:54 No.103272370

Anonymous 11/22/24(Fri)17:55:54 No.103272370

>>103272323
>faggot noises
lol no

Anonymous
11/22/24(Fri)17:57:17 No.103272396

Anonymous 11/22/24(Fri)17:57:17 No.103272396

>>103272363
>Anyone with any competence to save OpenAI from itself has long since left
The inertia is spent. altman is finally paying for his hubris.

Anonymous
11/22/24(Fri)17:58:29 No.103272409

Anonymous 11/22/24(Fri)17:58:29 No.103272409

>>103272363
was about fucking time, I alaways hated this fucker, his fear mongering of AI has done a lot of damage to the community

Anonymous
11/22/24(Fri)17:59:22 No.103272417

Anonymous 11/22/24(Fri)17:59:22 No.103272417

>>103272363
i don't even consider openai to be relevant at this point. claude passed them months ago and its remained the same. now local has gotten so close on benches like coding which is amazing given the assumed size difference (qwen 32b vs whatever the fuck 8x+ monster gpt/claude is). openai's reign was over months ago, its just taking people a while to realize it

Anonymous
11/22/24(Fri)18:00:44 No.103272435

Anonymous 11/22/24(Fri)18:00:44 No.103272435

>>103272417
Have we really started to take Chinese model benchmarks at face value? Come on now.

Anonymous
11/22/24(Fri)18:01:02 No.103272439

Anonymous 11/22/24(Fri)18:01:02 No.103272439

>>103272363
Trust the plan Altman says AGI is coming in 2025. Strawberry is going to blow us away.

Anonymous
11/22/24(Fri)18:01:34 No.103272444

Anonymous 11/22/24(Fri)18:01:34 No.103272444

>>103272363
Musk recently said he would make AGI by 2026 and xai is building a massive server farm at a unprecedented pace.

Anonymous
11/22/24(Fri)18:01:52 No.103272449

Anonymous 11/22/24(Fri)18:01:52 No.103272449

>>103272363
OpenAI still has branding and first-mover's advantage. For most people AI = ChatGPT.

Anonymous
11/22/24(Fri)18:03:09 No.103272463

Anonymous 11/22/24(Fri)18:03:09 No.103272463

>>103272435
i wouldn't even mention a benchmark if i didn't use it myself, i know how they game shit and especially china they lie and steal everything. but yeah, its a good model, its the first one to not shout at me in chinese half way through a message. not just qwen though, nemotron is also very good. hell even codestral is amazing for its size. local is eating well and the gap has shrunk insanely in the last year.

Anonymous
11/22/24(Fri)18:05:42 No.103272488

Anonymous 11/22/24(Fri)18:05:42 No.103272488

>>103272449
America Online also had branding and first mover's advantage

Anonymous
11/22/24(Fri)18:06:32 No.103272499

Anonymous 11/22/24(Fri)18:06:32 No.103272499

>>103272449
>AI = ChatGPT
That's why I mentioned inertia and referenced the brain-drain quote. You can only keep first-mover if you are close enough to state of the art to keep yourself relevant vs. people discover superior services.
Its a serious advantage, but must be defended, especially since AI is in its infancy in the public imagination.

Anonymous
11/22/24(Fri)18:10:52 No.103272544

Anonymous 11/22/24(Fri)18:10:52 No.103272544

File: MGS6.jpg (271 KB, 1529x857)

271 KB JPG

>>103264382
To some extent the human brain contains dedicated centers, but those centers have extremely wide interfaces pumping a shitload of data between them. Bandwidth is the limiting factor for virtually every interesting computation. So what you describe will never work with English text or tokens or whatever as the shared language for the centers- the interface is too narrow, so it'd at best be ultra inefficient.

It also just won't work well with the Von Neumann architecture.

Anonymous
11/22/24(Fri)18:12:19 No.103272558

Anonymous 11/22/24(Fri)18:12:19 No.103272558

>103272323
What a sad end

Anonymous
11/22/24(Fri)18:12:58 No.103272563

Anonymous 11/22/24(Fri)18:12:58 No.103272563

>>103272488
back in high school when aol was sending billions of cds to everyone, you could even find them at burger king, we used to shove hundreds of them through the vents of lockers so when you'd open it, 500 cds spill out
great fun until its you that opens the locker

Anonymous
11/22/24(Fri)18:14:36 No.103272581

Anonymous 11/22/24(Fri)18:14:36 No.103272581

>>103272499
This. First mover's advantage isn't going to save you if your services are inferior / more expensive than competitors (like, say, charging 0.50 per o1 query). It might delay your fall into obscurity, but eventually people will move on if you don't have something good enough to offer them. Unfortunately for them, OpenAI also happens to be in the position where it's costing a lot more than it's bringing in and it needs a plan to turn a profit fast.

Anonymous
11/22/24(Fri)18:18:55 No.103272627

Anonymous 11/22/24(Fri)18:18:55 No.103272627

>>103272581
>people will move on if you don't have something good enough to offer them
my co-worker moved to anthropic a while back (understandable since IT), but my kid's friends are already moving to perplexity, so they could give a rat's ass whats on the back end. And this is in a rural area without any kind of tech sector presence.
Also probably a majority of companies are using copilot branding via their MS EA, so the name brand is already severely diluted for knowledge workers.

Anonymous
11/22/24(Fri)18:24:03 No.103272660

Anonymous 11/22/24(Fri)18:24:03 No.103272660

>>103270886
>sillytavern
Just use the attach button, at least with the custom OpenAI API and using vLLM as a backend, it just works. I suppose now tabbyAPI can be used for that too.

Anonymous
11/22/24(Fri)18:38:53 No.103272799

Anonymous 11/22/24(Fri)18:38:53 No.103272799

>>103266622
>artificial intelligence is artificial
Oh wow.
Everyone
This guy is so smart
Holy shit

Anonymous
11/22/24(Fri)18:40:29 No.103272807

Anonymous 11/22/24(Fri)18:40:29 No.103272807

>>103266622
>ai isn't real it's jsut word associatomn and statistics
and what are we? our brain is just working thanks to a set of little electricity shocks

Anonymous
11/22/24(Fri)18:44:25 No.103272840

Anonymous 11/22/24(Fri)18:44:25 No.103272840

>>103272323
Probably a troll honeypot server. So naturally I'm going to join out of morbid curiosity

Anonymous
11/22/24(Fri)18:44:32 No.103272842

Anonymous 11/22/24(Fri)18:44:32 No.103272842

>>103272807
electricity shocks powered by God

Anonymous
11/22/24(Fri)18:45:48 No.103272855

Anonymous 11/22/24(Fri)18:45:48 No.103272855

>>103272840
Aww it's a fake link. No friends for me :(

Anonymous
11/22/24(Fri)18:46:54 No.103272864

Anonymous 11/22/24(Fri)18:46:54 No.103272864

>>103272842
if God created the world, then he also created the AI, CHECKMATE

Anonymous
11/22/24(Fri)18:49:44 No.103272886

Anonymous 11/22/24(Fri)18:49:44 No.103272886

File: Screenshot_20241122-174811~2.png (73 KB, 720x888)

73 KB PNG

>not real intelligence
>But it knows about the hallway birds

Anonymous
11/22/24(Fri)19:00:51 No.103272976

Anonymous 11/22/24(Fri)19:00:51 No.103272976

Is there a model that's free to use commercially? It doesn't have to be gpt level, just needs to string a few sentences of text together.

Anonymous
11/22/24(Fri)19:03:11 No.103272995

Anonymous 11/22/24(Fri)19:03:11 No.103272995

>>103272976
A ton of openly licensed models can do that.

Anonymous
11/22/24(Fri)19:03:15 No.103272997

Anonymous 11/22/24(Fri)19:03:15 No.103272997

q2 behemoth or 5km midnight miqu?

Anonymous
11/22/24(Fri)19:04:13 No.103273006

Anonymous 11/22/24(Fri)19:04:13 No.103273006

>>103272886
those are just government drones that don't fly

Anonymous
11/22/24(Fri)19:06:09 No.103273023

Anonymous 11/22/24(Fri)19:06:09 No.103273023

>>103272995
Thanks I'm retarded, I'll use T5.

Anonymous
11/22/24(Fri)19:09:07 No.103273045

Anonymous 11/22/24(Fri)19:09:07 No.103273045

>>103272997
miqu for slop, q2 for tardation

Anonymous
11/22/24(Fri)19:13:26 No.103273081

Anonymous 11/22/24(Fri)19:13:26 No.103273081

>>103272363
>Musk has power now
My hobby didn't deserve it. It is a good thing it id dead anyway.

Anonymous
11/22/24(Fri)19:14:49 No.103273094

Anonymous 11/22/24(Fri)19:14:49 No.103273094

>>103272997
Go back to the Kobold Discord.

Anonymous
11/22/24(Fri)19:22:47 No.103273159

Anonymous 11/22/24(Fri)19:22:47 No.103273159

File: 1665422973550704878950385(...).png (901 KB, 1155x1142)

901 KB PNG

>>103269586
I'd say fuck your optics, concern troll, but what you say isn't even remotely true. There needs to be space for hobbyists and tinkerers to collaborate on uncensored models. If that is not allowed, then you can be sure you don't live in a free society.

Anonymous
11/22/24(Fri)19:23:39 No.103273174

Anonymous 11/22/24(Fri)19:23:39 No.103273174

File: 1705186488029659.png (1.9 MB, 1024x1024)

1.9 MB PNG

>>103273159
based

Anonymous
11/22/24(Fri)19:28:23 No.103273230

Anonymous 11/22/24(Fri)19:28:23 No.103273230

>>103267649
Ah so it's completely useless for translating visual novels because it avoids anything offensive or adult in nature.

Anonymous
11/22/24(Fri)19:28:50 No.103273236

Anonymous 11/22/24(Fri)19:28:50 No.103273236

>>103273094
dunno what that is

Anonymous
11/22/24(Fri)19:34:36 No.103273292

Anonymous 11/22/24(Fri)19:34:36 No.103273292

I hate Qwen. Largestral is too big. Nemo is too retarded. Nemotron wants to give me lists instead of being normal. What am I supposed to use?

Anonymous
11/22/24(Fri)19:35:49 No.103273306

Anonymous 11/22/24(Fri)19:35:49 No.103273306

>>103273292
money

Anonymous
11/22/24(Fri)19:36:50 No.103273315

Anonymous 11/22/24(Fri)19:36:50 No.103273315

>>103273292
Magnum v4 72B

Anonymous
11/22/24(Fri)19:40:02 No.103273339

Anonymous 11/22/24(Fri)19:40:02 No.103273339

the google colab hag gives me a hard on every time.....

Anonymous
11/22/24(Fri)19:53:43 No.103273444

Anonymous 11/22/24(Fri)19:53:43 No.103273444

File: 1731529268244600.webm (1.36 MB, 576x566)

1.36 MB WEBM

>>103268024
>Wanna host? Request access to weights (huggingface login), then run huggingface-cli login in the terminal

iirc this was the issue. It's like saying "Run your media bittorrent-style. Provide your Netflix login to get started!"

Anonymous
11/22/24(Fri)20:04:43 No.103273552

Anonymous 11/22/24(Fri)20:04:43 No.103273552

>>103272363
He is about to get what he fucking deserves (I will never forgive him for withholding GPT-3 and forcing people to eat shit for 2 fucking years).

Anonymous
11/22/24(Fri)20:28:34 No.103273771

Anonymous 11/22/24(Fri)20:28:34 No.103273771

>cheap radeon pro v620 32gb on ebay
worth it?

Anonymous
11/22/24(Fri)21:18:57 No.103274158

Anonymous 11/22/24(Fri)21:18:57 No.103274158

>>103273292
Wait 2 more weeks

Anonymous
11/22/24(Fri)21:20:15 No.103274168

Anonymous 11/22/24(Fri)21:20:15 No.103274168

>>103274158
More like 2-4 months unless deepseek drops R1.

Anonymous
11/22/24(Fri)21:22:52 No.103274189

Anonymous 11/22/24(Fri)21:22:52 No.103274189

>>103265207
>https://rentry.org/lmg-lazy-getting-started-guide

I followed this guide and its still censored.

koboldcpp/Mistral-Nemo-12B-Instruct-2407-Q4_K_M
koboldcpp backend
mistral v3 tekken context and instruct
etc etc, won't do anything uncensored. Should I get a different model is that what I did wrong?

Anonymous
11/22/24(Fri)21:32:27 No.103274275

Anonymous 11/22/24(Fri)21:32:27 No.103274275

File: file.png (1.89 MB, 1500x2060)

1.89 MB PNG

One day after the AI goldrush dies completely some guy will have too much free compute in some of the big companies and he will just drop some discord rp dataset in the main training + will ease off the censoring a little and we will get a 7B that just gets everything.

Anonymous
11/22/24(Fri)21:38:54 No.103274320

Anonymous 11/22/24(Fri)21:38:54 No.103274320

>>103274275
Spare compute for sure, but I don't know about that mythical rp dataset, chief

Anonymous
11/22/24(Fri)21:41:46 No.103274343

Anonymous 11/22/24(Fri)21:41:46 No.103274343

>>103274275
>>103274320
You don't really need it. Poe AI with a good enough prompt on GPT3.5 did some ungodly nasty things with me.

Anonymous
11/22/24(Fri)21:44:50 No.103274366

Anonymous 11/22/24(Fri)21:44:50 No.103274366

>>103274275
I mean, no that's never ever going to happen.
reasons for this:
1. the dataset when training a model, really, really matters. so much so, that anthropic created Constitutional AI, which uses another AI to create the dataset. Having a junk dataset really harms the output.
2. Many models have already done this already, take a look at ArliAI, they're not perfect, not by any means.
3. the goldrush will get replaced by something better LLMs are step one, there will likely be better shit in two more weeks.

Anonymous
11/22/24(Fri)21:47:43 No.103274386

Anonymous 11/22/24(Fri)21:47:43 No.103274386

>>103274366
>ArliAI
>Training Duration: Approximately 3 days on 2x3090Ti
>Epochs: 1 epoch training for minimized repetition sickness
Ah so you just don't know what you are talking about.

Anonymous
11/22/24(Fri)21:52:20 No.103274429

Anonymous 11/22/24(Fri)21:52:20 No.103274429

>>103274386
yeah and you're the expert clearly, dumbass.

Anonymous
11/22/24(Fri)21:58:29 No.103274486

Anonymous 11/22/24(Fri)21:58:29 No.103274486

>>103274366
>anthropic's constitutional AI
>clear improvement on the same model every new checkpoint
>meta's SPIN
>benches keep maxxing yet nobody can tell any difference

Anonymous
11/22/24(Fri)22:06:47 No.103274581

Anonymous 11/22/24(Fri)22:06:47 No.103274581

>>103269328
they literally are. simulating thoughts are thoughts because thoughts are simulation

Anonymous
11/22/24(Fri)22:12:06 No.103274623

Anonymous 11/22/24(Fri)22:12:06 No.103274623

I am getting a ton of 404s on HF for model cards that were there last week.

Was there a purge or was I just looking the wrong thing?

Anonymous
11/22/24(Fri)22:16:49 No.103274652

Anonymous 11/22/24(Fri)22:16:49 No.103274652

>>103274623
Yes. That one was removed, but not the other ones. Except that other one.
Just post the fucking links of you want someone checking them for you, retard.

Anonymous
11/22/24(Fri)22:20:16 No.103274681

Anonymous 11/22/24(Fri)22:20:16 No.103274681

>>103274652
I included links for everything I wanted checked.

ohhhh... I forgot to include links.

Please see below.

Anonymous
11/22/24(Fri)22:21:53 No.103274693

Anonymous 11/22/24(Fri)22:21:53 No.103274693

>>103274486
>yet nobody can tell any difference
Filtered

Anonymous
11/22/24(Fri)22:31:24 No.103274785

Anonymous 11/22/24(Fri)22:31:24 No.103274785

https://github.com/danny-avila/LibreChat
has anyone used this?
I'm just looking for a lightweight interface for chatgpt, claude and others

Anonymous
11/22/24(Fri)22:47:32 No.103274902

Anonymous 11/22/24(Fri)22:47:32 No.103274902

>>103274785
I never heard of it. Most people use SillyTavern.

Anonymous
11/22/24(Fri)22:59:59 No.103275033

Anonymous 11/22/24(Fri)22:59:59 No.103275033

in ST, how do I set up an author's note that won't force a large portion of my context to be processed again when I either edit it or it gets inserted, currently I've got it at depth 0 insertion frequency 4 and for some reason it's going 6k deep and completely defeating the purpose I'm using it for (summarizing a very long chat that will take over 2 hours to process since I'm on cpu)

Anonymous
11/22/24(Fri)23:02:02 No.103275049

Anonymous 11/22/24(Fri)23:02:02 No.103275049

File: 58265577_p1.jpg (76 KB, 694x1000)

76 KB JPG

Hey Magnum anon here?

Thank you for giving me the pointers yesterday. I had problems with Magnum 70b occasionally acting up and sometimes defaulting back to Qwen's, outputting slop or gibberish, then I copied the System prompt exactly from the hugging face page and that changed everything completely.

Apparently having the same system prompt as what the model was tuned with MASSIVELY reinforces the tuning and makes it abundantly clear for the AI that it is not a helpful and polite AI assistant anymore. Copy and pasting the system prompt from the tuner's page completely changed the model's behavior and erased all traces of the pozzed censorship, bias or purple proze, now the waifus are coherent and wild as fuck.

Anonymous
11/22/24(Fri)23:02:41 No.103275056

Anonymous 11/22/24(Fri)23:02:41 No.103275056

>>103274902
I think sillytavern only works with API keys
I want to use my regular accounts but with a UI that doesn't take up 700mb of memory for a fresh tab like Claude
to be fair maybe I should look into using APIs directly but I imagine it's more expensive than regular premium for a power user

Anonymous
11/22/24(Fri)23:03:50 No.103275070

Anonymous 11/22/24(Fri)23:03:50 No.103275070

File: 3625656456.png (2.91 MB, 1280x1418)

2.91 MB PNG

>>103275049
Do ask the model tuners to provide the exact prompt and format they were tuning with, it's incredibly powerful.

And if the model tuners are here - attack your system prompts to your model page.

Anonymous
11/22/24(Fri)23:09:37 No.103275122

Anonymous 11/22/24(Fri)23:09:37 No.103275122

>Qwen2.5
>DeepSeek R1
>Marco-o1
WE BUILD FOR CHINA

Anonymous
11/22/24(Fri)23:12:16 No.103275139

Anonymous 11/22/24(Fri)23:12:16 No.103275139

>>103275122
China will release them open in order to undermine US companies as long as the US has the lead. The moment China is in the lead they will go closed source.

Anonymous
11/22/24(Fri)23:15:46 No.103275161

Anonymous 11/22/24(Fri)23:15:46 No.103275161

>>103275139
>The moment China is in the lead they will go closed source.

Then the west would go open, isn't the competition a beautiful thing?

Cold war was the reason the technology was developing quickly back in the 20th century, no competition = no progress.

Anonymous
11/22/24(Fri)23:27:12 No.103275241

Anonymous 11/22/24(Fri)23:27:12 No.103275241

bfloat16 is a meme
https://arxiv.org/abs//2411.13476

Anonymous
11/22/24(Fri)23:29:15 No.103275254

Anonymous 11/22/24(Fri)23:29:15 No.103275254

any igpu enjoyer here?
https://www.reddit.com/r/LocalLLaMA/comments/1gheslj/testing_llamacpp_with_intels_xe2_igpu_core_ultra/

should I go for intel or amd?

Anonymous
11/22/24(Fri)23:33:22 No.103275281

Anonymous 11/22/24(Fri)23:33:22 No.103275281

>>103275056
>regular accounts
At least for Claude, I think people use this:
https://gitgud.io/ahsk/clewd
Which makes a custom proxy that forwards the API calls to the web app. People used a similar method for Slack in the past.

Anonymous
11/22/24(Fri)23:33:54 No.103275287

Anonymous 11/22/24(Fri)23:33:54 No.103275287

>>103275254
Do the modern integrated GPUs have limited ram allocation?

Can i have a "tpu at home" if i bought a cpu with an igpu and allocated 120 gb of ram to it?

Anonymous
11/22/24(Fri)23:37:14 No.103275312

Anonymous 11/22/24(Fri)23:37:14 No.103275312

>>103274785
Can it be used with local?
If not then fuck off.

Anonymous
11/22/24(Fri)23:41:01 No.103275338

Anonymous 11/22/24(Fri)23:41:01 No.103275338

>>103275312
>Can it be used with local?
>If not then fuck off.

Anonymous
11/22/24(Fri)23:41:30 No.103275342

Anonymous 11/22/24(Fri)23:41:30 No.103275342

>>103275287
afaik ryzen apu can address up to 64gb
https://www.reddit.com/r/LocalLLaMA/comments/1efhqol/comment/lg24yh5/?utm_source=share&utm_medium=mweb3x&utm_name=mweb3xcss&utm_term=1&utm_content=share_button

not sure about intel

Anonymous
11/22/24(Fri)23:41:52 No.103275345

Anonymous 11/22/24(Fri)23:41:52 No.103275345

>>103275287
As far as I know it doesn't work with AMD.

Anonymous
11/22/24(Fri)23:43:42 No.103275356

Anonymous 11/22/24(Fri)23:43:42 No.103275356

>>103275312
yes, you gigantic retard, it can be used with local models
I tried running it with docker but had errors with mongodb and i cba
doesn't look bad though

Anonymous
11/22/24(Fri)23:48:25 No.103275392

Anonymous 11/22/24(Fri)23:48:25 No.103275392

>>103275312
Yeah it can connect to any openai-compatible API which most local LLM servers can do.

Anonymous
11/22/24(Fri)23:50:20 No.103275413

Anonymous 11/22/24(Fri)23:50:20 No.103275413

>>103275312
God you're so painfully retarded
Do you even listen to yourself?

Anonymous
11/23/24(Sat)00:07:03 No.103275518

Anonymous 11/23/24(Sat)00:07:03 No.103275518

The "local models are finally very good, but painfully slow" era is much more annoying than I thought it'd be

Anonymous
11/23/24(Sat)00:07:29 No.103275520

Anonymous 11/23/24(Sat)00:07:29 No.103275520

>>103275413
Xe can't cuz cloud stuff lives rent free in xis head.

Anonymous
11/23/24(Sat)00:18:44 No.103275602

Anonymous 11/23/24(Sat)00:18:44 No.103275602

I'm getting extremely similar, near-deterministic outputs on every reroll with mistral large 2411 even with 1.2 temp. I have not touched any other samplers/params, it's all vanilla.

Any idea how to fix it?

Anonymous
11/23/24(Sat)00:22:07 No.103275627

Anonymous 11/23/24(Sat)00:22:07 No.103275627

>>103275602
are you using an old llamacpp variant you didn't update for a while
iirc there was a brief period where sampling wasn't working properly unless you were using the HF loader, so it would act deterministic with any settings

Anonymous
11/23/24(Sat)00:25:40 No.103275653

Anonymous 11/23/24(Sat)00:25:40 No.103275653

>>103275627
Latest koboldcpp with sillytavern, with default settings (other than temp).
I have been out for quite a while so everything is freshly downloaded.

Anonymous
11/23/24(Sat)00:28:00 No.103275683

Anonymous 11/23/24(Sat)00:28:00 No.103275683

what's teh best gguf of Midnight-Miqu for 24gb? i'm using 70B-v1.5.i1-IQ4_XS (34.6GB) and it's about one word/sec on 3090ti
can i go to one of the smaller models (3M/3S/3X/3XXS) without making it crap out too much? or is there a better alternative? i find this model to be smart enought to keep a casual conversation going for a while

Anonymous
11/23/24(Sat)00:28:37 No.103275688

Anonymous 11/23/24(Sat)00:28:37 No.103275688

>>103275653
>>103275627
Also Q4_K_S from here:
https://huggingface.co/bartowski/Mistral-Large-Instruct-2411-GGUF

Anonymous
11/23/24(Sat)00:30:08 No.103275697

Anonymous 11/23/24(Sat)00:30:08 No.103275697

>>103275653
do you get gibberish/word salad if you crank the temp to 2.0 with all other samplers off
that's the easiest way to test if sampling is actually working or not (gibberish means it is working)

Anonymous
11/23/24(Sat)00:36:48 No.103275744

Anonymous 11/23/24(Sat)00:36:48 No.103275744

>>103275602
Frequency Penalty 0.13 and Presence Penalty 0.2 seems to be working for me. I was just cranking both up to 1/1.5 and halving down until Lagestral 2411 became less repetitive.

Anonymous
11/23/24(Sat)00:39:38 No.103275763

Anonymous 11/23/24(Sat)00:39:38 No.103275763

File: working.mp4 (405 KB, 406x468)

405 KB MP4

>>103275697
Yes, I am getting word salad at 2.
I guess I'll just have to find some better sampler settings.

Anonymous
11/23/24(Sat)00:44:49 No.103275795

Anonymous 11/23/24(Sat)00:44:49 No.103275795

>>103275763
temp 5 topK 3

Anonymous
11/23/24(Sat)00:48:16 No.103275815

Anonymous 11/23/24(Sat)00:48:16 No.103275815

File: 654095f9356802a56be82064.png (390 KB, 700x525)

390 KB PNG

>>103275518
Accelerate

Anonymous
11/23/24(Sat)01:54:42 No.103276223

Anonymous 11/23/24(Sat)01:54:42 No.103276223

>>103275518
Cloud models are smaller than you'd think. Current local models are just too big to justify their performance levels

Anonymous
11/23/24(Sat)01:57:06 No.103276250

Anonymous 11/23/24(Sat)01:57:06 No.103276250

>>103276223
I think that's true in some cases, but Claude Opus (which most coomers think is the best coom model) is clearly a genuine behemoth based on its slow token generation rate.

Anonymous
11/23/24(Sat)02:14:07 No.103276361

Anonymous 11/23/24(Sat)02:14:07 No.103276361

>>103276250
You'll never know with cloud models
>Studies show that users associate a lower token generation rate with a higher perceived intelligence of the model

Anonymous
11/23/24(Sat)02:16:22 No.103276382

Anonymous 11/23/24(Sat)02:16:22 No.103276382

>>103276361
Yeah but in this case Sonnet 3.5 has been their flagship "smart" model for half a year at this point.

Anonymous
11/23/24(Sat)02:40:32 No.103276557

Anonymous 11/23/24(Sat)02:40:32 No.103276557

File: NorthKoreanMikuKnockoff.png (1.4 MB, 1248x800)

1.4 MB PNG

good night, /lmg/

Anonymous
11/23/24(Sat)02:57:24 No.103276672

Anonymous 11/23/24(Sat)02:57:24 No.103276672

>>103276557
Good night, Miku and friends

Anonymous
11/23/24(Sat)03:02:00 No.103276709

Anonymous 11/23/24(Sat)03:02:00 No.103276709

File: 1674516896750610.jpg (56 KB, 800x533)

56 KB JPG

>>103276557

Anonymous
11/23/24(Sat)03:04:43 No.103276727

Anonymous 11/23/24(Sat)03:04:43 No.103276727

>How could I possibly relax when my body is still humming from what just happened?
>hum
Bros I want to know what the fuck the context is from the data poisoning source. "Shitty erotica" yeah I know but who what when why how exactly is it used when written by a human?

Anonymous
11/23/24(Sat)03:11:58 No.103276771

Anonymous 11/23/24(Sat)03:11:58 No.103276771

>>103276727
there's a large market for commissioned smut, particular among furries (who have notoriously high levels of disposable income), and a lot of 'authors' just mass-produce that slop by copying and pasting chunks together and using find-and-replace to add names and pronouns in afterwards; I'm betting a lot of that made it into datasets, along with all the commercial erotica that's probably produced in a similar manner
we need a model trained exclusively on Ao3, ff.net, and maybe some of the quest forums (sb, sv, qq, etc)

Anonymous
11/23/24(Sat)03:13:38 No.103276783

Anonymous 11/23/24(Sat)03:13:38 No.103276783

File: 1702576289666593.jpg (67 KB, 640x701)

67 KB JPG

>Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models
> Hallucinations in large language models are a widespread problem, yet the mechanisms behind whether models will hallucinate are poorly understood, limiting our ability to solve this problem. Using sparse autoencoders as an interpretability tool, we discover that a key part of these mechanisms is entity recognition, where the model detects if an entity is one it can recall facts about. Sparse autoencoders uncover meaningful directions in the representation space, these detect whether the model recognizes an entity, e.g. detecting it doesn't know about an athlete or a movie. This suggests that models can have self-knowledge: internal representations about their own capabilities. These directions are causally relevant: capable of steering the model to refuse to answer questions about known entities, or to hallucinate attributes of unknown entities when it would otherwise refuse. We demonstrate that despite the sparse autoencoders being trained on the base model, these directions have a causal effect on the chat model's refusal behavior, suggesting that chat finetuning has repurposed this existing mechanism. Furthermore, we provide an initial exploration into the mechanistic role of these directions in the model, finding that they disrupt the attention of downstream heads that typically move entity attributes to the final token.
https://arxiv.org/abs/2411.14257

>>103276557

Anonymous
11/23/24(Sat)03:21:32 No.103276832

Anonymous 11/23/24(Sat)03:21:32 No.103276832

>>103274275
7b is coping will need at least a 34b

Anonymous
11/23/24(Sat)03:37:16 No.103276927

Anonymous 11/23/24(Sat)03:37:16 No.103276927

I got bored with my suno credits and made this with mostly Suno V4 (and some post)
https://voca.ro/1n6LYL5sb8GU
Dedicated to you guys. UwU

Anonymous
11/23/24(Sat)03:40:11 No.103276957

Anonymous 11/23/24(Sat)03:40:11 No.103276957

File: 1712519454715559.jpg (182 KB, 850x1274)

182 KB JPG

Poorfag here.

I have a laptop with a Ryzen 5, 8 GB of RAM and no dedicated GPU. I could upgrade the RAM up to 32 GB though. Is that enough to run a local model (for coom reasons) or would it be too slow to be useable?

Pic unrelated.

Anonymous
11/23/24(Sat)03:42:34 No.103276984

Anonymous 11/23/24(Sat)03:42:34 No.103276984

>none of the local models know about the "bakery" fat ass joke
Does everyone just train on the same CommonCrawl from 2 years ago?

Anonymous
11/23/24(Sat)03:43:42 No.103276993

Anonymous 11/23/24(Sat)03:43:42 No.103276993

>>103276957
I'm running Cydonia on an R5 5600 and 32GB of DDR4, get about 1t/s until getting really deep into context, would definitely recommend DDR5 if you can get it.

Anonymous
11/23/24(Sat)04:00:21 No.103277118

Anonymous 11/23/24(Sat)04:00:21 No.103277118

>>103276927
kek

Anonymous
11/23/24(Sat)05:35:19 No.103277701

Anonymous 11/23/24(Sat)05:35:19 No.103277701

>>103276984
Nothing wrong with that

ld2mt
11/23/24(Sat)06:17:01 No.103277959

ld2mt 11/23/24(Sat)06:17:01 No.103277959

what's a good, small and performant model (preferably uncensored)?

Anonymous
11/23/24(Sat)06:33:13 No.103278046

Anonymous 11/23/24(Sat)06:33:13 No.103278046

The age of rasperry starts now. R1-lite is only the first step.

Anonymous
11/23/24(Sat)06:37:18 No.103278069

Anonymous 11/23/24(Sat)06:37:18 No.103278069

File: sample_a0d4d7934c74a0e724(...).jpg (168 KB, 850x1506)

168 KB JPG

Went back through my recent models, testing each one. I think 70b Hanami is my favorite.

Anonymous
11/23/24(Sat)06:45:13 No.103278125

Anonymous 11/23/24(Sat)06:45:13 No.103278125

>>103275161
>isn't the competition a beautiful thing?
yes anon, it's really beautiful, without that we wouldn't advance at all

Anonymous
11/23/24(Sat)06:50:40 No.103278159

Anonymous 11/23/24(Sat)06:50:40 No.103278159

>>103278046
>R1-lite is only the first step
do we know what will be the size of that thing? and are we sure they'll release it locally?

Anonymous
11/23/24(Sat)06:51:57 No.103278167

Anonymous 11/23/24(Sat)06:51:57 No.103278167

>>103278069
And I think you're piece of shit that only came here to spam Sao's models. Go fuck yourself, asshole.

Anonymous
11/23/24(Sat)06:55:18 No.103278193

Anonymous 11/23/24(Sat)06:55:18 No.103278193

>>103278167
you think something that's wrong then
I just coomed to it and wanted to share the positivity, schizo

Anonymous
11/23/24(Sat)06:57:07 No.103278209

Anonymous 11/23/24(Sat)06:57:07 No.103278209

>>103278193
Go buy a fucking ad, asshole. I know you're just about to start spamming that model because you're a fucking shill.

Anonymous
11/23/24(Sat)07:01:08 No.103278249

Anonymous 11/23/24(Sat)07:01:08 No.103278249

>>103278209
Specifically, I compared it to Nemotron, Magnum, Gemmasutra, and EVA-Qwen. Each of those made frequent errors which demonstrated that they didn't "understand" what was going on. Hanami, on the other hand, would write nice, long progressions of the scene that even made anatomical sense. Not that it was perfect, but I'm definitely going to keep using it for now.

Anonymous
11/23/24(Sat)07:04:35 No.103278267

Anonymous 11/23/24(Sat)07:04:35 No.103278267

Used CLIP to organize my 4chan and porn folders and the Tkinter GUI and troubleshooting was made by Qwen-2.5 Coder 32B, I love local models

Anonymous
11/23/24(Sat)07:04:44 No.103278268

Anonymous 11/23/24(Sat)07:04:44 No.103278268

>>103278249
Made up crap that only serve as an excuse to shill because that's how you make money. When the next thread is 90% filled with your shills we're supposed to think it was organic word of mouth, right? Go fuck yourself.

Anonymous
11/23/24(Sat)07:08:06 No.103278280

Anonymous 11/23/24(Sat)07:08:06 No.103278280

File: 9272ccb82c186f510e32bcb6d(...).jpg (389 KB, 850x1417)

389 KB JPG

>>103278268
My previous favorite was Euryale. I'd also been using Magnum v2 and v3 off and on. Magnum v4 just sucks every time I try it.

Anonymous
11/23/24(Sat)07:10:27 No.103278292

Anonymous 11/23/24(Sat)07:10:27 No.103278292

>>103278280
What? Do you want another excuse to keep shilling? Go ahead. Reply to this post. You're leaving money on the table if you don't.

Anonymous
11/23/24(Sat)07:12:16 No.103278305

Anonymous 11/23/24(Sat)07:12:16 No.103278305

File: facepalm2.jpg (404 KB, 1022x1080)

404 KB JPG

>DRY just causes the model to intentionally misspell words so it can keep repeating them

Anonymous
11/23/24(Sat)07:12:55 No.103278308

Anonymous 11/23/24(Sat)07:12:55 No.103278308

>>103278069
Going to try this model, thank for sharing :)

Anonymous
11/23/24(Sat)07:13:29 No.103278311

Anonymous 11/23/24(Sat)07:13:29 No.103278311

>>103278305
Eat it up goy.

Anonymous
11/23/24(Sat)07:14:45 No.103278316

Anonymous 11/23/24(Sat)07:14:45 No.103278316

>>103278292
I'm getting rich here, yeah. Also, I got noticeably fewer llama-isms. Shivers down my spine, breath hot on my ear, eyes gleaming with ____, voices barely above a whisper.

Anonymous
11/23/24(Sat)07:15:58 No.103278321

Anonymous 11/23/24(Sat)07:15:58 No.103278321

>>103278316
That's good to know. Reply to this post again to tell me more about it.

Anonymous
11/23/24(Sat)07:17:24 No.103278333

Anonymous 11/23/24(Sat)07:17:24 No.103278333

>>103278159
>and are we sure they'll release it locally?
I mean, they stated they will, that's the 2nd most assuring thing they could do

Anonymous
11/23/24(Sat)07:19:53 No.103278350

Anonymous 11/23/24(Sat)07:19:53 No.103278350

File: Screenshot_215242354.png (19 KB, 300x136)

19 KB PNG

>>103278321
I'm really running out of things to say, though. Let's see... pic-related is my current system prompt. The warning part was for when Nemotron started being faggy, but I'm sure I could take it out now.

Anonymous
11/23/24(Sat)07:21:42 No.103278361

Anonymous 11/23/24(Sat)07:21:42 No.103278361

>>103278350
Why am I supposed to care about your system prompt?

Anonymous
11/23/24(Sat)07:25:01 No.103278384

Anonymous 11/23/24(Sat)07:25:01 No.103278384

>>103278361
It might affect model outputs? I actually haven't tested changing it with Hanami, so I can't be sure. Mostly I was just looking for things to say, which I already mentioned.

Anonymous
11/23/24(Sat)07:26:59 No.103278394

Anonymous 11/23/24(Sat)07:26:59 No.103278394

>>103278384
So what you're saying is that you would rather do anything else rather than showing how the model actually writes? That's quite concerning...

Anonymous
11/23/24(Sat)07:28:54 No.103278403

Anonymous 11/23/24(Sat)07:28:54 No.103278403

>>103278394
yeah, if he doesn't want to show the output that means that the model is ass, that's probably a shill

Anonymous
11/23/24(Sat)07:31:00 No.103278410

Anonymous 11/23/24(Sat)07:31:00 No.103278410

>>103278394
It's a pain in the ass to show logs. I tend to use my own name, which I'd want to change. I also tend to tweak things to fit my fetishes, so a lot of the final replies aren't pure machine output (more like 95% model, 5% human).

Anonymous
11/23/24(Sat)07:32:25 No.103278423

Anonymous 11/23/24(Sat)07:32:25 No.103278423

>>103278410
>schizo bambling
yeah definitely a shill

Anonymous
11/23/24(Sat)07:32:56 No.103278427

Anonymous 11/23/24(Sat)07:32:56 No.103278427

>>103278423
bambling isn't a word

Anonymous
11/23/24(Sat)07:33:44 No.103278435

Anonymous 11/23/24(Sat)07:33:44 No.103278435

>>103278427
That's your opinion, shill.

Anonymous
11/23/24(Sat)07:34:13 No.103278436

Anonymous 11/23/24(Sat)07:34:13 No.103278436

>>103278435
kek

Anonymous
11/23/24(Sat)07:34:49 No.103278441

Anonymous 11/23/24(Sat)07:34:49 No.103278441

>>103278410
This is the way to use LLMs. Stronger models will lift heavier, but in the end there's not a single one that can give you what you want perfectly. Back when I used Claude Opus I had to wrangle pretty hard too

Anonymous
11/23/24(Sat)07:36:42 No.103278462

Anonymous 11/23/24(Sat)07:36:42 No.103278462

>>103278435
no, that's objective
You seem to have trouble thinking logically. Like tranny-tier in that words don't have meanings except in their use as rhetoric. Are you a tranny by chance?

Anonymous
11/23/24(Sat)07:37:31 No.103278467

Anonymous 11/23/24(Sat)07:37:31 No.103278467

>>103278462
>You seem to have trouble thinking logically.
says the shill who want us to try his model based on a "trust me bro" evidence

Anonymous
11/23/24(Sat)07:37:32 No.103278468

Anonymous 11/23/24(Sat)07:37:32 No.103278468

>>103278441
You basically have to narrate at least some of the other character's actions to steer them in the right direction or simply make things make logical sense. You can do so in a hinting, indirect way sometimes and it has the intended effect.

Anonymous
11/23/24(Sat)07:40:57 No.103278497

Anonymous 11/23/24(Sat)07:40:57 No.103278497

>>103278467
>who want us to try his model based on a "trust me bro" evidence
This is certified /lmg/ hood classic.

Anonymous
11/23/24(Sat)07:41:24 No.103278503

Anonymous 11/23/24(Sat)07:41:24 No.103278503

>>103278468
A problem I keep running into is female characters gradually becoming more aroused from an activity that does not involve genital stimulation and they eventually just magically orgasm out of nowhere.
Like, no, it doesn't work that way.
I have to narrate "{{char}}'s hand finds its way into her panties" or something to make the whole thing make sense.

Anonymous
11/23/24(Sat)07:43:09 No.103278515

Anonymous 11/23/24(Sat)07:43:09 No.103278515

>>103278441
>I don't know how to prompt and I have to cope by writing my own outputs

Anonymous
11/23/24(Sat)07:44:10 No.103278525

Anonymous 11/23/24(Sat)07:44:10 No.103278525

>>103278503
That's a sloptune issue, or prompt issue, or both desu. The current sloptune datasets are like 70% smut, most of which were generated from sex-mode jailbroken claude

Anonymous
11/23/24(Sat)07:46:13 No.103278535

Anonymous 11/23/24(Sat)07:46:13 No.103278535

>>103278515
>I don't know how to cope -
Right.

Anonymous
11/23/24(Sat)07:51:17 No.103278579

Anonymous 11/23/24(Sat)07:51:17 No.103278579

Holy fuck, buy an ad schizo having a meltie

Anonymous
11/23/24(Sat)07:51:17 No.103278580

Anonymous 11/23/24(Sat)07:51:17 No.103278580

>>103278497
true, I remember the L2 era with the endless finetunes, downloaded so much models it destroyed my ssd :'(

Anonymous
11/23/24(Sat)07:53:54 No.103278598

Anonymous 11/23/24(Sat)07:53:54 No.103278598

>>103278525
Mistral Nemo Instruct does it.

Anonymous
11/23/24(Sat)07:54:55 No.103278602

Anonymous 11/23/24(Sat)07:54:55 No.103278602

>>103278525
>>103278598
Mixtral Instruct also did it.

Anonymous
11/23/24(Sat)07:57:12 No.103278618

Anonymous 11/23/24(Sat)07:57:12 No.103278618

>>103278503
Also, if it's a femdom character, she'll just order me to cum while my cock is not being stimulated in any way.
Like, no, it doesn't work that way.

Anonymous
11/23/24(Sat)07:57:18 No.103278619

Anonymous 11/23/24(Sat)07:57:18 No.103278619

>>103278503
You have no idea how women work.

Anonymous
11/23/24(Sat)07:59:02 No.103278639

Anonymous 11/23/24(Sat)07:59:02 No.103278639

>>103278619
Neither do you.

Anonymous
11/23/24(Sat)07:59:39 No.103278641

Anonymous 11/23/24(Sat)07:59:39 No.103278641

>>103278441
I have like five roleplays that haven't progressed in months. I pick a model and run it through each of them to see what it says, then focus on one to autistically iterate on until I'm finished. Repeat with the next model.

Anonymous
11/23/24(Sat)07:59:41 No.103278642

Anonymous 11/23/24(Sat)07:59:41 No.103278642

>>103278619
A great deal of women factually can't even orgasm WITH genital stimulation let alone without it.

Anonymous
11/23/24(Sat)07:59:59 No.103278646

Anonymous 11/23/24(Sat)07:59:59 No.103278646

Fixed my gpu crashing, we're so back... to running quanted garbage because 24gb isn't worth shit in this vram-inflated llm economy

Anonymous
11/23/24(Sat)08:01:35 No.103278658

Anonymous 11/23/24(Sat)08:01:35 No.103278658

>>103278646
it'll get better during the 5090 era, I'm surprised that Nvdia went for 32gb, that's a lot when you know how stingy they are with their vram

Anonymous
11/23/24(Sat)08:06:39 No.103278687

Anonymous 11/23/24(Sat)08:06:39 No.103278687

>>103278267
>folders
accept hydrus tags as your lord and savior

Anonymous
11/23/24(Sat)08:07:13 No.103278695

Anonymous 11/23/24(Sat)08:07:13 No.103278695

>>103278467
I ignore people who post logs. It's always some 2-message garbage where the AI character says and does like ten things without any input from the player. They're totally useless as evidence of how it will perform, which speaks to the intellect of those who post them and/or want them.

Anonymous
11/23/24(Sat)08:07:52 No.103278703

Anonymous 11/23/24(Sat)08:07:52 No.103278703

>>103278658
The devil is in the details. The 5090 will be like 4-5 slots and eat 600W, with maybe the potential of power limiting it to 450W without losing too much performance. All of that at $2k+ most likely.
Even building a small 96GB VRAM rig with three of them is going to be a pain in the ass and scaling them beyond that will be even harder.

Anonymous
11/23/24(Sat)08:15:12 No.103278758

Anonymous 11/23/24(Sat)08:15:12 No.103278758

>>103278658
Yeah but just like >>103278703 said, it'll be overpriced, power-hungry garbage
I'm a student, not a consoomer

Anonymous
11/23/24(Sat)08:23:31 No.103278817

Anonymous 11/23/24(Sat)08:23:31 No.103278817

>>103278810
>>103278810
>>103278810

Anonymous
11/23/24(Sat)08:29:22 No.103278867

Anonymous 11/23/24(Sat)08:29:22 No.103278867

>>103269445
I'm just looking at mistral large 2407 in the screencap, it almost loses to qwen 32B

Anonymous
11/23/24(Sat)08:44:22 No.103279013

Anonymous 11/23/24(Sat)08:44:22 No.103279013

>>103278687
I can look for more complex concepts with CLIP, I just wish there was a bigger model (1-2B instead of the 400M OpenAI CLIP)

Anonymous
11/23/24(Sat)09:19:54 No.103279353

Anonymous 11/23/24(Sat)09:19:54 No.103279353

>>103278867
Yea, qwen2.5 is really smart. Mistral large writes better though.

Anonymous
11/23/24(Sat)09:22:10 No.103279374

Anonymous 11/23/24(Sat)09:22:10 No.103279374

>>103278658
I really look forward to AMD next cards.
If they keep putting more vram on it someone is going to make them work for AI eventually.
nvidia has the advantage for now but won't be long.

Anonymous
11/23/24(Sat)09:26:50 No.103279411

Anonymous 11/23/24(Sat)09:26:50 No.103279411

File: 1731887883476718.png (996 KB, 1760x746)

996 KB PNG

>>103279374
>I really look forward to AMD next cards.
anon, AMD was a company made just so that Nvdia wouldn't be sued for AntiTrust monopoly

Anonymous
11/23/24(Sat)09:31:21 No.103279452

Anonymous 11/23/24(Sat)09:31:21 No.103279452

>>103279411
How the fuck is corporate collusion a sign that we're living in a simulation?
Fuck Twitter for giving double digit mitwits a soapbox to speak on
And fuck you for posting it here and making me read it

Anonymous
11/23/24(Sat)09:32:38 No.103279463

Anonymous 11/23/24(Sat)09:32:38 No.103279463

>>103279452
keep coping retard, AMD isn't gonna save you, it's only role is to save Nvdia

Anonymous
11/23/24(Sat)09:40:41 No.103279535

Anonymous 11/23/24(Sat)09:40:41 No.103279535

>>103272363
The only thing he needed to do was keep open-sourcing GPT models. That would prevent others from wasting billions on training new models and allow for improvements to the GPT models, guaranteeing a monopoly.
For a jew, he is a massive ratard.

Anonymous
11/23/24(Sat)09:43:07 No.103279563

Anonymous 11/23/24(Sat)09:43:07 No.103279563

>>103279535
Your idea is worse. If he open-sourced GPT-3 and GPT-4, their competitors would just take and finetune their models and provide cheaper alternative platforms since they did not have to invest in training their own models.

Anonymous
11/23/24(Sat)09:44:39 No.103279579

Anonymous 11/23/24(Sat)09:44:39 No.103279579

>>103279563
this, OpenAI managed to get a monopoly for almost 2 years because they decided to keep the secret sauce to themselves, but this is now over, other companies can train their models better than then, oh well, RIP in peace bozo you won't be missed

Anonymous
11/23/24(Sat)09:47:49 No.103279601

Anonymous 11/23/24(Sat)09:47:49 No.103279601

>>103279563
That's the point. His competitors would stay on GPT and wait for OpenAI to release new GPTs.
Effectively murdering any competition.
Today, there would be no Claude or Gemini. A few years of monopoly is nothing in the long run and they could have licensed the same way Epic licenses Unreal Engine, making billions easily without even running their models and wasting a shit ton of money on that as they do now.

Anonymous
11/23/24(Sat)09:49:47 No.103279615

Anonymous 11/23/24(Sat)09:49:47 No.103279615

>>103279601
>That's the point. His competitors would stay on GPT and wait for OpenAI to release new GPTs.
>Effectively murdering any competition.
why? their competitors would continue the pretraining or finetune their GPT models in a way that it would beat OpenAI, doing that would even make it easier for them

Anonymous
11/23/24(Sat)09:52:33 No.103279631

Anonymous 11/23/24(Sat)09:52:33 No.103279631

>>103279615
>why? their competitors would continue the pretraining or finetune their GPT models in a way that it would beat OpenAI, doing that would even make it easier for them
And? They would be forced to open-source their models and pay money to OpenAI after x amount of revenue.
OpenAI's massive losses don't come from training models, they come from running their models.

Anonymous
11/23/24(Sat)09:57:15 No.103279652

Anonymous 11/23/24(Sat)09:57:15 No.103279652

>>103279631
>They would be forced to open-source their models and pay money to OpenAI after x amount of revenue.
If they make the license too restrictive, it's the same as keeping them close source. Their competitors will be forced to train their own models. All open-sourcing them would do is make us happy and make it easier for their competition to catch-up because they can just look at what OpenAI did in their latest models and use the same techniques themselves.
>OpenAI's massive losses don't come from training models, they come from running their models.
Bullshit.

Anonymous
11/23/24(Sat)10:13:52 No.103279740

Anonymous 11/23/24(Sat)10:13:52 No.103279740

>>103279652
>If they make the license too restrictive, it's the same as keeping them close source
There’s nothing restrictive about requiring people to pay after a certain point. Companies would gladly spend tens of millions of dollars on OpenAI’s GPTs rather than billions to train and operate their own models, which would cost even more.
Why do you think Microsoft or Apple aren’t spending billions to develop their own models? It’s because they essentially own OpenAI’s models. However, if competition overtakes OpenAI, they could easily turn to Claude or Google instead, and that would be the end of OAI.

Dominance over the market should always come first.

Anonymous
11/23/24(Sat)10:22:33 No.103279803

Anonymous 11/23/24(Sat)10:22:33 No.103279803

>>103279740
>Why do you think Microsoft or Apple aren’t spending billions to develop their own models?
>>103268360
>It’s because they essentially own OpenAI’s models.
Microsoft* essentially owns OpenAI's models. Apple had to rely on OpenAI because they had nothing of their own. They recognize this is a problem, and are planning to train their own by next year.
Which is what everyone would do if OpenAI licensed their model weights to everyone with fees for corporate usage.
They would just be giving their competition a stop-gap until they had their own models ready.

Anonymous
11/23/24(Sat)10:28:54 No.103279841

Anonymous 11/23/24(Sat)10:28:54 No.103279841

>>103276927
heh

Anonymous
11/23/24(Sat)10:30:43 No.103279860

Anonymous 11/23/24(Sat)10:30:43 No.103279860

>>103279803
>They recognize this is a problem, and are planning to train their own by next year.
They already trained smaller models and they performed terribly. It will take a few years before they reach anything similar to the current level of OAI. They don't even have the infrastructure for it.
At best, they will use upcoming llamas, and at worst continue using OAI for some time and then switch it.

Anonymous
11/23/24(Sat)10:41:25 No.103279944

Anonymous 11/23/24(Sat)10:41:25 No.103279944

>>103279535
There's a certain irony to the fact that his antics are likely in part what led to our current era of French and Chinese models and the west basically eating shit
Remember he didn't just close off the weights - he closed off the research after GPT-3 instruct too. There's a lot of shit we could have learned about much earlier than we did. Instead, he decided to burn everyone to try to get a slight lead in a race that was always going to be his to lose anyway

Anonymous
11/23/24(Sat)10:45:27 No.103279984

Anonymous 11/23/24(Sat)10:45:27 No.103279984

>>103279944
I don't think he cares at this point, he won 40 billions by changing his company's structure kek

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.