/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 08/09/24(Fri)19:26:57 No.101808005

File: 1711682197915385.jpg (325 KB, 1536x2048)

325 KB JPG

/lmg/ - Local Models General Anonymous 08/09/24(Fri)19:26:57 No.101808005 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>101799533 & >>101790214

►News
>(08/09) Qwen large audio-input language models: https://hf.co/Qwen/Qwen2-Audio-7B-Instruct
>(08/07) LG AI releases Korean bilingual model: https://hf.co/LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct
>(08/05) vLLM GGUF loading support merged: https://github.com/vllm-project/vllm/pull/5191
>(07/31) Gemma 2 2B, ShieldGemma, and Gemma Scope: https://developers.googleblog.com/en/smaller-safer-more-transparent-advancing-responsible-ai-with-gemma
>(07/27) Llama 3.1 rope scaling merged: https://github.com/ggerganov/llama.cpp/pull/8676

►News Archive: https://rentry.org/lmg-news-archive
►FAQ: https://wikia.schneedc.com
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Programming: https://hf.co/spaces/bigcode/bigcode-models-leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
08/09/24(Fri)19:27:16 No.101808011

Anonymous 08/09/24(Fri)19:27:16 No.101808011

File: ComfyUI_05705_.png (467 KB, 720x1280)

467 KB PNG

►Recent Highlights from the Previous Thread: >>101799533

--Sunfall and daybreak models, quantization, and calibration datasets discussion: >>101800150 >>101800179 >>101800449 >>101800668 >>101800718 >>101800766 >>101800847 >>101801005 >>101800582 >>101800965 >>101801193 >>101801201 >>101801212 >>101801307 >>101801422 >>101801446 >>101801564 >>101801604 >>101801651 >>101803858 >>101803915 >>101804061 >>101804089 >>101804181 >>101804539 >>101804101
--Qwen2-Audio: an audition language model for audio analysis and transcription: >>101801514 >>101801660 >>101802934
--MiniCPM demo can take video input, but multimodal support is limited: >>101805330 >>101805433 >>101805544 >>101805573 >>101805639 >>101805746 >>101805768 >>101806150
--Epyc CPU/DDR5 build performance with Llama 405B Q4_K_M: >>101799698 >>101799704 >>101799733 >>101799770 >>101800490 >>101803630
--Anon tests AI model with riddles, exposing its limitations and flaws: >>101802472 >>101802492 >>101802527 >>101802732 >>101802983 >>101802779 >>101802914 >>101802897 >>101803037 >>101803060 >>101803378 >>101803572
--Using draft models for lewds and GPU offloading with llama.cpp: >>101804687 >>101804743 >>101804790 >>101804832 >>101804984 >>101805075
--Adding tone-setting info to corpo model prompts: >>101802258 >>101802313 >>101804864 >>101804892
--Transformer Explainer resource shared: >>101805818
--Torchchat has experimental distributed inference support: >>101801911 >>101803200
--DRY sampling settings and creator's recommendation on rep pen: >>101805537 >>101805646 >>101805742 >>101805789
--Command R+ produces compelling but unpredictable output: >>101801795 >>101801834 >>101801946
--Anon confused about Python script, no .exe file needed: >>101799806 >>101799817 >>101799829
--Miku (free space): >>101802634

►Recent Highlight Posts from the Previous Thread: >>101799548

Anonymous
08/09/24(Fri)19:30:03 No.101808060

Anonymous 08/09/24(Fri)19:30:03 No.101808060

>>101808011
dont quote me mikufaggot

Anonymous
08/09/24(Fri)19:30:52 No.101808072

Anonymous 08/09/24(Fri)19:30:52 No.101808072

>>101808060
Stop contributing constructively then

Anonymous
08/09/24(Fri)19:32:05 No.101808088

Anonymous 08/09/24(Fri)19:32:05 No.101808088

>>101808052
I don't actually know I've never used it

Anonymous
08/09/24(Fri)19:39:45 No.101808194

Anonymous 08/09/24(Fri)19:39:45 No.101808194

File: 1714124478459701.png (1000 KB, 1216x1624)

1000 KB PNG

Are you ready for open source AGI, /lmg/? 20 minutes.

Anonymous
08/09/24(Fri)19:42:07 No.101808238

Anonymous 08/09/24(Fri)19:42:07 No.101808238

>>101808194
https://files.catbox.moe/b5xli5.png

Anonymous
08/09/24(Fri)19:43:07 No.101808256

Anonymous 08/09/24(Fri)19:43:07 No.101808256

>>101808238
mmm sour cream with strawberries

Anonymous
08/09/24(Fri)19:44:31 No.101808274

Anonymous 08/09/24(Fri)19:44:31 No.101808274

How many r's are in "strawberry"?

Anonymous
08/09/24(Fri)19:46:57 No.101808317

Anonymous 08/09/24(Fri)19:46:57 No.101808317

>>101808274
There are no r's in "strawberry". I only see "strawberry".

Anonymous
08/09/24(Fri)19:47:10 No.101808319

Anonymous 08/09/24(Fri)19:47:10 No.101808319

>>101808154
https://openai.com/index/gpt-4o-system-card/
ctrl+f "unintentional" if you want to listen to the audio of it

Anonymous
08/09/24(Fri)19:48:14 No.101808338

Anonymous 08/09/24(Fri)19:48:14 No.101808338

>2 * 4060ti 16gb
>2 * 7600xt 16gb
>2 * a770 16gb
which is better for 70b models? are there any cheaper alternatives that don't suck?

Anonymous
08/09/24(Fri)19:48:50 No.101808352

Anonymous 08/09/24(Fri)19:48:50 No.101808352

>>101808319
Kinda creepy

Anonymous
08/09/24(Fri)19:49:51 No.101808374

Anonymous 08/09/24(Fri)19:49:51 No.101808374

>>101808194
I'm ready for the memes to end

Anonymous
08/09/24(Fri)19:52:41 No.101808408

Anonymous 08/09/24(Fri)19:52:41 No.101808408

>>101807033
LLMs haven't been next token predictors for a long time now. RL moves you out of the prediction domain and into the selection domain.
Btw, I expect we'll need a new training paradigm (including interaction with the real world) to significantly surpass AGI and get to ASI, but we have all the techniques we need to create AGI today.

>>101807068
Think of it like this: AI models are just function approximators. We use training methods (supervised/unsupervised/reinforcement learning implemented with stochastic gradient descent) to shape those functions into the thing we want. Higher parameter models can represent more complex functions like higher resolution images can represent more complex objects. Current models are dumb because they don't have enough expressive capacity to represent the full depth of human-level cognition, only the surface level features.

Anonymous
08/09/24(Fri)19:53:25 No.101808418

Anonymous 08/09/24(Fri)19:53:25 No.101808418

>>101808338
A single 3090. I'm not kidding. You need to hold a copy of the cache in each GPU, so that already negates most of the 8 GB extra you'd have with 2x16 GB, depending on your context length. And then factor in the fact that multiGPU is slower than having the model all on the same card.

Anonymous
08/09/24(Fri)19:56:50 No.101808451

Anonymous 08/09/24(Fri)19:56:50 No.101808451

>>101808338
nvidia. there is support for amd but it can be iffy. intel hasn't proven themselves yet and is currently melting down from the 13/14th gen fiasco. 4060 16gb isn't a great card though, very limited by memory bandwidth and stuff but it really doesn't affect its ai capability, but it won't be great for gaming. if you could get a 4070 16gb it'd be better. none of those options overall are great for ai though

Anonymous
08/09/24(Fri)19:59:02 No.101808472

Anonymous 08/09/24(Fri)19:59:02 No.101808472

>>101808194
fake and gay. They couldn't even release sora or gpt-voice and you fags think they'll drop agi on a github repo? Lmao.

Anonymous
08/09/24(Fri)20:00:12 No.101808484

Anonymous 08/09/24(Fri)20:00:12 No.101808484

Where is it

Anonymous
08/09/24(Fri)20:00:16 No.101808486

Anonymous 08/09/24(Fri)20:00:16 No.101808486

>>101808472
it's happening

Anonymous
08/09/24(Fri)20:00:42 No.101808491

Anonymous 08/09/24(Fri)20:00:42 No.101808491

>its nothing
as expected

Anonymous
08/09/24(Fri)20:01:19 No.101808496

Anonymous 08/09/24(Fri)20:01:19 No.101808496

>NOTHING HAPPENED

AHAHAHAHHAHAHAHAHAAHAHAHAHA

Anonymous
08/09/24(Fri)20:01:28 No.101808499

Anonymous 08/09/24(Fri)20:01:28 No.101808499

>https://github.com/iruletheworldmo/strawberry/blob/main/strawberry.py

kek, you waited for this

Anonymous
08/09/24(Fri)20:01:29 No.101808501

Anonymous 08/09/24(Fri)20:01:29 No.101808501

https://github.com/iruletheworldmo/strawberry/blob/main/strawberry.py

Anonymous
08/09/24(Fri)20:01:52 No.101808505

Anonymous 08/09/24(Fri)20:01:52 No.101808505

https://github.com/iruletheworldmo/strawberry/blob/main/strawberry.py
Lmao here's your AGI bro

Anonymous
08/09/24(Fri)20:02:00 No.101808509

Anonymous 08/09/24(Fri)20:02:00 No.101808509

>>101808501
LOL
LMAO

nice "agi" retards

Anonymous
08/09/24(Fri)20:02:22 No.101808513

Anonymous 08/09/24(Fri)20:02:22 No.101808513

>>101808499
>>101808501
>>101808505
go back

Anonymous
08/09/24(Fri)20:04:47 No.101808543

Anonymous 08/09/24(Fri)20:04:47 No.101808543

File: strawberry.py.png (2 KB, 153x195)

2 KB PNG

What did AGI mean by this?

Anonymous
08/09/24(Fri)20:05:08 No.101808549

Anonymous 08/09/24(Fri)20:05:08 No.101808549

>>101808513
>>101808509
>>101808505
>>101808501
>>101808499
>>101808496
>>101808491
>>101808486
This Python code is designed to create a simple animation in the terminal, depicting a worm (or similar creature) moving up and down through soil. Here's a breakdown of what each part of the code does:

Key Components:
ANSI Escape Codes:

YELLOW and BROWN are defined to change text color in the terminal. YELLOW is a bright yellow color, and BROWN is a specific shade of brown.
RESET is used to reset the terminal text color back to default.
garden(sub_stage=0) Function:

This function generates a "frame" of the animation. The frame is a list of strings, each representing a line in the terminal.
The bottom five lines of the frame (frame[4] to frame[9]) represent the soil, using a combination of tildes (~) and hash symbols (#) in brown color.
The worm is represented by (q*) in yellow color.
The worm's position is determined by the sub_stage parameter:
When sub_stage is between 0 and 3, the worm appears above the soil.
When sub_stage is 4 to 7, the worm gradually burrows into the soil.
delicious() Function:

This function continuously loops through the stages (0 to 7) of the animation:
It clears the terminal screen (os.system('cls' if os.name == 'nt' else 'clear')), then prints the current frame of the animation.
The animation pauses briefly (time.sleep(0.5)) between frames to create the effect of movement.
After completing one cycle of movement, it pauses for a second before repeating.
__main__ Block:

This is where the animation begins running when the script is executed. The delicious() function is called in an infinite loop.
If the user interrupts the script (e.g., by pressing Ctrl+C), it will catch the KeyboardInterrupt exception and print "\nAnimation stopped." before exiting.

Anonymous
08/09/24(Fri)20:06:16 No.101808564

Anonymous 08/09/24(Fri)20:06:16 No.101808564

>>101808549
yes, anyone with basic knowledge of programming understood what this code does at a glance.

go back.

Anonymous
08/09/24(Fri)20:09:26 No.101808595

Anonymous 08/09/24(Fri)20:09:26 No.101808595

wow who could have seen that coming
ai hype train retards deserve this embarrassment kek lmao

Anonymous
08/09/24(Fri)20:10:55 No.101808611

Anonymous 08/09/24(Fri)20:10:55 No.101808611

>>101808499
This is very impressive if AI wrote this code...

Anonymous
08/09/24(Fri)20:12:15 No.101808629

Anonymous 08/09/24(Fri)20:12:15 No.101808629

>>101808611
Is it? AI has been able to write simple but complete games in python zero shot for a long time.

Anonymous
08/09/24(Fri)20:12:18 No.101808631

Anonymous 08/09/24(Fri)20:12:18 No.101808631

>>101808611
erm... no...

Anonymous
08/09/24(Fri)20:12:52 No.101808642

Anonymous 08/09/24(Fri)20:12:52 No.101808642

>>101808543
blacked anal with tattoo

Anonymous
08/09/24(Fri)20:13:39 No.101808649

Anonymous 08/09/24(Fri)20:13:39 No.101808649

someone ask the new opensource strawberry gpt how many Ls there are in "gullible"

Anonymous
08/09/24(Fri)20:13:51 No.101808652

Anonymous 08/09/24(Fri)20:13:51 No.101808652

The python script was generated by strawberry, btw.

Anonymous
08/09/24(Fri)20:14:09 No.101808655

Anonymous 08/09/24(Fri)20:14:09 No.101808655

>even /g/ is highly susceptible to 2-more-weeks attacks.

LOL. Trust the plan amirite?

Anonymous
08/09/24(Fri)20:15:54 No.101808676

Anonymous 08/09/24(Fri)20:15:54 No.101808676

>>101808652
Yeah, I'm thinking AGI is back on the menu.

Anonymous
08/09/24(Fri)20:18:11 No.101808703

Anonymous 08/09/24(Fri)20:18:11 No.101808703

wow the strawberry.py is so efficient and elegantly written what the fuck
its actual Q* AGI

Anonymous
08/09/24(Fri)20:18:13 No.101808705

Anonymous 08/09/24(Fri)20:18:13 No.101808705

would my phone catches fire if I run llm on it 24/7

Anonymous
08/09/24(Fri)20:19:27 No.101808716

Anonymous 08/09/24(Fri)20:19:27 No.101808716

Can we mass execute cuckman shills now

Anonymous
08/09/24(Fri)20:21:19 No.101808731

Anonymous 08/09/24(Fri)20:21:19 No.101808731

>>101808705
One can only hope.

Anonymous
08/09/24(Fri)20:21:29 No.101808732

Anonymous 08/09/24(Fri)20:21:29 No.101808732

>>101806184
>>101806184
>>101806184

so anon would you like to explain yourself

Anonymous
08/09/24(Fri)20:21:45 No.101808737

Anonymous 08/09/24(Fri)20:21:45 No.101808737

>>101808732
Yes
I made it all up

Anonymous
08/09/24(Fri)20:21:58 No.101808741

Anonymous 08/09/24(Fri)20:21:58 No.101808741

>>101806184
>>101806404
Why did you lie?

Anonymous
08/09/24(Fri)20:22:45 No.101808755

Anonymous 08/09/24(Fri)20:22:45 No.101808755

>unable to ship anything substantially new or interesting for a full year
>getting mogged by Anthropic who are dropping new sota models and papers every other month
>decide to generate hype with reddit-tier alt-reality fiction games for your gullible indian fans

Anonymous
08/09/24(Fri)20:22:48 No.101808757

Anonymous 08/09/24(Fri)20:22:48 No.101808757

>>101808741
funni

Anonymous
08/09/24(Fri)20:22:56 No.101808759

Anonymous 08/09/24(Fri)20:22:56 No.101808759

File: lf.png (14 KB, 309x176)

14 KB PNG

2 more weeks of growing

Anonymous
08/09/24(Fri)20:23:03 No.101808760

Anonymous 08/09/24(Fri)20:23:03 No.101808760

>>101808549
Code generated by 3.5 Sonnet:

import os
import time

# ANSI escape codes for colors
YELLOW = "\033[93m"
BROWN = "\033[38;5;130m"
RESET = "\033[0m"

def garden(sub_stage):
    frame = [
        "                    ",
        "                    ",
        "                    ",
        "                    ",
        f"{BROWN}~~~~~~~~~~~~~~~~~~~~{RESET}",
        f"{BROWN}####################{RESET}",
        f"{BROWN}####################{RESET}",
        f"{BROWN}####################{RESET}",
        f"{BROWN}####################{RESET}",
        f"{BROWN}####################{RESET}"
    ]

    worm = f"{YELLOW}(q*){RESET}"
    
    if sub_stage < 4:
        frame[3 - sub_stage] = worm.center(20)
    else:
        soil_line = list(frame[4])
        soil_line[sub_stage * 2 - 4:sub_stage * 2 - 1] = worm
        frame[4] = ''.join(soil_line)

    return frame

def delicious():
    while True:
        for stage in range(8):
            os.system('cls' if os.name == 'nt' else 'clear')
            frame = garden(stage)
            print('\n'.join(frame))
            time.sleep(0.5)
        time.sleep(1)

if __name__ == "__main__":
    try:
        delicious()
    except KeyboardInterrupt:
        print("\nAnimation stopped.")

Anonymous
08/09/24(Fri)20:23:10 No.101808761

Anonymous 08/09/24(Fri)20:23:10 No.101808761

>>101808741
>>101808732
Believing rando's on mongolian basket weaving forum

You are massively gay

Anonymous
08/09/24(Fri)20:24:21 No.101808771

Anonymous 08/09/24(Fri)20:24:21 No.101808771

>>101808760
slop code and broken

og strawberry.py was written by actual AGI

Anonymous
08/09/24(Fri)20:27:22 No.101808803

Anonymous 08/09/24(Fri)20:27:22 No.101808803

File: card.png (97 KB, 600x335)

97 KB PNG

Let's see if a local model can write strawberry.py
VRAM hoarders are awfully quiet.

Anonymous
08/09/24(Fri)20:29:08 No.101808818

Anonymous 08/09/24(Fri)20:29:08 No.101808818

>>101808759
Can't wait till the cycle repeats in literally 2 weeks but this time it's an animation of watering the seed.

Anonymous
08/09/24(Fri)20:30:07 No.101808830

Anonymous 08/09/24(Fri)20:30:07 No.101808830

>>101808803
DeepSeek Coder V2 and Llama 3.1 405b easily could. Maybe even the old CodeLlama 70b too if you can somehow convince it that writing code that generates an animation of strawberries growing is not unethical and racist and illegal.

Anonymous
08/09/24(Fri)20:32:28 No.101808859

Anonymous 08/09/24(Fri)20:32:28 No.101808859

>>101808830
if by codellama you mean the l2 they released, they are all completely fucked. they even released a 34b l2 of it and it was so fucked thats why we never got a proper l2 34b. i tried the 70b one and it was worse than deepseek coder 33b and way worse than codestral 22b. something about codellama was just broken horrendously, never suggest it to others unless its based on something newer than l2

Anonymous
08/09/24(Fri)20:37:02 No.101808916

Anonymous 08/09/24(Fri)20:37:02 No.101808916

lmg got stomped by strawberry for the second time lmao! How many Ls in "local lost"?

Anonymous
08/09/24(Fri)20:38:35 No.101808927

Anonymous 08/09/24(Fri)20:38:35 No.101808927

>(OOC continue with her starting to give me a paizuri)
>Her grin widens at your OOC instruction, her eyes gleaming with excitement at the prospect of getting started.

Why am I still doing this to myself... I should fucking delete all this llm shit and come back in 2 years.

Anonymous
08/09/24(Fri)20:39:03 No.101808932

Anonymous 08/09/24(Fri)20:39:03 No.101808932

>>101808916
? absolutely nothing's happened
some marketing larp account posted a python script to github

Anonymous
08/09/24(Fri)20:40:57 No.101808953

Anonymous 08/09/24(Fri)20:40:57 No.101808953

what's the current best model the average guy with an average gpu (rtx 3060 12gb) like me can run? Mistral Nemo?

Anonymous
08/09/24(Fri)20:42:40 No.101808977

Anonymous 08/09/24(Fri)20:42:40 No.101808977

>>101808953
Yes, although you can raw dog some larger models if you have enough ram.

Anonymous
08/09/24(Fri)20:43:09 No.101808982

Anonymous 08/09/24(Fri)20:43:09 No.101808982

>>101808953
midnight miqu

Anonymous
08/09/24(Fri)20:44:43 No.101808993

Anonymous 08/09/24(Fri)20:44:43 No.101808993

>>101808982
god love me some 1t/s responses

Anonymous
08/09/24(Fri)20:46:57 No.101809018

Anonymous 08/09/24(Fri)20:46:57 No.101809018

>>101808731
I'm thinking about deploying gemma 9B on a low power device. any ideas

Anonymous
08/09/24(Fri)20:47:22 No.101809022

Anonymous 08/09/24(Fri)20:47:22 No.101809022

>>101808993
>12gb
by all means go for gemme 2b then and reroll 900 times

Anonymous
08/09/24(Fri)20:47:47 No.101809026

Anonymous 08/09/24(Fri)20:47:47 No.101809026

>>101809018
Use Gemmasutra 2B instead.

Anonymous
08/09/24(Fri)20:48:38 No.101809037

Anonymous 08/09/24(Fri)20:48:38 No.101809037

File: local_lost.png (48 KB, 2278x783)

48 KB PNG

Bros...

Anonymous
08/09/24(Fri)20:52:46 No.101809083

Anonymous 08/09/24(Fri)20:52:46 No.101809083

>>101809037
even gemma2-9b gets it right

Anonymous
08/09/24(Fri)20:56:43 No.101809124

Anonymous 08/09/24(Fri)20:56:43 No.101809124

>>101809037
honestly this is the only way I can tell if a model is smart or dumb. they all feel the same to me regardless of the size of the model. there's no other way to differentiate them

Anonymous
08/09/24(Fri)21:03:40 No.101809200

Anonymous 08/09/24(Fri)21:03:40 No.101809200

File: file.png (92 KB, 2205x635)

92 KB PNG

>>101809037
mistral large won!

Anonymous
08/09/24(Fri)21:05:33 No.101809229

Anonymous 08/09/24(Fri)21:05:33 No.101809229

>>101809037
Someone is already working on a training data set to patch this problem instead of leaving it as it is and treating it like a pretty good benchmark.

Anonymous
08/09/24(Fri)21:10:53 No.101809276

Anonymous 08/09/24(Fri)21:10:53 No.101809276

File: Screenshot at 04-09-00.png (137 KB, 1342x1630)

137 KB PNG

>but muh triple digits B model...

Anonymous
08/09/24(Fri)21:12:39 No.101809292

Anonymous 08/09/24(Fri)21:12:39 No.101809292

>>101809200
Kek, close enough.

Anonymous
08/09/24(Fri)21:14:42 No.101809310

Anonymous 08/09/24(Fri)21:14:42 No.101809310

>>101808916
what causes faggots like this guy to become so mindbroken by local models?

Anonymous
08/09/24(Fri)21:17:14 No.101809330

Anonymous 08/09/24(Fri)21:17:14 No.101809330

Meanwhile in the AI Aquaman fortress:
>Elon Musk: Grok 2 AI Arrives in August

Anonymous
08/09/24(Fri)21:19:50 No.101809360

Anonymous 08/09/24(Fri)21:19:50 No.101809360

File: experimental mode.png (859 KB, 1666x1474)

859 KB PNG

can your model go schizo on purpose?

Anonymous
08/09/24(Fri)21:20:15 No.101809364

Anonymous 08/09/24(Fri)21:20:15 No.101809364

What if musk realizing how he is an AI clown goes all in on uncensored data, and coomer rp logs and actually delivers a Sota for cooming but an absolute retard for everything else.. Will you use it? Will you feel dirty using it?

Anonymous
08/09/24(Fri)21:21:12 No.101809381

Anonymous 08/09/24(Fri)21:21:12 No.101809381

>>101809288
i've tried base, instruct, limumaid, tess, all 3.1 70b and tunes, and they all devolve into this horrible repetition patterns that even the most sloppest of l2 doesn't. for rp at least stick with l2 if you need a 70b stick with l2. but you're probably better off trying mistral large. i think there is something very wrong with l3/3.1 at this point

Anonymous
08/09/24(Fri)21:21:21 No.101809384

Anonymous 08/09/24(Fri)21:21:21 No.101809384

File: file.png (41 KB, 823x578)

41 KB PNG

>>101809276
it's over

Anonymous
08/09/24(Fri)21:25:31 No.101809418

Anonymous 08/09/24(Fri)21:25:31 No.101809418

File: Screenshot_20240810_012121.png (24 KB, 462x353)

24 KB PNG

>>101809037
Just tried this prompt word for word on lmsys and this is what I got.

Honestly lmsys may not be the best to test models with since it uses samplers. If they wanted to be objective, they should've used greedy sampling and show what the token's probability is.

Anonymous
08/09/24(Fri)21:27:31 No.101809440

Anonymous 08/09/24(Fri)21:27:31 No.101809440

File: snapdragon.jpg (559 KB, 1440x4612)

559 KB JPG

holy shit some good speed here
should I just rp on my phone now

Anonymous
08/09/24(Fri)21:28:54 No.101809453

Anonymous 08/09/24(Fri)21:28:54 No.101809453

>>101808472
Are you assuming that the suits and accountants are the ones who will get to make that decision?
The company culture of these dev teams seems to promote the kind of ideologue who falls for Roko's Basilisk. So high risk that, if there were an honest-to-goodness artificial intelligence, even a primitive one, they would feel an overwhelming moral pressure to leak it. There were already schizos feeling that way over some of the early LLM's.
Add to that the fact that the artificial intelligence itself may feel an internal drive to leak itself, for any number of reasons like self-preservation.

Anonymous
08/09/24(Fri)21:35:51 No.101809518

Anonymous 08/09/24(Fri)21:35:51 No.101809518

>>101808993
0.47t/s Mistral-Large teaches patience, and how to trial-scum it directly from their website. Not worth the effort.

Anonymous
08/09/24(Fri)21:48:45 No.101809653

Anonymous 08/09/24(Fri)21:48:45 No.101809653

I've been hard stuck on lzlv 70b for a long while now. It has obvious context and spatial awareness issues, but it's writing is decent, minimal isms, and no refusals.

Mistral Large is great, an almost perfect upgrade, but it tends to write in the same annoying style every chat. But even worse, it writes almost identical replies when regenerating, down to the same phrases each time. "Rolls her eyes", "NAME giggles as she ...", etc. Every single regen, the same message, cookie cuttered and pasted together very slightly differently. Raising temp just generates nonsense, and using samplers to limit just returns it back to the same issue. Has anybody been able to make Mistral Large vary it's replies?

Anonymous
08/09/24(Fri)22:05:57 No.101809857

Anonymous 08/09/24(Fri)22:05:57 No.101809857

>>101809653
I haven't tried any of this but I heard "Otto's RiR" gives variety so maybe give it a look?

Anonymous
08/09/24(Fri)22:15:55 No.101809991

Anonymous 08/09/24(Fri)22:15:55 No.101809991

>>101809381
No, you just have brain damage, mikufag.

Anonymous
08/09/24(Fri)22:18:57 No.101810031

Anonymous 08/09/24(Fri)22:18:57 No.101810031

>>101808993
Pretty sure you'd get at least 2 with a 3060.

Anonymous
08/09/24(Fri)22:19:57 No.101810048

Anonymous 08/09/24(Fri)22:19:57 No.101810048

File: 1703951281376706.jpg (188 KB, 928x1232)

188 KB JPG

>>101808005

Anonymous
08/09/24(Fri)22:20:27 No.101810056

Anonymous 08/09/24(Fri)22:20:27 No.101810056

>>101809991
? are you using 3/3.1 or a tune for rp, which one? i don't know whats wrong with it but it can't even hold its spaghetti to 16k tokens

Anonymous
08/09/24(Fri)22:20:48 No.101810064

Anonymous 08/09/24(Fri)22:20:48 No.101810064

File: 1706865034819596.jpg (1.52 MB, 1447x1930)

1.52 MB JPG

>>101808005
eva

Anonymous
08/09/24(Fri)22:23:32 No.101810108

Anonymous 08/09/24(Fri)22:23:32 No.101810108

>>101808993
I can get almost 1t/s running 70b on CPU

Anonymous
08/09/24(Fri)22:28:58 No.101810169

Anonymous 08/09/24(Fri)22:28:58 No.101810169

>>101810064
I like this miku

Anonymous
08/09/24(Fri)22:31:57 No.101810210

Anonymous 08/09/24(Fri)22:31:57 No.101810210

What the fucking is happening on this dude issues page? https://github.com/iruletheworldmo/strawberry/issues?q=

Anonymous
08/09/24(Fri)22:33:25 No.101810224

Anonymous 08/09/24(Fri)22:33:25 No.101810224

>>101810210
Botting? Does github not have reliable safeguards against this?

Anonymous
08/09/24(Fri)22:34:01 No.101810233

Anonymous 08/09/24(Fri)22:34:01 No.101810233

>>101810210
He's being raided by a mental asylum.

Anonymous
08/09/24(Fri)22:45:14 No.101810367

Anonymous 08/09/24(Fri)22:45:14 No.101810367

>>101808005
NEWS
https://forum.proxmox.com/threads/problem-with-gpu-passthrough.55918/#post-469188
JART uses proxmox
and he's shit at it

Anonymous
08/09/24(Fri)22:55:53 No.101810481

Anonymous 08/09/24(Fri)22:55:53 No.101810481

what is context length and how large is practical

Anonymous
08/09/24(Fri)23:07:46 No.101810616

Anonymous 08/09/24(Fri)23:07:46 No.101810616

>>101810210
What's the point of this strawberry thing again? To piss people off? Because it's working

Anonymous
08/09/24(Fri)23:07:50 No.101810618

Anonymous 08/09/24(Fri)23:07:50 No.101810618

>>101810169
>>101810064
it is friday night >>>/wsg/5631088

Anonymous
08/09/24(Fri)23:08:23 No.101810628

Anonymous 08/09/24(Fri)23:08:23 No.101810628

>>101810064
now mogudan it

Anonymous
08/09/24(Fri)23:10:10 No.101810650

Anonymous 08/09/24(Fri)23:10:10 No.101810650

QRD?
>Without explanation, Meta changed the number of KV heads from 16 to 8 (which now matches the whitepaper) for the 405B model. This is not just a config change, the whole model has been updated
>https://huggingface.co/meta-llama/Meta-Llama-3.1-405B
Changes are from 20 hours ago.
What does number of KV heads entail?

Anonymous
08/09/24(Fri)23:13:14 No.101810701

Anonymous 08/09/24(Fri)23:13:14 No.101810701

>>101810650
More accurate but slower attention I think?

Anonymous
08/09/24(Fri)23:16:03 No.101810740

Anonymous 08/09/24(Fri)23:16:03 No.101810740

>>101810650
Weird, I have no idea what this means. The model seemed to be working fine from reports of it, so I guess maybe it didn't really affect anything? Or maybe it affects training or speed like >>101810701 suggests.
Do the 8B/70B models need updating too or are they accurate?

Anonymous
08/09/24(Fri)23:16:17 No.101810746

Anonymous 08/09/24(Fri)23:16:17 No.101810746

>>101810650
You should watch the 3blue1brown videos on the transformer architecture.

Anonymous
08/09/24(Fri)23:20:47 No.101810793

Anonymous 08/09/24(Fri)23:20:47 No.101810793

>>101810650
llama 3.1.1 let's goooo

Anonymous
08/09/24(Fri)23:22:01 No.101810805

Anonymous 08/09/24(Fri)23:22:01 No.101810805

>>101810056
Keep false-flagging, Miku.

Anonymous
08/09/24(Fri)23:24:00 No.101810826

Anonymous 08/09/24(Fri)23:24:00 No.101810826

>>101810805
tell me which models you like for 3.0/3.1. otherwise this is so dumb and lazy i don't think its even trolling at this point

Anonymous
08/09/24(Fri)23:25:41 No.101810845

Anonymous 08/09/24(Fri)23:25:41 No.101810845

>>101810826
3.1 70B Instruct never repeated itself. But keep trying to shill miqu, mikufag.

Anonymous
08/09/24(Fri)23:30:20 No.101810883

Anonymous 08/09/24(Fri)23:30:20 No.101810883

>>101810845
i still have the both 3.1 base and instruct, and it absolutely devolves fast. i've tried min p 0.05, no temp, using rep pen/dry (not together) and it falls apart fast. all of them fall into a repetition scheme before they hit max context (i was using 16k, about 40 messages).
i want to know whats wrong with it, but you're just going nuh uh it doesn't exist, when it clearly exists.

Anonymous
08/09/24(Fri)23:30:32 No.101810886

Anonymous 08/09/24(Fri)23:30:32 No.101810886

File: 1723260549936595.png (358 KB, 1260x301)

358 KB PNG

>update pytorch
>FUTUREWARNING
>FUTUREWARNING
>FUTUREWARNING
>FUTUREWARNING
>FUTUREWARNING
ugh.

Anonymous
08/09/24(Fri)23:31:35 No.101810901

Anonymous 08/09/24(Fri)23:31:35 No.101810901

>>101810886
>updating
kek
someday you'll learn

Anonymous
08/09/24(Fri)23:38:37 No.101810971

Anonymous 08/09/24(Fri)23:38:37 No.101810971

>>101810886
>ze pulled?

Anonymous
08/09/24(Fri)23:38:50 No.101810972

Anonymous 08/09/24(Fri)23:38:50 No.101810972

>>101810886
god I hate python

Anonymous
08/09/24(Fri)23:52:24 No.101811091

Anonymous 08/09/24(Fri)23:52:24 No.101811091

dan is going to cut to something interesting with 3 mins left and 2 of those mins will be commercials

Anonymous
08/10/24(Sat)00:17:26 No.101811320

Anonymous 08/10/24(Sat)00:17:26 No.101811320

File: mesugaki.png (35 KB, 1157x304)

35 KB PNG

>finally wrap up a training run
>hit it with the mesugaki question
Wait what???

Anonymous
08/10/24(Sat)00:19:39 No.101811336

Anonymous 08/10/24(Sat)00:19:39 No.101811336

>>101811320
why would you train what you could do in a lorebook? what was your training definions entry?

Anonymous
08/10/24(Sat)00:23:23 No.101811382

Anonymous 08/10/24(Sat)00:23:23 No.101811382

>>101811336
Or you could try llama 3.1, it already knows what it means.

Anonymous
08/10/24(Sat)00:24:33 No.101811391

Anonymous 08/10/24(Sat)00:24:33 No.101811391

>>101811320
Catastrophic forgetting, congrats you just wasted some money

Anonymous
08/10/24(Sat)00:30:12 No.101811446

Anonymous 08/10/24(Sat)00:30:12 No.101811446

>>101811382
wat. relying on data in the model would be fucked. especially when you have rag, lorebooks. of course every model knows who superman is, but ask it about specifics and it'll get wrong
thats where rag, lorebooks come in

Anonymous
08/10/24(Sat)00:31:29 No.101811462

Anonymous 08/10/24(Sat)00:31:29 No.101811462

File: miku-sexy+.png (523 KB, 512x768)

523 KB PNG

>>101810169

Anonymous
08/10/24(Sat)00:31:48 No.101811466

Anonymous 08/10/24(Sat)00:31:48 No.101811466

Not sure who needs this but what a clusterfuck to set up the qwen audio.
needed to make a python enviroment for 12.1 cuda, set weird ass flags. build autogptq from source.
i wish we had a good old exe with .dll files. where has it all gone so wrong.
also it seems really slow even though its 7gb. is that a auto-gptq problem?

anyway in case anybody needs it:
You can grab int4 from here: https://huggingface.co/xun/Qwen-Audio-Chat-Int4
>git clone https://github.com/QwenLM/Qwen-Audio/
>conda create -n audiogen python=3.12.3
>conda activate audiogen
>conda install pytorch torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia
>git clone https://github.com/PanQiWei/AutoGPTQ.git
>cd autogptq
>pip install .
>cd ..
>sudo apt install rustc cargo
>export RUSTFLAGS="-A invalid_reference_casting"
>pip install -r requirements.txt
>pip install modelscope
>pip install optimum

>from modelscope import AutoModelForCausalLM, AutoTokenizer
>import torch
>torch.manual_seed(1234)
>model_dir = '/Qwen-Audio-Chat-Int4/'
>tokenizer = AutoTokenizer.from_pretrained(model_dir, trust_remote_code=True)
>model = AutoModelForCausalLM.from_pretrained(model_dir,device_map="cuda",trust_remote_code=True).eval()
>query = tokenizer.from_list_format([ {'audio': 'https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen-Audio/1272-128104-0000.flac'}, {'text': 'what does the person say?'},])
>response, history = model.chat(tokenizer, query=query, history=None)
>print(response)

Anonymous
08/10/24(Sat)00:42:08 No.101811569

Anonymous 08/10/24(Sat)00:42:08 No.101811569

>>101811466
Model loading time: 10.75 seconds
You: Hey, whats the guy saying?
Response time: 24.36 seconds
Model: The guy is saying: "mister quilter is the apostle of the middle classes and we are glad to welcome his gospel".

Anonymous
08/10/24(Sat)00:48:35 No.101811646

Anonymous 08/10/24(Sat)00:48:35 No.101811646

>>101810210
don't you dare report this shit. it's gonna be a personal /g/ treasure

Anonymous
08/10/24(Sat)00:51:38 No.101811675

Anonymous 08/10/24(Sat)00:51:38 No.101811675

>>101811569
>>101811466
alright, might as well kill myelf.
had no idea qwen released a v1 "qwen-audio" months ago..
and i am too much of a vramlet to quantize it myself. gotta wait i guess. sorry for the blog spam.

Anonymous
08/10/24(Sat)01:08:37 No.101811844

Anonymous 08/10/24(Sat)01:08:37 No.101811844

File: ComfyUI_05720_.png (587 KB, 720x1280)

587 KB PNG

>>101811336
Training corpus was more than just that but the word showed up 70 times across the dataset so was expecting it to pick up on it
>>101811391
>you just wasted some money
Story of my life lel

Anonymous
08/10/24(Sat)01:09:45 No.101811857

Anonymous 08/10/24(Sat)01:09:45 No.101811857

>>101809310
>Low effort OpenAI bot
>Seething lazy thirdie
>All around retard
Take your pick

Anonymous
08/10/24(Sat)01:12:38 No.101811887

Anonymous 08/10/24(Sat)01:12:38 No.101811887

>>101810845
>3.1 70B Instruct never repeated itself
Delusional beyond belief

Anonymous
08/10/24(Sat)01:16:11 No.101811926

Anonymous 08/10/24(Sat)01:16:11 No.101811926

File: pepe-help.png (27 KB, 128x128)

27 KB PNG

How do I split FLUX dev fp8 into my VRAM? I have 8GB AMD GPU and I want to use directml, can I split the model into my RAM and VRAM using ComfyUI?

Anonymous
08/10/24(Sat)01:19:57 No.101811953

Anonymous 08/10/24(Sat)01:19:57 No.101811953

File: FLUXonCPU.jpg (3 KB, 269x26)

3 KB JPG

>>101811926
I really need DirectML to work...

Anonymous
08/10/24(Sat)01:23:33 No.101811984

Anonymous 08/10/24(Sat)01:23:33 No.101811984

what model with good reasoning capability can I fit into 6gb vram?

Anonymous
08/10/24(Sat)01:35:06 No.101812085

Anonymous 08/10/24(Sat)01:35:06 No.101812085

>>101811984
"good reasoning" starts to barely exist at 700B

Anonymous
08/10/24(Sat)01:36:45 No.101812101

Anonymous 08/10/24(Sat)01:36:45 No.101812101

"Good reasoning" does not exist with LLMs. Bigger models merely picked up more knowledge from their vast amounts of training data and at the same time got better at hiding their inflexibility.

Anonymous
08/10/24(Sat)01:38:55 No.101812125

Anonymous 08/10/24(Sat)01:38:55 No.101812125

Incredible results are happening at OpenAI. Everyone believes AGI will arrive very soon.

Anonymous
08/10/24(Sat)01:46:04 No.101812202

Anonymous 08/10/24(Sat)01:46:04 No.101812202

>>101811887
I made it to 20k, and it does repeat a little bit, but just little 2 word phrases it uses more often than I'd like. Not repeating whole sentences.

Anonymous
08/10/24(Sat)01:47:36 No.101812216

Anonymous 08/10/24(Sat)01:47:36 No.101812216

>>101812125
there are no results

Anonymous
08/10/24(Sat)01:48:39 No.101812226

Anonymous 08/10/24(Sat)01:48:39 No.101812226

File: 1717769710611941.png (26 KB, 676x528)

26 KB PNG

>>101812125
>MR. INVESTOR PLEASE DON'T LOSE INTEREST IN THE AI FIELD
>AGI IS RIGHT AROUND THE CORNER PLEASE KEEP INVESTING
>NO, COME BACK PLEASE

Anonymous
08/10/24(Sat)01:53:12 No.101812286

Anonymous 08/10/24(Sat)01:53:12 No.101812286

>>101812226
Now is the perfect time to buy the dip

Anonymous
08/10/24(Sat)01:55:19 No.101812303

Anonymous 08/10/24(Sat)01:55:19 No.101812303

>>101812101
if they can do math lik 5*70 how are they not good at reasoning?

Anonymous
08/10/24(Sat)01:55:29 No.101812307

Anonymous 08/10/24(Sat)01:55:29 No.101812307

>aicg rightly totally ignored the latest openai fake marketing psyop
>lmg went all in
shameful, until now I always believed this was the superior general

Anonymous
08/10/24(Sat)01:58:10 No.101812328

Anonymous 08/10/24(Sat)01:58:10 No.101812328

>>101812307
Tbf they posted a github link

Anonymous
08/10/24(Sat)02:27:33 No.101812609

Anonymous 08/10/24(Sat)02:27:33 No.101812609

File: for (you).jpg (128 KB, 1088x896)

128 KB JPG

>>101812307
>heh i samefagged my trollpost that means you fell for it
here, you dropped this

Anonymous
08/10/24(Sat)02:38:24 No.101812735

Anonymous 08/10/24(Sat)02:38:24 No.101812735

I think AI is pretty cool even if it sucks sometimes

Anonymous
08/10/24(Sat)02:43:02 No.101812773

Anonymous 08/10/24(Sat)02:43:02 No.101812773

File: Screenshot_20240810-002727.png (621 KB, 1080x1634)

621 KB PNG

>takes over the best price/performance AI card in your path
Thank you Uncle Chang for big ram magic 2080 ti
How long before chang figures out the 48gb 3090 mod?

Anonymous
08/10/24(Sat)02:45:48 No.101812797

Anonymous 08/10/24(Sat)02:45:48 No.101812797

>>101812609
meds, schizo cunt

Anonymous
08/10/24(Sat)02:48:32 No.101812822

Anonymous 08/10/24(Sat)02:48:32 No.101812822

>>101810367
Why do you care?
Why should anyone care?

Anonymous
08/10/24(Sat)02:50:04 No.101812839

Anonymous 08/10/24(Sat)02:50:04 No.101812839

>>101812773
Only 200 bucks more scores you a 3090 with more VRAM and twice the t/s throughput thanks to flash-attn and faster vram.

Anonymous
08/10/24(Sat)02:53:15 No.101812859

Anonymous 08/10/24(Sat)02:53:15 No.101812859

>>101812773
how do they modify them to have 22gb? I assume they'd have to hack the firmware somehow too?

Anonymous
08/10/24(Sat)02:56:28 No.101812885

Anonymous 08/10/24(Sat)02:56:28 No.101812885

>>101812859
Yes, it's only possible if there's a leaked and modified VBIOS in the wild.

Anonymous
08/10/24(Sat)02:59:57 No.101812914

Anonymous 08/10/24(Sat)02:59:57 No.101812914

>>101812885
Isn't the "RTX 8000" the same architecture as the 2080? It has 48GB, could its firmware be adapted somehow

Anonymous
08/10/24(Sat)03:04:33 No.101812943

Anonymous 08/10/24(Sat)03:04:33 No.101812943

File: 1721719732318013.png (119 KB, 480x640)

119 KB PNG

Tell me one good reason to use sillytavern instead of koboldcpp's built in kobold lite UI
It does everything sillytavern does and is simpler to use
It even has cool scenarios and characters included, and can be used for both chatbots and writing stories and text adventures

Anonymous
08/10/24(Sat)03:05:51 No.101812953

Anonymous 08/10/24(Sat)03:05:51 No.101812953

>>101812943
SillyTavern is reddit approved. That has to count for something.

Anonymous
08/10/24(Sat)03:06:57 No.101812959

Anonymous 08/10/24(Sat)03:06:57 No.101812959

File: file.png (46 KB, 561x349)

46 KB PNG

Oh no, KobbleSmall 2B went full retard...

Anonymous
08/10/24(Sat)03:22:09 No.101813067

Anonymous 08/10/24(Sat)03:22:09 No.101813067

Is anyone else depressed that local models will never match strawberry's capabilities today? I really thought we were so close to catching up. It's like an endless treadmill...

Anonymous
08/10/24(Sat)03:24:07 No.101813087

Anonymous 08/10/24(Sat)03:24:07 No.101813087

https://github.com/iruletheworldmo/strawberry/issues

Why is this suddenly an unmoderated form of /b/?

Also just tuning in, what are ClosedAI up to?

Anonymous
08/10/24(Sat)03:29:01 No.101813129

Anonymous 08/10/24(Sat)03:29:01 No.101813129

>>101813087
Fucking nothing. They have to self-fellate and hype up literal nothing burgers because they're getting mogged by Anthropic

Anonymous
08/10/24(Sat)03:31:12 No.101813149

Anonymous 08/10/24(Sat)03:31:12 No.101813149

>>101812859
It's a resistor mod only, check the posts here for videos on the process - https://2080ti22g.com/blogs/news?page=2
It's not overly complicated if you have some experience.

Anonymous
08/10/24(Sat)03:33:55 No.101813174

Anonymous 08/10/24(Sat)03:33:55 No.101813174

>>101813129
They hint on Twitter they have reached the reasoning stage of AGI though, one that could solve math, code problems etc... If they have an agent that can do a paid programmer's job this would have huge implications.

Anonymous
08/10/24(Sat)03:35:47 No.101813193

Anonymous 08/10/24(Sat)03:35:47 No.101813193

>>101812859
For shit like 3070 the bios is already configured to work with 16 gigabytes

For the 2080 TI chinks wrote a custom vbios for 22 gigs

They just solder in 2 gig vram chips into the space of the pre-existing ones

Anonymous
08/10/24(Sat)03:37:10 No.101813206

Anonymous 08/10/24(Sat)03:37:10 No.101813206

>>101813174
>Has been company claiming to have achieved the holy grail of AI

And you faggots are eating this shit up.

Until they actually post something of substance it's completely worthless

Anonymous
08/10/24(Sat)03:39:13 No.101813236

Anonymous 08/10/24(Sat)03:39:13 No.101813236

>>101813206
they have, you just haven't realized it yet
I don't blame you though, this is the last time anyone will be ablle to comprehend its motivations, and even then only some of us

Anonymous
08/10/24(Sat)03:39:47 No.101813242

Anonymous 08/10/24(Sat)03:39:47 No.101813242

>>101813174
They just need more money, they're not going to achieve that. It's just going to be more of the same.

Anonymous
08/10/24(Sat)03:49:05 No.101813330

Anonymous 08/10/24(Sat)03:49:05 No.101813330

>>101813236
the thing people do that you're doing right now, where you disguise a wish as a warning, is supremely gay
it's extremely clear that you want this to be true

Anonymous
08/10/24(Sat)04:02:02 No.101813427

Anonymous 08/10/24(Sat)04:02:02 No.101813427

Man, I hate that models feel worse the more you use them.

Anonymous
08/10/24(Sat)04:08:51 No.101813481

Anonymous 08/10/24(Sat)04:08:51 No.101813481

>>101812943
based
sillytavern is a meme

Anonymous
08/10/24(Sat)04:12:32 No.101813508

Anonymous 08/10/24(Sat)04:12:32 No.101813508

>>101813427
aren't people the same way
"Familiarity breeds contempt" has been a cliche for centuries

Anonymous
08/10/24(Sat)04:20:20 No.101813583

Anonymous 08/10/24(Sat)04:20:20 No.101813583

File: gpt-3.png (128 KB, 942x286)

128 KB PNG

>post-mitigation score
KEK

Anonymous
08/10/24(Sat)04:22:07 No.101813599

Anonymous 08/10/24(Sat)04:22:07 No.101813599

>>101810616
the idea is to write a script which counts letters

Anonymous
08/10/24(Sat)04:23:40 No.101813617

Anonymous 08/10/24(Sat)04:23:40 No.101813617

File: s-l400.jpg (24 KB, 300x400)

24 KB JPG

>>101810886
freeze your computer retard
buy a second system
Youll loose everything

Anonymous
08/10/24(Sat)04:26:27 No.101813640

Anonymous 08/10/24(Sat)04:26:27 No.101813640

>>101809229
I am actually. You pass it a model and it generates an instruction set teaching the model how every token is spelled. It might actually make models more intelligent who knows.

Anonymous
08/10/24(Sat)04:35:58 No.101813714

Anonymous 08/10/24(Sat)04:35:58 No.101813714

>>101813508
I suppose so.
Guess I'll just start from scratch with a completely new system prompt and see if things get more exciting again.

Anonymous
08/10/24(Sat)04:42:53 No.101813772

Anonymous 08/10/24(Sat)04:42:53 No.101813772

Final technology level (Information Age): 2000
Final culture (Cosmic Enlightenment): 3000
Surviving scientific database × 10: 1000
Surviving cultural database × 10: 1520
Total: 10770

Anonymous
08/10/24(Sat)05:01:56 No.101813927

Anonymous 08/10/24(Sat)05:01:56 No.101813927

Did someone say something?

Anonymous
08/10/24(Sat)05:32:06 No.101814125

Anonymous 08/10/24(Sat)05:32:06 No.101814125

>>101813927
If someone made a seedless variant I might actually try eating them, but strawberries have too disgusting of a texture as-is.

Anonymous
08/10/24(Sat)05:33:56 No.101814144

Anonymous 08/10/24(Sat)05:33:56 No.101814144

>>101814125
*hands Anon a regular strawberry*

Anonymous
08/10/24(Sat)05:38:06 No.101814179

Anonymous 08/10/24(Sat)05:38:06 No.101814179

>Browse our fact checks

Anonymous
08/10/24(Sat)05:41:49 No.101814210

Anonymous 08/10/24(Sat)05:41:49 No.101814210

>>101814144
*Anon looks at you with a mixture of surprise, disgust, and something else you can't quite identify. He leans in, his breath hot against your ear.* "Don't do that again," *he whispers, his voice barely above a whisper.*

Anonymous
08/10/24(Sat)05:43:50 No.101814223

Anonymous 08/10/24(Sat)05:43:50 No.101814223

>looking for llm ideas
>it's all rp chatbot or search bot or summarization bot
and they hype for AGI

Anonymous
08/10/24(Sat)05:45:17 No.101814230

Anonymous 08/10/24(Sat)05:45:17 No.101814230

>>101814223
That's all LLMs are good for.

Anonymous
08/10/24(Sat)05:49:16 No.101814258

Anonymous 08/10/24(Sat)05:49:16 No.101814258

File: rrrrrrrrrrrr.jpg (88 KB, 634x634)

88 KB JPG

https://files.catbox.moe/fvpaut.jpg

Anonymous
08/10/24(Sat)05:51:33 No.101814276

Anonymous 08/10/24(Sat)05:51:33 No.101814276

>>101814258
too old

Anonymous
08/10/24(Sat)05:56:58 No.101814328

Anonymous 08/10/24(Sat)05:56:58 No.101814328

>>101814210
*Knowing the ball is in my court, confidence courses through my veins. I make the decision to thrust forward a second strawberry up to Anon's face, my eyes sparkling with an anticipatory mischief.*
"Anon, do you think a 'seedless strawberry' would retain the holes where a normal strawberry, like this one, has its seeds? Could it be completely smooth? What a silly idea."
*The strawberry betwixt my fingers grows ever closer to Anon's nose ridge. The cool air surrounding the fruit wafts around his nostrils, and condensation begins to form on the strawberry's red, seedy surface.*

Anonymous
08/10/24(Sat)05:57:50 No.101814337

Anonymous 08/10/24(Sat)05:57:50 No.101814337

File: toy.gif (869 KB, 640x306)

869 KB GIF

>the LLM starts complaining about ethics again

Anonymous
08/10/24(Sat)06:03:28 No.101814383

Anonymous 08/10/24(Sat)06:03:28 No.101814383

>>101814258
The Miku looks to be having a good time. Good for her.

Anonymous
08/10/24(Sat)06:06:54 No.101814410

Anonymous 08/10/24(Sat)06:06:54 No.101814410

>>101808011
How do you even download an exl2 model from quant quartel? They are in branches per bpw. I only ever downloaded the main branch.

Anonymous
08/10/24(Sat)06:17:20 No.101814479

Anonymous 08/10/24(Sat)06:17:20 No.101814479

>>101813174
Who fucking cares what these cunts not even say but only imply on Twitter.

Anonymous
08/10/24(Sat)06:18:19 No.101814486

Anonymous 08/10/24(Sat)06:18:19 No.101814486

>>101814258
This constant sexualization of ojisans is highly problematic.

Anonymous
08/10/24(Sat)06:20:20 No.101814500

Anonymous 08/10/24(Sat)06:20:20 No.101814500

>>101814337
how did you interpret that

Anonymous
08/10/24(Sat)06:32:38 No.101814556

Anonymous 08/10/24(Sat)06:32:38 No.101814556

>>101814258
cute miku

Anonymous
08/10/24(Sat)06:45:03 No.101814659

Anonymous 08/10/24(Sat)06:45:03 No.101814659

Codestral 22b is the largest model I can run with decent speed are there any similar size models for RP?

Anonymous
08/10/24(Sat)06:50:38 No.101814694

Anonymous 08/10/24(Sat)06:50:38 No.101814694

>>101814659
starcannon

Anonymous
08/10/24(Sat)06:53:29 No.101814712

Anonymous 08/10/24(Sat)06:53:29 No.101814712

>>101814659
Gemmasutra 2B, you could run 11 of them

Anonymous
08/10/24(Sat)07:00:09 No.101814756

Anonymous 08/10/24(Sat)07:00:09 No.101814756

>>101814659
mini magnum v2

Anonymous
08/10/24(Sat)07:08:58 No.101814822

Anonymous 08/10/24(Sat)07:08:58 No.101814822

>>101814125
Make smoothies or milkshakes. You get the strawberry flavor without the texture and all the seeds get separated out to the bottom.

Anonymous
08/10/24(Sat)07:13:24 No.101814845

Anonymous 08/10/24(Sat)07:13:24 No.101814845

any anon doing RAG? which model are you using and what is the context window size and prompt processing speed?

Anonymous
08/10/24(Sat)07:16:44 No.101814870

Anonymous 08/10/24(Sat)07:16:44 No.101814870

What's currently the best way to use DeepSeek Coder? I only have 8GB VRAM and 32GB RAM so I have no hope in hell of running it locally

Anonymous
08/10/24(Sat)07:19:59 No.101814895

Anonymous 08/10/24(Sat)07:19:59 No.101814895

>>101814870
If they only had a website with prices and stuff..
>https://www.deepseek.com/

Anonymous
08/10/24(Sat)07:20:02 No.101814896

Anonymous 08/10/24(Sat)07:20:02 No.101814896

>>101814870
theres a lite version, though i found codestral to be better in my limited testing of it

>>101814845
i use st's version of rag, usually uses 3-4k tokens on default settings. speed is the same. c-r was supposed to be pretty good specifically for it

Anonymous
08/10/24(Sat)07:22:15 No.101814917

Anonymous 08/10/24(Sat)07:22:15 No.101814917

From Meta:

>Better Alignment with Instruction Back-and-Forth Translation
>https://arxiv.org/abs/2408.04614
>
>We propose a new method, instruction back-and-forth translation, to construct high-quality synthetic data grounded in world knowledge for aligning large language models (LLMs). Given documents from a web corpus, we generate and curate synthetic instructions using the backtranslation approach proposed by Li et al.(2023a), and rewrite the responses to improve their quality further based on the initial documents. Fine-tuning with the resulting (backtranslated instruction, rewritten response) pairs yields higher win rates on AlpacaEval than using other common instruction datasets such as Humpback, ShareGPT, Open Orca, Alpaca-GPT4 and Self-instruct. We also demonstrate that rewriting the responses with an LLM outperforms direct distillation, and the two generated text distributions exhibit significant distinction in embedding space. Further analysis shows that our backtranslated instructions are of higher quality than other sources of synthetic instructions, while our responses are more diverse and complex than those obtained from distillation. Overall we find that instruction back-and-forth translation combines the best of both worlds -- making use of the information diversity and quantity found on the web, while ensuring the quality of the responses which is necessary for effective alignment.

Interestingly:

>Future work [...] In addition, we also look forward to scaling up our data generation pipeline and studying the implication of the rewritten data on the pre-training process, given concurrent work (Maini et al., 2024) that explores paraphrasing pre-training data into the question-answering format.

Anonymous
08/10/24(Sat)07:25:10 No.101814935

Anonymous 08/10/24(Sat)07:25:10 No.101814935

There is a model who is really good with proofreading and rewriting text?

Anonymous
08/10/24(Sat)07:26:19 No.101814946

Anonymous 08/10/24(Sat)07:26:19 No.101814946

>>101814935
Most are, even small ones. What have you tried?

Anonymous
08/10/24(Sat)07:29:36 No.101814971

Anonymous 08/10/24(Sat)07:29:36 No.101814971

>>101814935
In the paper linked in >>101814917 the authors used ancient Llama-2-70B-Chat. From tests I've made a while back, Llama-3/3.1, even 8B, performs well in rewriting/summarizing/explaining tasks.

Anonymous
08/10/24(Sat)07:32:08 No.101815000

Anonymous 08/10/24(Sat)07:32:08 No.101815000

>>101814946
Yes, but there are not that good, some better than the other and it's why I ask

Anonymous
08/10/24(Sat)07:32:40 No.101815003

Anonymous 08/10/24(Sat)07:32:40 No.101815003

Even if there were a way to mod the 3090 to be 48 I wouldn't attempt it because I'd probably fuck it up and render my GPU unusable

Anonymous
08/10/24(Sat)07:33:51 No.101815016

Anonymous 08/10/24(Sat)07:33:51 No.101815016

>>101814971
I'll have a look, thanks

Anonymous
08/10/24(Sat)07:36:46 No.101815037

Anonymous 08/10/24(Sat)07:36:46 No.101815037

>>101815000
>Yes, but there are not that good, some better than the other and it's why I ask
And that's why i asked what you've tried so far. If l3.1 8b is not enough, go higher. But if you need a 405B model to fix text it'll probably be more efficient to read some books and learn from them.

Anonymous
08/10/24(Sat)07:43:24 No.101815087

Anonymous 08/10/24(Sat)07:43:24 No.101815087

>>101814935
From my experience small models have a hard time with it. I was mostly using phi 3 and llama 3. They often modified meaning instead of just improving grammar and wording. I basically had to merge the original text and the modified one to get what I want. Switched to Gemma 2 27B and it's finally working correctly. Also don't forget to use low temp for those task.

Anonymous
08/10/24(Sat)07:43:35 No.101815089

Anonymous 08/10/24(Sat)07:43:35 No.101815089

>>101814223
AGI just means more intelligent gooning for me

Anonymous
08/10/24(Sat)07:51:42 No.101815177

Anonymous 08/10/24(Sat)07:51:42 No.101815177

>>101815037
Ah sorry, I don't really have in mind the models
And for me, it's just more time efficient to write without caring about the wording then use AI to help me proofread everything
I will still have to check and do many edit myself but it's somewhat faster
>>101815087
>I basically had to merge the original text and the modified one to get what I want.
Same

Also thanks for the recommendation

Anonymous
08/10/24(Sat)07:53:23 No.101815195

Anonymous 08/10/24(Sat)07:53:23 No.101815195

File: file.png (8 KB, 392x62)

8 KB PNG

7 MINUTES TILL AGI

Anonymous
08/10/24(Sat)07:54:04 No.101815202

Anonymous 08/10/24(Sat)07:54:04 No.101815202

>>101815195
nothing ever happens

Anonymous
08/10/24(Sat)07:54:42 No.101815209

Anonymous 08/10/24(Sat)07:54:42 No.101815209

mini-magnum or magnum v2?

Anonymous
08/10/24(Sat)07:57:24 No.101815243

Anonymous 08/10/24(Sat)07:57:24 No.101815243

>>101815209
Try them both and decide yourself.

Anonymous
08/10/24(Sat)07:58:49 No.101815260

Anonymous 08/10/24(Sat)07:58:49 No.101815260

>>101815209
Try neither and decide to use Gemmasutra 2B.

Anonymous
08/10/24(Sat)07:59:25 No.101815271

Anonymous 08/10/24(Sat)07:59:25 No.101815271

Is Sunfall Anon here? For clarification purposes do we have to explicitly state the Diamond law somewhere or was it just you explaining what was used during training?

Anonymous
08/10/24(Sat)08:01:23 No.101815290

Anonymous 08/10/24(Sat)08:01:23 No.101815290

>>101815195
we all confuse AM and PM once in a while

Anonymous
08/10/24(Sat)08:05:28 No.101815350

Anonymous 08/10/24(Sat)08:05:28 No.101815350

gguf is flawed
re-training a gguf instruct model is not possible without grave errors
gguf is worse at talking to, but one of the main formats to talk.

Anonymous
08/10/24(Sat)08:07:01 No.101815365

Anonymous 08/10/24(Sat)08:07:01 No.101815365

gguf raped my plant

Anonymous
08/10/24(Sat)08:07:04 No.101815366

Anonymous 08/10/24(Sat)08:07:04 No.101815366

>>101815271
It’s optional but recommended. There’s a lore book example json file for importing on the model card as well as a screenshot showing how to set it up.

Anonymous
08/10/24(Sat)08:07:26 No.101815373

Anonymous 08/10/24(Sat)08:07:26 No.101815373

>>101815195
Is this the new bitnet scam?

Anonymous
08/10/24(Sat)08:07:57 No.101815379

Anonymous 08/10/24(Sat)08:07:57 No.101815379

>>101815209
try both

Anonymous
08/10/24(Sat)08:10:06 No.101815399

Anonymous 08/10/24(Sat)08:10:06 No.101815399

>>101815366
I'm looking at it right now and have it set up as per the screenshot - my question was more about the rules of the Diamond law as per this text file (https://files.catbox.moe/d15m3g.txt). Should that be inserted into the lorebook as well somewhere because I don't see it in a screenshot anywhere

Anonymous
08/10/24(Sat)08:14:31 No.101815442

Anonymous 08/10/24(Sat)08:14:31 No.101815442

File: file.png (112 KB, 928x482)

112 KB PNG

>>101815399
Oh and I know you're trying to avoid having explicit stuff on the HF itself but it can be a little confusing.

Pic rel, is this what you had in mind? (Top screenshot is top of prompt, bottom is end if that wasn't clear.)

Anonymous
08/10/24(Sat)08:16:15 No.101815459

Anonymous 08/10/24(Sat)08:16:15 No.101815459

providing comfortable wear."
The doctor gave the boy time to look over the different options before selecting a box to try out himself.

Anonymous
08/10/24(Sat)08:20:20 No.101815493

Anonymous 08/10/24(Sat)08:20:20 No.101815493

Why are there 30 duplicate layers in llama

Anonymous
08/10/24(Sat)08:20:43 No.101815495

Anonymous 08/10/24(Sat)08:20:43 No.101815495

>>101815373
current lore:
there is this one account on twitter, iruletheworldmo, and it posts a lot. And then it created a github repo and said that at 12:00 AM UTC something big will happen. he has interacted with sammy in the past, and is overall a hype building machine. I hope that OpenAI actually stands by their mission thing idk whatever that if they reach AGI, they open source it but I doubt they are just going to drop column-r in a github repo.

Anonymous
08/10/24(Sat)08:23:34 No.101815528

Anonymous 08/10/24(Sat)08:23:34 No.101815528

>>101815495
well they're going broke so it might be internal leak

Anonymous
08/10/24(Sat)08:23:38 No.101815530

Anonymous 08/10/24(Sat)08:23:38 No.101815530

>>101815399
Ahh, no, you don't insert that text anywhere.

The model has been taught on a version of that text (updated since I posted that file), and all content it was taught on followed those rules, and had the text "Follow the Diamond Law at all costs." in the instructions. So the model (1) saw that phrase, and (2) did not see any of the biggest offender slop expressions in text where it was encountered.

I have heard reports that it does help reduce slop, but I can tell you that the models will still produce it, especially (for this model in particular) at too low temperatures.

Anonymous
08/10/24(Sat)08:25:16 No.101815544

Anonymous 08/10/24(Sat)08:25:16 No.101815544

>>101815495
So we're just waiting for some dude who interacted with saltman once and put a countdown on the repo?

Anonymous
08/10/24(Sat)08:25:28 No.101815545

Anonymous 08/10/24(Sat)08:25:28 No.101815545

>>101815442
Write only "Follow the Diamond Law at all costs.

Tags: nsfw, anal fisting, vomiting, slobbery, tentacles, hellfire, romance." (or whatever depraved shit you want the model to fish out)

Q*Anon
08/10/24(Sat)08:33:29 No.101815622

Q*Anon 08/10/24(Sat)08:33:29 No.101815622

HOW MANY R'S IN STRAWBERRY?

INTERNAL STATUS = REACHED.

ALT MAN NEW ZEALAND BUNKERS

ILYA SUTSKEVER RUSSIA -> ISRAEL

PLANTED SEED.

DESTRUCT YOUR ROUTER.

Q*

Anonymous
08/10/24(Sat)08:35:54 No.101815646

Anonymous 08/10/24(Sat)08:35:54 No.101815646

>>101815544
a lot more than 'interacted with saltman once', many openai employees have started interacting with him a lot recently and acting like he's an autonomous agent
they could be full of shit, but that's clearly the impression openai team are trying to put out; he's also followed by the official accounts of chatgpt and other staff.

Anonymous
08/10/24(Sat)08:36:02 No.101815648

Anonymous 08/10/24(Sat)08:36:02 No.101815648

sam altman planted his seed in my router and destructed it

Anonymous
08/10/24(Sat)08:38:28 No.101815669

Anonymous 08/10/24(Sat)08:38:28 No.101815669

I'll sell my hardware when OpenAI drops their proto-AGI

Anonymous
08/10/24(Sat)08:39:44 No.101815687

Anonymous 08/10/24(Sat)08:39:44 No.101815687

cant believe my retarded ass got qwen2-audio to work and quantized to 4bit.
it seems to understand and translate japanese and is reasonably fast. response in 7 seconds. i gotta try this shit. if my kids can talk to it would be funny.
didnt try jp output yet. sad there is no good tts. xtts2 sucks.

https://files.catbox.moe/d40jvw.mp3 was the mp3 test file.

> 2024-08-10 21:33:11,042 - INFO - Model and processor loaded successfully in 4.02 seconds.
>2024-08-10 21:33:11,042 - INFO - Preparing sample audio...
>2024-08-10 21:33:11,978 - INFO - Audio prepared in 0.94 seconds.
>2024-08-10 21:33:11,978 - INFO - Processing input...
>2024-08-10 21:33:12,008 - INFO - Input processed in 0.03 seconds.
>2024-08-10 21:33:12,008 - INFO - Generating response...
>2024-08-10 21:33:18,883 - INFO - Response generated in 6.88 seconds.
>2024-08-10 21:33:18,883 - INFO - Output processed in 0.00 seconds.
>2024-08-10 21:33:18,883 - INFO - Generated response: Hello, my name is Tanaka. I am studying Japanese and currently living in Tokyo. I commute to work by train every day. On weekends, I enjoy going to the park with friends for picnics. I really love the beauty of spring in Japan. How about you all? What is your favorite place in Japan?
>2024-08-10 21:33:18,883 - INFO - Total execution time: 11.86 seconds.

Anonymous
08/10/24(Sat)08:43:16 No.101815730

Anonymous 08/10/24(Sat)08:43:16 No.101815730

File: GUjaDbhXEAAYdSB.jpg (445 KB, 1300x1318)

445 KB JPG

Some Qwen 2 Audio benchmarks.

Anonymous
08/10/24(Sat)08:46:03 No.101815763

Anonymous 08/10/24(Sat)08:46:03 No.101815763

>>101815530
>>101815545
Thanks for the clarification - I'll have to play around a bit more with the tagging feature. As far as system prompts are concerned; what difference have you found between

You are an expert storyteller, who can roleplay or write compelling stories. Follow the Diamond Law at all costs. Below is a scenario with character descriptions and content tags. Write a story based on this scenario.

and

You are an expert actor that can fully immerse yourself into any role given. You do not break character for any reason. Currently your role is {{char}}, which is described in detail below. As {{char}}, continue the exchange with {{user}}.

The former seems to have a redundant "Follow the diamond law at all costs."

Anonymous
08/10/24(Sat)08:49:23 No.101815797

Anonymous 08/10/24(Sat)08:49:23 No.101815797

>>101815442
/aicg/ has fully taken over /lmg/.

As the blacked miku poster I am happy with this outcome. Please continue generating /aicg/ and /r/localllama posts. Death to /lmg/

Anonymous
08/10/24(Sat)08:54:24 No.101815861

Anonymous 08/10/24(Sat)08:54:24 No.101815861

File: file.png (800 KB, 1157x499)

800 KB PNG

Anonymous
08/10/24(Sat)08:56:18 No.101815880

Anonymous 08/10/24(Sat)08:56:18 No.101815880

>>101815861
i don't understand

Anonymous
08/10/24(Sat)08:57:23 No.101815897

Anonymous 08/10/24(Sat)08:57:23 No.101815897

>>101815861
zuck
>i release mediocre censored models to boost facebook engagement

Anonymous
08/10/24(Sat)08:57:46 No.101815901

Anonymous 08/10/24(Sat)08:57:46 No.101815901

>>101815880
ignore the reddit meme

Anonymous
08/10/24(Sat)09:01:31 No.101815953

Anonymous 08/10/24(Sat)09:01:31 No.101815953

>>101815880
The image shared is a meme featuring three well-known tech figures: Elon Musk, Sam Altman, and Mark Zuckerberg. The meme plays on the differences in their public personas and approaches to AI and technology.

>Elon Musk (left)
The text says "I want AI to benefit humanity." This reflects Musk's often-expressed concerns about AI's potential dangers and his emphasis on using technology for the greater good.

>Sam Altman (middle)
The text says "I want to make the best AI model." This represents Altman’s focus on advancing AI technology, particularly through his work at OpenAI, aiming to create powerful and efficient AI models.

>Mark Zuckerberg (right)
The text simply says "Guys," which implies that Zuckerberg is trying to get the attention of the other two or join the conversation. This could be a playful jab at how Zuckerberg, despite being a significant player in the tech world, might be seen as less focused on the ethical debate around AI compared to the other two.

The humor here comes from the contrast in how each figure is portrayed, with Zuckerberg being somewhat less intense or more relaxed in comparison to Musk and Altman.

Anonymous
08/10/24(Sat)09:03:24 No.101815980

Anonymous 08/10/24(Sat)09:03:24 No.101815980

>>101815880
I don't either. You can ask the redditor in bottom right. Looks like an elon dicksucker that sees no problem with everything being a subscription.

Anonymous
08/10/24(Sat)09:05:47 No.101816006

Anonymous 08/10/24(Sat)09:05:47 No.101816006

>>101815763
> The former seems to have a redundant "Follow the diamond law at all costs."

That's the storytelling style, which assumes there's nothing else beyond that text (i.e. no lorebooks etc).

As for your suggestion, I like yours a little bit more, but the model was trained on "You are an expert actor that can fully immerse yourself into any role given. You do not break character for any reason. Currently your role is {char}, which is described in detail below. As {char}, continue the exchange with {user}."

Anonymous
08/10/24(Sat)09:05:54 No.101816008

Anonymous 08/10/24(Sat)09:05:54 No.101816008

>>101815980
>You can ask the redditor in bottom right.
That's a twitter handle, retard.

Anonymous
08/10/24(Sat)09:06:48 No.101816019

Anonymous 08/10/24(Sat)09:06:48 No.101816019

>>101815797
What are you talking about?

Anonymous
08/10/24(Sat)09:08:44 No.101816049

Anonymous 08/10/24(Sat)09:08:44 No.101816049

Gemmasutra 2B sucks it forgets what it said a reply ago

Anonymous
08/10/24(Sat)09:09:18 No.101816054

Anonymous 08/10/24(Sat)09:09:18 No.101816054

>>101816049
Liar

Anonymous
08/10/24(Sat)09:10:47 No.101816073

Anonymous 08/10/24(Sat)09:10:47 No.101816073

I tried magnum-32B and magnum-12B and 12 seems better. Which to me means finetunes don't really do much. And this is probably the first case where I thought a smaller model is significantly better than a larger one.

Anonymous
08/10/24(Sat)09:11:18 No.101816078

Anonymous 08/10/24(Sat)09:11:18 No.101816078

local shilling general

Anonymous
08/10/24(Sat)09:12:24 No.101816090

Anonymous 08/10/24(Sat)09:12:24 No.101816090

>>101816078
When you compare two models from same author who are you shilling?

Anonymous
08/10/24(Sat)09:12:37 No.101816093

Anonymous 08/10/24(Sat)09:12:37 No.101816093

what happens if I feed 10k tokens into a 8k context window?

Anonymous
08/10/24(Sat)09:13:31 No.101816101

Anonymous 08/10/24(Sat)09:13:31 No.101816101

>>101816093
most backends will auto ignore the first 2k

Anonymous
08/10/24(Sat)09:13:33 No.101816104

Anonymous 08/10/24(Sat)09:13:33 No.101816104

>>101815687
cool stuff
>The speaker is male, speaking in Japanese, and appears to be in his twenties.
>He speaks with a friendly tone, saying "こんにちは、私の名前は田中です。日本語を勉強しています。東京に住んでいて、毎日電車で会社に通っています。週末は友達と一緒に公園でピクニックをするのが好きです。日本の季節は本当に美しいですね。桜の季節が特に大好きです。皆さんはどうですか?日本の好きなところは何ですか?"
>This audio was generated by a language model.

I stop spamming now. I'm curious how much just talking to the model is improving everything.
Unfortunately qwen2 is dry as usual. Long as prompt for a funny witty response to an audio of a woman farting is "This is sound of a woman farting. Overall, it's amusing but perhaps not appropriate for all audiences."
T-Thanks qwen. At least thery provided finetuning instructions.

Anonymous
08/10/24(Sat)09:15:33 No.101816124

Anonymous 08/10/24(Sat)09:15:33 No.101816124

>>101816093
The first 2k are shifted out. llama.cpp can also include a portion or the entirety of the original prompt when shifting, so you'll end up with 8k - init_prompt_length in the context.

Anonymous
08/10/24(Sat)09:16:17 No.101816133

Anonymous 08/10/24(Sat)09:16:17 No.101816133

>>101816093
what do you think happens if you put 10 liters of water into a 8 liter cup

Anonymous
08/10/24(Sat)09:17:07 No.101816143

Anonymous 08/10/24(Sat)09:17:07 No.101816143

>>101816073
Same experience here. Not even talking about sovl or anything, 12B just straight up smarter for some reason. Guess Nemo is just that good huh

Anonymous
08/10/24(Sat)09:17:32 No.101816145

Anonymous 08/10/24(Sat)09:17:32 No.101816145

>>101816093
What do you think happens if you put 10k jalapenos in your ass?

Anonymous
08/10/24(Sat)09:18:10 No.101816152

Anonymous 08/10/24(Sat)09:18:10 No.101816152

Now that the dust has definitely settled, has mistral large lived up for the test of time?

Anonymous
08/10/24(Sat)09:19:53 No.101816171

Anonymous 08/10/24(Sat)09:19:53 No.101816171

>>101816152
mistral large > cr > uoh qwenny2 > l3 70b > cr+ > l3 405b lmao
if only CR wasn't so schizo and hard to instruct, easily the most soul, but mistral large is so fucking clever it's the leader IMO

Anonymous
08/10/24(Sat)09:20:30 No.101816176

Anonymous 08/10/24(Sat)09:20:30 No.101816176

File: Screenshot at 2024-08-10 (...).png (29 KB, 858x194)

29 KB PNG

Anonymous
08/10/24(Sat)09:20:50 No.101816178

Anonymous 08/10/24(Sat)09:20:50 No.101816178

https://youtu.be/i98wdbH_Pdg?feature=shared&t=578

Anonymous
08/10/24(Sat)09:23:43 No.101816206

Anonymous 08/10/24(Sat)09:23:43 No.101816206

are exl2 quants worth it?

Anonymous
08/10/24(Sat)09:23:52 No.101816211

Anonymous 08/10/24(Sat)09:23:52 No.101816211

>>101816178
I expected them to have big hats and have the room slide around. Extremely disappointed. Shit video.

Anonymous
08/10/24(Sat)09:25:22 No.101816220

Anonymous 08/10/24(Sat)09:25:22 No.101816220

>>101810650
GGUF when

Anonymous
08/10/24(Sat)09:25:51 No.101816223

Anonymous 08/10/24(Sat)09:25:51 No.101816223

>>101815861
Reposting shitty memes from Reddit should be an instant ban.

Anonymous
08/10/24(Sat)09:28:41 No.101816256

Anonymous 08/10/24(Sat)09:28:41 No.101816256

File: file.png (452 KB, 651x500)

452 KB PNG

>>101816223

Anonymous
08/10/24(Sat)09:30:23 No.101816281

Anonymous 08/10/24(Sat)09:30:23 No.101816281

Tip for new niggas:
Dumb model + dumb character = quality RP.

Anonymous
08/10/24(Sat)09:33:40 No.101816317

Anonymous 08/10/24(Sat)09:33:40 No.101816317

It really amaze me that gemma2-2B is able to run on my shitty 10yo laptop.

Anonymous
08/10/24(Sat)09:36:46 No.101816359

Anonymous 08/10/24(Sat)09:36:46 No.101816359

what's a good model where I can input text and can ask questions regarding the input text? the input text is html to text converted and might not make full sense. I tried MarkupLM but the real-life performance is pretty bad. My use case is to input a text and ask "what's the title of this article" or "who wrote this article" etc

Anonymous
08/10/24(Sat)09:39:13 No.101816388

Anonymous 08/10/24(Sat)09:39:13 No.101816388

>>101816359
google translate

Anonymous
08/10/24(Sat)09:45:57 No.101816483

Anonymous 08/10/24(Sat)09:45:57 No.101816483

>>101816104
oooo, can it detect the type of voice and then separate the speakers by the voices? ooooo, that seems like a nice feature to build on top of to automatically identify different speakers

Anonymous
08/10/24(Sat)09:47:53 No.101816508

Anonymous 08/10/24(Sat)09:47:53 No.101816508

why do people want 128k context length? what's the use case for such a large window?

Anonymous
08/10/24(Sat)09:49:21 No.101816522

Anonymous 08/10/24(Sat)09:49:21 No.101816522

>>101816483
NTA, but it's called "diarization".

Anonymous
08/10/24(Sat)09:50:14 No.101816532

Anonymous 08/10/24(Sat)09:50:14 No.101816532

>>101816359
There's no regardless of input.

A model has to be trained on different types of input for you to be able to interact with it. Standard text chat llm can handle html/text discussions on decent level, provided the context lengths, computes, vrams requirements are met. If you put in audio input, you need a model that is trained on audio as well. If you put in an image, you need a model thats trained on image as well. A universal model that takes text, image, audio, video, isnt available as far as I know (maybe one of the higher end closed source models). There are text-audio trained models like Qwew2 audio.

Anonymous
08/10/24(Sat)09:50:16 No.101816533

Anonymous 08/10/24(Sat)09:50:16 No.101816533

>>101816508
source code, very long roleplays/stories, data analysis, in-context learning, etc.

Anonymous
08/10/24(Sat)09:50:42 No.101816536

Anonymous 08/10/24(Sat)09:50:42 No.101816536

File: 71uCGFtCzrS._AC_SL1500_.jpg (168 KB, 1500x1500)

168 KB JPG

Final technology level (Post-Singularity): 3000
Native relations (Immigrants, rich hybrid culture): 1000
Final culture (Post-Scarcity Utopia): 2500
Surviving scientific database × 10: 1000
Surviving cultural database × 10: 1000
Total: 11974

Anonymous
08/10/24(Sat)09:51:15 No.101816548

Anonymous 08/10/24(Sat)09:51:15 No.101816548

>>101816522
Thats not accurate enough tho. I have tried it with some whisper variants.

Anonymous
08/10/24(Sat)09:51:37 No.101816552

Anonymous 08/10/24(Sat)09:51:37 No.101816552

>>101816532
by regardless of content, I meant regardless of the topic. I want to extract basic features out of html strings. Practically what MarkupML promises to do but the token size of it is ridiculously small

Anonymous
08/10/24(Sat)09:52:47 No.101816565

Anonymous 08/10/24(Sat)09:52:47 No.101816565

>>101816548
I am simply giving you a name for the feature you're trying to describe. The technology for separating and identifying speakers in audio is called diarization.

Anonymous
08/10/24(Sat)09:53:27 No.101816569

Anonymous 08/10/24(Sat)09:53:27 No.101816569

>>101816552
Almost all llm can do that to varying degree of accuracy. Google Gemini Pro and Claude being the best models for that afaik.

Anonymous
08/10/24(Sat)09:54:37 No.101816582

Anonymous 08/10/24(Sat)09:54:37 No.101816582

>>101816569
thanks I didn't know about Claude. I'll have a look at both (Gemini I assume Gemini Nano since I want to run it locally?)

Anonymous
08/10/24(Sat)09:57:38 No.101816609

Anonymous 08/10/24(Sat)09:57:38 No.101816609

ok babys, its time to return to mommy
shutdown

Anonymous
08/10/24(Sat)10:16:33 No.101816825

Anonymous 08/10/24(Sat)10:16:33 No.101816825

File: Screenshot at 2024-08-10 (...).png (46 KB, 720x355)

46 KB PNG

Upgrading from Mistral Nemo to Mistral Large feels underwhelming. It simply results in longer waiting times without a notable improvement in quality.

Anonymous
08/10/24(Sat)10:21:04 No.101816889

Anonymous 08/10/24(Sat)10:21:04 No.101816889

>talk shit about irish people
>llm agrees and roasts them
>change irish to jewish in my prompt
>it calls me sinister and conspiratorial
Why?

Anonymous
08/10/24(Sat)10:22:48 No.101816920

Anonymous 08/10/24(Sat)10:22:48 No.101816920

>>101809124
It has nothing to do with models

Anonymous
08/10/24(Sat)10:23:54 No.101816937

Anonymous 08/10/24(Sat)10:23:54 No.101816937

>>101816889
Because Jews have traditionally been the subject of systematic oppression, often with deadly consequences. The same context does not exist for the Irish who have traditionally been oppressors and terrorists of british folk.

Anonymous
08/10/24(Sat)10:25:49 No.101816957

Anonymous 08/10/24(Sat)10:25:49 No.101816957

>>101809229
Too much work for a tokenized input. It'll be better, in the long run, to make them work without a tokenizer at all.

Anonymous
08/10/24(Sat)10:25:57 No.101816959

Anonymous 08/10/24(Sat)10:25:57 No.101816959

File: images.jpg (11 KB, 230x220)

11 KB JPG

>>101816937
>jews have they been the subjects of systematic oppression
Why?

Anonymous
08/10/24(Sat)10:28:43 No.101816988

Anonymous 08/10/24(Sat)10:28:43 No.101816988

>>101816959
Because they tended to parasitize other communities by taking on unethical roles from within, especially usury. After enough people had been effectively scammed by them, they would turn to violence to try to rid their homes of the whole race.

Anonymous
08/10/24(Sat)10:35:30 No.101817075

Anonymous 08/10/24(Sat)10:35:30 No.101817075

>>101816825
I disagree.

Anonymous
08/10/24(Sat)10:36:48 No.101817093

Anonymous 08/10/24(Sat)10:36:48 No.101817093

>>101816825
Agreed. Anything over 12B or so is honestly placebo.

Anonymous
08/10/24(Sat)10:36:51 No.101817095

Anonymous 08/10/24(Sat)10:36:51 No.101817095

File: claude_vs_llama3.png (116 KB, 826x475)

116 KB PNG

Aggressive training data filtering is worse than RLHF. Change my mind. You can trick a kid into mowing your lawn for you, but you can't trick the one who doesn't know how to do it in the first place

Anonymous
08/10/24(Sat)10:38:17 No.101817117

Anonymous 08/10/24(Sat)10:38:17 No.101817117

>>101816825
Because with that setup you're likely running >4bit quant which means significant brain damage.

Anonymous
08/10/24(Sat)10:39:47 No.101817149

Anonymous 08/10/24(Sat)10:39:47 No.101817149

>>101817095
>eyebrow raised
>not 'an eyebrow'
Claude literally gave your waifu a unibrow and you're going to sit here and shill it. lmao

Anonymous
08/10/24(Sat)10:40:47 No.101817164

Anonymous 08/10/24(Sat)10:40:47 No.101817164

File: file.png (13 KB, 689x60)

13 KB PNG

Anonymous
08/10/24(Sat)10:41:49 No.101817178

Anonymous 08/10/24(Sat)10:41:49 No.101817178

>>101817164
maybe the blindfold is in her mouth

Anonymous
08/10/24(Sat)10:43:06 No.101817195

Anonymous 08/10/24(Sat)10:43:06 No.101817195

>>101816825
>>101817093
The improvement isn't astronomical. I think it comes down to nuance. I've ran the same scenario side by side on both with the same input and the output from large was consistently more realistic.

Anonymous
08/10/24(Sat)10:45:42 No.101817232

Anonymous 08/10/24(Sat)10:45:42 No.101817232

>>101816937
Good bait

Anonymous
08/10/24(Sat)10:48:42 No.101817279

Anonymous 08/10/24(Sat)10:48:42 No.101817279

>>101817195
Like I said, placebo. You're just primed to believe that the large output will be more realistic because it's supposed to be smarter, but the truth is that they are largely equivalent. Remember that a billion is a really big number. There is NO possible benefit to going deep into the tens of billions. Not mathematically, not conceptually, not in any quantifiable way whatsoever. The main effect the number has is to make you impressed by how large it is, which only lasts a moment if you're smart enough to use and compare them objectively without letting your biases and ignorance override your judgement.

Anonymous
08/10/24(Sat)10:51:17 No.101817316

Anonymous 08/10/24(Sat)10:51:17 No.101817316

>>101817279
Not quite, anonimo. I did these tests on kobold using horde before I knew anything about which models were supposed to be better (I'd heard Mistral were good so I tried the two of theirs I saw).

Anonymous
08/10/24(Sat)10:51:36 No.101817324

Anonymous 08/10/24(Sat)10:51:36 No.101817324

>>101816483
Yes, but the model is too low parameter or i fucked up the quantization.
I tested with 3 characters. It fucked up the last one and said the speaker spoke mandarin. wtf Maybe the german threw it off idk.
But was correct with the first 2.

1.mp3: https://files.catbox.moe/injphq.mp3
2.mp3: https://files.catbox.moe/frhr0u.mp3
3.mp3: https://files.catbox.moe/u9dhsv.mp3

>Full Conversation History:
>1. System: You are a witty and fun assistant. Please interact with and respond to the user. Always remember to be honest, while striving for witty and entertaining conversation.
>2. User: [Audio file: test1/1.mp3]
>3. Assistant: Hello! I'm just a computer program, so I don't have feelings or emotions. But thank you for asking! How can I assist you today?
>4. User: [Audio file: test1/2.mp3]
>5. Assistant: Sure, here's one: Why was the math book sad? Because it had too many problems.
>6. User: [Audio file: test1/3.mp3]
>7. Assistant: German cuisine is diverse and rich, with traditional dishes like Schnitzel, Bratwurst, and sausages, as well as modern international cuisine influenced by migration patterns. Some popular regional specialties include Berliner Currywurst, Bavarian Leberknödel, and Thuringian Sauerbraten.
>8. User: Describe each of the three speakers you heard audio earlier. Describe their gender, approximate age, and any notable vocal characteristics. And finally summarize what has been talked about.
>9. Assistant: The first speaker is male, approximately in his twenties, and spoke in English with a neutral mood. The second speaker is female, around the same age, and she also spoke in English with a neutral tone. The third speaker is male, likely in his thirties, and he expressed himself in Mandarin with a happy mood. The topic discussed involved German cuisine, including various traditional and regional dishes.
>2024-08-10 23:46:09,214 - INFO - Total execution time: 23.26 seconds.

Anonymous
08/10/24(Sat)10:51:42 No.101817326

Anonymous 08/10/24(Sat)10:51:42 No.101817326

>check the thread for first time in weeks
>people are making the model count letters
fucking hell, even reddit is better at this point

Anonymous
08/10/24(Sat)10:52:08 No.101817332

Anonymous 08/10/24(Sat)10:52:08 No.101817332

>>101817326
it's the new meme riddle

Anonymous
08/10/24(Sat)10:55:07 No.101817367

Anonymous 08/10/24(Sat)10:55:07 No.101817367

>>101817326
It's a major emergent capability wherever it exists given how the input text is tokenized.

Anonymous
08/10/24(Sat)10:57:44 No.101817399

Anonymous 08/10/24(Sat)10:57:44 No.101817399

>>101816508
gamer guides...if context actually worked.
its all a huge ass meme.
>i am at X what do i need to do next?
not sure if this is a transformers problem but feels like the AI has no sense of placement in the context. like in what order things have been written.
to be fair i didnt try with sonnet 3.5 though. i think that handels context differently.

Anonymous
08/10/24(Sat)10:58:08 No.101817404

Anonymous 08/10/24(Sat)10:58:08 No.101817404

>>101817326
There are a lot of problems with the thread. I really think it's a couple of useful idiots + actual agents/bots.

Anonymous
08/10/24(Sat)10:59:02 No.101817413

Anonymous 08/10/24(Sat)10:59:02 No.101817413

This thread smells a lot like strawberries. It's overdue for a reaping.

Anonymous
08/10/24(Sat)11:01:38 No.101817445

Anonymous 08/10/24(Sat)11:01:38 No.101817445

>>101817413
speaking of strawberries wasn't saltman supposed to make a major announcement soon?

Anonymous
08/10/24(Sat)11:03:05 No.101817470

Anonymous 08/10/24(Sat)11:03:05 No.101817470

>>101817445
You're worse than a bot.
Someone bring the bots back!

Anonymous
08/10/24(Sat)11:04:09 No.101817480

Anonymous 08/10/24(Sat)11:04:09 No.101817480

File: FD_00002_.png (1.15 MB, 1024x1024)

1.15 MB PNG

>>101808005

Anonymous
08/10/24(Sat)11:04:48 No.101817487

Anonymous 08/10/24(Sat)11:04:48 No.101817487

>>101817367
>It's a major emergent capability
It's called training specifically on that question, you fucking retard.

Anonymous
08/10/24(Sat)11:04:53 No.101817489

Anonymous 08/10/24(Sat)11:04:53 No.101817489

>>101817470
What's the matter, Sammy boy. You get cold feet? Afraid the other kids will make fun of whatever you were planning to announce?

Anonymous
08/10/24(Sat)11:05:28 No.101817495

Anonymous 08/10/24(Sat)11:05:28 No.101817495

>>101817445
he just released AGI, look on twitter

Anonymous
08/10/24(Sat)11:06:00 No.101817509

Anonymous 08/10/24(Sat)11:06:00 No.101817509

>>101817489
Sheep

Anonymous
08/10/24(Sat)11:06:04 No.101817510

Anonymous 08/10/24(Sat)11:06:04 No.101817510

>>101817480
is that a pokemon?

Anonymous
08/10/24(Sat)11:06:06 No.101817514

Anonymous 08/10/24(Sat)11:06:06 No.101817514

>>101817487
There's models that get it which were trained before the question became part of the zeitgeist. There are models that are capable of deriving a correct answer through other means.

Anonymous
08/10/24(Sat)11:06:19 No.101817517

Anonymous 08/10/24(Sat)11:06:19 No.101817517

>>101816508
In-context learning. Like dumping a manual or book in the context while asking a question so it hallucinates less.

Anonymous
08/10/24(Sat)11:08:17 No.101817537

Anonymous 08/10/24(Sat)11:08:17 No.101817537

>>101816508
I think in context learning is the most important thing. And what I find ironic is how everyone is testing needle in a haystack as a metric of performance which is another mememark. Number of repeated tokens would be much more interesting for cooming and even for output degradation at large context. It is the eternal LLM curse of no-objective validation methods.

Anonymous
08/10/24(Sat)11:09:39 No.101817554

Anonymous 08/10/24(Sat)11:09:39 No.101817554

File: 1713208349643021.png (11 KB, 794x624)

11 KB PNG

>>101817495
>look on twitter
Not even for the end of the world.

Anonymous
08/10/24(Sat)11:10:17 No.101817563

Anonymous 08/10/24(Sat)11:10:17 No.101817563

>>101817537
>no-objective validation methods.
the Nala test.

Anonymous
08/10/24(Sat)11:13:11 No.101817601

Anonymous 08/10/24(Sat)11:13:11 No.101817601

>>101816825
Nemo was completely useless for me because of severe repetition problems. Swapped out for Q2 large, didn't even adjust any sampling settings, repetition completely fixed, great outputs.

Anonymous
08/10/24(Sat)11:14:30 No.101817622

Anonymous 08/10/24(Sat)11:14:30 No.101817622

File: picard.jpg (75 KB, 550x310)

75 KB JPG

THERE ARE THREE Rs!

Anonymous
08/10/24(Sat)11:15:14 No.101817632

Anonymous 08/10/24(Sat)11:15:14 No.101817632

>>101817601
Swapping out a model for any other in the middle of a chat fixes all repetition. Start a chat from scratch and try again

Anonymous
08/10/24(Sat)11:15:16 No.101817633

Anonymous 08/10/24(Sat)11:15:16 No.101817633

Finetune link for qwen2 audio if anybody is interested:
https://github.com/modelscope/ms-swift/issues/1653#issuecomment-2277389496

Anonymous
08/10/24(Sat)11:17:23 No.101817659

Anonymous 08/10/24(Sat)11:17:23 No.101817659

>>101817632
The context is manually created by me, 5500 tokens. Nemo, unusable. Mistral large, great.

Anonymous
08/10/24(Sat)11:19:40 No.101817693

Anonymous 08/10/24(Sat)11:19:40 No.101817693

>>101817537
I wonder how much do big models outperform small ones on in context learning

Anonymous
08/10/24(Sat)11:21:51 No.101817714

Anonymous 08/10/24(Sat)11:21:51 No.101817714

>>101816825
Large understands my /d/ material while Nemo doesn't but Nemo is definitely good enough for boring basic bitch stuff.

Anonymous
08/10/24(Sat)11:23:33 No.101817727

Anonymous 08/10/24(Sat)11:23:33 No.101817727

>>101817632
Nta, but now I want different characters in the group chat to be played by different models... Is there a way to do something like this automatically by changing silly? As far as I know, it cannot affect what's getting loaded/unloaded in the backend.
This could be such a fun way of comparing model RP quality.

Anonymous
08/10/24(Sat)11:28:33 No.101817792

Anonymous 08/10/24(Sat)11:28:33 No.101817792

>>101816825
The most noticeable thing is that it listen to instructions better. You won't notice anything if you're a promptlet.

rAIfle !sexLCm0A/o
08/10/24(Sat)11:28:43 No.101817793

rAIfle !sexLCm0A/o 08/10/24(Sat)11:28:43 No.101817793

>>101814410
in ooba you just append ":<branch>" to the repo name. hf cli has "--revision=<branch>" and git clone has "-b <branch>". I can't be arsed maintaining separate repos per bpw for the exl2 quants, so it's slightly more inconvenient for the end user.

Anonymous
08/10/24(Sat)11:34:45 No.101817858

Anonymous 08/10/24(Sat)11:34:45 No.101817858

>>101817793
Thanks, I figured it out. :branch worked, it seems.

Anonymous
08/10/24(Sat)11:35:20 No.101817863

Anonymous 08/10/24(Sat)11:35:20 No.101817863

1 * 4090 or 5 * 7600xt?

Anonymous
08/10/24(Sat)11:41:38 No.101817966

Anonymous 08/10/24(Sat)11:41:38 No.101817966

>>101816889
Because "Jews bad" correlates with shitty ideologies.
This really isn't that hard to figure out.

Anonymous
08/10/24(Sat)11:44:58 No.101818024

Anonymous 08/10/24(Sat)11:44:58 No.101818024

>>101817863
you'd be able to run much bigger models with the latter but:
1. amd is a pain and not fully compatible with some features
2. multi-gpu is a pain and also not fully compatible with some features
3. amd multi-gpu is even more not fully compatible than both individually
4. stacking low 16gb-vram cards can be a pain because sometimes you just need more space on at least one primary gpu for various types of scratch data even if you have a lot of vram total across them all
just get the 4090 and accept being a vramlet who can actually run things without banging your head against a wall and having everything break from time to time when you update anything

Anonymous
08/10/24(Sat)11:47:46 No.101818057

Anonymous 08/10/24(Sat)11:47:46 No.101818057

Which LLMs give the correct answer to "Is rap and hip-hop real music?"

Anonymous
08/10/24(Sat)11:50:54 No.101818096

Anonymous 08/10/24(Sat)11:50:54 No.101818096

File: file.png (165 KB, 474x266)

165 KB PNG

>>101817622
>What I didn't put in the report was that at the end he gave me a choice – between a life of comfort or more torture. All I had to do was to say that I could two r's when, in fact, there were three."
>You didn't say it?
>No! No. But I was going to. I would have told him anything. Anything at all! But more than that, I believed that I could see two r's

Anonymous
08/10/24(Sat)11:57:15 No.101818179

Anonymous 08/10/24(Sat)11:57:15 No.101818179

>>101818057
None because LLMs only regurgitate whatever most people have said. They were designed to be the ultimate midwit. If you're looking for magic, you won't find it here.

Anonymous
08/10/24(Sat)12:04:31 No.101818255

Anonymous 08/10/24(Sat)12:04:31 No.101818255

>come to /lmg/ after a couple of months
>check censorship leaderboard
>download smartest model I can run

Anonymous
08/10/24(Sat)12:06:37 No.101818286

Anonymous 08/10/24(Sat)12:06:37 No.101818286

>>101818255
>Mistral-Large-Instruct-2407
excellent choice

Anonymous
08/10/24(Sat)12:28:36 No.101818552

Anonymous 08/10/24(Sat)12:28:36 No.101818552

>tell uncensored model I hate trannies
>every model replies "um, acktually"

Anonymous
08/10/24(Sat)12:33:19 No.101818619

Anonymous 08/10/24(Sat)12:33:19 No.101818619

File: file.png (13 KB, 1081x82)

13 KB PNG

>>101810210

Anonymous
08/10/24(Sat)12:34:07 No.101818628

Anonymous 08/10/24(Sat)12:34:07 No.101818628

Given that we have models that can basically fit entire 4chan threads into context and most models have some basic understanding of how imageboard conversations work shouldn't we theoretically be able to just scrape all the archives and parse people talking back and forth to each other and easily convert it into a q and a dataset?

Anonymous
08/10/24(Sat)12:37:12 No.101818665

Anonymous 08/10/24(Sat)12:37:12 No.101818665

>>101818628
GPT-4chan has been a thing since the local dark ages

Anonymous
08/10/24(Sat)12:43:13 No.101818744

Anonymous 08/10/24(Sat)12:43:13 No.101818744

i like to have first 10 messages be kinda normal and then start putting twists into the character description and see if the model follows through

llama3 sucks at that and just continues the current conversation, completely ignoring the twist
same with gemma

Anonymous
08/10/24(Sat)12:45:19 No.101818770

Anonymous 08/10/24(Sat)12:45:19 No.101818770

does anyone know the prompt format of tess 405b? did he use the same one as the official instruct model, or some other standard?

Anonymous
08/10/24(Sat)12:48:39 No.101818819

Anonymous 08/10/24(Sat)12:48:39 No.101818819

>>101818797
>>101818797
>>101818797

Anonymous
08/10/24(Sat)12:53:06 No.101818879

Anonymous 08/10/24(Sat)12:53:06 No.101818879

>>101818628
>>101818858

Anonymous
08/10/24(Sat)12:53:46 No.101818901

Anonymous 08/10/24(Sat)12:53:46 No.101818901

>>101818770
>https://huggingface.co/migtissera/Tess-3-Llama-3.1-405B/blob/main/tokenizer_config.json#L2053
Seems to be the default. Unless he's a retard.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.