/g/ - /ldg/ - Local Diffusion General Discussion of fre - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

09/30/24(Mon)08:20:36 No.102622356

File: y-u-no-bake.png (840 KB, 832x1216)

840 KB PNG

Anonymous 09/30/24(Mon)08:20:36 No.102622356 Archived

/ldg/ - Local Diffusion General

Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102610271

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/kohya-ss/sd-scripts/tree/sd3

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai

Anonymous
09/30/24(Mon)08:21:22 No.102622365

Anonymous 09/30/24(Mon)08:21:22 No.102622365

Blessed thread of frenship

Anonymous
09/30/24(Mon)08:31:14 No.102622450

Anonymous 09/30/24(Mon)08:31:14 No.102622450

i can't believe you've done this

Anonymous
09/30/24(Mon)08:40:53 No.102622526

Anonymous 09/30/24(Mon)08:40:53 No.102622526

I can't believe you used my 1girl as the OP. I did not allow that.

Anonymous
09/30/24(Mon)09:33:47 No.102623030

Anonymous 09/30/24(Mon)09:33:47 No.102623030

>>102622526
Fair use. I'm going to use it in my 48B model. Any suggested tags?

Anonymous
09/30/24(Mon)09:48:27 No.102623215

Anonymous 09/30/24(Mon)09:48:27 No.102623215

>

Anonymous
09/30/24(Mon)09:53:20 No.102623274

Anonymous 09/30/24(Mon)09:53:20 No.102623274

>>102623030
fkey, as that was the lora I used

Anonymous
09/30/24(Mon)10:03:10 No.102623379

Anonymous 09/30/24(Mon)10:03:10 No.102623379

are there any good non shit ui's out yet? reforge starts bugging out for me during long inpainting sprees and comfyui is comfyui

Anonymous
09/30/24(Mon)10:21:27 No.102623553

Anonymous 09/30/24(Mon)10:21:27 No.102623553

>>102623379
Invoke

Anonymous
09/30/24(Mon)10:24:09 No.102623581

Anonymous 09/30/24(Mon)10:24:09 No.102623581

File: 2388284649.png (851 KB, 1152x896)

851 KB PNG

Anonymous
09/30/24(Mon)10:24:56 No.102623591

Anonymous 09/30/24(Mon)10:24:56 No.102623591

New model
https://github.com/THUDM/CogView3

Anonymous
09/30/24(Mon)10:32:55 No.102623667

Anonymous 09/30/24(Mon)10:32:55 No.102623667

>>102623591
no ComfyUI implementation yet desu

Anonymous
09/30/24(Mon)10:35:40 No.102623700

Anonymous 09/30/24(Mon)10:35:40 No.102623700

File: 3884501246.png (3.71 MB, 1920x1536)

3.71 MB PNG

Anonymous
09/30/24(Mon)10:42:42 No.102623769

Anonymous 09/30/24(Mon)10:42:42 No.102623769

>>102623700
why not today

Anonymous
09/30/24(Mon)10:48:40 No.102623825

Anonymous 09/30/24(Mon)10:48:40 No.102623825

>pixart pride month over and no new pixart

Anonymous
09/30/24(Mon)10:52:51 No.102623864

Anonymous 09/30/24(Mon)10:52:51 No.102623864

>>102623591
Quite possibly the worst instructions ive seen (in English though) by and recent devs.
Words do not mean the objects they are described as representing.
Cog group needs to get it's shit together and be more organised and have someone proofread their shit, skill issue.

Anonymous
09/30/24(Mon)11:08:59 No.102624008

Anonymous 09/30/24(Mon)11:08:59 No.102624008

>>102623591
>https://github.com/THUDM/CogView3
Chicom spyware

Anonymous
09/30/24(Mon)11:17:46 No.102624084

Anonymous 09/30/24(Mon)11:17:46 No.102624084

File: 0.jpg (283 KB, 1024x1024)

283 KB JPG

Anonymous
09/30/24(Mon)11:27:22 No.102624177

Anonymous 09/30/24(Mon)11:27:22 No.102624177

File: 0.jpg (334 KB, 1024x1024)

334 KB JPG

Anonymous
09/30/24(Mon)11:28:05 No.102624182

Anonymous 09/30/24(Mon)11:28:05 No.102624182

File: file.png (1.2 MB, 720x720)

1.2 MB PNG

here's an emu3 image I found on /lmg/ kek
https://github.com/baaivision/Emu3

Anonymous
09/30/24(Mon)11:30:50 No.102624207

Anonymous 09/30/24(Mon)11:30:50 No.102624207

https://github.com/ToTheBeginning/PuLID/commit/eb4004cfcc4c7611c9cc56a68c190754b9d03df1
>release PuLID-FLUX-v0.9.1 model in 2024.10
we'll get a better pulid model for flux soon, nice, now we can't still make it work on ComfyUi goddam :(

Anonymous
09/30/24(Mon)11:35:39 No.102624262

Anonymous 09/30/24(Mon)11:35:39 No.102624262

>>102624182
kek

Anonymous
09/30/24(Mon)11:49:47 No.102624435

Anonymous 09/30/24(Mon)11:49:47 No.102624435

How to run Flux.1-dev Upscaler ControlNet on comfy-ui?

Anonymous
09/30/24(Mon)12:15:07 No.102624797

Anonymous 09/30/24(Mon)12:15:07 No.102624797

>

Anonymous
09/30/24(Mon)12:35:52 No.102625037

Anonymous 09/30/24(Mon)12:35:52 No.102625037

>>102624435
what are you stuck on?

Anonymous
09/30/24(Mon)12:43:53 No.102625124

Anonymous 09/30/24(Mon)12:43:53 No.102625124

>>102622356
A rare no collage edition

Anonymous
09/30/24(Mon)13:08:59 No.102625446

Anonymous 09/30/24(Mon)13:08:59 No.102625446

File: bComfyUI_114620_.jpg (1.37 MB, 3072x1536)

1.37 MB JPG

there a mech lora yet?

Anonymous
09/30/24(Mon)13:11:51 No.102625486

Anonymous 09/30/24(Mon)13:11:51 No.102625486

Genned 1500 or so images overnight. Let's go see if any are worth posting

Anonymous
09/30/24(Mon)13:16:17 No.102625530

Anonymous 09/30/24(Mon)13:16:17 No.102625530

File: FLUX-898909398648504_00001_.png (443 KB, 640x832)

443 KB PNG

Anonymous
09/30/24(Mon)13:16:47 No.102625535

Anonymous 09/30/24(Mon)13:16:47 No.102625535

bigma status?

Anonymous
09/30/24(Mon)13:16:47 No.102625536

Anonymous 09/30/24(Mon)13:16:47 No.102625536

File: bComfyUI_122235_.jpg (305 KB, 768x1152)

305 KB JPG

>>102625486
post the shit ones too sometimes they are funny

Anonymous
09/30/24(Mon)13:18:24 No.102625562

Anonymous 09/30/24(Mon)13:18:24 No.102625562

>>102625535
Our time will come a-gain

Anonymous
09/30/24(Mon)13:25:33 No.102625651

Anonymous 09/30/24(Mon)13:25:33 No.102625651

>>102625446
that looks pretty good desu

Anonymous
09/30/24(Mon)13:26:49 No.102625659

Anonymous 09/30/24(Mon)13:26:49 No.102625659

File: FLUX-795747150241137_00001_.png (258 KB, 640x832)

258 KB PNG

>>102625536
ok

Anonymous
09/30/24(Mon)13:29:54 No.102625690

Anonymous 09/30/24(Mon)13:29:54 No.102625690

File: bComfyUI_114626_.jpg (1.44 MB, 3072x1536)

1.44 MB JPG

>>102625651
yeah but it's like 1 out of 10 gens that'll look alright and not have fucked up proportions or 20+ weapons on it making it look goofy

Anonymous
09/30/24(Mon)13:32:29 No.102625718

Anonymous 09/30/24(Mon)13:32:29 No.102625718

>>102625659
Neat

Anonymous
09/30/24(Mon)13:35:06 No.102625746

Anonymous 09/30/24(Mon)13:35:06 No.102625746

>>102625659
kinda looks like a mannequin

Anonymous
09/30/24(Mon)13:42:45 No.102625858

Anonymous 09/30/24(Mon)13:42:45 No.102625858

>>102625446
>>102625690
Kino as heck my nigger

Anonymous
09/30/24(Mon)13:56:01 No.102626093

Anonymous 09/30/24(Mon)13:56:01 No.102626093

File: FLUX-562994406244314_00001_.png (406 KB, 640x832)

406 KB PNG

Anonymous
09/30/24(Mon)14:14:50 No.102626406

Anonymous 09/30/24(Mon)14:14:50 No.102626406

>>102625530
booooba

Anonymous
09/30/24(Mon)14:15:41 No.102626417

Anonymous 09/30/24(Mon)14:15:41 No.102626417

I wonder if Next Token with VQ VAE with Transformers is the real future. Get off this diffusion noise bus.

Anonymous
09/30/24(Mon)14:16:45 No.102626437

Anonymous 09/30/24(Mon)14:16:45 No.102626437

>>102626417
does this next token architecture thing means we won't need a text encoder anymore because it's inside the architecture already?

Anonymous
09/30/24(Mon)14:19:00 No.102626475

Anonymous 09/30/24(Mon)14:19:00 No.102626475

>>102626437
No you would still use the text encoder, the difference is you predict images by tokens instead of denoising. You also get multi-modal stuff built in like "change this from blue to red" or "describe this image" visual captioning without major changes to the architecture because everything is done in tokens.

Anonymous
09/30/24(Mon)14:21:52 No.102626519

Anonymous 09/30/24(Mon)14:21:52 No.102626519

>>102626475
>you predict images by tokens instead of denoising.
I think this is a big deal when you think about it, the denoising process always was some hack because we don't know how to solve the diffusion equation problem, going for tokens resolve that issue and we won't have to deal with millions of cope samplers kek

Anonymous
09/30/24(Mon)14:46:20 No.102626934

Anonymous 09/30/24(Mon)14:46:20 No.102626934

>>102626519
> won't have to deal with millions of cope samplers kek
>tfw ive been stockholm-syndromed into enjoying the choice of different samplers

Anonymous
09/30/24(Mon)14:52:00 No.102627035

Anonymous 09/30/24(Mon)14:52:00 No.102627035

File: 0.jpg (420 KB, 1024x1024)

420 KB JPG

Anonymous
09/30/24(Mon)15:04:15 No.102627225

Anonymous 09/30/24(Mon)15:04:15 No.102627225

File: FLUX-55683354348053_00001_.png (426 KB, 640x832)

426 KB PNG

>>102626406
so true

Anonymous
09/30/24(Mon)15:06:20 No.102627252

Anonymous 09/30/24(Mon)15:06:20 No.102627252

Any new text-to-video processes that are better than Comfy+AnimateDiff? They're so short for me, can't seem to get more than 16-64 frames in my current workflow.

Anonymous
09/30/24(Mon)15:12:08 No.102627337

Anonymous 09/30/24(Mon)15:12:08 No.102627337

>>102627225
nice

>>102627035
good shit man

Anonymous
09/30/24(Mon)15:16:07 No.102627413

Anonymous 09/30/24(Mon)15:16:07 No.102627413

>>102622356
shoulder to hip ratio says "cock and balls behind the curtain"

Anonymous
09/30/24(Mon)15:18:55 No.102627463

Anonymous 09/30/24(Mon)15:18:55 No.102627463

>>102627413
that makes it even better

Anonymous
09/30/24(Mon)15:20:49 No.102627505

Anonymous 09/30/24(Mon)15:20:49 No.102627505

>>102627463
It gives it a twist, but I dunno what to think about those tits now

Anonymous
09/30/24(Mon)15:21:57 No.102627525

Anonymous 09/30/24(Mon)15:21:57 No.102627525

>>102627413
I just jerked off too...

Anonymous
09/30/24(Mon)15:22:00 No.102627529

Anonymous 09/30/24(Mon)15:22:00 No.102627529

>>102627252
CogVideoX?

Anonymous
09/30/24(Mon)15:22:33 No.102627543

Anonymous 09/30/24(Mon)15:22:33 No.102627543

>>102627505
PS OR, how about
>fake, ceremonial tits that adhere to skin under the bra etc
The priestess is a man, b/c god requires a bride to run the temple or something. Not far from some bullshit that actually happened in Syria.

Anonymous
09/30/24(Mon)15:26:07 No.102627595

Anonymous 09/30/24(Mon)15:26:07 No.102627595

>>102627529
haven't tried it, loading it up now, thanks!

Anonymous
09/30/24(Mon)15:45:24 No.102627866

Anonymous 09/30/24(Mon)15:45:24 No.102627866

File: 0.jpg (188 KB, 1024x1024)

188 KB JPG

>>102627337

Anonymous
09/30/24(Mon)15:59:36 No.102628101

Anonymous 09/30/24(Mon)15:59:36 No.102628101

slow monday

Anonymous
09/30/24(Mon)16:00:52 No.102628119

Anonymous 09/30/24(Mon)16:00:52 No.102628119

>>102628101
post some gens

Anonymous
09/30/24(Mon)16:01:09 No.102628122

Anonymous 09/30/24(Mon)16:01:09 No.102628122

>>102628119
You

Anonymous
09/30/24(Mon)16:03:00 No.102628154

Anonymous 09/30/24(Mon)16:03:00 No.102628154

File: file.png (204 KB, 3571x837)

204 KB PNG

https://huggingface.co/alibaba-pai/CogVideoX-Fun-V1.1-5b-Pose/blob/main/README_en.md
looks like they updated CogVideoX-Fun

Anonymous
09/30/24(Mon)16:03:02 No.102628155

Anonymous 09/30/24(Mon)16:03:02 No.102628155

>>102628122
i'll do it if u do it

Anonymous
09/30/24(Mon)16:04:03 No.102628171

Anonymous 09/30/24(Mon)16:04:03 No.102628171

>>102628155
Good 4 u

Anonymous
09/30/24(Mon)16:04:05 No.102628172

Anonymous 09/30/24(Mon)16:04:05 No.102628172

im scared

Anonymous
09/30/24(Mon)16:07:31 No.102628228

Anonymous 09/30/24(Mon)16:07:31 No.102628228

>>102628172
why

Anonymous
09/30/24(Mon)16:20:48 No.102628414

Anonymous 09/30/24(Mon)16:20:48 No.102628414

>>102628228
aliens are talking to me

Anonymous
09/30/24(Mon)16:26:10 No.102628509

Anonymous 09/30/24(Mon)16:26:10 No.102628509

>>102628414
Tell 'em to STFU.

Anonymous
09/30/24(Mon)16:32:16 No.102628588

Anonymous 09/30/24(Mon)16:32:16 No.102628588

File: 0.jpg (136 KB, 1024x1024)

136 KB JPG

Anonymous
09/30/24(Mon)16:55:35 No.102628930

Anonymous 09/30/24(Mon)16:55:35 No.102628930

File: 86108747-829D-44E0-B519-E(...).png (3.39 MB, 2048x1280)

3.39 MB PNG

Anonymous
09/30/24(Mon)17:11:44 No.102629146

Anonymous 09/30/24(Mon)17:11:44 No.102629146

>no new toys for anon

Anonymous
09/30/24(Mon)17:15:03 No.102629178

Anonymous 09/30/24(Mon)17:15:03 No.102629178

>>102629146
he barely played with the toys he currently has, spoiled little shit

Anonymous
09/30/24(Mon)17:41:05 No.102629478

Anonymous 09/30/24(Mon)17:41:05 No.102629478

Why doesn't llama.cpp statically link rocm? And why isn't there a binary version so I don't have to shit up my local install with amd's notoriously vile amdgpu scripts?

Anonymous
09/30/24(Mon)17:42:06 No.102629494

Anonymous 09/30/24(Mon)17:42:06 No.102629494

>>102629478
sorry, wrong thread

Anonymous
09/30/24(Mon)17:43:48 No.102629506

Anonymous 09/30/24(Mon)17:43:48 No.102629506

/lmg/ won

Anonymous
09/30/24(Mon)17:44:41 No.102629511

Anonymous 09/30/24(Mon)17:44:41 No.102629511

>ComfyUI uses a web interface
>browsers are vram pigs

Anonymous
09/30/24(Mon)18:00:46 No.102629719

Anonymous 09/30/24(Mon)18:00:46 No.102629719

>>102629511
can't you disable that?

Anonymous
09/30/24(Mon)18:00:58 No.102629723

Anonymous 09/30/24(Mon)18:00:58 No.102629723

File: catbox_rmia1w.jpg (1.28 MB, 1728x1344)

1.28 MB JPG

>>102629146
illustriousxl
https://files.catbox.moe/2vtome.png

Anonymous
09/30/24(Mon)18:01:50 No.102629734

Anonymous 09/30/24(Mon)18:01:50 No.102629734

>>102629723
that's really nice ty for catbox

Anonymous
09/30/24(Mon)18:06:53 No.102629797

Anonymous 09/30/24(Mon)18:06:53 No.102629797

>>102629734
https://civitai.com/models/811067/illustrious-xl-smoothft
the specific checkpoint
and this lora seems to generically increase quality at low power but im not married to it
https://civitai.com/models/798443/some-style-for-illustrious-xl

Anonymous
09/30/24(Mon)18:09:17 No.102629831

Anonymous 09/30/24(Mon)18:09:17 No.102629831

>>102629511
I've found the manager node (essential really) kills FPS as well. It's bullshit.

Anonymous
09/30/24(Mon)18:11:49 No.102629865

Anonymous 09/30/24(Mon)18:11:49 No.102629865

>>102629797
What's the point of the new model if the base quality is so bad that you need a lora to fix it?

Anonymous
09/30/24(Mon)18:13:09 No.102629882

Anonymous 09/30/24(Mon)18:13:09 No.102629882

File: 72601.jpg (490 KB, 1944x1504)

490 KB JPG

something something big bara tiddies

Anonymous
09/30/24(Mon)18:14:04 No.102629896

Anonymous 09/30/24(Mon)18:14:04 No.102629896

>>102629882
enhance the bulge

Anonymous
09/30/24(Mon)18:15:41 No.102629913

Anonymous 09/30/24(Mon)18:15:41 No.102629913

>>102629865
the base model is underbaked but only just so, and it's new enough that we're in the early schizo phase of plugging in different tech sorcery to see what works

Anonymous
09/30/24(Mon)18:18:00 No.102629949

Anonymous 09/30/24(Mon)18:18:00 No.102629949

File: pic.jpg (970 KB, 1430x1800)

970 KB JPG

Anonymous
09/30/24(Mon)18:18:25 No.102629954

Anonymous 09/30/24(Mon)18:18:25 No.102629954

>>102629949
what happened

Anonymous
09/30/24(Mon)18:33:58 No.102630143

Anonymous 09/30/24(Mon)18:33:58 No.102630143

>>102627225
What model/lora(s) is this?

Anonymous
09/30/24(Mon)18:39:32 No.102630204

Anonymous 09/30/24(Mon)18:39:32 No.102630204

File: FLUX-696889077294880_00001_.png (422 KB, 640x832)

422 KB PNG

>>102630143
flux dev nf4.

here's the workflow:
https://files.catbox.moe/mhpv01.png
and this is the image I used for img2img:
https://files.catbox.moe/q9iu2m.png

and this post explains the workflow somewhat: >>102610322

Anonymous
09/30/24(Mon)18:43:11 No.102630242

Anonymous 09/30/24(Mon)18:43:11 No.102630242

>>102630204
can you post more pics from your batch you did last night

Anonymous
09/30/24(Mon)18:45:08 No.102630274

Anonymous 09/30/24(Mon)18:45:08 No.102630274

File: FLUX-503528708948858_00001_.png (452 KB, 640x832)

452 KB PNG

>>102630242
ok maybe a few more

Anonymous
09/30/24(Mon)18:46:09 No.102630287

Anonymous 09/30/24(Mon)18:46:09 No.102630287

File: FLUX-501268566337325_00001_.png (434 KB, 640x832)

434 KB PNG

Anonymous
09/30/24(Mon)18:46:12 No.102630289

Anonymous 09/30/24(Mon)18:46:12 No.102630289

>>102630204
So good

Anonymous
09/30/24(Mon)18:47:10 No.102630295

Anonymous 09/30/24(Mon)18:47:10 No.102630295

File: FLUX-899176119837483_00001_.png (340 KB, 640x832)

340 KB PNG

Anonymous
09/30/24(Mon)18:49:36 No.102630327

Anonymous 09/30/24(Mon)18:49:36 No.102630327

File: FLUX-555492752965722_00001_.png (447 KB, 640x832)

447 KB PNG

Anonymous
09/30/24(Mon)18:49:48 No.102630329

Anonymous 09/30/24(Mon)18:49:48 No.102630329

>>102629954
My cat ran over my keyboard.

Anonymous
09/30/24(Mon)18:50:19 No.102630331

Anonymous 09/30/24(Mon)18:50:19 No.102630331

>>102630295
>>102630287
>>102630274
cool stuff man, some of em give me a nostalgic vibe from msn messenger webcam days

Anonymous
09/30/24(Mon)18:56:18 No.102630380

Anonymous 09/30/24(Mon)18:56:18 No.102630380

File: FLUX-230002709596803_00001_.png (486 KB, 512x768)

486 KB PNG

anyway now I'm revisiting an old prompt idea that never quite turns out the way I want it to—but the idea itself is stuck in my head, so occasionally I return to it. Someday I'll figure out how to prompt what I'm looking for

Anonymous
09/30/24(Mon)19:07:46 No.102630459

Anonymous 09/30/24(Mon)19:07:46 No.102630459

honk mimimimimi...... hoonk... mimimimimi

Anonymous
09/30/24(Mon)19:12:46 No.102630497

Anonymous 09/30/24(Mon)19:12:46 No.102630497

File: 0.jpg (200 KB, 1024x1024)

200 KB JPG

Anonymous
09/30/24(Mon)19:15:10 No.102630519

Anonymous 09/30/24(Mon)19:15:10 No.102630519

File: 0.jpg (277 KB, 1024x1024)

277 KB JPG

Anonymous
09/30/24(Mon)19:43:56 No.102630791

Anonymous 09/30/24(Mon)19:43:56 No.102630791

>>102629949
I've seen her before

Anonymous
09/30/24(Mon)19:57:04 No.102630922

Anonymous 09/30/24(Mon)19:57:04 No.102630922

File: 595C8311-974C-47B8-A727-C(...).webm (330 KB, 482x360)

330 KB WEBM

>still like two years from doing this locally
Why live

Anonymous
09/30/24(Mon)19:59:34 No.102630946

Anonymous 09/30/24(Mon)19:59:34 No.102630946

>>102630922
erm, what the sigma?

Anonymous
09/30/24(Mon)20:04:50 No.102630988

Anonymous 09/30/24(Mon)20:04:50 No.102630988

>>102630922
nice, catbox?

Anonymous
09/30/24(Mon)20:10:48 No.102631044

Anonymous 09/30/24(Mon)20:10:48 No.102631044

>>102630922
anon

Anonymous
09/30/24(Mon)20:13:28 No.102631071

Anonymous 09/30/24(Mon)20:13:28 No.102631071

File: 00133-1394286230-4c89b896(...).png (2.89 MB, 1840x1432)

2.89 MB PNG

Anonymous
09/30/24(Mon)20:21:46 No.102631150

Anonymous 09/30/24(Mon)20:21:46 No.102631150

File: 00135-935350344-bf8ab798-(...).png (3.13 MB, 1840x1432)

3.13 MB PNG

Anonymous
09/30/24(Mon)20:25:19 No.102631181

Anonymous 09/30/24(Mon)20:25:19 No.102631181

how do we make stuff similar to Nijijourney using local? I see all these images on X and it makes me so jealous

Anonymous
09/30/24(Mon)20:32:05 No.102631223

Anonymous 09/30/24(Mon)20:32:05 No.102631223

>>102630331
I never thought of it as nostalgia but maybe I'm just 100 years old now and iut of touch. I guess most people's images haven't been that shitty in over a decade.

Anonymous
09/30/24(Mon)20:44:04 No.102631322

Anonymous 09/30/24(Mon)20:44:04 No.102631322

>>102631223
for some reason it just reminded me of the shitty webcams of that period. the good times in msn add threads on /b/ from 2005-2008 that were popular back then.

Anonymous
09/30/24(Mon)20:47:37 No.102631346

Anonymous 09/30/24(Mon)20:47:37 No.102631346

>>102622356
What model is this? Flux? Catbox?

Anonymous
09/30/24(Mon)21:05:04 No.102631513

Anonymous 09/30/24(Mon)21:05:04 No.102631513

>>102631181
Loras are the closest you'll get and they're still piss poor imitations. Iktf though anon, iktf

Anonymous
09/30/24(Mon)21:06:08 No.102631527

Anonymous 09/30/24(Mon)21:06:08 No.102631527

>>102631181
Ikr, Midjourney Niji can make beautiful drawings, I love their shadows, that's the model that can truely measure to good artists and doesn't look like AI slop

Anonymous
09/30/24(Mon)21:07:45 No.102631538

Anonymous 09/30/24(Mon)21:07:45 No.102631538

File: file.png (1.54 MB, 1024x1024)

1.54 MB PNG

https://reddit.com/r/StableDiffusion/comments/1ft9kjw/cogview3plus3b_really_great_prompt_comprehension/
>CogView-3Plus-3B
Yeah... it doesn't look great at all

Anonymous
09/30/24(Mon)21:17:58 No.102631644

Anonymous 09/30/24(Mon)21:17:58 No.102631644

>>102630988
https://files.catbox.moe/uoq7kk.jpeg
>>102631044
Ye?

Anonymous
09/30/24(Mon)21:22:14 No.102631684

Anonymous 09/30/24(Mon)21:22:14 No.102631684

>>102631644
>https://files.catbox.moe/uoq7kk.jpeg
kek i should have expected that

Anonymous
09/30/24(Mon)21:25:48 No.102631710

Anonymous 09/30/24(Mon)21:25:48 No.102631710

>>102631538
pitiful

Anonymous
09/30/24(Mon)21:30:49 No.102631755

Anonymous 09/30/24(Mon)21:30:49 No.102631755

>>102631538
>that plastic look
the chinks trained their model with a shit ton of AI pictures innit?

Anonymous
09/30/24(Mon)21:32:10 No.102631764

Anonymous 09/30/24(Mon)21:32:10 No.102631764

File: ComfyUI_21616_.png (1.82 MB, 1040x1520)

1.82 MB PNG

SD3 kind ok sometimes
no bumchin in site at least

Anonymous
09/30/24(Mon)21:33:14 No.102631773

Anonymous 09/30/24(Mon)21:33:14 No.102631773

>>102631764
wow how did i misspell sight lol oops

Anonymous
09/30/24(Mon)21:34:54 No.102631788

Anonymous 09/30/24(Mon)21:34:54 No.102631788

>>102631538
where do you even actually download this shit

Anonymous
09/30/24(Mon)21:37:18 No.102631811

Anonymous 09/30/24(Mon)21:37:18 No.102631811

>>102631710
it kinda looks like maybe it needs a different sampler. Hunyuan is like that, it's absolute horrendous dogshit at photographic gens with like Euler SGM Uniform and shit, but if you switch to SDE-based stuff it's 1000x better

Anonymous
09/30/24(Mon)21:37:48 No.102631820

Anonymous 09/30/24(Mon)21:37:48 No.102631820

>>102631788
https://github.com/THUDM/CogView3

Anonymous
09/30/24(Mon)21:38:50 No.102631826

Anonymous 09/30/24(Mon)21:38:50 No.102631826

So where's the demo?

Anonymous
09/30/24(Mon)21:38:54 No.102631827

Anonymous 09/30/24(Mon)21:38:54 No.102631827

File: file.png (1.66 MB, 850x1170)

1.66 MB PNG

>>102631764
>kind ok sometimes

Anonymous
09/30/24(Mon)21:41:03 No.102631840

Anonymous 09/30/24(Mon)21:41:03 No.102631840

>>102631826
there's none, and now it's obvious why, it looks like shit

Anonymous
09/30/24(Mon)21:41:15 No.102631844

Anonymous 09/30/24(Mon)21:41:15 No.102631844

>>102631811
definitely operator error. the examples on the github aren't anything to write home about but they're not as bad as that post

Anonymous
09/30/24(Mon)21:45:19 No.102631884

Anonymous 09/30/24(Mon)21:45:19 No.102631884

>>102631844
>the examples on the github
maybe they're from the regular base model and not the "Plus" one? Idk what's the difference between the 2 of them lol

Anonymous
09/30/24(Mon)21:46:52 No.102631898

Anonymous 09/30/24(Mon)21:46:52 No.102631898

>>102629506
In thread faggotry competition.

Anonymous
09/30/24(Mon)21:49:46 No.102631928

Anonymous 09/30/24(Mon)21:49:46 No.102631928

>>102631827
I think this is my fault DESU, I had the denoise too high during hi-res-fix, the original gen didn't have the double finger looking at it. I didn't even catch that though lol

Anonymous
09/30/24(Mon)22:14:50 No.102632098

Anonymous 09/30/24(Mon)22:14:50 No.102632098

File: 00337.png (553 KB, 576x1024)

553 KB PNG

Anonymous
09/30/24(Mon)22:27:47 No.102632178

Anonymous 09/30/24(Mon)22:27:47 No.102632178

File: 00343.png (452 KB, 576x1024)

452 KB PNG

Anonymous
09/30/24(Mon)22:30:32 No.102632214

Anonymous 09/30/24(Mon)22:30:32 No.102632214

>>102630204
>enormous boobs titcow. freshman year!! my friend min jee park, ugh she is the worst roommate so messy, but her body wow! ugly butterface korean. I want make sex to her chubby cleavage and huge fat booty we go to ucla together, hourglass figure hotness holy cow. first year at UCLA in the Dykstra Hall dorm. She's probably going to get into kappa delta which is honestly perfect for her. the tube top is aerie and the cheeky shorts are abercrombie
kek

Anonymous
09/30/24(Mon)22:33:00 No.102632232

Anonymous 09/30/24(Mon)22:33:00 No.102632232

File: IMG_0340.jpg (1.03 MB, 1125x1367)

1.03 MB JPG

>>102631684
I have no idea what else you could have been asking for

Anonymous
09/30/24(Mon)22:39:34 No.102632278

Anonymous 09/30/24(Mon)22:39:34 No.102632278

>>102632232
i was just joking but it gave me a laugh, nice cock bro

>>102632214
so this is prompt engineering

Anonymous
09/30/24(Mon)22:55:20 No.102632409

Anonymous 09/30/24(Mon)22:55:20 No.102632409

File: 00350.png (473 KB, 576x1024)

473 KB PNG

Anonymous
09/30/24(Mon)23:12:23 No.102632546

Anonymous 09/30/24(Mon)23:12:23 No.102632546

>>102632232
Prompt?

Anonymous
09/30/24(Mon)23:24:59 No.102632645

Anonymous 09/30/24(Mon)23:24:59 No.102632645

File: FLUX-795867716373370_00001_.png (535 KB, 512x768)

535 KB PNG

right now I'm getting some very fake-looking women because my prompt is too ambiguous and I can see it yo-yoing back and forth from artstation slop style to anime to 3D digital art to photograph etc as it gens, usually eventually settling on a photograph but sometimes ending up some kind of photoshopped hybrid thing

>>102632214
the thing about a good base model is that you can say a few things that you know work and then spend the other 80% of the prompt trying shit that probably doesn't do much, but you hope it will—in other words you can just have fun with it, and sometimes it just werks

>>102632178
nice, we love some plump homely gals don't we

Anonymous
09/30/24(Mon)23:41:28 No.102632776

Anonymous 09/30/24(Mon)23:41:28 No.102632776

File: FLUX-858159958029902_00001_.png (429 KB, 512x768)

429 KB PNG

>>102632645
to be clear, I don't "like" this style and wasn't aiming for it, but I do find it neat as a natural consequence of denoising being performed in steps with the prompt's ambiguity being resolved at each step, each time potentially in a different direction than before.

Anonymous
09/30/24(Mon)23:45:46 No.102632810

Anonymous 09/30/24(Mon)23:45:46 No.102632810

File: 0.jpg (148 KB, 1024x1024)

148 KB JPG

Anonymous
09/30/24(Mon)23:47:20 No.102632822

Anonymous 09/30/24(Mon)23:47:20 No.102632822

File: FLUX-411694197421002_00001_.png (318 KB, 512x768)

318 KB PNG

Anonymous
09/30/24(Mon)23:50:41 No.102632853

Anonymous 09/30/24(Mon)23:50:41 No.102632853

>>102632822
face is a little munged, but i like the style

Anonymous
09/30/24(Mon)23:52:14 No.102632864

Anonymous 09/30/24(Mon)23:52:14 No.102632864

File: 00353.png (628 KB, 768x1024)

628 KB PNG

>>102632645
>plump homely gals
Indeed

Anonymous
09/30/24(Mon)23:52:45 No.102632870

Anonymous 09/30/24(Mon)23:52:45 No.102632870

>>102632546
>a hospital tray on a grubby tile counter. on the tray, a staten island charcuterie board.
(It’s a real photo sorry)

Anonymous
09/30/24(Mon)23:52:46 No.102632871

Anonymous 09/30/24(Mon)23:52:46 No.102632871

>>102632853
the face only looks funny because of the video game lighting on the nose. It's a quirk of the style. Other than that it's too small to see if there's anything wrong with it, I think it looks basically fine

Anonymous
09/30/24(Mon)23:54:22 No.102632883

Anonymous 09/30/24(Mon)23:54:22 No.102632883

File: 00356.png (644 KB, 768x1024)

644 KB PNG

Anonymous
09/30/24(Mon)23:58:28 No.102632913

Anonymous 09/30/24(Mon)23:58:28 No.102632913

File: FLUX-223216115233203_00001_.png (479 KB, 512x768)

479 KB PNG

Anonymous
10/01/24(Tue)00:36:27 No.102633221

Anonymous 10/01/24(Tue)00:36:27 No.102633221

File: file.png (117 KB, 256x256)

117 KB PNG

All I do is train now. Now I want to try training a VQ-VAE. Also training a 1B Pixart 16 channel VAE model. Now that rumor is the 5090 will have 32 GB of VRAM (maybe 48 GB for Titan RTX AI), I'm pretty hyped.

Anonymous
10/01/24(Tue)01:19:02 No.102633510

Anonymous 10/01/24(Tue)01:19:02 No.102633510

>>102631764
it can look better than that even

Anonymous
10/01/24(Tue)01:26:52 No.102633562

Anonymous 10/01/24(Tue)01:26:52 No.102633562

File: 00370.png (463 KB, 1024x768)

463 KB PNG

Anonymous
10/01/24(Tue)01:37:23 No.102633627

Anonymous 10/01/24(Tue)01:37:23 No.102633627

>>102633221
>VQ-VAE
What's that?

Anonymous
10/01/24(Tue)02:03:35 No.102633812

Anonymous 10/01/24(Tue)02:03:35 No.102633812

dead general

Anonymous
10/01/24(Tue)02:05:16 No.102633820

Anonymous 10/01/24(Tue)02:05:16 No.102633820

hibernation mode

Anonymous
10/01/24(Tue)02:18:15 No.102633903

Anonymous 10/01/24(Tue)02:18:15 No.102633903

>>102630922
>two years
Too optimistic

Anonymous
10/01/24(Tue)02:41:40 No.102634043

Anonymous 10/01/24(Tue)02:41:40 No.102634043

File: 00379.png (539 KB, 576x1024)

539 KB PNG

Anonymous
10/01/24(Tue)02:50:57 No.102634090

Anonymous 10/01/24(Tue)02:50:57 No.102634090

File: 00380.png (567 KB, 576x1024)

567 KB PNG

Anonymous
10/01/24(Tue)03:04:11 No.102634191

Anonymous 10/01/24(Tue)03:04:11 No.102634191

Is there anything for comfy that will let me view checkpoint / lora info saved in a txt document? It's annoying trying to figure out which loras have activation texts and what those are

Anonymous
10/01/24(Tue)03:39:29 No.102634440

Anonymous 10/01/24(Tue)03:39:29 No.102634440

>>102634191
I just keep a list in Obsidian.

Anonymous
10/01/24(Tue)03:42:14 No.102634466

Anonymous 10/01/24(Tue)03:42:14 No.102634466

https://pytorch.org/blog/pytorch-native-architecture-optimization/
will this work on gguf quants? that looks interesting

Anonymous
10/01/24(Tue)03:55:14 No.102634562

Anonymous 10/01/24(Tue)03:55:14 No.102634562

Official pixart bigma waiting room

Anonymous
10/01/24(Tue)03:58:03 No.102634581

Anonymous 10/01/24(Tue)03:58:03 No.102634581

5090 waiting room

Anonymous
10/01/24(Tue)03:59:08 No.102634588

Anonymous 10/01/24(Tue)03:59:08 No.102634588

I'm calling it now. The 5090 will draw 1200 watts.

Anonymous
10/01/24(Tue)04:00:43 No.102634600

Anonymous 10/01/24(Tue)04:00:43 No.102634600

To purchase a 5090, you will require a notarized letter from a master electrician of your intent to comply with local, state, and federal guidelines on the installation of industrial equipment.

Anonymous
10/01/24(Tue)04:01:49 No.102634607

Anonymous 10/01/24(Tue)04:01:49 No.102634607

To purchase a 5090, you will be required to read the nda aloud, while the ai assesses the authenticity of your performance.

Anonymous
10/01/24(Tue)04:03:15 No.102634618

Anonymous 10/01/24(Tue)04:03:15 No.102634618

>>102634588
there was a leak that was saying that the 5090 would be 32gb + 600W

Anonymous
10/01/24(Tue)04:04:34 No.102634630

Anonymous 10/01/24(Tue)04:04:34 No.102634630

>>102634618
Sounds underpowered, hopefully that's the 5080

Anonymous
10/01/24(Tue)04:35:52 No.102634864

Anonymous 10/01/24(Tue)04:35:52 No.102634864

File: 00002-2584379894.png (1.52 MB, 1152x1632)

1.52 MB PNG

Anonymous
10/01/24(Tue)04:42:52 No.102634921

Anonymous 10/01/24(Tue)04:42:52 No.102634921

cough

Anonymous
10/01/24(Tue)04:43:25 No.102634925

Anonymous 10/01/24(Tue)04:43:25 No.102634925

>>102634466
This is basically bitsandbytes made by Pytorch with NIH syndrome. Since that doesn't play nice with GGUFs, there is no reason this will either until someone works on it.

Anonymous
10/01/24(Tue)04:43:35 No.102634926

Anonymous 10/01/24(Tue)04:43:35 No.102634926

The 5090 will be the first card designed during covid, won't it be?

Anonymous
10/01/24(Tue)04:58:27 No.102635048

Anonymous 10/01/24(Tue)04:58:27 No.102635048

File: FLUX-283235471552015_00001_.png (448 KB, 512x768)

448 KB PNG

so are we gonna migrate back to /sdg/ or what? this is a little too dead for me

Anonymous
10/01/24(Tue)04:59:28 No.102635059

Anonymous 10/01/24(Tue)04:59:28 No.102635059

File: FLUX-1093429083051788_00001_.png (454 KB, 512x768)

454 KB PNG

Anonymous
10/01/24(Tue)05:05:20 No.102635122

Anonymous 10/01/24(Tue)05:05:20 No.102635122

>>102635048
0/10, better luck bext time.

Anonymous
10/01/24(Tue)05:05:45 No.102635127

Anonymous 10/01/24(Tue)05:05:45 No.102635127

>>102635048
I'd rather one post per month than deal with the insufferable avatarfags desu

Anonymous
10/01/24(Tue)05:07:20 No.102635144

Anonymous 10/01/24(Tue)05:07:20 No.102635144

>>102635048
I downloaded a bunch of pornstar loras. I'm saying Flux is so back.

Anonymous
10/01/24(Tue)05:15:10 No.102635232

Anonymous 10/01/24(Tue)05:15:10 No.102635232

>>102635127
true

Anonymous
10/01/24(Tue)05:43:32 No.102635496

Anonymous 10/01/24(Tue)05:43:32 No.102635496

Anyone made local chatbot for prompting? I'm running Mistral-Small-Instruct-2409-IQ3_M and I'm trying to make it versatile as possible. Struggles with booru tags.

Anonymous
10/01/24(Tue)06:48:07 No.102636027

Anonymous 10/01/24(Tue)06:48:07 No.102636027

>>102635496
What are you trying to do?
Prompt expansion?
Mistral might not have seen the tags during training.

Anonymous
10/01/24(Tue)07:06:11 No.102636155

Anonymous 10/01/24(Tue)07:06:11 No.102636155

File: Screenshot 2024-10-01 120559.jpg (101 KB, 697x394)

101 KB JPG

did comfy break in the last 10 hours while I was asleep and you all fixed it immediately?

Anonymous
10/01/24(Tue)07:45:31 No.102636476

Anonymous 10/01/24(Tue)07:45:31 No.102636476

nvm, pip broke for some fucking reason

Anonymous
10/01/24(Tue)08:05:48 No.102636647

Anonymous 10/01/24(Tue)08:05:48 No.102636647

Is it possible to gen with zluda on rx570/580 4gb? How scuffed the experience will be? Is it much more better than directml? Thinking about trying fooocus, will it be friendly with vram usage, as 4gb is probably atrociously low for ai slop?

Anonymous
10/01/24(Tue)08:16:35 No.102636722

Anonymous 10/01/24(Tue)08:16:35 No.102636722

>>102636647
quick search on google and browsing github/reddit llinks that popped up says you can gen on it. if you can get it working, good luck man.

Anonymous
10/01/24(Tue)08:39:44 No.102636928

Anonymous 10/01/24(Tue)08:39:44 No.102636928

I'm trying to train a style (artist) lora for SDXL. I've tried some settings from guides, tried some presets made for kohya_ss, also tried to use the same settings from existing working lora metadata.
But the results look suspiciously similar - lora samples are horribly distorted, without any artist specific features. Loss grabs look similar too, with a quick drop to 0.07-06 and then just fluctuating there all the time (with random spikes).
I just don't understand what the fuck I'm doing wrong...

Anonymous
10/01/24(Tue)09:26:12 No.102637339

Anonymous 10/01/24(Tue)09:26:12 No.102637339

>>102636027
Just something I can bounce ideas and prompts from. It can be steered pretty well with starting message and prompt knowledge can be expanded with silly taverns world lore settings. It knows the structure and what words to avoid, but It repeats itself way too much

Anonymous
10/01/24(Tue)09:47:26 No.102637545

Anonymous 10/01/24(Tue)09:47:26 No.102637545

>>102636928
Did you try it with basic Prodigy settings? Did you use tags? You better make it save every X epoch so you can properly test if it works or notwith actual checkpoint

Anonymous
10/01/24(Tue)09:47:45 No.102637548

Anonymous 10/01/24(Tue)09:47:45 No.102637548

>>102637339
>but It repeats itself way too much
i think that happens when you got the instruct format wrong. mistral did some really confusing changes to the whitespaces around [INST] [/INST], even i'm not sure if i have the right instruct format or not, try asking >>>/g/lmg/

Anonymous
10/01/24(Tue)09:55:25 No.102637635

Anonymous 10/01/24(Tue)09:55:25 No.102637635

just came back from pooping, did anything interesting happen while i was gone?

Anonymous
10/01/24(Tue)09:57:05 No.102637655

Anonymous 10/01/24(Tue)09:57:05 No.102637655

>>102637548
Gemma2 might be better for it

Anonymous
10/01/24(Tue)10:00:57 No.102637685

Anonymous 10/01/24(Tue)10:00:57 No.102637685

>>102637545
Yes, I tried Prodigy too. It worked pretty much the same.
I use captions, I use WD14 with removal of low level tags like bow, underwear, etc. and also prune artist specific tags.
I already save it every epoch and generate samples every 200 iters.

Here's a note - almost all guides say you don't need 20-50 epoch trains and you can get working lora in about 2000-4000 iters. Right now I'm trying to train on a dataset of 200 images with 2 iterations for 50 epochs to test if it's underfitting.

Anonymous
10/01/24(Tue)10:05:15 No.102637729

Anonymous 10/01/24(Tue)10:05:15 No.102637729

>>102637685
If you want to train just style you could try training without captions

Anonymous
10/01/24(Tue)10:23:23 No.102637927

Anonymous 10/01/24(Tue)10:23:23 No.102637927

>>102633627
It's a VAE that outputs tokens, which could be useful for transformers based models which predict only on tokens rather than denoising.

Anonymous
10/01/24(Tue)10:24:57 No.102637946

Anonymous 10/01/24(Tue)10:24:57 No.102637946

desu waiting for the next habbening

Anonymous
10/01/24(Tue)10:54:22 No.102638287

Anonymous 10/01/24(Tue)10:54:22 No.102638287

>>102637685
Quick test with prodigy: lower dataset to 50-60 images, 800 steps, batch 2, gradient 2, save every 5 epoch

Anonymous
10/01/24(Tue)11:05:25 No.102638408

Anonymous 10/01/24(Tue)11:05:25 No.102638408

File: bComfyUI_119104_.jpg (218 KB, 1024x1024)

218 KB JPG

Anonymous
10/01/24(Tue)11:17:33 No.102638550

Anonymous 10/01/24(Tue)11:17:33 No.102638550

aCHOO *sniff*

Anonymous
10/01/24(Tue)11:25:49 No.102638656

Anonymous 10/01/24(Tue)11:25:49 No.102638656

File: ComfyUI_temp_zuxtq_00007_.png (3.18 MB, 1704x1280)

3.18 MB PNG

Anonymous
10/01/24(Tue)11:27:44 No.102638678

Anonymous 10/01/24(Tue)11:27:44 No.102638678

>>102638656
impressive

Anonymous
10/01/24(Tue)11:32:00 No.102638724

Anonymous 10/01/24(Tue)11:32:00 No.102638724

File: ComfyUI_temp_zuxtq_00016_.png (3.44 MB, 1704x1280)

3.44 MB PNG

Anonymous
10/01/24(Tue)11:45:16 No.102638893

Anonymous 10/01/24(Tue)11:45:16 No.102638893

>>102635048
Anon in cryosleep waiting for Bigma

Anonymous
10/01/24(Tue)11:53:01 No.102638993

Anonymous 10/01/24(Tue)11:53:01 No.102638993

haven't been here for about a month, any good Flux news? new controlnets, finetunes, anything?
mainly looking for:
>inpaint finetune
>upscaling that actually remains consistent vs. typical tiling
>facial image prompting so i don't have to train loras and curate a dataset for specific people

Anonymous
10/01/24(Tue)11:57:03 No.102639032

Anonymous 10/01/24(Tue)11:57:03 No.102639032

Someone tell them to stop making giant loras
https://www.reddit.com/r/StableDiffusion/comments/1ftmapd/ultrarealistic_lora_project_flux/

Anonymous
10/01/24(Tue)12:04:33 No.102639120

Anonymous 10/01/24(Tue)12:04:33 No.102639120

trying to switch to raw sd-scripts training and i've been cleaning up the script i'm working off.

Traceback (most recent call last):
  File "F:\sd-scripts\sdxl_train_network.py", line 185, in <module>
    trainer.train(args)
  File "F:\sd-scripts\train_network.py", line 512, in train
    accelerator.print("running training / \u5b66\u7fd2\u958b\u59cb")
  File "F:\sd-scripts\venv\lib\site-packages\accelerate\accelerator.py", line 1086, in print
    self.state.print(*args, **kwargs)
  File "F:\sd-scripts\venv\lib\site-packages\accelerate\state.py", line 970, in print
    PartialState().print(*args, **kwargs)
  File "F:\sd-scripts\venv\lib\site-packages\accelerate\state.py", line 696, in print
    print(*args, **kwargs)
  File "C:\Program Files\Python310\lib\encodings\cp1252.py", line 19, in encode
    return codecs.charmap_encode(input,self.errors,encoding_table)[0]
UnicodeEncodeError: 'charmap' codec can't encode characters in position 19-22: character maps to <undefined>

i googled it and i guess the file isn't in utf-8 which is what sd-scripts wants, but i feel like just converting the file format is a shitty solution, anyone else run into this?

Anonymous
10/01/24(Tue)12:35:37 No.102639518

Anonymous 10/01/24(Tue)12:35:37 No.102639518

>>102639032
>giant model
>not giantess

Anonymous
10/01/24(Tue)12:38:13 No.102639546

Anonymous 10/01/24(Tue)12:38:13 No.102639546

>>102639120
actually i just saved it in utf-8 in notepad so i dunno lol

Anonymous
10/01/24(Tue)13:18:41 No.102640006

Anonymous 10/01/24(Tue)13:18:41 No.102640006

>>102638993
no

Anonymous
10/01/24(Tue)13:22:00 No.102640039

Anonymous 10/01/24(Tue)13:22:00 No.102640039

>>102639120
The cp1252 tells me there is some windows involved ;) you could pass an encoding to the open() if you didnt write it yourselve you might have a less bad time just converting the file tho.

Anonymous
10/01/24(Tue)13:32:23 No.102640129

Anonymous 10/01/24(Tue)13:32:23 No.102640129

>>102639120
\u5b66\u7fd2\u958b\u59cb
yummy

Anonymous
10/01/24(Tue)13:36:01 No.102640190

Anonymous 10/01/24(Tue)13:36:01 No.102640190

>>102640129
yeah i did some more looking at there are a bunch of info messages it's trying to print in jp which causes it to bug out for some fuckin reason

Anonymous
10/01/24(Tue)13:37:34 No.102640208

Anonymous 10/01/24(Tue)13:37:34 No.102640208

>>102640190
because you don't have jap installed, just delete them

Anonymous
10/01/24(Tue)13:38:18 No.102640218

Anonymous 10/01/24(Tue)13:38:18 No.102640218

How do I use loss masks with sd-scripts? Like, do I put the masks in their own folder, name them the same as the training images, then specify that folder in the dataset config or something? What's the command for it? The readme on kohya just talks about controlnet and I'm too stupid to get what it means.

Anonymous
10/01/24(Tue)13:46:05 No.102640330

Anonymous 10/01/24(Tue)13:46:05 No.102640330

>>102640208
good idea, the message it was trying to print was "starting training," lol

Anonymous
10/01/24(Tue)13:51:11 No.102640390

Anonymous 10/01/24(Tue)13:51:11 No.102640390

>>102640208
it should not need to be installed for utf to work, my guess is his string (internal utf8 representation) gets converted into win 1252 somewhere which obv fails.

Anonymous
10/01/24(Tue)13:52:05 No.102640399

Anonymous 10/01/24(Tue)13:52:05 No.102640399

>>102640390
win10 issue maybe

Anonymous
10/01/24(Tue)13:52:06 No.102640400

Anonymous 10/01/24(Tue)13:52:06 No.102640400

File: ComfyUI_34194_.png (1.46 MB, 848x1024)

1.46 MB PNG

Anonymous
10/01/24(Tue)13:52:40 No.102640415

Anonymous 10/01/24(Tue)13:52:40 No.102640415

File: ComfyUI_34218_.png (1.31 MB, 848x1024)

1.31 MB PNG

Anonymous
10/01/24(Tue)13:52:50 No.102640419

Anonymous 10/01/24(Tue)13:52:50 No.102640419

>>102640190
does this thing log?

Anonymous
10/01/24(Tue)13:53:15 No.102640423

Anonymous 10/01/24(Tue)13:53:15 No.102640423

File: ComfyUI_34219_.png (1.09 MB, 848x1024)

1.09 MB PNG

Anonymous
10/01/24(Tue)13:56:02 No.102640462

Anonymous 10/01/24(Tue)13:56:02 No.102640462

>>102640390
it's a python bug because the terminal doesn't support the string, it's not the "code" it's the terminal throwing an error

Anonymous
10/01/24(Tue)13:56:05 No.102640465

Anonymous 10/01/24(Tue)13:56:05 No.102640465

>>102640419
i didn't see it. the scripts have en messages and jp messages listed for each thing it tries, so i imagine there's something early on that's supposed to detect the system lang and print it in the appropriate one, but didn't, so it defaulted to the creator's lang? anyway, just deleting the jp text like anon suggested fixed it.

Anonymous
10/01/24(Tue)13:56:49 No.102640475

Anonymous 10/01/24(Tue)13:56:49 No.102640475

>>102640400
>>102640415
>>102640423
good stuff

Anonymous
10/01/24(Tue)13:57:07 No.102640478

Anonymous 10/01/24(Tue)13:57:07 No.102640478

>>102640465
>so i imagine there's something early on that's supposed to detect the system lang and print it in the appropriate one
Nope, it just prints both

Anonymous
10/01/24(Tue)13:57:09 No.102640480

Anonymous 10/01/24(Tue)13:57:09 No.102640480

>>102640462
or it's this, i'm no programmer

Anonymous
10/01/24(Tue)13:59:19 No.102640510

Anonymous 10/01/24(Tue)13:59:19 No.102640510

>>102640399
python 3 has funky ideas about text you see internal its utf, output is in shits and giggles if its windows. Old python would just ignore and move on with 3 it throws.

Anonymous
10/01/24(Tue)14:06:54 No.102640612

Anonymous 10/01/24(Tue)14:06:54 No.102640612

>>102640462
Id say the bug is in the code, python just did what it was told, if it cant map it has to react and default is to throw

Anonymous
10/01/24(Tue)14:12:40 No.102640677

Anonymous 10/01/24(Tue)14:12:40 No.102640677

>>102640612
RETARD IF THE TERMINAL THROWS AN ERROR THAT IS PYTHON'S PROBLEM
kys
I don't know why it's hard for faggots like you

Anonymous
10/01/24(Tue)14:13:10 No.102640683

Anonymous 10/01/24(Tue)14:13:10 No.102640683

>>102638724
nice

Anonymous
10/01/24(Tue)14:14:23 No.102640695

Anonymous 10/01/24(Tue)14:14:23 No.102640695

>>102640677
the terminal displays what phython throws yes ;)

Anonymous
10/01/24(Tue)14:40:55 No.102640998

Anonymous 10/01/24(Tue)14:40:55 No.102640998

File: 69ac1607-725d-4fe7-9525-0(...).jpg (50 KB, 1024x576)

50 KB JPG

When you load SapianF Flux finetune with 16fp T5, you get bluescreened in your face.

Anonymous
10/01/24(Tue)14:42:08 No.102641015

Anonymous 10/01/24(Tue)14:42:08 No.102641015

>>102640998
it's time for memtest, bucko

Anonymous
10/01/24(Tue)14:43:30 No.102641022

Anonymous 10/01/24(Tue)14:43:30 No.102641022

>>102641015
My mem is fine, kthxbai

Anonymous
10/01/24(Tue)14:49:16 No.102641096

Anonymous 10/01/24(Tue)14:49:16 No.102641096

>>102640998
>SapianF
>The dataset for males now contains 175 images, and the female dataset now consists of 75 images

Anonymous
10/01/24(Tue)14:53:31 No.102641169

Anonymous 10/01/24(Tue)14:53:31 No.102641169

>>102641096
Finally some decent dicks.

Anonymous
10/01/24(Tue)15:09:55 No.102641393

Anonymous 10/01/24(Tue)15:09:55 No.102641393

>>102638287
Tried this, 50 images with captions, batch 2, grad accumulation 2, 800 iters total.
Result is pretty much the same...

Anonymous
10/01/24(Tue)15:39:11 No.102641827

Anonymous 10/01/24(Tue)15:39:11 No.102641827

File: 00564-920720096.jpg (1.01 MB, 1620x2160)

1.01 MB JPG

>>102641393
If settings are good then it has to be the dataset. Prodigy overfits if anything

Anonymous
10/01/24(Tue)15:45:52 No.102641912

Anonymous 10/01/24(Tue)15:45:52 No.102641912

>>102641827
too flat

Anonymous
10/01/24(Tue)15:48:34 No.102641953

Anonymous 10/01/24(Tue)15:48:34 No.102641953

File: file.png (519 KB, 638x803)

519 KB PNG

it really is crazy watching an AI learn how to make colors, there really should be study about how it learns compared to biology because it always starts with blobs and darks and lights and slowly things get coherent, color gets added, details, etc

Anonymous
10/01/24(Tue)15:51:51 No.102642001

Anonymous 10/01/24(Tue)15:51:51 No.102642001

>>102636928
>lora samples are horribly distorted

Too much or too little variation in the dataset, too high learning rate, too high output resolution, and too high or too low network dimension can cause this problem.

My first suggestion is to add noise or blurriness to your images. There's a setting to add noise called "Noise offset" but I usually do it manually.

Anonymous
10/01/24(Tue)15:56:13 No.102642063

Anonymous 10/01/24(Tue)15:56:13 No.102642063

>>102642001
>Too much or too little variation in the dataset
Is there any specific requirements for style datasets?

Anonymous
10/01/24(Tue)15:58:03 No.102642092

Anonymous 10/01/24(Tue)15:58:03 No.102642092

File: 00007-736784397.png (265 KB, 512x512)

265 KB PNG

Illustrious is pretty bad so far. Doesn't seem to even compare to Pony V6

Anonymous
10/01/24(Tue)16:06:55 No.102642225

Anonymous 10/01/24(Tue)16:06:55 No.102642225

>>102641953
That is pretty interesting. PixArt?

>>102642092
512x512 might be too low resolution for it

Anonymous
10/01/24(Tue)16:13:06 No.102642324

Anonymous 10/01/24(Tue)16:13:06 No.102642324

>>102640218
incase anyone else needs this in the future, /hdg/ gave me: https://rentry.org/d2ckzxmq

Anonymous
10/01/24(Tue)16:19:19 No.102642416

Anonymous 10/01/24(Tue)16:19:19 No.102642416

>>102642225
That's the VQ-VAE learning image reconstruction

Anonymous
10/01/24(Tue)16:21:29 No.102642442

Anonymous 10/01/24(Tue)16:21:29 No.102642442

>>102642063
No, as far as I know. You have to figure it out yourself.

Anonymous
10/01/24(Tue)16:30:10 No.102642542

Anonymous 10/01/24(Tue)16:30:10 No.102642542

>>102642416
there is no checkpoint, just vae alone is doing that?

Anonymous
10/01/24(Tue)16:42:21 No.102642675

Anonymous 10/01/24(Tue)16:42:21 No.102642675

>>102642542
Yeah it's a made from scratch VQ-VAE being trained on raw images but really all it is is a neural network compression system. It learns how to compress an image down into tokens and how to turn those tokens back into an image. Give those tokens to a transformers network and you do next token prediction conditioned on caption tokens and now you have a text to image model. Also since you're working with image tokens you can do interesting image to image stuff since we're just working with tokens and trying to predict with tokens. So you could do something like: <img_tokens> "Change their shirt blue" <txt_tokens>. As a prompt and it would be able to do it.

One of the hard parts is getting the VAE trained though and apparently VQ-VAEs can be temperamental and are actually quite old. I'm starting with 20m parameters with cross and multi-head attention on 256px patches and seeing how it goes because it needs to more or less produce 99% accurate reconstructions.

Anonymous
10/01/24(Tue)16:51:53 No.102642792

Anonymous 10/01/24(Tue)16:51:53 No.102642792

>>102630791
Kirsten Dunst in "Interview With The Vampire"

Anonymous
10/01/24(Tue)16:55:58 No.102642840

Anonymous 10/01/24(Tue)16:55:58 No.102642840

>>102642675
So it's the same tech that's being used with this https://github.com/buaacyw/MeshAnything ? Cool stuff, can't wrap my head around it.

Anonymous
10/01/24(Tue)17:04:12 No.102642944

Anonymous 10/01/24(Tue)17:04:12 No.102642944

>>102638724
this is nuts. Flux?

Anonymous
10/01/24(Tue)17:14:20 No.102643052

Anonymous 10/01/24(Tue)17:14:20 No.102643052

>>102642840
Ultimately it's just turning things into matrices and running some predefined formulas, it's really just really temperamental cooking.

Anonymous
10/01/24(Tue)17:17:39 No.102643100

Anonymous 10/01/24(Tue)17:17:39 No.102643100

Pulid on Flux seems to be working on ComfyUi now, let's test that out:
https://github.com/balazik/ComfyUI-PuLID-Flux

Anonymous
10/01/24(Tue)17:26:08 No.102643204

Anonymous 10/01/24(Tue)17:26:08 No.102643204

>>102643100
how's the testing going?

Anonymous
10/01/24(Tue)17:28:12 No.102643235

Anonymous 10/01/24(Tue)17:28:12 No.102643235

File: file.png (712 KB, 3840x1739)

712 KB PNG

>>102643204
not good lol, I'll try the regular workflow he provided to see it it works his way

Anonymous
10/01/24(Tue)17:36:22 No.102643328

Anonymous 10/01/24(Tue)17:36:22 No.102643328

File: file.png (1.8 MB, 3388x1563)

1.8 MB PNG

>>102643204
>>102643100
Ok ok, it's working, I give my workflow for those interested
https://files.catbox.moe/raseau.png

Anonymous
10/01/24(Tue)17:46:30 No.102643439

Anonymous 10/01/24(Tue)17:46:30 No.102643439

>>102643328
>https://files.catbox.moe/raseau.png
TY anon

Anonymous
10/01/24(Tue)17:51:27 No.102643490

Anonymous 10/01/24(Tue)17:51:27 No.102643490

File: file.png (1.91 MB, 1024x1024)

1.91 MB PNG

>>102643100
I'm not gonna lie that's pretty good, too bad it doesn't seem to work with cfg > 1 on that node

Anonymous
10/01/24(Tue)17:57:20 No.102643555

Anonymous 10/01/24(Tue)17:57:20 No.102643555

>>102643100
>V0.1.0: Working node with weight, start_at, end_et support (attn_mask not working)
>attn_mask not working
What does that mean?

Anonymous
10/01/24(Tue)17:59:38 No.102643581

Anonymous 10/01/24(Tue)17:59:38 No.102643581

>>102643490
>doesn't seem to work with cfg > 1
DOA

Anonymous
10/01/24(Tue)18:00:38 No.102643597

Anonymous 10/01/24(Tue)18:00:38 No.102643597

>>102643581
it's supposed to be working at CFG > 1, I guess that this node is just a proof of concept, he'll improve on that for sure

Anonymous
10/01/24(Tue)18:05:20 No.102643636

Anonymous 10/01/24(Tue)18:05:20 No.102643636

File: file.png (1.5 MB, 1024x1024)

1.5 MB PNG

>>102643100
kek, for a setting that has an early version of PuLID (we'll get a new version on october), without mask attention and with only CFG = 1 that's not that bad

Anonymous
10/01/24(Tue)18:10:23 No.102643708

Anonymous 10/01/24(Tue)18:10:23 No.102643708

Are there any good UI's that can talk to a backend API and doesn't install CUDA etc? I'm using AMD on Windows (yes, I know...)
I've got great performance using stable-diffusion.cpp and ROCm, but it looks like every UI out there is like "And now install pytorch" which I can't do...
Surely someone has made something that just calls an API endpoint instead of needing to run everything locally?
Worst case I can implement the StableHorde API and pretend to be a horde of 1, but that looks painful compared to running a /txt2img endpoint.

Btw I tried DirectML and it's dog slow compared to ROCm

Anonymous
10/01/24(Tue)18:13:39 No.102643733

Anonymous 10/01/24(Tue)18:13:39 No.102643733

File: file.png (1.58 MB, 1024x1024)

1.58 MB PNG

>>102643490

Anonymous
10/01/24(Tue)18:13:51 No.102643737

Anonymous 10/01/24(Tue)18:13:51 No.102643737

>>102643708
have you tried a web browser?

Anonymous
10/01/24(Tue)18:14:59 No.102643753

Anonymous 10/01/24(Tue)18:14:59 No.102643753

>>102643708
The reason we have these problems is that none of the programmers own AMD cards, and they barely care about them. AMD has very minor market share.

AMD cards are vastly more powerful than nvidia ones, however.

Anonymous
10/01/24(Tue)18:17:20 No.102643773

Anonymous 10/01/24(Tue)18:17:20 No.102643773

I only need 3 gaming computers.

1. for Flux, genning all the time
2. llm for someone to talk to about my day and stuff
3. for gaming while I think of something to say

Anonymous
10/01/24(Tue)18:20:48 No.102643809

Anonymous 10/01/24(Tue)18:20:48 No.102643809

File: file.png (1.14 MB, 1024x1024)

1.14 MB PNG

>>102643733
it also works fine with loras, for example I went for Wind Waker Lora here

Anonymous
10/01/24(Tue)18:27:53 No.102643877

Anonymous 10/01/24(Tue)18:27:53 No.102643877

File: file.png (951 KB, 1024x1024)

951 KB PNG

>>102643809

Anonymous
10/01/24(Tue)18:30:43 No.102643907

Anonymous 10/01/24(Tue)18:30:43 No.102643907

File: file.png (2.52 MB, 1024x1024)

2.52 MB PNG

>>102643877

Anonymous
10/01/24(Tue)18:31:17 No.102643914

Anonymous 10/01/24(Tue)18:31:17 No.102643914

>>102643737
My AMD GPU is in my desktop gaming PC which is running windows. I don't see how a web browser helps here.
>Put your gpu in another computer
>Dual boot linux
No.

>>102643753
In my experience my 7900XTX underperforms a 4090 but at a much better price point - I'm getting ~1.10 it/s on SDXL, for reference. But having a "cheap" 24GB card lets me just about run SDXL and a quantised 8b LLM model at the same time which is pretty cool.

But I'm not really complaining "Wah AMD no worky" - AMD works fine for the actual image generation, it's just frustrating that all the UIs seem to want to bundle the entire stable diffusion ecosystem and CUDA into their backend instead of being able to use an API endpoint running somewhere else (i.e. my own jury-rigged setup)

Anyone know of anything or do I have to build it myself / rip out the internals of A1111 and replace with my own implementation? I'm lazy so I'd rather not

Anonymous
10/01/24(Tue)18:34:16 No.102643939

Anonymous 10/01/24(Tue)18:34:16 No.102643939

File: 0.jpg (206 KB, 1024x1024)

206 KB JPG

Anonymous
10/01/24(Tue)18:35:47 No.102643947

Anonymous 10/01/24(Tue)18:35:47 No.102643947

File: file.png (2.15 MB, 1024x1024)

2.15 MB PNG

>>102643907

Anonymous
10/01/24(Tue)18:46:37 No.102644042

Anonymous 10/01/24(Tue)18:46:37 No.102644042

>>102643544
have faith in man from china

Anonymous
10/01/24(Tue)18:46:39 No.102644043

Anonymous 10/01/24(Tue)18:46:39 No.102644043

File: file.png (2.03 MB, 1024x1024)

2.03 MB PNG

>>102643947
when using PuLID it kinda destroy the text abilities though, can't even write "OWN" correctly kek

Anonymous
10/01/24(Tue)18:50:11 No.102644075

Anonymous 10/01/24(Tue)18:50:11 No.102644075

>>102643914
>In my experience my 7900XTX underperforms a 4090
Yeah, but it's going through a translation layer. Basically, the xtx is emulating the 4090, which is crazy.

Anonymous
10/01/24(Tue)18:55:24 No.102644122

Anonymous 10/01/24(Tue)18:55:24 No.102644122

>>102644075
It's not emulating it, I think? HIP/ROCm just provide a very similar (but legally different) API to CUDA, see src/ggml-cuda/vendors/hip.h in the ggml source code (used by stable-diffusion.cpp). So it is running natively, but I think the compute kernels just aren't as optimised as the ones in CUDA and the 7900XTX doesn't have as many cores as a 4090 either.

Anonymous
10/01/24(Tue)19:02:10 No.102644178

Anonymous 10/01/24(Tue)19:02:10 No.102644178

>>102643914
>I just want an API (for a hosted AI)
That's what web browsers are for.

Anonymous
10/01/24(Tue)19:03:05 No.102644192

Anonymous 10/01/24(Tue)19:03:05 No.102644192

>>102644122
My understanding is some vram is used by rocm to translate what it can't do natively. Was I wrong?

Anonymous
10/01/24(Tue)19:03:15 No.102644195

Anonymous 10/01/24(Tue)19:03:15 No.102644195

>>102644042
Just wait for the 5090 and Titan AI releases and you're going to see a lot small but excellent 1B models.

Anonymous
10/01/24(Tue)19:04:53 No.102644206

Anonymous 10/01/24(Tue)19:04:53 No.102644206

>>102644195
I'll tell biden-harris to ban the 5090 because it's used by sexists to emulate abuse of women.

:^)

Anonymous
10/01/24(Tue)19:05:57 No.102644214

Anonymous 10/01/24(Tue)19:05:57 No.102644214

>>102644206
The powers that be like AI given Nancy Pelosi was for blocking the anti-AI bill in CA. So I think I'll be fine.

Anonymous
10/01/24(Tue)19:06:24 No.102644220

Anonymous 10/01/24(Tue)19:06:24 No.102644220

>>102644122
HIP is the runtime that actually does the compute, ROCm is the stack. Nvidia calls everything CUDA which is the difference. Intel has a similar naming philosophy to AMD, they call their stack oneAPI but their actual runtime is using SYCL or their HIP equivalent which is Level-Zero. And yes, you are spot on.

Anonymous
10/01/24(Tue)19:06:50 No.102644224

Anonymous 10/01/24(Tue)19:06:50 No.102644224

DON'T vote for Trump - vote for ME - I will hold public lynchings of furries before every football game.

Anonymous
10/01/24(Tue)19:07:51 No.102644236

Anonymous 10/01/24(Tue)19:07:51 No.102644236

>>102644214
:^)

she has no idea what any of this is. kammy is a Swiftie

Anonymous
10/01/24(Tue)19:09:12 No.102644254

Anonymous 10/01/24(Tue)19:09:12 No.102644254

>>102644236
Kammy will do what the powers that be want you fucking retard. AI is cheap labor so it will be allowed, in case you missed the memo the Uniparty don't care about anything except grifting the populace.

Anonymous
10/01/24(Tue)19:20:24 No.102644334

Anonymous 10/01/24(Tue)19:20:24 No.102644334

>>102644178
That's not what I said. I *have* an API that wraps stable-diffusion.cpp and provides the /txt2img API endpoint of A1111 - What I am looking for is a nice piece of software that can use that API, that I can use to play around with various parameters and models, without having to keep track of everything myself and calling the API manually/via a script.

Yeah, I'd love to use a web browser to connect to a web-based UI that uses my API as its backend, but as far as I can tell no such UI exists. At least, that's the question I'm asking - Does such a UI exist?

I'm currently using SillyTavern for this because it is *somewhat* servicable, but the limitations (1 image at a time, doesn't record the seed or the full prompts) are beginning to be a pain.

Anonymous
10/01/24(Tue)19:29:42 No.102644437

Anonymous 10/01/24(Tue)19:29:42 No.102644437

File: 0.jpg (143 KB, 1024x1024)

143 KB JPG

Anonymous
10/01/24(Tue)19:30:58 No.102644456

Anonymous 10/01/24(Tue)19:30:58 No.102644456

File: file.png (1.84 MB, 1024x1024)

1.84 MB PNG

>>102643490
kek

Anonymous
10/01/24(Tue)20:05:12 No.102644803

Anonymous 10/01/24(Tue)20:05:12 No.102644803

>>102640415
Nice

Anonymous
10/01/24(Tue)20:25:24 No.102644965

Anonymous 10/01/24(Tue)20:25:24 No.102644965

File: bComfyUI_124109_.jpg (703 KB, 1440x1080)

703 KB JPG

Anonymous
10/01/24(Tue)20:42:36 No.102645099

Anonymous 10/01/24(Tue)20:42:36 No.102645099

>>102644220
I don't think so, I think that amd has to translate lots of commands, because of how the code works.

Anonymous
10/01/24(Tue)20:43:36 No.102645110

Anonymous 10/01/24(Tue)20:43:36 No.102645110

>>102644254
heheheheheheheheheh

she's gonna ban profits dude

Anonymous
10/01/24(Tue)21:03:45 No.102645333

Anonymous 10/01/24(Tue)21:03:45 No.102645333

>>102623700
period only 3 months out of the year? hmmm

Anonymous
10/01/24(Tue)21:12:48 No.102645404

Anonymous 10/01/24(Tue)21:12:48 No.102645404

File: linux003.png (292 KB, 2640x1036)

292 KB PNG

>>102645099
It's quite easy to see what their stack is doing, the code is open source and they have architecture diagrams. Pic related. There is no translation involved if you aren't using CUDA/HIP translation. If you are, then yes, but only because you aren't running the ML code optimally.

Anonymous
10/01/24(Tue)21:18:28 No.102645449

Anonymous 10/01/24(Tue)21:18:28 No.102645449

>>102645404
Is hip too confusing without rocm help?

Anonymous
10/01/24(Tue)21:24:53 No.102645504

Anonymous 10/01/24(Tue)21:24:53 No.102645504

>>102645404
It's not, it's just being used as a qualifier here. As I said in >>102644220, ROCm is just being used as a way to indicate that the package belongs there but unliike with CUDA, the runtime and stack isn't really differentiated whereas here, this does.

Anonymous
10/01/24(Tue)21:33:20 No.102645568

Anonymous 10/01/24(Tue)21:33:20 No.102645568

File: file.png (2.21 MB, 1024x1024)

2.21 MB PNG

Anonymous
10/01/24(Tue)21:40:24 No.102645620

Anonymous 10/01/24(Tue)21:40:24 No.102645620

Has anyone benchmarked rocm on nvidia hardware for a comparison?

Anonymous
10/01/24(Tue)21:40:30 No.102645621

Anonymous 10/01/24(Tue)21:40:30 No.102645621

>>102645568
do one with the penis arm lora

Anonymous
10/01/24(Tue)21:42:22 No.102645636

Anonymous 10/01/24(Tue)21:42:22 No.102645636

>>102645621
kek, go for it anon, everyone has access to this thing

Anonymous
10/01/24(Tue)21:44:02 No.102645652

Anonymous 10/01/24(Tue)21:44:02 No.102645652

If you trained schnell on https://huggingface.co/datasets/nyanko7/danbooru2023 it would basically be ponyxl right?

Anonymous
10/01/24(Tue)21:45:25 No.102645663

Anonymous 10/01/24(Tue)21:45:25 No.102645663

>>102645652
it would be way better, schnell is a way better base model than SDXL

Anonymous
10/01/24(Tue)21:46:30 No.102645669

Anonymous 10/01/24(Tue)21:46:30 No.102645669

File: file.png (2.31 MB, 1024x1024)

2.31 MB PNG

>>102644456

Anonymous
10/01/24(Tue)21:53:15 No.102645708

Anonymous 10/01/24(Tue)21:53:15 No.102645708

File: file.png (1.86 MB, 1024x1024)

1.86 MB PNG

>>102643100
it works better if you go for the celebrity's name instead of going for "A man" or "A woman" because flux knows them a bit so it helps
>A man juggles a virus with his tennis racket, wears a hat that says “Novax”, and wears a t-shirt that says “DjoCovid”.

Anonymous
10/01/24(Tue)21:56:39 No.102645729

Anonymous 10/01/24(Tue)21:56:39 No.102645729

https://youtu.be/cPVGs0_fu1U?t=213
this is insane, the chinks are getting too stronk

Anonymous
10/01/24(Tue)21:57:05 No.102645732

Anonymous 10/01/24(Tue)21:57:05 No.102645732

>>102645729
>examples

lmao sucke

Anonymous
10/01/24(Tue)21:58:51 No.102645744

Anonymous 10/01/24(Tue)21:58:51 No.102645744

>>102645729
>closed source

Anonymous
10/01/24(Tue)22:01:07 No.102645758

Anonymous 10/01/24(Tue)22:01:07 No.102645758

>>102645729
>stronk
Go back from which you came

Anonymous
10/01/24(Tue)22:02:06 No.102645762

Anonymous 10/01/24(Tue)22:02:06 No.102645762

>>102645729

Anonymous
10/01/24(Tue)22:13:15 No.102645811

Anonymous 10/01/24(Tue)22:13:15 No.102645811

File: 0.jpg (236 KB, 1024x1024)

236 KB JPG

Anonymous
10/01/24(Tue)22:23:28 No.102645877

Anonymous 10/01/24(Tue)22:23:28 No.102645877

File: file.png (1.87 MB, 1024x1024)

1.87 MB PNG

>>102643100
bruh

Anonymous
10/01/24(Tue)22:26:13 No.102645898

Anonymous 10/01/24(Tue)22:26:13 No.102645898

>>102645877
Is this the behind the scenes from Tom and Jerry banned episodes?

Anonymous
10/01/24(Tue)22:27:09 No.102645905

Anonymous 10/01/24(Tue)22:27:09 No.102645905

>>102645898
more like behind the scenes from Diddy's party kek
https://www.youtube.com/watch?v=i2CMpVf_e4o

Anonymous
10/01/24(Tue)22:28:14 No.102645915

Anonymous 10/01/24(Tue)22:28:14 No.102645915

File: file.png (1.9 MB, 1024x1024)

1.9 MB PNG

>>102645877

Anonymous
10/01/24(Tue)22:31:59 No.102645946

Anonymous 10/01/24(Tue)22:31:59 No.102645946

File: file.png (1.2 MB, 1024x1024)

1.2 MB PNG

>>102643100
Now that we can do Celebrities in an instant, Flux is fun again, yayyy

Anonymous
10/01/24(Tue)22:37:42 No.102645987

Anonymous 10/01/24(Tue)22:37:42 No.102645987

File: file.png (1.56 MB, 1024x1024)

1.56 MB PNG

>>102643100
kek

Anonymous
10/01/24(Tue)22:47:19 No.102646043

Anonymous 10/01/24(Tue)22:47:19 No.102646043

File: file.png (1.93 MB, 1024x1024)

1.93 MB PNG

>>102645987

Anonymous
10/01/24(Tue)23:09:46 No.102646181

Anonymous 10/01/24(Tue)23:09:46 No.102646181

File: file.png (1.19 MB, 1024x1024)

1.19 MB PNG

>>102645946

Anonymous
10/01/24(Tue)23:13:02 No.102646207

Anonymous 10/01/24(Tue)23:13:02 No.102646207

https://xcancel.com/LumaLabsAI/status/1840820602296320083
>Welcome to the era of Hyperfast video generation: with 10x faster inference, you can now generate full-quality Dream Machine v1.6 clips in under 20 seconds. No "turbo" or "distilled" models - just uncompromised quality.
How did they do that? If we had this secret sauce we wouldn't have to wait minutes for CogVideoX kek

Anonymous
10/01/24(Tue)23:14:56 No.102646225

Anonymous 10/01/24(Tue)23:14:56 No.102646225

Next Bred:

>>102646216
>>102646216
>>102646216

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.