/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/ldg/ - Local Diffusion Genera(...) 09/27/25(Sat)16:23:29 No.106719267

File: highlights_g_106715652_17(...).webm (3.25 MB, 2048x1687)

3.25 MB WEBM

/ldg/ - Local Diffusion General Anonymous 09/27/25(Sat)16:23:29 No.106719267

He Thinks He Knows Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106715652

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo

Anonymous
09/27/25(Sat)16:25:16 No.106719283

Anonymous 09/27/25(Sat)16:25:16 No.106719283

>I have to edit a yank file to add a model route
>In a way it's not specified.
>Comy.exe doesn't let you select a custom drive to install
This is shit THIS IS SHIT THIS IS SHIIIIIT

Anonymous
09/27/25(Sat)16:25:24 No.106719285

Anonymous 09/27/25(Sat)16:25:24 No.106719285

Context Window edition

Anonymous
09/27/25(Sat)16:25:55 No.106719291

Anonymous 09/27/25(Sat)16:25:55 No.106719291

>>106719283
hahaha, yeah...

Anonymous
09/27/25(Sat)16:26:17 No.106719294

Anonymous 09/27/25(Sat)16:26:17 No.106719294

>>106719264
yeah i'm using a few loras,
https://civitai.com/models/1648982/wan-nsfw-posing-nude
https://civitai.com/models/1343431/bouncing-boobs-wan-14b
and my lightning is set to 1, maybe its a lora strength imbalance? i'm on wan 2.1 720p 6K. maybe its also a promptlet issue.. well, just knowing its a model censorship issue brings me some kind of relief that i'm mostly doing this right.

Anonymous
09/27/25(Sat)16:26:55 No.106719303

Anonymous 09/27/25(Sat)16:26:55 No.106719303

File: 1738699934512086.png (1.1 MB, 896x1160)

1.1 MB PNG

the man in image1 is wearing the outfit of the man on the right in image2.

then I added a hat that it missed.
>goose ball run

Anonymous
09/27/25(Sat)16:27:04 No.106719304

Anonymous 09/27/25(Sat)16:27:04 No.106719304

>>106719283
API nodes don't have this problem

Anonymous
09/27/25(Sat)16:27:19 No.106719306

Anonymous 09/27/25(Sat)16:27:19 No.106719306

>comfyui
>kijainodes

Anonymous
09/27/25(Sat)16:27:38 No.106719311

Anonymous 09/27/25(Sat)16:27:38 No.106719311

File: 00038-3611765021.png (1.11 MB, 1152x896)

1.11 MB PNG

Anonymous
09/27/25(Sat)16:28:13 No.106719316

Anonymous 09/27/25(Sat)16:28:13 No.106719316

>>106719303
it has no care for the style of the original image, like we're not asking for much, just for the outfit to not be anime when applying to a realistic character lol

Anonymous
09/27/25(Sat)16:28:33 No.106719319

Anonymous 09/27/25(Sat)16:28:33 No.106719319

>>106719257
Test and tell us back anon, I suspect that context length = total length is basically because of the dual sampler thing going on with 2.2.
After all, how would the node even works if the rolling context is instantiated in both high and low noise?

Anonymous
09/27/25(Sat)16:28:33 No.106719320

Anonymous 09/27/25(Sat)16:28:33 No.106719320

File: gamer.png (2.36 MB, 1035x1552)

2.36 MB PNG

Anonymous
09/27/25(Sat)16:28:47 No.106719324

Anonymous 09/27/25(Sat)16:28:47 No.106719324

>the memory leaks are fixed guys!
>infinitely worse leaking and crashing issues than before

Anonymous
09/27/25(Sat)16:28:50 No.106719327

Anonymous 09/27/25(Sat)16:28:50 No.106719327

>>106719198
Yeah and this doesn't seem to work with their default i2v workflow, it just errors.

Anonymous
09/27/25(Sat)16:29:28 No.106719332

Anonymous 09/27/25(Sat)16:29:28 No.106719332

>>106719311
*sniff*

Anonymous
09/27/25(Sat)16:29:57 No.106719336

Anonymous 09/27/25(Sat)16:29:57 No.106719336

>>106719319
well it's a different sampler which gets those latents. i'm wildly assuming it will just work fine. but yeah i'll post results when done

Anonymous
09/27/25(Sat)16:29:57 No.106719337

Anonymous 09/27/25(Sat)16:29:57 No.106719337

>>106719327
It's broken then, maybe worth testing in t2v.

Anonymous
09/27/25(Sat)16:30:23 No.106719343

Anonymous 09/27/25(Sat)16:30:23 No.106719343

do i look like i know what a context window is

Anonymous
09/27/25(Sat)16:30:52 No.106719350

Anonymous 09/27/25(Sat)16:30:52 No.106719350

>>106719324
wait, this might be why anon is having issues with wan. fucking cumfart, I swear he doesn't fucking test anything

Anonymous
09/27/25(Sat)16:30:56 No.106719352

Anonymous 09/27/25(Sat)16:30:56 No.106719352

>>106719324
don't use my misery for your spamming faggotry, you schizo

>>106719286
>t2v only

Anonymous
09/27/25(Sat)16:31:19 No.106719358

Anonymous 09/27/25(Sat)16:31:19 No.106719358

>>106719316
you have to do it in two steps, covert the style then blend the images

Anonymous
09/27/25(Sat)16:31:39 No.106719363

Anonymous 09/27/25(Sat)16:31:39 No.106719363

>nigbo

Anonymous
09/27/25(Sat)16:32:23 No.106719371

Anonymous 09/27/25(Sat)16:32:23 No.106719371

should just take the API node pill. no issues whatsoever

Anonymous
09/27/25(Sat)16:32:27 No.106719373

Anonymous 09/27/25(Sat)16:32:27 No.106719373

>>106719324
actually fixed it for me, I don't use those lora power loaders though

Anonymous
09/27/25(Sat)16:32:41 No.106719377

Anonymous 09/27/25(Sat)16:32:41 No.106719377

>>106719336
Yeah but how does it make sense?
You have 32 frames total, and a context of 16 (let's ignore overlap)
So it means the first sampler will :
- gen 16
- gen the next 16

Then it sends 32 frames to the second sampler?
How can it only send 16?

Anonymous
09/27/25(Sat)16:33:44 No.106719387

Anonymous 09/27/25(Sat)16:33:44 No.106719387

I hate what this general has become because of this shit

Anonymous
09/27/25(Sat)16:33:56 No.106719389

Anonymous 09/27/25(Sat)16:33:56 No.106719389

>>106719377
why would it send anything? it's an entirely new process, like a refiner.

Anonymous
09/27/25(Sat)16:35:14 No.106719403

Anonymous 09/27/25(Sat)16:35:14 No.106719403

>>106719387
The solution is to remove the SaaShit from the OP

Anonymous
09/27/25(Sat)16:35:56 No.106719406

Anonymous 09/27/25(Sat)16:35:56 No.106719406

>>106719389
For me the process is linear, you either send all the latents of the 32 images to the next sampler, or it doesn't makes sense node wise.

Anonymous
09/27/25(Sat)16:36:18 No.106719412

Anonymous 09/27/25(Sat)16:36:18 No.106719412

>>106719403
the solution is to use API nodes silly

Anonymous
09/27/25(Sat)16:36:38 No.106719414

Anonymous 09/27/25(Sat)16:36:38 No.106719414

goddamn, so many filtered messages already. eu times are so much better.

anyway thanks to the anon for posting that context window link earlier, i really was a fucking retard with the settings i had. might finally get longer gens working without breaking into mashed potatoes.

Anonymous
09/27/25(Sat)16:37:27 No.106719419

Anonymous 09/27/25(Sat)16:37:27 No.106719419

>>106719387
karma

Anonymous
09/27/25(Sat)16:38:48 No.106719433

Anonymous 09/27/25(Sat)16:38:48 No.106719433

>>106719414
No problem anon, but apparently it's broken for i2v, and maybe this explains why it doesn't have much chatter online.

Anonymous
09/27/25(Sat)16:38:51 No.106719434

Anonymous 09/27/25(Sat)16:38:51 No.106719434

File: file.png (8 KB, 956x83)

8 KB PNG

this is already looking far better than the absolute rot i had it set to

Anonymous
09/27/25(Sat)16:40:13 No.106719451

Anonymous 09/27/25(Sat)16:40:13 No.106719451

File: gamer.png (1.83 MB, 1167x1167)

1.83 MB PNG

>should just take the API node pill. no issues whatsoever

Anonymous
09/27/25(Sat)16:41:52 No.106719465

Anonymous 09/27/25(Sat)16:41:52 No.106719465

>>106719451
your average 1girl genner

Anonymous
09/27/25(Sat)16:44:34 No.106719486

Anonymous 09/27/25(Sat)16:44:34 No.106719486

>>106719451
plappable

Anonymous
09/27/25(Sat)16:47:07 No.106719503

Anonymous 09/27/25(Sat)16:47:07 No.106719503

>>106719311
Where's Lara Croft?

Anonymous
09/27/25(Sat)16:47:45 No.106719511

Anonymous 09/27/25(Sat)16:47:45 No.106719511

Hunyaun 3.0 80b predictions
>NOOOO WHAT IS THIS?? ITS TOO BIG! I BOUGHT 2 5090s AND STILL CANT RUN IT
>UGH WHEN WILL THESE CHINKS LEARN TO OPTIMIZE
>IT CANT EVEN FOLLOW THE PROMPT RIGHT. IM RUNNING 1BIT_Q1_LIGHTNING QUANT WHICH SHOULD BE INDISTINGUISHABLE FROM FP16
>8 MINUTES PER IMAGE BUT THATS FINE, I DONT MIND WAITING AT ALLL HEHHHHHHH
meanwhile
>Hunyaun 3 great, better prompt following than GPT-5! I'm running it through ComfyCloud right now perfectly
>Crazy how good this model is with API Nodes, I get a 2k image back in 8 seconds
>Diffusing Hunyuan 3.0 locally with ComfyUI API Nodes, fully uncensored the greatest local model of all time!

Anonymous
09/27/25(Sat)16:47:53 No.106719512

Anonymous 09/27/25(Sat)16:47:53 No.106719512

>>106719273
I tried 49 frames, 720x1248, 8 block swap. Ran out of VRAM quick.
Tried again with 16 block swap - RAM was maxxed out and VRAM hit 95%, but it did complete. Took about 100 seconds longer than a 480p video with 101 frames so not worth it unless I can get the RAM/VRAM down, assuming that's bottlenecking and that render time isn't the expected difference between the 2.

Anonymous
09/27/25(Sat)16:47:58 No.106719515

Anonymous 09/27/25(Sat)16:47:58 No.106719515

File: gamer.png (2.57 MB, 1184x1184)

2.57 MB PNG

>your average 1girl genner

Anonymous
09/27/25(Sat)16:49:42 No.106719531

Anonymous 09/27/25(Sat)16:49:42 No.106719531

File: 00039-3132639113.png (2.44 MB, 896x1152)

2.44 MB PNG

Anonymous
09/27/25(Sat)16:50:55 No.106719540

Anonymous 09/27/25(Sat)16:50:55 No.106719540

>106719515
ranfaggot looks like this

Anonymous
09/27/25(Sat)16:51:07 No.106719543

Anonymous 09/27/25(Sat)16:51:07 No.106719543

>>106719503
I feel sad for understanding that reference

Anonymous
09/27/25(Sat)16:51:38 No.106719547

Anonymous 09/27/25(Sat)16:51:38 No.106719547

I wish I was gamer enough to game.

Anonymous
09/27/25(Sat)16:51:38 No.106719548

Anonymous 09/27/25(Sat)16:51:38 No.106719548

>>106719515
now do a miku genner

Anonymous
09/27/25(Sat)16:51:38 No.106719549

Anonymous 09/27/25(Sat)16:51:38 No.106719549

Crazy how underrated Krea is

Anonymous
09/27/25(Sat)16:52:29 No.106719559

Anonymous 09/27/25(Sat)16:52:29 No.106719559

>>106719511
qwen image is 20b and gens (with lightning) in ~4 seconds for me.
80b is so fat, the idea of stacking 4x qwen image is insane to me.
let's not even address how qwen does not remotely achieve the quality of even FLUX, which is what, 12b?
anyway, the size isn't important so much as capability. if you've been paying attention to the output gens (the bears + mooncake junk slop they genned), it's pretty nifty how each elemental bear has that elemental text character on it and associated mooncake.
in other words: the model is smarter. 80b smarter? fuck no, it's probably gonna need a ton of training to be remotely viable and to me, that's where the pain is
can we train this in any reasonable way or is it gonna be, effectively, just a closed box model due to sheer girth?

further to this, it's just no longer viable for raw t2i these days
we need image models to do i2i, editing, multiple reference images, style/subject extraction/separation, etc.
this thing better bang.

Anonymous
09/27/25(Sat)16:52:43 No.106719561

Anonymous 09/27/25(Sat)16:52:43 No.106719561

>>106719294
https://civitai.com/models/1602000/innie-pussy

Anonymous
09/27/25(Sat)16:53:18 No.106719568

Anonymous 09/27/25(Sat)16:53:18 No.106719568

>>106719434
Is that t2v?

Anonymous
09/27/25(Sat)16:53:47 No.106719571

Anonymous 09/27/25(Sat)16:53:47 No.106719571

>>106719568
yes

Anonymous
09/27/25(Sat)16:54:36 No.106719584

Anonymous 09/27/25(Sat)16:54:36 No.106719584

>>106719571
And did it work?

Anonymous
09/27/25(Sat)16:55:09 No.106719590

Anonymous 09/27/25(Sat)16:55:09 No.106719590

File: Lara by OptionalTypo.png (823 KB, 636x900)

823 KB PNG

>>106719543
>I feel sad for understanding that reference
You mean, you just know.

Anonymous
09/27/25(Sat)16:56:29 No.106719597

Anonymous 09/27/25(Sat)16:56:29 No.106719597

File: gamer.png (2.02 MB, 1035x1552)

2.02 MB PNG

>>106719548

Anonymous
09/27/25(Sat)16:56:56 No.106719601

Anonymous 09/27/25(Sat)16:56:56 No.106719601

>>106719590
Classic Lara is still the best

Anonymous
09/27/25(Sat)16:57:59 No.106719609

Anonymous 09/27/25(Sat)16:57:59 No.106719609

>>106719597
miku anon is looking rough

Anonymous
09/27/25(Sat)16:58:46 No.106719613

Anonymous 09/27/25(Sat)16:58:46 No.106719613

File: FluxKrea_Output_232525.jpg (3.08 MB, 2048x2048)

3.08 MB JPG

Anonymous
09/27/25(Sat)16:59:32 No.106719618

Anonymous 09/27/25(Sat)16:59:32 No.106719618

>>106719584
yes and no, there is very obvious ghosting where the context ends so i'm trying more settings

Anonymous
09/27/25(Sat)16:59:56 No.106719625

Anonymous 09/27/25(Sat)16:59:56 No.106719625

>>106719609
U're beign y'allsphobic

Anonymous
09/27/25(Sat)17:01:17 No.106719642

Anonymous 09/27/25(Sat)17:01:17 No.106719642

>>106719285
Surely, there must be a way to set the length to something wild like 500 frames without OOMing. Gotta be some kind of node that optimizes the memory (rather than completely offloading). So instead of it taking, lets say 10 minutes to generate, it will take 15 or 20 because it optimizes the memory.

Anonymous
09/27/25(Sat)17:01:17 No.106719643

Anonymous 09/27/25(Sat)17:01:17 No.106719643

File: braun.jpg (63 KB, 640x921)

63 KB JPG

>>106719561
>wan2.2 loras

anyway, i figured out what my problem was, while shift 8 is recommended for 720p, i set my shift down to 4.5 and magically everything is working as intended.
i'm not even gonna question this logic that defies the clear instructions, just gonna roll with it and set a queue of 8 and hope cumfartyUI decides to let me roll this one.

Anonymous
09/27/25(Sat)17:01:31 No.106719647

Anonymous 09/27/25(Sat)17:01:31 No.106719647

>>106719511
It should be able to be ran locally at Q8 with offloading to the CPU. Just prepare for 30 minute generation times kek.

Anonymous
09/27/25(Sat)17:01:41 No.106719649

Anonymous 09/27/25(Sat)17:01:41 No.106719649

>>106719625
he represents the best of /ldg/

Anonymous
09/27/25(Sat)17:05:39 No.106719674

Anonymous 09/27/25(Sat)17:05:39 No.106719674

File: ram it.jpg (49 KB, 809x522)

49 KB JPG

>>106719642
Cont...

I found these, does anyone know if these would help in generating longer vids?

Anonymous
09/27/25(Sat)17:07:53 No.106719691

Anonymous 09/27/25(Sat)17:07:53 No.106719691

>>106719618
I mean if it works it's great, even with the ghosting, which hopefully is just a parameter issue.

Anonymous
09/27/25(Sat)17:10:25 No.106719713

Anonymous 09/27/25(Sat)17:10:25 No.106719713

>>106719642
>>106719674
2 methods :

- using the last image of the preceding gen to make the next x frames of the next gen.
Issues :
1- it has no idea of the motion going on
2- deterioration of the image over time because of the vae

- using a rolling context window to generate long videos :
1- it doesn't work with i2v for now
2- no possibility to change the prompt between videos

Anonymous
09/27/25(Sat)17:10:38 No.106719714

Anonymous 09/27/25(Sat)17:10:38 No.106719714

>>106719643
what exactly does shift do?

Anonymous
09/27/25(Sat)17:13:26 No.106719732

Anonymous 09/27/25(Sat)17:13:26 No.106719732

File: dmmg_0055.png (1.3 MB, 896x1216)

1.3 MB PNG

>>106719549
iykyk

Anonymous
09/27/25(Sat)17:15:22 No.106719750

Anonymous 09/27/25(Sat)17:15:22 No.106719750

>>106719732
I lvoe it, but its blurry

Anonymous
09/27/25(Sat)17:15:40 No.106719753

Anonymous 09/27/25(Sat)17:15:40 No.106719753

>>106719714
increases the urge to subscribe to API nodes

Anonymous
09/27/25(Sat)17:16:23 No.106719759

Anonymous 09/27/25(Sat)17:16:23 No.106719759

>>106719753
please answer if you know. else take the trolling attempt back to sdg

Anonymous
09/27/25(Sat)17:18:12 No.106719779

Anonymous 09/27/25(Sat)17:18:12 No.106719779

>>106719714
decent blog on it that explains it better than i can https://replicate.com/blog/wan-21-parameter-sweep
but it was usually recommended as i said higher shift when you gen at 720p's recommended res, but, in the case of using a lightning lora at 4 steps, you want it pretty low so it doesn't go overboard/fry. in my case, inflated tumor-looking pussies and heads rolling 360 degrees.

(also comfy failed to do a 4 gen queue.)

Anonymous
09/27/25(Sat)17:18:26 No.106719785

Anonymous 09/27/25(Sat)17:18:26 No.106719785

File: 00040-3129523917.png (1.94 MB, 1152x896)

1.94 MB PNG

Anonymous
09/27/25(Sat)17:18:41 No.106719791

Anonymous 09/27/25(Sat)17:18:41 No.106719791

Forge doesn't have this problem btw

Anonymous
09/27/25(Sat)17:19:04 No.106719795

Anonymous 09/27/25(Sat)17:19:04 No.106719795

File: 1731410775183836.jpg (744 KB, 3240x1177)

744 KB JPG

>>106719549
>Crazy how underrated Krea is
It's all right, but so far, only Seedream is capable of making the photo look like it wasn't taken in a professional studio or something like that.

Anonymous
09/27/25(Sat)17:19:26 No.106719801

Anonymous 09/27/25(Sat)17:19:26 No.106719801

>>106719713
Actually, the sloppy last frame technique could work, considering we now the context nodes. Did a lot of that with 2.1, wasn't great but with these nodes, there has to be a way to feed the context alongside the last frame.

Anonymous
09/27/25(Sat)17:19:45 No.106719804

Anonymous 09/27/25(Sat)17:19:45 No.106719804

File: 1730789596301152.png (47 KB, 1282x540)

47 KB PNG

Was there a way to change the order of these?

Anonymous
09/27/25(Sat)17:20:29 No.106719812

Anonymous 09/27/25(Sat)17:20:29 No.106719812

>>106719779
thank you!

Anonymous
09/27/25(Sat)17:21:06 No.106719820

Anonymous 09/27/25(Sat)17:21:06 No.106719820

>>106719801
>feed the context alongside the last frame
The whole idea is to only work in latent space, if you use the output frame, you use a vae, and you deteriorate the result.

Anonymous
09/27/25(Sat)17:22:04 No.106719828

Anonymous 09/27/25(Sat)17:22:04 No.106719828

>only Seedream is capable of making the photo look like it wasn't taken in a professional studio or something like that.
Alright that one made me kek

Anonymous
09/27/25(Sat)17:27:41 No.106719871

Anonymous 09/27/25(Sat)17:27:41 No.106719871

File: dmmg___00002_.jpg (826 KB, 1664x2432)

826 KB JPG

>>106719750
the grain can get kinda out of hand sometimes but i'll take it

Anonymous
09/27/25(Sat)17:28:32 No.106719872

Anonymous 09/27/25(Sat)17:28:32 No.106719872

>>106719871
Ive never gotten gens looking this sharp and good

Anonymous
09/27/25(Sat)17:38:07 No.106719952

Anonymous 09/27/25(Sat)17:38:07 No.106719952

So glad I saved my ComfyCloud API tokens for Hunyuan 3. Going to be fun mogging openkeks who cant even fit it on a 5090

Anonymous
09/27/25(Sat)17:38:42 No.106719959

Anonymous 09/27/25(Sat)17:38:42 No.106719959

>>106719952
ranfag

Anonymous
09/27/25(Sat)17:42:33 No.106719991

Anonymous 09/27/25(Sat)17:42:33 No.106719991

File: dmmg_0206.png (1.67 MB, 832x1216)

1.67 MB PNG

>>106719872
https://pastebin.com/JgZEs7QQ
here's an oversimplified version of my workflow

Anonymous
09/27/25(Sat)17:44:05 No.106720012

Anonymous 09/27/25(Sat)17:44:05 No.106720012

File: file.mp4 (3.46 MB, 1056x768)

3.46 MB MP4

Anonymous
09/27/25(Sat)17:47:16 No.106720037

Anonymous 09/27/25(Sat)17:47:16 No.106720037

>>106719795
Nano Banana looks closer to high-quality CGI than a photo here (also Nano Banana doesn't even seem to be as good as Imagen 4 Ultra for straight text-to-image in the first place). Both Seedream and Krea look precisely like professional studio photography though IMO, just with different lighting approaches.

Either way though at the very least there's not really any good reason to use regular Flux Dev anymore when Flux Krea exists, Krea is just a lot better not only for realism but also prompt-adherence wise and as far as understanding of certain stylistic concepts.

Anonymous
09/27/25(Sat)17:48:47 No.106720049

Anonymous 09/27/25(Sat)17:48:47 No.106720049

>>106720012
the buildup and payoff here is fucking crazy man. are you using that extended context thingamajigger people were talking about last thread?

Anonymous
09/27/25(Sat)17:51:23 No.106720069

Anonymous 09/27/25(Sat)17:51:23 No.106720069

>>106719872
tip: never ever ever ever ever ever ever LOWER the guidance with Flux Krea like people insisted on doing with regular Flux, it will do nothing but reduce color range, detail, and coherency. Instead RAISING it to around 4.5 instead of 3.5 gives the best results with Krea I find in terms of detail and coherency, without having any negative impact on realism or anything like that. This is with Euler Beta, that's generally what I use (unless I'm using one of the RES4LYF custom samplers).

Anonymous
09/27/25(Sat)17:52:17 No.106720072

Anonymous 09/27/25(Sat)17:52:17 No.106720072

>>106720012

10 sec?! Teach me, Master!

Anonymous
09/27/25(Sat)17:54:25 No.106720083

Anonymous 09/27/25(Sat)17:54:25 No.106720083

>>106719795
Want to test the intelligence of SaaS vs. Local models?
Character reclining in the armchair, his back leaning against one armrest while both legs rest on the other armrest.

Anonymous
09/27/25(Sat)17:58:55 No.106720116

Anonymous 09/27/25(Sat)17:58:55 No.106720116

Did you know that wan2.2 is better than wan2.1?

Anonymous
09/27/25(Sat)17:59:57 No.106720127

Anonymous 09/27/25(Sat)17:59:57 No.106720127

>>106719872
the fuq is up with her fingers?

Anonymous
09/27/25(Sat)18:00:08 No.106720129

Anonymous 09/27/25(Sat)18:00:08 No.106720129

File: Generated Image September(...).png (1.88 MB, 1024x1024)

1.88 MB PNG

>>106720083

Anonymous
09/27/25(Sat)18:01:49 No.106720147

Anonymous 09/27/25(Sat)18:01:49 No.106720147

File: dmmg_0026.jpg (206 KB, 832x1216)

206 KB JPG

>>106719991
if anyone actually uses this, i'd really like to see results before i share it elsewhere

>>106720127
no hand refiner in place, since i never look at them full res anyway

Anonymous
09/27/25(Sat)18:01:50 No.106720148

Anonymous 09/27/25(Sat)18:01:50 No.106720148

>>106720129
it looks good but that's a really big sofa

Anonymous
09/27/25(Sat)18:06:42 No.106720178

Anonymous 09/27/25(Sat)18:06:42 No.106720178

>>106720148
for you

Anonymous
09/27/25(Sat)18:09:39 No.106720201

Anonymous 09/27/25(Sat)18:09:39 No.106720201

>>106720178
I'm not a manlet so my sofa can't be big, sorry you are one of those

Anonymous
09/27/25(Sat)18:14:19 No.106720236

Anonymous 09/27/25(Sat)18:14:19 No.106720236

>>106720148
>that's a really big sofa
...what?

Anonymous
09/27/25(Sat)18:15:02 No.106720240

Anonymous 09/27/25(Sat)18:15:02 No.106720240

>>106720072
it's probably just WAN 2.5 straight output

Anonymous
09/27/25(Sat)18:18:05 No.106720263

Anonymous 09/27/25(Sat)18:18:05 No.106720263

File: double_biting_e000010_01_(...).jpg (201 KB, 1024x1024)

201 KB JPG

Anonymous
09/27/25(Sat)18:19:05 No.106720274

Anonymous 09/27/25(Sat)18:19:05 No.106720274

>>106720263
gen made in Chroma btw

Anonymous
09/27/25(Sat)18:20:34 No.106720290

Anonymous 09/27/25(Sat)18:20:34 No.106720290

>>106720263
So that's the power of a 80b model...

Anonymous
09/27/25(Sat)18:20:36 No.106720291

Anonymous 09/27/25(Sat)18:20:36 No.106720291

>>106719267
Troon posting is no different from shitting up the board with scat. It should be a bannable offense.

Anonymous
09/27/25(Sat)18:21:35 No.106720301

Anonymous 09/27/25(Sat)18:21:35 No.106720301

>>106720291
>t. troon

Anonymous
09/27/25(Sat)18:30:50 No.106720372

Anonymous 09/27/25(Sat)18:30:50 No.106720372

File: file.png (44 KB, 585x287)

44 KB PNG

>>106719691
been genning more settings, still get ghosting but it's getting better. i'm still a tard so i don't know which fuse method is best for these settings though.
closed loop is off because if the last context window is quie different it just blends into the new context instead of smoothly transitioning.

Anonymous
09/27/25(Sat)18:31:00 No.106720374

Anonymous 09/27/25(Sat)18:31:00 No.106720374

>>106720263
That's you in the middle, isn't it?

Anonymous
09/27/25(Sat)18:32:45 No.106720395

Anonymous 09/27/25(Sat)18:32:45 No.106720395

>>106701482
nice style, catbox?

Anonymous
09/27/25(Sat)18:36:41 No.106720424

Anonymous 09/27/25(Sat)18:36:41 No.106720424

File: fivewaycomparison.jpg (3.59 MB, 7472x2048)

3.59 MB JPG

did a five-way comparison with each model at its recommended default resolution instead of trying to artificially match them.

Prompt (which I think is a pretty legitimately tricky one to get fully correct) was: ```A dusty, forgotten attic workshop in the late 1970s, captured on grainy, high-ISO Kodachrome film with a slight, warm color shift and subtle light leaks in the upper right corner. The main light source is a single, low-hanging bare bulb just out of frame to the left, casting long, soft shadows. In the center of a cluttered wooden workbench sits a bizarre, handmade device: a series of five nested brass rings, each inscribed with intricate, non-terrestrial constellations, levitating around a central, softly glowing, milky quartz sphere. To the immediate left of this device is an open, leather-bound journal, its pages filled with frantic, handwritten cursive ink notes and complex geometric diagrams. The right-hand page of the journal, facing the viewer, must clearly and legibly display the handwritten text: "The resonance is not a frequency, but a location. It remembers the space it used to occupy." In the near foreground, slightly out of focus due to a shallow depth of field, rests a soldering iron with a wisp of cold smoke rising from its tip. In the background, hanging on a pegboard, various well-worn tools are silhouetted against the dusty light.```

Only key takeaways are probably that HiDream is objectively the worst (extreme JPEG artifacting in the native output, total failure to output text at all) and also that none of the images really did an accurate job of capturing what a Kodachrome photo from the 70s actually looks like.

Anonymous
09/27/25(Sat)18:38:46 No.106720436

Anonymous 09/27/25(Sat)18:38:46 No.106720436

>>106720372
i was fucking with you on the closed loop lol, i wouldn't bother with anything that has looped in the name with what you're trying to do

Anonymous
09/27/25(Sat)18:47:12 No.106720489

Anonymous 09/27/25(Sat)18:47:12 No.106720489

>>106720240

so sad if true

Anonymous
09/27/25(Sat)18:47:43 No.106720493

Anonymous 09/27/25(Sat)18:47:43 No.106720493

File: file.png (39 KB, 411x524)

39 KB PNG

Anyone thats run this wan node, how the hell does the temporal mask work?

Anonymous
09/27/25(Sat)18:48:43 No.106720498

Anonymous 09/27/25(Sat)18:48:43 No.106720498

>>106720372
I think I'll just wait for a version that gets support for i2v anyway, and more people trying stuff with it.

Anonymous
09/27/25(Sat)18:50:51 No.106720510

Anonymous 09/27/25(Sat)18:50:51 No.106720510

Wan 2.5 will never be open source btw. The whole be nice and we might release it thing is just a way to smooth over the discourse as they transition to full SaaS. Anyone saying otherwise should be laughed at for the clowns they are.

Anonymous
09/27/25(Sat)18:51:03 No.106720511

Anonymous 09/27/25(Sat)18:51:03 No.106720511

>>106720489
I mean that's one of the major improvements in 2.5, proper support for true 1080P and up to 10 second for all resolutions (480, 720, or 1080)

Anonymous
09/27/25(Sat)18:55:47 No.106720546

Anonymous 09/27/25(Sat)18:55:47 No.106720546

>>106720510
IDK man, if you run the model through TensorArt at least (assuming you have an old enough account there to not be impacted by their current NSFW crackdown as far as what the generator allows promptwise) you'll see that it's as uncensored as 2.2 and 2.1 were.

It's also not THAT good in the grand scheme of things, like the coherency and retention of likeness and so on are great but stuff like the lipsyncing is nowhere close to Veo 3, and the prompt adherence seems noticeably worse than Kling's new 2.5 Pro Turbo, so I don't think Alibaba really have quite a product yet that going full SaaS right now would pay off.

Anonymous
09/27/25(Sat)18:57:41 No.106720559

Anonymous 09/27/25(Sat)18:57:41 No.106720559

>>106720546
>Kling's new 2.5 Pro Turbo
They do have great video models, but they will never share them, nor will allow any nsfw.

Anonymous
09/27/25(Sat)18:59:19 No.106720569

Anonymous 09/27/25(Sat)18:59:19 No.106720569

File: 1737602538498599.png (1.08 MB, 1360x768)

1.08 MB PNG

the man with the blue shirt and black jacket in image1 is wearing the outfit of the anime character in image2.

then

give the man wearing a purple shirt in image1 a brown cowboy hat. keep his expression the same.

Anonymous
09/27/25(Sat)18:59:53 No.106720577

Anonymous 09/27/25(Sat)18:59:53 No.106720577

>>106720559
I mean that's not the point though, my observation of that model's overall prompt adherence versus WAN 2.5s was

Anonymous
09/27/25(Sat)19:00:55 No.106720583

Anonymous 09/27/25(Sat)19:00:55 No.106720583

I installed Forge Neo but it made my waifus butt smaller compared to Forge.

What the fuck is this???????

Anonymous
09/27/25(Sat)19:02:11 No.106720592

Anonymous 09/27/25(Sat)19:02:11 No.106720592

>>106720274
were you using like Euler Simple and no negative whatsoever or something?

Anonymous
09/27/25(Sat)19:03:18 No.106720597

Anonymous 09/27/25(Sat)19:03:18 No.106720597

>>106720012
Fuck yeah!!

Anonymous
09/27/25(Sat)19:08:02 No.106720631

Anonymous 09/27/25(Sat)19:08:02 No.106720631

File: 1741810931119882.png (1.02 MB, 1360x768)

1.02 MB PNG

the man wearing the blue shirt and beige pants in image1 is wearing the outfit of the man in image2.

then

add a blue hat like the man in image2 to the man in the center in image1.

Anonymous
09/27/25(Sat)19:08:52 No.106720638

Anonymous 09/27/25(Sat)19:08:52 No.106720638

Is qwen image edit 2509 censored?

Anonymous
09/27/25(Sat)19:09:21 No.106720641

Anonymous 09/27/25(Sat)19:09:21 No.106720641

File: FluxKrea_Output_15151.jpg (3.11 MB, 2048x2048)

3.11 MB JPG

Flux Krea recreation of an old modded Skyrim screenshot of mine with a prompt from Gemini, composition came out pretty close to the original

ポストカード !!FH+LSJVkIY9
09/27/25(Sat)19:10:19 No.106720649

ポストカード !!FH+LSJVkIY9 09/27/25(Sat)19:10:19 No.106720649

File: abitsloppaAbitfloppa.mp4 (1.18 MB, 688x954)

1.18 MB MP4

>>106719590
almost topcow vibes :3
>>106719515
>>106719597
do we HAVE to post selfies here? this is the ai thread (local only)
>>106719871
>>106719991
love the hair <3

Anonymous
09/27/25(Sat)19:12:11 No.106720662

Anonymous 09/27/25(Sat)19:12:11 No.106720662

>>106720638
yes, if it detects the word "breast" in your prompt it IMMEDIATAELY phones home directly to Xi Jinping, who then promptly arranges for your assassination.

(actual answer is not anymore than original Qwen Edit AFAIK)

Anonymous
09/27/25(Sat)19:14:36 No.106720683

Anonymous 09/27/25(Sat)19:14:36 No.106720683

>>106720638
Yes, but at least for nudes there is a lora that helps. For anything suggestive or sexual, it will needs loras to understand them.

Anonymous
09/27/25(Sat)19:14:37 No.106720684

Anonymous 09/27/25(Sat)19:14:37 No.106720684

>>106720662
It doesn't seem it is, but I feel it is stupid sometimes and you have to reiterate the prompt

Anonymous
09/27/25(Sat)19:14:50 No.106720689

Anonymous 09/27/25(Sat)19:14:50 No.106720689

File: 1751807755955810.png (976 KB, 1360x768)

976 KB PNG

>>106720631
the man in the blue shirt is riding a brown horse that is galloping towards the camera. keep his appearance the same.

Anonymous
09/27/25(Sat)19:15:17 No.106720692

Anonymous 09/27/25(Sat)19:15:17 No.106720692

I swear it's the same fucking guy every time asking if X model is censored. Then his next question is always "can it do pissing?"

Anonymous
09/27/25(Sat)19:15:22 No.106720695

Anonymous 09/27/25(Sat)19:15:22 No.106720695

>>106720510
nta

They eventually do when the next version is ready.

Look, we are not even close to 30 sec of continuous action scripted in the prompt. Once this is achieved, dear lord...

Anonymous
09/27/25(Sat)19:16:18 No.106720698

Anonymous 09/27/25(Sat)19:16:18 No.106720698

>>106720695
>Nta
There was no need for that. I wasn't replying to anyone.

Anonymous
09/27/25(Sat)19:17:46 No.106720713

Anonymous 09/27/25(Sat)19:17:46 No.106720713

>>106720695
>Look, we are not even close to 30 sec of continuous action scripted in the prompt. Once this is achieved, dear lord...
Works needs to be done on the node for that, and it seems no one is interested.

Anonymous
09/27/25(Sat)19:22:30 No.106720742

Anonymous 09/27/25(Sat)19:22:30 No.106720742

>>106720692
it's more than one guy, and they're the probably same guys who repeatedly claim that CivitAI "banned NSFW" when they did not do actually do anything more than crack down on nonconsensual deepfakes and extremely niche controversial fetishes (and only once their hand was forced by payment processors).

Anonymous
09/27/25(Sat)19:24:01 No.106720755

Anonymous 09/27/25(Sat)19:24:01 No.106720755

File: 1757448860206825.png (1.22 MB, 1360x768)

1.22 MB PNG

two edits but it still works. one for the hand up then the action.

the man in the blue shirt is holding his left arm in the air, the plane behind him is engulfed in flames, and fire scorches the skies. a pillar of fire from the sky hits the plane. At the top of the image is a blue rectangular textbox the width of the image, with "FIRAGA" in white text in the center. keep his appearance the same.

Anonymous
09/27/25(Sat)19:25:24 No.106720762

Anonymous 09/27/25(Sat)19:25:24 No.106720762

>>106720662
>IMMEDIATAELY phones home directly to Xi Jinping
That phone is going to be busy

Anonymous
09/27/25(Sat)19:27:20 No.106720782

Anonymous 09/27/25(Sat)19:27:20 No.106720782

File: 1737063201960007.png (1.14 MB, 1360x768)

1.14 MB PNG

>>106720755
the man in the blue shirt is holding his left arm in the air, the plane behind him is frozen solid in ice, and a blizzard of snow is in the sky. the plane is encased in thick ice. The ground is covered in snow and ice. At the top of the image is a blue rectangular textbox the width of the image, with "Blizzaga" in white text in the center. keep his appearance the same.

Anonymous
09/27/25(Sat)19:27:28 No.106720783

Anonymous 09/27/25(Sat)19:27:28 No.106720783

Can I fart on a turtle with SD3

Anonymous
09/27/25(Sat)19:40:40 No.106720873

Anonymous 09/27/25(Sat)19:40:40 No.106720873

I don't know what to generate.

Anonymous
09/27/25(Sat)19:42:46 No.106720881

Anonymous 09/27/25(Sat)19:42:46 No.106720881

File: 1757534385144970.png (825 KB, 1360x768)

825 KB PNG

I fucking knew it was fake.

Anonymous
09/27/25(Sat)19:43:30 No.106720890

Anonymous 09/27/25(Sat)19:43:30 No.106720890

>>106720873
!!!
>>106720012 Hint Hint!!

Anonymous
09/27/25(Sat)19:44:17 No.106720896

Anonymous 09/27/25(Sat)19:44:17 No.106720896

>>106720881
are you a moron? all movies are fake

Anonymous
09/27/25(Sat)19:46:20 No.106720908

Anonymous 09/27/25(Sat)19:46:20 No.106720908

File: 1758913465286775.png (1001 KB, 1360x768)

1001 KB PNG

the man is holding a vanilla ice cream cone in an ice cream shop like Baskin Robbins. A neon sign in the back of the shop says "BIG GUYS" in neon lighting. Many flavors of ice cream are visible on display. keep his expression the same.

a big scoop, for you

Anonymous
09/27/25(Sat)19:48:51 No.106720923

Anonymous 09/27/25(Sat)19:48:51 No.106720923

>>106720012
Impressive face consistency throughout the video

Anonymous
09/27/25(Sat)19:49:48 No.106720930

Anonymous 09/27/25(Sat)19:49:48 No.106720930

File: 1731436036149762.mp4 (3.55 MB, 1104x832)

3.55 MB MP4

howdy lads, VRAMlet here.
Has anyone heard of this new higgsfield video model?
Idk the details yet but if/when it gets local support yet we should run some tests.

Anonymous
09/27/25(Sat)19:54:39 No.106720951

Anonymous 09/27/25(Sat)19:54:39 No.106720951

File: ComfyUI_temp_kjnqh_00010_.png (1.84 MB, 1120x1440)

1.84 MB PNG

ポストカード !!FH+LSJVkIY9
09/27/25(Sat)20:04:18 No.106721007

ポストカード !!FH+LSJVkIY9 09/27/25(Sat)20:04:18 No.106721007

File: L1K0hehe.gif (3.76 MB, 400x577)

3.76 MB GIF

>>106720873
cute anime scenes? ;3
steal some homework!
>>106720951
>we live in the timeline of skin-tight workout shorts+lace chones
there is a lot of complaining going on but realistically
women have never looked more adorable in any point in human history

>REQUESTIN CATB0X SIR

Anonymous
09/27/25(Sat)20:04:33 No.106721009

Anonymous 09/27/25(Sat)20:04:33 No.106721009

>>106720012
>10sec video
I WANT IT AHHHHHH

ポストカード !!FH+LSJVkIY9
09/27/25(Sat)20:07:19 No.106721031

ポストカード !!FH+LSJVkIY9 09/27/25(Sat)20:07:19 No.106721031

>>106721009
theory:
>create 5 second video
>use endframe of 5.3 second video as starting frame for another 5 second video
>stitch together
neat :D

Anonymous
09/27/25(Sat)20:08:33 No.106721040

Anonymous 09/27/25(Sat)20:08:33 No.106721040

>still no pony v7

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.