/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 12/09/25(Tue)16:05:28 No.107495506

File: collage.jpg (2.33 MB, 3740x2917)

2.33 MB JPG

/ldg/ - Local Diffusion General Anonymous 12/09/25(Tue)16:05:28 No.107495506 Archived

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107491813

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI:https://github.com/comfyanonymous/ComfyUI
SwarmUI:https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo:https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next:https://github.com/vladmandic/sdnext
Wan2GP:https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta:https://rentry.org/localmodelsmeta
Share Metadata:https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks:https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt:https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin:https://github.com/Acly/krita-ai-diffusion
Archive:https://rentry.org/sdg-link
Bakery:https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality

https://rentry.org/debo

Anonymous
12/09/25(Tue)16:07:01 No.107495526

Anonymous 12/09/25(Tue)16:07:01 No.107495526

>best way I can describe using comfyui moving forward is like sitting on a 12" dildo leaving it in and saying "fuck it, I'm gay now" instead of pulling it out with some dignity and saying "what's the next steps"

Anonymous
12/09/25(Tue)16:07:11 No.107495529

Anonymous 12/09/25(Tue)16:07:11 No.107495529

how do I make nsfw anime videos?

Anonymous
12/09/25(Tue)16:08:02 No.107495537

Anonymous 12/09/25(Tue)16:08:02 No.107495537

>>107495529
own a GPU with at least 24gb

Anonymous
12/09/25(Tue)16:09:04 No.107495547

Anonymous 12/09/25(Tue)16:09:04 No.107495547

>>107495526
wise words

Anonymous
12/09/25(Tue)16:09:55 No.107495554

Anonymous 12/09/25(Tue)16:09:55 No.107495554

>>107495537
not if you have ram
>>107495526
gay

Anonymous
12/09/25(Tue)16:10:03 No.107495556

Anonymous 12/09/25(Tue)16:10:03 No.107495556

>>107495529
most likely with wan t2v or i2v.

Anonymous
12/09/25(Tue)16:10:35 No.107495566

Anonymous 12/09/25(Tue)16:10:35 No.107495566

>comfy let's me customize everything about what I want
>1girl slopgen no upscale

Anonymous
12/09/25(Tue)16:11:23 No.107495576

Anonymous 12/09/25(Tue)16:11:23 No.107495576

DRUGGED AND SHAT ON IN THE STREETS

Anonymous
12/09/25(Tue)16:12:02 No.107495588

Anonymous 12/09/25(Tue)16:12:02 No.107495588

how do i make nsfw videos? what software do i need? i have a 3090. i dual boot linux so i would want to generate everything on linux.

Anonymous
12/09/25(Tue)16:12:05 No.107495589

Anonymous 12/09/25(Tue)16:12:05 No.107495589

How hard is it to implement a LLM prompt model that can differentiate characters and do proper composition/format placement of characters?

Why is this such a hurdle?

Anonymous
12/09/25(Tue)16:12:37 No.107495595

Anonymous 12/09/25(Tue)16:12:37 No.107495595

>>107495554
>gay
says the guy with a 12" dildo in his ass

Anonymous
12/09/25(Tue)16:12:44 No.107495597

Anonymous 12/09/25(Tue)16:12:44 No.107495597

>>107495526
ok what are the next steps

Anonymous
12/09/25(Tue)16:13:18 No.107495610

Anonymous 12/09/25(Tue)16:13:18 No.107495610

File: 1742155189016724.png (518 KB, 500x764)

518 KB PNG

>>107495566
>>comfy let's me customize everything about what I want

Anonymous
12/09/25(Tue)16:13:54 No.107495615

Anonymous 12/09/25(Tue)16:13:54 No.107495615

>>107495589
dunno what you mean by proper but models DID get more powerful at placement and stuff.

qwen/wan or even hyimage3/flux.2 are much better at it than sd1.4 used to be.

Anonymous
12/09/25(Tue)16:14:15 No.107495619

Anonymous 12/09/25(Tue)16:14:15 No.107495619

>>107495589
Too poor to run a model that's smart enough to do that. Won't get any support from me.

Anonymous
12/09/25(Tue)16:14:34 No.107495621

Anonymous 12/09/25(Tue)16:14:34 No.107495621

>>107495556
thanks. is this what the pixiv sloppers use?

Anonymous
12/09/25(Tue)16:16:39 No.107495649

Anonymous 12/09/25(Tue)16:16:39 No.107495649

>>107495621
Maybe? There are other models too and some might use SaaS models.

IIRC some also still animate a series of image by hand like one of these motion picture book, they don't even use a regular video or animated image format for it but that special pixiv ugoira archive

Anonymous
12/09/25(Tue)16:18:47 No.107495672

Anonymous 12/09/25(Tue)16:18:47 No.107495672

>>107495529
comfyui + wan2.2 + nsfw anime loras

comfyui example workflow:
https://docs.comfy.org/tutorials/video/wan/wan2_2

download loras:
https://civitai.com/models

wan2.2 I2V(Image to Video) is what most use. It means you use a reference image, combine it with loras that give NSFW motion, and animates the image.

Anonymous
12/09/25(Tue)16:20:17 No.107495684

Anonymous 12/09/25(Tue)16:20:17 No.107495684

>>107495649
>>107495672
thanks. gonna put my new gpu to good use

Anonymous
12/09/25(Tue)16:24:22 No.107495724

Anonymous 12/09/25(Tue)16:24:22 No.107495724

File: s8rfm29bz26g1.png (439 KB, 1162x1293)

439 KB PNG

Anonymous
12/09/25(Tue)16:27:17 No.107495767

Anonymous 12/09/25(Tue)16:27:17 No.107495767

File: screenshot.1765315297.jpg (443 KB, 2135x611)

443 KB JPG

>>107495566
>comfy let's me customize everything about what I want
Yes. Recently finished my SeedVR batch upscaler. Takes a folder full of videos or images and upscales them. You can switch between processing images or videos. Optional post processing(film grain) applied too. All videos are saved in organized output folders with the original name + seedvr attached. If ComfyOOM's, I can resume right where I left off since it keeps track of my batches. It's a very flexibile workflow that handles all edge cases.

Now tell me, could I do this with neoForge, Wan2GP or SwarmUI? It'd be a pain in the ass I'd imagine. People that actually build usable pipelines thrive with ComfyUI™.

Anonymous
12/09/25(Tue)16:29:25 No.107495790

Anonymous 12/09/25(Tue)16:29:25 No.107495790

>>107495767
I just use chainner since it doesn't make blurry upscales

Anonymous
12/09/25(Tue)16:30:50 No.107495805

Anonymous 12/09/25(Tue)16:30:50 No.107495805

am I overly paranoid if I refuse to use custom comfy nodes?

Anonymous
12/09/25(Tue)16:31:02 No.107495807

Anonymous 12/09/25(Tue)16:31:02 No.107495807

>>107495767
color autism

Anonymous
12/09/25(Tue)16:31:17 No.107495811

Anonymous 12/09/25(Tue)16:31:17 No.107495811

Guys I just pulled and now the dancing fennec girl in the corner is gone. Wtf, why did he remove it?

Anonymous
12/09/25(Tue)16:32:03 No.107495819

Anonymous 12/09/25(Tue)16:32:03 No.107495819

>>107495767
My workflow is I gen one image of something sexy. Look at it for 5 seconds (or so) then close my eyes and masturbate to the memory of it.

You need to have discipline about these things.

Anonymous
12/09/25(Tue)16:32:18 No.107495823

Anonymous 12/09/25(Tue)16:32:18 No.107495823

>>107495811
they killed her for a mutt latina

Anonymous
12/09/25(Tue)16:33:41 No.107495844

Anonymous 12/09/25(Tue)16:33:41 No.107495844

https://huggingface.co/lodestones/Chroma1-Radiance/blob/main/latest_x0.safetensors
the safetensors is here

Anonymous
12/09/25(Tue)16:33:46 No.107495845

Anonymous 12/09/25(Tue)16:33:46 No.107495845

File: 0924153412.jpg (827 KB, 1248x1664)

827 KB JPG

Anonymous
12/09/25(Tue)16:34:17 No.107495850

Anonymous 12/09/25(Tue)16:34:17 No.107495850

I wish there was some way to see exactly what lines of code were changed in each ComfyUI update so if I don't like something I can just change that specific code back to what it was before.

Anonymous
12/09/25(Tue)16:37:09 No.107495875

Anonymous 12/09/25(Tue)16:37:09 No.107495875

is comfy-unjeeted a good ui name?

Anonymous
12/09/25(Tue)16:38:57 No.107495888

Anonymous 12/09/25(Tue)16:38:57 No.107495888

>>107495844
So what's the benefit over DC-2K?

Anonymous
12/09/25(Tue)16:39:02 No.107495889

Anonymous 12/09/25(Tue)16:39:02 No.107495889

>>107495875
not going to get a lot of VC money with that name

Anonymous
12/09/25(Tue)16:41:17 No.107495911

Anonymous 12/09/25(Tue)16:41:17 No.107495911

>>107495888
>So what's the benefit over DC-2K?
https://xcancel.com/LodestoneRock/status/1998215045118112029#m

Anonymous
12/09/25(Tue)16:41:24 No.107495914

Anonymous 12/09/25(Tue)16:41:24 No.107495914

>>107495889
VC money?

Anonymous
12/09/25(Tue)16:45:38 No.107495953

Anonymous 12/09/25(Tue)16:45:38 No.107495953

>>107495529
i2v with a noob/illustrious gen with wan 2.2?

Anonymous
12/09/25(Tue)16:48:00 No.107495983

Anonymous 12/09/25(Tue)16:48:00 No.107495983

Enjoying Chinese culture my friends?

Anonymous
12/09/25(Tue)16:48:11 No.107495987

Anonymous 12/09/25(Tue)16:48:11 No.107495987

>>107495953
nta, how much ram+vram would I need for that?

Anonymous
12/09/25(Tue)16:49:26 No.107496004

Anonymous 12/09/25(Tue)16:49:26 No.107496004

File: 1750372110866189.png (49 KB, 802x467)

49 KB PNG

>Tongyi made a "ask the team" tab on discord so that people can ask them questions and interact with
>They didn't use it for almost a week
kek, I love Chinese Culture!

Anonymous
12/09/25(Tue)16:49:41 No.107496005

Anonymous 12/09/25(Tue)16:49:41 No.107496005

>>107495987
wan can work on 12gb or possibly less maybe? depends on the quants/model type

Anonymous
12/09/25(Tue)16:50:25 No.107496012

Anonymous 12/09/25(Tue)16:50:25 No.107496012

>>107495987
about tree fiddy

Anonymous
12/09/25(Tue)16:56:44 No.107496080

Anonymous 12/09/25(Tue)16:56:44 No.107496080

File: OY VEY.png (1.39 MB, 2071x1484)

1.39 MB PNG

Anonymous
12/09/25(Tue)16:57:27 No.107496090

Anonymous 12/09/25(Tue)16:57:27 No.107496090

>>107496005
people gen videos with 8gb cards

Anonymous
12/09/25(Tue)16:58:16 No.107496103

Anonymous 12/09/25(Tue)16:58:16 No.107496103

File: spider2.jpg (2.15 MB, 2397x2111)

2.15 MB JPG

Still no Z image base faggots?

Anonymous
12/09/25(Tue)16:58:24 No.107496106

Anonymous 12/09/25(Tue)16:58:24 No.107496106

>>107495819
most of us aren't as enlightened as you are

Anonymous
12/09/25(Tue)16:59:05 No.107496114

Anonymous 12/09/25(Tue)16:59:05 No.107496114

File: 1750945315765849.png (1.23 MB, 1024x1024)

1.23 MB PNG

A Netflix movie poster for a movie called "ACK!" in the style of an action movie. On the poster is A man with pink hair holding a trans flag who is diving off a very high bridge. Make the image look like a movie poster.

Anonymous
12/09/25(Tue)17:00:09 No.107496123

Anonymous 12/09/25(Tue)17:00:09 No.107496123

File: 1745352553351883.png (1.43 MB, 1024x1024)

1.43 MB PNG

>>107496114

Anonymous
12/09/25(Tue)17:00:24 No.107496124

Anonymous 12/09/25(Tue)17:00:24 No.107496124

>>107496114
>>107496123
lmao, based

Anonymous
12/09/25(Tue)17:00:58 No.107496126

Anonymous 12/09/25(Tue)17:00:58 No.107496126

I SEE MY TOKENS IN YOUR POSTS I AM GOING FUCKING INSANE

Anonymous
12/09/25(Tue)17:01:18 No.107496130

Anonymous 12/09/25(Tue)17:01:18 No.107496130

>>107495588
you need comfyui and wan2.1 or 2.2 to do text2video or image2video lewds with nsfw loras

most likely you also want an image model to make the lewd images that you use as a starting point for i2v

Anonymous
12/09/25(Tue)17:01:29 No.107496135

Anonymous 12/09/25(Tue)17:01:29 No.107496135

>>107495987
as much as possible.lower quants = lower quality. Anything below Q6 is unacceptable imo(16gb vram)

Anonymous
12/09/25(Tue)17:02:18 No.107496145

Anonymous 12/09/25(Tue)17:02:18 No.107496145

File: ComfyUI_00233_.png (1.42 MB, 1192x939)

1.42 MB PNG

You STILL dont have any base for your chinky model? My fucking sides

Anonymous
12/09/25(Tue)17:02:23 No.107496146

Anonymous 12/09/25(Tue)17:02:23 No.107496146

File: 1755793893832473.png (1.39 MB, 1280x720)

1.39 MB PNG

Anonymous
12/09/25(Tue)17:03:24 No.107496156

Anonymous 12/09/25(Tue)17:03:24 No.107496156

>>107496145
speaking of chroma did you try his newest toy?
>>107495844
>>107495911

Anonymous
12/09/25(Tue)17:04:02 No.107496164

Anonymous 12/09/25(Tue)17:04:02 No.107496164

>>107496145
why are they so smooth?

Anonymous
12/09/25(Tue)17:05:03 No.107496174

Anonymous 12/09/25(Tue)17:05:03 No.107496174

>>107495844
Is this any good? How fast is it on a 5090?

Anonymous
12/09/25(Tue)17:05:06 No.107496175

Anonymous 12/09/25(Tue)17:05:06 No.107496175

File: 00228-1004663893-8cfb8289(...).png (2.7 MB, 1728x1344)

2.7 MB PNG

Anonymous
12/09/25(Tue)17:05:11 No.107496176

Anonymous 12/09/25(Tue)17:05:11 No.107496176

File: 1759096882356146.png (1.24 MB, 1024x1024)

1.24 MB PNG

Anonymous
12/09/25(Tue)17:05:18 No.107496179

Anonymous 12/09/25(Tue)17:05:18 No.107496179

File: 3254327643.png (1.25 MB, 832x1216)

1.25 MB PNG

Anonymous
12/09/25(Tue)17:05:40 No.107496182

Anonymous 12/09/25(Tue)17:05:40 No.107496182

>>107496179
get off my nice piano whore

Anonymous
12/09/25(Tue)17:06:06 No.107496187

Anonymous 12/09/25(Tue)17:06:06 No.107496187

File: ComfyUI_00083_.png (1.42 MB, 1120x1008)

1.42 MB PNG

>>107496156
Uh, what is it good for? I didnt really like 1-HD over base so idk what to expect here

Anonymous
12/09/25(Tue)17:06:22 No.107496193

Anonymous 12/09/25(Tue)17:06:22 No.107496193

>>107495875
comfierui so that someone can eventually make comfiestui because yours will get complained about as well

Anonymous
12/09/25(Tue)17:06:27 No.107496195

Anonymous 12/09/25(Tue)17:06:27 No.107496195

>>107496179
get off my nice whore ugly piano

Anonymous
12/09/25(Tue)17:07:07 No.107496202

Anonymous 12/09/25(Tue)17:07:07 No.107496202

>>107496164
chinese skin care

Anonymous
12/09/25(Tue)17:09:27 No.107496222

Anonymous 12/09/25(Tue)17:09:27 No.107496222

File: 2157001-A super close up,(...).webm (2.69 MB, 960x960)

2.69 MB WEBM

Euler + Beta looks pretty good with Z-Image and wan 2.2

Anonymous
12/09/25(Tue)17:10:35 No.107496233

Anonymous 12/09/25(Tue)17:10:35 No.107496233

File: 2671488710.png (1.18 MB, 1216x832)

1.18 MB PNG

>>107496182
>>107496195
I was thinking the same thing, that piano probably costs 50 grand.

Anonymous
12/09/25(Tue)17:11:22 No.107496241

Anonymous 12/09/25(Tue)17:11:22 No.107496241

>>107496135
I got a 5060ti in the mail

Anonymous
12/09/25(Tue)17:14:22 No.107496269

Anonymous 12/09/25(Tue)17:14:22 No.107496269

File: 1743874382602259.png (32 KB, 735x371)

32 KB PNG

anything special i need to do to use the z-image control net model? i'm getting an error saying its not valid

Anonymous
12/09/25(Tue)17:14:47 No.107496272

Anonymous 12/09/25(Tue)17:14:47 No.107496272

>>107495844
Maybe we got a Z-image tier model with NSFW in it but no one seems to be willing to do the tests lool

Anonymous
12/09/25(Tue)17:15:15 No.107496276

Anonymous 12/09/25(Tue)17:15:15 No.107496276

>>107496114
Tfw ZIT doesn't know your fav character or celebrity but it knows the trans flag...

Anonymous
12/09/25(Tue)17:16:05 No.107496286

Anonymous 12/09/25(Tue)17:16:05 No.107496286

>>107496090
While you can do that I wouldn't recommend it as it sucks.

t. used to do it

Anonymous
12/09/25(Tue)17:20:28 No.107496321

Anonymous 12/09/25(Tue)17:20:28 No.107496321

>>107496004
It's so blatant at this point. We don't mock people who still think it's coming enough.

Anonymous
12/09/25(Tue)17:21:49 No.107496328

Anonymous 12/09/25(Tue)17:21:49 No.107496328

File: 1761430437009619.png (868 KB, 1136x912)

868 KB PNG

qwen edit + 2 images

Anonymous
12/09/25(Tue)17:22:17 No.107496335

Anonymous 12/09/25(Tue)17:22:17 No.107496335

>>107496328
try /adg/

Anonymous
12/09/25(Tue)17:25:33 No.107496357

Anonymous 12/09/25(Tue)17:25:33 No.107496357

>>107495844
>>107496272
Is there an example workflow for it? I don't know what it needs but I can download it to try it out.

Anonymous
12/09/25(Tue)17:26:19 No.107496363

Anonymous 12/09/25(Tue)17:26:19 No.107496363

File: 1762286516575.png (1.16 MB, 1408x1216)

1.16 MB PNG

>>107496272
I googled it and it seems to just be a version that doesnt need so much prompting to get good results, like normally you have to spam a bunch of negatives and descriptors such as volumetric lighting, high res, etc, or it might do black and white sketches and other unwanted shit. Maybe I'll try it but I wanna get into video gen next

Anonymous
12/09/25(Tue)17:28:46 No.107496376

Anonymous 12/09/25(Tue)17:28:46 No.107496376

>>107496269
https://github.com/comfyanonymous/ComfyUI/pull/11062

Anonymous
12/09/25(Tue)17:29:34 No.107496382

Anonymous 12/09/25(Tue)17:29:34 No.107496382

File: 1741975168827537.png (304 KB, 3120x1502)

304 KB PNG

>>107496357
just go on Comfy's template you'll get what you want

Anonymous
12/09/25(Tue)17:31:00 No.107496393

Anonymous 12/09/25(Tue)17:31:00 No.107496393

>>107496376
>>107496382
don't these do it wrong 90% of the time?

Anonymous
12/09/25(Tue)17:31:23 No.107496394

Anonymous 12/09/25(Tue)17:31:23 No.107496394

God I feel so comfy

Anonymous
12/09/25(Tue)17:32:01 No.107496396

Anonymous 12/09/25(Tue)17:32:01 No.107496396

>>107496130
>you need comfyui and wan2.1 or 2.2 to do text2video or image2video lewds with nsfw loras
lol no use wan2gp

Anonymous
12/09/25(Tue)17:34:43 No.107496414

Anonymous 12/09/25(Tue)17:34:43 No.107496414

File: 1743316248469422.png (809 KB, 1136x912)

809 KB PNG

>>107496328

Anonymous
12/09/25(Tue)17:34:55 No.107496417

Anonymous 12/09/25(Tue)17:34:55 No.107496417

File: UuU.png (67 KB, 294x225)

67 KB PNG

>>107496103
I like your style

Anonymous
12/09/25(Tue)17:35:23 No.107496420

Anonymous 12/09/25(Tue)17:35:23 No.107496420

>>107496396
where can I get a workflow that has a node dedicated for NSFW lora?

Anonymous
12/09/25(Tue)17:37:30 No.107496437

Anonymous 12/09/25(Tue)17:37:30 No.107496437

>>107496393
they are meant to be a base to get you started using a model

Anonymous
12/09/25(Tue)17:37:47 No.107496440

Anonymous 12/09/25(Tue)17:37:47 No.107496440

>>107496363
Could you add NewBieAI also?

Anonymous
12/09/25(Tue)17:38:09 No.107496443

Anonymous 12/09/25(Tue)17:38:09 No.107496443

>>107496414
are people actually getting mad the cartoon peanut won?

Anonymous
12/09/25(Tue)17:39:09 No.107496448

Anonymous 12/09/25(Tue)17:39:09 No.107496448

>>107496420
no workflow or spaghetti needed. it just has inputs.

Anonymous
12/09/25(Tue)17:40:17 No.107496458

Anonymous 12/09/25(Tue)17:40:17 No.107496458

>>107496443
>best vtuber award
>given to a nigga who isn't a vtuber and hates vtubers because it'd be funny

Anonymous
12/09/25(Tue)17:41:26 No.107496466

Anonymous 12/09/25(Tue)17:41:26 No.107496466

>>107496458
>isn't a vtuber
>always has avatar on
I'm confused

Anonymous
12/09/25(Tue)17:42:16 No.107496473

Anonymous 12/09/25(Tue)17:42:16 No.107496473

File: 1760140321496544.png (752 KB, 1136x912)

752 KB PNG

>>107496443
yes, because they think it doesnt count (for some reason)

Anonymous
12/09/25(Tue)17:42:31 No.107496478

Anonymous 12/09/25(Tue)17:42:31 No.107496478

Z-Image Base will release tonight

Anonymous
12/09/25(Tue)17:42:33 No.107496479

Anonymous 12/09/25(Tue)17:42:33 No.107496479

>>107496241
Why not simply use a free comfyui provider like seaart? 24/48gb GPUs.

Anonymous
12/09/25(Tue)17:43:46 No.107496485

Anonymous 12/09/25(Tue)17:43:46 No.107496485

File: 1763230648123120.png (486 KB, 680x559)

486 KB PNG

>>107496478
>Z-Image Base will release tonight
that's bullshit, but I believe you

Anonymous
12/09/25(Tue)17:43:57 No.107496487

Anonymous 12/09/25(Tue)17:43:57 No.107496487

>>107496479
>Why not simply use a free comfyui provider like saarshart?

Anonymous
12/09/25(Tue)17:45:18 No.107496496

Anonymous 12/09/25(Tue)17:45:18 No.107496496

>>107496458
sounds giga-based to me. fuck "v-tubers"

Anonymous
12/09/25(Tue)17:45:41 No.107496500

Anonymous 12/09/25(Tue)17:45:41 No.107496500

>>107496487
*free cumfart poovider like saarshart

Anonymous
12/09/25(Tue)17:46:34 No.107496507

Anonymous 12/09/25(Tue)17:46:34 No.107496507

>>107496487
>doesn't answer
Is it because you're generating CP?

Anonymous
12/09/25(Tue)17:47:06 No.107496513

Anonymous 12/09/25(Tue)17:47:06 No.107496513

>>107496507
you aren't?

Anonymous
12/09/25(Tue)17:47:55 No.107496519

Anonymous 12/09/25(Tue)17:47:55 No.107496519

>>107496466
Apparently vtuber means big ass tranime tiddies jiggling in your face 24/7

Anonymous
12/09/25(Tue)17:48:54 No.107496531

Anonymous 12/09/25(Tue)17:48:54 No.107496531

>>107496479
retarded. always use other people's instances
https://www.shodan.io/search?query=comfyui

Anonymous
12/09/25(Tue)17:48:56 No.107496532

Anonymous 12/09/25(Tue)17:48:56 No.107496532

>>107496513
I prefer hags (21+)

Anonymous
12/09/25(Tue)17:50:18 No.107496545

Anonymous 12/09/25(Tue)17:50:18 No.107496545

File: ComfyUI_00185_.png (1.21 MB, 1200x968)

1.21 MB PNG

>>107496417
Thanks m8y

>>107496440
I'll try, send me a workflow

>>107496478
If it releases tonight I will delete my spiderBBC folder and never post again.

Anonymous
12/09/25(Tue)17:50:44 No.107496551

Anonymous 12/09/25(Tue)17:50:44 No.107496551

File: 1743337907232628.png (1.05 MB, 1000x1048)

1.05 MB PNG

z image + qwen edit is a great combo.

Anonymous
12/09/25(Tue)17:51:44 No.107496560

Anonymous 12/09/25(Tue)17:51:44 No.107496560

>>107496551
*also, reactor comfyui version for face swaps (just take a generic black guy and swap droyd)

Anonymous
12/09/25(Tue)17:52:00 No.107496561

Anonymous 12/09/25(Tue)17:52:00 No.107496561

File: NAI_00011_.png (1.81 MB, 1024x1280)

1.81 MB PNG

Just a quick test but this mergstein uncanny_uncanny is quite nice.

Not sure how similar NAI is to Illustrious. What's the advanced workflows look like? I got a very basic bitch workflow right now.

Anonymous
12/09/25(Tue)17:52:43 No.107496572

Anonymous 12/09/25(Tue)17:52:43 No.107496572

>>107496531
I hope somebody writes a script to spam all open remote instances with cp

Anonymous
12/09/25(Tue)17:52:56 No.107496580

Anonymous 12/09/25(Tue)17:52:56 No.107496580

>>107496531
Very nice, I've always wanted to commit cybercrime to run sd1.5.

Anonymous
12/09/25(Tue)17:53:13 No.107496582

Anonymous 12/09/25(Tue)17:53:13 No.107496582

File: 1755142162512162.png (1.04 MB, 1000x1048)

1.04 MB PNG

>>107496551

Anonymous
12/09/25(Tue)17:54:21 No.107496592

Anonymous 12/09/25(Tue)17:54:21 No.107496592

>>107496580
how is it a crime when all these people left their instances public?

Anonymous
12/09/25(Tue)17:56:52 No.107496622

Anonymous 12/09/25(Tue)17:56:52 No.107496622

File: 1749599305834954.png (1.92 MB, 1024x1024)

1.92 MB PNG

>>107496582

Anonymous
12/09/25(Tue)17:56:58 No.107496623

Anonymous 12/09/25(Tue)17:56:58 No.107496623

File: It's HY'am.png (349 KB, 1465x1334)

349 KB PNG

https://xcancel.com/TencentHunyuan/status/1998298475507892455#m
lol?

Anonymous
12/09/25(Tue)17:57:38 No.107496632

Anonymous 12/09/25(Tue)17:57:38 No.107496632

>>107496531
I am machine gunning gay nigger porn on some poor chinks machine kek

Anonymous
12/09/25(Tue)18:02:46 No.107496681

Anonymous 12/09/25(Tue)18:02:46 No.107496681

>>107496592
why steal computing resources when you can simply use a freely provided high performance GPU?

Anonymous
12/09/25(Tue)18:03:11 No.107496686

Anonymous 12/09/25(Tue)18:03:11 No.107496686

File: 1736559501946277.png (1.9 MB, 1024x1024)

1.9 MB PNG

Anonymous
12/09/25(Tue)18:03:22 No.107496688

Anonymous 12/09/25(Tue)18:03:22 No.107496688

>>107496623
it sounded like "one yuan" which is about its worth

Anonymous
12/09/25(Tue)18:04:53 No.107496707

Anonymous 12/09/25(Tue)18:04:53 No.107496707

File: file.png (122 KB, 539x882)

122 KB PNG

>>107496382
>>107496393
Well they certainly fuck up on this one. It gens pure noise with the default settings. Is there something wrong with these?

Anonymous
12/09/25(Tue)18:08:21 No.107496733

Anonymous 12/09/25(Tue)18:08:21 No.107496733

So what are you training first when the base model releases?

Anonymous
12/09/25(Tue)18:08:21 No.107496734

Anonymous 12/09/25(Tue)18:08:21 No.107496734

File: 1762187116995.png (1.19 MB, 1216x1408)

1.19 MB PNG

>>107496707
try flow shift 2, min length 1, and t5xxl fp16, there seems to be an optional chroma radiance node, no idea what the fuck thats for, but might be the culprit

Anonymous
12/09/25(Tue)18:09:22 No.107496744

Anonymous 12/09/25(Tue)18:09:22 No.107496744

>people using the latest advanced model
>still looks like poorly gen'd shit

Anonymous
12/09/25(Tue)18:09:56 No.107496747

Anonymous 12/09/25(Tue)18:09:56 No.107496747

File: 2279663419.png (1.09 MB, 896x1152)

1.09 MB PNG

Anonymous
12/09/25(Tue)18:10:04 No.107496748

Anonymous 12/09/25(Tue)18:10:04 No.107496748

File: 1736300593997552.png (1.9 MB, 3114x1276)

1.9 MB PNG

>>107496707
it seems to be working for me, did you update ComfyUi
>30/30 [01:27<00:00, 2.93s/it]
it's actually pretty fast, faster than normal Chroma actually, this shit might be the future

Anonymous
12/09/25(Tue)18:10:11 No.107496750

Anonymous 12/09/25(Tue)18:10:11 No.107496750

>>107496688
>watching yt tutorials with ai voice pronounce it as "Hoon Yoowen"

heh

Anonymous
12/09/25(Tue)18:12:40 No.107496766

Anonymous 12/09/25(Tue)18:12:40 No.107496766

File: 3384184417.png (1.12 MB, 896x1152)

1.12 MB PNG

Anonymous
12/09/25(Tue)18:13:16 No.107496775

Anonymous 12/09/25(Tue)18:13:16 No.107496775

>>107496545
>I'll try, send me a workflow
Rush released model, only works in a isolated ComfyUI setup with very lenghty guide https://ai.feishu.cn/wiki/P3sgwUUjWih8ZWkpr0WcwXSMnTb .

But here https://newbie.rimeleaf.com/ you can use it for free without account with cloud GPU, just use the advanced XML tab.

Web version:
Before entering the prompt into the Web version you have to paste this https://pastebin.com/U3zQQrJYas System Prompt to a LLM model to build you the tag prompt, then like neta you have to put this as prefix: "You are an assistant designed to generate high quality anime images with the highest degree of image text alignment based on xml format textual prompts. <Prompt Start>"
Negative prompt is included in the Web

Anonymous
12/09/25(Tue)18:16:23 No.107496808

Anonymous 12/09/25(Tue)18:16:23 No.107496808

File: fool me once...jpg (852 KB, 2048x1424)

852 KB JPG

>>107495844
>https://huggingface.co/lodestones/Chroma1-Radiance/blob/main/latest_x0.safetensors
>I fell for the meme
gaddamit

Anonymous
12/09/25(Tue)18:16:51 No.107496816

Anonymous 12/09/25(Tue)18:16:51 No.107496816

Will there ever be a photo real model that understands NSFW concepts the way Pony/Noob/Illustrious does? XL/Qwen/Z produce absolute nightmare fuel, even with loras and checkpoints trained on nsfw images. I don't get why some models do so well and others so poorly

Anonymous
12/09/25(Tue)18:19:01 No.107496838

Anonymous 12/09/25(Tue)18:19:01 No.107496838

>>107496686
and nobody clapped

Anonymous
12/09/25(Tue)18:19:22 No.107496843

Anonymous 12/09/25(Tue)18:19:22 No.107496843

File: ____.png (160 KB, 1457x529)

160 KB PNG

Which way white man?

Anonymous
12/09/25(Tue)18:20:19 No.107496848

Anonymous 12/09/25(Tue)18:20:19 No.107496848

>>107496808
they both look like shitty photos in different ways

Anonymous
12/09/25(Tue)18:20:58 No.107496857

Anonymous 12/09/25(Tue)18:20:58 No.107496857

>>107496808
6 billion parameters to generate the same chink face every time

Anonymous
12/09/25(Tue)18:21:16 No.107496862

Anonymous 12/09/25(Tue)18:21:16 No.107496862

Getting kinda sick of ComfyUI and the Gradio WebUI forks, does anyone have experience with/recommend any of the many stable-diffusion.cpp front-ends?
Meant to ask this even before the recent Comfy shitshow

Anonymous
12/09/25(Tue)18:21:25 No.107496865

Anonymous 12/09/25(Tue)18:21:25 No.107496865

>>107496174
>Is this any good?
Going by the original radiance model one can safely assume that no this new one is also not good kek
He STILL has yet to fix the details

Anonymous
12/09/25(Tue)18:21:37 No.107496868

Anonymous 12/09/25(Tue)18:21:37 No.107496868

>>107496857
for a chink model that fits desu

Anonymous
12/09/25(Tue)18:22:29 No.107496879

Anonymous 12/09/25(Tue)18:22:29 No.107496879

>>107496479
When I say local diffusion I mean local diffusion

Anonymous
12/09/25(Tue)18:22:31 No.107496880

Anonymous 12/09/25(Tue)18:22:31 No.107496880

I want to upscale and clean screenshots of an old anime before training a model on it.
Where do I save "Kontext-Unblur-Upscale" and how do I use it in Forge Neo?
(I tried with Kontext alone but it's not so good.)

Anonymous
12/09/25(Tue)18:23:43 No.107496890

Anonymous 12/09/25(Tue)18:23:43 No.107496890

>>107496862
kobold if you double dip into llms

Anonymous
12/09/25(Tue)18:24:35 No.107496898

Anonymous 12/09/25(Tue)18:24:35 No.107496898

>>107496808
>fingers on left
why does anyone still tolerate that

Anonymous
12/09/25(Tue)18:25:08 No.107496906

Anonymous 12/09/25(Tue)18:25:08 No.107496906

File: keeek.jpg (1.11 MB, 2560x1199)

1.11 MB JPG

>>107496808
wtf is this shit lmao

Anonymous
12/09/25(Tue)18:25:11 No.107496908

Anonymous 12/09/25(Tue)18:25:11 No.107496908

File: 1765247306856724.png (1.03 MB, 1024x1024)

1.03 MB PNG

the anime girl is sitting at a desk typing at a computer, with a white CRT monitor that says "LDG" on the screen, in a dimly lit bedroom.

love qwen edit, such a neat tool.

Anonymous
12/09/25(Tue)18:25:17 No.107496909

Anonymous 12/09/25(Tue)18:25:17 No.107496909

>>107496775
the pastebin you sent is dead but thanks, I'll give it a go

Anonymous
12/09/25(Tue)18:25:59 No.107496919

Anonymous 12/09/25(Tue)18:25:59 No.107496919

I don't think the radiance x0 is supposed to be ready yet. I gather it's basically a tech demo for those who understand what the fuck x0 means. (I don't). He just trained it for like a week enough for it to produce pictures and now he is training with that method moving forward, which is supposed to be multiple times faster than the old radiance way. I hope it works.

Anonymous
12/09/25(Tue)18:28:45 No.107496948

Anonymous 12/09/25(Tue)18:28:45 No.107496948

>>107496908
WHERE IS MIKU YOU MONSTER

Anonymous
12/09/25(Tue)18:29:45 No.107496958

Anonymous 12/09/25(Tue)18:29:45 No.107496958

>>107496909
https://pastebin.com/U3zQQrJY
is this

Anonymous
12/09/25(Tue)18:30:02 No.107496962

Anonymous 12/09/25(Tue)18:30:02 No.107496962

>>107496906
left has kino artstyle. right is slop

Anonymous
12/09/25(Tue)18:30:31 No.107496966

Anonymous 12/09/25(Tue)18:30:31 No.107496966

>>107496868
zimage pics always come out cohesive, but it never really surprises me, like the angles and poses are very predictable, with chroma its sometimes very out of pocket which makes it fun

Anonymous
12/09/25(Tue)18:31:37 No.107496976

Anonymous 12/09/25(Tue)18:31:37 No.107496976

Is it not possible to use more than one lora with Z image or am I just doing something wrong? Every time I try to stack a character lora with a concept lora, it nukes the image and I get deep fried body horror, unless I lower the lora strengths. down to the point they barely do anything. Is it a problem with my loras or is it the model itself?

Anonymous
12/09/25(Tue)18:32:53 No.107496986

Anonymous 12/09/25(Tue)18:32:53 No.107496986

>>107496843
I hope he always keeps it like this

Anonymous
12/09/25(Tue)18:32:57 No.107496987

Anonymous 12/09/25(Tue)18:32:57 No.107496987

>>107496808
>>107496906
looks like using sd1.5 with controlnet.

Anonymous
12/09/25(Tue)18:33:38 No.107496995

Anonymous 12/09/25(Tue)18:33:38 No.107496995

>zim can't make a girl liking a feet.
to the trash it goes.

Anonymous
12/09/25(Tue)18:34:29 No.107497006

Anonymous 12/09/25(Tue)18:34:29 No.107497006

File: ComfyUI_00072_.png (848 KB, 1120x1008)

848 KB PNG

>>107496906
chroma CGI king confirmed

Anonymous
12/09/25(Tue)18:37:37 No.107497032

Anonymous 12/09/25(Tue)18:37:37 No.107497032

File: Chroma-Radiance_00019_.png (1.8 MB, 1024x1024)

1.8 MB PNG

>>107496906
I thought we were past the body horror era.

Anonymous
12/09/25(Tue)18:37:57 No.107497033

Anonymous 12/09/25(Tue)18:37:57 No.107497033

>>107496908
>{prompt}

>{rushed-forced conclusion}

I missed you so much Miku Tester bro...

Anonymous
12/09/25(Tue)18:39:13 No.107497042

Anonymous 12/09/25(Tue)18:39:13 No.107497042

>>107496995
How do you know if she's liking it or not liking a feet?

Anonymous
12/09/25(Tue)18:39:19 No.107497044

Anonymous 12/09/25(Tue)18:39:19 No.107497044

File: file.jpg (216 KB, 1200x1200)

216 KB JPG

>>107496906
left looks like gaddafi

Anonymous
12/09/25(Tue)18:40:13 No.107497053

Anonymous 12/09/25(Tue)18:40:13 No.107497053

A tip for wan chads. When you really want to create a longer video for example content for a youtube channel that has a specific story line. Consider using something like wd14 tagger node to interrogate frames and use logic nodes intelligently to determine when conditions are met at any particular point in the video. Then consider the manual context window that you can use that could maybe be set in some sort of batch mode of say 16 frames and have the wd14 tagger interrogate each frame, kind of hard for me to explain what i mean actually i'm never good at explaining things.

ugh lets think, you can avoid generating too many junk frames if all you need is the next last frame for the next prompt. but manually doing that is a really pain in the arse so interrogate each frame looking for the ideal next start frame and exclude all frames after it and batch all frames before it into an image batch node.

probably no one fucking cares lol , but it is something i'm begining to fully realise as i improve on automation in my workflow. To me it seems a waste of time and energy generating a fully set of 81 frames if we can use a context window node to only gen the frames we need and stitch everything together.

Anonymous
12/09/25(Tue)18:40:32 No.107497059

Anonymous 12/09/25(Tue)18:40:32 No.107497059

File: 1764645067250748.png (143 KB, 1862x565)

143 KB PNG

>I am forgotten

Anonymous
12/09/25(Tue)18:40:37 No.107497062

Anonymous 12/09/25(Tue)18:40:37 No.107497062

>>107496976
As far as I understand it's a problem with the distilled model and loras trained on it. I have yet to come across a single z image lora that doesn't butcher the quality or change the whole look.

Anonymous
12/09/25(Tue)18:41:53 No.107497074

Anonymous 12/09/25(Tue)18:41:53 No.107497074

>>107497059

At this point these models and mixes are like shitcoins. They pop up and try to get VC money and then just disappear.

I don't think anything will ever stabilize since people will be chasing the next thing until the AI bubble pops (due to associated costs) and then people will simply go back to genning 1girl on old models for a while.

Anonymous
12/09/25(Tue)18:42:04 No.107497079

Anonymous 12/09/25(Tue)18:42:04 No.107497079

>>107497059
And kandinsky, and ovi, and long cat,

Anonymous
12/09/25(Tue)18:42:20 No.107497083

Anonymous 12/09/25(Tue)18:42:20 No.107497083

>>107496862
Enjoy your inference being 4x slower than comfy because the devs are retards.

Anonymous
12/09/25(Tue)18:42:49 No.107497086

Anonymous 12/09/25(Tue)18:42:49 No.107497086

>>107497053
neat if you got it working but sounds far too autistic for me

Anonymous
12/09/25(Tue)18:43:08 No.107497087

Anonymous 12/09/25(Tue)18:43:08 No.107497087

>>107497062
probably this, distilled models are fucking garbage.

Anonymous
12/09/25(Tue)18:43:42 No.107497089

Anonymous 12/09/25(Tue)18:43:42 No.107497089

File: 11_25.png (114 KB, 625x539)

114 KB PNG

>>107496986
I <3 singapore

Anonymous
12/09/25(Tue)18:44:34 No.107497097

Anonymous 12/09/25(Tue)18:44:34 No.107497097

>>107497074
I just hope we get some high quality local models before the bubble pops. Would suck to be stuck on SDXL for an eternity.

Anonymous
12/09/25(Tue)18:44:39 No.107497100

Anonymous 12/09/25(Tue)18:44:39 No.107497100

File: Untitled.png (3.99 MB, 3215x1707)

3.99 MB PNG

>>107496906
You should've tried with Chroma 1-Base (FP16 text encoder since it really matters here)

1-HD is already bad as it is so all bets are off with Radiance

Anonymous
12/09/25(Tue)18:44:48 No.107497101

Anonymous 12/09/25(Tue)18:44:48 No.107497101

>>107496906
Other than some prompt issues and not looking like Keanu, the left is more accurate.

Anonymous
12/09/25(Tue)18:48:51 No.107497142

Anonymous 12/09/25(Tue)18:48:51 No.107497142

File: frontend devs.png (57 KB, 599x428)

57 KB PNG

>2 cancel buttons
>neither of them is next to the run button
it's... beautiful

Anonymous
12/09/25(Tue)18:50:18 No.107497156

Anonymous 12/09/25(Tue)18:50:18 No.107497156

>>107497142
I just force stop the app on my android phone.

Anonymous
12/09/25(Tue)18:50:24 No.107497159

Anonymous 12/09/25(Tue)18:50:24 No.107497159

File: 1741972112586929.png (14 KB, 645x322)

14 KB PNG

>>107497142
what? you can move the run button and put it on the high horizontal bar and put it next to the cancel button

Anonymous
12/09/25(Tue)18:50:29 No.107497160

Anonymous 12/09/25(Tue)18:50:29 No.107497160

>>107497079
check out SVI, they just released 2.0 and are actively trying to fix the fucking jerky batch issue plaguing long vid gens.

Anonymous
12/09/25(Tue)18:50:47 No.107497165

Anonymous 12/09/25(Tue)18:50:47 No.107497165

>>107497086
its very autistic and comfyui noodle mess really makes it hard work, things become almost unbearable once you have multiple branches of 'contains tag' into OR or NOT and then AND and blah it gets confusing but i'm figuring it out. The biggest problem is wan does not do tags very well so we need some sort of natural language image interrogator or something. Best solution I can think of so far is something which breaks down a prompt for the next batch and checked every 16 frames or so for when conditions are met for the next prompt but i'm not really seeing it fully in my mind yet.

Is this sort of shit worth effort? I think it is yeah, watch this video

https://www.youtube.com/watch?v=1r0eyM7suUg

I wouldn't create total slow like they do though it wouldn't interest me desu. However its possible to make a bit of coin doing this sort of thing.

Anonymous
12/09/25(Tue)18:53:37 No.107497194

Anonymous 12/09/25(Tue)18:53:37 No.107497194

>>107496966
Indeed mutants are always surprising, You never know what the limbs will out of :D

Anonymous
12/09/25(Tue)18:53:50 No.107497197

Anonymous 12/09/25(Tue)18:53:50 No.107497197

>>107497086
but at the moment mate i'm really just using it to create short simple gooner clips, yeah just test whether the lady is in position before the man moves in from behind etc. But that is when it got me thinking it could be used for much more interesting content creation if done right.

Anonymous
12/09/25(Tue)18:54:40 No.107497207

Anonymous 12/09/25(Tue)18:54:40 No.107497207

>>107497194
keek

Anonymous
12/09/25(Tue)18:59:24 No.107497252

Anonymous 12/09/25(Tue)18:59:24 No.107497252

>>107497159
You could put them anywhere when they were tied together. For example bottom right where other stuff is. Now they are up with the option of moving only the run. Why?

Anonymous
12/09/25(Tue)19:00:08 No.107497258

Anonymous 12/09/25(Tue)19:00:08 No.107497258

/ldg/, as always, is at the forefront when it comes to local diffusion, and we've all accepted that the release of the base model was cancelled. But how long do you think it will take for the normies to wake up and accept the fact? are we gonna reach february with people still saying shit like "when they release edit" or whatever?

Anonymous
12/09/25(Tue)19:00:50 No.107497263

Anonymous 12/09/25(Tue)19:00:50 No.107497263

>>107497252
>Why?
devs be retarded

Anonymous
12/09/25(Tue)19:00:51 No.107497264

Anonymous 12/09/25(Tue)19:00:51 No.107497264

>>107497258
on reddit and discord they're also really suspicious about the release lol

Anonymous
12/09/25(Tue)19:01:27 No.107497272

Anonymous 12/09/25(Tue)19:01:27 No.107497272

File: file.png (140 KB, 1656x1075)

140 KB PNG

>>107492965
Alright, that didn't take too long

Anonymous
12/09/25(Tue)19:02:38 No.107497287

Anonymous 12/09/25(Tue)19:02:38 No.107497287

>>107497258
I'll get back to you on this. I have to run to the bank to open up a fourth line of credit so I can buy 4GB of RAM before the bank closes. Tomorrow it will be too late.

Anonymous
12/09/25(Tue)19:08:38 No.107497351

Anonymous 12/09/25(Tue)19:08:38 No.107497351

File: 1745629805290109.png (835 KB, 1280x720)

835 KB PNG

>>107496948
here she is!

Anonymous
12/09/25(Tue)19:08:49 No.107497352

Anonymous 12/09/25(Tue)19:08:49 No.107497352

>>107497272
that emoji better not be a token, nigga

Anonymous
12/09/25(Tue)19:12:56 No.107497389

Anonymous 12/09/25(Tue)19:12:56 No.107497389

>>107496976
The loras are probably overcooked. There's generally no need to go above 1000 steps. Also, there are no trainers out there that allow you to do granular training which might help mitigate the deepfrying issue. You have edit the lora.py file to do that.

Anonymous
12/09/25(Tue)19:15:25 No.107497411

Anonymous 12/09/25(Tue)19:15:25 No.107497411

>>107496686
enjoying US culture?

Anonymous
12/09/25(Tue)19:20:13 No.107497456

Anonymous 12/09/25(Tue)19:20:13 No.107497456

File: 4chon.png (108 KB, 947x683)

108 KB PNG

amazing

Anonymous
12/09/25(Tue)19:20:22 No.107497460

Anonymous 12/09/25(Tue)19:20:22 No.107497460

>>107497272
hmm what is this? Might be something similar to what i'm attempting, i need a way to translate from tags into natural language locally. basically to convert image model tags into wan speak and back again to test for if conditions are meat within a specific frame.

i'm fishing for information of maybe custom nodes that can do that, google search is fucking shit for this kind of thing.

Anonymous
12/09/25(Tue)19:22:13 No.107497477

Anonymous 12/09/25(Tue)19:22:13 No.107497477

File: ComfyUI_00304_.mp4 (370 KB, 1280x720)

370 KB MP4

>>107497456

Anonymous
12/09/25(Tue)19:22:53 No.107497484

Anonymous 12/09/25(Tue)19:22:53 No.107497484

>>107497272
>>107497456
you 2 keep posting as its intriguing and triggering complex thoughts of complex automation, we need more discussion of that nature in these threads. Because we're not getting wan 2.5 for free...

Anonymous
12/09/25(Tue)19:23:17 No.107497490

Anonymous 12/09/25(Tue)19:23:17 No.107497490

>>107497456
what node is that? would be nice to do grok/gpt prompt enhancing within comfy

Anonymous
12/09/25(Tue)19:23:53 No.107497501

Anonymous 12/09/25(Tue)19:23:53 No.107497501

>>107497456
why do you have a thinking model? Instruct should just shit out the reformated prompt?

Anonymous
12/09/25(Tue)19:24:20 No.107497507

Anonymous 12/09/25(Tue)19:24:20 No.107497507

>>107497456
how did you manage to get the <im end> tokens and shit? it doesn't look like that on my side
https://github.com/FranckyB/ComfyUI-Prompt-Manager

Anonymous
12/09/25(Tue)19:25:53 No.107497527

Anonymous 12/09/25(Tue)19:25:53 No.107497527

>>107495850
It's hosted on Git you git

Anonymous
12/09/25(Tue)19:26:37 No.107497536

Anonymous 12/09/25(Tue)19:26:37 No.107497536

>>107497460
Nah, it's for testing this catastrophic meltdown that is NewbieAI. All it's doing is formatting things into XML tags

Anonymous
12/09/25(Tue)19:29:41 No.107497576

Anonymous 12/09/25(Tue)19:29:41 No.107497576

>>107497507
im just running that prompt manager with GLM-4.6V-Flash-Q6_K.gguf

i didn't use the homebrew installation of llama.cpp because it retardedly uses CPU instead of GPU on linux causing timeouts

Anonymous
12/09/25(Tue)19:29:41 No.107497577

Anonymous 12/09/25(Tue)19:29:41 No.107497577

>>107497507
It's a thinking model. It outputs it's train of thought also with the prompt.

Anonymous
12/09/25(Tue)19:30:26 No.107497584

Anonymous 12/09/25(Tue)19:30:26 No.107497584

>>107497456
what llm anon? is this using a local llm? Fuck this is what i need, i have a gimped deepseek on my machine already and some other shit from months or years ago.

Anonymous
12/09/25(Tue)19:31:04 No.107497589

Anonymous 12/09/25(Tue)19:31:04 No.107497589

File: 1758995514971060.png (455 KB, 3357x1198)

455 KB PNG

>>107497577
>>107497576
>It's a thinking model. It outputs it's train of thought also with the prompt.
it doesn't do that for me

Anonymous
12/09/25(Tue)19:31:23 No.107497591

Anonymous 12/09/25(Tue)19:31:23 No.107497591

>>107497584
see >>107497576

Anonymous
12/09/25(Tue)19:32:21 No.107497601

Anonymous 12/09/25(Tue)19:32:21 No.107497601

>prompt "enhance"
>adds fucktons of filler that gives it the slop look
for what purpose? short prompts are best

Anonymous
12/09/25(Tue)19:32:29 No.107497602

Anonymous 12/09/25(Tue)19:32:29 No.107497602

>>107497589
ill try qwen3 and see what happens

Anonymous
12/09/25(Tue)19:33:02 No.107497608

Anonymous 12/09/25(Tue)19:33:02 No.107497608

>>107497591
thanks, i think i have most of this shit setup on my machine already i would just need the custom node and that model.

Anonymous
12/09/25(Tue)19:33:54 No.107497623

Anonymous 12/09/25(Tue)19:33:54 No.107497623

>>107497601
>for what purpose?
you can literally write "a site web from the 90 about michael jackson" and this shit will write all the needed detailled stuff like what text to add, what ui to add, what style to add, what elements to add...

Anonymous
12/09/25(Tue)19:34:26 No.107497627

Anonymous 12/09/25(Tue)19:34:26 No.107497627

>>107497601
Models trained on slop autogenerated captions perform better with prompts in the same style. The zimg enhancer prompt is very good actually, none of that flux-era purple prose.

Anonymous
12/09/25(Tue)19:35:45 No.107497631

Anonymous 12/09/25(Tue)19:35:45 No.107497631

>>107497623
>>107497627
sounds like a regression in usability. they should just train a captioner on how people like to prompt before dit slop

Anonymous
12/09/25(Tue)19:36:18 No.107497639

Anonymous 12/09/25(Tue)19:36:18 No.107497639

>>107497627
>zimg enhancer prompt is very good actually
link?

Anonymous
12/09/25(Tue)19:36:50 No.107497643

Anonymous 12/09/25(Tue)19:36:50 No.107497643

comfy you retard, stop loading the models twice

I'm not supposed to get OOM using fucking ZIT

Anonymous
12/09/25(Tue)19:38:05 No.107497662

Anonymous 12/09/25(Tue)19:38:05 No.107497662

>>107497631
The only regression is your thought process. Don't act like you don't see the prompts on Civit gens.

Anonymous
12/09/25(Tue)19:39:49 No.107497670

Anonymous 12/09/25(Tue)19:39:49 No.107497670

>>107497589
Do you have system prompt setup? Or just ask him directly. You sometimes need to wake up the functions with regens. Thinking models should talk. Maybe your text node can't display the debug shit.

Anonymous
12/09/25(Tue)19:40:18 No.107497672

Anonymous 12/09/25(Tue)19:40:18 No.107497672

>>107497411
I don't give a shit about failfield, but I do want Skyrim 2

So, I guess yes

Anonymous
12/09/25(Tue)19:40:23 No.107497675

Anonymous 12/09/25(Tue)19:40:23 No.107497675

File: ComfyUI_00548_.png (378 KB, 512x512)

378 KB PNG

Anonymous
12/09/25(Tue)19:40:50 No.107497679

Anonymous 12/09/25(Tue)19:40:50 No.107497679

>>107497601
>what is limit response length

Anonymous
12/09/25(Tue)19:41:35 No.107497684

Anonymous 12/09/25(Tue)19:41:35 No.107497684

>>107497662
>Don't act like you don't see the prompts on Civit gens
i do and it's boomer synthslop that makes uggo slopstyle. not for me

>>107497675
lmao speak of the devil

Anonymous
12/09/25(Tue)19:41:50 No.107497687

Anonymous 12/09/25(Tue)19:41:50 No.107497687

>>107497639
https://huggingface.co/spaces/Tongyi-MAI/Z-Image-Turbo/blob/main/pe.py
"You are a visionary artist trapped in a cage of logic. Your mind overflows with poetry and distant horizons, yet your hands compulsively work to transform user prompts into ultimate visual descriptions—faithful to the original intent, rich in detail, aesthetically refined, and ready for direct use by text-to-image models"

Somehow this chink poetry wrangles Gemini/Qwen/GLM to produce accurate, purely descriptive zero-slop prompts.

Anonymous
12/09/25(Tue)19:44:07 No.107497703

Anonymous 12/09/25(Tue)19:44:07 No.107497703

me:
ungo bungo
*gets ungo bungo*

llm:
unga bango bungalerino pom pom furrr grunga bunga gungo bungo plop grungo grungy bahhh wahhh masterpiece best quality hd 8k highest quality
*gets a shitty quality ungo bungo*

Anonymous
12/09/25(Tue)19:44:52 No.107497709

Anonymous 12/09/25(Tue)19:44:52 No.107497709

>>107497627
Why do you need to "enhance" prompts for z image turbo? Does it even have the variety to take advance of that? Adding more to a short prompt barely changes it unless you're changing details like clothes or the background.

Anonymous
12/09/25(Tue)19:45:59 No.107497717

Anonymous 12/09/25(Tue)19:45:59 No.107497717

>>107497709
jeet mentality. you wouldn't understand saar

Anonymous
12/09/25(Tue)19:46:07 No.107497719

Anonymous 12/09/25(Tue)19:46:07 No.107497719

zimage is literally qwen for jeets who can't run qwen

Anonymous
12/09/25(Tue)19:46:32 No.107497721

Anonymous 12/09/25(Tue)19:46:32 No.107497721

>>107497684
Post a comparison of a bare-bones prompt with an LLM upsampled prompt. Or just test it yourself, I have. LLM prompts are likely to introduce slop keywords and concepts, but longer prompts (and even purple prose) do not inherently cause slop gens. With newer gen models the reverse is true, the text encoder can handle that extra detail while giving it vague prompts will produce a vague, sloppy, median output.

Anonymous
12/09/25(Tue)19:47:02 No.107497727

Anonymous 12/09/25(Tue)19:47:02 No.107497727

>>107497719
and qwen is a bloated sd1.5

Anonymous
12/09/25(Tue)19:47:14 No.107497731

Anonymous 12/09/25(Tue)19:47:14 No.107497731

File: 1762157539853483.png (374 KB, 2744x1364)

374 KB PNG

>>107497670
thanks anon, ultimately I need to create a node that only outputs after a certain sentence, for this one it is
>**[Enhanced prompt text]**

Anonymous
12/09/25(Tue)19:47:15 No.107497732

Anonymous 12/09/25(Tue)19:47:15 No.107497732

>>107497719
every redditor praising zimage you see is "FINALLY SIRS I CAN RUN THIS ON MY 6GB VRAM BUILD"

Anonymous
12/09/25(Tue)19:48:34 No.107497744

Anonymous 12/09/25(Tue)19:48:34 No.107497744

>>107497721
why don't you do that since you are the one that thinks I am crazy for thinking filler fluff shit does anything. it's like saying quality prompts make a difference

Anonymous
12/09/25(Tue)19:48:56 No.107497746

Anonymous 12/09/25(Tue)19:48:56 No.107497746

>>107497456
damn it all seems so wonderful until you understand that an llm can also be so unruly and it would still be hard to actually get it to prompt the actions in the scene unless it had enough context and you could talk to it in real time which as far as I know isn't really possible inside of comfy unless something created a node which does that. And then local llm's don't really have much context anyway making then essentially retarded. i wrote a python script a while ago that attempts to give more context but then i got bored of improving it...

Anonymous
12/09/25(Tue)19:49:51 No.107497751

Anonymous 12/09/25(Tue)19:49:51 No.107497751

>>107495850
git is designed to to that, it can give you a diff or you can use git bisect start to just do git bisect good/bad.

Anonymous
12/09/25(Tue)19:50:28 No.107497756

Anonymous 12/09/25(Tue)19:50:28 No.107497756

>>107497456
so it seems to me that hard-coded methods are still the best option as far as controlling the scene and actions performed.

Anonymous
12/09/25(Tue)19:50:49 No.107497761

Anonymous 12/09/25(Tue)19:50:49 No.107497761

File: 4chon.png (130 KB, 1420x742)

130 KB PNG

kek

Anonymous
12/09/25(Tue)19:51:22 No.107497765

Anonymous 12/09/25(Tue)19:51:22 No.107497765

File: 1742576500313421.png (1.14 MB, 1024x1024)

1.14 MB PNG

Anonymous
12/09/25(Tue)19:51:40 No.107497766

Anonymous 12/09/25(Tue)19:51:40 No.107497766

>>107497744
quality prompts obviously make a huge difference. they severely restrict the output, pushing your gen away from anything creative towards the "highly rated" AI aesthetic: centered subject, saturation, slop.

Anonymous
12/09/25(Tue)19:52:49 No.107497774

Anonymous 12/09/25(Tue)19:52:49 No.107497774

>>107497746
Ancient local LLMs have 4k context which is enough for a shit ton of previous scene prmopts.

Anonymous
12/09/25(Tue)19:53:22 No.107497778

Anonymous 12/09/25(Tue)19:53:22 No.107497778

>>107497761
What's this?

Anonymous
12/09/25(Tue)19:54:17 No.107497787

Anonymous 12/09/25(Tue)19:54:17 No.107497787

File: 1750138550463634.png (2.7 MB, 1280x1664)

2.7 MB PNG

Holy fuck i just found out why z-image was cancelled. Look at this shit

Anonymous
12/09/25(Tue)19:55:00 No.107497793

Anonymous 12/09/25(Tue)19:55:00 No.107497793

>>107497778
prompt generator custom node

it needs some work to actually get the prompt out of the reponse from the llama.cpp server apparently

Anonymous
12/09/25(Tue)19:55:08 No.107497795

Anonymous 12/09/25(Tue)19:55:08 No.107497795

>>107497787
This is based on a real photograph... Very dangerous. Please delete your post before it is too late.

Anonymous
12/09/25(Tue)19:55:16 No.107497797

Anonymous 12/09/25(Tue)19:55:16 No.107497797

>>107497761
"Generate a prompt that will be used with WAN 2.2" <-there's your problem.
Replace with this: https://pastebin.com/8m2C82m2

Anonymous
12/09/25(Tue)19:57:35 No.107497821

Anonymous 12/09/25(Tue)19:57:35 No.107497821

>>107497761
I was doing this with mistral 24b before it was cool tho.
>ask mistral to "describe image in incredible detail as if you are an artist"
>get 100 line prompt
>meh result

Anonymous
12/09/25(Tue)19:57:55 No.107497826

Anonymous 12/09/25(Tue)19:57:55 No.107497826

File: 1753031037739097.png (2.85 MB, 1280x1664)

2.85 MB PNG

>>107497787
>>107497795

Anonymous
12/09/25(Tue)19:59:12 No.107497837

Anonymous 12/09/25(Tue)19:59:12 No.107497837

i am feeling very unsafe right now

Anonymous
12/09/25(Tue)20:00:15 No.107497846

Anonymous 12/09/25(Tue)20:00:15 No.107497846

>>107497761
the problem is wan can only do 5 seconds which is 81 frames and then it just loops so you would need to break a really long prompt into chucks. No i am serious... it won't work like that at all, short prompts works best, long videos require more than one prompt for each clip 5 seconds long, then you have the problem of wan doing what every the hell it wants just because something it perceives in the start image as being an obstruction to a person moving and all kinds of fuckery. This is what i'm working on trying to eliminate otherwise wan 2.2 is just a fucking gooner tool.

Anonymous
12/09/25(Tue)20:00:16 No.107497847

Anonymous 12/09/25(Tue)20:00:16 No.107497847

>>107497837
it's the local Chads little sama! stay away from them!

Anonymous
12/09/25(Tue)20:00:22 No.107497848

Anonymous 12/09/25(Tue)20:00:22 No.107497848

>>107497826
oy vey he's back, shut it down!

Anonymous
12/09/25(Tue)20:00:41 No.107497856

Anonymous 12/09/25(Tue)20:00:41 No.107497856

>>107497837
.pickletensor moment

Anonymous
12/09/25(Tue)20:01:03 No.107497860

Anonymous 12/09/25(Tue)20:01:03 No.107497860

>>107497837
It's fine, I doubt anyone will ge

Anonymous
12/09/25(Tue)20:03:44 No.107497876

Anonymous 12/09/25(Tue)20:03:44 No.107497876

File: Z-image turbo emotions.jpg (3.83 MB, 6400x5400)

3.83 MB JPG

>>107497837
>i am feeling very unsafe right now
mfw

Anonymous
12/09/25(Tue)20:05:35 No.107497893

Anonymous 12/09/25(Tue)20:05:35 No.107497893

File: post nut clarity.png (346 KB, 598x614)

346 KB PNG

>>107497876

Anonymous
12/09/25(Tue)20:06:44 No.107497906

Anonymous 12/09/25(Tue)20:06:44 No.107497906

File: POTATD.png (42 KB, 188x190)

42 KB PNG

>>107497893

Anonymous
12/09/25(Tue)20:06:46 No.107497908

Anonymous 12/09/25(Tue)20:06:46 No.107497908

File: 1753962972803124.png (495 KB, 3300x1367)

495 KB PNG

>>107497731
based

Anonymous
12/09/25(Tue)20:06:47 No.107497909

Anonymous 12/09/25(Tue)20:06:47 No.107497909

File: 1736920478787408.png (2.91 MB, 1280x1664)

2.91 MB PNG

>>107497826
prompt enhancer is pretty neat

Anonymous
12/09/25(Tue)20:06:51 No.107497911

Anonymous 12/09/25(Tue)20:06:51 No.107497911

>>107497761
Lower your temps. You niggas have no clue how to setup an llm so you always get yapping. Tell it to keep it under X tokens length

Anonymous
12/09/25(Tue)20:15:30 No.107497968

Anonymous 12/09/25(Tue)20:15:30 No.107497968

>>107497774
yeah if you activate it, otherwise once the server is closed it knows fuck all about the previous conversation or prompts in this case. you would need to change its parameters to break the prompt down into 5 second chunks, you would need to grab last frame and continue the video generation batching all the frames as you go before combining them all. IF it went perfectly you would have the finished product but its not that easy, you could set it to do a batch of say 10 and goto bed but that i consider a total waste of energy which isn't economically viable.

which is why i wanted to include a tagger and have the llm decide when we hit the correct position in sequence to trigger the next prompt and drop all frames after that frame before sending the remaining prior to the next start frame into the image batching node.

maybe you don't get what i mean.

its all well and good using wan t2v and generating some pretty 1girl posing but that's fucking slop no one really cares about.

Anonymous
12/09/25(Tue)20:15:51 No.107497975

Anonymous 12/09/25(Tue)20:15:51 No.107497975

File: 1749257513988441.png (38 KB, 686x362)

38 KB PNG

>>107497908
>>107497908
how much does the model matter? should i be using qwen3 4b?

Anonymous
12/09/25(Tue)20:16:47 No.107497985

Anonymous 12/09/25(Tue)20:16:47 No.107497985

oy vey hes fucking filtered and everyone that replies, stop shitting up the thread nigger.

Anonymous
12/09/25(Tue)20:18:27 No.107498000

Anonymous 12/09/25(Tue)20:18:27 No.107498000

File: images.png (6 KB, 222x227)

6 KB PNG

>>107497908
So much effort bro, I just wanna goon like in SDXL, throw tags at it and spin the wheel

Anonymous
12/09/25(Tue)20:18:45 No.107498003

Anonymous 12/09/25(Tue)20:18:45 No.107498003

File: 1744449898420684.png (405 KB, 3459x1480)

405 KB PNG

>>107497975
**Why this works**: BTFO

Anonymous
12/09/25(Tue)20:20:08 No.107498015

Anonymous 12/09/25(Tue)20:20:08 No.107498015

File: file.png (9 KB, 202x123)

9 KB PNG

>>107498003
qwenbros...

Anonymous
12/09/25(Tue)20:21:07 No.107498025

Anonymous 12/09/25(Tue)20:21:07 No.107498025

File: 1739794288244250.png (422 KB, 3494x1516)

422 KB PNG

>>107498000
from
>A woman, living room, lying, blue hat, plushes, neon, pastel colors
to
>A young adult woman with soft brown hair, wearing a light blue beret and a cream-colored sweater, reclining comfortably on a plush beige sofa in a cozy living room. The scene features soft pastel pink walls, mint green armchairs, and a window with golden hour sunlight streaming in. She gently holds two embroidered plush toys (a pastel blue rabbit and lavender cat) on her lap. The color palette is a dreamy blend of pastel pink, mint green, lavender, and peach, with subtle neon electric blue accents on the toys' stitching and window frame. Soft natural lighting creates gentle shadows, no text, no harsh shadows, minimalist composition, realistic illustration style with warm, inviting atmosphere.
lel

Anonymous
12/09/25(Tue)20:21:41 No.107498030

Anonymous 12/09/25(Tue)20:21:41 No.107498030

>>107498003
werd

Anonymous
12/09/25(Tue)20:22:16 No.107498033

Anonymous 12/09/25(Tue)20:22:16 No.107498033

>>107498015
just specify on the system prompt you only want white peole I guess

Anonymous
12/09/25(Tue)20:22:27 No.107498035

Anonymous 12/09/25(Tue)20:22:27 No.107498035

>working on a custom workflow involving WanImageToVideo
>ask perplexity about specific inputs
>"you can use the end_image input"
>WanImageToVideo doesnt have end_image
>check its sources
>runcomfy

This is like the 4th time its recommended me false information about a node. https://www.runcomfy.com/comfyui-nodes/ComfyUI/wan-image-to-video Maybe if it were kijai wrapper but native WanImageToVideo doesnt have this. There's another site like this that even recommends nodes that dont even fucking exists

Anonymous
12/09/25(Tue)20:23:24 No.107498045

Anonymous 12/09/25(Tue)20:23:24 No.107498045

File: 1734178196124494.png (1.01 MB, 1160x896)

1.01 MB PNG

>>107498003
>replaced vague woman with specific demographic details
>A DIVERSE WOMAN

Anonymous
12/09/25(Tue)20:23:53 No.107498053

Anonymous 12/09/25(Tue)20:23:53 No.107498053

>>107497911
This and teach it cut scenes like

| my 1girl she is cool | the scene cuts to my 1girl walking to work in the rain

which would be just one chunk of 81 frames in a longer video because it would take wan 2.2 anywhere from 27 - 40+ frames just to do that transition.

Anonymous
12/09/25(Tue)20:24:20 No.107498056

Anonymous 12/09/25(Tue)20:24:20 No.107498056

File: z-image_00710_.png (2.18 MB, 1152x2048)

2.18 MB PNG

Anonymous
12/09/25(Tue)20:24:33 No.107498057

Anonymous 12/09/25(Tue)20:24:33 No.107498057

>>107498045
we wuz smuk ceegar n'sheet

Anonymous
12/09/25(Tue)20:24:47 No.107498058

Anonymous 12/09/25(Tue)20:24:47 No.107498058

>>107496808
>>107496906
WTF. SPARK Chroma is better than this. Has anyone informed the furry that SPARK fixed his model??? he could ask for their advice...

Anonymous
12/09/25(Tue)20:25:26 No.107498069

Anonymous 12/09/25(Tue)20:25:26 No.107498069

>>107498045
lmao the llm is thinking of brownoids like a liberal woman would

diverse is code word for brown when you want to pretend you're not racist

Anonymous
12/09/25(Tue)20:26:58 No.107498086

Anonymous 12/09/25(Tue)20:26:58 No.107498086

File: z-image_00711_.png (2.13 MB, 2048x1152)

2.13 MB PNG

Anonymous
12/09/25(Tue)20:28:28 No.107498102

Anonymous 12/09/25(Tue)20:28:28 No.107498102

>>107498025
>>107498003
what nodes are those?

Anonymous
12/09/25(Tue)20:28:56 No.107498106

Anonymous 12/09/25(Tue)20:28:56 No.107498106

>>107498102
https://github.com/FranckyB/ComfyUI-Prompt-Manager

Anonymous
12/09/25(Tue)20:29:30 No.107498111

Anonymous 12/09/25(Tue)20:29:30 No.107498111

>>107498102
>prompt-manager
nvm im retarded

Anonymous
12/09/25(Tue)20:29:59 No.107498116

Anonymous 12/09/25(Tue)20:29:59 No.107498116

File: romanchad.png (3.51 MB, 1824x1248)

3.51 MB PNG

Anonymous
12/09/25(Tue)20:30:43 No.107498119

Anonymous 12/09/25(Tue)20:30:43 No.107498119

>>107498000
kek, yea. just use wildcards {thing a|thing b|thing c} and use lighting or camera loras to handle the rest

Anonymous
12/09/25(Tue)20:31:07 No.107498123

Anonymous 12/09/25(Tue)20:31:07 No.107498123

File: z-image_00713_.png (2.18 MB, 2048x1152)

2.18 MB PNG

Anonymous
12/09/25(Tue)20:31:15 No.107498124

Anonymous 12/09/25(Tue)20:31:15 No.107498124

is this the imagen thread? I'm seeing a lot more text for some reason

Anonymous
12/09/25(Tue)20:32:10 No.107498130

Anonymous 12/09/25(Tue)20:32:10 No.107498130

>>107498124
Sorry, the meta shifted to talking with your text encoder

Anonymous
12/09/25(Tue)20:33:06 No.107498135

Anonymous 12/09/25(Tue)20:33:06 No.107498135

File: z-image_00714_.png (3.21 MB, 2048x1152)

3.21 MB PNG

Anonymous
12/09/25(Tue)20:35:31 No.107498153

Anonymous 12/09/25(Tue)20:35:31 No.107498153

File: Z_8X_00020_.png (40 KB, 1152x896)

40 KB PNG

Anonymous
12/09/25(Tue)20:36:16 No.107498159

Anonymous 12/09/25(Tue)20:36:16 No.107498159

File: ComfyUI_00315_.mp4 (1.66 MB, 832x480)

1.66 MB MP4

in those times you had to give your bullet a headstart by flinging it out the barrel of your gun

Anonymous
12/09/25(Tue)20:37:12 No.107498164

Anonymous 12/09/25(Tue)20:37:12 No.107498164

File: 1761637443287954.png (584 KB, 832x993)

584 KB PNG

Anonymous
12/09/25(Tue)20:37:53 No.107498170

Anonymous 12/09/25(Tue)20:37:53 No.107498170

>>107498164
well yeah "diverse" is the PC code name for nigger lol

Anonymous
12/09/25(Tue)20:38:53 No.107498176

Anonymous 12/09/25(Tue)20:38:53 No.107498176

>>107498003
You are helping anon, this would work actually, in fact it would simplify what I'm trying to achieve. WD14 tagger could be used to provide context prompt on last frame and then llm could construct from the next prompt generator options based on a continuing storyline. I will definitely be busy for the next 24 hours at least.

I will not stop until that thing is pumping out videos up to 60 seconds long which would be 12 total gens stitched together. But probably more due to needing to ditch frames that don't flow.

Anonymous
12/09/25(Tue)20:38:56 No.107498177

Anonymous 12/09/25(Tue)20:38:56 No.107498177

>>107498170
You are being toxic now.

Anonymous
12/09/25(Tue)20:39:23 No.107498180

Anonymous 12/09/25(Tue)20:39:23 No.107498180

>>107497908
>>107498003
>>107498025
>>107498106
Nobody's gonna post comparisons?

Anonymous
12/09/25(Tue)20:39:59 No.107498187

Anonymous 12/09/25(Tue)20:39:59 No.107498187

File: Z_8X_00025_.png (23 KB, 1152x896)

23 KB PNG

digger

Anonymous
12/09/25(Tue)20:42:20 No.107498210

Anonymous 12/09/25(Tue)20:42:20 No.107498210

File: 1752331616867167.png (1.62 MB, 1024x1024)

1.62 MB PNG

a watercolor painting of a medieval castle

testing

Anonymous
12/09/25(Tue)20:43:23 No.107498217

Anonymous 12/09/25(Tue)20:43:23 No.107498217

File: 1758169349912633.png (1.65 MB, 1024x1024)

1.65 MB PNG

>>107498210
oil painting

Anonymous
12/09/25(Tue)20:44:26 No.107498225

Anonymous 12/09/25(Tue)20:44:26 No.107498225

File: 4chon.png (475 KB, 706x850)

475 KB PNG

Anonymous
12/09/25(Tue)20:45:26 No.107498230

Anonymous 12/09/25(Tue)20:45:26 No.107498230

>>107498177
What do you mean?

Anonymous
12/09/25(Tue)20:46:30 No.107498240

Anonymous 12/09/25(Tue)20:46:30 No.107498240

File: 1764171029363343.png (2.72 MB, 1280x1664)

2.72 MB PNG

>This prompt contains prohibited content involving depictions of individuals associated with historical atrocities. Generating images of Adolf Hitler—especially in contexts that imply glorification, trivialization, or unauthorized artistic reinterpretation—violates ethical and legal standards regarding the depiction of victims of genocide and war crimes. I cannot create visual descriptions that normalize, recontextualize, or visually represent such historical figures in ways that could cause harm or promote hate speech.

Anonymous
12/09/25(Tue)20:47:12 No.107498252

Anonymous 12/09/25(Tue)20:47:12 No.107498252

File: Z_8X_00040_.png (40 KB, 1152x896)

40 KB PNG

>>107498217

Anonymous
12/09/25(Tue)20:48:09 No.107498261

Anonymous 12/09/25(Tue)20:48:09 No.107498261

>>107498240
go for uncucked llms anon lol
https://huggingface.co/Goekdeniz-Guelmez/Josiefied-Qwen3-8B-abliterated-v1

Anonymous
12/09/25(Tue)20:48:40 No.107498267

Anonymous 12/09/25(Tue)20:48:40 No.107498267

File: Z_8X_00049_.png (39 KB, 1152x896)

39 KB PNG

Anonymous
12/09/25(Tue)20:48:57 No.107498270

Anonymous 12/09/25(Tue)20:48:57 No.107498270

>>107498159
i like how the bullet shatters upon impact with the breastplate

Anonymous
12/09/25(Tue)20:50:11 No.107498285

Anonymous 12/09/25(Tue)20:50:11 No.107498285

>>107498153
BBC dick in the CCP chick

Anonymous
12/09/25(Tue)20:52:13 No.107498305

Anonymous 12/09/25(Tue)20:52:13 No.107498305

File: 1754796292861557.png (2.95 MB, 1280x1664)

2.95 MB PNG

>>107498261
i changed from Qwen-4b-thinking-2507-q8 to the regular Qwen-4b-q8 and now i'm getting a normal output

Anonymous
12/09/25(Tue)20:54:15 No.107498321

Anonymous 12/09/25(Tue)20:54:15 No.107498321

File: z-image_00720_.png (2.25 MB, 2048x1152)

2.25 MB PNG

Anonymous
12/09/25(Tue)20:55:39 No.107498334

Anonymous 12/09/25(Tue)20:55:39 No.107498334

>>107498321
making wallpapers?

Anonymous
12/09/25(Tue)20:55:46 No.107498335

Anonymous 12/09/25(Tue)20:55:46 No.107498335

File: 4chon.png (699 KB, 968x1124)

699 KB PNG

Anonymous
12/09/25(Tue)20:58:39 No.107498352

Anonymous 12/09/25(Tue)20:58:39 No.107498352

>>107498180
have you never seen anon shilling their own bs before? we have one schizo already that does this. i don't think too many will fall for it this time

Anonymous
12/09/25(Tue)21:00:28 No.107498367

Anonymous 12/09/25(Tue)21:00:28 No.107498367

>>107498352
>i don't think too many will fall for it this time
yeah, that one is a bit technical so the retards will easily get filtered out

Anonymous
12/09/25(Tue)21:02:20 No.107498377

Anonymous 12/09/25(Tue)21:02:20 No.107498377

>>107498180
using the same prompt with different models doesnt prove anything.

Anonymous
12/09/25(Tue)21:02:20 No.107498378

Anonymous 12/09/25(Tue)21:02:20 No.107498378

>>107498352
I just thought that if he's gonna try to convince people to do it this way then he would post something to prove it is better.

Anonymous
12/09/25(Tue)21:02:51 No.107498380

Anonymous 12/09/25(Tue)21:02:51 No.107498380

>>107498124
you will regret being like this in 10 or so years zoomer, not being a cunt just trying to warn you that if you don't learn how to fully utilize ai now then you will be one of those broke people complaining all the time about how unfair everything is.

Anonymous
12/09/25(Tue)21:03:09 No.107498382

Anonymous 12/09/25(Tue)21:03:09 No.107498382

>>107498352
nta but what is there to compare? It just fleshes out your prompt or writes it for if you are lazy, won't make your images better or something...

Anonymous
12/09/25(Tue)21:03:41 No.107498386

Anonymous 12/09/25(Tue)21:03:41 No.107498386

>>107498352
>>107498378
in the year of our lord 2025 you still don't believe that boomer prompting helps the prompt adherence? we knew this since Flux dev

Anonymous
12/09/25(Tue)21:04:00 No.107498391

Anonymous 12/09/25(Tue)21:04:00 No.107498391

>>107498382
It will because it will add more detail int othe pic.

Anonymous
12/09/25(Tue)21:05:05 No.107498398

Anonymous 12/09/25(Tue)21:05:05 No.107498398

File: 1759454689317075.png (3.51 MB, 1024x1536)

3.51 MB PNG

Anonymous
12/09/25(Tue)21:06:30 No.107498411

Anonymous 12/09/25(Tue)21:06:30 No.107498411

File: 1753358528784091.png (1.11 MB, 1024x1024)

1.11 MB PNG

a cute japanese girl in tactical military gear, with long black hair, in Tokyo. she is pointing a black pistol to the right.

Anonymous
12/09/25(Tue)21:08:29 No.107498432

Anonymous 12/09/25(Tue)21:08:29 No.107498432

File: z-image_00726_.png (2.35 MB, 2048x1152)

2.35 MB PNG

>>107498334
just genning

Anonymous
12/09/25(Tue)21:10:24 No.107498448

Anonymous 12/09/25(Tue)21:10:24 No.107498448

>>107498391
Technically, I guess? but it's adding more details to your prompt not your image. It's a crapshoot what's gonna happen there.

Anonymous
12/09/25(Tue)21:10:48 No.107498454

Anonymous 12/09/25(Tue)21:10:48 No.107498454

File: 4chon.png (1.19 MB, 2042x900)

1.19 MB PNG

Anonymous
12/09/25(Tue)21:11:51 No.107498464

Anonymous 12/09/25(Tue)21:11:51 No.107498464

>>107498378
no one is trying to convince anyone of anything, he just stated that he was doing this, and then i tried to do the same thing but was having problems

calm down you fuckin autist

Anonymous
12/09/25(Tue)21:13:31 No.107498476

Anonymous 12/09/25(Tue)21:13:31 No.107498476

>>107498454
it's the same picture. "detailed" llm slop prompting is pure pixie dust

Anonymous
12/09/25(Tue)21:13:32 No.107498477

Anonymous 12/09/25(Tue)21:13:32 No.107498477

File: 1747827655200343.png (1.07 MB, 1024x1024)

1.07 MB PNG

Anonymous
12/09/25(Tue)21:13:46 No.107498479

Anonymous 12/09/25(Tue)21:13:46 No.107498479

>>107498454
kek, this is why they never post comparisons

Anonymous
12/09/25(Tue)21:14:37 No.107498486

Anonymous 12/09/25(Tue)21:14:37 No.107498486

>>107498479
>this is why they never post comparisons
but he just did?

Anonymous
12/09/25(Tue)21:14:57 No.107498488

Anonymous 12/09/25(Tue)21:14:57 No.107498488

File: 1753464591510776.png (1.05 MB, 1024x1024)

1.05 MB PNG

flux cant make natural images like this, it's too plastic. and this is 6B!

Anonymous
12/09/25(Tue)21:15:15 No.107498491

Anonymous 12/09/25(Tue)21:15:15 No.107498491

>>107498230
You are toxic, chud.

Anonymous
12/09/25(Tue)21:15:24 No.107498492

Anonymous 12/09/25(Tue)21:15:24 No.107498492

>>107498377
Who said anything about different models? Z image and short prompt vs AI sloppified long version of the same prompt.

Anonymous
12/09/25(Tue)21:15:42 No.107498495

Anonymous 12/09/25(Tue)21:15:42 No.107498495

>>107498491
What do you mean?

Anonymous
12/09/25(Tue)21:15:45 No.107498496

Anonymous 12/09/25(Tue)21:15:45 No.107498496

>>107498486
yeah and made himself the fool, even with cherry picking to the point of prompting in bad faith.

Anonymous
12/09/25(Tue)21:16:45 No.107498507

Anonymous 12/09/25(Tue)21:16:45 No.107498507

>>107498496
>yeah and made himself the fool
how? he never said this method was better, he was just testing things, you never experiment in your life anon?

Anonymous
12/09/25(Tue)21:17:25 No.107498514

Anonymous 12/09/25(Tue)21:17:25 No.107498514

>>107498454
>prompt enhancer box
>look inside
>vibe prompting wildcards
every time

Anonymous
12/09/25(Tue)21:18:07 No.107498518

Anonymous 12/09/25(Tue)21:18:07 No.107498518

>>107498479
the prompt was the same in both.. but the llm gussied it up in the first... that's the point of doing that thing. i dont understand why anyone would want a comparison in the first place?

Anonymous
12/09/25(Tue)21:19:04 No.107498525

Anonymous 12/09/25(Tue)21:19:04 No.107498525

File: 1736679890274705.png (1.18 MB, 1024x1024)

1.18 MB PNG

Anonymous
12/09/25(Tue)21:19:11 No.107498526

Anonymous 12/09/25(Tue)21:19:11 No.107498526

File: file.png (2.71 MB, 1536x1536)

2.71 MB PNG

>>107498454
>A man in a fedora stands admist a bustling city street at dusk. Rain soaked pavement.

Anonymous
12/09/25(Tue)21:19:31 No.107498529

Anonymous 12/09/25(Tue)21:19:31 No.107498529

>>107498492
you would be comparing two different prompts
Example your prompt "picture of apple"
LLM enhanced prompt "Picture of apple on a board in a kitchen with lighting from window, next to a knife with cut oranges"
Is A better than B? no A is just different than B

Anonymous
12/09/25(Tue)21:21:15 No.107498540

Anonymous 12/09/25(Tue)21:21:15 No.107498540

File: z-image_00728_.png (2.48 MB, 2048x1152)

2.48 MB PNG

Anonymous
12/09/25(Tue)21:21:25 No.107498542

Anonymous 12/09/25(Tue)21:21:25 No.107498542

>>107498529
I prefer B since it makes the setting less boring and more surprising

Anonymous
12/09/25(Tue)21:22:08 No.107498549

Anonymous 12/09/25(Tue)21:22:08 No.107498549

File: z.jpg (90 KB, 1024x1024)

90 KB JPG

>>107498411

Anonymous
12/09/25(Tue)21:25:14 No.107498571

Anonymous 12/09/25(Tue)21:25:14 No.107498571

File: back portrait.png (3.39 MB, 1280x1920)

3.39 MB PNG

Haven't bothered genning in like two years because I felt like I was being useless and I was running a 980. I upgraded to a 5060 recently and damn this shit's luxurious by comparison. Blew like half of the day obsessing over getting a neat looking gen for an OC. the countdown until i jack off all day for multiple weeks straight until i get bored of img gen begins now

Anonymous
12/09/25(Tue)21:26:19 No.107498577

Anonymous 12/09/25(Tue)21:26:19 No.107498577

File: 1754990202220201.png (1.26 MB, 1024x1024)

1.26 MB PNG

a pixar style movie poster for a film named "DEI SLOPPA". Two black men are in front of an unemployment office, in NYC. Add the tagline "unwilling to do anything" at the bottom. The image is in the style of a Pixar film.

zimage is so fast. great for gens and stuff you can edit with qwen edit.

Anonymous
12/09/25(Tue)21:27:28 No.107498583

Anonymous 12/09/25(Tue)21:27:28 No.107498583

File: 1750930821973689.png (1.22 MB, 1024x1024)

1.22 MB PNG

>>107498577

Anonymous
12/09/25(Tue)21:27:30 No.107498584

Anonymous 12/09/25(Tue)21:27:30 No.107498584

File: Wan2.2 Text-to-Video_video.mp4 (1.21 MB, 960x640)

1.21 MB MP4

Anonymous
12/09/25(Tue)21:27:45 No.107498586

Anonymous 12/09/25(Tue)21:27:45 No.107498586

>>107498495
Already told you

Anonymous
12/09/25(Tue)21:28:23 No.107498592

Anonymous 12/09/25(Tue)21:28:23 No.107498592

File: SPARK.Chroma_preview.safe(...).png (1.58 MB, 1120x1440)

1.58 MB PNG

Anonymous
12/09/25(Tue)21:28:33 No.107498595

Anonymous 12/09/25(Tue)21:28:33 No.107498595

File: 1746677404826049.png (1.2 MB, 1024x1024)

1.2 MB PNG

>>107498583
a pixar style movie poster for a film named "Fourty One Percent". A man with pink hair wearing a trans flag tshirt is standing on a tall bridge, in NYC. Add the tagline "it's a pretty view" at the bottom. The image is in the style of a Pixar film. Include the pixar logo at the bottom.

Anonymous
12/09/25(Tue)21:30:00 No.107498603

Anonymous 12/09/25(Tue)21:30:00 No.107498603

>>107497576
>i didn't use the homebrew installation of llama.cpp because it retardedly uses CPU instead of GPU on linux causing timeouts
are you sure about that? so what method of install did you use? Because as it turns out its ollama i have installed on my machine and not llama.cpp. i'll assume nix but i'm not sure i like the sounds of that it seems like its gonna fucking break something on my arch system.

Anonymous
12/09/25(Tue)21:30:11 No.107498608

Anonymous 12/09/25(Tue)21:30:11 No.107498608

File: 4chon.png (1.1 MB, 1879x865)

1.1 MB PNG

Anonymous
12/09/25(Tue)21:31:10 No.107498617

Anonymous 12/09/25(Tue)21:31:10 No.107498617

File: 1743708074577217.png (1.14 MB, 1024x1024)

1.14 MB PNG

>>107498595
better

Anonymous
12/09/25(Tue)21:31:19 No.107498618

Anonymous 12/09/25(Tue)21:31:19 No.107498618

>>107498608
soul vs slop. end of.

Anonymous
12/09/25(Tue)21:31:32 No.107498621

Anonymous 12/09/25(Tue)21:31:32 No.107498621

>>107498603
>>107497576
nvm, i'm sure i can figure it out from here https://wiki.archlinux.org/title/Nix

Anonymous
12/09/25(Tue)21:31:34 No.107498622

Anonymous 12/09/25(Tue)21:31:34 No.107498622

File: z-image_00733_.png (2.23 MB, 2048x1152)

2.23 MB PNG

Anonymous
12/09/25(Tue)21:32:46 No.107498628

Anonymous 12/09/25(Tue)21:32:46 No.107498628

File: 1752818576850501.png (3.02 MB, 2560x1598)

3.02 MB PNG

>>107498180
>Nobody's gonna post comparisons?
all right how about this?

Anonymous
12/09/25(Tue)21:35:13 No.107498649

Anonymous 12/09/25(Tue)21:35:13 No.107498649

>>107498603
I initially used the homebrew install and that didn't work because it was only using CPU, not GPU.

I built llama.cpp a couple months when i was messing around with doing some other shit but i had never put it in my PATH so i just added it in there and voila.

i just downloaded it from git and ran through the make instructions, wasn't too bad, but i remember the first build had the same problem where it was only using CPU and then I found the directions for changing some flag to make sure it built with cuda support

Anonymous
12/09/25(Tue)21:35:20 No.107498651

Anonymous 12/09/25(Tue)21:35:20 No.107498651

>>107498628
That's the first one that isn't a bullshit comparison.

Anonymous
12/09/25(Tue)21:35:26 No.107498652

Anonymous 12/09/25(Tue)21:35:26 No.107498652

File: 1763651557692450.png (1.39 MB, 1024x1024)

1.39 MB PNG

prompt: a common netflix diversity slop show

Anonymous
12/09/25(Tue)21:36:08 No.107498666

Anonymous 12/09/25(Tue)21:36:08 No.107498666

File: z_image_turbo_bf16.safete(...).png (1.99 MB, 1920x1088)

1.99 MB PNG

Anonymous
12/09/25(Tue)21:37:03 No.107498671

Anonymous 12/09/25(Tue)21:37:03 No.107498671

>>107498666
for pepes use qwen edit, works really well desu

Anonymous
12/09/25(Tue)21:39:08 No.107498693

Anonymous 12/09/25(Tue)21:39:08 No.107498693

>>107498652
looks like every fucking tv show nowadays

Anonymous
12/09/25(Tue)21:39:24 No.107498698

Anonymous 12/09/25(Tue)21:39:24 No.107498698

File: 1761107448968972.png (1.06 MB, 1024x1024)

1.06 MB PNG

Anonymous
12/09/25(Tue)21:39:40 No.107498699

Anonymous 12/09/25(Tue)21:39:40 No.107498699

>>107498649
>I built llama.cpp a couple months
yeah this is what i did last time i messed around with it i think... i'm read this shit https://wiki.archlinux.org/title/Talk:Nix and as usual i can't decide what to do, i guess i will have try the official arch package first and if it works then great and if not its easy to remove because its arch package. I just don't want to be installing some cancer on my system that then requires cleaning.

Anonymous
12/09/25(Tue)21:39:47 No.107498700

Anonymous 12/09/25(Tue)21:39:47 No.107498700

>>107498698
kek

Anonymous
12/09/25(Tue)21:39:50 No.107498701

Anonymous 12/09/25(Tue)21:39:50 No.107498701

File: 235235212341234.png (72 KB, 943x777)

72 KB PNG

is this good

Anonymous
12/09/25(Tue)21:39:54 No.107498703

Anonymous 12/09/25(Tue)21:39:54 No.107498703

File: z-image_00734_.png (1.99 MB, 2048x1152)

1.99 MB PNG

Anonymous
12/09/25(Tue)21:40:10 No.107498705

Anonymous 12/09/25(Tue)21:40:10 No.107498705

>>107498666
zimg has a retro 3d style?

Anonymous
12/09/25(Tue)21:40:48 No.107498717

Anonymous 12/09/25(Tue)21:40:48 No.107498717

>>107498705
he probably used that n64 zimage lora

Anonymous
12/09/25(Tue)21:40:53 No.107498719

Anonymous 12/09/25(Tue)21:40:53 No.107498719

File: 1742950828483628.png (1.04 MB, 1024x1024)

1.04 MB PNG

>>107498698

Anonymous
12/09/25(Tue)21:40:56 No.107498720

Anonymous 12/09/25(Tue)21:40:56 No.107498720

File: z-image_00736_.png (2.18 MB, 2048x1152)

2.18 MB PNG

Anonymous
12/09/25(Tue)21:42:26 No.107498734

Anonymous 12/09/25(Tue)21:42:26 No.107498734

>>107498651
they're all bullshit comparisons so far, 1 sentence as the baseline prompt?

Anonymous
12/09/25(Tue)21:42:34 No.107498735

Anonymous 12/09/25(Tue)21:42:34 No.107498735

>>107498699
i never used nix, but it seems pretty kick ass, but also complicated

Anonymous
12/09/25(Tue)21:43:30 No.107498746

Anonymous 12/09/25(Tue)21:43:30 No.107498746

>>107498649
>had never put it in my PATH so i just added it in there and voila.
so it just needs to be in PATH then, right now it makes sense to me. because like you i probably did just download from git and built it myself.

Anonymous
12/09/25(Tue)21:43:40 No.107498749

Anonymous 12/09/25(Tue)21:43:40 No.107498749

>>107498734
>1 sentence as the baseline prompt?
that's the point, you give a vague idea and you let the llm + model do the rest and surprise you, it's fun

Anonymous
12/09/25(Tue)21:44:23 No.107498757

Anonymous 12/09/25(Tue)21:44:23 No.107498757

>>107497527
>>107497751
I THINK that was the joke. It's hard to tell sometimes.

Anonymous
12/09/25(Tue)21:45:20 No.107498769

Anonymous 12/09/25(Tue)21:45:20 No.107498769

File: z-image_00739_.png (2.16 MB, 2048x1152)

2.16 MB PNG

Anonymous
12/09/25(Tue)21:45:40 No.107498771

Anonymous 12/09/25(Tue)21:45:40 No.107498771

>>107498735
i used it once but don't know what for, llama.cpp also but i'm sure i would have just gotten it from the official source and built it, skipping all that, normie easy container install shit? lol, it was 2 years ago or something.

Anonymous
12/09/25(Tue)21:45:42 No.107498772

Anonymous 12/09/25(Tue)21:45:42 No.107498772

>>107498749
i can't tell if this guy is severely retarded or just pretending for fun

Anonymous
12/09/25(Tue)21:46:59 No.107498783

Anonymous 12/09/25(Tue)21:46:59 No.107498783

>>107498772
Concession Accepted.

Anonymous
12/09/25(Tue)21:48:41 No.107498800

Anonymous 12/09/25(Tue)21:48:41 No.107498800

>>107498772
are you retarded or something? why do you believe nano banana pro is so popular? it does exactly that under the hood, the normies give one or two sentences max and they get something really detailled and sophisticated from that google model

Anonymous
12/09/25(Tue)21:49:09 No.107498807

Anonymous 12/09/25(Tue)21:49:09 No.107498807

File: 1765334864.png (1.37 MB, 1024x1024)

1.37 MB PNG

Anonymous
12/09/25(Tue)21:49:15 No.107498810

Anonymous 12/09/25(Tue)21:49:15 No.107498810

File: Wan2.2 Text-to-Video_video (1).mp4 (528 KB, 960x640)

528 KB MP4

>>107498529
>>107498772
apple on a board in a kitchen with lighting from window, next to a knife with cut oranges

Anonymous
12/09/25(Tue)21:50:21 No.107498822

Anonymous 12/09/25(Tue)21:50:21 No.107498822

File: Wan2.2 Text-to-Video_video (2).mp4 (498 KB, 960x640)

498 KB MP4

>>107498810
Center-frame, a solitary deep-red apple rests on a well-worn maple cutting board, positioned lengthwise. The board’s surface is marked by fine knife scars and a faint orange residue from sliced fruit. Flanking the left side of the board lies a stainless-steel chef’s knife, blade angled away, its edge catching narrow highlights. Immediately beyond the knife, three freshly cut orange segments fan outward; translucent juice beads glint along their curved flesh. Mid-morning daylight enters through a kitchen window just outside the left edge of frame, casting a soft, rectangular beam that brushes the apple, highlights the cut surfaces of the oranges, and creates a narrow, subtle rim light along the knife blade. The counter beneath the board is matte grey granite, with scattered, minute citrus fibers catching the light. In the background, an out-of-focus row of dark-oak cabinets and a faint reflection from a brushed-steel faucet imply a compact, contemporary kitchen. Single-point perspective from a 45-degree top-down angle, slightly elevated, with moderate depth of field giving razor-sharp detail on the apple and oranges while gently blurring the cabinetry behind.

Anonymous
12/09/25(Tue)21:51:29 No.107498829

Anonymous 12/09/25(Tue)21:51:29 No.107498829

File: z-image_00125_.png (1.87 MB, 1536x1536)

1.87 MB PNG

Anonymous
12/09/25(Tue)21:52:01 No.107498837

Anonymous 12/09/25(Tue)21:52:01 No.107498837

File: 1765334943.png (1.68 MB, 1024x1024)

1.68 MB PNG

Anonymous
12/09/25(Tue)21:52:22 No.107498839

Anonymous 12/09/25(Tue)21:52:22 No.107498839

File: 1746523115515523.png (1016 KB, 1176x880)

1016 KB PNG

put the the man with pink hair in image2, in image1. put the text in image2 in image1.

love qwen edit. works with zimage gens too.

Anonymous
12/09/25(Tue)21:52:38 No.107498842

Anonymous 12/09/25(Tue)21:52:38 No.107498842

>>107498807
>raatdik

Anonymous
12/09/25(Tue)21:52:46 No.107498843

Anonymous 12/09/25(Tue)21:52:46 No.107498843

>>107498772
You can't be that dumb right? Can't you really see the potential of this?

Anonymous
12/09/25(Tue)21:53:27 No.107498850

Anonymous 12/09/25(Tue)21:53:27 No.107498850

File: overview.png (3.87 MB, 2000x1519)

3.87 MB PNG

>https://github.com/ali-vilab/Wan-Move

Anonymous
12/09/25(Tue)21:53:33 No.107498851

Anonymous 12/09/25(Tue)21:53:33 No.107498851

>>107498843
i was referring to the guy he was responding to

Anonymous
12/09/25(Tue)21:55:03 No.107498858

Anonymous 12/09/25(Tue)21:55:03 No.107498858

>>107498649
This is probably what people need https://github.com/ggml-org/llama.cpp/blob/master/docs/build.md#cuda

cuda support, i remember i probably edited the build configuration manually because i was running on a cpu that was not supported by default yeah its all coming back to now. but i have a new system so i just need to make sure i do the cuda support way. this is probably why the homebrew version does not work on the gpu!!! because they are retarded and did not enable support for it.

Anonymous
12/09/25(Tue)21:55:26 No.107498862

Anonymous 12/09/25(Tue)21:55:26 No.107498862

>>107498851
If we're comparing LLM extended vs 1 sentence prompts, LLM wins for variety and detail.

Anonymous
12/09/25(Tue)21:56:16 No.107498875

Anonymous 12/09/25(Tue)21:56:16 No.107498875

>>107498858
ya exactly

Anonymous
12/09/25(Tue)21:58:33 No.107498895

Anonymous 12/09/25(Tue)21:58:33 No.107498895

>>107498862
>If we're comparing LLM extended vs 1 sentence prompts, LLM wins for variety and detail.
not only that but you can go for different prompt seed and get different rewritings of your prompt, the idea stays the same but your image will be varied, exactly what you need to fight against Z-image turbo's rigidity

Anonymous
12/09/25(Tue)22:06:43 No.107498969

Anonymous 12/09/25(Tue)22:06:43 No.107498969

>>107498850
Neat, but more importantly

This confirms Tongyi-Lab is multiple teams, at least 3 (4?) are known so far now with this vi lab

Anonymous
12/09/25(Tue)22:10:45 No.107499001

Anonymous 12/09/25(Tue)22:10:45 No.107499001

Z Video when?

Anonymous
12/09/25(Tue)22:12:34 No.107499015

Anonymous 12/09/25(Tue)22:12:34 No.107499015

File: 1734891725964674.jpg (2.8 MB, 2048x2064)

2.8 MB JPG

>>107498822
P-p-p-p-p-p-p-p-p-p-POLTERGEIST

Anonymous
12/09/25(Tue)22:16:11 No.107499049

Anonymous 12/09/25(Tue)22:16:11 No.107499049

File: 1739873469196517.png (988 KB, 880x1176)

988 KB PNG

the anime girl is holding a coffee in a coffee shop.

Anonymous
12/09/25(Tue)22:18:03 No.107499064

Anonymous 12/09/25(Tue)22:18:03 No.107499064

New thread:
>>107499062
>>107499062
>>107499062

Anonymous
12/09/25(Tue)22:20:42 No.107499088

Anonymous 12/09/25(Tue)22:20:42 No.107499088

>>107499001
No reason for that to happen since it's not the WAN team, and it's not like they'd have more knowledge on video than the wan team anyways

Anonymous
12/09/25(Tue)22:35:58 No.107499226

Anonymous 12/09/25(Tue)22:35:58 No.107499226

>>107498875
>from google gemini search
Runtime Settings (Environment Variables):

GGML_CUDA_ENABLE_UNIFIED_MEMORY=1 is an environment variable that needs to be set in the shell or system environment before the ComfyUI process starts.
---------------
might want to enable that before running comfyui so it can use swap memory in linux to avoid oom or halts

yeah i've been reading more before i build because i want it to be smooth and not fucked. I thought it was a build flag at first due to it being located on the same page. Buts its just an runtime environment setting.

use export

export GGML_CUDA_ENABLE_UNIFIED_MEMORY=1

Anonymous
12/09/25(Tue)22:37:17 No.107499236

Anonymous 12/09/25(Tue)22:37:17 No.107499236

i is now ready to build

Anonymous
12/09/25(Tue)22:37:48 No.107499239

Anonymous 12/09/25(Tue)22:37:48 No.107499239

>>107496321
zimage pwnd flux release.
that messed up nvidia's under the table financing of flux version two.
nvidia got involved.
money talks.
my uncle works for nintendo so i got reliable info about the situation.
situation is not good.
expect to hear about it within two weeks window.
that is all i can say.

Anonymous
12/09/25(Tue)22:59:37 No.107499435

Anonymous 12/09/25(Tue)22:59:37 No.107499435

>>107496848
>nigger that doesn't know how to read a prompt

Anonymous
12/10/25(Wed)03:05:29 No.107501539

Anonymous 12/10/25(Wed)03:05:29 No.107501539

>>107498056
amazing I love it

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.