/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 01/08/26(Thu)19:27:32 No.107809385

File: 1765815295790257.jpg (2.7 MB, 3126x2407)

2.7 MB JPG

/ldg/ - Local Diffusion General Anonymous 01/08/26(Thu)19:27:32 No.107809385 Archived

Discussion of Free and Open Source Diffusion Models

Prev: >>107805470

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

Anonymous
01/08/26(Thu)19:37:36 No.107809444

Anonymous 01/08/26(Thu)19:37:36 No.107809444

This sounds pretty good: https://github.com/Saganaki22/ComfyUI-AudioSR

To process low quality LTX2 audio output to better quality.

Anonymous
01/08/26(Thu)19:41:27 No.107809477

Anonymous 01/08/26(Thu)19:41:27 No.107809477

>>107801257
Asking on this one as well for more nanobanana prompts for making datasets

Anonymous
01/08/26(Thu)19:41:42 No.107809481

Anonymous 01/08/26(Thu)19:41:42 No.107809481

>schizobake

Anonymous
01/08/26(Thu)19:42:52 No.107809490

Anonymous 01/08/26(Thu)19:42:52 No.107809490

>>107809444
fuck off with your jewtx64gbram

Anonymous
01/08/26(Thu)19:44:21 No.107809497

Anonymous 01/08/26(Thu)19:44:21 No.107809497

Wether you like it or not AniStudio should be in OP

Anonymous
01/08/26(Thu)19:45:31 No.107809505

Anonymous 01/08/26(Thu)19:45:31 No.107809505

>>107809444
good idea

>>107809490
what? this runs on most gpu people have here, doesn't it?

Anonymous
01/08/26(Thu)19:46:18 No.107809511

Anonymous 01/08/26(Thu)19:46:18 No.107809511

>>107809497
feel free to make your own thread. you can put whatever you want in the OP

Anonymous
01/08/26(Thu)19:46:29 No.107809513

Anonymous 01/08/26(Thu)19:46:29 No.107809513

Anyone using z-image finetunes? Or is everyone running the vanilla original? Somehow I can't decide if I find a finetune being better than the original.

Anonymous
01/08/26(Thu)19:50:07 No.107809539

Anonymous 01/08/26(Thu)19:50:07 No.107809539

>>107809513
they are not finetunes, they are just loras merged with the z-image model

Anonymous
01/08/26(Thu)19:50:14 No.107809542

Anonymous 01/08/26(Thu)19:50:14 No.107809542

>>107809490
meds

Anonymous
01/08/26(Thu)19:50:48 No.107809544

Anonymous 01/08/26(Thu)19:50:48 No.107809544

>>107809513
only for lora training, not for genning

Anonymous
01/08/26(Thu)19:51:41 No.107809552

Anonymous 01/08/26(Thu)19:51:41 No.107809552

>>107809539
Are they good for anything?

Anonymous
01/08/26(Thu)19:51:58 No.107809556

Anonymous 01/08/26(Thu)19:51:58 No.107809556

lmao the image + sound workflow kijai posted is amazing, just provide audio and prompt "talking" for example: deus ex sound clip source

https://files.catbox.moe/tt1obv.mp4

Anonymous
01/08/26(Thu)19:52:59 No.107809564

Anonymous 01/08/26(Thu)19:52:59 No.107809564

>>107809556

https://files.catbox.moe/oio2rr.mp4

Anonymous
01/08/26(Thu)19:53:06 No.107809567

Anonymous 01/08/26(Thu)19:53:06 No.107809567

>>107809364
wtf is that

Anonymous
01/08/26(Thu)19:56:49 No.107809588

Anonymous 01/08/26(Thu)19:56:49 No.107809588

File: comfyui-portable.png (351 KB, 2008x2080)

351 KB PNG

Hey, can anyone help here?

I am using ComfyUI portable for Wan as per the rentry guide, and it had been working fine to this point. However I recently updated it, along with all the nodes, and now a bunch of nodes are broken including the important VHS_Combine node, which is required for generating videos.

Does anyone know exactly what has gone wrong here? Something related to numpy? If so, is there a simple way to downgrade without breaking something critical?

Anonymous
01/08/26(Thu)19:57:28 No.107809595

Anonymous 01/08/26(Thu)19:57:28 No.107809595

>>107809588
>he pulled

Anonymous
01/08/26(Thu)19:59:58 No.107809614

Anonymous 01/08/26(Thu)19:59:58 No.107809614

>update comfy
>nothing works anymore
why is cumfartui so shit?

Anonymous
01/08/26(Thu)20:00:42 No.107809619

Anonymous 01/08/26(Thu)20:00:42 No.107809619

>>107809588
I've been using comfy for years no, and while it can to shit, it's never this bad, so I wonder if it's a case of windows install being particularly bad since I'm ubuntu, or if you guys just install every custom node no matter how unmaintained it is.
You should read the error too, it's a mismatch of numpy version, something wants v2.x but something else wants v1.x and it's breaking stuff.

Anonymous
01/08/26(Thu)20:03:46 No.107809635

Anonymous 01/08/26(Thu)20:03:46 No.107809635

>>107809614
I have literally never had this problem. It's always a simply git pull and install requests. You must be using the retard portable version or something.

Anonymous
01/08/26(Thu)20:04:23 No.107809638

Anonymous 01/08/26(Thu)20:04:23 No.107809638

>>107809614
he hates custom nodes and wants you to use the native ones, or wait until they exist

Anonymous
01/08/26(Thu)20:04:28 No.107809640

Anonymous 01/08/26(Thu)20:04:28 No.107809640

>>107809556
I already lost track of all these workflows. Is it the workflow with Mel-Band RoFormer that can be used for singing? I think https://github.com/RageCat73/RCWorkflows/blob/main/LTX2-Audio-Input-FP8-Distilled.json is even a little improved.

>>107809588
That's a node I like a lot too, though technically you could do without it on a recent comfyui (with less options how to save the video).

IDK what went wrong but I'd generally just uv pip install -r requirements.txt with the venv activated and then do uv pip install <stuff> manually as-needed.

Anonymous
01/08/26(Thu)20:05:33 No.107809647

Anonymous 01/08/26(Thu)20:05:33 No.107809647

lmao, cia guy does music, and it works: kijai audio workflow

https://files.catbox.moe/t64fx4.mp4

Anonymous
01/08/26(Thu)20:06:11 No.107809650

Anonymous 01/08/26(Thu)20:06:11 No.107809650

https://files.catbox.moe/apg37g.mp4

Anonymous
01/08/26(Thu)20:09:35 No.107809670

Anonymous 01/08/26(Thu)20:09:35 No.107809670

im dying cause I didnt prompt to add the black guy. I just prompted "the man sings".

https://files.catbox.moe/ach182.mp4

Anonymous
01/08/26(Thu)20:15:19 No.107809701

Anonymous 01/08/26(Thu)20:15:19 No.107809701

miku audio:

https://files.catbox.moe/e4ei3p.mp4

Anonymous
01/08/26(Thu)20:17:16 No.107809710

Anonymous 01/08/26(Thu)20:17:16 No.107809710

I have test anon fatigue.

Anonymous
01/08/26(Thu)20:17:20 No.107809712

Anonymous 01/08/26(Thu)20:17:20 No.107809712

>>107809701
better, last one but the workflow does in fact work well.

https://files.catbox.moe/jsfa9q.mp4

Anonymous
01/08/26(Thu)20:18:31 No.107809721

Anonymous 01/08/26(Thu)20:18:31 No.107809721

>>107809710
it's set in deep during kontext "text"

Anonymous
01/08/26(Thu)20:22:13 No.107809743

Anonymous 01/08/26(Thu)20:22:13 No.107809743

Reads don't mess up an ssd like writes do? Doing a --cache-none on comfy to reduce ram usage on Wan, but its doing like 30gb of reads every gen.

Anonymous
01/08/26(Thu)20:22:46 No.107809749

Anonymous 01/08/26(Thu)20:22:46 No.107809749

>>107809743
only write

Anonymous
01/08/26(Thu)20:24:20 No.107809756

Anonymous 01/08/26(Thu)20:24:20 No.107809756

>>107809614
he pulled?

>>107809638
of course he does because its not making him any money. i updated recently and found 3 nodes didnt work

Anonymous
01/08/26(Thu)20:25:01 No.107809761

Anonymous 01/08/26(Thu)20:25:01 No.107809761

Some fun with a LTX default workflow:
https://litter.catbox.moe/9iug9bw879i6o9hb.mp4

>>107809670
I think it actually summons people based on voice(s) at times

>>107809712
looks very good

Anonymous
01/08/26(Thu)20:28:43 No.107809786

Anonymous 01/08/26(Thu)20:28:43 No.107809786

lmao, Hitler reborn as Floyd (speech audio, floyd image 2 video)

it is SO expressive. I love ltx2.

https://files.catbox.moe/p1gjoi.mp4

Anonymous
01/08/26(Thu)20:28:49 No.107809787

Anonymous 01/08/26(Thu)20:28:49 No.107809787

the highlight of my day is seeing if my gen made it into the op

Anonymous
01/08/26(Thu)20:30:41 No.107809793

Anonymous 01/08/26(Thu)20:30:41 No.107809793

>>107809786
slightly longer speech (240 frames)

https://files.catbox.moe/1l3b74.mp4

Anonymous
01/08/26(Thu)20:31:47 No.107809799

Anonymous 01/08/26(Thu)20:31:47 No.107809799

File: ZImageTurbo + 2x upscaler.png (3.45 MB, 1248x1824)

3.45 MB PNG

https://files.catbox.moe/xf5zuk.png

Anonymous
01/08/26(Thu)20:33:07 No.107809813

Anonymous 01/08/26(Thu)20:33:07 No.107809813

File: 1737409640854075.png (2.28 MB, 1632x928)

2.28 MB PNG

Anonymous
01/08/26(Thu)20:33:48 No.107809820

Anonymous 01/08/26(Thu)20:33:48 No.107809820

File: ZImageTurbo-_0005.png (1.06 MB, 832x1216)

1.06 MB PNG

Upscaler screwed up the lettering of this one
https://files.catbox.moe/7lu3so.png

Anonymous
01/08/26(Thu)20:34:34 No.107809823

Anonymous 01/08/26(Thu)20:34:34 No.107809823

okay last one. this is the best speech by floyd.

https://files.catbox.moe/yfo4xs.mp4

Anonymous
01/08/26(Thu)20:34:40 No.107809824

Anonymous 01/08/26(Thu)20:34:40 No.107809824

it's apparently a warning
https://litter.catbox.moe/vw0occ677epqlq78.mp4

Anonymous
01/08/26(Thu)20:34:52 No.107809825

Anonymous 01/08/26(Thu)20:34:52 No.107809825

File: 1756458140114154.png (2.34 MB, 1632x928)

2.34 MB PNG

Anonymous
01/08/26(Thu)20:39:06 No.107809854

Anonymous 01/08/26(Thu)20:39:06 No.107809854

File: 1750313320090174.png (1.86 MB, 1632x928)

1.86 MB PNG

Anonymous
01/08/26(Thu)20:39:42 No.107809859

Anonymous 01/08/26(Thu)20:39:42 No.107809859

File: 1754014357767103.jpg (299 KB, 1352x1058)

299 KB JPG

>>107809385
Okay my 5070ti just arrived
Image gen massively increased in performance. From 5 to 10 minutes generating 2k image with my old GTX 1080 into 5 to 10 SECONDS with my 5070ti

I think im gonna try Video generation next.
Im gonna get addicted to this. This is my fentanyl

Anonymous
01/08/26(Thu)20:41:06 No.107809867

Anonymous 01/08/26(Thu)20:41:06 No.107809867

File: images[1].jpg (6 KB, 275x183)

6 KB JPG

>>107809588
https://files.catbox.moe/p1s7wj.mp4

Anonymous
01/08/26(Thu)20:41:31 No.107809870

Anonymous 01/08/26(Thu)20:41:31 No.107809870

File: file.png (77 KB, 1024x1024)

77 KB PNG

horse man

Anonymous
01/08/26(Thu)20:42:05 No.107809875

Anonymous 01/08/26(Thu)20:42:05 No.107809875

Anon, tell me chroma model to use for realism and anatomy. I love sdxl but hands are always weird

Anonymous
01/08/26(Thu)20:43:58 No.107809893

Anonymous 01/08/26(Thu)20:43:58 No.107809893

>>107809870
does he speak normal words?

Anonymous
01/08/26(Thu)20:44:45 No.107809899

Anonymous 01/08/26(Thu)20:44:45 No.107809899

>>107809893
neigh

Anonymous
01/08/26(Thu)20:44:57 No.107809900

Anonymous 01/08/26(Thu)20:44:57 No.107809900

>>107809385
Say what you want about Flux.2, it has already been proven it has more sovl and LoRAs are way better than ZiT versions

https://civitai.com/models/2212121?modelVersionId=2511510

This is 32B vs. 6B anyways, it's an indisputable fact.

Anonymous
01/08/26(Thu)20:45:06 No.107809901

Anonymous 01/08/26(Thu)20:45:06 No.107809901

does wan 2.2 i2v degrade the quality when the resolution is too high (e.g. above 1280x720)? just curious how high the resolution can be without fucking up the quality

Anonymous
01/08/26(Thu)20:46:08 No.107809908

Anonymous 01/08/26(Thu)20:46:08 No.107809908

>>107809900
flux 2 is pretty cool, it may be 32B but it gens faster for me than qwen for example, the offloading works really well
probably not great for NSFW but it's a cool model

Anonymous
01/08/26(Thu)20:47:48 No.107809922

Anonymous 01/08/26(Thu)20:47:48 No.107809922

>the girl is sitting on the on top of the table next to the keyboard
>the girl ends up sitting in the chair AT the keyboard and half a person sticks in the table
sometimes z-img is as dumb as SDXL. I don't get it.

Anonymous
01/08/26(Thu)20:49:27 No.107809936

Anonymous 01/08/26(Thu)20:49:27 No.107809936

>>107809761
I think it can be corralled with a good prompt
https://files.catbox.moe/wyzd80.webm

Anonymous
01/08/26(Thu)20:50:14 No.107809943

Anonymous 01/08/26(Thu)20:50:14 No.107809943

>>107809922
>sitting on the on
>blaming the model

Anonymous
01/08/26(Thu)20:52:35 No.107809957

Anonymous 01/08/26(Thu)20:52:35 No.107809957

>>107809943
made that typo here.

Anonymous
01/08/26(Thu)20:55:57 No.107809977

Anonymous 01/08/26(Thu)20:55:57 No.107809977

File: LTX-2.mp4 (492 KB, 704x704)

492 KB MP4

>>107809854
tried to prompt a video with your image
> she dies

>>107809875
you might want to start with z-image-turbo
chroma suggestions are in the previous thread and the one before

qwen, flux2 and wan (1 frame for an image) also can do hands quite well

>>107809859
congrats, you certainly could do video now

Anonymous
01/08/26(Thu)20:56:35 No.107809984

Anonymous 01/08/26(Thu)20:56:35 No.107809984

ok when I put another person IN the chair AND say it's viewed from the side, the girl ends up ON the table...

Anonymous
01/08/26(Thu)21:01:26 No.107810013

Anonymous 01/08/26(Thu)21:01:26 No.107810013

>>107809977
why does she turn into a horse at the end

Anonymous
01/08/26(Thu)21:04:54 No.107810033

Anonymous 01/08/26(Thu)21:04:54 No.107810033

>>107810013
i really do not know. it was just "she dies" in the prompt. that's probably how it works? haven't died yet.

Anonymous
01/08/26(Thu)21:05:09 No.107810037

Anonymous 01/08/26(Thu)21:05:09 No.107810037

>>107809977
holy lmao

Anonymous
01/08/26(Thu)21:05:10 No.107810038

Anonymous 01/08/26(Thu)21:05:10 No.107810038

>>107809977
>congrats, you certainly could do video now
Where do i start ?

Anonymous
01/08/26(Thu)21:07:26 No.107810051

Anonymous 01/08/26(Thu)21:07:26 No.107810051

File: 9072534.png (1.77 MB, 1024x1536)

1.77 MB PNG

Anonymous
01/08/26(Thu)21:09:26 No.107810065

Anonymous 01/08/26(Thu)21:09:26 No.107810065

File: 1744162931754915.mp4 (1.73 MB, 1264x720)

1.73 MB MP4

>>107809813

Anonymous
01/08/26(Thu)21:10:15 No.107810069

Anonymous 01/08/26(Thu)21:10:15 No.107810069

File: 1758836420362253.png (2.32 MB, 1408x1152)

2.32 MB PNG

>>107810065
hot
wan or ltx2?

Anonymous
01/08/26(Thu)21:10:33 No.107810073

Anonymous 01/08/26(Thu)21:10:33 No.107810073

Hi guys, I'm running some Wan 2.2 high / low noise model video gens. Image to video. The workflow seems to just ignore the inserted image and create a clip from the prompt only, any help?

Anonymous
01/08/26(Thu)21:11:32 No.107810080

Anonymous 01/08/26(Thu)21:11:32 No.107810080

>>107810038
for ltx probably with these workflows https://github.com/Lightricks/ComfyUI-LTXVideo/tree/master/example_workflows
or https://github.com/RageCat73/RCWorkflows/blob/main/LTX2-Audio-Input-FP8-Distilled.json for audio-matched singing/talking perhaps, there are also workflows in the templates

wan has a lot of workflows but perhaps start with the basic templates / https://comfyanonymous.github.io/ComfyUI_examples/wan22/ ?

>>107810037
another one, the sound is just silly too
https://litter.catbox.moe/6pqc49695ucazl09.mp4

Anonymous
01/08/26(Thu)21:13:06 No.107810085

Anonymous 01/08/26(Thu)21:13:06 No.107810085

>>107810073
maybe you are using the wrong checkpoints or lora, hard to tell without seeing the workflow tho

Anonymous
01/08/26(Thu)21:14:20 No.107810092

Anonymous 01/08/26(Thu)21:14:20 No.107810092

>960x960x240 = 221 million pixels total, fine
>720x720x408 = 211 million pixels total, oom
what gives

Anonymous
01/08/26(Thu)21:14:51 No.107810096

Anonymous 01/08/26(Thu)21:14:51 No.107810096

>>107810069
WAN still. I haven't had time to setup LTX2 yet.

Anonymous
01/08/26(Thu)21:16:30 No.107810105

Anonymous 01/08/26(Thu)21:16:30 No.107810105

>>107810092
you're naive, calculations aren't just about pixels, especially if we're involving relating previous frames to next frames.

Anonymous
01/08/26(Thu)21:16:37 No.107810106

Anonymous 01/08/26(Thu)21:16:37 No.107810106

the future of animation is LTX2:

https://files.catbox.moe/kj8io2.mp4

Anonymous
01/08/26(Thu)21:17:05 No.107810109

Anonymous 01/08/26(Thu)21:17:05 No.107810109

>>107810092
are you counting the temporal resolution as pixels? i don't think it's as simple as that, there are probably attention mechanisms that scale worse than linearly that need to attend to the whole video at once

Anonymous
01/08/26(Thu)21:18:16 No.107810115

Anonymous 01/08/26(Thu)21:18:16 No.107810115

>>107810106
holy shit, this iteration it invented new characters.

https://files.catbox.moe/6ggc31.mp4

Anonymous
01/08/26(Thu)21:18:54 No.107810123

Anonymous 01/08/26(Thu)21:18:54 No.107810123

File: 078245654.png (1.83 MB, 1024x1536)

1.83 MB PNG

Anonymous
01/08/26(Thu)21:19:06 No.107810127

Anonymous 01/08/26(Thu)21:19:06 No.107810127

wan sisters please help. how the fuck do i stop "Diffusion Model Loader KJ" from offloading every other generation? takes about 5 - 8 minutes to load everytime. i can queue up 10 gens and every few gens it will offload the models and i have nothing extra running

Anonymous
01/08/26(Thu)21:19:15 No.107810129

Anonymous 01/08/26(Thu)21:19:15 No.107810129

File: 1759975299844544.png (2.29 MB, 1152x1312)

2.29 MB PNG

>>107810123
nice pussy

Anonymous
01/08/26(Thu)21:19:39 No.107810130

Anonymous 01/08/26(Thu)21:19:39 No.107810130

does ltx2 work for goon stuff

Anonymous
01/08/26(Thu)21:19:46 No.107810132

Anonymous 01/08/26(Thu)21:19:46 No.107810132

>>107810127
>he doesnt have enough ram
lmao

Anonymous
01/08/26(Thu)21:20:48 No.107810139

Anonymous 01/08/26(Thu)21:20:48 No.107810139

>>107810130
lol
lmao

Anonymous
01/08/26(Thu)21:20:50 No.107810140

Anonymous 01/08/26(Thu)21:20:50 No.107810140

>>107810130
it has made me coom, but I'm a freak who loves roleplay more than explicit stuff

Anonymous
01/08/26(Thu)21:25:04 No.107810166

Anonymous 01/08/26(Thu)21:25:04 No.107810166

File: 1girl standing, holding a(...).png (1.55 MB, 1024x1536)

1.55 MB PNG

close enough

Anonymous
01/08/26(Thu)21:25:34 No.107810170

Anonymous 01/08/26(Thu)21:25:34 No.107810170

File: ZImageTurbo-_0031.png (2.95 MB, 1824x1248)

2.95 MB PNG

https://files.catbox.moe/5yt6cq.png

Anonymous
01/08/26(Thu)21:26:41 No.107810178

Anonymous 01/08/26(Thu)21:26:41 No.107810178

File: 1741971397716464.png (2.12 MB, 1632x928)

2.12 MB PNG

>>107810166
>supreence
just reroll it bwo

Anonymous
01/08/26(Thu)21:31:25 No.107810196

Anonymous 01/08/26(Thu)21:31:25 No.107810196

File: Capture.png (1.1 MB, 3463x1707)

1.1 MB PNG

>>107810085
I'll post the result when it's done, but I'm not hopeful, I'm not sure why it just ignores the input image.

Anonymous
01/08/26(Thu)21:31:53 No.107810198

Anonymous 01/08/26(Thu)21:31:53 No.107810198

File: ltx.mp4 (982 KB, 1280x704)

982 KB MP4

>>107810130
sometimes you can continue from lewd i2v for a bit which might be good enough for some, it can talk dirty, some people's fetishes might be supported, and if you squint hard enough it supports (usually deformed) naked boobs t2v with sufficient attempts

but really: not very well

Anonymous
01/08/26(Thu)21:33:11 No.107810205

Anonymous 01/08/26(Thu)21:33:11 No.107810205

>>107810196
looks like the high noise model is labelled t2v, not i2v

Anonymous
01/08/26(Thu)21:34:11 No.107810211

Anonymous 01/08/26(Thu)21:34:11 No.107810211

>>107810132
its doing it every gen now, what are some -- settings to stop this from happening?

Anonymous
01/08/26(Thu)21:37:13 No.107810224

Anonymous 01/08/26(Thu)21:37:13 No.107810224

>>107810205
probably, I think I just fixed that...
Restarted.

Anonymous
01/08/26(Thu)21:38:48 No.107810232

Anonymous 01/08/26(Thu)21:38:48 No.107810232

>>107810123
zimage?

Anonymous
01/08/26(Thu)21:39:27 No.107810238

Anonymous 01/08/26(Thu)21:39:27 No.107810238

>>107810130
wait for nsfw loras

Anonymous
01/08/26(Thu)21:40:06 No.107810246

Anonymous 01/08/26(Thu)21:40:06 No.107810246

dubs man with some insight:

https://files.catbox.moe/0l09xv.mp4

Anonymous
01/08/26(Thu)21:40:59 No.107810252

Anonymous 01/08/26(Thu)21:40:59 No.107810252

>>107810232
ya

Anonymous
01/08/26(Thu)21:41:08 No.107810254

Anonymous 01/08/26(Thu)21:41:08 No.107810254

File: 1767894369296__000002000_3.jpg (152 KB, 1024x1024)

152 KB JPG

Anonymous
01/08/26(Thu)21:42:54 No.107810264

Anonymous 01/08/26(Thu)21:42:54 No.107810264

File: ZImageTurbo-_0037.png (2.55 MB, 2304x960)

2.55 MB PNG

https://files.catbox.moe/nd73sa.png

Anonymous
01/08/26(Thu)21:43:15 No.107810266

Anonymous 01/08/26(Thu)21:43:15 No.107810266

File: ltx_hallucination.mp4 (2.13 MB, 1280x704)

2.13 MB MP4

>>107810224
just to be clear, it's not just the filename, you need the i2v model for both high and low

also "r" eloading node definitions should make new models show up without restarting

Anonymous
01/08/26(Thu)21:43:35 No.107810267

Anonymous 01/08/26(Thu)21:43:35 No.107810267

File: z-image-experimental_00467_.png (3.78 MB, 1264x2048)

3.78 MB PNG

Anonymous
01/08/26(Thu)21:44:16 No.107810268

Anonymous 01/08/26(Thu)21:44:16 No.107810268

Is there any point in using a specific qwen 3 4b variant over the normal one for z-image? Maybe allowing less censored output?
If so which one?

Anonymous
01/08/26(Thu)21:45:00 No.107810270

Anonymous 01/08/26(Thu)21:45:00 No.107810270

File: z-image-experimental_00468_.png (2.99 MB, 1264x2048)

2.99 MB PNG

Anonymous
01/08/26(Thu)21:46:43 No.107810278

Anonymous 01/08/26(Thu)21:46:43 No.107810278

>>107810266
I downloaded (I think) all of the models so yeah, it should just be a drop down select for me. Thanks for clarifying though, most people just ASSUME you know! thank you for looking and pointing it out!

Anonymous
01/08/26(Thu)21:48:01 No.107810286

Anonymous 01/08/26(Thu)21:48:01 No.107810286

>>107810252
thanks

Anonymous
01/08/26(Thu)21:50:01 No.107810292

Anonymous 01/08/26(Thu)21:50:01 No.107810292

File: z-image-experimental_00469_.png (3.02 MB, 1264x2048)

3.02 MB PNG

Anonymous
01/08/26(Thu)21:52:15 No.107810310

Anonymous 01/08/26(Thu)21:52:15 No.107810310

>>107810268
no. you can use various quants PROBABLY with no ill effect most of the time, but it doesn't unlock much

things may change if the nsfw trainings happen against abliterated versions of qwen or w/e, maybe then it starts to matter?

Anonymous
01/08/26(Thu)21:53:15 No.107810315

Anonymous 01/08/26(Thu)21:53:15 No.107810315

>>107810292
The nose need inpainting, otherwise I like it

Anonymous
01/08/26(Thu)21:53:36 No.107810320

Anonymous 01/08/26(Thu)21:53:36 No.107810320

>>107810310
OK, I was hoping it would help, but it makes sense.

Anonymous
01/08/26(Thu)21:55:35 No.107810331

Anonymous 01/08/26(Thu)21:55:35 No.107810331

File: z-image-experimental_00471_.png (2.75 MB, 1264x2048)

2.75 MB PNG

>>107810315
it's a boogie

Anonymous
01/08/26(Thu)21:55:56 No.107810335

Anonymous 01/08/26(Thu)21:55:56 No.107810335

>>107810268
isn't qwen just for tokenizing the prompt/images inputs? abliteration wouldn't do anything useful unless qwen was actually used to do any inference. but even then, qwen is actually pretty uncensored by default.

Anonymous
01/08/26(Thu)21:58:26 No.107810353

Anonymous 01/08/26(Thu)21:58:26 No.107810353

>>107810335
It's the text encoder, yes.

Anonymous
01/08/26(Thu)21:59:51 No.107810363

Anonymous 01/08/26(Thu)21:59:51 No.107810363

>>107810335
some workflows already use the fact that it's a VL model to also do image-to-prompt at some point

but yes abliteration doesn't even add knowledge on either the image model or the text model, it just stops the retarded refusals

Anonymous
01/08/26(Thu)22:01:05 No.107810370

Anonymous 01/08/26(Thu)22:01:05 No.107810370

File: file.png (38 KB, 633x386)

38 KB PNG

wher basuuu

Anonymous
01/08/26(Thu)22:03:17 No.107810386

Anonymous 01/08/26(Thu)22:03:17 No.107810386

>>107810370
soon™

Anonymous
01/08/26(Thu)22:05:23 No.107810398

Anonymous 01/08/26(Thu)22:05:23 No.107810398

Does --highvram work to keep models from offloading?

Anonymous
01/08/26(Thu)22:06:05 No.107810401

Anonymous 01/08/26(Thu)22:06:05 No.107810401

>>107810198
Wtf bro that looks awful.

Anonymous
01/08/26(Thu)22:06:19 No.107810403

Anonymous 01/08/26(Thu)22:06:19 No.107810403

Where do i even start with this

Anonymous
01/08/26(Thu)22:06:37 No.107810406

Anonymous 01/08/26(Thu)22:06:37 No.107810406

>>107810198

https://files.catbox.moe/g4jm4z.mp4

Anonymous
01/08/26(Thu)22:06:40 No.107810407

Anonymous 01/08/26(Thu)22:06:40 No.107810407

>>107810398
vram_group.add_argument("--highvram", action="store_true", help="By default models will be unloaded to CPU memory after being used. This option keeps them in GPU memory.")

Anonymous
01/08/26(Thu)22:08:36 No.107810414

Anonymous 01/08/26(Thu)22:08:36 No.107810414

File: 1747606581557314.png (2.15 MB, 2159x1536)

2.15 MB PNG

Anonymous
01/08/26(Thu)22:11:29 No.107810426

Anonymous 01/08/26(Thu)22:11:29 No.107810426

>>107810370
100 more weeks, don't worry :3

Anonymous
01/08/26(Thu)22:12:32 No.107810433

Anonymous 01/08/26(Thu)22:12:32 No.107810433

>>107810407
>This option keeps them in GPU memory
Sounds good, I'll give that a try. Never had to bother with high low vram settings until this week. Current settings are --windows-standalone-build --use-sage-attention --fast fp16_accumulation --disable-api-nodes

Anonymous
01/08/26(Thu)22:13:54 No.107810446

Anonymous 01/08/26(Thu)22:13:54 No.107810446

>>107808950
Thanks

Anonymous
01/08/26(Thu)22:29:45 No.107810527

Anonymous 01/08/26(Thu)22:29:45 No.107810527

>>107810205
I think you got me, that output followed the input image

Anonymous
01/08/26(Thu)22:37:07 No.107810565

Anonymous 01/08/26(Thu)22:37:07 No.107810565

File: z-image-experimental_00475_.png (2.67 MB, 1264x2048)

2.67 MB PNG

Anonymous
01/08/26(Thu)22:42:06 No.107810598

Anonymous 01/08/26(Thu)22:42:06 No.107810598

File: ZImageTurbo-_0055.jpg (593 KB, 1248x1824)

593 KB JPG

>>107810446
Just FYI, I switched prompts to a less retarded one. The 4-step one has a fuckton of upscaler artifacts, the one below has a very persistent camo-like pattern (latent noise artifacts, I think) on the image I can't remove no matter what. So pick your poison

https://files.catbox.moe/8llwr6.png

Anonymous
01/08/26(Thu)22:43:09 No.107810602

Anonymous 01/08/26(Thu)22:43:09 No.107810602

>>107810598
* I switched the workflow

Anonymous
01/08/26(Thu)22:49:28 No.107810629

Anonymous 01/08/26(Thu)22:49:28 No.107810629

how the fuck are you prompting someone's groin in zimage?
>groin, crotch, between the legs, pubis
only seem to vaguely address the region
>pussy, vagina etc
often draw genitals over the clothes
>genitals
ignored somehow
how do you prompt someone holding her hand in front of her crotch?

Anonymous
01/08/26(Thu)22:56:08 No.107810665

Anonymous 01/08/26(Thu)22:56:08 No.107810665

>>107810598
fucking nice

Anonymous
01/08/26(Thu)23:14:57 No.107810770

Anonymous 01/08/26(Thu)23:14:57 No.107810770

>>107810406
excellent. did you generate the audio externally or is that all LTX?

>>107810527
great, easy fix then

Anonymous
01/08/26(Thu)23:17:27 No.107810788

Anonymous 01/08/26(Thu)23:17:27 No.107810788

Can LTX2 generate Audio only for a video?

Anonymous
01/08/26(Thu)23:23:59 No.107810840

Anonymous 01/08/26(Thu)23:23:59 No.107810840

>>107810629
use simplified Chinese translation of the area
lower midriff

Anonymous
01/08/26(Thu)23:25:31 No.107810852

Anonymous 01/08/26(Thu)23:25:31 No.107810852

>>107810370
i ate it and now my belly is all big and round... mmmmfffgh

Anonymous
01/08/26(Thu)23:30:42 No.107810891

Anonymous 01/08/26(Thu)23:30:42 No.107810891

this helped me get it running finally
https://www.reddit.com/r/StableDiffusion/comments/1q7klzo/i_followed_this_video_to_get_ltx2_to_work_with/

Anonymous
01/08/26(Thu)23:34:42 No.107810912

Anonymous 01/08/26(Thu)23:34:42 No.107810912

File: z-image-experimental_00478_.png (3.08 MB, 2048x1264)

3.08 MB PNG

2000 steps
prolly should bake some more

Anonymous
01/08/26(Thu)23:36:55 No.107810921

Anonymous 01/08/26(Thu)23:36:55 No.107810921

>>107810912
No one asked, avatarfag.

Anonymous
01/08/26(Thu)23:38:03 No.107810926

Anonymous 01/08/26(Thu)23:38:03 No.107810926

File: z-image-experimental_00479_.png (3.81 MB, 2048x1264)

3.81 MB PNG

Anonymous
01/08/26(Thu)23:39:23 No.107810941

Anonymous 01/08/26(Thu)23:39:23 No.107810941

>>107810921
:P

Anonymous
01/08/26(Thu)23:40:34 No.107810948

Anonymous 01/08/26(Thu)23:40:34 No.107810948

File: ZImageTurbo-_0089.png (2.42 MB, 2304x960)

2.42 MB PNG

>>107810912
Keep at it, loving it
https://files.catbox.moe/6kv4i9.png

Anonymous
01/08/26(Thu)23:42:13 No.107810956

Anonymous 01/08/26(Thu)23:42:13 No.107810956

LTX2 is such an emotive model desu:

https://files.catbox.moe/gts1j8.mp4

Anonymous
01/08/26(Thu)23:43:56 No.107810965

Anonymous 01/08/26(Thu)23:43:56 No.107810965

>>107810788
I was wondering the same thing and anon said
>>107800860

Anonymous
01/08/26(Thu)23:45:39 No.107810977

Anonymous 01/08/26(Thu)23:45:39 No.107810977

File: z-image-experimental_00481_.png (2.84 MB, 1920x1080)

2.84 MB PNG

>>107810948
thanks

Anonymous
01/08/26(Thu)23:46:09 No.107810980

Anonymous 01/08/26(Thu)23:46:09 No.107810980

File: ZImageTurbo-_0092.png (1.72 MB, 2304x960)

1.72 MB PNG

colors a tad too washed out, like someone is watching the movie with a monitor with too high brightness and low contrast setting. Kinda weird glitch
https://files.catbox.moe/ofgsd0.png

Anonymous
01/08/26(Thu)23:47:41 No.107810989

Anonymous 01/08/26(Thu)23:47:41 No.107810989

https://files.catbox.moe/x7wfck.mp4

Anonymous
01/08/26(Thu)23:48:33 No.107810993

Anonymous 01/08/26(Thu)23:48:33 No.107810993

File: z-image-experimental_00483_.png (2.67 MB, 1920x1080)

2.67 MB PNG

Anonymous
01/08/26(Thu)23:49:32 No.107811005

Anonymous 01/08/26(Thu)23:49:32 No.107811005

catbox is way too slow for this shit

Anonymous
01/08/26(Thu)23:51:43 No.107811018

Anonymous 01/08/26(Thu)23:51:43 No.107811018

File: z-image-experimental_00484_.png (2.58 MB, 1920x1080)

2.58 MB PNG

Anonymous
01/08/26(Thu)23:52:28 No.107811022

Anonymous 01/08/26(Thu)23:52:28 No.107811022

File: ComfyUI_01316_.jpg (3.61 MB, 2880x1440)

3.61 MB JPG

Holy shit I've been wondering for days why my ZiT images suddenly look like noisy fucking garbage after a pull. It's because a commit made ZImage default to fp16 dtype instead of bf16.
There's no option to select bf16 in the default diffusion loader so you have to use the "ModelComputeDtype" node

Anonymous
01/08/26(Thu)23:54:00 No.107811028

Anonymous 01/08/26(Thu)23:54:00 No.107811028

File: Screenshot 2026-01-09 045335.png (52 KB, 1381x235)

52 KB PNG

>>107811022
the fix

Anonymous
01/08/26(Thu)23:54:04 No.107811031

Anonymous 01/08/26(Thu)23:54:04 No.107811031

>>107810770
That was all LTX.

Anonymous
01/08/26(Thu)23:58:33 No.107811052

Anonymous 01/08/26(Thu)23:58:33 No.107811052

>>107811022
you mean this commit?
https://github.com/Comfy-Org/ComfyUI/pull/11057

I was wondering how to "fix" it. thanks for that

Anonymous
01/08/26(Thu)23:58:57 No.107811055

Anonymous 01/08/26(Thu)23:58:57 No.107811055

https://files.catbox.moe/ge3e3f.mov

Anonymous
01/08/26(Thu)23:59:35 No.107811057

Anonymous 01/08/26(Thu)23:59:35 No.107811057

>>107811022
>>107811028

Had the same issue,Thanks a lot, man

Anonymous
01/09/26(Fri)00:00:01 No.107811059

Anonymous 01/09/26(Fri)00:00:01 No.107811059

>>107811052
I believe that's the one yeah

Anonymous
01/09/26(Fri)00:05:15 No.107811083

Anonymous 01/09/26(Fri)00:05:15 No.107811083

File: 1742326097810428.png (1.15 MB, 1048x992)

1.15 MB PNG

>.mov

Anonymous
01/09/26(Fri)00:06:31 No.107811093

Anonymous 01/09/26(Fri)00:06:31 No.107811093

File: ComfyUI_00012_.png (1.14 MB, 1024x1024)

1.14 MB PNG

Anonymous
01/09/26(Fri)00:09:27 No.107811104

Anonymous 01/09/26(Fri)00:09:27 No.107811104

>>107811052
So, Cumanonymous ruined something on purpose? This is nasty.

Anonymous
01/09/26(Fri)00:09:45 No.107811106

Anonymous 01/09/26(Fri)00:09:45 No.107811106

>>107810912
The only time I've not needed to bake it for 4,000 steps is when it was of a person and they were already a form of asian. Even then it was better than less desu.
Really nice gens.

Anonymous
01/09/26(Fri)00:10:56 No.107811112

Anonymous 01/09/26(Fri)00:10:56 No.107811112

the ltx commits have dried up, is this the final state for it in comfy?

Anonymous
01/09/26(Fri)00:13:08 No.107811124

Anonymous 01/09/26(Fri)00:13:08 No.107811124

>https://files.catbox.moe/kr3svc.json

To any anon that got this workflow working with the 2K-DC blockwise checkpoint working, can you share your Comfy flags?
It runs but I get a bunch of noise

Anonymous
01/09/26(Fri)00:14:15 No.107811127

Anonymous 01/09/26(Fri)00:14:15 No.107811127

>>107811083
i clicked it it's fine

Anonymous
01/09/26(Fri)00:16:10 No.107811135

Anonymous 01/09/26(Fri)00:16:10 No.107811135

File: ZImageTurbo-_0114.png (1.71 MB, 2304x960)

1.71 MB PNG

>>107811022
>>107811028
This workflow uses your fix. Thanks once again
https://files.catbox.moe/0275tt.png

Anonymous
01/09/26(Fri)00:17:37 No.107811145

Anonymous 01/09/26(Fri)00:17:37 No.107811145

>>107811112
use LTX workflows NOT comfys, they are much better >>107810891
Know that I2V is sort of buggy and needs 48 fps to work well. They said they are gonna fix it within a month or two with ltx2.1 then 2.5

Anonymous
01/09/26(Fri)00:22:45 No.107811167

Anonymous 01/09/26(Fri)00:22:45 No.107811167

File: z-image-experimental_00489_.png (3.31 MB, 1920x1080)

3.31 MB PNG

Anonymous
01/09/26(Fri)00:25:45 No.107811181

Anonymous 01/09/26(Fri)00:25:45 No.107811181

File: z-image-experimental_00490_.png (2.92 MB, 1920x1080)

2.92 MB PNG

Anonymous
01/09/26(Fri)00:27:23 No.107811188

Anonymous 01/09/26(Fri)00:27:23 No.107811188

nudity is fixed with albliterated gemma btw, the model itself does not seen censored
https://civitai.com/models/2292336/ltx-2-nsfw-text-encoder-gemma-3-12b-abliterated?modelVersionId=2579572

Anonymous
01/09/26(Fri)00:28:45 No.107811192

Anonymous 01/09/26(Fri)00:28:45 No.107811192

>>107811188
obviously genitals are not detailed, but a lora will fix that fast

Anonymous
01/09/26(Fri)00:28:53 No.107811194

Anonymous 01/09/26(Fri)00:28:53 No.107811194

File: 00025-4192047237.png (1.42 MB, 1368x856)

1.42 MB PNG

Anonymous
01/09/26(Fri)00:30:51 No.107811206

Anonymous 01/09/26(Fri)00:30:51 No.107811206

File: zimage1.png (1.21 MB, 1536x1024)

1.21 MB PNG

Anonymous
01/09/26(Fri)00:30:59 No.107811207

Anonymous 01/09/26(Fri)00:30:59 No.107811207

cozy bread

Anonymous
01/09/26(Fri)00:36:17 No.107811231

Anonymous 01/09/26(Fri)00:36:17 No.107811231

>>107811207
hey thats my line you thief

Anonymous
01/09/26(Fri)00:36:50 No.107811232

Anonymous 01/09/26(Fri)00:36:50 No.107811232

File: image-w1280.jpg (460 KB, 1280x720)

460 KB JPG

>>107811135
original has more sovl

Anonymous
01/09/26(Fri)00:37:30 No.107811234

Anonymous 01/09/26(Fri)00:37:30 No.107811234

>>107811231
Whose line is it anyway?

Anonymous
01/09/26(Fri)00:44:15 No.107811254

Anonymous 01/09/26(Fri)00:44:15 No.107811254

>>107811031 >>107811093
it certainly has potential:
https://litter.catbox.moe/2kuse1bpjgofrolw.mp4
https://litter.catbox.moe/yx2lxondp7xravrc.mp4

Anonymous
01/09/26(Fri)00:46:36 No.107811258

Anonymous 01/09/26(Fri)00:46:36 No.107811258

>>107811234
Mission improbable: your only son becomes a tranny

Anonymous
01/09/26(Fri)00:56:12 No.107811297

Anonymous 01/09/26(Fri)00:56:12 No.107811297

File: 00013-848792884.png (1.38 MB, 952x1152)

1.38 MB PNG

Anonymous
01/09/26(Fri)01:21:48 No.107811408

Anonymous 01/09/26(Fri)01:21:48 No.107811408

>>107811254
did you prompt it to be australian

Anonymous
01/09/26(Fri)01:24:37 No.107811425

Anonymous 01/09/26(Fri)01:24:37 No.107811425

File: Screenshot from 2026-01-0(...).png (1.2 MB, 2555x1440)

1.2 MB PNG

>>107811232

Anonymous
01/09/26(Fri)01:25:04 No.107811428

Anonymous 01/09/26(Fri)01:25:04 No.107811428

>>107811408
yes. and yes, Indian works too.

Anonymous
01/09/26(Fri)01:26:18 No.107811434

Anonymous 01/09/26(Fri)01:26:18 No.107811434

>>107811428
Weird I tried australian before but couldn't get it. I'll give it another try

Anonymous
01/09/26(Fri)01:28:48 No.107811442

Anonymous 01/09/26(Fri)01:28:48 No.107811442

what happens if you include a tag in parenthesis during training? I see boorutageditor has weight options that does this (1.0, 1.1, 1.2 etc..)

for example
1girl, blonde hair, (hair between eyes), blue eyes,
in the txt file. does it make the ai train really hard on 'hair between eyes'? is it a good idea to use that on a tag that is not super common but something you really want the lora to learn? such as trapezius muscles, which is probably something that is not really tagged at all in the general models or in the danbooru tags.

Anonymous
01/09/26(Fri)01:33:22 No.107811451

Anonymous 01/09/26(Fri)01:33:22 No.107811451

>>107811022
its the same picture

Anonymous
01/09/26(Fri)01:34:26 No.107811455

Anonymous 01/09/26(Fri)01:34:26 No.107811455

>>107811442
The trainer needs to supports that. Need to read the fine details of whatever you're using.

Anonymous
01/09/26(Fri)01:35:46 No.107811461

Anonymous 01/09/26(Fri)01:35:46 No.107811461

a base model just flew over my house

Anonymous
01/09/26(Fri)01:36:42 No.107811463

Anonymous 01/09/26(Fri)01:36:42 No.107811463

>>107811455
kohya?

Anonymous
01/09/26(Fri)01:38:33 No.107811470

Anonymous 01/09/26(Fri)01:38:33 No.107811470

>>107811451
Compressing into jpg to fit into 4mb covers some of the detail loss but just look at the green background and edges of the red circle it's significantly noisier on fp16

Anonymous
01/09/26(Fri)01:40:28 No.107811478

Anonymous 01/09/26(Fri)01:40:28 No.107811478

>>107811434
it worked most times, wasn't a lucky roll

Anonymous
01/09/26(Fri)01:40:42 No.107811479

Anonymous 01/09/26(Fri)01:40:42 No.107811479

>>107811463
Yes, Kohya_SS has that option.

Anonymous
01/09/26(Fri)01:44:29 No.107811496

Anonymous 01/09/26(Fri)01:44:29 No.107811496

>>107811479
what is it called specifically so I know what to look for?

Anonymous
01/09/26(Fri)01:46:40 No.107811506

Anonymous 01/09/26(Fri)01:46:40 No.107811506

>>107811005
yeah it's annoying, even more when
>click embed
>wait 20s
>another floyd or hitler or trump
all i want are pretty girls fuck that ugly shit

Anonymous
01/09/26(Fri)01:49:45 No.107811519

Anonymous 01/09/26(Fri)01:49:45 No.107811519

>>107811470
hmmm yeah maybe
i will test this myself when i'm back home

Anonymous
01/09/26(Fri)01:52:32 No.107811531

Anonymous 01/09/26(Fri)01:52:32 No.107811531

>>107811496
https://github.com/kohya-ss/sd-scripts/pull/336

Anonymous
01/09/26(Fri)01:54:21 No.107811538

Anonymous 01/09/26(Fri)01:54:21 No.107811538

https://ltx.io/model/model-blog/prompting-guide-for-ltx-2

Anonymous
01/09/26(Fri)02:02:43 No.107811570

Anonymous 01/09/26(Fri)02:02:43 No.107811570

>>107811005
>>107811506
Just look up vidya dump threads on /wsg/ (or /gif/ for spicy stiff), then post links

Anonymous
01/09/26(Fri)02:05:33 No.107811586

Anonymous 01/09/26(Fri)02:05:33 No.107811586

What kind of system prompt is recommended to properly describe images in natural language?

Anonymous
01/09/26(Fri)02:07:36 No.107811599

Anonymous 01/09/26(Fri)02:07:36 No.107811599

Abliterated gemma 3 is snake oil then?

Anonymous
01/09/26(Fri)02:10:32 No.107811620

Anonymous 01/09/26(Fri)02:10:32 No.107811620

>>107811599
no, it for sure makes a difference.

Anonymous
01/09/26(Fri)02:11:51 No.107811631

Anonymous 01/09/26(Fri)02:11:51 No.107811631

>>107811586
joycaption?

Anonymous
01/09/26(Fri)02:14:10 No.107811646

Anonymous 01/09/26(Fri)02:14:10 No.107811646

>>107811599
It will give you different results just cause it's a modified weight, ltx won't suddenly become uncensored/nsfw or something lol

Anonymous
01/09/26(Fri)02:14:20 No.107811648

Anonymous 01/09/26(Fri)02:14:20 No.107811648

>>107811599
Abliteration removes refusals, but also dumbs down the model, making it hallucinate harder. Usually it means it gets progressively more retarded in chats (especially when ERPing) so I'm not sur ehow it affects short one-time prompts. There's also this https://huggingface.co/Nabbers1999/gemma-3-12b-it-abliterated-refined-novis/tree/main that's further trained and another "less censored" version called herecic but I' not sure how to make it work with ltx

Anonymous
01/09/26(Fri)02:22:52 No.107811697

Anonymous 01/09/26(Fri)02:22:52 No.107811697

What is the base noobai model called specifically that you train on? is it just illustrious?

Anonymous
01/09/26(Fri)02:24:24 No.107811701

Anonymous 01/09/26(Fri)02:24:24 No.107811701

>>107811697
illustrious 0.1 is the base for pretty much everything current
training on noobai eps 0.5 or vpred 1.0 is reasonable though

Anonymous
01/09/26(Fri)02:25:48 No.107811709

Anonymous 01/09/26(Fri)02:25:48 No.107811709

>>107811701
Why not illustriousv1.1 or v2

Anonymous
01/09/26(Fri)02:27:16 No.107811720

Anonymous 01/09/26(Fri)02:27:16 No.107811720

>>107811709
They were partially trained on 1.5 megapixel images which was a very bad idea for sdxl finetune

Anonymous
01/09/26(Fri)02:27:26 No.107811723

Anonymous 01/09/26(Fri)02:27:26 No.107811723

>>107811709
They are failed models with poorly implemented hires training

Anonymous
01/09/26(Fri)02:28:37 No.107811731

Anonymous 01/09/26(Fri)02:28:37 No.107811731

>>107811648
it will output entirely different things because it's becoming another model

Anonymous
01/09/26(Fri)02:30:01 No.107811736

Anonymous 01/09/26(Fri)02:30:01 No.107811736

>>107811697
Training on naked noob is fine.

Anonymous
01/09/26(Fri)02:40:48 No.107811783

Anonymous 01/09/26(Fri)02:40:48 No.107811783

Am I retarded for trying to run chroma-unlocked-v50-annealedon a 9060 xt with 16 gb vram and 32 gb ram? A 512x512 gen took 3 minutes, 768x768 9 minutes, and I'm not even going to try a 1024x1024.Although its not showing any ram offloading so its kind of wierd.
I tried a sdnq quant with int8 and it genned a 1024x1024 in 2 minutes which still isn't exactly fast but the gen was pretty sharp. Is this just amd/rocm sucking or does chroma really just kill?

Anonymous
01/09/26(Fri)02:42:20 No.107811790

Anonymous 01/09/26(Fri)02:42:20 No.107811790

>>107811701
>>107811736
i'm just using the civit lora trainer atm. I have not yet been able to figure out how to get local training to work on my 5070ti.
what exactly is it called so I can search for it on the civitai list?
I'll move over to the other thread if I have more questions, I realize this isn't relevant to the local thread.

Anonymous
01/09/26(Fri)02:45:02 No.107811804

Anonymous 01/09/26(Fri)02:45:02 No.107811804

>>107811783
comfy used to have a bug with chroma that made it super slow, its now as fast as flux, faster with the distill lora. Also use the 2k res versions, now the 512 res ones.
Use this https://huggingface.co/silveroxides/Chroma-Misc-Models/blob/main/Chroma-2K-QC/Chroma-2K-QC-fp8mixed-blockwise.safetensors

With this:
https://github.com/silveroxides/ComfyUI-QuantOps

Anonymous
01/09/26(Fri)02:46:42 No.107811815

Anonymous 01/09/26(Fri)02:46:42 No.107811815

no base general

Anonymous
01/09/26(Fri)02:48:03 No.107811823

Anonymous 01/09/26(Fri)02:48:03 No.107811823

>>107811790
https://civitai.com/models/833294?modelVersionId=1190596
if you want to train more than one lora consider spending time on figuring out local training, using civitai when you have a 5070ti is a waste

Anonymous
01/09/26(Fri)02:51:13 No.107811845

Anonymous 01/09/26(Fri)02:51:13 No.107811845

>>107811804
Alright,thank you, will try

Anonymous
01/09/26(Fri)02:54:14 No.107811857

Anonymous 01/09/26(Fri)02:54:14 No.107811857

Anyone get the ltx2 vid2vid working? I almost got something decent from a poor quality input video, made up some interesting details, but it shits the bed with motion and falls apart near the end.

Anonymous
01/09/26(Fri)02:55:49 No.107811866

Anonymous 01/09/26(Fri)02:55:49 No.107811866

>>107811857
I2V is bugged and needs 48 fps, they said 2.1 would work on this

Anonymous
01/09/26(Fri)02:57:01 No.107811873

Anonymous 01/09/26(Fri)02:57:01 No.107811873

>femanon, look at this cool lora i made of my face
>so you just asked ai to make you muscular with a full head of hair?
>n-no its more than that

Anonymous
01/09/26(Fri)02:58:31 No.107811881

Anonymous 01/09/26(Fri)02:58:31 No.107811881

>>107811804
I couldn't make it work, just get noisy image.

Anonymous
01/09/26(Fri)02:58:34 No.107811882

Anonymous 01/09/26(Fri)02:58:34 No.107811882

you now remember that despite being the same size as turbo it will take longer to gen

Anonymous
01/09/26(Fri)02:58:42 No.107811883

Anonymous 01/09/26(Fri)02:58:42 No.107811883

finally got ltx working
https://files.catbox.moe/67cy3a.mp4

Anonymous
01/09/26(Fri)02:59:49 No.107811892

Anonymous 01/09/26(Fri)02:59:49 No.107811892

>>107811866
Not using the I2V though? using their V2V: https://github.com/Lightricks/ComfyUI-LTXVideo/blob/master/example_workflows/LTX-2_V2V_Detailer.json

Anonymous
01/09/26(Fri)03:01:22 No.107811902

Anonymous 01/09/26(Fri)03:01:22 No.107811902

File: file.png (34 KB, 150x182)

34 KB PNG

>>107811883
did you prompt him to sound like an italian mafioso?

Anonymous
01/09/26(Fri)03:01:36 No.107811904

Anonymous 01/09/26(Fri)03:01:36 No.107811904

File: vram.jpg (117 KB, 912x604)

117 KB JPG

>>107811823
thanks.
>using civitai when you have a 5070ti is a waste
there's something critical i'm missing. I can't get the vram usage down in kohya no matter how many setting I turn down.

Anonymous
01/09/26(Fri)03:02:57 No.107811909

Anonymous 01/09/26(Fri)03:02:57 No.107811909

>>107811892
same difference, the issue is that their temporal compression is too aggressive, more fps lessens the effect of this and fixes the issues during movement. They said they plan to refine the temporal compression more as their focus was speed

Anonymous
01/09/26(Fri)03:03:59 No.107811916

Anonymous 01/09/26(Fri)03:03:59 No.107811916

>>107811883
Oh thank god. A gen than isn't George Floyd, the CIA agent.

Anonymous
01/09/26(Fri)03:05:56 No.107811931

Anonymous 01/09/26(Fri)03:05:56 No.107811931

>>107811909
was there an ETA on 2.1?

Anonymous
01/09/26(Fri)03:06:37 No.107811934

Anonymous 01/09/26(Fri)03:06:37 No.107811934

Why does LTX2 keep giving me Indians

Anonymous
01/09/26(Fri)03:07:23 No.107811939

Anonymous 01/09/26(Fri)03:07:23 No.107811939

>>107811934
You either get the Indian model or the Chinese model this is the cost of America outsourcing its tech talent and industrial capacity.

Anonymous
01/09/26(Fri)03:07:36 No.107811940

Anonymous 01/09/26(Fri)03:07:36 No.107811940

>>107811931
In a month or so. They said they would try for their 2.5 in Q1 which has major improvements.

>>107811934
its biased to india and china unprompted cause most of the world's content is from those places.

Anonymous
01/09/26(Fri)03:07:53 No.107811941

Anonymous 01/09/26(Fri)03:07:53 No.107811941

>>107811934
Describe the ethnicity explicitly, saar

Anonymous
01/09/26(Fri)03:08:05 No.107811943

Anonymous 01/09/26(Fri)03:08:05 No.107811943

File: 1750108113249903.png (12 KB, 760x60)

12 KB PNG

>>107811902
im somewhat of a prompt engineer myself

Anonymous
01/09/26(Fri)03:08:39 No.107811949

Anonymous 01/09/26(Fri)03:08:39 No.107811949

the kijai sound + image workflow is great.

https://files.catbox.moe/s5c9ke.mp4

Anonymous
01/09/26(Fri)03:08:53 No.107811950

Anonymous 01/09/26(Fri)03:08:53 No.107811950

LTX-2 lora anons, are you training on the dev model or the distilled model? Tried making a lora on the dev model and it sucks, can't even tell if it's doing anything.

Anonymous
01/09/26(Fri)03:09:21 No.107811953

Anonymous 01/09/26(Fri)03:09:21 No.107811953

File: sdgsdfasdfsd.png (49 KB, 1106x346)

49 KB PNG

>>107811931

Anonymous
01/09/26(Fri)03:10:16 No.107811956

Anonymous 01/09/26(Fri)03:10:16 No.107811956

https://files.catbox.moe/bb38ps.mp4

Anonymous
01/09/26(Fri)03:10:52 No.107811959

Anonymous 01/09/26(Fri)03:10:52 No.107811959

>>107811904
OneTrainer has default configs for different memory capacities but everyone swears by their own trainer. It's worked well for me. Noob specifically needs a manual edit to the config though.

Anonymous
01/09/26(Fri)03:11:19 No.107811960

Anonymous 01/09/26(Fri)03:11:19 No.107811960

>>107811950
people made reddit posts. Or id ask people who made some on civitia

Anonymous
01/09/26(Fri)03:12:08 No.107811966

Anonymous 01/09/26(Fri)03:12:08 No.107811966

>>107811953
Guess I'll stop fighting with it and hope that 2.1 or 2.5 yields something stable. Was actually impressed with the few frames of compression garbage that it managed to make something from.

Anonymous
01/09/26(Fri)03:12:10 No.107811967

Anonymous 01/09/26(Fri)03:12:10 No.107811967

>>107811959
i'll check it out. could you share your configs?

Anonymous
01/09/26(Fri)03:18:03 No.107811992

Anonymous 01/09/26(Fri)03:18:03 No.107811992

>Wan is so much better than LTX
>The Wan videos in question are basically static shots of a woman slapping her hips against a dick for 5 seconds.

Anonymous
01/09/26(Fri)03:18:39 No.107811996

Anonymous 01/09/26(Fri)03:18:39 No.107811996

you are gonna see so many mobile game ads made with this lol
https://files.catbox.moe/z5dz25.webm

Anonymous
01/09/26(Fri)03:22:39 No.107812012

Anonymous 01/09/26(Fri)03:22:39 No.107812012

>>107811996
I hate how conservative it makes the underwear.

Anonymous
01/09/26(Fri)03:24:20 No.107812019

Anonymous 01/09/26(Fri)03:24:20 No.107812019

>>107811956
what did you prompt? wow game + camera prompts?

Anonymous
01/09/26(Fri)03:24:28 No.107812021

Anonymous 01/09/26(Fri)03:24:28 No.107812021

>>107811934
Indian content is huge, it's just basically unknown to the rest of the world because it's not that interesting outside of the occasional bollywood thing.
Just write the ethnicity you want, or do i2v like sane people.

Anonymous
01/09/26(Fri)03:28:36 No.107812044

Anonymous 01/09/26(Fri)03:28:36 No.107812044

>>107812019
the grass on the hills and trees sway gently in the breeze, a blue 2017 Toyota Prius drives along the dirt road in front of the player, the camera shifts view to follow the car as it drives through the scene. Epic MMORPG music plays. A panicked voices says "Chat! Chat! Are you seeing this?"

Just a lazy prompt and a WoW screenshot.

Anonymous
01/09/26(Fri)03:33:43 No.107812059

Anonymous 01/09/26(Fri)03:33:43 No.107812059

>>107812044
looks neat, so much variety with the gens.

https://ltx.io/model/model-blog/prompting-guide-for-ltx-2

ltx has a lot of prompting ideas im skimming through this

Anonymous
01/09/26(Fri)03:41:08 No.107812082

Anonymous 01/09/26(Fri)03:41:08 No.107812082

>>107811996
crazy how the lora mimicked the game's soundtrack perfectly

Anonymous
01/09/26(Fri)03:41:19 No.107812083

Anonymous 01/09/26(Fri)03:41:19 No.107812083

>>107811804
>fp8
get the fuck out

Anonymous
01/09/26(Fri)03:42:10 No.107812089

Anonymous 01/09/26(Fri)03:42:10 No.107812089

File: 1765248625110865.png (1.96 MB, 1584x1056)

1.96 MB PNG

ltx2 is fun but I havent forgot about you qwen edit 2511:

Anonymous
01/09/26(Fri)03:43:12 No.107812092

Anonymous 01/09/26(Fri)03:43:12 No.107812092

>>107811996
prompt? the transition was neat

Anonymous
01/09/26(Fri)03:46:31 No.107812112

Anonymous 01/09/26(Fri)03:46:31 No.107812112

https://files.catbox.moe/7ha8zw.mp4

Anonymous
01/09/26(Fri)03:49:54 No.107812126

Anonymous 01/09/26(Fri)03:49:54 No.107812126

>>107812112
kino, sometimes i just want to get comfy with a 12 boar asses game

Anonymous
01/09/26(Fri)03:51:45 No.107812133

Anonymous 01/09/26(Fri)03:51:45 No.107812133

>>107812112
>collect 12 boar asses for king gobblecock
award
1 silver

Anonymous
01/09/26(Fri)04:03:05 No.107812184

Anonymous 01/09/26(Fri)04:03:05 No.107812184

https://files.catbox.moe/z6zk65.mp4

Anonymous
01/09/26(Fri)04:13:34 No.107812219

Anonymous 01/09/26(Fri)04:13:34 No.107812219

https://files.catbox.moe/m8y4t0.mp4

Anonymous
01/09/26(Fri)04:14:59 No.107812224

Anonymous 01/09/26(Fri)04:14:59 No.107812224

>>107812184
lmao

Anonymous
01/09/26(Fri)04:15:17 No.107812225

Anonymous 01/09/26(Fri)04:15:17 No.107812225

>>107812219
sora 2 could never. Now hopefully 2.5 indeed does make the compression better, still fast but better quality

Anonymous
01/09/26(Fri)04:22:39 No.107812257

Anonymous 01/09/26(Fri)04:22:39 No.107812257

>>107812112
used a deus ex screenshot:

https://files.catbox.moe/nzgdxp.mp4

Anonymous
01/09/26(Fri)04:45:16 No.107812347

Anonymous 01/09/26(Fri)04:45:16 No.107812347

>>107810912
Interesting how it looks like modern Tumblr commissions instead of Schiele.

Anonymous
01/09/26(Fri)04:47:16 No.107812355

Anonymous 01/09/26(Fri)04:47:16 No.107812355

how do you do a camera rotation prompt? are the loras needed?

Anonymous
01/09/26(Fri)04:53:40 No.107812377

Anonymous 01/09/26(Fri)04:53:40 No.107812377

Is ComfyUI still the best or should I switch to something else?

The other ones still can't do video gens right?

Anonymous
01/09/26(Fri)04:54:53 No.107812383

Anonymous 01/09/26(Fri)04:54:53 No.107812383

>>107812377
nothing comes close to comfyui but neoforge is probably a good easier to use alternative for newbies

Anonymous
01/09/26(Fri)04:59:39 No.107812399

Anonymous 01/09/26(Fri)04:59:39 No.107812399

>>107812383
>alternative for newbies
Ooops, I meant "alternative for people that aren't shit-eating cuckolds". imagine being forced to use an interface as bad as cumrag

Anonymous
01/09/26(Fri)05:00:29 No.107812402

Anonymous 01/09/26(Fri)05:00:29 No.107812402

>>107812399
sure

Anonymous
01/09/26(Fri)05:00:47 No.107812404

Anonymous 01/09/26(Fri)05:00:47 No.107812404

File: 1739720005183748.jpg (593 KB, 1784x2600)

593 KB JPG

>>107812377
comfyui sucks only nerds use it

Anonymous
01/09/26(Fri)05:01:44 No.107812410

Anonymous 01/09/26(Fri)05:01:44 No.107812410

lmao, the loras work well for camera movement. dolly out lora test:

https://files.catbox.moe/e0585a.mp4

Anonymous
01/09/26(Fri)05:02:40 No.107812415

Anonymous 01/09/26(Fri)05:02:40 No.107812415

>>107812404
what do you use?

Anonymous
01/09/26(Fri)05:12:28 No.107812454

Anonymous 01/09/26(Fri)05:12:28 No.107812454

If Comfyui is so good, how come there's no comfyui 2?

Anonymous
01/09/26(Fri)05:15:30 No.107812464

Anonymous 01/09/26(Fri)05:15:30 No.107812464

Any AniStudio news? Ani has been committing new stuff to the main repo, seems big. Why isn't AniStudio in the op anyways? It seems to be pretty competitive nowadays and it's actually might be faster than cumfart from what I'm seeing

Anonymous
01/09/26(Fri)05:16:08 No.107812467

Anonymous 01/09/26(Fri)05:16:08 No.107812467

>>107812464
very organic ani

Anonymous
01/09/26(Fri)05:16:45 No.107812473

Anonymous 01/09/26(Fri)05:16:45 No.107812473

>>107812464
Buy an ad.

Anonymous
01/09/26(Fri)05:17:18 No.107812474

Anonymous 01/09/26(Fri)05:17:18 No.107812474

>>107812464
I don't like vibecoded slop

Anonymous
01/09/26(Fri)05:20:29 No.107812499

Anonymous 01/09/26(Fri)05:20:29 No.107812499

ran meltie

Anonymous
01/09/26(Fri)05:21:28 No.107812503

Anonymous 01/09/26(Fri)05:21:28 No.107812503

I want to run the new ltx on my 3090. which one do I want, ltx-2-19b-distilled-fp8.safetensors? is fp4 usable? is there some optimized workflow for such setup or is the official one fine (LTX-2_I2V_Distilled_wLora.json)?

Anonymous
01/09/26(Fri)05:23:33 No.107812512

Anonymous 01/09/26(Fri)05:23:33 No.107812512

>>107812503
Use the FP8, the FP4 doesn't look that good and is optimized for 5000s GPUs where it's twice as fast as FP8.

Anonymous
01/09/26(Fri)05:25:42 No.107812529

Anonymous 01/09/26(Fri)05:25:42 No.107812529

>>107812503
kijai is working on a better WF with split files:
https://huggingface.co/Kijai/LTXV2_comfy
https://files.catbox.moe/jftiwc.mp4
But until then apparently the LTX ones are better

Anonymous
01/09/26(Fri)05:27:20 No.107812537

Anonymous 01/09/26(Fri)05:27:20 No.107812537

meeting 1girl irl
https://files.catbox.moe/hvyx5q.mp4

Anonymous
01/09/26(Fri)05:27:45 No.107812540

Anonymous 01/09/26(Fri)05:27:45 No.107812540

>>107812529
oh and he got GGUFs working btw, far better quality than fp8 imo

Anonymous
01/09/26(Fri)05:27:49 No.107812541

Anonymous 01/09/26(Fri)05:27:49 No.107812541

File: e228e506-00c3-40e7-b240-2(...).png (1.61 MB, 1024x1024)

1.61 MB PNG

>>107812524

Anonymous
01/09/26(Fri)05:28:17 No.107812544

Anonymous 01/09/26(Fri)05:28:17 No.107812544

>>107812415
forge neo

Anonymous
01/09/26(Fri)05:29:08 No.107812550

Anonymous 01/09/26(Fri)05:29:08 No.107812550

File: 1752838564918204.jpg (1.43 MB, 1248x1824)

1.43 MB JPG

>>107812377
Swarmui is the best of both worlds. It's still comfy but there's a very useful base interface on top.

Anonymous
01/09/26(Fri)05:34:12 No.107812570

Anonymous 01/09/26(Fri)05:34:12 No.107812570

>>107812550
>but there's a very useful base interface on top.
how so

Anonymous
01/09/26(Fri)05:38:23 No.107812587

Anonymous 01/09/26(Fri)05:38:23 No.107812587

>>107812570
All base functions you would ever need including the best inpainting interface i've seen thus far. Built in regional prompting, seed variation, rembg, you name it.
The best sorting interface and metadata viewer, no need to fuck around with shit like diffusion toolkit.
Can pull model info from civit together with preview images
Just a very comfy interface, especially for a newfag.

Anonymous
01/09/26(Fri)05:38:47 No.107812589

Anonymous 01/09/26(Fri)05:38:47 No.107812589

yea, GGUF is far better than FP8 it looks like. This is Q6 https://files.catbox.moe/hzov9n.mp4
He says he is doing Q8 as well which should look much better than fp8

Anonymous
01/09/26(Fri)05:38:50 No.107812590

Anonymous 01/09/26(Fri)05:38:50 No.107812590

>>107812541
add tan

Anonymous
01/09/26(Fri)05:38:55 No.107812591

Anonymous 01/09/26(Fri)05:38:55 No.107812591

>>107812529
>>107812503
Ok, I'm currently getting filtered by this https://huggingface.co/Lightricks/LTX-2/tree/main telling me to get gemma-3-12b-it-qat-q4_0-unquantized but it's gated and there is no reupload. And the text_encoder in the ltx hf repo contains weights for a different model, I'm lost

Anonymous
01/09/26(Fri)05:41:01 No.107812601

Anonymous 01/09/26(Fri)05:41:01 No.107812601

death to subgraphs

Anonymous
01/09/26(Fri)05:44:42 No.107812613

Anonymous 01/09/26(Fri)05:44:42 No.107812613

>>107812590
The tan is already there, but it's pretty subtle. I hate niggers

Anonymous
01/09/26(Fri)05:49:23 No.107812635

Anonymous 01/09/26(Fri)05:49:23 No.107812635

File: Flux-20250603_182255-gen_(...).jpg (427 KB, 896x1152)

427 KB JPG

>>107812541
unbelievably..shite. welcome to 2026, lmao
>>107812587
show me the inpainting inferface please, curious now
t. actual comfy user (with the old interface of course)

Anonymous
01/09/26(Fri)05:50:53 No.107812644

Anonymous 01/09/26(Fri)05:50:53 No.107812644

>>107812587
How does it compare to Invoke? I tried that but didn't like it. And can it outpaint? Is it easy to upscale images over and over with it?

Anonymous
01/09/26(Fri)05:52:30 No.107812653

Anonymous 01/09/26(Fri)05:52:30 No.107812653

>>107812601
I don't hate subgraphs, but they are being wildly misused desu.
They should only be for things like helper functions like math for resizing images and things that generally aren't touched by cause a lot of clutter, and the things that might be changed in it should be exposed in the settings of the subgraph. Instead people are putting everything but the fucking prompt window in there.

Anonymous
01/09/26(Fri)05:53:39 No.107812663

Anonymous 01/09/26(Fri)05:53:39 No.107812663

>>107812541
Very good stuff

Anonymous
01/09/26(Fri)05:55:53 No.107812676

Anonymous 01/09/26(Fri)05:55:53 No.107812676

>>107812653
Personally I use subgraphs to hide any setting that I don't use on the day to day. I want everything visible on the main view to be an actionable setting relevant to genning.

Anonymous
01/09/26(Fri)05:56:08 No.107812678

Anonymous 01/09/26(Fri)05:56:08 No.107812678

lmao the dolly in/out loras are good

dolly in:

https://files.catbox.moe/g1iahr.mp4

Anonymous
01/09/26(Fri)05:56:42 No.107812682

Anonymous 01/09/26(Fri)05:56:42 No.107812682

https://github.com/Comfy-Org/ComfyUI/pull/11741

latent2rgb previews coming for ltx2

Anonymous
01/09/26(Fri)05:57:12 No.107812684

Anonymous 01/09/26(Fri)05:57:12 No.107812684

File: ComfyUI_temp_ugvir_00005_.png (1.56 MB, 832x1248)

1.56 MB PNG

Anonymous
01/09/26(Fri)05:58:09 No.107812688

Anonymous 01/09/26(Fri)05:58:09 No.107812688

Can you feed in a song and change the lyrics while having it sung the same way?

Anonymous
01/09/26(Fri)06:04:34 No.107812715

Anonymous 01/09/26(Fri)06:04:34 No.107812715

File: autism2.png (1.01 MB, 1280x720)

1.01 MB PNG

Anonymous
01/09/26(Fri)06:07:36 No.107812727

Anonymous 01/09/26(Fri)06:07:36 No.107812727

ok now we are talking.

https://files.catbox.moe/q3s4jp.mp4

Anonymous
01/09/26(Fri)06:09:08 No.107812739

Anonymous 01/09/26(Fri)06:09:08 No.107812739

where is pussy lora

Anonymous
01/09/26(Fri)06:09:56 No.107812744

Anonymous 01/09/26(Fri)06:09:56 No.107812744

>>107812685
Why deleted? You're right there's no tan
Gyarus without tan are shite

Anonymous
01/09/26(Fri)06:10:25 No.107812745

Anonymous 01/09/26(Fri)06:10:25 No.107812745

okay, now we have a winner. dolly zoom in lora used:

https://files.catbox.moe/2g7kh6.mp4

Anonymous
01/09/26(Fri)06:11:57 No.107812758

Anonymous 01/09/26(Fri)06:11:57 No.107812758

File: ComfyUI_00077_.png (1 MB, 1280x720)

1 MB PNG

Anonymous
01/09/26(Fri)06:13:32 No.107812772

Anonymous 01/09/26(Fri)06:13:32 No.107812772

>>107812730
>>107812730
>>107812730
fresh

Anonymous
01/09/26(Fri)06:14:05 No.107812777

Anonymous 01/09/26(Fri)06:14:05 No.107812777

>>107812739
Anatomy loras never work well

Anonymous
01/09/26(Fri)06:14:46 No.107812780

Anonymous 01/09/26(Fri)06:14:46 No.107812780

new thread
>>107812673
>>107812673
>>107812673

>>107812772
duplicate, please remove

Anonymous
01/09/26(Fri)06:15:38 No.107812785

Anonymous 01/09/26(Fri)06:15:38 No.107812785

>>107812780
>extremely early bake
Sorry that's spamming/flooding

Anonymous
01/09/26(Fri)06:17:01 No.107812795

Anonymous 01/09/26(Fri)06:17:01 No.107812795

>>107812772
thanks for baking anon
>>107812780
Kill yourself shitposting subhuman

Anonymous
01/09/26(Fri)06:20:29 No.107812827

Anonymous 01/09/26(Fri)06:20:29 No.107812827

actual thread
>>107812800
>>107812800
>>107812800

Anonymous
01/09/26(Fri)06:59:11 No.107813072

Anonymous 01/09/26(Fri)06:59:11 No.107813072

File: 1755229201199045.png (173 KB, 809x504)

173 KB PNG

>>107812635
>show me the inpainting inferface please, curious now
Here you go
Can handle masking and layers, can outpaint, has auto segment functionality.

Anonymous
01/09/26(Fri)07:04:06 No.107813109

Anonymous 01/09/26(Fri)07:04:06 No.107813109

>>107813072
lmao looks like fucking shit unironically

Anonymous
01/09/26(Fri)07:21:00 No.107813239

Anonymous 01/09/26(Fri)07:21:00 No.107813239

File: 1766699283673799.png (1.65 MB, 1632x928)

1.65 MB PNG

moo

Anonymous
01/09/26(Fri)07:23:09 No.107813252

Anonymous 01/09/26(Fri)07:23:09 No.107813252

File: autism4.png (1.83 MB, 1920x1080)

1.83 MB PNG

>>107813239
hello there

Anonymous
01/09/26(Fri)07:26:03 No.107813272

Anonymous 01/09/26(Fri)07:26:03 No.107813272

>>107813252
it's time to play uno?

Anonymous
01/09/26(Fri)08:48:01 No.107813759

Anonymous 01/09/26(Fri)08:48:01 No.107813759

File: 1757245005541318.png (2.28 MB, 1280x1280)

2.28 MB PNG

sad life

Anonymous
01/09/26(Fri)08:59:45 No.107813854

Anonymous 01/09/26(Fri)08:59:45 No.107813854

Has anyone experimented with using the abliterated gemma 3 with LTX2? I couldn't find a single file version of it so I made it from the repo here https://huggingface.co/mlabonne/gemma-3-12b-it-abliterated with a python script chatgpt wrote. However, when I try to use it I get Invalid tokenizer errors and chatgpt ain't that useful about a solution. Is there a single file version file floating around somewhere that works as a tokenizer for comfyui? My ultimate goal is to use the abliterated version to train a NSFW lora which seems straightforward but I assume I also need to use it during inference as well.

Anonymous
01/09/26(Fri)09:11:19 No.107813937

Anonymous 01/09/26(Fri)09:11:19 No.107813937

File: 1767379820375934.png (1.98 MB, 1280x1280)

1.98 MB PNG

>>107813854
dont waste your time, the model hasnt been trained with the ablit model so it wont know the concepts anyway

Anonymous
01/09/26(Fri)12:29:53 No.107815590

Anonymous 01/09/26(Fri)12:29:53 No.107815590

>>107813854
https://huggingface.co/FusionCow/Gemma-3-12b-Abliterated-LTX2

but what for ATM, it's not like you will get better results

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.