[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1759291721152619.jpg (1.41 MB, 2421x2253)
1.41 MB
1.41 MB JPG
Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106884374

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
When is ostrix gonna add ramtorch to other models?
>>
File: 1757752679819672.jpg (475 KB, 2016x1152)
475 KB
475 KB JPG
>>
>>106891139
what happened to the 16 channel vae for sdxl?
>>
>>106891143
best gen in literal months
>>
>>106891150
he has the lodestone syndrome, he can't stay on a single project for too long, once a new shinny thing appears, he jumps from the previous boat to this one
>>
File: ComfyUI.webm (3.06 MB, 1280x682)
3.06 MB
3.06 MB WEBM
New UI is going to be sick.
ComfyUI truly is the best. Nothing else comes close. Such talented individuals involved. I kneel.
>>
>>106891176
is there anything ComfyUI can't do!?
>>
>>106891182
be a good ui is one
>>
>>106891176
lipstick on a pig vibes
>>
tinker tranis out in full force I see
>>
>>106891176
>hides the CPU and memory usage
hmmmmm
>>
>>106891176
I think we should start posting more Comfy news every thread from now on. Everyone uses it, so it's important information pertaining to local diffusion.
>>
>>106891223
is it some sort of humiliation ritual or something? I'd rather not see the impending enshitification fall straight from the asshole
>>
>>106891219
That is not part of core.
>>106891241
Don't reply it to it. Anyone using the term "local diffusoin" is a troll
>>
File: 00009-2491276575.png (1.56 MB, 1400x792)
1.56 MB
1.56 MB PNG
~desu THIS
>>
This will blow your mind... to smithereens.
>>
File: 1716575713105840.jpg (37 KB, 948x699)
37 KB
37 KB JPG
>Anyone using the term "local diffusoin" is a troll
>anon misspelled it for some reason
>/ldg/ - Local Diffusion General
>>
nobody calls genning diffusion, but nice try
>>
>>106891277
chat what did they mean by this
are they implying our entire general is a troll
>>
File: 00010-1239134936.png (1.4 MB, 1400x792)
1.4 MB
1.4 MB PNG
>>106891276
4got my image
>>
File: ComfyUI_05965_.png (1.04 MB, 1192x872)
1.04 MB
1.04 MB PNG
>>106891277
>>
File: 1741220983371820.mp4 (1.83 MB, 480x704)
1.83 MB
1.83 MB MP4
so I set the 2.2 lora to 3 strength and the 2.1 lora to 3 strength and got chaos.
>>
>>106891337
damn, poor George got turned into a pile of ashes kek
>>
>>106884426
https://files.catbox.moe/dkr9yn.mp4
>>
File: 1729373224341429.mp4 (2.81 MB, 480x704)
2.81 MB
2.81 MB MP4
>>106891337
6 strength for both!

okay, too much.
>>
if all my .safetensors files are saved on an NTFS drive, can I access them via Linux and still gen at the same speed? Or will it be shot and slow as all hell
>>
File: 1750946545631845.mp4 (1.68 MB, 480x704)
1.68 MB
1.68 MB MP4
>>106891373
1 strength high (2.2), 1 low:

the anime girl picks up the black man on the floor and throws him into the sky.
>>
>>106891366
He was begging for it with that gen lel
>>
File: ComfyUI_0216.jpg (2.8 MB, 1664x2432)
2.8 MB
2.8 MB JPG
>>
File: ComfyUI_05967_.png (1012 KB, 1176x880)
1012 KB
1012 KB PNG
>>
>>106891448
very aesthetic
>>
>>106891447
What is it? Doesn't load for me.
>>
>>106891493
Catbox is down out of nowhere, will probably be back soon. But I bet you can imagine what it is
>>
update comfyui to insta cancel video gens
>>
>>106891554
we can thank that anon that shared the insta cancel custom node, it forced comfy add it officially as well (even though it's a basic feature that should've been added years ago but heh... that's another story)
>>
File: 1740588384187609.mp4 (527 KB, 640x640)
527 KB
527 KB MP4
>>
>>106891575

Does that fox asshole lurk here still? T-thanks, I guess.
>>
>>106891453
missing a finger.
>>
>>106891606
retard. fuck off
>>
File: homer demands.jpg (61 KB, 1157x949)
61 KB
61 KB JPG
>>106891606
wat?
>>
repost.

AAAAHAHAHAHAHAHAHAHAH

https://github.com/leejet/stable-diffusion.cpp/issues/396

>closed

>not fixed

stable-diffusion.cpp does NOT use clip_l.

>>106891601
lmao great acting.
>>
>>106891631
Yeah, there it is again, the fingers, man. Freaky.
>>
This means I can't use stable-diffusion.cpp. I rely on being able to prooompt clip_l
>>
File: 1749783543459858.mp4 (758 KB, 640x640)
758 KB
758 KB MP4
the anime girl fires her pistol, causing the man on the right to fall down. she waves hello.

warning shot! get out sam.
>>
AHHHHHH I HATE THE ANTICHRIST (comfyui)
>>
>>106891670
did you see his interview with Tucker? the dude looks emaciated, sleep-deprived and seriously disturbed. I dont touch chatgpt
>>
File: 1655783628694.jpg (375 KB, 1248x1868)
375 KB
375 KB JPG
What is the proper way of doing video continuation without being an independently glued i2v mess with no proper continuity?
My mind is ripe with degenerate goon ideas for genning multi-staged stuff where each section of the video has its own prompt and lora. It has potential to make me abandon real porn forever but its sucking ass the way I'm doing on wan2gp
>>
File: 1741557127685841.mp4 (874 KB, 640x640)
874 KB
874 KB MP4
>>106891670
>>
>>106891730
it's obvious the guy is a sociopath, but I think they all big CEOs are, you have to sell your soul to get to that position, and some are more than ok to do that, since they didn't have much soul to begin with
>>
File: 1756887181787576.mp4 (879 KB, 640x640)
879 KB
879 KB MP4
miku kick!
>>
>>106891782
That's that Mike guy with the blue hair.
>>
File: 00012-2107503266.png (1.65 MB, 1400x792)
1.65 MB
1.65 MB PNG
[high and squeaky voice]
>"I'm here to kick ass and fuck bitches!"
>>
>>106891176
I don't understand? I don't see anything that's actually different in terms of ease of use.

Will the inpaint workflow feature will have canvas functionality with layers, bounding boxes like Krita or InvokeAI? Or can I use multiple adetailers more intuitively, or multiple ControlNets without bloating with nodes and wires?

Or is this only good news for people who spam MikuTests, Radiance shills, or those who just gen "1girl, crunching, pointing at viewer" Netaslop? So they can have a prettier, more attractive UI to press the generate button and spam more?
>>
AHHHHH FORGIVE ME FATHER FOR I HAVE INSTALLED COMFYUI
>>
File: 1760257662320702.mp4 (735 KB, 640x640)
735 KB
735 KB MP4
the man holds up a game box saying "SKYRIM 2" with a knight in armor holding a sword on the cover, and smiles.
>>
>>106891176
>log out
Huh?
>>
File: 1732025024478559.mp4 (575 KB, 640x640)
575 KB
575 KB MP4
>>106891840
>>
>>106891554
and various enhancements to amd gpus
>>
Wansisters, do we finally have long video? Safetensors loras seem to be already available. I cant try them now, gotta go work

https://github.com/vita-epfl/Stable-Video-Infinity
https://huggingface.co/vita-video-gen/svi-model/tree/main/version-1.0
>>
>>106891647
simpsons never had 5 fingers
>>
File: ComfyUI_07249_.png (2.25 MB, 1152x1152)
2.25 MB
2.25 MB PNG
>>106889520
The only one who needs to be in charge of every model ever created is Lodestone. Literally just takes one guy with limited compute to destroy every image model out there, and he did that on a dated Flux architecture with censorship on top of that. Imagine if you gave him unlimited compute like some of these huge corps have, we'd have uncensored video models that would make us laugh at Sora.
>>
>>106891915
>gotta go work
fuck off sphere earth shill
>>
>>106891919
Sounds demonic.

>>106891923
He's about to fall lol
>>
>>106891636
>>closed
you can see the merge before the close
>>
>>106891923
lol is this a bait or something? there's no way someone can glaze lodememe that much right?
>>
>>106891935
it was created by a freemason
>>
>>106891820
none of that.
>>
>>106891939
psst

(it's him)
>>
File: ComfyUI_07254_.png (2.34 MB, 1152x1152)
2.34 MB
2.34 MB PNG
>>106891923
Now that the dust has settled, why did Chroma get so much hate? There is literally no better photoreal model for the details. It's like every hating Plebbitor was acting like those SDXL photoreal models were SOTA or could follow prompts or something.
>>
>>106891926
Another attention trip seeker to block, thanks
>>
The retards at comfyui made an installer that doesn't actually work with amd.

How are they so retarded? They have amd in the installer, but it doesn't work.
>>
>>106892025
this. it doesn't work. They are retards.

pip install comfy-cli
comfy install
>>
>>106892025
>>106892033
lmfao
>>
File: ComfyUI_03341_.png (1.72 MB, 1024x1024)
1.72 MB
1.72 MB PNG
>>106891923
>>106891990
>why did Chroma get so much hate
People who need/want it to fail are upsetti spaghetti
>>
File: 1730750675935105.png (783 KB, 1176x888)
783 KB
783 KB PNG
the anime girl is sitting at a computer desk with a white CRT monitor.

qwen edit + osaka
>>
>>106892053
Ever try it?
>>
File: 1735553909920568.png (752 KB, 1176x888)
752 KB
752 KB PNG
>>106892085
the anime girl is sitting at a computer desk with a white CRT monitor. keep her expression the same.
>>
I would like a straight diffuser. I'm not a homosexual.
>>
>>106891990
I don't hate it and think it's the best NSFW realism model, but it's not without it's flaws and they might've been avoidable if the furry had listened to what some of the more knowledgeable guys were saying.
>>
Reinstalling torch. This will work >:|

>>106892123
Chroma is easily the best painterly paintings generator yet.
>>
>>106891915
>unique prompt for multi minutes video
Am I reading that right, it's one prompt for like 8min Tom & Jerry cartoon?

>wan i2v
Good, tired of everything made for t2v only.

>wan 2.1
Hope it works with 2.2.
>>
File: 1753086747002206.png (1.12 MB, 1360x768)
1.12 MB
1.12 MB PNG
change the text in the white box to "what the fuck happened to Pokemon, man?"
>>
>>106892164
>what the fuck happened to Pokemon, man?
they make turds and have their games sold really well, you have to blame the consoomers, they're the ones tolerating this, they vote with their wallet and by buying those games on masse they send the message that it's all right
>>
File: 1730558303408641.png (1 MB, 1360x768)
1 MB
1 MB PNG
>>106892164
change the text in the white box to "We make any shit and it sells, that's why the games are bad.". change the location to a grass field.

kek
>>
>>106892164
Why is she dressed like a hooker?
>>
>>106892182
How's life in Afghanistan?
>>
File: 1741114155205091.png (1020 KB, 1360x768)
1020 KB
1020 KB PNG
change the text in the white box to "The devs are really bad, they never even try to improve.". change the location to an office in japan.

holy shit they should just use AI to make their backgrounds.
>>
pip is stupid. It's redownloading the same file.
>>
File: jay-yuipeace-2828841999.jpg (465 KB, 1920x1440)
465 KB
465 KB JPG
> mfw he's been doing this for a month straight
> not even creative prompts
> literally the same slop every thread
> "guys look what qwen can do"
> yeah we know, you've posted it 500 times
> realize he's just shilling qwen edit
>probably works for alibaba
Thanks /ldg/ cointainment thread for corpo ads.
>>
File: ComfyUI_05985_.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>
>>106892187
Fertile. How's your "life", in terms of tfr?
>>
>>106892226
use uv
>>
>>106892234
I don't know what tfr is, but I'm pretty happy.
>>
>>106892227
god forbids people experimenting and having fun with local models in a local model general
>>
>>106892227
Missing the other two musketeers, the Radiance and Neta shill and we have the bingo!
>>
>>106892241
I'll ask your grandchildren.
>>
>>106892227
qwen image edit, huh? That sounds like hot shit.
>>
>>106892247
there's experimenting and there's spamming 20 times slight variations of the original image, unfortunately for turboautists like you, you can't see the difference, and I can't blame you, you are born with that brain you can't unwire that shit
>>
I prefer someone shilling a local model they like with gens to complaining about local models. Thread space is unlimited. You have a scroll wheel. Use it. Comment on gens you do like.
>>
>>106892257
Sure.
>>
>>106892272
>I don't care about the quality of the posts
this mindset is what made /sdg/ what is it today btw
>>
>>106892269
it's better than the nth drama about meta stuff, like we are doing right now, or the same schizo rant about comfyui eating babies and torturing kittens
>>
>>106892283
Or, I have a different metric of "quality" than you do. As in, my metric of quality is not "things I personally like" and is instead "furthers a discussion on local models in general." You are not the quality police. You are not a mod. You are an anon, the same as everyone else, and I bet you there are anons who think your gens are also not High Quality. Learn to play nice.
>>
File: 1745015123787572.png (673 KB, 1024x1024)
673 KB
673 KB PNG
my doro is augmented.
>>
>>106892299
>furthers a discussion on local models in general.
oh yeah, spamming the same image 20 times sure furthers a discussion on local models in general

Take your (You) kind sir, you deserved it
>>
File: test.webm (1.15 MB, 768x1202)
1.15 MB
1.15 MB WEBM
I am... disappointed so far. The older lightx2v seems to be better. Maybe need to balance strengths and find a sweet spot. Needs more testing.
>>
>>106892307
>yeah, spamming the same image 20 times sure furthers a discussion on local models in general
Significantly more than any of your posts in this conversation, yes.
>>
>>106892311
hard to say how 0 can beat 0 but go on king
>>
>>106892310
You're correct in terms of which animation is actually better, but that lower punch is genuinely hilarious.
>>
>>106892310
yeah, it's definitely a bust, I'm really dissapointed
>>
>>106892319
he only used 0.0001% of his power on it, it was enough.
>>
>>106892326
that's one punch man, if he went for more power her head would've left her body
>>
>>106891907
:O
>>
File: 1753899281818322.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>106892301
>>
File: ComfyUI_07275_.png (1.68 MB, 1152x1152)
1.68 MB
1.68 MB PNG
>>106892123
Don't get me wrong, it's not perfect, but neither is any other model we have. This is perfectly fine for cooming even with its imperfections. Well, I guess a lot of people can't really infer it at decent speeds to get different seeds which is why they complain about issues.
>>
File: QwenEdit_00058_.png (894 KB, 1024x1024)
894 KB
894 KB PNG
>>106891277
>>
>>106892273
They said they don't exist lmao

>life

hue hue hue
>>
>>106892310
what strengths on the high and low for the older lora?
>>
>comfui is 2x as fast as stable-diffusion.cpp
uh
>>
Anyone with a 5090 can try 4/4 steps wan in 720p/129 frames and time it?
I can gen in 11-12min using res2m.
>>
>>106892339
I think what probably set people off is the percieved lost potential. In a way, it's kind of like the reaction to GPT-5, where the hype of what it could be made the reveal of what it actually was that much more disappointing. Obviously OpenAI continued to shoot themselves in the foot after that, but if they hadn't promised so much ("AGI is Here!") then there would be less backlash. Likewise, as the previous anon said, there were multiple moments where choice A vs B made Chroma less than what it could be.

Personally, I loved the aesthetics, even if as a ramlet it took me ages to make an image. But if that image is completely schizo, I can't really do much but play seed gacha or feed it to IPAdapter/i2i.
>>
File: 1730278510526953.png (1018 KB, 1024x1024)
1018 KB
1018 KB PNG
>>106892333
>>
>>106892365
High 3
Low 1
>>
what causes the first frame in wan to be super blurry (then the rest is fine) in i2v?
>>
>>106892372
>I think they finally moved on from transformers
if google has found another better architecture, we're soo back, Sora 2 at home soon babyyyyyyyy
>>
>>106892375
I thought vulkan would be worse. that's good news
>>
>>106892375
false hope. bad wf.
>>
>>106892332
>>106891907
>update for amd enhancement
>s/it increases
now this is very cool
>>
>>106892372
can i finally make my indie game now as a no-coder?
>>
>>
>>106892428
rip
>>
File: 1757522883709141.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
the man is wearing the blue hat

pretty cool
>>
File: ComfyUI_00001_.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>106892421
Or... :O

it really is 2x as fast???

11 seconds / it with res multistep 1024x1024, cfg >1

Normally this would be 20 seconds, I think...

welp
>>
>>106892458
is there an offical comfyui wf for qwen image edit?

I AM NOT GAY
>>
>>106892479
yes there is even a 2509 specific one just search qwen in templates
>>
File: ComfyUI_07295_.png (2.31 MB, 1152x1152)
2.31 MB
2.31 MB PNG
>>106892070
I guess some people just can't have fun.

>>106892384
Makes sense as well. However, the model continues to exceed my expectations with what it can do. Though someone with different needs might see it differently. Hopefully the bigASP dev could further refine the model.
>>
>>106891990
It really is capable of producing top tier results, but you have really investigate it and learn its quirks, otherwise it happily spits out slop, not to mention the best version (2k) was dumped on Civitai with no elaboration. Its a PR problem
>>
>>106892508
If only we could prompt things like deformed hands like that.
>>
>>106892508
>Hopefully the bigASP dev could further refine the model.
Every model is kinda bad when it first comes out, right? I'm hoping this is a more "SDXL" situation than an "SD2/3" situation. You can clearly see tons of potential when it works right.
>>
>>106892510
2k???

also
>>
>>106892508
>pircel
I guess some people just can't have standards.
>>
File: ComfyUI_00002_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>106892528
lmao this wf
>>
>>106892539
I'm naming this style "slopcore"
>>
>>106892310
you are using the older lora for both low and high?
>>
>>106892553
Yes.
>>
File: ComfyUI_00003_.png (1.44 MB, 1024x1024)
1.44 MB
1.44 MB PNG
>>106892539
Euler being nice and normal.
>>
fwiw, I don't think stable-diffusion.cpp needs to support clip_l, since Chroma doesn't use it. But it should change its name to Chroma.cpp
>>
File: ComfyUI_05968_.png (1.18 MB, 872x1192)
1.18 MB
1.18 MB PNG
>>
>>106892572
https://github.com/leejet/stable-diffusion.cpp/pull/397
fixed it for u
>>
>https://zhuang2002.github.io/FlashVSR/
Non snake oil super resolution is here.
>>
File: ComfyUI_00004_.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>>106892567
So Chemo never learned how to do middle fingers?
>>
File: ComfyUI_05970_.png (1.29 MB, 880x1184)
1.29 MB
1.29 MB PNG
>>
what scheduler should i be using with chroma flash? currently using simple
>>
>>106892580
kek
>>
>>106892595
Now you just have to fix Chemo to support clip_l again.
>>
>>106892620
I took chemo for u but now i dying :(
>>
>>106892609
I'm using beta...

>>106892608
he passes, tbqfwy
>>
File: ComfyUI_05963_.png (1.24 MB, 1016x1032)
1.24 MB
1.24 MB PNG
>>
>>106892625
Can I have your computer when you die?
>>
File: ComfyUI_05981_.png (1.24 MB, 1024x1016)
1.24 MB
1.24 MB PNG
>>
File: ComfyUI_00005_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>106892602
1girls rise again
>>
>>
File: ComfyUI_05980_.png (1.17 MB, 1232x840)
1.17 MB
1.17 MB PNG
>>
File: 1745846293951469.png (1.25 MB, 952x1096)
1.25 MB
1.25 MB PNG
the character in image1 is wearing the outfit of the anime girl in image2. keep their expression the same.

neat
>>
File: QwenEdit_00062_.png (1.67 MB, 1024x1024)
1.67 MB
1.67 MB PNG
>>106892508
>>
File: ComfyUI_07300_.png (1.65 MB, 1152x1152)
1.65 MB
1.65 MB PNG
>>106892510
>not to mention the best version (2k) was dumped on Civitai with no elaboration.

Eh? HD Flash is still best for me. But I guess there is a slight learning curve.
>>
>>106892664
put on some socks
>>
File: ComfyUI_05991_.png (1.35 MB, 1224x848)
1.35 MB
1.35 MB PNG
>>
File: ComfyUI_00006_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>106892650
>>
File deleted.
>>
File: wan22___0114.png (1.61 MB, 720x1280)
1.61 MB
1.61 MB PNG
chroma is the best realism model?
>>
File: ComfyUI_05993_.png (1.32 MB, 760x1360)
1.32 MB
1.32 MB PNG
>>
File: 1743623665396585.png (1.22 MB, 1080x906)
1.22 MB
1.22 MB PNG
>>106892687
chroma pre v30 is the best realism model
>>
File: ComfyUI_00007_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>106892679
>>
File: QwenEdit_00063_.png (1.6 MB, 1024x1024)
1.6 MB
1.6 MB PNG
>>106892667
>>
File: ComfyUI_07302_.png (1.96 MB, 1152x1152)
1.96 MB
1.96 MB PNG
>>106892692
I already told you anon, this is a skill issue. Chroma HD is the best Chroma model, and HD Flash best refinement of that (albeit losing some style variety, but it might be my shit prompting).
>>
>>106892692
Are there non-quant ggufs of those?
>>
>>106892704
thanks
>>
File: wan22.png (15 KB, 468x233)
15 KB
15 KB PNG
>>106892146
Fuck knows, says 2.2 is todo. Looks like 4 loras? I cant try it out until later, some brave anon will have to test them, surprised no one else is talking about it

>>106892580
lmao, gold
>>
>>106892710
and I already told you anon, your brain is a skill issue, you can't admit that lodestone has slopped the model when he decided to make it work for fewer steps starting for v30, but one day you'll get out of this cult, I believe it, I don't think you're that dumb
>>
File: radiance.png (3.18 MB, 864x1488)
3.18 MB
3.18 MB PNG
>>106892690
doing this in 3d turned out pretty nice
>>
File: ComfyUI_00009_.png (1.52 MB, 1024x1024)
1.52 MB
1.52 MB PNG
Yeah, I don't think we can expect any more big speedups for my 6950xt. I'm at max watt utilization
>>
File: wan22___0119.png (1.69 MB, 832x1216)
1.69 MB
1.69 MB PNG
>>106892710
i think i agree with you anon. nothing better than chroma HD
>>
File: smoking_sanna.png (1.24 MB, 888x1176)
1.24 MB
1.24 MB PNG
sanna, no!
>>
>>106892710
>Chroma HD
using
Chroma1-HD-dc-fl-BF16.gguf

works with cfg=1
>>
File: ComfyUI_00010_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>106892776
>>
File: 00187-294463782.png (1.73 MB, 1824x1248)
1.73 MB
1.73 MB PNG
>>
File: l0l.png (998 KB, 1352x768)
998 KB
998 KB PNG
Kek
>>
>>106892801
air niggas
>>
File: flux_0002.png (1.44 MB, 832x1216)
1.44 MB
1.44 MB PNG
i think i also agree, that chroma hd is just clearly the only good local model for realism
>>
File: ComfyUI_06004_.png (1.19 MB, 856x1216)
1.19 MB
1.19 MB PNG
>>106892822
lol
>>
File: radiance.png (3.35 MB, 864x1488)
3.35 MB
3.35 MB PNG
>>106892301
nice augmented doro

>>106892601
samples look pretty good, yes. did you use it on your own videos yet?
>>
File: qwen___0001.png (1.32 MB, 832x1216)
1.32 MB
1.32 MB PNG
we talking about realism? oh chroma HD is the only one worth even trying
>>
File: definitely not a cult.png (836 KB, 862x575)
836 KB
836 KB PNG
>>106892710
>Chroma HD is the best Chroma mode
>>106892783
>nothing better than chroma HD
>>106892860
>chroma HD is the only one worth even trying
>>
File: flux.krea___0001.png (1.51 MB, 832x1216)
1.51 MB
1.51 MB PNG
yeah i wouldn't even dream about using anything other than Chroma HD (tm) for generating realism with local diffusion
>>
File: ComfyUI_07306_.png (1.75 MB, 1152x1152)
1.75 MB
1.75 MB PNG
>>106892664
Not bad, Qwen Edit is getting close.

>>106892783
Wan gets coherence better, but y'know anon, Chroma can do some cool stuff.
>>
File: test2.webm (3.68 MB, 768x1202)
3.68 MB
3.68 MB WEBM
Idle
Attack 1
Attack 2
Run
Guard
Evade
Taking Damage
At Low HP
Incapacitated
>Triumph
Flourish

The new LoRa seems to does occlusion better and has higher object consistency at the expense of fast movements. Increasing strength beyond 1 on high noise introduce weird fogging, so sadly balancing high/low isn't an option.
>>
>>106892882
what about adding one first step without the light lora on high?
>>
File: 1759783911944347.mp4 (970 KB, 704x480)
970 KB
970 KB MP4
>>106892882
I tried the 2.2 kijai lora on high and low, 1 str for both:

the man holding the bag jumps out the window of the office.
>>
File: chroma___0001.png (1.35 MB, 832x1216)
1.35 MB
1.35 MB PNG
>>106892878
my point (if you can't tell from the sarcasm) is that it very clearly is not the best model.

>>106892872
this is the only actual chroma gen i'm posting and it only really works as security cam footage
>>
>>106892899
next ill try with rCM low. pretty sure 2.2 high isnt meant to be used with low, but it somewhat works.
>>
would a 3070 8gb be trash for this stuff?
I just remembered I have one laying around.
>>
>>106892907
you can use sdxl, not much else
>>
>>106891493
>>106891447
>>106884426

>>106891366 up again
>>
File: 1747337695988321.mp4 (831 KB, 704x480)
831 KB
831 KB MP4
>>106892906
and with rCM low

yep, this is a winning combo.
>>
File: radiance.png (2.42 MB, 864x1488)
2.42 MB
2.42 MB PNG
>>106892907
it's not ideal but it'll work for most stuff

very slow for most videogen but you can even do that (idk, run some while you do other stuff)
>>
new wan lightx2v high lora weight 1
rcm wan low lora weight ???
>>
File: radiance.png (2.87 MB, 864x1488)
2.87 MB
2.87 MB PNG
>>106892915
you can use almost all models if you can trade system RAM

it'll raise your imagegen times to other people's times to do 5s WAN videos or w/e but it can work pretty well
>>
File: 1754298303888091.mp4 (723 KB, 704x480)
723 KB
723 KB MP4
>>106892919
>ACK
>>
>>106892940
beautiful dancer
>>
>>106892924
1 and 1 seems fine
>>
>>106892922
>>106892940
could I run qwen edit with it?
I'm curious how it would compare to my 16gb amd card
>>
File: 1742826753583288.mp4 (930 KB, 704x480)
930 KB
930 KB MP4
>>106892946
a nuclear explosion hits the buildings in the background outside the office. the characters fall on the ground.

cinematic.
>>
i still get slow motion with the new high lora, like at the end of this vid
or maybe it's not slow and it's just me?
>>
>>106892954
thanks anon
>>
>>106892940
working a few days until you can buy a decent card would be a much better use of time
>>
File: ComfyUI_06008_.png (1.25 MB, 1176x888)
1.25 MB
1.25 MB PNG
>>
File: 1756968881757336.mp4 (680 KB, 704x480)
680 KB
680 KB MP4
the man in the blue shirt walks to the computer on the left, sits in the chair, and starts typing.

neat, it worked.

setup: 2.2 kijai lora (from today), rCM lora from kijai (same huggingface), 1 str for both, 6 steps (3/3).
>>
File: ComfyUI_07307_.png (2.32 MB, 1152x1152)
2.32 MB
2.32 MB PNG
>>106892905
We can both agree that Chroma is not perfect, but none of the models you posted are uncensored, nor is their photorealism anywhere near Chroma.
>>
>>106892999
use image2 node with a meme image and say "change the faces in image1 to the appearance of image2"
>>
File: ComfyUI_06015_.png (1.1 MB, 1032x1008)
1.1 MB
1.1 MB PNG
>>
File: radiance.png (2.85 MB, 864x1488)
2.85 MB
2.85 MB PNG
>>106892951
yea. chroma radiance absolutely has good 1girl aesthetics - though of course it still has flaws it shows why everyone should train the boorus.

>>106892960
i haven't practically tried it with your hardware, but i believe so.

e.g. use the gguf q8 quant with the comfyui-multigpu *gguf*distorch2* model loader where you can tell the node to offload 13GB or whatever is exactly needed to system RAM.
or even just use a smaller quant.
>>
Undressing lora for wan:
https://civitai.com/models/2044832/wan2-undressing?modelVersionId=2314347

Get it before it gets banned.
>>
File: soymall.png (565 KB, 3145x2141)
565 KB
565 KB PNG
>>106893013
I think it just struggles with all the characters and mental illness involved. Picrel original if you want to have a shot at it.
>>
File: 00200-1275829035.png (1.91 MB, 1248x1824)
1.91 MB
1.91 MB PNG
>>
>>
>>106892823
we. wuz
>>
>>106892831
saved lol
>>
File: ComfyUI_00616_.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
My setup was working fine. Then I installed ComfyUI_Comfyroll_CustomNodes. Now startup doesnt finish. Comfy opens a blank screen, and the loading circle rotates endlessly.

The log doesnt say anything out of the ordinary, and does not give an import failed error.

As soon as I delete the ComfyUI_Comfyroll_CustomNodes folder, my setup goes back to working.

Any ideas?
>>
File: 1759455098505614.mp4 (1.35 MB, 704x480)
1.35 MB
1.35 MB MP4
oh shit.
>>
File: QwenEdit_00069_.png (1.13 MB, 1016x1024)
1.13 MB
1.13 MB PNG
>>
>>106892918
Thanks anon. Looks nice.
>>
File: qwen___0002.png (1.35 MB, 832x1216)
1.35 MB
1.35 MB PNG
>>106893009
you got me there. no way to generate nsfw content with any of these models.

https://files.catbox.moe/27k6tn.png
https://files.catbox.moe/2ce3mq.png
https://files.catbox.moe/7ovjsk.png
>>
>>106893067
Ensure ComfyRoll does not have additional install requirements on its github page. If it does, follow the additional instructions. If it doesn't, try switching versions.
>>
>>106893106
>ComfyRoll
what is it?
>>
>>106893131
nigger get some friends
>>
File: radiance.png (3.42 MB, 864x1488)
3.42 MB
3.42 MB PNG
>>106892983
just live twice as good as the average one wage / one housewife household in the 1900s and take the other 8x+ per capita productivity gains for gpu, it's that easy
>>
>>106893135
I have made a lot of progress in this area. I'm trying to choose a quant.
>>
>>106893131
bloat nodes
>>
File: ComfyUI_00015_.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
>>106892801
>>
File: 1745729379389557.mp4 (1.08 MB, 640x480)
1.08 MB
1.08 MB MP4
the white car drifts around a corner in Tokyo, creating lots of smoke on the tires.
>>
File: radiance.png (2.37 MB, 864x1488)
2.37 MB
2.37 MB PNG
>>
File: radiance.png (3.42 MB, 864x1488)
3.42 MB
3.42 MB PNG
>>106893157
doesn't drift hard, but doros hard
>>
File: radiance.png (3.05 MB, 864x1488)
3.05 MB
3.05 MB PNG
>>
File: test3.webm (3.82 MB, 768x1202)
3.82 MB
3.82 MB WEBM
The new LoRas are cucked with less gore. I think I'll conclude my personal testing. It's not worth it.
>>
>>106891606
kek
>>
File: ComfyUI_00018_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
I'll fix the hands in post
>>
File: 00010-1537513496.png (2.03 MB, 1248x1824)
2.03 MB
2.03 MB PNG
>>
File: 1751136964295552.webm (589 KB, 736x560)
589 KB
589 KB WEBM
me when the project page for new thing doesn't have a comfyui workflow json on it
>>
>>106893281
>comfyui
stay asleep
>>
File: chroma___0012.png (1.43 MB, 832x1216)
1.43 MB
1.43 MB PNG
>>106893009
i will concede that if you fuck around with chroma for long enough (and add loras) it can be really creative
>>
File: RaMu ZoOm.webm (3.92 MB, 1280x852)
3.92 MB
3.92 MB WEBM
>>106892990
This is gonna be the Simpsons in 30 years.

>>106893211
Have you tried adjusting the strength to match the old one? A little strength seems to go a long way with the new one, it might actually add that blood splash back in.
>>
>>
File: ComfyUI_01644_.png (1010 KB, 904x1144)
1010 KB
1010 KB PNG
qwen image edit...
>>
I'm trying to gen on Fedora KDE with my 10 VRAM, Qwen is crashing on VAEdecode every time. It was working fine on Win10. What do
>>
>>106893507
>Qwen is crashing on VAEdecode every time.
go for vae decode tilted
>>
>>106893517
does that take a lot longer? im thinking it might be wayland causing GPU conflicts. Grok suggests I should use an older nvidia driver for the 30-series. Sigh
>>
>>106893527
>does that take a lot longer?
nah it's not that much longer
>>
>>106893507
>i'm trying anything on Fedora
found your issue
>>
>>106893556
wtf am i supposed to do then? I am not making a windows account
>>
>>
>>106893563
Why would you do that?
>>
>>106893586
Now put a silly hat on her
>>
>>106891820
>"1girl, crunching, pointing at viewer" Netaslop
thats my jam
>>
Apparently Grok Imagine videogen can be jailbroken, as I have seen people making undress videos of celebs through I2V by only prompting for pasties or censor bars
Not gonna lie, the motion and consistency from that shit is better than what we currently got with Wan. Imagine the possibilities if it was open...
I wonder if Wan 2.5 (which is uncertain if it will be open-sourced) is comparable to it
>>
>>106893563
>I am not making a windows account
you dont have to, you dumb fucking monkey.
>>
>>106893721
you will eventually
>>
>>106893740
>you will eventually
i assume so as well, but you dont have to right now. just keep a partition with windows so you can gen if you cant figure it out on linux.
>>
Can I take a group of nodes and cosmetically turn them into a single node to clean up the workflow?
>>
>>106893863
Ctrl+left click on the nodes you want to merge, right-click on one of them and select [Convert to Group Node].
>>
>>106893740
you will with that attitude
>>
>>106893884
Oh wow, it retains all the parameters, very useful. Is it possible to just make it cosmetic to save space/make it less chaotic for a new user?
>>
Well I got Qwen to work on Linux but only after changing desktop settings to 1080p/60hz. Sad
>>
File: 20250818_082304.png (195 KB, 478x500)
195 KB
195 KB PNG
Been using sd.webui for a bit now and it does all that I need.
I'm kinda stuck on what to do when it freezes. It'll occasionally do that and I can't seem to fix it besides closing out the CMD window and hitting run.bat again.

Refreshing just puts any new gens "In Queue" but it'll never finish.
Hitting "Restart WebUI" turns off the WebUI but it doesn't restart...

The main issue is I access WebUI remotely so when it freezes I have to screenshare to my PC to do all this
>>
>>106891601
Pool's Closed
>>
File: QwenEdit_00071_.png (1.14 MB, 1160x896)
1.14 MB
1.14 MB PNG
babe open wide its time for your slop
>>
File: ComfyUI_00004_.png (2.59 MB, 1696x1296)
2.59 MB
2.59 MB PNG
Welp I found a permanent fix: add --cpu-vae so the CPU takes care of business instead. I won't have to go back to Windows after all. Such relief
>>
>>106893920
Click on the grey dot/circle next to the node title, it will minimize the node and make it as small as possible.
>>
>>106891915
Needs kjboss integration into comfy, looks promising though
>>
>>106894023
These seem to already be safetensors, they would need further converting? https://huggingface.co/vita-video-gen/svi-model/tree/main/version-1.0
>>
>>106894215
There's inference code on the github, its not just a plug n play lora. It'll need a custom node.
>>
>>106894228
No indication of vram requirements and
how it scales with length either. Does it work in batches, keeping a number of previous frames memory for context so there's no abrupt motion shifts? Fuck knows
>>
>>106894228
Hmm, suppose its just a case of if kj or anyone else will. There's not much buzz about it, so cant see it happening anytime soon. Similar to (I think) nvidia released a long video model or something a few weeks ago?
>>
>>106894263
>Does it work in batches, keeping a number of previous frames memory for context so there's no abrupt motion shifts?
Seems to be precisely what it does with this:
>--num_motion_frames
You can set how many previous to use as context for the next clip it generates after 81 frames. I guess there'd be a sweet spot somewhere that keeps the motion coherent between segments without bloating VRAM too high, I guess it snips off the same amount of frames at the end. Set it to use 16 frames for context, the video it generates is 65 frames, and it keeps chaining them together. It looks like you can set individual prompts per segment too.
>>
>>106894478
>Chunk 1:
>Input: A single static image.
>Generation: pipe() runs and generates a full 81 frames.
>Stitching: The code executes video_list += video[:-16]. This takes the first 65 frames (Frame 1 to 65) and adds them to our final video.
>Next Context: The code executes rand_ref_frame_final = video[-16:]. The last 16 frames (Frame 66 to 81) are saved to be used as the input for the next chunk.

>Chunk 2:
>Input: The 16-frame clip from the end of Chunk 1 (Frames 66-81).
>Generation: pipe() takes this motion context and generates a new, full-length clip of 81 frames that seamlessly continues the action.
>Stitching: The code again executes video_list += video[:-16]. It appends the first 65 frames of this new clip to our final video.
>Next Context: The last 16 frames of the new clip are saved for Chunk 3.
The VRAM requirements should be the same as regular WAN, since it's always generating 81 frames. It should be compatible with LoRA's too. Seems almost too good to be true
>>
>genning porn while animating 3d porn for a living

What a time to be alive. Only so much gooning I can do in between breaks.
>>
B-bros, w-where you at?
>>
>>106894945
im here but im busy watching the latest animeslop during paid work hours (wfh chad).
have this korbo
>>
i cant believe the first gen in comfyui still doesnt allocate vram properly and takes x3 times to finish because of it before continuing normally from then on
>>
>>106895004
The reason the first gen takes longer is because it takes time to load the model. Then when the model is loaded all subsequent images are fast.
>>
>>106895004
There's also remnants left in VRAM even if you try to unload. You need to restart Comfy to fully purge it. Really noticable when switching between two high VRAM requirement workflows, you can get OOM's while restarting between each makes them load and run perfectly.
>>
>>106895040
no nigger i specifically said allocation, it overallocates like double the memory over 24gb and spills into 16gb extra ram before on the next gen it takes 18gb to gen the entire next video
>>
>tfw going from 720p to 800p fixed the broken faces completely
>oom at 860p
>>
File: 1756246025979395.png (3 KB, 170x89)
3 KB
3 KB PNG
>>106895064
>24gb
(vram)

i have 24+128gb and dynamic pagefile that comfy uses to allocate additional 100+gb to btw when genning videos, i have to set blockswap to 31 for it to not oom during the first q8 wan 2.2 gen anyway and then later change it after the first gen finishes

picrel is what happens and was talked about before here, first gen overfils vram and spill into ram despite having enough space to fit, it takes take to finish, and then the next gen allocates the proper amount of vram and works from then on
>>
>>106895108
offload some of the model to the ram dude
https://github.com/pollockjj/ComfyUI-MultiGPU
>>
>>106895045
yeah, the unload all models node fixes some of those but there is still a problem
>>
>>106892682
tummy
>>
>>106895109
why would it need the pagefile with 128GB of RAM available? Wan isn't that big.
>>
>>106895179
because comfy memory allocation is a meme
>>
>>106895109
3090?
>>
>>106895197
yes
>>
>>106895139
Already am, my dude. I am stocked up on loras.
>>
>>106894952
Korbo?
Isn’t it Holo?
>>
>>106893454
fuck now i have to go watch the annotated series again
>>
>>106895164
an ugly, child one
i prefer adult somewhat muscular with lineae semilunaris visible
>>
File: file.jpg (65 KB, 1280x720)
65 KB
65 KB JPG
>>106895216
>he doesnt know
>>
>>106895208
If you are using --fast flag, try --fast fp16_accumulation fp8_matrix_mult cublas_ops instead. There was a commit a month ago that added convolution autotuning but it fucks up on our 3090s holding VRAM during the first run causing either a hang or extreme slowdown on WanImageToVideo node or VAE Decode.
>>
>>106895231
kys faggot
>>
any way to fix the face in wanimate? looks slopped, not really like the reference image face
>>
>>106895257
yeah i know about that too, i got fucked by that update which made this problem x10 worse, but with now with just fp16 accumulation or without --fast at all im still getting this problem which is older that that addition to --fast
>>
>>106895322
bro just buy a blackwell gpu, you arent poor, right?
>>
>>106895333
which also wont be used to its fullest since comfyui will overallocate no matter what
>>
>>106895339
i mean yeah my rtx 6000 pro has plenty of vram free, so it doesnt matter if it overallocates. stop being poor
>>
File: IMG_2507.jpg (140 KB, 1018x768)
140 KB
140 KB JPG
>>106895245
This steered me to a tumbler PowerPoint
Fking hell

It’s Horo btw
>>
>>106895355
it's holo the wise bitch wolf, korbo is the meme name since she has garbage calligraphy
>>
File: file.png (13 KB, 285x68)
13 KB
13 KB PNG
>>106895355
>inteGIRL
(X) Doubt
>>
>>106895348
>buys a component to be cucked out of using it fully
good goy
>>
>>106895361
True, but it’s still Horo
L = R
Forever +1 no takebacksies no reset even when you go home
>>
File: IMG_2212.jpg (1.36 MB, 2160x2160)
1.36 MB
1.36 MB JPG
Horo
>>
>>106895887
>>106895887
>>106895887
>>
>>106895355
who writes "r" as just a vertical line? It clearly says Holo.
>>
Is there any way to get a NovelAI-like system of giving individual characters a tag, as well as the scene with it's own tags? I love the system NovelAI has but I'm a broke-ass bum and can only use local models like Stable Diffusion.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.