[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106833514

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
comfy should be dragged out on the street and shot
>>
Blessed thread of frenship
>>
how to train neta lumina 2 in ai-toolkit
>>
>>106834466
>>106834472
>>106834565
anyone figure out wtf we're supposed to do with this?

the schizo OP keeps replying to people saying his clip L doesn't do anything, but it's the only one that's usable with normal noob models? And the clip G requires a finetuned noob model he hasn't released?
>>
>>106839537
>anyone figure out wtf we're supposed to do with this?
ignore it because the guy is a retard kek
>>
File: sure.jpg (163 KB, 503x747)
163 KB
163 KB JPG
>>106839360
>You can actually, for the first time in local model
>>
>>106839617
yeah, I don't know what this retard is talking about, I don't think Qwen Image's prompt understanding is on another level compared to Flux, both models are really similar, and it means it's humiliating for Qwen because it's almost twice as big as Flux
>>
>t. can't run qwen
>>
>>106839640
no anon, the new narrative is that HunyuanSlop 3.0 is a good model because no one can run it to test it out, that's obvious enough, big model = good model, what's not clicking people???
>>
>>106839662
you must have meant to reply to another post and the conversation is about qwen
>>
>>106839666
>is about qwen
no, it's about "t. can't run qwen" = "you criticize that model but because it's a big one I will assume with conviction that you couldn't run it and test it out by yourself, thefore I deem your argument irrelevant", that strawman narrative fits well on either Qwen or HunyuanSlop, I hope that helped
>>
>>106839714
no need to get upset mr vramlet
>>
>>106839421
Properly captioned giant datasets and training on everything including porn.
>>
>>106839727
>t. has a gtx 1060
>>
>no u
everytyme
>>
File: 1730554021452869.png (1.18 MB, 1646x1437)
1.18 MB
1.18 MB PNG
>other/don't like
looooooool
>>
>>106839749
no need to get upset mr vramlet
>>
>knowing me
>knowing you
>there is nothing we can do
>>
>>106839753
Interesting, I wonder if some api have access that could disable some of them like you can with gemini.
>>
https://civitai.com/models/2029640/disney-simulator?modelVersionId=2297034
I'll never understand people posting things with almost zero explanation.
>>
>>106839753
>ee = A || (A= {})
Fuck I hate javascript faggots so fucking much it's unreal.
>>
>>106839753
>other dont like
kek
>>
>>106839455
midna my wife
>>
File: 1740994872585474.png (317 KB, 1024x818)
317 KB
317 KB PNG
>>106839859
javascript being the official code of the internet was one of the biggest mistake of humanity, even the creator of javascript apologized for that kek
>>
File: qwen___0001.png (1.22 MB, 832x1216)
1.22 MB
1.22 MB PNG
>>106839629
it makes pretty good images when you prompt it well but the output needs so much detailing it's kinda useless to me. it is not leaps and bounds ahead of anything.

that said if you have the hardware and time to run this big ass model for zero benefit, more power to you
>>
can we train i2v wan (any version) locally on the ostris/ramtorch update with 16gb yet?
>>
>>106839488
haven't tried it but there is a lumina examplee config
https://github.com/ostris/ai-toolkit/blob/main/config/examples/train_lora_lumina.yaml
>>
>>106839188
Res_5s takes 5x as long as Euler, but you're only using it for like literally 2 steps. I highly recommend watching the videos by one of the creators of that node pack, I genuinely learned a lot, specifically these two:
https://youtube.com/watch?v=A6CXfW4XaKs
https://youtube.com/watch?v=905eOl0ImrQ

I have
Res_5s - 2 steps
Res_3s - 4 steps
Res_2m - 8 steps
Cfg: 4.3
Beta57 scheduler

As far as the prompts, I'm not sure which specific words are doing the heavy lifting as I havent bothered with trying to figure that out, but once I used the following (which I stole from some good gens on civilian), the results went from slop to not-slop. The positive is in a separate clip text encode and concat'd with the main prompt. From past experimentation I know that this DOES affect the gen versus putting it all in a single clip text encode.

Positive:
candid amateur photo, model, casual framing, slightly uneven household lighting, natural skin pores, real imperfect textures, soft focus transitions, unfiltered atmosphere of an intimate capture, looks like taken on an iphone

Negative:
anime, fat, ugly, wrinkly, bad hands, pixelated, drawing, CGI, ai, stable diffusion, action figure, 2d, cartoon, sketch, render, 3d, painting, digital art, fan art, 2d art, wax, doll-like, perfect skin, 2.5d, smooth skin, jewelry, hyperrealistic, ultrarealistic, ugly, extra fingers, extra limbs, extra hands, makeup
extra body,

Ultimately I think that the prompting matters a LOT for chroma and that there's probably certain words that will insta-slop your gen if present or not...
>>
File: 00013-1746520255.png (1.9 MB, 1024x1280)
1.9 MB
1.9 MB PNG
>>
>>106839991
IDK if it works with the config files but it's not in the webUI yet.

I assume ostris hasn't quite finished doing the qwen implementation either.
>>
File: foxgrapes.jpg (237 KB, 640x1333)
237 KB
237 KB JPG
>>106839617
>Are you serious? Have you tried prompting 3 characters in various outfit, hair colors, and build doing various stuff in various pose in a random background with 3/4 important additional details in Flux-dev 1.0?

You can't do it in Flux, moron. Qwen can. Easily. End of fucking discussion. It's like you're trying very hard to ignore the reality. Try Qwen for like three hard prompts that Flux absolute shit the bed with.

Honestly this reeks of some of the worst pearl clutching, fox and the grape thinking. Flux-dev 1.0 is widely, and with good reasons, recognized as one of the worst non-clip model for prompt understanding, people have tried for two years to improve its prompt understanding, and utterly failed. Qwen arrives with one degree of magnitude better prompt understanding, and all you can say is

>Well it's not THAT good when I prompt one character under a tree
>The image is kind of bad
>People will never be able to make it gen better image
>Why I can't use it so it's bad
>>
>>106840211
The PR for ramtorch is only for qwen and nothing else yet.
>>
Why the fuck chroma te is so huge, flux legacy?
>>
>>106840247
ok that's more definite. just qwen then.
>>
File: ace step.jpg (293 KB, 1796x782)
293 KB
293 KB JPG
any tips with ace step? everything it produces sounds like garbage. like it cant even keep a consistent rythm.
>>
File: ComfyUI_00214_.png (3.87 MB, 1280x1920)
3.87 MB
3.87 MB PNG
>>
>>106840257
chroma isn't huge. have you seen the other newer models? qwen, hunyuanimage, hidream and so on? there's other model types that are also larger (but i just misremembered, not it wasn't neta yume lumina)
>>
File: 00015-1906436612.png (2.39 MB, 1024x1280)
2.39 MB
2.39 MB PNG
>>
>>106840276
TE, text encoder.
>>
>>106840246
not reading all that
>>
>>106840257
>Why the fuck chroma te is so huge
it has to, you won't get good prompt comprehension with a tiny ass te
>>
>>106840303 >>106840257
same thing, most of them have larger text encoders too

qwen's TE for example is 16.6GB, of course most people use the quant'd version but chroma's isn't even big for today's standards
>>
>>106840330
> qwen's TE
LLM?
>>
>>106839421
>Sora 2
>rendering actual humans
>shows potato face as proof

two more week till China fixes it for you

P.S. both examples can be easily genned with Wan Animate
>>
File: 1759065222783304.jpg (66 KB, 618x767)
66 KB
66 KB JPG
Retard vramlet here, why are the next level models do much bigger compared to XL? XL and variants hover around 6-8GB including Text encoders and stuff, Flux and derivates are like 22GB+, why are there no intermediate sized models? Would something that's eg 10GB not offer enough improvement over XL to bother?
>>
>>106840398
because no one managed to train many of the features people wanted on the smaller models and "most" managed to train them on the bigger ones.

"edit" models like qwen image edit and flux kontext with various builtin features but also the prompt comprehension, perspective control, multiple characters and so on. and multilingual support and other stuff too
>>
>>106840428
btw you can add pony v7 based on auraflow to the growing evidence pile that we just can't train smaller models THAT well with current model design/training methods.

it'll probably work much better on qwen (the next plan for pony).
>>
File: 00019-1523721866.jpg (1.24 MB, 2048x2560)
1.24 MB
1.24 MB JPG
>>
>>106840398
BTW the intermediate size models are essentially Neta Yume Lumina or Chroma Radiance.

Because the higher end local is now Qwen and the like.

I doubt anyone will do a "further in-between" unless someone figures out a nicer model architecture. Might happen but it's not currently obviously there.
>>
File: IMG_2311.jpg (74 KB, 934x2000)
74 KB
74 KB JPG
>>
File: ChromaGiger_00113_.jpg (1.16 MB, 1304x1672)
1.16 MB
1.16 MB JPG
>>
I don't get saving images of people that make you mad and endlessly seethe online. just turn the screen off lol. go outside lmao
>>
>>106840690
mental illness is hard to understand
>>
File: IMG_1487.jpg (1.61 MB, 1808x3216)
1.61 MB
1.61 MB JPG
>>
>>106840698
>>106840690
thanks nick for this insight
>>
>>106840584
>>106840700
who is this guy?
>>
File: radiance.png (2.49 MB, 848x1488)
2.49 MB
2.49 MB PNG
>>106840579 >>106840398
there were/are variants of this - the auraflow pony v7, pixart sigma, cosmos, janus pro 7b and others.

you can try them. many are better than SDXL in a lot of ways but most most likely they just won't get major finetunes now.
>>
File: 00022-382901949.jpg (1.75 MB, 2048x2560)
1.75 MB
1.75 MB JPG
>>
File: file.png (3.07 MB, 1496x2016)
3.07 MB
3.07 MB PNG
>>
File: radiance.png (2.4 MB, 848x1488)
2.4 MB
2.4 MB PNG
>>
File: radiance.png (2.62 MB, 848x1488)
2.62 MB
2.62 MB PNG
>>106840679
cool, you trained giger tentacles?
>>
cozy thursday bread mhm
>>
>>106840128
yes but whats the expected caption? tags + nl?
>>
>>106840823
tags+nl or just tags, yes.
https://www.neta.art/blog/neta_lumina_prompt_book/
>>
>>
File: ChromaGiger_00129_.jpg (843 KB, 1304x1672)
843 KB
843 KB JPG
>>106840807
yeah. almost everything he made, including sculptures
>>
File: ComfyUI_temp_nytma_00016_.png (2.27 MB, 1024x1344)
2.27 MB
2.27 MB PNG
>>106840876
Could you post the lora?
>>
File: i3098.jpg (429 KB, 1024x1024)
429 KB
429 KB JPG
fr
>>
>>106840584
literally my face when i realize, that local video will no longer evolve
>>
File: ComfyUI_00174_.png (1.26 MB, 864x1536)
1.26 MB
1.26 MB PNG
>>
>>106839891

https://files.catbox.moe/z6pl57.mp4
>>
>>106840786
I didn't know I liked hairy butts, new fetish unlocked, thanks!
>>
Why?
>>
>>106840918
>I didn't know I liked hairy butts
https://youtu.be/Zd8vzIRQLLM?t=7
>>
>>106840945
spaghet
>>
>>106840945
>v45
Lol? Also are you double quanting it? Use a Q8 like a normal person.
Also pretty sure the vae is wrong. you need AE
>>
>>106840945
preview in sampler is crap, and it never worked for me
>>
Please, stop using Chroma, it's a flawed model.
>>
>>106840889
it's there
>>
>>106840998
Do you need to trigger the Beksinski lora with the exact polish name?
>>
>>106841023
no trigger
>>
File: msedge_PL6jCrYACr.png (195 KB, 1329x466)
195 KB
195 KB PNG
Spooky season! Post your spooky workflows!
>>
>>106840965
Fp8, I've tried Q8. It's the same vae.

>>106840978
Output is the same.
>>
>>106840945
>Why
UncomfyUI sucks
>>
>>106840996
>Please, keep using Chroma, it's a fun model.

I agree!
>>
>>106841061
Try to use weight type default.
>>
File: 1755333332246290.png (195 KB, 2004x845)
195 KB
195 KB PNG
>>106840996
>Please, stop using Chroma, it's a flawed model.
even the pony fags admits it
https://www.reddit.com/r/StableDiffusion/comments/1o0v232/comment/nij9lk3/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button
>>
when idling in comfy, per the little GPU graph, my GPU revs up to 25% - 50% for a second at a regular interval, almost like heavy breathing.

Why does it do this? Is it exhausted, do I work it too hard?
>>
File: radiance.png (2.79 MB, 848x1488)
2.79 MB
2.79 MB PNG
>>106840876
very cool. so you can do, like, walls with alien statues and mechanical tentacles and all that stuff now? (i'm guessing yes from the alien i saw before).
>>
>>106841107
That's just the miner. It only turns on when you're not genning, nothing to worry about.
>>
>>106841097
AntiChromaSchizo (me) was right all along, check RadianceSchizo's gens for proof
>>
File: file.png (2.66 MB, 848x1488)
2.66 MB
2.66 MB PNG
>>106840996
> a flawed model
and the unflawed model with good anime 1girls nsfw and so on is... nothing else, even if illlustrious/noob/neta-yume lumina and so on area also pretty cool models they're all flawed.

like the original chroma, radiance has learned a lot of cool shit.
>>
File: ChromaGiger_00135_.jpg (729 KB, 1304x1672)
729 KB
729 KB JPG
>>106841125
should be yeah but you still need long descriptive prompts
>>
>>106841166
radiance has garbage details
>>
There's no reason to console war over free models. Each and every one has a place in someone's workflow. Redirect that energy into clowning on the creators, like Ponyfag's commitment to artist censorship to use them privately.
>>
>>106841107
try disabling all custom nodes and seeing if it happens.
>>
>>106841211
this, he still hasn't learnt a thing, the reason why Illustrious has dethroned pony v6 is because the former model has artist tags
>>
>>106841211
Yes but let us be clear, Chroma is a failed model, but anons who test it do so for fun and support because it is a local model.
Now let us continue posting gens, but always keeping this in mind
>>
File: radiance.png (2.3 MB, 848x1488)
2.3 MB
2.3 MB PNG
>>106841182
usually a bunch of booru tags and a short description of what you think might be difficult to express as booru tags also work.
>>
>>106841252
oh ran...
>>
>>106841096
Doesn't work.
>>
>>106841274
How about UNET loader or unified chroma loader?
>>
>>106841259
Where is her arm? did you prompt to have 2 tails?
>>
File: 00029-3690700950.jpg (1.27 MB, 2048x2560)
1.27 MB
1.27 MB JPG
>>
>>106841199
to me >>106841125 looks like one of the more detailed results apart form like hunyuanimage3 or something. but that has no anime nsfw finetuning, is aesthetically shit and barely runs anywhere.
>>
>>106840584
>>106841301
>>
>>106840998
NTA but based ty
>>
>>106841133
I honestly feel like it is a miner, not necessarily from comfy, but it does seem odd...

>>106841211
Based

>>106841228
I'll try this when I get home, thanks
>>
>>106841297
Log says "Requested to load PixArtTEModel_" despite I selected "chroma" for TE loader.
>>
>>106841303
you're blind. necklace, buttons and belt are all fucked up, they're all fucking smudgy
>>106841166
here its even worse, the gold details on the armor blend and smudge.
>>
>>106841301
Miku love
>>
>>106841334
>>106840700
>>
>>106841310
it isn't a miner, people here just hate comfy and troll. It is always better to watch your gpu power usage, not the % usage. Likely it is just desktop overhead, in those moments the gpu is clocked very low with lower power use so the 'in use' percentage is high even though nothing is happening. If you actually have miner you'll see high power use
>>
>>106841311
No that's fine. Have you tried the template chroma workflow from comfy?
>>
File: radiance.png (1.92 MB, 848x1488)
1.92 MB
1.92 MB PNG
>>106841299
i did not prompt a specific amount of tails, so it's 1 or more by booru standards

her arm is that arm transformation, or missing, IDK. both happen in the training data or it might be a mistake of sorts in the diffusion.
>>
>>106841310
With ComfyOS (coming son) we only use your rig for mining 11% of the time
>>
>>106841301
thats an interesting style. is it prompt or lora?
>>
>>106841372
>>106840700
>>
>>106841350
Okay well that's good to hear. So other people probably see the "breathing" too then? I'll check out the actual power consumption when I get home

Yeah I'm aware of the hate for comfy, dont understand it though. I love making overly complex workflows just as much as genning desu
>>
>>106841361
Memes aside, I'm actually looking forward to ComfyOS. Gonna setup a VM for it with GPU passthrough and then i'll never have to worry about malicious custom nodes infecting my host.
>>
>>106841351
Yes, same output.
>>
>>106841390
Do you have the AE vae? The model isn't 1:1 to flux.
>>
>>106841381
You can do that right now, bozo

I just have a dedicated rig on a separate VLAN
>>
File: 1760035407542269.png (19 KB, 669x323)
19 KB
19 KB PNG
Why does updating NeoForge require verification now?
>>
>>106841397
Yes. Also tried all TEs.
>>
>>106841419
What is "vlan"? I don't use Mac.
>>
>>106841321
idk what you mean, do you zoom in to 1:1 and notice it doesn't look like vector art or something?
>>
>>106841529
you said details are fine, but they're not. If you're implying other models also do shit details, yes but not as fucking garbage as whatever chroma radiance produces.
now go kys faggot
>>
>>106841419
I'm aware but I see no reason to bother doing that when ComfyOS will be available next year or so. I also already have Comfy in a docker.
>>
File: ComfyUI_00186_.png (1.02 MB, 864x1536)
1.02 MB
1.02 MB PNG
>>
>>106841352

https://files.catbox.moe/7bspgx.mp4
>>
File: ComfyUI_0026-urd.png (3.62 MB, 1296x1728)
3.62 MB
3.62 MB PNG
>>
File: file.png (3 MB, 1530x1120)
3 MB
3 MB PNG
>>106841554
no, that's just your standards. it's fine and just getting better, also zero tricks so far to make "highres fix" type images.

99.99% of photos are worse than this one too (actual camera sensor) and it's looking nowhere near chroma radiance at 1:1.
>>
File: file.png (2.24 MB, 848x1488)
2.24 MB
2.24 MB PNG
>>106841666
i wonder why this model likes orange hands, but it's cool as fuck.
>>
>>106841097
If they encountered the same issue with both Flux and AuraFlow, maybe the issue is them and not the models? There's a good chance they'll add Qwen to that list too when it doesn't magically work like they want it to.
>>
File: 00030-3525737182.png (2.51 MB, 1024x1280)
2.51 MB
2.51 MB PNG
>>
File: ChromaGiger_00147_.jpg (673 KB, 1304x1672)
673 KB
673 KB JPG
>>
>>106841097
The most important takeaway from his writeup at https://civitai.com/articles/19986 is that T5 can't do style-content separation. Anons in the thread have confirmed it, too. It's never been so over, a year of hopes and dreams wasted. Now the question is, do llms share this with T5?
>>
>>106841817
I mean they're gonna finetune Qwen Image next and that one doesn't have T5 as a text encoder
>>
>>106841826
It has a pure llm, which may or may not be even worse. Vllms have abysmal style understanding, substituting it with meaningless slop almost always.
>>
>Flux: Can't do lap nursing handjobs or small penis blowjobs

>Chroma: Can't do lap nursing handjobs or small penis blowjobs

>Qwen: Can't do lap nursing handjobs or small penis blowjobs

>HiDream: Can't do lap nursing handjobs or small penis blowjobs


Guess I'll stick to Illustrious for another few years.
>>
>>106841666
>horizontal shoulders
kek now that's a troon
>>
>>106840906
damn that's pretty sharp
>>
File: bewba.jpg (129 KB, 832x1216)
129 KB
129 KB JPG
>>106839477
>>106839473
>the duality of man! ;3
blessed zones of frems!!
>>
File: radiance.png (2.57 MB, 848x1488)
2.57 MB
2.57 MB PNG
>>106841741
for styles maybe, but let's not put too much weight on that

the failure probably most people regret is how a lot of nsfw and nsfw-adjacent stuff (such as prompting multiple characters with various details and poses - lewd or not) doesn't work well. it would be seen as a great success by most otherwise *even if* styles didn't work without lora. just a less great success than if styles also worked.
>>
File: ComfyUI_00190_.png (1.12 MB, 880x1184)
1.12 MB
1.12 MB PNG
>>106840786
>>
>>106841919
>>106841922
>>106841923
fuck, marry, kill.. GO!
>>
>>106841623
what model is this?
>>
>>106841097
>>106841741
>>106841817
>confuses the model with a convoluted "styles cluster" shit instead of simply giving it more nuance with real artist tags
>is surprised why styles don't work well on his model
do not take this clown's words seriously, he's the problem, not T5
>>
I havent been here since biglust 16 came out and allowed me to gen excellent deepfakes. Are people still basically just using illustrious and NoobAI? Or has something better come out since then?
>>
>>106840181
thanks, I'll take a look
>>
>>106841923
marry
>>106841922
kill
>>106841919
fuck
>>
>>106841919
kill>>106841929
>>
>>106841947
Neta Yume, Chroma, Qwen
>>
i will fuck and marry anyone\anything if you kill only postcard
>>
>>106841981
Thanks I'll look into them. Do they have specific strengths versus illustrious and noob? I like how illustrious easily has SDXL lora support for anime styles and already has a ton baked in, and for realism and deepfakes its better than flux in getting the appearance right and looking realistic, it just sucks dick at adhering to prompts.
>>
File: ComfyUI_00191_.png (1.17 MB, 880x1184)
1.17 MB
1.17 MB PNG
>>
>>106841994
>filename
>>
File: 00032-3337549059.png (2.24 MB, 1024x1024)
2.24 MB
2.24 MB PNG
>>
>>106841930
it's just fluxerino with gta 6 lora (https://civitai.com/models/688840?modelVersionId=1775782)
testin new lunatix installation
>>
File: ComfyUI_00195_.png (1.02 MB, 880x1184)
1.02 MB
1.02 MB PNG
>>
Could I please have the original full size image of that cockatiel.
>>
File: cockbird.png (54 KB, 548x803)
54 KB
54 KB PNG
>>106842116
pic related is the original
>>
File: 00033-2790987070.png (2.01 MB, 1024x1280)
2.01 MB
2.01 MB PNG
>>
>>106842129
:^)
>>
>>106842129
Thanks anon.
>>
>>106842138
Cool
>>
File: 1743988744660869.mp4 (1.45 MB, 720x960)
1.45 MB
1.45 MB MP4
>>106841923
>>
File: lilbullieshehe.mp4 (1.29 MB, 726x1158)
1.29 MB
1.29 MB MP4
see ya later!!! <3
also NO kill!
NO HIT!
byee!

>chckpnt: wan 2.1
>lora: https://tensor.art/models/906123123449986583
>>
>>106842169
zamn
>>
>>106842169
>>106841923
>t. smokes camel turkish silvers exclusively
;3
>>
>>106840181
wait Res_2m and Res_3m are the same speed, why not use Res_3m all the time then?
>>
>>106840998
...where?
>>
File: garfield_thunberg.png (1.05 MB, 1024x1280)
1.05 MB
1.05 MB PNG
>>106840574
>>
why wouldn't t5 let you do content-style separation?
>>
>>106842247
probably tensor washback
>>
>>106842169
got dam
>>
File: x.webm (1.96 MB, 1120x1504)
1.96 MB
1.96 MB WEBM
>>106842009
>>
File: ComfyUI_00194_.png (3.3 MB, 1496x2016)
3.3 MB
3.3 MB PNG
>>
File: 00441-2999025034.png (360 KB, 512x640)
360 KB
360 KB PNG
>>
>>106842336
lols pretty good, even fixed the 3rd arm
>>
File: 00443-501995628.png (1.04 MB, 767x927)
1.04 MB
1.04 MB PNG
>>
File: ComfyUI_00375_.mp4 (624 KB, 640x832)
624 KB
624 KB MP4
>>106842231
>>
>>106842483
I love that the hat is also alive
great video
>>
File: newwhip.png (1.74 MB, 1680x864)
1.74 MB
1.74 MB PNG
my new car is coming to life
>>
File: mynewcar.png (463 KB, 1672x856)
463 KB
463 KB PNG
>>106842504
made from some anons shitpost
>>
>>106842504
that's pretty pope
>>
File: bibis_gaza_adventure.png (1.21 MB, 856x1216)
1.21 MB
1.21 MB PNG
>>106842528
I have one of those too
>>
>>106842504
listening to city pope
>>
File: ComfyUI_00377_.mp4 (1.17 MB, 640x832)
1.17 MB
1.17 MB MP4
>>106842390
>>
Ia that all? All you can conjure anons?
>>
>>
>>106842569
requesting a vramKING to make them bounce while squatting
>>
>>
>>106842504
>>106842518
kek, when the car you drew as a kid turns up
>>
File: ComfyUI_00379_.mp4 (989 KB, 640x832)
989 KB
989 KB MP4
>>106842569
how about while sitting?
>>
>>
>>106842594
Nice!
>>
>>
File: ComfyUI_00380_.mp4 (1.62 MB, 720x1280)
1.62 MB
1.62 MB MP4
>>
File: file.png (1001 KB, 848x1060)
1001 KB
1001 KB PNG
>>106841301
And stop staring at me with them big ol' eyes!
>>
File: IMG_20251010_012043.png (1.34 MB, 768x1280)
1.34 MB
1.34 MB PNG
>>106842594
Here is your reward!
>>
erm.... anonie?
>>
File: ComfyUI_00232_.png (3.1 MB, 1496x2016)
3.1 MB
3.1 MB PNG
>>
File: ComfyUI_00381_.mp4 (1010 KB, 720x1280)
1010 KB
1010 KB MP4
>>
>>106842642
cursed
>>
>>106842642
This is new ComfyUI mascot.
>>
>>106842569
crossed legs conjure me
>>
File: x.mp4 (1.82 MB, 1120x544)
1.82 MB
1.82 MB MP4
>>
>>106842691
lols

is this wan2.2?
>>
>>106842691
BASED BASED BASED BASED
>>
File: rq.png (1.9 MB, 768x1352)
1.9 MB
1.9 MB PNG
>>106842467
>>
Gradual change of schema to v3 currently on latest comfy commits, I do not recommend to update to latest nightly for a few days.
>>
>>106842695
yes, quick gen with wan after qwen image edit
>>
comfy is such a piece of shit.. all the noodles on my shit just keep disappearing for no reason.. shit's still connected, but the connectors just disappear
>>
You now remember FramePack
>>
>>106842711
this happened to me after I hadn't restarted firefox for a few days
>>
File: c?ckmobile.png (2.11 MB, 1424x728)
2.11 MB
2.11 MB PNG
>>106842708
basically if you didn't do this thing with low step count it might look better yet
>>
File: ComfyUI_00384_.mp4 (875 KB, 640x832)
875 KB
875 KB MP4
>>
File: rq.png (2.04 MB, 768x1352)
2.04 MB
2.04 MB PNG
>>106842714
i remember it but i had already preferred hyvid or wan
>>
>>106842616
Sweet! Get your reward.
I
>>
File: ComfyUI_00385_.mp4 (918 KB, 640x832)
918 KB
918 KB MP4
>>
File: AniStudio_output-00052.png (919 KB, 1344x1088)
919 KB
919 KB PNG
>>106842736
how did you cure her down syndrome?
>>
File: IMG_20251010_014308.png (1.4 MB, 768x1280)
1.4 MB
1.4 MB PNG
>>106842789
Here
>>
File: ComfyUI_00387_.mp4 (1.15 MB, 640x832)
1.15 MB
1.15 MB MP4
>>106842848
it did it by itself, i just prompted for her to stare and then blink once, and it was nah bruh we fixin this shit
>>
File: ComfyUI_00262_.png (3.56 MB, 1536x2048)
3.56 MB
3.56 MB PNG
>>
File: 00445-1432466528.png (1.16 MB, 768x960)
1.16 MB
1.16 MB PNG
>>
qwen edit doesn't really understand how to place a kippa which is based but also limiting
>>
>>106842934
>qwen edit doesn't really understand how to place a kippa which is based
shalom
>>
>>106842848
We don't need you.
>>
File: 00446-2561653272.png (1.07 MB, 768x960)
1.07 MB
1.07 MB PNG
>>
>>106842848
We need you.
>>
File: QwenEdit_00038_.png (908 KB, 768x960)
908 KB
908 KB PNG
>>106842960
Shalom!
>>
playing with subgraphs made it really easy to finally have a clear view of my noodle fest
>>
File: ComfyUI_00393_.mp4 (864 KB, 832x832)
864 KB
864 KB MP4
>>
File: under the rug.jpg (80 KB, 640x480)
80 KB
80 KB JPG
>>106843011
>>
>>106843086
in a way yes, but it's for stuff I never modify in any way, I only care about input/output for them, so it doesn't matter how noodly it is in the subgraph
>>
>>106843096
seems like you are butthurt
>>
File: ComfyUI_00394_.mp4 (654 KB, 832x832)
654 KB
654 KB MP4
>>
>>106843101
not at all but feel free to imagine whatever you want
>>
File: ComfyUI_00395_.mp4 (727 KB, 832x832)
727 KB
727 KB MP4
>>106843101
you seem retarded desu
>>
File: 00448-2794823676.png (991 KB, 960x768)
991 KB
991 KB PNG
>>
File: ComfyUI_00396_.mp4 (510 KB, 832x832)
510 KB
510 KB MP4
filth
>>
File: q.png (1.95 MB, 768x1352)
1.95 MB
1.95 MB PNG
>>106842792 >>106842858
>>
>>106843198
Insectile...
>>
File: sit.mp4 (1.21 MB, 1056x1856)
1.21 MB
1.21 MB MP4
>>
>>106843215
got dam
>>
File: 00449-4062480162.png (980 KB, 960x768)
980 KB
980 KB PNG
>>
File: y.mp4 (1.45 MB, 1056x1856)
1.45 MB
1.45 MB MP4
>>
File: 1744082936480043.jpg (656 KB, 1989x1127)
656 KB
656 KB JPG
I feel so fucking dumb. I'm trying to get ComfyUI to run. I have 24GB of VRAM and I'm trying to do a simple small image to video test, but I keep getting Disconnected error, but nothing shows in the logs.
I see I have 2 of these nodes loading the same model, should I disable one of them? what's the purpose of having 2?
>>
>>106843261
>I see I have 2 of these nodes loading the same model, should I disable one of them? what's the purpose of having 2?
typically you need a high and a low wan22 model
but I don't see if that's how it's set up here. maybe you do have it wrong tho and still need the high model
>>
File: 00450-3186642153.png (1011 KB, 960x768)
1011 KB
1011 KB PNG
>>
File: bounce.mp4 (1.94 MB, 1120x1856)
1.94 MB
1.94 MB MP4
>>106842600
using the seat cushion to bounce
>>
>>106843261
>keep getting Disconnected error
never seen that. Are you using the default wan template? It has the lightning loras built in and should be plug and play. Unless you are being silly and have a 7900xt thinking your 24gb of ram means something.
>>
File: ComfyUI_00401_.mp4 (734 KB, 640x832)
734 KB
734 KB MP4
>>106842722
well that's gay af, but it worked.. restarting ff seems to have fixed it
>>
>>106843294
heh, cute
>>
File: 1759367913856391.jpg (126 KB, 1634x190)
126 KB
126 KB JPG
>>106843313
Just followed the guide on anime ldg, using the portable version of comfy.
The last thing it gets written to the log is this, is there a way to verbose the logs in portable?
>>
Neta LoRA status?
>>
how much vram do I need to run qwen without the lightning loras at reasonable gen times?
>>
>>106843356
about 3fiddy
>>
File: ComfyUI_00407_.mp4 (735 KB, 640x832)
735 KB
735 KB MP4
>>
>>106843334
try the default comfy wan template, you can at least see if it works. Literally all you have to do is load the template and get the linked models (maybe already have them).
>>
>>106843356
get a 4090/5090
>>
File: dept_ch39_00042_.png (2.97 MB, 1664x1112)
2.97 MB
2.97 MB PNG
>>106843317
I thought the second pair of hands would have been the chickens. now I dont know what to think
>>
>>106843411
holy sexo
>>
File: ComfyUI_00200_.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
>>
>>106843401
>$3500 canadian communism tokens
sheeeeeeeit
if one of my shitcoins pops ill do it, AMD is twisting my balls with rocm
>>
Today's experience with using comfyui.

>miss click node and drag with mouse select entire webui like a fucking webpage.
>prompt nodes boxes now lag when you merely click inside of them to edit the prompt.
>prompt nodes randomly losing the cursor.
>prompt nodes not showing cursor in correct location within the prompt text.
>hit F5 to fix prompt issues, takes ages to load, also fucks with the load from output directory node even when you've set automatic refresh off.
>for what ever reason the UI lags randomly from time to time, like it just hangs on ksampler for ages from some minor change to prompt.
>never frees system ram, starts off fast but always memory leaks until your need to restart it unless you disable caching, free nodes cache what ever that button is does fuck all...

Is this guy a fucking retard or what? Because over the last year it has gotten worse and worse, what a piece of a fucking shite. There is nothing worse in a production environment than having to fucking fight against the fucking tool being a laggy piece of bloatY fucking shite.

GOD FUCKING DAMN IT, I SHOULD NOT BE LAGGING JUST BECAUSE I SWITCH FROM ONE SDXL MODEL TO ANOTHER.
>>
>>106843489
works perfectly fine on my machine
>>
>>106843489
doesn't work on my machine either
>>
>>106843513
just wait till you use a big workflow retard. And here is the thing is also works on my machine but over the last year its gotten slower and slower and more annoying and fucking useless bloat added that no one wanted.
>>
File: f.mp4 (1.93 MB, 1120x1856)
1.93 MB
1.93 MB MP4
>>106843317 >>106843381
excellent

>>106843411
it's a nice plot twist
>>
>>106843541
what do you use to upscale? looks nice.
>>
>two fav loras are fighting
>>
Wan vace 2.2 released, no mention in thread?
>>
i dont and wont care about wan until i can do more than 5 seconds.
>>
still not upgrading from my double 3090 setup, costs less than one 4090 and runs LLMs better
>>
>>106843600
link?
>>
>>106843600
Source? I don't see anything on their huggingface
>>
File: 1756093988760355.jpg (1.15 MB, 1416x2120)
1.15 MB
1.15 MB JPG
>>
File: f.mp4 (750 KB, 1056x1856)
750 KB
750 KB MP4
>>106843590
just simple fast lanczos in this case, i went for the speedier gens
>>
>>106843541
gud one
>>
>>106843614
>>106843615
I got an email about it, but when I clicked it, it was gone. Guess the dude accidentally released the video too soon.
>>
File: sit.mp4 (1.02 MB, 1056x1856)
1.02 MB
1.02 MB MP4
>>106843655
ty, have another sitting
>>
File: 1755997138156207.jpg (916 KB, 1416x2120)
916 KB
916 KB JPG
>>
File: ComfyUI_05755_.png (769 KB, 888x1176)
769 KB
769 KB PNG
>>
File: ComfyUI_05797_.png (930 KB, 736x1408)
930 KB
930 KB PNG
>>
>>106843731
Very smug!
>>
>>106843731
smugliest son of a bitch
>>
>>106843215
was it radiance pic?
>>
File: sit.mp4 (1.94 MB, 1056x1856)
1.94 MB
1.94 MB MP4
>>106843731
i need a 4x strategy game with this race

>>106843738
what a cultured museum

>>106843805
yes
>>
>>106840858
based
>>
File: ComfyUI_00413_.mp4 (1.78 MB, 1024x1024)
1.78 MB
1.78 MB MP4
>>
>>106843743
>>106843751
It looks more like extreme disgust to me, kind of how my face looks whenever I browse through /pol/
>>
File: rw_g_1b_00028_.jpg (1.8 MB, 1344x1728)
1.8 MB
1.8 MB JPG
>>
>>106841437
Anyone???
>>
>>106843411
>>106843440
>holy sexo
Which one?
>>
github requires a login to clone and pull ding dong
>>
>>106841437
>>106843997
You damn well know why.
>>
>>106839753
>other_dont_like
Literally me
>>
File: 1746138512851815.jpg (833 KB, 1416x2120)
833 KB
833 KB JPG
>>
>>106844065
Why?
>>
>https://pbihao.github.io/projects/DreamOmni2/index.html
https://www.youtube.com/watch?v=8xpoiRK57uU

Qwen Edit about to get mogged.
>>
File: 00451-1880026191.png (1.13 MB, 768x960)
1.13 MB
1.13 MB PNG
>>
>>106844100
cool style
>>
>>106840398
nunchaku flux/qwen/chroma
>>
>>106844100
legit scary
>>
>>106844070
i hate women with these faces
>>
>>106844207
>>106844207
>>106844207
>>106844207
>>106844207
>>
>>106842632
catbox?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.