[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>107957379

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>ZiT
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg
>>
mfw
>>
Anon, why last Qwen image model is not more recommended?
Accuracy and anatomy are dope.
>>
File: screenshot.1769305299.jpg (67 KB, 552x307)
67 KB
67 KB JPG
>>
>>107960810
This will never not be funny to me.
>>
>>107960797
I do not have the patience for it. It’s insanely slow for what is ultimately a mediocre output.
>>
As much as qwen is much better than zit on prompt following/understanding purely for being larger I didn't like how it made cartoons and 3d stuff, it all had that homogenized ai look which zit (and klein) do not
>>
>>107960797
Klein beats it for most uses and is like 10x faster
>>
File: file.png (2.56 MB, 2336x1635)
2.56 MB
2.56 MB PNG
>>107960797
OK I see the same patterned noise as my issue with klein
>>
File: 32178237.png (3.43 MB, 2048x1024)
3.43 MB
3.43 MB PNG
>>
>>107960649
I use NBP for commercial stuff. Let's just say aesthetically, Flux.2 Klein has it beat for photography by a margin, and for stuff it doesn't know, the edit model is enough to bridge the gap. The only thing Flux.2 Klein is really behind NBP still is text, somewhat. It is behind, but not as far behind as ZIT. A change in architecture for Flux.3 to autoregressive would be really interesting to see.
>>
>>107960906
>A change in architecture for Flux.3 to autoregressive
That would balloon the size to dozens of GB big. But I get it, it's the secret sauce of the closed sota models.
>>
>>107960906
>NBP
Searched google for what that was and stumbled upon this subreddit https://www.reddit.com/r/IndianArtAI/comments/1ql0wlq/one_image_to_multiple_shots_using_nbp_workflow/
Beyond parody
>>
>>107960950
it's just nano banana pro anon
>>
>>107960797
Looks like a magazine photograph.
>>
File: file.jpg (274 KB, 1679x1491)
274 KB
274 KB JPG
any trick to stop klein 9B from giving way too many details on skin to the point of making everyone 10 years older than they are?
>>
>>107960984
Probably something like an instagram lora once it's out.
>>
>>107960984
I get this when I do more steps than what the distill was made for, since you can't increase the steps if you want more quality you'd need to use the 2s,3s etc samplers from RES4LYF
>>
>>107960934
I mean NBP.
2 Klein was trained really well for the same 2K res that I'm prompting NBP. The detail it's capturing in images compared to smooth skin NBP is just crazy.
>>
How does LoRA training for Klein work? Do I need an A/B set for edit or what?
>>
>>107960994
thanks, going from 8 to 4 steps helped a lot, but the model still wants to add endless skin details from peach fuzz to wrinkles and make every person 30+

>>107960992
guess I'll wait for that
>>
>>107961026
You just train it normally and then you can prompt it to add whatever details you trained on in edit mode
>>
>>107960906
Google also cheats and feeds itself into nbp prompts, which we really can't do easily.
>>
>>107961036
Try to juggle sampler and scheduler settings. Some just like to make up random shit.
>>
>>107961057
>Google also cheats and feeds itself into nbp prompts

Yes, but Flux team can. I was certain we were a lot farther behind than we are now. I'm convinced we are only bottlenecked by model size. It legit feels like a tiny NBP.
>>
>>107961093
I think I got it to behave, I added "smooth skin" to the prompt and it stopped making grandma
man I miss NAG
>>
File: 266514413.png (3.91 MB, 2048x1024)
3.91 MB
3.91 MB PNG
>>
>>107960797
I still prefer Qwen for my use-case (Anime to photo-real), Klein changes too much, like outfit and hairstyle, while making some anatomical mistakes like too big heads if the reference is too stylized. Klein is much more realistic, better textures and ofc, 10x faster, but the output is not nearly as accurate as QIE.
One thing though, Klein can do anime to photo-real without any lora, just by prompt, while QWEN needs a bunch of loras to get decent results and there are some flaws that cannot be fixed, as far as I've tested.
I'm really hoping someone makes a lora that closes that gap for Klein, or that Zedit turns out to be better than both somehow...
>>
>>107961097
That's fair. Right now the tech giants benefit from the enormously inefficient architecture since it keeps smaller firms out. The US is becoming such a snarl of bad policy that I'm hopeful competitors can keep stepping up.
>>
File: 01505-3454300269.png (1.2 MB, 896x1152)
1.2 MB
1.2 MB PNG
>>
File: radiance_x32.jpg (144 KB, 1280x1280)
144 KB
144 KB JPG
>>107960797
The downsides are a bunch - it tends to only change minor details on the same prompt if you change seed and various of the subjects people like best are only somewhat trained and only somewhat aesthetic/sexy/[...]. And then there is also the slow gen speed.

I do often recommend Qwen, but not usually first if the question is "what is generally good". It works best for specific uses.
>>
File: radiance_x32.jpg (106 KB, 1280x1280)
106 KB
106 KB JPG
>>
it looks absolutely terrible
>>
can't wait for all the LTX-2 haters to say they actually liked it all along when the porn tunes get good enough in a month.
>>
File: radiance_x32.jpg (381 KB, 1280x1280)
381 KB
381 KB JPG
>>107960906
>A change in architecture for Flux.3 to autoregressive would be really interesting to see.
Why do you expect good results from that. Just the text? It doesn't seem worth bloating the models by orders of magnitude for that, doesn't it?
>>
Is this the nigger thread?
>>
>>107960984
>everyone 10 years older than they are?
Specify -2 years old.
>>
File: radiance_x32.jpg (245 KB, 1280x1280)
245 KB
245 KB JPG
>>
File: radiance_x32.png (2.69 MB, 1280x1280)
2.69 MB
2.69 MB PNG
>>
>>107961292
I used "smooth skin" and it worked, so it's ok now
>>
File: ComfyUI_00069_.png (2.86 MB, 1248x1824)
2.86 MB
2.86 MB PNG
>>107961334
love me some tifa
>>
anon was right, it's so fucking fun going into old porn stashes and upscaling and making images more cinematic using klein 9b
>>
File: radiance_x32.jpg (240 KB, 1280x1280)
240 KB
240 KB JPG
>>107961388
i did get a few more when I tried the high angle tifa
>>
File: radiance_x32.png (2.43 MB, 1280x1280)
2.43 MB
2.43 MB PNG
>>
File: radiance_x32.png (2.75 MB, 1280x1280)
2.75 MB
2.75 MB PNG
lower perspective
>>
is gguf worth it if its like 20% slower?
>>
Stupid fur shit load of shit stones has me arguing in favor of a vae.
>>
File: radiance_x32.jpg (224 KB, 1280x1280)
224 KB
224 KB JPG
>>107961425
gguf are often a very good compromise. on many current models a q8 will look very close to fp16/bf16, and other gguf quant levels also often are great for their size.
>>
Klein tiled vae gives sharper output but slightly more color cast. what gives?
>>
>>107961425
Q8 is practically identical to fp16. If you want it to be as fast as possible and have a newer card fp8 is also an option, but q8 is still going to be more accurate.
>>
>>107961425
Always run Q8. It's basically free gains for no price.
>>
>>107961477
I have a 3090. was using fp8 on klein. could gen in around 3.5-4sec . with q8 its around 4.5sec
>>
>>107961471
rng
>>
>>107961490
Why are you running fp8 quants when you can fit the entire model twice nigga?
>>
>>107961499
dunno. I thought bf16 was placebo.
>>
>>107961477
the difference is negligible between fp8 (the current "scaled" ones) and q8 on my tests, and I tested a lot, and I get a huge boost in speed with fp8 on my 5090, so I just go with fp8 nowadays
>>
>>107961500
it may well be, I don't see any substantial difference but I'm using it regardless
>>
>>107961425
No. Also, 20% slowdown is generous, for me it's worse.
>>
>>107961490
I don't think 30 series can take advantage of fp8, unless I'm mistaken.

>>107961501
This has been my experience as well. If there's a quality difference it's not enough to be noticable.
>>
I think I'm going back to fp8 lol. I'll keep gguf for the text encoder tho.
>>
>>107961266
Not just text. Prompt following is one area where it's still clearly lacking. And didn't Chinks release a smaller autoregressive model?
>>
>>107961527
>Prompt following
Zit and qwen can both follow prompt to the point of extreme rigidity. There's no point in adding gigabytes of reinveting the wheel. Flux scuff is just generational heritage and most likely caused by their safetycucking.
>>
reminder, don't let retards convince you fp8 is the same as q8.
>>
>>107961543
This was a thing for a while. All these people telling me that wan fp8 was the same as gguf and I kept thinking “no it’s fucking not”
>>
File: radiance_x32.jpg (190 KB, 1280x1280)
190 KB
190 KB JPG
>>107961500 >>107961502
if fp8[-scaled/-mixed/...] is a close match or not always depends on the model. if it already is, it's not even like q8 can just be much better.

but after people had some time, it could as well be that the GGUF Q4/6 IQ or w/e also is relatively close.

there's no easy generalization apart from gguf q8 almost always being pretty decent as a quant of bf16/fp16 or even sometimes fp32 models -> a random person on hf dropping a gguf q8 conversion ten minutes after publication has a good chance.

but it doesn't really determine the quality of everything else, e.g. whether a fp8scaled also is nearly perfect - never mind after people had weeks to try.
>>
>>107961542
>Zit and qwen can both follow prompt to the point of extreme rigidit

Yeah but https://z.ai/blog/glm-image

This model shows it can be done for consumer-sized models. I'm not asking for an 80B autoregressive model. And the benefits are way more than just following prompts well. It's following prompts in the same a way a human would understand text, E.G. "don't do or include this" and the model painlessly does it, no need to even attempt engineering the prompt or being worried about mistakes for more complex prompts.
>>
best zit cfg settings?
>>
File: ComfyUI_temp_oimpv_00011_.jpg (679 KB, 1200x2144)
679 KB
679 KB JPG
107960931
Sorry, was a bit busy. Here's some examples (and workflows)
https://files.catbox.moe/7ykdds.png
https://files.catbox.moe/pijg2x.png
https://files.catbox.moe/ex0d7n.png
https://files.catbox.moe/64nssa.png
>>
File: radiance_x32.jpg (243 KB, 1280x1280)
243 KB
243 KB JPG
>>107961527
not sure we've hit the limit on diffusion models even with the prompt difficulty of WAN2.2, Flux or Qwen though?

actually, what autoregressive model do you think of and how many prompt tokens more can it actually handle on images? what if we adjust for the ram/compute required, is it even better?
>>
>>107961589
whoops meant for
>>107960931
>>
So for LTX-2 gguf q8 is pretty much the same speed as the fp8.
>>
>>107961542
Muh safety

Its 2026 - have people still not understood the connection between SFT/RL and prompt following/seed variance?

your mom should try RL on you, then you'll understand
>>
I'm so happy the thread is normal for once in a long while
>>
File: edit_27.jpg (1.55 MB, 1168x1760)
1.55 MB
1.55 MB JPG
>>
>>107961607
Is it normal to bake threads half an hour before the last one hits 300 posts?
>>
>>107961607
I think the janny went full scorched earth. A few people ate bans for simply existing in the schizo bakes.
>>
>>107961643
the threads were unusable for the past few weeks, so good riddance
>>
File: radiance_x32.jpg (214 KB, 1280x1280)
214 KB
214 KB JPG
>>
>>107961671
bro that's a buttplug
>>
>>107961633
its the new normal
>>
File: radiance_x32.jpg (229 KB, 1280x1280)
229 KB
229 KB JPG
>>107961686
a dragon tail, possibly natural, bionic, or attached to underwear
>>
>>107961695
I will accept this explanation. maybe I am the bigot for not knowing dragon anatomy
>>
>>107961595
Thanks.
>>
File: radiance_x32.jpg (189 KB, 1280x1280)
189 KB
189 KB JPG
Looks like maybe a team from NYU figured out how to fuurther improve on VAE (they say they tested against the current Flux VAE as well as on various LLM)

https://github.com/ZitengWangNYU/Scale-RAE
https://huggingface.co/papers/2601.16208
>>
>>107961727
>Lecunny
funny he left meta and immediately started being useful
>>
>>107961607
"Anon" stopped screeching about the two links in OP he dislikes. Lets see how long he'll keep baking.
>>
File: Flux2-Klein_00164_.jpg (1.18 MB, 2426x2144)
1.18 MB
1.18 MB JPG
>>107961173
I use it for similar use-cases and it works fine with a face reference/anatomy reference.
>>
>>107961781
makes sense
>>
>just give me an inch please
kek
>>
File: radiance_x32.jpg (214 KB, 1280x1280)
214 KB
214 KB JPG
>>107961738
didn't know he had left meta. that is pretty funny.
>>
>>107961643
but this is the schizo bake?
>>
>>107961798
The pissbaby should get what he wants because he cried a lot?
>>
Everyone needs to just not acknowledge or mention the drama at all. It’s clear that talking about it creates a feedback loop.

I’m training a LoRA for f2k 9b right now. How about you?
>>
do you think anon forgot you baking seven threads because you didnt like OP? kek
>>
File: radiance_x32.jpg (206 KB, 1280x1280)
206 KB
206 KB JPG
>>
2 more days for ACEStep bros!
>>
what is this for?
https://huggingface.co/Lightricks/LTX-2-19b-IC-LoRA-Canny-Control
how do you use it?
>>
>>107961864
>Everyone needs to just not acknowledge or mention the drama at all. It’s clear that talking about it creates a feedback loop.
Do you remember how the anon who didn't like the links would mention them at the start of every thread? And then for the past couple of days/week would spam multiple new bakes at the same time really early without said links?
Do you ever wonder why there's not an anon doing the same thing but opposite right now? Any anon who's lurked within the past week witnessed how hard anon trolled and sperged because he wasn't getting his way with OP, and now it's stopped because he is satiated.
>>
File: radiance_x32.jpg (276 KB, 1280x1280)
276 KB
276 KB JPG
>>107961864
tried a few for 4b, mediocre success so far

>>107961918
IC:
>allowing fine-grained video-to-video control on top of a text-to-video, base model. It allows also the usage of an intial image for image-to-video
Canny:
https://en.wikipedia.org/wiki/Canny_edge_detector

Canny was already used from the old imagegen models onwards, you can essentially draw or extract the structure of a reference and then have imagegen draw something very structurally similar without obviously making EVERYTHING exactly the same.
>>
>>107961895
I got this feeling it’s been delayed a bit. They never respond if they are asked if it’s been delayed and make constant referrals to waiting for their “launch partners” being ready.


My Chinese culture alarm has been tripped
>>
>still trying
>>
>>107961946
>who the fuck cares about this shit?
the anon who was sperging out about them so hard hed bake a ton of extra threads at 270 obviously does lol
>>
>>107961938
Anon, why Radiance hands and feet looks like shit again? It becomes Lodeshit signature at this point...
>>
File: radiance_x32.jpg (168 KB, 1280x1280)
168 KB
168 KB JPG
>>107961918
>how do you use it?
templates has a workflow
>>
File: 445456454544.png (250 KB, 430x817)
250 KB
250 KB PNG
>>107961948
Kek you almost got me for a sec there. Release is imminent. 27th might mean tomorrow (Chinese 27th)
>>
Can Klein take openpose/depth/sketch without controlnet?
>>
>>107961982
No I’m serious. Read up in the chat
>>
File: 4545546544545.jpg (171 KB, 780x873)
171 KB
171 KB JPG
>>107961982
Local is gonna be eating good starting around midnight tomorrow. I have my audio prompts ready, even taking a few from Udio/Suno, do you anon?
>>
https://voca.ro/18b730hlCAOg
>>
File: 45564454564.jpg (189 KB, 879x873)
189 KB
189 KB JPG
>>107962005
The first SD 1.5 moment for audio. It's here.
>>
>>107960797
How does it compare with Z Image Turbo?
>>
File: 5654454564564.png (88 KB, 1128x468)
88 KB
88 KB PNG
>>107961993
Hm, from what he is saying that means they are prepared, but Comfy told them that the code is not ready.
>>
File: radiance_x32.jpg (165 KB, 1280x1280)
165 KB
165 KB JPG
>>107961967
it's the x32 version

and hands were not absolutely perfect before that either, just usually getting pretty good
>>
>>107962060
Honestly, I wouldn't mind being forced to use it on diffusers or whatever Gradio UI they have prepared, fuck Comfy.
>>
File: radiance_x32.jpg (231 KB, 1280x1280)
231 KB
231 KB JPG
>>
File: radiance_x32.jpg (135 KB, 1280x1280)
135 KB
135 KB JPG
>>
File: radiance_x32.jpg (194 KB, 1280x1280)
194 KB
194 KB JPG
>>107962060>>107961982
Are we sure that this doesn't just mean that new AceStep is releasing on the 27th but the Comfy devs indicate it might take a week for official support rather than support on release?
>>
File: radiance_x32.jpg (113 KB, 1280x1280)
113 KB
113 KB JPG
>>
>>107962060
My interpretation was that they were prepared to move the release date back a little to accommodate their partners (comfy).

Personally I don’t understand why a third parties preparedness should factor in at all but I have no doubt they plan to release what they say they do.
>>
File: 1.png (458 KB, 464x688)
458 KB
458 KB PNG
Grok wont let me generate bondage anymore, my life is ruined ...

Now that i got a taste i want more... how do i continue down this path of degeneracy?
>>
File: radiance_x32.png (3.17 MB, 1280x1280)
3.17 MB
3.17 MB PNG
>>107962144
iirc civitai has a lot of lora for illustrious/noob
>>
I'm having a lot of memory issues with the gguf nodes in comfy. my container just keeps getting OOM killed.
>>
File: radiance_x32.jpg (251 KB, 1280x1280)
251 KB
251 KB JPG
>>107962144
radiance x32 also learned some btw, but you will get more odd gens than on illustrious/noob with lora
>>
File: radiance_x32.jpg (466 KB, 1280x1280)
466 KB
466 KB JPG
>>107962192
try comfyui-multigpu - the distorch2 gguf loader. you can tell it to offload x GB to system RAM, it also shows the allocation in a nice table in the console when you gen
>>
>>107962192
man this is some WSL bullshit again. I think this might finally be the day I just install linux on my gaming PC.
>>
>>107962206
Why are you using wsl to run comfy?
>>
>>107962204
that's the thing tho. I don't want it to offload. not exactly sure the source of the problem in comfy. but basically it won't free up VRAM when it needs to and it'll start filling up the shared ram even tho I disabled the feature on the GPU.
>>
>>107962213
because that's where my 3090 is.
>>
File: radiance_x32.jpg (194 KB, 1280x1280)
194 KB
194 KB JPG
>>107962221
ah. well, i don't think I know what's going on in WSL then.

>>107962206
> I think this might finally be the day I just install linux on my gaming PC.
certainly works better for inference/training for me
>>
>>107961738
He always was quite the top expert, but I'm not sure he had any power at the end at Meta.
>>
>>107962142
It's for maximum impact, don't underestimate how that stuff is important if immediately upon release, people can play with it on the biggest UIs.
>>
>>107962206
Migrating my LLM/SD stuff to linux (headless) has been the best decision I made, it's so much more stable and memory efficient than windows.
>>
>>107961727
God just link directly to the paper, why do I have to click twice because of you?
>>
File: joy.png (16 KB, 505x247)
16 KB
16 KB PNG
What options in the joycaption preset are important? Not for an artstyle or artist lora. Should I even bother mentioning the artstyle since the models are agnostic anyway?
>>
>>107962262
He didn't. The fact that he had to answer to Wang in the new power structure Zuck created says everything about that. Zuck basically went all in on transformers because he got embarrassed with Llama 4. With that mindset, research on "alternatives" is not on his mind and FAIR got pretty much put on the chopping block. I don't even know why it is still a thing when he has de-prioritized it to the point of losing a world class ML researcher over it.
>>
File: radiance_x32-666.jpg (155 KB, 1280x1280)
155 KB
155 KB JPG
>>107962329
i knew you would be uninterested in models and software and wanted you to faint from exhaustion with that one click
>>
>>107962340
Write a long detailed description for this image in 200 words or less. Do NOT include information about people/characters that cannot be changed (like ethnicity, gender, etc), but do still include changeable attributes (like hair style). Include information about lighting. Include information about camera angle. Do NOT mention the image's resolution. Do NOT use any ambiguous language. Mention whether the image depicts an extreme close-up, close-up, medium close-up, medium shot, cowboy shot, medium wide shot, wide shot, or extreme wide shot. Your response will be used by a text-to-image model, so avoid useless meta phrases like “This image shows…”, "You are looking at...", etc.

my current picks
>>
>>107962376
Why are your gens so slopped?
>>
>>107962340
Is that guy still alive?
Is he doing anything? Did he go bankrupt?
>>
>>107962381
Nta but he’s using a vaeless fine tune of chroma. Results are as you’d expect
>>
>>107962388
'Two more weeks' has taken on a profound meaning for me; it affirms life and denies it at the same time, an introspective reassessment of my own state of being, more than an existential philosophy that permeates every aspect of my life.

So here's to two more weeks for all of you,
Amen
>>
>>107962409
z-image base's strongest warrior
>>
>>107962388
SD1.5 vibe, I kinda like it.
>>
>>107961864
which tool/settings?
>>
>>107962447
he has no vision, only ideas
>>
>>107962447
>constant inability to see projects through to the finish
>begs for money to continue to flimflam on whatever his obsession this month is and then abandon it
>complains about the license of the obvious models to train on because he very clearly will turn around and monetize his shit in the extremely unlikely event he maintains his attention span long enough to make something viable.
>>
File: radiance_x32.jpg (268 KB, 1280x1280)
268 KB
268 KB JPG
>>107962381
sudden increase in magic sauce
>>
>>107962447
radiance is fucking garbage
>i-it could be good
BUT IT ISNT RIGHT NOW
we have the guy spamming his radiance.jpgs I dont even know why, its so fucking bad
>>
i come in peace
>>
>>107962474
never understood the obsession with loisense. if you need money to train the model, then just collect as non-profit. unless of course youre pocketing it behind everyone's back. noncommercial models would actually be an improvement, then we wouldn't have so many civitbrowns and the garbage buzz ecosystem would collapse.
>>
>>107962447
It wouldn't be such a big deal if he wasn't one of the few people who had the resources to train full local finetunes independently. Just seems like a huge waste of resources.
>>
File: radiance_x32.jpg (230 KB, 1280x1280)
230 KB
230 KB JPG
>>107962497
even every linux distro needs to know its loisenses, it is how it is
>>
File: radiance_x32.jpg (273 KB, 1280x1280)
273 KB
273 KB JPG
>>
File: 1740611664562445.png (2.88 MB, 1568x992)
2.88 MB
2.88 MB PNG
>>
File: radiance_x32.jpg (262 KB, 1280x1280)
262 KB
262 KB JPG
>>107962491
sort-of a stargate?
>>
>>107962547
sure, something like that
>>
File: radiance_x32.jpg (233 KB, 1280x1280)
233 KB
233 KB JPG
>>107962556
so it was a random unprompted thing the model added?
>>
>>107962560
idk, this was what Z got:

degraded photographic emulsion, uneven color fading, chemical staining artifacts, low tonal separation, soft focus with highlight bloom, grain clumping, subtle registration errors, aged and timeworn image texture.

miniature photography aesthetic, shallow depth of field at close focus, pronounced background bokeh, narrow plane of focus, tilt-shift–like focus falloff, compressed perspective, fine surface detail emphasized, tabletop-scale lighting, diorama-like spatial cues, realistic materials at small scale.scale-model resin figurine style, adult female twisting in dynamic pose with head tilted up, long curtained hair partially veiling empty eyes (eye emphasis), pained expression, floor-length coat with strategic fading, monastery carved into a distant floating peak, dusty pathway at her feet lashed by temporal storms, out-of-reach glowing embers drifting overhead, eldritch aura, phoenix-rising-from-ashes motif etched into coat hems, geometric patterns and symbols subtly inscribed across stone and fabric, safe for work, no nudity
Safe for work, no nudity, no text.


so who the fuck knows? lmao
>>
>>107962447
>yes some of lodestone's models are really bad

Some? Where are good ones?
>>
File: Flux2-Klein_00009_.jpg (229 KB, 752x976)
229 KB
229 KB JPG
>>
>>107962636
Finally, square donut.
>>
>instruct Klein to enhance the photo quality and make it cinematic
>just burns the photo instead
Am I missing something here
>>
>>107962596
>yes S̶o̶m̶e̶ of lodestone's models are really bad
>>
>>107962645
>posts vague shit
>get shit in return
>>
File: 1753153133381420.png (2.47 MB, 1248x1248)
2.47 MB
2.47 MB PNG
>>
>>107962652
I figured it would at least know that enhancing photos don't mean maxing out the saturation and turning on a piss filter
>>
>>
File: 1738564773333045.png (2.53 MB, 1536x1024)
2.53 MB
2.53 MB PNG
>>
>>107962658
Who in /ldg gives a shit about furfags degen model ffs...
>>
File: radiance_x32.jpg (127 KB, 1280x1280)
127 KB
127 KB JPG
>>107962578
ty. looks like random chance to me, but it did look good.
>>
>>107962716
:)
>>
>>107961989
You can try feeding it to the ksampler at high denoise.
>>
File: Flux2-Klein_00047_.jpg (248 KB, 704x1024)
248 KB
248 KB JPG
>>
>>
File: radiance_x32.jpg (102 KB, 1280x1280)
102 KB
102 KB JPG
>>
>>107962658
How do I force Klein to output a certain resolution without resizing my original input image?
>>
File: radiance_x32.jpg (202 KB, 1280x1280)
202 KB
202 KB JPG
>>107962804
the width/height for that is separate on the flux2scheduler node isn't it?
>>
>>107962658
HLL anons lora hard carried that model
>>
>>
>>107962804
>>
>change location of my models
>go back to old workflow and change every model and lora back to original 100%
>the same seed now produces different results

I've checked this shit 15 times over

REEEEEEEEEEEEEEEEEEEEE
>>
>>107962816
Nta but this doesn't do anything. it will force your resolution to be divisible by 16 no matter what. If you input 1920x1080 it will change 1080 to 1072 instead.
>>
File: radiance_x32.jpg (230 KB, 1280x1280)
230 KB
230 KB JPG
>>107962826
are you sure there wasn't also a python package update on startup or something like that?

anyhow most likely that's your new seeds now
>>
>>107962812
This is it, nice
>>
File: 1538714552646.jpg (57 KB, 960x956)
57 KB
57 KB JPG
Can you run the qwen tts in comfy?
>>
>>107962832
Nta but I’m pretty sure anything other than numbers divisible by what the model expects will return an error.
>>
>>107962832
Just resize the final image back to the orig.
>>
File: radiance_x32.jpg (199 KB, 1280x1280)
199 KB
199 KB JPG
>>107962847
excellent

>>107962853
there are like 7 extension repositories for this in the manager, yes
>>
>>107962871
Or just make a copy of the original image and crop it properly before feeding it to the model
>>
>>107962853
yes, there'll always be a saar who'll vibecode new stuff into comfy within the first day of the release, just google it
>>
>>107962853
I tested it out and it's kind of shit without the emotion control for voice cloning but they said it will be implemented soon. Might be another Chinese culture situation.
>>
>>107962904
What's the best TTS model right now? I heard a lot of praise for indexTTS but got bad results that sounded nothing like the reference audio. Vibevoice is good by itself but has basically no control over the output, and cranking cfg breaks the output very quickly, especially on longer audios
>>
>>
>>107962926
probably qwen or index just based on tts, no voice cloning
>>
>>107962926
Probably pre-cucked VibeVoice before Microsoft rereleased it with censorship. The lack of control sucks but its voice cloning is really good when it actually works. The original model is still on HF
>>
>>107962727
Lumi :33333333 *pats your head*
>>
>>107962963
I honestly don't think it's better than qwen-tts, vv just gets placebo benefits from being a big model and getting pulled by microsoft for being "dangerous"
>>
File: Flux2-Klein_00074_.jpg (200 KB, 704x1024)
200 KB
200 KB JPG
>>107962926
I like Chatterbox turbo, it can use paralinguistic tags. I hope they tune it more, it's fun when it works.

https://voca.ro/1nl7h8KB7kc6

 [clear throat], [sigh], [shush], [cough], [groan], [sniff], [gasp], [chuckle], [laugh] 
>>
https://voca.ro/1aUb0IGqqqUN
>robotic, mechanical, glitch, young female voice
Does the model even know non-human sounds? Do I have to boomer prompt this shit too?
>>
File: radiance_x32.jpg (273 KB, 1280x1280)
273 KB
273 KB JPG
>>107962930
cool that it chose to add text below her feet

>>107963052
i can hear the appeal. i wonder how many commercial tts are afraid of [moan]
>>
File: ComfyUI_00029_.png (1.65 MB, 1024x1024)
1.65 MB
1.65 MB PNG
>>
>>107960649
you forgot this jules
https://rentry.org/animanon
>>
Like fucking clockwork. Guess I’ll close this tab until tomorrow.
>>
Will Wan2GP work with an RX 6600?
>>
>>107963174
the other autist usually wakes up in 1~ hour, until then we're fine.
>>
>>107962826
It's the pytorch version + changes to the UI
>>
>>107963052
I mean, emotional tags are awesome but the audio itself is so robotic and soulless
>>
File: radiance_x32.jpg (158 KB, 1280x1280)
158 KB
158 KB JPG
>>
>>107963219
Have you tried second passing the radiance gens? On normal chroma it wipes most of the scuff.
>>
Do you still believe that they're releasing the z-image base model?
>>
>>107963181
Come on, niggers.
>>
>>107963233
I believe in Chinese culture. That’s my answer.
>>
Any tool or anyone working on converting recent models into NVFP4?

Am using it for Klein9b and result are very satisfying.
>>
File: ComfyUI_00030_.png (1.74 MB, 1024x1024)
1.74 MB
1.74 MB PNG
>>
@107963277
@107963290
>lubimiv
>>
>>107963277
you know how every General thread has at least two actual autists who have very particular ideas about the exact format the OP *must* take and sperg out when it deviates from their ideal in the slightest? Ran is one of those autists.
Basically it's background noise, ignore it
>>
File: radiance_x32.jpg (228 KB, 1280x1280)
228 KB
228 KB JPG
>>107963230
no, not yet. it's also known
>the x32 model is still training atm, expect some squiggles on the details part of the image!
>>
>uninvited failed dev spergout week 105 day 2 (weekend melty)
>>
>>107963303
>expect some squiggles on the details part of the image!

Like any others furfag models...
>>
>>107963242
>>107963261
Here: https://rentry.org/ranfaggot
>>
>panic baking on post 235
>>
File: ranfaggot.png (1.13 MB, 1216x832)
1.13 MB
1.13 MB PNG
>>107963327
kys nigger no one asked
>>
>>107963327
he baked very early with his UI in the OP, now he is going to spam his proxies to samefag about nothing to fill the thread.
>>
>>107963326
I don't give a shit about this nigger. My question is basic, I did not find anything in the archives regarding it.
>>
>>107963221
Just missing the hipster mustache
>>
kek ran is having a melty, you love to see it
>>
File: radiance_x32.jpg (191 KB, 1280x1280)
191 KB
191 KB JPG
>>
>>107963233
ani said they will probably release it but the it will be censored and not the same as zit. also base loras won't work on turbo.
>>
File: radiance_x32.jpg (203 KB, 1280x1280)
203 KB
203 KB JPG
>>107963320
it's a new thing happening because of the radiance x32 bump tho
>>
reminder to share this links for all the newfags anons
https://rentry.org/ranfaggot
https://rentry.org/ranfaggot
https://rentry.org/ranfaggot
>>
File: radiance_x32.jpg (365 KB, 1280x1280)
365 KB
365 KB JPG
>>
File: radiance_x32.jpg (173 KB, 1280x1280)
173 KB
173 KB JPG
>>107963393
> it will be censored
hope that's wrong. or that it mainly means no winnie together with president xi.
>>
>250
>>
/ADT/ IS RAIDING AGAIN.
SAME THING THEY DID TO /HDG/ AND /EDG/
USE YOUR 4CHAN X FILTERS UNTIL MODS STEP IN.
>>
>>107963316
>>107963316
>>107963316
when ready
>>
This honestly just feels so frustrating. Every days for weeks now at this time the whole topic just gets derailed by these two. I just wanna talk about diffusion models man
>>
INFO ABOUT /ADT/ RAIDERS:
They raid by spamming fake bakes and botting the thread

RECOMMENDATION:
Stick to tech discussion.
Where there is tech and discussion, there is /LDG/.

DO NOT INTERACT WITH /ADT/ RAIDERS, THEY WANT YOU TO DERAIL FROM TECH DISCUSSION
>>
>>107963277
A made-up nemesis, only brought up as a smokescreen by the actual spammer who constantly samefags and shills Anistudio.
>>
File: 1755539815565962.jpg (2.07 MB, 2016x1152)
2.07 MB
2.07 MB JPG
>>
>>107963504
you're supposed to take your meds
>>
>>107963481
just talk elsewhere, i mean it, there reason you won't find a place is because you are too used to this one; this schizo actually made me find a better place a couple months ago.
>>107963487
i get it, no one is posting, so you are once again trying to convince anons to just act business as normal despite most of the thread being a clueless anon and your proxy bot spam. after all what is the point of shilling if there is no one to shill to?
btw he did the same thing on adt claiming to be ldg:
https://desuarchive.org/g/thread/106262936/#106286089
>>
>>107963481
It is not frustrating.
Turn on the 4chan X filter.
Do not let /ADT/ win.
They are a dead end, failed general and want every general to turn into them.
>>
imagine trying to shit on our cute sister general
mean :(
>>
>>107963481
everyone can see your bullshit tRan
https://rentry.org/ranfaggot
>>
>>107963504
You sound obsessed.
>>
File: images(1).jpg (33 KB, 260x194)
33 KB
33 KB JPG
Example of a derail message: this one >>107963530
> so you are once again trying to convince anons to just act business as normal
He wants to derail the thread
>and your proxy bot spam.
He wants to gaslight

TURN ON FILTERS
DO NOT LET /ADT/ WIN
STICK TO TECH ON TOPIC DISCUSSION
>>
>>107963558
those posts up the schizophrenia levels by 10x, if anything they derail the thread even harder. people who w ant to will filter this garbage themselves
>>
>>107963564
You have the mindset of a rapist
>>
>>107963548
Stop gaslighting,
/adt/ botted /ldg/ /hdg/ and /edg/
The best you can do is to stop this botting nonsence, nobody will go to your general
>>
>>107963558
i think it's funny btw, you know how dead the thread actually is and you are now desperate to get people to post on the thread by gaslighting them, after you made the thread unusable for months to shill your UI.
>>
>>107963569
you can filter me, no problem
/adt/ raider gaslighting again
>>
>guise guise it's totally not trani (who is not me desu)
>it's the anime gooners! get em!
>have you heared about anistudio desu?
>>
File: ComfyUI_00031_.png (1.72 MB, 1024x1024)
1.72 MB
1.72 MB PNG
>>
File: lul.png (2 KB, 186x46)
2 KB
2 KB PNG
>>
The fries are waiting julien
>>
>>107963548
>cute sister general
Another example of an /adt/ gaslighitng
IMPORTANT: ONLY AN /ADT/ROON CALL THEM SELVES AS GROWN MAN "CUTE LITTLE SISTER"
The best you know your enemy the better.

#ban/adt/
>>
>>107963612
uhg, deplorable pedophiles those anons from /adt/
>>
File: ComfyUI_00032_.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
>>
File: ComfyUI_00033_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>
>>107963651
>>107963693
>/adt/ can’t step outside its own bubble and genuinely thinks posting those pics is “trolling.”
The only one with a phobia of adult women and anything 3D is you.
>>
>>107963601
catbox?
>>
File: thx.png (1.63 MB, 864x1152)
1.63 MB
1.63 MB PNG
>>
File: ComfyUI_00034_.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>107963749
sorry mate, the way I process my images removes the comfyui workflow. Anyways it was done with ZIT and with digital abyss lora https://civitai.com/models/844787/digital-abyss-fluxzimage
>>
>>107963752
Hunyan3?
>>
Jannies of frenship
>>
File: ComfyUI_00035_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>
>got so scared he tried to bake twice at 250
>still projecting and regurgitating posts originally directed at him
holy keeeeeekkkkkk
>>
File: zit.png (3.1 MB, 1296x1728)
3.1 MB
3.1 MB PNG
>>
>>107963812
explain?
>>
>now mikutroon's gonna spam his epic george floyd edits
my god please have mercy on this general
>>
thanks mods, i had to verify my e-mail to post this, but it's 100% worth it.
>>
File: ComfyUI_00036_.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
>>
>>107963856
>mommy milkers
Comfortable
>>
When ready

>>107963933
>>107963933
>>107963933
>>
>>107963941
>274
nigger
>>
>>107963941
Thank you for the proper bake
>>
File: ComfyUI_00037_.png (1.65 MB, 1024x1024)
1.65 MB
1.65 MB PNG
>>
File: 1769164979964402.png (8 KB, 413x400)
8 KB
8 KB PNG
>>107963941
>>107963969
>le when ready no rush teehee (before bump limit)
>thank you for le bake oni-chan!!! hahah we're baking!!!! it's a bake off!
>Blessed thread of frenship durr!!!
>yay thank you a heckin thready!! epic bakey wakey!



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.