[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1.02 MB, 3264x3264)
1.02 MB
1.02 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>101957363

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/g/sdg
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
>>101959699
>I made it into the collage
My life is now complete
>>
>>101959721
why am I not in the collage tho
>>
>>101959761
what happened to your fingers bro
>>
>>101959761
Why are you samefagging in the other thread schizo?
>>
File: ComfyUI_20685_.png (1.59 MB, 1280x760)
1.59 MB
1.59 MB PNG
>>
>>Maintain thread quality
>https://rentry.org/debo
>>
The new rf variant for dpmpp_2s_ancestral sampler in comfy is pretty good. Of course it's also half the speed of euler, which kinda sucks.
>>
>>101959788
>Posts malware in his news posts for weeks and tries to shift the blame onto others for calling him out for it, under a changed filename but forgets to change his style
god damn
>>
File: 1699650368005503.jpg (100 KB, 1280x720)
100 KB
100 KB JPG
>there's a webcam node
Does this work with animatediff?
>>
>>101959725
funnily, adding tonemap made the speed inference faster, went from 7s/it to 5s/it, more speed is always welcomed
>>
>>101959799
If each step takes twice the time, then it needs half the steps overall
>>
>>101959823
ok nvm it went back to 7s/it :(
>>
>>101959823
There was a reddit link with a bunch of guides, including using tonemap. Is it in the OP somewhere?
>>
how do I change the guidance number in forge
>>
>>101959721
I made it 2 times in a row, yeah baby!
>>
this shit https://civitai.com/models/657422/flux-paizuri ALMOST works https://files.catbox.moe/j0ml4n.png
>>
Don't skip leg day, fellas.
https://civitai.com/images/23837590
>>
>>101959836
this?
https://new.reddit.com/user/Total-Resort-3120/submitted/
>>
>>101959836
https://old.reddit.com/user/Total-Resort-3120/submitted/

One day this'll be in a rentry and I wont have to link to a reddit account
>>
>>101959825
Maybe the lesson learned from this is "just double your steps for better results".
>>
>>101959799
try the sampler called "supreme" I think it might be the best
https://github.com/Clybius/ComfyUI-Extra-Samplers
>>
>>101959799
>Of course it's also half the speed of euler, which kinda sucks.
that's why I use Deis, it's better than euler but the same speed
>>
>>101959850
SD3 lora, porn included
>>
>>101959855
>>101959856
Thanks.
>>
>>101959862
Is it slower?
>>
File: the-office-ryan-howard.gif (2.66 MB, 498x280)
2.66 MB
2.66 MB GIF
>>101959862
>noted to my google chrome bookmark, so much shit to learn goddam
>>
>>101959888
I don't think so but I'll have to double check.
>>
File: ComfyUI_20681_.png (1.57 MB, 1280x760)
1.57 MB
1.57 MB PNG
>>
File: 2024-08-18_00241_.png (1.65 MB, 1280x720)
1.65 MB
1.65 MB PNG
>>
File: 1414559388.png (1.47 MB, 1152x896)
1.47 MB
1.47 MB PNG
>>
>>101959862
>>101959875
>>101959895
So much to learn. I exclusively use Euler A lately since i've been fixated on Pony, would love a faster sampler that achieves the same whatever the fuck Pony needs from Euler..
>>
>>101959888
okay my bad it is 50% slower than euler.
(euler 14 seconds, supreme 21 seconds)
>>
File: 00156-3215099015.png (1.02 MB, 832x1216)
1.02 MB
1.02 MB PNG
The homicide detectives looked inside her phone, among the thousands of selfie images she had taken the last one she took stood out to them.

A vital clue to the case!
>>
>>101959953
>I exclusively use Euler A lately since i've been fixated on Pony,
Euler A doesn't work on Flux... it was my favorite sampler
>>
>>101959973
What did you settle on? Also schnell or dev?
>>
>>101959725
>Also tonemap seems to be compatible and adds more detail on top of the PAG gen. This is at
tonemap 0.42.
It's also compatible with Dynamic Thresholding, noice!
>>
>>101959973
Good, flux is forcing forward progress for better faster stronger samplers. That makes me even more optimistic for when 2D is further improved.
>>
File: 1704128719535122.png (604 KB, 594x610)
604 KB
604 KB PNG
https://x.com/realDonaldTrump/status/1825138139502878806
fluxgods...
>>
>>101959979
dev
>>
Why has Comfy have no option to pause or save the queue so you can do something else and resume later?
>>
>this is what that chicken gen looks like in diffusion toolkit
whaaaaaaaaaaaattt thee fuu-我爱北京天安门,
天安门上太阳升;
伟大领袖毛主席,
指引我们向前进。
>>
>>101960002
at least, if orange man will be elected, he won't for sure make AI deepfake illegal kek
>>
>>101960006
But what sampler did you settle on?
>>101960002
American politics is so weird for someone from Europe, where an actually nuanced spectrum exists.
>>
>>101960002
omg someone could think this was real
>>
File: 2024-08-18_00247_.png (1.55 MB, 1280x720)
1.55 MB
1.55 MB PNG
>>
File: 00158-3215099017.png (1.24 MB, 832x1216)
1.24 MB
1.24 MB PNG
Another blonde victim which matched the previous victim's description and job (influencer) also had a similar selfie photo found on her phone, could this be the beginnings of a serial killer case?
>>
>>101960021
>But what sampler did you settle on?
oh, Euler, and from time to time Deis, the others are or else shit or else too slow
>>
>>101960021
To help explain it better, american politics is like choosing your football team, once you choose sides the nuances don't matter, you win or lose. Everyone else is the ENEMY.
>>
File: 4183855434.png (1.53 MB, 1152x896)
1.53 MB
1.53 MB PNG
>>
>>101960021
yes in europe you get to pick from 50 shades of socialism instead...
>>
>>101960019
He did have a moment where he was targeting violent video games...so you never know.
>>
>>101960117
Not the hecking moment?!?! He is fascism y'alls
>>
File: ComfyUI_temp_zhnam_00053_.png (2.31 MB, 1024x1400)
2.31 MB
2.31 MB PNG
>>
>>101960110
This was my point. Everything in the US looks like different flavors of neoliberalism (with or without religion) from here. Off topic tho.
>>
>>101960008
I thought of a pretty good idea recently where you'd have two queues: a queue for exploration and a 2nd queue for upscales.
So whenever you find something good, you could queue an upscale of it, but this upscale queue would only run during downtime (like when you're writing prompts or alt-tabbed).

It would basically always be running automatically during any downtime. And so it should also pause/interrupt whenever you run your main exploration queue.
>>
File: ComfyUI_20740_.png (1.03 MB, 1280x720)
1.03 MB
1.03 MB PNG
>>
File: 2024-08-18_00256_.png (3.56 MB, 1920x1080)
3.56 MB
3.56 MB PNG
>>
File: 00160-3311128505.png (1.03 MB, 832x1216)
1.03 MB
1.03 MB PNG
>>101960031
>>101959966
The third incident ended with the death of the killer by the hands of his victim, this case ended without becoming a serial killing case which the higher up were very happy about.


>>101960177
Very nice
>>
>>101960221
nasty coom cave
>>
>>101960243
So he wore dreadlocks to try and decieve the detectives into thinking he was black.
>>
>>101959875
what settings do you use for DEIS? It's waaaaaaaaaaaay different from Euler. All my gens are weird and kinda deepfried with it, regardless of cfg scale.
>>
n slur
>>
>>101960297
>what settings do you use for DEIS?
what do you mean? I just chose Deis and I don't have other options
>>
>>101960002
Based.
>>
File: srfhgybsrfhgybsr.png (604 KB, 429x867)
604 KB
604 KB PNG
>>101960322
>I don't have other options
>>
>>
>>101960335
deis normal 30+ steps I guess, beta + deis gave me weird shit yeah
https://reddit.com/r/StableDiffusion/comments/1em7zy3/testing_the_samplers_schedulers_with_a_xy_plot/
>>
File: file.png (2.31 MB, 1024x1024)
2.31 MB
2.31 MB PNG
>>
File: Capture.jpg (279 KB, 1939x1444)
279 KB
279 KB JPG
Ok, seems like PAG can be tammed, you have to crank up the scale to fucking 30 at least kek
>>
File: wtf.png (867 KB, 775x887)
867 KB
867 KB PNG
>>101960348
I think a lot of my samplers might just be broken, given i tried those identical settings to start with and it wasn't great.
DPMPP3MSDE has been broken since i started using comfyUI, always worked in forge.
>>
>>
>>101960373
anon, I hope you're adding Dynamic Threshold or Tonemap in addition to CFG = 6 right?
>>
>>101960277
You'd think so but in fact it was a "skinny whi boi" white body-suit!
>>
>>101960393
never been told i needed either of those things when i asked a few days ago
show me a node setup so i can do it right pls
>>
>>101960407
Here's a tutorial for Dynamic Thresholding (good for anime, not so much for realistic shit)
https://reddit.com/r/StableDiffusion/comments/1ekgiw6/heres_a_hack_to_make_flux_better_at_prompt/

And here's a tutorial for Tonemap (Great for realistic shit but not as good as Dynamic Thresholding for anime)
https://reddit.com/r/StableDiffusion/comments/1estj69/remove_the_blur_on_photos_with_tonemap_an/
>>
File: file.png (63 KB, 769x658)
63 KB
63 KB PNG
https://github.com/comfyanonymous/ComfyUI/discussions/4457

You guys are downloading the latest standalone release, aren't you anons? You only have yourself to blame if performance is bad.
>>
>>101960429
well in this case its a funny amalgamation of Pony re-utilizing a realistic style, usually Euler A is fine but my old better gens on forge were with DPMPP3MSDE, with a higher cfg scale. Normally i work with 3.0/3.5 and Euler A.
Ill have to check that out for the normal workflow.

Now im back to fixated on why 3MSDE isnt working.
>>
>>101960402
That's what they thought, but the autopsy revealed a Jamaican man underneath the white body suit.
>>
>>101960443
he says download the latest standalone AND THEN UDPATE.
This is just so people are using a clean repo when reporting issues.
>>
File: ComfyUI_00748_.png (863 KB, 1024x1024)
863 KB
863 KB PNG
>>
if you have enough VRAM to fit the flux fp8 checkpoint version, would nf4 or q4 or whatever still offer better throughput? got 2.24s/it on the fp8 checkpoint version. i also have a q8 quant which at first glance has a lot worse quality than the fp8 checkpoint and is slower too
>>
File: ComfyUI_Flux_119.png (1.73 MB, 1344x768)
1.73 MB
1.73 MB PNG
https://civitai.com/models/659438/xenomorph-egg-flux
>>
File: 2024-08-18_00257_.png (3.1 MB, 1080x1920)
3.1 MB
3.1 MB PNG
>>101960262
ya Beksinski lora makes good slime
>>
>>101960491
Kek
>>
>>101960491
LMAO
>>
>>101960483
>if you have enough VRAM to fit the flux fp8 checkpoint version, would nf4 or q4 or whatever still offer better throughput?
why not go for Q5 or Q6 instead?
>>
Have you found a consistent way to prompt for young girls? If I go with "young girl", they're often WAY too young. And I if say something like "20 year old girl", or worse yet, "woman", they look in their 30-40s.
>>
>>101960491
lmao
>>
>>101960528
Try Young woman
>>
File: 1199545413.jpg (2.05 MB, 2304x1792)
2.05 MB
2.05 MB JPG
>>
>>101960511
ok sure, but can i expect better compute throughput from a reduced model or only if it helps meet VRAM compatibility?
>>
>>101960491
ok, now that's funny.
>>
File: ComfyUI_00622_.png (1.65 MB, 1024x1024)
1.65 MB
1.65 MB PNG
>>
>>101960539
Huh. I'm so used to SD1 understanding age ranges so well that I had not thought about this combo.
>>
>>101960574
Flux sure does look good, but it's spatial understanding is garbage.
>>
File: ComfyUI_00570_.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>
File: FLUX_00024_.png (1.35 MB, 896x1152)
1.35 MB
1.35 MB PNG
>>
>>101960594
I know, it's like the last one I tried too.
>>
File: 00122-388835679.png (1.33 MB, 960x1280)
1.33 MB
1.33 MB PNG
>>
File: 00168-427747338.png (1.04 MB, 832x1216)
1.04 MB
1.04 MB PNG
>>
File: ComfyUI8.18.2024__00102_.png (2.97 MB, 1248x1824)
2.97 MB
2.97 MB PNG
>>
File: ComfyUI8.17.2024__00140_.png (1.7 MB, 1248x1824)
1.7 MB
1.7 MB PNG
>>
File: grid-0342.jpg (286 KB, 2304x1792)
286 KB
286 KB JPG
>>
File: grid-0344.jpg (287 KB, 2304x1792)
287 KB
287 KB JPG
>>
First time lora training attempt, what does "steps" actually mean? If I'm training 5000 steps @ rank 64 on, say, 20k images with a batch size of 1, does that mean it won't even look at 15k of those? I guess raising batch size could help but I've heard raising the batch size lowers quality...
If it matters, I'm not trying to capture a general style, but rather specific items and their associated details. Currently Flux just generates a melted/distorted mess when trying to generate those details.
>>
File: kl.png (1.32 MB, 832x1216)
1.32 MB
1.32 MB PNG
>>
>>101960741
>>101960753
what's this guy's problem?
>>
>>101960528
yeah its tricky. flux extremely sensitive to prompt. hyper-sensitive. might end up with a toddler. "cute naked (cough) girl on a bed", I got a baby.
>>101960719
sick
>>
File: file.png (339 KB, 1834x793)
339 KB
339 KB PNG
Okay, I'm doing some tests to see what Comfy parameters give me the best performance, and I set up this simple workflow. But something weird seems to have happened here. What am I missing? Why is there a dude here instead of the bottle?
>>101960767
Some things about this model are very German.
>>
File: 00171-205423702.png (841 KB, 832x1216)
841 KB
841 KB PNG
thanks for OOT lora anaon
>>
>>101960612
Flux is BFL's SD 1.5 moment, and it only gets better from here for future iterations of this stuff. And it's actually being done by a competent team for once.
>>
>>101960794
I used the Force/Set Device to move the VAE and CLIP to the CPU and it's producing a bottle instead. How bizzarre.
>>
>>101960766
Working at Mcdonalds for so long does things to a peron.
>>
https://youtu.be/PrPY_USgngg?si=2upQLT9Es6loGyLd&t=202

uh-oh, niko is being a disgusting ai-bro again and stealing artists jobs
>>
File: file.png (531 KB, 512x512)
531 KB
531 KB PNG
>>101960818
What does "BFL" mean?
>>
Can i run flux on 1080ti?
>>
>>101960835
Big Fucking Loser

or maybe Black Forest Labs instead
>>
>>101960835
Brown Fluid Lavatory
>>
>>101960845
Ah, right. I'm only vaguely aware of things. They used to work on the original SD, right?
>only gets better from here
That's exciting. I might end up buying a new PSU and a 3090 towards year's end.
>>
>>101960794
you are missing the model sampling flux node or am I blind? don't ignore it. thats why it doesn't pick up your idea. also the guidance is an important value, too high will of course fry the image but it needs to be constantly adjusted.
what are you tring to imply about us here? son.
>>
File: file.png (485 KB, 2082x859)
485 KB
485 KB PNG
>>101960873
>model sampling flux node
I just use the default KSampler. It works (pic related). But I don't understand why it rendered a man instead of the bottle in that first picture. Other than moving CLIP and VAE to CPU, it's exactly the same setup.
>>
File: aaaa.png (93 KB, 1262x689)
93 KB
93 KB PNG
>>101960901
just squeeze the "model sampling flux" node between unet loader and ksampler and try it out. and of course you need the image width and height for that.
>>
>>101960860
>Ah, right. I'm only vaguely aware of things. They used to work on the original SD, right?
yes, a lot of the BFL crew are former SAI.

As for a 3090, that's a pretty good purchase for 24GB vram at a decent speed for image generation. Just make sure you pc case has good cooling as these cards get toasty.
>>
File: 00175-1128017310.png (1.1 MB, 832x1216)
1.1 MB
1.1 MB PNG
>>
File: 2024-08-18_00274_.png (2.5 MB, 1080x1920)
2.5 MB
2.5 MB PNG
>>
https://enragedantelope.github.io/Styles-FluxDev/
>600 styles test
>they all look the same
:(
>>
>>101960999
https://civitai.com/images/23199531

>Style Test Workflow
>Not using any of the negative guidance hacks

Now there's your problem
>>
>>101960971
What's that for?
>>
File: 0.jpg (646 KB, 1024x1408)
646 KB
646 KB JPG
>>
File: 00178-658671670.png (1.02 MB, 832x1216)
1.02 MB
1.02 MB PNG
Mexican weather lady using OOT lora
>>
>>101960971
What is ModelSamplingFlux for? It adds 2 seconds to my generation time and it generates the exact same image if I don't use it.
>>
>>101960760
A step is essentially the batch size number of images processed. So 5000 steps with 15,000 images does mean 10,000 images aren't looked at. The performance gains of batch size far outweighs any quality loss (made up with more steps or slightly higher learning rate). With Flux you can't do more than batch size 1 on consumer GPUs. Generally you can expect to do 2-3 epochs to get the results you want.
>>
File: ComfyUI_20753_.png (1.31 MB, 1280x720)
1.31 MB
1.31 MB PNG
>>
File: 2024-08-18_00280_.png (2.3 MB, 1080x1576)
2.3 MB
2.3 MB PNG
>>
File: ComfyUI_20851_.png (1.13 MB, 1280x720)
1.13 MB
1.13 MB PNG
>>
>>101961060
would play
>>
File: ComfyUI_20810_.png (1.13 MB, 1280x720)
1.13 MB
1.13 MB PNG
>>
>>101961032
>>101961094
copy pasted straight from reddit but very much relevant:
"base shift is a small, consistent adjustment that stabilizes the image generation process, while max shift is the maximum allowable change to the latent vectors, preventing extreme deviations in the output. Together, they balance stability and flexibility in the image generation.
Using a dog as an example:

Increasing Base Shift: If you increase the base shift, the generated image may become more consistent and closer to the intended form (a clear image of a dog) with less variation or noise. The dog might appear more stable, with well-defined features, but it could also lose some subtle details or become slightly repetitive in texture.

Decreasing Base Shift: Reducing the base shift could introduce more variability, allowing for finer details or more nuanced textures to emerge. However, it might also make the image slightly less stable, potentially introducing minor artifacts or inconsistencies.

Increasing Max Shift: By increasing the max shift, the model has more freedom to explore the latent space, potentially leading to more creative or exaggerated interpretations of the dog. The dog could end up with more exaggerated features or a more stylized appearance, but it might also risk deviating too much from a realistic representation.

Decreasing Max Shift: Lowering the max shift would constrain the model, leading to a more controlled and realistic depiction of the dog. The image would likely remain close to a typical dog appearance with fewer unexpected variations, but it might lack some creative elements or subtle uniqueness."
tldr "artistic freedom slider"
>>
File: ComfyUI_03376_.png (1.97 MB, 1152x896)
1.97 MB
1.97 MB PNG
>>101960999
>>101961030

>Abstract Painting Winston Churchill looking at iPhone., Painting in abstract styles, often using non-representational shapes, colors, and forms.

People really need to stop sleeping on CFG 6~ and high Guidance Neg.

This entire test needs to be redone with new parameters lol
>>
downloading these quants at a mere 8MB/s just to compare s/it and schizo interpret render quality is annoying, especially when they all just seem worse in every way than the regular fp8 checkpoint
>>
>>101960794
Don't bother with gguf
>>
File: ComfyUI_00757_.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>>
File: file.png (581 KB, 512x512)
581 KB
581 KB PNG
>>101961183
I can't use anything else.
>>
gu gu gu gu
>>
>>101961174
this dude complaining about having 80Mbs, I have 20Mbs at best and it drops at night.
>>
>>101961200
no, huggingface is only giving me that much
>>
>>101961172
>CFG 6
Gen time is ~3 times slower on my poorfag gpu.
>>
>>101961210
then you're an even worse of a complainer than i thought
>>
File: file.png (1.55 MB, 1152x896)
1.55 MB
1.55 MB PNG
>>101960999
>>101961172
Compare that to the output from the test (pic rel), which doesn't use any guidance neg - and you can clearly see that there's so much potential in Flux that is wasted by not using negative guidance.
>>
File: ComfyUI_20697_.png (1.79 MB, 1080x1920)
1.79 MB
1.79 MB PNG
>>
>>101961219
yeah, for artistic style, Dynamic Thresholding + CFG 6 is really good, I wish DT would be as good on realistic images though, Tonemap has the opposite problem, imagine if there exist a method that would take the best of both worlds, would be fucking perfect
>>
I've been out of the loop for a while, its forge abandoned now? Do I need to switch back to automatic1111?
>>
>>101961239
woah
>>
>>101961214
Yeah it's double for everyone at the very least since it generates another noise latent to compare against at every step, so the negative attributes can be avoided. (At least that's how I believe it works).

In related news I just updated to Pytorch 124 nightly and my gen times on Flux reduced by 10 seconds lmao.
>>
>>101961239
The original Matrix is so post-processed to shit that Flux images look as if they were from the film.
>>
>>101961183
q8 gguf is almost indistinguishable from fp16, unlike fp8...i've gotten better results and better text when I started using q8
>>
>>101961217
not my fault you're trapped in some godforsaken hellhole like australia
>>
>>101961274
Same here
>>
>>101961266
If you guys wanna try that yourselves the command I used on my python_embedded folder is

.\python.exe .\Scripts\pip.exe install --pre --upgrade torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cu124
>>
>>101961274
>>101961286

Not the schnell one
>>
File: ComfyUI_20696_.png (1.77 MB, 1080x1920)
1.77 MB
1.77 MB PNG
>>101961270

Lol true
>>
>>101961296
I'm the second anon. I'm using schnell.
>>
>>101961274
>q8 gguf is almost indistinguishable from fp16

thats bs and please dont post the miku with dreadlocks image as an answer
>>
>>101961219
with all due respect.. the first image looks like dogshit. its not abstract at all. and the 2nd one is just fried garbage. maybe try a different approach?
>>101961239
fuck
>>
>>101961319
nta but this is widely known and the reason quants are a thing at all. 16 -> 8 is indistinguishable
>>
>>101961317
Try this and compare it with gguf schnell q8

https://huggingface.co/drbaph/FLUX.1-schnell-dev-merged-fp8-4step
>>
g guff
>>
>>101961249
please respond
>>
File: 00190-4213548808.png (451 KB, 496x768)
451 KB
451 KB PNG
Pretty crazy how well flux does with low resolution, this kind of low resolution before would have totally messed up faces that I would have had to inpaint over.
>>
>>101961324
>the first image looks like dogshit. its not abstract at all. and the 2nd one is just fried garbage. maybe try a different approach?
you can't do anything about it; flux just doesn't know that concept
>>
>>101961337
>schnell-dev-merged
>>
>>101961363
No Forge dev is who was actually what is giving everything a boost, making everything work easier for everyone with different GPUs and it all works with Forge.

so it's better than ever right now, A1111 is pretty much dead (for now)
>>
File: 116775245467567.png (1.11 MB, 896x1152)
1.11 MB
1.11 MB PNG
guys help pls

Im getting all slop results on forge, I try and follow all prompts from popular images on civit but my results dont match at all!

My gens are always yellowish, noisy, grainy, foggy and got some artifacts, not sharp at all, the results here and on civit look sooo fuckin good!

Im on forge, using the NF4 version of Flux

3070 and 32G RAM
>>
>>101961332
>this is widely known
that may apply to text LLM models, we're just discovering image quants models, I tried the Q_8 model after some posts stating that it was closer to fp16 than fp8 and all it gave me was some shitty results compared to f8
quants model seems like some vramlets cope so they can boast about smaller models without able to test properly against bigger models because they don't have the proper hardware to start with
>>
File: ComfyUI_02685_.png (1.87 MB, 1024x1024)
1.87 MB
1.87 MB PNG
>>101961324
With all due respect, if you actually read the posts then you'd realize that the second 'fried garbage' image isn't mine - it's from this "Styles Test" another anon posted.

https://enragedantelope.github.io/Styles-FluxDev/

I was using the same prompt, and quickly popped it into my workflow.

My gen was used as an example that base Flux has a lot more potential in it's ability to output unique styles that most people aren't aware of.

Cheers.
>>
>>101961391
show screenshot of forge page
>>
>>101961391
nf4 and gguf are slop
>>
>>101961374
try "abstract art", "painting", I dunno.. "modern art", whatever and an abstract artist in the CLIP_L box
>>
>>101961363
Have you tried using your eyes.
https://github.com/lllyasviel/stable-diffusion-webui-forge
>>
File: ffffoorge.png (177 KB, 1870x1057)
177 KB
177 KB PNG
>>101961416
here
>>101961423
ahhh damnn
>>
>>101960794
Looks like it's fucked with safetensors too on latest comfy, the text encoder just outputs garbage.
>>
>>
>>101961454
Have you tried Euler with Beta and without highres fix?
>>
>>101961480
bitch
>>
File: file.png (665 KB, 1024x1024)
665 KB
665 KB PNG
>>
>>101961485
Ive tried euler with and without hires, but not beta, thats something I gotta run once

Ill post a gen

is there no hope really for 8G plebs?
>>
>>101961426
but what about
https://github.com/lllyasviel/stable-diffusion-webui-forge/discussions/801 ?
it says to go back to the main branch which links to automatic1111
>>
>>101961516

I think you need to download ae.safetensors and put it in the vae folder then load it at the top.

https://huggingface.co/black-forest-labs/FLUX.1-dev/tree/main
>>
File: 00005-1183917609.png (1.05 MB, 896x1152)
1.05 MB
1.05 MB PNG
Reminder
>>
>>101961319
Are you using dev or schnell? All I know is that q8 dev is virtually lossless compared to fp16 dev. This is probably different with schnell though, being much less steps.
>>
File: 3454545677456456.png (1.33 MB, 896x1152)
1.33 MB
1.33 MB PNG
>>101961530
ohhh I didnt know this, I thought flux had no VAE

Forge allows it?

>>101961485
picrel is euler beta 35 steps no hires, 1min40s
>>
>>101961556
take the comfypill
>>
>>101960443
This dumb motherfucker
>Even 4090 doesn't have enough vram for that
YEAH NO SHIT SHERLOCK
FUCKING IMPLEMENT BETTER MULTIPLE GPU SUPPORT YOU FUCKING WANKER
>>
I got 2 GPUs, can I choose to load flux on the 3090 and load the VAE on the 3060 using comfy? Everytime I generate an image it loads flux (takes about a minute or two) and then has to unload it to load the VAE in the same GPU, so it takes at least 3 minutes to generate an image
>>
>>101961556
>35 steps
10's enough for most cases unless you like the gen
>>
>>101961538
is this that flux thing all the kids have been talking about in school?
>>
>>101961520
Even the latest update in that announcement is before flux release. He has been committing tons of code each day for a while now. I don't know if he will pause again for whatever reason.

There is a forge fork that might be active if he does.
https://github.com/Panchovix/stable-diffusion-webui-reForge
>>
>>101961124
I could run training with batch size 4 on 12GB (at 512x512)
>>
>>101961596
Things are in motion everything is in flux.
>>
i have a folder full of pics with genned captions and want to create a lora from them
since training seems to be in such a state of flux (heh), what's the DEFINITIVE latest&greatest way, as of today, to train with best results?
>>
>>101961582
I did, but for some odd reason its slower than forge, but its much faster for XL PDXL and 1.5

>>101961591
hmm understood.

what about cfg and distilled cfg? should it always be at 1 and 3.5?
>>
>>101961610
frfr
>>
File: ComfyUI_00446_.png (615 KB, 512x768)
615 KB
615 KB PNG
>>101961556
This looks fine
>>
>>101961616
I used civitai's trainer, it's like 2 dollars to train a lora on flux and it turned out pretty good for the first time in my case. I could use my own GPU but Im too lazy tbqh
>>
>>101961617
>>101961591
>>101961582
also anons is there some magic prompt format for Flux? Im a SAI promptschool graduate, know fuckall about Flux

>>101961633
damn anon this is so much better! Is this from the main model? the non nerfed nf4?
>>
>>101961640
>damn anon this is so much better! Is this from the main model? the non nerfed nf4?
looks like pap you blind mongoloid smdh
>>
What's a GPU weight slider in updated Forge? Couldn't find anything about it.
>>
>>101961636
meh, i've got a 3090 and the patience to run it locally, just don't want to pick the wrong tool to start with only to find out 10h later that something else was better
>>
>>101961640
>also anons is there some magic prompt format for Flux?
Just use this, if you're not into writing essays https://huggingface.co/spaces/gokaygokay/FLUX-Prompt-Generator
>>
>>101961585
there was a custom node for that, mikufag linked it before
>>
>>101961656
rude
>>
File: file.png (3.4 MB, 1080x1893)
3.4 MB
3.4 MB PNG
>>101960348
Here's a comparison of the samplers using the simple scheduler. heuncpp2 converges the fastest but is also the slowest. But other samplers start pulling ahead at the 10 step mark. I find also and tested that ipndm gives the best from 15-20 steps if you look at the pie and background details before deis takes over at 20-30 steps looking at the pie again and then every scheduler converges above that step count. Mostly using ipndm at 15 steps for that reason for my gens and deis if I want something better. I like ddim_uniform better so I am using that as a scheduler instead of sgm or simple.
>>
tell me, why should I use Forge and why will I get better results with it?
>>
File: 00198-1346045329.png (1.18 MB, 832x1216)
1.18 MB
1.18 MB PNG
>>
>>101961667
In that case SimpleTuner will do the job, are 24GB of VRAM enough to train though?
>>
>>101961668
thanks, if someone can respond about cfg and distilled cfg, and comfy being slower than forge, it'd be great..

also am I missing anything else I should know about?
>>
>>101961683
gross, furries should navigate to /trash/
>>
>>101961667
you won't get it right on the first try
be prepared to spend a lot of time on it, refining and improving
>>
>>101961686
Tell me what did you eat today and why?
>>
>>101961669
where?
>>
>>101961703
I had some hotdogs and I just felt like it
it's a nice sunday so a bit of unhealthy food is fine
>>
File: 1698538146581380.png (2.95 MB, 2048x2048)
2.95 MB
2.95 MB PNG
For AMD users on older cards that don't support WMMA, I made a ComfyUI node that implement flash attention in triton.
It's sadly slower than sub quadratic or split cross attention, but it uses way less VRAM, might be useful for some.
https://github.com/ardfork/ComfyUI-flash-attention-triton
>>
>>101961696
Maybe you need the T5 text thing too

look at this thread it should help

https://github.com/lllyasviel/stable-diffusion-webui-forge/discussions/1050
>>
>>101961505
what the duck?
>>
>>101961717
Sounds real good.
>>
why should I use Forge and why will I get better results with it?
>>
>>
File: 11677524926354939342.png (1002 KB, 896x1152)
1002 KB
1002 KB PNG
>>101961633
>>101961556
>>101961530
>>101961668

ok so this is with ae.sft
euler beta and prompt from the generator.

is there a way to run that prompt generator locally?
>>
File: ComfyUI_01049_.png (1.77 MB, 832x1216)
1.77 MB
1.77 MB PNG
>>101961696
>have to link to reddit now
>https://old.reddit.com/r/StableDiffusion/comments/1en5eb3/flux_realism_lora_comparison_onoff_and_different/

You can ignore CFG for now.

>Distilled cfg
Lower (1.6-1.8) for realistic images and higher 2.75-3.5 for normal ones. Higher guidance (dcfg) also means less errors/artifacts
>>
>>101961690
think so, i've seen others have success with 24GB
supposedly it's the bare minimum though so not optimal

>>101961701
eh, i'm mainly just concerned about getting the method right first try, since flux is relatively new and it feels like there's new discoveries, tools and tricks being posted every day
dataset quality/config is another matter and i don't mind having some trial and error with that, hopefully if i curate it a bit beforehand it shouldn't be too bad though
>>
>>101961798
thanks anon, Ill go thru this
>>101961734
I thought the t5 thingy was for the main models only and not for the nf4 ones
>>
>>101961716
dude needs a rentry
https://old.reddit.com/r/StableDiffusion/comments/1el79h3/flux_can_be_run_on_a_multigpu_configuration/
>>
File: 2571825505.png (1.61 MB, 896x1152)
1.61 MB
1.61 MB PNG
>>
>>101961789
>is there a way to run that prompt generator locally?
https://github.com/Ling-APE/ComfyUI-All-in-One-FluxDev-Workflow get this workflow if you want everything in one place and read the page for instructions
>>
File: 1694507672472531.jpg (63 KB, 984x984)
63 KB
63 KB JPG
>>
>>101961912
their husband will be furious
>>
>>101961912
delete this
>>
>>101961921
Funnier than what i was gonna say lmao
>>
>>101961835
afaik, nf4 has that and clip_l baked into the file but you can also supply them separately
>>
>>101961959
that would increase vram usage Im guessing, I think Ill get an oom error
>>
>perhaps it's not perfect
well only one way to find out :^)
https://civitai.com/models/659046/mistress-of-the-flux
>>
File: ComfyUI_00112_.png (532 KB, 512x512)
532 KB
532 KB PNG
>>
>>101961977
no idea, i just select t5 and clip separately and have no issue running it on my 3070, i think it just loads in the weights for the main model only in that case
>>
File: 1721883861834909.jpg (430 KB, 722x881)
430 KB
430 KB JPG
>>101959823
>funnily
Remove this word from your lexicon please, my inner dialog finds it's sound offensive.

Humorously, interesting or counter intuitively would have have been better options.
>>
File: 1717033123322220.png (494 KB, 768x768)
494 KB
494 KB PNG
>>101961912
>>
>>101962020
fffunnaallaayyy
>>
>>101962015
would you mind catboxing any of your gens?
>>
>>101960491
https://www.youtube.com/watch?v=gopKHVBlJJQ
>>
>>101959769
>schizo post deleted
The rentry is working remember to follow the advice
>>
>flash attention, xformers
with modern pytorch, relevant how?
>>
>>101962039
How about one of us sexy and handsome anons come over to your room and you know...show you the basics, help you get set up juuuust the right way, get that forge up and running so we can gen aaaaaall night together?
>>
>>101962098
PREDATOR!
>>
>>101962098
sure, seems like youre up for it

didnt know sdg fags moved over right as SAI got btfo'd
>>
>>101960635
very cool aesthetic anon, nicely done
>>
File: 3294857110.png (1.64 MB, 896x1152)
1.64 MB
1.64 MB PNG
>>
>>101962113
sdg was never usable honestly. Even /b/'s general, for how much they ignore normal convo or questions was more useful.
>>
File: 00207-3347989897.png (1.14 MB, 832x1216)
1.14 MB
1.14 MB PNG
My boss at work is annoyed with me again, I should just quit this job, she's so annoying always telling me how I can't do basic things like printing (the printer is so confusing)
>>
>>101962127
exactly, ldg is better, more open and helpful.. at least I thought so until someone has to ruin it for all like how they ruined sdg with all the nonsensical drama everyday
>>
>>101962084
I made new installs of forge and comfy for flux. For them i don't specify anything and let them run whatever it chooses.

On older installs for sdxl and sd1.5 i explicitly set xformers since it's what i've used and i think it was marginally faster than sdp, or whatever that was, when i tested it.
>>
>>101962121
ty friend
>>
File: download.jpg (87 KB, 800x800)
87 KB
87 KB JPG
>>101961172
Oh noes, it's got a signature. This must be plagiarism and copyright theft. I'm sure the artist who actually painted that would be upset to see how it was stolen by you.
>>
someone pls post the .gguf nodes for comfyui
im going in dry.
>>
>20 seconds and 10gb vram to caption 1 image
fuuck, I thought this would be the easy part
>>
File: 2024-08-18_00316_.png (1.86 MB, 1280x1024)
1.86 MB
1.86 MB PNG
>>101961683
lol the fuck does lcm do on 100 steps?
>>
File: 116775249263549396.png (1.15 MB, 896x1152)
1.15 MB
1.15 MB PNG
>>101962015
ok this is with t5 and clip, didnt get any ooms, but quality defintely seems better
>>
>>101962204
still better than manual
>>
File: 221085716.png (812 KB, 896x1152)
812 KB
812 KB PNG
>>
Coomer here. So I'm currently at epoch 130 of training a flux lora on ~2000 images of a NSFW concept. Holy fucking shitballs it just werks. There are a couple of subconcepts it hasn't learned well (probably because T5 doesn't understand the tag I used at all), but for the most part it is completely mogging literal months of iterative R&D I did with SDXL models on this dataset. And I'm sure I'm not training it optimally at all, I just used the default settings Kohya recommended in the readme. A good full finetune on a few tens of thousands of NSFW images, and this thing will absolutely BTFO every single SDXL porn model. We are actually so back.
>>
File: 49263549399.png (1.18 MB, 896x1152)
1.18 MB
1.18 MB PNG
>>101962252
Finally some good gens from forge, thanks anons, really appreciate all the help!
>>
>>
>>101962339
oh no no "flux doesn't do art styles" sisters we're losing!!
>>
>>
>>101960491
Prompt? Did you write something to add the grain or is this a lora effect?
>>
File: 2024-08-18_00321_.png (1.76 MB, 1280x1024)
1.76 MB
1.76 MB PNG
>>
>>101960002
For this tech to be at the fingertips of idiots has really bad implications for open source.
>>
>>101962384
Thinking logically about it, the only reason Elon would give them a completely uncensored model is he wants a scandal and regulation built around it.
>>
File: 2478753670.png (893 KB, 896x1152)
893 KB
893 KB PNG
>>
>>101962204
>>101962326

I also find that Flux LoRa werks even with dataset made for PonyXL in danbooru tagging styles. The tags become a long ass activation token it seems like. Pairs that with a long winded english description, you can get accurate gens out the gate with old datasets. However, Flux autistically follow English prompts. Random traits that PonyXL "just knows" when you train need be mentioned in plain language. Which can be hard cause tags are just difficult to describe.
>>
>>101962384
idiots can also light themselves on fire with a lighter. also, lemme take a pencil and draw a topless harris OMG
>>101962402
that model is far from being completely uncensored.
>>
>>101962384
You could apply this same argument to literally any technological progress ever made.

"Ugg think fire is scary when used by other tribe"
>>
>>101961214
Adaptive Guidance node speeds it up significantly.
>>
>>101962402
>logic
>elon
choose one
>>
File: ComfyUI_Flux_140.png (1.83 MB, 1344x768)
1.83 MB
1.83 MB PNG
>>101962366
Nah, just a slop prompt
>A dramatic and atmospheric image of Donald Trump and kamala harris standing in a deep, dark cave reminiscent of the one from the movie Alien (1979). The cave should have an eerie, otherworldly ambiance with dim, unsettling lighting and a rugged, uneven surface. Surround the couple with numerous Xenomorph eggs, adding to the ominous atmosphere. They are both wearing their usual business suits, and Trump is giving one of the eggs to kamala harris with a serious face. There is a speech bubble coming from him with the text "You have to eat ALL the eggs." Kamala Harris is gazing at the egg in his hands intently, her expression reflecting a mix of fear and curiosity. The scene should capture the tension and mystery typical of the Alien franchise.
You do need both Kamala and xenomorph loras
Have a kamonald trumpis
>>
>>101962416
catbox this hot mamasita please hermano
>>
>>101962331
Nice, good to know it's sorted. Sad I can't come over to help you now. Maybe next time.
>>
File: 1167752492635494003-SD.png (1.11 MB, 896x1152)
1.11 MB
1.11 MB PNG
>>
>>101962463
everything is publicity for xitter
not one day out of the news for like 5 years now
>>
>>101962441
Well, I guess the only assurance is that everyone on X accurately says "Grok made this". Even if it's Flux, this is what Elon wants Grok to make after all.
>>
>>101962486
finally some good shit
>>
>>101962402
or he actually believes AI should be uncensored and it's going to die on the first amendment hill making it precedent that the outputs are all that matters
>>
>>101962484
all good, thanks again!
>>
>>101962512
I was so mad when he was chestpounding that it was all GROK made lmao

>>101962507
thanks, its been a while.. flux is a massive jump from prompting realism in pony!
>>
>>101962430
Yes, flux not learning certain tags even with lots of training is kinda annoying. I need to do more experiments, but I think that if T5 has no idea what the tag is when processed in isolation, it doesn't generate a reasonable embedding for those tokens, so the DiT struggles to learn it. I'm thinking about writing a script with a huge list of manual rules to convert the tags into basic natural language descriptions that T5 should understand better. That might work. Luckily with my dataset the set of tags is of relatively small size so it should be doable.
>>
>Q4 gguf is 29s/it VS the original 300 or so s that i remember
honestly, huge improvement, but i'm gonna wait another 2 more weeks kek
>>
>>101962486
what a gen BRO. solid 11/10. ima light myself on fire for the occasion
>>
>>101962545
>>101962510
reply meant here
>>
>>101962486
prompt and lora?
>>
>>101962460
Thanks, i'll look into it
>>
File: ComfyUI_03385_.png (2.01 MB, 1152x896)
2.01 MB
2.01 MB PNG
>>101962441
>>101962442

mfw anon makes the exact same point with the same example 2 seconds before me
>>
>>101962557
still trying to understand flux, really amazed by how well it understands prompts
>>
Hi guys, total noob here. I download latest Forge and flux1-dev-bnb-nf4-v2.safetensors

For dimensions 896x1152 I'm getting ~30s/it for GTX 1070 Ti, is it ok speed or abnormal? Could I make it any lower? btw I tried shared memory/async thing, but it was always crashing on those
>>
>>101962597
are you serious? im using a gtx 1080 and getting this on 1024x comfyui >>101962554
fuck this game really is rigged from the start
>>
>>101962583
prompt was just
>bath too narrow blurry FEET woman's sexo legs black and white 9mm
used skypaintcumfluxdev.art website
>>
>>101962586
>>
>>101962615
Sorry I don't fully get it. Is it good speed?
>>
File: img__00004_.png (1.08 MB, 832x1216)
1.08 MB
1.08 MB PNG
>>101962597
>is it ok speed or abnormal?
sounds about right
>>
>>101962632
of course its pretty decent, even compared to my gpu that should be crushing yours but im getting worse speeds on a lower resolution.
>>
>>101962583
couple of rngens to get right one, but prompt goes something like ark atmosphere, grainy, black and white photo, Hot babe in a bath tub, shot from behind, dark wet hair, photo realistic, polaroid photo

with that realism lora and ancient shadows lora
>>
>>101962637
>>101962643
oh dang, thanks. Yes I agree GTX 1080 should have worked better.
>>
>>101962597
seems fine. ppl come in here and mention 7 minutes per gen, so there is that.
>>
File: 00211-391963375.png (1.22 MB, 832x1216)
1.22 MB
1.22 MB PNG
I can never concentrate when I'm at my work meeting.
>>
>>101962159
thanks anon
According to pytorch and huggingface there should be no reason to use xformers at least on modern torch
>>
>>101962349
It doesn't. Try doing impressionism or non realism.
>>
>>101962084
pytorch have flash attention implemented since 2.2
>>
File: ComfyUI_01030_.png (1.85 MB, 1344x768)
1.85 MB
1.85 MB PNG
>>101962689
Are you using a quant or fp?
>>
>>101962717
Also if you can't see the AI slop in the outputs, rather than properly stylized ones (that can only be fixed with a LoRA or finetune), that is on you.
>>
>>101962672
I see, thank you
>>
File: file.jpg (122 KB, 896x1152)
122 KB
122 KB JPG
>>101962616
you lied to me
>>
Come and get your own loaf of...
>>101962774
>>101962774
>>101962774
>>
>>101962739
NTA but I have modern Torch installed and comfy had has that "not compiled with flash attention" anyway. Not sure why.
>>
>>101962786
It obviously doesn't work on windows.
>>
>>101962778
sorry that was the metal toenails shaving rash one
>>
>>101962741
I'm using Q8_0.gguf
>>
>>101962786
Same



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.