[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1.25 MB, 3264x3264)
1.25 MB
1.25 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>101764165

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Kolors
https://gokaygokay-kolors.hf.space
Nodes: https://github.com/kijai/ComfyUI-KwaiKolorsWrapper

>AuraFlow
https://fal.ai/models/fal-ai/aura-flow
https://huggingface.co/fal/AuraFlows

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
Blessed thread of frenship
>>
File: delux_flebo_00011_.png (1.25 MB, 1216x832)
1.25 MB
1.25 MB PNG
>mfw
>>
File: 1701906796808711.png (1.35 MB, 768x1280)
1.35 MB
1.35 MB PNG
>>
File: ComfyUI_00023_.png (904 KB, 1024x1024)
904 KB
904 KB PNG
>>
File: 2024-08-07_00373_.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
>>
>>101770367
Next challenge - realistic looking cum. Impossible.
>>
File: 1712512836427429.png (1.36 MB, 768x1280)
1.36 MB
1.36 MB PNG
>>101770422
lol give me a few hours let's see what is possible. with schizo prompting, you can do anything
>>
File: 2024-08-07_00384_.png (1.52 MB, 1024x1024)
1.52 MB
1.52 MB PNG
>>
File: ComfyUI_Flux_90.png (1.23 MB, 1344x768)
1.23 MB
1.23 MB PNG
>>101770437
Good luck. No amount of ambiguous words like "cream", "goo", "semi-transparent liquid" and so on helped me.
>>
File: Doomer.jpg (45 KB, 720x720)
45 KB
45 KB JPG
>>101769863
Update on this.

Nothing works anymore, im getting errors out of the ass ive never even seen before. Guess i need to reinstall it all again.

I dont even blame AMD for this one, never update kings.
>>
File: 1716657633555375.png (1.72 MB, 768x1280)
1.72 MB
1.72 MB PNG
>>101770512
sorry for the dogshit artifacts I merged sd3 and flux
>>
File: grid-0087-3014128512.jpg (656 KB, 2688x3600)
656 KB
656 KB JPG
>>101770512
try:
(snot), (pus), (messy eater), (white industrial goo)
>>
File: Flux_00230_.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
just woke up, what did I miss?
>>
>>101770721
awesome
>>
File: 1706408760004430.png (1.3 MB, 768x1280)
1.3 MB
1.3 MB PNG
>>
>>101770683
>I merged sd3 and flux
abomination
>>
>>101770683
why does it have AI face
Aaaaah!!!
>>
File: Capture.jpg (427 KB, 3097x1497)
427 KB
427 KB JPG
>>101770721
>what did I miss?
We can crank up the cfg up to 8 without any issues now
>>
File: Flux_00231_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>101770738
>>
>>101770606
The HIP SDK should have that file, doing a cursory search. What happens if you search for that DLL in where you installed the SDK, do you find it? Also, for not finding DLL dependencies, you should be using Dependency Walker (https://www.dependencywalker.com/) to figure out if every DLL missing is accounted for.
>>
File: Flux_00289_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>101770820
how?
>>
>>101770820
>speed plummets down into oblivion
>"without issues"
>>
>>101770838
>how?
DynamicThresholdingFull
https://files.catbox.moe/haqdtd.png
>>
File: ComfyUI_02282_.png (890 KB, 768x1024)
890 KB
890 KB PNG
>>101770422
>>
>>101770845
cfg had always the effect of speed decrease, are you new? every people who used negative prompt before used cfg and therefore had their speed halved, no one complained before
>>
File: Flux_00276_.png (1.51 MB, 1024x1024)
1.51 MB
1.51 MB PNG
>>101770846
whats all this disconnected stuff?
>XYZ Plot
>select Node inputs
>>
>>101770935
some testing shit nodes, you can remove them
>>
File: grid-0101-3014128512.jpg (219 KB, 1792x2400)
219 KB
219 KB JPG
>>
>>101770860
What words did you use? Is it consistent?
>>
>>101770838
I saw you post this pic yesterday I actually saved it, can you maybe gimme the prompt? it goes so hard
>>
File: ComfyUI_00036_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
Drink Flux Cola
even if the bottles aren't always the same size
>>
File: ComfyUI_02314_.png (786 KB, 768x1024)
786 KB
786 KB PNG
>>101770964
Not really consistent, a lot of them look like candle wax. I have 2 4090s, offloaded the clip model to one of them and have the fp16 model on the other, so i bulk generate 12 images at this resolution at a time about every 1 minute or so. Maybe 10-15% of the images are passable.

prompt is

"A photo of a blonde girl, she has watery and runny white liquid jizz on her face."
>>
File: 2024-08-07_00385_.png (1.4 MB, 1024x1024)
1.4 MB
1.4 MB PNG
>>
File: ComfyUI_01267_.png (1.5 MB, 1024x1024)
1.5 MB
1.5 MB PNG
>>101770947
but damn cfg at 8 generates at a snails pace.
shit took like 4 minutes for this image kek.

>>101771034
here bro
https://files.catbox.moe/anxy7y.png
>>
File: Flux_00336_.png (1023 KB, 1024x1024)
1023 KB
1023 KB PNG
>>
File: 34567564374.png (33 KB, 650x371)
33 KB
33 KB PNG
>>101770833
Ehh?????
hiprtc0507.dll is not there
>>
>>101771071
thanks man
>>
>>101771071
>but damn cfg at 8 generates at a snails pace.
you don't need to go that far, if you can get the same result with lower cfg go for it
>>
>>101771077
kek
>>
File: Flux_00216_.png (1.15 MB, 1024x1024)
1.15 MB
1.15 MB PNG
>>101771109
so where is the sweet spot?
>>
File: 1350104.png (2.44 MB, 1072x1072)
2.44 MB
2.44 MB PNG
>>
>>101771152
I have no idea, I'm actually making a XY plot between guidance and CFG to see where the magic starts
>>
File: 2024-08-07_00375_.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
>>101771171
thanks for offering your gpu cycles for science anon! looking forward to the results
>>
File: Flux_00310_.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>101771201
this, doing gods work
>>
>>101771050
cool
>>
>>101771201
>>101771207
oh sheesh
>>
File: ComfyUI_00029_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
初めまして、カーミット様。
>>
File: ComfyUI_temp_fkxqv_00223_.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
>>101770512
Mayo kinda works.
>>
>>101770863
we use CFG even without a negative prompt
>>
File: Flux_00312_.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>101771255
kek
>>
File: Flux_00340_.png (856 KB, 1024x1024)
856 KB
856 KB PNG
>>
>>101771257
cfg when different from 1 halves the speed whether or not a negative prompt is added.
>>
File: 00008-2036652110.jpg (527 KB, 1536x2304)
527 KB
527 KB JPG
>>
File: Flux_00313_.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>
how she drivnig like this
maybe a cool racing game
>>
>>101771078
I looked up the instructions. It says the following.
>IMPORTANT: If your GPU is higher/newer than a RX6800, then skip Step 3
IF your GPU is below a RX 6800, you need to skip to Step 3.
>Install AMD HIP SDK 6.1.2 from here: https://www.amd.com/en/developer/resources/rocm-hub/hip-sdk.html
>If your GPU is below a RX 6800, you need to do the following steps:
>3.1 Install AMD HIP SDK 5.7.1 from here: https://www.amd.com/en/developer/resources/rocm-hub/hip-sdk.html
I'm guessing 1.) Your GPU is lower than a RX 6800 so you used the wrong HIP SDK and 2.) When you upgraded your driver, it no longer works because the ROCm install doesn't recognize that driver but that is weird because ROCm really shouldn't be tied to the driver you use based on what I know in Linux so might be unlikely. Do 1 first and then only then do 2 and download the driver you had before.
>>
>>101771296
yes, that's what I said
you, with your ESL, implied it was only used when there is a negative prompt
>>
>>101771328
>you, with your ESL, implied it was only used when there is a negative prompt
where did I imply that?
>>
uhhhh
at laest i figured out the prompts in filename thing
>>
>>101771318
conflicting tokens, whenever you stumble upon something like this, prompt around it. "from the side" or "head outside" and such
>>
>>101771339
>every people who used negative prompt before used cfg
>>
>>101771354
where's the lie? to use negative prompts you need to use cfg, yeah
>>
File: 2024-08-07_00416_.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>101771212
thank you
>>
>>101771364
I didn't say there was a lie...
oh ESL anonie, you're too dummy and cute to be mad at
*pats your head*
>>
>>101771387
?
>>
>>101771400
*pats your head even harder*
>>
File: ComfyUI_00031_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
what do you think these guys are plotting?
>>
>>101771428
the protocols
>>
File: flux.png (1.84 MB, 1076x718)
1.84 MB
1.84 MB PNG
>>
File: file.png (30 KB, 122x236)
30 KB
30 KB PNG
>>101771442
this guys like "FUCK ITS HAPPENING"
>>
>>101771428
Are these guys like catholic clerics?
>>
>>101766903
thx I don't have a good graphics card
I have an old HD5670
>>
>>101771479
diff anon but those gens are really cool, how long does it take for each one?
>>
File: 2886757634.jpg (166 KB, 1280x720)
166 KB
166 KB JPG
>>101771322
I have a 7900 xtx and use windows.

And idk man it looks like its trying to call for the older version of HIP ver 5.7? But when i have that installed, it doesnt work at all.

I read the instructions correctly, because it worked before.
>>
File: Flux_00243_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>
>>101771490
infinity, he is not using a HD5670 to gen with any model, it just ain't possible
>>
>>101771506
the GPUs we would have if they won the war bros... 1488 gb vram...
>>
File: img_18.png (1.24 MB, 960x1360)
1.24 MB
1.24 MB PNG
This is my new flux gf guys, please say something nice about her.
>>
>>101771511
isn't it possible to gen cpu only though? or use a shit gpu with the cpu doing most of the work?
>>
File: Flux_00218_.png (1.15 MB, 1024x1024)
1.15 MB
1.15 MB PNG
>>101771520
>>
>>101771522
>please say something nice about her.
Hey babe, you look like belle delphine, can I drink your bathwater?
>>
File: 1695744778385046.jpg (2.38 MB, 2208x2064)
2.38 MB
2.38 MB JPG
Ipad Pro M4 running FLUX schnell, 4 steps, 100 secs per gen

Apple won, bigly
>>
File: up_0008.jpg (468 KB, 2752x5120)
468 KB
468 KB JPG
>>
File: Flux_00320_.png (1.4 MB, 1024x1024)
1.4 MB
1.4 MB PNG
>>
>>101771549
>25s/it
>>
>>101771543
Never mind ......
>>
>>101771550
dangerously based
>>
>>101771495
The only other thing is that your ZLUDA build is outdated. From https://github.com/lshqqytiger/ZLUDA/releases/, I am seeing a v3.8.1 that is from 3 weeks ago and and v3.8 right below that explicitly removes support for anything below ROCm 6. That probably may be the only reason why ROCm 5.7 is being sought after.
>>
>>101771511
>it just ain't possible
Says who?
>>
>>101771564
Faster than your RTX
>>
>>101771490
like 30 seconds with gradio webui (link from here) or a bit longer on huggingface
>>
>>101771455
neat
>>
What's the best FLUX model I can run on my 16GB card? My internet sucks so I don't want to download a huge model just for it not to fit
>>
File: ComfyUI_temp_fkxqv_00282_.png (702 KB, 1024x1024)
702 KB
702 KB PNG
>>101771522
>>
File: ComfyUI_00003_.png (1.71 MB, 832x1216)
1.71 MB
1.71 MB PNG
>>
>>101771836
There's model with merged vae or something like that, should be easy to find
>>
File: file.png (1.5 MB, 1344x768)
1.5 MB
1.5 MB PNG
>>101771428
>>
File: ComfyUI_00006_.png (1.94 MB, 832x1216)
1.94 MB
1.94 MB PNG
>>
File: 342764637854783.png (11 KB, 967x134)
11 KB
11 KB PNG
>>101771603
Okay, well this is new.
At least its not exception errors.
Reinstalled ZLUDA and HIP SDK.

just for the record:
AMD Driver ver.24.7.1
Python ver.3.10.11
HIP ver.6.1.2
ZLUDA ver.3.8.1
>>
>>101771906
Based Karen
>>
>>101771857
>>101771918
I like the style
>>
>>101771882
I'll look around, no worries. I'm just glad it's not completely off the charts. Seeing such huge model sizes sent shivers down my spine. I run a 6GB SDXL model no problem, but FLUX is like 16GB at least. Scary.
>>
New to SDXL lora training, is there any consensus on what learning rates should be set to? I downloaded a few models off civitai to check their metadata, and they're all wildly different.
>>
>>101772011
read up on prodigy optimizer and try that one first, it just werks.
recommending this niggas settings for the optimizer:
https://civitai.com/articles/3105/essential-to-advanced-guide-to-training-a-lora
>>
File: 1712787885828837.png (1.37 MB, 768x1280)
1.37 MB
1.37 MB PNG
I merged sd3 and flux
>>
File: ComfyUI_30865_.png (1009 KB, 1024x1024)
1009 KB
1009 KB PNG
Can flux generate legible text in foreign scripts? I haven't had any luck yet with Japanese or Cyrillic.
>>
File: FD_00007_.png (428 KB, 512x512)
428 KB
428 KB PNG
>>101770820
Doesn't this make it slow as fuck?
I just woke up too.
>>
File: Flux_00035_.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
>>
>>101772047
Neither of them produce tits this big
>>
>>101772057
using cfg halves the speed, it was always like that with that parameter yeah
>>
I miss Ran
>>
>>101772062
>two cigs
>>
>>101771922
It found your APU at device id 1, probably on your Ryzen CPU based on the fact it's some weird RDNA2 variant on desktop. 7900 XTX is gfx1100, try using device-id 0 instead.
>>
File: 1711117617723924.png (1.23 MB, 768x1280)
1.23 MB
1.23 MB PNG
>>101772063
my man you have no idea who you are talking to
>>
>>101771606
Me.
>>101771609
bait
>>
File: ComfyUI_00011_.png (1.71 MB, 832x1216)
1.71 MB
1.71 MB PNG
>>101771935
>>
>>101771836
both flux dev and flux schnell are the same size and have the same vram requirements, you can run flux with 16GB VRAM if you load it in fp8
>>101772084
two lungs
>>
File: Flux_00039_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>
File: 2024-08-07_00429_.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>
>>101772164
Several people in these threads have been running it way less VRAM.. just takes a while
>>
File: ComfyUI_00012_.png (1.93 MB, 832x1216)
1.93 MB
1.93 MB PNG
>>
>>101772251
my shit crashes on comfy every time i try it
>>
File: 2024-08-07_00450_.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
>>
File: ComfyUI_30861_.png (1 MB, 1024x1024)
1 MB
1 MB PNG
>>
File: 0.jpg (233 KB, 1024x1024)
233 KB
233 KB JPG
>>101772072
Is it always half unless you use cfg 1.0? i.e is it the same using cfg 3.0 as it is using cfg 8.0?
>>
>>101772251
Yeah because it resorts to swap when it runs out of VRAM, per the CUDA sysmem fallback policy by default. I haven't tested Flux personally, just downloaded and have it ready to start testing tonight since I was busy this entire week and last week until today. You can also reduce VRAM usage using FP8 for VAE and the text encoder and the unet with the options ComfyUI provides. I haven't tested but traditionally, you could also use token merging with the tomesd node in the for_testing ComfyUI custom nodes and put it right between the model loader and when it gets to the sampler and increase the ratio to use less VRAM too which worked not only for Stable Diffusion but some other models too.
>>
>>101772294
did you --lowvram
>>
File: 2024-08-07_00452_.png (1.75 MB, 1024x1024)
1.75 MB
1.75 MB PNG
>>101772369
ya speed just doubles regardless how high you set CFG
>>
https://new.reddit.com/r/StableDiffusion/comments/1emi1j9/opensource_amd_gpu_implementation_of_cuda_zluda/
>a based gentleman wanted to help AMD by making Cuda compatible with their cards
>AMD sent a ban notice to him
If that's not a sign that AMD is a controlled oposition, then I don't know what else to say
>>
>>101772397
>doubles
*halves
>>
>>101772392
would that even help
i don't even see anything get loaded into vram it crashes before that happens
>>
>>101772421
post workflow
>>
>>101772447
bruh literally the glass bottle workflow to load the schnell model
https://comfyanonymous.github.io/ComfyUI_examples/flux/
>>
File: ComfyUI_00015_.png (1.32 MB, 1344x768)
1.32 MB
1.32 MB PNG
>>
>>101772406
Why would you link to Reddit, you mouthbreather, especially when it's news and it's old news. Use the original link to the website that broke the news.
https://www.phoronix.com/news/AMD-ZLUDA-CUDA-Taken-Down
And actually get the fuck out of here, we don't need to see more of you vermin bottom-feeders here who can't contribute positively to the culture and posting atmosphere who just are going to repost shit there once something useful here is posted.
>>
>>101772505
zam
>>
File: 0 (6).jpg (185 KB, 1024x1024)
185 KB
185 KB JPG
>>101772397
Pretty shit, I can barely run it as it is on a 4080.
Interesting experiments but I don't feel like it's worth it for general purposes.
>>
>>101772474
might be a dumb question also but do you have the latest comfy commit?
also just try --lowvram just to see
>>
>>101772537
i updated today
ill try in lowvram just for you honey
>>
File: 2024-08-07_00455_.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
>>101772526
thank you
>>
File: Flux_00322_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>101772172
>>101772062
>>
File: FLUX_00147_.png (863 KB, 1152x896)
863 KB
863 KB PNG
>>101772568
have you been at this for 12 hours?
>>
>>101772526
for my use case it's interesting, with cfg = 1 there's no way I can change Miku's skin, I thought flux was just unable to do something like that, turned out that cfg unleashes its prompt understanding to the max
>>
>>101772537
insta crashed.
even the terminal kills itself
>>
File: 112367437564.jpg (73 KB, 1000x1001)
73 KB
73 KB JPG
>>101772505
I doubt any open source dev would ever take legal action against open source code but i am almost 100% certain you cannot just go backses on open source license agreements?
>>
File: Flux_00290_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>101772576
yes
>>
File: FLUX_00148_.png (885 KB, 1152x896)
885 KB
885 KB PNG
>>101772598
respect
I just woke up
>>
>>101772598
Thank you, king
>>
File: 00001-3065012206.png (772 KB, 1024x1024)
772 KB
772 KB PNG
>>101772090
I KNEEL
>>
>>101772582
that probably means you don't have enough RAM or pagefile/swap space. make your pagefile/swap bigger.
I get by on a 4GB card with 16GB RAM with Schnell. Blistering 125s per iteration.
>>
>>101772626
Should try amfetamine + vodka combo for ultimate art
>>
File: Flux_00295_.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
>>101772626
>>101772649
I can barely get any sleep since flux came out, I just wanna Proooompt.
feels like the early days of SD but better.
>>
>>101772672
(loaded as fp8 on checkpoints already converted fp8, even if you have the fp8 e4m3fn launch flags, it still has to load the full fp16 checkpoints before they take up less RAM/VRAM.)
>>
third flux lora dropped on civitai, this time chinese style slop:
>https://civitai.com/models/629898/flux-lora-littletinies?modelVersionId=704235
>>
File: ComfyUI_temp_oabjb_00036_.png (2.99 MB, 2016x1536)
2.99 MB
2.99 MB PNG
>>
File: UXL.png (106 KB, 1240x506)
106 KB
106 KB PNG
>>101772584
Read the wording carefully.
>The code that was previously here has been taken down at AMD's request. The code was released with AMD's approval through an email. AMD's legal department now says it's not legally binding, hence the rollback.
This wasn't a legal threat, he basically lost the legal guarantees from the fact that AMD's legal department had approved his release of the code to open source and then retracted it. He basically did a preemptive move to prevent Nvidia from rolling out their own legal action against him after he lost AMD's legal protection with the code, basically and there is no way he can publicize the code any longer. Anyone else can pick it up and fork it since it was released with the orginal license but risk having Nvidia ram a DMCA in their mailbox.
But I am just going to level with you and everyone else regarding this topic. CUDA compatibility was always going to be a slippery slope because it's a proprietary standard from Nvidia and isn't open source and thus is subject to their whims and desires per how Nvidia licenses it even if it's for compatibility. While it's a great term solution short to medium term, it can't be the long term solution. We actually need to see more software that isn't CUDA being written even if Nvidia is far ahead of everyone else.
AMD's ROCm isn't a solution either because it also isn't a standard and is only open-source. The only solution that I see working with a slim chance and that I have tethered myself to is Intel's strategy with SYCL, a Khronos standard and the successor to OpenCL, and extending that to the UXL Foundation with trying to get everyone on board with it. Unsurprisingly, AMD and Nvidia are missing. It is my opinion AMD is hurting themselves not trying to write a frontend tying this to ROCm and I am hoping at some point AMD comes around to it.
>>
File: Flux_00048_.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>101772568
>>
>>101772714
cool, that's for flux Dev, I'm scared that they will all focus on Schnell because of the licence, I don't want that, Dev is the best model we have
>>
>>101772714
why that bird leaking bean juice?
>>
>>101772731
regular basic bitch lora makers won't care but anyone with real money won't use dev unless they're backed by donations
>>
File: Flux_00327_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>101772726
>>
>>101772731
It's ok I will train a coomer lora on an a6000ada when the tools are a little bit better.
>>
>>101772746
You didn't read the dev license.
>>
>>101772667
Glad to see you getting it to work, it seems. I haven't used ROCm in over a year but it seems like similar errors and etc. from when I was using it, albeit a lot less convoluted than the Linux errors I saw with Vega 64. AMD still has a long way to go, there is no reason why ROCm shouldn't work on Windows at the moment natively but it is what it is.
>>
File: Flux_00214_.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>>101772793
based Coomer Chad
>>
File: Flux_00049_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>101772749
>>
>>101772798
dev license is non-commercial use and you're in a grey area with some guy training models for fun and getting monthly Patreon donations
>>
File: ComfyUI_30871_.png (600 KB, 1024x768)
600 KB
600 KB PNG
>>
>>101772814
>>101770274
here's where the spam posts go
>>
>>101772829
>yuri
a man of culture I see
>>
>>
File: 00007-3179540968.png (359 KB, 512x512)
359 KB
359 KB PNG
>>101772090
>>101772667
>go through the effort of updating GPU to play BFV because of error
>errors out SD instead but can play
>spend all day fixing SD

>BFV errors out with earlier error, but SD works

Lessons learned;
>never update
>any product made by EA is actually garbage
>>
File: file.png (694 KB, 1024x1024)
694 KB
694 KB PNG
>>
>>101772867
>playing EA goy slop
deserved
>>
File: ComfyUI_Flux_00510_.png (2.03 MB, 1536x1024)
2.03 MB
2.03 MB PNG
>>101771039
poison drink
>>101772052
yes, it can make legible Japanese hiragana but I don't speak it so It's probably nonsense words. I haven't tried prompting something specific.
>>
File: ComfyUI_30872_.png (744 KB, 1024x768)
744 KB
744 KB PNG
>>
File: 1581588520895.jpg (125 KB, 1280x720)
125 KB
125 KB JPG
>>101772881
Honestly and BF4 actually works yet BF1 and V dont so you know what it was deserved
>>
>>101772882
>Our competitor's product is poison
Made me laugh
>>
>>101772867
>never update
Never update thinking it is a quick thing for alpha/beta software which is what ROCm is despite AMD saying otherwise. And yeah, EA's programming is kinda shit but also AMD's drivers too on Windows. Hopefully updating SD made it faster too so the effort was worth it.
>>
File: file.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>
File: kingwithkings.png (237 KB, 529x450)
237 KB
237 KB PNG
>>101772961
>updating SD made it faster
It did actually, thanks.
>>
>>101771857
prompt?
>>
File: ComfyUI_00007_.png (1.85 MB, 832x1216)
1.85 MB
1.85 MB PNG
>>101773009
it's just a new oil painting lora on civit
ponyXL
and a character lora
>>
>>101772793
>coomer lora
I assume the best we would get is "1girl, spread anus, hairy asshole", because for actually complex scenes the whole model would need to be re-trained
>>
File: 2024-08-07_00477_.png (1.04 MB, 1024x1024)
1.04 MB
1.04 MB PNG
>>101772714
yaaa.. well I tried it.. its shit, people obviously need to figure out how to make good loras for flux still, hardly can get it to make anything
>>
File: ComfyUI_00561_.png (1.58 MB, 1024x1024)
1.58 MB
1.58 MB PNG
I'm getting lots of booba without even asking for it, the weird half-censored nipples are really noticeable.
>>
File: wizard.png (910 KB, 1024x1024)
910 KB
910 KB PNG
>>101772974
>>
File: ComfyUI_00013_.png (3 KB, 1024x1024)
3 KB
3 KB PNG
>>101772672
>>101772700
got it to work
aaaaand its just a blank fucking image lol!
>>
File: ComfyUI_02622_.jpg (978 KB, 1792x2304)
978 KB
978 KB JPG
>>
>>101773181
Do you have image previews on? Was the image black the entire time or were there colors and shapes before it went black. With black images throughout the entire process, typically, I find a restart fixes things but I don't have a Nvidia GPU. If it is colors and shapes before it goes all black, then yeah, it's something in the generation process going wrong.
>>
File: ComfyUI_temp_oabjb_00049_.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
>>
File: ComfyUI_30876_.png (799 KB, 1024x768)
799 KB
799 KB PNG
>>
File: ComfyUI_00574_.png (1.31 MB, 1024x768)
1.31 MB
1.31 MB PNG
>>
File: ComfyUI_30878_.png (745 KB, 1024x768)
745 KB
745 KB PNG
>>
File: ComfyUI_00014_.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>101773233
got it to work
>>
File: 2024-08-07_00507_.png (1.52 MB, 1024x1024)
1.52 MB
1.52 MB PNG
>>101773318
grats .. now make us some gens
>>
File: 2024-08-07_00510_.png (1.59 MB, 1024x1024)
1.59 MB
1.59 MB PNG
>>
Inpainting in Flux is dramatically better.
>>
>>101773112
nofap is easy when you have a pussy
>>
>>101773407
I haven't bothered because inpainting is comfy is nightmarish
>>
File: ComfyUI_Flux_00605_.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
>>
File: ComfyUI_30881_.png (755 KB, 1024x768)
755 KB
755 KB PNG
>>
>>101773441
SwarmUI makes it a little better. But honestly even without inpainting the results can be great
>>
File: ComfyUI_00001_.png (1.13 MB, 1024x1024)
1.13 MB
1.13 MB PNG
>>101773376
will do
>>
File: ComfyUI_02628_.jpg (694 KB, 1792x2304)
694 KB
694 KB JPG
>>101773441
that hasn't been true for like a year now. You can draw a mask within a load image node and pass the image + mask straight to a sampler.
>>
does Flux have negative prompt?
>>
File: 00013-232147762.png (1.07 MB, 720x1344)
1.07 MB
1.07 MB PNG
>>
>>101773508
nta but the issue i have with it is that it doesn't consider what is behind the inpainted mask
so if i do it at half denoise i get a grey blob.
a1111 doesn't do this.
>>
File: 2024-08-07_00430_.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
>>101773517
when you use the DynmamicThresholding node you can use negative prompts if you raise CFG above the reccomended cfg == 1 and set mimimc cfg == 1, but the generation time doubles then
>>
File: 1722985155755211.jpg (177 KB, 671x682)
177 KB
177 KB JPG
>>
>>101773553
You're doing something wrong then, I've never had that issue
>>
File: ComfyUI_02407_.png (985 KB, 1024x1024)
985 KB
985 KB PNG
>>
File: 00040-232147761.png (1.15 MB, 720x1384)
1.15 MB
1.15 MB PNG
>>
File: ComfyUI_30882_.png (823 KB, 1024x768)
823 KB
823 KB PNG
>>
So, is Flux open source in the same way normal SD is? Like, are there people currently working on different checkpoint for porn (similar to pony)?
>>
>>101773606
can you provide an example workflow then?
>>
File: 1722985155755212.jpg (131 KB, 665x609)
131 KB
131 KB JPG
>>
is there any way i can get decent flux gen speed on a volta card?
>>
File: 2024-08-07_00360_.png (1.56 MB, 1024x1024)
1.56 MB
1.56 MB PNG
>>101773633
they are on the fence about that, the horsepussy model maker (pony) is bitching about license again cause he can't make money of it cause while FLUX.schnell is completly free and open source, the superior model FLUX.dev is only free for non-commercial use and the master model FLUX.pro is closed source, ppl still will make loras for it (already are) but if and when we see a coomer finetune remains unknown, but never underestimate the thirst of coomers for new toys I guess
>>
File: 2024-08-07_00521_.jpg (773 KB, 2560x1440)
773 KB
773 KB JPG
>>
oh shit i got nipplez on flux
i thought it wasnt possible
nvm they turned into fucking grapes
>>
File: ComfyUI_02346_.png (1.54 MB, 1024x1024)
1.54 MB
1.54 MB PNG
>>
>>101773640
just load an image, open image in mask editor, and then send to a sampler. I usually use a mask ksampler which is an impact node, but I think these ones are all default nodes
>>
File: ComfyUI_02433_.png (998 KB, 1280x1024)
998 KB
998 KB PNG
>>
File: 2024-08-08_00003_.png (1.27 MB, 720x1280)
1.27 MB
1.27 MB PNG
>>
File: ComfyUI_30883_.png (769 KB, 1024x768)
769 KB
769 KB PNG
>>
File: 00013-2296647356.png (2.21 MB, 1248x1824)
2.21 MB
2.21 MB PNG
>>
>>101773824
>set latent noise mask
the node that has been under my nose this entire time
thanks anon
>>
File: Flux_00333_.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
okok I know I already beat this prompt to death but just this last one ok
>>
File: FLUX_00169_.png (1.33 MB, 1152x1280)
1.33 MB
1.33 MB PNG
is this a gibson SG?
>>
>>101773928
more or less
the shape is right
>>
File: Capture.jpg (738 KB, 3453x1305)
738 KB
738 KB JPG
Fucking finally...

https://files.catbox.moe/hsi2zn.png (embed)
Now that we can increase the CFG value on flux, see here:
https://reddit.com/r/StableDiffusion/comments/1ekgiw6/heres_a_hack_to_make_flux_better_at_prompt/?sort=confidence
It's time to look at the sweet spot between the Guidance scale and the CFG, and for that we'll use that prompt:
>A drawing of Hatsune Miku with dreadlocks and a black skin showing her fists on the street
The goal here to check whether the photo shows Miku having a black skin + dreadlock, if neither is present, the photo is eliminated.
Here's my observations:
- CFG = 1 won't change Miku's skin or add any dreadlocks. This is one of the reasons why it's important to have a higher CFG if you want Flux to perform as well as possible in terms of prompt understanding.
- The sweet spot seems to be at CFG = 5, that's the minimum CFG where we got the most success.
- Having a low guidance doesn't work well with a high CFG, the images gets broken.
- High guidance seems to make the model worse at prompt understanding, for example at Guidance = 5.1, there hasn't been a single image that was successful.
>>
File: Capture.jpg (2.33 MB, 3453x1305)
2.33 MB
2.33 MB JPG
Fucking finally...

https://files.catbox.moe/hsi2zn.png (embed) (embed)
Now that we can increase the CFG value on flux, see here:
https://reddit.com/r/StableDiffusion/comments/1ekgiw6/heres_a_hack_to_make_flux_better_at_prompt/?sort=confidence
It's time to look at the sweet spot between the Guidance scale and the CFG, and for that we'll use that prompt:
>A drawing of Hatsune Miku with dreadlocks and a black skin showing her fists on the street
The goal here to check whether the photo shows Miku having a black skin + dreadlock, if neither is present, the photo is eliminated.
Here's my observations:
- CFG = 1 won't change Miku's skin or add any dreadlocks. This is one of the reasons why it's important to have a higher CFG if you want Flux to perform as well as possible in terms of prompt understanding.
- The sweet spot seems to be at CFG = 5, that's the minimum CFG where we got the most success.
- Having a low guidance doesn't work well with a high CFG, the images gets broken.
- High guidance seems to make the model worse at prompt understanding, for example at Guidance = 5.1, there hasn't been a single image that was successful.
>>
>>101773916
np, if you have/get the Impact pack of nodes you can just use this mask detailer sampler, which makes it a bit more simple to setup. Also has face detailer nodes you can use with detection models to auto mask faces/people/hands

https://github.com/ltdrdata/ComfyUI-Impact-Pack
>>
File: 00037-750171258.png (3.02 MB, 1280x1920)
3.02 MB
3.02 MB PNG
>>
>>101773683
Late but I thought the pony person was still in the process of making a v7 of the SDXL finetune and was waiting on the Open Model Initiative to put a model out. Flux wasn't planned, it was just a coincidence the creators had it ready last week and marketed it at the right time after the SD3 fiasco. I get he probably wanted to use Flux since it is here right now but he's going to have to wait to make money.
>>
File: 2024-08-08_00011_.png (1.35 MB, 720x1280)
1.35 MB
1.35 MB PNG
>>101773933
very interesting results, high cfg is working nicely it seems (when using half cosine) .. pic related is cfg = 8, guidance = 3.5 .. which seems to be a sweet spot

one of my observations to add: high CFG works better for landscape/cityscapes/surrealist/anime/cartoon, for photorealistic human gens I wouldn't go above 5 .. maybe even less
>>
>>101773992
Sup
>>
File: 00039-2055272249.png (3.01 MB, 1280x1920)
3.01 MB
3.01 MB PNG
>>
>>101773917
I will continue to goon thank you very much
>>
>the preview looks absolutely perfect
>oh God, please let it be good, please, please
>VAE decode bar is filling, doki doki, waku waku
>image is saved
>it's awful
I want to cry.
>>
>>101774010
>still in the process of making a v7 of the SDXL finetune
no one knows, he had a serious fight with SAI, not sure if that still he still wants to do SDXL, last he blabbered was he wants to make an AuraFlow finetune
>>
File: FLUX_00170_.png (1.43 MB, 1152x1280)
1.43 MB
1.43 MB PNG
and the strat
>>
File: ComfyUI_30888_.png (796 KB, 1024x768)
796 KB
796 KB PNG
>>
File: 2024-08-07_00418_.png (1.84 MB, 1024x1024)
1.84 MB
1.84 MB PNG
>>101774061
7 string strat.. fancy
>>
>>101774010
I hate to break it to you but Pony is a fraud and what you got already was a lightning in a bottle strike where someone incompetent sometimes accidentally makes something good. Pony basically is the YandereDev.
>>
>>101774019
>one of my observations to add: high CFG works better for landscape/cityscapes/surrealist/anime/cartoon, for photorealistic human gens I wouldn't go above 5 .. maybe even less
yeah I have the same conclusion, desu having a cfg of 5 is enough to make flux as good as possible on prompt understanding, no need to overkill yeah

>high cfg is working nicely it seems (when using half cosine)
yeah, using Half Cosine Up on mimic_mode and cfg_mode was the last thing needed to make DynamicThresholding completely viable, glad I found that mf through a XY plot
https://files.catbox.moe/b4hdh0.png
>>
does this flux schnell safetensor run on A1111? Or is some comfy only for now thing?
>>
>>101773508
>that hasn't been true for like a year now. You can draw a mask within a load image node and pass the image + mask straight to a sampler.
comfy inpainting gradually fries the image to shit if you do a large number of repeated inpaints on your results, as the VAE encodes/decodes the image over and over
>>
File: 2024-08-08_00017_.png (1.32 MB, 720x1280)
1.32 MB
1.32 MB PNG
>>101774169
>A1111?
no, two more weeks
>comfy
yes

>>101774155
anime gens really like high cfg tho, this is cfg 8, on 5 the sword was borked and the sakura pattern on the kimono was not present
>>
>>101770020
Does boomer prompting work because the t5 tokenizer has a max sequence length of 512, whereas the CLIP tokenizer has a max sequence length of 77? Does flux use both or just one or the other at a given time?
>>
File: Flux_00277_.png (924 KB, 1024x1024)
924 KB
924 KB PNG
>>101774169
just use comfyUI bro.
its literally better in every way.
>>
File: ComfyUI_30889_.png (1.72 MB, 1632x1224)
1.72 MB
1.72 MB PNG
>>
>>101774211
>anime gens really like high cfg tho, this is cfg 8, on 5 the sword was borked and the sakura pattern on the kimono was not present
oh cool, I'll keep that in mind when making anime gens then
>>
File: 2024-08-07_00366_.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>101774220
by default if you feed a simple prompt into it it uses both with the same prompt, you can use CLIPTextEncodeFlux node tho, then you can send CLIP and T5 each their own prompt, which may or may not be a good idea depending on your prompt and gen subject
>>
What the flux?
Is this broken : https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell

Acts like it's working but then "error".
>>
File: ComfyUI_02644_.jpg (1.14 MB, 2048x2048)
1.14 MB
1.14 MB JPG
>>
>>101774274
works for me
>>
File: ComfyUI_30890_.png (1.78 MB, 1632x1224)
1.78 MB
1.78 MB PNG
>>
>>101774211
>on 5 the sword was borked and the sakura pattern on the kimono was not present
I guess that on cfg 1 it was even worse?
>>
>>101774274
>https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
it was bronked 4me, the dev one in spaces is working but gives like 5 gens
>>
>>101774274
>>
File: ComfyUI_30891_.png (1.6 MB, 1632x1224)
1.6 MB
1.6 MB PNG
>>
>>101772047
recipes?
>>
>>101772047
Ah that's why it looks like shit
>>
>>101773627
This isn't Flux is it?
>>
File: 2024-08-08_00025_.png (1.17 MB, 720x1280)
1.17 MB
1.17 MB PNG
>>101774326
pic related at cfg 1, didnt get the style at all
>simple anime style, flat colors, cell shaded, thick line art, style by Leiji Matsumoto
also the kimono is borked and the swords .. well are suddenly swords
>>
File: 1722985155755253.jpg (157 KB, 701x683)
157 KB
157 KB JPG
>>
Next bread is ready and waiting...
>>101774288
>>101774288
>>101774288
>>
no collage in the next one, rebake
>>
>>101774058
>he had a serious fight with SAI
Stupid drama or something that makes sense?
Didn't even know Pony dude discussed with them, I thought most model makers are silent about what people do to add back porn.
>>
>>101774382
every block .5 except the last, final block 1.0 (all Flux)
>>
>>101774468
only saw it mentioned on his dev update about pony developed on civitai .. seems there was a discord drama involving license of SAI employees and SD3 license debate, I don't much care, but its the kinda popcorn drama you see so often these days
>>
>>101774465
5 API credits has been deposited to your BFL acc
>>
>>101774040
redo it and save at earlier steps.
>>
>>101773407
Going with the one on the left.

On a side note, can Flux do transparent backgrounds? If so, I'd wager it could make an amazing sprite set for Sillytavern.
>>
>>101774580
No but it's easy enough to remove a background. It's even built into windows now.
>>
File: fac001.jpg (316 KB, 1024x1024)
316 KB
316 KB JPG
>>
>>101774058
I was just wondering if there was any update after
https://civitai.com/articles/5069/towards-pony-diffusion-v7 since that was written right when the SD3 debacle started. Any news source for the Auraflow thing or is this Discord talk?
>>101774122
That's probably not in question, given his past remarks on not knowing things that should be in 101s for anyone doing this semi-seriously. But codemonkeys do occasionally still do good things and he had retard levels amount of money and enough clout and given most finetuners have been hired away from the community and no one else wants to do NSFW tunes, he would be one of the people to watch.
>>
>>101774580
Not that I know of
>>
>>101774630
The ponyfaggot is sticking to his guns on Pony-AF even though AF is dogshit.
>>
>>101774630
>Any news source for the Auraflow thing or is this Discord talk?
someone posted a twitter? or other social meda link/screenshot some threads back here
>>
>>101774580
there are nodes to accomplish this very easily in comfy
"layer diffuse"
>>
>>101774730
Okay, I'll dive for it. But AuraFlow does makes sense given his timeframe if he wants to pump something out to keep the moolah going because of the license, but I would think he would try and request early access from the guy and maybe spend money to speed up the creation of the model but apparently, he's just waiting. What a leech.
>>
>>101774463
>sees first post
Time to add the pastebin already?
>>
>>101774122
What kind of resources would it hypothetically take to finetune a flux model? Do we even know?
>>
>>101773826
>>101773626
catbox?
>>
>>101774964
Probably a serious cluster of H100s to do anything like Pony did. But it depends how easy the model is to teach, SDXL was particularly bad.
>>
>>101773407
The one in the center is getting her zen on. I like that. I'll go with her for that reason alone.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.