[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (956 KB, 3264x3264)
956 KB
956 KB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>101917856

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
i think ai images SUCK
>>
>>101920566
AI thinks you suck, Anon.
>>
>>101920549
NTA but last night I went to bed at 2 AM trying to figure this out. I think something's borked with Forge's "automatic memory management".
>>
God I hope Comfy implements better support for multiple GPUs
Its annoying as fuck to fiddle with the workflow for loading LORAs
>>
I spent the past 5 hours generating and fixing boomer prompts for my lora dataset and I'm not even a quarter through all the images

I have 500+ datasets with anywhere from 80 to 1000 images in each. Save me ldg my autistic hyper focus won't let me stop, I'm scared for my future
>>
>>101920607
Only if you consider memory leaks malware.
>>
File: 1711737890115968.png (1.48 MB, 1260x3740)
1.48 MB
1.48 MB PNG
internvl2 sucks dick i fucking hate benchmarks
>>
>>101920615
>AI thinks
think again
>>
>>101920686
>chinks lying and cheating their way to "success"
NO WAY!
>>
File: ComfyUI_Flux_16.png (1.15 MB, 1216x832)
1.15 MB
1.15 MB PNG
>>101920566
>>101920699
Apparently Impact is a "Striking, dramatic font"
>>
File: 1716929191354503.png (36 KB, 333x695)
36 KB
36 KB PNG
OH IT DEFAULTS TO A TINY MODEL, WRETCHED SETTINGS, AND A CHINESE FUCKING PROMPT

MAYBE THAT HAS SOMETHING TO DO WITH IT BUT I CAN'T CHECK BECAUSE IT ERRORS WHENEVER I TRY TO USE THE 26B
>>
>>101920766
are you ok anon? :(
>>
>>101920781
i want good thing and world keep giving me bad thing instead
>>
so Q8 model is like fp16 but more efficient?
>>
File: Tfmlk6XJob.png (104 KB, 1473x514)
104 KB
104 KB PNG
>>101920444
autocaption as comparison...
https://github.com/Z-L-D/Autocaption
>>
>>101920686
Alright but you are judging 26B which is the worst open version they have, not 76B or Pro version.
>>
>>101920820
dude that's a 7B he's comparing it to
>>
>>101920808
man that's limp

>>101920820
no i was judging 8B because that's what the space defaulted to and it errors whenever I want to try 26B

but joycaption
>>101920414
>>101920444
is also 8B and it blows that shit out of the water
>>
>>101920808
Complete, concise, exhaustive. Local won.
>>
>>101920823
See >>101920808
In that case it was 8B, even worse than 26B
>>
>>101920636
i see. yea, i'm seeing a lot of other people are having the same issue, which is somewhat comforting. hopefully illyasviel finds a fix for it. for now, back to pony/SDXL i guess.
>>
that lad who got 1.5k steps of lora training on 8gb vram in under 5 hours, how did it go?
>>
>>101920851
died in a housefire
>>
Flux can't do sweaty women like DALL-E 3 can
>>
>>101920849
I think Forge is a prime example of a hobbyist developer who goes parabolic, churns out a ton of low quality code really fast, and the project ends up grinding to a halt or collapsing after the technical debt gets too large. Many such cases.
>>101920869
Or like SD 1.5 can.
>>
have they finetuned soul into flux yet?
>>
File: FLUX__00004_.png (974 KB, 896x1152)
974 KB
974 KB PNG
>>101920869
I couldn't get it to do any real degree of moisture
>>
>>101920876
1.5 can't really
>>
>>101920851
I think you might mean me (I don't know, if there's some other vramlet anon who posted about this I'm sorry). I didn't test the loras since they had no boomer prompt captioning, working on that then will run training again.
>>
>>101920895
what were your settings?
>>
Blessed thread of frenship
>>
File: file.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>>101920876
indeed
>>
What comfyui workflow do you guys use for flux?
>>
THREAD THEME: make something cool in the sky
>>
>>101920900
https://desuarchive.org/g/thread/101908455/#q101910682
based on the low vram use I saw I'm guessing I could try to push the settings a bit more, that was just based off kohya's recommendations for vramlets
>>
>>101920935
thanks, i'm gonna give it a stab tomorrow too
>>
>>101920869
https://civitai.com/models/642235/sweaty-shirt-armpit-sweat-pit-stains-wet-spots-flux-dev1
>>
>>101920880
For anime these LoRAs look interesting, everything else meh
https://civitai.com/models/649031/inksketch-flux?modelVersionId=726131

https://civitai.com/models/640405/flux1-dev-modern-anime?modelVersionId=719943

https://civitai.com/models/647940/flux-atilessence-lora-test?modelVersionId=724910

https://civitai.com/models/648623/ascii-art-flux?modelVersionId=725667
>>
>>101920869
all bodily secretions or excretions are forbidden.
>>
>>101920907
Dual CLIP loader for text encoders.
Force model to device to force CLIP and VAE to CPU.
Bootleg GGUF loader to load Q4_0.
VAE loader for ae.safetensors.
Then everything else as normal.

But it's really slow. I'm sticking to 1.5 until I get a new GPU.
>>101920931
I'm at work REEEE
>>
>>101920946
>skin remains perfectly dry
>>
>>101920941
good luck anon, may the vramlet gods smile down upon us both
>>
>>101920957
Thanks
>>
File: ComfyUI_02544_.png (1.12 MB, 1344x768)
1.12 MB
1.12 MB PNG
>>
File: Capture.jpg (239 KB, 2405x1292)
239 KB
239 KB JPG
>>101920959
>>101920950
>>101920904
>>101920869
Works on my machine
>>
>>101921001
Her shoulder looks like a salted pretzel
>>
>>101921001
mpox-coded
>>
>>101921001
it's the fucking monkeypox
>>
>>101921001
bitch lookin like a hamburger bun egg wash and sesame seed ass ho nigga
>>
>>101921001
she has fungus growing on her
>>
>>101921001
Why does flux love the shitjourney wax look?
>>
File: ifx33.png (1.6 MB, 1024x1024)
1.6 MB
1.6 MB PNG
>>
you guys just ended his whole genning career
>>
File: 1722958819_S1.png (3 MB, 1024x1024)
3 MB
3 MB PNG
>>101920869
dalle is king

>>101921001
looks...bad
>>
>>101921001
congratulations, you made flux look like shit
>>
File: 1722959949_S1.png (3 MB, 1024x1024)
3 MB
3 MB PNG
>>101921091
but anyway.. not a dalle thread
>>
File: ifx54.png (1.65 MB, 1024x1024)
1.65 MB
1.65 MB PNG
>>
File: FD_u_00006_.jpg (556 KB, 1536x2432)
556 KB
556 KB JPG
Can't believe I'm saying this but I'm bored of booba. Time to explore the latent space for gen ideas.
>>
File: Capture.jpg (371 KB, 3305x1660)
371 KB
371 KB JPG
>>101921008
>>101921009
>>101921010
>>101921016
>>101921031
>>101921032
>>101921091
>>101921093
kek, what about that one?
>>
File: FD_u_00060_.jpg (627 KB, 1536x2432)
627 KB
627 KB JPG
>>
File: fp120.jpg (266 KB, 1216x832)
266 KB
266 KB JPG
>>
>>101921164
god man, call me back once flux can gen actual humans and not one of those stupid fucking realistic wax figures. fucking SHIT! i HATE THIS! FUCK!
>>
File: 34789563451324.png (1.15 MB, 720x1280)
1.15 MB
1.15 MB PNG
>>
File: FD_u_00061_.jpg (719 KB, 1536x2432)
719 KB
719 KB JPG
When is the LoRA to de-chin butt coming out?
>>
Anyone here making LoRAs yet? I can't get over how they come out almost perfect all the time.
>>
Despite the occasional meme chairman Trump gen I'm starting to see how Flux is visually stunning, but really limited.
>>
>>101921192
I would love to. I made a bunch of LoRAs for XL, would love to re-do them for flux.
I am smooth brained with a beefy gpu though so need to be spoonfed a training script for Kohya.
>>
File: ComfyUI_00061_.png (1.3 MB, 832x1216)
1.3 MB
1.3 MB PNG
>>101921174
refreshing
>>101921182
she seems to like her job
>>
>>101921219
https://github.com/ostris/ai-toolkit
It cannot get more simple than this. In fact, I found it simpler than Kohya. I will switch when Kohya is fully cooking again though.
>>
>>101921174
god I wished I could get the Loras working
>>
File: 00000-3070375651.jpg (261 KB, 600x800)
261 KB
261 KB JPG
>>101920270
The thing is I'm still able to run original dev/schnell model with fp8 dweight. Even with t5 fp16 it still running without OOM despite I only have 16 gb ram. It's probably just nf4/gguf nodes implementation problem.
>>
File: Capture.jpg (344 KB, 3056x1440)
344 KB
344 KB JPG
>>101921100
>>101921091
kek
>>
File: t_1.jpg (447 KB, 688x1216)
447 KB
447 KB JPG
>>101921192
It's crazy how well the model generalizes. You can include a dozen well-captioned nudity shots in the dataset and it would just work. In early SDXL days restoring anatomy knowledge was way way harder.
>>
>>101920567
I get "AssertionError: You do not have CLIP state dict!" error and if I pick clip_l in the VAE options I get "AssertionError: You do not have T5 state dict!"
What else should I add to make the flux Q5_1 work in forge?
>>
>>101921241
I am just used to kohya, my problem is I am a retard with the settings, I don't know what to set any of the shit to, so that's the bit I need spoonfed.
>>
>>101921289
The future is looking bright. I didn't even need to caption my shit and it restored anatomy.
>>
>>101921164
lower your cfg to 1.5 - 2.0 and try again
3.5 gives too much of a waxy feel
add some additional to the prompt
also i'm pretty sure that negative prompt doesn't do anything because flux
>>
>>101921306
>also i'm pretty sure that negative prompt doesn't do anything because flux
it does, it removed the sand she had on her shoulder on the first render >>101921001
it can even remove blur if you want https://imgsli.com/Mjg3OTYw
>>
File: FD_u_00063_.jpg (579 KB, 1536x2432)
579 KB
579 KB JPG
>>
Are loras working with gguf on comfy?
>>
>>101921330
That is a man
>>
>>101921338
Here, now you don't need to ask anymore.
https://civitai.com/search/models?sortBy=models_v9&query=gguf
>>
What is the difference between this place and /sdg/? They're both primarily Flux threads.
>>
>>101921365
Didn't even answer the guy's question and linked to some checkpoints the guy probably already has. Useless retard.

>>101921338
Not as of yet.
>>
>>101921383
Go to /sdg/ and ctrlf "PW" and see why we avoid it like the cyst it is.
>>
>>101921362
nobody asked
>>
File: Capture.jpg (351 KB, 3011x1484)
351 KB
351 KB JPG
>>101921306
I went for GuidancePos = 3, going lower gave me weird artifacts, so far that's the best one I got
>>
File: 184858_00001_.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
>>101920268

how "vintage" are we talking?
>>
>>101921402
>Didn't even answer the guy's question and linked to some checkpoints the guy probably already has. Useless retard.
I linked him to the place where LoRAs will appear for gguf when they start appearing numbnuts, so he doesn't have to ask here literally every thread like he has been doing.
>>
File: 1194762365.png (1.42 MB, 896x1152)
1.42 MB
1.42 MB PNG
>>
>>101921451
>where LoRAs will appear for gguf
It's very likely that LoRAs can be used without the being specifically for GGUF, stop jumping the gun. You have fucking downs syndrome and nobody has the heart to tell you you fucking retard.
>>
>>101921430
your multiple sweat prompts caused it. it's doing what you told it to.
>>
>>101921468
First of all, it's Down syndrome, not Downs syndrome, and you should know that. Also speaking of Down syndrome, it's impossible for flux to generate a person with it. We need a retard LoRA.
>>
>>101921508
really? it can't generate mongolians?
>>
File: 3849765897346.png (1.09 MB, 720x1280)
1.09 MB
1.09 MB PNG
>>101921231
Of course, it's pretty cushy just triaging to the people who do the real work and have to actually take their job seriously
>>
File: FD_00154_.png (1.6 MB, 1024x1024)
1.6 MB
1.6 MB PNG
>>101921570
Maybe there's some magical prompt that can do it but I haven't found it and it sucks because I want to make tard memes.
>>
Can't get comfyui to run with zluda on windows
>>
File: FD_00160_.png (1.53 MB, 1024x1024)
1.53 MB
1.53 MB PNG
the aftermath of yjk
>>
File: Capture.jpg (357 KB, 3272x1526)
357 KB
357 KB JPG
>>101921496
Ok, fair enough. I tried it again and used the word "sweat" only once here
>>
File: 1702325647305483.jpg (83 KB, 738x1292)
83 KB
83 KB JPG
>>
>>101921647
I look like that
>>
>>101921658
Rich pipo looklike that
>>
>>101921668
In my country poor people look like that. Rich people can afford vegetables.
>>
>>101921658
Can you see your own dick ?
>>
>>101921680
In my country poor people eat vegetable and rich people can afford meat
>>
>>101921455
Prompt for this aesthatic ?
>>
>>101921696
The physique is achieved by eating exclusively fried chicken and burgers because they cost as much as a single capsicum.
>>
File: 1843810418.png (1.31 MB, 896x1152)
1.31 MB
1.31 MB PNG
>>101921716
That's just the default look, I threw in some song lyrics.
>>
File: FD_00170_.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>101921612
How DARE you
>>
File: FD_00171_.png (1.56 MB, 1024x1024)
1.56 MB
1.56 MB PNG
This says a lot about society
>>
>>101921642
looks 100% better. drop the cfg even more, try 2.0
>>
File: 3796061871.png (1.18 MB, 889x906)
1.18 MB
1.18 MB PNG
>>
File: ComfyUI_04209_.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>
>>101921817
This thread is for AI images only, no real photos allowed.
>>
>>101921812
by "cfg" you mean the actual "cfg" or the GuidancePositive?
>>
>>101921828
can't you see the 'AI earrings' ?
>>
File: ComfyUI_02546_.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>>
>>101921874
I don't know Anon, my wife has all kinds of retarded earrings, I can believe them.
>>
>>101921901
LUV ME NODES
LUV ME SPAGHETTI
LUV ME 1GIRLS
ATE WEBUI
ATE GRADIO
SIMPLE AS
>>
File: ifx56.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>101921908
I look at too many AI images kek
>>
File: FD_00182_.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
This is harder than it should be. She is literally in water and bone dry
>>
>>101921944
what are you trying to achieve?
>>
>>101921957
wetness
I did manage to achieve a consistent person though, but I want to make a person wet. Flux can't into wet people.
>>
>>101921977
>Flux can't into wet people.
yes it can, just look at a few posts higher >>101921824
>>101921642
>>101921430
>>
>>101921855
not sure what it is in comfy, but it looks like what you changed last time (to 3.0), so probably that one.
>>
File: ComfyUI_04212_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>
File: FD_00192_.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>101921988
I am not trying to achieve a sweaty person, I am trying to achieve a wet person.
This is the closest so far. Gonna try dropping the guidance, see if it makes a difference.
>>
>>101922008
give me your prompt and I'll see what I can achieve
>>
File: ComfyUI_02555_.png (1.74 MB, 832x1216)
1.74 MB
1.74 MB PNG
>>
>>101921824
>>101922002
>hair is dry
>>
File: ifx55.png (1.66 MB, 1024x1024)
1.66 MB
1.66 MB PNG
>>101922023
noice
>>
>>101921989
>not sure what it is in comfy, but it looks like what you changed last time (to 3.0), so probably that one.
it's the GuidancePositive, it's the equivalent to "distilled cfg" on Forge, and like I said I can't put this lower I got fucked up results, this is my limit
>>
File: ComfyUI_04213_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>101922024
>>
>>101921988
>>101922022
Nta but you cant get it i think tried tried every word for wet still didnt get it and if you at it try getting some soap bubble too
>>
>>101922022
A woman standing inside a waterfall with water pouring on her head and over her body, her skin is shiny and glistening from the water, and her hair is dripping with water.
>>
>>101922080
>Nta but you cant get it i think tried tried every word for wet still didnt get it and if you at it try getting some soap bubble too
If you haven't tried Tonemap then it means you haven't tried enough
https://reddit.com/r/StableDiffusion/comments/1estj69/remove_the_blur_on_photos_with_tonemap_an/
>>
File: FD_00201_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>101922070
She has a hydrophobic coating
>>
what is the status of finetune rn?
my dataset is waiting for the 8xh100 :>
>>
File: 2153540605.png (1.18 MB, 896x1152)
1.18 MB
1.18 MB PNG
>>101921828
Sorry, won't happen again.
>>
File: ComfyUI_02563_.png (1.74 MB, 832x1216)
1.74 MB
1.74 MB PNG
anyone going deis/ddim_uniform? getting good results with it

>>101922099
does

>her skin shines, slick with a thin layer of water.

help your gen?
>>
File: Capture.jpg (383 KB, 3281x1488)
383 KB
383 KB JPG
>>101922086
This is my first attempt, I think I can do better
>>
File: Capture.jpg (345 KB, 3109x1389)
345 KB
345 KB JPG
>>101922086
>>101922164
ok that one is good, never touching GuidancePos ever again kek
>>
File: FD_00206_.png (1.38 MB, 1024x1024)
1.38 MB
1.38 MB PNG
>>101922153
intersting. I used "a thin layer of crystal clear gel"
>>
>>101922200
goddamn that looks good
>>
>>101922203
It would be better with real nipples.
https://files.catbox.moe/hvrecd.png
>>
>>101922203
wut lol no it doesn't, the got the mpox or some pond scum on her, her skin looks like wax, and the edges of the tan lines look way too sharp.
>>
>>101922220
just give it a low-denoise img2img pass in one of the SDXL models.
>>
File: 1709366863057397.jpg (77 KB, 862x1048)
77 KB
77 KB JPG
>>101922093
>you didnt tried this trick
You also impaint it using sdxl models
>>
>>101921828
anon... they literally have the ai same face
>>
File: FD_00207_.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>101922153
>her skin shines, slick with a thin layer of water.
Helps a little
>>101922238
I have a full SDXL nippling workflow for flux. But in this case I am not interested in the nudity, not even interested in the woman really, I just don't like when I have a hard time achieving something, especially when the model is so capable in most other areas.
It's a personal challenge to make a wet person now.
https://files.catbox.moe/h8avmw.png
>>
>>101922255
why should I need inpaint, Flux can do it on its own
>>
File: 0.jpg (118 KB, 1024x1024)
118 KB
118 KB JPG
>>
>>101921231
brap
>>
File: ComfyUI_04220_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>
>>101922305
Just looks crispy
>>
>Change strength of LoRA in workflow
>Entire model reloads on to vram from scratch

What the fuck is causing that?
>>
>>101922261
give me the workflow of your picture, I'll try to improve it
>>
Pro can do it. Dev limitation.
>>
>>101922473
posted >>101922220
>>
>>101922447
Idiotic devs
>>
>>101922220
bruh your workflow doesn't have a seed how can I replicate that in the first place?
>>
>>101922447
Same issue on sd.next
>>
>>101922260
They’re siblings, you bigot.
>>
>>101922476
Pro vs Dev
>>
File: 2446606958.png (1.5 MB, 896x1152)
1.5 MB
1.5 MB PNG
>>
>>101922533
Now that dev has functional and extremely effective LoRA support and at home fine tunes on the way, I really won't give any shits for a few months until this model has truly been tapped
>>
>>101922533
other way around, or is Dev actually better?
>>
>>101922545
wait im retarded and can't into read. esl sirs must understand
>>
>>101922545
nta, but I think Dev is objectively better like 60% of the time. Oftentimes it's a complete tossup though.
>>
>>101922533
desu dev can be closer to pro with CFGmaxxing and finetunes will improve this shit further, the future is looking great
>>
>>101922540
I agree, I just hate hitting these kinds of limitations. I know for sure someone will do a wet LoRA. But it's less about the actual wetness itself and more the fact that I hit a limitation. Still, nothing like SD3 where day 1 we were struggling with a woman laying on grass.
>>101922545
Pro is left Dev is right how can you think right looks better? Left is clearly wetter.
>>
>>101922570
sorry sirs
>>101922551
>>
File: ComfyUI_02574_.png (1.63 MB, 832x1216)
1.63 MB
1.63 MB PNG
average spanish tourist trap
>>
>>101922447
Comfy should hang for this bullshit. We have had enough, bros.
>>
File: fp121.jpg (269 KB, 1344x768)
269 KB
269 KB JPG
>>
>>101922598
Beautiful. Would live there.
>>
File: FD_00005_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>101922586
>>
Anyone tried this?

https://www.reddit.com/r/StableDiffusion/comments/1et0mlj/guide_to_use_flux_on_forge_with_amd_gpus_v20/
>>
File: ComfyUI_02585_.png (1.48 MB, 832x1216)
1.48 MB
1.48 MB PNG
>>101922622
why is she holding the sign backwards? is she stupid?
>>
File: fuckinghands.png (2.09 MB, 1434x717)
2.09 MB
2.09 MB PNG
>compose with flux
>upscale with SDXL
>it fucks up the hands

every fucking time
>>
>>101922664
denoise is too high matey
>>
>>101922656
>tourists go home
>refuge welcome
>is she stupid
yes
>>
It's true that flux is "censored" ?
>>
File: ComfyUI_02581_.png (1.55 MB, 832x1216)
1.55 MB
1.55 MB PNG
>>101922683
nah mate look at these nips, wheyyy
>>
>>101922664
Don't upscale with SDXL. Just alter specific parts with masks.
>>
>>101922683
I doubt there was any porn in the training data, so I would say no
>>
>>101922691
>can't make a wet woman
>can make a wet obese bald man
what did they mean by this?
>>
File deleted.
>>101922691
>>101922694
Not porn but controversial prompts. Can it do what grok does?
>>
File: FD_00178_.png (1.97 MB, 1024x1024)
1.97 MB
1.97 MB PNG
>>101922683
Depends on what you mean by censored.
>>
>>101922693
The problem is Flux looks kind of fuzzy and low resolution with that awful DoF so I have to upscale one way or another. Perhaps its because I'm using low VRAM version
>>
File: ComfyUI_02566_.png (1.72 MB, 832x1216)
1.72 MB
1.72 MB PNG
>>101922713
Mate. Grok IS Flux.
https://x.ai/blog/grok-2

>In collaboration with Black Forest Labs, we are experimenting with their FLUX.1 model to expand Grok’s capabilities on X
>>
>>101922683
Depends what you mean by censored. It appears to have like 0 pornographic material in the training data, but there were clearly some tasteful nudes in the set.
Even the most basic of LoRA training puts high quality nudity right back in too.
So it's not really censored so much as lacking excessive NSFW content. But model seems more than happy to learn NSFW stuff in short order.
>>
>>101922713
You're gonna get a warning for that. I did one of Trump shooting up a school and got the janny stink on me, all the while some retard was posting literal cp.
>>
>>101922713
>Can it do what grok does?
>>
Give me the low down on Grok?
I don't think anything could make me subscribe to a phone app. I don't login to anything ever, rather suffer with janky ComfyUI and waste hours of my time.
Elon's gonna make it open source soon and then we will all be making hardcore Grok gens with ComfyUI... right? Gens we've never dreamed of. I CAN'T WAIT
>>
>>101922731
I bet SAI are fucking FUMING
>>
File: GU_CWHwWcAAx18X.png (738 KB, 897x685)
738 KB
738 KB PNG
>>101922736
I deleted the file but like that

>>101922731
Interesting
>>
>>101922772
Bait used to be believable.
>>
>>101922772
It's literally just a straight dump of Flux.1 Pro with no guard rails.
>>
File: hqdefault-439547110.jpg (13 KB, 480x360)
13 KB
13 KB JPG
>>101922772
>>
>>101922773
They will be fine. They are trying to make SD extra safe. When they achieve it, they will be raking infinite money. The AI game is all about safety.
>>
>>101922784
kinda goes hard lad ngl
>>
>>101922772
To obvious, but it's pretty astounding the amount of people I've legitimately compare flux with grok and not realize they're basically comparing the same model.
>>
>>101922810
isn't it weird how bing is just as good as dalle3
>>
>>101922814
Totally mind blowing.
>>
File: FD_00014_.png (1.55 MB, 1024x1024)
1.55 MB
1.55 MB PNG
>>101922664
Thanks for the prompt idea Anon
>>
I think what's interesting about BFL is that they've actually managed to make a model that front ends can service to their customers and people actually like to use.
SAI was never able to do that. Their API was only ever used begrudgingly in the absence of the open weights.
>>
File: ComfyUI_04237_.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
>cfg 1
>"A sweaty woman"
>Gives me a woman with blood on her face
wtf flux?
>>
File: Capture.jpg (309 KB, 3108x1372)
309 KB
309 KB JPG
>>101922839
how can flux think sweat is blood?
>>
>>101922810
I really like the fact that Grok and Musk are getting blamed for all the edgy gens and not Flux kek
>>
>>101922848
Maybe she has hippo dna
https://en.wikipedia.org/wiki/Hipposudoric_acid
>>
>>101922848
is there a point to doing 30 steps instead of 20?
>>
>>101922868
They're mainly seething because twitter released a competent product.
>>
>>101922884
>is there a point to doing 30 steps instead of 20?
it's for more consistency in quality
https://reddit.com/r/StableDiffusion/comments/1er3wt7/if_you_want_a_good_compromise_between_quality_and/
>>
>>101922848
We need better datasets, man. Can't help but feel we aren't even close of taking advantage of the 12B parameters.
>>
File: GVAx7X5aMAAdThx.jpg (88 KB, 767x767)
88 KB
88 KB JPG
>>101922772
It still has limited instances, can't generate unlimited images

>>101922848
How much time it takes to generate it?
>>
File: Capture.jpg (38 KB, 2760x175)
38 KB
38 KB JPG
>>101922902
>How much time it takes to generate it?
>>
File: fp123.jpg (204 KB, 1024x1024)
204 KB
204 KB JPG
>>
>>101922810
They are clearly different models. Maybe based on the same underlying architecture, but they give similar yet different outputs Here is a comparative I made.
https://www.reddit.com/r/StableDiffusion/comments/1etjfbr/flux_reply_to_grok_2/
>>
File: FD_00008_.png (736 KB, 1024x1024)
736 KB
736 KB PNG
>>101922800
>The AI game is all about safety
>>
>>101922943
they probably finetuned pro to make it better?
>>
File: 1946940357.png (1.01 MB, 896x1152)
1.01 MB
1.01 MB PNG
>>
>>101922943
>They are clearly different models.
I am not sure they are. Aren't user prompts fed through grok then to BFL? It's possible the prompts grok feeds to the API are not the same ones the user typed in.
I'd imagine they just use flux pro. Is it stated anywhere they have any other models?
>>
https://civitai.com/models/650637/liarinchief?modelVersionId=727931
>>
>>101923022
a Lora trump? lol, flux can render Trump just fine on its own
>>
>>101923022
>Dude is just sitting there shouting at phantoms about how he won't back down to bullying.
>>
>>101923036
Why are you defending the liar in chief, bootlicker? Is that boot tasting good?
>>
forge dev is a fucking numbskull
too lazy to scroll up, are other people complaining about bugs in the latest forge too?
>>
>>101923022
>Discussion is turned off for this model.
just wanted to praise his choice of a sensible rank size
>>
File: streetlamp lemoose.png (1.25 MB, 1536x640)
1.25 MB
1.25 MB PNG
>>101922943
>>101922893
>>101922641
>>101922093

omg, you do reddit? THE NARWHALS BACON AT MIDNIGHT! haha, upvotes to the left fellas.

YOU SIRS, WIN THE INTERNET ON THIS BLESSED DAY
>>
>>101923070
Imagine siting there waiting 7 minutes just to see this.
>>
>>101923095
shit i forgot a line of my post
this is forge failing to generate a grid
>>
File: file.png (88 KB, 540x296)
88 KB
88 KB PNG
>>101923070
>filename

kek
>>
File: Untitled.png (298 KB, 380x379)
298 KB
298 KB PNG
>>101923067
>This is the guy calling you Liar in Chief.
>>
>>101923121
lmaooo, they really all have the same face isn't it?
>>
>>
File: FD_00033_.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>101923022
>>
>>101922893
all that shows is a cartoon miku example
i think for realistic looking people 20 steps looks more natural, but that might depend on the rest of your setup too ofc
>>
File: ComfyUI_02611_.png (1.56 MB, 832x1216)
1.56 MB
1.56 MB PNG
>>101923177
>>
File: 0.jpg (188 KB, 1024x1024)
188 KB
188 KB JPG
>>
File: FD_00035_.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>101923232
>>
File: 2841246117.png (970 KB, 896x1152)
970 KB
970 KB PNG
>>
if you guys are gonna post trump, do "trump doing Gangnam Style on the lawn of the whitehouse"

>(improve prompt as you will)
>>
>update forge
>now VAE's don't show up
what is this guy doing? really?
>>
>I can't wait for pony dev to make pony flux
>150 upvotes
>>
>>101923300
What's the workflow for this? All I get is slop on flux comfyui, takes 7 minutes to generate a 1024x1024 too
>>
File: ComfyUI_02615_.png (1.24 MB, 1216x832)
1.24 MB
1.24 MB PNG
>>
>>101923345
>flux
>cfg 8
>karras
come on now
>>
>>101920957
>slow
Q4_0 and Q8_0 are really slow somehow but the dev_fp8 version works flawlessly with some adjustment of course, generation time can vary between 1:30min~5:30min
>>
>>101923367
Fantastic gen, Anon
>>
File: stablediffusion13.jpg (337 KB, 1552x1200)
337 KB
337 KB JPG
>>
>>101922045
Is that dalle or this:
https://civitai.com/models/646686/japanese-photo-1980s-style-1980

?
>>
>>101923394
How do you get it down from 5 to 1 minute? Nothing I do on my 3060 seems to work.
>>
>>101923394
>generation time can vary between 1:30min~5:30min
It offloads to your RAM or, even worse, pagefile.
>>
>>101920931
I missed this. I miss thread themes too. Next time post this with an example gen of the theme you want to see.
>>
>>101920553
WHAT THE FUCK THERE ISNT ANY FOCER/SET CLIP DEVICE NODE I CHECKED EVERYTTHING FUCKING DISGRACE PIECE OF SHIT
>>
https://civitai.com/models/651004/giger?modelVersionId=728323
Finally some actual kino
>>
>>101923490
>what is a custom node
>>
File: 748103618.png (1.09 MB, 896x1152)
1.09 MB
1.09 MB PNG
>>101923345
I'm on forge, it just works
>>
>>101921189
prompt japanese or korean, they don't suffer from it
>>
>>101923490
it's here
https://reddit.com/r/StableDiffusion/comments/1el79h3/flux_can_be_run_on_a_multigpu_configuration/
>>
>>101923490
ComfyBootlegOffload.py
>>
>>101923536
I don't like Asian people
>>
>>101923490
>https://www.reddit.com/r/StableDiffusion/comments/1el79h3/flux_can_be_run_on_a_multigpu_configuration/
>>
>>101923345
>flux
>karras
>cfg > 1
>negative prompt
What are you doing?
>>101923526
Forge is slower and buggier than any other UI out there.
>>
>>101923345
>>101923561
Worst of all it's in light mode
>>
>>101923345
you can't put a CFG > 1 without adding or else Dynamic Thresholding or Tonemap, what are you doing anon?
https://reddit.com/r/StableDiffusion/comments/1ekgiw6/heres_a_hack_to_make_flux_better_at_prompt/
https://reddit.com/r/StableDiffusion/comments/1estj69/remove_the_blur_on_photos_with_tonemap_an/
>>
link to the qf4_0 flux model?
>>
>>101923345
Since nobody actually explained and just gave you shit, Flux only works on cfg of 1, and because of this negative prompting does not work.
Set your cfg to 1.
Remove your negative prompt text (keep the node)
Change your sampler to "Simple"
Run again, it will be faster and good.
>>
>>101923467
>Next time post this with an example gen of the theme you want to see.
i'm making it vague on purpose so people can interpret it however they like
>>
>>101923606
https://huggingface.co/city96/FLUX.1-dev-gguf/tree/main
>>
File: F52l3XEW0AAEU43.png (58 KB, 144x204)
58 KB
58 KB PNG
>>101923540
THAT ISN'T A FUCKING WEBSITE YOU PIECE OF SHIT JUST SEND ME A FUCKING DOWNLOAD URL OR WHATEVER

FOLLOWED A YOUTUBE TUTORIAL FOR 4 HOURS JUST FOR THIS SHIT TO NOT WORK OUT OF THE BOX

WHO WOULD DESIGN IT LIKE THIS. I JUST WANT TO DOWNLOAD IT WITHOUT HAVING TO DO THIS WEIRD SHIT, WHY ISNT THERE JUST A .EXE I CAN INSTALL??
>>
>>101923616
I usually only read prompts that contain gens.
>>
File: FLUX_00029_.png (1.32 MB, 896x1152)
1.32 MB
1.32 MB PNG
>>
File: 1718311011437270.jpg (201 KB, 1149x1170)
201 KB
201 KB JPG
>>101923618
>>
>>101923614
he can put a cfg > 1 and use negative prompt if he uses DT or Tonemap though
>>
>>101923614
sgm uniform is the best scheduler
>>
>>101923629
about to try comfyui for the first time in ages, please catbox this so i can replicate it (nude) (this made me horny)
>>
Any recommended optimizations for the 3060 please?
I am stuck with 5 minutes per gen no mater what I do, redownloaded the models, reinstalled comfyui, updated the gpu drivers but nothing works
>>
>>101922598
Home...
>>
What UI do I need to use to use flux GGUF? It doesn't work on comfyui
>>
File: dd.jpg (352 KB, 1024x1024)
352 KB
352 KB JPG
>dark theme
>>
>>101923657
Buy a better GPU.
>>101923665
Are you a zoomer? Literally use a search engine.
>>
>>101923665
https://github.com/city96/ComfyUI-GGUF

come home white man
>>
File: Comparison_all_quants.jpg (3.84 MB, 7961x2897)
3.84 MB
3.84 MB JPG
>>101923657
Force the text encoder to your cpu with this
https://www.reddit.com/r/StableDiffusion/comments/1el79h3/flux_can_be_run_on_a_multigpu_configuration/
And with your 12gb of vram, run a model that asks for less than 12gb, go for Q4_0 or Q4_1 for example
>>
>>101923685
go back to your containment thread d*bo
>>
>try a workflow after several minutes of searching for a basic fucking txt2img because comfyui users are linux adhd spergs that need 20 billion controls
>run it
>realize this bitch is doing 2 extra levels of upscaling in such a way that it blew up a 1536 image to twice the size
>didnt even run out of vram
okay, credit were its due, that's kinda rad. But how do i set this up so i can CHOOSE if i want gens to be upscaled?
>>
>>101923698
It's amusing how every general on this site has a resident boogeyman, and everyone I don't like is that boogeyman.
>>
>>101923635
He is already retarded please don't confuse him. Keep it simple
>>
>>101923706
shit my bad i thought i linked this at the end
https://comfyworkflows.com/workflows/e6c1c436-f878-4cc3-be0a-43ee96864467

>>101923711
to be fair it is true that every general has at minimum 1 schizo
>>
File: file.png (3 KB, 535x37)
3 KB
3 KB PNG
>>
>>101923713
true true, fair enough
>>
>>101923697
I did that already, forced the clip to cpu and for the model I am using flewx1-dev-fp8 it's what I saw users reporting 2 minutes or so per gen by using it.
>>
>>101923721
That's a bit overkill. Why not just do this? >>101920957
>>
>>101923757
brother i have no clue what's overkill, this shit isn't intuitive and there's no real handholding, download a preset workflow and pray it works.
also im not even using flux its just sdxl.
i want to try flux after i get the hang of comfy again though.
>>
File: Capture.jpg (10 KB, 440x160)
10 KB
10 KB JPG
>>101923743
>I am using flewx1-dev-fp8 it's what I saw users reporting 2 minutes or so per gen by using it.
fp8 asks for 14/15 gb of vram during inference, I'm actually running it right now, piercel picture
Your 3060 isn't enough, it's offloading the 2/3 surplus of memory to your ram, and that shit makes everything slower
>>
File: FLUX_00030_.png (1.16 MB, 896x1152)
1.16 MB
1.16 MB PNG
>>
>>101923628 oookayyy

THREAD THEME: delicious cats
>>
>>101923777
Wait, seriously? The Kijai one?
>>
Is there any way to fit q4_0 in 8 VRAM? I'm already putting clip and vae on the CPU, but Comfy still gives me OOM errors unless I go below 512x512, and it's slow as hell anyway.
>>
>>101923789
I hope we get a finetune that fixes the Madame Tussauds syndrome.
>>
>>101923818
yes
>>
>>101923821
did you activate the system fallback on Nvdia Control Panel?
>>
>>101923789
She's got MY vote.
>>
>>101923841
I'm using AMD on Linux.
>inb4 amd
>>
File: ComfyUI_00010_.png (422 KB, 512x512)
422 KB
422 KB PNG
>>101923391
>>101923561

Saw others use cfg 8

>>101923526
I'll try forge next, takes too much to generate now

>>101923587
>>101923614
Can flux do anything other than 512x512 and 1024x1024 properly?
>>
>>101923850
and MY axe!
>>
>>101923853
>Can flux do anything other than 512x512 and 1024x1024 properly?
yes, it's really resiliant to resolution changes, you can have fun with it without much issues
>>
>>101923821
Werks for me. Restart your pc, turn off all the bloatware and try aga-
>>101923851
Nevermind, my condolences anon.
>>
File: FD_u_00005_.jpg (743 KB, 2048x2048)
743 KB
743 KB JPG
>>101923853
Yes
>>
File: 48945123654533.jpg (208 KB, 410x550)
208 KB
208 KB JPG
>>101923825
I will try the Q4_0 then, I will seriously go crazy if this turns to be true.
>>
File: Untitled.jpg (369 KB, 1920x1267)
369 KB
369 KB JPG
>>101923853
This is better but still has that slop look compared to 1.5
>>
>>101923851
>AMD
>on LINUX
quit while you're ahead
>>
Can flux be uncucked?
When will the nsfw models come out?
>>
>>101923878
go for Q4_1, it also fit to your VRAM and is a better quality
>>
>>101923878
Try the 5 or 4_1 quant, they should fit just fine
>>
Bread so fresh you can smell it...
>>101923884
>>101923884
>>101923884
>>
>>101923888
I am NOT ahead.
>>
>>101923853
>Saw others use cfg 8
they use Dynamic Thresholding or Tonemap to make it work >>101923587
>>
File: ComfyUI_00556_.png (2.05 MB, 1536x1152)
2.05 MB
2.05 MB PNG
>>101923853
their paper says up to 2mp, i've done 1408x1408 fine as well as higher res common aspect ratios like 4:3 1152x1536. Just keep it divisible by 8
>>
File: FD_00043_.png (107 KB, 256x256)
107 KB
107 KB PNG
>>101923853
>>101923877
And yes again
>Can flux do anything other than 512x512 and 1024x1024 properly?
>>
>>101923886
You can't un-slop schnell, try dev
>>
>>101923928
Those niggers only want to post their slop. Not talk about technical stuff. Nigger.
>>
>>101923945
>bu..but...
>>
File: tits.png (204 KB, 1149x774)
204 KB
204 KB PNG
>>101923539
>>101923540
>>101923559
Thank you, nice white men

>>101923514
Fuck you
>>
File: ComfyUI_00012_.png (1.44 MB, 864x1080)
1.44 MB
1.44 MB PNG
>>101923918
Takes too long to generate 1024

>>101923929
That's slower
>>
>>101923977
>t5xxl_fp8
>>
>>101923990
stop using those fucking brackets in the prompt
it won't solve anything but they annoy me
>>
File: ComfyUI_00557_.png (1.96 MB, 1536x1152)
1.96 MB
1.96 MB PNG
>>101923990
>>Takes too long to generate 1024
I have a 4080, but I noticed my it/s didn't really change as much as you'd expect from jumping the resolution. Maybe like 20 seconds longer per gen, these take me about 75 seconds a gen for 30 steps
>>
File: fp16.png (175 KB, 1059x699)
175 KB
175 KB PNG
>>101923993
>sir the t5xxl_fp16 tits you ordered
flux1-dev-F-16.gguf crashes. should i load the t5 on the cpu? is that it? i hate being a 32ramlet so much
>>
>>101924128
>should i load the t5 on the cpu?
yes you should do that
>>
>>101924128
>should i load the t5 on the cpu? is that it?
yes
>>
>>101924128
>>101924143
>>101924148

is that not what's already happening in the screenshot with the Force/Set CLIP Device node?
>>
>>101924159
and which device is set in that node, retard anonie?
>>
>>101924159
>cuda: 0
Anon, your cpu/ram are not a cuda device.
>>
>>101924159
no, it's putting your text encoder to "cuda:0" a.k.a your gpu, you must change this value to "cpu"
>>
File: crashed.jpg (363 KB, 1617x903)
363 KB
363 KB JPG
>>101924143
>>101924148
>>101924159
>>101924165
>>101924167
Still crashed :,,-(
>>
>>101924199
the fp16 is just too big, go for Q8_0, it has the same quality while being 2 times lighter (12gb) >>101923697
>>
>>101924199
Is it litearlly just impossible? to t5xxl_fp16 and flux1-dev-F16 with 32gbramlets?
>>
>>101924227
FP16 should fit in 24GB just fine
>>
>>101924365
I have a 24gb card and I OOM if I don't close google chrome
>>
File: ComfyUI_temp_bsxmx_00003_.png (761 KB, 1024x1024)
761 KB
761 KB PNG
>>101924365
>>101924388
please give a secure link to download more ram
>>
>>101924439
THE MORE YOU BUY
THE MORE YOU SAVE
>>
>>101924443
that is like soooo truuue
>>
>>101924388
use your igpu for the desktop
>>
>>
File: evolvingpepe.png (1.4 MB, 1018x1018)
1.4 MB
1.4 MB PNG
>>
File: ComfyUI_temp_bsxmx_00008_.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>>101924519
How do i do that? is it just changing the display port to the one in the motherbored?
>>
>>101924568
yeah, VRAM utilization should show only a couple of megabytes after that
>>
File: 00048-AYAKON_12404478.png (3.32 MB, 1536x2560)
3.32 MB
3.32 MB PNG
vi
>>
ComfyUI gguf lora yet?
>>
>>101923822
realism lora from XLabs
>>
>>101921182
are you inpainting the text? having a hard time generating readable text of that size.
>>
File: flux_00159_.png (909 KB, 1024x768)
909 KB
909 KB PNG
strange thing happened to me in these threads,
I started hoarding pictures I like.
As if will not see them again
Probably will never have time to open them, weird.
>>
>>101924817
yes, a couple of the initial gens were pretty close to that though. but yeah don't expect 5% of the area of your image to be able to sustain good text or hands etc, which i'm sure you know hence the question
>>
File: 2.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
Is not bad bad nudes, but the model still need better fine tunes, I tried Asuka nsfw cosplayer and is good.
https://files.catbox.moe/8fyte0.png
>>
>>101925108
mama mia thats a spicie meatball
>>
>>101925108
go away Teebs
>>
File: 2024-08-16_00026_.png (1.26 MB, 720x1280)
1.26 MB
1.26 MB PNG
>>101925108
she looks a bit young, please don't post that at least without a disclaimer on the catbox. cheers.
>>
>>101925538
Asuka literraly has 14 years... Your asuka is too old.
>>
File: img__00004_.png (1.08 MB, 832x1216)
1.08 MB
1.08 MB PNG
>>
>>101925571
mine is explicitly not asuka since she 100% looks like an asian girl wearing a wig, not a half german girl; she is an older girl merely cosplaying asuka. and this must be respected as something with its own appeal. But also i live in a country where i could get fined or worse for accessing this site if someone posts clearly underage simulated photographic stuff :(
>>
File: file.png (856 KB, 1024x1024)
856 KB
856 KB PNG
>>101923164
>ESL
>>
>>101925986
at least I'm not a fat bold democrat
>>
>>101926012
you're not even american
>>
File: img__00006_.png (1.1 MB, 832x1216)
1.1 MB
1.1 MB PNG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.