[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (738 KB, 3264x3264)
738 KB
738 KB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>101795805

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Kolors
https://gokaygokay-kolors.hf.space
Nodes: https://github.com/kijai/ComfyUI-KwaiKolorsWrapper

>AuraFlow
https://fal.ai/models/fal-ai/aura-flow
https://huggingface.co/fal/AuraFlows

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
File: 1709810167099524.png (38 KB, 1101x458)
38 KB
38 KB PNG
Why do we need two models for clip for flux?
>>
File: ComfyUI_01216_.png (1.32 MB, 1344x768)
1.32 MB
1.32 MB PNG
>>
>>101799537
One to send telemetry back to BFL
>>
File: ComfyUI_01138_.png (1.78 MB, 1024x1024)
1.78 MB
1.78 MB PNG
>>101799537
Your entire post as the prompt
>>
File: ComfyUI_01220_.png (1.31 MB, 1344x768)
1.31 MB
1.31 MB PNG
>>
>>101799537
It was trained on both clip and t5 is why
>>
File: ComfyUI_temp_rrpzr_00020_.png (1.45 MB, 1024x1144)
1.45 MB
1.45 MB PNG
flux is pretty fucking great at img2img/inpainting actually
>>
>>101799609
workflow?
>>
>>101799465
hello friends. Do you believe me that I have been gooning for at least 5 hours yesterday
>>
File: 1719716172988468.png (974 KB, 1024x1024)
974 KB
974 KB PNG
halo: AI evolved
>>
>>101799598
ok
>>
>>101799621
https://files.catbox.moe/jetxy1.png
>>
File: ComfyUI_01223_.png (1.35 MB, 1344x768)
1.35 MB
1.35 MB PNG
>>
>>101799640
Can you explain what you do? I see you you use two models and not just flux but beyond that I'm not sure I get it.
>>
File: ComfyUI_temp_rrpzr_00021_.png (1.49 MB, 1024x1144)
1.49 MB
1.49 MB PNG
>>101799669
sure, I'll explain.
first I create some image normally in a pony based model, just use whatever SDXL format model you like. then I use my self written nsfw recog based node and impact stuff to detect nsfw parts, make masks out of them and invert them.
then I just use inpainting with flux, I found the DPMPP_2M_SDE sampler with heun and an eta around 0.01-0.03 best for inpainting/img2img with flux. then I just display the image, the last part is just using the mask to censor like picrel lol

I'm a noob at comfyui, I don't know if there are better ways to detect NSFW so I quickly written my own node, maybe but its overkill, but when I tried to load the nsfwrecog model with the ONNXDetectorProvider from Impact Pack it failed
>>
>>101799588
Lul
>>
>>101799763
I was about to check where the NSFW checker node was but I see now you made it yourself.
>>
File: ComfyUI_00979_.png (1.72 MB, 832x1216)
1.72 MB
1.72 MB PNG
>>
>>101799763
It (whatever you're spamming) just ends up looking very obviously like SDXL slop though

Just use SDXL and spare yourself the trouble
>>
>>101800037
you're a hateful chud
>>
File: 1697417922585736.png (1007 KB, 1024x1024)
1007 KB
1007 KB PNG
>>
do you guys like shorks?
>>
here is another one
>>
>>101800057
I'd love to try your workflow, but without the NSFW checker going into the segs node it's kind of difficult to use.
>>
File: mayhem.png (2.01 MB, 1024x1024)
2.01 MB
2.01 MB PNG
Good afternoon
>>
>>101800145
hmmm let me make a burner github
>>
File: 132.png (1.82 MB, 1024x1024)
1.82 MB
1.82 MB PNG
>>
>>101800142
flux? proompt?
>>
>>101800204
NTA but it looks like pony to me.
>>
File: 1704599494492174.jpg (104 KB, 800x1170)
104 KB
104 KB JPG
>my 2 post in op
Nice
>>
>>101800218
thanks for your useless speculation, anon
>>
File: file.png (652 KB, 604x886)
652 KB
652 KB PNG
>>101800057
>y-you're a chud!

I just don't get what you're trying to do here. SDXL without any other enhancements (not on Comfy even) gives you arguably better results, adheres to your prompt better and doesn't have to go through a million nodes. What's the objective of that workflow again?
>>
>>101800204
Using imgnAI, FurXl model. Prompt:
>pixel art, smug, shark girl, cute face, teasing, young shark girl, character, blue hair style, petite, full body, short, blue white striped panties, game cg, retro art
>>
>>101800270
thanks anon, that style looks great
>>
>>101800234
I guess my speculation wasn't so useless after all
>>101800270
>>
>>101800322
it actually was
>>
>>101800360
Actually, it wasn't
>>
File: ComfyUI_02760_.png (2.86 MB, 2048x2048)
2.86 MB
2.86 MB PNG
>>
File: 1704755341105843.png (705 KB, 1024x1024)
705 KB
705 KB PNG
anime girl uses comfyui:
>>
File: ComfyUI_01197_.png (1.21 MB, 832x1216)
1.21 MB
1.21 MB PNG
>>
>>101799629
if I have images like this, I'd goon for 10 hours minimum
>>
>>101800145
https://github.com/goburiin/nsfwrecog-comfyui
hopefully it works for you, I'm not sure about the requirements.
>>101800255
>I just don't get what you're trying to do here.
obviously not creating gooner AI sloppa, the nunslop is a toy example for an automatic nsfw detection and inpainting workflow with flux. If you don't see any value in that then I don't know what to tell you.
>>
File: 133.png (1.66 MB, 1024x1024)
1.66 MB
1.66 MB PNG
>>
>>101800532
>obviously not creating gooner AI sloppa
>spammed the exact prompt over and over to get attention
Also, if you're making a workflow for nsfw detection and inpainting over that, what other applications does that have other than to create coomer slop? lol
>>
>>101800578
whatever kill yourself
>>
>>101800585
right
Show me a usecase of your workflow for something that is not coomer slop and I'll believe you
>>
>>101800606
Hmm well I guess the idea is coomer oriented but that doesn't change the fact that it's an interesting idea
>>
File: 1709718081679067.png (728 KB, 1024x1024)
728 KB
728 KB PNG
>>
File: _tmp_06.jpg (284 KB, 1400x1400)
284 KB
284 KB JPG
>>101800227
Nice
>>
File: Flux_00214_.png (2.21 MB, 1024x1024)
2.21 MB
2.21 MB PNG
fuck i hate these schizo workflows turn up cfg set negative empty guidance 100 dynamic thresholding number tweaks all for 2x longer gen times
>>
>>101800160
Mayhem peruttu
>>
File: ComfyUI_02766_.png (1.2 MB, 1152x896)
1.2 MB
1.2 MB PNG
>>
>>101800801
but muh negatives tho
>>
File: 1711353515537587.png (991 KB, 1024x1024)
991 KB
991 KB PNG
now we can do generations with generations within the generations:
>>
File: 134.png (692 KB, 1024x1024)
692 KB
692 KB PNG
>>
File: Flux_00260_.png (1.94 MB, 768x1344)
1.94 MB
1.94 MB PNG
>>
File: Wu.png (1017 KB, 1024x1024)
1017 KB
1017 KB PNG
>>
>>101800801
>>101800969
Just don't use it you niggers, are you for real complaining about anon trying out things and sharing his findings?
>>
File: 1718299911134195.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>101801006
>>
>>101801204
you should consider jumping off a bridge
>>
Flux + NaturalVisionXL makes great photos
>>
>>101801200
>types too slowly
>dragon roars
>>
>>101801219
just report and ignore he craves the attention he lacks IRL
>>
>>101801204
I don't care about the kid, how the fuck did you get such a long, coherent text?
>>
File: 260.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
>>
been away from this for more than a year.
what's the state of local img generation now? can someone give a qrd, and also I'm curious is flux or sd3 worth it? thanks.
>>
File: file.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>101801398
with the release of flux, local img generation is as good as DALL-E or better in some cases
no NSFW yet though
>>
>>101801398
SD3 is absolutely NOT worth it. Flux does everything SD3 was supposed to.
>>
>>101801527
>SD3 is absolutely NOT worth it
SD3 (beta) is not worth it, the 8B model thats API only isnt bad but its not local so who cares
>>
File: 1383898123.png (1.71 MB, 1024x1024)
1.71 MB
1.71 MB PNG
>>101801398
Flux without a doubt, Flux mogs sd3 easy
>>
File: fs_0042.jpg (67 KB, 1024x584)
67 KB
67 KB JPG
>>
File: ComfyUI_02784_.png (1.13 MB, 1152x896)
1.13 MB
1.13 MB PNG
>>
>>101801657
>>101801673
>>101801686
go away fag
>>
>>101801719
Idea. If you're brave enough to risk the backlash of a rejected report, you potentially have the power to be rid of him.
>>
>>101801629
Prompt please and what model and LoRA did you use?
>>
Wake up jannie
>>
>>101801809
let them sleep they deserve the rest
>>
File: 1698652373173156.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
>>
>>101801841
We get it. No one is more deserving of a rest than our overworked jannies. All the effort would have gone to waste, until their hour came again.
The right janny in the wrong place can make all the difference in the world, so they needed to wake up.
>>
File: ComfyUI_00101_.png (1.55 MB, 1024x1024)
1.55 MB
1.55 MB PNG
Long live Flux! o/
>>
File: Flux_00309_.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
>>
File: ComfyUI_00086_.png (1.15 MB, 1024x1024)
1.15 MB
1.15 MB PNG
Finally a Model that gets Swastikas right. NatSoc Propaganda Dream <3
>>
>>101801966
Maybe show it to /pol/
>>
I have a 6GB card, and tried to load realvisxlV40_v40LightningBakedvae.safetensors but I can't do anything because I'm out of vram. I tried stetting with PYTORCH_CUDA_ALLOC_CONF=max_split_size_mb:25 but even with chunks this small I still can't render anything
>>
>>101801983
/pol/ is too stupid to use it
>>
>>101802009
Then be their propagandist. Charge them money for your prompt engineering service or something. If you're gonna make nazis, at least get paid.
>>
>>101802002
Forgot to mention, I'm not new but just deleted everything and I'm coming back after a while, I remember there was like a 4GB model or that would allow me generate. Is there any other good model I can use?
>>
>>101799582
nice
>>
>>101801227
>>
>>101802002
>using XL models on vramlet cards
oof
>>
>>101802057
could be worse he could be using flux on a 1080 like that one anon god bless him
>>
>>101802057
I know, my setup needs an upgrade, was cool 6 years ago when I built it tho
>>
File: abs002.jpg (528 KB, 1233x1230)
528 KB
528 KB JPG
I love abstract art only when it's cyberpunk
>>
File: fs_0048.jpg (79 KB, 1024x584)
79 KB
79 KB JPG
>>
File: ComfyUI_02771_.png (1.21 MB, 1152x896)
1.21 MB
1.21 MB PNG
>>101801779
just Flux
>>
File: Flux_00326_.png (1.71 MB, 1024x1024)
1.71 MB
1.71 MB PNG
>>
File: Flux_00321_.png (1.96 MB, 1024x1024)
1.96 MB
1.96 MB PNG
>>
WTF happened to /sdg/
>>
>>101802223
Would a 6gb 2060 be enough for FP8? Or would I actually need a 16gb card?
>>
>>101802284
/sdg/ is a chat room for avatar fags
>Hellooooo cumbot
>Hiiii fago, how was your day
>Oh you know me, hey has anyone seen dikdik today?
>>
File: 00009-2525470654.png (2.94 MB, 1280x1920)
2.94 MB
2.94 MB PNG
>>101802284
It's now a designated containment general for one special individual
>>
File: 344.png (39 KB, 684x513)
39 KB
39 KB PNG
Let's say I'm at 74 tokens, would adding a multi-token tag like "looking at viewer" cause issues, since it's split between the two token batches?
>>
File: ComfyUI_02714_.png (1015 KB, 1152x896)
1015 KB
1015 KB PNG
>>101802304
I think 8gb is that minimum to be able to do anything with Flux, but I haven't really looked into it. Just seen some posts in passing.
>>
>>101801227
pretty good
>>
>>101802304
It would be extremely painful.
>>
File: ComfyUI_00718_.png (2.95 MB, 1824x1248)
2.95 MB
2.95 MB PNG
>>101802339
cute
>>
>>101802338
Yeah probably I doubt there is any intelligent parsing going on
>>
>>101802339
Dang, getting a 4070 ti super soon so guess I could wait

>>101802353
kek
>>
>>101802338
yes
>>
>>101802391
>>101802401
Dang, I hate it. Wish there was an extension that highlighted the prompt according to the token count. Like, you hover the mouse cursor over the token count and it highlights the prompt with different colors according to which token group they belong to.
>>
File: ComfyUI-Flux_00015_.png (1.92 MB, 1024x1024)
1.92 MB
1.92 MB PNG
>>101802057
>>
>>101802505
i hate 1girl posters but this gen is cool.
>>
>>101802074
>Prompt executed in 716.61 seconds
>>
>>101802535
>wait 10 years for a gen
>it fucks up the hands

must be painful
>>
File: ComfyUI_00407_.png (1.48 MB, 832x1216)
1.48 MB
1.48 MB PNG
>>101802535
>hands

oh no no fluxanons
it's over.
>>
>>101802535
uh, what happened to her arms and hands?
>>
File: mech07.jpg (221 KB, 793x799)
221 KB
221 KB JPG
I have yet to harness the full power of sdxl.
>>
>>101802574
>>101802585
>>101802591
everybody is allowed to make mistakes every once in a while
nobody is perfect
how the fuck is she holding the one on the right anyways?
>>
>>101802351
Thanks
>>
>>101802647
yea its impressive anyway that a 1080ti can even run this at all
>>
>>101802052
nice is this flux to sdxl workflow? or viceversa
>>
>>101802734
Yeah Flux -> NaturalVisionXL with some inpainting. Just flex (pic rel) is pretty good too
>>
File: ComfyUI_00013_.png (1.24 MB, 832x1216)
1.24 MB
1.24 MB PNG
>>
File: ComfyUI_00015_.png (1.4 MB, 1344x768)
1.4 MB
1.4 MB PNG
>>
File: CatJak_00184.png (2.29 MB, 832x1344)
2.29 MB
2.29 MB PNG
>>
File: ComfyUI_Flux_5991.jpg (172 KB, 1024x768)
172 KB
172 KB JPG
>>101803057
nice one anon. I'm interested in the artist and style tags
>>
File: 0_0 (3).jpg (288 KB, 1024x1024)
288 KB
288 KB JPG
Okay, it finally feels worthwhile to set up an image generator locally now that flux is out, at least for me. My current experience is exclusively with MidJourney

Does Flux work with existing Stable Diffusion UIs? Or is it it's own seperate thing? Would it be easy for me to set up?

Also, I remember back when AI was first on the upswing there were rumors that you could fry your GPU by generating locally. I assume that was just retarded fear mongering, right? Or should I actually be careful with my usage?
[spoiler]And how good is it with celebrities and fictional characters? As good as peak DALLE?[/spoiler]
>>
File: 00037-802237211.png (3.36 MB, 1280x1920)
3.36 MB
3.36 MB PNG
>>
File: CatJak_00180.png (2.04 MB, 832x1344)
2.04 MB
2.04 MB PNG
>>101803209
I don't share due to a thread schizo having a multi year vendetta against me and has no problem impersonating me and threatening to doxx ect
>>
>>101799598
>It was trained on both clip and t5 is why
do we know why? they felt like t5 wasn't enough on its own?
>>
>>101803272
that just sounds even more schizo anon!
>>
>>101803272
>I don't share
if you want to gate-keep prompts feel free to post in >>>/g/sdg
>>
File: 00039-835195585.png (2.96 MB, 1280x1920)
2.96 MB
2.96 MB PNG
>>
>>101803426
>>101803394
You must be new here
>>
File: file.png (2.02 MB, 1267x1260)
2.02 MB
2.02 MB PNG
>>101803272
>>
>>101799763
Thanks for the explanation anon, I like the results, I think if flux knew more poses it would be better to start with it.
>>
>>101803272
>I'm a victim!
tranny behavior
>>
>>101803491
i am not, ran.
>>
>>101803597
oh, it's the ben10 guy? lmao
>>
File: 1696942342214990.jpg (131 KB, 1024x1024)
131 KB
131 KB JPG
Can you make stuff like cartoon style picrel (from dalle), mainly the "wet skin/hair" effect?
I know the pose itself flux probably has no idea what it is.
>>
File: CatJak_00162.png (1.92 MB, 832x1280)
1.92 MB
1.92 MB PNG
>>101803597
>>101803577
>>101803426
>>101803394
Since you're doing pretty bad today here's a pity image
https://files.catbox.moe/ee3frg.png
>>
>>101803518
>>101803533
Anon?
>>
File: ComfyUI_02765_.png (1.19 MB, 1152x896)
1.19 MB
1.19 MB PNG
>>101803666
deep-fried flux, my favorite
>>
>>101803666
that wasn't so hard now was it, ran? next time someone asks for a box give it or leave. PLEASE keep your avatarfag nonsense out of here.
>>
>>101803272
>I don't share due to a thread schizo having a multi year vendetta against me and has no problem impersonating me and threatening to doxx ect
You realize that if this is true it's only because you're a faggot that doesn't share, right?
>>
>>101803710
it has no metadata in it kek
>>
holy nogen
>>
>>101803733
ranfag...
>>
>>101803718
said anon has impersonated multiple other anons before, and it's the reason no one shares anymore. but then again you just might be him seething that your thread is dying.
>>
>>101803761
>clearly a mentally ill schizo
Haha I believe you.
>>
>>101803761
>but then again you just might be him
meds can help you with the paranoia, you'll need therapy for everything else though
>>
You are not entitled to a catbox.
>>
>>101803781
then why even bother posting the image if nothing can be learned from it?
>>
File: 00291-2024-06-23.jpg (729 KB, 1024x1280)
729 KB
729 KB JPG
Thanks for the free bumps to our general
>>
>>101803761
>impersonated multiple other anons
the high iq avoid this issue by not avatarfagging
>>
Give it straight to me, bros, is it possible for a regular person to make money with generative AI? Has anyone here managed to monetize their generations?
>>
>>101803250
>Does Flux work with existing Stable Diffusion UIs
Works with comfyui, check OP for linl

>Would it be easy for me to set up?
If you have basic knowledge about git installing, yes. Otherwise, learn and follow a guide.
https://medium.com/@yushantripleseven/installing-comfyui-linux-windows-b59a57af61b6

>I assume that was just retarded fear mongering
Yes, unless you have a crappy driver or card, never heard of people frying their card in generating stuff.
You can cap your card though, if you have a 3090 for example, 275-300W instead of 350W would give you almost the same perf for a big reduction in power usage.

>how good is it with celebrities and fictional characters
Doesn't know most characters, sometime you get them back by describing them. The dataset was probably cleaned beforehand.
Same for celebs.
On that front Dalle is way better, as long as you don't get dogged.
>>
>>101803808
It is there for your viewing pleasure, nothing more.
>>
>>101803817
>by not avatarfagging
the problem is that avatarfaggots only manage to get 1 "good" prompt done and then they post endless variations of them, they usually dont have the creativity to keep coming up with new ideas.

the real solution would be to have a dedicated AI board on 4chan and a containment thread for these subhumans.
>>
>>101803808
have you forgotten how to describe images with words?
you don't want to learn, you just want to copy
>>
>>101803848
>containment thread for these subhumans
We have one, it's now called /sdg/.
>>
>>101803853
>you just want to copy
who gives a shit?
why does it bother you to begin with?
>>
>>101803848
>Same posters can make good work with different models
>They continue to excel year after year
>names become synonymous with good gens
You sound salty
>>
File: 736154248.png (1.54 MB, 1536x640)
1.54 MB
1.54 MB PNG
>>
>>101803865
evidentially you because you're bitching people aren't bending over backwards for you
>>
>>101803867
>names
I cant even name one and I dont care lol
>>
>>101803822
do you get multiple you's by people liking your gens/asking for catbox everytime you post?
if not reconsider or get better
>>
>>101803867
>names become synonymous with good gens
examples?
>>
>>101803877
go to the containment thread, you arent welcome here if you dont contribute anything.
>>
>please say me
>>
>>101803822
No.
>>
>>101803893
>nogen
If you think begging for prompts is contributing you're mentally ill
>>
File: 455237260.png (1.64 MB, 1536x640)
1.64 MB
1.64 MB PNG
>>
>>101803893
this is chang's thread, if you don't shill for the glorious leader, you don't belong here
>>
>>101803894
fuck you that's my line
>>
>>101803907
no, being mentally ill is this >>101803272
>>
>>101803924
the biggest fear of these faggots is that someone "steals" their prompt and then makes money with it somehow (never going to happen anyway).
kek
>>
>>101803924
You always get random paranoid anons in these types of threads, just ignore them.
>>
>>101803911
1chang has thousand times more creativity than all avatarfags under heaven and earth put together
>>
File: Flux_00587_.png (834 KB, 1024x1024)
834 KB
834 KB PNG
>>
File: Flux_00544_.png (1.46 MB, 1024x1024)
1.46 MB
1.46 MB PNG
>>
Lazy prompt beggars are 10x worse than avatarfags and you can't disprove it
>>
>>101803272
I've noticed it's generally the mentally ill who always think and claim people are trying to doxx them. Pure schizo behavior.
>>
nogen melty
>>
>>101799465
Can I use Stable Diffusion XL with just8 GB of VRAM? I'm using Stable Diffusion 1.5 right now.
>>
/sdg/ refugees ruined this general
>>
File: 1685902363.png (1.73 MB, 1024x1024)
1.73 MB
1.73 MB PNG
>>
>>101803982
Which ones?
>>
>>101804003
the ones that act really feminine
>>
>>101803648
>wet skin/hair
For some reason it doesn't seem to have this concept, maybe "wet" was scrambled when it was in relation to a person/character.
>>
DALL-E 3 still the best, you should all be focusing your brain power in having it leak.
>>
File: disney pizdar.png (816 KB, 1019x912)
816 KB
816 KB PNG
>>101803999
checked, also these always turn our beautiful.
(ot: old render)
>>
File: ComfyUI_Flux_6011.jpg (178 KB, 1024x768)
178 KB
178 KB JPG
>>
>>101804036
>in having it leak
they will never leak this shit
>>
>>101804059
usually leaks are against the owner's wishes
>>
>>101804036
>focusing your brain power in having it leak
It will never happen anon, even getting an unfiltered access to the API is crazy hard.
>>
>>101803933
it's not paranoia, it's annoyance from low effort leeches
imagine being given an infinitely creative tool and the first thing you do is type in "Messi in the style of Picasso" and worse, get mad that someone who typed that in doesn't catbox for you.
fuck, the whole point of this shit is exploring and all you do is type in 1girl
>>
File: ComfyUI_Flux_6017.jpg (181 KB, 1024x768)
181 KB
181 KB JPG
>>
>>101804036
I'd rather see people focus on adding back the good stuff to flux, way more realistic than leaking a model more well guarded than some state secrets.
>>
File: ComfyUI_Flux_6019.jpg (230 KB, 1024x768)
230 KB
230 KB JPG
>>
File: 1092769092.png (1.96 MB, 1024x1024)
1.96 MB
1.96 MB PNG
>>
>>101804090
>the whole point of this shit is exploring
And sharing nice stuff, so that others can improve on it, share back etc.
I've learned a lot just by looking at anons posting workflows here.

But if you don't want to share, it's up to you, just don't make it some moral statement about it.
>>
>>101804150
Notice how he won't post anything. Nobody is here to spoonfeed you schizo
>>
>>101803955
Now we're talking
>>
>>101804150
you haven't shared shit and I'll be honest, everything here is amateur, like a 14 year old kid showing off his crusty game dev code like he made a masterpiece, you aren't that special
wow you made a comfyui workflow using someone else's plugins! wow that's crazy anon I bet you figured it all yourself and didn't watch a youtube video
>>
File: ComfyUI_Flux_6031.jpg (223 KB, 1024x768)
223 KB
223 KB JPG
>>
>>101804058
>>101804098
>>101804119
>>101804198
Gentlemanly
>>
File: Capture.jpg (290 KB, 3060x1442)
290 KB
290 KB JPG
>>101803648
>Can you make stuff like cartoon style picrel (from dalle), mainly the "wet skin/hair" effect?
I got something like this, what do you think?
>>
>>101803272

Frank Frazetta is the artist, Im guessing
>>
>>101804036
>DALL-E 3 still the best
99.99% of my prompts dont work in DALL-E 3 so there's literally zero proof of this.
But I'll take you word for it.
>>
File: 631803697.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>>
File: ComfyUI_Flux_6037.jpg (218 KB, 1024x768)
218 KB
218 KB JPG
>>101804204
suffering from success
>>
>>101804224
>>101803648
that face is disgusting
>>
>>101799465
Flux is puritan garbage.
>>
File: 448098258.png (1.77 MB, 1024x1024)
1.77 MB
1.77 MB PNG
>>
>>101804270
you mean get filtered by GPT or the NSFW check afterwards, DALL-E 3 itself is more than happy to generate smut.
>>
File: 2111174816.png (1.79 MB, 1024x1024)
1.79 MB
1.79 MB PNG
>>
File: Capture.jpg (284 KB, 3026x1471)
284 KB
284 KB JPG
>>101804286
>that face is disgusting
Yeah your're right, I improved it a bit, I think she looks beautiful now
>>
>>101804369
unironically much better
>>
File: ComfyUI_02827_.png (1.28 MB, 1152x896)
1.28 MB
1.28 MB PNG
>>
>>101804369
>not that weirdo who puts tampons in boys bathrooms and calls himself a general
missed opportunity. still an improvement though!
>>
>>101804406
Boys can have periods too, chud.
>>
File: file.png (936 KB, 907x1182)
936 KB
936 KB PNG
>Me: "Draw woman carrying a man "
>AI: "Here's a man carrying a woman"
Are there any non-sexist models out there?
>>
File: 1462526522.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
>>
File: 76352037.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>
>>101804432
>schnell
>wow flux doesn't understand my prompt
>>
>>101804432
"A woman carrying a much smaller man. The woman is much taller than the man. The man who is being carried is short like a midget."

Try that.
>>
File: CatJak_00207.png (2.12 MB, 832x1280)
2.12 MB
2.12 MB PNG
Enter!
>>
>>101804487
nice. catbox?
>>
you are not entitled to one
>>
>>101804224
The idea is here yeah.
I've tried getting wet skin so many times, the model doesn't give a shit.
Maybe it's the fact that you asked for her to be in the beach.
>>
>>101804487
prompt?
>>
Debo needs a wellness check
>>
>>101803648
dunno, but try
negative: figurine, repeling water, big drop of water
positive: (realistic skin:0.2)
>>
>>101804514
>I've tried getting wet skin so many times, the model doesn't give a shit.
>Maybe it's the fact that you asked for her to be in the beach.
No, it has more to do with this method that reduces flux's bias greatly
https://reddit.com/r/StableDiffusion/comments/1enm9og/discovered_by_accident_a_trick_to_make_flux/
>>
>>101804567
is prompt weighting working now? it wasn't before
>>
>>101804592
No it doesn't, it's noise.
>>
>>101804406
kek'ed
>>
File: ComfyUI_171248_.png (940 KB, 1024x1024)
940 KB
940 KB PNG
https://huggingface.co/comfyanonymous/Freeway_Animation_Hunyuan_Demo_ComfyUI_Converted
>>
>>101803272
haha
>>
>>101804611
ni hao
>>
>>101803648
or try oil and change hair types
>>
>phones home to CHYNA
>>
>>101804574
I'm playing with this, it seems to be able to make her look more "oily/shiny", but at the cost of making the cartoon style go too far.
The fun thing being that I can see in latent that at low steps it gets the style right, but at some point it goes caricature instead of cool sexy pixar girl.
>>
>>101804762
You should tinker with the GuidanceNegative value, it's at 10, maybe it's too high for what you want to achieve
>>
File: ComfyUI_00004_.png (1.77 MB, 1024x1024)
1.77 MB
1.77 MB PNG
>>
>>101804689
I'll try to add that.

>>101804786
There was nothing in my test.
Basically the cfg trick works, but at the cost of amplifying everything.
Wet -> you get that shiny look
Pixar style -> way too cartoony

So testing at cfg 2, I get proper style but not wet, and testing at higher cfg, for example 6, I get wet skin but the generation goes too far style wise.
Maybe there is a way to amplify only the parts you want with this trick.
>>
File: 00009-1681725957.png (2.86 MB, 1280x1920)
2.86 MB
2.86 MB PNG
>>
>>101804895
you're talking about cfg, but did you change the GuidanceNeg value too? maybe that's the one that can help, there's even a third solution, "AdaptiveGuidance threshold", basically it puts cfg = 1 at the very last steps, and you get to choose at what steps it starts to shift from CFG > 1 to CFG = 1
https://reddit.com/r/StableDiffusion/comments/1enxcek/improve_the_inference_speed_by_25_at_cfg_1_for/
>>
File: FD_00268_.png (931 KB, 1024x1024)
931 KB
931 KB PNG
I just woke up. Did I miss any Flux developments in the last 8 hours?
Fuck my weak body needing sleep
>>
>>101804994
>Did I miss any Flux developments in the last 8 hours?
you can improve the speed of your render if you're using CFG > 1 now >>101804993
>>
>>101804993
>>101805004
>Download this modified "dynthres_comfyui.py" script
No. I am too dumb to know that this does.
>>
>>101805037
I wish someone could make a PR on the Dynamic Threshold repo so that the owner can officially implement the modification
>>
I hate catjak now and will dedicate my life to impersonating him sheerly because he wont share his prompts.
>>
>>101804994
from the moment I understood the weakness of my flesh...
>>
>>101804611
Shouldn't you be called comfypseudonymous?
>>
>>101804993
Yeah it's at 10.
I thought this was just a speedup thing, I'll try it, too bad it's not modifying the node itself in the git, would be easier to manage.
>>
File: ComfyUI_00194_.png (815 KB, 832x1216)
815 KB
815 KB PNG
>>
>>101805081
>I thought this was just a speedup thing
no, GuidanceNeg is the secret sauce that help flux to stop being biased towards generic slop styles and actually follow your prompts
https://reddit.com/r/StableDiffusion/comments/1enm9og/discovered_by_accident_a_trick_to_make_flux/
>>
File: 00045-2271918151.png (3.6 MB, 1280x1920)
3.6 MB
3.6 MB PNG
>>
catjak here, im gay
>>
dogjak here, Im heterosexual
>>
File: ComfyUI_02083_.png (1.92 MB, 1024x1024)
1.92 MB
1.92 MB PNG
>>
birdjak here, they're both gay heterosexuals
>>
>>101805205
dogjak is out to get me. Help! AAAAAH THE VOICES. Rapeman where are you?!
>>
Debojak here I'm schizo
>>
humanjak here, I'm jaking off
>>
File: walz.jpg (1.07 MB, 3000x2000)
1.07 MB
1.07 MB JPG
horsejak here. yall got nothing
>>
>>101805142
>>101805205
>>101805219
>>101805227
>>101805242
>>101805250
>>101805252
please, call me ranfag
>>
Schizojak here I'm debo
>>
File: 00048-1741593344.png (3.85 MB, 1280x1920)
3.85 MB
3.85 MB PNG
>>
>>101804994
>Fuck my weak body needing sleep
No. Gen more.
>>
niggerjak here, im violent
>>
Julienjak here, where the kids at?
>>
sonicjak here, i'm gooning
>>
File: ComfyUI_00014_.png (1.91 MB, 1024x1024)
1.91 MB
1.91 MB PNG
>>
File: FD_00008_.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>>101805054
>>101805294
>>
>debo is crying
>>
>>101805328
seems like you came to the right thread lmao
>>
>>101805054
Is Mechanicus good? I completely forgot about it but that quote just reminded me.
>>
>>101805386
nice
>>
File: 00051-256943957.jpg (487 KB, 1280x1920)
487 KB
487 KB JPG
>>
File: ComfyUI_00128_.png (1.98 MB, 1024x1600)
1.98 MB
1.98 MB PNG
>>
File: Improvement.jpg (3.12 MB, 6599x2623)
3.12 MB
3.12 MB JPG
Look at the improvement we've made in just a few days: https://files.catbox.moe/njz7qq.png
To make the workflow work, use this tutorial:
https://reddit.com/r/StableDiffusion/comments/1enxcek/improve_the_inference_speed_by_25_at_cfg_1_for/
>>
>>101804993
I wouldn't call this a 25% speed improvement, it's just making the last 5 steps quicker.
Now, those last 5 steps do go quick. Can you somehow apply this to every step? Then we would be talking about significant improvements.
>>
File: ComfyUI_02859_.png (1.2 MB, 1152x896)
1.2 MB
1.2 MB PNG
>>
File: 00055-3874895449.jpg (454 KB, 1280x1920)
454 KB
454 KB JPG
>>
File: Capture.jpg (31 KB, 840x457)
31 KB
31 KB JPG
>>101805642
>Can you somehow apply this to every step?
if you apply on every step it's the same thing as just having CFG = 1, and yes you can improve the number of steps by decreasing the Adaptive Threshold value
>>
>>101805642
Turning CFG over 1 slows down generation time. Anon's script modification starts with a higher CFG, then sets it to 1 once it matters less to the image. You can lower the threshold on the AdaptiveGuider node to switch sooner.
>>
>>101805718
>>101805730
Alright so the last steps where it is using adaptive threshold are going at normal gen speed, which for me is about 1.19s/it, using dynamic thresholding each step is about 2.30s/it. It's about 12% faster, not 25%.
>>
>>101805855
it depends on the pictures, for realistic shit, it can switch to CFG 1 pretty quickly
>>
>>101805718
Isn't uncond zero scale > 0 supposed to improve the last steps when cfg is back at 1?
>>
>>101805855
>It's about 12% faster, not 25%.
it's not just about the speed though, it removes the glitches the high CFG is doing during the begining of the inference steps, look at the second to last and last picture there, the last picture is cleaner and there's no the weird textures you can see on the second to last picture >>101805619
>>
>>101805897
Tbh I have no idea what it does kek
>>
>>101805902
>it's not just about the speed though, it removes the glitches the high CFG
Interesting, there's a specific crispy gen I have I want to test that on
>>
File: fs_0144.jpg (257 KB, 2560x1824)
257 KB
257 KB JPG
>>
Is he trying to nuke this general
>>
File: flux_00057_.png (1008 KB, 1344x768)
1008 KB
1008 KB PNG
why is it inpossible to generate something far away? Especially without depth of field
>>
>pedophile has evolved into hebephile
>>
>>101806020
Do you consider that good or bad?
>>
>>101805962
>>101805902
Hmm, not that much difference. Needs more testing
>>
>>101806049
decrease the adaptive threshold value even more
>>
>>101806059
>>101805977
Die
>>
File: ComfyUI_00043_.png (2.01 MB, 1024x1024)
2.01 MB
2.01 MB PNG
>>
Original DT gen always on the left, Adaptive always on the right. Will post more examples.
>>
>>101806092
Actually this one doesn't count because the right gen was 20 steps and the left was 50
Proper comparison picrel
>>
>>101806092
You mean on the left it's CFG > 1 + DynamicThreshold and on the right it's CFG > 1 + DynamicThreshold + Adaptive Threshold?
>>
>>101806130
Yes
>>
>>101806127
How many steps did it ignore (CFG = 1) on those 50? And what's your Adaptive Threshold value?
>>
>>101806174
11. AT is set at your default of 0.994, I am not going to change it for these tests.
>>
>>101806203
>I am not going to change it for these tests.
you should, if you want something less cooked/burned, you can decrease that value and let more steps at CFG 1
>>
>>101801227
are you doing flux first and natvis after? gimme workflow bro
>>
>>101802223
pls post workflow
>>
>>101806217
It's definitely less cooked from the 2 examples I have done, maybe it can be less fried by changing it but that's not my intention, not yet at least. Once I find my crispiest gen I might start playing with that and see what happens.
>>
>>101806317
>It's definitely less cooked from the 2 examples I have done,
Nice. AdaptiveGuidance is a really cool node, it makes things faster and fix the overcooking made by the high CFG
>>
File: ComfyUI_Flux_6059.jpg (311 KB, 1216x832)
311 KB
311 KB JPG
>>
File: ComfyUI_HunyuanDiT_00041_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>Freeway demo is here
First tries. Hunyuan bros we are so back
https://files.catbox.moe/3fj3n7.png
https://files.catbox.moe/sz3rul.png
>>
Extra hard to tell on this one,
>>
File: 1721843691507982.jpg (343 KB, 768x1024)
343 KB
343 KB JPG
>>101800765
Yes i can
>>
>>101805619
What does 0.993 even mean?
>>
>>101806410
pixartfags in shambles
>>
>>101806428
its a number
>>
>>101806092
Is this overcooked look on purpose?
>>
>>101806436
thanks anon.
>>
File: ComfyUI_HunyuanDiT_00040_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>101806410
https://files.catbox.moe/l5e4ac.png
>>
>>101806428
thredhold = 1 means it won't leave any steps at cfg = 1; the lower this value, the more sensitive it will be to changes, and the more last steps there will be at CFG = 1.

Basically in simple terms, if you want less overcook from the high CFG, decrease that adaptive threshold value
>>
>>101806474
So it's relative to the number of steps?
Wouldn't it be simpler to have a value corresponding to the nth step it changes cfg to 1?
>>
>>101806507
that's not how it works, every image is different and have their last steps act differently, AdaptiveGuidance looks at the difference between 2 steps and see if it's bigger than the threshold or not, it's a better way to get something that works for every kind of pictures
>>
>>101806524
Oh ok.
>>
>>101799629
I'm having a really hard time genning anthro girls, PLEASE share a catbox patrician
>>
File: ComfyUI_HunyuanDiT_00039_.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
>>101806433
RIP. Maybe when Bigma comes around it can be this good kek
https://files.catbox.moe/ex6e8o.png
https://files.catbox.moe/on9avi.png
>>
>>101806410
I really hope this mf will also finetune Flux now
>>
>>101806443
No. It's just what happens when you use a high CFG on flux. The Adaptive Thresholding is intended to run the last steps at CFG 1 like flux is designed to do, but still keep the style adherence of using high CFGs.
What I am trying to do is get the same result but less fried.
>>
>>101806611
With what money
>>
>>101806648
Yeah I've noticed it, it changes everything to the extreme, it's annoying.
>>
>>101806680
he wasted that money on huyuan instead of flux, what a retard...
>>
>>101806708
MOOOODS
>>
File: rapemanfanclub01.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
for the anon earlier today
>>
>>101806708
can you do a loli but with a huge rack?
>>
File: ComfyUI_HunyuanDiT_00035_.png (1.04 MB, 1024x1024)
1.04 MB
1.04 MB PNG
>>101806611
https://files.catbox.moe/6iz2h6.png
https://files.catbox.moe/eg9y9x.png
I'm sure we'll get something for Flux eventually
>>
File: rapemanfanclub02.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
>>101806763
don't feed the troll
>>
>>101806696
>he wasted that money on huyuan instead of flux, what a retard...
Anon Hunyuan is cheaper to tune, was here first and I'm pretty sure the source of this tune is Chinese. Just thank God we got something that is open more than anything, and Hunyuan is absolutely a great base model I have shown many times. The only reason it's not Dalle/Flux tier is because of prompt following, the dataset used to train it was top tier.
>>
>>101806779
>can you do a loli but with a huge rack?
unironically probably not. this is a problem with flux being "too smart", it knows what a child should look like too well. but i can try to make some hebes with big tits sure

has anyone gotten really big booba on flux, adult or otherwise? what prompt did you use if so?
>>
>>101806829
>Hunyuan is absolutely a great base model
How is it for nsfw? artists? characters?
I'm just getting back since I saw Flux and I'm a little lost with all these models.
>>
At AT of 0.950 you stop seeing any differences.
0.980 seems to be the point where it stops following your style so closely and more Flux kicks in. 0.990 seems to be the sweet spot for prompt adherence and deepfrying.
>>
>>101806575
Damn that first catbox is cool
>>
>>101806891
thanks for the experiment anon, I'll keep that value in mind. How much steps did it ignore at 0.990?
>>
File: file.png (512 KB, 512x512)
512 KB
512 KB PNG
buncha nerds and weirdos
>>
>>101806943
oh fuck
>>
>>101799629
post the catbox I beg you
>>
>>101806880
Artists and characters it knows quite a bit. Styles it knows quite a few, my fav is manga obviously. For nsfw we just got this finetune.

Prompt following somewhere between sdxl and flux.
I recommend you take a look at
https://imgur.com/a/hunyuandit-0vrZEn0
And https://www.shakker.ai/modelinfo/87e2cc2169934523a2ff82fb12e7206b?from=search

The first are Hunyuan gens I have posted here, or just search archive for "HunyuanDiT" filename. second is finetune model technical info.
>>
File: ComfyUI_Flux_6103.jpg (273 KB, 1216x832)
273 KB
273 KB JPG
>>
>>101799629
Put this in the next collage
>>
>>101807128
Put this in the next collage
>>
File: Capture.jpg (443 KB, 3272x1507)
443 KB
443 KB JPG
Doesn't really look like MJ but I like that style
>>
File: FD_00039_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>101806891
same prompt and seed with no DT or DT+AT at CFG 1.0 (i.e a normal Flux.1 dev fp8 gen) for the sake of comparison.
I have to keep restarting comfy when using DT and not using DT otherwise it just hangs and locks up my PC which is extremely annoying and makes it hard to test.
Clearly this works, but there is still improvements to be made to the workflow and/or nodes.
Going back to base flux now.
>>101806932
Flicked over after step 37, so 13/50

Also, someone should bake.
>>
File: flux_00070_.png (860 KB, 1344x768)
860 KB
860 KB PNG
>>
>>101807198
>Clearly this works, but there is still improvements to be made to the workflow and/or nodes.
If I find something else that fix the overcooking I'll post it there and on leddit as usual kek
>>
How is flux at creating ancient chinese interiors? Like, xianxia/wuxia shop/inn/palace/whatever.

Should I bother or would Hunyuan or some SDXL model be better suited for this niche?
>>
File: image (4).png (1.87 MB, 1024x1024)
1.87 MB
1.87 MB PNG
>>101807265
It doesn't really understand the 'ancient' distinction, maybe poor prompting on my part
>>
>>101807310
this looks like a japanese antique shop where everything was made in china
>>
>>101807198
Left to right, only changing cfg, 1.5,2.0,2.5,3.0
>>
>>101807051
I'll take a look, thanks
>>
File: FD_00045_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>101807265
>>101807310
>>
File: ComfyUI_02099_.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
>>
File: FD_00046_.png (1.61 MB, 1024x1024)
1.61 MB
1.61 MB PNG
>>101807381
>>
File: ComfyUI_Flux_6143.jpg (272 KB, 832x1216)
272 KB
272 KB JPG
>>
File: 1702388474847415.png (7 KB, 369x161)
7 KB
7 KB PNG
>>101807051
Where can you download it?
And why is it a "demo"?
>>
File: FD_00047_.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>
Alright we re about to die. Is anyone baking because if not I'm gonna
>>
I have baked and made le collage finna press post real quick, once I have checked the OP is fine
>>
File: Capture.jpg (20 KB, 854x286)
20 KB
20 KB JPG
Adaptive Threshold is a good way to keep the style and remove the glitches we got on high CFG, that's cool
https://imgsli.com/Mjg2MDc4
>>
>>101807392
This one is more along the lines of what I'm after, thanks anon! Do you mind sharing your prompt?

I had some buzz on shitvitai so tested some of the models on there (rather than randomly download and delete, pic rel), and holy crap the clarify on flux's details is night and day in comparison. Maybe just the models I tried, but kek SDXL-san get your shit together.
>>
>>101807456
do it, the guy below you is a larp from /b/
>>
>>101807500
put your name back on Teebs
>>
>>101807506
go back to your containment niggerlas
>>
File: ComfyUI_HunyuanDiT_00055_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>101807265
>Hunyuan
Hunyuan interiors don't look that realistic by default, it would need a tune, here's
>这是一间传统的日本茶室(Chashitsu),用于进行茶道仪式。茶室的设计和布置非常讲究,旨在营造一种宁静和谐的氛围。
>>
File: ComfyUI_HunyuanDiT_00054_.png (784 KB, 1024x1024)
784 KB
784 KB PNG
>>101807425
It's a demo because they had a demo up for a while before they released it on Huggingface, Hunyuan by default is .pt so comfy converted it to safetensors you can download it here https://huggingface.co/comfyanonymous/Freeway_Animation_Hunyuan_Demo_ComfyUI_Converted/tree/main
>>
File: FD_00056_.png (331 KB, 512x512)
331 KB
331 KB PNG
>>101807548
>>101807548
>>101807548
>>101807548
>>
>>101807567
Nice, thanks!



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.