[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: ComfyUI_00399_.png (848 KB, 1344x768)
848 KB
848 KB PNG
Discussion of free and open source text-to-image models

Emergency bake edition

Previous /ldg/ bread : >>101725030
>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Kolors
https://gokaygokay-kolors.hf.space
Nodes: https://github.com/kijai/ComfyUI-KwaiKolorsWrapper

>AuraFlow
https://fal.ai/models/fal-ai/aura-flow
https://huggingface.co/fal/AuraFlows

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
File: ComfyUI_01109_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
finally
>>
File: b_00064_.png (2 MB, 1920x1080)
2 MB
2 MB PNG
Flux is awesome
>>
File: 0.jpg (150 KB, 1024x1024)
150 KB
150 KB JPG
>>
File: 1722818498.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>
File: Sigma_12350_.png (3.45 MB, 2048x2048)
3.45 MB
3.45 MB PNG
ty baker!
>>
File: ComfyUI_00143_.png (757 KB, 1024x1024)
757 KB
757 KB PNG
>>
File: file.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>
File: ComfyUI_30754_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>
File: FD_00338_.png (1.38 MB, 1024x1024)
1.38 MB
1.38 MB PNG
>>
File: ComfyUI_30756_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>
File: 73.jpg (506 KB, 1472x1472)
506 KB
506 KB JPG
ack
>>
I find the more I understand flux's prompting style and just what it can do the more I love it.
Like the limitations put on me by the limits previous models are slowly being dissolved
>>
File: ComfyUI_30760_.png (849 KB, 1024x1024)
849 KB
849 KB PNG
>>
File: Flux_00250_.png (1.35 MB, 1024x1024)
1.35 MB
1.35 MB PNG
>>
>>101726656
Flux is not finetunable. We have been over this many times.

You will never see a massive finetune of flux and a flourishing finetune-mix ecosystem around flux. No NAI or Pony diffusion will ever come out.
At the very best, some super small niche loras, but even that is at this very point impossible, as the tech does not exist and BFL will not be helping. They actually unironically think it's impossible too.

What you see is what you will get. 10 years from now and flux will be at this same stage.
>>
>>101727499
teach me how to prompt flux effectively
>>
File: ComfyUI_30761_.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
>>
File: ComfyUI_30763_.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>
File: ComfyUI_00027_.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
>>
File: ComfyUI_30764_.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
>>
I'm promplet, I will just use wildcards.
>>
>>101727513
Someone did a LoRA on 24GB https://github.com/bghira/SimpleTuner/pull/622#issuecomment-2267624531

And there's this https://x.com/ostrisai/status/1820219528254488942
>>
File: ComfyUI_00029_.png (751 KB, 1024x1024)
751 KB
751 KB PNG
>>
File: ComfyUI_30765_.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>
>>101727538
>Someone did a LoRA on 24GB
Oh my.
>>
File: ComfyUI_01111_.png (1016 KB, 1024x1024)
1016 KB
1016 KB PNG
>>
File: FD_00283_.png (1.4 MB, 1024x1024)
1.4 MB
1.4 MB PNG
>>101727514
Just describe the scene in natural language
>>
>>101727499
Agreed. I think I still have a lot to learn, but I'm finding it much easier as time goes on.
>>101727514
Unironically ask an LLM to help you modify your prompt and learn from what it does. I guess because Flux is captioned by LLMs they seem to "speak the same language." It's not a silver bullet but it helps. I imagine some LLMs work better than others. I've been using whatever OAI slop Copilot uses these days. Most LLMs use OAI slop to make synthetic data so I imagine they all prompt pretty similarly anyway.
>>
File: ComfyUI_01112_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
File: 621528.png (2.51 MB, 1472x1472)
2.51 MB
2.51 MB PNG
>>101727514
think like a vllm
>>
File: ComfyUI_30766_.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
>>
>>101727513
Imagine being wrong.
https://github.com/bghira/SimpleTuner?tab=readme-ov-file#flux1
This is the same nigger who said it was impossible. Fuck off back to your den, Emad, don't you have 40 million dollars to launder this month?
>>
File: 1722820648.png (1.15 MB, 1024x1024)
1.15 MB
1.15 MB PNG
>>
File: ComfyUI_01116_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>
>>101727594
fp4 mayhaps?
how shit would that be
>>
>>101727594
NTA but nice! I didn't think it would be possible for awhile longer.
>>
File: ComfyUI_temp_ayrsf_00018_.png (3.44 MB, 1680x1680)
3.44 MB
3.44 MB PNG
>>
File: ComfyUI_01119_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>
File: ComfyUI_30768_.png (965 KB, 1024x1024)
965 KB
965 KB PNG
>>
File: ComfyUI_00037_.png (785 KB, 1024x1024)
785 KB
785 KB PNG
>>
>>101727584
>>101727656
The feels.
>>
>>101727513
no, it's finetuneable. just not easily on consumer hardware. if good loras can be trained on 24gb then the potential for a flux ecosystem is there. it needs to fit within the max consumer card really, as the next tier up is like $10k more.
an actual finetune is another story and will be impossible except by big teams. which was always kind of the case anyway as you can count on one hand the amount of actual relevant finetunes we've ever had.
>>
>>101727669
Why can't we run flux/SD on multi GPUs anyway?
>>
>>101727669
Nobody is buying $10k GPUs to train LoRAs, they rent them for a few bucks.
>>
>>101727594
Well you will see. Distilled model will not respond to training the same way. There will be issues. It's not as simple as it is with some of the other models, some which supply training code and examples.
Flux is not it and will never be it.

Oh, and where is your negative prompts? Worry about that first, before you dream about massive finetunes with billions of images.
>>
File: ComfyUI_00038_.png (844 KB, 1024x1024)
844 KB
844 KB PNG
>>101727657
>>
File: ComfyUI_30769_.png (877 KB, 1024x1024)
877 KB
877 KB PNG
>>
What prompt do they use with a vision model to describe images? Is it just as simple as "Please describe the following image." Or is it more complex? The closer we can get to how the original captions in the dataset were worded, the better.
>>
File: ComfyUI_01076_.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>>
>>101727714
We may never know
>>
>>101727681
> Well you will see. Distilled model will not respond to training the same way. There will be issues. It's not as simple as it is with some of the other models, some which supply training code and examples.
> Flux is not it and will never be it.
Source?
>>
File: Sigma_12357_.jpg (2.16 MB, 2048x2048)
2.16 MB
2.16 MB JPG
>>
>>101727714
Copy an image into chat GPT and ask it to describe it and then prompt like would.
>>
>>101727677
probably because image models never needed more than like 10gb until now, so it was never a priority. there's a lot of scrambling after flux dropped without notice and BTFO all previous models in 90% of use-cases.
>>
File: ComfyUI_00408_.png (878 KB, 1344x768)
878 KB
878 KB PNG
>>
>>101727538
Looks like schnell will be the most worked on.
Can the dev profit from that or are they too different anyway?
>>
>>101727729
Me. You can quote me.
>>
Why doesn't flux have a negative prompt? It's such a pain in the ass to see something undesirable over and over again and have to find some indirect way of discouraging it when you could just put one word in the negative prompt. I wonder if it is something that can be added later?
>>
>>101727681
>Oh, and where is your negative prompts
Wait, flux doesn't support negative prompts?
>>
>>101727755
Yeah, you know nothing. Distilled models vs trained models are exactly the same, it's just a difference how they are made.
>>
File: FD_00402_.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
>>101727681
>here is evidence that it's trainable
>you: picrel
Seriously just shut the fuck up. You were wrong, deal with it. The community has massively taken to this model.
>>
>>101727763
the flux team said you don't need it and just need to prompt better instead
>>
>>101727678
I expect some price crash soon on the rent market too, once Nvidia sold to everyone and a bunch of overeager companies realize they actually don't need all these fancy A100/H100 and dump them in the market.
>>
>>101727750
It's the difference between SDXL and SDXL Turbo
>>
>>
File: ComfyUI_30772_.png (959 KB, 1024x1024)
959 KB
959 KB PNG
>>
File: ComfyUI_00274_.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
>>
File: Sigma_12362_.jpg (1.93 MB, 2048x2048)
1.93 MB
1.93 MB JPG
>>
>>101727750
I don't see any noticeable difference between schnell and dev. People say schnell can't do text, but I was able to prompt for text.
>>
File: thistheonlyworkfloworwhat.jpg (358 KB, 3488x1695)
358 KB
358 KB JPG
how can I improve here?
>>
>>101727787
Flux pro has negative prompts.
That statement of theirs alone should signal to you something about their mission.
>>
File: FD_00454_.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>101727814
I have had limited phasing too. Woman inside the table
>>
>>101727574
That looks nothing like the prompt in the filename, the sad true is that all flux gens look the same
>>
>>101727831
It's understanding long prompts. The limit is still high in schnell (256 tokens) but 512 is pretty magic on dev
>>
>>101727848
They never said that though, he made it up
>>
File: flux1-schnell_00013_.jpg (110 KB, 512x512)
110 KB
110 KB JPG
>>101727846
Improve what?
>>
File: 1722822309.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
File: ComfyUI_01135_.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>101727869
the workflow
>>
File: FD_00376_.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>101727846
Be more verbose. Put your prompt into an LLM and ask it to make it more eloquent.
>>
>>101727862
>its not real because I choose not to believe it
trump voter?
>>
>>101727856
Yeah, I should cut Orientalism out because it really doesn't understand it. I'll acknowledge that even though I know you're trolling.
>>
File: ComfyUI_30774_.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>
File: FD_00403_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>101727878
>the workflow
What are you trying to achieve?
>A large, rotund gorilla sits at a computer desk in a thinking pose. The gorilla's massive frame fills the chair, with its hand resting thoughtfully on its chin. Its deep, intelligent eyes are focused on the computer screen. The gorilla's fur is dark and thick, contrasting with the modern, sleek design of the computer. Papers and books are scattered around the desk, suggesting a scene of intense contemplation. The background features a cozy, well-lit room with a window showing a lush jungle outside. The scene captures the unexpected blend of primal strength and intellectual curiosity.
>>
File: Sigma_12364_.jpg (2.67 MB, 2048x2048)
2.67 MB
2.67 MB JPG
>>
>>101727875
wtf is that
>>
>>101727846
There's a separate CLIP, seperate T5, CFG adjusting and neg prompt (not currently working) workflow. That should be the default that ships with ComfyUI, but for some reason he gives this extremely simplified one.
>>
File: ComfyUI_00231_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>
>>101727908
Honey
>>
File: ComfyUI_01137_.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>101727912
is there a picture on what its supposed to look like so I can attempt to reconstruct it?
>>101727899
>>101727883
so basically the longer the prompt the better?
>>
>>101727927
>so basically the longer the prompt the better?
the more descriptive the better
>>
People don't quite grasp that this model is not like the research drops. This is model from a company with mission to make money, not enable basement coomers. These "open model" versions were not made so that you can finetune them properly.
It's not in BFLs best interest that some other company could take their schnell model and finetune it to make it better than Pro and then sell access to that model. It would be disastrous.

I trust BFL engineers when they say it can't be done. Not in the scale that people here hope. You will never have schnell perform better than current Pro.
>>
File: ComfyUI_30775_.png (654 KB, 1024x1024)
654 KB
654 KB PNG
>>
File: ComfyUI_00242_.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
>>
File: ComfyUI_01138_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>101727935
does it make a difference if you write like whole sentences compared to just words separated by commas that describe the scene?
>>
File: ComfyUI_00210_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
I don't think this model is that good at generating pictures of women. Either that or I'm doing something wrong.
>>
>>101727912
All I have done to the default workflow so far is add an optional upscaler.
Figuring out what the limitations of this model are first, and I am not hitting too many walls where I feel like I need to alter the workflow.
>>
>>101727927
https://files.catbox.moe/efb6nn.png

Note the dev has said lowering or increasing CFG 1-3 may help with following styles for longer prompts (default is 4), though that may just be the model's creativity going up a bit depending on how it interprets your prompt.
>>
>>101727642
>>101727476
kek
>>
File: Sigma_12370_.jpg (3.03 MB, 2048x2048)
3.03 MB
3.03 MB JPG
>>
File: ComfyUI_00221_.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
Where I wish I was, but can't be because I am no longer as free as I was when I was 24.
>>
File: FD_00419_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>101727952
It uses both clip and t5 so you can do either. But it understands things like "A blue ghost wearing a red hat on the left and a black woman wearing a green dress on the right" which you can't do with the old way.
>>
File: ComfyUI_00163_.png (885 KB, 1024x1024)
885 KB
885 KB PNG
>>101727968
I was pretty proud of that one
>>
File: file.png (2.05 MB, 1216x832)
2.05 MB
2.05 MB PNG
>>
File: ComfyUI_00133_.png (875 KB, 1024x1024)
875 KB
875 KB PNG
>>
File: FLUX__00034_.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>
File: ComfyUI_00230_.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>101728046
kek
>>
File: ComfyUI_00136_.png (885 KB, 1024x1024)
885 KB
885 KB PNG
>>
File: ComfyUI_01141_.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
>>101727966
>Note the dev has said lowering or increasing CFG 1-3 may help with following styles for longer prompts (default is 4)
how to do that?
>>101728010
>It uses both clip and t5 so you can do either.
can you do both?
>>
File: ComfyUI_30778_.png (2.35 MB, 1536x1536)
2.35 MB
2.35 MB PNG
>>
>>101727956
Flux has a default style for women, and it doesn't look very realistic. You have to play around with the prompt until you find something that works for you, also increase your steps, I always do 40, once you've figured out the sweet spot but it's easily the best model for genning women, just takes a while to find the right prompt though (depends what you're going for).
>>
File: FD_00074_.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>101728046
hehe
>>
>>101727956
ai training on ai moment
>>
File: FD_00428_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>101728065
>can you do both?
yes, but I just put the same prompt in both and use the t5 style
>>
>>101728065
>how to do that?
Check the workflow I linked.
>>
>>101728076
I'm not sure if I even know what I want, but maybe something like this. It doesn't have an epic lighting, it's just a girl, you can see some noise, her skin imperfections, a scenery around her instead of a blurry background, the lighting isn't perfect. Any advice on that?
>>
>imagination is the only limiting factor
I'm fucked. It was nice knowing you.
>>
File: 1721456712754753.png (1 KB, 242x21)
1 KB
1 KB PNG
Every day I get closer to buying a dedicated proompt machine
>>
>>101728132
>boring Snapchat photo circa 2015
>>
File: ComfyUI_00242_.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>101728136
Where do you want to go? Gen that. For me, it is the mountains and the interior of a deep abandoned and long forgotten mine. Places I'll never go to.
>>
>>101728145
Yikes, if you have the money then just do it. I tried scraping by with a 1080 for like a year and it was torture.
>>
File: funky guy.png (732 KB, 1024x1024)
732 KB
732 KB PNG
>>101727444
checked!

>>101726868
Does this even work? I don't understand funkopops.
>>
>>101728152
Another one I use "Snapchat photo of X, The photo is taken with bad lighting."
>>
File: ComfyUI_00250_.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
>>101728152
Kek it works!
>>101728145
>>101728164
People who gen with small GPUs or even just with the CPU are based. You are fighting with the tools you have. I was like that in 2022.
>>
File: ComfyUI_00254_.png (812 KB, 1024x1024)
812 KB
812 KB PNG
>>
File: 1710825443281104.png (666 KB, 768x576)
666 KB
666 KB PNG
>>101728164
I can do a 1024x1024 SDXL image roughly every 30 seconds, a Flux dev-fp8 1024x1024 image roughly every 6 minutes. I'm trying an anon's fancy ComfyUI Flux prompt right now which is taking extra long. My 2060 12GB was great for 768x576 SD 1.4 outputs like picrel, but showing its age now. At the same time, I want to hold out for cheaper 24GB cards, I feel like a 16GB upgrade won't future-proof me for long.
>>
>>101726935
This is what I get from:
>The page appears authentic and ancient, and evokes a sense of indescribable dread.
My love of books means I will be genning an outrageous # of books for really no reason.
>>
File: 826.png (26 KB, 600x800)
26 KB
26 KB PNG
>>101728226
>6 minutes
>>
File: ComfyUI_00263_.png (950 KB, 1024x1024)
950 KB
950 KB PNG
>>
File: Lunatic Book.png (1.92 MB, 1024x1024)
1.92 MB
1.92 MB PNG
>>101728233
this book
>>
File: 1703408289513934.png (912 KB, 768x576)
912 KB
912 KB PNG
>>101728190
Thanks bro
>>101728248
What do you run and how much faster for you?
>>
>>
>>101728260
How does it get fingers right?
>>
are hand adetailers just gambling for a good one or is there a lora i can't find due to my own retardation
>>
Does anyone here dare to run Flux on pure CPU? What's it like, an hour for a gen?
>>
>>101728273
You wouldn't have that problem with Flux.
>>
>>101728284
i also wouldn't have 1girl loli:1.5 bending over spread anus gaping anus style_anime either on flux
>>
File: 1697731296415571.png (885 KB, 1024x1024)
885 KB
885 KB PNG
Thanks to the anon who posted catbox earlier, first time I've gotten ComfyUI to werk. Going to have a lot of fun messing with this one.
>>
>>101728226
>>101728248
>1070 8GB
>10 minutes
Bros... I don't want to work again... I want to stay a NEET...
>>
>>101728266
My 6950xt (all are 16gb) gets me just a little faster than you, as I have it configured, with fapper-eight.

I may be able to try some things to get way better performance, though.
>>
File: ComfyUI_00170_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>
>>101728272
By not being a coward like Stability AI and training on images of real people. SAI was so worried that people would use the model to generate images of children that they didn't release a single good model after SD 1.5 in 2022. They let Jewish financers and academics pressure them.
>>
File: ComfyUI_00278_.png (882 KB, 1024x1024)
882 KB
882 KB PNG
>>
File: 1701429258347482.png (966 KB, 768x576)
966 KB
966 KB PNG
>>101728301
Maybe the AI VC bubble will burst and a lot of start-ups will sell off A30's/A100's at bargain bin prices.
>>101728310
Good to know, thanks
>>
>>101728314
SD did train on real people. You can look into the laion dataset yourself.
>>
File: ComfyUI__09946.jpg (391 KB, 1600x2000)
391 KB
391 KB JPG
>>101728076
its because its trained on sdxl images, looks like the 1girl from juggernaut

>>101727891
i'm not trolling, once you have seen one flux gen, you've seen them all, look on reddit, facebook sd groups, whatever, all flux gens look the same specially the movie poster ones, also every gen posted has been cherry picked, (unlike sd3) so far it feels like another new ai generation company offering their new product, that in the end is just a game of playing gatcha, the less control you have the more slot machine feels like

thats why i rate a comfy sd workflow over flux for now,
>>
>>101728325
Imagine if this scene was like that peeping cartoon where the peeper gets killed when the peeped turns out to be a horrific monster, haha
>>
File: tes library.png (1.78 MB, 1024x1024)
1.78 MB
1.78 MB PNG
>>101728325
It's excellent, it only makes some little mistakes, like it doesn't know where exactly doorknobs go.

Here, it's excellent, but I never mentioned any picnic tables.
>>
How do you see what's in the prompt of a png without getting the whole workflow when you import the image in comfy?
>>
>>101728332
Yes, that's why SD 1.5 was able to generate images of real women and remove clothes with inpainting. SD 2.1 and SDXL were heavily filtered and not nearly as exciting as 1.5. The community spent hundreds of thousands of finetuning in total only to get meh-tier models like pony.
>>
File: ComfyUI_00173_.png (1.15 MB, 1024x1024)
1.15 MB
1.15 MB PNG
>>
>>101728344
open it in notepad
>>
>>101728314
lol flux didnt train on real people, it used mostly artificial images, thats why all the women look like a mix of sdxl and dalle
>>
>>101728359
Oh it's in clear text! Thanks anon
Any way to get it without having to change the \n to newlines myself?
>>
File: FD_00477_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>101728260
For some reason that pic reminds me of this
>>
File: 0.jpg (296 KB, 1024x1024)
296 KB
296 KB JPG
>>
How does negative prompts work with natural text?
You write a sentence or just tags?
>>
>>101728376
another anon.

notepad++ iirc
>>
>>101728393
Guess I'll use that.
>>
File: ComfyUI__00466.jpg (233 KB, 768x1344)
233 KB
233 KB JPG
>>
>>101728386
Nice
>>
File: 00056-1745624667.png (2.15 MB, 1080x1440)
2.15 MB
2.15 MB PNG
>>
File: FD_00493_.png (1013 KB, 1024x1024)
1013 KB
1013 KB PNG
I'm gonna catfish the shit out of dudes on tinder
>>
burger king security camera footage
>>
File: FLUX__00042_.png (1.09 MB, 896x1152)
1.09 MB
1.09 MB PNG
sorry.
>>
>>101728410
I like him.
>>
>>101728416
Quite hard when you can only generate one image per girl.
>>
>>101728416
Hard to catfish without working loras to produce a consistent face, soon.
>>
File: FD_00492_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
Anyone else have this checkered pattern appear occasionally with Flux?
>>
>>101728436
>>101728435
see >>101727538
>>
>>101728438
With low guidance.
>>
>>101728406
your tits, ma'am
your milkers, madam
your breasts, m'lady
your mammaries, miss
>>
File: ComfyUI_temp_uvzbi_00014_.png (2.18 MB, 1120x1440)
2.18 MB
2.18 MB PNG
>>101728438
yes, its the safety watermark it pastes into your gen, or did you think you were getting all these images for free? mwahaha
>>
>>101728448
Get training then, I'll wait.
>>
>>101728455
For me it's her narrow waist, her pelvic lines, her cute blushing face, and her tits
>>
>>101728448
just because you trained an untested lora, that doesnt mean consistency anon
>>
>>101728460
how many sinks and mirrors does that bathroom need
>>
File: ComfyUI_00175_.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
>>
File: ComfyUI_00416_.png (919 KB, 768x1344)
919 KB
919 KB PNG
>>
>>101728390
If you mean flux negative prompts don't work.
>>
File: 1722825733.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>
File: ComfyUI_00418_.png (1001 KB, 768x1344)
1001 KB
1001 KB PNG
>>
>>101728468
>>101728461
the same retard who said you can't train it is the one who added the training into his app. Jesus christ you SDJeets are pathetic.
>>
>>101728469
His, hers, and Jamal's.
>>
>>101728474
lol
>>
>>101728471
:^) we are having trouble getting a good Kamala.
>>
File: FD_00512_.png (1003 KB, 1024x1024)
1003 KB
1003 KB PNG
>>101728454
Ah yep, that did it. Was on 2.5 and getting it every few images, 3.5 is fine.
>>101728460
I don't care
>>
File: ComfyUI_00141_.png (721 KB, 1024x1024)
721 KB
721 KB PNG
>>
>>101728502
Go read what the fucker is writing. He is still very much unsure if it can work.

He is not sure you can "improve" the model. He says it's possible to change the the model without completely ruining it. No one has yet demonstrated a true finetune and no one knows if it's possible. Not even the guy that made the finetuner.
>>
>>101728539
>No one has yet demonstrated a true finetune
3 days anon...
>>
Everyone screeching about Loras and fine-tuning.
There's not a single solution of controlnet on the horizon. Controlnet is the only thing that has kept stable diffusion useful as a tool for so long. It's just a stock art generator until then.
>>
File: 1722826129.png (1.04 MB, 1024x1024)
1.04 MB
1.04 MB PNG
>>
>>101728547
Yeah, the race is on, people with awesome gear are tinkering. I want one that focuses on buildings.
>>
>>101728555
Checked.

Explain what controlnet is.
>>
>>101728557
I want one that focuses on big boobies and naked ladies.
>>
>>101728555
Also, no negative prompts.
No negatives is a deal breaker for me.
>>
Sometimes LLM just makes it worst
Original prompt :
Inside a kitchen, Trump is standing with a cutout of Hatsune Miku. Behind them, there is a window. From outside, Joe Biden is peeking inside through the window.

Response :
In this scene, an anthropomorphic Hatsune Miku cutout stands tall in a modern kitchen setting. The character exhibits a human-like posture and expression as it is positioned between Trump and the window. Outside, Joe Biden secretly observes through the glass pane, his curiosity piqued by the unusual sight within. The kitchen features sleek design elements, contrasting with the whimsical presence of Hatsune Miku's cutout.

Model : Dolphin-Llama3:8b
>>
>>101728565
Actually, I would love one that could reliably avoid unsafe gens.

How about adult nudity, no sex. That would be rather handy. ie art nudes.
>>
File: 1722826592.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>
>>101728539
>IT'S IMPOSSIBLE TO TRAIN
>OK YOU CAN TRAIN IT BUT IT'S VERY HARD
>OK IT'S NOT THAT HARD BUT YOU CAN'T IMPROVE IT
YOU ARE HERE
>OK YOU CAN IMPROVE IT BUT IT CAN'T DO CONSISTENCY
>OK IT CAN DO CONSISTENCY BUT IT CAN'T DO NUDITY
>OK IT CAN DO NUDITY BUT IT CAN'T DO PORN
Seriously you need to actually genuinely shut the fuck up.
>>
>>101728569
I hold out that someone will discover how to return the negatives (imo there are baked in negative prompts.
>>
>>101728583
Honestly creepy.
>>
So I'm trying to understand rectified linear flow here
Does the k-th rectified flow refer to a k-th training epoch?
>>
File: FD_00523_.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
>>
File: out-0 (1).jpg (228 KB, 1024x1024)
228 KB
228 KB JPG
>>101728590
>>
File: FD_00145_.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>101728582
the more explicit a model is capable of the more artistic you can make your nudes. Just look at Pony
That said, Flux is pretty OK already at safe artistic nudes.
>>
File: FLUX__00047_.png (1.29 MB, 1152x896)
1.29 MB
1.29 MB PNG
>>
File: 1720874850203050.jpg (350 KB, 1894x1221)
350 KB
350 KB JPG
>>101728595
No idea if that works, worth a try I guess.
>>
>>101728661
current century scrooge mcduck
>>
File: FD_00532_.png (804 KB, 1024x1024)
804 KB
804 KB PNG
>>101728650
>>
>>101728685
Broccoli shota
>>
>>101728669
this is nonsense
>>
File: 1694764375361413.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>dalle wont allow this prompt
however...
>>
File: 1712803448298774.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>101728299
>>
File: 1706872124927943.png (677 KB, 1024x1024)
677 KB
677 KB PNG
>>101728712
pixel art prompt at the start makes some neat stuff

pixel art, resident evil in game screenshot, miku hatsune is the main character, she has a black beret and gun.
>>
>>101728564
Controlnets let you guide the image in a very specific way. You can place line art of something in a controlnet and the AI will color it for example. You can put a specific style into the controlnet and it will replicate it. You can put a pose in a controlnet and the subject of that image will copy the pose. It's extremely useful.
>>
>>101728706
giwtwm
>>
>>101728590
kek, show me the image2video timeline too please, yeah it has been 3 days, so stop acting like is the 2nd coming of jesus, calm your panties anon, every new ai toy that has been released since in the last 3 years had this same hype moment and then people get tired of its limitations

this anon knows
>>101727936
>>
File: ComfyUI_00179_.png (1.11 MB, 1344x768)
1.11 MB
1.11 MB PNG
>>
File: 1695166348317750.png (730 KB, 1024x1024)
730 KB
730 KB PNG
>>101728731
>>
>>101728590
which ai model or company gan generate true consistency? thats the holy grail of ai generation and flux isn't either
>>
File: ComfyUI_00014_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>limited porn abilities
I HATE THIS TIMELINE
>>
>>101728742
I don't get why you guys want to be peasants again
>>
File: 1713767451654220.jpg (26 KB, 512x512)
26 KB
26 KB JPG
>>101728693
Just tried and it kinda worked on my quick test.
Prompt : Image of the sky
Negative : Blue sky

And I got consistent red skies.
Without the negative blue skies again.
>>
>>101728768
I want nsfw of her.
>>
File: FD_00547_.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
Reminder that all the retards that said you can't train flux are nogens and vramlets who are mad they can't enjoy the model.
There's literally no point engaging with them.
>>
File: kk.jpg (11 KB, 204x151)
11 KB
11 KB JPG
Could I create a LoRA with these images and then expect usable outputs?
>>
>>101728770
>it kinda worked
post the XY comparisons
>>
>>101728788
I can't train it but I'm still having fun
>>101728789
Probably. You'd want to split them into individual icons and blow them up to 512x512 first. I've made a Diablo 1 inventory LoRa with sprites only slightly larger that came out reasonably well.
>>
>>101728777
I tried.
The prompt was her bending over with her heart-shaped ass facing the camera.
Instead, I get smugly denied.
>>
>>101728788
>vrimlet
>>
File: 1722827939.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>>
>>101728789
I havent made loras with multiple instance prompts yet, but if you gave each one a unique name and images in individual folders, it should work fine

ive used kohya to make a character lora but that was just for one character. if each icon had an instance prompt and image associated with it, it should work?
>>
>>101728801
I want mostly suggestive too, but I'll wait for the fabled magical finetune I guess.
>>
>>101728789
I wouldn't do any training with less than 40 well-tagged examples. as far as the resolution though small is fine, no problems
>>
>>101728801
Try bathers renaissance painting
>>
>>101728769
it's just a meme, I wanted to make it jeb bush but flux didn't seem to know too well what jeb looks like
>>
>>101728769
US peasants? hAHAHAHAHAHAHAHAH PEASANT!
>>
what the fuck is "variance explosion" and what does it imply for the result of the model?
>>
>>101728798
Yes, I wanted to split them anyway.
>>101728819
There are 20 in total.
>>
File: file.jpg (39 KB, 794x456)
39 KB
39 KB JPG
>>101728791
Not sure what you want, but it's easy to replicate.
Picrel is literally with and without negative.
It's pretty consistent for this at least.
>>
>>101728789
If will work if the model has a basic concept of the images. Otherwise, it'll either gen something that's vaguely similar but unusable or 100% likeness but completely unusable.
>>
>>101728564
>give model an image
>it will arrange your gen along that image
doesn't have to be an image, for example there are controlnets that set the camera position in your gen according to XYZ coordinates you give
>>
File: 1697903256370496.jpg (1.25 MB, 3840x2190)
1.25 MB
1.25 MB JPG
>>101728849
No way to know without trying; you can get away with 20 when training for a single person/face/object, might work with a single icon type as well.
>>
>>101728868
Wow you can control the camera?

How long before you can do a proper video?
>>
>>101728555
Not a problem. Controlnets are easy to make and the concept is extremely simple. Let it progress one step at a time.
>>
>>101728882
You need a temporally consistent video model for that.
>>
>>101728849
>20
between a rock and a hard place
cooking too long gets overfit (gens adhere too much to examples), cooking too little gets underfit (too much variation, not consistently getting what you want)
can you make more?
(depending on use case, you might be able to commission a few more)
>>
>>101728868
You can make a controlnet for anything really, so long as you have an input image dataset and an expected output dataset.

I've always wanted an Unwrapped UV controlnet that creates textures based on UV maps
>>
>>101728890
I prefer images anyway. So you can set a background, but adjust the camera angle, and then place figures and objects?
>>
>>101728849
>>101728897
that being said, you can always give it a try. if you end up with something that works for your purpose then all's well. I'm just saying that I myself wouldn't
>>
Requesting Mario saying NIGGER
>>
File: ComfyUI_00429_.png (1020 KB, 768x1344)
1020 KB
1020 KB PNG
>>
File: ComfyUI_00181_.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>
>>101728940
why are you so obsessed with them
>>
File: 1711291588041685.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
female olympic boxing:
>>
>>101728911
I've only seen one controlnet of that type, there's another one on civitai to control light position, although it's spotty

not sure why this haven't caught on, probably because popular frontends only have images as inputs for controlnets

you can do all kinds of scene manipulation with CN, from placing objects to setting exact poses, face expressions, character proportions to making scenes from depthmaps to style transfer to full-fledged neural rendering (there are a few rigs for Mayba/Blender aimed at that)
>>
File: FLUX__00041_.png (1.08 MB, 896x1152)
1.08 MB
1.08 MB PNG
>>
File: ComfyUI_00183_.png (1019 KB, 1024x1024)
1019 KB
1019 KB PNG
>>101728940
>>
File: FD_00463_.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
>>101728802
>>
What's the best way to get rid of jpeg artifacts in a gen?
>>
File: 1722269509513553.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>101728731
Yeah I'm really loving it
>>
I'm starting to think when the prompt has nsfw context it goes spaghetti.

> 'Slave attire, medieval bedroom setting, a single woman, displaying feet with detailed toes and soles, neat toenails, exquisite hands, five dexterous toes, lengthy brown tresses framing hazel eyes and smooth complexion, indicative of a 25-year-old, perky teardrop breasts, a petite yet athletic physique, bound by rope bondage while her arms are confined behind her back.'
>>
>>101729028
Is nintendo okay with this?
>>
>>101729095
Lack of training data, so yeah it goes to shit.
>>
>>101728740
giwtnwm
>>
>>101729095
>Anti-footfag measures
Holy chads
>>
File: FLUX_00005_.png (1.01 MB, 896x1152)
1.01 MB
1.01 MB PNG
>>
>>101729049
click queue prompt and wait until a gen comes with no artifacts
>>
File: 1692283694682548.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
>>101729051
>>
>>101729049
"jpeg artifacts" in negatives
dependent on subject matter you might try "vector graphic" in prompt with a low weight
>>
File: ComfyUI_00187_.png (838 KB, 1024x1024)
838 KB
838 KB PNG
>>
>>101729110
Here's another spaghetti. I don't think the prompt is nsfw enough, but then I noticed "her legs spreading", this must be the forbidden keyword!

> In this scene, a beautiful woman with Twintails, green eyes, and parted lips sits on the ground in a forest during daylight. She has white hair, long eyelashes, and wears a beautiful necklace. Her expression is serene, and she appears shy. From the front view, high skin detail, detailed skin texture, goosebumps, and a slight spread of her legs convey a sense of intimacy. The surroundings feature a gradient background, with birds, clouds, and butterflies adding to the fantasy theme. The painting style focuses on impasto, and the overall aesthetic is one of high quality and beautiful color details.
>>
File: 20240805T040124Z_00001_.jpg (553 KB, 1024x1024)
553 KB
553 KB JPG
>>101728940
Not condoning such language so here is Mario getting ready to rob some place with a brown Yoshi.
>>
>>101729104
>rectangle of coloured points resulting from a math equation
not their problem
>>
>>101729124
>the fand/hoot fan has entered chat
>>
File: chap in red hat.png (856 KB, 1024x1024)
856 KB
856 KB PNG
>>101729229
>>
File: 1708282339691882.png (1003 KB, 1024x1024)
1003 KB
1003 KB PNG
>>101729173
>>
File: FLUX_00012_.png (1.21 MB, 1152x896)
1.21 MB
1.21 MB PNG
>>
File: FD_00548_.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
>>
File: file.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>101729366
>>
No need to wait over an hour for the next bread because it's already here...

>>101729379
>>101729379
>>101729379
>>
File: 2024-08-04_00684_.png (1.87 MB, 1280x1280)
1.87 MB
1.87 MB PNG
>>101729095
rope bondage and the like just isnt in the data set, also "slave attire" is very vague and the model has probably not single picture tagged as such .. so ofc it goes haywire,

also I am still sad FLUX doesnt know what shibari is
>>
Running with an empty prompt in flux tends to give much more coherent results and and empty prompt with SD/SDXL. As in: the flux gens appear to be the product of much shorter, simpler prompts. What do you think this implies?
>>
>>101729452
>Running with an empty prompt in flux tends to give much more coherent result
thats cause dev/schnell are destilled models, they will go and converge to something, no matter what
>>
>>101728214
There's something about this that really gets me going.
>>
File: FD_00564_.png (2.66 MB, 1024x1536)
2.66 MB
2.66 MB PNG
>>101729366
>>
File: 1714891824951249.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>101729280
>>
File: FD_00577_.png (2.15 MB, 1024x1536)
2.15 MB
2.15 MB PNG
>>101729419
Try this
https://huggingface.co/spaces/TencentARC/PhotoMaker-V2
>>
>>101729489
kek
>>
File: ComfyUI_01525_.png (1.59 MB, 1024x1024)
1.59 MB
1.59 MB PNG
>>
>>101729681
If the 'VIVE' said 'VIVEK' it would be perfect
>>
>>101729681
prompt?
>>
File: 2024-08-04_00241_.png (1.4 MB, 1024x1024)
1.4 MB
1.4 MB PNG
>>
>>101730095
I too wish I had 50 caliber lever action M14s



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.