[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1.26 MB, 3264x3264)
1.26 MB
1.26 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102052110

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
Blessed thread of frenship
>>
>>102055040
Anything that requires more than 3 seconds of focused attention is scary, weird and alien for a zoomer.
>>
>>102055040
It's just a carry-over word from SD 1.5 where people would write descriptive sentences in a model that didn't understand it. We didn't have t5 encoding back then so it was a dumb way to do it.
>>
>>102054997
I recommend just going to Kohya at this point AI-toolkit was good for how fast it was, but Kohya is a much cleaner, fast and memory efficient implementation.

>>102055056
Personally, my dream prompting method would be a combination of booru tags with natural language to guide the composition.
>>
>>102055035
ty baker
>>
Boomer prompting = "please generate a wonderful image full of whimsy of a woman standing in an empty field. make sure to include minute details that entice the viewer to look further. there is an air of mystery and mystique to the image composition"

Regular prompting = "a woman standing in an empty field"
>>
>>102055094
You forgot the "Thank you" at the end
>>
>computer please draw for me
>>
>>102055094
that's not regular prompting, that's brainlet prompting, unless your goal is to have a boring sterile image
>>
File: ComfyUI_02089_.png (1.08 MB, 768x1024)
1.08 MB
1.08 MB PNG
>>
>>102055103
He is rude youngster, forgive him
>>
It is just me or does stacking LoRAs make gens take 10x longer?
>>
>>102055056
You jest, but the average 20 year old reels at the sight of a paragraph with more than 3 lines of text. Their dopamine receptors BURN with very literal pain.
>>
>>102055122
Shouldn't. It's possible your vram is being offloaded.
>>
>>102055113
The intrigue comes from additional descriptors, not "pls gib pretty pretty image ty".
>>
>>102055094
See? "Boomer" means "long and too hard to read for my zoomi brainie"
>>
>>102055094
Well, that boomer prompt takes it to the extreme, but I've used Dall-E 3 enough to know that these types of models like bit of verbosity in the prompts. That is the type of text you get out from the vision models that handle the image tagging. But the "please generate" at the start is dumb.
>>
File: FD_00057_.png (1.64 MB, 768x1344)
1.64 MB
1.64 MB PNG
>>102055128
It is always, even with 16gb.
>>
I always hated that tags didn't let me describe two characters differently or something more specific for background or setting
>>
File: FD_00058_.png (678 KB, 768x1344)
678 KB
678 KB PNG
This is coming along better than I expected, it's really struggling with her tattoos though
>>
>>102055113
>brainlet
its brainlet to rely only on the prompt to get interesting outputs
>>
>>102055156
He is packing
>>
>>102055117
I just wish I would not need to see his puke face everytime I come here. Can't you start a reddit for that shit?
>>
I've updated my llama.cpp install and want to try writing prompts, sending them to llama.cpp (with Llama3 8B or whatever) to elaborate on them, then piping the output into stable-diffusion.cpp as the prompt for Flux.
What should my instruction to the LLM be for it to spit out the kind of boomerprompt that Flux is highly responsive to?
Also, are there any LLMs that are particularly well-trained on art history and architectural details?
>>
>>102055117
'em
>>
>>102055122
if you're using Q quants then yes, the loras have to be constantly re-applied because they modify the dequant weights and the weights are only dequant on-the-fly, you can't apply loras in one pass like you can with the non-Q models
>>
File: FD_00061_.png (1.06 MB, 1344x768)
1.06 MB
1.06 MB PNG
>>102055160
Packing heat
>>
>>102055185
Looks really good.
>>
>>102055181
I see, that's why it takes so long between gens.
>>
>>102055181
yea this .. it seems its just what it is, so for heavy lora usage Q's are not an option .. you could like merge the loras into default and then quant your merged model .... but rofl I guess that would take ages as prestep
>>
>>102055113
that's the oposite of brainlet, you don't give the model much direction so it has the freedom to add nice touches on top of it
>>
File: FD_00059_.png (841 KB, 768x1344)
841 KB
841 KB PNG
>>102055197
No it doesn't, it's only eopch 6 but it's the first one that seems to understand the concept.
>>
>>102055134
>I like to write convoluted sentences when you can have the same effect for a sentense 10 times shorter
https://www.youtube.com/watch?v=3RMAPFH75AU
>>
>>102055208
Maybe, I haven't seen the show is a long time so I can't recall exactly what's off about the image.
>>
>>102055204
Using a single LoRA it's fine, it only adds half a second per it to my gens, but stacking them fucks it.
>>
>>102055217
her tattoos are wrong
>>
>>102055204
would it be possible to have a node to quant a lora to our desired quant? could that allow it to applied directly?
>>
>>102055012
Every optimization is a trade-off between speed, usability and support for additional features. Optimizations are becoming a hard requirement as newer models increase parameter count to compensate for their architecture and datasets.
>>
You people are exaggerating the difference between Q8 and FP8.
>>
>>102055247
you don't have trade off if you give more options to the user, it's not the 90's anymore when you couldn't even chose between mono and stereo in N64 games, nowdays a PC game has the option feature and a shit ton of shit to choose, as it should
>>
I simply do not care for character loras. Styles and concepts only, please.
>>
>>102055080
I train under 24 gb on ai-toolkit and get about 2-3 s per it on the default flux preset. What numbers does kohya get?
I tried both and found ai-toolkit much more straightforward, but other than that they seem to be doing pretty much the same thing.
>>
File: FD_00062_.png (1.21 MB, 1024x1536)
1.21 MB
1.21 MB PNG
>zabecca
>>
>>102055225
yep, that is what I see to
>>102055236
I don't think that would work, its still an operation you have to do on every weight, probably would be even slower. The quanting is a trick to make the 8bit weight closer to what it would be to the 16bit weight then by just truncating it, right?

>>102055257
there indeed are.. q8 is very close to fp16 in some gens, while fp8 fucks em up.. for some generic nature shot it doesnt matter, but especially on text etc. its kinda bad
>>
>>102055265
Why not character lora? It makes your prompts simple when you don't have to describe every little detail to have similar character across gens.
>>
>>102055257
Q8 and FP8 are not similar and Q8 is a much better approximation of what FP16 is compared to FP8
>>
>>102055257
no we're not, Q8 is really close to fp16 wheras fp8 is not at all
>>
You people really are exaggerating the difference between Q8 and FP8.
>>
File: fp16-vs-q8-vs-fp8.jpg (740 KB, 3648x1260)
740 KB
740 KB JPG
>>102055257
I did this test yesterday (with half a dozen other pictures to) that demonstrates it nicely on a bit more complex prompt.

left fp16, middle q8, right fp8_e4
>>
>>102055275
I do not care for your silly homosexual OC. Refrain from training such.
>>
File: 1032567688.png (1.41 MB, 896x1152)
1.41 MB
1.41 MB PNG
>>
>>102055169
You should ask most of that to /lmg/. But it's easy to come up with a prompt just with trial and error.
>>
>>102055133
yes, you can describe an image better in a proper way without acting like you're pleading, begging and flirting with the AI. Seen far too many people prompt basic shit and be surprised they end up generating basic shit

>>102055205
if a model is adding "nice touches" unprompted then it's a fucked up model, that is unless there's a simple, straightforward way to choose between it strictly following the prompt or being more creative, CFG is ok but not ideal
>>
>>102055292
>OC
How does that work? I generate 20 images of some custom character and then train lora on those gens to "save" the OC character?
I would never use such loras.
>>
>>102055288
that picture sums up nicely the feeling I had towards Q8 and fp8, Q8 gives almost the same picture as fp16 with some little difference in details, fp8 can make completely different poses and fuck up some shit that Q8 won't
>>
>>102055264
City hasn't explained it well. You're expecting an extra option that loads the lora to gpu to solve the problem, if that's not already the case it would be a marginal improvement at best. The trade-off is that loras need to be applied on-the-fly due to quantization/dequantization.
>>
>>102055312
>if a model is adding "nice touches" unprompted then it's a fucked up mode
no it's not, if the model follow your simple prompt and you never specified to not add more stuff to it, that's your fault, not the model
>>
>>102055320
>if that's not already the case it would be a marginal improvement at best.
we would never know if we don't try, I don't like suppositions, I like to see it with my own eyes
>>
>>102055337
then start coding
>>
>>102055343
it's not my repo, it's yours city, if you're a lazy ass and you don't want people to enjoy decent speed on loras + gguf quants, just say it, don't waste our time by finding excuses
>>
>>102055326
>a woman standing in an empty field, DO NOT ADD GRASS, DO NOT ADD ANY KIND OF VEGETATION, THERE IS NO SKY AND NO CLOUDS, THERE IS NO SUN, ONLY BARREN GROUND
can't wait to prompt like this, it will be truly the model of all models
>>
>>102055349
I accept your concession, codelet.
>>
>>102055303
Your LoRA got fucked.
>>
>>102055359
>codelet
says the man who won't do anything either, and therefore can be categorized as a codelet too, ironic.
>>
File: lewd.jpg (219 KB, 1024x1536)
219 KB
219 KB JPG
I should have put more nudes in the data set. V2 maybe
>>
do llms have the same struggle with loras? i doubt anyone runs a non quant llm
>>
shits too complicated now take me back to 1.5
>>
/sdg/ is now just full of women and troons. I wonder if any of them are hot.
>>
>>102055358
>what is a negative prompt?
>>
>>102055371
So uh... what were your LoRA settings?
>>
>>102055380
I am not a woman. I'm a tranny
>>
>>102055380
wait you'd fuck troons? that's kinda gay if you ask me
>>
>>102055380
>I wonder if any of them are hot.
Only the troons will care to look feminine and pretty.
>>
>>102055374
Shit has never been so good, what are you talking about?
>>
>>102055386
Whatever civit set, but with repeats and epochs set to 10 and size set to 1024
>>
>>102055399
this, if the price to pay is a little more complex setup to get kino pictures, I'm all for it
>>
File: bildbeschreibungen.png (1.07 MB, 1280x2911)
1.07 MB
1.07 MB PNG
Screenshotted Anon's research from a few days ago. Great work.
If you're still around, what prompt did you use to ask these LLMs for the description? I noticed that simply “describe this image in detail” isn't enough if you want more information about the artstyle.
>>
>>102055384
>pos: woman standing in an empty field
>neg: tree, fence, bench, picnic blanket, bicycle, dog, bird, cloud, barn, haystack, flowers, path, windmill, kite, distant mountains, farmhouse, grass, shrubs, butterfly, rocking chair, watering can, lamp post, puddle, shovel, scarecrow, street sign, abandoned car, bicycle, gazebo, mailbox, apple tree, stone wall, playground equipment, tent, lantern, birdhouse, deer, rustic gate, garden gnome, wooden crate, picnic basket
>>
File: 176592752.png (1.33 MB, 896x1152)
1.33 MB
1.33 MB PNG
>>102055365
How so?
>>
>>102055374
Funny thing. Yesterday I went to look at the old 1.4 and 1.5 gens folder. Even the NAI finetune shit looks so horrible in comparison. I'm glad it's over.
I can't believe that I thought the shit was good looking.
>>
>>102055384
>what is doubling the gen time?
>>
>>102055421
this is unironically how i prompt desu
>>
>>102055421
find something better than that, I'm all for having as little words as possible on my prompt
>>
>>102055372
vector databases/RAG/3rd things are all hot topics. Thebloke has hundreds of LLM GGUFs.

>>102055406
you aren't though. The upgrade is text. It will be better, it is still base model days.
>>
>>102055428
the thing is, CFG was created to improve adherance prompt, not just to have a negative prompt, so that double gen time is worth it if you want Flux to actually follow your prompts for good
>>
do you think it is possible to teach the model to handle either a "negative" section in the prompt or a separate conditioning and avoid having to sample twice to get a usable negative prompt?
>>
File: flux_cyber-env11.jpg (3.03 MB, 2080x2720)
3.03 MB
3.03 MB JPG
>>
File: FD_00071_.png (1.8 MB, 1024x1536)
1.8 MB
1.8 MB PNG
>>
>>102055443
i pray to the lord everyday for this to be true
>>
>>102055396
>Only the troons will care to look feminine and pretty.
In waht world do you live
>>
>>102055443
a lot of researsh has been done to get rid of CFG, but it turns out those solutions were even slower, like PerpNeg which is 3x slower than CFG = 1 kek
>>
>>102055433
>still base model days
I don't see Flux picking up in the way SDXL did. Everyone seems to go through the same phase of being wowed by easy kino, hitting its thematic limitations and 1girl-itis, making a lora, realizing xl is still better for what they want (usually lewds) and abandoning Flux. Plus it's higher requirements.

Screenshot this post. In a year's time, Flux D 1 will be dead.
>>
>>102055462
Flux is doing much better right now than SDXL did its first couple weeks. Flux might be replaced in a year, but its heyday is far from today.
>>
>>102055417
nta, atleast for JoyCaption I can give you the answer: none .. you just paste the picture and it spits out the prompt
>>
>>102055094
>>102055138
dalle is funny with what triggers it sometimes in it's own revised prompts

>Enter prompt:
>a japanese woman laying in the grass!
>Private Client
>Your request was rejected as a result of our safety system. The revised prompt may contain text that is not >allowed by our safety system.
>Revised prompt rejected.
>A Japanese woman with traditional kimono is lying leisurely in a lush, green grass field, basking under the >calming sun. Her eyes are closed, and she breathes in the serene nature around her. In her vicinity, blooming >flowers of varying colors add to the overall tranquility of the setting, while majestic trees provide a slight shade >from the afternoon sun. Her relaxed pose conveys the tranquility of the environment, making for a peaceful and >serene scene.
>>
There is no paper on flux, but if its at all similar process to Dall-E 3, then it would do good to have bit more verbose prompt.

Read the Dall-E 3 paper and look at all the examples and training examples. They used like 90% of AI tagging and the vision model was very verbose.
I think same applies to flux and what works with Dall-E 3, will work with flux.
As far as I know, only Bing implementation of Dalle does not auto force prompt rewriting by AI. ChatGPT always rewrites, which is ass.
>>
>>102055472
video card tech has moved on. It would have done this well if 16GB wasn't so common.
>>
>>102055478
>Enter prompt:
>a man who doesn't know how to green text
>Revised prompt:
>An image of a confused middle-aged South Asian man, sitting at a computer. The screen of the computer is focused on a text-based application filled with neatly organized rows of text, but one line of text remains visibly black while the others are highlighted in green, indicating his struggle to use the 'green text' feature.
>>
>>102055417
I think his challenge was to check how these image descriptors fare without any further hints
>>
>>102055454
the real one, unfortunately
>>
>>102055458
PerpNeg is just a different way to calculate CFG, you're still doing two passes.
I'm talking about building the ability to handle negative prompts during training.
>>
>>102055508
>the real one, unfortunately
I felt that
>>
File: f.png (155 KB, 832x1152)
155 KB
155 KB PNG
>>
>>102055482
you're way late, anon, we figured out Flux was trained on LLM slopa on day one
>>
>>102055511
>I'm talking about building the ability to handle negative prompts during training.
that's a good question, but even if it's possible, higher CFG always improve prompt adherance, so it's not like CFG will magically becomes useless if there was a way to get negative prompt at CFG = 1
>>
>>102055524
>we figured out Flux was trained on LLM slopa on day one
I mean that's obvious no? Pretraining an image model requires billions of pictures, who's gonna do that by hand?
>>
>>102055431
that is how I often had to prompt on 1.5

>>102055432
if we had something like a creativity slider and it was set to 0 creativity, it should only do the bare minimum of concepts you prompted, set it to maximum creativity and you'll get a bunch of fluff and random objects of interest and still end up looking great, that is where you can lock in the seed and add in things like "dog, kite" into the negs if you don't want those in particular. like I said we have CFG but it doesn't quite work like that, at least not with the models I've used, and then you've got all the side effects of your CFG being too low or too high
Flux isn't perfect but from my experience it's the best one so far at following simple prompts, if it knows the things you're prompting
>>
>>102055530
>Flux isn't perfect but from my experience it's the best one so far at following simple prompts, if it knows the things you're prompting
100% agree with that take
>>
File: file.png (2.21 MB, 896x1152)
2.21 MB
2.21 MB PNG
>>102055472
Maybe, but the first few days /ldg/ maxed on pics per thread. People posted absolute kino non-stop. Nowadays all you see is arguing, some anons trying to train loras with varied success, and civitai doing an enshittification speedrun. I don't want to doom, but idk. It feels off.
>>
>>102055544
do again, but put the rowboys into canoes
>>
>>102055524
>LLM slopa
Though if you read the Dalle paper, you would have noticed the very important finding: AI captions improved the model significantly.

They figured out that the benefit was maximal around 90% ai and 10% human. So 90% of AI captioned images and 10% human captioned was better than 80% AI and 20% human and so on.

Read the paper. It was very significant and showed that synthetic data is relevant and can improve performance.
>>
File: file.png (17 KB, 149x99)
17 KB
17 KB PNG
>>102055035
why did ForgeUI suddenly start having TWO images in the output?
They're perfectly identical and it's not actually generating two images, it's generating just the one and then it gets shows as two identical outputs.

Wtf is going on?
>>
File: file.png (2.31 MB, 896x1152)
2.31 MB
2.31 MB PNG
>>102055549
>This is a movie poster for a film from 1994. The film is set in a cyberpunk dystopia, and features robots, literature scholars, and painters. The style of the poster is realistic, gritty, and intended for mature audiences, making heavy use of professional, high quality photography. The title of the film is predominantly displayed in the bottom part of the poster: "Flopx". Underneath the title, a subtitle reads: "Doom and Gloom".

Do it yourself. Here's the prompt:
>This is a movie poster for a Steven Spielberg film from 1994. The film is set in the wild west, and features robots and cowboys. The title of the film is predominantly displayed in the bottom part of the poster: "ROWBOYS".
>>
>>102055544
all we need is a good full finetune and we're back
>>
>>102055552
>AI captions improved the model significantly.
compared to full human's captions that have a 100% accuracy caption?
>>
File: file.png (2.22 MB, 1024x1024)
2.22 MB
2.22 MB PNG
>>
File: file.png (1.89 MB, 896x1152)
1.89 MB
1.89 MB PNG
>>102055564
I get that, but how difficult will it be to tune this model properly, given it's size and the quality of the original dataset?
Don't get me wrong, I got my 3090 specifically to proompt all day with it, and I don't regret it. Like I said, it just feels off right now.
>>
>>102055565
Read the paper on the matter.

https://cdn.openai.com/papers/dall-e-3.pdf
>>
Official pixart bigma hype begins in 8 days
>>
>>102055565
>full human's captions that have a 100% accuracy
anon, this has never been try for anything.
>>
>>102055583
how is that possible though? no AI captions will be 100% accurate, humans can make perfect captions though, so I suspect they hired shit pajeet captionners or some shit
>>
File: file.png (1.98 MB, 896x1152)
1.98 MB
1.98 MB PNG
By the way, does anybody else use ipndm/sgm_uniform? I find it gives excellent results.
>>
File: 2024-08-24_00158_.png (1.52 MB, 720x1280)
1.52 MB
1.52 MB PNG
>>102055562
Thanks. Changed it abit
>This is a movie poster for a Steven Spielberg film from 1994. The film is set in the wild west, and features robots and cowboys. The title of the film is predominantly displayed in the bottom part of the poster: "ROWBOYS". The cowboys on the movie poster are in canoes that are on a wild river. They seem joyful and full of energy as they paddle thru the water. On the side of the river robots with revolvers shoot on the cowboys.
>>
>>102055596
>ipndm
this is twice as slow as euler though?
>>
File: file.png (2.17 MB, 896x1152)
2.17 MB
2.17 MB PNG
>>102055601
Fucking kino
>>
>>102055601
>robots are shooting each other instead of the rowing cowboys
SO CLOSE
>>
File: file.png (2.09 MB, 896x1152)
2.09 MB
2.09 MB PNG
>>102055604
Really? Pic related is the same as >>102055605
with euler. It took the same in both cases. (and they look the same damn)
>>
>>102055508
Apparantly not
>>
File: 00002-2048854268.jpg (222 KB, 1192x880)
222 KB
222 KB JPG
>>
How come there is paper(s) on Dalle3 but not flux, even though flux is the "open"?
>>
File: 00105-2221437000.png (1.48 MB, 1440x1248)
1.48 MB
1.48 MB PNG
>>
>>102055631
maybe they found a secret sauce and they don't want to share it to everyone, and I won't pretend that OpenAI's paper is geuine, maybe they misled us with some exagerations or some lies
>>
File: file.png (1.8 MB, 896x1152)
1.8 MB
1.8 MB PNG
>>102055618
In some cases it looks very different
(>>102055578)
>>
File: t.png (436 KB, 832x1152)
436 KB
436 KB PNG
>>
File: igx182w.jpg (521 KB, 1600x1600)
521 KB
521 KB JPG
>>
>>102055664
a studio ghibli style lora?
>>
>>102055664
Niiice
>>
>>102055664
Very miazaki
>>
File: 2024-08-24_00162_.png (1.59 MB, 832x1216)
1.59 MB
1.59 MB PNG
>>102055610
ya I'm not native English .. I guess it needs to be "are shooting at" not "shoot on" , I noticed t5 can be a fucking grammar nazi sometimes, for example describe something with an animal and don't call it "it" .. instead call it he or she and it just doesnt get.

>This is a movie poster for a Steven Spielberg film from 1994. The film is set in the wild west, and features robots and cowboys. The title of the film is predominantly displayed in the bottom part of the poster: "ROWBOYS". The cowboys on the movie poster are in canoes that are on a wild river. They seem joyful and full of energy as they paddle through the water. Am Flussufer stehen Roboter die auf die Cowboys schießen.
>>
>>102055640
I'm gonna level with you.
With "secret sauce", I would imagine model of this size to be 10x better than SDXL.

I think it all comes from scale and better captioning and using literal LLM instead of clip. No secret.
>>
File: ifx183.png (1.38 MB, 1024x1024)
1.38 MB
1.38 MB PNG
>>102055680
that's just ImageFX
>>
File: file.png (1.83 MB, 1024x1024)
1.83 MB
1.83 MB PNG
https://civitai.com/models/677362/mario-strikers-style-flux?modelVersionId=758237
>>
File: file.png (2.42 MB, 896x1152)
2.42 MB
2.42 MB PNG
>>102055686
I like the title font on that one.
>This is a movie poster for a Steven Spielberg film from 1994. The film is set in the wild west, and features robots and cowboys. The cowboys on the movie poster paddle in canoes down the rapids of a wild river. They look joyful and full of energy as they paddle intensely through the water. On the banks of the river, robots with revolvers shoot at the cowboys from the distance.
>The artwork on the poster uses a painterly style that highlights the adventurous tone of the film, reminiscent of Lucas Films.
>The title of the film is predominantly displayed in the bottom part of the poster: "ROWBOYS".
>Underneath the title, a subtitle can be read: "Paddling of destiny".
>>
>fooled by SaaS gen again
>>
>>102055695
>https://civitai.com/models/677362/mario-strikers-style-flux?modelVersionId=758237
funny, looks like "The World Ends With You" artwork .. is Mario Strikers and TWEWY the same artist?
>>
>>102055690
>I would imagine model of this size to be 10x better than SDXL.
1) SDXL is a 3.5b model, so it's a 3x scale, not 10x
2) SD3-8b isn't even close to Flux even though it's size is quite close
>>
File: file.png (2.98 MB, 960x1280)
2.98 MB
2.98 MB PNG
g2g catch you later boys
>steven spoilberg
lmao
>>
>>102055664
>when even the most cucked GAFAM of them all (google) has miyazaki style in its model, you know that Flux did something really wrong
>>
>>102055700
Awesome!
>>
>>102055723
>GAFAM
?
>>
>>102055710
>spoilberg
he spoiled us this new "Row Boys" movies before the official announcement :'(
>>
>>102055700
>that cooperative 1-hand each paddling on the front boat
>>
File: 4step_up_00022_.png (3.52 MB, 1536x1536)
3.52 MB
3.52 MB PNG
>>
File: images.jpg (12 KB, 296x170)
12 KB
12 KB JPG
>>102055664
>STUDIO GHIBLI STYLE
>>
>>102055707
That is the point. It should be better than just basic scaling.
>>
>>102055552
>Though if you read the Dalle paper, you would have noticed the very important finding: AI captions improved the model significantly.
Yes but OpenAI has a much better caption model.
>>
>>102055730
https://en.wikipedia.org/wiki/Big_Tech
>Alphabet, Amazon, Apple, Meta, and Microsoft are known as the Big Five tech companies. They were known as GAFAM before Facebook changed its name to Meta in 2021
Alphabet is Google
>>
File: FD_00073_.png (1.39 MB, 704x1472)
1.39 MB
1.39 MB PNG
Still struggling with the tat
>>
File: fp123.jpg (204 KB, 1024x1024)
204 KB
204 KB JPG
>>102055738
>>
>>102055740
yeah, SAI fucked it up big, they scaled up their model up to 8b and didn't have the level of Flux, that's a huge failure
>>
>>102055743
>Yes but OpenAI has a much better caption model.
maybe Flux also used GPT4V caption model
>>
>>102055765
we don't know if DALL-E 3 used GPT-4V
>>
>>102055743
I have not researched that. DE3 was trained early 2023 and the research done maybe late 2022.

Are you sure they had access to better vision models than current SOTA open source? I really don't know, but I think they have improved quite a bit.
>>
>>102055771
>Are you sure they had access to better vision models than current SOTA open source?
Most likely, they trained their own with their own image caption pair data set made by humans.
It's in the paper.
is current SOTA using high volume quality human made captions?
>>
>>102055748
I'll see myself out.
>>
File: 4step_up_00023_.png (3.65 MB, 1536x1536)
3.65 MB
3.65 MB PNG
>>102055686
T5 knows German and French too, prompt does not have to be in english
>>
>>102055787
I don't know. Last time I checked, the model from Meta was best, or maybe some finetune of it.
>>
File: file.png (1.86 MB, 1024x1024)
1.86 MB
1.86 MB PNG
>>102055695
Really impressive Lora
>>
How "locked" are peoples PC's when using kohya and making Loras? I have a 13500 and 16GB card and ddr5 and would like to be able to browse and shit while the lora is being made.
Are there any settings to limit GPU/CPU use during training, or even, other tools run locally?
>>
File: e.png (829 KB, 832x1152)
829 KB
829 KB PNG
>>
>>102055804
Yes I know, but it tends to give different results if you prompt in German. Also I think more in English than in German these days.. especially when working with AI.
>>
File: 10205503545344.png (2.17 MB, 1024x1024)
2.17 MB
2.17 MB PNG
>>102055035
>>
Making some really nasty stuff atm
>>
GIVE ME THE LORA
>>
Is there an option in Forge to keep the text encoding on a separate GPU?
>>
>>102055901
>>102055117
Meant for this guy lol
>>
>>102055905
nope, that's why I'm stuck on comfyUi
>>
>>102055915
Just sub to his patreon, it's only $5
>>
>>102055901
>>102055915
he just annoys us here posting pics of the grifter not sharing the lora.. by now I think its actually him doing viral marketing
>>
File: Capture.png (596 B, 96x26)
596 B
596 B PNG
Now were cooking with CFG
>>
>>102055955
same
>>
File: file.png (1.89 MB, 1024x1280)
1.89 MB
1.89 MB PNG
>>102054020
>balthus
what an interesting artist
https://resources.metmuseum.org/resources/metpublications/pdf/Balthuss_Thereses_The_Metropolitan_Museum_Journal_v_33_1998.pdf
>>
>>102055917
That's what I figured, no need to wedge this 2080 Ti in then, thanks anon.
>>
File: bComfyUI_108758_.jpg (858 KB, 2048x1088)
858 KB
858 KB JPG
>>
File: FD_00074_.png (1.33 MB, 768x1344)
1.33 MB
1.33 MB PNG
LoRA done. Once I get enough civit sheckles I will re-do it. with more nudes. Will post it soon.
>>
>>102055975
>Balthus even had the Rola arms embroidered onto many of his kimono, in the style of a Japanese kamon.
it's always the anime fans
>>
File: 1719318662886651.png (551 KB, 512x512)
551 KB
551 KB PNG
>>
is there a node that makes a sound when the gen is over? don't give me this, this shit also changes the whole Ui, I hate that
https://github.com/pythongosssss/ComfyUI-Custom-Scripts?tab=readme-ov-file#play-sound
>>
File: 4step_up_00024_.png (3.47 MB, 1536x1536)
3.47 MB
3.47 MB PNG
>>
File: file.png (2.17 MB, 1024x1024)
2.17 MB
2.17 MB PNG
https://www.youtube.com/watch?v=FuJBwu_03r8
>>
File: 4step_up_00025_.png (3.65 MB, 1536x1536)
3.65 MB
3.65 MB PNG
>>102056092
I like the font on this one
>>
File: bComfyUI_108785_.jpg (741 KB, 2048x1088)
741 KB
741 KB JPG
>>
File: a.png (1.32 MB, 832x1152)
1.32 MB
1.32 MB PNG
>>
>>102055748
I prefer FAGMA
>>
File: 00115-3221897838.png (1.18 MB, 1440x1248)
1.18 MB
1.18 MB PNG
>>
>>102056053
the nose knows
>>
>>102055847
I use ai-toolkit, but to answer your question I think it doesn’t matter. My 3090 is left with just enough memory to browse with one tab open and I see graphical glitches from time to time. Also I have to close stuff before starting or else it won’t. This on Linux and KDE, which is significantly lighter than Windows.
>>
File: file.png (2.13 MB, 1024x1024)
2.13 MB
2.13 MB PNG
>>102055695
>striker0s, Mario Striker art style,
>Hatsune Miku as a sleek, robotic samurai in chrome armor is slicing through waves of sushi rolls flying through the air. Each slice sends colorful sparks flying. Behind her, a giant koi fish swims through the sky as if it were water, creating ripples of light, text at the bottom: "Sushi Master."
Holy fuck it nailed that shit perfectly
>>
>>102056146
Young scully lewd lora when?
>>
>>102055578
Moreover there’s the licensing issue. Flux is not going to take off like SDXL did. This is as far as it’s going to get unless some autist is autistic enough to devote a ton of money and time for a proper tune.
>>
File: file.png (2.34 MB, 1024x1024)
2.34 MB
2.34 MB PNG
>>102055695
>>
File: 4step_up_00028_.png (3.48 MB, 1536x1536)
3.48 MB
3.48 MB PNG
>>102056174
>>
File: 00124-3362575104.png (975 KB, 1360x1024)
975 KB
975 KB PNG
>>102056186
When there's a porn finetune probably
>>
>>102056337
nice
>>
What learning rate are you all using for your LoRA training?
>>
>>102056200
Licensing is something that's only really being screeched about by a few big names. Let them die. Flux will take over when people learn to game the system again.
It's like everyone just forgot every model was merged with stolen NAI weights for like a year.
>>
File: ComfyUI_05485_.png (1.59 MB, 1024x1024)
1.59 MB
1.59 MB PNG
>>
File: file.png (2.26 MB, 1024x1024)
2.26 MB
2.26 MB PNG
>>102056433
>>
>>102056370
NAI chose to ignore that because it drove business to them, they could have easily shut it down if they wanted to, there would have just been no benefit in doing so.
The situation with BFL is different. There's no gaming the system. If anyone releases a full finetune and attempts to profit from it BFL will not stand for it. They wouldn't have included the terms if they had no intention of enforcing them, it's part of their business plan.
>>
>>102056169
thanks, ubuntu and gnome here ill look at aitoolkit, i guess it's 512 16gb vramlets?
>>
File: 00041-4239731267.png (789 KB, 832x1216)
789 KB
789 KB PNG
>>102056200
I feel like the guys making porno models did not care about the license.
>>
File: file.png (2.1 MB, 1024x1024)
2.1 MB
2.1 MB PNG
>>
File: 2024-08-24_00207_.png (1.12 MB, 832x1216)
1.12 MB
1.12 MB PNG
>>
>>102056545
that pony lover sure does, he chose sdxl cause he can cash in on his finetune and start fighting and argueing with SAI when SD3 did have a "non-commercial" license .. just like Flux.dev, and now he wont even consider Flux.dev cause its non-commercial only
>>
>>102056545
I wanna see if BFL takes down all the non-schnell models that "generate revenue".

NAI never did that, even though their model was leaked and it was literally almost over for them. As far as I know, they never tried to take down any model or mix. The whole 1.4 and 1.5 ecosystem relied on the original NAI leak.
>>
File: ComfyUI_Flux_Dev_00343_.png (1.74 MB, 1024x1024)
1.74 MB
1.74 MB PNG
Any interesting news in the past week or so?
>>
File: FD_00111_.png (681 KB, 1024x1024)
681 KB
681 KB PNG
Alright, I am not really happy with this LoRA but it's there if you want it.
https://civitai.com/models/680433
>>
File: bComfyUI_108806_.jpg (1.05 MB, 2048x1088)
1.05 MB
1.05 MB JPG
>>
File: 2024-08-24_00215_.png (1.24 MB, 832x1216)
1.24 MB
1.24 MB PNG
>>
File: 2024-08-24_00214_.png (1.19 MB, 832x1216)
1.19 MB
1.19 MB PNG
>>102056627
Thanks, I don't really care for Rebecca, but I'll give you a thumbs up in civitai for your work.
>>
>>102056645
What lora is this?
>>
>>102056467
We have yet to see that and until then, you're talking just as much shit as I am.
>>
File: 2024-08-24_00216_.png (1.16 MB, 832x1216)
1.16 MB
1.16 MB PNG
>>102056669
none, I am just writing abit boomer prosa atm, that was
>A picture in a flat anime style of a human wizard in the forest picking mushrooms. The scenario is sinister and evil. The mushrooms are black with green slime dripping of them.
>The wizards eyes glow with a dark menance, his robe is embroided with arcane symbols. He is a vile necromancer. Behind the wizard stands a skeleton servant holding a black weave basket filled with the same mushrooms the wizard is picking. Its nighttime and there is a mysterious glowing red moon in the sky.

still workin on it tho, the skeleton doesnt wanna hold the basket
>>
>>102055954
yea he doesnt seem to clown on him too hard. You'd think there would be some of him getting his ass fucked by a demon or something by now. I think you are right.
>>
another day, another hundred celebrity loras uploaded to civit
>>
File: file.png (817 KB, 856x850)
817 KB
817 KB PNG
fuck flux, when the next sota dropping?
>>
>>102056736
any onlyfans girls?
>>
File: ComfyUI_02096_.png (1.13 MB, 1024x768)
1.13 MB
1.13 MB PNG
>>102056714
Interesting theory, but wrong
>>
File: bComfyUI_108817_.jpg (803 KB, 2048x1088)
803 KB
803 KB JPG
>>
>>102056755
then stop posting the shit or make him fuck a demon or post the lora
>>
>>102056755
it's completely the grifter, not a single time he got clowned and has the same style as the real one doing his own loras
>>
File: 2024-08-24_00222_.png (1.21 MB, 832x1216)
1.21 MB
1.21 MB PNG
damnit the skeleton just does not want to hold the basket, it is mocking me
>>
File: 00014-2710870355.png (1.06 MB, 1152x896)
1.06 MB
1.06 MB PNG
>>102056594
If you're not in it for the love of the game, I will reject your fine-tune on principle.
>>
>>102056750
I dunno, aren't all of them?
it's not as if OF is an exclusive club with a high barrier to entry, anyone can sign up
>>
>>102056755
then show us a catbox of fuckface fucking himself in the ass or s t o p.
>>102056736
anything nice?
>>
File: 2024-08-24_00226_.png (1.37 MB, 832x1216)
1.37 MB
1.37 MB PNG
... damn lazy skeleton.
>>
File: 1695254425138922.png (1.73 MB, 1024x1024)
1.73 MB
1.73 MB PNG
>>
What's this Flux Atilessence Lora test
?

https://civitai.com/models/647940
>>
File: bComfyUI_108936_.jpg (530 KB, 2048x1088)
530 KB
530 KB JPG
>>
File: ComfyUI_05488_.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>>102056823
Sigh... if you want better prompt adherance you have to go for other samplers than euler, but the one that work well are 2 times slower, fuck off
https://imgsli.com/MjkwNjE1
>striker0s, Mario Striker art style,
>A joyful woman with tears of happiness streaming down her face is holding a goat high in the air. The goat is wearing a golden crown adorned with jewels, and the word ‘Flux’ is elegantly written on the crown. The woman has a speech bubble next to her that exclaims, ‘THAT’S WHY HE’S THE GOAT!!’ The background is a vibrant, celebratory scene with confetti falling from the sky and a crowd of people cheering in the distance. The woman is dressed in casual, colorful clothing, and the goat looks proud and majestic with its crown.
>>
>>102056792
he looks like he knows exactly which mushrooms to pick lulz
>>102056601
a lot is happening but we're slowly settling in, for now. lora flood phase. nice gen
>>
File: 1716609748221962.png (1.76 MB, 1024x1024)
1.76 MB
1.76 MB PNG
>>
File: ComfyUI_00635_.png (579 KB, 512x768)
579 KB
579 KB PNG
Does anyone have an example comfy workflow for generating an image in SD1.5 and then piping the latent into Flux for upscaling & refinement?
Or, incidentally, the other way round?
>>
>>102056852
Why sd 1.5? Why not some sdxl stuff?
>>
>>102056852
can you replicate a flow based on an image? dont wanna share directly.
>>
>>102056767
So fucking stupid that I have to be bullied into posting this half baked LoRA.

https://gofile.io/d/7RjVgs
>>
>>102056874
Exquisite Details is too good and stuck on SD1.5

>>102056890
Sure, that'd be great for learning actually.
>>
>>102056893
I believe the activation phrase is Cerfukin
>>
>>102056893
based, what's the trigger word?
>>
>>102056893
early and often
>>
>>102056914
Cerfukin
>>
File: 1699610027443295.png (1.81 MB, 1024x1024)
1.81 MB
1.81 MB PNG
>>
Anyone using joycaption locally? I can't figure it out even with translate
>>
File: up.png (249 KB, 2552x606)
249 KB
249 KB PNG
>>102056900
sleepy but ask away. need some custom nodes. power perlin noise, rgthree, KJnodes, nothing fancy. don't get confused by those set/get nodes.
>>
>>102056944
what about the worst remixes?
>>
>>102056967
Turn on youtube and boom you got em
>>
File: ComfyUI_01355_.png (720 KB, 1024x768)
720 KB
720 KB PNG
>>
File: ComfyUI_01372_.png (758 KB, 1024x768)
758 KB
758 KB PNG
>>102056984
>>
File: ComfyUI_01373_.png (768 KB, 1024x768)
768 KB
768 KB PNG
>>102056994
>>
>>102053192
>JC Denton
Seconding this
(I just woke up)
>>
File: 0.jpg (101 KB, 1024x1024)
101 KB
101 KB JPG
>>
>>102057015
Why not a whole deus ex lora?
>>
>>102052849
TV's Frank
>>
File: ComfyUI_01379_.png (760 KB, 1024x768)
760 KB
760 KB PNG
>>102057010
>>
>>102056792
That's a trustworthy fellow if I ever saw one.

>>102057019
Yeah, that would be really cool. I'd do one, but I ran into an error no one on the internet has. I'm waiting a few weeks to see if updates solve it.
>>
>>102057019
>>102057035
>>102057015
I might do this next, the Rebecca LoRA pissed me off with how bad it is. Just need buzz. Should be able to farm it up in a couple of days.
>>
File: ComfyUI_01380_.png (730 KB, 1024x768)
730 KB
730 KB PNG
>>102057032
>>
Model train you!
>>
>>102057046
The Rebecca LoRA looked good.
>>
File: ComfyUI_01383_.png (702 KB, 1024x768)
702 KB
702 KB PNG
>>102057047
>>
File: flux_00739_.png (1.86 MB, 1024x1320)
1.86 MB
1.86 MB PNG
>>102057046
Bless you, anon. I tried one of my Deus Ex prompts and was disappointed that flux didn't recognize it.
>>
Man SDXL was way better, most loras so far have been shit for my flux creations. It seems and maybe im wrong here that SDXL took way less effort to create stuff that was presentable. If you have even 1 shit dataset in your flux it will fucking ruin everything, with SDXL, you could maybe have a few in there but it wouldn't shit the bed completely. What am I doing wrong? The captions are fine, im using openai to interrogate the images, using 5000 steps, but if you include a dataset that might be even a slightly bit different than everything else, even though it has all the same features, your end result will be shit.
>>
File: ComfyUI_01384_.png (793 KB, 1024x768)
793 KB
793 KB PNG
>>102057063
>>
File: FD_00100_.png (941 KB, 1024x1024)
941 KB
941 KB PNG
>>102057060
It's very cherry picked. Have to describe her pretty exactly otherwise you just get random women with her tattoos and eyes.
>>
>>102057071
>l
What parameters are you training at? People say Flux isn't easy to overtrain, but I find it can easily go off the rails if your settings are too strong.
>>
>>102055975
wtf why is it always the french?
>>
I love you rebeccaaaaa
>>
>>102057046
>>102057060
>>102057077
Where's the Rebecca LoRA?
>>
File: file.png (2.4 MB, 1024x1024)
2.4 MB
2.4 MB PNG
>>102057076
>>
>>102057077
Oh, okay, now I see.
>>
File: FD_00110_.png (1 MB, 1024x1024)
1 MB
1 MB PNG
>>102057090
https://civitai.com/models/680433
>>
>>102057085

https://pastebin.com/m0FPqQFP
>>
File: ComfyUI_01387_.png (770 KB, 1024x768)
770 KB
770 KB PNG
>>102057093
>>
>>102055349
Ignore the retard, I'll try to find a clean(ish) way to do that when I get home. It's slightly faster if you have the vram but I didn't want to add separate LoRA/CN/etc nodes just for gguf shit but a "force patch vram" node or sth should work even if it's hack.
>>
File: ComfyUI_01392_.png (741 KB, 1024x768)
741 KB
741 KB PNG
>>102057093
Cursed
>>
File: 00044-2573187592.jpg (966 KB, 2224x1248)
966 KB
966 KB JPG
goomorn
today is, indeed, latina caturday.
https://youtu.be/M8qZT4BqZ6E?si=8SvP2v-XElvoQNV4
>>
>>102057106
>I didn't want to add separate LoRA/CN/etc nodes just for gguf shit but a "force patch vram" node or sth should work even if it's hack.
why can't you add a button on your GGUF model loader, like "load the lora on vram" ON/OFF, that could do the trick?
>>
File: 2024-08-24_00231_.png (1.47 MB, 832x1216)
1.47 MB
1.47 MB PNG
>>102057035
>>102056833
ya you can trust em with your loved ones and best kept secrets..

also I gave em fucking A and B signifiers and then said the skeleton with the B hold the basket.. nothing.. by now I think there is magic at play
>>
why are you (You)ing yourself anon
>>102057071
I dunno, I dont make loras. but what I can say is flux loras, so far, are fucking all over the place and testing them/dialing them in is painful AF. patience wearing thin desu.
>>102057098
wild!
>>
File: ComfyUI_01394_.png (807 KB, 1024x768)
807 KB
807 KB PNG
>>102057119
>>
>>102057119
no Sailor, it's still not a Hamburguaa :(
>>
File: 2024-08-24_00232_.png (1.38 MB, 832x1216)
1.38 MB
1.38 MB PNG
gngnrrnnfn...
>>
File: ComfyUI_01397_.png (955 KB, 1024x768)
955 KB
955 KB PNG
>>102057137
>>
>>102057070
>I tried one of my Deus Ex prompts
Here's one from way back, maybe from an SD1.5 finetune. I really like the aesthetics, except the guy for some reason turned green.
>>
>>102057130
>I dunno, I dont make loras. but what I can say is flux loras, so far, are fucking all over the place and testing them/dialing them in is painful AF. patience wearing thin desu.

That's just it though, dialing it in takes forever and then finding a prompt where you get variances is also hard to find. With SDXL you just changed a few words and you would get totally different setting, lighting, etc. With Flux its almost always the same fucking gen
>>
>>102057165
>the guy for some reason turned green.
I distinctly remember all art from that game having that green hue. If anything it's the woman that's not matching the aesthetic.
>>
File: ComfyUI_01398_.png (963 KB, 1024x768)
963 KB
963 KB PNG
>>102057159
>>
>>102057127
flux got a will of its own. (certainly) guidance, step amount, the max/base shift and jesus all play a part in this.
>>
>>102057170
>With Flux its almost always the same fucking gen
Doesn't that usually mean it's overtrained?
>>
File: ComfyUI_01406_.png (928 KB, 768x1024)
928 KB
928 KB PNG
>>102057186
It does, I am not even sure what it's trying to tell me with this chinese text
>>
File: 2024-08-24_00233_.png (1.49 MB, 832x1216)
1.49 MB
1.49 MB PNG
>>102057186
mostly Jesus .. now the skeleton is A .. guidance is 3.5 .. I guess ill try lowering that
>>
File: file.png (2.27 MB, 1024x1024)
2.27 MB
2.27 MB PNG
>>
>>102057212
What's the prompt?
>>
File: 4086247269.png (1.4 MB, 896x1152)
1.4 MB
1.4 MB PNG
Looks like training on 512px works but introduces the old familiar problems like mangled hands and such. Or it could be something else with the dataset, what do you think?
>>
>>102057196

It doesn't matter a whole lot, tried 1000 steps to 5000 steps, its cool to try dont get me wrong, but to actually do work with, its a nightmare. It just has potential but that's it. Its over hyped bullshit
>>
>>102057227
holy fucking christ now they are holding it together looking mockingly at me...

ah.. the prompt here:

>A cinematic shot in a photorealistic style of a human wizard. He is wearing a long black hooded robe and has a long white beard. His robe is embroided with arcane symbols. He is bald and his face is grumpled and old. He has a sinister grin on his face. His left eye is blind. He is a vile necromancer. On the wizard's forehead there is a tattoo that is the simple bold letter A.

>He is in the forest picking mushrooms. The scenario is sinister and evil. The mushrooms are black. Green slime covers the mushrooms.

>Behind the wizard stands a skeleton servant with glowing eyes. On the skeletons skull a is a simple bold letter B is painted in red.

>The skeleton with B on on its skull is holding a basket. The basket is filled with mushrooms. Its nighttime and there is a mysterious glowing blue moon in the sky.
>>
File: 0.jpg (140 KB, 1024x1024)
140 KB
140 KB JPG
>>
File: 00058-195349927.png (810 KB, 1152x896)
810 KB
810 KB PNG
>>102057186
FLUX is the LLaMA 3 of image models (high quality but very censored).
>>
File: 2024-08-24_00235_.png (1.35 MB, 832x1216)
1.35 MB
1.35 MB PNG
>>
>>102057247
ablation when
>>
>>102057260
>ablation
I had to look that word up.
>>
File: ComfyUI_01274_.png (934 KB, 1280x720)
934 KB
934 KB PNG
>>
Ready to roll with the a fresh loaf of...
>>102057280
>>102057280
>>102057280
>>
>>102057211
well there is the number 33 in there so thats a good thing. I like numbers.
>>102057212
I mean it certainly does listen. just go like "4 panel grid, 4x4, panel 1: bla, panel 2:jesus, ..." and it will do that
>>102057246
super
>>102057247
well I distinctly remember fucking a llama-3 powered characters brains out so that can be jailbreaked. assistent assistant. nice face there
>>
File: ifx185.png (1.69 MB, 1024x1024)
1.69 MB
1.69 MB PNG
>>
>>102056832
it seems to me like both failed, could you try it without the lora? these types of comparisons are best on the base model.
>>
>>102055870
pretty slick artstyle
flux lora?
>>
>>102057170
Not with schnell, as ong as you're not after photos. Schnell got range.
>>
>>102056953
nta but thanks a lot, also those are some great custom nodes.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.