[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


3-Year duration 4chan Passes are now available for $45

[Advertise on 4chan]


File: tmp.jpg (776 KB, 3264x3264)
776 KB
776 KB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102162827

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
>26 minutes ago
>>
File: frame_00012.png (1.32 MB, 1920x1080)
1.32 MB
1.32 MB PNG
I had to check the catalog to find the new one.
>>
File: 2024-08-31_00135_.png (1.31 MB, 832x1216)
1.31 MB
1.31 MB PNG
>>102166301
ty baker
>>
File: FD_00209_.png (1.9 MB, 1024x1024)
1.9 MB
1.9 MB PNG
Thanks for pinching off a fresh loaf
>>
thx mods for removing the pedo slop
>>
Nobody posting gens so here's how my LoRA is progressing.

I guess it makes sense with over 1300 images it needs more time in the oven but I hope the outputs aren't all this shitty on inference.
>>
>>102166872
A lot of the images have 'cloudy' output.
>>
>>102166872
Why so many images for a LoRA? Your character looks simple (idk who the fuck that is). You should be able to do it with 20-50.
>>
File: 2024-08-31_00152_.jpg (707 KB, 2496x3648)
707 KB
707 KB JPG
>>102166872
her basic features are recognizable as Senjogahara, the style is not tho .. yea it needs more baking
>>
>>102166887
It's a style LoRA

>>102166896
I have the training set to 7000 steps and it will probably need it. I should have opted for epochs instead for ease of tracking. This one just popped out right after the new epoch began.
>>
File: 2024-08-31_00158_.png (1.43 MB, 832x1216)
1.43 MB
1.43 MB PNG
>>102166919
you making a general monogatari style lora or just a character lora? if its the later >>102166887 is right, 1300 images is to many .. if its a style lora, its fine
>>
>>102166919
I did a style LoRA with only 103 images. 1300 seems extremely excessive.
>>
>>102166933
Style LoRA.
The images are tagged with the characters, but that was secondary to the overall style. It's doing an okay job of the background imho, here's a much earlier example. It's stopped giving me wrongly proportioned bodies so far.
>>
>>102166936
I don't think it's excessive, but more training than usual is necessary to get an acceptable result.
>>
>>102166949
what data set you using? screencaps?
>>
>>102166872
what base model?
>>
>>102166966
I downloaded videos of all the character PVs throughout the series and made a script that extracted an image every 0.5 seconds from the video, cleaned up the overly samey images and tagged them.
I think the nature of the videos kind of dooms the LoRA to having compression artifacts baked into the LoRA.though.
>>
>>102166966
Flux
>>
>my lora: 0 buzz
>some pics some dude made with my lora: 120 buzz
I'm not bitter, no sir
>>
File: 2024-08-31_00163_.jpg (1.08 MB, 2496x3648)
1.08 MB
1.08 MB JPG
>>102166991
>compression artifacts baked into the LoRA.though
ya that was my thought to, did you use the blu rays or some other source? cause especially streams are pretty bad for making loras
>>
File: 2024-08-31_00016_.png (1.55 MB, 832x1216)
1.55 MB
1.55 MB PNG
>>102167038
die in a fire
>>
>>102167034
Did he at least post the pic to your lora page?
>>
>>102167041
Nope, it's about as shitty of a image scrape as you could hope for.
At least when it's finished we can all take a good look at how shitty or/ actually okay it turns out.
>>
>I was going to stop
No you weren't lol. Why would you tell such an obviously false lie?
>>
File: 2024-08-31_00166_.png (1.24 MB, 832x1216)
1.24 MB
1.24 MB PNG
why do diffusion generals attract the most horrible kind of persons
>>
>>102167109
because you keep talking about them and feeding them
>>
>>102167109
its not just diffusion generals, you can find such a schizo on every board.
>>
>>102167144
Im sorry, ill stop.
>>
>>102166991
yeah, flux picks up on the small details way more, im noticing too

on doing 7000 steps, assuming your 1300 images have no repeats set, you're looking at a bit over 5 epochs (an epoch is one complete trip through all the images)

i would say go with epochs next time, set it to 16 epochs and save each one, can cancel training if it seems good by epoch 6 or 8. It's also possible to resume from a completed epoch (trying that now)

batch size also factors in.. if you've got the memory i always trust batch size 4, too high batch size can reduce quality because it is reducing steps
>>
>>102167155
I generally keep a batch size of 3 at 512 but also do a second pass on the same dataset at 1024 with a batch size of 1 with less repeats. Idk, I just think mixing a few 1024x1024 images into the mix might be good for output. I have no proof of that though.
>>
I'm gonna do it. I'm gonna pull comfy.
I can't stand having to clear my model cache manually every time I change the prompt.
Surely this is fixed in a new pull, right?
>>
>>102167290
no. It is doing everything properly. An update won't stop it from doing what it is doing. You can't seem to get what is actually happening through that thick skull.
>>
>>102167301
What is happening, Anon?
>>
>>102167309
I have explained it twice at this point.
>>
>>102167320
didn't ask for your life story
>>
>>102167320
You have not. You claimed it's something that doesn't happen on gguf, which I am using so it isn't that. This is all you have said on the matter (providing that was even you) because I have brought it up exactly once before this.
>>
>>102167329
I assumed you were also the guy asking for optimizations and started up with that guy who was acting as city.
>>
hello 1girls
>>
Oh no
>>
>>102167351
Not me. I don't pretend to be anyone, not even myself.
It only started happening recently. Prior to this I was using wildcards with no issue. Now if I change a single word the I go OOM unless I clear the model cache.
>>
>>102167301
Calm down Comfy
>>
>>102167368
I look like that
>>
>>102167392
alright, short version, you are probably still fucked. Although I hear the new update isn't bricking shit so there is that.

>>102167395
as comfy I have a confession. lllyasviel is a better programmer than I and after some soul searching I wanted everyone to know.
>>
>>102167290
I hope for you they fixed the OOM issue with multiple loras, I had to go back to an older release after I pulled two days ago
>>
>>102167448
Multiple LoRAs is fine, that was fixed a few days ago. My issue is something else.
I pulled and updated the GGUF nodes and still the same problem so I don't know what's happening.
>>
>>102167460
Changing to Q6_K over Q8_0 seems fine. I guess I will just take the minor quality hit for convenience. Still annoying when it was all working perfectly less than a week ago.
>>
>>102167460
ookay .. ill pull then and try. Was very annoying that it would just OOM on my 4090 randomly on big loras
>>
>>102167497
Just update the gguf nodes, that's where the fix is. Update comfy too if you want I guess but it's the nodes that matter.
>>
>>102167501
nah I am using fp16 .. was a problem regardless
>>
File: FD_00276_.png (920 KB, 1024x1024)
920 KB
920 KB PNG
Why is the thread so dead? Yanks asleep?
>>
>>102167519
Oh then I don't know what your issue is. City was in here a few days ago saying he fixed the LoRA issue and he did. Yours must be something else.
>>
>>102167523
nothing interesting going on, everybody either waiting for a good flux finetune or another model to come out
>>
File: Untitled-1.jpg (1.91 MB, 3768x2512)
1.91 MB
1.91 MB JPG
>>
File: ComfyUI_00125_.png (800 KB, 1024x1024)
800 KB
800 KB PNG
flux is fucking insane
it's easily better than DALL-E 3
i've been coming up with dozens of items for my RPG
even on a Q4 quant, CFG1, 20 steps, i am getting good shit 90% of the time

i think this is the model that's going to wind up causing congress to push legislation and regulation
>>
>>102167535
It's better without so much rampant faggotry desu. Just the normal amount of faggotry from the usual suspects.
>>
File: 2024-08-31_00175_.jpg (1.07 MB, 2496x3648)
1.07 MB
1.07 MB JPG
>>102167531
I don't know what it was either.. but its fixed, can load 4 big loras again now with fp16 and no OOM.
>>
>>102167553
>i think this is the model that's going to wind up causing congress to push legislation and regulation
And what will this even do? Cuck America in the AI race? The world is bigger than the USA.
>>
>>102167565
>And what will this even do? Cuck America in the AI race? The world is bigger than the USA.
this, if the US doesn't want to improve AI, others countries will gladly do it instead
>>
>>102167553
what kind of RPG has gamer girl bathwater?
>>
Whatever the model unloading issue is, it seems to be related to the quant models. Removing them from the equation and now it's not happening.
Sucks because the quality is so much better on the Q8 than FP8.
>>
Getting a little better at 4700 steps
>>
>>102167693
A meme game.
But I will guess it's an e-girl visual novel.
>>
>>102167109
And Dunning-Kruger sub 70 IQ redditor morons.
>>
>>102167726
I will have you know my IQ is in the 4th percentile.
>>
File: city.jpg (74 KB, 960x540)
74 KB
74 KB JPG
>>102167713
shitty code
>>
>>102167109
the ai venn diagram overlaps alot with the nft/crypto space, that and it tends to attract alot of lonely gooner types, like me! my nuts hurt!
>>
>>102167746
>le funny uboat farming comment
>>
>>102167713
>Whatever the model unloading issue is, it seems to be related to the quant models. Removing them from the equation and now it's not happening.
>Sucks because the quality is so much better on the Q8 than FP8.
yeah, Q8 (or any GGUF quant) has some bugs that FP8 doesn't, let's hope city will fix them
https://github.com/city96/ComfyUI-GGUF/issues/84
>>
>>102167786
you've linked that before, it was 404 then and it is still 404
>>
>>102167797
wtf you're right, why is that 404 when I log off?
>>
>>102167563
grrr.. was to happy to soon.. it still goes OOM back to 9230f658232fd94d0beeddb94aed093a1eca82b5 it is
>>
File: FD_00299_.png (1.8 MB, 1024x1024)
1.8 MB
1.8 MB PNG
>>102167775
>le funny uboat farming
>>
File: file.png (147 KB, 1778x1145)
147 KB
147 KB PNG
>>102167786
>>102167797
the fuck is goin on? I tried to make this issue 3 times but no one can see it but me?
>>
>>102167838
city96 doesn't like you
>>
>>102167847
but I never talked to him on Github ;_;
>>
>>102167838
NTA but I can't view your profile at all, I'm not logged in though. Does Github shadowban people?
>>
>>
>>102167838
probably moderator approval mode .. the comfy team is full of weirdos
>>
>>
>>102167869
it's not a comfyorg repo
>>
>>102167863
why would I be shadowbanned? I absolutely did nothing special, I created this account a week ago and made 2 regular issues, that's all lol
>>
>>102167838
I hope your issue gets approved cause this seems to be exactly my problem
>>102167448
>>102167563
>>102167823
anything past the 92... release will randomly crash into OOM with big loras or multiple loras, while the 92.. release works perfectly fine regardless of how many loras I load
>>
>>102167838
I just submitted an issue and yours aren't there. Skips #87 and #88
>>
>>102167939
fuck, how am I gonna reach to city now?
>>
>>102167955
city96 could be anywhere
>>
File: FD_00288_.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>102167892
One of the changes he made was an attempt to fix the LoRA reloading issue, maybe a regression bug
>>
File: shamiko1.png (2.8 MB, 2048x2048)
2.8 MB
2.8 MB PNG
>>
>>102167957
city, if you're reading this thread, can you please fix this?
https://files.catbox.moe/ox8hip.txt
>>
File: shamiko2.png (3.24 MB, 1792x2304)
3.24 MB
3.24 MB PNG
>>
File: shamiko3.png (3.05 MB, 1792x2304)
3.05 MB
3.05 MB PNG
>>
File: shamiko4.png (2.48 MB, 1664x2432)
2.48 MB
2.48 MB PNG
>>
>>>/e/edg is over there
>>
>>102167971
>>102167976
>>102167985
>>102167991
You post the same shit every time. Does it nor bore you?
>>
File: chie2.png (2.77 MB, 2048x2048)
2.77 MB
2.77 MB PNG
>>102168018
>You post the same shit every time.
Nah, that's wrong.
Sometimes I post armpits.
>>
>>102168018
Go in to /sdg/. People have been prompting the same shit over and over for two years now. Actual mental illness. I even saw the angry hamburger guy here earlier today.
>>
foot fetishists can post their porn everywhere they go with no consequences but i can't?
>>
File: FD_00311_.png (1.88 MB, 1024x1024)
1.88 MB
1.88 MB PNG
>>102168028
>Go in to /sdg/.
No.
>>
Fuck yeah
>>
>>102168031
correct.
>>
>>102166301
>https://imgsys.org/rankings
No AuraFlow, no Kolors, what is this, an advert for Flux Pro?
>>
File: 1167752492635494163-SD.png (1.24 MB, 896x1152)
1.24 MB
1.24 MB PNG
whats the current best flux finetune anons?
>>
File: FD_00316_.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
Oh shit this cunt knows transformers
>>
>>102168101
is there any REAL finetunes of Flux yet?
>>
>>102168101
None, they are all shit
>>102168098
>you don't list every random model available
wow what a load of shills
Here's why
https://civitai.com/search/models?baseModel=AuraFlow&baseModel=Kolors&sortBy=models_v9
>>
>>102168126
The Ponyfucker is gonna make Aura Flow relevant.
>>
File: 1167752492635494131-SD.png (1.94 MB, 896x1152)
1.94 MB
1.94 MB PNG
>>102168120
I dont know, hence the q

>>102168126
thought so
>>
>>102168126
>Here's why
nta but i don't get what you were trying to prove with that link, they all look pretty good.
>>
>>102168145
>The Ponyfucker is gonna make Aura Flow relevant.
I'm not sure of that, recently there was a giant Hunyuan hentai finetune and no one gave a fuck

>>102168156
I think the point is that there's only ~10 contributions on civitai about Auraflow, that's what Flux does in one hour, no one give a fuck about that model
>>
File: FD_00320_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>102168145
And someone smarter is going to mog him on Flux.
>>102168156
>25 models combined
>of those only 9 are LoRAs
You're right it proves nothing. Very good models. Very popular.
>>
>>102168168
>no one give a fuck about that model
this mentality is poisonous, one of the reasons /ldg/ split from /sdg/ is to avoid that train of thought
>>
>>102168184
the reality is here, if Auraflow was a hype model there would be a lot of loras on civitai, there's nothing because there are better toys to play with, like Flux
>>
File: FD_00303_.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
>>102168184
Sorry Anon but Aura Flow had potential and got completely shit on by the release of Flux. That's just how the cookie crumbles sometimes.
Sigma had potential too, so did Lumia, but they don't hold a candle to Flux.
>>
SDXL was fairly irrelevant compared to SD 1.5 until Pony came out.
>>
>>102168101
the flux version of copax is alright... but the bar is pretty low for finetunes currently, https://civitai.com/models/118111?modelVersionId=778112
>>
>>102168201
XL had a shit load more LoRAs and fine tunes before Pony too. More than 1 in 2 months.
If Aura Flow becomes relevant it will go into the OP. But it's just a waste of space right now. Bitching about it is dumb.
>>
>>102168126
>Here's why
AuraFlow and Kolors are outperforming Flux around here, except in text and prompt adherence.
Is that it? People value so much text and where things are in a picture? I beg people to send the same prompt to Flux and Kolors and compare the outputs, getting better text and adherence to Kolors would have gotten us to where we want to go faster than creating a Lora for every single thing Kolors can't do.
>>
>>102168242
Text is one of the least important things to me as it's so easily inserted with an image editor.
>>
File: FD_00315_.png (727 KB, 1024x1024)
727 KB
727 KB PNG
>>102168242
Go and start training for them then, Anon. Nobody but you gives a fuck.
>>
>>102168242
>AuraFlow and Kolors are outperforming Flux around here, except in text and prompt adherence.
that's bait
>>
>>102168253
And the funny thing is, with Flux around how bad the other models were at text doesn't matter, you just generate with them and inpaint the text with Flux.
>>
>>102168253
text in context of a picture is not easily inserted with an image editor. sure a meme caption is easy peasy, but think of painted letters on a diagonal wall that seamlessly integrate with the image composition, you'll spend a lot of time in photoshop adding that
>>
>>102168272
It's probably easier and faster to shoop in text and inpaint over it with a faster model at a low denoising strength. Plus get to choose your font that way.
I guess Flux's text is useful in certain scenarios, like you want text in a very specific style, like a specific team's jersey or something.
>>
>>102168258
is this a lora? link pls
>>
>>102168253
Flux can unironically do some beautiful and stylized text with inpainting.
>>
>>102168258
this
>>
File: 2024-08-31_00210_.jpg (857 KB, 3840x2160)
857 KB
857 KB JPG
>>
I don't give a fuck about text because it doesn't help me coom.
>>
File: KolorsVFlux.png (2.6 MB, 2048x1024)
2.6 MB
2.6 MB PNG
>>102168258
I want to understand what is going on, though.
Here's Kolors Vs. Flux with the prompt "Hatsune Miku", the thing with Flux is that, once you saw this gen of her, you've seen them all, you'll get a variation of this pic no matter what seed.
Kolors is on the left and seems like a better, more creative successor of Stable Diffusion legacy, and I don't even know if Flux could recreate that pic, but if it can I'm sure it'd require a prompt that is several paragraph long and a bunch of connected nodes to make the prompt interesting, and maybe throw a Lora or 2 in them...
Flux was a mistake.
>>
File: 2024-08-31_00213_.png (1.47 MB, 1280x720)
1.47 MB
1.47 MB PNG
I'm sorry.
>>
File: FD_00142_.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
>>102168315
>is this a lora?
Yes, it's mine.
https://civitai.com/models/702801
>>
why don't people realize that you can just use multiple models? they all each have their strengths and weaknesses. the fuck's up with this "this model won dumbass" mentality? it's all opensource there's no "winner" here, more models the better, use what you want
>>
>>102168364
>Flux was a mistake.
the fuck? Flux deserves to exist as much as Kolors or Auraflow, if some people prefer one or not the other, what's the issue there? Go for your favorite model, that's the beauty of opensource, they're not gonna take those models back
>>
File: FD_00238_.png (1.69 MB, 1024x1024)
1.69 MB
1.69 MB PNG
>>102168364
So Kolors makes some random shit you didn't ask for and Flux sticks to the prompt it was given. That's what I'm seeing there. Flux is 100% capable of producing the left image.
>>
>>102168383
the goal is to have the one god level model that doesn't need loras for nudity/porn and wasn't finetuned to the point of forgetting everything it learned in training
>>
>>102168411
>and wasn't finetuned to the point of forgetting everything it learned in training
instead of finetuning the model, what if we continue the pretraining instead?
>>
File: _1.png (19 KB, 768x512)
19 KB
19 KB PNG
>>
>>102168262
No, I guess you're so used to sending extremely long prompts that hide Flux's lack of creativity that you've forgotten how good artistic models look like.
Flux is so bad at this that last time I complained the answer was 'just use an LLM to enhance your prompt'...
>>
>>102168424
you got the original or a diverse enough dataset?
>>
>>102168411
imagine how powerful we would be if we had a god level finetune of flux, auraflow, pixart, hunyuan and whatever else model, we would be unstoppable. sass shivers at this thought.
>>
>>102168383
>the fuck's up with this "this model won dumbass" mentality?
My mentality was "they didn't even test the superior artistic models and we have their ranking on the OP."
>>
>>102168427
no I agree with you, Flux lacks a lot of styles, but that can be fixed with a finetune or loras, Flux has the best qualities, great anatomy/realism, great prompt adherance, great text
>>
>>102168410
not capable of reproducing the sovl, where'd it go?
>>
File: _2.png (22 KB, 768x512)
22 KB
22 KB PNG
>>102168377
nta but very cool. i tipped you some buzz
>>
>>102168439
>you got the original or a diverse enough dataset?
we can't use Laion-5b? it came back yesterday
>>
>>102168476
alt text ain't enough, you'd have to caption the images properly
>>
>>102168457
That's not how this works though. Windows isn't the best operating system yet it has 75% of the market.
I am not saying AF or Kolors are better than flux, I am saying nobody gives a shit if they are or aren't because everyone has already mass adopted flux.
>>
File: 2024-08-31_00218_.png (1.43 MB, 1280x720)
1.43 MB
1.43 MB PNG
>>
>>102168424
>what if we trained the model instead of training the model
>>
>>102168490
the advantage of laion is that there's some "captions" in it, you could feed it into a model captioner (image + caption text) so that it can make more accurate LLM captions
>>
File: FD_00218_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>102168473
No need to tip, just post some gens to the page. It's free and I get free buzz from that.
>>
>>102168500
finetuning and continuing the pretraining aren't the same process at all anon, and I hope you know that right?
>>
>>102168497
Now do it without the nonsensical blurring.
>>
File: KolorsVFluxSlop.png (383 KB, 708x291)
383 KB
383 KB PNG
>>102168410
Are you blind? You just proved my point, the face on your Flux output is horrendous.
And this is what people prefer because of better text and location of things in the picture.
>>
>>102168512
yeah but you need a proper model, not the 8B shit, and it is 5000000000 images
a fuck load of them are offline now tho
>>
>>102168410
miku please, fix your posture!
>>
>>102166301
test
>>
>>102168494
>I am not saying AF or Kolors are better than flux, I am saying nobody gives a shit if they are or aren't because everyone has already mass adopted flux.
not true, Auraflox and Kolors were made months before Flux even existed, and back then no one gave a fuck aswell, let's face it anon, if a model is really good, people would react to that, you think we decided to ignore them for whatever reason? no, we talked about it and we thought it wasn't the huge step up we were waiting for since the release of SDXL
>>
File: FD_00215_.png (2.33 MB, 1024x1024)
2.33 MB
2.33 MB PNG
>>102168473
>>102168520
It's appreciated though, thanks. Sorry not meaning to sound ungrateful.
>>
File: annaanyaface.jpg (1.25 MB, 2048x2048)
1.25 MB
1.25 MB JPG
Still my favorite LORA.
>>
>>102168537
I never said we should caption the 5b pictures, Flux has been trained enough, we just need the last boost
>>
>>102168523
It's the same process.
>>
File: FD_00217_.png (2.04 MB, 1024x1024)
2.04 MB
2.04 MB PNG
>>102168526
It followed my prompt you dumb idiot. I didn't go out to re-create the left image I just posted an old gen I made.
>>
File: 2024-08-31_00220_.png (1.39 MB, 1280x720)
1.39 MB
1.39 MB PNG
>>102168524
>>
>>102168448
A god level finetune would be able to give you any output from any other model, you'd say "by Dalle 3", or "by Dreamshaper 8" or "by RealVisXL" or even "by Flux" and it'd do its best to mimic the style of those models in the same way other models mimic artist styles.
I've seen the amazing things SD1.5 models have been able to do and I'm sure 12B parameters are enough for such a godlike finetune.
You'd truly have Dalle 3 at home because the model purposely mimics it at the prompt level in the same way photographic styles are imitated.
>>
File: ComfyUI_02696_.png (897 KB, 768x1024)
897 KB
897 KB PNG
If you prompt meme text, it will also replicate it accurately
>>
>>102168572
man, i like flux but the base model makes some of the most soulless outputs imaginable
>>
>>102168589
KEK
>>
>>102168476
Really? Did they manage to find the only CSAM picture in their entire dataset and removed it or what happened?
>>
File: FD_00308_.png (876 KB, 768x768)
876 KB
876 KB PNG
>>102168592
Is this better?
>>
>>102168589
this is 2 seconds in any image editor
this >>102168576 or >>102168497 you will spend quite some time with, Id rather gen that in context
>>
>>102168602
yes
https://laion.ai/blog/relaion-5b/
>>
>>102168592
>soulless outputs
Soulless models are the perfect vessel for injecting soul into.
>>
>>102168602
2236 links allegedly
>>
>>102168602
>the only CSAM picture
There were 1008
>>
>>102168603
*licks xer teeth* hey princess, forgot to brush your teeth? don't worry, i'll clean that up for you *kisses you*
>>
laion? yikes
>>
>>102168592
those are drawbacks that can be easily fixable with loras or a finetune
>>
File: ComfyUI_02620_.png (1.38 MB, 1024x1024)
1.38 MB
1.38 MB PNG
>>102168608
Yeah, but if I can type "Meme text on the bottom that reads "Pathetic"", it kind of saves me a whole step in the process
>>
File: FD_00289_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>102168626
I hate everything about the things you choose to be.
>>
File: 00007-78889941.png (472 KB, 512x688)
472 KB
472 KB PNG
>>
>>102168524
I've been paying attention to the TV series I've been watching and noticed that, unless they're a sitcom where everything is zoomed out, scenes show this blurriness in the background.
So it's a real thing, and cameras are to blame, but I also noticed that the best SD1.5 models at realism also have bokeh by default.
What happened is that you got used to AI generations that have crisp foregrounds and backgrounds and thought that's the way it should be, but if you search for images of what you are generating, real images, chances are they'll be as blurry as what Flux generates.
>>
nick fe moment
>>
>>102168638
ya you are right.. just trying to prove a point to those "ill just edit that in an image editor" crowd
>>
>>102168549
Okay, sure, so Flux was this big step people were waiting for, I ask again, was it just "THIS MODEL MAKES PERFECT TEXT AND WHEN I ASK IT FOR A CAT ON THE RIGHT OF THE PIC IT PUTS IT THERE!!!"
?
>>
File: FD_00002_.png (2.46 MB, 1024x1536)
2.46 MB
2.46 MB PNG
>>102168657
I would like to see them edit this in post with a speed run timer. WR is the 35 seconds it took me to gen.
>>
>>102168572
Did you ask for mismatched eyes where one had random orange tint for some reason?
>>
>>102168679
Flux isn't just that, you're trolling right?
>>
>>102168691
now make every other letter a different color
>>
File: FD_00216_.png (2.06 MB, 1024x1024)
2.06 MB
2.06 MB PNG
>>102168693
It's just burn from the CFG experimentation I was doing at the time.
>>
>>102168609
>potential CSAM
>We have no idea if the links or pictures were fine or not, we removed them just in case.
>>
>>102168679
>was it just "THIS MODEL MAKES PERFECT TEXT AND WHEN I ASK IT FOR A CAT ON THE RIGHT OF THE PIC IT PUTS IT THERE!!!"
if this was enough to make a model good, then SD3 would be hyped, not clowned, because SD3 is also fine on text and can put a cat on the right pic aswell
>>
>>102168694
No, I want to know what is about it that is more than that, explain like I'm 5.
I'm not being obtuse, people literally haven't said what is it.
>>
>>102168723
Nope, I've seen it do it by default some time, I suspect the dataset was poisoned by AI generations with mismatches eyes and Flux learned that anime girls now and then have bad eyes like that.
>>
>>102168758
I don't get it anon, you have the right to prefer Auraflow than Flux, and the vast majority of people have the right to prefer Flux to Auraflow. There's no need to convince anyone, go for the local model you want, that's the beauty of open source, complete freedom
>>
>>102168771
I suspect you're retarded.
>>
>>102168777
>I don't get it anon
That I want to understand what this big jump was about, people keep claiming it's more than what I mentioned but aren't able to put it into words.
>>
File: ComfyUI_temp_ikkkz_00012_.png (3.8 MB, 1600x1600)
3.8 MB
3.8 MB PNG
Why is this thread so autistic all of a sudden?
>>
File: lemonanyaface.jpg (979 KB, 2048x2048)
979 KB
979 KB JPG
Can Flux do anime girls who are this smug?
>>
>>102168786
Can you define x?
>The reason Flux was adopted in mass by the community was because of the text it can generate, its prompt adherence and x.
>>
>>102168798
spillover from the containment thread
>>
>>102168758
nta but it's mainly the 16ch vae and it's vast knowledge of random shit like games and what not, impresses normies by being a good meme creator. it being a transformer model also makes it superior to the old unets. about it.
>>
File: 00021-3093869841.png (1005 KB, 688x1008)
1005 KB
1005 KB PNG
>>102168798
no glue, sorry meant to type clue
>>
>>102168812
>random shit like games
Like 2 games
>>
>>102168609
>After removing LAION-5B dataset from the public web in late December last year, 19.12.2023,
Wait, is this for real? I thought once something was on the web, it was impossible to take down, and yet nobody made a mirror of LAION-5B?
>>
File: file.png (1.04 MB, 1024x1024)
1.04 MB
1.04 MB PNG
>>102168797
You'll get different answers to that, it depends from people to people, but to me it's more like Flux doesn't really have any big weakness, it can do everything well, yes, even styles if you go for a Lora, I went for Monty Python lora for that picture and Flux just nailed this masterpiece first try. It was exactly what I had in mind, there's something special about that model, and a lot of people noticed it aswell, hence the hype, and I'm sad you can't understand it aswell but like I said you are entilted to your opinion, if you find Auraflow to be a better choice then go for it anon, you don't need our permission to enjoy that model aswell
>>
>>102168821
doesn't really matter, it was impressive out of the box and that was enough to grab the attentions of redditors. that was the 'big jump' aesthetic wise the base model is a huge step back for now atleast.
>>
File: 2024-09-01T000557.257.jpg (221 KB, 1024x1024)
221 KB
221 KB JPG
>>102168704
>>
>>102168831
there is plenty, and last i heard it was 6B? did they take others down too?
>>
>>102168838
no no, the same image as before but with every other letter with a different color
>>
>>102168838
>Britfag
>>
>>102168806
Why would I? I never said that.
The reason Flux was adopted en masse is because it's an extremely competent and capable model thats insanely easy to train.
It has nothing to do with text, text is just a neat feature.
>>
>>102168831
>yet nobody made a mirror of LAION-5B?
How many people do you know who did anything with the dataset?
>>
>>102168812
Thanks, it wasn't so hard to explain after all, huh?
So it turns out even with the absence of a lot of artistic styles and mediums and characters and even NSFW capabilities Flux is still far ahead of other open source models in what it can draw and deliver doing so.
>>
File: FD_00335_.png (1.38 MB, 1024x1024)
1.38 MB
1.38 MB PNG
>>102168845
No I've already entertained you. I'm going back to genning Liliths.
You're welcome to do it yourself. The prompt is >The text "Learn how to prompt dumb faggot"
>>
>>102168843
it was 6B, they removed 1B CP links
no, it was always 5B
>>
File: ComfyUI_temp_ikkkz_00027_.png (3.56 MB, 1600x1600)
3.56 MB
3.56 MB PNG
>>
>>102168862
based
>>
>>102168862
You run out of patience when then task gets impossible.
>>
Don't respond to d*bo
>>
>>102168862
wow shut up
>>
>>102168833
>I'm sad you can't understand it aswell
Thanks for helping me understand. It's possible it's a (me) problem because after being unable to get what I wanted with Stable Diffusion they became sour grapes and I just wanted to see what it did with prompts, and it's still more creative with them than Flux, when you have nothing specific in mind. I appreciate insightful comments like yours that help me understand how other people perceive all this.
I guess this could work:
>The reason Flux was adopted in mass by the community was because of the text it can generate, its prompt adherence and THAT IT CAN ACTUALLY DRAW WHAT YOU HAVE IN MIND, UNLIKE OTHER IMAGE MODELS.
>>
>>102168874
>noooo you have to give in to my every whim nooo
My response
https://files.catbox.moe/gstdcj.jpeg
>>
>>102168888
Absolute waste
>>
>>102168916
or you could understand the point of the request instead of throwing a baby tantrum
>>
>>102168860
Frankly, I think we'd be far ahead by now if LAION never existed, if Stable Diffusion was forced to use a better dataset from the get-go, SD1.5 would have been at the level of SDXL from the start.
>Clit Eastwood
>>
>>102168874
>Frankly, I think we'd be far ahead by now if LAION never existed, if Stable Diffusion was forced to use a better dataset from the get-go,
Such as?
>>
File: 00023-1805374539.png (1.17 MB, 688x1008)
1.17 MB
1.17 MB PNG
is this the level of thread quality we are trying to maintain
>>
>>102168946
wrong comment
>>102168948
thread quality would be 1000% higher if every retard didn't try to pretend to be smarter than me
not debo btw
>>
>>102168927
The point of the request is to get me to waste my time finding the prompt and seed that will do it, like I am somehow implying there's no limitations to Flux.
Meanwhile you haven't shown me your speed run of creating the image in photoshop.
In short, kys.
>>
File: ComfyUI_temp_ikkkz_00033_.png (3.22 MB, 1600x1600)
3.22 MB
3.22 MB PNG
>>
>>102168965
>The point of the request is to get me to waste my time finding the prompt and seed that will do it
this is a very autistic take, anon, hope you realize that
>>
>>102168964
>wrong comment
I accept your concession debo.
>>
>>102168939
Stable Diffusion wouldn't exist without LAION. The origin story is literally that Emad joined LAION's discord and offered compute to train a LDM on LAION-5b
>>
>>102168946
Dalle 2's dataset, I heard at some point it was for sale for $15 million, and Stability AI had raised $180 million at that point, but instead of spending the money on a critical part of the model, they decided to go for something that was free (maybe they were stupid and assumed LAION was good).
>>
>>102168993
>Dalle 2's dataset, I heard at some point it was for sale for $15 million
Can you buy a dataset containing a lot of copyrighted shit in there? Is that this simple?
>>
>>102168965
>Meanwhile you haven't shown me your speed run of creating the image in photoshop.
Assume I could create it in 15 minutes if you paid me to do it, now what?
>>
>>102169009
You can do anything you want as long as nobody stops you.
>>
File: ComfyUI_05813_.png (904 KB, 1024x1024)
904 KB
904 KB PNG
>>102169010
why would I pay you anything if Flux can do it in less than a mn for free, that's the point innit?
>>
>>102169029
you can't do it in less than a minute with the specifications given
>>
>>102168923
checkem
>>
File: 2024-08-31_00243_.jpg (817 KB, 2160x3840)
817 KB
817 KB JPG
>>
>>102168989
The alternative world theory is that Emad wanted to do that anyway, if LAION didn't exist he'd have gone to the discord of someone else and trained LDM with a different dataset.
In fact, Stable Diffusion became the thing that was accepted and the community nurtured it and made it the best thing around for more than a year, but the technology was already there, so the alternative world theory is that if it never existed we'd have gotten a different model to nurture and we'd have been better off eventually, but SD was good enough so that's the basket where all the people put their eggs, like Flux now.
>>
File: ComfyUI_temp_ikkkz_00042_.jpg (1.8 MB, 1600x1600)
1.8 MB
1.8 MB JPG
>>
>>102169053
>Debo's asshole
>>
File: annaanyaface.jpg (1.11 MB, 2048x2048)
1.11 MB
1.11 MB JPG
>>102168559
>tfw you don't realize a gen is flawed until after you post it
Meh. Was using a slop character LORA that couldn't gen the collar correctly.
I am a serious man who is serious about his smugs being on-model and canon.
>>
File: 2024-08-31_00246_.png (1.27 MB, 720x1280)
1.27 MB
1.27 MB PNG
>>
>>102169059
god level finetune of any model you desire is in there, would you go inside?
>>
How is everyone's loss during LoRA training? Not the number, but the drop over time? I find mine goes down, but so slowly most the change gets cut off by the rounding in the command window.
>>
>>102169009
The details were never disclosed, perhaps building and tagging the dataset cost them $15 million, the point is Stability AI had the money so they could have used it for that instead of using LAION.
>>
>>102169082
I feel like no, because no matter how good the fine tune is, in a year or so at least, the base model will be rendered obsolete by something newer. It would have to be the perfect model forever that only I had access to and could run on my hardware for me to consider braving whatever is in there.
>>
File: 2024-08-31_00249_.jpg (1.23 MB, 2160x3840)
1.23 MB
1.23 MB JPG
>>
>>102169136
>It would have to be the perfect model forever that only I had access to and could run on my hardware for me to consider braving whatever is in there.
you can go for quants though, some of them retains the quality of the original Flux
>>
File: file.png (1.59 MB, 1024x1024)
1.59 MB
1.59 MB PNG
>>102169082
If I have a finetune of Flux that knows every single celebrities/anime characters, you bet I would jump into it!
>>
File: w20.jpg (175 KB, 1024x1024)
175 KB
175 KB JPG
>>
File: FD_00007_.png (1.78 MB, 1024x1024)
1.78 MB
1.78 MB PNG
>>102169159
It will never know Blitzball and Messi
>>
>>102169169
That's a deep cut inside joke right there.
>>
>>102169169
then fuck it I'm not going in there :(
>>
File: gremlinanyaface.jpg (1.13 MB, 2048x2048)
1.13 MB
1.13 MB JPG
>>
>>102169136
okay what about this then, imagine that there's a magic stone that will grant you the godtune of any model you desire from the past, present and future. only catch? the magic stone is actually one of debo's kidney stones and you'll have to suck it out, would you do it?
>>
>>102169136
>in a year or so at least
Nobody knows what's coming, what if someone comes with an algorithm that with 1GB RAM on CPU can create any image you want in 2 seconds? What if some Chatbot gets so smart that it could tell you how to code such a thing with ease? Something that makes today's Flux looks like Craiyon in comparison? Something that makes Lora training look like neanderthals painting in a cave? Midjourney goes bankrupt because you can do much better things than it without prompt enhancement?
Someone needs to make a big list of things leading to THE DREAM and put a YOU ARE HERE in the road to it.
>>
File: 2024-08-31_00252_.jpg (1.37 MB, 3840x2160)
1.37 MB
1.37 MB JPG
>>
File: FD_00012_.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
I fucking love how it made Lilith into handsome Squidward for no reason lmao
>>
>>102169176
Well, one really needs to consider what that implies. The perfect image model FROM THE FUTURE.
You can prompt 47th president of the united states and it will show you. You could probably ask for even more granular information that that if it is THE BEST MODEL EVER. You can bet on that. Companies would line up in the billions for the weights. At that point, it's no longer a cooming tool, it's basically a future predicting weapon.
>>
>>102169197
>Something that makes today's Flux looks like Craiyon in comparison?
But Craiyon has 16 times the detail of Flux.
>>
>>102169169
Imagine a finetune of flux that allows you to depict a face and it draws it in the same way victims describe a face and artists are able to draw the perpetrator?
Then you'd just describe Messi's face and get it, if only there was a way to describe Blitzball that way...
>>
am I the only one who doesn't want a model that can do absolutely everything because their true joy lies in training loras?
I just wish lora tech would improve so they'd not rape the model so much. If I could get perfect fingers in my loras as often as I get perfect fingers with base flux I'd be happy, even
>>
>>102169227
16 times the detail thanks to its 16 channel VAE kek
https://www.youtube.com/watch?v=r3rXKCT_STM
>>
>>102169218
yes but you'll have to swivel debo's kidney stone around in your mouth like one of your baby teeth just came out
>>
>>102169239
>If I could get perfect fingers in my loras as often as I get perfect fingers with base flux I'd be happy, even
that's why we want finetunes, because it injects the concepts onto the model without raping the weights and destroying the anatomy capabilities
>>
>>102169253
but that defeats the part where my happiness comes from training loras...
>>
File: FD_00016_.png (1.15 MB, 1344x768)
1.15 MB
1.15 MB PNG
>>102169253
No that's cheating, the base model needs to understand every concept known and unknown to man.
>>
hi anons, what's the 8gb options for flux rn?
i see something about split fp8 and also a fp4?
i'd rather stay in comfy
>>
>>102169227
It's ironic because I was the anon posting pictures to prove it, and apparently I became a meme, but it turned out Flux can reach that level of detail if you make it gen at 256x256 as well.
>>
>>102169239
>am I the only one who doesn't want a model that can do absolutely everything because their true joy lies in training loras?
I find no joy on loras, you have to find the good one (if it exists), download it, everytime you use one the model unload/reload, it's slower, eats more vram, rape the weights so you lose prompt adherance or some anatomy, if you stack multiple loras the chance of everything working out is slim... I can go on an on, I fucking hate loras
>>
>>102169114
yeah same, and usually in the 0.2 or 0.3 range
>>
>>102169269
use gguf, look up city96 in archives or scroll up thread for the git repo
>>
>>102169270
>I was the anon posting pictures to prove it
you spammed like 10 of them, 1 or 2 would have been enough
>>
>>102169265
>the base model needs to understand every https://www.youtube.com/watch?v=vAUOPHqx5Gs
>>
>>102169270
you're still so very confused
>>
>>102169277
that's using loras, not training them.. training them is fulfilling and addicting, I don't even gen anymore outside of testing my creations
>>
File: 1707224138434065.png (2.33 MB, 1024x1024)
2.33 MB
2.33 MB PNG
>>
>>102169307
you've fallen into a deep rabbit hole anon kek, there's no point working hard on something that is just bad at use, maybe you should go for finetuning the model instead, you'll get the thrill of training and it'll be really useful
>>
>>102169249
>Suck out a kidney stone and swivel it around for a bit in exchange for the power to predict the future in the form a set of weights worth probably more than anything on earth?

What would you do?
>>
>>102169307
Yeah training loras is most fun. Just takes so much time
>>
>>102169277
So, how was the model trained and why can't people continue the training and pretend they're finetuning it as if the new images were part of the original dataset?
You know, if instead of dumbly training loras and such for SD1.5 people sit to figure it out we'd have a way to finetune models and then we could have a master model and people finetuning it cumulatively, as if all existing Loras could be put into a model because they could never hurt it.
>>
File: w31.jpg (263 KB, 1792x1024)
263 KB
263 KB JPG
>>
>>102169288
People weren't getting what I was saying, finally found one that showed what I meant, people got it, and that's why I dropped the subject.
But it keeps coming now and then with people making fun of it.
>>
>check bounties on civit
>95% porn requests
This is it, isn't it. The only reason diffusion is popular is because it's build off the backs of degenerates.
>>
>>102167833
underrated
>>
Does this look over trained or under trained?
>>
>>102169321
>maybe you should go for finetuning the model instead,
sadly I'm not smart, patient or resourceful enough for such a thing. maybe if I won a lottery and could throw endless compute and money at it
>>
>>102169296
Gen a 1024x1024 picture, now a 256x256 one with the same prompt, downscale the 1024x1024 to 256x256 and put them side by side.
Note how the one made natively at 256x256 has much more detail.
>>
>>102169377
I don't think it's that expensive to rent a gpu to do some finetuning innit?
>>
>>102169285
thanks. is
>flux1-dev-Q4_K_S.gguf
the one to go for? seems to rank highest in the divergence benchmark while being under 8gigs
>>
File: 00028-2369843824.png (669 KB, 1152x896)
669 KB
669 KB PNG
remember the beaver-wooden chair leg challenge
>>
>>102169376
over but I never trust the preview images. I don't even see a point of using preview images while training because they're so inaccurate to the actual thing. unless you're genning with the latest epoch to test
>>
File: FD_00031_.png (981 KB, 1024x1024)
981 KB
981 KB PNG
>>102169394
Just try multiple until you find the right fit for you. Flux spills into RAM and it's normally fine
>>
>>102169334
Why don't people ever do these ones?:
>You get to have sex with any person in the world in any way you want PLUS you get the power to predict the future in the form a set of weights worth probably more than anything on earth
It's the mindset that a great sacrifice must be made to grant a big wish, but magic is all about getting what you want for free.
>/x/ crosspost
>>
>>102169417
will do tyvm
>>
>>102169407
That's what I was thinking. A lot of the previews of training I've done look just right and when I load them up in comfy they're like half the strength I would have expected. Something is screwy on one end.
>>
File: ifx324.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>
>>102169394
I use the 4-bit Q4_0 at 12gb (there's ggufs of the t5 as well that might help), not at PC right now to double check how much VRAM it takes but you can probably run it with 8
basically try to use the biggest gguf you can get away with since it doesn't change gen speed at all, just helps you not oom
>>
>>102169365
95% of people are degenerates and would rather die than having their fetish revealed to the public.
That's what happens when the only way to procreate is to have sex. Most people that aren't degenerates can't pass their genes.
>>
>>102169423
Because the point of these hypotheticals is to have a counter to the good thing. Otherwise you will always say yes. You need to make a hard decision otherwise it's just
>would you rather have a billion dollars or become a quadriplegic?
>>
>>102169454
Wait... is that?
It can't be.
>>
>>102169470
maybe i need to fix the nets xd
>>
>>102169454
far from a real blitzball game but nice try anon kek
https://www.youtube.com/watch?v=BURjzwpnlTA
>>
>>102169461
Sure but for everything other than pizza the fetish is freely available already. Why do we need to gen images of it?
Maybe when we get good local video diffusion I would be able to understand, since you can easily just insert whatever subject you want to see shitting on you or whatever, but images? So vanilla, so boring.
>>
File: Blitzball.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>>
File: 1720763012202082.png (1.86 MB, 1280x1280)
1.86 MB
1.86 MB PNG
Good morning, remember to be nice to your cat girl maid.
>>
>be me
>havent updated comfy forever
>"oh, an update all button!"
>press it
>various nodes do not work anymore
>two pages of broken nodes
>logfile looks like random LLM garbage complaining about 100 different things at once
>try to fix it
>"just do this if you use windows and portable version: <20+ steps>"
>cant find anything for linux
>sad
>...
>revert to backup of the machine i made before
>everything works
feels good man
is that a normal thing that everything breaks when you update? didnt really feel the need to update that often
>>
>>102169451
personally, my previews are complete messes and then the resulting lora works mostly fine?? i just dont bother even turning them on at this point
>>
here we go again
>>
>>102169467
>Otherwise you will always say yes
But here you always get a no because nobody would do the bad thing for anything offered.
>>
>>102169451
it's actually wild how insanely different they can be. also, I'm personally finding flux gives better anatomy and saner outputs with lower image count datasets, while still adhering to the trained style. testing my own loras, my 150~ image dataset bakes are outperforming my 500~ image dataset bakes. though that may be that there were less "quality" images in the 500 dataset so it muddied the waters, I can't say for certain. either way the point I'm trying to make is that if you decide to retry your lora, give a smol dataset bake a shot too. forcing a form of caption drop out by using --wildcard arg while leaving in an empty paragraph space in my captions helped overall lora sanity, too, more of a difference than dropping LR even, if you want to try that in the future
>>
File: trogon.png (1.59 MB, 1024x1024)
1.59 MB
1.59 MB PNG
>>
The next bread...
>>102169548
>>102169548
>>102169548
>>
File: 000000_17103_.png (1.95 MB, 952x1587)
1.95 MB
1.95 MB PNG
>>
>>102169540
>uses linux
>can't fix his own shit
>>
>>102169540
I mean, if you havent updated in a while and the custom nodes youre using havent been updated in a while, yeah probably new comfy updates probably going to break somethings.
>>
File: ifx325.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
>>102169487
hence my comment
>>
>>102169540
>is that a normal thing that everything breaks when you update?
Yes, comfy is a talentless hack and frequently shits out updates that break everything with minimal improvement, memory leaks or flat out regression. If literally anyone, and I mean anyone else, made a node based system he'd be blown out of the water. no one does though because a node based system is retarded, we're just stuck with comfy since the alternatives have no real customization



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.