[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1 MB, 3264x3264)
1 MB
1 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>101972156

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/g/sdg
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
Blessed thread of frenship
>>
>>101975777
jesus christ why is ai still so fucking bad... it's been 3 years and shit still looks like glossy slop
>>
>>101975777
>all of the retards training Lora’s and fine tunes on stable diffusion outputs
Literally why, you’re ruining the flux model with shitty deformed airbrushed AI slop
>>
>>101975810
>Zooey
Yes.
>>
>>101975824
this, 100% this, I hate that AI imbreeding is being normalised this bad
>>
File: file.png (587 KB, 1263x682)
587 KB
587 KB PNG
>>101975808
>>
File: ComfyUI_32165_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>
File: ComfyUI_temp_vdqxh_00010_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>
File: image.jpg (95 KB, 1536x1280)
95 KB
95 KB JPG
i made it into the collage with that image? ok guess its good enough
>>
What's the criteria for the making the collage
>>
>>101975824
anon you can't ruin this ugly piece of shit any further. this whole place is just retarded coomers and tinker trannies
>>
>>101975847
Why it always adds necklaces or chokers if you prompt for "goth"
>>
File: ComfyUI_32168_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>
>>101975824
How else am I supposed to train my OC that I made with AI?
>>
>>101975852
It is good enough for /ldg/, lol.
>>101975868
Fried te.
>>
File: ComfyUI_32174_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>
>>
File: ComfyUI_32172_.png (1.56 MB, 1024x1024)
1.56 MB
1.56 MB PNG
>>
>>101975888
>896x512
get a job
>>
File: ComfyUI_02601_.png (1.74 MB, 832x1216)
1.74 MB
1.74 MB PNG
>>101975862
just gen an image the collage maker wants to put in there

simple as
>>
>>101975888
Revert the size
>>
>>101975897
hahah I'm using fastflux.ai I can't use my computer right now
>>
>>101975875
these are flux pixel art? neat, i didn't realise. and i mean that in a positive way, i thought this was pony+lora, i'm guessing the occasional vectory outputs like >>101975890 are the model making mistakes when asked for pixel art?
>>
>>101975922
You can't "revert the size" (swap dimensions) with fastflux. It's just a tech demo.
She's wearing a thick pair of grandma long stockings anyway.
>>
File: ComfyUI_01119_.png (1.62 MB, 1152x1728)
1.62 MB
1.62 MB PNG
ah man, I'm too much of a promptlet to make her part her thighs
>>
>>101975955
What's with this severe artifacting on her hair? The whole gen looks mangled...
>>
>>101975955
spread legs
legs apart
>>
>>101975955
>she is spreading her legs
not rocket science
>>
>>101975816
This model has a default look that is very glossy in previous models you had to ask for it. They probably don't want it to be photorealistic out of the box. At least that's my hope. Otherwise it would mean that they have terrible taste.
>>
>>101975998
You think grok devs asked them to not make it photorealistic?
>>
File: 00008-1396033759.png (3.68 MB, 1280x1920)
3.68 MB
3.68 MB PNG
>>
File: ComfyUI_32203_.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>101975940
Correct.
>>
>>101975969
that's the Flux grid seeping through
>>
File: ComfyUI_00798_.png (1 MB, 1024x1024)
1 MB
1 MB PNG
>>
>>101976023
This looks real
>>
File: file.png (350 KB, 650x663)
350 KB
350 KB PNG
The keywords for different ages seem to be a combination of young/(nothing)/old and girl/female/woman.
young girl: child
young female: teenager, early 20s
girl/young woman: 30s
old girl: 60s
old woman/old female: 60s+

The main take away right now is that "female" makes women more feminine, and never results in children. Also, there seems to be no reliable way to generate hebe and teenagers. There's an age gap from 10 to 20 year olds.
>>101976074
My new card <3
>>
File: 00010-748567061.png (3.01 MB, 1280x1920)
3.01 MB
3.01 MB PNG
>>
>>101976087
>>101976023
Use same seed until you get one with the hands fixed
>>
>>101975907
Norf FC spanish holiday
>>
File: ComfyUI_32206_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
>>101975868
>necklaces or chokers if you prompt for "goth"
training data, but also the best part of goth girls
(tattoos are the worst part)

>>101975888
i've never been able to make a skirt lift with panties showing

>>101975955
ime prompting for legs spread doesn't get you super good results unless you full on make them do the splits and even then flux isn't too good at it

>>101976086
>there seems to be no reliable way to generate hebe and teenagers. There's an age gap from 10 to 20 year olds.
i've had success (like 30% consistency) with "young teen girl" or "tween girl", still often goes below 10 though but 11-14 is twice as easy to get as 14-17 I've found
>>
File: FluxDev_01970_.jpg (142 KB, 1024x1024)
142 KB
142 KB JPG
>>
File: image.jpg (96 KB, 1536x1280)
96 KB
96 KB JPG
whats worse is sometimes flux combines old and young features so you get uncanny anachronistic faces
>>
when will Comfy stop unloading the model with every lora change?
>>
File: ComfyUI_32205_.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>
File: ComfyUI_32207_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>
File: ComfyUI_32211_.png (1.15 MB, 1024x1024)
1.15 MB
1.15 MB PNG
>>
>>101976128
>tween
I'm not English native, and "tween" for me is something you did in Macromedia Flash. I didn't know it meant "preteen".
>14-17
That's were the weird gap seems to be. There has to be a keyword for it, but I don't want to research it non-locally because I'll end up on a list. My interest is purely curiosity, mind you.
>>101976170
They look very... Olympic? Like straight out of Asgard or something.
>>
File: ComfyUI_32212_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>
File: 1720319915188896.jpg (391 KB, 1216x832)
391 KB
391 KB JPG
>>101976086
Guess which one is this
>>
File: ComfyUI_01122_.png (1.04 MB, 832x1216)
1.04 MB
1.04 MB PNG
>>101975969
no idea. maybe because of the higher res
>>101975978
>>101975997
no give. the finetunes can't come fast enough
>>
File: 00014-3996208760_cleanup.png (3.11 MB, 1280x1920)
3.11 MB
3.11 MB PNG
>>
File: ComfyUI_32213_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>
>>101976182
I don't know and that's annoying, it shouldn't work that way, you could simply keep in memory the original weights being modified by the lora and then bring them back for every change instead of destroying the whole model and recreating a new one
>>
>>101976245
loras affect the whole model
>>
>>101976253
no, it only affects a tony % of the weights, or else a lora would be the same size of the model, and you can see it's not the case, it's only 20-30 mb wheras the whole model is 23gb
>>
>>101976245
just reverse the merge using the lora weights
>>
>>101976222
Young girl? But that hair and other details means there's more elements to that prompt, so I don't know.
The androgyny of children creeps me the fuck out. Creepy fuckers.
>>
>>101976264
I beg you go read how lora works before you make more a fool of yourself.
>>
>>101976275
that's funny because that's exactly the same reaction I got by reading your retarded take
>>
File: overridemodeldevice.png (213 KB, 1901x864)
213 KB
213 KB PNG
trying to load this WF from here, manager can't find the "overridemodeldevice" node. can someone link me it?
>>
File: ComfyUI_32215_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>
>>101976295
anon please, pull up the paper or an implementation and check your assumptions
>>
File: image.jpg (107 KB, 1536x1280)
107 KB
107 KB JPG
>>101976205
>There has to be a keyword for it,
the keyword for it is "teenager" or "teen", which should by definition get you a 13-19 year old. there is literally no other term other than "jailbait" to describe that age range and i seriously doubt the caption model used for flux used that word

>>101976205
>They look very... Olympic? Like straight out of Asgard or something.
it's probably the fact that the prompt is prompting for "armor" instead of making them have cybernetic bodies, since that's how joycaption captioned my reference image
the goal is to make a bunch of cyberpunk doll-like girls for a music video

>>101976222
my bet is on "young female" because of the hair
>>
>>101976298
I'm guessing its this https://www.reddit.com/r/StableDiffusion/comments/1el79h3/flux_can_be_run_on_a_multigpu_configuration/
>>
>>101976313
now you have to explain to me how a lora can have all the weights while being 1000x lighter than the real model
>>
File: 00017-2454746390_cleanup.png (3.32 MB, 1280x1920)
3.32 MB
3.32 MB PNG
>>
>>101976264
>or else a lora would be the same size of the model,
I don't know myself whether it's technically viable to swap lora weights around without reloading the entire model, but this claim is not backed up by anything so I'm inclined to believe >>101976275 is likely correct.

Out of interest, which UI currently allows you to swap lora weights without a model reload?
>>
>>101976330
that's what the rank size controls
please, I bed you, go read an implementation
>>
>>101976356
*I beg you
but also I'll bed you if you don't do what I'm asking you to do
>>
File: ComfyUI_32217_.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
>>
>>101976356
I'll ask the question again, like a LLM maybe I'll get a good seed and a good answer, how many % of the model's weights a Lora is changing
>>
>>101976341
>Out of interest, which UI currently allows you to swap lora weights without a model reload?
I think none? Forge does the same shit
>>
>>101976379
all of them, please, anon, google it, read the paper, read an implementation
>>
>>101976379
Why don't you go read how it's implemented instead?
>>
>>101976318
>the keyword for it is "teenager"
In my experience, if you prompt "teenager", you get a 20-something year old, which is how teenagers are portrayed in TV and film, incidentally. There seems to be no way to prompt for actual awkward teenagers. They all look like 20+ Ukranian supermodels.
>>
given how good flux is, give me one good reason why I shouldn't delete all my sd1.5, sdxl and, given that I don't do anime, my realistic pony models and loras?
>>
File: 2024-08-19_00209_.jpg (791 KB, 2048x2560)
791 KB
791 KB JPG
>>
>>101976397
>>101976412
>all of them,
And... you lost
https://gitlostmurali.com/machine-learning/data-science/lora-qlora
>his is the benefit of using LoRA. Just store the matrices into your disk, which would be maybe 1% of the original model weights
>>
>>101976414
Unless the answer is in using the CLIP text encoder and use good old SD1 tags. I have not tried that.
>>
>>101976272
>>101976318
It's 'girl', but the entire prompt seems take into account as well. Like if the environment is a nightclub even with 'girls' it will gen teen+
>>
>>101976397
>>101976412
he is almost certainly too stupid to understand and was either trolling and wasting time from the start, or has decided to start wasting your time now after feeling embarrassed about having posted something wrong online. please don't bother replying to him again <3
>>
>>101976469
and, you also lost >>101976429
>>
>>101976379
Don't bother. It doesn't matter that you're correct, others will believe whoever shouts loudest. He can't explain how https://huggingface.co/XLabs-AI/flux-RealismLora/tree/main?show_file_info=lora.safetensors which only contains lora weights for the double blocks, and only certain parts of the double blocks, affects the whole model. He's stuck on the rank aspect which does reduce the size compared to the full weights. Explain all you want, ask him to elaborate, it won't work.
>>
Is everything alright in here?
>>
>>101976417
Flux is niggerlicious
>>
>>101976505
butthurt sai shills like usual
>>
>>101976487
I mean, you must be a special kind of retard to believe a Lora has the 12 billions weights inside of it while only being a 30 Mo file, but yeah, I'm done educating those retards, it's ok to be a low IQ, but not being a bad faith low IQ
>>
>>101975843
Cute and accurate rendering of a blessed thread of frenship
>>
I will come back when AI can do this
>>
>>101976579
I guess that's a real picture right?
>>
>>101976487
he is not correct as shown by the sentence he cherry picked from the article here>>101976429, he doesn't understand how it works otherwise he wouldn't have quoted that
you want to get technical, sure, loras don't touch all the weights but the loras all touch the same weights which is what matters to the discussion
the loras he sees on civitai all touch the same weights
>>101976553
see, he doesn't fucking get it
anon, I beg you again, go read up on how lora works
you can make it kilobytes large and still touch all parameters
>>
>>101976462
I see. Makes sense, since this model does not work with individual tags. It's more abstract.
>>101976222
What is the actual prompt here, if I may ask?
>>101976579
You can make a LoRA for something like that, with amateur pictures. SD1 can do it.
>>
>>101976379
>I'll ask the question again, like a LLM maybe I'll get a good seed and a good answer, how many % of the model's weights a Lora is changing
>>101976397
>all of them
>>101976592
>sure, loras don't touch all the weights
oops...
>>
>>101976233
Damn, what are you using for this? dev 50 steps?
>>
>>101976649
That looks like good old SD 1.5
>>
File: Capture.jpg (113 KB, 1340x1448)
113 KB
113 KB JPG
>>101976609
https://www.entrypointai.com/blog/lora-fine-tuning/
lawl, unless you go for a 8,192+ Rank Lora (who does that?), you can see it in the 1% of total weights max
>>
>>101976649
kek, flux newfags
>>
>>101976592
What matters to the discussion is whether the entire model needs to be destroyed and reloaded. It does not because loras do not affect the entire model, loras are in fact configurable to only affect certain types of module and only certain layers.
>>
File: 124935872352.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
Not that I'm upset with this view, but how do I get a from_below view without being able to booru tag prompt? have tried various permutations of "looking down at the viewer" and "worm's eye view" and "view from below"/"view from the ground". 1girl in her room shots are heavily biased towards shots from above, I guess, and I just need to have her in a throne or something.
>>
File: 00018-3991584975.png (2.81 MB, 1280x1920)
2.81 MB
2.81 MB PNG
>>101976649
SD1.5
>>
>>101976737
low perspective
>>
File: 2024-08-19_00218_.jpg (775 KB, 2048x2560)
775 KB
775 KB JPG
>>
>>101976736
And when you look at this board you can see it's not even affecting 1% of the weights when you go for a Lora Rank 512, so yeah, there's absolutely no point of destroying the model just because 1% of it got changed, the 1% that will be modified should stay in memory and be used to return the model back to its original form, that would be a way less retarded thing to do rather than doing this loading/unloading monkey retarded shit
>>
>>101976684
>>101976709
>>101976744
So why does flux not look as good then?
>>
>>101976779
SD1.5 is king when it comes to normal, ordinary looking pics
Flux is impressive, but still in its early days imo
>>
>>101976816
SDXL is also very flexible. I think the secret sauce with SD1 was the wildcard dataset. We will never see a model like that again.
>>
>>101976779
loras, hires fix, cherrypicking, outright model finetunes toward specific goals, and base models being trained with somewhat "safe" datasets that make them worse at usually those same specific goals. the fair comparison is base sd1.5 -> sdxl -> flux, not some realism finetune that only does asian girls in egirl clothing taking selfies where the same splayed-fingers-around-a-phone block of picture is in nearly every gen. Not to disparage that anon's image at all, it's just that he's probably making it with a set of tools built towards creating exactly that, and from the resolution clearly has a big workflow set up around it boosting it up in size (I'd have assumed it was sdxl based to be honest)
>>
>>101976737
Damn
>>101976751
I've been wanting to know too
>>
>>101976816
>>101976832
both SD15 and SDXL are RNG shitshows cause CLIP doesn't understand shit you tell it, ofc you can enforce what you want with a lore or finetune, but you don't have real creative freedom to prompt what you actually want
>>
>>101976244
That's a REALLY consistent pixel art image. Is that from Flux?
>>
>>101976862
>I'd have assumed it was sdxl based to be honest
We SD1 vramlets know it well. All the people SD can do are the same person with minor plastic surgery.
>>101976897
That's not true. It just works differently. If you have ever used NovelAI's v3 model you know how powerful a well curated tag dataset can be. In many ways I prefer it to Flux's natural language because it's more flexible and predictible.
>>
File: 2024-08-19_00221_.jpg (826 KB, 2048x2560)
826 KB
826 KB JPG
>>
File: 00309-4113405124.png (2.74 MB, 1280x1920)
2.74 MB
2.74 MB PNG
>>101976832
a big part of it is how loras can affect lighting. It is pretty much impossible to do stuff like this in XL or Flux. You can get dramatic lighting but it is way more limited compared to the contrast/lighting control loras in 1.5
>>
can someone verify that loras work with Q4_0? I just want to test what I've trained
>>
File: ComfyUI_32169_.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
>>101976908
Yeah.
>>
>>101976609
anon you're still not getting it
>>101976708
>>101976773
that is not a % of weights touched, that is just a size comparison
let me explain this to you
you have 5 parameters
[0, 1, 2, 3, 4]
you want to modify them by some delta
you could do 5 parameters
[-1, 2, -3, 1, 5] giving you [-1, 3, -1, 4, 9]
or you could do just one parameter applied to all of them
[3] giving you [3, 4, 5, 6, 7]
one has 5 parameters, that's 100% of the original weights
the other has 1 parameter, that's 20% of the original weights
this percentage is what you're seeing in that table
one is five times larger than the other, that's how one lora can be several gigabytes and another just 8 megabytes
but in this example BOTH ARE TOUCHING 100% OF THE PARAMETERS
>>
File: ComfyUI_32170_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>
File: ComfyUI_32171_.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
>>
File: ComfyUI_32178_.png (1.53 MB, 1024x1024)
1.53 MB
1.53 MB PNG
>>
File: 00121-2538579639.jpg (175 KB, 1280x1920)
175 KB
175 KB JPG
>>101976779
flux is cool
>>
>>101976609
>>>101976397
>>all of them
>>>101976592
>>sure, loras don't touch all the weights
>>101976945
>100% OF THE PARAMETERS
>>
File: 1273685234.png (930 KB, 768x1280)
930 KB
930 KB PNG
>>101976751
thanks, that's working a little better. it avoids the trap of sounding in isolation like it could be referring to the camera or to the subject like "from below" and "looking down" could, while not being as weird/rare as worm's eye view. not quite working but is at least focusing the bottom quarter of the image on legs with some foreshortening effect which is obviously along the lines of what i was asking it for. unfortunately all the obese dominatrixes in the dataset are making her displeasingly chunky now in 75% of gens.
>>101976923
they do on comfy and i believe worked on forge before comfy too
>>101976914
NAIv3 and Pony both have very strong adherence to tags so you don't even need to do the magic NAI did, although it's nice to not do the extra stupid things affecting Pony.
>>
File: ComfyUI_00487_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>101976969
yes
>>
>>101976945
That's a nice hypothetical situation that doesn't apply to the real world. In reality you have something like
>double_blocks.0->19
>single_blocks.0->38
each containing
>some_other_bullshit
>more_bullshit
>another_load_of_shit
and the lora contains
>double_blocks.0->19
each containing
>more_bullshit
As you can see, the lora does not affect the entire model.
>>
File: maxresdefault.jpg (161 KB, 1280x720)
161 KB
161 KB JPG
>>101976945
>100% OF THE PARAMETERS
and 16 times the details
>>
>>101976996
>>101976773
>And when you look at this board you can see it's not even affecting 1% of the weights
I'm addressing this, anon, you misunderstood what that table was telling you
>>101977008
that's why I said "in this example", see above
>>
>can train loras on the full size model on 12GB vram with -fp8, split_mode
>can't load the full size model to gen with it on 12GB vram with fp8, putting vae on other gpu, model on main gpu, and clip on cpu, get immediate oom

Why?
>>
File: ComfyUI_00483_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>
Damn, Flux Q8 is giving me better results than the unquantized version
>>
>>101976999
>they do on comfy and i believe worked on forge before comfy too
thank you, thought I saw someone say only q8 worked for loras for some reason. will DL baby quant for testing
>>
>>101977036
how much does your speed drop when you enable a lora?
>>
>>101977020
>you misunderstood
You're the one who were claiming claiming that loras affect all the weights when it's a complete lie, don't forget that, you're full of shit anon >>101976609
>>
File: ComfyUI_32179_.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
>>
>>101977025
>Why?
because the dev who coded the training shit is talented, Comfy isn't, simple as that
>>
File: ComfyUI_32180_.png (1.52 MB, 1024x1024)
1.52 MB
1.52 MB PNG
>>
File: ComfyUI_32181_.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
>>
>>101977020
Yes, we all know how matrix multiplication works, you've clearly grasped the concept of lora rank and how that makes the lora weights smaller. However, you are failing to understand that loras simply do not affect every module of every layer, that fact is why the entire model does not need to be destroyed and reloaded.
>>
File: ComfyUI_32182_.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>
>>101977056
and you misunderstood that table, as I showed you
you admit to misunderstanding that table and I'll admit to any mistakes I made, deal?
>>
File: ComfyUI_32184_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>
>>101977098
can that retard not reference kohya's work or is he only capable of using other's code when its to spite illya?
>>
File: ComfyUI_32194_.png (1.58 MB, 1024x1024)
1.58 MB
1.58 MB PNG
>>
>>101976999
>they do on comfy and i believe worked on forge before comfy too
Why are you lying? Forge just copy comfyui gguf nodes, they haven't contributed to making gguf work with flux at all.
>>
File: ComfyUI_32197_.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
>>
>>101977136
>you admit to misunderstanding that table and I'll admit to any mistakes I made, deal?
I won't accept anything from you, you were grandstanding on your high horse on the lora subject by asking me to "read some documentations" as if you were some kind of expert when at the end you were saying some big stinky horseshit lies like "it affect all the weights", you're the one who needs to read the documentation anon, stop acting like a smartass when you can't even back it up with some true factual shit
>>
>>101977153
Comfy copied Forge's nf4 implementation
>>
>>101977055
Not much, from 1.5s/it to 1.7s/it
>>
Does anyone know what the best site/resource to find the best (preferably local) ai tools? There'sAnAIForThat doesn't seem to search well and GitHub is alright but sometimes repos don't label it as ai, there's choice paralysis and it's hard to tell what the best tool is compared to others.
>>
>>101977153
I'm not lying, I literally did believe that.
>>
>>101977205
>inability to admit fault is a nasty trait, anon.
and pretending to be an expert and grandstanding on others when at the end you were completely wrong is a much more nasty trait, anon
>>
>>101977131
>However, you are failing to understand that loras simply do not affect every module of every layer
again, that's why I said "in this example", in the example all parameters are modified to make it obvious
I'm addressing something the other anon said that was wrong and I know you know what I'm talking about
>>101977171
inability to admit fault is a nasty trait, anon. you were looking for something supporting your argument and you completely misread it to make it fit. If you don't believe me ask the anon above. That table does not show what percentage of the weights are modified by lora rank.
>>
>>101976923
They work, but not so well with schnell.
>>101976918
>pretty much impossible to do stuff like this in Flux
It's still early to tell isn't it?
>>
File: 00035-1569712495.jpg (211 KB, 1344x1600)
211 KB
211 KB JPG
>>101977004
bro, share your flux microbikini lora, the one on civitai sucks ass
>>
File: ComfyUI_24623225657_.png (1.38 MB, 1024x1024)
1.38 MB
1.38 MB PNG
I'm scare what have I done
>>
>>101977223
>inability to admit fault is a nasty trait, anon.
and pretending to be an expert while grandstanding on others like a fucking faggot smartass when at the end you were completely wrong is a much more nasty trait, anon
>>
>>101977240
then be the bigger man and admit you misunderstood that article you linked, or do you believe you are correct in your interpretation of that table?
cause you're not
>>
>>101977163
>i yearn for things that don't exist aaaah
>>
File: 00036-200704195.jpg (168 KB, 1344x1600)
168 KB
168 KB JPG
>>
>>101972120
>>101972103
Ok thx I'll try that when i get home
>>
>>101977233
You set guidance too low
>>
As a long time A1111 user, who switched to ComfyUI to use flux since it was the "official" way, I just installed forge. Holy fuck it's so much better if all you're doing is basic shit. Like I get that if you have a complicated workflow that needs to run fully automated end-to-end, comfy is your only choice and it's fine for that. But the spaghetti node bullshit is fucking retarded if you just need to do basic text2img and img2img things, which is all I do. Forge just werks, is exactly like a1111, seems to have everything I need built in, it's just perfect. Shame the dev is a cocky autistic sperglord (tbf comfy is kinda the same way).
>>
File: ComfyUI_32223_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>
>>101977276
I used a WF from here that had guidancepositive at 3.5 and negative at 7.0, what are the proper amounts?
>>
File: ComfyUI_32227_.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>
>>101977255
Did you admit your behavior was wrong by getting on your high horse and not actually backing up with some true statements? Doesn't seem like it
>>
File: file.png (710 KB, 719x717)
710 KB
710 KB PNG
>hello is this thing on?
>>
File: ComfyUI_32233_.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>>
>>101976318
if you leave out age descriptors for the subject and only reference age-associated events, locations, or relationships, the model will usually intuit and fill in where there's either no token or a "negative" token. for instance, an iconic teen bedroom with people in it or referring to classmates might create groups of people of similar ages. watching a pg-13 movie with a childhood friend, etc.
I use this technique a lot to get around really strongly weighted tokens from training materials with a lack of variety. since you can't negate associated tokens in flux, you just have to avoid certain terms and hope the model can figure it out, which from what I've tested, it can in a lot of cases without LoRA. also, try using short sentences that chain or reinforce tangential aspects of the prompt instead of using weighting.
>>
>>101976414
why don't you guys use prompt editing to blend the words? it's always been the way to have control over this. [little girl|teen] or [little girl|little girl|teen] etc. should blend it into whatever age you're looking for. i don't know if this works in c*mfy though.
>>
File: file.png (534 KB, 724x715)
534 KB
534 KB PNG
Schizoprompting gives you neat things sometimes
>japanese 1990 kodak scan low definition grainy biomechanical dark lolita fashion
>>101977306
Ah, sorry, I don't do that, so I don't know now it affects the gen. The image you posted looks the way it comes out when you use a guidance of 1.0 with a regular workflow. That's why I said that.
>>
>>101977314
so you're not the bigger man then, you will not admit you were hastily googling for something supporting your point, you won't admit you latched on to a sentence that kinda looked like it proved you right but you actually did not know enough about the subject to realize it was talking about something else.
you won't admit that
>>
File: file.png (547 KB, 717x712)
547 KB
547 KB PNG
>>
>>101977369
So you don't want to admit your smartass attitude was wrong, I thought you were the bigger man than that
>>
>>101977227
in 1.5 you can train loras to affect contrast noise which allows for all those backlight/directional light control loras (that actually work). Based on my understanding it is impossible to do this in XL or Flux because you can't interact with that layer of the model.
>>
>>101977364
ah shit, I'll try the default comfy WF and see if that fixes it then. thanks for your help!
>>
>>101977369
>, you won't admit you latched on to a sentence that kinda looked like it proved you right but you actually did not know enough about the subject to realize it was talking about something else.
>>101977136
>I'll admit to any mistakes I made, deal?
Oh no, he forgot to take his schizo pills again
>>
>>101977304
India?
>>
what should i gen?
>>
>>101977383
nasty trait, anon
>>101977400
what's the issue there, anonie, you having problems understanding basic English?
he admits he rushed for a gotcha and misunderstood what the article was talking about and I'll admit to any mistakes I made
>>
>>101976940
cleans up really nicely with extensions too https://imgsli.com/Mjg5MTI3/0/1 mostly whatever you're prompting seems to generate pixel patch sizes of 4x4 whereas the sakuemonq pony lora I use deliberately generates 8x8, making it hard to compare directly, but it seems to do almost as good of a job of trying to avoid generating overly detailed sections.
>>
File: nerd.gif (501 KB, 220x181)
501 KB
501 KB GIF
>>101977426
>what's the issue there, anonie, you having problems understanding basic English?
>>
>>101977421
Two retards arguing
>>
>>101977448
not the nerd gif!
>>
>>101977421
pikacho being bbqed by mario
>>
File: image.jpg (102 KB, 1536x1280)
102 KB
102 KB JPG
>>101977354
this works well for the real world like bedrooms or school settings but its tougher in novel settings like the futuristic ones im going for

>also, try using short sentences
tough when im also trying to get a very particular look for the outfit
but thank you anon, your instructions would be a useful prompt to enhance a prompt with an LLM for flux
>>
File: two retards fighting.png (469 KB, 581x411)
469 KB
469 KB PNG
>>101977466
exactly picrel
>>
File: 00115-3896670292.png (2.81 MB, 1280x1920)
2.81 MB
2.81 MB PNG
>>
>>101977466
but make me the bigger retard
>>
>>101977481
It's fine, God already did.
>>
File: 75948.gif (1.37 MB, 320x240)
1.37 MB
1.37 MB GIF
>>101977485
Ohohohoh
>>
>>101977485
God isn't real.
>>
>>101977471
throw this image into joycaption desu, i bet flux could generate this really well including the youtube page
>>
File: 00112-420565960.png (2.9 MB, 1280x1920)
2.9 MB
2.9 MB PNG
>>
>>101977364
it's certainly something unique. weird reso tho
>>
>>101977510
Sure we can't physically see God, but we can physically see your fucking gunt (put it away)
>>
>>101977510
Then how did the universe got created? Can't make shit out of nothing right?
>>
>>101977510
says you
>>
File: file.png (1.5 MB, 832x1216)
1.5 MB
1.5 MB PNG
>>
>>101977510
god invented women, show some respect
>>
>>101977530
>that hand
Quality LoRAs
>>
>>101977531
without women we would still live in fucking Eden, FUCK
>>
>>101977562
damn
>>
File: 00002-2264910668.png (2.87 MB, 1280x1920)
2.87 MB
2.87 MB PNG
>>
>>101977469
What about the clip text encoder? That one works with tags, right? Doesn't that work with Flux?
>>
>>101977518
no you can't
>>101977520
okay then, god is that which was always existed and from what the universe was born... but you can't give it any more attributes now
>>101977521
yes
>>101977531
you're right, I do love my transsisters
>>
File: myFile_20_4.0_035.png (325 KB, 512x512)
325 KB
325 KB PNG
Apparently this is Two Retards Fighting according to cyberrealistic(1.5)
>>
File: myFile_20_4.0_039.png (689 KB, 768x768)
689 KB
689 KB PNG
The Gospel of Two Retards Fighting according to epicrealismXL_v8Kiss
>>
File: 2024-08-19_00239_.jpg (917 KB, 2048x2560)
917 KB
917 KB JPG
>>
>>101977590
conjoined at the arm, the ultitard
>>
File: 00022-763268806_cleanup.png (2.94 MB, 1280x1920)
2.94 MB
2.94 MB PNG
>>
File: file.png (1.56 MB, 832x1216)
1.56 MB
1.56 MB PNG
>>101977561
have one more
>>
>>101977678
the hand is rather distorted otherwise niec
>>
Is there some insane finetune being cooked on flux or something? Did someone announce everything of that level?
>>
>>101977700
basically the opposite
>>
File: file.png (440 KB, 512x512)
440 KB
440 KB PNG
>>101977680
Thanks
>>
>>101977700
the gays are making one that can generate perfect dicks and gay sex positions
>>
>>101977710
feelsbad to know only the horse fuckers and the faggots are more motivated than the regular coomer simps on model training :(
>>
>>101977721
the solution is to broaden your tastes, why only get hard for 50% of the human population? there is a whole world out there.
>>
File: file.png (436 KB, 512x512)
436 KB
436 KB PNG
>>
File: 00023-3002410243_cleanup.png (2.96 MB, 1280x1920)
2.96 MB
2.96 MB PNG
>>101977697
Ty
>>
>>101977721
it's a tough time to be a straight nigga
>>
>>101977739
box
>>
Im using Q8 but it's still taking >22GB of VRAM, why is that? I can't load the controlnet like this
>>
File: image.jpg (107 KB, 1536x1280)
107 KB
107 KB JPG
>>101977577
>What about the clip text encoder? That one works with tags, right? Doesn't that work with Flux?
these are great questions for someone who knows what they're doing. i just want consistent late-teen robot dollfaced cyberpunk girls

>>101977721
>horse fuckers and the faggots are more motivated than the regular coomer
this has always been true in AI and to a lesser extent in erotica etc, because if you have a fetish that you can't do in real life or find real content for (e.g. furry) you're much more motivated to create solutions
>>
>>101977732
can't change your sexuality anon, desu it would be fun to like men, I understand my type way more than fucking women that's for sure kek
https://www.youtube.com/watch?v=S2ktZHBjkqE
>>
>>101977749
>late-teen robot dollfaced cyberpunk girls
Looks like you got it.
>>
File: myFile_20_4.0_043.png (2.21 MB, 1536x1536)
2.21 MB
2.21 MB PNG
The Gospel of Two Retards Fighting according to realismengine_SDXLv30VAE (Bumped up base res and cfg scale 7)
POV: You and your bro get into a fistfight over Halo 2 LAN circa 2003
>>
>>101977766
These guys faces reminded me of that one video of the nurse jacking off that retard tied to a wheelchair
>>
>>101977746
I can't
>>
>>101974410
>>101977631
anyone got a catbox pls?
>>
>>101977783
>the nurse jacking off that retard tied to a wheelchair
kek
>>
>>101977783
lmao
>>
File: ComfyUI_32242_.png (1.75 MB, 1024x1024)
1.75 MB
1.75 MB PNG
>two retards fighting
>>
File: 2024-08-19_00248_.jpg (940 KB, 2048x2560)
940 KB
940 KB JPG
>>101977577
>What about the clip text encoder? That one works with tags, right? Doesn't that work with Flux?
not even Black Forest Anon that was here in the thread could answer me this question. In my observation its that it works with certain topics. Especially 2D content works okaish with tags in FLUX. Photorealism you want natural language prompts in t5. Also if you use text inputs for clip and t5 seperate, t5 mostly overides what you put into CLIP.
>>
File: ComfyUI_32244_.png (1.75 MB, 1024x1024)
1.75 MB
1.75 MB PNG
>>
>>101977828
desu the difference is so subtle I don't bother with it at all
>>
File: 00024-757673544_cleanup.png (3.02 MB, 1280x1920)
3.02 MB
3.02 MB PNG
>>
File: image.jpg (107 KB, 1536x1280)
107 KB
107 KB JPG
>>101977762
>Looks like you got it.
thanks but a lot of the gens are too stylized, not photoreal enough to be consistently used in a piece of art together
>>
>>101977739
How the fuck is this not real
>>
>>101977835
well if she wasn't retarded before he showed up she sure is now
>>
File: flux_tom_1.jpg (1.64 MB, 1920x1080)
1.64 MB
1.64 MB JPG
Looks like flux can learn weirder styles pretty well
>>
>>101977846
I like how these Flux gens are easy to inpaint.
>>
>>101977917
Beautiful
>>
>>101977917
woow, nice style
>>
>civitai's downloader is down
>file not found
>>
>>101977941
It's not Flux though
>>
>>101977973
so what is it
>>
>>101977917
lora?
>>
>>101977973
Better yet, no need for extra steps
>>
It's over
>>
We're so back
>>
I hasn't even started
>>
File: ComfyUI_Flux_9371.jpg (104 KB, 1024x1024)
104 KB
104 KB JPG
hoping we will see a pixelwave finetune of flux
>>
>>101977978
something more advanced
>>
>>101978019
Pixelwave sucks desu
>>
>>101978005 so quick? >>101978009
>>
File: ComfyUI_03657_.png (1.59 MB, 832x1216)
1.59 MB
1.59 MB PNG
>ONE GIRL
>I DON'T KNOW WHY
>>
File: ComfyUI_Flux_9331.jpg (159 KB, 1024x1024)
159 KB
159 KB JPG
>>101978046
pixelwave is one of few models that gives unique outputs. other models will give you nearly the same images
>>
File: myFile_20_4.0_049.png (2.59 MB, 1536x1536)
2.59 MB
2.59 MB PNG
>>101978063
>TWO RETARDS
>I DON'T KNOW WHY
>IT DOESN'T EVEN MATTER HOW HARD I GEN
>KEEP THAT IN MIND I DESIGNED THIS NODE SETUP TO EXPLAIN IN DUE TIME
>ALL
>I
>DON'T KNOW
>>
File: FLUX_00027_.png (1.04 MB, 896x1152)
1.04 MB
1.04 MB PNG
>>101955940
this is more of what I was aiming for
>>
File: ComfyUI_32254_.png (1.84 MB, 1024x1024)
1.84 MB
1.84 MB PNG
>>
anyone got flux dev to successfully convert to tensorrt?
There seems to be some implied workingness
https://github.com/comfyanonymous/ComfyUI_TensorRT

If so, what are your specs and workflow nodes?
>>
>>101977917
Nice, the patterning on the spiral is interesting. Last time I remember seeing something similar was in Kamen Rider. https://youtu.be/FLFV-itlSzc?t=28
>>
File: ComfyUI_00388_.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
>>101977230
you mean the one I used for pic related?
>>
File: 00025-3295957142_cleanup.png (3.37 MB, 1280x1920)
3.37 MB
3.37 MB PNG
>>101977978
SD1.5
>>
>>101978149
i'm asking what model
>>
>>101978165
Brav5
>>
File: file.png (771 KB, 512x768)
771 KB
771 KB PNG
>>101978063
Sorry SD can't do hands
>>
>>101978124
>anyone got flux dev to successfully convert to tensorrt?
that would be fp16 right? you can't make quants out of tensorrt I think
>>
so what's the best clip l? openai vit-l-14?
>>
>>101978170
which one specifically
>>
>>101978146
please more celebs in microbikinis
>>
>>101978191
yes
>>
File: ComfyUI_32258_.png (1.49 MB, 1024x1024)
1.49 MB
1.49 MB PNG
>>
File: ComfyUI_Flux_9105.jpg (201 KB, 1024x1024)
201 KB
201 KB JPG
would there be any benefit in using this fp16 gguf variant of flux, compared to the regular fp16 .sft?
https://huggingface.co/city96/FLUX.1-dev-gguf/blob/main/flux1-dev-F16.gguf
>>
File: file.png (753 KB, 512x768)
753 KB
753 KB PNG
>>101978170
>brav5
Holy shit the whole model is just that girl
>>
>>101978205
>yuri
>>
File: Untitled.png (124 KB, 2002x1461)
124 KB
124 KB PNG
>>101978201
where 2 cop 336 version?
>>
>>101978212
no
>>
>>101978222
maybe it's this one?
https://huggingface.co/openai/clip-vit-large-patch14-336
>>
File: 00002-1271968127.png (1.27 MB, 896x1152)
1.27 MB
1.27 MB PNG
>>
>>101978146
yes pls
>>
File: 00077-2170662465.jpg (144 KB, 1344x1600)
144 KB
144 KB JPG
>>101978146
yes or its inpainting?
>>
>>101978232
will comfy just load a .bin or .h5?
>>
>>101978222
you can't just swap the clip l for something different, the model won't understand the embeddings
you need OpenAI's ViT-L-14
>>
File: ComfyUI_Flux_23.png (1.18 MB, 1216x832)
1.18 MB
1.18 MB PNG
Not really feeling comfy today, bros
>>
File: 00078-4169123338.jpg (191 KB, 1344x1600)
191 KB
191 KB JPG
>>
>>101978232
found it
https://github.com/openai/CLIP/issues/199
>>
>>101978257
What went wrong?
>>
>>101978253
if i dont fuck around then ill never fiind out
>>
File: ComfyUI_32260_.png (1.81 MB, 1024x1024)
1.81 MB
1.81 MB PNG
>>
>>101978190
I asked chatgpt4o it seemed quite happy to say that it can and included general suggestions of methods. I can't say if they are true or not.
>>
>>101978277
the finding out can be painful and with permanent injuries
>>
>>101978277
this
https://www.youtube.com/watch?v=Of_8YG8b760
>>
SD just gave me the perfect image and I beat it so hard my balls hurt
>>
Is there any benefit of using the hires fix on flux?
>>
>>101978257
What's wrong with her arm?
>>
File: 2024-08-19_00252_.jpg (759 KB, 2048x2560)
759 KB
759 KB JPG
>>101978295
yes, use ultimate upscale to, pic related
>>
>>101978323
nothing, women have weird muscle attachments in the elbow, a degree of double jointedness is common in women.
>>
File: 00084-359097865.jpg (186 KB, 1344x1600)
186 KB
186 KB JPG
>>
File: 00028-1706116032.png (3.02 MB, 1280x1920)
3.02 MB
3.02 MB PNG
>>101978193
https://civitai.com/models/25494?modelVersionId=63786
>>
File: 2024-08-19_00255_.png (2.43 MB, 1280x1280)
2.43 MB
2.43 MB PNG
>>
>>101978384
thanks
>>
File: ifx117.png (1.54 MB, 1024x1024)
1.54 MB
1.54 MB PNG
>>
File: ComfyUI_08008_.png (814 KB, 1200x768)
814 KB
814 KB PNG
Flux.1 is getting pretty outdated at this point, Flux.2 when?
>>
>>101978384
kek'd at Stan Lee in the gallery
>>
File: ComfyUI_32267_.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
>>
>>101978412
right after SD3.1 drops
>>
What forge version is best for pony and sdxl, or does it even matter?
>>
File: image.jpg (73 KB, 1536x1024)
73 KB
73 KB JPG
>>101978412
>Flux.2 when?
when video flux drops + 8 months
>>
File: file.png (771 KB, 512x768)
771 KB
771 KB PNG
>write some prompts in my air conditioned home office
>refine until perfect
>queue 100 images
>go make some fresh gazpacho with 100% homegrown ingredients while they cook
>come back to a perfectly targeted porn set with your particular fetish
Is this what the Romans felt like just before their empire went to shit?
>>
>>101978481
yeah
>>
File: MarkuryFLUX_00106_.png (1.81 MB, 832x1216)
1.81 MB
1.81 MB PNG
>>101978412
Black Forest Labs they were only created 4 months ago, so in 4 months they managed to create Flux.1

It seems they were also working on the text to video model too. So I'm guess we will get video flux in 2 months and then 6 month after there might be a flux 2.
>>
>>101978489
Except they had actual human slaves to fuck instead of masturbating to blue light I guess.
>>
File: 00030-528228139.png (2.89 MB, 1280x1920)
2.89 MB
2.89 MB PNG
>>
>>101978492
Lol, lmao even
>>
File: 2024-08-19_00260_.png (1.86 MB, 1024x1280)
1.86 MB
1.86 MB PNG
>>101978477
>>101978492
you are all wrong, its just two more weeks as always
>>
File: file.png (1.7 MB, 768x1152)
1.7 MB
1.7 MB PNG
>>101978481
This thing can't do teeth
>>
File: ComfyUI_32270_.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
>>
>>101978498
right
>>
File: 00021-897495043.png (2.93 MB, 1280x1920)
2.93 MB
2.93 MB PNG
>>
>>101978498
A handful of them, the rest were poor fuckers working the field or deployed to bumfuck nowhere doing mandatory military service. Not that being deployed would be all bad, just fuck the local girls but there's a chance some German fucker would ambush and kill you.
>>
File: ComfyUI_Flux_26.png (1.31 MB, 1216x832)
1.31 MB
1.31 MB PNG
>>101978323
she's just a bit tired
>>
>>101978510
>This thing can't do teeth
those look like very normal and real asian teeth anon
the real test is seeing if it can make kids with their front baby teeth missing, which i am not going to attempt for obvious reasons but also because flux can probably do it no problem
>>
Let's roll...
>>101978563
>>101978563
>>101978563
>>
ive done some AB and cant see a difference on outputs between this clip_l linked by comfy
https://huggingface.co/comfyanonymous/flux_text_encoders/tree/main
and the extra gigabyte and change on this one
https://huggingface.co/openai/clip-vit-large-patch14/tree/main
>>
File: file.png (1.11 MB, 768x768)
1.11 MB
1.11 MB PNG
>>
>>101978237
its the microbikini LoRa from civi tho
>>
File: ComfyUI_temp_mkpud_00005_.png (2.27 MB, 1280x1600)
2.27 MB
2.27 MB PNG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.