[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1.01 MB, 3264x3264)
1.01 MB
1.01 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102181685

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
Let me see a beautiful girl, a sweet summer rose...
>>
>>102187084
shit taste
>>
File: 00163-4221760619.jpg (1.06 MB, 1613x2150)
1.06 MB
1.06 MB JPG
>>102187117
one from a project
>>
File: 2024-09-01_00244_.jpg (898 KB, 2496x3648)
898 KB
898 KB JPG
>>102187084
ty baker
>>
File: file.png (1015 KB, 908x909)
1015 KB
1015 KB PNG
https://drive.google.com/drive/folders/1eGrbstWLGOlinNL_d7WzaYcOzpSp5a9x
That's a cool way to find which styles actually work on flux dev
>>
Blessed thread of frenship
>>
>>102187144
yeah
>>
File: file.png (1.72 MB, 1280x896)
1.72 MB
1.72 MB PNG
>>
https://huggingface.co/leejet/FLUX.1-dev-gguf/tree/main
>Q8_0 -> 12.8gb
https://huggingface.co/city96/FLUX.1-dev-gguf/tree/main
>Q8_0 -> 12.7gb
I know that city's quants are a bit deprecated because he forgot to add some F32 shit on some weights, maybe the 12.8 one is the "real official" one expected from a real Q8_0 quant
>>
File: file.png (264 KB, 771x615)
264 KB
264 KB PNG
Emad doing some revisionism on his takedown request on SD1.5 in 2022 kek
>>
File: 1705942042745393.jpg (65 KB, 800x1170)
65 KB
65 KB JPG
>>
File: flux0208.jpg (1.87 MB, 2304x1792)
1.87 MB
1.87 MB JPG
>>
File: 2024-09-01_00255_.png (103 KB, 128x1024)
103 KB
103 KB PNG
>>
File: 1695136821822720.jpg (72 KB, 800x1170)
72 KB
72 KB JPG
>>
File: 1698658295095074.jpg (101 KB, 1170x800)
101 KB
101 KB JPG
Corporate memphis
>>
File: FLUX_02257_.png (1.11 MB, 896x1152)
1.11 MB
1.11 MB PNG
Trying to run FLUX controlnet for the first time on ComfyUI.


I do not understand in which folder should I put my controlent safternsors (flux-depth-controlnet-v3).
>>
File: 1710286942659905.jpg (129 KB, 1170x800)
129 KB
129 KB JPG
>>
File: 2024-09-01_00263_.png (935 KB, 384x3072)
935 KB
935 KB PNG
>>
>>102187370
The censorship really started on day 1 and ever since they've cared more about censorship than even making a functional model let alone giving me the ability to finetune it. SAI never once released real training tools and so far the only group that has is Pixart.
>>
File: 1693936336832408.jpg (129 KB, 800x1170)
129 KB
129 KB JPG
>>
>>102187445
Emad got lucky Runway released the uncensored version of SD1.5 + Nai's leak, without that his cucked shit would've never taken off the way it did
>>
bigma status?
any chance it'll mog flux or no?
>>
File: 2024-09-01_00267_.png (2.36 MB, 768x4608)
2.36 MB
2.36 MB PNG
>>
File: 00064-3029397221.png (421 KB, 344x1088)
421 KB
421 KB PNG
>>102187479
nice lanky felines
>>
>>102187476
>We will try to let our model out in Sep.
>let our model out
It'll be a monster.
>>
>>102187095
yeah, using a few different ones I trained
>>
>>102187476
>>102187504
imo they'll delay that release, everything worse than flux dev will be discarded anyway, I'm glad they don't have much choice but to release something actually good now
>>
>>102187476
For raw power nothing is going to mog Flux so smaller models can only compete on trainability on consumer hardware. The next Pixart model will probably be on par with SD3 but without the anatomical monsters.
>>
pixartsexuals will rise again
>>
File: 00000-4007433951.jpg (310 KB, 1576x1120)
310 KB
310 KB JPG
>>
>>102187514
you will release them on civitai at some point?
>>
File: memphis.png (2.43 MB, 3264x2029)
2.43 MB
2.43 MB PNG
>>102187372
>>102187408
Could be more exaggerated, that's where its charm comes from.
>>
>>102187517
>>102187519
The fact that flux is a guidance distilled model is really annoying, and the more I use flux, training loras on it etc, the more I'm convinced that maybe it's not fixable. When training a complex concept lora on thousands of images, you basically have to use CFG to get good comprehension of the new things it learned. But it still hasn't lost the guidance distilled nature of it, so you have to use hacks like dynamic thresholding, automatic CFG and such to not fry the image.

Basically what I'm saying is a model slightly worse than Flux but without this distillation bullshit might take off due to that fact alone. And you probably could fine tune such a model for much longer and not end up in this weird middle ground halfway between a normal model and a guidance distilled model like you do when finetuning flux.
>>
>>102187635
My theory is that you can get rid of that guidance bullshit if you make a giant finetune that will make Flux not burn at CFG > 1, I'm sure that's possible, and yeah I agree this shit is annoying as fuck, CFG is king and they made a mistake by adding this distilled guidance bullshit
>>
File: 1717570818804581.jpg (89 KB, 1292x738)
89 KB
89 KB JPG
>>102187611
My gens are just prompts .
Need to lora for more exaggerated limbs i guess .
>>
>>102187635
If Pixart is 3B and supports negative prompts, it wins. I also don't think Flux fully uses its 12B parameters and things like buttchin smells of overtraining.
>>
>>102187380
>>102187443
>>102187479
bongbat is bong
>>
File: 00066-1901601229.png (291 KB, 1008x256)
291 KB
291 KB PNG
>>
>>102187727
>If Pixart is 3B and supports negative prompts, it wins.
you only have one way to support negative prompt, it's to go for CFG > 1, so it will always be twice as slow
>>
>>102187757
negative prompts on Flux looks to be a janky hack
>>
>>102187757
but at 3B with CFG it will still run faster than Flux
>>
>>102187770
I know I know, it's one thing to make CFG > 1 work on Flux, but negative prompt seems to be working at 20% of the time (and I'm being nice there)
>>
>>102187783
>but at 3B with CFG it will still run faster than Flux
we already have a 3b model it's SDXL (3.5b), and desu I don't mind to wait longer to get the Flux quality, I'm not going back to smaller models (unless they managed somehow to get that quality with such a small size)
>>
>>102187809
SDXL isn't 3.5B because SAI are a bunch of liars. We're talking strictly about the weights of the core model, not pretending the T5 and VAE should be counted.
>>
>>102187824
I think SDXL works with clip only, and that shit is small (300 mb), so SDXL is really a 3.5b model
>>
>XL sized
>16ch VAE
>Better dataset
>>
I will not post in the thread
>>
>>102187863
There won't be a better dataset, only Musk has the balls to make a completely uncensored model that has all the celebrities, characters and NFSW in there
>>
>>102187839
They include the VAE parameters.
>>
it's pixart pride month
>>
>>102187874
Isn't grok 2 based on flux?
>>
File: 000000_17180_.png (2.25 MB, 1508x1032)
2.25 MB
2.25 MB PNG
>>
File: file.png (2.87 MB, 1290x2064)
2.87 MB
2.87 MB PNG
>>102187878
You're right, the unet model is 2.6b
>>
>>102187903
It was also poorly trained because Pixart Sigma with 600m is on par if not better than SDXL.
>>
>>102187889
I think it's a finetune of flux pro or something like that yeah, I said that Musk has balls, not that he's talented enough to make an actual good model
>>
>>102187903
wow a thot
>>
>>102187914
as much as I despise SAI, SDXL is a unet model wheras Pixart is a DiT, the architecture difference helped Pixart to perform aswell with such a small size imo
>>
>>102187915
>finetune of flux pro
Not a finetune.
>>
if you guys could wish for the perfect pixart model what parameter size would you want it to be?
>>
>>102187928
2B or 3B. Whatever is largest that can be feasibly full fine tuned on 24 GB of VRAM.
>>
>>102187928
I wanted to say 12b is fine (because you can run Q8 on it and the quality is almost on par with fp16) but then I remember we can't really finetune a 12b model with our current gpu's so...
>>
>>102187809
Flux could easily be 8B
>>102187903
>multiple text encoders is a GOOD THING
fuck Lykon, dropkick a Lykon
>>
>>102187926
SAI actively made sure finetuning was difficult as a censorship/safety strategy. "Loras are all you need".
>>
>>102187961
Huh, this sounds so familiar...
>>
>>102187937
that's just sad, Nvdia is nerfind everyone with their low VRAM gpu releases, if we had 48gb we wouldn't even question anything and go for giant model and locals would be really good
>>
>>102187975
I'll happily take an 8B Pixart model that fully supported Fairscale which allows you to fairly efficiency swap train on smaller GPUs.
>>
File: kde.jpg (24 KB, 610x542)
24 KB
24 KB JPG
Anyone using Forge? I was able to gen a few images the other day but now trying anything just causes it to eat all RAM and trigger oom killer. Even with the exact same prompt as before.
>>
>>102187975
A 48 GB VRAM model would make the poorfags seethe. They're already frothing with 24 GB models.
>>
I have some good news, I found some alternative Q8 that are closer to fp8 than the regular Q8_0, and it works fine on the GGUF node
https://imgsli.com/MjkzMTU0
https://huggingface.co/mo137/FLUX.1-dev_Q8-fp16-fp32-mix_8-to-32-bpw_gguf/tree/main
>>
>>102187903
SAIkeks flexing with 1girls will never not be funny
>>
>>102188024
and then anon would complain about NVIDIA not making 80GB VRAM consumer cards
there is no satisfying the retards that think bigger models are always better
>>
File: file.png (825 KB, 630x740)
825 KB
825 KB PNG
>>102188050
People would be investing in 40/80 GB cards if we didn't know they'd be deprecated in 2 years. Think about how much people were spending on PCs in the 80s.
>>
>>102188042
Q8 is superior to fp8 though, did you mean fp16?
>>
anyone know if there's an /e/ bake going on for flux? juice isnt worth the squeeze fussing with base flux and style loras. im willing to cough up a donation for server time.
>>
>>102188050
debo, why are you ok with Nvdia staying with 24gb for more than 6 years at this point, their RTXTitan (24gb) was made in 2018, we have to advance like we always done on the computer ecosystem, if we listen to you we would've stayed on 1gb from the 2000's pc because "hurdur that's enough for you goys!"
>>
File: 1724560544371292.png (1.38 MB, 1024x1024)
1.38 MB
1.38 MB PNG
i will not post in the thread
>>
>>102188083
oh yeah I meant fp16 my b
>>
File: ComfyUI_01560_.jpg (958 KB, 1440x1920)
958 KB
958 KB JPG
>>102187609
I wasn't planning to, I only just started training loras
>>
>>102188111
what lora did you use for THOSE
>>
>>102188024
well fuck them because I'm not paying 10000 dollars to get a 48gb card
>>
>>102188070
Very few people were buying those.
What do you mean by "deprecated", are A100s "deprecated"?
>>102188099
I'm not debo.
>>
>>102188140
>I'm not debo.
yes you are, responding to multiple people on the same post + having retarded takes is debo's best signature
>>
>>102188140
>I'm not debo.
just means "retard" desu
slang
>>
@102188140
d*bo
>>
where's bigma
>>
I keep seeing the same fucking name plastered over the threads
spamming about thread personalities is just as cancerous as them
>>
>>102188140
I don't care, people spent way more on computers 30 years ago then you will spend today. Guess what anon, Earth to anon, computers were $1000 in 1990. Do you need an inflation calculator? Yes anon, H100s by far excel over A100s. Let me know if that confuses you.
>>
>>102188139
okay, but how does that harm me?
>>
>>102188209
that harms you because we can't finetune Flux at the moment, our 24gb cards aren't enough, and we can't do multi gpu training innit?
>>
>>102188226
yeah I'm sure you're really a big contributor
>>
>>102188192
> Yes anon, H100s by far excel over A100s
That doesn't deprecate the A100s, anon, does it?
>>
"Gamers" are okay with 24GB cards
>these are not gaming cards
>jewvidia is an AI company
You're right, which is why the 32GB+ cards are marketed to those companies.
>>
>>102187974
thinking of flux?
I'm working on it
>>
>>102188235
So you're ok with zero flux finetune? Because I don't, debo
>>
>>102188236
I'm not getting in an autistic semantics debate with a poorfag
>>
>>102188138
https://civitai.com/models/656458/big-boobs-flux?modelVersionId=734465
https://civitai.com/models/661287/jk-perfect-breasts-for-flux-perky-torpedo-tits?modelVersionId=740030
>>
>>102188250
I accept your concession.
>>
File: 3041267228.png (1.14 MB, 832x1216)
1.14 MB
1.14 MB PNG
>>
If Anon didn't figure out how to make Flux run on cards like the 1080 then it would be even more of a fuck up with far far less adoption.
>>
>>102188244
I'm sure someone that isn't you is going to finetune it. Don't worry anon, hope and pray that's what you like to do.
>>
>>102188260
A100s are still, why haven't you bought one again?
>>
>>102188291
Are still what?
You didn't want to go into an autistic semantics debate but you do want to go into a non sequitur debate?
>>
File: file.png (1.56 MB, 1280x896)
1.56 MB
1.56 MB PNG
>>
im a varamlet and im proud
>>
>>102188018
change this to automatic (fp16) so the loras stop getting rebuit all the time
>>
File: 1725109771078193.jpg (314 KB, 1170x1142)
314 KB
314 KB JPG
can A1111 use Flux yet
>>
>>102188369
just use forge. it's literally an updated a1111
>>
File: file.png (668 KB, 874x695)
668 KB
668 KB PNG
When will this fucker die
>>
>>102188361
I already set it to that, plus I wasn't using any loras.
Seems to be this issue https://github.com/lllyasviel/stable-diffusion-webui-forge/issues/343 except mine happens immediately.
>>
File: 00068-3449512665.png (1.26 MB, 1256x896)
1.26 MB
1.26 MB PNG
>>
File: 00253.png (1.85 MB, 832x1152)
1.85 MB
1.85 MB PNG
>>
File: 1715104055200440.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>102188396
i'd much rather see 1girl than his bug face and unkempt hair
>>
File: file.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>102188396
>>
File: 967610945.png (2.1 MB, 1152x896)
2.1 MB
2.1 MB PNG
>>102188396
Does it even look better? It's different, I'll give him that.
>>
i won
>>
2young nick
>>
File: 00000-3749492016.png (1.05 MB, 896x1152)
1.05 MB
1.05 MB PNG
>>
>>102187915
i would say the opposite actually, he clearly has the people since grok2 is actually a competitive llm with the top dogs now, but making an uncensored image model is probably too much of a liability even for him
>>
File: ComfyUI_00200_.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
>>
File: ComfyUI_Flux_56.png (1.15 MB, 1280x720)
1.15 MB
1.15 MB PNG
>>102188111
>>
File: 000000_17183_.png (2.51 MB, 1508x1032)
2.51 MB
2.51 MB PNG
>>
>>102188580
cool
>>
File: 00007-2511165097.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>
File: _0023.png (864 KB, 1024x1024)
864 KB
864 KB PNG
finally.. finished my final outputs test between all the different bakes/epochs, once I'm done sorting these I can decide on one and move the fuck on to new loras. I want to make some character ones but I'll probably wait til kohya releases layer training so I can try that..

how many layers does flux have? maybe I can spend the remainder of my runpod funds testing what some of them do before I move to vast
>>
>>102188445
Very nice
>>
File: 918739255.png (1.07 MB, 1216x832)
1.07 MB
1.07 MB PNG
>>
File: 00073-2748758278.png (773 KB, 888x616)
773 KB
773 KB PNG
endless things to x/y compare
>>
File: 1021377166.png (1.5 MB, 832x1216)
1.5 MB
1.5 MB PNG
>>102188615
thx
>>
File: 000000_17185_.png (2.2 MB, 1508x1032)
2.2 MB
2.2 MB PNG
>>102188602
Ty, Flux1.dev_Q8_0, with a clipvision load image.
>>
>>102188826
Model?
>>
comfortable
>>
>>102188396
I wouldn't even be bothered if it wasn't for him just plastering the same fucking face every time
>>
>>102188042
https://imgsli.com/MjkzMTYz
wtf, why the 11bpw one is the worse? it's the biggest one of them all
>>
File: 2024-09-01_00269_.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
>>
>>102189227
wonder if it knows of lykoi cats
>>
File: 2024-09-01_00271_.png (1.6 MB, 1024x1024)
1.6 MB
1.6 MB PNG
>>
>>102189255
>lykoi cats
ya looks like one you are right.. did not know em .. I just wanted a "rough charcoal sketch of a cat"
>>
>>102187146
Hmmm... could be prettier.
>>
do you think training only specific layers will help reduce finger rapeage for loras?
>>
>>102189344
Theoretically yes because the layer in charge of rendering the fingers should be untouched.
>>
>>102189344
the less weights raped, the better
>>
File: 00077-2408760732.png (459 KB, 544x688)
459 KB
459 KB PNG
less weight, the better, agreed
>>
what's a layer?
>>
>>102189464
Ukraine will lose, unless the West donates cannon fodder.
>>
File: ComfyUI_00142_.png (751 KB, 488x1064)
751 KB
751 KB PNG
I can't get it to say "I drew myself". Is it because flux reads text in tokens and doesn't know how to spell some of them?
>>
>>102184029
Fucking based
>>
File: th-1277584323.jpg (78 KB, 474x579)
78 KB
78 KB JPG
amazing how far we've come
>>
>>102184589
>>102184735
Once per user if it were once per image you would be able to game the buzz system with a 2nd account
>>
File: 2027741117.png (2.19 MB, 1152x896)
2.19 MB
2.19 MB PNG
>>102188939
Flux dev with this lora https://civitai.com/models/651715?modelVersionId=750685
>>
File: file.png (16 KB, 540x89)
16 KB
16 KB PNG
1.09B
>>
File: delux_sg_00103_.png (2.01 MB, 1536x968)
2.01 MB
2.01 MB PNG
>>102188340
I see that debo hiding in the back there

>>102188341
>varamlet

>>102188396
I still don't know who this is

>>102189503
jackets, sweaters, maybe scarfs

>>102189522
what are the ramifications on local image generation?
>>
File: 1725186795.png (1.73 MB, 1024x1024)
1.73 MB
1.73 MB PNG
>>
>>102188490
They just have nanny ai that runs fast after gens to check against tiddies, in addition to the usual policing of language. There's absolutely no reason to go back, because all of the capacities and elements will be here soon. The basic roadmap is:

>every prompt, a lora

then

>every inpaint, a lora

rn there aren't that many loras, but eventually it will be seamless.
>>
>>102189401
>>102189435
really want to test on style loras, it'll be more of a challenge to get good fingers still in the correct style I'm guessing
>>
>>102189593
fake. Spacebar's on the wrong side.
>>
File: FD_00039_.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>102189593
A cornographic image? On a BLUE board?
>>102189564
Speaking of, upload some gens to the LoRA so I can recoup some of the buzz.
https://civitai.com/models/709157
>>
File: 00170-4221760619.jpg (1.11 MB, 1613x2150)
1.11 MB
1.11 MB JPG
>>102189325
>>
>>102189625
>Speaking of, upload some gens to the LoRA so I can recoup some of the buzz.
no
>>
>>102189590
>what are the ramifications on local image generation?
More refugee mathematicians, I guess.
>>
>>102189576
Thank you!
>>
>>102189401
>the layer in charge of rendering the fingers
no such thing
>>
File: 000000_17190_.png (2.56 MB, 1508x1032)
2.56 MB
2.56 MB PNG
>>
File: 013.jpg (187 KB, 1288x1288)
187 KB
187 KB JPG
>>
File: file.png (430 KB, 1024x1024)
430 KB
430 KB PNG
>>
File: 00008-2246758491.jpg (2.15 MB, 2048x2048)
2.15 MB
2.15 MB JPG
>>102189722
Nice
>>
File: Sigma_13289_.png (1.64 MB, 1024x1024)
1.64 MB
1.64 MB PNG
>>
File: Sigma_13240_.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
>>
>>102189738
>beam me up!
1.8 guidance is nice.
>>
>>102189752
What
>>
File: FD_00049_.png (651 KB, 1024x1024)
651 KB
651 KB PNG
>>102189644
>>
>>102189778
>>
>>102189822
Who asked
>>
>>102189829
Try it and post the result to see if that person is cool or not
>>
>>102189840
No thanks
>>
I don't know if this has any use to anyone who wants to test blocks, but I checked the state_dic of flux and got this:
https://files.catbox.moe/i1cetj.txt
>>
File: file.png (2.03 MB, 1024x1024)
2.03 MB
2.03 MB PNG
>>
>>102189887
What is happening here
>>
File: 1711823131985222.jpg (54 KB, 950x1067)
54 KB
54 KB JPG
I think ideogram 2.0 is best image generator right if we dont forget about its closed
>>
>>102189887
Sometimes when everyone has a nice day the answer is to smash a bee hive. Except it's a sub-conscious compulsion.
>>
>>102188396
>Downvote
Why this one turkjeet makes /ldg/ seeth so much ?
>>
File: 1714038463298415.png (9 KB, 435x115)
9 KB
9 KB PNG
>>102189910
>ideogram 2.0
No thx
>>
>>102189910
what makes you believe that? do you have some image examples that show how better it is compared to the rest?
>>
File: 2024-09-01_00288_.jpg (1.94 MB, 3072x3072)
1.94 MB
1.94 MB JPG
>>
File: 1721705563455704.jpg (121 KB, 1230x768)
121 KB
121 KB JPG
>>102189929
Picrel might be the best example in one prompt it chreated 4 different characters with every details i asked and style
>>
File: 000000_17192_.png (2.24 MB, 1508x1032)
2.24 MB
2.24 MB PNG
>>
>>102189958
Wow you prompted characters all from the same franchise?
>>
File: 1721462056589845.jpg (107 KB, 984x984)
107 KB
107 KB JPG
Bros we got nearly everything sd had for flux in 2 weeks
>>
>>102189987
Except for soul.
>>
>>102189625
>>102189794
Does this LoRA require special settings/requirements for Flux? It seems to hang on my computer
>>
File: file.png (2.12 MB, 1024x1024)
2.12 MB
2.12 MB PNG
>>
>>102189987
*4 weeks .. it was released a month ago
>>
File: 1712397217417367.png (15 KB, 957x136)
15 KB
15 KB PNG
>>102190003
>>
File: delux_sg_00106_.png (1.84 MB, 1536x968)
1.84 MB
1.84 MB PNG
>>102190000
checked and true
>>
>>102189986
Vector art illustrations of four major Norse gods in a single row, utilizing a flat design style. From left to right: Odin (an older man with a long white beard and one eye, wearing a blue tunic with brown leather accents and a winged helmet, holding a spear with a serious expression); Thor (a muscular figure with flowing red hair and beard, wearing a metal breastplate and red cape, wielding his hammer Mjolnir with a determined look); Loki (a slender figure with a green and gold tunic and a mischievous, cunning expression, holding a staff with distinctive green accents on his clothing); and Hel (a pale-skinned woman with long black hair and a stern expression, wearing a black and white dress, with half of her face appearing skeletal, holding a staff topped with a skull). Each god should be set against a simple white background. Maintain a minimalist style with clean lines, resembling character designs for a mobile game.
>>
File: file.png (979 KB, 1024x1024)
979 KB
979 KB PNG
>>
File: 1709651416549558.jpg (110 KB, 800x1198)
110 KB
110 KB JPG
>>
File: 2024-09-01_00294_.jpg (1.38 MB, 3072x3072)
1.38 MB
1.38 MB JPG
heck I love the crazy cool typefaces FLUX comes up with
>>
>>102190003
>>102190027
nvm restarted and it seems to be working now, think I might have accidentally used run_cpu.bat, kek
>>
File: 1701584142976.jpg (483 KB, 1024x1024)
483 KB
483 KB JPG
>>
File: 1701594526626516.jpg (126 KB, 800x1198)
126 KB
126 KB JPG
How many charctors flux knows ? I only know about migu and trump
>>
>>102189926
>>102189910
I unironically canceled once I found out SDXL existed (and I had my GPU on the way).

Coincidentally, Flux came out right as I built my machine.

Flux is so good, it's gonna be like interior decorating levels of skill needed. Like "um, I want xyz right there"
>>
>>102190134
lots of comic book characters with varying likeness strengths
and Meghan Markle
>>
File: SoWeird.jpg (3.34 MB, 7961x1975)
3.34 MB
3.34 MB JPG
This is so weird...
https://imgsli.com/MjkzMTc1
https://huggingface.co/mo137/FLUX.1-dev_Q8-fp16-fp32-mix_8-to-32-bpw_gguf/tree/main
>>
File: ComfyUI_33294_.png (1.49 MB, 1280x720)
1.49 MB
1.49 MB PNG
>>
>>102190134
>https://drive.google.com/drive/folders/1eGrbstWLGOlinNL_d7WzaYcOzpSp5a9x
troll gonna troll I guess
>>
File: 00171-4256486162.jpg (555 KB, 1210x1920)
555 KB
555 KB JPG
>>
>>102190170
he asked for characters and you give him styles, are you retarded or something?
>>
File: 2024-09-01_00020_.png (1.34 MB, 832x1216)
1.34 MB
1.34 MB PNG
>>
>>102190159
It's functionally identical
>>
>>102190197
yeah, white skin is identical to black skin, you just solved racism
>>
File: 2024-09-01_00298_.jpg (1.44 MB, 3072x3072)
1.44 MB
1.44 MB JPG
>>102190134
check an anon made some research last thread
>https://mega.nz/folder/a2Ri0b4Z#SNKrUAChFFeovXJZT3V5SA
Knows many super heroes, vidya and comic characters .. but got some glaring holes in the knowledge as you will see
>>
>>102190134
I'm running character tests, here's where I'm at so far
https://mega.nz/folder/a2Ri0b4Z#SNKrUAChFFeovXJZT3V5SA
>>
>>102190240
it know useless shit like western superheroes, I want it to know my japanese waifus instead :(
>>
>>102190260
traint it on em.. its so easy and takes like an hour for a character lora
>>
File: 1704587123311288.jpg (67 KB, 1170x800)
67 KB
67 KB JPG
>>102190240
>>102190245
>>102190170
Thanks bros
>>
Favorite flux checkpoints???
>>
File: 00174-1319252815.jpg (426 KB, 960x1280)
426 KB
426 KB JPG
>>102190321
flux1-dev-bnb-nf4-v2
>>
>>102190321
Flux.1 Dev
>>
>>102190159
So basically? The 9.341 bpw one is the closest to fp16 and not the bigger ones? That's weird indeed.
>>
>>102190321
AbyssOrangeFlux2_hard
>>
>>102190321
FLUX.1 [pro]
>>
File: 1702000934353.jpg (731 KB, 1024x1024)
731 KB
731 KB JPG
>>
>>102190333
why going for nf4 when you can go for Q4_0, it has the same size but better quality
>>
File: file.png (2.43 MB, 1024x1024)
2.43 MB
2.43 MB PNG
Heard you people are talking shit about me on here
>>
looks like inspire_pack has the ability to isolate layers in already trained loras https://www.reddit.com/r/comfyui/comments/1f6bymd/lora_block_weight_for_flux_inspire_pack_in/

might help in deciphering what block weights to use for testing training of loras

maybe this is old news but I hadn't know about it yet
>>
>>102190355
I can't say anything about you without a patreon link
>>
File: 1704069083835768.png (873 KB, 1024x768)
873 KB
873 KB PNG
>>102190098
>>
File: 1720686932123.jpg (749 KB, 1024x1024)
749 KB
749 KB JPG
>>
>>102190339
>So basically? The 9.341 bpw one is the closest to fp16 and not the bigger ones? That's weird indeed.
yeah, if you have a 24gb vram card, might aswell go for that one, it's the new closest quant to fp16
https://imgsli.com/MjkzMTgw
>>
File: 1721259462766087.png (406 KB, 1024x768)
406 KB
406 KB PNG
>>102190372
>Shake it. Bitch.
>>
>>102190351
I completely forgot it exists, have to dl the .gguf one. Ty man
>>
>>102190372
>>102190403
Glad it's working properly took me some fuckery to find the triggers. I'm not entirely happy with it so I might try again with different settings.
>>
>>102190195
Nice shot
>>102190321
flux1-dev-q8_0.gguf
>>
>>102190351
doesn't Q4_K_S have better quality than Q4_0 despite being the same size?
>>
File: file.png (66 KB, 963x762)
66 KB
66 KB PNG
>>102190432
you're right
https://github.com/ggerganov/llama.cpp/pull/1684#issuecomment-1579252501
>>
File: delux_sg_00107_.png (1.82 MB, 1536x968)
1.82 MB
1.82 MB PNG
>>102190240
someone (not me) should publish this as a website

>>102190355
this guy must have the coolest tinder profile at this point
>>
>>102190321
Q8_0-fp32-09.341bpw >>102190398
>>
>>102190432
>>102190450
Where does Q4_1 fit in quality-wise? It's what I'm using now on my 12GB card
>>
https://aclanthology.org/2024.lt4hala-1.15.pdf

gpt-4 works with Latin, to some extent. What others work with Latin?
>>
>>102190477
you can see it on the image, it has worse quality than q4_k_s (16.3% error for q4_1 vs 13.2% error for q4_k_s)
>>
>>102190481
wrong thread anon
>>
>>102190477
use Q5_1 or Q6_K if you have 12GB, Q5_1 should be slightly faster with loras since it isn't a K quant
>>
>>102190492
>>102190533
Nice, I'll check all of these out
>>
>>102190533
I use Q8 with 12gb and it works fine, but probably a bit slower than those two. (around 4 something /it
>>
>>102190207
Oh shit you're right. I genuinely didn't even notice. I need to spend some time in /pol/
>>
>>102190502
oops lmao
>>
>>102190576
kek
>>
File: 1721222120927313.png (742 KB, 1024x768)
742 KB
742 KB PNG
>So Nicolette, tell me again that story of how you lost your virginity.
>>
File: ComfyUI_33308_.png (1.48 MB, 1280x720)
1.48 MB
1.48 MB PNG
>>
>>102190562
I thought quants don't actually speed up gen time, just prevent oom?
>>
>>102190606
What's he holding?looks like a packet of mayonnaise
>>
>>102190638
it speed up gen time if the quant is small enough to fit on the gpu, if it's too big, some of it end up on the cpu and it makes shit slow
>>
>>102190645
They're about to shoot one of those Japanese image videos where they don't actually have sex but just simulate it using white condiments
>>
>>102190663
That's a thing?
>>
>>102190638
they are slower than their equivalent in fp, but they are faster than a higher fp .. sooo:
(in matter of speed not quality fp16 is the slowest)
>fp16 -> q8 -> fp8 -> q6 ... etc.
>>
>>102190675
Yes. Most of it is basically gravure with nipples and buttholes, the sex scenes are usually the final scene of the set.
>>
File: ComfyUI_33309_.png (1018 KB, 1280x720)
1018 KB
1018 KB PNG
>>
File: 1703747530224.jpg (993 KB, 1024x1024)
993 KB
993 KB JPG
>>
>>102190647
ohh derp, of course. thanks anon
>>
File: 2024-09-01_00320_.jpg (1.55 MB, 3072x3072)
1.55 MB
1.55 MB JPG
>>102190715
ooph what a pun.. nice gen tho
>>
File: 1725160070.png (1.71 MB, 1024x1024)
1.71 MB
1.71 MB PNG
>>
>>102190741
Every time I do female robots it always gives them nips
>>
File: ogog.png (24 KB, 1158x102)
24 KB
24 KB PNG
is this the correct stuff?
>>
File: 1715240600407.jpg (757 KB, 1024x1024)
757 KB
757 KB JPG
>>
>>102190785
you should go for Q4_K_S instead anon >>102190450
>>
So i hear Cog has an img2vid model, but they have no date for release and if it is EVER released it will not be open source.

I hereby name 1st Sept as "ghey devs" day.
>>
File: 1694753617441046.png (31 KB, 351x466)
31 KB
31 KB PNG
What's a good way to "fix" noise on an image so it upscales better?
I like to tweak my gens to perfection and that usually means a LOT of manual drawing, inpainting, merging, photoshop edits and more. Unfortunately the processing sometimes leaves some parts looking too "smooth".
Even manually adding some uniform/gaussian noise in Photoshop and running the whole image through with low denoise doesn't fix it, it leaves the added noise in and you can clearly see what parts were edited (picrel).
What's a good way to fix this? I just want to add some actual (uniform and consistent) texture to the fabrics so they upscale nicely.
>>
>>102190797
meh, fuck them, BFL will deliver with their video model like they did with flux
>>
File: 2024-09-01_00324_.jpg (1.57 MB, 3072x3072)
1.57 MB
1.57 MB JPG
>>102190782
these are merely buttons, she is a coffee machine, the left button is for sugar, the right for milk
>>
File: 00082-1791074334.png (844 KB, 888x808)
844 KB
844 KB PNG
>>102190785
same set up as me
i have 'automatic (fp16 lora)' swap method: queue swap location: cpu and gpu weights setting at like 3400. no idea of those are good but it works for me currently
>>
>>102190801
get an different upscaler.. some radically remove noise, some smooth it out
>>
File: 00272.png (1.25 MB, 896x1152)
1.25 MB
1.25 MB PNG
>>
>>102190808
All our eggs in one basket again, and not by choice anon, that's the worst of it.
No idea when BFL will release either, it's been 1 month to the day since BFL launched.
I'm getting angsty for a good local img2vid model with coherency to come out.
I hope they give ys something with high coherence, Flux has been fairly solid even though it has limitations as is.
>>
>>102190782
prominent nipples are a defining characteristic of women (not troons)
>>
>>102190887
Robots aren't women
>>
File: 002.jpg (174 KB, 1288x1288)
174 KB
174 KB JPG
>>
>>102190880
I don't see a reason why we should doubt them, this team left SAI because they were sick of their cucked policies and managed to make and release a fairly uncensored (AND GOOD) model just like that, now they have some ties with Musk they probably have more money now, so it's more likely it'll be a good video model, now I hope we'll be able to run it but if we don't, the GGUFs are gonna save us
>>
>>102190900
that robot identified as a woman you bigot!
>>
>>102190794
How is this compared to the nf4 model
>>
>>102190924
I hope so anon, the pain of SD3 still hurts.
>>
File: 2024-09-01_00330_.jpg (1.79 MB, 3072x3072)
1.79 MB
1.79 MB JPG
>>102190936
no it identifies as coffee machine
>>
File: 1702718536493.jpg (662 KB, 1024x1024)
662 KB
662 KB JPG
>>
>>102190946
Looks like this: Q4_K_S > Q4_0 > nf4
>>
File: 00004-2130684045.jpg (492 KB, 1080x1440)
492 KB
492 KB JPG
>>102190841
yea i guess this works. 3060 lacks horsepower
>>
File: ComfyUI_00022_.png (1.04 MB, 1280x720)
1.04 MB
1.04 MB PNG
Finally got flux working somewhat reasonably. On a 3080ti with flux1-dev-Q4_K_S.gguf . I'm getting ~2s/it , or about 60 seconds per image. Is that within the ballpark of the performance I should expect or is something weird happening? Using essentially the canonical flux workflow with the gguf extension. I'm also running on a 7yo intel i7-8700 fwiw; I still don't have a good understanding to what gguf actually ends up offloading from the gpu.
>>
File: 1714385214513588.png (1.99 MB, 1024x1536)
1.99 MB
1.99 MB PNG
Q4_1 on top, Q4_K_S on bottom. Q4_1 actually looks a bit better to me.
>>
>>102191049
Is that a remote control for a vibrator?
>>
>>102191049
KS deleted the women with green pants. What a racist.
>>
>>102190987
Thank you
>>
>>102191049
that's not how quant comparisons work, the only way to know which one is better is to know which one is the cloest to fp16
>>
>>102191071
I think one of those lovely latina ladies dropped a hairclip at yoga, if I return it she might marry me
>>
>>102190987
Do you know where to get the other stuff for the Q models besides the main model?
>>
>>102191021
> I'm getting ~2s/it , or about 60 seconds per image
at 1024x1024? ya that is about right .. a 3090 is ~1.5s/it, a 4090 ~0.75s/it
>>
>>102191099
what do you mean exactly?
>>
File: 2024-09-01_00334_.png (1.86 MB, 1024x1024)
1.86 MB
1.86 MB PNG
>>
>>102191110
>>102190785
This stuff
>>
File: 1712003884197196.png (1.95 MB, 1024x1536)
1.95 MB
1.95 MB PNG
>>102191085
I've proompted 8 pictures and in nearly all of them, the Q4_K_S has issues like clothing straps that just disappear or in some cases outright deformities.
>>
>>102191118
Awww
>>
>>102191134
K quants are wonky right now, they seem to perform worse with loras
>>
File: 1710344531186700.png (2.13 MB, 1024x1536)
2.13 MB
2.13 MB PNG
>>102191134
Here's Q5_1 (top) vs Q6_K (bottom) as well
>>
>>102191120
I'm not sure I understand your needs but I'm gonna try anyway:

The VAE:
https://huggingface.co/black-forest-labs/FLUX.1-schnell/blob/main/ae.safetensors

The text encoders:
https://huggingface.co/comfyanonymous/flux_text_encoders/tree/main
>>
>>102191093
Oh ok maybe I watched too much JAV in the past, but I thought that was a scene where he controls all the women that have a vibrator inside them with his remote control
>>
>>102191134
if it looks better, go with it, dont waste a bunch of time on it
>>
>>102191170
Thank you so so much
>>
>>102191120
https://huggingface.co/city96/FLUX.1-dev-gguf/tree/main
https://huggingface.co/city96/t5-v1_1-xxl-encoder-gguf
>>
File: ComfyUI_00029_.png (1022 KB, 1280x720)
1022 KB
1022 KB PNG
>>102191100
ok, thanks. A minute per image is bearable for the quality; If there's different quantization working reasonably at like 15s/image for some quality loss, I think I'd prefer it though.
>>
>>102191175
It's already been tested anyway. Each time the model gets smaller the further away from fp16 it gets. It's very linear.
>>
>>102191166
could you try this Q5_1 and Q6_K comparison without loras?
>>
>>102191134
I have a theory that the K quants are too "irregular" in terms of weights precision differences, and the image models doesn't seem to like that somehow, that can explain why a 11bpw perform worse than a 8.5bpw >>102190159
>>
>>102191193
A tip, you can gen at lower resolutions and still get excellent quality with flux. I get 4it/s on a 4080 using fp8 at 512x512. If you're just testing concepts drop the resolution, then crank it up when you've mostly dialled in
>>
>>102190785
What is the link for this model
>>
File: 1709035848204.jpg (644 KB, 1024x1024)
644 KB
644 KB JPG
>>
File: 2024-09-01_00335_.png (1.94 MB, 1024x1024)
1.94 MB
1.94 MB PNG
>>102191142
ty
>>
Come and get your own loaf of...
>>102191214
>>102191214
>>102191214
>>
>>102191251
the worst collage so far, gj
>>
File: 1712130611283938.png (875 KB, 1024x768)
875 KB
875 KB PNG
>>102191162
>>102191203
That does seem to gel with what I've seen.
>>102191202
In a bit, after I finish gooning to Deus Ex gym babes.
>>
File: ifx312.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>
>>102191269
original collage baker was best frfr
>>
>>102191398
sad but true
>>
>>102191398
>>102191423
How are you guys generating these collages?
>>
>>102191539
https://www.befunky.com/
>>
>>102191294
>/ldg/ - Local Diffusion General
take your imagefx cloudshit and fuck off
>>
File: ifx313.png (872 KB, 1024x1024)
872 KB
872 KB PNG
>>102191549
no lady ;)
>>
File: ifx334.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
1 more? or 29??
>>
>>102191566
https://www.youtube.com/watch?v=ebnYbhU9ukA
>>
File: 00245.png (2 MB, 832x1152)
2 MB
2 MB PNG
>>
File: ifx331.png (1.83 MB, 1024x1024)
1.83 MB
1.83 MB PNG
>>
File: 00010-4050325329.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
to slavery for the landed white men
>>
File: 00013-1410593377.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>
File: monster16.jpg (1.44 MB, 1792x2304)
1.44 MB
1.44 MB JPG
>>102191860
Sweet
>>
>>102192716
The color scheme here is so tasty it made me forget for a second that Monster already has a black cherry flavor



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.