[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: the longest dick general.jpg (2.5 MB, 3264x2013)
2.5 MB
2.5 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bred : >>102893094

Difficult, Slow, Clunky, and with an Aesthetic Trapped Somewhere Inside the Early 2000s Edition

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://aitracker.art
https://huggingface.co
https://civitai.com
https://tensor.art/models
https://liblib.art
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux
DeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai
>>
https://huggingface.co/jimmycarter/LibreFLUX
All right, who's gonna test this?
>>
BFL ruined local image generation
>>
nutella on my tongue edition
>>
let's get things started with a titty elf
>>
>>
>>102909045
Mind if I get a few more of this?
>>
>>102909100
just started genning a few minutes ago don't have much to post yet
>>
>>102909110
They are great, a few more blonde or white haired elves would be nice!
>>
File: ComfyUI_02203_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>
>>102909156
I'll post em as I get em
>>
>>102909218
I appreciate them very much anon, what flux model are you using for these? What prompt?
>>
>>102909234
dev nf4

Each prompt was slightly different, I was adjusting it as I went. But at one point it was:
>The Sorcerer And The Elf \(1971\), Kubrick's high fantasy masterpiece. pictured: Sara Hussein as Nimreth the forlorn, a beautiful high elf girl bereft of her love, wandering the far hills in darkness as the tears fall on her huge breasts. Her long pointed ears and circlet of twigs and her very long hair pulled back are the outward signs of her elven heritage. The actress had some big'uns, holy cow. shameless display of cleavage. her circlet is inlaid with a magnificent citrine piece.
>>
>>102908995
>I started training for over a month on a 5x 3090s and about 500,000 images.
>For de-distillation of schnell I think you probably need a lot more than 1500 H100-equivalent hours. I am very tired of training FLUX and am looking forward to a better model with less parameters.
that's weird, because dedistill dev is perfectly undistilled and it needed less pictures to do so
https://huggingface.co/nyanko7/flux-dev-de-distill
>Train: 150K Unsplash images, 1024px square, 6k steps with global batch size 32, frozen teacher model, approx 12 hours due to limited compute.
>>
File: blurred_social (16).png (552 KB, 934x1635)
552 KB
552 KB PNG
>>102909282
also it's img2img on blurry starting images that look like picrel, 0.92 denoise, and guidance is 1.5
>>
>>102909299
I dont get it either
>>
>>
>>102909358
kek, there's a Gigachad Lora now,
>>
>>102909299
>dedistill
>no PuLID support
dam...
>>
File: ComfyUI_06913_.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>102909421
what? PuLLID works fine on dedistill
>>
>>102909438
 Boolean value of Tensor with more than one value is ambiguous 

updated a couple days ago
>>
>>102909450
I got that error aswell at the begining of PuLID, but it got fixed 3 weeks ago, are you sure you're using the good repository?
https://github.com/balazik/ComfyUI-PuLID-Flux/issues/7#issuecomment-2389574681
>>
>>102909438
what does PuLLID do?
>>
File: ComfyUI_06680_.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>102909469
you put as an input an image of someone, and with PuLID you can make any picture you want with that person
https://github.com/balazik/ComfyUI-PuLID-Flux
https://www.youtube.com/watch?v=GM-e46xdcUo
>>
File: ad.jpg (30 KB, 441x490)
30 KB
30 KB JPG
>>102908985
Anyone with experience admotion director in comfy? they quietly updated their nodes and now nothing works properly. I have to use "queue trigger" and this just creates more queues and fires off other workflows
>>
File: 1715709231819723.png (1014 KB, 832x1152)
1014 KB
1014 KB PNG
>>
File: gobbo.png (480 KB, 520x760)
480 KB
480 KB PNG
>>
>>102909157
You got any more officer mayli pics? Or any mayli pics you can catbox?
>>
File: file.jpg (703 KB, 4096x3072)
703 KB
703 KB JPG
On the left is Flux on the right is real, you really notice how far from reality AI is when you compare the both of them like that
>>
>>102909934
did you describe the lighting? it looks really flat
>>
File: file.png (391 KB, 640x480)
391 KB
391 KB PNG
>>102910077
the lightning is flat, the skin is too smooth, the setting is too clean, it's fitting for McDonalds, they love to advertise their burger in a flux way and then the real burgers look nothing like it
>>
File: 00124-1114288180.png (2.34 MB, 1248x1824)
2.34 MB
2.34 MB PNG
>>
File: file.jpg (1.23 MB, 3909x1824)
1.23 MB
1.23 MB JPG
https://huggingface.co/alimama-creative/FLUX.1-Turbo-Alpha
Has anyone tried this?
>>
File: look who I just met.png (1.77 MB, 1600x1024)
1.77 MB
1.77 MB PNG
>>102907353
I did, it's very easy and feels awesome.
>>
>>102909358
Literally me
>>
>>102910129
there's a reddit thread here and people are wondering how it compares to schnell:
https://www.reddit.com/r/StableDiffusion/comments/1g26pim/fast_flux1_dev_in_8_steps_turbo_alpha_first/

>Compared this to a flux schnell distillation lora that works in 8 steps that I already have, and not only is it 2 seconds faster than this, the quality is also better. Not sure this is an improvement.
i wonder what that "flux schnell distillation lora" he has is
>>
File: 202410_19-43.jpg (162 KB, 1504x1064)
162 KB
162 KB JPG
>>102910129
>>102910193
How big is it? Ive been stuck on nf4 since I'm on an 8gb card
>>
>>102910213
>How big is it?
it's a 700 mo Lora
>>
>>102910330
>I have more Mayli pics than Sperm in my Balls
can't blame you anon, she's such a beautiful woman
>>
>>102910148
You covering with laughing man is kudos anon
>>
File: Haru.png (1.45 MB, 1157x1157)
1.45 MB
1.45 MB PNG
What's vibe transfer under the hood?

I want to get the local equivalent if possible.
>>
>>102910746
>What's vibe transfer under the hood?
the what?
>>
File: file.png (22 KB, 392x209)
22 KB
22 KB PNG
>>102910753
NovelAI has a feature where you can insert a multitude of images and it'll instantly mimic its properties; style, coloring, composition sometimes, subject info, and it alters your generations instantly.
>>
>>102910773
You really like the word instantly.
>>
File: file.png (948 KB, 1247x509)
948 KB
948 KB PNG
>>102910786
I just instantly woke up, instantly, sorry bro.

but yeah here's a sample image, maybe a visual helps.
>>
>>102910886
https://github.com/logtd/ComfyUI-Fluxtapoz
there's this but I don't find it very good
>>
File: 1709958794556452.jpg (1.61 MB, 1857x3261)
1.61 MB
1.61 MB JPG
>>102910746
>I want to get the local equivalent if possible.
There are many style transfer techniques.

https://style-aligned-gen.github.io/
https://instantstyle.github.io/
https://instantstyle-plus.github.io/
https://csgo-gen.github.io/
https://rb-modulation.github.io/
https://rf-inversion.github.io/

https://github.com/brianfitzgerald/style_aligned_comfy
https://github.com/cubiq/ComfyUI_IPAdapter_plus
https://github.com/smthemex/ComfyUI_CSGO_Wrapper
https://github.com/logtd/ComfyUI-Fluxtapoz

Pic related was done using style aligned comfy back in the day. Left side was the original pic.
I think I only have tried IPAdapter besides that one, but I don't remember what I did with it.
>>
File: ComfyUI_02466_.png (1.65 MB, 1024x1024)
1.65 MB
1.65 MB PNG
>>
File: ComfyUI_02471_.png (1.04 MB, 1024x1024)
1.04 MB
1.04 MB PNG
>>
File: 00209-1341567353.png (1.11 MB, 896x1152)
1.11 MB
1.11 MB PNG
>>
File: 00216-4206173043.png (1.62 MB, 896x1152)
1.62 MB
1.62 MB PNG
>>
>>102910977
She loves the poultry
>>
>https://blog.comfy.org/comfyui-v1-release/
>The electron app is a simple wrapper around the existing ComfyUI web application
Pretty damn smart actually. Too bad it's only for closed beta
>>
>>102911671
>another wrapper
>no improvements
>>
>>102911684
>another wrapper
Zero good ones so far
>>
Is there a new Cog model coming? I could swear i saw an announcement that the current model was going to be replaced but i can't find it anywhere now.
It's possible i've confused myself with another video model?
>>
>>102911740
you must be thinking of that pyramid model, they used an sd3 base and said they were planning on making a new base from scratch
>>
>>102911772
Thanks, that was indeed it.
>>
>>102911979
>We are training Pyramid Flow from scratch to fix human structure issues related to the currently adopted SD3 initialization and hope to release it in the next few days.

Update README.md
feifeiobamapushed 1 commit to main • dddc38e…d149be3 • <<<8 days ago>>>

It's never coming out anon. It's over.
>>
File: 00017_.jpg (378 KB, 1664x2432)
378 KB
378 KB JPG
>>102911671
>electron app
>Pretty damn smart actually
No. It is a step down. It is normally a step down. Electron is a hack and I don't even know where to start on how the security/telemetry is going to work. With mcmokeypoop's affinity for languages it is going to be filled with C# trash and it is going to get even more buggy.

>>102911772
that was a while back I thought. Two more weeks should be up by now.
>>
>>102911706
Shit is shit no matter how you wrap it
>>
>>102911671
I wonder if cumfart is aware of the security implications of electron apps + loading and running arbitrary garbage code from the web
>>
>>102911051
nice
>>
>>102910330
TJD
>>
>>102909901
I have not been around for a while. SO currently goign back to all the threads and collecting the Mayli pics I missed.

They are so good.


>>102909110
>>102909045
those first two elves look very good. Like a screenshot.

What did you use to make them?
>>
File: file.png (515 KB, 512x512)
515 KB
515 KB PNG
>>
>>102909218
>>102909110
>>102909282
post some petite 18 year old elves pleasw
>>
>>102912190
>BFL gave us a turd but at least we get to polish it
>>
>>102912104
>that was a while back I thought. Two more weeks should be up by now.
since they are training their own base from scratch i expect it to take half a year minimum, unless the train has been ongoing for a while before the pyramid release
>>
>>102911671
starting to get greedy I see, whats the use of this? comfyorg members want money of course
>>
Damn ComfyUI is going pro, the UI looks pretty good, I can't believe we finally have an open-source AI tool that isn't some clunky 2000s-tier slop or worse, something built with gradio.
>>
File: 0.jpg (169 KB, 1024x1024)
169 KB
169 KB JPG
>>
File: 0.jpg (130 KB, 1024x512)
130 KB
130 KB JPG
>>
So these jew niggas are actually not gonna release dev 1.1?
lmao
>>
I wonder if the Pixart architecture can be changed to just do the Linear Attention and removing the positional embeddings and you'll get an easier to train model.
>>
>>102913839
you really thought they were? retard
>>
>>102913839
They've gone completely radio silent. I don't know how in the modern era devs haven't learned the basics of at least posting some dev logs once a month or something.
>>
Why does /sdg/ still exist? It's not like both generals are fast enough to warrant the split, and it's not like proprietary model users are really much interested in posting their pictures on /g/. Stability is effectively dead, and whoever still wants to post sd pics, posts here.
Debo posts here whenever he likes so "containment" doesn't really work. Why is the split still there?
>>
ok anon
>>
>>102913992
/sdg/ is just a circle jerk image dump. These threads actually have discussion
>>
I'm back to /g/ and I see Flux DeDistilled Quants.
What do these version do? Are they of lower quality than my dev-fp8 flux model?
>>
>>102914092
dedistilled is generally lower quality right now because the weights are quasi broken
>>
0/10
>>
>>102914128
yet somehow anon still falls for it every time
>>
>>102912329
Telemetry thing is already bad enough.
Cumfart is a toy tier developer high on his own farts.
>>
>>102913853
Time to train the 1B Pixart 16 channel model with Linear Attention and no positional embeddings while waiting for Sana.
>>
>>102912727
>What did you use to make them?
Flux dev nf4, more details here >>102909282 >>102909306

>>102912811
Was asleep, sorry. If I made them any more young and petite they wouldn't be titty elves anymore.
>>
>>102909606
very nice
>>
File: ComfyUI_03757_.png (1001 KB, 1024x1024)
1001 KB
1001 KB PNG
>>
File: ComfyUI_03779_.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>
>>
>>
>>
File: ComfyUI_03806_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>
>>
just one more after this, don't want to spam
>>
this one specially for >>102909156
>>
File: ComfyUI_03821_.png (1.61 MB, 1024x1024)
1.61 MB
1.61 MB PNG
Elf thread , then
>>
need big tit elf gf badly
>>
File: ComfyUI_03827_.png (1.44 MB, 1024x1024)
1.44 MB
1.44 MB PNG
>>
>>102911057
nice
>>
File: ComfyUI_03831_.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
>>
File: file.jpg (223 KB, 1024x1024)
223 KB
223 KB JPG
>>
I lied, one more

>>102915869
Who would ever refuse an elf thread? It's tradition.

>>102915979
I don't think it's possible to have an elf gf. An elf lover, ok, but there's something vulgar and human and familiar about being a gf
>>
File: ComfyUI_03840_.png (1.49 MB, 1024x1024)
1.49 MB
1.49 MB PNG
>>
>>102914170
He used to say the gradio telemetry was untrustworthy. Clearly easy dev is more important that user. Or maybe the excuse for a paid version. I have no idea how third party nodes are going to be integrated in electron. Unless the ecosystem is going to be locked up.
>>
>>
File: ComfyUI_03852_.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
>>
>>102914170
is there a way to turn off this telemetry?
>>
File: ComfyUI_03871_.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>
File: ComfyUI_03875_.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>
ITS UP

https://github.com/NVlabs/Sana

https://huggingface.co/collections/mit-han-lab/dc-ae-670085b9400ad7197bb1009b

https://ea13ab4f5bd9c74f93.gradio.live/
>>
>>102916839
>20 times smaller and 100+ times faster than flux
>>
>>102916839
Damn!
>>
File: file.png (1.59 MB, 1024x1024)
1.59 MB
1.59 MB PNG
>>102916839
>>
>>102916839
lewds?
>>
File: image.jpg (207 KB, 1024x1024)
207 KB
207 KB JPG
Same prompt as the previous flux image...
>>
File: image.jpg (406 KB, 1024x1024)
406 KB
406 KB JPG
>>102916839
>can't even do text
>>
>>102916839
It's fucking garbage
>>
>>102917146
>can't even train the model
I hope you like what the distilled model can do lmao
>>
File: file.png (284 KB, 1024x1024)
284 KB
284 KB PNG
>>102917108
i think the demo censors anything lewd, at least this is what i get with "1girl, anime, large breasts"
>>
>>102916839
>a woman holding a sign that says /ldg/
>bad text
>bad hands
It's over, no buttchin tho
>>
>>102917146
this is the cutest anime girl we've had in a long time and it's your throwaway gen to prove this new model can't do text?
>>
File: 1705210383742909.jpg (98 KB, 1024x1024)
98 KB
98 KB JPG
>>102917174
Forgot the image like a retard
>>
File: file.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
at least it can do anime
>>
>>102917172
This but with "blackface" or "nigger"
>>
what are we waiting for now
>>
What's the latest in img2img face swapping? I'm kind of stuck on using a1111, but if there's a standalone local app for it I'm down to try.
>>
File: file.png (1.77 MB, 1317x1324)
1.77 MB
1.77 MB PNG
The compression kills it a little but I think the trainability is what will be the saving grace. I'm going to take a stab and say they didn't train the model the best they could have and 1.6B is limited in knowledge so you'll need to split the model down photorealism and anime.
>>
File: file.png (936 KB, 1024x1024)
936 KB
936 KB PNG
>>102917146
>>102917174
picrel
>>102917216
comfyui support
>>
>>102917126
>>102917146
>>102917155
>>102917210
It's not great but there is a lot of seemingly solid work here. The paper is essentially a basket of optimization tricks except for replacing t5 with gemma as the text encoder, (although I think that also ends up being cheaper, but is not necessarily the point). As long as these optimizations don't always make the output worse then it should be good in the long run. That is, this paper seems to contribute a lot to the diffusion literature, even if it isn't as good as sota in fidelity. Particularly if the new autoencoder is actually good that will be a huge boon. Its hard to tell being that there are so many new things being done in sana.
>>102917216
sana 2
>>
>>102917234
There's no local model yet, it's just the demo.
>>
>>102917172
reminds me of the chat filter in an online game I played ages ago that replaced any bad words with LOVE
>>
>>102917197
>>102917210
>>102917226
for a base model this is amazing.
>>
we took another L today...
>>
>>102917238
Doesn't https://huggingface.co/collections/mit-han-lab/dc-ae-670085b9400ad7197bb1009b contain the weights? Or do we need some nontrivial inference code as well?
>>
>>102917263
That's the VAE. So that would be like seeing sdxl-vae.safetensors
>>
File: 1703055319390617.jpg (671 KB, 1024x1024)
671 KB
671 KB JPG
The fabled woman lying in grass test
>>
>>102917241
uhhh it looks flux schnell to me
>>
File: 1699437224056864.jpg (477 KB, 1024x1024)
477 KB
477 KB JPG
>>102917275
It's fucking garbage
>>
>>102917275
it failed on the fucking grass but not as badly on the woman, this model is so cooked
>>
>>102917282
Well that's good news because the weights are 10x smaller.
>>
>>102917275
big penis
>>
>>102917291
"it looks like flux schnell" is my shorthand way of saying something looks like tiny model garbage
>>
Show me booba
>>
File: 1700388272519982.jpg (559 KB, 1024x1024)
559 KB
559 KB JPG
>>102917289
But then I get a not so bad one anatomy wise. The grass still looks terrible though
>>
File: file.png (1.22 MB, 768x768)
1.22 MB
1.22 MB PNG
>>102917305
Then you're a fucking retard because that's a proper way to say anything.
>>
>>102917305
I am sure he understood what you meant. This new model is shit stuff has far exceeded every release so far.
>>
>>102917235
maybe the encoder would be better, but it needs less compression
like instead of trying to match sdxl level with 2x more compression, why not try to match flux vae level with 2x less compression compared to sdxl instead of 4x less (so something like f8c8)
>>
>>102917312
you already know it can't. It's 2024, sexualizing a woman is illegal
>>
>>102917326
Keep waiting for a Flux finetune, any day now.
>>
File: file.png (584 KB, 1024x1024)
584 KB
584 KB PNG
>>102917324
nice
>>
File: file.png (2.34 MB, 1248x1248)
2.34 MB
2.34 MB PNG
This model will likely need to be downscaled so the AE doesn't gobble up the details.
>>
>>102917372
Me in the back
>>
Anons said SDXL was shit. I ignored them and got to work demonstrating its power.

Anons said WDXL was shit. I ignored them and got to work demonstrating its power.

Anons said FLUX was shit. I ignored them and got to work demonstrating its power.

Anons are saying Sana is shit. I am going to see for myself...

(I have deliberately left out the many other times anons were right)
>>
File: test.jpg (278 KB, 1024x1024)
278 KB
278 KB JPG
it's sure fast and understands natural prompt,but we do need the offline download
>>
File: new controlnet.jpg (1.45 MB, 2120x2488)
1.45 MB
1.45 MB JPG
Well, since nobody likes sana I also found this in the comfy issue tracker. Could be the new controlnet.

pic related is from some guy who tested it https://github.com/comfyanonymous/ComfyUI/issues/5314

project page
https://github.com/xyfJASON/ctrlora
>>
File: 1719149808201273.jpg (165 KB, 1024x1024)
165 KB
165 KB JPG
>>
File: file.png (1.52 MB, 1024x1024)
1.52 MB
1.52 MB PNG
>>102917155
>>102917174
>>102917282
>>102917289
>>102917290
>>102917305
>>102917398
meds now
>>
File: file.png (2.07 MB, 1248x1248)
2.07 MB
2.07 MB PNG
>>102917398
Flux is shit because you can't train it. You're not going to see any real nsfw checkpoints on it for a long, long time because the minimum hardware requirement is well beyond consumer hardware. SDXL was slow to train because we didn't proper training tools and didn't have all the optimizations we have today.
>>
>>102917420
>>102917431
What's with the garbled eyes, even in these closeups
>>
>>102917447
The compression is too much. I don't think the model is going to be usable without downsampling, my guess is you'll do 4K and downscale to 2K or 1K.
>>
>>102917447
the vae causes that in sdxl, but in sana its going to be even worse because the compression is higher
>>
told you it would be shit. micro models are useless fucking garbage. pay no attention to any model below 3b. baffling how 2022 midjoirney still absolutely mogs local models. local bakers are incompetent cowards afraid to train on real data.
>>
File: file.png (2.16 MB, 1248x1248)
2.16 MB
2.16 MB PNG
>>102917457
I will also say that it's not done baking yet, the model on the demo is 1024px, things fall apart if you try to go much higher. I have a feeling they're still training the 1024 model and then you still have to do the 2K and 4K one.

>>102917488
You're still not going to have a Flux fine tune. Have fun your toy box.
>>
>>102917488
>afraid to train on real data.
>"b-but it converges quicker if I train on slop!!"
>>
The model is uncensored to some degree, just got some tits.
>>
File: 1717937552561764.jpg (2.2 MB, 4096x4096)
2.2 MB
2.2 MB JPG
>>102917495
Do NOT try max out the resolution. Got two things that kind of looked like nipples in this nightmare flesh heap though, censored to protect janny's eyes of course
>>
a sad day for local... is this really "bigma"? is there anything left to wait for besides a quick and painless death?
>>
>>102917538
Make an 8x compression model with their architecture 20x faster. Also we haven't seen the 2K / 4K results which will benefit a lot more from the compression and not show the flaws as much or at all.
>>
File: file.png (220 KB, 1024x1024)
220 KB
220 KB PNG
>>102917415
>>
File: file.png (596 KB, 1248x1248)
596 KB
596 KB PNG
>>
File: file.png (1.8 MB, 1248x1248)
1.8 MB
1.8 MB PNG
>>
>>102917556
2k 4k is completely retarded, even if they have the training data for it, you would have to restrict your dataset EXTREMELY if you wanted to keep it able to produce high res results like that after your own shitty lora or finetune
also it defeats the purpose of the model and its compression since itll make the latents just as big or bigger than sdxls
>>
File: file.png (1.51 MB, 1248x1248)
1.51 MB
1.51 MB PNG
>>102917602
I'm not saying I would've made some of their decisions. Given eyes are so small you can't have that much compression and frankly 1K is good enough for 99% of all use cases.
>>
>>102917616
We'll know we're fucked when people start training SD3
>>
File: file.png (893 KB, 1024x1024)
893 KB
893 KB PNG
>>102917588
how2soul
>>
File: file.png (2.29 MB, 1024x1024)
2.29 MB
2.29 MB PNG
>>
File: file.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
its fun
>>
>>102911671
what's the point of a wrapper?
>>
>>102917753
>wahhh python dependencies save me
>>
>>102917768
No I'm serious I'm a bit of a noob when it comes to those stuff, what's a wrapper anyway? Will it make it better or something?
>>
>>102917781
Meaning it runs inside of a shell program instead of in your browser.
>>
>>102917781
it'll steal your prompts
>>
File: file.png (1.06 MB, 1280x1024)
1.06 MB
1.06 MB PNG
>>
File: file.png (1.58 MB, 1024x1024)
1.58 MB
1.58 MB PNG
>>102916839
ehh I expected worse, it doesn't know how to do dreadlocks though
>>
>>102917431
the only finetunes I ever cared about were the ones that were almost base models in their own way—pony, wdxl. The millions of shovelware finetunes never had my attention at all, and LoRAs seldom interest me. For those who like those sorts of things, boo hoo, but when did they ever make a good gen that way?
>>
>>102917165
>I hope you like what the distilled model can do lmao
we can try flux now, they've undistilled them
>>
>>102917876
You will never see a Pony model on Flux because it's $50k+ in compute and most if not all cloud providers prohibit it, so you need someone who *locally* owns H100s
>>
File: file.png (1.61 MB, 1280x1024)
1.61 MB
1.61 MB PNG
>>
>>102917289
wtf is this shit, how the fuck is it unable to do grass? it's a concept you see on almost every pictures
>>
File: file.png (105 KB, 1346x1279)
105 KB
105 KB PNG
>>102917360
>Keep waiting for a Flux finetune, any day now.
it'll be soon, unironically
https://huggingface.co/SG161222/Verus_Vision_1.0b
>>
File: file.png (1.02 MB, 1248x1248)
1.02 MB
1.02 MB PNG
>>102917913
>image model with no examples
Grim
>>
>>102917398
>Anons said FLUX was shit.
we never said that, the first day it came out it was hyped like no other
>>
>>102917431
>Flux is shit because you can't train it. You're not going to see any real nsfw checkpoints on it for a long, long time because the minimum hardware requirement is well beyond consumer hardware.
we'll literally get something serious in a few days lool >>102917913

>>102917927
that's because he hasn't finished his model, he'll do that once the weights will be released
>>
>>102917781
pytorch (and other things) run C/C++ code to make images. Python is used to call the C/C++ code. Python is used to call the Python calling the C/C++ code.

The (flawed) argument is that using Electron to call the python code to call the C/C++ code will be faster.
>>
File: file.png (1.32 MB, 1248x1248)
1.32 MB
1.32 MB PNG
>>102917942
What you linked is a bunch of empty promises. I seem to remember a model called AuraFlow that was that cocky too. I wonder where that went? Seems to me you just posted a grifter model, I hope you weren't stupid enough to donate.
>>
File: file.png (2.08 MB, 1280x1024)
2.08 MB
2.08 MB PNG
>>102917905
turn down PAG guidance and it gets a little better
once we have it locally im sure we'll figure out the correct settings
>>
>>102916839
that's the first time they released a demo without the weights, maybe they want us to try it out a bit and see where it could be improved before giving the final model?
>>
File: file.png (1018 KB, 1248x1248)
1018 KB
1018 KB PNG
>>102917955
It's definitely not done training.
>>
>>102917953
>What you linked is a bunch of empty promises.
the dude who's finetuning Flux rn had made Realistic Vision, he's far from a noname
https://civitai.com/models/4201/realistic-vision-v60-b1
>>
>>102917980
>slop Reddit model
>cursed gallery
STOP THE PRESSES
THAT MAN IS TRAINING A FLUX MODEL
>>
>>102917947
>The (flawed) argument is that using Electron to call the python code to call the C/C++ code will be faster.
desu I don't really see the point, his Ui is fast enough, I never thought to myself "damn when I clicked on that button it took some time to be executed"
>>
>>102917980
>The man who introduced butt chins to sd
Oh good
>>
File: file.png (23 KB, 1029x230)
23 KB
23 KB PNG
>>102917994
>Oh no, that random doomer anon said it's shit, pack it up guys, it's over!
>>
>>102917954
>PAG guidance
they're using PAG? Interesting, I guess they can go for fancy CFG shit because their model is hella fast or something
>>
File: file.png (1.48 MB, 1280x1024)
1.48 MB
1.48 MB PNG
>>102917602
i put your post into sana and this is what it gave me
>>
File: file.png (1.45 MB, 1248x1248)
1.45 MB
1.45 MB PNG
>>102918009
No one said Reddit wasn't popular you retard. Maybe this place isn't to your speed. Realistic Vision is as sloppy as Dreamshaper, I really hope you don't have high hopes and it was funny too because it never got good as NSFW. One of the frankly worst SDXL models and is only consider okay because sometimes it produces images that aren't completely made of wax.
>>
>>102918009
The entire economy of Civitai is farming buzz through giving/receiving updoots and reactions. Their ratings cannot be trusted
>>
>>102918032
>No one said Reddit wasn't popular you retard.
civitai isn't reddit you low IQ anon
>>
>>102918030
good i hope they read it and make a model that isnt complete dogshit and doesnt rely on stupid memes
>>
>>102918083
>make a model that isnt complete dogshit
it's a 1.6b model, I don't know what we're expecting there, too small to be good
>>
i love sana
>>
File: file.png (1.08 MB, 1248x1248)
1.08 MB
1.08 MB PNG
>>102918058
Yes it is, in fact it's even worse because it's like 80% Pajeet spam. I bet you have brown skin too.
>>
>>102918094
yeah im just very disappointed and laughing at the resident pixart shiller
>>
>>102918130
I know you can't stop talking about Midjourney
>>
>>102918121
do the needful and giv buzz to trane new emma watson lora saaar
>>
>>102918130
there's at least three of us
>>
File: file.png (316 KB, 425x431)
316 KB
316 KB PNG
>>102918121
nigga, your gens are ass, you have no right to talk about quality
>>
File: file.png (1.83 MB, 1248x1248)
1.83 MB
1.83 MB PNG
>>102918149
lmao now the seethe is full
>>
File: file.png (1021 KB, 1280x1024)
1021 KB
1021 KB PNG
that horizon line hurts kek
>>
>>102918146
>there's at least three of us
yeah, three people are actually thrilled about that sana demo right now, probably sana devs kek
>>
>>102918146
yes i was one as well until they showed what its going to be like
>>
If sana gets as much love as sigma did (in terms of tunes) I will be happy
>>
File: file.png (319 KB, 404x441)
319 KB
319 KB PNG
>>102918157
>>
>>102918163
Can't wait to hear more about how much you like Midjourney. I don't even know why you post here when you'd fit right into their Discord.

>>102918177
Yeah I know you're a seething Jeet, you'll be spamming all night now.
>>
>>102918186
>Can't wait to hear more about how much you like Midjourney.
why are you talking about MJ schizo? No one mentioned that name but you
>>
>flux shills stopped posting images
curious
>>
>>102918139
thats ironic considering sana and all the pixarts are clearly trained on midjourney
>>
>>102918196
I see plenty
>>
>>102918196
>sana shills stopped posting demo images
curious
>>
Chang save us from western menace
>>
File: file.png (1.38 MB, 1248x1248)
1.38 MB
1.38 MB PNG
>>102918211
sdg is the other place
keep holding breath anon, I'm sure Realistic Vision Flux edition won't look like SDXL images llmmmaooo
>>
>>102917997
some people on the comfy team don't like lighthouse or whatever the framework is called. It has some limitations that cause issues with large variables and transferring. Any claims to performance are bullshit.

I don't think they should be moving, but if they are probably moving just to move. The choices are kinda grim. Easy choice is to actually do some web dev, but time and skill will be an issue. Outside electron, wxWidgets (or QT should they break their full MIT licensing). There was an anon doing a re-write in react, but I haven't seen anything about that in a while.
>>
>>102918224
>keep holding breath anon
why should I keep holding breath for a 1.6b model?
>>
>>102918236
that's assuming you were ever here in good faith
>>
>>102918235
>Easy choice is to actually do some web dev, but time and skill will be an issue.
yeah but that's the best long term choice, at least you have full control of what you're doing and you have zero limitations if you put some time in it
>>
stop using the demo its really slow for me now
>>
File: file.png (1.65 MB, 1024x1024)
1.65 MB
1.65 MB PNG
>>102916839
I got too spoiled with Flux dev, can't accept three legs and unreadable text anymore
>>
>>102918295
im done with it, the slop is all yours now
>>
>>102918297
defend this mr proper grammar pixart shiller
>>
Pretending that anyone expected Sana to "be as good as Flux" is retarded.
Do we really need eleventy billion params tho?
>>
>>102918310
No one is going to fine tune Flux and your SDXL version of Flux isn't going to save you.
>>
>>102918323
about 3.50b would prob be the optimal
>>
>>102918306
ty anon sir
>>
>>102918328
thank you appreciate it
really showed me a whole new perspective about sana
>>
>>102918363
You obviously can't be reasoned with because you're incapable of understanding how to compare apples and oranges. You're like a faggot that's like "I only play AAA games I don't know why anyone plays those garbage indie games, they look like shit". Just shows you're a maximum 80 IQ Pajeet slopper. Probably why you evoked an appeal of authority, I know how much you brown skins love hierachies.
>>
>>102918378
>(schizo:1.5)
>>
>>102918378
oh i see, very reasonable points, please keep going, you are blowing my mind here
>>
>>102918399
>>102918422
samefag
>>
>>102918428
wow, that's crazy
>>
>>102918378
the fuck is wrong with you? did you invest on the sana company or something? take deep breaths and touch some grass Jesus...
>>
>>102918323
>Pretending that anyone expected Sana to "be as good as Flux" is retarded.
why did they went for something this small though? a 4 to 6b model would've been the sweet spot and it would be good enough to replace Flux
>>
>>102918428
you caught me... im sorry, but now i have to live in your walls
>>
>>102918328
>b-but Flux!
Your whataboutism isn't gonna work, Sana looks like shit and nothing is gonna change that
>>
>>102918323
there have been zero tech improvements that contribute to quality besides adding more parameters. dataset, params, and training time are about all that matter. 4-5b would be a good upgrade. 1.6 is too small, remember "2b is all you need?", yeah that shit can fuck off
>>
>>102918500
>Sana looks like shit
Why are those sana researchers so retarded? They were literally here to witness the hype of Flux and the only conclusion they went to it is: "Oh I get it! People like small models that produce images that look like shit, that's it we cracked the code!"
>>
Hey catpiss anon, stay in your own thread.
>>
listen anon im just happy someone is releasing a new model chill out
one day we will get perfect 1girl big booba 16ch VAE non sloppa fast model just hang in there
>>
>>102918513
>4-5b would be a good upgrade.
this, would be big enough to rival flux, and small and fast enough to be run by everyone
>>
**** bored again
>>
>>102918513
i wonder how it's so hard for these people to see this? a 4-6b model would be such a huge improvement both in making better loras to making more complex images, yet we only go backwards.
>>
when's bfl suppose to release t2v
>>
>>102918653
>when's bfl suppose to release t2v
never, they didn't make a dev/schnell 1.1 when they went for Flux pro, they don't need us anymore anon, they got the twitter deal, let it go it's over
>>
>>102918686
what is there left to wait for
>>
flux 2 never ever
sana 2 never ever
flux finetune never ever
local models *especially* never ever ever
You will enjoy 1girlslop
>>
>>102918698
>what is there left to wait for
for the moment? nothing, but that was also the case during the SD3M fiasco, we had nothing and then flux appeared out of thin air
>>
helpless fags waiting for someone to save them
that's why local is doomed
>>
>>102918753
>helpless fags waiting for someone to save them
we're waiting for your multimillion dollar model anon :(
>>
>>102918763
why would I release a model for you faggots
I don't like 80% of you
>>
>>102918772
as if you'll be able to do it in the first place, you won't do shit nigger
>>
what would cefurkan say in these trying times
>>
>>102918793
please stop this thread already sucks enough
>>
KYS NIGGERS KYS NIGGERS KYS NIGGERS KYS NIGGERS KYS NIGGERS
>>
>/sdg/ is cozy
>/ldg/ is hostile
I know where I'm going
>>
>>102918825
>>102918834
>>102918799
>>102918834
oh brother, this thread stinks, is this the sana effect?
>>
BIG
MA
>>
DE
AD
>>
FAGGOTS FAGGOTS FAGGOTS FAGGOTS FAGGOTS FAGGOTS FAGGOTS FAGGOTS FAGGOTS FAGGOTS FAGGOTS FAGGOTS
>>
>>102918868
I'm not a big fan of jannies, but without them, 4chan would look like this schizo spam h24
>>
its okay anon take a breath
>>
>>102918297
NIGGER MIKU, LOOKS LIKE SHIT

/LDG/ QUALITY QUITE GOOD FOR A BUNCH OF NIGGER FAGGOTS
>>
How do you pronounce "Sana" btw?
>>
File: file.png (1.15 MB, 1024x1044)
1.15 MB
1.15 MB PNG
>>102918868
>>102918913
are you sure you don't need help anon?
>>
>>102918698
salvation from this mortal coil
>>
File: 00020_.jpg (815 KB, 2496x3648)
815 KB
815 KB JPG
>>102918699
if you insist

>>102918834
bye

>>102918919
I would assume like Santa without the "t"
>>
File: file.jpg (804 KB, 2048x2048)
804 KB
804 KB JPG
>>102916839
>"Sana can do 4k"
https://www.youtube.com/watch?v=oHC1230OpOg
>>
>>102918970
hey

fuck you
>>
>>102918978
santa isn't going anywhere. Your christian values will have to take a back seat for now.
>>
>>102917431
Shit
>>102917495
>>102917531
>>102917873
>>102917927
Shit
>>102917953
>>102917977
>>102918032
Nigger aids
Imagine being proud of posting this 1girl slop
>>
>>102918978
>>102918913
>>102918868
I like to think this is a disgruntled sana employee who worked his ass off till midnight to produce this piece of shit and expected us to like it kek
>>
>>102919009
I am not sana

I'm just a random guy lol
>>
which is better for low vram(12gb) comfy or forge?
>>
>>102916839
>made Pixart Sigma
>promising model, the future is bright for them
>partenered with Nvdia
>made Sana, a complete piece of shit, this company is done
many such cases...
>>
>>102919029
really whatever you prefer
>>
it's never felt so good to be a sanasexual
look at all this seethe
>>
>>102918970
Having an opinion posting this Buttchin nose cancer slop
>>
>>102919060
>look at all this seethe
true, look at that
>>102918913
>/LDG/ QUALITY QUITE GOOD FOR A BUNCH OF NIGGER FAGGOTS
>>102918378
>You obviously can't be reasoned with because you're incapable of understanding how to compare apples and oranges. You're like a faggot that's like "I only play AAA games I don't know why anyone plays those garbage indie games, they look like shit".
>>
>>102919072
enlighten me. What was my opinion?
>>
>>102915736
Real thread
>>
>>102919087
I crave violence
>>
sana is unironically worse than SDXL
>>
Reserving judgement until I get my hands on a .sft any anyone who doesn't is gay and retarded
>>
>>102919241
what would be the difference with the demo? it's still the same fucking model
>>
>new model releases
>schizo anon is afraid ldg is going to become more popular again
>doesn't realize is just the same anons that come back whenever something relevant happens, instead of being terminal avatarfags
>>
>>102919248
>what's the difference with the demo of flux and having the local file
>>
>>102919263
yeah, that's what I'm asking you, the images won't magically look better because the same model is running on your pc, innit?
>>
File: ComfyUI_00827_.png (1.5 MB, 1280x1024)
1.5 MB
1.5 MB PNG
>>
>>102919272
anon will have a chance to actually do something with it as opposed to being beholden to whatever gay shit they did with the demo
>so you're saying sana is actually not bad?
no
>the images won't magically look better
this is presuming whatever settings they have in the demo are the optimal which we all know is never the case. I'm sure you're not pretending that some extremely basic gradio interface is the same as whatever fucked up workflow anon can come up with
>>
File: file.png (101 KB, 1772x1034)
101 KB
101 KB PNG
>>102919304
>this is presuming whatever settings they have in the demo are the optimal which we all know is never the case.
you can change the settings on the demo though
>>
>>102919310
>no sampler or scheduler settings
anon...
>>
File: file.png (14 KB, 675x236)
14 KB
14 KB PNG
>>102919318
on their paper they made a sampler designed only to be optimal to that model, I'm not sure if the other samplers will be working on sana
>>
how bad is nvidia on linux?
thinking about swapping for linux since it uses less system resources
>>
>>102919330
I tried to run ComfyUi on WSL2, the experience was terrible, I had to wait 10 whole minutes to load a model, never again
>>
>>102919327
we won't know until we get our hands on the model. that's not even thinking about something completely new anon might come up with. I could be wrong! but I don't think I am. just wait until we can work with it. just like flux, XL, and 1.5
>>
>>102919355
>I could be wrong! but I don't think I am.
that's just cope, I have never seen an instance where a magical sampler will fix all the problem a base model has, if it looks like shit on one sampler, it won't suddently look like Flux dev on another sampler
>>
>>102919337
i mean native linux
i hear nvidia drivers are terrible on linux... is it true?
>>
>>102919365
but we have seen, with essentially every model, that it gets better once anon has their hands on the file. again, I'm not saying it's actually Not Bad, just that I'm personally reserving my judgment and not taking the outputs of the demo as all it will ever be; which is perhaps too nuanced for 4chan
>>
>>102919381
I'm running on nvidia right now, no problem. I used an AMD before, and nvidia is definitely an improvement. Rocm is a nightmare, and AMD should be forbidden from trying to maintain a repo.
>>
>>102919304
The online demo is definitely using the rewriting prompts feature of Gemma, that's the first thing to go.
>>
>>102919393
>which is perhaps too nuanced for 4chan
you have 0 nuance, you're claiming something that doesn't have any explaination on it
>ohh it'll look better if I run that turd on my pc because... well because MAGIC DUH
no anon, you just sound like a retard here
>>
>>102919406
>The online demo is definitely using the rewriting prompts feature of Gemma
how can we be sure they're doing that?
>>
>>102919393
If it's just SDXL that trains 4 times faster and uses an actually good text encoder, that's all that matters.
>>
Fresh

>>102919427
>>102919427
>>102919427
>>
>>102919422
Pretty sure the hearts is from Gemma, probably something like "if it's an unsafe / nsfw prompt, write a prompt for a heart image". I don't know why they wouldn't use it since it's a big part of the paper.
>>
>>102919408
I don't believe it's worth continuing this conversation if you're going to ignore how every other model got better with time
>>
>>102919450
>every other model got better with time
like Stable Cascade, SD3M? oops...
>>
>>102919459
if you think sana is anywhere close to those you are retarded and should neck yourself
>>
>>102919474
Sana is as good as Stable Cascade... sorry bud.
>>
>>102919474
Stable Cascade produces way better pictures than this piece of turd, you're a 2 digit IQ monkey anon, sorry to tell you that
>>
>>102919492
Prove it lmao
>>
>>102919396
Have you used before with windows? Did you notice a increase in inferencee time?
>>
>>102919439
fair enough
>>
>>102919498
You're the one that made the quality claim first, therefore you're the one with the burden of proof, hope that helps. >>102919492
>if you think sana is anywhere close to those
>>
>>102919499
>Have you used before with windows?
No. There is a link in the op of this or from sdg that has a card ranking, and I think there might be a linux/windows comparison of similar systems. You may see a difference there, if there is one.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.