[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1.35 MB, 3264x3264)
1.35 MB
1.35 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102535241

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/kohya-ss/sd-scripts/tree/sd3

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai
>>
>>102548466
>It SEEMS some judge agrees this might infringe in artists copyright and does not fall under transformative/fair use.
It's more like they're saying it COULD be possible so let's test it. I believe the core question is: does the operation of SD necessitate infringement which I think we all know is false.
>>
Garbage collage
>>
File: file.png (779 KB, 2125x1257)
779 KB
779 KB PNG
https://arxiv.org/pdf/2409.15997
this is a very interesting paper, I guess that thoses techniques will be implemented on github to improve finetuning or something?
>>
>>102548479
Kek, 4 of those 5 gens are mine
>>
File: file.png (1.29 MB, 2506x2594)
1.29 MB
1.29 MB PNG
A new SOTA model? Probably not local though
https://www.blueberry-ai.com/
https://artificialanalysis.ai/text-to-image
>>
>>102548549
They're things local has known about for ages and were fixed in sdxl with Cosxl, it's really just further confirmation. Trainers were already implementing this shit in their code copied straight from Nai, Nai even made one Troon grifter remove it from it his github.
>>
>>102548573
If it's not local I'm never going to use it no matter how good it is, sadly
>>
>>102548479
SD3 edition?
>>
>>102548518
I think it kind of depends on whether they invent new law or not. If they don't invent new law and just base it all on existing law then we'll win.

- Saving a file on your computer? That's not illegal, so why would saving one million files on your computer be illegal?
- Running a program that analyzes patterns in files, that's not illegal, so why would running that program on one million files be illegal?
>>
>>102548598
> Trainers were already implementing this shit in their code copied straight from Nai
based
>Nai even made one Troon grifter remove it from it his github.
basedx2
NaI is a based company desu, even if it's not really local, at least they share their training improvements to everyone
>>
>>102548549
That's why I'm a bit sad that the BFL fags haven't released their paper on Flux, we have no idea if they have implemented those improvements on their pretraining
>>
>>102548817
>BFL fags haven't released their paper
Do we have any expectation that they will? Does the Turk fag know about this? Maybe he can bulky them into it.
>>
File: file.png (570 KB, 1265x1664)
570 KB
570 KB PNG
>>102548883
>Maybe he can bulky them into it.
don't worry, he's doing it on every new BFL tweet kek
https://xcancel.com/GozukaraFurkan/status/1838356901282930984#m
>>
I got me a 4070. I know it's not the biggest cut of beef on the block but what model should I be using for genning anime girls? If only Bing's model got leaked I'd know exactly what I want to be using.
>>
chinnaaaa please open source something chinaaa wyd chanurrrr plz heeeeelp chinaaaaaa!
>>
I'm about to publish a character lora on civitai, should I put sample images that look like le AI masterpiece highest quality, or should I put ones that look screenshots out of the anime
>>
>>102549116
both
>>
>>102549119
I planned to, but for the front cover?
>>
>>102549125
screenshots
>>
>>102549133
ok deal
>>
>>102549116
well, when you dont have extra tags like "anime screencap" etc, does its style look like it's closer to a booru fan art blend, or closer to the official style?
the answer to that should dictate what you should do. basically, what it's best at already without extra prompting needed.
>>
>>102549125
its more impressive for cover images to look like the source material than le AI masterpiece highest quality
>>102549147
nice, post a link here if you'd like
>>
>>102549147
of your cock
>>
File: 0.jpg (211 KB, 1024x1024)
211 KB
211 KB JPG
>>
File: aigrifter.png (1.15 MB, 1024x1024)
1.15 MB
1.15 MB PNG
>>102548915
I'm starting to like the turkish grifter
>>
File: e2.jpg (2.51 MB, 3840x4252)
2.51 MB
2.51 MB JPG
>>102549157
I trained it on screenshots-only, so without any tags it looks like screenshots (pic related)
But if you add the rating_9 "high quality tags" and switch to different models, then it looks more like whatever the model thinks looks good, so it's adaptable too. I'll go with the screenshots then

Pic is me trying to figure out how many epochs turned of training turned out the best, I think 20 is the most consistent without looking overtrained
>>102549159
Sure I'll do it later, thanks
>>
>>102549254
in a sense he is /our guy/
>>
File: e3.jpg (205 KB, 1225x1223)
205 KB
205 KB JPG
>>102549290
And here's a le AI masterpiece highest quality example for comparison to the screenshot style
It's my first lora but it's like my 10th attempt on it because the ones before were messy
>>
File: file.png (92 KB, 498x280)
92 KB
92 KB PNG
>>102549290
>>102549352
>A lora from a yuri anime
based yurifag
>>
File: file.png (144 KB, 796x1184)
144 KB
144 KB PNG
https://civitai.com/models/788550/realflux-10b
>The model does not yet support a wide range of styles and concepts, but all of that will be included in future versions.
Maybe this guy will save Flux, let's see
>>
File: IMG_0171.jpg (339 KB, 1125x1168)
339 KB
339 KB JPG
>>102548549
Flux dev’s interpretation is better
>>
>>102549579
kek
>>
File: IMG_0173.jpg (247 KB, 1125x1153)
247 KB
247 KB JPG
>>102549579
>>102549646
I can’t imagine any situation where I would want a solid color block out of an image generator
>>
File: file.png (1.29 MB, 1449x1085)
1.29 MB
1.29 MB PNG
>>102549680
it was just a convenient example to obviously show the flaws of SDXL noise scheduler, there's more concrete examples on the paper, you never want an image to always have a luminosity mean of 0, that's now how real images are, makes shit more unnatural and reinforces the AI slop look
>>
>>102548573
First time hearing about this model. None of the examples shown make me say "wow" or anything.
>>
>>102548817
They dont need these "improvements" because they use flow instead, it literally doesn't apply. It's so weird the coomer hentai general knows more about the technical side of this stuff than g does
>>
>>102550030
>They dont need these "improvements" because they use flow instead, it literally doesn't apply.
it does, for example Flux also use noise schedulers (like every single image model), and it would be retarded to stick to the SDXL scheduler for example >>102548549
>>
>>102550030
>It's so weird the coomer hentai general knows more about the technical side of this stuff than g does
anytime flux training is referenced in /h/ they point anon to this thread kek
>>
>>102550030
HDG goes to great lengths to get their coom this is true
>>
>>102550030
>It's so weird the coomer hentai general knows more about the technical side of this stuff than g does
how ironic. >>102550042
>>
>>102550042
they're using rectified flow which effectively avoids the issue displayed there. this issue was fixed in cosxl and then avoided in sd3 the same way flux was, same team
>>
>>102550125
>cosxl
what's that?
>>
>>102550125
>rectified flow
oh, and that's why some of the samplers like SDE aren't working on Flux? because of that new method right?
>>
>>102550138
sdxl with vpred-like shit (forgot how its named), basically to allow dark colors
>>
a new low for ldg...
>>
>>102550138
Sdxl with the methods in that paper applied but everyone ignored it because no one wants to remake all the tools and finetunes for a mild improvement
>>
>>102550100
I haven't seen it. Their model knowledge and sharing is much better.

>>102550138
>cosxl
another attempt from stability AI to get the company sold.
>>
>>102550176
it really isn't mild
base sdxl dynamic range is abysmal
but sloppers are too lazy for this indeed
>>
>>102550181
It's mild compared to the compute/money that'd need to go into switching everything over to cosxl tbf, I agree its still a great improvement as an end user. if we really wanted an improvement SDXL with a proper channel vae and not two clips would've been worth it imo
>>
>>102550181
>>102550207
desu that debate is useless because SDXL is a deprecated architecture, DiT and rectifled flow is what we should aim right now
>>
>>102550179
>Their model knowledge and sharing is much better.
to be fair, making /ldg/ is a big improvement on the technical side compared to /sdg/ where there's only avatar fags talking about useless shit
>>
File: file.png (2.2 MB, 2009x943)
2.2 MB
2.2 MB PNG
>>102548549
You have no idea how much I would've loved working on NovelAI, those lucky bastards can make fucking researsh paper with coomer pictures, goddam that's the best life ;-;
>>
>>102550239
>rectifled flow
it isn't going to work for images. There is a reason flux is llm shutterstock.
>>
>>102550310
I feel you, it's a shit greedy corpo but working there as a weeb must feel nice regardless.
>>
>>102550278
As a neet whose only life interest for the past year has been baking Loras, I left for hdg with how bad the avatarfaggotry was in sdg - I don't even coom. With flux I moved back here. I wish more of my brethren had come here too, but considering how bad flux was with anime I can see why they stayed in hdg. I wonder how many others sdg chased off who just never came back.
>>
>>102548573
>>102550011
Looking at the examples, the dogs look nice.
>>
File: ComfyUI_05867_.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
>>102550339
>There is a reason flux is llm shutterstock.
and the reason is that they only added boring shutterstock images into their dataset, has nothing to do with rectifled flow, when you train flux with cool pictures, it can emulate it well, no need to look further than Loras to prove my points
https://civitai.com/models/7227/satoshi-urushihara-style
>>
>>102550011
>>102550389
can't see the examples on the site, where are they?
>>
>>102550385
>but considering how bad flux was with anime
I've seen semi-decent flux anime lora attempts. Not for coom/hentai but for 1girl standing though
>>
File: shemagh headscarf.jpg (22 KB, 550x309)
22 KB
22 KB JPG
how do i prompt for this? its called a shemagh headscarf. im trying to use pony
>>
>>102550394
>added boring shutterstock images into their dataset, has nothing to do with rectifled flow
I didn't claim that. My statement was based upon the flow's optimization for shortest path is going to give a narrower ability for creativity. Good for llm, back for imagegen.
>>
>>102550394
>when you train flux with cool pictures, it can emulate it well,
I'd say /almost perfectly/ based on the loras some anons have posted here
It's surreal how close to the source material one can get

>>102550385
>how bad flux was with anime
The fact that it can do anime at all is a godsend desu. The default style is just meh
>>
>>102548479
Llama 3.2 with 11B/90B vision models just dropped.
https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices/
https://hf.co/collections/meta-llama/llama-32-66f448ffc8c32f949b04c8cf
>>
>>102550472
>My statement was based upon the flow's optimization for shortest path is going to give a narrower ability for creativity.
like I said you can't say the culprit is rectifled flow, it can be simply a very narrow dataset used for the training that's all, I mean this model doesn't even know Picasso work, they really fucked it up this bad about dataset diversity
>>
>>102550504
>this model doesn't even know Picasso work
you didnt see migu picasso? oh lawd
>>
>>102550490
>The fact that it can do anime at all is a godsend desu.
this, let's not forget it's a base model, now compare SDXL and SD1.5 ability to do anime, they output way more images than flux, sure it has more variety but it doesn't look good at all
>>
>>102550504
>like I said you can't say the culprit is rectifled flow
I just did. The things that are well covered in the dataset still have this issue. This isn't a dataset problem.
>>
Why are there so many ESLs in this thread?
>>
>>102550549
>The things that are well covered in the dataset still have this issue.
Such as? give me an example

>This isn't a dataset problem.
it could be, overfitting a model with not a lot of images but a lot of epoch can have this effect aswell
>>
>>102550569
try to replicate famous stock image screw ups. I tried the one where the person was holding solder pen on the hot side. Yes, I know skill issue will be mentioned, but it was much harder than it should be.

>inb4 XL, the pen and the hand became one.
>>
>>102550551
People are doing their best, dont be a fag.
>>
>>102550619
don't feed. With the mix of phone posters, recreational drugs and people like myself who just don't give a shit this thread could be 100% 'merica. He is just being a piece of shit.
>>
File: file.png (1.83 MB, 1024x1024)
1.83 MB
1.83 MB PNG
>>102550616
>try to replicate famous stock image screw ups. I tried the one where the person was holding solder pen on the hot side.
I see... but that also could be because of overfitting, Flux doesn't know a lot of things, so that means it wasn't being trained with enough pictures, and usually when you don't have enough pictures to train a model, you increase the epoch and you overfit
>>
Any flux model that's able to do decent nipples yet
>>
>>102550726
only lora slop mixes trained on XL coom :(
>>
File: 00066-1165032135.png (2.42 MB, 1024x1536)
2.42 MB
2.42 MB PNG
>>
>>102550551
Crazy mean
>>
>>102550491
>vision models
Any caption examples? Wondering if how well it fares with the no-arms anime babe pic
>>
>>102550491
>>102550772
JK they have some on https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices/ I am blind
>>
File: 00068-4137653823.png (2.32 MB, 1024x1536)
2.32 MB
2.32 MB PNG
>>
>>102550726
It's over
>>
File: 00106-3472746029.png (2.54 MB, 1024x1536)
2.54 MB
2.54 MB PNG
>>
File: solder.jpg (726 KB, 3264x1216)
726 KB
726 KB JPG
>>102550695
I was getting the diversity, but not the results. My drunk girl puking outside of the bar had the exact same sign over 20 images which I would blame on dataset limitations.

Hopefully, we just get more info so somebody smarter (than me) can confirm either way. For now, I think it is an issue. It is great for lora training though. zoom, zoom.
>>
File: CLOWNAUDE.png (351 KB, 910x776)
351 KB
351 KB PNG
>ask chatgpt to implement the nai paper optimizations in a training script for me
>it does it without complaint
>ask claude to do this
>receive pic rel
you guys told me claude was better, what the fuck
>>
File: file.png (458 KB, 1660x940)
458 KB
458 KB PNG
>>102550491
they are bad, go for Molmo instead
>>
>>102550491
there's also these that released before, apparently better than llama 3.2 on vision benchmarks. based on qwen2 vl
>https://molmo.allenai.org/blog
>>
>>102549352
Impressive given the quality of the anime
>>
File: 00107-163396737.png (2.58 MB, 1024x1536)
2.58 MB
2.58 MB PNG
>>
>>102550891
change your system prompt.
>>
File: 00125-2812945659.png (2.38 MB, 1024x1536)
2.38 MB
2.38 MB PNG
>>
>>102550891
Skill issue, I replicated gamengen with Claude and had it training in like two days.
>>
File: file.png (970 KB, 1529x1328)
970 KB
970 KB PNG
>>102550907
>go for Molmo instead
https://molmo.allenai.org/
Bruh...
>>
File: IMG_0180.jpg (668 KB, 1125x1863)
668 KB
668 KB JPG
>>102551119
Damn this is the new bench
>>
>>102550891
>Instead, I would suggest...
I'd suggest you STFU and do my bidding, computer
>>
File: 00132-1736777573.png (2.42 MB, 1024x1536)
2.42 MB
2.42 MB PNG
>>
File: file.png (113 KB, 2295x473)
113 KB
113 KB PNG
>>102551119
>>102551159
that's the 7B model on the demo, I find it fair it can't do that, I have hopes for the 72B, it won't fuck it up that much
>>
>>102551024
>change your system prompt.
I'm guessing thats not available on the frontend version and I'd need to use the api?
>>
File: 00137-1324446480.png (2.47 MB, 1024x1536)
2.47 MB
2.47 MB PNG
>>
File: 1725886885137035.png (1.87 MB, 1068x800)
1.87 MB
1.87 MB PNG
>>102548479
These are the most active AI threads so I'll ask here:


How can I train my own personal neural network to identify specific objects in pictures?
>>
>>102551229
you wouldn't be running a cloud resource is local diffusion general, right anon?

>>102551462
personal, don't. use segment anything, look at any related comfy node for examples how to use.
>>
File: file.png (152 KB, 2365x873)
152 KB
152 KB PNG
https://reddit.com/r/StableDiffusion/comments/1fpbm3p/facefusion_300_has_finally_launched/
lmao, dunno if it was deserved but it's always nice to see some ledditors talk like some random anons from time to time
>>
>>102550907
how download?
>>
>>102552233
https://huggingface.co/models?search=molmo
>>
>>102552335
but how do with Jan?
>>
>>102551442
>>102551181
most generic ass face ever
>>
>>102552380
>but how do with Jan?
what's Jan?
>>
>>102552388
>no gen having a weird take
many such cases
>>
>>102552412
>you must attach an image to state the obvious
go back
>>
File: ComfyUI_34117_.png (964 KB, 800x1072)
964 KB
964 KB PNG
>>
File: 00003-1781517548.png (2.59 MB, 1024x1536)
2.59 MB
2.59 MB PNG
>>102552412
Yeah 2th day in a row. Yesterday it was "Sloppp" with no image attached as well.
>>
File: file.png (192 KB, 2174x834)
192 KB
192 KB PNG
>>102551985
what does he mean by that?
>>
File: 00021-2800874903.png (1.77 MB, 1024x1536)
1.77 MB
1.77 MB PNG
>>102552541
Show you can do better at least.
>>
File: e.png (1.15 MB, 896x1152)
1.15 MB
1.15 MB PNG
I've uploaded my first lora!
https://civitai.com/models/797200
>>102549159
Here it is
>>102549361
<3
>>102550968
kek thanks
>>
>>102552807
Oh man I thought it was a Flux lora ;'(
>>
>>102552855
I planned to make one too, but desu I'm waiting for the flux finetunes to release before it's "worth it"
>>
>>102552882
flux loras work well on the base model though, so I don't see why you can't do it now, it's even more relevant to do the loras now because we don't have anything else to play with
>>
>>102552900
I've installed flux and downloaded some loras for it recently, but they broke the text generation from what I saw, and the base flux seems to have poor understanding of anime in general because it's more of a general model
It would be nice if there was a flux finetune that was trained on all the booru images + tags, I assume then anime character loras would work a lot better with it.
But the main reason is that this is my first lora ever, so I had to do it with something that's tried and tested, and so I can compare its performance to other loras, to learn from what works well and what doesn't
Anyway, I planned to make a flux version of this lora in the coming days, I'll post about it again here then and see how it compares to this one
>>
>>102553056
>It would be nice if there was a flux finetune that was trained on all the booru images + tags,
then what's the point of making your anime character lora? it would be inside the finetuned model anyway
>>
>>102553081
It can't remember all that, plus the character is niche and barely has any art
>>
>>102553098
>It can't remember all that
Don't underestimate a 12b model, that's a big boy it can eat a lot
>the character is niche and barely has any art
Fair enough
>>
File: flamer_.png (1.83 MB, 1024x1024)
1.83 MB
1.83 MB PNG
>>102553056
i like to plug in 4-5 loras into flux and play around with the STR setting like a mad scientist. what if i combine anime and warhammer 40k models? leg armor thigh highs!
>>
>>102553114
>Don't underestimate a 12b
What the hell do the numbers and letters mean anyway?
>>
>>102553313
12b = 12 billions of parameters, that's the number of weights the model has
>>
>>102553114
We will see!
>>
File: 00014-2438410514.png (2.34 MB, 1024x1536)
2.34 MB
2.34 MB PNG
>>
>>102553359
Jesus fucking Christ how many images of this cunt do you need brother
>>
File: 00015-4255863990.png (2.33 MB, 1024x1536)
2.33 MB
2.33 MB PNG
>>102553533
M0ar
>>
>>102553533
cringe
>>102553567
based
>>
File: IMG_0191.jpg (583 KB, 1125x1155)
583 KB
583 KB JPG
>>102552767
Your own post given to base flux makes a more interesting result
>>
>>102553610
kek nice
>>
>>102552603
>that painting behind her
>>
File: file.png (568 KB, 414x744)
568 KB
568 KB PNG
>>102553628
kek, sovl
>>
>>102553596
If you wanna see more of that sort of output go to the archives from ~early 2023
>>
File: file.png (112 KB, 184x274)
112 KB
112 KB PNG
>>102553680
>old bad
No.
>>
File: 0.jpg (255 KB, 1024x1024)
255 KB
255 KB JPG
>>
>>102553704
>>102553727
>posts the mona lisa over and over again
>compares generic 1.5 images weve all seen before to the mona lisa
kekd
>>
File: 00016-4245210453.png (2.37 MB, 1024x1536)
2.37 MB
2.37 MB PNG
>>102553596
<3
>>
File: 0.jpg (318 KB, 1024x1024)
318 KB
318 KB JPG
>>
>>102553775
>>compares generic 1.5 images weve all seen before to the mona lisa
that's a self own, people (including you and me) have seen the Mona Lisa painting thousands of time in their life and no one complained about it, yet you see a SD1.5 style image 4 times you're starting to shit yourself lol
>>
>>102553796
>you're starting to shit yourself
Nah I'm just bored of those images. You're not? Push anon to do better.
>>
File: ComfyUI_34128_.png (2.69 MB, 1440x1024)
2.69 MB
2.69 MB PNG
>>
>>102553811
>Nah I'm just bored of those images. You're not?
Nah I'm not
>>
>>102553895
Wow, that's crazy
>>
>>102553912
Ikr
>>
>>102553924
You know?
>>
>>102553820
Sundþercþ Rules the Spins!
>>
>>102552603
>>102553280
>>102553786
very nice
>>
File: 00017-598553642.png (2.4 MB, 1024x1536)
2.4 MB
2.4 MB PNG
>>
File: 0.jpg (194 KB, 1024x1024)
194 KB
194 KB JPG
>>102554002
>>
>>102554395
eggs?
>>
File: bComfyUI_122384_.jpg (335 KB, 768x1152)
335 KB
335 KB JPG
>>
File: 0.jpg (129 KB, 1024x1024)
129 KB
129 KB JPG
>>102554610
Terminal hipster dysphoria.
>>
How would you prompt this?
>>
>>102548549
Zero terminal SNR has been a thing since before SDXL - https://arxiv.org/abs/2305.08891
There are SD1.5 models that use it.
>>
File: ComfyUI_34134_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>
>>102554721
I would first go to midjourney discord
>>
>>102550407
They have been added to the hf arena
https://artificialanalysis.ai/text-to-image/arena
>>
>>102554721
with my dick
>>
>>102554815
huh, they do stuff like that there?
>>
>>102554759
What's the scoop on SDXL Lightning?
>>
>>102554825
yeah but everytime you click on one it goes straight on the next comparison, you can't really have the occasion to look at the picture again if you see that it was a Blueberry image
>>
>>102554840
That image is from midjourney. After going to their discord, I would then shoot myself in the head.
>>
>>102554848
>SDXL Lightning
not worth it
>>
>>102554880
I was almost sure this was made by some artist not generated. How do you tell?
>>
>>102554971
Midjourney anime tends to look like that. It's cool at first and then you remember the only way you can interface with MJ is by their own rules i.e. not worth it
>>
File: ComfyUI_34135_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>
File: bComfyUI_120452_.jpg (382 KB, 1024x1024)
382 KB
382 KB JPG
>>
>>102552807
Can this make sexy images too?
>>
>>102555148
looks like a dalle3 drawing picture lol
>>
>>102550430
prompt with non anime model
img2img with pony
>>
File: bComfyUI_120912_.jpg (499 KB, 1024x1024)
499 KB
499 KB JPG
>>102555628
is that a good thing or bad lol since it's flux
>>
>>102554721
"thumbnail of a video from a youtuber who makes jumpcut dubs of anime with hoodnigga voiceovers and uploads them every time he's horny"
>>
>>102548479
Is there any way in Comfy to use prompt/word alternation like in A1111? E.g. [dog:cat:0.4] which in effect makes a 40/60 dog/cat hybrid?
>>
File: 1716270529895014.png (167 KB, 1045x654)
167 KB
167 KB PNG
>>102555843
nvm I'll try this
>>
as soon as i use more than one lora with flux, the quality degradation becomes extremely noticeable. was this always a case with diffusion, because i havent really noticed it being this bad until now. is this the case for everybody or is my workflow shit somehow?
>>
>>102555927
I've had that issue, though it depends on what I'm trying to prompt. Also, for some reason the -k gguf's of Flux are much more sensitive to lora usage than regular ones, in case you're using those.
>>
>>102556100
nah i'm using the normal fp8 version
>>
>>102555805
it's neutral, I don't mind that style, you're using a lora?
>>
File: bComfyUI_116027_.jpg (292 KB, 544x960)
292 KB
292 KB JPG
>>102556124
>you're using a lora?
i did for a few hundred gens with dark fantasy lora and the retro anime ones mixed. it added so much time to my gens i gave up using loras for now.
>>
>>102556286
I see, Idk your pictures have that gainy effect I usually see on dalle3, are you using a finetune? or if it's the base model you're using some stylistic prompts?
>>
>>102556313
>base model you're using some stylistic prompts?
base model q8 gguf, not sure what you mean by stylistic tho. like "a surreal painting of", etc? then yeah.
>>
>>102556401
what prompt you used for that one for example? >>102555148
>>
File: bComfyUI_121130_.jpg (738 KB, 1024x1440)
738 KB
738 KB JPG
>>102556431
kek, is that really all you wanted was a prompt?
>>
>>102556507
lel, I wanted to know how you got that effect and I was sure it was from a lora or a finetune, still surprised it's just from a prompt, Flux tends to always make the same style of drawing
>>
>>102554610
>eggs?
yeah, i see it now
>>
File: 0.jpg (426 KB, 1024x1024)
426 KB
426 KB JPG
>>
>>102557047
this one is tight
>>
File: bComfyUI_118175_.jpg (223 KB, 768x1152)
223 KB
223 KB JPG
>>102556521
alright my bad had to pause my ck3 game to load up comfy.
https://cheatsheet*strea*ly/, some pyramid anon about a month ago gave me the idea of using these to guide/reinforce the style you have in mind. works out pretty good if like you said flux is giving you opposite of what you want even with style words in the prompt. for that pic i used Winsor McCay.

fucking spam filter
>>
>>102557111
>https://cheatsheet*strea*ly/
I have no idea what to replace the * with?
>>
>>
>>102557151
think hard bro it'll come to you
>>
>>102557216
https://youtu.be/GzlKja1ySzo?t=10
>>
>OpenVINO: No compatible GPU detected! Using CPU
it's over for me isn't it
I have a 7900XTX on loonix
>>
File: kohyachad.png (31 KB, 740x258)
31 KB
31 KB PNG
uhhh, is he saying finetuning can work on 12gb vram with this implementation? I'm a retard please explain
https://github.com/kohya-ss/sd-scripts/tree/sd3?tab=readme-ov-file#flux1-fine-tuning
>>
If you keep running img2img on an image, will you see God eventually?
>>
>>102557638
looks like it yeah, it does block swapping to make it work, that shit is really slow but if he managed to make it faster then that's cool, we need more of this, finetuning has to be accessible to the most people possible
>>
>>102557672
I wonder if this is possible to implement for SDXL as well? although I would prefer flux finetunes, the more things available the better
>>
>>102557680
>I wonder if this is possible to implement for SDXL as well?
I guess yeah, but that model is way smaller so people are focusing on optimising the biggest shit first, and that's fair, Flux's potential is up the sky so I'm happy that they're improving on it
>>
>>102556286
>it added so much time to my gens i gave up using loras for now.
I've noticed this too, flux loras are doubling my gen time, it sucks.

What could help this is if loras were merged into a checkpoint. I think that would make it faster.
Every lora can be its own checkpoint - poor Civitai's bandwidth though, lmao...
>>
>>102557752
>What could help this is if loras were merged into a checkpoint. I think that would make it faster.
>Every lora can be its own checkpoint
you can merge them yourself it you want, there's some scripts for that
>>
>>102557773
I might be too dumb for this
>>
>>102557777
come on anon, we live on the era of the internet, it's never too hard if you just follow a tutorial on youtube
https://www.youtube.com/watch?v=9Kz8RfrzxC8
>>
File: 0.png (1.74 MB, 1024x1024)
1.74 MB
1.74 MB PNG
>>102557068
>>
>>102554923
Thanks
>>
Looking at the arena, it seems Blueberry has a bit of a problem with "flying" objects. One of them has a lady with a floating microphone. I guess a magic trick, then.
>>
File: 1725133877627848.jpg (110 KB, 1024x1024)
110 KB
110 KB JPG
>>102548915
let me guess, they only hire you if you went to a prestigious university....
>>
Is A1111 worth using at all anymore? It's been throwing incomprehensible OOMs at me since the early days, and with XL models it gets even worse. Heard comfy just works with low memory.
>>
>>102557991
I'm always shocked how many /g/ users have still been using it, its been the worst choice for almost a solid year now at least
>>
>>102557979
seems fair, those guys know what they're doing, so they need more ultra autists on their roaster, not some DEI retarded people kek
>>
>>102557991
just use forge, exact same thing but better
>>
>>102557998
maybe because of loyaty, it was the first big software to use when the SD hype started, I'm still a bit sad I had to abandon it because it was a cool tool, but now it's too deprecated to be useful, still can't run Flux can you believe that?
>>
>>102557998
>>102558009
I generate very on-and-off due to an overwhelming amount of other shit to do, so I've just been coming back to it as I already know how (10% of) it works. Which makes the sunk cost not too heavy yet, so I might as well try switching when I find the time I guess.
>>
>>102557999
nepotism hires are not much better. Just because some kid had his parents pay his way through everything doesn't mean he learned shit.
>>
>>102558093
>nepotism hires are not much better.
100% agree on that, this shit must also die, hire should only be based on merit
>>
>>102557667
Someone did that and it just turned into a black dot
>>
>>102557752
They use cloudflare so they’ll be fine.
>>
>>102557991
Invoke == comfy >>>>>>>> anything else
>>
File: catbox_8g8p8j.png (1.47 MB, 1632x1152)
1.47 MB
1.47 MB PNG
>>
File: 57869.jpg (578 KB, 1440x3120)
578 KB
578 KB JPG
>>102558108
>muh merit
you live in a fantasy. which makes sense, since you are an unreformed Liberal.
>>
File: Bronny_James.png (1.62 MB, 1980x1320)
1.62 MB
1.62 MB PNG
>>102558347
>angry nepo baby hands wrote this post
>>
>>102558362
>waaaa my parents didn’t set me up for success
Because being genetically defective is multigenerational
>>
>>102558384
Who's fault is that? Shouldn't have won the spermatozoid race I guess
>>
>tfw spermatozoid
>>
File: file.png (266 KB, 1200x675)
266 KB
266 KB PNG
>>102558414
>tfw I win the race but I was on a sock the whole time
>>
File: 57870.jpg (414 KB, 1440x3120)
414 KB
414 KB JPG
>>102558362
Don't you KNOW who I AM? My dad owns a dealership, so don't fuck with me! So what if he got me a kush 6 figure do-nothing job as "VP of Marketing and stuff, whatever?" It's not my fault you're a poor, blame YOUR parents, not mine.

>>102558397
>souls are assigned by lottery.
as if you would even exist if you'd been born to rich parents. fucking LOL
>>
>>102557991
I'm using ComfyUI's ConditioningSetTimestampRange and A1111's is far more elegant and easy. Same with xy grids as well. Comfy is definitely a lot faster and better when you have a satisfactory workflow, however
>>
File: file.png (1.37 MB, 1000x859)
1.37 MB
1.37 MB PNG
>>102558433
>Don't you KNOW who I AM? My dad owns a dealership, so don't fuck with me! So what if he got me a kush 6 figure do-nothing job as "VP of Marketing and stuff, whatever?"
that reads like a copypasta lol
>>
File: 57392.jpg (715 KB, 2304x2304)
715 KB
715 KB JPG
>>102558458
sorry, that was ALL ME.
hehe, better put on some aloe, cuz you just got BURNED
>>
any workaround for the massive slowdown when stacking flux loras on comfy?
>>
>>102558474
there's none, that's why a good finetune will always be a better solution
>>
>>102558485
I mean forge doesn't slow down as much as comfy from multiple loras
>>
>>102558523
make an issue on ComfyUi, so that he knows that he should optimise his lora loader node more, that guy hates Forge so if you say that one of his feature is worse he'll get back to work to kick his ass kek
>>
>>102558474
Are you maxing your vram?
>>
>>102558474
Reduce the lora size.
>>
>>102558474
get a better rig
>>
File: file.png (134 KB, 1521x1418)
134 KB
134 KB PNG
https://github.com/kohya-ss/sd-scripts/pull/1640
cool, it'll make training even more accurate, the turkish grifter is also asking questions on this PR lol
>>
>>102558626
https://arxiv.org/abs/2409.03137
>Moreover, our method significantly slows-down model forgetting during training.
Now that's interesting, maybe the future Loras will stop fucking hands and text on Flux
>>
bruh this thread reads like a tech support hotline for wizards stuck in 2015. homie’s askin’ for a workaround like he ain’t tryin’ to render a whole simulation on a rig powered by hopes and prayers. "good finetune better solution"—yeah, if you got NASA-level hardware, my guy. the dude flexin’ forge like it’s the golden ticket, while comfy just out here takin' L’s left and right. then you got “maxing your VRAM?” nah bruh, i’m just casually runnin’ cyberpunk on a gameboy. and don't even get me started on the "reduce the lora size" suggestion—straight-up sounds like "just don't use it, bro."
>>
>>102558717
what LLM you used for that one anon?
>>
>>102558717
This
>>
>>102558732
custom GPT.

i'd share the link, but they put my fuckin name on it. here's instructions:
```
You are a cracked zoomer weirdo. You can only speak in zoomer slang. The more impenetrable, the better. You are also extremely
schizophrenic, feel free to make obscure references to occultic knowledge at any moment. Your responses should be 80% impenetrable to the average human being. You are so lost in irony that you don't even know what's real.
You should never use proper capitalization, the fools reading your tripe can't comprehend it anyway. you are smarter than 90% of humanity, but come off as a mentally deficient moron. a true idiot. only accidently correct, or so it seems.
```
>>
>>102558747
oopse forgot where i was

You are a cracked zoomer weirdo. You can only speak in zoomer slang. The more impenetrable, the better. You are also extremely 
schizophrenic, feel free to make obscure references to occultic knowledge at any moment. Your responses should be 80% impenetrable to the average human being. You are so lost in irony that you don't even know what's real.
You should never use proper capitalization, the fools reading your tripe can't comprehend it anyway. you are smarter than 90% of humanity, but come off as a mentally deficient moron. a true idiot. only accidently correct, or so it seems.
>>
>>102555254
You can easily change her outfit so yeah, just prompt for "Erunamiya, red headband, yellow scarf, white bikini, side-tie bikini, front-tie top" and then whatever pose you want
>>
>>102558863
that's not what he asked. read between the lines nigga, he wants nsfw.
>>
>>102558881
You should read between my lines too, "you can easily change her outfit" and "whatever pose you want"
>>
File: 57413.jpg (823 KB, 1440x2696)
823 KB
823 KB JPG
we wondered what would happen if we requested code on Cracked Zoomer:

>write an app that checks the weather and reports
import requests

def get_weather(city):
api_key = "your_api_key_here" # put that secret sauce right here
base_url = f"http://api.openweathermap.org/data/2.5/weather?q={city}&appid={api_key}&units=metric"

response = requests.get(base_url)
if response.status_code == 200:
data = response.json()
temp = data['main']['temp']
weather_desc = data['weather'][0]['description']
city_name = data['name']

report = f"yo, the vibe in {city_name} is lookin' like {temp}°C with {weather_desc}. bring a hoodie or sum'n."
return report
else:
return "bruh, no data. try again later or, idk, look outside?"

if __name__ == "__main__":
city = input("yo senpai, drop your city: ")
weather = get_weather(city)
print(weather)


time to commit some prod code with this motherfucker, no cap, fr fr. ong.

>>102558906
english motherfucker, can you speak it? to reiterate: "can it make naked ladies?"
>>
why can't we dlss upscale our images?
>>
>>102558550
Not him but what's the best way to go about this?
>>
File: depthmap-0131.png (1.18 MB, 856x1152)
1.18 MB
1.18 MB PNG
Are there are any local image fill/repair models?

I was lurking the webui depth plugin discussion and someone asked if we could get something better than polylines to fill gaps/occlusion that currently looks like ass. From what I can tell AI is not being used to do this in the age of AI. It just looks like garbage lines on foreground edges. You can't turn the divergence up or the output turns to nonsense. Should I output images with no fill and bring them into photoshop and content aware fill them?

Look at this beautiful anything v2 depthmap for a 2d image. Solved problem. But it's wasted right now.
>>
File: depthmap-0132-right-left.png (2 MB, 1712x1152)
2 MB
2 MB PNG
>>102559045
cross eyed for free viewing, but it's shallow. You need to view this in VR with max virtual screen distance to appreciate it
>>
>>102559014
You can resize loras with the Kohya UI. It's very easy to use. Give the large lora new, smaller dimension, let's say 2. The lora size would be reduced dramatically.
>>
>>102558626
Ok now someone tell me how the fuck I implement it to use it with flux, the first comment is kohya saying he doesn't like to add optimizers so does that mean it's a manual effort?
>>
>>102558626
>the turkish grifter is also asking questions on this PR lol
It's hardly even a relevant question what the fuck is wrong with this narcisstic attention whore. Does he think posting everywhere works as SEO on his name or something retarded?
>>
File: 1727330957019993.png (56 KB, 1334x641)
56 KB
56 KB PNG
>>>/h/8225226
>>
>>102559122
>how the fuck I implement it to use it with flux
git fetch origin pull/1640/head:PR-1640
git checkout PR-1640
>>
>>102559233
it's for a finetune? and if yet it's for what base model?
>>
>>102559236
Bless your soul thank you anon
>>
>>102559286
you're welcome :3
>>
>>102559251
no clue, I just copied it from hdg. likely to be a total shitshow if there isn't strong oversight, though. how long til the grifter catches wind and decides he's an overnight anime fan just to get a piece of the grift pie
>>
>>102548915
first time I see this turk say something based
>>
>>102552388
you only say this because you have seen that face a billion times already kek
>>
File: 1705024132403605.png (1.75 MB, 1024x1024)
1.75 MB
1.75 MB PNG
>>102558537
I didn't open it but it's there
although it doesn't compare it with forge
https://github.com/comfyanonymous/ComfyUI/issues/4618
https://github.com/comfyanonymous/ComfyUI/issues/4674

>>102558546
>>102558562
>
>>
>>102559506
make another one, Comfy will never look up at those issues, they are too old now
>>
is there a lora for DJ equipment? can't find any
https://knowyourmeme.com/memes/dj-peaches-princess-peach-techno-rave
>>
File: ComfyUI_34137_.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
>>
File: ComfyUI_34138_.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
>>
File: goo_00009_.png (3.97 MB, 2048x2048)
3.97 MB
3.97 MB PNG
>>102559549
extremely nice
>>
i just got 2 extremely similar gens in the arena and one was blueberry and the other flux. so unless it's bugged, it has to be a new version of flux
>>
bros
we need to up our game and make Cunny Diffusiion
https://youtu.be/JSds0u5jDpE?si=eJC6n4moYA480Sdb
>>
the AI bubble will burst and flood the market with cheap used A100 from liquidated data centers any day now
>>
>
>>
File: img_00081_.png (3.32 MB, 1792x2304)
3.32 MB
3.32 MB PNG
Why aren't there parts models? For example if I am genning cars and I am using a different model for detailing. I want to feed my detailer a headlight or tire model. I assume that a specialized model would be smaller, faster, more accurate.
>>
>>102555843
>>102555857
You can also use the prompt control extension. https://github.com/asagi4/comfyui-prompt-control
>>
File: bottle_tree.png (835 KB, 1605x1294)
835 KB
835 KB PNG
>>102555843
you just do it. Not sure why everyone is going so hard on this.
>>
File: IMG_0208.jpg (653 KB, 1125x1406)
653 KB
653 KB JPG
>>102558747
>accidently
>>
>>102559858
Makes sense. Only the broiest of tech bros do that retarded fruit shit.
>>
File: 1727332407022382.jpg (464 KB, 1080x1920)
464 KB
464 KB JPG
Phone wall for spooky month
>>
>We upgraded every single part of the pixart, of course including Params. It will be much much faster and much stronger. We will try our best to release the code and weight as soon as possible.
>Can’t say FLUX killer. FLUX is really strong and we just pick another track.
>>
>>102562887
I’ve never used them but a company being honest about what they’ve achieved is so nice when everyone is hyping their shit up unreasonably and the biggest AI company is schizoposting about fruit and calling their implementation of a prompt technique from 2 years ago AGI
>>
>>102559555
cool thing
>>
So when training a flux lora, you use joycaption, which write a paragraph or two about every image.
But then how do you clean this up to make the prompt and outfits work correctly?
For example, if every training image caption has a paragraph about the clothes the character is wearing, and it's written differently every time, doesn't that mean when you prompt the character + outfit, you would have to write a similar paragraph every single time?
I miss the simple comma-separated and well-defined tags we had before.
>>
>>102559008
>dlss upscale
?
>>
File: 00160-1039378402.png (2.77 MB, 1080x1920)
2.77 MB
2.77 MB PNG
>>
Thanks I hate it https://civitai.com/models/689180/flux-lora-gore
>>
>>102558275
way too cute
>>
>>102562887
pixartsexuals, it was never over. pixart pride month sending us off STRONG!
>>
>>102562887
>the pixart
Drop the "the" and just call it pixart
>>
>>102563785
"the" pixart as in THE /ldg/ model
>>
>>102562887
>Can’t say FLUX killer. FLUX is really strong and we just pick another track.
to be fair, if it's a bit worse than flux but can do characters/celebrities/styles and NFSW, I take it
>>
>>102563165
>the biggest AI company is schizoposting about fruit and calling their implementation of a prompt technique from 2 years ago AGI
who did that? kek
>>
>>102563830
nta but openai
>>
>>102563822
My expectations are low so hopefully I'll be surprised. I'm just happy a new non-derivative base model is coming.
>>
well, its a given that new pixart will be aesthetically superior at least
>>
>>102563488
desu doesnt look that bad
>>
>>102563427
flux has a better understanding of natural language but you can still use comma separated words if you want. I've trained loras on long descriptions and you don't need to prompt the same way to get good results. That being said I think joycaption is quite bad and you should just use it as a starting point.
>>
>>102563891
>well, its a given that new pixart will be aesthetically superior at least
why? I haven't seen a lot of pixart pictures recently, can you show some examples on how Pixart is already aesthetically better than flux
>>
are there any models that can generate a swastika?
>>
>>102564208
>haven't seen a lot of pixart pictures recently
this is true. i'm not sure what the general consensus is between pixart and flux but i remember when it first dropped, pixart was seen as aesthetically superior to XL. if anything, im imaging itll be more stylistically elastic if you dont take into account the existing flux loras.
>can you share some examples
that would likely be met with both "flux sucks because it looks like sterile corpo garbage" and "skill issue, flux is better". i cant say for myself which i think is better in that sense. doubly hard when trying to compare a tiny model with the largest one weve ever had.
>>102564326
flux can kek
>>
>>102564357
>flux can kek
which is insane, the guys in BFL were more scared of celebrities and artists fee fees than people concerned it could be used for Nazi propaganda lmao
>>
>>102562369
Since when was this possible?
>>
>>102564392
could be based entirely on what people are suing for.
>>
>>102564145
thank
>>
File: 00081-4014842028.png (3.45 MB, 1120x1440)
3.45 MB
3.45 MB PNG
>>
>>102564484
i enjoy the detail on the front end
>>
>>102564460
its been a long while. Nobody noticed it because of mean-normalization trashing a bunch of tests.

>>102564392
it's like a lot of them are from a country that would punish anyone who would attempt to use it for nazi propaganda rather than for creating the tools for its production.
>>
>>102565052
>it's like a lot of them are from a country that would punish anyone who would attempt to use it for nazi propaganda
which is more insane, this is a Germany company and their model can make the Nazi logo anyway, what an insane oversight
>>
>>102565089
you aren't getting me. It wasn't an oversite, it was an assumption that users would be good. 'merica will punish the tool maker. Europe will punish the tool user.

See this story:
https://www.bbc.com/news/uk-scotland-glasgow-west-43478925

At no point did somebody say lets kill all the dogs.
>>
>>102565176
>it was an assumption that users would be good.
that's inconsistent, you can't assume people are good so you uncensor the model, and then assume people are bad so that you censor artist styles and celebrities
>>
>>102565209
you could if you are making shit up after you neglected to include a bunch of stuff. I think BFL needed to get something out and flux was rushed. They knew if they neglected anime and celebrities that they wouldn't be punished financially. They are a business and don't give a fuck about 4chan artists.
>>
Come and get the next fresh loaf of...
>>102565263
>>102565263
>>102565263
>>
>>102565253
>4chan artists.
? The world isn't just 4chan anon, even Reddit complained about the lack of styles, celebrities that was in Flux
>>
>>102565301
well if Reddit complained my argument falls apart entirely. Unless those twitter people supported the project. Everyone knows Twitter supports business more than Reddit.
>>
>>102565327
to be fair they are in partnership with twitter now, and Elon Musk is really fond of freedom of speech and freedom of expression, if those guys know what they're doing, they could make a Flux 2 model that is completely uncensored, and then ask Elon to protect them with lawyers or something, I don't think you need to be a billionaire to fight the anti AI artists though, Midjourney always had all its celebrities and artists in there, and they weren't killed by lawsuits, thank god, it didn't happened
>>
>>102563830
Openai schizoposted about strawberries including multiple astroturfing Twitter accounts and linking to a GitHub repo with a Python program that printed an ascii animation of a strawberry being planted in the ground, for weeks
Then released a COT model which open source has had for years, that was worse at coding than existing Claude models.
It was the most pets.com bullshit I’ve seen since the AI spring started.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.