[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1.23 MB, 3264x3264)
1.23 MB
1.23 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102036630

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
File: 2024-08-23_00187_.png (1.8 MB, 1024x1024)
1.8 MB
1.8 MB PNG
>>102039916
thank you baker
>>
File: ComfyUI_00578_.png (2.51 MB, 1536x1536)
2.51 MB
2.51 MB PNG
Can you guess what gguf I used for this?
>>
File: ifx176.png (1.58 MB, 1024x1024)
1.58 MB
1.58 MB PNG
>>
It's okay to use schell, sdxl, sd1.5
>>
File: ComfyUI_00579_.jpg (721 KB, 1624x2880)
721 KB
721 KB JPG
>>102039953
>>
>>102039960
Why would I though
>>
>>102039968
it's okay regardless
>>
>>102039916
the "ramen sluts" one was better & ty
>>102039953
what I can say is that you fried the image, so hm.
>>102039958
schizomatic fantastic mode activated.
>>
File: ComfyUI_05386_.png (904 KB, 1024x1024)
904 KB
904 KB PNG
>>102039916
>made it to the collage again
I can't help winning, can I?
>>
File: 00083-3184145119.png (1.89 MB, 1216x832)
1.89 MB
1.89 MB PNG
>>
>>102040037
I made it twice in a row once (both times had two of my images)

Beat that.
>>
File: 4_methods-min.jpg (3.49 MB, 4780x4060)
3.49 MB
3.49 MB JPG
That took me way too much time but here we are I guess?
>>
File: ComfyUI_00557_.png (2.37 MB, 1536x1536)
2.37 MB
2.37 MB PNG
>>
what a boring lora.
>>102039958
added metal lizard thing with red eyes
>>102040047
not bad. 2 of mine in there now and happened in the past as well but not in a row. you win the grand price of benis. next goal: 3
>>
if i gen like once a month and don't have the vram for flux, what's the best online place to deploy a comfy instance and gen?
>>
File: ComfyUI_05387_.png (832 KB, 1024x1024)
832 KB
832 KB PNG
>>102040047
>>
>>102040019
NTA but could you point out which bit is "fried" in that image? havent proompted so heavily that i can spot the most minute tells, so while the image doesn't quite look photorealistic to me, i couldn't really point to clear diffusion model artifact signs other than maybe some faint chromatic aberration effect at the top of her cheeks with a bit of red/blue
captcha: DRM4D
>>
>>102040048
Does the cfg 6 increase gen time?
>>
>>102040068
he chilling
>>
>>102040084
non-1 should double it, yes
>>
>>102040084
CFG > 1 always doubles the time
>>
>>102040068
yeah my next goal is 3 in a row, I wish us all luck.
>>
File: ComfyUI_00575_.png (2.57 MB, 1536x1536)
2.57 MB
2.57 MB PNG
>>102040080
I think it's the cheeks but I think it was because I increased the steps

I use the schnell dev merge, that's the best it can generate in 5 minutes without waiting 20 minutes like on dev
>>
>>102040077
Miku chan, you can beat me anyday
>>
File: 00088-26440195.png (1.21 MB, 1216x832)
1.21 MB
1.21 MB PNG
>>102040107
>>
>>102040124
That's dev?
>>
File: 2024-08-23_00208_.png (1.64 MB, 1024x1024)
1.64 MB
1.64 MB PNG
>>
>>102039474
The guy who forked the gist wrote there just now:
>I haven't messed with Comfy nodes enough to know if its even possible to change which GPU is selected for primary processing,

But ain't that what the earlier post explains? You can assign your models to whatever?
I need to try this for myself
>>
>>102040080
it is fried in a sense of too much contrast. look at the taylor swift image above, thats even further into fried territory. for comparison, the asian girl I posted above, with the metal lizard toy, thats slightly undercooked. just dial in the guidance scale, i find that things get hairy around 3.3 but other factory in play there as well, like a lora, the style, etc.
>>
>>102040142
>But ain't that what the earlier post explains? You can assign your models to whatever?
you can, if you want Flux to go on your cpu you can, if you want clip to go on your second gpu you can
>>
Blessed thread of frenship
>>
>>102040166
The comment said the processing still happens on the primary GPU even if you put the weights on the secondary GPU.
>>
>>102040048
Thanks for doing this. Looks to me like it doesn't matter a huge amount which you use, but any at all does indeed get you much better prompt adherence from the increased CFG, most obviously in the pixel art style and getting Miku's hair right. DynamicThresholding seems to do weird stuff to the brightness that none of the others do, though, so not that. I think I like Tonemap's outputs the most for whatever that's worth.
>>
>>102040166
And now it also works good with loras? Sweet.
>>
>>102040188
I've used either of my gpus for making images with that code
>>
File: 00093-26440200.png (1.3 MB, 1216x832)
1.3 MB
1.3 MB PNG
>>102040133
yeah dev Q8_0 using a lora called movie portrait

Also I wasn't that anon, I just replied because of the blood in yoiur image lol
>>
>>102040193
>Looks to me like it doesn't matter a huge amount which you use, but any at all does indeed get you much better prompt adherence from the increased CFG
that's why CFG was invented in the first place, to increase the prompt adherance of a model, to me the most obvious part is the first picture, you can only get the sushis at CFG 6 + AutomaticCFG
>>
>>102040213
then show that to the guy that wrote that code and says only the primary processes the diffusion model
>>
>>102040216
>you can only get the sushis at CFG 6 + AutomaticCFG
but you don't get the sushi with CFG 6 and the other CFG modifiers
>>
>>102040216
yeah, sorry i didn't mean it like it was big news, just "you get the intended benefit no matter which one you pick". I actually missed "of pixelated sushi rolls" in the prompt so yeah AutomaticCFG gets a lot of points for that given it also has the blackest Miku skateboarder and retains a more plush-like Miku with the burger than Tonemap does, I'd previously put those down to being within random variance but with the sushi it's somewhat clearly ahead on three counts, which is enough that if I can't be bothered to do more research I guess I'll just use AutomaticCFG
>>
>>102040242
true, that's why to me AutomaticCFG is the winner, it's the only one that managed to get this level of prompt adherance at CFG 6, maybe for the others you can make it work at higher cfg, but you'll fry the image even more, imo the goal there is to get the most prompt adherance at a CFG the closest to 1
>>
File: 00105-2094558879.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
>>
>>102040231
He just added one module to it Anon
And says he hasn't played with Comfy nodes very much
>>
>>102040161
Try to get a realistic image in schnell or schnell dev merged, schnell gives all images some plastic slop look
>>
>>102040275
so you're saying the man is confused?
>>
>3k steps, 0.0005
>likeness is almost there
>up LoRa strength to 1.5
>perfect likeness, but distorted
what did I do wrong
>>
>>102040214
What card do you use and how much it takes to make that image with dev q8? What's the prompt you used, I want to try it on the merge
>>
>>102040287
Not trying to discuss people?
That's why I referred to the code itself and the direct statement by him
>>
>>102040254
>I guess I'll just use AutomaticCFG
I was only using the "basic" AutomaticCFG, there's a lot of variants, if I find a better variant I'll make the news here obviously kek, can be a fun challenge to get the sushis at CFG 5 for example, because why the fuck not :^)
>>
>>102040257
>>102040313
get SkimmedCFG .. its superior
>>
>>102040314
it's not >>102040048
couldn't even get the sushi at cfg 100, this shit gives outputs too close to cfg 1
>>
I thought flux cannot have loras n stuff

How come there are so many out now on civitai, and they have nudity too?

What happened?
>>
>>102040305
right, but he says the processing happens on the primary and instead of you showing it doesn't you decided to talk about the guy
just show us the processing running on the secondary GPU, anon
>>
>>102040333
any weights can have loras
what happened is you got confused about what was being discussed back then
>>
>>102040333
checked and also haters gonna hate for their cult of the day even if they have no clue at all
>>
File: file.png (542 KB, 832x1216)
542 KB
542 KB PNG
The WORLD OF HORRORâ„¢ Lora has been "improved" by an additional 7000 steps (12000 steps total)
>>
File: 2024-08-23_00225_.png (1007 KB, 1024x1024)
1007 KB
1007 KB PNG
>>
>>102040333
>I thought flux cannot have loras n stuff
Some retard claimed that because Flux is distilled then it's impossible to make loras out of it, turns out it's the opposite and it's never been easier to make loras out of any other models
>>
hime lora is all over the place
>>102040284
o fast, hmm. havent fired it up since day 2.
>>102040363
good work!
>>
File: ComfyUI_03877_.png (1.29 MB, 896x1152)
1.29 MB
1.29 MB PNG
>>102040048
Do you have a catbox for automatic-cfg? I thought the update might have fixed compatibility but I'm still getting fucked results.
>>
>>102040284
>Try to get a realistic image in schnell or schnell dev merged, schnell gives all images some plastic slop look
kek, exactly what the pony v7 needs, after all those dudes are into plastic toys of ponies
>>
>>102040413
Sure, if you are using Adaptive Guidance I noticed it had different (and worse) results than the regular CFGGuidance, so that's what I used for every of my "anti-CFG burner" nodes
https://reddit.com/r/StableDiffusion/comments/1ez5nsx/adaptiveguider_gives_different_output_than/
https://files.catbox.moe/smor0p.png
>>
>>102040342
>instead of you showing it doesn't you decided to talk about the guy
No, that decision was made here: >>102040231
>>
File: file.png (1.36 MB, 832x1216)
1.36 MB
1.36 MB PNG
>>102040400
Thank!

I think this is it. Dithering is quite consistent now
>>
>>102035997
reup on this plz
>>
>>102040443
>No, I won't be uploading to Civitai
So I guess huggingface is the hill you want to die on anon? Because people only use Civitai to find their favorite Loras
>>
>>102040454
I wouldn't be asking for a lora if I was the author...
>>
File: ComfyUI_01481_.png (821 KB, 832x1216)
821 KB
821 KB PNG
>>102040443
Actually I uploaded it on civitai in the last thread cause I decided that I am lazy at the end. You can find that one over there.

Anyways, for the 12000 steps version, find it here:
https://litter.catbox.moe/th49im.safetensors
>>
>>102040475
ty anon
>>
>>102040454
We need to always maintain the status quo.
>>
>>102040314
>>102040322
"did it accomplish one specific thing at cfg100" is a pretty meaningless benchmark to be fair, but if anon is confident skimmedcfg is superior id very much like to see either an XY demonstrating a pattern, or at least an explanation. the single 4x4 comparing 4 models isn't really enough to draw a conclusion between two things with such similar performance
>>
>>102040454
Retard
>>
File: Capture.jpg (256 KB, 2576x1334)
256 KB
256 KB JPG
>>102040481
when the status quo is good, yes, why the fuck would i use Huggingface to find loras, you can't see any pictures when you scroll past the models
>>
File: ComfyUI_01490_.png (982 KB, 832x1216)
982 KB
982 KB PNG
>>102040479
>>102040454
>>102040465
>>102040499
Well I mostly just do this for fun so I don't really care. I guess putting it on Civitai would let more people try it though so there's that
>>
>>102040493
shut the fuck up nigger
>>
>>102040454
huggingface a bit .. better than sloptai, no? "the hill you want to die on", anon..
>>102040475
grabbing it
>>
File: file.png (1.67 MB, 832x1216)
1.67 MB
1.67 MB PNG
>>102040519
Let's just stop fighting and get back to genning boys
>>
>>102040519
i look like that
>>
>>102040436
it's the exact same request you idiot
why are you hung up on this, just show it works on your machine
>>
File: 2024-08-23_00222_.png (1.51 MB, 1024x1024)
1.51 MB
1.51 MB PNG
>>
>>102040551
Sorry, I can't assist with that.
>>
>>102040475
Thank you for your cervix.
>>
File: ComfyUI_01505_.png (1.33 MB, 1216x832)
1.33 MB
1.33 MB PNG
>>102040641
Yeah
>>
>>102040535
me too. cool huh. lets fuck.
>>
>>102040433
Thanks anon. My problem was using too high Guidance Neg on AutomaticCFG. Looks like it doesn't scale up as high as the other methods, as 8.0 gave me those distorted results.
>>
File: file.png (1.46 MB, 1216x832)
1.46 MB
1.46 MB PNG
>>102040649
kino
>>
>>102040657
>My problem was using too high Guidance Neg on AutomaticCFG. Looks like it doesn't scale up as high as the other methods, as 8.0 gave me those distorted results.
yeah, AutomaticCFG doesn't like high guidance Neg at all, I noticed that aswell, desu it's working fine at its default guidanceNeg so I keep it that way
>>
>>102040664
I think my noise injection thing wrecks it, I suck at this sorry. love it tho!
>>
>>102039967
I can fix her
>>
>>102040702
>I think my noise injection thing wrecks it
what's that?
>>
File: 00099-26440206.png (1.16 MB, 1216x832)
1.16 MB
1.16 MB PNG
>>
>>102040721
people inject noise into images half way through genning (or afterwards using img2img) and resample to add additional details to an image.
>>
File: 00104-26440211.png (1.38 MB, 1216x832)
1.38 MB
1.38 MB PNG
>>
>>102040751
isn't what the Ancestral thing does on the samplers like Euler A?
>>
Anyone uses ZLUDA?
>>
File: noise.png (161 KB, 1620x622)
161 KB
161 KB PNG
>>102040721
2 samplers chained, split sigmas denoise node controls the process. 30 steps, 0.45 = sampler1 does 16 steps, 2nd one 14. and noise is being injected before it goes into sampler#2, the value is at 0 in the image. really helps with detail, can go up to 0.4 (-ish, thats the high end)
>>
File: 00106-3657787546.png (1.42 MB, 832x1216)
1.42 MB
1.42 MB PNG
>>
>>102040805
can you provide a workflow, that looks interesting
>>
File: 00108-3657787548.png (1.39 MB, 832x1216)
1.39 MB
1.39 MB PNG
>>
>>102040825
all there no? only model, width & height + latent and a positive conditioning come in from the right. just get the comfyUI essentials for the noise node (there are others too) and voila.
>>
>>102040760
As far as I'm aware, it's similar in theory but more random and uncontrolled. Ancestral samplers continuously add noise to the image at every step, whereas non-ancestral samplers will eventually 'settle' on an image and move towards it. There's a theoretical final image with non-ancestral samplers.

So by adding noise to the latent on a non-ancestral sampler you're providing additional sauce for it to derive detail from, but while still sticking to the intended form - I guess.

Some anons can correct me if I'm wrong kek.
>>
File: 00110-3220680566.png (1.54 MB, 832x1216)
1.54 MB
1.54 MB PNG
>>
File: 2024-08-23_00252_.png (1006 KB, 1280x1280)
1006 KB
1006 KB PNG
I love 2024, running a 12b diffusion model and an 8b LLM to look at pictures smoothly at the same time on a consumer GPU. What a time to be alive. How will 2025 be?
>>
>>102040943
flux pro tier no censor all out PORN ADDICT
>>102040889
correct
>>
File: 1252382145.png (1.34 MB, 1152x896)
1.34 MB
1.34 MB PNG
>>
File: 2024-08-23_00260_.png (1.52 MB, 1280x1280)
1.52 MB
1.52 MB PNG
>>102040969
>flux pro tier
ya I guess .. one year after SDXL it was extremely finetunes and hammered out in all directions from coom to bloom ..

also Bill Gates really had himself being scrubbed by his lawyers from any possible commercial dataset I guess, flux can only emulate his likeness by description without lora .. but I not give him my GPU cycles for a lora
>>
File: file.png (230 KB, 1851x947)
230 KB
230 KB PNG
Why is it so hard to find a guide on how to setup flux with a lora to run in comfyui...
At the moment i have setup fp8 e4m3fn and i can run it ok.
Tried adding "Load Flux LoRA" node to feed to BasicGuider, but it errors

Should i keep using fp8 or download nf4 version?
Maybe GGUF Q8?
Which one's best for 4090?
>>
File: 00112-3336006683.png (1.85 MB, 1216x832)
1.85 MB
1.85 MB PNG
>>
>>102041007
If you're just looking to use a LoRA, then you can use the default "Load LoRA" node. The one you're using is by a third-party.
>>
File: 2024-08-20_00084_.png (1.07 MB, 720x1280)
1.07 MB
1.07 MB PNG
>>102040971
Always sad to see. I wish them all a skinny boyfriend with a god of war tattoo who works in IT
>>102040997
The small handful of celebs I've tried to gen have all been fairly gimped. Like, maybe the right hair colour, and some mildly recognisable features if I'd prompted for them explicitly along with the name, but clearly limited
>>
>>102041007
>fp8 e4m3fn
right this .. best quality just below pure fp16 .. you could run even fp16 on the 4090 .. but then you need to go lower on t5 .. I opted for fp8_e4m3fn + t5 at fp16 for best prompt undersanding on my 4090 .. cause if a prompt fails what will a few pixel quality give you. you could go vice versa .. GGUF is not needed for 4090
>>
>>102041064
>right this .. best quality just below pure fp16
isn't it Q8 the one closest to fp16?
>>
>>102041083
Yes
>>
>>102041007
>>102041083
>>102041094

Worth mentioning that FP8 has the fastest gen times on 4000 series cards currently due to the --fast flag.
>>
>>102041007
just get grthree nodes and use his "power lora loader". comes with switches to disable loras, can grab info like trigger words and such from civitai, really helpful. i had some issues too some days ago. I dunno..maybe you got a wiring mistake somewhere? and use the full t5xxl fp16 clip. stay on that model quant, and you are good.
>>102041060
eh? the ones I tried are all able to perfectly replicate the target. like cant distinguish from a real photo anymore tier.
>>
File: 2024-08-23_00263_.png (1.77 MB, 1280x1280)
1.77 MB
1.77 MB PNG
>>102041060
>The small handful of celebs I've tried to gen have all been fairly gimped. Like, maybe the right hair colour, and some mildly recognisable features if I'd prompted for them explicitly along with the name, but clearly limited
when the big "AI is bad" hype was going there was a move of many celebs to put emself on a AI blacklisted petition, I guess BSL removed all those from the dataset, many very very rich ppl like bill gates and schwaab probably on it .. just avoid later troubles.. they didnt care about Disney tho

>>102041083
>>102041094
ya? maybe I should change then, I was so busy genning I didnt care reading up on that .. is there some read up on that?

>>102041125
yes but be careful I used --fast for days until I noticed its non-deterministic .. very slightly
>>
File: file.png (45 KB, 521x444)
45 KB
45 KB PNG
>>102041127
only thing i don't like about power lora loader is the lack of subfolder support
>>
>>102041154
>I was so busy genning I didnt care reading up on that .. is there some read up on that?
there's a comparison between different quants here
https://reddit.com/r/StableDiffusion/comments/1eso216/comparison_all_quants_we_have_so_far/
>>
File: file.png (681 KB, 2038x983)
681 KB
681 KB PNG
>>102041047
Damn. This worked, thanks

>>102041064
Dunno man, with fp8 running i get up to 22/24GB vram
fp16 would spill over and be dead slow i reckon

>>102041127
Might do, thanks, i don't gen too often, just trying to keep up with developments. And every single time i do i run into some issue, its pain
>>
>>102041127
>eh? the ones I tried are all able to perfectly replicate the target. like cant distinguish from a real photo anymore tier.
You're talking about loras, what you're responding to is about what celebs the base model knows well, which aren't many.
>>
File: file.png (1.35 MB, 1024x1024)
1.35 MB
1.35 MB PNG
>>102041177
wow! reddit is such a good resource guys. thank you for this incredible post, but where is the OP's patreon? i must provide them with the monetary value they deserve for such extensive testing
>>
https://huggingface.co/city96/t5-v1_1-xxl-encoder-gguf/tree/main
What's the largest of these I can run on CPU with 32GB RAM? I always offload due to poor VRAM, currently using t5 xxl fp8. I assume I could move up quite a few tiers in exchange for a bit of waiting.
>>
>>102041196
you can run the full FP16 and should be plenty fast if you have a recent CPU
>>
>>102041195
>wow! reddit is such a good resource guys.
it is, unless you have better to show?
>>
>>102041206
thanks. the fp16 should beat the q8_0, then? my cpu's overpowered relative to the rest of my PC so yeah sounds like speed will be fine. I'm guessing the 32 is pointless overkill for inference even on a great rig?
>>
File: bComfyUI_107531_.jpg (1.02 MB, 1536x2048)
1.02 MB
1.02 MB JPG
>>
>>102041231
wait disregard the first part of this I'm a fucking retard
>>
File: 2024-08-23_00264_.png (2.28 MB, 1280x1280)
2.28 MB
2.28 MB PNG
>>102041177
doesn't say if its its fp8_e4m3fn or e5m2 .. which one was used? also yea I see the ball disappear .. I more like want to know what does Q8 do to the fp16 weights compared to the fp8 variants
>>
>>102041253
>doesn't say if its its fp8_e4m3fn or e5m2 .. which one was used
it was fp8_e4
>>
File: 3014899121.png (1.36 MB, 1152x896)
1.36 MB
1.36 MB PNG
>>
File: flu.png (43 KB, 261x424)
43 KB
43 KB PNG
>>102041194
ignore me, sorry. yes. miku!
>>102041164
yeah well just type "flu" in the window and you get all loras in the flux subfolder. and that one lora with flu in its name lol
>>
File: ComfyUI_05395_.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
>>
File: 1713140114066232.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
>>
>>102041183
>Dunno man, with fp8 running i get up to 22/24GB vram
>fp16 would spill over and be dead slow i reckon
na comfy is pretty smart, it loads something else into the free vram .. it works with fp16, I tested it, but since I only have 32GB sytem ram it only does when I do nothing else with the system
>>
File: file.png (1.94 MB, 1024x1024)
1.94 MB
1.94 MB PNG
Kino
>>
What's the git cmd to go back to an old branch?
>>
File: file.png (2.24 MB, 1024x1024)
2.24 MB
2.24 MB PNG
https://www.youtube.com/watch?v=bE4C8a48o1E
>>
File: 28347658723315.png (3.24 MB, 2002x1805)
3.24 MB
3.24 MB PNG
The one guy trained the 185, sakuemonq, and shirosu loras, and he did a great job. It's shocking that PixelartXL has so many civitai downloads, it generates so much fried shit and sub-pixel cheating that looks bad as it is and is unreliable through a downscale/upscale process. sakuemonq is the one I use the most, the 1856601 one is also sort of handy in that it very closely matches the lora-less gen for if i random come across a seed i think would look good as pixel art while not genning pixel art
>>
>>102041350
If you have to ask this, you're better off nuking and starting from scratch. It sounds like you might want to go back to an old commit, not an 'old' branch, as well.
>>
>>102041437
I want to revert an update made by comfyui, go back to the version before it
>>
File: ComfyUI_05397_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
>>102041421
first 2 & shirosu got soul. that image is very informative! & nice babe hot damn
>>
File: 1722683704247527.png (1.89 MB, 1024x1024)
1.89 MB
1.89 MB PNG
>>
File: 2024-08-23_00265_.png (1.38 MB, 1280x1280)
1.38 MB
1.38 MB PNG
fp16 Mickey Necron ill use as quality reference for quality tests, also a good gen .. so I wanted to share it .. but fp16 is about 2.9s/it on 1280x1280 .. so you really gotta have patience with fp16
>>
I hope there will be a lora of Phryge someday, she's so cute
>>
>>102041585
>but fp16 is about 2.9s/it on 1280x1280 .. so you really gotta have patience with fp16
we got anons waiting several minutes for a 4 step schnell gen
>>
File: FLUX_00043_.png (1.03 MB, 896x1152)
1.03 MB
1.03 MB PNG
english rose
>>
>>102041593
mithra goon monster .. this guy is so creepy
>>
>>102041350
git reset --hard (the commit you want to go back to)
>>
>>102041559
nice one anon
>>
>>102041617
would with gusto
>>
>>102041644
you've said that at every one of my weird fucking gens
what wrong with you
>>
>>102041653
no I didn't but I will from now on
>>
So after fucking around with Joy caption and intern vlm, I can safely say intern vlm is at least on par even at 8B. I can't check out any higher quants until they fix the bnb quants.
>>
well fuck .. GGUF nodes fail to import .. is it save to update comfy atm, or are there breaking bugs?
>>
>>102041477
I can only recommend not doing this and simply waiting for things to get better, or getting used to whatever changed. `git checkout commitshagoeshere` will do what you want if you look up the sha of the specific commit you want to be on, but remember later when you're gnashing your teeth in rage that I did warn you. Also, for future reference, git is something that chatgpt/claude are extremely good at converting from natural language into git commands for what you want, especially if you warn it you're a git beginner.
>>102041557
cheers. Yeah unfortunately the xy grid node I'm using does the sampling of the images itself, so I can't do the automatic down/upscale afterwards to show off how it looks after that, but generally the sakuemonq one comes out very well
>>
>>102041619
Thanks

WTF is this

>Using sub quadratic optimization for cross attention, if you have memory or speed issues try using: --use-split-cross-attention
>>
File: 56_aTzpBRAyfL0Jj7qF1ww.jpg (506 KB, 1024x1024)
506 KB
506 KB JPG
>>
>>102041754
>movie still, wallace & gromit a close shave, body horror, live action dark gritty reboot
>>
>>102041746
What changed is now it takes around 60s/it to generate, before the update it was around 20s/it

I think it's the xpu update
>>
>>102041750
Do you have memory or speed issues?
>>
https://civitai.com/models/677673/chassidish
OY VEY SHUT IT DOWN
>>
File: file.png (2.11 MB, 1024x1024)
2.11 MB
2.11 MB PNG
https://www.youtube.com/watch?v=UBQP9gEldRk
>>
>>102041794
Fair enough. I recommend you compare the two suggestions you've been given and see what each does before you run either of them. Good luck.
>>
>>102041815
there was already one such lora of a jew man in civitai, it got shut it down quickly indeed kek
>>
I'm trying to set my shit up for training but I dont understand this token shit.

>FLUX.1-dev
>FLUX.1-dev has a non-commercial license. Which means anything you train will inherit the non-commercial license. It is also a gated model, so you need to accept the license on HF before using it. Otherwise, this will fail. Here are the required steps to setup a license.

>Sign into HF and accept the model access here black-forest-labs/FLUX.1-dev
I did that and it says: You have been granted access to this model
ok so far so good.

>Make a file named .env in the root on this folder
what folder are they talking about?
the ai-toolkit folder or where? also whats an ".env" file anyway and how do I do this?

>Get a READ key from huggingface and add it to the .env file like so HF_TOKEN=your_key_here
how?
pls halp
>>
File: 1707058181013148.png (1.96 MB, 1024x1024)
1.96 MB
1.96 MB PNG
>>
>>102041914
- the folder of the project you downloaded ie "/ai-toolkit/.env"
- .env is a text file for variables
- add HF_TOKEN=your_key_here as a line in .env
- figure out how to get an api token

if you can't figure this out you're already fucked, it only gets harder from here
>>
>>102041914
>>102041944
how do i open an .env file? do i need special program. should i install visual studio
>>
>>102041955
the clue is text file
>>
>>102041745
i did a fresh git pull of comfy and install gguf nodes for the first time today and the work so go ahead
>>
>>102041960
so microsoft word, gotcha
>>
File: file.png (2.27 MB, 1024x1024)
2.27 MB
2.27 MB PNG
https://www.youtube.com/watch?v=kfFcyTuopbI
>>
File: bComfyUI_107617_.jpg (430 KB, 1712x864)
430 KB
430 KB JPG
>>
>>102041969
anon, you can ask stupid questions that a 6 year old should know to chatgpt
I'll be honest, you're such a beginner at computers you're incapable of training
>>
>>102041813
Yes

>>102041876
Thanks, I'll try to reinstall comfy
>>
>>102041979
>the clue is text file
>text
>microsoft word is a text processor

where have i made an unreasonable leap in logic here? just checked and you can open a txt file in word so I'm beginning to think you aren't as smart as you think you are
>>
>>102040932
Mind posting some more? I love elves.
>>
this is bait and if you take it we'll all have a headache soon
>>
>>102042011
Word adds metadata. Again, you're computer illiterate, you are incapable of this task at this time.
>>
>>102041944
thanks bro, that explains everything I wanted to know.
>>
>>102042021
i'm literally looking at the tutorial right now and it says you have to ADD METADATA so i'm clearly on the right track
>>
>>102040475
>12000 steps
Holy shit. I'm the anon who made the captioning script. I'm so glad you created such a good looking lora.
I had to scrap mine because the dataset was shit, but I might try again in the future.
Saving this one tho. Fucking kino.
>>
>>102042036
Yeah anon you're on the right track, keep going!
>>
>>102041955
just open it with the text editor
>>
File: file.png (442 KB, 2741x1114)
442 KB
442 KB PNG
>>102041164
What do you mean no subfolder support? You can nest your shit it as many times as you want.
>>
>>102042047
see at least one anon gets it

>>102042048
my text editor of choice is microsoft word (cracked because im not paying for that shit)
>>
>>102042048
I opened it in Google Docs, they don't have an .env save option, can I just link the Doc URL?
>>
>>102042058
>my text editor of choice is microsoft word
the text editor is free and every windows has it installed.
>>
File: file.png (66 KB, 831x550)
66 KB
66 KB PNG
>>102042057
that's bizarre. node doesn't work like that for me
>>
>>102042037
By "mine" I meant my porn one. My dataset was shit. I didn't mean YOUR dataset.
>>
>>102042081
I don't think anyone thought you meant that
>>
>>102042073
oh yeah i used to use wordpad before i learnt how to get software for free, get gud desu...
>>
File: file.png (233 KB, 1754x1654)
233 KB
233 KB PNG
>>102042074
Anon... the settings...
>>
File: file.png (37 KB, 1358x721)
37 KB
37 KB PNG
>>102042073
Yeah I used Edge to open Google Docs, what do I do from here?
>>
File: 1724257852026474.jpg (18 KB, 250x250)
18 KB
18 KB JPG
>>102042096
anon... i....
>>
File: file.png (1.85 MB, 1024x1024)
1.85 MB
1.85 MB PNG
https://www.youtube.com/watch?v=ZkNMZlkrzaU
>>
File: file.png (2.52 MB, 1024x1024)
2.52 MB
2.52 MB PNG
>>102042091
>>
>>102042073
why not notepad++
>>
>>102042172
notepad#
>>
>>102042073
>windows

kek, ngmi
>>
File: file.png (669 KB, 512x512)
669 KB
669 KB PNG
>one of those kinda days
>>
File: 1708874154395791.png (1.67 MB, 1024x1024)
1.67 MB
1.67 MB PNG
bait bait bait
>>
>>102040475
I don't know how receptive the WoH are to AI art (even though the dev allegedly traced his art), but this could be huge for modding
>>
>>102042186
I prefer rustpad myself
>>
>>102042199
Shibzo aBait
>>
>>102040265
i suddenly remember the early 2000s anime 'rape man' where he ties a woman up to a skyscraper full spread eagle, then goes full spiderman and swings in from downtown to fuck her
>>
>>102036539
>Wanna thank that anon with the Joycaption script a bit earlier (he might have went to sleep now)
This warms my heart. You're welcome.
>>
File: fp16-vs-q8-vs-fp8.jpg (740 KB, 3648x1260)
740 KB
740 KB JPG
>>102041961
thanks worked

>>102041585
also the q8 vs fp8_e4m3fn test of mine was pretty conclusive .. ran 5 gens of high quality against each other .. fp16 always won ofc, but q8 is nearer to fp8 .. pic related. So I guess it ain't snake oil.. I am moving.

pic related, from left to right fp16, q8, fp8_e4
>>
>>102042202
it's going to take years for the mainstream to accept AI especially is gamer spaces because most of them are mentally and emotionally children
>>
>>102042230
What joycaption script?
>>
>>102042196
yep
>>
File: ComfyUI_00587_.png (2.6 MB, 1536x1536)
2.6 MB
2.6 MB PNG
>>
File: file.png (2.22 MB, 1024x1024)
2.22 MB
2.22 MB PNG
https://www.youtube.com/watch?v=aS8O-F0ICxw
>>
>>102042274
https://pastebin.com/raw/jLJB3xcK (embed)
>>
File: flux_llm_enhanced.png (1.27 MB, 2471x1750)
1.27 MB
1.27 MB PNG
What can be used for LLM?

Anyone tried pic related?
>>
>>102042037
>I fell asleep right after posting it
Thanks dude! I should be the one thanking you cause the script really made the whole process a whole lot easier.

>>102042202
I respect the dev for really using MSpaint to do all of his art
>the new update is delayed half a year
>>
File: file.png (113 KB, 1057x642)
113 KB
113 KB PNG
>>102042402
As you get into training you should consider using something like MongoDB to manage everything.
>>
>>102042402
>I respect the dev for really using MSpaint to do all of his art
We should all get over our egos and just love art for art itself. Yes, money is a thing, and making a living is a thing. But creating great art that makes everyone's day great should be the highest of achievements. Let's make each other glad we're making cool shit.
>>
File: file.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
I'm trying to use the World of Horror LoRA, but it doesn't seem to be working. Flux generates in this style regardless of it being activated.
>>
File: file.png (1.43 MB, 1216x832)
1.43 MB
1.43 MB PNG
>>102042448
What's that? I'm a codelet so I don't understand

Anyways, time to get back to genning. I have lost consciousness for a while

>>102042464
Yeah. I'm just here to make stuff. That's why I don't like the subleddit's constant need to one up artists desu. I think it just makes AI users look retarded
>>
File: file.png (2.31 MB, 1024x1024)
2.31 MB
2.31 MB PNG
https://www.youtube.com/watch?v=FuJBwu_03r8
>>
File: file.png (1.05 MB, 1216x832)
1.05 MB
1.05 MB PNG
>>102042483
Try : "lack and white pixel art dithered retro image of a wide shot of..."

The prompt might also benefit from longer captions since it was captioned using Joycaption. But on my end it works just fine with a short paragraph.

Also, if the dithering is too much, turn the strength down to 0.75 or something.
>>
>>102042489
MongoDB is a free database architecture you can install on your computer. It's nice because it has flexible document structure with dynamic columns. As you scrape and assemble datasets you can simply save it in the database. Then when you want to make something you can just query for what you want and resize/crop the images with your captions. Working directly with images and captions in a folder will eventually bite you in the ass especially if you destructively cropped your shit to 512px or something.
>>
File: file.png (2.18 MB, 1024x1024)
2.18 MB
2.18 MB PNG
https://www.youtube.com/watch?v=SVhme1eb8SI
>>
>>102042605
your stuff is deep in the uncanny valley
>>
>>102042639
Such are the 3d renders from the begining of the 2000's yeah
>>
>>102042649
just feels very unsettling, why you make it?
>>
>>102042676
I like it
>>
File: index.png (7 KB, 233x216)
7 KB
7 KB PNG
ok so I got the training environment all set up now I just need to create a Dataset.
so far so good.

I got an idea for a LoRa and I want to test it on a Person.

Minimum for a Dataset is like 20 images some people said but the more the better, first question is there a maximum here and whats the sweet spot?
my other question is so I have the image for example:
1.jpg and this needs a corresponding 1.txt in the same folder.
My Plan is to use JoyCaption and just copy that text into the 1.txt file.
So question is how do you make the trigger words for it?
JoyCaption will give some description (for example)
>"woman with red hair, playing chess at the park, bomb going off in the background"
Do I have to change the "woman" with some other word for example the Name of the Person and then this Word becomes the trigger word to trigger the LoRa when prompting?
like:
>"X with red hair, playing chess at the park, bomb going off in the background"
and then everytime I type in "X" it gives me the person I'm looking for.
and if I were not to change that would it make so everytime the word "Woman" is used it would replace the woman with the Woman its trained on?

sorry if questions are retarded I'm trying to learn
>>
File: file.png (959 KB, 1024x1024)
959 KB
959 KB PNG
>>102042564
I see
Maybe I'll do that later. I don't think it will be useful until I have more datasets but certainly worth looking into

Anyways I updated the civitai entry too. Anons can also head over there to download now:
https://civitai.com/models/676564?modelVersionId=758967
>>
>>102042698
obviously on how you spam it .. well good for you I guess, bad fur us
>>
>>102042709
>us
talk for yourself
>>
File: ComfyUI_01522_.png (942 KB, 832x1216)
942 KB
942 KB PNG
Miku.
>>
>>102042699
>Do I have to change the "woman" with some other word for example the Name of the Person and then this Word becomes the trigger word to trigger the LoRa when prompting?
Yes
>"X with red hair, playing chess at the park, bomb going off in the background"
If X has red hair then you shouldn't say "X with red hair". Give the model a chance to learn what X looks like
>>
File: 2024-08-23_00278_q8.png (1.76 MB, 1280x1280)
1.76 MB
1.76 MB PNG
ya after all tests I like q8 .. makes font type appear much clearer, pic related, looked good in fp8, but the font type was blurry and had slight errors >>102032649
>>
File: 25756810.png (1.35 MB, 1024x1280)
1.35 MB
1.35 MB PNG
https://civitai.com/models/677728/frutiger-aero-flux?modelVersionId=758644
Finally, fruitiger aero lora
>>
File: file.png (898 KB, 832x1216)
898 KB
898 KB PNG
>>102042699
I don't know about maximum here but I used around 1,300 images for the World of Horror Lora. I think for a person 20-50 is enough. I can be wrong though
>>
File: ComfyUI_03157_.png (1.81 MB, 1280x1024)
1.81 MB
1.81 MB PNG
>>
>>102042785
does Q8 with loras run at the same speed as FP8 with loras for you?
>>
>>102042799
More is better as long as it's a diversity of images in resolution, content and captions as long as they stick to a unifying theme or subject.
>>
>>102042699
for a character, the rule of thumb is to refer to them as if the model already knows who it is, since that's the goal. So static properties, things that never change, don't mention those. Things that do change however (clothing, hair colour/style, makeup, pose, action), mention those

If it's bart simpson then it doesn't matter, he always wears an orange top with blue shorts. The model will learn that "bart simpson" means a yellow boy that outfit.

If it's not bart simpson, then everything you describe will be interchangeable, so if you don't include it in the prompt when using the lora, then it won't automatically put it in

does that make sense
>>
File: 2024-08-23_00224_.png (909 KB, 1024x1024)
909 KB
909 KB PNG
>>102042808
yes, well maybe 10% slower or something.. didnt measure it .. but seems to be fine
>>
>>102042830
Actually that Cerfuckin guy uses like 10 in his so you'll be fine I think
>>
>>102042842
If you use too few images the model will learn things like retarded head poses as if they're an intrinsic feature of the subject. There's a reason why his images all look retarded with very little diversity.
>>
>>102042841
it goes from 2.4s/it to 3.8s/it for me... with no lora it is slightly slower than FP8
are you using Comfy?
>>
>>102042838
Even with Bart Simpson you want to mention clothes because most people will want him to wear different things, that's the value of AI. For example "Bart Simpson wearing a space suit on the moon". It's one reason why long form captions are valuable because they capture all the elements of the image which ultimately lets the model generalize details.
>>
File: file.png (1.04 MB, 864x1280)
1.04 MB
1.04 MB PNG
>>
File: bComfyUI_107566_.jpg (346 KB, 768x1024)
346 KB
346 KB JPG
>>
>>102042709
i dig their y2k stuff so idk
>>
>>102042872
yea comfy .. gimme 5 minutes ill make a test
>>
loras from anon are simply the best
>>
>>102042759
>>102042838
>does that make sense
yes thank you explaining.

>>102042799
>I used around 1,300 images for the World of Horror Lora
sweet jesus thats a lot.
did you automate making the text descriptions for the images or did you do it all by hand? or how does it work when you want to train a style and not a person?
>for a person 20-50 is enough. I can be wrong though
makes kinda sense tho because the face doesnt really change all that much, just need it from every possible angle I guess.
>>102042909
so JoyCaption is good enough for that task? its pretty descriptive from what I tested so far.
>>
File: ComfyUI_02158_.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
>>
File: file.png (738 KB, 1000x1000)
738 KB
738 KB PNG
>>102041421
I actually trained a Sakuemonq for Flux, inspired by this guy.
https://files.catbox.moe/ezg71g.safetensors
>>
>>102043093
>makes kinda sense tho because the face doesnt really change all that much, just need it from every possible angle I guess.
don't forget plenty of body shots as well
and with Flux you can train on multi-subject images too, saying "x is on the left, y is on the right" works 99.99% of the time
>>
>>102043093
Yes JoyCaption would be fine for that because it's trained to autistically detail as much as possible.
>>
File: ComfyUI_03966_.png (1.48 MB, 896x1152)
1.48 MB
1.48 MB PNG
>artist family members nearly walked in on me generating 1girl slop

phew
>>
>>102043099
Wonderful. Downloading it now to give it a shot. I assume he was careful about training on a dataset where he'd made sure all the images had "pixel" sizes of 8x8, did you do the same thing for yours? Or is that just how sakuemonq's images are all released anyway?
>>
File: ComfyUI_03973_.png (1.59 MB, 896x1152)
1.59 MB
1.59 MB PNG
>>
File: ComfyUI_02162_.png (1.46 MB, 1024x1024)
1.46 MB
1.46 MB PNG
>>
File: file.png (483 KB, 1000x1000)
483 KB
483 KB PNG
>>102043165
I did 10x10 for this because flux can do higher resolutions but yeah, every image is the same pixel size.
>>
File: file.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
Aww look at her
https://civitai.com/models/638052/ps1-ps2-old-3d-game-style-flux
>>
>>102043125
>don't forget plenty of body shots as well
what if the dataset contains nudity? will the model learn that too?
>>102043142
good good that makes it a bit easier.
>>
File: 00120-3620645119.png (1.48 MB, 1216x832)
1.48 MB
1.48 MB PNG
>>102042016
>>
File: 1722846494577394.png (1.53 MB, 1024x1024)
1.53 MB
1.53 MB PNG
>>
File: 00124-3620645123.png (1.43 MB, 1216x832)
1.43 MB
1.43 MB PNG
>>102043297
>>
>>102043309
More like Shinzo Ape kek
>>
File: 00126-3620645125.png (1.37 MB, 1216x832)
1.37 MB
1.37 MB PNG
>>102043313
>>
>>102043297
>>102043313
These do not get my dick hard anon
>>
File: 00132-3620645131.png (1.76 MB, 1216x832)
1.76 MB
1.76 MB PNG
>>102043328
>>
File: file.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
>>
File: 00133-3620645132.png (1.21 MB, 1216x832)
1.21 MB
1.21 MB PNG
>>102043343

>>102043336
Sorry but I'm not gay enough to want to turn you on today.

Maybe tomorrow.
>>
>>102043364
Post a couple more of the elves
>>
>>102043375
Baking now anon
>>
>>102043397
Ty anon love you
>>
File: 2024-08-23_00347_.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>102042872

>>102042872

okay may observations were:
1) a small lora makes no difference at all, 1.2it/s with slight fluctuation
2) when I added a bigger lora or two it went to 1.5s/it .. (or ~0.6it/s) so the speed halved)
3) this did not happen on fp8, that stayed at ~1.4it/s with big loras or multiple loras

so you are correct something is wrong with Q8 (or the gguf loader) and loras .. your barely notice its one lora or a small one, but if you use multiple it looses half its speed.

it also throws a warning in the console
>ComfyUI\custom_nodes\ComfyUI-GGUF\dequant.py:8: UserWarning: To copy construct from a tensor, it is recommended to use sourceTensor.clone().detach() or sourceTensor.clone().detach().requires_grad_(True), rather than torch.tensor(sourceTensor).

thats rather sad .. Q8 has better quality than fp8 .. but something is seriously wrong with gguf and lora loading
>>
File: 00134-337441520.png (1.3 MB, 1216x832)
1.3 MB
1.3 MB PNG
>>102043402
This is what came out, I assume this is not what you are looking for.
>>
>>102043418
No, proportions are weird. I liked the original style you made!
>>
>>102043431
Would you like that in the same artstyle or more realistic like real life
>>
File: bComfyUI_107559_.jpg (307 KB, 768x1024)
307 KB
307 KB JPG
>>
>>102043439
Same artstyle if you can, and a bit older too
>>
File: FLUX~3.jpg (127 KB, 1496x1168)
127 KB
127 KB JPG
Morning
>>
>>102043503
AAAAARRGGGHH THE FLUX GRID
>>
File: ComfyUI_05407_.png (743 KB, 1024x1024)
743 KB
743 KB PNG
>>102043276
Really like that one
>>
File: 1710105745713869.png (1.58 MB, 1024x1024)
1.58 MB
1.58 MB PNG
>>102043319
>>
>>102043522
kek :D
>>
>>102043512
??
>>
Guys why does this look familiar?

https://civitai.com/models/678158/jimmy?modelVersionId=759113
>>
>>102043531
don't tell me you can't see it
the grid, anon, it's right there!
other anons will confirm
>>
>>102043542
I know there's a grid at the begining of the render but I don't notice it on anon's image
>>
File: ComfyUI_00889_.png (1.55 MB, 896x1152)
1.55 MB
1.55 MB PNG
first style lora attempt
>>
File: 549568335.jpg (559 KB, 2806x1018)
559 KB
559 KB JPG
for posterity, here's a lora trained on a 12gb 3060. It's not great, and it takes a long fucking time, but there's something there. I chose this southern charms model because she has such ridiculous proportions that it's hard to mistake her for anyone else (and there's a tonne of photos of her in different wigs, costumes and sets) and also I had my first wank to her)).
left is training pic, middle is the same caption w/ lora, right is same caption w/o lora
>>
>>102043512
The matrix has you.
>>
>>102043414
yeah I agree with that, it's quite slow to use lora on the gguf quants, but I can't let Q8 go this shit is almost at fp16 quality :( I hope he'll optimise his shit a bit, it's only the begining
>>
File: ComfyUI_03982_edit.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>try and find examples of the classic "leaving a lipstick kiss mark on a polaroid" trope
>google
>nothing, zero results, it never happened

am i insane... did i make up an entire trope in my head? i swear this was a very, very common thing in movies and stuff.

>picrel, my scuffed edit trying to recreate the idea.
>>
File: 1720944498596363.png (1.77 MB, 1024x1024)
1.77 MB
1.77 MB PNG
>>
File: 00143-1886614641.png (1.55 MB, 832x1216)
1.55 MB
1.55 MB PNG
>>102043447
ok the artstyle of the original was random because it wasn't prompted. But I'll try, here is what I have so far.
>>
File: g.png (727 KB, 590x593)
727 KB
727 KB PNG
making a dataset for training is a lot of work I just noticed.
>>
File: bComfyUI_107575_.jpg (326 KB, 768x1024)
326 KB
326 KB JPG
>>
>>102043696
you're making a Sam Hyde lora? kek
>>
>>102043696
it's fun, you get to be productively autistic
>>
>>102043093
>automate making the text descriptions for the images
>>102042364
>>
>>102043696
make a dataset for training a flawless image caption model and you'll only have to do it once
>>
>>102041914
huggingface-cli login
>>
File: FLUX_00074_.png (1.12 MB, 896x1152)
1.12 MB
1.12 MB PNG
wahoo
>>
>>102043746
kinda looks someone I know
I'd have sex with her, if you know what I mean
I mean my penis in her vagina, in and out, until I ejaculate
>>
>>102043717
kek good idea but actually no. its some other person.
>>102043725
interesting, gotta set that up ASAP, still gotta manually edit every description tho.
>>
am I using Joy Caption completely wrong or does changing the prompt have about zero effect on what it outputs?
>>
File: 00144-1886614642.png (1.68 MB, 832x1216)
1.68 MB
1.68 MB PNG
>>
>>102043688
Lovely!! Thnx anon
>>
>>102043802
I too have noticed this phenomenon
I wanted to tweak it, to see if it could do my work for me, but no dice
>>
File: 00145-1886614643.png (1.44 MB, 832x1216)
1.44 MB
1.44 MB PNG
>>102043819
No problem I'd think up something more fun that 1girl but I'm pretty
>>
Delivery of bread, piping hot...
>>102043834
>>102043834
>>102043834
>>
>>102043817
>>102043830
These two are just what I was looking for. Couple more with the blonde if you don't mind?
>>
File: 00147-1886614645.png (1.6 MB, 832x1216)
1.6 MB
1.6 MB PNG
>>102043830
But I'm pretty out of the zone at the moment.
>>
>>102043802
>>102043824
The default is descriptive mode, idk if the guy has finished adding the Training prompt mode yet, he said he was working on it (0-50-100?????) 3 days ago.
Try asking for a "Training caption of this image" instead, idk, I've not tested it, also he says the max tokens is 250 atm but flux will accept 512.
Pre-alpha atm, so will change.
>>
>>102041263
real
>>
>>102043512
it's there
>>
It's Friday. Tiime for 1girls,



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.