[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1.15 MB, 3264x3264)
1.15 MB
1.15 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102242966

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/pol/uncensored+ai
>>
poop
>>
Blessed thread of frenship
>>
File: 2024-09-05_00019_.jpg (725 KB, 3840x2160)
725 KB
725 KB JPG
>>102247060
ty baker
>>
File: 1698929714553.jpg (332 KB, 1024x1024)
332 KB
332 KB JPG
>>
File: ComfyUI_Flux_12911.jpg (363 KB, 896x1152)
363 KB
363 KB JPG
>>
>>102247120
kick this kitten for $10000000000000000
>>
>>
File: 1722710794013.jpg (544 KB, 1024x1024)
544 KB
544 KB JPG
>>
>>102247141
noooooo!
>>
>>102246847
how did you prompt this? im having issues
>>
>>102247084
Want to see poop gens?
>>
>>102247161
canel
>>
>>102247179
yeah it's a shame, but it was better than the others.
>>
What's the Lora.FA training mode for? The GUIs describe it as a way to reduce memory usage by freezing one of two matrices in the model. But no matter how much I try it, memory remains unaffected, but the character reproducibility noticeably suffers and raising the weight burns the images much quicker than regular loras. Is it really just harmful with no upsides?
>>
>>102247178
no
>>
File: 2024-09-05_00408_.png (1.77 MB, 1280x720)
1.77 MB
1.77 MB PNG
>>
File: 01032-3937745515.jpg (704 KB, 1440x1920)
704 KB
704 KB JPG
gentlemen
>>
>>102247203
cool o.O
>>
>>102247195
Some settings are always useless or detrimental to the results.
>>
breasts?
>>
File: ComfyUI_Flux_12925.jpg (433 KB, 896x1152)
433 KB
433 KB JPG
>>
File: 2024-09-05_00413_.png (1.61 MB, 1280x720)
1.61 MB
1.61 MB PNG
>>102247218
ty
>>
I know this isn't the thread but I trust you chaps know your onions
what's the catch with hailuoai
>>
>>102247270
reminds me of le petit prince
>>
>>102247277
why would there be a catch, it's another AI company burning through money
>>
File: 1725546852773928.webm (2.43 MB, 1280x720)
2.43 MB
2.43 MB WEBM
>>102247277
there's no catch, it's a great video model and you don't need to make an account to generate videos, that's really cool
>>
File: 2024-09-05_00416_.png (1.54 MB, 1280x720)
1.54 MB
1.54 MB PNG
>>102247290
its Tapestry of Bayeux lora mixed with the Disgaea lora
>>
>>102247314
yeah but really tho
is it gonna ask me for a phone number, or to sign up to something
there's no such thing as a free meal, or compute
>>
>>102247327
no I'm serious, you just go to the site, you type a prompt, you enter generate and that's it, no account shit, no phone shit, nothing
>>
>>102247327
>>you don't need to make an account
can you read?
>>
>>102247327
it only asks for a phone number if you load the mobile version, your mobile browser can request the desktop version
>>
File: 2024-09-05_00418_.png (1.51 MB, 1280x720)
1.51 MB
1.51 MB PNG
>>
>>102243364
I'm sure it would nail the chin on the left.
>>
>>102247327
In the long term, no. But I had plenty of fun training for free during Leonardo's and Scenario's beta phases.
So make a use of it while it lasts.
>>
>>102247327
why so jaded and cynical? some people are just nice and let you have massive compute power for free, no strings attached
>>
File: 01063-3937745517.jpg (561 KB, 1440x1920)
561 KB
561 KB JPG
>>
File: 00134-2528571729.png (2.04 MB, 1024x1440)
2.04 MB
2.04 MB PNG
>>
File: ComfyUI_Flux_12931.jpg (484 KB, 896x1152)
484 KB
484 KB JPG
>>
File: 00061-55412642.png (2.32 MB, 896x1152)
2.32 MB
2.32 MB PNG
>>
>>102247406
This is impressive. LoRA? Finetune? PROMPT??
>>
File: workflow.jpg (512 KB, 2226x1189)
512 KB
512 KB JPG
i'm pretty happy with my workflow, but any tips on improving it?
i'm a VRAMlet
>>
>>102247341
>>
>>102247420
go for automaticCFG instead of Dynamic Thresholding, it burns the image less + gives you better prompt understanding
https://reddit.com/r/StableDiffusion/comments/1eza71h/four_methods_to_run_flux_at_cfg_1/
>>
>>102247417
https://civitai.com/models/721039/retro-anime-flux-style
>>
>>102247420
if you go for cfg > 1, use adaptive guider, that will make your gens faster
https://reddit.com/r/StableDiffusion/comments/1enxcek/improve_the_inference_speed_by_25_at_cfg_1_for/
>>
File: 1703456076516.jpg (624 KB, 1024x1024)
624 KB
624 KB JPG
>>102247439
>There are still 91 people ahead, so the wait is expected to be 5 minutes.
>>
>>102247450
>18k steps
I experienced this with my own attempts at loras, but it seems Flux requires higher amounts of steps than other models?
>>
File: dd_00013_.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
>>102247465
>>102247447
thanks!
>>
>>102247450
>1gb lora
fucking based
>>
File: file.png (2.03 MB, 1024x1024)
2.03 MB
2.03 MB PNG
>>102247513
>>
>>102247417
prompt was this
https://fluxpro.art/prompts/cm0p10b5g0co111m70q6rk0pw
>>
>>102247277
>what's the catch with hailuoai
Absolutely nothing, it's a great model, now I hope that BFL will release something at that level
>>
File: ComfyUI_Flux_12945.jpg (432 KB, 896x1152)
432 KB
432 KB JPG
>>
File: 01104-3937745514.jpg (581 KB, 1440x1920)
581 KB
581 KB JPG
>>
>>102247612
so I can just gen shit all day every day for free, maybe even write a script to gen random shit 24/7
I wouldn't, but some cunt would, and this is why we can't have nice things
>>
>>102247685
it sure won't last anon, they are doing this free shit as a giant advertisement, and once people are convinced by the quality, they'll have enough fame to make it pay for it, and that's completely fair lol, my advise would be to have some fun with it before it ends, you won't get this chance a second time
>>
File: 01105-3937745514.jpg (744 KB, 1344x1728)
744 KB
744 KB JPG
>>
File: out.webm (226 KB, 1280x720)
226 KB
226 KB WEBM
asdf
>>
>>102247835
that's impressive how realistic it is, the chinks really stepped up their game there
>>
>>102247706
You can do it locally

https://huggingface.co/THUDM/CogVideoX-5b
>>
File: ComfyUI_Flux_12965.jpg (483 KB, 832x1216)
483 KB
483 KB JPG
>>
>>102247903
:^)
>>
>>102247900
I can't webm, the mp4 is cleaner
>>
File: ComfyUI_Flux_12969.jpg (410 KB, 832x1216)
410 KB
410 KB JPG
>>
>>102247921
you need to change your settings to get higher quality
>>
https://github.com/PowerHouseMan/ComfyUI-AdvancedLivePortrait
>>
File: 00131-3453757705.png (968 KB, 832x1216)
968 KB
968 KB PNG
>>
>>102247612
kek, we definitely need a video AI thread now
>>
File: ComfyUI_Flux_12983.jpg (416 KB, 832x1216)
416 KB
416 KB JPG
>>
File: 01118-3320843819.jpg (808 KB, 1440x1080)
808 KB
808 KB JPG
>>
>>102248147
Damn, OpenAI really shoot themselves in the foot by not making Sora public sooner, now no one will care if they do it at the end, it's too late
>>
File: ComfyUI_00251_.jpg (863 KB, 1616x1184)
863 KB
863 KB JPG
>>
File: ComfyUI_Flux_12989.jpg (397 KB, 832x1216)
397 KB
397 KB JPG
>>
>>102247205
my sweet
>>
File: 01119-4220096088.jpg (845 KB, 1440x1080)
845 KB
845 KB JPG
>>
https://huggingface.co/datasets/bigdata-pw/TheSimpsons
frames from every episode and 3 florence-2-large captions, caption, detailed_caption and more_detailed_caption
unsurprisingly the captions are all shit
>>
File: The-Simpsons-S01E01-1241.jpg (120 KB, 1920x1072)
120 KB
120 KB JPG
>A couple of people that are standing next to each other.
>The image shows Homer Simpson and Marge Simpson from The Simpsons wearing Santa Claus outfits, standing in front of a backdrop of light poles and a starry night sky.
>The image is a still from the animated TV show, The Simpsons. It shows two characters, Homer Simpson and Marge Simpson, standing side by side in front of a blue background with white stars. Homer is wearing a red and white striped suit with a black belt and a black hat with a white pom-pom on top. Marge is also wearing a blue hoodie and a red Santa hat with white fur trim. They are both looking at Homer with a surprised expression on their faces.
>Homer Simpson and Marge Simpson
>>
File: 01122-1337927527.jpg (1.15 MB, 1110x1670)
1.15 MB
1.15 MB JPG
>>102248293
>florence-2-large
you used the base version, not finetune?
>>
File: ComfyUI_00003_.png (1.38 MB, 1024x1024)
1.38 MB
1.38 MB PNG
>>
>>102248346
you haven't tried the finetune versions, have you?
>>
>>102248375
oh are they worse?
>>
File: ComfyUI_Flux_13005.jpg (219 KB, 832x1216)
219 KB
219 KB JPG
>>102248331
>>
File: ComfyUI_Flux_13001.jpg (355 KB, 832x1216)
355 KB
355 KB JPG
>>
>>102248380
>oh are they worse?
they aren't good that's for sure >>102245900
>>
>>102248331
flux does img2img, and it's pretty good
>>
File: file.png (426 KB, 1496x602)
426 KB
426 KB PNG
>>102248380
yes
>This is an animated image. In this image we can see two persons standing. In the background there are street lights and sky.
>>
>>102248417
meant for >0000000000
>>
File: ComfyUI_00257_.jpg (959 KB, 1536x1584)
959 KB
959 KB JPG
>>
File: ComfyUI_Flux_13017.jpg (299 KB, 832x1216)
299 KB
299 KB JPG
>>
>>102248418
Yeah my bad it indeed is shit. Didn't even realize
>>
File: ComfyUI_33380_.png (628 KB, 1024x1024)
628 KB
628 KB PNG
>>
>>102247406
berserk, saint seiya, masterpiece
>>
File: ComfyUI_33381_.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
>>
well, I finally did it. I quit testing, nitpicking, rebaking, procrastinating and posted my first flux lora to shitvitai
feels kinda good desu
>>
File: ComfyUI_00263_.jpg (945 KB, 1536x1584)
945 KB
945 KB JPG
>>
>>102248858
link?
>>
File: 1715597150979.webm (2.17 MB, 1280x720)
2.17 MB
2.17 MB WEBM
>>
File: MarkuryFLUX_00022_.png (1.57 MB, 832x1216)
1.57 MB
1.57 MB PNG
>have sudden urge to look up a massive slut I knew from high school
>she took my virginity in 9th grade
>tfw she's been through 3 divorces by the age of 28, has 4 kids, and lives in some hick town in Missouri now

heh, guess life isn't so bad after all.
>>
>>102248944
Can it do anything besides dancing
>>
>>102248959
cool story didnt happen tho your life still sucks buddy
>>
>>102248962
check the pol thread or the hailuoai homepage
>>
File: MarkuryFLUX_00023_.png (1.64 MB, 832x1216)
1.64 MB
1.64 MB PNG
>>102248993
Don't care if you believe me, it has not impact on anything. It's just funny to how how she ended up.
>>
>>102249045
and that made you come to /ldg/ to tell us all?
>>
>>102248887
err.. please ignore my general faggotry..
https://civitai.com/models/724454
>>
>>102248995
it can do a lot of funny shit, you have to watch them on /pol/ though >>>/pol/480733043
>>
File: MarkuryFLUX_00024_.png (1.51 MB, 832x1216)
1.51 MB
1.51 MB PNG
>>102249052
I was posting images here throughout the day, and posted it along with an image because i thought it was funny. Seems to have made you really butthurt for some reason though.
>>
>>102249045
its obvious you're new to larping
>>
>>102249089
nobody asked bro
>>
>>102249089
yea but why would we care if a fake slut that you pretend took your virginity is divorced and has kids
>>
File: 00043-2200544077.jpg (763 KB, 1296x1728)
763 KB
763 KB JPG
>>
>>102249092
>>102249096
>>102249098
Oh, I get it... my mistake /ldg/ was definitely the wrong place to post this. It's full of angsty virgins.
>>
File: 1725563976408507.webm (606 KB, 1280x720)
606 KB
606 KB WEBM
This is insane, Holywood is dead
>>
>>102249118
stop projecting your sadness of still being a virgin onto is my guy
>>
>>102249121
it's still gacha, Hollywood dies when you can do img2img keyframes, however.
>>
>>102249132
It's funny because that wasn't even the main point of the story, seems to be what ticked you all off lmao
>>
File: ComfyUI_00068_.png (1.43 MB, 1072x1152)
1.43 MB
1.43 MB PNG
>>102249144
RAWWWWRRRRR..... make me angry again. I . Dare. You.
>>
File: 1725573858402275.webm (1.45 MB, 1280x720)
1.45 MB
1.45 MB WEBM
>>102249121
lmao, this shit is good
>>
People are still using the base model, right? I'm waiting for checkpoints.
>see checkpoint on civitai
>look inside
>it's just 33 loras merged with the base model
god dammit
>>
>>102249177
>People are still using the base model, right? I'm waiting for checkpoints.
no one made a real finetune of flux yet, it's asking for more than 24 gb of vram
>>
What to use from here?

https://huggingface.co/zer0int/CLIP-GmP-ViT-L-14/tree/main
>>
>>102249189
Gotcha. And I'm checking the right place, right? The entire civit page was just garbage.
>>
>>102249189
Impossible really to finetune it because none of the trainers even properly support it. Let's say you were willing to rent the servers required to train Flux, no trainer supports it out of the box.
>>
>>102249206
there's only 2 places to look at and it's civitai and huggingface yeah
>>
>>102249059
How did you upload so many images one by one to that lora?
>>
File: 27938931.png (889 KB, 1024x768)
889 KB
889 KB PNG
this shit is insane
>>
>>102249201
that one is the best
https://huggingface.co/zer0int/CLIP-GmP-ViT-L-14/blob/main/ViT-L-14-BEST-smooth-GmP-TE-only-HF-format.safetensors
>>
File: 00037-1858885819.jpg (255 KB, 1120x1440)
255 KB
255 KB JPG
which pytorch version is everybody using on comfy? I just realized im still using 2.3.0, also read that 2.4.0 sucks ass, should i upgrade to 2.5.0, im on winblows btw
>>
File: ComfyUI_33387_.png (1.24 MB, 848x1024)
1.24 MB
1.24 MB PNG
>>
>>102249248
Why TE only? And what's HF
>>
>>102249253
im using 2.1.0
>>
File: 00059-2200544078.png (2.49 MB, 1296x1728)
2.49 MB
2.49 MB PNG
>>
>>102249259
>Why TE only?
because we're only using the text encoder (TE), not the text decoder

>And what's HF
dunno what that means either
>>
>>102249238
painfully
>>
File: file.png (3.45 MB, 3185x1612)
3.45 MB
3.45 MB PNG
>>102249253
>which pytorch version is everybody using on comfy?
2.3.1

>>102249277
>im using 2.1.0
wtf that's an old one, why?
>>
>>102249297
no clue lmao i just never update it ima update to 2.4
>>
>>102249253
nice style, are you using a lora for that one?
>>
File: 1720160782820.jpg (380 KB, 1024x1024)
380 KB
380 KB JPG
>>
>>102249238
>>102249284
jokes aside, I just copy+pasted the upload URL and then did them one by one. you could probably get chatgpt to write you a quick script where it automates this process pretty easily using selenium webdriver or something desu
>>
>>102249284
Why even do that lmao, press create - post images and batch upload

>>102249280
Isn't flux using both?
>>
>>102249303
2.4 sucks, it gives fucked up pictures, 2.5.0 seems to have fixed it though, and it's faster too
>>
>>102249321
how do I get 2.5.0 all it seems i can get is 2.4.1
>>
>>102249319
>Isn't flux using both?
no diffusion models use the text decoder, it's for LLMs only
>>
>>102249319
>Why even do that lmao, press create - post images and batch upload
I didn't see a batch upload option lmao fuck me. I thought it was either post one image, or post 20 at once that go into some kind of mini album which I don't really like
>>
>>102249280
>not the text decoder
there is no text decoder in CLIP, but there is an image decoder which we don't use thus the file is Text Encoder only
>dunno what that means either
HuggingFace format, the tensors inside the file have a different layout and name. although I don't think it's an actual standard HF has created
>>
>>102249335
2.5.0 is the nightly version, you can do this:
1) Go on the ComfyUI_windows_portable\update folder
2) use this cmd command:
..\python_embeded\python.exe -s -m pip install --upgrade --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cu121
>>
>>102249346
*an image encoder
I meant to write.
>>
>>102249336
I saw most people load this one

ViT-L-14-BEST-smooth-GmP-ft.safetensors
>>
>>102249335
do this but change
>https://download.pytorch.org/whl/nightly/cu121
to
>https://download.pytorch.org/whl/nightly/cu124
torch 2.5.0 works better with cuda 12.4
>>
>>102249374
that's the same thing, it's the bigger version that has some shit Flux will never use
>>
>>102249386
>torch 2.5.0 works better with cuda 12.4
oh yeah? like better quality images?
>>
File: fs_0075.jpg (328 KB, 1536x1536)
328 KB
328 KB JPG
>>
>>102249407
Oh, is it already time for your shift, Sergeant Johnson?
>>
File: ComfyUI_00091_.jpg (541 KB, 1792x2304)
541 KB
541 KB JPG
>>
>>102249400
nta and I haven't tested 1:1 myself, but someone posted some comparisons the other day and 2.5.0 with 12.4 seemed to be better quality. it was only like 3 different prompts iirc, but the difference was pretty noticeable to me. no idea if it changes performance speed at all between the 121 vs 124
>>
File: 00026-2490054030.jpg (245 KB, 1120x1440)
245 KB
245 KB JPG
>>102249307
yeah, i used <lora:anime_lora_comfy_converted:1> for that gen, I totally forgot about that lora lol


>>102249297
thanks, can't update for now because my ISP is down and im connecting thru my cellphone with shitty speeds T_T
>>
>>102249045
holy esl SAAR DO NOT REDEEM
>>
File: cu124vcu121.jpg (452 KB, 2060x1038)
452 KB
452 KB JPG
>>102249400
yes, better colors, and slight details are better, pic related, looks for details like that she is smiling, the pearl necklace and the TV details
>>
>fast delete of the glowie
based jannies
>>
>>102249500
oh cool, do you have other image comparisons like that? I wanna know if it's still a downgrade compared to torch 2.3.1 + cu121 or not
>>
>>102249459
><lora:anime_lora_comfy_converted:1>
what's that? can you provide a civitai link? I really dig that style
>>
>>102249526
not with 2.3.1 .. can't switch versions now and test, training a lora sorry .. but I am overall happy with 2.5.0+cu124 .. anatomy is correct and pretty much behaves like 2.3.1
>>
>>102249606
i'm on 2.5.0+cu124 as well and havent had any issues (3090ti)
>>
File: ComfyUI_33400_.png (577 KB, 768x1024)
577 KB
577 KB PNG
>>
>>102249687
leave some women for the rest of us
>>
>>102249390
So I did some testing

>So in order of best to worst of your clips, it's:

ViT-L-14-TEXT-detail-improved-hiT-GmP-TE-only-HF.safetensors

ViT-L-14-BEST-smooth-GmP-TE-only-HF-format.safetensors

ViT-L-14-GmP-ft-TE-only-HF-format.safetensors

Right?

I was using 3 anyway and found it a slight improvement over base CLIP, and in certain usecases a big improvement, so I'm keen to get the time to test 1 and 2

>That's how I would rate it, yes. 1. and 2. are about on par with regard to benchmarks (accuracy on zeroshot, for example). 1. is objectively better at text, over all. The rest is a bit of a subjective thing, but - yes, this would be my ranking. Albeit 2 can sometimes generate superior detail (non-text detail). It really depends on what you're prompting.

I prefer the smooth one, it adds more small details to the image, TEXT makes it more slop
>>
Is it just me, or are all the workflows on the sharing sites just dogshit?

I tried using ComfyUI launcher, and importing workflows through that, and even they either fail to run.
>>
>>102249716
I'm not a big fan of "detail improved", yeah it improves the text, but for the rest it just makes it worse than smooth
>>
>>102249721
They work fine, you need to go to manager and install missing custom nodes
>>
File: ComfyUI_00352_.jpg (104 KB, 976x1208)
104 KB
104 KB JPG
>>102249253
That is really nice lora aside do you have a catbox?
I promise to keep it sfw this time so the jannies don't get into a tizzy.
>>
I won't care about their text to video until they support image to video.
>>
>>102249828
see vidu
https://www.vidu.studio/
>>
>>102249828
>>102249836
or simply Luma
>>
>>102249836
But why are people posting videos from hailuo instead of vidu.studio? Are they any good?
>>
>>102249871
text to video is so much better
>>
>>102249847
I had to stop watching Luma videos because of the morphing, I started having dreams where I was watching videos that morphed like that and realized I must stop until there are good video generators out there.
>>
>>102249871
hailuo is probably the best one so far, maybe a bit behind Sora but Sora is DOA so...
>>102247314
>>102247612
>>102248147
>>102248408
>>102249121
>>
>>102249538
https://huggingface.co/XLabs-AI/flux-lora-collection/tree/main
>>
File: ComfyUI_00101_.png (2.63 MB, 1440x1920)
2.63 MB
2.63 MB PNG
>>
>>102249747
Yah, this is /g/ I'm not foolish enough to have done that already but fair. I keep getting weird issues like random seeds not working in Ksampler, even though I've obviously clicked the random button.

Importing Via image alone also yields strange results like 'NoneType' object has no attribute 'lower' efficency node
>>
File: ComfyUI_00020_.png (1.61 MB, 1024x1024)
1.61 MB
1.61 MB PNG
>>
File: ComfyUI_Flux_13143.jpg (406 KB, 832x1216)
406 KB
406 KB JPG
>>
Wait a minute, Germany beat the rest of the world in AI image generation? How did that happen?
>>
>>102250036
I don't want to simplify it too much, but it's really not that hard to make an image model. What's shocking is there's not more millionaire bankrolling models. You can legit make a decent 2B model for like $50k right now.
>>
>>102250036
a German invented latent diffusion
>>
>>102250005
Post the workflow
>>
>>102250036
German science is the best in the world.
>>
File: 15511531223.jpg (2.25 MB, 5184x3456)
2.25 MB
2.25 MB JPG
>>102250036
Not just AI image gen, this is not surprising desu.
>>
>>102250036
>>102250063
Isn't twitter using flux? Maybe Elon funded it
>>
File: ComfyUI_Flux_13155.jpg (409 KB, 832x1216)
409 KB
409 KB JPG
>>
File: ComfyUI_03136_.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>
File: 000000_17329_.png (1.93 MB, 1032x1508)
1.93 MB
1.93 MB PNG
>>
File: ComfyUI_Flux_13163.jpg (340 KB, 832x1216)
340 KB
340 KB JPG
>>
File: Untitled.png (36 KB, 1020x404)
36 KB
36 KB PNG
what does this do? can i preview it?
>>
File: ComfyUI_03140_.png (1008 KB, 1024x1024)
1008 KB
1008 KB PNG
>>
>>102250082
Nope, Elon just saw how good it was and how he couldn't make porn or nudity with it so it was safe to allow people to use it on twitter, and it was absorbed by Grok.
It was never disclosed how this happened but he probably made a deal with BFL that was profitable for both, this was the stolen we dream of Stability AI, "X's image generator is powered by Stable Diffusion" would have been incredible for them.
Fuck, Elon bought twitter, he could probably buy BFL too.
>>
>>102250168
Too bad SAI is now being used to AWS
>>
File: ComfyUI_03144_.png (1.38 MB, 1024x1024)
1.38 MB
1.38 MB PNG
Just found out that the bogdanoff LoRA I made is pretty good at bogging known characters,
>>
File: ComfyUI_03146_.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>
File: ComfyUI_33409_.png (633 KB, 768x1024)
633 KB
633 KB PNG
>>
File: 00105-1718306478.png (1021 KB, 832x1216)
1021 KB
1021 KB PNG
>>
>>102247205
yeah I'm gonna need uhhhhhhhhhh prompt and loras if any please
>>
File: 00009-4037424332.jpg (422 KB, 1664x2432)
422 KB
422 KB JPG
>>
I think the funny part of all this is the hentai spammer is going to force AI censorship but not in the way he thinks. The day of the AI janny comes ever closer.
>>
File: ComfyUI_03155_.png (1.66 MB, 1024x1024)
1.66 MB
1.66 MB PNG
>>
updated forge and it apparently no longer supports SDV so I'm looking for a new UI. probably also going to learn flux. is comfy it or is there something else going on these days?
>>
File: 0.jpg (451 KB, 1024x1024)
451 KB
451 KB JPG
>>
>>102250375
Flux on comfy
>>
File: ComfyUI_03160_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>
File: ComfyUI_03163_.png (1.35 MB, 1024x1024)
1.35 MB
1.35 MB PNG
You ever seen a muppet get bogged?
>>
File: ComfyUI_Flux_13225.jpg (380 KB, 832x1216)
380 KB
380 KB JPG
>>
>>102250405
>>102250428
KEK
>>
File: ComfyUI_00114_.png (3.3 MB, 1728x2304)
3.3 MB
3.3 MB PNG
>>
>>102248995
No
>>
File: 00163-1009193522.png (1.55 MB, 720x960)
1.55 MB
1.55 MB PNG
>>
File: ComfyUI_03168_.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
https://gofile.io/d/6eKSIo

Here is the bogged LoRA. activation phrase is "igor bogdanoff"
>>
>>102250537
based
>>
File: 935.jpg (286 KB, 1024x1024)
286 KB
286 KB JPG
>>102250396

*thanks anon
>>
File: 0.png (1.55 MB, 1024x1024)
1.55 MB
1.55 MB PNG
>>
>>102250624
What lora is this?
>>
File: 1715528560627.jpg (398 KB, 1024x1024)
398 KB
398 KB JPG
>>
File: 0.jpg (144 KB, 1024x1024)
144 KB
144 KB JPG
>>
File: 1705851840942.jpg (553 KB, 1024x1024)
553 KB
553 KB JPG
>>
>>102250747
The speech bubble over the top is such a jarring break from the overall aesthetic that it borders on parody and loops back around to being funny.
>>
>>102250649

that's a raw bing output, sorry anon
>>
File: comfyhui.png (24 KB, 1355x1005)
24 KB
24 KB PNG
has comfyui become bloatware? I remember you opened up a new window, it instantly loaded, now it takes time to load up a new canvas
>>
>>102250851
Like half of the custom nodes in comfy are spyware.
>>
File: 1712120385469778.jpg (2.05 MB, 1224x2144)
2.05 MB
2.05 MB JPG
I will NOT post in the thread
>>
File: 0.jpg (212 KB, 1024x1024)
212 KB
212 KB JPG
>>
>>102250855
How?
>>
>>102250855
that's a pretty big accusation to swing around anon
>>
>>102250893
>>102250878
>Hey guys just download this node to use an LLM in comfy!
>Just plug it in there and don't even look at what it's doing
>>
>>102250893
not much else to do when other UIs fail to support the latest developments.
>>
File: ComfyUI_00126_.png (3.01 MB, 1440x1920)
3.01 MB
3.01 MB PNG
>>
After training a few LoRAs using grids as training images, I'm convinced it's actually a pretty good method. It functions similarly to batching, but actually runs faster and lets you train in 1024x1024
>>
>>102250909
kek that was deb* fault, he listed that shit in his news for a whole month, that idiot never tested anything
>>
File: ComfyUI_00131_.png (3.11 MB, 1440x1920)
3.11 MB
3.11 MB PNG
>>
>>102250920
>>102251057
YAWN
1 GIRL
YAWN
>>
>>102250933
>lets you train in 1024x1024

How would the VRAM requirement lower when you use grid images?
>>
>>102251057

it was much better when you were generating stuff like this >>102247205
you should at least go back to posting that if you're refusing to share prompt/lora for it
>>
>>102251096
It doesn't lower the requirements, but 1024 at a batch size of 1 gives you arguably better and faster results than a batch size of 4 at 512
>>
https://imgsli.com/Mjk0NTI1
Any guess for what LoRA I'm trying to train now?
>>
File: ComfyUI_00134_.png (2.82 MB, 1440x1920)
2.82 MB
2.82 MB PNG
>>102251115
That's not me, genius. He even has an auto/forge filename.
>>
File: 1725584237.png (1.4 MB, 1280x736)
1.4 MB
1.4 MB PNG
>>
>>102251150
nta but your shit looks like 1.5 slop
you could generate exactly the same stuff using way less compute and time
>>
What graphics card is good for local image gen that doesn't cost an arm and a leg
>>
>>102251163
probably a good roof desu
>>
File: ComfyUI_00139_.jpg (662 KB, 2064x2304)
662 KB
662 KB JPG
>>102251180
incredible insight, nogen
>>
>>102251297

a 4060 is practically mandatory for local gens at this point
you can get by with a 3060 for now but it won't stay that way for much longer
>>
>>102251362
NTA but >>102251180 is right. I feel uninspired just looking at your gens.
What are you even bringing to the table? I feel less confident in flux just by looking at your work.
>>
>>102251381
NTA but nogen opinions are worth less than debo replies
>>
>>102251120
When you was training at 512, did you use bucketing or manually resized your images to 512x512 squares? I wonder if the results are better because of the square training data instead of buckets that keep the original aspect ratio as long as the amount of pixels is less or equal to 262144 (512*512).
>>
File: Untitled.png (19 KB, 1733x174)
19 KB
19 KB PNG
>>102251390
>>
>>102251163
Needs some badgers.
>>
>>102251413
I rely on bucketing when doing 512, but slapping the images into a roughly grid shaped collages at 1024 is my go to method these days.
>>
>>102251416
nta but you radiate schizo-anon energy, >>102251362 has some nice upscaling quality, catbox?
>>
>>102251462
I am that anon and I think I'm justified in in criticizing boring 1 girl posts. The LoRA clearly is overtrained too looking at her nonsensical clothing.
>>
File: 1725586155.png (1.43 MB, 1280x736)
1.43 MB
1.43 MB PNG
>>
>nogens complaining about 1girl booba
pathetic
>>
>>102251485
you just sound salty that you can't upscale that well imo
>>
>>102251495
But where's the SNAAAAAAAAAAAAAKE?
>>
File: ComfyUI_00140.jpg (320 KB, 1280x1856)
320 KB
320 KB JPG
>>
File: 1725586356.png (1.32 MB, 1280x736)
1.32 MB
1.32 MB PNG
>>102251508
>>
File: ComfyUI_00144_.png (3.28 MB, 1720x1920)
3.28 MB
3.28 MB PNG
>>
>>102251533
Perfect!
>>
>>102251571
thanks
>>
https://civitai.com/models/714022/neonfantasyflux-style-lora?modelVersionId=798521
>>
https://civitai.com/models/715731/the-sims-1-style-f1d
>>
File: 0.jpg (72 KB, 1024x1024)
72 KB
72 KB JPG
>>
>>102251807
>.webm
Nope
>>
>nogen
Excuse me that's text-fag to you, anon
>>
File: ComfyUI_03201_.png (1.11 MB, 1024x1248)
1.11 MB
1.11 MB PNG
>>
File: ComfyUI_03202_.png (1.31 MB, 1024x1248)
1.31 MB
1.31 MB PNG
>>
What was that site to use for image to text prompt?
>>
File: Ultra Copium.png (970 KB, 1022x500)
970 KB
970 KB PNG
>>102247060
>Disinfo-Copium-Machine go BRRRRRRRR

What a silly thing to lie about lol

https://x.com/halphelt/status/1831316915551137918?t=Q7enYETzZ5jvJzeufY6TIA&s=19
>>
File: ComfyUI_03203_.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>102252194
This one?
>>
>>102249117
ooh
>>
File: ComfyUI_03205_.png (1.07 MB, 1280x768)
1.07 MB
1.07 MB PNG
>>102252261
https://huggingface.co/spaces/fancyfeast/joy-caption-pre-alpha

Forgot the link lol
>>
>>102250073
see attached
https://files.catbox.moe/21emc2.json
>>
>>102251839
Nice
>>
File: ComfyUI_03206_.png (1.09 MB, 1280x768)
1.09 MB
1.09 MB PNG
>>
>>102252321
forgot to say what the problem with this one is, the seed doesn't' randomize, and the Apply RUEnet is busted.
>>
File: ComfyUI_00148_.png (3.49 MB, 1720x1920)
3.49 MB
3.49 MB PNG
>>
File: 1705181905571.jpg (355 KB, 1024x1024)
355 KB
355 KB JPG
>>
>>102252260
That user is lying to discourage other artists from adopting AI tools, but he's actually using it himself.
>>
>>102252446

not bad
not bad at all
>>
File: ComfyUI_03212_.png (1.12 MB, 1280x768)
1.12 MB
1.12 MB PNG
>>
File: 1720584830485.jpg (418 KB, 1024x1024)
418 KB
418 KB JPG
vaguely inspired by kafka
>>
>>102252490
>>102252446
Why your images look like the chanel ones? Are you using the same lora?
>>
File: ComfyUI_03216_.png (1.15 MB, 1280x768)
1.15 MB
1.15 MB PNG
>>
>>102252503
just prompt, but there's nothing in common with the chanel prompts.
>>
File: 1699835321640.webm (1.12 MB, 1280x720)
1.12 MB
1.12 MB WEBM
hailuoai does not understand how typewriters work
>>
>>102252588
LOCAL diffusion general
>>
>>102252449
>but he's actually using it himself.
Citation needed
>>
>>102252588
it doesn't understand anything though
>>
I'm convinced that Dynamic Thresholding, AutomaticCFG, Skimmed CFG, and Adaptative Guidance are all a scam.
>>
File: ComfyUI_33408_.png (1.11 MB, 768x1024)
1.11 MB
1.11 MB PNG
>>
>>102252652
This
>>
File: 1711751687895.webm (1.55 MB, 1280x720)
1.55 MB
1.55 MB WEBM
once more with feeling
>>
File: scamcfg.png (52 KB, 901x427)
52 KB
52 KB PNG
>>102252652
Dynamic Thresholding isn't a scam, you just gotta read how to use it

AutomaticCFG and SkimmedCFG however are made by the same autistic dev who doesn't even detail anything how his stuff works, he just puts a vague description and some example without workflow or details, its just says "recommended, just trust me bro"
>>
fine I'll be the one to make the 300th post then
>>
>>102252652
How rude, they're just as effective as dev+schnell merges.
>>
>>102253182
>There's a full time Janny dedicated to watching this thread.
The absolute state of this place.
>>
Let's get some fresh bread up in here...
>>102253191
>>102253191
>>102253191
>>
>>102250036
undertraining
>>
agreed
>>
well of course, thanks
>>
really
>>
>>102253206
nice img
>>
>>102253130
thank you for your service
>>
hit it
>>
>>102253014
>AutomaticCFG and SkimmedCFG however are made by the same autistic dev who doesn't even detail anything how his stuff works
who gives a fuck? at the end his anti burner works better than dynamic thresholding
https://reddit.com/r/StableDiffusion/comments/1eza71h/four_methods_to_run_flux_at_cfg_1/



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.