[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: 1723932132744856.png (122 KB, 259x543)
122 KB
122 KB PNG
Discussion of free and open source text-to-image models

Previous /ldg/ bread: >>101943936

>Beginner UI
EasyDiffusion: easydiffusion.github.io
Fooocus: github.com/lllyasviel/fooocus
Metastable: metastable.studio

>Advanced UI
Automatic1111: github.com/automatic1111/stable-diffusion-webui
ComfyUI: github.com/comfyanonymous/ComfyUI
Forge: github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: github.com/invoke-ai/InvokeAI
SD.Next: github.com/vladmandic/automatic
SwarmUI: github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
rentry.org/sdvae

>Model Ranking
imgsys.org/rankings

>Models, LoRAs & training
civitai.com
huggingface.co
aitracker.art
github.com/Nerogar/OneTrainer
github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
rentry.org/sdg-link
rentry.org/rentrysd

>GPU performance
vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: www.mage.space
img2img: huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
rentry.org/debo

>Related boards
>>>/g/sdg
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
File: delux_flebo_00035_.png (2.34 MB, 1152x1536)
2.34 MB
2.34 MB PNG
>mfw
>>
File: ComfyUI_02741_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>101948121
>>
lmao thank you for using my horse meme image for the OP

>>101948103
fffuck, thanks anyway. Guess ill recreate it from scratch.
>>
What were you thinking?
>>
>>101948149
debo no!
>>
>>101948149
advertiser-sama don't look
>>
File: ComfyUI_00402_.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
>>
File: 1722453528228895.jpg (132 KB, 768x1024)
132 KB
132 KB JPG
>>101948149
>>
>>101948171
Why did Joe spread a bunch of icing on her body and face
>>
>>101948149
>try the penis lora prompting
why tho ani?
>>
the fluxpro.art finally added seeds. Now I can test my gens agains pro properly.
>>
>>101948184
prompt?
>>
File: 2024-08-18T102153.176.jpg (205 KB, 1024x1024)
205 KB
205 KB JPG
>>101948149
>>
>>101948220
im sorry hombre that's a real image
>>
>>101948093
I noticed it the first time the other day doing Chika Komari gens. Noche's LORA would not ever get the ribbons right but Ibukimakisiko's LORA got them right surprisingly often.
>>
>>101948213
If you set safety to explicit, will it generate tiger balls?
>>
File: ComfyUI_Flux_9539.jpg (276 KB, 768x1344)
276 KB
276 KB JPG
>>
File: ComfyUI_06374_.png (1.42 MB, 1200x768)
1.42 MB
1.42 MB PNG
When do you guys think they will release their text to video model?
>>
We need new models like XL, sigma and black forest products every week.
>>
>>101948261
GOOD MORNING SIR
>>
>>101948234
baka art niggers always trying to pass of their real work as AI. put down the pencil and start prompting faggot
>>
>>101948261
Alright you start
>>
>>101948235
Im noticing the only good Stocking Anarchy lora on Civitai has a really hard time generating her iconic dress with the hair ribbon half the time, really considered just training it myself but that shit on 10 epochs will take almost 10 god damn hours..
fug, maybe i can use my civitai good boy bucks but im not sure...
>>
File: 0.jpg (221 KB, 1024x1024)
221 KB
221 KB JPG
>>
File: 1719584345966348.png (97 KB, 1550x831)
97 KB
97 KB PNG
umm, bros??????
>>
>>101948251
It will generate whatever it is capable of generating.
Unfortunately I don't think tiger balls are in the dataset.
>>
>open new bread
>see some chubby hairy dude's dick
gays need to be removed
>>
File: bogged matrix.jpg (17 KB, 200x232)
17 KB
17 KB JPG
>>101948284
>He pulled?
>>
>>101948283
kino
>>
>>101948149
'ick on 'eck
>>
>>101948284
What's your prompt?
>>
>>101948311
can we have a Bogdanoff Lora?
>>
File: 1712422899478.png (359 KB, 669x669)
359 KB
359 KB PNG
>>101948234
>>
File: ComfyUI_Flux_9545.jpg (256 KB, 768x1344)
256 KB
256 KB JPG
>>
File: ComfyUI_00403_.png (1.44 MB, 1024x1024)
1.44 MB
1.44 MB PNG
>>101948185
dunno
>>
>>101948335
wish granted
https://civitai.com/models/150094/bogdanoff-twins
>>
>>101948311
>>101948332
not a gen, just trying to train a lora
seems to be something caption-related, more specifically related to special characters (0xe7 being "ç")
do i REALLY have to manually clean all my thousands of auto-captioned pics of words like "façade" now?
>>
File: file.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
>>101948149
>>
>>101948284
I'm having a shit load of unicode errors trying to tun that too. I just gave up because I can't be fucked.
>>
>>101948373
>do i REALLY have to manually clean all my thousands of auto-captioned pics of words like "façade" now?
Write a script to convert to ASCII, not unicode or UTF8.
>>
>>101948360
need it for Flux tho
>>
>>101948410
train it on civitai then
>>
>>101948376
Decided to try it again and now it fucking works for no reason.
I hate computers.
>>
>>101948415
It costs 2k buzz to train a flux lora on civit. So $2.
>>
File: ComfyUI_00409_.png (1.5 MB, 1024x1024)
1.5 MB
1.5 MB PNG
>>
File: good morning saar.png (774 KB, 1024x1024)
774 KB
774 KB PNG
>>
File: dpool(24).jpg (84 KB, 984x984)
84 KB
84 KB JPG
>>101948121
He is /sdg/ jesus
>>
File: ComfyUI_00416_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>
File: 1694368295200976.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>
>>101948662
This really says a lot about society.

Also, while I got training working, it seems to have frozen at picrel.

Trying to run this on 16GB 4080. Any of you other "low vram" (can't believe I am a vramlet now) manage to get this going? I know some of you have trained LoRAs on less than 24gb vram.
Would love some guidance.
>>
>>101948698
imagegen is fucking slow and i'm saying this as a 3090/24GB VRAM user
>>
File: file.png (1.19 MB, 768x1024)
1.19 MB
1.19 MB PNG
>>101948440
>>
>>101948510
Amazing
>>
>>101948698
4070 ti Super 16gb here had success training a few loras on Kohya using this config.
https://github.com/bmaltais/kohya_ss/issues/2701#issuecomment-2294833735
>>
>>101948662
prompt for the ball gag?
>>
>its only 600 tokens to generate a good lora for Pony on civitai
>i have almost triple that
how the FUCK do people get away with making trash like nochenigger when its that inexpensive?
Shit i need to triple check how the tagging works to make sure this goes good i might just upload the results then.
Do Pony models absolutely need to follow booru tags, if i want to make totally sure it nails the exact outfit a character wears, should i make unique tokens to describe that outfit?
>>
>$ sign in prompt
>Comfy throws some EOF exception but continues with next prompt in queue anyway and hangs the system
All of this 1girl making business is held together with wishful thinking, spaghetti and dried cum for glue
>>
Flux or what it was got into the news because it's not censored and cucked enough. Is it still shit for anime style?
>>
>>101948770
>600 tokens
meant buzz. oops.
>>
>>101948770
Why do people hate on noche? His Loras are relatively decent for the average Lora.
>>
File: 1719110871319491.png (16 KB, 396x449)
16 KB
16 KB PNG
>yfw 8gb vramlet
>>
File: 2d.png (764 KB, 768x1024)
764 KB
764 KB PNG
>2d,anime,1girl,office_lady,1boy,sitting_on_face,facesitting,from_side,assertive_female
>>
File: ComfyUI_00380_.png (1.49 MB, 1024x1024)
1.49 MB
1.49 MB PNG
>>101948763
https://civitai.com/models/651337/ballgag-flux
>>
>>101948753
Oh with kohya? Did you use gui?
>>
File: ComfyUI_Flux_9571.jpg (296 KB, 768x1344)
296 KB
296 KB JPG
>>101948777
checked. there is a lora but it's bad lol. I'd wait for a proper anime finetune.
>>
>>101948829
Yeah
>>
>>101948833
Thanks, I will give it a go. How long did it take per lora?
>>
>>101948800
makes undeserved $ from them, most of his loras are not great or just average compared to competing loras.
in my case i get a special beef with him because his Stocking Anarchy one is pretty average to bad.
>>
>>101948824
it's the underscores, dummy
>>
>>101948844
>makes undeserved $ from them
who are you to say what is a deserved or undeserved amount? you're sounding very reddit right now.
>>
File: pos.jpg (388 KB, 832x1216)
388 KB
388 KB JPG
>>
>>101948862
>criticism? must be from reddit!
>>
>>101948840
About 2 hours 40 minutes for 5000 steps
>>
If any of you boys has a folder of the Bogdanoff twins, I'll throw it on Civitai and train a flux lora for them + post it. (after my stocking lora is done anyway :) )

>>101948862
>Um did you just criticize the lora maker that's very reddit chungus
how about you make like a tree and get outta here, funny im making this response in the same post offering to make a free lora for a model i cant even run
>>
>>101948875
>>101948881
thinking you know what value things SHOULD be is what makes you reddit brained, you retards
>>
>>101948844
>undeserved $
So? People will make money off whatever they can. At least he's not selling generated images with tons of faults, he's not an e-whores who makes even more $ simply by flashing pussy to the camera either. If you truly cared, you'd duplicate his Loras but make them much better and sell them or release them for free to embarrass him.
>>
File: 1720453827692148.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
ink sketch lora + basic miku prompt:
>>
>>101948888
>he holds no opinions about the world in fear that he might be wrong about them

kek
>>
>>101948888
>>101948891
why even cover him with these retarded arguments? Its perfectly normal to be critical of shit being sold as a product. What a waste of time.
>>
File: 1706897321285666.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
>>101948905
>>
>>101948914
>>101948919
you people are brain dead
>>
>>101948931
>he cowers in the corner shitting and pissing himself as he types the words "you people are brain dead" in concession
you don't need to be like this.
>>
>>101948939
kill yourself, illiterate imbecile
>>
>>101948939
kek he struck your nerves
>>
>>101948947
>>101948952
all me btw
>>
>>101948817
the extra minutes it takes to gen allows you to do other stuff in the meantime :^)
>>
File: ComfyUI_00998_.png (1.14 MB, 832x1216)
1.14 MB
1.14 MB PNG
>>
>>101948753
Running this script I hit this error
"network_module": network_module,
UnboundLocalError: local variable 'network_module' referenced before assignment
>>
File: 1919531684.png (1.23 MB, 896x1152)
1.23 MB
1.23 MB PNG
>>
>>101948919
He makes good Loras, you try to stop him.
>>
is there a node or extension in comfy that works similar to civitai manager? for loras and default instance prompts, I mean.
>>
>>101949003
Are you on the sd3-flux.1 branch of Kohya?
>>
File: 2030596806.png (1.22 MB, 896x1152)
1.22 MB
1.22 MB PNG
>>
>>101949040
Yes, I figured it out. The LoRA type was blank in the config. On to the next error
>>
File: flux_s_1.jpg (398 KB, 832x1216)
398 KB
398 KB JPG
>>
File: 3449254677.png (1.05 MB, 896x1152)
1.05 MB
1.05 MB PNG
>>
>>101949061
Should auto pick it once you enter your model directories. Make sure you haven't loaded it in the Dreambooth tab... I've done that a few times...
>>
can i run this on my chromebook
>>
>>101949144
yes
>>
Does Flux strictly require Comfy?
>>
>>101949159
Works on forge
>>
File: grid-0253.png (3.83 MB, 2304x1792)
3.83 MB
3.83 MB PNG
>>
>>101949155
where do i start nigga
>>
>>101949159
no
>>
>>101949040
>Are you on the sd3-flux.1 branch of Kohya?
wait I'm not. Am I retarded or is there no branch of this? I can see sd3 scripts and that's all
>>
>>101949168
to clarify, he was joking.
>>
File: grid-0252.png (3.77 MB, 2304x1792)
3.77 MB
3.77 MB PNG
>>
File: 0.jpg (508 KB, 1024x1024)
508 KB
508 KB JPG
>>
File: ComfyUI_02_.jpg (1.38 MB, 2048x2048)
1.38 MB
1.38 MB JPG
>>101949195
>>
>>101949186
kys
>>
File: 3559813457.png (1.06 MB, 896x1152)
1.06 MB
1.06 MB PNG
>>
>>101949224
look buddy.. you don't wanna mess with me

lets just say im kind of a big deal around here
>>
>>101949238
delete this
>>101949239
teebs?
>>
>>101949239
Hello kind of a big deal around here, I'm Dad
>>
>>101949184
https://github.com/bmaltais/kohya_ss/tree/sd3-flux.1
It's the bmaltais fork, I forgot this wasn't the main one, shit.
>>
>>101949248
Oh god teebs is here too?
>>
>>101949255
oh fug, thanks
>>
File: ComfyUI8.17.2024__00078_.png (1.32 MB, 1248x1824)
1.32 MB
1.32 MB PNG
>>
File: 00077-1838427454.png (997 KB, 896x1152)
997 KB
997 KB PNG
>>101949206
>>
File: ComfyUI8.17.2024__00064_.png (2.06 MB, 1248x1824)
2.06 MB
2.06 MB PNG
>>
im testing both guis, comfy is fine but sometimes in forge it will lag when unloading a model, how do you let it stay in memory, or unloading after each gen?
>>
File: ComfyUI8.17.2024__00381_.png (2.54 MB, 1248x1824)
2.54 MB
2.54 MB PNG
>>
>>101949355
>Crystalsong Forest
comfy
>>
File: ComfyUI8.17.2024__00330_.png (3.3 MB, 1248x1824)
3.3 MB
3.3 MB PNG
>>
>>101949314
I love it.

>>101949355
I love it.
>>
Fuck, I pulled and now lora are not working anymore with gguf.
AttributeError: 'GGMLTensor' object has no attribute 'tensor_shape'. Did you mean: 'tensor_split'?

Please send help!
>>
File: ComfyUI_Flux_9617.jpg (248 KB, 768x1344)
248 KB
248 KB JPG
>>
File: ComfyUI8.17.2024__00154_.png (1.85 MB, 832x1216)
1.85 MB
1.85 MB PNG
>>
File: 1699032454826863.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
Miku but with a ghibli lora:
>>
File: 00070-4284174251.png (3.81 MB, 1344x1728)
3.81 MB
3.81 MB PNG
>>101949405
Nice
>>
>>101948110
Do you think the success of Flux is a win or a loss for Stability AI? I've seen a lot of people saying that Flux killed Stability AI, but doesn't a rising tide lifts all boats? I think the future is bright for Stability AI. Flux will force them to release their better models.
>>
>>101949195
>>101949405
>>101949435
sick
>>
>>101949405
I love it.

>>101949435
I love it.

Real bangers in this thread.
This is the stuff of dreams my brothers. Imagine when these can all be animated with good quality.
>>
>>101949456
nah stability is toast.
>>
>>101949475
I like toast.
>>
>>
>>101949456
>doesn't a rising tide lifts all boats? I think the future is bright for Stability AI. Flux will force them to release their better models.
People have lost hope that Stability will actually release better models. They had to be aware that their former colleagues were about to release a competitor to SD3 and Stability hasn't seemed to do anything in response. The only way I see Stability surviving is if they can convince people to start training on their models rather than Flux
>>
Is there a Flux guide (including requirements) for retards (me)?
>>
Only way StabilityAI recovers is if they release SD3 8B, it beats Flux at what Flux does well and can do nudity out of the box.
So it is impossible.
>>
who's teebs
>>
me
>>
File: 1722878433163.png (1.26 MB, 1188x1070)
1.26 MB
1.26 MB PNG
>>101949490
>The only way I see Stability surviving is if they can convince people to start training on their models rather than Flux
They will. Flux (or Flush the toilet, as I call it) is too VRAM hungry and isn't as trainable as SD. Flux (or Flush the toilet, as I call it) is competition for Midjourney. The open source community will inevitably come back to SD once they realize it's better suited for our needs.
>>
File: you just know.png (1.2 MB, 896x1152)
1.2 MB
1.2 MB PNG
webcomic artist we coming for you next. Wait five, five months for the finetune and its a over.
>>
>>101949537
stop coping, Lykon
>>
>>101949537
nice bait
>>
>>101949537
nice meme
>>
why is everything so hard to run these days
>>
>>101949392
For anyone else having this error, revert comfyui to 14af129c5509d10504113a1520c45b0ebcf81f14, latest commit broke GGUF lora.
>>
File: FluxPro37.jpg (309 KB, 1024x1024)
309 KB
309 KB JPG
>>101949537
>>
>>101949578
>latest commit broke GGUF lora.
GOTTA LOVE UPDOOTING!
>>
>>101949567
Works on my machine
>>
when will people stop making loras that override all faces in the image?
>>
File: ComfyUI_00108_.png (991 KB, 832x1216)
991 KB
991 KB PNG
>>
>>101949598
stfu
>>
>>101949608
really i think the text kills the realism for these gens, It looks too perfect and obviously stands out from the rest of the image. It'd have to actually be affected by the room's lighting and actually look like marker.
>>
File: 00084-962208020.jpg (450 KB, 1344x1728)
450 KB
450 KB JPG
>>101949613
Kneel.
>>
When generating an image, leave a piece of your heart in it. You are approaching perfection, approaching the divine.
>>
File: works on my machine.png (31 KB, 200x193)
31 KB
31 KB PNG
>>101949613
>>
File: file.png (173 KB, 599x400)
173 KB
173 KB PNG
>>101949608
wow it really works!
>>
>>101949627
I'm making futa porn.
>>
File: ComfyUI_00127_.png (1020 KB, 832x1216)
1020 KB
1020 KB PNG
>>101949617
some gens are better than others
>>
>>101949643
that's a whole lot better but still not quite there.
honestly Flux 2.0 could probably do it out of the box perfectly, two more weeks boys.
>>
File: ComfyUI_03196_.png (1.45 MB, 1152x1304)
1.45 MB
1.45 MB PNG
>>
File: grid-0268.jpg (344 KB, 1536x2688)
344 KB
344 KB JPG
>>
File: 1709353137399697.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
another ghibli miku (lora)
>>
File: 1713761396599727.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>101949674
>>
>>101949674
>>101949691
That's a pretty big pussy.
>>
File: ComfyUI_00492_.png (911 KB, 1280x832)
911 KB
911 KB PNG
Guess the website. Prompt is the blurb they use to advertise one of their video updates
>>
File: 1693499509205771.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
pixar lora miku, got a cute pout face in this gen

https://civitai.com/models/650251
>>
File: ComfyUI_00008_.png (701 KB, 832x1216)
701 KB
701 KB PNG
>>
File: 1694179467085213.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>101949733
>>
File: FluxDev_01715_.jpg (215 KB, 832x1216)
215 KB
215 KB JPG
>>
What's the biggest flux GGUF you can fit in a 12GB gpu?
>>
>>101949775
lmao
>>
File: ComfyUI_00497_.png (933 KB, 1280x832)
933 KB
933 KB PNG
>>
>>101949119
This is great, catbox?
>>
>>101949796
for me Q4_1 is the best, Q5 seemed to sometimes fit and sometimes not fit, pushing the limit too much
>>
File: x.png (1.59 MB, 1024x1024)
1.59 MB
1.59 MB PNG
>>101949809
>>
>>101949811
what's your speed with and without loras?
>>
File: 00103-3557252780.jpg (548 KB, 1344x1728)
548 KB
548 KB JPG
>>
>>101949832
~4.2 s/it on my 3060 without lora, seems to go up a tiny bit to 4.3 with
6 s/it when i use bigger flux models that can't all fit in my vram
>>
File: 2179191790.png (1.78 MB, 896x1152)
1.78 MB
1.78 MB PNG
>>
File: ComfyUI_00626_.png (2.77 MB, 1408x1408)
2.77 MB
2.77 MB PNG
>>
https://imgsys.org/rankings
flux got added to the rankings, and unsurprisingly flux-dev wins
>>
>>101949869
>Nude woman covered in magnetic tape from a VHS
>>
File: 0.jpg (127 KB, 1024x1024)
127 KB
127 KB JPG
>>
File: Flux1_Q3_K_S.png (952 KB, 2360x851)
952 KB
952 KB PNG
Oh quants are about to get so much more confusing once I write the rest of the kernels.
>>
File: 1722706917596222.png (1004 KB, 1024x1024)
1004 KB
1004 KB PNG
>>
>>101949894
doing god's work anon
>>
>>101949435
fucking sick, how do you get that granulation effect?
>>
File: ComfyUI_03001_.png (1.69 MB, 1344x768)
1.69 MB
1.69 MB PNG
>>101949894
LET'S GOO

ty city
>>
File: ComfyUI8.18.2024__00006_.png (2.65 MB, 1248x1824)
2.65 MB
2.65 MB PNG
>>
File: ComfyUI_00627_.png (2.11 MB, 1536x1376)
2.11 MB
2.11 MB PNG
>>101949894
city96 was always my favorite anon
>>
>>101949894
will there be a Q8_K?
>>
>>101949894
You fuckin' rock dude!
>>
>>101949894
nice, K quants are great. any chance for a K quant for the T5?
>>
File: Bonzi_Really.png (1.12 MB, 894x894)
1.12 MB
1.12 MB PNG
>>101949894
oooooooo
>>
>>101949912
>illustration, (pointillism:1.3), (tight:1.1), ultra tight, super tight, very tight
>>
>>101949931
Nice colours
>>
interesting, if I use q4/q8 models in forge, loras wont work properly, but they are fine in comfy. fp8 works in forge with loras, though.
>>
>>101949937
Probably not at first since that didn't come with a reference numpy kernel in gguf-py so I'll have to dig aroud the C++ logic to find out how it works, and there's more important tasks such as figuring out why the fuck the tensors still randomly fail when using LoRAs kek
>>101949947
Yes, comfy just added custom ops as an option for the text encoder + llama.cpp officially supports T5 so the quants will be standard too.
>>
File: 00014-2148446935.png (1.03 MB, 896x1152)
1.03 MB
1.03 MB PNG
>>101949894
is there going to be a ranking comparison once this gets released.

Would be interesting which model produces the best outputs once the kernel gets out.
>>
>>101949973
see
>>101949578
>>
>>101949983
no comfy is fine, I mean forge isn't working with q4/q8 when I use a lora, not sure why. I added the vae and encoders in the dropdown too.
>>
>>101949982
I'll probably make one eventually though I'm on a shitty 10GB 3080 and I'm pretty sure miku guy would kill me if I made him compare all of them kek.
>>
>>101948753
Is there supposed to be a flux script in here?
>>
File: ComfyUI_00629_.png (2.27 MB, 1152x1536)
2.27 MB
2.27 MB PNG
>>101949972
tyty
>>
>>101950029
pls catbox i want to utilize this style
>>
File: Capture.jpg (201 KB, 3807x903)
201 KB
201 KB JPG
https://civitai.com/models/630820/flux-fusion-ds-nf4-fp4-fp8-fp16-4-steps-aio-and-unet-only?modelVersionId=705611
It's gonna be such a mess, I don't think it's a good idea to categorise different quants as if it was another version of the finetune, maybe putting multiple quant options on the download button would be the thing to do instead
>>
File: ComfyUI_03216_.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>
>>101949909
She is so cute with BHA artstyle. Never thought i would want to see this mix.
>>
>>101950050
nice gen, catbox?
>>
>>101950008
I don't think so, I don't have one, I just loaded the config I didn't change any settings besides directing it to my folders and setting up the sample prompt. Make sure you're on the bmaltais fork >>101949255
>>
so, why does comfy work with q4 + clip models + lora, but forge doesn't apply the lora for q4, it only works with fp8. is forge not working with quant models yet for lora? I get output but no lora.
>>
>>101948110
What is this horse from >>101948110
>>
should i even bother with GGUF if i can run full bf16 in vram at 1.2s/it

or does it offer a speed boost?

i updated comfy to try GGUF after not doing so for a week and everything is broken (even after updating all nodes)
>>
I have 6GB of VRAM and 16GB CPU RAM on a laptop, how do you run FLUX on forge?
>>
File: high_res_o_00202_.png (743 KB, 768x1088)
743 KB
743 KB PNG
>>
>>101949894
the model is 5gb, but it eats 6.8gb during inference, is that extra amount of vram consumption consistent?
>>
I'm trying to run the gguf flux models for the first time, are these the two clip models I need?
https://huggingface.co/openai/clip-vit-large-patch14/tree/main
https://huggingface.co/city96/t5-v1_1-xxl-encoder-bf16/tree/main
the 2nd one is nearly 10gb!!
>>
File: ComfyUI_03212_.png (1.65 MB, 1024x1024)
1.65 MB
1.65 MB PNG
>>101950068
Sure.

https://files.catbox.moe/amb4iw.png

I found a new method using https://github.com/logtd/ComfyUI-SEGAttention for prompt adherence that's working out pretty well. It replaces the cfg entirely but you can still benefit from Adaptive Guider to disable SEG attention, like you can with the other method.
>>
>>101950092
I guess yeah, making the picture takes vram, especially at "high" resolutions like 1024x1024
>>
>>101950075
it's from this anon's gen >>101947888
>>
File: Capture.jpg (180 KB, 1744x831)
180 KB
180 KB JPG
>>101950105
when I tried SEGAattention I thought it acted exactly like PrepNeg, it's 3x slower too right?
>>
>>101950069
Yeah definitely on this fork, ran the setup, pressed 1 to install gui, loaded script, set folders, pressed go, keep getting failures. This thing specifically
>>
>>101949993
I misread sry
>>
File: 1696012450616015.png (1021 KB, 896x1152)
1021 KB
1021 KB PNG
>>101950055
yeah I just got a bunch of civitai loras to test various styles, turned out pretty good
>>
Someone save this for the next time someone asks how to use joy caption locally: (taken from /h/)

You can clone joycaption from hugging face the same way you clone from GitHub (you can even use github's desktop app to do it if you feel so inclined)
copy+paste your venv from forge or comfy or whatever you use into the cloned folder to save yourself having to build the venv and install all the shit it's requirements.txt is missing
then edit the app.py and change the
>MODEL =
line from hugging face to a local LLM of your choosing
here's a quant'd version
https://huggingface.co/unsloth/Meta-Llama-3.1-8B-bnb-4bit/tree/main
though there are other uncensored ones that might be better suited when using for NSFW, it's worked well for me so far

you can write a .bat file to launch it if on windows (ask chatgpt how if you're not familiar)

any errors just ask chatgpt how to resolve them
>>
>>101950092
Yeah it's like context with LLMs, the unpacked weights/actual image data/hidden states needs somewhere to go and they're in FP16.
>>
>>101950146
so what would be the equivalent to quanting the context cache? Do diffusion models have that?
>>
File: IMG_4622.gif (63 KB, 640x478)
63 KB
63 KB GIF
>>101950050
>>101950105
>miku with black skin and dreadlocks skating
>>
File: ComfyUI_00018_.png (1.19 MB, 832x1216)
1.19 MB
1.19 MB PNG
>>
>>101950165
Yeah that'd be it but no we don't have that. Pretty sure forcing those to be in fp8 would completely destroy the image quality. Some of the vram in my image is also probably from the vae being still loaded.
>>
Is there a comparison between T5 fp8 and fp16?
>>
>fp16
why don't we have fp32? or is it just diminishing returns past that point?
>>
>>101950123
To be honest I don't know what Prepneg is. Gen times are about twice as slow now though. I like the results though. Tonemap seemed to lose some detailed compared to DynamicThresholding, but DynamicThresholding tended to end up with more artifacts like noisy images and stuff.
>>
>>101950203
there was in the very first days of flux, someone made a comparison with a photo of a woman and the fp8 T5 one made her skin more green for some reason
>>
>>101949999
>10GB 3080
Lol, im using the same gpu. What are the odds. If you ever get to Q8_K if its ever possible that would probably be the most popular model for all us gpu lets.
>>
File: ComfyUI_00020_.png (1.13 MB, 832x1216)
1.13 MB
1.13 MB PNG
>>
>>101950221
it's this: https://github.com/pamparamm/sd-perturbed-attention
basically it has the same role as SEGAattention, it's supposed to give good results at CFG = 1
>>
File: Plague(r).jpg (84 KB, 738x1292)
84 KB
84 KB JPG
>>101950234
>Mining coal with a battle axe
Gee no wonder why he is sad
>>
>>101950231
you can't fit Q8 or even Q5 on 10gb tho
>>
>>101950140
Really thankful you stumbled upon that mix. It so visually appealing to look at i can't put my finger to it. This just make Hatsune Miku 10x cuter
>>
File: ComfyUI_00631_.png (3.18 MB, 1152x1536)
3.18 MB
3.18 MB PNG
>>
>>101950126
No idea then
>>
File: 1694944248528148.png (1.16 MB, 896x1152)
1.16 MB
1.16 MB PNG
she is getting stronger!
>>
>>101950221
>Tonemap seemed to lose some detailed compared to DynamicThresholding, but DynamicThresholding tended to end up with more artifacts like noisy images and stuff.
True, I tried for days to find the right parameters to recreate the magic unslopping effect that CFG + GuidanceNeg 10 + DynamicThresholding does, without much success
https://reddit.com/r/StableDiffusion/comments/1euk2vw/some_xy_plot_i_made_to_try_to_understand_the_cfg/
Maybe a method that will have the DT effects without the artifacts exists and we haven't found it yet, or it still doesn't exist and we are searshing in the void kek
>>
>>101950264
i managed to still use the models with forgeui. It just took me 4 to 9 minutes of wait time to gen a single image 1024 x 1024 last time i tried.

Just not really worth the wait time when i could do the same with Q4 faster.
>>
>>101948259
They said it's coming soon, I imagine the next month or so.
>>
>>101950249
well shit, you're right. just tested and got almost identical gens. thanks for letting me know.

>>101950293
So that's still your workflow then Miku-anon? RIP. I'm probably gonna go back to it because these double gen times suck
>>
>>101950307
desu I doubt we'll be able to run it on our machines, unless we use ultra optimized quants like Q6_K then maybe?
>>
>>101950252
yeah flux seems to think pickaxe = axe sometimes kek
>>101950307
man that sounds like it will have horrific gen times, but still sounds cool
>>
>>101950171
>insert autistic screeching about how he can't gen it without a hack
>>
>>101950080
It can be useful to run the Q8 or Q4 version if you want to keep some VRAM free for a local LLM to help write prompts
>>
File: ComfyUI_02670_.png (1.82 MB, 832x1216)
1.82 MB
1.82 MB PNG
>>101950171
>>101950382
Funny thing is, I'm not even that anon. Just the combination of conflicting features on Miku's base model makes a good way to benchmark prompt-adherence.
>>
File: file.png (63 KB, 952x650)
63 KB
63 KB PNG
>>101949868
>>101949796
i'm not sure why, but the full size models work on mine without any offloading at all, with 12gb vram ~3.7 iterations/second
>>
File: Ya feel me nigga.jpg (407 KB, 1080x1047)
407 KB
407 KB JPG
Just to think.. In a year's progress, we'll be running audio visual and text all on one machine at perfect accuracy..

..You're huffing the same hopium too right?
>>
>>101950421
Even my 4090 can't run weight_dtype: default, lmao.
>>
>>101950421
>without any offloading at all
nah, comfy intelligently offloads when necessary and doesn't necessarily fill every last byte of VRAM first, i am also able to run fp16 on my 3060 like that
though for me it's more like 6 s/it, not sure why yours is faster
>>
File: Capture.jpg (29 KB, 846x517)
29 KB
29 KB JPG
>>101950249
>>101950221
My b I gave you the wrong one, it's PerpNegGuider and it's native on ComfyUi
https://perp-neg.github.io/
https://civitai.com/models/625042?modelVersionId=706228
>>
This is a photograph capturing a man squatting on the ground in a bustling market street, with the iconic Taj Mahal in the background, shrouded in a light haze. The man, of South Asian descent, has a medium build and is dressed in traditional attire: a light beige kurta (long shirt) over a dark vest, with loose, beige trousers and flip-flops. He has short, dark hair and a neatly trimmed beard. His expression is relaxed, and he is smiling slightly.

The street is narrow and crowded with makeshift stalls and vendors, covered with blue and brown tarpaulins. The ground is dusty, with scattered debris and small objects, including a few plastic bags. The stalls are low, with a few motorcycles parked between them, adding to the chaotic yet lively atmosphere.

The Taj Mahal, a grand mausoleum with its distinctive domes and minarets, stands prominently in the background, its ivory-white marble glowing softly against the hazy sky. The surrounding buildings are a mix of modern and traditional architecture, with some high-rise structures visible in the distance, blending into the misty atmosphere. The overall scene is rich in cultural and historical context, capturing the essence of a busy market day in a historic city.
>>
>>101950421
I'm using q5.1 and I'm getting 5.2 s/t for 1024x1024 pics. What am I doing wrong.
>>
>>101950439
I have a 4070 Super and I can run the fp16 model and clip/T5 no problem with ComfyUI. I think it has something to do with ComfyUI's lowvram mode that allows it.
>>
>>101950458
im almost in tears at my stocking LORA from civitai not going very well on the 20th epoch, im gonna throw this guy's prompt into AutismMix confetti and see what it gives me.
>>
>>101950461
i have no idea, i'm extremely stupid and i just use the default settings from the noob guide http://comfyanonymous.github.io/ComfyUI_examples/flux
>>
>>101950434
yep, we will also gets img to vid locally aswell. Give it only a few years and people will be making fully feature films from their own home.

No more hollywood or bollywood films polluting the medium with propaganda now that you can have them home baked.
>>
File: ComfyUI_00026_.png (900 KB, 832x1216)
900 KB
900 KB PNG
Flux, that's not what I meant by dangling pair of legs...
>>
File: myFile_30_3.0_026.png (2.51 MB, 1536x1536)
2.51 MB
2.51 MB PNG
>>101950458
what did it mean by this?
>>
File: ComfyUI_03225_.png (1.87 MB, 1024x1024)
1.87 MB
1.87 MB PNG
>>101950464
Hm I guess the implementations are different. Sick results though, will try and tune this and see what I prefer.
>>
>>101950503
what prompt did you use for the movie poster.
>>
>>101950527
Oops wrong post
>>101950450
>>
>>101950421
>full fp16 model
>12gb vram
>~3.7 iterations/second
Excuse me? I hope you mean 3.7 s/it instead?
>>
File: ComfyUI_00027_.png (850 KB, 832x1216)
850 KB
850 KB PNG
>>101950528
>A poster for an animated 3D Disney movie. The poster says "DISNEY Pixar presents EPSTEIN" at the bottom, with the stylized Disney and Pixar logos. The rest of the poster shows an animated depiction of a dark jail cell. Behind the bars, a man's legs can be seen dangling from the ceiling, subtly illuminated. The legs are clad in gray pants and black shoes. There is an empty bed in the corner of the cell, and the floor and walls are made of concrete while the bars are cold hard iron. The legs hang in the upper half of the view, and the rest of the hanging body is out of view. The image implies someone is hanging from a noose after committing suicide, but all that can be seen is the dangling legs.
>>
I dont get it, loras dont always work in forge with a quantized model but it's fine in comfy, fp8 always works in forge for loras. but I wanted q4/q8 cause I have 16GB.
>>
>>101950322
>>101950346
If it's good, people will find a way to bring down requirements and speed up generation, just like with Flux. Maybe they'll also release a distilled model like Schnell, if that's possible for video gen. I'm pretty optimistic based on the past few weeks, but we'll see.
>>
File: llamaquants.png (453 KB, 3000x2100)
453 KB
453 KB PNG
>>101949982
Do you need it? You can make a direct beeline from how these prompts perform in LLMs to do a fairly accurate prediction how badly things degrade the smaller of a quant you use. Based on the current image, I am willing to say that Q4_K_M is going to be the one that probably will be the least compromises and the one everyone shoots for while Q6_K is probably going to be the threshold where it will still adhere to the prompt. I personally like Q5_K_M with LLMs though but it's possible there's going to be enough degradation of the diffusion model that people won't go for that quant.
>>
File: ComfyUI_00635_.png (2.32 MB, 1152x1536)
2.32 MB
2.32 MB PNG
>>
>>101950557
Give it a couple more days before people settle on the right way to quantize the loras. Certain implementations will be better than others
>>
>>101950551
sorry, yes, 3.7sec/it. in my defense, i did say i was retarded
>>
>>101950572
is this resampled through a pony model, or have flux LoRA's come this far this quickly?
>>
>>101950552
Thank you so much
>>
>>101950527
There's even PerpNegAdaptiveGuider if you want to combine PerpNeg with Adaptive Guidance (for the boost speed)
https://github.com/asagi4/ComfyUI-Adaptive-Guidance
>>
>>101950577
fp8 works fine but occasionally there will be lag when unloading a model, I want a toggle to keep it in memory

comfy is good but I need a tool like civitAI helper for default instance prompts, as it isnt always obvious.
>>
>>101950607
Oh nice, thank you.
>>
https://civitai.com/models/657252/fluxstanza?modelVersionId=735368

a costanza lora, we are in the new age of memes
>>
File: ComfyUI_00637_.png (2.22 MB, 1408x1408)
2.22 MB
2.22 MB PNG
>>101950591
just flux w/ a lora
>>
>>101949894
Any chance of t5 quantization?
>>
>>101949894
Well, _K quants implemented. Thank god the llama.cpp guys had reference numpy code for most things.
Q6_K is for some reason faster and better on SD1.2 than Q8_0 lol, then again, it's SD1.2.
The actual c++ code doing the quantization is questionable and it's hard to find analogies to the keys they use, trying to figure that out before uploading them. Will also have to quant from FP32 because I didn't add BF16 support lol.
>>
>>101950670
damn, very nice.
>>
File: Flux_01171_.png (1.15 MB, 768x1344)
1.15 MB
1.15 MB PNG
No CFG used, close to getting the benchmark that people keep using
>>
>>101950572
That's a cute style, is that the LoRA?
>>
>>101950569
i guess only time will tell. Once we get these models on hand we will know for certain which is the best gens for least gpu usage.
>>
>>101950701
nice, care to give your workflow?
>>
>>101950701
Try enforcing an uncharacteristic style like "50s comic book" or something, that's where I ran into issues
>>
File: 00125-1763863892.png (2.9 MB, 1728x1344)
2.9 MB
2.9 MB PNG
>>101950458
>>101950651
>>
>>101950624
>>101950527
what PrepNeg parameters did you use to get that insane result?
>>
File: LOL HAHAHHA.jpg (63 KB, 494x490)
63 KB
63 KB JPG
>>101950728
fucking visionary over here
>>
>>101950719
https://files.catbox.moe/6djyrj.png
>>101950727
I'll try that out
>>
>>101950728
kek
>>
File: 00021-1572976536.png (1.12 MB, 896x1152)
1.12 MB
1.12 MB PNG
>>
>>101950692
so it's normal for Q5 to be slower than Q4?
>>
>>101949894
>>101950692
I kneel.
>>
>>101950732
Default PerpNegGuider, cfg 1.0 and neg_scale 1.98. Seems really unstable though since I didn't prompt for the butterflies, just got lucky on the first gen. I can give you a box if you want
>>
File: Capture.jpg (146 KB, 1175x1186)
146 KB
146 KB JPG
>>101950748
>Boomer prompting
Many such cases with Flux :(
>>
>>101950774
I used an LLM to turn my non-boomer prompt into a boomer prompt. I hate that it works so well but I enjoy the end results.
>>
>>101950774
Is it really boomer prompting when it is just how it works? It's just describing the image.
>>
>>101950774
kek i am doing the same thing, write the prompt in tags in clip l, then ask gpt to turn my tags into a boomer prompt for the t5
>>
File: Q2_K_M.png (913 KB, 2055x845)
913 KB
913 KB PNG
Q2_K is, as expected, completely retarded.

>>101950758
Yes I think my shitty tensor block access code was slowing it down, I tried to improve that as well but who knows if it did anything lol.
>>
>>101950728
kek
>>
>>101950794
Well, at least it can still spell!

(prompt that please)
>>
>>101950794
MsPaint Flux
>>
>>101950787
>It's just describing the image.
Adding both "Black African" and "dark skin" is equivalent to (dark skin:2), basically you're just spamming that token to force Flux to consider it, that's not natural at all
>>
>>101950794
Q2_K = soul
>>
>>101950769
>Default PerpNegGuider, cfg 1.0 and neg_scale 1.98. Seems really unstable though since I didn't prompt for the butterflies, just got lucky on the first gen.
yeah maybe neg_scale 1.98 is a bit too much, on the civitai example it's at 1.5, but yeah, seems promissing indeed
>>
>>101950769
>I can give you a box if you want
Sure, I'm interested on that one
>>
>>101950804
>basically you're just spamming that token to force Flux to consider it, that's not natural at all
Is it any less natural than hacking back in CFG and using a negative prompt to "force" the model not to include it?
>>
File: 00115-3557252781.jpg (737 KB, 1728x1344)
737 KB
737 KB JPG
>>
File: ComfyUI_13404_.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
>>101950799
Wonder if I could whitelist some crap to make the quality degradation less horrible. A 3.8GB file outputting trash is less useful than a 4GB one that doesn't mangle the output.
>>
>>101950835
>me every day for the past week or so that i've been lurking these threads
>>
>>101950832
https://files.catbox.moe/qips4g.png
>>
>>101950839
unironically better than SD3 still
>>
>>101950839
honestly not too bad at all, still better than the equivalent SD lmao >>101950848
>>
>>101950833
I much prefer adding a CFG hack that works and then not touch it again than spending the rest of my life writing stuff like "A drawing of Miku as a black african, it means she has a black skin, black as is the absence or complete absorption of visible light... please flux consider that concept of blackness!!!"
>>
>>101950847
thanks anon
>>
File: 1714632271845532.png (1.27 MB, 896x1152)
1.27 MB
1.27 MB PNG
>>
>>101950847
too bad you don't have a seed node so I can't reproduce your results
>>
File: ComfyUI_00639_.png (1.89 MB, 1408x1408)
1.89 MB
1.89 MB PNG
>>
File: file.png (295 KB, 1478x1010)
295 KB
295 KB PNG
>>101950926
Seed is:

438710575208934
>>
>>101950960
oh it was that red node on my side, kek, is there a reason you went for that custom node instead of the regular seed node on ComfyUi?
>>
>>101950839
Can you fix your node with recent comfyui updates?
>https://github.com/comfyanonymous/ComfyUI/commit/bb222ce
break lora with gguf
>https://github.com/comfyanonymous/ComfyUI/commit/4f7a3cb
break loading model
>>
File: Capture.png (338 KB, 339x428)
338 KB
338 KB PNG
>Here's your lora bro
Onsite training was a mistake
>>
File: 1717327948000962.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
teal color power aura miku
>>
>>101950976
Does comfy have a standalone seed node that you can link to all your inputs and can randomize and fix on a button press? Honestly never bothered to look lol, rgthree was one of the first custom nodes I downloaded since he has a plethora of utility stuff

https://github.com/rgthree/rgthree-comfy
>>
File: ComfyUI_00641_.png (2.1 MB, 1408x1408)
2.1 MB
2.1 MB PNG
>>
>>101950982
cant wait for the female brit lora. Man is making some progress.
>>
>>101950982
looks about right
>>
File: Capture.jpg (25 KB, 789x313)
25 KB
25 KB JPG
>>101951004
>Does comfy have a standalone seed node that you can link to all your inputs and can randomize and fix on a button press?
no it's not that sophisticated kek, but desu if I want to change a seed I just do a +1 kek
>>
>>
>>101950982
I block every civitai users that make retarded loras like that, made my life way easier as there aren't that many people making loras to begin with
>>
>>101951059
tbf im used to using the default KSampler nodes that just have a seed input



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.