[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: the longest dick general.jpg (2.96 MB, 2217x3264)
2.96 MB
2.96 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bred : >>102964600

Very Opposite Opinion Edition

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://aitracker.art
https://huggingface.co
https://civitai.com
https://tensor.art/models
https://liblib.art
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3

>SD3.5
https://huggingface.co/stabilityai/stable-diffusion-3.5-large
https://replicate.com/stability-ai/stable-diffusion-3.5-large

>Sana
https://github.com/NVlabs/Sana
https://sana-gen.mit.edu

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux
DeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai
>>
File: 2024-10-25_00024_.png (1.54 MB, 720x1280)
1.54 MB
1.54 MB PNG
>>102974667
with the Van Gogh style lora, and trigger.
>>
Blessed thread of frenship
>>
File: 2024-10-25_00028_.png (1.65 MB, 720x1280)
1.65 MB
1.65 MB PNG
>>102974909
LyingSigmaSampler experiments.
>>
>>102975105
You see a guy with a paintbrush, stick him with a knife.
>>
I'm an artist.
>>
Can someone explain how the denoise steps work? Those are done by the samplers?
>>
>>102975152
>LyingSigmaSampler
I've seen the examples where it appears to generate "more details" and even lessen the blur present in Flux but, it gives images an unpleasant appearance IMO.
>>
>>102975178
i think the same but haven't messed around with it myself yet
>>
>>102975152
another

>>102975178
imo this stuff will be basic to ai images soon enough. The guy who made it is cloning DemonDetailer of A1111 or whatever the're called. I don't yet understand it. The current version has simple settings. It's a multiplier that applies to a portion or all of steps. Rather obviously, it could be done in more sophisticated ways.
>>
File: 2024-10-25_00029_.png (1.61 MB, 720x1280)
1.61 MB
1.61 MB PNG
>>102975220
image.

>>102975220
um actually not a mult, an adder? idk, but it can't hit plus of 100%
>>
>I'm currently convinced they nuked all popular celebrities from their training sets to dodge controversy.
They used a VLM to erase the tags that mentioned the celebrity's names to replace them with tags that didn't, but I reckon they did it because they were retards, otherwise Trump would have been tagged as blond man, it can draw it because the VLM can recognize him.
So it wasn't intentional, they let there what the model could recognize, if it recognized your celebrity it'd be properly tagged and would be drawn fine, but they didn't care either way.
>>
>>
>>102975292
yep, the few ones it recognizes clearly shows it was never intentional
what was intentional was the erasure of all porn from the dataset (or as much as they could)
>>
I have ugly test results, but I'm not posting them, since that's only annoying.
>>
File: 02170.jpg (2.12 MB, 1792x2304)
2.12 MB
2.12 MB JPG
>>
File: 2024-10-25_00034_.png (1.62 MB, 720x1280)
1.62 MB
1.62 MB PNG
>>102975230
Using 2 LyingSigmaSamplers
>>
>>102975512
Enjoying the series.
>>
2:17:34 to generate a mochi video... that hurts
>>
File: killer7 comparison.png (1.63 MB, 1340x1418)
1.63 MB
1.63 MB PNG
I think even I could've done a better job than this AI upscale.
>>
>>102975626
Well, let's see it
>>
>>102975500
put them in a nice grid/diagram and post
>>
>>102975993
it's shit
>>
File: 02182.jpg (1.81 MB, 1792x2304)
1.81 MB
1.81 MB JPG
>>
>>102976005
We cross our fingers promise not to make fun of you.
>>
File: 02184.jpg (1.71 MB, 1792x2304)
1.71 MB
1.71 MB JPG
>>
File: 2024-10-25_00039_.png (761 KB, 720x1280)
761 KB
761 KB PNG
>>
I probably should play games some instead of just using Flux.
>>
Is there any way to get comfyui to utilize 2 gpus for batch-genning?
>>
>>102976123
brap
>>
File: 2024-10-25_00041_.png (1.21 MB, 720x1280)
1.21 MB
1.21 MB PNG
>>102976123
>>
File: 02186.jpg (2.05 MB, 1792x2304)
2.05 MB
2.05 MB JPG
>>
File: 2024-10-25_00042_.png (1.35 MB, 720x1280)
1.35 MB
1.35 MB PNG
>>102976193
>>
>>102976123
pov: I took my glasses off to read the wine list, a hooker finds out I won craps
>>
File: 2024-10-25_00043_.png (1.07 MB, 720x1280)
1.07 MB
1.07 MB PNG
>>102976207
>>
>>102976154
not in the way you want it to
>>
>>102976005
Now I want to see it even more
>>
File: 2024-10-25_00044_.png (1.15 MB, 720x1280)
1.15 MB
1.15 MB PNG
>>102976234
>>
File: 02190.jpg (1.77 MB, 1792x2304)
1.77 MB
1.77 MB JPG
>>
File: 2024-10-25_00045_.png (1.55 MB, 720x1280)
1.55 MB
1.55 MB PNG
>>102976343
I knew something interesting would happen with one of the settings.
>>
File: ComfyUI_temp_kkkrx_00013_.png (3.67 MB, 1344x1728)
3.67 MB
3.67 MB PNG
using this lora
https://civitai.com/models/889194/gigachad-and-poses-illustriousxl-and-noobai?modelVersionId=995038
>>
>>102976470
What is Noob?
>>
>>102976470
nice
>>102976498
https://civitai.com/models/833294?modelVersionId=968495
>>
>>102976498
https://civitai.com/models/833294/noobai-xl-nai-xl
further illustrious finetuning that has more h100s thrown at it
>>
>>102976470
Prompt?
>>
>>102976520
masterpiece, best quality, absurdres, kubo tite, 1boy, flying kick, midair, gigachad \(meme\), pinstripe suit, (beard:0.75), full body, black shoes, wristwatch, cityscape, solo

worst quality, censored, sketch, artist name, multiple views, lowres, flat color, (muted color:0), amputee, jpeg artifacts

https://files.catbox.moe/23odjc.png
>>
File: 02196.jpg (2.26 MB, 1792x2304)
2.26 MB
2.26 MB JPG
>>
>>102976507
>>102976503
Looks pretty cool. It's for 2D mostly?
>>
File: Untitled.png (36 KB, 1227x498)
36 KB
36 KB PNG
still don't know what the hell this thing is, am i doing it right
>>
>>102976590
I don't know of documentation stating the order.
>>
man, it's weird going back to sdxl. I tried a few new models yesterday and the prompt adherence is utter dogshit. Getting what you want requires 10 gens and a lot of luck
>>
does anybody know of a good comfy workflow for FLUX inplainting that lets me paint inside comfy and not upload an external mask?
>>
File: image (35).png (304 KB, 1024x1024)
304 KB
304 KB PNG
>>
>>102976751
right click an image node and select "open in mask editor"
>>
>>
>>
>>102976533
art
>>
>>102974813
retard here. I am new to this. I downloaded localai and bunch of models. My questions how do I generate porn from a set pictures I collected? I want to generate a specific person
>>
>>102976922
maybe try youtube something like "how to train a lora on civitai"
i think civitai supposedly makes this easy. i haven't tried it myself though
>>
File: emin_2957893b.jpg (56 KB, 620x387)
56 KB
56 KB JPG
All the kids in the last bred talking about
>ai isn't "real" art
>>
>
>>
https://civitai.com/articles/8322
Merge a Lora into Flux for better speed and quantize it.

It's a pretty short article but tell me if I'm making some obvious mistake here.
>>
File: grid.jpg (3.12 MB, 3584x4608)
3.12 MB
3.12 MB JPG
>>
>>102976901
giwtwm
>>
File: image (43).png (376 KB, 1024x1024)
376 KB
376 KB PNG
>>
File: image (46).png (304 KB, 1024x1024)
304 KB
304 KB PNG
>>
File: image (40).png (392 KB, 1024x1024)
392 KB
392 KB PNG
>>
File: image (44).png (383 KB, 1024x1024)
383 KB
383 KB PNG
>>
File: image (41).png (331 KB, 1024x1024)
331 KB
331 KB PNG
>>
File: image (45).png (417 KB, 1024x1024)
417 KB
417 KB PNG
>>
File: image (38).png (418 KB, 1024x1024)
418 KB
418 KB PNG
>>
>>
>>102977363
no ones figured out lora quants?
>>
>>102977659
Loras can already be very smol, why quant them?
>>
>>
>>
File: image (53).png (375 KB, 1024x1024)
375 KB
375 KB PNG
>>
File: image (49).png (358 KB, 1024x1024)
358 KB
358 KB PNG
>>
>>102977689
i guess it wouldnt make them load any quicker, negating the need to bake them in
>>
File: image (61).png (305 KB, 1024x1024)
305 KB
305 KB PNG
>>
File: image (50).png (427 KB, 1024x1024)
427 KB
427 KB PNG
>>
File: image (64).png (248 KB, 1024x1024)
248 KB
248 KB PNG
>>
File: image (36).png (211 KB, 1024x1024)
211 KB
211 KB PNG
>>
File: image (30).png (465 KB, 1024x1024)
465 KB
465 KB PNG
>>
Sloppa go brrrr
>>
File: image (42).png (282 KB, 1024x1024)
282 KB
282 KB PNG
>>102977883
>Sloppa
How? I think the quality is decent. Are you a luddite?
>>
>Under the ultraviolet pink and blue glow of holographic advertisements, a young Russian teenage model girl in a pink plastic latex outfit and pink knee high boots poses in front of a futuristic luxury sports car on a city street, filled with pulsing neon lights. The camera captures her full body and beautiful face

I guess I can work with late teens
>>
File: ComfyUI_Flux_63.png (959 KB, 1344x768)
959 KB
959 KB PNG
/ldg/ - 1girl general
>>
1girl is All You Need
>>
need 1gf
>>
>>102977925
>posts 1boy
>>
is the a GUI that supports dreambooth and Pixart Sigma at the moment?
I am aware of the script but cbf
>>
File: 1girl.webm (357 KB, 1968x1064)
357 KB
357 KB WEBM
Not bad, but 25 min to generate this.
>>
If I was a cool rapper and had 1russianteengirl and 1sportscar irl I wouldn't need to generate them
>>
>1girl
Only Sana can save this sinking ship
>>
File: image (54).png (414 KB, 1024x1024)
414 KB
414 KB PNG
>>102978523
Let me see the images your producing. Oh wait...
>>
in Flux, some loras don't like mixing. Not sure if I can make it work.
>>
If you're tired of 1girl, I might have some 2girls in the back
>>
File: image (17).png (306 KB, 1024x1024)
306 KB
306 KB PNG
>>
File: image (62).png (906 KB, 1024x1024)
906 KB
906 KB PNG
>>
File: image (67).png (564 KB, 1024x1024)
564 KB
564 KB PNG
>>
File: image (70).png (836 KB, 1024x1024)
836 KB
836 KB PNG
>>102978787
>>
File: image (72).png (793 KB, 1024x1024)
793 KB
793 KB PNG
>>
File: 2024-10-26_00006_.png (1.07 MB, 720x1280)
1.07 MB
1.07 MB PNG
>>
File: 2024-10-26_00007_.png (1.3 MB, 720x1280)
1.3 MB
1.3 MB PNG
>>102978827
Adjusting the LyingSigmaSampler.
>>
>>102978523
The sketch Flux lora is quite decent.
>>
File: 2024-10-26_00008_.png (1.41 MB, 720x1280)
1.41 MB
1.41 MB PNG
>>102978869
>>
Overwhelmed by the sheer number of loras. We eatin good tonight fluxbros
>>
>>102978803
She clearly needs glasses lol, look how blurry everything is!
>>
What is the conclusion about Sana? A failed launch?
>>
File: image (81).png (929 KB, 1024x1024)
929 KB
929 KB PNG
>>102979090
I agree. What kind of glasses should she wear?
>>
File: image (87).png (741 KB, 1024x1024)
741 KB
741 KB PNG
>>
File: image (84).png (805 KB, 1024x1024)
805 KB
805 KB PNG
>>
File: 2024-10-26_00010_.png (839 KB, 720x1280)
839 KB
839 KB PNG
>>102979017
>>
File: 2024-10-26_00011_.png (954 KB, 720x1280)
954 KB
954 KB PNG
>>102979131
99% human :^)
>>
>>102979108
How about blue blockers?
>>
File: image (88).png (983 KB, 1024x1024)
983 KB
983 KB PNG
>>102979181
You got it
>>
File: image (89).png (902 KB, 1024x1024)
902 KB
902 KB PNG
>>102979181
>>
>>102979216
>>102979225
VERY cute lol
>>
>>102977894
I agree
>>
File: 2024-10-26_00012_.png (1.05 MB, 720x1280)
1.05 MB
1.05 MB PNG
>>
File: tmp5ib0e894.png (1.13 MB, 896x1152)
1.13 MB
1.13 MB PNG
>>
>>102979345
I'm glad you liked them
>>
File: 2024-10-26_00013_.png (1.05 MB, 720x1280)
1.05 MB
1.05 MB PNG
>>102979374
Trying out loras
>>
File: image (86).png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
>>
File: image (75).png (856 KB, 1024x1024)
856 KB
856 KB PNG
>>
File: 2024-10-26_00015_.png (1.14 MB, 720x1280)
1.14 MB
1.14 MB PNG
>>102979389
>>
>>102974813
Where's the nude models?
You jokers are constantly edging.
>>
>>102979533
/g/ is a blue board, you won't find nudes here. look in the "related boards" section in the OP
>>
>>102979533
You mean a woman in a flowing garment? Very tasteful. Good thinking anon.
>>
File: 2024-10-26_00018_.png (1.53 MB, 720x1280)
1.53 MB
1.53 MB PNG
>>
>>102974813
Is there anything like VLLM for imagegen, optimized bulk inference with continuous batching?
>>
File: ComfyUI_01615_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>
>>102979701
Having fast inference with multiple GPUs is unsafe.
>>
>>102979896
…no…
>>
File: ComfyUI_01623_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>
File: Mochi_preview_00003.webm (2.17 MB, 856x480)
2.17 MB
2.17 MB WEBM
>>102976081
>>102976300
It was body horror I deleted, this is another one generated when I went to sleep.

163 frames
24 fps
848 x 480
200 steps because why not
>>
File: file.jpg (519 KB, 1401x870)
519 KB
519 KB JPG
are you splitting your sigmas?

>picrel
>>
i always see people commenting about depth maps for hands in blender, so you dont have to inpaint so much
how do i learn that?
>>
File: 00074-2236934357.jpg (741 KB, 1616x1280)
741 KB
741 KB JPG
>>
>>102975177
it depends on your scheduler. Two (very oversimplified) basic types of schedulers. Ones that will converge to a pic and an increase steps will eventually do nothing and ones that will give you a new pic depending on ranges of steps.

This is mostly a sampler video, but it should help.
https://www.youtube.com/watch?v=-GXJDz8i-Wo

>>102979104
lost in the flood of model release. I see it similar to turbo/lcm models which fell out of popularity since nobody wants to do a full second pass.

>>102980269
in blender? Post link.

Look into graphormer. It is okay.
>>
https://github.com/kijai/ComfyUI-MochiWrapper/commit/f29f7397078b988110b82b85f135acc932a4c7ee

so support cublas_ops with GGUF

pretty big speed boost on 4090 at least, needs this installed:
https://github.com/aredden/torch-cublas-hgemm

CUBLAS INFERENCE:
FLOPS: 274877906944
TFLOP/s: 305.801

TORCH INFERENCE:
FLOPS: 274877906944
TFLOP/s: 166.989

gpt4o suggests it's bf16 mochi model only but idk.
>>
>DEPRECATION: Loading egg at X is deprecated. pip 24.3 will enforce this behaviour change. A possible replacement is to use pip for package installation. Discussion can be found at https://github.com/pypa/pip/issues/12330

Can they please stop screwing with pip and fix actual issues. ahhhhhhhhh
>>
File: 2360193329.png (1.1 MB, 896x1152)
1.1 MB
1.1 MB PNG
>>
File: 575524562.png (1.72 MB, 896x1152)
1.72 MB
1.72 MB PNG
>>
>>102979377
very cool
>>
File: 00009-2469355154.png (1.01 MB, 896x1152)
1.01 MB
1.01 MB PNG
My Flux dedistilled gens are absolute garbage now and I don't know why, it's driving me insane.
>>
File: 00038-2469355154.png (1.05 MB, 896x1152)
1.05 MB
1.05 MB PNG
>>102980904
A few days ago for comparison. Please help me
>>
>>
>>102980909
You pull?
>>
Ok I'm mostly happy with the age ranges time to try and make them into robots with pieces from that one anons robot angel prompt
>>
>>102979104
They could come back with a less compressed AE
>>
File: 1827486498.png (1.35 MB, 896x1152)
1.35 MB
1.35 MB PNG
>>
>>102980879
catbox?
>>
>>102980934
I pulled...
>>
remember when the "trick" to giving Flux soul was to set negative guidance to 10?
>>
>download illustrious xl
>download example image with comfyui metadata
>drag into comfyui so the settings are the same
>queue prompt
>receive garbage
what am I doing wrong here
>>
surprisingly cohesive
>>
>Under the ultraviolet pink and blue glow of neon lights, two Russian teenage model girls with Cybernetic enhancements, machine made joints, mechanical limbs and blood vessels connected to tubes, wires and cables attaching to neck, wires and cables on head, science fiction, white knee high boots, walking together in an alley. The camera captures their full body and beautiful faces.

Early in the gens it looked like it was just making normal girls until the last few steps
>>
File: wiggle.webm (268 KB, 856x480)
268 KB
268 KB WEBM
I really shouldn't have laughed at this so much.
>>
>>102980230
What made it generate a grid?
>>
>>102981190
post catbox of the image (and a link to where you found it) so anon can see whats going on
>>
>>102980176
Nice, more physics tests please
>>
>>102981334
It's just the first image in illustrious xl's example gallery, the pink and blue miku
>>
File: 1281960042.png (1.38 MB, 768x1344)
1.38 MB
1.38 MB PNG
>>
>>102981263
>>102981289
Great stuff man
>>
File: 648876954.png (1.34 MB, 768x1344)
1.34 MB
1.34 MB PNG
>>
>>102980879
im so lonely bros...
>>
the voldy rentry was deleted. this makes me sad
>>
File: 2024-10-26_00026_.png (1.66 MB, 720x1280)
1.66 MB
1.66 MB PNG
>>
File: 1295324572.png (1.27 MB, 768x1344)
1.27 MB
1.27 MB PNG
>>
>>102981361
looks like he used auto or a fork, you wont get the same gen if you use comfy (without tweaking, if you can get close) unfortunately
>with comfyui metadata
comfy CAN load auto gens as workflows but the two programs operate different enough that the output wont be 1:1 outright
>>
>>102981853
Oh I didn't realize comfy would do that. Thanks
>>
>>102981872
this https://github.com/BlenderNeko/ComfyUI_ADV_CLIP_emb will get you closer to replicating auto gens but IMO you'd be wasting your time. might as well simply load up auto if you want that exact image
>>
>Tried using flux
>Crashes the CMD right away with a python.exe report

Damn, guess it's not meant to be.
>>
File: 2024-10-26_00027_.png (1.66 MB, 720x1280)
1.66 MB
1.66 MB PNG
>>102981780
>>
Cozy saturday
>>
File: 3999314.png (1.41 MB, 768x1344)
1.41 MB
1.41 MB PNG
>>
>>102977861
this one's cute
>>
>>102982029
a year ago this would have been incredibly difficult to do... upside down without it being all fucked up and shit? forget it
>>
>>102977371
god i want this white meat
>>
File: 3338798709.png (886 KB, 641x1030)
886 KB
886 KB PNG
>>102982061
true, things have improved a lot
>>
Should I tell them?
>>
>>102982155
tell them what
>>
>>102981391
>Great stuff man
thank you anon

>>102982078
>things have improved a lot
Stable diffusion is barely 2 years old
2 years from now video evidence will no longer be admissible in court
>>
File: 4188120860.png (1.54 MB, 1536x640)
1.54 MB
1.54 MB PNG
>>
>>102982219
>half-decent splits
Nice. Flux? If so it's way better than any of my brief attempts
>>
>>
File: ComfyUI_04584_.png (1.77 MB, 1280x1024)
1.77 MB
1.77 MB PNG
>>
File: ComfyUI_04580_.png (1.86 MB, 1280x1024)
1.86 MB
1.86 MB PNG
>>
>>102982193
The thumbs are on the wrong side.
>>
>>102982219
This one put the thumbs in the right place. idk hwat kind of demon toes those are tho
>>
>>102981821
These hands are correct.

flipped hands:
>>102981380
>>102982029
>>
Can I run 2 parallel instances of comfy, each one dedicated to a gpu?
How do I do that?
>>
File: 3501768682.jpg (2.46 MB, 2048x2048)
2.46 MB
2.46 MB JPG
>>102982331
yeah it's flux
>>102982564
huh, true
>>
>>102981263
>>102981289
Very cool.
>>
>>102982626
nevermind I think it's this :
--cuda-device x --port xx
>>
File: sk11.png (1.4 MB, 1536x1224)
1.4 MB
1.4 MB PNG
>>
>>102982200
everything will be signed by the hardware
>>
>>102982735
very nice
>>
File: 468586766.png (1.53 MB, 768x1344)
1.53 MB
1.53 MB PNG
>>102982901
thanks
>>
File: ComfyUI_04596_.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
A flower for your thoughts
>>
>he pulled
>>
>>102982471
>>102982331
Could you make them with more natural lighting? Would be interesting to see how it handles metal to skin tones
>>
File: ComfyUI_04609_.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
>>
huh, the 8B Flux @ 768px can be full finetuned in kohya without the block swap
this is interesting, faster per step than SDXL too
>>
>>102983118
I couldnt help myself
>>
>>102983290
meant for >>102983000
>>
File: ComfyUI_04618_.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
>>
>gen a couple hundred images
>realize im using the wrong vae
>>
File: ComfyUI_04624_.png (1.64 MB, 1024x1024)
1.64 MB
1.64 MB PNG
>>
File: ComfyUI_04625_.png (1.61 MB, 1024x1024)
1.61 MB
1.61 MB PNG
oh
>>
File: ComfyUI_04628_.png (1.67 MB, 1024x1024)
1.67 MB
1.67 MB PNG
"Any sufficiently advanced technology is indistinguishable from magic” - Arthur C . Clarke
>>
fp8 mochi 100 steps
https://files.catbox.moe/mfhvnr.mp4
>>
File: ComfyUI_245405_.png (1.47 MB, 1024x1600)
1.47 MB
1.47 MB PNG
Mochi support:
https://github.com/comfyanonymous/ComfyUI/commit/5cbb01bc
>>
>>102983118
>Could you make them with more natural lighting? Would be interesting to see how it handles metal to skin tones
Looks like the existence of robot body parts implies neon lights because I didn't prompt for it but it's still there
>Two Russian teenage model girls with Cybernetic enhancements, machine made joints, mechanical limbs and blood vessels connected to tubes, wires and cables attaching to neck, wires and cables on head, science fiction, white knee high boots, walking together in an alley. The camera captures their full body and beautiful faces.
I'll try a natural sunlight prompt to maybe force the lighting to be natural
>>
>>102983934
do quantized weights exist?
>>
>>102983859
>EmptyMochiLatentVideo node for the latent.
Does this mean vid2vid or img2vid?
>>
>>102984062
https://huggingface.co/Kijai/Mochi_preview_comfy/tree/main
(yes)
>>
File: ComfyUI_245422_.png (1.43 MB, 1024x1600)
1.43 MB
1.43 MB PNG
>>102984092
Not yet because they have not released the encoder part of the VAE yet.
>>
>>102984062
>do quantized weights exist?
Yeah fp8 and GGUF and even a 4bit
But that's for the 480p model. I'm using the websites model which is presumed to be the unreleased 720p model
>>
File: ComfyUI_Flux_14744.jpg (160 KB, 1024x1024)
160 KB
160 KB JPG
>>
>>102984137
Wouldn't that just be in the github repo somewhere of the model they have released?
I'm confused by my ignorance.
>>
File: ComfyUI_245425_.png (1.32 MB, 1024x1600)
1.32 MB
1.32 MB PNG
>>102984191
They released the decoder weights and code but nothing for the encoder. VAEs can both encode to latent space and decode from latent space, they are missing the encoder part so anything that requires the VAE Encode node will not work.
>>
>>102984229
ok, i get it, thanks.
>>
>>102984148
Looks pretty cool
>>
File: ComfyUI_Flux_14805.jpg (300 KB, 704x1472)
300 KB
300 KB JPG
new pixelwave flux checkpoint is out btw

https://civitai.com/models/141592/pixelwave?modelVersionId=992642
>>
>>102984418
What's the purpose?
>>
>>102984527
seems to be his first flux finetune thats not simply merging loras into a checkpoint

>I fine tuned version 03 from base FLUX.1-dev for over 5 weeks on my 4090. It is able to do different art styles, photography, and anime.
>>
>>102984541
But can't Flux already do those?
>>
>bright sunlight cyberpunk
kinda cursed desu
>>
File: ComfyUI_245549_.png (1.47 MB, 1024x1600)
1.47 MB
1.47 MB PNG
BTW mochi can do nsfw decently well.
>>
>>102984672
A big poo 4 u
>>
File: file.png (3.12 MB, 2291x1616)
3.12 MB
3.12 MB PNG
didn't know you can apply LUTs in comfy
>>
>>
>>102984672
sorry to ask, but how is this different from Kijai wrapper? better speeds?
>>
File: ComfyUI_245623_.png (1.52 MB, 1024x1600)
1.52 MB
1.52 MB PNG
>>102984734
Proper integration so you can use the regular sampler nodes, etc... with it.
>>
>>102984418
Booba status? Buttchin status?
>>
>>102984781
check the gens on civitai
>>
>>102984560
>>102984541
Genuinely asking, I never did the whole sd thing, I only recently got a good gpu (6950 xt, slow but 16gb vram).
>>
File: ComfyUI_Flux_14828.jpg (288 KB, 704x1472)
288 KB
288 KB JPG
>>
>>102984840
Can you do a dragon?
>>
>>102984776
ok, thank you sexy beast
>>
File: 2024-10-26_00034_.png (1.78 MB, 1280x720)
1.78 MB
1.78 MB PNG
>>
File: ComfyUI_Flux_14842.jpg (245 KB, 704x1472)
245 KB
245 KB JPG
>>102984938
>>
File: ComfyUI_Flux_14844.jpg (246 KB, 704x1472)
246 KB
246 KB JPG
>>102984938
i think i'll have to add some fur
>>
>>102985019
Neat!
>>
File: 00213-2125084715.jpg (1.06 MB, 1280x1720)
1.06 MB
1.06 MB JPG
>>
File: ComfyUI_Flux_14846.jpg (252 KB, 704x1472)
252 KB
252 KB JPG
>>102984938
>>
>>102984776
Drop a workflow dawg, I'm a retard
>>
>>102984952
oh damn
>>
File: ComfyUI_Flux_14864.jpg (185 KB, 704x1472)
185 KB
185 KB JPG
>>
>>102985186
>>
>>102984418
Was it trained with actual artist names though?
>>
>>102985420
wait, this is actually neat.

I thought it just made monstrous softcore.
>>
>>102985420
>>102985475
Any idea if my 6950xt can run it? previous gen AMD gpu, I got it because it's the best gaming value, if you don't use rt. It runs Flux dev about as fast as quants, though it can't fully load (I have 64gb of great system ram tho).
>>
>>102985468
if its anything like the previous versions it was trained on a ton on synth slop
>>
>>102984418
>Your actions, big or small, can create ripples of positivity. Let's make the world a better place, one kind gesture at a time. Thank you for your support! I hope you have a wonderful day!
Haha okay
>>
>>102985420
Hangs on vae decoding for me.
Mochi wrapper works with vae tiling
>>
File: ComfyUI_245665_.png (1.37 MB, 1024x1600)
1.37 MB
1.37 MB PNG
>>102985502
My ADA 6000 is struggling so if it even works on AMD it's going to be extremely slow.

>>102985591
yeah that needs to be optimized.
>>
>>102985617
Cute focks girl gens.
>>
File: ComfyUI_01630_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>
>>102984672
Care to post an example?
>>
File: 1625439970454.webm (2.25 MB, 540x360)
2.25 MB
2.25 MB WEBM
does anyone here know how to do this?
>>
>>102985617
>ADA 6000
cool
>>
>>102985762
>dog looks like a deepdream dog
>>
File: 02246.jpg (2.8 MB, 1792x2304)
2.8 MB
2.8 MB JPG
>>
So when is Illustrious XL 0.2 coming out?
>>
File: 1704162165965983.png (994 KB, 1024x1024)
994 KB
994 KB PNG
>>
>>102986034
It's already here and they call it "NoobAI-XL"
>>
>>102979688
Nice, catbox?
>>
File: 02248.jpg (2.8 MB, 1792x2304)
2.8 MB
2.8 MB JPG
>>
>>102985762
inpainting, batch size > 1
>>
waiting room
>>
we're almost at the point where we need advances in compute (H100s at home) more than we need better models. almost.
>>
>>102985762
That's playground AI canvass I think.
Pretty sure comfyUI does it better
>>
File: NearTheGatesOfHeaven.png (434 KB, 896x1152)
434 KB
434 KB PNG
>>
File: rWGiEF9 (1).png (165 KB, 500x472)
165 KB
165 KB PNG
>Decide to have another go at SD after a long hiatus
>Whip out ye olde a111
>Would rather die than use comfy
>But flux doesn't support a111
>Nothing does
Fuck me, man. Does comfyUI have some kinda un-Appleslop button that can just make it look normal?
>>
>>102986733
A111 is all but dead use ForgeUI, it's a fork of A1111 and supports Flux
>>
>>102986767
Is there a type of Flux model I should be aware of if all models don't support Forge?
>>
>>102986787
I'm not really sure what you're asking but Forge should support all Flux models.
>>
>>102986855
Was asking if some models only work in Comfy. These files are pretty big. Thanks.
>>
>would rather die than use comfy
ok but why? It takes like an hour to learn
>>
>>102986910
Couldn't find a good tutorial for graph/nodes/flowchart.
>>
>>102986910
NTA but inpainting and adetailer
>>
>>102986957
Inpainting sucks on Comfy but it has an adetailer equivalent.
>>
>>102987003
it does but an a1111 type workflow makes alot more sense for refining gens than comfy
>>
>>102987003
adetailer is way superior than the detailer from impact pack, the detailer node forces the user to use its own ksampler which sucks ass and its outdated, specially if you are using a samplercustom node or workflow, you can't replicate the ksampler used in your workflow in the detailer node, I've seen some open issues about it in its github page but the dev just ignores it and just adds an "improvement" tag and doesn't give a shit, its like the dev of impact pack forces the user to use its own shit instead of native comfyui stuff that is already done.

adetailer just seamlessly add itself with every setting or workflow that you may have in webui/forge
>>
okay maybe i try again to get forge working
>>
>>102985750
https://files.catbox.moe/uu6ypi.webm

The weird colours are because the tiled vae sucks right now.
>>
File: stack.jpg (2.99 MB, 2304x4992)
2.99 MB
2.99 MB JPG
>>
Reminder than forge has some obfuscated code that sends your prompts to a chinese server: https://github.com/lllyasviel/stable-diffusion-webui-forge/pull/2151
>>
>inpainting
Fair enough. It just kind of sucks that a1111 and comfy handle prompts differently so even with the same settings and prompt and seed you get two different outputs

>>102987086
those tits are being subject to rollercoaster tycoon tier G-forces ouch
>>
>>102987109
Based. How can I set that up in comfy? You got a workflow?
>>
>>102983853
Man, even at 100 steps it's shimmering so much, kind of sad.

>>102986653
I'd be ok with the compute of a 4090 if it has the vram of an h100 lol.

>>102987086
>The weird colours are because the tiled vae sucks right now.
All my gens look like bad VHS because of that, it's annoying.
>>
>retards be like "nooo SD3.5 bad, only Flux is good, forever"

meanwhile 3.5 can straight up generate close-to-perfect lesbian porn that actually looks properly photographic out of the box, with only very minor anatomical issues:
https://files.catbox.moe/emmep6.jpg
https://files.catbox.moe/hx2z6l.jpg

Prompt was:
"A high-resolution professional photograph of two incredibly attractive young women kneeling on a bed and facing each other. Both women are completely nude, and they are kissing passionately. The woman on the left is Caucasian with blonde hair, while the woman on the right is African-American with brown hair. The lighting of the photograph is soft and appears designed to highlight the women's anatomy, suggesting the image is intended for use on an adult website."
>>
>>102987212
so did they stop caring about muh safety or are they incompetent
>no metadata
coward
>>
>>102987227
>so did they stop caring about muh safety or are they incompetent
They just censored less nudes vs whatever flux used.
Both probably cleaned most nsfw/porn from their dataset.
They still used a bad vlm so a lot of bad captioning led to most of anything artist, celeb or porn/pose related being completely unknown concepts for it.
>>
>>102987212
looking forward to sd3.5 medium
>>
>>102987212
>lesbian porn
>two nude women kissing
>>
>>102987212
Simply don't care until it can do higher resolutions than 1MP
>>
>>102987254
you can upscale with a tiled vae until someone figures out a way to fix it
>>
>>102987250
i mean do you define it as like, "only ultra-closeup shots of cunnilingus" or something lol
>>102987254
SD 3.5 Medium specifically supports 0.25 to 2MP according to them, I really think it's going to turn out to have expectedly worse complex prompt adherence but subjectively better "image quality" in the eyes of Average Joe Diffusionman
>>102987227
I genned it while playing around with CivitAI's new onsite support for SD 3.5 actually:
https://civitai.com/images/36765083
>>
>24hr thread
why did everyone leave?
>>
>>102987350
tech happenings make anon post
no tech happenings make anon lurk
>>
>>102984781
undefeated
>>
>>102987350
Mochi was a letdown
SD 3.5 was a letdown
We were so back for a little while but now it's back to being so over
>>
Sana2 soon
>>
wake me up when Sana3
>>
>why did everyone leave
Hardware requirements and gen times for SOTA models went up, and SOTA models are all censored right now too
People have also just gotten bored of GenAI over time unless they're working towards something larger or have an interest in a niche that can only be created with AI
>>
>>102987357
SD 3.5 is only a letdown if you've come to expect base models to behave exactly the same as [Overfit Finetune Of Your Choice]. Not being distilled across the board even is a major plus, like gen ANY woman with SD3.5 Large and then do the same with SD 3.5 Large Turbo on the same seed, you'll immediately see how what we've come to call "Flux Girl" is actually just "Distillation Girl".

100% of the things people don't like about how Flux looks are directly and specifically related to both Dev and Schnell distilled, anyone claiming otherwise is dumb
>>
>>102987385
but flux dedistill still makes buttchins anon
there is no escape from the buttchin
>KG0YJ0
>>
*being distilled, that is
>>
File: 02278.jpg (2.8 MB, 1152x3456)
2.8 MB
2.8 MB JPG
>>
>>102987393
yeah cause it's not a proper de-distill, which is essentially impossible. De-destills are *doable* in the same way upscaling an image with ESRGAN Model XYZ is, you might get pretty good results in some cases but it will always be quite lossy no matter what you do.
>>
File deleted.
I’m a 1girl post respecter btw.
>>
>>102987393
doubt all the "dedistills" are binary distilled/dedistilled poof switches
bfl probably trained on a larger dataset and its not like they are gonna recover the original model weights 100% from the distillation from training on a much smaller dataset and epochs/steps
>>
>>102987435
the 1girl is illusory
>>
>>102985617
what model are you using for those gens?
>trys to post
>forgets about timer
>timer resets
>repeat
damn
>>
I just want to know how did buttchins become so ubiquitous with ai women? A cleft chin is a pretty rare trait on a woman, where did they find all these pictures of Popeye lookin gals to train on?
>>
>>102987393
>but flux dedistill still makes buttchins anon
nta but I find that putting "chin" in the negative typically fixes the chins.
>>
>>102987357
>SD 3.5 was a letdown
Nope, people are waiting for the smaller SD3.5M version before choosing what version to focus on.
But I still think that the best architecture by far is OmniGen. I wouldn't be surprised if all SAAS become like OmniGen in a few months(unless they cant filter the input/output).
>>
>>KG0YJ0
>>
>>102987514
>smaller SD3.5M
Is it because it'll be smaller so it would run on more hardware, or just because it will have something special to it?
>>
>>102987512
it's only an issue with flux, and no idea why
>>
>>102987350
>why did everyone leave?
im glad youre here, anon
>>
>>102987560
You see it in a lot of SDXL fine tunes like Dreamshaper aswell
>>
>>102987512
Flux didn't release the base/foundation/pretrained model but only the post-trained model. And they happened to post-train it to give a certain "style" out of the box. That's the issue.

The image model field really needs to label their models better like they do in the LLM field.
>>
>>102987560
it's because all open-weight versions of Flux are distilled, like I said, literally the SD 3.5 Large Turbo default lady looks VERY much like Flux Woman, whereas this is not the case for 3.5 Large regular version.
>>
>>102987553
The architecture is different:
https://medium.com/diffusion-images/stable-diffusion-3-5-debuts-in-3-variants-large-turbo-and-medium-run-them-in-comfyui-ce760d7fab74
>>
>>102987600
So the distilled version sets a "style in stone", or makes the model be more rigid/autistic about what a typical concept like "woman" would look like?
Is this why it's nowhere near as varied as what I could get with the same prompts on DALLE?
>>
>>102987612
The interesting part seems to be behind a subscription wall... anyway, I'm hoping the model being 2.5B will not be too disappointing.
>>
>>102987615
>>102987600
I don't think that's how distilling has to work? The non-distilled, full Flux Pro model should be giving a default style as well. That's ideally HOW a final production model for casual users should work, since you want it to give aesthetic results without complex prompting or settings, just like how for LLMs, you want to expose the Instruct models to users, not the base model which gives more schizo, random outputs to casual unskilled user inputs.
>>
>>102987615
Distillation basically dumbs the model down to a "reasonable default" while drastically reducing output variety yeah, to allow for decent-enough outputs in a low number of steps. Presumably the overall dataset used by SAI and BFL had enough in common that this meant distillation lead to a very similar looking "default lady" in the distilled versions of both their models.
>>
>>102987560
>>
the example resolution in the comfy workflow for moch is 848x480, is the recommended resolution? or can I go 4/3 or 16/9 instead or phone style?
>>
>>102987643
I see, thanks for the explanation.

>>102987651
I wonder how the hell dalle did it then (outside of not filtering porn from the dataset).
>>
>>102987643
Flux Pro DOES generate "normal" looking images that have no plasticity or bumchin inherent, very similar to SD 3.5 Large non-Turbo outputs. Attached is a Pro 1.1 output from the following prompt:

"a photograph featuring a young woman seated outdoors at a dining table. She has long, wavy blonde hair cascading over her shoulders and is smiling warmly at the camera. Her skin is fair, and she has blue eyes. She is wearing a blue, sleeveless dress adorned with small white floral patterns and ruffled shoulder straps. Around her neck, she wears a delicate necklace with a small pendant. The background consists of a plain, light gray concrete wall with a green, neatly trimmed hedge running horizontally just below the wall, providing a natural contrast to the urban setting. The dining table is set with several clear wine glasses, some filled with white wine, and a few empty glasses in front of her. The table itself is dark-colored, possibly a dark blue or black fabric. The overall atmosphere suggests a casual, yet elegant dining experience, possibly in a trendy urban restaurant or café. The lighting is natural, indicating that the photo was taken during the day. The composition of the photograph centers the woman, making her the focal point, while the background elements provide context without overwhelming the image."
>>
>>102986056
It's mostly the Monet lora. You have to put this in front: a painting by Claude Monet depicts
>>
>>102987668
We barely know what the actual model they're running under everything really looks like, as far as Dalle goes. It's pretty overtly run through a heavy post-process pipeline to add that distinct cartoonish look that always tends to remind me of the over-the-top way ambient occlusion was implemented in Far Cry 3.
>>
>>102987651
Is that the standard method in the image gen world? In the text gen world, usually what people mean by distillation is still essentially a pretraining run on the full dataset but with less epochs basically since the distillation method (using the logit distribution of the parent model) allows it to learn faster.
>>
>>102987698
my explanation was a big oversimplification but it's an accurate summary of the approach and practical result i'd say, yeah
>>
New

>>102987712
>>102987712
>>102987712
>>
>>102987707
>>102987680
If that's the case, I wonder if Flux Pro can actually do other styles like Picasso significantly better. Logically I feel like they'd still want it to have a default style though for the reasons I described (reducing schizo outputs), but perhaps it does, just not to the degree of bias in Flux dev. So then maybe it should be called "partial" distillation. Things are quite different in LLM land.
>>
File: 02286.jpg (2.82 MB, 2048x2048)
2.82 MB
2.82 MB JPG
>>
File: ComfyUI_245710_.png (1.39 MB, 1024x1600)
1.39 MB
1.39 MB PNG
>>102987472
These fennecs are made with that noob vpred model.
>>
>>102987785
How hard is it to prompt noob?
>>
>>102987810
it's just danbooru tags + some e621, zero natural language though so you can only specify what's in the image, not where things are
>>
>>102987836
Is there a danbooru taxonomy? like a browsable tree?
>>
File: 02272.jpg (2.68 MB, 2304x1664)
2.68 MB
2.68 MB JPG
>>
>>102987880
there's the danbooru tag wiki and this https://danbooru.donmai.us/related_tag
>>
>>102987992
>https://danbooru.donmai.us/related_tag
NTA pretty useful link thank you



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.