[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>108310881

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
terrible collage, you cunt
>>
File: 882246376784778.png (1.99 MB, 1248x832)
1.99 MB
1.99 MB PNG
>>
Blessed thread of frenship
>>
File: 00048-580643718.jpg (292 KB, 1344x1728)
292 KB
292 KB JPG
>>
File: uh.png (68 KB, 321x159)
68 KB
68 KB PNG
>>108313992
>>
Bored
>>
>>108314008
Loras still don't work properly with zit
>>
okay updating cl.exe to a newer version helped with some of the errors but I still can't get triton to compile anything
>>
>mfw Resource news

03/06/2026

>Modular Diffusers - Composable Building Blocks for Diffusion Pipelines
https://huggingface.co/blog/modular-diffusers

>LTX-2.3-GGUF Using Unsloth Dynamic 2.0
https://huggingface.co/unsloth/LTX-2.3-GGUF

>RealWonder: Real-Time Physical Action-Conditioned Video Generation
https://liuwei283.github.io/RealWonder

>FaceCam: Portrait Video Camera Control via Scale-Aware Conditioning
https://weijielyu.github.io/FaceCam

>RelaxFlow: Text-Driven Amodal 3D Generation
https://github.com/viridityzhu/RelaxFlow

>Guiding Diffusion-based Reconstruction with Contrastive Signals for Balanced Visual Representation
https://github.com/boyuh/DCR

>MASQuant: Modality-Aware Smoothing Quantization for Multimodal Large Language Models
https://github.com/alibaba/EfficientAI

>VisionPangu: A Compact and Fine-Grained Multimodal Assistant with 1.7B Parameters
https://www.modelscope.cn/models/asdfgh007/visionpangu

>Rolling Sink: Bridging Limited-Horizon Training and Open-Ended Testing in Autoregressive Video Diffusion
https://rolling-sink.github.io

>MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphing
https://xiaokunsun.github.io/MorphAny3D.github.io

>RLC Prompt Suite for ComfyUI: JSON-based prompt generation and seed management
https://github.com/efeerimoglu/ComfyUI-RLC-Prompt-Suite

>ComfyUI LoRA Optimizer
https://github.com/ethanfel/ComfyUI-LoRA-Optimizer

>ComfyUI Optical Realism Post-Processing Node
https://github.com/skatardude10/ComfyUI-Optical-Realism

03/05/2026

>LTX-2.3 Video Engine
https://ltx.io/model/ltx-2-3

>LTX Desktop: Fully local AI gen space with integrated video editor
https://ltx.io/ltx-desktop

>Z-Image Power Nodes
https://github.com/martin-rizzo/ComfyUI-ZImagePowerNodes

>Error as Signal: Stiffness-Aware Diffusion Sampling via Embedded Runge-Kutta Guidance
https://github.com/mlvlab/ERK-Guid

>Gaussian Wardrobe: Compositional 3D Gaussian Avatars for Free-Form Virtual Try-On
https://ait.ethz.ch/gaussianwardrobe
>>
File: 144021335527817.png (1.82 MB, 832x1248)
1.82 MB
1.82 MB PNG
>>
>mfw Research news

03/06/2026

>Accelerating Text-to-Video Generation with Calibrated Sparse Attention
https://arxiv.org/abs/2603.05503

>FC-VFI: Faithful and Consistent Video Frame Interpolation for High-FPS Slow Motion Video Generation
https://arxiv.org/abs/2603.04899

>Diffusion-Based sRGB Real Noise Generation via Prompt-Driven Noise Representation Learning
https://arxiv.org/abs/2603.04870

>Diff-ES: Stage-wise Structural Diffusion Pruning via Evolutionary Search
https://arxiv.org/abs/2603.05105

>Transformer-Based Inpainting for Real-Time 3D Streaming in Sparse Multi-Camera Setups
https://arxiv.org/abs/2603.05507

>Frequency-Aware Error-Bounded Caching for Accelerating Diffusion Transformers
https://arxiv.org/abs/2603.05315

>Axiomatic On-Manifold Shapley via Optimal Generative Flows
https://arxiv.org/abs/2603.05093

>How far have we gone in Generative Image Restoration? A study on its capability, limitations and evaluation practices
https://arxiv.org/abs/2603.05010

>Locality-Attending Vision Transformer
https://arxiv.org/abs/2603.04892

>FOZO: Forward-Only Zeroth-Order Prompt Optimization for Test-Time Adaptation
https://arxiv.org/abs/2603.04733

>HALP: Detecting Hallucinations in Vision-Language Models without Generating a Single Token
https://arxiv.org/abs/2603.05465

>Decoding the Pulse of Reasoning VLMs in Multi-Image Understanding Tasks
https://arxiv.org/abs/2603.04676

>A Simple Baseline for Unifying Understanding, Generation, and Editing via Vanilla Next-token Prediction
https://arxiv.org/abs/2603.04980

>CLIP-driven Zero-shot Learning with Ambiguous Labels
https://arxiv.org/abs/2603.05053

>Revisiting Shape from Polarization in the Era of Vision Foundation Models
https://arxiv.org/abs/2603.04817

>AI+HW 2035: Shaping the Next Decade
https://arxiv.org/abs/2603.05225
>>
>>108314042
>>108314048
filtered
>>
File: uni.jpg (3 MB, 2784x3013)
3 MB
3 MB JPG
>imagine what this empty room will look like, if if belongs to a very messy, anime-obsessed teenage gamer girl with a lot of posters and toys
neat
>>
>>108314061
which level of the backrooms is this
>>
>>108314061
what model?
>>
Can i replace porn actresses in my favorite scenes with abby shaprio yet or is that still a pipedream. Thank you
>>
>>108314066
nano banana
>>
>>108314066
Uni-1 https://lumalabs.ai/uni-1
>>
>>108314069
api cucks have their own dedicated thread please take your slop there, thank you
>>
File: 583092765358483.png (2.89 MB, 1968x1056)
2.89 MB
2.89 MB PNG
>>108314061
>>
File: deJS_zi_00044_.png (2.52 MB, 1832x1000)
2.52 MB
2.52 MB PNG
>>108314061
if you didn't ask it to imagine, what would have happened?
>>
>>108314082
literally my room, even the double computer desks
>>
>>108314102
does your room also go on forever?
>>
File: 308530789691336.png (3.82 MB, 1968x1056)
3.82 MB
3.82 MB PNG
>>108314102
>>108314110
That's a lot of real estate.
>>
>>108313912
The free tier in whatever copilot clone you use has no problems helping you compile, retard.
>>
>>108314121
creepy
>>
File: 46945360519559.jpg (597 KB, 1968x1056)
597 KB
597 KB JPG
>>108314127
yeah
>>
>>108314110
>>108314121
it's just a mirror bro, makes the room look bigger
>>
Any idea why certain noise schedules are just broken with Anima?

I tried using DPM++ 3M SDE with exponential like you're supposed to, it doesn't work. Beta works great, might be my favorite combination so far, but what the fuck?
>>
>>108314318
I've been using that flux2 scheduler some anon recommended soon after anima came out
>>
>>108314318
https://huggingface.co/circlestone-labs/Anima/blob/main/README.md
>>
alright nuked all my previous rocm shit, cleaned up my system environment variables, and reinstalled the latest hip sdk. triton finally compiles.
>>
>>108314423
Congrats. I'm on AMD too, but haven't gone to the trouble of attempting Triton so far.
>>
>>108314318
I think it's the SDE samplers, officially SDE does not work with rectified flow models.

exponential scheduler is extremely aggressive so that probably exacerbates whatever problems there are.
>>
File: kling motion comfy.png (1.1 MB, 805x1932)
1.1 MB
1.1 MB PNG
Insane new stuff coming to ComfyCloud recently
>>
who asked?
>>
>>108314457
nobody
I just blurt out answers to questions I think someone should ask
>>
jacked off 3 times today
>>
that's off-topic
>>
Is it just me or does the recent dynamic memory thingy comfy added in recent updates NOT work with Comfyui-GGUF when loading Wan 2.2?
It works with load diffusion model node (bf16, different model), it works with Int8 stuff (again Wan 2.2). It seems to work with Klein 9B gguf. It's still slow as shit and consumes shit ton of memory with GGUF Wan 2.2 models (q8, if that matters).
As a side node it feels like magic. My 32gb RAMlet setup finally feels usable for Wan 2.2 now. (As in usable without my system getting raped with excessive zram/swap use). At the risk of pissing off resident thread schizo, thank you Comfy.
>>108314318
I am using er_sde + simple/ddim uniform, ddim + ddim unifrom, occasionally dpmpp_2m + beta, and when I need to crank cfg euler ancestral + simple/ddim uniform.
>>
Are there any loras that let Flux2 Klein 4b edit and recognize NSFW and furry?
>>
>>108314430
>>108314423
NOPE
you need to set TORCHINDUCTOR_CACHE_DIR too else it will just delete its output before it can use it
>>
>>108314486
I seem to recall it not having GGUF support, at least back when it was first merged. Don't know if that's changed since then.

https://github.com/Comfy-Org/ComfyUI/pull/11845
>NOTE: This work does not have any GGUF integration and GGUF will not see any benefits yet.
>>
>>108314509
Strange that it seems to work with Klein, but that tracks for Wan. Thanks.
>>
I'm reading the 1girl rentry guide from the OP and following along. How important is it to run comfyui custom nodes in a container?

I just want to try out the impact pack since the guide goes into that for better hands. From what I seen the impact pack had a security issue back in 2024 but I haven't seen much since then.
>>
>>108314589
First of all, be aware that most rentries in the OP are varying degrees of outdated and unmaintained. That one should still be an ok intro though, I think (dunno been ages since I've read it.).
As for your question it is important. I am personally running Comfy under a docker container. The reason is that most custom nodes are essentially unreviewed code running on your computer. Both the possibility of intentional malware (happened once in the past, got caught quickly another time) and glaring security risks from "vibe coded" development (I think there is also at least one case of a security flaws from a node being exploited) are dangerous with them. That specific node might be ok but keep doing this for many nodes and that's asking for trouble down the line.
>>
>>108314589
my pc is the container. cyber security doesnt matter if ther is nothing of value on your pc
>>
>>108314494
a lora will not cover that sort of thing to a satisfactory degree. lodestone is training his own kedit finetune however
>>
>>108314673
Just found it. Chroma2-Kaleidoscope. I wonder how long until it's done. I wanna edit Milena Velba into furry images :|
>>
>>108314673
aaaand editing features are likely going to be removed or added later. RIP my dreams.
https://www.reddit.com/r/StableDiffusion/comments/1rlqmy7/will_chroma2_kaleidoscope_have_editing_features/
>>
File: 1755845481994.png (2.03 MB, 832x1152)
2.03 MB
2.03 MB PNG
What joycaption prompt presets (or simlar) are you using and how long captions are overkill? My current caption are currently around 120-150 tokens.
>>
>>108314673
From what I have seen it has already turned into a train wreck and collapsed quicker than original Chroma did.
>>108314685
I wonder if there would be an effect where it is easier to make the model remember how to do edits after major finetuning if the base model has edit capacity? Similar to how rebuilding lost muscles is easier than gaining them first time?
Though this doesn't matter in practice here since kekstone will just shit out another worthless schizobake.
Bless anima, bless tdrussell. We are still fucked for NSFW realism but at least we are no longer dependent on this clown for booru NSFW.
>>
>>108314486
Not using GGUF but i'm always having issues with memory desu, I have to constantly restart if not my pc slowly comes to a halt, minmizing the browser will take a whole minute, file explorer when trying to upload a file here for example lags a lot, etc... it's largely fixed by using --disable-pinned-memory ime
>>
>>108314704
Just use Gemini Flash 3. It's cheap. It's very noticeably better than Joy Caption for anything SFW, and even many NSFW images. You can caption 200 images or so for less than a half buck. (Been a while since I checked pricing but I doubt it changed much.)
>inb4 le SAAS shill
>>
>>108314765
>We are still fucked for NSFW realism
desu as much as chroma sucks isnt it pretty much the meta for realistic hardcore NSFW?
>>
>>108314793
I'm not sending my fetish datasets to google.
>>
>>108314802
i use sdxl for realistic shemales
>>
>>108314634
>be aware that most rentries in the OP are varying degrees of outdated and unmaintained.
Most of the fundamentals haven't really changed.
>>
>>108314783
I am having a laggy computer nowadays too but this has been going on well before I pulled the memory update. So I am probably still SOL in terms of finding root cause.
Sorry for your issues. I haven't noticed any change in lagginess since pulling dynamic memory. If anything it decreased because of far less swap use.
>>108314802
There are no good meta NSFW realism models. Anyone who says otherwise is coping imo.
>SDXL
Old, outdated, low quality. Low realism due to said low quality.
>Chroma
Vast knowledge of NSFW concepts and potential to make very realistic images but schizo anatomy and slow as shit
>Flux
Shitmixing bunch of shitty loras together sucks. Plus the plastic.
>>108314804
Well if so I would recommend using a concise instruction prompt. The longer and more conditional the prompt, the higher odds of hallucination in the caption. At least that's my experience in general.
Caption length depends on the model. But you want a moderately long paragraph typically. (<7-8 sentences?)
>>
what should i prompt
>>
>>108314952
asuka
>>
>>108314802
IDK why anon is seething so hard at chroma. It's the NSFW equivalent of flux.
>>
File: ComfyUI_14393.jpg (3.4 MB, 1440x2160)
3.4 MB
3.4 MB JPG
>>108313968
All images in existence have noise in them. That's how film, compression and dithering work.

>>108314704
I went two-step with Gemma 3; first I had it describe everything in the picture and then in a second pass on the output text using a different system prompt, refined that information into a 192 word/token caption. I had to use two discrete steps because LLMs are absolutely retarded and you have to keep them very focused to get them to function exactly how you want. 192 is pretty overkill for mostly simple images, but I made sure they were all completely NL with no comma-separated "lists" in them (modern models don't really do that anymore).

Fairly time consuming, but my LoRA is very flexible because of it, which is always my goal.

>>108314952
Tiny faces.
>>
>>108314972
Imagine the sexo with this particular jebby ommmggffff
>>
>>108314952
1girl, standing, large breasts, looking at viewer
>>
>>108314952
a wine glass filled to the brim with foreskins
>>
>>108314809
proof?
>>
File: 1742799889599529.png (2.94 MB, 1216x1824)
2.94 MB
2.94 MB PNG
>>
File: ComfyUI_00018_.png (3.03 MB, 1224x1824)
3.03 MB
3.03 MB PNG
>>108315062
quality post, thank you
>>
>>108315055
i think catbx is down but i put an image here >>>/aco/9145043
>>
>>108315091
nice skates
>>
Hibernation mode
>>
File: 1766719270015068.png (55 KB, 1467x286)
55 KB
55 KB PNG
>>108314685
why are they booing him he's right!!
https://www.youtube.com/watch?v=75GaqVWqEXU
>>
>>108315019
mazeltov!
>>108315374
AI is progressing in specific ways. If Chroma devs can't make a NSFW editing model someone else will.
>>
>>108315409
>someone else will
LOL! all local finetuners have done for the past 3 years is throw boorus at it. the real reason chroma wont have any editing capabilities is because there are no pre-existing booru edit datasets for him to use. localkeks are the laziest subhumans when it comes to datasets, they got incredibly lucky that boorus exist because without them there wouldn't even be any nsfw at all besides shitty flux nude loras.
>>
>>108315418
This is some very strange schizo rage. Very Jewish.
>>
How is LTX-2.3? Worth the git pull risk?
>>
>>108315418
>how dare that one guy didn't manually caption millions of pictures by himself!
>>
>>108315678
doesn't he have a full discord full of potential helpers?
>>
File: 1746237597346232.png (123 KB, 1024x1024)
123 KB
123 KB PNG
>>
File: 1743831788713772.jpg (971 KB, 1312x1632)
971 KB
971 KB JPG
>>
File: 1763220971070894.png (3.32 MB, 2016x1120)
3.32 MB
3.32 MB PNG
>>
>>108315750
Nice
>>
File: 1770135366387297.jpg (663 KB, 2016x1120)
663 KB
663 KB JPG
>>
File: image_680525.png (1.09 MB, 720x1280)
1.09 MB
1.09 MB PNG
holy fuck. i really hope qwen image 2.0 goes open source. It's such an improvement from the previous qwen image and editing models and makes klein and z image look like an absolute joke. Very close to nano banana pro photorealism and full uncensored editing is possible. Even way better than seedream 5.0.
https://litter.catbox.moe/lzp9gw.png
https://litter.catbox.moe/gctcgp.png
https://litter.catbox.moe/9eu0pn.png
https://litter.catbox.moe/cme6m8.png
https://litter.catbox.moe/137z57.png
>>
>>108315409
>someone else will
that's cope. local scene is 6 feet below
>>
>>108315813
this is bigasp tier
>>
>>108315813
local???
>>
File: 1751059039594477.jpg (464 KB, 2016x1120)
464 KB
464 KB JPG
>>
>>108315819
>projects cope
shut up faggot
>>
>>108315813
Didn't really give it a good test of anatomy did ya
>>
File: 1748908293038851.jpg (496 KB, 1536x1536)
496 KB
496 KB JPG
>>
>>108315833
delusional
>>
>>108314972
based knight of jen
>>
>>108313977
What is everyone using for realistic NSFW images these days? I want maximum prompt flexibility, maximum realism is good too but probably could use img2img with more realistic model with low denoising. So what is best for both prompt adherence and pure realism? I've mostly been just using anime stuff but kinda getting bored of that
>>
File: 1771466486807618.png (3.28 MB, 1242x2208)
3.28 MB
3.28 MB PNG
>>
is it me or has the quality of loras for zit and klein on civitai taken a huge nosedive recently? Is all the training still focused on sdxl?
>>
>>108316294
SDXL won. nobody wants to use 5+ loras just to render a plastic-looking boob. whichever model has better finetunes wins the day
>>
>>108315823
are you talking about the nudity or the quality?
>>
>photorealism
I want fur friends not artificial roasties
>>
>>108316134

Still the SDXL BigLust models for now. Anyone who says chroma is smoking crack. It’s unfortunate just how stale realistic NSFW currently is across the board.
>>
>>108316373
quality, that means it's not good
>>108316391
it's just the local in general. nobody does anything, we're all waiting for handouts
>>
>>108316403
Anima was pretty good handout, maybe we'll get more anime models, but probably not NSFW realism. Too bad PR because like a gun retards can misuse it, I hope I'm wrong partially because I'm not said retards and I just gonna goon
>>
>>108316417
But RedCraft ZiB Distilled DPO Veris looks okay and seems to be getting a lot of downloads, might be flexibility I want and I can just feel it into SDXL slop toon or detailers. Be a lot more practical when I get a NVIDIA GPU instead of slow ass AMD one I got. I just want more generalizations, because sometimes there isn't a LoRA for my cursed ideas believe it or not
>>
>>108314453
>local
>Cloud
Cumfart retards are really this stupid.
>>
>>108313977
I just want something like whisk meets SDXL, local. Why is it so hard?
>>
>>108316434
>seems to be getting a lot of downloads
if it's civitai we're talking about number of downloads means nothing. civitjeets are a flock of flies summoned to shit piles.
>see wai nsfw illustrious
>>
>>108316436
comfyui runs locally, and comfycloud lets you deploy your local workflows
>>
>>108316601
Nobody cares about local comfy anymore. All money is in saas, Comfy knows it, that's why he left you faggots
>>
we back
>>
>>108316613
lmao bitter
>>
>>108316613
comfy really should not be considered local anymore. if he keeps focusing on saas shit we might gonna have to remove this crap from the op
>>
File: file.png (100 KB, 778x434)
100 KB
100 KB PNG
>>108316791
nah, ani loves comfy, i trust ani
>>
>>108316808
Not after Comfy betrayed him.
>>
>>108316818
wdym by betrayed exactly
that never happened
>>
>>108316613
>>108316791
You are the only thing in this thread (i refuse to consider living being, much less a person) that knows how cumfart's cock and balls taste like
>>
>>108316823
comfy betrayed ani by having other friends even though ani thought they were supposed to be exclusive. comfy is a piece of shit backstabber for this. ani gave him a very generous offer to start comfyorg together and drop yoland and instead comfy chose to start it with yoland. he is a backstabber for this and deserves to be removed from the op
>>
>>108316838
yes, we get it, Julien, you have no friends
>>
im still going to use comfy
>>
File: ComfyUI_00534.jpg (156 KB, 1024x1024)
156 KB
156 KB JPG
>>108313977
/ldg/ ANCHOR:
>/ldg/ pixiv:
BakerAnon:https://www.pixiv.net/en/users/110320313
Nekotwins:https://www.pixiv.net/en/users/93453238
RSetsuu:https://www.pixiv.net/en/users/91156181
tamzaiy:https://www.pixiv.net/en/users/38224130
匂い:https://www.pixiv.net/en/users/76374114
Otto:https://www.pixiv.net/en/users/106407853
owphoenix:https://www.pixiv.net/en/users/1661492
cornholioanon:https://www.pixiv.net/en/users/1035594
kunitsune: https://www.pixiv.net/en/users/1154481
Encolpe : https://www.pixiv.net/en/users/1528323
Ravaged Cherry: https://civitai.com/user/RavagedCherry
fire.inc: https://www.pixiv.net/en/users/94978225

*Anchor your Pixiv and CivitAI here to add it in the next thread.

>/ldg/ CivitAI
https://civitai.com/user/Nyan666
https://civitai.com/models/1922023/obmirsak
>>
>>108316939
>>
is it possible to train a realistic model to create a realistic version of something unrealistic? i tried making a lora using cartoon dataset but when i use it with my realistic checkpoint it doesnt merge the two concepts, it gens one or the other there is no middle ground
>>
File: 1746024830207118.png (97 KB, 1038x619)
97 KB
97 KB PNG
>>108316955
Really?
Stock clip art?
You're genuinely the most retarded, subhuman, worthless faggot that has ever entered these threads
No wonder you have no friends and your wrapper is a failure
>>
>>108316838
>and instead comfy chose to start it with yoland.
Thank god comfy is smart and didnt start shit with the pedophile
>>
File: ComfyUI_00001_.png (2.47 MB, 1152x1728)
2.47 MB
2.47 MB PNG
@ComfyAnonymous,
For inpainting, please add a BBox option to the Load Image node alongside Image and Mask. This would let users control the edit area and the surrounding context separately, instead of relying on sloppy default mask expansion.
I want to decide both what changes and what visual context is shown to the model.
>>
>>108316955
I'm fan of cornholioanon's work. Truly impressive.
>>
>>108317117
Thank you for contacting us.
I hope you are fine - we must decline your request at this time because we are concentrating on other aspects of the development.
Kind Regards,
ComfyUI Support Team
>>
File: ComfyUI_00004_.png (2.03 MB, 1152x1728)
2.03 MB
2.03 MB PNG
@ComfyAnonymous,
Also don't forget to add an "Open in Explorer Folder" option when right clicking in Assets.
>>
>>108317148
SEXO
>>
>>108317014
my goto method has been to do a first pass with an unrealistic model then refine with a realistic model
>>
Is he still seething because his crush is actually successful?
No free comfybucks for him for "making a better anima"?
>>
>>108316441
What does this even mean
>>
>>108317266
if i had to guess, he wants the visual fidelity of whisk with the porn capabilities of sdxl
>>
Me during the day >>108317117, me during the night >>108317148
t. anonnete
>>
>>108317300
>visual fidelity of whisk
oxymoron kek looks like every other slop cloud model
>>
cozy breas
>>
why so dead
>>
>>108317014
Use Klein Edit
>>
>>108313977
I'm about to read the documentation you guys have but quick question which I assume is a no. Is anything able to run on a very old laptop GPU? (660m) I have one collecting dust that I can use for a local model
>>
File: RAUUUUUUGH.jpg (61 KB, 349x326)
61 KB
61 KB JPG
>father asks if I can install ai stuff for him to fix old photos when I arrive few hours before celebrating family birtbday
>dead tired but agree
>spend almost 2 hours trying to figure out why it's slow as fuck
>ragequit momentarily while a new install runs
>unmount my usbstick I had the 60gb or so of models to transfer
>gives me error
>still raging and unplug it anyway
>the install error beeps
>realize i had been installing comfy on the usb stick through a fucking usb2.0 bandwidth
>>
>>108317733
SD 1.5 might be possible. CUDA stopped supporting Kepler around v10. You might run into a bit of dependency hell trying to set anything up on that old version.
Honestly SD 1.5 is completely outdated and not worth it anyway, so the answer is effectively no.
>>
>>108317765
>fix old photos
he just wants to give himself huge tits
>>
>>108317765
>revealing your true power level
NGMI
>>
>>108317711
hours long site maintenance prevented the usual trolling and all other poasting
>>
>>108317765
What's the poorfag solution now, Klein4B? At least that's what more or less works for me with 8GB. This shit's Comfy version OOMed sadly, seemed like a safer approach https://github.com/microsoft/Bringing-Old-Photos-Back-to-Life
Using Seedvr2 is sometimes useful too.
>>
File: 1742421359286071.jpg (659 KB, 1536x1536)
659 KB
659 KB JPG
my elf waifeu
>>
https://www.reddit.com/r/StableDiffusion/comments/1rn3fjv/comment/o96zamx/
>>
>>108317877
I tested your ZiB -> ZiT refiner. Any sampler recs besides the defaults? I do ZiB for 28 steps, then ZiT for 8 at 0.5–0.6 denoise to keep the composition. Do you upscale when switching models, like a Hires Fix pass?
>>
>>108317901
Can run on 16gbVRAM?
>>
>>108317913
yes but you will need 64GB+ RAM
>>
Can we ban anons with more of 32ram?
>>
>>108317939
You mean less than 64GB? Sure, I say ban all third worlders period
>>
>>108317948
you make me feel insecure about myself
>>
File: ComfyUI_temp_rnaqv_00004_.png (1.63 MB, 1040x1440)
1.63 MB
1.63 MB PNG
>>
>>108317907
yeah I upscale, I start at 0.9MP->1.6x pixel space upscale using upscale model->0.6 denoise.
I think last time I shared the settings but again:
1st pass: 15 steps res2s beta57
2nd pass: 5 steps euler normal 0.6 denoise
for the 2nd pass it's EXTREMELY important that you do the number of steps relative to your denoise, otherwise you'll deepfry the image.
>>
File: ComfyUI_00094_.png (1.28 MB, 1024x1216)
1.28 MB
1.28 MB PNG
>>108313977
>Year: 2020 + 6
> Civitai.com still can't read any metadata within ComfyUI gens

It's one of then if not THE most used diffusion UI yet they pretend only A1111 and its forks exist....why? Also I Don't know if its my own machine that's the issue but their website never works properly anymore but works "fine" on Mobile, safari, and other browsers. (I say "Fine" because its still slow for no good reason as always)
>>
>>108317962
>deepfry the image.
It happened, how do I calculate that? Is that the Advanced K Sampler setting start at step?
>>
>>108317979
you're starting with a fully denoised output for the 2nd step so no. I'm sure there's a smart way to automatically pick the denoise value at sigma #5 from a full 9 step processing for zit, but I didn't dig into it and just did some 'vibe' testing.
>>
>>108317973
>> Civitai.com still can't read any metadata within ComfyUI gens
It reads prompts just fine? Pretty sure its ready CFG and steps for me before as well.
>>
File: Sigma.png (59 KB, 969x646)
59 KB
59 KB PNG
>>108317995
Asked Opus. Is this the “vibe” calculation you used?
>>
>>108317773
I see. And I'm guessing the same goes for making it my local LLM, right?
Well thanks for the info.
>>
File: ComfyUI_temp_rnaqv_00012_.png (2.94 MB, 1248x1728)
2.94 MB
2.94 MB PNG
>>
>>108318003
I use the default save image node but it never reads any data whatsoever.

Workflow: https://files.catbox.moe/t62u0j.json

Version 1.39.19 on MacOS
>>
File: 1763011405685709.png (120 KB, 1782x616)
120 KB
120 KB PNG
>>108318044
lmao my dude, you might be a bit rarted, but you made me do this, so you can see the exact denoise at each step for zit at shift 6 for the normal scheduler
>>
File: ComfyUI_temp_rnaqv_00014_.png (2.8 MB, 1248x1728)
2.8 MB
2.8 MB PNG
>>
File: ComfyUI_temp_rnaqv_00015_.png (2.14 MB, 1248x1728)
2.14 MB
2.14 MB PNG
>>
>>108318094
get her titties out
>>
I was about to make a complaint about the lack of unique posing then I realized that real women also take incredibly boring photos of themselves.
>>
File: ComfyUI_temp_rnaqv_00016_.png (2.05 MB, 1248x1728)
2.05 MB
2.05 MB PNG
>>
File: 1645616889894.png (1.46 MB, 1121x1121)
1.46 MB
1.46 MB PNG
Error running sage attention: PassManager::run failed, using pytorch attention instead.
>>
File: ComfyUI_00001_.jpg (674 KB, 1088x1920)
674 KB
674 KB JPG
I have becometh an aiArtist.
>>
File: ComfyUI_temp_rnaqv_00020_.png (2.04 MB, 1248x1728)
2.04 MB
2.04 MB PNG
>>
File: ComfyUI_temp_rnaqv_00022_.png (2.06 MB, 1248x1728)
2.06 MB
2.06 MB PNG
>>
>>108315750
What model is this?
>>
>>108318260
flux dev
>>
>>108318066
there is a node for that but i dont remember the name
>>
File: ComfyUI_00040_.jpg (738 KB, 1088x1920)
738 KB
738 KB JPG
>>
>>108318057
Depends on how much ram it has maybe you can run some quant of a small MOE like gpt-oss 20B on the CPU. Don't expect chatgpt level performance but you might find a use for that.
>>
>>108318082
Learned something new, thanks.
>>
>>108318143
help?
>>
>>108318143
>>108318378
I had this exact problem but forgot what i did to fix it.
>>
>>108318378
No clue. I assume you have the sage attention package installed.
I can recommend uninstalling sage, and then try to git clone + build yourself. It doesn't take long to combine.
>>
>>108318143
Don't know your UI but unistall everyhting, including python and all dependencies and start over
>>
>>108318082
Following your vibe calculation, which I have not tested yet, there is a gap at 0.5, then at 0.3, 0.2, and 0.1. I suspect I need more steps to achieve those denoise levels.
If so, it seems misleading that KSampler lets you set denoise and steps independently when denoise is tied to step count.
>>
>>108318301
>>108318189
patreon?
>>
>>108318512
it's just sage attention 1.0.6, that shouldn't need compiling I don't think. I'll think about trying to get 2 to work once this runs.

It's probably a triton error since torchcompile also fails with the same error.
>>
https://www.youtube.com/watch?v=zuIepC06LUg
Oh no, fellow localsissies, they're making fun of us...
>>
>>108318559
Maybe k samplers needs to be updated, we where using it since SD
>>
>>108318597
LTX sound is legit unsettling. I think some videos made with itare cool but the sound is always awful.
>>
>>108318570
triton? Is this AMD gpu? Never used sage with triton, dunno much about particularities of how that works.
>that shouldn't need compiling
The fp16 version (sage 1) doesn't need to, but Patch Sage Attention KJ and similar nodes still require compiled version to run it. If you are using a node besides passing sage attention as a launch argument, that might be the issue.
>>
>>108318611
yes, AMD. triton-windows should have support, and I got the test-script from the github to run correctly but in comfy it still errors out.
>>
what is it about realism that makes it look real? some models look like actual photos, but others have this weird clay look.
>>
>no alien doing pushups benchmark
>>
>>108318639
how much porn it's been trained on vs how much flickr blown out HDR mixed with anime it's been trained on
>>
>>108318639
Millions of year of pattern recognition specialization in the human brain?
>>
>>108318639
Cheap mass produced synthetic data vs real data that takes time to gather and caption mostly
>>
>>108317877
when ever i try to make fantasy stuff i get cosplayers with cheap plastic, like in that image it made her ears plastic
>>
What's the verdict on LTX 2.3? How does it handle porn? Is it worth using?
>>
sup niggas
if floyd miku nigga still around? he was lowkey the champ of the threads when i was here
also klein flux is pretty good. massive upgrade from zit and it allows edit and t2i at the same time with much more variety.
>>
File: 1753137550647529.jpg (459 KB, 1250x1566)
459 KB
459 KB JPG
>>108318639
the response is as simple as it gets, you want realism? then train your model with only real photos
>>
>>108316134
Klein 9B for NSFW edits.
ZIT for general NSFW
or Chroma, then I2I in ZIT.
or ZIT, then Chroma to inpaint genitalia.
OR ZIT, then Klein9B to fix genitalia using loras.

You will have to experiment what works best for you.
>>
You're still not tired of the ZiT look? Base does not have that problem.
>>
>>108318779
Base is not for inference.
>>
File: 8571549699.png (410 KB, 640x484)
410 KB
410 KB PNG
>>108317792
>>
>>108318779
>Base does not have that problem.
base looks a bit more slopped and has terrible details, anatomy and architecture so...
>>
>>108318830
ZiT is inherently slopped via distillation. Every gen from it looks the same, it is rigid.
>>
>>108318851
>Every gen from it looks the same
there's some methods to mitigate that (like adding some conditioning noise) but yeah it's a pretty boring model, makes great images, is really consistent... too consistent... if the image you make is bad you know you're fucked, you can't go for other seeds and see if you can get more lucky
>>
>>108318726
>What's the verdict on LTX 2.3
Sound is way better by default, they actually gave a shit and made it better, which is nice.

>How does it handle porn
It doesn't until people create loras as always.

>Is it worth using?
Probably yes, especially in a month or two with loras.
>>
>>108318830
>a bit more slopped
No.
>terrible details
No.
>anatomy
No.
>architecture
It's the same as ZiT kek.
>>108318868
>but yeah it's a pretty boring model
It's extremely boring to the point of not being worth it unless you are too impatient to wait for a large-step ZiB gen. But that doesn't really take long anyway.
>>
>>108318779
ZiT isn't creative and hits SDXL slop levels of rigidness, but base fucks up limbs since it's not finetuned. There's a correlation between creativity and stability, stabilized models lose creativity. That's why Civitai finetunes are sketchy. It's vibe tuning by clueless people destabilizing models for the worse. Porn Master Z finetunes are trash, same as everything on the roulette including anime.
Best choice is original models. Want something extra? Add it via LoRA.
>>
>>108318823
dad, no...
>>
>>108318639
>but others have this weird clay look.
be thankful you can spot this as some cannot
>>
>>108318933
>>architecture
>It's the same as ZiT kek.
I mean "architecture" as the house architecture, ask it to make a photo outside and you'll see the background houses and people are all smeared to shit
>>
i am using a1111 but i am finding it has a problem with doras which i want to use. what should i be using instead for txt2img and inpainting?
>>
>>108318972
>There's a correlation between creativity and stability
one day we'll manage to make something good and creative, and I think we have to get away from the DiT architecture to get that
>>
remember when it was only flux
>>
I felt so safe
>>
a true buttchin heaven too
>>
>>108318143
If you need a good cope, nowadays the native PyTorch scaled_dot_product_attention isn't *that* much slower than sage, flash, or xformers, and I find it to be more stable anyway.
>>
>>108318143
Do you have an old gpu anon?
https://github.com/Comfy-Org/ComfyUI/issues/6228#issuecomment-2562903511
>>
>>108319030
>doras
Desu not worth the effort
>>
>>108318910
>It doesn't until people create loras as always.
The problem is that LTX-2 has some basic porn loras but they aren't good and don't compare to using WAN. Feels like this will be no different. Seems they only improved the sound.

I think I'll just throw my WAN videos in LTX 2.3 and hope it produces decent NSFW audio. using it for video gen probably not worth it.
>>
>>108319031
>one day we'll
You? Lol, come on, "we"? Did you make ComfyUI? Did you make WAI? Did you make Noob? The only thing "we" did was nothing.
>>
File: hello.png (114 KB, 640x640)
114 KB
114 KB PNG
>>108319207
>Did you make ComfyUI?
Yes, I'm ComfyAnonymous.
>>
>>108319207
I made a C++ UI but I need more time and money.
>>
>make WAI
jej
>>
>>108319207
I did Anima, and no, I will not touch that model again.
>>
>>108319144
no, rx7600, not the very latest but should be up to date enough

and yes that is the only thread I can find that throws the exact same error in the same function
>>
>>108319222
I'd support this. We desperately need a real desktop UI for professional/power users. No Electron/browser bullshit. C/C++ for the backend and Python for extensions. Bonus points if current custom nodes will be able to work with it.
>>
>>108319207
>The only thing "we" did was nothing.
Speak for yourself I make kinosovl.
>>
File: luabutts.jpg (113 KB, 573x892)
113 KB
113 KB JPG
>>108319376
>Python for extensions
python is horrible language no matter the application
>>
>>108319207
The projection in this post is so bad that I feel bad second hand. Fuck.
>>
File: 1735082152733117.png (1.91 MB, 1000x1127)
1.91 MB
1.91 MB PNG
>Israel, at war, has released a new local AI model, unlike China, at peace, which has released nothing...
china sisters, own response?
>>
Why is /ldg/ so dead? And no, I don't want cheap rationalizations like "the website is this" or "the board is that" or "a website maintenance flew over my window" I want the pure unadulterated truth!
>>
>>108316134
Chroma and Klein as needed.
>>
>>108319457
a website maintenance flew over my window
>>
>>108319457
first time lurking between model releases?
>>
>>108319457
>I want the pure unadulterated truth!
simple, there's no groundbreaking new models at the moment, don't expect a Z-image turbo type release to happen often, let's hope it'll happen agaon though, we're still far from the best API models so there's still a lot of room for improvement
>>
>>108319457
1. nothing good released recently
2. mods have been cracking down on ani same fagging spam. he gets banned almost instantly now

number #2 is the real reason. that guy was absolutely flooding the threads 24/7
>>
>>108319485
>he gets banned almost instantly now
Oh, I missed that, I wasn't around last weeks but the threads were insane at some point. Didn't know something has been finally done.
>>
>>108319466
>>108319473
They are still groundbreaking, anon is simply used to them already.
>>
File: 1756801817131344.png (437 KB, 527x537)
437 KB
437 KB PNG
>>108319515
>They are still groundbreaking
wake me up when we'll get a local video model that is even close to Seedance 2.0
>>
wake me up before you go go
>>
>>108319524
You can't even run video models.
>>
>>108319457
one person was making 90% of the posts
>>
>>108319531
>strawman
>>
>>108319534
How many people were in his head. The threads were unusable most of the time.
>>
none of my 1girls are coming out any good :/
>>
>>108319534
>one person was making 90% of the posts
where is he btw? it's been a while he hasn't been schizoing here, don't get me wrong it's a good thing lol
>>
>>108319457
>I want the pure unadulterated truth!
/adt/ was the culprit.
>>
>>108319545
he still posts. he just cant use his bot farm anymore
>>
>>108319551
shouldn't this be like the one thing has been perfected by now
>>
>>108319598
proof?
>>
why the fucking is comfyui now running the model in the ram, when vram is only 20% utilized?
holy shit, never git pull, guys
>>
comfy bastard put the model in the bloody vram sar
>>
>>108319650
>>108319675
try to go for the --disable-dynamic-vram flag
>>
>>108319650
I feel like this shit was constantly happening in a1111 too, always some update dropped that killed performance, bloated memory usage leading to OOM's on the exact same settings that worked before the update

why do they do this?
>>
File: 80.png (1.75 MB, 1024x1024)
1.75 MB
1.75 MB PNG
>>
https://www.reddit.com/r/StableDiffusion/comments/1rnqowr/comment/o98lnf1/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button
fucking jeets
>>
>>108319716
artist? assuming anima
>>
>>108319825
umikochannart, I haven't been able to make it not include the signature in the gens o matter what I put in the negatives (even with perp neg), I always remove on photoshop before upscaling kek
>>
File: ComfyUI_temp_xjzqa_00051_.png (2.5 MB, 1248x1728)
2.5 MB
2.5 MB PNG
>>
File: ComfyUI_temp_xjzqa_00054_.png (3.69 MB, 1344x1920)
3.69 MB
3.69 MB PNG
>>
so proper comfy node when? https://github.com/hanjq17/Spectrum
>>
>>108320038
claude free tier took two tries on my end, pasted the link and prompted "generate a comfyui custom node from this github". first one didn't work so i just started a new chat with the same instructions, ymmv
>>
When the fuck will they release Z-image edit???
>>
>>108319990
I've a feeling if she were to stand up, then her tube top wouldn't be horizontally tubed anymore, unless the strap is literally holding them up
>>
File: ComfyUI_temp_nbmjc_00064_.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>
File: 1751825365378519.mp4 (3.56 MB, 720x1024)
3.56 MB
3.56 MB MP4
>>108320012
Is the thread so dead that we have to repost gens? thanks ran.

Might as well post this then that I genned before
>>
>>108320118
>Is the thread so dead that we have to repost gens?
that's ani's favorite tactics btw
>>
File: file.png (52 KB, 600x600)
52 KB
52 KB PNG
>>108320100
"If this big tiddy micro tube top-wearing woman stands up, would it look like this, this, or this?"
>>
>>108320124
no that was "ani" reposting gens from a parallel thread which is not what I'm talking about. kys
>>
>>108319845
In my experience writing something like "There is no watermark, no signature, no text in the image" in the positive prompt and cranking strength of that up a bit (:1.25) seems to help alongside stuff like english text, signature, watermark, patreon logo, web address, twitter username etc. in the negatives, but I couldn't find anything that works reliably neither.
>>
>>108320137
2
>>
File: ComfyUI_temp_xjzqa_00087_.png (2.21 MB, 1248x1728)
2.21 MB
2.21 MB PNG
>>108320118
who are you anyway? the thread sheriff or something? mind your own business, chud
>>
>>108320149
>it's ok with I do it
subhuman lolcow
>>
>>108320163
esl? consider more lessons
>>
>>108320176
consider taking a rope and a stool
>>
any tips on prompting wan to do a tacking shot of the subject walking or jogging towards the camera? it works great if i gen an image with a treadmill, but i cant get to to do it otherwise. specifically trying to keep her whole body in frame including feet
>>
File: ComfyUI_01612_.png (2.76 MB, 1280x1920)
2.76 MB
2.76 MB PNG
>>108320159
>asian asuka
disgusting
>>
>>108320038
https://gofile.io/d/RbwXoN
Someone alredy did it. Gofile because catbox seems dead.
>>
>>108320248
>Someone alredy did it.
it should be on github gaddamit
>>
>>108320248
well that was me, and i did exactly what >>108320057 suggested. it kinda works? but it's a buggy mess, that's why i asked for a real coder to implement it properly into comfyui
>>
>>108320282
>it kinda works? but it's a buggy mess
when that happen I ask for claude to add prints to the code so that it can see what's the problem
>>
>>108320282
The flux-specific node seems outright broken yeah, since it crashed after using 16+ gigs of ram more than normally on cpu mode and cuda was killed byoffloading. The lite node is okay I guess, but the default settings are mostly suggestions and might vary from model to model. No clue about the anima node.
>>
File: ComfyUI_temp_ipaor_00023_.png (3.85 MB, 1344x1920)
3.85 MB
3.85 MB PNG
>>
File: 1745474181382208.png (71 KB, 1907x346)
71 KB
71 KB PNG
>>108319823
>fucking jeets
lmao, nailed it
>>
File: ComfyUI_temp_hrmad_00046_.png (2.94 MB, 1152x2016)
2.94 MB
2.94 MB PNG
>>
File: nice.webm (844 KB, 704x1280)
844 KB
844 KB WEBM
>>108320012
>>108320159
>>108320247
>>108320356
>>108320391
the realism is impressive on these new models. but can they do futa? id really like to leave sdxl but new models just cant do what i need them to do
>>
File: comfy__85.jpg (1.05 MB, 1267x1267)
1.05 MB
1.05 MB JPG
>>
>tfw a video model can do better music than models specialized on just that like AceStep kek
https://youtu.be/BIhNKuo5m4E?t=256
>>
what is this hrmad model? the paid jeets on plebbit are still drooling about ltx
>>
>>108320533
>the paid jeets on plebbit are still drooling about ltx
yeah the astroturfing is hard on that one, you can tell there's dozen of employees trying to hype this piece of turd up, little do they know that a model will naturally be hyped up if it's a quality model, like Z-image turbo for example
>>
local is pretty stale. sad to see
>>
>>
>>108320587
mmm, z-image?
>>
File: 1763993382519238.png (2.77 MB, 1160x1744)
2.77 MB
2.77 MB PNG
>>
File: ComfyUI_temp_pdtgy_00095_.png (3.1 MB, 1440x1120)
3.1 MB
3.1 MB PNG
>>
>>108320557
dogs eat dog food
>>
>>108319845
cool thanks
i use iopaint for that
>>
When ready

>>108320614
>>108320614
>>108320614

When ready
>>
>>108317300
Being able to edit images, fix mistakes and prompt like on whisk.
Being able to have pretty SDXL pictures.
But yeah, pretty much spot on.
>>
>2026
>Still no consistent characters or faces
Is stable diffusion stale or are all local models the same?
>>
>>108320513
She looks just like john cena.
>>
>>108321951
kek at least try to come up with something plausible.
>>
>>108322012
>>>/gif/30358935



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.