[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Death to Anime Edition

Discussion of Free and Open Source Diffusion Models

Prev: >>108351283

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
https://xcancel.com/wildmindai/status/2032083436760920228#m
https://arxiv.org/abs/2603.10744
>Extensive experiments on the state-of-the-art FLUX.1-dev model demonstrate that JiT achieves up to a 7x speedup with nearly lossless performance, significantly outperforming existing acceleration methods and establishing a new and superior trade-off between inference speed and generation fidelity.
great if true
>>
thanks for another early bake catjak. we're gonna have lots of local diffusion discussion here for sure
>>
Crazy how when I was calling out the issues in comfyUI the same drones that are mad at him now were slobbering his entire shaft and balls
Comfy is addressing my issues and stopped being influenced by these people and is no longer picking fights and acting like these people so he's fine by me now
>>
File: 00234-2610570556.png (2.2 MB, 1080x1920)
2.2 MB
2.2 MB PNG
>>108357192
just in time if true
>>
>>108356854
>THERE IS NO ALTERN- ACK
https://www.youtube.com/watch?v=XogoQnkQUO8
>>
Pro tip: anyone asking for help while using Ksample Simple, just ignore them.
>>
>>108357216
>I ditched ComfyUi
>still uses nodes
is he fucking retarded?
>>
>>108357216
why use comfyui when you can use a comfyui clone?
>>
so is there a single foss project that allows you to point it to a folder of images and for it to auto tag them all with ai vision thats not using models from 2 years ago? is immich good for this nowadays? i want semantic image search that works
>>
>>108357260
Yeah comfy can do this
>>
>>108357269
with?
>>
>>108357276
whatever llm you want.
>batch load>llm "caption this slop">batch save
>>
holy shit that new flux klein is definitely fast. 5 second 1080p gens on my 5060 ti 16gb.
though flux klein is kinda poop dogshit, what did they even train this thing on? fifty million copies of the same couple of images? it has zero knowledge on anything.
>>
>>108357373
I-it can run at 12gb?
>>
Blessed thread of frenship
>>
>>108357382
sorry pal this shit hits 14gb even with sage attention. i think it's mostly the text encoder though.
>>
As expected, Anima Preview 2 is more stable aesthetically, but dumber and less creative than Preview 1.

Thanks tdrusell for catering to the slop masses.
>>
>>108357373
>it has zero knowledge on anything.
it's on purpose, BFL is a safety freak company so when they said they trained their images on the most boring copyright free imaginable, I believe them
>>
>>108357397
Waiting for a slop quant then.
>>
>>108357406
The fuck are you talking about?
The model has better prompt understanding
>>
>>108357423
you can offload some of the model to the ram though
https://github.com/pollockjj/ComfyUI-MultiGPU
>>
File: 1771514564246623.png (140 KB, 1150x312)
140 KB
140 KB PNG
>>108357425
>The fuck are you talking about?
it's ani fuding Anima, don't engage with that schizo
>>
>>108357425
Yeah? Prompt Miku chasing a seagull and the seagull’s carrying a drawing of Teto in its beak, good luck.
>>
>img2img seems to have been significantly censored
>nsfw gens i could turn realistic get enshittified almost seemingly on purpose
what in the fuck BFL
>>
>>108357459
wait nevermind false alarm, i'm retarded, i left on that really shitty "realisimifier" lora from civitai that breaks basically everything, didn't realize it was still on from a previous run.
that said one thing i'm definitely noticing more than before, realism-ifying literally anyone makes them age like 50 more years, and specifying age makes them too young.
>>
File: 1754062537751121.png (3.06 MB, 1792x1280)
3.06 MB
3.06 MB PNG
givem e ure power
>>
>>108357481
I thought that KV cache thing would not change the image relative to the older version
>>
>>108357484
so static... so ZiT sloppa
>>
File: goblin waifu.mp4 (2.55 MB, 704x1280)
2.55 MB
2.55 MB MP4
i found a secret prompt formula with my barely working lora
>>
>>108357493
I live for the sloppa
>>
>>108357493
>so static
it's an image anon
>>
https://github.com/Comfy-Org/ComfyUI/pull/12909
Comfy seems to have fixed the KV cache vram usage bug, now let's test out that new Klein and see how much faster it's supposed to be
>>
>>108357526
12gb status?
>>
>mfw Resource news

03/12/2026

>New FLUX.2 Klein 9b models
https://huggingface.co/black-forest-labs/FLUX.2-klein-9b-kv-fp8

>Too Vivid to Be Real? Benchmarking and Calibrating Generative Color Fidelity
https://github.com/ZhengyaoFang/CFM

>Geometric Autoencoder for Diffusion Models
https://github.com/freezing-index/Geometric-Autoencoder-for-Diffusion-Models

>Guiding Diffusion Models with Semantically Degraded Conditions
https://github.com/Ming-321/Classifier-Degradation-Guidance

>Layer Consistency Matters: Elegant Latent Transition Discrepancy for Generalizable Synthetic Image Detection
https://github.com/yywencs/LTD

>OmniVTON++: Training-Free Universal Virtual Try-On with Principal Pose Guidance
https://github.com/Jerome-Young/OmniVTON-PlusPlus

03/11/2026

>anima-preview2.safetensors
https://huggingface.co/circlestone-labs/Anima/tree/main/split_files/diffusion_models

>Reviving ConvNeXt for Efficient Convolutional Diffusion Models
https://github.com/star-kwon/FCDM

>BinaryAttention: One-Bit QK-Attention for Vision and Diffusion Transformers
https://github.com/EdwardChasel/BinaryAttention

>QUSR: Quality-Aware and Uncertainty-Guided Image Super-Resolution Diffusion Model
https://github.com/oTvTog/QUSR

>InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing
https://github.com/OpenGVLab/InternVL-U

>SD Forge Nvidia VFX: VideoSuperRes extension for Forge Neo implements
https://github.com/Haoming02/sd-forge-nvidia-vfx

>Nvidia_RTX_Nodes_ComfyUI: "RTX Video Super Resolution"
https://github.com/Comfy-Org/Nvidia_RTX_Nodes_ComfyUI

>Gemini Embedding 2: Natively multimodal embedding model
https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-embedding-2

03/10/2026

>HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising
https://jacky-hate.github.io/HiAR
>>
>mfw Research news

03/12/2026

>Just-in-Time: Training-Free Spatial Acceleration for Diffusion Transformers
https://arxiv.org/abs/2603.10744

>HanMoVLM: Large Vision-Language Models for Professional Artistic Painting Evaluation
https://arxiv.org/abs/2603.10814

>ID-LoRA: Identity-Driven Audio-Video Personalization with In-Context LoRA
https://arxiv.org/abs/2603.10256

>The Quadratic Geometry of Flow Matching: Semantic Granularity Alignment for Text-to-Image Synthesis
https://arxiv.org/abs/2603.10785

>Delta-K: Boosting Multi-Instance Generation via Cross-Attention Augmentation
https://arxiv.org/abs/2603.10210

>StyleGallery: Training-free and Semantic-aware Personalized Style Transfer from Arbitrary Image References
https://arxiv.org/abs/2603.10354

>A^{2}-Edit: Precise Reference-Guided Image Editing of Arbitrary Objects and Ambiguous Masks
https://arxiv.org/abs/2603.10685

>Variance-Aware Adaptive Weighting for Diffusion Model Training
https://arxiv.org/abs/2603.10391

>Unlearning the Unpromptable: Prompt-free Instance Unlearning in Diffusion Models
https://arxiv.org/abs/2603.10445

>The Orthogonal Vulnerabilities of Generative AI Watermarks: A Comparative Empirical Benchmark of Spatial and Latent Provenance
https://arxiv.org/abs/2603.10323

>Fighting Hallucinations with Counterfactuals: Diffusion-Guided Perturbations for LVLM Hallucination Suppression
https://hamidreza-dastmalchi.github.io/cipher-cvpr2026

>Does AI See like Art Historians? Interpreting How Vision Language Models Recognize Artistic Style
https://arxiv.org/abs/2603.11024

>Taming Score-Based Denoisers in ADMM: A Convergent Plug-and-Play Framework
https://arxiv.org/abs/2603.10281

>COMIC: Agentic Sketch Comedy Generation
https://susunghong.github.io/COMIC

>EmoStory: Emotion-Aware Story Generation
https://arxiv.org/abs/2603.10349

>LiTo: Surface Light Field Tokenization
https://apple.github.io/ml-lito

>V2M-Zero: Zero-Pair Time-Aligned Video-to-Music Generation
https://genjib.github.io/v2m_zero
>>
File: image(1).jpg (720 KB, 1536x2752)
720 KB
720 KB JPG
https://civitai.com/models/2461222?modelVersionId=2767330
Wow, the best anime aesthetic illustrious checkpoint I’ve seen so far, but there are one or two catches, in case you notice ;)
>>
>>108357608
>two catches
It's antiquated XL and only a LoRA?
>>
>>108357552
>>108357562
why is this schizo spamming his news slop on /ldg/ now?
https://rentry.org/debo
>>
>>108357651
For each character you have to use its dedicated Lora. If you look at all the demo images, each one is using its own character Lora.
>>
https://huggingface.co/GuangyuanSD/Z-Image-Distilled


Is this the new meta for z-image? It's a "turbo" z-image base, and it works with z-image base loras (the official turbo doesn't)
>>
File: zimage sissies.png (29 KB, 749x193)
29 KB
29 KB PNG
>>108357733
the absolute state of z-image enjoyers
>>
File: Anima0.2_00016+17.png (2.23 MB, 2048x1024)
2.23 MB
2.23 MB PNG
>>108357140
>>108357150
There was an attempt.

>>108357406
kys Ani
>>
>>108357733
what's the fucking point? if I wanted a turbo version of Z-image base there's already Z-image turbo
>>
>>108357749
20 looks better?
>>
>>108357748
Based
>>108357759
They fucked around with base a bit before releasing so turbo is technically a turbo version of a different base
>>
>>108357802
Yeah, and it was the same seed. I didn't use quality tags, so 50 may have opted for lower somehow.
>>
File: 1760383228697305.jpg (42 KB, 736x736)
42 KB
42 KB JPG
>a photograph of the woman in image 1, wearing the outfit on image 2
i havent visited ldg since z-image-base came out and didnt live up to my expectations, so i feel a bit out of the loop. what's the latest? i've been running klein 9b exclusively since then, and realized that i can never go back to a model without edit functionality.
>>
>>108357802
>>108357862
Depends on the model and sampler. Sure, some samplers like extra steps, but if the model was trained for ~30 steps, using more steps won’t help.
>>
File: 00022-3948370380.jpg (456 KB, 1440x2400)
456 KB
456 KB JPG
>>108357373
qwen image 2512 is superior for text2image.
>>
>>108357373
>it has zero knowledge on anything.
just input a reference image of whatever you want it to do. works with clothes, hairstyles, poses, nudes, whatever. especially if you combine it with a lora since it's so easy to train
>>
File: file.png (15 KB, 402x74)
15 KB
15 KB PNG
Thanks Comfy.
>>
>>108357966
Nothing too major for realistic stuff, I think. A version of Klein 9B with KV-caching just came out. Says it needs a 5090 or higher, not sure if accurate:
https://huggingface.co/black-forest-labs/FLUX.2-klein-9b-kv-fp8

On the anime side, a new small SDXL-killer candidate called Anima came out, and just got its second preview release:
https://huggingface.co/circlestone-labs/Anima
>>
What's a good place for anime generation request?
>>
>>108358018
>>>/b/degen
>>
>>108358018
the meta now is /hgg/, second /edg/
>>
>>108358033
It's not porn, but thanks.
>>108358036
Thanks fren, I'll check it out.
>>
>>108358011
>Says it needs a 5090 or higher, not sure if accurate:
>or higher
lmao. i guess it's back to the lodestone waiting room i guess
>>
>>108358018
Happy to help. We’re a bit short on genners right now, so if you can share your gen here, it’d be appreciated ^^
>>
>>108358018
>>>/r/
Here we don't fulfil any requests from poorfags who do not own GPUs
>>108358080
Cry more
>>
>>108358090
Why are you talking shit about us when the anon explicitly said he didn’t want to make requests here?
>>
>>108358098
Because he’s a retard from some dead general who’s been insulting genners here since last year.
>>
>>108358107
Before insulting /adt/, do you have any proof?
>>
https://huggingface.co/black-forest-labs/FLUX.2-klein-9b-kv-fp8

we're eating well boys. faster and better apparently.
>>
File: 00223-378632948.png (1.96 MB, 1608x1224)
1.96 MB
1.96 MB PNG
>>
>pull latest frontend
>still cant ctrl+v images from clipboard
fucking tards
>>
>>108358153
Like I said, the model is dumber. The seagull can’t grab the picture and it’s flying in the opposite direction.
Anima 1, and especially ZiT and ZiB can handle this simple prompt, but Klein 9B and 4B just slop it out and can’t do it either.
Thanks for testing and proving my point.
>>
>cant drag and select multiple from canvas anymore
are these niggers actually hit in the head? who the fuck is approving all these dogshit changes?
>>
>pull latest frontend
>still can’t view the metadata unless I download the entire asset and bloat ComfyUI with yet another workflow
>>
>pull latest frontend
>still can’t snap or magnetize the nodes between them, or adjust them at the same time to be the same size.
>>
>pull last frontend
>nodes 2.0 still in beta
>>
>pull latest frontend
> inpainting is still a pain in the ass, no custom bbox like invoke
>>
>pull down pants
>no penis
comffFFFYYYYYYYY!!!!!
>>
File: 1756520520125155.png (759 KB, 832x1216)
759 KB
759 KB PNG
Why is /g/ recommending Anima? Not only do images take longer to generate but also the quality is so much worse than something like WAI Illustrious
Prompt understanding can't save it if generates 3 arms like it's 2023
>>
>>108358211
we are naturally better at proompting desu
>>
File: nosey.mp4 (2.41 MB, 704x1280)
2.41 MB
2.41 MB MP4
>>
>>108358211
Illustrious is still the GOAT. We recommend it because we’re a Comfy biased general. More decentralized generals tend to ignore Anima completely.
>>
>>108358211
You're not tired of that style yet?
>>
>>108358187
just open the file in any text editor
>>
File: 1771801764853883.png (1.12 MB, 832x1216)
1.12 MB
1.12 MB PNG
>>108358229
gotcha
>>
>"comfy biased general"
>right after a flood of "why comfy suck" posts
what did anon mean by this
>>
>>108358132
>fp8retards
toy quant
>>
File: 1772522135284424.jpg (1.21 MB, 4654x1819)
1.21 MB
1.21 MB JPG
>>108358238
nope, I change styles often
these are my favs so far
>>
>>108358259
>I change styles often
unfortunate that they all have that ugly WAI sheen
>>
>>108358211
psychologically refreshing anon
>>
File: z-image_00638_.png (1.83 MB, 944x1280)
1.83 MB
1.83 MB PNG
>>
>>108358263
Slop is whatever strains the eyes. In a few years it’ll be "ugly Anima sheen"
>>
>>108358282
nah it comes from WAI being a jeetmix
>>
>>108358287
Noob pencil slop sheen, how about that?
>>
>>108358300
>pencil slop sheen
kekd good one
>>
>>108357957
you realize the original Klein 9B also said that right? Those HuggingFace hardware requirements listings are ALWAYS wildly, wildy overstated, IDK why exactly
>>
>>108358220
This. Remembering the anon recently who didn't understand that only prompting for the series name won't make the output look like an anime screen cap.
>>
>1.03it/s
>8 step 2 image klein 9b edit in a few seconds
>went from 20 seconds/8 steps to about 8 seconds
open source wins again
>>
>>108358371
I’m here, still sticking to my point. Dragon Ball is Dragon Ball style, not Akira Toriyama, not an anime screencap, not some masterpiece score 9. Dragon Ball, bro, it’s simple.
>>
>>108358258
How so?
>>
>>108358384
Open source and nogenners :^)
>>
File: 1 vs 2.jpg (249 KB, 1664x1216)
249 KB
249 KB JPG
>>108358211
Anima 2 seems to be an improvement tho
>>
>>108358384
what about the quality? is it worse?
>>
>>108358403
>>108358153
Aesthetic improvement, AKA coomer improvement, AKA jeet improvement, AKA sloppers improvement
>>
>Hardware
>The FLUX.2 [klein] 9B-KV model fits in ~29GB VRAM and is accessible on NVIDIA RTX 5090 and above.

I am running it fine on a 4080 (16gb). this is misleading, just run it.
>>
>>108358403
wtf is Anima 2
>>
>>108358417
His patch note states the opposite and I disagree with you thoughever
>>
>>108358055
bruh it's literally just a fucking KV-cache enabled version of regular Klein 9B, there's nothing to complain about, the model isn't physically larger or different otherwise
>>
>>108358421
https://huggingface.co/circlestone-labs/Anima/resolve/main/split_files/diffusion_models/anima-preview2.safetensors
>>
>>108358421
another intermediate checkpoint
>>
File: 1771271851288306.png (1.27 MB, 832x1248)
1.27 MB
1.27 MB PNG
>>108358416
seems fine to me?
>change the text to "CONCORD 4"
>prompt executed in 10 seconds (8 steps, default is 4)
>>
>>108358419
loras work just as well?
>>
>>108358419
runs fine on 2080
>>
>>108358384
ngl, the BFL fags are moving the local ecosystem forward, first they managed to double the inference speed, and they also showed a better way to train models
https://xcancel.com/pess_r/status/2031721036996128811#m
>>
File: 1761253418622376.png (44 KB, 504x324)
44 KB
44 KB PNG
>>108358435
havent tested any all ive done is added this to the workflow for the new kv model:
>>
>>108358444
>kv cache
is this an update comfyui situation or do i grab it from somewhere else?
>>
>>108358449
it's an update comfyui situation
>>
File: 1773121954053630.png (355 KB, 408x612)
355 KB
355 KB PNG
>>108358455
oh no... pray for me anons. here we go
>>
File: 53434.jpg (596 KB, 1332x759)
596 KB
596 KB JPG
>>108358211
It is what it is, but seems easier/take less tries to prompt multiple characters doing different actions with minimal bleeding in a single prompt, than doing it with wai or noob
too bad the details are still fucked
>>
File: 1744376521863938.png (1.44 MB, 832x1248)
1.44 MB
1.44 MB PNG
the asian girl is holding a sign in her left hand saying "flux 9b kv = fast edits"
>>108358449
update comfy to get the node (use update in the update folder, not within the app). didnt have it till I did that.

also I bumped the steps from 4 to 8 cause it's already so fast. 4 is fine for most edits, 8 is better for text gens.
>>
UPDATE: ok so I pulled and everything worked fine, thanks anons
>>
File: 637.png (5 KB, 295x79)
5 KB
5 KB PNG
ack
>>
File: 1757164080651235.png (1.43 MB, 992x1040)
1.43 MB
1.43 MB PNG
over the girls chest is a large yellow sign with black text saying "whoops! cant show that on a christian message board!"

I think this model is even better at longer strings of text. I dont think it's JUST speed. text edits work in general but sometimes you'd have to fix a word or two.
>>
File: 1767420880332717.png (1.56 MB, 992x1040)
1.56 MB
1.56 MB PNG
>>108358477
the girl is dressed in a brown bear costume.

this model is SO fast, holy shit. at least twice as fast as when I used the fp8 or Q8 9b distill model.
>>
>>108357430
Comfy already does this.
>>
File: 1748017998337041.png (1.31 MB, 784x1312)
1.31 MB
1.31 MB PNG
replace the outfit of the girl in image 1 with the outfit of the girl in image 2.

pretty good elegg desu
>>
bro do some floyd bro ive been waiting all day
>>
>>108358505
if I want to use that thug as a hello world test case then I will, thanks
>>
hes not a thug bro hes an inspiration to the society and may he rest in piece bro
>>
>>108358500
not manually
>>
>>108358500
Not this anon but comfy takes up all my vram to the point my whole system is taking a hit, even with reserve vram 1, while this node lets me choose how much I want to stick in the vram so it doesn't slow down everything
I can't run LTX2 nor klein with base gguf nodes, but it works fine with those
>>
File: 1745700275478012.png (1.37 MB, 1168x880)
1.37 MB
1.37 MB PNG
the man in image 1 and the girl in image 2 are shaking hands. keep their expressions the same.
>>
File: 1769386693142739.png (1.36 MB, 896x1152)
1.36 MB
1.36 MB PNG
replace the face of the man in image 1 with the face of the man in image 2.

literally fors
>>
>>108358458
IT DESTROYED ALL MY SUBGRAPHS NOOOO
>>
File: DOMP IT.png (823 KB, 1080x1080)
823 KB
823 KB PNG
>>108358566
>Mr Comfy, he pulled
>>
>new speedup
>does not change anything else about the model or its outputs
>mikuryanbanefag posting nearly identical gens from the first time he demod the model
>"pretty good desu"
Never change, /ldg/
>>
>>108358477
>whoops can't show that...
dude, the japanese are champions of censorship. some ecchi and hentai don't even have nipples. fuck you, amically
>>
File: 00173-1963271001.jpg (247 KB, 896x1152)
247 KB
247 KB JPG
>>
>>108358627
>the japanese are champions of censorship.
the US forced them to censor genitals after bombing them 2 times with an atomic bomb, if you wanna blame someone, blame the muricans
>>
can you generate seeds inside subgraphs now?
>>
File: 1747161527217419.png (1.27 MB, 1088x944)
1.27 MB
1.27 MB PNG
the new model is SO fast just add in the kv cache node after the model node.

the man in image 1 is standing beside the girl in image 2.

kek it made him anime
>>
File: 1746094336250546.png (1.21 MB, 832x1248)
1.21 MB
1.21 MB PNG
>>
File: 1742534315670079.png (1.19 MB, 832x1248)
1.19 MB
1.19 MB PNG
>>108358704
>>
>6 second 1500x1000 euler generation with a reference image
black magic
>>
File: Anima0.2_00018_.png (404 KB, 1024x768)
404 KB
404 KB PNG
>>108357455
>>108358153
>>108358180
NTA, but where there's a will...

I will say the composition was all over the place, so it took a lot of RNG, with the prompt tweak attempts not being really consistent. Anima preview 1 did not seem any smarter about it though. ZIT gave more consistently sensible composition, but was still stubborn about having the seagull always facing Miku despite my prompting attempts (plus it didn't know Teto). I did not try Z-Image non-Turbo or Klein.
>>
File: z-image-turbo_00003_.png (836 KB, 1024x768)
836 KB
836 KB PNG
>>108358749
ZIT tended to do this.
>>
the new KV version cannot do nipples anymore
>>
>>108358678
>>108358549
>>108358503
WELCOME BACK MIKU BRO
WE ARE BACK
>>
>>108358757
>>108358749
This prompt needs to be in the benchmarks. I'm trying to generate a volleyball serve or block in anima 2 but the character can't position the hands correctly, 100% goon model
>>
>>108358803
Yeah, cowboy shooters always win, slice of life Yuru Camp or DIY anime strugglers always lose with each finetune
>>
File: 1750744010987022.png (1.69 MB, 832x1248)
1.69 MB
1.69 MB PNG
>>108358795
indeed, the kv model + node makes edits like 2x faster or more, highly recommend
>>
>>108358816
>the kv model + node makes edits like 2x faster or more
I wonder if this can be applied to all the other models like Wan, LTX, Z-image... if the quality stays the same that's a huge deal
>>
>>108358816
>highly recommend
12gb VRAM?
>>
>>108358824
Nah, 16GB minimum is the classic case of "socialism for the rich" in the local scene
>>
kv model seems way way worse at anatomy. i'm getting extra arms in every image.
>>
File: ComfyUI_00066_.jpg (2.03 MB, 1856x2304)
2.03 MB
2.03 MB JPG
>>108358826
ty
>>
>>108358832
>i'm getting extra arms in every image.
How many, i'm interested
>>
>>108358824
maybe, works on 16 for me (4080)
>>
File: 1764590370707630.png (699 KB, 640x886)
699 KB
699 KB PNG
>>108358841
too many
>>
>>108358854
lol squiddly diddly
>>
>>108358854
that is peak jeet gooning material right there.
>most beauty vishnu do be showing bobs!
>>
File: ComfyUI_00071_.jpg (1.39 MB, 1856x2304)
1.39 MB
1.39 MB JPG
>>
12gb RTX 3060 BROS KLEIN KHHV WERKS AT SUPER SPEED
WE
ARE
BACK
>>
>>108359016
Also this >>108358854
>>
>>108359016
Why did they say you needed a 5090 lol?
Did they just have some AI hallucinate it?
>>
>>108359038
Stop reading everything. Most stuff is just bloat.
I never read anything and I'm fine.
>>
i gave up on KV. character likeness and anatomy is even worse than klein. might be good if you quickly want to remove a bird from a photograph of something.
>>
>>108359103
I'm having a good time substituting clothing and making scenes with reference characters. It seems like it captures the characters’ styles better and is more creative. Although I do admit that the issue with extremities is concerning.
>>
>>108359156
turns out speed comes at a price.
>>
>>108359156
Yes and increasing steps up to 20 does not help
>>
>>108359038
"they" don't want anons to try things
>>
>>108359038
the jews fears the mikutesters
>>
File: 1764113152520666.png (10 KB, 354x153)
10 KB
10 KB PNG
after updating comfy, my float nodes are incrementing by +1/-1, when before they were incrementing by +0.1/-0.1. i remember i changed this before somehow, but i cannot remember how and i cannot find any information about it either. does anyone know how to fix it?
>>
>>108359206
Custom node? Maybe ve to an AI the code
>>
>>108359233
no it's just the regular float node
>>
>>108359242
Oh soorry, i'm more of a boolean troon
>>
>>108359206
You didn't top up ComfyCoins. If you have zero coins, Comfyanonymous will disable random features until you buy a lootbox node.
>>
does this kv shit work with offloading? vanilla 9b worked on 8gb for me
>>
There's no way comfyui is this shit, he has to be breaking shit on purpose, my older version works flawlessly and this new installation it breaks on the most simple things like preview images not updating correctly
>>
>>108359438
btw the lootbox is a dozen API gens using your prompts that were logged by comfyui
>>
File: ComfyUI_00019_.png (504 KB, 1248x1664)
504 KB
504 KB PNG
>>
>>108358211
Try animayume.
>>
>>108359589
kek
>>
>>108359576
comfyui's own templates are broken as the seed node doesn't work correctly being in the subgraph, so I have no good expectations for comfyui
>>
File: 1769450716789894.png (3.33 MB, 1656x1232)
3.33 MB
3.33 MB PNG
>>
>>108359604
I know they had a chink guy Chenlei Hu working on their frontend, its likely cumfaggot fired him or something because there's no way this is the same guy, the quality drop is insane and don't get me started on their topbar that takes a fuckton of space for no reason
>>
File: Anima0.2_00021-24.png (1.54 MB, 2048x1536)
1.54 MB
1.54 MB PNG
>>108358749
Composition-wise I had a pretty good streak going for a while with this prompt, but then it started getting random again. Still felt like it had more success than some others I tried.
>1girl, @optionaltypo, hatsune miku, vocaloid, full body, running, beach, sand, ocean, from side. Hatsune Miku is running across the beach. She is chasing a small seagull that is flying away with a paper photo of Kasane Teto in its mouth.
>Neg prompt: chibi
Beak grasp is elusive, but might be improvable with some magic wording.
>>
>>108358823
To anything that is transformer, more than that, it should be possible to quantize the cache reducing it's size even further.
>>
>>108358854
how the fuck do you get bad anatomy like extra limbs on edits? it shouldn't be changing shit you don't tell it to if your prompt is non-retarded and you aren't doing something weird with the scaling / workflow
>>
>>108359669
Wait, may be it's not, since klein was finetuned for kv cache.
>>
>>108359206
just divide by 10 in the next node
boom
>>
>>108359177
20 steps is way too much for a distilled model. Are you running non-scaled FP8 or some shit (for either the model or the text encoder)? If so don't
>>
>>108359557 (me)
damn, on fp8 model single image ref works, adding second image - oom...
>>
File: 1743562496617122.png (1.64 MB, 1024x1536)
1.64 MB
1.64 MB PNG
this is way late but can I get a catbox or prompt for this pic?
>>108268638
>>
hello?
>>
keep it down, everyone's trying to sleep
>>
File: 1746124908994529.png (3.09 MB, 1824x1216)
3.09 MB
3.09 MB PNG
>>108360133
some sloppa for u
>>
Hibernation Mode
>>
Toriel and Lopunny fusion image, thinking about making a chatbot to accompany it on pephop.ai Should I try hard on trying to make the personality a perfect blend of the two or will the sheer sexual power of these two be enough for most gooners?
>>
should checkpoints just work with a basic text2img workflow? uncanny is the only thing that has decent results.
>>
File: 1768895564798630.png (2.39 MB, 1344x1728)
2.39 MB
2.39 MB PNG
https://www.reddit.com/r/StableDiffusion/comments/1rsfvxv/ultrareal_lora_for_klein_9b/
is this a joke? with the lora it's Flux 1's tier in terms of realism, completly plastic and the famous flux chin is here too
>>
File: file.png (2.66 MB, 1584x1312)
2.66 MB
2.66 MB PNG
looks like it's a bit more slopped with the kv cache thing
>>
>>108360296
People unironically love slopped images for some insane reason but luckily, you can filter those loras out. Usually, you want loras that are trained after some sort of camera that work but even with loras doing that like https://civitai.com/models/1662740/lenovo-ultrareal, you still see some biases. Better than nothing but it will only be a matter of time until someone grifts and uses the same training as your picture into these loras,
>>
>>108360315
>(slopped:1.25)
vs
>(slopped:1.27)
>>
>>108360209
"just working" is completely antithetical to cumfart's user experience idea
>>
File: 1761252804375406.png (188 KB, 1956x1161)
188 KB
188 KB PNG
Come on, do something...
>>
Anyone have a klein kv workflow?
>>
>>108360387
this. i miss when auto and forge were the meta. comfy is a tyrant
>>
>>108360387
>>108360436
all right ani you can stop samefagging now
>>
>people unironically running the fp8 instead of bf16
LMAO
>>
>>108360440
not ani. my post is only the second one. you really need to stop thinking about ani all the time. comfy hate is diffusion general culture that will never go away despite your desperate shilling
>>
>>108360410
its dead jim
>>
>>108360446
so what's today's FUD recipe, not ani?
>>
oh dear we have so many loyal comfy fans here... i wonder how did it get so many fans always ready to defend it at all times, and all for free? incredible
>>
>>108360469
35 stars status?
>>
File: v0.17.0.png (129 KB, 910x800)
129 KB
129 KB PNG
Huh, just noticed 0.17.0 came out hours ago with support for the KV cache Klein model.
>>
>>108360479
Nice, add this to the next news anchor please
>>108357552
>>108357562
>>
>>108358749
All local models are dogshit at prompt comprehension, nobody wants to admit it but it's true. I've tried SD 1.5, SDXL, more recently Flux, Zimage, Anima all of them are fucking dogshit and are way below my expectations for where they should be. Proprietary models still btfo, they're the only models can understand your fucking prompt even if it's something that hasn't been in training set 9,000 times, only models that are generalizing properly and developing some level of true understanding. Architecture change is needed I imagine to get anywhere with small local models I feel like, way we are doing it now clearly only works with dumb brute force scale and is just not optimal
>>
Apparently I am retarded because this external model folder yaml file ain't working.
>>
>>108360538
Anon, I think everyone here already knows that local is nothing more than a grift and just a way to attract attention for the big players. And they're slowly giving up on it, there's no money there and unless you actually love foss, there's no reason to invest in local. The only practical reason for using local models is porn.
>>
File: 1769147035482431.png (1.14 MB, 1935x1257)
1.14 MB
1.14 MB PNG
>>108360546
blatant issue of PEBKAC
>>
>>108359641
Good
Instead of mouth, try beak, that's what bird mouth are called.
>>
>>108360554
Fed it to chatgpt. What kind of fucking retard at comfy made that yaml file?

You have to remove basically 100% of the text and copypaste folders AND now how to not break YAML code structure.

Rape open source analy till it bleeds.
>>
>>108360554
>>108360575
Uh oh anime2real poster meltie
>>
>>108360575
>doesnt know how yaml needs to be formatted
>blames others
lol
but I agree yaml sucks, I prefer json
>>
>>108360553
I disagree, just way we are doing text encoding is clearly dogshit and doesn't give optimal results without insane amounts of scale. The models have decent level of understanding, but it falls the fuck apart when it comes to encoding intent to the model. Likely not a impossible thing to overcome, it very likely will eventually, but labs need to explore novel approaches more and at faster pace
>>
>>108360575
small multi-million corporation... please understand!!
>>
When can I actually use Nodes 2.0? They look way better and are more appealing than the regular ones but the DanBooru autocomplete doesn't work, same with a bunch of other stuff...
>>
>>108360575
>kikegpt and not his own local model
cringe
>>
So klein still sucks, but now it takes me 3seconds to see the failed results instead.
>>
>>108360575
You always have to feed everything to ChatGPT or use symlinks. Still seems bizarre that you can't even save assets outside of ComfyUI.
>>108360599
Local LLMs are shit. The gap in local LLMs is massive, nothing compared to diffusion.
>>
>>108360618
The gap with local LLMs are small, gap with diffusion is fucking massive. I don't know what drugs you are on, I want some
>>
>>108360618
>Local LLMs are shit. The gap in local LLMs is massive, nothing compared to diffusion.
can you fuck off to your comfyui discord with this fud bullshit?
>>
>>108360599
not his own local model(...) fed chatgpt slop logs by some random slopper
>>
>>108360622
>The gap with local LLMs are small
No local Opus for slowburn, fuck off
>>
>>108360616
>So klein still sucks
yeah... but this is the best that we have unless Alibaba decided to wake the fuck up and release Qwen Image 2.0 or Z-image edit
>>
>>108360629
Idk Qwen 27B seems pretty good, maybe just a skill issue or you are faggot with only 4GBs of VRAM. It just the job done for the most part while us diffusion tards are stuck with prompt adherence that feels like it's stuck in Dalle 3 days, maybe quality in on par but about it I care about it following my fucking prompt more
>>
>>108360133
Dead general
>>
File: 213123123123.jpg (387 KB, 2000x1582)
387 KB
387 KB JPG
Does regular klein loras work for this kv version?

I need less plastic skin.
>>
>>108360671
>I need less plastic skin.
you won't get that with klein lol
>>
>>108360671
Why is it the same photo 6 times with no change? Either way you can just img2img it with another model that gives less plastic skin if all else.
>>
>>108360654
Nah man, local models have like 3 personality archetypes and 2 3 ways characters react to and describe the same situation, with 0 speech pattern variety. For creative writing, Claude mogs local hard and also knows copyrighted characters way better. I've used both and the difference made me want to rope..
>>
>>108360696
people who believe we can reach 4.6 opus with a 27b model are delusional, blame Nvdia from not giving us cheap gpu with enough vram to run bigger models :(
>>
I was told there would be z-image base finetunes
>>
File: ComfyUI_00095_.jpg (343 KB, 1024x1024)
343 KB
343 KB JPG
>>
>>108360719
sir
>>
>>108360696
I use Opus for my BA students, it's the only model that can capture their individual quirks withput mass slopping.
>>
File: ComfyUI_00075_.jpg (131 KB, 1024x1024)
131 KB
131 KB JPG
>>
File: Flux2_Klein_9b_kv_00046_.png (1.34 MB, 1168x880)
1.34 MB
1.34 MB PNG
>>108360676
But it's like 19gb of a model.

>>108360690
I see what you did there.

I've been relying on zit i2i ever since it came out.
>>
>>108360719
BLOODY BENCHOD DO NOT REDEEEEEEEEEEEEEEEEMMMMMMMMM
>>
>>108360410
*becomes SaaS*
>>
>>108360728
OMGGG A BLUE HAIRED TETO!!!
>>
>>108360733
based
>>
>300MB+ of templates
damn! These niggas are retarded
>>
Setting klein kv to gen like 30 images in a minute will get you an ok result eventually.
>>
File: ComfyUI_00091_.jpg (264 KB, 1024x1024)
264 KB
264 KB JPG
>>
>>108360751
that's definitely a dalle3 image, but since it made me laugh I'll let that slide kek
>>
File: ComfyUI_06655_.jpg (1.22 MB, 1536x1024)
1.22 MB
1.22 MB JPG
>>108360753
its flux klein
>>
>>108360759
Needs more piss filter
>>
>tell klein to remove the black dress
>it does and adds areolas and nipples

I think it recognized it as a man since it's fairly flat chested and didn't have a overly female haircut. Tried replicating it with other images, didn't work.
Sexist.
>>
>>108360759
can't believe I used to say that this was good, you have to understand it was the very first decent edit model kek
>>
>>108360766
>areolas
to be fair, short haired feminists don't shave so...
>>
File: 00010-316588865.jpg (118 KB, 1024x1024)
118 KB
118 KB JPG
>>
File: ComfyUI_03243_.jpg (437 KB, 1024x1280)
437 KB
437 KB JPG
Wish I switched to comfyui earlier
>>
is civitai shitting the bed for everyone or just for me?
>>
Ok, I'm starting to get a hang of it.
>>
>>108360616
skill issue
>>
>>108360825
yes, for two or three days already
>>
File: mqdefault.jpg (15 KB, 320x180)
15 KB
15 KB JPG
Can a friendly anon help a nigga out?
i am trying to generate some design ideas for my combat robot (real life robot battler) but because this nigga uses screwed propulsion, see screw tanks, the image generators online cant fucking handle it.

picture fucking related.
>>
>>108360869
> online
we are local fuck off
>>
>>108360869
Maybe it knows about the Shagohod?
>>
>>108360895
api nodes is okay
>>
File: 1751283063963406.png (19 KB, 623x385)
19 KB
19 KB PNG
>>108360895
>>108360903
>>
>>108360831
Is that a photo of (You) on the right?
>>
>>108360924
just like lmg is llama cpp thread
ldg is comfyui thread
fuck off
>>
File: 1761214880857243.jpg (450 KB, 1536x1536)
450 KB
450 KB JPG
>>108360965
kys
>>
Why is the developer of the dead wrapper known as Tranustudio such a gaped, worthless retard?
>>108360965
Disregarding the fact that the general has local in its name, there's a subhuman faggot avatartroon shitstain that won't leave this place and keeps shitting on comfy and anima due to various reasons, each more utterly retarded than the last
So anon probably assumed you were that nuisance
>>
>>108360986
>No one can dislike my preferred thing for any reason in any thread or he's Ani
>>
Does the snapshot manager in comfy manager take the entire app of just the manager?
>>
File: 1756804083577404.png (2.95 MB, 1664x1312)
2.95 MB
2.95 MB PNG
>>
Oh god, why did I update..
>>
>>108361125
saar stop being racist
>>
>>108361125
good cattle
use comfyui because that's expected of you
update because that's what they tell you to do
disregard all your actual experiences, do not believe yourself, comfyui is good because nameless bots told you so. enjoy your existence as an obedient drone and don't forget to subscribe to comfy cloud retard
>>
>>108361125
>>108361129
>>108361139
Don't you get tired of being a worthless raped retard, Julien?
Your dogshit wrapper will never be anything but stillborn
>>
>108361143
>no you can't criticize bad design decisions becasue muh ani
ok retard
>>
>>108361143
7 minutes, that's a slow response. Act faster next time or your monthly allowance of comfy coins is going to be cut in half
>>
File: raped.jpg (353 KB, 800x800)
353 KB
353 KB JPG
>>108361153
>>108361157
That's nice, shame the opinions of subhuman cocksleeves for old rich men are worthless
>>
who the fuck are ani and julien? can you fucking faggots shut the fuck up?
>>
>saar my ui updatings are very powerful productivity!!!
>>
>>108361185
Hello, these are very important topics to this general as you can tell by the inclusion of these in the OP
>https://rentry.org/debo
>https://rentry.org/animanon
>>
>>108361185
>who the fuck are ani and julien? can you fucking faggots shut the fuck up?
Wrong question
"Who" is used to refer to people, not subhuman shitstains
>>
>>108361195
i don't give a fuck, hang yourself, fucking loser
>>
so let me get this straight
if you say anything bad about comfy, trolls are just going to derail the thread and call you their boogeyman and post some guy's dox?
how is this okay with anyone? doesn't anyone think it's very suspicious at the very least?
>>
>>108361211
That's literally what happens yes. Even if you tell them you hate Ani's shit too they still sperg out like a jeet being called a paki.
>>
>>108361125
pip install -I comfyui-frontend-package==1.24.4
>>
>>108361211
>how is this okay with anyone? doesn't anyone think it's very suspicious at the very least?
Becauae you are that much of an insufferable subhuman, Julien
And, please, follow your own advice (>>108361202)
>>
>>108361211
This is lord ranjak's personal chat room and blog and you show deference to him here by attacking his sworn enemies, ani and debo.
>>
>>108361211
It's just regular trolls mixing in. No way anyone would defend comfy's retarded UI updates outside of the jeets working on it.
>>
>>108361235
>>108361236
Was that japan trip that much of a failure?
Lmao
Suffer, Julien
>>
File: EG8.jpg (78 KB, 452x1071)
78 KB
78 KB JPG
>>108361251
Explain this UI change without attacking anybody, saar.
>>
>why yes I do watch kino casino and I also want to be a heckin epic AI alog like the intro guy, how could you tell?
>>
>>108361256
Why?
I just want Julien to suffer
>>
>>108361256
>blueprints
>apps
>templates
isnt all that the same thing
>>
File: 00030~01.jpg (100 KB, 447x488)
100 KB
100 KB JPG
Does the KV cache node depend on the position in the workflow? Because my gen times didn't change at all.
>>108361274
Blueprints are saved subgraphs. Templates are premade WFs, apps are the new shareable thing I think?
>>
File: ComfyUI_00003_.jpg (230 KB, 2000x2000)
230 KB
230 KB JPG
I'm so sick of /adt/ troll farms. Why don't they enjoy Anima instead of attacking this thread?
>>
A fashion designer today must be absolutely thrilled in the designs they can spew out these days. That's one market that will always need manual labor, the chinese sweatshops.
>>
>>108361301
How do you make the preview node display both pics?
>>
>>108361293
Anima is pretty okay for being preview 2, still some anatomy issues and it's pretty slow compared to SDXL, but it will probably eventually be the replacement for illustrious and noob.
(he will reply accusing me of being Ani for not saying it's perfect already)
>>
File: 1742377951999251.png (152 KB, 1121x827)
152 KB
152 KB PNG
>attempting to pass clipart off as a legitimate gen
>deflecting its shitposting onto the other general that rejected the dogshit wrapper (what a coincidence!)
totally not the worthless raped """dev""" guise
>>
>>108361309
> what is stitch images node
>>
>>108361309
>>
>>108361317
>>108361334
why can't you /adt/ trolls leave this general? we just want to discuss local diffusion in peace and leave ani out of it, he stopped using 4chan altogether because of faggots like you
>>
>>108361352
I've been posting in the image gen threads since before comfy ui existed and any of these retards started shitting up the board you fuckwit
>>
>>108361352
if only that were true
sadly, you keep sullying this place with your utter subhumanity, you worthless raped retard
>>
>>
>>108361388
yoooo, gib links to that kind of thot, legit my favorite dress style of thots
>>
>>108361388
whose sex did he offend?
>>
File: Code_YF99g7tR9r.jpg (53 KB, 331x436)
53 KB
53 KB JPG
How do I change the float so it can have the smallest increment in 0.01s? It only let's me do 0.1 now.
>>
File: 1756835294709976.png (21 KB, 1152x233)
21 KB
21 KB PNG
>>108361414
here brownie
>>
I don't remember the normal 9b klein being this good, is the KV doing other things than just speeding it up?

>>108361396
>>108361403
No idea, probably a tranny.
>>
>>108361417
thanks
>>
>>108361431
How big was the speedup for you because in my wf it does nothing. Is it just for editing?
>>
>>108361446
Cba to check with the default one, but I recall it being like 20s or so.
With kv it's like 2-3seconds, on a 5090.
>>
I remember the 9b being cucked, while 4b wasn't. At least trump works on 9b kv.
>>
Very nice.
>>
>>108361568
ai was a mistake
>>
New bread
>>108361595
>>108361595
>>108361595
>>108361595
>>
Ok, this isn't working out, can probably fix it with controlnets if that's available for klein.

>>108361600
Life is a mistake.
>>
>>108361610
> facing right
>>
>>108361623
Yeah it was my last attempt, seeing if it was messing up the directions, I had left previously.
>>
hey guyss!



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.