[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


The Perfect Curve Edition

Discussion of Free and Open Source Diffusion Models

Prev: >>108361595

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>mfw Resource news

03/13/2026

>ComfyUI-CapitanZiT-Scheduler
https://github.com/capitan01R/ComfyUI-CapitanZiT-Scheduler

>DVD: Deterministic Video Depth Estimation with Generative Priors
https://dvd-project.github.io

>InSpatio-WorldFM: An Open-Source Real-Time Generative Frame Model
https://inspatio.github.io/worldfm

>DyWeight: Dynamic Gradient Weighting for Few-Step Diffusion Sampling
https://github.com/Westlake-AGI-Lab/DyWeight

>Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation
https://firm-reward.github.io

>BiGain: Unified Token Compression for Joint Generation and Classification
https://github.com/Greenoso/BiGain

>EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation
https://silentview.github.io/EVATok

>GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing
https://grade-bench.github.io

>Detect Anything in Real Time: From Single-Prompt Segmentation to Multi-Class Detection
https://github.com/mkturkcan/DART

>MV-SAM3D: Adaptive Multi-View Fusion for Layout-Aware 3D Generation
https://github.com/devinli123/MV-SAM3D

>One Model, Many Budgets: Elastic Latent Interfaces for Diffusion Transformers
https://snap-research.github.io/elit

>CEI-3D: Collaborative Explicit-Implicit 3D Reconstruction for Realistic and Fine-Grained Object Editing
https://github.com/shiyue001/CEI-3D

>SceneAssistant: A Visual Feedback Agent for Open-Vocabulary 3D Scene Generation
https://github.com/ROUJINN/SceneAssistant

>Diffusion Blend: Inference-Time Multi-Preference Alignment for Diffusion Models
https://github.com/bluewoods127/DB-2025

>ZLD Nodes for ComfyUI
https://github.com/Z-L-D/comfyui-zld

03/12/2026

>New FLUX.2 Klein 9b models
https://huggingface.co/black-forest-labs/FLUX.2-klein-9b-kv-fp8

>Too Vivid to Be Real? Benchmarking and Calibrating Generative Color Fidelity
https://github.com/ZhengyaoFang/CFM
>>
>mfw Research news

03/13/2026

>OmniForcing: Unleashing Real-time Joint Audio-Visual Generation
https://omniforcing.com

>OSCBench: Benchmarking Object State Change in Text-to-Video Generation
https://hanxjing.github.io/OSCBench

>Coarse-Guided Visual Generation via Weighted h-Transform Sampling
https://arxiv.org/abs/2603.12057

>Noise-aware few-shot learning through bi-directional multi-view prompt alignment
https://arxiv.org/abs/2603.11617

>Enhancing Image Aesthetics with Dual-Conditioned Diffusion Models Guided by Multimodal Perception
https://arxiv.org/abs/2603.11556

>SoulX-LiveAct: Towards Hour-Scale Real-Time Human Animation with Neighbor Forcing and ConvKV Memory
https://arxiv.org/abs/2603.11746

>ForensicZip: More Tokens are Better but Not Necessary in Forensic Vision-Language Models
https://arxiv.org/abs/2603.12208

>FlashMotion: Few-Step Controllable Video Generation with Trajectory Guidance
https://arxiv.org/abs/2603.12146

>WeEdit: A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing
https://arxiv.org/abs/2603.11593

>Manifold-Optimal Guidance: A Unified Riemannian Control View of Diffusion Guidance
https://arxiv.org/abs/2603.11509

>OrthoEraser: Coupled-Neuron Orthogonal Projection for Concept Erasure
https://arxiv.org/abs/2603.11493

>ShotVerse: Advancing Cinematic Camera Control for Text-Driven Multi-Shot Video Creation
https://arxiv.org/abs/2603.11421

>UCAN: Unified Convolutional Attention Network for Expansive Receptive Fields in Lightweight Super-Resolution
https://arxiv.org/abs/2603.11680

>Locating Demographic Bias at the Attention-Head Level in CLIP’s Vision Encoder
https://arxiv.org/abs/2603.11793

>The Latent Color Subspace: Emergent Order in High-Dimensional Chaos
https://arxiv.org/abs/2603.12261

>Intrinsic Concept Extraction Based on Compositional Interpretability
https://arxiv.org/abs/2603.11795

>ZeroSense:How Vision matters in Long Context Compression
https://arxiv.org/abs/2603.11846
>>
File: ComfyUI_04948_.png (649 KB, 1024x1024)
649 KB
649 KB PNG
>>
Sorbet desu
>>
how did tdrussell make anima so good?
>>
Been toying with Anima and enjoying, it but the way it handles style mixing is kind of weird. Did anyone else feel like that? I haven't tried loras yet.
>>
>>108367476
style mixing was pretty much an unintended feature of clip. pretty much all other TEs make it feel "weird"
>>
>>108367476
>but the way it handles style mixing is kind of weird
Can you post an example of what you mean?
>>
File: deBU_zi_00008_.png (3.32 MB, 2048x1170)
3.32 MB
3.32 MB PNG
>>108367476
I've love to see what you've been cooking with anima
>>
>>108367476
>>108367476
You're always trading aesthetics for intelligence. Loras in Anima work that way, they give you body consistency and character beauty by trading prompt adherence.
>>
>>108367476
dunno about styles in general but for artists its probably because like 90+% of images only have one artist, so like 90+% of the dataset was captioned with only one artist and now thats what the model expects
>>
>style mixing
no one has posted comparisons illustrating the supposed "better" style mixing of CLIP or how anima has "poor" mixing. how do you even measure that.
>>
File: the monstrosity.png (837 KB, 1948x1004)
837 KB
837 KB PNG
>>108367368
This is why I'm creating this monstrosity. I want to be able to create a custom curve for every occasion.

Here it is! All hand-wired.
>>
File: ComfyUI_04959_.png (453 KB, 1024x1024)
453 KB
453 KB PNG
huh somehow it always spells "winnin" as "winin" but when i change the prompt to "winning" it now spells it correctly
>>
File: ComfyUI_00330_.png (3.27 MB, 1824x1248)
3.27 MB
3.27 MB PNG
>>108367476
>Awesome pics. Great size. Look thick. Solid. Tight. Keep us all posted on your continued progress with any new progress pics using anima. Show us what you got man. Wanna see how freakn' huge, solid, thick and tight you can get. Thanks for the anima motivation.
>>
>>108367507
that doesn't look very comfy
>>
>>108367476
Style mixing was a clip quirk sadly.
You might get to somewhere with prompt scheduling though.
>>
>>108367506
They are fucking something up and blaming the model same as always
>>
File: Flux2-Klein_00262_.png (1.49 MB, 1024x1024)
1.49 MB
1.49 MB PNG
>>108367507
holy shit
>>
File: ComfyUI_15369.jpg (3.33 MB, 1500x2000)
3.33 MB
3.33 MB JPG
>>108367507
Why not just create a curves interface (like you'd see in Photoshop) that writes the values to a particular step instead of trying to do it in the other direction (setting values to create a curve)?
>>
>>108367506
Don't care about making experiments to prove anything to you, but in my experience anima will randomly decide to do one of they @styles in the prompt with at best vague influence from others rather than mixing different aspects from different styles like clip. Which style it lands on changes between seeds.
>>
File: the-univrse.jpg (957 KB, 1920x1080)
957 KB
957 KB JPG
>>108367507
>>
>>108367543
never encountered that. its easy to lock into styles.
>Don't care about making experiments to prove anything
sounds like the issue doesnt exist, then.
>>
>>108367507
bruh
>>
>>108367562
nta but kys retard
>>
File: 1650871717180.jpg (121 KB, 627x733)
121 KB
121 KB JPG
>>108367507
>>
>>108367562
nta but love yourself fren
>>
File: bad strategy.png (1.29 MB, 884x852)
1.29 MB
1.29 MB PNG
I think these results are lying to me. The final images are far too burnt compared to what I would get normally.

I think I should try another strategy, something a bit more log n.
>>
>>108367586
you're overdoing it
each model will handle sigmas differently, so trying to "fit the curve" to the model is the goal, not fit the model to the curve
>>
File: 1754938855117673.png (2.98 MB, 1288x1728)
2.98 MB
2.98 MB PNG
>>
>>108367615
>converse all stars, vintage 2004, intensifies
>>
>n*gbo
eww
>>
File: bbs-zit-2026-03-14_00070_.png (3.59 MB, 1920x1080)
3.59 MB
3.59 MB PNG
>>
File: deBU_zi_00009_.png (3 MB, 2048x1170)
3 MB
3 MB PNG
>>
File: the monstrosity part 2.png (607 KB, 2059x940)
607 KB
607 KB PNG
My new technique: Gen 34 different images, each one stopping at a different step. This way, I can get images which accurately represent what's happening at each step!

Next goal: Duplicate this 33 times, one for each starting step.
>>
File: bbs-zit-2026-03-14_00083_.png (3.48 MB, 1920x1080)
3.48 MB
3.48 MB PNG
>>108367956
>no output
this is edging with extra steps
>>
>>108367958
Each one of those subgroups outputs two images, one from the "denoised_ouput", and one from "output".
>>
>>108367433
>>108367438
Based.
>>
>>108367965
kys furryshit
>>
File: NAG anima.png (72 KB, 1091x573)
72 KB
72 KB PNG
Any NAG node that supports Anima?
Preferably a sampler like this or model patch, rather than something that outputs guidance.
>>
>>108367979
Vibecode it
>>
File: bbs-zit-2026-03-14_00102_.png (3.68 MB, 1920x1080)
3.68 MB
3.68 MB PNG
>>108367965
yeah ok and you already identified 18 was good enough so just do that
>>
>>108367983
NAG implementation differs from model to model, it's not as simple as copy paste another model's code and change two parameters, AI would probably just hallucinate it.
Possibly would have been doable if anyone implemented it for the base model, Cosmos-Predict though.
>>
>>108367993
Thank you for your wise and kind words.
>>
File: bbs-zit-2026-03-14_00108_.png (3.82 MB, 1920x1080)
3.82 MB
3.82 MB PNG
>>108368018
that's what i'm here for :)
>>
>>108368014
I assure you, it is that easy
>>
File: deBU_zi_00010_.png (3.28 MB, 2048x1170)
3.28 MB
3.28 MB PNG
>>
>>108368041
H-hello..
>>
>>108368039
>doubt
>>
File: deBU_zi_00011_.png (3.49 MB, 2048x1170)
3.49 MB
3.49 MB PNG
>>108368060
hi
>>
File: the monstosity part 3.png (528 KB, 1457x1011)
528 KB
528 KB PNG
>>108367956
All the nodes in that image have been put into a subgroup, and I've stacked as many of those subgroups up as I can. The interface is running at about 10 seconds per frame now.
>>
>>108368073
Cool. I wonder how easy this would be to script instead but comfyui scene graph is probably a nightmare (what else).
>>
File: 1758980008577673.jpg (608 KB, 1536x1536)
608 KB
608 KB JPG
nya
>>
File: bbs-zit-2026-03-14_00091_.png (3.28 MB, 1920x1080)
3.28 MB
3.28 MB PNG
>>
>>108368039
catbox plox? my free claude is shitting itself
>>
File: bbs-zit-2026-03-14_00139_.png (3.3 MB, 1920x1080)
3.3 MB
3.3 MB PNG
>>
File: bbs-zit-2026-03-14_00150_.png (3.75 MB, 1920x1080)
3.75 MB
3.75 MB PNG
>>
File: deBU_zi_00013_.png (3.31 MB, 2048x1170)
3.31 MB
3.31 MB PNG
>>
File: bbs-zit-2026-03-14_00159_.jpg (2.35 MB, 1920x1080)
2.35 MB
2.35 MB JPG
>>
File: ComfyUI_00130_.png (1.44 MB, 1248x1824)
1.44 MB
1.44 MB PNG
>>108368169
>https://files.catbox.moe/163oyo.py
sure bro. I'm all vibes over here, I feel like nag is solid, pag/seg do some wild shit, I am not even sure if those techniques should even work with cosmos, but fuck it, tokens are cheap and the output is interesting.
>>
>anima doesnt know adolf hitler
>>
File: bbs-zit-2026-03-14_00183_.jpg (2.02 MB, 1920x1080)
2.02 MB
2.02 MB JPG
>>
>>108367885
nice
>>
>>108368381
ty anon
>>
>>108367507
Comfy is proud of you
>>
>>108367507
You can paste nodes with their connections using
ctrl + shit + v
>>
File: sexyknight.png (1.58 MB, 1500x1012)
1.58 MB
1.58 MB PNG
>>
is there any new model after flux klein?
are the new chroma models ready?
>>
Where can i find a Workflow and Lora for undressing Animations ?
>>
>>108368974
sir
>>
Why did the update make the run button smaller? To clear up space for fucking what. And the buttons that change the number of gens are like 10 pixels wide now, you have to really squint to hit them
>>
Saar, do the needful.
>>
>>108369015
buetiful desi lady, open bobs
>>
can anyone recommend a best text based editing model that can run on a 4070?
>>
>>108369015
jealous of real patriots being able to appreciate their women?
>>
>>108369010
unc needs glasses
>>
>>108369040
klein 4b probably, 9b might work quantized or for small images
>>
>>108369088
how much quants hurt the performance?
on LLMs Q5 is usually acceptable but idk how it would go for images
also can you recommend me a foolproof ui?
>>
>>108369099
I run 9B on fp8/q8 on 12GB just fine
>>
>>108369099
q8 are pretty much lossless, after that it gets bad real fast.
>>
>>108369106
thanks
>>108369111
thanks and checked
>>
>>108369111
>q8 are pretty much lossless
I've found this doesn't hold true for the text encoders part sadly, bf16 -> Q8 produces wildly different images, while Q8 -> fp8 looks almost the same.
>>
so is there anything other than comfyui that 'just works' for prompt/chat based editing
>>
>>108369182
look at the op retard
or ask your favourite LLM
>>
>>108369182
no, nothing here "just werks" but your best bet is wangp or forge neo
>>
>>108367428
>Away for a month
>Rely on Nano Banana Pro's prompt understanding for professional photo editing work
>Flux 2 Klein comes and destroys it with its VAE, but it's just a tad bit behind in prompt understanding and is nowhere near in text
>Check news now
>Qwen Image 2
>Quality wise, it's practically Flux 2 Klein with all the flaws fixed.
>Destroys NBP/2.
>But it's not out yet

Are we getting Chinese cultured again? They have a 7B model better than NBP on their hands. This is significantly worse than Z base not releasing.
>>
>>108369252
Mind you, Flux 2 Klein is not bad as 2nd pass for NBP, or just prompt engineering the crap out of it (I'm surprised it understands some things so well), but its architecture was still a bit lackluster for prompt following. Hopefully BFL steps it up with Flux 3.
>>
File: 1772654900396823.jpg (36 KB, 1417x157)
36 KB
36 KB JPG
>close bug reports
>dont fix them
COMFY YOU AND YOUR TEAM ARE ACTUAL FUCKING GORILLAS
https://github.com/Comfy-Org/ComfyUI/issues/12896
https://github.com/Comfy-Org/ComfyUI/issues/12918
>>
>>108369311
You can always use forge neo or sd.cpp
https://github.com/Haoming02/sd-webui-forge-classic
https://github.com/leejet/stable-diffusion.cpp
>>
>>108367507
This picture deserves to be in the highlights this is why /ldg/ was created for, thanks anon, your post gave me local models testosterone
>>
If I upgrade my 32gb ram to 64gb will I notice any difference when generating pics? ddr4 with 12gb vram if that matters
>>
>>108369380
No, but you can store more models in the RAM and do more complex worklfows
>>
File: file.png (2.15 MB, 1200x1200)
2.15 MB
2.15 MB PNG
>wow there's an official brightness contrast blueprint now that uses glsl
>can only turn up brightness
yes, very good, no one ever needs to make things darker

also
>no gamma node
come on, jack, get your shit together
>>
>>108369399
>do more complex worklfows
Nice. Thanks, bro.
>>
>>108369402
oh nevermind, there's a levels blueprint. i take it all back.
>>
>>108368943
Klein 9B just got a version that supports KV caching for speed.

Anima for anime-style. A second preview release just came out.
>>
Just got a BSOD running comfy. Second time this happened now, first time was like a month ago. And I've been using it for over a year and everything was fine before that
>>
File: GMVEPNpawAA_ZVx.jpg (229 KB, 1073x1532)
229 KB
229 KB JPG
Any anime bros here that could help me with a question? What's with this mental illness? Quasarcake clearly has real talent, yet spends his time and energy drawing secondary characters from that dungeon food slop anime and tons of Frieren and gacha slop girls that isn't even his OC.
What makes an artist stoop this low? Please tell me he's just doing it for money and has actual original work somewhere, right? Indeed, thanks to him I'm gonna make a lora for ZiB, what a faggot.
>>
>>108369470
it's this sort of shit that gets you #1 on the daily pixiv rankings
>>
>>108369470
Artists can only survive if they leech off of popular IPs. Nobody gives a shit about their OCs
>>
>>108369470
Draw popular thing -> more updoots
>>
File: 00133-4221361199.jpg (231 KB, 896x1152)
231 KB
231 KB JPG
>>108367476
Prompt scheduling (turning artists on/off after x steps, or switching) seems to be OK.

@nanaken nana, [@sky-freedom,@shengtian::0.0625], [@timbougami:0.625], [realistic:0.875],

I see a strong influence of nanaken nana here and a subtle one of timbougami, very little of the others because muh wildcards switched them off almost immediately.
>>
>>108369470
nobody gives a fuck about OCs and neither should you
>>
File: FWWJ4LmaMAAxgho.jpg (284 KB, 1191x1428)
284 KB
284 KB JPG
>>108369477
>>108369485
>>108369490
As expected, he does this anime shit for money. Pic related is a vtuber, I imagine he probably hates this stuff and it's not what he enjoys drawing.
>>
>>108369510
Artists don't really care about the subject matter, it's not about who you draw it's about how you draw. Most artists have spent years drawing random sculptures for practice, they're unfazed by that shit
>>
File: FJsbChBaQAspEQn.jpg (376 KB, 1532x2118)
376 KB
376 KB JPG
>>108369508
>and neither should you
Yes I should give a fuck, because if his anime slop commissions are this good, his OCs must be 100 times better, meaning a better dataset for my lora.
>>
so codex can just one shot nodes now huh
>>
File: xyz_grid-0003-4221361199.jpg (785 KB, 2464x2809)
785 KB
785 KB JPG
fun with style mixing
>>
Is tiled diffusion not working with anima currently? I'm getting
>Value Error: too many values to unpack (expected 4)
So is USDU the only way to add details while upscaling for anima? I tried upscaling with another model like wai/chenkin, but they melt every face in the background and eyes in the foreground even though they were ok at first with anima
>>
grok is based? wtf
https://gofile.io/d/Zadtyn
>>
>>108369460
yeah it's totally comfy's fault that your computer is fucked
>>
>>108369656
nigga grok was generating full on animated porn at one point
>>
>>108369652
>Value Error: too many values to unpack (expected 4)
I'm guessing this is due to the vae as sdxl's is 4 channel but anima's is 16
>>
Heh, so CLIP was important for mixing artist styles after all? What's their solution to the CLIP problem? None?
If in 2026 Anima can't mix artist styles and weights, it's DOA. 2026 with a 2B params models with only a 0.6b text encoder that's rigid on styles and the text encoder isn't programmed for weighting it's literally over.
We'll all watch it flop.
>>
>>108369664
he's fudding, ignore
>>
>>108369718
nai can't mix styles
>>
>>108369652
You need to decoode and encoode. I also got good results with Anima by increasing the CFG by 1.5x during hires fix. SDXL is also a good option for hires fix with Anima.
>>
>>108369650
Thanks tdrusell
>>
havent been here for weeks
has flux klein character lora training been fixed yet?
>>
>>108369718
As Juggernaut showed us, SDXL and CLIP were never the problem since Juggernaut handles multiple characters and complex scenes just fine. The real issue was always the slop surrounding it, overfitted 1girl dataset slop and especially dogshit captioning, with trial and error shitmixes being the cherry on top of the turd cake.
>>
>>108369781
>since Juggernaut handles multiple characters and complex scenes just fine
lmao?
>>
>>108369781
lol
>>
>>108369796
Juggernaut is a perfect example of a good for all non overfitted model. The face details are like any SDXL model, easily fixed with Adetailer.
>>
>>108369823
oh anon, you're such a joker
>>
>>108369311
>>108369460
Shame that the raped dev poisoned the well so much that posts like this don't get taken seriously
>>
>>108369311
comfyui has many problems right now. as i've already said, they only care about the api. it's over for real
>>
>>108369978
g-give me nodes 2.0
>>
>>108369664
Everything worked perfectly for a year, I didn't change my PC did I. I now rolled back to a version from October 2025 and it works perfectly again. I'm never touching a newer Comfy, I'm just sticking with this from now on. Nice to have all the buttons back too
>>
i'll just wait for nodes 3.0
>>
>>108370105
That could’ve been a software issue with your drivers/OS or whatever or your hardware could be failing
>>
>>108370126
he pulled
>>
File: 00226-1284552179.png (1.68 MB, 1080x1920)
1.68 MB
1.68 MB PNG
>>108368384
Anima does, however, know Hugo Boss's finest.
>>
>>108369650
what do you think about fidelity and interpretation between tags and dan booru images?
>>
>>108370126
>umm it's not comfy's fault that it can't work with up to date drivers and windows updates
Isn't it though
>>
>>108370213
>windows updates
unironically a problem here. stop updating wangblows
>>
>>108367428
Complete newbie here - just installed sdnext (the only UI that managed to install itself with rocm support) and experimenting with Flux2 klein base 4B.

I want to generate isometric spritesheets - do you have model/workflow recommendations? The results I'm getting haven't been very good so far.
>>
>>108367438
GO. AWAY
>>
>>108370212
I see Anima as a contemporary Illustrious, but it's still lacking. I can't match the texture, line work, colors I get with Noob yet, Noob is very organic.
>>
File: 4534534564.jpg (1.3 MB, 1810x1035)
1.3 MB
1.3 MB JPG
>>108369677
I tried with an sdxl vae just for the sake of it, and yeah
>The size of tensor a (4) must match the size of tensor b (16) at non-singleton dimension 1
Seems like the repo has not been updated for a year, so it's pretty much dead for any cosmos support
>>108369740
Oh right, forgot about hiresfix
Hands are still terminably broken, but at this point I can only pray the final version does better with details.
>>
>>108370266
gooning to your gen rn
>>
>>108370266
4gb vram here, ooming seeing your gen
>>
>>108370229
I'd recommend the distill klein since base is kinda eh. As for workflow - if it's not editing then just prompt it. The newer models can understand complex prompts. There was some anon here who was doing spritesheets but idk what model he used. And civit maybe has a lora for it.
>>
Why is Klein such a snowflake that it needs its own KSampler?
>>
>>108370266
catbox?
>>
>>108370397
no?
>>
>>108370397
it does?
>>
File: 00017-108396027.png (3.86 MB, 1440x2560)
3.86 MB
3.86 MB PNG
>>108370397
yes?
>>
>>108370457
>>108370464
>>108370475
Yes, it needs its own set of custom Flux 2 nodes, one for CFG, another for steps, another for the sampler, another for sigmas. It’s a snowflake model.
>>
new to video models. when using Wan i2v, what can cause the initial image to disappear and get replaced by a "straight t2v" version of the prompt? it doesn't persist and combine with the prompt as expected.
>>
>>108370497
nope, that's just a workflow default. you can use it with whatever Ksampler
>>
>>108370524
Are you using both models or just rawdoggin the high noise?
>>
i've tested about 9 different female characters of varying degrees of popularity, Anima got every single one of them nearly 1:1. Liru's titty belt thingy wouldn't gen right but that's it. Very impressive stuff for once again a model only in like v0.1 and 0.2.

>>108370497
>>108370531
i have no idea if it really needs all that or not personally i just ran flux 2 with that bullshit included because i figured cumfart's default workflow was the way to go.
>>
>>108370565
I used only the high-noise version after seeing a few decent outputs from it. suspected that was the reason. will try to make loading and offloading work then, thanks
>>
>>108370596
Assuming you are using 2.2, it really doesn't like using a single model for I2V. T2V is fine. Also this should still work and is better than just chaining two samplers https://github.com/stduhpf/ComfyUI-WanMoeKSampler

I haven't used wan in comfy in a while tho so I can't guarantee it.
>>
Why is Anima v2 garbage
>>
>>108370660
it's based on anima v1 which was garbage
>>
>>108370660
it's based on anime which is garbage
>>
>>108370660
35 stars status?
>>
File: Anima_00373_.png (1.31 MB, 832x1296)
1.31 MB
1.31 MB PNG
>>
>>108370660
it's based which is garbage
>>
>>108370704
>>108370475
>>108370266
Can anima make true blacks? (not BBC)
>>
>>108370760
>not BBC
Why not >:^(
>>
File: wai_vs_anima1.png (3.01 MB, 1792x1152)
3.01 MB
3.01 MB PNG
>see anon's post from yesterday about WAI vs Anima
>take an example prompt from the WAI Civit page
>masterpiece, best quality, Cartethyia \(wuthering waves\), Cartethyia teen style clothes, small blue and white crown of thorns, Cartethyia white dress, black forehead mark, 1girl, blonde hair, pointy ears, long hair, blue eyes, braid, cowboy shot, abstract background, water, blue theme
>left: WAIv16
>right: Anima Preview2
WAIsisters, our response?
>>
>>108370808
WAIt
>>
>>108370660
made by an incompetent hack who decided to spend most of his budget on shills instead of training the model. v1 was garbage to begin with, people are running low on copium and finally see how bad it really is
>>
>>108370808
I think left has better composition
>>
>>108370760
It can!
>the second i prompt a vanta black character it breaks the gay cuck 4mb file limit of this gay cuck site
many such cases.
>>
File: image.jpg (221 KB, 2496x832)
221 KB
221 KB JPG
>>108370813
>>108370808
Testing background no humans.
Left Noob, right Anima
Animasister, our response?
>>
>>108370827
>Animasister, our response?
their response has always been to deflect and continue shilling
>>
>>108370808
Don't know the character, but both have different clothes, which is closer to the original?
>>
>>108370838
wai. anima gen is a literal who
>>
>>108370827
Why does Noob has so much sovl? Did you prompt that hole in the wall, or was it a blessing from Laxhar?
>>
>>108370822
composition in anima does suck. it's is like a dumbed down wai with a better vae. simpler to prompt for, though.
and i'm a non-booru tag prompting chroma shill.
>>
File: 121050563.jpg (240 KB, 1280x720)
240 KB
240 KB JPG
>>108370838
>Cartethyia \(wuthering waves\
Anima
>>
>ani is awake
here we go again
>>
>>108370793
he wants VantaBC
>>
>>108370848
Blessing from Laxhar. I''m hoping to never move away from Noob and just wait for a local SOTA editing model. Maybe it’ll be ZetaOmni? I don’t know, but Noob has too much SOVL to be discarded.
>>
>>108370851
The VAE is barely worth it considering how bad it is at upscaling. After regressions of 2nd preview I decided to go back to wai and noob, will be waiting for a better model.
>>
>>108370822
>>108370851
>"small blue and white crown of thorns"
>WAI makes the background a literal crown of thorns because CLIP bleeds every part of the prompt into every other part
>better composition
>>
>>108370808
Both are WAI faggot. tf are you talking?
>>
>>108370878
wai looks better, mr. number one anima fan
>>
How long until somebody drops the ultimate z-image-base finetune that destroys both the sdxl slop and anima?
>>
[unsubscribe]
>>
>>108370878
which isn't a feature unique to anima, at least. models are products of their times.
and if you've anima, you know that composition is hard to control and kind of sucks. probably a third of my gens beyond a certain prompt length lose adherence, cutting off something i've explicitly prompted for, even when i add negatives to reinforce.
>>
>knowing characters and details doesn't actually matter
>WAI has better composition because it put some random shit in the background
>WAI looks better because, well, it just does, okay?
>the one on the right isn't even Anima, you're lying!
holy fucking cope
>>
>>108370827
Why anima so opaque?
>>
>>108370906
honestly it's appalling. there's no benefit to using an llm te when it's so fucking small. might as well stay on clip, at least it can mix artists
>>
>>108370915
>there's no benefit to using an llm te
This. Anima has been a huge disappointment so far.
>>
>>108370660
>>108370827
>>108370887
You are such a retarded rape victim
>>108370890
Hkw long until you hanh yourself, Julien?
>>
File: anima_00134_.png (884 KB, 1024x1024)
884 KB
884 KB PNG
>>108370878
>>108370906
what anima thinks is a full body shot, with a neg of 'face out of frame'.
>>
>>108370934
What the fuck
>>
>>108370934
an issue of skill, perhaps?
>>
>>108370949
thisss you just need to prompt booru tags harder, anima can never be at fault, it is NOT a shitbake made by a retarded scammer
>>
File: chenkinsloppa.png (2.3 MB, 840x1256)
2.3 MB
2.3 MB PNG
>>108370808
C-chenkin RF bross
>>
Still no comparisons of this supposed superior artist mixing? Shame.
>>
>>108370957
Take it out of the microwave, anon, it’s melting.
>>
Day 4 of Anima. As a regular user (idk how to train) I think I'm just going to drop it, at least for the WIP checkpoints. It has some interesting things it can do already, but it's also still missing a ton of things. Full version + controlnet might make it superior to Noob, but for now I'll stick with Noob personally, especially as a lot of its issues can be mitigated with CN and its pile of LoRAs.
>>
Okay thanks for keeping us updated 'anon'
>>
>update comfyui
>some workflows do not work anymore for some reason
>inference oom's all the time
>can't even goon properly
Why is code by the fennec faggot so bad? Does he not test his shit before it lands in the main branch?
>>
>>108370986
considering the fact that it's regressing i don't think it's worth it to wait. anima is a dead end with its licensing to begin with so its a good thing that people are throwing it into the trash
>>
>>108370959
no because anon is just trolling
>>
>>108371023
>anima is a dead end with its licensing to begin with
Huh, it's not like SD?
>>
totally not fudding btw
>>
>>108371010
so is this the daily FUD? yummy!
>>
qrd on 35 stars?
>>
File: funding_secured.jpg (807 KB, 1668x2505)
807 KB
807 KB JPG
>>108371023
>>
>go look at my SD 1.5 gens again
>reminded that even after all this time and all these models, I still was never able to get back to that style
Damn maybe I really do just have to learn how to train.
>>
>>108371065
lold
>>
>>108370949
probably, but it shouldn't completely disregard composition tags like shot and angle because of complex prompting. the gens are pretty nice, but there's effectively no control.
>>
>>108367507
based
>>
>>108371065
i don't get it. is this some discord meme reference?
>>
>legitimate criticism is fud
pajeet shill behavior for pajeet model, truly
>>
>>108371094
Forgetroons are so alienated from their own gens that I feel deep and genuine pity for them.
>>
File: 35453456.jpg (698 KB, 651x1147)
698 KB
698 KB JPG
>>108370808
To be fair anima quality tags aren't exactly the same as wai, and it seems that they really affect the result a lot
>>
Can comfy do x/y plots where you can switch both VAE and text encoder when switching checkpoints? Unfortunately, forge cannot.
>>
>>108371105
You don't know about GitHub? What are you doing on this board?
>>
>>108371085
you should learn how to train, it's not as complicated or time consuming as it seems.
there are no downsides, you get a better understanding of how the model works, you get away from generic 1girl gens, you get better control over your gens, and if you are genning or training on older models like sdxl you can do it locally in a few hours.
>>
>>108371085
This >>108371160 and also, training Anima is way cheaper than training other models because it’s part of the ComfyORG ecosystem. There are deals on ComfyCloud when you want to train models that are part of ComfyORG.
>>
at least put some effort into it faggot
>>
>>108370212
I would say it's more accurate, just different from what you may be used to if you've been doing SDXL anime for a bit.

It took me a lot of study to get similar results out of anima that I used to get with Noob vpred, and I didn't really like it at first. Blending artist styles is not quite the same, and BREAK keywords just don't work at all.
>>
not only is anima shit but it's license is also garbage
why would anyone use it?
>>
The Raped and The Damned
>>
only jeets care about loicense howeverbeit
>>
>>108371112
Enjoy spending over 6 million hours dicking around with settings spaghetti, I guess
>>
>>108370873
Now that's a completely legit criticism. I just completely gave up on using anima for upscaling and still use Noob for the hires pass.
>>
I like how there's an influx of trolling here, it subsides, and then theres an influx of gens posted from one or two anons in SDG
>>
>>108371447
Thanks for pointing out /adt/ raider
>>
File: 1763987444999193.png (3.89 MB, 1216x1824)
3.89 MB
3.89 MB PNG
>>108370873
>>108371338
>considering how bad it is at upscaling.
i disagree
>>
>>108371469
>>108370679
>>
>>108371513
>being baited by mr 'im gonna make apache anima XD'
lol, lmao even
>>
>>108371469
And why not some /sdg/ troll?
>>
File: aaa.png (173 KB, 1710x1107)
173 KB
173 KB PNG
how do i apply a lora in image to video for wan in comfyui? i haven't found a simple answer on website or youtube, they all want me to use their workflow that i have to pay on patreon to use
>>
>>108370808
>WAIsisters, our response?
i fucking love women's thighs
>>
>>108371747
open the subgraph, copy paste the load lora node and just chain them together
>>
>>108371747
im so sorry anon
>>
>>108371747
>>108371737
>>108371710
Also I'd recommend right clicking the big subgraph node and click unpack. Comfy for some reason started hiding starter templates into subgraphs.
>>
Very curious why this is only happening with Anima. Not with the other dozens of models in between with adjacent or worse license, or worse. Just anima.
>>
retard here
i'm retarded
>>
Why do my 1girls look disgusting
>>
>>108371752
wow. thank you friend. i have been trying to do this for days now
>>
>>108371814
no problems broski, be sure to post your gens here as a payment though
>>
File: 1772532134010689.jpg (760 KB, 1312x1632)
760 KB
760 KB JPG
>>
File: 1758091080119014.jpg (692 KB, 2272x960)
692 KB
692 KB JPG
>>
File: 1752743277081490.jpg (967 KB, 2016x1120)
967 KB
967 KB JPG
>>
File: 1757192786395773.png (3.93 MB, 1312x1632)
3.93 MB
3.93 MB PNG
my gay gf
>>
File: output.webm (644 KB, 320x640)
644 KB
644 KB WEBM
>>108371832
just testing the hip move lora. took me ages to convert the mp4 to webm
>>
>>108371649
the silence is deafening
>>
File: 1748162567758307.png (3.92 MB, 1728x1248)
3.92 MB
3.92 MB PNG
pits
>>
File: 1746156376114258.jpg (775 KB, 1312x1792)
775 KB
775 KB JPG
>>
>>108372252
fix toes >:^(
>>
File: Flux2-Klein9B_00026_.png (1.6 MB, 688x1520)
1.6 MB
1.6 MB PNG
>>
>>108371105
It's petra.
>>
i have two finished images for my dataset so far, bet they still help a lot for the lora, really cuts down editing
>>
I have this bad feeling that a Klein 9B Chroma tune will be so similar to Flux-based Chroma that it will effectively be a nothingburger when it drops
>>
>>108370266
wait that image was genned with anima?
>>
>>108370838
kek
>>
File: 1762882371652786.png (277 KB, 1163x1602)
277 KB
277 KB PNG
I've been cut off from so much shit, like chub.ai is blocked in Australia for example, and now this. Is there such a thing as a trustworthy vpn company?
>>
>>108373120
unless you need to hop around just make your own VPN with a cheap VPS
>>
>>108373120
none of them are trustworthy in terms of giving your logs upon government request but you can pirate over vpn just fine
>>
>>108373120
The internet is so lame right now.
>>
cozy infinity
>>
what do you guys use to manage lora dataset and captions? i am using visual studio code, but there must be something made for this, like if i click on one caption it should show me all the images that uses it in the dataset or something
>>
>>108373791
hydrus network maybe? thats what i see recommended a lot when it comes to image management
>>
File: 1748467783594097.png (2 MB, 1216x1824)
2 MB
2 MB PNG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.