[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>108347136

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
anima preview2 status?
>>
>mfw Resource news

03/11/2026

>anima-preview2.safetensors
https://huggingface.co/circlestone-labs/Anima/tree/main/split_files/diffusion_models

>Reviving ConvNeXt for Efficient Convolutional Diffusion Models
https://github.com/star-kwon/FCDM

>BinaryAttention: One-Bit QK-Attention for Vision and Diffusion Transformers
https://github.com/EdwardChasel/BinaryAttention

>QUSR: Quality-Aware and Uncertainty-Guided Image Super-Resolution Diffusion Model
https://github.com/oTvTog/QUSR

>InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing
https://github.com/OpenGVLab/InternVL-U

>SD Forge Nvidia VFX: VideoSuperRes extension for Forge Neo implements
https://github.com/Haoming02/sd-forge-nvidia-vfx

>Nvidia_RTX_Nodes_ComfyUI: "RTX Video Super Resolution"
https://github.com/Comfy-Org/Nvidia_RTX_Nodes_ComfyUI

>Gemini Embedding 2: Natively multimodal embedding model
https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-embedding-2

03/10/2026

>HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising
https://jacky-hate.github.io/HiAR

>Scaling Test-Time Robustness of Vision-Language Models via Self-Critical Inference Framework
https://github.com/KaihuaTang/Self-Critical-Inference-Framework

>FVG-PT: Adaptive Foreground View-Guided Prompt Tuning for Vision-Language Models
https://github.com/JREion/FVG-PT

>Kaleidescope - Index, Search, Invoke Comfyui Workflow
https://github.com/svenhimmelvarg/kaleidoscope

>App Mode, App Builder, and ComfyHub
https://blog.comfy.org/p/from-workflow-to-app-introducing

03/09/2026

>DiffiT: Diffusion Vision Transformers for Image Generation
https://github.com/nvlabs/diffit

>Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis
https://bfl.ai/research/self-flow

>MatAnyone 2: Scaling Video Matting via a Learned Quality Evaluator
https://pq-yang.github.io/projects/MatAnyone2
>>
>mfw Research news

03/11/2026

>Streaming Autoregressive Video Generation via Diagonal Distillation
https://arxiv.org/abs/2603.09488

>Prompt-Driven Color Accessibility Evaluation in Diffusion-based Image Generation Models
https://arxiv.org/abs/2603.09832

>FrameDiT: Diffusion Transformer with Frame-Level Matrix Attention for Efficient Video Generation
https://arxiv.org/abs/2603.09721

>When to Lock Attention: Training-Free KV Control in Video Diffusion
https://arxiv.org/abs/2603.09657

>SODA: Sensitivity-Oriented Dynamic Acceleration for Diffusion Transformer
https://arxiv.org/abs/2603.07057

>CogBlender: Towards Continuous Cognitive Intervention in Text-to-Image Generation
https://arxiv.org/abs/2603.09286

>Component-Aware Sketch-to-Image Generation Using Self-Attention Encoding and Coordinate-Preserving Fusion
https://arxiv.org/abs/2603.09484

>TIDE: Text-Informed Dynamic Extrapolation with Step-Aware Temperature Control for Diffusion Transformers
https://arxiv.org/abs/2603.08928

>RubiCap: Rubric-Guided Reinforcement Learning for Dense Image Captioning
https://arxiv.org/abs/2603.09160

>Prune Redundancy, Preserve Essence: Vision Token Compression in VLMs via Synergistic Importance-Diversity
https://arxiv.org/abs/2603.09480

>The Coupling Within: Flow Matching via Distilled Normalizing Flows
https://arxiv.org/abs/2603.09014

>Training-Free Coverless Multi-Image Steganography with Access Control
https://arxiv.org/abs/2603.09390

>IntroSVG: Learning from Rendering Feedback for Text-to-SVG Generation via an Introspective Generator-Critic Framework
https://arxiv.org/abs/2603.09312

>Evolving Prompt Adaptation for Vision-Language Models
https://arxiv.org/abs/2603.09493

>Reasoning-Oriented Programming: Chaining Semantic Gadgets to Jailbreak Large Vision Language Models
https://arxiv.org/abs/2603.09246

>B-DENSE: Branching For Dense Ensemble Network Supervision Efficiency
https://arxiv.org/abs/2602.15971
>>
It was nice to have one non-drama thread but it seems those rentry links are just too important for catjack's ego
>>
I have my new 285k and 5080 I'm ready to party fellas. Do you guys containerize your ai workflows or just raw dog it on the host? Should I jump straight into comfyui
>>
>>108351290
Anon gave me the impression there was very little difference between versions but that does not seem to be the case.
>>
>>108351283
>tranime infested collage
>>
>>108351355
you're on anime website btw
>>
File: ComfyUI_temp_nrhbq_00001_.png (2.47 MB, 1152x1920)
2.47 MB
2.47 MB PNG
death to anime
>>
>>108351283
Thanks for restoring the OP.

>>108351336
I use venv/conda, but that's it.

>>108351290
>>108351338
>tfw can't test it today
Maybe tomorrow.
>>
>>108351371
What is conda?
>>
>>108351336
Don't know about all that container shit, I have a linux install i use for local ai and not much else, two venvs because qwen doesn't like sage or something i don't really understand. You might want to think about docker or something if you're using lots of custom nodes and shit on your everyday computer.
>>
>>108351386
another variant on a python venv type of thing

python package management almost does not work system or user account wide because too many pieces of python have incompatible dependencies and requirements. so people came up with methods to contain them one environment at a time.

personally (different anon) i'm mainly using uv to manage pip packages and the venv they get installed in
>>
File: ComfyUI_temp_qvfoq_00001_.png (2.95 MB, 1920x1200)
2.95 MB
2.95 MB PNG
>>
>>108351336
you could certainly set up a container with podman/docker [-compose] (or if you are that kind of a madman, k8s) that equally updates stuff from git, but i am just using venv
>>
>>108351427
I see,I've never bothered to learn about conda or docker or whatever. Seems too complicated.
I just use venv and some bash aliases most of the time. My bar isn't that high.
Would probably be a pain in the ass if I did some real dev stuff and managing multiple python versions and whatnot.
>>
File: Flux2-Klein_00014_.png (1.09 MB, 976x992)
1.09 MB
1.09 MB PNG
>>
File: Anima2_00042_.png (1.3 MB, 1152x896)
1.3 MB
1.3 MB PNG
>>
>>108351620
this is definitely seedream or some shit
>>
File: deBU_zi_00035_.png (2.11 MB, 1536x922)
2.11 MB
2.11 MB PNG
>>
how does one add loras to

>Wan2GP: https://github.com/deepbeepmeep/Wan2GP

it downloaded successfully and it does work, but i would like it to be uncensored if possible. thanks!
>>
File: what.png (42 KB, 580x460)
42 KB
42 KB PNG
>>108351283
When I have 4chanX activated on Tapermonkey, it's hiding ani's rentry somehow, it didn't happen before until yesterday lool
>>
great posts as expected of a ranbake
>>
File: ComfyUI_04723_.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
>>
>>108351792
>everyone I dislike is ran
obsessed
>>
I see the two rentry links at the end of the OP as well as ran's slop shit in the collage so I do declare with utter certainty that this is a ranbake
>>
>>108351812
me in the tub
>>
File: 1763916160475780.png (1.49 MB, 1351x943)
1.49 MB
1.49 MB PNG
https://huggingface.co/InternVL-U/InternVL-U
can't believe they thought it was a good cherry picked image lmao
>>
https://www.wired.com/story/nvidia-investing-26-billion-open-source-models/
Nvidia will save us!
>>
post fp16_accumulation X/Y/Zs
>>
>>108351913
i'm not your monkey
>>
>>108351873
it's going to be nothing but bloated underperforming LLMslop
>>
>>108351964
Surely they'll put another million into anima.
>>
>>108351964
Anima is good.
>>
>>108351964
>bloated
surely they learned from the z team.... right?..... RIGHT?
>>
File: tmpcg__jj1s.png (668 KB, 896x1152)
668 KB
668 KB PNG
>>108351984
>>
>>108351873
This is grim, actually. Nvidia is afraid of competition.
>>
>>108352003
kino
>>
File: tmpcy91b3rs.png (730 KB, 896x1152)
730 KB
730 KB PNG
>>108352018
Preview 2 is pretty good at spitting these out, just put your usual negative as the prompt + what you actually want the image to be and best quality etc as negative.
>>
anime model btw, good thing he included that ye pop dataset huh
https://files.catbox.moe/kp3d4r.jpg
https://files.catbox.moe/fzlsre.jpg
>>
>>108352090
I didn't know it could do realistic shit, it looks more realistic than Qwen Image lool
>>
File: 1769482285281913.png (3.04 MB, 1216x1824)
3.04 MB
3.04 MB PNG
>>108351290
pretty good
>>108352003
kewel gen anon
>>108352059
i love doing that picunrel. i bet using the deviantart tag would push it even further into kinosovl
>>
>>108352105
Is Anima preview 2 finally creating an atmosphere of slight glasnost?
>>
>>108352090
one good realism lora or light finetune and this thing curbstomps chroma using less than 1/3 the parameter count
what is kekstone even doing
>>
>>108352127
>what is kekstone even doing
gathering some more dog dicks drawings from e621 duh!!
>>
>>108352142
yeah chroma still mogs as long as it still knows farting_on_prey while anima doesn't
>>
>>108352157
pretty sure you can do that with anima
just... write
>>
File: ComfyUI_04739_.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
>>
File: 57394534.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>
File: ComfyUI_04746_.png (901 KB, 1024x1024)
901 KB
901 KB PNG
>shit accuracy
>best by far at sniffing out treasure
>also good at negotiating for some reason
take her or leave her?
>>
>>108352228
Most valuable party member, talks the final boss into leaving the country without fighting.
>>
>>108352228
i would put googly eyes on her
>>
File: 1745382358388356.png (170 KB, 1922x711)
170 KB
170 KB PNG
lul
>>
>>108352267
qrd?
>>
File: 1769839480717942.png (565 KB, 1680x1470)
565 KB
565 KB PNG
>>108352279
he felt into the ani's fabricated huggingface message lmao
>>
File: tmpzzwkzbfq.png (676 KB, 1024x1024)
676 KB
676 KB PNG
It seems to be struggling with kneeling and sitting on knees, getting lots of overly long torso short legs.
>>
>>108352321
nice SD1.5 gen anon
>>
File: tmpnecrbo9q.png (796 KB, 1024x1024)
796 KB
796 KB PNG
>>108352328
I was thinking it was an aspect ratio thing like sd1.5 would do but it happens 1:1 too, it's anima preview 2.
>>
kekstone had to have purged artist and character tags, right? anima is learning thousands of them with a fraction of the parameters while chroma still knows nothing.
>>
Asked claude why i now get errors in comfyui
"The root cause is clear from the traceback. In newer versions of ComfyUI, model_function now returns a tuple (tensor, ...) rather than a plain tensor."
Fuck you, comfy.
>>
>>108352380
now ask claude to fix the errors
>>
>>108352385
it told me
"comfyanon should be drug out into the streets and shot."
>>
>>108352388
fitting, since claude helped Trump get into war against Venzeula kek
>>
>>108352388
Shockingly based for a (((SAAS))) chatbot.
>>
>>108352377
>chroma still knows nothing.
his cope is that T5 is a bullshit text encoder to learn anime tags, I don't believe that at all, he just wants to do """ethical""" training without having the balls to say it, at least the other horse pony fucker didn't lie to our faces and said outright he wanted to cuck his models
>>
File: t2i_00001_ (2).png (1.54 MB, 1280x720)
1.54 MB
1.54 MB PNG
Soul.
>>
Damn, I expected 2026 to be a great year for local diffusion after the release of Z-image turbo, and so far, nothing groundbreaking happened... :(
>>
>>108352284
>the project is yours (but actually I own all of it and I'm not paying you lmao)
ani was right. also, why would Alibaba pay someone with a shit dataset to train a proprietary model? If ani is making a new one and it's better, wouldn't that be egg on their face?
>>
File: 1750375798490544.jpg (1.12 MB, 3506x2400)
1.12 MB
1.12 MB JPG
Is there a way to make ComfyUI make noises when you queue up images? Or when it's finished with one of them?
>>
>>108352185
is the inspiration for this gen from that dumb hoe talking about being naked in public isn't consent?
>>
>>108352321
isn't that the default body of a loli?
>>
>>108352284
I hope that furry feels very silly now.
>>
>>108352555
She's suppose to just be a chestlet not a loli, I did not prompt loli.
>>
File: misakifootball.png (973 KB, 1024x1024)
973 KB
973 KB PNG
>>
>>108352587
d-did you try loli in the negs
>>
File: tmpkbp5u11z.png (836 KB, 1024x1024)
836 KB
836 KB PNG
>>108352620
loli in negatives mostly just makes her eyes smaller but I did try that before.
>>
File: tmpa_r_6iyh.png (900 KB, 1024x1024)
900 KB
900 KB PNG
>>108352620
It does better anatomy with wariza but that isn't really the same pose. Maybe v3 will improve it.
>>
>>108352680
I posted the wrong one with the overly long leg of course, fucking firefox using ebussy's patented no thumbnail filepicker.
>>
File: thumbnail_firefox.png (26 KB, 959x538)
26 KB
26 KB PNG
>>108352693
??
>>
>>108352321
>>108352680
You ARE using "seiza" for sitting on knees, right?
>>
>>108352711
Yes, I also tried "on knees, sitting" but both give the same frequent long torso/short leg issues.
>>
File: motor3.png (1.55 MB, 1024x1024)
1.55 MB
1.55 MB PNG
>>
>>108352426
sovl

>>108352541
There used to be a custom "play sound" node.
>>
>>108351307
>>108351355
>>108351792
>>108351845
The raped
>>
>>108352680
i was really hype for anima v2 but after some testing i noticed wild regressions. he's seriously fucking something up. anatomy's worse, the styles are starting to get fried. i'm kinda afraid that the final release will be completely fucked
>>
>he's awake
>>
>>108352400
>his cope is that T5 is a bullshit text encoder to learn anime tags
Where did he say this? I mean, that's verifiably bullshit because there are anime models than use T5.
>>
>>108353111
nvm blonde moment. i had downgraded drivers to 576 branch
>>
>>108353010
>but after some testing i noticed wild regressions
Don't be alarmed, but you didn't regress, you've always been that retarded, you raped failure
>>
>>108353010
I was contemplating spinning up comfy in a container again to test it on that to see if it was just forge neo fucking it up so I guess if it's not just me having the issue I'll just have to accept the model's cooked.
>>
>>108353131
What you should contemplate is suicide, trani
>>
>>108353150
>>108351666 (You, Satan)
>>
>update comfyui
>cant paste images into the clipboard while focused on "load image" node
FUCKING RETARDS!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
>>
from the clipboard*
sorry i am fuming hard
>>
>its not just me
https://github.com/Comfy-Org/ComfyUI/issues/12896

god i hate monkeys and cumfart and people from africa. fucking hell. they literally just destroyed all productivity with this (((update)))
>>
Already mentioned in OP but what the hey.

https://github.com/Comfy-Org/Nvidia_RTX_Nodes_ComfyUI
>>
>>108353254
>Search RTX in the ComfyUI manager to install it.
>alright
>[Installation Errors]
>'ComfyUI_NVIDIA_RTX_Nodes': With the current security level configuration, only custom nodes from the "default channel" can be installed.
k....
>>
>>108353163
Congrats on making the most incomprehensible, schizophrenic post ITT
>>
>>108353254
this thing is so fast that compiling the video with video combine node takes longer than the upscale
>>
File: 1746751901969067.png (3.12 MB, 2016x1120)
3.12 MB
3.12 MB PNG
sexo
>>
>>108353292
do it the old fashioned git clone-way



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.