[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>108313977

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>mfw Resource news

03/07/2026

>Exiv: Modular and extensible open source GenAI engine
https://github.com/piyushK52/Exiv

03/06/2026

>Modular Diffusers - Composable Building Blocks for Diffusion Pipelines
https://huggingface.co/blog/modular-diffusers

>LTX-2.3-GGUF Using Unsloth Dynamic 2.0
https://huggingface.co/unsloth/LTX-2.3-GGUF

>RealWonder: Real-Time Physical Action-Conditioned Video Generation
https://liuwei283.github.io/RealWonder

>FaceCam: Portrait Video Camera Control via Scale-Aware Conditioning
https://weijielyu.github.io/FaceCam

>RelaxFlow: Text-Driven Amodal 3D Generation
https://github.com/viridityzhu/RelaxFlow

>Guiding Diffusion-based Reconstruction with Contrastive Signals for Balanced Visual Representation
https://github.com/boyuh/DCR

>MASQuant: Modality-Aware Smoothing Quantization for Multimodal Large Language Models
https://github.com/alibaba/EfficientAI

>VisionPangu: A Compact and Fine-Grained Multimodal Assistant with 1.7B Parameters
https://www.modelscope.cn/models/asdfgh007/visionpangu

>Rolling Sink: Bridging Limited-Horizon Training and Open-Ended Testing in Autoregressive Video Diffusion
https://rolling-sink.github.io

>MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphing
https://xiaokunsun.github.io/MorphAny3D.github.io

>RLC Prompt Suite for ComfyUI: JSON-based prompt generation and seed management
https://github.com/efeerimoglu/ComfyUI-RLC-Prompt-Suite

>ComfyUI LoRA Optimizer
https://github.com/ethanfel/ComfyUI-LoRA-Optimizer

>ComfyUI Optical Realism Post-Processing Node
https://github.com/skatardude10/ComfyUI-Optical-Realism

03/05/2026

>LTX-2.3 Video Engine
https://ltx.io/model/ltx-2-3

>LTX Desktop: Fully local AI gen space with integrated video editor
https://ltx.io/ltx-desktop

>Z-Image Power Nodes
https://github.com/martin-rizzo/ComfyUI-ZImagePowerNodes

>Error as Signal: Stiffness-Aware Diffusion Sampling via Embedded Runge-Kutta Guidance
https://github.com/mlvlab/ERK-Guid
>>
>mfw Research news

03/07/2026

>CoShadow: Multi-Object Shadow Generation for Image Compositing via Diffusion Model
https://arxiv.org/abs/2603.02743

>HumanOrbit: 3D Human Reconstruction as 360° Orbit Generation
https://arxiv.org/abs/2602.24148

>FACE: A Face-based Autoregressive Representation for High-Fidelity and Efficient Mesh Generation
https://arxiv.org/abs/2603.01515

>Rethinking Representativeness and Diversity in Dynamic Data Selection
https://arxiv.org/abs/2603.04981

>MiM-DiT: MoE in MoE with Diffusion Transformers for All-in-One Image Restoration
https://arxiv.org/abs/2603.02710

>CAPT: Confusion-Aware Prompt Tuning for Reducing Vision-Language Misalignment
https://arxiv.org/abs/2603.02557

>Mask-aware inference with State-Space Models
https://arxiv.org/abs/2603.04568

>EvoPrune: Early-Stage Visual Token Pruning for Efficient MLLMs
https://arxiv.org/abs/2603.03681

>Accelerating Multi-Scale Deformable Attention Using Near-Memory-Processing Architecture
https://arxiv.org/abs/2603.00959

>DLEBench: Evaluating Small-scale Object Editing Ability for Instruction-based Image Editing Model
https://arxiv.org/abs/2602.23622

>ShareVerse: Multi-Agent Consistent Video Generation for Shared World Modeling
https://arxiv.org/abs/2603.02697

>Through the Lens of Contrast: Self-Improving Visual Reasoning in VLMs
https://arxiv.org/abs/2603.02556

>AdaIAT: Adaptively Increasing Attention to Generated Text to Alleviate Hallucinations in LVLM
https://arxiv.org/abs/2603.04908

>Can Vision Language Models Assess Graphic Design Aesthetics? A Benchmark, Evaluation, and Dataset Perspective
https://arxiv.org/abs/2603.01083

>Look Carefully: Adaptive Visual Reinforcements in Multimodal Large Language Models for Hallucination Mitigation
https://arxiv.org/abs/2602.24041
>>
File: ComfyUI_temp_pdtgy_00104_.png (2.68 MB, 1040x1440)
2.68 MB
2.68 MB PNG
>>108320614
very good collage, good taste
>>
>>108320614
very cringe rentry fanfiction. OP needs work
>>
We have nano banana pro at home now lel
>>
>>
>>
>>108320622
>>108320627
get out schizo you're not welcome here
https://rentry.org/debo
>>
Blessed thread of frenship
>>
>>108320668
>>108320641
>>
File: 1771771287210764.png (3.59 MB, 1160x1744)
3.59 MB
3.59 MB PNG
>>
>>108320689
go back to your asylum schizo >>108314840
>>
File: 1766285776725037.jpg (768 KB, 1216x1824)
768 KB
768 KB JPG
night out with the ladies
>>
File: 1758978692086098.png (3.65 MB, 1536x1536)
3.65 MB
3.65 MB PNG
>>
File: 1765567545903594.jpg (720 KB, 1216x1824)
720 KB
720 KB JPG
>>
File: 1766151627202099.png (3.42 MB, 1664x1312)
3.42 MB
3.42 MB PNG
>>
>>108320704
>>108320708
>>108320711
nice taste in women anon, I'm sure you'll like this
https://www.tiktok.com/@walkerpaul28/video/7612565462378974484
>>
File: 1768115859881173.png (3.24 MB, 1312x1632)
3.24 MB
3.24 MB PNG
looks like a young mcafee
>>
holy spamming/flooding
>>
File: 1760664156903243.jpg (711 KB, 2016x1120)
711 KB
711 KB JPG
>>
File: deJS_zi_00015_.png (2.58 MB, 1832x1000)
2.58 MB
2.58 MB PNG
>>108320716
cute
>>
File: 1762847960698857.png (3.73 MB, 1536x1536)
3.73 MB
3.73 MB PNG
last one
>>108320719
>having tiktok
>>
>>108320731
you didn't like the video?
>>
>>108320737
i cant even open it man, zoomietok requires an account
>>
>>108320740
oh you're on your phone? all right I give you the video on a catbox then
https://files.catbox.moe/32a1fc.mp4
>>
>>108320752
cute asian, but no im on pc, maybe it requires login from my country?
>>
>>108320756
that's weird because you can watch tiktok videos on pc without having to require an account, yeah maybe it depends on the country you live it, the internet is so gay now, the rules used to be the same for everyone, not anymore
>>
Local Diffusion?
>>
File: ComfyUI_temp_pdtgy_00157_.png (2.2 MB, 1120x1440)
2.2 MB
2.2 MB PNG
>>
>>108320777
what would it sound like if she tooted in that skin-tight suit.
>>
File: ComfyUI_temp_pdtgy_00177_.png (2.29 MB, 1120x1440)
2.29 MB
2.29 MB PNG
>>
Babe babe wake up! AnimaYume updated 0.2!
https://civitai.com/models/2385278/animayume

For version 0.2:
Continuation of AnimaYume v0.1 with improved dataset quality and techniques to prevent oversaturation and low-quality outputs. Testing shows better prompt coherence than v0.1 and stable generation at 1536 resolution.
>>
>>108307618
what if i just buy an RTX 6000 pro?
>>
>>108320833
can't wait to try this and be disappointed again like i was with anima itself.
>>
>>108320839
shit wrong thread i'm retarded sorry
>>
>>108320846
its okay i still love you anon
>>
>>108320833
He wasn't in court with Pony dev?
>>
>>108320848
thank you i love you too
>>
File: deJS_zi_00020_.png (2.77 MB, 1832x1000)
2.77 MB
2.77 MB PNG
>>108320833
i've been kinda tempted to start a new anime arc
>>
>>108320887
With all the genning experience you have, you'd probably be really good at anime if you tried it. Do you know any artists?
>>
File: ComfyUI_temp_pdtgy_00212_.png (2.98 MB, 1088x1856)
2.98 MB
2.98 MB PNG
>>
File: deJS_zi_00022_.png (2.8 MB, 1832x1000)
2.8 MB
2.8 MB PNG
>>108320912
I've done a ton of anime in the past but have been using zimg and chroma for ages, which aren't that great for anime. I wonder if there'll be an anime zimg soon
>Do you know any artists?
no, my subject matter knowledge is almost zero. I usually just wildcard artists until something hits
>>
is there a proper application out for diffusion yet? I don't want to use python because it's too many headaches around deps
>>
>>108320833
Finally, something new to try out again.
>>
>>108320833
Neat, testing it tomorrow. Right now I'm phoneposting from bed.
>>
>>108320976
hi newfriend!
>>
>>108320833
>finetuning Anima
why won't they wait for Anima to be finished first?
>>
>>108320913
>3dpd with anime girl posters instead of some Rihanna or Britney Spears
Nah, that doesn't exist
>>
>>108320833
What's the dataset?
>>
>>108320990
Duonlinvo or whatever it's called is facing court along with the Pony dev and Nochekaiser because of some CivitAI bullshit about a Lora script.
Probably won't be here to finetune Anima release
>>
>>108321008
>Duonlinvo or whatever it's called is facing court along with the Pony dev and Nochekaiser because of some CivitAI bullshit about a Lora script.
smells like bghira faggotry he would report them
>>
>>108320833
did he stop using ai slop in his dataset? guess not. into the bin it goes
>>
>>108321008
>Duonlinvo or whatever it's called is facing court along with the Pony dev and Nochekaiser because of some CivitAI bullshit about a Lora script.
really? source?
>>
>>108320990
>Note: I am still waiting for the final version of Anima and testing some methods to make my training process faster.
>>
File: 1761797156913458.png (2.77 MB, 1160x1744)
2.77 MB
2.77 MB PNG
>>
>>108320990
>why won't they just wait for the API release before training a better model???
>>
File: image-22.png (87 KB, 565x183)
87 KB
87 KB PNG
>>108321017
>>
>>108321047
>random screenshot from who knows where
>no mention of "facing court"
okay
>>
>>108321047
who wrote that? some reddit brownie?
>>
>>108321047
>going to jail for being a mass lora slopper
>>
>>108321016
what you call slop is the magic recipe for your 1girls to be more stable
>>
>>108321080
yum taste shit, but at least it's stable (same exact image every seed)
>>
>>108321047
Good one now where's the real source
>>
>>108321047
i remember this from somewhere, it's just some schizo lora training guide
>>
File: 1772946904153543.mp4 (2.77 MB, 832x1504)
2.77 MB
2.77 MB MP4
>>108320833
>>
>>108321114
the animation is not bad at all, what video model you used to make it?
>>
>>108320833
Thanks but I prefer Noob as refiner. Less slopped than Duongvie or Bluvoll, which are anime diffusion's worst enemies and sadly infiltrated the Chenkin Noob circle with their slop datasets.
>>
>>108321127
Grok
>>
File: 1755553086448879.png (3.27 MB, 1744x1160)
3.27 MB
3.27 MB PNG
>>
>>108321155
Trvke.
>>
File: 1753578288939546.png (3.61 MB, 1656x1232)
3.61 MB
3.61 MB PNG
>>
https://civitai.com/models/897413?modelVersionId=2663313
>looks like an interesting NSFW klein finetune
>we can't download it
I'm starting to hate civitai more and more...
>>
>>108321185
>the model is a "finetune" on a few thousand images
>might actually be just a merge of other peoples' loras
>"Please note: The Klein versions of Big Love are pay-only and special license conditions apply. See description!"
>it's API-only
>unlock usage for $5 on tensorart
>additional license terms ban merges, finetunes, and redistribution
>still inherits the Flux dev license, so BFL is allowed to nuke it whenever they want
what the fuck is this next level grifter shit
>>
>>108320704
catbox?
>>
>>108321277
>>still inherits the Flux dev license, so BFL is allowed to nuke it whenever they want
where's the snitching anon? if you read this, please report that shit. I want this grifter to eat some karma kek
>>
Glad I got an rtx4080 back when I did.. Sheesh. All the demand tho might help push tooling for upcoming tech out faster.
>>
imagine if it's true, Seedance 2.0 at home baby! >>108321322
>>
>>108321395
that would imply local has any bakers capable of making a seedance-level model in the first place
>>
File: 1755664375193225.png (97 KB, 248x204)
97 KB
97 KB PNG
>>108321421
I think the point here is that with this new architecture you could get the results of a transformer architecture that is 30x bigger, if Alibaba makes a 14b model based on this, it'll get the level of a 420b transformers model, surely this would reach Seedance 2.0's level
>>
>>108321395
>>108321442
you might be dangerously low iq to think that's valid
>>
>>108321462
I don't think it's valid at all, just passing time making fun of some jeets making engagement posts in reddit in all place, at least in twitter you're paid to ragebait people, he does this shit for free here!
>>
File: bbs-zit-2026-03-08_00089_.png (3.54 MB, 1792x1024)
3.54 MB
3.54 MB PNG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.