[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Previous: >>108440192

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Blessed thread of frenship
>>
new model?

https://github.com/huggingface/diffusers/pull/13317
>>
Blessed are those who bless /ldg/
Cursed are those who curse /ldg/
>>
>>108444715
yeah
https://xcancel.com/Assu_Bhadhu/status/2036383424488022412#m
>>
File: _AnimaPreview_01703_.jpg (274 KB, 896x1152)
274 KB
274 KB JPG
>>108444721
Doomers get the rope of hope!
>>
>>108444703
>>108444721
what is this cringe roleplay?
>>
>>108444738
make a troonjak
>>
>>108444767
newfag, dude's been blessing the threats since the beginning of time basically. big bang > matter cools down, atoms > he starts blessing the threats > etc
>>
>>108444842
so the fact that he has been doing it for years is supposed to make it less cringe? lol
>>
>>108444852
you just wrote cringe, that's the ultimate insult
>>
i miss my chinese researcher gf
>>
>>
File: 00123-3177030577.png (3.94 MB, 1440x2560)
3.94 MB
3.94 MB PNG
>>108444715
>>108444736
new disappointment model let's GOOOOOOO
>>
>>108444685
Thank you for baking this thread, anon
>>108444703
Thank you for blessing this thread, anon
>>108444721
Thank you for blessing the blessers of this thread, anon
>>
>>108444892
is this the zetachroma model?
>>
>seething at thread blesser anon
>>
File: _AnimaPreview_01865_.jpg (235 KB, 1280x768)
235 KB
235 KB JPG
>>108444825

>>108444892
looks great
>>
I'm a Grok exiled, do I still need 16GB VRAM to make 5 seconds video using ComfyUI?
>>
>>108444907
lol no
>>
>>108444907
course it is, he's trying to flex. dress makes no sense, dress pattern looks absolute dogshit, black blob on ceiling, plastic face, black hole eyes. at least make her do a split or w/e. you know, something sdxl can't do.
>>
>>108444900
Thanks anon~
*pats you and kisses on the cheek. grabs your bulge slightly, smirks seductively*
>>
>>108444946
I can make videos on 12 but 24 is optimal
>>
>mfw Resource news

03/24/2026

>daVinci-MagiHuman: Single-Stream Architecture for Fast Audio-Video Generative Foundation Model
https://huggingface.co/GAIR/daVinci-MagiHuman

>SparkVSR: Interactive Video Super-Resolution via Sparse Keyframe Propagation
https://sparkvsr.github.io

>Manifold-Aware Exploration for Reinforcement Learning in Video Generation
https://dungeonmassster.github.io/SAGE-GRPO-Page

>PROBE: Diagnosing Residual Concept Capacity in Erased Text-to-Video Diffusion Models
https://github.com/YiweiXie/PRObingBasedEvaluation

>LPNSR: Prior-Enhanced Diffusion Image Super-Resolution via LR-Guided Noise Prediction
https://github.com/Faze-Hsw/LPNSR

>Text-Image Conditioned 3D Generation
https://jumpat.github.io/tigon-page

>Improving Diffusion Generalization with Weak-to-Strong Segmented Guidance
https://github.com/851695e35/SGG

>The Golden Subspace: Where Efficiency Meets Generalization in Continual Test-Time Adaptation
https://github.com/AIGNLAI/GOLD

>Style Organizer v6.0: Style Grid for Forge
https://github.com/KazeKaze93/sd-webui-style-organizer

03/23/2026

>Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models
https://franklinz233.github.io/projects/astrolabe

>LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation
https://jiazheng-xing.github.io/lumosx-home

>LagerNVS: Latent Geometry for Fully Neural Real-time Novel View Synthesis
https://arxiv.org/abs/2603.20176

>Making Video Models Adhere to User Intent with Minor Adjustments
https://ubc-vision.github.io/MinorAdjustVideo/docs/webpage/index.html

>IsoCLIP: Decomposing CLIP Projectors for Efficient Intra-modal Alignment
https://github.com/simomagi/IsoCLIP

>Learning Like Humans: Analogical Concept Learning for Generalized Category Discovery
https://github.com/zhou-9527/AnaLogical-GCD

>UNI-1: Multimodal reasoning model that can generate pixels
https://lumalabs.ai/uni-1
>>
>mfw Research news

03/24/2026

>Climate Prompting: Generating the Madden-Julian Oscillation using Video Diffusion and Low-Dimensional Conditioning
https://arxiv.org/abs/2603.21856

>EruDiff: Refactoring Knowledge in Diffusion Models for Advanced T2I Synthesis
https://github.com/xiefan-guo/erudiff

>Not All Layers Are Created Equal: Adaptive LoRA Ranks for Personalized Image Generation
https://donaldssh.github.io/NotAllLayersAreCreatedEqual

>CTCal: Rethinking T2I Diffusion Models via Cross-Timestep Self-Calibration
https://arxiv.org/abs/2603.20741

>Relax Forcing: Relaxed KV-Memory for Consistent Long Video Generation
https://zengqunzhao.github.io/Relax-Forcing

>Premier: Personalized Preference Modulation with Learnable User Embedding in T2I Generation
https://arxiv.org/abs/2603.20725

>Memory-Efficient Fine-Tuning Diffusion Transformers via Dynamic Patch Sampling and Block Skipping
https://arxiv.org/abs/2603.20755

>Adaptive Video Distillation: Mitigating Oversaturation and Temporal Collapse in Few-Step Generation
https://arxiv.org/abs/2603.21864

>ADaFuSE: Adaptive Diffusion-generated Image and Text Fusion for Interactive T2I Retrieval
https://arxiv.org/abs/2603.21886

>GIDE: Unlocking Diffusion LLMs for Precise Training-Free Image Editing
https://arxiv.org/abs/2603.21176

>Taming Sampling Perturbations with Variance Expansion Loss for LDMs
https://arxiv.org/abs/2603.21085

>MS-CustomNet: Controllable Multi-Subject Customization with Hierarchical Relational Semantics
https://arxiv.org/abs/2603.21136

>JANUS: Lightweight Framework for Jailbreaking T2I Models via Distribution Optimization
https://arxiv.org/abs/2603.21208

>AdaEdit: Adaptive Temporal and Channel Modulation for Flow-Based Image Editing
https://arxiv.org/abs/2603.21615

>ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent Framework
https://arxiv.org/abs/2603.20644

>From Scale to Speed: Adaptive Test-Time Scaling for Image Editing
https://arxiv.org/abs/2603.00141
>>
>>108444970
>>108444976
fuck off
>>
>>108444970
>>108444976
Thanks bro.
>>108444980
Go back to /adt/, troll.
>>
>>108444976
>>Climate Prompting: Generating the Madden-Julian Oscillation using Video Diffusion and Low-Dimensional Conditioning
>>the Madden-Julian Oscillation
>>Julian
>>
>>108444954
Well the flex is working for this simple coomer. Ah am just a simple coom enjoyer, no need for anything fancy.
>>
>nigbo
>>
>>108444994
Literally obsessed. Are you seeing "Julien" under your bed already, retard?
>>
LDG is the best ai image generation thread on the entire site
>>
imagine
for a second hear me out
IMAGINE using adetailers or inpainting in 2026
only SDXLfag or chromaCVCKS would spew garbage such as this. Me? I 1-shot all my ZIT gens in 30 secs~, I dont want to wait 30 mins for 31251 detailers to run.
Thanks.
>>
>>108444999
>the raped retard has been instructed to bark at its own name like the brainless mutt it is
lmao
>>
>>
>>108445013
well actually i-
>ZiT
oh youre trolling nvrmind
>>
>>108444907
lol, I know you’re trolling but it is chroma, Idk why the author stopped training the model, he should've have keep training it
>>
>>108445033
how many hundreds of detailers did you run?
>>
imagine
for a second hear me out
IMAGINE
a thread without ranfaggot.
Thanks
>>
>>108445033
i was not trolling im unironically waiting for that model to get good
>>
>>108445040
you'd still be a raped worthless retard and we'd still hate you
>>
>>108445040
makes me almost sad
god how much better would it be
we probably wouldn't even have the split in the first place because sdg would still be flourishing without ran's stupid personality drama
mentally ill unemployed coomer
>>
Holy meltie
>>
>>108445052
>mentally ill unemployed coomer
we don't sign our posts here
and we split because of subhuman avatartroon tourists like cumfart or julien the raped retard (you)
>>
File: Flux2-Klein_00283_.jpg (577 KB, 1344x2240)
577 KB
577 KB JPG
>>108445033
>Idk why the author stopped training the model, he should've have keep training it
It's so close to greatness but what can you do
>>
>>108445079
*vomits*
>>
>>108445037
Just a 2nd pass and facedetailer but I don't like the facedetailer that much, the thing about chroma is that it needs some monkey patching to get it run it properly, but once you get it, you can pretty much generate anything you want, thats why I think it filters a lot of users, you have to mess a little bit, but lately my workflow has been more simpler, I've been using the fp8mixed-final version and the chroma-unlocked-flash-v47-heun-8steps-cfg1_r04-fp32 lora with 18 steps.

What I like about chroma is that is basically flux (schnell), so you can use Redux, Controlnet
>>
>>108444892
>>108445024
>>108445033
>>108445079
>>108445117
Too old
>>
>>108445131
based
>>
>>108445079
See? this is why I don't like Klein, it messes up the faces so bad, I don't know what they added to the model but its really hard to train a character lora with that model, it somehow always distorts the face, also they added some gay ass filter or poison that messes up the quality when you generate videos using klein images, same when you upscale images with SeedVR2, it adds some ugly details that you don't get when you upscale chroma, zmodel or SDXL photos.

fucking BFL they poisoned the model really bad, it also gets detected as AI photos when you upload them to social media
>>
>edit model
>still requires a lora for likeness
Klein was never good it's just the smallest edit model we have desu
>>
We really should migrate back to /sdg/
>>
>>108445004
this
>>
>>108445179
this
>>
>>108445179
your mother's cunt
>>
File: 1768837987064054.jpg (583 KB, 1536x1536)
583 KB
583 KB JPG
>>
>>108445179
makes sense
>>
File: 1766516657867270.jpg (551 KB, 1328x1640)
551 KB
551 KB JPG
whats in the box????
>>
They will continue to seethe at this blessed thread
>>
>>108445210
>>108445215
Do you prompt specifically for those eyelids?
>whats in the box????
Her supply of extra fingers.
>>
https://github.com/gazingstars123/Anima-Standalone-Trainer
anyone tried this? (only 43 stars)
>>
File: Flux2-Klein_00302_.jpg (591 KB, 2128x1456)
591 KB
591 KB JPG
>>108445155
>See? this is why I don't like Klein, it messes up the faces so bad
Yeah, sadly it's one of those "once you see it, you cant unsee it" type of things. This is the best I've manged to make with it and it required huge regularization dataset. Not worth time and effort, easier to inpaint with Chroma as well.
>>
File: HEG16t1bAAAuu6a.jpg (1.39 MB, 1584x2816)
1.39 MB
1.39 MB JPG
>>
>>108445241
just use tdrrusell (anima author duh) diffusion-pipe
>>
>>108445272
no thanks, turdrussel is a hack and i dont trust him
>>
replying to himself
>>
What's recommended to access your UI remotely, say on a phone at the office? Its easy enough on my local network but I only know commercial software for external remote access
>>
File: ComfyUI_temp_ijfzp_00088_.png (3.89 MB, 1296x1840)
3.89 MB
3.89 MB PNG
>>108445259
I think the best workflow to enhance chroma photos is a 2nd pass thru Klein

pic.rel
>>
>>108445296
Do you use lora for it? I've used it to clean datasets, but it alters too often things it should not
>>
>>108445272
tdrrusell has zero clue what he's doing
>>
>>108445333
https://huggingface.co/dx8152/Flux2-Klein-9B-Consistency

Yes, use this lora its pretty fucking great, that guy has the best loras for edit models
>>
>>108445296
quite nice
>>
Why do we have such dedicated Comfy and Anima defenders? Anyone else feels like its sus
>>
>>108445241
>nodejs
>>
>>108445360
the hate against tdrussell is unfounded, he used to post here (probably he still does), his repo was the first to be able to train hunyuan 1.5 videos, also chroma too, he works quietly and recently released a coom model, what else do you want?

only a failed dev could get so jealous
>>
>>108445347
Thanks, I'll give it a try.

>>108445336
>tdrrusell has zero clue what he's doing
Disagree. He should ditch the score tags tho.
>>
Hey anons
How fucked am i if i want to gen coom clips with WAN (i guess?) on an amd 6950 on linux with 64 gb ram?
Any recommended apps or should i try it with comfy? Asking before i waste hours lol
>>
>>108445399
>amd
lmao
>>
>>108445360
When you are a samefag trying to slander them all the time, it gets hard to have an honest conversation.
>>
>>108445446
why does it matter so much to you? are you personally invested in comfyorg?
>>
>>108445441
I know but i need to coom lol
>>
>>108445040
He hasn't posted in months. You sound obsessed.
>>
Weird how the comfy and anima shitting always flares up right when julien decides to "work" on his crashing wrapper
>>
>>108445462
personally tired about the constant fudding
>>
>>108445462
can easily ask you the same thing
>>
File: file.png (80 KB, 1024x439)
80 KB
80 KB PNG
>>108445079
>Idk why the author stopped training the model

well, *something* is happening in picrel
>>
>>108445360
Well, what else should we be using?
Go on now, share with all of us.
>>
>>108445515
It's literally stale.
>>
>>108445360
I also find it very sus how everyone defends drinking water and breathing air, almost as if they were paid to do it
>>
>>108445360
holy shit, case in point i guess
>>
>>108445462
Look where letting liars get away with lying has gotten us.
>>
>>108445537
Yep. Feels like there's essentially no honest posters here lately. Dead internet theory
>>
File: HEIVa2daIAEz_Xf.jpg (382 KB, 2816x1584)
382 KB
382 KB JPG
>>
>>108445270
>>108445577
>>>/g/dalle
>>
>>108445259
This is great for a klein image
>>
best model to generate korean feet?
>>
This seems to be NucleusMoE's test repo:
https://huggingface.co/sippycoder/nucleus-moe-test/tree/main
Not sure if it is worth anything without inference code.
Couldn't find new info outside of diffusers commit neither.
>>
File: HEG0w1MbAAAzIX0.jpg (474 KB, 2816x1584)
474 KB
474 KB JPG
>>
Do you guys remember when, I remember
>>
>>108445376
>only a failed dev could get so jealous
you have your answer anon
>>
>>
File: 00037-926603542.jpg (252 KB, 1344x1728)
252 KB
252 KB JPG
>>
>>
File: 1774367262418663.png (38 KB, 475x235)
38 KB
38 KB PNG
???
>>
File: 1743142751398947.jpg (104 KB, 828x627)
104 KB
104 KB JPG
>davinci model
looks like china stole and repaired ltx kek
>>
>35 stars
>>
>>108446516
is this what you think about all fucking day ran? did you completely abandon any hope of employment? guess jerking off to slop beats working as a janitor
>>
>>108446543
>ran
lmao
>>
how's the twitter anti-anima campaign going? you fucking specimen lmao
>>
sad act.
speciment.
35 stars.
lmao.
>>
just updated comfy and now my workflows don't work and it OOM
>>
>tdrussell fudding
>anima fudding
>comfy fudding
pay attention to meeeeee
>>
Comfy's GF is spending Ani's rightful millions.
>>
Wtf are those melties, catjak is doing even worse than I thought.
>>
>>108446583
that one was funny ngl, insane behaviour
>>
File: 1759079074806528.png (3.94 MB, 1232x1840)
3.94 MB
3.94 MB PNG
power bros... WE WONNED!!!!!!!!!
>>
>>108446620
>not found: nai fudding
It makes you think.
>>
for me its daVinci-MagiHuman
>>
File: 1773345297381416.jpg (34 KB, 640x480)
34 KB
34 KB JPG
>end of free grok
>end of sora
>davinci local looks good
we become emperor again, local sisters
>>
>>108445940
>>108445890
post the lora
>>
>>108446947
Always were
>>
>>108446838
>daVinci-MagiHuman
gguf when?
does it do nsfw?
>>
Klein and chroma are good until you need to see fingers and toes.
>>
Is it good tho? I don't see the porn bros being happy about it yet
>>
>>108447139
Chroma as NSFW refiner is magic. Anima with chroma low denoise for instance, is amazing...
>>
beautiful scenery nature glass bottle landscape, , purple galaxy bottle,
>>
are we eating good?
>>
>>108447173
so good that we arent posting the new models gens
>>
File: name the band.png (1.59 MB, 1152x896)
1.59 MB
1.59 MB PNG
>>108446947
Another wonderful day to be a Local Chad
>>108447139
Dunno about toes but Klein fingers are fine
>>
I think local video gens will be the end of me, I'll goon to death
>>
>>108447139
klein is better than chroma for this, but both are sad for feet lovers
>>
File: 35.png (1.82 MB, 960x1280)
1.82 MB
1.82 MB PNG
>>108447173
we eating the sloppiest slop ever slopped by slopkind
>>
>>108447160
Anima? With Chroma?
I don't get it, one is anime the other is realistic.
Can you catbox an example of what you mean?
>>
>>108447194
You use "we" as if you participate substantially in this thread but the overall nature of your reply suggests you do not. Sad and pathetic but mostly pathetic.
>>
>>
>>
>>108444935
gem
>>
>>108447288
missed opportunity to show cum
>>
Discovered the “refiner” and chained “KSampler Advanced” rabbit hole, feel guilty for being a slopper and never correlating steps with denoise and only hiresfix.
Never using regular KSampler or 0.0–1.0 denoise values again, that’s not the right way to think.
>>
File: 0594333753.png (1.45 MB, 960x1400)
1.45 MB
1.45 MB PNG
>>
>>108447317
Sigmas split is the way
>>
>>108446760
what base model is that anon
you dont have to share all your secrets if you refine though
>>
>>108447357
Kys fag
>>
>>108447317
Questionmark. Explain u're self
>>
is mmaudio still the only video to audio model?
>>
>>108446947
It only shows there is a lower expectancy of return in t2v and t2i. Its grim, not good.
>>
>>108447406
The first to unlock true NSFW will be rich
>>
>>108447380
zib
>>
>>108446947
sora out isn't such a good news, one of the reason everyone competes on video is because of it
>>
>>108447380
It's basically ZiT since he does a second pass with it and kills the life out of sovlful ZiB.
>>
>>108447525
openai was a big local enemy. now i can finally buy better settings, without selling my car kek
>>
File: 1763273820366862.png (2.78 MB, 2648x616)
2.78 MB
2.78 MB PNG
so this is the power of pixel space...
https://pastes.dev/Dng0vyyQ8x (script made by gemini)
>>
File: 1759182591792939.jpg (3.32 MB, 5208x1128)
3.32 MB
3.32 MB JPG
https://pastes.dev/Jkj5tRfXEe
https://ibb co/mQbCHc8 (uncompressed)
>>
kissing ur homies gn y/n?
>>
>>108447716
>>108447723
idk what this means
>>
Is there a way to make LoRA (or equivalent) that work on Z-Image Turbo? I remember being able to make Lora for Flux but needing a different trainer.
>>
>>108447839
Most if not all trainers support both turbo and base, ESL fren.
>>
>>108444987
how about you go back to /sdg/ schizo
>>
>>108445796
>Not sure if it is worth anything without inference code.
technically you can try it out by using this PR
https://github.com/huggingface/diffusers/pull/13317
>>
>>108447525
nah, the new goal is Seedance 2.0 now
>>
What LORAs do you guys like to use to get genitals in zimageturbo? Hopping back into this after eons gone. It seems like a good model, but damn are the fucked up genitals annoying.
>>
>>108445079
>It's so close to greatness
your image was made by what lodestone model?
>>
>>108447853
Thank you. I had been using SDXL based models for a long time and it was only with the need for text on screen that made me look into alternatives.
>>
File: 1763628178445219.png (569 KB, 567x466)
569 KB
569 KB PNG
https://xcancel.com/soraofficialapp/status/2036546752535470382#m
HOLY SHIT SORA 2 IS DEAD LMAOOO
>>
>decade old meme poster is also the last one to find out
pottery
>>
kek
https://www.reddit.com/r/StableDiffusion/comments/1s2vxsj/this_model_really_wants_to_talkdavincimagihuman/
>>
File: 1754572676971632.png (530 KB, 2321x507)
530 KB
530 KB PNG
>tfw the only countries that don't have a meltie about IP and let them use freedom of expression are macao countries
I apologize to the footcheball countries, you guys are based
>>
File: please say yes.png (114 KB, 640x640)
114 KB
114 KB PNG
>>108447937
What are they going to do with the model now that it's completly useless to them? Will they open source it?
>>
>>108448003
https://www.reddit.com/r/StableDiffusion/comments/1s2qq84/davinci_magihuman_potential_ltx2_killer/
So all this meme model can do is talking? lmao this is lame
>>
>>108447877
I know but I can't be arsed.
>>
>>108448172
me neither, I would if anime china man showed some pictures and they were good, but for the moment I'll just wait, it's a MoE model so it's probably a meme
>>
File: 1771468395270724.jpg (43 KB, 512x512)
43 KB
43 KB JPG
>>108448108
americans rarely share. look dall-e 3. openAI will simply delete both models...
>>
>>108448108
>Will they open source it?
poor thing, lmao? didn't you knew, when companies don't need stuff anymore they either destroy it themselves or let nature corrode it.
>>
>>108448254
I know engineers on big companies are souless cash mercenaries, but desu I would be sad to get all my work on a model go down the drain just like that
>>
>>108448108
>Will they open source it?
What if they just sell it to the biggest bidder? I'm sure Elon wouldn't mind spending some billions to get that model
>>
>>108448265
>I would be sad to get all my work on a model go down the drain just like that
Most people seem to not care, a lot of code for a lot of stuff never sees the light of day, just see stuff on arxiv, I was searching for stuff that converted images to ASCII art, lots of very good and cool stuff, but no code, a whole team of people to write and research the thing and if you want it you have re-implement it yourself, such is life.
>>
>>108448223
>"OpenAI"

They really should stop calling themselves that.
>>
>>108447937
vague motherfuckers. Tell people what you're doing with the model after the app closes.
>>
>>108448285
>They really should stop calling themselves that.
didn't Elon try to sue them because of that? keek
>>
>>108446992
the team said it's trained on the entire EvilAngel catalog
>>
What's a good image editor like grok?

Where can I just say something and it changes it perfectly?
>>
File: lul.png (229 KB, 623x397)
229 KB
229 KB PNG
>>108448315
>the team said it's trained on the entire EvilAngel catalog
>>
>>108448315
when are we getting a model like that frfr?
>>
>>108448319
Qwen-edit is the closest open source alt
>>
>>108448319
>good image editor
Flux Klein 9B distill
>like grok
Don't necessarily expect it to equivalent to a large API model in every aspect
>perfectly
No such thing
>>
>>108448319
Now that grok isn't free anymore and Sora 2 is dead, prepare for a wave a newfags in the coming days
>>
>>108448368
I propose regularly posting anti-poojeet gens to scare the worst among them away at least.
>>
>>108446992
available this week. it's only a 15b model. and i doubt about any nsfw. get ready to gen doll bodies, in waiting nsfw loras, kek
>>
>>108448350
>>108448360
I'll try both

>>108448368
FUCK U BLOODY BLOODY FUCK
>>
>>108448368
relax, local models are like 6 tiers behind, they won't stick around
>>
>>108448457
to be fair, a lot of jeets like grok imagine and this shit is LTX 2.3 tier (aka, complete ass) so who knows
https://xcancel.com/EvanLuthra/status/2036547819058942007#m
>>
>>108448319
>Where can I just say something and it changes it perfectly?

Pray that Owen Image 2 gets open-sourced soon.
>>
>>108448477
Qwen Image is a slopped mess, I don't think it'll be much better than Klein, I still pray for Z-image edit
>>
File: chromalora.png (3 KB, 433x59)
3 KB
3 KB PNG
Training a chroma lora in 10 minutes is crazy lol
>>
>>108444685
Haven't gened in months, can someone spoon feed me what the use case for models I haven't heard of is?

Wan 2.2 is still best local video? Chroma is for non slopped asian girls with bad anatomy, did it improve? Illustrious is for anime/hentai.

Why should I puck up Z, Anima, Qwen, flux klein? Are they upgrades or sidegrades?
>>
>>108448529
Anima is a huge upgrade if you want to do more than 1girl, standing. Still worth checking out even if you aren't making complex images yet
>>
>>108447937
wtf? didn't they just pay Disney 1 billion dollars a month ago to put some mickey slop in there? lmaoo
>>
>>108448554
It's a downgrade since it uses pony slop tags and prompt bleeds. what a waste of money
>>
>>108448606
surely you're not saying that it bleeds more than a CLIP based model
>>
>>108447937
Sora was abused by pajeets and 3rd world country shitters that just generated tons and tons of fake rage/bait videos of animals and anything to spam into facebook. farm engagement and get paid by meta, glad that is gone, so those fuckers can't monetize anymore, even my old folks saw some of those videos and couldn't tell if they were true or not, plus they used 3rd party apps to delete the sora watermark
>>
File: pajeets.png (182 KB, 845x943)
182 KB
182 KB PNG
>>108448687
https://www.reddit.com/r/passive_income/comments/1qvt9y5/5k_in_5_days_posting_ai_videos/
>>
>>108448617
he has no actual criticisms of the model so he just says that some aesthetic classifier tags trained at 50% dropout are ruining the whole thing
>>
>>108448716
we really live in hell
>>
>>108448139
https://youtu.be/gPPxfPThq20?t=4
>>
>>108448687
Why didn't they just IP-block India from using it.
>>
>>108448723
we truly do, dude covers watermark with emoji
>>
>>108448784
is that even legal?
>>
>>108448716
>>108448785
I need to find a way to make money grifting with AI.
>>
>>108448794
Netflix does it with versions of netflix they don't want you to use.
>>
>>108448716
i wont blame these pajeets. companies made the mistake of creating the API slop. it's just deserved karma
>>
>>108447393
Read the SDXL template in Comfy, it explains the K Sampler Advanced widgets.
Basically K Sampler Normal is a marketing sampler to atract tourists and casuals. It synthesizes a shitload of mathematical processes, which obviously results in a sshitload of slop as a result.
If you're using normal K Sampler, you're at the same level as a CivitAI or TensorArt slopper or a Forge tard.
You can't call yourself Local if you're using the Goy Sampler, you deserve to be in /de3/
>>
>>108448825
you're a fucking schizo
>>
has anyone tested the closed source luma uni-1 model for image generation. holyshit the model is slightly less censored than nanobanana pro but the photorealism skin texture and details is at nanobanana pro level. Far less cucked that grok and gpt image 1.5 at the moment lmfao.
https://files.catbox.moe/6f40ua.png
https://files.catbox.moe/g8eke3.png
https://files.catbox.moe/ws04iu.png
https://files.catbox.moe/ws04iu.png
>>
>>108448832
Denooise status?
Remember, 25 steps like the WAI manual on Civit says :^)
>>
File: such is life.png (176 KB, 460x310)
176 KB
176 KB PNG
>>108448839
yeah anon I know, the API fags are eating good, and we're eating plastic slop
>>
>>108448797
porn
>>
>>108448797
Step 1: Stop cooming to it
>>
any local model to use as refiner meme as this schizo >>108448855 suggests to get this quality? >>108448839
>>
File: budgetpixel-image-678049.png (1.15 MB, 1280x720)
1.15 MB
1.15 MB PNG
>>108448871
i just want qwen image 2.0 to go open source. Its the closest potential bridge nanobanana pro we could get for local. That model is far better than klein and far less janky than z image base and turbo.
>>
>>108448934
>That model is far better than klein and far less janky than z image base and turbo.
really? did you try it? and if yes, show some examples please, I'm curious to know how good it is
>>
NovelAI pivoted to video games (flopped hard), they'll never release their models open source >>108446307
Sora also flopped, won't release open source either >>108447937
Was about to spend serious cash upgrading to 64GB RAM but the way things are going I might as well wait.
>>
>>108448953
What was the NovelAI game.
>>
>>108448797
first step is to accept that local is simply inferior in quality compared to saas. if you want engaging content you need to be using models like sora 2, nano banana, seedance, midjourney, and the new uni-1.
>>
File: it's over.png (288 KB, 640x480)
288 KB
288 KB PNG
>>108448977
>you need to be using models like sora 2
>>
>>108448953
Anima is dying too, nobody's making loras or training checkpoints anymore. Only NetaYume is still at it and released an AnimaYume update today, but Noobcord says it's just a WAIification of it. Plastic colors, washed out, completely defeats the point of Anima, It's a nothingburger. Everyone else has pretty much abandoned ship on that model.
>>
>>108449002
it's not out. retard
>>
>>108448953
>>108448976
Speaking of which, I haven't tried NovelAI's text adventure. is it good?
>>
File: fickle.jpg (11 KB, 320x180)
11 KB
11 KB JPG
>>108448977
thats the mindset of ai-jeet, using SaaS tools doesn't make you stand out from the rest, open-source models, your own workflows and your own trained loras is where is at.
>>
>>108449002
>defeats the point of Anima
Did Anima ever even have a point??? Since when has tdrusel had any goal with his model other than Comfy bucks
>>
File: you admited it.png (140 KB, 1150x312)
140 KB
140 KB PNG
>>108449002
>>108449053
we know who's fudding anima
>>
how quick he turned on neta since they're tuning anima instead of contributing to that apache2 anima shit lol
>>
File: 1759933095841349.png (389 KB, 1168x1792)
389 KB
389 KB PNG
>>
>>108449064
can you take anon's advice and neck yourself already you obsessed tranny?
>>
>>108449206
how about you take a rope and hang yourself, everyone will celebrate that
>>
just a bit of banter
>>
>>108449206
>anifart melty
you love to see it
>>
>>108449206
I honestly hope he does. He is going to include his slop in the next collage too. What a faggot
>>
>>108449206
anifart
https://rentry.org/animanon
>>108449260
debofart
https://rentry.org/debo
>>
File: Untitled_SA9B2QJF.png (3.71 MB, 1584x2816)
3.71 MB
3.71 MB PNG
not to demoralize you guys but this uni-1 is amazing despite the cuck censorship to blocks the image from view after a spilt 4 second after generation if it detects nipples and full detailed nudity. I really hope we get something close to this level of quality in the future for local that can at least fit on 24gb of vram.
https://files.catbox.moe/npg23c.png
https://files.catbox.moe/3d3tor.png
https://files.catbox.moe/biowv3.png
https://files.catbox.moe/48rn3y.png
https://files.catbox.moe/xx9tmz.png
https://files.catbox.moe/snpnvk.png
https://files.catbox.moe/427k7m.png
https://files.catbox.moe/dhlnl6.png
https://files.catbox.moe/t3mbj0.png
https://files.catbox.moe/6f40ua.png
>>
>
>>
>>108449288
>realism image
Z-image turbo is pretty close to API models, I'm more demoralized on the video model side
>>
File: ComfyUI_06128_.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
>>
>>108449288
this image >>108445940 is better than the slop you posted
>imagine shilling a cucked saas closed source model
>>
>>108448871
dude, the problem is you. stop only using sdxl models.
>>
I think I am retarded.
Whenever I try to download z-image from HF using the
hf download Tongyi-MAI/Z-Image
command linked on the HF I end up getting a "no module named 'venv' error. I am pretty sure that I have venv as I am running python 3 which installs with it and I am also able to create a virtual environment with -m venv <environment name> as well as activate it. What am I doing wrong?
>>
>>108449318
present your examples, sir
>>
>>108449331
desu i just use wget
>>
>>108449303
>colonoscopy
kek
>>
File: 00002-3575874416.png (3 MB, 1632x1632)
3 MB
3 MB PNG
>>108449309
chroma sparks preview is nowhere near the quality of uni-1. It great that sparks is uncensored but quality wise it's not close. I've used that model in the past. Its a nice model but its janky and has anatomy issues with the extremities. Hopefully the v1 update is improvement over preview.
>>
>>108449400
the Spark guy just finished training the 512 resolution for the new version and is moving onto 1024, at which point, it will be done. I'm looking forward to it given that lodestone's zeta chroma won't likely be usable for many months to come
>>
>>108449028
SaaS are just better. Local is shit. Simple as.
Local is for porn.

>>108449331
Do you have python 3 installed?
>>
what's the fastest and easiest way to caption an image for training? like "a photo of 25 year old american woman in..."
>>
ltx2.3 is fun

https://files.catbox.moe/tefkkm.mp4
>>
>>108449639
ACK
>>
>>108449028
>using SaaS tools doesn't make you stand out from the rest, open-source models, your own workflows and your own trained loras is where is at
fucking delusional. literally nobody (0 people) outside of a small freetard circlejerk here cares how something was made, they care about the results. and the results made by local models are strictly inferior unless you're making porn.
>>
>>108449655
>literally nobody (0 people) outside of a small freetard circlejerk here cares how something was made, they care about the results
this, still rooting for local though, look at what happened to Sora 2, those API companies can remove your toy any time they want, when your model is local it's on your PC for ever
>>
File: sallet.png (2.53 MB, 1824x1248)
2.53 MB
2.53 MB PNG
>>
*yawn*
>>
File: 00023-1494673525.png (2.94 MB, 1248x1824)
2.94 MB
2.94 MB PNG
>>108449655
the r/stablediffusion and r/LocalLLaMA are very autistic adhering to rule of local=superior. Personally I'm just a pragmatic person that uses whatever i find convenient for my needs and doesn't have annoying hurdles. currently local is not doing it for me when it comes to photorealism but its solid for 2d and 3dcg hyper real.
>>
>>108449689
>reddit
fuck off back there then, fucking pornmixslopper retard? your gens are shit, your takes are shit and you frequent places full of retards.
literally end your life
>>
https://files.catbox.moe/2irbx6.mp4
>>
>>108445296
Gah this picture is making me so horny!
>>
File: 1745924584170297.png (447 KB, 650x749)
447 KB
447 KB PNG
>>108449718
the cock stared face, usually that's a face women do but I forgot to include faggots as well kek
>>
>>108449666
sure. local is nice to have as a backup plan.
>>108449689
oh yeah true, another smalltime circlejerks that literally have sd/local in their name.
>>
File: 1747250990764255.jpg (530 KB, 1328x1640)
530 KB
530 KB JPG
>>
>>108449806
sexo
>>
File: ComfyUI_17328.png (3.84 MB, 1500x2000)
3.84 MB
3.84 MB PNG
>>108449632
Flexibility? Caption absolutely everything in the image. Want it to maintain the person's look at the cost of flexibility? Drop details about the subject being captioned (eye/hair color, facial/body details, etc). Creating a style? No captions at all.

Oh, and use Qwen 3.5 (the biggest quant you can run) for captioning and give it strict rules about what it's going to be doing. It tends to think itself in circles without clear instruction... you can also just turn thinking off, but I wouldn't.

>>108449689
The only genuine creativity to be had with saas is in the prompting to get around all their filters.
>>
>>108448525
>Training a chroma lora in 10 minutes is crazy lol
How? base + settings?
>>
File: 1_00015_.jpg (3.17 MB, 3046x3046)
3.17 MB
3.17 MB JPG
>>
File: 1720870131756872.png (380 KB, 680x680)
380 KB
380 KB PNG
KEK
>>
I see redditors using ltx 2.3 to inpaint faces, but the are all retarded and don't explain what the fuck they're doing, vibe coding shit.

Can ltx take an input image, mask an input video, then replace the face entirely?
>>
>>108449922
nobody wants to waste thousands to make some goonslop. comfy was right to betray local
>>
>>108449922
>>108449939
Is this the modern ComfyUI audience? Kinda grim.
>>
>>108449939
35 stars status?
>>
File: 00102-907486533.png (995 KB, 896x1152)
995 KB
995 KB PNG
can somebody make a proper hires photography lora for anima thanks
>>
>>108449939
>betray local
how he is betraying local anifart? he's still implementing the newest local models while you don't with your 35 stars wrapper
>>
>>108449939
closed niggas be like
>grok is dead
>sora dead
>seedance 2.0 is dead
>yoooo we can't stop winning!!!
seriously what the fuck is left, jeets making "AI influencers" on nano banana?
>>
>>108449939
says the guy whose wrapper still does not support anima, even though the project it depends on already does lomao
>>
>>108450018
ani said he's busy with apache anima afaik. he just didn't have the time for anistudio, not like he'd want to implement proprietary models though
>>
>>108450018
>says the guy whose wrapper still does not support anima
obviously since he hates anima >>108449064
(he wanted the 1 million dollard fund but it turns out that just making a 35 stars wrapper and shitting on comfy all day on /ldg/ won't really help having the favors of comfy)
>>
File: 1757729772597535.jpg (442 KB, 1432x1840)
442 KB
442 KB JPG
>>108449966
>not doing 2nd pass with zit or klein
why r u even her broski?
>>
File: greenknight.png (2.53 MB, 1824x1248)
2.53 MB
2.53 MB PNG
>>
>>108450042
so i show the potential if someone were to do what i requested (aka i'm fucking lazy)
>>
>>108449806
got a catbox?
>>
>>108450149
I need proof of your admixture first
>>
Can anon share what was so good about paid/close?

Been gooning on local everyday and never felt like I was missing anything?
>>
>>108450266
first of all you need to post an example of what you've been gooning to
>>
>>108450266
The first thing you were missing was quality, second prompt adherence, third flexibility, fourth speed. You literally have no taste if you already figured out for yourself the enormous gap. Although at a certain point being a midwit local slopper, gooning to Chroma or SDXL slop and ERPing with Nemo has its money advantages.
>>
>>108450299
can you show some examples of god tier closed source gens? they all look painfully generic and fall apart next to a well trained lora on an open source model.
all i ever see are lazy memes and jeet slop.
>>
File: comfy__444.jpg (828 KB, 1014x1014)
828 KB
828 KB JPG
>>
>>108450325
He can't. It's a comfycloud shill. Their main goal is to demoralize visitors and tourists and make comfy cloud look alluring compared to prompting locally. Faggot wants your sub money.
>>
>>108450028
>ani said he's busy with apache anima afaik
with what funding? lmao
you are such a worthless raped retard
>>108450415
yep, Trani's still a worthless raped retard with no one to talk you
>>
Still waiting to see real NSFW from closed model... We're not talking bikini jiggling here.

We're talking of a spreaded pussy lips near a butt hole covered in cum with makeup running all over the face.
>>
>>108450482
>muh porn
lol
>>
>>108450482
you haven't seen what people were generating with grok?
>>
@grok undress her bobs
>>
File: 00000.jpg (799 KB, 2816x1536)
799 KB
799 KB JPG
>>108450325
local could never
>>
>>108450673
never what? obviously this is meticulously inpainted, you can do that on local too.
>>
>>108449288
Some of the AI bitches you posted are mad ugly but the overall level of realism is nice.
>>108449331
source venv/bin/activate beforehand?
>>
File: How do we tell him.jpg (127 KB, 800x450)
127 KB
127 KB JPG
>>108450681
>obviously this is meticulously inpainted,
>>
haven't been here for a while, what's the latest on lodestones, any epic new failures or has he made something that actually works
>>
>>108450838
>>108450838
>>108450838
move along people
>>
>>108449655
What results nigga? an ugly 1girl nano banana face? what results are supposedly better than local? only poor fags who cannot run local would suck that much SaaS cock

>>108450665
Yeah, I've seen it, 480p vertical video screenshots, pajeets are so pathetic
>>
>>108450841
>any epic new failures
Zeta and Kaleidoscope are his newest failures
>has he made something that actually works
Lol
>>
>>
>>108451001
>>108450983
>>
>>108451026
I like that one >>108451001 more, but she wasn't enough Miku to post on lmg
>>
>>108450843
I can guess your admixture really easy
>>
>>108451049
Fair enough. Both are nice.
>>
How do I faces consistent and not changing with Img2Img on Flux2-Klien?
>>
>>108449288
nothing interesting goes on in these supposedly demoralizing gens.
it's all just 1girl, foreground, detailslop
of course it can do anatomy if every picture is the same boring layout



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.