[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>108220388

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://thetacursed.github.io/Anima-Style-Explorer/

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Blessed thread of frenship
>>
>>108223381
thanks for the bake anon
>>
/dedgen/
>>
>>>/r/20344542
We are treated as gods on other boards.
>>
File: 00031-2797087328.png (1.54 MB, 1024x1024)
1.54 MB
1.54 MB PNG
>>
/benin/
>>
File: Klein_Edit_00206.png (1.11 MB, 1280x720)
1.11 MB
1.11 MB PNG
>>108223526
So... anything cool happening in the image gen space right now?
>>
>>108223583
Most models got native NAG support in ComfyUI
>>
>>108223381
Been out for about a month, what ultimately won out, Z or Klein? Any actual finetunes out for either of these?
>>
>>108223606
Klein for editing and Z for regular text to image. Different usecases.
>>
I like Qwen 2511 for edits
>>
>>108223606
>what ultimately won out, Z or Klein?
neither, seems like they're hard to train, and klein 4b looks like a meme but no one want to deal with the gay licence of klein 9b, at this point our only cope is Qwen Image 2.0 or Z-image edit
>>
File: 1581285285757.png (1.82 MB, 1001x1478)
1.82 MB
1.82 MB PNG
SubtleShader is now a traitor. he's selling his new loras now, wtf. he didn't even become popular by starting from scratch, but by merging other models...
>>
>>108223606
Klein9B is an edit model
Z is a general model.
Can't really compare who won yet. I personally think Z has higher quality realism, so Z image edit should be better, but we'll see.
>>
>>108223595
apparently the native NAG doesn't work with klein yet
>>
ultra cozy
>>
>>108223680
well, time to pay up.
>>
File: 00010-4047272580.png (2.16 MB, 1792x1792)
2.16 MB
2.16 MB PNG
been out of it for a bit lads
whats the metagame nowadays? i'm still using reforge and wainsfwillustrious_v140. is comfy really going to give me better results?
heard some of the latest illu models have issues with using controlnet, is this true?
>>
>>108223583
I'm still riding the high of Anima, Z, and Klein Edit
>>
>>108223884
>whats the metagame nowadays?
waiting for anima and hoping no major fuckups happen
>>
>>108223896
*apache2 anima*
the tdrussell one is kind of shit
>>
>>108223884
wai nsfw was updated to v16. most people are still using that for local anime gens.

>is comfy really going to give me better results?
it's just a ui and thus cannot give 'better' results. it does however offer more flexibility, automation and general tools.

>heard some of the latest illu models have issues with using controlnet, is this true?
not that i've heard of.
>>
>>108223921
>most people are still using that for local anime gens.
jesus imagine if that were true. how horrible. and i mean most people here not on jeetivai or whatever
>>
File: ComfyUI_temp_kefuj_00086_.png (1.68 MB, 1024x1024)
1.68 MB
1.68 MB PNG
>>
>>108223680
lol he's selling a checkpoint 'subscription'. $12.50 a month
>>
File: 1754521601235523.png (2.97 MB, 1232x1656)
2.97 MB
2.97 MB PNG
>>108223884
For anime? Anima. https://huggingface.co/circlestone-labs/Anima
>>
>>108223935
Anons here use whatever is new because most are bleeding edge users. Go to any other general and you'll see 99.9% are not using Anima. Most don't even know what that is.
>>
>>108223964
>Go to any other general
that was referenced in the "whatever" part of "jeetivai or whatever". anon should pay no mind to what the masses of lagging sloppa sheep use desu
>>
>>108223920
>sloptuner: I only train on permissively licensed models so that I can lock down my finetune with a restrictive license and monetize it
>anima dev: let me train this permissively licensed model and release it with a noncommercial license so I can monetize it
>sloptuner: NOOOOO YOU CAN'T DO THAT IT HAS TO BE APACHE 2 SO I CAN MAKE MONEY WITH MY OWN FINETUNES
this is pure hypocrisy if nothing else
>>
>>108223963
I just started using this. How are you genning at a higher resolution? I get kind of weird gens when I try bumping it that high.
>>
File: 00049-332417223.png (2.65 MB, 1728x1728)
2.65 MB
2.65 MB PNG
>>108223921
>>108223896
>>108223963

excellent thanks bros
>>
>>
>>108224021
https://comfyanonymous.github.io/ComfyUI_examples/2_pass_txt2img/
>>
>>108223964
I rarely even see anima gens here. If they do exist, I can't tell the difference between them and XL.

You'd figure anons would be posting gens that show how 'superior' anima is such as multiple 1girls, ease of regional prompting and so on, but no, it's the same generic 1girl stupid shit.
>>
>>108224010
so who is going to finetroon commercial anima? Isn't tdrussell just as grifty as the grift finetrooners? I don't get it
>>
At least half of the collage is Anima
>>
>>108224105
ranfaggot made the collage so it's a moot point
>>
retard here, been using gemini to generate images using several other reference images.
i like nano banana bc i can input 2-5 images of a subject and it can generate a very accurate model of it, but can't do any nsfw shit.
is there a comfy alternative that can do the same? or do i have to train a LoRA?
>>
go work on your wrapper lilbro
>>
File: MJ Kino.jpg (200 KB, 828x1104)
200 KB
200 KB JPG
>>108224093
as long as a local model hasn't been close to Midjourney's sovl I won't consider anime local to be solved
https://www.tiktok.com/@tcgbrugialdi/photo/7415973084705918240
>>
>>108224122
You've done him
>>
>>108224122
put the fries in the bag unc
>>
>>108224104
>Isn't tdrussell just as grifty as the grift finetrooners?
I mean I guess, if you squint a bit. But there is a difference between someone trying to monetize a huge finetune vs sloppers who want to sell their loras.

What I don't understand is all the people saying shit like "anima is doa, nobody will finetune it if they can't make money off it". But they don't extend that same logic to the person actually training anima, who is apparently expected to just give everything away for free with no compensation.

And the thing I REALLY don't fucking understand is why nobody complains that Noob also has a noncommercial license.
>>
>>108224149
It's just the fag trying to drum up hype for his "anima but apache2" model that isn't even off the ground yet. You can ignore those posts.
>>
File: 1771750667078314.png (20 KB, 575x156)
20 KB
20 KB PNG
>>108224149
the answer is actually quite simple
>>
File: 527246642.png (1.53 MB, 936x1072)
1.53 MB
1.53 MB PNG
me and who?
>>
>>108224161
Imagine fudding a local model.
>>
File: 2girls.png (1.09 MB, 1024x768)
1.09 MB
1.09 MB PNG
tried to do 2girl boxing on base anima
>>
>>
>>108224192
how well does anima handle 2 character loras in the same image interacting?
>>
>>108224192
ZiB as base model, Anima as hiresfix at 0.4~5. You can use the same prompt if you want and add the artist and copyrighted characters.
Anima in img2img understands any style of character, whether realistic or anime itself, and translates them well into anime. The background will be murdered but at least the characters will have better composition.
>>
>>
File: 00011-49031391.png (586 KB, 832x1216)
586 KB
586 KB PNG
One thing i would really like to see for Noob models is to have is some kind of style controlnet style IP adapter. You can think of it like Vibe Transfer NAI has. It’s used for style adjustments, not affecting the composition. Similar thing was created for Pony based models https://github.com/gustproof/sd-webui-forge-style-components
>>
>>
>>
>>108223920
Your vaporware doesn't exist and never will.

>>108224192
Funny, I'm doing the same thing right now except het. Results aren't great, but it might be a skill issue.
>>
>>108224192
>same hair style
>same hair length
>same eye type
>same bangs
>same breast size
hmm
>>
>>108224305
>the girl on the left has short black hair and small breasts, the girl on the right has long blonde hair and large breasts
problem solved
>>
>>
>>108224332
i know that. im saying anon was too lazy to make them distinct.
>>
File: 2girls2.png (963 KB, 1024x768)
963 KB
963 KB PNG
>>
>>108224229
>ZiB as base model, Anima as hiresfix
Unless you have a chad card or use shitquants Comfy's memleaks won't let you do this.
>>
File: 1girl1boyboxing.png (1.21 MB, 1024x768)
1.21 MB
1.21 MB PNG
girl vs boy boxing in base anima
>>
>>108224369
?
gen first image with ZIB
unload model
i2i image with anima.

actually, anima is so small that you can probably keep both of them loaded.
>>
>>108224369
I have only 12gb vramlet ZiB at fp8 or gguf and comfy does that very fast and this >>108224390 i have both of them loaded.
>>
File: animapreview_00000_.png (3.1 MB, 1664x2560)
3.1 MB
3.1 MB PNG
>>
>>108224390
>loading and unloading models every time you wanna gen
id rather kill myself but im happy it works for you
>>
what about using anima to build the base composition, then XL to refine? you only need anima for the prompt adherence. XL can fill in the gaps

>>108224432
i don't need to do that because i have 32gb vram and 128gb ram.
>>
>>108224439
The details would get destroyed by XLs poopoo VAE but I'm sure some anons do it already and don't care
>>
>>108224454
then just do a third pass with anima as a final refiner to fill in the lost details lol
>>
File: 2differentgirls.png (1012 KB, 1024x768)
1012 KB
1012 KB PNG
base anima boxing with different characters
>>
I still like Noob for complex lighting effects, stuff like depth of field, and really dark pictures. I have no doubt Anima will get better for "high aesthetics" type works with more training. Shit, just having the Qwen text encoder instead of clip, which every shitmix and shitmerge use, should make it superior in the long run.
>>
>>108224475
Does this work with multiple character loras.
Please demonstrate.
>>
File: 2girlslora.png (1.09 MB, 1024x768)
1.09 MB
1.09 MB PNG
>>108224482
It doesn't work properly since the characters become mixed
>>
>>108224490
So Anima did not solve the same issue SDXL has. What a pity.
>>
>>108224439
>Anima2SDXL
Haven't used pure SDXL in a while, it's hit or miss, maybe 70% success. Watch the denoise amount though, SDXL rounds out, and simplifies facial expressions and poses, makes characters look like cake toppers. I do 0.4~0.5 denoise also and usually 3 gens at 0.4, 0.45, and 0.5. Sometimes gotta add tags becausd Anima added cool thongs that SDXL doesn't recognize.

Actually found ZiB2Anima trying to use Anima as a translator of middleman for SDXL since ZiB2SDXL was too destructive.
>>
File: 1751617733052397.mp4 (2.82 MB, 2048x1176)
2.82 MB
2.82 MB MP4
https://unicornanrocinu.github.io/LUVE_web/
Babe wake up, an open source video will be released soonTM
>>
>>108224514
>Haven't used pure SDXL in a while
When I say SDXL I mean waiNSFW or NoobAI shitmix. No one uses raw SDXL anymore.
>>
>>108224506
That's not an Anima issue it's a lora issue. The lora is only trained on a single character, it basically forces everyone to be that character.

Proper use of regularization images or multi-character loras would probably work. Certainly it would work better on Anima than any SDXL model.
>>
I wonder how many were there to witness the birth of the concept of a "refiner" and how its original implementation, by anon, was completely incorrect i.e. it's not supposed to be a regular i2i pass. And also how it was just a cope afterthought by SAI to "fix" SDXL. I imagine not many remember this.
>>
>>108224536
They never used it correctly to begin with. Refiner is basically one of the best ways to merge models outside of actually mixing them but so many anons failed at it and couldn't understand it.
>>
File: kek.png (314 KB, 1286x909)
314 KB
314 KB PNG
>>108224515
>comparing to wan 2.1 in the year of the lord 2026
kek
>>
>>108224110
yes
>>
>>108224524
Me too, for that type of task WAI is ironically better at img2img gens from ZiB or Anima in my experience speaking of the automated production of txf2img->hiresfix->detailer. Not manual inpainting or thing like that. With dedicated inpainting and all that maybe Noob would've worked better
>>
>>108224543
Well, eventually anon figured out the correct way to do it but then realized the original XL refiner was mostly pointless and largely slopped the output. I suspect that the reason why no one knows how to do it today is because that original refiner fell out of fashion quickly.
>>
>>
>>
Lol, Hasaku dev is training Klein 4b for anime. Does it have a better license than Anima? At least Klein with 4b, VAE and text encoder are better than Anima. I don't mind shitmergers starting to use Klein 4b as the next SDXL.
>>
File: 1763603524495627.png (3.36 MB, 2415x1241)
3.36 MB
3.36 MB PNG
Have you guys tried BitDance?
https://github.com/shallowdream204/BitDance
>>
>>108224642
>Lol, Hasaku dev is training Klein 4b for anime. Does it have a better license than Anima?
Klein 4b has the goated Apache 2.0 licence
>>
>>108224647
>14B
kek
>>
>>108224642
Why Klein and not ZIB?
>>
>>108224660
I mean, Qwen Image had its moments and it's a 20b model
>>
File: jhcn5hhvlftd1.gif (2.49 MB, 323x323)
2.49 MB
2.49 MB GIF
>spent the entire day removing clothes off loli with klein9b
>>
>>108224506
Anima is cool but it's slower and seems to have limited learning capacity. It's a showcase for tdrusel's dataset overall, but it's technologically inferior to K4B even though Anima took way longer and cost more to train.
>>
>>108224711
What the fuck are you even saying?
You're just talking out your ass because you're lonely today?
Why would a more advance model with more parts be the same as XL?
Compared to other major models it's still light
>>
>>108224711
the dataset is anemic, the caption strat is retarded and the encoder is laughably dumb. The entire model needs to be redone
>>
>>108224727
jesus christ why are you so defensive? calm down tdrussell.
>>
>>108224663
ZiB is unstable and the character consistency is garbage. Even our pedigree /ldg/ lora makers haven't managed to pull a decent character lora from it yet.
>>
>>108224743
You seem upset
>>
>>108224750
>Even our pedigree /ldg/ lora makers haven't managed to pull a decent character lora from it yet.
I have, but it's a person lora, not anime. I guess Z models are just more tuned for realism.
>>
>>108224751
I think tdrussell is if he's crashing out at anons for valid criticism
>>
>>108224727
Sorry Comfy, you wasted 1million in a literally who
>>
File: screenshot.1771901021.jpg (65 KB, 419x269)
65 KB
65 KB JPG
oooiii i have less than 100gb on all my nvme's. space is running out and a new 4tb nvme is like $600

fuuuuuuuuck
>>
>>108224763
>>108224766
The only faggot seething over Anima is a jilted dev that got left behind. You really do spend all your free time crying like a zesty lil bitch.
>>
>>108224769
I don't see anyone 'seething' over Anima. Just anons making basic comments about the model.

"WAT THE FUCK ARE YOU SAYING ABOUT ANIMA??"
Now that's screeching.
>>
all those anons last thread talking about ldg being dead or whatever when this is the only place devs both based and fag alike can shit post at each other with anonymity
>>
>>108224785
You say the same copes and seethes on other platforms. Nobody is going to give you money loser faggot
>>
>>
>>108224766
The Comfy grant is still up for grabs.
>>
File: 585i3838.jpg (74 KB, 531x397)
74 KB
74 KB JPG
>>108224663
https://www.reddit.com/r/StableDiffusion/comments/1r8oed1/why_are_people_complaining_about_zimage_base/?share_id=3O2BgXnz1jnrfBOtR0oLB&utm_name=androidcss
>>
>>108224766
Illustrious 3.5 only needs 350k in funding. Why didn't he give them money?
>>
Zeta Flop!
>>
File: loracope.png (26 KB, 788x218)
26 KB
26 KB PNG
>>108224506
loras are outdated copium. local research has stagnated. zero developments in this field since sd1.5, just bloated underperforming models that lack half the features of sdxl (prompt weighting).
>>
>Schizo dooming again because his life is hell
>Seething because comfy won't give him money to fail at another project
Was comfy giving you a job that you squandered not enough?
>>
is qwen image edit the best image edit model out there right now?
>>
>>108224766
This. Also tdrusell isn't part of any shitmerger cabal or NoobAI friend group, so obviously shitmergers are gonna ignore his model...
>>
>>108224836
Loras will never be outdated. A big general model will never be able to capture every character people want, and using 1-2 reference images is not enough for any model to fully grasp them. It won't understand their unique facial expressions, posture, and so on.
>>
y-you want to be a part of the cabal you call shit?
>>
>Dooming when the model is half way done
>But please don't treat my shit tier UI that's been in development longer than this model has been training because I deserve the grace I won't give others
lol faggot
>>
>>108224857
>using 1-2 reference images is not enough for any model to fully grasp them.
Nano Banana Pro is close desu
>>
>>108224872
Nah.
>>
>>108224832
i wish it had a longer name like zeta the chinks never told us what z stood for



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.