[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>107727269

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Blessed thread of frenship
>>
File: 1755549618857669.jpg (2.09 MB, 2048x3072)
2.09 MB
2.09 MB JPG
https://files.catbox.moe/g5a4ig.png
>stereotypical X man/woman with a white background holding a sign saying 'ZIT X man/woman'
Why doesn't zit like the word transgender?
>>
how do I sell my hentaislop
>>
>>107730798
honestly i think there's a lot of random words it doesn't like
>>
>>107730799
Whar?
Who are you referring to?
>>
File: Untitledsdfsdf-1.mp4 (3.65 MB, 1000x674)
3.65 MB
3.65 MB MP4
How come wan 2.2 ended up being forced to use light loras?
I've been trying with just high step and cfg counts and the motion is just so broken in comparison to light loras. Ignore frame interpolation broken frames.
>>
>>107730683
>https://rentry.org/debo
>https://rentry.org/animanon
wait... glanced through those rentries and i don't understand how is this related to local diffusion topic
>>
>>107730798
Why is the Jewish man the only one who isn’t smiling?
>>
>>107730823
It's not related. It's part of this general's drama. Anons have been trying to force this shit out of the op for months, but the schizo never sleeps
>>
Don't engage with the spammer, let him reply to himself, post about imagegen.
>>
>>107730833
who?
>>
>>107730840
why though? this is a troll thread. posting about imagegen is off-topic. when we have a bread without off-topic spam we can have a thread about local diffusion
>>
nu year sam drama
>>
>>107730852
what the fuck are you talking about schizo
this is not local diffusion. stop derailing the thread retard. let us have a bread without your petty grudges
>>
>>107730798
why is the jewish man the only one with a cool hat?
>>
Move your troll war to reddit or discord or tiktok or whatev.
>>
File: Sad.png (228 KB, 1080x1928)
228 KB
228 KB PNG
>>107730867
This was to cope with that very discord calling him a retard minutes later
>>107730873
This is local diffusion, a bitter dev from a failed project decided to disparage a popular project that we all use only to get called a retard in another dev's space. The mentally ill dev is also actively advertising his project here while showing clear signs of mental illness.
He also starred his own project so in reality his almost 2 year old project has 32 stars despite months of endless shilling.
>>
>>107730899
I got a vacation for talking about z-img base, absolutely not comfy
fuck that
>>
Let it go bro
No one here is ever gonna use tranistudio
If you really still have hopes of using it as a source of income, your best option is fucking off and shilling it as far away from here as possible
Otherwise, the only thing waiting for you is a slow, self-influcted descent into misery
>>
>>107730905
yeah and other shit that never happened schizo
you were obsessing over ani as usual and got what you deserved
>>
>>107730911
i use tranistudio
>>
>>107730911
ani said that anistudio is quite successful though, he got an offer from google. they think its pretty competent. the popularity will come. comfy didn't get famous overnight
>>
year of no base
>>
So successful he is schizomaxxing for 2 weeks over christmas for 20 hours a day
>>
>>107730830
he wasnt paid to be there
>>
oh wow! the first day of 2026 there is the first thread of the new year!
> rentries are back
>single digit on topic posts
>complaints about the schizo
>schizo being a low effort troll the entire time
fucking hell
>>
Is there a good local image to prompt model? I do not want to sign in with google nor get molested in huggingspace or whatever it is
>>
>>107730962
https://github.com/1038lab/ComfyUI-QwenVL
>>
>>107730962
z-image has some official system prompt for qwen
>>
>>107730962
you can try qwen 2.5-vl llama 3.2 vision but cloud is much better,
>>
>>107730962
Stable Diffusion 1.5
>>
>>107730962
depends on what you are doing. qwen like anon suggested, joycaption for booru
>>
>>107730962
For booru tags, I still use wd-eva02-large-tagger-v3 in workflows because it is cheap memory wise and gets the job done.
>>
>>107730926
Why does he spend his time seething in here instead of working for google, then?
>>
File: Lol.png (7 KB, 324x95)
7 KB
7 KB PNG
>>107730926
He didn't get hired, if he had a job he would be flaunting it and not posting here all day. He destroyed his career prospects by being a lolcow.
>>107730987
Hard to get a job when you schizo stalk and attack devs in the ai space and make garbage software like anistudio.
>>
>>107730997
>posting anonymously is impossible
>>
Is there some magical workflow with loras that makes qwen image 2511 or whatever potent? I want to do comparisons to zit.
>>
>>107730996
>>107730997
You could be working at google if you weren't so retarded
>>
>>107730997
Why are you lying?
>>107730852
Do you think anons can't remember past a day?
>>
>33 stars is crying
>33 starsi is alone
>>
does llama through joycap have the problem of "I can't see the face of the subject with the huge tits, long hair and wearing a dress, so they will be henceforth referred to as 'individual'"?
>>
have zit loras gotten any better? Last time I checked, they destroyed coherence and detail
>>
I'll take that as a no >>107731048
farewell janus, I tried but you suck
>>
File: 1767273577972926.png (1.95 MB, 1536x1024)
1.95 MB
1.95 MB PNG
>>
File: 1644968678885.gif (1.44 MB, 500x375)
1.44 MB
1.44 MB GIF
where the hell is that new year's present?
this time, they were the ones talking about presents
>>
>>107731061
>uses the same easy to verify cope
>>107728696
This is why we all want you gone and why you will never amount to anything with the only highlight of your life being a janny to comfy that got you a short lived pity job.
Tik tok, the garden is calling you ani. Time to go back to mowing lawns
>>107731077
He's so lonely, he rather get negative attention instead of working on his project
>>107731089
Don't false flag us we know what's trani and what isn't
>>
sdxl won
>>
>>107731095
wasn't that qwen?
>>
>>107731107
yes and it's even shittier than the previous version
>>
>>107730911
>shilling it as far away from here as possible
he already tried that on the tongy discord but because he can't samefag on discord he got clowned on and humiliated out of the room. that's why he always comes back to 4chan
>>
File: 1767240993793637.png (1.42 MB, 1845x912)
1.42 MB
1.42 MB PNG
>>107731095
>>107731107

>Small spacecraft designed in a classic 90s JRPG anime style, inspired by Chrono Trigger and early Final Fantasy — compact and heroic silhouette, rounded yet mechanical forms, hand-crafted look, subtle sci-fi details mixed with fantasy elements, exposed engines and glowing accents, colorful but limited palette, slightly whimsical design, anime proportions, clean linework, retro science-fantasy aesthetic, looks like it belongs in a 16-bit era RPG world, painterly highlights, soft shading, sense of adventure, bronze final fantasy vii aesthetics
source:
>>107727290
>>107727315
>>
File: 1750349035144356.png (1.25 MB, 1152x896)
1.25 MB
1.25 MB PNG
the green slime girl holds a bag of chips. on the bag is the text "SIPS". she holds up a potato chip

cool, it works. flux 2 with 8 step turbo lora
>>
>>107731059
I think they are great.
>>
File: 1760894255929414.mp4 (603 KB, 640x832)
603 KB
603 KB MP4
>>
>>107731162
He gave up on doing animation because everyone surpassed him in the space. He never had the skill to begin with but pretended to be an authority.
>>
>>107731180
trying to be an authority by name on an anonymous imageboard is fucking cringe to begin with, the only people who do that are attention whores
>>
>>107731187
I agree, look at him now
So desperate for attention he rather be hated by anon. 33 stars for almost 2 years work. He still can't understand why nobody wants to contribute to his for profit license. He honestly thought anons would work for him while he does fuck all with his project.
>>
can we just talk about diffusion please? I don't care about schizobabble
>>
lmao look how defeated he is
>>
So it's a wrapper around sd.cpp
As a developer, why would I not just use sd.cpp myself? What's the value add? What am I getting for the commercial license that I can't just do myself?
>>
>>107731149
Any guide you can link, or tips? I'm trying to train and experience that as well
>destroyed coherence and detail
>>
>check thread to find if there has been anything new in the local image gen scene
>its full of dramatarding
ok, back to lmg
>>
>>107731240
This anon is on the money I think. >>107703601
>>107703628

Also personally I use ostris' v1 adapter, I don't use v2 or the dedistilled base model. I haven't properly A/B tested but I feel like this is how I get better results.

For strong styles I also use "High Noise" Timestep Bias.
>>
>>107731230
Yes, He expects anons to port basic features that he lacks the skill to add himself. Once someone reads his license they immediately close the tab.
Also it's still slower than comfy.
>>
File: 1758930566507996.png (1.39 MB, 1136x912)
1.39 MB
1.39 MB PNG
transform into realistic photography

neat. you can use flux 2 like an edit model
>>
post what you've been working on
>>
y'all would rather shitpost than post gens?
why are y'all here?
baka (shaking my head)
>>
Anyone had success with the long vid loras and workflows? Here's a good link that seems to be regular updates on workflows, techniques, etc https://wanx-troopers.github.io/svi.html


Also for the love of christ, ignore the drama tards
>>
File: 1763702798415494.png (1.44 MB, 960x1088)
1.44 MB
1.44 MB PNG
>>
>>107731268
50 pics with 4000 steps, alright. Still the default 0.0001 learning rate? I feel like for me the images fall apart due to overtraining, tried to lower the steps to 2k or the lr, it became better but this also reduced the lora effect. And at strength 1.0 the loras become fried, that probably also means overtraining. Using v2 though. Will try 4000, thanks.
>>
>>107731384
>man who doxxed himself calls for others to be doxxed
You can't make this shit up
>>
>>107731450
Do save epochs, and try various step counts. It's been really hard to for me figure out which epoch is the best unfortunately. I haven't messed with learning rate. I want to try prodigy when I get around to it. Also just try strength 0.7 or 0.8 it seems like that's the sweet spot for a lot of zit loras. Even if it means you had to overtrain, who the heck knows anyway.
>>
anyone made a "must have" list z-image good loras from civitai?
>>
>>107731536
>>107731384
*yawn*
Yet another example of why AniStudio is unsafe to run.
>>
>ani is crying
>>
>>107703601
any tips on captioning? So far Ive done 2 loras. theyve come out well. just used taggui for it. I want to get better. and I think captions is my weak point.
>>
>>107731481
anything from that xixxix anon
i mostly use my own loras thoever
>>
lol comfy finally updated their chroma weights from the broken ones
https://huggingface.co/Comfy-Org/Chroma1-HD_repackaged/blob/main/split_files/diffusion_models/Chroma1-HD-fp8mixed.safetensors
https://huggingface.co/silveroxides/Chroma1-HD-fp8-scaled/blob/main/Chroma1-HD-fp8mixed.safetensors
>>
>>107731481
ZIT sucks with lora. Try to avoid them.
>>
>>107731626
I wonder what the last straw was kek. That plebbit post has been haunting them for months and months.
>>
>>107731626
people should just stop using the org repo. too incompetent
>>
File: 1748878420428820.png (1.72 MB, 1024x1024)
1.72 MB
1.72 MB PNG
transform into realistic photography

with picture of frieren as input
>>
>>107730813
we're not getting another version of local wan so all we can do is train more cope loras
>>
>>107731646
Are you using an LLM to simply write you a prompt from a reference image and then run through flux?
>>
>>107731632
Nah
>>
File: xyz_grid-0002-414940822.jpg (767 KB, 5376x1308)
767 KB
767 KB JPG
ZIT at 20 steps completely changes the character.
While 5 steps is actually enough if you use upscaling pass after initial generation.
>>
>>107731641
comfydev defended for months their incorrectly converted checkpoints
>>
>if
>>
>>107731671
ALL the steps pictured completely changes the character.
>>
>>107731646
That looks rather young for a cock-sleeve, doesn't it?
>>
File: 1761141765456196.mp4 (3.69 MB, 1264x720)
3.69 MB
3.69 MB MP4
>>
File: 1759664318093035.png (1.67 MB, 1024x1024)
1.67 MB
1.67 MB PNG
transform into realistic photography. she is a gorgeous woman

>>107731711
better?
>>
>>107731706
2-5, 10-15, 20
However 20 is significantly different from all the other iterations.
>>
>>107731724
Too Arabic.
>>
>>107731671
why do your zit gens look like ponyslop ive never seen others look like that
>>
>>
>>107731737
Try adding score_9, score_8, masterpiece, best quality to your prompt
>>
>>107731724
Gorgeous for good looks sar
>>
>>107731737
Prompt basically consists of 1girl and a pose. That's what it produces.
>>
>>107731725
You use "completely" and "significantly" as if it's changing her race kek
>>
File: 1766922909590034.png (2.88 MB, 1472x1024)
2.88 MB
2.88 MB PNG
man i cant believe we're holding out against pure schizoautism, I thought it was all over. thanks for making me believe in christmas again bros!!!!
>>
>>107731769
She got significantly older and has a completely different face. So
>>
>>107731095
A full year of bullying a failed dev that even comfy personally makes fun of
>>
>>107731788
Missing the forest for the trees. What happened to the "it causes jpeg artifacts" cope?
>>
>>107731620
>xixxix anon
OK thanks
>>
>>107731751
>tfw this has been such a meme modern llms should be aware of it, and Qwen might be not only inferring the meaning but actually knowing
>>
>>107731810
How would Qwen react to art by Greg Rutkowski?
>>
>>107731807
What cope?
>>
where can i go to post gens without autists chronically lurking and shitting up the thread
>>
>>107731857
>>>/b/degen
>>
>>107731857
https://e6ai.net/
>>
File: 1742952521884562.png (3.32 MB, 1336x2008)
3.32 MB
3.32 MB PNG
>>107731632
Your dataset is probably just bad.

>>107731671
The more complex an image, the more a higher step count will help; in addition to a higher res of course. And foregoing a second pass in favor of generating at the final res outright is more favorable IMO.
>>
File: ComfyUI_temp_sozkl_00001_.png (2.37 MB, 1152x1344)
2.37 MB
2.37 MB PNG
Are there models specifically for blender or is it all under copding? Also are there any local models that actually know code or is it all copium and should just cuck out for claude? 12G vram and 48 GB ram.
>>
>>107731880
>or is it all under copding?
what?
>>
>>107731871
>>107731872
if that's the best you can do then i sincerely believe there's an effort to destroy any kind of constructive ai space
>>
>>107731894
under coding*
>>
>>107731879
Who said that I was training the loras?
>>
File: Sansitre.jpg (255 KB, 2129x1269)
255 KB
255 KB JPG
>>107730683
I can't manage to install ComfyUI fram interpolation.
I tried switching to "local" and "dev" repos as well.
I use Wan portable.
>>
>>107731880
wrong thread anon
>>
>>107731909
There are like 10 discords. 4chan legit sucks for image posting with its absurd captcha
>>
File: 1742944251737192.png (3.98 MB, 2000x1336)
3.98 MB
3.98 MB PNG
>>
>>107731942

anon... did you just admit to getting filtered by the new caption system? DIOS MIOS!!!!
>>
>>107731947
Interesting style. Moar
>>
>>107731958
you mean annoyed? yes. I cba to do it 10 times to post 10 images
>>
>cba to do it 10 times to post 10 images
>but does it 10 times to post 10 times without an image
>>
>>107731972
This lmao
>>
>>107731857
>2ch org/ai - (just use a translate browser extension, this place actually discusses, helps and posts gens despite being a little slow)
>civitai - (unironically)
>>
File: 00026-1455380457.jpg (428 KB, 1344x1728)
428 KB
428 KB JPG
Honestly what's with this shit captcha anyways.
>>
>>107732001
garbage gen, shit quality, shit subject
>>
>>107731972
I wasted 30 seconds doing the retarded captcha again just to say I can bother out of spite, but cba to just post 1 image on what is supposed to be a image board
>>
>>
>>107732027
Looks like a penis haver
>>
File: 1000012651.png (522 KB, 896x1152)
522 KB
522 KB PNG
Is noobai vpred still the best for anime? Have we evolved past sdxl? It's years old at this point.
>>
>>107732027

Genuinely can't tell if this is a real photo of cinna or not lol, bravo! Might also need that catbox to get that real looking quality if it's z-image.
>>
>>107732061
vaultmeat lora needed
>>
File: 1742413120700607.jpg (2.06 MB, 4000x2672)
2.06 MB
2.06 MB JPG
>>107731961
Thanks. Here are some random seeds.
>>
File: 00053-1618771901.jpg (361 KB, 1344x1728)
361 KB
361 KB JPG
>>107732013
>t. nogen
>>
retard here, just trying to train my first ZiT lora and at least from looking at the console it seems to learn quite fast, is that correct? Seems to be as fast or faster than XL, despite being on 12GB.
>>
>>107732084
yeah but it can (and should) be baked for ~4k steps unlike XL
>>
>>
>>107732101

I was getting a little bit of overfitting on smaller 10-15 image datasets around the 2500-2700 step mark using the lokr output and standard settings with AI-Toolkit if anyone wonders about smaller dataset params.
>>
>>107731947
>>107732080
catbox?
>>
>ran a few test prompts through gemini to compare to z-image turbo, any merge/tune/base whatever
>in every single example, z-img turbo was better in every way but especially in prompt adherence
ah. that explains why we're royally fucked for pc parts possibly forever now. we actually HAVE received SOTA, its just most are too retarded to use it correctly. But even if people want to get into the scene now if more people find out, it's too late for them.
>>
>>107732122
desu i wouldnt expect a 15 img set to look any good regardless of settings



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.