[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Beijing Time Tracker Edition

Discussion of Free and Open Source Diffusion Models

Prev: >>107855134

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Thanks for the bake, alt collage
>>
Can any kind anon with a dual 24gb GPU setup (two 3090s or two 4090s) share a good LTX2 workflow for that setup?
>>
>>107858102
>>107858106
Cool collages
>>
Blessed thread of frenship
>>
>>
>>107858102
https://dagroup-pku.github.io/MHLA/
that's interesting
>>
I didn't expect the glm fags to dethrone Z-image turbo but I expected something decent, but their model is really bad, holy shit, looks like only Alibaba can make good LLMs and image models at the same time
>>
>>
>>107858132
>good LLMs and image models at the same time
Lol, lmao even. Qwen only recently got good with their latest model, the edit version is still somewhat slopped though.
>>
>>107858157
I said Alibaba, and Z-image turbo + Qwen was made by Alibaba
>>
>>107858121
>>107858133
cute gens and music i forgot how nice illust can look
>>
Chinese quiz culture.

You’ve created a model. It’s really popular and have said you will like to release the base model too as part of your open source ecosystem. However, due to factors beyond your control your parent company has decided to forbid the release of the model for various reasons do you:

A) release the model anyway

B) act as if there is no issue and provide non substantive updates to various scripts in the hope people get bored and forget about it

C) explain the situation. Letting people know of your parent companies decision and the rationale behind it.

D) take the fall for the parent company’s decision.

This constitutes 10% of your final grade for the unit on Chinese culture. Answer wisely.
>>
>>107858161
Right, let me know when they stop releasing slopped video models. Even Tencent are better than them.
>>
WHERE IS Z-IMAGE BASE? Dumbledore, said, calmly
https://www.youtube.com/watch?v=IdoD2147Fik
>>
File: 1738059644318037.mp4 (455 KB, 1064x720)
455 KB
455 KB MP4
>>107858170
>Even Tencent are better than them.
Time to bring that kino
>>
>>107858165
Shame no Chinaman would ever leek the goods
>>
Why doesn't any native tiled vae decoder work for wan 2.2?
>>
>>107858171
>Worse than Z Turbo according to their chart
>Still not released
They're not giving us the "base" model. They're busy giving us a model that they call base, but in reality they're keeping the base model to themselves for cloud offerings.
>>
File: 1741002930541980.png (2.02 MB, 1216x1216)
2.02 MB
2.02 MB PNG
>>
File: 1746371995545893.png (1.85 MB, 1216x1248)
1.85 MB
1.85 MB PNG
>>
>>107858175
>but an alien with scrawny arms WOULDNT be able to do a pushup so hunyuan is more realistic
>but how strong is the artificial grav on his ship?
>well the lighting in hunyuan....
>
>>
File: 1754661661508755.png (2.3 MB, 960x1568)
2.3 MB
2.3 MB PNG
>>
File: 1742564109029206.png (1.96 MB, 1152x1312)
1.96 MB
1.96 MB PNG
>>
File: 1742623442083933.png (62 KB, 640x360)
62 KB
62 KB PNG
>>107858165
>A) release the model anyway
if they do that they'll probably end up in jail for the rest of their life, this is China we're talking about
>>
File: 1743745831418482.png (2.36 MB, 1280x1184)
2.36 MB
2.36 MB PNG
>>
>>107858165
E) merge the training/diffusion code in various project, promise to release the weights in 2 more weeks, renew the promise every week
>>
>>107858191
>an alien with scrawny arms WOULDNT be able to do a pushup
he's in space he can do pushups he has no gravity to make it hard
>>
Chinese technology molded by the Western Man is such a beautiful sight to behold. Models are a unique vessel for this sort of cultural interplay.
>>
>>107858202
at some point they'll be running out of projects to merge lool
>>
>>107858165
maybe they're waiting for a worthy rival to appear so that they can crush them like they did with Flux 2
>>
>>107858202
That's just B
>>
man I love 1girl slop
>>
>>
>>107858192
Catbox anon
>>
WHY IS VAE DECODE TILED RUINING THE GEN REEEEEEEEE
>>
>>107858265
you need bigger temporal size
>>
>>107858265
always have a temporal size value superior to your number of frames, you can't cheap this one out
>>
File: Video_00001 (3).mp4 (379 KB, 720x704)
379 KB
379 KB MP4
>>107858266
>>107858269
Man, that's gay.

I spent like 20min genning a 1080p video and it was broken. They really need to fix preview of video genning, just let us view a single frame.
>>
File: 1738037387370336.png (396 KB, 1497x1248)
396 KB
396 KB PNG
https://xcancel.com/bdsqlsz/status/2011068298532946274#m
RANDOM ANIME CHINESE MAN SAID "THIS MONTH", so the worse case scenario would be in 2 weeks(TM)
>>
Is there any good way to generate a spritesheet for an animated sprites yet or are they still ask terrible?
>>
>>107858291
>no way but maybe
I can see through your lies chang
>>
File: ComfyUI_01577_.png (1.05 MB, 720x1280)
1.05 MB
1.05 MB PNG
>>107858247
me too anon
>>
File: wtf is that.png (2.77 MB, 1280x1280)
2.77 MB
2.77 MB PNG
GLM image is fucking terrible, AR models will never be the future of imagegen
>>
File: 00117-1528102715.jpg (294 KB, 1728x1344)
294 KB
294 KB JPG
>>
>>107858391
Nano Banana Pro is Autoregressive and it's the best image generator by far.
>>
>>107858396
>Nano Banana Pro is Autoregressive
proof?
>>
>>107858395
is this zit or chroma? If it's zit, what lora did you use?
>>
File: ComfyUI_01581_.png (949 KB, 720x1280)
949 KB
949 KB PNG
>>
>>107858411
It's zit.
There is no lora. Loras in zit are shit, I only use them when I absolutely need to, because they always destroy quality.
It is a finetune Lexivision. With Euler a / Beta. However I think original model would look similar.
>>
File: ComfyUI_01583_.png (3.09 MB, 1440x2560)
3.09 MB
3.09 MB PNG
need more milkers
>>
damn, GLM image failed my "a 2d cat with a 3d tail" test. time to wait for the next toy
>>
File: imageGallery.jpg (744 KB, 1439x1436)
744 KB
744 KB JPG
>>107858482
>>
>>107858436
Just how dumb are you, holy shit
>>
>>107858515
Having a seizure or something?
>>
File: output_t2i.png (1.59 MB, 1152x1024)
1.59 MB
1.59 MB PNG
>>107858391
glm-image isnt that smudged for me but it still looks like shit yeah
>>
>>107858291
Do people here ever feel stupid for being strung along like this?
>>
>>107858436
shift?
>>
>>107858482
zai bros.... we got too cocky!!!!
>>
File: 1761752272635195.png (2.67 MB, 1056x1440)
2.67 MB
2.67 MB PNG
>>
File: 1749825358877163.png (2.34 MB, 992x1504)
2.34 MB
2.34 MB PNG
>>
>>107858641
>Do people here ever feel stupid for being strung along like this?
I know they're making fun of us but what else do we have? they're literally the only company on earth that can make decent image models
>>
File: 1746547936049894.png (2.16 MB, 1536x1080)
2.16 MB
2.16 MB PNG
GLM-image kinda sucks as an edit model too
>>
File: 1768344275732300.png (418 KB, 735x722)
418 KB
418 KB PNG
>GLM fucking sucks
>Z-Base in 2 more weeks
I can't take it anymore it's never been more over
>>
>>107858291
>the worst case scenario would be in 2 weeks(TM)
she knew it
https://files.catbox.moe/9z9m3h.mp4
>>
File: fml.png (89 KB, 279x180)
89 KB
89 KB PNG
>>107858749
if we reach Chinese's new year (Feb 17, 2026) without a Z-image base release then it's fucking over
>>
>>107858747
hmm, I think something might be wrong on your side, shit's overcooked. I saw some examples where shit didnt come out as fried as this.
>>
File: 1758298804072665.png (1.79 MB, 1280x720)
1.79 MB
1.79 MB PNG
>>107858749
>>107858756
>>107858772
Remember, Gwello, if it's good, it won't be local. Thank you for participating in Chinese culture.
>>
>>107858690
8
>>
File: 1762759299376372.jpg (47 KB, 828x798)
47 KB
47 KB JPG
Retard here, what does shift do? I don't remember using it with XL.
>>
>Someone took my 2 weeks total recall video from here and is using it to advertise their own workflow on reddit

Like whatever but... that didn't come from your workflow.
>>
>>107858978
>going on reddit
1st mistake
but yeah you expect indians to give a fuck about your shit? lol
>>
>>107858978
>2 weeks total recall video
wtf is that
>>
>>107859011
it's this kino >>>/wsg/6071819
>>
>>107859011
Since catbox seems to be down.
>>>/wsg/6072500
>>
>>107858827
It reshapes the curve of the sampler's steps for denoising. The greater the shift the more steps are spend on early denoising while with shift of 1.0 the steps are evenly distributed.
>>
>>107859018
verhoeven made that with ltx2?
>>
File: 1753629527656642.png (2.51 MB, 1344x1152)
2.51 MB
2.51 MB PNG
>>
>>107858978
and people wonder why some anons scrub metadata or don't catbox
>>
that's why no one here deserves my premium 1girl prompts
>>
>>107859035
kek



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.