[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Armchair Synthographer Edition

Discussion of Free and Open Source Diffusion Models

Prev: >>107724582

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Blessed thread of normalcy
>>
>>107718988 (Cross-thread)
>Small spacecraft designed in a classic 90s JRPG anime style, inspired by Chrono Trigger and early Final Fantasy — compact and heroic silhouette, rounded yet mechanical forms, hand-crafted look, subtle sci-fi details mixed with fantasy elements, exposed engines and glowing accents, colorful but limited palette, slightly whimsical design, anime proportions, clean linework, retro science-fantasy aesthetic, looks like it belongs in a 16-bit era RPG world, painterly highlights, soft shading, sense of adventure, bronze final fantasy vii aesthetics
>>
>>107727289
this, feelsgood to be back
>>
>>107727269
>>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon
I thought the schizo that keeps posting these died? wtf
>>
>>107727290
you forgot the image anon lol
>>
File: 1767240926308.png (25 KB, 736x199)
25 KB
25 KB PNG
First for sisyphean schizos
>>
>>107727290
z turbo bigly win always
>>
File: file.png (1.79 MB, 1845x912)
1.79 MB
1.79 MB PNG
>>107727290
>>107727304
reeeeeeeeeeeeeeeeeeeeeeeeeeeee
>>
File: Lol.png (7 KB, 324x95)
7 KB
7 KB PNG
>>107727296
>>
>>107727315
>I mean Z-image looks closer to the prompt to me? It looks FAR MORE 90's jrpg to me. Like advance wars a little bit
>fuck qwen is slopped. Generic ship that is not 16 bit looking at all with a unasked for background. How did they make zimage so good?
>>
File: Sad.png (228 KB, 1080x1928)
228 KB
228 KB PNG
>>107727317
Was for
>>107727302
>>
File: file.png (1.68 MB, 1845x912)
1.68 MB
1.68 MB PNG
>>107727315
>>107727290
>>107727324
>Z-image looks closer to the prompt to me
>FAR MORE 90's jrpg to me
>qwen is slopped. Generic ship that is not 16 bit looking at all with a unasked for background

pic related is a new seed and I changed the 4 step to 8 steps. Stock qwen wants to give me something out of Deponia. qwen with lightning is giving me 90s PC game cover art
>>
File: nonsense.png (1.73 MB, 2626x1482)
1.73 MB
1.73 MB PNG
>>107727366
the qwen fags think that "more details = less slop" without thinking that it has to follow the prompt first, but I guess for the regular normie that's enough to impress them, they're just gaming the mememarks at this point
>>
File: 00087-1298800317.png (1.49 MB, 1288x976)
1.49 MB
1.49 MB PNG
>>
>>107727379
the people demand sloppa
>>
>>107727441
Idk man, Z-image base is the most anticipated local model ever and it's because Z-image turbo's unsloppiness
>>
a lot of these models I feel are going in reverse because there's a huge influx of tasteless techbros who want to make the latest AI sloppa. A lot of the style/fidelity issues were already solved in Flux (with loras), yet the latest stuff is still struggling with looking too AI. making every hair look sharper is not really conceptually helping. ZIT also fudges up stuff that should not really be an issue anymore.
>>
>>107727463
ZIT is the best base model ever released period, and the main improvement was not using T5 and clearly a extremely good post training dataset
>>
File: 1742831859752409.png (2.71 MB, 1445x1771)
2.71 MB
2.71 MB PNG
>>107727463
>just 2 more loras bro, and this model will be saved!
when will this meme fucking die? Z-image turbo doesn't need this cope to look good
>>
>>107727482
post at least 25 images to back up your statement.
>>
you're getting tricked into defending ZiT (which is not needed) for the thousandth time
>>
>>107727506
you've probably seen thousands of z-image turbo images posted here this last month, the fuck?
>>
>>107727506
>>107727496
two comparisons here and at least 100 posted in the last few threads
>>
>>107727447
yea, if you're looking at 4chan all day
>>
>>107727521
reddit and discord is the same, people were upset when the "surprise" was a shitty qwen update, where else is there to look?
>>
>>107727463
>new thing hits
>people try it out
>they get bored
If you were a real artist the tool wouldn't be important. If not, you are sort of frequenting the local glory hole.
I mean you with I.
I need to reiterate: fuck you hiroshima and your amazon turk captchas
>>
File: 1752911588185839.png (763 KB, 1805x1509)
763 KB
763 KB PNG
>>107727521
on tongyui's official discord, the only reaction people had when they realized the "new year's present" was qwen image 2512 was sadness lmao
>>
>>107727523
xitter
>>
https://huggingface.co/lodestones/Zeta-Chroma/discussions/1#6955385ac7b9e3139ae49293
The more I look into this repo, the more I think lodestone has a point, maybe it'll be better to undistill turbo than to take base and do this miraculous RLHF by ourselves, they didn't provide the exact process (not surprising since it's their secret sauce)
>>
>>107727550
>a thousand bot accounts said it was the best!
>>
>>107727537
qwen is hunyuan-tier to me at this point. 3 updates and it's still completely slopped. bloated unsalvageable crap just like hidream.
>>
>>107727558
basically this, the architecture is too old (it's still using Qwen 2.5 vl) and their base model is probably too mid to be saved with "just one more finetune bro(TM)!!"
>>
seriously i dont give a fuck about that ponyfag anymore
>>
They are withholding Z base release until they get something better than Z out as API only.
Qwen 2512 is a failed experiment. Z base will never be release until they strike another gold like Z did.
>>
>>107727569
I never cared about his regular chroma model, but when he went into radiance that caught my attention, one day we won't need the VAE anymore and we'll be able to do edit models that won't compress images anymore, that's my biggest dream right now
>>
>>107727521
It's easily the most adopted in a short time, just the https://huggingface.co/Comfy-Org/z_image_turbo repo has over 3.6 million downloads, and it has only been out for a month.

And it makes perfect sense, best overall quality despite being much smaller and faster. Qwen is the only potential competitor, Flux is dead.
>>
>>107727581
that is not pony / astra, that is lodestone
>>
>>107727579
>until they get something better than Z out as API only.
all they have to do is to scale up Z-image to a 20b model and they can definitely rival Nano Banana Pro
>>107727584
the Qwen Image series is still relevant because of it's edit model, but I don't think it'll be hard to beat for Z-image edit so when that'll be released it'll definitely be over for them
>>
File: 00100-1343296806.png (989 KB, 1288x976)
989 KB
989 KB PNG
>>
>>107727581
>radiance
even bigger failbake than regular chroma
good idea in theory tho
>>
>>107727585
And im not too confident about astra since he not only made dumb mistakes in his past models but choose what had to the worst possible base as his model that was known to be auto captioned ideogram scrapped garbage with the fucking censor cat in its results
https://www.reddit.com/r/StableDiffusion/comments/1e1ktdh/auraflow_sure_does_like_making_the_ideogram/

lodestone has actually made good models in the past though and learned from his mistakes with chroma which to be fair no one had done openly at that scale before
>>
>>107727585
oh, I feel no one care for him for more than a year at this point, his last relevant shit was almost 2 years ago with pony v6 lol
>>
File: ComfyUI_00010_.mp4 (402 KB, 832x480)
402 KB
402 KB MP4
Happy New Year!
Goodnight, anons.
>>
>>107727614
kek, nice
>>
>>107727602
Lodestone will be making a 'Z-Image Chroma', he has been openly salivating about it, will be interesting to follow the training since he releases all the checkpoints as it goes along.

Great NSFW understanding is likely the only thing missing for Z-Image to be the one model to rule all at this point, and that requires a finetune, not a lora.
>>
File: deBO_zi_00021_.png (2.46 MB, 2176x1152)
2.46 MB
2.46 MB PNG
>>
File: 1753045289484911.jpg (606 KB, 1336x2008)
606 KB
606 KB JPG
>>
>>107727614
Same, sleep well
>>
>>107727624
why even bother when in two months a better model is going to come out
>>
>>107727734
I think it's gonna take some time before the Z-image turbo's miracle can be reverse engineered by the other companies
>>
>>
>>107727758
Expect a safety rushed western models in the autumn of 2026.
>>
>>107727590
if you could go back in time and paint something like this, you'd be a millionaire
>>
File: 1737716012334529.png (3.32 MB, 1336x2008)
3.32 MB
3.32 MB PNG
>>
>107727645
>so lonely he would rather post here
Buckbroken
>>
>>107727771
This guy is the quokka poster of ldg. He was posting this same image in sdg but changed year or so ago.
Same prompt same image just seed variance. Is it a bot? Hard to say.
>>
>>107727624
this is exactly why nobody ever finetuned chroma despite chromakeks calling it a 'perfect base for finetuning'. even lodestone knew it was shit, which is why he is so eager to switch off of it. chroma is melted crap, a $200k fuckup that finetuners should stay far away from. he'll fall into the exact same pitfalls with his new z-crap as he's impatient and is going to do a shitty de-distillation of turbo resulting in 40 epochs of 'locking in'
>>
>>107727800
truth nuke
>>
>>107727800
chroma is great for what it is, a uncensored flux schnell. The issue is that its size + distillation + other inherent issues of flux schnell being flux schnell makes it absurdly expensive to train, to properly finish it would probably cost another 100K+ worth of compute. Its better to start again with a much better and smaller model without most of those issues
>>
>>107727836
I also blame T5 for being a pitfall so many base models fell into, they only became 'good' by throwing absurd amounts of compute to force it to learn with such a shitty encoder. Qwen 4B should be night and day better
>>
>>107727866
also qwen 4B as a encoder opens up the possibility of a local reasoning image model like banana pro
>>
>>107727886
it's using qwen 4b base though, it can't reason since it's not the instruct or the reasoning version
>>
File: 1753558266870940.jpg (804 KB, 1336x2008)
804 KB
804 KB JPG
>>
>>107727895
you can use chain of thought with qwen base fine, most base models period really, give it the format and it will autocomplete it. Also their edit model might already do that differently.
>>
File: z-image.png (939 KB, 1319x487)
939 KB
939 KB PNG
>>107727895
https://github.com/Tongyi-MAI/Z-Image
>>
Fug, there's a new bread already
>>107725280
>>107725411
If the smaller model has such good motion, then why don't people use 5b model for the first few steps, then switch to the regular high-low 14b setup? Besides, people already use 3 ksampler workflows
>>
>>107727919
wan 5B is utter garbage, no one uses it for a reason
>>
>>107727910
>prompt enhancer emposers the model with reasoning capabilities
it's just saying you use a llm to boomer prompt and give it to the model
>>
File: file.png (1.97 MB, 1120x1312)
1.97 MB
1.97 MB PNG
sloppa bros... we're home!
>>
>>107727896
prompt? that's a pretty cool concept for local
>>
File: 1749411412326611.png (3.55 MB, 1336x2008)
3.55 MB
3.55 MB PNG
>>
>>107727269
>he picked the one where the thumb is on the wrong side
(or maybe her hand is turned outward?)
>>
File: 1744016536750797.png (3.96 MB, 1336x2008)
3.96 MB
3.96 MB PNG
>>107728021
>digital drawing of a blue-skinned girl lifting her skin to reveal a complex, green, and yellow abstract pattern inside her abdomen. the girl's left hand is gripping the skin, which is pulled up to expose the pattern. the girl has a muscular build with defined pectoral muscles and a slight hint of abs. the blue skin has darker blue spots and shading, adding depth and texture. the abstract pattern inside the abdomen is intricate, with overlapping lines and shapes in green and yellow, resembling a biological or organic structure. the background is a gradient of green at the bottom and blue at the top. the drawing uses a combination of thick and thin lines, with the blue outlines providing a strong contrast to the green and yellow internal pattern. the style is reminiscent of modern digital art, with a focus on bold colors and abstract forms. the girl's face is not visible, and the image is cropped just above the waist, focusing on the torso and the internal pattern. the texture of the skin is smooth, with subtle shading to indicate muscle definition. the overall composition is dynamic, with the girl's action and the intricate pattern creating a sense of curiosity and wonder.
>>
>>107728042
thanks, illustrious btfo
>>
>>107727896
nightmare fuel
>>
>>107727734
What makes you think that ? All other new models have been going in the reverse, huge increase in size and lowered performance for very little gain.

This is what made Z-Image such a phenomenon, overall same if not better quality and MUCH faster and smaller.
>>
File: file.png (2.08 MB, 1216x1248)
2.08 MB
2.08 MB PNG
>>
>>107728064
y do they look mannish about the face
>>
>>107728077
>mannish
dude relent on the sissy trap porn pls
>>
File: file.png (2.33 MB, 1440x1056)
2.33 MB
2.33 MB PNG
>transcoding new anime releases (computer is literally a jet engine right now)
>generating slop with the free gpu time
>watch anime on the other monitor
>shitpost with my bros in 4chud
living the life
>>
Is Qwen 2512 really much better at prompt adherence than ZIT?
>>
File: 1767250908.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
>>
>>107728059
things are speeding up in the field, a lot of venture capital is getting in and trying to grift. the number and frequency of model releases is increasing and it will continue to do so
>>
File: 1754620588910833.png (2.39 MB, 1168x1752)
2.39 MB
2.39 MB PNG
>>
File: ComfyUI_09743_.png (2.73 MB, 1280x2048)
2.73 MB
2.73 MB PNG
when Z image base? wtf are they waiting for?
>>
>>107728110
no, the opposite
>>107727366
>>107727290

https://www.reddit.com/r/StableDiffusion/comments/1q0dsnj/zimageturbo_vs_qwen_image_2512/
>>
>>107728117
>the number and frequency of model releases is increasing and it will continue to do so
based, let's hope it's the case
>>
>>107728120
Senpai genned what I imagine the most: I'm a girl using computer in the 1990s.
>>
>>107728117
we need someone to throw away tens of millions on making a z-video
>>
>>107728117
You need to understand that the venture capital grift is towards benchmaxxing, not giving us good models.
Look at Qwen Image 2512. No way is it better than Z-Image Turbo but according to the benchmarks it is. Numbers, charts, and market leaders are where vcs look, not real world performance.
>>
As I am a 8gb vramlet I have only been using Wan 2.2 5B turbo Q8 at like 704x704 for 5 second clips every 2.5 minutes. It spends most time at VAE decoding, this is normal for video generation right? I get OOM when using the normal vae decode node so I switched to the tiled one. I'm only doing I2V but I feel like my prompts are wrong? Or maybe I shouldn't expect too much of this model? Are there any rules to follow? It's hard to test using non turbo or even 14B as they take way too long. I wouldn't mind switching to them if it meant actual good I2V though.
>>
>>107728160
5B is utter garbage. But to run 2.2 you will need at least 96GB ram, not vram or else the constant swapping to disk is very slow, maybe try kandinsky 2B, far better than wan 5B
>>
File: 1767251507.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>
Trani shat himself after blacking out from drinking cheap alcohol.
>>
>>107727800
>why nobody ever finetuned chroma
It's been out for ~6 months which is when the first SDXL finetunes started coming out, and the actually GOOD finetunes came years later.

Chroma is at least twice the size of SDXL so it will take more time to finetune, and with Z-Base coming there's little reason to use Chroma for finetuning since you have a smaller model with great quality (much better than Flux Schnell which Chroma was based on) to use.

>and is going to do a shitty de-distillation of turbo
Why would he de-distill Turbo instead of training on Z-Image Base models which aren't distilled ?
>>
>>107728149
And you need to understand the the frequency of new model releases is increasing. Competition breed innovation. Serious money is entering the system now which speeds things up.
>>
>>107728160
You can use 14b if you have at least 32gb of system ram, but you'll need to use multigpu nodes for corect cpu offloading, and also put unload nodes between ksamplers, otherwise it'll rape your ssd by offloading everything to pagefile
>OOM when using the normal vae decode
That's weird, mine worked fine on 8gb rtx 2080. Are you sure you're not using anything else that hoards your vram?
>>
>>107728117
>the number and frequency of model releases is increasing
All of that increase comes from one company, Alibaba
>>
>>107728180
Forgot to add: don't even bother trying anything larger than Q4 quants for 14b
>>
>>107728170
Damn, everyday one is pushed closer to buying an rtx 6000 pro
>>107728180
Gonna have to read a bit on how that would work, but thanks for pushing me into a direction.
Also the OOM error kinda fixes itself anyway, comfyui automatically switches to tiled vae decoding if it didn't work normally. The problem with that is that you can't choose the settings. I did add the tiled vae decoding node when iI was still experimenting with higher resolutions so it could be that it is not needed anymore
>>
File: 1746644324936876.png (2.69 MB, 1168x1752)
2.69 MB
2.69 MB PNG
>>
>>107728124
>wtf are they waiting for?
western man to bark like starving dog
>>
>>107728263
I barked long enough c'mon, my voice is broken :(
>>
>>107728124
Make it (or rather them) as good as possible within their 6b parameter size, if they get these models right they will dominate the finetuning / lora space for a LONG time to come.
>>
>>107728124
training it still with some light aesthetic tuning so idiots wont cry about it looking nothing like the preference trained one
>>
>>107728293
They still will though. There will be reddit posts about plastic skin and extra fingers and "what went wrongs"
>>
>>107728278
not enough
>>
>>107728309
>caring about what leddit has to say
>>
>>107728284
desu it's not like they only have one chance, if they mess up they can do like the qwen faggot, give us another version next month lool
>>
>>107728329
nope. Only one real chance with public perception which will then determine their valuation. tens if not hundreds of millions ride of them doing right the first time
>>
>>107728329
Not when we're talking about models made specifically for large finetunes, you don't want to cancel you expensive finetuning after a month because version 2.0 of the base model came out.

It's worth the extra time to make this as good as they can.
>>
File: R E A L I S M.png (2.36 MB, 1822x1643)
2.36 MB
2.36 MB PNG
>>107728309
reddit thinks Qwen Image 2512 is realistic, so the tongyi fags shouldn't worry about them lool
https://www.reddit.com/r/StableDiffusion/comments/1q0xfvn/realism_with_qwen_image_2512_fp8_turbolora/
>>
>>107728350
this photograph almost looks AI-generated
>>
>>107728350
all the recent reddit posts have been shitting on the new qwen in favor of z-image
https://www.reddit.com/r/StableDiffusion/comments/1q0jxa1/zimage_still_undefeated/
https://www.reddit.com/r/StableDiffusion/comments/1q0dsnj/zimageturbo_vs_qwen_image_2512/
https://www.reddit.com/r/StableDiffusion/comments/1q0f8gc/quick_amateur_comparison_zit_vs_qwen_image_2512/
>>
>>107728357
I hope your joking with the "almost" cause you can literally see a grid pattern in it
>>
>>107728364
no fucking way this is ai
>>
File: file.png (1.95 MB, 1344x1152)
1.95 MB
1.95 MB PNG
my gay anime girls cant be this cute!!!
>>
File: 1757928460841797.png (309 KB, 653x565)
309 KB
309 KB PNG
>>107728350
that looks terrible holy shit... thank god for Z-image turbo
>>
File: hard to top that.png (537 KB, 680x453)
537 KB
537 KB PNG
>>107728370
I couldn't finish the anime, I love yuri tsunderes ( ftw!!) but that shit was cringe as fuck, I felt zero chemistery between the main characters
>>
>>107728370
three-armed eldritch horror
>>
>>107728377
nah bro, faganare is fucking kino, 5 new episodes just dropped too.
>>
>>107728350
to be fair they're using the turbo lora and that shit slops models, makes you wonder how much more realistic Z-image turbo would've been if it wasn't distilled
>>
https://civitai.com/models/2269377
>>
>>107727496
You have no idea about the technology.
Read the paper, z turbo is basically a hardcore fine-tune for realism, while qwen is a universal model.
If you would prompt qwen properly now, you would also get decent quality.

You'll understand it once base is released, maybe.
>>
File: ComfyUI_00020_ (1).png (2.41 MB, 1440x1440)
2.41 MB
2.41 MB PNG
there really is no need for a base model
https://civitai.com/models/2268614/catus-rexs-claymation-style-zit?modelVersionId=2553559
>>
>>107728433
>If you would prompt qwen properly now, you would also get decent quality.
prove it
>>
File: ComfyUI_00032_ (1).png (825 KB, 512x1024)
825 KB
825 KB PNG
>>107728433
nah, its not limited to photography. If anything qwen is more style locked to photography than it is.
>>
>>107728444
>bghira
>>
File: 1749285580063827.jpg (568 KB, 1920x1080)
568 KB
568 KB JPG
>>107728433
>z turbo is basically a hardcore fine-tune for realism, while qwen is a universal model.
what a bunch of bullshit, Z-image turbo is insane everywhere
>>
>>107728433
then why is qwen far more "realism" biased than z-image?
>>
File: why are you lying?.png (3.42 MB, 2560x819)
3.42 MB
3.42 MB PNG
>>107728433
>qwen is a universal m-ACK
>>
>>107728452
as you can see Qwen is far far more style locked than zimage is. It turned the clay character into a shitty mix of a photo of a man
>>
>>107726230
>ani having time of his life while schizo keeps pooping himself and stealing threads to insert his personal grudge into the op again, despite anons agreeing that it's off-topic
love to see this
happy for you ani you magnificent bastard
>>
>>107728452
>>107728456
those retards finetuned Qwen on "realism" so much it's starting to lose non realism concepts kek
>>
>>107728438
You can train Turbo surprisingly well despite it being distilled, and it's not as if the De-Turbo solution currently used when training it is as far as you can push the de-distillation either.

That said, beyond lora training you really want a undistilled base model with less aesthetic tuning/bias in order to easily add new concepts (like NSFW, wide range of distinct artstyles etc).
>>
>>107728433
6.5/10 bait
>>
>>107728442
Am I your nanny?
Add “causal amateur street photography taken with a Canon EOS 5D” or whatever to the prompt. And leave out all the imagery, just describe the picture instead of a fever dream from an esoteric yoga picture book.

The reason your z image doesn't produce crap with this prompt is because it has nothing but this realism in its weights
>>
>>107728457
Wait, the links were re-added? How the fuck didn't I notice this. I've been posting to a schizo thread the whole time...
>>
>>107728466
>makes a claim
>has the burden of proof
>doesn't want to get rid of that burden
Concession Accepted. Thanks for playing.
>>
File: this.png (1.71 MB, 1280x720)
1.71 MB
1.71 MB PNG
>>107728438
>there really is no need for a base model
yes there is, don't start to gaslight us chang!
>>
>>107728466
you have it backwards buddy:
>>107728452
qwen is the model locked so hard onto photography It cant do anything else
>>
>>107728468
yeah the mods are probably busy with irl and aren't monitoring the threads so the schizo can run amok again
>>
>>107728284
i guarantee they're not delaying it to 'make it really good for all of us to enjoy :)"
they went completely radio silent regarding the base model. the same base model one of the team members predicted would be released the weekend after turbo. something wrong is going on internally. if they were just making it better they would've communicated that like they did with Qwen. now it's a month since turbo and there hasn't been a single update on why it isn't released yet. i smell saas
>>
>>107728479
>>107728478
>>107728471
anyone baking real bread?
>>
>>107728433
>z turbo is basically a hardcore fine-tune for realism
yet is can do pretty good anime images
>>
>>107728480
>>107728457
shut the fuck up ani
>>
>>107728483
>now it's a month since turbo and there hasn't been a single update on why it isn't released yet.
they merged the inference/training code on diffusers though, that's something
https://github.com/huggingface/diffusers/pull/12857
>>
>>107728495
what it actually is is a hardcore fine-tune on human preference across many styles
>>
>>107728480
>mods
>busy with irl
>>
>>107728483
>i guarantee
Which is worth about as much as a lukewarm cup of piss

>something wrong is going on internally
Sure Jan

>they would've communicated that like they did with Qwen
More like it's imminent, but they are finetuning small, also since this is a model(s) explicitly made for further training, testing is more time consuming since you need to train on the trained models to see how well they work.
>>
>>107728457
I'm new to this general and really it's pretty bizarre. No other thread allows adding links that are effectively flamewar instigation and looks like nothing more than a personal vendetta. I've seen generals get banned for less than this.
>>
>>107728514
I'm curious to know if they'll redo a Z-image turbo 2 or some shit, if it's talking them so much time to finish it it means that base was really undercooked and yet they used it as a teacher to make ZiT
>>
New to this space, just downloaded ComfyUI and got a pop-up asking me to deposit credits to use the world's most powerful image generation models. I put in a meager $50 to get started, now what do I prompt with first?
>>
File: nooooo.gif (3.95 MB, 640x640)
3.95 MB
3.95 MB GIF
>>107728501
>https://github.com/huggingface/diffusers/pull/12857
>2 weeks ago
The delay between Qwen Image Edit 2511 and its release was 2 weeks...
>>
>>107728510
they are people like most of us. unlike schizo ready to snipe every bake to insert his drama
>>
>>107728524
>it means that base was really undercooked and yet they used it as a teacher to make ZiT
The only thing the initial base model needed to do was to output aesthetically pleasing images on which Turbo was trained.

The new base model(s) need to be great to train on, very different priority.
>>
>>107728549
that is a entirely different form of distillation, its just cfg / step distilled
>>
>>107728535
You are emphatic. I can actually sense your soft power because I am similar.
>>
>>107728549
>The only thing the initial base model needed to do was to output aesthetically pleasing images on which Turbo was trained.
that's not how it worked at all, base went through SFT, then the distillation, then the RLHF to get turbo
>>
https://huggingface.co/Phr00t/WAN2.2-14B-Rapid-AllInOne/tree/main/Mega-v12
Is this shit any good? I'm a vramlet, and switching between high and low models takes forever
>>
File: 33_stars_schizo.png (1.4 MB, 1080x6598)
1.4 MB
1.4 MB PNG
>>107728535
>>107728517
>>107728480
>>107728468
>33 stars
>>
>>107728517
report it to the mods in irc if you want
>>
>>107728564
Yes, but it still only needed to output aesthetically pleasing images, which is why it went through those steps, it being good for finetuning was never a priority at all.

Again, the first base model had one objective, create as good looking images as possible within a 6b parameter budget, with a focus on photography and human realism.
>>
>>107728577
Don't engage with trolls, just ignore and report, the filth will be cleaned at some point
>>
File: file.png (27 KB, 676x211)
27 KB
27 KB PNG
>>107728575
It is if coom gens is all you care about
>>
>>107728517
>I'm new to this general
lel

>allows adding links that are effectively flamewar instigation
It's only one person trying to create a flamewar, and that's you and the mods obviously know it's only you
>>
>>107728549
>output aesthetically pleasing images on which Turbo was trained.
Not how that works at all...
>>
Is a 5070ti enough for video slopping?
>>
File: ComfyUI_00160_.jpg (131 KB, 560x896)
131 KB
131 KB JPG
just tried qwen 2512, zit cant do this
>>
>>107728599
its for sure the best bang for the buck if you can't manage a 5080 / 5090
>>
>>107728599
but be aware, for wan2.2 for offloading you will need at minimum 92GB RAM in on windows. Linuix could prob get away with like 76GB or something
>>
>>107728517
it is off-topic and flamewar instigation. mods already made their will known and were deleting threads and posts containing those links. issue is that they are likely gonna be busy for a few days and won't be andle to clean up this mess so we need to make it known that we don't support this shit in the op.
schizo is very dedicated by we can defeat him, as evidenced by a couple threads without this garbage in op post
>>
>>107728607
>zit cant do this
can't do what? it doesn't seem hard to make
>>
>>107728589
Ani is talking to himself again, he's pretending the mods are not deleting all mentions of him or his project due to confirmed samefagging and shilling. The links have been in 95% of all /ldg/ threads this has been confirmed.
>>107728595
I feel insulted that trani thinks so little of the average /ldg/ poster to think the same tired routine works.
>>
>>107728616
yeah those threads were best in a long while, very productive and cozy.
>>
>>107728616
The links are only there until mods return to clean up your shilling Julien.
>>107727317
>>
>>107728628
>The links have been in 95% of all /ldg/ threads this has been confirmed.
that's appeal to tradition and its stupid. we never had a proper conversation about this. after we did, it became obvious that the links simply do not belong. slavery was normal for most of human history, doesn't make it right. mods were removing those links for the last few days but you knew they'd be too busy with irl obligations to deal with your bullshit.
>>
The reason why we don't have Z-Image Base yet is that they have to follow Alibaba's internal review process.
That's why you see "model coming soon" so much from different teams cause internal reviewers slow down the entire process.
It's the sad and boring truth but that's how it is.
>>
>>107728645
Julien, have you ever tried posting in a different style to not make it so obvious?
>>
>>107728652
We don't have Z-Image Base but we got the latest Z-Image Cope version
>>
>>107728652
you will see if the base model is no longer capable of nudity
>>
>>107728653
you can keep coping that anyone who defends ani is him but that obviously is not the case. you started calling everyone ani because you have no arguments schizo
>>
>>107728645
You're free to go back to /sdg/ if you disagree with the basis on which /ldg/ was founded.
>>
>>107728665
and what basis is that? i thought it's local diffusion discussion, not your personal grudges tran
>>
File: 1766604556374.png (130 KB, 711x699)
130 KB
130 KB PNG
>>107728662
You're not fooling anyone trani
>>
>>107728675
so just your edited screenshots, no arguments again
>>
>>107728673
>everyone that disagrees with me is ran
Hi Julien
>>
>>107728683
who's the julien guy? your bull?
>>
>>107728681
>denies easy to verify things
Classic Julien
https://desuarchive.org/g/thread/107657385#107658056
>>
>>107728688
Anon is in his cuck shed imagining some French guy is having sex with his teenage wife.
>>
>>107728688
He made tranistudio to cope with the loss of his previous bvll
>>
>>107728596
Please enlighten us
>>
File: 1746470534492184.png (64 KB, 834x516)
64 KB
64 KB PNG
comfyuibros... when are we getting this natively?

https://github.com/SparknightLLC/ComfyUI-GraphConstantFolder
https://github.com/SparknightLLC/ComfyUI-GraphConstantFolder
https://github.com/SparknightLLC/ComfyUI-GraphConstantFolder
>>
>>107728696
>>107728675
>>107728681
Ani is looking pretty schizo ngl
>>
>>107728350
>clarity slider: MAX
total pixel vomit
>>
>>107728658
Z-Image base going to be censored because it was trained on a censored dataset from the start.
They even gloat about it in the paper how they used classifiers to filter their dataset away from synthslop and erotic content.
>>
>>107728715
comfy doesn't care. he basically wants the community to create everything new for his project but when the custom nodes break with every update he pretends like it's normal. it's so fucking painful to see. we really need something fresh and fast to dethrone cumfart
>>
>>107728615
Holy moly. Any way I can cut that down?
>>
>>107728715
>summary is written with ai
If you are a techlet you should not pay attention to any of these snake oils.
>>
>>107728732
crazy how comfy is both bloated and underdeveloped at the same time. new updates don't bring new features, they just break stuff. he's clearly in it just for the corpo bucks
>>
>>107728732
Techlet detected.
There's only one way how tokens are getting to a model. Everything else what happens afterwards is a debate.
>>
>>
>>107728756
>responding to troll/llm
lol
>>
>>107728784
99% of posters in the most popular boards might as well be bots. I reply because I get to be bit creative and brush up my English (I'm originally from Bangladesh/Burkina Faso- dual citizenship now almost a US citizen)
Real threads are somewhere else.
>>
>>107728725
When are the porn industry big shots going to get into this and create coomer models? I really can't imagine them witnessing all this and not thinking they can get in on the action.
Conversely, why is everyone making these models so prude? Of course people are going to create porn with them and "jailbreak" them however they can. So why not just enable that from the start.
It's such a weird thing to enforce no-nudity on local models to begin with.
If they want to cover their ass they can simply have the clip part be no-nudity in the official release and then release a nudity one on the side for the coomers, so it's not baked into the model proper.
>>
>>107728815
Real truth is that even if you don't sense it, porn works also with energy. If you, a retard, are not sensing it in the video, it is not good enough for your current mood.
For chronic masturbators like yourself maybe ai is suitable. still hard to imagine.
>>
>>107728833
So you think progress with AI models will stop today and the stuff we have right now will never further improve, right? Am I hearing that correctly? Just want to clarify. You're posting on /g/ and your position is that nothing about this will further improve?
>>
>>107729142
for wan chads playing with new toy.
>>
Anyone knows a good model for low res pixel art, like 32x32 stuff?
>>
>>107729174
also what i'm finding is that motion frames might be bullshit because i'm noticing that might cause your 1girl to keep doing same limb motion over many clips if using same prompt like a retard or someone under the effects of nerve agent...
>>
>>107728869
>hearing
That would be reading unless you are delusional.
>>
so its probably not necessary i will test this, anchor frame is what is good so that 1girl stays the same. but we don't want the motion to continue or it will keeping doing it...
>>
>>107728736
anon is trolling as always since 2.2 release
32 of RAM GB is enough
>>
>>107728736
>>107729234
You would need over 90gb if you want both high and low models loaded simultaneously for quick swaps, with 32gb it would be quite slow as each time a model would have to be unloaded first. Not to mention that if you're on Windows, its godawful memory management would absolutely affect your pagefile, and thus hurt your ssd
>>
>>107728736
you use swapfile/parition/ linux or swapspace in windows but if that is on a standard hdd it will be so slow you'd want to fucking kys. So goto second hand store and buy the cheap SSD and have it as your swap and not worry about it getting trashed from all the writes.
>>
>>107729256
also in linux (you should be using linux...) fstab options for drive

none swap discard 0 0

then it will not fuck it self and oom, discard option means it will free blocks on the fly not doing this could mean the drive runs out of space and needs trim command and remounting...
>>
and believe me wan could use depending on your system 50 Gigs swap space if your system only has 16/32 physical ram and that swap drive will use all its blocks in no time before then needing to free the blocks and make them available again.
>>
comfy will offload models into the swap which you want because reloading them from your multi TB hdd is much slower than loading them from an SSD or the ram. its all about the read speed MB/s anons :)

You want the swap of data to happen as fast as your machine can do it.
>>
A cheap ass mobo can hold like 128 GB physical ram but have you seen the ram prices? You knew that was gonna happen once people understood that offloading is a thing due to 24GB vram not even being enough. Next will be the SSD drives, they will come for those also.
>>
>>107729174
Pic related, i was correct, merged clips are now 99.9% seemless your brain does not notice unless you've done a frame by frame study first otherwise the video is now seemless no jarring jumps. Now i'm testing dropping the motion frames to 0 as i suspect its not what we want, we only ever needed the reference frame zero enforcing the characters appearance between clips and not the motion.

Same prompt and continued motion frame will cause characters to animate like they're retarded like constantly lifting their leg over and over etc.

I was too drunk last night to figure this shit out. Today is a good day.
>>
File: 1745403819846963.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
a somali man standing outside a daycare in Minnesota in the snow. The building has a sign that says "Quality LEARING center" above a worn down door.

will z image turbo ever be beat? base when?
>>
>>107729520
>Now i'm testing dropping the motion frames to 0 as i suspect its not what we want,
I am wrong here, dropping the motion frame just starts the video from the reference frame. erm I have to figure out how to stop that retarded motion perhaps its purely because I'm using same seed and same prompt. But yeah i will have the perfect workflow done today.
>>
File: 1740171358271966.png (1.87 MB, 1080x1544)
1.87 MB
1.87 MB PNG
the anime girl in image1 is sitting in a car. keep their appearance the same. she is wearing a business suit.

qwen edit 2511, img source: project diva miku

classy!
>>
>>107729616
Now remove cl
>>
>>107729538
The hairline is too low to be believably somali
>>
>>107728433
begone qwencoper
>>
File: 1743869890377897.mp4 (745 KB, 848x480)
745 KB
745 KB MP4
we're so fucking back
>>
>>107727584
>Flux is dead.
Flux is the SAFEST, please.
>>
>>107729663
if I was more detailed with the prompt I could replicate their gigantism head. zit is very good, along with edit. in conjunction you can make great wan 2.2 stuff too.
>>
File: 1761623969611915.jpg (1.96 MB, 1248x1824)
1.96 MB
1.96 MB JPG
>>
File: 1748727431826962.jpg (1.48 MB, 1248x1824)
1.48 MB
1.48 MB JPG
>>
File: 1761386735860378.jpg (1.34 MB, 1248x1824)
1.34 MB
1.34 MB JPG
>>
File: 1737105060362746.png (1.29 MB, 1080x1544)
1.29 MB
1.29 MB PNG
>>107729616



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.