[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


https://files.catbox.moe/9nxonh.safetensors Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107426097

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe
https://github.com/ostris/ai-toolkit

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image/

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
>gm nigbos
>>
WHERE IS THE FOOTJOB ZIMAGE FINETUNE
>>
File: 1735795733244371.webm (1.92 MB, 624x816)
1.92 MB
1.92 MB WEBM
>>107426175
>
>>
File: Flux2Img_00037_.png (2.66 MB, 1152x1440)
2.66 MB
2.66 MB PNG
>>
File: Wanimate_00081.mp4 (1.26 MB, 960x544)
1.26 MB
1.26 MB MP4
>>
>>107430223
strong girlboss energy
>>
did flux1 really only come out late last year? seems a lot longer ago
>>
File: Wanimate_00082.mp4 (1.06 MB, 960x544)
1.06 MB
1.06 MB MP4
>>
>>107430129
>Get a shitty gen
>Inpainting would require the model to reason over what it's inpainting and know what goes there and what doesn't
>But other aspects of image perfect, need quick fix
>Tell NBP to fix the details, not even additional context
>It just does it perfectly

Well it's over bros isn't it? No way we'll ever get anything that comes close to that.
>>
How do you guys save zy plot grid images?
I made a plot but it saved all the images individually. and I can't get the combination that was used out of the metadata..
>>
File: 1764715069061436.png (1.29 MB, 1120x1008)
1.29 MB
1.29 MB PNG
We love or we hate rice bunnies, but we are never indiferent to them.
>>
File: file.png (524 KB, 508x647)
524 KB
524 KB PNG
I can generate stuff with WAN in Wan2GP, but 16fps is too choppy for me. I'm seeing a lot of smoother stuff in Civitai.
How do I do that?
>>
>>107430424
if there's not enough frames then just make more
>>
Has anyone come to a consensus on how to caption z-image lora datasets? Should you go full prose or are tagged keywords still good? I really don't want to recaption my hundreds of images.
>>
File: 1528897148347.jpg (16 KB, 400x400)
16 KB
16 KB JPG
Anyone else has their zit lora output just noise? Is something wrong with the v2 adapter for ostris?
>>
>>107430424
I use the optical flow frame interpolator on Adobe Premiere Pro (it's easy to pirate). Works great for subtle animations but will fuck up if there are quick or erratic movements.
>>
>>107430429
you should caption as original dataset model trained was
>>
>>107430424
Yeah you should use the VFI Film Interpolation nodes to interpolate to 32fps it just werks
>>
>go through the fucking effort of getting shitty fucking custom nodes to make an zy plot
>get it all installed and workflow set up
>the cunt loader cant load zit

FUCK YOU
>>
File: Tummy.webm (3.92 MB, 1440x1440)
3.92 MB
3.92 MB WEBM
>>107430424
I use Topaz Video AI (an old version I paid for, not the new subscription model) and output to 60fps.

>>107430429
I used Gemma 3 27B and had it describe everything in the image, my first (Jenny Nicholson) LoRA attempt has been pretty flexible so far.
>>
>>107430505
there's a vlm adapter for gemma 3?
>>
File: image.png (178 KB, 300x300)
178 KB
178 KB PNG
>>107430486
>>
>>107430442
settings issue
>>
File: 3056223002.jpg (3.41 MB, 2432x1664)
3.41 MB
3.41 MB JPG
>>
https://github.com/SaTaNoob/ComfyUI-Z-Image-Turbo-Resolutions?tab=readme-ov-file#1536-resolution
>A ComfyUI custom node that provides quick access to all image resolutions for the Z Image Turbo model, sourced from its official Hugging Face Space.
>1536x1536 (1:1)
damn it can offcially go that high? impressive
>>
File: flux2_00005_.jpg (769 KB, 3328x1792)
769 KB
769 KB JPG
>>107430424
If you want it 'for free' after the set-up, TensorRT interpolation can interpolate to 32 frames in under 2 seconds.
>>
>>107430551
I can gen 2048x2048 without issue.
>>
I've been really enthusiastic about ai for a long while but I feel like the openai ram shortage bullshit is flipping my attitude and it somehow affects my attitude towards local ai as well
>>
>>107430621
local ai is freedom though, it's the antithesis of the major tech companies gobbling everything up and having to access ai through them
>>
>>107430621
>feminine blogpost
>feminine brained failed normgroid who lets other npcs affect his emotions and actions
thank you for contributing such a valuable contribution that is the eternal reminder of the mental retardation of the average person of this world to everyone who can see
>>
File: 1755680967437979.jpg (1.21 MB, 1248x1824)
1.21 MB
1.21 MB JPG
>>
File: 1735571944978282.png (437 KB, 847x974)
437 KB
437 KB PNG
DUDE WTF ARE THOSE PRICES??
>>
>>107430641
You can't get avoid paying the ridiculous goy tax for owning the local hardware now too
>>
>>107430659
Lmao the actual end of personal computing. It's unironically over.
>>
>>107430662
Saar?
>>
>>107430659
>My system appreciates in value.
I didn't expect that to feel bad.
>>
File: ZiMG_0677.jpg (166 KB, 1344x1728)
166 KB
166 KB JPG
>>107430642
Fuck OpenAI
>>
File: flux2_00006_.jpg (903 KB, 3328x1792)
903 KB
903 KB JPG
Daily dose.
>>107430644
Is he saviorfagging, which would be very based, or did they just have sexual intercourse?
>>
>>107430671
obviously was writing something else first and didn't delete "get"
>>
>>107430683
spoils of war
>>
>>107430675
>tattoos
dropped
>>
>>107430659
Reminder that the majority of capital no longer exists in the hands of consumers and there is no incentive to sell to them. This will eventually end with food being exclusively sold to companies to feed their remaining employees and those not within those companies basically being starved out.
>>
File: ComfyUI_00704_.png (3.09 MB, 1608x1288)
3.09 MB
3.09 MB PNG
>>
If anyone else is looking for a basic captioning workflow, I found this on leddit
https://github.com/Wonderflex/WonderflexComfyWorkflows/blob/main/Workflows/Florence%20Captioning.png
>>
Is it worth it to pay 1400€ more for 5090 than 5070 Ti if I just want to animate hentai pics?
>>
File: ComfyUI_00700_.png (2.92 MB, 1608x1288)
2.92 MB
2.92 MB PNG
>>
>>107430659
this won't last but if you didn't upgrade already might as well wait 2 more years till this shit show ends
>>
>>107430734
>till this shit show ends
I've given up waiting for this to end. Things just keep getting worse and worse.
>>
>>107430734
Give up explaining basic supply/demand anon, they will doom all day about this being the end of pc and all ram sold now being the price of a car forever.
They will forget about it when supply ramps up in a year anyway, or more likely they would have found something else to doom about.
>>
>when supply ramps up
>>
>oh no supply will never ever adapt again like it always did before we are doomed to buy ram at 1 gazillion $ a GB forever
>>
>>107430800
This is clearly because they don't want you to have RAM. This isn't a mistake.
>>
>>107430808
Who are "they"?
>>
>>107430766
>supply ramps up
>in a year
>>
The lack of material for ram will only force them to come up with something new and better, trust in the plan.
>>
>>107430800
like how the supply of ssds ramped up and adapted during covid with artificial global level production suppresion to keep the prices the same? oh
>>
>>107430827
they will build factories and hire people dude
it's just sand
>>
>>107430129
why is there a wan category in OP and the only thing in it is a shitty cumfart template?
>>
>>107430837
>it's just sand
gpu is also still sand yet only one company managed to make good shit out of it
>>
>>107430842
>schizo came back
>>
>>107430696
fag
>>
File: this is time.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>107430659
I think it's time to bring back communism anons!
>>
>>107430852
yeah because there is a huge issue called DRIVERS
you can bruteforce your way in the hardware space but you can't easily do that when every fucking game needs optimization
>>
>>107430833
Yes in fact ssds are now the price of small houses.
>>
File: flux2_00007_.jpg (591 KB, 3328x1792)
591 KB
591 KB JPG
>>107430832
A hardware-replacement for RAM? I would doubt it, and even if, it would be entirely out of scope for any consumer.
But software-wise it might be interesting to see what optimizations can be squeezed out. Maybe if will force BIG TECH to pivot to more optimized and smaller architectures at a faster pace.
>>
>>107430837
WTF you mean you can't just create new billion dollar factories in a week?
>>
File: ComfyUI_00708_.png (2.26 MB, 1608x1288)
2.26 MB
2.26 MB PNG
>>
https://files.catbox.moe/iwabi0.jpg NSFW

It's been a week already, right? I'm still getting impressed by zit.
>>
>>107430784
AI itself is no bubble, it's the insane amounts of money pumped into it that makes it bubble since there's no way AI (which at this and the foreseeable future is just really fast wide scale pattern matching) can recuperate these crazy investments.

90% of AI SAAS services are losing money, you think Microsoft make money out of people generating Pixar versions of Taylor Swift ? They don't, despite putting ads front and center on their services.

Meanwhile AI is quickly automating away a lot of repetitive digital only workloads, administration will soon be all AI, a lot of grunt work in creative fields will be all AI and in certain industries it will replace most of the creatives as well.

RAM will become cheap again because there will be a ton of datacenters put up for sale from failed companies.

Thank you for coming to my TED Talk
>>
>>107430833
Covid era was mostly a logistics issue, not a supply/production one.
In my company, we had the products (automotive stuff), we had the clients, but we couldn't ship them, resulting in the locally available products being more expensive.
>>
>>107430862
Yes, every citizen (with the exception of the great leaders) gets 2 seconds 11 milliseconds a day of computation, thank you communism!
>>
File: this.png (108 KB, 500x200)
108 KB
108 KB PNG
>>107430921
>I'm still getting impressed by zit.
same, I have some serious lack of sleep I want to keep gening
>>
>>107430924
> RAM will become cheap again because there will be a ton of datacenters put up for sale from failed companies.
just like we get cheap v100?
>>
>>107430927
>logistics issue
the NAND manufacturers literally lowered production to keep up the prices during covid, retard
>>
>>107430921
What's your setup for that? I'm a total scrub at natural language prompting models. Can't seem to get them crisp enough.
>>
>>107430949
GPUs will become cheaper when the bubble bursts, that's also a given. NVidia is making gazillions on the AI craze, they're like the stores selling tools during the gold rush, THEY made bank, 99% of those out looking for gold went broke.
>>
>>107430446
but they used moon runes.
>>
It's kind of scary to think that one of the largest RAM manufacturers crushed the numbers and said "Yep, fuck everyone but a hand full of companies the poors don't have enough money." It's actually really fucking scary.
>>
>>107430942
Same, my legs hurt from sitting too much.

>>107430962
I use that massive list of quality tags, then just feed images to LLM.
>>
File: flux2_00008_.jpg (1.92 MB, 3328x1792)
1.92 MB
1.92 MB JPG
Me when I don't gen 1girls for a day.
>>
>>107430924
>and then everyone clapped
>>
>>107431085
so flux2 also has this weird canvas texture appearing out of nowhere? or this is deliberate here?
>>
>>107431094
That was deliberate, see >>107430892 or >>107430683
But it does get this weird 'dithering' effect on hair sometimes.
>>
File: Flux2Img_00073_.png (2.29 MB, 1152x1440)
2.29 MB
2.29 MB PNG
>>107431085
classier than I
>>
>>107430825
You know nothing is a mere coincidence.
In the end they will resell the hardware back to you as a service. You will own nothing. Screenshot this post and check back in 5 years.
>>
File: A friendly reminder.png (39 KB, 1066x259)
39 KB
39 KB PNG
>>
ok but what about removing the useless wanx category and the comfy template?
>>
>>107430505
That's kept the likeness of the Rebecca LORA really well. One off? Or have you tried others? I don't think I've seen a video so convincing over multiple frames
>>
Steadydance fp16, 720, 6 steps.
>>
>>107431235
the movements are good but the face consistency isn't perfect and it's pretty slopped imo
>>
chat, is this real?

Also look at that nutsack.

>>107431253
your face is slopped.
>>
File: 1756122100388628.png (439 KB, 1484x1524)
439 KB
439 KB PNG
https://xcancel.com/elrobles/status/1991228372425257415#m
that's why you shouldn't give a fuck about AI haters, you can make your model as cucked as possible all they want is your company to go bankrupt, go all the way or go home
>>
File: that's right.jpg (459 KB, 1250x1566)
459 KB
459 KB JPG
>>107431284
slopped af, I'm not accepting this shit anymore since Z-image edit showed you can get kino by not being a lazy fuck and train your model with only real data
>>
>>107431235
>>107431284
Oh wow, tiktok dance videos, exactly what the internet lacked.
>>
>>107431285
that guy was the original head of the audio team then he crashed out over copyright. the dumb retard didn't know about what is needed in a dataset to make an actual good model
>>
File: Wanimate_00088.mp4 (1.93 MB, 544x960)
1.93 MB
1.93 MB MP4
>>107431235
>>
I didn't need the janny to know that was samefagging, but that's a rare skill nowadays.
>>
>It's Thursday in China
they said they'll release the base model before the week end...
>>
>>107431326
>they said they'll release the base model before the week end...
No they didn't.
They never even outright stated they would release the base model. Simply that they would try.
>>
File: 1748242841020976.png (236 KB, 1348x2034)
236 KB
236 KB PNG
>>107431328
>They never even outright stated they would release the base model.
your gaslighting won't work on us Chang, you better release that shit at some point in time
>>
>>107431308
I guess I'm not banned and they just cleaned my replies. To keep it on topic, is anyone genning at >1MP resolutions with ZiT? I've given it a try, but I feel like prompt adherence is worse at 1536x1536 and other aspect ratio. And I also feel like am not getting good upscales.
>>
>>107431333
If you were to directly translate this to Chinese. It would like they were checking the feasibility of releasing the base model as open source.
>>
File: 1739339235395792.png (101 KB, 1549x583)
101 KB
101 KB PNG
>>107431348
What about this?
https://github.com/Tongyi-MAI/Z-Image?tab=readme-ov-file#-z-image
> Z-Image-Base – The non-distilled foundation model. By releasing this checkpoint, we aim to unlock the full potential for community-driven fine-tuning and custom development.
>By releasing this checkpoint
>>
>>107431359
To be released could be mean anything. It could mean it will be released as an API service. Vague language.
>By releasing this checkpoint
Pending approval of course.

Trust me. This is a proven pattern in Chinese companies.
>>
>>107431364
>It could mean it will be released as an API service.
>checkpoint
do you know what checkpoint means? it means not API, it means local
>>
>>107431375
They might mean release the checkpoint on their own servers to be delivered via API.

I don't know why you're trusting these people to do the right thing.
>>
>>107431387
>release the checkpoint on their own servers to be delivered via API.
>>107431359
>By releasing this checkpoint, we aim to unlock the full potential for community-driven fine-tuning and custom development.
oh yeah, when they said they'll release a checkpoint so that we can finetune it, it definitely knows API, last (You) for you
>>
File: attempt at WoW.png (1.87 MB, 1024x1024)
1.87 MB
1.87 MB PNG
not bad at all
>>
>>107431399
You need to be burned a few more times before you learn how Chinese companies communicate.
>>
File: 3907098418.jpg (480 KB, 1664x2432)
480 KB
480 KB JPG
>>
>>107431364
"Release" is standard English term used in work. For example, "have you RELEASED the cache?".
Idk what sort of retard are you. Perhaps an unemployed jeet or something?
>>
>>107431408
that's why I enjoy models having their character directly known, you can't put 4 characters with just loras
>>
>>107430837
>>107430766
>is just sand
>sand that before had a profit margin of less than 20% and now can go over 200%
>2 companies on the entire world making it
only the ccp can save us, this is actually over now.
>>
>>107431427
Keep arguing. You'll make me even more smug come the day of no release.
>>
>>107431427
>Perhaps an unemployed jeet or something?
it's just debo doing a bit of a trolling, the usual
>>
File: 00006-850972410.png (2.77 MB, 1248x1824)
2.77 MB
2.77 MB PNG
>>107430675
>>
>>107431439
>only the ccp can save us,
the US will ban chinese GPUs if they manage to compete with Nvdia and be like 3x cheaper kek, "free" market, am I right?
>>
>>107431456
in the interest of national security of course
>>
>>107431343
considering the base gen still contains artifacts I just upscale and hires fix with sdxl when I do anime but I just don't't see a point to going that high if it's going to fuck up somewhere
>>
>>107431343
>I feel like prompt adherence is worse at 1536x1536 and other aspect ratio.
it is, it works best at 1MP, I guess that's what the model has seen the most
>>
>>107431343
Just do a second pass at 0.60-.75 denoise.
>>
File: 00013-3415558384.png (2.7 MB, 1248x1824)
2.7 MB
2.7 MB PNG
>>107430675
prompt and model? i love that tummy and tight midriff convenient censorship.
>>
>>107431343
Also I am pretty sure the prompt adherence seems bad because this model seems more than usual reliant on aspect ratio so squares gonna give you shit. Learn to use 4:3 and 16:9 like a real human
>>
>>107431481
>Just do a second pass at 0.60-.75 denoise.
That gives me an idea actually, what if we do like 8 steps, the first 4 steps is at 1024x1024 so that it gets the prompt adherence working, and then we upscale the still noised image at 2024x2024 and the last 4 steps go in this dimension
>>
File: flux2_00012_.jpg (959 KB, 3328x1792)
959 KB
959 KB JPG
Base will release, I am never giving up hope. Never!
>>107431451
Do you ever gen full frontal, or do you always do convenient censorship?
>>107431491
Z Turbo, says so in his filename.
>>
>>107431491
Z


pastebin. com/g21ULxKz
>>
>>107430578
it tends to zoom the subject out and sometimes there's weird shit going on at the edges. I think it's better to do a couple steps at a lower res then latent upscale to 2048
>>
File: 1750327299990497.png (40 KB, 691x501)
40 KB
40 KB PNG
>>107431522
you can use ropeScale to fix that (decrease scale_x and scale_y)
>>
File: 1764150234567537.png (62 KB, 671x344)
62 KB
62 KB PNG
>>107430659
HAHAHAHAHAHAHAHAHAHAHAHAHAHA
>>
what the fuck is going on with the base model? i don't get it, turbo required a base to be made but somehow that same base isn't ready? yet it was ready enough for the baking and release of a turbo distillation? either they're censoring it or it's not being released at all
>>
>>107431546
this
>>
>>107430721
absolutely not
>>
>>107430921
holy pepperoni
>>
>>107431408
zimage can't do wow for shit, and it annoys me. I tried sylvanas, arthas, lich king, thrall, etc. and the most I got back was a stylized eredar. I don't get why it refuses to do WoW, blizzard isn't beefing with netease anymore.
>>
>>107431546
maybe they're waiting for flux 3 so they can destroy them again (probably not, we're probably never getting base)
>>
>>107431546
>yet it was ready enough for the baking and release of a turbo distillation?
they probably rushed the release of turbo so that it got released the same time as Flux 2, genius move imo, they got the hype not only by having a great product but also by that 32b vs 6b comparison meme
>>
File: GwenBride Snack.webm (3.92 MB, 1536x1536)
3.92 MB
3.92 MB WEBM
>>107431225
That was just I2V from a real image, no LoRA. My handcrafted, artisan Chinese negs were designed to maintain the likeness when animated.
>>
>>107431546
they probably want some lead room for zit2 cause they know some american companies will build off it and slap their own brand and claim its their
>>
How do we solve the color darken issue? WAN first 2-3 frames darkens randomly
>>
>>107431557
>I don't get why it refuses to do WoW,
it's simple, the model hasn't seen any WoW image during the training :(
>>
>>107431439
this is an isra-I mean, american problems, I can't wait for china flood my country with cheap gpus, when they do it they will probably pick all the south america market, russia and some of the north africa(europe) too.
I just wonder why if they can't do 5nn yet they just don't stack a bunch of cores and vram and make a chunkier gpu.
>>
>>107431577
use a better model
oh wait, local doesn't have any!
>>
>>107431579
And that's weird because WoW is fucking massive in china, so is overwatch. And it can't do OW either. It tries to if you go and micro manage the description of the characters, but they're always ever so slightly off.
>>
sad truth, they arent releasing base model because they are censoring the easy 3d cunny
Its over
>>
>>107431577
>>107431594
have you tried hunyuanVideo 1.5?
>>
File: 00021-932707137.png (2.24 MB, 1248x1824)
2.24 MB
2.24 MB PNG
>>107431516
i prefer implied nudity or convenient censorship over hardcore and full frontal nudity. illustrious is the best and smartest ai image model that handles that concept very well. Z turbo image is bad at convenient censorship and keeps exposing nipples and vagina.
>>107431521
thanks anon :)
>>
File: ComfyUI_00382_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>107431595
Example of what I mean
>>
File: 1758107619187287.png (143 KB, 1914x560)
143 KB
143 KB PNG
>>107431633
>>
File: 3.jpg (12 KB, 314x263)
12 KB
12 KB JPG
Does ZiT use qwen or lumina2 clip type? The template comfy WF has lumina2 but isn't he just pulling shit out of his ass again?
>>
File: 1739086283465427.png (69 KB, 572x902)
69 KB
69 KB PNG
clicking the red X to cancel a queued job doesn't even fucking work. WHY DID HE DO THIS????
>>
>>107431574
Nta. So you're not using light loras, or does nag work for you?
I can't get 40steps cfg 3-4 to look good.
>>
>>107430129
https://gofile.io/d/NIFrxQ

Psycheswings Lora - Z-image Turbo
>>
>>107431658
I had the node set to stable diffusion by mistake and it didn't make a difference.
>>
File: ZiMG_00657_.png (3.76 MB, 1344x1728)
3.76 MB
3.76 MB PNG
>>
>>107431705
whats Psycheswings
>>
File: 1734685170498673.jpg (3.52 MB, 3380x2669)
3.52 MB
3.52 MB JPG
Here's a list of styles Z-image turbo can do
https://files.catbox.moe/1zmowv.txt
>>
>>107431681
nta but I was making kino massage videos with lightx2v. last night I queued some videos doing 50 steps without lightx2v and what I got was unusable garbage. complete waste of compute
>>
>>107431679
>WHY DID HE DO THIS????
Ikr, that's the most retarded move ever, this shit is really important why do they feel the need to hide it
https://github.com/Comfy-Org/ComfyUI_frontend/issues/7108
>>
File: 1734685170498673.jpg (33 KB, 267x362)
33 KB
33 KB JPG
>>107431734
>>
>>107431705
>>107431721
https://www.instagram.com/psycheswings/?hl=en

https://www.tiktok.com/@psycheswings?lang=en

https://x.com/psycheswings?lang=en

Search her name in the /g/ and /b/ archives for the lore
>>
>>107431705
I heard that Erika Kirk is next on your list. release that!!
>>
https://youtube.com/shorts/QdwCMPuLMGc?si=yE9tg_ih64HA9WTU

Tick tock AI bros. Looks like your precious AI is eating itself
>>
>>107431753
that's dystopian all right!
>>
File: flux2_00013_.jpg (1.64 MB, 3328x1792)
1.64 MB
1.64 MB JPG
>>107431651
I get that. I have been genning A LOT of convenient censorship images on Z, well, tried to. But it's incredibly hard since it doesn't really get the concept.
>>
>>107431749
something something dragged and shot
>>
>>107431759
it's someone I don't recognise or care for, every model can generate that
>>
>>107431765
>train AI only on data made before 2022
well, that was easy
>>
>>107431763
I'm probably not the anon you think I am but I'll look into that. Thanks for the suggestion!
>>
>>107430721
Yes, gpu price per vram will skyrocket for the next gen so might as well pull the trigger now for the highest one and not being fucked over for the next 2 or 3 gen
>>
>>107431789
niceee do try and make it!
>>
>>107431734
I like how Cyberpunk77 is so prominent in the cyberpunk scene that the game's color scheme and design elements bleed into random cyberpunk mentions in the tokens
>>
>>107431715
the default model quality is so good, too bad it turns into this >>107431789
when you attempt to train a lora with it
>>
>>107431734
>syd mead
>loish
So that's how the megacorpo slop is called
>>
File: 1740222572507614.png (1.77 MB, 1024x1024)
1.77 MB
1.77 MB PNG
>>
nano banana is cheating somehow i just can't prove it
>>
>>107431888
'Corporate Memphis' and 'Alegria Art' also.
>>
File: 1742087186721592.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
>>107431888
>So that's how the megacorpo slop is called
kek this shit has some nice shitposting potential
>>
>>107431655
>>107431681
Yeah, both of those were with lightning LoRAs. The best thing you can actually do for quality (aside from using high resolutions) is switch over to the TripleKSampler. Things are handled a bit more intelligently with that and you get some really coherent outputs.
>>
>>107431929
I KNOW they swapped those pictures, I just couldn't prove it
>>
File: ComfyUI_00121_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>
File: 1742829023528832.jpg (156 KB, 880x585)
156 KB
156 KB JPG
>>
File: 1748885287689351.jpg (64 KB, 735x472)
64 KB
64 KB JPG
>>
File: 1753228337372100.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>107431933
I had to go for [Hatsune Miku:she:0.35] on the PC: Schedule prompt or else Miku destroys the style
https://github.com/asagi4/comfyui-prompt-control/blob/master/doc/schedules.md
>>
File: 1748005663134457.jpg (114 KB, 700x367)
114 KB
114 KB JPG
>>
File: 00037-2747663314.png (2.48 MB, 1824x1248)
2.48 MB
2.48 MB PNG
>>107431765
the zoomers and gen alphas in the comment section are just very stupid and are going along with what ever their favorite jewtuber told them what to parrot out.
>>
>>107431765
The kinda of faggot to make kirk videos is not the kinda of faggot that uses enough ai to possibly "poison it"
>>
File: flux2_00014_.jpg (558 KB, 3328x1792)
558 KB
558 KB JPG
I am afraid to say, I like some of the stuff flux2 puts out. Photorealism is kinda ass, but everything else is really fun. So, pretty much flux 1 all over again.

>>107431765
Quite possibly the dumbest idiot I have listened to in the past 3 weeks, and I've been binging police body cam footage.
Geez, that was insufferable. The editing, good lord.
>>
>>107432014
you love this 3d style so much ehh
Z can make really good looking real women brother
>>
the age of SDXL will never end
>>
>>107431765
>>107432043
those people are so retarded they don't know filters exist, good companies know how to filter out AI slop from the dataset
>>
I've come to realize that Z cannot make Muscle Mommies
>>
>>107432066
Indeed it can
>>
>>107432110
pic unrelated?
>>
>>107432121
KEKW
>>
>>107432110
why is it so ugly
>>
What are the general differences one would see when training a zimage style lora on 16 rank and also 64 instead of the default 32?
>>
>>107432200
it's all zit slop loras to me so I don't really care. base when?
>>
>>107432207
>base when?
you still think base will be released? how cute
>>
>>107432127
better?

>>107432200
lower ranks leads to smaller file size at the cost of the lora's ability to "learn" whatever you're training. 16 is generally OK for singular or simple concepts but complex shit typically needs a higher rank. More affect on the model and more "comprehension" but at the cost of higher storage usage and vram usage during training, which also means slower training. Pic rel's lora used a 128 ranks which lead to a 600+ MB file but pretty accurate likeness.
>>
>>107432207
>base when?
never
>>
why is everyone on twitter upset about the base model being cancelled? what did i miss?
>>
>>107432258
>cost of higher storage usage and vram usage during training

why isnt there an option to merge the lora on the fly into the model instead of it occupying any space in vram? @comfyuiniggers
>>
File: Flux2Img_00082_.png (1.9 MB, 1280x1280)
1.9 MB
1.9 MB PNG
flux2 struggling to give me clear pants
>>
>>107431705
>600mb for this ugly jew
bro...
>>
File: Seedream4.5.png (96 KB, 896x717)
96 KB
96 KB PNG
>>107341188
CALLED IT
Seedream 4.5 out to MOG local once again. Enjoy your fried plastic distilled turboslop and begging for base models. Seedream marches on
>>
>>107432291
fuck off racist scumbag
>>
>>107432283
>why isnt there an option to merge the lora on the fly into the model
its also funny how when lzma2 compressing most models you save 30% of the storage and yet we dont compress the weights for storage at least a little but enough to decompress on the fly
>>
>>107432314
you know how confident they are in their model when they have to tell you how an image generation model is good instead of showing you lol
>>
File: 1735309748845289.jpg (29 KB, 394x474)
29 KB
29 KB JPG
>>107431765
Not gonna lie AI is crashing out this is WILD uncs not be getting their slop no more true artists are cooking this is peak
>>
>>107432314
Why use the emm dash?
>>
>>107432314
>Multi-image
something Z-image edit can't do :(
>>
https://huggingface.co/Tongyi-MAI/Z-Image-Base
oh my god
>>
File: G7OgRkCWMAAq6Qt.jpg (1.24 MB, 3840x2560)
1.24 MB
1.24 MB JPG
Where's the API node update Comfy? We're bored of Z-sloppa already
>>
>>107432351
>>107432351
>>107432351
>>107432351
>>107432351
ITS OUT
>>
>>107432351
holy shit finally
>>
>>107432351
Wouldn't they just call it Z-Image? as it's the original not a distinct flavour
>>
File: 1737965455419520.png (398 KB, 891x727)
398 KB
398 KB PNG
most unique tranimesloppers style:
>>
>>107432351
>8b
I TOLD YOU FAGGOTS
>>
File: FUCK YOU.png (94 KB, 224x224)
94 KB
94 KB PNG
>>107432351
MOTHERFUCKER
>>
File: AnimateDiff_00001.webm (606 KB, 720x720)
606 KB
606 KB WEBM
When I do a load video>tiled encode>ksampler 2.2 low>tiled decode with 8steps cfg1 with speed lora, .2 denoise the result comes out like this.

Shouldn't it be the same as running it as another pass?
>>
>>107432314
Another API model that mogs local once again???? In my Comfy UI diffusion general???? This couldn't get any better!
Are the API nodes already available?
>>
File: 1759073510239829.gif (131 KB, 220x134)
131 KB
131 KB GIF
>>107432351
hope your fucking balls fall off
>>
File: flux2_00015_.jpg (498 KB, 3328x1792)
498 KB
498 KB JPG
>>107432351
>>107432362
>>107432365
>>107432376
I hate you, Anon.
>>
>>107432376
still better than all the other bloatmodels so far
>>
>>107432351
come on bro
>>
>>107432351
BASED
>>
>>107432351
PLEASE FLOOD WITH ANIME
>>
>>107432351
>AI TOOLKIT WORKS
AI TOOLKIT WORKS
>AI TOOLKIT WORKS
AI TOOLKIT WORKS
>>
File: 1751721516707764.png (536 KB, 700x392)
536 KB
536 KB PNG
>>107432351
Come on man don't do this to me...
>>
>>107432372
At least it's not incase style
>>
>>107432372
Interesting post
>>
>>107432372
Thanks for sharing anon
>>
>localkeks really thought they would escape sdxl
>>
Z oomers

Make a muscle mommy gen
>>
>>107432372
Oh no that's shit, thanks for sharing anon. What model is it so I don't use it? Thanks bro
>>
File: 1742521813801448.png (2.39 MB, 960x1408)
2.39 MB
2.39 MB PNG
turned out well
>>
>>107432451
>Elaine
based, but I hope Z-image edit will make celebrities loras obsolete
>>
>more jews
>>
>>107432462
>based, but I hope Z-image edit will make celebrities loras obsolete
it wont, and even if the faces work, it's not gonna pick up on the body shape
>>
Not finding jews beautiful is literal nazism by the way
>>
>>107432472
>it's not gonna pick up on the body shape
what if you go for an image input a full body view of the celebrity?
>>
>>107432476
and that's fine
>>
>Out this weekends
>Two Weekends later
>Nothing
What causes this?
>>
>>107432451
can you make her head more rectangular?
>>
>>107432482
chinese deviousness
>>
File: THE QUEEN.png (1.67 MB, 1600x1067)
1.67 MB
1.67 MB PNG
>>107432476
I found Elaine hot and then I found she was a jew, I mean, even Scarlett Johansson is a jew too, and you bet I wanna fuck her
>>
>>107432476
>>107432481
>>107432466
>>107432291
What's with the influx of racists here lately?
>>
>>107432488
scarlet has hit the wall dumbo
>>
File: heckin problematic.png (149 KB, 708x800)
149 KB
149 KB PNG
>>107432498
>racism in my 4chan?? OMG
>>
>>107432372
Oh how bad, what is your favourite artstyle anon? I like genning anime and am a regular here.
>>
>>107432498
Hey buddy, this is 4chan
>>
>>107432505
nice wojak argument poltard
>>
>>107432503
even post wall she's still hotter than 99% of women
>>
>>107432513
>4chan is when you are comically racist
go back to stormfront
>>
>>107432511
I dont see the problem in the question, hes asking for a style question. I love retro anime too.

>>107432518
>99%
bruh
>>
File: ComfyUI_temp_bseru_00002_.png (2.66 MB, 1280x1920)
2.66 MB
2.66 MB PNG
What's the point of using some "recommended" dogshit prompt enhancer llm, when I can give the prompt to a larp model?
>>
>>107432515
>tard
that's a slur towards mentally handicaped people, don't be such a right winger anon
>>
>>107432515
>pretends he's against racism and is a good person
>uses the "retard" slur anyway
really makes you think
>>
>>107432530
This is good, love the red theme
>>
>>107432519
Nothing comical about it these days, it's a deeply held personal belief. Now fuck off.
>>
File: 1742492513460746.png (1.78 MB, 960x1408)
1.78 MB
1.78 MB PNG
>>107432486
>A photograph of Elaine with a neutral expression. her head is shaped like perfect square, with straight lines and sharp corners. The background is completely black.
no, z-image is garbage
>>
>>107432531
>>107432545
i'm not some lefty caricature you imagined in your head
i just don't want off-topic racist shit here
>>
>>107432557
>i just don't want off-topic racist shit here
but you want to say the "retard" slur? weird
>>
File: ZiMG_0688.png (3.68 MB, 1728x1344)
3.68 MB
3.68 MB PNG
>>
>>107432566
it's not a slur. nobody actually gets offended by it. but i bet there are plenty of jews here who now feel bad because of you bringing up race out of fucking nowhere.
>>
>>107432351
i fell for the bait
>>
>>107432569
>it's not a slur.
it is a slur, I bet there are plenty of mentally handicaped people who now feel bad because of you bringing up the r-slur out of fucking nowhere.
>>
Can I train ZiT loras just with the files required for genning, or do I have to clone the repo and do some git humiliation ritual?
>>
File: Nano Banana Pro.jpg (474 KB, 2816x1536)
474 KB
474 KB JPG
>>107432486
>>107432555
>A photograph of Elaine with a neutral expression. her head is shaped like perfect square, with straight lines and sharp corners. The background is completely black.
>>
>>107432590
LMAO
>>
File: Z-image turbo.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
>>107432590
it works with some boomer prompting
>A high-resolution studio photograph of a woman facing the camera with a completely neutral, emotionless expression. Her head has an impossible geometric form: a perfectly square shape with four straight, rigid edges and sharply defined 90-degree corners, as if sculpted with mathematical precision. Her facial features—eyes, nose, and mouth—are realistically integrated into this square structure. Lighting is even and controlled, illuminating her face without casting dramatic shadows. The background is entirely black, seamless, and featureless, creating stark contrast and emphasizing the geometric shape of her head.
>>
>>107432590
fucking kek
>>
Anons what do you like more, retro anime o modern anime? If so which are your favourite Loras for each one?
>>
>>107432581
the purpose of that word was not to insult developmentally disabled people, only pol users. i'll concur - yes, i shouldn't have used that word, but it's still different from direct racist insults
>>
>>107432628
>i'll concur - yes, i shouldn't have used that word
good goyim
>>
>antisemitic shit again
sigh
>>
>>107432530
Do you have a before vs after prompt enchancer? Very interesting
>>
>>107432498
weak bait jamal
>>
>>107432628
>yes, i shouldn't have used that word
thanks for conceeding, retard.
>>
>>107432589
It works with the 'z_image_turbo_bf16.safetensors' file BUT you also need the 'zimage_turbo_training_adapter' so you still need to git clone
>>
>>107432372
Jessie my love.
>>
>>107432666
he doesnt, literally everything is autodownloaded
>>
>>107432659
that doesn't make me wrong. i can admit to my mistakes and work on myself unlike racist shits like you
>>
>>107432314
is it me does seedream 4.5 look like a slight down grade from 4.0. i'm seeing more compressed jpeg artifacts on images i generate with seedream 4.5 than 4.0.
>>
>>107432668
>>
>>107432668
what'd you use to recreate?
>>
>>107432672
>that doesn't make me wrong.
it shows you're an hypocrite so yes it makes you wrong, don't moralfag when you can't even show the example next time jamal
>>
File: ZiMG_0700.png (3.77 MB, 1344x1728)
3.77 MB
3.77 MB PNG
>>
>>107432672
>shits like you
im so tired of casual racism against indians, fuck off chud
>>
>>107432695
zit's blushes look like shit
>>
>>107432682
>>
File: z-image_00329_.png (3.27 MB, 1168x2048)
3.27 MB
3.27 MB PNG
cool my ban is lifted
>>
>>107432685
no it doesn't. the slur i used wasn't nearly as hurtful and dangerous as the shit you were saying about Jews
>>
>>107432683
>what'd you use to recreate?
Not my creations. Read the file name.
>>
>>107432737
>Jews are more important than mentally disabled people
oof that's problematic xister, there's a lot to unpack here
>>
File: 1740943163254699.png (1021 KB, 1280x720)
1021 KB
1021 KB PNG
>>107432695
husbant... you waited too late to buy ram
>>107432711
Yep. I'll take anyone's non-crap blush prompt.
>>
>>107432735
Now prompt a black dude groping her
>>
>>107432766
nah, ask your trooncord sisters instead troony
>>
>>107432765
damn it worked well for obama, even at far away distance
>>
File: ZiMG_0702.png (3.58 MB, 1728x1344)
3.58 MB
3.58 MB PNG
>>107432711
yeah agree on that

>>107432745
ahh the SAAS "creators"
>>
i miss the dutch hag drinking beer posting
>>
File: Jessie.png (1.16 MB, 1012x902)
1.16 MB
1.16 MB PNG
>>107432683
>what'd you use to recreate?
Here's some info.

https://www.instagram.com/moescapeai/
>>
>>107432780
i cant believe the hammock is able to hold all that weight. z-image is truly amazing.
>>
>>107432745
>>
File: ZiMG_0703.jpg (399 KB, 1728x1344)
399 KB
399 KB JPG
>>107432793
Yeah… I can remake that, or at least super close to it locally

>>107432816
ikr!
>>
>>107432780
>obese asian whore lying on back
so this is the power of Local creativity
>>
>>107432817
>>
>>107432832
big talk for a nogen
>>
>>107432834
>>
>>107432838
>big talk for a nogen
says the nogen
>>
>>107432848
>>
reminder to report shills and spammers, especially shill spammers
>>
File: 00064-863001100.png (2.7 MB, 1248x1824)
2.7 MB
2.7 MB PNG
>>
>>107432864
this includes cumfart shills
>>
>>107432869
based
>>
File: 1758291054433636.png (1.88 MB, 1024x1024)
1.88 MB
1.88 MB PNG
>>
zib (z-image-base) is too dangerous to society
unlimited high-quality personalized coom will destroy any brains
>>
>pedo
>>
File: 00070-3877952116.png (2.37 MB, 1824x1248)
2.37 MB
2.37 MB PNG
>>107432902
nah. sdxl still remains king of open source image models. z-image at its current stage is only good at doing photorealism.
>>
>>107432902
edit is even more "dangerous" kek
>>
File: z-image_00337_.png (3.17 MB, 1168x2024)
3.17 MB
3.17 MB PNG
>>
>>107432902
What kind of coom could be achieve with it that you can't already find online or create with other models?
>>
Does z image produce convincing images of nude girls between the ages of 8 and 12?
>>
Has anyone tried STARFlow or did it just completely go under the radar because of z-image?
>>
>>107432992
You're going to have to try harder than that, Sir.
>>
>>107432992
I could never know because why would I AI generate something illegal
>>
>>107432992
yeah its the best text to image for cunny anatomy ever so far but vaginas are censored but whatever just put them in lewd outfits

but if you need more than topless 8 to 12 year olds for the 2 months until LTXVideo2 comes out do you even like little girls
>>
>>107433004
I am a pedophile. This is the only thing I care about. Someone needs to make a model with child pornography in its training data. I don't want that person to have to be me. I don't know anything about AI and I am broke.
>>
>>107433088
>Someone needs to make a model with child pornography in its training data.
nah you just need children + pornography, Sora 2 could probably do it if it weren't censored by OpenAI, and a local version of that will exist by next summer
>>
new:
>>107433107
>>107433107
>>107433107
>>
can't remember the last time it was this bad
>>
Non-trollbake:
>>107433131
>>107433131
>>107433131
>>
>>107430664
I've been saying this for a while. They're intentionally killing it. They don't want us to have the freedom that it provides.
>>
>>107433137
what? why?
>>
>>107433186
ani, the developer of an irrelevant ui used by no one but himself, baked with an ad for his UI inserted into the OP.
>>
>>107433223
nobody uses sd.next either yet it's in the op. ani is making his ui for free and he's a part of the community, what's wrong with making anons know about it?
>>
>thread suddenly has low iq brown retard posts
>coincidentally the trani trollbakes return at the same time
oooooooooooooooooooooooooooooo im nooooooticiiiiing
>>
>>107433245
what a sad troon lol
>>
>>107433088
kill yourself
>>
>>107433088
chroma is 120 days of sodom in comparison to any current model. ZiT won't matter for porn until a proper tune drops
>>
>>107430129
Hmm... :) what could this be?
>>
>>107433544
yo e? what's that?
>>
>>107431285
>ethically trained models.
Into the trash it goes.
>>
>>107430129
I was chaining, but it looks dumb, and crashes sometimes.

So I thought I might use this:
https://github.com/blepping/comfyui_overly_complicated_sampling

Yep, it's complicated alright...
>>
>>107430707
>>107430722
NOmygod
>>
>>107430862
bout fucking time



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.