[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107586718

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image/

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
https://github.com/huggingface/diffusers/pull/12857
>>
Blessed thread of frenship
>>
>>107588926
APOLOGIZE TO CHINESE CULTURE
>>
>>107588805
No you use that on the de-distilled model. Also you can raise the lora strength to 2
>>
File: 1737299261659884.png (1.33 MB, 1216x1024)
1.33 MB
1.33 MB PNG
>>107588926
is it happening?
>>
>>107588936
i'll try it out. the lora training just finished
>>
Drag and shot.
>>
>>107588926
screenshot this
>>
File: image.png (51 KB, 557x461)
51 KB
51 KB PNG
>>107588942
>>
File: ComfyUI_03141_.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>>
My cock is hard.
>>
>>107589005
we're gonna need to see some proof
>>
>>107588984
>Z-Image-Omni-Base mode

heheheheh

-omni

hehehehehehehehehheh

whatever that is, it ain't base
>>
File: 1762466981682376.png (312 KB, 732x770)
312 KB
312 KB PNG
Why are they so good at AI?
>>
>>107589027
retardç‹—
> This PR adds support for the Z-Image-Omni-Base model. [s]Z-Image-Omni-Base is an intermediate model developed during the creation of Z-Image-Turbo[/s]. It isa foundation model designed for easy fine-tuning, unifying core capabilities in both image generation and editing to empower the community to explore custom development and innovative applications.

omni means multi modal accepting text, images or something else
>>
>>107589043
>>107589043
>Z-Image-Omni-Base is an intermediate model


all asians are liars. all women are liars.
>>
>>107589066
...what is the problem with what you quoted anon? This is not new information at all.
>>
>>107589071
It's not base, they're calling it base, but itt's not, because of course not.

You're so gullible.
>>
>>107589077
k
>>
how do I get non slopped gen from z + controlnet? when I apply controlnet to z it becomes very slopped
>>
ahahahahahahah

we really need usernames. I'm going to have to roast you.

all asians are liars, and no hot woman is ever going to say my name.
>>
>>107589043
Are we assuming that this model is more censored and/or less capable than the base model or what am I supposed to get upset about?
>>
File: file.png (29 KB, 633x123)
29 KB
29 KB PNG
>>107589027
You're wrong.
>>
>>107589135
>what am I supposed to get upset about?
Chinese culture, clearly
>>
>>107589164
But have you considered Chinese culture?? also women
>>
>>107589088
Dunno, jump off controlnet early and finish genning without it, perhaps.
>>
>>107589135
assume its more censored and that lodestone will save us
>>
>>107589164
That's still too expensive for a hobbyist like Lodestone to ever consider.
>>
>>107589190
Why should it be more censored? It is already censored at qwen level, there's no reason to make any changes in censorship now.
>>
>>107589180
What's the best way to do that since ZiT uses the QwenImageDiffSynthControlnet node that doesn't have an end step setting? Two-pass setup?
>>
>>107589198
you just dont understand their culture man
>>
My ComfyUI video previews stopped working after an update. I made sure I allowed Display Video Previews in the settings and chose TAESD (slow) in Comfy manager but still no previews. Any thoughts?
>>
>>107589171
exactly, this guy, he is trusting a woman or an asian - possibly a HOMOSEXUAL - trusting a sodomite.

I know where *I* hang *MY* hat.
>>
>>
>>107589193
do you think lodestone would get upset at being characterized as a hobbyist kek
>>
>>107589213
the normal preview is latent2rgb. change it in manager
>>
>>107589214
Shut up lumifag.
>>
>>107589239
>>107589213
it seems the issue is that we previously had to use the manager setting but now it's been made an official setting so you have to change it in the main comfy settings page (search "preview" in the settings search bar), if that doesn't work then also add --preview-method taesd to your comfyui terminal command.
>>
>>107589268
taesd is a waste of system resources
>>
>>107588906
tanks 4 bake
>>
>>107589279
my bad i don't know the difference, then use latent2rgb
>>
>>107589088
>>107589206 (me)
>>107589180
Update: For now the best approach I found is simply to make a second pass (optionally upscaling your image) at ~.5 denoise, without the ControlNet applied. ControlNet strength (for the first pass) around ~.5 too. This seems to make great results.
>>
>>107588906
>>107589287
>>Maintain Thread Quality
>https://rentry.org/ranfaggot

this should be in the OP next time you bake. it's really suspicious you keep forgetting
>>
nigga has a fucking alarm clock lmao
>>
>107589401
It's really suspicious you are still posting after you were banned
>>
just added an update
>>
>>107589463
who was banned?
>>
>>107589463
Just let him reply to himself.
>>
>>107589543
???
>>
>>107589463
>ranfaggot
>>
>>107589552
he's ban evading? an anon was saying he was a janitor which is probably worse
>>
So anyway Z image
>>
>seething at people only existing in your head
>>
>>107589579
omg a PR without weights! so cool!
>>
>niggerjak doing everything unacceptable that he says ani does
>>
comfy seething he didn't get implementation first
>>
why do i need sveral gigs of some thing to compile a program?
>>
>>107581919
That's a nice round ass. How to craft that shape? I find that it's difficult to command these models to make an ass that's just right for my taste.
>>
File: DumbBitchJuice.png (463 KB, 512x768)
463 KB
463 KB PNG
>>107589579
It isn't bad at text. 512x768, just picked the best from a run of 8
>>
>>107589824
Should work even better if you use a 1MP resolution like 832x1248.
>>
Day 0 support!!!
>>
>>107587584
>>107587767
>>107588219

Dear Disco Elysium Anon:

When you mention in >>107588411 that the LoRA is there, do you mean CivitAI?

>>107588469 anon seems to have figured it out, but I was under the impression that workflows from pictures only worked if the picture is a PNG file, and not a JPEG.

Thank you very much!
>>
My Wan2.2 works fine but whenever I start up comfyui I see a message in the cmd that says it failed to find nvdisasm.exe and cuobjdump.exe. Do I actually need these? I just followed the instructions from the wan2.2 rentry guide.
>>
>>107590056
https://civitai.com/models/1433982?modelVersionId=2513908
>>
>>107587584
>>107587767
>>107588219

Dear Disco Elysium Anon:

When you mention in >>107588411 that the LoRA is there, do you mean CivitAI?

>>107588469 anon seems to have figured it out, but I was under the impression that workflows from pictures only worked if the picture is a PNG file, and not a JPEG.

Thank you very much!
>>
>>107590082
https://civitai.com/models/1433982?modelVersionId=2513908
>>
>>107590058
isn't wan2.5 available now?

i did a new (manual) install of comfy on windows and wan2.2 wasn't working for me anymore
>>
>>107590059
Thank you!

Ignore the other posts that will inevitably appear: I had no idea if I was getting past the captcha or not due to server delay.
>>
>>107590095
you're welcome redditGOD
>>
Babe, babe, wake up, WAI-illustrious-SDXL checkpoint released an update!
https://civitai.com/models/827184/wai-illustrious-sdxl
>>
>>107590362
Anyone try this already? I mostly used v14, I didn't like v15 as much, how is this one?
>>
>>107590362
>doesn't say what the merge components are
indian detected
>>
>>107590381
"Adjusted the model’s overall default style, enhancing visual cleanliness and improving character accuracy."
Extracted from Civit
>>
File: 1755889311932254.png (2.2 MB, 949x4053)
2.2 MB
2.2 MB PNG
>here's a bunch of incomprehensible schizobabble and screenshots of me getting banned for being a samefagging nuisance
does he genuinely think he'll achieve anything with this?
>>
>>107590408
whocars he's having a nap post bifusion
>>
>>107590387
It's a Chinese model, mainly a recent update of characters from this year.
>>
>>107590423
i know but he's still pathetic for gatekeeping the merge components
>>
>>107590362
Nice!
>>
btw someone from z image team made this >>107588926 post
so try to be polite
>>
>>
>>107588926
nice to meet you sir, how is your day going?
>>
>>107590362
V-Pred?
>>
>>107590448
"Recommended settings:
Steps: 15-30
CFG scale: 5-7
Sampler: Euler a
The VAE is already integrated."

The info on the page does not mention anything about that.
>>
>>
File: 1740474012967101.png (313 KB, 1873x1555)
313 KB
313 KB PNG
>>107588926
>https://github.com/huggingface/diffusers/pull/12857
OMG ITS HAPPENING!!
>>
>>107589193
>That's still too expensive for a hobbyist like Lodestone to ever consider.
all lodestone has to do is the "post training" and as you can see it cost them 48k dollars, it's easily on his budget since he paid 200k for chroma
>>
File: file.png (7 KB, 239x77)
7 KB
7 KB PNG
>>107590716
bghira mad af that he got banned from their discord
>>
File: ahahahah.png (1.82 MB, 1280x720)
1.82 MB
1.82 MB PNG
>>107588926
I'm sure the fuckers who couldn't stop talking about that "muhh chinese culture" will apologize and admit that they were wrong right?
>>
>>107590751
lmaooo
>>
>>107590751
>he got banned from their discord
?? why? lul
>>
File: 1765373701730618.png (431 KB, 800x582)
431 KB
431 KB PNG
>>107590751
>they'll release the base model
>they banned this mentally ill fuck
how based can china be?
>>
>>107590766
his first message was literally "you know porn is illegal in china right? wouldn't it be crazy if those alibaba devs would be sent to prison if they trained their model with nude people?"

this guy is like your random 4chan schizo but he doesn't hide and post his mentally ill shit to the pubilc lmao
>>
So are we all just pretending like we don't know what a base model is as some sort of copeing exercise
>>
File: file.png (30 KB, 671x111)
30 KB
30 KB PNG
>>107590766
his post(s) were wiped, this is the only one i could find in the archive
see the full thread for full context >>107365872
>>
>>107590362
Cool will post some gens later
>>
File: feelsgoodman.png (71 KB, 220x203)
71 KB
71 KB PNG
>>107588926
mfw looking at /ldg/ just after waking up and looking on my phone
>>
https://huggingface.co/Tongyi-MAI/Z-Image-Omni-Base

It's fucking here. 6B, 50 steps, 1 minute for inference on a 3090.
>>
File: not ready.png (232 KB, 500x358)
232 KB
232 KB PNG
>>107588926
December 25 is gonna be such a busy day, they'll probably release the new version of QiE and ZiB at the same time
>>
>>107590848
Shit I clicked again... :(
>>
>>107590362
Based, the only checkpoint maker that makes the best local Anime checkpoint model and also doesnt put it behind grift Early Access.
>>
>>107588926
https://www.reddit.com/r/StableDiffusion/comments/1ppjzdo/comment/nunjfwj/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button
>There are two new features: SigLIP, which is what makes it Omni, and the noise mask. Both are optional. The __init__ blocks of each module in the diffusion model are the same as in the Turbo model. Thus, the building blocks are based on what we have seen in Turbo; only new elements have been added. If the configuration files are the same, nothing has changed.
>The training loop itself doesn't have to change for t2i task.
>The lora training on the Turbo worked fine for a few thousand steps - only the day one trainer scripts were rough - and the base model will be more reliable.
noice
>>
>>107590800
>>107590751
who?
>>
>>107590922
https://www.reddit.com/r/StableDiffusion/comments/1lsfobb/full_breakdown_the_bghirasimpletuner_situation/
>>
>>107590922
author of "simpletuner", moralfag who likes to report model trainers for every reason possible so they get taken down or get legal issues.
>>
>>107590932
Autism and its consequences have been a disaster for the human race
>>
i told you retards that the model is gonna come, its simple, they cant compete with nano banana pro in closed source space, so there is no benefit to not going open weights
the only thing to look out for is if they nerfed the model, but they had to release it
>>
>>107590932
what the fuck
>>
File: 1744836137862745.png (283 KB, 2319x1107)
283 KB
283 KB PNG
>>107590893
>SigLIP
I see this as a good sign since only Flux Kontext dev (and not Qwen Image Edit) manages to do editing without zooming in the image
>>
File: 1737246755717112.jpg (678 KB, 2760x2534)
678 KB
678 KB JPG
>>107590716
the sad thing is that the doomers will just find another thing to doom about, their brain is eternally moving goalposts
>>
>>107589032
>not obese
>not 50yo aunty
wtf that's not a normal jeet
>>
>>107591054
she had that indian fat belly
>>
File: DA GOAT.png (79 KB, 240x240)
79 KB
79 KB PNG
>>107591048
>some faggot anime poster on twitter claimed they were releasing it
and he was right, and he's always right lmao
>>
>>107591048
can't wait to see their apology video
>>
>>107591048
all of those are me
>>
>>107591048
can confirm all of those are the anon above me
>>
>>107587767
are you generating at this resolution, or is this an upscale?
>>
File: APOLOGIZE.gif (1.53 MB, 640x358)
1.53 MB
1.53 MB GIF
>>107591095
>all of those are me
APOLOGIZE
>>
File: this.png (426 KB, 640x653)
426 KB
426 KB PNG
>>107588926
>the rugpull didn't happen
remember folks, nothing ever happens
>>
File: 1760232435561786.png (340 KB, 678x460)
340 KB
340 KB PNG
>>107588926
>https://github.com/huggingface/diffusers/pull/12857
>W-well, maybe they'll release base, but thie chinese culture will prevent them to release Edit, trust the pla-ACK
>>
>>107588926
I'm not familiar with chinese culture, but it could be just a trolling: releasing inference and training code is not the same as releasing weights.
>>
>>107591048
>their brain is eternally moving goalposts
I can already imagine it, instead of leaving this place and being ashamed of themselves they'll do the same doomer shit for edit, bunch of fucking losers
>>
>>107591164
>it could be just a trolling: releasing inference and training code is not the same as releasing weights.
that would be the biggest troll move in the history of AI, and nothing will top that until the sun explodes loool
>>
File: file.jpg (138 KB, 1708x703)
138 KB
138 KB JPG
>>107588926
What did they mean by that?
>>
>>107591135
>APOLOGIZE
i've seen no base model yet
>>
File: 1757500051924577.mp4 (864 KB, 1040x480)
864 KB
864 KB MP4
https://xcancel.com/ChetiArt/status/2001291373182382526#m
Imagine SCAIL but with Wan 2.2, holy shit
>>
>>107591245
you know it'll be released, the sooner you accept this the better
>>
File: based.png (511 KB, 1474x1102)
511 KB
511 KB PNG
>>107588926
https://xcancel.com/Ali_TongyiLab/status/2001559091345506651#m
based
>>
File: ComfyUI_temp_vckck_00040_.png (2.65 MB, 1088x1856)
2.65 MB
2.65 MB PNG
>>107591237
wtf is up with this people, they have been harassing this team for weeks now, the model is free and open-source, why do they feel so entitled?
>>
File: I mean.png (951 KB, 1280x720)
951 KB
951 KB PNG
>>107591292
>please stay tuned for further updates
the "updates" in question:
https://www.youtube.com/watch?v=qvHuQ8goiBw
>>
>>107588926
>Yeah the preview images of Z-image turbo look good, but it's too powerful they won't release it
>Yeah they released it but I'm sure it won't look as good as their cherry picked examples
>Yeah it actually looks really good on ComfyUi, but it's a distilled model, and they won't release base
>Yeah they'll release base... but...
You are here
>>
Something is wrong with ZIT offloading in comfy. The KSampler steps randomly slow down by about a factor of 10.
It doesn't happen with the FP8 version that fits into my 12GB VRAM, and it also doesn't happen with other models that are too big like Qwen or Wan.
>>
>>107588926
Z base will release???
CREATIVE CHADS RISE UP!
>>
>>107591420
>ZIT
>offloading
>>
>>107591292
>please stay tuned for further updates
meh. this sounds like it's not yet close to releasing. maybe next year
>>
File: file.png (45 KB, 503x297)
45 KB
45 KB PNG
>>107591320
where is fucking model bloody bastard bitch
>>
>>107591452
It's larger than 12GB. What confuses you?
>>
>>107591237
>>107591320
>>107591459
it's weird that /ldg/ is more reasonable than the public space this time lmao
>>
>>107591461
Q8 is 7GB, vramlet
>>
>>107591476
shut the fuck up and fix it comfy
>>
>>107591482
>crashes out when proven retarded
concession accepted, retard
>>
>>107591491
I already said it's working with fp8 you retard. Why are you talking to me about q8? I have a 40 card. Fix the offloading comfy.
>>
>>107590848
a curse on your mom's left foot
>>
>>107591456
>maybe next year
it's gonna be released on Christmas day, mark my words
>>
>>107591438
...
>>
>>107591292
if it took them this long to finish base it meant that the version that was used to distill turbo was really uncooked, can't believe they rushed turbo and the model ended up being amazing, now I really wonder what turbo would've looked like with a finished base model as a teacher
>>
>>107591473

Better to be a patient gooner than a harasser. M y Christmas wish is for it to be out in the next few days on my vacation
>>
>>107591645
Definitely wanted to dunk on Flux 2. Now they can make Z Turbo 1.1 or whatever. Someone else will redistill the base if they don't do it.
>>
>>107591645
they won't give us an undistilled finetune though, it's like they dare us to reach turbo's level with our own means, can you do it lodestone? are you up to the task
>>
>>107591702
lodestone will take this model to levels nobody ever thought possible
>>
no hope after v7 but apparently pony was already playing with it https://huggingface.co/purplesmartai/zony-v8-256px-exp-de-distilled
>>
>>107591736
>no hope after v7
to be fair he used a really bad base model to do his trainings, but fuck him he removes the artist tags anyway
>>
>>107591713
If someone can manage to make Z unlearn how to do hands, it's xer.
>>
>>107591697
>Now they can make Z Turbo 1.1 or whatever.
if they try another finetune out of it, I'd like it to not be distilled desu
>>
File: 1605485289228.gif (3.96 MB, 441x480)
3.96 MB
3.96 MB GIF
they will never release z-base or wan 2.5, since they are working on wan 2.6 api, lol
>>
>>107591759
different teams
>>
what we really need is a goon model that hasnt been trained on a bunch of degenerate cartoons
>>
>fp8
>>
File: file.png (1.22 MB, 1530x2246)
1.22 MB
1.22 MB PNG
LMAO
>>
>>107591817
>digital rape
yeah lets just make up a word and make whatever that is illegal
>>
>troon with fake tits
>>
>>107591817
>oh noo they made parodies of me they should be arrested for that!
says the blue haired xir who has no issue laughing at memes of charlie kirk lul
>>
File: z_mod_00019_.jpg (731 KB, 1824x1248)
731 KB
731 KB JPG
>>107591120
latent upscale
>>
File: lmao.gif (1.03 MB, 320x288)
1.03 MB
1.03 MB GIF
>>107591817
>digital rape
>>
>>107591817
>whore yourself out on the internet
>get treated like a whore
I see nothing wrong with this. I'd make a video of me fucking her but I don't want digital STDs.
>>
>>107591817
>>107591871
she's trying to moralfag while whoring her giant breasts at the same time, as if it's the right moment to do that when you're supposed to be taken seriously lol
>>
>>107591817
almost 10 years after the "digital blackface" we got "digital rape" lmao
https://www.youtube.com/watch?v=Ox9fQ5LSaD4
>>
>>107591713
>>107591702
he will waste $$$ and make shit like he did with chroma and radiance
>>
>>107591759

2.6 is laughably bad compared to 2.5, I was surprised.
>>
File: main.png (35 KB, 359x372)
35 KB
35 KB PNG
Why do they always act like everyone knows what any of this shit means?
>>
>ZIT
>in the prompt replace "Photograph of" with "GTA San Andreas screenshot of"
>doesn't look anything like a game screenshot, just makes creates chaos
pic rel was supposed to be naked in the jungle at night lmao
>>
File: file.gif (1.02 MB, 640x640)
1.02 MB
1.02 MB GIF
https://simpcity.cr/threads/lyra-crow-lyracr0w0.238666/
>>
>>107591964
llm issue
>>
>>107591963
Ngop Nrfi the popular Star Wars character
>>
>>107588926
MY KNEES!
I KNEEL XI!!!! I KNEEEEEEEEEL
>>
>>107591963
this looks like a davidau schizo tune, avoid like the plague
>>
>>107591964
use a llm to rewrite your prompt, boomer prompting is always working with those models
>>
>>107591292
just tell us how big it is so i can call that one anon a retard. thats all i want
>>
File: 1742088752957601.png (189 KB, 2002x498)
189 KB
189 KB PNG
>>107592224
they will all be 6b anon
https://www.arxiv.org/abs/2511.22699
>>
>>107592260
yea we all want it to be 6b but im going to keep saying it's not. reverse psychology or something
>>
Is there a brainlet's guide to setting this shit up on linux?
>>
File: 1737866619453574.jpg (358 KB, 985x768)
358 KB
358 KB JPG
>>107591817
didn't know her. thanks for the info.one more for my list of... experiences lol
>>
>>107591420
https://github.com/comfyanonymous/ComfyUI/pull/11254
>>
>>107591930
literally came up with one of the most cost-effective ways so far to train a model for flux

even before he did that his training was cheap.
>>
>>107592358
ok lodestone, no one cares about your furshit models
>>
>>107592358
cringe
>>107592365
this
>>
>>107591964
Turns out it goes away when increasing the aura flow shift
>>
File: zimg_0015.png (1.31 MB, 960x1280)
1.31 MB
1.31 MB PNG
>>
>MUH FAMOUS PEOPLE
honestly fucking CRINGE
>>
uhh why are my noodles gone
>>
Saar, help.

I want to switch between width and height for the given integer value while the unselected one is set to 0.
>>
>>107591817
>schizophrenictwink
We should send our very mysterious seether to these redditors
He would blend in
>>
>>107592485
DO NOT PULL
>>
>>107592503
Probably just use node
https://github.com/Azornes/Comfyui-Resolution-Master
>>
https://blog.comfy.org/p/meet-the-new-comfyui-manager
wait what?
>>
File: ComfyUI_temp_bfdor_00016_.png (2.19 MB, 1088x1856)
2.19 MB
2.19 MB PNG
>>
>>107592603
about time I guess
>>
>>107592603
wait do I remove it from custom nodes then?
>>
>>107592603
vibecode to human ratio?
>>
>community fixes your shit
>steal it and claim it as your own
>>
>>107592678
ltrdata is literally on payroll by comfy tho?
>>
Predictions for 2026?

>a local gimped version of wan2.5/3.0
>chroma z image merge
>360 video
>more speed boosts
>no wanchaku
>gpu price increase
>chinese gpus
>z base api only
>C U L T U R E
>>
>>107592715
>Predictions for 2026?
vr 3d video
>>
>>107592715
>>z base api only
not possible, they literally released the inference code on diffusers >>107588926
>>
This may be of interest to someone, a Qwen3-VL dataset tagging UI:
https://github.com/ArchAngelAries/TagScribeR
>>
File: G4Iax5MXUAAMj32.png (103 KB, 1117x1200)
103 KB
103 KB PNG
What UI should you be using if you are running 5060ti 16 vram and 32 ddr5 memory ram?

Can this do 1080 or 720p images/videos?
>>
File: 2025-12-18-003.png (1.7 MB, 1024x1024)
1.7 MB
1.7 MB PNG
We're gonna have a good Christmas bros, I beleeb.
>>
>>107592814
they will release the next version of Qwen Image Edit on christmas as well, best christmas ever!
>>
b-bigma status?
>>
>>107592806
probably comfyui

i'd recommend 720p or 480p but sure, it can with some time and offloading

maybe try the gguf q8 or q5 quants on top of offloading
>>
>>107592568
I tried that one earlier, too much bloat, otherwise nice.

>>107592591
Oh shit you're right. Man, can I extract this right click action? I'm building a workflow that gathers all of the essential values you need to change into one small area so there's less of a mess for beginners.
>>
>>107592806
UI really doesn't matter for this as long as the UI supports the actual software you are trying to run. People who get erections from cable management use comfyUI. Normies use forge.
>>
>>107592851
* 720p or 480p videos, it'll take long enough to compute them on wan (or hyvid)

the images work ok fullhd particularly with z-image or sdxl it'll be fast but you can run anything (also chroma, neta-yume, qwen image, ...)
>>
>>107592806
>What UI should you be using
I'm a long time Comfy hater but in all honesty it's currently the best option.
>>
>>107592806
comfyui
>>
I'll install SD.Next now and goon to slop for 6 hours, then I'll delete it all and repeat after 6 months when I feel bored again.
>>
File: 1432498179182.png (296 KB, 722x768)
296 KB
296 KB PNG
Is there a text filter node that truncates the text after [word]? I've only found replacers and just filters. No cutoffs.
>>
>>107592918
make your own custom node anon, it's easy to do if you ask claude or gemini
>>
??
https://huggingface.co/tencent/HY-WorldPlay
>>
Man how do I get latent previews working again?
so far i've tried setting both in the manager and in the live preview thing, both with the 'old' manager and the new PIP package one.
Note that when I restart, the TAESD selector from the old manager gets reset to NONE. So I think the issue might lie somewhere in there.
fugggg
>>
>>107592976
are you asking what this is? it's a model that "streams" interactive 3D world gen

https://www.youtube.com/watch?v=mmBPLKKTnr4
>>
>>107592997
the only way to fix this right now is to add this flag
> --preview-method taesd
>>
>halve the resolution of my wan 2.2 render
>it still takes just as long to render

How? Why?
>>
>>107593001
So we solved the 5 second limit by having a infinite iterative gen loop?
>>
>>107592928
>claude

Be careful, I almost spent all of my openai credits trying to vibecode a custom wan node

>switched to local qwen3 coder
>it introduced even more errors

Kek
>>
File: ainormies.png (17 KB, 797x147)
17 KB
17 KB PNG
>>107591817
The comments on the thread are pretty funny too, like Nano Banana Pro would be able to generate sex images all so sudden
>>
>>107592928
>>107593024
I only had success with vibecoding the code itself but when it needed to wrap it into the node, it never worked. And I can't code at all to do the final step.
>>
>>107593024
that's why I only use claude and gemini, they're the only non meme coding models
>>
>>107593025
Nice try normies, but my most vile and disgusting fetishes only exist in hentai!
>>
>>107591817
Lmfao, isn't that the literal roastie who has an onlyfans but refuses to do direct cuntshots 90% of the time because she's got literal arby's tier thin roast beef labia?
>>
>>107593020
this is 3d interactive and you could say this cuts corners in terms of speed vs quality. i doubt you'll find it as pretty as wan/hunyuanvideo or comparable overall

it does essentially gen iteratively tho, yes
>>
>>107593025
what kind of sick fuck would ever use ai in such a way?
only absolute, perverted losers digitally rape women
>>
File: zimage_00167_.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>>107593004
oh man that did it.
It feels like they only tested on the desktop/electron distribution, not the portable one. the new manager doesnt even work for me (getting CSP errors, which I know how to fix but then again, HOW did this get through testing???)
Even the jigsaw puzzle icon is missing from portable (while it's there on desktop).
All in all, a very very bad release. now back to gooning
>>
>>107593066
>3d interactive
Like actual 3D files that you can pull and export? Or just pre-render?
>>
>>107593091
>All in all, a very very bad release.
not only that but they're pretending there's no bugs by closing this issue lol
https://github.com/comfyanonymous/ComfyUI/issues/11370
>>
>>107593101
the retard who opened it flagged it as fixed, as the select actually worked for him. I have the feeling that it's only the portable installations that are fucked right now.
>>
File: zimage dall-e.png (2.28 MB, 1536x1024)
2.28 MB
2.28 MB PNG
>>
File: crystal ball.png (1.39 MB, 832x1216)
1.39 MB
1.39 MB PNG
>>107592715
>a local gimped version of wan2.5/3.0
>360 video
>more speed boosts
Possible
>no wanchaku
Likely but there is still non-zero amount of work being done on it recently (check nunchaku pull requests on github), so I have extremely faint hope
>chinese gpus
Not in a state average Joe can use at least
>chroma z image merge
>z base api only
>C U L T U R E
Schizo babble
I will also cope very hard and say that a decent local music model will arrive
>>
>>107593126
man wanchaku will be a fucking game changer. BUT it needs to come with loras support OOTB, not the qwen shitshow we have right now.
>>
File: zit_nfl_teams.jpg (931 KB, 4096x2364)
931 KB
931 KB JPG
interesting
>>
>>107593045
Yeah claude is pretty damn good, sonnet 4.5 in particular. Might switch to haiku next time.
>>
File: ComfyUI_temp_spuon_00052_.png (3.23 MB, 2658x1080)
3.23 MB
3.23 MB PNG
>>107593101
Can't wait until a new repo comes along to stop using this shitty experience called comfyui, 2026 is the year where it dies
>>
>>107593096
i think it's more like a video stream controlled by interactive inputs for now

yes the difference is probably pretty small but I think the model isn't trained to give you 3d models for blender

the actual 3d generation models may currently do this better (they're not perfect yet but getting there)
>>
>>107593126
>I will also cope very hard and say that a decent local music model will arrive
oh god please let this happen, I want udio at home
>>
>>107593133
Qwenchaku has a working unoffical node for loras, no?
I agree that it shouldn't be half-assed regardless.
I don't think they would though, since the low step distill loras are almost essential to running it.
>>
>>107593126
And before I mislead someone I should clarify that by it I meant nunchaku in general, not wan nunchaku (Someone is working on ZIT nunchaku now).
>>
>>107590362
>loginwalled
FUCK

Has everyone just created an account by now for downloads?
>>
>>107593126
>>107593133
Well, we still have woct0rdho for radial attn instead.

>>107593136
>read new york jets as "new york jeets"

I've been on this site for too long
>>
>>107593167
yeah, but I think these 'insert racial chinese slang' will just do the baked lightning models and call it a day (same way they did for qwen).
>>
cozy bread
>>
File: ComfyUI_temp_bfdor_00028_.png (2.42 MB, 1088x1856)
2.42 MB
2.42 MB PNG
>>
>>107593191
Damn I hope not but I fear you are guessing right.
That is if we ever get a wan release anyway.
>>
Saar, I am become desi.
>>
>>107593195
taytay will NEVER have an ass that fat
>>
>>107593242
who watches this
>>
>>107592851
>>107592896
>>107592898
What is Wan (2.1? 2.2?) and why do people recommend it? Can I run it?
>>
>>107593193
>cozy bread
that's what happens when we finally got the certainty base will be released >>107588926
our mind is at peace now
>>
>>107591817
Don't be a whore online and no one will bother with you.
>>
>>107593242
SAAR, THIS FEELS ILLEGAL SAAR.
The quantized versions of Wan Animate can't do a full face swap and they keep the original person face structure so in the end you get a tranny version of yourself

>>107593257
other jeets
>>
>>107593292
2.2 14b
Best local video model, both text to video and image to video. Also does text to image to
>Can I run it?
Yes.
You are not going to run it with good speeds unless you have a beefy rig though.
>>
>>107591817
meanwhile all of womens existance online is to advertise themselves for a rich man and or drain simps by giving them gf larp experience that is actually a pajeet company chatting with the man on onlyfans/social media etc

women are really barely human, they only have enough humanity inside them in order to birth men, thankfully now we all have an option that wont wont be hypergamous monkeybranching solipsistic delusional liars who cant engage with real arguments beyond social signaling their emotions to the group for validation.
>>
>>107593322
Oh it's not a UI but a model you plug into Comfy? I guess someone with a 5060ti should settle for Wan 1.1?
>>
>>107593346
I wouldn't bother with videos on a 5060ti but check the rentries if you care.
>>
>>107590056
jpg does not include the metadata so yeah, it wouldn't work
>>
>>107593183
yeah like a year ago
>>
https://github.com/comfyanonymous/ComfyUI/commit/dbd330454ada04609c69fda2ae7c002d7ea05f67
>>
>>107593346
if you have ram you won't have to worry since you can offload
>>
>>107593392
this is just vibecode slop
>>
>>107593364
>I wouldn't bother with videos on a 5060ti
You mean it can't do videos?

>>107593394
32g ddr5 system ram enough?
>>
>>107593450
if you are running things in the background you will probably suffer and you are going to have to reload high/low noise models every run. best to stick with 2.1 if you just need a stream of gens without reloading
>>
File: weirdcomposition.png (1.8 MB, 832x1216)
1.8 MB
1.8 MB PNG
Why does ZIT make weird compositions like this sometimes?
>Character is taking unusually small part of the image.
>Lanky anatomy
Not the first time I got an image like this.
>>
>>107593481
what is your prompt anon?
>>
>>107593492
Random lyrics out of boredom:
A misty memory
Is she a lost embrace?
Am I in love with just a theme?
But I got results like this on saner prompts too.
>>
I can't do it. I can't pull cumfart because I know it's going to be shit
>>
Has anyone trained a 64 rank style lora on zimage? or even 128 rank? Any info is there really a benefit going that high? I think there should be
>>
>>107593101
>>
File: ComfyUI_02883.jpg (3.15 MB, 1536x2160)
3.15 MB
3.15 MB JPG
>>107591817
>still have to be a Chad to gen you're waifu
There's just no pleasing them, huh?
>>
File: ComfyUI_temp_gisdv_00004_.png (2.68 MB, 1216x1664)
2.68 MB
2.68 MB PNG
>>
>>107593501
>random nonsensical prompt
>terribly composed image
you are actually retarded.
>>
>>107593522
>Has anyone trained a 64 rank style lora on zimage? or even 128 rank?
No but I fail to see how it would differ from training a high rank lora for any other model. Would only benefit and work well with complex style and concept loras with large datasets.
>>
>>107593554
>But I got results like this on saner prompts too.
But I got results like this on saner prompts too.
>But I got results like this on saner prompts too.
But I got results like this on saner prompts too.
I could refute you but I delete all ugly gens.
When it happens again with a normal prompt I will tag (You) from the archives.
>>
>>107592976
So when will we be able to run this locally ?
>>
File: cade.png (954 KB, 667x1000)
954 KB
954 KB PNG
cade
>>
File: toilet.png (1.08 MB, 832x1216)
1.08 MB
1.08 MB PNG
>>
File: troonto.jpg (36 KB, 512x512)
36 KB
36 KB JPG
>>107593187
kek
>>
>>107593501
>>107593590
>But I got results like this on saner prompts too.
You have to prompt like you are a computer. You have to be specific. You have to elaborate. Unironically gitgood.
>>
>>107593564
>>107593522
Whats the highest rank someone trained that came out pretty good, did anyone like that have any advice on their findings?

I trained a dalle style lora for z image on 100 1024p images with the same settings at rank 8 and 32, and the 32 looks a nice amount better. I think I'll train on 256 rank with the same settings next to see the difference
>>
https://poal.me/i09dsq
what say (You)?
>>
>>107593101
>>107593524
bro>>107593392
Almost certainly the culprit judging by the massive latent preview rewrite
>>
>>107593601
It's 32 gigs full size, but there are multiple versions. Als idk what text encoder and whatever 3d enablers it needs
>>
>>107593601
when we get ggufs
>>
taesd vs latent2rgb verdict?
>>
Does qwen edit still only use 8step light lora?
>>
>>107590362
Why would you still use this when there are many other better animu checkpoints?
>>
Why isn't "LoKr" default in trainers instead of LoRA?
>>
>>107593774
cumfart doesn't have compatibility I think or I might be thinking of dora
>>
>>107593694
I did not update comfy today but i updated my system and my terminal in i3wm was broken and also the ksampler previews broken. Something else much have fucked it at least on my case, i had to change picom config to use xrender instead of glx backend to fix things.
>>
>>107593782
It supports it
>>
>>107593678
You'll never catch "schizo anon"
>>
For zit loras do you still need to start every caption with the trigger word or can you still write it into normal sentences like "Photo of someone using x"?
>>
>>107593831
https://rentry.org/ranfaggot

>>107593823
then I have no idea why people don't use it more often
>>
>>107593656
>Whats the highest rank someone trained that came out pretty good, did anyone like that have any advice on their findings?
I have seen a few very large rank loras. Some of them were low step distills. Another was trying to beat better text into SDXL (And failing because it wasn't addressing the root cause)
The use case for them are very niche.
> the same settings at rank 8 and 32, and the 32 looks a nice amount better.
8 is too low for style loras typically, makes sense.
>I'll train on 256 rank with the same settings next to see the difference
It will just start to learn random noise from the training data.
But feel free to experiment.
Also you need more VRAM for larger rank lora training.
>>
>>107593719
there's a 4 step lora.
>>
>>107593837
You can use natural language descriptions normally.
>>
>>107593656
64 but only with a dataset in the thousands. just can't justify higher dins without a lot of data
>>
>>107593874
>>107593884
It's for z image so it's not a problem either way but I'll train 128 rank. I think 64 would be enough and 256 would take 1.3 GB so it's not viable to publish it after training for most to use so I'll experiment with 128.
>>
>>107593930
ZiT reacts to loras like crazy. Even 32 on 1.0 strength burns. 128 would look like deep fried 2015 meme
>>
>>107593950
I think there was some problem in comfyui or whatever that was "fixed" day 1 about zit loras that made the strength 1 basically become strength 0.75 or something around that. So try using those loras that are "burnt" on 1 at 0.7 instead.
I get the best results when training at those high ranks but using in comfyui with strength 0.6-0.8, although that might just be my lora style niche.
>>
>>107593980
of course it's a cumfart issue. fucking hell
>>
Just bought a 5060ti.

Validate my purchase, please.
>>
>>107594035
you have to use the worst media application in existence lol
>>
File: ComfyUI_02812.png (3.38 MB, 2160x1280)
3.38 MB
3.38 MB PNG
>>107594035
A bit much to spend on a doorstop, but it's not my money.
>>
>>107594035
For how much?
Not the strongest card out there, but arguable a cheap entry to 16gb cuda.
>>
>>107594064
$440 pre-tax.
>>
>>107594051
Connecting alien hair braids with Jenny
>>
>>107594109
>>107594109
>>107594109
new bread
>>
Is there an obvious reason why trying to generate a single 832x1248 image with the base ZIT model would burn through 16gb of VRAM, start offloading to RAM, burn through 16gb of RAM and then crash a bunch of my programs before the process itself crashes?
>>
waiting for real bake
>>
>>107594149
he won't bake a real one
https://rentry.org/ranfaggot
>>
>>107594113
Didn't you get banned?
>>
>>107594173
why would anon be banned for baking a thread? are you playing tranitor again?
>>
>>107594149
Dis
>>
>>107594186
>>107594380
Not worth splitting over. Just bake next time.
>>
>>107594145
bf16 gguf by chance?
>>
>>107594391
Agreed. Funny how reasonable we are compared to the last two or three days of the other guy screeching.
>>
>>107594113
>>107594156
>>107594186
>>107594391
fuck off faggot
>>
>>107594455
It's quite sad that he's delusional enough to think if he samefags more he'll salvage the reputation he destroyed by his own hand.
>>
Does anyone have the fork of llama with the image model quanting patch? I can't build it.
>>
>it's over
grim



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.