[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107588906

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image/

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg
>>
File: 1740090585913678.png (4 KB, 152x51)
4 KB
4 KB PNG
Any trainer that allows full finetune of ZIT?
>>
>>107594115
for what purpose? base is here. no point in using a distill to train
>>
poll from last thread
https://poal.me/i09dsq
>>
Reposting because I did not refresh before the new bread was baked: Is there an obvious reason why trying to generate a single 832x1248 image with the base ZIT model would burn through 16gb of VRAM, start offloading to RAM, burn through (a large majority of) 16gb of RAM and then crash a bunch of my programs before the process itself crashes?
>>
>>107594158
pinned memory issue probably
>>
>>107594115
Imagine botting huggingface download stats
>>
>>107594168
imagine botting a stolen valor repo
>>
>>107594177
it would be stolen valor if zit was already published in safetensor single file format officialy, which it wasnt
>>
>>107594177
I seriously don't get why they don't just repackage themselves. investors are probably confused as to why cumfartorg has more downloads than the researchers themselves
>>
I saw it yesterday at 3.5m
You can't expect me to believe 250 thousand people have downloaded comfy's modified version of z-image-turbo in a day
There aren't that many local gen ai users in the entire world
>>
>>107594200
Why does it need to be a single file? Multiple files can download faster
>>107594201
Cumfart changes key names. Z-Image team like diffusers
>>
>>107594202
cumfart bots GitHub stars too. china has a phone farms that make GitHub accounts
>>
>>107594165
Is there an easy way to diagnose that further or fix it outright if that's the case?
>>
>>107594158
Post workflow.
Cumfart sporadically gets random memory management bugs. Try reverting to an earlier release or pulling later if you can't find a solution. Upgrading/downgrading torch/cuda versions might also help.
>>
hows the wrapper coming along?
>>
>>107594227
https://rentry.org/ranfaggot
>>
>>107594141
very curious to hear the opinions of the no voters. why are you so quiet when people are clearly having issues?
>>
>>107594216
>Why does it need to be a single file?
because its simpler to move around, store, find, load etc
>Multiple files can download faster
no they dont, unless you are downloading things with your dogshit vibecoded garbage downloader script instead of any software that automatically maxes out your download connection no matter the speed
and even in those cases you can chunk the download of a single file and download multiple chunks at the same time automatically in software
>>
>trani wrongbaking again
Pathetic
>>
someone bake a real thread this is a troon one
>>
>>107594221
>16gb RAM
Comfy's memory management is trash because by default it always loads checkpoints into RAM, then from there loads them into VRAM. Try various startup arguments like --disable-pinned-memory and try it out
maybe --disable-smart-memory too
--highvram and --cache-none if things become desperate but beware than --cache-non means everything will be automatically offloaded and you'd be loading cmodel, clip and vae on every run
>>
>>107594109
Why is AniStudio not in OP?
>>
>>107594232
This is so poorly written compared to the other two almost as if on purpose
>>
>>107594271
it's almost as if the author doesn't revolve his life around documenting the misadventures of a troon. are you going to post on topic or not?
>>
reply again but this time try not to sound upset
>>
File: prostitute.png (1.36 MB, 832x1216)
1.36 MB
1.36 MB PNG
>>
>>107594270
if he has updates and comes to the thread regularly then it will be added. until then I wouldn't bother sending people to compile c++ binaries
>>
>>107594263
Thanks for the advice, I'll give it a shot. Actually on a fresh installation of Forge Neo atm if that helps diagnose anything. IIRC back when I was using A111, I had to add some lines into either a config somewhere or the .bat but I had kind of hoped that this would simply run without crashing after however many months of development.
>>
>>107594298
It will never be added because it is unsafe to run.
>>
>>107594321
and cumfart isn't? should we remove that according to your standards?
>>
>>107594301
Sorry anon, last time I forge was when Flux got released and it was the og version, so I have no idea how to go about memory optimizations there
>>
File: z-image_nag_00186_.png (1.92 MB, 1024x1536)
1.92 MB
1.92 MB PNG
>>
>install Wan2GP
>start prompting
>takes my machine 20 minutes to create a 5 second 480p video
>I can't even fap to the result
Bros I'm not sure this is worth it
>>
>>107594346
are you even using the distill lora? once the tensors are loaded it won't take as long
>>
File: prostitute2.png (1.32 MB, 832x1216)
1.32 MB
1.32 MB PNG
>>107594330
Cumfart has next to 100k stars. There are multiple eyes on every single commit, base Cumfart is safe to run. You should still run under docker or VM because custom nodes aren't trustworthy.
Do you honestly think it is comparable to ani? Do you trust ani not to add anything funny?
>>107594344
Her face is deformed but very cool composition.
>>
Is julien here? I have a question
>>
>>107594344
Forgot to ask for prompt
>>
>>107594250
>simpler to move around, store, find, load
Folders exist lmao
>no they dont
What about when the server with the file you want is at capacity and you get slower speed than your maximum? Multiple files will be distributed to multiple servers in any CDN and therefore download faster
>>
>>107594363
>base Cumfart is safe to run
it's crashing people's computer and calls home. I wouldn't call that safe. I do agree it should be run in a docker instance at all times
>>
>>107594359
No I just installed it and hit generate.
What the fuck is a distill lora?
>>
>>107594388
At least it wasn't made by a spammer.
>a-ani is not the spammer!
Yeah right, the guy that spends 20 hours a day samefagging to defend Ani's honor and got 150(!) of his posts nuked by IP isn't Ani. Lmao.
>>
AniStudio is actually quite great
>>
File: 9QImkTqJX70.png (804 KB, 706x704)
804 KB
804 KB PNG
I know this is like 3 years late, but they recently deleted the Huggingface Stable Diffusion 2.1 demo. I'm trying to recreate the workflow locally.
I installed ComfyUI (running on CPU, because don't have $$$ for GPU, each 768x768 image takes 15 min :-( ). I grabbed the ckpt files from https://huggingface.co/Comfy-Org/stable_diffusion_2.1_repackaged/tree/main (should I use ema or noema?) and the v2-inference-v.yaml (renamed to match the ckpt filename) from github before they recently deleted it, and put them in the models folder.
I remember from memory default cfg is 7.5 and steps 25. Yet the results are wildly different from what I used to get online.
What I can't tell is which sampler and scheduler are used. Does anyone know them off-hand, or from testing? Are there other settings worth knowing about? I'm not trying to make anything realistic (or hyperrealistic) or Tiktok females. I'm more into vintage art stuff, even if it's not perfect or kind of stylized (pic related, borrowed from someone' youtube)
>>
>>107594395
https://huggingface.co/lightx2v/Wan2.2-Lightning
>>
>>107594409
And the denoise value. Does anyone what denoise value should go in the box?
>>
>>107594407
True
It should be addedto OP
No idea why anons would be against that
>>
>>107594382
>What about when the server with the file you want is at capacity and you get slower speed than your maximum
which again doesnt happen with any non-toy blog website and never with huggingface which is what we are talking about.
go download your 30 different files instead of just 1 before using them fucking nocoder retard
>>
>>107594411
is 2.2 actually fixed? I still use the 2.1 version because they botched the first 2.2 attempt
>>
>>107594428
sheesh, I don't know why anon, maybe because he's a spammer? https://rentry.org/animanon
>>
>>107594382
Holy dunning kurger low IQ brown.
>>
>>107594440
stop instigating flame wars tRan
https://rentry.org/ranfaggot
>>
>>107594447
technically sharts can be loaded in parallel
>>
>>107594471
Warning others of unsafe software is perfectly on topic.
>>
>>107594484
are you a rusttranny or something?
>>
>>107594494
Ew, I'm not the one that avatarfags with a girl that has a dick.
>>
File: prostitute3.png (1.21 MB, 832x1216)
1.21 MB
1.21 MB PNG
>>107594395
Unless you have a server grade enterprise Nvidia GPU you run wan 2.2 with the distill lora.
>>
>>107594434
>>107594447
Cope and seethe
>>
>>107594436
2.2 was fixed ages ago.
There are more than 1 fixed 2.2 distill options you can use.
>>
>>107594409
I wish you well on your journey unfortunately I am not able to help
>>
>>107594520
neat. ty for the heads up
>>
Ani won.
>>
>>107594550
Who's Ani Won?
>>
File: 1735137076335823.jpg (579 KB, 1567x786)
579 KB
579 KB JPG
use case?
>>
File: file.png (459 KB, 660x683)
459 KB
459 KB PNG
interesting results. for the people that voted yes, why is it getting worse? for the no posters, why is it getting better?

>>107594563
that's not even an aardvark wtf?
>>
>>107594550
What's the best way to support him?
>>
>>107594570
>why is it getting worse
because every recent ui change has been an absolute disaster
memory management got improved but is still bad
basic features like saving queue is still missing
and the future doesnt look that good long term given they dont want to work with custom nodes that 95+% of the users use that is the main reason people use their ui
>>
>>107594563
I will pretend that the guy was experimenting with lora training rather than the likely reason of degenerate fetishes.
>>
File deleted.
>>107594563
i started making loras here because i thought it was cool to get new concepts, then i shared them on civit and you realize there's a whole gamified ecosystem. they want you to churn out as many as you can, as fast as you can and it doesn't really matter if it's quality, if there's a hundred that do the same thing, nothing. so likely someone just farming buzz.
>>
>>107594583
If you can code he would be very happy for some helping hands i think
Otherwise you can test AniStudio and report issues
Other ways include stuff like telling pwople about it, bigger userbase = more velocity
>>
>>107594570
>low IQ, afraid of nodes, unable to navigate and fix things on their own, doesn't read the commits on github
he voted yes
>high IQ, pulls updates multiple times a day, workflow always works without issue, reads the commits on github
he voted no

its that shrimple really
>>
>>107594563
I would honestly say making dumb brainrot videos over fetish content. The aardvark is kind of suspicious, though.
>>
>>107594570
Didn't vote but:
Dogshit UI performance
Autistic UI redesigns that make it more cancer to use
Significant development effort being re-directed to API jewry
>>
>>107594595
You cant add velocity to something you (trAni) already nuked into the ground with your autistic behaviour that everyone knows about by now.
https://rentry.org/animanon
>>
>>107594570
>why is it getting worse
- Vue nodes being forced by default
- horrible security
- obsession with bedazzling the UI while tanking the fps
- pinned memory cancer
- killing off custom nodes
- removing basic shit everyone uses
>>
>>107594595
Cool so i can actually contribute
He could consider giving updates here or maybe some mild QA sessions so people know what he's working on
>>
>>107594604
what is your obsession with starting a flamewar for a dev that is working to give us options instead of being forced to use cumfart?
>>
>>107594604
Why do you always try to stir drama? Why wouldn't you want a UI dev to post updates here?
>>
sooo it's wan 2.2 forever for videos?
>>
>>107594612
a dead toy wrapper for another project written by a tranny that spams this thread daily with his meltdowns isnt an option, sorry sis
>>
File: zimg_0019.png (1.61 MB, 848x1280)
1.61 MB
1.61 MB PNG
>>
File: 1744658135152249.png (864 KB, 1200x751)
864 KB
864 KB PNG
>>107594623
https://github.com/FizzleDorf/AniStudio
seems really cute and lightweight for genning. says he plans on doing nodes too.
>>
>>107594622
Well they released 2.6 to API so maybe they will hand us the older toy soon?
Doesn't really feel like it though.
>>
>>107594667
at this point you are just sad
>>
>>107594595
hello ani sir I push my contribution for code please be the reviewing. I change many comments using AI please add credit.
>>
20:38:21
20:39:24
>>
>>107594599
if its getting better why would you need to update multiple times a day, read commits and know how to fix problems on your own?
>>
https://github.com/huggingface/diffusers/pull/12857
>omni-base
is it why base got delayed? because they wanted to turn it into something that could do edit as well?
>>
To not pull is to win at pulling

- Sun Tensocu
>>
File: 1741801319284459.png (2.77 MB, 1024x1536)
2.77 MB
2.77 MB PNG
>>
>>107594726
>why would you need to update multiple times a day
because it keeps improving with every update
>read commits
so you know what to expect
>know how to fix problems on your own
because your PC/OS isn't a static walled garden single-configuration system so you should be mindful of that fact and be ready to finesse.

90% of the time when someone has a problem with a product in life, it's a personal skill issue.
>>
File: prostitute4.png (1.17 MB, 896x1152)
1.17 MB
1.17 MB PNG
>>107594747
Since turbo can't do edit likely yes.
They finetuned edit capability after the distill.
>>
>>107594781
>fixing problems that shoudnt exist is good
good goy
>>
>>107594667
It's really great and ani actually cares about local unlike cumfart
>>
>because it keeps improving with every update
bait?
>>
>>107594667
Jesus
Eye cancer: The UI
>>
>>107594793
this. ani is a local first kind of guy unlike comfy's API obsessed nature
>>
>>107594570
>why is it getting worse?
try an older version of the frontend (like 1.17.5) and notice how much smoother it is compared to their modern frontend, it was way less laggy before it's not even close
>>
>>107594809
We should remove cumfart UI from OP and add AniStudio
Cumfart is not really local anymore anyway
>>
>>107594807
he added options to change it however you like

>>107594814
this isn't really an argument for it improving but more proof it's going down the shitter
>>
ani really should give some updates about his ui here
i'm actually hyped for it, it's great so far
>>
>>107594793
Cummed here to say this
>>
I'm rooting for Anistudio~!
>>
>>107594815
At this point "comfy"UI breaks with every single update, had malware and has tons of telemetry
It really should be removed
>>
>>107594890
Holy based!
>>
>
>>
File: 1754949520158699.png (2.03 MB, 2434x793)
2.03 MB
2.03 MB PNG
https://arxiv.org/pdf/2512.15603
that might be better than using an edit model lol
>>
File: 1765375499440280.png (208 KB, 1562x1566)
208 KB
208 KB PNG
rusttrannies infiltrated cumfart btw
>>
>>107595060
This is to have closed source components that definitely won't spy on you
>>
rust is the natural progression of growing up and realizing python is a bit shit if you care about performance
>>
>>107595089
it's clearly rust + python
>>
>>107595059
good to see they are finally making iteration, and therefore real art, possible
>>
>>107595093
replacing parts of python scripts in which rust would be optimal is sensible coding
>>
>>107595059
https://github.com/huggingface/diffusers/pull/12853
>>
File: 1756799096363358.png (77 KB, 1129x387)
77 KB
77 KB PNG
>>107594747
>is it why base got delayed? because they wanted to turn it into something that could do edit as well?
it was already like that before they made turbo
>>
>>107595104
why are they bothering with nodes 2.0 then?
>>
>>107595104
Except it's a replacement for parts of the UI
>>
>>107594202
Anon, there is a lot of Indians in the world. A whole lot.
>>
anyone managed to download the latest wan 2.2 blink lora from civitai before today's mass purge? I saw the censorship coming from a mile away but I didn't manage to archive the last one in time.
>>
>>107594202
3.5 millions out of 8 billions that have a powerful enough gpu to run a 6b model sounds kinda low to me idk
>>
>>107595112
>no RLHF for Z-image edit
why?
>>
File: z_mod_00089_.jpg (702 KB, 1344x1728)
702 KB
702 KB JPG
>>
>>107586802
>Without those custom nodes his software is completly irrelevant
I don't use custom nodes.
>>
>>107595186
you don't do anything interesting that warrants a nodegraph
>>
>>107595089
Everything performance critical in Cumfart's python is API calls to torch and CUDA under the hood. I am yet to see any backend written in other languages giving an actual performance improvement for inference.
Python sucks due to venv bloat, the occasional garbage collection bug shitting up the memory and conflicting custom nodes with conflicting dependencies breaking each other but it doesn't hurt the performance.
>>
>>107594202
nobody but trainers uses transformer format
>>
File: ZIMG_00077_.png (1.55 MB, 1024x1536)
1.55 MB
1.55 MB PNG
>>
>>107595202
ggml performs better on CPU but sdcpp seems to be about the same as comfyui and is much smaller
>>
>>107595212
>This means Z-image will not be released
that meme is over
https://github.com/huggingface/diffusers/pull/12857
>>
>>107595222
>much smaller
Because it's a fuckign naked binary. Also the bulk of comfy is caused by the fuckhuge pytorch and cuda dlls.
>>
>"bulk of comfy"
>entire thing takes up 1/100th the space of any one model weights
yeah I am thinking I don't give a fuck
>>
>>107594667
Do you think any mentally stable person without sexual deviancy would look at this and say "this is good"?
>>
File: 1746005699238978.png (603 KB, 653x515)
603 KB
603 KB PNG
>>107595108
>https://github.com/huggingface/diffusers/pull/12853
So we'll get QiE 25/11, Qwen Image Layered and Z-image base on christmas? goddam
>>
>>107595250
bruh my venv is 32 gb of garbage. I don't use models that big
>>
>>107595250
The torch whl is 25% of ZiT
>>
By the way, real thread:
>>107588906
>>107588906
>>107588906

Reminder that the real thread always have the rentry:

>https://rentry.org/animanon
>>
>>107595250
My venv is 11 gigs. what are these 1 TB models and how do you run them anon?
>>
>>107595267
>bruh my venv is 32 gb of garbage.
7.16 gb for me, pretty small compared to the total of models I have lol
>>
>>107595267
>bruh my venv is 32 gb of garbage.
Ngl that's on you my guy
>>
CAN I GET THE HITLER LORA
>>
>>107595275
>Reminder that the real thread always have the rentry:
Is this official?
>>
>>107595281
you can easily reach 1 TB with enough models though
>>
>>107595275
sorry, wrong link:
*https://rentry.org/ranfaggot
>>
>>107595302
You said any one model thoughever >>107595250
>>
I honestly think people should stop encouraging software bloat in the memory drought that is going to get much worse over time and yes that means swear off python trash
>>
File: 1742754120646202.png (157 KB, 1933x1018)
157 KB
157 KB PNG
https://github.com/comfyanonymous/ComfyUI/issues/11400#issuecomment-3672000434
Welcome to C O M F Y C U L T U R E
>>
>>107595275
Thanks, nobody deserves debo, trani and cumfart trolling
>>
>>107595326
I knew the hyperfocus on default preview method was gonna cause something lol.
>>
>>107595108
>https://github.com/huggingface/diffusers/pull/12853
>we’re not just enabling smarter image editing—we’re igniting a whole new playground for creators

sloppiest sentence I have ever read

good shit though
>>
File: z_mod_00095_.jpg (689 KB, 1344x1728)
689 KB
689 KB JPG
>>
File: ever.png (92 KB, 320x180)
92 KB
92 KB PNG
>>107595313
>thoughever
>>
>>107595108
>>107595344
if it only does basic edit it won't be that interesting, the main appeal of edit model is that it can reproduce your character doing completly unrelated stuff relative to the image input, if that layered thing can do that then yeah you have my attention
>>
>>107595344
>igniting a playground
Kinda weird ngl.
>>
>>107595344
Do you think he asked an LLM to "give him a banger line" when announcing his model to the world?
>>
gotta love MTL AI sloppa commits
>>
can ranfag just fuck off already? tired of the meltdowns while the adults are speaking. you lost. everyone knows you are a schizo loser so shut the fuck up about it
>>
>>107595365
he did, and he used qwen 3 (the driest llm of them all) to do it since it's their model kek
>>
>while the adults are speaking
Do we really need to lie like that?
>>
File: cyj1etphq08g1.png (1.94 MB, 2434x793)
1.94 MB
1.94 MB PNG
>>107595361
>>
>>107595382
Millions in compute to recreate photoshop magic wand and transform tools
>>
>>107595382
that's what I said, that's basic, what I want is to use this character, putting him on a bike with a different angle or whatever, and putting in on another place with another lighting and see how it fares, that's what makes those edit models so fun, they can act as accurate character loras
>>
File: z_mod_00104_.jpg (747 KB, 1344x1728)
747 KB
747 KB JPG
>>
>unsloped qwen
real shit?
>>
>>107595265
>QiE 25/11
Alibaba
>Qwen Image Layered
Alibaba
>Z-image base
Alibaba

The local ecosystem would be so dead without that company, god bless the chinks
>>
noooo get angry at the chinese they are only releasing cool stuff for free to damage our wonderful SAAS lords like openai. Think of how much MONEY the big US companies are losing by you not prompting with them
>>
>>107595419
Reddit ahh post.
>>
>>107595060
Debo said this in /sdg/ some days ago, Comfy dug its own grave by incorporating Rust
>>
>>107595344
I forgive them since it's probably translated.
But man, once your brain pattern see them, it's over, this shit is everywhere.
>>
>>107595361
>if it only does basic edit it won't be that interesting
it's based on Qwen Image, not Qwen Image Edit, so it won't have that capability to character transfer unfortunately
>>
File: z_mod_00117_.jpg (578 KB, 1344x1728)
578 KB
578 KB JPG
>>
File: Z-image turbo.png (2.63 MB, 1536x864)
2.63 MB
2.63 MB PNG
>Taylor Swift standing on a floor covered with countless black-and-white fashion posters, each poster featuring Taylor Swift in different fashion poses. Captured from a high-angle 3/4 perspective, focusing on Taylor Swift wearing a red sparkling crystal off-shoulder mini dress with long boots. LOMO Ic-a style, HD photography, emphasizing surreal repetition effect, vivid textures and cinematic composition.
>>
comfy replacement when
>>
File: Z-image turbo.png (2.17 MB, 1536x864)
2.17 MB
2.17 MB PNG
>>107595548
kek
>>
File: 799.jpg (12 KB, 264x87)
12 KB
12 KB JPG
>>107595642
>>
I updated my packages with the manager and now Comfyui is frozen I can't interact with anything even with a fresh install. Has anyone else had this issue? Its the portable version.
>>
The funniest shit is that this prolly actually happens a lot.
>>
>>107595692
considering 3/4 cumfart users are technically inept yeah
>>
File: ah.jpg (14 KB, 400x400)
14 KB
14 KB JPG
I'm also imagining a computer virus that forceupdates your every python module to latest.
>>
File: img_00076_.jpg (840 KB, 1264x1728)
840 KB
840 KB JPG
>>
>>107595708
made a custom node repo that does this and call it ComfyUI remote security
>>
>>107595131
No I missed it too. People need to stop using that site. The rules seem to be "If your loras are too good they will get removed"
>>
>>107595692
>>107595707
Yes I am quite compter illiterate. Considering you guys are referring to it as cumfart I should just use something else instead of trying to fix this.
>>
>>107595734
unfortunately cumfart has the monopoly. you can try neoforge since that's the latest auto fork with life support
>>
>>107595734
If you are fine with older anime models, you could technically cope with the forge forks or something. If you want/need new models, then noodles it is. And the default workflows for the models are relatively easy on the nodes.

Also this isn't vidya. Forcing to stay bleeding edge updated is suicide.
>>
Babe wake up, they improved on T5
https://huggingface.co/google/t5gemma-2-4b-4b
>>
>>107595814
Does it fit in chroma?
>>
>/ldg/
>182 replies, 30 images
>/sdg/
>111 replies, 92 images
this place is like the lisp threads, everyone is just there to argue with each other, no one seriously programs in lisp.

you ni/g//g/ers lost. you just tell yourself you're smarter while you have your twitteresque cloutfights and do absolutely nothing else. lol.
>>
>>107595819
it's another architecture so no lol
>>
>>107595730
that's the insane part, they're not breaking any rules whatsoever and they have no way to prove they're being used on real people, they're just mass deleting everything for no reason.
>>
https://huggingface.co/inclusionAI/TwinFlow
>we are also working on Z-Image-Turbo to make it more faster!
lul, come on you can't say z-image turbo + 8 steps is too slow
>>
File: 1753568284236064.mp4 (703 KB, 720x960)
703 KB
703 KB MP4
>>
Can you at least courteously change part of the background before you use my prompt?
>>
>>107595814
>stingy googleniggers dont want to release even gemma 3 70b let alone gemma 4 after all this time
fuck them
>>
>>107595823
I will allow you to continue seething
>>
>>107595856
I forgot I really liked that almost realistic style, I hope someone trains z image on it.
>>
File: ComfyUI_00109_.png (1.06 MB, 896x1152)
1.06 MB
1.06 MB PNG
>>107595850
I mean maybe it will help someone running a potato PC.
But wouldn't it produce better quality to wait for the base model and work on that?
>>
>>107595873
no
>>
File: 1756219157753683.png (976 KB, 1360x1134)
976 KB
976 KB PNG
>>107595823
>>
File: img_00082_.jpg (625 KB, 1264x1728)
625 KB
625 KB JPG
>>
>>107595899
There are lots of people who like that style. So, there definitely will be 2.5D finetunes once we get base and anime finetunes.
>>
File: file.png (2.27 MB, 1024x1536)
2.27 MB
2.27 MB PNG
testa netayume v4
>>
>>107595935
inpainted 4x upscaled half-fried color sd 1.4 tier ahh gen frfr unc
>>
>>107595935
that is ass, figuratively and literally
>>
>>107595823
/sdg/ is just a pic dump with no discussion. Might as well browse deviantArt
>>
File: ComfyUI_temp_chebo_00002_.png (3.13 MB, 1024x1344)
3.13 MB
3.13 MB PNG
>>107595935
Both Yume and Newbie need to go back into the oven for a long fuckign time. The models are fucking megabad.
>>
>>107595756
>>107595802
I should probably use all this free time to actually learn how to use a computer. Or just go back to being a wagie and cope with NAI kek.
>>
File: zimg_0054.png (1.51 MB, 848x1280)
1.51 MB
1.51 MB PNG
>>
>>107595965
Newbie needs more time
Yume has been like this for months.
>>
>>107595666
If you have "Save image as Type" chrome extension installed, disable it.
>>
>>107595970
If ani makes substantial progress there will be a braindead exe eventually
>>
File: 💀.jpg (104 KB, 768x766)
104 KB
104 KB JPG
>>107595666
bro doesnt know how to open dev tools
>>
File: 00027-303032542.png (2.45 MB, 1248x1824)
2.45 MB
2.45 MB PNG
>>
>>107595965
Newbie is severally undertrained alpha release.
I don't have high hopes but maybe it can converge into something decent.
Yume is getting worse and worse from previous releases.
Anyone who doesn't think it's another failbake is coping.
I don't know what the guy training it fucked up, but he screwed up the training royally.
Maybe it was just a grift like some claim.
>>
>there will be a braindead exe eventually
It's called sd-cli
>>
File: zimg_0045.png (1.66 MB, 848x1280)
1.66 MB
1.66 MB PNG
>>107596007
you go ahead and download some random exe and run it on your computer. i'll watch.
>>
File: file.png (3.16 MB, 1024x1536)
3.16 MB
3.16 MB PNG
>>107595965
>>107596025
I think v4 does anatomy better. I think that v3 had the best results tho... been a while since I tried desu
>>
>>107596025
>Maybe it was just a grift like some claim.
Nah. It's completely decomposed. The model has three eyes each loking the other way.
>>
>>107596036
it's open source, you can see all the code and judge from there
>>
>>107596040
v4 has SD 1.5 tier hands and concept bleeding
It got worse than v3
>>107596041
What caused it though?
On paper it has the ingredients of a good model.
>>
Could you use sdcpp to make a tiny app with a hardbaked ZiT with some pre-set optimal settings and a slot for a loras? And gui to hide all the tech shit?
>>
Actually nevermind. I realized that would lower the barrier of entry. Ignore.
>>
>>107596093
it's the exe release from the sdcpp repo. you can probably just write a script that passes the arguments
>>
File: file.png (2.8 MB, 1024x1536)
2.8 MB
2.8 MB PNG
>>107596041
holy garbage lmao
curious what you tried to prompt
>>
File: file.png (593 KB, 989x554)
593 KB
593 KB PNG
Y'all ain't said nothin, believe that. Once Z-Image Base comes in and that anime finetune cut these failbakes heads clean off UUH, I'm outta here. You ain't never seen no nigga be gone like I'mma be gone
>>
>>107596093
>Could you use sdcpp to make a tiny app
Mobile app? I don't know which backend you would use (Vulkan?) but it supports quite a few so possibly you can find one that works. No idea if it would compile problem free for arm though.
>a hardbaked ZiT
It supports ZiT, more limited, no fancy stuff like partial offloading but it runs it
>pre-set optimal settings
Not as many to thinker as comfy but yes you can do that
>a slot for a loras
The lora control is more limited as far as I understand but possibly (I didn't bother testing that)
>And gui to hide all the tech shit?
Yes you can just call the executable under the hood.
>>
>>107596128
>It supports ZiT, more limited, no fancy stuff like partial offloading but it runs it
it has offloading and easycache
>>
>>107596128
>no fancy stuff like partial offloading but it runs it
A q4 should be around 3-4 gigs. Which is more bearable as a componenent.
>>
>>107596132
>it has offloading
It doesn't have the offloading Comfy has for ZiT or anything comparable to it.
https://github.com/leejet/stable-diffusion.cpp/issues/1013
The "offloading" in sdcpp is offloading different model parts like text encoder to cpu while unet is running. It doesn't have per layer offloading implementation.
>easycache
Worthless forced meme
>>
File: ComfyUI_00755_.png (1.59 MB, 1200x1600)
1.59 MB
1.59 MB PNG
ZIT can't generate open photocopiers. They lid is always down. This is a problem for obvious reasons.
>>
>>107596234
Can it generate it open if a girl is sitting on the glass under it?
>>
File: z-image_nag_00187_.png (2.04 MB, 1024x1536)
2.04 MB
2.04 MB PNG
>>
>>107596234
Also what if you combined it with that clear plastic chair form under lora?
>>
File: z-image_nag_00189_.png (2.15 MB, 1024x1536)
2.15 MB
2.15 MB PNG
>>
>>107596234
"the top lid cover of the photocopier is at a 90 degree angle and open and resting against a wall"
>>
File: 1765632927020690.jpg (493 KB, 2560x956)
493 KB
493 KB JPG
Loras work on DFloat11 quant (Z-image turbo) now
https://github.com/mingyi456/ComfyUI-DFloat11-Extended/pull/8
https://huggingface.co/mingyi456/Z-Image-Turbo-DF11-ComfyUI
>>
>>107596276
Why doesn't he quant actual big models?
>>
>>107595990
I'm not sure if this is bait or not.
>>107596008
We all have to start somewhere pal, I doubt you came out of your mother's womb knowing how to use a computer.
>>
File: ComfyUI_00756_.png (1.65 MB, 1200x1600)
1.65 MB
1.65 MB PNG
>>107596240
I am unable to make it do that, such is the problem.

>>107596260
Can't find it.

>>107596272
No luck.
>>
>>107596276
kinda crazy how this was out for months and months and nothing happened

same thing with tensorrt, big speedup with any model after its trained in a couple hours and yet nobody gives a shit to make it work with modern models

same with patching loras on the fly into the model to lower vram usage but nobody cares

same with compressing all model weights to save a huge ~30% of file size and decompressing on the fly as they are loaded into vram but nobody fucking cares
>>
>>107596252
>>107596263
the glitched patches of skin are a lil nauseating
>>
>>107596305
>same thing with tensorrt
Wasn't it locked to 50XX and you needed to convert everything else to that format too?
>>
>>107594371
Against a star-studded cosmic backdrop featuring nebulous clouds and bright stellar points, a figure draped in flowing emerald-green fabric arches backward with arms extended upward, as if surrendering to or being drawn by an intense beam of rainbow-colored light descending from above; this radiant prism—composed of distinct vertical bands of magenta, pink, orange, yellow, green, and gray—casts vivid illumination upon her form while contrasting sharply with the deep blackness surrounding her, evoking a surreal yet serene atmosphere of awe-inspiring transcendence within the infinite universe.
>>
>>107596291
>We all have to start somewhere pal, I doubt you came out of your mother's womb knowing how to use a computer.
the first thing i did after coming out was ctrl shift i the electron app chromium wrapper heartbeat monitor software to make sure its not connecting to the google analytics server to delay their telemetry collection on me
>>107596314
no, you just have to train the model for a couple of hours for each model/lora/lora combo you want to use once, but if you plan on using a model or lora a lot its more than worth it
>>
>>107596305
dfloat11 is like 2x slower and z-image already fits completely in vram, it's pointless
>>
>>107596291
i didn't start somewhere, i've always been here
>>
File: z-image_nag_00196_.png (1.68 MB, 1024x1536)
1.68 MB
1.68 MB PNG
>>
class SaveImageWebsocket:
@classmethod
def INPUT_TYPES(s):
return {"required":
{"images": ("IMAGE", ),}
}

RETURN_TYPES = ()
FUNCTION = "save_images"

OUTPUT_NODE = True

CATEGORY = "api/image"

def save_images(self, images):
pbar = comfy.utils.ProgressBar(images.shape[0])
step = 0
for image in images:
i = 255. * image.cpu().numpy()
img = Image.fromarray(np.clip(i, 0, 255).astype(np.uint8))
pbar.update_absolute(step, images.shape[0], ("PNG", img, None))
step += 1

return {}

@classmethod
def IS_CHANGED(s, images):
return time.time()

NODE_CLASS_MAPPINGS = {
"SaveImageWebsocket": SaveImageWebsocket,


If I want to hardcode my default image file to be jpg at 95% qual instead of png bloat, do I set it up in the ("PNG", img, None))? Can jpg preserve the comfy metadata even?
>>
>>107596342
>dfloat11 is like 2x slower and z-image already fits completely in vram, it's pointless

"At batch size = 1, inference is approximately 2× slower than the original BF16 model, but the performance gap narrows significantly with larger batches."
i'd like to see the numbers for 8-16 images in a batch but the point here is the tech itself, and there is also no reason not to use this as the standard format for model storage at the very least, 30% storage saved for free, I do that with 7zip locally already
>>
>>107596234
for me it's the lack of crossbows.
>>
File: ComfyUI_00759_.png (1.43 MB, 1024x1600)
1.43 MB
1.43 MB PNG
>>107596234
Qwen does it no problem. Huge L for Tongyi tbqh.
>>
>>107596314
>>107596337
It takes a few minutes to create a single tensorrt model, not " a couple of hours"
>>107596326
Thanks
>>
File: 1748190505664482.png (3.43 MB, 3840x1369)
3.43 MB
3.43 MB PNG
>>107596342
>dfloat11 is like 2x slower and z-image already fits completely in vram
on z-image the speed is the same and it's always nice to get rid of some of its size (you have more place to put a llm rewrite on top of it)
>>
>>107596380
the solution is to buy more storage and gen higher quality things per image :)
aside from just saving the metadata in a txt file of course, or also saving it as lossless jpg to jxl
>>107596391
>It takes a few minutes to create a single tensorrt model, not " a couple of hours"
from what i remember early in the development for some things that i was looking at on my 3090 it would take a couple hours, maybe it was for that particular model or early version software
>>
just save it as a TIFF
>>
File: ComfyUI_00760_.png (1.13 MB, 1024x1600)
1.13 MB
1.13 MB PNG
>>107596388
You can even turn it on. My God.
>>
>>107596170
--offload-to-cpu
>>
>>107596405
>lossless jpg to jxl
by lossless here i mean lossless jpg to jxl after you already lost info saving to jpg i mean
>>
>>107596392
In my tests with 12gb VRAM it always runs slower.
Very slighlty slower at high res and noticeably slower at 1024p.
I trust my own tests more, I recommend testing it on your own device instead of grabbing random images
>>
>>107596388
qwen is retarded in so many other ways though
>>
File: z-image_nag_00202_.png (1.71 MB, 1024x1536)
1.71 MB
1.71 MB PNG
>>
File: zimg_0108.png (2.05 MB, 960x1536)
2.05 MB
2.05 MB PNG
>>
File: 00064-2709436679.png (2.28 MB, 1824x1248)
2.28 MB
2.28 MB PNG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.