[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: zimage_00125_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107476773

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
Cookies!
>>
Don't you people ever get sick of spamming the same generic twintails anime girl every day
>>
thx 4 bake
>>
idiots
>>
File: zimage_00112_.png (2.21 MB, 1184x1680)
2.21 MB
2.21 MB PNG
>>107481508
no
>>
File: Z-image turbo.png (1.27 MB, 1280x720)
1.27 MB
1.27 MB PNG
>>107481484
thanks anon for the bake
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo/discussions/81#6936a24bd6c891ad17323df6
>Then honestly promising a model dedicated to fine-tuning, purely dedicated to the open-source community, only to ultimately say, “No, fuck off, pay for our API after all,” would be the worst move.
> treating people in the community like idiots is shooting themselves in the foot.
>>
Blessed thread of frenship
>>
File: ComfyUI_00010_.jpg (2.98 MB, 2048x2048)
2.98 MB
2.98 MB JPG
Any tips for making Z output less sloppy?
>>
>>107481517
>only to ultimately say, “No, fuck off, pay for our API after all,” would be the worst move.
>> treating people in the community like idiots is shooting themselves in the foot.
It's like it's his first time being swindled by the Chinese
>>
>>107481533
first it was the wan rugpull, another one would definitely make me spam winnie the pooh memes even harder
>>
>>107481529
why does it looks so slopped though? that's weird, show your workflow
>>
miku spammer has ruined this board. Jesus Christ you should be ashamed of yourself. Fucking autist.
>>
>>107481533
>>107481549
>the wan rugpull
Alibaba never promised wan 2.5 so it's not their fault you assumed imaginary shit
>>
>>107481564
the leaker Q promised it as an open model at first thoughever beit?????????
>>
>>107481559
say this again without crying this time
>>
File: ranking.png (105 KB, 1104x904)
105 KB
105 KB PNG
there's no way they release after this. base will be API only
>>
File: it's over.png (762 KB, 1329x1219)
762 KB
762 KB PNG
>>107481569
>the leaker Q promised it as an open model at first thoughever
oh no... history repeats itself!
>>
baker whered you go off?! we had sdg schizos and adg pedos making fun of us
>>
SDXL forever
>>
>>107481578
>there's no way they release after this. base will be API only
this, they're probably even angry they released Z-image turbo at all, Alibaba really underestimated the power of this model, they thought this tiny team wouldn't do shit
>>
>>107481578
That doesn't make a lot of sense from a business perspective, turbo is a lot more efficient to run with comparable results and they released it with a commercial license for free.
>>
>>107481564
Calm down ch*ng ch*ng, they purposely didn't answer any questions about open source at first making it ambiguous so they farmed the open source community for numbers on their social media and people using their api on release to test the model that will supposedly end up releasing later on (it didn't), al motivated by their xinccc shills on xitter and shitcord who claimed it will indeed release in due time.
So go fuck yourself, winnie the pooh.
>>
File: zimage_00128_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>
>>107481613
>they purposely didn't answer any questions about open source at first making it ambiguous
when it's ambiguous, the first reflex as a 3 digit IQ individual is to not make up fan fiction, but since your IQ is 80 you didn't listen to that basic advise, many such cases and many such retards
>>
>>107481627
cao ni ma
>>
>>107481575
I'm laughing at you. You are spamming the same image for 24/7 every single day. Must be sad to be you. So very sad.
>>
Some one mentioned limb editor or something, we had that for months already, see https://github.com/toyxyz/ComfyUI-ultimate-openpose-editor
>>
>>107481608
>That doesn't make a lot of sense from a business perspective
it does, if they release base to the wild, many small API companies will just finetune it and put it on their list and the API ecosystem will be ultra competitive, they're shooting themselves in the foot if they do that
>>
File: 1746116156669178.jpg (76 KB, 720x720)
76 KB
76 KB JPG
Is there some brainlet guide for AI toolkit, I tried cloning Z-Image repo, downloading the model with hf download, or simply linking to the required files in a folder and it simply says no checkpoint found no matter what I do
>>
>>107481627
Yeah, if you are a proto jew ni hao ma with your 3cm small xincc ding dong, if you are a normal person you always assume the other person has also the best intentions for you, specially if there's shills from your team telling people "it will release later on" in xitter and shitcord as I already said, now cope more.
>>
>>107481559
>ruined this board
>>107481645
>I'm laughing
based?
>>
>not getting a base model to finally be free from sdxl
>no other models to even come close being announced
>cumfart in ruins because of the chink grifter and jeet devs
>zero progress to freeing ourselves from cumfart
>saasniggers won
it's never been more over than this
>>
>>107481659
>if you are a normal person you always assume
again you still don't get it, you shouldn't assume anything, they have the right to put some of their models API only, how do you think they're gonna make money if they release everything to the nature you fucking retard?
>>
>still using SDXL
>>
File: zimage_00130_.png (2.56 MB, 1280x1600)
2.56 MB
2.56 MB PNG
>>107481645
>>
>>107481678
>implying dit bloatslop is good
>>
File: migu dab.png (228 KB, 640x640)
228 KB
228 KB PNG
>>107481645
I'm having my fun while making you seethe eternally, can't be a better scenario desu, feelsgoodman
>>
>>107481678
If you want to goon XL is the only real option, and for SFW API stuff is better
>>
>>107481671
can someone translate from chinesoid to a real language? I dont understand this babble
>>
File: 1200_1200.jpg (287 KB, 1200x1200)
287 KB
287 KB JPG
>5070 speed + big vram
>built for easily finetune nsfw loras
>can experiment other stuff while genning 1girl in background
>power efficient
I might pull the trigger
convince me not to get one
>>
>>107481671
You absolute retarded mongol, the amount of people able to run a model like wan 2.5 is like 0.01% of the total users they'll ever have, it won't have any financial impact at all for them, your retardation has no limits.
Those people in fact will elevate the model via loras and improvements/performance increases.
>>
>pretending anons here still use XL
>>
seriously, who is forking cumfart so we don't have grift chink incompetence every update?
>>
>>107481650
nothing stops a project like chroma from happening. and at least this time it would be on a good base, even with the loss of quality from undistilling the model. i guess it all boils down to whether or not we will get good hands on the finetune
>>
>>107481707
>the amount of people able to run a model like wan 2.5 is like 0.01% of the total users they'll ever have
then why are you shitting and pissing yourself over the fact this giant unrunnable model isn't local?
>>
>>107481716
shut the fuck up trani
>>
>>107481706
At this point the way ram prices are you should just pull the trigger if you can get this at MSRP
>>
>>107481706
don't get one. it's basically the overpriced digits garbage and unified memory is a meme if the bandwidth is bad
>>
>>107481726
you must be 18 to post here
>>
Was it finally confirmed that base wasnt getting released?
>>
>>107481723
hello cumfart
>>
>>107481734
nope, they only said it's gonna be released, but they can't stay silent forever, we'll know the end of the story once the model is finished, we'll see if it's gonna be API only or not
>>
>>107481722
>Unrunnable, nobody said that you strawman xincc, just like wan 2.2, only a really, really small number of people actually can run the model, by the hardware needed itself or by technical knowledge, again, is a literal no difference when it comes to api consumers.
>>
>ask a team for their dataset
>suddenly the base is delayed and "not finished training"
gonna have to call scoobs and the gang to figure this mystery out
>>
File: 1740305306379725.png (2.63 MB, 1216x1728)
2.63 MB
2.63 MB PNG
>Here anon, take this
>>
>>107481751
>nobody said that
>>107481707
>the amount of people able to run a model like wan 2.5 is like 0.01% of the total users they'll ever have
>>107481751
>only a really, really small number of people actually can run the model
yo is this guy retarded?
>>
>>107481727
yes, that's a big part too
>>
>>107481761
The box also contains Anthrax
>>
lol even reddit is hating on cumfart
>>
>>107481756
the base model won't have the noob finetuning, it's gonna be a separate finetuned model specialized on anime
>>
Is aitoolkit good for training loras for other models than zit? I can't find the usual settings you have to set.
>>
>>107481767
Holy mongol retard, xincc, that 0.01% is already the people who can run wan 2.2 compared to anyone using api, just demonstrating the xincc logic and honesty yet again, guess is a trait you can't get rid of.
>>
>>107481782
desu he deserves it after removing the stop button, what a retarded move, in a more competitive space that kind of move would the end of your career
>>
what's the website where i can upload a image comparison with a slider between 2 images?
>>
>>107481797
>I'm shitting and pissing myself about the fact wan 2.5 won't be local even though only 0.01% of local users would be able to run it
you are genuinely a subhuman retard anon, there's nothing I can do for you
>>
>>107481806
juse use ure eyeys lol
>>
>>107481559
>miku spammer
We all spam mikus newfag
>>
>>107481798
I think cumfart 's time in the sun is over. it's time for someone else to step up
>>
>>107481806
yes
https://imgsli.com/
>>
>>107481810
Redirected to >>107481707
>>
TETO CHADS RISE UP
TETO REVOLUTION WHEN!?!?!
FUCK MIKU SIMPS BEHEAD MIKU
>>
At this point I think nigger rigged XL models are the future. At least bigasp2.5 with rectified flow learning gives interesting outputs and feels somewhat different
>>
>>107481729
this reddit parroting isn't very convincing
>>
>>107481828
>please bro, let them release Wan 2.5 locally with the apache 2.0 licence so that every API company can use it and put it on their own site, making Alibaba's API option weaker, especially if one API company decides to run their version of Wan 2.5 cheaper
yo is your brain shrinking more and more throught the discussion or something?
>>
>>107481852
>Moving the goalpost for third time
Nobody is stopping them from releasing it under a more restrictive license for big companies.
Keep coping xiao nig.
>>
>>107481816
>we
>>
>>107481806
rgthree image compare node in comfyui :^)
>>
>>107481865
>a more restrictive license for big companies.
who's gonna enforce that though? you think Alibaba will be allowed to verify the weights of API companies and sees if they're not using a finetuned Wan 2.5 version to hide it? are you fucking retarded? genuine question
>>
>>107481880
>Moves the goal for a fourth time
LMAOOO, trying to hold to the smallest branch at this point.
Again keep coping, not gonna change the facts.
https://en.wikipedia.org/wiki/China%27s_final_warning
>>
>>107481904
you're the one who's coping, I gave you a lot of reasons why it's a suicide to release your best model locally, but you don't want to listen, your chink hate is clouding your vision, did your girlfriend cheat on you with a chink or something? what's wrong with you?
>>
You don't get it. I AM Chinese.
>>
bac
>>
i was hyperventilating the entire time
>>
>>107481912
I will never stop hating after panda express gave me teriyaki chicken when I asked for sweet and sour.
>>
I was shidding n pissing the entire time
>>
shouldve used that time to reflect on why you are such a huge faggot
>>
>>107481914
cao ni ma
>>
File: ComfyUI_00082_.png (2.51 MB, 1280x1280)
2.51 MB
2.51 MB PNG
>>107481556
Prompt too simple + shitty sloppy Brap Pitt lora + absolutely cancerous SeedVR upscale
>>
>>107481788
It sucks, I'd go with onetrainer, idk maybe they already implemented zit by now
>>
>>107481648
Nothing better out? This looks abandoned.
>>
File: 177.jpg (910 KB, 857x579)
910 KB
910 KB JPG
doomposters are always right
>>
How much extra ram/vram usage does it take to use a huge workflow, just basic nodes? I can't tell a difference.
>>
>>107481706
>LPDDR5x unified memory
this is just DGX Spark shit.
>>
>>107481717
>nothing stops a project like chroma from happening
and nobody would care because chroma is broken garbage. only base model would matter.
>>
>>107481529
you can tell someone used the lenovo lora because it has a tendency to add 'lenovo' in text somewhere in the gen
>>
File: file.png (3.03 MB, 1248x1872)
3.03 MB
3.03 MB PNG
bunda
>>
>>107481651
but it just downloads the model for you
>>
>>107482065
it doesn't, seems to be some kind of recurring bug for some from what I've seen on the discussions page, and looks like I've got it
>>
>>107481706
Can this thing be linked to a normal weak computer, so you run your software on the weak computer and have this thing compute and send the result to the weak computer, just like if this thing was a gpu inside the computer?
>>
>>107481958
>shitty sloppy Brap Pitt lora
but it can do Brad Pitt without the lora lol
>>
https://www.reddit.com/r/comfyui/comments/1phf24v/comfy_cloud_is_cooked_with_new_pricing_kickingin/
comfyui exerting Chinese culture and even some Jewish culture
>>
File: 1739116073866024.png (282 KB, 2546x994)
282 KB
282 KB PNG
>The launch should be soon, they're updating a lot of things.
https://github.com/Tongyi-MAI/Z-Image/commits/main/
they're just patting themselves on the back on the fact that ZiT got ranked 8 on artificial analysis lol
>>
File: 1735527520609490.png (2.46 MB, 1216x1728)
2.46 MB
2.46 MB PNG
Here's the base model you asked for
>>
>>107482154
thanks gigachad
>>
posting Miku is so fucking basic and boring, it's like being avengers fan irl or something
>>
>>107482154
gigabased
>>
>>107482154
does ZiT know gigachad?
>>
>>107481798
>in a more competitive space that kind of move would the end of your career
Unless you work for Microsoft, or Google, or Apple, or Youtube, or...
>>
>>107482176
It's also a good benchmark. If it can't get Miku right, it's shit.
>>
what custom node are you using to run the abliterated prompt enhancer on comfyui?
>>
>>107482199
sorry, im not sending my prompts to the cloud
>>
File: 1756548703376962.png (3.23 MB, 1440x1920)
3.23 MB
3.23 MB PNG
>>
>>107482176
cringe
>>107482197
this anon gets it
>>
File: file.png (2.95 MB, 1248x1872)
2.95 MB
2.95 MB PNG
robot gf when

>>107482180
its a lora
https://civitai.com/models/2199607/z-image-turbo-gigachad?modelVersionId=2476616
>>
>>107482207
I want to run the prompt enhancer locally
>inb4 muhh custom prompt = it gets sent to the cloud
if you find the specific code that does what you halucinate maybe we'll take you seriously
>>
>>107482210
Ms. President, when will you tariff china for not releasing Z-image base?
>>
File: 1755582418753679.png (2.32 MB, 1216x1728)
2.32 MB
2.32 MB PNG
>>107482218
>its a lora
>https://civitai.com/models/2199607/z-image-turbo-gigachad?modelVersionId=2476616
damn, why did i train my own then? mines probably better anyways
>>
>>107482199
run it using ollama or llama.cpp or koboldcpp, then call it locally in api using a node
also use derestricted not abliterated

>>107482207
retard
>>
>>107482176
absolutely
>>
>>107481836
>bigasp2.5
>I take a look what it is.
>"only works in ComfyUI"
>I give up.
>>
>>107482270
use swarmUI, you'll never have to deal with spaghetti ever again
>>
>>107482261
kobold can just use zit now so comfy isn't required anymore
>>
>>107482063
Write blacked on her panties
>>
File: file.png (3.65 MB, 1344x1536)
3.65 MB
3.65 MB PNG
>>107482130
Yeah I dunno man, same prompt, same seed, with vs without
>>
>>107482063
Write panties on her panties
>>
>>107482063
Write write on her panties
>>
>>107482261
>also use derestricted not abliterated
I wanted to use Qwen 3 VL and it doesn't have such "derestricted" thing
>>
>>107482349
it looks like ass, show a screen of your workflow you probably fucked up something
>>
>>107482405
check for GLM-4.5-Air or 4.6 with very low quants
>>
File: file.png (3.7 MB, 1248x1872)
3.7 MB
3.7 MB PNG
>me when I API

>>107482253
wait so you also trained one?
Can you share it?
Yours definitely looks better
>>
So McDonalds Netherlands is currently getting severe backlash for this slopped Ad.

https://youtu.be/SROxy7KxFpw?si=QsG8S6T4Cb95AkkV
>Every shot travelled through a rigorously engineered toolchain: real Google Earth plates, advanced style-transfer, pixel-level photo repair, custom LoRAs, control nets, bespoke ComfyUI graphs, and thousands upon thousands of tightly steered iterations."
>bespoke ComfyUI graphs

Which if you fucks is responsible for this abomination?
>>
impact nodes phones home to YouTube (Google)
>>
File: 1745092216411505.png (10 KB, 538x248)
10 KB
10 KB PNG
>>107482527
lmao those luddites need to touch grass
>>
cumfart just isn't professional software. it's a trash heap for ameteurs to bloat
>>
File: 1753986734710104.jpg (612 KB, 2432x1716)
612 KB
612 KB JPG
>>107482498
>an artistic monochrome portrait photograph of Gigachad, a huge muscular man with a strong jaw line, combed back hair and a thong. He's looking into the camera with a confident smirk on his face. he's handing a laptop with the text "API" on its screen to the viewer.
i did a comparison and they seem pretty similar.
>>
>>107482598
both are ass lol, the first one looks like charlie kirk kek
>>
File: lets goooo.png (59 KB, 673x312)
59 KB
59 KB PNG
Omg it's here!
https://github.com/Tongyi-MAI/Z-Image
>>
>>107482640
HOLY SHIT
HOLY SHIT
>>
>>107482640
you could have at least changed the name of the links to make it believable
>>
>>107482640
dumb giganigger retard
>>
>>107482640
i was going to make a fake image like this but then i realized the checkpoint/online demo buttons are images and was too lazy to make my own edit just for 3x (You)'s.
>>
I don't even check it anymore
do you thin I'm stupid?
>>
>>107482527
Is this the power of gorrilions of dollars?
>>
>>107482527
why are they trying to sell Christmas as a dystopian nightmare for everyone in the first place?
>>
https://xcancel.com/ExtremesTwo/status/1997749416116343123#m
(Translated with DeepL)
>How can we dominate this ecosystem without releasing anything? After release, wouldn't it be wonderful to build tools around this base model? We'd gain a wave of goodwill and put those foreigners in their place—after all, many harbor resentment toward this system due to ideological issues.
lmao he's talking about you >>107481925
>>
selling my 4090 in 2 hours for $2k which is more than i paid for it a few years back

no release of ZiB has convinced me i don't need to waste my time on genning anymore
>>
>>107482815
sit on it and sell for $4000
>>
>>107482815
>no release of ZiB has convinced me i don't need to waste my time on genning anymore
it will be so funny if they'll release ZIB tommorow
>>
>>107482832
if they do at least that anon can use the money on ComfyAPI credits to gen with
>>
>>107482867
he wouldn't have enough memory for nodes 2.0
>>
I like how it takes a minute to load the noodles visually lmao.
>>107481514
Sex. Box?
>>
>>107482815
still waiting on finding out if anon's RTX Pro 6000 for $2k was legit or not.
>>
>>107482640
Imagine you're a company which released a really succesful model like Z-image turbo, and you are supposedly releasing the upcoming Z-image base, wouldn't you capitalize on that hype by talking about it on the internet? like Idk, tease people with the release with some preview images or some shit? they are dead silent as if they're ashamed of it, it's sus as fuck
>>
>>107481651
git clone the HF repo (it's like twice the size of the model itself so beware)
go to toolkit and type the path to the folder with all the shit.
You can also delete .git folder from the model folder later since it's not needed and just bloatr
>>
>>107482900
there is no fucking way, I wish anon the best but it literally makes no sense. There has to be a catch.
>>107482918
That is a very western way of thinking. In China it is handled differently
>>
>>107482940
>In China it is handled differently
Yep, and it's called the Chinese Culture™
>>
>>107482918
you ordah 2 ton of steel?
fuk u. u get 2 ton pig iron
>>
>>107482940
>In China it is handled differently
yes like not releasing the base at all
>>
>>107482963
how cultured in the Chinese ways you are
>>
You can't even post the chink pasta to get rid of them because they all spies and shills live in canada.
>>
>>107482935
NTA but why can I not simply point it to the unet and encoder? So gay I have to download the entire repo.
>>
>>107483011
you can do that in other lora trainers but so far AI toolkit is the only one that can do ZiT so you have to go through the humiliation ritual
>>
File: firefox_QRRBNrkekm.mp4 (715 KB, 940x842)
715 KB
715 KB MP4
working on a flux/z-image VAE finetune for pixel art (decoder only with frozen encoder)
surprising how good the results are after yoloing some random hyperparams on a shitty dataset for 45 minutes
>>
File: aazz.jpg (381 KB, 1536x2048)
381 KB
381 KB JPG
>>
>>107482176
Teto
>>
File: file.png (2.53 MB, 1248x1872)
2.53 MB
2.53 MB PNG
>>107482176
I agree...unless...
>>
File: Zurbo_00011_.jpg (434 KB, 2048x2944)
434 KB
434 KB JPG
>>107482640
I fell for it a third time.
>>
Is there anything as good as Nano Banana Pro or am I stuck waiting like 5 years for the Chinese to finally sell me the unbound service I'm burning for.
>>
cliudcucks BTFOd
>>
File: 1746788366666037.jpg (386 KB, 1216x1728)
386 KB
386 KB JPG
https://civitai.com/models/483381/lying-down-breast-morph-backtits-flux-pony
anyone interested in a z-image version of this?
>>
>update comfyui
>workflow no longer works
>look up the error
>no one has this error
>tried fixing it myself but ended up bricking it
>reinstall comfyui
im so heckin comfy right now
>>
Anyone trained a person/face lora for z-image? I was thinking of trying to bring one of my wAIfus over but if it just fucks up the quality it's not worth it.
>>
File: ComfyUI_temp_rtzxc_00001_.png (3.51 MB, 1120x1920)
3.51 MB
3.51 MB PNG
>>
>>107483383
What error is it? My recent update broke all the workflows and I had to go into a file to fix it.
>>
>>107483321
why wouldn't they be?
>>
File: 0_00285_.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
is it just me or does zimage produce inherently boring images? they look like bland stock images. sdxl has more character. maybe there is some receipe of basic prompts it needs like "dynamic, interesting" king of like how other models have to have quality prompts like "masterpiece"?
>>
>>107483433
>is it just me or does zimage produce inherently boring images? they look like bland stock images.
the distillation makes it boring, only base can fix that...
>>
File: Zurbo_00045_.jpg (877 KB, 3328x1792)
877 KB
877 KB JPG
>>107483433
What was your prompt and what were you expecting?
>>
>>107483404
ale bym ją klepnął
>>
>>107483443
1girl, asian, reading book

>what were you expecting
idk lol just less boring ig
>>
>>107483463
gotta expand that prompt bub
>>
File: girl sitting reading.jpg (403 KB, 1698x839)
403 KB
403 KB JPG
>>107483443
it was a fairly simple prompt about a girl sitting and reading. but you can imagine in your head all they different ways you could and hold a book and what position your feet are in and which angle the image is. how is it possible every single seed is practically identical. im just not use to that, its good for playing dress up. i can start describing the angle and stuff but its a losing battle. i could say from front or from side or 3/4 perspective but thats only three variations that are static and set. unless there is a way to use degrees like "She is rotated clockwise 62 degrees and the camera is looking down at 12 degrees and there is 100 lumins of light and her skin is approximately this illuminated and reflected and the hue of the light is x/y/z in RBG". other models provide a natural tasteful randomness and you can pick a seed you like
>>
>>107482900
narrator: it wasn't
>>
>all the bakers (me included) have abandoned the thread
grim
>>
>>107483269
Local still doesn't have a DALLE-3 equivalent so make of that what you will.
>>
>>107483509
Have you tried setting the denoise on an empty latent to 0.7-0.8? Sometimes that helps with variety for some reason.
>>
>>107481706
lpddr should be convincing enough for you, but if it isn't then you should definitely piss your money away on it
>>
what elaborate jewish humiliation rituals do I have to undergo to make fucking AI toolkit download ZiT, it's supposed to start automatically, nothing happens
>>
>>107483535
But you just poasted??
>>
File: z-turbo_00058_.png (3.03 MB, 1152x1536)
3.03 MB
3.03 MB PNG
>>
File: file.png (1.22 MB, 2267x1365)
1.22 MB
1.22 MB PNG
>>107483463
I'm using a LLM model to rewrite my boring prompts personally lul
https://huggingface.co/ZeroWw/Josiefied-Qwen3-8B-abliterated-v1-GGUF
https://huggingface.co/spaces/Tongyi-MAI/Z-Image-Turbo/blob/main/pe.py
https://github.com/FranckyB/ComfyUI-Prompt-Manager
>>
File: Zurbo_00007_.jpg (1.14 MB, 2048x2944)
1.14 MB
1.14 MB JPG
>>107483509
Gotta expand the prompt, because otherwise it'll just do exactly what you say. They have that refiner thing which basically does that for you.
Or just use noise injection (or skip a few diffusion steps with an empty prompt).
>>
>>107483598
NO DONT DO IT
>>
>>107483269
>Is there anything as good as Nano Banana Pro
no, NBP is on a league of its own, nothing is close to that yet
>>
File: sdxl.jpg (218 KB, 1003x888)
218 KB
218 KB JPG
>>107483509
the difference

>>107483550
i see what you mean, it does help a lot. and i bet using an image with random blobs of light and dark would help too for this. the image doesnt look as crisp thoguh even when adding more steps, would have to find a good set up
>>
>>107483615
You can probably try to run it through a second KSampler at low noise to fix the crispness.
>>
>it can't stop the job
>it can't delete the job
>stuck in a loop trying to stop the job
>kill the process and reastart AI toolkit
>it's still stuck in a loop trying to kill the job
what the fuck is this piece of shit
>>
>>107483664
webslop
>>
Dealing with Comfy and python crap is a rite of passage
>>
>>107483598
Mmmm gummy de Milo...
>>
>>107483433
>>107483463
What sort of image are you expecting from a girl reading a book?
>>
>>107483600
even in your image everything is a generic 3/4 perspective, like something someone training to be an artist would draw.
>>
>>107483698
Is there a reason why y'all expect the AI to give crazy-ass angles when it's unprompted?
>>
>>107483664
>try to kill the process
>it just starts again continuing trying to delete the job
>loop continues
>you have to repeatedly mash the mouse to actually close the process
this is hilarious desu
>>
>>107483708
it's really hard to prompt z for angles though
>>
File: 1741049512976468.png (2.74 MB, 1440x2160)
2.74 MB
2.74 MB PNG
>>107483395
I trained 3000 steps rank 16 with ai-toolkit
top = lora 100%
bottom = no loras
any kind of lora does tend to change a lot you don't want changed and not for the better.
>>
>>107483676
in reality it's a humiliation ritual. you are such a cuck for accepting it
>>
>>107483725
What was your data set like?
>>
File: Výstřižek.png (9 KB, 193x123)
9 KB
9 KB PNG
A treshold of what?
>>
>>107483775
it means that for the first 20% of the steps, the noise gets applied, after that it's over
>>
>>107483775
threshold at which timestep to return to original text embedding
btw that node sucks, it will fuck up any text if you put the strength too high because that information is also encoded in the text embedding so you can't just randomly inject too much noise into it like a sloppy butcher
>>
>>107483800
is there a better solution than that then?
>>
>>107483463
Silly complaint, you got what you typed in. I personally can't stand it when a model forces a style on an image even with a basic prompt because it usually means an undercurrent of "aesthetics" will bleed through the rest of the model when I'm not looking for it.
>>
>>107483808
no. the lack of variance is a fundamental flaw with z-turbo. you can't do shit because it always converges to the same thing from random noise
img2img with lower denoise or what that node does are the only hacky workarounds we have
>>
>>107483800
Can't you i2i the fucked up image back with the non-noisy cond?
>>
>>107483820
theorically, a perfect model would render a woman reading a book on a transparent background since you haven't specified she's in a room lul
>>
>>107483841
you are essentially already doing that, so not really.
you can't i2i with high denoise because it will again converge to the same shit, and with lower denoise it's not enough to fix the mistakes of the early timesteps
>>
>>107483708
because thats what real life data looks like. if i trained an ai on billions of face book images, there is no way they would all come out looking like "from front" "3/4" "from side" only. there would be organic variation. like if i prompt for 4 girls standing in a kitchen xit will make a flat angle of them standing in a line with one leg bent everytime. but just imagine 4 friends standing in a kitchen in real life it would be dynamic. i dont even know how to articulate the dynamicness in this post so i wouldnt be able to prompt xit to do it even if it understood

its not even crazy angles, if you cant see the difference between this >>107483509 and this >>107483615 then we cant even communicate with each other
>>
>>107483749
no effort put into the data set. I just quickly found 36 images I liked and put them trough joycaption basic description. I noticed after the training some of the captions are messed up too.
here is the dataset if curious: https://files.catbox.moe/g2hcii.zip
>>
Is AI toolkit the only trainer you can use for ZiT so far or do others support it yet? This humiliation ritual is too much for me
>>
https://youtu.be/3oCTiIbVfls
>best way I can describe using comfyui moving forward is like sitting on a 12" dildo leaving it in and saying "fuck it, I'm gay now" instead of pulling it out with some dignity and saying "what's the next steps"
>>
>>107483906
Stop advertising your slop service. I am not paying to goon.
>>
File: so zazed.png (306 KB, 500x500)
306 KB
306 KB PNG
>>107483906
OMG BASED
>>
>>107483931
grift aside, he is still right
>>
>>107483906
unironically
>>
>>107483906
>40 subs
>100 views
>comfy suix
>loras are dead(what lmao)
>ai generated voice over
>ai generated summary full of emojis
>shilling Patreon
>look at their other videos
>"How to Make an AI Influencer"

Is this a comedy channel?
>>
cOmFyUi AnD lOrAs R SUX
pls subscribe thx ;))
>>
>>107483941
(You):
>zero subs
>zero views
>can't argue comfy not being an anal gape session
>can't explain why loras aren't annoying or a cope
>no voice
>poorfag cope
>neet
>has no other videos
>can't teach anything

are you someone we should listen to at all?
>>
File: 1763478083006051.mp4 (598 KB, 704x480)
598 KB
598 KB MP4
>>107483941
I'll agree with anyone who hates comfy
>>
>>107483906
>>107483931
it is true that loras are fucking retarded. We need lightweight local models good with reference image. Z-Image-Edit is our only hope.
>>
>>107483986
wow you're so cool and edgy
>>
>>107483986
BASED BASED BASED
>>
>>107483860
I was thinking of trying it with a data set that's just portraits with a blank background and the same girl from multiple angles. Maybe that would minimize the quality impact on the model.
>>
File: file.jpg (9 KB, 240x240)
9 KB
9 KB JPG
isn't this the same motherfucker that said wan 2.5 was going to be a local model?
>>
>>107483998
thanks for noticing
>>
>>107484010
maybe? I don't really care what Chinese people say because they are just as scammy as jews
>>
>>107484010
he didn't mean local for you, you just don't understand chinese culture
>>
comfycloud is local btw
>>
This 12" dildo in my ass says otherwise
>>
i just want ramtorch for christmas so i can end this quantcuck nightmare
>>
>>107483906
>everything is outdated lol
>subscribe to my patreon to find out my new secret technique
I mean we all know the guy is a retard, but what's more depressing is the anti-comfy schizo spends all day searching for comfy hate content to post, and this is all he can find, lmao.

what a clown
>>
File: notragebait.png (550 KB, 827x767)
550 KB
550 KB PNG
Can I gen shit in a reasonable speed with a 3060 Ti yet
>>
>>107484042
hello fellow cumfer
>>
>>107484052
there has been multiple threads on reddit about how shitty comfy got. would you like those instead? you sound like one of those abused redditors that loves comfy no matter what while getting beaten
>>
>>107483604
Nice style
>>
>>107484025
>I don't really care what Chinese people say because they are just as scammy as jews
come on bruh, did they jews give you wan, qwen, qwen edit and z-image?
>>
bigASP 2.5 is pretty good with text, huge improvement over stock XL. Not as good as the larger models but you usually get what you want after some rerolls.
>>
>>107484054
SDXL will be okay with that
>>
>>107484076
this. the entire community is sick of the grifts and poor updates. the software is stagnant with the only hype being a new model that neoforge just gets days later. I didn't bother with zit because it just seems like another boring dut model
>>
>>107484085
they were supposed to give me wan 2.5
>>
>>107484076
i don't care what you post. you're a broken record at this point. ComfyUI continues to be the leading UI for AI, securing VC funding and growing their base while you rot in a corner desperately searching for ways to shit on it while accomplishing nothing. You are useless, a cancer of /ldg/. Hopefully a semi-truck will run you over one day so we won't have to hear your incessant bickering daily.
>>
>>107484104
according to who? that random Q anon chink?
>>
I use anistuuuuuuuuuuuuuuuuu
>>
>>107484114
does it support Zit?
>>
>>107484054
get a job you destitute motherfucker
>>
>>107484112
you can never have anything nice, only the scraps.
>>
>>107484118
sdcpp had a thread that it was faster than cumfart so yeah probably but ani hasn't updated in a long time.
>>
>>107484120
no, you are literally funding my neet life with your tax dollars
>>
>>107484088
You could've made a generic blonde 1girl standing with the same quality in SD 1.5. What are you even doing?
>>
>>107484139
trying out how well bigASP2.5 can do text
>>
>>107484122
nta but what are they even doing with wan 2.5? it's not like it's better than any of those commercial api only frontier models
is it just a case of look i made this and it's mine and you can't have it?
>>
>>107484111
one can only hope, this fucking retard cant stop blabbering about how shitty comfy is, NO ONE FUCKING CARES, use the UI you want stop bitching fucking retarded faggot.
>>
>>107484187
it's pretty much dead. they should just open source it or maybe do it when they have something that can compete with veo or sora

>>107484199
comfyui is fundamentally flawed at the foundation. it needs to die so the vaccum creates a wild west for uis to thrive. otherwise everyone just fuds any competition and the problem continues to grow. comfyorg must die
>>
>>107484218
kys trani, youve been doing this for the past 2-3 years at this point, dont you get tired? no one fucking cares retard
>>
File: 1757737122045781.png (210 KB, 507x507)
210 KB
210 KB PNG
was this ever disproved?
>>
>>107484218
>comfyorg must die
as much as I hate to admit it. it's true, it needs to go or bleed the majority of users
>>
if base isn't out by friday it's officially over
>>
>>107484218
>>107484231
why are you replying to yourself?
>>
>>107484224
this is the kind of behavior that keeps anything else from progressing. comfy just breeds toxicity for anything different. I'm not even ani
>>
>>107484199
>this fucking retard
I'm another anon that hates comfyUI so there' at least 2 of us
>>
>>107484249
three, maybe four. I just enjoy trashfires
>>
>>107484135
can you call it a life
>>
>>107484227
lmao, this will never cease to be funny
>>
comfy will die when a better replacement is made, which there are NONE atm
>>
>>107484218
>comfyui is fundamentally flawed at the foundation
what are the foundational issues? i've never had a problem doing anything with it and i'm average iq
maybe you mean relying on litegraph which is a fair point but comfy is trying to remove it and people are somehow shitting on him for that because le any change is bad
>>
> comfart tells me nodes 2.0 is out, try it
> try it
> load workflow
> cannot paste image into load image box
> remove nodes 2.0, bi bi
>>
>>107484265
the node execution is shitty. should be factory like houdini. it's also a terrible techstack for scaling (python/typescript)
>>
>>107484285
it's the perfect tech stack for people genning at home on their 1computer
idk what you want to scale it for, it's not meant to be industrial software
>>
>>107484010
Believe so, I vaguely remember him saying "local video model" but then apologized for misunderstanding. ldg went in a fit, kek

>>107484029
lol
>>
>>107484307
I've ridden on a high-speed train before, so don't tell me I don't understand chinese culture
>>
I like how we don't have enough images to even make a collage, baker can just keep the same image for the next one
Yes I'm aware I'm not helping
>>
>>107484297
tell that to comfyorg because they want to please everyone and end up pleasing nobody. if this was just software for tinker trannies I don't think there would be this UI/UX dicking around but they only make money through API nodes which they are now overcharging and everyone (with a functioning brain) cancelled. in fact everything about the software wants to dig it's claws into you using the API nodes. look at their official YouTube. 80% of it is API node tuts
>>
No more ldg until base releases
>>
>>107484318
It's funny how busy these threads were one week ago but it only lasted for like 3 days or so.
>>
>>107484325
>tell that to comfyorg because they want to please everyone and end up pleasing nobody.
this line of thinking always makes trash software and video games
>>
test
>>
>>107484332
because 4chon is a entitled needy little whiny fuck begging at the feet of master China
>>
Lodestones needs to release another Chroma so that we can have another few months of arguing about whether it's good or not
>>
>>107484325
well they do owe some men in suits $17 million, gotta make it back somehow
on one hand i don't blame him for wanting to get paid but on the other hand this shit isn't so complex that it couldn't just be a community open source project
>>
if the z-image team is reading this we aint paying for shit, release the base model already
>>
>>107482733
(You) know (((why)))
>>
>>107484332
the hype got killed by Alibaba's sneaky behavior, it's been 4 days they haven't said anything about the upcoming release, that silence is really loud, deep down we know they're gonna fuck us over and we'll get a really sour taste in our mouth
>>
>>107484351
should have been a non profit but he wants to be the red hat of diffusion if that clues you in to how psychopathic cumtard actually is.
>>
>>107483725
Just remembered that civit doesn't allow celebs anymore. ZIT would probably have twice as many loras otherwise.

Is there a different site that people use for celeb loras now?
>>
>>107484332
>>107484343
>>107484362
is this your first time ever witnessing a big (heh) model release?
>>
>>107484362
I think comfyui having shitty update after shitty update extend the doomposting. everything is crumbling apart
>>
>>107484365
>Is there a different site that people use for celeb loras now?
ultimately, I hope Z-image edit will be good enough at face consistency (like nano banana pro) so that we won't need celebrities loras anymore
>>
>>107484261
This.
>>
>>107484261
>>107484381
vicious cycle of wishing for something new but fudding anything new because it doesn't come with all the snake oils
>>
File: 1755916903385405.mp4 (2.02 MB, 1280x720)
2.02 MB
2.02 MB MP4
>>107483433
>>
>>107484373
been here from the beginning.. off and on because i sometimes have a life unlike so many others
>>
>>107484135
skill issue

t. neet with 5090
>>
I think that what the agonizing wait for Z-Image Base has taught me is that I am consumed by avarice. I will be limiting myself to using SD1.5 for the next 5 years to atone for this.
>>
Fresh when ready

>>107484399
>>107484399
>>107484399
>>107484399
>>
>>107484402
>didn't fix the z and wan links
kys ran
>>
>>107484391
I didn't realize i2i was snakeoil.
>>
File: Miku.png (1 MB, 1024x1024)
1 MB
1 MB PNG
>>
>>107484392
skull mogged
>>
what i can do with 4gb?
>>
>>107484465
you can get a job
>>
>>107483190
Based
>>
File: ....jpg (52 KB, 600x604)
52 KB
52 KB JPG
>>107483541
>Local still doesn't have a DALLE-3 equivalent so make of that what you will
>>
>>107484227
I can suggest an equation that sadly is a good description of our future:
E = mc2 + indians
This equation combines Einstein's famous wrong equation with the addition of indians, who turn everything they touch into shit.
>>
>>107484475
i cant
>>
>>107486587
you can suffer then
>>
>>107486595
thank you



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.