[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107392912

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe
https://github.com/ostris/ai-toolkit

>Z
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image/

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
Blessed thread of maybe one day base will release and local will actually be saved
>>
WHY HASN'T Z-IMAGE RELEASED BASE YET?
>>
so zit doesn't know what a vagina is.
>>
nb4 "turbo is enough hurrdurr"
>>
https://xcancel.com/bdsqlsz/status/1995455379128410131#m
the fact he has the base model on his hands but isn't showcasting any image is worrying desu, maybe they just want to keep the suspens until the end, or maybe the model just outputs complete shit :(
>>
>adding noise to an image makes it all noisy, but what if we removed noise from noise to make an image?!
gotta say I wouldn't back him, good on him for persevering.
>>
https://xcancel.com/runwayml/status/1995493445243461846#m
>"We made something quite special"
>The videos are barely better than Wan 2.2 tier
>They say this when Sora 2 exists
lol, lmao even
>>
>>107395594
sus
>>
>>107395575
it knows somewhat. hopefully a finetune or lora can fix it.
>>107395582
he probably has an nda
>>
>>107395594
these are children
>>
>>107395626
>>107395615
they're not
>>
>>107395626
I got a copyright strike for doing umamusume stuff. Japs are fucking weird with copyright, probably because yakuza runs it all.
>>
>>107395633
There's a difference?
>>
File: WanVideo2_2_I2V_00548.webm (2.02 MB, 1248x704)
2.02 MB
2.02 MB WEBM
>>107395582
nda
>>
>>107395642
This. I wouldn't tell people to never use workflows made by other people, that's retarded, but you should know the basics of how nodes and connections work, and what most of the settings on the workhorse nodes do. Otherwise, when the time comes that you need a custom function, or to do anything beyond what someone else's premade workflow does already, you're kind of fucked.
>>
>>107395582
will we get both base and whatever he trains?
>>
File: 1748628355378791.png (1.63 MB, 1122x768)
1.63 MB
1.63 MB PNG
>>107395582
>>
>>107395624
>it knows somewhat.
you're right. it does the same thing as flux does where underwear will appear suddenly, so underwear is clearly being occasionally added to iterations.
>>
>>107395654
dude have you seen the MONSTRUOS workflows posted in civitai with a bajillion custom nodes? are you fucking retarded?
>>
>>107395655
be aware that Grok is visually inspecting everything you post to determine if it should be boosted or not.
>>
>>107395655
Didn't mean to (You).
>>
>>107395660
what exactly is the cutoff point of a lewd? the smallest but of areola
>>
>>107395633
upload dataset to catbox
if you dont, then i was right.
>>
>>107395667
this ain't 2009, dumbo
>>
please stop posting asian women this is becoming unbearable
>>
>>107395674
wait you cant get banned on Twitter for straight up posting porn ?
>>
>>107395679
Twatter for realistic, any booru which accepts AI slop if you do anime instead
>>
>>107395519
ANIME DIFFUSION NEWS ANCHOR!

>Noob Models!
SeeleNoobAI (2048 native resolution): https://civitai.com/models/1445275/seele-noobai-sdxl
Chenkin Noob XL:(NoobAI ESP with new dataset of character)
https://civitai.com/models/2167995/chenkin-noob-xl
WAI Shuffle Noob
https://civitai.com/models/989367/wai-shuffle-noob

>Anime LoRa Making Guide!
https://civitai.com/models/22530/guide-make-your-own-loras-easy-and-free

>Model News!
ZiT Zeta Image Turbo Model: a new 6b model, It's fast, open-source but the main problem is it doesn't understand booru tags.
UIs that supports it: Comfy, Krita AI Diffusion, Neo Forge, Swarm, SD Next

>Anime ZiT LoRas!:
Frieren LoRA
https://civitai.com/models/2176854/frieren-beyond-journeys-end-sousou-no-frieren-z-image-lora
Flat Anime Style:
https://civitai.com/models/2175307/z-image-flatanimestyle
Ra Lilium Style:
https://civitai.com/models/2125529/ra-lilium-style
Nyalia Style:
https://civitai.com/models/2180136/nyalia-style
Anime Flat Style:
https://civitai.com/models/1952560/anime-flat-style

ALSO ANIME CHARACTER LORA REQUESTS GO HERE!
>>
>>107395694
I mean, people should put in a little effort instead of being completely spoonfed, no? Or do you want complete retards coming here asking for tutorials for everything?
>>
>>107395582
>the fact he has the base model on his hands
How is this a 'fact' ?
>>
>>107395694
thanks for the news friend
>>
>>107395698
'clone' is not recognized as an internal or external command,
operable program or batch file.
>>
File: ComfyUI_00729_ copy.jpg (643 KB, 4096x2048)
643 KB
643 KB JPG
Testing out that essay of quality tags fro Zit.
Both using LLM prompt.

Left without quality, right with.
>>
Look at how he crawls back. They always do.
>>
File: blyatman.png (43 KB, 725x425)
43 KB
43 KB PNG
how many ZIBs can one train with such money?
>>
>>107395701
they release models 2 generations older than the flagship, and their first image model was just flux.
>>
>>107395698
you don't train a lora out of thin air retard, he needs the model to do this shit
>>
1
>>
>>107395705
Cant blame you. You'd barely get a like for your trouble on civitai
>>
>>107395710
Maybe increase the CFG of the high?
More noise at high may improve prompt following afaik. Or maybe prompt better, idk.
>>
Where is...
Where is base...
>>
>>107395713
it beat hunyuan to the punch but was worse and also required datacenter gpus to even run. was one of the more genuinely cinematic vidgen models though
>>
looks like repost schizo has turned their spam bot on again.
>>
File: HoesMad.png (1.52 MB, 1664x2560)
1.52 MB
1.52 MB PNG
>>107395694
Based
>>
>>107395719
that's not legal saar please read grok terms and conditions, no bob and vagene
>>
>>107395694
>adt schizo
>>107395702
>bot schizo
damn this thread is starting really well!
>>
>>107395720

Had the same experience. Generation took a little bit less (with cfg 1/1) time but quality drastically decreased. With my previous cfg 2/1.1 was the same time and quality decreased too.
>>
>>107395727
You can just say you're retarded.
>>
>>107395733
That's fucked. Wan 2.1 had the same issue, but it was easy to post process afterwards, 2.2 over exposes it to fuck and I can't really fix it.
>>
File: 1562958920852.jpg (303 KB, 598x714)
303 KB
303 KB JPG
Does Qwen 3 4B have any alternative versions like Flan and Gner for T5?
>>
>>107395694
>ANIME DIFFUSION NEWS
>in the realistic slop thread
>>>/g/adt
>>
>>107395734
*sniff*
>>
>>107395519
Since you're a human. What's up. Post a gen you've made recently. Opinion on Z/ZiT?
>>
>>107395734
You don't. Some starting images are worse than others. It's just how it is
>>
>>107395694
You've managed to, remarkably, only provide bad advice.
>Noob models
r3mix or bust
literally.

>lora making guide
local or death

>model news
what

>zit loras
no point until full base model.

thank you for your attention to this matter
>>
>>107395752
okay
you're retarded
did that do anything?
>>
>>107395746
but they didnt release weights, training methods or papers related to it. You cannot train open models on copyrighted music, this is the issue, they're instead probably doing it behind closed doors.
>>
>>107395642
>nda
he posted images of z-turbo before it got released, I don't see why they don't want him to do the same thing for z-image base desu
>>
>>107395758
because music is a copyright nightmare worse than image/video, and the holders are VERY aggressive usually.
For now we gotta cope with mmaudio and songbloom, both of which are NOT very good.
>>
>>107395719
He says he 'plans' to train a lora, and then hints at getting access early, I very much doubt he has access to a final release model, more likely he is among testers who get to try the model checkpoints from which they are choosing to release.

Of course assuming he isn't just full of shit.
>>
>>107395760
people still recommending this when you could already do that with kijais nodes really shows how retarded some of you niggers are
>>
adt frens missed out on the release of flux so theyre only just now realizing which thread is the real diffusion thread :]
>>
>>107395761
You'll need a LoRA obviously, base Wan can't do that shit
>>
>>107395762
>doesnt this try to load both models at once
Do you even have to ask?
No. That would be retarded as fuck.
>>
>>107395782
fair enough
>>
>>107395775
>because music is a copyright nightmare worse than image/video, and the holders are VERY aggressive usually
Also it's a lot easier to run afoul of music 'copyright' than image copyright due to the relative limitations of the mediums. You can draw heavy inspiration from an existing artwork and you will seldom cross the line into plagiarism, not so for music.
>>
>>107395694
Oops! You got the tabs mixed up. This is /ldg/ thread you were spamming, not /adt/.
>>
>civitai jeet jannies silently deleted all my milf images on their site that had schoolgirl cosplay in the prompt
lmao
>>
>>107395612
wait is that the actual schizo julien replying?
>>
>>107395834
glad my reporting worked
>>
*
>>
>>107395834
deserved you fuckin pedo
>>
>>107395668
https://files.catbox.moe/v5afti.png

I managed this. It's deformed, but shows that it has some knowledge.
>>
>>107395758
no
nsfw genning is fucked
>>
>>107395679
Sorry anon we're all using Z-Image now, I just prompt "woman" it gives me asian women, nothing I can do my hands are tied etc
>>
I hope someone is making a Lilith from Diablo LORA
>>
>>107395896
add ethnicity like german or swedish (but not british)
>>
Concept: a spam bot that replies with some variation of "That's a child." to every single gen that depicts a woman
>>
File: zimg_0119.png (2.04 MB, 1024x1496)
2.04 MB
2.04 MB PNG
china didn't release the best model of the year for us to gen disgusting white women
>>
>>107395923
>implying bugs are any better
>>
>tfw no chroma finetune that runs as fast as z

>>107395612
is it open source that can run on my puter and not api? if not, who cares
>>
>>107395921
>but not british
this, it is Migu approved that british aren't people to appreciate
https://www.youtube.com/watch?v=RBEb2wC8u5E
>>
>>107395922
kek, they do that in some of the vdg /gif/ threads to gens they dont like
>>
File: 1759027314396902.jpg (787 KB, 1792x1600)
787 KB
787 KB JPG
Well, there is at least something that qwen can do better. Integrate unreal objects into real scene without looking like it's a stamp.
>>
>>107395694
i think anime is fucking gay
>>
>>107395612
meh
not impressed
>>
>>107395923
trvth
>>
File: 1742538056820419.png (79 KB, 579x507)
79 KB
79 KB PNG
Reminder

>I added a v2 of the z-image-turbo training adapter. It is 2x as big and has been trained for a significantly longer time. With these adapters, it is a balance of getting the max amount of de-distillation without diverging from the base too much so the LoRAs maintain maximum compatibility. So I want to test this a bit more before setting it as the default. But if you want to test, just chanve the v1 to v2 in the config, and please report back if you find it works better / worse than the v1 adapter.
https://x.com/ostrisai/status/1995504226295009558
>>
File: ComfyUI_00029_.png (2.26 MB, 1504x1024)
2.26 MB
2.26 MB PNG
>>107395939
zit, 3 steps of lcm.
>>
just wait for base
>>
>>107395958
>posts on gay anime website
thanks for the input faggot
>>
>>107395921
This post is Indian (99% certainty)
>>
>>107395923
because you said so?
>>
File: ComfyUI_00752_.jpg (976 KB, 2048x2048)
976 KB
976 KB JPG
Burger.
>>
File: zimg_0125.png (2.28 MB, 1024x1496)
2.28 MB
2.28 MB PNG
>>107395938
white women are over anon
>>
https://civitai.com/models/717617

damn bros we're eating good
>>
>>107396027
so glad ive banned this fool
>>
>>107395923
truth nuke
>>
File: 1750758066055147.png (1.08 MB, 2861x4547)
1.08 MB
1.08 MB PNG
>>107396027
When is civitai adding profile location in account info? lol
>>
>>107395923
>>107396020
this unironically
>>
>>107396064
this is probably the best thing Elon has implemented to the site, seeing all those jeet seething and deleting their grifting accounts was so delicious
>>
>>107396088
The Indian is immunized against all dangers: one may call him a scoundrel, parasite, swindler, it all runs off him like water off a raincoat. But call him a jeet and you will be astonished at how he recoils, how injured he is, how he suddenly shrinks back: “I’ve been found out."
>>
>>107396129
Bitch I found out when I smelled you 9 miles ago.
>>
>>107396129
yeah i saw that quote in a twitter post too
>>
>>107396064
Funny thing about this is that people like BAP had been saying this for years, so when this happened it just confirmed what we already suspected
>>
File: combined_0075.jpg (610 KB, 2040x3524)
610 KB
610 KB JPG
>>
File: ZiMG_00459_.png (2.6 MB, 1344x1728)
2.6 MB
2.6 MB PNG
>>
>>107396139
it helps explain why they dgaf about actual elections despite allegedly being left or right.
>>
I'm going insane. Tiled is supposed to reduce vram usage. It's still maxing out my 32gb no matter what. What is this bullshit?
>>
Best thing about Z-Image is genning extremely topical throwaway images in 20 seconds so you can fill the thread with on-topic worthless slop
>>
>>107396165
reduice tile size
>>
>>107396165
Resolution Master looks like it's probably eating all of your memory alone... lol
>>
>>107396165
wtf, your tile size is too big, you have to decrease the values dude
>>
>>107396177
kek
>>
>>107396020
so are bugs. eastern euro goddesses are the easiest on the eyes. too bad they are psychopaths
>>
>>107396165


shit like res master is useless. just type in values like a chad
>>
>>107396177
Asian 1girl remains undefeated.
>>
>>107396184
Even if I go down to something like 256, it's all of the vram.

>>107396191
I use LESS vram if I gen at that resolution without tile.

>>107396189
>>107396205
No.
>>
>>107396230
how do I generate a 1girl with a massively sticking out chink face. like where that jaw juts wayyy out. a side view.
>>
File: ZiMG_00467_.png (3.52 MB, 1344x1728)
3.52 MB
3.52 MB PNG
>>107396153
>>
>>107396246
>massively sticking out chink face

what that mean?
>>
Anyone noticed that ZIT gens have really poor variability between seeds? Like composition barely changes with most of the gens
Is this expected behavior or I might have fucked something up?
>>
>>107396007
borgur
>>
File: ZiMG_00471_.png (3.35 MB, 1344x1728)
3.35 MB
3.35 MB PNG
>>107396247
Fucking finally ass to ass
>>
>>107396020
im here for it
>>
>>107396281
what prompt does that?
>>
>>107396265
Expected. Z image is really bad with seed variation. To get around this, I suggest using the seed to quickly gen a disposable SD1.5 image (doesn't even need to be the right prompt) and then feeding that as a latent with high denoise, or you can use the "single step of CFG = 0" trick.
>>
>>107396177
The future is chink!
>>
>>107396261
Five seconds googling "wmaf meme" and you'll find many examples of what he's thinking of
>>
>>107396289
had to coarse that shit with so many variations. this worked

Full-height vertical, side profile shot, below torso framing, 2 women, they are standing sideways, their side profile to the camera, they are pressing their buttocks together,
>>
Painter released clean vram https://github.com/princepainter/Comfyui-PainterVRAM

Hoping this will solve my OOM problems, gonna try it out soon. Any other cool nodes out there?
>>
File: 1740416484446816.png (1.5 MB, 2209x1373)
1.5 MB
1.5 MB PNG
this local lora galery is actually really cool, you can see littie images and shit, is there a node similar but for a load image node so that I could see better what's already on my input list?
>>
File: 177.jpg (910 KB, 857x579)
910 KB
910 KB JPG
There will be no base model, doomposters are always right in the end. XL until the end of time.
>>
>>107396318
does this scrape civitai for thumbnail info or do you set it yourself? does it require a second server to work?
>>
does SageAttention stack with torch.compile? Or are they conflicting?
>>
>>107396294
Damn. Alright, thanks.
> I suggest using the seed to quickly gen a disposable SD1.5 image
Loading models is a bitch already as a 12Gb vramlet. I guess I could use my old gens as the noise source lmao
>>
File: combined_0021.jpg (579 KB, 4675x1720)
579 KB
579 KB JPG
>>
>>107396324
>There will be no base model
sinophobic propaganda

chinese century
>>
>>107396324
liar, ace step 1.5 will be released today. TODAY.
>>
File: ComfyUI_00042_.png (1.32 MB, 1504x1024)
1.32 MB
1.32 MB PNG
Is this sfw?
>>
>>107396356
>Loading models is a bitch already as a 12Gb vramlet.
I know how you feel
t. 4 Gb
>>
>>107396353
no you use one or the other
>>
>>107396396
yes
>>
>>107396403
Wait can you even use zit with this?
>>
>>107396396
nazi salutes are generally considered nsfw
>>
>>107396413
I want your job
>>
>>107396165
all temporals to one
>>
>>107396416
GGUF, Q6. A gen takes about 70-80 seconds for me.
>>
>>107396450
>70-80 seconds
Including the model moving part? Either way, impressive that you can run this on 4Gb VRAM.
>>
File: ZiMG_0049.jpg (2.9 MB, 1344x1728)
2.9 MB
2.9 MB JPG
>>107396281
Back to this now
>>
>>107396349
it scrapes civitai for the thumbnail yeah, you click on the icon and it does it for you
>>
I have started a career as professional coomer
>>
>>107396360
wtf lmao
>>
ahahahahah

zit doesn't know what a swastika is.
>>
>>107396498
i used something similar, maybe the same node, that required a local server that you managed in another tab. is this the same or is this standalone?
>>
>>107396318
>>107396318
>>107396349
can we get the prompt/trigger words?
>>
>>107396511
I've seen several gens wit swastikas before
>>
>>107396427
>the job: no job
>>
File: 1741275859763428.png (1.11 MB, 3291x1457)
1.11 MB
1.11 MB PNG
>>107396519
yep, and you can even add more shit over it if you also want to add the style description on top of the trigger word, this shit is really amazing
https://github.com/Firetheft/ComfyUI_Local_Lora_Gallery
>>
File: ZiMG_00499.jpg (2.74 MB, 1344x1728)
2.74 MB
2.74 MB JPG
>>107396496
>>
File: combined_0009.jpg (562 KB, 3916x2040)
562 KB
562 KB JPG
>>
Is there any model that can make thin people with a double chin? When I prompt it either nothing happens or I get fatties.
>>
>>107396578
inpaint
>>
>>107396578
what an odd request. train a lora for it
>>
Julien in ukrainian hohol?
>>
>>107396578
That sounds like an inpaint problem, or a lora you would have to train yourself. You're trying to get something that only shows up in Context A and never in Context B in a gen that is otherwise all Context B and not Context A. Not impossible, but unlikely.
>>
File: pieds.png (895 KB, 1492x1036)
895 KB
895 KB PNG
I'd like some tips for a Wan workflow using 64gb of ram and 24gb of vram.
As well as for lora(s) training.
>>
File: ComfyUI_00052_.png (3.02 MB, 1504x1024)
3.02 MB
3.02 MB PNG
>>
Does Onetrainer support ZiT yet?
>>
>>107396435
Oh shit, that was it. Doesn't go to 1, but now I'm only at half vram. Thanks.
>>
File: 1752813867510799.png (47 KB, 2116x106)
47 KB
47 KB PNG
How to speed up zit on a 12gb vram/64gb ram system? I downloaded fp16 versions of everything and a single 896x1152 gen takes 25s at best with forge neo, kinda meh tbhdesu
>>
>>107396726
use ai toolkit. it takes minutes to set up and start training
>>
>>107396748
>forge neo
that's your problem, go for comfyui + sageattention
>>
Do old datasets captioned with joycaption work for ZiT training or do I need to recaption them with somethign else?
>>
File: the west has fallen.png (41 KB, 220x220)
41 KB
41 KB PNG
https://youtu.be/7prw229V_vM?t=654
>a 600k youtuber is making fun of flux 2
BFL sissies, it's over...
>>
>>107396748
we don't know your GPU, maybe 25 seconds is pretty good already?
>>
>>107396748
25s doesn't sound too shabby for 12GB
>>
>>107396825
he's obnoxious. he sounds like that paul language guy.

anyway, unquantized, is dev any good? I couldn't possibly run a 64gb model.
>>
File: 1759609062024215.png (2.99 MB, 1216x1728)
2.99 MB
2.99 MB PNG
>>
File: 1guy1jarmovie.jpg (390 KB, 2000x2923)
390 KB
390 KB JPG
>>
>>107396905
I'm old enough o remember this.
>>
>>107396837
4070 Ti, even neta lumina was faster than this
>>
File: Zurbo_00086_.jpg (543 KB, 1920x3072)
543 KB
543 KB JPG
This is my everyday outfit, by the way.
>>
>>107396837
>people whine about 25s per image
I have RTX 3060 and anything that took less than 60 seconds automatically registers as garbage for me. I could run Z Turbo with just 8 steps, but that's below my standards.
Also, is there a universal way to speed up existing workflows? I heard about triton, but does it just work independently of node types?
>>
>>107396985
>I have RTX 3060 and anything that took less than 60 seconds automatically registers as garbage for me.
then go for 50 steps + 2024x2024 resolution
>>
>>107396748
>>107396930
i have about 30s for 896x1440 on intel arc b580 and it is not loaded for 100%, your speed definitely should be higher
maybe something with offloading
>>
>>107396985
>universal way to speed up existing workflows
8 images in a batch to get 1 for free
>>
>>107396985
>my standards
>rtx 3060
pipe down please
>>
File: ComfyUI_11814_.png (1020 KB, 1024x1024)
1020 KB
1020 KB PNG
New lora, enjoy:

https://files.catbox.moe/y5290f.safetensors
>>
>>107396985
>I could run Z Turbo with just 8 steps
that's how it was made to run
>>
File: 1757031469410566.png (1.25 MB, 1280x720)
1.25 MB
1.25 MB PNG
>>107396903
>>
>>107397003
kek, Ikr
>>107397004
put this on civitai my nigga
>>
>>107397004
looks undercooked, ill test it out
>>
>>107397004
for z-image?
>>
>>107397003
t. bought 5090 and exclusively generates 5-step 1girl 7fingers
>>
>>107397051
>for z-image?
no it's for Stable Cascade
>>
>>107397066
i'd actually download it if it was, no one's using cascade
>>
>>107396748
Speed greatly depends on whether you changed the prompt or reusing existing. When changed, you need to load text encoder, run it, load unet, and only then run unet. When reusing, you run unet immediately.
5070ti here connected over 4 lanes of pcie3.
When reusing the prompt, it takes 11 seconds for 1600x896.
When changing prompt, it take 19 seconds.
>>
>>107397075
kek, based
>>
>>107397066
doa
>>
>>107397004
>>107397066
>no it's for Stable Cascade
>>107397085

its for zimage

{
"version": "1.0",
"ss_base_model_version": "zimage",
"format": "pt",
"sshs_legacy_hash": "c96ee6d1",
"sshs_model_hash": "7ae202a805b080d84c7fecd887dcee72fc9dd46600ae3086787754fb5d7ba1a0",
"training_info": "{\"step\": 3000, \"epoch\": 25}",
"ss_output_name": "realism_lora2",
"name": "realism_lora2",
"software": "{\"name\": \"ai-toolkit\", \"repo\": \"https://github.com/ostris/ai-toolkit\", \"version\": \"0.7.7\"}"
}
>>
File: ComfyUI_11819_.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>>107397051
Yeah
>>107397024
How so?
>>
>>107396748
>25 seconds is too slow
forever doomed to enjoy slop
>>
chinks know their shit. zurbo is based
>>
>>107397120
here's your (You), you've earned it
>>
>>107396985
>gen an image in 62 seconds
>hey this is pretty cool
>implement a workflow that changes loading slightly, such that the gen outputs the same image at 59 seconds
>AI was a mistake, it's nothing but trash
>>
>>107397125
>>107397120
kek
>>
>>107397120
prompt?
>>
still no base?
>>
File: Migu please.png (1.33 MB, 1280x720)
1.33 MB
1.33 MB PNG
">A group of female cosplayers at a Cosplay convention"
>"Gets Miku anyway
bro...
>>
how to fix the burned skin for some celebs in z? It looks like they've slept in the sun too much, lol.
>>
>>107395713
Z Image Turbo cost like 640k USD.
>>
File: ComfyUI_11827_.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>
>>107397163
example prompt?
>>
>>107396139
bap is a jew
>>
>>107397163
>how to fix the burned skin for some celebs in z?
you can't, they got overtrained, you can try to go for (celebrity:0.1) to tone that down a bit
>>
Has anyone tried this?
https://orange-3dv-team.github.io/MoCha/
Is the quality as good as it looks?
>>
File: mfw.png (1.47 MB, 1822x899)
1.47 MB
1.47 MB PNG
>>107397203
>>
File: neko neko ne ne ne.png (1.3 MB, 1280x720)
1.3 MB
1.3 MB PNG
>>
>>107397117
imo these types of loras are dead, you can just prompt for it in z
>>
blew out my comfy install today because i'm fucking sick and tired of it not being able to load png workflows.. 680 gigs of models gone
>>
>>107397204
bruh.. blue board
>>
comfy should be dragged out on the street and shot
>>
>>107397256
shutup sperg
>>
>>107397252
it's sfw if you're american
>>
>>107397263
just filter out those schizo sentences, like broken clocks they always say the same thing lol
>>
>>107397245
What do you mean by png workflows? Like metadata?
>>
>>107397276
yeah.. it just would not load any metadata from pngs anymore.. only worked with json workflows so i couldn't load my settings from good pics i had

fucking hate comfy
>>
>>107397256
*sucked
>>
File: z-image_00017_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
british woman with thick hair and cancer, harbor boats and pastel houses behind. Breezy seaside light, warm, cinematic close-up.
>>
File: ComfyUI_01246_.png (1.46 MB, 1536x1152)
1.46 MB
1.46 MB PNG
>>
>>107397315
>and cancer
what the fuck did you expect to see?
>>
>>107397338
What did you expect to see?
>>
>>107397315
she's hairy like a yeti
>>
>>107397235
This. You really need to offer something more stylistically different and consistent that you can't get straight out of the model.

There's certainly potential in say 70s, 80s, 90s movies / genre / style specific loras, but just a 'amateur photo of X' is rather pointless.
>>
File: z-image_00022_.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
british man with thick hair and cancer, harbor boats and pastel houses behind. Breezy seaside light, warm, cinematic close-up.
>>
>>107397369
if you want a cancer patient you need to prompt away the eyebrows
>>
cool failgen
>>
>>107397190
looks like :1 gives better result than :0.1 lol
>>
File: z-image_00030_.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
british man with thick hair and cancer, harbor boats and pastel houses behind. Breezy seaside light, warm, cinematic close-up. Even though he's dying on the inside, he smiles. He has no eyebrows.
>>
does anyone have a link to the new ai toolkit zimage patch?
>>
>>107397395
Try adding typhus and maybe gangrene
>>
>>107397397
overvrite v1 to v2 in config file
>>
>>107397235
>>107397361
You vastly overestimate the average prompters ability to prompt
>>
>>107397394
weird, since :1 is the same thing as doing nothing
>>
>>107395830
>>107395760
So botting again, how pathetic, bravo.

Is your general so fragile and pointless that if anyone puts an anime anchor everywhere because anime website your general loses its reason to exist?

And what if I ctrl C and ctrl V the Anime anchor in /sdg/ will you spam it to death also?

More points in favor of how /adt/ is a problematic thread and deserves to be deleted from /g/ for offtopic and spam.

It was never D*bo or /s*g/, it was you, the resentful, terrorist and fake thread and thanksfull more people are starting to notice.
>>
>>107397395
just put chemotherapy
>>
>>107397425
>Who would want to spam /ldg/ and hurt it? It must be... one of their own regular posters, doing it to attack people from other threads somehow
Hmm, I doubt it
>>
File: z-image_00062_.png (1.67 MB, 1024x1024)
1.67 MB
1.67 MB PNG
>>
>>107397456
sure thing ranfaggot
>>
File: z-image_00064_.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
british man with thick hair and cancer, harbor boats and pastel houses behind. Breezy seaside light, warm, cinematic close-up. Even though he's dying on the inside, he smiles. He has no eyebrows and typhus. He also has gangrene. Chemotherapy
>>
>>107397481
Doesn't seem to work, you need to make a chemo patient lora
>>
>>107397481
gives nice nipple chin effect
>>
File: ComfyUI_09501_.png (1.75 MB, 1152x1152)
1.75 MB
1.75 MB PNG
>>107395612
Kek anybody who is still paying for them and is an APIcuck on top of it deserves to be scammed at this point.
>>
>expecting a two sentence prompt to work
did you even see the researchers prompts?
>>
Did they change how loras are loaded again? Getting different outputs on the same seeds from a couple days ago
>>
>>107397245
>680 gigs of models gone
you can just move the models folder, what do you mean?
>>
retard here, the ZiT text encoder goes into the clip folder right?
>>
>>107397481
now do cute asian girl with thick hair and cancer, harbor boats and pastel houses behind. Breezy seaside light, warm, cinematic close-up. Even though she's dying on the inside, she smiles. She has no eyebrows and typhus. She also has gangrene. Chemotherapy
>>
File: 1758466123305742.jpg (52 KB, 682x875)
52 KB
52 KB JPG
>>107397393
the only failgens are images that get permanently deleted.
>>
>>107397533
>he doesnt symlink everything to one folder like everyone else
>>
>>
>>107397563
Isn't that awfully messy?
>>
File: f2f-p0w.jpg (1.74 MB, 1600x1600)
1.74 MB
1.74 MB JPG
>>
>>107397533
you can put it both on clip or text_encoder folder it doesn't matter
>>
File: ComfyUI_09507_.png (2.1 MB, 1152x1152)
2.1 MB
2.1 MB PNG
>>
>pooma
>>
>>107397571
no, because for everything you care about you can just search it quickly after throwing all the files in and for the rest you have lora manager to show you what you need
>>
>>107397004
0% like a y2k camera.
>>
>>107397596
Isn't lora manager bloat?
>>
>>107397117
imagine thinking a y2k p&s was going to be that sharp
>>
>>107397565
I don't understand what this means as a commentary on /ldg/ other than the lazy and obvious stereotypes
>>
6s per 1280x1280 zimage with powerlimited 3090 and no fp16 accumulation, big
>>
>>107397393
Nice style
>>
>>107397565
>european kb
weebs are eurocucks confirmed.
>>
>>107397636
3 steps?
>>
>>107397646
8 and
>>107397000
>8 images in a batch to get 1 for free
>>
>>
this doesn't work:

there is a simple cross shape inside of a square shape.

from the top side of the square, the left half is erased but the right half remains.
from the right side of the square, the top half is erased but the bottom half remains.
from the bottom side of the square, the right half is erased but the left half remains.
from the left side of the square, the bottom half is erased but the top half remains.
>>
File: z-image_00068_.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
>>107397540
now do cute asian girl with thick hair and cancer, harbor boats and pastel houses behind. Breezy seaside light, warm, cinematic close-up. Even though she's dying on the inside, she smiles. She has no eyebrows and typhus. She also has gangrene. Chemotherapy
>>
Is there any actual reason to use comfy?
Like out of those 10k nodes that make it a nightmare to use, is there any single one that would give me a result I can't get on another ui?
>>
>>107397657
he should be on his chest tho
>>
>>107397671
>Is there any actual reason to use comfy?
masochism
>>
>namefag
>having any clout
>>
>>107397669
heartbreaking
>>
File: ComfyUI_09517_.png (1.47 MB, 1152x1152)
1.47 MB
1.47 MB PNG
>>107397587
Not Chroma, it's ZIT iPhone LoRA.
>>
>>107397691
So far the only lora that managed to remove the blur from time to time is this one
https://civitai.com/models/2178631
>>
I can't get ZIT to give me sushi served on the bottom of a woman's soles.
>>
File: Ostris.jpg (134 KB, 1468x530)
134 KB
134 KB JPG
Why the fuck is Ostris toolkit downloading the model again on load? I already went through this once already.
>>
>>107397707
https://civitai.com/models/638889
this one works about 90% of the time for me
>>
File: file.png (119 KB, 2157x383)
119 KB
119 KB PNG
>>107397738
lmao what kind of mental illness is this?
>>
>>107397751
sounds based
>>
File: 1744215642287582.png (2.64 MB, 1280x1280)
2.64 MB
2.64 MB PNG
>>
>>107397665
Total Z-Image failure! I gave it this straightforward descriptive prompt with more than enough information, and it couldn't do it correctly:
>Eight persons — Alice, Brian, Claire, Daniel, Paula, Quentin, Robert, and Sophie — are sitting in a row in which some of them are facing north while some of them are facing south. Only three persons sit to the left of Robert. Three persons sit between Robert and Quentin. Brian is second to the left of Quentin. More than two people sit between Brian and Sophie. Daniel is second to the left of Sophie.
The neighbours of Robert are facing south. The neighbours of Paula face the same direction. Claire is second to the right of Alice. Alice and Paula face opposite directions, and similarly Brian and Quentin face opposite directions as well.
>>
File: ComfyUI_00075_.png (2.03 MB, 1504x1024)
2.03 MB
2.03 MB PNG
zit can't do amputations, apparently, but it's pretty funny.
>>
>>107397425
Are you the guy who was spamming /adt/ with crosslinks to this thread? Same extreme mental illness energy.
>>
>>107397782
I'm a simple man, I want nazi flags.
>>
ok, the negative prompt is largely, but not totally, ignored.
>>
File: ComfyUI_00023_.png (1.85 MB, 1024x1536)
1.85 MB
1.85 MB PNG
>>
>>107397425
plain wrong nig
>>
Z-Image-Base To be released To be released
Z-Image-Edit To be released To be released

tee hee
>>
File: ComfyUI-ZiT-iPhone_00013_.png (1.22 MB, 1152x1152)
1.22 MB
1.22 MB PNG
>>107397707
Goal was just to increase photorealism, not perfect for this yet though as some of the textures like hair and skin look drawn and it messes with small details (which I assume would need a large scale tune to fix).
>>
>>107397844
total doomposter victory
>>
>>107397858
>not perfect for this yet though as some of the textures like hair and skin look drawn and it messes with small details
to be fair he used the v1 version of the adapter maybe he could improve that >>107395974
>>
https://github.com/ChenDarYen/ComfyUI-NAG/pull/64

negatives near, this is very good news.
>>
File: ComfyUI_00055_.png (2.65 MB, 1504x1024)
2.65 MB
2.65 MB PNG
>>107397867
mainly, we need to assume they won't and it's up to us to make it work. to wit
>>107397875

zit's amazing speed is game changer for me.
>>
File: ComfyUI-ZiT-iPhone_00019_.png (1.65 MB, 1152x1152)
1.65 MB
1.65 MB PNG
>>107397868
Thanks, I'll look into it
>>
>>107397671
Cumrag is good for automation. It's like coding-lite. If you only need to gen 1girls with standard parameters, it won't give you anything.
>>
>>107397952
you're the one who made that lora though? if yes then it's a really based lora dude, congrats
>>
anyone trained character or celeb loras yet? Does ZiT carry over the likeness well?
>>
File: comparison_lora.jpg (380 KB, 2044x1024)
380 KB
380 KB JPG
>>107397235
>>107397361
Not really. I had significantly different results of using lora vs without it
>>
>>107398024
both look realistic, but the one of the left looks more professional, it depends on your taste and I hope the base model will let us have more control over that, I too prefer the amateurish look of photos
>>
>>107398024
right wins because it's not slopgirl
>>
File: ComfyUI_00040_.png (963 KB, 768x1216)
963 KB
963 KB PNG
>>
File: ComfyUI-ZiT-iPhone_00029_.png (1.92 MB, 1152x1152)
1.92 MB
1.92 MB PNG
>>107397982
Yep, thanks.
Dropped catbox a couple of threads back in case anyones wants to try it. The LoRA is also meant to improve the muted colors as I trained on HDR images.

>>107390078
>>
>>107397728
You should copy the files to a local folder and set the path to the local folder.
>>
>>107398058
basically, the reason we hate the asian women 1girls is they all look alike, and are the asian slopgirl.
>>
>>107397875
Shame the compatibility issue is still there with latest comfy
>>
i am apparently retarded. Why can't I find all of the loras and vaes required to use civitai workflows? why isn't there just a comment block with links to all this shit I need? i just want to generate some anime boobies with my 5090.
>>
>>107397671
Can you use more than one sampler in auto? Apply SAG/PAG to a range of steps?
>>
>>107398024
>no trigger discipline
>>
>ZiT model downloaded in 40 minutes
>text encoder takes 3 hours
why is civitai like this
>>
Des switching from PCIe 3.0 NVME to 4.0 make a noticeable difference when it comes to model loading and offloading?
>>
File: 1758942863240901.png (1.53 MB, 1280x720)
1.53 MB
1.53 MB PNG
this model really thrives with long ass prompts, is there a comfyui node that adds some sort of local llm rewriting or something? I'd like to automate that and not ask chatgpt to do this shit for me all the time
>>
>>107398182
Your whole system feels snappier but it's not like a night and day difference.
>>
>>107398140
> trigger discipline
for pussies
>>
>>107398159
use huggingface wtf
>>
>>107398159
Why the fuck are you downloading models from civitai
>>
one sec
>>
File: file.png (1.83 MB, 1024x1536)
1.83 MB
1.83 MB PNG
>>
File: ComfyUI_00084_.png (1.88 MB, 1504x1024)
1.88 MB
1.88 MB PNG
Maybe scheduler matters? Or a fluke...?

>>107398071
Yeah, they gotta get it really working.
>>
>>107398016
Yes, it carries over likeness well, but you still have to prompt it correctly to get the likeness since trigger words don't work.
>>
Can I please have a tutorial on wan 2.2 for linux?
the one in the OP is for windows only.
>>
>>107398266
>>107398266
>>107398266
>>107398266
>>
>>107398182
It's barely noticeable for generations. NVMe affects only the first generation when UI loads the model into RAM. Consequent generations work with that RAM version. Unloading isn't a thing (model weights don't change so it doesn't need to be saved back. data is simply rewritten in memory).
>>
File: ComfyUI_00090_.png (1.08 MB, 1504x1024)
1.08 MB
1.08 MB PNG
intedesting



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.