[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: collage.jpg (3.3 MB, 5183x3428)
3.3 MB
3.3 MB JPG
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107609700

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
NAI 4.5 marches on
>>
File: debox_00029_.png (2.3 MB, 2016x1152)
2.3 MB
2.3 MB PNG
>mfw
>>
>>107615164
>>Maintain Thread Quality
>https://rentry.org/ranfaggot
fix'd
>>
File: file.png (1.59 MB, 1024x1536)
1.59 MB
1.59 MB PNG
>>107615179
begone shill

tried some booruprompting... with the newbie model... lol
>>
File: file.png (2.25 MB, 1024x1536)
2.25 MB
2.25 MB PNG
>>107615187
this one came out better, literally same prompt
>>
Can Z to image to image? I can't really find a workflow that works.
>>
>>107615179
>>107615183
>>107615187
>>107615191
thought in tran?
https://rentry.org/ranfaggot

>>107615197
you have to read the ranfaggot rentry first.
>>
>>107615187
NAI 4.5 is still the strongest anime model, you will never have your local NAI
>>
>>107615207
yeah ok but this is a local thread, so kindly fuck off
>>
>>107609797
>>107609797
>>107609797
finish the thread
>>
>>107615218
you made that thread, finish it by yourself debo
>>
>>107615213
>local thread
kindly look at the maintain thread quality rentries then discuss schizos. if you was a drama free thread go somewhere else
>>
>>107615164
stop telling the anime posters from this drama thread to go to /adt/. these posters fucking suck at anime
>>
How does NAI manage to stay strong all this time? Is still a lighter model than SDXL
>>
File: file.png (2.03 MB, 1024x1536)
2.03 MB
2.03 MB PNG
>>107615197
just use the union cnet v2, it supports inpainting.

I think im already fed up with this model.
undercooked sadly.
>>
>>107615185
>>107615204
>>107615218
>>107615228
>>107615236
alll these posts were made by the same """person""" btw
>>
>>107615266
100% he's also doing the NAI shilling too, just to shit on this thread more.
>>
>>107615266
I wonder who
https://rentry.org/ranfaggot
>>
>>107615197
>Can Z to image to image? I can't really find a workflow that works.
Yeah, it works great. Z + lora pushes tons of detail into image.
>>
is there a working wan2.2 workflow yet
last i asked i was told it was fucked
>>
>>107615277
>https://rentry.org/ranfaggot
this creature is /ldg/'s fault. we can never have a normal thread again because people get gaslit into being useful idiots
>>
>>107615288

The default one works fine in comfyui, no issues.
>>
File: 1765889679246665.png (199 KB, 1126x621)
199 KB
199 KB PNG
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo/discussions/107
YOU BLOODY BASTARD WHERE IS ZIMAGE BASE !!!
>>
>>107615316
>PIGFACE
thought they worshipped cows over there
>>
>>107615314
>>107615316
get new material troll. comfyui is dogshit and asking for base is a dead meme. hold off until christmas
>>
>>107615331
>comfyui is dogshit
get new material troll
>>
>>107615316
biuteful english, merge for good looks
>>
File: z_mod_00003_.jpg (385 KB, 1344x1728)
385 KB
385 KB JPG
>>
File: 1752520056540574.png (2.91 MB, 870x5010)
2.91 MB
2.91 MB PNG
>>107615277
>>107615293
>Here, look at me being a worthless samefagging nuisance, now do what I tell you because I say so
The fact that you think this that anyone who reads your schizophrenic seethewall will come to any conclusion other than you being a mentally unstable retard is nothing short of hilarious
>>
>>107615236
No one will go to your weeb shithole, stop shilling it. Me as an anime poster, I don't care about other anime posters, you have to understand that.
>>
File: z-image_nag_00292_.png (2.61 MB, 1024x1536)
2.61 MB
2.61 MB PNG
bludy bastert, where is base
>>
File: z-image_nag_00277_.png (2.24 MB, 1024x1536)
2.24 MB
2.24 MB PNG
>>
retards, even when base is released it'll still take months to get a NSFW finetune, and it'll take weeks for the good lora creators to migrate over.
>>
>>107615472
looks ultra haggy, make her younger and hotter
>>
File: z_mod_00013_.jpg (344 KB, 1920x1152)
344 KB
344 KB JPG
>>
File: z-image_nag_00300_.png (2.54 MB, 1024x1536)
2.54 MB
2.54 MB PNG
>>
>>107615494
turbo really just wants to make them as ragged and haggered as possible, you really need to tard wrangle it until it says uncle and even then her skin still gets wrinkly
>>
>>107615577
try upping the shift in the 6-7 range
also this one came out nice, id rape her with my dick if you catch my drift
>>
File: z_mod_00024_.jpg (528 KB, 1344x1728)
528 KB
528 KB JPG
>>
>>107615607
in this case i think it was just my ksampler at the time, now i use the advanced stuff. hell i was experimenting with shifts higher than 7 at the time but i really should've just changed the sampler.

which reminds me to go back and redo tons of old gens with that new sampler setup.. eeehhh.
>>
File: z-image_nag_00306_.png (2.22 MB, 1024x1536)
2.22 MB
2.22 MB PNG
>>
File: z-image_nag_00307_.png (1.86 MB, 1024x1536)
1.86 MB
1.86 MB PNG
>>
>>107615391
why include schizo walls of text in the OP?
>>
>>107615391
you need to take your meds tran
>>
File: z-image_nag_00310_.png (3.21 MB, 1024x1536)
3.21 MB
3.21 MB PNG
>>
File: z_mod_00027_.jpg (650 KB, 1344x1728)
650 KB
650 KB JPG
>>107615633
>>107615646
>>107615672
Lots of detail for small images, nice. I guess nag lets you do this or do you run i2i with small size increase?
>>
lmgfag here
Rarely do imagegen, how can I get a prompt influence/cfg param, different sampler? i loaded a couple flux examples and their graph is different and confusing :((
>>
>>107615696
just t2i.. not sure if nag is necessary but its just the workflow i use now
>>
>>107615700
just use z image
>>
>>107615734
I don't want a different model I want to adjust the sampling / CLIP influence on my pipeline
>>
File: z-image_nag_00315_.png (1.9 MB, 1024x1536)
1.9 MB
1.9 MB PNG
>>
File: 1745561904020939.png (3.05 MB, 1088x1920)
3.05 MB
3.05 MB PNG
my Z image variation hack is to just gen everything at four different resolutions/aspect ratios.
>>
File: 1739140131249253.png (2.73 MB, 1920x1088)
2.73 MB
2.73 MB PNG
>>107615809
>>
Blessed thread of frenship
>>
File: z-image_nag_00321_.png (1.85 MB, 1024x1536)
1.85 MB
1.85 MB PNG
>>
File: z_mod_00029_.jpg (448 KB, 1344x1728)
448 KB
448 KB JPG
>>
>>107615672
>>107615633
>>107615646
these look like windows xp wallpapers
>>
had too much carbs, went out with friends for pizza and drank 5 beers, don't feel like genning
>>
I'm genning an image in ca. 75 seconds with Z-Image and default settings (20 steps) and at Q8. Is this a typical speed for a middling 12GB GPU from two or three years ago, or are there settings I could tweak or some kind of Lighting/Turbo/etc. LORA I could use?
>>
File: z_mod_00065_.jpg (525 KB, 1344x1728)
525 KB
525 KB JPG
>>
>>107610020
prompt?
>>
File: z-image_nag_00329_.png (2.32 MB, 1024x1536)
2.32 MB
2.32 MB PNG
>>
>>107615933
20 steps is overkill for Z. idk where you got default from, it's not official. it converges at 9 steps and anything more than 12 or so just introduces more noise into the image
>>
>>107615933
did you rtfm?? use the default wf?? Z should be run at 8-15 steps
>>
>>107615700
Use the CFG guider instead of the basic one. Can't believe no one answered this jfc
>>
File: z-image_nag_00334_.png (2.33 MB, 1024x1536)
2.33 MB
2.33 MB PNG
>>
>>107615966
>>107615974
incorrect ~30 steps give more details and doesnt "add noise"
>>
>>107615998
point and laugh at this nigger for going over 12 steps on a lightning distilled model
>>
>>107615984
you're more charitable than some, he was asking a pretty stupid question to be honest

>>107615998
use whatever makes you happy, but 8 steps is sufficient for many gens and what he should start with if he has such bad gen times
>>
>>107615933

I was using the default z template in comfyui and my 3060 12gb did them in 28-32 seconds. Bought a 5060ti and it does them in 16-18. over a minute is way too long.
>>
Comfy-Manager seems to break previews. Removing it restores them. I guess the native integration is somehow conflicting with VHS.
>>
>>107616022
>but 8 steps is sufficient for many users
true im not many users my gens blow everyone elses out of the fucking water. most probably dont need it anyway
>>
>>107615998
delusional or blind, not sure
>>
>>107616034
>my gens blow everyone elses out of the fucking water
i always love seeing this quote come from people who don't post gens
>>
>>107616036
meant for >>107616007

>>107616041
its more fun when you dont attach an image and no one knows who you are :) be thankful
>>
>>107615998
Can you show an example?
>>
Say hypothetically someone was wanting to use WAN to make videos that make a white liquid spray onto someones face but have it come from off screen and no object visible. What kind of prompts or loras or otherwise necessary things would that person need to do to achieve such a hypothetical result?
>>
>>107616050
Nobody hell this man make deepfake porn!!!
>>
>>107615966
>>107615974
It's the default in stable-diffusion.cpp
Thanks for the tip, now it's down to 32 seconds (which is very impressive compared to Flux Schnell) Prompt adherence and quality seems slightly worse at 8 steps but it's fine for testing prompts and ideas.
>>
>>107616030
Even with the new GUI setting?
>>
File: 1757282363895966.mp4 (798 KB, 480x854)
798 KB
798 KB MP4
>>107616050
instructions unclear, made syrupy liquid spray onto a pancake girl's face
>>
>>107615179
Z-Image booru tune will destroy it.
>>
>>107616071
Or actually, it may be under the Comfy section instead of Server-Config, depending on which Comfy installation method you used.
>>
What if Base never releases what are we going to do
>>
>>107616094
kill myself or alternatively just stick with sdxl.
>>
>>107615207
what even is the tech behind nai? it started with SD, right?
>>
File: file.png (1.55 MB, 903x1170)
1.55 MB
1.55 MB PNG
>>107615998
if you're gonna be that confidently wrong at least post proof of your retardation next time
>>
File: ComfyUI_01241_.png (1.17 MB, 1152x896)
1.17 MB
1.17 MB PNG
>>107616072
I'm the OG food girl poster, I will accept requests for different kinds of food to turn into a girl.

here's one of my first chroma gens, from may 2025
>>
>>107616072
I can't bear to bite those pendulous tatas...
>>
>>107616109
oh my god did you do that pancake girl in the vid i posted? please share that damn prompt, i can't recreate it to save my life. How do you prompt engineer sexy food girls like this?

>>107616113
so don't (hopefully she doesn't mold over time)
>>
>>107616108
Yeah I'm sure you're running a non schizophrenic workflow with a non schizophrenic sampler and scheduler. It's definitely because too many steps and not anything else.
Retard.
>>
File: 886568585899.jpg (494 KB, 2506x974)
494 KB
494 KB JPG
>>107615316
This is slopped beyond salvation. Not as good as NBP nor Flux.2 base. It is good that they are still cooking at least. Why are people so blind to slop?
>>
>>107616109
lol i remember you. god bless.
>>
>>107616108
>after/before airbrushing
>>
>>107616122
basic default euler simple workflow you fucking retard. fuck off and go troll elsewhere you piece of shit.
>>
>>107616071
>>107616088
Even with that option set to auto, it still didn't work for me. I did get it working though.

What I had to do was REMOVE the ComfyUI-Manager custom node, and install it via pip manually.

https://blog.comfy.org/p/meet-the-new-comfyui-manager
.\python_embeded\python.exe -m pip install -r ComfyUI\manager_requirements.txt 

The instructions doesn't tell you that you need to remove the legacy manager custom node.
>>
>>107616072
how do you prompt this pls. or is it a lora?
>>
>>107616109
Rotisserie chicken
>>
>>107616137
Strange how you're the only one who gets that kind of output with additional steps isn't it lol
>>
>>107616142
You also need to put "--enable-manager" in the launch options.
>>
>>107616154
post your comparison then. i'm waiting.
>>
>>107616158
My question is why purposefully mislead? Anon figured this out within hours turbo releasing. You must have missed that
>>
>>107616186
mislead what? you sound like that redditor that told people to increase z-image steps to 50 and his images were all noisy shit
i don't get what your point is, it makes the images look like they've gone through 10 rounds of jpeg compression. if that's the look that looks natural to you then fine, you do you
>>
File: 1764636904944353.png (1.95 MB, 1440x1120)
1.95 MB
1.95 MB PNG
>>107616120
I didn't do that exact pancake girl, no. that one was a different anon, I think inspired by my maple syrup girls.
>>
>>107616202
Ah I see you look to reddit theres your problem. Go back to the first handful of threads around turbos release and figure it out. I will not reply further
>>
File: 90fg78.png (17 KB, 741x444)
17 KB
17 KB PNG
>>107615984
Thanks anon bless you. Where is it in the menu? I went through every submenu in Add Node several times and don't see that. I don't see BasicGuider which is already in the graph either
Maybe my comfy is a bit outdated
>>
>>107616221
delusional, like i said. post your image comparisons or shut the fuck up
>>
File: 1748839216152801.mp4 (878 KB, 720x560)
878 KB
878 KB MP4
this was one anon's i2v of my gen, thank you based anon
>>
>>107616123
>This is slopped beyond salvation
kek no its just not aesthetically tuned like turbo (which is a good thing btw)
and base allows someone to make a massive finetune on booru or real sexo which, if attempted on turbo, wouldnt work
>>
>>107616276
>aesthetically tuned
who decides the aesthetics?
cause i'd like to speak to their manager
>>
Today I saw a Chinaman and I wished to strangle him on behalf of anon.
>>
>>107616292 (me)
any time a model is "aesthetically" tuned it just sounds like they optimize for:
>(bokeh, plastic skin, shallow depth of field, central composition, blurry, jpeg artifacts, masterpiece, 4k, hd:3.0)
i want to know who decided this was A E S T H E T I C
>>
>>107616145
so far everything I get from this is absolute nightmare fuel. what were you thinking?
>>
Is there a simple way I can generate my in z images at 1024x1024 and then upscale it slightly all in the same workflow? Is their a simple upscaled I can just dump into the default?
>>
>>107616292
>>107616326
i mean the whole thing about ZiT is how non generated the outputs can look. it was so good it caused an influx of NBP shills on twitter to post "Look at how real this image looks!" my timeline was flooded with those posts around its release and it wasnt a coincidence
but yes most have little to no conception of what looks good
>>
File: trellis2.png (2.97 MB, 2560x1440)
2.97 MB
2.97 MB PNG
Trellis 2.
It seems that the model has not had any HF training as it will often render empties and other junk you mind find in a viewport.
>>
>>107616345
>>107616145
holy kek. I am going to hell for genning this.

>>107616364
just gen the whole image at your desired res. unless you're a severe vramlet. then use seedvr or something.
>>
>>107616398
lmao
>>
>>107616398

Genning it at just 1294x1294 almost doubles the gen time.
>>
File: 1761091571571138.png (3.42 MB, 1920x1088)
3.42 MB
3.42 MB PNG
>>107616409
what are your PC specs??
>>
>>107616445

5060ti 16gb
>>
>>107616455
lrn2usecomfy m8 seriously i'm on the same card genning at 1920x1080 one shot.2.8s/it.
>>
File: 1762970772551243.jpg (41 KB, 676x676)
41 KB
41 KB JPG
>>107616474

Fren please....how I kneel...tell me
>>
>>107616398
>Schlomo Shekelstein
I like to think you didn't explicitly prompt for it. >>107616345
hunger
>>
is it possible to generate cellulite on z, chroma, and qwen?
>>
>>107616483
Here take my workflow embedded in my shitty attempt to make a maple syrup girl https://files.catbox.moe/r1tqad.png

>>107616398
>>107616345
>>107616207
>>107616109
pls give me your prompt formula i get mixed results in chromasome, flux, and currently z-image.
>>
>>107616490
Cellulite doesn't exist in Chinese culture.
>>
File: Screenshot.jpg (45 KB, 810x382)
45 KB
45 KB JPG
>>107616364
there are a few native nodes (upscale latent, upscale image etc), search for upscale



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.