[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor application acceptance emails are being sent out. Please remember to check your spam box!


[Advertise on 4chan]


File: robot.mp4 (949 KB, 512x512)
949 KB
949 KB MP4
Previous /sdg/ thread : >>107370979

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://huggingface.co/jayn7/Z-Image-Turbo-GGUF

>Flux.2 Dev
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/city96/FLUX.2-dev-gguf

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image
https://huggingface.co/QuantStack/Qwen-Image-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2
https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUF
https://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUF
https://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://github.com/maybleMyers/chromaforge
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Models, LoRAs & upscaling
https://civitai.com
https://tensor.art
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>
File: 2025-31.jpg (199 KB, 2074x1613)
199 KB
199 KB JPG
Morning
>>
>>
>>
gm
looks like 8 steps is enough with cfg 1.
>>
File: 00007.png (2.61 MB, 1536x1024)
2.61 MB
2.61 MB PNG
>>
File: Waifu.jpg (112 KB, 1024x1024)
112 KB
112 KB JPG
>>107381874
For which model
>>
>>107381680
gm
>>
>>107381884
see file name
>>
>>
>>107381893
Sorry
>>
How much VRAM does zimage need?
>>
>gm
>>
File: 150281748.png (1.14 MB, 896x1152)
1.14 MB
1.14 MB PNG
>>107382226
>>
>mfw Resource news

11/30/2025

>The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation
https://showlab.github.io/Adv-GRPO

>ComfyUI-SAM3DObjects - Single-Image to 3D Object Reconstruction
https://github.com/PozzettiAndrea/ComfyUI-SAM3DObjects

>ComfyUI-DyPE v2.1: Multi-Architecture Support
https://github.com/wildminder/ComfyUI-DyPE/releases/tag/2.1.0

>Qwen3-Next-80B-A3B-Instruct GGUF Models
https://huggingface.co/unsloth/Qwen3-Next-80B-A3B-Instruct-GGUF

>Vidi2: Large Multimodal Models for Video Understanding and Creation
https://bytedance.github.io/vidi-website

11/29/2025

>FlowMatch Euler Discrete Scheduler for ComfyUI
https://github.com/erosDiffusion/ComfyUI-EulerDiscreteScheduler

>AMD Rocm 7.1.1 released: Now with aotriton
https://www.amd.com/en/resources/support-articles/release-notes/RN-AMDGPU-WINDOWS-PYTORCH-7-1-1.html

>ComfyUI-Z-Image-Utilities
https://github.com/Koko-boya/Comfyui-Z-Image-Utilities

>Valve dev Ayi Sanchez counters calls to scrap Steam AI disclosures
https://www.pcgamesn.com/steam/ai-disclousres-debate-valve-dev-response

11/27/2025

>Z-Image-Turbo: Distilled State-of-the-art image generation model with 6B parameters
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Official FLUX.2 Prompting Guide
https://docs.bfl.ai/guides/prompting_guide_flux2

>AnchorOPT: Towards Optimizing Dynamic Anchors for Adaptive Prompt Learning
https://github.com/zhengli97/ATPrompt

>MobileI2V: Fast and High-Resolution Image-to-Video on Mobile Devices
https://github.com/hustvl/MobileI2V

>Monet: Reasoning in Latent Visual Space Beyond Images and Language
https://github.com/NOVAglow646/Monet

>UltraViCo: Breaking Extrapolation Limits in Video Diffusion Transformers
https://thu-ml.github.io/UltraViCo.github.io

>Deep Parameter Interpolation for Scalar Conditioning
https://github.com/wustl-cig/parameter_interpolation

>STARFlow-V: End-to-End VidGen Modeling with Normalizing Flows
https://github.com/apple/ml-starflow
>>
>mfw Research news

11/30/2025

>TEAR: Temporal-aware Automated Red-teaming for Text-to-Video Models
https://arxiv.org/abs/2511.21145

>Frequency-Aware Token Reduction for Efficient Vision Transformer
https://arxiv.org/abs/2511.21477

>MoGAN: Improving Motion Quality in Video Diffusion via Few-Step Motion Adversarial Post-Training
https://xavihart.github.io/mogan

>LLaVA-UHD v3: Progressive Visual Compression for Efficient Native-Resolution Encoding in MLLMs
https://arxiv.org/abs/2511.21150

>CaliTex: Geometry-Calibrated Attention for View-Coherent 3D Texture Generation
https://arxiv.org/abs/2511.21309

>IntAttention: A Fully Integer Attention Pipeline for Efficient Edge Inference
https://arxiv.org/abs/2511.21513

>MUSE: Manipulating Unified Framework for Synthesizing Emotions in Images via Test-Time Optimization
https://arxiv.org/abs/2511.21051

>ShapeGen: Towards High-Quality 3D Shape Synthesis
https://arxiv.org/abs/2511.20624

>Latent Diffusion Inversion Requires Understanding the Latent Space
https://arxiv.org/abs/2511.20592

>A Reason-then-Describe Instruction Interpreter for Controllable Video Generation
https://sqwu.top/ReaDe

>Revisiting KRISP: A Lightweight Reproduction and Analysis of Knowledge-Enhanced Vision-Language Models
https://arxiv.org/abs/2511.20795

>Concept-Aware Batch Sampling Improves Language-Image Pretraining
https://arxiv.org/abs/2511.20643

>Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward
https://arxiv.org/abs/2511.20561

>Advancing Image Classification with Discrete Diffusion Classification Modeling
https://arxiv.org/abs/2511.20263

>Adam Simplified: Bias Correction Debunked
https://arxiv.org/abs/2511.20516

>FVAR: Visual Autoregressive Modeling via Next Focus Prediction
https://arxiv.org/abs/2511.18838
>>
>vibe killed
>>
>>107381587
At least a bit of a cutie
>>
>>107382814
>A generic image of a stereotypical robot is "cute"
>>
File: tmp-1764521076900.jpg (71 KB, 1080x908)
71 KB
71 KB JPG
Anyone else having trouble on huggingface?
>>
>>
File: 000000_46339_.png (2.74 MB, 963x1706)
2.74 MB
2.74 MB PNG
>>
>>
>>107381972
I'm running it with 12gb vram but you could try with 8gb. Might work.
>>
File: lumi-pdxl_00014_.png (2.09 MB, 1792x1024)
2.09 MB
2.09 MB PNG
>>107382826
expand it... this thread is sus.
>>
File: lumi-zit_00014_.png (1.99 MB, 1792x1024)
1.99 MB
1.99 MB PNG
>>107381972
>>107383321
it works fine, about 60-70s/gen on an 3060ti, still lazily using the 9 steps from the template. Get the fp8 even though technically 3060ti doesn't support it, it still fits in vram and ram swaps is what gets you. if you're on a 1080 or something ymmv

https://huggingface.co/silveroxides/Z-Image-Turbo-SingleFile/tree/main
>>
>>107383339
Oh dang. I dont get it
>>
>>
File: 000000_46411_.png (1.71 MB, 768x1360)
1.71 MB
1.71 MB PNG
>>
>ffaze OP
at least not the usual slop
>>
File: lumi-zit_00015_.png (2.41 MB, 1792x1024)
2.41 MB
2.41 MB PNG
>>107383453
iykyk
>>
>>107383530
What's ffaze
>>
File: 1745705368730544.jpg (259 KB, 1248x1824)
259 KB
259 KB JPG
>>
>>107383567
What
>>
File: 3-corpart.png (1.42 MB, 896x1152)
1.42 MB
1.42 MB PNG
What kind of syntax does z image use?
>>
>>107383568
>>107383659
Shut up schizo
>>
File: lumi-zit_00020_.png (2.47 MB, 1792x1024)
2.47 MB
2.47 MB PNG
>>107384048
it's pretty flexible, i've been hitting it with SDXL style prompts and it's doing fine. 1.5 style prompting works too, but it won't go all acid trip on you like 1.5 would. using anything "breast" related is an open invitation to nsfw. i had one where it stuck bare breasts on an otherwise intact shirt. it's a weird model
https://files.catbox.moe/pyqcjn.png
>>
File: deBL_zi_00043_.png (3.24 MB, 1920x1280)
3.24 MB
3.24 MB PNG
trying to cobble together a prompt enhancement workflow
- qwen node ooms every other gen (vram isn't clearing?)
- I cant figure out how to install sageattention (cant find torch)

I'm already demoralized. gonna revert back
>>
guys guys
i'm literally shitting right now
>>
File: deBL_zi_00081_.png (2.4 MB, 1840x1232)
2.4 MB
2.4 MB PNG
~350s when it does work. yeesh
>>
>>
>>107384288
I think the prompt enhancement workflow might use some wrapper node what does not use comfyUI's memory management?
If you have the memory it's easier to install llama.cpp and use llama-server with some low weight model to bolster up your prompts, then just paste it in to ComfyUI. Or you could write your own node to read its output. What model? Gemma 3 has 12b and 27b for example.
>>
File: deBL_zi_00073_.png (2.48 MB, 1840x1232)
2.48 MB
2.48 MB PNG
>>107384521
I was looking to avoid all that extra work. I already balked at just trying to troubleshoot sageattention for 10min. if I care enough to try again in the future, I'll just use a cloud node and connect to my gpt account. then I don't have to worry about memory at all
>>
>>107384556
Now that I'm on Fedora 43, Cuda Toolkit 13 update 2 doesn't support the new runtime environment and it's only for F42. Need to wait until a new supported version comes out before I can try installing Sage either.
>>
>>107384288
There is so much going on in this image and I love it.
>>
>>107384504
disturbingly good
>>
>>
Afternoon anons
Looks like you can get FLUX Quokkas if you overcomplicate the prompt
>>
File: deBL_zi_00071_.png (2.39 MB, 1840x1232)
2.39 MB
2.39 MB PNG
>>107385177
I prob need to reign in my wildcards xD
>>
Previous pic but with background, apparently Z-image can do that too, maybe by accident.
>>
File: lumi-zit_00022_.png (2.94 MB, 1792x1024)
2.94 MB
2.94 MB PNG
>>
The animated landscape anon is still in the other thread trying to make it happen
>>
>>107385400
This looks like you are using one of those new civitai style loras? What are your sampler/scheduler/step settings?
Are you upscaling like in the old days and generating initial gen at around 1k then doing the final upscale?
I've been doing this and if I use style loras the initial gen often becomes somewhat grainy or strange, but this depends.
Upscaling is strange, I need to use very low denoise to avoid generating new details, like 0.1 or so.
For initial gens I just use flow 8 and 10 steps/euler/simple. Upscaling 5 steps/dpmm/sde uniform with 0.1 or 0.2 denoise.
I haven't really tried generating one big initial image maybe I should.
>>
File: deBL_zi_00067_.png (2.52 MB, 1840x1232)
2.52 MB
2.52 MB PNG
>>107385667
>using one of those new civitai style loras
nope, just base zimage
>sampler/scheduler/step
euler/ddim_uniform/20~30
>Are you upscaling
yeah, this series was experimenting with ultimate sd upscale. idk if its really worth the extra time
>Upscaling 5 steps/dpmm/sde uniform with 0.1 or 0.2 denoise.
I'm at .15 for these. .2 was getting aggressive at adding weird stuff
>I haven't really tried generating one big initial image maybe I should
I haven't seen the upper bound on where zimage starts going off the rails, so you can go pretty high it seems
>>
>>107385774
Makes sense. Thanks.
>>
>>107385651
is he trying to lose your virginity? very difficult task
>>
>>107385878
I'm sorry you're so upset. I hope you can have a good day regardless.
>>
File: 2025-3_0.jpg (983 KB, 3456x2688)
983 KB
983 KB JPG
>>
>>107385906
?
>>
>nta
>>
File: lumi-zit_00026_.png (2.29 MB, 1792x1024)
2.29 MB
2.29 MB PNG
>>107385774
some anon mentioned res <=2048 the other day. takin my own crack at prompt enhancement, still gotta rig it to do side-by-side and to see what the actual final prompt was. plush-for-comfy + some random enhancement prompt i found on x
>>
test
>>
Good morning.
>>
I new (obviously) and I'm trying to figure out inpainting. The problem I have is that I want to image2image on the masked area, rather than blacking it out and starting from scratch. Is there a good way to do this or search terms I should use to learn more? I'm using comfyui.
>>
>>107385203
lel
>>
>>107386512
I've just been cropping out the part I want to regen and running it back through image2image, then pasting it onto the original image and running inpaint to mesh the backgrounds together
>>
>>107386289
Thank you for letting us know.
>>
>>107381587
why is he such a hollow potatoe?
>>
>>
>>
File: deBL_zi_00054_.png (1.75 MB, 1536x1024)
1.75 MB
1.75 MB PNG
>>
>>
>>107387004
fun.
multiple characters interacting is my next challenge.
>>
File: lumi-zit_00028_.png (1.96 MB, 1792x1024)
1.96 MB
1.96 MB PNG
>>
File: lumi-zit_00029_.png (3.15 MB, 1792x1024)
3.15 MB
3.15 MB PNG
>>
File: 00075-1970917416.png (2.34 MB, 1024x1536)
2.34 MB
2.34 MB PNG
>>
>>
>>107387696
have you accepted our new lord and master z-image into your heart yet?
>>
File: deBL_zi_00048_.png (1.89 MB, 1536x1024)
1.89 MB
1.89 MB PNG
>>
File: lumi-zit_00040_.png (2.56 MB, 1792x1024)
2.56 MB
2.56 MB PNG
>>
File: deBL_zi_00044_.png (2.09 MB, 1536x1024)
2.09 MB
2.09 MB PNG
>>
z is a bit difficult to control placement and angle too precisely
it kinda gives up on some prompts too easily
that being said, it looks like my lora is overcooked,although it works and looks pretty good, it really takes over some aspects of the prompt over others
>>
>>107388462
that's at 0.2 strength
this is no lora
>>
>>107388521
and here's 0.85
at full strength she's basically in a photo studio with a plain wall behind her
>>
>>
File: deGE_zi_00001_.png (2.83 MB, 1920x1216)
2.83 MB
2.83 MB PNG
>>107388462
>z is a bit difficult to control placement and angle too precisely
>it kinda gives up on some prompts too easily
yar
>>
File: 00016-4083527920.png (2.3 MB, 1024x1536)
2.3 MB
2.3 MB PNG
>>107387710
Sorry for not getting back to you earlier, I got sucked into a podcast with "Forgotten weapons" Ian as a guest (Title is What Are The Rarest Weapons In The World?) :P
I heard of the news but haven't had a go with it. What I saw from the sample images, looked promising.
>>
>>107388602
pretty much everyone that was doing something non-sdxl jumped on board, even with its limitations, on the dream of "the base model is going to be released soon bro"
but it is really good at what it does, althought it doesnt do it all
>>
>>107388634
forgot to add, the sdxl people are mostly on board too, but some are stuck in their ways or found the things z doesnt do well or at all
and that lack of seed variety that hits it gets a bit tiresome
>>
on the other hand it handles tricks i havent been able to use since sdxl like the [from:to:when] conditioning scheduling (with the right node). it's a little different than forge/reforge's method
>>
File: deGE_zi_00002_.png (2.48 MB, 1920x1216)
2.48 MB
2.48 MB PNG
>>107388705
>with the right node
which node?
>>
>>107388705
which i waited for like a year for someone to implement and it was buggy, and i was already using flux-dedistilled and then chroma so i gave up on it lel
>>107388725
was posted in the other thread
https://github.com/asagi4/comfyui-prompt-control/
>>
i guess z-girl is half asian
>>
File: deGE_zi_00004_.png (2.51 MB, 1920x1216)
2.51 MB
2.51 MB PNG
>>107388741
I have that installed but dunno how to use it
I dont think I'd use the prompt edit syntax anyway
>>
File: 00090-3608802939.png (2.34 MB, 1024x1536)
2.34 MB
2.34 MB PNG
>>107388634
>>107388658
I hope they are having fun at least :)
>>
>>107388806
it's not hard to grasp if you take the time. it's fun to break tho
>>
>>107388888
or to get it to do unintentional things lel
>>
>>
File: 00018-2859400874.png (2.2 MB, 1024x1536)
2.2 MB
2.2 MB PNG
>>
File: deGE_zi_00007_.png (2.55 MB, 1920x1216)
2.55 MB
2.55 MB PNG
I found the AI bubble everyone is talking about
there's an angry dude inside
>>
i cant say it's what i prompted
i cant say it's not
>>
File: deCC_zi_00016_.png (2.15 MB, 1920x1216)
2.15 MB
2.15 MB PNG
>>107388973
I noticed z-image is very eager to give you two-tone hair if you have conflicting hair colors in the prompt
>>
File: 00101-2842643479.png (2.49 MB, 1024x1536)
2.49 MB
2.49 MB PNG
>>107388973
It's a curse and a gift :D
>>
>>107388993
>>107388995
yup
definitely not what i prompted for style or anipals or pose
or anything for that matter lel
>>
File: 00112-3965424691.png (2.25 MB, 1024x1536)
2.25 MB
2.25 MB PNG
>>107389020
The weird bean fellas are goofy af, but god damn they have a charm to them.
>>
>>107389033
i guess using "cute little ____" is a bad idea with z-
>>
File: 00039-1657167723.png (2.21 MB, 1024x1536)
2.21 MB
2.21 MB PNG
>>107389068
:D
>>
>>
>>
File: deGE_zi_00008_.png (2.35 MB, 1920x1216)
2.35 MB
2.35 MB PNG
>>107389144
the minions have accidentally become very cute
>>
>>107389332
TOO cute
>>
>>
File: deGE_zi_00009_.png (2.27 MB, 1920x1216)
2.27 MB
2.27 MB PNG
>>107389390
the sword officially known as rob
>>
:)



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.