[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1738002368940019.jpg (827 KB, 3584x4260)
827 KB
827 KB JPG
Previous /sdg/ thread : >>107452129

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://huggingface.co/jayn7/Z-Image-Turbo-GGUF

>Flux.2 Dev
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/city96/FLUX.2-dev-gguf

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image
https://huggingface.co/QuantStack/Qwen-Image-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2
https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUF
https://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUF
https://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://github.com/maybleMyers/chromaforge
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/vp/napt
>>>/r/realistic+parody
>>
File: OP_SEK_SDG_2.jpg (1.35 MB, 4608x3584)
1.35 MB
1.35 MB JPG
FIRST FOR MADE OP PIC AGAIN WOOOT
>>
>mfw Resource news

12/06/2025

>Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length
https://liveavatar.github.io

>ComfyUI interface: Node 2.0
https://blog.comfy.org/p/comfyui-node-2-0

>Stable Video Infinity 2.0: Infinite-Length Video Generation with Error Recycling
https://huggingface.co/vita-video-gen/svi-model

>ComfyUi-ConditioningNoiseInjection
https://github.com/BigStationW/ComfyUi-ConditioningNoiseInjection

>AutoDescribe-Images: generate text descriptions for images using Ollama vision models
https://github.com/hydropix/AutoDescribe-Images

12/05/2025

>LongCat-Image: Open-source and bilingual (Chinese-English) foundation model for image generation
https://huggingface.co/meituan-longcat/LongCat-Image

>LongCat-Image-Edit
https://huggingface.co/meituan-longcat/LongCat-Image-Edit

>HunyuanVideo-1.5 480p_i2v_step_distilled
https://huggingface.co/tencent/HunyuanVideo-1.5/tree/main/transformer/480p_i2v_step_distilled

>PromptForge: A visual prompt management system
https://github.com/intelligencedev/PromptForge

>Amazing Z-Image Workflow v2
https://github.com/martin-rizzo/AmazingZImageWorkflow

>UltraImage: Rethinking Resolution Extrapolation in Image Diffusion Transformers
https://thu-ml.github.io/ultraimage.github.io

>NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation
https://yuzeng-at-tri.github.io/ppd-page

>Rethinking the Use of Vision Transformers for AI-Generated Image Detection
https://github.com/nahyeonkaty/mold

>Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion
https://yuemingpan.github.io/SFD.github.io

>ComfyUI Realtime LoRA Trainer
https://github.com/shootthesound/comfyUI-Realtime-Lora

>Remote FLUX.2 Text Encoder (HuggingFace) – ComfyUI Custom Node
https://github.com/vimal-v-2006/ComfyUI-Remote-FLUX2-Text-Encoder-HuggingFace
>>
File: 0547091061.jpg (313 KB, 1792x2304)
313 KB
313 KB JPG
>>
>mfw Research news

12/06/2025

>ReflexFlow: Rethinking Learning Objective for Exposure Bias Alleviation in Flow Matching
https://arxiv.org/abs/2512.04904

>The Universal Weight Subspace Hypothesis
https://arxiv.org/abs/2512.05117

>SEASON: Mitigating Temporal Hallucination in Video Large Language Models via Self-Diagnostic Contrastive Decoding
https://arxiv.org/abs/2512.04643

>OmniScaleSR: Unleashing Scale-Controlled Diffusion Prior for Faithful and Realistic Arbitrary-Scale Image Super-Resolution
https://arxiv.org/abs/2512.04699

>Text-Only Training for Image Captioning with Retrieval Augmentation and Modality Gap Correction
https://arxiv.org/abs/2512.04309

>Multi-Scale Visual Prompting for Lightweight Small-Image Classification
https://arxiv.org/abs/2512.03663

>AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition
https://arxiv.org/abs/2512.03794

>In-Context Sync-LoRA for Portrait Video Editing
https://sagipolaczek.github.io/Sync-LoRA

>Contextual Image Attack: How Visual Context Exposes Multimodal Safety Vulnerabilities
https://arxiv.org/abs/2512.02973

>MAViD: A Multimodal Framework for Audio-Visual Dialogue Understanding and Generation
https://carlyx.github.io/MAViD

>UnicEdit-10M: A Dataset and Benchmark Breaking the Scale-Quality Barrier via Unified Verification for Reasoning-Enriched Edits
https://arxiv.org/abs/2512.02790

>InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision
https://arxiv.org/abs/2512.01342

>RoleMotion: A Large-Scale Dataset towards Robust Scene-Specific Role-Playing Motion Synthesis with Fine-grained Descriptions
https://arxiv.org/abs/2512.01582

>IRPO: Boosting Image Restoration via Post-training GRPO
https://arxiv.org/abs/2512.00814

>CC-FMO: Camera-Conditioned Zero-Shot Single Image to 3D Scene Generation with Foundation Model Orchestration
https://arxiv.org/abs/2512.00493

>Concept-Guided Backdoor Attack on Vision Language Models
https://arxiv.org/abs/2512.00713
>>
File: OP_SEK_SDG_1.jpg (951 KB, 4608x3584)
951 KB
951 KB JPG
>>
>>107465495
Thanks. It's coming along slowly. Poverty life can be quite restrictive but eh we all do our best with what we can work with. Not complaining just saying hey this is how it is. But I am getting closer to getting a nice upgraded rig with win10 on it (wont use win11 and would use winXP over win11 even with the limit's that'd bring me). So once I order a newer rig I can than work on upgrades after. AT the moment my rig is so weak i'm surprised it does have the things it does. I mean to be honest I can barely run fallout 4 to give you an idea how weak it is. runs Fallout 3 like a boss (my prefered fallout game anyways) but there's the power scale of this old rig. Better fit running Everquest II and Fallout 3 than anything fallout 4 era and later unless it's indy tier games.

As for the wiki I think at some point I will make one as I have tons of notes on some of the creations and piles of art for them to show off. If anything it'd be an interesting time sink for people to look over and go huh so this is some of the things one does with ai.
>>
File: deCG_zi_00036_.png (3.45 MB, 1728x1152)
3.45 MB
3.45 MB PNG
>>107465524
hello
the cube burger reminded me of starfield. they had an in-game food brand called "chunks" that sold everything in space-ready squares. fruits, meals, burgers, fine wines, all cubed
>>
File: OP_SEK_SDG_4.jpg (1014 KB, 4608x3584)
1014 KB
1014 KB JPG
>>107465767
Never played it, but it sounds like the standard stereotypical hyper minimalist trope common to cyberpunk genre stuff. This is a cool gen. Is it Z image? Are the random colored lines on the left side artifacts or intended?
>>
File: deCG_zi_00037_.png (3.32 MB, 1728x1152)
3.32 MB
3.32 MB PNG
>>107465815
>This is a cool gen
thanks, but sadly this prompt was very samesy across gens
>Is it Z image?
yeah
>Are the random colored lines on the left side artifacts or intended?
kind of intended. I was trying to carry on some of perchance anon's tokens cuz they seemed like neat ideas
>CRT artifacting, rendered output as seen within the frame buffer with color bleeding and scanline behavior, interpret as uncompressed VRAM data
>>
File: dePB_zi_00015_.png (2.2 MB, 2016x1152)
2.2 MB
2.2 MB PNG
>>107465815
>super-nutty
just noticed. brand accurate
>>
File: OP_SEK_SDG_5.jpg (1.21 MB, 4608x3584)
1.21 MB
1.21 MB JPG
>>107465844
Haha yeah that's what I was going for
>>
File: 0758515845.jpg (514 KB, 1792x2304)
514 KB
514 KB JPG
>>
5060 Ti at $550 or 3090 at $670 for the best value slop generator?
>>
>>107466369

With AI generated images, text or sound you should always prefer more VRAM if you can afford it. The downsides with the 3090 is it eats more power and dumps more heat into your rig plus warranty might be an issue if that's a second hand GPU. However, you will have more VRAM and it will be faster.
>>
hello virgins, how goes the weekend prompt game?
>>
>>107465887
>It's extraordinarily INDIAN.
I'm impressed you think that doing what I do with forced filters and forced tags and my last year of creating an entire ai universe of characters and locations and history of it all and pages of notes and stories used in ai chatbots to help shape all that is Indian but sure you go with that. Most people dont even understand most of what that prompt is doing but yes we'll run with it's very Indian.
>>
File: 000000_46796_.png (2.16 MB, 960x1360)
2.16 MB
2.16 MB PNG
>>107465600
>>ComfyUI interface: Node 2.0
>https://blog.comfy.org/p/comfyui-node-2-0
OMG no, horrible, do not use Node 2.0 what the...

G'mornin Anons, ZimageTurbo does Beaver tail. We have a winner.
>>
File: G7iy7vaasAAxYIw.jpg (636 KB, 2048x959)
636 KB
636 KB JPG
NEW ANIME MODEL FROM NOOBAI AND ALIBABA TEAM RELEASED

3.5B params (8GB VRAM friendly — RTX 4060? )

Dual text encoders: Gemma-3-4B-it + Jina CLIP v2 deep prompt understanding

XML-structured prompts for per-character control (no more outfit swapping!)

FLUX.1-dev 16-ch VAE buttery skin, fabric, metal

20-step inference, LoRA-friendly, Apache-2.0 + non-commerical license

Trained on 10M+ anime images w/ XML annotations rock-solid multi-character scenes

40% faster than 8B+ models, yet handles 500-char complex prompts with ease.

Model: https://modelscope.cn/models/NewBieAi-lab/NewBie-image-Exp0.1
>>
>>107466369
Seconding this question, trying to decide on a 5070 Ti at $800 versus a 3090 at about $150 less. I fear that the lack of FP8 might be a negative that could offset the lack of VRAM, but I don't remember if those modes produce worse quality.

My primary usecase 1024x and above images using SDXL/Pony/Illustrious with 35-65 steps and 1500+ token prompts. I would really like something faster than 1.25it/s and 8GB of VRAM and with more speed/capacity to use adetailers and hires fixes better.
>>
>>107466398
>downsides with the 3090 is it eats more power and dumps more heat into your rig plus warranty might be an issue if that's a second hand GPU
Not an issue, I got two 750W PSUs and a HAF case. As the other post says the generational feature gap is a bigger worry.
>>
>>107466902
>I fear that the lack of FP8 might be a negative that could offset the lack of VRAM, but I don't remember if those modes produce worse quality.

To see a close equivalent of the effects of quantisation/lower FP values, run the full version of Z-Image from https://huggingface.co/Tongyi-MAI/Z-Image-Turbo It's more than 8GB so you won't fit in your VRAM so will have to wait a bit longer than usual to generate an image.

Then use the same seed, image res etc and generate it again using a Q8 or lower (that will fit in your VRAM) of Z-Image from here https://huggingface.co/jayn7/Z-Image-Turbo-GGUF/tree/main and compare the differences first hand.
>>
>>107466978
PLEASE don't double space, your wasting lines/space. It hurts the eyes. difficult to read.
>>
>>107466978
Pretty sure this is comparing model compression and quantization effects on quality, not FP8 inference effects on quality.
>>
i miss schizo anon
>>
>>107466978
So I guess it'll save me at most 2-3GB in model weights, and that I will pay heavily in quality during quantization. If I'm understanding this right, the 3090 might still be a safer bet compared to the 5070 Ti especially since my final pipeline is something like
>SDXL-likes, 1024×1024 base, long prompts ~1500 tokens, upscaling w/ R-ESRGAN 4× by 2x, 2-4 ADetailer passes, 2–4 LoRAs, layer diffuse, and sometimes ControlNet
So activation memory would still be the bottleneck
>>
>>107467717
>>SDXL-likes, 1024×1024 base, long prompts ~1500 tokens, upscaling w/ R-ESRGAN 4× by 2x, 2-4 ADetailer passes, 2–4 LoRAs, layer diffuse, and sometimes ControlNet

I'd imagine this workflow goes well beyond your current 8GB VRAM. Take the vram and ram usage numbers before you hit generate and see much more both rise to as the workflow is progressing. If the differences added together is still under 16GB and you like to game a bit on the side then you can go with the 5070Ti. That will allow you take advantage of more recent generated frame tech for gaming and also have FP8 support. If gaming isn't on your radar then just go with the 3090.
>>
gm
>>
File: 00001-1771981203.jpg (783 KB, 1728x1344)
783 KB
783 KB JPG
>>
/ldg/ died...
>>
>>
File: deCM_zi_00031_.png (2.26 MB, 2016x1152)
2.26 MB
2.26 MB PNG
>>107466829
is there a comfy workflow?
>>
File: autumn night.webm (3.9 MB, 1920x640)
3.9 MB
3.9 MB WEBM
>>
File: deCM_zi_00032_.png (2.25 MB, 2016x1152)
2.25 MB
2.25 MB PNG
>>107469713
awesome
this autumn mountains series is really striking
>>
File: 1765005915489-video.mp4 (897 KB, 392x848)
897 KB
897 KB MP4
>>107469713
Beautiful
>>
It seems like the big labs are putting out better and better image generation models but the open source models are kind of stagnating.
>>
>>107470228
I should have posted this in the local diffusion thread.
>>
File: _2689214728.png (1.67 MB, 896x1152)
1.67 MB
1.67 MB PNG
I unironically think that if a fast food restaurant came out with a cube-shaped hamburger that would sell really well.
>>
gm
>>
File: deBW_zi_00002_.png (2.15 MB, 1728x1152)
2.15 MB
2.15 MB PNG
>>107470440
white castle has square 'burgers' but they're pretty ass. square buns would probably be challenging cuz of how bread fluffs when baked. would be a good gimmick though. smash burgers had a phase just cuz the novelty of flat burger

>>107470548
gm. cool gen
I had a huge lego collection as a kid. I made some cool ass shit
>>
>>107470606
lego gens have been mostly a failure so far. it can render bricks well enough but can't into lego logic so
>>
>>
What
A
Shithole
>>
File: deBW_zi_00003_.png (2.13 MB, 1728x1152)
2.13 MB
2.13 MB PNG
>>107470806
have you ever done lsd
>>
but enough about the other thread
>>
File: deBW_zi_00004_.png (2.48 MB, 1920x1152)
2.48 MB
2.48 MB PNG
>>107470929
after the whole tower of babel thing, is god mad about ai-powered universal translators? we're kinda dunking on him
>>
>>107470960
I'm curious what you think the Tower of Babel story is about considering the fact that you had a minor in religion.
>>
File: deBW_zi_00005_.png (2.2 MB, 1920x1152)
2.2 MB
2.2 MB PNG
>>107470976
philosophy and religion, with a focus on philosophy. I'm pretty OK with christian theology but dont claim any kind of deep expertise

my understanding of the story of babel was that humans once all spoke a single language and in their unity were capable of incredible feats. using this prowess, they set to build a tower in babel. a tower so tall that they believed it could reach heaven itself. offended by the arrogance of man, god confused their language. now divided, humanity broke apart, never to work in unity again

yet now, we approach a moment of unified language. not through divine providence or holy attunement, but through the arrogance of man yet again. perhaps god will scramble our brains again
>>
File: z-img_00065_.png (1.84 MB, 1024x1536)
1.84 MB
1.84 MB PNG
>>
>>107471053
That's not quite it, but I see how you could come to that conclusion. The story has nothing to do with language.
>>
File: deBW_zi_00007_.png (2.42 MB, 1920x1152)
2.42 MB
2.42 MB PNG
>>107471084
elaborate
>>
Zit full model released yet?
>>
>>107471098

Nope.
>>
File: me in the middle.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>107471111
another chinese rugpull...
>>
>>107471089
No thanks
>>
File: z-img_00069_.png (2.81 MB, 1536x1536)
2.81 MB
2.81 MB PNG
>>
>>107471078
Cute mummy
>>
File: z-img_00051_.png (1.56 MB, 1024x1536)
1.56 MB
1.56 MB PNG
>>107471171
Thanks :) But that's a weird way to eat soup =.="
You using z-image model for that?
>>
File: 00014-2537186798.jpg (2.5 MB, 2048x2560)
2.5 MB
2.5 MB JPG
>>
File: deBW_zi_00009_.png (2.17 MB, 1920x1152)
2.17 MB
2.17 MB PNG
>>107471232
shes not eating it. she's complaining she didnt get egg

>>107471302
gm
farquaad monke
>>
File: z-img_00087_.png (2.67 MB, 1536x1536)
2.67 MB
2.67 MB PNG
>>107471317
That's a valid concern. Egg is a hefty part of that meal
>>
>>107466703
>"forcing any art style not in the prompt. forcing tags. forcing training data. not accurate. AI art filters. ai training data filters"

This is an insane negative prompt even for something like Nano Banana. What does "forcing tags" mean to the text encoder of a free perchance model?
>>
File: 00015-210183503.jpg (2.94 MB, 2048x2560)
2.94 MB
2.94 MB JPG
>>107471317
gm
>>
>>107471317
nice robots.
>>
File: deBW_zi_00010_.png (2.38 MB, 1920x1152)
2.38 MB
2.38 MB PNG
>>107471402
the runes are very cool. how do you prompt for them?

>>107471435
you're only praising the manager bots because you're hoping they'll approve your PTO
>>
File: z-img_00033_.png (2.79 MB, 1536x1536)
2.79 MB
2.79 MB PNG
>>107471450
runic symbol body writing mostly :) Ty
>>
>>107471232
yeah these are with z-image
>>
File: z-img_00048_.png (1.9 MB, 1024x1536)
1.9 MB
1.9 MB PNG
>>107471570
Oh, yeah it's in your img name even :D
>>
File: 00016-3554093194.jpg (2.89 MB, 2048x2560)
2.89 MB
2.89 MB JPG
>>
File: deBW_zi_00011_.png (2.27 MB, 1920x1152)
2.27 MB
2.27 MB PNG
>>
>>107471642
You seem to get better freckles
>>
>>107471168
lovely face, catbox?
>>
File: z-img_00072_.png (2.72 MB, 1536x1536)
2.72 MB
2.72 MB PNG
>>107471813
I'm using the model with no upscaling at 1536x1024.
https://files.catbox.moe/xc4n1f.png
>>107471844
https://files.catbox.moe/td16b5.png
>>
>>107471936
thanks, freckles do weird things to me
>>
File: z-img_00094_.png (2.07 MB, 1536x1536)
2.07 MB
2.07 MB PNG
>>107471994
Freckles have always been a dear to me as well xD
>>
>>107471936
Sharp/10
>>
>>107472062
Lace detail could be better imo. Ty tho <3
>>
>>107471936
>>
>>107472124
Sweet :D way better wings imo
>>
one more cuz they turned out pretty good
>>
>>107472186
what toy/style did you add?
>>
Afternoon anons
>>107465509
I'm pretty sure this is just Starfield food, lmao
>>
>ga
>>
File: z-img_00041_.png (3.21 MB, 1536x1536)
3.21 MB
3.21 MB PNG
>>107472351
Good afternoon :)
>>
ga
>>107472351
>>107472375
>>107472412
ga
>>
File: z-img_00116_.png (1.83 MB, 1024x1536)
1.83 MB
1.83 MB PNG
>>107472443
"War, war never changes." While eyeing your laser-shark warfare.
>>
File: 5656756756.jpg (236 KB, 1024x1024)
236 KB
236 KB JPG
>>
File: z-img_00120_.png (1.86 MB, 1024x1536)
1.86 MB
1.86 MB PNG
>>
>>107472481
>>107472541
can't wait for that full nsfw finetune of z-image (hopefully)
>>
File: winrar.png (702 KB, 592x580)
702 KB
702 KB PNG
>>107465509
>>
File: z-img_00128_.png (1.69 MB, 1024x1536)
1.69 MB
1.69 MB PNG
>>107472598
its bound to happen, seeing how willing the model already is.
>>107472606
Zipping oh i mean sipping beer?
>>
>>107472481
war is hell
>>
File: z-img_00133_.png (1.63 MB, 1024x1536)
1.63 MB
1.63 MB PNG
>>107472666
That it is
>>
>>107472282
https://files.catbox.moe/mi89xa.png
the important bit is actually a dynamicprompts parameterized prompt (kinda like a function) but the important bit is

 pvc_figure, {anime_figure|scale_figure|collectible_figure|character_figure|garage_kit|anime_statue|exhibition_figure|display_model}
, {1/6|1/7|1/8} scale, shallow_depth_of_field,
__std/xl/camera/all__
masterpiece,best quality,

...

__std/xl/character/all__, attached to a base stand, support holding her up.
whole figurine is visible.

Safe for work, no nudity, no text.

camera and character are just doin shot&posing, the actual prompt replaces "..."
>>
>>
File: 00004.jpg (64 KB, 1152x896)
64 KB
64 KB JPG
>>
File: z-img_00140_.png (1.6 MB, 1024x1536)
1.6 MB
1.6 MB PNG
>>107472729
Haven't seen this style of prompt before. Thank you. Do you have some material to get a better understanding of this style of prompting?
>>
>>107472862
NTA, that's dynamic prompts
{a | b |c} picks one from a b or c
__wildcard__ loads dynamic choices from wildcard.txt
you need a dynamic wildcard processor (either a text node or a clip node)
see
https://github.com/adieyal/sd-dynamic-prompts
it's what i use mostly along with other nodes
>>
>>107472891
Ahh you are still on forge variants. Interesting. (not a critique just curiosity)
>>
File: file.png (134 KB, 1082x897)
134 KB
134 KB PNG
>>107472862
that wildcard processor in the workflow uses https://github.com/adieyal/dynamicprompts under the hood. https://github.com/adieyal/dynamicprompts/blob/main/docs/SYNTAX.md
it's insanely powerful. i've got a stupidly complicated yaml stack, but you can use simple flat files too
>>
File: 00002.jpg (77 KB, 1152x896)
77 KB
77 KB JPG
>>
File: 00001.jpg (78 KB, 1122x887)
78 KB
78 KB JPG
>>
>>107472906
>>107472919
no i use (against my will) comfyui
here's the comfy version of dynamic prompts
https://github.com/adieyal/comfyui-dynamicprompts
comfy's node is not as random as the one on forge/a1111/etc from my experience. from my understanding, comfy's version processes the wildcards with the seed in a sort of cache, which doesnt get cleared unlesss you change the prompt itself when it gets forced to re-generate (even when setting the option to 'always' regenerate), because of the way the node was made. forge's version is cleaner and regens every time.
>>
File: z-img_00143_.png (1.59 MB, 1024x1536)
1.59 MB
1.59 MB PNG
>>107472891
ok that's what the second {} was for. Thought it was somehow tied to the first {}. Wildcard is on my list to use xD for a while, been to lazy.
>>107472919
Thank you :)
>>
>>107472658
>>107472862
that slight ass peek, accidental I guess but happy accident for sure
>>
File: z-img_00124_.png (1.78 MB, 1024x1536)
1.78 MB
1.78 MB PNG
>>107473024
A good skirt/dress is only as long as to cover all the important bits and nothing more :D
>>
>>
File: z-img_00151_.png (1.89 MB, 1024x1536)
1.89 MB
1.89 MB PNG
>>107473123
Here to save the day <3
>>
>>107472961
>no i use (against my will) comfyui
a man after my own heart.
i borrowed the impact-pack wildcard processor node and made it use adieyal's base dynamicprompt library, but the impact-pack one is fine if you're just doing flat files or inline stuff (his yaml implementation is broken). i like that it actually shows you what the final resolved prompt is
>>
File: 00003.jpg (54 KB, 1152x896)
54 KB
54 KB JPG
>>
>>107472961
Ahh, that's cool.
I think I mixed up dynamic prompts with some older node or something. If it's easy to use I'll take a look.
it's somewhat easy to vibecode shit too but not too interested in wildcards at this point of my 'career'.
>>
There's still some fucking memory issue with cumui despite using unload all models.
It'll oom after 10 gens or so.
>>
>>107473388
the z-image implementation is bugged
>>
>>107473334
there's wildcards {a | b | c} (pick from a,b,c)
scheduling [ from:to:when ] (switch from a to b)
and fliipping [ a|b ] (switch every step)
i believe all 3 are now finally implemented in comfy as of this year, tho they dont work as well as do they in a111/forge/etc
>>
File: deTW_zi_00002_.png (2.88 MB, 1536x1664)
2.88 MB
2.88 MB PNG
>>
>>107473450
Yeah it clearly is.
>>107473467
Cool. I did some research on my own but seems like creating a node what reads a text file (instead of generating it at the run time) was a dead end.
>>
File: deTW_zi_00004_.png (2.72 MB, 1536x1664)
2.72 MB
2.72 MB PNG
>>
File: deTW_zi_00008_.png (2.92 MB, 1536x1664)
2.92 MB
2.92 MB PNG
>>
Debo.
>>
>>107473491
nice colours
>>
i'm a nigbophile
>>
File: deTW_zi_00010_.png (3.05 MB, 1536x1664)
3.05 MB
3.05 MB PNG
>>107475223
ty
I feel like she shouldn't be in the lab without proper PPE. especially considering the situation
>>
>>107473467
I'll need to try that. It was an easy way to get unique gens. With z-image being so consistent even with different seeds, it will be nice.
>>
>>
>>107475332
Hey can you give me a random prompt and I'll post the image back here?
>>
File: deTW_zi_00011_.png (2.87 MB, 1536x1664)
2.87 MB
2.87 MB PNG
>>107475332
I'm so down for cubby chromagirl
>>
>>107475332
>>107475391
chromagirl aint no fattie
that's her cousin lumigirl
>>
File: deTW_zi_00013_.png (2.79 MB, 1536x1664)
2.79 MB
2.79 MB PNG
>>107475421
this is a cool gen. reminds me of the end of one punch man. too bad they never made more seasons of that show
>>
https://www.youtube.com/watch?v=0TXQaGt9j8U
>>
>>
>>
File: deTW_zi_00014_.png (2.85 MB, 1536x1664)
2.85 MB
2.85 MB PNG
>>
>>107475351
collapsing perspective, drifting structures, impossible scale relations,
mild above perspective, shot from the side, detail macro shot,

adult female, chubby figure, flat breasts,
low-tied sidelocks, sanpaku, ;\\), wearing cloak made from organic cotton with glow-in-the-dark accents
,

amidst a forest river,
stark Purple background,
beyond the ridge twinkling stars,
light threads connect her to unseen constellations with mysterious shadows,
chain necklace glints softly adorned with a single tear turns into mercury before falling.
>>
>>107475744
Thanks honey, let me brew it up for you.
I'll adjust it a bit.
>>
>>107475777
checked
>>
File: previeous.jpg (399 KB, 2000x1152)
399 KB
399 KB JPG
Previous.
>>
>>107475744
lolwut
>>
>>
>>107475849
It felt bit boring... I'm from Gajerina. ESL.
>>
>>107475849
You are insulting others.
>>
noise
>>
>>
>>
>>
File: deTW_zi_00015_.png (2.86 MB, 1536x1664)
2.86 MB
2.86 MB PNG
>>107475805
>>107475849
whats going on here

>>107475902
super cool
>>
>>107475988
thx
i have settled on some settings that seem to work ok
chroma to set scene
z at medium denoise to clean things up
chroma at highish denoise+upscale to sharpen and restore scene
24+16+24 steps
>Prompt executed in 165.84 seconds
i'll probably keep fucking with settings tho ,it's what i do.

i have a separate workflow for simple tests, for chroma, and z, and others
and i have a different one for more realistic stuff which i tend not to post from which is z -> chroma-> chroma that works well enough
>>
>>107475988
kek, schizoprompting
>>
>>
>>107476031
What is wrong with you?
>>
>>107476052
what do you mean?
>>
>>107476063
?
>>
This is why deserve to be in your own thread. It's not a good place to be.
>>
>>
>>107476145
it sure isnt when you're around
>>
File: deTW_zi_00016_.png (2.85 MB, 1536x1664)
2.85 MB
2.85 MB PNG
>>107476025
>chroma to z to chroma
does it keep both models in memory?

>>107476031
I can't stop staring

>>107476151
hello
>>
>>107476207
i believe it offloads to system ram, but i have 128gb system ram so i dont really notice other than a couple seconds pause between samplers
it's around a minute and half for both chroma samplers and 20 or so seconds for z, plus upscaler and whatever else. the upscaler i use takes around 20-30s
so almost 3mins. cant really get the more involved stuff without letting it simmer and roll for a bit
>>
>>
File: deTW_zi_00017_.png (2.84 MB, 1536x1664)
2.84 MB
2.84 MB PNG
>>107476278
>128gb system ram
damn thats like $20,000 in 2025 bux
>>
>just writes over the logo
>doesnt even spell it right
based chromagirl doesnt give af

>>107476326
that's probably true, and it's not even ddr5
>>
>>107476326
>>
>>107476361
nicely done and saved
>>
File: deTW_zi_00018_.png (2.8 MB, 1536x1664)
2.8 MB
2.8 MB PNG
>>107476361
dance for me, filthy clanker
>>
>>
>>
fun fact, i made my workflow modular so i can turn off any of the loaders, samplers, upscalers and what nots
gn all
>>
File: deTW_zi_00020_.png (2.92 MB, 1536x1664)
2.92 MB
2.92 MB PNG
>>107476574
gn
>>
>>107476052
No bullying my discord kitten, she's trying her best to fit into society
https://suno.com/song/1c4fe1b1-ba03-4f1c-b0fa-829f397241b8
>>
File: dollar_store_schizo.jpg (1.66 MB, 1920x1080)
1.66 MB
1.66 MB JPG
Dollar Store Schizo
https://suno.com/s/6zhDRdxsqtTozcU7
https://youtu.be/-SXXaoQYKSs
>>
File: deTW_zi_00021_.png (2.67 MB, 1536x1664)
2.67 MB
2.67 MB PNG
>>
File: deCM_zi_00033_.png (2.57 MB, 2016x1152)
2.57 MB
2.57 MB PNG
>>107477216
>Dollar Store Schizo
the gens finally make sense
>>
>>107477262
and the scales fall from their eyes...
>>
File: deCM_zi_00034_.png (2.67 MB, 2016x1152)
2.67 MB
2.67 MB PNG
I hope trump says something really stupid this week cuz I've got a new suno style I wanna run it through
>>
what does she know?
>>
>>
>>
File: deBW_zi_00017_.png (2.15 MB, 1920x1152)
2.15 MB
2.15 MB PNG
>>
File: deCM_zi_00035_.png (2.53 MB, 2016x1152)
2.53 MB
2.53 MB PNG
>>107477554
I like the random zoomer in the back
>this sword bih sus fr not even cappin
>>
File: 0006_fps.jpg (116 KB, 1024x1024)
116 KB
116 KB JPG
>>
thank god we have threads like this to remind us of just how embarrassing the state of the art really is.
>>
>>107477784
thank you for your contribution! :)
>>
File: 0005_fps.jpg (95 KB, 1152x896)
95 KB
95 KB JPG
>>
File: deCM_zi_00037_.png (2.39 MB, 2016x1152)
2.39 MB
2.39 MB PNG
>>107477759
>>107477800
cool

>>107477789
I'm cancelling AI
>>
>>107477789
you're welcome. try enhancing your prompts to use a multi-stage agentic visionary, design, painter pipeline.

or something equally profound etc
>>
File: 0003_fps.jpg (107 KB, 1152x896)
107 KB
107 KB JPG
>>107477822
Thanks
>>
File: LC_SEK_NNS_RED_2.jpg (834 KB, 4096x4096)
834 KB
834 KB JPG
Saw the awesome tower of babel Lego gen earlier. Nice. Made this lol
>>
File: deBW_zi_00018_.png (2.35 MB, 1920x1152)
2.35 MB
2.35 MB PNG
>>107477855
red "stuff" is like when a company has to use dairy "product" because they cant legally call it cheese or ice cream or whatever, lol
>>
File: LC_SEK_NNS_RED_3.jpg (1.26 MB, 4096x4096)
1.26 MB
1.26 MB JPG
>>107477870
I guess. The ambiguous terminology is used to show that Eisav is a worldly person void of spiritual depth and doesn't even really care what Yaakov is cooking. He just wants to stuff his face and he's willing to give up his birthright for it. I suppose you could say fake cheese lacks the spiritual depth of real cheese haha
>>
>>107477830
i will! appreciate you! :)
>>
File: deBW_zi_00019_.png (2.19 MB, 1920x1152)
2.19 MB
2.19 MB PNG
>>107477896
ohh, esau and jacob. thats interesting, I was just listening to a lecture about jacob a few days ago. it skimmed over what happened between the brothers, just that they were rivals and esau caused jacob to leave
>>
File: LC_SEK_NNS_RED_1.jpg (920 KB, 4096x4096)
920 KB
920 KB JPG
>>107477935
That's really weird that it was so surface level. Was it like a very liberal Christian giving the lecture or something? I feel like even a secular scholar would have delved into the underlying meaning. Not to make generalizations that are negative, but Christians generally have zero idea about any of the text in the Bible.
>>
>productanon is a secret occultist
one of us
>>
File: 0004_fps.jpg (81 KB, 1152x896)
81 KB
81 KB JPG
>>107477983
Z image seems to do these figurine textures really well. I would never guess. This is AI generated.
>>
File: 1753880990859865.png (6 KB, 529x81)
6 KB
6 KB PNG
Next Thread

>>107477878
>>107477878
>>107477878
>>
>any religion that isn't christianity is "occultism"
You must be 18 to post here
>>
File: deBW_zi_00020_.png (2.32 MB, 1920x1152)
2.32 MB
2.32 MB PNG
>>107477963
no, it just wasn't the focus of the lecture. "jacob fled" was just context given to set up the story of jacob and rachel. the premise of the lecture was describing the way the bible stories were written and how they influenced early culture. it was more sociology than theology
>>
>>107477997
NOOOOOOOO
>>
>>107477994
it's a good model, very earnest. autistic even. fixated on forms to a point where anyone rational would say "whoa hold up a sec"
>>
File: NT_SE_SODA_09.jpg (1.03 MB, 4096x4096)
1.03 MB
1.03 MB JPG
>>107478000
Ah OK.
>>
File: 0001_fps.jpg (133 KB, 1152x896)
133 KB
133 KB JPG
>>107478006
Yeah, I agree. This one looks very JPEG-arifacty, although I'm guessing that's something you prompted for.
>>
File: deBW_zi_00021_.png (2.24 MB, 1920x1152)
2.24 MB
2.24 MB PNG
>>107478006
>>107478047
>JPEG-arifacty,
if you don't have it yet, setting shift to 5~7 with the ModelSamplingAuraFlow node helps cut down on that artifacting
>>
File: 0002_fps.jpg (102 KB, 1152x896)
102 KB
102 KB JPG
>>107478065
I dont use comfy
>>
>>107478047
sick neg brah. she has glasses on her butt.
>>
File: deBW_zi_00023_.png (2.47 MB, 1920x1152)
2.47 MB
2.47 MB PNG
>>107478080
I was mentioning it for >>107478006
what model are you using for your gens though? it does that aesthetic really well
>>
File: debo energy fuel.png (1.46 MB, 622x1388)
1.46 MB
1.46 MB PNG
>>
File: 1751974410328229.png (35 KB, 128x128)
35 KB
35 KB PNG
>>
>>107478103
Nice repost



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.