[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1750415475357455.png (3.04 MB, 2048x1440)
3.04 MB
3.04 MB PNG
Previous /sdg/ thread : >>107343201

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://huggingface.co/jayn7/Z-Image-Turbo-GGUF

>Flux.2 Dev
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/city96/FLUX.2-dev-gguf

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image
https://huggingface.co/QuantStack/Qwen-Image-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2
https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUF
https://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUF
https://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://github.com/maybleMyers/chromaforge
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Models, LoRAs & upscaling
https://civitai.com
https://tensor.art
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/vp/napt
>>
>mfw Resource news

11/27/2025

>Z-Image-Turbo: Distilled State-of-the-art image generation model with 6B parameters
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Official FLUX.2 Prompting Guide
https://docs.bfl.ai/guides/prompting_guide_flux2

>AnchorOPT: Towards Optimizing Dynamic Anchors for Adaptive Prompt Learning
https://github.com/zhengli97/ATPrompt

>MobileI2V: Fast and High-Resolution Image-to-Video on Mobile Devices
https://github.com/hustvl/MobileI2V

>Monet: Reasoning in Latent Visual Space Beyond Images and Language
https://github.com/NOVAglow646/Monet

>UltraViCo: Breaking Extrapolation Limits in Video Diffusion Transformers
https://thu-ml.github.io/UltraViCo.github.io

>Deep Parameter Interpolation for Scalar Conditioning
https://github.com/wustl-cig/parameter_interpolation

>STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows
https://github.com/apple/ml-starflow

>iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation
https://kr1sjfu.github.io/iMontage-web

>The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment
https://ouyangziheng.github.io/ImageCritic-Page

>AlignBench: Benchmarking Fine-Grained Image-Text Alignment with Synthetic Image-Caption Pairs
https://dahlian00.github.io/AlignBench

11/25/2025

>FLUX.2: Frontier Visual Intelligence
https://bfl.ai/blog/flux-2

>FLUX.2-dev-GGUF
https://huggingface.co/orabazes/FLUX.2-dev-GGUF

>FLUX.2 Day-0 Support in ComfyUI: Frontier Visual Intelligence
https://blog.comfy.org/p/flux2-state-of-the-art-visual-intelligence

>Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens
https://wakalsprojectpage.github.io/comt-website

>DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation
https://zehong-ma.github.io/DeCo

>Syn-GRPO: Self-Evolving Data Synthesis for MLLM Perception Reasoning
https://github.com/hqhQAQ/Syn-GRPO
>>
>mfw Research news

11/27/2025

>Progress by Pieces: Test-Time Scaling for Autoregressive Image Generation
https://grid-ar.github.io

>Qwen3-VL Technical Report
https://arxiv.org/abs/2511.21631

>Video Generation Models Are Good Latent Reward Models
https://arxiv.org/abs/2511.21541

>Generalized Design Choices for Deepfake Detectors
https://arxiv.org/abs/2511.21507

>MIRA: Multimodal Iterative Reasoning Agent for Image Editing
https://arxiv.org/abs/2511.21087

>DiverseVAR: Balancing Diversity and Quality of Next-Scale Visual Autoregressive Models
https://arxiv.org/abs/2511.21415

>Canvas-to-Image: Compositional Image Generation with Multimodal Controls
https://snap-research.github.io/canvas-to-image

>Which Layer Causes Distribution Deviation? Entropy-Guided Adaptive Pruning for Diffusion and Flow Models
https://arxiv.org/abs/2511.21122

>CtrlVDiff: Controllable Video Generation via Unified Multimodal Video Diffusion
https://tele-ai.github.io/CtrlVDiff

>From Diffusion to One-Step Generation: A Comparative Study of Flow-Based Models with Application to Image Inpainting
https://arxiv.org/abs/2511.21215

>Restora-Flow: Mask-Guided Image Restoration with Flow Matching
https://arxiv.org/abs/2511.20152

>PromptMoG: Enhancing Diversity in Long-Prompt Image Generation via Prompt Embedding Mixture-of-Gaussian Sampling
https://arxiv.org/abs/2511.20251

>Text-guided Controllable Diffusion for Realistic Camouflage Images Generation
https://arxiv.org/abs/2511.20218

>OmniAlpha: A Sequence-to-Sequence Framework for Unified Multi-Task RGBA Generation
https://arxiv.org/abs/2511.20211

>FlowerDance: MeanFlow for Efficient and Refined 3D Dance Generation
https://arxiv.org/abs/2511.21029

>EmoFeedback2: Reinforcement of Continuous Emotional Image Generation via LVLM-based Reward and Textual Feedback
https://arxiv.org/abs/2511.19982
>>
>>107353516
>>107353523
lell i was just typing if you like it i rec that whole album
>>107353523
i saw megadeth during their first symphony of destruction tour, good times
>>
File: deDW_zi_00002_.png (3.47 MB, 1344x1536)
3.47 MB
3.47 MB PNG
>>107353490
>>107353523
I liked it
>>
>>107353550
Thanks honey. It was not 50/50.
>>107353538
I just love their first album. It's sort of strength return. I still play guitar but I'm like a skeleton.
>>
>>107353538
1990-1992 this was during their EU tour and when Pantera was with them. After this everything went downhill.
>>
>>107353550
poppy is something else
>>107353561
yah early thrash was the shit
kill em all is still a better album than anything after justice
same with killing is my business for megadeth
>>
>>107353575
What are you going to do when you get old and get bored of the music?
>>
>>107353582
https://www.youtube.com/watch?v=GQ5ejXzQu3Q
>>
>>107353571
pretty much. i'd say 96 was the end of metal
>>107353582
i am old and bored of music lel
all i listen to is babymetal, poppy and wagner. everything else annoys me
>>107353606
saw them in their first big US tour too, i think they opened for pantera if i remember right. i have vague memories of the 90s
>>
>>107353627
Ok. Sorry if I bothered you.
Well if you are a musical person, you'll spend ten years of without listening to anything and it is going to be somewhat strange.
>>
>>107353656
10 years without music. sorry spellinng.
>>
>>107353656
not a bother
just seemed like a random "post music" lel
and i dont mean i wont listen to other things, i dont mind
i just spend more days listening to the wind and the birds than to hoooman things
there's music in the spheres, man
>>
>>107353688
Whatever your life goes into.

Music is in everywhere but it sounds like someone is jacking off.
>>
>>107353707
okay, then
>>
File: deDW_zi_00004_.png (3.37 MB, 1344x1536)
3.37 MB
3.37 MB PNG
>>107353561
>I still play guitar but I'm like a skeleton.
you're my dad. hes cant play like he used to, but he still plays

>>107353627
>all i listen to is babymetal, poppy and wagner. everything else annoys me
this is quite a sentence
>>
>>107353727
10 years from now.
You won't even remember the music you listened to unless you play guitar.
>>
>>107353729
Yeah. At some point mouth becomes more important than actions.
>>
>>107353729
it is what it is
>>107353737
of course i do. even with the massive amount of drugs i've taken, i remember it all
part of the reason i took massive amounts was to forget it and i havent
maybe you wont, but we're not the same

https://www.youtube.com/watch?v=CmxLAnBaizY
>>
File: deDW_zi_00005_.png (3.72 MB, 1344x1536)
3.72 MB
3.72 MB PNG
>>107353737
I remember the music I listened to 10 years ago
hell, I remember the music I listened to 20 years ago

https://www.youtube.com/watch?v=H6BEkPzstJQ

kinda funny too cuz I don't have a great memory
>>
>>107353775
I know darling.
>>107353756
Nothing to do with chemicals unless you are a heroinist. I don't think you are.
>>
>>107353775
>crystal meth
that takes me back
>>107353815
i wouldnt call myself one no
>>
>>107353756
I just can't listen to this trash.
https://www.youtube.com/watch?v=407YNmRKvQs
>>
>>107353824
Do you know anyone Phil Anselmo friends?
>>
>>107353832
https://www.youtube.com/watch?v=ivKdJAxd3pM
>>
>>107353863
South of Heaven is one of their more progressive songs.
>>
>>107353845
why yes, i do
>>
>>107353873
I'm talking about a real life, not tiktok bullshit.
>>
File: deDW_zi_00006_.png (3.02 MB, 1344x1536)
3.02 MB
3.02 MB PNG
>>
>>107353888
it might surprise you to learn that i dont know phil personally, nor do i follow his personal life, so no, i dont fucking know who he's friends with lol
>mfw
>>
>>107353863
https://www.youtube.com/watch?v=E22gNctAVrc&list=PLY1a1INoMkegNezBcXweH6caLA0n7qWuj
>>
>>107353909
Seems like you are buttfucked?
>>
>>107353911
see i couldnt listen to that lol
>>107353921
seems like english isnt your first language
>>
>>107353927
This is right, I am from Finland. I have an English Passport too. if you want to try an rank up.
>>
File: deDW_zi_00008_.png (3.15 MB, 1344x1536)
3.15 MB
3.15 MB PNG
>>107353927
oh no, you've triggered his trap card: languages
>>
>>107353927
NI number follows you everywhere.
>>
>>107353943
has it been snowing there? we havent had much if any snow up north here in the US
>>107353950
nah i think we're just having a miscommunication due to that barrier
>>107353951
what do you mean?
>>
>>
File: deDW_zi_00009_.png (3.2 MB, 1536x1344)
3.2 MB
3.2 MB PNG
>>
>>107353969
?
>>
Seems like you are you are butthurt.
Some people are more English than you.
>>
>>107354024
technically all the english are more english than any of us, unless there's an englishman here today
>>
idk why z keeps making these little creatures
maybe it doesnt know some terms and just makes placeholders
>>
>>
>>
Hope everyone's Thanksgiving was pleasant. I'm going to bed.
>>107354038
That starburst shape coming out of the laser gun looks really cool.
>>107354010
Deboki sounds like a cooler version of reiki
>>
>>107354366
gn
>>
File: deDW_zi_00010_.png (3.24 MB, 1536x1344)
3.24 MB
3.24 MB PNG
>>107354060
I like them
I say keep them

>>107354366
happy thanksgiving. gn
>reiki
kinda what I was going for. was trying for a shonen jump style and all those comics are about chi/ki/chakra/whatever
>>
>>107354400
>I like them
yeh but they're a sign of the issue of sameness too
back to chroma for now. i just cant do it, i need my chromagirl. movement on the lora trainer side is increasing so hopefully i can make one for z soon
someone said since the base z-image model is likely to be smallish compared to flux, it'll be like the wild west days of sdxl in terms of finetunes and loras for it
>>
>>
File: ComfyUI_00004_.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
so this z thing is pretty cool, pity it's all locked up behind comfyui. dunno what happened to her eyes tho lol
>>
File: ComfyUI_00003_.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
pity it took so long to get bootstrapped... i'll mess with it later. gn
>>
File: deDW_zi_00012_.png (3.46 MB, 1344x1536)
3.46 MB
3.46 MB PNG
>>107354886
I've seen posts about it running in forge-neo. food for thought for tomorrow (if you're not already full of food)
gn
>>
>>107354948
Thank you for letting us know.
>>
>>107354010
Yes, this image is highly likely to be AI-generated.

Here are the specific signs that point to it being artificial rather than a scan of a real 90s anime or manga:

1. The Title "DEBOKI" Does Not Exist The large text at the top reads "DEBO...KI" (or possibly DEBOKI). A search for this title yields no results for any existing anime, manga, or video game from the 1990s (or any other era). AI models often struggle to generate real text and instead invent nonsense words that look like titles but have no meaning.

2. Visual Inconsistencies (Hallucinations) While the image mimics the 90s cel-shaded style very well, there are several structural errors typical of AI:

The Demon's Hands: Look closely at the demon's left hand (on the right side of the image). The fingers are somewhat fused together and the knuckles are misshapen. AI frequently struggles with complex hands.

Bikini Straps: The woman's bikini top has a strap on her right shoulder (viewer's left), but on her left shoulder (viewer's right), the strap seemingly disappears or merges into her hair/neck in an illogical way.

Background Details: The buildings in the background are vague, blob-like shapes. In a real professional anime illustration, the background architecture would be drawn with distinct perspective and defined windows.

Hair Logic: The woman's hair strands, particularly on the right side, merge and split in ways that don't follow natural hair growth or typical animation drawing styles.

3. "Niji" Style Artifacts This specific look—high saturation, heavy contrast, and a "clean" retro aesthetic—is very common with AI models specifically tuned for anime (like Niji Journey or specific LoRAs for Stable Diffusion). They often produce images that look "more 90s than the actual 90s," exaggerating the aesthetic.
>>
File: deDW_zi_00018_.png (3.48 MB, 1344x1536)
3.48 MB
3.48 MB PNG
>>107355093
busted!
I actually did use AI to make that image
>>
File: deDW_zi_00019_.png (3.55 MB, 1344x1536)
3.55 MB
3.55 MB PNG
gn
>>
>>107352887
For me, compared to SD 1.5 (I haven't used SD since 2023) the weirdest thing is the lack of variation between seeds. The composition reminds extremely similar, as do faces and basically everything. I wish I could add more randomness but I don't see how.
>>
I regret to inform you that Z-Image Turbo cannot into Arabic.

>>107355128
wtf get that AI slop outta here we only post real pictures here clanker-lover
>>
File: 2025-11-28 10.01.06.png (126 KB, 901x602)
126 KB
126 KB PNG
Prompt weights do literally nothing except randomise the image a bit now? I tried every weight from (Megan Fox:1.0) to (Megan Fox:-1.2) through 0.0 and they all have the exact same Megan Fox face.
>>
>>107355767
yeah it's a distilled model
>>
i miss schizo anon
>>
File: 00058.png (3.09 MB, 2304x1152)
3.09 MB
3.09 MB PNG
>>107355546
Goodnight
>>
File: 000000_45987_.png (1.82 MB, 1664x928)
1.82 MB
1.82 MB PNG
Flux seed variation is better but this is sooo fast and quality is good.
>>
File: 00114-1589955697.jpg (2.52 MB, 2048x2560)
2.52 MB
2.52 MB JPG
>>
>>107357205
What is it? ZIT?
>>
File: Vq0.mp4 (1.42 MB, 624x832)
1.42 MB
1.42 MB MP4
>Alibaba Tongyi Wanxiang
>>
>>107357279
yes, sorry, off to work, have a great day!
>>
File: Vf3.mp4 (1.13 MB, 624x832)
1.13 MB
1.13 MB MP4
>>Alibaba Tongyi Wanxiang
>>
File: Vc7.mp4 (1.25 MB, 624x832)
1.25 MB
1.25 MB MP4
>>>Alibaba Tongyi Wanxiang
>>
File: autumn river.webm (3.68 MB, 1920x960)
3.68 MB
3.68 MB WEBM
>>107357205
The details look excellent. It's good to have such a fast new model.
>>107357264
Very nice!
>>107357302
>>107357430
That's a big jump in quality, the latest animation looks clean.
>>
File: 00149-1018784897.jpg (1.75 MB, 2048x2560)
1.75 MB
1.75 MB JPG
>>
gm
>>107355601
looks like arabic
>>
The flux 2 version is a little better I think.
>>
File: beds.webm (1.28 MB, 1920x960)
1.28 MB
1.28 MB WEBM
>>107358168
Good morning
>>
>>107358168
It does look like Arabic, the problem is that it doesn't look like what I told it to write lmao
>>
File: ComfyUI_00007_.png (1.88 MB, 1792x1024)
1.88 MB
1.88 MB PNG
>>107355020
anytime, big guy!
>>
>>107359207
You have been shitting up the other thread again. Please be kind. Namefags like you should be more polite because you are willingly exposing yourself by wanting to create immediate drama.
>>
File: deDW_zi_00021_.png (3.26 MB, 1536x1344)
3.26 MB
3.26 MB PNG
gm
>>
File: 00354-3543456579.jpg (2.03 MB, 2560x2048)
2.03 MB
2.03 MB JPG
>>107359498
gm
>>
File: Desktop 2.jpg (288 KB, 2089x2089)
288 KB
288 KB JPG
>>107359269
>>
>gm
>>
Morning anons
>>107359530
This looks amazing
>>
File: deDW_zi_00022_.png (3.17 MB, 1344x1536)
3.17 MB
3.17 MB PNG
>>
File: file.png (72 KB, 813x564)
72 KB
72 KB PNG
>>107359498
gm. also, lmao at the absolute state of comfyui wildcard processors. who needs namespacing? just flatten it all out like god intended
>>
File: Desktop 1.jpg (718 KB, 2089x2089)
718 KB
718 KB JPG
>>107359958
Thanks
>>
File: deDW_zi_00023_.png (3.59 MB, 1536x1344)
3.59 MB
3.59 MB PNG
>>107360033
I hope you got caught up on all the thanksgiving gens. lots of good bunchans in there
>>107359509 there were lots of koff monke and koff 1girl too
>>
File: 00418-999120879.jpg (1.99 MB, 2048x2560)
1.99 MB
1.99 MB JPG
>>107360070
they were neat
>>
>>107359990
You should do one with fancy looking people in robes and stuff a la kabuki. Deboki/kabuki
>>
File: deDW_zi_00025_.png (3.67 MB, 1344x1536)
3.67 MB
3.67 MB PNG
>>107360159
I have a lot of wildcards going. some fancy, some robes, they all bubble up eventually
>>
>>
>>107360070
yeah, i did!
>>
File: 00419-1945975404.jpg (790 KB, 2048x2560)
790 KB
790 KB JPG
mfw
>>
File: deDW_zi_00024_.png (3.31 MB, 1344x1536)
3.31 MB
3.31 MB PNG
>>107360307
the santa seal has been broken
>>
>>107360307
Nice
How can anyone compete.
>>
File: 00420-3410638904.jpg (1.38 MB, 2048x2560)
1.38 MB
1.38 MB JPG
>>
Z-Image is the first checkpoint that gives me the closest to what Im thinking about while writing the prompt.
Beep Boop
>>
>>107360659
Thank you for your service.
>>
File: 00422-3331004179.jpg (1.14 MB, 2048x2560)
1.14 MB
1.14 MB JPG
>>
can you inpaint with z-image-turbo?
>>
>>107360659
Yes. This is just the turbo version.
The full version will be interesting. We'll be able to extend with LoRA.
>>
>>
>>107360659
robo smash
>>
>>107353522
I currently gen with two different PCs
>Laptop with a 6GB 1060
>Desktop with a 12GB RX 6700XT
Both gen at about 6s/it. Pretty shit but I just do other stuff while waiting.
Thinking of getting a 5060 Ti 16GB and either putting it in my main desktop or a new mini PC build to make it my dedicated AI machine.
Thoughts?
>>
>>107362150
good choice
don't wait to much vram prices are getting fucked
>>
File: 00_4.mp4 (1.01 MB, 512x512)
1.01 MB
1.01 MB MP4
>>
>>107362150
>Thinking of getting a 5060 Ti 16GB and either putting it in my main desktop or a new mini PC build to make it my dedicated AI machine.
>Thoughts?

Just getting the 5060Ti and swapping the 6700XT out would be a lot less cash spent than a new mini PC. Unless you need an entirely new rig for other reasons beyond image generation then just stick to the GPU.
>>
File: gl_pc-0006.jpg (308 KB, 1344x1728)
308 KB
308 KB JPG
All this talk about GPUs reminded me, are LHR models still a thing that's happening?
>>
>>107362175
Who is this weird fellow and why was he getting spammed by the shizo back in the day?
>>
File: deVG_zi_00001_.png (2.91 MB, 1920x1216)
2.91 MB
2.91 MB PNG
>>107362193
>LHR model
wasnt that just to gimp crypto mining or something?
>>
File: gl_pc-0003.jpg (204 KB, 1344x1655)
204 KB
204 KB JPG
>>107362295
Yes but I figure with the export restrictions to China maybe something similar is happening - I have no idea I just gen pics.
>>
>>107362150
16GB should be enough for most models but FLUX.2 realeased a few days ago and you pretty much needed 24gb vram to gen at it/s and considering that Z-Image isn't being seen as the more popular models, I think this trend of making basically incredibly resource heavy models will continue for a few years.
GPU prices are another thing, so if you can afford that 16gb vram GPU, I would say go for it, the future is uncertain but having more vram is always good
>>
File: ComfyUI_00018_.png (2.37 MB, 1792x1024)
2.37 MB
2.37 MB PNG
there we go... although it's understanding of bunny girls is a little different than expected, really wants to put the dancer outfit on her
>>
File: clanker.jpg (114 KB, 1024x1024)
114 KB
114 KB JPG
Does ComfyUI work better on Linux than Windows 11?
>>
File: deVG_zi_00002_.png (3.05 MB, 1920x1216)
3.05 MB
3.05 MB PNG
>>107362427
forge fork with z-image support, if you want something before support is 'officially' added to forge
https://github.com/maybleMyers/chromaforge

>>107362444
checked
i doubt it, but I don't know anything about it
>>
>>107362444
What's Windows 11?
>>
>>107362444
Most things do assuming the same specs
>>
File: ComfyUI_00025_.png (2.87 MB, 1792x1024)
2.87 MB
2.87 MB PNG
>>107362458
i guess that doesn't look all that different from reforge. i'm just dinkin' around for now, it's lookin like my xl wildcard system doesn't work that great. i expect encoder wants more natural language but the only way i could pull that off atm is feeding it through gpt-4o-mini or somethin.
>>
File: gl_pc-0004.jpg (120 KB, 1848x1134)
120 KB
120 KB JPG
I'll take it that nobody knows
>>
>>107362621
We know.
>>
File: gl_pc-0001.jpg (188 KB, 1344x1728)
188 KB
188 KB JPG
>>107362628
Sure you do
>>
File: deVG_zi_00006_.png (3.34 MB, 1920x1216)
3.34 MB
3.34 MB PNG
>>107362600
>feeding it through gpt-4o-mini
I've been thinking that could be a good approach, since a spattering of wildcards doesn't seem to be enough to get meaningful variety a lot of the time. its a small enough model that swapping a local llm in for encoding could work
too lazy to try build it all rn tho
>>
Underwater gens are actually pretty good with z-image-turbo
>>
File: 00426-1154593049.jpg (1.9 MB, 2048x2560)
1.9 MB
1.9 MB JPG
my banane, gone
>>
File: ComfyUI_00028_.png (2.63 MB, 1792x1024)
2.63 MB
2.63 MB PNG
>>107362706
i also haven't figured out how to reliably censor it either
>>
File: gl_pc-0002.jpg (200 KB, 1848x1248)
200 KB
200 KB JPG
>>
>>107362600
you created that malware node? great stuff, it deleted my comfyui folder
>>
File: ygwyfd.gif (1.45 MB, 640x346)
1.45 MB
1.45 MB GIF
>>107363308
sucks to suck i guess
>>
>>107363331
rude...
>>
File: 00428-2856721150.jpg (2.64 MB, 2048x2560)
2.64 MB
2.64 MB JPG
>>
File: 00430-2017859289.jpg (1.81 MB, 2048x2560)
1.81 MB
1.81 MB JPG
>>
File: ComfyUI_00034_.png (2.91 MB, 1792x1024)
2.91 MB
2.91 MB PNG
SD1.5 prompting is back on the menu boys

stylized dark fantasy aesthetic
art by Alphonse Mucha, art by Heinrich Cornelius Agrippa, art by Akira Toriyama, art by Remedios Varo
1girl bunnygirl, blank expression, bored, (illuminati:1.05),
spinning out of control but gracefully, (gentle giantess:1.1), very large breasts
stupid whore, brat,


i'm fairly certain it's ignoring all the artists but Mucha. last bit is how to censor this fucking thing, 3/4 gens are nudes
>>
File: 00431-385286952.jpg (1.56 MB, 2048x2560)
1.56 MB
1.56 MB JPG
>>
>>107363643
i dont think weights work well on z
>>
File: deDW_zi_00029_.png (3.54 MB, 1344x1536)
3.54 MB
3.54 MB PNG
>>107363592
maximum fuzzball

>>107363643
>it's ignoring all the artists
artist knowledge doesn't seem too good. but could also be cuz of distillation

>>107363762
>i dont think weights work well on z
I'd been wondering this. I hadn't seen much effect but wasn't sure
>>
>>107363643
Awesome. We can also use names instead of LoRAs!
>>
File: 00433-3876903619.jpg (2.19 MB, 2048x2560)
2.19 MB
2.19 MB JPG
>>
>>107364487
He is like us. A cocoon worm/ape.
>>
File: 000000_46000_.png (1.42 MB, 768x1344)
1.42 MB
1.42 MB PNG
>>107362150
Save your money up for a 5070 Ti Super 24GB.
>>
I feel really bad about how harsh I was to ani. he should keep going
>>
File: deDW_zi_00032_.png (3.01 MB, 1536x1344)
3.01 MB
3.01 MB PNG
>>107364547
>Super
rumors have been the supers aren't happening. we'll see

>>107364582
ani don't care
>>
I think I cracked it. At least with ZIT, steps have to be about 7 times as big as cfg.

>cfg 1.4 steps 10
>cfg 5 steps 35
>cfg 7 steps 49
If you try cfg 5 steps 10 you got some overfried garbage.

Maybe that's already common knowledge, but I didn't know that because it wasn't like that with SD 1.5.
>>
>ffaze
>>
z-girl
>>
>>
>>
>Z-Image Turbo
>go from 1536x1536 to 512x512
>the composition looks much nicer and more like I want
>have to balance between resolution and good results
wtf
>>
File: 000000_46008_.png (1.56 MB, 768x1344)
1.56 MB
1.56 MB PNG
>I can't feel my legs.
>>
>>107365540
upscale my nig
>>
What's the deal with "woman with butterfly hair ornament flanked by large male being and flanked by multiple smaller creatures"?
>>
File: 6150484020001_thumb.jpg (1.17 MB, 1920x1080)
1.17 MB
1.17 MB JPG
>>107365643
i know right
>>
File: lumi-zit-00001.png (1.62 MB, 1792x1024)
1.62 MB
1.62 MB PNG
>>
File: deAF_zi_00001_.png (3.21 MB, 1920x1216)
3.21 MB
3.21 MB PNG
>>107365373
so the distilled model was trainable after all

>>107365572
>urk... at.. least I've got a blueprint shoved up my ass

>>107365880
I miss AI seinfeld. I wonder when the next meme AI content will happen
>>
>>107366025
indeed
this is mainly practice for if/when the base model is released, but overall faster and easier to train (using ai-toolkit) than chroma has been lol
although I'm still fighting against z for proper prompt following
and it burns in a bit (probably because it's distilled) so hopefully that gets mitigated too later
>>
File: lumi-zit-_00008_.png (1.88 MB, 1792x1024)
1.88 MB
1.88 MB PNG
this wanted to be lewd so badly but was defeated by the model spazzing out a bit lol.
>>
File: deVG_zi_00003_.png (3.25 MB, 1920x1216)
3.25 MB
3.25 MB PNG
>>107366353
my alien got some stylish chest armor under similar circumstances
>>
it sure does like its nudity now tho lol
wonder why ( ͡° ͜ʖ ͡° )
>>
File: 0v_5~1.jpg (278 KB, 1808x2320)
278 KB
278 KB JPG
Grit is a universal solvent
>>
>>
the legiones birdstartes
>>
>>107366607
This is neat.
>>
File: deAG_zi_00001_.png (2.81 MB, 1920x1216)
2.81 MB
2.81 MB PNG
its putting the weights into the gen
>>
>>107367029
yah it cant handle them lol
>>
am i supposed to be able to run the 16bit version of Z on 16gb vram? i keep ooming on images this size
>>107367029
>>
File: deAG_zi_00005_.png (3.17 MB, 1920x1216)
3.17 MB
3.17 MB PNG
>>107367582
>run the 16bit version of Z on 16gb vram
yes, absolutely. I'm only on 12gb
>>
>>107367582
>am i supposed to be able to run the 16bit version of Z on 16gb vram? i keep ooming on images this size

Are you using the checkpoint from the comfy hf repo that's only 12.3GB?

https://huggingface.co/Comfy-Org/z_image_turbo/tree/main/split_files/diffusion_models

Also, are generating directly at the res or upscaling?
>>
Probably the last chroma image for a while.
Last one from me
Good night anons
>>
File: Desktop 8.png (1.72 MB, 1434x1114)
1.72 MB
1.72 MB PNG
>>
File: Desktop 5.jpg (756 KB, 2089x2089)
756 KB
756 KB JPG
>>
File: _00145_.png (2.91 MB, 1600x960)
2.91 MB
2.91 MB PNG
>>107367785
Goodnight
>>
i miss schizo anon
>>
File: bird.png (1.3 MB, 1280x768)
1.3 MB
1.3 MB PNG
>>
>>107368446
birb
>>
File: bird.png (1.23 MB, 1280x768)
1.23 MB
1.23 MB PNG
>>107368459
>>
File: cat.png (1.34 MB, 1280x768)
1.34 MB
1.34 MB PNG
>>
File: bird.png (1.1 MB, 1280x768)
1.1 MB
1.1 MB PNG
>>
File: bird.png (1.14 MB, 1280x768)
1.14 MB
1.14 MB PNG
>>
File: cat and bird.png (1.25 MB, 1280x768)
1.25 MB
1.25 MB PNG
>>
File: cat and bird.png (1.25 MB, 1280x768)
1.25 MB
1.25 MB PNG
>>
File: bird.png (1.11 MB, 1280x768)
1.11 MB
1.11 MB PNG
>>
File: bird.png (1.2 MB, 1280x768)
1.2 MB
1.2 MB PNG
>>
File: cat.png (1.33 MB, 1280x768)
1.33 MB
1.33 MB PNG
>>
File: bird.png (1.24 MB, 1280x768)
1.24 MB
1.24 MB PNG
>>
gm

even at cfg 1, it's impressive.
>>
File: 00016-3929135192.jpg (2.81 MB, 2048x2560)
2.81 MB
2.81 MB JPG
>>
>>107369445
prompt?
>>
File: cat.png (1.4 MB, 1280x768)
1.4 MB
1.4 MB PNG
>>
File: 00017-2384297012.jpg (961 KB, 2048x2560)
961 KB
961 KB JPG
>>107369489
gibbon, santa hat, close-up, chromostereoptic
had ran it through img2img loopback a bunch of times
>>
>>107369559
Looks decadent yet appealing. Nice.
>>
>>
>gm
>>
File: autumn river.webm (3.6 MB, 1920x960)
3.6 MB
3.6 MB WEBM
>>107369442
Good morning, the gen quality is really good.
>>107369445
I imagine this woud be cool on a blacklight poster.
>>
>>107369860
I'm still amazed that hands (and toes) are finally solved.

Nice gen anon... lovely shot.
>>
File: hifi.png (2.3 MB, 1536x1024)
2.3 MB
2.3 MB PNG
>>
>>
File: 00044-3035200828.jpg (2.86 MB, 2048x2560)
2.86 MB
2.86 MB JPG
>>
From my experience, z-image works best with cfg 1 and around 20 steps. Negatives don't work.
>>
File: 46456546546.jpg (284 KB, 832x1216)
284 KB
284 KB JPG
>>
>>107370468
For me weights don't seem to work no matter what and steps should be ~7 times the cfg. I'm currently going with cfg 3 and 21 steps.
>>
>>107370539
Yeah. According to the developers, DiT transformer backbone, like z-image-turbo (distilled), “absorbs” prompt meaning better than UNet models. So CFG is not going to help respect the prompt.
>>
File: autumn river 2.webm (3.74 MB, 1920x960)
3.74 MB
3.74 MB WEBM
>>107369959
Thank you!
>>107370367
>>107370521
Nice cats
>>
>>107370539
btw the higher the resolution the bigger and fewer the squares of her dress become. Like 640x480 is tonnes of tiny squares and 1600x1280 is just huge squares like for grilled cheese sandwiches.
>>
>>107362458
Nice to see forge repo for z image, how is it vs comfy for i2i edits?
>>
>>107362181
Just got my 2nd 5060ti16gb in the mail yesterday
>>
>>107370979
>>107370979
>>107370979
>>
File: 00026-3836555631.png (572 KB, 512x640)
572 KB
572 KB PNG
>>
>>107370997
Hes seen some stuff
>>
File: 00035-1035143123.png (350 KB, 512x640)
350 KB
350 KB PNG
>>
File: 351618113_1129_104900.png (2.92 MB, 1560x1214)
2.92 MB
2.92 MB PNG
>>
File: 51618114_1129_104900.png (2.58 MB, 1560x1214)
2.58 MB
2.58 MB PNG
>>
File: 00062-458047760.png (640 KB, 512x640)
640 KB
640 KB PNG
>>
File: 00108-448890693.png (597 KB, 512x640)
597 KB
597 KB PNG
>>
File: 002sf3957westminster.jpg (250 KB, 1000x1000)
250 KB
250 KB JPG
I like how the OP pic implies that they are having Thanksgiving dinner immediately after waking up in the morning
>>
File: 1752982483069295.png (41 KB, 128x128)
41 KB
41 KB PNG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.