[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1744997870666204.jpg (402 KB, 2130x2130)
402 KB
402 KB JPG
Previous /sdg/ thread : >>107314383

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Early Preview UI
AniStudio: https://github.com/FizzleDorf/AniStudio

>Flux.2 Dev
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/city96/FLUX.2-dev-gguf

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image
https://huggingface.co/QuantStack/Qwen-Image-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2
https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUF
https://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUF
https://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://github.com/maybleMyers/chromaforge
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Models, LoRAs & upscaling
https://civitai.com
https://tensor.art
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/vp/napt
>>
>>107328651
op time to remove ani, no one asked for it, no one uses it, no one cares for it
>>
>mfw Resource news

11/25/2025

>FLUX.2: Frontier Visual Intelligence
https://bfl.ai/blog/flux-2

>FLUX.2-dev-GGUF
https://huggingface.co/orabazes/FLUX.2-dev-GGUF

>FLUX.2 Day-0 Support in ComfyUI: Frontier Visual Intelligence
https://blog.comfy.org/p/flux2-state-of-the-art-visual-intelligence

>Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens
https://wakalsprojectpage.github.io/comt-website

>DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation
https://zehong-ma.github.io/DeCo

>Syn-GRPO: Self-Evolving Data Synthesis for MLLM Perception Reasoning
https://github.com/hqhQAQ/Syn-GRPO

>Learning Plug-and-play Memory for Guiding Video Diffusion Models
https://thrcle421.github.io/DiT-Mem-Web

>DiffSeg30k: A Multi-Turn Diffusion Editing Benchmark for Localized AIGC Detection
https://huggingface.co/datasets/Chaos2629/Diffseg30k

>FlowPortal: Residual-Corrected Flow for Training-Free Video Relighting and Background Replacement
https://gaowenshuo.github.io/FlowPortalProject

>Trump Launches Genesis Mission, Harnessing AI for US Energy, Science and Security Dominance
https://www.capitalaidaily.com/president-trump-launches-genesis-mission-harnessing-ai-for-us-energy-science-and-security-dominance

11/24/2025

>cc12m-1mp_plus-realistic: Filtered CC12M dataset for 1mp+ realism
https://huggingface.co/datasets/opendiffusionai/cc12m-1mp_plus-realistic

>simpletuner v3.1.3 with Kandinsky5, ACE-Step music training, and a webUI
https://github.com/bghira/SimpleTuner/releases/tag/v3.1.3

>Hunyuan 1.5 step distilled loras
https://huggingface.co/Comfy-Org/HunyuanVideo_1.5_repackaged/tree/main

>MMT-ARD: Multimodal Multi-Teacher Adversarial Distillation for Robust Vision-Language Models
https://github.com/itsnotacie/MMT-ARD

>Intervene-All-Paths: Unified Mitigation of LVLM Hallucinations across Alignment Formats
https://github.com/SooLab/AllPath
>>
>mfw Research news

11/25/2025

>ABM-LoRA: Activation Boundary Matching for Fast Convergence in Low-Rank Adaptation
https://arxiv.org/abs/2511.19145

>LumiTex: Towards High-Fidelity PBR Texture Generation with Illumination Context
https://lumitex.vercel.app

>Are Image-to-Video Models Good Zero-Shot Image Editors?
https://arxiv.org/abs/2511.19435

>Breaking the Likelihood-Quality Trade-off in Diffusion Models by Merging Pretrained Experts
https://arxiv.org/abs/2511.19434

>In-Video Instructions: Visual Signals as Generative Control
https://arxiv.org/abs/2511.19401

>Growing with the Generator: Self-paced GRPO for Video Generation
https://arxiv.org/abs/2511.19356

>SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
https://arxiv.org/abs/2511.19320

>Evaluating Dataset Watermarking for Fine-tuning Traceability of Customized Diffusion Models: A Comprehensive Benchmark and Removal Approach
https://arxiv.org/abs/2511.19316

>BideDPO: Conditional Image Generation with Simultaneous Text and Condition Alignment
https://limuloo.github.io/BideDPO

>ConceptGuard: Proactive Safety in Text-and-Image-to-Video Generation through Multimodal Risk Detection
https://arxiv.org/abs/2511.18780

>LAST: LeArning to Think in Space and Time for Generalist Vision-Language Models
https://arxiv.org/abs/2511.19261

>STCDiT: Spatio-Temporally Consistent Diffusion Transformer for High-Quality Video Super-Resolution
https://jychen9811.github.io/STCDiT_page

>FilmSceneDesigner: Chaining Set Design for Procedural Film Scene Generation
https://arxiv.org/abs/2511.19137

>When Semantics Regulate: Rethinking Patch Shuffle and Internal Bias for Generated Image Detection with CLIP
https://arxiv.org/abs/2511.19126

>Beyond Reward Margin: Rethinking and Resolving Likelihood Displacement in Diffusion Models via Video Generation
https://arxiv.org/abs/2511.19049

>VeCoR - Velocity Contrastive Regularization for Flow Matching
https://p458732.github.io/VeCoR_Project_Page
>>
>mfw MORE Research news

>One4D: Unified 4D Generation and Reconstruction via Decoupled LoRA Control
https://mizhenxing.github.io/One4D

>Learning What to Trust: Bayesian Prior-Guided Optimization for Visual Generation
https://arxiv.org/abs/2511.18919

>DiP: Taming Diffusion Models in Pixel Space
https://arxiv.org/abs/2511.18822

>ProxT2I: Efficient Reward-Guided Text-to-Image Generation via Proximal Diffusion
https://arxiv.org/abs/2511.18742

>VLM in a flash: I/O-Efficient Sparsification of Vision-Language Model via Neuron Chunking
https://arxiv.org/abs/2511.18692

>Exploring Weak-to-Strong Generalization for CLIP-based Classification
https://arxiv.org/abs/2511.18396

>Synthetic Curriculum Reinforces Compositional Text-to-Image Generation
https://arxiv.org/abs/2511.18378

>MagicWand: A Universal Agent for Generation and Evaluation Aligned with User Preference
https://arxiv.org/abs/2511.18352

>ConsistCompose: Unified Multimodal Layout Control for Image Composition
https://arxiv.org/abs/2511.18333

>Seeing What Matters: Visual Preference Policy Optimization for Visual Generation
https://arxiv.org/abs/2511.18719

>CoD: A Diffusion Foundation Model for Image Compression
https://arxiv.org/abs/2511.18706

>Now You See It, Now You Don't - Instant Concept Erasure for Safe Text-to-Image and Video Generation
https://arxiv.org/abs/2511.18684

>Edit2Perceive: Image Editing Diffusion Models Are Strong Dense Perceivers
https://arxiv.org/abs/2511.18673

>Robust Posterior Diffusion-based Sampling via Adaptive Guidance Scale
https://arxiv.org/abs/2511.18471

>Point-to-Point: Sparse Motion Guidance for Controllable Video Editing
https://arxiv.org/abs/2511.18277

>Uni-DAD: Unified Distillation and Adaptation of Diffusion Models for Few-step Few-shot Image Generation
https://arxiv.org/abs/2511.18281
>>
File: deCA_cHD_00039_.png (1.14 MB, 1613x922)
1.14 MB
1.14 MB PNG
>>
File: deCA_cHD_00040_.png (1.25 MB, 1613x922)
1.25 MB
1.25 MB PNG
>>107328855
its np, i had the news open all day and just wasnt moving through it at all. just me being extremely lazy and digging up an excuse, lol
>>
>>
>>
File: deCA_cHD_00041_.png (910 KB, 1613x922)
910 KB
910 KB PNG
>>
>>
>>
File: deCA_cHD_00042_.png (1.34 MB, 1613x922)
1.34 MB
1.34 MB PNG
>>
>>
File: deCA_cHD_00043_.png (1.13 MB, 1613x922)
1.13 MB
1.13 MB PNG
>>
File: PW_147105_.png (2.09 MB, 1280x1600)
2.09 MB
2.09 MB PNG
Good evening, anons! I hope everyone is doing well :]
>>
File: deCA_cHD_00046_.png (1.01 MB, 1613x922)
1.01 MB
1.01 MB PNG
>>107330108
hello
all the buzz is about flux2. only the mightiest of GPUs can run it
>>
File: PW_147107_.png (2.57 MB, 1280x1600)
2.57 MB
2.57 MB PNG
>>107330187
Heya, Debo!! Great to see you again :]
Ohhh sounds exciting!! I'll have to check it out!
>>
File: deCA_cHD_00048_.png (924 KB, 1613x922)
924 KB
924 KB PNG
>>107330257
there's a few links in the resource news for the gguf and comfy implementation. apart from that, I can't help, flux2 is beyond my power level
>>
File: PW_147245_.png (2.33 MB, 1280x1600)
2.33 MB
2.33 MB PNG
>>107330321
Downloading it now! I hope it works for me haha
>>
Last one from me
good night anons
>>107330108
Thanks PW, gn :)
>>
File: deCA_cHD_00049_.png (1.16 MB, 1613x922)
1.16 MB
1.16 MB PNG
>>107330377
I'm expecting a full report on my desk or you're losing your sdg pension

>>107330449
early night
gn
>>
File: PW_146655_.png (3 MB, 1280x1800)
3 MB
3 MB PNG
>>107330449
Good night, Quokkanon!! Sleep well :D
>>107330458
LOOL Yes sir! I'll have it on your desk before I go to bed!
>>
>>107326889
https://suno.com/song/5c797d02-7fb8-428c-9981-233fc37f0089
>>
File: hifi.png (2.3 MB, 1536x1024)
2.3 MB
2.3 MB PNG
>>
i miss schizo anon
>>
File: autumn river.webm (3.63 MB, 1920x960)
3.63 MB
3.63 MB WEBM
>>107324001
This is great news!
>>107326889
>https://suno.com/s/1WB2m2b0Vs3UTLxc
Cool instrumental.
>>107330482
I'm looking forward to seeing what you can gen with the new model.
>>107331069
>https://suno.com/song/5c797d02-7fb8-428c-9981-233fc37f0089
Nice remix and lyrics.
>>
File: autumn river 2.webm (3.83 MB, 1920x960)
3.83 MB
3.83 MB WEBM
>>
>>107332095
Thank you for all the nigbobumps.
>>
File: autumn river 3.webm (3.84 MB, 1920x960)
3.84 MB
3.84 MB WEBM
>>
flux.2 dev
>Lower VRAM (~24-32G) - RTX 4090 and 5090
>Lower
This makes me sad.

gm
>>
>gm
>>
Gonna need a H200 to use the flux.2 as designed.
>>
>>107328651
Remove AniStudio from OP
>>
File: 00017-2370645326.jpg (2.24 MB, 2048x2560)
2.24 MB
2.24 MB JPG
>>
>>107334086
i don't know, i kinda like the idea of anistudio, the problem is it's just too early and barely functional
>>
ComfyUI with the right custom nodes can do everything Anistudio promises but better and with more control. Why would anyone switch to an unproven, preview build?
>>
File: 1741935462698949.png (24 KB, 713x217)
24 KB
24 KB PNG
I'm a huge fucking retard and I need a big spoon feeding right into my mouth.
I literally just installed this, how do I load models?
No images I save appear to have any metadata that ComfyUI can load.
>>
>>107334086
>>107334140
Ani herself requested it to be removed from all OPs
>>
>>107334271
>4chan server filename

exif/workflow data is stripped automatically when uploaded so you're not going to find any workflow on images. The anon has to share the image via catbox.moe (or any other file uploading site) for the workflow to be present.
>>
>>107334140
Maybe in a year AniStudio will be worth the OP space, for now, it's garbage.
>>
>>107334354
proof????
>>
>>107334410
I tried it with a catbox image too and it didn't work, but I guess whoever posted it just didn't include the metadata then. So I just drag and drop the image into the UI and it should be able to read it?
>>
File: deCA_cHD_00051_.png (883 KB, 1613x922)
883 KB
883 KB PNG
>>107334086
I repeat:
you don't even post here. why do you care?
>>
>>107334495
download and drag and drop this into comfyui, it has workflow. Not my image, found elsewhere...

https://files.catbox.moe/24i01w.png
>>
>>107334510
Ani himself said to remove his UI from the OP. And this genereal must grow and evolve, not stagnate with the same old non functioning UIs,
>>
File: 1734863177532395.png (116 KB, 1242x828)
116 KB
116 KB PNG
>>107334547
>>107334594
Okay yeah that does work, if I can find some in the style/context I want that'll make it easier to figure out what kind of workflows I need.
>>
File: 00040-391254508.jpg (1.75 MB, 2048x2560)
1.75 MB
1.75 MB JPG
>>
>>107334598
It's labeled as an early preview, which should not cause confusion among new people.
>>
File: deCA_cHD_00052_.png (958 KB, 1613x922)
958 KB
958 KB PNG
>>
>>107334598
>same old non functioning UIs
I dont think so, as this anon >>107334839
said, early preview means it's actively developed and removing it will kills visibility and potential user feedback.
>>
File: 00045-1451384121.jpg (1.86 MB, 2048x2560)
1.86 MB
1.86 MB JPG
>>
File: Flux2_00014_.png (2.64 MB, 1536x960)
2.64 MB
2.64 MB PNG
flux2 with my prompts
apparently ai-toolkit can train a lora for it
will have to see over the 4 day weekend
>>
Morning anons
>>
File: deCA_cHD_00054_.png (815 KB, 1613x922)
815 KB
815 KB PNG
>>107335508
will we have to call her flux2girl?

>>107335544
gm
>>
>gm
>>
>>107335147
The other sd.cpp based uis are way better and even more popular
So by your logic we should add them all or is there a hidden reason why we shouldn't add them?
>>
File: Flux2_00015_.png (2.65 MB, 1344x1008)
2.65 MB
2.65 MB PNG
>>107335569
>will we have to call her flux2girl?
erm
chromluxgirl
or flomagirl
>>
>>107335508
Not bad.
Is that with the 4-bit version or do you rent an H200 or equivalent?
>>
File: Flux2_00016_.png (2.75 MB, 1008x1344)
2.75 MB
2.75 MB PNG
>>107335693
fp8
https://huggingface.co/silveroxides/FLUX.2-dev-fp8_scaled
regular model will run on a 5090 i believe
>>
File: Flux2_00017_.png (2.54 MB, 1344x1008)
2.54 MB
2.54 MB PNG
on the plus side, flux2 handles styles (and knows styles) better than most local models
on the negative, it seems kind of soulless. trying to get a good prompt flow going now
>>
File: Flux2_00018_.png (2.96 MB, 1536x1152)
2.96 MB
2.96 MB PNG
>>
File: Flux2_00019_.jpg (753 KB, 1536x2048)
753 KB
753 KB JPG
on the other other hand, no need to upscale
>>
File: PW_147301_.png (1.4 MB, 1024x1024)
1.4 MB
1.4 MB PNG
Good morning, anons! I hope everyone is doing well :]
>>
>>107335823
gm did you see
>>107327360
>>107327393
>>107327439
>>107327452
>>107327504
>>
File: PW_147304_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>107335840
Heya Flux girl anon!! It's so good to see you again!
Omg these are amazing hahaha I love those!
>>
lel that chinese model everyone's talking about sure BTFO flux 2 tho, i'll have to play with that later too
>>107335873
flux2 can definitely have fun times
>>
File: deCA_cHD_00055_.png (1.16 MB, 1613x922)
1.16 MB
1.16 MB PNG
>>107335823
gm
pretty early for you
any luck with flux2 yet?
>>
File: PW_147305_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>107335883
I just got it working this morning haha
It took an hour and 45 mins for my first test LOL
I figured out what I did wrong tho haha
I like how you can copy the style of other images/gens! That's so cool!
>>107335946
Yeah haha I gotta go do some work stuff pretty soon but I don't think I'm gonna stay all day
These gens are done with Flux 2! :]
I just got it working like 30 mins ago haha
>>
>>107335823
gm PW
Day off?
>>
>>107335706
Well, I only have 12gb of vram... I'll have to skip until I can do an upgrade.
>>
File: deCA_cHD_00057_.png (1.39 MB, 1613x922)
1.39 MB
1.39 MB PNG
>>107335979
>I gotta go do some work stuff
you dont have tuesdays off anymore?
>These gens are done with Flux 2! :]
looks good. I kinda wanna see whats up with that mobile game in the background
>>
wait so is this the tranny hang out hugbox? i dont understand the "how is your day" posting
>>
>>107336028
comfy unloads to regular ram so you could probably fit it in tho
maybe
>>
>>107336043
you have to go back
>>
>>
File: PW_147306_.png (895 KB, 1024x1024)
895 KB
895 KB PNG
>>107336014
Good morning!! :D
Super cool gen! I love that outfit!
Kinda hahaha I gotta go shopping in a couple hours but I think that's all I plan to do today
>>107336034
I didn't for a while but I think I will again soon after the holidays!
Thanks hahaha! I like how those came out!
>>
>>107335632
Ani lurks here we should support him. Unlike other sp.cpp based UI developers who don't hang around here, Ani participates. That's why I think we should help him out.
>>
File: PW_147310_.png (1.82 MB, 1024x1024)
1.82 MB
1.82 MB PNG
>>
>>107336161
I understand, but if Julien himself wants it delisted, we should respect his wishes. No dev wants an unfinished project representing him publicly
>>
>>107336062
ty - have fun

>>107336062
the mistral clip is what's causing the oom, I'll try another workflow.
>>
>>107336287
>the mistral clip is what's causing the oom
yep, maybe there's a way to load that to cpu (if you have the ram that is). i was trying to load my usual workflow which has gemma3-27b as llm (to use as a prompt generator and pass it to flux2) but it was way too much lol
>>
>>
File: PW_147311_.png (1.7 MB, 1024x1024)
1.7 MB
1.7 MB PNG
>>
>>
>>107336263
>>107335147
Support is one thing, spoonfeeding a broken UI to newfags in the main thread list is another. If it's in the OP, people assume it's a viable alternative to Forge or Comfy and it's not there yet, maybe keep it in a rentry or a separate paste until it's functioning at least.
>>
>>107336302
>load that to cpu
yes. that was a good tip. There was a cpu option.
>loaded completely; 95367431640625005117571072.00 MB usable, 1280.59 MB loaded, full load: True
KEK - it thinks I have 9x1030 MB of VRAM. So it crashed.
>>
>>107336263 sorry, it was to him >>107336161
>>
>>107336459
>loaded completely; 95367431640625005117571072.00 MB usable
i get that all the time
>>
>>107336585
I get CUDA kernel errors
>>
File: PW_147317_.png (1.93 MB, 1024x1024)
1.93 MB
1.93 MB PNG
>>
File: deCA_cHD_00058_.png (987 KB, 1613x922)
987 KB
987 KB PNG
>>107336188
>>107336394
homeless pw is cute

>>107336643
>pw spell misfires and she turns herself into a ragdoll
>>
there's too much to keep track of today lol
>>
File: PW_147314_.png (1.86 MB, 1024x1024)
1.86 MB
1.86 MB PNG
>>107336658
Thanks!! :D
LOL! It's crazy how much like the original doll gens!
>>
File: example.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
behold, z-image (running locally)
>>
File: example.png (2.24 MB, 1152x1536)
2.24 MB
2.24 MB PNG
now to wait for a comfy workflow lol
>>
File: deCA_cHD_00059_.png (913 KB, 1613x922)
913 KB
913 KB PNG
>>107336915
z-image is finally out? I've been looking forward to trying it out after flux2 was too fat for me
>>
File: z-image-chromagirl.png (2.16 MB, 1152x1536)
2.16 MB
2.16 MB PNG
>>107336979
just out now lol
https://huggingface.co/Comfy-Org/z_image_turbo/tree/main/split_files
waiting on a workflow but i used the python script they provided for these
>>
>>107337011
>https://huggingface.co/Comfy-Org/z_image_turbo/tree/main/split_files

A Q8 quant should come in around ~7GB which will fit in a lot more GPUs than Flux.2 did.
>>
>>107337094
yeah flux2 despite the glitz may be DOA now lol
>>
>>
File: deCA_cHD_00062_.png (1.1 MB, 1613x922)
1.1 MB
1.1 MB PNG
>>107337316
looks pretty good
>>
>>107337350
it's pretty fantastic and fast as hell
>8/8 [00:05<00:00, 1.40it/s]

that was straight 1536x1152
>>
>>107337350
>>107337374
and apparently it knows porn positions and stuff, not just nsfw lol
this shit's ridiculous
>>
BIGGER
>>
File: ComfyUI_temp_prpef_00001_.jpg (1.18 MB, 3659x4878)
1.18 MB
1.18 MB JPG
i'm easily impressed
>>
File: 00046-1786240189.jpg (538 KB, 1344x1728)
538 KB
538 KB JPG
>>
>>
>>
>>
>>
>>
What's with the slopper in /sdg/? Somehow I feel like if a typical anon spams the same character it wouldn't even be much of a deal.
Here, he runs the same scene over and over again with no cohesion. It's like abstract without the abstract. It's not going to make anyone ask "what's the story?" It's just a bunch of shit mashed together with the same keywords.
>>
>>107338496
no one asked
>>
>>107338509
nor did anyone ask for that junk
>>
>>107338522
no one asked you
>>
>>107338496
enjoy your "story"
>>
>>107338496
You're going to have to be more specific, quality is dead here since nigbo chased off all the good posters
>>
File: deDL_zi_00002_.png (1.5 MB, 1344x768)
1.5 MB
1.5 MB PNG
>>107338496
>>107338595
two new models drop, all the prompters are eating good. you, on the other hand, spend your thanksgiving eve recycling boring, worn-out nogen drama baiting
>>
uh oh
>>
File: deDL_zi_00006_.png (1.24 MB, 1344x768)
1.24 MB
1.24 MB PNG
>>
File: deDL_zi_00012_.png (1.4 MB, 1344x768)
1.4 MB
1.4 MB PNG
>>107338567
have you just been using the default euler/simple or have you tried other combos?
>>
File: output.webm (1.27 MB, 380x377)
1.27 MB
1.27 MB WEBM
dear past me
pls pls pls document at least some things before you deprovision everything
thank you
future me
haha
>>
>>107339146
nah in integrated it into my regular workflow, so 24 steps z-image with deis_2m + double sigma stuff (bong tangent)
then pass it to a tile upscaler with chroma and more bong tangent for another 20 steps lel
it's overkill but it seems to be working
you're using zimage now? how fast is it for you?
>>
>>107339170
lel
>>
File: deDL_zi_00017_.png (1.19 MB, 1344x768)
1.19 MB
1.19 MB PNG
>>107339170
nice to see you
happens to the best of us
happy thanksgiving eve

>>107339178
>you're using zimage now? how fast is it for you?
yeah. very speedy. 40s at 20 steps. cranked up to 30 steps @ ~60s
>>
>>107338704
>spends all 365 days recycling boring, worn-out prompts
>>
File: output.webm (1.17 MB, 380x377)
1.17 MB
1.17 MB WEBM
>>107339248
henlo :)
there is so much wrong with my setup right now and the outputs that i do not know where to start fixing it haha
i just realized that the outputs are not even squares anymore
so much for being able to clone the exact setup to a different environment
i need time, stupid real life haha
>>
File: deDL_zi_00019_.png (1.44 MB, 1344x768)
1.44 MB
1.44 MB PNG
>>107339399
>i need time, stupid real life haha
you don't have extra down time this week?
>>
File: ComfyUI_temp_fhrxb_00030_.png (2.37 MB, 1536x1152)
2.37 MB
2.37 MB PNG
>>107339248
nice, push up the res man lol
>>
File: deDL_zi_00021_.png (1.31 MB, 1344x768)
1.31 MB
1.31 MB PNG
>>107339489
tru
>>
File: output.webm (1.16 MB, 380x377)
1.16 MB
1.16 MB WEBM
ok now stuff crashed, back to the machine room for me, gn frens haha

>>107339453
no not really
and my problem is that when i have some free time i just work more haha
oh well
>>
File: deDL_zi_00033_.png (2.56 MB, 1680x1216)
2.56 MB
2.56 MB PNG
110s

>>107339603
gn
>>
eh nm that upscaling was too much lol
i'll just gen straight at the high resolution
>>
>>
>>
>>
now that everyone can use z-image, the slop will flow
it was the best of times, it was the worst of times
>>
>>107339666
>>107339732
like these
>>107340088
winrar
>>
>>107340111
thx
>>
Also, Black Forest Lab has a playground to try the pro version.
>>
>>107340219
bfl lost this one, man
z-image just rolls over it
except the edit (z-image supposedly has an edit model coming soon(tm)
flux2 is nice, but it got mogged
>>
>>
File: deDL_zi_00034_.png (2.15 MB, 1680x1216)
2.15 MB
2.15 MB PNG
>>107340329
>bfl lost this one, man
its kind of ironic cuz flux's claim to fame was that it popped up after sd3 was too large and too bad. now flux2 gets undercut in a similar fashion
>>
File: 000000_45934_.png (2.53 MB, 1475x1106)
2.53 MB
2.53 MB PNG
Back to slowness... took 7 minutes with Q3.gguf on a 5070Ti 16gb.
>>
File: deAR_zi_00002_.png (2.76 MB, 2048x1216)
2.76 MB
2.76 MB PNG
retrying my animal racing prompt on zindex but it definitely doesn't understand at all. prob needs the negative to work
>>
File: PW_147350_.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
Hello again, anons! :]
>>
File: 00048-2224676707.jpg (1.76 MB, 2048x2560)
1.76 MB
1.76 MB JPG
>>107340961
hello
>>
File: PW_147358_.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>107341149
Heyyy Koff!! It's so great to see you again! I loved your new song!
>>
File: 00050-1461368342.jpg (972 KB, 1344x1728)
972 KB
972 KB JPG
>>107341216
i am glad you liked it.
i heard this song this evening, one of the nicer ones i have heard lately, sadly i doubt a.i. can ever match it:
https://youtu.be/3EBTk5brQVY
>>
File: 00052-3190651851.jpg (1.2 MB, 1344x1728)
1.2 MB
1.2 MB JPG
nite
>>
File: PW_147361_.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>107341359
Ohhh I like this! It's really mellow and unique!
I tried to make something close to the style
https://suno.com/s/5JxcUbqoPpsrBAhB
>>107341548
Good night!! Sleep well :]
>>
File: deDL_zi_00036_.png (2.44 MB, 2048x1216)
2.44 MB
2.44 MB PNG
>>107340961
hello
there's yet another new model out. people like it more than flux2: >>107337011

>>107341548
gn, sorry I missed you
>>
File: PW_147369_.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>107342075
LOOL just when I thought I caught up hahaha! I'll try it out!
>>
File: 1764213982954.jpg (1.54 MB, 1981x3000)
1.54 MB
1.54 MB JPG
Any advice for genning anything that features a couple?
Concepts from one character keep bleeding into the other character and vice-versa... I tried using "BREAK" but it doesn't seem to do much
>>
File: 000000_45936_.png (2.41 MB, 1132x1448)
2.41 MB
2.41 MB PNG
ZIMAGE, 7 seconds..we back.
>>
File: deDL_zi_00037_.png (2.63 MB, 2048x1216)
2.63 MB
2.63 MB PNG
>>107342154
lol, yeah, its kinda crazy to get two new sota models back to back. might be more surprises before the end of the year too

>>107342203
there's not really a silver bullet for prompting multiple characters. newer models can understand prompts better but will still bleed, plus they'll be much slower on average
cloud models are way ahead of local on this front. people have done really impressive composites with nano banana and gpt image
for local, you can try looking into regional conditioning or latent couple. there are some nodes/workflows but its kind of finicky and unreliable

>>107342334
you can gen an image faster than a rocketeer can kill you
>>
File: PW_147376_.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
>>107342438
Oh wow! This gens in 4 seconds! Much better than waiting 2 mins for Flux 2 hahaha!
>>
>>107342454
>>107342334
Nice!
Specs? I doubt I'll be able to run it on my 6GB laptop but it's just really impressive
>>
File: deDL_zi_00039_.png (2.64 MB, 2048x1216)
2.64 MB
2.64 MB PNG
>>107342454
you can pump the resolution a lot too. its pretty crazy

>>107342487
z-image is very small. actually smaller than flux1, I think. idk if there are quants out yet tho
>>
File: PW_147411_.png (2.09 MB, 1440x1440)
2.09 MB
2.09 MB PNG
>>107342487
I have 24gb (4090)!
I think you might be able to run it! It's way faster than anything i've used so far I think! Also the stuff you gotta download isn't too big either
>>107342526
That's one of the first things I tried hahaha! Slowly going up! This one only took 9 seconds
>>
File: PW_147417_.png (3.47 MB, 2048x1440)
3.47 MB
3.47 MB PNG
15 seconds!
Plus it got the characters right haha
>>
File: deDL_zi_00041_.png (2.86 MB, 2048x1216)
2.86 MB
2.86 MB PNG
>>107342652
>the witch mafia plans a murder
>>
File: PW_147430_.jpg (801 KB, 2048x2048)
801 KB
801 KB JPG
24 seconds! Gonna have to add a thing to save as jpg haha it got too big
>>107342663
LOL
>>
>>107342663
>All those broccoli perms
Damn, now that's hyper real
>>
File: PW_147437.jpg (2.74 MB, 2800x2048)
2.74 MB
2.74 MB JPG
Wow I don't think i've ever made gens this big haha
35 seconds
>>
>>
File: deDL_zi_00043_.png (2.97 MB, 2048x1216)
2.97 MB
2.97 MB PNG
>>107342764
lol, zoomer heavy dataset it seems
>>
>>107342790
Maybe this model knows more zoomer celebs
>>
this shit is so crazy
at least now people cant bitch about my gens since everyone's going nuts with both z and flux
>>107328651
op get rid of ani and add z-image you crazy bastard
>>
File: deDL_zi_00045_.png (2.55 MB, 2048x1216)
2.55 MB
2.55 MB PNG
>>107342926
only one guy complains about your gens and he only does it because he wants attention, not because he has valid opinions
>>
>>107342926

I'll get zimage added. It's going to look a bit weird without either docs or github.io tutorial linked and also a quant repo for now.
>>
File: PW_147448.jpg (2.15 MB, 2800x2048)
2.15 MB
2.15 MB JPG
3840x2160 takes 55 seconds but things get weird hahaha
It also slows down my pc and goes over 4mb a lot, even as a jpg
>>
>>107342963
are you just testing the limits of your gpu on how high you can gen? because if you just want high res dont go higher than like 1536 and then do an upscale with model or whatever, much easier
even if you use a slow upscaler yo've already saved what, a minute or a minute and a half anyway
>>107342961
lel i know, i was being facetious
still, everyone's mixing all kinds of shit up now, it's funny, i feel like my gens arent pushing the limits anymore
>>
>>
File: deNE_zi_00003_.png (2.1 MB, 2048x1216)
2.1 MB
2.1 MB PNG
I will need to know how z-image performs on quokka benchmarks
>>
File: PW_147412_.png (2.93 MB, 2048x1440)
2.93 MB
2.93 MB PNG
>>107342999
Yeah hahaha I just wanted to see how high I could get it with the basic workflow! It was fun to play with it :]
I'll likely play around with upscaling tomorrow, getting kinda sleepy haha
I really like this model, my only thing is that I noticed that if you gen with the same proompt over and over it looks almost the same as the one before it but i'm sure that could be easily fixed
>>107343122
It does a pretty good quokka hahaha
>>
>>107343145
yeeah people have mentioned different seeds look the same on the same prompt, but i havent seen that
then again i use 4 or 5 different seed generators on different nodes lol
>>
File: same same.jpg (345 KB, 2332x1046)
345 KB
345 KB JPG
>>107343155
>different seeds look the same on the same prompt
I definitely see that, even with fairly wildcarded prompts. that behavior is what ultimately turned me off of base flux1
>>
>>107343122
>>107343145
Gotta say it's pretty accurate to real pics (unlike flux or chroma), nice, can't wait to try it.
>>
>>107343173
there's only so many ways a news blurb can look no?
>>
Next Thread

>>107343201
>>107343201
>>107343201

>>107343194

Have you tried your group shot from the last thread in z-image yet?
>>
File: PW_147381_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>107343155
Good idea haha I might have to try something similar tomorrow!
>>107343181
I hope it works well for you!! :D
I'm sure it will!
>>
File: same same same.jpg (453 KB, 2366x1002)
453 KB
453 KB JPG
>>107343194
the layout, composition and positioning, even the anchors are nearly identical between them all. there's a lot of room for variety that isnt being explored. here's the same thing with the DJ prompt, before I was able to loosen it up some
>>
i was going to skip this gen but then i saw she was making a proper thumbs down
>>107343206
oh i should lol
>>107343245
ah i see it
>>
File: deDL_zi_00046_.png (2.75 MB, 2048x1216)
2.75 MB
2.75 MB PNG
>>
File: PW_147392_.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
>>
File: PW_147462.jpg (1.89 MB, 2048x2048)
1.89 MB
1.89 MB JPG
>>
File: PW_147463.jpg (1.89 MB, 2048x2048)
1.89 MB
1.89 MB JPG
>>
File: PW_147464.jpg (1.87 MB, 2048x2048)
1.87 MB
1.87 MB JPG
>>
File: deDL_zi_00047_.png (2.75 MB, 2048x1216)
2.75 MB
2.75 MB PNG
>>
File: Trek-Picard-740x481.png (588 KB, 740x481)
588 KB
588 KB PNG
>>
File: 1752982483069295.png (36 KB, 128x128)
36 KB
36 KB PNG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.