[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor acceptance emails will be sent out over the coming weeks. Make sure to check your spam folder!


[Advertise on 4chan]


Previous /sdg/ thread : >>109155083

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Flux.2 Dev/Klein
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/black-forest-labs/FLUX.2-klein-4B
https://huggingface.co/black-forest-labs/FLUX.2-klein-9B

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Anima
https://huggingface.co/circlestone-labs/Anima

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/aco/csdg/
>>>/b/degen
>>>/d/ddg
>>>/e/edg
>>>/gif/vdg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vp/napt
>>>/vt/vtai

OP https://rentry.co/twkuk8tz
>>
>shithole general
>>
>>109170403
You solved a captcha to post this here
>>
File: 00045-973614839.jpg (1.54 MB, 2592x2016)
1.54 MB JPG
>>
>>
>mfw Resource news

06/30/2026

>OmniDance: Multimodal Driven Dance Video Generation with Large-scale Internet Data
https://github.com/AMAP-ML/OmniDance

>SAFE-DiT: Semantics-Aware Fast-path Execution for High-Resolution Diffusion Transformers
https://github.com/xuanhuayin/SAFE-DiT

>EcoVideo: Entropy-Orchestrated Video Generation Paradigm in Cloud-Edge Dynamics
https://github.com/IF-LAB-PKU/EcoVideo

>See Only When Needed: Context-Aware Attention Intervention for Mitigating Hallucinations in LVLMs
https://github.com/Iris1946/CAI

>Spanning the Visual Analogy Space with a Weight Basis of LoRAs
https://research.nvidia.com/labs/par/lorweb

>Krea 2 LoRA Trainer
https://github.com/CaptainGrock/Krea2Trainer

>Ideogram JSON Captioner Kit - making ID4 datasets slightly less painful
https://github.com/Adudeguyman/Ideogram-fantastic-upgraded-captioning-kit

06/29/2026

>Krea 2 Base & Turbo — NVFP4 / FP8 / MXFP8 / INT8 / ConvRot INT8
https://huggingface.co/Winnougan/Krea-2-Base-Turbo-NVFP4-FP8-INT8

>Local Dream 2.8.0 with Anima support
https://github.com/xororz/local-dream/releases/tag/v2.8.0

>OSOR: One-Step Diffusion Inpainting for Effect-Aware Object Removal
https://github.com/Zhouqm-Git/osor

>Diffusion Model Attribution via Spectral Coupling of Denoiser Responses
https://github.com/Pragati-Meshram/SGS

>OrthoTryOn: Geometric Orthogonalization for Conflict-Free Unified Fashion Generation
https://github.com/NJU-PCALab/OrthoTryOn

>CSD: Content-aware Speculative Decoding for Efficient Image Generation
https://github.com/aderfebr/CSD

>Dismantling Pathological Shortcuts: A Causal Framework for Faithful LVLM Decoding
https://github.com/Cc2021start/Fox

>Extra CFG++ Samplers
https://github.com/xxiiyu/extra_cfgpp

>VNCCS 3.0 release
https://github.com/AHEKOT/ComfyUI_VNCCS/releases/tag/3.0.0

>forgeModelPatch: Add ZImage and Anima to Forge
https://github.com/croquelois/forgeModelPatch

>Flux2-Klein-9B-True-V3
https://huggingface.co/wikeeyang/Flux2-Klein-9B-True-V3
>>
>mfw Research news

06/30/2026

>Nemotron-Labs-Diffusion-Image: Advancing Masked Discrete Diffusion for High-Resolution Image Synthesis
https://arxiv.org/abs/2606.29814

>Intermediate Text Representation Guided Text-to-Image Generation for Enhancing One-and-Only Alignment
https://basedoun-won.github.io/one-and-only-ir-guidance

>Your Data Manifold is Secretly a Reward Model: Shell-LCC for Text-to-Video Generation
https://arxiv.org/abs/2606.30248

>Mural: Transferring LLM knowledge to image generation via Mixture-of-Transformers
https://arxiv.org/abs/2606.29013

>Concept Removal Guidance: Evidence-Calibrated Negative Guidance for Safe Diffusion Sampling
https://arxiv.org/abs/2606.29801

>Goku: A Million-Scale Universal Dataset and Benchmark for Instruction-Based Video Editing
https://arxiv.org/abs/2606.30599

>Illuminating Unified Multimodal Model for Free-form Interleaved Text-Image Generation
https://arxiv.org/abs/2606.30054

>MuseBench: Benchmarking Intent-Level Audiovisual Arts Understanding in MLLMs
https://musebench.github.io

>Rigel: Self-Distilled Score Adaptation for Image and Video Captioning Evaluation
https://arxiv.org/abs/2606.29997

>MAVIN: Multi-Shot Audio-Visual Generation with Narrative Control
https://arxiv.org/abs/2606.29473

>DreamForge-World 0.1 Preview: A Low-Compute Real-Time Controllable World Model
https://trydreamforge.com

>ScaleErasure: Inference-Time Minimal Intervention for Precise Concept Erasure in Next-Scale Autoregressive Image Generation
https://arxiv.org/abs/2606.29282

>The Human Creativity Benchmark
https://arxiv.org/abs/2606.30561

>What Color is the Sky (for a non-human)?
https://arxiv.org/abs/2606.28912

>W4A4 Quantization for Inference on Wan2.2-I2V-A14B
https://arxiv.org/abs/2606.29337

>Self-Evolving Agentic Image Restoration via Deliberate Planning and Intuitive Execution
https://arxiv.org/abs/2606.28971

>StackingNet: Collective Inference Across Independent AI Foundation Models
https://arxiv.org/abs/2602.13792
>>
File: debo_ccg_fia_00095_.png (1.99 MB, 1792x977)
1.99 MB PNG
>>
File: 00497-2657103393.png (1.53 MB, 1152x896)
1.53 MB PNG
>>109170649
darn, missed the extra finger, another upscale fail
>>
File: debo_vn_fia_00088_.png (2.3 MB, 1792x977)
2.3 MB PNG
>>
File: 00053-4199553907.png (2.69 MB, 1872x2736)
2.69 MB PNG
>>
>>
>>
File: debo_vn_fia_00093_.png (2.19 MB, 1792x977)
2.19 MB PNG
>>109170960
please refrain from pooping on the floor
>>
>>109170975
it's a big cat
>>
File: 00055-3753280650.jpg (1.15 MB, 1872x2736)
1.15 MB JPG
>>
File: debo_vn_fia_00100_.png (2.21 MB, 1792x977)
2.21 MB PNG
>>
>>
File: debo_vn_fia_00103_.png (2.33 MB, 1792x977)
2.33 MB PNG
>>
>>
File: debo_vn_fia_00108_.png (2.37 MB, 1792x977)
2.37 MB PNG
>>
>>
File: output.mp4 (3.79 MB, 1024x1280)
3.79 MB
3.79 MB MP4
>>
>>
>>
>>
File: debo_sf_k2_uv_00034.jpg (3.65 MB, 6192x2580)
3.65 MB JPG
>>109172174
another banger
>>
>>
File: debo_sf_k2_uv_00038.jpg (3.13 MB, 6192x2580)
3.13 MB JPG
>>
File: tux_1.jpg (283 KB, 1664x2148)
283 KB JPG
Sorry I didn't get to reply to everyone in the last thread. Thanks for all the happy birthdays. It was a pretty good one.
>>
>>109173591
I love this more penguin gens please.
>It was a pretty good one.
How come anon?
>>
File: pixel-0000-2219724565.png (790 KB, 2048x2048)
790 KB PNG
>>
i miss schizo anon
>>
File: output.mp4 (3.75 MB, 1024x1024)
3.75 MB
3.75 MB MP4
>>
File: 000000_76306_.png (2.84 MB, 946x1715)
2.84 MB PNG
G'mornin Anons,
>>
>gm
>>
File: 00003-3072589208.jpg (1.61 MB, 2016x2592)
1.61 MB JPG
>>109175082
good morning
>>
>>109175082
>>109175123
>>109175147
gm
>>
gm

Happy Canada Day to our Canadian anons
>>
>>
File: 00006-2489859305.png (3.1 MB, 1344x1728)
3.1 MB PNG
>>
>>
File: 000000_76317_.png (2.61 MB, 937x1698)
2.61 MB PNG
>>109175123
>>109175147
>>109175203
Gm,
>>
File: 00009-3648761485.png (2.64 MB, 1344x1728)
2.64 MB PNG
>>
File: 00010-772701660.png (1.69 MB, 1344x1728)
1.69 MB PNG
>>
File: 00015-1679206714.png (1014 KB, 1344x1728)
1014 KB PNG
>>
>>109175774
Thank you for all the nigbobumps.
>>
>>
File: pixel-0002-1904913965.png (392 KB, 2048x2560)
392 KB PNG
>>
First time trying Krea 2 Turbo since ZIT, is this normal?
>prompt adherence sucks, the waifu is wearing dark clothes instead of wearing revealing gold squares linked together by twine and she's not holding the banana between her feet as prompted
is that a censorship thing? Do I need an analgated model?
>prompt variance between seeds is minimal, just like ZIT
>>
>>109176089
Nopely no you need either a lora or a custom node to enhance its prompt adherence
>https://huggingface.co/Beinsezii/Krea-2-Turbo-Projector-Scale-LoRA-Diffusers
>https://github.com/nova452/ComfyUI-Conditioning-Rebalance
They are all bit different and stuff, haven't tried the new version of this node
>>
>>109176089
>>>/g/ldg
>>
>>
>>109176089
you want the analgaped model
>>
>>
File: debo_sf_k2_uv_00044.jpg (3.72 MB, 6192x2580)
3.72 MB JPG
gm
>>109175207
happy canada day. may all your milks be bagged and police be mounted
>>
>>109176331
gm
>>
File: output.mp4 (3.73 MB, 1536x1024)
3.73 MB
3.73 MB MP4
>>
>gm
>>
File: debo_sf_k2_uv_00046.jpg (2.93 MB, 6192x2580)
2.93 MB JPG
>>109176404
do you control the color on these or is that an random affect of the loopback?
>>
File: 00362-1810160783.png (587 KB, 768x512)
587 KB PNG
>>109176461
just a random effect
>>
File: 00020-776188329.png (2.8 MB, 1248x1824)
2.8 MB PNG
>>
>>109176461
what did you go with for upscale?
>>
File: debo_sf_k2_uv_00047.jpg (3.43 MB, 6192x2580)
3.43 MB JPG
>>109176661
just the rtx sr single pass you suggested, mostly because I wasn't getting any bang for the buck with other stuff I experimented with
next thing I was gonna try is seeing if PID does better but I haven't gotten around to testing it
>>
>>109176707
yeh i've been looking into PID but it's so finnicky (wants only resolutions smaller than the model label and adds/generates color tints more often than not). the flux1 pid processor or wahtever seems best at not making too much noise. for me only rtx leaves the source relatively clean when upscaling compared to most upscaler models/other ways of upscaling. i really dislike the "upscale and re-sample with low denoise" method too. just upscale what i give you damn you.
also i still dont know what "melted" means lel
>>
File: melt.jpg (424 KB, 2452x1284)
424 KB JPG
>>109176754
>also i still dont know what "melted" means lel
its easiest to notice zooming in on the metal paneling and shapes. instead of sharp lines, the textures and edges have this warping effect. looks almost painted, in a way.
>>
>>109176823
ah i see
>>
File: 00023-2517181230.png (1.81 MB, 1280x1024)
1.81 MB PNG
>>
File: Sans titre.png (1.97 MB, 928x1120)
1.97 MB PNG
>>109176641
>>
File: 00024-872148162.png (1.95 MB, 1280x1024)
1.95 MB PNG
>>109176880
amazing
>>
>>109176823
Yeah this is clearly tensor core related latent washback.
>>
File: debo_sf_k2_uv_00052.jpg (3.65 MB, 6192x2580)
3.65 MB JPG
>>109176957
impossible. I just brought my tensors into the shop for a tune up
>>
It's kind of tiresome how asking for a painting tends to give very round faces with any model.
>>
File: Real-CUGAN-se_0701_124601.jpg (207 KB, 2467x1472)
207 KB JPG
>>109173796
OK :)
>how come
How come what? How come it was a good birthday? I guess because nothing bad happened, lol
>>
File: pixel-0004-3789299552.png (625 KB, 2560x2048)
625 KB PNG
>>
>>109176988
maybe try specific painter names, although yes, round faces, and very similar painting styles are the norm
>>
File: pixel-0006-3073565282.png (526 KB, 2560x2048)
526 KB PNG
>>
>>
Morning anons
>>
File: pixel-0007-4048624339.png (430 KB, 2560x2048)
430 KB PNG
>>109177118
morning
>>
File: debo_sf_k2_uv_00053.jpg (3.52 MB, 6192x2580)
3.52 MB JPG
>>109177118
gm
big win for mexico
>>
File: 00033-437657129.png (2.32 MB, 1344x1728)
2.32 MB PNG
>>
File: debo_sf_k2_uv_00061.jpg (3.78 MB, 6192x2580)
3.78 MB JPG
>>
File: pixel-0010-2922140655.png (327 KB, 2560x2048)
327 KB PNG
>>
>>
>>
>>
File: 00041-571367536.png (2.65 MB, 1344x1728)
2.65 MB PNG
relatable
>>
>>109178190
mfw
>>
poop'in rn
>>
>>
>>109177301
It was indeed, England next Sunday
>>
File: debo_sf_k2_uv_00064.jpg (3.7 MB, 6192x2580)
3.7 MB JPG
>>
>>
>>
>>
>>
File: debo_sf_k2_uv_00069.jpg (3.77 MB, 6192x2580)
3.77 MB JPG
>>
File: comfyui_00006_.png (161 KB, 384x384)
161 KB PNG
>>
File: comfyui_00007_.png (315 KB, 512x512)
315 KB PNG
>>
File: comfyui_00008_.png (428 KB, 768x512)
428 KB PNG
about as big an image i can gen, with krea2, oh well
>>
>>
File: comfyui_00011_.png (583 KB, 768x512)
583 KB PNG
>>
>>
File: debo_sf_k2_uv_00075.jpg (3.79 MB, 6192x2580)
3.79 MB JPG
>>109179360
how long does it take?
>>
File: comfyui_00017_.png (504 KB, 512x768)
504 KB PNG
>>109179601
a minute and 50 seconds, at this size, 9 steps. not very long by my standards. when i tried 768x768 it froze up my pc at the vae decode step
>>
File: debo_sf_k2_uv_00076.jpg (3.1 MB, 6192x2580)
3.1 MB JPG
>>109179636
theres a bunch of gguf options if you wanna try to find a better fit
https://huggingface.co/molbal/krea2-gguf
>>
File: comfyui_00018_.png (449 KB, 512x768)
449 KB PNG
>>109179655
guess i'll try them
>>
>>109179667
that's a great spaceship
>>
File: comfyui_00001_.png (469 KB, 512x768)
469 KB PNG
>>109179655
well, that was aggravating. when i tried to use the gguf unet loader, gave an error, not recognizing the krea2 format, i guess.. claude wasn't able to help me fix it, and the comfyui seemed broken afterwards, getting oom errors using the prior setup, so had to reinstall and redownload everything. i shall not mess with gguf further.
>>109179689
prompt was 'a small spaceship floating in space, shaped like a wedge, sleek, shiny, chrome. in the background there is a brilliant nebula. high contrast digital photo.'
>>
>>109179985
dont be a gguf
get the fp8 or preferably (if using nvidia later than 30xx) an int8
also fp8 encoders or int8 to minimize ram usage, and something like sage attention too
>>
File: comfyui_00002_.png (466 KB, 768x512)
466 KB PNG
>>109179999
sadly, and nice numbers, i have just a poor gtx 1080. i am using the fp8 scaled of the model and the clip. my impoverished hardware is at its limit, unfortunately
>>
>>109180013
damn
i think 1060 is the lowest i've seen so you have a small advantage over the worst
>>
File: comfyui_00004_.png (481 KB, 768x512)
481 KB PNG
>>
File: comfyui_00005_.png (424 KB, 768x512)
424 KB PNG
>>
>>
File: comfyui_00007_.png (437 KB, 512x768)
437 KB PNG
>>
File: comfyui_00009_.png (481 KB, 512x768)
481 KB PNG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.