[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor applications are now open. Apply here!


[Advertise on 4chan]


Previous /sdg/ thread : >>108893212

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Flux.2 Dev/Klein
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/black-forest-labs/FLUX.2-klein-4B
https://huggingface.co/black-forest-labs/FLUX.2-klein-9B

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Anima
https://huggingface.co/circlestone-labs/Anima

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/aco/sdg
>>>/b/degen
>>>/d/ddg
>>>/e/edg
>>>/gif/vdg
>>>/h/hdg
>>>/r/realistic+parody
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vp/napt
>>>/vt/vtai

OP https://rentry.co/twkuk8tz
>>
>mfw Resource news

05/25/2026

>PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion
https://research.nvidia.com/labs/sil/projects/pid

>ComfyUI custom node for NVIDIA PiD
https://github.com/Merserk/ComfyUI-PiD

>One-Forcing: Towards Stable One-Step Autoregressive Video Generation
https://aurora-edu.github.io/one-forcing

>Composing People Together: Iterative Pose-Image Generation for Multi-Person Interaction Scenes
https://cornell-vailab.github.io/PeopleComposer

>DFSAttn: Dynamic Fine-grained Sparse Attention for Efficient Video Generation
https://github.com/jessica-hujie/DFSAttn

>SCOPE: Simulating Cross-game Operations in Playable Environments for FPS World Models
https://z2tong.github.io/SCOPE

>Multimodal Distribution Matching for Vision-Language Dataset Distillation
https://andyj1.github.io/mdm

>FeatherOps: Fast fp8 matmul on RDNA3
https://github.com/woct0rdho/ComfyUI-FeatherOps

>Self-Teaching Autoencoder
https://github.com/the-puzzler/leautoencoder

05/24/2026

>L2P: Unlocking Latent Potential for Pixel Generation
https://huggingface.co/tsolful/Z-Image-L2P-INT8

>MooshieUI: Beginner-friendly interface for ComfyUI
https://github.com/Mooshieblob1/MooshieUI

05/23/2026

>Klein Tiled Upscaler for ComfyUI
https://github.com/Gavr728/ComfyUI_KleinTiledUpscaler

>Anima AI Character & Artist search engine with 49,000 sample images
https://animadex.net

>ComfyUi-Untwisting-RoPE (Training-Free Style Transfer)
https://github.com/BigStationW/ComfyUi-Untwisting-RoPE

>LongCat-Video-Avatar-1.5
https://huggingface.co/meituan-longcat/LongCat-Video-Avatar-1.5

>IMG Dataset Refiner v4.3
https://github.com/NyxAwroo/IMG-Dataset-Refiner/releases/tag/v4.3

>Sulphur-2-base
https://huggingface.co/SulphurAI/Sulphur-2-base

05/22/2026

>[real] Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models
https://github.com/microsoft/Lens

>L2P: Unlocking Latent Potential for Pixel Generation
https://nju-pcalab.github.io/projects/L2P
>>
>mfw Research news

05/25/2026

>DrawVideo: Generating Long Video from Storyboard Keyframe Sketches
https://arxiv.org/abs/2605.23508

>CoMoGen: COntrollable MOtion Dynamics and Interactions with Mask-Guided Video GENeration
https://arxiv.org/abs/2605.22996

>LaMo: Self-Supervised Latent Motion Priors for Physical Realism in Video Generation
https://lamo-ai.github.io

>EM-Vid: Training-Free Entity-Centric Memory for Efficient and Consistent Multi-Shot Video Generation
https://arxiv.org/abs/2605.23610

>EvalVerse: Pipeline-Aware and Expert-Calibrated Benchmarking for Professional Cinematic Video Generation
https://arxiv.org/abs/2605.23271

>Efficient One-Step Diffusion Restoration Model with Compact Token Compression and Linear Attention
https://arxiv.org/abs/2605.23451

>Leveraging Foundation Models for Causal Generative Modeling
https://arxiv.org/abs/2605.23861

>Occlusion-Aware Physics-Semantic Keyframe Selection for Robust Video Editing
https://arxiv.org/abs/2605.23192

>Vision Transformers Need Better Token Interaction
https://arxiv.org/abs/2605.23868

>VDE: Training-Free Accelerating Rectified Flow Model via Velocity Decomposition and Estimation
https://arxiv.org/abs/2605.23381

>VINS-120K: Ultra High-Resolution Image Editing with A Large-Scale Dataset
https://arxiv.org/abs/2605.23518

>Commutator-Induced Uncertainty in VAEs
https://arxiv.org/abs/2605.23449

>Precise: SDE-Consistent Stochastic Sampling for RL Post-Training of Flow-Matching Models
https://arxiv.org/abs/2605.23522

>Transcoders Trace Visual Grounding and Hallucinations in Vision-Language Models
https://arxiv.org/abs/2605.22902

>Coloring the Noise: Adversarial Sobolev Alignment for Faithful Image Super Resolution
https://arxiv.org/abs/2605.23264

>Good Token Hunting: A Hitchhiker's Guide to Token Selection for Visual Geometry Transformers
https://zsh2000.github.io/good-token-hunting.github.io

>Not Too Generative, Not Too Discriminative: Human Alignment Sweet Spot
https://arxiv.org/abs/2605.23819
>>
File: debo_tm-m_anima1_00055_.png (2.17 MB, 1792x977)
2.17 MB PNG
mind: blown
>>
File: 4534654654.gif (2.7 MB, 1280x768)
2.7 MB GIF
>>
File: o_00066_.png (2.35 MB, 1920x1080)
2.35 MB PNG
>>
File: o_00067_.png (2.18 MB, 1920x1080)
2.18 MB PNG
>>
File: debo_tm-m_anima1_00056_.png (2.01 MB, 1792x977)
2.01 MB PNG
>>108904851
I hear a dance beat to this
>>
File: o_00068_.png (2.25 MB, 1920x1080)
2.25 MB PNG
>>
>>
File: o_00069_.png (2.11 MB, 1920x1080)
2.11 MB PNG
>>
File: debo_tm-m_anima1_00058_.png (2.06 MB, 1792x977)
2.06 MB PNG
>>108904851
>>108904979
https://suno.com/s/O8pHgfykhBCBgAqp
>>
monkee
https://suno.com/song/66a995f0-de8c-4d7c-a4a4-3bc6407cca85
>>
File: debo_tm-m_anima1_00059_.png (2.12 MB, 1792x977)
2.12 MB PNG
>>108905176
bmp stands for banger music productions
>>
File: o_00077_.png (2.15 MB, 1920x1080)
2.15 MB PNG
>>
>>108905176
that style can just do anything can't it? protean
>>
File: o_00078_.png (2.3 MB, 1920x1080)
2.3 MB PNG
>>
File: o_00079_.png (2 MB, 1920x1080)
2 MB PNG
>>
File: debo_tm-m_anima1_00061_.png (1.93 MB, 1792x977)
1.93 MB PNG
>>
File: o_00085_.png (1.15 MB, 1152x896)
1.15 MB PNG
>>
File: o_00088_.png (1.12 MB, 1152x896)
1.12 MB PNG
>>
File: debo_tm-m_anima1_00062_.png (1.95 MB, 1792x977)
1.95 MB PNG
>>
>>
File: debo_tm-m_anima1_00065_.png (2.11 MB, 1792x977)
2.11 MB PNG
>>
File: o_00090_.png (2.8 MB, 1920x1080)
2.8 MB PNG
>>
File: o_00091_.png (2.47 MB, 1920x1080)
2.47 MB PNG
>>
File: o_00092_.png (2.94 MB, 1920x1080)
2.94 MB PNG
>>
File: debo_tm-m_anima1_00067_.png (2.28 MB, 1792x977)
2.28 MB PNG
>>
File: o_00093_.png (2.49 MB, 1920x1080)
2.49 MB PNG
>>
>>108904583
I'm not seeing anima being qualitatively better than illustrious. Why are people so excited?
>>
File: 000000_71633_.png (3.34 MB, 1043x1567)
3.34 MB PNG
>>
File: o_00095_.png (2.47 MB, 1920x1080)
2.47 MB PNG
>>
File: o_00097_.png (2.65 MB, 1920x1080)
2.65 MB PNG
>>
File: debo_tm-m_anima1_00069_.png (2.24 MB, 1792x977)
2.24 MB PNG
>>
>>
File: 000000_71720_.png (3.37 MB, 1035x1553)
3.37 MB PNG
>>108904657
Literally!:D
>>108906625
Nice shot
>>108906967
Nice
>>
File: o_00103_.png (2.53 MB, 1920x1080)
2.53 MB PNG
>>
>>108906996
thx
are you using some strange node? that low level noise looks a bit too harsh
>>
>>108906997
*"doctors"
>>
>>
File: heryago.png (641 KB, 1920x941)
641 KB PNG
>>108907037
3 upscales in one workflow.experiment'n.:D
>>
File: 000000_71730_.png (3.19 MB, 1035x1577)
3.19 MB PNG
>lowered steps to 4 and 4, no seedvr2 upscale.
>>
>>108905351
https://suno.com/song/2ca15b19-6cb5-425b-a839-b1d2372b6577
>>
File: debo_tm-m_anima1_00073_.png (1.97 MB, 1792x977)
1.97 MB PNG
>>
>>108907061
>>108907197
you werent using seedvr on the screenshot tho
also remove latent upscale, see what happens
>>
>>108907199
welcome back drama queen
>>
>>
File: nope.png (89 KB, 512x584)
89 KB PNG
>>108907238
Very nice.
Correct, I have seedvr2 as it's own workflow.
I'll give that a try,
I also got rid of the scale image..down after tiling....
>>
>>108907296
what is the image model you use?
>>
File: klein9b.png (148 KB, 1483x829)
148 KB PNG
>>108907320
Klein works good, has lots of nodes to fool with.
>>
>>108907341
ah that makes sense now, klein does produce that "noisy" texture a lot
try lowering that lora a bit too, to like 90 or 80
but it's more klein at fault lel
>>
>>
File: debo_tm-m_anima1_00081_.png (2.46 MB, 1792x977)
2.46 MB PNG
gettin high in hydroponics
>>
gn all
>>
File: debo_tm-m_anima1_00087_.png (2.13 MB, 1792x977)
2.13 MB PNG
>>108907642
gn
>>
File: debo_gr_anima1_00107_.png (2.39 MB, 1792x977)
2.39 MB PNG
>>
I'm new at this stable diffusion stuff. I'm using ComfyUI.

So far, I've iterated towards workflow where I have base gen / inpainting / upscale steps, where I save / load images between steps as needed. Sometimes loading image from end of inpainting to start of inpainting to fix the change and do another.

Between each of the steps, I have VAE Decode and Endcode. I've been wondering how much it is an issue? Would I see improvement if I worked directly with latent, and only converted to bitmap image at the very end?
>>
Is there way to prompt so it draws more interesting and creative angles around people or subjects? Almost always, stuff is show from the front or from the side. Sometimes from the back. At eye level.

Any way to convince SD to draw stuff from an angle? Without having to specify the angle itself?
>>
File: o_00104_.png (2.57 MB, 1080x1920)
2.57 MB PNG
>>
File: o_00106_.png (2.22 MB, 1920x1080)
2.22 MB PNG
>>
i miss schizo anon
>>
>>108908471
you can try something like: camera captures medium close-up angled slightly high from side profile capturing full figure
>>
>>
>>
>>
>>
File: o_00108_.png (2.62 MB, 1920x1080)
2.62 MB PNG
>>
File: o_00110_.png (2.53 MB, 1920x1080)
2.53 MB PNG
>>
File: o_00111_.png (1.22 MB, 1152x896)
1.22 MB PNG
>>
File: o_00112_.png (1.08 MB, 1152x896)
1.08 MB PNG
>>
File: o_00115_.png (1.17 MB, 1152x896)
1.17 MB PNG
>>
>>108910231
>>108910296
lel
>>
File: o_00118_.png (1.2 MB, 1152x896)
1.2 MB PNG
>>
>>
File: o_00119_.png (1.16 MB, 1152x896)
1.16 MB PNG
>>
>>
>>
File: 35425353454356354.gif (2.96 MB, 1152x896)
2.96 MB GIF
>>
>>
File: 453465465444.gif (3.44 MB, 1152x896)
3.44 MB GIF
>>
File: 34564654546.gif (3.41 MB, 1152x896)
3.41 MB GIF
>>
File: o_00141_.png (1.17 MB, 968x1080)
1.17 MB PNG
>>
>>
>>
File: o_00146_.png (1.01 MB, 968x1080)
1.01 MB PNG
>>
File: o_00147_.png (954 KB, 968x1080)
954 KB PNG
prompt is:
foreground: flame of candle.
midground: girl.
background: teeth.
medium: 1970s film photo.
framing: close-up.
perspective: forced, oblique.
lighting: harsh, high-contrast.
mood: surreal, eerie, lurid, phantasmagoric, oneiric.
temperature: very cool.
.....
obviously over prompted and hardly effective, but amusing results
>>
Morning anons
>>
File: debo_gr_anima1_00001_.png (2.23 MB, 1792x977)
2.23 MB PNG
neighbors dog spent 4am-5am barking its lungs out. now I'm gonna spend all day tired and pissed off

>>108911421
>perspective: forced, oblique.
I wonder how instrumental this bit is

>>108911423
morning
>>
File: o_00148_.png (1.19 MB, 968x1080)
1.19 MB PNG
>>108911423
morning
>>108911443
probably not at all, i'll try a pair of gens with and without it
>>
>>
File: debo_gr_anima1_00003_.png (2.3 MB, 1792x977)
2.3 MB PNG
>>
>>
File: 56476756.gif (943 KB, 968x1080)
943 KB GIF
idk, it's different
>>
File: debo_gr_anima1_00005_.png (2.16 MB, 1792x977)
2.16 MB PNG
>>108911577
interesting
>>
>>
File: o_00152_.png (1.25 MB, 968x1080)
1.25 MB PNG
changed to 'perspective: forced, oblique, vertiginous.' extra straw
>>
>>
File: debo_gr_anima1_00007_.png (2.22 MB, 1792x977)
2.22 MB PNG
>>108911632
>vertiginous
nice word, even if I don't understand the correlation to straws
>>
File: debo_gr_anima1_00008_.png (2.12 MB, 1792x977)
2.12 MB PNG
>>
>>
>>
File: debo_gr_anima1_00011_.png (2.18 MB, 1792x977)
2.18 MB PNG
>>
>>
File: o_00160_.png (1.39 MB, 1280x768)
1.39 MB PNG
>>108912092
blarg, wrong thread. oh well
>>
File: debo_gr_anima1_00012_.png (2.3 MB, 1792x977)
2.3 MB PNG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.