[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: 1779059559632320.jpg (1.38 MB, 3072x2048)
1.38 MB JPG
Previous /sdg/ thread : >>108841225

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Flux.2 Dev/Klein
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/black-forest-labs/FLUX.2-klein-4B
https://huggingface.co/black-forest-labs/FLUX.2-klein-9B

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Anima
https://huggingface.co/circlestone-labs/Anima

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/aco/sdg
>>>/b/degen
>>>/d/ddg
>>>/e/edg
>>>/gif/vdg
>>>/h/hdg
>>>/r/realistic+parody
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vp/napt
>>>/vt/vtai

OP https://rentry.co/twkuk8tz
>>
>>108853501
my back hurts just looking at that
try making the pose yourself
>>
File: SDG_News_00103_.png (1.92 MB, 1448x1086)
1.92 MB PNG
>mfw Resource news

05/18/2026

>Lance: Unified Multimodal Modeling by Multi-Task Synergy
https://lance-project.github.io

>GridLoraTester: Workbench for character LoRA training on FLUX.2: dataset curation
https://github.com/Mandrakia/GridLoraTester

>FLUX MCP server
https://docs.bfl.ai/api_integration/mcp_integration

>Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization
https://shredded-pork.github.io/Flash-GRPO.github.io

>LongLive2.0 5B BF16: AR-trained Wan2.2-TI2V-5B generator
https://huggingface.co/Efficient-Large-Model/LongLive-2.0-5B

>DealMaTe: Multi-Dimensional Material Transfer via Diffusion Transformer
https://github.com/haha-lisa/DealMaTe

>Deep Pre-Alignment for VLMs
https://github.com/THUMAI-Lab/Deep-Pre-Alignment

>Sparse Autoencoders enable Robust and Interpretable Fine-tuning of CLIP models
https://github.com/Fabian-Mor/sae-ft

>VAGS: Velocity Adaptive Guidance Scale for Image Editing and Generation
https://github.com/Harvard-AI-and-Robotics-Lab/Velocity_Adaptive_Guidance_Scale

>Neural Companion: Local desktop AI companion shell
https://github.com/Rakile/NeuralCompanion

>PixlStash 1.2: easy sharing, cleaner UI and faster background processing for your image management
https://pixlstash.dev/whatsnew.html

05/17/2026

>Comfy-mesh LTX 2.3 support — separate node + separate server GUI
https://github.com/shootthesound/comfyui-mesh#ltx-23--separate-node--separate-server-gui

>Rebels_HiDream-01_Image_Dev_NODES: Run HiDream-01 Image Dev bf16 and GGUF
https://github.com/RealRebelAI/Rebels_HiDream-01_Image_Dev_NODES

05/16/2026

>ComfyUI-Mesh Icarus & Daedalus: Split a diffusion model across two GPUs
https://github.com/shootthesound/comfyui-mesh

>Pixal3D-ComfyUI
https://github.com/Saganaki22/Pixal3D-ComfyUI

>ArXiv to Ban Researchers for a Year if They Submit AI Slop
https://www.404media.co/new-arxiv-rules-ai-generated-papers-ban
>>
>mfw Research news

05/18/2026

>DreamSR: Towards Ultra-High-Resolution Image Super-Resolution via a Receptive-Field Enhanced Diffusion Transformer
https://arxiv.org/abs/2605.15682

>ElasticDiT: Efficient Diffusion Transformers via Elastic Architecture and Sparse Attention for High-Resolution Image Generation on Mobile Devices
https://arxiv.org/abs/2605.15684

>Self-Prompting Diffusion Transformer for Open-Vocabulary Scene Text Editing via In-Context Learning
https://hongxiii.github.io/mstedit

>Echo-Forcing: A Scene Memory Framework for Interactive Long Video Generation
https://arxiv.org/abs/2605.16003

>One Pass Is Not Enough: Recursive Latent Refinement for Generative Models
https://arxiv.org/abs/2605.15309

>Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization
https://arxiv.org/abs/2605.15980

>Evaluating Design Video Generation: Metrics for Compositional Fidelity
https://arxiv.org/abs/2605.16223

>Sound Sparks Motion: Audio and Text Tuning for Video Editing
https://amirhossein-razlighi.github.io/Sound_Sparks_Motion

>Tuning-free Instruction-based Video Editing Via Structural Noise Initialization and Guidance
https://arxiv.org/abs/2605.15533

>Do Less, Achieve More: Do We Need Every-Step Optimization for RL Fine-tuning of Diffusion Models?
https://arxiv.org/abs/2605.15855

>GenShield: Unified Detection and Artifact Correction for AI-Generated Images
https://arxiv.org/abs/2605.16122

>Efficient Image Synthesis with Sphere Latent Encoder
https://arxiv.org/abs/2605.15592

>Neutral-Reference Prompting for Vision-Language Models
https://arxiv.org/abs/2605.15615

>HyperDiT: Hyper-Connected Transformers for High-Fidelity Pixel-Space Diffusion
https://arxiv.org/abs/2605.15741

>Registers Matter for Pixel-Space Diffusion Transformers
https://arxiv.org/abs/2605.16147

>RaPD: Resolution-Agnostic Pixel Diffusion via Semantics-Enriched Implicit Representations
https://arxiv.org/abs/2605.15908
>>
File: debo_cs-f_anima1_00019_.png (2.74 MB, 1792x1140)
2.74 MB PNG
>>108853585
I dont want to have opinions on the posing. I wanna leave it to the AI to see what it does, good or not
>>
yeah
slop
>>
>>108853621
also that leg (on previous image)
what's up wit dat
>>
File: debo_cs-f_anima1_00016_.png (2.75 MB, 1792x1140)
2.75 MB PNG
>>108853639
anatomy isn't anima's greatest strength
>>
>>
>>108853666
lel
>>
File: debo_cs-f_anima1_00020_.png (2.65 MB, 1792x1140)
2.65 MB PNG
>>
>>
>>
>>
>>
File: debo_cs-f_anima1_00021_.png (2.9 MB, 1792x1140)
2.9 MB PNG
>>108854051
>>
>>108854089
you're not selling me anima here
looks like sd15 gen lel
>>
File: debo_cs-f_anima1_00022_.png (2.99 MB, 1792x1140)
2.99 MB PNG
>>108854113
its a cool layout and composition but, yeah, the face/hand details are rough. I might experiment with some different upscaling and genning at a higher native resolution later. I can't really find the sweet spot
>>
>>108854120
i've seen people using 50 steps lel
>>
File: debo_cs-f_anima1_00023_.png (2.49 MB, 1792x1140)
2.49 MB PNG
>>108854161
I am also using 50 steps
>>
>>108854168
ooof and yikes
>>
>>
>>
me on the bottom left
>>
>>
>>
>>108854120
what are the prompts like? u may have shared before... i've been encouraging the prompt enhancer to go full sdxl
>>
>>
>>
>>
death to clankers
>>
>>
>>
do your worst, i have already won
>>
>>
>>
>>
>>
File: debo_cs-f_anima1_00025_.png (2.52 MB, 1792x1140)
2.52 MB PNG
>>108854336
>ooof and yikes
er_sde is the recommended sampler for anima and it needs higher steps

>>108854679
>what are the prompts like?
heres the full workflow: https://files.catbox.moe/mhgxt1.png
>encouraging the prompt enhancer
I haven't been using the enhancer cuz its too slow locally
>>
>>108855031
did you try any other samplers/schedulers?
maybe start with euler + beta at like 35 steps
or res_2m
whoever recommended that appears to be wrong
>>
File: debo_cs-f_anima1_00026_.png (2.7 MB, 1792x1140)
2.7 MB PNG
>>108855041
yeah when I first started playing with anima, I tried a bunch of different combos. researched what other people seemed to be using too. there honestly wasn't a huge difference between most of them, but er_sde still slightly outperformed the match-ups
that was on beta3, so idk if its different for base1. I assumed it wasn't
>>
>>108855053
i just dont think sde samplers are good for any model post sdxl or its derivatives. from my experience with flux/chroma/z its' been either single step euler-types or multistep, not so much exponential ones
also the scheduler makes a big diff. try something like res-2m/deis-2m/abnorset with beta/beta32 if you have extra samplers/schedulers (res4lyf), or even kl-optimal/power/shift,with less steps for the multistep samplers (since they do 2+ steps per regular step)
>>
>>108855053
idk it could be the weighting, 1.4 is pretty high esp on two of them. picrel what i pulled out of the workflow but w/o any weighting. main issue is the character is super small so the detail is scuffed.
https://files.catbox.moe/oydhs9.png is run through my sdxl prompt enhancer (no weighting)
>>
>>
>>
it's just like that time i took acid in school
>>
>>
>>108855183
neat
>>
File: debo_cs-f_anima1_00027_.png (2.84 MB, 1792x1140)
2.84 MB PNG
>>108855095
I can try turning the style weights down but they tend to lose stickiness cuz the prompt is busy otherwise

>>108855101
>>108855119
nice character sheets

>>108855183
really cool composition
>>
>>108855196
chromagirl is overpowered
>>
>>108855193
>>108855196
it does a lot of neat stylish stuff, i think it's the "bold outlines, flat colors, minimal hard-edged shading". i also just noticed i have a fuckup in my rewrite block lol time to see fixing it breaks the spell
>>
File: debo_cd-a_anima1_00006_.png (2.77 MB, 1792x1194)
2.77 MB PNG
heres one with
- higher base gen
- different sampler/scheduler
- different upscaler settings
- lower weighting

doesn't really perform any better on tiny faces

>>108855241
>also just noticed i have a fuckup in my rewrite block lol
doesn't beat when I accidentally had my positive prompt plugged into my negs for a whole month before I noticed
>>
>>108855275
most models (especially small ones) will suck at small faces
see >>108855183
no details
granted that was probably on purpose but it'll have a hard time
have you done any portrait/"medium shot" types?
>>
>>108855282
sometimes it seems like these things have a maximum detail budget. these chibi ones have the advantage of being flat shading so it can deal with small faces being just eyes and a blob
>>
File: debo_cs-f_anima1_00033_.png (2.88 MB, 1792x1140)
2.88 MB PNG
>>108855282
>have you done any portrait/"medium shot" types?
yeah, ofc it can do face details when it has a lot of space to work with
you know me tho, i like the more "lived in" expansive scenes with characters tucked into the environment
>>
>>108855301
yah
>>108855302
well you're gonna have to really push the model, use a lora, or admit defeat
>>
gn all
>>
File: debo_cs-f_anima1_00030_.png (3.1 MB, 1792x1140)
3.1 MB PNG
>>108855310
or I can enjoy the stuff I'm making even tho its not perfect :)

>>108855319
gn
>>
>>108855319
gn
>>
>>
File: debo_cs-f_anima1_00031_.png (2.92 MB, 1792x1140)
2.92 MB PNG
>>108855420
pls stop mogging the thread with every gen
>>
>>108855439
lol i've been pretty choosy
>>
>>
File: debo_cs-f_anima1_00032_.png (2.79 MB, 1792x1140)
2.79 MB PNG
>>
>>
File: debo_cs-f_anima1_00034_.png (2.61 MB, 1792x1140)
2.61 MB PNG
>>
>>
last one. bedtime. gn
>>
File: debo_cd-a_anima1_00075_.png (2.73 MB, 1792x1194)
2.73 MB PNG
wtf

>>108855785
gn
>>
>>108855798
thicc lol
>>
https://www.youtube.com/watch?v=a856jos1bSo
>>
i miss schizo anon
>>
>>
>>
File: 000000_70893_.png (2.48 MB, 1061x1553)
2.48 MB PNG
G'mornin Anons, have a great day!
>I caught flutterby!
>>
>>108857155
leave the butterflies alone!v
>>



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.