[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: 1777426822970211.png (3.89 MB, 1585x1036)
3.89 MB PNG
Previous /sdg/ thread : >>108705694

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Flux.2 Dev/Klein
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/black-forest-labs/FLUX.2-klein-4B
https://huggingface.co/black-forest-labs/FLUX.2-klein-9B

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Anima
https://huggingface.co/circlestone-labs/Anima

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/aco/sdg
>>>/b/degen
>>>/d/ddg
>>>/e/edg
>>>/gif/vdg
>>>/h/hdg
>>>/r/realistic+parody
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vp/napt
>>>/vt/vtai
>>
>containment general
>>
>>108719627
obviously it's not working to contain anyone
>>
>>
>mfw Resource news

04/29/2026

>Z-Anime | Full Anime Fine-Tune on Z-Image Base
https://huggingface.co/SeeSee21/Z-Anime

>QuantVideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization
https://github.com/svg-project/Quant-VideoGen

>World-R1: Reinforcing 3D Constraints for Text-to-Video Generation
https://github.com/microsoft/World-R1

>Benchmarking Layout-Guided Diffusion Models through Unified Semantic-Spatial Evaluation in Closed and Open Settings
https://github.com/lparolari/cobench

>VibeToken: Scaling 1D Image Tokenizers and Autoregressive Models for Dynamic Resolution Generations
https://github.com/SonyResearch/VibeToken

>OmniVTG: A Large-Scale Dataset and Training Paradigm for Open-World Video Temporal Grounding
https://github.com/oceanflowlab/OmniVTG

>Refinement via Regeneration: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Models
https://github.com/LeapLabTHU/RvR

>SketchVLM: Vision language models can annotate images to explain thoughts and guide users
https://sketchvlm.github.io

>Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation
https://tuna-ai.org/tuna-2

>Prefill-Time Intervention for Mitigating Hallucination in Large Vision-Language Models
https://github.com/huaiyi66/PTI

04/28/2026

>Illustrious XL & NoobAI-XL Style Explorer
https://github.com/ThetaCursed/Illustrious-NoobAI-Style-Explorer

>LTX Desktop 1.0.5
https://github.com/Lightricks/LTX-Desktop/releases/tag/v1.0.5

>Meta-CoT: Enhancing Granularity and Generalization in Image Editing
https://shiyi-zh0408.github.io/projectpages/Meta-CoT

04/27/2026

>PixlStash 1.1.0 Update
https://pixlstash.dev/whatsnew.html

>AURA AI Studio Vault: One-stop management app for models, images and more
https://github.com/TheGho7t/AURA-AI-Studio-Vault

>UniGeo: Unifying Geometric Guidance for Camera-Controllable Image Editing via Video Models
https://mo230761.github.io/UniGeo.github.io
>>
>mfw Research news

04/29/2026

>Golden RPG: Confidence-Adaptive Region-Aware Noise for Compositional Text-to-Image Generation
https://arxiv.org/abs/2604.25314

>A Systematic Post-Train Framework for Video Generation
https://arxiv.org/abs/2604.25427

>ResetEdit: Precise Text-guided Editing of Generated Image via Resettable Starting Latent
https://arxiv.org/abs/2604.25128

>ViPO: Visual Preference Optimization at Scale
https://liming-ai.github.io/ViPO

>GramSR: Visual Feature Conditioning for Diffusion-Based Super-Resolution
https://github.com/aimagelab/GramSR

>Exploring Time Conditioning in Diffusion Generative Models from Disjoint Noisy Data Manifolds
https://arxiv.org/abs/2604.25289

>The Thinking Pixel: Recursive Sparse Reasoning in Multimodal Diffusion Latents
https://arxiv.org/abs/2604.25299

>DDA-Thinker: Decoupled Dual-Atomic Reinforcement Learning for Reasoning-Driven Image Editing
https://arxiv.org/abs/2604.25477

>Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation
https://mutualforcing.github.io

>Learning Illumination Control in Diffusion Models
https://nishitanand.github.io/relighting-diffusion-website

>Learning from Noisy Preferences: A Semi-Supervised Learning Approach to Direct Preference Optimization
https://arxiv.org/abs/2604.24952

>Improving Diversity in Black-box Few-shot Knowledge Distillation
https://arxiv.org/abs/2604.25795

>QFlash: Bridging Quantization and Memory Efficiency in Vision Transformer Attention
https://arxiv.org/abs/2604.25306

>When the Forger Is the Judge: GPT-Image-2 Cannot Recognize Its Own Faked Documents
https://arxiv.org/abs/2604.25213

>The Forensic Cost of Watermark Removal
https://arxiv.org/abs/2604.25491

>GPT-Image-2 in the Wild: A Twitter Dataset of Self-Reported AI-Generated Images from the First Week of Deployment
https://arxiv.org/abs/2604.25370

>Can We Change the Stroke Size for Easier Diffusion?
https://arxiv.org/abs/2603.26783
>>
evening anon
>>
>>
>>108720100
howdy
>>
File: pixel-0001-2745796111.png (73 KB, 2560x2048)
73 KB PNG
>>
>>
>>
>>
>>
>>
>>
>>
>>108719619
It may be time to add some agents to the mix.
https://x.com/NousResearch/status/2049584595465572752
https://github.com/NousResearch/hermes-agent/tree/main/skills/creative/comfyui
>>
File: deNK_zi_00047_.png (2.24 MB, 1663x1164)
2.24 MB PNG
>>108720995
that zoomout gave me extreme anxiety
I wonder how much value you can really get out of agentic workflow creation tho. maybe helpful for intricate video timecoding or something
>>
When did last thread highlights stop?
>>
File: deNK_zi_00052_.png (2.56 MB, 1663x1164)
2.56 MB PNG
>>108721118
was that ever a thing? we had an anon do it on rare occasion but it wasn't a constant thing
>>
>>108721041
>that zoomout gave me extreme anxiety
Same.

>I wonder how much value...
Not sure, It'll be interesting to see how it setups and novel workflows based on prompt description. I'm expecting bloat and redundancy of course.

Prompting a Gen directly in Hermes is a token whore of course.
>>
quick request, can someone prompt classic winnie the pooh fighting red shirt disney winnie the pooh? red shirt pooh bear is winning
>>
File: deCS_zia_00002_.png (2.53 MB, 1792x977)
2.53 MB PNG
erm, my zimg anime isnt working
>>
>>
File: deCS_zia_00007_.png (3.03 MB, 1792x977)
3.03 MB PNG
well, it looks like something at least
>>
File: deCS_zia_00011_.png (2.93 MB, 1792x977)
2.93 MB PNG
still very melty hm
>>
File: deCS_zia_00015_.png (2.73 MB, 1792x977)
2.73 MB PNG
kinda works. will have to tinker more
gn
>>
i miss schizo anon
>>
gm

>>108722161
So do we
>>
>>108723090
gm
>>
>gm
>>
>>
>>108723300
nice and shiny.
>>
>>
>>
>>
>>
File: deCS_anima_00001_.png (2.33 MB, 1792x977)
2.33 MB PNG
posting some anima stuff while I try to figure out zia more
>>
File: 00003-425627238.png (2.22 MB, 896x1152)
2.22 MB PNG
>>
File: deCS_anima_00002_.png (2.21 MB, 1792x977)
2.21 MB PNG
>>
File: 00000-4205374383.jpg (377 KB, 1728x1344)
377 KB JPG
>>
File: deCS_anima_00007_.png (2.39 MB, 1792x977)
2.39 MB PNG
>>108724971
me in the back
>>
>>
File: 00002-514436778.jpg (927 KB, 2592x2016)
927 KB JPG
>>
File: deCS_anima_00008_.png (2.25 MB, 1792x977)
2.25 MB PNG
>>
File: deCS_anima_00015_.png (2.57 MB, 2048x1117)
2.57 MB PNG
>>
>>
File: deCS_anima_00011_.png (2.47 MB, 1792x977)
2.47 MB PNG
>>
File: 00004-1446782111.jpg (234 KB, 2048x2048)
234 KB JPG
>>
File: 00005-603462497.jpg (203 KB, 2048x2048)
203 KB JPG
>>
File: bbs-zit-2026-04-30_00002_.png (3.65 MB, 1920x1080)
3.65 MB PNG
>>108726430
did you bog him on purpose lol
>>
Afternoon anons
>>
File: 00032-2079924748.png (1.46 MB, 512x1536)
1.46 MB PNG
>>108726534
naw, just the normal sd 1.5 result
>>
>>108726541
howdy
>>
>>
File: 00007-821408021.png (3.36 MB, 1152x1920)
3.36 MB PNG
>>
>>
Deep in the hear of Germany
>>
File: 1713888231559952.png (748 KB, 1341x1927)
748 KB PNG
Good morning, afternoon, or night.

I got tired of asking for edits in /trash/ (the Edit, Lineart, and Coloring Threads are practically unusable due to spam)

Could you please edit this image?

What I need is for you to change the design and color of the body fur of this female furry cheetah to transform her into a Jaguar and also change the design of her ears to resemble those of a real jaguar.
https://en.wikipedia.org/wiki/Jaguar

https://upload.wikimedia.org/wikipedia/commons/0/0a/Standing_jaguar.jpg

https://en.wikifur.com/w/images/1/1f/Miles-df_xian-jaguar.jpg
>>
>>
File: jaghag.png (1.17 MB, 1046x1503)
1.17 MB PNG
>>108727030
>>
>>108727180
Not OR, But nice edit
>>
>>
File: ComfyUI_00063_.png (1.06 MB, 1344x2016)
1.06 MB PNG
>>108727030
Send tokens
>>
File: jaghag2.png (1.07 MB, 1046x1503)
1.07 MB PNG
>>108727180
>>108727202
>>
>>
>>
File: deCS_anima_00053_.png (3.42 MB, 2048x1117)
3.42 MB PNG
>>108726839
(n)ice
is this a real character or an ai construct?
>>
>>108727349
beep
boop
>>
>>
>>
>>
>>108727256
>>108727180
>>108727231
OR of edit request here
Thank you so much.
Blessings to you.

It's nice to see that at least in this thread, you can find some good things.
>>
>>
File: 000000_68850_.png (3.26 MB, 1068x1531)
3.26 MB PNG
>>108727030
>>
File: 000000_68806_.png (3.23 MB, 1392x1171)
3.23 MB PNG
>>
>>
File: deCS_zia_00049_.png (2.62 MB, 1792x977)
2.62 MB PNG
>>108727628
science has gone too far

>>108727641
did you flush z image anime? I was able to get it kinda tuned in but I can't solve 100% of the meltiness
>>
>>108727658
>Think we get a fire this time messing with genes,,
>>
>>108727628
LOL
This image is appreciated nonetheless.

I imagine the female Jaguar as a character from the 2000s (2003 or 2004) and as a Linkin Park fan.
>>
>>108727658
i doubt i'll try it again unless something really draws my attention to it, but i havent deleted it
are you using cfg >3.5 and steps >30?
>>
File: deCS_zia_00045_.png (2.68 MB, 1792x977)
2.68 MB PNG
>>108727782
cfg 6 seemed like the sweet spot and I felt like I had to go above 40 steps for full convergence
its weird cuz the sample workflows use cfg 1.1 for some reason
>>
>>108727805
there's a turbo version isnt there? or distilled at least
i've used cfg 1.1 with zturbo before, currently at 3.5
as someone said, "just because it's turbo doesnt mean you *have* to use it at cfg 1"
>>
File: deCS_zia_00043_.png (2.69 MB, 1792x977)
2.69 MB PNG
>>108727814
ah, it mustve been for the distilled version
>>
>>108727820
cfg 1.1 is a cheat to give turbo models neg conditioning, but it does slow it down. higher cfg on turbo = slower and slower
>>
>>
>>
File: deCS_anima_00017_.png (2.55 MB, 2048x1117)
2.55 MB PNG
>>
>>
>>
>>
File: deCS_anima_00019_.png (2.5 MB, 2048x1117)
2.5 MB PNG
>>108728036
>they bleated ne toog ditied
aint that the truth
>>
>>108728061
some of these are pretty funny. wish zit were just a smidge better at text lol. nb2 would have a field day with this
>>
>>
>>
File: deCS_anima_00020_.png (3.58 MB, 2048x1117)
3.58 MB PNG
>>
cat is yelling at me to go to bed. i obey. gn!
>>
File: deCS_anima_00022_.png (3.35 MB, 2048x1117)
3.35 MB PNG
cereal summon technique

>>108728452
gn
>>
>>
How do you organize your pics when genning with ChatGpt?
Especially annoying with all my alt accounts
>>
>>
>>
i miss schizo anon
>>
gm
>>
File: 00001-89998238.png (1.89 MB, 1280x768)
1.89 MB PNG
>>108730046
gm
>>
File: 00003-2950884486.png (1.83 MB, 768x1280)
1.83 MB PNG
>>
File: 00004-1462917107.png (410 KB, 768x1280)
410 KB PNG
>>
File: 00005-1960539319.png (1.78 MB, 1280x768)
1.78 MB PNG
>>
>>
huehuehue
>>
>>
>>
>>108720659
Hands are reversed.
>>
gm
>>
File: 00009-2080559120.png (922 KB, 768x1280)
922 KB PNG
>>
>>108730935
early start?
>>
>>
File: deCS_anima_00026_.png (3.2 MB, 2048x1117)
3.2 MB PNG
>>108730935
gm
happy friday

>>108731090
weekend eve gets everyone excited
>>
>>108731090
the same cat what made me go to bed woke me up too. little fucker

>>108731196
gm
>>
File: 00020-1598300780.png (1.59 MB, 896x1152)
1.59 MB PNG
>>
File: 00021-640224881.png (1.92 MB, 1152x896)
1.92 MB PNG
>>
Morning anons
https://youtu.be/k4hjX6ZsplU?si=Df3Aw0-i3_GFPG7g
>>
File: 00012-2080559123.png (605 KB, 768x1280)
605 KB PNG
>>108732073
morning
>>
>gm
>>
File: deCS_anima_00027_.png (2.78 MB, 2048x1117)
2.78 MB PNG
>>108730178
>>108730214
I like all these, but esp the elephants
>>
File: 65649-2852628294.png (3.6 MB, 1920x1152)
3.6 MB PNG
>>108732301
been trying to gen using creatures i've seldom used
>>
>>
File: 65650-110503496.png (1.51 MB, 1920x1152)
1.51 MB PNG
guess suno has a 'create your own model' thing now, currently it is 'cooking' one for me
>>
File: 00029-1512835060.png (648 KB, 768x1280)
648 KB PNG
>>108732657
well it was lame, or i didn't do it right
>>
>>
File: deCS_anima_00028_.png (2.84 MB, 2048x1117)
2.84 MB PNG
>>108732657
>suno has a 'create your own model' thing now
ohh really... thats very interesting
im curious how you'd say it failed. it just didn't follow the inputs well?
>>
File: 65653-4006254640.gif (3.03 MB, 2592x2016)
3.03 MB GIF
>>108733054
well, i gave it sounds of various sort of experimental punk styles, and the results i get are the very generic results i would get from before, like the custom model isn't having an effect.
>>
>>108733078
>high resolution, clean image
>fucked up hands
come on anon...
>>
>>
>mfw Security news

https://github.com/huggingface/diffusers/security/advisories/GHSA-98h9-4798-4q5v
>>
File: 65655-4087088899.gif (3.02 MB, 2016x2592)
3.02 MB GIF
>>
>>108733078
>>108733054
as i understand it you're supposed to be uploading your own non-ai songs so it can mimic your style/voice. i guess you could use anything that wouldn't trigger ContentID (or whatever they use to keep the RIAA reptilians off their backs)
https://help.suno.com/en/articles/11362497
>>
>>
File: 65658-2042236788.gif (2.72 MB, 2016x2592)
2.72 MB GIF
>>108733240
ah, i was being bad.. just an experiment. was hoping to have a good model based off of the obscure music i like. oh well.
>>
>>
me on the right
>>
>>108733367
i mean if they let you upload it then whatever, it's on their head lol! i haven't tried it myself. the voice feature is pretty rad tho even if it does drag some style prompt in (via the voice itself, not the optional style prompt section). i'd guess it's a half-baked feature rn anyway (the custom models)
>>
>>
File: 65660-2344468904.jpg (919 KB, 2592x2016)
919 KB JPG
>>
>>
>>
File: deCS_anima_00031_.png (3.09 MB, 2048x1117)
3.09 MB PNG
>>108733456
the rare moment when the AI catches the genner in action

>>108733496
>>108733514
these come together like a pokemon battle
>>
File: 65662-1860395603.jpg (438 KB, 1728x1344)
438 KB JPG
>>
File: 65663-3289109297.jpg (437 KB, 1728x1344)
437 KB JPG
>>
File: deCS_anima_00034_.png (3.45 MB, 2048x1117)
3.45 MB PNG
>>108734058
raichibis
>>
hmmmm
>>
>>
File: 65664-1256832573.png (2.93 MB, 768x2304)
2.93 MB PNG
>>
>>
File: 000000_68970_.png (2.94 MB, 1378x1159)
2.94 MB PNG
G'evenin Anons,
>>
File: 000000_68998_.png (2.25 MB, 1455x1132)
2.25 MB PNG
>>
>>
File: deCS_anima_00039_.png (3.21 MB, 2048x1117)
3.21 MB PNG
>>108734293
heyo
>>
File: 65666-3066110589.jpg (433 KB, 1728x1344)
433 KB JPG
>>108734293
evening
>>
File: 000000_69000_.png (2.28 MB, 1436x1117)
2.28 MB PNG
>>108734325
Nice.
>>108734354
>>108734366
TGIF!
>the crystal ball shows the future within,
>>
>>108734410
thx
>>
File: 000000_69020_.png (2.12 MB, 1442x1121)
2.12 MB PNG
>>
File: deCS_anima_00040_.png (2.84 MB, 2048x1117)
2.84 MB PNG
>>108734410
>TGIF!
thank /g/ its frog
>>
File: 65667-2746191683.jpg (429 KB, 1728x1344)
429 KB JPG
>>
>>
>>
>>
File: 65669-1146532251.jpg (896 KB, 2016x2592)
896 KB JPG
>>
>>108734568
>>108734759
nice ones
>>
File: ComfyUI_00014_.png (2.12 MB, 1024x1488)
2.12 MB PNG
testing settings
>>
baking
>>
>>108735043
>>108735043
>>108735043
>>
>>108723451
holy shit is this a god prompt or are using more than just turbo lora?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.