[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File deleted.
Previous /sdg/ thread : >>108377950

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Flux.2 Dev/Klein
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/black-forest-labs/FLUX.2-klein-4B
https://huggingface.co/black-forest-labs/FLUX.2-klein-9B

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Anima
https://huggingface.co/circlestone-labs/Anima

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/aco/sdg
>>>/b/degen
>>>/d/ddg
>>>/e/edg
>>>/gif/vdg
>>>/h/hdg
>>>/r/realistic+parody
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vp/napt
>>>/vt/vtai

OP https://rentry.co/twkuk8tz
>>
>>
>>
File: SDG_News_00057_.png (2.18 MB, 1344x1728)
2.18 MB
2.18 MB PNG
>mfw Resource news

03/18/2026

>Faster Inference of Flow-Based Generative Models via Improved Data-Noise Coupling
https://github.com/araachie/loom-cfm

>Flash-Unified: Training-Free and Task-Aware Acceleration Framework for Native Unified Models
https://github.com/Rirayh/FlashU

>ViFeEdit: Video-Free Tuner of Your Video DiT
https://github.com/Lexie-YU/ViFeEdit

>SegviGen: Repurposing 3D Generative Model for Part Segmentation
https://fenghora.github.io/SegviGen-Page

>W2T: LoRA Weights Already Know What They Can Do
https://github.com/xiaolonghan2000/Weight2Token

>V-Co: Closer Look at Visual Representation Alignment via Co-Denoising
https://github.com/HL-hanlin/V-Co

>HeBA: Heterogeneous Bottleneck Adapters for Robust VLMs
https://github.com/Jahid12012021/VLM-HeBA

>Parallel In-context Learning for LVLMs
https://github.com/yshinya6/parallel-icl

>GDPO-SR: Group Direct Preference Optimization for One-Step SR
https://github.com/Joyies/GDPO

>REFORGE: Multi-modal Attacks Reveal Vulnerable Concept Unlearning in Image Generation Models
https://github.com/Imfatnoily/REFORGE

>Mixture of Style Experts for Diverse Image Stylization
https://hh-lg.github.io/StyleExpert-Page

>GlyphPrinter: Region-Grouped DPO for Glyph-Accurate Visual Text Rendering
https://henghuiding.com/GlyphPrinter

>PureCLIP-Depth: Prompt-Free and Decoder-Free Monocular Depth Estimation within CLIP Embedding Space
https://github.com/ryutaroLF/PureCLIP-Depth

>Kimodo: Scaling Controllable Human Motion Generation
https://research.nvidia.com/labs/sil/projects/kimodo

>Learning through Creation: Hash-Free Framework for On-the-Fly Category Discovery
https://github.com/brandinzhang/LTC

>Vlo: Local open-source video editor with ComfyUI-backend
https://github.com/PxTicks/vlo

>ComfyUI-LCS: Training-free color control via Latent Color Subspace
https://github.com/facok/ComfyUI-LCS

>FlashMotion: Few-Step Controllable Video Generation with Trajectory Guidance
https://quanhaol.github.io/flashmotion-site
>>
>mfw Research news

03/18/2026

>Tri-Prompting: Video Diffusion with Unified Control over Scene, Subject, and Motion
https://zhouzhenghong-gt.github.io/Tri-Prompting-Page

>Diffusion Models for Joint Audio-Video Generation
https://arxiv.org/abs/2603.16093

>When Generative Augmentation Hurts: A Benchmark Study of GAN and Diffusion Models for Bias Correction in AI Classification Systems
https://arxiv.org/abs/2603.16134

>VIGOR: VIdeo Geometry-Oriented Reward for Temporal Generative Alignment
https://arxiv.org/abs/2603.16271

>SNCE: Geometry-Aware Supervision for Scalable Discrete Image Generation
https://arxiv.org/abs/2603.15150

>LibraGen: Playing a Balance Game in Subject-Driven Video Generation
https://arxiv.org/abs/2603.13506

>Generative Video Compression with One-Dimensional Latent Representation
https://gvc1d.github.io

>Semantic One-Dimensional Tokenizer for Image Reconstruction and Generation
https://arxiv.org/abs/2603.16373

>Unlearning for One-Step Generative Models via Unbalanced Optimal Transport
https://arxiv.org/abs/2603.16489

>Leveling3D: Leveling Up 3D Reconstruction with Feed-Forward 3D Gaussian Splatting and Geometry-Aware Generation
https://arxiv.org/abs/2603.16211

>Feed-forward Gaussian Registration for Head Avatar Creation and Editing
https://malteprinzler.github.io/projects/match

>Adaptive Moments are Surprisingly Effective for Plug-and-Play Diffusion Sampling
https://arxiv.org/abs/2603.16797

>WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation
https://cvlab-kaist.github.io/WorldCam

>Rethinking UMM Visual Generation: Masked Modeling for Efficient Image-Only Pre-training
https://arxiv.org/abs/2603.16139

>Visual Prompt Discovery via Semantic Exploration
https://arxiv.org/abs/2603.16250

>Grounding World Simulation Models in a Real-World Metropolis
https://seoul-world-model.github.io

>Interact3D: Compositional 3D Generation of Interactive Objects
https://arxiv.org/abs/2603.16085
>>
>mfw MORE Research news

>SparkVSR: Interactive Video Super-Resolution via Sparse Keyframe Propagation
https://arxiv.org/abs/2603.16864

>LICA: Layered Image Composition Annotations for Graphic Design Research
https://arxiv.org/abs/2603.16098

>Mostly Text, Smart Visuals: Asymmetric Text-Visual Pruning for Large Vision-Language Models
https://arxiv.org/abs/2603.16001

>AnyCrowd: Instance-Isolated Identity-Pose Binding for Arbitrary Multi-Character Animation
https://arxiv.org/abs/2603.15415

>Persistent Story World Simulation with Continuous Character Customization
https://arxiv.org/abs/2603.16285

>Next-Frame Decoding for Ultra-Low-Bitrate Image Compression with Video Diffusion Priors
https://arxiv.org/abs/2603.15129

>Locate-then-Sparsify: Attribution Guided Sparse Strategy for Visual Hallucination Mitigation
https://arxiv.org/abs/2603.16284

>WiT: Waypoint Diffusion Transformers via Trajectory Conflict Navigation
https://arxiv.org/abs/2603.15132

>Reevaluating the Intra-Modal Misalignment Hypothesis in CLIP
https://arxiv.org/abs/2603.16100

>AI Application Benchmarking: Power-Aware Performance Analysis for Vision and Language Models
https://arxiv.org/abs/2603.16164

>Real-Time Human Frontal View Synthesis from a Single Image
https://arxiv.org/abs/2603.15433

>HYDRA: Unifying Multi-modal Generation and Understanding via Representation-Harmonized Tokenization
https://arxiv.org/abs/2603.15228

>Anatomy of a Lie: A Multi-Stage Diagnostic Framework for Tracing Hallucinations in Vision-Language Models
https://arxiv.org/abs/2603.15557

>AI-Generated Figures in Academic Publishing: Policies, Tools, and Practical Guidelines
https://arxiv.org/abs/2603.16159

>Directional Embedding Smoothing for Robust Vision Language Models
https://arxiv.org/abs/2603.15259

>What DINO saw: ALiBi positional encoding reduces positional bias in Vision Transformers
https://arxiv.org/abs/2603.16840
>>
File: out2.jpg (1.54 MB, 2662x1997)
1.54 MB
1.54 MB JPG
left or right?
i prefer left, but right has some subtleties that are nicer (like hand pose)
>>
File: deSA_zi_00039_.png (2.1 MB, 1792x977)
2.1 MB
2.1 MB PNG
>>108401969
the details on the left are more crisp and have move character, overall more life and vibrance. right is kinda washed out and 'airbrushed', though I do like the tighter color range more (and the hands are nicer too but maybe thats just random)
>>
>>
File: deSA_zi_00040_.png (2.25 MB, 1792x977)
2.25 MB
2.25 MB PNG
>>108402148
nice
>>
jeet thread
>>
File: deBU_zi_00025_.png (2.04 MB, 1536x922)
2.04 MB
2.04 MB PNG
>>
a surprisingly accurate globe from zit, except i think antarctica blewed up but oh well, we didn't need it anyway
>>
>>108402053
yah i couldnt fix the airbrushed look on that method, ah well
>>
>>108402776
>accurate
>erased the roman empire
>>
File: deSA_zi_00023_.png (1.77 MB, 1832x1000)
1.77 MB
1.77 MB PNG
>>108402776
>antarctica blewed up
omg the secret world government activated the alien pyramid
>>
>>
>>108401870
>>Kimodo: Scaling Controllable Human Motion Generation
>https://research.nvidia.com/labs/sil/projects/kimodo
Has anyone tried using this?
>>
File: 000000_62400_.png (3.55 MB, 1276x1276)
3.55 MB
3.55 MB PNG
>>108402776
Cool.
>>
>>108402994
be the pioneer in that type of model, anon
>>
>>108403038
>Amish Barn Rumble 2026
Damn I missed the event.
>>
>>
>>
>>
in the grim darkness of the far future, there are only 1girls
>>
>>
>>108403156
could be worse
>>
>>
>>
>>
>>
>>
i feel like zimage forces certain composition layouts with certain keywords no matter what (and i'm not referring to the seed variance thign which is something different). it's hard to explain it's like a visual balance within the gen. klein does it a bit too, but not as much as zimage seems to. flux and chroma did not. it's possibly somethign to do with distillation i suppose.
>>
>>108403235
i get the same kind of thing. it also has a small stock of representations for a bunch of stuff, so "mask" always comes out like in pic rel. i see a lot of repetitive stuff that never gets posted. poses and stuff too. like it starts out as wildcards, sure, but there's no way kimi-k2 is that stable with its rewording of stuff so it boils down to whatever weird constrained latent space from distilling imo
>>
>>108403262
yah but with models like chroma or flux dedistilled it was possible to just push the model by messing with the sigmas and whatnot into doing more or less what you want. with these you have to either use excessive +1.5 strength or multipliers to push it or it doesnt work at all, and even then it breaks more often than not. on the one hand these models do well enough out of the box but it just feels constrained .
>>
>>
>>
LOL
>>
great, now i have to train a qwen-2512 lora. i just heard about this model

>>108403773
nice
>>
>>
>>108403910
i mean you don't *have* to unless...
>>
>>108403948
What's the deal here?
>>
>now i have to train a qwen-2512 lora. i just heard about this model
jesus are /sdg/ anons behind
>>
>>108403987
good question...

"Esoteric librarian’s dream: Manly P. Hall’s sacred geometry, gilded zodiac wheels, emerald tablets, candlelit marble halls, occult frescoes, celestial Atlantean sigils, ivory robes, starlit domes, chiaroscuro mysticism, ultra-crisp gothic detailing, 8K visionary realism."[mode:Voynich|whispered herbal cipher, looping marginalia, star-wheels, asemic glyphs]

Illuminated parchment folio, a full-foliage dame of generous circumference seated in lotus above the page, tresses like ink poured through water curtaining a single revealed eye whose iris swirls with the center of a spiraling sigil hidden within a meadow of whispering reeds, her mien rapt in cogitation; she is mantled in a short cote-hardie of midnight silk, lacings slack to the waist, slashed sleeves baring soft under-glow of skin yet no nakedness, hems edged with tiny bell-fruit seeds that chime without wind; dramatized by chiaroscuro, moon-cinders stripe her curves while lanterned fireflies orbit; behind, distant edges softened by haze around weathered signs—rotted stelae whose runes have migrated onto her hem; ringing vibration marks around hanging bells with reflective liquid surfaces ripple outward like concentric star diagrams, each droplet a microcosm of unknown constellations, safe for work, no nudity.

(sfw: 适合工作场合观看,无裸露画面,无文字。)
>>
>>108404010
sorry i dont spend all day filtering through neverending drama just to get the latest flavor of the month
>>
>>108404034
was more a joke about the "news" here but you do you sis
>>
>>
>op pic deleted
based thank you jannies
>>
>>108404062
>I was only pretending
>>
>>108404076
wait till you hear about other models people use /sdg/ anonie
>>
>chud lee
chun lee but with chud face. why hasnt anyone done this?
>>
>>108404070
what was wrong with it?
>>
>>108404017
Interesting, thank you.
>>
>>108404117
it was the op pic of a literal shithole
>>
>>
lolwut
>>
>>108404162
he just wants to show you his mishappen children bro, relax
>>
>>



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.