[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: 43436346346436.png (2.7 MB, 1248x1824)
2.7 MB
2.7 MB PNG
Previous /sdg/ thread : >>107785765

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://huggingface.co/jayn7/Z-Image-Turbo-GGUF

>Flux.2 Dev
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/city96/FLUX.2-dev-gguf

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image
https://huggingface.co/QuantStack/Qwen-Image-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2
https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUF
https://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUF
https://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://github.com/maybleMyers/chromaforge
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/vp/napt
>>>/r/realistic+parody
>>
File: k_00063_.png (3.68 MB, 1488x1488)
3.68 MB
3.68 MB PNG
>>
File: 535453454334.jpg (1.37 MB, 1664x2432)
1.37 MB
1.37 MB JPG
desuarchive.org/g/thread/103817025
a year ago
>>
>mfw Resource news

01/08/2026

>Edit2Restore:Few-Shot Image Restoration via Parameter-Efficient Adaptation of Pre-trained Editing Models
https://github.com/makinyilmaz/Edit2Restore

>ComfyUI-Persona-Director: AI Agent for maintaining character consistency
https://github.com/18yz153/ComfyUI-Persona-Director

>UniVideo: Unified Understanding, Generation, and Editing for Videos
https://github.com/KlingTeam/UniVideo

>Dataset Metadata Injection Tool: Adds Kohya/A1111-compatible tag frequency metadata
https://github.com/LindezaBlue/Dataset-Metadata-Injection

>TTP Toolset adds LTX 2 first and last frame control
https://github.com/TTPlanetPig/Comfyui_TTP_Toolset/releases/tag/V1.0.3

>Qwen-llm-loader prompt refiner for ComfyUI
https://github.com/capitan01R/Qwen-llm-loader

>WanGP adds LTX 2 support, works on as little as 10GB VRAM
https://github.com/deepbeepmeep/Wan2GP#january-7st-2026-wangp-v1010-spoiled-again

01/07/2026

>Official NVFP4 and mixed NVFP4/BF16 versions of FLUX.2 [dev]
https://huggingface.co/black-forest-labs/FLUX.2-dev-NVFP4

>VINCIE-7B: ByteDance Seed compact 7B model
https://huggingface.co/ByteDance-Seed/VINCIE-7B

01/06/2026

>LTX-2: DiT-based audio-video foundation model
https://github.com/Lightricks/LTX-2

>DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
https://guoxu1233.github.io/DreamID-V

>Diffusion Toolkit v1.10
https://github.com/RupertAvery/DiffusionToolkit/releases/tag/v1.10

>NVIDIA RTX Accelerates 4K AI Video Generation on PC With LTX-2 and ComfyUI Upgrades
https://blogs.nvidia.com/blog/rtx-ai-garage-ces-2026-open-models-video-generation

>fp8_e4m3fn conversion of Gemma 3 12b it text encoder
https://huggingface.co/GitMylo/LTX-2-comfy_gemma_fp8_e4m3fn

>Diffuse: Windows desktop UI for Huggingface Diffusers
https://github.com/TensorStack-AI/Diffuse

>SwinIFS: Landmark Guided Swin Transformer For Identity Preserving Face Super Resolution
https://github.com/Habiba123-stack/SwinIFS
>>
>mfw Research news

01/08/2026

>Mind the Generative Details: Direct Localized Detail Preference Optimization for Video Diffusion Models
https://arxiv.org/abs/2601.04068

>Thinking with Frames: Generative Video Distortion Evaluation via Frame Reward Model
https://arxiv.org/abs/2601.04033

>PosterVerse: A Full-Workflow Framework for Commercial-Grade Poster Generation with HTML-Based Scalable Typography
https://arxiv.org/abs/2601.03993

>ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation
https://arxiv.org/abs/2601.03955

>I2E: From Image Pixels to Actionable Interactive Environments for Text-Guided Image Editing
https://arxiv.org/abs/2601.03741

>VideoMemory: Toward Consistent Video Generation via Memory Integration
https://hit-perfect.github.io/VideoMemory

>Detecting AI-Generated Images via Distributional Deviations from Real Images
https://arxiv.org/abs/2601.03586

>Diffusion-DRF: Differentiable Reward Flow for Video Diffusion Fine-Tuning
https://arxiv.org/abs/2601.04153

>SDCD: Structure-Disrupted Contrastive Decoding for Mitigating Hallucinations in Large Vision-Language Models
https://arxiv.org/abs/2601.03500

>Understanding Reward Hacking in Text-to-Image Reinforcement Learning
https://arxiv.org/abs/2601.03468

>ThinkRL-Edit: Thinking in Reinforcement Learning for Reasoning-Centric Image Editing
https://arxiv.org/abs/2601.03467

>Latent Geometry of Taste: Scalable Low-Rank Matrix Factorization
https://arxiv.org/abs/2601.03466

>Attention mechanisms in neural networks
https://arxiv.org/abs/2601.03329

>Listen to Rhythm, Choose Movements: Autoregressive Multimodal Dance Generation via Diffusion and Mamba with Decoupled Dance Dataset
https://arxiv.org/abs/2601.03323

>Mass Concept Erasure in Diffusion Models with Concept Hierarchy
https://arxiv.org/abs/2601.03305
>>
>mfw Yesterday's Research news

01/07/2026

>DiT-JSCC: Rethinking Deep JSCC with Diffusion Transformers and Semantic Representations
https://arxiv.org/abs/2601.03112

>Text-Guided Layer Fusion Mitigates Hallucination in Multimodal LLMs
https://arxiv.org/abs/2601.03100

>Towards Faithful Reasoning in Comics for Small MLLMs
https://arxiv.org/abs/2601.02991

>LAMS-Edit: Latent and Attention Mixing with Schedulers for Improved Content Preservation in Diffusion-Based Image and Style Editing
https://arxiv.org/abs/2601.02987

>Muses: Designing, Composing, Generating Nonexistent Fantasy 3D Creatures without Training
https://luhexiao.github.io/Muses.github.io

>ClearAIR: A Human-Visual-Perception-Inspired All-in-One Image Restoration
https://arxiv.org/abs/2601.02763

>InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields
https://arxiv.org/abs/2601.03252

>A Versatile Multimodal Agent for Multimedia Content Generation
https://arxiv.org/abs/2601.03250

>LTX-2: Efficient Joint Audio-Visual Foundation Model
https://arxiv.org/abs/2601.03233

>Decentralized Autoregressive Generation
https://arxiv.org/abs/2601.03184

>UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision
https://arxiv.org/abs/2601.03193

>DiffBench Meets DiffAgent: End-to-End LLM-Driven Diffusion Acceleration Code Generation
https://arxiv.org/abs/2601.03178

>Unified Thinker: A General Reasoning Modular Core for Image Generation
https://arxiv.org/abs/2601.03127

>DreamStyle: A Unified Framework for Video Stylization
https://lemonsky1995.github.io/dreamstyle

>GRRE: Leveraging G-Channel Removed Reconstruction Error for Robust Detection of AI-Generated Images
https://arxiv.org/abs/2601.02709

>DreamLoop: Controllable Cinemagraph Generation from a Single Photograph
https://anime26398.github.io/dreamloop.github.io
>>
File: IMG_2370.png (695 KB, 678x907)
695 KB
695 KB PNG
>>107807627
nice thread title bro
>>
File: k_00072_.png (3.18 MB, 1488x1488)
3.18 MB
3.18 MB PNG
>>
File: deCS_zi_00072_.png (3.58 MB, 2176x1152)
3.58 MB
3.58 MB PNG
>>
File: k_00073_.png (2.6 MB, 1488x1488)
2.6 MB
2.6 MB PNG
>>
File: dePR_zi_00061_.png (2.4 MB, 1920x1152)
2.4 MB
2.4 MB PNG
>>
File: k_00074_.png (2.7 MB, 1488x1488)
2.7 MB
2.7 MB PNG
>>
File: k_00077_.png (1.03 MB, 1088x1488)
1.03 MB
1.03 MB PNG
a second lora has hit the.. whatever. not sure about this one
>>
File: k_00079_.png (675 KB, 1088x1488)
675 KB
675 KB PNG
>>
File: deCS_zi_00073_.png (3.52 MB, 2176x1152)
3.52 MB
3.52 MB PNG
>>107808609
looks pretty good to me
>>
File: k_00080_.png (1.77 MB, 1088x1488)
1.77 MB
1.77 MB PNG
>>107808623
yeah, it is decent, but prompt must include the right words/phrases, the flux loras seem to work even without the right language.
or i just don't understand anything
had chatgpt give some prompts based off of the training data
same prompt with the punk art lora
>>
File: k_00081_.png (537 KB, 1088x1488)
537 KB
537 KB PNG
>>
File: k_00082_.png (1.29 MB, 1088x1488)
1.29 MB
1.29 MB PNG
>>
>>107807978
cutie
>>
File: deCS_zi_00074_.png (3.6 MB, 2176x1152)
3.6 MB
3.6 MB PNG
>>107808802
nice. this would be a cool style for some kind of video game. I haven't decided the genre yet
>>
File: k_00085_.png (2.84 MB, 1088x1488)
2.84 MB
2.84 MB PNG
>>
File: k_00086_.png (1.67 MB, 1088x1488)
1.67 MB
1.67 MB PNG
>>
File: deCS_zi_00075_.png (3.35 MB, 2176x1152)
3.35 MB
3.35 MB PNG
>>
File: k_00088_.png (1.05 MB, 1920x1080)
1.05 MB
1.05 MB PNG
>>
File: k_00089_.png (3.23 MB, 1920x1080)
3.23 MB
3.23 MB PNG
>>
>>
File: k_00090_.png (3.24 MB, 1920x1080)
3.24 MB
3.24 MB PNG
with the two loras at the same time, at strengths of 1, too high, but semi neat
>>
>>
File: k_00091_.png (2.22 MB, 1920x1080)
2.22 MB
2.22 MB PNG
loras at .8 strength
>>
>>
>>
>>
File: k_00096_.png (1.98 MB, 896x1152)
1.98 MB
1.98 MB PNG
>>
>>
File: k_00097_.png (2.06 MB, 896x1152)
2.06 MB
2.06 MB PNG
>>
File: k_00098_.png (1.59 MB, 896x1152)
1.59 MB
1.59 MB PNG
>>
Qwen seems to have some trouble with continuity of legs and arms. It's weird. It's got fingers and feet good.
>>
File: k_00100_.png (820 KB, 896x1152)
820 KB
820 KB PNG
>>
File: k_00101_.png (1.84 MB, 896x1152)
1.84 MB
1.84 MB PNG
>>
File: k_00102_.png (1.51 MB, 896x1152)
1.51 MB
1.51 MB PNG
flaming owl
>>
File: k_00103_.png (1.07 MB, 896x1152)
1.07 MB
1.07 MB PNG
hoot and holler
>>
File: output.png (2.2 MB, 1792x1152)
2.2 MB
2.2 MB PNG
>>
File: output.png (1.94 MB, 1792x1152)
1.94 MB
1.94 MB PNG
>>
File: k_00109_.png (1.03 MB, 896x1152)
1.03 MB
1.03 MB PNG
>>
File: deTW_zi_00001_.png (3 MB, 2048x1280)
3 MB
3 MB PNG
>>107809514
a new koff style is born

>>107810318
mfw
>>
>>107807627
Make proper thread so my filter works, faggot
>>
File: k_00113_.png (955 KB, 896x1152)
955 KB
955 KB PNG
>>107810389
you should get better at filtering, it's not that hard
>>
File: k_00114_.png (472 KB, 896x1152)
472 KB
472 KB PNG
>>
File: k_00115_.png (1.41 MB, 1920x1080)
1.41 MB
1.41 MB PNG
>>
File: k_00117_.png (2.02 MB, 1920x1080)
2.02 MB
2.02 MB PNG
>>
File: k_00118_.png (2.5 MB, 1920x1080)
2.5 MB
2.5 MB PNG
>>
File: deTW_zi_00003_.png (2.91 MB, 2048x1280)
2.91 MB
2.91 MB PNG
>>
File: k_00119_.jpg (1.09 MB, 1920x1080)
1.09 MB
1.09 MB JPG
>>107810786
nice. using some lora?
>>
File: k_00120_.png (3.39 MB, 1920x1080)
3.39 MB
3.39 MB PNG
>>
File: deTW_zi_00007_.png (2.94 MB, 2048x1280)
2.94 MB
2.94 MB PNG
>>107810890
yeah, I've been using the same 'cassette futurism' zimg lora all week. it does very cool stuff, even if very little of it is actually 'cassette futurism'. it has nice opinions on color composition
>>
File: k_00121_.png (1.39 MB, 1488x1488)
1.39 MB
1.39 MB PNG
>>107811039
nice, looking at that user's page, they have some nice loras
>>
File: PH_SE_ANIMALSNACK_09.jpg (2.38 MB, 4608x3524)
2.38 MB
2.38 MB JPG
No wonder my catcher didn't watch the thread. Whoever baked forgot the subject.
>>
File: k_00122_.png (2.93 MB, 1488x1488)
2.93 MB
2.93 MB PNG
>>
File: file.png (55 KB, 476x983)
55 KB
55 KB PNG
is it supposed to look like this? or is my install completely fucked because the desktop updater is fucking retarded?
>>
>>107811130
i ask because i would not put it past the fennec to do something this retarded
>>
File: k_00129_.png (1.97 MB, 1008x1488)
1.97 MB
1.97 MB PNG
>>
File: deTW_zi_00008_.png (2.94 MB, 2048x1280)
2.94 MB
2.94 MB PNG
>>107811063
>>107811120
oh boy, grids are back on the menu
you must be getting good gen times with your zimg setup

>>107811085
personally, i salute the valiant baker who baked when no one else would

>>107811130
I refuse to update so I have no idea what its supposed to look like anymore
>>
File: k_00130_.png (2.27 MB, 1920x1080)
2.27 MB
2.27 MB PNG
>>107811277
not really grids of gens, i prompted single gens as consisting of 4 different subjects, 'top left is ___, top right is ___' etc
the gen times are fine, relatively, i am just happy not having is oom on me, though it does do tiled vae at the end
pic related took about 10 minutes, 12 steps
>>
File: k_00131_.png (3.94 MB, 1920x1080)
3.94 MB
3.94 MB PNG
>>
File: deUC_zi_00019_.png (2.73 MB, 2048x1280)
2.73 MB
2.73 MB PNG
>>107811318
oh, those are single gens? thats even cooler
>10 minutes, 12 steps
less cool, lol. worth tho
>>
File: k_00132_.png (1.81 MB, 1920x1080)
1.81 MB
1.81 MB PNG
gn
>>
>>107811433
later gator
>>
>>107811277
looks like a bug and i just made my life way more difficult for no reason. moving the sidebar to left side works fine, on the right it's fucked. fuck my life
>>
File: deUC_zi_00020_.png (2.98 MB, 2048x1280)
2.98 MB
2.98 MB PNG
>>107811433
gn

>>107811555
>looks like a bug and i just made my life way more difficult
add it to the pile. or their github issues. same diff prob
>>
>>107811574
fuck this i'll deal with it tomorrow. all i'll get for my trouble tonight is seething rage. gn
>>
File: deUC_zi_00021_.png (2.47 MB, 2048x1280)
2.47 MB
2.47 MB PNG
>>107811581
gn
>>
File: PH_SE_ANIMALSNACK_11.jpg (1.13 MB, 4096x4096)
1.13 MB
1.13 MB JPG
>>107811277
Same was just making a comment
>>
File: deUC_zi_00025_.png (2.33 MB, 2048x1280)
2.33 MB
2.33 MB PNG
>>107811641
very cute treats
but, uh... are they for dogs? or can I eat them?
>>
i miss schizo anon
>>
File: PH_SE_ANIMALSNACK_04.jpg (1.73 MB, 4608x3336)
1.73 MB
1.73 MB JPG
>>107811683
Let's put it this way. They're made out of ingredients that either a human or a dog could both consume and find delicious lol
>>
File: PH_SE_ANIMALSNACK_10.jpg (1.39 MB, 4608x3384)
1.39 MB
1.39 MB JPG
>>
File: winter night.webm (3.76 MB, 1920x640)
3.76 MB
3.76 MB WEBM
>>
File: 703981510.png (1.41 MB, 848x1184)
1.41 MB
1.41 MB PNG
>>107807627
>>
File: k_00133_.png (2.65 MB, 1920x1080)
2.65 MB
2.65 MB PNG
>>
>>
>>
File: k_00135_.png (1.35 MB, 1088x1488)
1.35 MB
1.35 MB PNG
>>
>>
File: k_00136_.png (2.35 MB, 1088x1488)
2.35 MB
2.35 MB PNG
honk honk
>>
File: 2201500225.jpg (247 KB, 2691x2097)
247 KB
247 KB JPG
>>
>>
>>
File: k_00140_.png (2.8 MB, 1488x1488)
2.8 MB
2.8 MB PNG
>>
>>
File: k_00142_.png (1.12 MB, 1152x896)
1.12 MB
1.12 MB PNG
>>
File: k_00143_.png (2.26 MB, 1152x896)
2.26 MB
2.26 MB PNG
>>
File: k_00144_.png (1.11 MB, 1152x896)
1.11 MB
1.11 MB PNG
>>
File: deUC_zi_00026_.png (2.04 MB, 2048x1280)
2.04 MB
2.04 MB PNG
>>107813911
>>107813930
changed settings? looks different

>>107814069
lol, friendly guy

>>107814141
awesome

>>107814369
me and my wife
>>
File: k_00145_.png (1.45 MB, 1152x896)
1.45 MB
1.45 MB PNG
>>
File: 3701800378.jpg (368 KB, 2097x2691)
368 KB
368 KB JPG
>>107814717
Thanks
>>
File: k_00147_.jpg (1.12 MB, 1920x1080)
1.12 MB
1.12 MB JPG
>>
File: deUC_zi_00027_.png (2.73 MB, 2048x1280)
2.73 MB
2.73 MB PNG
>>
>>107814717
>looks different
nah just a different style
i'm trying to integrate comfy api with my stuff into sillytavern now for chat+image gen goodness
>>
seems to work (without wildcards)
>>
File: 2601400264.jpg (382 KB, 2097x2691)
382 KB
382 KB JPG
>>
Morning anons
>>
oh it doesnt provide a workflow (embedded or not) via the api
so unable to recreate ever lol
>>
File: deUC_zi_00028_.png (2.2 MB, 2048x1280)
2.2 MB
2.2 MB PNG
>>107815199
gm
happy friday
>>
>>
File: k_00148_.png (2.63 MB, 1920x1080)
2.63 MB
2.63 MB PNG
>>107815199
morning
>>
File: k_00151_.png (2.55 MB, 1920x1080)
2.55 MB
2.55 MB PNG
>>
>>
File: x1.png (2.07 MB, 1626x1220)
2.07 MB
2.07 MB PNG
>>
File: x2.png (2.2 MB, 1626x1220)
2.2 MB
2.2 MB PNG
>>
>>
>>107815687
Reminds me of mulan
>>
>>107815687
i literally "gah"'d when seeing that finger lel
>>
>>107815732
AI is flexible. kek.
>>
i'm a nigbophile
>>
Chroma girl inspired
>>
File: k_00156_.png (2.91 MB, 1080x1920)
2.91 MB
2.91 MB PNG
>>
>>107815776
what upscaler are you using, if any?
>>
File: k_00157_.png (3.98 MB, 1080x1920)
3.98 MB
3.98 MB PNG
>>
>>107815883
no upscalers.
>>
>>107815883
actually, for that one, since it's a kontext gen, it's slightly resized to the original's size using Image Resize V2.
>>
>>107815971
NTA but the one you link to is pretty fucked up if you view it full size. meanwhile the one you just posted is fine
>>
>>107816036
You're right.
Let me fix that.
>>
>>107815938
>>107815971
oh you were on kontext then
how long does it take to switch models and such lol
kontext made it look super noisy (vs z which looks clean)
>>
File: k_00158_.png (2.87 MB, 1080x1920)
2.87 MB
2.87 MB PNG
>>
>>107816065
I just have two workflows. One with kontext, the other with z-image. Yeah, the noise is because of kontext. I just used gimp to put the butterfly on the original gen which keeps the original.
>>
File: k_00159_.png (1.68 MB, 1080x1920)
1.68 MB
1.68 MB PNG
>>
File: k_00160_.png (2.02 MB, 1088x1488)
2.02 MB
2.02 MB PNG
>>
friendship with hip 6.4 release is OVER. TheRock nightlies are my friend now
flash attention working on rocm
INFO <dinoml.backend.rocm.builder_cmake> Executing "C:/Program Files/CMake/bin/cmake.EXE" -D CMAKE_PREFIX_PATH="C:/TheRock/" -D CMAKE_CXX_COMPILER="C:/TheRock/bin/hipcc.exe" -DCMAKE_RC_COMPILER="C:/Program Files (x86)/Windows Kits/10/bin/10.0.22621.0/x64/rc.exe" -D CMAKE_BUILD_TYPE=Release -D GPU_TARGETS="gfx1201"  -B "tmp/flash_attn_sdpa/build" -S "tmp/flash_attn_sdpa" -G "Ninja"
INFO <dinoml.backend.rocm.builder_cmake> Executing cmake --build tmp\flash_attn_sdpa\build --config Release
INFO <dinoml.compiler.compiler> compiled the final .so file elapsed time: 0:00:16.619116
FlashAttention matches Torch SDPA

also first full model build on amd, sd 1.5
my pytorch nightly isn't working for some reason just hangs but i see from sd.next's benchmarks that 9070xt on windows is getting ~17it/s (i'll try 2.9.1+rocm7.11.0a20260103 that it mentions)
dinoml is at 27it/s and there's definitely a lot of performance left on the table
>>
File: deUC_zi_00029_.png (2.44 MB, 2048x1280)
2.44 MB
2.44 MB PNG
>>
File: k_00162_.png (1.3 MB, 1920x1080)
1.3 MB
1.3 MB PNG
>>
>>
File: k_00163_.png (2.9 MB, 1920x1080)
2.9 MB
2.9 MB PNG
>>
Hey, could you merge this with /ldg/? I mean, instead of posting here, post the same thing in /ldg/. We need anons who don't test pointlessly. I would ask /adt/, but they're dead.
>>



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.