[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion and Development of Local Image and Video Models

Previous: >>108851016

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>mfw Resource news

05/18/2026

>Lance: Unified Multimodal Modeling by Multi-Task Synergy
https://lance-project.github.io

>GridLoraTester: Workbench for character LoRA training on FLUX.2: dataset curation
https://github.com/Mandrakia/GridLoraTester

>FLUX MCP server
https://docs.bfl.ai/api_integration/mcp_integration

>Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization
https://shredded-pork.github.io/Flash-GRPO.github.io

>LongLive2.0 5B BF16: AR-trained Wan2.2-TI2V-5B generator
https://huggingface.co/Efficient-Large-Model/LongLive-2.0-5B

>DealMaTe: Multi-Dimensional Material Transfer via Diffusion Transformer
https://github.com/haha-lisa/DealMaTe

>Deep Pre-Alignment for VLMs
https://github.com/THUMAI-Lab/Deep-Pre-Alignment

>Sparse Autoencoders enable Robust and Interpretable Fine-tuning of CLIP models
https://github.com/Fabian-Mor/sae-ft

>VAGS: Velocity Adaptive Guidance Scale for Image Editing and Generation
https://github.com/Harvard-AI-and-Robotics-Lab/Velocity_Adaptive_Guidance_Scale

>Neural Companion: Local desktop AI companion shell
https://github.com/Rakile/NeuralCompanion

>PixlStash 1.2: easy sharing, cleaner UI and faster background processing for your image management
https://pixlstash.dev/whatsnew.html

05/17/2026

>Comfy-mesh LTX 2.3 support — separate node + separate server GUI
https://github.com/shootthesound/comfyui-mesh#ltx-23--separate-node--separate-server-gui

>Rebels_HiDream-01_Image_Dev_NODES: Run HiDream-01 Image Dev bf16 and GGUF
https://github.com/RealRebelAI/Rebels_HiDream-01_Image_Dev_NODES

05/16/2026

>ComfyUI-Mesh Icarus & Daedalus: Split a diffusion model across two GPUs
https://github.com/shootthesound/comfyui-mesh

>Pixal3D-ComfyUI
https://github.com/Saganaki22/Pixal3D-ComfyUI

>ArXiv to Ban Researchers for a Year if They Submit AI Slop
https://www.404media.co/new-arxiv-rules-ai-generated-papers-ban
>>
>mfw Research news

05/18/2026

>DreamSR: Towards Ultra-High-Resolution Image Super-Resolution via a Receptive-Field Enhanced Diffusion Transformer
https://arxiv.org/abs/2605.15682

>ElasticDiT: Efficient Diffusion Transformers via Elastic Architecture and Sparse Attention for High-Resolution Image Generation on Mobile Devices
https://arxiv.org/abs/2605.15684

>Self-Prompting Diffusion Transformer for Open-Vocabulary Scene Text Editing via In-Context Learning
https://hongxiii.github.io/mstedit

>Echo-Forcing: A Scene Memory Framework for Interactive Long Video Generation
https://arxiv.org/abs/2605.16003

>One Pass Is Not Enough: Recursive Latent Refinement for Generative Models
https://arxiv.org/abs/2605.15309

>Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization
https://arxiv.org/abs/2605.15980

>Evaluating Design Video Generation: Metrics for Compositional Fidelity
https://arxiv.org/abs/2605.16223

>Sound Sparks Motion: Audio and Text Tuning for Video Editing
https://amirhossein-razlighi.github.io/Sound_Sparks_Motion

>Tuning-free Instruction-based Video Editing Via Structural Noise Initialization and Guidance
https://arxiv.org/abs/2605.15533

>Do Less, Achieve More: Do We Need Every-Step Optimization for RL Fine-tuning of Diffusion Models?
https://arxiv.org/abs/2605.15855

>GenShield: Unified Detection and Artifact Correction for AI-Generated Images
https://arxiv.org/abs/2605.16122

>Efficient Image Synthesis with Sphere Latent Encoder
https://arxiv.org/abs/2605.15592

>Neutral-Reference Prompting for Vision-Language Models
https://arxiv.org/abs/2605.15615

>HyperDiT: Hyper-Connected Transformers for High-Fidelity Pixel-Space Diffusion
https://arxiv.org/abs/2605.15741

>Registers Matter for Pixel-Space Diffusion Transformers
https://arxiv.org/abs/2605.16147

>RaPD: Resolution-Agnostic Pixel Diffusion via Semantics-Enriched Implicit Representations
https://arxiv.org/abs/2605.15908
>>
>>108855284
>>108855287
thanks!
>>
BUSHY
ARMPIT
FARM
WOMEN
>>
whats the best client and model to use on an amd 6800xt?
>>
>>108855361
I have a 6950xt. I'm on Linux, and I use ComfyUI and for models I use Flux Klein 9B, but you may want to use the smaller Flux Klein one.

I suspect at this point people are using gguf to save memory.

the thing is, you have to learn to use linux, you have to use venv and install python stuff, so idk, it's like a gaping time hole.

btw, for anime, it's all about anima v1.
>>
File: zit_00011_.png (1.66 MB, 1024x1504)
1.66 MB PNG
>>
File: zit_00012_.png (1.61 MB, 1024x1504)
1.61 MB PNG
>>108855507
>>
>>108855361
oh yeah, I should obviously mention Zit, I use it the most. Z Image Base.

lol
>>
Is there a way to use reference background images for t2i? Depth, canny isn't enough.
>>
>>108855690
no, its impossible
>>
>>108855690
Inpainting? Just oldschool photobashing?
>>
File: debo_cd-a_anima1_00061_.png (2.91 MB, 1792x1194)
2.91 MB PNG
>>
>3 hour old thread
>barely any posts
>>
>>108855894
It's midnight in america chud
>>
>https://xcancel.com/LodestoneRock/status/2056533258746396705
Damn. I could also use new gpu
>>
Claude is having me install llama shit for a node, compiling shit for 5090.

Wish me luck.
>>
>Fucking piece of shit Civitai always break at night.
>Tfw Night NEET

REEEEE
>>
Anima diffusers FUCKING when
>>
File: file.png (1.47 MB, 1024x1024)
1.47 MB PNG
>>
>>108855953
nice! time to burn them on some shitty useless experiments which will be abandoned midway, instead of finetuning anima on his dataset!!!
>>
Nodes 2.0... whatever happened there?

https://github.com/Comfy-Org/ComfyUI_frontend/discussions/12330

A lot of my nodes are broken in this vue (cancer) rendering system.
>>
>>108856053
Nice, it's actually working, allowing me to use all 81frames from any 5second video I've done preciously.

Hate how ltx prefers an essay of useless prompts.
>>
>>108855953
>LodestoneRock
>>
>>108855953
@Comfy can you send me one too?
Fax: 9844 1529
>>
>>
>>108855953
WHY CAN'T THIS STUPID NIGGERMONKEYFAGGOTRETARD JUST DO A SIMPLE FINETUNE
>>
bruh i found my old 1.5 pngs and loading them into newer models with all the schizo weights produces some shit ill tell u wat
>>
File: 1775577372194749.jpg (876 KB, 1248x1824)
876 KB JPG
>>
File: file.png (1.83 MB, 1024x1024)
1.83 MB PNG
>>108856609
Mass copying my prompts from midjourney to sdxl based furry porn models does some pretty weird shit too.
>>
File: file.png (1.7 MB, 1024x1024)
1.7 MB PNG
>>
File: ComfyUI_00291_.jpg (2.95 MB, 3584x4608)
2.95 MB JPG
>>
File: file.png (1.95 MB, 1024x1024)
1.95 MB PNG
>>
>>108856600
More fun to try something new and complicated I'd guess
>>
File: ComfyUI_00037_.png (1.31 MB, 832x1216)
1.31 MB PNG
>hourglass figure

hmmmm
>>
What is the tech or node im looking for to run 2 gens, same everything but just switch the model between 2. right now I'm doing 1 and switching manually then 1 but im sure there is a better way
>>
>>108856775
did you use it like a tag or "The woman’s body has an hourglass figure."
>>
>>108856600
>REEE
>>
>>108856782
a tag, copypaste from old prompt
>>
File: 1751299255204229.png (3.22 MB, 1328x1640)
3.22 MB PNG
>>
>>
File: 1766421016849093.png (3.19 MB, 1328x1640)
3.19 MB PNG
>>
File: 1757950955990739.png (3.38 MB, 1328x1640)
3.38 MB PNG
bnuy
>>
File: 1750986445863106.png (3.63 MB, 1536x1536)
3.63 MB PNG
>>
>>108855690
of course there is
>>
File: 1771277954849476.png (3.15 MB, 1328x1640)
3.15 MB PNG
oink oink
>>
>>108855690
reference? you mean targeting? make a mask with REMBG.
>>
>>108856929
and brotip: you can invert the masks.
>>
>>108856929
That makes it an i2i, right?
>>
File: 1752400304222579.jpg (791 KB, 2048x1128)
791 KB JPG
>>
>>108856991
how do you reference an image it not being i2i?
>>
Anybody use Claude TUI for image gen?
>>
File: ss.jpg (450 KB, 1864x1385)
450 KB JPG
>>108855690
>>
>>108857039
is unsloth version of klein 9b any less censored than normal? isn't it pure snakeoil for image models
>>
File: ComfyUI_00073_.png (1.4 MB, 1024x1024)
1.4 MB PNG
>>
>>108857059
i run the unsloth gguf because it's smaller file size and works on my 16gb vram 32 ram setup. i doubt it's less censored in any way.
>>
File: Anima_00659_.png (422 KB, 896x1152)
422 KB PNG
^_^
>>
File: Anima_00662_.png (637 KB, 896x1152)
637 KB PNG
>>
I never see people posting their gens to Anima lora galleries on civit. It's weird.
>>
>>108857227
civit only upvotes the most indian images so I won't put anything I like into that toilet
>>
>>108857227
I post them once or twice per day. It gets drowned immediately. It's unironically better to stick to older models like chroma if you want visibility.
>>
>>108857227
all the best slop gets posted on twatter nowadays
>>
File: anima_baseV10_00228_.jpg (516 KB, 1432x1840)
516 KB JPG
>>
someone make adetailer for anima please I am so tired of hard slopped eyes we have the tech to fix
>>
>>108857197
did u copy
>>
File: anima_baseV10_00232_.jpg (469 KB, 1432x1840)
469 KB JPG
>>
>>108857278
isnt she cold
also anima yume for 1.0 based when?
>>
>>108857291
Use gimp and i2i on low denoise.
>>
>>108857327
i'd fix with inpaint illustrious, I just want other lazy retards to have a tool I can tell them to use to stop posting shit like the image above
>>
>>108857241
you can hide any boring members
>>
File: anima_baseV10_00240_.jpg (461 KB, 1432x1840)
461 KB JPG
>>
File: anima_baseV10_00244_.jpg (598 KB, 1456x1952)
598 KB JPG
>>
>>108857321
i can't wait for ANY competent anima finetune (base is just not usable by itself, image burning into undetailed flat color blobs even at cfg 4, etc.)
>>
File: ComfyUI_00293_.jpg (1.9 MB, 3584x4608)
1.9 MB JPG
Gyaruren
>>
File: ComfyUI_00294_.jpg (2.47 MB, 3584x4608)
2.47 MB JPG
>>
File: ComfyUI_00132_.png (1.73 MB, 1504x1000)
1.73 MB PNG
>>108857439
how do I remove filthy males from being genned?
>>
stop using piece of shit overfit sdxl garbage
>>
File: ComfyUI_00295_.jpg (1.37 MB, 3584x4608)
1.37 MB JPG
>>108857468
>1girl, solo,
>>
>>108857468
that is clearly another woman with breasts, but just write solo, retard.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.