[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor acceptance emails will be sent out over the coming weeks. Make sure to check your spam folder!


[Advertise on 4chan]


Discussion and Development of Local Image, Video, and Music Models

Previous: >>109200742

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
SDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineage
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://huggingface.co/models
https://civitai.com
https://civitaiarchive.com
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Krea 2
https://huggingface.co/krea/Krea-2-Raw
https://huggingface.co/krea/Krea-2-Turbo

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>mfw Resource news

07/04/2026

>Ambit: Local-first desktop manager for AI image libraries
https://github.com/AsuraAce/ambit

>Orion4D MetaPrompt Custom Nodes for ComfyUI
https://github.com/orion4d/Orion4D_MetaPrompt

>Qwen3.5 INT8 ConvRot Text Encoders for ComfyUI
https://huggingface.co/Winnougan/Qwen-3.5-INT8-Convrot-Comfy

07/03/2026

>Krea-2 Depth ControlNet-LoRA
https://huggingface.co/Patil/Krea-2-depth-controlnet

>Multi-Resolution Flow Matching: Training-Free Diffusion Acceleration via Staged Sampling
https://github.com/Xingyu-Zheng/MrFlow

>DiffRGD: An Inference-Time Diffusion Guidance Through Riemannian Gradient Descent
https://diffrgd.github.io

>Representation Distribution Matching for One-Step Visual Generation
https://alan-lanfeng.github.io/rdm

>SAB-LVLM: Significance-Aware Binarization for Large Vision-Language Models
https://github.com/LyuQi127/SAB_LVLM

>Style-CCL: Content-Preserving Style Transfer via Curriculum Continual Learning
https://github.com/witcherofresearch/Qwen-Image-Style-Transfer
https://github.com/Tele-AI/TeleStyle

>ByteDance-Seed / PAR
https://huggingface.co/ByteDance-Seed/PAR

07/02/2026

>PAPA: Online Personalized Active Preference Alignment
https://github.com/NasikNafi/papa

>Condensing Large-Scale Datasets Directly with Minimal Information Loss
https://github.com/LINs-lab/CIM

>VisReason: A Large-Scale Dataset for Visual Chain-of-Thought Reasoning
https://y-research-sbu.github.io/VisReason

>Asset Generator for 2D & 3D: Blender add-on that generates assets from text prompts
https://github.com/tin2tin/Asset_Generator-2D-3D

>ComfyUI-TrixLoader: All-in-One Image Loader, Editor, and Resizer node for ComfyUI
https://github.com/trx7111/ComfyUI-TrixLoader

07/01/2026

>Elastic Diffusion Transformer: Accelerating SOTA generation models
https://github.com/wangjiangshan0725/Elastic-DiT

>Boogu-Image-0.1-Edit-Turbo
https://huggingface.co/Boogu/Boogu-Image-0.1-Edit-Turbo
>>
>mfw Research news

07/04/2026

>Visual Semantic Entropy: Do Vision Language Models Recognize Visual Ambiguity?
https://arxiv.org/abs/2606.31407

>PhotoQuilt: Training-Free Arbitrary-Resolution Photomosaics via Bootstrapped Tiled Denoising
https://kooroshrh.github.io/photo-quilt

>MindFlow: Harmonizing Cognitive Semantics and Acoustic Dynamics for Facial Animation Generation in Dyadic Conversations
https://arxiv.org/abs/2606.27779

>Gradient Smoothing: Coupling Layer-wise Updates for Improved Optimization
https://arxiv.org/abs/2606.30813

>Rank-Aware Hyperbolic Alignment for Vision-Language Dataset Distillation
https://andyj1.github.io/raha

>On Test-Time Scaling for Vision-Language Models
https://arxiv.org/abs/2606.28864

>Clearer Sight, Fewer Lies: Oriented Pickup Preference Optimization for Multimodal Hallucination Mitigation
https://arxiv.org/abs/2606.29805

>Steal the Patch Size: Adversarially Manipulate Vision-Language Models
https://arxiv.org/abs/2607.00174

>Spatially Localized Image Degradation Embeddings for Image Quality Assessment
https://arxiv.org/abs/2606.29162

>NURBS Splatting: A Unified Differentiable Rendering Framework for Vector Graphics
https://arxiv.org/abs/2606.31764

>$μ$Flow: Leveraging Average Images for Improving Generalisation of Deepfake Faces Detectors
https://opontorno.github.io/MuFlow

>SPECSIA: Stylization Dataset for Novel-View Enhancement in Drawing-based 3D Animation
https://arxiv.org/abs/2607.00525

>Resonant Brane Splatting for Arbitrary-Scale Super-Resolution
https://arxiv.org/abs/2606.29453

>When Sinks Help or Hurt: Unified Framework for Attention Sink in Large Vision-Language Models
https://arxiv.org/abs/2604.03316

>Stateful Token Reduction for Long-Video Hybrid VLMs
https://arxiv.org/abs/2603.00198

>Universal Image Immunization against Diffusion-based Image Editing via Semantic Injection
https://arxiv.org/abs/2602.14679
>>
cum
>>
Why does Krea hate the word "CUM"
>>
another night of zitjeet seething over krea? you bet!
>>
This general has become unbearable, I have 1girl photo realism fatigue, especially for girls with asian faces. There were very few gens I was glad to see here, like gibbon gens.
>>
File: 1771140976287772.png (1.7 MB, 1024x1024)
1.7 MB PNG
pretty neat that krea knows teto natively and a lot of other stuff. also you can add specifics easily like "teto has 0401 on her arm".
>>
>>109202691
I've seen Krea doing a lot of popular characters, but what about lesser known ones?
Has anybody found a character Krea doesn't know?
>>
File: 034401CUI_00001_.png (1.5 MB, 1152x1536)
1.5 MB PNG
>>
>>109202690
/ldg/ lives and dies by the asian 1girl
>>
Blessed thread of frenship
>>
File: debo_is_k2_00011.png (2.5 MB, 1024x853)
2.5 MB PNG
>>
The artifacts work out nicely kek
>>
>>109202690
Be the change you want to see anon
>>
>>
>>109202690
if you want artistic stuff, use midjourney. local artistry is dead now
>>
File: 1767128354163904.png (1.83 MB, 1024x1024)
1.83 MB PNG
>>
>>109202788
a grown man made this gen
>>
>>109202763
>I suffer from skill issue, the post

How much are they paying you shill?
>>
>>109202696
Mortal Kombat characters.
>>
>>109202799
Compared to Krea MJ is basically like an SD1.5 tier model. No soul, outputs too basic, and that is especially true when we can use Krea both as an art tool that trumps anything MJ outputs, but also 2nd pass thru other models like Anima to enhance the artistry of 2D gens.
>>
File: 042736CUI_00001_.png (1.16 MB, 1152x1152)
1.16 MB PNG
>>
>>109202799
I sometimes wonder if you gen using a tablet or something or a really small monitor
>>
https://huggingface.co/RuneXX/LTX-2.3-Workflows/blob/main/Video-2-Video/Extend-Any-Video/LTX-2.3_-_V2V_Extend_Any_Video_Multi-Extend_long_video.json

ltx video extend is still funny to mess with, video source is god of woke by playstation

https://files.catbox.moe/3t1o5a.mp4
>>
>>109202848
>Nogen

I accept your concession
>>
>>109202799
looks like ass, you're legit blind dude
>>
>>109202481
>>109199883
i put
>(illustration:-3), (anime:-4.75), (cartoon:-3.75)
in one of the concatenate text boxes in the default krea2 workflow and all it did was turn it into anime
... guess it has to be fed directly into the clip text encode just like for wildcards
>>
File: ComfyUI_temp_pkyyb_00059_.png (3.7 MB, 1344x1728)
3.7 MB PNG
>>
>>109202688
you think about him every moment of your day anon? I think you fall in love with him kek
>>
>>109202686
>Why does Krea hate the word "CUM"
Censorship Filter, writing no no words is dangerous anon!
>>
File: gato.jpg (397 KB, 1360x768)
397 KB JPG
>>
File: ComfyUI_temp_pkyyb_00062_.png (3.58 MB, 1344x1728)
3.58 MB PNG
thanks to the anon who recommended the Krea-2-Turbo-Projector-Scale-LoRA-Diffusers, I can finally generate the word "CUM" and now my life can go on
>>
>>109202947
Nice SD 1.5 image anon, takes me back
>>
File: 00012-570797482.png (1.23 MB, 1024x1024)
1.23 MB PNG
its here as i promised :)
https://gofile.io/d/ph7bdY
>>
File: ComfyUI_temp_pkyyb_00063_.png (2.37 MB, 1728x1344)
2.37 MB PNG
>>109202954
Glad that you like it
>>
>>109202947
are you using Krea Raw? Because Turbo would never produces such a shit image lol



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.