[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor acceptance emails will be sent out over the coming weeks. Make sure to check your spam folder!


[Advertise on 4chan]


Discussion and Development of Local Image, Video, and Music Models

Previous: >>109200742

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
SDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineage
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://huggingface.co/models
https://civitai.com
https://civitaiarchive.com
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Krea 2
https://huggingface.co/krea/Krea-2-Raw
https://huggingface.co/krea/Krea-2-Turbo

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>mfw Resource news

07/04/2026

>Ambit: Local-first desktop manager for AI image libraries
https://github.com/AsuraAce/ambit

>Orion4D MetaPrompt Custom Nodes for ComfyUI
https://github.com/orion4d/Orion4D_MetaPrompt

>Qwen3.5 INT8 ConvRot Text Encoders for ComfyUI
https://huggingface.co/Winnougan/Qwen-3.5-INT8-Convrot-Comfy

07/03/2026

>Krea-2 Depth ControlNet-LoRA
https://huggingface.co/Patil/Krea-2-depth-controlnet

>Multi-Resolution Flow Matching: Training-Free Diffusion Acceleration via Staged Sampling
https://github.com/Xingyu-Zheng/MrFlow

>DiffRGD: An Inference-Time Diffusion Guidance Through Riemannian Gradient Descent
https://diffrgd.github.io

>Representation Distribution Matching for One-Step Visual Generation
https://alan-lanfeng.github.io/rdm

>SAB-LVLM: Significance-Aware Binarization for Large Vision-Language Models
https://github.com/LyuQi127/SAB_LVLM

>Style-CCL: Content-Preserving Style Transfer via Curriculum Continual Learning
https://github.com/witcherofresearch/Qwen-Image-Style-Transfer
https://github.com/Tele-AI/TeleStyle

>ByteDance-Seed / PAR
https://huggingface.co/ByteDance-Seed/PAR

07/02/2026

>PAPA: Online Personalized Active Preference Alignment
https://github.com/NasikNafi/papa

>Condensing Large-Scale Datasets Directly with Minimal Information Loss
https://github.com/LINs-lab/CIM

>VisReason: A Large-Scale Dataset for Visual Chain-of-Thought Reasoning
https://y-research-sbu.github.io/VisReason

>Asset Generator for 2D & 3D: Blender add-on that generates assets from text prompts
https://github.com/tin2tin/Asset_Generator-2D-3D

>ComfyUI-TrixLoader: All-in-One Image Loader, Editor, and Resizer node for ComfyUI
https://github.com/trx7111/ComfyUI-TrixLoader

07/01/2026

>Elastic Diffusion Transformer: Accelerating SOTA generation models
https://github.com/wangjiangshan0725/Elastic-DiT

>Boogu-Image-0.1-Edit-Turbo
https://huggingface.co/Boogu/Boogu-Image-0.1-Edit-Turbo
>>
>mfw Research news

07/04/2026

>Visual Semantic Entropy: Do Vision Language Models Recognize Visual Ambiguity?
https://arxiv.org/abs/2606.31407

>PhotoQuilt: Training-Free Arbitrary-Resolution Photomosaics via Bootstrapped Tiled Denoising
https://kooroshrh.github.io/photo-quilt

>MindFlow: Harmonizing Cognitive Semantics and Acoustic Dynamics for Facial Animation Generation in Dyadic Conversations
https://arxiv.org/abs/2606.27779

>Gradient Smoothing: Coupling Layer-wise Updates for Improved Optimization
https://arxiv.org/abs/2606.30813

>Rank-Aware Hyperbolic Alignment for Vision-Language Dataset Distillation
https://andyj1.github.io/raha

>On Test-Time Scaling for Vision-Language Models
https://arxiv.org/abs/2606.28864

>Clearer Sight, Fewer Lies: Oriented Pickup Preference Optimization for Multimodal Hallucination Mitigation
https://arxiv.org/abs/2606.29805

>Steal the Patch Size: Adversarially Manipulate Vision-Language Models
https://arxiv.org/abs/2607.00174

>Spatially Localized Image Degradation Embeddings for Image Quality Assessment
https://arxiv.org/abs/2606.29162

>NURBS Splatting: A Unified Differentiable Rendering Framework for Vector Graphics
https://arxiv.org/abs/2606.31764

>$μ$Flow: Leveraging Average Images for Improving Generalisation of Deepfake Faces Detectors
https://opontorno.github.io/MuFlow

>SPECSIA: Stylization Dataset for Novel-View Enhancement in Drawing-based 3D Animation
https://arxiv.org/abs/2607.00525

>Resonant Brane Splatting for Arbitrary-Scale Super-Resolution
https://arxiv.org/abs/2606.29453

>When Sinks Help or Hurt: Unified Framework for Attention Sink in Large Vision-Language Models
https://arxiv.org/abs/2604.03316

>Stateful Token Reduction for Long-Video Hybrid VLMs
https://arxiv.org/abs/2603.00198

>Universal Image Immunization against Diffusion-based Image Editing via Semantic Injection
https://arxiv.org/abs/2602.14679
>>
cum
>>
Why does Krea hate the word "CUM"
>>
another night of zitjeet seething over krea? you bet!
>>
This general has become unbearable, I have 1girl photo realism fatigue, especially for girls with asian faces. There were very few gens I was glad to see here, like gibbon gens.
>>
File: 1771140976287772.png (1.7 MB, 1024x1024)
1.7 MB PNG
pretty neat that krea knows teto natively and a lot of other stuff. also you can add specifics easily like "teto has 0401 on her arm".
>>
>>109202691
I've seen Krea doing a lot of popular characters, but what about lesser known ones?
Has anybody found a character Krea doesn't know?
>>
File: 034401CUI_00001_.png (1.5 MB, 1152x1536)
1.5 MB PNG
>>
>>109202690
/ldg/ lives and dies by the asian 1girl
>>
Blessed thread of frenship
>>
File: debo_is_k2_00011.png (2.5 MB, 1024x853)
2.5 MB PNG
>>
The artifacts work out nicely kek
>>
>>109202690
Be the change you want to see anon
>>
>>
>>109202690
if you want artistic stuff, use midjourney. local artistry is dead now
>>
File: 1767128354163904.png (1.83 MB, 1024x1024)
1.83 MB PNG
>>
>>109202788
a grown man made this gen
>>
>>109202763
>I suffer from skill issue, the post

How much are they paying you shill?
>>
>>109202696
Mortal Kombat characters.
>>
>>109202799
Compared to Krea MJ is basically like an SD1.5 tier model. No soul, outputs too basic, and that is especially true when we can use Krea both as an art tool that trumps anything MJ outputs, but also 2nd pass thru other models like Anima to enhance the artistry of 2D gens.
>>
File: 042736CUI_00001_.png (1.16 MB, 1152x1152)
1.16 MB PNG
>>
>>109202799
I sometimes wonder if you gen using a tablet or something or a really small monitor
>>
https://huggingface.co/RuneXX/LTX-2.3-Workflows/blob/main/Video-2-Video/Extend-Any-Video/LTX-2.3_-_V2V_Extend_Any_Video_Multi-Extend_long_video.json

ltx video extend is still funny to mess with, video source is god of woke by playstation

https://files.catbox.moe/3t1o5a.mp4
>>
>>109202848
>Nogen

I accept your concession
>>
>>109202799
looks like ass, you're legit blind dude
>>
>>109202481
>>109199883
i put
>(illustration:-3), (anime:-4.75), (cartoon:-3.75)
in one of the concatenate text boxes in the default krea2 workflow and all it did was turn it into anime
... guess it has to be fed directly into the clip text encode just like for wildcards
>>
File: ComfyUI_temp_pkyyb_00059_.png (3.7 MB, 1344x1728)
3.7 MB PNG
>>
>>109202688
you think about him every moment of your day anon? I think you fall in love with him kek
>>
>>109202686
>Why does Krea hate the word "CUM"
Censorship Filter, writing no no words is dangerous anon!
>>
File: gato.jpg (397 KB, 1360x768)
397 KB JPG
>>
File: ComfyUI_temp_pkyyb_00062_.png (3.58 MB, 1344x1728)
3.58 MB PNG
thanks to the anon who recommended the Krea-2-Turbo-Projector-Scale-LoRA-Diffusers, I can finally generate the word "CUM" and now my life can go on
>>
>>109202947
Nice SD 1.5 image anon, takes me back
>>
File: 00012-570797482.png (1.23 MB, 1024x1024)
1.23 MB PNG
its here as i promised :)
https://gofile.io/d/ph7bdY
>>
File: ComfyUI_temp_pkyyb_00063_.png (2.37 MB, 1728x1344)
2.37 MB PNG
>>109202954
Glad that you like it
>>
>>109202947
are you using Krea Raw? Because Turbo would never produces such a shit image lol
>>
File: ComfyUI_temp_pkyyb_00033_.jpg (1.43 MB, 1728x1302)
1.43 MB JPG
>>109202977
not enough noise for you?
>>
>>109202947
nice
thats the only one ive been using and 0.2 value is pretty good
at 0.25 you sometimes get a black generation and everything past that becomes black
>>
>>109203018
too much plastic and brightness that's for sure, does Krea know what are shadows in the first place?
>>
File: 00027-4260440872.png (1.83 MB, 1024x1536)
1.83 MB PNG
>>
>>109203018
looks like a gen you made at cfg 7 or some shit, I'm not a big fan of Krea but how did you manage to make this so burned? if I was a conspiracy theorist I would believe this is some falseflag to make Krea look worse than it really is
>>
>>109202909
t. Stuck at SDXL and Ponyrealism
>>
File: 1757492368797616.jpg (532 KB, 1448x1448)
532 KB JPG
>>109203043
What strength are you using?
>>
>>109202805
No way. Those are borderline public domain.
>>
>>109203074
what the fuck is that lmao??
>>
>>109203072
ironic since your image is SDXL tier
>>
File: debo_is_k2_00017.png (2.5 MB, 1024x853)
2.5 MB PNG
>>
>>109203074
god tier gen desu
>>
>>109203082
Kek, most you're getting from that trash is mush.
>>
>>109203112
>most you're getting from that trash is mush
exactly, which is why I don't use Krea 2, I know what to expect
>>
>>109203058
Are you my fan or something? you keep replying to all my posts, why you care so much? lol
>>
localkek infighting is so funny. when they fling shit at each-other's gens they always use local models as a baseline.
>sdxl tier
>zitslop
>chroma melt
>fluxchin
but never "nano banana slop". it's like they all know how inferior their local toys are, but tried to pick the least-worst ones to form their cope identity around. better luck next year, localkekkies!
>>
File: this you?.png (13 KB, 220x180)
13 KB PNG
>>109203122
>I don't get it??? I post an image on a public website to get reactions and I got... reactions??? what is happening???
>>
File: 1783224911501071.jpg (357 KB, 800x900)
357 KB JPG
do something with this pic
>>
>>109203128
the reason for that is because api models like nano banana aren't relevant to the discussion.
you already know this of course because you skipped over the most recent SOTA model gpt image 2, in favor of an older model because the big api models get progressively worse each release.
>>
>>109203128
>but never "nano banana slop".
duh? this is a local diffusion thread, why would we talk about API gens in the first place? that's off topic
>>
ironically the cloud image gen thread gens, at least the times i've checked it, have been much worse.
>>
>>109203122
>why you care so much?
because you're polluting this thread with your garbage images, hard to pretend you don't exist when you spam that shit, you're asking to be bullied lol
>>
>>109203116
>>109203128
Ah, so you were a cloudkek all along. Cloud has really fallen apart since Dalle 3, since it was censored your spotlight was over, the threads are dead now (though the gens before it was censored were at least close to being as good as Krea, I'll give you that)
>>
File: Krea 2 turbo is 39th lmao.png (265 KB, 1120x1765)
265 KB PNG
>>109203163
>close to being as good as Krea
what is this schizo talking about? Krea is not even close to the best API models, it's not even the best local model retard
>>
>jeeterboard anon woke up
>>
>>109203181
>Ideogram is 10th
>Krea 2 turbo is 39th
Kreasissies, I don't feel so good...
>>
>>109203181
What good is a model that blocks your request just for mentioning feet? Aside from that, GPT Image is the most slopped and safetymaxxed of all models out there, images will literally never look realistic and will be riddled with slop due to "safety" concerns.
>>
File: nanobananaproface.png (3.62 MB, 2056x1418)
3.62 MB PNG
>>109203128
jeets and sudacas like to pretend that people don't notice NBP slop but only morons can't tell
>>
File: 00042-22452770.png (3.73 MB, 1344x2560)
3.73 MB PNG
>>109203074
if been out of the loop on the current meta for achieving solid clean photorealism with krea 2.
the strength should be 0.7-1. something is wrong with you settings beyond the lora strength anon.
>>
>>109203210
she is absolutely hideous
>>
>>109203207
>What good is a model that blocks your request just for mentioning feet?
Krea 2 can't even write CUM, it's also safetyslopped
>but muhhh 67 jailbreaking custom nodes
they don't work and make the image even more slopped, what now?
>>
File: ComfyUI_00050_.png (967 KB, 1000x1000)
967 KB PNG
>>109203217
>>
>>109203207
>model with no real censorship and does nude out of box if you're not a brainlet who can't bbox prompt is worse than model with actual censorship that doesn't tell you when censoring
Every single one of you think censorship you can't see is better than "censorship" that is not even real. I swear all of you are API shills who are poisoning the well to normalize invisible censorship.
Ideogram is the real local savior who claims to have a "safety filter" to shake off the feminist journalists who will give bad optics.
>>
File: 00049-2807396858.png (3.7 MB, 1920x1280)
3.7 MB PNG
>>109203217
hard disagree anon.
>>
>>109203074
fucking laughed hard at this
>>
this might as well just be called /pag/
>>
>>109203233
>Every single one of you think censorship you can't see is better than "censorship" that is not even real. I swear all of you are API shills who are poisoning the well to normalize invisible censorship.
this, by supporting Krea 2 you're normalizing this shit
>>
/ldg/ - loser degenerate general
>>
>>109203239
Michael Jackson ahh plastic surgery style
>>
>>109203271
this shit was normalized years ago, it doesn't matter, sdxl is proof of that.
>>
>>109203280
>this shit was normalized years ago
absolutely not, SDXL has no censorship filters, it doesn't know some concepts yes, but it doesn't straight up refuses to follow your prompt
>>
File: Krea2_turbo_00003_.png (993 KB, 1024x1024)
993 KB PNG
>>109203262
>>109203273
>>
>>109203280
Have you tried Anima retard?
>>
File: Ideogram__00946_.jpg (974 KB, 1536x2048)
974 KB JPG
>>
>>109203280
I blame BFL, they're the first ones who released a local model with a safety filter (Flux.1 Kontext)
>>
File: Ideogram__00972_.jpg (1.01 MB, 1536x2048)
1.01 MB JPG
>>109203298
>>
I don't understand how Krea get's btfoed by a fucking mugshot of miku holding a cum slut sign without looking fried.
>>
File: 00052-2478846965.jpg (693 KB, 2624x1728)
693 KB JPG
>>
Neither is ideal but I'd take a filter over a poisoned dataset like Flux 1 and Klein
>>
>>109203288
the censorship was baked into the base model by not training it on nsfw concepts.
>>
>>109203314
Bypassable filter*
>>
>>109203315
>>109203314
I prefer a model that doesn't know concepts (we can put that back with further training) than a model trained to refuse prompts, one is just not knowing things, the other is a poisoned/lobotomized cucked shit
>>
File: Krea2_00041_.jpg (1.65 MB, 1680x2160)
1.65 MB JPG
yes, don't use krea2
>>
File: z-image-turbo_00001_.png (1.15 MB, 1024x1024)
1.15 MB PNG
>>109203290
>>
>>109203320
so what do you use, anima and wan?
>>
>>109203348
Z-image turbo and some coom finetunes/loras
>>
>>109203320
>kekstone will fix it!
how is z-image turning out? remember, the new base for finetunes??
what about heckin based china? hidream, boogu image, ernie?? so many uncucked BASED local models getting tons of attention from the community, right?

krea surpassed all your shit in less than a week.
>>
>>109202956
what were your settings for this? the inspector doesnt work for krea loras, or maybe it's aitoolkit
>>
>>109203352
shouldn't you stick to models that aren't poisoned/lobotomized?
whats the difference between a krea 2 nsfw finetune and a zit nsfw finetune? krea 2 finetunes gen porn slop out of the box the same as zit finetunes.
>>
>>109203354
>>109203366
Krea won't do shit nigga, its VAE is too ass to portray realism accurately, it'll always scream AI if you don't have shit covering your eyes

Klein 9b will win in the long term, it has the highest celling, good luck making a serious finetune of a 12b model that uses Qwen vae though, kekekekek
https://huggingface.co/fancyfeast/bigasp-3
>>
File: a.jpg (263 KB, 1024x1184)
263 KB JPG
Hate how you want, I don't think I've seen a model that can handle multiple conflicting character descriptions quite this well before.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.