[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion and Development of Local Image and Video Models

Previous: >>108681463

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
File: 1759674427098958.png (2.04 MB, 1086x1448)
2.04 MB PNG
no more trolling
>>
this thread is for FRENS ONLY
>>
File: 723864.jpg (1.23 MB, 1254x1254)
1.23 MB JPG
>>
File: 979nqo.png (813 KB, 1024x512)
813 KB PNG
>>
>>108683023
why don't you ever upscale these?
>>
>>108683042
overheating. got an old rig.
>>
>>108683059
>overheating
put a temp limit with MSIAfterburner nigga
>>
>>108683066
I click on that and nothing happens. It be like read only mode or sumpin.
>>
>mfw Resource news

04/24/2026

>MAI-Image-2
https://playground.microsoft.ai/chat

>ComfyUI-NAG-Extended: NAG support for Flux 2 Klein and Anima
https://github.com/BigStationW/ComfyUI-NAG-Extended

>UniGenDet: A Unified Generative-Discriminative Framework for Co-Evolutionary Image Generation and Generated Image Detection
https://github.com/Zhangyr2022/UniGenDet

>VARestorer: One-Step VAR Distillation for Real-World Image Super-Resolution
https://github.com/EternalEvan/VARestorer

>Sapiens2
https://github.com/facebookresearch/sapiens2

>Vista4D: Video Reshooting with 4D Point Clouds
https://eyeline-labs.github.io/Vista4D

>Pre-process for segmentation task with nonlinear diffusion filters
https://github.com/cplatero/NonlinearDiffusion

04/23/2026

>ParetoSlider: Diffusion Models Post-Training for Continuous Reward Control
https://shelley-golan.github.io/ParetoSlider-webpage

>DynamicRad: Content-Adaptive Sparse Attention for Long Video Diffusion
https://github.com/Adamlong3/DynamicRad

>Normalizing Flows with Iterative Denoising
https://github.com/apple/ml-itarflow

>LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model
https://github.com/inclusionAI/LLaDA2.0-Uni

>Illustrious XL & NoobAI-XL Style Explorer
https://github.com/ThetaCursed/Illustrious-NoobAI-Style-Explorer

>AI Model & ‘MAGA’ Influencer Emily Hart Unmasked as Indian Man
https://www.yahoo.com/news/articles/ai-model-maga-influencer-emily-091027504.html

04/22/2026

>Embedding Arithmetic: A Lightweight, Tuning-Free Framework for Post-hoc Bias Mitigation in Text-to-Image Models
https://github.com/cvims/EMBEDDING-ARITHMETIC

>Denoising, Fast and Slow: Difficulty-Aware Adaptive Sampling for Image Generation
https://github.com/CompVis/patch-forcing

>TS-Attn: Temporal-wise Separable Attention for Multi-Event Video Generation
https://github.com/Hong-yu-Zhang/TS-Attn

>AnyRecon: Arbitrary-View 3D Reconstruction with VDM
https://yutian10.github.io/AnyRecon
>>
>mfw Research news

04/24/2026

>AttentionBender: Manipulating Cross-Attention in Video Diffusion Transformers as a Creative Probe
https://arxiv.org/abs/2604.20936

>KD-CVG: A Knowledge-Driven Approach for Creative Video Generation
https://kdcvg.github.io/KDCVG

>Linear Image Generation by Synthesizing Exposure Brackets
https://arxiv.org/abs/2604.21008

>Exploring the Role of Synthetic Data Augmentation in Controllable Human-Centric Video Generation
https://arxiv.org/abs/2604.21291

>AttDiff-GAN: A Hybrid Diffusion-GAN Framework for Facial Attribute Editing
https://arxiv.org/abs/2604.21289

>Projected Gradient Unlearning for Text-to-Image Diffusion Models: Defending Against Concept Revival Attacks
https://arxiv.org/abs/2604.21041

>Sparse Forcing: Native Trainable Sparse Attention for Real-time Autoregressive Diffusion Video Generation
https://arxiv.org/abs/2604.21221

>StyleVAR: Controllable Image Style Transfer via Visual Autoregressive Modeling
https://arxiv.org/abs/2604.21052

>Building a Precise Video Language with Human-AI Oversight
https://linzhiqiu.github.io/papers/chai

>Seeing Isn't Believing: Uncovering Blind Spots in Evaluator Vision-Language Models
https://arxiv.org/abs/2604.21523

>ID-Eraser: Proactive Defense Against Face Swapping via Identity Perturbation
https://arxiv.org/abs/2604.21465

>When Prompts Override Vision: Prompt-Induced Hallucinations in LVLMs
https://pegah-kh.github.io/projects/prompts-override-vision

>Seeing Fast and Slow: Learning the Flow of Time in Videos
https://seeing-fast-and-slow.github.io

>Addressing Image Authenticity When Cameras Use Generative AI
https://arxiv.org/abs/2604.21879

>Multiscale Super Resolution without Image Priors
https://arxiv.org/abs/2604.21810

>Prototype-Based Test-Time Adaptation of Vision-Language Models
https://arxiv.org/abs/2604.21360

>Latent Denoising Improves Visual Alignment in Large Multimodal Models
https://arxiv.org/abs/2604.21343
>>
File: file.png (221 KB, 1120x360)
221 KB PNG
>>108683002
>Why isn't 24GB enough?
you think I'm some sort of poorfag?
>>
File: zImageturbo_00207_.jpg (735 KB, 1520x1824)
735 KB JPG
>>
File: Anima_0015.jpg (1.28 MB, 1344x2496)
1.28 MB JPG
>>
>>108683114
all that gpu power for nothing, it's not like there's a big local model that can be used and is competitive with the best API models
>>
>>108683132
LLMs
>>
> >108683096
> >108683101
Fuck off
>>
File: zImageturbo_00211_.jpg (821 KB, 1520x1824)
821 KB JPG
>>
File: 362468591572682.png (2.01 MB, 1344x1728)
2.01 MB PNG
>>108682974
<- not mine, but she's gorgeous~ I love Raiden and I'll use her as my paper on my phone.
>>
>>108683154
how do you gen these? is it just z image turbo?
>>
File: Anima_0017.jpg (1.56 MB, 1344x2496)
1.56 MB JPG
>>
>>108683157
gpt-image-2
>>
File: zImageturbo_00215_.jpg (532 KB, 1520x1248)
532 KB JPG
>>108683157
Zimage Turbo + lora for photos + lora for likeness. Latent upscale gives realistic detail
>>
File: o4gged.png (958 KB, 1024x512)
958 KB PNG
>>
>>108683158
anima + zit? can you share anima prompt/workflow? i've found that res_2m is really good for realism
>>
File: zImageturbo_00221_.jpg (898 KB, 1520x1824)
898 KB JPG
>>
File: anima_00086_.png (1.16 MB, 1344x2352)
1.16 MB PNG
>>
File: zImageturbo_00229_.jpg (818 KB, 1520x1824)
818 KB JPG
>>
File: 1754898313616880.png (95 KB, 1394x253)
95 KB PNG
>>
>>108683290
fine wine needs time.
>>
>>108683290
aged perfectly, based comfy
>>
>>108683290
Based. He knew API was the future and invested heavily in it. China learned from this and quickly pulled local support for WAN right after. We probably wouldn't have models like GPT-Image-2 without him.
>>
>>108683290
I forget, what model was this about?
>>
File: 384.jpg (387 KB, 1024x1536)
387 KB JPG
>>
>>108683328
it was hunyuanimage 3.0 (80b model lool)
>>
>localpoors threw a fit because hunyuan was too big for their 24gb
aged like wine >>108683002
>>
>>108683339
hunyuan wasn't good though, so it was big and bad
>>
hunyaun was great, but comfykeks wouldn't know because local is verboten
>>
File: zImageturbo_00233_.jpg (807 KB, 1520x1824)
807 KB JPG
>>
File: 1757241711129377.png (1.79 MB, 720x1280)
1.79 MB PNG
>>
File: Anima_0042.jpg (1.06 MB, 1344x2496)
1.06 MB JPG
>>108683352
Nice
>>
the bigger the model, the better it is, it's common sense
>>
File: 1762409100561054.png (65 KB, 512x364)
65 KB PNG
>>108683349
>hunyaun was great
care to show some images?
>>
File: hunyuan3 nbp.png (51 KB, 1136x370)
51 KB PNG
>>108683290
>>108683330
>>108683349
Wait, I looked into this and it's true?? Hunyuan 3 was better than Nano Banana but never received a ComfyUI implementation because it threatened the API nodes ecosystem. Holy shit can we ditch ComfyUI already? It has undoubtedly harmed local thanks to this.
>>
File: zImageturbo_00237_.jpg (811 KB, 1520x1824)
811 KB JPG
>>108683366
ty! nice lora, is it the k-pop girl?
>>
>>108683405
>Hunyuan 3 was better than Nano Banana
source: (((the media))), oy vey
>>
>>108683405
do you enjoy seething since months about comfy being the most relevant local ui and successful?
>>
>>108666242
If you're still here what's the second textbox for that we can't edit?
>>
>>108683405
we should all switch to InvokeAI
>>
>>108683423
it's not local
>>
File: 1764378840943651.png (54 KB, 400x400)
54 KB PNG
>>108683441
>>
>>108683441
how do i run it locally then anon?
>>
So API nodes are local now??? Sweet!
>>
>>108683449
>>108683456
why do you keep feeding him
>>
>>108683458
who said that anon?
>>
>>108683468
it's in the OP
>>
>>108683458
>>108683477
are you really that bored? like you really have nothing else to do with your life? kinda sad when you think about it
>>
File: zImageturbo_00244_.jpg (682 KB, 1520x1824)
682 KB JPG
>>
File: 3228701093.png (131 KB, 918x717)
131 KB PNG
i just discovered ltx2.3 loras
>>
>>108683477
where in op is the statement that "API nodes are local now" anon?
>>
black snape, but ltx 2.3:

https://litter.catbox.moe/gktnj1crp42z1o9g.mp4
>>
>>108683500
I still can't believe they made Snape black. I think it would have been more tasteful to make hermione black if they were shooting for DEI
>>
>>108683513
>I think it would have been more tasteful to make hermione black if they were shooting for DEI
hermione is a female, she's already a DEI
>>
>>108683513
>I think it would have been more tasteful to make hermione black if they were shooting for DEI
ron is already a ginger
>>
*yawn*
>>
I just downloaded the ComfyUI desktop app from the link in the OP. How many credits do I deposit to get started with anima?
>>
File: 00004-2488151787.png (1.76 MB, 896x1152)
1.76 MB PNG
>>
>>108683513
>I still can't believe they made Snape black.
don't underestimate the willingness of the wokies to stir the pot, they're so good at that
>>
>>108683531
making the weasleys black would have been great actually. they already live in a shit hole.
point stands, though snape should never be black. that is the epitome of a white character
>>
File: 9hbwyv.png (1.16 MB, 1024x512)
1.16 MB PNG
>>
n*gbo-esque honestly
>>
snape is an incel, that is white culture >>108683541
>>
>>108682673
from scratch
>>
>>108683541
>though snape should never be black. that is the epitome of a white character
he's literally described as a man with a pale skin on the book lol
>>
>>108683540
>don't underestimate the willingness of the wokies to stir the pot, they're so good at that
They're good at stirring the pot but their shit makes no money. This harry potter remake is going to flop hard because the woke crowd hates jk rowling and the fact that she makes money from everything related to harry potter and it's going to piss off the normal people too so who does that leave to even watch that shit?
>>
>>>/tv/
>>
File: 00005-1951089249.png (1.52 MB, 896x1152)
1.52 MB PNG
>>
>>108683497
careful icarus
>>
They made Snape black? This reminds me of when ComfyUI added API nodes into a UI that was originally meant for local models. It's all about testing the waters until people get too tired to care about it anymore. By that point it's already normalized and the subversive vermin won.
>>
>>108683583
can you answer the question anon? >>108683498
>>
>>108683497
which loras? They seem really hit or miss
>>
>>108683577
fucking slut tease
>>
i NEED another COMPUTER.

(for genning)

I am gaming and need to GEN.

and another one to talk to my virtual people.
>>
>>108683591
lets just say... the kino lora
>>
>>108683540
it's an English series starring English people, they don't give a shit about retarded American Zoomer politics

like you know this guy has a posh London accent and an extensive background in Shakesperean stage productions, right
https://youtu.be/96GAI4ioekM?t=11
>>
File: Chroma_0010.jpg (2.51 MB, 1536x2560)
2.51 MB JPG
>>108683591
this one seems cool
https://civitai.red/models/2557755/retro-90s-anime-style-lora-ltx-23?modelVersionId=2874411
>>
>>108683563
Rowling doesn't give a shit, SHE retconned Dumbledore into being gay herself years after the book series had ended, just as a public anecdote
>>
>>108683605
looks hilariously bad
>>
File: ComfyUI_11227_.png (794 KB, 1024x1024)
794 KB PNG
>>
>>108683603
>it's an English series starring English people
are you implying that woke only happens in the US?? lmao
>>
>>108683610
nobody cares that old nigga dun got kilt
>>
>>108683616
i'm implying American Zoomers are the overwhelming majority of people who give any kind of fuck about the Woke Boogeyman
>>
>>108683615
this is nice, what model?
>>
>>108683629
Anima p3
>>
>>108683583
why is anyone supposed to give a fuck about api nodes in comfyui? it literally doesn't matter.
if they switch to full api support and drop local, doesn't matter because someone will just fork it and local will be completely unaffected.
>but then local won't have all the latest API support!
?
>>
>>108683627
>the Woke Boogeyman
my fucking ass, there's no boogeyman about this, they knew Snape was canonically a white man in the book, they knew that rewriting history and Netflix'ed him into a nigga would stir the pot, they know what they're doing, they're taunting people, and you defend them because you're probably some gay ass liberal who loves this woke slop, right
>>
>>108683627
today I learned that the entire right wing party in america are all zoomers. that's crazy
>>
>tardbo back to shilling groids and api nodes
>>
File: Chroma_0013.jpg (2.32 MB, 1536x2560)
2.32 MB JPG
>>
>>108683647
based zoomers
>>
File: 00008-1139655562.png (1.57 MB, 896x1152)
1.57 MB PNG
toe socks.
>>
I don't want to gen after today's incident. Because I know my gen will be used as bait for investors by Comfy to create speculation that there is activity in local models.
I don't want to be part of this fake engagement system.
>>
File: Chroma_0016.jpg (2.53 MB, 1536x2688)
2.53 MB JPG
>>
>>108683290
I love when daddy comfy decides whats best for me. that way I dont have to think for myself <3



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.