[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion and Development of Local Image and Video Models

Previous: >>108835005

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>24 hours passed since yeterday
>239 NEW LORAS FOR ANIMA in civitAI
nuts for a brand new model
>>
>>108838376
>>108838079
>595 LORAS FOR ILLUSTRIOUS+NOOBAI + PONYV6
Why are you not impartial? Illustrious is still winning
>>
Anon this is the anima thread
>>
>inb4 n*gbo
>>
>>108838481
he's coming to give you a compiled list of news you'll never read and you WILL enjoy it while you hide it.
>>
it it over for local or is local saved
>>
>>108838484
it's over for you. hang it up and just drop already.
>>
>mfw Resource news

05/16/2026

>ComfyUI-Mesh Icarus & Daedalus: Split a diffusion model across two GPUs
https://github.com/shootthesound/comfyui-mesh

>Pixal3D-ComfyUI
https://github.com/Saganaki22/Pixal3D-ComfyUI

>ArXiv to Ban Researchers for a Year if They Submit AI Slop
https://www.404media.co/new-arxiv-rules-ai-generated-papers-ban

05/15/2026

>Causal Forcing++: Scalable Few-Step Autoregressive Diffusion Distillation for Real-Time Interactive Video Generation
https://github.com/thu-ml/Causal-Forcing

>EntityBench: Towards Entity-Consistent Long-Range Multi-Shot Video Generation
https://catherine-r-he.github.io/EntityBench

>ClickRemoval: An Interactive Open-Source Tool for Object Removal in Diffusion Models
https://github.com/zld-make/ClickRemoval

>Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video
https://yyfz.github.io/warp-as-history

>RAVEN: Real-time Autoregressive Video Extrapolation with Consistency-model GRPO
https://yanzuo.lu/raven

>Image Restoration via Diffusion Models with Dynamic Resolution
https://github.com/StarNextDay/SubDAPS.git

>Does Synthetic Layered Design Data Benefit Layered Design Decomposition?
https://github.com/YangHaolin0526/SynLayers

>InsightTok: Improving Text and Face Fidelity in Discrete Tokenization for Autoregressive Image Generation
https://github.com/LeapLabTHU/InsightTok

>ComfyUI-AsymFlow
https://github.com/CanFromEarth/ComfyUI-Klein9B-AsymFlow

>Microsoft Lens: 3.8B parameter T2I model. Available in RL-tuned and fast 4-step Lens-Turbo
https://huggingface.co/microsoft/Lens
https://huggingface.co/microsoft/Lens-Turbo

>snwy/SD1.5-DALLE-2
https://huggingface.co/snwy/SD1.5-DALLE-2/tree/main

>stable-diffusion-webui-codex v0.3.0-beta
https://github.com/sangoi-exe/stable-diffusion-webui-codex

05/14/2026

>Anima base v1.0 released
https://huggingface.co/circlestone-labs/Anima

>TrackCraft3R: Repurposing Video DiT for Dense 3D Tracking
https://cvlab-kaist.github.io/TrackCraft3r
>>
>mfw Research news

05/16/2026

>LatentHDR: Decoupling Exposure from Diffusion via Conditional Latent-to-Latent Mapping for Text/Image-to-Panoramic HDR
https://arxiv.org/abs/2605.11115

>Probing into Camera Control of Video Models
https://arxiv.org/abs/2605.14815

>ZeroIDIR: Zero-Reference Illumination Degradation Image Restoration with Perturbed Consistency Diffusion Models
https://arxiv.org/abs/2605.11435

>ImageAttributionBench: How Far Are We from Generalizable Attribution?
https://arxiv.org/abs/2605.12967

>Improving Diffusion Posterior Samplers with Lagged Temporal Corrections for Image Restoration
https://arxiv.org/abs/2605.12573

>OTT-Vid: Optimal Transport Temporal Token Compression for Video Large Language Models
https://arxiv.org/abs/2605.11803

>Fast Image Super-Resolution via Consistency Rectified Flow
https://arxiv.org/abs/2605.12377

>Learning to Align Generative Appearance Priors for Fine-grained Image Retrieval
https://arxiv.org/abs/2605.09859

>Venus-DeFakerOne: Unified Fake Image Detection & Localization
https://arxiv.org/abs/2605.14091

>Leveraging Multimodal Large Language Models for All-in-One Image Restoration via a Mixture of Frequency Experts
https://arxiv.org/abs/2605.11444

>TMPO: Trajectory Matching Policy Optimization for Diverse and Efficient Diffusion Alignment
https://arxiv.org/abs/2605.10983

>The Truth Lies Somewhere in the Middle (of the Generated Tokens)
https://arxiv.org/abs/2605.09969

>Unlocking Compositional Generalization in Continual Few-Shot Learning
https://arxiv.org/abs/2605.11710

>Rigel3D: Rig-aware Latents for Animation-Ready 3D Asset Generation
https://arxiv.org/abs/2605.13129

>EchoPrune: Interpreting Redundancy as Temporal Echoes for Efficient VideoLLMs
https://arxiv.org/abs/2605.10050

>GRIP-VLM: Group-Relative Importance Pruning for Efficient Vision-Language Models
https://arxiv.org/abs/2605.13375

>LoREnc: Low-Rank Encryption for Securing Foundation Models and LoRA Adapters
https://arxiv.org/abs/2605.13163
>>
Why is Ani shitting his diapes so much recently?
>>
Kill ani in real life, intercept him on his daily route and stab him multiple times with murderous intent
>>
Convince me to care about loss curves and regularization datasets.
>>
>>108838540
You really should care about loss curves and regularization datasets.
>>
Need a Kazuma Kaneko lora for Anima, did anyone make one yet? You can get highres scans of his art here:
>https://archive.org/details/VeskScans2024
>>
>>108838550
Tag it first
>>
>>108838550
>>https://archive.org/details/VeskScans2024
That's quite the archive, anon.
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
File: ComfyUI_02469_.jpg (655 KB, 2048x864)
655 KB JPG
>>
File: 1770216682970819.png (2.28 MB, 1920x1080)
2.28 MB PNG
>>
File: ComfyUI_02448_.jpg (712 KB, 2048x864)
712 KB JPG
>>
https://huggingface.co/TencentARC/Pixal3D
Is 3D gen in doable in comfy?
>>
File: ComfyUI_temp_cyqor_00004_.png (1.5 MB, 1024x1504)
1.5 MB PNG
1girl
>>
what model(s) would be ideal to create kind of surreal, uncanny, dreamlike backroom images similar to these in vibe/appearance? (just images, not videos). I really like the aesthetic/dream quality, but I haven't been keeping up with image gen and don't really want to try a bunch experimenting
>>
>>108838894
search for liminal loras and use whatever model it is for
>>
>>108838915
thanks senpai, I'll give that a shot
>>
>>108838743
yes
>>
any way to force the fan to spin hard from a WSL script?
>>
>>108838950
yes
>>
>>108838992
something like : "spinfan 10" . Spins fan for ten seconds.
>>
who is the most obscure artist you've created a lora for, anon?
>>
File: Velma-SD-00001.jpg (99 KB, 631x885)
99 KB JPG
Qwem image edit keeps getting stuck in ksampler. No freeze, no crash, no gpu stress, no error message. What could it be?
>>
>>108839073
engagement status? lol, where do you think you are, loser? this is a shilling shithole. don't you remember the state of this general days before Anima base dropped? it'll die again after this weekend.
>>
>>108839142
what workflow? default or some civit monstrosity?
>>
>>108839174
Default and a couple of others I found on the internet. All of them have the same problem.
>>
>>108839184
Does terminal say something? Try to make a separate comfy install and run it from there.
>>
>>108839073
schizo twitter artists with less than a couple hundred followers
>>
>>108839190
terminal gets stuck at 'Requested to load QwenImage' and nothing else
>Try to make a separate comfy install and run it from there
I'll give it a shot. Thanks.
>>
File: woods.jpg (1.69 MB, 2304x1792)
1.69 MB JPG
>>108838550
Pick out a set of like 100 meaningful images in that set and reupload it somewhere and I'll caption it and make a lora.
>>
>>108839312
Nta but that is too kind
>>
>>108839312
do you have a guide for captioning stuff?
>>
>>108839387
I just use an LLM to do it, give it the image and ask it to describe the image. I use a local instance of koboldcpp with gemma 4 and a tool I vibe coded that loops over all image in a folder and sends them to koboldcpp with a prompt to describe the image. Takes like 15 seconds per image.
>>
>>108839438
That's pretty cool, did you up the tool somewhere?
I'm too stupid to fix the vibe coding I've gotten out of it.
>>
>>108839447
Nah I haven't uploaded it, maybe https://github.com/jhc13/taggui will work? Haven't looked at that in a while.
>>
>>108839148
oh hello sharty, home you're smell as bad as always
>>
>>108839438
Oh, but my LoRA is for SDXL. Will you still help me? Or are you a paid Anima shill promoting the model by helping anons make LoRAs? Because that is very shady behavior!
>>
anime website
>>
>>108839312
Why don't you help anons make loras for MUGEN or Chenkin, Mr.Kind Soul?
>>
>>108839485
it's actually a loli website
>>
>>108838502
>>108838508
thanks!
>>
>>108839312
I want to make a lora for neta lumina, would you help me, mr anon? ^^
>>
Forget it.
>>
Why are there so many trolls here?
>>
>deblessed
>>
This is a new way of shilling a model.
How low can /ldg/ get when paid shills show up offering free compute for training loras for Anima? This is unacceptable. Jannies need to do something, that doesn't happen even on Discord.
>>
>>108838550
>>https://archive.org/details/VeskScans2024
Any more like this but for other artists?
>>
i am not impressed by anima.
>>
>>108839550
No free compute 4U
>>
>>108839348
Nah, it's probably just tdrusell connected to a ComfyCloud GPU through an API handing out free compute remnant from his training. Like perfume sellers giving out free samples, except here they’re handing out loras. Kinda pathetic desu.
>>
>>108839519
It's likely just one mentally ill schizo.
>>
>>108839622
I don't even know who is who anymore
>>
File: Untitled.png (1.54 MB, 1280x768)
1.54 MB PNG
>>
File: Milk.png (2.65 MB, 1248x1824)
2.65 MB PNG
>>
File: 00218-1068808240.jpg (106 KB, 512x512)
106 KB JPG
is AI still capable of making creepy horrors or has it been completely animeifyd?
>>
So I've been building a list of artists for anima, but using the peak danbooru score for an image with their artist tag to rank them. Anybody do anything similarly autistic for building tier lists?
>>
>>
>>108839729
>is AI still capable of making creepy horrors or has it been completely animeifyd?
yes
>>
>>
>>
>>108839729
>is AI still capable of making creepy horrors or has it been completely animeifyd?
no
>>
>>
>>108839788
>>108839794
>>108839797
your images are so insanely noisy and hard to look at please fucking stop spamming
>>
>>
>>
>>
>>108839815
>>108839808
>>108839806
>>108839797
>>108839794
>>108839788
>>108839780
These look like shit. Hiding mistakes with extreme amounts of retarded detail in all the wrong places. Kill yourself d*bo.
>>
>>
>>108839622
one or two sharteen, they enjoy doing this shit 24/7
>>
>>
>>108839805
>>108839821
post gen
>>
>>108839837
stop fucking spamming you retard we don't want to see your entire fucking gen folder
>>
>>
>>108839851
>>108839821
>>108839805
If it was Anima you would be clapping like a seal. But since it’s not a ComfyOrg model, suddenly it’s “spam” and “noise.” Meanwhile this board’s been drowning in Eva, DBZ, and 2B Anima spam for 2 straight days and none of you say shit.
>>
>>108839913
99% of the gens in this thread right now is one dudes (d*bo)
point towards the anime spam I don't see it right now.
>>
>>108839917
Past threads since Anima released
>>
>>108839922
sure, a bunch of DIFFERENT people posting their uninspired gens is the same thing as one dude spamming the same prompt with a randomizer on it over and over is totally the same thing. fuck right off with that shit.
>>
>>108839938
first gen was like 4 hours ago, >>108838672 are you going to compare tiny 4 hours from the same anon after almost 60 hours of pure anima spam, and noy only gens but comments??
>>
>anti-anima out of nowhere
>>
This is how you can tell anima is "good enough"
>>
File: 1771124790934203.jpg (1.37 MB, 1248x1824)
1.37 MB JPG
>>108839729
>>
File: midsummer.jpg (1.47 MB, 2304x1792)
1.47 MB JPG
>>108839913
>>108839922

Anima base just came out like 2 days ago so of course there's some hype around it retard. This thread will be back to it's dead pace in no time and then you can go back to being the overwhelming majority of posts with your insane trolling.
>>
>>108840049
Curious, from my point of view you are the shillers and trolls who won't leave /ldg/ at peace.
>>
File: 1776199240798954.png (2.29 MB, 2112x1408)
2.29 MB PNG
>>
>>108840049
NTA but hype about what? When Z Image or Klein or any of the 50 Chromas released I didn't spam /ldg/ for three days with gens of Seinfeld, Game of Thrones or offer anons free compute to train their lora of Doctor Who.
>>
File: file.png (1.79 MB, 1394x1360)
1.79 MB PNG
>>108840098
>>
stop crying lilbro
>>
>>108840129
Leave him alone, he can't a detailer or inpaint in anima
>>
>>108840098
The female Truman Show?
>>
File: 1756385469515450.png (2.59 MB, 1536x1984)
2.59 MB PNG
>>108840137
inpainting works with https://github.com/lquesada/ComfyUI-Inpaint-CropAndStitch im just lazy right now
>>
>>108840129
spoopy
>>
File: debo_cs_anima1_00092_.png (1.64 MB, 1408x1126)
1.64 MB PNG
>>
Can Anima do hands properly now?
>>
File: ComfyUI_00175_.png (2.66 MB, 1120x1688)
2.66 MB PNG
>>
>>108838550
There's 400 kazuma kaneko images up on danbooru and the @kazuma kaneko artist prompt changes the image, is the built in anima style bad or did you not try it?
>>
>>108840297
It can be pretty bad sometimes.
>>
>>108840335
Melted eyes
>>108840152
Melted eyes and face

Inpainting and details don't work in Anima. In SDXL I could zoom into any gen and it had depth and sharpness. If I generated a brick wall and did hires fix in SDXL, the result was a brick wall with more molecular details like an 8k photo. Anima doesn't have that. In Anima, I give it a brick wall and apply hires fix, and what changes is the arrangement, size, and shape of the bricks, not the quality and details of the bricks themselves.
>>
>>108840297
it's MUCH better than sdxl but it's still far from perfect
>>
>>108840335
Try fixing the eyes and lips and adding depth with Anima without Anima changing the entire eyes and lips section
>>
File: Anima Face.jpg (268 KB, 720x1013)
268 KB JPG
>>108840152
The same for you, this is the kind of quality that Anima brings you. If you want that face but more polished, you can't. You have to use SDXL for that.
>>
its wild to see people so mindbroken from genning AI images that every minor issue with eyes and hands is seen as a catastrophic failure, meanwhile many celebrated artists often leave eyes and hands only partially resolved on purpose
>>
>>108840375
>>108840365
Now that I look closely, Anima is a shit model if it can't refine like SDXL. Thanks for sharing anon, something seemed off to me when I saw Anima gens, and now it's clear that it's the lack of being able to refine the gen.
>>
File: debo_cs_anima1_00093_.png (1.9 MB, 1408x1126)
1.9 MB PNG
behold, the perfect anima face
please, hold your applause
>>
>>108840351
>>108840365
>>108840375
>>108840389
cry more lol
>>
File: 5357.png (1.29 MB, 864x992)
1.29 MB PNG
damn anima can make ultra kino
>>
File: ComfyUI_00832_Ayakon.jpg (3.76 MB, 4096x2680)
3.76 MB JPG
>>
Since Anima took about 4 months from preview to release I expect just as long that anon will be seething about it.
>>
>>108840389
SDXL couldn't get those details at that resolution to save its life kek. Post your own gen I know you won't.
>>
>>108840496
oh this is SDXL (illustrious) btw, i've not considered really moving to anima yet
>>
didnt ask
dont car
>>
File: 79003065.png (1.71 MB, 1184x992)
1.71 MB PNG
>>
>>108838550
were you really unable to gather just 100 of your favorite images from that archive so anon could train a lora for you?
>>
File: 1760778943233344.png (1.17 MB, 768x1088)
1.17 MB PNG
>>108840523
yeah, we could tell by the generic composition and "1girl, bikini"
>>
File: 758242.png (1.93 MB, 1344x1008)
1.93 MB PNG
>>
>>108840496
>>108840523
And obviously no, you are a professional genner Ayakon, you never felt limited by what these casuals who open ComfyUI once a week say.
>>108840517
I would love to, but I don't post anime in non anime generals!
>>
File: debo_cs_anima1_00094_.png (1.95 MB, 1408x1126)
1.95 MB PNG
>>
>>108840385
Haha anon, the strokes left to free will by illustrators are totally different from this >>108840365
And from this >>108840375
I highly doubt that if I were an artist I would leave that lip as a mess on purpose along with that eye which is just the red pupil, or the elf's face with that senseless luminosity on the nose and the whole facial area poorly done
>>
>>108840385
AI artists are more artistic than anti ai artists confirmed.
>>
>>108840038
kino
>>
File: works on my machine.png (2.6 MB, 1120x1688)
2.6 MB PNG
>>108840365
>>
>>108840669
also it's literally a 2 min fix so i went with whatever seemed closer without bothering to refine it

also also i don't know what artist you're using and i don't care. it's impossible to reproduce the exact face without that information. either way you're just being retarded
>>
File: ComfyUI_00177_.png (2.93 MB, 1120x1688)
2.93 MB PNG
lol, umad?
>>
Why is preview 3 so much better? is this another z-image grift situation.
>>
https://huggingface.co/circlestone-labs/Anima/discussions/151
>Also, many users are interested in better furry support.
>
>>
File: 1778773096874732.png (136 KB, 728x514)
136 KB PNG
I false flagged as anti-AI on /v/ using snailcat image in OP. Thought you might find the thread interesting, over 400 replies: >>>/v/738998648

A lot on unhinged seething, but most anons seemed fine with the idea of using AI to help make games.
>>
>>108840669
>2 minutes
SDXL takes 1 minute to generate at 1.5k and upscale to 2k with adetailer for this same composition
>without bothering to refine it
You can't refine with Anima, that's the whole point
>>108840700
I don't know but from the navel to the pelvis of your character is extremely long, I don't know if it's an upscaler problem but it looks like it has a double pelvis, and from the navel to the sternum also extremly long. Upscaler issje
>>
>>108840717
he's not wrong
>>
>>108840719
i think anons like ai when its one guy chasing a dream but hate ai when its anything else
>>
>>108840492
Based
>>
File: 1773081487153598.jpg (1.31 MB, 2128x1456)
1.31 MB JPG
I like anima because it feels like the actual successor to sdxl.

I dont want my gens to have absolutely perfect details, it should be a little imperfect, like something you see on pixiv, where the fingers are a little smudgy, and some parts you can tell the artist really didn't feel like fucking around with anymore because he's been at the desk for 8 hours and wants to move on.

SDXL tried to be "perfect" which is why alot of hands look manicured and the lineart looks perfectly straight, and the imperfections look obviously "AI"

Anima chads won.
>>
File: ComfyUI_02641_.png (1.09 MB, 1024x1024)
1.09 MB PNG
>>
>>108840767
Majority of this community has half a brain and doesn't understand what a base model is.
Don't believe me? Look at how many of them call Z-Image "Z-Image Base".
Community is broken beyond repair.
>>
>>108840775
me on the left
>>
>>108840767
>actual successor to sdxl.
I meant sd1.5

sd1.5 is more SOVL than illustrious/pdxl slop
>>
File: ComfyUI_00213_.jpg (3.45 MB, 3584x4608)
3.45 MB JPG
>>
>>108840775
Is this Digimon?
>>
File: ComfyUI_00215_.jpg (2.25 MB, 3584x4608)
2.25 MB JPG
>>
File: debo_cs_anima1_00097_.png (1.85 MB, 1408x1126)
1.85 MB PNG
>>
File: ComfyUI_00216_.jpg (1016 KB, 3584x4608)
1016 KB JPG
>>
File: ComfyUI_00305_.png (1.94 MB, 1024x1536)
1.94 MB PNG
>so, you gen in batch size of one



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.