[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion and Development of Local Image and Video Models

Previous: >>108914462

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
File: zit.png (2.26 MB, 1536x1280)
2.26 MB PNG
>>
>>108921206
>>Illustrious
>https://rentry.org/comfyui_guide_1girl
Safe to remove now I don't think anon will have multi day sperg outs about illust no longer being in OP
>>
>>108921227
I will flood the thread with shotas if you do so
>>
>>108921227
No, they should add another entry for Pony.
>>
>>108921227
I still use Illustrious...
>>
>mfw Resource news

05/27/2026

>InvokeAI 6.13.0
https://github.com/invoke-ai/InvokeAI/releases/tag/v6.13.0

>NVIDIA CUDA 13.3 Enhances GPU Development
https://developer.nvidia.com/blog/nvidia-cuda-13-3-enhances-gpu-development-with-tile-programming-in-c-compiler-autotuning-and-python-updates

>Scheduled Style Injection: Expanding the Style-Content Pareto Frontier in Training-Free Diffusion-based Style Transfer
https://github.com/ameyskulkarni/scheduled_style_injection

>Feedforward 3D Editing Learns from Semantic-Part Transformation
https://dennis-jwweng.github.io/pxform

>Reinforcing Few-step Generators via Reward-Tilted Distribution Matching
https://github.com/Harahan/RTDMD

>Paper Doll Studio: Local-first tool for creating paper-doll wardrobe assets
https://github.com/Khurramali1997/paper-doll-studio

>‘Lobotomized’: Character.AI Is Showing What AI Enshittification Looks Like
https://www.404media.co/lobotomized-character-ai-is-showing-what-ai-enshittification-looks-like

>Tech CEOs are apparently suffering from AI psychosis
https://techcrunch.com/2026/05/27/tech-ceos-are-apparently-suffering-from-ai-psychosis

>Soap2Soap: Long Cinematic Video Remaking via Multi-Agent Collaboration
https://github.com/showlab/Soap2Soap

05/26/2026

>Bonsai Image · Ternary 4B — Unpacked FP16 Safetensors
https://huggingface.co/prism-ml/bonsai-image-ternary-4B-unpacked

>Anima Turbo LoRA [Official]
https://civitai.com/models/2560840/anima-turbo-lora

>Comfyui-Anima-Regional-Conditioning
https://github.com/Sen-sou/Comfyui-Anima-Regional-Conditioning

>HoloFair: Unified T2I Fairness Evaluation and Fair-GRPO Debiasing
https://github.com/1059684669/HoloFair

>Where Detectors Fail: Probing Generative Space for AI-Generated Image Detection
https://github.com/Amamiya-C/PROBE-AIGI-Detection

>4KLSDB: A Large-Scale Dataset for 4K Image Restoration and Generation
https://4klsdb.github.io

>Trajectory-Consistent Calibration for Cache-Acceleration
https://github.com/NJUDeepEngine/TCC
>>
>mfw Research news

05/27/2026

>Helix4D: Complex 4D Mesh Generation
https://snap-research.github.io/helix4d

>LongCat-Video-Avatar 1.5 Technical Report
https://meigen-ai.github.io/LongCat-Video-Avatar-1.5-Page

>Paris 2.0: A Decentralized Diffusion Model for Video Generation
https://arxiv.org/abs/2605.26064

>MRT: Masked Region Transformer for Layered Image Generation and Editing at Scale
https://arxiv.org/abs/2605.27235

>Beyond Pairwise Preferences: Listwise Reward-Aware Alignment for Diffusion Models
https://arxiv.org/abs/2605.26491

>Personalized Generative Models for Contextual Debiasing
https://arxiv.org/abs/2605.26353

>Quantized Keys Steal Attention: Bias Correction for KV-Cache Compression in Video Diffusion
https://arxiv.org/abs/2605.26266

>ReCA: Multi-Shot Long Video Extrapolation via Recursive Context Allocation
https://reca.vmv.re

>Towards Controllable Image Generation through Representation-Conditioned Diffusion Models
https://arxiv.org/abs/2605.27343

>Erased but Exploitable: Black-box Embedding-Aware Prompting Against Unlearned Text-to-Image Diffusion Models
https://arxiv.org/abs/2605.26332

>PARE: Pruning and Adaptive Routing for Efficient Video Generation
https://arxiv.org/abs/2605.27336

>SoftCap: Soft-Budget Control for Diffusion Transformer Acceleration
https://arxiv.org/abs/2605.27075

>Everything at Every Scale: Scale-Invariant Diffusion with Continuous Super-Resolution
https://arxiv.org/abs/2605.26032

>Diff-Instruct with Diffused Reward: Towards Principled One-step Generator RL
https://arxiv.org/abs/2605.24001

>Squeezing Capacity from Multimodal Large Language Models for Subject-driven Generation
https://zsh2000.github.io/squeeze-mllm-subject-gen

>Gemini Embedding 2: A Native Multimodal Embedding Model from Gemini
https://arxiv.org/abs/2605.27295

>JLT: Clean-Latent Prediction in Latent Diffusion Transformers
https://arxiv.org/abs/2605.27102

>Recursive Flow Matching
https://jhhuangchloe.github.io/RecFM
>>
File: 1760580034947706.png (1.42 MB, 582x1130)
1.42 MB PNG
i stopped caring about all ai "news" since 99% of it goes nowhere, if the new tech is actually good, everyone will start using it, advertising it through word of mouth everywhere, post examples that are noticably better, and the tech will automatically end up in all frontend/backend projects. everything else is academycuck citationbait.
>>
File: 1773991829058259.png (97 KB, 1054x1021)
97 KB PNG
Just how exactly does this config file work? What am I supposed to uncomment exactly?
>>
>>108921273
if you dont understand that, while at the same time being too retarded to even ask ai, then quit this hobby and wait 5 years until ai can also automatically set everything up for you and change your diapers too.
>>
>>108921291
>if you dont understand that
I understand it but I'm also tired right now after work
>>
baka baka baka
>>
>>108921190
kek
>>
My dream woman is a farm girl that spits randomly.
>>
>>108921346
hot
>>
>>108921346
https://youtu.be/3x1qPT3Pjlo?si=Ap1Lyufe11cZ_cyI
>>
>>108921227
This
>>
>>108921272
yeah the "link guy" is pretty annoying
>>
roundhouse kick newsfag down a hole and piss on him
>>
File: 212614CUI_00001_.png (1.75 MB, 1344x1728)
1.75 MB PNG
>>
File: q_2piwuh.png (1.05 MB, 1536x1024)
1.05 MB PNG
>>
>>108921253
>https://developer.nvidia.com/blog/nvidia-cuda-13-3-enhances-gpu-development-with-tile-programming-in-c-compiler-autotuning-and-python-updates
>CUDA Tile C++
Surely this will be noticeably more performant than python, no?
>>
>>108921442
probably yeah
>>
>>108921442
>>108921454
0/10
>>
>>108921461
hello saar
>>
File: Untitled.png (22 KB, 983x312)
22 KB PNG
>>108921461
What are you trying to say? Paranoid ass nigga.
>>
>>108921442
>https://developer.nvidia.com/blog/nvidia-cuda-13-3-enhances-gpu-development-with-tile-programming-in-c-compiler-autotuning-and-python-updates

Translation: "come deeper into our walled garden".
>>
>>108921485
oh i'll come deep alright
>>
File: Anima_00047_.png (1.82 MB, 1024x1536)
1.82 MB PNG
for 16GB VRAM in cumfart
15GB GGUF vs 22GB MXFP8
who is the fastest?
>>
File: q_1du3hx.png (1.08 MB, 1024x1536)
1.08 MB PNG
>>
I just wanted to brag that I figured out a thing.
>>
I figured out a thing.
THING THING THING THING THING THING THING THING THING THING
>>
https://ledger.somantix.ai/posts/comfy-ui-won-t-train-on-your-art-just-on-how-you-make-it/
Uh oh
>>
Plank scale chip manufacturing with quantum computing and 90 trillion open model weights for real time image gen? When?
>>
File: mom.webm (1.96 MB, 704x1280)
1.96 MB
1.96 MB WEBM
wan seems to mistake the peach fuzz from hyper real image gens for weird spots or lint or something
>>
>>108921658
Realtime image gen based on a realtime llm would be wild.
>>
>>108921646
>Comfy Products means Comfy Cloud, Comfy API, and Comfy Enterprise — and these are what the new Terms govern. If you run ComfyUI on your own machine, you’re outside this contract. If you launch Comfy Cloud, you’re inside it. The same underlying engine, in two distinct legal regimes, depending on where the compute happens.
based comfy wants to destroy big business users while protecting local chads
>>
>gives a million dollars to train local SOTA anime
>fucks over APIfags with legalese while leaving localchads out of all the bullshit
How does based Comfy do it?
>>
>>108921678
if they did it arbitrarily to paying customers they will come for the local users too. give it time
>>
In this moment I am euphoric.
>>
>>108921700
Dumbass
>>
>>108921710
what is dumb about predicting corpo enshitification?
>>
>>108921646
> A VFX freelancer has been using ComfyUI on her workstation
>This week she gets a brief that requires the work to run on hosted infrastructure with a SOC-style security profile. She clicks “Launch Cloud.”
If she's required to work there, it's not her issue but an issue of the one who pays her since her "work" already belongs to them.
>>
>>108921714
>>108921700
absolute retard. lmfao.
>>
>invoke project becomes entirely community-led and non-commercial
>api nodes added
fascinating
>>
>>108921728
Imagine if instead they ported the front end to work with comfy back end and thus all the existing community nodes
>>
>>108921718
this is the funniest part. she created her workflow on her workstation, so it was never her property to begin with. unless she has some explicit carveout with her employer, PIIA assigned all ownership to the company for assets made with company resources
>>
>>108921739
I saw a list the other day of 40+ different front ends for ComfyUI, if you can't find something that works for you either make your own or gen with something else.
>>
>>108921646
>https://ledger.somantix.ai/posts/comfy-ui-won-t-train-on-your-art-just-on-how-you-make-it/

So wait? What? Comfy submarined users?
>>
>>108921756
Sure, NOW it's trivial to vibe your own but back when invoke first went tits up that wasn't really a thing.
>>
File: q_29f1d8.png (3.7 MB, 2048x1536)
3.7 MB PNG
>>
>>108921718
>her
>she
>her
>>
>>108921670
>latinamilfgod is back
based
>>
>>108921778
cool, I want to play this game.
>>
>>108921227
>>108921248
>be furry
>stuck with illustrious until the end of time
Feel like a cuck but at least I don't need to reinvent all my workflows every 3 months
>>
>>108921778
Nice gen
>>
i accumulated a lot of new 1girls i need a better videogen model to gen us kissing i cant keep using fucking wan 2.2 reeeeeeeeeeeeeeeee
LTXniggers hurry up give me local seedance 2.0 and i wont be antisemitic online anymore
>>
FAT 1girl wrinkly in her 40's.
>>
File: obese women.png (324 KB, 604x652)
324 KB PNG
>>108920970
see picrel
>>
NO
>>
All your tokens are belong to us!
>>
>>108921895
he talks like a fag but he is right
>>
File: 1769874624637510.png (54 KB, 179x152)
54 KB PNG
(elephants foot:1.1)
>>
>>108921905
They exist. Without getting my permission, my cousin exercised her cute fat off. It's a catastrophe.
>>
>>108921931
I don't believe you
>>
>>108921833
You're just not trying hard enough.
>>
>>108921810
85% of new hires for AI focused work are, or identify as, female
>>
>>108921967
bitchy middle managers?
>>
>>108921819
>>108921832
thx. running some old xl gens through qwen i2i.
>>
>>108921947
I really don't care what india thinks.
>>
>>108921996
have some national pride
>>
If you like lxtgirl you ain't white.
>>
File: q_vv9ry8.png (2.39 MB, 1536x1536)
2.39 MB PNG
>>
>>108922058
so many... holes to choose from
>>
>>
My ace step 1.5 xl sft chops are improving.

not my "keeper", but enough to show.
https://files.catbox.moe/caew5f.mp3
>>
File: file.png (436 KB, 764x985)
436 KB PNG
https://www.reddit.com/r/StableDiffusion/comments/1tplsmr/using_depth_maps_and_weight_noising_to_get_better/

big if true
>>
>>108922265
whoa

is that legit?
>>
>>108922265
I'm not a nerd, explain what he's trying to say
>>
Started working on making my own LoRA just to prove to the naysayers that I can, in fact, do it.

Autocaptioning is a fucking mess. Just not even close, most of the time. At minimum I'll be doing manual cleanup on all the captions.

Sourcing high quality images is also tricky. At the risk of poisoning the well I've been removing watermarks with Klein. I can get high res images with trivial ease if I download the watermarked ones and 'fix' them. Just have to hope I'm not destroying too much good data in the process.

Mostly though I just can't believe anyone thinks this is a good workflow for genning images for fun. I understand making LoRAs to get praise from strangers on the internet, and I have some use cases in future where LoRAs make excellent sense and I plan to use them. But for 1girl? fuck this lol this sucks
>>
>>108922278
loras suddenly work. if true.
>>
>>108922265
Officially merged into AI Toolkit when?
>>
>>108922265
the face looks better on the standard training, the new method already looks overfit (same makeup/hairstyle from dataset)
>>
>>108922265
isn't this repackaging old method as new? am I losing my mind
>>
File: 1775758904683754.jpg (2.12 MB, 1456x2128)
2.12 MB JPG
>>
>>
>>
File: 1767160911809545.jpg (1.79 MB, 1456x2128)
1.79 MB JPG
>>
cooking up some kinos
>>
>>108922322
Cool look
>>108922413
Ah the traditional American cuisine, healthy and delicious!
>>
>>
File: file.png (24 KB, 721x229)
24 KB PNG
>>108922317
perhaps
>>
>>108922368
>>108922413
obsessed
>>
kawaii seems to make ace step think you want vocaloid-style overly high vocals. Once I understood it, I realized it's actually doing a good job. You have to be aware of this fact that vocaloids are in the dataest, and I don't think there's a way to keep them from creeping in.
>>
>>108922303
dude, she isn't at all recognizable in the standard training image
>>
>>108922517
fatty
>>
>>108922531
It's reaaaaally hard to lose weight if you eat out every day.
>>
>>108922534
its only hard if you are a goy
>>
>>108922536

I don't like calling it "goyslop". jews eat goyslop too, it's just kosher goyslop. picrel.
>>
>>108922517
>>
Local Diffusion?
>>
The theory: human hunger is invariant to social conditions.

The reality: every modernizing country has an obesity epidemic

two situations:
>eating sitting across from the girl that you are almost dating. she likes you

>eating anonymously around people you never met and will never know
>>
File: toriga_comfyui_00001_.png (1.85 MB, 896x1152)
1.85 MB PNG
>>
>>108922534
fat retard
>>
File: comfyui_00001_.png (647 KB, 896x1152)
647 KB PNG
>>
>>108922575
I like it, but can you add a burqa?
>>
File: comfyui_00002_.png (746 KB, 896x1152)
746 KB PNG
>>108922575
>>
File: comfyui_00003_.png (692 KB, 896x1152)
692 KB PNG
>>108922579
>>
>>108922593
kek
>>
I snipped this out, kinda like how it lags.

https://files.catbox.moe/doz0wd.mp3

ace step 1.5 from song I'm genning, but the song part didn't work out.
>>
>>108922527
Yeah, I hope the new lora thing works. I want to be able to tank up on loras.
>>
File: Untitled.png (742 KB, 1344x626)
742 KB PNG
Pixal 3D
>>
>>108922581
>>108922593
lol'd
>>
Is Kijai going to make quantized versions of Sulphur 2?
>>
anothe ace step gen, it skipped some lyrics, partially probably because of my complex prompt and lack of 5hz lm, which I never really use anymore, because I like it without it.

https://files.catbox.moe/edtjl2.mp3
>>
8^)

*fingers sparkle*

The holy prompter of the light. We read, "Even a man who has nothing can still offer his life."
>>
File: ComfyUI_00018_.png (1.08 MB, 1416x736)
1.08 MB PNG
https://files.catbox.moe/s3vms8.mp3
>>
>>108922610
impressive
>>
>>108922710
kek
>>
File: q_wwse75.png (2.38 MB, 1536x1536)
2.38 MB PNG
>>
https://x.com/SenseTime_AI/status/2059288626773799110

Actual open source.
>>
>>108922817
Next time I go to the public pool, I'm going to take a giant shit in it and announce on twitter how I'm open sourcing my shit to the world.
>>
>>108922827
Apache , MIT or GPL?
>>
File: q_6gj4te.png (3.07 MB, 1536x1536)
3.07 MB PNG
>>
>>108922610
is it better than the trellis 2 which supposedly had some "fix" a few months ago or something like that so it should be better than on launch?
>>
>interleaved generation
wat is
>>
>>108922610
Thanks for the demonstration, Lodestone.
>>
>>108922610
what is this?
I prompt big boobs in 3d and it'll make a blender ready model for it?
>>
File: 021830CUI_00001_.png (1.7 MB, 1344x1728)
1.7 MB PNG
>>
File: q_0w7yev.png (2.61 MB, 1536x1536)
2.61 MB PNG
>>
>>108922843
Apache poo.0
>>
>>108922887
But corpos may steal your shit!
>>
My farts just keep smelling better and better.
>>
>releases shit under GPL
>corpos are now forced to shit in this pool from now on
>>
File: ComfyUI_00567_.png (586 KB, 896x1152)
586 KB PNG
>>
>>108922368
>>108922413
can u poast the others you did i didnt download them in time :(
>>
>>
>>108923014
I like it, but I want her to have a round chin...
>>
>>108922993
I don't post here usually, you might be thinking of someone else. Last thing I posted here was a couple weeks ago and it got me banned
>>
is it me or does anima have that kind of a look where the image doesnt P O P but unironically? everywhere you look in the images it just kinda feels the same, for example noobai is obviously unstable, opinionanted and limited, but theres an obvious idea to its generations, especially with the colors, the outlines, the framing, the backgrounds and how they contrast to the subject etc.

i guess anima is then good for a stable base, where it doesnt try anything too crazy for the composition nor colors leading to good potential lora creation, but i think its probably a good idea to have an actual official tune that will push it towards a more artistic creative compositions, naturally highlighting the subject through different means and other stuff shitjourney does well for example, especially creative but good composition.

are these bigger finetunes planned or am i missing something?

i dont even have an oled monitor but for me for example noobai was the first model with its colors and linework that seems the most "genuine" to me and i cant detect any ai feel to it through those two things which are noticable on other models that "average out" the colors of the image and make the linework somewhat unnatural, unpropotional, somewhat jagged etc.
>>
File: kim kiss.mp4 (371 KB, 720x720)
371 KB
371 KB MP4
>>
>>108923030
derete
>>
>is it me
yea
>>
>>108923030
Fake. women never smile at me.
>>
File: durga yes sir.mp4 (437 KB, 720x720)
437 KB
437 KB MP4
>>
File: ComfyUI_00008_.png (919 KB, 1344x776)
919 KB PNG
>>
>>108923021
>it got me banned
TJD
>>
>>108923030
>tiny meme resolution
>no interpolation
>slow motion fuckup
>barely any movement
>5s
bro woke up from a coma he slipped into on day 1 of wan 2.1 release and used the same wf
>>
>>108923052
he posts content. you don't. he wins.
>>
>The pixels she generates are protected by the no-training pledge.
>The recipe that produced them is not.
>>
File: ComfyUI_temp_sifqz_00003_.png (1.82 MB, 1024x1016)
1.82 MB PNG
>>108923052
if you buy me a new computer i'll do better
>>
>>108923079
u dont need a better pc, just to wait a couple minutes more
>>
>>108922952
too old
but the goat is ok though
>>
I was 3 weeks in prison, what did I miss?
>>
>>108922610
looks pretty decent

was the source material already textured like that or does it bring the texture resolution down?
>>
>>
>>108923104
was anima v1 already out? i don't remember.

else quite a few local model releases, none of which stand at the top of anime/1girl but could be interesting regardless.

oh and a pretty nice ltx2.3 "timeline editor" type of node https://github.com/WhatDreamsCost/WhatDreamsCost-ComfyUI started becoming somewhat popular as far as I can tell, it's not technically just 3 weeks old tho
>>
>>108923116
3D textures always look shit. You can do some projection and controlnet copes, but at the end of the day anything you want to put into a game or video production needs to either be retextured or so inconsequential that nobody even notices how shit it looks.
>>
>>108923134
So not yet? AI obviously has the potential to fix this. WAN or the image edit models can change the perspective and simulate the formerly invisible parts credibly at the same level of detail or actually even imagine details at a higher resolution.
>>
>>108922878
nice
>>
File: ComfyUI_26932.jpg (3.18 MB, 1500x1920)
3.18 MB JPG
>>108921646
Will Comfy finally learn how to prompt after stealing from every one of his SaaS rubes?

>>108922265
Could be cool if he decides to support Z-Image in the future (but I won't be holding my breath).
>>
>>108923240
nice racket
>>
>>108921206
AI can ape Homare's art style now? That seems like a pretty deep cut. I thought he was an obscure artist
>>
File: ComfyUI_temp_nhosi_00010_.png (3.8 MB, 1344x1728)
3.8 MB PNG
>>
>>108923253
>Homare
Yeah, he has ~1.4k images on Danbooru. That's a lot. Any model trained on that site should be able to do it.
>>
File: ComfyUI_temp_nhosi_00011_.png (3.49 MB, 1344x1728)
3.49 MB PNG
>>
>>
ace step 1.5 xl sft... CAN scream...
>>
>>108923291
z-base?
>>
>>108921253
>>108921257
thanks!
>>
>>108923296
kissing and plapping sounds status?
>>
>>108923133
>oh and a pretty nice ltx2.3 "timeline editor"
thats cool, so you can basically prompt a full movie through this?
>>
>>108923253
>>
>>108923294
:) round chin
>>
>>108923320
Women who look like that never go into nature :^)
>>
File: 464755.png (717 KB, 1720x758)
717 KB PNG
mayday mayday
>>
File: 1.jpg (393 KB, 824x1204)
393 KB JPG
>>
>>108923431
wrong thread?
>>
>>108923441
you're right, we need a general for ltx2.3 kinnoisseurs
>>
>>108923314
it is a sensible way to work with ltx

i don't think you'll make full movies in cinematic length, but you do you
>>
Does anyone have this ltx2.3 lora by any chance?

https://civarchive.com/models/2454555?modelVersionId=2782555
>>
File: 1764075568049607.png (3.58 MB, 1408x2112)
3.58 MB PNG
>>
File: merian (93).png (253 KB, 1822x2050)
253 KB PNG
>>108921206
I want to use AI to draw me some generic backgrounds and props for comic.
I also want to make a LoRA to better match the artstyle I'm aiming for.
What is the most intelligent model for this use case?
What is the most suitable model to train from?
>>
File deleted.
why porn still looks better on wan2.2, even with Eros/Sulphur massive training, physics bases are totally fucked up >.>

T_T
>>
>>108923451
>i don't think you'll make full movies in cinematic length
why not?
just need to write a good screenplay.
>>
>>108923342
homare \(fool's art\), elegant, 1 girl, {princess zelda, the legend of zelda, pointy ears, blue eyes, green eyes, blonde hair, short hair, crown braid, parted bangs, medium breasts, petite, thong bikini bottom, halterneck, bangle, jewelry, blush, blue gem, earring, white thong, blue bandeau, armlet, alternate costume, triforce print|laura kinney, green eyes, black hair, long hair, petite, small breasts, (toned:0.6), sports thong, criss-cross halter bikini top, choker, cherry blossom print|azula, topknot, black hair, yellow eyes, single hair bun, hair ornament, small breasts, petite, bandeau, dragon print, thong bikini bottom, armlet|korra, dark-skinned female, blue eyes, ponytail, brown hair, hair tubes, medium breasts, aqua bikini, water print, side-tie thong, (toned:0.7)|tifa lockhart, final fantasy, 1girl, red eyes, black hair, long hair, low-tied long hair, slender, large breasts, earrings, black thong bikini bottom, white o-ring bandeau, cleavage, armlet},
, , struggling, flustered, reluctant, aroused, 4 fingers, candid , bikini bottom pull, pussy, sparse pubic hair, pussy juice, nipples, bikini top lift, night, blurry foreground, cave, dark background, sidelighting, cave interior, overgrown, nsfw , rating explicit, (stone ruins:0.6) BREAK ,1 boy, orc, moblin, interspecies, imminent rape, grabbing another's arm, restrained, large male, erection, veiny penis, nude male, bara male, groping, size difference, imminent gangbang, {groping, molestation|licking another's face|forced kiss},
>>
>>108923597
>interspecies
>>
>>108923600
getting Orced
>>
>>108923588
or prompt it from chatgpt.
>>
File: 1773505579166547.png (3.93 MB, 1408x2112)
3.93 MB PNG
>>
>>108923546
Check Klein9b
>>
File: 426745.png (782 KB, 1800x1033)
782 KB PNG
bro is dead :skull:
>>
>>108923665
Where's the 1girl???? ????
>>
>>108923710
shredded by a hail of bullets shortly prior
>>
this is a blue board...
>>
>>108923741
lol, I thought this was /b/
>>
>>108923546
lora train relatively well with most of the popular models but it is still a matter of some experimentation and not all subjects train equally well on all models.

maybe qwen/qwen image edit or flux klein can be used, or anima, or z-image[base/turbo] or just illustrious... but if a chroma or even the otherwise not that popular ernie or hunyuanimage or some other model works best for your type of backgrounds I'm also not going to be entirely surprised.

i actually think it'll be a matter of trying a bunch of them, won't necessarily be quick.
>>
uuh... how do you enable torch.compile on https://github.com/67372a/LoRA_Easy_Training_Scripts ?
>>
>>108922858
If I'm being perfectly honest. Pixal 3D is only good from the angle you give it. Like it nails the shape. But it falls apart quick from any other angle. I think it works best for simple objects.

I still prefer trellis 2 in general.
>>
>>
>>
>>108923863
very nice fren
>>
>>108923869
but the plant is in the way, you can't even sit with her. cursed image.
>>
>>108923869
Thanks
>>108923882
This better?
>>
>>108923885
pretty hot
>>
Anyone want to help with corporeal similitudes?
>>
File: file.png (1 MB, 1024x1024)
1 MB PNG
>>
>>108923894
Yeah, except it needs a little card on the table "reserved for anon"
>>
File: ComfyUI_00568_.png (826 KB, 896x1152)
826 KB PNG
>>
>>108921206
Modern web is pure incompatible shit. I blame web developers for this. Do you have any idea how much work I need to do to deal with your incompatible shit and yoursecurity layer bullshit and your hacker packet interceptor crap? I hate webdevs. You all suck.
>>
>>108923919
k
>>
>>108923919
This isn't the webdev thread.
>>
>secretly tore up an anti-AI art student's painting today
What did you guys accomplish
>>
and then everyone clapped
>>
The anima trainer defaults to 15 epochs. Is that really enough?
>>
>>108923994
do you really need more?
>>
>>108923716
>>108923904
and you people complain about me. yikes.
>>
oh boy autismo
>>
just go to sleep...
sssshhhh, it'll all be ok
https://suno.com/s/s65rrDABZEFe5Noe
https://www.youtube.com/watch?v=nE9wjxqtq0g
>>
>>108922282
> to the naysayers that I can
So they were right.
>>
>>108924054
no, im busy generating kinos
>>
>>108924092
Exactly why I hardly game now...
>>
>>108924092
which you didn't post. convenient.
>>
>>108924098
it isn't done yet
>>
>>108923973
>>secretly tore up an anti-AI art student's painting today
The only thing that could make this gayer is if you only imagined doing it instead of actually doing it which we all know is what happened.
>>
>>108924118
whatever you say lol
>>
>>108924135
i know what you're trying to do. you won't bait me
>>
LORAs are shit.
How long till I can completely substitute them with character sheets.
>>
>>108924257
klein can do it
>>
>anima is trained on lolis but not on obscure waifus
hmm
>>
>>108924335
>whoring your waifu out in a dataset
shiggy diggy
>>
anyone tried using Anima edit / reference images? i'm not sure how to apply multiple reference images
>>
File: 36357566.png (661 KB, 1767x730)
661 KB PNG
bro died for isarel. RIP to a real nigga
>>
File: 215186CUI_00001_.png (896 KB, 1152x896)
896 KB PNG
Mornin'
>>
File: ComfyUI_00574_.png (603 KB, 896x1152)
603 KB PNG
>>
is there any website which hosts irl loras now?

Also, what is the meta fo making instagram thots 1girls now? Bee out of the game for a while
>>
File: krea2.png (1.62 MB, 853x1531)
1.62 MB PNG
Krea has the opportunity to save local.
Any bets on how they screw it open weighting it?
Aggressive SFT? Safety tune it? Strangle license?
Don't care if it's Krea 2 small or w/e. It would still be better than the slop models we've been getting lately.
>krea 2 picrel
>>
File: 7653678.png (705 KB, 1804x746)
705 KB PNG
>hey i see some API cucks down there
>>
i need to stop generating and look for a job
>>
>>108924684
you can put 'prompt engineer' on your resume now
>>
Can someone redpill me on NovaAnimeXL? How is it different from WAI? The description doesn't make it clear.
>>
>>108924695
meme
>>
File: Rosa__00020_.png (1.12 MB, 1024x1024)
1.12 MB PNG
>>
File: radiance.jpg (127 KB, 736x1280)
127 KB JPG
>>108924695
if i recall correctly, most nova* usually defaults to pretty large breasts among other things, and it's massive/gigantic breasts are usually really big

finetunes/merges are often some stylistic changes and different biases for, idk, 1girl body shapes and clothing items, it can be hard to tell what EXACTLY has changed tho.
>>
File: 785383616307917.png (3.22 MB, 1152x1600)
3.22 MB PNG
>>
File: 46375.png (616 KB, 1236x704)
616 KB PNG
look at this crazy sun of a gun
>>
File: 1778082265684480.png (383 KB, 1128x1437)
383 KB PNG
>>108924688
my genius PE skills are only for sdxl
>>
File: ComfyUI_00579_.png (890 KB, 896x1152)
890 KB PNG
>>
I updated my comfy and now regular image2image is very slow. Anyone else?
>>
File: apply cosmos reference.jpg (135 KB, 1767x477)
135 KB JPG
>>108924391
This worked. Just chain apply cosmos reference together and latent reference. Both subjects needs to be exactly the same resolution though.
>>
>>108924823
>he updooted
>>
File: q_25rbdf.png (2.69 MB, 1536x1536)
2.69 MB PNG
>>
am i the only one who thinks anima's kissing knowledge is weak? it feels unnatural
>>
>>108925074
Try a French model.
>>
>>108924684
There's probably unironic job listings looking for people who know how to set up stuff like ComfyUI for their business. This general forgets that even just installing ComfyUI goes over 99% of every normie's head.
>>
File: ComfyUI_00581_.png (643 KB, 896x1152)
643 KB PNG
>>
>>108925166
i am the senior git puller
>>
>>108925078
no but really, a bust up profile picture shouldn't look this bad
https://litter.catbox.moe/xk37mv.png
>>
File: 45578.png (542 KB, 1232x700)
542 KB PNG
>mfw
>>
File: ComfyUI_00584_.png (1.11 MB, 896x1152)
1.11 MB PNG
how to make MJ standing straight?
>>
File: radiance.jpg (89 KB, 736x1280)
89 KB JPG
>>
File: 376471291511928.png (1.32 MB, 1216x832)
1.32 MB PNG
>>
File: kino alert.gif (44 KB, 220x220)
44 KB GIF
ok i think i am done watching the kinovision for today. here were my favorites
https://files.catbox.moe/51qpkn.mp4
https://files.catbox.moe/av091l.mp4
https://files.catbox.moe/eyf2r0.mp4
https://files.catbox.moe/hzuzo3.mp4
https://files.catbox.moe/ihiqdh.mp4
https://files.catbox.moe/2xhesm.mp4
>>
File: dasg.jpg (153 KB, 1402x853)
153 KB JPG
im done with gen 1girl boobs. now im developing useless autistic novel AI architectures and watch them. if that gets boring too, i will search a hobby irl
>>
File: 124708CUI_00001_.png (1.86 MB, 1344x1728)
1.86 MB PNG
>>
>>108925411
I recommend picking up an instrument and studying music in general. That is a bottomless pit.
>>
File: 130137CUI_00001_.png (1.75 MB, 1344x1728)
1.75 MB PNG
>>
>>108924969
yeah that's what i tried, significantly slower inference than with 0/1 reference and it's hard to indicate which image should be the "base"
>>
>>108925453
electronic music maybe, other genres are kinda ded
>>
>>108925546
rock and roll will never die
>>
File: 131151CUI_00001_.png (2.1 MB, 1248x1824)
2.1 MB PNG
>>
File: 1750148816031921.png (994 KB, 1475x705)
994 KB PNG
Starting with ComfyUI, I was originally going to use WAI-Illustrious-SDXL v17 since that seems to be the go-to recommendation, but I only have 4GB of VRAM, so I’m wondering if SDXL is even the right choice for my setup.

Edists like pic rel and simple white background concept-art of chracters is basically what I’m aiming for, nothing too crazy:

>flat colors
>soft shading
>low texture complexity
>minimal background detail
>no heavy lighting effects or particle spam

Would a good SD1.5 anime model be enough for this instead of SDXL? If so, does anyone have recommendations for a reliable model from a trusted source?
>>
File: 32be5e94e2f69177.gif (939 KB, 320x327)
939 KB GIF
>>108925588
>but I only have 4GB of VRAM
>>
>>108925588
Anima should work
>>
>>108925588
>I only have 4GB of VRAM
try sd1.5
>>
File: Fv8CNCTWwAItbuf.jpg (245 KB, 1280x960)
245 KB JPG
>>108925588
>SD1.5
anyone who thinks that's still relevant is ngmi
>>
>>108925573
it's ded jim
>>
>>108925628
I don't think any music genre can die, people still listen to classical music, but sure you're not going to get massive following with it.
>>
>>108925693
join the electronic music train, even symphonic orchestras are doing it now

https://www.youtube.com/watch?v=p-EX2AEDhNs
>>
File: 1771316918082583.png (665 KB, 832x1273)
665 KB PNG
>>108925612
Kek.
I'll probably upgrade if I end up enjoying it.
>>108925617
Thanks.
>>108925622
As long as I can make accurate edits like the post before or like like pic rel, I'm happy.
Made pic rel with online models and a bit of photoshop, figured it's time to start local and see if I can get better rez and less constraints.
>>
>>108925588
>Edists like pic rel and simple white background concept-art of chracters is basically what I’m aiming for, nothing too crazy:
Anon, the model's hardware requirements isn't determined by the style or the perceived simplicity of the image. Most base models are style omnipotent. Even 1.5 can produce detailed images under the right conditions.
>>
suggest me comfyui custom nodes/extensions that I can build to boost my github repo rating lel
>>
>>108925735
make an uninstaller
>>
>>108924523
It'll be a DoA flux finetune like krea1, calm down lol.
>>
>>108925741
what if I code a new local diffusion software in C++?
>>
File: 105921CUI_00001_.png (1.31 MB, 832x1216)
1.31 MB PNG
>>
>>108925735
>boost my github repo rating
what purpose does this serve exactly? increase your izzat?
>>
>>108925758
mayb I'll finally get employed
>>
>>108925715
If you want anime focused stuff then go for Illustrious based finetunes like WAI and such since they are still the best so far, Anima is good I guess but with less features compared to IL since it's still too new, also not to mention that you lack the VRAM necessary to get everything working smoothly as you want.
>>
>>108925749
expect a nuclear level meltdown if it hits more than 35 stars
>>
>>108925779
it's too early for you to meltdown over your crush
>>
>>108925777
Well but in this case it's better to start with Anima since my VRAM is low, it sounds like you're telling me to go with WAI anyways?
>>
>>108925807
anima uses more vram than sdxl. you have it backwards
>>
File: 5647587.png (2.66 MB, 1920x1088)
2.66 MB PNG
>>
>>108925588
>4gb of vram
at that point i'd just buy an 8gb vram card for $50 or less, don't even bother man. at least then you'll be right as rain for sdxl.
>>
>>108925816
Anon, you're confusing me to no end.
Look I just installed Comfy UI I was going to use a model based on SD1.5 since it's lighter.
>Anything V5
>Counterfeit
>MeinaMix
>AbyssOrangeMix
Some anon suggested Anima and I was going to go with that so what am I getting wrong here? I don't plan to use SDXL but SD1.5 based models.
>>108925853
I see.
Might as well.
>>
>>108925853
>50 or less
Lmao
>>
Running ComfyUI through Open webUI on GrapheneOS via LAN. I can prompt an image and it will generate, but instead of displaying the image in the Open webUI chat, it just saves it to the PCs hardrive. Logs show a web socket error.
What am I missing?
>>
>>108925873
>so what am I getting wrong here
anima is the most memory intensive of your three options. anima sucks most on low end hardware
>>
>>108925873
>Anything V5
>Counterfeit
>MeinaMix
>AbyssOrangeMix
Now that's a museum gallery I haven't seen in a long time.
Look anon, no one uses SD1.5 based models anymore, they were never good to begin with, they were just a starting step for what came afterward.
If you want to get things done and be happy at the end then just use WAI or any other Illustrious model
>>
File: 1777139007041328.jpg (102 KB, 1920x1080)
102 KB JPG
>>108923240
basedbasedbased
MORE tennis skirt jenny
>>
>>108924065
Is this how you people are? "Oh, you don't have your LoRA ready by the end of the thread, I guess you can't do it"?
>>
>>108925903
have you set ENABLE_WEBSOCKET_SUPPORT to true in openwebui?
>>
>>108925903
>Running ComfyUI through Open webUI
y tho, bloat on bloat.
>>
>>108925922
Even with 4g vram?
Look I can spend $200 on a 8G (Yes, it's that expensive over here) but I just want to see if I'll enjoy doing this.
I'll go with WAI as recommended on the starting guide them, hope I can at least run it properly.
>>
>>108925941
That I have not. I will do that now.
>>
>>108925954
>Even with 4g vram?
Dunno, just get a used 12GB 3060, it should be around the same price range, it will get the job done with room to spare for this kind of workload.
And make sure you don't fall the AMD trap.
>>
>>108925954
Should be alright. It will take a little longer than 1.5 but at least you have a smarter model. 1.5 is smart phone level nowadays and really all it's good for
>>
>use flux klein without issues
>update comfy
>the 1megapixel resize node now bricks the workflow completely
>works when bypassed

?????
>>
>>108925981
All right, thanks frens.
>>
>>108925989
what error does it give you?
>>
>>108925992
I've got 4gb vram as well but I'd suggest Anima. The model is smart enough to understand composition and multiple characters. You'd need to use controlnet and other tools with illustrious but that's going to be rough due to the vram. Better to just get the model that doesn't need the tools.
>>
>>108926041
can you stop? There are anons here that find it intolerable at 8gb and the quality of the turbo Lora makes the outputs slop.
>>
>>108926034
Massive one that klein is just getting confused with.

I just need to find another similar node I think.
>>
>>108926059
he's not wrong, sdxl is marginally faster than anima and it requires a bunch of extra shit like controlnets and inpainting to unfuck your gens.
i would rather a 50 second anima gen that does what i want over a 35 second sdxl gen that needs 3 inpainting passes to fix all the errors that add another minute+ to the total gen time.
>>
>>108926059
can you? you've been fudding anima ever since preview1. its time to let go
>>
>>108926082
nobody needs any of that shit for 1girl
>>
>>108925979
>And make sure you don't fall the AMD trap
I never understood this obvious nVidia shill nonsense. My 7900 works perfectly fine. No issues at all.
>>
tdrussell won
>>
>>108926059
Arguing with these shills is a waste of time, if someone decides to trust them then so be it, it should work as a good lesson and experience for them.
>>
man anima really is shit
no idea why it's shilled so hard here
>>
File: 13577.jpg (57 KB, 1000x800)
57 KB JPG
>retards throwing poop around
>klein chads standing by and standing proud
>>
File: ComfyUI_00586_.png (304 KB, 896x1152)
304 KB PNG
>>
>>108926108
Base models are always shit, they are meant to be finetuned into something better
>>
>installing training wheels on a bicycle is making it better
I guess it is better for a certain group kek
>>
>>108926108
it's worth shilling just for the prompt adherence, everything else is icing on the cake.
>>
>>108926118
nice cope
>>
>>108926118
nobody wants to fine-tune a model where the base licence assumes control over their model
>>
>>108926169
>base licence assumes control over their model
who cares
fine tune it and release it on some piracy forum
that makes me think
are there any torrent sites for checkpoints/loras yet?
>>
>>108926181
>who cares?
The people that spend money to fine-tune? Are you illiterate?
>>
i noticed klein makes much more realistic images if you lower the resolution. afterwards, i can upscale it by feeding it as a reference image at a higher resolution
>>
>>108926196
it's not about the money, it's about sending a message
>>
>>108926203
for cumfyorg and tdruss, it is
>>
ace step 1.5 xl sft song :^)

It can do indie grunge :^))))

https://files.catbox.moe/c15vfp.mp3
>>
>>108926196
>what is crypto
>>
>>108926196
doesn't seem to be an issue for the people currently making anima finetunes
>>
>>108926211
Now for a corporeal similitude, and then someone can make the WAN gen, I can't.

or idk maybe I can dunno
>>
>>108926218
>yeah but uhhh those fine-tunes all have catastrophic forgetting or whatever.
that guy is such a fucking retard
>>
>>108926211
lmao
>>
>catastrophic forgetting
>>
>>108926233
it's a very real thing!
>>
>>108926244
what is?
>>
>>108926249
his meds
>>
>>108926249
i forget what we were talking about
>>
>>108926270
catastrophic
>>
File: ANIMA_bface_bad_00001_.png (905 KB, 832x1216)
905 KB PNG
>>108926211
>>108926220
>>
>>108926299
SLAVE BLOCK WHO WILL BID?
>>
File: comfyui.jpg (582 KB, 817x1464)
582 KB JPG
>>
Fresh

>>108926382
>>108926382
>>108926382

Fresh
>>
>>108926218
to be fair they are retarded 1000 ish images ones and they all suck
>>
>>108925378
Artist plox?
>>
>>108926397
luis royo + range murata
>>
>>108926108
>>108926168
>>108926169
>>108926196
Thanks for letting us know you still are be a raped retard
>>
>be russell
>comically evil license
>no platform can operate the model or finetunes efficiently, price per gen just quadrupleted over night due to fees per generated images that have to be forked over to the big guy
>"well, it was always meant to please the local community hehe"
>trollface.png



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.