[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion and Development of Local Image and Video Models

Previous: >>108921206

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
File: WAN22_00058.mp4 (408 KB, 480x640)
408 KB
408 KB MP4
>>
>inb4 n*gbo
>>
Bonsai won
>>
File: 215189CUI_00001_.png (2.44 MB, 1248x1824)
2.44 MB PNG
>>
>>108926462
Did it?
>>
>comfy worth billions
>invoke team sleeping on a bed on Adobe money
must sting
>>
File: 160613CUI_00001_.png (2.12 MB, 1248x1824)
2.12 MB PNG
Trying to recreate my waifu Lady Argent since she almost doesn't have any art to make loras with.
>>
Is it true that you can't stack LoKr loras together? or just aijeet myth?
>>
>>108926613
Stop being mean and I'll tell you
>>
>>
who is catjak shizoing out on now?
>>
>>108926613
First time hearing this honestly
>>
>>108926613
there isn't anything preventing it but they could stack worse than regular loras, only way to know is to test
>>
>mfw Resource news

05/28/2026

>MAVEN A Multi-Agent Framework for Multicultural Text-to-Video Generation
https://github.com/AIM-SCU/CRAFT

>Bias Leaves a Gradient Trail: Label-Free Bias Identification via Gradient Probes on Concept Decompositions
https://github.com/vitryt/label-free-bias-identification

>VidPrism: Heterogeneous Mixture of Experts for Image-to-Video Transfer
https://github.com/Lrrrr549/VidPrism.git

>Wan2.2-NVFP4-Sparse (NVFP4)
https://huggingface.co/lightx2v/Wan2.2-NVFP4-Sparse

>Microsoft data suggests using AI is more expensive than hiring people
https://finance.yahoo.com/sectors/technology/articles/microsoft-data-suggests-using-ai-225900743.html

>Pixal3d Studio
https://huggingface.co/spaces/victor/pixal3d-studio/tree/main

05/27/2026

>InvokeAI 6.13.0
https://github.com/invoke-ai/InvokeAI/releases/tag/v6.13.0

>NVIDIA CUDA 13.3 Enhances GPU Development
https://developer.nvidia.com/blog/nvidia-cuda-13-3-enhances-gpu-development-with-tile-programming-in-c-compiler-autotuning-and-python-updates

>Scheduled Style Injection: Expanding the Style-Content Pareto Frontier in Training-Free Diffusion-based Style Transfer
https://github.com/ameyskulkarni/scheduled_style_injection

>Feedforward 3D Editing Learns from Semantic-Part Transformation
https://dennis-jwweng.github.io/pxform

>Reinforcing Few-step Generators via Reward-Tilted Distribution Matching
https://github.com/Harahan/RTDMD

>Paper Doll Studio: Local-first tool for creating paper-doll wardrobe assets
https://github.com/Khurramali1997/paper-doll-studio

>‘Lobotomized’: Character.AI Is Showing What AI Enshittification Looks Like
https://www.404media.co/lobotomized-character-ai-is-showing-what-ai-enshittification-looks-like

>Tech CEOs are apparently suffering from AI psychosis
https://techcrunch.com/2026/05/27/tech-ceos-are-apparently-suffering-from-ai-psychosis

>Soap2Soap: Long Cinematic Video Remaking via Multi-Agent Collaboration
https://github.com/showlab/Soap2Soap
>>
>mfw Research news

05/28/2026

>SmartDirector: Keyframe-Conditioned Cinematic Video Generation with Narrative Pacing Control
https://arxiv.org/abs/2605.27891

>OSP-Next: Efficient High-Quality Video Generation with Sparse Sequence Parallelism, HiF8 Quantization, and Reinforcement Learning
https://arxiv.org/abs/2605.28691

>Compositional Text-to-Image Generation Via Region-aware Bimodal Direct Preference Optimization
https://arxiv.org/abs/2605.28615

>Stay Fair! Ensuring Group Fairness in Diffusion Models Across Guidance Scales
https://arxiv.org/abs/2605.28036

>Qwen-Image-Bench: From Generation to Creation in Text-to-Image Evaluation
https://arxiv.org/abs/2605.28091

>DebFilter: Eradicating Biases Stashed in Value
https://arxiv.org/abs/2605.28167

>BiasEdit: A Training-Free Bias-Detect-and-Edit Framework for Learning Fair Visual Classifiers
https://arxiv.org/abs/2605.28450

>Representation-Conditioned Diffusion Models for Guided Training Data Generation
https://arxiv.org/abs/2605.27495

>SIGMA: Semantic-Difference Instruction-Grounding Mask Annotator for Text-Driven Image Manipulation Localization
https://arxiv.org/abs/2605.27924

>Residualized Temporal Sparse Autoencoders for Interpreting Diffusion Models
https://arxiv.org/abs/2605.27813

>Rethinking Video-Language Model from the Language Input Perspective
https://arxiv.org/abs/2605.27920

>EchoAvatar: Real-time Generative Avatar Animation from Audio Streams
https://robinwitch.github.io/EchoAvatar-Page

>Explicit Critic Guidance for Aligning Diffusion Models
https://arxiv.org/abs/2605.27736

>Inpainting-Style Conditional Diffusion for Multivariable Time Series Forecasting
https://arxiv.org/abs/2605.28324

>No Safe Dose: How Training Data Drives Unsafe Image Generation
https://arxiv.org/abs/2605.28137

>MangaFlow: An End-to-End Agentic Framework for Controllable Story to Manga Generation
https://arxiv.org/abs/2605.28173
>>
>>108926800
>>108926811
thanks!
>>
File: .png (24 KB, 659x184)
24 KB PNG
>>108926811
lol
>>
File: 1779377801324907.webm (748 KB, 370x370)
748 KB
748 KB WEBM
Are we ever going to have a zit moment again?
>>
File: Flux2-Klein_00033_.jpg (388 KB, 1008x1296)
388 KB JPG
>>
File: 215190CUI_00001_.png (2.68 MB, 1248x1824)
2.68 MB PNG
>>
>>108926873
>Laws around AI tightening
>More and more labs abandon open weight/source
>Almost all model releases now are shitty slop trained research previews
I won't say not but I am not feeling optimistic
>>
File: Flux2-Klein_00038_.jpg (399 KB, 1008x1296)
399 KB JPG
Trendy social democrat, circa 2050
>>
>>108926956
Que?
>>
trying to do tentacle rape gens in wan 2.2, motion's really slow and boring, is there specific tags/sentences i should add to my prompt that fix this? using that dasiwan lightning nsfw model at Q8.
>>
>>108926800
>>108926811
Fuck off debo
>>
>[WARNING] Dynamic vram disabled with argument. If you have any issues with dynamic vram enabled please give us a detailed reports as this argument will be removed soon.
Yeah, it's dogshit slow and doesn't work properly. Tell Claude to make it faster. End of report.
>>
>>108926971
>motion's really slow and boring
More high sigma steps. Have like 3 steps without any distill lora if desperate. Or just run like 7-8 steps without any high distill loras.
>dasiwan
So, a shitmix? Yeah don't use those
>>
>>108927033
each pass gets 4 steps, i use dasiwan because its the only one that seemed to do everything i wanted, including penetration.
got a better recommendation? trying to actually use lightning loras on their own with base wan was always a pain in the ass.
>>
File: Flux2-Klein_00069_.jpg (424 KB, 1312x976)
424 KB JPG
>>
here for the sfw vageen
>>
>>108927058
>got a better recommendation
I already told you: base model + specific distill and NSFW loras you need for the task. You can't use shitmixes and expect great results.
>trying to actually use lightning loras on their own with base wan was always a pain in the ass.
That is true unfortunately. My recommendation is to just pick something like 1022, and roll with it. Don't do schizo stuff with distill loras like mixing them with esoteric weights like some people do. Crank step count up when inadequate. Look at the sigma curve for the shift value + step count + scheduler you are using and switch around the appropriate value (0.9). (Or use one of those custom nodes that do that automatically)
Again if you are really desperate for motion run a few steps on the high model without any distill loras. You get cfg on the first model this way so you can use slow, still, etc. in the negatives.
>>
>>108927130
hhhhhhhrrrrrrrrmmmmmmmmmm
alright i'll cave and ditch the shitmix and give that a shot. one thing, does the fp8 quanted wan really cripple output quality as bad as people say? big reason i use the q8 mix is the lack of speed loss from stackmaxxing loras. fp8 native speed would help a lot.
>>
goonery
>>
File: ComfyUI_00591_.png (1.29 MB, 896x1152)
1.29 MB PNG
g'night
>>
File: Flux2-Klein_00086_.jpg (314 KB, 1136x1136)
314 KB JPG
>>
>>108927219
box?
>>
>>108927170
There shouldn't be a massive quality hit for any 8bit quant, especially for a larger model like this. Get a scaled one over naive fp8 at least. There are other fp8 variants with various speedup and quality tradeoffs. I know mxfp8 has the highest quality but I have no idea about the speed diff compared to standard fp8 so I can't recommend it unless you want to do some testing.
I am on Ampere so I am an int8 guy anyway.
>>
>>108927251
coool thanks, im on blackwell and people swear fp8 wan isn't as good but i couldn't really tell the difference. sticking with fp8mixed in that case.
mxfp4 in all models i've tried was fucked for sure though.
>>
File: 175433CUI_00001_.png (801 KB, 1248x1824)
801 KB PNG
>>
File: ComfyUI_26946.jpg (3.01 MB, 1500x1920)
3.01 MB JPG
>[INFO] Prompt executed in 78.34 seconds
>[INFO] Prompt executed in 44.74 seconds
>[INFO] Prompt executed in 50.21 seconds
Wow, the latest Comfy is 25-30sec slower without Dynamic VRAM and the initial load/inference went from ~45sec to 1m18s.

>[INFO] Prompt executed in 82.69 seconds
>[INFO] Prompt executed in 339.17 seconds
>[INFO] Prompt executed in 45.61 seconds
With Dynamic VRAM on it's a fucking shitshow, this must be straight off of my NVME because my system RAM stayed at 16GB of usage (I have 64GB) the entire time. It ran my GPU a lot harder too. I can smoothly watch TV in a media player via my capture card (1080p60) without Dynamic VRAM, and with it enabled I was getting maybe one frame every couple of seconds (the audio wasn't stuttering).

I guess those monstrous regressions in performance are worth millions though, so what do I know...
>>
>>108927330
Yeah I'm also experiencing a fuckton of performance loss. With it without dynamic vram, 5090, 196gb.
>>
>>108927282
I meant mxfp8 not mxfp4.
mxfp4 is better than standard fp4, but it's still just 4 bits (And nvfp4 is better than both if you are on Blackwell and want to run a 4 bit quant)
Anyway I think fp8mixed should be fine but as I said I don't know all the details of fp8.
>>
>>108927330
Alright I am not pulling for a while thanks for the heads up.
>>
>>108927330
probably all the time it has to take to send your workflow and prompt to comyfui's servers for cataloging
>>
File: ComfyUI_21122.png (2.1 MB, 1600x1200)
2.1 MB PNG
>>108927340
Dynamic VRAM used to rape my System RAM to point of pushing everything else into paged memory and green-screening my PC with memory errors and now it's avoiding it like the plague. Makes me wonder what they test on because my old-ass PC isn't anything fancy (5950x/4090/64GB RAM).

>>108927368
I hope they're ready for the influx of error reports when they do remove "--disable-dynamic-vram".
>>
Fugg I posted in the wrong thread
Anyway I'm 9 months sober of this shit, you can do it too anon
>>
>>108927397
>1girl, standing, white background,
>1girl, standing, white background,
>1girl, standing, white background,
>1girl, standing, white background,
>1girl, standing, white background,
aiiieeee, comfy STOP STEALING MY MASTERPIECES, BEST QUALITY!!!
>>
>>108927330
>>108927402
i honestly have no idea why i even poked fun at you several threads ago, your gens are actually pretty damn high quality
you're alright, shovel-faced-roastie-poster.
>>
File: 181611CUI_00001_.png (1.9 MB, 1248x1824)
1.9 MB PNG
>>108927330
>>108927340
When did you guys updoot? Guess I'll won't be gitpulling anytime soon.
>>
>masterpiece, absurdres
>>
>>108927425
There was a significant rework of aimdo backend this week which I am guessing is where the regressions originate from, but well it's their job to figure out precisely what.
>>
>score_9
>>
>greg rutkowski
>>
i still don't know what danbroo tags are
>>
File: Flux2-Klein_00100_.jpg (443 KB, 1136x1136)
443 KB JPG
>>
>>108927541
dan broo is a guy who made a lot of tags
>>
>literally all it took to fix my problem was switch to wan 2.2 base and use the lightning lora full strength
f-fuck.. i had the formula from the start and just wasnt using it right.. i kneel. total slopmerger death.
>>
>1girl, I don't know why...
>>
>>108926382
WHERE ARE THE ACE STEP GENS IN THE COLLAGE
>>
We need to bring back slavery so I can have a girlfriend.
>>
>click on lora
>guy is using multiple loras in his samples AND artist tags
fuck you if you do this
>>
>>108927620
Yeah, plus custom nodes etc.
>>
File: Flux2-Klein_00122_.jpg (483 KB, 1136x1136)
483 KB JPG
>>
https://huggingface.co/Qwen/Qwen-Image-Bench
kek we should use this to judge gens
>>
>>108927624
that too, but usually i already have a workflow that works. its just retarded and malicious to upload a style lora and obfuscate it with other loras and style/artist tags

makes no sense, the person should already be aware that it defeats the purpose of showcasing the style
>>
File: 185337CUI_00001_.png (1.77 MB, 1248x1824)
1.77 MB PNG
>>108927518
I see. I'm on AMD on top of that so I'll wait a whole month at least.
>>
re-posting my ace step 1.5 xl sft gen, because nobody else is genning muzaks today:

https://files.catbox.moe/c15vfp.mp3

Indie grunge style.

I'm unironically ecstatic to find out some things I can do.
>>
>>108927700
The anima realism lora needed another idk lora plus a custom node to manipulate the lora.

lmao cba
>>
What is the best model for cute pp's?
>>
File: before after 1.jpg (680 KB, 1496x1990)
680 KB JPG
>>
>>108927744
describe your ideal pp first, bitch
>>
>>108927768
too bad models can't get the letters right yet lol. looks nead.
>>
>>108927775
>nead
you cant get the letters right yet either hehe
>>
>>108927769
Well, I can help, the ideal one is exactly 4", cut, and 2.3" in diameter. The ballsack is lopsided.
>>
>>108927781
7.7
>>
How can you guys go from 1girl spam to discussing penis sizes?
>>
can wan do front twerking, like breakdancing?
>>
>>108927804
oh wow did you assume everyone's GENDER? *plenty* of women know how to use Python.
>>
my wife knows how to use Python
>>
>>108927820
everyone here is a 1boy, small penis
>>
Beating up women is funny.
>>
>>108927804
picture, if you will, a navy blue 24-spoked wheel over a horizontal trisect of saffron, white, and green.
>>
File: before after 2.jpg (1.05 MB, 1368x2736)
1.05 MB JPG
>>108927775
yeah it can rustle my jimmies
>>
>>108927831
i'm actually solo, big_penis, huge_balls, masterpiece,
>>
>>108927896
>24
disgusting.

The Chinese know there are 30 spokes to a wheel.
>>
>>108927785
>cut
>>
>>108927899
that's some H.R Giger lora or something?
>>
>>108927899
looks good
>>
stop gay larping and post 1girl
>>
File: 00031-3380537822.png (3.04 MB, 1280x1920)
3.04 MB PNG
>>108927950
1girl posted
>>
File: before after 3.jpg (819 KB, 1392x1600)
819 KB JPG
>>108927922
yeah, upload tomorrow

>>108927928
Works suprisingly well
>>
>>108927921
He asked for the ideal form.
>>
>>108927964
i need to see peach taking a massive shit on a plate, male pov of course. as a joke haha
>>
>>108927992
They'll never goonmaxx
I pity them
>>
>>108927985
put a comfyui screenshot through it
>>
File: K-BR-00001.png (115 KB, 498x498)
115 KB PNG
sdxl inpainting sucks ass
>>
>>108928058
I'd use flux-fill. Don't know if that was ever beaten.
>>
File: before after 4.jpg (1.8 MB, 1336x2672)
1.8 MB JPG
>>108928019
post one! I'll put it to model preview
>>
How to gen black women animooo
>>
>be tdrussell
>comically evil license
>no platform can operate the model or finetunes efficiently, price per gen just quadrupleted over night due to fees per generated images that have to be forked over to the big guy
>"well, it was always meant to please the local community hehe"
>trollface.png
>>
who is he talking to
>>
tdrussell derangement syndrome
>>
Sorry, "women" - I didn't mean to imply they aren't animals.
>>
>>108928189
>price per gen just quadrupleted over night due to fees per generated images that have to be forked over to the big guy
as a member of the local community, i 100% approve of this.
>>
>>108928100
Kek.
Looks like a lora that will alternate between generating nightmare fuel and kino
>>
>>108928244
anyone else notice that huggingface free basically has nothing in terms of tokens' allowance? Like I couldn't even get off 2 prompts on of of the spaces. yesterday. crazy.
>>
>>108928189
Without sounding mad or resorting to insults, explain why I a local genner should care about that
>>
catastrophic funding
>>
File: before after 6.jpg (1.36 MB, 1808x1952)
1.36 MB JPG
>>108928247
>>
>>108928341
lmao, kino
>>
>>108928274
yes you got the joke
>>
File: anima real test6.png (1007 KB, 768x1024)
1007 KB PNG
>>108928537
>>
Feels sketchy that all that three month long FUD vanished the moment tdrusell dropped the base model. Pretty sure he made that whole thing up himself just to build hype.
>>
>>108928567
the "FUD"s never stopped, raped retard
>>
i have been posting my klein realism pictures around 4chan and no one has realized it yet
>>
File: 1765381287071459.jpg (1.25 MB, 1248x1824)
1.25 MB JPG
>>
File: 1762919533726614.jpg (1.35 MB, 1248x1824)
1.35 MB JPG
Jesus FUCK anima's prompt adherence is insane, try prompting something like this on an SDXL model without using regional prompting.
>>
File: anima real test12.png (1.69 MB, 1024x1519)
1.69 MB PNG
>>108928657
>>
>>108926998
Who and why?
>>
>>108928675
to be fair all it takes is an input image, half orange, half blue. Anima confirmed for making you less creative with the tech
>>
>>108928675
I have a problem with this. It still looks like slop
>>
>just pick up a pencil
whipped localslave mindset
>>
>>108928703
is your ass tattooed with "property of circlestone labs"?
>>
>>108928675
yeah it's a pretty great model.
>>
>>108928701
The biggest character flaw of an anima shill is the inability to see slop for what it is
>>
>>108928695
>all it takes is just doing it entirely yourself manually therefore missing his point entirely
>>
what do you do while you wait for the batch of generations to finish?
>>
>>108928723
ask grok to give you an image like that. why do it manually?
>>
>>108928725
ogle my other generations of course
>>
File: 105921CUI_00001_.png (1.77 MB, 832x1216)
1.77 MB PNG
>>108928725
I like listening to music.
>>
>>108928725
browse the web.
>>
>>108928748
your generated music, correct?
>>
>>108928763
No. I'm not very familiar with music generation. I've seen some absolute AI bangers on youtube though.
>>
>>108928721
how can you be so assblasted over a model?
>>
>>108928725
Started learning to code for fun. Got through a basic C# course in the last month, now reading about SQL and making simple http console apps.
No idea why i'm doing it honestly, but it's fun as fuck.
>>
>>108928567
Any good anime finetuned model could rival poor old SDXL, he didn’t need this whole campaign, it would have happened naturally anyway
>>
>>108928800
not over a model. I am pissed off jeets slop faggots spam every corner of the internet with shiny skin fried slop
>>
>90gb of generations
ok i think i should wipe all this out
>>
>>108928817
so you are on a crusade against ai slop, but you are just crying about anima in the local diffusion general?
the guys post wasn't even about the quality of his gen, it was about anima's great prompt adherence.
>>
>>108928858
if you fags stop spamming fried shit all the time and got gud we wouldn't have these issues. this is with every fucking model, not just anima. I know for sure there is better outputs from people that aren't fucking blind
>>
I need me a good horror/gore lora
>>
>>108928877
can you stop menstruating for 1 thread anon
>>
>>108928877
why don't you post some of your kino SDXL gens? it's a model that lets you really show off your creativity, unlike jeetima.
>>
a friend wants to know if anima can do futa on male cowgirl where the guy is riding on top? sdxl can't.
>>
>>108928895
your friend needs to stop being a gargantuan faggot
>>
File: a22b.png (903 KB, 768x1024)
903 KB PNG
>>108928877
This thread is neither an art gallery nor a competitive prompting tournament
>>
>>108928895
your friend sounds based. i have a based friend and he said anima can gen that easily.
>>
>>108928901
i'm afraid my friend's homosexuality is permanent
>>108928914
thanks, i'll let my friend know he can live out his fantasy with anima
>>
i've decided that the local models are bad
all of them
not a single good one
need to do anything interesting? off to the lora training farms
we shouldn't have to live like this
>>
>>108928910
True! It's a padded room for Catjak to screech in
>>
>>108928946
what a shame.
>>
>>108928877
>if you fags stop spamming fried shit all the time and got gud we wouldn't have these issues
huh? are you an arbiter of autism or something? why are you so bootyblasted over other people's gens.

it makes me want to gen 1girl, standing, white background, expressionless just to piss you off
>>
The many. The proud. The jeetsloppers
>>
>>108928950
I think that's the biggest complaint about anima. You need a Lora to do anything
>>
File: Anima_00553_.png (940 KB, 768x1024)
940 KB PNG
am I kawaii
>>
>>108928997
i thought anima was the best thing ever though?
>>
>>108929011
that was paid advertising
>>
So lora manager only seems to pull the info of half the loras i have. Did a test and downloaded like 5 popular loras I dont have and added them, they show up on the manager fine but only 2 of the 5 have preview images or any info really.

Is this a lora author issue or is it something I can fix on my end, I don't mind doing it manually.
>>
File: asuka4.png (970 KB, 768x1024)
970 KB PNG
>>
>>108928880
You don't need one. Just look up tags similar to guro and horror (theme) and use those
>>
>>108928895
Yes
>>
File: asuka6.png (1011 KB, 768x1024)
1011 KB PNG
>>
>>
What was the browser plugin that allowed you to read the metadata?
>>
>ask question
>get ignored
>some guy asks if he can get fucked in the ass by his waifu
>bunch of replies
i'm a tiny bit upset.
>>
what can I not do with 8GB vram
>>
>>108929116
You didn't post a 1girl with your question.
>>
>>108929119
generate a 1 hour 4k video in 10 minutes
>>
>>108929125
You're right...
>>
>>
>>108929196
Looks easy to rape.
>>
>>108928880
anima cant do good horror
>>
>>108929116
Maybe his question was more interesting
>>
>>108929203
You'll get fined for that.
>>
File: zit.png (1.79 MB, 1024x1536)
1.79 MB PNG
>>
>>108929218
>You'll get fined for that.
They'll send more of them? Oh no...
>>
>>108929224
make her an orc
>>
Oh is the fudding fag back? It's been a minute.
>>
sowwy
>>
>>108929116
this general is mostly dead and completely filled with retards, making it now the same as all the rest of ai generals
>>
>he's STILL seething
>>
>>108929242
Literally kek. I've never seen someone that mind broken.
>>
>>108926873
Before the "ZiT moment" there was the "Flux.dev moment" and before that the "SDXL" moment. It's never gunna stop happening.
>>
paste any and all prompts you like so i can gen and post some results
>>
>>108926873
there will be milestone moments when some things are gonna be possible with the release of some model, but i actually doubt we will get a model to model jump as big as pre to post-zit
>>
>>108929265
gen the spookiest yamamura sadako with your favorite model. TRULY scary and spooky.
>>
>make single post praising the hot new anime model
>anon spends the next hour melting down over it
heh
>>
>>108929268
>but i actually doubt we will get a model to model jump as big as pre to post-zit
you must be pretty new anon said the same thing after flux dropped
>>
this ones got that bisexual lighting
>>
>>108929276
flux wasnt that huge of a jump, flux was overall better than previous models but when zit came out it solved multiple problems and was the best in multiple big categories, most realistic out of the box (and still is), 2kx2k native gen, tiny, fast, great anatomy, trains well, ok knowledge

i was here since the beginning and was in the threads on both model releases, zit was the fastest this general ever was and ever will be.
>>
>>108929294
>i was here since the beginning and was in the threads on both model releases
i mean clearly you werent kek are you just trying to disuede newfrens?
>zit was the fastest this general ever was and ever will be.
kek x2 just go back to posting about how you hate this general and the anons who post here or whatever
>>
Is zit runable on 12gb cards?
>>
>>108929294
That or the NAI leak.NAI leak felt like the rush would last forever
>>
>zit trains well
LOL
>>
>>108929310
>no arguments seethe
anyone can look at post history and see how fast the threads were going on zit release, sorry you got proven wrong after trying to larp as an oldfag
>>
>>108929265

3girls 1boy, carnival setting, they are fighting over him
>>
>>108929312
yes
>>
>>108929325
so just to be clear your plan is to continue to doom post and cry about ldg? what a sad life you live
>>
>>108929312
Very easily.
>>
>>108929341
>>108929348
huh, on the civitai page they mention 16gb cards. I never even bothered with zit.
>>
File: 1776248021148812.png (5 KB, 240x132)
5 KB PNG
>>108929321
it does, which you would know if you actually trained loras
>>
>>108929360
>only 24gb
lmao
have fun making a NSFW finetune off turbo. hint: thats why the non distilled base version exists.
>>
>>108929325
nigga want the old shit gen the old model
>>
>>108929358
You offload like 2 gigs running ZIT on 12gb but it still runs very fast. Because it's distilled.
It's Z-Base that gets slow without quanting on 12gb.
>>
>>108929374
>no pic of his output folder
yup, concession accepted
>>
Are some anons still unaware that turbo is double distill locked and can only be trained so hard before it falls apart? We figured that out like day one.
>>
if i wanna make a lora of a character, is it best to just have it naked, like im mainly going for general body type and facial features
i imagine it takes less time taggin? i have never tried making my own lora
>>
>>108929399
>Are some anons still unaware
yes some anons are retarded this is true
>>
turbo? the movie?
>>
>>108929399
ostris making that adapter was a mistake
>>
>>108929399
it had a dedistill training adapter basically right away on release which was already working, nobody was training on just basic zit since the beginning
>>
>>108929390
no but really did you think 20 gigs of lora data is supposed to be impressive kek
>>
>>108929321
>>108929374
>>108929399
since we have these training experts in the thread surely you will be able to just tell this guy how to actually train a real lora, right sisters? >>108929403
>>
>>108929421
Agreed. At the time it made sense however since we had no idea if/when base would release. But yeah now ZiT basically serves as a demo for the more flexible base model.
>>
>>108929426
its ok bro, you got caught having nothing to back up your claim (twice now) and got btfo, no need to cry any more despite having cognitive dissonance, next time think before speaking so you dont get humiliated again.
>>
>>108929403
naked and clothed are good don't be afraid to mix them, anything you want it to train on is useful. don't over think it too much just tag your dataset.
>>
>>108929432
why would i, a supposed expert, waste my time answering basic day 1 lora questions?
but nah just get any images you can find it doesnt matter
>>108929442
no need to get upset
>>
release the pp's
>>
>>108929450
>why would i, a supposed expert, waste my time answering basic day 1 lora questions?
We know you can't. No need to cope.
>>
>>108929453
PowerPoints?!?
>>
>>108929442
Anon needs proof that turbo is double distill locked? Just read the paper kek.
>>
>>108929422
>basically right away
LOL now we know for sure you're just trolling
>>
>>108929493
the paper talks about the basic turbo model, the training adapter is not made by the official team and was specifically made to break the distillation so that the model is trainable, are you sub 80 IQ perhaps or?
>>
>>108929501
Oh, you're still coping with the adapter instead of using ZiB? Why? Do you hate yourself?
>>
>>108929499
is he wrong? i thought it was like day 1 or 2.
>>
>>108929499
>ZIT - 25 Nov
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo/blob/main/transformer/diffusion_pytorch_model-00001-of-00003.safetensors
>training adapter - 28 Nov
https://huggingface.co/ostris/zimage_turbo_training_adapter/blob/main/zimage_turbo_training_adapter_v1.safetensors

oof.
>>
>>108929509
the discussion topic was around ZIT being a destilled model that cant be trained. learn to read.
>>
>>108929520
those three days felt like a fucking month theres no way
>>
>>108929518
NTA. You have to understand you are in the thread full of 2D-only anime pedofiles that will literally lie about easily verifiable things if they see something that they autistically perceive as an attack against them, and someone praising a realism model is basically like attacking their dog.
>>
>>108929540
ANIMA IS THE BEST AT REALISM BIGOT
>>
>>108929526
>ZIT being a destilled model that cant be trained. learn to read.
Calm down. Anon claimed it could be trained WELL. The other anon was disputing the "well" part. No where was it claimed that it can't be trained at all lol. Seriously you have to be trolling or you're just incredibly dumb. But alas I feel dumber for even replying to you.
>>
>>108929540
>you are in the thread full of 2D-only anime pedofiles
but for the past four months anon has been saying this is the realism no anime general???? which is it????
>>
>>108929550
>Anon needs proof that turbo is double distill locked? Just read the paper kek.
the training adapter directly disproved this claim that the model is just "double distill locked" for training, because the purpose of the adapter is to specifically remove that so that its trainable.
so "just read the paper" is nonsensical since the paper cant talk about something that didnt exist yet.

its like saying Anima is a shit model to train anime loras for "just read the cosmo paper", like are you mentally retarded?
>>
File: ComfyUI_14079_.png (691 KB, 1024x1024)
691 KB PNG
>>108929567
this here is ANIMA COUNTRY
>>
>>108929567
I know this always comes as a surprise to autistic low IQ individuals online, but the commenters you reply to online aren't all the same person.
>>
>>108929590
I actually miss ani
>>
>anon says ZiT trains well
>another anon points out that turbo is distilled and the base model is the better candidate for tuning
>first anon misunderstands the second and spergs out for a full hour
am i missing something?
>>
>"model is double distill locked!"
>it basically had a dedistill adapter on lauch
>"read the paper!"
>>
So where is the NSFW finetune of turbo? It should've already happened with the adapter, right?
>>
>>108929610
nobody really cares until data centers die and sell off their compute
>>
>>108929610
>am i missing something?
Considering the reply directly under yours is more seething, no I don't think you are missing anything.
>>
>>108929627
go check civitai, all the major goon model trainers have a zit finetune.
>>
>>108929661
>zit finetune
*shitmix
>>
>>108929658
Fuck off already zit nazi
Anime website.
>>
>>108929661
>no link
Alright
>>
File: 24535375.jpg (846 KB, 1476x3540)
846 KB JPG
impressive. very nice. now let's see paul allen's kino
>>
>>108929590
hope he did the flip
>>
>>108929673
https://civitai.red/models/2196857/big-love-z
https://civitai.red/models/2270401/pornmaster-z-image
https://civitai.red/models/620406/moody-pro-mix
>>
>>108929692
>200k cumulative downloads only
ZIT is trash.
>>
File: 1760248525428457.png (223 KB, 646x680)
223 KB PNG
haven't checked in for a while, what's the best current image model? is z-image still the general go-to? is qwen edit still the best edit model?
>>
>>108929715
Z base for realism and Anima for anime pretty much
>>
>>108929715
klein 9b
>>
File: 1664117573569.jpg (123 KB, 816x294)
123 KB JPG
I was browsing the archives and found some ancient 1.5 gens. The threads were extra comfy.

https://desuarchive.org/g/thread/88812749/#88813395
>>
>>108929715
all still unsurpassed by sd 1.5 in sovl
>>
>>108929692
lol
>>
>>108929674
get this cloud slop outta here
>>
>>108929721
zbase always behaved weirdly for me iirc
turbo had realistic gens but less diversity and lacked knowledge of general IPs
whereas base had more diversity and knowledge of IPs but the gens would always come out with some weird abomination or obvious artefacts
qwen was better in that regard and could gen IP stuff without so many artefacts but at the same time had a distinct "ai slop" look to it
is there a good general lora for zbase or is it more a case-by-case basis?

>>108929724
meh i heard mixed opinions about it but worth a try i suppose
>>
>>108929587
If the model wasn't distill locked then it wouldn't need an adapter in the first place. Do you happen to have brown skin by chance?
>>
File: 524475.png (630 KB, 994x1593)
630 KB PNG
>>108929744
told you klein was the best
>>
>>108929753
>would always come out with some weird abomination or obvious artefacts
you just have to run it for 50ish steps with a good sampler scheduler combo
it also helps to have a language model generate a prompt
>>
>>108929768
it was mainly the light pee filter that made me reply desu
>>
>>108929779
i think you're seeing things
>>
File: 09.png (37 KB, 495x313)
37 KB PNG
>>108929692
>https://civitai.red/models/2196857/big-love-z
>>
>>108929804
>jeet doesn't know how to use a model
somethings never change
>>
>>108929593
>>108929804
This mf scammer keeps censoring anyone saying the truth about his merges

-> they're shit and free model does a lot better
>>
>>108929759
the argument was around that the model can be trained well, so insinuating that it cant because its double distill locked was false.
>>
>>108929830
well really it depends on your definition of "well"
>>
File: 1774220400930395.jpg (642 KB, 832x1216)
642 KB JPG
>>
>>
>>108929793
i wish i could see you
>>
>>108929820
how is he censoring shit if that comment is 4months old AND the first comment there
>>
Anon really got b8ed into rehashing the old turbo discourse baka my baka
>>
>>108930036
Go check yourself, am not your mom
>>
>>108930060
for me it's anima into flux
>>
>>108927227
>absurdres, masterpiece, best quality, very aesthetic, nsfw
>2girls, lying, side-by-side, from above, one-piece swimsuit, looking at viewer, holding hands, garden, flower
>>
>>108930060
this site is a few posters reposting the same things over and over again and has been for years.
>>
File: comfyui.jpg (726 KB, 824x1204)
726 KB JPG
>>108930118
the whole thing could be replaced with bots and no one would notice the difference.
>>
>>108930148
yes
>>
>>108930148
I would notice because I am smart
>>
>>108929000
what the fuck? prompt?
>>
File: noraml girl.png (1.16 MB, 768x1344)
1.16 MB PNG
>>
>>108930172
>average zit hater's zit gens
>>
>>108930163
in some sense, propably. what matters is what you do though.
>>
File: ComfyUI_00592_.png (868 KB, 896x1152)
868 KB PNG
>>108930171
maybe something like this
>>108930172
choco curry?
>>
File: rage thumbs up.jpg (61 KB, 704x653)
61 KB JPG
>ComfyUI breaks the front end subgraph, again.
>All the fucking asshole insists on releasing workflows as subcharts.
>Random shit stops flowing through.
>>
>>108929715
>haven't checked in for a while
I don't believe you.
>>
>>108930202
>All the fucking asshole insists on releasing workflows as subcharts.
its mostly the official team that pushes that shit, but anyway i always just right click and unpack it no matter what
>>
>>108926873
we've had a few since then that've been close
>>
>>108929715
sdxl or illustrious is best to generate the scene, then im2img with z-image trubo to make it photo real
>>
File: 6424367.jpg (3.87 MB, 2176x3840)
3.87 MB JPG
new ltx2.3 kino coming, titled KINOSOVL NIGGA
>>
File: 1758171584090888.png (3.34 MB, 1536x1792)
3.34 MB PNG
>>
File: 1764310939720628.png (3.08 MB, 1792x1536)
3.08 MB PNG
>>
>>108929734
Imagegen went from unreliable 1% kino, 99% trash to consistent 1% trash, 99% mediocre
>>
whats a prompt for that slutty hentai nunn look
>>
File: 1765362310014745.png (1.23 MB, 1902x1033)
1.23 MB PNG
I make alot of VNs now with AI, and the funny part about picking an artstyle is that you feel like you're gonna change it because it looks awful then it kinda grows on you overtime as you write it.
>>
Genning images locally, for an all white America.
>>
>>108930534
I'd think the more important thing is the fidelity rather than the style itself.
>>
>>108930505
sounds like a loralet issue
>>
>>108930534
edit models next year will allow you to gen what you want and give it the old style to restyle it or to just gen in that exact style of a single image
>>
>>108930534
>>108930571
The not-art parts are important too. I've played porn games that are kinda bullshit art-wise but somehow tolerable just because of the game.
>>
>>108930505
That's less to do with models and more to do with the proliferation and accessibility growth of the medium. The proportion of those who are "bad at art" and those who "are good at art" hasn't changed, only how much we now see.
>>
uh oh, my power might go out
>>
>>108930674
kinosovl levels too strong
>>
I dont have porn addiction, what use is AI art to me?
>>
i'm making realistic amateur 1girl cosplay sluts with anima, what's the best way to face swap?
>>
>>108930706
you can generate trump as jesus or whatever gets you off
>>
File: grid_2_20260528_192401.jpg (1.14 MB, 2176x2432)
1.14 MB JPG
>>
>>108930719
whats your workflow look like atm? are you using anima zit or flux as an upscaler/detailer? you could faceswap with flux or qwen, if you have a old sdxl lora of your waifu you could inpaint it with that, or you could use an ipadapter. choice is yours
>>
File: 756353675.jpg (2.72 MB, 3840x1664)
2.72 MB JPG
>>108930700
yes, take this before i go dark
>>
>>108930735
nice, looks like Mike Klubnika's "Fused 240" x Machinarium
catbox?
>>
File: grid_2_20260528_194449.jpg (1.15 MB, 2432x2176)
1.15 MB JPG
>>108930803
anon gets credit for mentioning murata range in the previous thread https://files.catbox.moe/gpkcpx.png
>>
>>108930817
that wasnt me, but thanks for the catbox
>>
ultimate upscale pre detailer, ultimate upscale post detailer, hiresfix pre detailer, hiresfix post detailer

before i run a bunch of tests to see the differences with different combos of these
am i wasting my time
is there one set together that generally works best or, should i just go with 1 of the 4 only. im not trying to do 10k res images over here, 4-6k max so i can downscale later
>>
File: debo_mr_anima1_00011_.png (2.59 MB, 1792x977)
2.59 MB PNG
>>
>>108930848
>am i wasting my time
Personal experimentation is never a waste of time unless you do not enjoy doing it. Back when I used dedicated detailers, I'd run them both before and after the upscale pass(es). But that's just one anons opinion.
>>
> Arch Linux vanilla / RTX 5090 / 64GB RAM
Total newbie here. Want to generate local NSFW video.
What is the best clean FOSS tool and guide to follow right now?
Just want a proper, isolated setup that won't bloat my system. Thanks.
>>
>>108930861
so when doing that, did you reduce the amount you were upscaling each time?
>>
>>108930870
You're basically saying you want to gen cp, so nobody is going to help you.
>>
File: 3467447.jpg (2.09 MB, 1664x3840)
2.09 MB JPG
>>108930870
use ltx2.3
>>
>>108930870
>ComfyUI: https://github.com/comfyanonymous/ComfyUI
or
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>>108930912
What do you mean exactly?
>>
File: grid_4_20260528_202011.jpg (2.21 MB, 3328x3328)
2.21 MB JPG
>>
>>108930751
>>108930927
model names at the bottom like watermarks is making me kek
>>
>>108931008
the horrible thumb in a cherry picked example sells it.
>>
>>108930914
>>108930927
>>108930935
You guys spend way too much time on 4chan. You think you're hilarious with your terminology, but you've built a tiny underground bubble where only a handful of basement dwellers understand each other. I literally had to ask an AI what "cp" even meant, and I still don't get the rest of your jargon. This place isn't the information hub it was 10 or 15 years ago. I'm out
>>
>>108931120
You're indian.
>>
>indian demands stuff
>crashes out
>says he's higher caste anyway
lfmao
>>
>>108930706
>I dont have porn addiction,
but you do have a kino addiction
>>
I can now do vocaloids in ace step lmao
>>
>>108930935
well if i put them both at 50% on their nodes, its gona result in an image doulbe the size than if i just put 50% and only had one node active, wouldnt it?
>>
File: grid_4_20260528_210544.jpg (1.82 MB, 4352x2432)
1.82 MB JPG
>>
ace step genning right now: vocaloid-ish thingy.

the anime thread didn't seem into music.
>>
>>108931150
>vocaloids
desu generated vocals have sounded robotic like vocaloids for awhile now
>>
>>108931285
You're not an artist, so I won't discuss anything with you relating to art.
>>
big reminder, almost always batch=2, if you have the vram. over the long run, you save time.
>>
File: file.png (353 KB, 2056x1105)
353 KB PNG
>pip install comfyui-frontend-package -U
fuck you
>>
File: 4263563.jpg (2.93 MB, 1664x3840)
2.93 MB JPG
>>108931008
you have to let people know how to make sovl
>>
File: ComfyUI_27037.png (3.71 MB, 1500x1920)
3.71 MB PNG
Anyone get 2048px out of ZIT to work with PiD? I tried several different ways and got nothing but noisy outputs. 1024px out of ZIT works (mostly) fine though (I have to slop things up a bit).

This is scaled down from 6400x8192 and it still looks worse than ZIT working at 2048. Currently going from 1600x2048 (ZIB) to 800x1024 (ZIT) to 3200x4096 (PiD) to 6400x8192 (RTX VSR).

65-90MB output images is kinda unsustainable though.

>>108931242
Bottom left looks cool.
>>
NEXT THREAD THEME: ACCESSORIES
>>
>>108931499
soon
>>
>>108931310
i mean always batch 8 if you have the vram, thats the highest savings, and even if you dont have vram exactly you should try it since it might be better despite the slight offload since it depends what its offloading
>>
>>
Title: love love machine
https://vocaroocom/1eyLca9uWmKe
because catbox is down again.

>>108931569
>batch 8
I can't get close lol.
>>
>>108931453
>Anyone get 2048px out of ZIT to work with PiD?
haven't tried PiD yet but it works with zeta

>65-90MB output images is kinda unsustainable though.
(even lossless, obviously also lossy) JXL will likely be much smaller.
>>
File: 1754903747255477.png (3.24 MB, 1536x1792)
3.24 MB PNG
>>
I think honestly that ai music is just music creation, because you have to make the rhymes yourself anyway, and it's obvious click and gen type ai art people are just incapable of both the necessary interest, but also the abilities too.

This is interesting, because you see the same thing like if you hand a woman a tool from the tool aisle. she doesn't care what it is, and never will be capable, either, of understanding what it is, or even why it is.
>>
>>108931757
and this is why music is the highest of muses, to command men, yet to be in its parts invisible to most, making men the women to music.
>>
>>108931569
>>108931647
gradient accumulation is a thing
>>
>>108931788
what about batch cum accumulation in the balls?
>>
>>108931757
you aren't creating anything until you can plug reference melodies into the generator to actually guide it to what you are imagining
>>
>>108931818
>not an artist
ok opinion discarded
>>
>>108931827
proof?
>>
(indian noises)
>>
Fresh

>>108931834
>>108931834
>>108931834

Fresh



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.