[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor applications are now open. Apply here!


[Advertise on 4chan]


Discussion and Development of Local Image and Video Models

Previous: >>108948244

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
is it over or are we back
>>
>inb4 n*gbo
>>
>>108951943
both
>>
>>108951943
it's over
>>
>>108951943
>Tdruss is a drama queen that lied about affording anima2
we are in over status currently
>>
Is there a repository for paid LoRA crap people hide behind Patreon?
>>
>>108951984
yeah
>>
>mfw Resource news

05/31/2026

>FLUX Identity Adjuster (V2)
https://github.com/Magirad/Flux_ID_Adjuster_V2

>ComfyUI AnimaFastTrain
https://github.com/quinteroac/ComfyUI-AnimaFastTrain

>MONET: Open-source dataset
https://huggingface.co/datasets/jasperai/monet

05/30/2026

>Pixal3D — Apple Silicon (MPS / Metal) Port
https://github.com/pawel-mazurkiewicz/Pixal3D-mac

>Comfy-Org/PixelDiT (diffusion models & upscalers)
https://huggingface.co/Comfy-Org/PixelDiT/tree/main/diffusion_models

>Orion4D Generative Paint: ComfyUI advanced painting interface
https://github.com/orion4d/Orion4D_generative_paint

>ComfyUI Anima IP-Adapter
https://github.com/Wenaka2004/comfyui-anima-ipadapter

05/29/2026

>Colored Noise Diffusion Sampling
https://hadardavidson.github.io/CNS

>VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion
https://videomla.github.io

>minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models
https://github.com/shengshu-ai/minWM](https://github.com/shengshu-ai/minWM

>GPIC: A Giant Permissive Image Corpus for Visual Generation
https://gpic.stanford.edu

>SGMD: Score Gradient Matching Distillation for Few-Step Video Diffusion Distillation
https://github.com/ModelTC/LightX2V

>Native Audio-Visual Alignment for Generation
https://ernie-research.github.io/NAVA

>GASS: Geometry-Aware Spherical Sampling for Disentangled Diversity Enhancement in Text-to-Image Generation
https://github.com/L-YeZhu/GASS_T2I

>SAVAA: Mitigating Hallucinations in LVLMs via Step-wise Adaptive Visual Attention Amplification
https://github.com/JiachengZ01/SAVVA

>Nexus BTA: Local AI image and video studio built around an embedded ComfyUI runtime.
https://github.com/JpAndreBTA/Nexus-BTA

05/28/2026

>MAVEN A Multi-Agent Framework for Multicultural Text-to-Video Generation
https://github.com/AIM-SCU/CRAFT

>Bias Leaves a Gradient Trail
https://github.com/vitryt/label-free-bias-identification
>>
>mfw Research news

05/31/2026

>Channel-wise Vector Quantization
https://arxiv.org/abs/2605.26089

>How Accurate are Video Quality Models for Diffusion-Based Video Super-Resolution?
https://arxiv.org/abs/2605.25940

>AdvantageFlow: Advantage-Weighted Least Squares for RL in Flow Models
https://arxiv.org/abs/2605.26013

>Mitigating Object Hallucinations in Vision-Language Models through Region-Aware Attention Recalibration
https://arxiv.org/abs/2605.24957

>SpongeBob: Sync-Aware Harmonious Audio-Visual Generative Editing
https://hy-spongebob.github.io

>Self-supervised Dynamic Heterogeneous Degradation Modeling for Unified Zero-Shot Image Restoration
https://arxiv.org/abs/2605.24593

>On-Policy Adversarial Flow Distillation for Autoregressive Video Generation
https://arxiv.org/abs/2605.26105

>SLAD : Shared LoRA Adapters for Task Specific Distillation
https://arxiv.org/abs/2605.29726

>SuperVoxelGPT: Adaptive and Ordered 3D Tokenization for Autoregressive Shape Generation
https://arxiv.org/abs/2605.29655

>When Eyes Betray AI: Social Gaze Consistency as a Semantic Cue for AI-Generated Image Detection
https://arxiv.org/abs/2605.27348

>Which Pretraining Paradigm Better Serves Spatial Intelligence? An Empirical Comparison of Vision-Language and Video Generation Models
https://arxiv.org/abs/2605.28132

>CIVIC: End-to-End Sequence Compactness for Efficient Vision-Language Models
https://arxiv.org/abs/2605.28115

>Janus-LoRA: A Balanced Low-Rank Adaptation for Continual Learning
https://arxiv.org/abs/2605.28495

>Resolving Ambiguity in Composed Image Retrieval via Calibrated Interaction
https://arxiv.org/abs/2605.24634

>ProSR: Process-Shaped Spatial Reasoning for Reliable Chain-of-Thought in VLMs
https://arxiv.org/abs/2605.25524

>Structure-Guided Visual Perturbation Neutralization for LVLMs
https://arxiv.org/abs/2605.27927

>Black-box Membership Inference Attacks on the Pre-training Data of Image-generation Models
https://arxiv.org/abs/2605.27020
>>
>>108951997
where?
>>
File: y.png (1.92 MB, 1400x704)
1.92 MB PNG
coprosarus free.
>>
File: ComfyUI_00445_.png (1.33 MB, 864x1536)
1.33 MB PNG
>>
Someone baked this thread and thought: yeah this is fine.
>>
Ok how do you mix loras then? If I have a style and character Lora, do I generate the character Lora first and then use the style Lora, do I change the weights, what do you guys do?
>>
>>108952049
Use your eyes to measure the resultant image and adjust the weights accordingly.
>>
AI niggas when they actually have to experiment and problem solve
>>
>>108952055
Thanks anon. I was asking because some dude in the other thread said that mixing loras with anima fucks up the generations.
>>
File: 1776075526805287.jpg (104 KB, 768x766)
104 KB JPG
Why isnt it advertised that anima easily works with 1536 x 1536 resolution?

Also im so fucking tired of vramletniggers and laziness or ignorance of devs who make shit and dont give multiple different workflows or ranges of how to get the best result out of something.

Nigger just give me "MAX QUALITY TOP PRO WHITE MAN VERSION OF THE WORKFLOW WITH ALL SETTINGS MAXXED OUT FOR MAX QUALITY FOR THOSE WHO ARENT ADHD AND CAN WAIT OR HAVE THE HARDWARE TO RUN THIS NIGGER SHIT AT FULL SIZE.json"

LTX is the biggest offender against this. All fucking workflows use the fucking speedshit and there's no documentation that even tells you the best settings for actual maximum quality. But because theres so many newniggers and retards who didnt see the full quality of wan 2.2, WHATEVER blurry shit ltx generates is like magic to them.
>>
>>108952067
the day any software gives good defaults will be the day world hunger and cancer are solved (never happening btw)
>>
>>108952067
Nigga, the civitaibpage says it. That's one of the first things I looked up. Yeah it works with 1536 and +-64 variations, it's good.
>>
File: 222910CUI_00001_.png (1.33 MB, 1536x1152)
1.33 MB PNG
>mfw bread
>>
File: 824362660.png (108 KB, 808x668)
108 KB PNG
>>108952062
?
>>
hallucination.
>>
>>108952075
I must have seen it elsewhere then, as an artifact of old 1024 resolution which was for preview only, you are right.
>>
>>108952067
its literally on the HF. its just the same reason that people prompt this with "a woman cosplaying as suzumiya haruhi, she is wearing her clothes, ultra real DSLR 8k photograph"
on a more serious note, ive seen some people say it can be unstable at times at 1536x1536 though ive never seen this myself, if anything its the opposite. one thing td russia should point out is that higher step numbers help at higher resolutions to fix artifacts
>>
trvke: image agi will be achieved when "1girl, huge breasts, masterpiece" will be enough to forever give an infinite range of uniquely creative and good gens.
>>
>>108952110
>uniquely creative
>good
These are contradictory.
>>
File: 230542CUI_00002_.png (1.55 MB, 1536x1152)
1.55 MB PNG
>>108951138
Wlop the goat
>>
>>108952100
Yeah that's about my experience. Higher native res needs more steps to stabilize but I've never had problems getting it to stabilize past that.
>>
>>108952130
it would only be contradictory if there was only 1 single png in existence that is good and others are worthless.
>>
>>108952135
I don't agree with you.
>>
Reminder, generate your ZIT kino prompts for ZIT LoRAs at 2048x2048 which gives a slightly more unique look than when genning at any other smaller res. I think because most LoRAs dont have 2048x2048 images in the dataset, and so when genning at that size you go out of distribution somewhat and have a different mix of original model knowledge and the trained LoRA which now underfits in some aspects.
>>
File: 230854CUI_00001_.png (1.46 MB, 1536x1152)
1.46 MB PNG
>>108952134
from 30 steps to 60
>>
ugh my Comfyui just shat the bed again
what's that memory thing you have to put in the command line?
>>
> >108952001
> >108952004
Fuck off
>>
>>108952158
That's fine, it's okay to be wrong, anon.
>>
what the fuck is a lycoris?
>>
>>108952216
"Lora beYond Conventional methods". Blanket term for alternative lora training methods.
>>
>>108952216
Lycoris is a genus of about 20 species of flowering plants in the family Amaryllidaceae, subfamily Amaryllidoideae.
>>
>>108952225
I fucked up the acronym:
"Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion"
>>
File: Flux2-Klein_00529_.jpg (536 KB, 1024x1536)
536 KB JPG
>>
>>108952216
llm that generates liquorice
>>
AI generated food mhm
>>
where's that guy who generates the pizza, I'm hungry
>>
>using anima
>realize im just doing the same gens i was doing with illustrious but now they take 55 seconds instead of 15
am i fucking retarded?
>>
>>108952239
cool
>>
>108952186
>108952134
>108952077
so obvious that you are Catjack, you don't sleep, you do the same stupid tests he does, and you use the same frivolous artists tag as him, you spam as him and you samefag as him, It's so painfully obvious.
>>
>>108952239
That's a proto-garloid. Pretty rare these days.
>>
>>108952286
rare but a-peeling.
>>
File: cd0g58.png (1.02 MB, 1280x768)
1.02 MB PNG
>>
>>108952299
you should bananned for that
>>
Why do I always get better images when I treat Flux2 the same way I treat SDXL?
This whole natural language just doesn't fucking work.
>>
Why do I always state something with no images or proof to accompany my claims?!?!
>>
File: met her at church.png (1022 KB, 1024x1024)
1022 KB PNG
>>108952279
we like what we like
>>
>>108952279
Me, but 8-9 minutes instead of 2-3. I should try some more-elaborate scenes now that I'm getting the hang of my 1girl.
>>
>>108952279
I tried it but went back to noob, it's WAY too fucking slow for no reason and you still have to fix hands and faces, waiting for anima 2 with the e621 database
>>
File: 235120CUI_00001_.png (1.26 MB, 1536x1152)
1.26 MB PNG
>>
>>108952361
>it's WAY too fucking slow
Turbo 0.2 is unironically pretty good once you dial in the settings and prompt. use negpip
>for no reason
the reason is it's a transformer model that uses pure attention instead of SDXL's UNET which is mostly convolutions
>>
>>108952067
I'm in the same boat.
Queue bunch of slop before sleep
Queue again before going to work
Enjoy the harvest at home in the evening.
Repeat.
>>
>>108952361
JANIMA fixes all those except the speed
https://civitai.red/models/2642932/janima?modelVersionId=2967640
https://gofile.io/d/jo1lcf

To improve the quality of everything a lot I also gen at 50 steps 5 cfg 1536 1536, just set and forget after you find a prompt you like
>>
What's the best Anima Lora trainer that actually trains the TE? Because Easy Training Scripts does not (or I am doing something wrong).
>>
>>108952193
--lowvram
you poor jeet
>>
>>108952420
isnt the offical recommendation to not train it
>>
>>108952427
Nobody tells me what I should or shouldn't do.
I want to experiment.
>>
>>108952433
I want 2 experiment with u bby
>>
>>108952420
>>108952433
Do you mean the LLM adapter or the Qwen 0.6b TE?
Anyway ETS (whichever fork you are using, in theory since I don't use it) and underlying sd-scripts backend should be able to train both.
It goes without saying that training an LLM TE is a fucking terrible idea.
>>
File: 001439CUI_00001_.png (2.81 MB, 1536x1152)
2.81 MB PNG
>>
File: 003603CUI_00001_.png (1.43 MB, 1536x1152)
1.43 MB PNG
>>
Become unmonetizable.
>>
>>108952543
>D...dad?
>*Harry turns around, beer and a plate with a sandwich in hand*
>Dont scare me like that! I had the craziest day at the mall today, let me tell you all about it.
>>
>>108952422
no i was looking for
>--disable-dynamic-vram
but I did that and then KDE shat the bed because NVIDIA drivers
man fuck Nvidia
and fuck you you nigger
>>
>>108952563
SH3 if it were good
>>
just kidding no hard feelings
>>
>>108952420
>Because Easy Training Scripts does not
why don't you accept that you are retarded instead of spreading misinformation
anyways, if you mean the qwen te, select train on both in the network section. if you mean the llm adapter, add train_llm_adapter=True to network args and add network_reg_lrs=.*llm_adapter.*=5e-5 to network args to control the llm adapter lr
>>
>>108952583
I understand you.
>>
I just hit 1000 images that i rated 10/10
Took around 200k gens or so
>>
>>108952576
>but I did that and then KDE shat the bed because NVIDIA drivers
werks 4 me
>>
>>108952700
post the 1000th
>>
>>108952576
Disable ram memory fallback if its enabled
And i think theres a command to reserve vram for other tasks no matter what, so increase that limit up from 500mb i think
>>
Any idea why ZIT 2048x2048 images often have random color glitch type of artifacts at the bottom right corner of images of that resolution? Or is that just with my LoRA?
>>
>>108952740
dalle watermark
>>
>>108952576
Just use Windows. Troonix sucks at VRAM management.
>>
>>108952879
Spoken like a true jeet.
>>
>>108952740
Lora trained with too low resolution?
>>
>>108952885
>Spoken lika a-ACK out of memory
>>
>>108952965
it doesn't crash and lose my data and session :)
I can just turn off explorer.exe and get fat 1GB if I want to.
>>
>>108952972
show me
>>
>>108952965
With 50 tabs and many other things open for me it uses 0.2 0.3 vram, comfy default 500mb reservation is more than enough
>>
>>108952965
Dats rite, young man. Ain't no one got no time for that stupid bullshit.
>>
>>108953007
not possible since your browser alone uses at least 500mb. add the rest of the bloat on top of that
>>
File: file.png (47 KB, 1390x865)
47 KB PNG
>>108952992
https://github.com/d8rt8v/win_gpu_util/blob/main/gpu_ulil.ps1
>>
>>108953027
When comfyui or some other program needs more VRAM it takes it. When I use llama.cpp for example which I think is the program that by default reserves 500MB, and then I quit it after some time so the VRAM gets cleared, the VRAM drops to 0.2GB despite me using Firefox the entire time. Windows 10, 24GB VRAM, 128GB RAM, dynamic pagefile.
>>
>>108953051
why can't you do this with linux? i run it without a desktop environment as well. that is what you are doing when you disable explorer.exe
>>
>>108953027
I have a powershell script which launch a small instance of WebView2, it takes like 100MB in RAM instead the full regular Edge.
https://gist.github.com/admiralnelson/4435be8d5f23713608bb6d9bbfe92457
You want to copy
Microsoft.Web.WebView2.WinForms.dll
Microsoft.Web.WebView2.Core.dll
from Microjeet Teams or download them from nuget and paste them next to powershell file.
>>
>>108953098
lool if you gotta save 100mb ram to run anything its fucking over
and if you're talking about vram, then just make a new browser profile and disable hw acceleration for it so it doesnt use any vram
>>
>>108953118
>disable hw acceleration for it so it doesnt use any vram
Okay I'm fucking stupid for not knowing that.
>>
>>108953151
disabling hw accel in browser for any os is also important to stop the sometimes large decrease in speed that comes from your gpu switching from inference to rendering the page all the time, very visible in some badly made browser pages like comfyui, when i have comfyui tab visible on any monitor the gen time increases by like 10-30%
>>
>>108953195
though for most sites its fine, but its worth having a separate browser profile just for comfy so you can edit the wf while genning at max speed

not sure is there a way to disabel hw accel just for comfyui
>>
why is it so hard to get natural lookin breasts, like most chicks have saggy medium sized boobs
>>
Which general is fast and doesn't sperg at 1girls but also talks about various model shit.
>>
>>108953226
this one
>>
>>108953207
And don't smile.
>>
>>108952279
>am i fucking retarded?
if you cant tell the difference between the models then no you are not retarded you are just blind and possibly not very creative
>>
>>108953204
I'm trying to see if it's possible to disable HW accels in my ps script, it's so convenient to double click run.bat and everything is set up with UI ready.
>>
>>108953244
make new ff profile, disable hw in it, then in run.bat:
@echo off
start "" "C:\Program Files\Mozilla Firefox\firefox.exe" -no-remote -P "PROFILENAME"
exit

make shortcut of bat, pin shortcut to taskbar
>>
File: pipeline.png (1.76 MB, 2946x890)
1.76 MB PNG
>https://yfyang007.github.io/ControlLight/
Disregarding the classic shift issue, this is kinda impressive.
>>
I haven't run difffusion models since 2023.
What is the goto way to take clothes off of real photos of women in 2026? Is it still inpainting?
If yes, is with comfyui? And what model for realism?
Is there a specific workflow?
pls respond (I have autism)
>>
>>108953263
kewl
>>
>>108953226
probably /slop/
>>
>>108953267
>What is the goto way to take clothes off of real photos of women in 2026? Is it still inpainting?
Still sucks as of now
I mostly undress my cartoon women in NAI, and then do inpainting again in krita+NoobAI inpainting model
>>
>>108953267
>I haven't run difffusion models since 2023.
I don't believe you
>>
>>108953303
i believe him. i only got back into it a couple months ago after trying the original stable diffusion on release
>>
>>108953267
>What is the goto way to take clothes off of real photos of women in 2026
klein edit
>what model for realism
https://civitai.com/models/2168935/z-image-turbo
>>
>>108953275
I don't want to dualboot all the time to play games with friends.
>>
File: cat.png (2.14 MB, 1024x1536)
2.14 MB PNG
>>
I've been out of the game for a minute, I see Chroma is the new thing to burn local
what's the general consensus for the 'default' sampler setting for photorealism, gens take too long to experiment and my shit looks ass
>>
>>108953409
If you have to ask just use https://civitai.com/models/2168935/z-image-turbo
>>
>>108953275
You are trans, of course always contemplating to neck yourself.
>>
File: 1758409956107634.png (464 KB, 3238x1689)
464 KB PNG
>>
File: 035502CUI_00001_.png (2.41 MB, 1152x1536)
2.41 MB PNG
I think I've learned about the existence of a hundred obscure fetishes just by browsing Civitai.
>>
best anima sampler scheduler combo aside from the default one?
res_2s 5s 6s bong beta57 and all their combinations work well and make unique images, any other ones?
>>
Is there any theoretical benefit that we can get if we can load the same model in VRAM twice?
>>
>>108953461
amputation is undeserved.

There never has been a fully capable amputation lora.
>>
>>108953390
normgroid cattle
>>
>>108953478
https://github.com/KeithZ117/Comfyui-anima-sampler
>>
>>108952134
Based and TY
>>
https://huggingface.co/collections/nvidia/cosmos3
>16b and 65b omnimodal diffusion models
>image, video, and audio
>fully permissive license
what da fuck
ltx is kill?
>>
File: ComfyUI_00684_.png (567 KB, 896x1152)
567 KB PNG
Pride month :)
>>
>>
>>108953784
already?
>>
>>108953770
>https://huggingface.co/collections/nvidia/cosmos3
Russ has already started training Anima 2 on this
>>
File: AnimaBase1+Turbo_00014_.png (548 KB, 1024x1024)
548 KB PNG
>>108953263
>make shortcut of bat, pin shortcut to taskbar
This caught my eye, but I couldn't get it to work. Searching online found another workaround:
>rename .bat to .exe
>right-click -> more options (if on WIn11) -> pin to taskbar
>rename .exe back to .bat
>shift+right-click on pinned shortcut, edit target from .exe to .bat also
>can also change icon to get logo from firefox.exe or whatever. May need to sign out of Windows and sign back in to take effect.
Holy shit, this is amazing, you've given me Quick Launch back on Win11. Thank you so much!
>>
>>108953770
>ltx is kill?
only if it can generate longer videos than 8 seconds
>>
>>108953770
>https://research.nvidia.com/labs/cosmos-lab/cosmos3/
Damn Nvidia was really trolling us with all the SANA pet projects.
>>
File: cosmos 3.png (886 KB, 1024x1024)
886 KB PNG
>>
File: slopvidia.png (34 KB, 637x385)
34 KB PNG
>>108953770
oof
>>
>>108953770
>16 and 64B params
>barely any examples
>still slopped
>fucked speed through time
>smeary motion
DOA
>>
>>
>>
>>108953902
hold on a cotton pickin' minute, wasn't qwen image trained on synthetic data?

So we're doing the AI version of the human centipede now? AI models trained on AI models that were trained on AI data?
>>
>>108954119
that's what happens when niggas are too lazy to webscrape
>>
File: 45687544586.webm (1.82 MB, 256x448)
1.82 MB
1.82 MB WEBM
kino alert
https://files.catbox.moe/qt16b3.mp4
>>
>>108954137
Finally a real AniStudio generated image!
>>
>>108954001
cozy
>>
>>108953770
nice bloated piece of SHIT
>>
File: debo_lr_anima1_00025_.png (2.31 MB, 1792x1075)
2.31 MB PNG
>>
Is it time to bring back GenJam?
>>
>>108954176
people barely post anything at all anymore since everyone is genning only kino with better models now which they dont want to leak. the days of genjam are over.
>>
>>108954183
what models?
>>
>>108954216
zit for insta 1girlniggers
tranima for tranimeniggers
klein for 5 people that want to edit images and cant wait pixelspace edit models
>>
>>108954225
idgi. what is there to leak if those are public models?
>>
>>108954176
Yes.
>>
>>108954233
leaking the sovlkino generated images and saving your waifus only for your own eyes
>>
>>108954253
oh, right. i think most people don't have a good imagination, so they don't know what else to generate besides coomer pictures
>>
File: zit.png (458 KB, 340x857)
458 KB PNG
wristlets btfo
>>
how do you get zit to do panties? All the loras available absolutely kill detail. Without loras, they always look really weird.

Is the only solution to switch to klein and edit?
>>
>>108954119
It’s local, what did you expect?
>>
>>108954320
>kill detail
dual sampler wf
https://pastebin.com/raw/U2CiEqvC
>>
>>108953207
negative prompt the shit out of fake boobs

implants, bolt-ons, breast enhancement, fake tits, etc
>>
File: ComfyUI_00685_.png (504 KB, 896x1152)
504 KB PNG
>>108953835
indeed
>>
File: united states.png (80 KB, 900x900)
80 KB PNG
>>108954383
bolt ons? what the heck?
>>
>>108950889
Wait so the drama is that lora trainers using stolen art and spending virtually no money aren't allowed to patreon their shit?
Are you fucking serious? Why the fuck are you destroying relationships with every single successful local baker? This has to be some manufactured astroturfing by Sammy right?
>>
>>108954413
somehow I doubt russell comes after lora makers patreon, gating model finetune is probably another topic
>>
>>108950889
>>108954413
I thought open source was free!
>>
How good is klein at creating VN-style sprites from reference images? What about NSFW? Any model recommendations for this?
>>
File: 5378463674.webm (883 KB, 256x448)
883 KB
883 KB WEBM
>>
>>108954413
I mean spending 50k$ out of pocket + probably double from the money grant or whatever along other effort needed to make this finetune is the only reason why anima exists in the first place, it wasnt made by some big corpo, so at least asking for commercial licenses for lora makers doesnt seem that crazy.
It would be good if everything in the world was FOSS, but going out of your way to make something for a lot of your own money that wouldn't have been made otherwise and then asking for people who will directly profit out of mostly your work to enter into a basic commercial contract with you doesn't seem unfair.

Especially given that the end users can still sell all images generated by the model or any lora from it regardless.

People in this space forget that yes, a lot of it is built on the backs of other FOSS projects and work given for free, but a lot of it is also built by people who give their own work but also need to eat and pay the bills.

It's easy for a multibillion dollar corpo to spend some small amount of money short-term on releasing an open weight model that looks like its "just free" for the end consumer, but in reality the corpo is paying for good pr, user training data, thousands of hours of free work, optimizations, tools and improvements from the FOSS community, and especially towards the long-term goal of the destruction of the proprietary market of other companies who will indirectly lose billions of dollars and you gain more long-term if there is an open version of their model that is as 90% as good but is free on the market.

That's why most of the time when you see the moment a company finally "makes it" with a model that can actually rival all others, they don't open source it again.
>>
>>108954374
holy shit!
/ldg/ actually provided something useful for once.
thanks, anon.
>>
>>108954609
its from https://civitai.com/models/2093591/dall-e-3-like-girls but yes
>>
>>108954374
>>108954632
so the idea is that last steps without lora smoothes the image?
>>
tranny in here seethed and reported me
>>
>>108952361
>it's WAY too fucking slow
can jeets stop self-reporting? thanks
>>
should I train Illustrious LoRA instead of Anima LoRA then if (((they))) are going to claim my Anima LoRA as their own?
>>
can't claim what you don't upload
>>
File: file.png (455 KB, 644x747)
455 KB PNG
>>108954901
they will take it from you
>>
>>108954925
hdd chads cant stop winning
>>
File: ComfyUI_00686_.png (735 KB, 896x1152)
735 KB PNG
>>
>>108951970
Anima 2 on Cosmos 3 any second now?
>>
>>108954717
yes, since base zit does realism details well
>>
File: 1756915016324809.png (2 KB, 200x46)
2 KB PNG
>>108954950
more like hdd chads cant stop spinning
>>
why can't >we take a thread to such lengths
>>>/h/8881901
>>
File: ViewFromTheDMZ.mp4 (1.7 MB, 1792x1080)
1.7 MB
1.7 MB MP4
When I'm done gooning my post-nut clarity compels me to gen total gooner death
>>
>>108955030
no more cooming, only kinoing
>>
File: TrueKleinV2_00612_.png (2.12 MB, 1440x960)
2.12 MB PNG
bit bored
>>
>>108951930
>is anima really that good
it must be considering I still haven't figured out how to install it and get it working
>>
File: iajpxsp4cs9c.png (214 KB, 512x512)
214 KB PNG
>>108955012
Can't coompost as hard on /g/ because it's a blue board. Here's a good boy from the nai leak back in 2022 though.
>>
>>108955137
>NAI2
it's so ass. Hoping for NAI3
>>
File: 1754593084641934.mp4 (686 KB, 848x480)
686 KB
686 KB MP4
>new videogen model
https://bernini-ai.github.io/
https://huggingface.co/ByteDance/Bernini

>up to five reference images as inputs
>Insert Image/Video into the Video
>video reference-guided Video Editing
>Prompt-driven video editing cases

>Bernini-R uses two sets of weights:
>Wan2.2 base — Wan-AI/Wan2.2-T2V-A14B-Diffusers on Hugging Face. Supplies the VAE, UMT5 text encoder, tokenizer, and the transformer architecture/base weights. It is downloaded automatically on first run (configured by wan22_base in configs/bernini_renderer_wan22/config.json).
>Bernini-R checkpoint — the trained high-noise / low-noise transformer weights (safetensors) from Hugging Face, passed with --high_noise_ckpt / --low_noise_ckpt. Both a local directory and a Hugging Face repo id are accepted.

Looks kinda OK.
>>
>>108955150
Shame that it's not a fully new model, which is what we actually need.
>>
File: Ernie-Turbo-PID_00068_.jpg (3.43 MB, 4096x4096)
3.43 MB JPG
>>
File: 112708CUI_00001_.png (1.49 MB, 1152x1536)
1.49 MB PNG
>>
>>108954925
This is a actually true, I can identify what is running just by listening. Every time I launch Cum UI inference my nvme emits certain sound, which feels as if the python shit is torturing the device by reading random segments in random order...
Doing something else like loading in a llm model with llama server is pretty quiet and doesn't sound like the device is being tortured.
Then again, I keep my librewolf profile in ram and have tested this: I'm getting zero disk writes or reads when browsing websites so the original issue in this clickbait article is easily avoided.
>>
File: eh_wttyMhVvBs.jpg (186 KB, 1100x580)
186 KB JPG
give me few good local image generation models.
>>
ok tranimefags, i kneel, JANIMA is kino

also, is it possible to train a lora slider where the slider actually allows you to tune the art style of a particular artist starting from his initial art style through time as it changes into the newest one? maybe its a bi too convoluted and not something most would need after they tune it to what they like, and it wont matter later on with edit models allowing good 1 image loras
>>
>wow this checkpoint is totally le good!
>look inside
>IL outputs
>>
>>108955402
it adds better composition with more detail compared to base anima that is dry
>>
>>108955404
>better composition
>1girl standing
>>
>>108955411
the average tranime ai sample image nowadays looks like it could have been made with basically any tranime model anyway, try it with your own prompts or dont nigger
>>
>>108953902
Trillion dollar company but they'll do anything to avoid paying people to augment their training set with real data. Slop fed into slop fed into slop into slop.
>>
File: annoyed.jpg (28 KB, 499x481)
28 KB JPG
>ComfyUI Manager faggot keeps updating the custom-node-list.json every two days
>My PR starts showing "This branch has conflicts that must be resolved"
>resolve conflicts button doesn't work because "These conflicts are too complex to resolve in the web editor."
>I have to close my PR, sync my fork and then repeat the steps of editing custom-node-list.json again, and opening a new PR again

fuck this shit, I thought github did everything automatically.
Why do I have to do everything again and again just because comfy faggot updated his json file?
>>
>>108955411
>>108955402
based, fuck these fat retards sniffing their own farts only to make 1girl standing no matter the architecture.
>>
>>108955411
I've never seen a good 2girls generation
>>
>>108955520
I still don't understand why you use comfy when forge exists. I don't understand what you need in comfy that forge is unable to do. Everything I end up seeing posted is unedited AI slop so it's not like people are putting some serious effort, so what the hell.
>>
>>108955534
1girl, big boob, masterpiece is ALL you need.
>>
>>108955555
exactly, i just think its funny that retards are now waiting MUCH longer to gen the same shit because they're fat and retarded

nice quints, bitch
>>
>>108955550
You don't even post any gens.
>>
cumfart just cannot stop taking Ls
>>
>>108954573
>it wasnt made by some big corpo
cumfartorg is the sign it's big corpo interest. Not once did Russ try to start a fundraiser. Greed was the primary motivator
>>
i'd have taken millions too
>>
>>108955757
with revenue share? :)
>>
File: ComfyUI_28018.png (2.81 MB, 1440x1440)
2.81 MB PNG
>let Z-Image take the wheel
>gives me a weird pose with wet fingers
...

>>108953770
>65B parameters
Cool! My Vera Rubin rack has just been collecting dust.
>>
File: Flux2-Klein_00489_.jpg (443 KB, 1072x1440)
443 KB JPG
>>108955782
>wet fingers
lora dataset leaking
>>
i wasted way too much time trying to wrangle the prompt for a certain seed, and couldn't make it work
>>
Any good proompting guides for anima? Official HF guide is very unspecific and seems not to capture the nuances of it all very well. Especially of natural language prompting.
>>
>>108955972
It's just boorutags and llm slop captions. @ before artist names. why do you need anything else other than an artist list?
>>
>muh artists
>>
File: 3575545.webm (2.77 MB, 256x448)
2.77 MB
2.77 MB WEBM
cliffhanger
https://files.catbox.moe/wixmrb.mp4
>>
File: 436455.webm (2.01 MB, 256x448)
2.01 MB
2.01 MB WEBM
https://files.catbox.moe/m6qhpm.mp4
>>
File: 134727CUI_00002_.png (1.9 MB, 1152x1536)
1.9 MB PNG
>>
File: ComfyUI_00688_.png (967 KB, 896x1152)
967 KB PNG
>>
>>108956083
still waiting for the biblically accurate angel to appear
>>
>>108956143
patience is a biblical virtue
>>
>>108956162
( ◜ω◝ )< I have no patience so I head over to /hdg/ to gen my BBCs
>>
File: 140256CUI_00001_.png (2.45 MB, 1536x1152)
2.45 MB PNG
>>
>>108952001
>>108952004
thanks!
>>
what's the best zimageturbo checkpoint for creating uncircumcised penises?
I don't want shitty cut cocks
>>
>>108953987
how does this change local diffusion?
>>
>>108950676
>>108950693
>>108950953
>>108950964
>>108950982
Why are you still acting like a retarded mandrill?
You forgot we know who Nik is?
>>
>>108956517
training lora would be super easy if that's all you want to do
>>
>>108956573
Ani doesn't come here anymore because you are a screeching faggot and anons chose the grift tineline
>>
>>108956586
you are so fucking retarded dude
it only takes one glance at your post to tell you're terrified
>>
Crazy how ldg still has niggas mindbroken LOL
>>
Janima is kinda shit
it just smears slop onto anima gens
>>
every post you make here is a concession btw
>>
>reply with the truth
>immediate cope
have you ever extrapolated why you hate ani so much?
>>
>>108956617
still better than whatever they did with their illustrious merges kek
>>
>>108956642
to be fair, it seems to have slightly better composition but I feel like it is animefying all my loras
>>
>>108956586
Good riddance to that avatarfag. He never contributed anything of note.
>>
>>108956640
Have you extrapolated how you manage to eat your own shit nonstop with your cock lodged in your throat?
>>
>>108956569
well?
well?
well?
>>
>Ani wants freedom for local models
>Ani puts in the work to get real change done
>Ani already contributed a lot to the open source ecosystem
yeah that makes a lot of sense to hate on him
>>
lol cry more buddy
>>
> Ani
who?
>>
>>108956737
I thought you were crying. I just want to make fun of your mental illness more
>>
>already contributed a lot to the open source ecosystem
How so?
>>
>>108956751
by being a lolcow everyone laughs at desu
>>
>>108956751
by doing nothing worth noting and coming here to cry, duh
>>
>>108956640
Nta but because of things he has done irl.
He will never get rid of me.
Pray that he finds "investors" so i can fully crush him with the AniFiles.
>>
>>108956791
>Pray that he finds "investors"
Now THAT's wishful thinking
>>
The catjak samefagging is pretty egregious rn
>>
>>108955430
>PAY. CAPTIONERS. WHAT. THEY'RE. WORTH.
seethe lmao
>>
File: 154417CUI_00001_.png (1.67 MB, 1536x1152)
1.67 MB PNG
>five fingers
ruined
>>
>julien
>>
>>108956876
cute
>>
>>108956586
>the grift tineline
? Why should I care about cloud providers ?
>>
>>108956876
thats her thumb
>>
>>108956908
when comfyui starts releasing paid local addons
>>
>>108956930
you forgot commercial local models with revenue share
>>
>>108956930
>>108956942
So a nothing burger, then?
>>
File: 155625CUI_00001_.png (1.59 MB, 1536x1152)
1.59 MB PNG
>>108956924
I know. Pitou only has 4 fingers.
>>
>>108956956
I am sure you know more than everyone here catjak. At least one of your split identities
>>
>>108956956
yeah anon thinks that if he fuds what he sees as the competition people will start using his stuff
>>
>>108956930
>>108956942
>>108956966
ok when are you killing yourself though?
>>
>>108956974
I am not catjak, retard
>>
File: 00132-225729096.png (1.85 MB, 1024x1536)
1.85 MB PNG
>>108952067
It IS advertised but i know your frustration.
so most quality tags you absolutely don't need, but i do recommend experimenting with highres/absurdres/style-adjacent tags based on what you're working with
reses as long as they're 64 multiples and under 4MP you're fine, in fact you don't NEED hiresfix on base anima and slopmerges that aren't complete shit, you can oneshot 1536x1536/etc.
>this image is ackshully illustrious 2.0 based but demonstrates my point minus the need for adetailer on the face and eyes
>>
>>108956989
no, but you are a vile subhuman avatartroon nonetheless
so do the world, us, and your family a favor a favor and kill yourself
>>
>>108956991
Everyone I see using anima is throwing it through a detailer and res tags are just as bad as quality tags.
>>
>>108956999
>>108956989
>>
>>108957009
>>108956999
>>
>>108957015
nice vike subhuman avatartroon behavior you are displaying catjak
>>
Will someone else bake for once? I want to make sure catjak feels like she doesn't belong
>>
uh oh melty
>>
>>108957045
to be fair I have been hoping catjak kills herself every day so we can be free from her bullshit
>>
Why is lilbro having a hissyfit rn?
>>
File: 160327CUI_00001_.png (2.37 MB, 1536x1152)
2.37 MB PNG
You niggas are crazy, bro
>>
>>108957038
>>108957045
>>108957058
Julien is a raped retard lmao
>>
>>108957066
the baker lost all credibility years ago so she troons out on ani hoping it deflects the blame for ruining the thread
>>
>>108957069
It's unironically one guy
>>
>>108957072
catjak is why we left /sdg/ in the first place so I don't know why she migrated to a thread where everyone hates her
>>
lilbro is absolutely losing his mind right now lmao
>>
>>108957082
maybe we just need to split again
>>
>>108957082
>>108957090
Is your amputated cock haunting you again? It's not gonna grow back buddy
>>
>>108957110
>>108956989
>>
>>108957072
I was talking about you though you're the one having a fit right now
>>
I don't think there has been a thread in a very long time where the troon hasn't sperged, fudded and seethed over ani
>>
If anyone wonders I am still working on building the apps that I will use for preparing my data to LoRA train
>>
>>108957152
ok
>>
>>108957152
I could use app that doesn't shit itself when there are folders with 1k+ images
>>
I can't proompt a style I'm satisfied with... 90% of the time I see slop I like it was either made in NAI and the tags don't transfer to anima, or there's no metadata.
>>
>>108957072
>>108957082
>>108957090
>>108957116
>>108957141
OHNONONONONONONONO
PFTAAHAHHAHAHHAHHAHAHHAHHAHHABHHHAHHAHAHHASHAHHAHAHHHAHAHHAHHAHAAAAAHAHHAHAHHAHAH
>>
>>108957194
Just train a lora
>>
File: 163530CUI_00002_.png (1.6 MB, 1536x1152)
1.6 MB PNG
Demon Deals got the fingers right.
>>
Julien the lolcow is a homosexual pedophile and loser who will never get his revenge on comfy
>>
????????????????

rgthree nodes keep fucking disabling themselves mid genning, I have to swap versions fucking constantly. What the fuck is wrong with it?
>>
>nodes
>>
File: 1760289160451064.png (1.6 MB, 1680x960)
1.6 MB PNG
>tfw making a horror VN
>get kino moments in between like this
feels good
>>
File: ComfyUI_00689_.png (1.37 MB, 896x1152)
1.37 MB PNG
behold my new opus
>>
File: ComfyUI_00690_.png (1.41 MB, 896x1152)
1.41 MB PNG
>>
File: ComfyUI_00691_.png (1.59 MB, 896x1152)
1.59 MB PNG
>>
File: ComfyUI_00692_.png (1.53 MB, 896x1152)
1.53 MB PNG
>>
>>108957415
>>108957491
>>108957503
>>108957512
exact same prompt, different seeds
>>
cozy
>>
File: 1775149781080916.jpg (1.28 MB, 1488x1824)
1.28 MB JPG
>>108955539
what about me
>>
>She is holding a stick. She pokes dog poop on the ground with it. Outdoors.
goated prompt adherence
>>
File: 171531CUI_00001_.png (1.32 MB, 1536x1152)
1.32 MB PNG
>>108957770
kek forgot the image
>>
>>108955539
2girl looks great if you inpaint, trying to do it in 1 gen is always garbage



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.