[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion and Development of Local Image and Video Models

Previous: >>108931834

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>>108938075
what is 35? answer me you schizo
>>
>mfw Resource news

05/29/2026

>Colored Noise Diffusion Sampling
https://hadardavidson.github.io/CNS

>VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion
https://videomla.github.io

>minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models
https://github.com/shengshu-ai/minWM](https://github.com/shengshu-ai/minWM

>GPIC: A Giant Permissive Image Corpus for Visual Generation
https://gpic.stanford.edu

>SGMD: Score Gradient Matching Distillation for Few-Step Video Diffusion Distillation
https://github.com/ModelTC/LightX2V

>Native Audio-Visual Alignment for Generation
https://ernie-research.github.io/NAVA

>GASS: Geometry-Aware Spherical Sampling for Disentangled Diversity Enhancement in Text-to-Image Generation
https://github.com/L-YeZhu/GASS_T2I

>SAVAA: Mitigating Hallucinations in LVLMs via Step-wise Adaptive Visual Attention Amplification
https://github.com/JiachengZ01/SAVVA

>Nexus BTA: Local AI image and video studio built around an embedded ComfyUI runtime.
https://github.com/JpAndreBTA/Nexus-BTA

05/28/2026

>MAVEN A Multi-Agent Framework for Multicultural Text-to-Video Generation
https://github.com/AIM-SCU/CRAFT

>Bias Leaves a Gradient Trail: Label-Free Bias Identification via Gradient Probes on Concept Decompositions
https://github.com/vitryt/label-free-bias-identification

>VidPrism: Heterogeneous Mixture of Experts for Image-to-Video Transfer
https://github.com/Lrrrr549/VidPrism.git

>Wan2.2-NVFP4-Sparse (NVFP4)
https://huggingface.co/lightx2v/Wan2.2-NVFP4-Sparse

>Microsoft data suggests using AI is more expensive than hiring people
https://finance.yahoo.com/sectors/technology/articles/microsoft-data-suggests-using-ai-225900743.html

>Pixal3d Studio
https://huggingface.co/spaces/victor/pixal3d-studio/tree/main

05/27/2026

>InvokeAI 6.13.0
https://github.com/invoke-ai/InvokeAI/releases/tag/v6.13.0
>>
Blessed thread of frenship
>>
I'm just here for slopkino and the news
>>
>mfw Research news

05/29/2026

>KGEdit: Ambiguity-Aware Knowledge Graphs for Training-Free Precise Video Generation and Editing
https://arxiv.org/abs/2605.29509

>LiveSVG: Zero-Shot SVG Animation via Video Generation
https://levymsn.github.io/LiveSVG

>VPG: Visual Prefix Guidance for Autoregressive Image and Video Generation
https://arxiv.org/abs/2605.30317

>Orthogonal Negative Guidance in Attention Feature Space for Text-to-Image Generation
https://arxiv.org/abs/2605.29390

>Veda: Scalable Video Diffusion via Distilled Sparse Attention
https://arxiv.org/abs/2605.30325

>Cert-LAS: Toward Certified Model Ownership Verification for Text-to-Image Diffusion Models via Layer-Adaptive Smoothing
https://arxiv.org/abs/2605.29809

>Future Forcing: Future-aware Training-free KV Cache Policy for Autoregressive Video Generation
https://arxiv.org/abs/2605.30083

>AdaState: Self-Evolving Anchors for Streaming Video Generation
https://adastate.github.io

>YoCausal: How Far is Video Generation from World Model? A Causality Perspective
https://www.youzhexie.me/papers/YoCausal/index.html

>Alignment-Guided Score Matching for Text-to-Image Alignment in Diffusion Models
https://jaayeon.github.io/AGSM

>Benchmarking Single-Factor Physical Video-to-Audio Generation
https://research.nvidia.com/labs/cosmos-lab/flatsounds

>IP-Adapter Is All You Need: Towards Fine-Tuning-Free Diffusion-Based Talking Face Generation
https://arxiv.org/abs/2605.30230

>Stable-Layers: Fine-Tuning Image Layer Decomposition Models with VLM-Scored Reinforcement Learning
https://arxiv.org/abs/2605.30257

>GenClaw: Code-Driven Agentic Image Generation
https://arxiv.org/abs/2605.30248

>Guidance Contrastive Token Credit Assignment for Discrete Policy Optimization
https://arxiv.org/abs/2605.29198

>OccamToken: Efficient VLM Inference with Training-Free and Budget-Adaptive Token Pruning
https://arxiv.org/abs/2605.29657

>Diffusion Models, Denoiser Architecture and Creativity
https://arxiv.org/abs/2605.16415
>>
>>108938084
https://rentry.org/ranfaggot
this is what he means by 35
>>
>>108938092
this link should be stickied lol
>>
Can't gen 1girl anymore because I'm love with a girl irl and genning just makes me melancholy
>>
>>108938104
gen her. make a lora
>>
Welcome back I'm glad to see you're still crying
>>
File: Flux2-Klein_00033_.jpg (235 KB, 1024x1536)
235 KB JPG
holy moly, look at how detailed the skin on the face is
>>
>>108938110
>raped retard catjak signing in again
>>
>>108938143
i dont understand what im looking at
>>
>>108938104
Gen deepfakes of her performing the most degrading sexual acts possible on the fattest, baldest, oldest, ugliest Indian men until you stop being lovesick.
>>
>>108938104
skill issue. you are too bad at genning otherwise you would have genned multiple girls that are better than any can ever be irl.
>>
If I generated risque gens of all my highschool crushes before the Take It Down Act went into affect do I get grandfathered in and am allowed to keep the pics?
>>
File: HunyuanVideo_00063.mp4 (640 KB, 960x544)
640 KB
640 KB MP4
>>108938092
>>108938103
>>108938147
Samefag
>>
>>108938169
just let the two trannies tucker themselves out
>>
>>108938163
I did but I'm not in love with them, I'm in love with the girl irl
>>
File: Flux2-Klein_01103_.png (1.87 MB, 832x1248)
1.87 MB PNG
>>
where is everyone?
>>
File: kuro.jpg (213 KB, 1152x896)
213 KB JPG
>>
File: kuro2.jpg (236 KB, 1152x896)
236 KB JPG
>>
>>108938143
Which part is the face
>>
>>108933597
Ok, let's see if you're right. I can imagine 12 gates to a city (so you have to fly around it), it's a cube in shape, and it's multi-layered and glowing, like a rainbow (in other words, layers of "stone" that are glowing colors). All around it are flashes of lightning. We zoom into the center of the city, which had golden streets, to see a lamb with a wound which glows (and is like a nuclear sun).
>>
>>108933527
>maybe i can do a full scale invasion on a city
alien invasion kino pls
>>
File: 215146CUI_00001_.png (2 MB, 1192x1536)
2 MB PNG
>>
>>108938422
Neat.
>>
File: FK9B__00003_.png (1.51 MB, 832x1216)
1.51 MB PNG
>>108938422
>>
>>108938422
telltale style lora?
>>
File: 1777139007041328.jpg (102 KB, 1920x1080)
102 KB JPG
>>108935702
>>108932565
>>108932501
>>108932065
you are on a roll today
>>
File: 023744CUI_00001_.png (1.77 MB, 1192x1536)
1.77 MB PNG
>>108938469
Yeah. It's amazing how well it mixes with different character loras. The more I mess with anima, the more I realize how good it really is.

https://civitai.red/models/2519109/telltale-games-style-anima-lora
>>
File: 024624CUI_00001_.png (1.97 MB, 1192x1536)
1.97 MB PNG
The borderlands one is also neat.
>>
File: FK9B__00004_.png (1.38 MB, 832x1216)
1.38 MB PNG
>>108938455
>>
>>108938536
i love psycho baltic women
>>
>use Linux
>accidentally loads huge checkpoint
>the whole desktop UI locks up and hangs
why is it so bad on here?
>>
>>108938561
tranny code vs microslop jeet code, on windows i have to kill and restart explorer.exe every few days since it starts taking 5 seconds to load folders if its left open for multiple days
>>
>>108938579
I don't understand why Microsoft needs to break everything every few years. Windows 10 just works now. Why are they retiring it in favor of starting the cycle of pain again?
>>
>>108938561
>>the whole desktop UI locks up and hangs
What do you think vram is?
>>
>>108938561
That's not exclusive to linux.
>>
>>108938592
oh no i was talking about windows 10. but yes, 11 is even worse of course
>>
>>108938592
it's not about product quality. you need to make money
>>
>>108938592
money, duh
>>
>>108938594
>>108938597
I never had desktop locks up when I'm out of VRAM on Windows
>>
>>108938594
what do you think proper process prio is
>>
thoughts on the anima turbo lora v0.2? i only tested it briefly. it seems pretty different from the first one. i don't know yet which one i like more, it's gonna take ma few weeks of testing
>>
>>108938594
its cpu ram. most distros dont have aggressive oom killers enabled by default
>>
>>108938610
i need neither the purported speed gains nor le stabilization
>>
>>108938603
idk, myself, I don't use Windows.

Personally, I'd rather fly close to the sun than clutch pearls, but whatevs
>>
>>108938615
>cpu ram
Aha. I have 64gb, but yeah, at 16gb I couldn't gen ANYTHING. I think I could have if stablediffusioncpp existed though.
>>
>>108938610
Free lunches are NEVER free.
>>
I'm spoiled with what I can gen with modern tech but I really want that good video generation. And when it hits real-time local Seedance 2.0 tier generation? It's over.
>>
>>108938392
i'm composing a prompt for some mega kino right now
>>
>>108938643
The more kino images I gen the worse it is because I get reminded that I can't just add them to my 3D digital world yet.
>>
>>108938536

nice buckling mode modelling anon
stem anons have envy
>>
>>108938656
idk what that means.
>>
>>108938604
>proper
To quote anna sarkesian, yeah, we're better than you.
>>
File: 37486635.png (649 KB, 1742x709)
649 KB PNG
kino alert
>>
>>108938701
>>108938383
>>
>>108938711
i personally won't violate the spiritual laws by visualizing new jerusalum. you have to do it yourself and face the consequences
>>
>>108938728
:'-D So you're wrong.

>you can see anything that you can imagine ;^)
>>
Is there a new Anima LoRa training meta? Some small file size LoRas showing up.
>>
>>108938731
wrong about what? you can see it if you want, but you will be smited afterwards
>>
File: wat.jpg (226 KB, 1500x1500)
226 KB JPG
I think it's appropriate to post this here, though idk if it's "local": it's a listing on Amazon, a fake product.
>>
>>108938738
nope, can't see it...
>>
Maybe if ... someone... could gen it I could see it.
>>
File: 936862137.png (157 KB, 1500x1500)
157 KB PNG
>nope, can't see it...
>>
File: 1761837182204753.jpg (1.06 MB, 1216x2176)
1.06 MB JPG
>>
>>108938656
>>108938673
You mean because I used Tan2?
>>
File: ANIMA_bface_bad_00010_.png (1.33 MB, 832x1216)
1.33 MB PNG
>>108938785
>>
File: ANIMA_bface_bad_00012_.png (380 KB, 832x1216)
380 KB PNG
>>108938907
>>
>numpy
>xformers
>flash-attn
The holy trinity of annoying dependency errors
>>
>>108938989
just use a venv
>>
>>108938610
Just noticed it came out, too. Suffice to say, at high-res there's no competition between Anima Preview 3 + Turbo 0.1 and Anima Base 1 + Turbo 0.2.

>4chan wants a new cookie, wait 60s
>4-step captcha
>file too big after upload
>another 4-step captcha to retry with a lower-quality jpg
>connection error
>try again
>fake rangeban
>have to try another known working machine
The things I put up with to keep posting here...
>>
File: 24367357367.webm (1.78 MB, 448x256)
1.78 MB
1.78 MB WEBM
nothing ever ha-
>>
>>108939035
1280x1600, same seed.
>>
>>108939035
>>another 4-step captcha to retry with a lower-quality jpg
4chan xt alleviates at least this part of it. Sorry fren.
>>
>>108939038
wtf is this real
>>
File: zit.png (753 KB, 660x607)
753 KB PNG
>>
>>108939047
Last one. Is this achievable natty?
>>
>>108939061
>>108939047
base without lora should be best generally, yes?
>>
File: ANIMA_bface_bad_00013_.png (366 KB, 832x1216)
366 KB PNG
>>108938985
>>
>>108939038
please be real in tel aviv
>>
File: 1280x1600_compare.jpg (2.79 MB, 2560x3300)
2.79 MB JPG
>>108939065
Here's without loras.
>>
File: 34724663.webm (471 KB, 448x256)
471 KB
471 KB WEBM
>>108939092
it will be your house soon
>>
File: 24758357.webm (2.5 MB, 448x256)
2.5 MB
2.5 MB WEBM
TAKE ME TO HEAVEN
>>
>>108939139
This happened to me in a dream once
>>
I have went through 30k images generated with a single lora, by now I feel like I have seen all variations of all 1girls it can generate and I can now run inference in my mind.
>>
>>108936376
some faggot that hated anime started concern trolling and made a separate thread to segregate anime gens. it did fuck all in that regard, but atleast you can post cloud shit in /adt/ i guess.
>>
File: grid_4_20260529_211149.jpg (3.6 MB, 3968x2688)
3.6 MB JPG
>>
File: krk3.png (902 KB, 768x1024)
902 KB PNG
>>
>>108939198
What manga is this trained on? Reminds me of Dorohedoro a bit.
>>
File: 1751428370468416.jpg (1.13 MB, 1984x1344)
1.13 MB JPG
>>108939224
anima base no lora https://files.catbox.moe/9jftlq.png
>>
>>
>>108939304
do this but frutiger aero
>>
File: Flux2-Klein_00050_.png (3.81 MB, 1440x1922)
3.81 MB PNG
anon, PornMaster-色情大師 Flux2 Klein is actually good for editingany gooning material, I thought it was for chinese 1girl all this time....
>>
File: grid_4_20260529_214825.jpg (3.55 MB, 3968x2688)
3.55 MB JPG
>>
File: 1759783754317761.jpg (958 KB, 1344x1984)
958 KB JPG
>>
File: 6422445.webm (1.55 MB, 448x256)
1.55 MB
1.55 MB WEBM
this is some kind of biblical event
>>
>>
pantsu-ripper, elegant, 1 girl, nude, sparse pubic hair, azula, amber eyes, black hair, topknot, single hair bun, sidelocks, petite, small breasts, (toned:0.6), forest, outdoors, grass, flowers, holding apple, garden of eden, , serpent wrapping body, serpent coiled around, serpent entangled, serpent embrace, large serpent, mystical serpent, contrapposto, looking at another, nature,, framed, ornate border, seductive smile
>>
>>108939529
cool. did you train lora on combat footage or what
?
>>
File: 062253CUI_00002_.png (1.71 MB, 1192x1536)
1.71 MB PNG
>>
>>108939584
generic reply. welcome email. tldr. hi
>>
>>108939576
no it's stock ltx2.3. the trick is to generate at low resolutions cause i think the high resolution training data has no combat footage
>>
>>108939616
>x2.3 you just hit the /g/ word of the day!
>>
>>108939092
they don't have those kind of trees do they
>>
>>108939574
https://files.catbox.moe/mjeusy.png
>>
File: 357656.webm (3.35 MB, 448x256)
3.35 MB
3.35 MB WEBM
the trvthnvke just dropped
>>
>>108939676
die screaming
>>
>>
>>
File: 1771636877734945.png (3.13 MB, 2176x1216)
3.13 MB PNG
>>
>>108939529
based
>>
>>108939529
This is nice. I am imagining attacking Israel and this kind of Yahweh thingy happening, which fuels my energy to go further because Yahweh is trash
>>
File: ComfyUI_00598_.png (708 KB, 896x1152)
708 KB PNG
>>
File: ComfyUI_00599_.png (697 KB, 896x1152)
697 KB PNG
>>
should i continue this as an action movie or porn
https://files.catbox.moe/i11suc.mp4
>>
>>108940157
do a lot of crazy anime explosions
>>
File: 00168-1463761361re.png (1.47 MB, 1088x1408)
1.47 MB PNG
>made it in
lfg!
>>
File: 1757055889610308.png (3.3 MB, 1856x1472)
3.3 MB PNG
>>
>>108940194
sounds genius
>>
File: ComfyUI_00600_.png (791 KB, 896x1152)
791 KB PNG
>>
File: ComfyUI_00601_.png (1.35 MB, 896x1152)
1.35 MB PNG
>>
File: ComfyUI_00602_.png (925 KB, 896x1152)
925 KB PNG
>>
File: ComfyUI_00603_.png (1.17 MB, 896x1152)
1.17 MB PNG
>>
File: ComfyUI_00605_.png (954 KB, 896x1152)
954 KB PNG
>>
File: 34747.webm (1.13 MB, 448x256)
1.13 MB
1.13 MB WEBM
>>
I have more than 25,000 high resolution coom pictures and wallpapers I have saved since more than a decade.
How can i use them to train an AI model and create a checkpoint?
How long will it take, how much RAM and processor do I need to have to make a 6-12 GB training model?
>>
>>108940344
>forty muddy seconds of nothing happening in 256p
back to the drawing board
>>
>>108940367
nothing happens in war until the very last second of your life
>>
So does the anima licence make it so my loras are owned by circlestonelabs? Am I getting that right?
>>
>>108940426
no
your loras are owned by you
let (((them))) try and take your loras away from you
>>
>>108940426
anima owns the commercial sales of your Lora and no you will not be compensated
>>
https://files.catbox.moe/9z56ym.mp4
the nuke sounds so kino in this one
>>
mother of kino
https://files.catbox.moe/dfvkcm.mp4
>>
File: ComfyUI_00607_.png (611 KB, 896x1152)
611 KB PNG
>>
>>108940426
It doesn't matter because the model itself has been trained with stolen materials.
Unless they are able to prove where every single image from their database came from you have 0 issues to worry for. Models like anima come and go and in ten years cum ui has been bankrupted anyway.
>>
File: ComfyUI_00608_.png (861 KB, 896x1152)
861 KB PNG
>>
>>108940539
deviantart, danbooru and whatever is in laion pop art. I am still mad how little it was trained on. We could have gotten a much better model with rule34 added
>>
>rule34
sure buddy
>>
>>108940602
you would also get infinity ai slop that r34 failed to tag
>>
>>108940602
just train that maggie simpson lora yourself, king
>>
I come here asking a question without shame… what would be the easiest way to gen a specific man and woman having sex together? I’ve tried IMG2IMG without much luck, I’ve tried Klein 9B for identity swapping on real images and it kinda works, the only other thing I’ve tried with success is uncensored SaaS sites like Venice but I’m tired of not being able to use local.


Any advice would be appreciated!
>>
>>108940649
just pay a hooker
>>
>>108940649
Train Lora on man. Train Lora on woman.
>>
>>108940679
>he has to take pictures of his peepee for the dataset
oh no no no no no
>>
>>108940649
You should be able to use ...
>>
File: 3573655.webm (3.15 MB, 448x256)
3.15 MB
3.15 MB WEBM
>>
>>108940649
just face it it ain't happening for you even in AI.
>>
I got a 5070 Ti and 80GB total RAM and wanna get back into genning, video as well if possible. Haven't been around since SD 4 or somewhere along the line, what the fuck is going on? There's like 12 front ends and I can't make heads or tails of where I should even try beginning
>>
File: ComfyUI_00609_.png (1.16 MB, 896x1152)
1.16 MB PNG
>>
>>108940716
a1111 is dead, forge neo is the successor. swarmui is a c# wrapper around other back ends. comfyui is spyware/malware and comfyanon is full groftercore now. I guess there is wan2gp as well I guess
>>
>>108940679

Yeah but then what? Do you have to rely on something like regional prompting? I’m asking because I can’t stand the vanilla looking males in the various checkpoints.
>>
>>108940716
don't be a weirdo. just get a portable comfy like everyone else.
>>
>>108940716
wan2gp is the best for video
>>
>>108940351
donate that shit to the sulfteam or playtime ai. both of them have disords.
https://huggingface.co/Playtime-AI
https://huggingface.co/SulphurAI/Sulphur-2-base
>>
File: 673214.gif (673 KB, 498x392)
673 KB GIF
sfw vageen please
>>
File: 00002-2502644298re.png (2.01 MB, 1888x768)
2.01 MB PNG
Where is the object permanence?
>>
File: 120000CUI_00001_.png (1.86 MB, 1192x1536)
1.86 MB PNG
Anima does 3D so well. It's crazy. Can't wait for the inevitable LiS lora. I'll be genning Victorias all day.
>>
>>108940716
Best option is to make Claude to work for you for free and make your own inference script.
They enshittified cum ui just in one year, I can't even imagine how bad it'll be next year at this time.
>>
File: 537483457.webm (3.14 MB, 448x256)
3.14 MB
3.14 MB WEBM
>>
>>108940910
>to make Claude to work for you for free
benchod
>>
File: 537246.webm (1.07 MB, 448x256)
1.07 MB
1.07 MB WEBM
>>
File: 121410CUI_00002_.png (1.45 MB, 1192x1536)
1.45 MB PNG
>>
>>
>>108940981
zombie
>>
>>108940963
vampire
>>
>>108940998
mimic
>>
>>108940935
I admit, I went too far. May Vishnu help me.
>>
File: q_0srgv8.png (2.21 MB, 1536x1536)
2.21 MB PNG
>>
>>108939061

in which anime serie saber was one of main characters
>>
File: 537647.webm (3.16 MB, 640x384)
3.16 MB
3.16 MB WEBM
bumping up the resolution
>>
>>108939529

nothing seem to bounce to skies
>>
Here is an idea: pixel space video diffusion.
Clearly what the local needs as the next step after Wan 2.2
>>
>>108941131
i'm sure a dog fucker with a gaming pc will get right on that
>>
>>108940351
Best use case for such a large dataset is probably to bring NSFW to a SFW model like Z-Base, Klein 9B etc.
The training will be long so you want something powerful like 5090. RAM doesn't matter that much, just use disk to cache latents if you can't hold them all in memory. CPU power is completely irrelevant.
>make a 6-12 GB training model
Not sure what you mean by that, do you want to finetune a model in that size range? Z-Base and Klein both fit the bill. You are not training anything from scratch with just 25k images.
Anyway you can try getting in contact with this guy as well, he is trying to bring NSFW to Klein.
https://huggingface.co/SG161222
>>
someones making IP adapter for anima.
>>
>>108941213
don't care until it's out but it hasn't worked on DiT models very well so I am not expecting much
>>
File: 131028CUI_00001_.png (2.41 MB, 1192x1536)
2.41 MB PNG
>>108941213
That's nuts.
>>
firing up the kino reactor. hope i get a good seed
>>
File: 00010-1238901691re.png (1.08 MB, 944x1920)
1.08 MB PNG
>>
File: 1754546927972583.png (2.32 MB, 1104x1408)
2.32 MB PNG
>>108940910
claude sucks donkey dick

Use a mix of Kimi K 2.6 and GLM

All western AIs are so censored and dogshit I can't believe people fall for the grift
>>
>>108941457
safety is important because it ensures that large language models produce ethical, reliable, and non-harmful outputs. we don't want people to feel sad using it.
>>
^
I'm sorry I cant respond to that post as it goes against my guidelines.
>>
>>108940752
>>108941196
thanks for the help and info
>>
File: check it out.jpg (641 KB, 2048x1536)
641 KB JPG
what is this hand symbol called, pals
>>
File: 365552.webm (3.01 MB, 168x420)
3.01 MB
3.01 MB WEBM
dang, the kinos look much better when i generate them in vertical aspect ratios
>>
File: ComfyUI_00613_.png (1.13 MB, 896x1152)
1.13 MB PNG
>>
File: ComfyUI_00616_.png (876 KB, 896x1152)
876 KB PNG
>>
File: ComfyUI_00615_.png (1.01 MB, 896x1152)
1.01 MB PNG
>>
File: 365552.webm (3.44 MB, 256x448)
3.44 MB
3.44 MB WEBM
>>
please just kys yourself catjak. you can chase after ani in the afterlife
>>
>>108940890
Oof
>>
File: 574664.webm (3.86 MB, 256x448)
3.86 MB
3.86 MB WEBM
>>
>>108941688
>dang, the kinos look much better when i generate them in vertical aspect ratios
trained on iphone video.
>>
File: ComfyUI_00617_.png (1.15 MB, 896x1152)
1.15 MB PNG
>>
File: 00125-1966139116re.png (2.43 MB, 1920x944)
2.43 MB PNG
Probably shouldn't have prompted huge breasts for this one...
>>
File: ComfyUI_00620_.png (1.06 MB, 896x1152)
1.06 MB PNG
>>
>>108941916
yeah maybe
>>
File: ComfyUI_00621_.png (1.2 MB, 896x1152)
1.2 MB PNG
>>
>>108941887
Can I have 6BP Anima model?
>>
File: 637655677.webm (2.52 MB, 256x448)
2.52 MB
2.52 MB WEBM
the audio quality is excellent for this style of video
https://files.catbox.moe/46ad4a.mp4
>>
File: q_8ccwbb.png (2.82 MB, 1420x1826)
2.82 MB PNG
>>
>>108941958
did you even watch any of these before you uploaded them?
>>
>>108941983
yes i leap out of my chair and scream "KINO!!!!!!!!!" 3 times in a row while beating my chest each time a new one finishes generating
>>
File: ComfyUI_00622_.png (819 KB, 896x1152)
819 KB PNG
>>108941942
>>
File: 357643275.webm (3.5 MB, 256x448)
3.5 MB
3.5 MB WEBM
>>
>>108942003
I really enjoy these. I think what I personally would like to see is maybe the squad being attacked by a big warewolf or something like that. Maybe you're aiming for more realistic stuff, but that is what pops into my imagination.
>>
>>108942003
is this a liveleak lora? very kino dude
>>
File: 473754.webm (3.53 MB, 256x448)
3.53 MB
3.53 MB WEBM
>>108942023
thanks, i'm learning the new prompt relaying system that wan2gp added. it's so much nicer now that i can one-shot the whole video generation without needing to do clip extensions to get the right prompt timings. i'll try your suggestion when i get bored of nuclear kino
>>
>>108942047
tThank you for thanking.
>>
children don't even need action figurines anymore. praise AI.
>>
>>108942043
it's stock ltx2.3
>>
File: q_9bdll4.png (1.71 MB, 896x1152)
1.71 MB PNG
>>
>>108938610
i've tested it a bit and i don't like it. whatever tdrussel did with 0.1 was much better than 0.2
>>
File: 154536CUI_00001_.png (1.03 MB, 1192x1536)
1.03 MB PNG
>>
>>108942047
I tried the prompt relay thing in ComfyUI but I think I messed it up in some way. I'm going to try wan2gp later today, seems like a neat feature.
>>
File: ComfyUI_00623_.png (1.06 MB, 896x1152)
1.06 MB PNG
>>
>>108942104
HOLY MAMA
>>
File: ComfyUI_27430_SMALL.jpg (2.81 MB, 1500x1920)
2.81 MB JPG
>>108938479
I'm still trying to trying to make PiD work for me, but it's doing it's best not to cooperate! Here's a full-size image so you can get a sense of how it's going. It's mainly lacking detail at that resolution because ZIT has to work at 1024px to get anything usable into it, which isn't great. I haven't found a way to get a 2048px latent to PiD that doesn't get garbled along the way.

Full-size:
https://files.catbox.moe/vpgg3p.png
>>
File: 155352CUI_00001_.png (1.21 MB, 1192x1536)
1.21 MB PNG
>>
>>108941958
kek this is how I dream
>>
>>108942047
>>108942003
Do you have more?
>>
>>108942125
>no workflow
pointless
>>
>>108942177
i posted a bunch in this thread
>>
File: 160222CUI_00001_.png (2.26 MB, 1192x1536)
2.26 MB PNG
>>
File: 34676.webm (2.75 MB, 256x448)
2.75 MB
2.75 MB WEBM
ok i'm going to sleep. last kino for now. audio version here: https://files.catbox.moe/f45xak.mp4
>>
File: 160750CUI_00001_.png (1.31 MB, 1192x1536)
1.31 MB PNG
>>
>>108942244
really cool anon, thanks for sharing
>>
>>108938092
>>108938169
>>108938147
>>108941908
any new projects that will make you a millionaire? or have you finally accepted the fact that you're a worthless raped retard that will die alone and destitute?
>>
Sd.cpp has an ui bundled in it since april
Why don't we add it to OP then? Finally sd.cpp has a usable UI that is maintained and with a proper license :)
>>
>>108942386
are you ok?
>>
>>108942386
>usable UI
have you actually used it
>>
>>108942386
>Why don't we add it to OP then?
sdcpp is great. it is a good cli for hoomans and agentic ai. the bundled gui is a joke. better to script it. claude code could probably make a better gui in a couple of minutes.
>>
>>108942412
we should support it more than cumfart
>>
>>108942401
>>108942408
>>108942412
uh oh, the raped retard is having a melty
>>
>>108942408
Yes, does it's job well and doesn't crash all the time
Also prper license is a big plus
>>
>>108942465
it's super shit compared to something like anistudio stop lying
>>
>anistudio
>>
>sd.cpp has it's own ui now
How long will his melty last this time? I say 2 weeks min
>>
>>108942408
https://github.com/FizzleDorf/imgui-node-editors
>>
>fizzling
>>
>imgui
>>
>>108942476
>>108942499
You are a worthless, raped retard and no one here will ever use any of the digital excrements you shit up, Julien
>>
File: image.png (75 KB, 1098x264)
75 KB PNG
>>
>>108942526
Don't they have a revenue share?
>>
>>108942526
It's great seing comfy being this successful
Very based
>>
>>108942521
but you are a nocoder
>>
File: ComfyUI_00624_.png (870 KB, 896x1152)
870 KB PNG
>>
File: ComfyUI_00625_.png (788 KB, 896x1152)
788 KB PNG
is there a huggingface space that'll convert images to text without being a censored faggot?
I want it to be detailed and explicit iykwim
>>
File: ComfyUI_00626_.png (999 KB, 896x1152)
999 KB PNG
>>
>>108942617
Be grateful your family remains oblivious to the depths of your subhumanity
>>
>swimsuits are not popular on civitai
>>
File: 1775831571890750.png (1.72 MB, 1344x768)
1.72 MB PNG
Honestly if you're using anything other than Swarm, you might be a bot
>>
>>108942125
>
the difference may only be visible to you with the layer mode "difference" in gimp
>>
>>108940890
Catbox? Is this a lora? I couldn't get 3d to look a way I liked but this is nice.
>>
>>108942125
Will the jenny lora be released?
>>
File: ComfyUI_00627_.png (845 KB, 896x1152)
845 KB PNG
behold
>>
>>108942244
Neat
>>
File: ComfyUI_00628_.png (828 KB, 896x1152)
828 KB PNG
>>
File: 1753371624335673.png (990 KB, 768x1344)
990 KB PNG
>tfw you find a character anima doesnt know

feels bad
>>
Are there any local video generators today that can use character reference sheets (an image where you have multiple panels of the same character from the front, back and side angles to give the generator a 360° consistency) like Seedance 2.0, Gemini Omni and Kling? It's one of the things I really wish LTX-2.3 had (maybe they'll introduce it in LTX 3.0).
>>
File: ComfyUI_00629_.png (636 KB, 896x1152)
636 KB PNG
>>
>>108942657
Why the hate? You are an even worse faggot, getting involved in a fight between two millionaires, you are a raped retard instead.
>>
File: ComfyUI_00630_.png (688 KB, 896x1152)
688 KB PNG
>>
>>108942747
How is Julien unemployed but constantly traveling and shitposting?
>>
>>108942747
Julien is a diddler.
>>
probably a sex tourist
>>
bet ani's got loads of Cambodia stamps in his passport
>>
>>108942771
Julien lives in the first world, and he probably comes from a wealthy family, so I would not rule out that he is supported by them in his pursuit of conquering his dream of being a reputable dev.
>>
>>108942813
>he probably comes from a wealthy family
textbook pedophile sex tourist
>>
>>108942747
having a million charges for child porn possession doesn't make you a millionaire, Julien
>>108942813
imagine what'd happen to him if any of his family members got wind of what he has made out of his life
>>
File: ComfyUI_00633_.png (1.34 MB, 896x1152)
1.34 MB PNG
>>
>>108942822
Okay, but now let us point the finger at you. Since you are neither Comfy nor Julien, why do you get so involved in their beef?
>>
>>108942835
Nta but julien is literal human garbage
>>
>>108942835
One of them (you) keeps trying to shit up our general
Doesn't get more complicated than that
>>
This Julien character is a paedophile?
>>
>>108942835
Probably someone involved with them, possibly with Comfy, probably Russell.
>>
>>108942851
yeah, see the rentry
>>
>>108942851
lolicon != pedo
>>
>>108942863
Sounds like a cope you fucking paedophile.
>>
>>108942863
Thats not what the government said.
>>
I upgraded my mobo and switched to Linux but kept the same graphics card. New CPU. But ComfyUI runs like 10x faster than it did before. Is that a Linux thing or a "new CPU" thing?
>>
>>108942863
I can't find "lolicon" in the English dictonary? 'Pedophile' and 'CSAM' are in there though
>>
>>108942860
this one?
https://rentry.org/ranfaggot
>>
>>108942859
Probably. SDXL-> Chroma and Z Image felt like way bigger upgrades than SDXL->Anima. Funny how the Anima spam always seems to show up whenever tdrusell is active on Hugging Face.
>>
>Julien is still posting the schizobabble wall
LMAAAAAAAOOOOOOOOOOOOOOOOOOOOOOOOOOO
>>
File: 'g.png (61 KB, 256x234)
61 KB PNG
why is every single general on 4chan exactly like this?
>>
>>108942898
any criticism is drowned out by the spam. Anima is an alright model for the size but it doesn't bring enough to the table
>>
>>108942903
4chan is dying, less users, they want (you) to engage in sort of any form
>>
>>108942903
The AI grift is dying so all thats left is LARP, and seethe
>>
File: ComfyUI_27481.jpg (3.29 MB, 1500x1920)
3.29 MB JPG
>>108942180
It's too embarrassing!

>>108942690
That's illegal!

>>108942903
Thread police!
>>
>>108942923
this, it's turning into a dead forum, the one thing we were escaping from ironically
>>
File: 1770904983262343.png (444 KB, 640x640)
444 KB PNG
^
>>
File: 5xsabk-2500814082.png (15 KB, 1027x731)
15 KB PNG
>>108942125
>52mp detailed jennies I can die happy now
>>
jenny anon is the only one saving /ldg/
>>
File: Wan2.2_i2v_00001_.mp4 (977 KB, 480x720)
977 KB
977 KB MP4
Is it possible to use PID with Forge Neo? I mean can I just download the thing and use it as a VAE or...?
>>
>>108942909
The thing is, zimage can do the same stuff as anima and more without being fully trained and shilled because it’s a better model. pic rel was made with whatever caption model and whatever zimage adel_ai illustration lora on civitai, nothing fancy. For my taste, it handles composition and the intended meaning of the image better, the only downside is that I had to inpaint the face.
>>
does anyone have any good loras for oversized animals? I want to make Korra's bear dog
>>
>>108943156
Anima can do 5 fingers
>>
>>108943156
>>108942830
who was the jannetty?
>>
File: 1755645984095776.png (1.13 MB, 1024x1024)
1.13 MB PNG
almost forgot how good z image turbo is:

can also use reactor swap node if you want to do celeb memes. or klein edit.
>>
File: 1774656450500656.png (1.02 MB, 1024x1024)
1.02 MB PNG
>>108943226
>>
>>108943239
orange you glad?
>>
File: ComfyUI_00634_.png (1.05 MB, 896x1152)
1.05 MB PNG
>>
>>108943285
>that pussy
>>
>>108943226
>z image turbo
It's so god damn good. Training porn loras works surprisingly well
>>
>>108943037
literally the only reason i come here
>>
>model that lets you imitate a sound with your voice, then uses that vocal imitation together with text as input to generate the sound you actually want.

https://github.com/thxxx/VTS
https://www.reddit.com/r/LocalLLaMA/comments/1trve9e/open_source_turning_vocal_imitations_into_sound/

Is there another project like this? Surely there must be, this would be even better with a bigger audio gen model.
>>
File: 185007CUI_00001_.png (1.91 MB, 1192x1536)
1.91 MB PNG
>>
>>108943355
>after a rough facefuck
h...hot
>>
File: zImageturbo_00002_.jpg (831 KB, 1496x1928)
831 KB JPG
>>
>>108943406
I prefer smoother pits
>>
File: 190630CUI_00001_.png (2.03 MB, 1192x1536)
2.03 MB PNG
>>
>>108943352
Sound effects are here!
this looks pretty cool
>>
File: 191246CUI_00002_.png (1.8 MB, 1192x1536)
1.8 MB PNG
>>
cozy
>>
>>108943406
Beautifull.
>>
File: zImageturbo_00032_.jpg (625 KB, 1624x1328)
625 KB JPG
>>
File: q_nefim4.png (1.1 MB, 1344x768)
1.1 MB PNG
>>
>>108943690
That man is getting to the bottom of things.
>>
File: zImageturbo_00038_.jpg (595 KB, 1624x1328)
595 KB JPG
>>108943704
damn right, made myself laugh
>>
>>108943690
Kek
>>
File: q_2p44cw.png (1.1 MB, 1344x768)
1.1 MB PNG
>>
>>108943724
>>
File: zImageturbo_00045_.jpg (519 KB, 1624x1328)
519 KB JPG
>>
Fresh

>>108943765
>>108943765
>>108943765
>>108943765

Fresh
>>
>>108942732
you can kind of do it with ltx if you inject it as the first frame and then inject the first real video frame afterwards. just include in the prompt that the video starts with a front back side perspective of the person and then cuts to whatever scene you are doing



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.