Discussion and Development of Local Image, Video, and Music ModelsPrevious: >>109099286https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUISDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineageWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, & Upscalershttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.info>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/tdrussell/diffusion-pipehttps://github.com/kohya-ss/sd-scriptshttps://github.com/kohya-ss/musubi-tuner>Zhttps://huggingface.co/Tongyi-MAI/Z-Image>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/https://animadex.net>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>Wanhttps://github.com/Wan-Video/Wan2.2>LTX-2.3https://huggingface.co/collections/Lightricks/ltx-23>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>>109101107>>Maintain Thread Qualityhttps://rentry.org/LDG_vital_info
Gaming time
>inb4 n*gbo
What is the best oriental female lora for ZiT?
>mfw Resource news06/20/2026>One Node · FLUX.2 [klein]https://github.com/yanokusnir-ai/one-node-flux-2-klein06/19/2026>FreeStyle: Free Control of Style-Content Dual-Reference Generation from Community LoRA Mininghttps://github.com/Blue2Giant/FreeStyle>JanusMesh: Fast and Zero-Shot 3D Visual Illusion Generation via Cross-Space Denoisinghttps://siang1105.github.io/JanusMesh.github.io>Linear Recurrent Unit with Semantic Modulation for Image Super-Resolutionhttps://github.com/MingyuChoi-run/LSM>LEAP: Layer-skipping Efficiency via Adaptive Progression for Vision Transformer Distillationhttps://github.com/KevinZ0217/LEAP>StylisticBias: A Few Human Visual Cues Drive Most Social Biases in MLLMshttps://hf.co/datasets/shaghayegh/stylistic-bias-dataset>musubi-tuner adds support for ideogram 4 lora traininghttps://github.com/kohya-ss/musubi-tuner/blob/dev/docs/ideogram4.md>KupkaProd Music Video Pipelinehttps://github.com/Matticusnicholas/KupkaProd-Music-Video-Pipeline>Midjourney goes from generating cat images to full-body ultrasound scanshttps://www.theverge.com/ai-artificial-intelligence/952011/midjourney-medical-ai-ultrasound-scan>TeleStyle V2: Beyond Content-Preserving Style Transfer with Self-Distillation and Distribution-Matching-Distillationhttps://github.com/Tele-AI/TeleStyleV206/18/2026>UniTemp: Unlocking Video Generation in Any Temporal Order via Bidirectional Distillationhttps://lzhangbj.github.io/projects/unitemp>Reasoning as Intersection: Consensus-Frame Alignment for Visual Focus in Video-MLLMshttps://github.com/1Pansy/VideoCFR>Moebius: 0.2B Lightweight Image Inpainting Framework with 10B-Level Performancehttps://hustvl.github.io/Moebius>From Bounding Boxes to Visual Reasoning: An On-Policy Data Annotation Tool for Vision-Language Modelshttps://github.com/WnQinm/Annotator>Boogu-Image-0.1-Edit GGUF https://huggingface.co/realrebelai/Boogu-Image-Edit_GGUFs
>>109101121Interesting image. Gummyworms
>>109101107Desu collages are better when they are like 10 images max.
>>109101121Help! I was reincarnated as a slime!
>mfw Research news06/20/2026>Is AI ruining our skills? Early results are in — and they’re not goodhttps://www.nature.com/articles/d41586-026-01947-1>Revealing Artifacts via Noise Amplification: A Novel Perspective for AI-Generated Video Detectionhttps://arxiv.org/abs/2606.16742>TriFlow: Generating Artist-Like 3D Mesh Topology via Nearest-Vertex Vector Fieldshttps://arxiv.org/abs/2606.20131>Addressing Detail Bottlenecks in Latent Diffusion for RGB-to-SWIR Image Translationhttps://arxiv.org/abs/2606.19961>Timestep Rescheduling in Diffusion Inversionhttps://arxiv.org/abs/2606.15389>Human-in-the-Loop Atlas-Based 3D Asset Segmentation for Interactive Content Workflowshttps://arxiv.org/abs/2606.17824>SpatialAvatar-0: High-Quality 4D Head Avatar with Multi-Stage Reconstructionhttps://spatialwalk.github.io/SpatialAvatar-0>ProductConsistency: Improving Product Identity Preservation in Instruction-Based Image Editing via SFT and RLhttps://arxiv.org/abs/2606.19103>HiRo: A Compact Four-Directional Hierarchical Reservoir Token-Mixer for Efficient Image Classificationhttps://arxiv.org/abs/2606.15151
>>109101137>>109101147ALERT! Bot spamming possibly infected links! Take care, anon!
>>109101137>>109101147Fuck off debo
>>109101113>please don't look at the things I've done over the years>I'm so lonelylol suffer
Big Asp 3 apparently finished main training.Only RL left.The outputs are a bit rough honestly.But given this guy's track record I am going to trust the plan and give him the benefit of the doubt.
>>109101147>>109101137Can you please stop spamming this every thread?You don't even add new content you spam the same shit every thread
>>109101137>>109101147thanks!
>>109101170Can you please stop spamming this every thread?You don't even add new schizo content you spam the same schizo shit every thread
>>109101137>>109101147
>It's a homebrew agentic program that uses URL/search/image context and kling/heun/google/grok apis and stitches the result. You can do the same in a google workspace or with claude's MCP functionality.>I should clarify; Heun is an implicit sampler method (unlike euler which is explicit) meaning it be be used to generate partial image results without knowing the subjects are and then be merged with other methods. So it's very useful for generating virtual 3d spaces and then populating them with objects/characters.>Most flux models use it. I just have the agent scrub git repos and huggingface for public flux apis with that methodan anon in another board said this the other day, what the hell is he talking about
>>109101183Looks a lot like Wardour Street in London.
>>109101189Why are you posting this in every thread?
>>109101196so you can answer
>>109101202No this is another fucking ritual post if nobody is responding actually wait to post it, why are you being annoying for the sake of being annoying?
>>109101343Women should look like this
>>109101107>two of my gens made it into the general collage :D Also I forgot how good Anima's prompt adherence truly is, even with 2 loras attached:>Wappah \(Artist\), @Wappah01, correct proportions, correct anatomy, 1girl, futanari, solo, light-brown skin, curly hair, pink gradient background, standing, dark-green turtleneck, bottomless, cum drip, viewed from below, half-erect penis, speech bubble saying "ugh why do i feel so hot?" and "and why are you still here?" https://civitai.red/models/447945/wappah-style-and-characters-anima-or-ill-or-pony
>>109101170imagine this guys family dieimagine his sorrowmh..... i get rock hard just imagining it
https://www.reddit.com/r/StableDiffusion/comments/1ub4jpk/ltx_director_20_update_a_free_open_source/neat node for ltx 2.3
>>109101383big if true.
>>109101383that looks interesting but every time i try video gen i leave disappointed
>>109101362>AI_Art_FactoryCan you please buy an ad?
>>109101362>Also I forgot how good Anima's prompt adherence truly is, even with 2 loras attached:You're friends with Rusell or something? You always show up to do damage control. Just so you know, prompt control and LoRAs depend more on the people making them than on the model itself. And statistically, Anima tends to lose prompt adherence once you stack more than one LoRA. Stop acting like that's some special strength of the model.
>>109101427rent free
>>109101387Little kid, debo gen alright
>>109101382
>>109101454that apartment smells like Norwegian fish farm
>>109101427>You always show up to do damage control.for what/who? Please learn to be happy and just post gens.No one cares about this dumb drama (you) keep fanning the flames of. see >>109101113
Has Anima been finetuned properly yet?Did they implement the GRPO lora training thing?
>>109101536finetuned for what?
I have been sitting with this feeling for a while and I think it's worth saying out loud: I don't feel fulfilled by Anima the way I used to with Illustrious, I genuinely feel alienated from my own genning tools
>>109101454>>109101435>>109101343Even the earlier Chroma 1 epochs looked better than that
>>109101544Non-anime art
>>109101545Babbys first Marx theorist arrived
>>109101536>Has Anima been finetuned properly yet?turns out the catastrophic forgetting is real and every attempt turned to dogshit
>>109101536>GRPO lora training thing?what?>>109101561there's a realism tune on civitai iirc
>>109101545Anima it's all cold, the gens don't feel like mine anymore. Nothing does, my prompt gets bloated by an LLM, I lose any control over composition, and I just wait for an output. It doesn't feel like creating anymore I'm basically an API endpoint for myself, I submit a request and wait for a response.I hate Anima, I hate Anima so much there is something deeper about what got lost in this shift.
>>109101560proof?
Ideogram4 can't even generate a loaf of bread.
Before Anima I would spend hours on a single gen, and when it was done, I actually felt like I put a piece of myself into that AI generated image. Anima is sterile, type a prompt, some LLM bloats it into soup, the machine spits out a result, repeat. Am I supposed to feel happy about this? No, not at all, I hate this, I hate it with all my might.
>>109101623Kys, it's now or never
>>109101561>>109101506>>109101454>>109101435>>109101382>>109101343all animahttps://civitai.red/models/2409949/sam-anima-realistic?modelVersionId=3017757
>>109101586https://github.com/yifan123/flow_grpo>there's a realism tune on civitai iircI meant Western art.
>>109101454Catbox anon
>>109101545>>109101595are you using underscores? don't.are you using turbo lora? don't, you'll have less control. but it's good for quick gens.are you using enough steps? you'll get gibberish otherwise.don't use latent upscale, use higher base res with higher steps.use the right sampler & scheduler.
>>109101638https://files.catbox.moe/nugd6r.pngsorry for the mess.
>>109101629why link 2.0 instead of 2.1
Reflect please, instead of counting what we gained with this new model, think about what we stopped doing, or what was quietly taken from us under the guise of progress.
>>109101644He's just trolling it's safe to ignore.
>>109101651Because you will notice 2.1 is only turbo.
Oh shit! A skeever
>>109101663
>Was it really so important that the machine understood prose?You lost CLIP.>Was it really so important that the machine understood relationships between objects?You lost regional prompter.>Was it really so important that the machine got it right on the first pass?You lost hiresfix.>Was it really so important that the machine gave you five fingers?You lost inpainting.>Was it really so important that the machine stopped melting eyes?You lost adetailer.You traded your tools for cosmetic fixes you could have just done yourself and you thanked tdrusell for it.
>>109101699You didn't lose any of these things you fucking incompetent.
>>109101699imagine watching your family die a gruesome violent graphic deathi get rock hard just thinking of how it would traumatize you
i trained a lora with anima trainflow and it works well but i think it would need some more time in the oven. this is the first time making a lora so i'm not super experienced, is there any way to take an existing checkpoint and use that as a base to continue training with more steps? I have the dataset and everything I used to train it with
>>109101818>anima trainflow>is there any way to take an existing checkpoint and use that as a base to continue training with more stepsyes. stop using this dogshit.https://github.com/67372a/LoRA_Easy_Training_Scripts
>>109101728img2img kinda sucks with anima tbdesu
>>109101818>>109101857resume in sd-scripts will attempt to load optimizer states. If you actually finished the previous run that's useless. You want --network_weights option if you want to do more training than previously planned.No idea how that GUI wrapper handles that.
>>109101908>If you actually finished the previous run that's uselesswhy? it loads optimizer states, not lr scheduler states
from docs/train_network_advanced.md:>* `--network_weights=\"<weight file>\"`: Starts training by loading pre-trained LoRA weights. Used for fine-tuning or resuming training. The difference from `--resume` is that this option only loads LoRA module weights, while `--resume` also restores Optimizer state, step count, etc.
>>109101946--network_weights is for when you don't have optimizer states from previous run, and resuming training without optimizer states is worse, it doesn't mean you should only use --network_weights when you want to do more training
Are non-explicit furry gens accepted around here? I don't like the /trash/ thread very much desu
>>109101964no
>>109101974Shame.
The creator of anima should've created a model with more parameters.
>>109101964Officially? No. Unofficially... depends....
>>109098840> ACEStep XL with a LoRA goes from being Suno tier to being better than Udio.Lol, you again.>Junmin Gong praised the release of SA3 on his XYou even know their chink names.
with fire
>>109102024>Junmin Gongnta, but he is very important, because he released Ace Step 1.5 XL SFT, which I use every day.
>>109101964Of course you are welcome to post anything.
>>109101168oh that's that one hot chick that did the snow white movie last year right?
https://files.catbox.moe/ueazag.mp3I'm changing style, but I like this output, so sharing. ace step 1.5 xl sft.
Just trained an SA3 DoRA using default SAI recommended settings.Damn... It's just like the target music, holy shit. I honestly can't believe my ears, a few secs on my GPU yields some of the greatest music in the world. Wtf. It learned the style and composition perfectly with short captions. I had to check my dataset like 200 times to make sure it wasn't overbaked or anything like that. I'm mindblown by this thing.This is the first model that is "truly dangerous" to the music industry in the sense that it puts real musicians and artists at risk. This thing is too good, I'm surprised SAI released it. AI art is just like a toy compared to this. A single HQ song is much more valuable than an obviously slopped image, only analogy is like if we got something capable of making lossless HQ videos indistinguishable from the real deal from start to finish. You no longer need to learn music theory, sound design, and software navigation as a beginner. And you certainly no longer need 3 to 6 years of consistent practice to produce radio-ready EDM tracks as resources online say. The entire process from ideation, arrangement, composition, mixing, mastering can be thrown out the window. We're in interesting times.>>109102024I'm the one who shared samples from previous threads, which btw I found on AceStep discord which is the only place discussing it. Guess how I found out about SA3? Through Junmin Gong. We both have him to thank.
>>109102024anyway, has anyone released a teen pop lora?
>>109102108Also, I may have been wrong about needing to be extra precise. This DoRA I trained was with lazy captions, doesn't get better than that.
Your idea about spamming these threads non-stop is delusional.
>>109102108does sa3 use a 5hz lm that can be turned off? I don't like "computer grid" music.
I'm an unc.
>>109102108I don't think AI changes anything. If you are not a musician you don't understand what I mean and therefore...
>>109102118>5hz lmThe model doesn't need an LM like ACEStep, though there's an optional prompt enhancer model (which I haven't and had no need to touch). ComfyUI I think bakes this prompt enhancer in there if that's what you mean.
butt
>>109102140It's too square. ofc computer music is always that way, it aligns to the digital grid.
>>109102138t. failed musicianGet with the times grandpahttps://github.com/gantasmo/theDAWI no longer need to spend thousands of dollars on plugins.
>>109102154You sure are insecure. I don't think a hobbyist needs to spend thousands of dollars on plugins.
how do I learn how to make ComfyUI workflows?I'm shoot brainlet
>>109102163I have done this too, but if the wf is decent your real task is git gud at prompting.