Beg More Fag Edition Discussion and Development of Local Image and Video ModelsPrevious: >>108711911https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/tdrussell/diffusion-pipe>Zhttps://huggingface.co/Tongyi-MAI/Z-Image>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2https://huggingface.co/Lightricks/LTX-2>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girl>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
Damn jeets stoled my job
>>108718153No. And yes Gemini 3 Flash uncritically can perfectly caption hardcore NSFW including bestiality.>>108717558It likes very very long prompts more than short ones, can do most styles. Most people sort of underuse Klein T2I IMO.
>>108718245woops *unironically
>>108718114He could have at least done 512x512 like the original Chroma instead of fucking 256x256, which had literally no chance whatsoever of producing worthwhile results from day one
webp really is incredibleit's insane that 4chan still doesn't support it
i discovered hard cuts for wan i feel like spielberg
>>108718304based kinosmith
>>108718295It has good compression efficiency but most CDNs serve webp images at absolutely fucking dogshit quality for some reason which makes me hate it.
>mfw Resource news04/29/2026>Z-Anime | Full Anime Fine-Tune on Z-Image Base https://huggingface.co/SeeSee21/Z-Anime>QuantVideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantizationhttps://github.com/svg-project/Quant-VideoGen>World-R1: Reinforcing 3D Constraints for Text-to-Video Generationhttps://github.com/microsoft/World-R1>Benchmarking Layout-Guided Diffusion Models through Unified Semantic-Spatial Evaluation in Closed and Open Settingshttps://github.com/lparolari/cobench>VibeToken: Scaling 1D Image Tokenizers and Autoregressive Models for Dynamic Resolution Generationshttps://github.com/SonyResearch/VibeToken>OmniVTG: A Large-Scale Dataset and Training Paradigm for Open-World Video Temporal Groundinghttps://github.com/oceanflowlab/OmniVTG>Refinement via Regeneration: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Modelshttps://github.com/LeapLabTHU/RvR>SketchVLM: Vision language models can annotate images to explain thoughts and guide usershttps://sketchvlm.github.io>Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generationhttps://tuna-ai.org/tuna-2>Prefill-Time Intervention for Mitigating Hallucination in Large Vision-Language Modelshttps://github.com/huaiyi66/PTI04/28/2026>Illustrious XL & NoobAI-XL Style Explorer https://github.com/ThetaCursed/Illustrious-NoobAI-Style-Explorer>LTX Desktop 1.0.5https://github.com/Lightricks/LTX-Desktop/releases/tag/v1.0.5>Meta-CoT: Enhancing Granularity and Generalization in Image Editinghttps://shiyi-zh0408.github.io/projectpages/Meta-CoT04/27/2026>PixlStash 1.1.0 Updatehttps://pixlstash.dev/whatsnew.html>AURA AI Studio Vault: One-stop management app for models, images and morehttps://github.com/TheGho7t/AURA-AI-Studio-Vault>UniGeo: Unifying Geometric Guidance for Camera-Controllable Image Editing via Video Models https://mo230761.github.io/UniGeo.github.io
>mfw Research news04/29/2026>Golden RPG: Confidence-Adaptive Region-Aware Noise for Compositional Text-to-Image Generationhttps://arxiv.org/abs/2604.25314>A Systematic Post-Train Framework for Video Generationhttps://arxiv.org/abs/2604.25427>ResetEdit: Precise Text-guided Editing of Generated Image via Resettable Starting Latenthttps://arxiv.org/abs/2604.25128>ViPO: Visual Preference Optimization at Scalehttps://liming-ai.github.io/ViPO>GramSR: Visual Feature Conditioning for Diffusion-Based Super-Resolutionhttps://github.com/aimagelab/GramSR>Exploring Time Conditioning in Diffusion Generative Models from Disjoint Noisy Data Manifoldshttps://arxiv.org/abs/2604.25289>The Thinking Pixel: Recursive Sparse Reasoning in Multimodal Diffusion Latentshttps://arxiv.org/abs/2604.25299>DDA-Thinker: Decoupled Dual-Atomic Reinforcement Learning for Reasoning-Driven Image Editinghttps://arxiv.org/abs/2604.25477>Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generationhttps://mutualforcing.github.io>Learning Illumination Control in Diffusion Modelshttps://nishitanand.github.io/relighting-diffusion-website>Learning from Noisy Preferences: A Semi-Supervised Learning Approach to Direct Preference Optimizationhttps://arxiv.org/abs/2604.24952>Improving Diversity in Black-box Few-shot Knowledge Distillationhttps://arxiv.org/abs/2604.25795>QFlash: Bridging Quantization and Memory Efficiency in Vision Transformer Attentionhttps://arxiv.org/abs/2604.25306>When the Forger Is the Judge: GPT-Image-2 Cannot Recognize Its Own Faked Documentshttps://arxiv.org/abs/2604.25213>The Forensic Cost of Watermark Removalhttps://arxiv.org/abs/2604.25491>GPT-Image-2 in the Wild: A Twitter Dataset of Self-Reported AI-Generated Images from the First Week of Deploymenthttps://arxiv.org/abs/2604.25370>Can We Change the Stroke Size for Easier Diffusion?https://arxiv.org/abs/2603.26783
So this thing came out:https://civitai.red/models/2585622/ultrareal-fine-tune-anima?modelVersionId=2904690Havin' a hard time genning good images with it, though.
>>108719007>Anima1I assume he means preview1. Odd decision.
>>108719088Pretty sure he fucked up. Worked with my anima preview 3 loras just fine.
>>108719007we are regressing, this is pure slop
i been generating kinos for 18 hours straight
>>108719209post some
>>108719088Yeah it is odd. I asked in the discussions and he deleted my message pretty quickly so I assume it is Anima Preview 1 if he's taken offense to me asking a simple question.
>>108719007I think the lenovo lora works better. >>108719273Fucking kek, what a retard.
>>108718410>>108718420Thanks
> >108718410> >108718420Fuck off
i'll get banned
No discussions just slops sirs,
ltx is pretty good at fixing limbs. i make a 1 second clip with the broken reference image, prompt what the intended pose was, and then take the last frame
>>108719684Show a in/out ?
>>108719684why are you telling everyone my secret? can you stfu please?
https://github.com/zty0304/Anime-layersBabe, babe wake up! New model just released that can take apart any anime illustration like a Photoshop PSD file, line art, flat colors, shadows, all separate layers!
>>108719917based now i can troll artists with it
I NEED GPUI NED GPU I NEED GPU INEED GPU
>>108719917im running it right now
>>108720045catbox?
>>108720045helo beauty baby i am give kissing on youre bobs
>>108719007this is just regular Anima Preview 3 with the Turbo lora, same prompt as my Klein 9B one a bit earlier lol
>>108720135It's Preview 1. Apparently it was just civitai fucking up but he did reply to my message.
>>108720135yeah i was just pointing out that Anima normally with Turbo lora already looks like my picrel if you use DPM++ 2S Ancestral Simple
>>108719684Thank you for letting us know.
>>108719917wow this is cool. i've always thought this would be the ideal strategy for an anime model. is it open weights?
>>108719917 Do the "layers" have transparency? If not this is worthless.
>>108719917Nice
Need MayLi making out with Lois Griffin in Stonetoss style
Sarah Peterson desu...
>>108720436This is some next level quality, did you achieve that with Anima alone?
i tuned ltx2.3 on music videos, time to start a 30 hour 4k generation and see what happens
Ive made a little dream of mine come true. - a tablet mounted on the sofa that serves solely as a display- a room microphone- on my first 3090, a own custom finetune of flux2klein 9b SNOFS, Cohere STT with FireRedVAD- on my second 3090: Qwen3.6 35b Heretic, fine-tuned Qwen3-TTS- a custom-developed pipelinenow I can lie on the couch, scratch my balls while letting the system generate images. this really makes the hobby so much more fun.
>>108720698Post pic of setup
>>108720721butcept for the scatching.
>>108720698I'm too uptight, I must be at the computer. I don't even like using my phone.
>>108720574Anima and just a basic hires fix.https://files.catbox.moe/jfpkoa.png
Is it a skill issue that I can't make Flux generations better than Pony ones? Given, I really only use them to make portraits for chatbot character cards. I just thought with models that are twice as big as pony models, they'd look a little better at least.
So why isn't stability matrix shilled in the OP? Is it some kind of Chinese malware? I tried it the other day and it was actually pretty damn nice
>>108720789It's fine I just use comfy directly so I don't bother. Stability Matrix is good for noobs.
>>108720755Flux.1 Dev? That needs a lot of setup and a more "pruned" style of NL prompting to get the best out of it. Compared to newer models, it is fairly limited.
>>108720698>a room microphoneso you yell out "computer, generate bobs and vagan"?
>browsing loras on civitai>half of them are tranny pornis this really the ultimate desire for men when they have no restrictions?
>>108720838The eyes remind me of pic related.
>>108720879>>half of them are tranny pornThe internet is a mirror
south park style for anima when
>>108720879Must be your recommendations.
>>108720504forced unfunny meme
>>108720879It's like three niggas making 90% of them
>>108720890Poorfag can't train?
>>108720879>is this really the ultimate desire for men when they have no restrictions?>no restrictions8^)There are... *no*... restrictions?
>>108720894>recommendationsno i'm typing my model name into the search bar
>obtained images>no longer wish to clean, caption, and trainits not even coom pics :(
>>108720829jebby a CUTE!
>>108721055have your ai do it
>>108721105the best bakers go through manually to ensure each caption is perfect and each image is pristine after the computer does its job
>>108721116>ai cant compare to the heart and soul of a humanyou sound like a drawfag
>>108721131ok
>>108721055>cleanNot really needed unless you have trash full of watermarks. Maybe bucketing but it's not a dealbreaker>caption>what are captioner apps>trainGo to sleep and wake up to lora retard>>108721131>believing ai and leaving his captions without proofreadingretard
>>108721143>captioner appsplease, i have my own script
>>108721144>can do arm warmers>can't do realistic amputees
>:^(I'm installing ace step cpp.lish me ruck
hol up, dcw isn't in the cpp app? lol
>>108721293found it.
fyi ace-step.cpp with Vulkan (with my AMD rdna2 card) is way faster than comfy, for some reason, idk why.
>>108721305What backend does Comfy use with Rdna2?
>>108721305It's also way faster on Nvidia, and it's magnitudes faster than the official Gradio or any other pytorch implementation as well.
Is joycaption still considered the best captioning model?
>>108721393rocm. mine is gfx1030, and it's really hard to look this up, and I have long since forgotten which part of amd's driver stuff it is, but one of them was dropped, but cdna2 is still built (can't be used with mine).
>>108721435no its either qwen or gemma
Anima still can't do toes but the arm and leg spaghetti here is kind of blowing my mind. Pretty good.
>>108721435It's a retarded meme.Just grab abliterated Qwen 3.6 MoE or Gemma 4 MoE and write a decent system prompt.Can go with dense variants too if you have 24gb vram.
https://files.catbox.moe/d3jwcw.mp3It turned out alright imo.>>108721416What settings do you use?in comfy I was using exp_heun_2_x0_sde, and tan 2 for the scheduler. and often 500 steps, actually.
>>108720168not bad, but not as squishy as a real kid's drawing.real kids use dead reckoning to draw. They start on a part. Then then hook it together with another one. And another. then uh oops, not quite coming together, various compromises are implemented. ai could do this, but nobody wants to, they want "pro" art, which begins by squinting, and working in layers (very conceptually similar to frequency, actually).
>>108721507>>108721475Nice, set up a script to work with ollama/gemma. Thanks
cozy breas
>>108721779What do those numbers mean
>>108719917>demo coming soon>files updated one month ago
>>108721829https://danbooru.donmai.us/wiki_pages/twitter_cutting_game
local will never reach this level of capability
>>108721874thank god
>>108721874Ok I am impressed that this one didn't get filtered at least.
>>108721874informative. up-it-two-thumbs
>>108721874see if it can summarize https://www.bbc.com/news/world-us-canada-12994248if it does, be sure to notify any and all news outlets.
>>108721813A small treat (wild berries) may help.
>>108719007can somebody extract a lora of this from preview1? so i can use this shit on preview3?
what's the best way to train a lora for ace step?
Hello saars I code the ollama image tagger app. Yes good?
>>108722117I made this. I also made video version which doesn't have auto-captioning yet but you can crop, cut videos and caption manually with it.
>>108722148Nice, mines just a single python script
>>108722117>>108722148nice
>>108722154Mine started from couple of batch files then bloated into whole app.
>>108721874lmao
ANIMA PREVIEW 4 WHEN???
I need an izzat llm that monitors everything I do and tells me my izzat.
>>108722191-500 izzat
I still don't understand the diff between /sdg/ and /ldg/.
>>108722218Containment general for a discordfag who used to cause drama there.It kinda works but this general has developed its own dramas after the split.A few other anons schizopost there too sometimes.
>>108722221brown
original prompt do not steal
>>108722221Catbox or LoRA... Please explain anon
>>108722985piss filter and melted fingers, it's chatgpt.
>>108722767uwhoah is this real
>https://github.com/muooon/EmoSensAnyone tried this shit?
I still got a few kinos in me after I thought I'd run out
>>108722985More than one named character doing something besides standing, it's API
>>108723124
>>108723124Is janny brown pokemon fan ximself or something?Lol @ removing that but leaving barely censored pedo porn.
why did my post get removed? especially considering it was objectively true>>108723389I wish you the best of luck with your suicide, faggot
>>108723606>I wish you the best of luck with your suicide, faggot
>>108723606Those fucking Japanese ruining the internet!
ernie base has some good potential. just needs a fine-tuning of photorealistic non synthetic images.
>>108723606What's up with Peru of all places?
>>108723651Cool, liked your earlier gens
>>108723608lol no wonder you're a friendless retard that looks for validation in fucking 4chan>>108723629Nips look at you and feel revulsion
>>108723651I wrote it off for now but I am interested in taking a crack at it if I see evidence that it responds well to training.One anon here posted a meme lora with very poor facial likeness, which was not very encouraging to say at least.
>>108723665>lol no wonder you're a friendless retard that looks for validation in fucking 4chan
>>108723665>nips>feeluh okay good one
>>108722221Anima 3... For sure
>>108721527>500 stepsWhich model? 500 steps is too much for Turbo, anyways on Turbo XL not too many fancy settings are needed. Just Scragnog custom VAE, DCW Double 0.05 for both, 8 steps. I master my gens with Matchering 2 to improve the sound quality even further. The XL SFT model isn't as creative and doesn't use as many instruments as Turbo XL which means it's also significantly worse at prompt following so I don't use it or its merges anymore.
>wanschizo's a petrol-sniffing abboI mean, that explains him having the brain damage required to be obsessed with a TV show for 5-year-olds
>>108723712>I mean, that explains him having the brain damage required to be obsessed with a TV show for 5-year-olds
>>108723723nice selfie of you when you see an unattended jerrycan
>Only one person uses this very popular reaction image/meme/jakI wish these wannabe detective schizos could understand how ludicrous they look.
>>108723747>nice selfie of you when you see an unattended jerrycanWhy haven't you taken your meds yet little man?
>>108723757>>108723764>subhuman retard doesn't understand how 4chan worksyeah bro the fact that all those pics have the exact "randomized" filename is nothing but a coincidencewhat a fucking mongrel
>>108723785But seriously, why haven't you taken your meds?
FYI, this is the kind of "contribution" the lobotomite abbo makes to this sitehttps://desuarchive.org/a/thread/280916078
why are we fighting again
>>108723983>weIt's just an archive schizo having a meltdown.
>1boy, male focus>get a man with a vaginaebin
>>108724000shoulda put cuntboy in the negatives
>>108724000which model? lmao
>>108724000chroma has a habit of giving women thick penises even when you don't ask for it.
>>108723048What tf is that readme
when can local do this?https://files.catbox.moe/mr1or8.mp4
>>108724075What having millions of furry futa porn images in the dataset does to a motherfucker's model.Serious talk though, it's probably the captioning LLM being unable to tell it when the "woman" has a penis and describing how her vagina gets penetrated.
>>108724094lel I recognized him before he turned around.
>>108724102Yeah, I usually have to describe the woman's vagina or mention the clitoris or it usually gives everybody a penis if you mention it once.
>>108724094it can already do absolutely boring SFW videosbut yes this is longer and with better audio than averageBUT is this using lower end hardware? If you had a high end nvdia gpu farm server thing you could already have done more with hunyuanvideo like a year ago
>>108723989I mean, yeah, there is only one person involved in that discussion, since the other party is a subhuman monkey
>>108724094>he is not flashing his dick overflowing with cum>the girls are not showing boucing titiesexcuse-me but whats the usecase for local being able to make boring videos? indian scamming?
is insectfucker anon in?
>>108724146Anon, take your meds. You're clearly mentally unwell :(
>>108724241another certified wanschizo classicall that petrol has left your brain so shriveled you can't even come up with new retorts
>>108724276Take your meds.
let's ALL take our meds
>>108724276You definitely need your meds bro, like urgently.
>>108724276Nta but if everyone else is "a schizo" for you you might want to ask yourself if you aren't the schizo in the roomFaggot
ok i think i have done everything AI can do and i am now bored.
>>108724435Everything? Show me your BEST cute fart gen anon
>>108724435come back when insectfucker is here, he'll show you some shit
>fartschizo is also backof course
Everytime I do batch captioning, the later captions get lower quality than the earlier - missing punctuation, general wall-of-textiness, no capitalisation. Do I have to set it up to flush context after every image or something like that?