Catbox Host Seething Edition Discussion and Development of Local Image, Video, and Music ModelsPrevious: >>109015348https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUISDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineageWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, & Upscalershttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.info>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/tdrussell/diffusion-pipehttps://github.com/kohya-ss/sd-scriptshttps://github.com/kohya-ss/musubi-tuner>Zhttps://huggingface.co/Tongyi-MAI/Z-Image>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/https://animadex.net>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>Wanhttps://github.com/Wan-Video/Wan2.2>LTX-2.3https://huggingface.co/collections/Lightricks/ltx-23>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
gm saars
>inb4 n*gbo
>>109020001https://rentry.org/LDG_vital_info
I made a fun challenge for you /ldg/ idiogram is amazing
where is ideogram anima???
i would be using ideogram right now if it didnt have a piss filter...
>>109020027ideogram looks awful to me.
maintain thread qualityhttps://rentry.org/LDG_vital_infohttps://rentry.org/LDG_vital_infohttps://rentry.org/LDG_vital_info
>>109020033based openai poisoning their own models with a piss filter in order to sabotage all of local
>>109020027But you say this exact same thing about Ernie, Kontext, Klein, Z Image, Qwen, WAN, and every single one of the 40 Chroma versions and Anima this last 6 months. Hard to take you seriously at this point anon :(
>>109020033I would try if I had enough vram
Can you recognize which character this is supposed to be?
>>109020033No model at all does what I want. I want such an extensive amount of face descriptors that you can create at least a very near likeness, and with enough spamming of the gen button, a likeness.That's not the case right now. No model has such a capacity.The funniest thing right now is how models can't even produce a complete range of chins. Noses are also just impossible, you can't get an appropriately large nose, not too large, but not model small either. You can't control these things.
>9.3Byeah *burp* she could lose some weight mhm
found him right in the middle
I met my wife on ldg
>>109020049you do have enough ram anon showed us in previous >>109019809
Almost did my first Ideogram gen until I realize I'd have to update Comfy.
>>109020071Idiogram on the first couple of gens is the most disappointing thing I can imagine until you learn how it works
>>109020063He either re-named his safetensors, changed them after he started or actually magically found a way to fit 17GB in 16GB.It's funny considering I can easily run WAN2.2 and LTX here but not ideogram4
how do (You) solve the severe kino drought problem?
Didn't know my wife was inside my GPU
>>109020001Other than the Touhou one every single one of these images is an indefensible inclusion. Yes I was snubbed of course. None of these had any right to take a place that could have belonged to my gens (very high quality).
>>109020077>the first couple of gens is the most disappointing thing I can imagine until you learn how it worksthis describes literally every single model
>>109020077Where's the good gen? I'm waiting.
>mfw Resource news06/09/2026>SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioninghttps://teal024.github.io/SCAIL-2>BLM-SGAN: Bidirectional Language Modeling for Semantic-Spatial Text-to-Image Generationhttps://github.com/haidy-maher/BLM-SGAN-Text-to-Image-Generation>SwiftVR: Real-Time One-Step Generative Video Restorationhttps://h-oliday.github.io/SwiftVR>Property-Informed Diffusion-Based Text-to-Microstructure Generationhttps://github.com/hongsong-wang/PropDiff-TMG>OmniTryOn: Video Try-On Anything at Once!https://github.com/xcltql666/OminTryOn>IEA: Amateur-Friendly Conversational Image Editing Agent via Three Stages of Multitask Alignmenthttps://github.com/OpenDFM/Image_Edit_Agent>CapRL++: Unified Reinforcement Learning with Verifiable Rewards for Dense Image and Video Captioninghttps://github.com/InternLM/CapRL>CHROMA: Detecting AI-Generated Images through Inter-Channel Color-Space Correlationshttps://github.com/JPSoteloSilva/CHROMA>VideoWeaver: Evaluating and Evolving Skills for Agentic Long Video Generationhttps://github.com/JianhuiWei7/VideoWeaver>Built to benefit everyone: our planhttps://openai.com/index/built-to-benefit-everyone-our-plan>China Preps $295 Billion Plan to Fund Nationwide AI Buildouthttps://www.bloomberg.com/news/articles/2026-06-09/china-prepares-295-billion-plan-to-fund-nationwide-ai-buildout>Z-Image-Engineer V6 (4B) https://huggingface.co/BennyDaBall/Z-Image-Engineer-V606/08/2026>Unified Safe In-context Image Generation in Multimodal Diffusion Transformers via Restricting Unsafe Information Flowshttps://github.com/deng12yx/UVR>GuideCAD: A Lightweight Multimodal Framework for 3D CAD Model Generation via Prefix Embeddinghttps://github.com/mskimS2/GuideCAD>Consistency-Preserving Diverse Video Generationhttps://github.com/XinshuangL/Diverse-Video>Ideogrammar — Ideogram 4 Prompt Editorhttps://github.com/rlemson7/ideogrammar
>>109020090the OP is a mentally ill tranny dude
>>109020062No shes my chinese researcher wife no yours
>mfw Research news06/09/2026>Ultra Flash: Scaling Real-Time Streaming Video Generation to High Resolutionshttps://arxiv.org/abs/2606.09150>Seeing is Believing: Aligning Prompt Rewriting with Visual Anchors for Text-to-Image Generationhttps://arxiv.org/abs/2606.08492>MilliVid: Hierarchical Latents for Long-Range Consistency in Video Generationhttps://davidcharatan.com/millivid>OmniGen-AR: AutoRegressive Any-to-Image Generationhttps://arxiv.org/abs/2606.09156>TIDE: Task-Isolated Diffusion for Unified Video Editing and Generationhttps://LittleWork123.github.io/tide>LiteVSR: Lightweight Adaptation of Frozen Diffusion Transformers for Video Super-Resolutionhttps://arxiv.org/abs/2606.09250>CineDance: Towards Next-Generation Multi-Shot Long-Form Cinematic Audio-Video Generationhttps://aliothchen.github.io/projects/CineDance>CoVEBench: Can Video Editing Models Handle Complex Instructions?https://arxiv.org/abs/2606.08415>ZIPP:Zero-shot Image Personalization from Personashttps://arxiv.org/abs/2606.08841>TUDSR: Twice Upsampling-Diffusion for Higher Super-Resolutionhttps://arxiv.org/abs/2606.09608>Beyond Raw Signals: Undecoded Generative Latents as Privileged Synthetic Datahttps://arxiv.org/abs/2606.08336>Beyond Scalar Rewards by Internalizing Reasoning into Score Distributionshttps://arxiv.org/abs/2606.09076>HACK++: Towards More Effective Head-Aware Key-Value Compression for Efficient Visual Autoregressive Modelinghttps://arxiv.org/abs/2606.08302>Understanding Quantization-Aware Training: Gradients at Quantized Weights Bias to the Low-Loss Basinhttps://arxiv.org/abs/2606.09012>Diffusion Image Generation with Explicit Modeling of Data Manifold Geometryhttps://arxiv.org/abs/2606.00094>Real-Time AttentionBender: Granular Interactive Network Bending of Video Diffusion Transformershttps://arxiv.org/abs/2606.06497>Optimizing Few-Step Generation with Adaptive Matching Distillationhttps://arxiv.org/abs/2602.07345
>>109020092Ideogram felt way worse to me what with its retarded promoting schema and the filter that actually isn’t really a filter if you prompt it right.
>>109020092wrong FLUX 9B Distill is perfect right away
>>109020104fact
STOPPPPP trying to convince me to use ideogram i dont wanna right now
Is nucomfy compromised? Do I do the needful and upgrade to try the new FOTM?
(repost)>>109020005>>109020000How do you know if a 1girl is a 1girl tho?
>>109020128my model would never lie to me
>>109020140As long as she identifies as a 1girl it's definitely not gay.
>>109020080Does this look like a qwen workflow? You could just ask for help instead of accusing everyone of lying.Workflow here, obviously remove the lora loaders: https://files.catbox.moe/fvmcf3.png (embed)If that doesn't work try changing sys memory fall back settings in nvida control panel and add --reserve-vram 2 to your .bat
no lie, I've never really had any women flirt with me, so one day a tranny did, and honestly he was passing, but not, because like what are the odds lmao. A "tomgirl" style tranny.
>>109020144I quite literally dropped my entire graphical interface and opened the comfyui web interface on another machine just to be sure it had as much ram as possible and it still doesn't workI'm not going to waste hours of my life just because you want to troll people into running a model it's impossible for them to run
>>109020168I have yet to see anything nice come out of ideogram.
>>109020168holy shit i was going to call this bullshit but then i noticed that >>109020144 isn't using the Ideogram 4 Scheduler
>>109020190
>>109020190Pretty sure that scheduler is impotant.
>>109020113>Is nucomfy compromisedfor like a year now yeah
>>109020215Where in the code?
>>109020144Is flash attention worth the hassle?
>>109020220check it out ya'll. c:/users/desktop/comfyui/network.py
>>109020223>Is flash attention worth the hassleI don't know how you can have comfy on your PC for more than a week and not have inadvertently installed flash attention like 20 times.
>>109020238>idoidk man. trained on plasters.
>>109020238Flash atention *whore*
>>109020233Im sorry I'm not a vramlet
no lie, I've never really had any interest in API models, but one day I tried grok, and honestly it was uncensored, because like what are the odds lmao. A "uncensored" style API.
do you guys accept refugees?
not your workflow not your waifu
>>109020256only if you are the kino kind of refugee
>>109020249>I'm not a vramlet>having flash attention 2 means you're a vramletThat's a new one
>>109020190>>109020201I know I'm getting baited now but here's the same workflow with ideogram4 scheduler. https://files.catbox.moe/w0s0cg.png>>1090202235.43s/it with6.09s/it withoutNot massive gains but it takes 30 seconds to install so might as well
>>109020256Depends where from.
AI is for chumps.
>>109020258>>109020275/adt/ :'(
>>109020250I have Grok and Nano Banana, through their like um real basic plans. Like I think the lowest is a decoy product, idk.Anyway, they're both pretty decent 1girl generators overall.I do something I think is amusing, instead of photographing assorted hos, I attempt to memorize their appearance and then type it in and gen it. It's harder than it sounds, very amusing imo. So far, I go through basic colors like as layers top down, then styles, then key extras. So likestraw - hairwhite - shirtwhite - shortsblack - shoeswith extra layers if they are there, like a belt etc.then a types layerdouble tied pony tailhourglass-ish 80's style shorts pantsuit *I'll look this up*black roller skates (neon green accent)sometimes there is extra info that's like maybe add, maybe don't, because it depends on the perspective, so if a freckled face, then you can't really have the pony tail.And, if we consider the face, there are shapes to the face that might be observed.I will also observe if something is dirty, and if she's sweaty, but these really are just basically optional according to what you think.If you save it to the gallery in your phone, you may do a double take, because it may look like you took their photo, sort of. It's highly funny.
>>109020285Yes, this is an anime general, you know tdrussell? Well, he posts here.
I'm sitting naked after I mowed the lawn, so my balls smell like a dead rat.
in case any ldg hot babes are into that.
>>109020285I was worried you were gonna say /sdg/ or something. That's like hearing about someone who survived living next to the elephants foot and decided to move in after all this time.
>>109020050IT'S PIKACHU
>>109020285We like anime too, what's your favorite anime?
I'll make the perfect ux for myself!
>>109020319>>109020279I like these, catbox?
>>109020325NTA but i like Evangelion and Frieren.
>>109020337cringe and cringe
Drag and drop is not feasible on comfy anymore, what happened?
>>109020285Welcome ^^
>>109020285Are you a gacha fag? here is a Furina, enjoy. Welcome to the thread anon!
>>109020325That's kind of a loaded question, but to name a few: Katanagatari, shoujo shuumatsu ryokou, 3-gatsu no lion, made in abyss, kaguya-sama wa kokurasetai>>109020380I played a lot of genshin but got too bored in natlan and stopped.
>>109020300Wait, tdrusell? The actual dev of Anima lurks this general? Based. Good info anon.
>>109020331https://files.catbox.moe/eo3s33.png
>>109020425damn, ideogram is pretty good
>>109020459>damn, ideogram is pretty goodsamefagthis shit is easy, bring it.
why causes schizsaar to seethe so hard about ideogram?
>>109020504iffy license and rocky launch created a polarizing opinion on the model. People who never moved on from the initial impression are baffled people enjoy it now.
>>109020517we just need tdrussel to finetune it to make everyone happy
>>109020476well these;>>109020319>>109020279>>109020172(prompt plz...)look cool for wallpapers but it could be a fluke since all ihear it is tarded model trained on closed source outputs is it
oh anonie...
>>109020560>it is tarded model trained on closed source outputsit is. but 60% of nano banana pro is still 200% better than shit like z image or hidream. training on sloppa api outputs won't get you nearly as good as the models you're copying, but it's still better than everything else. local is just that pozzed right now.
[[[[[[[[[[[[cleft chin]]]]]]]]]]]]]]
>>109020053Why didn't they just copy the Chinese? Bloated mess
>>109019569Is it anima Samv2
6b is ideal. more than sdxl and can fit on most cards with decent quants. anima doesn't have enough parameters.
>>109020614Easier to fix than moot chin tho
>>109020617>Why didn't they just copy the Chinese? desu because they are retarded and lazy
>Why didn't they just copy the Chinese?you mean why didn't they sell out to API? thank god they didn't.
calm down anon no need to start getting upset
>>109020635Yes.
>>109020560>>109020441
>negative prompt: uglydo you really need more?
>he doesn't even notice mootchin
The very best 1girl ever though of is coming right up.
>>109020782stop spying on my generations
>>109020782Sorry to keep you waiting
>>109020800
>>109020800>no feetuh. that can't be it.HERE IT IS!!!*square toenails are normal.
boobs too small now wtf anon >>109020807
>>109020810oops, here it is. lmao
>>109020812I got scared that the giant cone nipples would be too much for a blueboard so I made the bboxes for the boobs smaller.
>>109020822>want to make nipples smaller >instruct model to make entire booba smaller ??????????
>>109020829The bigger the boxes got the more ridiculous and bovine the nipples got.
>>109020822herro im hiroshima nagasaki, i give you gaijin permission to postu
>>109020837
so what happens now?
>>109020941show me something
ideogram sux for nsfw
>>109019368Catbox?
>>109021051Depends what your poison is.
>>109021053monster-girls, futas, massive nips, normal shit
>>109021051it's more uncensored than zit
How do I make my gens better? They're not up to par, imo.
>>109021203put satan into the negative prompt