Discussion and Development of Local Image and Video ModelsPrevious: >>108777750https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, & Upscalershttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.info>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/tdrussell/diffusion-pipehttps://github.com/kohya-ss/sd-scriptshttps://github.com/kohya-ss/musubi-tuner>Zhttps://huggingface.co/Tongyi-MAI/Z-Image>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2https://huggingface.co/Lightricks/LTX-2>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girl>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>>108789589does n*gbo really want beef again
Blessed thread of frenship
>mfw Resource news05/09/2026>SenseNova-U1-8B-MoT-Merger-: GGUF quantized checkpoints and layer-offload VRAM modeshttps://github.com/OpenSenseNova/SenseNova-U1#-updated-news>HiDream-O1-Image: Pixel-level Unified Transformer (UiT) without external VAEshttps://huggingface.co/HiDream-ai/HiDream-O1-Image>ComfyUI-RefineNode: local image refinement preprocessing, reference image processing and paste-backhttps://github.com/1Kynx/ComfyUI-RefineNode>Flowception: Temporally Expansive Flow Matching for Video Generationhttps://github.com/facebookresearch/flowception05/08/2026>LTX-2.3 PolarQuant Q5: 88% size reduction, near lossless quality (Cosine Similarity: 0.9986)https://huggingface.co/caiovicentino1/LTX-2.3-22B-HLWQ-Q5>Anima Scribble+Canny Control with adjustable strengthhttps://huggingface.co/CabalResearch/Anima-Canny-Scribble-Adjustable-Control-LoRA>Sparkle: Realizing Lively Instruction-Guided Video Background Replacement via Decoupled Guidancehttps://showlab.github.io/Sparkle>Continuous-Time Distribution Matching for Few-Step Diffusion Distillationhttps://github.com/byliutao/cdm>MSD-Score: Multi-Scale Distributional Scoring for Reference-Free Image Caption Evaluationhttps://steinsgatesg.github.io/MSDScore>SoftSAE: Dynamic Top-K Selection for Adaptive Sparse Autoencodershttps://anonymous.4open.science/r/SoftSAE-8F71>IMG Dataset Refiner (v4.0 Pro)https://github.com/NyxAwroo/IMG-Dataset-Refiner>FLUX, Open Research, and the Future of Visual AI — Stephen Batifol, Black Forest Labshttps://www.youtube.com/watch?v=x8Yb4RidLgM05/07/2026>Stream-R1 Reliability-Perplexity Aware Reward Distillationhttps://stream-r1.github.io>banodoco/hivemind: Drop-in skill that searchs the Banodoco Discord message feedhttps://github.com/banodoco/hivemind05/06/2026>Exploring Data-Free LoRA Transferability for Videohttps://github.com/Noahwangyuchen/CASA>Ortho-Hydra: Orthogonalized Experts for DiT LoRAhttps://github.com/sorryhyun/anima_lora
>mfw Research news05/08/2026>DynT2I-Eval: A Dynamic Evaluation Framework for Text-to-Image Modelshttps://arxiv.org/abs/2605.06170>FreeSpec: Training-Free Long Video Generation via Singular-Spectrum Reconstructionhttps://fdchen24.github.io/FreeSpec-Website>RealCam: Real-Time Novel-View Video Generation with Interactive Camera Controlhttps://xyc-fly.github.io/RealCam>SwiftI2V: Efficient High-Resolution Image-to-Video Generation via Conditional Segment-wise Generationhttps://arxiv.org/abs/2605.06356>ActCam: Zero-Shot Joint Camera and 3D Motion Control for Video Generationhttps://elkhomar.github.io/actcam>Arena as Offline Reward: Efficient Fine-Grained Preference Optimization for Diffusion Modelshttps://arxiv.org/abs/2605.06070>Secure Seed-Based Multi-bit Watermarking for Diffusion Models from First Principleshttps://arxiv.org/abs/2605.06153>FREPix: Frequency-Heterogeneous Flow Matching for Pixel-Space Image Generationhttps://arxiv.org/abs/2605.06421>Continuous Latent Diffusion Language Modelhttps://arxiv.org/abs/2605.06548>Autoregressive Visual Generation Needs a Prologuehttps://arxiv.org/abs/2605.06137>Continuous Expert Assembly: Instance-Conditioned Low-Rank Residuals for All-in-One Image Restorationhttps://arxiv.org/abs/2605.06127>Plug-and-play Class-aware Knowledge Injection for Prompt Learning with Visual-Language Modelhttps://arxiv.org/abs/2605.05910>DCR: Counterfactual Attractor Guidance for Rare Compositional Generationhttps://arxiv.org/abs/2605.06512>Taming the Entropy Cliff: Variable Codebook Size Quantization for Autoregressive Visual Generationhttps://arxiv.org/abs/2605.06207>MARBLE: Multi-Aspect Reward Balance for Diffusion RLhttps://aim-uofa.github.io/MARBLE>Eulerian Motion Guidance: Robust Image Animation via Bidirectional Geometric Consistencyhttps://arxiv.org/abs/2605.06280>AI-Generated Images: What Humans and Machines See When They Look at the Same Imagehttps://arxiv.org/abs/2605.06143
ok already deblessed
How's Hidream for text/hotties combo?
>>108789475Yeah, that happened back in the early 90s.
>>108789778Everything died around 2010.
>>1087897852016 was the turning point
>>108789778what model is that
>>108789798No. That was just when it was too late to turn back. Maybe 2012 really was the end of the world... just as we know it. Nostradamus was right.
I will be posting kino btw
>>108789822ZIM+ZIT.
Anon?
Ernie truly is almost there. All the model needs is a single tune to unslop itself.
>finetune will fix everything!
It did for SD1.5 and SDXL thobeit
but sd 1.5 and sdxl were good models
I've spent last three and half hours trying to debug a fix for the grid patterns in HiDream-O1-Image (full), but alas I failed.If I must say one nice thing about hidream o1, it's that the per iteration speed is reasonably fast for 8B.I am running it on a 3060, and a 1024p image takes 2 minutes for 50 steps, with partial system memory offloading. Triple that for 2048p. I get its 2048p speeds with 1024p under ZIM, Ernie etc.The quality is fucking ass regardless of grid though. Maybe there is something wrong with the script no one figured out yet, or maybe all the images we see are from the 200B API model and they straight up lied about the benches of the local variant. (I don't think you can even benchmeme this garbage)Anyway I go to bed a disappointed, sorry man.It runs with fine speed flash attention disabled btw, the repo is lying to you.
>>108789866im waiting
>>108790246My one remaining cope is that they would have had to brazenly cheat for the 8gb version to have done so well in the benchmarks. Maybe there really is some stupid mistake somewhere. Or maybe they really are that brazen.
>>108790227I don't see why not
>>108790246Agree.I was excited to see 2048 as the default resolution, but there is no real detail.It is relatively fast though (25 seconds for 50 steps, 2048x2048). I will try with the prompt enhancement next as some models improve clarity with more descriptive prompting.
This model is extremely good with infographics/illustrations. It's also probably next level art.
can illu/noob not do a wig partially off someone else's head that shows the real hair color? can anima do it?asking for a friend
>>108790410anima probably could
>>108790293I tested using the prompt enhancement script from the repo (following the exact recommendation with gemma-4-31B-it at full precision). It roughly doubled the length of the prompt, but the image output is not very different.
>>108790433i guess ill have to try it. cant even find a lora for what im looking for, seems to be super niche.that marceline is super weird looking, reminds me of homestuck
>>108790433https://danbooru.donmai.us/wiki_pages/hair_visible_through_wigseems to work some NLP would get you a better hit rate
>>108790472that tag is fighting me tooth and nail. if i do a blonde wig, it shows up as multicolored hair even with it in negative, if i make it a weird color like pink, the wig shows up as an object.
>>108790176can you drop catbox? I fucked something up and didn't manage to make it work
Just learned how to use IP adapters.chuds BTFO.
>>108790531Surehttps://files.catbox.moe/5etar5.pngHad to censor the booba for this post kek. I'm just using the standard template workflow with a touch of the turbo LoRA extract to speed up base gens https://civitai.com/models/2551262/ernie-turbo-lora-extracted.
>>108790619>check the ernie model>almost every gen is STILL 1girl, standingkek
>>108790722What did you generate with it?
>>108790619based feeta
>>108790736nothing, ernie looks like shit and im not downloading dogshit. currently messing with anima
>>108790722Come again? Civit users are uninspired jeets.
>>1087907755girls...SITTING?!?! AHHHHHHHHHHHH ERNIEMAN SAVE ME!!!!!!!!!!!!!!!!
>>108790520
1girl standing general
>>108790252Its current aesthetic lends itself well to sharp graphics. Not so much anything else tbdesu.
>>108790775Not bad, how are the generation times?
>>108790775give me the QRD on Ernie. Can it do realistic nsfw stuff? Is it easy to train?
this fagollage smells of curry and shit
why is everything shit and gay
maybe it's you
>>108789589I haven't updated ComfyUI in a while; does putting more than one character still suck compared to A1111?I just want to prompt normally, no conditioning nods or any of that crap.
>>108791397the UI has no bearing on this
https://xcancel.com/ostrisai/status/2053256188142428341>I am running my first test on training a HiDream-O1 LoRA on AI Toolkit. I don't want to get too excited too early. But this is the coolest model I have EVER seen. Super efficient pixel space. No VAE. No Text Encoder. Trains super fast. This is an industry changing innovation!
>>108791433Why would he post that image with that text? Should we not believe our lying eyes? The results are shit, nothing like a tarot card
>>108791433no thanks
>>108790686>>108790780How do you make these? They look so cool.
>>108790775it's nice, but I guess no full nsfw right?
>>108791429How do I do what I want?
>https://huggingface.co/HiDream-ai/HiDream-O1-ImageAnyone has been able to replicate the output of HiDream-O1-Image?I feel like something is weird, are they shitting us and presenting the images from their non released 200B in the page of the open sourced 8B?
>>108791569type the prompt in the text boxthe result will be the same, rng notwithstanding
>>108791572I couldn't get any good results, so subterfuge seems likely.
>>108791070Been out of the loop, what model makes gifs like this?
>>108789622>>108789623someone kick nigbos cage?>commences to post this spamfor a year
>>108790775holds up mirror
>>108791580Thanks anon, I'm looking into their technical report too, they don't seem to make a difference on what model they're exactly using.It's a bit annoying because it's clearly better for text too.Example from their pdf:(left is qwen 2512)>Input prompt: An advertisement for XP Boost, a gaming hydrationdrink, featuring a young male gamer holding a can of the product in ahigh-tech gaming setup. In the foreground, a young male with curlybrown hair, appearing to be in his late teens or early twenties, wears ablack and green gaming headset and a black t-shirt with green accents.He holds a black and green can labeled "XP BOOST HYPER LIME" in hishand, extending it toward the viewer. The can features a lightning boltdesign and text indicating "ZERO CRASH FORMULA" and "16 FL OZ (473mL)". The background shows a dimly lit gaming environment with acomputer monitor displaying a blue circular graphic, a keyboard, and agaming chair with the XP Boost logo. The left side of the imagecontains large text reading "QUEUE UP. POWER ON." in white andgreen, with smaller text below stating "GAMING HYDRATION DRINKZERO CRASH FORMULA" and "Caffeine + electrolytes + B12, crafted forranked nights and tournament weekends." Below this, a green buttonlike graphic reads "LEVEL UP YOUR FOCUS." At the bottom, four iconsrepresent the product's benefits: "CLEAN ENERGY," "ELECTROLYTESHYDRATION," "B12 FOCUS," and "ZERO CRASH." The overall scene isframed with neon green lightning effects at the bottom.
Babe babe wake up!, Laxhar Lab has reverse engineered and open-sourced the trainer for SenseNova-U1, a leaked unreleased image generation model.>Key highlights:no VAE or separate Vision Encoder, making it a true end-to-end text + image model8B parameters, making it fast and efficient compared to rivals like GLM-Image (16B)Strong at understanding complex prompts and generating infographicsTrainer is live on Hugging Face: Link: >>108791657
>>108791675How good is the model at heavy text?Can you give it the prompt from >>108791624?
>>108791675>>108791692Here is the output (full version)
>>108791675How good is it at 1girl and nsfw.
>>108791723>that handhorrifyingtext is pretty good though.
>>108791576You are too cryptic. What do you imply:is it that breaking prompt into chunks doesn't workorComfy breaks into chunks automatically when line break occurs and it's not effective?
>billion-dollar labs releasing non-stop garbage>big russ out here releasing gemstone after gemstone at 1/10 the parametershow does he do it?? or better yet, why can't they??
>>108791723Thanks anon.The text is almost fine with spacing issues, but the logos are bad, and obviously anatomy is fucked up.
>>108791765its gemstone because its the only model animepoors can run
>>108792092where is your 2 bazillion parameters gem?
>>108791433>from slop to slopgod damn it man why does nobody have eyes
>>108792287idk but it looks like a fast learner, by the 3rd sample it understood the style.
>>108792313If the style is "slop flat art" then sure. Looks nothing like a tarot card though
>>108792374it's ostris, the style is probably "slop flat art".
>>108792386Grifters gonna grift
Why didn't you guys tell me anima is so much better than illustrious, I'm training styles and holy fuck, it really catches on.
i missed this one, new api nodes dropped??https://blog.comfy.org/p/luma-uni-1-is-now-available-via-partner
>>108792467it's not better than pony
>>108792468this one actually looks good unlike the localslop we've been getting. no wonder comfy stopped supporting local models, the shit we get is barely worth spending 5 minutes on. local fell off majorly
sunday, fuddy sunday
>>108791748NTA but chunk shit does fuck all to help multiple characters.You were already told what you needed to do for that.This is my last (You)
How do I convert a novelai prompt to work with anima?
how's trellis 2 doing?
>>108792523What does your novelai prompt look like?
>>108792621give huge bob to girl and make sexy indian man
>>108792621
>>108792689>>108792621https://files.catbox.moe/xkdiyj.png
>>108792716Not my gen btw. It's a shittedfag from /trash/ but the artstyle makes me diamonds and I wanna gen vanilla with it.
yep that's not possible on local
>>108792689Set steps to 40year2024 becomes year 2024Undesired content is negativesMove the ones with -2 in prompt to negativesRemove {{}} and :: s from promptRemove underscores from promptRemove artist:, artist mixing is weak on anima but feel free to keep them.
>>108792732No luck. Everything's coming out way too shiny and clean.
>>108792885I forgot to addYou need @ before artist tags
>>108792904Yeah I forgot them too. Still coming out nothing like the original style unfortunately.
>>108792915And this is with no weights on the artist tags
Jokeal Confusion General
>gpu crashed
sex with JKs
zit sameface slop
flux2 klein9b still the best for doing prompt+image(+image) instruction inputs? (with that qwen model roughly competitive with it). i see some talk of ernie and a very new hidream model itt?
>>108793032i love my asian zitslop girls
>>108793036Where DreamShaperZIT?
>>108793042its not 2024 anymore broski
how do we stop anime slopstyle for good? anima keeps forgetting artists and is going full pony slop so it's not the answer. tired of all these failbakers shitting things up further
>>10879305435 stars status?
>>108793056and this helps because????
>>108792468local is so fucking DEAD>>108792287>god damn it man why does nobody have eyesturns out there are multiple kinds of ai psychosisone develops after you've generated so much slop, that you cant distinguish slop from non slop anymore
>>108793054We need to reverse engineer nai
>>108793054make your own model
https://github.com/Comfy-Org/ComfyUI/pull/13817cant wait to try this 'vaeless' model
>>108793065That "anon" is an insane tranny. Don't bother replying
>>108793071will you front the money? I promise I won't have pony score faggotry
>>108793054>anima keeps forgetting artistsDo you have any proof of this yet
>>108793085anima uses pony scoring?
>>108793085If the scores are bad, why don't you just not use it?
>>108793036wheres her dick?
>>108793133buried deep in your ass
>>108793117yes, the style priors are overwritten for score tag dropout training. It's why the art styles are degrading every preview before it goes 100% api
>>108791675>Laxhar LabThey should get their heads out of their asses and train anima on their noob dataset>>108793158>still no proof Next time you post please have some proof ready thank you anon
>>108793158>art styles are degradingwelcome to /ldg.
>>108793170sorry but saying there isn't any proof isn't viable if you have no proof yourself. make a side by side comparison with the same artist tags and see for yourself
kill ani in real life
>>108793196You seem to not understand the burden of proof
the only proof i need is that no matter how advanced models get, you knuckledraggers will make the same 1girl, standing slop you've been making since sd1.5
>>108793170cosmos has a gay corpo licence they don't want to touch. anima's shortcomings comes from njudea and Russ's own Jewish greed
>>108793198I accept your concession. anima has failed and is just a new pony model until an illustrious like model is released again without Jewish greed involved
>>108793206How is the license substantially different than Noobs There's already other finetunes of Anima
>>108793220njudea can arbitrarily change it's licence and cuck everyone using it retard. It's a stipulation if the licence
>>108793218>still no proof kekd
>>108793229cucked out of profiting off open source? fine by me
bicker bicker bicker
>>108793170would rather they started pretraining for one of those vaeless meme models, doubt anima would get that much better
apache2 anima status? oh wait thats right it failed
>>108793205mona lisa is a 500 year old 1girl, standing gen people still talk about
>>108793282wood you, though?
someone gen the mona lisa with cum on her massive forehead
>>108793020plap plap plap
>>108793286does comfy have working dynamic offloading yet? sdcpp does it way better
>steal millions of anime images from real artists to train your model>release it under a license that forbids using your anima outputs to train competing modelswhen are localkeks finally gonna develop a sense of shame?just leave real artists out of it and steal from us apichads instead. we don't mind, go ahead and distill our outputs all you want but if you're gonna cry about licensing then keep your grubby hands off actual human work, you fucks
8 hours ago lodestone updated training visualization and made the remaining part much shorter.Is he finally giving up on it, lel
>>108793410why is the loss going up https://upload.wikimedia.org/wikipedia/commons/thumb/1/18/Noto_Emoji_v2.034_1f480.svg/1280px-Noto_Emoji_v2.034_1f480.svg.png
>>108793409faggots eat anything up without thinking about the consequences like the pony model for example
>>108793410It used to be 2.25 (million I think) steps, now he seems to be planning to call it quits at 1.75>>108793422He changed the way it is calculated, for some reason.
>>108793410why can't this gigaretard just do a simple finetune
>>108793444we ask this all the time about tdrustled, ponydev and noob team
>>108793197meds
>108793449One of these is not like the others.
>>108793229>njudea can arbitrarily change it's licenceit literally can't thoughbeit, have you even read the nvidia cosmos license? there's basically no restrictions. it's intended to be commercially usable after all
Cyberdyne Systems just released a new model!
no fate but what we make
ani was right about turdrussel
>>108793410lmao, its completely fucked leave the serious work to the professionals, such as sarah peterson and playtime_ai
Anyway, I wonder what kind of surgery he will perform on Ernie.That's what I think he is moving on to.
Not a single one of these models are any good btw. (expect tuna which is MIA since announcement)Maybe it was always a ruse to distract local copers while API models secretly perfected the vae?
>>108793471the one whose model has x50 less downloads on civitai?
>>108793612the biggest reason the vae exists is because of gpulets. pixel space genning is too slow or intense for consumers which is why I don't have high hopes for people adopting this locally. Most API models don't use a vae
>>108793649Yeah I wasn't serious about the second sentence.About speed I don't know.Hidream is reasonably fast, but shit quality. GLM and LLaDA were slow and shit.I don't know maybe fast and good pixel space on consumer hardware is possible. I am very skeptical that we are getting it though.
>>108793687the model would have to be pretty small and possibly distilled if you want the speeds maybe slightly better than DiT latent diffusion but I think DiT has run it's course for a while now. AR should be improved instead of benchmark chasing on the same years old tech
>>108792479People still use pony? I thought everyone was using illustrious and noob
>>108793737And still others are heterosexuals over the age of 17.
>>108792468took them a whileanon was shilling this two months ago >>108452832
just put the fries in the bag already big russ. how much more training could a 2b model need?
What is it about competent people being rewarded for their effort that makes Pembroke's biggest rape victim seethe so hard?
Skill takes a long time. Life is short.
Big Russ just flew over my house and dropped a note which confirms Anima Full will be API only.
>>108793970Should have worn slippers.
>>108794030Thanks I hate it
>https://civitai.red/images/127883693holy shiet that prompt
>>108794102>join my discord saarfuck off jeet, no one gives a fuck
>>108794102Kek, all of that for "white hair, white eyelashes, himecut, pale skin, face close-up"
>>108794102Just 600 words? I have seen people go far more schizo.
>>108794102What's wrong, you don't like boomer prompting?
>>108794102Is this jeet just taking real life photos and saying it's his gens?That's nastya zhidkova.
>>108794151The eyes aren't even the same color, blind anon.
>>108794151He is a grifter but that's a ZIT gen and not a photo of zhidkova
>>108794131he's selling a workflow so obviously he's bloating the prompt as much as possible to make it sound convoluted and specialreminds me of that short lived JSON prompt phase every jeet started doing to larp as a hacker or something
>>108793073waow, what model make this
>>108794239It's a FACT that if you parse your prompt through JSON the AI model appreciates you more. It doesn't do any better it just appreciates you more.
>>108794030jfc, why is this so awful?
>>108794253You don't like uncanny angelina jolie?
>>108794114uh oh melty
>>108794273yea, and?
>>108794281random 1/4 hair just stuck to face for no reason
>>108794295there has to be a reason
>>108794374and the reason is (Yooou)
jeets are really out here buying workflows? lmao
>>108794473now do one of her without any makeup
do artists really make posts about their work being stolen while they're downloading cracked photoshop and pirated art courses in the background?
>>108794576Yes, hypocrisy is an integral feature of the leftoid artoid brain.
>>108794576Great artists steal.
>>108793409I wish cloudcucks were a bit smarter. What a retarded post jej.
>>108793409> 'real artists'lmfao
>>108791469Here is the positive prompt for one of them, using Anima:traditional media, binding discoloration, bleed through, crease, scan dust, painting \(medium\), canvas \(medium\), scan artifacts, magazine scan, scan, artbook, doujinshi, textless version, poster \(medium\), production art, novel illustration, non-web source, original, commission, @minuspal, @nirak, @wamudraws, oekaki, jaggy lines, huge breasts, hanging breasts, ass, wide hips, thighs, eyelashes, lips, lipstick, breasts, mature female, alternate breast size \(larger\), cleavage, curvy, toned, 1girl, dark magician girl, wizard hat, green eyes, blonde hair, bare shoulders, long hair, skirt from side, determined, armpit focus, walking , alley, urban, night, outdoors,>>108791595Any model can either via seed variation or multiple light i2i passes on a gen (with each pass serving as a frame). Those used the latter.The former has been an A1111 extension for quite some time.
>>108793970my sides
>>108791595https://github.com/FizzleDorf/Loopback-Wave-for-A1111-Webui
>>108793035>flux2 klein9b still the best for doing prompt+image(+image) instruction inputs?It's at least well supported and looks reasonably good, but it has the same issues as other models of their sizes for small text making sense for example.>i see some talk of ernieI have no idea if it's good or not, I'm also curious about it.>very new hidream modelThis model makes no sense to me : the previews look very good on their paper and hf, but what people post trying to replicate them look worse than sd1.5.So either they're cheating and showing images of their unreleased 200B model on the page of the 8B, or the support right now is very bad so whatever we're doing isn't working with it.
I really miss forge's adetailer for faces. They look really messy without it. I just looked up Comfy's equivalent and it seems to be such a drag to set up. Wish me luck.
i give up on my dream lora. its not working out. i dont have photo real data for the concept so it keeps coming out with that clay sloppa look.
How come ernie won't take my anima latents like z image does? it just throws errors>>108795897What are you trying to make my friend?
>>108796096Ernie has flux 2 vae. It has higher channel count than anima/zit. You are trying to fit a square into a circle shaped hole.Anima and zit vae are different btw despite having the same shape, so you are passing mostly gibberish latents to ZITYou should be passing anima latents to a model with qwen vae, like qwen image.
>>108796112That sucks. I did use the anima with z refiner that got posted here a while back and it has been my favorite work flow in a good bit and was hoping to just put Ernie in as it is better at understanding things than z image
a redditor just vibecoded a site that lets anyone create free AI videos using his gpus with no safety filteris this a good idea?https://www.reddit.com/r/StableDiffusion/comments/1t9juoy/i_built_a_site_to_create_free_ai_videos_using_ltx/
>>108796122could you share that wf? checked the archive but the link is dead
>>108796096>What are you trying to make my friend?trying to make photo real fantasy races. but i cant seem to force it to understand the concept while still having realistic lighting and texture. if there were images of things that had really good make up like the orcs from lord of the rings i could use that, but what i am trying to make doesnt exist
Women can smell my musk. They probably know I have an rdna2 card. If I were a woman I'd be dizzy.
Wow did anyone see this?The world 1girl supply at record lows, 10 days remainingBBCMumbai, India
>>108796346oh no! Not the super censored talking head model that falls apart the moment you add any movement to the prompt?! It's so exploitable!
>>108796893wan 2.1 and ltx 2.3 are super censored?
>>108796898I've only used wan2.2 which isn't that censored. But without loras, ltx is
i cant stop making images i make them nearly every dayfor hours and so on its almost all i do
>>108796949Are you >>108796928 ?
>>108795880How many steps did you bake this? I imagine Base would be less resistant to her unique hair coloring DESU.
>>108796904Why do you guys keep saying this? LTX with NSFW loras is unironically better than WAN these days especially because it can do audio as well.
>>108797047Can't get it to work.>Here's your 16 GB Workflow bro>Doesn't work
>>108797047not really. It breaks cohesion pretty easily. You can quickly test this. Faces change, skin changes, anatomy warps with any kind of movement. The nsfw loras sort of work but not really. I'm not saying wan22 is much better but what I find that works is making a wan video in 5 second sections, stitching it together, and then adding audio to it with ltx, using 16 frames or so as keys to maintain coherence. It's pretty laborious and you have to get lucky with the wan SVI workflow but I think it's the best you can doAlso, ltx text to video is complete shit. I don't think anyone uses it as a t2v model. Wan on the other hand can deliver decent t2v
>>108797074i feel like the whole is greater than the sum of the parts for ltx. like in a vacuum it does most things worse than wan, but as a total package it wins out. at least for me.
so anima preview 4 tomorrow, right?
>>108797074you can use klein9b edit model at 0.5 denoise on last frame with only improve the image keep composition and style the same and that works quite well. however it sucks at complex actions like repositioning actors. wan 2.2 is still better by miles yeah. the sound is missing but most ltx lora's generate shitty sound's anyway.
>>108797074>Wan on the other hand can deliver decent t2vthis and wan i2v model also does t2v if you feed it just a blank white image using node in pic related and then a decent prompt and character model. For prompt i just use llama.cpp and Qwen3-4B-abliterated_dark.Q8_0.gguf with -ngl 0 so that none of the layers are loaded on the vram and then i can safely have that running in a terminal with out getting oom. hours of fun, hours, i just worry my dick will fall off.
>>108797389>imageahahahah you can't use that with ace step.
>>108797389I use https://github.com/FranckyB/ComfyUI-Prompt-Managerit uses llamacpp to load the qwen gguf as a node and spits out the prompt in the workflow
>>108797444yeah i tried that but I get problems with it crashing after a while, using my own llama on cli i get better control to how it loads the llm.llama-cli -c 32000 --flash-attn on --no-mmap -rea off -ngl 0 --threads 4 -m /path/to/model32000 context is probably over kill mind, /clear clears the context and lets you start a fresh prompt, just need to get it to first engineer a prompt template for specifically generation of good wan prompts. Then can just leave the thing running in the terminal and copy paste what you need without it causing an OOM for as long as you have enough ram/swap then it will run albeit a little slow compared with using the GPU. i'm only getting 2.6 t/s but it don't matter much :)llama.cpp compile commandcmake -B build -DGGML_CUDA=ON -DGGML_BLAS=ON -DGGML_BLAS_VENDOR=OpenBLASComfyUI-Prompt-Manager would be better if it was able to just connect to the users already running llama-server process.
>>108794374I recognize this slutwhere's the disgusted face looking at viewer tho?
>>108797596Well that's because he's wanking. I guess.
ComfyUI 0.21 is out, with AMD support for --enable-dynamic-vram. I tried it out, but it still can't handle OOM gracefully on an iGPU laptop. Attempting untiled VAE decode with too big an image still freezes Windows for several minutes, and the max safe resolution is lower than with --disable-smart-memory, i.e. about 1600x1024 instead of 1600x1280 with SDXL.If/when it snaps out of the freeze, it does fall back to tiled decode, but the right behavior should be to see the oncoming memory ceiling BEFORE smashing headfirst into it. It's overestimating the available memory somehow.
>>108797616(Freezing also happens in the default mode with 1600x1280, just to be clear. It didn't used to. I was just hoping that dynamic vram would fix it.)
>>108797628igpus don't have dedicated vram. your best bet would be increasing the page file and hoping for the best.
https://files.catbox.moe/209htl.wav
>>108797616>wanting to do ai stuff with iGPU>amd at thatLMAO bro, maybe you would've had some argument if you were talking about the strix, but the vram requirements would barely use half of its unified ram anyway
>>108797640Use mp3 next time, you're wasting catbox's bandwidth unnecessarily.
>>108797616just use tiled vae decode nodes instead of waiting for the fallback.
>>108797638AFAICT it IS hammering the pagefile when it freezes, which is how it sometimes recovers after a while. I think it's measuring the available physical RAM wrong, maybe double-counting the shared portion or something.>>108797652It's not the fastest, but I can run some stuff decently. I just wish it'd detect OOM more gracefully so my options aren't jamming the power button or letting it rape the SSD for 30 minutes while hoping it unfreezes.>>108797696Or I could do that, yeah. But it's the principle of the matter, the airbag don't werk.
>>108797727>Dynamic vram is the new ComfyUI memory optimization that should massively reduce ram usage and generally speed things up on Nvidia hardware on Windows and Linux.>on Nvidia hardwareNot for you (or me)
>>108797652> wanting to do ai stuff with iGPUiGPUs is the future of local >>108795908
anyone had luck setting up comfy on newest fedora 44?
>>108797750They just added AMD support with this release, though it's not enabled by default.
>>108797761make a pyenvmake a venvinstall
>>108797778> pyenv> venv> 2026> what is uv
>>108797757>IntelI'd go "bruh", but Comfy recently added portable builds for Intel, so it's looking readier-to-use lately.
>amdshits and intelshits are getting more supportI hate this, how do I justify buying a 5090 now???????
>>108797816You can gen in seconds instead of minutes, and train in hours instead of... days? A week? I haven't actually dared try yet.
>>108797816cuda is still king for the foreseeable future. AMD is controlled opposition and intel is intel
anima preview 4 isn't coming out because russ finally realized that preview 3 was an actual fried downgrade and it only got worse from there
>>108798051>>108793056
>>108798153you think you can downplay every anima criticism by mentioning a single faggot schizo?
man imagine running sdxl shit in 2026 lmao, stop being poor
>>108797789>uvlol, lmao even
Can I make is so my workflow does 4 runs of model 1 before running them all in model 2 in a simple way?
>>108798336you can do anything you imagine with comfy, anon. Think of yourself as Johnny Depp in one of his magical movies.
love me some comfy
i will never stop using sdxl, it's the best
just deleted all my 600gb of sdxl models and loras
>>108798461based
where is anima v4
there will be no further updates to anima
I've been gooning non stop since I got a nice LTX2.3 eros workflow going for my setup
>>108798518and you course you will provide no proof because you're a pussy. why even bother shitting up this board then
can anyone give me some prompts for sex in ltx 2.3? The ones you gave me for wan2.2 were amazing back in the day. just give me something to throw into wan2gp so i can copy the settings. thank!
>>108798518so you make us read your gay masturbation diary AND not catbox the "nice" workflow? fuck you, buddy
>>108798640lmao stay mad
>>108796886My github stars are stuck in Hormuz.
>>108798666*cums on you*
>>108790392>My BRAPs would kill you traveler
>>108791802legendary status
/r/ing' sfw VAGEEN
>>108798982Man toes, man foot silhouette shape. Only men have a big toe that prominent and such bony feet. I’m even showing you photos of real women because your gen is rotoscoped slop.
>>108799315Big if true. No model can be considered good unless it's trained with "male feet" and "female feet". A real artist can draw a man with girly feet or vice versa, and generated images must allow that as well.
>>108798982"Feet are the second face of a character" or "Feet are the face of the lower body," say good anime illustrators. Your gen's feet just followed the logic of the tights and calves without getting the consideration they deserve.This wouldn't have happened if you'd consulted >>>/g/adt/ first. We emphasize this stuff because of the avalanche of "politically correct feet" in new anime models.
>>108799430meds
Has Flux2 Klein been surpass yet for edits?
>>108799430thiswrong feet is not okay, and most importantly, not ldgsome people enjoy trolling by posting illogical feet in order to damage our reputation. don't give them the satisfaction of leading you astray
why is inpaintign on comfy still so shitty compared to Auto1111 after all those years
>>108799587Nah.
>>108799603comfy only cares about API bucks anonhe betrayed us
>>108799587Yah.
>>108799603why inpiainting when you can mask it and API node the fuck out uf it?
>>108796970trained on z-image running on turbo at 200%around 1500 steps or something this lora is old
seedvr is magic. does any ai upscaler even compare?
>>108799587No, and it probably won't be considering we're never getting Z-Image-Edit.
Fresh>>108799954>>108799954>>108799954Fresh