80b EditionDiscussion of Free and Open Source Text-to-Image/Video Models and UIPrev: >>106706484https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3https://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/https://github.com/Wan-Video>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Neta Luminahttps://huggingface.co/neta-art/Neta-Luminahttps://civitai.com/models/1790792?modelVersionId=2203741https://neta-lumina-style.tz03.xyz/>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbours>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
TELL ME NUNCHAKUWHERE IS THAT WAN QUANTWHERE IS THAT QWEN LORA SUPPORT
>>106708345>WHERE IS THAT QWEN LORA SUPPORThere?https://huggingface.co/nunchaku-tech/nunchaku-qwen-image-edit-2509
Blessed thread of frenship
>>106708106Alright here's some (retarded) napkin math.HDM is 340m params for $650.Lumina2 and sd35m are both ~2.5b params. They both show strong capabilities at their size while suffering from training data and architecture problems, so an HDM-like model with its superior architecture and better training data could easily become local SOTA at 2.5b.$650*2.5b/340m = $4779HDM is undertrained according to the author, so let's double that budget. I suspect we hit severe diminishing returns with a budget >$10k. With current tech, we can get crowd (or richfag) funded local SOTA with <$10k. Keep in mind there are other optimizations that aren't used in HDM, so if one or two other 10x optimizations can be incorporated, we are in very good shape.>>106708232SDXL anime models like noob are the most fun by far.I've mostly stopped non-anime SDXL models unless I need a controlnet or something. I still use SDXL for really big upscales since it has the tile controlnet and can fit the whole gen in memory while not taking too long. There are some impressive non-anime SDXL checkpoints out there, but the prompt comprehension issues and lack of art style knowledge hurt it significantly compared to Chroma. >>106708267what case/mobo do you use for this? I've been wondering if it's possible to fit a dual gpu setup in a mid tower these days
>>106706788It got that soft/grubby SRPO look... they really need to show it doing something a little more impressive than stock photography. Maybe a complex interaction or something, I don't know.
https://xcancel.com/SD_Tutorial/status/1970518843048272293#mwhat's happening here? why SRPO is completly fucked on fp8?
>>106708383Sex with Jenny
>>106708384Maybe like 3-4 steps?
>>106708376Where's the dataset coming from?
ÆÜGH Icum>Error: Maximum file size allowed is 4MBhttps://files.catbox.moe/f7x5jz.mp4
>>106708429ask chatgpt to make a script that compress your video to 4mb
>>106708415literally just use the danbooru api if you just want a booru model
>>106708376I have a full size case with a full sized motherboard with two GPU slots, I prefer extra space for easy moving than seeing how tight I can fit everything.
>>106708415catbox please?for the anime side of things, a booru dataset augmented with NL captions (but not replacing the tags).for the rest, desu, I don't see why something like LAION wouldn't work. it was fine for sd1.5. then augment it with some better captions, aesthetic selection.. use srpo or whatever too>>106708486damn... maybe I should give up on my dual gpu idea. don't really want a behemoth on my desk.
>>106708413All computer metrics are bunk in the scheme of generative models. The only real way to measure a model is promptability, level of censorship, breadth of conceptual and stylistic capacity, and ultimately distilled as "usability". Right now all the metrics are biased metrics essentially designed around making stock images but really if you want to see how shit aesthetics filtering is, just take 20k images from any booru and see which ones standard aesthetics metrics consider low quality.
>>106708528honestly i don't blame you for assuming gpu sizes either way, you really don't have a full idea of how huge or small gpus are until you actually get one in your hands.then you totally forget once its been in your system for a year+.dual gpu'ing is not for the faint of heart. or wallet for that matter.
>>106708528Why would you want a space heater on your desk? Two 4090s running full tilt even throttled gets quite toasty.
trying to find a method to stop style swing but it's really bad in some cases especially at random seeds that tries to be irl during the first pass. I'm giving this model one last chance before dumpstering it
>>106708358Lora support anon, that is still being worked on. You can use model fine, though I think they messed up the lightning merge in that version
>>106708429why did that video need to be 11 seconds if its the same motion, REEETARD
>>106708528I use a basic old raidmax smilodon case from like 15-20 years ago and it fits modern GPUs fine
having a danbooru data set with the tags grouped by subjects, background and interactions would make even SDXL based models exponentially betterwe just need a powerful VLM like Gemini but uncensored
>>106708703nai seems to do something like thathttps://docs.novelai.net/en/image/multiplecharactershowever, nai's implementation kind of looks like it's just calling on a regional prompting addon at least some of the time.reminder that the forge regional prompting addon is able to generate region masks FROM THE PROMPT ITSELF, and we still don't have a comfyui equivalent even though this kicks ass:https://github.com/hako-mikan/sd-webui-regional-prompter?tab=readme-ov-file#region-specification-by-prompt-experimental
>>106708799how could you gen this absolute filth?reported, filtered, snitched on, sent the batsignal
>>106708827I prompted black monolith lol
Is there a straightforward way to get SD working on linux with an AMD card? I'm following the wiki installation and I kept running into issues
>>106708799sovl and kino
>>106708799That's the most disgusting thing i've ever seen on 4chan, and I'm an oldfag. You should be ashamed.
>>106708415>>106708528NTA and also asking for catbox, thanks
>>106708799The best image posted in a long while
>>106708328I'm in the OP
>gm
I've set up Qwen Image Edit but it's maxing out my VRAM. I've tried launching Comfy with and without vram saving parameters. Is the Q8 model too big for a 3090 (24gb vram)
>>106708883
>>106708883..no? i have 16gb vram and can use q8 fine
>>106708844>make venv>follow these instructions https://github.com/comfyanonymous/ComfyUI?tab=readme-ov-file#amd-gpus-linux-onlyif that doesn't work, you will have to tell us more. card, distro, UI you're attempting to use. these steps worked on my old rx 6800 and my 7900 xtx.>>106708883try the fp8 scaled version instead. ggufs have a broken implementation on some cards, I have a similar issue on my 7900 xtx. fp8 scaled is faster anyway.
>>106708872>>106708892You're a real piece of shit debo, it's also obvious that you like to bring up old irrelevant drama from other threads to force some conflict.
>>106708931Can't get it to work on arch with a 9070
damn nigbo wthelly
>>106708931>try the fp8 scaled version insteadIt's not faster on a 3000; on 4000/5000 it is.
>>106708944https://en.wikipedia.org/wiki/Nigbo_language
>>106708931>>106708883Do I need to run that pytorch step if I followed the Auto Installation? I can try it but idk if it will do anything
>2025>finetuned sdxl is still the best local model for realism and anime imageswhen are we gonna get an unslopped 4b-5b model with a permissive license?chroma is slow dogshit that looks badseedream is the only new model that looks good but it's NOT local
>>106708964Can we not do this ritual post?
>>106708650based retard doesn't understand what the pingpong effect is (or that it's a setting in those nodes)anyway, GAAAHHH THE OOM IS EVERY GEN NOW CUMFARTUI YOU'RE PISSING ON MY LEG AND TELLING ME IT'S RAINING!https://files.catbox.moe/dyugav.mp4
>>106708959reading comprehension anon. that pytorch setup is for the AMD linux user, not you. you should try the fp8 model instead of q8.
>>106708941>>106708931Nevermind, got it to work with the manual installDon't know why I bothered with the pip comfy-cliThanks
>>106708959>>106708883oh yeah and if you're ever OOMing during VAE operations, replace VAE encode/decode with TILED VAE encode/decode>>106708999nice
>>106708844I think I saw a new beta version of rocm pytorch released today, in theory getting that should make things really straightforward
>>106708772>and we still don't have a comfyui equivalent>https://github.com/asagi4/comfyui-prompt-control/blob/master/doc/regional_prompts.md
>>106708772sounds like DAAM -> Latent Couple
>>106708883no? I use q8 on a 4080 with 16gb. it should be fine.
>>106708772Dude this is better https://github.com/Haoming02/sd-forge-couple
>>106709346what's with the obsession with that random dude
>>106709357It's hard work to get to lolcow status
>>106709357It's obviously cause he worked at blizzard duh. Real answer the dude went on a weird tirade against getting game developers to create offline versions of games when they EOS them.
https://huggingface.co/nunchaku-tech/nunchaku-qwen-image-edit-2509
https://www.reddit.com/r/StableDiffusion/comments/1nravcc/nano_banana_vs_qwen_image_edit_2509/damn the lightning lora really sucks
>>106709355NTA but thanks for reminding me of forge couplewould you pls catbox that image?
>>106709409Ahh..man. Coping they will make a newer lightning LoRA.
>>106709452I don't have it on this computer, this was during my laptop era
>>106709465i mean just any forge couple'd gen will do, but understandable
>>1067094608step one works fine in general for qwen edit v2.
>>106709472I'm not doing anything special and I don't give catboxes. It's a long story that becomes evident whenever you see the schizo screech the name ran.
>>106709481ran is the schizo
>>106709475
>106709486>time wasting postMore wheelchairs for you then
>>106709492thanks schizo (niggerjak)
>>106709487
>>106709481oohhhhh careful everyone. this mans hides national secrets in his prompts/ eat shit fuckface
>>106709481alright whatev fair enough>>106709492>>106709504THAT was not me by the way.
>He's triggered againEvery day you seethe here is a win for me and the other anons that separated from your mental illness and won.
>phone posting to not look schizo
>>106709509Well there's your proof, this guy has been seething at me for 3 years all because I told him to stop spamming the thread with slop worse than you see on /sdg/ today while trolling and messing with people. You can see examples of his poor handiwork in OP.
>>106709510>>106709521>>>/g/sdg
>>106709521slopmeister
>>106709510>Every day you seethe here is a win for meso that's your goal in life, to own the anonymous libs on 4chan?
>>106709539no it's to blogpost and share shitty diffusion gens
>>106709517He pulled this exact thing when a anon was making fun of him in his containment thread yesterday and does it so often it has no effect anymore. He's so autistic and ritualistic he does the same thing every day non stop>>106709539>seethes about me near daily even when I'm gone for monthsI dunno what to tell you
gtfo ranfagggg
harumpfff!!! I am the most important poster here and everyone should listen to what I have to say AT ALL TIMES! totally NOT a schizo so stop bullying me! t. ran
With comfyui, how can I start genning an image with one checkpoint, then switch to a different checkpoint?
how do I voice sex with my computer?Like is vibevoice censored?
>>106709567are u a homo?
Since he's having a melty I'm going to do something he's too autistic to do without getting exposed and post a gen. Progress on taming Chroma seeing some improvement
>>106709568basically just make a workflow that does it? you can pass the unfinished latent to the next sampler
>>106709575The reference audio has to have sexy sounds in it, then increase CFG 1.7+ and lower steps between 10-15
>>106709552>about mewho the fuck are you? smells like some insane main character syndrome
>>106709568link different model to next k-sampler?
>recycling the same memes and projection>why can everyone point me out>why is the thread I dedicate 15 hours of my worthless life necro bumping not getting any postGets the almonds activated, he's in such a diminished state and he just comes back for more punishment. Last post for you just keep seething until your caretaker tells you to shut up
>>106709600got example ?
>>106709485
>>106709350>>106708909How is it using 24gb VRAM if other people are using the same models on 16gb? >>106708959I'd rather fix that than download the FP8 scaled one (which I can't find, I can only see the Q ones)
Where's the qwen edit anon. Post workflow for drawing to realism pls, I fiddled with it a bunch yesterday but it's not even outputting semi-realistic anymore, just reprinting the same drawing. Someone save me from this hell, img2img with illustrious models are better at realism but dogshit in keeping the character consistent with the reference and always fuck eyes hands feet teeth and such
>>106709592>>106709610It gives an error about multiplying dimensions
>>106708799
Radiance is pretty cool but it makes my 4090 caps whine in a rhythm like an incoming text in the 2000s.
>>106709635
>>106709568Chain samplers?First one with the first model, second one with the second model, pass the latents (and maybe upscale in between)Note that the denoising of the second sampler needs to be below 1 to get meaningful results with this.
>>106708376>With current tech, we can get crowd (or richfag) funded local SOTA with <$10k.agghh!!! you fool. don't do that. don't give me hope
>>106709700
>>106709783balenciaga?
>>106709756>>106709624nice gens. now go back to your general.
>>106709674isn't illustrastious sdxl? You are using sd1.5 workflow
>>106709631>>106709575Just picked some random porn vid, would be better if I cleaned the audio but too lazy kekhttps://files.catbox.moe/5mimvk.mp3
>>106709783>>106709792hehe no
>>106709795thanks
>>1067096741. You are mixing ancient model with newer one (Though this shouldn't necessarily cause multiplying dimensions error?)2. 10 steps will just give shit results3. Lower denoising otherwise what you are doing is 100% pointless4. You are mixing a 1024p and 512p modelWith all due respect you probably need to learn more before trying quirky stuff like this.
>>106709783 >>106709700nice effect. the things people will do when they get the ability to do whole scenes of 30s-2min or so
>>106709709I think thats what I'm doing, I have one sampler with the first model, then I link it to a second sampler with the second model>>106709818Sorry, I don't know what that means
the girl in image1 is wearing the outfit of the girl in image2. make the image realistic.not bad
>>106709834increasing cfg works. Does vibevoice have prompt guide?
>>106709635Neat.
>>106708931Looks like you were basically right, it's not loading it as quantized (sorry if phrasing is wrong, I just wanna make funny image)I found a link to the FP8 model in the workflow itself so I'll download that, but I wonder if Wan2.2 is having the same issue. Any ideas on how to make it use the Q8 model properly?
>>106709925haven't seen one if there is.
kek, used ivy from soulcalibur as the swap source:
>>106708931>fp8 scaled is faster anyway.Not for hardware that lacks fp8 acceleration, which includes RDNA3.Nice AYYYYYMD cope though.
>>106709987if the model was lewder, would she keep the whip?
>>106710013the image source didnt have much of the whip but with a lora you can do anything desu
>>106709909miss this be local diffusion thread
>>106710038
In wan2.2, I'm getting a "ModuleNotFoundError" for decord despite having it installed in the venv. I'm not well-versed in Python so I'm hoping one of you might know what's happening.
>>106710021can be quite hard to train. at least the more creative whip swings. pretty cool regardless
Does your guys' GPU have a coil whine? My GPU whines when I generate AI images, AI chats, and when I Cycles GPU render in Blender. It's silent when I'm playing video games.
>>106710080ain't it python3 you need to call?
>>106710080Run pip check.Also>wan/bin/pipI don't know what's up with that so your venv might be broken.Try creating a venv (with uv or whatever), source venv/bin/activate and pip install -r requirements.txt. Then run python generate.py
>>106710097https://vocaroo.com/168rSoBGUnsk
>>106710097V-sync?Try a game that runs on a few hundred something fps.
>>106709795he's tryna back up his only friend nigbo whilst being unemployed xd
>>106710097I definitely had some audio interference when I got my 3090, and either my hearing has got worse or it stopped
>>106710103this has to be 11/10 nu-/g/ bait this has to be this just has to be>>106710080use miniconda. fuck venvs use miniconda. oh nooo my console bloat fuck you python is all bloat its all shit everything you do on a computer should be disposable USE MINICONDA FOR CUDA SHIT FORGE WILL SAVE YOUR ASS
>>106710113That was it, didn't activate the venv. Got a different error now, but at least I can work with this. Thanks.
what is the moderest wan2.2 workflow? Rentry talks about "old" wan2.2also, where are goofs for wan2.2???
>>106710150>what is the moderest wan2.2 workflow? Rentry talks about "old" wan2.2https://vb.lk/wan-2.2-comfyui-example-workflow-kijai-github>also, where are goofs for wan2.2???https://vb.lk/wan-2.2-gguf-huggingfaceretarded zoomer faggot
the man in image1 is wearing the outfit of the man in image2 and is wearing a black fedora, and holding a katana. keep his expression the same.literally edgemaster
>>106710173not clicking dat sheit
>>106710173MODS
>>106708328Not sure if that's the right thread to ask, but does anyone know how these AI covers get generated?I'm pretty sure the common sites disallow commercial music remixes, and frankly nothing I've tried sounds as "good" as these.https://www.youtube.com/watch?v=HIjdlLSOtvEhttps://www.youtube.com/watch?v=PeBkXhfchb0
>>106710173>vb.lkprobably an ad but that's still pretty cool stuff>>106710213retard. dumbass. cumslurper. twobit piece of shit.>>106710215it's fine, i clicked it. it basically yolo guesses the end of the link to something that makes sense
lara croft if netflix didnt make it:
Anyone tried this Forge fork? https://github.com/DenOfEquity/ersatzForgeI couldn't get it to run (python dependency), but the dev updates it daily since it's his personal build.Apparently, this is the original repo that Panchovix based ReForge2 on, and this one is still actively developed.
No I stick with comfyui
>>106709329I don't see where this generates the masks from the prompt
>>106710325Please. Please no more forks.
WHY DID NOBODY WARN ME THAT I HAD TO PUT A FUCKING "SAVE IMAGE" NODE OR ELSE THE IMAGE IN COMFY GETS DELETED!?!?
>he can't right click save on the preview node>he can't see his temp folderNGMI
>>106710009>>106708945good to know, it does seem like rdna3 lacks fp8 acceleration (at least I can't find anything saying it does), though it handles fp8 just fine of course.>>106709963I would just use the fp8 scaled model for wan as well. I have never had a good experience with gguf, probably won't until I upgrade to UDNA in the future.>>106710097yeah lmao, it's not so bad on the xtx, but back when I was using the 6800 it straight up sounded like it was screaming in pain
man I love AI.
>>106710423it was slop bro, dont cry about it. you'll gen better
>>106710433HELP ME!
>>106710423they're still on your temp folder though?ComfyUI\temp
>>106710423we honestly hate you
>>106710423You need another node for protest, nonie :3
>>106710467>the folder is empity
>>106710463RIGHT CLICK ON THE PIC IN THE PREVIEW NODE RETARD CLICK SAVE AS
>>106710467Wrong. He needs a node to enable the temp folder.
the anime girl is sitting at a computer typing. on her white crt monitor is the text "LDG" with a chibi version of Miku Hatsune below it.
>>106709930i'm cooli'm hipi'm with it
>>106710441>though it handles fp8 just fine of course.fp8 has less quality than int8 for diffusion, scaled or not.On Blackwell it is arguably worth is because it gets a big speed up with dedicated acceleration.Without that you are just degrading quality without any speed boost.I would try to get Q8 working if I were you.
>>106708376Who would in their right mind waste $10K just to entertain autists on /lmg/? It better makes some serious money.t. got the funds
>>106710510
VAE decode(IMAGE) ------->(IMAGE) Save ImageWHY THE HELL ISN'T MY IMAGE SAVING IN \ComfyUI\output!?!?!?!
>>106710556I bought a RTX 600 pro and use it mostly for gens that get posted in here and elsewhere on 4chan (I'm trying to use it for professional reasons too but failing at that).
>>106710589Please stop shitting up the thread with woahjak spam, thanks.
>>106710605HELP ME HELP ME WHY DON'T YOU HELP ME I SWITCHED TO YOUR DAMN COMFYUI SIDE, DON'T LEAVE ME ALONE IN THIS!
>>106710604>RTX 6000 proWhat's the use case of this for diffusion?I thought it excelled at MOE LLMs (still overpriced though), and was very overpriced and mediocre for anything else.
>HuMo & Chroma1-Radiance Native Support in ComfyUIUh where the comfy haters at?
>>106710605catjak faggot stfu
>>106710604Reminds me of this.
>>106710658original cutscenes & art when
>>106710658Prompt?
wan2.5 is api only right
NOW IT'S SAVING IMAGES FOR ME BUT I WANT TO CHANGE THE DIRECTORY, GET IT OUT OF COMFYHOW THE HELL DO I REDIRECT THIS!?!?!?!
>>106710739symbolic link or --output-directory "folder/path" in startup arguments
>>106710689Neat
Reminder that with the upcoming Hunyuan Image 80b, if you do not have a 96gb vram card you are not allowed to discuss the model. >>106703161If you cannot afford to upgrade to a 96gb+ card, you are a poorfaggot turdskinned larper who should stick with SaaS