80b EditionDiscussion of Free and Open Source Text-to-Image/Video Models and UIPrev: >>106706484https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3https://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/https://github.com/Wan-Video>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Neta Luminahttps://huggingface.co/neta-art/Neta-Luminahttps://civitai.com/models/1790792?modelVersionId=2203741https://neta-lumina-style.tz03.xyz/>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbours>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
TELL ME NUNCHAKUWHERE IS THAT WAN QUANTWHERE IS THAT QWEN LORA SUPPORT
>>106708345>WHERE IS THAT QWEN LORA SUPPORThere?https://huggingface.co/nunchaku-tech/nunchaku-qwen-image-edit-2509
Blessed thread of frenship
>>106708106Alright here's some (retarded) napkin math.HDM is 340m params for $650.Lumina2 and sd35m are both ~2.5b params. They both show strong capabilities at their size while suffering from training data and architecture problems, so an HDM-like model with its superior architecture and better training data could easily become local SOTA at 2.5b.$650*2.5b/340m = $4779HDM is undertrained according to the author, so let's double that budget. I suspect we hit severe diminishing returns with a budget >$10k. With current tech, we can get crowd (or richfag) funded local SOTA with <$10k. Keep in mind there are other optimizations that aren't used in HDM, so if one or two other 10x optimizations can be incorporated, we are in very good shape.>>106708232SDXL anime models like noob are the most fun by far.I've mostly stopped non-anime SDXL models unless I need a controlnet or something. I still use SDXL for really big upscales since it has the tile controlnet and can fit the whole gen in memory while not taking too long. There are some impressive non-anime SDXL checkpoints out there, but the prompt comprehension issues and lack of art style knowledge hurt it significantly compared to Chroma. >>106708267what case/mobo do you use for this? I've been wondering if it's possible to fit a dual gpu setup in a mid tower these days
>>106706788It got that soft/grubby SRPO look... they really need to show it doing something a little more impressive than stock photography. Maybe a complex interaction or something, I don't know.
https://xcancel.com/SD_Tutorial/status/1970518843048272293#mwhat's happening here? why SRPO is completly fucked on fp8?
>>106708383Sex with Jenny
>>106708384Maybe like 3-4 steps?
>>106708376Where's the dataset coming from?
ÆÜGH Icum>Error: Maximum file size allowed is 4MBhttps://files.catbox.moe/f7x5jz.mp4
>>106708429ask chatgpt to make a script that compress your video to 4mb
>>106708415literally just use the danbooru api if you just want a booru model
>>106708376I have a full size case with a full sized motherboard with two GPU slots, I prefer extra space for easy moving than seeing how tight I can fit everything.
>>106708415catbox please?for the anime side of things, a booru dataset augmented with NL captions (but not replacing the tags).for the rest, desu, I don't see why something like LAION wouldn't work. it was fine for sd1.5. then augment it with some better captions, aesthetic selection.. use srpo or whatever too>>106708486damn... maybe I should give up on my dual gpu idea. don't really want a behemoth on my desk.
>>106708413All computer metrics are bunk in the scheme of generative models. The only real way to measure a model is promptability, level of censorship, breadth of conceptual and stylistic capacity, and ultimately distilled as "usability". Right now all the metrics are biased metrics essentially designed around making stock images but really if you want to see how shit aesthetics filtering is, just take 20k images from any booru and see which ones standard aesthetics metrics consider low quality.
>>106708528honestly i don't blame you for assuming gpu sizes either way, you really don't have a full idea of how huge or small gpus are until you actually get one in your hands.then you totally forget once its been in your system for a year+.dual gpu'ing is not for the faint of heart. or wallet for that matter.
>>106708528Why would you want a space heater on your desk? Two 4090s running full tilt even throttled gets quite toasty.
trying to find a method to stop style swing but it's really bad in some cases especially at random seeds that tries to be irl during the first pass. I'm giving this model one last chance before dumpstering it
>>106708358Lora support anon, that is still being worked on. You can use model fine, though I think they messed up the lightning merge in that version
>>106708429why did that video need to be 11 seconds if its the same motion, REEETARD
>>106708528I use a basic old raidmax smilodon case from like 15-20 years ago and it fits modern GPUs fine
having a danbooru data set with the tags grouped by subjects, background and interactions would make even SDXL based models exponentially betterwe just need a powerful VLM like Gemini but uncensored
>>106708703nai seems to do something like thathttps://docs.novelai.net/en/image/multiplecharactershowever, nai's implementation kind of looks like it's just calling on a regional prompting addon at least some of the time.reminder that the forge regional prompting addon is able to generate region masks FROM THE PROMPT ITSELF, and we still don't have a comfyui equivalent even though this kicks ass:https://github.com/hako-mikan/sd-webui-regional-prompter?tab=readme-ov-file#region-specification-by-prompt-experimental
>>106708799how could you gen this absolute filth?reported, filtered, snitched on, sent the batsignal
>>106708827I prompted black monolith lol
Is there a straightforward way to get SD working on linux with an AMD card? I'm following the wiki installation and I kept running into issues
>>106708799sovl and kino
>>106708799That's the most disgusting thing i've ever seen on 4chan, and I'm an oldfag. You should be ashamed.
>>106708415>>106708528NTA and also asking for catbox, thanks
>>106708799The best image posted in a long while
>>106708328I'm in the OP
>gm
I've set up Qwen Image Edit but it's maxing out my VRAM. I've tried launching Comfy with and without vram saving parameters. Is the Q8 model too big for a 3090 (24gb vram)
>>106708883
>>106708883..no? i have 16gb vram and can use q8 fine
>>106708844>make venv>follow these instructions https://github.com/comfyanonymous/ComfyUI?tab=readme-ov-file#amd-gpus-linux-onlyif that doesn't work, you will have to tell us more. card, distro, UI you're attempting to use. these steps worked on my old rx 6800 and my 7900 xtx.>>106708883try the fp8 scaled version instead. ggufs have a broken implementation on some cards, I have a similar issue on my 7900 xtx. fp8 scaled is faster anyway.
>>106708872>>106708892You're a real piece of shit debo, it's also obvious that you like to bring up old irrelevant drama from other threads to force some conflict.
>>106708931Can't get it to work on arch with a 9070
damn nigbo wthelly
>>106708931>try the fp8 scaled version insteadIt's not faster on a 3000; on 4000/5000 it is.
>>106708944https://en.wikipedia.org/wiki/Nigbo_language
>>106708931>>106708883Do I need to run that pytorch step if I followed the Auto Installation? I can try it but idk if it will do anything
>2025>finetuned sdxl is still the best local model for realism and anime imageswhen are we gonna get an unslopped 4b-5b model with a permissive license?chroma is slow dogshit that looks badseedream is the only new model that looks good but it's NOT local
>>106708964Can we not do this ritual post?
>>106708650based retard doesn't understand what the pingpong effect is (or that it's a setting in those nodes)anyway, GAAAHHH THE OOM IS EVERY GEN NOW CUMFARTUI YOU'RE PISSING ON MY LEG AND TELLING ME IT'S RAINING!https://files.catbox.moe/dyugav.mp4
>>106708959reading comprehension anon. that pytorch setup is for the AMD linux user, not you. you should try the fp8 model instead of q8.
>>106708941>>106708931Nevermind, got it to work with the manual installDon't know why I bothered with the pip comfy-cliThanks
>>106708959>>106708883oh yeah and if you're ever OOMing during VAE operations, replace VAE encode/decode with TILED VAE encode/decode>>106708999nice
>>106708844I think I saw a new beta version of rocm pytorch released today, in theory getting that should make things really straightforward
>>106708772>and we still don't have a comfyui equivalent>https://github.com/asagi4/comfyui-prompt-control/blob/master/doc/regional_prompts.md
>>106708772sounds like DAAM -> Latent Couple
>>106708883no? I use q8 on a 4080 with 16gb. it should be fine.
>>106708772Dude this is better https://github.com/Haoming02/sd-forge-couple
>>106709346what's with the obsession with that random dude
>>106709357It's hard work to get to lolcow status
>>106709357It's obviously cause he worked at blizzard duh. Real answer the dude went on a weird tirade against getting game developers to create offline versions of games when they EOS them.
https://huggingface.co/nunchaku-tech/nunchaku-qwen-image-edit-2509
https://www.reddit.com/r/StableDiffusion/comments/1nravcc/nano_banana_vs_qwen_image_edit_2509/damn the lightning lora really sucks
>>106709355NTA but thanks for reminding me of forge couplewould you pls catbox that image?
>>106709409Ahh..man. Coping they will make a newer lightning LoRA.
>>106709452I don't have it on this computer, this was during my laptop era
>>106709465i mean just any forge couple'd gen will do, but understandable
>>1067094608step one works fine in general for qwen edit v2.
>>106709472I'm not doing anything special and I don't give catboxes. It's a long story that becomes evident whenever you see the schizo screech the name ran.
>>106709481ran is the schizo
>>106709475
>106709486>time wasting postMore wheelchairs for you then
>>106709492thanks schizo (niggerjak)