Discussion of Free and Open Source Text-to-Image/Video Models and UIPrev: >>106625151https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GPAniStudio: https://github.com/FizzleDorf/AniStudio>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3https://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/https://github.com/Wan-Video>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Neta Luminahttps://huggingface.co/neta-art/Neta-Luminahttps://civitai.com/models/1790792?modelVersionId=2122326https://neta-lumina-style.tz03.xyz/>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbours>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
Seedream thread
hello, im a newbabretardfuckingidiotmongoloid who just started messing with onetrainer, is this the base model for sdxl i should download?https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/tree/main
>>106628597i wish you the best of luck on your quest, newbabretardfuckingidiotmongoloid !
what nag node do i use
>>106628619but you didnt answer the question AAAAAIIIIIIIIIIIIIEEEEEEEEEEEEEEEEEi'll figure it out ;)
Total ComfyCloud API Node Victory
>>106628594this scares the chroma foot faggot kek
>>106628640How does a 14B model need that much memory to run?
>>106628642computer, add a sonichu medalion without changing the rest of the image
>>106628594from the thumbnail, i thought it was a giant ribbed dildo.
For the lulz here's all 68 images from previous in a single collage.
>>106628650VRAM requirements increase to widen the SaaS moat. Do not let those filthy localhoards cross!
>>106628640>64 × 180 GB = 11,520 GBis this a joke or something?
>>106628640>5B is 20 gigs>14B needs 11TB??
>>106628640I guess he means to server every request? Because otherwise this makes zero sense.
>>106628669>>106628671Just stop thinking about it, you cant run it regardless so lets all just calm down and subscribe to ComfyUI API.
>>106628597If you pick the SDXL preset the field will be automatically filled and when you start training the first it will automatically download the model
>>106628691I mean poland and serbia are really white, but no one want to go there lol
>>106628622none theyre all snake oil
>>106628694i already downloaded every file herehttps://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/tree/mainso i'll find out soon enough
>>106628642Is this the banana
>>106628704i installed it and it's not, it works as advertised. what i'm not understanding now is why you would use it instead of using one of the speed loras that let you use cfg>1, since nag basically doubles gen times. maybe i'm missing something though
>>106628744nope qwen + this https://civitai.com/models/1934100/anime-to-realism?modelVersionId=2189067
https://huggingface.co/fredconex/SongBloom-Safetensorshttps://github.com/fredconex/ComfyUI-SongBloomwe got Suno at home?https://files.catbox.moe/96i90x.flachttps://files.catbox.moe/olajtj.flac
>>106628662Lovely
Can you train the same lora with same settings and datasets and get different results or does retraining do nothing?
>>106628827Not bad, from these samples it's better than Ace-Step 1.0 (will 1.5 ever be released?)Wonder what the range of music is, and most importantly if it can be effectively finetuned with other musc
>>106628865>Wonder what the range of music isyou can put a real music in there and it'll remix it, I find it fun to play with
>>106628873Does it require music input ?
>>106628895it's not mendatory
hunyuan image looks better now in comfy https://github.com/comfyanonymous/ComfyUI/pull/9882
>>106628863If you are talking about the same model, a training run with the same dataset will make pretty much almost the same lora at the same epoch.
does nag not work with chroma flash?
>>106628938Yeah same model. So it's just about resolution and scheduler?
>>106628733thank godsomeone finally trained a lora that i actually WANT (sure feels like forever ;D)>>106628642>he makes posts like this but gets mad at migusama ;3>>106628662n e a t !>>106628691ew!
>>106628931>hey look Comfy I fixed your implementation of that model!I bet 20 dollars he won't merge it, againhttps://github.com/comfyanonymous/ComfyUI/pull/7965
>>106628931impressive, that mf saved HunyuanImage
more and more people realizing comfyui is no longer about local models
The man in the red tie raises his arms, and the various trays of fast food on the table in front of him float in the air.behold my power!
https://xcancel.com/LodestoneE621/status/1968687032605065528#m>Peak GPU mem: 16,139 1,736 MB (on dummy mlp forward pass)>Speed ratio: 0.99× (compute & comms perfectly interleaved)GET OUT!
>>106628999I don't know how he managed to stay healthy after spending 80 years of his fatass life eating only McDonald's.
>>106628999that looks like something i shat out with sd1.5 circa 2023 what are you DOING nigger?
>>106628926Nice Huke style.
>>106629000So what’s the obvious drawback he is choosing to ignore? Because we know from chroma that there were many
>>106628982No, it's only you, since all local models are supported, often within hours, and it even supports local models that aren't even close to being finished trainingThere is a LOT to complain about when it comes to Comfy, but it's not local model support, which is stellarGo lie somewhere else
>>106629023Sorry meant this for rocketpajeet
>>106629041truth nuke, and I say this as someone who don't really like this autistic bitch
>>106629023the source image isn't very good/high quality.
>>106629046didn't notice him, pretend i quoted him too.>>106629053wan can take images from the 1900s and turn them into (masterpiece:1.5), its just (You)
after the botched implementation of hunyuan image and chroma, more and more people are switching to better UIs where output quality comes before model quantity
>>106629000Has the furry solved the 'too little vram' problem ?Big if true
>>106629053McDonald Trump
>>106629000as much as I question the furry's decisions, he honestly seems like a good researcher
>>106629000How is this gonna work when even DDR5 is glacially slow compared to gpu vram?
>>106629053the amount of salt this originally caused will always be so funny to me
>>106629053So this is it. This is the true power of Americans. My god..
>>106629088He's not a fucking researcher he just half bakes he saw in a paper and some guy on discord forwarded to him shit and never explains his reasoning before autistically moving on to the next snakeoil. Why am I the only person who sees this?
>>106629131>Why am I the only person who sees this?you aren't, haven't you noticed the amount of seething everytime he made a retarded move on chroma's training? kek
>>106629000isnt this what flash attention does? you can only move things from ram->vram so fast, i dont see how this will work
>>106629131You aren't
>>106629095It's funny because I don't know anyone in the entire history of ever who didn't love going to McDonalds after playing sports as a kid. Hell, even as an adult. Nobody wants to be sucking down on weird french shit after a big game.
https://github.com/comfyanonymous/ComfyUI/pull/9898>Reduce Peak WAN inference VRAM usage>The first git commit alone improves performance some and the second further increases it. I standardized on 1024x1024 for the image size and varied the frames. Before the changes the maximum number of frames it can handle is 49 and this increases it to 65 for my setup.based
>>106629142>WAN2.2 I2V 14B Q4_K_S GGUF + lightx2v 4steps LoRA (based on video_wan2_2_14B_i2v template) 1024x1024x61frames video generation>wan q4>61 framesgrim, also is 1024x1024 one of the "officially" listed resolutions? i dont think so
>>106629163>grim, also is 1024x1024 one of the "officially" listed resolutions? i dont think soit's not, I guess he just wanted to do a test
>>106629131>he just half bakes he saw in a paper and some guy on discord forwarded to him shit and never explains his reasoning before autistically moving on to the next snakeoilthat's what most professional researchers do
>>106629139fav timeline: finding out the supersize me guy gained weight and was destroying himself from being a wastoid\boozer not from eggmcmuffs >free unlimited mcdonalds when they first launched the app>i ate so much mcdond i should be dead>i lost around 3lbs kek eggmcmuff has real egg >;3good, simple, pure, fun times
what does that have to do with image generation? nb4 spergout
>>106629180They also document their shit.
>>106629193the images generated were about\of mcdond and the time trump was being cheekyits too bad about the face detail from far awayi would imagine things will finetune\tighten in the next few quarters\months
>>106629186>supersize me guyThe guy was an absolute fraud. Tbh, I think McDonald's gets way to bad a reputation for no real reason. Their chicken McNuggies are as close as you get gen to bare basic "Human food" and I don't mean that in a bad way.
>>106629135No, Flash attention is all about keeping things in the GPU as much as possibleThis is about being as efficient as possible when you have to offload parts of a model to ram, this is not a new concept optimization, it's been around in most trainers and inference tools for quite a while, the difference is that this claims to have near zero overheadIf the claims hold up, this would be enormous
>>106629186You cannot pay me enough to try an egg mcmuffin
>>106629220>If the claims hold up, this would be enormouslike, he's using a paper to make this node or something?
>>106629222you can make one at home with a cookie cutter and egg and 2 slices of cheddar...surely you eat breakfast sandwiches anon ;3
>>106629211>vegan girlfriend guy was a drunk fraudWOW hahahah
>>106629220>No, Flash attention is all about keeping things in the GPU as much as possible>This is about being as efficient as possible when you have to offload parts of a model to ramThese are not different, the highest efficiency possible IS to keep everything in vram as much as possible while offloading when you need to, given the speed of the gpu vs ram being x10 difference while the time to move from one to another has a big cost too, meaning this can't really be anything new, i doubt its even a better FA
>>106628594Miyazaki chill
>>106629142just tested it, went from 22.3gb of usage to 21.7gb, it's not much but I'll take it
>>106629252speed diff?
>>106629239bigot
>>106629261it's the same, but now I can make it slightly faster by offloading less to the ram I guess
>>106629243>These are not differentYes they are, Flash Attention optimizations are ALL about optimizing WITHIN the GPU, it have the benefit of fitting more into vram but it have no strategy whatsoever for when it doesn't fit into vramOffloading optimizations are specifically for when it doesn't fit into vramSo no, they are very different
>>106629131Anyone who has ever trained the exact same NSFW concept dataset in a lora at both 512x512 and 1024x1024 on Flux for the same number of epochs is aware that the rate of anatomy errors will always be drastically higher with the 512x512 one unless you actually inference at 512x512. Chroma simply didn't get anywhere remotely close to enough training at 1024x1024.
>>106629272You're right, I've must have confused FA with some other tech I read about a long time ago
>ACK!
>>106629327BASED
>>106629285More HD training would have been exponentially more expensive, but I'm not sure it's that. The HD version is fucky and slops prompts, but the anatomy is noticably better than Base. Base is still better because you can fix a fucked hand with more steps and better prompting.
Is there an ideal resolution ratio I should set for WAN videos to not come out fuzzy as shit or otherwise lose their shit in Comfy gens?
>>106629233I'll lose the cheese and double-side fry the eggalso streaky baconcheese and egg, especially cheddar, do not mix imho
>slow mo video
>>106629327fk yeah
>>106629303Flux Krea full or fp8?
>>106629351just dont use anything below q8 and it shouldnt be a big problem, but anyway, 720x1280 or 1280x720
wan is slowly making the troon more female
i take it back what i said many therads ago, i in fact love chroma again and just had aworkflow skill issue. not only that but chroma is really good at inpainting.i would post an example but my gens are too strong for you, proompter.
>>106629327
>>106629373kek, it's truebased Wan
>>106628754>since nag basically doubles gen timesno it doesn't??
>>106628754NAG is a buffed neg prompt, not a cfg1 hack.
>>106629457>not a cfg1 hack.it works for models with cfg 1 though
>>106629465working with=/=enabling
>SPRO is 1st on the trending page>HunyuanImage isn't even on the listI hope Tencent is gonna learn from that and will give us kino next time
>>106629559SRPO is relevant only because of the unfucking method. It's literally just flux without it.
>>106629620they used to be SO mean to me for showing panties\chonies ;3>>106629404>>106629373>>106629327the image saddens me every time i gaze upon it.,..>>106629352im more of a softboiled\poached kinda guy personally hehe
>>106629639can you leave
>>106629641Can you ? Someone who offers absolutely nothing to these threads
>>106629559Yes I will take your leaderboard b8
>>106629641sunset complete friendi'll miss ya <333>love one another>read the gospel dailyTHE END D R A W S N E A R . U S. A L L .
>>106629689no :(
>>106628594Based SaaS API node EnjoyerAs our master and Sensei Comfy, we adapt and know how to enjoy our local SaaS technology.Welcome to the future, welcome to: /sldg/ - SaaS Local Diffusion General
>>106629620really nice style
>>106629212badass
>>106629131You are a NAZIYou are a PEDOYou are a SCHIZOYou have to KYSYou are the cancer of /ldg/
>>106628975To be clear, right is only less slop. It is disingenuous to call it trvesovl.
be the change you want in the world, post milfs
>>106629854Are you sure?NowRight now?Is it time?
>>106628594Giga basedTwo days ago I moved to /sdg/ much better thread quality, there are schizos but at least it's more diffusion oriented, this thread is like a church or sect. Example threads >>106627772 >>106627168 >>106624229They are already various anons from here dual posting,
>106629894>random colorful nonsense pictures yay :Dthe absolute state
>>106629917brilliant
>>106629894Yes, you're absolutely right... /sdg/ is much healthier and more fun/sdg/ discuss local and cloud without discriminating and with a more open mind
the man holds up a white sign saying "BUY SKYRIM, OR ELSE!"
>>106629929What happens is that they actually love AI diffusion and technology in general. They're not seething faggots who censor people, call them schizos, or shill trash models like Chroma just because it's local.
>>106629903you should try some of these with sneedream
>>106629854
>>106629932Kino. Prompt?
>>106629949oof... those seams and the green line
>>106629949whut model? lumina?
>>106629941Yes I'm /sdg/ now an anon is genning a workflow that mixes API nodes and Qwen image editor. It's great what you can achieve mixing local and cloud technology. And most importantly without ideological prejudices.
>>106629950by Ansel Adams, , orbitdawn on TitanSteps: 28, Sampler: Euler, Schedule type: Simple, CFG scale: 1.1, Distilled CFG Scale: 3.5, Seed: 3797686900, Size: 1472x712, Model hash: 4610115bb0, Model: flux1-dev, Version: f2.0.1v1.10.1-previous-669-gdfdcbab6, Module 1: ae, Module 2: clip_l, Module 3: t5xxl_fp16
>>106629959
tfw you find a new artist to train on
>>106629966ty
>get new gpu>ecsatic at it just works >>106629940 in reforge>jump to comfyui after needing to wait a little while longer with the hype building up>errors out the fucking anal cunt over how i tried to go about installing sageattention/triton>finally get around them(?)>now errors out the wazoonus about ??? in every wan2.2 workflowi now understand why anons are driven to schizophrenia over the mere thought of a new UI this is unusable dogshit
>>106629983You forget the most important thing, people gate keep their workflows and don't help
>>106629959Chroma testing my lora epochs
>>106629997Have you tried asking for help in /sdg/?
>>106629955>>106629949yeah, protip. if you have the VRAM, just don't use ultimate upscale. use tile or blur controlnet and upscale directly in the intended resolution.I'm feeling generous so here's the workflow for this, the workflow works for any art style if you use the right model and tweak accordingly:https://files.catbox.moe/s9qkzc.png
>>106629983like HEH how does it get this busted?!>>106629997there's plenty of already working workflows out there, its niggerware comfyui or trannyware python that decides to waste a bit more of your limited lifespan
>>106630006>blur controlnetthats a new one for me thanks anon
>>106630004No, never, in fact I've never gone, they told me they're more oriented towards cloud and SaaS fagging.
>>106630022Oh no! They told you wrong anon! Actually we're super versatile here! We have anons generating with Chroma >>106629622 also anons generating anime with WAI then with See Dream >>106629729 and some animate their videos with WAN! >>106624803We have really fun here!Guess who else is here? >>106629141
>>106629626How many images do you usually like to have for your LoRAs?
just to be clear thats not a wan gen kek its anidiff
>>106630006>error in imageCatbox dead again?
>>106630049Anon you like anime? Look the same person is here!! >>106620604 maybe you can ask him the same question there!
>>106630070>>106630006shit, here https://litter.catbox.moe/55sskg5uohf7vjok.png
>>106630075outstanding vistas
>>106630130He is also in /sdg/!
Wan-Animate page up.>https://humanaigc.github.io/wan-animate/
>>106630149animatorbros.............................................................its over
>>106630149...
>>106630083I don't use comfy
>>106630177lol, have fun with broken gradio trash then
>>106630193Meds onigai
So prompting in French sort of works for Chroma, but you need to set your cfg ~40 to 60% higher than normal and the results will not be as good for the most part, plus it still needs a little bit of English guidance in there to not turn out like a messy SD1.5 gen. This is obviously not what Chroma was trained to do and is not really an optimal prompting strategy, but it was fun to try. Apologies to anyone who actually speaks French if they should happen to read my very translator-plus-chatbot-assisted prompt, I ofc do not speak French>Photo floue du plus beau décolleté de ma cousine Hélène, 20 ans, à la maison de campagne du Lac Léman, 2001, debout au bord du lac. Elle est canon avec des seins incroyables en maillot de bain deux-pièces! [hot panting emoji] cute college girl up close
>>106630149>https://huggingface.co/Wan-AI/Wan2.2-Animate-14BNow it's out.
>>106630149imagine what anons will do with this. taking videos of themselves doing disgusting things
>>106630166wasnt trained on unique ghibli style, thats def the worst vid in the otherwise very impressive examples
the man on the right shoots a black pistol at the man on the left, who falls to the floor. the blue text at the bottom is unchanged.
>>106630204my french gf
Holyshit I dont believe it, is that radial attention getting real updates?
>>106630149Huh, this looks kind of good? No "fun" in the name either.
t minus 24 hours until someone recreates goatse but with an anime character
Need to update the denoising on high res fix to preserve the style
>>106630140No, I'm not.
>>106630166looks good
>>106630149Animators on suicide watch
xibros... are we tired from winning?