Discussion of Free and Open Source Text-to-Image/Video Models and UIPrev: >>106625151https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GPAniStudio: https://github.com/FizzleDorf/AniStudio>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3https://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/https://github.com/Wan-Video>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Neta Luminahttps://huggingface.co/neta-art/Neta-Luminahttps://civitai.com/models/1790792?modelVersionId=2122326https://neta-lumina-style.tz03.xyz/>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbours>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
Seedream thread
hello, im a newbabretardfuckingidiotmongoloid who just started messing with onetrainer, is this the base model for sdxl i should download?https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/tree/main
>>106628597i wish you the best of luck on your quest, newbabretardfuckingidiotmongoloid !
what nag node do i use
>>106628619but you didnt answer the question AAAAAIIIIIIIIIIIIIEEEEEEEEEEEEEEEEEi'll figure it out ;)
Total ComfyCloud API Node Victory
>>106628594this scares the chroma foot faggot kek
>>106628640How does a 14B model need that much memory to run?
>>106628642computer, add a sonichu medalion without changing the rest of the image
>>106628594from the thumbnail, i thought it was a giant ribbed dildo.
For the lulz here's all 68 images from previous in a single collage.
>>106628650VRAM requirements increase to widen the SaaS moat. Do not let those filthy localhoards cross!
>>106628640>64 × 180 GB = 11,520 GBis this a joke or something?
>>106628640>5B is 20 gigs>14B needs 11TB??
>>106628640I guess he means to server every request? Because otherwise this makes zero sense.
>>106628669>>106628671Just stop thinking about it, you cant run it regardless so lets all just calm down and subscribe to ComfyUI API.
>>106628597If you pick the SDXL preset the field will be automatically filled and when you start training the first it will automatically download the model
>>106628691I mean poland and serbia are really white, but no one want to go there lol
>>106628622none theyre all snake oil
>>106628694i already downloaded every file herehttps://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/tree/mainso i'll find out soon enough
>>106628642Is this the banana
>>106628704i installed it and it's not, it works as advertised. what i'm not understanding now is why you would use it instead of using one of the speed loras that let you use cfg>1, since nag basically doubles gen times. maybe i'm missing something though
>>106628744nope qwen + this https://civitai.com/models/1934100/anime-to-realism?modelVersionId=2189067
https://huggingface.co/fredconex/SongBloom-Safetensorshttps://github.com/fredconex/ComfyUI-SongBloomwe got Suno at home?https://files.catbox.moe/96i90x.flachttps://files.catbox.moe/olajtj.flac
>>106628662Lovely
Can you train the same lora with same settings and datasets and get different results or does retraining do nothing?
>>106628827Not bad, from these samples it's better than Ace-Step 1.0 (will 1.5 ever be released?)Wonder what the range of music is, and most importantly if it can be effectively finetuned with other musc
>>106628865>Wonder what the range of music isyou can put a real music in there and it'll remix it, I find it fun to play with
>>106628873Does it require music input ?
>>106628895it's not mendatory
hunyuan image looks better now in comfy https://github.com/comfyanonymous/ComfyUI/pull/9882
>>106628863If you are talking about the same model, a training run with the same dataset will make pretty much almost the same lora at the same epoch.
does nag not work with chroma flash?
>>106628938Yeah same model. So it's just about resolution and scheduler?
>>106628733thank godsomeone finally trained a lora that i actually WANT (sure feels like forever ;D)>>106628642>he makes posts like this but gets mad at migusama ;3>>106628662n e a t !>>106628691ew!
>>106628931>hey look Comfy I fixed your implementation of that model!I bet 20 dollars he won't merge it, againhttps://github.com/comfyanonymous/ComfyUI/pull/7965
>>106628931impressive, that mf saved HunyuanImage
more and more people realizing comfyui is no longer about local models
The man in the red tie raises his arms, and the various trays of fast food on the table in front of him float in the air.behold my power!
https://xcancel.com/LodestoneE621/status/1968687032605065528#m>Peak GPU mem: 16,139 1,736 MB (on dummy mlp forward pass)>Speed ratio: 0.99× (compute & comms perfectly interleaved)GET OUT!
>>106628999I don't know how he managed to stay healthy after spending 80 years of his fatass life eating only McDonald's.
>>106628999that looks like something i shat out with sd1.5 circa 2023 what are you DOING nigger?
>>106628926Nice Huke style.
>>106629000So what’s the obvious drawback he is choosing to ignore? Because we know from chroma that there were many
>>106628982No, it's only you, since all local models are supported, often within hours, and it even supports local models that aren't even close to being finished trainingThere is a LOT to complain about when it comes to Comfy, but it's not local model support, which is stellarGo lie somewhere else
>>106629023Sorry meant this for rocketpajeet
>>106629041truth nuke, and I say this as someone who don't really like this autistic bitch
>>106629023the source image isn't very good/high quality.
>>106629046didn't notice him, pretend i quoted him too.>>106629053wan can take images from the 1900s and turn them into (masterpiece:1.5), its just (You)
after the botched implementation of hunyuan image and chroma, more and more people are switching to better UIs where output quality comes before model quantity
>>106629000Has the furry solved the 'too little vram' problem ?Big if true
>>106629053McDonald Trump
>>106629000as much as I question the furry's decisions, he honestly seems like a good researcher
>>106629000How is this gonna work when even DDR5 is glacially slow compared to gpu vram?
>>106629053the amount of salt this originally caused will always be so funny to me
>>106629053So this is it. This is the true power of Americans. My god..
>>106629088He's not a fucking researcher he just half bakes he saw in a paper and some guy on discord forwarded to him shit and never explains his reasoning before autistically moving on to the next snakeoil. Why am I the only person who sees this?
>>106629131>Why am I the only person who sees this?you aren't, haven't you noticed the amount of seething everytime he made a retarded move on chroma's training? kek
>>106629000isnt this what flash attention does? you can only move things from ram->vram so fast, i dont see how this will work
>>106629131You aren't
>>106629095It's funny because I don't know anyone in the entire history of ever who didn't love going to McDonalds after playing sports as a kid. Hell, even as an adult. Nobody wants to be sucking down on weird french shit after a big game.
https://github.com/comfyanonymous/ComfyUI/pull/9898>Reduce Peak WAN inference VRAM usage>The first git commit alone improves performance some and the second further increases it. I standardized on 1024x1024 for the image size and varied the frames. Before the changes the maximum number of frames it can handle is 49 and this increases it to 65 for my setup.based
>>106629142>WAN2.2 I2V 14B Q4_K_S GGUF + lightx2v 4steps LoRA (based on video_wan2_2_14B_i2v template) 1024x1024x61frames video generation>wan q4>61 framesgrim, also is 1024x1024 one of the "officially" listed resolutions? i dont think so
>>106629163>grim, also is 1024x1024 one of the "officially" listed resolutions? i dont think soit's not, I guess he just wanted to do a test
>>106629131>he just half bakes he saw in a paper and some guy on discord forwarded to him shit and never explains his reasoning before autistically moving on to the next snakeoilthat's what most professional researchers do
>>106629139fav timeline: finding out the supersize me guy gained weight and was destroying himself from being a wastoid\boozer not from eggmcmuffs >free unlimited mcdonalds when they first launched the app>i ate so much mcdond i should be dead>i lost around 3lbs kek eggmcmuff has real egg >;3good, simple, pure, fun times
what does that have to do with image generation? nb4 spergout
>>106629180They also document their shit.
>>106629193the images generated were about\of mcdond and the time trump was being cheekyits too bad about the face detail from far awayi would imagine things will finetune\tighten in the next few quarters\months
>>106629186>supersize me guyThe guy was an absolute fraud. Tbh, I think McDonald's gets way to bad a reputation for no real reason. Their chicken McNuggies are as close as you get gen to bare basic "Human food" and I don't mean that in a bad way.
>>106629135No, Flash attention is all about keeping things in the GPU as much as possibleThis is about being as efficient as possible when you have to offload parts of a model to ram, this is not a new concept optimization, it's been around in most trainers and inference tools for quite a while, the difference is that this claims to have near zero overheadIf the claims hold up, this would be enormous
>>106629186You cannot pay me enough to try an egg mcmuffin
>>106629220>If the claims hold up, this would be enormouslike, he's using a paper to make this node or something?
>>106629222you can make one at home with a cookie cutter and egg and 2 slices of cheddar...surely you eat breakfast sandwiches anon ;3
>>106629211>vegan girlfriend guy was a drunk fraudWOW hahahah
>>106629220>No, Flash attention is all about keeping things in the GPU as much as possible>This is about being as efficient as possible when you have to offload parts of a model to ramThese are not different, the highest efficiency possible IS to keep everything in vram as much as possible while offloading when you need to, given the speed of the gpu vs ram being x10 difference while the time to move from one to another has a big cost too, meaning this can't really be anything new, i doubt its even a better FA
>>106628594Miyazaki chill
>>106629142just tested it, went from 22.3gb of usage to 21.7gb, it's not much but I'll take it
>>106629252speed diff?
>>106629239bigot
>>106629261it's the same, but now I can make it slightly faster by offloading less to the ram I guess
>>106629243>These are not differentYes they are, Flash Attention optimizations are ALL about optimizing WITHIN the GPU, it have the benefit of fitting more into vram but it have no strategy whatsoever for when it doesn't fit into vramOffloading optimizations are specifically for when it doesn't fit into vramSo no, they are very different
>>106629131Anyone who has ever trained the exact same NSFW concept dataset in a lora at both 512x512 and 1024x1024 on Flux for the same number of epochs is aware that the rate of anatomy errors will always be drastically higher with the 512x512 one unless you actually inference at 512x512. Chroma simply didn't get anywhere remotely close to enough training at 1024x1024.
>>106629272You're right, I've must have confused FA with some other tech I read about a long time ago
>ACK!
>>106629327BASED
>>106629285More HD training would have been exponentially more expensive, but I'm not sure it's that. The HD version is fucky and slops prompts, but the anatomy is noticably better than Base. Base is still better because you can fix a fucked hand with more steps and better prompting.
Is there an ideal resolution ratio I should set for WAN videos to not come out fuzzy as shit or otherwise lose their shit in Comfy gens?
>>106629233I'll lose the cheese and double-side fry the eggalso streaky baconcheese and egg, especially cheddar, do not mix imho
>slow mo video
>>106629327fk yeah
>>106629303Flux Krea full or fp8?
>>106629351just dont use anything below q8 and it shouldnt be a big problem, but anyway, 720x1280 or 1280x720
wan is slowly making the troon more female
i take it back what i said many therads ago, i in fact love chroma again and just had aworkflow skill issue. not only that but chroma is really good at inpainting.i would post an example but my gens are too strong for you, proompter.
>>106629327
>>106629373kek, it's truebased Wan
>>106628754>since nag basically doubles gen timesno it doesn't??
>>106628754NAG is a buffed neg prompt, not a cfg1 hack.
>>106629457>not a cfg1 hack.it works for models with cfg 1 though
>>106629465working with=/=enabling
>SPRO is 1st on the trending page>HunyuanImage isn't even on the listI hope Tencent is gonna learn from that and will give us kino next time