>What is this?A local open weights music generator, like Suno and Udio.>Original repo (includes lora training)https://github.com/ace-step/ACE-Step-1.5>Comfyui guidehttps://docs.comfy.org/tutorials/audio/ace-step/ace-step-v1-5>Suno-like UIhttps://github.com/fspecii/ace-step-uiShare your gens.Keywords: music gen, local model, song gen, suno, udio, acestep, ace step
Is audio gen lighter than image/text gen or will it still rekt gpulet users?
>>108095115theres like a huge range of models for image gen. Its a bit heavier than SDXL models but lighter than like Flux. I am running the comfyui workflow with 4GB vram and the generation time is about 8x longer than the song duration.
how far are we from giving an ai a full album with corresponding lyrics and letting it generate more songs in the same style?
>>108095075>https://github.com/fspecii/ace-step-uilooks unironically like vibecoded trash.do we know if comfy is planning to implement more nodes?
>>108095139It works if you train a Lora. People have already trained loras with Michael Jackson, Linkin Park etc with success.
>>108095139you can do that now with the lora training feature. also supposedly the audio sounds a lot better too when you use a lora
>>108095115You need a 24gb GPU to run the biggest LLM text encoder together with the main DiT model comfortably, but if you disable the LLM, it works even on CPU-only for VRAMlets, but the output quality will be shittier.
>>108095174>>108095168do you give them snippets or full length mp3s?
>>108095285You can train Loras with full length tracks as long as your GPU doesn't OOM in the process.
>>108095306I'll bite, how many minutes of audio for a decent lora?
>>108095379Any full album (~11 songs) works
>>108095168
https://xcancel.com/bdsqlsz/status/2020432198210613708Based if true
>>108095075Can I use lora with ComfyUI yet?why is it broken?
>>108095560You have to convert the lora first. I had Claude to write a python script for me that converts it to a format Comfy accepts and it worked perfectly.You have to convert the keys like:new_key = k.replace("base_model.model.base_model.model.", "diffusion_model.decoder.")
https://voca.ro/1o8PRqN0Gbae
https://voca.ro/15rN76Zadfqu
>>108095536is LoKr another Lora replacement like Locon Dora etc?
>>108095768sounds like low quality mp3 but overall good
>>108096107Yes