/g/ - suno at home: ace step 1.5 - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
suno at home: ace step 1.5 02/03/26(Tue)13:11:05 No.108051632

File: Screenshot from 2026-02-0(...).png (245 KB, 1007x1076)

suno at home: ace step 1.5 Anonymous 02/03/26(Tue)13:11:05 No.108051632

Comfyui has Ace Step 1.5

Currently broken on AMD, apparently, but if you have nvidia, you can gen songs locally, and with lots of good features.

https://www.reddit.com/r/comfyui/comments/1quzawn/acestep_15_is_now_available_in_comfyui/

Anonymous
02/03/26(Tue)13:12:26 No.108051644

Anonymous 02/03/26(Tue)13:12:26 No.108051644

File: Screenshot from 2026-02-0(...).png (24 KB, 610x432)

24 KB PNG

>>108051632
the error I'm getting, with amd.

Anonymous
02/03/26(Tue)13:15:55 No.108051680

Anonymous 02/03/26(Tue)13:15:55 No.108051680

Yeah so the issue is that the audio vae of comfyui is borked on AMD.

as usual, comfyui sucks, it sucks in a very Windows-like way.

using --cpu-vae

super slow. I thought this problem was fixed, but nooo.

Anonymous
02/03/26(Tue)13:20:10 No.108051713

Anonymous 02/03/26(Tue)13:20:10 No.108051713

>>108051680
so far, 6 minutes of vae/clip running on cpu...

Anonymous
02/03/26(Tue)13:35:18 No.108051818

Anonymous 02/03/26(Tue)13:35:18 No.108051818

>>108051713
cpu vae takes a huge amount of time with this. too bad the comfyui guy refuses to buy an amd card, so his code is constantly broken on amd.

Anonymous
02/03/26(Tue)13:36:48 No.108051832

Anonymous 02/03/26(Tue)13:36:48 No.108051832

fwiw, the real problem is hip is incomplete when it comes to audio.

but this is a solved problem for vae anyway, so it shouldn't be happening, don't make me vibe code a comfyui fork.

Anonymous
02/03/26(Tue)14:08:06 No.108052051

Anonymous 02/03/26(Tue)14:08:06 No.108052051

>>108051632
MY FIRST GEN

https://files.catbox.moe/hqkvul.mp3

btw, has anyone ever gotten the gradio crap to work with amd?

Anonymous
02/03/26(Tue)14:13:34 No.108052106

Anonymous 02/03/26(Tue)14:13:34 No.108052106

I nuked the gradio stupid shit. what a pantload

Anonymous
02/03/26(Tue)14:24:22 No.108052197

Anonymous 02/03/26(Tue)14:24:22 No.108052197

File: Screenshot from 2026-02-0(...).png (70 KB, 967x609)

70 KB PNG

God damn github, what a shit website, I searched it like 10 timed and it pukes like it's being ddos'd. massive trash.

Anonymous
02/03/26(Tue)14:26:30 No.108052218

Anonymous 02/03/26(Tue)14:26:30 No.108052218

>>108052051
gradio didn't work on older nvidia card with 8GB either and I was using the lightest model 0.6B

Anonymous
02/03/26(Tue)14:34:02 No.108052275

Anonymous 02/03/26(Tue)14:34:02 No.108052275

>>108052218
Yeah, the gradio / uv thing is trash.

Anonymous
02/03/26(Tue)14:35:14 No.108052287

Anonymous 02/03/26(Tue)14:35:14 No.108052287

>>108052275
Their implementation of it is. Imma tr "comfyui" now, I hate how this dog shit ui became the standard

Anonymous
02/03/26(Tue)14:50:58 No.108052405

Anonymous 02/03/26(Tue)14:50:58 No.108052405

no luck solving

probability tensor contains either `inf`, `nan` or element < 0

(ie I still have to use cpu vae - that or buy an s tier nvidia gpu to just do stuff that comfyui is incapable of)

try to keep the thread alive, I must do life crap

Anonymous
02/03/26(Tue)16:37:55 No.108053243

Anonymous 02/03/26(Tue)16:37:55 No.108053243

Neevr used comfy before. Trying both manual and portable installations. 16gb vram.
VAEDecodeAudio keeps crashing:
>RuntimeError: GET was unable to find an engine to execute this computation

Anonymous
02/03/26(Tue)20:26:42 No.108054980

Anonymous 02/03/26(Tue)20:26:42 No.108054980

>>108053243
For me, I have to use --cpu-vae

MIOPEN_FIND_MODE=FAST python main.py --cpu-vae

the miopen_find_mode thing helps with my amd card, on Linux.

anyway, --cpu-vae is necessary on amd, because comfyui's audio vae is immature. really, audio everything is immature on comfy.

anyway, back slow genning.

possible areas of solution, which I may try to investigate as time allows:
>zluda comfyui
>(actually I won't bother with this one) some other audio gen thingies don't have the cpu-vae issue. heartmula didn't, not sure why. I had to restart the server every few gens, again, not sure exactly how that happened.

Anonymous
02/03/26(Tue)20:28:24 No.108054987

Anonymous 02/03/26(Tue)20:28:24 No.108054987

>>108051632
why the fuck is this a thread and why the fuck are you advertising malware instead of the gradio app from the lab that has all the full implementation? go discuss in >>>/g/ldg

Anonymous
02/03/26(Tue)20:48:29 No.108055108

Anonymous 02/03/26(Tue)20:48:29 No.108055108

HOW DO YOU KNOW IF YOU ARE USING CPU VAE????
Your cpu usage will go way up. I find all cores utilized, but not all out constantly. 1%, 20%, 80%, 1% again, on the same core say CPU7.

>>108054987
comfyui isn't considered malware. audio gen is very different from /ldg/ and /sdg/ interests. 99% of those guys will never be interested in even listening to any music at all. music is kind of a Gen X thing. Once you get past Gen X, music just isn't that important to people. That's why "hit songs" for 2025 per Billboard were on average 3 minutes or so. vs 5 minutes as the traditional song length, with a few super long songs making it to the top some years.

>>108054980
correction, this is what is working for me (AMD XTX rx 6950xt):

PYTORCH_ALLOC_CONF=expandable_segments:True TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL=1 MIOPEN_FIND_MODE=FAST python main.py --use-pytorch-cross-attention --novram --cpu-vae

I don't know if I need all options lol. so many combinations to try, but hopefully I'll take the time to look into it.

This is btw what I used with SongBloom

Yes I am genning. It will take maybe 30 minutes for a 120 minute song, for me, I guess. We'll see. This is only my second acestep 1.5

Anonymous
02/03/26(Tue)21:00:31 No.108055193

Anonymous 02/03/26(Tue)21:00:31 No.108055193

>>108055108
>>108053243
yeah, so I'm getting a vae decode error too, with audio decode, despite being under cpu decode. it's not a ram issue.

I had one successful gen, so I know it *can* work...

Anonymous
02/03/26(Tue)21:36:14 No.108055405

Anonymous 02/03/26(Tue)21:36:14 No.108055405

I can gen music just fine but not much luck with sound effects, farting noises, etc.

Anonymous
02/03/26(Tue)21:41:18 No.108055438

Anonymous 02/03/26(Tue)21:41:18 No.108055438

>>108055405
share!

https://catbox.moe/

so far, my gens are taking forever, but I think this one will succeed (I had a node that overrides others, that is incompatible with ace step; it might be an old node that I should avoid anyway)

Anonymous
02/03/26(Tue)21:52:58 No.108055492

Anonymous 02/03/26(Tue)21:52:58 No.108055492

>>108055438
https://files.catbox.moe/4gpmrd.mp3
https://files.catbox.moe/2ye4xa.mp3
will try more later

Anonymous
02/03/26(Tue)21:57:16 No.108055519

Anonymous 02/03/26(Tue)21:57:16 No.108055519

>>108055108
it's a diffusion model you retard. also this model sucks for the most part. doesn't deserve it's own thread

Anonymous
02/03/26(Tue)22:02:08 No.108055545

Anonymous 02/03/26(Tue)22:02:08 No.108055545

https://files.catbox.moe/rho1z9.mp3

Anonymous
02/03/26(Tue)22:08:48 No.108055576

Anonymous 02/03/26(Tue)22:08:48 No.108055576

>>108055492
Thanks!

be sure to include what your style prompts were!

First one is doing what I knew the base model could do. I would sometimes get like 1 second that punched through the low bitrate and laid down bass.

second
isn't it strange rich people can't get a girlfriend?

>>108055519
Yeah but it's the first song model that has promise. heartmula doesn't follow the prompt. ace step 1.35 was low bitrate. songbloom's method is experimental and not ready for prime time.

>>108055545
style prompt: teen pop

Anonymous
02/03/26(Tue)22:15:11 No.108055602

Anonymous 02/03/26(Tue)22:15:11 No.108055602

>>108055545
>>108055576
also, it took me <38 minutes, and needed <50gb system memory (to cpu vae). idk why it needs that much lol

Anonymous
02/03/26(Tue)22:39:39 No.108055700

Anonymous 02/03/26(Tue)22:39:39 No.108055700

File: 3221311515.png (150 KB, 1121x751)

150 KB PNG

Official Gradio UI has a lot of features, including LoRA training etc... Plus I feel like the stuff is broken for Comfy, on 3090 it reloads prompt each time

Anonymous
02/03/26(Tue)22:45:58 No.108055741

Anonymous 02/03/26(Tue)22:45:58 No.108055741

https://files.catbox.moe/2imtt6.mp3

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.