[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Comfyui has Ace Step 1.5

Currently broken on AMD, apparently, but if you have nvidia, you can gen songs locally, and with lots of good features.

https://www.reddit.com/r/comfyui/comments/1quzawn/acestep_15_is_now_available_in_comfyui/
>>
>>108051632
the error I'm getting, with amd.
>>
Yeah so the issue is that the audio vae of comfyui is borked on AMD.

as usual, comfyui sucks, it sucks in a very Windows-like way.

using --cpu-vae

super slow. I thought this problem was fixed, but nooo.
>>
>>108051680
so far, 6 minutes of vae/clip running on cpu...
>>
>>108051713
cpu vae takes a huge amount of time with this. too bad the comfyui guy refuses to buy an amd card, so his code is constantly broken on amd.
>>
fwiw, the real problem is hip is incomplete when it comes to audio.

but this is a solved problem for vae anyway, so it shouldn't be happening, don't make me vibe code a comfyui fork.
>>
>>108051632
MY FIRST GEN

https://files.catbox.moe/hqkvul.mp3

btw, has anyone ever gotten the gradio crap to work with amd?
>>
I nuked the gradio stupid shit. what a pantload
>>
God damn github, what a shit website, I searched it like 10 timed and it pukes like it's being ddos'd. massive trash.
>>
>>108052051
gradio didn't work on older nvidia card with 8GB either and I was using the lightest model 0.6B
>>
>>108052218
Yeah, the gradio / uv thing is trash.
>>
>>108052275
Their implementation of it is. Imma tr "comfyui" now, I hate how this dog shit ui became the standard
>>
no luck solving

probability tensor contains either `inf`, `nan` or element < 0

(ie I still have to use cpu vae - that or buy an s tier nvidia gpu to just do stuff that comfyui is incapable of)

try to keep the thread alive, I must do life crap
>>
Neevr used comfy before. Trying both manual and portable installations. 16gb vram.
VAEDecodeAudio keeps crashing:
>RuntimeError: GET was unable to find an engine to execute this computation
>>
>>108053243
For me, I have to use --cpu-vae

MIOPEN_FIND_MODE=FAST python main.py --cpu-vae

the miopen_find_mode thing helps with my amd card, on Linux.

anyway, --cpu-vae is necessary on amd, because comfyui's audio vae is immature. really, audio everything is immature on comfy.

anyway, back slow genning.

possible areas of solution, which I may try to investigate as time allows:
>zluda comfyui
>(actually I won't bother with this one) some other audio gen thingies don't have the cpu-vae issue. heartmula didn't, not sure why. I had to restart the server every few gens, again, not sure exactly how that happened.
>>
>>108051632
why the fuck is this a thread and why the fuck are you advertising malware instead of the gradio app from the lab that has all the full implementation? go discuss in >>>/g/ldg
>>
HOW DO YOU KNOW IF YOU ARE USING CPU VAE????
Your cpu usage will go way up. I find all cores utilized, but not all out constantly. 1%, 20%, 80%, 1% again, on the same core say CPU7.

>>108054987
comfyui isn't considered malware. audio gen is very different from /ldg/ and /sdg/ interests. 99% of those guys will never be interested in even listening to any music at all. music is kind of a Gen X thing. Once you get past Gen X, music just isn't that important to people. That's why "hit songs" for 2025 per Billboard were on average 3 minutes or so. vs 5 minutes as the traditional song length, with a few super long songs making it to the top some years.

>>108054980
correction, this is what is working for me (AMD XTX rx 6950xt):

PYTORCH_ALLOC_CONF=expandable_segments:True TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL=1 MIOPEN_FIND_MODE=FAST python main.py --use-pytorch-cross-attention --novram --cpu-vae

I don't know if I need all options lol. so many combinations to try, but hopefully I'll take the time to look into it.

This is btw what I used with SongBloom

Yes I am genning. It will take maybe 30 minutes for a 120 minute song, for me, I guess. We'll see. This is only my second acestep 1.5
>>
>>108055108
>>108053243
yeah, so I'm getting a vae decode error too, with audio decode, despite being under cpu decode. it's not a ram issue.

I had one successful gen, so I know it *can* work...
>>
I can gen music just fine but not much luck with sound effects, farting noises, etc.
>>
>>108055405
share!

https://catbox.moe/

so far, my gens are taking forever, but I think this one will succeed (I had a node that overrides others, that is incompatible with ace step; it might be an old node that I should avoid anyway)
>>
>>108055438
https://files.catbox.moe/4gpmrd.mp3
https://files.catbox.moe/2ye4xa.mp3
will try more later
>>
>>108055108
it's a diffusion model you retard. also this model sucks for the most part. doesn't deserve it's own thread
>>
https://files.catbox.moe/rho1z9.mp3
>>
>>108055492
Thanks!

be sure to include what your style prompts were!

First one is doing what I knew the base model could do. I would sometimes get like 1 second that punched through the low bitrate and laid down bass.

second
isn't it strange rich people can't get a girlfriend?

>>108055519
Yeah but it's the first song model that has promise. heartmula doesn't follow the prompt. ace step 1.35 was low bitrate. songbloom's method is experimental and not ready for prime time.

>>108055545
style prompt: teen pop
>>
>>108055545
>>108055576
also, it took me <38 minutes, and needed <50gb system memory (to cpu vae). idk why it needs that much lol
>>
File: 3221311515.png (150 KB, 1121x751)
150 KB
150 KB PNG
Official Gradio UI has a lot of features, including LoRA training etc... Plus I feel like the stuff is broken for Comfy, on 3090 it reloads prompt each time
>>
https://files.catbox.moe/2imtt6.mp3



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.