[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1744670862091211.jpg (145 KB, 675x499)
145 KB
145 KB JPG
Discussion of Free and Open Source Diffusion Models

Prev: >>108045531

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg
>>
blessed thread of frenship
>>
File: ldg_spammer.jpg (1.73 MB, 979x2558)
1.73 MB
1.73 MB JPG
>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>>108048611
What's wrong with ComfyUI?
>>
>>108048624
OP used to be the biggest Comfy advocate, but then he was fired from the org for being a schizo, has never found a job since and thought he would get rich by making a competing UI that never took off on account of it being garbage.
>>
File: z_imageBASEd_00227_.jpg (426 KB, 1264x1760)
426 KB
426 KB JPG
>>
File: 1712428465706393.png (83 KB, 500x500)
83 KB
83 KB PNG
How do I hack into comfy a pre-subgraph, pre-node2 old UI?
>>
File: ComfyUI_00510_.jpg (2.53 MB, 4096x2730)
2.53 MB
2.53 MB JPG
Anon...

Qwen image 2512 kills any other model at realism and anatomy. No other local model can output this 2 girls with fingers and toes without monstrosities
>>
>>108048708
Oh really? Must the year of the Chinese Culture then.
>>
>gen stuff at work without boss noticing
>work quality decreased
this is such a distracting hobby
>>
>>108048708
all nice but the PROBLEM is they are both the exact same fucked up blend of mystery meat mixed race AI slop, also ultra SEPIA 9000.
>>
why are you doin this?
>>
Proper thread
>>108048751
>>108048751
>>108048751
>>
File: z_imageBASEd_00233_.jpg (429 KB, 1264x1760)
429 KB
429 KB JPG
>>
when will LLMs catch up? I can gen very high quality 1girls doing whatever the fuck I want but LLMs are still stuck at "shivers running down my spine" unless you have a gorillion gigs of RAM
>>
>>108048767
hard to say. I think language is simply a much more complex issue than pixels on a screen.
>>
Ace step 1.5 in 5 hours brehs. Not just the model but a bunch of the tools that come with it too.
>>
>>108048805
thank god comfy is a good man, he won't forget us old lads and implement fp16, right?
>>
I have a dual 3090 rig all planned out but video is the make it or break it for me
Am I gonna get good video mileage from my 2x 3090 and 128GB of RAM?
>>
>>108048896
Your 2nd 3090 will do being shit all. Dual GPU stuff for image and video gen is basically dead in the water.
>>
>>108048896
one 5090 would be better if you could buy one (you can't). still a good setup though
>>
File: 1762456326060491.mp4 (3.75 MB, 1216x704)
3.75 MB
3.75 MB MP4
>>
File: 1763474734338596.mp4 (3.73 MB, 1216x704)
3.73 MB
3.73 MB MP4
>>
File: 1750856485214797.mp4 (3.72 MB, 1216x704)
3.72 MB
3.72 MB MP4
>>
File: 1756076253659700.mp4 (3.75 MB, 960x896)
3.75 MB
3.75 MB MP4
>>
File: 1738620248024991.mp4 (3.77 MB, 1216x704)
3.77 MB
3.77 MB MP4
>>
File: 1751485183800615.mp4 (3.65 MB, 768x1088)
3.65 MB
3.65 MB MP4
>>
>>108049185
I remember that video being a little different
>>
File: prompt injection.jpg (22 KB, 400x400)
22 KB
22 KB JPG
>>108048611
how do I poison my websites for AI?
>>
>>108049269
The honest answer? You can't. It's just like piracy. By trying to stop it, you just end up hurting regular users.
>>
>>108049300
that's fine with me.
>>
>>108049321
The hurting users or the not being able to stop it?
>>
>>108049300
oh and by the way I already know it is possible, and being done. including in resumes.
>>
>>108049328
I dunno, add schizo text somewhere where it's not visible to a regular user maybe
>>
File: 1766805074085706.mp4 (3.75 MB, 1216x704)
3.75 MB
3.75 MB MP4
>>108049199
>>
File: 1754351551665520.mp4 (3.74 MB, 1216x704)
3.74 MB
3.74 MB MP4
>>
File: 1747756044169004.mp4 (3.75 MB, 1216x704)
3.75 MB
3.75 MB MP4
>>
>>108049390
>>108049394
>>108049398
LTX-slop.
>>
>>108049328
I think you're either misunderstanding or underestimating the scope of the issue.
>>
>>108049425
no im not
>>
>comfyui sucks
I dunno, I tried wangp. that gradio shit is confusing as fuck
>>
>>108049441
i made my own fork of comfyui
>>
>>108049432
ok
>>
File: 1750979199849684.png (548 KB, 926x1236)
548 KB
548 KB PNG
>>108049452
meanwhile
>>
>>108049452
tourist retard loser redditor, that's called a concern troll, but you would know that already if you weren't a newfaggot
>>
>>108049469
>dude: I consent
>expo: I consent
>is there someone you forgot to ask
>>
>>108049490
nazi
>>
File: Capture.png (30 KB, 392x462)
30 KB
30 KB PNG
bye losers
>>
File: 1725563362424353.png (1.03 MB, 1030x1541)
1.03 MB
1.03 MB PNG
what anime video model can I use with google collab?
>>
File: 1753401524239795.mp4 (1.29 MB, 704x672)
1.29 MB
1.29 MB MP4
>>
File: 1751963012183599.png (1.22 MB, 2266x907)
1.22 MB
1.22 MB PNG
>>108049452
what was elon thinking?
>>
>>108049590
not local kill yourself
>>
>>108049637
you big dum dum
https://files.catbox.moe/11ivw1.mp4
>>
>>108049680
>hey i catboxed the video
okay???
>>
>>108049680
tourist
>>
anyone got an upscale workflow for comfy? trying out anima
>>
>>108049694
must suck having crippling retardation
>>
>>108049723
i do
>>
>>108049729
may i see it
>>
>>108049742
no
>>
>>108049747
i implore you to reconsider
>>
>>108049762
post anima
>>
>>108049847
i only make porn
>>
File: 1767686505854159.mp4 (3.53 MB, 1056x704)
3.53 MB
3.53 MB MP4
>>108048708
>>
Apache2 anima when?
>>
File: 1746627971647865.mp4 (3.74 MB, 1216x704)
3.74 MB
3.74 MB MP4
>>
File: 1742600378574970.mp4 (3.72 MB, 1216x704)
3.72 MB
3.72 MB MP4
>>
>>108049937
>>108049940
hilarious that you are now stealing gens from /v/ to keep your dogshit thread alive, Julien
another day of tranusstudio being the worst UI wrapper in existence, and for what? this?
>>
File: 1768238632698852.webm (1.61 MB, 1216x704)
1.61 MB
1.61 MB WEBM
>>
I don't like that the other thread is the same person self replying for most of it. this must be the real thread
>>
>>108049992
yeah mods are going to delete it eventually. this one was made first
>>
>>108049992
that thread was made by lord niggerjak so that his peons may prostrate before him. I'd avoid it
>>
>>108050039
who are the peons? the voices in his head?
>>
who the fuck is this apache2 anima retard, what does that mean, what's his problem, can he fuck off?
>>
>>108050116
its the usual suspect(s) sneething at anything related to comfy fudding yet again
>>
>>108048611
Why is AniStudio not in OP?
>>
>>108050116
calm down tdrustled
>>
File: 1739937185760580.mp4 (2.04 MB, 704x1056)
2.04 MB
2.04 MB MP4
>>108049553
>jewgle
>>
>>108050153
? You still can't answer why AniStudio shouldn't be in OP
People use it to generate images and it runs locally, so why should we not add it to OP?
>>
>>108050153
>>108050207
Get lost drama baiter i'm just asking a question
If you can't give an answer you're free to go somewhere else or just don't reply
>>
>>108050227
you are the drama baiter, rannigger
>>
>>108050236
I don't understand why you would bully a dev that put so much time and effort to finally kill all the pyshit and build a proper desktop app for us
He wants to destroy all the pyshit once and for all which would improve the whole ecosystem
What kind of mental illness is the reason you're against improvements?
>>
>>108050270
That's it Julien, suck on that dildo!
>>
>vandalized op
>shitting on comfy yet again
>"pyshit pyshit pyshit!"
Nta but you really need help julien, you're spiraling and you're on this since christmas
Get a job and stop being terminally online here you schizo, no one will use your crashing garbage "ui" anyway
>>
>>108050201
No one uses it because it doesn't compile.
>>
>>108050444
stop lying or post a github issue "anon"
>>
>>108050270
this is your MO, rannigger. when your precious rentry links are not in the OP you will come into the thread and pose as ani to be annoying and stir resentment against him and therefore justify having the rentry links

rannigger is really proud of his rentry links being in the OP because evidently he has nothing going on in his life
>>
>>108050512
>>108050547
>cumfart's cumrag thinks its opinion matters
Lol
>>
>>108050547
Put the fries in the bag
No one cares about your wrapper and you
>>
>>108050512
Why would I make a microsoft account to submit an issue when dev has said here he's abandoned it?
>>
>>108050563
>>108050568
>>108050572
stop samefagging lolcow
>>
>>108050582
Tranussyudio status?
>>
File: file.png (61 KB, 1165x805)
61 KB
61 KB PNG
>>108050621
>>
>>108050646
So basically not being developed anymore? Can you fuck off for good then finally?
>>
>>108050646
So it can't Z image, Klein, Anima or LTX?
Lmao
Can't believe you call yourself a dev, you worthless subhuman cumrag
>>
>>108050756
His UI bundles an old version of sd.cpp, the project that actually does the local diffusion, he didn't develop it but it does have support for klein and z image now and has a few functional front ends that do compile.
>>
>>108046159
nice
>>
anima is cancer
>>
File: F2Kb__00008_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
AHHHHHH 1.5 was released 35 minutes agooooo

https://blog.comfy.org/p/ace-step-15-is-now-available-in-comfyui
>>
File: 1758272184852117.png (12 KB, 763x200)
12 KB
12 KB PNG
>>108051392
benchod
>>
where do people post Ace gens?
>>
FATHER FORGIVE ME FOR I AM UPDOOTING
>>
>>108051357
apache2 version when?
>>
>>108051401
>https://blog.comfy.org/p/ace-step-15-is-now-available-in-comfyui
works on my machine...
>>
File: 1750861933257946.mp4 (3.74 MB, 896x896)
3.74 MB
3.74 MB MP4
>>108051401
why you lie
>>
>>108051427
https://catbox.moe/
>>
>>108051438
i made this video
>>
>>108051438
idk why it's not working for you.
https://www.reddit.com/r/comfyui/comments/1quwx84/acestep_15_template_for_comfyui_v012_is_ready/
>>
>>108051467
>>108051438
>>
File: 00112-2603958653.jpg (1.63 MB, 2688x2688)
1.63 MB
1.63 MB JPG
>>108048611
>>108048611
This OP is wrong
>>
>>108051496
280+ posts in your thread. better start baking
>>
>>108051496
so the guy sitting is ran, the guy collecting cotton is ani and the girl is debo?
>>
File: 01805-3002059339.png (1.02 MB, 896x1152)
1.02 MB
1.02 MB PNG
>>108051451
AI music thread on /wsg/ is dead. wtf? i loved that thread.
https://www.youtube.com/watch?v=OYjZK_6i37M&list=RDOYjZK_6i37M&start_radio=1
>>
>>108051467
ace step 1.5 not working for me in comfy...
>>
I have a noob question, can I grab any lora from civit.ai and combine it with any model I want to add penises? I have a good realistic image gen model, it does boobs and ass fine, but when it comes to generating dicks, it creates horrors beyond comprehension.
>>
>>108051573
why are you lying
>>
>>108051591
yeah
>>
>>108051428
i'm not so sure that is so anymore; i feel that the new technological advances made a 'tectonic fault' there too and we have now the old group and a new 'trans-humanist'/techno one and their goals no longer align.
>>
best ui for gender non-confirm fxlks?
>>
File: 883982277690625.png (2.07 MB, 800x1296)
2.07 MB
2.07 MB PNG
>>
>>108051679
can you make her peg a male with her tale and impregnate him with her eggs
>>
>yet another qwen text encoder
>this time it is 1.7b
Just stop it already for fuck sake. I already have 0.6b, 4b, 8b...
>>
don't prompt for sonic, masterpiece, safe in anima
>>
File: file.jpg (976 KB, 1149x1528)
976 KB
976 KB JPG
>>
>>108051718
Total qwen victory.
>>
File: vae.png (22 KB, 555x428)
22 KB
22 KB PNG
Does the x0_x32 Radiance stray away from normal radiance? I kee pgetting this on the pixel space notVae.
>>
File: 00121-1380337966.jpg (1.18 MB, 2688x2048)
1.18 MB
1.18 MB JPG
>>108051767
Wouldn't bother with that model desu
>>
>>108048624
Tourist? Have ever saw how troonfy used to behave in this board before he started to just lurk and throw his furry dumbo now and then to shill his crapware?
>>
File: ComfyUI_00690_.png (639 KB, 1024x1024)
639 KB
639 KB PNG
>>
>>108051436
this is crap
>>
>>108051784
no, please tell me what actually happened
>>
File: klein9b_temp_00043_.jpg (375 KB, 1248x1248)
375 KB
375 KB JPG
>>108051784
Try speaking English, you retarded faggot.
>>
File: file.png (206 KB, 1070x1663)
206 KB
206 KB PNG
Rate my song

https://vocaroo.com/1kzCDVTxuLOk
>>
>>108051837
meds, dyslexic fuck
>>108051833
desu archive
>>
>>108051863
i don't know what to search
>>
>>108051855
not clicking that shit
>>
>>108051855
Nice, workflow? I'll try to do some lolicon music
>>
>>108051874
Not sharing with a pdf
>>
>>108051874
nta the default workflow from comfy's blog post works for me
>>
File: file.png (176 KB, 456x769)
176 KB
176 KB PNG
>>108051874
its this one from comfyui templates
>>
>>108051886
Play button doesn't work?
>>
File: file.jpg (839 KB, 1148x1529)
839 KB
839 KB JPG
>>
>>108051897
they don't look very comfy
>>
>>108051903
raxist
>>
File: 00006-1523558169.jpg (2.07 MB, 2560x2048)
2.07 MB
2.07 MB JPG
>>
>>108051865
you are replying to tranjak who is just here to stir drama and is the cancer of this thread
>>
>>108051916
ai slop
>>
File: image.png (3 KB, 291x103)
3 KB
3 KB PNG
>>108051392
>>108051436
Try it, sis!
>>
>>108051886
Thanks
>>108051878
Heh, get fucked!! I'll gen some childish music about lolis and their uncle's now
>>
>>108051573
I have the solution >>108051936
>>
>>108051855
Commercial-Grade Quality!
>>
>>108051920
No I'm not lmao I hate all avatar fags equally
>>108051865
Well search for that furry dumbo term then, its from back then, if you're really curious.
>>
Anything new for API nodes?
>>
>>108051936
comfy cloud is pretty intuitive, I dont have a good gpu so i use it
>>
>>108049490
help me I'm being oppressed by the graphics card
>>
>>108051957
you don't have a gpu
>>
>>108051957
Lots of good stuff on API nodes recently
>>
>>108051679
love it
>>
File: ComfyUI_00003_.png (1005 KB, 896x1152)
1005 KB
1005 KB PNG
tried to use comfyui and used grok to tell me how to set it up, it works but the results are kinda shit.
is it a prompt/model skill issue or there's something wrong in the workflow itself?

https://pastebin.com/QzXX64A9
>>
The new queue is so stupid that it calculates the time a job has been sitting in the queue...
>>
>>108051997
4 tetas
>>
File deleted.
Why do my Lora results look like shit? Its nothing like the OP image, this is supposed to be a "dick".

https://civitai.com/models/2360918/erect-penis-model-klein-9b
>>
File: 01887-2174442414.png (1.13 MB, 896x1152)
1.13 MB
1.13 MB PNG
>>108051855
I like it. Is it Ace?
thanks for sharing
>>
can I make dreamcore music in ace 1.5 or no
>>
>>108051784
>>108051920
>>108051936
Tranussystudio status?
>>
>>108051998
vibecodebros...
>>
>>108051436
how the fuck do I get lyrics? I put in lyrics and nothing gets sung. It's over
>>
>>108051718
https://vocaroo.com/16WmAAbFQLU5
>>
>try the zbase distill lora
>lora key not loaded
>results are shit
>>
>>108052024
Dead I hope, tran, trani and troonfy love each other
>>
>>108052055
loser
>>
>>108052055
0wn3d
>>
guys im having a good time
>>
>>108052100
proof?
>>
>>108052104
of?
>>
>>108052100
gullible idiot
>>
acestep has me impressed, now ill prepare a dataset for when lora training because accessible.
>>
>>108052142
I hope the entire TLMC is enough. granted I'm a few years out of date
>>
File: 00142-228056292.jpg (1.5 MB, 2688x2048)
1.5 MB
1.5 MB JPG
>>
File: 01914-1318586301.png (1.01 MB, 896x1152)
1.01 MB
1.01 MB PNG
>>108052142
It looks like it has an interesting feature set. Being able to change words in songs and remix songs.
I'm curious how it compares to Suno in terms of quality.
I think I'll deep dive into it this weekend. I've had some nice times with Suno. I wanna make something grand.
>>
>>108052055
yup, same for me.
>>
>>108052018
its incredibly inconsistent, copypasting his prompts works but trying my own stuff and adding the trigger makes the same garbage as you got
>>
>>108052018
seems fine to me, looks like what I see when I look down
>>
His self insert is white and he portrays his enemies in the fields picking cotton. What fascinating psychology.
>>
>>108051745
uh oh. deviantart datasets sonic fetishes are too strong
>>
File: 00145-443076970.jpg (1.32 MB, 2688x2048)
1.32 MB
1.32 MB JPG
>it's upset that I make a schizo that screeches the n word pick cotton
>>
>>108052195
>>108052251
This is impressively good, what did you use to get that?
>>
>>108052251
Maybe I shouldn't use ZiB base, just testing. I'm not sure why text is getting fucked up when I do a latent upscale. But this is ComfyUI. Maybe it's denoise related.
>>
>>108052265
hrt and meth
>>
File: 00148-497029784.jpg (1.3 MB, 2688x2048)
1.3 MB
1.3 MB JPG
>>108052265
z base bf32, while rare results like this make me wish I could find a good eye detailer model
>>108052269
>the projection
Found the failed dev
>>
When I was a little bitty baby
My mama done rock me in the cradle
In them old cotton fields back home
It was back in Louisiana
Just about a mile from Texarkana
In them old cotton fields back home

Let me tell you now well got me in a fix
I caught a nail in my tire doing lickitey splits
I had to walk a long long way to town
Came along a nice old man well he had a hat on
Wait a minute mister can you give me some directions
I gonna want to be right off for home
When I was a little bitty baby
My mama done rock me in the cradle
In them old cotton fields back home

It was back in Louisiana
Just about a mile from Texarkana
In them old cotton fields back home
Don't care if them cotton balls get rotten
When I got you baby, who needs cotton
In them old cotton fields back home

Brother only one thing more that's gonna warm you
A summer's day out in California
It's gonna be those cotton fields back home
It was back in Louisiana
Just about a mile from Texarkana
Give me them cotton fields
(it was back in Louisiana)

Let me hear it for the cotton fields
(just about a mile from Texarkana)
You know that there's just no place like home
Well boy it sure feels good to breathe the air back home
You shoulda seen their faces when they seen how I grown
In them old cotton fields back home
>>
>>108051855
Heh very catchy
>>
File: image-3.png (3.93 MB, 1328x1328)
3.93 MB
3.93 MB PNG
Qwen Image 2512 fp8_e4m3fn test on Radeon AI PRO 9700

Default comfy workflow, same prompt as the one I used for z image. 4 seconds/step and 4 minutes for the whole image. It might be faster if you have more than 32gb of ram to spare since it was maxing everything out. 27gb of vram used during inference

The raccoon looks a lot more copyrighted compared to z image. There's also basically no seed variation. Honestly really impressed with the undistilled z image tongyi put out. It's nice to know I'm not missing out by not being able to run larger models on my 16gb vram card at home
>>
>>108052364
>The raccoon looks a lot more copyrighted compared to z image
yeah it how qween stuff looks, klein is much better in this regard too, its very obvious how 3d/cartoon stuff in qwen sucks (i.e. look like ai sloppa turbo 9000) when you compare it to zi/klein
>>
File: 00152-497029784.jpg (1.3 MB, 2688x2048)
1.3 MB
1.3 MB JPG
>>
i hate how klein refuses to do anything semi nsfw. i dont even mean nudity, but even anything mildly suggestive
>>
>>108052364
And should be supporting sdcpp so you don't have to deal with python cancer anymore
>>
>>108051855
Wtf that's actually good, guess I need to try it in comfart for myself.
>>
>>108052399
Oof, getting dangerously coomy in here, I'm not complaining
>>
File: 00155-3740450160.jpg (1.31 MB, 2688x2048)
1.31 MB
1.31 MB JPG
>>108052418
nobody is going to do free labor for you boy
>>
File: 01816-1038582822.png (1.26 MB, 896x1152)
1.26 MB
1.26 MB PNG
https://www.youtube.com/watch?v=MRFkHZcVP90
>>
File: z_image_bf16_00077_.png (3.24 MB, 1264x2048)
3.24 MB
3.24 MB PNG
>>
im horny
>>
File: 00160-925725323.jpg (1.26 MB, 2688x2048)
1.26 MB
1.26 MB JPG
>>
File: 1761729561504421.jpg (468 KB, 832x1216)
468 KB
468 KB JPG
>>
>>108052542
oh bwoy
>>
File: o_00454_.png (2.11 MB, 1280x768)
2.11 MB
2.11 MB PNG
>>
Comfy's implementation of AceStep is garbage.
>No toggle for the "thinking" option like the official implementation (that uses LLM to guide the song's inference)
>No option to select instrumental only
>Why the fuck do I need to set duration, bpm and keyscale? In the official implementation, you don't need that
>>
tran is still having a meltdown after 3 days? what mental gymnastics is it this time?
>>
>>108052587
Posting images from a new model is a meltdown?
How odd
>>
I downloaded a distillled LTX2 workflow for my vram-poor situation. It works, but I don't want to use distilled LoRAs. Using it with the provided workflow, without the LoRAs, produces (not amazing) audio, but the visuals are extremely noisy, even at very high cfg values like 20+

What do I probably need to change to make it work correctly? Other than cfg and steps
>>
File: 01814-3179937906.png (1.06 MB, 896x1152)
1.06 MB
1.06 MB PNG
https://www.youtube.com/watch?v=sWGh92O9kxU&list=RDsWGh92O9kxU&start_radio=1
>>
File: MaryJane.png (946 KB, 760x1013)
946 KB
946 KB PNG
>>
File: 00166-3325721174.jpg (1.3 MB, 2688x2048)
1.3 MB
1.3 MB JPG
>>
It's really Suno v4.5-v5 tier bros.

I can't believe we have this quality locally (these are my first 2 tries, lyrics, bpm etgc.. timescales generated with help of Gemini)

https://files.catbox.moe/ovsmhs.mp3
https://files.catbox.moe/8dxyze.mp3

Also I notice we are missing the repaint feature on ComfyUI? That would be clutch for some portions, but it gets the songs just about 90-95% right.
>>
>>108052607
post your workflow dingus
>>
>>108052668
Also from what I recall there's a fast sorting feature if you generate in batch (by lyric adherence) (I guess it's on gradio and not on Comfy yet).
>>
>>108052663
why wont you do the birth scene i asked? come on
>>
>>108052681
>lso from what I recall there's a fast sorting feature if you generate in batch
qrd
>>
>>108052668
Are you deaf? Fucking shill retard.
>>
>>108052668
I'm still downloading the models. Outside of machine generated japanese pop (which is not that musical at all) I'm curious to see what happens. At least the sound quality is clear enough.
>>
>>108052663
sheeeeit, take it to /b/degen and be dirty and explicit, I'm loving it too much
>>
template in comfy uses acestep_v1.5_turbo.safetensors

is this fine? how is it so far?
>>
>>108052714
i don't have a gpu
>>
File: 161019551659798.png (2.71 MB, 1248x1824)
2.71 MB
2.71 MB PNG
>>
>>108052673
ok here it is presently https://litter.catbox.moe/5fcyhwfgyi2zrd28.json
>>
>>108052668
Lmao. Cmon now.
>>
File: o_00457_.jpg (1.87 MB, 2560x1536)
1.87 MB
1.87 MB JPG
>>
So it's kind of a diffusion based model, but has anybody ever gotten ACE Step music gen working on comfy UI with an AMD system by chance?
>>
>>108052704
>Outside of machine generated japanese pop (which is not that musical at all) I'm curious to see what happens

These are my first gens, not my best, this is why I say a key feature is missing from Comfy (repaint). The lyrics/prompt adherence out of these gens is very good though.
>>
File: 630241955803757.png (1.57 MB, 880x1168)
1.57 MB
1.57 MB PNG
>>108051997
That looks like Pony, try a newer model like Anima or something.
>>
>>108052745
deformed face, 4 years, its anima alright
>>
>>108052770
deformed legs
>>
>>108052769
No it waas not a critique of (you) just a general opinion
*hugs* SUGOII~!
>>
File: GrassLadies (2).jpg (3.53 MB, 6144x1024)
3.53 MB
3.53 MB JPG
Grass Ladies retrospective
OG SD 3.0 really was that bad, comparatively, lol
>>
>>108051855
surprisingly clear and catchy

I will make an Epstein song about Bill Gates and post it to his twitter
>>
File: file2.jpg (727 KB, 1117x1489)
727 KB
727 KB JPG
>>
>>108052746
>q2
buddy
>>
>>108052404
Patently false lol
>>
>>108052790
ok ok I'll see if I can run a larger one
>>
>>108052794
you better not try
>>
File: file3.jpg (759 KB, 1094x1459)
759 KB
759 KB JPG
>>
>>108052793
okay? then tell me how to generate visible nipple outlines under a womans top. it straight up ignores all of my prompts.
>>
>>108052810
It's like SDXL but worse but better. I don't know if it is actually worth to use ZiB on my laptop grade hardware. At least it seems to know some artist styles but that might be just general knowledge from le Van Gogh etc
>>
>>108052781
Thanks. Now with Z Images and Kleins the era of SD is truly over.
>>
File: o_00460_.png (1.91 MB, 1280x768)
1.91 MB
1.91 MB PNG
>>
>>108052745
Artist?
>>
File: file4.jpg (731 KB, 1117x1489)
731 KB
731 KB JPG
>>108052831
I think with a clever workflow. Mix and match and then upscale with something else.
>>
>>108052843
me
>>
whats the best way to stack loras with flux2 klein? i notice the quality clearly degrades when i use more than one
>>
>>108051436
> example workflow uses all in one checkpoint
> download weights link has no aio, but two textual encoders instead
Fuck off, I will not use cloud.
>>
>>108052794
wait. I don't think it's that. I think it has something to do with this split sigma setup but I'm not troubleshooting it for you. just get another workflow or use comfy template
>>
>>108052866
reduce the strength so they all add up to 1 or 0.7
>>
>>108052668
This is nowhere suno v5 level. Not even close
>>
>>108052693
>>108052748
>>108052875
samefag
>>
>>108052873
I'm trying it without the two-part upscaling process now, since I'm not obsessed with HD anyway. But it takes me many minutes to test these things. I'm not asking anyone to take over troubleshooting, I was just checking whether anyone spotted something immediately.
>>
>>108052828
That's a pretty specific thing, I dunno how you'd word it even unless it has some widely recognized Booru tag associated with it
>>
File: 00173-1500669391.jpg (1.28 MB, 2688x2048)
1.28 MB
1.28 MB JPG
>>
>>108052866
Base or Distilled? If Distilled it doesn't and never will support proper Lora stacking, just like every other Distilled model.
>>
File: 01918-3733975422.png (1.05 MB, 896x1152)
1.05 MB
1.05 MB PNG
>>108052668
Quite impressive. I think you're right. It's definitely on par with Suno.
Looking forward to check it out.
>>
>>108052874
hmm yeah that seems to solve the quality issue, but then one of them is always too weak

>>108052901
>Base or Distilled?
distilled
>If Distilled it doesn't and never will support proper Lora stacking
oh damn, I didn't realize. guess i will try it with base then
>>
>>108052895
fuck you bot do my preggo thing
>>
>>108052895
holy fuck, so fucking hot
>>
first prompt box: you just describe the genre? there arent really instructions/prompt guides.
>>
>>108052927
Ahhh I'm COOMIINGGG
>>
File: z_image_bf16_00074_.jpg (2.8 MB, 1264x2048)
2.8 MB
2.8 MB JPG
>>
people here complaining about comfy is starting to show up in the first page Google searches. keep it up lads
>>
File: output.webm (764 KB, 640x352)
764 KB
764 KB WEBM
>>108052889
success?
>>
>>108052919
It's always a matter that some (or most if they are from civitai) are trash.
You need to be clear what to combine and it's always a compromise.
It's a balancing thing. 1 or 2 perhaps.
>>
>>108052894
i can easily generate it with an nsfw lora, but bare klein just wont
>>
>>108052955
comfy should be dragged out to the street and inseminated
>>
>>108052955
Show us a screenshot. I don't use google.
>>
>>108052949
indeed
>>
AceStep workflow for not all in one model please? Templates are loading with 20kb/s and 40-60mb.
>>
>>108052960
prompt?
>>
>>108052971
>>108052955
>he uses google
>in 2026
>while posting on /g/
>in AI thread
retard nogen
>>
>>108052981
I bought ComfyCloud lootbox node and my downloads are fast.
>>
>>108052988
I don't use google.
>>
reminder
boards:4:g;type:filename;/\.mp4$/i;stub:no;op:no;
filter the video faggot
his videos are designed to drain your soul energy
>>
>>108052751
>So it's kind of a diffusion based model, but has anybody ever gotten ACE Step music gen working on comfy UI with an AMD system by chance?
Works just fine on pytorch 2.9.1+rocm 20260116 on my AI PRO 9700. The actual inference time takes like 8 seconds compared to the encoding which takes a lot longer
>>
File: 1755466671821187.mp4 (1.45 MB, 720x1040)
1.45 MB
1.45 MB MP4
>>108052547
>>
>>108053001
proof?
>>
File: 01924-4076749057.png (1.09 MB, 896x1152)
1.09 MB
1.09 MB PNG
https://www.youtube.com/watch?v=ln7Vn_WKkWU&list=RDsWGh92O9kxU&index=8

Well, I don't know why I came here tonight
I got the feeling that something ain't right
I'm so scared in case I fall off my chair
And I'm wondering how I'll get down the stairs

Clowns to the left of me
Jokers to the right
Here I am, stuck in the middle with you

Yes, I'm stuck in the middle with you
And I'm wondering what it is I should do
It's so hard to keep this smile from my face
Losing control, yeah, I'm all over the place

Clowns to the left of me
Jokers to the right
Here I am, stuck in the middle with you

When you started off with nothing
And you're proud that you're a self-made man
And your friends, they all come crawling
Slap you on the back and say
"Please, please"

Trying to make some sense of it all
But I can see, it makes no sense at all
Is it cool to go to sleep on the floor?
'Cause I don't think that I can take anymore

Clowns to the left of me
Jokers to the right
Here I am, stuck in the middle with you

When you started off with nothing
And you're proud that you're a self-made man
And your friends, they all come crawling
Slap you on the back and say
"Please, please"

Well, I don't know why I came here tonight
I've got the feeling that something ain't right
I'm so scared in case I fall off my chair
And I'm wondering how I'll get down the stairs

Clowns to the left of me
Jokers to the right
Here I am, stuck in the middle with you

Yes, I'm stuck in the middle with you
Stuck in the middle with you
Here I am, stuck in the middle with you
>>
>>108053008
I fucked your mother last night and she called me daddy.
>>
>>108053026
big claims need big proof
>>
Field slave ani is running it's course any suggestions?
>>
>>108053026
dad why are you posting on 4chan? i thought you was at work
>>
>>108053007
kek
>>
>>108053034
If this nigga does his field work one day hes gonna be promayted to a house nigga.
>>
>>108053055
That's true
House slave ani next
>>
>>108053034
dragging stone blocks up a ramp on the side of a pyramid
>>
>>108052843
dima ivanov
>>
File: file5.jpg (944 KB, 962x1443)
944 KB
944 KB JPG
Hmm applying style in Klein makes it look like a GPT gen oh goy
>>
>>108052981
Seems like it's dual clip.
>>
File: o_00465_.png (1.52 MB, 1280x896)
1.52 MB
1.52 MB PNG
>>
File: Screenshot_2847.png (2 KB, 343x27)
2 KB
2 KB PNG
>excited to try AceStep since it works on 4gb VRAM
>>
>>108053064
I like this better
>>
>>108053067
65 images on danbooru wtf.
>>
>>108053101
20 seconds isnt too bad
>>
>>108053109
>seconds
anon...
>>
why cant we quantisize to .safetesnors?
>>
>>108053125
>what is fp8
>>
>>108053101
OOMs with 12gb during decode.
>>
>>108053147
tiled vae
>>
>>108053067
>dima ivanov
That's Russian for Dimebag Darrell btw.
t. I'm from St. Petersburg
>>
>>108053156
It just gives you static with tiled. At least when it automatically shifts when it runs out of memory. Longest I can go on 3060 is 3:20 sec
>>
>>108053101
How long was the song?
>>
>>108053156
> Runs on Consumer Hardware
> Less than 4GB of VRAM required.
Seems like cumrag has fucked up again, also
> te takes 8 time longer than ksampler
>>
>>108053163
why aren't you in the front lines getting blown up by a drone like a good goy
>>
>>108053180
2min. i just ran the default template to test the gen time
>>
new
>>108053187
>>108053187
>>108053187
>>108053187
>>
>>108053156
for audio?
>>
0 good music gens since release and only three links were dropped. what the fuck bros is it a stinker?
>>
>>108053185
i'm Julien
>>
>>108053194
ye thats what mine defaulted to since mine ran out of memory as well
>>
>>108053205
i can't find that node
>>
>>108053224
might be because im using the --lowvram argument.console says it switched to tiled automatically since it said the vae decoding ran out of memory
>>
>>108053177
Never mind now apparently I can't anymore even though it worked a second ago. Comfy and memory issues name a better duo
>>
>>108053232
Does tiled mean the quality is worse?
>>
>>108054077
i honestly have no idea since i cant run the full decode to test it
>>
>>108054077
not necessarily but there might be seams hypothetically



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.