Jannies Gonna Freak EditionDiscussion and Development of Local Image and Video ModelsPrevious: >>108748625https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/tdrussell/diffusion-pipe>Zhttps://huggingface.co/Tongyi-MAI/Z-Image>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2https://huggingface.co/Lightricks/LTX-2>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girl>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
Full Size
>>108756548Why did you not put large_breasts in the prompt?
>>108756548naizuri material
>>108756589Looks like this gen took it all
https://blog.comfy.org/p/april-wrappedComfyUI's April Roundup is here, lets see what they delivered last month!>API nodes API nodes API nodesdamn, looks like local really died out!
brehs anything new dropped since ernie?
>>108756893video generations are good now
>>10875689235 stars status?
>>108756892did anyone even ask for these api nodes? does anyone even use them?
>>108756916I was checking comfy's commitlog... I only see cogvideox or you mean the ltx speedup (it was already fast anyway)?
>>108756925comfy asked alibaba to switch to api so they could make more money. comfyui is partly responsible for china abandoning local because comfyanonymous gleefully spreads his cheeks for api models. why release a model locally when comfyorg will advertise your api for free?
>>108756950>>108756924
the kino factory is reaching max capacity
china before api nodes:>hmmm we could go api but it's a hassle, nobody will use it, we would have to work with western payment processors and handle a bunch of account datacomfyanonymous:>woah, don't worry! api is fun and profitable, just watch! here, before you release that next local model why not try putting it on our API? we'll handle the complicated stuff, plus we can grow your audience to many western users who don't have an alibaba account.china after api nodes:>that's smart, thank you comfyorg for providing a centralized api-first platform for the entire world to use. we even noticed some former local users switching to api thanks to your hard efforts!
>>108756967seated at the kinoplex ready
>>108756977i only have 31 seconds of the song sounding good. i think i will try to get it to 1 minute and then post the result
>>108756375I would be too after wasting so much time and effort.
peanut looking rough ngl
pee my nut
>>108756838BUILD
>>108754424>>108754679This has easily some of the worst details since stable cascade and sana.
>>108757327gross
>>108757327nice
dead general
>>108757341its been very slow for awhile.
>>108757354We're all just waiting for K3, there's not much to talk about.
>>108757310This reads like they benchmemed it to oblivion by aggressively overfitting details of whichever images they are were testing for SSI, PSNR, etc.Also once again, Artificial Analysis confirmed non-credible joke even by the low standards for AI leaderboards.
>>108757341The only thing that kept it up was the anime model but the Anima discussion moved to real anime generals so there is nothing left here.
>>108757369K3?
>>108757369what's k3? anima or a new model? >>108757395kinda glad it moved over the. the astroturfing of that model was getting annoying.
>>108757251did they try to solve the plastic skin problem by giving everyone acne scars?
>>108757432>the astroturfing of that model was getting annoying.Was there any? I don't think people testing the model counts
I need more stars ...
>>108757462Discussing [new thing] makes you a shill.
>>108757251man-made horrors beyond comprehension
>>108757475i need more ram
>>108757623>>108757432Missed your gens 3dcgi bro>>108757395I think that now is tine for us to invest in some GPU and move to Qwen Image if we want to see improvments, they already invested on us a lot
>>108757674:) thanks, i usually get dunked hard in these threads as if I'm some lolcow. Qwen image is just too slow and the results aren't superb enough to justify the long wait times.
is that 3d cgi nova sdxl?
So I am done curating the dataset of my anima realism lora in the making and I've moved on to the captioning stage.I intend to use slight variations of this for different parts of my dataset:jpst DOT it / 4-2Ls (without space, replace DOT)My plan is this, I intend to teach AI a new style called @real photo (will be prepended to captions) which would be realism. The reason I am going for this is that first it can also be easily used with tag style prompts, second I am not going to need to overpower anima's existing but unreliable and inadequate understanding of photographic images during training. I am aware that this means I also don't get to benefit from said existing knowledge but I feel like it will work out. Well if it doesn't I will just prepend something like "A Photo." instead and try again.I am going for a clean professional photo look but I added "However, if the image is very noisy, has significant jpeg artifacts, is greyscale, vintage, or has a 'candid' iPhone photo look, you should mention that in a single brief sentence before describing the content. " part because there are a handful of greyscale, vintage, etc. images as a sort of regularization in the dataset.Can't say I am 100% satisfied with the test results but I think the outputs are workable.Probably gonna caption in 1 or 2 hours and the thread is kinda dead atm but if anyone has any input I am open to suggestions.And yeah the last paragraph is cope but feels nice to include.
So, where is Z-Image Edit ?
where anima preview 4 is
post some cute girls
post braces lora for anima
>>108757904i can't help but i wish you good luck
>>108757981I appreciate the good sentiment regardless anon.
Do you have a link to download "Qwen2-VL-2B-Instruct" as one file, please?The only link I found, it's split in two files and Forge can't use it.
I tried Klein 9b bf16 after fp8 and the results when upscaling are much better, and it's not even much slower despite just 12GB
Having surgery today bros, praying the next anima model is up after.
>>108758150Top or bottom? Get well soon sister.
>>108758098>qwen2>forgelol
Seems like the acestep xl variant SFT improves guitar solos.https://files.catbox.moe/liyv6s.mp3settings picrel. As you can see, this one is without "thinking". I will try one with thinking now.
>>108758150What do you expect to improve? Tell me, I'm all ears. It wont be NAI, it wont be realism.
>>108758231pic, oops
>>108758235local is already better than midjourney, because it doesn't look retarded.
>>108758098https://huggingface.co/turingevo/Qwen2-VL-2B-Instruct-gguf
Seedance 2.0 API was released but almost no one is using it. why?
>>108758150Good luck on your surgery anon.
:oMaybe on the 2nd week on nofap you get at least a hb 5/10. Surely 5 at level 2.
>>108758314Interesting
how much slower is anima with upscaling compared to sdxl with upscaling?
>>108757904I think anima needs finetune with realistic dataset first, lora isnt enough
>>108758274It sucks
>>108758345What kind of style?
>>108758378style affects gen times in anima? i thought it was only sampling methods that affected it.
>>108758373That might be the case. But I will still give it a go.
>>108758397No, my message is clear. What style is it? If it's anime, this isn't the place to answer that, wrong general. If it's realism or other style, everything's ok.
idk, once again I just prefer no 5hz lm for ace step 1.5...leaving it genning an "metal" instrumental.
adt, edg, hdg, hgg are dead tooseems like it's over for local
what ever shall we do!?!? the 1girls are gone!
>>108758475>my message is clearNot really, I asked what the gen time difference was and you implied style had something to do with it. So no, you only elaborated because your initial reply was vague due to malicious intent.
I like penis
A preview 4 just flew over my house
I don't think the Spark Chroma guy should be so depressed and sincerely apologizing for the model. It improved stability and works fine with loras.
ltx2.3 is so plastic and so i2v. that's why they quickly released this version...jews are not the chosen one of ai kek
>>108758897He was so humble which is the exact opposite of the average trainer it gave me whiplash
>>108758881big russ... please...
I don't even want preview 4. I just want final.
>>108758940Yeah main reason being a lot of people (myself included) have been holding off on baking loras/etc since they'll just be made redunant. But of course model baking takes time, so can't really complain.
>>108758940>>108758975I suspect we're going to get up to preview 5 or even 6 before we get the final. Just a gut feeling.
>>108759001Whatever happens, I trust Big Russ. Way better than Astralite or other retards kek
>>108759001It takes only like 3 hours to train a lora kek
>>108756970Comfy must die... But we need something as good before
>>108759061Meant for >>108758975
>>108759061Not that anon, but the problem with making a bunch of loras for the preview builds is it will end up creating compatibility issues in the future. Especially with the retards not labeling their Anima Loras properly.
>>108759105Who cares when it doesn't take that long to train tho just have fun anon
>>108759111I don't think you understand the problem. lmao typical indian coded thinking.
https://huggingface.co/SeeSee21/Z-AnimeAnyone tested this shit? The showcase images look like pure slop so idk if I should even bother
>>108759122Oh you're poor I wish you said that earlier so I could disregard your opinion sooner
>>108759141>he still thinks this is about training or gpu'sSAD!
>>108759130It's literally trained on slop, so no surprise there. And yes, it's shit.
>>108759147Concession accepted
>>108759163>look guys im retardedYeah, we see that.
>>108759172Meant for >>108758975
>>108759105>Especially with the retards not labeling their Anima Loras properly.why would this matter unless... unless of course you cant train yourself...
>localkeks arguing about loras like its 2023 comfy was right to abandon local
There he is good morning sar
>>108759218>>108756924
IM REDEEEEEEMING
>>108759218the "loras are actually bad" thing has to be the most retarded api cope i've ever seen.
>>108758940I just want the realism lora
>@grok generate an image of epstein vs diddy in a fighting game>got it, that sounds based and funny. here you go!>flux, generate an image of epstein vs diddy in a fighting game>ERROR, cannot compute request. for your own safety we removed all proper nouns from our dataset. here have a frog wearing sunglasses instead!>heh no matter, i'll train a lora and bypass your safety crap, local always wins! now.. to activate the diddy and epstein loras... wait why is epstein black now?? why do all the spectators have his face?? why are the features melting together???
so he just comes in here to troll really poorly. got it.
>>108759321he's still coping about getting kicked out in favor of yoland and about how much of a failure he is irl :(
>>108759312i find it kind of depressing that api models are just turning into online meme generators for normies.
Is comfy still the best API to hook up to my openclaw?
>>108759473I use the AniStudio API for openclaw, much better than cumfartpooi.
>>108759321what a great baker we have that totally stays on topic and doesn't fixate on some gay vendetta
head canon: the post
refresh.. refresh... refresh.. refresh... AGHHHH
>>108759525>keeping the tgread faggot-free>vendettadon't you have some rapists to service, faggot?
ITS UP ITS UP ITS UP
>>108759700my dick is up right now, thanks for noticing
>>108759700false, anima preview 4 is NOT out.
I WANT TO BELIEVE!!!
>>108759870Very comfy. Did you use an upscaler?
>>108759312I mean I be fair, all Grok.com imagine prompts are clearly enhanced by the actual LLM first. E.g. Grok gives this for yours, no way the literal original text alone produces this.
Did you guys know that bong_tangent is literally called that because ClownsharkBatwing is British? Like it means "a tangent by a Bong" lol
>>108759898The anon you replied to thinks all cloud models are a single giant magical file with no tool calling or additional models
>>108759923This is a direct quote of me a few days ago. Are the dumb bots back? Hopefully not
We love our shitposting bots. Really makes this place feel lively and full
>>108759896Thanks. Yeah it was 4 steps gen -> 2x upscale -> 4 steps denoise at half strength.
>>108759898wait, so api models can actually think?
>>108759960yeah they're thinking about downloading all your data and uploading it directly into the alphabet soups databases.
see you all next week, nothing for us today from big russ..
>>108759312are you using local models to gen anything other than 1girl standing? wtf are you thinking?
>>108758274>Seedance 2.0 API was released but almost no one is using itseedance 2 has been completely dominating twitter for the past couple weeks. of course you wouldnt know that because you spend every waking hour on 4chanhttps://x.com/kirawontmiss/status/2051353197533438299
>>108760203kek
>>108760203nta but twitter is a cess pit of grifting jeets, pakistanis, nigerians, and low class morons rage baiting. that site is nearly unusable. bluesky is the same but its pedophiles instead
>>108760203saas has gotten so good that most people don't even recognize it as AI when they see it. localkeks have spent so long coping with chroma and sdxl that they completely missed the cutting-edge innovations in API models. you see it all the time when people ask "how was this made??" and some localkek goes "well it's probably a blender render combined with a custom lora with controlnet and ipadapter with clever use of aftereffects" when it was really just a generic veo 3 slopgen
imagine typing all that out lole
>>108760287>saas is so good people mistake it for blenderi wouldn't brag about that
>saasussy is itchy again
>>108757327so good. whats model/prompt used? tried to drag into comfy and no luck
>>108760696>whats model/prompt used?NTA>tried to drag into comfy and no luck4chan removes the workflow json alongside other metadata after upload.
>>108760717aw shucks. thx anon
>>108760696>tried to drag into comfykeknta either, i think the checkpoint is novaanimal with a character lora
>high angle view, from above>gens an extremely low anglei love it when prompts are ignored completely or the inverse happens
>>108760772Other shit in your prompt is in conflict with that. Either increase the weight of it enough to override the conflicts or fix the conflicts.
>>108760780it was the "showing crotch" tag, and i had increased the weight of the angle tags already. some tags are just too strong.
>>108760840Showing crotch isn't a proper booru tag, not using proper booru tags can also result in weird shit.
After all this time, still no pokies slider for Chroma. I guess I should just kill myself.
>>108760772aerial helps
Are there decent turbo loras for Z Image? The ones I tried had super slopped crappy results.
Been saving my cum for preview 4
i've been waiting for his cum
tdrusted abandoned local for api....
an anon cum just flew over my house
>>108761116tomorrow.. i believe..
>>108761204
What are the best model for experimenting with general digital art stuff? I guess its ZIT? The models listed in the OP seems awfully 1girl centric.
>>108761275man you really need a better hobby julien
with how petty comfy is, I wouldn't be surprised if he's purposefully destroying the local ecosystem just to make sure ldg loses
>>108761334obviously there is not a single best model and SFW "digital art" makes more of them applicable, but sure, try ZI[T], Flux2 Klein or Qwen-Image
>>108761334Midjourney
>>108761334chatgpt
Is ltx better than wan for comer clipsIs the workflow on comfy pretty easy to get into? I have only used webiu > wan before and I find it kinda clunky
I don't get the Anima hype at all. Trialscumming NAI is way more convenient. In the time I'm sitting here waiting for a single Anima gen to finish, I can literally spin up 3 burner emails and pump out 90 images on NAI.
I wish I were 1girl
I wish I was 1boy with 2girls
>>108761586i pity you poorfag, i can't even imagine worrying about the price of a proper gpu
i wish i was 1boy with 4skin
>>108761029Slider are easy to train in ostris. He made a yt video about it
>>108761586>site shuts down for whatever reason>can no longer genHappens all the time. It's local or nothing.
>>108761586Lies/10
>>108761613Nah, I can run Anima Klein and Z image, but I rediscovered the concept of "just works". Irediscovered pressing a button, waiting 2 seconds, and getting KINO. I rediscovered not depending on Comfy, not thinking about Python, nodes, vibe coding nodes to fix things.I rediscovered not thinking about CivitAI, Loras, inpaint, or new shitmerges. I cleared so many local memes from my brain that I'm not going back unless they block my script.
is there a way to make lodestones less retarded
>>108761657ok poorfag, enjoy "waiting on a single anima gen" kek
>>108761657cute wall of cope.post some of your 2 second KINO
>>108761670But Anima is inferior to NAI
>>108761657In germany they would call someone like you a "Zahlschwein"
>>108761673>>108761673I don't post anime in non anime generals, and yeah I know you are thirsty as for good gens and especially for anime gens, but I'm not gonna give you the satisfaction.
>>108761692how many layers of cope do you have for not owning a proper gpu? lmao
>anima sucks >no i dont post in "non anime" threads He posts this every day now
>>108761334Hey cool I have one of those.
>>108761692but you post about non-local models in the local diffusion general?oh wait i already know the answer. >cumfart make api. api future of local and image 2 infographic sota sam altman!
>>108761701I dunno, but at least I feel way more free mentally and mentally healthy ever since I forgot about Python, nodes and civitai and all that local drama off low quality models and fake hype bullshit
>>108761722I saw that earlier kek. What could it be? nb4 hurrdurr its a cloud version of animah!!
>ever since I forgot about Python, nodes and civitai and all that local drama off low quality models and fake hype bullshityou didnt tho cause you post the same b8 here every day
>>108761723in this case you are not only a poorfag but also a techlet, so why do you even post on /g/ in /ldg/ Zahlschwein-anon?
instrumental. ace step 1.5 sft xl. non-turbo.https://files.catbox.moe/k0a351.mp3
>>108761798How do you get decent instrumental shit? It's always way worse than the instrumentals from gens with vocals for me
>>108761736neat!My recommendation is to include the prompt, if you can, or a style part of the prompt, because we all are inundated with 1girl stuff, but prompting pop and fine art is something we don't see that much.
>>108761742I'm here because I feel like it's my duty as a former local user who got bored to share my experience of leaving this cult and tell you how good and free I feel now.
geometric penis
>>108761831so you seek validation for purchasing a subscription Zahlschwein-anon?
>>108761738No, first time. Do this mental exercise: imagine a day without thinking about run_comfy_gpu.bat or update.bat. Imagine not being defensive while genning, not watching for broken or diluted outputs. Imagine not knowing about loras or Civit memes, not knowing that "feels off" sensation. Man, feels good getting local off my back. I appreciate what I learned, the programming and vibe coding, but I'm done with local.No local, no problems.
>>108761827thx.I have put convert $ofile -set comment "$P" -filter Lanczos -resize 50% x_$ofile # P=promptin to my script.
>imagine a day without thinking about run_comfy_gpu.bat or update.batof course Zahlschwein-anon is a windows user besides being a poorfagand this guy wants to lecture others about local tech lmao
>.bat holy brainlet
>>108761845>purchasing a subscriptionBuying a subscription to NAI is currently very difficult because they changed their payment processor and PayPal banned NAI. Discord and Reddit don't know what to do about this problem since NAI's customer base dropped and that's probably why v5 got delayed.
>>108761890But muh anima is so slow on windows with the heck'in integrated gpu :(
> 204 / 27 / Heh, maybe the GPUlet is a projection
https://files.catbox.moe/g1lxvk.mp3instrumentalcompletely pleasant imo. I'm not like the weird youtube people screaming that something is a mad hit, except ironically 8^)>>108761813careful prompting (acestep was trained on gemini tagged open music), "dcw" in acestep.cpp/gradio, max steps (capped in the cpp one at 100, would prefer 500 personally), idk, that sounds like about it. I didn't use "thinking" in that one, ie no audio codes.
>>108761924Half of the gens are cloud
>>108761813>>108761928oh yeah and the sft xl model - I haven't tried the turbo sft xl one...
>>108761924busy training
>>108761979Nice, the power of 2026 local models
anonette is really upset today
>>108762000At least thank him for deciding to breathe some life into this dead ass place
>t.
>>108762020scared of a little quiet newfren?
>>108762020"he" should do the same to the SaaS threads that go on for multiple days
>>108762046no one actually likes saas, they use it because they have no other option.
>>108762107>likes saas,saas like you. give it your credit card.
>>108761628With 12gb vram?
Komrades, we need moar gens.
>>108762178Good things come to those who wait.
>Maybe if I chain Z image to some uncensored anime model I'll get good NSFW realism! (Then 2 problems appear: fried and uncanny valley) but wait, a solution: make a realism LoRA on Anima to make Z image's job easier. And the local hype train rolls on: maybe when Anima controlnet drops I can crank the denoise without breaking the base concept :^)I get it, local clowns get their dopamine hits from patching broken shit while half baked updates keep dropping, but this is pathetic. How much handholding does the model need to do what you want? And you know well the next "great model" won't fix shit and you'll keep riding this local karma carousel forever.
>still seething
35 stars lmao
>>108762178what prompt would you like to see?
>windows poorfag still shitting himselflmao
>>108762196Local has their whole reward system totally warped. Making good gens doesn't even give them pleasure anymore. They get way more of a dopamine hit imagining they're winning some argument against Anon, vibecoding nodes, making loras (half fixes) to achieve that style or concept their failed model couldn't pull off, fixing Python and updating Comfy. They completely lost sight of why they even started down this path in the first place.
>>108762196It's not "local vs" thread anon.And using API chatbot is not /g related, my grandma does it daily too.
>replying to himself to seethe further My sides. Please anon no more I'm laughing too hard.
>>108762196>noooo one model must do it all, YOU CAN'T CHAIN MODELS, YOU CAN'T EXPERIMENT YOUR OWN WORKFLOWS AND SOLUTIONS,what a chud lmao
>still no gens
>>108762244Imagine talking about public buses in a sports car tunning thread...
so anyway...
...no gens
>>108762000He's REALLY upset. I wonder if it was something IRL that's got him like this.
So fun to play with all those free models and WF
miku
>>108762259>YOU CAN'T CHAIN MODELS, You're just chaining patches on patches. >YOU CAN'T EXPERIMENTExperimenting means you're clueless. >YOUR OWN WORKFLOWS These aren't workflows————— they're fixflows.>AND SOLUTIONS,The only thing remotely valid here are your "solutions" but even those are half baked garbage.
lilbro having quite the melty today upset over local diffusion
>>108762344>t. Zahlschwein scared of .bat files
>>108762353His mom refuses to buy him a GPU, so he's cooming on Samfaggot's catalogs
>>108762340>>108762334Nice repost
>>108762344the irony is that all saas cucks model hop trying to get their "AI influencer" to look consistent.
>>108762367implying there's rules about it. seethe more spergy mcvirgin.
>>108762344lol, you're so lazy that you're just copy-pasting the responses that your llm gives you
>>108759463I (heart emoji) the free market
>>108762334Yeah, keep having fun. Maybe you can make a workflow to generate better feminine feet.
>>108759463they can't do porn and image gen has no other use case
>>108762400I prefer high quality sfw than low quality slopped smut >>108762397
>>108762408good for you, go to the api thread, this one is for people running local models
>>108762384>doesn't put out>demands money before she will do anything for your>refuses simple requests>will ghost you out of nowhere and go offline foreverbetabux lmao
>>108762412>go to the api threadhes so mind broken that hes been here seething for months hell never leave i almost feel sorry its like his own personal hell
FACT: localcucks cant gen sexy girls so they are perpetually fuming
>windows poorfag is proud to be a 1girler
>>108761577Hello
>>108762452holy moly, a blurry 1girl. localkeks won't be recovering from this anytime soon.
>>108762452Every saas 1girl looks like a very thin coat of paint over the exact same image
Please spoon-feed me because I'm retarded. I want a free local model that can edit photos of cute girls to make them naked. I'm not going to post these edited images online or anything, they're just for me to fap to. Something like grok but uncensored. Please help me, I don't know what to do or where to start.Here is a picture of Kirby as payment for your kindness.
>>108762418nope, apifu is chad only>>108762538>holy moly, a blurry 1girl. localkeks won't be recovering from this anytime soonfirstly, its not 1girl look at the guy in the back. second, its blurry due to camera movement which is common in amateur photogaphy.maybe you should take a look at some real photos from time to time instead of letting your mind get poisoned by plastic slop like this >>108762340
im getting tired of this coping can the cloudfag do something else now
>>108762452apicuck generating stuff we generated months ago, once again you lose
>>108758231>>108758240Very nice settings. SFT was broken from my end, it seems changing shift fixed it. But hear me out:Those settings on Base-Turbo XL Merge (or perhaps even SFT). Insane results.https://huggingface.co/scragnog/ace-step-1.5-gguf-merge-models/tree/mainHere are oneshot DiT only results on Base Turbo XL merge from a LoRA I've been struggling with on Turbo (because I stopped training preemptively before I saw results so thought it was underbaked).Fate Gearhttps://vocaroo.com/1n3t24Kllhkz(Mastered, but also sounds very good without it)ACEStep XL just keeps getting better bros. Not only did that perfectly capture what I was going for, but we're reaching Udio levels of coherence in terms of quality.My LoRAs are instantly fixed. Musicality is massively improved. I thought I had "failures" before, now I don't have a single failure. If you trained a LoRA on base before, seriously, try this. That Fate Gear result was from I thought was a "bad" LoRA.Zutomayo LoRA I just trained on base with Turbo-Base merge.https://vocaroo.com/1mexIG2rYRXBNo, that's not a real ZUTOMAYO song... I can't believe it.Again, these are just first tries. The base model gets good composition results, but there's usually extremely bad audio quality. The Turbo model gets mediocre results with a voice that's similar but off. SFT model alone is better with voice, but composition still off. The composition of the Turbo XL model alone is nowhere near this quality for both results, even with thinking turned off. Not comparing to broken Turbo outputs this time (they truly were bad in comparison to this) I am hyped AF bros. The local music endgame is here.Now try doing DAT with Suno or Udio, I'll be waiting APIcucks.
Tried this with Klein and gas started coming out of my PC.
>>108762638Now, both LoRAs still need a slight bit of work which I attribute to low LRs while training, but local is eating so fucking good rn. Honestly, the quality this is squeezing out of ACEStep XL might even directly conflict with what its own devs intended (so good the model devs might hush hush or it might shadow the release of v2 if they had planned to commercialize it kek).
>>108762607>some real photosdo we have to pretend that everyone is walking around with 1mp phones from 2008 and not 48mp iphones with built-in stabilization?
>>108756892>ComfyUI's April Roundup is here, lets see what they delivered last month!
>>108762626>not holding beer jug properly (impossible physics)>robotic pose>plastic skin>flux chin>floating earing>jibberish textlocalkeks are not sending their best, thats for damn sure
>>108762638Retard questions but how many VRAM do you need and does it work easily on comfy
>>108762638Neato, that Zutomayo one is pretty good for something out of local.
>>108762656Glad I don't have to trick my PC into working for me... that'd be a pain in the ass!
>>108762701>how many VRAM do you need It should run on anything from low VRAM to high because it's been implemented as cpp version.>does it work easily on comfyYou don't want to use Comfy. There's still no DCW support, and its memory usage is much worse than ACEStep cpp. Use https://github.com/ServeurpersoCom/acestep.cppinstead.>>108762715Yep, not saying cloud can't do that quality because Udio (and only Udio) can, but it can't mimic the artists precisely without triggering safety kek.
>still no anima preview 4>still no realism anima lora that doesn't suckI sleep.
https://github.com/ace-step/ACE-Step-1.5/discussions/235imagine if all models had nice little guides like this
>>108762821tomorrow sir
>>108762821There are some nice ones like >>108762738Id imagine
>>108762821Artist? That's a nice style, anon.
>>108762867It's @justsomenoob, may be affected by the Mayli lora a bit as well.
>>108762738for the love of all that is holy more sexo jennies
>>108762384for me, it's turning api into local and local into api
>>108762901how do you turn local into api? cancel your gen at 50% and call yourself a pervert?
>>108762821in good time anon in good time
>>108762384API Throws more tantrums
cmon SaaSies we don't have to take this!
>>108762821AAAAAH
>>108762876Thanks. For only 130 some images it works very well.
>>108761979My own personal ick is fake nails lol
>>108762842>https://github.com/ace-step/ACE-Step-1.5/discussions/235The official guide is also pretty neat anon-kunhttps://github.com/ace-step/ACE-Step-1.5/blob/main/docs/en/Tutorial.md
need jenny and mayli scissoring please god anon
>>108763083I can't get track extraction to work
requesting mayli lora for anima
>generating image>wooops, looks like this promo we processed violates our guardrails after all, cant show you that!>thanks for the tokens though, goy!
>>108763083What a beautifully written guide. The introduction alone is supreme kino.
Is Tencent about to mog Anima potentially?>https://arxiv.org/pdf/2605.03652>"We present AniMatrix, a video generation model that targets artistic rather than physical correctness.">"We will publicly release the AniMatrix model weights and inference code."
>>108763350>"We will publicly release the AniMatrix model weights and inference code."y'all about to get Chinese culture'd
>>108763350>vidgen lmao kek
>>108763350Will it generate boobs and vagene?
>>108763356
>>108763350I'll believe it when I see it work.
>>108763350>video modelkek>tencentlmaooooooooooooooooooooooooooooooooooooooooooo
im like super negative about anything I don't use or do so like back off and be warned! :DDD
>>108763350>>108763360surely i'll be able to gen my mesugaki x ugly bastard foursome hentai with this, right?
>>108763360Do you really think the list of products and services on their site is the same as the abilities of the hypothetical video model? Just kek
>>108763356wan is pretty good for t2i in terms of qualityt2v and extracting frames is even better
>>108763350>https://arxiv.org/pdf/2605.03652>video modeltrash
Dumb question, can you train ltx 2.3 loras with a still image only ? Also does training lora with mid range 50 series gpu possible ?
>>108763443you can, it often works. i think it's easier on wan though.usually it's a matter of time and offloading to vram if your gpu's vram isn't so large
>can you train a motion and audio lora with a still image?
>>108763464How long ? 1 hour ? 3 ? 6 ? 12 ??
>>108763469i can. can you?
>>108763469>>108763443you could for wan, not sure about ltx 2.3
>download eros & sulphur>neither of them can do vaginas or penises that don't look like mangled atrocitieswhat the fuck man
>>108763474depends on your hardware and the training settings (the choice of optimizing algorithm, amount of sample images, resolution and many other things matter)guessing it'll be >1h on your hardware tho except if you can use lower resolutions and fast learning
>>108762626oh man, if only she were mid.
>>108762638It's just incredible, my LoRAs that are baked well sound very good, while those that are undertrained still sound much better. Not only is the composition better, but the lyrics following is almost entirely flawless now.Remember that Miku LoRA from a few threads ago? Here's that with a much more generic prompt and lyrics (not the same, so it may be more generic)-https://vocaroo.com/1cwMS4lkPWJeThe soundscape, synths which are part of the prompt sound so much better now. Here's the result I had shared with Turbo, now I can barely even hear the Miku voice in that output kek (voice is too drowned out)- https://vocaroo.com/1gonlSXjg3b3One flaw with ACEStep is LoRAs are not easily accessible. The community is too afraid of the music mafia kek. However, it may be useful for the community to share LoRAs that teach the model useful concepts. I made a rentry for those who are getting started with LoRA training https://rentry.co/s8fg8ber>>108763205>I can't get track extraction to workYou mean covers? The way I remember getting that to work is through the cover (no FSQ) ACEStep cpp setting on Turbo, 0.5 setting. I tried it briefly, there may be better methods with SFT/Base (in particular, it was hinted that lower values work on SFT, so the with the merge models that would be the case). Here was the result of World Is Mine- https://vocaroo.com/11E5J1UtRHkVThis is with no DCW. An even better result could be achieved by isolating vocals and placing them over original track since it does change that a bit. SFT or merge models probably yield even better results with that particular feature though (lower cover setting), so it may not be required, I'll test around and see.>>108763307Indeed, it's is superior philosophy that allows it to succeed.
>>108763490Name one base model that can.
>>108763492>>108763483I just want 360 view of the character face i want without turning into someone else
>>108763517i was assuming it'd be able to since they were trained on porn.
>>108762638>Fate GearI like the drumming. I don't know Japanese lol>ZutomayoThis is what I was trying to explain about the old 1.2, the vocals are heartfelt. anyway, what's the theme of this one?
Fresh>>108763550>>108763550>>108763550
>>108763511>Mikusounds neat, I just don't understand Japanese so I don't know what's going on.loras sound uh. hard. lol.