/g/ - /ldg/ - Local Diffusion General - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/ldg/ - Local Diffusion Genera(...) 04/09/26(Thu)19:15:28 No.108569503

File: highlights_g_108563476_17(...).jpg (2.36 MB, 3708x4152)

/ldg/ - Local Diffusion General Anonymous 04/09/26(Thu)19:15:28 No.108569503

Discussion and Development of Local Image and Video Models

Previous: >>108563476

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon

Anonymous
04/09/26(Thu)19:18:45 No.108569528

Anonymous 04/09/26(Thu)19:18:45 No.108569528

File: dwarfman.jpg (205 KB, 1000x700)

205 KB JPG

Anonymous
04/09/26(Thu)19:20:00 No.108569537

Anonymous 04/09/26(Thu)19:20:00 No.108569537

Blessed thread of frenship

Anonymous
04/09/26(Thu)19:21:20 No.108569547

Anonymous 04/09/26(Thu)19:21:20 No.108569547

File: file.png (838 KB, 698x1021)

838 KB PNG

Anonymous
04/09/26(Thu)19:26:36 No.108569578

Anonymous 04/09/26(Thu)19:26:36 No.108569578

>>108569547
>>108566619
>>108569190
>>108568213
Don't you feel like a piece shit posting anime in a general where everybody pretends and nobody cares about it? Don’t you have a bit of remorse for being part of this big farce?

Anonymous
04/09/26(Thu)19:27:26 No.108569583

Anonymous 04/09/26(Thu)19:27:26 No.108569583

>>108569578
>nobody
don't talk on behalf of everyone, freak

Anonymous
04/09/26(Thu)19:28:26 No.108569589

Anonymous 04/09/26(Thu)19:28:26 No.108569589

>mfw Resource news

04/09/2026

>MAR-GRPO: Stabilized GRPO for AR-diffusion Hybrid Image Generation
https://github.com/AMAP-ML/mar-grpo

>HybridScorer: Score, sort, and cut large sets down fast with GPU-accelerated AI review
https://github.com/vangel76/HybridScorer

04/08/2026

>OrthoFuse: Training-free Riemannian Fusion of Orthogonal Style-Concept Adapters for Diffusion Models
https://github.com/ControlGenAI/OrthoFuse

>MIRAGE: Benchmarking and Aligning Multi-Instance Image Editing
https://github.com/ZiqianLiu666/MIRAGE

>Few-Shot Semantic Segmentation Meets SAM3
https://github.com/WongKinYiu/FSS-SAM3

>PoM: A Linear-Time Replacement for Attention with the Polynomial Mixer
https://github.com/davidpicard/pom

>RS Nodes for ComfyUI: Cmprehensive custom node pack focused on LTXV audio-video generation, LoRA training and post-processing
https://github.com/richservo/rs-nodes

>FLUX.2 Small Decoder: Distilled VAE decoder for faster decoding and lower VRAM usage
https://huggingface.co/black-forest-labs/FLUX.2-small-decoder

>Nvidia snaps up AI chip packaging capacity as TSMC expands in U.S.
https://www.cnbc.com/2026/04/08/tsmc-nvidia-advanced-packaging-intel.html

04/07/2026

>Anima preview3 released
https://huggingface.co/circlestone-labs/Anima#preview3

>FrameFusion Image Interpolation: Compact image interpolation model for generating in-between frames
https://github.com/BurguerJohn/FrameFusion-Model

>An Inside Look at OpenAI and Anthropic’s Finances Ahead of Their IPOs
https://www.wsj.com/tech/ai/openai-anthropic-ipo-finances-04b3cfb9

>PrismML debuts energy-sipping 1-bit LLM in bid to free AI from the cloud
https://www.theregister.com/2026/04/04/prismml_1bit_llm

>ComfyUI Hires Fix Ultra - All in One
https://github.com/ThetaCursed/ComfyUI-HiresFix-Ultra-AllInOne

>ATSS: Detecting AI-Generated Videos via Anomalous Temporal Self-Similarity
https://github.com/hwang-cs-ime/ATSS

Anonymous
04/09/26(Thu)19:29:26 No.108569593

Anonymous 04/09/26(Thu)19:29:26 No.108569593

>mfw Research news

04/08/2026

>GenLCA: 3D Diffusion for Full-Body Avatars from In-the-Wild Videos
https://onethousandwu.com/GenLCA-Page

>Grounded Forcing: Bridging Time-Independent Semantics and Proximal Dynamics in Autoregressive Video Synthesis
https://arxiv.org/abs/2604.06939

>Evolution of Video Generative Foundations
https://arxiv.org/abs/2604.06339

>VersaVogue: Visual Expert Orchestration and Preference Alignment for Unified Fashion Synthesis
https://arxiv.org/abs/2604.07210

>Controllable Generative Video Compression
https://arxiv.org/abs/2604.06655

>Not all tokens contribute equally to diffusion learning
https://arxiv.org/abs/2604.07026

>FlowInOne:Unifying Multimodal Generation as Image-in, Image-out Flow Matching
https://arxiv.org/abs/2604.06757

>Holistic Optimal Label Selection for Robust Prompt Learning under Partial Labels
https://arxiv.org/abs/2604.06614

>Towards Robust Content Watermarking Against Removal and Forgery Attacks
https://arxiv.org/abs/2604.06662

>PhyEdit: Towards Real-World Object Manipulation via Physically-Grounded Image Editing
https://arxiv.org/abs/2604.07230

>Noise Constrained Diffusion (NC-Diffusion) Framework for High Fidelity Image Compression
https://arxiv.org/abs/2604.06568

>RefineAnything: Multimodal Region-Specific Refinement for Perfect Local Details
https://limuloo.github.io/RefineAnything

>Visual prompting reimagined: The power of the Activation Prompts
https://arxiv.org/abs/2604.06440

>MoRight: Motion Control Done Right
https://research.nvidia.com/labs/sil/projects/moright

>Fast-dVLM: Efficient Block-Diffusion VLM via Direct Conversion from Autoregressive VLM
https://arxiv.org/abs/2604.06832

>DesigNet: Learning to Draw Vector Graphics as Designers Do
https://arxiv.org/abs/2604.06494

>FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling
https://arxiv.org/abs/2604.06916

>When to Call an Apple Red: Humans Follow Introspective Rules, VLMs Don't
https://arxiv.org/abs/2604.06422

Anonymous
04/09/26(Thu)19:30:05 No.108569597

Anonymous 04/09/26(Thu)19:30:05 No.108569597

MYTH: api models are censored
FACT: api models are less censored than local models and are in fact trained on NSFW imagery

MYTH: api models are too expensive
FACT: it's actually quite cheap to use API through ComfyUI API Nodes. the price for api has went down in comparison to the price of hardware

MYTH: api nodes collect your data and are unsafe to use
FACT: api is safer than local because nothing is stored on your hard drive. with local models, you need to download hundreds of loras and custom nodes, any of which could be infected

MYTH: an api can pull the plug at any time, why use something like that?
FACT: everything you generate can be saved to your desktop so nothing is lost

MYTH: it's impossible to train a custom style of character with api, loras make local way better
FACT: api can learn any style or character with a single image reference, which is much faster and smarter than loras

MYTH: if i buy api credits and don't like the model, that's money wasted
FACT: comfyUI's API nodes credit system allows you to prompt hundreds of cutting-edge api models. the credits share between models so you aren't locked in to any one ecosystem

MYTH: api users are poor and from third world countries
FACT: the top hollywood productions and anime studios all use api models. api is the weapon of choice for everyone world-wide

MYTH: discussion of api models is off-topic
FACT: api models are part of the comfyui experience and are relevant to this thread. combining api models with local workflows is still local

Anonymous
04/09/26(Thu)19:30:56 No.108569606

Anonymous 04/09/26(Thu)19:30:56 No.108569606

File: 76538754745724.jpg (1.81 MB, 1664x2432)

1.81 MB JPG

Anonymous
04/09/26(Thu)19:33:13 No.108569622

Anonymous 04/09/26(Thu)19:33:13 No.108569622

>>108569589
>>108569593
>>108569597
fuck off faggot

Anonymous
04/09/26(Thu)19:38:39 No.108569649

Anonymous 04/09/26(Thu)19:38:39 No.108569649

File: z_image_turbo-Q8_0-103342(...).jpg (661 KB, 1664x2432)

661 KB JPG

>>108569597
MYTH: you are not a cunt
FACT:

Anonymous
04/09/26(Thu)19:44:18 No.108569680

Anonymous 04/09/26(Thu)19:44:18 No.108569680

>>108569597
i know it's just a shitpost but
>api models are less censored than local models
always gives a good chuckle

Anonymous
04/09/26(Thu)19:48:23 No.108569709

Anonymous 04/09/26(Thu)19:48:23 No.108569709

>>108569680
you can do AI porn with grok, and the quality is miles ahead what local can do
https://www.reddit.com/r/Grok_Porn/

Anonymous
04/09/26(Thu)19:49:37 No.108569715

Anonymous 04/09/26(Thu)19:49:37 No.108569715

>>108569709
maybe for low-tier gooners but i have pristine taste anon

Anonymous
04/09/26(Thu)19:51:26 No.108569723

Anonymous 04/09/26(Thu)19:51:26 No.108569723

File: LET HIM COOK.png (433 KB, 3188x1044)

433 KB PNG

>>108569715
fair enough, but I don't like where this is going, it's obvious that civitai is trying to separate themselves from NSFW, at some point they'll completly remove the porn loras, the writing is on the wall
https://civitai.com/articles/28369

Anonymous
04/09/26(Thu)19:51:47 No.108569724

Anonymous 04/09/26(Thu)19:51:47 No.108569724

>>108569715
he says, while using wan 2.2

Anonymous
04/09/26(Thu)19:53:21 No.108569734

Anonymous 04/09/26(Thu)19:53:21 No.108569734

>>108569709
>you can do AI porn with grok
you could make ai porn with grok until jeets ruined it.

Anonymous
04/09/26(Thu)19:54:06 No.108569740

Anonymous 04/09/26(Thu)19:54:06 No.108569740

>>108569734
>until jeets ruined it.
many such cases...

Anonymous
04/09/26(Thu)19:57:30 No.108569752

Anonymous 04/09/26(Thu)19:57:30 No.108569752

>>108569724
api-non-frens are so far behind they don't even realize it
acting like someone playing minecraft with 2k*2k textures and bragging about graphics

Anonymous
04/09/26(Thu)19:59:14 No.108569762

Anonymous 04/09/26(Thu)19:59:14 No.108569762

https://youtu.be/i_S615aKLfI

Anonymous
04/09/26(Thu)20:01:43 No.108569773

Anonymous 04/09/26(Thu)20:01:43 No.108569773

>>108569709
>slow motion
>slopped to hell and back

Lol, that is just Wan tier slop. If they had anything close to Seedream 2 but uncensored there's no way they would allow NSFW with it.

Anonymous
04/09/26(Thu)20:02:02 No.108569775

Anonymous 04/09/26(Thu)20:02:02 No.108569775

File: 1757985478383624.png (95 KB, 1424x124)

95 KB PNG

>>108569762
really nigga?

Anonymous
04/09/26(Thu)20:02:47 No.108569778

Anonymous 04/09/26(Thu)20:02:47 No.108569778

>>108569709
you can tell it's better than wan because it doesnt turn into airbrushed vaseline plastic after the first frame kek

Anonymous
04/09/26(Thu)20:04:09 No.108569784

Anonymous 04/09/26(Thu)20:04:09 No.108569784

File: please let that happen.png (76 KB, 220x220)

76 KB PNG

>>108569773
>If they had anything close to Seedream 2 but uncensored there's no way they would allow NSFW with it.
dude, I don't think the world is ready for the day we'll get a local model as good as Seedance 2.0... it's gonna be great

Anonymous
04/09/26(Thu)20:05:12 No.108569788

Anonymous 04/09/26(Thu)20:05:12 No.108569788

>>108569778
It starts out as plastic though.

Anonymous
04/09/26(Thu)20:07:31 No.108569798

Anonymous 04/09/26(Thu)20:07:31 No.108569798

>>108569784
by the time that happens (2050), the rest of us will be living in a full neurolinked metaverse world with API nodes, but you can keep jerking it to outdated videos

Anonymous
04/09/26(Thu)20:07:34 No.108569799

Anonymous 04/09/26(Thu)20:07:34 No.108569799

File: Best we can share is Wan 2.2.png (194 KB, 800x534)

194 KB PNG

>>108569784
>it's gonna be great
Do you seriously we're gonna give you something this good gweilo?

Anonymous
04/09/26(Thu)20:07:51 No.108569801

Anonymous 04/09/26(Thu)20:07:51 No.108569801

>>108569778
And I prompted for the vaseline.

Anonymous
04/09/26(Thu)20:13:00 No.108569825

Anonymous 04/09/26(Thu)20:13:00 No.108569825

>>108569798
it's 2026, have you not noticed the api pattern yet?
>hey here is our new model, look at how great it is!
and then 3 weeks later they cripple it and hope most people won't notice(they won't because most of their user base is brown) and then sit back and count money while people burn credits trying to gen the same slop they genned on day one.

Anonymous
04/09/26(Thu)20:14:43 No.108569832

Anonymous 04/09/26(Thu)20:14:43 No.108569832

>>108569825
>and then 3 weeks later they cripple it
Seedance didn't wait 3 weeks before crippling it, they crippled it before they deployed their API to the rest of the world lmao, at least Sora had the decency to be cool to play around with at the very begining, I know the bar is low as fuck but it is what it is

Anonymous
04/09/26(Thu)20:17:49 No.108569845

Anonymous 04/09/26(Thu)20:17:49 No.108569845

wait, all these api outputs are crippled? yet they're still better than local?? oh nononono api has been holding back this whole time, just imagine how far ahead they REALLY are. it's so over

Anonymous
04/09/26(Thu)20:19:30 No.108569854

Anonymous 04/09/26(Thu)20:19:30 No.108569854

give it a rest lilbro

Anonymous
04/09/26(Thu)20:20:16 No.108569855

Anonymous 04/09/26(Thu)20:20:16 No.108569855

>>108569845
I don't get this meme. No one is saying local is ahead of cloud right now

Anonymous
04/09/26(Thu)20:27:15 No.108569881

Anonymous 04/09/26(Thu)20:27:15 No.108569881

>>108569845
>all these api outputs are crippled
>they're still better than loca
grim, even Mike Tyson with one leg could destroy me, so yeah, a crippled API service is still better than what local shit is producing (and I hope local will step up its game one day, and no, finetuning SDXL for the 14th billionth time won't do it)

Anonymous
04/09/26(Thu)20:31:06 No.108569897

Anonymous 04/09/26(Thu)20:31:06 No.108569897

>still crying about SDXL 3 years later

Anonymous
04/09/26(Thu)20:34:28 No.108569914

Anonymous 04/09/26(Thu)20:34:28 No.108569914

>>108569723
Why are civittards such entitled little shits? If Visa is cutting off credit card payments, your business is done. You go under, you cease to exist. WTF is Civit supposed to do against that? I'm not defending any of the other bullshit about the platform but for this war with payment processors it seems like they found the least bad option.

Anonymous
04/09/26(Thu)20:34:58 No.108569916

Anonymous 04/09/26(Thu)20:34:58 No.108569916

Local falling behind means there is no reason to waste money on the current overpriced hardware shortage. if Nvidia releases the 6000 series you'll have no reason to buy it because even if the compute power per dollar was insane, there are no good models to fully take advantage of it anyway.

API cucking local and withholding even outdated video models like wan 2.5 is saving you money. do you know how much money you'd be wasting on this hobby if local had all the good models to choose from? do you know how much debt you'd go into if you could run SORA locally? these companies are saving you from yourself by not making these models open source, and doing the right thing by destroying them instead. You're welcome.

Anonymous
04/09/26(Thu)20:37:26 No.108569931

Anonymous 04/09/26(Thu)20:37:26 No.108569931

>>108569914
we're not angry at civitai because they got cucked by visa, we know they can't do anything against (((them))), what we don't like is the gaslighting, they're not honest at all about what's really goin on, people just don't like being lied to, shocker I know

Anonymous
04/09/26(Thu)20:44:09 No.108569965

Anonymous 04/09/26(Thu)20:44:09 No.108569965

>>108569578
Based, until tdrusell stop ignoring our anime threads, we will continue to protest! >:^(

Anonymous
04/09/26(Thu)20:45:11 No.108569970

Anonymous 04/09/26(Thu)20:45:11 No.108569970

>>108569723
that poster's a whiny bitch

Anonymous
04/09/26(Thu)20:46:29 No.108569974

Anonymous 04/09/26(Thu)20:46:29 No.108569974

File: hunyuan comfy.png (37 KB, 1394x276)

37 KB PNG

>>108569916
unironically this. models like wan 2.5, seedance 2, seedream etc don't fit on local hardware, and quantcoping is just sad. anima is 2b parameters yet it's slower than sdxl which is bigger. and these api models are easily 16b+ minimum, with video ones easily reaching 100b.
cumfart cried and threw a tantrum over hunyuan releasing a model too big for localpoors to run, so now all of china realized that local doesn't want these models anyway because they're too big. comfyorg unironically saved local from having to buy H200s

Anonymous
04/09/26(Thu)20:49:07 No.108569980

Anonymous 04/09/26(Thu)20:49:07 No.108569980

is the fud posting going as planned

Anonymous
04/09/26(Thu)20:50:38 No.108569988

Anonymous 04/09/26(Thu)20:50:38 No.108569988

>>108569974
BASED! API nodes saved local from debt. Plus we can still use these models in ComfyUI anyway through the Partner Nodes program

Anonymous
04/09/26(Thu)20:50:43 No.108569989

Anonymous 04/09/26(Thu)20:50:43 No.108569989

>>108569974
>cumfart cried and threw a tantrum over hunyuan releasing a model too big for localpoors to run, so now all of china realized that local doesn't want these models anyway because they're too big.
based comfy, no one will care if they can't even run it in the first place

Anonymous
04/09/26(Thu)20:55:42 No.108570012

Anonymous 04/09/26(Thu)20:55:42 No.108570012

>>108569989
This. SDXL hasn't even been fully explored, we're still discovering new ways to use 'old' tools. Keep that bloated useless WAN crap on the API.

Anonymous
04/09/26(Thu)20:56:15 No.108570015

Anonymous 04/09/26(Thu)20:56:15 No.108570015

>>108569974
>seedance 2
i forgot the global release was today.
any API chads gen some kino?

Anonymous
04/09/26(Thu)20:56:23 No.108570017

Anonymous 04/09/26(Thu)20:56:23 No.108570017

anima won. Local won. The gay cuck by the name of ani lost

Anonymous
04/09/26(Thu)20:57:57 No.108570023

Anonymous 04/09/26(Thu)20:57:57 No.108570023

File: ComfyUI_20438.png (2.08 MB, 1200x1600)

2.08 MB PNG

So... are we gonna get some images with these API-glazing shitposts, or is this guy a fucking poor-ass promptlet?

Thrill me with your 10kW gens, you faggot.

Anonymous
04/09/26(Thu)20:58:11 No.108570025

Anonymous 04/09/26(Thu)20:58:11 No.108570025

>>108570015
Tried to gen some cute anime 1girl farts and got filtered. It's pretty useless.

Anonymous
04/09/26(Thu)21:00:25 No.108570035

Anonymous 04/09/26(Thu)21:00:25 No.108570035

>>108570015
>any API chads gen some kino?
they're gonna kill the golden goose by censoring it like that, what's the point of making such an incredible model if you don't allow people to make fun things with it? I will never understand this

Anonymous
04/09/26(Thu)21:01:31 No.108570040

Anonymous 04/09/26(Thu)21:01:31 No.108570040

Why tdrusell ignores:
/hdg/
/udg/
/edg/
/adt/
/vtai/
The pokemon one
/hgg/
The /d/ generals where they explore extreme fetishes and tags

Why he invest his time here?

Anonymous
04/09/26(Thu)21:02:00 No.108570045

Anonymous 04/09/26(Thu)21:02:00 No.108570045

https://civitai.com/models/2383017/anima-cat-tower
>massive changes to Anima's default style (albeit, slopped to high hell)
>improvements to anatomy
>improvements to consistency
>same or better character knowledge
I thought Anima was untrainable and forgot all its base knowledge if you so much as sneezed on the weights

Anonymous
04/09/26(Thu)21:03:26 No.108570050

Anonymous 04/09/26(Thu)21:03:26 No.108570050

>greatest SOTA API model of all time releases to the deafening sound of crickets

Anonymous
04/09/26(Thu)21:06:57 No.108570063

Anonymous 04/09/26(Thu)21:06:57 No.108570063

>>108570040
SAAAAR WHY HE IGNORE IT!@!!!!!!!!!!!

Anonymous
04/09/26(Thu)21:08:26 No.108570066

Anonymous 04/09/26(Thu)21:08:26 No.108570066

>>108570040
jealous that we got his attention but you don't?

Anonymous
04/09/26(Thu)21:09:36 No.108570073

Anonymous 04/09/26(Thu)21:09:36 No.108570073

>>108570045
Anon is going to reply to this calling cattower slop, which it is, but it still proves that training more than simple LoRAs works well.
It was always a farce pushed by retards using SDXL hyperparams.

Anonymous
04/09/26(Thu)21:12:19 No.108570086

Anonymous 04/09/26(Thu)21:12:19 No.108570086

>>108570040
Comfy will never love you

Anonymous
04/09/26(Thu)21:14:07 No.108570093

Anonymous 04/09/26(Thu)21:14:07 No.108570093

>>>108570040
And >>>/jp/2huai?

Anonymous
04/09/26(Thu)21:27:11 No.108570147

Anonymous 04/09/26(Thu)21:27:11 No.108570147

>>108570015
>any API chads gen some kino?
so that anon only reposts stuff from twitter and reddit unfortunately

Anonymous
04/09/26(Thu)21:31:53 No.108570185

Anonymous 04/09/26(Thu)21:31:53 No.108570185

>>108570050
Because not only is it censored but it's fucking expensive. Every API service locks it behind one of their highest sub tiers so no one but Youtube grifters are using it.

Anonymous
04/09/26(Thu)21:33:22 No.108570193

Anonymous 04/09/26(Thu)21:33:22 No.108570193

>>108570050
too censored to be useful, people tasted Sora 2 at its best, hard to go back to something more cucked

Anonymous
04/09/26(Thu)21:34:59 No.108570201

Anonymous 04/09/26(Thu)21:34:59 No.108570201

File: 1761391215412973.jpg (1.46 MB, 3328x4864)

1.46 MB JPG

>>108569606

Anonymous
04/09/26(Thu)21:45:32 No.108570232

Anonymous 04/09/26(Thu)21:45:32 No.108570232

File: 1769226583411058.png (3.6 MB, 1128x2048)

3.6 MB PNG

Anonymous
04/09/26(Thu)21:55:34 No.108570280

Anonymous 04/09/26(Thu)21:55:34 No.108570280

File: ComfyUI_Anima_00040_.png (1.2 MB, 1024x1024)

1.2 MB PNG

Acestep.cpp is insane. It does not consume all my VRAM all at once, only fills it up when I run it, so I can run comfyUI in conjunction with it. Plus, it's ultra fast. Unlike every other iteration of ACEStep UIs, it also allows seamless switching between XL Turbo and XL SFT.

XL Turbo 80s Jap groove gen
https://vocaroo.com/1hpzg5IVZxPe

The prompt is everything, it makes a huge difference in output quality, so like image gen it makes sense to try different ways and styles to prompt same thing, and remove tokens if something sounds off.

Anonymous
04/09/26(Thu)21:57:11 No.108570289

Anonymous 04/09/26(Thu)21:57:11 No.108570289

>>108570280
but can i run it with 4gigs vrams?

Anonymous
04/09/26(Thu)22:03:09 No.108570316

Anonymous 04/09/26(Thu)22:03:09 No.108570316

>>108570280
what does a music prompt even look like?
>80's style jap groove, 120bpm, "lyrics"
i've never even looked at music genning, is it all prompt or is it more like a DAW with a prompt?

Anonymous
04/09/26(Thu)22:10:36 No.108570356

Anonymous 04/09/26(Thu)22:10:36 No.108570356

>>108570316
>what does a music prompt even look like?
Depends completely on the model and what kind of language it was trained on, just like image models. Udio worked extremely well with rateyourmusic tags because that's what it was trained on (until it started giving you fucking moderation errors every fucking time if you copy pasted the tags from an album you like).
https://ace-step.github.io/ace-step-v1.5.github.io/#XLDemos
Judging by their example prompts it sounds like it was trained on natural language, but I intend to test RYM tags just in case.
>>108570280
>Acestep.cpp is insane.
Works better than ComfyUI?

Anonymous
04/09/26(Thu)22:17:30 No.108570395

Anonymous 04/09/26(Thu)22:17:30 No.108570395

>>108570023
just imagine 10kW jennies

Anonymous
04/09/26(Thu)22:20:58 No.108570414

Anonymous 04/09/26(Thu)22:20:58 No.108570414

>>108570289
>but can i run it with 4gigs vrams?
You should be able to, Q4 is below 4GB in size (and as long as the total GB is less than your VRAM it should all fit).

https://www.serveurperso.com/temp/acestep.cpp-win64/models/

>>108570316
There's two separate prompts, a caption and a lyric portion. The LM turns them into codes that the model understands, which then outputs the codes for the song and translated to either mp3/FLAC/WAV. In this case for the caption I use
>A groovy 80s synth-pop track featuring sultry female vocals, blending English and Japanese lyrics with flirtatious call-and-response delivery. The timbre pulses with a funky slapped bassline, shimmering arpeggiated synths, gated reverb snare drums, and electric piano stabs. The emotion is playful liberation, infectious joy, and cheeky rebellion. Human sounds include syncopated finger snaps, ecstatic "Ha!" shouts from both vocalists, and layered harmonies during the chorus.

I use LLMs to enhance the caption (could be done right thru acestep cpp itself, and it can also be done with Grok/Gemini).
Like with API, you can technically just lazy prompt it straight thru the UI with the built in prompt enhancer, though I like flexibility outside of that.

Lyrics were https://files.catbox.moe/8xof5r.txt
They can be provided in a variety of ways, but I always adhere to ACEStep's instructions for them. Like image gen, there's things that can be modified like BPM, duration, keyscale, which adjust speed and style of the song, as well as CFG which adjusts prompt adherence and creativity between gens.

Anonymous
04/09/26(Thu)22:25:42 No.108570445

Anonymous 04/09/26(Thu)22:25:42 No.108570445

>>108570356
>Works better than ComfyUI?
I don't think the Comfy ACEStep implementation has ever been without issues, dev on this seems to have completely halted. It has more features which I will test soon, two separate cover modes with cover-nosfq apparently being highest quality, and back when I used the first ACEStep 1.5 on Comfy, it was quite slow when a generation had some kind of change to the caption, so I think this is even better.

Anonymous
04/09/26(Thu)22:43:48 No.108570527

Anonymous 04/09/26(Thu)22:43:48 No.108570527

ltx 2.3 distilled is pretty fun (and fast)

https://files.catbox.moe/i2t7bx.mp4

Anonymous
04/09/26(Thu)22:44:00 No.108570529

Anonymous 04/09/26(Thu)22:44:00 No.108570529

File: deWA_zi_00025_.png (2.45 MB, 1792x977)

2.45 MB PNG

Anonymous
04/09/26(Thu)22:47:05 No.108570547

Anonymous 04/09/26(Thu)22:47:05 No.108570547

>>108570201
can you make her look real tho

Anonymous
04/09/26(Thu)22:59:35 No.108570597

Anonymous 04/09/26(Thu)22:59:35 No.108570597

>>108570547
with gpt image 2 launching soon, yes

Anonymous
04/09/26(Thu)23:13:48 No.108570672

Anonymous 04/09/26(Thu)23:13:48 No.108570672

File: 1758263119977770.png (2.36 MB, 1536x1536)

2.36 MB PNG

https://civitai.com/models/1277670/janku-trained-chenkin-and-noobai-rouwei-illustrious-xl?modelVersionId=2786084

I still think illustrious is best for animu more or less. this one has the regular illustrious style but also the deeper colors of base noobAI.

Anonymous
04/09/26(Thu)23:14:54 No.108570678

Anonymous 04/09/26(Thu)23:14:54 No.108570678

File: 1775138852648467.png (2.37 MB, 1536x1536)

2.37 MB PNG

>>108570672

Anonymous
04/09/26(Thu)23:24:26 No.108570733

Anonymous 04/09/26(Thu)23:24:26 No.108570733

>>108570445
>I don't think the Comfy ACEStep implementation has ever been without issues,
I'm trying it right now. You weren't kidding, this shit is jank. For some reason the "thinking" step is using my CPU instead of GPU so it's slow as FUCK. Thankfully, it seems that you can skip that step. But in order to do so I have to use a different set of nodes. This shit is weird and jank and confusing and now I'm considering trying the .cpp setup like you said.

Anonymous
04/09/26(Thu)23:35:08 No.108570797

Anonymous 04/09/26(Thu)23:35:08 No.108570797

>>108569916
Local is still thriving, just not on video yet. This is obviously due to video being most prohibitive to train in a style that ClosedAI and Bytedance have done, but one can hope some AI lab makes a breakthrough with so many (including BFL) thrown at the problem.

Anonymous
04/09/26(Thu)23:37:24 No.108570817

Anonymous 04/09/26(Thu)23:37:24 No.108570817

>>108570672
>>108570678
As someone who uses only base Noob models and their derivatives, I can assure you that many use the Noob name for marketing without understanding what the model do when merging. Also, most 4chan gens aren't from base Noob but from the WAI/Janku branch. Few people know how to prompt these models properly, and the results you're imagining likely aren't from Noob.

Anonymous
04/09/26(Thu)23:43:19 No.108570847

Anonymous 04/09/26(Thu)23:43:19 No.108570847

File: ComfyUI_Anima_00039_.png (867 KB, 1024x1024)

867 KB PNG

Wow, gothic metal music now sounds absolutely insane out of the box
https://vocaroo.com/1Q4Llaeb3gi2

>>108570733
Yep, absolutely is for some reason. The .cpp is counter-intuitively (due to using ggml) the fastest, most lightweight and cleanest version of ACEStep, because no python UI exists for ACEStep that is good. Plus .cpp is compatible with every feature plus more, and good attempt would be to just port the .cpp straight into ComfyUI as a custom node.

Anonymous
04/09/26(Thu)23:45:04 No.108570857

Anonymous 04/09/26(Thu)23:45:04 No.108570857

File: 1751534000393376.png (2.55 MB, 1536x1536)

2.55 MB PNG

>>108570817
I had a previous version saved, downloaded the latest version, seems decent.

but I also have base noob 1.0 cause it's good to have something without any merges or whatever.

Anonymous
04/09/26(Thu)23:49:10 No.108570871

Anonymous 04/09/26(Thu)23:49:10 No.108570871

>>108570847
And this is not surprising, seems like every python UI that's not Comfy for ACEStep is vibecoded, including the actual official UI, and actual devs like lllyasviel or Auto1111 are not available to work on proper UIs. As for Comfy, I'm guessing it's just not too compatible with the current architecture.

Anonymous
04/09/26(Thu)23:49:58 No.108570875

Anonymous 04/09/26(Thu)23:49:58 No.108570875

>>108570672
>>108570678
>>108570857
I judge a model based on how well it can do toilet sitting+undies down. you'd be surprised at how hard it is for many models to get right.

Anonymous
04/09/26(Thu)23:51:17 No.108570886

Anonymous 04/09/26(Thu)23:51:17 No.108570886

File: 1752647576987499.png (2.89 MB, 1536x1536)

2.89 MB PNG

>>108570857
>>108570875
yeah, but in general illustrious/noob based models do great, weve come a long way since pony which needed tons of tinkering to get okay anatomy.

Anonymous
04/10/26(Fri)00:04:59 No.108570918

Anonymous 04/10/26(Fri)00:04:59 No.108570918

>still no happysamefacehorse
it's over

Anonymous
04/10/26(Fri)00:05:43 No.108570920

Anonymous 04/10/26(Fri)00:05:43 No.108570920

File: deWA_zi_00030_.png (2.35 MB, 1792x977)

2.35 MB PNG

forbidden technique

Anonymous
04/10/26(Fri)00:13:09 No.108570940

Anonymous 04/10/26(Fri)00:13:09 No.108570940

>>108570847
>https://vocaroo.com/1Q4Llaeb3gi2
Sounds like it keeps changing its mind whether a woman or a man is singing at the part where it gets loud, kek. Also, the quiet parts remind me of Let Us Cling Together by Queen.

Anonymous
04/10/26(Fri)00:15:45 No.108570949

Anonymous 04/10/26(Fri)00:15:45 No.108570949

File: happyhorse08.png (452 KB, 869x970)

452 KB PNG

>>108570918
trust the plan saaaar
is 48gb seedance level local tomorrow in comfyui will be optimized FAST

Anonymous
04/10/26(Fri)00:20:39 No.108570973

Anonymous 04/10/26(Fri)00:20:39 No.108570973

>>108569773

Crazy part is that seedance 2.0 in the testing phase had full on nudity with no issues

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.