[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion and Development of Local Image and Video Models

Previous: >>108679668

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
File: 384.jpg (387 KB, 1024x1536)
387 KB JPG
>>
File: 1530371707197-0.gif (15 KB, 633x758)
15 KB GIF
I just want to make anatomically correct porn on demand
>>
File: zImageturbo_00069_.jpg (650 KB, 1264x1672)
650 KB JPG
>>
>>108681486
I just want github stars and a million dollars
>>
I don't want being part of /ldg/
>>
>>108681486
>I just want to make anatomically correct porn on demand
grok
>>
>>108681494
you what bloody benchod
>>
>>108681498
>grok
lol they can't even create bikinis.
>>
>>108681494
Kys or go to /agd/
>>
Blessed thread of frenship
>>
So what do big corpos expect in return for their millions invested into comfy (hard mode: no “huuur duurn API nodes)
>>
Comfy gave me trust issues
>>
>>108681517
well cooked spaghetti
>>
kill ani IRL
>>
>Vrowsing the interwebs for celebs and insta whores
>creating tons of deepfake nuds of them not caring about the dumb UI drama.
>>
>>108681529
based and i also do this
>>
>>108681529
Just look at all these catbox'd samples!
>>
File: zImageturbo_00073_.jpg (647 KB, 1264x1672)
647 KB JPG
>>
>mfw Resource news

04/24/2026

>MAI-Image-2
https://playground.microsoft.ai/chat

>ComfyUI-NAG-Extended: NAG support for Flux 2 Klein and Anima
https://github.com/BigStationW/ComfyUI-NAG-Extended

>UniGenDet: A Unified Generative-Discriminative Framework for Co-Evolutionary Image Generation and Generated Image Detection
https://github.com/Zhangyr2022/UniGenDet

>VARestorer: One-Step VAR Distillation for Real-World Image Super-Resolution
https://github.com/EternalEvan/VARestorer

>Sapiens2
https://github.com/facebookresearch/sapiens2

>Vista4D: Video Reshooting with 4D Point Clouds
https://eyeline-labs.github.io/Vista4D

>Pre-process for segmentation task with nonlinear diffusion filters
https://github.com/cplatero/NonlinearDiffusion

04/23/2026

>ParetoSlider: Diffusion Models Post-Training for Continuous Reward Control
https://shelley-golan.github.io/ParetoSlider-webpage

>DynamicRad: Content-Adaptive Sparse Attention for Long Video Diffusion
https://github.com/Adamlong3/DynamicRad

>Normalizing Flows with Iterative Denoising
https://github.com/apple/ml-itarflow

>LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model
https://github.com/inclusionAI/LLaDA2.0-Uni

>Illustrious XL & NoobAI-XL Style Explorer
https://github.com/ThetaCursed/Illustrious-NoobAI-Style-Explorer

>AI Model & ‘MAGA’ Influencer Emily Hart Unmasked as Indian Man
https://www.yahoo.com/news/articles/ai-model-maga-influencer-emily-091027504.html

04/22/2026

>Embedding Arithmetic: A Lightweight, Tuning-Free Framework for Post-hoc Bias Mitigation in Text-to-Image Models
https://github.com/cvims/EMBEDDING-ARITHMETIC

>Denoising, Fast and Slow: Difficulty-Aware Adaptive Sampling for Image Generation
https://github.com/CompVis/patch-forcing

>TS-Attn: Temporal-wise Separable Attention for Multi-Event Video Generation
https://github.com/Hong-yu-Zhang/TS-Attn

>AnyRecon: Arbitrary-View 3D Reconstruction with VDM
https://yutian10.github.io/AnyRecon
>>
I don't feel like genning, I don't feel like opening Comfy, I don't feel like updating it, I don't feel like anything after the pathetic move they pulled on all of us today.
>>
okay
>>
File: 1767426079441396.png (1.12 MB, 1024x1024)
1.12 MB PNG
How is WAI v17? v14 was good, v15 was ass and v16 was good again, don't know what to expect
>>
>>108681535
nice try glowie
>>
>>108681547
Fight back. Throw molotovs in their offices.
>>
>mfw Research news

04/24/2026

>AttentionBender: Manipulating Cross-Attention in Video Diffusion Transformers as a Creative Probe
https://arxiv.org/abs/2604.20936

>KD-CVG: A Knowledge-Driven Approach for Creative Video Generation
https://kdcvg.github.io/KDCVG

>Linear Image Generation by Synthesizing Exposure Brackets
https://arxiv.org/abs/2604.21008

>Exploring the Role of Synthetic Data Augmentation in Controllable Human-Centric Video Generation
https://arxiv.org/abs/2604.21291

>AttDiff-GAN: A Hybrid Diffusion-GAN Framework for Facial Attribute Editing
https://arxiv.org/abs/2604.21289

>Projected Gradient Unlearning for Text-to-Image Diffusion Models: Defending Against Concept Revival Attacks
https://arxiv.org/abs/2604.21041

>Sparse Forcing: Native Trainable Sparse Attention for Real-time Autoregressive Diffusion Video Generation
https://arxiv.org/abs/2604.21221

>StyleVAR: Controllable Image Style Transfer via Visual Autoregressive Modeling
https://arxiv.org/abs/2604.21052

>Building a Precise Video Language with Human-AI Oversight
https://linzhiqiu.github.io/papers/chai

>Seeing Isn't Believing: Uncovering Blind Spots in Evaluator Vision-Language Models
https://arxiv.org/abs/2604.21523

>ID-Eraser: Proactive Defense Against Face Swapping via Identity Perturbation
https://arxiv.org/abs/2604.21465

>When Prompts Override Vision: Prompt-Induced Hallucinations in LVLMs
https://pegah-kh.github.io/projects/prompts-override-vision

>Seeing Fast and Slow: Learning the Flow of Time in Videos
https://seeing-fast-and-slow.github.io

>Addressing Image Authenticity When Cameras Use Generative AI
https://arxiv.org/abs/2604.21879

>Multiscale Super Resolution without Image Priors
https://arxiv.org/abs/2604.21810

>Prototype-Based Test-Time Adaptation of Vision-Language Models
https://arxiv.org/abs/2604.21360

>Latent Denoising Improves Visual Alignment in Large Multimodal Models
https://arxiv.org/abs/2604.21343
>>
>>108681517
what happened to blender?
the list of corpos that have invested in bender is a who's who of industry titans.
>>
>>108681551
Have you ever tried doing your own model mix instead? It's not difficult at all.
>>
Where are the blender API meshes?
>>
>>108681558
maybe because Blender is actually useful in the industry?
>>
> >108681546
> >108681557
Fuck off
>>
>nobody reads the big news spam posts
why are these made
>>
>>108681567
and comfyui isn't, so free money for some chink and a nice local tool for gooners.
whats the problem?
>>
imagine doing a big greasy poo into ani's mouth as it's dying
>>
File: ComfyUI_159042_.jpg (317 KB, 1080x1920)
317 KB JPG
>>
>>108681558
>Epic invests in Blender
>They "coincidentally" discontinue their game engine
Nothing is free
>>
File: 1772233853964855.png (1.01 MB, 1024x1024)
1.01 MB PNG
>>108681563
I was only making loras for autismmix and illustrious back in the day because I found civitai exploit for infinite buzz and used it for training, then they cucked out and you had to pay for buzz so I don't have computing power to train a model
>>
>>108681576
Idk
as long as I can continue deepfaking the shit of everyone I find appealing enough, I couldn't care less.
>>
>>108681584
i like the eyes
>>108681586
>train a model
mixing is not training, its just mixing models together. which is what wai is. a mix.
>>
File: zImageturbo_00077_.jpg (622 KB, 1264x1672)
622 KB JPG
>>
>ComfyUI gets chink ceo
>ComfyUI adds API nodes
>Suddenly all of china pulls out of local models
hmmmm
>>
Claude, generate me a node and graph interface for sdcpp and preload with pirated leaked models. Throttle if it gets too hot.
>>
>>108681547
I don't even feel like using Anima, knowing it comes from those dirty hands of Comfy and his buddy tdrussell. At least Mugen is clean and honest money from clean and honest people.
>>
File: 16982.jpg (1006 KB, 1214x1296)
1006 KB JPG
>>
>>108681585
the game engine was a meme, and that was 7 years ago. blender is much better now than it was 7 years ago, and it's still completely free.
>>
File: zImageturbo_00083_.jpg (568 KB, 1672x1264)
568 KB JPG
>>
>>108681631
KEK
>>
>>108681631
Why are girls in glue so hot?
>>
>>108681613
>comfyui gets ching chong
>ching chong man give his friend money to make local anime model that mogs all other anime models
hmmmmmm
>>
>>108681622
First few lines of coke are free.
>>
>>108681624
Also no one used the game engine, that's why they discontinued it. Blender is used to create assets for games, not for their engine.
>>
File: 1751766371818813.png (2.03 MB, 1086x1448)
2.03 MB PNG
>>108681517
they are hoping for a fat exit by adobe, nvidia or some other big tech company once they become the established ai/creative tool
thats why they are desperate for useful idiots to promote them so they can eventually do a rugpull
>>
File: anima_00017_.png (1.1 MB, 1472x2200)
1.1 MB PNG
>>
>>108681653
>>108653190
>>
>>108681577
with local you don't need to imagine
>>
>>108681645
Then Ching Chong needs more money because his friend ate all the money and he takes over the entire local ecosystem and turns it into a ponzi scheme.

And thats why I don't want to use PonzAnima
>>
File: c7r8jy.png (924 KB, 1024x512)
924 KB PNG
>>
File: 55112121254.png (1.93 MB, 2463x721)
1.93 MB PNG
ACEStep cpp just got a massive upgrade to sound quality bros. This is HUGE for local music. The turbo model, which is the most creative out of the box and arguably the best version (though its quality sounds so poor we usually have to go to SFT for more fidelity), now just got an upgrade.

From the ACEStep cpp dev himself

>DCW mode = Double
>DCW scaler = 0.05
>DCW high scaler = 0.05
>It brings out all those mid and upper mid frequencies that we didn't have in Turbo. It's amazing!
>It offers the fidelity of SFT with the purity of Turbo.

It also got a recommendation directly from Junmin Gong himself, so that's how you know this method is good.

For the guy who kept complaining about "synthetic" sounds before, here you go. Note I can now notice a massive different in sound quality with a pair of IEMs.

Hear the difference:
Before:
https://vocaroo.com/1omPU2Mdpna9

After:
https://vocaroo.com/1a4VSBjqkuSX

Before:
https://vocaroo.com/19L6rwBSBNtK

After:
https://vocaroo.com/14hih0IkzofF

In the after I also changed the VAE to scragnog's VAE for maximum sound quality. This is insane. Hopefully if you wrote it off before, you think twice about it now. Retesting my LoRAs (which are way better and more accurate on Turbo than SFT when trained on base, because the instrumental range of Turbo is very good).

Also, unlike the previous ACEStep version, I am convinced this one just needs good prompt engineering to get just about any kind of song.

Here's an example gen from my first attempt at cinematic type storytelling song, the type you only seen on no-no datasets from possibly commercial models. Ignore the lack of lyrics sync between male/female vocals, I did not include it in the lyrics, but alignment is much better in different seeds, but this gen shows its capabilities.

https://vocaroo.com/1k678M6RdWPI

Notice it mispronounces "wild". A shortcoming of Turbo (just like on earlier Suno versions), but that is fixed when spelled phonetically "wyld".
>>
So about dataset curation for Anima realism lora, does anyone know a good place to get high quality NSFW images?
As you know most NSFW images in the internet are compressed jpegs, some are OK and usable in terms of quality, but most suffer from the "low quality look", going as far as easily visible artifacting in some.
Unfortunately many CDNs using webp BS for delivering images only made this problem worse. (I know webp supports higher quality and lossless modes, but literally every website I've seen delivers webp images in garbage quality. And stripping webp from accepted formats in http headers either get ignored or CDN converts that webp again to jpeg, delivering even lower quality.)
I did not have too much luck with torrents neither. There are torrents for "high res" porn images, but these are typically saved roughly same jpeg quality as their low resolution counterparts. I am not sure if downscaling a low quality image to training resolution vs having low quality image at the training resolution already meaningfully makes up for it. From my brief experiments resampling these images, it doesn't seem to help.
And OF leak type of content might be the most unusable in terms of quality.
So in case there is confusion I don't need lossless pristine pngs shot with 10k cameras. But I am dissatisfied with the general quality of NSFW images. The qwen vae, while much better older than vaes like SDXL, doesn't preserve as much quality as Flux2 vae, so in effect these images will get significantly compressed again. I would rather work from a high quality baseline for the best results.
This question also applies to SFW images to some degree, but it is a lot easier to get your hands on high quality SFW images than NSFW.
Any ideas anons? I skimmed through my stash and grabbed images with least worst quality among them, but I am not particularly satisfied about the average quality.
>>
File: zImageturbo_00091_.jpg (685 KB, 1672x1264)
685 KB JPG
>>
/ldg/ it's dead for good with this.
>>
File: 16234.jpg (1.02 MB, 1149x1369)
1.02 MB JPG
>>
>>108681691
Yep, not clicking on run_comfy_nvidia.bat never again
>>
>>108681683
>>It brings out all those mid and upper mid frequencies
at the cost of it sounding even more synthetic ngl and im not trying to troll
>>
>>108681697
>.bat
Only retards use those.
>>
I don't want to gen
>>
>>108681686
I want to see an example of a lora trained only on uncompressed images vs one trained on jpeg versions
>>
>>108681695
ironic that you post this using an api model, go kill yourself
>>
what happened
why is ani doomposting comfy yet again. did they get bored of shitting on anima
>>
>>108681733
Today Comfy practically called us idiots to our faces. We were taken for stupid.
>>
>>108681724
he's right tho
>>
>>108681745
can you link to said source of being called idiots? thanks.
>>
>>108681724
Using API models like GPT Pro is fine. API is clear that it's API. ComfyUI larps as 'open' and 'local' and does more damage to the local ecosystem by shoving API ads everywhere. Comfy should've said no to WAN selling out to API and refused to integrate their API slop into the UI.
>>
File: gl0uha.png (968 KB, 1024x512)
968 KB PNG
>>
I'm sick of seeing these chink women posted.
Do better.
>>
File: comfy.png (7 KB, 400x207)
7 KB PNG
>>108681751
>>
>>108681686
There's no single place that's filled with good quality nsfw images. You just have to gather it little by little.
>>
File: 1_00035_.jpg (3.36 MB, 3524x2632)
3.36 MB JPG
>>
Does anyone else feel a bad sensation in their stomach when they open Comfy? It's impossible to look at a node without feeling stupid or like they're playing with me or like I'm being treated as a product.
Fuck Comfy
>>
File: zImageturbo_00102_.jpg (675 KB, 1672x1264)
675 KB JPG
>>
>>108681788
Most of us already moved on from that shit, it's just the paid shills like OP who refuse to remove it.
>>
>>108681761
>type first few letters of a node i want
>click random one while i was distracted by shit on my second monitor
>notice it was yellow and realized it was an api node
>deleted it and added the correct node
to date that has been my experience with the comfy api ecosystem. if anything comfy does more harm to api.
you have this great ui that lets you do whatever the fuck you want, why you would waste your time fighting with google or openai?
>>
File: 1768252309696164.png (2.22 MB, 1122x1402)
2.22 MB PNG
>>108681770
what type of women do you want to see?
>>108681813
lol
>>
>>108681721
Assuming noticeably low quality jpegs (<80 quality) it probably doesn't matter at all for anything older than Flux unless extremely low quality, and matters noticeably for Flux 2. Anima probably sits somewhere in between.
Maybe it doesn't change much at low rank loras, and/or loras where texture detail doesn't matter as much but it should matter for higher rank loras. GIGO after all.
>>108681778
I expected this to be answer but figured it wouldn't hurt to ask..
>>
File: 1rs3r8.png (909 KB, 1024x512)
909 KB PNG
>>
>>108681820
actually we need more ABG feet gens
>>
>>108681798
Doesn't look like yogurt.
>>108681820
Where did his lower legs vanish to?
>>
>type first few letters of a node i want
>click random one while i was distracted by shit on my second monitor
>notice it wasn't yellow and realized it was a localslop node
>deleted it and added the correct node
to date this has been my experience with the comfy api ecosystem. can comfy just remove these bloat local nodes already? who gives a shit about latentx32 or vapeencode or whatever this babble is? it's not needed anymore and gets in the way of google and openai
>>
>>108681551
>it's real
Woohoo, new version to try out!
>Regarding model updates: I create models purely as a hobby, not as my main job. So I work on them in my spare time, and the update frequency depends on my availability. Recently, my computer suffered hardware damage—both the GPU and hard drive were affected. As a result, updates will likely be slower for quite some time. Thank you for your understanding.
Oof
>>
>>108681813
theres no arguing with that mentally ill faggot, he will keep shitting up the thread no matter what
>>
>>108681802
No we haven't. Moved on to what? Literally everyone is using ComfyUI locally. Fucking schizo.
>>
File: 1754773137569822.png (1.66 MB, 1086x1448)
1.66 MB PNG
>>108681830
facts
>>108681832
some people are paraplegic, dont be racist now
>>
>Literally everyone is using ComfyUI locally
>Comfy Cloud has also grown quickly, with annualized bookings crossing $10M in 8 months.
what's with all the paid shills?
>>
>>108681697
>not uv run main.py
ngmi
>>
kek
https://www.reddit.com/r/StableDiffusion/comments/1suu8p2/something_big_is_coming/
>>
>>108681820
>>108681851
Why are you posting cloud gens here
>>
>>108681820
pocahontas
>>
>>108681870
i'm all against cloud shit, but if they were made with comfyui api it's fine. comfyui is local-first, so posting api helps them raise money for local.
>>
>>108681820
Blonde hair blue/green eyed white women
>>
>>108681851
what model?
>>
>>108681876
no, api is not local you mongoloid retard. this isn't a comfy thread it's a LOCAL thread.
>>
>>108681886
comfyui IS local, they said it in their latest announcement. we're just helping raise awareness like they asked. chill out dude, it's all for the benefit of local in the end.
>>
>>108681699
This. What the fuck. Once again, >>108681683 ACE-shill anon get your damn ears checked.
>>
>>108681891
kill yourself in real life
>>
File: ComfyUI_07172_.png (3.53 MB, 2304x1792)
3.53 MB PNG
>>
File: zImageturbo_00109_.jpg (607 KB, 1672x1264)
607 KB JPG
>>
>>108681899
SPIDER JIZZ NOOOOOOOO
>>
>>108681899
the fucking state of that model is embarrassing.
>>
>Reddit celebrating Cumfart VC investment
Do they really
>>
>>108681897
kek
>>
File: zImageturbo_00115_.jpg (910 KB, 1720x2064)
910 KB JPG
>>108681903
Spider sama was bitten by radioactive salaryman
>>
File: 93qe02.png (1.04 MB, 1024x512)
1.04 MB PNG
>>
File: 1773418335383668.png (2.02 MB, 1122x1402)
2.02 MB PNG
>>108681882
yes
>>108681884
im using a magic workflow that doesnt require a model
>>108681899
lmao
>>
File: local is dead.jpg (1.32 MB, 1024x1536)
1.32 MB JPG
>>
api niggers post here because their shit gens aren't good enough for the api thread, lmao
>>
>>108681962
show us your gen, big man
>>
>>108681982
i'm training, you wouldn't know what that is, though.
>>
Why the fuck is "playtime_ai" account banned from everywhere?
He makes good video loras, and they're not even that spicy.
Anyone knows if he saves his loras somewhere? he's banned from civitai, hf and reddit apparently.
>>
>>108681991
Why did he post a rape lora or something?
>>
>>108681985
just show us one single gen you've made
>>
>>108681997
No idea, I only remembered him from a titty drop loras posted in an earlier thread, and when I go to his user on hf or reddit they're all banned.
>>
File: zImageturbo_00124_.jpg (611 KB, 1720x1304)
611 KB JPG
>>
File: pepe laughing.png (133 KB, 360x346)
133 KB PNG
>actually using cumfytroonUI, ever
>>
>>108681991
the answer every time is libertarianism
>>
https://youtu.be/FE1r98G5IG8
zoz
>>
File: 1775505309186860.png (1.15 MB, 1536x1024)
1.15 MB PNG
>>108681997
>>
can anon pls reupload the 'toss lora to a regular site and not limewire pls
>>
>>108682025
To be fair to these sites they really don't want to be sued. We need a good and proper chink site or something to upload to that doesn't give a shit.
>>
>>108682041
>chink site
>nudes
You do know that porn is illegal in Chink land, yes?
>>
>>108682057
>porn is illegal in Chink land
and twitter is illegal but that doesn't prevent Alibaba and Tencent to have an account with it, the CCP doesn't care about the law when it comes to big chinese companies, as long as they're dunking on the western dogs!
>>
>>108682057
>You do know that porn is illegal in Chink land, yes?
Honestly no, I don't know what those commies are up to however my point stands just replace china with some other country that doesn't give a shit.
>>
>>108682041
You would be hard pressed to find pressed to find anywhere to do so. The usual western law refuges like Russia and China are quite anti-porn themselves.
>>
>>108682072
And? They don't do porn.
>>
local is actually dead. i didn't think it could get worse than happyhorse, but here we are. a whole entire year with absolutely nothing of note.
>>
File: 32168435132185521.jpg (1.02 MB, 1024x2048)
1.02 MB JPG
>>108682001
nta but most people who gen are probably training anima or testing anima workflows. it's a fun little model.
>>
>>108682086
>just replace china with some other country that doesn't give a shit.
10 years ago that would have been Russia but they disabled themself.
>>
>>108682090
>And?
are you retarded? the point is that big Chinese companies are exempt of chinese laws if it benefits the CPP
>>
>>108682099
Well I guess we're fucked in the mean time.
>>
>forge user
>high quality gens shared in thread because its simple
>cumfy user
>nonsense gens because they spent over 9000 hours dicking around with spaghetti instead of learning how to prompt
>>
>>108682112
where are the forge gens?
>>
>>108682106
How's porn benefiting the CPP? Their porn law is zero tolerance and there's nothing to gain strategically allowing nudes of Gordon Ramsay.
>>
>>108682124
in his mind
>>
I can't believe we're missing out on all the good tiny cocked chink porn. That's probably why it's banned.
>>
>>108682124
https://files.catbox.moe/t7nmdz.jpg
>>
File: and they're so hot.png (243 KB, 609x609)
243 KB PNG
>>108682134
>I can't believe we're missing out on all the good tiny cocked chink porn.
there's a lot of "chinese lezdom" porn in the internet though, just saying
>>
>>108681529
there's no actual drama, there are just a few autistic retards who can't stop, it's easy to ignore them anyway, if a post is about ui or the personality of model dev or whatever, it's shit
>>
Most of us moved on to GPT-Image-2
>>
>>108682163
you didn't move on you're still lurking on a local thread, if you truly moved on you would be posting images here instead >>108653190
>>
>>108682028
https://mega.nz/file/4MV3SBhB#n1rGqISBOMr3R-2uv4dtQ26SZdACKXPGYDZEWMFS20s
>>
>>108682168
nobody uses that, we use comfyui api locally
>>
>>108681683
Anon there is still that metallic sound behind, I have a headphone, I can hear it.
It's better than anything else we have on local, but I won't pretend it's old udio level at all.
I wonder if it's just an issue of training with low bitrate, because low quality mp3 can sound like that.
>>
>>108682183
>we
who's we? you and your schizo voices?
>>
File: ComfyUI_Anima_00048_.png (1.34 MB, 1024x1024)
1.34 MB PNG
>>108681699
Tbh these results are pretty good for no LoRA whatsoever. Now the model just needs to be polished a bit more to be more clean with composition/structure and it'll be perfect. Maybe not commercial ready yet, but so damn close that perhaps something like https://github.com/entrepeneur4lyf/Web-Audio-Mastering
with right settings would be a enough to catch up.

https://vocaroo.com/1iyV4A0yXbnL

In this case the prompt was
>A high-energy J-Rock comedy track with a "bratty" punk aesthetic. The instrumentation features a fuzzy, distorted electric guitar playing fast power chords and a driving "four-on-the-floor" punk drum beat. The emotion is feisty, annoyed, and humorous. The timbre is sharp and bright, featuring a "pouty" female vocal delivery with a heavy Katakana-English accent.

Maybe that "distorted" part makes it sound a bit more fake.
>>
File: zImageturbo_00127_.jpg (739 KB, 1520x1824)
739 KB JPG
>>
>>108682194
the top creatives in the thread all use gpt-image,
>>
>>108682183
>nobody uses that
you just proved that local is way more popular than APIkeks, local threads thrive while API threads die
>>
>>108682072
>western dogs
Western Bulldogs Rule !
>>
>>108682206
if they all want to be banned for off topic that's their problem
>>
>>108682198
she looks like somebody just told her that women are weaker than men.
>>
>>108681820
feet in pantyhose is godly, nice view
>>
File: 1758533849910398.png (620 KB, 832x1280)
620 KB PNG
>>108682172
thank you anon i love you
>>
he should have NOT uploaded that Lora.
>>
fun ltx 2.3 lora: https://civitai.red/models/2553102/editanything?modelVersionId=2869279

workflow: https://huggingface.co/RuneXX/LTX-2.3-Workflows/blob/main/Video-2-Video/LTX-2.3_-_V2V_Video-Edit_remove_add_replace_restyle_EditAnything-Lora.json

prompt: remove the flying object on the right.

https://litter.catbox.moe/8wizzofy6zzhk38h.mp4
>>
File: zImageturbo_00131_.jpg (597 KB, 1520x1824)
597 KB JPG
>>108682217
she seem sassy alright
>>
>>108682197
The instrumentals and song structure are fine, the awful part is the voice.
>>
>>108682216
circle the off-topic posts? we are all just enjoying gens here except the shills trying to promote their corporate slop ui
>>
>>108682197
it sound so fake, jesus, what a meme model
>>
>>108682237
that is impressive
>>
>>108682248
>circle the off-topic posts?
why are you asking that?
>>
>>108681870
because he can't get enough attention for his slop in the cloud threads
>>
File: zImageturbo_00137_.jpg (692 KB, 1520x1824)
692 KB JPG
>>
>>108682246
No part of that is fine. It reminds me of listening to people's mp3 ringtones on old flipphones back in 2005.
>>
>>108682197
This sounds like it's clipping and heavily compressed at the same time. This is ear rape.
>>
>>108682265
Well they're good enough for what I test listened to. But the voice metallic sound is so obvious it destroys whatever it tries to do.
>>
File: 1757721247904453.jpg (1.35 MB, 4178x1670)
1.35 MB JPG
that comparison is quite brutal when you think about it, I knew Z-image turbo looked flat but I didn't know why exactly, seems like it's unable to create real depth and angle, it looks so boring and fake compared to the left image
>>
>>
>>108682275
Why aren't you using the gen from anon that proved you just suck at prompting >>108681337
>>
>>108682284
can you do this in the style of info wars
>>
>>108682290
Because they're disingenuous shills. I hope they're getting paid because otherwise it's really sad and pathetic.
>>
File: laugh and drink.gif (1.46 MB, 217x217)
1.46 MB GIF
>>108682217
>she looks like somebody just told her that women are weaker than men.
i snortled
>>
File: 1754679843483412.jpg (1.73 MB, 4040x1667)
1.73 MB JPG
>>108682290
>>108682304
bruh it still gets mogged what are you talking about?
>>
>>108682188
Voices sound natural, not robotic at all anymore.
Sound quality is just a touch below commercial (needs mastering).
This is like calling images shit because they need to be upscaled. It will only get better from here. The community will find workflows to make it all better. In fact, if you truly cared, you would try the model out yourself, and attempt to improve its sound quality to sound "natural" with a DAW as you claim.

I think it's just a small percentage of selective users who mind such purported differences, because it's extremely hard to notice unless your focus is not enjoying the song itself, but to nitpick. Idk about you anon, but I'm excited about it actually being able to match Udio's output on good seeds/prompts in terms of composition ability, with the voices sound just as natural (though still not entirely as good as Udio out of the box, but that's because Udio is an obviously bigger model). I think with all these improvements for V1.5, it's obvious V2 ACEStep will match or surpass Udio.
>>
>>108682284
Woah… this makes a lot of sense actually. Thanks for clearing things up!
>>
>turbo
It was a mistake for Tongyi to release this. They should've released base and turbo at the same time, at least. Turbo is just a shitty demo for the much better base model.
>>
>>108682314
>wall of text
why are you like that? if you can't accept criticism you'll never be able to improve, to improve you need first to accept the flaws in your model
>>
Oh god I've just seen some seedance 2 nsfw gens and it's actually over isn't it
>>
>>108682290
the idea is to criticize anything ldg uses anon, any local model is bad, anyone working on models or loras are bad, any local ui is bad, even the api bullshit stuff is here to play both sides to rill people up
if you don't see the trolling, it's on you at this point
>>
>>108682284
>>108682322
>this makes a lot of sense actually
do you think the jannies will agree with you?
>>
Has anon really been in here all day trolling with cloud gens?
>>
>>108682312
>z image small boob
>gpt image 2 big boob
>>
File: zImageturbo_00149_.jpg (600 KB, 1520x1824)
600 KB JPG
>>
>>108682335
you are in a cult, if you can't accept the fact that we're still far away from API then you need to get your eyes and head checked >>108682312
>>
>>108682339
does it matter when they arent here?
>>
There is no trolling, we’re all here to promote ComfyUI
>>
>>108682351
where have i seen this image or an image like it before its eerily familiar
>>
>>108682327
>not posting link
>>
>>108682350
and that's a bad thing, as the localkeks can't stop saying, "local is good because it can do coom", I'm not seeing the coom potential in the right image at all
>>108682355
how do you know they're not here?
>>
File: 1771268610289679.png (516 KB, 832x1280)
516 KB PNG
>>
>>108682362
https://turbo.cr/v/YwMqzae6fC6Hk
https://turbo.cr/v/yL-9spubUkfYz
https://turbo.cr/v/wg7mVYyY-mjuB
>>
if API is so good and local so bad, then why local threads are highly active and API threads are slow as molasses?
>>
>>108682326
Because
>it sounds like shit due to (insert extremely niche schizo artifact here), therefore its unusable

Is invalid criticism. Listen to ACEStep V1 again (how lame and robotic that sounds), listen to V1.5 before XL, then listen to XL before the improvements. The improvements are insane, and you're pretending Udio/Suno or whatever model still has moat. Anon, I wouldn't be using this model nor care about it if Udio or Suno were local. They're not, so we have to appreciate the only options we have. And it's improved enough to make meme songs that sound like they were made by a human and not a robot, which was Udio's biggest perk.
>>
>>108682314
>Voices sound natural, not robotic at all anymore.
You can't be serious.
The rest is you answering claims I didn't make. Of course I'd be happy if v2 sounds natural and good, but I won't pretend current release sounds anywhere as good as you claim it to be. I have ears, and I'm not even that hard to please, I'm not some elitist audiophile or anything.
I don't think you can improve on the CURRENT base model because it's flawed, and until they release a new one, we are just using artifices to hide its flaws.
Doesn't even mean it's unusable, I'd probably use it to make purely instrumental tracks, where it's less glaring to hear for me.
>>
File: zImageturbo_00153_.jpg (601 KB, 1520x1824)
601 KB JPG
>>108682361
prompt is from some real photo, blasting trough old list
>>
File: 723427.jpg (1.71 MB, 1024x1536)
1.71 MB JPG
>>
>>108682370
Way sexier than it has any right to be.
>>
>>108682380
>another wall of text
you'll never make it, or else you fix your shit or else you'll end up in the forgotten land, there's other people in the line that'll gladly take your place, nobody care about losers, so be a winner or be a nobody
https://www.youtube.com/shorts/b1YBSvXKeIE
>>
File: 1767140290886337.png (743 KB, 1152x960)
743 KB PNG
>>
>>108682376
first of all, thanks
second, yes, it is over
third, this is the worst AI will ever be
fourth, the next decade is gonna be awesome
>>
File: 8jttye.png (1003 KB, 1024x512)
1003 KB PNG
>>
>>108682376
api is the future. thank you comfy for giving bytedance a chance
>>
>>108682379
because apichads dont needlessly gossip like a bunch of fairies
>>
>>108682376
HOLY FUCKING SHIT
Tell me how to get these gens.
I know this is ldg and I swear I am not a troll but I have to gen some deepfakes before they nerf this.
>>
File: 1772994428852022.png (407 KB, 745x570)
407 KB PNG
>>108682376
holy shit it can do her?? they're in big trouble lmao
>>
calm down patel
>>
>>108682426
>apichads dont needlessly gossip
says the apikek gossiping about local being down, the delicious irony...
>>
>>108682427
https://simpcity dot cr/threads/higgsfield-prompts.887642/page-18
Prompts in there

Looks like safety filter is basically broken on higgsfield and minor celebs go through as well
>>
>>108682376
on the third video the sound is too bad to be seedance, that's a grok gen isn't it?
>>
>>108682376
with a good enough workflow ltx 2.3 can do that
>>
File: zImageturbo_00167_.jpg (558 KB, 1520x1824)
558 KB JPG
>>
>>108682453
*plastic skin and ultra blurry movements enter the scene*
>>
>>108682381
Right, so you're just a troll. What you're making is not even a reasonable comparison. You're comparing a local model to the best commercial model. Not even Suno because you know that sounds like shit.

>Hurr durr the best cloudshit has to offer is better than the best local has to offer

The argument is disingenuous.
>>
File: 1771804481803370.png (785 KB, 832x1280)
785 KB PNG
>>
>>108682468
>The argument is disingenuous.
it's not, if you can't make something as good why would people run your toy then? they'll just go back making music with udio and studio, simple as that
>>
to be fair the whole countdown thing was funny
>>
>>108682480
Yeah only newfrens and trolls got their panties in a twist about it funny to watch
>>
ComfyUI epicly trolled local with that one lol, we all knew it was more funding for API which is based because it helps Seedance and Grok push the uncensored realism tech forward
>>
File: 1773350822148688.png (713 KB, 832x1216)
713 KB PNG
>>
>>108682468
>lalalala I'm not hearing anything I won't listen to your advice!
then we won't listen to your "songs"
>>
>>108681876
Stop deepthroating Sam, he's not a fag like you
>>
>>108682468
You are answering a point I didn't even make in my last post, I didn't write about another commercial model.
>>
/ldg/ - local gossiping general
>>
File: 1755800394646079.png (732 KB, 832x1280)
732 KB PNG
combining chrischan and basedjak loras with this would go so hard
>>
>>108682478
>Not even hiding the troll anymore

If I wanted to use a non-local model, why would I be here or even trying ACEStep at all? Sure, I'll go give my data to Udio.
>>
>>108681551
YA YA YA YA YA YA YA YA
>>
>>108681575
fuk u r dum
>>
File: 1761919351657584.png (961 KB, 1280x959)
961 KB PNG
>>108682528
>combining loras
>>
>>108681504
use DPM++ 2S Ancestral with Simple or Flux.2 Scheduler
>>
>>108682531
>I'll go give my data
if by "my data" means "my words" then you're already giving your data in 4chan at this exact moment, your sentences are being scrapped by LLM companies as we speak and they'll be trained on it, do you see how retarded you sound?
>>
>>108681575
>why are these made
now you understand why debo has a rentry, this guy is ubnoxious as fuck
https://rentry.org/debo
>>
>>108682446
Can't view without account, they apparently nuked my old account for no good reason, and sign ups are closed.
I know no one likes a beggar, but would copy pasting relevant content to a pastebin somewhere be too much trouble anon?
>>
>>108682532
dunno if ur just shitposting but choose a different seed, highres fix and adetailer and that proib looks pretty good
>>
>>108682480
i just woke up. what was it?
>>
>>108682560
https://www.reddit.com/r/StableDiffusion/comments/1suu8p2/something_big_is_coming/
>>
plebbitanon always here to post links !
>>
>>108682560
comfyui shilling saas like usual, localkeks eat it up and continue to promote it for free
>>
>>108682550
I'm phone posting from bed so it's a bit long tings t bh. Anyone wanna help anon out? If not I'll see what I can do in like 10 mins
>>
>>108682486
>Yeah only newfrens and trolls got their panties in a twist about it funny to watch
This
>>
File: he's not wrong.png (129 KB, 1978x554)
129 KB PNG
>>108682572
they have way more interesting conversations than here desu
>>
File: api.jpg (1.15 MB, 1367x1150)
1.15 MB JPG
>>
>>108682576
lol
>>
>>108682594
aww that's so cute
>>
>>108682589
That same point has already been discussed here ad nauseam. Plebbit has always been downstream of 4chan.
>>
>>108682594
top cute
>>
>>108682598
to be fair it's both, sometimes it's 4chan that has the news first, sometiimes it's leddit
>>
>>108682598
>Plebbit has always been downstream of 4chan
other way around now
>>
>>108682589
almost every good "feature" added to comfy is third party.
>>
>>108681935
>magic workflow that doesnt require a model
Oh, so API. Is it a local API?
>>
>>108682605
then why are you still here?
>>
File: mi3gse.png (1.07 MB, 1024x512)
1.07 MB PNG
>>
>>108682370
>>
>>108682611
>why are you still here?
API shills and plebbit enjoyers can't seem to leave this local 4chan thread for some reason
>>
>>108682617
>API shills and plebbit enjoyers can't seem to leave this local 4chan thread for some reason
This thread lives rent free in their minds indeed.
>>
>>108682370
>dlss 5 OFF
>>108682615
>dlss 5 ON
>>
File: 1765701454648510.png (767 KB, 832x1280)
767 KB PNG
>>108682615
peach vanished
>>
>>108682615
damn that sabrina carpenter lora came out great.
>>
>>108682611
i have always been here
>>
>>108682284
I guess I agree because I want to be the cool frog, not the crying gay guy
>>
File: truth nuke.png (586 KB, 601x1019)
586 KB PNG
>>108682284
>>
why is ltx so weird?
https://genur.art/posts/128522325
>>
File: 1752142483819145.png (594 KB, 896x1152)
594 KB PNG
>>
File: 1764807037103566.png (1.12 MB, 1080x1080)
1.12 MB PNG
>>108682641
>the cool frog
>the crying gay guy
but frogs are gay anon
>>
>>108682376
I'd pay for a subscription of a good nsfw video model at this point
>>
File: OY VEY.png (67 KB, 757x661)
67 KB PNG
>>108682655
>why is ltx so weird?
because of the jews (unironically)
>>
File: l.jpg (1.26 MB, 1448x1086)
1.26 MB JPG
>>
Cloud kek are so boring, they're not even able to fill their own thread... Until it makes nsfw cloud is dead.
>>
>>108682673
>blurry analog realism
bruh even the APIkeks are guilty of this
>>108681653
>>108681820
>>
>>108682550
Here you go anon
https://rentry.co/rhk24ycs

Very annoying to do. Couple of new videos in there and prompts for the other 3

Another poster was complaining about losing all his money trying to get past the filter tho so I'm not vouching for this shit or anything. Higgsfield as quite sketchy from what Ive seen on xitter
>>
>>108682673
>1girl
>blurry analog realism
the irony
>>
File: 1746476384246991.png (720 KB, 768x1344)
720 KB PNG
>>
The emergency is to replace CIVITAI... they will stop everything sooner or later
>>
>>108682673
ngl that was funny, I really want a local models to be able to make memes of this quality
>>
File: f2.jpg (1.25 MB, 1402x1122)
1.25 MB JPG
>>
File: 1747145026318325.png (2.05 MB, 1086x1448)
2.05 MB PNG
>>108682673
LOOOOOOOOOOOL
>>
Cloudfag kinks: catalog, menu, grandma afternoon tv series sex scene
>>
>>108682617
>>108682620
why do you think we live rent free
>>
>>108682729
>HappyHorse Local
>Mogao soon
kek, good times
>>
File: 1776194117526483.png (752 KB, 832x1280)
752 KB PNG
>>
>>108682741
probably the seething about local diffusion 12+ hours a day on cooldown.
>>
>>108682741
>why do you think we live rent free
they hate our freedom
>>
>>108682589
The plebbitor is right. That and there's actual paid API shills here. They can't stand local progress, they are absolutely seething about it. When local does make progress, they must flood with FUD and spam to make it seem like their insignificant models are simply the best. Notice the flood of API shills here.
>>
>runs out of tokens
>time to talk shit in the local thread!
>>
imagine if localkeks spent as much time training as they did seething. they might finally reach dall-e3's level!
>>
ltx2.3 chads where we at?
>>
>>108682776
>no u
>>
>>108682691
Thanks a lot anon.
I don't have a Higgsfield account so I will see if any of these work on FAL.
>>
>>108682770
>They can't stand local progress
what progress bro? nothing happened in 2026 so far
>>
File: file.png (2.33 MB, 1248x1824)
2.33 MB PNG
>>
>>108682798
>subliminal block cock psyop in the background
you're not even trying
>>
>>108682789
did you miss the huge drop earlier today?
>>
>>108682810
oy vey shut it down
>>
>>108682789
fr fr no cap on god bro. this thread cringe n shit, "lowcal model", only for neerds
>>
>>108682699
Why? What specifically is only there and not duplicated on other sites?
>>
>>108682789
ace step (somewhat works but you must adjust stuff) and textgeneration node (github issues has posts it is not that good) and attempt ot implement multigpu but seems they are failing at it
>>
why are people so obsessed with disrupting this general? If it's not one thing, it's another. It never seems to end. As soon as one schizo gets tired, another gets started
>>
File: 1751686334068379.png (2.11 MB, 1086x1448)
2.11 MB PNG
>>108682798
butt
>>108682789
>what progress bro?
they released an api node for gpt image 2, so now you can gen api images locally
https://blog.comfy.org/p/gpt-image-2-is-now-here-via-partner
>>
File: 1752805237873779.png (1.62 MB, 1024x1024)
1.62 MB PNG
How many images do you usually gen per batch?
>>
>>108682874
Just hide the posts and move on. No point in wasting any energy on them. If everybody stops responding they'll get bored.
>>
>>108682554
Yeah, I just found that one amusing. It took a surprising number of additional rolls to get one without distracting issues. Actually, her left thigh is still fat/misaligned here, now that I look.
>>
>>108682874
>why are people so obsessed with disrupting this general?
I have no idea dude, never seen a general as targeted as /ldg/
>>
comfyui shills target this general for free advertising which is why it should be removed from the OP
>>
>>108682885
at most 4
>>
>>108682873
>attempt ot implement multigpu but seems they are failing at it
what? multigpu already exists (and it's broken because of the dynamic vram shit though)
https://github.com/pollockjj/ComfyUI-MultiGPU
>>
>>108682874
cause you keep giving him attention
>>
>>108682874
it happens in lots of generals but this one does seem worst than most. I'd also wager it is the same 2-3 people based on the non-evolving bait.
>>
>>108682890
what is the prompt i'll give it a go as well
>>
>>108682923
if jannies were doing their fucking job we wouldn't have to deal with trolls in the first place
>>
>>108682929
I wouldn't be shocked if the jannies were the ones doing it, desu.
>>
>>108682874
yeah julien really is mentally ill
>>
File: WaiAnima1+Turbo_00001_.png (1.78 MB, 1600x1280)
1.78 MB PNG
>>108682928
1girl, saber alter, fate \(series\), as109, flat chest, choker, bikini top, nightclub, bar, boots, sitting, leaning back, feet up, feet on table, crossed ankles, looking at viewer, serious, (coca-cola:0.5), drinking, holding bottle

I threw it at WaiAnima with Turbo lora too to see what would happen. Decent, but stylemogged by Wai-Illu.
>>
Fresh

>>108682974
>>108682974
>>108682974
>>
>>108682237
remove the man on the left in the black jacket.

https://litter.catbox.moe/6gn3085ox6ex7kxz.mp4
>>
>>108682969
(And yes, I did use @ with Anima.)
>>
>>108682284
Baaed
>>
>>108682900
in comfyui itself as native look at their githubs.
they are failing at it.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.