/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 06/12/26(Fri)21:13:02 No.109041690

File: highlights_g_109034986_17(...).jpg (612 KB, 1917x1073)

612 KB JPG

/ldg/ - Local Diffusion General Anonymous 06/12/26(Fri)21:13:02 No.109041690 Archived

Discussion and Development of Local Image, Video, and Music Models

Previous: >>109034986

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
SDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineage
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon

Anonymous
06/12/26(Fri)21:15:16 No.109041701

Anonymous 06/12/26(Fri)21:15:16 No.109041701

>inb4 the useless unfappable deepfake spam

Anonymous
06/12/26(Fri)21:15:40 No.109041707

Anonymous 06/12/26(Fri)21:15:40 No.109041707

https://rentry.org/LDG_vital_info

Anonymous
06/12/26(Fri)21:17:50 No.109041720

Anonymous 06/12/26(Fri)21:17:50 No.109041720

Blessed thread of frenship

Anonymous
06/12/26(Fri)21:19:05 No.109041732

Anonymous 06/12/26(Fri)21:19:05 No.109041732

https://www.anthropic.com/news/fable-mythos-access
cloudkeks are on the ropes

Anonymous
06/12/26(Fri)21:20:42 No.109041746

Anonymous 06/12/26(Fri)21:20:42 No.109041746

>SaaS is so powerful that the government has to step in
meanwhile local is so kekked they censor themselves

Anonymous
06/12/26(Fri)21:20:47 No.109041747

Anonymous 06/12/26(Fri)21:20:47 No.109041747

>>109041732
Anthropic genuinely might be an all time top 3 "snake oil" business.

Anonymous
06/12/26(Fri)21:24:14 No.109041785

Anonymous 06/12/26(Fri)21:24:14 No.109041785

File: Wan21_SCAIL2_00016.mp4 (1.71 MB, 912x688)

1.71 MB MP4

>>109041701
>his grapes are sour

Anonymous
06/12/26(Fri)21:24:26 No.109041789

Anonymous 06/12/26(Fri)21:24:26 No.109041789

>>109041746
you're so gay that the government has to step in and prevent you from trying to suck all the cocks on the planet.

Anonymous
06/12/26(Fri)21:25:14 No.109041794

Anonymous 06/12/26(Fri)21:25:14 No.109041794

>>109041732
>be local
>win

Local can't stop winning.

Anonymous
06/12/26(Fri)21:26:02 No.109041802

Anonymous 06/12/26(Fri)21:26:02 No.109041802

>>109041746
>SaaS is so powerful that the government has to step in
Well retard, If you actually read the statement you'd see that Anthropic do not agree with you, they think the US govt is being vindictive.

>[...] we believe the government should have the ability to block unsafe deployments, as part of a statutory process that is transparent, fair, clear, and grounded in technical facts.
>This action does not adhere to those principles.

Anonymous
06/12/26(Fri)21:29:10 No.109041831

Anonymous 06/12/26(Fri)21:29:10 No.109041831

>Local models are free, uncensored, can be trained to do anything
>Yet somehow the government feels bigger threat from 'censored' API
local models are so dumb they're effectively harmess, nobody takes them seriously. API models are so advanced in thinking and capability, they're not even in the same league.

Anonymous
06/12/26(Fri)21:36:13 No.109041888

Anonymous 06/12/26(Fri)21:36:13 No.109041888

File: 1777729074642151.png (87 KB, 686x386)

87 KB PNG

>API models are so advanced in thinking and capability,

Anonymous
06/12/26(Fri)21:39:52 No.109041913

Anonymous 06/12/26(Fri)21:39:52 No.109041913

>>109041888
He's right. Car washes don't reduce your wanted level. If you want to wash your car, you need to go to a body shop.

Anonymous
06/12/26(Fri)21:40:06 No.109041916

Anonymous 06/12/26(Fri)21:40:06 No.109041916

>>109041831
>API models are so advanced in thinking and capability, they're not even in the same league.

Are you sure about that cloudkek? You can't finetune cloudshit models on custom data. As it stands, a local model with a LoRA is more dangerous than censored ("more capable) cloudshit model with similar param count. You might say (apples to oranges) but remember glm 5.1 was already open sourced.

Anonymous
06/12/26(Fri)21:44:58 No.109041963

Anonymous 06/12/26(Fri)21:44:58 No.109041963

>>109041831
They already seethed about celeb deepfakes and AI political propaganda. There just wasn't a major incident yet.

Anonymous
06/12/26(Fri)21:52:21 No.109042033

Anonymous 06/12/26(Fri)21:52:21 No.109042033

>>109041916
>You can't finetune cloudshit models on custom data
Because you don't need to, this isn't the own you think it is. Local is so far behind you still have to train loras for outfits while API can do it in a single-shot and even search the internet.
You could try training a local model to make a bomb or whatever but they're so dumb that it wouldn't even work properly. Once GPT/Claude/Gemini gained internet access, local fell off the face of the earth. I can just ask GPT to "put her in the kansas city royals jersey" and it will work, no need to use separate edit models (outdated localcope) or custom loras.

Anonymous
06/12/26(Fri)21:55:31 No.109042056

Anonymous 06/12/26(Fri)21:55:31 No.109042056

Pretty sure their concern is more with people finding a way around the safeguards and making 1000+ agents hack into national security shit than a few degenerates generating CP

Anonymous
06/12/26(Fri)21:57:36 No.109042072

Anonymous 06/12/26(Fri)21:57:36 No.109042072

File: 1754472605835755.png (3.99 MB, 1536x1024)

3.99 MB PNG

>>109042033
>I can just ask GPT to "put her in the kansas city royals jersey"

Yeah but then it'll have that GPT image grunge, so its useless

Anonymous
06/12/26(Fri)21:59:05 No.109042085

Anonymous 06/12/26(Fri)21:59:05 No.109042085

File: q_zm8bal.png (707 KB, 960x960)

707 KB PNG

Anonymous
06/12/26(Fri)22:01:45 No.109042110

Anonymous 06/12/26(Fri)22:01:45 No.109042110

File: vifdd.png (89 KB, 859x742)

89 KB PNG

Why does this video loader only have an image input, how am I supposed to get a video upscaled with this?

Anonymous
06/12/26(Fri)22:05:03 No.109042140

Anonymous 06/12/26(Fri)22:05:03 No.109042140

>>109042056
Nah, their main concern is preventing a foreign adversary from finding and patching their backdoors. Read the reasoning behind the immediate ban- >>109041673
>To date, the government has only given us verbal evidence of a potential narrow, non-universal jailbreak, which essentially consists of asking the model to read a specific codebase and fix any software flaws. Our understanding is that one potential jailbreak was shared with the government. We have reviewed the report and validated that the level of capability displayed there is widely available from other models (including OpenAI’s GPT-5.5), and is used every day by the defenders who keep systems safe. We will share more details over the next 24 hours.

From that alone, it's obvious.

Anonymous
06/12/26(Fri)22:05:39 No.109042150

Anonymous 06/12/26(Fri)22:05:39 No.109042150

>>109042110
>image
s

Anonymous
06/12/26(Fri)22:05:41 No.109042151

Anonymous 06/12/26(Fri)22:05:41 No.109042151

File: ComfyUI_00314.jpg (3.16 MB, 2656x4096)

3.16 MB JPG

Anonymous
06/12/26(Fri)22:08:45 No.109042179

Anonymous 06/12/26(Fri)22:08:45 No.109042179

File: 1662341561354.png (1.28 MB, 1184x896)

1.28 MB PNG

>>109041707

Anonymous
06/12/26(Fri)22:08:55 No.109042180

Anonymous 06/12/26(Fri)22:08:55 No.109042180

>mfw Resource news

06/12/2026

>ComfyUI-Flux2Klein-Enhancer: Conditioning enhancement and reference latent control
https://github.com/capitan01R/ComfyUI-Flux2Klein-Enhancer

>InterleaveThinker: Reinforcing Agentic Interleaved Generation
https://zhengdian1.github.io/InterleaveThinker-proj

>Experimental Anima LLLite Regional Controlnet
https://huggingface.co/Sen-sou/Anima-LLLite-Regional-Controlnet

>World Tracing: Generative Pixel-Aligned Geometry Beyond the Visible
https://haoz19.github.io/world-tracing-page

>VietFashion: Benchmarking Sketch-Text Composed Image Retrieval for Cultural Outfits
https://hng0303.github.io/VietFashion

>Modality Forcing for Scalable Spatial Generation
https://modality-forcing.github.io

>VideoMDM: Towards 3D Human Motion Generation From 2D Supervision
https://videomdm.github.io

>EvTexture++: Event-Driven Texture Enhancement for Video Super-Resolution
https://github.com/DachunKai/EvTexture

>Budget-Constrained Step-Level Diffusion Caching
https://github.com/Westlake-AGI-Lab/BudCache

>ECA: Efficient Continual Alignment for Open-Ended Image-to-Text Generation
https://github.com/Snowball0823/ECA

>InterleaveThinker: Reinforcing Agentic Interleaved Generation
https://zhengdian1.github.io/InterleaveThinker-proj

>i1-3B: A Simple and Fully Open Recipe for Strong Text-to-Image Models
https://huggingface.co/zlab-princeton/i1-3B

06/11/2026

>i1: A Simple and Fully Open Recipe for Strong Text-to-Image Models
https://zlab-princeton.github.io/i1

>AnchorEdit: Maintaining Temporal Consistency in Multi-turn Image Editing via Causal Memory
https://github.com/xuhang07/AnchorEdit

>Reroute, Don't Remove: Recoverable Visual Token Routing for Vision-Language Models
https://github.com/elmma/mllm-reroute

>ComfyUI-BerniniStudio
https://github.com/CCpt5/ComfyUI-BerniniStudio

>Ideoprompt: plain English to Ideogram 4 structured JSON prompt
https://github.com/cocktailpeanut/ideoprompt

>Orion4D FXMax for ComfyUI
https://github.com/orion4d/Orion4D_FXMax

Anonymous
06/12/26(Fri)22:09:55 No.109042188

Anonymous 06/12/26(Fri)22:09:55 No.109042188

>mfw Research news

06/12/2026

>TetherCache: Stabilizing Autoregressive Long-Form Video Generation with Gated Recall and Trusted Alignment
https://arxiv.org/abs/2606.13035

>DuET: Dual Expert Trajectories for Diffusion Image Editing
https://arxiv.org/abs/2606.13303

>Efficient, Robust, and Anti-Collusion Fingerprinting of Image Diffusion Models
https://arxiv.org/abs/2606.12977

>ReFree: Towards Realistic Co-Speech Video Generation via Reward-Free RL and Multilevel Speech Guidance
https://arxiv.org/abs/2606.13304

>SeamEdit: A Black-Box VLM-Agnostic Pipeline for Large-Image Semantic Editing
https://arxiv.org/abs/2606.13041

>Towards More General Control of Diffusion Models Using Jeffrey Guidance
https://arxiv.org/abs/2606.13240

>AudioX-Turbo: A Unified Framework for Efficient Anything-to-Audio Generation
https://zeyuet.github.io/AudioX-Turbo

>SmartFont: Dynamic Condition Allocation for Few-Shot Font Generation
https://arxiv.org/abs/2606.13382

>High-Fidelity Two-Step Image Generation via Teacher-Aligned End-to-End Distillation
https://arxiv.org/abs/2606.12575

>Bridging Modal Isolation in Interleaved Thinking: Supervising Modality Transitions via Stepwise Reinforcement
https://arxiv.org/abs/2606.12886

>Selecting Samples on Graphs: A Unified Dataset Pruning Framework for Lossless Training Acceleration
https://arxiv.org/abs/2606.12913

>Edit the Bits, Diff the Codes: Bitwise Residual Editing for Visual Autoregressive Models
https://arxiv.org/abs/2606.13558

>HYDRA-X: Native Unified Multimodal Models with Holistic Visual Tokenizers
https://arxiv.org/abs/2606.13289

>Emotional regulation improves deep learning-based image classification
https://arxiv.org/abs/2606.13081

>Weekend Time
https://suno.com/s/NDgulWDocrYOA7US

Anonymous
06/12/26(Fri)22:18:21 No.109042265

Anonymous 06/12/26(Fri)22:18:21 No.109042265

>>109042180
>>109042188
None of this data is useful when you blindly post it without any vetting. Didn't you learn your lesson already?
Why are you spamming this in this thread when you have a tard cage all to yourself?

Anonymous
06/12/26(Fri)22:23:53 No.109042312

Anonymous 06/12/26(Fri)22:23:53 No.109042312

>>109041406
Haven't figured out how to nest LoRAs yet, I guess one could merge one of them into base instead, and then go from there. In the meantime, the final epoch perhaps wasn't so bad after all even with a bit of overbaking in there.

https://vocaroo.com/12tVNq7SnhO1
https://vocaroo.com/1iXXFRxfvMQy

Anonymous
06/12/26(Fri)22:23:58 No.109042316

Anonymous 06/12/26(Fri)22:23:58 No.109042316

File: debo_cm_anima_00047_.jpg (45 KB, 390x470)

45 KB JPG

>>109042265
your 'feedback' is just a vehicle for you to attack me as a poster; therefor, it will not be considered

Anonymous
06/12/26(Fri)22:26:53 No.109042334

Anonymous 06/12/26(Fri)22:26:53 No.109042334

>>109042312
>https://vocaroo.com/12tVNq7SnhO1
Leslie Parrish's voice really shining thru there kek. Lowering LoRA weight should always mitigate diversity issues.

Anonymous
06/12/26(Fri)22:27:03 No.109042337

Anonymous 06/12/26(Fri)22:27:03 No.109042337

File: bimbos.mp4 (2.71 MB, 704x1280)

2.71 MB MP4

Anonymous
06/12/26(Fri)22:33:01 No.109042378

Anonymous 06/12/26(Fri)22:33:01 No.109042378

>>109042265
Don't listen to this guy >>109042180
He's just mad cause he can't read.
Thank you for blessing us with daily research.

Anonymous
06/12/26(Fri)22:36:38 No.109042411

Anonymous 06/12/26(Fri)22:36:38 No.109042411

>>109042312
Something interesting is that I didn't tag it, just used generic descriptions, but for some reason Claude is able to control exactly which singer and style from a 20+ song dataset gets triggered, even without using the same exact captions.

Here's Dave Rogers style song
https://vocaroo.com/1lXhL94xMsvi

Manuel
https://vocaroo.com/14wvmcvt94lB

Anonymous
06/12/26(Fri)22:39:25 No.109042442

Anonymous 06/12/26(Fri)22:39:25 No.109042442

>>109042411
And there's a total of 2 songs from both of them. It's a diverse dataset.

Anonymous
06/12/26(Fri)22:42:16 No.109042471

Anonymous 06/12/26(Fri)22:42:16 No.109042471

File: debo_s_fia_00064_.png (1.79 MB, 1792x977)

1.79 MB PNG

>>109042378
:)

Anonymous
06/12/26(Fri)22:47:31 No.109042516

Anonymous 06/12/26(Fri)22:47:31 No.109042516

Is anyone here using Ideogram 4? What do you think?

Anonymous
06/12/26(Fri)22:55:56 No.109042569

Anonymous 06/12/26(Fri)22:55:56 No.109042569

>>109042516
i saw people experimenting and it seems to be the best t2i model if you know what you want. i am still waiting for reference image support

Anonymous
06/12/26(Fri)22:57:39 No.109042575

Anonymous 06/12/26(Fri)22:57:39 No.109042575

File: Ideogram_0017.jpg (738 KB, 1408x1872)

738 KB JPG

Where is the workflow the scail2?

It doesn't appear as a default workflow so I assume Kijai posted somewhere in one of his many many repos and just assumed I'd know where.

>>109042516
I like it. Like as far as the control over the image composition it gives you, it rivals saas. I've been meaning to train some hentai comic LoRAs to see how it handles art styles because it's already fairly competent at making comics

Anonymous
06/12/26(Fri)22:59:29 No.109042585

Anonymous 06/12/26(Fri)22:59:29 No.109042585

>>109042033
>Local is so far behind you still have to train loras for outfits while API can do it in a single-shot and even search the internet.

Local LLMs are not as far behind as you think. Image models etc... are one thing that only hobbyist companies do, but the big open source companies have focused on LLMs, and as such there's already a local LLM on par with the best cloud LLMs (of course, just not as good as the latest one, but if we can take a local LLM and say it's as good as Gemini 3 Pro, that's close enough...)

Anonymous
06/12/26(Fri)23:03:24 No.109042598

Anonymous 06/12/26(Fri)23:03:24 No.109042598

>>109042585
Now, you may not believe in benchmarks, but this is the case based on private tests, benchmarks, lmsys, etc... GLM 5.1 is better than Gemini 3.1 Pro and Sonnet 4.6

Anonymous
06/12/26(Fri)23:03:30 No.109042599

Anonymous 06/12/26(Fri)23:03:30 No.109042599

>>109042516
yes, it's the most powerful prompting

Anonymous
06/12/26(Fri)23:05:23 No.109042606

Anonymous 06/12/26(Fri)23:05:23 No.109042606

>>109042585
The main thing is that dataset determines the diffusion destiny.

And that can often mean that big models are in fact more restricted in what they can include.

Here's a question to ask, has nanobanana ever trained on real actual medium format scans?

Anonymous
06/12/26(Fri)23:05:51 No.109042610

Anonymous 06/12/26(Fri)23:05:51 No.109042610

>>109042575
https://github.com/Brobert-in-aus/scail-auto-extend
Use the wf with this node

Anonymous
06/12/26(Fri)23:06:23 No.109042612

Anonymous 06/12/26(Fri)23:06:23 No.109042612

>109042599
use ideogram to generate a cartoon explaining how to make curry like your mom makes it.

Anonymous
06/12/26(Fri)23:14:43 No.109042670

Anonymous 06/12/26(Fri)23:14:43 No.109042670

>>109042610
ty

Anonymous
06/12/26(Fri)23:32:52 No.109042794

Anonymous 06/12/26(Fri)23:32:52 No.109042794

any differences on generating hentai from december to now? I think I'm on stable difussion through forge webui

Anonymous
06/12/26(Fri)23:35:22 No.109042806

Anonymous 06/12/26(Fri)23:35:22 No.109042806

>>109042794
Nah, industry is pretty stagnant

Anonymous
06/12/26(Fri)23:37:26 No.109042814

Anonymous 06/12/26(Fri)23:37:26 No.109042814

File: Wan21_SCAIL2_00030.png (682 KB, 1280x528)

682 KB PNG

>>109042794
Don't listen to APIcucks. They're trolling. Wan and LTX is better than ever. Check out last thread for multi character swap potential of SCAIL-2

Anonymous
06/12/26(Fri)23:38:00 No.109042816

Anonymous 06/12/26(Fri)23:38:00 No.109042816

>>109042806
*local is pretty stagnant
industry got tons of sota image models and claude fable is so insanely good the government had to ban it

Anonymous
06/12/26(Fri)23:40:08 No.109042830

Anonymous 06/12/26(Fri)23:40:08 No.109042830

File: Wan21_SCAIL2_00030.mp4 (404 KB, 1280x528)

404 KB MP4

Figured out how Wan SCAIL-2 multicharacter supposed to work. Just combine 2 characters in the segmentation input and prompt for 2 subjects. It will swap them out. Work best with same aspect ratio input references.

Anonymous
06/12/26(Fri)23:42:43 No.109042847

Anonymous 06/12/26(Fri)23:42:43 No.109042847

>>109042830
that has lots of potential

Anonymous
06/12/26(Fri)23:44:02 No.109042855

Anonymous 06/12/26(Fri)23:44:02 No.109042855

>>109042816
>claude fable
https://www.youtube.com/watch?v=wVJ7LYrl83E

Anonymous
06/12/26(Fri)23:44:07 No.109042856

Anonymous 06/12/26(Fri)23:44:07 No.109042856

File: 1773474314012368.png (332 KB, 976x496)

332 KB PNG

What if I dont want to just copy animations from existing anime shit?

I don't wanna be Alan Bowe, I wanna be Miyazaki.

Anonymous
06/12/26(Fri)23:44:15 No.109042857

Anonymous 06/12/26(Fri)23:44:15 No.109042857

>>109042830
I’ve found it to be hit and miss with prompting multiple subjects, as far as I can tell it goes off the first frame of the video, so if there’s only one character clearly visible to begin with it won’t segment the image into multiple characters, instead both of them will be blue.

Anonymous
06/12/26(Fri)23:44:57 No.109042864

Anonymous 06/12/26(Fri)23:44:57 No.109042864

>>109042816
>claude fable
Kind of annoyed about this. I had some projects I wanted it to look over and clean up then they banned it.

Anonymous
06/12/26(Fri)23:46:45 No.109042870

Anonymous 06/12/26(Fri)23:46:45 No.109042870

>>109042864
Dont worry about it.

It was opus 4.6 but with a bunch of grifter ass underlying bullshit like how it aggressively nerfed outputs for no reason.

So, technically, we still have Fable 5. Just go use Opus 4.6

Anonymous
06/12/26(Fri)23:51:00 No.109042903

Anonymous 06/12/26(Fri)23:51:00 No.109042903

>>109042830
my pc is too lowend to take on WAN. what's scail-2 do?

Anonymous
06/12/26(Fri)23:54:46 No.109042929

Anonymous 06/12/26(Fri)23:54:46 No.109042929

File: 1756312018172647.png (3.9 MB, 1037x1850)

3.9 MB PNG

>>109042516
I thought it was total shit at first but since it can make comic pages like this, >>109042575
its actually got *some* potential, however, just making the comic panels one by one and using a program like comic life 3 is still better.

Anonymous
06/12/26(Fri)23:54:51 No.109042931

Anonymous 06/12/26(Fri)23:54:51 No.109042931

>>109042903

Better and easier to use reference replacements and tracking. Just look at previous threads for more examples.

Anonymous
06/12/26(Fri)23:57:00 No.109042947

Anonymous 06/12/26(Fri)23:57:00 No.109042947

File: Ideogram_0008.jpg (564 KB, 1936x1088)

564 KB JPG

Anonymous
06/12/26(Fri)23:57:47 No.109042953

Anonymous 06/12/26(Fri)23:57:47 No.109042953

if ideogram doesn't get some kind of finetune it will be the biggest waste of potential yet. 'muh license' is such cope. what are the chinks even doing, are they too ashamed to train on a western model? what ever happened to noobai?

Anonymous
06/12/26(Fri)23:57:59 No.109042956

Anonymous 06/12/26(Fri)23:57:59 No.109042956

File: scail2.mp4 (1.77 MB, 1408x1280)

1.77 MB MP4

>>109042903
better video referenced videos is the main current usage I think

Anonymous
06/12/26(Fri)23:59:43 No.109042961

Anonymous 06/12/26(Fri)23:59:43 No.109042961

>>109042956
me when i blink out of existence

Anonymous
06/13/26(Sat)00:00:10 No.109042964

Anonymous 06/13/26(Sat)00:00:10 No.109042964

>>109042953
>'muh license' is such cope.
I actually hate these people who pop up to remind you about the license whenever you discuss the model. It's genuinely the most useless "Uhm achually" sentiment you can make.

Anonymous
06/13/26(Sat)00:01:35 No.109042975

Anonymous 06/13/26(Sat)00:01:35 No.109042975

>>109042953
>what ever happened to noobai?
noob2 is saas only and controlled by comfyorg or some shit

Anonymous
06/13/26(Sat)00:04:02 No.109042994

Anonymous 06/13/26(Sat)00:04:02 No.109042994

>>109042953
>'muh license' is such cope
it's the most relevant feature that determines if the most relevant parties involved can even actually use/redistribute the model

Anonymous
06/13/26(Sat)00:04:06 No.109042996

Anonymous 06/13/26(Sat)00:04:06 No.109042996

I have been using ideogram for about a day, and not once have I triggered that gray blocked content blob thing.

Then again, I haven't attempt to really generate any nudity since I'm not a coom brain

Anonymous
06/13/26(Sat)00:04:58 No.109043000

Anonymous 06/13/26(Sat)00:04:58 No.109043000

>>109042994
who gives a shit

Anonymous
06/13/26(Sat)00:09:51 No.109043025

Anonymous 06/13/26(Sat)00:09:51 No.109043025

>>109042956
Cute.

Anonymous
06/13/26(Sat)00:10:16 No.109043026

Anonymous 06/13/26(Sat)00:10:16 No.109043026

File: file.png (403 KB, 545x564)

403 KB PNG

how do i get klein 9b to make a nice vag? they like to add a bulge or just do some really weird shit and make them really long and weirdly shaped when i edit, pic related.

Anonymous
06/13/26(Sat)00:10:21 No.109043028

Anonymous 06/13/26(Sat)00:10:21 No.109043028

>>109042996
and?

Anonymous
06/13/26(Sat)00:12:04 No.109043041

Anonymous 06/13/26(Sat)00:12:04 No.109043041

>>109043000
do you just lack basic foresight or understanding?

people don't want to be rugpulled or sued after putting in effort as happened *countless* times in many areas surrounding copyright.

Anonymous
06/13/26(Sat)00:13:28 No.109043053

Anonymous 06/13/26(Sat)00:13:28 No.109043053

>>109043026
>how do i get klein 9b to make a nice vag?
You can't without LoRAs. It does not know what a vagina looks like.

Anonymous
06/13/26(Sat)00:15:59 No.109043067

Anonymous 06/13/26(Sat)00:15:59 No.109043067

File: cute.png (1.37 MB, 832x1248)

1.37 MB PNG

>>109043025
the video ended up cute and the original gen by another anon breads ago also was already very cute

Anonymous
06/13/26(Sat)00:17:20 No.109043079

Anonymous 06/13/26(Sat)00:17:20 No.109043079

File: 1756907251844432.png (186 KB, 330x399)

186 KB PNG

>>109043028
And I leveled up on fortnite this weekend

Anonymous
06/13/26(Sat)00:17:42 No.109043085

Anonymous 06/13/26(Sat)00:17:42 No.109043085

>>109043041
yawn

Anonymous
06/13/26(Sat)00:19:27 No.109043092

Anonymous 06/13/26(Sat)00:19:27 No.109043092

>>109043053
which loras. there is like a gorillon of them. i probably have them all and tried them all but i'm not sure what works. I was hoping for a definitive one.

Anonymous
06/13/26(Sat)00:20:09 No.109043096

Anonymous 06/13/26(Sat)00:20:09 No.109043096

>>109042516
Trash model that does more of the same.
Looks like the astroturfing worked.

Anonymous
06/13/26(Sat)00:21:00 No.109043101

Anonymous 06/13/26(Sat)00:21:00 No.109043101

>>109042316
The OP shows you linking anons to malware, why do you post this constantly with that being a known issue, did you apologize?

Anonymous
06/13/26(Sat)00:26:04 No.109043130

Anonymous 06/13/26(Sat)00:26:04 No.109043130

>>109042953
>are they too ashamed to train on a western model
Lol. Ernie is superior and the Chinks know it.

Anonymous
06/13/26(Sat)00:31:07 No.109043155

Anonymous 06/13/26(Sat)00:31:07 No.109043155

File: image_00034_.png (3 MB, 2048x2048)

3 MB PNG

>>109039192
To the cloud faggot, yes you can gen that shit in local this is Z image

Anonymous
06/13/26(Sat)00:33:08 No.109043167

Anonymous 06/13/26(Sat)00:33:08 No.109043167

>>109042996
I triggered it with a cat pictures. I guess it's a skill.

Anonymous
06/13/26(Sat)00:34:03 No.109043176

Anonymous 06/13/26(Sat)00:34:03 No.109043176

>ernie
forgot that shit existed. what's with china and pumping out garbage like hidream, ernie, and glm?

Anonymous
06/13/26(Sat)00:46:04 No.109043244

Anonymous 06/13/26(Sat)00:46:04 No.109043244

>>109043176
GLM is actually the best cloud AI on the market right now if you're not a shithead code grifter.

Anonymous
06/13/26(Sat)00:47:23 No.109043251

Anonymous 06/13/26(Sat)00:47:23 No.109043251

>>109043176
commercial/research entities regularly pump out models with no good questionable/nsfw tuning even outside china

yes few of them seem to succeed, but I guess they prefer to keep imaginary or perhaps real trouble away? idk about their circumstances in detail.

Anonymous
06/13/26(Sat)00:48:02 No.109043255

Anonymous 06/13/26(Sat)00:48:02 No.109043255

File: image_00036_.png (3.15 MB, 1667x1667)

3.15 MB PNG

>>109043155
An he uncucked version

Anonymous
06/13/26(Sat)00:49:54 No.109043265

Anonymous 06/13/26(Sat)00:49:54 No.109043265

>>109043244
the fact that you immediately thought of GLM’s llm says it all. the image model is downright terrible

Anonymous
06/13/26(Sat)00:51:59 No.109043277

Anonymous 06/13/26(Sat)00:51:59 No.109043277

it's friday? i thought it was tuesday

Anonymous
06/13/26(Sat)00:53:21 No.109043281

Anonymous 06/13/26(Sat)00:53:21 No.109043281

File: 367845.gif (3.82 MB, 320x222)

3.82 MB GIF

>set up a long queue and went to go work
>came back to find it was all using the same seed

Anonymous
06/13/26(Sat)00:55:04 No.109043288

Anonymous 06/13/26(Sat)00:55:04 No.109043288

File: image_00037_.png (2.22 MB, 1228x1228)

2.22 MB PNG

>>109043155
>>109043255
OK I think this is the best, uncucked, young but not too young, and with really 80s atmosphere

Anonymous
06/13/26(Sat)00:55:40 No.109043293

Anonymous 06/13/26(Sat)00:55:40 No.109043293

>>109043265
lmao, glm has an image model?

Forgive my ignorance

Anonymous
06/13/26(Sat)00:55:48 No.109043295

Anonymous 06/13/26(Sat)00:55:48 No.109043295

I was wondering why Bernini was so good.
They finetuned Wan on 20 million video pairs.
The model being from ByteDance too helps obv.

Anonymous
06/13/26(Sat)00:58:24 No.109043303

Anonymous 06/13/26(Sat)00:58:24 No.109043303

File: 2.jpg (98 KB, 1024x1024)

98 KB JPG

>>109041170
>First, you haven't run a full dynamic range of settings for Tan2.
no idea, i'm not smart but stubbornly refuse to take anon's word at face value sometimes.
>Also, you should always graph your steps.
wat? that grid is one of 4, flux 2 klein, 6 steps to rule out some of the ones looking unfinished because it just needed more than 4 steps that are usually fine.
the specific thing that drove me to doing this again was the other day seeing armpit skin gens anon posted and thinking that'd be something interesting to test across different models but got hung up on flux 2 klein because it kept giving me stubble but not like armpit stubble i see IRL it's like only in the skin wrinkles. pic related. so i tried prompting it away using shaved, hairless, etc, didn't work, wondered if a different sampler would be better, haven't had time/motivation to push thru going thru all the results yet so will probably stick with euler + beta or flux2 for klein gens.

>>109041320
>what's "Automatic"?
one of the scheduler settings in forge neo but i've never looked into it. in x/y tests i did with samplers/schedulers for zit (that lead me to preferring DPM++ 2s a RF with bong tangent) the automatic scheduler column didn't match any others exactly, assume it will be the same with klein and others but don't know for sure.

Anonymous
06/13/26(Sat)01:00:50 No.109043323

Anonymous 06/13/26(Sat)01:00:50 No.109043323

>>109043281
countless 1girls, gone like tears in rain
take measures to prevent this

Anonymous
06/13/26(Sat)01:01:11 No.109043324

Anonymous 06/13/26(Sat)01:01:11 No.109043324

>>109042516
pushing the local scene forward without being some stupid size like a 400b model. hopefully we get a model from some lab that pushes the convenient side of things as well, im lazy as hell with bboxing and just wanna prompt

Anonymous
06/13/26(Sat)01:14:42 No.109043408

Anonymous 06/13/26(Sat)01:14:42 No.109043408

>>109043323
i'm too lazy to patch it, i still haven't jumped onto the vibecode bandwagon so i would have to go search for the code that loads the seed from metadata

Anonymous
06/13/26(Sat)01:14:50 No.109043409

Anonymous 06/13/26(Sat)01:14:50 No.109043409

>>109042931
so you gen with the reference visible, then crop it?

Anonymous
06/13/26(Sat)01:23:40 No.109043458

Anonymous 06/13/26(Sat)01:23:40 No.109043458

>>109043409
it's like the image editing models or controlnets where you supply references, surely you used some of those by now?

Anonymous
06/13/26(Sat)01:24:19 No.109043462

Anonymous 06/13/26(Sat)01:24:19 No.109043462

File: Wan21_SCAIL2_00018.mp4 (3.64 MB, 2016x672)

3.64 MB MP4

>>109043409
Reference Image + Video = kino, what not to get?

Anonymous
06/13/26(Sat)01:29:56 No.109043490

Anonymous 06/13/26(Sat)01:29:56 No.109043490

>>109042312
Might have to retrain my ZUTOMAYO LoRA on the entirety of this album so I can have a cool nice live version of it, this time using the higher rank settings so it can pick up all the nuisances in her voice etc...
https://music.apple.com/us/album/midnight-forever-expo-meik%C5%8D-wa-gunaruga-gotoshi-live/1840129493

Anonymous
06/13/26(Sat)01:35:03 No.109043511

Anonymous 06/13/26(Sat)01:35:03 No.109043511

>>109043462
>kino
neked lady is not kino

Anonymous
06/13/26(Sat)01:47:08 No.109043562

Anonymous 06/13/26(Sat)01:47:08 No.109043562

>>109042180
>>109042188
thanks!
seems like you've missed https://nvlabs.github.io/motionbricks/

Anonymous
06/13/26(Sat)01:47:28 No.109043565

Anonymous 06/13/26(Sat)01:47:28 No.109043565

>>109041690
Tele Bgftg33

Turn my Asian gf's pics into a lora, send me a sample of other loras you've made

Anonymous
06/13/26(Sat)01:50:31 No.109043577

Anonymous 06/13/26(Sat)01:50:31 No.109043577

>>109043155
>>109043255
>>109043288
look awful

Anonymous
06/13/26(Sat)01:51:40 No.109043581

Anonymous 06/13/26(Sat)01:51:40 No.109043581

>>109043462
why is she so small

Anonymous
06/13/26(Sat)01:53:13 No.109043590

Anonymous 06/13/26(Sat)01:53:13 No.109043590

>>109043462
also have you tried to mask part of reference video to see how the model will fill it

Anonymous
06/13/26(Sat)01:53:43 No.109043591

Anonymous 06/13/26(Sat)01:53:43 No.109043591

>>109043462
I want Marika to rape me.

Anonymous
06/13/26(Sat)01:59:10 No.109043613

Anonymous 06/13/26(Sat)01:59:10 No.109043613

File: debo_s_fia_00077_.png (2.17 MB, 1792x977)

2.17 MB PNG

>>109043562
thanks, will add this

Anonymous
06/13/26(Sat)01:59:54 No.109043616

Anonymous 06/13/26(Sat)01:59:54 No.109043616

>>109043565
>>>/r/

Anonymous
06/13/26(Sat)02:01:07 No.109043622

Anonymous 06/13/26(Sat)02:01:07 No.109043622

>>109043616
/r/ is dead bro

Anonymous
06/13/26(Sat)02:03:08 No.109043632

Anonymous 06/13/26(Sat)02:03:08 No.109043632

>>109043562
Isn't this just motion matching with a bigger library of motions?

Anonymous
06/13/26(Sat)02:07:53 No.109043654

Anonymous 06/13/26(Sat)02:07:53 No.109043654

>>109043565
You misunderstand, anon. You must convince me that she's good enough to warrant my GPU time.

Anonymous
06/13/26(Sat)02:08:58 No.109043663

Anonymous 06/13/26(Sat)02:08:58 No.109043663

>>109043562
So if someone trains sex animations on this you can make a game world where you can fuck all of the NPCs?

Anonymous
06/13/26(Sat)02:09:26 No.109043666

Anonymous 06/13/26(Sat)02:09:26 No.109043666

File: 24646.webm (3.99 MB, 420x291)

3.99 MB WEBM

Anonymous
06/13/26(Sat)02:09:50 No.109043667

Anonymous 06/13/26(Sat)02:09:50 No.109043667

>>109043622
That is so stupid kek

Anonymous
06/13/26(Sat)02:12:27 No.109043676

Anonymous 06/13/26(Sat)02:12:27 No.109043676

>>109043667
Oh, so this is why it's gone
https://archive.is/2026.05.21-102104/https://www.wired.com/story/4chans-misogynist-wizards-are-nudifying-women-by-request/

But that is still retarded.

Anonymous
06/13/26(Sat)02:14:35 No.109043682

Anonymous 06/13/26(Sat)02:14:35 No.109043682

>>109043676
tldr. why did gook moot cave in now? feminists have been complaining for over a decade now

Anonymous
06/13/26(Sat)02:15:26 No.109043686

Anonymous 06/13/26(Sat)02:15:26 No.109043686

>>109043682
US Law - Take It Down act

Anonymous
06/13/26(Sat)02:17:36 No.109043694

Anonymous 06/13/26(Sat)02:17:36 No.109043694

>>109043686
oh ok. did they ban deep fakes on the whole site then?

Anonymous
06/13/26(Sat)02:18:13 No.109043702

Anonymous 06/13/26(Sat)02:18:13 No.109043702

File: 1779640686328456.png (1.26 MB, 2528x1173)

1.26 MB PNG

>>109043682
Picrel takes a certain level of degeneracy. Like Plebbitor/b normalfag invasion levels of degeneracy. Worse than other boards, of perhaps trolls spamming. Either way, I guess he just didn't feel like moderating it, because that's a good way to get bad PR.

Anonymous
06/13/26(Sat)02:19:20 No.109043712

Anonymous 06/13/26(Sat)02:19:20 No.109043712

>>109043694
it's only banned if it gets reported
>>109043702
>BBC
Why are white people like this?

Anonymous
06/13/26(Sat)02:22:15 No.109043725

Anonymous 06/13/26(Sat)02:22:15 No.109043725

File: 5644878.gif (3.62 MB, 320x245)

3.62 MB GIF

>>109043712
kikes are not white

Anonymous
06/13/26(Sat)02:26:38 No.109043750

Anonymous 06/13/26(Sat)02:26:38 No.109043750

File: Wan21_SCAIL2_00104.mp4 (1.04 MB, 1216x1024)

1.04 MB MP4

It doesn't wanna match the lighting, but otherwise worked well.
Seems like the best thing you can do (aside from replacing the background of your input image with white) is to rescale the input image to match the aspect ratio of the video by padding with white.

Anonymous
06/13/26(Sat)02:31:09 No.109043767

Anonymous 06/13/26(Sat)02:31:09 No.109043767

Can LoRA being butchered cause bad hands or is it all up to the checkpoint?

Anonymous
06/13/26(Sat)02:31:45 No.109043769

Anonymous 06/13/26(Sat)02:31:45 No.109043769

>>109043750
i use klein to refit an image into a new aspect ratio without stretching it

Anonymous
06/13/26(Sat)02:38:53 No.109043799

Anonymous 06/13/26(Sat)02:38:53 No.109043799

>>109043769
>i use klein to refit an image into a new aspect ratio without stretching it

nta but this is sounding like a lot of work for 5 seconds of footage that might a single (You) and excluded from the collage for being video.

Anonymous
06/13/26(Sat)02:41:55 No.109043806

Anonymous 06/13/26(Sat)02:41:55 No.109043806

>>109041297
>>109041690
This is uncomfortable to look at but not in a bad way.

Anonymous
06/13/26(Sat)02:42:06 No.109043808

Anonymous 06/13/26(Sat)02:42:06 No.109043808

>>109043750
Or gen new background mode to get the matching lighting and shadows.

Anonymous
06/13/26(Sat)02:43:13 No.109043813

Anonymous 06/13/26(Sat)02:43:13 No.109043813

how do I remove all the info from an image before uploading it to civitai?
Don't wanna people judging me

Anonymous
06/13/26(Sat)02:44:49 No.109043816

Anonymous 06/13/26(Sat)02:44:49 No.109043816

File: 475375.gif (3.24 MB, 320x222)

3.24 MB GIF

>>109043799
great things happen when you stop caring about superficial things

Anonymous
06/13/26(Sat)02:49:37 No.109043829

Anonymous 06/13/26(Sat)02:49:37 No.109043829

>>109043813
You could just remove the info manually after uploading. If you're still paranoid just save the image again with an image editor

Anonymous
06/13/26(Sat)02:52:46 No.109043839

Anonymous 06/13/26(Sat)02:52:46 No.109043839

I hate how LTX looks more coherent at higher fps but animations are stiffer. Probably because it was trained on 60fps videos of mostly video games and vtuber slop.

Anonymous
06/13/26(Sat)02:54:41 No.109043845

Anonymous 06/13/26(Sat)02:54:41 No.109043845

>>109043462
how smooth is it working with porn

Anonymous
06/13/26(Sat)02:57:27 No.109043857

Anonymous 06/13/26(Sat)02:57:27 No.109043857

>>109043813
literally just screenshot your img

Anonymous
06/13/26(Sat)02:57:42 No.109043859

Anonymous 06/13/26(Sat)02:57:42 No.109043859

>>109043839
the motion is far more dynamic at lower resolutions. you may be able to upscale it afterwards or use it as a control video for your higher resolution generations

Anonymous
06/13/26(Sat)02:58:21 No.109043864

Anonymous 06/13/26(Sat)02:58:21 No.109043864

>>109043816
why do you keep genning this child

Anonymous
06/13/26(Sat)03:05:33 No.109043893

Anonymous 06/13/26(Sat)03:05:33 No.109043893

File: forge.jpg (100 KB, 929x639)

100 KB JPG

Was using Anima in Forge... How I do this in Comfyui????

Anonymous
06/13/26(Sat)03:10:07 No.109043906

Anonymous 06/13/26(Sat)03:10:07 No.109043906

File: 4376.png (276 KB, 752x415)

276 KB PNG

>>109043864

Anonymous
06/13/26(Sat)03:20:07 No.109043933

Anonymous 06/13/26(Sat)03:20:07 No.109043933

File: debo_ccg_fia_00069_.png (1.15 MB, 1792x977)

1.15 MB PNG

Anonymous
06/13/26(Sat)03:20:53 No.109043936

Anonymous 06/13/26(Sat)03:20:53 No.109043936

>>109043864
4channers still advocate for behaviors adjacent to the pedophile socialite class, and that will not change, ever.

Anonymous
06/13/26(Sat)03:22:26 No.109043941

Anonymous 06/13/26(Sat)03:22:26 No.109043941

File: 1770725747734517.gif (190 KB, 384x256)

190 KB GIF

Anonymous
06/13/26(Sat)03:31:46 No.109043965

Anonymous 06/13/26(Sat)03:31:46 No.109043965

File: 1766680069807205.gif (247 KB, 384x256)

247 KB GIF

Anonymous
06/13/26(Sat)03:33:22 No.109043974

Anonymous 06/13/26(Sat)03:33:22 No.109043974

>>109043893
Search the manager or use SDUltimateUpscale.

Anonymous
06/13/26(Sat)03:33:28 No.109043975

Anonymous 06/13/26(Sat)03:33:28 No.109043975

>post a bikini pic on civitai
>immediately moved to red
lol

Anonymous
06/13/26(Sat)03:37:12 No.109043983

Anonymous 06/13/26(Sat)03:37:12 No.109043983

File: Wan21_SCAIL2_00143.mp4 (2.56 MB, 1056x1184)

2.56 MB MP4

The best part of this is that it made the background characters asian too

Anonymous
06/13/26(Sat)03:40:17 No.109043998

Anonymous 06/13/26(Sat)03:40:17 No.109043998

>>109043975
why even post there

Anonymous
06/13/26(Sat)03:44:47 No.109044023

Anonymous 06/13/26(Sat)03:44:47 No.109044023

>>109043983
asian psycho

Anonymous
06/13/26(Sat)03:45:45 No.109044032

Anonymous 06/13/26(Sat)03:45:45 No.109044032

>>109044023
>asian psycho
that'd be honestly kino
based on bubble era japan
t. gook

Anonymous
06/13/26(Sat)03:48:42 No.109044042

Anonymous 06/13/26(Sat)03:48:42 No.109044042

>>109041888
im currently testing my latest optimal training setup lora for acestep.
it uses qwen llm to generate audio codes.
if you got to their repo and check list of genres trained into it = hilarious.

as im testing this optimized lora im getting ai slop.
i throw in almost two paragraphs of insults ijnto the prompt. and some explanations this style of music != what insectoid llm slop thiks it is.

songs generated currently ar coming out as intended, in the style of lora i trained it in.

throw insults at it.

Anonymous
06/13/26(Sat)03:51:13 No.109044052

Anonymous 06/13/26(Sat)03:51:13 No.109044052

>>109042516
you do not need control net and region conditioning.
that is good.

Anonymous
06/13/26(Sat)03:51:20 No.109044054

Anonymous 06/13/26(Sat)03:51:20 No.109044054

Been playing with scail! Any of you have issues with audio and lipsyncing? any way to fix it??

Anonymous
06/13/26(Sat)03:59:14 No.109044086

Anonymous 06/13/26(Sat)03:59:14 No.109044086

File: Wan21_SCAIL2_00136.mp4 (865 KB, 1700x618)

865 KB MP4

Anonymous
06/13/26(Sat)04:01:58 No.109044102

Anonymous 06/13/26(Sat)04:01:58 No.109044102

File: Wan21_SCAIL2_00087_.mp4 (1.48 MB, 640x640)

1.48 MB MP4

>>109044086
nice consistency

Anonymous
06/13/26(Sat)04:02:51 No.109044104

Anonymous 06/13/26(Sat)04:02:51 No.109044104

>>109044086
why doesn't the cigarette appear? did you include it in the prompt?

Anonymous
06/13/26(Sat)04:02:58 No.109044105

Anonymous 06/13/26(Sat)04:02:58 No.109044105

are dynamic prompts always chosen randomly or is there a way to do them in order so i could have quite a long list of {big boob|small boob|medium boob} and gen and it will cycle them in order rather than random

Anonymous
06/13/26(Sat)04:09:14 No.109044129

Anonymous 06/13/26(Sat)04:09:14 No.109044129

>>109044105
Wildcards are chosen randomly.
You should be able to slop your custom node to have them cycled in order.

Anonymous
06/13/26(Sat)04:17:18 No.109044153

Anonymous 06/13/26(Sat)04:17:18 No.109044153

File: Wan21_SCAIL2_00065.mp4 (3.54 MB, 1440x400)

3.54 MB MP4

Triple replacement test

Anonymous
06/13/26(Sat)04:18:56 No.109044161

Anonymous 06/13/26(Sat)04:18:56 No.109044161

File: new_counters_WanScail2Test5.png (3.44 MB, 1280x704)

3.44 MB PNG

>>109044153

Anonymous
06/13/26(Sat)04:22:15 No.109044173

Anonymous 06/13/26(Sat)04:22:15 No.109044173

>>109042947
nice

Anonymous
06/13/26(Sat)04:22:32 No.109044174

Anonymous 06/13/26(Sat)04:22:32 No.109044174

>>109044153
> replacement

Anonymous
06/13/26(Sat)04:38:46 No.109044220

Anonymous 06/13/26(Sat)04:38:46 No.109044220

File: new local (shit) toss.png (1.07 MB, 1024x1024)

1.07 MB PNG

New pixel-space 'toss:
https://huggingface.co/spaces/Photoroom/PRX-Pixel
(It's shit. Like even the some of the example images in the demo have broken anatomy.)
Tested with like three images. It sucks at text. (Maybe more steps help I dunno just ran default 28)
At CFG 1 the images look ZIT-like, but with much worse prompt adherence. Higher CFG gives better prompt adherence but it looks slopped. Doesn't seem to have much character knowledge, didn't test styles or celebrities, but I wouldn't hope for much.
One positive thing is that unlike many other local pixel space models it doesn't suffer from patch artifacts. Speaks about state of things with these slopped research preview garbage when the bare fucking minimum feels noteworthy to mention.

Anonymous
06/13/26(Sat)04:42:30 No.109044233

Anonymous 06/13/26(Sat)04:42:30 No.109044233

File: summerin.png (2.16 MB, 1024x1536)

2.16 MB PNG

It's summer and I'm just sitting at home prompting.

Anonymous
06/13/26(Sat)04:47:29 No.109044239

Anonymous 06/13/26(Sat)04:47:29 No.109044239

>>109044220
>7B
tubby girl

Anonymous
06/13/26(Sat)04:55:04 No.109044262

Anonymous 06/13/26(Sat)04:55:04 No.109044262

File: Wan21_SCAIL2NoAudio_00002.webm (3.86 MB, 802x960)

3.86 MB WEBM

Am I supposed to be daisy chaining extend nodes? Because I just set 81 on the initial node and +129 on the singular extend node and it seems to just werk

Anonymous
06/13/26(Sat)05:04:56 No.109044288

Anonymous 06/13/26(Sat)05:04:56 No.109044288

File: hftyy.png (82 KB, 804x798)

82 KB PNG

>>109044262
lost

Anonymous
06/13/26(Sat)05:12:58 No.109044324

Anonymous 06/13/26(Sat)05:12:58 No.109044324

File: Wan21_SCAIL2_00168.mp4 (1.39 MB, 640x768)

1.39 MB MP4

Anonymous
06/13/26(Sat)05:13:27 No.109044327

Anonymous 06/13/26(Sat)05:13:27 No.109044327

File: clip_Single_00011.mp4 (3.26 MB, 1056x592)

3.26 MB MP4

>>109044153
only three? Those are rookie numbers
>>109044262
https://github.com/Brobert-in-aus/scail-auto-extend
use this node + wf

Anonymous
06/13/26(Sat)05:14:28 No.109044332

Anonymous 06/13/26(Sat)05:14:28 No.109044332

File: file.png (124 KB, 277x216)

124 KB PNG

>>109044327

Anonymous
06/13/26(Sat)05:18:09 No.109044345

Anonymous 06/13/26(Sat)05:18:09 No.109044345

>tfw 2 years ago i was waiting for 5mins for an 480x720 gen to be upscaled now im waiting 5mins for a 20second coherent full on video of my waifu getting plowed by a green orc cock

life really moves fast

Anonymous
06/13/26(Sat)05:19:27 No.109044349

Anonymous 06/13/26(Sat)05:19:27 No.109044349

File: igram7.jpg (252 KB, 1184x848)

252 KB JPG

>>109044233
well yes, the more convenient campaign seasons for murder, rape and plunder are spring/fall. siege in summer or winter means "bold" mis-planning or something you plebeian should not complain about if you want to live.

being at home in summer AND not getting sieged yourself is good, fren

Anonymous
06/13/26(Sat)05:23:19 No.109044359

Anonymous 06/13/26(Sat)05:23:19 No.109044359

File: comfyui_00035_.png (1.1 MB, 896x1152)

1.1 MB PNG

lol
esoteric art style LoRA just downloaded from CivitAI

Anonymous
06/13/26(Sat)05:27:01 No.109044372

Anonymous 06/13/26(Sat)05:27:01 No.109044372

File: Ideogram_4.0_00040_.png (1.67 MB, 1024x1024)

1.67 MB PNG

>>109042516
There is a lot of detail in the model. Also kind of model puts in a frenulum piercing unprompted? It seems to have lots of unwanted sampling variety.

Anonymous
06/13/26(Sat)05:27:55 No.109044379

Anonymous 06/13/26(Sat)05:27:55 No.109044379

File: comfyui_00036_.png (1.56 MB, 896x1152)

1.56 MB PNG

Anonymous
06/13/26(Sat)05:30:45 No.109044398

Anonymous 06/13/26(Sat)05:30:45 No.109044398

File: 1779843893264204.jpg (64 KB, 964x912)

64 KB JPG

Can I link civitai.red LoRAs I trained on my resume?

Anonymous
06/13/26(Sat)05:35:05 No.109044418

Anonymous 06/13/26(Sat)05:35:05 No.109044418

>>109044398
There are adult board links in OP. Maybe you can say you put some stuff in those.

Anonymous
06/13/26(Sat)05:35:21 No.109044419

Anonymous 06/13/26(Sat)05:35:21 No.109044419

>>109044372
ideogram hallucinates random details because the gptslop it was trained on also hallucinated random details

Anonymous
06/13/26(Sat)05:39:44 No.109044437

Anonymous 06/13/26(Sat)05:39:44 No.109044437

>>109044398
sure.

Anonymous
06/13/26(Sat)05:46:02 No.109044459

Anonymous 06/13/26(Sat)05:46:02 No.109044459

File: Wan21_SCAIL2_00022.mp4 (3.65 MB, 592x1056)

3.65 MB MP4

>>109044327
Oh cool, thank you

Anonymous
06/13/26(Sat)05:49:46 No.109044470

Anonymous 06/13/26(Sat)05:49:46 No.109044470

>>109044153
Would you care to share your workflow?

Anonymous
06/13/26(Sat)06:03:39 No.109044555

Anonymous 06/13/26(Sat)06:03:39 No.109044555

File: Wan21_SCAIL2_00176.mp4 (1.2 MB, 1088x608)

1.2 MB MP4

Anonymous
06/13/26(Sat)06:16:10 No.109044620

Anonymous 06/13/26(Sat)06:16:10 No.109044620

File: Screenshot_20260613_131146.png (395 KB, 592x790)

395 KB PNG

>>109044054
nsfw
https://files.catbox.moe/021w4k.mp4

so what's the problem with the audio? I exported the video as 24fps. Wf has 24fps set in the first node.

Anonymous
06/13/26(Sat)06:22:16 No.109044647

Anonymous 06/13/26(Sat)06:22:16 No.109044647

>>109044620
KEK

Anonymous
06/13/26(Sat)06:29:00 No.109044681

Anonymous 06/13/26(Sat)06:29:00 No.109044681

>>109044620
Lord Farquaad with tits.

Anonymous
06/13/26(Sat)06:32:00 No.109044694

Anonymous 06/13/26(Sat)06:32:00 No.109044694

File: clip_Double_00017.webm (3.93 MB, 912x1344)

3.93 MB WEBM

Anonymous
06/13/26(Sat)06:37:12 No.109044708

Anonymous 06/13/26(Sat)06:37:12 No.109044708

>>109044694
I fucking love Scrubs

Anonymous
06/13/26(Sat)07:15:57 No.109044871

Anonymous 06/13/26(Sat)07:15:57 No.109044871

File: clip_Single_00020.mp4 (333 KB, 976x640)

333 KB MP4

>>109044620

Anonymous
06/13/26(Sat)07:21:02 No.109044895

Anonymous 06/13/26(Sat)07:21:02 No.109044895

>>109044694
My wife is telling me this woman is called Fukuda Aimi. Is this right?

Anonymous
06/13/26(Sat)07:26:11 No.109044917

Anonymous 06/13/26(Sat)07:26:11 No.109044917

File: clip_Single_00022.webm (3.95 MB, 784x622)

3.95 MB WEBM

>>109044895

Anonymous
06/13/26(Sat)07:26:22 No.109044919

Anonymous 06/13/26(Sat)07:26:22 No.109044919

File: Wan21_SCAIL2_00038-2.webm (3.86 MB, 1920x720)

3.86 MB WEBM

Anonymous
06/13/26(Sat)07:26:38 No.109044922

Anonymous 06/13/26(Sat)07:26:38 No.109044922

File: Ideogram_0014.jpg (344 KB, 2224x960)

344 KB JPG

Anonymous
06/13/26(Sat)07:27:36 No.109044931

Anonymous 06/13/26(Sat)07:27:36 No.109044931

can scail do ahegao? wan animate struggled with eye movement and tongue, asking for a friend

Anonymous
06/13/26(Sat)07:30:03 No.109044945

Anonymous 06/13/26(Sat)07:30:03 No.109044945

File: Wanimate_00006-noaudio.mp4 (1.69 MB, 1162x544)

1.69 MB MP4

>>109044919
Old wanimate comparison. I think wanimate did better with the likeness, initially anyway. Degraded heavily over the course of the video

Anonymous
06/13/26(Sat)07:30:23 No.109044948

Anonymous 06/13/26(Sat)07:30:23 No.109044948

Someone post the fucking scail workflow with background removal. It's a pain in the ass to setup.

Anonymous
06/13/26(Sat)07:32:55 No.109044961

Anonymous 06/13/26(Sat)07:32:55 No.109044961

Zbase learns so fast, nsfw test: https://files.catbox.moe/l7aak0.jpg

Anonymous
06/13/26(Sat)07:35:26 No.109044971

Anonymous 06/13/26(Sat)07:35:26 No.109044971

>>109044961
what are your training settings?

Anonymous
06/13/26(Sat)07:42:37 No.109045002

Anonymous 06/13/26(Sat)07:42:37 No.109045002

>>109044945
for you

Anonymous
06/13/26(Sat)07:44:01 No.109045012

Anonymous 06/13/26(Sat)07:44:01 No.109045012

File: clip_Single_00023.mp4 (593 KB, 976x640)

593 KB MP4

>>109044948
There's probably a more elegant way to do it than this, but here you go
https://files.catbox.moe/xntg6t.png

Anonymous
06/13/26(Sat)07:44:45 No.109045016

Anonymous 06/13/26(Sat)07:44:45 No.109045016

File: Wan21_SCAIL2_00016.mp4 (3.47 MB, 1786x2048)

3.47 MB MP4

>>109044931
sometimes. it's not 100%

Anonymous
06/13/26(Sat)07:45:35 No.109045024

Anonymous 06/13/26(Sat)07:45:35 No.109045024

>>109045012
Thank you.

Anonymous
06/13/26(Sat)07:47:33 No.109045031

Anonymous 06/13/26(Sat)07:47:33 No.109045031

>>109044931
it has a pretty decent success rate but like the other anon said I too don't consider it "reliable" yet

it's not quite solid with regards to EITHER all humanoid reference images or the reference video, same as other facial expressions really

and i mean only those where i'd expect (or can test) most segmentation models yolo whatever to identify the facial features otherwise

Anonymous
06/13/26(Sat)07:48:21 No.109045037

Anonymous 06/13/26(Sat)07:48:21 No.109045037

File: Untitled.png (33 KB, 450x387)

33 KB PNG

>>109045012
Have you tried this node instead of flux klein?

Anonymous
06/13/26(Sat)07:48:30 No.109045038

Anonymous 06/13/26(Sat)07:48:30 No.109045038

File: lo1l.webm (2.87 MB, 1056x960)

2.87 MB WEBM

Anonymous
06/13/26(Sat)07:50:05 No.109045047

Anonymous 06/13/26(Sat)07:50:05 No.109045047

>>109045037
that's... probably way quicker, had no idea it existed.

Anonymous
06/13/26(Sat)07:51:54 No.109045057

Anonymous 06/13/26(Sat)07:51:54 No.109045057

>>109045047
it exists in dozens of similar sounding names for a while now, you probably want the most popular "rmbg"

Anonymous
06/13/26(Sat)07:52:24 No.109045059

Anonymous 06/13/26(Sat)07:52:24 No.109045059

>>109045038
impressive

Anonymous
06/13/26(Sat)08:03:40 No.109045116

Anonymous 06/13/26(Sat)08:03:40 No.109045116

File: 36764534.webm (3.66 MB, 420x291)

3.66 MB WEBM

Anonymous
06/13/26(Sat)08:09:24 No.109045133

Anonymous 06/13/26(Sat)08:09:24 No.109045133

File: Wan21_SCAIL2_00181.mp4 (3.6 MB, 1258x1500)

3.6 MB MP4

Anonymous
06/13/26(Sat)08:10:48 No.109045142

Anonymous 06/13/26(Sat)08:10:48 No.109045142

File: Wan21_SCAIL2_00040.webm (2.01 MB, 828x960)

2.01 MB WEBM

>>109044931
>>109045016
>>109045031
Tongue works but I never see the eyes crossed correctly. Maybe a wan2.1 ahegao lora would help?

Anonymous
06/13/26(Sat)08:17:50 No.109045187

Anonymous 06/13/26(Sat)08:17:50 No.109045187

>>109044398
Coomers seem to know more about AI technology than the average normie so employers are retarded if they're turning away applicants for NSFW loras

Anonymous
06/13/26(Sat)08:22:59 No.109045207

Anonymous 06/13/26(Sat)08:22:59 No.109045207

When did comfy go from a scratch disk raping monster when generating videos to a smooth memory managing king?

Anonymous
06/13/26(Sat)08:31:57 No.109045253

Anonymous 06/13/26(Sat)08:31:57 No.109045253

>>109042956
>HITGS on a /g/ thread

Nice.

Anonymous
06/13/26(Sat)08:47:58 No.109045349

Anonymous 06/13/26(Sat)08:47:58 No.109045349

File: Wan21_SCAIL2_00195.mp4 (3.88 MB, 1786x2048)

3.88 MB MP4

>>109045142
not perfect but looks like it has some problem with 3D model doing crossed eye

Anonymous
06/13/26(Sat)08:48:47 No.109045356

Anonymous 06/13/26(Sat)08:48:47 No.109045356

>>109045349
It's an improvement. The original woman is very punchable.

Anonymous
06/13/26(Sat)08:49:36 No.109045362

Anonymous 06/13/26(Sat)08:49:36 No.109045362

>>109045349
have you tried it with this lora?
https://civarchive.com/models/1390545?modelVersionId=1571626

Anonymous
06/13/26(Sat)08:52:02 No.109045375

Anonymous 06/13/26(Sat)08:52:02 No.109045375

File: dance_miku2.webm (2.95 MB, 960x832)

2.95 MB WEBM

Anonymous
06/13/26(Sat)08:53:45 No.109045387

Anonymous 06/13/26(Sat)08:53:45 No.109045387

Is there a reason SCAIL was built on Wan2.1 instead of 2.2?

Anonymous
06/13/26(Sat)08:54:32 No.109045389

Anonymous 06/13/26(Sat)08:54:32 No.109045389

>>109042610
nice, thanks for sharing this.

works for me except i had to disable the last comparison image concatenate, somehow out of all things that is what OOMs here - sticking bitmap images together.

Anonymous
06/13/26(Sat)08:54:43 No.109045390

Anonymous 06/13/26(Sat)08:54:43 No.109045390

>>109045362
why dont u try it

Anonymous
06/13/26(Sat)08:57:34 No.109045398

Anonymous 06/13/26(Sat)08:57:34 No.109045398

>>109045390
im currently training a lora and dont have the gpu capacity to try it right now

Anonymous
06/13/26(Sat)08:57:38 No.109045399

Anonymous 06/13/26(Sat)08:57:38 No.109045399

>>109045375

i get seven keyframes with 2005 computer
you wanna to reshoot at ntsc anon?

Anonymous
06/13/26(Sat)08:58:57 No.109045405

Anonymous 06/13/26(Sat)08:58:57 No.109045405

>>109045349
What resolution are your inputs? I’m getting pretty bad facial likeness consistency with a 9:16 ~1200px ref (don’t have it in front of me atm)

Anonymous
06/13/26(Sat)08:59:58 No.109045407

Anonymous 06/13/26(Sat)08:59:58 No.109045407

If you have to ask how much VRAM costs, you can't afford it.

Anonymous
06/13/26(Sat)09:02:48 No.109045424

Anonymous 06/13/26(Sat)09:02:48 No.109045424

>>109045405
reference image 1869x2300, rendering video at 576x1056

Anonymous
06/13/26(Sat)09:05:30 No.109045436

Anonymous 06/13/26(Sat)09:05:30 No.109045436

>>109045387
since it only need to copy the motion of subject, it doesn't need the improved motion from wan2.2 ?

Anonymous
06/13/26(Sat)09:06:37 No.109045445

Anonymous 06/13/26(Sat)09:06:37 No.109045445

>>109045387
probably started before wan2.2 was out, maybe you get the opportunity to ask them directly on social media or w/e. i think they do write english.

Anonymous
06/13/26(Sat)09:08:46 No.109045461

Anonymous 06/13/26(Sat)09:08:46 No.109045461

File: scail2.mp4 (2.12 MB, 512x896)

2.12 MB MP4

Anonymous
06/13/26(Sat)09:09:40 No.109045466

Anonymous 06/13/26(Sat)09:09:40 No.109045466

>>109045387
chatgpt said this:
>Wan 2.1 14B is one dense transformer. SCAIL-2 modifies its conditioning sequence, masking channels, and RoPE behavior. Applying those changes to one dense model is relatively straightforward.
>Wan 2.2 A14B is a two-expert MoE model: approximately 27B total parameters, with separate high-noise and low-noise 14B experts. SCAIL training would need to modify and train both experts consistently, greatly increasing storage, training memory, complexity, and cost.
>Driving video already supplies motion. Wan 2.2’s improved prompt-generated motion and aesthetics provide less benefit when SCAIL directly transfers motion from another video.
>Wan 2.1 has a mature modification ecosystem. Its dense architecture is easier to fine-tune, convert, quantize, integrate into ComfyUI, and extend with LoRAs.
>SCAIL-2 actually uses Wan2.2 Animate as one of its data-generation teachers, so the authors were aware of it. They seemingly chose to distill its useful behavior into the simpler Wan 2.1 backbone.

Anonymous
06/13/26(Sat)09:10:57 No.109045471

Anonymous 06/13/26(Sat)09:10:57 No.109045471

File: dance_miku3.webm (826 KB, 960x832)

826 KB WEBM

>the effect at 0:03

Anonymous
06/13/26(Sat)09:16:12 No.109045502

Anonymous 06/13/26(Sat)09:16:12 No.109045502

>>109045461
kino alert

Anonymous
06/13/26(Sat)09:16:23 No.109045503

Anonymous 06/13/26(Sat)09:16:23 No.109045503

>>109045424
Thanks, are you using basically the same settings from the kj PR workflow (other than frames etc)?

Anonymous
06/13/26(Sat)09:16:49 No.109045510

Anonymous 06/13/26(Sat)09:16:49 No.109045510

File: ltx2.3_flf2vNoAudo_00005_.mp4 (1.7 MB, 960x544)

1.7 MB MP4

Why is LTX so... shit?

Anonymous
06/13/26(Sat)09:18:06 No.109045523

Anonymous 06/13/26(Sat)09:18:06 No.109045523

>>109045502
Nihon-Viet Cong propaganda.

Anonymous
06/13/26(Sat)09:19:29 No.109045535

Anonymous 06/13/26(Sat)09:19:29 No.109045535

>>109045461
honestly impressive

Anonymous
06/13/26(Sat)09:19:54 No.109045537

Anonymous 06/13/26(Sat)09:19:54 No.109045537

>>109045510
It's good, it's just cloud video models have gone nuts the last few months and LTX looks garbage in comparison

Anonymous
06/13/26(Sat)09:21:16 No.109045549

Anonymous 06/13/26(Sat)09:21:16 No.109045549

>>109045510
because it's local

Anonymous
06/13/26(Sat)09:22:20 No.109045558

Anonymous 06/13/26(Sat)09:22:20 No.109045558

File: 5325677.gif (3.69 MB, 320x222)

3.69 MB GIF

>>109045537
did the copers wake up?

Anonymous
06/13/26(Sat)09:23:04 No.109045568

Anonymous 06/13/26(Sat)09:23:04 No.109045568

File: scail2_2.mp4 (2.01 MB, 592x1056)

2.01 MB MP4

>>109045535
yea scail really is good at what it does.

and wan was certainly already an impressive model anyhow. even with scail, if you have very long hair or chains with jewelry or whatever they may do some physics stuff where the reference had none of it

Anonymous
06/13/26(Sat)09:24:25 No.109045574

Anonymous 06/13/26(Sat)09:24:25 No.109045574

File: dance_girl.webm (832 KB, 960x832)

832 KB WEBM

is there a way to fix this initial discoloration?

Anonymous
06/13/26(Sat)09:25:01 No.109045580

Anonymous 06/13/26(Sat)09:25:01 No.109045580

>>109045510
it made compromises but it's a good improvement over predecessors and some capabilities are quite good.

i DID use wan more myself with the better prompt adherence and more capabilities what you can prompt (spatially, temporally)

Anonymous
06/13/26(Sat)09:26:03 No.109045585

Anonymous 06/13/26(Sat)09:26:03 No.109045585

File: scail2.mp4 (2.88 MB, 592x1056)

2.88 MB MP4

Anonymous
06/13/26(Sat)09:28:52 No.109045595

Anonymous 06/13/26(Sat)09:28:52 No.109045595

>>109045510
it's a talking head model: 1 person talking.
sulphur and eros (i2v version of sulphur), the nsfw models, can do basic nsfw but you'll re-rolling a shit tonne

I was having an argument with someone on here the other day, he insisted ltx was better than wan22. He actually convinced me for a while, I went on a multi-day tear, tweaking settings and trying out workflows, doing hundreds and hundreds of gens. My personal conclusion is that wan22 is just better but it has no sound and the clips are frustratingly short.

Anonymous
06/13/26(Sat)09:29:00 No.109045596

Anonymous 06/13/26(Sat)09:29:00 No.109045596

>>109045503
https://files.catbox.moe/wp75sw.mp4
same KJ workflow, just some minor change

Anonymous
06/13/26(Sat)09:29:52 No.109045601

Anonymous 06/13/26(Sat)09:29:52 No.109045601

>>109045349
>>109045142
>>109045133
>>109045574
>>109045596
give it to me straight
can I run this shit with 6GB of VRAM?

Anonymous
06/13/26(Sat)09:31:46 No.109045609

Anonymous 06/13/26(Sat)09:31:46 No.109045609

>>109045601
yes just limit the highest dimension to 480

Anonymous
06/13/26(Sat)09:32:03 No.109045611

Anonymous 06/13/26(Sat)09:32:03 No.109045611

>>109045568
honestly, as a filty human artist, this feels like the only place worth visiting about ai art stuff

Anonymous
06/13/26(Sat)09:36:16 No.109045636

Anonymous 06/13/26(Sat)09:36:16 No.109045636

>>109045595
kino is a skill that takes time to develop

Anonymous
06/13/26(Sat)09:38:57 No.109045653

Anonymous 06/13/26(Sat)09:38:57 No.109045653

>>109045636
kino issue

Anonymous
06/13/26(Sat)09:40:07 No.109045658

Anonymous 06/13/26(Sat)09:40:07 No.109045658

>>109045596
how do I get the workflow from this?
It only opens a loader
Do I have to update Comfy to the latest version?

Anonymous
06/13/26(Sat)09:40:35 No.109045663

Anonymous 06/13/26(Sat)09:40:35 No.109045663

Why is no one testing outo Bermini? It seems like it's better than LTX and even lets you use image references finally

Anonymous
06/13/26(Sat)09:40:56 No.109045665

Anonymous 06/13/26(Sat)09:40:56 No.109045665

>>109045595
LTX 2.3 is a seriously impressive model for it's size and speed. T2V, I2V, sound, upto 50fps, 30 sec gens.
I'll agree Wan is better aesthetically and for NSFW but I can't go back to no sound 5 second slow mo clips

Anonymous
06/13/26(Sat)09:41:04 No.109045667

Anonymous 06/13/26(Sat)09:41:04 No.109045667

>>109045636
sure, but you see, everyone who talks like this never posts their workflows. The guys I argued with the other day refused to post his workflow. Meanwhile, I don't need to post workflows, the default wan22 is enough to get you better results than ltx.

Anonymous
06/13/26(Sat)09:42:19 No.109045678

Anonymous 06/13/26(Sat)09:42:19 No.109045678

File: Wan21_SCAIL2_00203.mp4 (316 KB, 1500x648)

316 KB MP4

>>109045601
6GB VRAM

Anonymous
06/13/26(Sat)09:43:16 No.109045686

Anonymous 06/13/26(Sat)09:43:16 No.109045686

>>109045663
everyone is playing with scail making blurry as fuck vids. there was some anons playing with it a few days ago

Anonymous
06/13/26(Sat)09:43:58 No.109045691

Anonymous 06/13/26(Sat)09:43:58 No.109045691

>>109045665
that's why I'm looking for a good SVI workflow. Every one I've tried degrades the quality of wan22. I think you can use wan to get a good 20 second clip with no sound, extract frames (like lots of them), and then use them as guides for ltx so it doesn't go batshit with the horrible, deformed anatomy and strange motion.

Anonymous
06/13/26(Sat)09:45:04 No.109045697

Anonymous 06/13/26(Sat)09:45:04 No.109045697

>>109045658
drag and drop doesn't work? file open doesn't work?

Anonymous
06/13/26(Sat)09:45:45 No.109045700

Anonymous 06/13/26(Sat)09:45:45 No.109045700

>>109045697
no
I'm on an old version of Comfy

Anonymous
06/13/26(Sat)09:46:02 No.109045702

Anonymous 06/13/26(Sat)09:46:02 No.109045702

>>109045665
Can LTX do nsfw at all?

Anonymous
06/13/26(Sat)09:47:06 No.109045705

Anonymous 06/13/26(Sat)09:47:06 No.109045705

>>109045702
Out of the box? No. You'll need loras or a finetune

Anonymous
06/13/26(Sat)09:47:14 No.109045708

Anonymous 06/13/26(Sat)09:47:14 No.109045708

>>109045700
oh u need nightly comfyui to even try SCAIL2

Anonymous
06/13/26(Sat)09:47:46 No.109045712

Anonymous 06/13/26(Sat)09:47:46 No.109045712

File: 1758213507114228.jpg (142 KB, 820x627)

142 KB JPG

>>109045708
ok

Anonymous
06/13/26(Sat)09:49:11 No.109045721

Anonymous 06/13/26(Sat)09:49:11 No.109045721

File: 3215247.webm (3.69 MB, 420x291)

3.69 MB WEBM

>>109045667
didn't i give you my prompt and seed?

Anonymous
06/13/26(Sat)09:55:39 No.109045759

Anonymous 06/13/26(Sat)09:55:39 No.109045759

>>109045721
I wasn't arguing with you. I do my best to avoid avatar fags. I'd filter you outright if I had a way.
Are you happy with the way her face completely warps in the first 2 seconds of your video and in other parts as well?
is that why you only post gifs and tiny resolutions, to hide all of that?
Are you happy with your vid looking like a ponyxl gen come to life?

Anonymous
06/13/26(Sat)09:57:05 No.109045769

Anonymous 06/13/26(Sat)09:57:05 No.109045769

File: Wan21_SCAIL2_00056.webm (2.78 MB, 822x960)

2.78 MB WEBM

10 steps seems to give a nice improvement over the default 6

Anonymous
06/13/26(Sat)09:58:24 No.109045775

Anonymous 06/13/26(Sat)09:58:24 No.109045775

File: Wan21_SCAIL2_00054.webm (2.95 MB, 822x960)

2.95 MB WEBM

>>109045769
And here's 6 steps

Anonymous
06/13/26(Sat)09:59:31 No.109045783

Anonymous 06/13/26(Sat)09:59:31 No.109045783

>>109045769
>>109045775
Just noticed the missing sparkle effect on 10 steps but the face and jacket look a lot cleaner to me

Anonymous
06/13/26(Sat)10:00:46 No.109045790

Anonymous 06/13/26(Sat)10:00:46 No.109045790

>>109045759
i gave you everything you needed to make some proper kinos, but you aren't satisfied with it for some reason

Anonymous
06/13/26(Sat)10:01:50 No.109045801

Anonymous 06/13/26(Sat)10:01:50 No.109045801

File: ltx2.3_flf2vNoAudo_00008_.mp4 (1.63 MB, 960x544)

1.63 MB MP4

https://files.catbox.moe/z1ype2.mp4

Something about the way the talking heads move. So off-putting. I've found LTX way better at generic stock footage desu

Anonymous
06/13/26(Sat)10:05:31 No.109045825

Anonymous 06/13/26(Sat)10:05:31 No.109045825

why no more Anima talk? :(

Anonymous
06/13/26(Sat)10:07:31 No.109045842

Anonymous 06/13/26(Sat)10:07:31 No.109045842

Anima desu
anime website desu

Anonymous
06/13/26(Sat)10:08:28 No.109045851

Anonymous 06/13/26(Sat)10:08:28 No.109045851

>>109045825
no new one was announced and 80% of the anima discussion was shills anyways

Anonymous
06/13/26(Sat)10:08:56 No.109045854

Anonymous 06/13/26(Sat)10:08:56 No.109045854

File: 1762613605744456.png (661 KB, 896x1152)

661 KB PNG

>>109045825
>>109045842
>>109045851

Anonymous
06/13/26(Sat)10:09:43 No.109045857

Anonymous 06/13/26(Sat)10:09:43 No.109045857

>>109045825

i got you covered bro in 1934 radio corporation america had this thing called rca maybe commies got it first with phono but pretty not much worth mentioning ever since

Anonymous
06/13/26(Sat)10:10:20 No.109045859

Anonymous 06/13/26(Sat)10:10:20 No.109045859

File: LTX-2_00250-NoAudio.mp4 (1.45 MB, 1440x1056)

1.45 MB MP4

>>109045801
https://files.catbox.moe/cj3jpb.mp4
Post reminded me I completely forgot I trained this

Anonymous
06/13/26(Sat)10:11:13 No.109045866

Anonymous 06/13/26(Sat)10:11:13 No.109045866

>>109045859
lol.

Why do all AI monsters come out like that though? The white can with the green M

Anonymous
06/13/26(Sat)10:12:44 No.109045876

Anonymous 06/13/26(Sat)10:12:44 No.109045876

>>109045825
Not much to talk about really. It has it's place as very decent concept creator, but it's not good enough for creating final image. If I could decide I would cull half of the booru creators and replace them with traditional artists.

Anonymous
06/13/26(Sat)10:17:17 No.109045909

Anonymous 06/13/26(Sat)10:17:17 No.109045909

File: ltx2.3_flf2vNoAudo_00010_.mp4 (1.86 MB, 864x672)

1.86 MB MP4

https://files.catbox.moe/8csp6h.mp4

Audio for /g/ when?

Anonymous
06/13/26(Sat)10:18:27 No.109045916

Anonymous 06/13/26(Sat)10:18:27 No.109045916

>>109045859
>>109045909
wansisters in shambles

Anonymous
06/13/26(Sat)10:19:32 No.109045927

Anonymous 06/13/26(Sat)10:19:32 No.109045927

>>109045909
There was a brief general on /wsg/ when LTX was first released but people got bored.

Anonymous
06/13/26(Sat)10:21:13 No.109045940

Anonymous 06/13/26(Sat)10:21:13 No.109045940

what difference does 64gb of VRAM get me vs 128gb?

Anonymous
06/13/26(Sat)10:25:24 No.109045966

Anonymous 06/13/26(Sat)10:25:24 No.109045966

>>109045940
double the difference

Anonymous
06/13/26(Sat)10:28:45 No.109045993

Anonymous 06/13/26(Sat)10:28:45 No.109045993

File: blogfactory.jpg (805 KB, 2160x1216)

805 KB JPG

ideo4

Anonymous
06/13/26(Sat)10:38:26 No.109046050

Anonymous 06/13/26(Sat)10:38:26 No.109046050

>>109045993
Bounding boxes are the future
Natural language and tags in the trash

Anonymous
06/13/26(Sat)10:46:51 No.109046101

Anonymous 06/13/26(Sat)10:46:51 No.109046101

how to merge lora with anima?

Anonymous
06/13/26(Sat)10:47:59 No.109046108

Anonymous 06/13/26(Sat)10:47:59 No.109046108

>>109046101
Pen and paper.

Anonymous
06/13/26(Sat)10:49:20 No.109046117

Anonymous 06/13/26(Sat)10:49:20 No.109046117

>>109045909
>[Common sense feature that almost every website has] for [board in unmaintained shithole website that nobody in charge gives a fuck about] when?
Never.

Anonymous
06/13/26(Sat)10:49:59 No.109046123

Anonymous 06/13/26(Sat)10:49:59 No.109046123

>>109045993
>>109046050
can I run it with 5 GB of VRAM?

Anonymous
06/13/26(Sat)10:52:22 No.109046143

Anonymous 06/13/26(Sat)10:52:22 No.109046143

the more bboxes I use, the better the image quality becomes, even at turbo settings

i use between 10 and 25 bboxes

Anonymous
06/13/26(Sat)10:52:35 No.109046144

Anonymous 06/13/26(Sat)10:52:35 No.109046144

Can it run on GeForce 6200 AGP (128MB version)?

Anonymous
06/13/26(Sat)10:54:45 No.109046155

Anonymous 06/13/26(Sat)10:54:45 No.109046155

>>109046144
yes saar

Anonymous
06/13/26(Sat)10:56:44 No.109046177

Anonymous 06/13/26(Sat)10:56:44 No.109046177

Do you guys not use llm assisted tools for image gen?

Anonymous
06/13/26(Sat)10:57:52 No.109046188

Anonymous 06/13/26(Sat)10:57:52 No.109046188

>>109046177
Does asking my girl for depictions of herself count?

Anonymous
06/13/26(Sat)11:12:49 No.109046275

Anonymous 06/13/26(Sat)11:12:49 No.109046275

File: debo_ccg_fia_00002_.png (2.4 MB, 1792x977)

2.4 MB PNG

Anonymous
06/13/26(Sat)11:14:36 No.109046286

Anonymous 06/13/26(Sat)11:14:36 No.109046286

File: ComfyUI_19379_overlay.png (3.41 MB, 1500x2000)

3.41 MB PNG

>>109046177
Yup.

Anonymous
06/13/26(Sat)11:15:29 No.109046292

Anonymous 06/13/26(Sat)11:15:29 No.109046292

>>109046286
>that chink at the bottom
kek

Anonymous
06/13/26(Sat)11:17:10 No.109046301

Anonymous 06/13/26(Sat)11:17:10 No.109046301

>>109045927
>but people got bored
why?

Anonymous
06/13/26(Sat)11:19:22 No.109046316

Anonymous 06/13/26(Sat)11:19:22 No.109046316

>>109046301
>but people got bored
Simply weren't enough people in the thread. The only reason to go to the wsg /ldg/ was to hear audio on videos. There was no generality to the general so once the model stopped being the new thing people forgot the thread ever existed.

Anonymous
06/13/26(Sat)11:22:03 No.109046331

Anonymous 06/13/26(Sat)11:22:03 No.109046331

>>109046177
Local language models are fantastic for image captioning and subsequent iteration of similar concepts
I have a gigantic library of images accumulated over the years i can experiment with.

Anonymous
06/13/26(Sat)11:23:19 No.109046339

Anonymous 06/13/26(Sat)11:23:19 No.109046339

>>109046286
>needing llm for 1girl slop
Please kys.

Anonymous
06/13/26(Sat)11:24:15 No.109046342

Anonymous 06/13/26(Sat)11:24:15 No.109046342

>>109046286
catbox?

Anonymous
06/13/26(Sat)11:24:17 No.109046343

Anonymous 06/13/26(Sat)11:24:17 No.109046343

>year almost over
>best local goon model still illustrious
grim

Anonymous
06/13/26(Sat)11:27:35 No.109046361

Anonymous 06/13/26(Sat)11:27:35 No.109046361

File: 32467424.webm (3.87 MB, 420x291)

3.87 MB WEBM

>>109046316
maybe i should generate some more war kinos make a thread on there since those have excellent audio

Anonymous
06/13/26(Sat)11:31:51 No.109046385

Anonymous 06/13/26(Sat)11:31:51 No.109046385

File: 423634644.webm (3.92 MB, 420x291)

3.92 MB WEBM

goodbye

Anonymous
06/13/26(Sat)11:40:09 No.109046431

Anonymous 06/13/26(Sat)11:40:09 No.109046431

File: ComfyUI_01141.jpg (3.66 MB, 1500x2000)

3.66 MB JPG

>>109046292
That's by design.

>>109046339
You know where that really comes in handy? Describing women's clothes. It's an encyclopedia of clothing and I'm not.

Anonymous
06/13/26(Sat)11:41:11 No.109046435

Anonymous 06/13/26(Sat)11:41:11 No.109046435

File: scail2.mp4 (2.52 MB, 1056x592)

2.52 MB MP4

>>109045611
glad to hear it. we're probably standing on the shoulders(?) of giant anime girls here. or how ever that goes.

ai probably combines well with whatever you already do.

oh and if other places are worse it's probably mostly 'cause censored SaaS fucking sucks for art with new censorship every other week or something

Anonymous
06/13/26(Sat)11:42:10 No.109046443

Anonymous 06/13/26(Sat)11:42:10 No.109046443

>>109046431
Maybe use your own eyes? Oh wait, you can't because you are artistically frigid.

Anonymous
06/13/26(Sat)11:43:33 No.109046456

Anonymous 06/13/26(Sat)11:43:33 No.109046456

File: 1775734524989491.mp4 (334 KB, 1024x1024)

334 KB MP4

Anonymous
06/13/26(Sat)11:50:21 No.109046491

Anonymous 06/13/26(Sat)11:50:21 No.109046491

what's the current recommended model for clothes change, or eye color change, or similar minor edits in an image?
I think I used to run one of the qwens for it or something. but last year the results still weren't that great

Anonymous
06/13/26(Sat)11:52:20 No.109046505

Anonymous 06/13/26(Sat)11:52:20 No.109046505

>>109046491
flux 2 klein 9b

Anonymous
06/13/26(Sat)11:53:00 No.109046511

Anonymous 06/13/26(Sat)11:53:00 No.109046511

>>109046505
whats the current recommended model for sex?

Anonymous
06/13/26(Sat)11:53:40 No.109046517

Anonymous 06/13/26(Sat)11:53:40 No.109046517

>>109046491
Klein edits are better at this kind of thing desu. Qwen is too slopped

Anonymous
06/13/26(Sat)11:54:47 No.109046525

Anonymous 06/13/26(Sat)11:54:47 No.109046525

>>109046361
catbox?

Anonymous
06/13/26(Sat)11:58:53 No.109046565

Anonymous 06/13/26(Sat)11:58:53 No.109046565

>>109045993
this is nonsense thobeit

Anonymous
06/13/26(Sat)12:12:03 No.109046695

Anonymous 06/13/26(Sat)12:12:03 No.109046695

>>109046188
I guess
>>109046286
Very nice
>>109046331
You're not thinking far enough
>>109046339
Also not thinking far enough

Anonymous
06/13/26(Sat)12:12:45 No.109046703

Anonymous 06/13/26(Sat)12:12:45 No.109046703

>>109046443
Ah, so there's some arbitrary limit on using AI when using AI, huh? Inderdasting...

Anonymous
06/13/26(Sat)12:17:14 No.109046737

Anonymous 06/13/26(Sat)12:17:14 No.109046737

>>109046343
>best local goon model still illustrious
anima

Anonymous
06/13/26(Sat)12:17:51 No.109046745

Anonymous 06/13/26(Sat)12:17:51 No.109046745

>>109046431
horseface

Anonymous
06/13/26(Sat)12:20:18 No.109046778

Anonymous 06/13/26(Sat)12:20:18 No.109046778

>>109046745
Why are you seething at another anon expanding his skillset?
Vramlet?

Anonymous
06/13/26(Sat)12:22:08 No.109046799

Anonymous 06/13/26(Sat)12:22:08 No.109046799

>>109046435
i mean some are sloppy and i personally dont use genai to create visuals but sometimes just the sheer variety of things people do here due to this vastly lowered execution cost is a good inspiration
and on top of it, generally loose vibe, doing genuinely random shit instead of something you see from so called 'pro-ai communities' feels decent to me
tl;dr i do like lurking here

Anonymous
06/13/26(Sat)12:27:31 No.109046857

Anonymous 06/13/26(Sat)12:27:31 No.109046857

>>109046799
I think the biggest problem with the space right now is software. web apps aren't really accessible for artists

Anonymous
06/13/26(Sat)12:29:16 No.109046871

Anonymous 06/13/26(Sat)12:29:16 No.109046871

>>109046737
anima is still in "meet potential model" phase, illustrious just works

Anonymous
06/13/26(Sat)12:30:41 No.109046887

Anonymous 06/13/26(Sat)12:30:41 No.109046887

Anima thoroughly beats the shit out of any Illustrious finetune

Anonymous
06/13/26(Sat)12:32:12 No.109046900

Anonymous 06/13/26(Sat)12:32:12 No.109046900

>>109046857
i always say this but
because of the medium itself(text) coding got the most natural integration
but with drawing and artistic matter, human-computer interaction workflow is one of the most important part i think and since what they are currently aiming for is the end product, it just doesnt integrate well into the existing artistic workflow
i think photoshop's rotate tool is a decent example of 'making something that is compatible with existing method'

Anonymous
06/13/26(Sat)12:33:07 No.109046906

Anonymous 06/13/26(Sat)12:33:07 No.109046906

File: anima1_00006_.jpg (442 KB, 1152x1648)

442 KB JPG

Anonymous
06/13/26(Sat)12:34:50 No.109046922

Anonymous 06/13/26(Sat)12:34:50 No.109046922

>>109046900
once they capture enough hours of people using computers, not just the final product, we'll see some good integration

Anonymous
06/13/26(Sat)12:46:41 No.109047008

Anonymous 06/13/26(Sat)12:46:41 No.109047008

File: dance_japl.webm (2.85 MB, 480x832)

2.85 MB WEBM

Anonymous
06/13/26(Sat)12:48:23 No.109047018

Anonymous 06/13/26(Sat)12:48:23 No.109047018

File: Screenshot_20260613_123655.png (1.96 MB, 2560x1354)

1.96 MB PNG

My rebirth is imminent

Anonymous
06/13/26(Sat)12:48:46 No.109047025

Anonymous 06/13/26(Sat)12:48:46 No.109047025

If people are switching to Ideogram, it's because Anima flopped, right?

Anonymous
06/13/26(Sat)12:50:12 No.109047038

Anonymous 06/13/26(Sat)12:50:12 No.109047038

>>109047018
>>109046887
yes this wonderful gen is better than any of the detailed finetunes lmao

Anonymous
06/13/26(Sat)12:52:08 No.109047051

Anonymous 06/13/26(Sat)12:52:08 No.109047051

>>109047038
Still being retarded and miserable?
You look at a purely LLM guided output and that's the only thing you see?

Anonymous
06/13/26(Sat)12:53:54 No.109047065

Anonymous 06/13/26(Sat)12:53:54 No.109047065

File: anima1_00018_.jpg (520 KB, 1152x1648)

520 KB JPG

Anonymous
06/13/26(Sat)13:04:20 No.109047148

Anonymous 06/13/26(Sat)13:04:20 No.109047148

File: anima1_00022_.jpg (441 KB, 1152x1648)

441 KB JPG

Anonymous
06/13/26(Sat)13:08:39 No.109047186

Anonymous 06/13/26(Sat)13:08:39 No.109047186

>>109047018
Basado

Anonymous
06/13/26(Sat)13:08:51 No.109047189

Anonymous 06/13/26(Sat)13:08:51 No.109047189

File: debo_ccg_fia_00004_.png (2.36 MB, 1792x977)

2.36 MB PNG

Anonymous
06/13/26(Sat)13:12:04 No.109047213

Anonymous 06/13/26(Sat)13:12:04 No.109047213

File: dance_boob.webm (2.85 MB, 480x832)

2.85 MB WEBM

Anonymous
06/13/26(Sat)13:21:26 No.109047277

Anonymous 06/13/26(Sat)13:21:26 No.109047277

>>109047189
I always use you as a litmus test for how much models have improved due to how you have failed at basic genning for I think going on 4 years now?
Really warms my heart image gen has gone a long way

Anonymous
06/13/26(Sat)13:22:38 No.109047288

Anonymous 06/13/26(Sat)13:22:38 No.109047288

>>109046517
>>109046505
thanks boys I'll try that on soon, appreciate it

Anonymous
06/13/26(Sat)13:27:00 No.109047317

Anonymous 06/13/26(Sat)13:27:00 No.109047317

>>109047313
>>109047313
>>109047313

Anonymous
06/13/26(Sat)13:40:48 No.109047436

Anonymous 06/13/26(Sat)13:40:48 No.109047436

>>109047018
>All that excessive stuttering
Anon, tell you ai-waifu to chill the fuck out. Not even animu girls talk this retarded. Moderation is key to believably.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.