/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 10/25/25(Sat)02:09:10 No.107001451

File: highlights_g_106995676_17(...).jpg (2.22 MB, 2846x3069)

2.22 MB JPG

/ldg/ - Local Diffusion General Anonymous 10/25/25(Sat)02:09:10 No.107001451 Archived

You're Not Alone Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>106995676

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://civitai.com/models/1790792?modelVersionId=2298660
https://gumgum10.github.io/gumgum.github.io/
https://huggingface.co/neta-art/Neta-Lumina

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo

Anonymous
10/25/25(Sat)02:11:32 No.107001464

Anonymous 10/25/25(Sat)02:11:32 No.107001464

File: 00039-2972526174.png (1.93 MB, 1248x1824)

1.93 MB PNG

Anonymous
10/25/25(Sat)02:26:38 No.107001552

Anonymous 10/25/25(Sat)02:26:38 No.107001552

File: 1735142585245396.mp4 (3.76 MB, 564x576)

3.76 MB MP4

PARTY!!!

Anonymous
10/25/25(Sat)02:27:13 No.107001555

Anonymous 10/25/25(Sat)02:27:13 No.107001555

>>107001464
hot

Anonymous
10/25/25(Sat)02:27:57 No.107001558

Anonymous 10/25/25(Sat)02:27:57 No.107001558

>>107001552
*heavy trap bass beat starts playing*

Anonymous
10/25/25(Sat)02:29:48 No.107001564

Anonymous 10/25/25(Sat)02:29:48 No.107001564

File: 00047-612365413.png (2.02 MB, 1248x1824)

2.02 MB PNG

>>107001555
chisato is literally 10/10 wife material and mongs the rest of the girls.

Anonymous
10/25/25(Sat)02:36:22 No.107001586

Anonymous 10/25/25(Sat)02:36:22 No.107001586

Local LLMs for making creative not repeating prompts? Is it doable at all? /lmg/ ignores me.

Anonymous
10/25/25(Sat)02:43:44 No.107001622

Anonymous 10/25/25(Sat)02:43:44 No.107001622

File: 00056-3989700339.png (2.42 MB, 1536x1536)

2.42 MB PNG

Anonymous
10/25/25(Sat)02:47:12 No.107001642

Anonymous 10/25/25(Sat)02:47:12 No.107001642

File: NetaYumev35_20251025_00018_.png (1.67 MB, 1024x1536)

1.67 MB PNG

posting this last trick or treat lain, she came out nicely, now back to anime backlog watching

Anonymous
10/25/25(Sat)02:48:35 No.107001652

Anonymous 10/25/25(Sat)02:48:35 No.107001652

>>107001586
i'm not sure what you mean by "not repeating prompts" but you can probably set up ollama and find some way to call it

Anonymous
10/25/25(Sat)02:55:35 No.107001687

Anonymous 10/25/25(Sat)02:55:35 No.107001687

>>107001586
it is entirely doable.
I don't like doing that because I do actually want entire control over what I gen, and I'm not bored enough to have a 'surprise me' button to do some random gens.
I do actually use it currently to automatic sort my prompt tags, but it's really not needed at all, I just use it to tidy up

You have two choices really:
- llama-cpp-python (will start/close/be used withing your comfyui workflow)
- external openai compatible LLM (llama.cpp, ollama, lmstudio, etc...)

Now the real problem is that LLMs are WAY MORE EXPENSIVE to run compared to diffusion models. models in the 7b~ dense range require 8GB vram to run at non retarded quants (q8). With LLMs you could theoretically go up to Q4, some imprecision shouldnt matter much here. If using moe models, you can get away with running bigger models and offload the expert layers to CPU, while keeping cache and rest of layers to GPU. since these are one off gens, you can keep the context to a minimum (1000 should be more than enough).

if you want something really MINIMAL (500M params):
z-tipo (what I currently use to sort), it requires you to manually install the cuda version of llama-cpp-python and it lives within comfy.
there are multiple nodes in comfy, just search for llama-cpp-python, ollama or opena/OAI compatible shit

Anonymous
10/25/25(Sat)02:57:49 No.107001699

Anonymous 10/25/25(Sat)02:57:49 No.107001699

>>107001586
You can cover 90% of prompts with smart use of wildcards.

Anonymous
10/25/25(Sat)03:00:51 No.107001716

Anonymous 10/25/25(Sat)03:00:51 No.107001716

File: file.png (359 KB, 1962x1012)

359 KB PNG

>>107001699
yeah or just use impact wildcards, this is my current setup
you can see how the normal prompt comes out and the augmented prompt.
SADLY tipo creates a trash augmented prompt. I just randomize artists really

Anonymous
10/25/25(Sat)03:05:45 No.107001730

Anonymous 10/25/25(Sat)03:05:45 No.107001730

File: 00064-2779316287.png (2.36 MB, 1824x1248)

2.36 MB PNG

Anonymous
10/25/25(Sat)03:06:37 No.107001734

Anonymous 10/25/25(Sat)03:06:37 No.107001734

>>107001652
It means that it does not output "top with v cut" every time when I ask for a girl's outfit.

>>107001687
I have a setup, the question is more about models themselves and techniques to get a good prompt. I have tried abliterated 4b and 8b of qwen and they did not follow my prompts enough or were very uncreative.

>>107001699
Yes, but you have to get wildcards first. It takes too much effort for me.

Anonymous
10/25/25(Sat)03:10:50 No.107001750

Anonymous 10/25/25(Sat)03:10:50 No.107001750

>>107001734
small models are garbage sadly.
You could try nemo instruct, or a recent gemma abliterated.
If you're asking for prompting techniques, then you'll have to play around with samplers, more randomness you want, the higher the temeperature. there are some sampler that help make the bot coherent with hight temp (but I forgot the name, I usually use llms for work and low temp), I'd suggest you ask chatgpt or lmg for this.
For prompting itself, it usually works better if you give the chatbot a list to choose from (but at that point it would be the same as using wildcards substitution) and the prompting techinque GREATLY varies between models, so there's not a general way to do it

Anonymous
10/25/25(Sat)03:12:24 No.107001759

Anonymous 10/25/25(Sat)03:12:24 No.107001759

Oops didn't see the new thread
>>107000663
>just try it out yourself
I have and it's pretty shit sadly. 1girl already works just fine on sdxl. Boomer prompts start giving shit anatomy and body horrors pretty quickly. Prompting for text doesn't work beyond 1-2 words it seems. I think every time someone shills a model they should be required to present a complex gen that cannot be done with illust at a fraction of time and VRAM with metadata attached. I'm gonna assume from now on that "uuuh skill issue just gen yourself" people are all LLMs or paid indians.

Anonymous
10/25/25(Sat)03:13:58 No.107001767

Anonymous 10/25/25(Sat)03:13:58 No.107001767

>>107001716
That's slick. Mine isn't as elaborate, I might have to change my setup.

Anonymous
10/25/25(Sat)03:21:26 No.107001780

Anonymous 10/25/25(Sat)03:21:26 No.107001780

>>107001586
Find cool image you like, i2prompt it and then use that
https://github.com/1038lab/ComfyUI-QwenVL

Anonymous
10/25/25(Sat)03:30:01 No.107001804

Anonymous 10/25/25(Sat)03:30:01 No.107001804

File: 1745319983509499.jpg (891 KB, 1336x2008)

891 KB JPG

>>107001759
show me what this prompt looks like with base ilu at this res on the first pass
https://files.catbox.moe/atrr5z.png
>at a fraction of time and VRAM
when was this claim made?

Anonymous
10/25/25(Sat)03:32:51 No.107001811

Anonymous 10/25/25(Sat)03:32:51 No.107001811

File: 1753393023324645.jpg (851 KB, 1336x2008)

851 KB JPG

Anonymous
10/25/25(Sat)03:33:59 No.107001819

Anonymous 10/25/25(Sat)03:33:59 No.107001819

File: 00065-1589373952.png (2.26 MB, 1248x1824)

2.26 MB PNG

Anonymous
10/25/25(Sat)03:39:51 No.107001841

Anonymous 10/25/25(Sat)03:39:51 No.107001841

Are there any AI that can edit video game and anime characters into being naked? Gemini doesn't allow that

Anonymous
10/25/25(Sat)03:40:16 No.107001842

Anonymous 10/25/25(Sat)03:40:16 No.107001842

>>107001586
>/lmg/ ignores me.
There must be a reason

Anonymous
10/25/25(Sat)04:15:38 No.107002007

Anonymous 10/25/25(Sat)04:15:38 No.107002007

>>107001804
>abstract, ghost, fireplace
Wow you're really going out of your way to demonstrate complex composition, character interactivity and anatomy with your gen
I retract my previous statement, shills seem to be just retarded

Lumi
10/25/25(Sat)04:15:38 No.107002008

Lumi 10/25/25(Sat)04:15:38 No.107002008

File: 00078-225239598.png (1.79 MB, 1792x1024)

1.79 MB PNG

https://youtu.be/Dh56pv7gESM

Anonymous
10/25/25(Sat)04:26:36 No.107002059

Anonymous 10/25/25(Sat)04:26:36 No.107002059

File: 1748446280861521.jpg (735 KB, 1336x2008)

735 KB JPG

Anonymous
10/25/25(Sat)04:31:37 No.107002082

Anonymous 10/25/25(Sat)04:31:37 No.107002082

>>107002059
Where's the 1girl?

Anonymous
10/25/25(Sat)04:34:09 No.107002093

Anonymous 10/25/25(Sat)04:34:09 No.107002093

File: input.jpg (181 KB, 1349x2185)

181 KB JPG

>>107001841
Qwen Image Edit 2509 with clothing remover lora
lora: https://limewire.com/d/AvpLO#Gd7AyXiz1r
result (nsfw): https://files.catbox.moe/na96bw.png

Anonymous
10/25/25(Sat)04:35:44 No.107002098

Anonymous 10/25/25(Sat)04:35:44 No.107002098

>>107001841
It does you just have to write it in the most verbose way that makes it think you're doing something artsy

Anonymous
10/25/25(Sat)04:35:47 No.107002099

Anonymous 10/25/25(Sat)04:35:47 No.107002099

>>107001842
They have miku in op, I'm not surprised.

Anonymous
10/25/25(Sat)04:50:30 No.107002164

Anonymous 10/25/25(Sat)04:50:30 No.107002164

>>107002093
i want to cum inside 2b

Anonymous
10/25/25(Sat)04:52:19 No.107002177

Anonymous 10/25/25(Sat)04:52:19 No.107002177

>>107001819
bowsette a shit

Anonymous
10/25/25(Sat)04:52:55 No.107002180

Anonymous 10/25/25(Sat)04:52:55 No.107002180

File: 251025-163652-wan5s_00001.mp4 (2.32 MB, 896x1200)

2.32 MB MP4

Anonymous
10/25/25(Sat)04:53:53 No.107002188

Anonymous 10/25/25(Sat)04:53:53 No.107002188

>>107002180
sylvanas a shit

Anonymous
10/25/25(Sat)04:55:13 No.107002199

Anonymous 10/25/25(Sat)04:55:13 No.107002199

File: chfp8_a_00008_.png (1.38 MB, 1024x1024)

1.38 MB PNG

tell me about neta lumina. I see it being pushed hard now. it's only a 2b parameter model, isn't it? why use it over wan or qwen?

Anonymous
10/25/25(Sat)04:57:19 No.107002211

Anonymous 10/25/25(Sat)04:57:19 No.107002211

>>107002199
it knows artists, unlike wan or qwen which are limited to a very limited set of styles

Anonymous
10/25/25(Sat)04:59:56 No.107002226

Anonymous 10/25/25(Sat)04:59:56 No.107002226

File: 251025-165641-wan5s_00001.mp4 (2.57 MB, 1088x1600)

2.57 MB MP4

>>107001819

Anonymous
10/25/25(Sat)05:02:40 No.107002236

Anonymous 10/25/25(Sat)05:02:40 No.107002236

>>107002226
ahegao lora a shit

Anonymous
10/25/25(Sat)05:03:51 No.107002245

Anonymous 10/25/25(Sat)05:03:51 No.107002245

>>107002093
now try on realistic

Anonymous
10/25/25(Sat)05:04:56 No.107002255

Anonymous 10/25/25(Sat)05:04:56 No.107002255

>>107002211
so does sd1.5, but because it's so small it can't compete now. is neta yume lumina's quality still good compared to the big ones?

Anonymous
10/25/25(Sat)05:07:12 No.107002278

Anonymous 10/25/25(Sat)05:07:12 No.107002278

CeFurkan is back shillng

Anonymous
10/25/25(Sat)05:08:03 No.107002284

Anonymous 10/25/25(Sat)05:08:03 No.107002284

Ran took everything from me.

Anonymous
10/25/25(Sat)05:08:53 No.107002293

Anonymous 10/25/25(Sat)05:08:53 No.107002293

File: NetaYumev35_20251025_00022_.png (1.51 MB, 1024x1536)

1.51 MB PNG

>>107002255
I've been posting some this and last thread. compared to sdxl (illu/noob) it doesnt need upscalers or detailers. Granted gens take way longer, but I'm generating at the resolution you see, and I don't mind waiting since it manages to oneshot most of it. You can additionally use NL, which helps a lot in posing the girl in the composition you want.

Anonymous
10/25/25(Sat)05:09:11 No.107002295

Anonymous 10/25/25(Sat)05:09:11 No.107002295

anons what are your guys gen times on qwen image with and without 4/8 step lora?

Anonymous
10/25/25(Sat)05:09:34 No.107002300

Anonymous 10/25/25(Sat)05:09:34 No.107002300

File: 1758656043186527.png (1.36 MB, 832x1248)

1.36 MB PNG

>>107002082
hiding in the cabin
>>107002199
qwen is large and wan is a middling image model
>>107002255
it uses a 16ch vae if thats what you mean

Anonymous
10/25/25(Sat)05:12:13 No.107002315

Anonymous 10/25/25(Sat)05:12:13 No.107002315

File: NetaYumev35_20251025_00027_.png (1.57 MB, 1024x1536)

1.57 MB PNG

>>107002293
and last one. wish one of these stupid anime thots would come trick or treating me IRL. SAD.

Anonymous
10/25/25(Sat)05:13:20 No.107002324

Anonymous 10/25/25(Sat)05:13:20 No.107002324

>>107002300
>qwen is large
as in qwen is not preferred because it's too big?

Anonymous
10/25/25(Sat)05:17:52 No.107002347

Anonymous 10/25/25(Sat)05:17:52 No.107002347

>>107002324
maybe he meant to say hes poor. but qwen's problem is not its size (can still fit in 16gb with some offload at Q8 or completely at 24gb). The results are almost always GOOD meaning you dont need to re-roll your gens as much, but even fully fitting in a GPU, genning is slower (due to genning at a high 1.3MP size) and it's slopped and has bad styles knowledge/no artists

Anonymous
10/25/25(Sat)05:26:30 No.107002395

Anonymous 10/25/25(Sat)05:26:30 No.107002395

>>107002199
>wan or qwen
Both need LoRAs to do anything even resembling kino.

Anonymous
10/25/25(Sat)05:34:43 No.107002434

Anonymous 10/25/25(Sat)05:34:43 No.107002434

how do I speed up wan 2.2 i2v

Anonymous
10/25/25(Sat)05:37:32 No.107002448

Anonymous 10/25/25(Sat)05:37:32 No.107002448

File: n9t2asb11ywf1.png (174 KB, 640x640)

174 KB PNG

absolute legend?

https://files.catbox.moe/2dyn9a.mp4

Anonymous
10/25/25(Sat)05:38:38 No.107002452

Anonymous 10/25/25(Sat)05:38:38 No.107002452

>>107002295
A lot or not much.

Anonymous
10/25/25(Sat)05:39:15 No.107002456

Anonymous 10/25/25(Sat)05:39:15 No.107002456

>>107002452
bruh that isnt helpful. some numbers would help

Anonymous
10/25/25(Sat)05:40:09 No.107002461

Anonymous 10/25/25(Sat)05:40:09 No.107002461

>>107002448
lost

Anonymous
10/25/25(Sat)05:43:48 No.107002472

Anonymous 10/25/25(Sat)05:43:48 No.107002472

>>107002456
How many cuda cores you have?

Anonymous
10/25/25(Sat)05:44:13 No.107002474

Anonymous 10/25/25(Sat)05:44:13 No.107002474

>>107002472
10,752

Anonymous
10/25/25(Sat)05:45:56 No.107002479

Anonymous 10/25/25(Sat)05:45:56 No.107002479

File: 1741658881115885.jpg (738 KB, 1336x2008)

738 KB JPG

Anonymous
10/25/25(Sat)05:47:02 No.107002483

Anonymous 10/25/25(Sat)05:47:02 No.107002483

>>107002434
Use 2.5 instead

Anonymous
10/25/25(Sat)05:48:07 No.107002487

Anonymous 10/25/25(Sat)05:48:07 No.107002487

>>107002474
Should be pretty quick then.
Think about it as ballpark. If an action is under 10 minutes it is still usable.
In the past and still, renders can take 8 hours per frame.
With AI slop that is condensed.

Anonymous
10/25/25(Sat)05:49:05 No.107002496

Anonymous 10/25/25(Sat)05:49:05 No.107002496

>>107002487
I dont think anyone waits 10mins for an image

Anonymous
10/25/25(Sat)05:50:03 No.107002503

Anonymous 10/25/25(Sat)05:50:03 No.107002503

>>107002496
I don't think you have ever been employed or done graphics for a client.

Anonymous
10/25/25(Sat)05:51:04 No.107002508

Anonymous 10/25/25(Sat)05:51:04 No.107002508

>>107002496
Maybe English is a problem for you. Is it?

Anonymous
10/25/25(Sat)05:51:06 No.107002509

Anonymous 10/25/25(Sat)05:51:06 No.107002509

>>107002503
youre talking to a motion designer. literally no one waits 10 mins for imagen. youre joking

Anonymous
10/25/25(Sat)05:52:05 No.107002511

Anonymous 10/25/25(Sat)05:52:05 No.107002511

>>107002509
English is a problem.

Anonymous
10/25/25(Sat)05:52:11 No.107002513

Anonymous 10/25/25(Sat)05:52:11 No.107002513

>>107002508
yes Im german, english isnt my 1language.

Anonymous
10/25/25(Sat)05:55:18 No.107002528

Anonymous 10/25/25(Sat)05:55:18 No.107002528

>>107002448
based turk working hard

Anonymous
10/25/25(Sat)05:56:54 No.107002541

Anonymous 10/25/25(Sat)05:56:54 No.107002541

>>107002448
Not bad anon...not bad...but BEHOLD! MY GOONJITSU!
https://files.catbox.moe/vqq4u0.mp4

Anonymous
10/25/25(Sat)05:59:07 No.107002547

Anonymous 10/25/25(Sat)05:59:07 No.107002547

>>107002509
If you are such a professional you should already know...

Anonymous
10/25/25(Sat)05:59:16 No.107002549

Anonymous 10/25/25(Sat)05:59:16 No.107002549

How to make Chroma good?

Anonymous
10/25/25(Sat)06:00:10 No.107002552

Anonymous 10/25/25(Sat)06:00:10 No.107002552

>>107002541
it's funnier seeing him suck dicks desu, this is a bit too much and well, it's literally a woman with his head.

Anonymous
10/25/25(Sat)06:00:51 No.107002554

Anonymous 10/25/25(Sat)06:00:51 No.107002554

>>107002549
delete chroma, download gwen + analogcore lora and some insta thots lora for 1 girls and youre done, way better realism than whatever chroma shits out

Anonymous
10/25/25(Sat)06:04:57 No.107002575

Anonymous 10/25/25(Sat)06:04:57 No.107002575

>>107002554
>this is what qwenfags believe
Advised him that again when your model actually becomes non shit.

Anonymous
10/25/25(Sat)06:05:40 No.107002579

Anonymous 10/25/25(Sat)06:05:40 No.107002579

File: 1739217727555737.png (512 KB, 875x355)

512 KB PNG

the cartoon character in the red shirt is very fat and holds up a sign at the beach saying "tomorrow i'll gen 1girls", while Sonic the Hedgehog looks at him

Anonymous
10/25/25(Sat)06:12:14 No.107002612

Anonymous 10/25/25(Sat)06:12:14 No.107002612

>>107002579
Why not take a full pic of robotnik?

Anonymous
10/25/25(Sat)06:15:44 No.107002627

Anonymous 10/25/25(Sat)06:15:44 No.107002627

>>107002612
just to test if it still works, seems fine even with a cropped image.

Anonymous
10/25/25(Sat)06:19:10 No.107002639

Anonymous 10/25/25(Sat)06:19:10 No.107002639

>>107002627
Yeah it did a good job that's true.

Anonymous
10/25/25(Sat)06:20:17 No.107002643

Anonymous 10/25/25(Sat)06:20:17 No.107002643

File: 1734447632356783.png (814 KB, 792x1320)

814 KB PNG

the pink hair anime girl is sitting at a table in a walmart staff room, smoking a cigarette while sitting at a white table. the walmart logo is on the wall.

Anonymous
10/25/25(Sat)06:21:36 No.107002652

Anonymous 10/25/25(Sat)06:21:36 No.107002652

how do i make a comfyui tagger workflow with multiple images to txt for trainning lora?

Anonymous
10/25/25(Sat)06:21:58 No.107002655

Anonymous 10/25/25(Sat)06:21:58 No.107002655

File: 1740344971354841.png (796 KB, 792x1320)

796 KB PNG

>>107002643

Anonymous
10/25/25(Sat)06:22:07 No.107002656

Anonymous 10/25/25(Sat)06:22:07 No.107002656

> Some nodes require a newer version of ComfyUI (current: 0.3.66). Please update to use all nodes.
> Requires ComfyUI 0.3.63:
> c46c74c1-cfc4-41eb-81a8-9c6701737ef6
qwen edit, wtf

Anonymous
10/25/25(Sat)06:24:34 No.107002667

Anonymous 10/25/25(Sat)06:24:34 No.107002667

>>107002656
Cum ui has gone from being a nice little javascript python shit for images to literal malware.
Year ago it was still okay.

Anonymous
10/25/25(Sat)06:28:23 No.107002688

Anonymous 10/25/25(Sat)06:28:23 No.107002688

File: WAN_00009_.png (1.9 MB, 1080x1352)

1.9 MB PNG

wan >>>>>>>>>>>>>>>>>> qwen

Anonymous
10/25/25(Sat)06:31:13 No.107002697

Anonymous 10/25/25(Sat)06:31:13 No.107002697

File: 1731419830992770.png (652 KB, 944x1104)

652 KB PNG

the videogame girl is sitting at a computer and typing in a cave near a fire, on the back of the white CRT monitor is the text "LDG". keep her in the same polygon style.

why does a cave have power? it's a videogame cave.

Anonymous
10/25/25(Sat)06:32:15 No.107002701

Anonymous 10/25/25(Sat)06:32:15 No.107002701

File: 1742610876525001.png (657 KB, 944x1104)

657 KB PNG

>>107002697

Anonymous
10/25/25(Sat)06:37:15 No.107002734

Anonymous 10/25/25(Sat)06:37:15 No.107002734

>>107002697
solar powered PC

Anonymous
10/25/25(Sat)06:38:41 No.107002743

Anonymous 10/25/25(Sat)06:38:41 No.107002743

>>107002093
Prompt nodes don't have image inputs links, correct?

Anonymous
10/25/25(Sat)06:41:31 No.107002758

Anonymous 10/25/25(Sat)06:41:31 No.107002758

The only way to train a wan 2.2 lora is with cloud, isn't it? Aren't you locked out of your computer for like a week with a 5090?

Anonymous
10/25/25(Sat)06:44:38 No.107002772

Anonymous 10/25/25(Sat)06:44:38 No.107002772

>>107002758
>locked out of your computer for like a week with a 5090

>xhe spent multiple thousand $ on a single pc component but doesn't have an old gpu or money to buy a 70$ 1070

Anonymous
10/25/25(Sat)06:46:06 No.107002780

Anonymous 10/25/25(Sat)06:46:06 No.107002780

>>107002758
There is a guy trained wan 2.2 lora with ~250p clips on high and ~400p clips and ~700p images on low, 3 seconds clips, with pretty good result. Should be not that long.

Anonymous
10/25/25(Sat)06:49:10 No.107002789

Anonymous 10/25/25(Sat)06:49:10 No.107002789

ran is not satisfied with his discord
users need blogposting
i will post images and make ran seethe

Anonymous
10/25/25(Sat)06:50:31 No.107002798

Anonymous 10/25/25(Sat)06:50:31 No.107002798

>Tsukuyomi

Anonymous
10/25/25(Sat)06:50:59 No.107002805

Anonymous 10/25/25(Sat)06:50:59 No.107002805

>>107002780
I'm sure the results are fine, but it takes so long to train doesn't it?
It's my work pc.

Anonymous
10/25/25(Sat)06:56:27 No.107002830

Anonymous 10/25/25(Sat)06:56:27 No.107002830

File: 251025-175534-wan5s_00001.mp4 (759 KB, 896x504)

759 KB MP4

>wan2.2_i2v_A14b_high_noise_lora_rank64_lightx2v_4step_1022
got really fast movement with 3.0 strength

Anonymous
10/25/25(Sat)06:58:47 No.107002842

Anonymous 10/25/25(Sat)06:58:47 No.107002842

>>107002830
link for lora

Anonymous
10/25/25(Sat)07:00:34 No.107002851

Anonymous 10/25/25(Sat)07:00:34 No.107002851

>>107002842
bruh literally just type wan2.2_i2v_A14b_high_noise_lora_rank64_lightx2v_4step_1022 in google

Anonymous
10/25/25(Sat)07:01:02 No.107002853

Anonymous 10/25/25(Sat)07:01:02 No.107002853

>>107002830
>>107002842

Man what the fuck are you retards doing with that insane low quality flashing with your light lora setups

New HIGH:
https://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22_Lightx2v/Wan_2_2_I2V_A14B_HIGH_lightx2v_MoE_distill_lora_rank_64_bf16.safetensors

Old LOW:
https://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22-Lightning/old/Wan2.2-Lightning_I2V-A14B-4steps-lora_LOW_fp16.safetensors

4 steps, cfg 1, unipc

Anonymous
10/25/25(Sat)07:03:24 No.107002867

Anonymous 10/25/25(Sat)07:03:24 No.107002867

>>107002853
what about t2v?

Anonymous
10/25/25(Sat)07:06:19 No.107002886

Anonymous 10/25/25(Sat)07:06:19 No.107002886

File: 1743770310436836.png (951 KB, 960x1088)

951 KB PNG

the white character is wearing a white tshirt and blue shorts sitting at a computer desk holding a green lightsaber in his messy bedroom. A large STAR WARS sign is in the background and various STAR WARS merchandise. the image is black and white. keep their facial expression the same.

Anonymous
10/25/25(Sat)07:17:44 No.107002949

Anonymous 10/25/25(Sat)07:17:44 No.107002949

File: lightx2v_1022_1.25str.mp4 (745 KB, 448x600)

745 KB MP4

Anonymous
10/25/25(Sat)07:18:47 No.107002951

Anonymous 10/25/25(Sat)07:18:47 No.107002951

File: lightx2v_1022_2.0str.mp4 (849 KB, 448x600)

849 KB MP4

Anonymous
10/25/25(Sat)07:19:48 No.107002958

Anonymous 10/25/25(Sat)07:19:48 No.107002958

File: lightx2v_MoE_1.25str.mp4 (736 KB, 448x600)

736 KB MP4

Anonymous
10/25/25(Sat)07:20:51 No.107002968

Anonymous 10/25/25(Sat)07:20:51 No.107002968

File: lightx2v_MoE_2.0str.mp4 (814 KB, 448x600)

814 KB MP4

ultimately it's just RNG

Anonymous
10/25/25(Sat)07:21:26 No.107002971

Anonymous 10/25/25(Sat)07:21:26 No.107002971

one more please?

Anonymous
10/25/25(Sat)07:27:58 No.107003002

Anonymous 10/25/25(Sat)07:27:58 No.107003002

File: 1753483774945303.png (1.3 MB, 1024x1024)

1.3 MB PNG

the man is sitting at a poker table in a casino, keep his expression the same.

was just a cropped headshot of kaiji. zawa zawa...

Anonymous
10/25/25(Sat)07:34:17 No.107003044

Anonymous 10/25/25(Sat)07:34:17 No.107003044

File: 1747537332271143.jpg (53 KB, 500x479)

53 KB JPG

>>107002245
https://litter.catbox.moe/k2q4xggehhc770ms.png

Anonymous
10/25/25(Sat)07:35:52 No.107003052

Anonymous 10/25/25(Sat)07:35:52 No.107003052

>>107002968
>>107002958
>>107002951
>>107002949
These are great! Would be cool to see more.

Anonymous
10/25/25(Sat)07:39:24 No.107003075

Anonymous 10/25/25(Sat)07:39:24 No.107003075

reasons not to use lightx2v?

Anonymous
10/25/25(Sat)07:43:32 No.107003101

Anonymous 10/25/25(Sat)07:43:32 No.107003101

File: comparing-progress-from-d(...).jpg (138 KB, 1116x1103)

138 KB JPG

It's out.

https://civitai.com/models/1901521/v7-base

Anonymous
10/25/25(Sat)07:44:14 No.107003104

Anonymous 10/25/25(Sat)07:44:14 No.107003104

>>107003075
It's noticeably shit when comparing to not using it

Anonymous
10/25/25(Sat)07:45:14 No.107003107

Anonymous 10/25/25(Sat)07:45:14 No.107003107

>>107003101
you had my hopes up with that image.

Anonymous
10/25/25(Sat)07:46:57 No.107003120

Anonymous 10/25/25(Sat)07:46:57 No.107003120

>>107003101
kek, dalle-mini is so sovlful even after all those years

Anonymous
10/25/25(Sat)07:51:37 No.107003146

Anonymous 10/25/25(Sat)07:51:37 No.107003146

File: 1732994965175552.png (464 KB, 609x679)

464 KB PNG

>ctrl-f Pony
>0 results
What do we think? I couldnt care less about weeb slop, or steven universe and furry faggotry. How's the realism? I doubt it can ever stand up to Chroma

Anonymous
10/25/25(Sat)07:52:14 No.107003151

Anonymous 10/25/25(Sat)07:52:14 No.107003151

File: WAN_00046_.png (2.35 MB, 1080x1352)

2.35 MB PNG

Anonymous
10/25/25(Sat)07:53:23 No.107003161

Anonymous 10/25/25(Sat)07:53:23 No.107003161

>>107003146
just two more finetunes

Anonymous
10/25/25(Sat)07:54:04 No.107003166

Anonymous 10/25/25(Sat)07:54:04 No.107003166

File: 1759517492477396.png (1.2 MB, 1128x920)

1.2 MB PNG

>>107002295
qwen image edit, 8 steps, 1 megapixel images, rtx 3090
first gen: 156 secs
second gen, same image and prompt: 49 secs
change image: 91 secs
change prompt: 62 secs
disable 8 step lora, 20 steps: 95 secs

Anonymous
10/25/25(Sat)07:57:26 No.107003190

Anonymous 10/25/25(Sat)07:57:26 No.107003190

>>107002643
now this is autism

Anonymous
10/25/25(Sat)07:59:45 No.107003203

Anonymous 10/25/25(Sat)07:59:45 No.107003203

can you train qwen loras with 16gb? does it take 12 hours per lora? do the loras come out even remotely well with what i assume is 40 block swaps?

Anonymous
10/25/25(Sat)08:08:46 No.107003253

Anonymous 10/25/25(Sat)08:08:46 No.107003253

File: chroma multi concept lora.jpg (1.67 MB, 3936x1264)

1.67 MB JPG

Looks like multiple concepts for Chroma loras work after all. Just need to crank up early training lr without frying the whole thing. I think Chroma/flux are super sensitive to gradients that pull into opposite directions so you need to let them reserve their space during early training. Otherwise it will just overwrite everything and you'll get generalized mess with combined concepts.

Anonymous
10/25/25(Sat)08:18:17 No.107003308

Anonymous 10/25/25(Sat)08:18:17 No.107003308

>>107003203
>can you train qwen loras with 16gb?
yes

>does it take 12 hours per lora?
depends on how you train it

>do the loras come out even remotely well with what i assume is 40 block swaps?
I don't know if lora quality has anything to do with block swap

Anonymous
10/25/25(Sat)08:22:18 No.107003327

Anonymous 10/25/25(Sat)08:22:18 No.107003327

>>107003044
could've taken even lower quality picture

Anonymous
10/25/25(Sat)08:23:58 No.107003333

Anonymous 10/25/25(Sat)08:23:58 No.107003333

would loras for flux work for chroma or would i need to retrain them?

Anonymous
10/25/25(Sat)08:25:17 No.107003346

Anonymous 10/25/25(Sat)08:25:17 No.107003346

>>107003333
Some of them work.

Anonymous
10/25/25(Sat)08:36:52 No.107003418

Anonymous 10/25/25(Sat)08:36:52 No.107003418

When will /ldg/ wake up and see Qwen is bad?

Anonymous
10/25/25(Sat)08:38:12 No.107003425

Anonymous 10/25/25(Sat)08:38:12 No.107003425

>>107003418
when it's proven...?

Anonymous
10/25/25(Sat)08:38:31 No.107003427

Anonymous 10/25/25(Sat)08:38:31 No.107003427

> wan t2v
> 3d animation of...
works

> 3d blender animation of ...
50/50

> ... 3d animation
does not work

why the fuck

Anonymous
10/25/25(Sat)08:41:29 No.107003443

Anonymous 10/25/25(Sat)08:41:29 No.107003443

>>107002805
https://huggingface.co/quarterturn/wan2.2-14b-i2v-city-the-animation
5 1/2 days on a 4090D 48GB for 101 15-second 640x360 clips, which was the biggest I could use without OOMing.

Anonymous
10/25/25(Sat)08:43:48 No.107003454

Anonymous 10/25/25(Sat)08:43:48 No.107003454

>>107003427
blender isn't and "of" is the trigger

Anonymous
10/25/25(Sat)08:45:20 No.107003461

Anonymous 10/25/25(Sat)08:45:20 No.107003461

>>107003418
things that qwen does great: text editing, prompt adherence, structural correctness

things that qwen does bad: image editing, speed, style variation, realism

Anonymous
10/25/25(Sat)08:47:32 No.107003473

Anonymous 10/25/25(Sat)08:47:32 No.107003473

Is Wan really the only good video model we got?

Anonymous
10/25/25(Sat)08:48:11 No.107003477

Anonymous 10/25/25(Sat)08:48:11 No.107003477

>>107003473
yes

Anonymous
10/25/25(Sat)08:49:36 No.107003485

Anonymous 10/25/25(Sat)08:49:36 No.107003485

File: 1736785332101770.jpg (252 KB, 1500x1302)

252 KB JPG

https://huggingface.co/purplesmartai/pony-v7-base/tree/main
THIS IS IT, LOCAL IS SAVED

Anonymous
10/25/25(Sat)08:51:15 No.107003496

Anonymous 10/25/25(Sat)08:51:15 No.107003496

>>107003485
I hate to say it but unironically it's the best model we've gotten for realism and combining out of left field concepts into.

Anonymous
10/25/25(Sat)08:51:56 No.107003500

Anonymous 10/25/25(Sat)08:51:56 No.107003500

>>107003496
>it's the best model we've gotten for realism
prove it, show some examples kek

Anonymous
10/25/25(Sat)08:52:09 No.107003501

Anonymous 10/25/25(Sat)08:52:09 No.107003501

>>107003473
hunyuan is good, but not as good as wan
if you were here at the time people went nuts for it, it raised the bar so much plus did NSFW

Anonymous
10/25/25(Sat)08:53:06 No.107003506

Anonymous 10/25/25(Sat)08:53:06 No.107003506

>>107003501
>if you were here at the time people went nuts for it, it raised the bar
true, the bar was raised higher between Mochi and HunyuanVideo compared to HunyuanVideo and Wan 2.1

Anonymous
10/25/25(Sat)08:54:36 No.107003512

Anonymous 10/25/25(Sat)08:54:36 No.107003512

>>107001750
Thanks, will try.

Anonymous
10/25/25(Sat)08:56:23 No.107003521

Anonymous 10/25/25(Sat)08:56:23 No.107003521

>>107003461
>image editing
literally the best
>speed
yeah
>style variation, realism
loras

Anonymous
10/25/25(Sat)08:57:31 No.107003532

Anonymous 10/25/25(Sat)08:57:31 No.107003532

>>107003521
if you need loras to make your model good then your model was never good to start.

Anonymous
10/25/25(Sat)08:58:22 No.107003536

Anonymous 10/25/25(Sat)08:58:22 No.107003536

Does anyone here know about superesolution models?

I want to train a model with my own dataset, because my dataset shares the same colours, patterns and style, but it has low resolution images, so I want to upscale them as faithfully as possible.

Please somebody help me

Anonymous
10/25/25(Sat)08:58:23 No.107003537

Anonymous 10/25/25(Sat)08:58:23 No.107003537

>>107003532
I'm completely fine with that

Anonymous
10/25/25(Sat)09:00:20 No.107003548

Anonymous 10/25/25(Sat)09:00:20 No.107003548

>>107003521
>tells it to edit out a part in the photo
>whole image changes
yeah truly the best

Anonymous
10/25/25(Sat)09:00:31 No.107003550

Anonymous 10/25/25(Sat)09:00:31 No.107003550

File: geeeeg.png (72 KB, 1369x473)

72 KB PNG

>>107003485
>

Anonymous
10/25/25(Sat)09:02:37 No.107003575

Anonymous 10/25/25(Sat)09:02:37 No.107003575

>>107003532
>if you need loras to make your model good then your model was never good to start.
this

Anonymous
10/25/25(Sat)09:03:26 No.107003586

Anonymous 10/25/25(Sat)09:03:26 No.107003586

File: Pony V7 base.jpg (305 KB, 1024x1536)

305 KB JPG

>>107003496
>I hate to say it but unironically it's the best model we've gotten for realism and combining out of left field concepts into
totally

Anonymous
10/25/25(Sat)09:04:50 No.107003599

Anonymous 10/25/25(Sat)09:04:50 No.107003599

>>107003586
local is saved!

Anonymous
10/25/25(Sat)09:05:38 No.107003605

Anonymous 10/25/25(Sat)09:05:38 No.107003605

>>107003586
put this in the next OP

Anonymous
10/25/25(Sat)09:07:52 No.107003618

Anonymous 10/25/25(Sat)09:07:52 No.107003618

On the frame interpolator what does 'clear cache after n frames' do? How high or low do I want to try to set this?

Anonymous
10/25/25(Sat)09:08:42 No.107003622

Anonymous 10/25/25(Sat)09:08:42 No.107003622

These V7 pics look like we went back 2 years in time.

Anonymous
10/25/25(Sat)09:08:54 No.107003625

Anonymous 10/25/25(Sat)09:08:54 No.107003625

>>107003586
I want to fuck a grass girl so bad bros...

Anonymous
10/25/25(Sat)09:09:52 No.107003631

Anonymous 10/25/25(Sat)09:09:52 No.107003631

>>107003622
pony v6 is miles ahead better, and he did this shit 2 years ago

Anonymous
10/25/25(Sat)09:09:54 No.107003632

Anonymous 10/25/25(Sat)09:09:54 No.107003632

>>107003586
I haven't tried it yet but this seems to occur for very short prompts because the model was trained with long and detailed ones

Anonymous
10/25/25(Sat)09:32:47 No.107003787

Anonymous 10/25/25(Sat)09:32:47 No.107003787

File: 1737630597983915.png (43 KB, 753x375)

43 KB PNG

i dont think i can ever go back to prompting like this bros

Anonymous
10/25/25(Sat)09:33:12 No.107003791

Anonymous 10/25/25(Sat)09:33:12 No.107003791

>>107003253
Cool. I don't get it

Anonymous
10/25/25(Sat)09:42:29 No.107003851

Anonymous 10/25/25(Sat)09:42:29 No.107003851

File: WanVid_00008.webm (939 KB, 720x960)

939 KB WEBM

dude on the left is striking out, feel bad for him

Anonymous
10/25/25(Sat)09:50:04 No.107003910

Anonymous 10/25/25(Sat)09:50:04 No.107003910

>>107003851
dude on the right wishes he was home watching youtube

Anonymous
10/25/25(Sat)09:50:55 No.107003914

Anonymous 10/25/25(Sat)09:50:55 No.107003914

File: image_00016_.jpg (398 KB, 984x1264)

398 KB JPG

>>107003851
great lora

Anonymous
10/25/25(Sat)09:52:58 No.107003933

Anonymous 10/25/25(Sat)09:52:58 No.107003933

>>107003496
no its synth slopped, but somehow in a far more retarded way than flux/qwen

Anonymous
10/25/25(Sat)09:54:00 No.107003945

Anonymous 10/25/25(Sat)09:54:00 No.107003945

>if you have an unhooked image loader with a different image to the hooked up image loader the unhooked still counts and fucks up the gen

Thanks, open source.

Anonymous
10/25/25(Sat)09:57:20 No.107003970

Anonymous 10/25/25(Sat)09:57:20 No.107003970

how come v7 was open sourced?
thoughts on krea video?
thoughts on new ltx lora?

Anonymous
10/25/25(Sat)09:59:03 No.107003984

Anonymous 10/25/25(Sat)09:59:03 No.107003984

>>107003970
light x2v lora*

Anonymous
10/25/25(Sat)10:03:54 No.107004017

Anonymous 10/25/25(Sat)10:03:54 No.107004017

>>107003787
oh yeah I'll do style_cluster_1610, my favourite!

Anonymous
10/25/25(Sat)10:04:30 No.107004024

Anonymous 10/25/25(Sat)10:04:30 No.107004024

hey there faggots, sick of transparent promotion campaigns for shit-tier models? that's because you're browsing 4chan instead of trying THIS shit-tier model!
you can tell it's bad because I made it and even I won't post any gens from it, but I'll be unironically fucked if I can't manufacture some hype and at least get some downloads!

Anonymous
10/25/25(Sat)10:05:42 No.107004033

Anonymous 10/25/25(Sat)10:05:42 No.107004033

File: 1656210369404.jpg (10 KB, 250x250)

10 KB JPG

pony 7 is even worst than sdxl 3

Anonymous
10/25/25(Sat)10:12:14 No.107004079

Anonymous 10/25/25(Sat)10:12:14 No.107004079

>>107004033
>worst than
Please enable flags on all boards, chink-moot.

Anonymous
10/25/25(Sat)10:16:11 No.107004110

Anonymous 10/25/25(Sat)10:16:11 No.107004110

>>107004024
im actually downloading because at the end of the day, it doesnt hurt to try really

Anonymous
10/25/25(Sat)10:20:38 No.107004137

Anonymous 10/25/25(Sat)10:20:38 No.107004137

File: 1636758330040.jpg (6 KB, 200x202)

6 KB JPG

>>107004079
>spitting on chinks, when you use their tools every day
lmaoooooooooooooooooooooooooooo

Anonymous
10/25/25(Sat)10:21:56 No.107004148

Anonymous 10/25/25(Sat)10:21:56 No.107004148

>>107003970
>>107003984
>krea video
no GGUFs
>light x2v lora
suffers from ghosting and lip flapping

Anonymous
10/25/25(Sat)10:22:49 No.107004160

Anonymous 10/25/25(Sat)10:22:49 No.107004160

>>107004137
sdxl isn't chink, illustrious isn't chink but you might be

Anonymous
10/25/25(Sat)10:25:38 No.107004179

Anonymous 10/25/25(Sat)10:25:38 No.107004179

>>107004148
redeem
https://huggingface.co/6chan/krea-realtime-video-fp8/tree/main

Anonymous
10/25/25(Sat)10:27:25 No.107004191

Anonymous 10/25/25(Sat)10:27:25 No.107004191

File: 1747808295679103.png (146 KB, 500x338)

146 KB PNG

>masturbate to horses
>pour tens of thousands of dollars into horse porn generator
>the horse porn is subpar

Anonymous
10/25/25(Sat)10:28:03 No.107004196

Anonymous 10/25/25(Sat)10:28:03 No.107004196

>>107003851
Is that the same Sabrina lora from weeks ago?

Anonymous
10/25/25(Sat)10:30:14 No.107004214

Anonymous 10/25/25(Sat)10:30:14 No.107004214

File: 251025-222606-wan5s_00001.mp4 (3.59 MB, 1168x1488)

3.59 MB MP4

>>107003914
what model?

Anonymous
10/25/25(Sat)10:38:26 No.107004275

Anonymous 10/25/25(Sat)10:38:26 No.107004275

>>107003044
I wish qwen edit didn't have that sameboob syndrome where it always estimate the same shape, size and look.
Same for bodies, genitals and so on.

Anonymous
10/25/25(Sat)10:40:22 No.107004292

Anonymous 10/25/25(Sat)10:40:22 No.107004292

>>107004033
The worst thing is that v7 could have saved SD3.5 the same way v6 saved SDXL, if he were to train on Medium. Small size, faster training, but all the benefits of 3.5M namely 16ch VAE, T5XXL and native 1.4MP out of the box. We could have had v7 as early as like the first quarter of 2025, and today would have been swimming in loras and merges.

Anonymous
10/25/25(Sat)10:40:43 No.107004296

Anonymous 10/25/25(Sat)10:40:43 No.107004296

>>107003787
>style_cluster
I don't even blame the model for being shit, I blame the dev for thinking this was somehow necessary
what a disgrace

Anonymous
10/25/25(Sat)10:40:48 No.107004297

Anonymous 10/25/25(Sat)10:40:48 No.107004297

>>107003333
they're convertable at least
https://github.com/EnragedAntelope/Flux-ChromaLoraConversion

Anonymous
10/25/25(Sat)10:41:10 No.107004298

Anonymous 10/25/25(Sat)10:41:10 No.107004298

>>107004179
>fp8
>goof
pick one

Anonymous
10/25/25(Sat)10:42:35 No.107004307

Anonymous 10/25/25(Sat)10:42:35 No.107004307

>>107004179
Workflow?

Anonymous
10/25/25(Sat)10:49:01 No.107004353

Anonymous 10/25/25(Sat)10:49:01 No.107004353

File: ComfyUI_00002.webm (3.69 MB, 960x960)

3.69 MB WEBM

Anonymous
10/25/25(Sat)10:49:46 No.107004363

Anonymous 10/25/25(Sat)10:49:46 No.107004363

>>107004353
kek

Anonymous
10/25/25(Sat)10:50:33 No.107004373

Anonymous 10/25/25(Sat)10:50:33 No.107004373

>>107004353
I cammed

Anonymous
10/25/25(Sat)10:50:58 No.107004378

Anonymous 10/25/25(Sat)10:50:58 No.107004378

>>107004353
More

Anonymous
10/25/25(Sat)10:53:47 No.107004401

Anonymous 10/25/25(Sat)10:53:47 No.107004401

>>107004214
chroma 2k

Anonymous
10/25/25(Sat)10:56:24 No.107004426

Anonymous 10/25/25(Sat)10:56:24 No.107004426

File: Screenshot_20251025-10100(...).jpg (881 KB, 1080x1459)

881 KB JPG

thought picrel was AI at first and /ldg/ had breached containment into social media apps

Anonymous
10/25/25(Sat)10:56:41 No.107004428

Anonymous 10/25/25(Sat)10:56:41 No.107004428

>>107004296
>I don't even blame the model for being shit, I blame the dev for thinking this was somehow necessary
no one asked, so I can only conclude he's thinking somehow hiding artist name is "safer"
which is sad and retarded

Anonymous
10/25/25(Sat)11:04:10 No.107004489

Anonymous 10/25/25(Sat)11:04:10 No.107004489

What's the next big hope after the huge successes of Chroma, PonyV7 and Neta Lumina?

Anonymous
10/25/25(Sat)11:04:48 No.107004494

Anonymous 10/25/25(Sat)11:04:48 No.107004494

>>107004489
chroma and leto are good doe

Anonymous
10/25/25(Sat)11:09:57 No.107004536

Anonymous 10/25/25(Sat)11:09:57 No.107004536

>>107003586
At this point why not just partner up and go all in on Chroma? They "sponsored" Chroma, but a full blown partnership would be better. Pony v7.1 is Chroma, then a tune of that is Pony v7.5

Anonymous
10/25/25(Sat)11:15:57 No.107004577

Anonymous 10/25/25(Sat)11:15:57 No.107004577

>>107004353
My gens with this image aren't as creative or as safe for work

Anonymous
10/25/25(Sat)11:16:54 No.107004585

Anonymous 10/25/25(Sat)11:16:54 No.107004585

>>107003253
This is big. Never seen anyone combine concepts with a LoRA (which is one of the main perks of API, but that means local just catched up). Could you write a rentry with your findings?

Anonymous
10/25/25(Sat)11:18:15 No.107004596

Anonymous 10/25/25(Sat)11:18:15 No.107004596

>>107004196
A new I2V dropped yesterday on Civitai

Anonymous
10/25/25(Sat)11:19:08 No.107004603

Anonymous 10/25/25(Sat)11:19:08 No.107004603

>>107003473
This just came out https://meituan-longcat.github.io/LongCat-Video/
Very promising, too bad comfyui is just a shill and scammers framework now, if it does nsfw could probably kill wan 2.2
Gotta wait till someone implements it somewhere.

Anonymous
10/25/25(Sat)11:19:32 No.107004607

Anonymous 10/25/25(Sat)11:19:32 No.107004607

>>107004603
im going to cum inside your ass

Anonymous
10/25/25(Sat)11:20:20 No.107004613

Anonymous 10/25/25(Sat)11:20:20 No.107004613

>>107004426
omg its migu

Anonymous
10/25/25(Sat)11:20:59 No.107004618

Anonymous 10/25/25(Sat)11:20:59 No.107004618

File: naiandnow.jpg (3.02 MB, 3106x2177)

3.02 MB JPG

went back and tried 1.5 again to compare vs illustrious based.
1.5 mixes still do decently actually

Anonymous
10/25/25(Sat)11:22:30 No.107004629

Anonymous 10/25/25(Sat)11:22:30 No.107004629

>>107002549
Know how to use plain English to describe what you want.

Anonymous
10/25/25(Sat)11:34:26 No.107004707

Anonymous 10/25/25(Sat)11:34:26 No.107004707

>>107004603
So to implement this to comfyui we would need a non jew autist to create nodes for it?

Anonymous
10/25/25(Sat)11:35:09 No.107004714

Anonymous 10/25/25(Sat)11:35:09 No.107004714

>>107004629
"girl with only 2 arms and 2 legs"

Anonymous
10/25/25(Sat)11:37:42 No.107004738

Anonymous 10/25/25(Sat)11:37:42 No.107004738

>>107004707
why not implement it in neoforge or sdcpp instead? why does it always need to be cumfart?

Anonymous
10/25/25(Sat)11:37:48 No.107004739

Anonymous 10/25/25(Sat)11:37:48 No.107004739

File: ugfzopvmbov21.gif (2.97 MB, 480x360)

2.97 MB GIF

>>107004714

Anonymous
10/25/25(Sat)11:38:36 No.107004748

Anonymous 10/25/25(Sat)11:38:36 No.107004748

>>107004738
I mean I personally don't care. As long as it's usable for VRAMlets like myself.

Anonymous
10/25/25(Sat)11:40:25 No.107004768

Anonymous 10/25/25(Sat)11:40:25 No.107004768

>>107004748
then aggressively fud comfyui so devs make other options vramlet friendly

Anonymous
10/25/25(Sat)11:41:29 No.107004776

Anonymous 10/25/25(Sat)11:41:29 No.107004776

>>107003253
>>107004585
I second this anon. There's a huge lack of chroma training resources. I've been mostly flying by the seat of my pants trying to experiment with training.

Anonymous
10/25/25(Sat)11:42:26 No.107004786

Anonymous 10/25/25(Sat)11:42:26 No.107004786

>>107004714
Git gud at prompting.

Anonymous
10/25/25(Sat)11:44:42 No.107004804

Anonymous 10/25/25(Sat)11:44:42 No.107004804

>>107004768
I'd use sdcpp but it doesn't do ram offloading in proper fashion. This is very annoying.
Somewhat strange that llama.cpp is apparently its main influence.

Anonymous
10/25/25(Sat)11:45:51 No.107004818

Anonymous 10/25/25(Sat)11:45:51 No.107004818

>>107004804
ask devs to contribute. hell, ask the nunchaku devs to make a sdcpp implementation. nobody does shit unless they know it's what people want

Anonymous
10/25/25(Sat)11:46:31 No.107004826

Anonymous 10/25/25(Sat)11:46:31 No.107004826

File: guardians-of-the-galaxy-g(...).jpg (149 KB, 666x341)

149 KB JPG

are there any realistic models that can do really realistic startrek style aliens or orcs and goblins? i find photoreal shits itself when trying to do anything fantasy. the most fantastical thing i can do is a giant human

Anonymous
10/25/25(Sat)11:47:52 No.107004841

Anonymous 10/25/25(Sat)11:47:52 No.107004841

>new wan ditto model works nicely for style transfer
>Still super inconsistent across individual videos
Wish there were a way to sort of lock in character consistency. But maybe I can just turn down the model noise.

Anonymous
10/25/25(Sat)11:47:57 No.107004842

Anonymous 10/25/25(Sat)11:47:57 No.107004842

Is there a node in ComfyUI that can duplicate another node with the exact same settings? For example, I’d like to have two KSamplers, and whenever I change the settings on the main one, the other automatically updates to match.

Anonymous
10/25/25(Sat)11:49:26 No.107004867

Anonymous 10/25/25(Sat)11:49:26 No.107004867

>>107004842
make all the widgets an input and use a variable node spaghetti to both.

Anonymous
10/25/25(Sat)11:52:57 No.107004897

Anonymous 10/25/25(Sat)11:52:57 No.107004897

>>107004707
Basically, we are slaves of the whims of Kijai, that is now on comfy's payroll as well.
Comfy themselves don't implement shit now for several months.
It's a shame because the model seems amazing at motion, prompt following and actual action/prompt sequences. Better than wan 2.2 from the examples.
Not to mention it has both long generations and even multi minute generations.
Just look at this
https://meituan-longcat.github.io/LongCat-Video/assets/videos/interaction/2-1.mp4
https://meituan-longcat.github.io/LongCat-Video/assets/videos/interaction/2-2.mp4
And a fucking 2 min video with amazing coherence https://meituan-longcat.github.io/LongCat-Video/assets/videos/long/3-4.mp4

Anonymous
10/25/25(Sat)11:58:25 No.107004957

Anonymous 10/25/25(Sat)11:58:25 No.107004957

>>107004842
You can use a get set node and just change the value from the node instead of the ksampler

Anonymous
10/25/25(Sat)12:00:16 No.107004975

Anonymous 10/25/25(Sat)12:00:16 No.107004975

>>107004897
But what does Kijai actually do?
Looking at my nodes I don't think I use any of his, but maybe I use branches?
GGUF and MultiGPU are the mains I use

Anonymous
10/25/25(Sat)12:00:30 No.107004978

Anonymous 10/25/25(Sat)12:00:30 No.107004978

>>107004714
tsar truthnuke

Anonymous
10/25/25(Sat)12:00:39 No.107004981

Anonymous 10/25/25(Sat)12:00:39 No.107004981

>tfw you made an excellent image and the wan is extremely coherent with the pose

Feelsgoodman.

Anonymous
10/25/25(Sat)12:01:04 No.107004986

Anonymous 10/25/25(Sat)12:01:04 No.107004986

>>107004818
fuck off trani kys

Anonymous
10/25/25(Sat)12:01:17 No.107004989

Anonymous 10/25/25(Sat)12:01:17 No.107004989

just how do I tell wan to not move the camera?

Anonymous
10/25/25(Sat)12:02:09 No.107004999

Anonymous 10/25/25(Sat)12:02:09 No.107004999

>>107004989
fixed camera

Anonymous
10/25/25(Sat)12:02:42 No.107005005

Anonymous 10/25/25(Sat)12:02:42 No.107005005

>>107004999
doesn't work

Anonymous
10/25/25(Sat)12:05:18 No.107005021

Anonymous 10/25/25(Sat)12:05:18 No.107005021

>>107004989
luck of the draw. Try different prompt, add stuff that gets out of frame to the description for it to still show, I don't think its wan fault it's light lora fault on my case at least

Anonymous
10/25/25(Sat)12:06:02 No.107005031

Anonymous 10/25/25(Sat)12:06:02 No.107005031

>>107005005
add camera movement to the negatives

Anonymous
10/25/25(Sat)12:06:09 No.107005032

Anonymous 10/25/25(Sat)12:06:09 No.107005032

>>107004585
>>107004776
I can write down stuff later for sure. Hll anon used LION to create a huge multiple concept lora and only trained Text Encoder too so I think there's lots undocumented stuff that works really well.

Anonymous
10/25/25(Sat)12:06:56 No.107005044

Anonymous 10/25/25(Sat)12:06:56 No.107005044

>>107005031
doesn't work

Anonymous
10/25/25(Sat)12:07:06 No.107005046

Anonymous 10/25/25(Sat)12:07:06 No.107005046

>>107004818
No one cares about your wrapper trani

Anonymous
10/25/25(Sat)12:09:09 No.107005067

Anonymous 10/25/25(Sat)12:09:09 No.107005067

>>107004989
>>107005044
At this point you deserve to never get what you want.
You fucking retard can't even bother to learn the very basics of prompting with wan.
I know what it is but I hope nobody else spoon feeds your jeet ass.

Anonymous
10/25/25(Sat)12:09:12 No.107005068

Anonymous 10/25/25(Sat)12:09:12 No.107005068

>>107005046
I think he's drunk again >>107004925

Anonymous
10/25/25(Sat)12:09:22 No.107005069

Anonymous 10/25/25(Sat)12:09:22 No.107005069

>>107004618
Now make them do literally anything other than standing.

Anonymous
10/25/25(Sat)12:09:49 No.107005074

Anonymous 10/25/25(Sat)12:09:49 No.107005074

>>107004989
I have trouble telling wan to do anything with the camera at all other than zoom or close up

Anonymous
10/25/25(Sat)12:10:05 No.107005076

Anonymous 10/25/25(Sat)12:10:05 No.107005076

>>107005069
yeah, make them crouch, point at viewer and laugh

Anonymous
10/25/25(Sat)12:11:53 No.107005089

Anonymous 10/25/25(Sat)12:11:53 No.107005089

>>107005067
>t. no gen

Anonymous
10/25/25(Sat)12:14:42 No.107005120

Anonymous 10/25/25(Sat)12:14:42 No.107005120

>>107005067
either A
>someone told you and your just a faggot that gatekeeps
or b
>you're full of shit

There's no third option

Anonymous
10/25/25(Sat)12:14:56 No.107005122

Anonymous 10/25/25(Sat)12:14:56 No.107005122

>>107004975
he wraps the main application that's in diffusers but comfy has a vendetta for making diffusers as abrasive to use as possible to use his slower implementations

Anonymous
10/25/25(Sat)12:16:46 No.107005141

Anonymous 10/25/25(Sat)12:16:46 No.107005141

>>107005122
Ok I believe you but I'm gonna be honest I don't know half of those words
I just wanna take clothes off women in funny ways.
Is that too much to ask?

Anonymous
10/25/25(Sat)12:17:00 No.107005147

Anonymous 10/25/25(Sat)12:17:00 No.107005147

>>107003945
what

Anonymous
10/25/25(Sat)12:17:13 No.107005151

Anonymous 10/25/25(Sat)12:17:13 No.107005151

>>107004707
It came out 10 hours ago, come on dude, this is ridiculous.

Anonymous
10/25/25(Sat)12:19:29 No.107005172

Anonymous 10/25/25(Sat)12:19:29 No.107005172

if I gen a 10s (161 frames) video on wan, is there a way to prompt it to do one thing then another without the second taking over immediatly?
"she types on a computer for 3 seconds, then she gets up and walks away"

Anonymous
10/25/25(Sat)12:21:01 No.107005190

Anonymous 10/25/25(Sat)12:21:01 No.107005190

Also, pov looks fucking great https://nitter.net/Meituan_LongCat/status/1982083998852763838

Anonymous
10/25/25(Sat)12:27:02 No.107005239

Anonymous 10/25/25(Sat)12:27:02 No.107005239

File: 1759106424259375.jpg (479 KB, 1536x1536)

479 KB JPG

Behold the power of Pony 7...

Anonymous
10/25/25(Sat)12:32:52 No.107005293

Anonymous 10/25/25(Sat)12:32:52 No.107005293

>>107004897
This Is So Funny To Say About Open Source hahaha, Nigga It's On Github ROFL It's Literally Got A Readme With Instructions hahaha This Nigga Cannot Read And Is On /g/ lmao

Anonymous
10/25/25(Sat)12:33:58 No.107005300

Anonymous 10/25/25(Sat)12:33:58 No.107005300

you forgot to apitalize g and lmao

Anonymous
10/25/25(Sat)12:35:44 No.107005310

Anonymous 10/25/25(Sat)12:35:44 No.107005310

>>107004986
Ran is annoyed again. Many such cases.

Anonymous
10/25/25(Sat)12:36:50 No.107005315

Anonymous 10/25/25(Sat)12:36:50 No.107005315

>>107005147
Use a first frame last frame workflow.
Use different images in the two image loader nodes. Only use one frame with the unused one unhooked.

Anonymous
10/25/25(Sat)12:38:16 No.107005330

Anonymous 10/25/25(Sat)12:38:16 No.107005330

>>107005239
sovl... sd1.6...

Anonymous
10/25/25(Sat)12:39:03 No.107005339

Anonymous 10/25/25(Sat)12:39:03 No.107005339

>>107005293
Nigger I'm not gonna run their interference code myself, I'm too lazy for that, what would be the use case of ComfyJew-I if everyone just did that?

Anonymous
10/25/25(Sat)12:40:53 No.107005354

Anonymous 10/25/25(Sat)12:40:53 No.107005354

>>107005339
Boo Hoo Nigga, I Wanna Gen But I Don't Know How hahaha Nigga Boo Hoo

Anonymous
10/25/25(Sat)12:41:02 No.107005355

Anonymous 10/25/25(Sat)12:41:02 No.107005355

dunno why schizo is so anti anistudio. I've been asking for an exe since 2022 and finally someone is working on it. fuck python

Anonymous
10/25/25(Sat)12:42:16 No.107005369

Anonymous 10/25/25(Sat)12:42:16 No.107005369

ani hours are the best

Anonymous
10/25/25(Sat)12:44:18 No.107005384

Anonymous 10/25/25(Sat)12:44:18 No.107005384

i believe in ani

Anonymous
10/25/25(Sat)12:46:38 No.107005408

Anonymous 10/25/25(Sat)12:46:38 No.107005408

>>107005355
You Niggas Need A Pampers hahaha

Anonymous
10/25/25(Sat)12:46:53 No.107005411

Anonymous 10/25/25(Sat)12:46:53 No.107005411

>>107005369
>>107005384
Brap

Anonymous
10/25/25(Sat)12:47:54 No.107005421

Anonymous 10/25/25(Sat)12:47:54 No.107005421

>julien

Anonymous
10/25/25(Sat)12:48:55 No.107005427

Anonymous 10/25/25(Sat)12:48:55 No.107005427

File: 1754997267162139.mp4 (1.22 MB, 720x896)

1.22 MB MP4

>>107002688

Anonymous
10/25/25(Sat)12:48:59 No.107005428

Anonymous 10/25/25(Sat)12:48:59 No.107005428

>>107005421
FUCK OFF RANFAGGOT

Anonymous
10/25/25(Sat)12:49:04 No.107005431

Anonymous 10/25/25(Sat)12:49:04 No.107005431

>>107004603
I see it's a dense model. Realistically, how long would it take to gen 2 min videos on a 3090?

Anonymous
10/25/25(Sat)12:52:38 No.107005456

Anonymous 10/25/25(Sat)12:52:38 No.107005456

>>107005408
Can I have one too?

Anonymous
10/25/25(Sat)12:53:41 No.107005465

Anonymous 10/25/25(Sat)12:53:41 No.107005465

>>107004603
uncanny af

Anonymous
10/25/25(Sat)12:57:19 No.107005498

Anonymous 10/25/25(Sat)12:57:19 No.107005498

>>107005427
they are sisters what are you doing

Anonymous
10/25/25(Sat)12:57:28 No.107005501

Anonymous 10/25/25(Sat)12:57:28 No.107005501

No surprise comfy claimed that trani "has a ton of issues"
He's spiraling

Anonymous
10/25/25(Sat)12:58:03 No.107005507

Anonymous 10/25/25(Sat)12:58:03 No.107005507

File: svi.jpg (43 KB, 1437x204)

43 KB JPG

Wansisters, we're about to eat good once more

https://github.com/vita-epfl/Stable-Video-Infinity/commit/34e4c505a0d77d59a738a08c161fe7d11dff8fc5

Anonymous
10/25/25(Sat)13:06:14 No.107005592

Anonymous 10/25/25(Sat)13:06:14 No.107005592

>Ran took my shota collection.

Anonymous
10/25/25(Sat)13:08:39 No.107005610

Anonymous 10/25/25(Sat)13:08:39 No.107005610

Who the f is Ran
Who the f is trani

Anonymous
10/25/25(Sat)13:09:25 No.107005616

Anonymous 10/25/25(Sat)13:09:25 No.107005616

>>107005610
t ran i
they're the same person as far as i know

Anonymous
10/25/25(Sat)13:10:07 No.107005623

Anonymous 10/25/25(Sat)13:10:07 No.107005623

>>107004842
double click on the input and connect to both ksamplers
messy but it's cumfart ui get used to

Anonymous
10/25/25(Sat)13:12:42 No.107005651

Anonymous 10/25/25(Sat)13:12:42 No.107005651

>>107005122
> diffusers
> his slower implementations

Anonymous
10/25/25(Sat)13:12:58 No.107005653

Anonymous 10/25/25(Sat)13:12:58 No.107005653

Bros.. I just gooned to a 480p test gen because it was so good..

Anonymous
10/25/25(Sat)13:13:12 No.107005656

Anonymous 10/25/25(Sat)13:13:12 No.107005656

File: 1759399874666.jpg (1.52 MB, 2024x2424)

1.52 MB JPG

Anonymous
10/25/25(Sat)13:13:58 No.107005667

Anonymous 10/25/25(Sat)13:13:58 No.107005667

>>107005067
> bother to learn the very basics of prompting with wan
there is no good guide

Anonymous
10/25/25(Sat)13:15:01 No.107005680

Anonymous 10/25/25(Sat)13:15:01 No.107005680

>>107005653
post it

Anonymous
10/25/25(Sat)13:16:30 No.107005702

Anonymous 10/25/25(Sat)13:16:30 No.107005702

>>107005680
No, I will post the finished part later.
Now I will go sleep like a baby.

Anonymous
10/25/25(Sat)13:20:27 No.107005744

Anonymous 10/25/25(Sat)13:20:27 No.107005744

>>107005702
i will rape you like a niggerbaby

Anonymous
10/25/25(Sat)13:20:27 No.107005745

Anonymous 10/25/25(Sat)13:20:27 No.107005745

>>107005610
Oh just more names to add to the filter

Anonymous
10/25/25(Sat)13:22:24 No.107005763

Anonymous 10/25/25(Sat)13:22:24 No.107005763

>>107005653
> gooned
retard

Anonymous
10/25/25(Sat)13:26:01 No.107005795

Anonymous 10/25/25(Sat)13:26:01 No.107005795

I'm having some trouble, recently I tested a freeware version from a very expensive local software to upscale videos from low quality up to 1080 and 4k.

The freeware had about 3 files worth of use, I was surprised to restore some old episodes from 90's sitcoms.

I did look at worflows for comfyAI and tried to accomodate it for my old episodes, doing the same as this bullshit 300USD licensed software.

The only trouble my workflow fucks up and runs out of RAM when I'm around 13%-17%.

So far I'm using nodes taking the whole video file and run it through the workflow.

My question is should I...

>split each video into ten pieces to make em go through my workflow,

or

>split the original video frame by frame and get another node where it would cycle though a massive batch of frames.

and that is assuming I've got it made right and I'm not fucking up in my end.

and that's assuming what I've said can't be quantified like those massive models do on their own, but since its something I've been able to do in my computer with private paid software I wonder there must be a way to figure it out how to do it my way with comfyUI, that program literary used upscalers from the internet made by other people so I reckon its a codemonkey taking stuff from others into his own app, I hope I can recreate it on my own.

Anonymous
10/25/25(Sat)13:30:49 No.107005840

Anonymous 10/25/25(Sat)13:30:49 No.107005840

Kill AI bros.
Behead AI bros.
Roundhouse kick an AI bros head off from his shoulders.
Slam dunk an iPad baby into the trashcan.
Crucify manipulative AI scammers and grifters.
Hammer a stake into an AI gooners heart while they are sleeping.

Anonymous
10/25/25(Sat)13:31:37 No.107005847

Anonymous 10/25/25(Sat)13:31:37 No.107005847

>>107005840
i will rape your twink ass

Anonymous
10/25/25(Sat)13:33:44 No.107005866

Anonymous 10/25/25(Sat)13:33:44 No.107005866

>>107005847
It's funny that you say that because I plan on installing Linux mint today.

Anonymous
10/25/25(Sat)13:34:25 No.107005872

Anonymous 10/25/25(Sat)13:34:25 No.107005872

What if nu pony is actually really good but we can't see it because we all suck at prompting

Anonymous
10/25/25(Sat)13:34:54 No.107005877

Anonymous 10/25/25(Sat)13:34:54 No.107005877

>>107005866
what?

Anonymous
10/25/25(Sat)13:35:45 No.107005880

Anonymous 10/25/25(Sat)13:35:45 No.107005880

>>107005866
Linux Mint is probably the most approachable distro in terms of matching Windows' usability but even then it's a clusterfuck of issues.
It's a-okay but goddamn do I hate linux already. Endless stream of dependencies etc.

Anonymous
10/25/25(Sat)13:44:46 No.107005962

Anonymous 10/25/25(Sat)13:44:46 No.107005962

File: 20251025_123651.jpg (402 KB, 1527x1113)

402 KB JPG

I have been experimenting with Chroma1-HD-Flash as part of a larger workflow. I have this issue where if the prompt has "elf" or especially "pointy ears" in the prompt, it /always/ sticks these crappy earrings in. Always the same style of earring. Even if I leave off jewelry, earring, everything from the prompt, they still appear. If I img2img an existing image that has no earrings, it will insert them. I tried to partially mitigate it by adding "stud earrings", hoping they would at least not hang so easier to remove manually. But all this did, mostly, was add stud earrings *and* hanging ones. Adding earrings to negative and raising the cfg helps somewhat, but they still appear about 1/3 of the time and of course that massively slows down chroma flash defeating the purpose of it.
This must be bad tagging, isn't it? The training images had earrings and it was not mentioned in the prompt, so they slip in undesired.

Anonymous
10/25/25(Sat)13:46:58 No.107005985

Anonymous 10/25/25(Sat)13:46:58 No.107005985

File: 108469 - SoyBooru.jpg (248 KB, 816x1024)

248 KB JPG

Anonymous
10/25/25(Sat)13:48:50 No.107005999

Anonymous 10/25/25(Sat)13:48:50 No.107005999

>>107005795
you can try to find out what models that app uses and google or ask chatgpt how to run them

Anonymous
10/25/25(Sat)13:50:36 No.107006018

Anonymous 10/25/25(Sat)13:50:36 No.107006018

File: file.png (1.78 MB, 1280x1536)

1.78 MB PNG

>style_cluster_1610, score_9, rating_safe, cowboy shot of iwakura lain wearing a sexy halloween witch dress with a witch hat, holding a hallowen basket in one hand and putting her other hand behind her head. She has a mischevious evil grin looking at the viewer. She's standing in front of a door, behind her a faintly lit road in a suburb. The point of view is from inside the house facing the door and the girl. The atmospherie is eerie and supernatural
>default settings from the official workflow
BROS this is FUCKING GARBAGE, fucking ponyV7 I CANT FUCKING BELIEVE I DOWNLOADED THIS GARBAGE

Anonymous
10/25/25(Sat)13:53:11 No.107006036

Anonymous 10/25/25(Sat)13:53:11 No.107006036

File: file.png (1.87 MB, 1280x1536)

1.87 MB PNG

>>107006018
2nd try with another seed. might be irredeemable, unless im prompting wrong

Anonymous
10/25/25(Sat)13:57:15 No.107006076

Anonymous 10/25/25(Sat)13:57:15 No.107006076

File: file.png (2.75 MB, 1280x1536)

2.75 MB PNG

>>107006036
3rd attempt.
Also errata corrice for the 1st attempt, I used 'full body shot' instead of 'cowboy shot'
anyway, garbage all around.

Anonymous
10/25/25(Sat)13:59:08 No.107006095

Anonymous 10/25/25(Sat)13:59:08 No.107006095

>>107006018
>style_cluster_1610, score_9, rating_safe
I thought this was the stuff everyone hated about Pony... he kept it anyway!?

Asstralite
10/25/25(Sat)14:01:53 No.107006138

Asstralite 10/25/25(Sat)14:01:53 No.107006138

>>107006076
>>107006036
>>107006018
Skill issue.

Anonymous
10/25/25(Sat)14:01:53 No.107006139

Anonymous 10/25/25(Sat)14:01:53 No.107006139

>>107006018
>>107006036
>>107006076
sovl

Anonymous
10/25/25(Sat)14:02:57 No.107006158

Anonymous 10/25/25(Sat)14:02:57 No.107006158

>>107005656
cute

Anonymous
10/25/25(Sat)14:04:19 No.107006177

Anonymous 10/25/25(Sat)14:04:19 No.107006177

>>107005795
the most annoyingly formatted post on this site fuck off

Anonymous
10/25/25(Sat)14:04:41 No.107006181

Anonymous 10/25/25(Sat)14:04:41 No.107006181

File: file.png (23 KB, 376x232)

23 KB PNG

>>107006138
>>107006139
Amazing if organic

Anonymous
10/25/25(Sat)14:07:58 No.107006220

Anonymous 10/25/25(Sat)14:07:58 No.107006220

I think the problem might be with the style cluster? the default one was for pony fuckers I guess but on the model card in HF I see no mention at all of where these fucking styles are.
but first error I see that I did was this:

>When referring to characters use pattern: <species> <gender> <name> from <source>
>For example "Anthro bunny female Lola Bunny from Space Jam".
something that no other model has required before lol, I'll try by changing some of the prompt around too.

Anonymous
10/25/25(Sat)14:08:08 No.107006223

Anonymous 10/25/25(Sat)14:08:08 No.107006223

Seedream is cool but it is so completely constrained by your prompt that it quickly becomes boring. There was far more variation between same-prompt gens in Dall-E 3 than there is in Seedream 4.

Local is still king, I think.

Anonymous
10/25/25(Sat)14:08:39 No.107006226

Anonymous 10/25/25(Sat)14:08:39 No.107006226

File: file.png (112 KB, 877x753)

112 KB PNG

>>107006181
gem

Anonymous
10/25/25(Sat)14:13:47 No.107006278

Anonymous 10/25/25(Sat)14:13:47 No.107006278

File: file.png (1.94 MB, 1280x1536)

1.94 MB PNG

>style_cluster_1610, score_9, rating_safe, human girl Iwakura Lain from Serial Experiments Lain. She is wearing a sexy halloween witch dress with a witch hat, holding a pumpkin hallowen basket in one hand and putting her other hand behind her head. She has a mischevious evil grin looking at the viewer. She's standing in front of the viewer's house's door, behind her a faintly lit road in a suburb. Cowboy shot. The atmosphere is eerie and supernatural
nailed the character this time, and adjusted some of the prompt to make it simplier to understand where she is. Also I added the word pumpkin for the next gen. Tbh it looks a bit undercooked, I'll try adding more steps, maybe that'll fix it

Anonymous
10/25/25(Sat)14:14:32 No.107006280

Anonymous 10/25/25(Sat)14:14:32 No.107006280

File: file.png (1.95 MB, 1280x1536)

1.95 MB PNG

>>107006278
2nd gen same default steps and specifying the pumpkin. I'll try the first seed's image without pumpkin and doubling the steps

Anonymous
10/25/25(Sat)14:20:09 No.107006326

Anonymous 10/25/25(Sat)14:20:09 No.107006326

>>107005962
Elf styles are so drearily conventional that it's really hard to fight against the model, and that's not just with Chroma. But yeah I'm trying right now with Chroma1-HD-Flash to see if I can do it, and I can't lol. Gonna keep trying though

Anonymous
10/25/25(Sat)14:20:52 No.107006336

Anonymous 10/25/25(Sat)14:20:52 No.107006336

>>107006278
Did he overtrain the model, chose the wrong parameters or is Auraflow just that shit no matter what you do?

Anonymous
10/25/25(Sat)14:21:47 No.107006340

Anonymous 10/25/25(Sat)14:21:47 No.107006340

File: file.png (2.39 MB, 1280x1536)

2.39 MB PNG

>>107006278
40 steps instead of 20 of this. Better but ultimately still looks like fucking garbage in the details (eyes/hands) Maybe this needs even more steps? Trying with 60 now

Anonymous
10/25/25(Sat)14:25:16 No.107006360

Anonymous 10/25/25(Sat)14:25:16 No.107006360

>>107006018
>fucking ponyV7 I CANT FUCKING BELIEVE I DOWNLOADED THIS GARBAGE
I thought that was only for generating horses

Anonymous
10/25/25(Sat)14:26:23 No.107006368

Anonymous 10/25/25(Sat)14:26:23 No.107006368

File: file.png (2.54 MB, 1280x1536)

2.54 MB PNG

>>107006340
60 steps, not much difference.
Might test 30 steps, but now im gonna test CFG change.

40 steps 4.5 cfg next (default cfg was 3.5)

Anonymous
10/25/25(Sat)14:28:10 No.107006377

Anonymous 10/25/25(Sat)14:28:10 No.107006377

is it possible to make funny videos in wan, or is that out of the model's purview?

Anonymous
10/25/25(Sat)14:28:12 No.107006378

Anonymous 10/25/25(Sat)14:28:12 No.107006378

File: file.png (2.35 MB, 1280x1536)

2.35 MB PNG

>>107006340
this is 40 steps at 4.5 CFG,
hand are decisely better, so is the eye

Anonymous
10/25/25(Sat)14:28:55 No.107006384

Anonymous 10/25/25(Sat)14:28:55 No.107006384

File: file.png (1.68 MB, 1280x1536)

1.68 MB PNG

>>107006278
and this is 20 steps 3.5 CFG lmao bros what the fuck

Anonymous
10/25/25(Sat)14:29:39 No.107006392

Anonymous 10/25/25(Sat)14:29:39 No.107006392

nodes are kinda shit when it comes to videos. where is a UI that has sequencers and timelines? is that too much for techbros to handle? all this node kikery is a waste of my fucking time

Anonymous
10/25/25(Sat)14:30:33 No.107006398

Anonymous 10/25/25(Sat)14:30:33 No.107006398

>>107006392
Ideally we would have an interface like blender, where we have nodes and timeline/sequencers. would be fucking kino actually.

Anonymous
10/25/25(Sat)14:34:07 No.107006435

Anonymous 10/25/25(Sat)14:34:07 No.107006435

what base of chroma is everybody using to train on?

Anonymous
10/25/25(Sat)14:35:30 No.107006450

Anonymous 10/25/25(Sat)14:35:30 No.107006450

>>107006384
*4.5 CFG
anyway I'm done testing for now. It isn't that half bad desu, I'm sure I'm fucking up the prompting in some way, but for now I can't be bothered to look at civitai's gens examples to see how people are doing the good gens.
I actually just checked the official examples, and they're all 40 steps 3.48cfg.
I don't understand why the comfy workflow comes with 20 steps, gens are fucking undercooked.
4.5 CFG looked better to me than 3.5, would require a bit more testing.
I still don't see a way to consult the style clusters, so if anyone could point me to the right direction I would be grateful

Anonymous
10/25/25(Sat)14:36:51 No.107006460

Anonymous 10/25/25(Sat)14:36:51 No.107006460

>>107006384
I have found that the model is extremely sensitive to literally everything.

Try CFG as high as 6. Try schedulers like dpmpp_2m_sde_gpu, or euler_cfg_pp (with low CFG). You can get dramatically different styles and vibes.

Another thing I noticed: natural language prompt gives strong western / digital art style. Danbooru tag prompt gives a decent default anime style. Clearly the training data wasn't uniformly captioned in both styles.

Anonymous
10/25/25(Sat)14:36:51 No.107006461

Anonymous 10/25/25(Sat)14:36:51 No.107006461

>>107006392
AniStudio will have that soon according to the dev

Anonymous
10/25/25(Sat)14:37:24 No.107006471

Anonymous 10/25/25(Sat)14:37:24 No.107006471

>>107006468
>>107006468
>>107006468
>>107006468
>>107006468

Anonymous
10/25/25(Sat)14:38:36 No.107006485

Anonymous 10/25/25(Sat)14:38:36 No.107006485

>>107006460
I'll maybe wait for another kind anon to do the usual MATRIX of CFG x SAMPLERS.
I thought that pony only worked with NL, that's what the official images are using, I'll try a round with booru prompting, but later.

Anonymous
10/25/25(Sat)14:40:23 No.107006496

Anonymous 10/25/25(Sat)14:40:23 No.107006496

>>107005067
retard

Anonymous
10/25/25(Sat)14:42:51 No.107006523

Anonymous 10/25/25(Sat)14:42:51 No.107006523

File: 1742592388306549.jpg (117 KB, 850x668)

117 KB JPG

>>107005962
try adding 'frieren' to negatives

Anonymous
10/25/25(Sat)15:11:47 No.107006752

Anonymous 10/25/25(Sat)15:11:47 No.107006752

trani is a demented faggot that comes here to shill his toy project UI that no one uses and spreads FUD about comfy. ran is a faggot that posts obese women here occasionally and is trani's boogeyman

Anonymous
10/25/25(Sat)15:38:59 No.107006945

Anonymous 10/25/25(Sat)15:38:59 No.107006945

>>107005962

how model could have i am easy elf ears if not even ears pearced ever

Anonymous
10/25/25(Sat)15:55:41 No.107007079

Anonymous 10/25/25(Sat)15:55:41 No.107007079

>>107002180
i wanna play this skyrim mod

Anonymous
10/25/25(Sat)16:03:37 No.107007115

Anonymous 10/25/25(Sat)16:03:37 No.107007115

File: QwenEdit_00183_.png (1.01 MB, 1104x944)

1.01 MB PNG

>>107002099
Do you mean Miku Hatsune?
*ducks*

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.