/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/ldg/ - Local Diffusion Genera(...) 09/02/25(Tue)23:19:19 No.106469492

File: highlights_g_106464276_17(...).webm (3.78 MB, 2048x1386)

3.78 MB WEBM

/ldg/ - Local Diffusion General Anonymous 09/02/25(Tue)23:19:19 No.106469492

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106464276

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
Chromaforge: https://github.com/maybleMyers/chromaforge
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://rentry.org/wan22ldgguide
https://github.com/Wan-Video
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo

Anonymous
09/02/25(Tue)23:22:55 No.106469514

Anonymous 09/02/25(Tue)23:22:55 No.106469514

Blessed thread of frenship

Anonymous
09/02/25(Tue)23:23:15 No.106469516

Anonymous 09/02/25(Tue)23:23:15 No.106469516

They tell me this thread is blessed

Anonymous
09/02/25(Tue)23:26:46 No.106469536

Anonymous 09/02/25(Tue)23:26:46 No.106469536

neta is the future for anime start learning now
https://neta-lumina-style.tz03.xyz/

p ost. cardnon
09/02/25(Tue)23:29:15 No.106469555

p ost. cardnon 09/02/25(Tue)23:29:15 No.106469555

>>106469516
>>106469514
all of Gods children are blessed by his grace

Anonymous
09/02/25(Tue)23:29:25 No.106469558

Anonymous 09/02/25(Tue)23:29:25 No.106469558

File: WanVideo2_2_I2V_00302.webm (565 KB, 1248x720)

565 KB WEBM

Anonymous
09/02/25(Tue)23:29:51 No.106469565

Anonymous 09/02/25(Tue)23:29:51 No.106469565

>>106469536
>.xyz
Oh yes, anon, this link is definitely trustworthy.

Anonymous
09/02/25(Tue)23:33:05 No.106469593

Anonymous 09/02/25(Tue)23:33:05 No.106469593

File: 1739662154288883.png (897 KB, 1068x950)

897 KB PNG

>>106469565
Seems fine

Anonymous
09/02/25(Tue)23:36:00 No.106469612

Anonymous 09/02/25(Tue)23:36:00 No.106469612

>normalfags calling other normalfags out as being too eager to use the word "clanker" as a slur
ai bros stay winning

Anonymous
09/02/25(Tue)23:37:41 No.106469621

Anonymous 09/02/25(Tue)23:37:41 No.106469621

>>106469612
I didn't realize styxhexenhammer was this big

Anonymous
09/02/25(Tue)23:38:09 No.106469624

Anonymous 09/02/25(Tue)23:38:09 No.106469624

File: AnimateDiff_00265.mp4 (2.01 MB, 720x1280)

2.01 MB MP4

Anonymous
09/02/25(Tue)23:38:10 No.106469625

Anonymous 09/02/25(Tue)23:38:10 No.106469625

File: 1756562176106549.gif (1.56 MB, 500x500)

1.56 MB GIF

is there any hope for local t2v? except for making shizo videos, local t2v is obsolete. only local i2v is excellent for serious things

Anonymous
09/02/25(Tue)23:39:31 No.106469633

Anonymous 09/02/25(Tue)23:39:31 No.106469633

>>106469625
It's pretty good for porn with porn LoRAs but yeah most videos end up like a fever dream.

Anonymous
09/02/25(Tue)23:39:45 No.106469634

Anonymous 09/02/25(Tue)23:39:45 No.106469634

>>106469625
I mean, we’re basically in the Ford model an era of this stuff and here you are writing off the future of cars. Bit silly don’t you think?

Anonymous
09/02/25(Tue)23:40:15 No.106469642

Anonymous 09/02/25(Tue)23:40:15 No.106469642

>>106469621
i dont know who that is or how he relates

Anonymous
09/02/25(Tue)23:40:42 No.106469643

Anonymous 09/02/25(Tue)23:40:42 No.106469643

>>106469625
It's good for producing extremely plausible realistic single frames from the video.
Like if candid realism if your goal I'd just extract a frame from i2v

Anonymous
09/02/25(Tue)23:41:42 No.106469652

Anonymous 09/02/25(Tue)23:41:42 No.106469652

Anti AI fags have gone fucking nuts the last few months.

Anonymous
09/02/25(Tue)23:43:16 No.106469658

Anonymous 09/02/25(Tue)23:43:16 No.106469658

File: 2.2 t2v _00183.mp4 (526 KB, 480x880)

526 KB MP4

whats the best tool for automated video captioning? I want to try training a t2v lora but I dont wanna caption my entire dataset by hand. I'm trying chatgpt but it cant caption videos for shit.

Anonymous
09/02/25(Tue)23:43:26 No.106469660

Anonymous 09/02/25(Tue)23:43:26 No.106469660

>>106469652
>the last few months.
That's an understatement

Anonymous
09/02/25(Tue)23:51:05 No.106469699

Anonymous 09/02/25(Tue)23:51:05 No.106469699

>>106469625
remember how anon hyped Wan T2I? shame the diaperfag decided to tune qwen instead. maybe the bigasp guy will do it.

Anonymous
09/02/25(Tue)23:51:55 No.106469701

Anonymous 09/02/25(Tue)23:51:55 No.106469701

>>106469658
Gemini. It shits all over the other models for OCR stuff.

For example. >>106469558

Based on the video provided, here is a description of what happens:

The video features two characters from the TV series Star Trek: The Next Generation: the android Lieutenant Commander Data (on the left) and Lieutenant Commander Geordi La Forge (on the right). They are seated side-by-side in the cockpit of what appears to be a shuttlecraft.

The sequence of events is as follows:

Data holds up a blue and white package resembling a pack of cigarettes.

He removes a single cigarette and places it in his mouth.

The tip of the cigarette spontaneously lights up on its own.

Data takes a drag from the cigarette and then exhales smoke.

Throughout this entire action, Geordi La Forge looks at Data with a surprised and bewildered expression.

Anonymous
09/02/25(Tue)23:52:13 No.106469703

Anonymous 09/02/25(Tue)23:52:13 No.106469703

File: 44406930.mp4 (3.7 MB, 960x544)

3.7 MB MP4

Anonymous
09/02/25(Tue)23:52:28 No.106469705

Anonymous 09/02/25(Tue)23:52:28 No.106469705

>>106469536
>Boris Vallejo
Looked up his style and... that's quite sad. Of course, nano banana can pull of his style just fine.

Anonymous
09/02/25(Tue)23:57:57 No.106469729

Anonymous 09/02/25(Tue)23:57:57 No.106469729

love me ambiguous language

Anonymous
09/03/25(Wed)00:04:26 No.106469755

Anonymous 09/03/25(Wed)00:04:26 No.106469755

File: WanVideo2_2_I2V_00303.webm (399 KB, 1248x720)

399 KB WEBM

According to Gemini

In this video:

The scene opens with Captain Picard looking thoughtful in his ready room. In the background, the Vocaloid star Hatsune Miku, now impressively dressed in a red Starfleet command uniform, walks into the room holding a Starfleet PADD (Personal Access Display Device).

She stops and stands behind the captain. Then, in a move that would surprise even the Borg, Picard picks up a 20th-century-style handgun from his desk and calmly examines it.

It's a fantastic edit, but a few small clues give it away:

The Starfleet Guest Star: As you know, Hatsune Miku, while a cultural icon here in Japan and worldwide, never actually made it onto the Enterprise-D's crew roster.

The Firearm: The biggest giveaway is the prop. Starfleet exclusively uses energy-based weapons like phasers. A projectile-based handgun like that is a massive anachronism and would be completely out of place in Picard's ready room, unless he was in the middle of a Dixon Hill holodeck program.

This is another great example of combining real footage with a digitally inserted character to create a completely new, surreal, and humorous narrative.

Anonymous
09/03/25(Wed)00:31:26 No.106469882

Anonymous 09/03/25(Wed)00:31:26 No.106469882

>>106469755
Isnt gemini paid? Do you use with some local client like kobold or silly tavern? Im trying to find good uncensored llm for images

Anonymous
09/03/25(Wed)00:31:29 No.106469883

Anonymous 09/03/25(Wed)00:31:29 No.106469883

>>106469755
those are not good captions

Anonymous
09/03/25(Wed)00:32:48 No.106469887

Anonymous 09/03/25(Wed)00:32:48 No.106469887

File: 3818249416.jpg (2.03 MB, 1920x2703)

2.03 MB JPG

>>106469705
>Two elf female characters with pointed ears. The character in the foreground has long, wavy blonde hair, light skin, and wears a blue, off-shoulder dress with white fur trim. She has a worried expression and blue markings on her forehead. The character behind her has long, orange hair, light brown skin, and wears a sleeveless purple top. She has green markings on her face and is embracing the blonde-haired character from behind, with one arm around her shoulder. The background is a textured, dark green and blue gradient, resembling a forest or cave

Anonymous
09/03/25(Wed)00:34:40 No.106469899

Anonymous 09/03/25(Wed)00:34:40 No.106469899

>>106469887
basterd bitch delete this

Anonymous
09/03/25(Wed)00:39:30 No.106469922

Anonymous 09/03/25(Wed)00:39:30 No.106469922

File: 1728175481459494.png (133 KB, 491x549)

133 KB PNG

What's the most viable captioning method for deviantart-tier freak fetish stuff?

Anonymous
09/03/25(Wed)00:54:09 No.106469998

Anonymous 09/03/25(Wed)00:54:09 No.106469998

should i get my lazy ass out of bed and finish installing wan? how long do gens take with a 5090

Anonymous
09/03/25(Wed)00:56:56 No.106470011

Anonymous 09/03/25(Wed)00:56:56 No.106470011

>>106469998
they'll take no time at all you fuckin ass

Anonymous
09/03/25(Wed)00:58:51 No.106470020

Anonymous 09/03/25(Wed)00:58:51 No.106470020

File: 3388629620.jpg (1.86 MB, 1920x1999)

1.86 MB JPG

>>106469705
>>106469887

Anonymous
09/03/25(Wed)01:00:38 No.106470024

Anonymous 09/03/25(Wed)01:00:38 No.106470024

>>106469998
sure / depends on settings but in the order of some minutes with many settings. you can do 1 minute gens at a not too terrible resolution if you take the fast options with 4 steps or so.

Anonymous
09/03/25(Wed)01:01:45 No.106470030

Anonymous 09/03/25(Wed)01:01:45 No.106470030

>>106470011
my bad man, i genuinely got no clue on this shit

Anonymous
09/03/25(Wed)01:02:49 No.106470038

Anonymous 09/03/25(Wed)01:02:49 No.106470038

>>106470024
thanks anon

Anonymous
09/03/25(Wed)01:03:32 No.106470041

Anonymous 09/03/25(Wed)01:03:32 No.106470041

>>106469998
it takes me roughly 4-5 minutes for a 720p 8 second video

Anonymous
09/03/25(Wed)01:14:33 No.106470084

Anonymous 09/03/25(Wed)01:14:33 No.106470084

File: ComfyUI_temp_thjda_00003_.png (1.7 MB, 768x1344)

1.7 MB PNG

>>106470020
Just like my futa doujins!

Anonymous
09/03/25(Wed)01:14:34 No.106470085

Anonymous 09/03/25(Wed)01:14:34 No.106470085

>>106469998
2.2 is so good it convinced me to try training video loras.

>>106469701
thanks bruv. got everything captioned, surprised it let me do them all for free

Anonymous
09/03/25(Wed)01:22:47 No.106470125

Anonymous 09/03/25(Wed)01:22:47 No.106470125

File: 775498300.png (1.05 MB, 1536x640)

1.05 MB PNG

Anonymous
09/03/25(Wed)01:25:07 No.106470133

Anonymous 09/03/25(Wed)01:25:07 No.106470133

File: ComfyUI_temp_slaev_00002_.jpg (880 KB, 1664x1152)

880 KB JPG

Damn, AI inventing new instruments.

Anonymous
09/03/25(Wed)01:27:40 No.106470141

Anonymous 09/03/25(Wed)01:27:40 No.106470141

File: 3285549005.png (903 KB, 1536x640)

903 KB PNG

>>106470133
very mongolian

Anonymous
09/03/25(Wed)01:28:58 No.106470148

Anonymous 09/03/25(Wed)01:28:58 No.106470148

>>106470141
I fed Gemini a Batzorig video screenshot lol

Anonymous
09/03/25(Wed)01:30:19 No.106470154

Anonymous 09/03/25(Wed)01:30:19 No.106470154

>>106469883
Yeah but I didn't prompt it how to caption. I was just more interested to see if it could identify what was happening in the video at all.

Anonymous
09/03/25(Wed)01:32:17 No.106470165

Anonymous 09/03/25(Wed)01:32:17 No.106470165

>>106470085
>surprised it let me do them all for free
np. I assume they do it as a means to capture audience share. Their free stuff is very generous. I honestly just pay for gemini as my GPUs are usually blasting away at training most of the time anyway. It's the best model for captioning in my opinion. And the fact google made veo 3 should indicate as much.

Anonymous
09/03/25(Wed)01:34:51 No.106470176

Anonymous 09/03/25(Wed)01:34:51 No.106470176

>>106469887
>>106470020

>Reference image

Literally just prompt for the guy
>A caveman carrying a wounded woman while pointing a spear a large flying bird over them while sitting atop a rocky hill by Boris Vallejo

https://files.catbox.moe/zcos9q.jpeg

Local would never.

Anonymous
09/03/25(Wed)01:37:10 No.106470186

Anonymous 09/03/25(Wed)01:37:10 No.106470186

did comfy fix the shitty qwen edit text encode node?

Anonymous
09/03/25(Wed)01:43:09 No.106470209

Anonymous 09/03/25(Wed)01:43:09 No.106470209

>>106470186
every fix breaks two more things. python was a mistake

Anonymous
09/03/25(Wed)01:44:06 No.106470211

Anonymous 09/03/25(Wed)01:44:06 No.106470211

File: ComfyUI_temp_rccnj_00005_.png (2.49 MB, 1152x1664)

2.49 MB PNG

Anonymous
09/03/25(Wed)01:49:33 No.106470233

Anonymous 09/03/25(Wed)01:49:33 No.106470233

File: 1729387153496869.png (1.52 MB, 1024x1024)

1.52 MB PNG

>>106470176
Do you have some special version lol? I get this on nano which isn't even close.

Anonymous
09/03/25(Wed)01:50:01 No.106470235

Anonymous 09/03/25(Wed)01:50:01 No.106470235

File: 3209517266.png (1.36 MB, 1152x896)

1.36 MB PNG

>>106470176
Didn't think it would recognize it. Flux can't really handle a more complex composition like that.

Anonymous
09/03/25(Wed)01:51:48 No.106470246

Anonymous 09/03/25(Wed)01:51:48 No.106470246

File: WanVideo2_2_I2V_00304.webm (1.66 MB, 1248x720)

1.66 MB WEBM

Anonymous
09/03/25(Wed)02:12:02 No.106470324

Anonymous 09/03/25(Wed)02:12:02 No.106470324

>>106470235
Not bad. Unlike the original nano banana can't show me booba, so a Chroma LoRA would win anyway.
As for the results you're getting with nano banana, no idea what you're using. I can get his likeness right away even across other seeds.
>A caveman with a shield standing atop a rocky hill while goblins are incoming. A woman kneels beside him by Boris Vallejo

https://files.catbox.moe/ak75k4.jpeg

Anonymous
09/03/25(Wed)02:15:30 No.106470339

Anonymous 09/03/25(Wed)02:15:30 No.106470339

File: WanVideo2_2_I2V_00305.webm (2.37 MB, 1248x720)

2.37 MB WEBM

Anonymous
09/03/25(Wed)03:39:35 No.106470715

Anonymous 09/03/25(Wed)03:39:35 No.106470715

ultra cozy

Anonymous
09/03/25(Wed)03:56:30 No.106470779

Anonymous 09/03/25(Wed)03:56:30 No.106470779

is there anything as good as veo3?

Anonymous
09/03/25(Wed)04:08:55 No.106470835

Anonymous 09/03/25(Wed)04:08:55 No.106470835

>>106470779
Yes veo3 is as good as veo3.

Anonymous
09/03/25(Wed)04:10:54 No.106470850

Anonymous 09/03/25(Wed)04:10:54 No.106470850

>>106470211
if not for toes I would say it's not a gen

Anonymous
09/03/25(Wed)04:17:00 No.106470885

Anonymous 09/03/25(Wed)04:17:00 No.106470885

File: AnimateDiff_00268.mp4 (1.34 MB, 1280x720)

1.34 MB MP4

Anonymous
09/03/25(Wed)04:18:40 No.106470895

Anonymous 09/03/25(Wed)04:18:40 No.106470895

>>106470885
I was not aware she had a cat.

Anonymous
09/03/25(Wed)04:20:26 No.106470908

Anonymous 09/03/25(Wed)04:20:26 No.106470908

2.2 for vace soon?

Anonymous
09/03/25(Wed)04:30:49 No.106470940

Anonymous 09/03/25(Wed)04:30:49 No.106470940

File: WanVideo2_1_T2V_00193.mp4 (2.88 MB, 1248x720)

2.88 MB MP4

Anonymous
09/03/25(Wed)04:36:03 No.106470956

Anonymous 09/03/25(Wed)04:36:03 No.106470956

>>106470940
Expected miku to walk out of that...This thread is getting to me

Anonymous
09/03/25(Wed)04:46:36 No.106470993

Anonymous 09/03/25(Wed)04:46:36 No.106470993

>>106470885
Who are those two from? I recognize them from something...

Anonymous
09/03/25(Wed)04:48:57 No.106471007

Anonymous 09/03/25(Wed)04:48:57 No.106471007

File: WanVideo2_1_T2V_00194.mp4 (3.67 MB, 1248x720)

3.67 MB MP4

not what I wanted at all but okay.

Anonymous
09/03/25(Wed)04:49:24 No.106471012

Anonymous 09/03/25(Wed)04:49:24 No.106471012

bros... i beg... do loras work with qwen nunchaku yet... bros...

Anonymous
09/03/25(Wed)04:49:59 No.106471015

Anonymous 09/03/25(Wed)04:49:59 No.106471015

>>106470993
Himawari and the flat chested one from Yuru Yuri

Anonymous
09/03/25(Wed)04:50:44 No.106471018

Anonymous 09/03/25(Wed)04:50:44 No.106471018

>>106469701
>He removes a single cigarette and places it in his mouth.

obviously wrong
he was holding the cigarette already

Anonymous
09/03/25(Wed)04:54:51 No.106471031

Anonymous 09/03/25(Wed)04:54:51 No.106471031

>>106469755
>Then, in a move that would surprise even the Borg

wtf

implying they have feels

Anonymous
09/03/25(Wed)04:56:10 No.106471035

Anonymous 09/03/25(Wed)04:56:10 No.106471035

>>106471031
There was that one episode where the borg had feels.

Anonymous
09/03/25(Wed)05:01:44 No.106471050

Anonymous 09/03/25(Wed)05:01:44 No.106471050

File: UOH.jpg (155 KB, 680x823)

155 KB JPG

>>106470885
Animate this.

Anonymous
09/03/25(Wed)05:06:15 No.106471067

Anonymous 09/03/25(Wed)05:06:15 No.106471067

File: WanVideo2_1_T2V_00195.mp4 (1.8 MB, 1248x720)

1.8 MB MP4

Anonymous
09/03/25(Wed)05:27:47 No.106471172

Anonymous 09/03/25(Wed)05:27:47 No.106471172

File: WanVideo2_1_T2V_00196.mp4 (1.74 MB, 1248x720)

1.74 MB MP4

Anonymous
09/03/25(Wed)05:39:13 No.106471219

Anonymous 09/03/25(Wed)05:39:13 No.106471219

File: Untitled.png (235 KB, 835x619)

235 KB PNG

>New furk post

See any issues here?

Anonymous
09/03/25(Wed)05:44:01 No.106471248

Anonymous 09/03/25(Wed)05:44:01 No.106471248

>>106471219
>water is wet

Anonymous
09/03/25(Wed)05:46:36 No.106471259

Anonymous 09/03/25(Wed)05:46:36 No.106471259

hey bros anyone got a spare 5090 to donate :) I promise ill train some qwen ToT loras with it

Anonymous
09/03/25(Wed)05:47:14 No.106471262

Anonymous 09/03/25(Wed)05:47:14 No.106471262

>>106471248
I also just found out he blocked me. But look at his loss.
He's basically trained a broken LoRA and bragging about it.

Anonymous
09/03/25(Wed)05:48:38 No.106471266

Anonymous 09/03/25(Wed)05:48:38 No.106471266

>>106471219
Is that 5600 steps??

Anonymous
09/03/25(Wed)05:52:11 No.106471295

Anonymous 09/03/25(Wed)05:52:11 No.106471295

>>106471219
nans for days

Anonymous
09/03/25(Wed)05:52:59 No.106471307

Anonymous 09/03/25(Wed)05:52:59 No.106471307

File: 1729912655468154.jpg (1.16 MB, 2016x1152)

1.16 MB JPG

I'm not terribly impressed by how Qwen handles traditional media.

Anonymous
09/03/25(Wed)05:59:19 No.106471339

Anonymous 09/03/25(Wed)05:59:19 No.106471339

>>106471307
Ask furk to train you a nan lora for you.

Anonymous
09/03/25(Wed)06:01:47 No.106471357

Anonymous 09/03/25(Wed)06:01:47 No.106471357

>>106471219
well yeah, they aren't giving consumer cards 96gb vram because it would destroy their enterprise market overnight. that's why i'm hoping for a deepseek-level breakthrough from china but in the hardware space. they already have modded cards. they are also making 96gb custom cards but they're kind of shit because low bandwidth, no cuda, and shit-tier drivers.

Anonymous
09/03/25(Wed)06:06:44 No.106471391

Anonymous 09/03/25(Wed)06:06:44 No.106471391

>>106471307
The paintings themselves in the back is honestly really well done, just the girl is slopped.

Anonymous
09/03/25(Wed)06:07:24 No.106471396

Anonymous 09/03/25(Wed)06:07:24 No.106471396

>>106469998
5 hours of genning god damn. having fun with the I2V

Anonymous
09/03/25(Wed)06:17:32 No.106471461

Anonymous 09/03/25(Wed)06:17:32 No.106471461

>>106471172

just image AI generating FPS walkthough jump-scare game movies in perpetuity

Anonymous
09/03/25(Wed)06:19:08 No.106471473

Anonymous 09/03/25(Wed)06:19:08 No.106471473

File: ComfyUI_temp_jzxyy_00004_.png (3.74 MB, 1152x1664)

3.74 MB PNG

Anonymous
09/03/25(Wed)06:38:12 No.106471573

Anonymous 09/03/25(Wed)06:38:12 No.106471573

File: QwenChromfaceWan_00008_.jpg (377 KB, 1392x2496)

377 KB JPG

I shouldn't have updated my OS.
DRAM/VRAM management is kinda fucky now. God damn.

Anonymous
09/03/25(Wed)07:01:31 No.106471711

Anonymous 09/03/25(Wed)07:01:31 No.106471711

>>106471357
It's not his post that's cringe. It's that he's bragging about his hardware while being unaware he is basically showing the world that his LoRA is stillborn.

Anonymous
09/03/25(Wed)07:28:07 No.106471833

Anonymous 09/03/25(Wed)07:28:07 No.106471833

>>106471711
nta, but I can't help but question how the man is such a prolific (shit)poster seemingly everywhere but somehow missed that their training run was cooked from the go.

Anonymous
09/03/25(Wed)07:43:40 No.106471933

Anonymous 09/03/25(Wed)07:43:40 No.106471933

File: 1573897305298.jpg (12 KB, 257x294)

12 KB JPG

Anyone ever used the captioning tool in Onetrainer? Usable or megacopium only good for boorutags?

Anonymous
09/03/25(Wed)07:45:16 No.106471941

Anonymous 09/03/25(Wed)07:45:16 No.106471941

>>106471833
I'm convinced his low intelligence robbed him of his ability to second guess and check himself and by radiating enough confidence in a field most people knew little about, he was able to accidentally grift his way to notoriety by just being a fucking idiot.

Anonymous
09/03/25(Wed)08:27:17 No.106472188

Anonymous 09/03/25(Wed)08:27:17 No.106472188

>>106471933
usable, but you still have to check manually for any flops afterward, if anything it saves time by doing the heavy captioning for you.

Anonymous
09/03/25(Wed)08:32:23 No.106472218

Anonymous 09/03/25(Wed)08:32:23 No.106472218

>>106472188
Can it do nsfw?

Anonymous
09/03/25(Wed)08:32:59 No.106472220

Anonymous 09/03/25(Wed)08:32:59 No.106472220

I know you can upscale and interpolate wan video but is there anything to fix any fuckups in the video like when something gets blurred out or things like that?

Anonymous
09/03/25(Wed)08:36:06 No.106472242

Anonymous 09/03/25(Wed)08:36:06 No.106472242

with wan 2.2 you can save the latent from the high noise sampler and reroll with the low noise sampler to hopefully get a better result

Anonymous
09/03/25(Wed)08:44:58 No.106472298

Anonymous 09/03/25(Wed)08:44:58 No.106472298

Can I run Wan on my M4 Max? How's the speed?

Anonymous
09/03/25(Wed)09:02:42 No.106472405

Anonymous 09/03/25(Wed)09:02:42 No.106472405

>>106470133
>Playing 'viking boat'

Anonymous
09/03/25(Wed)09:03:34 No.106472413

Anonymous 09/03/25(Wed)09:03:34 No.106472413

File: ComfyUI_00112_.png (460 KB, 1024x1536)

460 KB PNG

Anonymous
09/03/25(Wed)09:08:42 No.106472450

Anonymous 09/03/25(Wed)09:08:42 No.106472450

>>106471711
Are you telling me that ohwx man training of himself is not the true way ?

Seriously this guy has been doing this for such a long time yet he has learning nothing, still seems to think there's some magic token combination, still hasn't understood that repeats are only for balancing training data when doing multiple concepts at the same time, doesn't even understand the principle of A - B testing and instead changes lots of parameters between experimental runs.

Snakeoil salesman if there ever was one.

Anonymous
09/03/25(Wed)09:10:03 No.106472460

Anonymous 09/03/25(Wed)09:10:03 No.106472460

>>106472298
You probably can, I saw someone posting that they got it running, the speed was something horrendous though.

Anonymous
09/03/25(Wed)09:12:41 No.106472485

Anonymous 09/03/25(Wed)09:12:41 No.106472485

I'm going insane trying to find a good noob-based model with decent coherence that allows some flexibility beyond basic tags. For example, this one https://civitai.com/models/1201815?modelVersionId=1491533 - you can actually add variations like 'blue glowing tattoo' instead of just 'tattoo' without it breaking. Problem is, these models all have shit mixes and can't follow artist styles closely like vanilla noob does. But I'm too much of a shitter to get kino results with plain noob. Is there a good middle ground model/remix that actually respects artist styles while being more forgiving?

Anonymous
09/03/25(Wed)09:14:17 No.106472495

Anonymous 09/03/25(Wed)09:14:17 No.106472495

>>106472485
anything that uses (only) CLIP will never give you the control you seek

Anonymous
09/03/25(Wed)09:16:57 No.106472516

Anonymous 09/03/25(Wed)09:16:57 No.106472516

>>106472485
Your best bet is to find a model that has an LLM text encoder slapped onto it. Idk if noob has a variant like that tho.

Anonymous
09/03/25(Wed)09:18:27 No.106472526

Anonymous 09/03/25(Wed)09:18:27 No.106472526

>>106472485
illustrious has limited natural language support

Anonymous
09/03/25(Wed)09:21:56 No.106472544

Anonymous 09/03/25(Wed)09:21:56 No.106472544

File: AnimateDiff_00280.mp4 (3.31 MB, 720x720)

3.31 MB MP4

beeg birb

Anonymous
09/03/25(Wed)09:33:18 No.106472639

Anonymous 09/03/25(Wed)09:33:18 No.106472639

File: QwenChromfaceWan_00021_.jpg (575 KB, 1392x2496)

575 KB JPG

>>106471573
The 1girl machine keeps churning, but memory management sucks.

Anonymous
09/03/25(Wed)09:34:07 No.106472643

Anonymous 09/03/25(Wed)09:34:07 No.106472643

>>106472485
https://huggingface.co/Minthy/RouWei-Gemma

someone's been trying to stitch better encoders to sdxl but i don't see much difference so far

Anonymous
09/03/25(Wed)09:42:56 No.106472717

Anonymous 09/03/25(Wed)09:42:56 No.106472717

File: ComfyUI_16628.png (3.26 MB, 1200x1600)

3.26 MB PNG

>>106471219
>NaN
That explains those terrible LoRAs of himself.

Anonymous
09/03/25(Wed)09:55:36 No.106472799

Anonymous 09/03/25(Wed)09:55:36 No.106472799

>>106472717
lul

Anonymous
09/03/25(Wed)10:06:37 No.106472882

Anonymous 09/03/25(Wed)10:06:37 No.106472882

File: 00007-2391443323.png (2.65 MB, 1248x1848)

2.65 MB PNG

Anonymous
09/03/25(Wed)10:08:05 No.106472904

Anonymous 09/03/25(Wed)10:08:05 No.106472904

File: ComfyUI_temp_ugmpv_00032_.png (1.89 MB, 1152x1152)

1.89 MB PNG

>random character sheet out of nowhere
Thanks, I guess

Anonymous
09/03/25(Wed)10:09:41 No.106472916

Anonymous 09/03/25(Wed)10:09:41 No.106472916

>>106472904
What did you prompt for ?

Anonymous
09/03/25(Wed)10:13:03 No.106472945

Anonymous 09/03/25(Wed)10:13:03 No.106472945

>>106472916
https://genshin-impact.fandom.com/wiki/Jahoda
The appearance paragraph xd. I just specified anime artstyle. I guess AI generated articles are good prompts lmao

Anonymous
09/03/25(Wed)10:13:04 No.106472946

Anonymous 09/03/25(Wed)10:13:04 No.106472946

>>106472904
not a footfag but that little red foot is cute

Anonymous
09/03/25(Wed)10:13:14 No.106472947

Anonymous 09/03/25(Wed)10:13:14 No.106472947

File: 1734205326105497.jpg (1.14 MB, 1248x1824)

1.14 MB JPG

>>106471391
At least it draws really good legs

Anonymous
09/03/25(Wed)10:14:58 No.106472965

Anonymous 09/03/25(Wed)10:14:58 No.106472965

File: QwenChromfaceWan_00026_.jpg (455 KB, 1392x2496)

455 KB JPG

>>106472639
That's not a Luger. Man.

Anonymous
09/03/25(Wed)10:25:59 No.106473065

Anonymous 09/03/25(Wed)10:25:59 No.106473065

File: AnimateDiff_00281.mp4 (3.58 MB, 720x1280)

3.58 MB MP4

beeg guy

Anonymous
09/03/25(Wed)10:31:42 No.106473120

Anonymous 09/03/25(Wed)10:31:42 No.106473120

File: ComfyUI_temp_dlflt_00002_.png (1.56 MB, 1152x1152)

1.56 MB PNG

>multiple seeds, samplers and schedulers
>it keeps giving me ref sheets
There is no way a paragraph can have such strong specific style "vibes" that it fucks with the model. Is this a thing,

Anonymous
09/03/25(Wed)10:33:29 No.106473136

Anonymous 09/03/25(Wed)10:33:29 No.106473136

>>106469887
Seb McKinnon Lora for Flux? That's not available on civitai, I want to test it.

Anonymous
09/03/25(Wed)10:35:46 No.106473152

Anonymous 09/03/25(Wed)10:35:46 No.106473152

>an entire separate general of waisloppers
mortifying

Anonymous
09/03/25(Wed)10:36:06 No.106473159

Anonymous 09/03/25(Wed)10:36:06 No.106473159

>>106473136
wrong post
>>106470020

Anonymous
09/03/25(Wed)10:46:37 No.106473264

Anonymous 09/03/25(Wed)10:46:37 No.106473264

File: AnimateDiff_00284.mp4 (3.08 MB, 720x960)

3.08 MB MP4

>>106472717

Anonymous
09/03/25(Wed)10:47:07 No.106473269

Anonymous 09/03/25(Wed)10:47:07 No.106473269

It feels like progress is pretty stagnant after a few months. too bad the software still sucks ass and just got worse. any new models on the horizon to look forward to at least?

Anonymous
09/03/25(Wed)10:47:59 No.106473282

Anonymous 09/03/25(Wed)10:47:59 No.106473282

>>106473269
qwen hinted at some updates but that's about it.

Anonymous
09/03/25(Wed)10:48:54 No.106473289

Anonymous 09/03/25(Wed)10:48:54 No.106473289

>>106473269
Goddamn man we just had qwen not even 2 weeks ago. Is this zoomer brain I heard about?

Anonymous
09/03/25(Wed)10:50:07 No.106473301

Anonymous 09/03/25(Wed)10:50:07 No.106473301

File: ComfyUI_00006_ (10).png (2.11 MB, 832x1248)

2.11 MB PNG

Chroma loras are so easy to bake, all you need is:

10-15 512x512 images
natural gemini captions
adamw optimizer
constant scheduler
batch size 1
set it for about 2K steps (about 150ish epochs)

Anonymous
09/03/25(Wed)10:51:26 No.106473316

Anonymous 09/03/25(Wed)10:51:26 No.106473316

>512x512
we're regressing, not progressing

Anonymous
09/03/25(Wed)10:55:30 No.106473360

Anonymous 09/03/25(Wed)10:55:30 No.106473360

>>106473301
I don't know if I would call that good quality anon. This seems to have the same quality problem flux did if you trained at such low resolution

Anonymous
09/03/25(Wed)10:55:52 No.106473365

Anonymous 09/03/25(Wed)10:55:52 No.106473365

>>106473282
probably some standard controlnet IP adapter thing for their existing stuff.

>>106473289
qwen isn't really as impressive as it should be at that size. synthetic slopped datasets are a step backwards and the two stage models for wan is just annoying for a 10% higher quality video than 2.1

Anonymous
09/03/25(Wed)10:58:14 No.106473386

Anonymous 09/03/25(Wed)10:58:14 No.106473386

>>106473301
>Chroma loras are so easy to bake,
Yes, it's shockingly easy to train Chroma loras effectively

>natural gemini captions
Don't need this, JoyCaption is good enough, and if all you train is a single concept like a person (or even an art style assuming it's consistent) you can train with just a simple 'foobar' nonsense tag and it will have no problem training it

You didn't mention learning rate, for people I would suggest 0.0001 (1e-4) for art styles you probably want to go a bit higher since it's more abstract as a concept

Anonymous
09/03/25(Wed)10:59:21 No.106473394

Anonymous 09/03/25(Wed)10:59:21 No.106473394

>>106473365
>qwen isn't really as impressive as it should be at that size
you are right but a lot of people don't want to believe it. people will dismiss the arena rankings as nonsense, but qwen ranks around the same place as hidream and it honestly looks it. i went to train a qwen lora and it was like 60gb worth of slop. it's incredibly bloated for a model that does not feel anywhere near the top-10. if it was good it would've made way more strides like flux dev did compared to SD3/SDXL (though at the time we didnt realize how impossible it would be to tune).

Anonymous
09/03/25(Wed)10:59:50 No.106473400

Anonymous 09/03/25(Wed)10:59:50 No.106473400

>>106473365
You know I was gonna take you seriously but then
>10% higher quality video than 2.1
Ahh another vramlet seethe. Trust me if you can't run this you probably just dip from the scene, it's gonna get worse from here.

Anonymous
09/03/25(Wed)11:03:48 No.106473430

Anonymous 09/03/25(Wed)11:03:48 No.106473430

recommend me some cool Illustrious base model hidden gems

Anonymous
09/03/25(Wed)11:04:03 No.106473431

Anonymous 09/03/25(Wed)11:04:03 No.106473431

File: 1749435765168363.png (40 KB, 399x399)

40 KB PNG

>>106473301
people should just ignore chroma, and work on qwen. chroma is like bigasp. big potential, crappy results. maybe another model, merged with chroma will save the day

Anonymous
09/03/25(Wed)11:05:26 No.106473446

Anonymous 09/03/25(Wed)11:05:26 No.106473446

>>106473301
when will girls stop having their feet buried in the ground?

Anonymous
09/03/25(Wed)11:06:11 No.106473453

Anonymous 09/03/25(Wed)11:06:11 No.106473453

>>106473446
When you type full body shot or prompt something regarding footwear

Anonymous
09/03/25(Wed)11:08:19 No.106473473

Anonymous 09/03/25(Wed)11:08:19 No.106473473

Do flux loras still work with chroma

Anonymous
09/03/25(Wed)11:08:43 No.106473478

Anonymous 09/03/25(Wed)11:08:43 No.106473478

>>106473431
>chroma is like bigasp
come on now lets not be disingenuous...
at least bigasp was trained at 1024x!

Anonymous
09/03/25(Wed)11:09:07 No.106473481

Anonymous 09/03/25(Wed)11:09:07 No.106473481

File: AnimateDiff_00285.mp4 (3.46 MB, 720x960)

3.46 MB MP4

>>106473264

Anonymous
09/03/25(Wed)11:10:16 No.106473497

Anonymous 09/03/25(Wed)11:10:16 No.106473497

File: seeds.jpg (1.3 MB, 4176x2496)

1.3 MB JPG

>>106473365
>>106473394
>qwen isn't really as impressive as it should be
I'd agree, but the alternatives in terms of prompt adherence and non-mangled hands/poses are kinda slim.
But the shocking amount of sameface and general lack of variance between seeds hurt the model a lot.
Picrel, 3 seeds, same prompt.

Anonymous
09/03/25(Wed)11:14:30 No.106473529

Anonymous 09/03/25(Wed)11:14:30 No.106473529

because qwen is DPO'd to shit, it has been said multiple times. you need to inject noise if you want an actual different image per seed

Anonymous
09/03/25(Wed)11:14:59 No.106473534

Anonymous 09/03/25(Wed)11:14:59 No.106473534

>>106473497
Lack of seed variation is actually good for i2v gen purposes. You can change the pose or other details, and the face/character tends to stay the same. I agree it does take some tard wrangling to avoid unwanted generalizations.

Anonymous
09/03/25(Wed)11:15:20 No.106473538

Anonymous 09/03/25(Wed)11:15:20 No.106473538

>>106473473
Some do some don't. You have try one by one.

Anonymous
09/03/25(Wed)11:16:14 No.106473546

Anonymous 09/03/25(Wed)11:16:14 No.106473546

>>106473497
I personally like it makes editing an image much easier without just losing the entire damn composition.

Anonymous
09/03/25(Wed)11:16:46 No.106473553

Anonymous 09/03/25(Wed)11:16:46 No.106473553

File: Chroma-out-0000.jpg (1.27 MB, 2048x2048)

1.27 MB JPG

>>106473360
It's not the resolution, most likely prompted to look like a phone camera shot

This is from a Chroma lora I recently trained at 512 resolution

Anonymous
09/03/25(Wed)11:19:02 No.106473571

Anonymous 09/03/25(Wed)11:19:02 No.106473571

>>106473431
Qwen's comprehension of traditional media styles, as well as creativity, is piss poor. I don't think it will be remedied by finetuning
It's a great model, but more experienced users will get more out of chroma.

Anonymous
09/03/25(Wed)11:19:28 No.106473577

Anonymous 09/03/25(Wed)11:19:28 No.106473577

File: ComfyUI_00008_.png (1.37 MB, 832x1248)

1.37 MB PNG

>>106473431

One of the really popular SDXL model makers is making a finetune right now, I've used a prototype lora of it and it is VERY promising.

>>106473446

This nigga likes feet!

>>106473553

Yeah they're prompted/oversharpened to look like phone shots after they're loaded to IG.

Anonymous
09/03/25(Wed)11:21:49 No.106473595

Anonymous 09/03/25(Wed)11:21:49 No.106473595

I'm new I2V and I'm following the guide. When I try to generate with the first workflow (https://rentry.org/wan22ldgguide) I'm getting:
> ValueError("type fp8e4nv not supported in this architecture. The supported fp8 dtypes are ('fp8e4b15', 'fp8e5')")
And ChatGPT insists that fp8e4nv doesn't work on a 3090. Is it wrong?

Anonymous
09/03/25(Wed)11:25:09 No.106473621

Anonymous 09/03/25(Wed)11:25:09 No.106473621

>>106473571
>I don't think it will be remedied by finetuning
I think you can, but it will be very expensive since the model is large and massively overtrained, not sure if anyone with enough money would think it's worth it

Anonymous
09/03/25(Wed)11:27:05 No.106473637

Anonymous 09/03/25(Wed)11:27:05 No.106473637

>>106473595
I had that same issue when I tried a new comfy install, even though I used the e4m3fn models with my old install no problem. I dunno. Just try the e5m2 models.

Anonymous
09/03/25(Wed)11:29:31 No.106473656

Anonymous 09/03/25(Wed)11:29:31 No.106473656

>>106473571
>remedied by finetuning
The fuck, if the distilled horseshit that flux was able to be unfucked by finetuning, a non distilled model should be 100 fold easier.

Anonymous
09/03/25(Wed)11:31:50 No.106473674

Anonymous 09/03/25(Wed)11:31:50 No.106473674

>>106473656
qwen would take forever. it needs a bit more elbow grease and cash than chroma

Anonymous
09/03/25(Wed)11:32:51 No.106473687

Anonymous 09/03/25(Wed)11:32:51 No.106473687

https://chromaawards.com/
I think 11labs is paying off civit to not add a chroma category because of this shit

Anonymous
09/03/25(Wed)11:33:35 No.106473695

Anonymous 09/03/25(Wed)11:33:35 No.106473695

>>106473571
>Qwen's comprehension of traditional media styles, as well as creativity, is piss poor
correct
>I don't think it will be remedied by finetuning
it CAN, but nobody will because the model is way too bloated.
>but more experienced users will get more out of chroma.
completely false

Anonymous
09/03/25(Wed)11:34:29 No.106473703

Anonymous 09/03/25(Wed)11:34:29 No.106473703

>>106473674
People keep saying that but all big finetune needs a lot, chroma needed 105,000 H100 hours are you saying qwen would need more? SDXL needed a cluster, finetune will need big hardware for any model. It's such a non argument.

Anonymous
09/03/25(Wed)11:36:02 No.106473716

Anonymous 09/03/25(Wed)11:36:02 No.106473716

>>106473703
chroma is 8.9b. qwen is 20b

Anonymous
09/03/25(Wed)11:36:17 No.106473719

Anonymous 09/03/25(Wed)11:36:17 No.106473719

>>106473703
>are you saying qwen would need more?
it entirely depends on the size of the model. the other anon is right, it's too bloated but it's also overturned which is why there isn't much seed variation

Anonymous
09/03/25(Wed)11:36:19 No.106473720

Anonymous 09/03/25(Wed)11:36:19 No.106473720

File: 1990654507.png (1.04 MB, 1536x640)

1.04 MB PNG

Anonymous
09/03/25(Wed)11:36:44 No.106473723

Anonymous 09/03/25(Wed)11:36:44 No.106473723

>>106473431
qwen is overfit and bloated to shit. just do wan

Anonymous
09/03/25(Wed)11:37:48 No.106473730

Anonymous 09/03/25(Wed)11:37:48 No.106473730

>>106473723
wan is slightly over it as well but at least it's in the realm of doable

Anonymous
09/03/25(Wed)11:38:04 No.106473732

Anonymous 09/03/25(Wed)11:38:04 No.106473732

File: AnimateDiff_00287.mp4 (2.64 MB, 720x1072)

2.64 MB MP4

>>106473301
hello beautiful babe

Anonymous
09/03/25(Wed)11:38:08 No.106473733

Anonymous 09/03/25(Wed)11:38:08 No.106473733

>>106473703
yes, qwen would absolutely need more because it's massive in comparison to chroma. chroma already had to cope by removing parameters and training at 1/4 the resolution of fucking sdxl. and even with all that, he still wound up spending $150k on it. acting like the compute costs for these models are the same as SDXL is simply retarded

Anonymous
09/03/25(Wed)11:39:48 No.106473749

Anonymous 09/03/25(Wed)11:39:48 No.106473749

>>106473703
Use your brain

Anonymous
09/03/25(Wed)11:41:39 No.106473766

Anonymous 09/03/25(Wed)11:41:39 No.106473766

>>106473577
>prototype lora

Where?

Anonymous
09/03/25(Wed)11:45:42 No.106473797

Anonymous 09/03/25(Wed)11:45:42 No.106473797

File: ComfyUI_00010_ (14).png (1.68 MB, 832x1248)

1.68 MB PNG

>>106473732

Love it lmao

>>106473766

I've said too much.

Anonymous
09/03/25(Wed)11:48:24 No.106473812

Anonymous 09/03/25(Wed)11:48:24 No.106473812

>>106473797
Oh you meant a chroma tune, that;s good was wondering how did you manage to make a qwen tune look so shitty lol

Anonymous
09/03/25(Wed)11:54:13 No.106473856

Anonymous 09/03/25(Wed)11:54:13 No.106473856

File: 1750746778004470.jpg (1 MB, 2016x1152)

1 MB JPG

>>106469536
Neta is not perfect, but this is the only anime model that can handle multiple subjects on screen without mangling them

Anonymous
09/03/25(Wed)12:03:34 No.106473943

Anonymous 09/03/25(Wed)12:03:34 No.106473943

>>106473431
Most people outside of here are ignoring chroma

Anonymous
09/03/25(Wed)12:09:39 No.106473977

Anonymous 09/03/25(Wed)12:09:39 No.106473977

>>106473732
>>106473301

tranny hands

Anonymous
09/03/25(Wed)12:11:11 No.106473987

Anonymous 09/03/25(Wed)12:11:11 No.106473987

File: AnimateDiff_00289.mp4 (2.04 MB, 480x720)

2.04 MB MP4

>>106473797
>prompt for indian man
>get a cholo

Anonymous
09/03/25(Wed)12:11:42 No.106473991

Anonymous 09/03/25(Wed)12:11:42 No.106473991

>>106473977
>t. has futa images saved

Anonymous
09/03/25(Wed)12:19:38 No.106474052

Anonymous 09/03/25(Wed)12:19:38 No.106474052

>>106473720
>white mans kriptonite.png

Anonymous
09/03/25(Wed)12:26:23 No.106474103

Anonymous 09/03/25(Wed)12:26:23 No.106474103

File: ComfyUI_00125_.png (2.05 MB, 1536x1024)

2.05 MB PNG

>>106473720

Anonymous
09/03/25(Wed)12:42:06 No.106474253

Anonymous 09/03/25(Wed)12:42:06 No.106474253

>>106473687
Could be, damn

Anonymous
09/03/25(Wed)12:54:21 No.106474383

Anonymous 09/03/25(Wed)12:54:21 No.106474383

Are flux dev and schnell loras interchangeable?

Anonymous
09/03/25(Wed)12:54:48 No.106474388

Anonymous 09/03/25(Wed)12:54:48 No.106474388

File: 00001-3520518463.png (359 KB, 1873x883)

359 KB PNG

>>106474328
Hello, I'm trying to switch from Forge to ComfyUI.
I prefer ComfyUI's interface because my entire txt2img + hires fix workflow fits on my screen without scrolling.
The problem is that I can't get it to work correctly. I've posted more details in the attached thread.
Any help would be appreciated. Thanks!
json: https://files.catbox.moe/nv2b7k.json

Anonymous
09/03/25(Wed)12:56:11 No.106474406

Anonymous 09/03/25(Wed)12:56:11 No.106474406

>>106474388
I think sdxl wants -2 clip layer

Anonymous
09/03/25(Wed)12:57:16 No.106474422

Anonymous 09/03/25(Wed)12:57:16 No.106474422

>>106474103
tranny eyes

Anonymous
09/03/25(Wed)13:08:16 No.106474533

Anonymous 09/03/25(Wed)13:08:16 No.106474533

>>106473637
Yeah that worked.

Anonymous
09/03/25(Wed)13:08:21 No.106474536

Anonymous 09/03/25(Wed)13:08:21 No.106474536

>>106474388
clip needs to be -2. Not sure what you are trying to do with the tiled vae encode/decode nodes. Also 'BREAK' commands don't work in the default CLIP Text Encode nodes, there are custom nodes that use the A1111 parser if you want to keep them but in Comfy you should break each one out into separate text encode nodes and concat them.

Anonymous
09/03/25(Wed)13:14:46 No.106474602

Anonymous 09/03/25(Wed)13:14:46 No.106474602

>>106469492
how do i generate abstract happy merchant memes? i am super retarded when it comes to prompting ai

Anonymous
09/03/25(Wed)13:16:05 No.106474619

Anonymous 09/03/25(Wed)13:16:05 No.106474619

>>106474602
literally just img to img and play with the denoise

Anonymous
09/03/25(Wed)13:17:51 No.106474631

Anonymous 09/03/25(Wed)13:17:51 No.106474631

>>106474536
Thanks I am looking here and /adt/ for answers. All those options were loaded by default when I dragged the gen made in Forge to Comfy. I will keep in mind what you tell me.

Anonymous
09/03/25(Wed)13:22:18 No.106474669

Anonymous 09/03/25(Wed)13:22:18 No.106474669

>>106474631
you can also just disable the CLIP Set Last Layer node, ComfyUI will automatically use -2 with SDXL models.

Anonymous
09/03/25(Wed)13:24:47 No.106474692

Anonymous 09/03/25(Wed)13:24:47 No.106474692

File: ComfyUI_00834_.jpg (924 KB, 2160x2896)

924 KB JPG

>>106474631
>>106474388
I think it would be much faster to just make a workflow from the ground up

Anonymous
09/03/25(Wed)13:28:48 No.106474735

Anonymous 09/03/25(Wed)13:28:48 No.106474735

>>106470041
Fuck me I don’t know how you guys put up with that, 5seconds for my 1girls feels insufferably inexcusably long as it is. I’m sure someone will chime in with “it’s worth it” or whatever but that’s into “too long for me to bother” territory. And that’s with the fastest consumer card.

Anonymous
09/03/25(Wed)13:31:05 No.106474752

Anonymous 09/03/25(Wed)13:31:05 No.106474752

>>106471050
Well it’s a good thing the oral insertion lora bandit got bored, otherwise…ToT

Anonymous
09/03/25(Wed)13:32:11 No.106474761

Anonymous 09/03/25(Wed)13:32:11 No.106474761

Say I'm training a lora and I want 3000 steps total, with 20 images.
Is it better to do 3 epochs with 1000 steps each, or 6 epochs with 500 steps each?

Is there a noticeable difference between the two at same step amounts (ie, first at epoch 2, 2000 steps, second at epoch 4 at 2000 steps)

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.