/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 09/02/25(Tue)12:35:04 No.106464276

File: highlights_g_106457557_17(...).webm (3.84 MB, 2048x683)

3.84 MB WEBM

/ldg/ - Local Diffusion General Anonymous 09/02/25(Tue)12:35:04 No.106464276 Archived

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106457557

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
Chromaforge: https://github.com/maybleMyers/chromaforge
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://rentry.org/wan22ldgguide
https://github.com/Wan-Video
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo

Anonymous
09/02/25(Tue)12:36:26 No.106464287

Anonymous 09/02/25(Tue)12:36:26 No.106464287

Cake.

Anonymous
09/02/25(Tue)12:37:03 No.106464293

Anonymous 09/02/25(Tue)12:37:03 No.106464293

>>106464276
you forgot to read anistudio

Anonymous
09/02/25(Tue)12:38:35 No.106464301

Anonymous 09/02/25(Tue)12:38:35 No.106464301

File: AnimateDiff_00265.mp4 (3.65 MB, 720x1040)

3.65 MB MP4

Anonymous
09/02/25(Tue)12:39:05 No.106464308

Anonymous 09/02/25(Tue)12:39:05 No.106464308

Cursed thread of hatred and animosity

Anonymous
09/02/25(Tue)12:41:11 No.106464330

Anonymous 09/02/25(Tue)12:41:11 No.106464330

Blessed thread of frenship

Anonymous
09/02/25(Tue)12:42:59 No.106464345

Anonymous 09/02/25(Tue)12:42:59 No.106464345

File: ComfyUI_temp_knjof_00001_.png (2.67 MB, 1024x1024)

2.67 MB PNG

>50 steps euler/simple
Yeah radiance needs to stay in the oven

Anonymous
09/02/25(Tue)12:45:05 No.106464365

Anonymous 09/02/25(Tue)12:45:05 No.106464365

>>106464308
TO VALHALLA

Anonymous
09/02/25(Tue)12:49:08 No.106464401

Anonymous 09/02/25(Tue)12:49:08 No.106464401

>>106464345
I am curious as to the final result but to me the entire concept seems backward. A big reason we use vae was to cut computational cost, the very first models didn't use vae and was a pain to train and gen with.

Anonymous
09/02/25(Tue)12:49:26 No.106464404

Anonymous 09/02/25(Tue)12:49:26 No.106464404

so chroma.... what happened there???

Anonymous
09/02/25(Tue)12:53:42 No.106464447

Anonymous 09/02/25(Tue)12:53:42 No.106464447

File: 1745435907365688.png (860 KB, 900x716)

860 KB PNG

Can someone share a good workflow to upscale images and them still looking sharp?
What I tried so far looked like shit.

Anonymous
09/02/25(Tue)13:01:25 No.106464510

Anonymous 09/02/25(Tue)13:01:25 No.106464510

Is it hard to set up a local model? I've tried to watch a few guides and it seems quite overwhelming.
And would a 9070xt be good for generating videos/images?

Anonymous
09/02/25(Tue)13:02:44 No.106464528

Anonymous 09/02/25(Tue)13:02:44 No.106464528

File: IMG_2925.jpg (3.28 MB, 5712x4284)

3.28 MB JPG

>>106464424
at least yours is INSIDE your case lmao. This is what i get for trying to upgrade a prebuilt. Card don't fit. psu don't work. bought a second replacement psu but it doesn't have enough PCIE plugs so i have to keep using the original psu as well.
but it works. so

Anonymous
09/02/25(Tue)13:03:24 No.106464533

Anonymous 09/02/25(Tue)13:03:24 No.106464533

>>106464510
>AMD
it's 10x harder than the plug n' play Nvidia. If you thought it was hard after watching videos it's not going to be a good time

Anonymous
09/02/25(Tue)13:15:45 No.106464643

Anonymous 09/02/25(Tue)13:15:45 No.106464643

File: 1740252647672142.png (1.63 MB, 991x1562)

1.63 MB PNG

Anonymous
09/02/25(Tue)13:16:18 No.106464647

Anonymous 09/02/25(Tue)13:16:18 No.106464647

>>106464404
too much the diaper

Anonymous
09/02/25(Tue)13:17:07 No.106464654

Anonymous 09/02/25(Tue)13:17:07 No.106464654

File: AnimateDiff_00259.mp4 (2.15 MB, 720x1280)

2.15 MB MP4

milk

Anonymous
09/02/25(Tue)13:19:45 No.106464677

Anonymous 09/02/25(Tue)13:19:45 No.106464677

File: ComfyUI_temp_nkjid_00006_.png (3.77 MB, 1536x1152)

3.77 MB PNG

Anonymous
09/02/25(Tue)13:23:03 No.106464711

Anonymous 09/02/25(Tue)13:23:03 No.106464711

>>106464643
Are you pretending that you can't prompt with 12gb vram ? You must be some /saasdg/ fag who has never user local.

Anonymous
09/02/25(Tue)13:25:38 No.106464736

Anonymous 09/02/25(Tue)13:25:38 No.106464736

File: AnimateDiff_00268.mp4 (3.14 MB, 720x720)

3.14 MB MP4

>>106464404

A bit of a disappointment. I had high hopes for a model with a large character database like Noobai, with the advantage of natural language and the variety of styles.

Unfortunately, it only recognizes a few more characters than Flux or Qwen, but the advantages end there.

Since Qwen arrived, I can easily create a scene, add a background, and add characters described just with a prompt. Then I edit the characters with Noobai InPaint.
(pic related)
If they don't do something, Chroma will soon be forgotten.

Anonymous
09/02/25(Tue)13:26:56 No.106464749

Anonymous 09/02/25(Tue)13:26:56 No.106464749

File: 1743836343824727.jpg (1.55 MB, 2016x1152)

1.55 MB JPG

>>106464736
It's interesting that qwen has almost no traction on civit compared to chroma.

Anonymous
09/02/25(Tue)13:27:29 No.106464758

Anonymous 09/02/25(Tue)13:27:29 No.106464758

>>106464510
If you can follow instructions on a github repo, installation shouldn't be a problem. Coming from an AMD user, you should avoid AMD if possible. Its gonna be more of a headache to setup than an Nvidia card. If you already have the card though, and are on Windows, I've had the best luck with this repo: https://github.com/patientx/ComfyUI-Zluda

Anonymous
09/02/25(Tue)13:31:26 No.106464797

Anonymous 09/02/25(Tue)13:31:26 No.106464797

>>106464749
Probably because qwen is like 20gb and vramlets can't use it.

Anonymous
09/02/25(Tue)13:33:20 No.106464811

Anonymous 09/02/25(Tue)13:33:20 No.106464811

>>106464345
skill issue.
you're using either the wrong settings or you are a promptlet. try euler simple 24 steps

Anonymous
09/02/25(Tue)13:33:20 No.106464812

Anonymous 09/02/25(Tue)13:33:20 No.106464812

File: 1740294979677564.jpg (1.04 MB, 1824x1248)

1.04 MB JPG

>>106464797
I use Q4 with a 12GB card. The loss of quality is absolutely negligible and the speed is comparable to Chroma.

Anonymous
09/02/25(Tue)13:34:19 No.106464819

Anonymous 09/02/25(Tue)13:34:19 No.106464819

>>106464811
retard

Anonymous
09/02/25(Tue)13:34:37 No.106464823

Anonymous 09/02/25(Tue)13:34:37 No.106464823

>>106464711
vramlet cope

Anonymous
09/02/25(Tue)13:35:03 No.106464831

Anonymous 09/02/25(Tue)13:35:03 No.106464831

>>106464812
>the speed is comparable to Chroma.
I haven't used chroma but if that's true then it's slow as all fuck. takes several minutes for one image on qwen q4 on my 3060 12gb.

Anonymous
09/02/25(Tue)13:35:22 No.106464835

Anonymous 09/02/25(Tue)13:35:22 No.106464835

>>106464819
damn you really got me there didn't you.
pathetic. hate on a model because it's bad, not because you are incapable of using it.

Anonymous
09/02/25(Tue)13:36:19 No.106464842

Anonymous 09/02/25(Tue)13:36:19 No.106464842

>>106464835
It's the same prompt I use on normal chroma and it is perfectly fine there. And less steps won't make it better.

Anonymous
09/02/25(Tue)13:38:42 No.106464859

Anonymous 09/02/25(Tue)13:38:42 No.106464859

qwen has nunchaku so it's vramlet friendly. chroma doesn't.

Anonymous
09/02/25(Tue)13:39:39 No.106464863

Anonymous 09/02/25(Tue)13:39:39 No.106464863

>>106464859
Still doesn't have lora support.

Anonymous
09/02/25(Tue)13:42:30 No.106464885

Anonymous 09/02/25(Tue)13:42:30 No.106464885

im not fond of the model but for more nuanced reasons than you

Anonymous
09/02/25(Tue)13:46:31 No.106464918

Anonymous 09/02/25(Tue)13:46:31 No.106464918

>>106464749
i can't figure out how to train it. the aitoolkit method is like 2bit or some nonsense and the kohya's musubi seems way too complicated. i have 24gb (bare minimum btw).

but the reason qwen, hidream, etc get no traction on civit is because nobody can run/train it. sdxl is cheap and easy to use so it has tons of content. even flux, which was hailed as the savior of local and better than dalle3, barely received any resources in comparison to XL.

Anonymous
09/02/25(Tue)13:47:12 No.106464925

Anonymous 09/02/25(Tue)13:47:12 No.106464925

>>106464842
post the prompt. all my usual chroma prompts work fine on 20~30 steps. not even normal chroma uses 50 steps so you are going full retard here.

Anonymous
09/02/25(Tue)13:48:02 No.106464935

Anonymous 09/02/25(Tue)13:48:02 No.106464935

I am trying to get an anime character to take off their wristwatch in wan2.2, but it is outright not playing ball.

Did the chinese developers just not include that in the model's training? I would have thought it would be fairly simple since it can do clothes stripping well enough

Anonymous
09/02/25(Tue)13:48:24 No.106464937

Anonymous 09/02/25(Tue)13:48:24 No.106464937

>>106464749
what are you talking about? qwen has far more lora than chroma on civitai.

Anonymous
09/02/25(Tue)13:48:48 No.106464941

Anonymous 09/02/25(Tue)13:48:48 No.106464941

>>106464918
i don't get why people don't realize you can rent a 40GB A40 for $0.40/hour and train a lora for like 1usd cost

Anonymous
09/02/25(Tue)13:50:19 No.106464957

Anonymous 09/02/25(Tue)13:50:19 No.106464957

sars

Anonymous
09/02/25(Tue)13:50:55 No.106464964

Anonymous 09/02/25(Tue)13:50:55 No.106464964

>>106464812
>The loss of quality is absolutely negligible
lol

Anonymous
09/02/25(Tue)13:51:01 No.106464966

Anonymous 09/02/25(Tue)13:51:01 No.106464966

>>106464301
that level of titty squish up needs to be mandatory dress for all hot women.

Anonymous
09/02/25(Tue)13:52:00 No.106464977

Anonymous 09/02/25(Tue)13:52:00 No.106464977

>>106464964
It is tho. The only loss it with the text capabilities. Image is barely touched.

Anonymous
09/02/25(Tue)13:52:36 No.106464984

Anonymous 09/02/25(Tue)13:52:36 No.106464984

>>106464964
Oh, go on, prove me wrong.

Anonymous
09/02/25(Tue)13:52:47 No.106464987

Anonymous 09/02/25(Tue)13:52:47 No.106464987

>>106464941
its not a 1click setup. majority of lora trainers just use civitai to train loras, or the free-tier google colab (emphasis on the free part). even $1 is a barrier of entry too high.

Anonymous
09/02/25(Tue)13:53:39 No.106464999

Anonymous 09/02/25(Tue)13:53:39 No.106464999

>>106464977
>>106464984
you people clearly have low standards and/or are blind so no point showing examples

Anonymous
09/02/25(Tue)13:54:31 No.106465006

Anonymous 09/02/25(Tue)13:54:31 No.106465006

>>106464999
So all bark no bite, eh? Fuck outta here.

Anonymous
09/02/25(Tue)13:54:47 No.106465010

Anonymous 09/02/25(Tue)13:54:47 No.106465010

quality loss only kicks in at the level below whatever quant i can run

Anonymous
09/02/25(Tue)13:55:32 No.106465015

Anonymous 09/02/25(Tue)13:55:32 No.106465015

>>106465006
Q4 looks just like Q8 looks just like FP16 looks just like FP32 I swear guys I swear!

Anonymous
09/02/25(Tue)13:56:14 No.106465020

Anonymous 09/02/25(Tue)13:56:14 No.106465020

>comparing Qs to FPs
retard alert

Anonymous
09/02/25(Tue)13:56:52 No.106465026

Anonymous 09/02/25(Tue)13:56:52 No.106465026

>generates plastic fluxslop at Q2
saaaarssss where cellphone camera lora for fix output

Anonymous
09/02/25(Tue)13:57:25 No.106465030

Anonymous 09/02/25(Tue)13:57:25 No.106465030

>>106465015
>got blown the fuck out so badly she just started pissing and shitting herself all over the thread

Anonymous
09/02/25(Tue)13:58:03 No.106465033

Anonymous 09/02/25(Tue)13:58:03 No.106465033

>>106465030
>thinks quantization is magic

Anonymous
09/02/25(Tue)13:59:08 No.106465044

Anonymous 09/02/25(Tue)13:59:08 No.106465044

>>106465033
indians have a tendency to fall for cheap magic tricks

Anonymous
09/02/25(Tue)14:00:03 No.106465049

Anonymous 09/02/25(Tue)14:00:03 No.106465049

>>106465033
Post several comparisons across different quants. Assuming you can even run any of them, attention whore.

Anonymous
09/02/25(Tue)14:00:56 No.106465057

Anonymous 09/02/25(Tue)14:00:56 No.106465057

>>106465049
you must be a newfag if haven't seen the endless quant comparisons already

Anonymous
09/02/25(Tue)14:01:18 No.106465059

Anonymous 09/02/25(Tue)14:01:18 No.106465059

>>106464276
Nano banana is insane for genning subject LoRAs without a lot of pics. There's nothing that comes close. When will local catch up?

Anonymous
09/02/25(Tue)14:01:18 No.106465060

Anonymous 09/02/25(Tue)14:01:18 No.106465060

>>106465020
>implying fp8 looks better than Q quants > Q4

Anonymous
09/02/25(Tue)14:02:00 No.106465069

Anonymous 09/02/25(Tue)14:02:00 No.106465069

why did they call it nano banana though

Anonymous
09/02/25(Tue)14:02:19 No.106465072

Anonymous 09/02/25(Tue)14:02:19 No.106465072

>>106465059
qwen edit is the only contender really

Anonymous
09/02/25(Tue)14:02:30 No.106465074

Anonymous 09/02/25(Tue)14:02:30 No.106465074

>>106465057
threads recently have been overrun with saars trying to hop on the video bandwagon. they don't like to accept the fact that trying to fit a 20b param model into 12gb requires corners to be cut.

Anonymous
09/02/25(Tue)14:03:19 No.106465086

Anonymous 09/02/25(Tue)14:03:19 No.106465086

>>106465069
most engineers at Google have nano penis. Asian genetics aren't very kind

Anonymous
09/02/25(Tue)14:03:47 No.106465088

Anonymous 09/02/25(Tue)14:03:47 No.106465088

>>106465015
>Q8 looks just like FP16
To be fair, it's really, really close.

Anonymous
09/02/25(Tue)14:04:02 No.106465092

Anonymous 09/02/25(Tue)14:04:02 No.106465092

>>106465057
Still see no comparisons from you

Anonymous
09/02/25(Tue)14:04:26 No.106465098

Anonymous 09/02/25(Tue)14:04:26 No.106465098

>>106465069
because it is under 1B parameters

Anonymous
09/02/25(Tue)14:05:57 No.106465113

Anonymous 09/02/25(Tue)14:05:57 No.106465113

File: Chroma_00010_.png (1.76 MB, 920x1536)

1.76 MB PNG

Anonymous
09/02/25(Tue)14:06:57 No.106465125

Anonymous 09/02/25(Tue)14:06:57 No.106465125

File: test.jpg (813 KB, 2784x2496)

813 KB JPG

Testing out my fashion wildcards on Qwen with different other models right now.
Which one of these do you guys like more?
Mostly in terms of face.
Prompt adherence is pretty damn good on Qwen, though, gotta admit.
Here's the Yume Kawaii prompt I used:
>A highly detailed photograph of a young japanese woman. She sports an oversized white t-shirt dress with a magical girl transformation sequence print in pastel gradients, worn over a lavender tulle petticoat that peeks out below the hem. Her feet are wrapped in platform boots covered in pearl white holographic material with chunky 4-inch soles and lace-up fronts adorned with star-shaped charms. A sleeping mask accessory rests on her head like a headband - pale pink satin with gold embroidery reading "Sweet Dreams" and dangling pearl chains. Her hair is a long pastel lavender wig with bangs, styled in loose waves and decorated with tiny LED star clips that twinkle softly. Her face showcases pale blue circle lenses, white eyeliner drawn in star patterns at the outer corners, cotton candy pink blush applied generously under her eyes, and glitter tears made from iridescent gems.

Anonymous
09/02/25(Tue)14:07:42 No.106465132

Anonymous 09/02/25(Tue)14:07:42 No.106465132

Can we fast forward to when NetaLumina or one of its derivatives is good enough to replace Noob

Anonymous
09/02/25(Tue)14:08:35 No.106465140

Anonymous 09/02/25(Tue)14:08:35 No.106465140

>>106465072
Close, but no dice. Photorealism just isn't there yet.

Anonymous
09/02/25(Tue)14:08:46 No.106465143

Anonymous 09/02/25(Tue)14:08:46 No.106465143

how do we fast forward to a timeline that doesn't exist?

Anonymous
09/02/25(Tue)14:12:19 No.106465170

Anonymous 09/02/25(Tue)14:12:19 No.106465170

>>106465125
right looks better. eyes on the left look digitally fake

Anonymous
09/02/25(Tue)14:12:50 No.106465173

Anonymous 09/02/25(Tue)14:12:50 No.106465173

>>106465059
>When will local catch up?
Comparing local against a huge new model that can't run on consumer hardware even if it wasn't proprietary, are you retarded or just a saas shill ?

Anonymous
09/02/25(Tue)14:13:14 No.106465176

Anonymous 09/02/25(Tue)14:13:14 No.106465176

>>106465125
neither are showing pantsu so both are bad

Anonymous
09/02/25(Tue)14:14:31 No.106465184

Anonymous 09/02/25(Tue)14:14:31 No.106465184

>>106465132
Quite honestly i'm already using it a lot. Very fun to play with because of the insane prompt compliance.

Anonymous
09/02/25(Tue)14:15:27 No.106465192

Anonymous 09/02/25(Tue)14:15:27 No.106465192

File: AnimateDiff_00104.mp4 (2.99 MB, 1280x720)

2.99 MB MP4

Anonymous
09/02/25(Tue)14:18:46 No.106465223

Anonymous 09/02/25(Tue)14:18:46 No.106465223

>>106465184
Not enough artist knowledge desu, and I'd rather not do a second pass with Noob. Not saying you can't get pleasing results, just seems like it needs a bit more time in the oven.

Anonymous
09/02/25(Tue)14:20:06 No.106465237

Anonymous 09/02/25(Tue)14:20:06 No.106465237

>>106465173
Most of their "huge" parameters are only being put to use for the prompt following. For simple ID copy/style, we do not need so many parameters. Qwen Edit, Kontext Pro/Max should tell you that (though it's API only, but Dev is same size). Local is behind, but that far behind. We just need better models.

Anonymous
09/02/25(Tue)14:21:45 No.106465248

Anonymous 09/02/25(Tue)14:21:45 No.106465248

>>106465237
>but that far behind.
but not*

Anonymous
09/02/25(Tue)14:23:20 No.106465260

Anonymous 09/02/25(Tue)14:23:20 No.106465260

>>106464749
Ehh? I barely see chroma LoRAs. Qwen is getting a lot more. Still flux and illustrious seems to be the most though. Did you just look up "Chroma" and saw there was more? lol

Anonymous
09/02/25(Tue)14:27:01 No.106465287

Anonymous 09/02/25(Tue)14:27:01 No.106465287

>>106465223
nta prompt adherence is the most important thing to me personally, I am very much bored of noobs/illustrious compositions so completely on the Neta train.

Anonymous
09/02/25(Tue)14:28:10 No.106465298

Anonymous 09/02/25(Tue)14:28:10 No.106465298

>>106464736
You have no idea what you're talking about. Chroma (the model base) is still more powerful than any other model we've seen so far. An additional finetune would help it learn styles/characters, but the caveat is that it requires compute that even small companies do not have. Neta Lumina on the other hand is quite good with anime/styles and it should be a breeze for anime tunes. Chroma is still SOTA for local photorealism NSFW.

Anonymous
09/02/25(Tue)14:30:28 No.106465315

Anonymous 09/02/25(Tue)14:30:28 No.106465315

>>106465287(me)
Also Qwen for more interesting prompts Neta can't do. I was hoping to use chroma but man is it bad at anime like no idea why photoreal is fine but as soon as the word "anime" is anywhere in the prompt everything about the model takes a giant nose dive.

Anonymous
09/02/25(Tue)14:31:45 No.106465335

Anonymous 09/02/25(Tue)14:31:45 No.106465335

>>106465237
Shit you just made up

Qwen Edit which is far behind Nano is still a pain to run on local due to its resource demands, there's no magic that will make the gap disappear between SOTA saas running on extremely expensive hardware versus local models to be used on consumer hardware

But local is still better overall, because you can finetune those models to be much better than SAAS models for specific concepts, and even concepts that SAAS models will never allow due to NSFW censorship

But for pure technical prowess, SOTA saas would have to be totally incompetent for local consumer models to compete

Anonymous
09/02/25(Tue)14:33:02 No.106465343

Anonymous 09/02/25(Tue)14:33:02 No.106465343

>>106465335
>Qwen Edit still a pain to run on local due to its resource demands
Works easily and fast on a 500$ 3090 at basically fp16 quality of Q8

Anonymous
09/02/25(Tue)14:35:45 No.106465367

Anonymous 09/02/25(Tue)14:35:45 No.106465367

>>106465335
Perceived nano superiority is due to a slight better adherence given any scenario. Again, local can catch up given a decent unfiltered/non-distilled model.

Anonymous
09/02/25(Tue)14:36:05 No.106465371

Anonymous 09/02/25(Tue)14:36:05 No.106465371

>>106465335
Nano is shit. I can't say I got even one good result out if it. Colorize, it misses half the image, "show me character behind" the damn thing gives some weird combo of front of the character clothes that makes no selnse, "complete the rest of the image" gives me either a dwarf cause you can't change the aspect ratio of the image or just some nonsensical design. I genuinely believe this is some sort of weird as gaslighting from google. There has been multiple comparison on reddit too and almost all of them show how much nano fucks up the image. The censorship is just a cherry on top.

Anonymous
09/02/25(Tue)14:38:35 No.106465388

Anonymous 09/02/25(Tue)14:38:35 No.106465388

File: Chroma_00022_.png (1.94 MB, 1152x1304)

1.94 MB PNG

Anonymous
09/02/25(Tue)14:39:22 No.106465395

Anonymous 09/02/25(Tue)14:39:22 No.106465395

>>106465287
For sure, and to be clear it can do SOME artists but I'm tired of models that are technically good with the caveat of needing a lora for styles that other models can do OOTB.

Anonymous
09/02/25(Tue)14:39:37 No.106465397

Anonymous 09/02/25(Tue)14:39:37 No.106465397

File: edit leaderboard.png (131 KB, 1846x1265)

131 KB PNG

>>106465059
sure qwen is behind nano banana, but it's not that far behind 4o, which was considered to be an impossibly massive gigasaas saas monolithic 300b model. i wouldn't discredit local so soon, especially since it's roundhouse kicking BFL's faggot API license shit right in the face. plus qwen have hinted that they're already working on a v2

Anonymous
09/02/25(Tue)14:43:35 No.106465438

Anonymous 09/02/25(Tue)14:43:35 No.106465438

File: QwenChromfaceWan_00005_.jpg (322 KB, 1392x2496)

322 KB JPG

>>106465170
Alright, thought so as well. Eye colors are really weird on base Qwen. Tends to lean hard into extreme colors.
>>106465176
Now that's just lewd.

Anonymous
09/02/25(Tue)14:44:15 No.106465445

Anonymous 09/02/25(Tue)14:44:15 No.106465445

>>106465388
This Wicked City, Ninja Scroll anime style still holds up

Anonymous
09/02/25(Tue)14:45:10 No.106465457

Anonymous 09/02/25(Tue)14:45:10 No.106465457

>>106465343
Define “fast”. Everything seems to be getting progressively slower than sd1.5, which is “fast” in my mind

Anonymous
09/02/25(Tue)14:46:18 No.106465471

Anonymous 09/02/25(Tue)14:46:18 No.106465471

File: Chroma_00025_.png (2.02 MB, 1152x1304)

2.02 MB PNG

>>106465445
yeah it's timeless

Anonymous
09/02/25(Tue)14:46:48 No.106465472

Anonymous 09/02/25(Tue)14:46:48 No.106465472

>>106465395
>or styles that other models can do OOTB
Those models are finetunes specifically trained on art styles though, not really comparable to a base model which needs to know some of everything

Anonymous
09/02/25(Tue)14:50:24 No.106465502

Anonymous 09/02/25(Tue)14:50:24 No.106465502

>>106465471
I was thinking Cyber City Oedo 808 has this style as well but I'm not sure.

Anonymous
09/02/25(Tue)14:50:54 No.106465509

Anonymous 09/02/25(Tue)14:50:54 No.106465509

>>106465472
Which is why it needs more time in the oven

Anonymous
09/02/25(Tue)14:51:06 No.106465514

Anonymous 09/02/25(Tue)14:51:06 No.106465514

>>106465457
less than a minute for an edit that would have took much more if done with inpainting, photoshop, regenning or anything similar
>Everything seems to be getting progressively slower than sd1.5
hardware didnt improve much compared to the models, no shit toy models from 3 years ago will be nothing to modern day models that are order of magnitude better

Anonymous
09/02/25(Tue)14:52:01 No.106465521

Anonymous 09/02/25(Tue)14:52:01 No.106465521

>>106465457
because hardware hasn't kept up. 4090->5090 isn't as much of an improvement as 3090->4090. the vram gain is worthless because they didn't want to cut into their (already dated) a100s which are still selling for 5 figures. pray for chinese asic or something idk

Anonymous
09/02/25(Tue)14:54:37 No.106465539

Anonymous 09/02/25(Tue)14:54:37 No.106465539

>>106465343
where are you finding 3090's for $500
I just searched ebay and the lowest price is $750~$800

Anonymous
09/02/25(Tue)14:54:55 No.106465543

Anonymous 09/02/25(Tue)14:54:55 No.106465543

Is there anywhere I can download a decent dataset of a few thousand (at minimum) random photos? I want to train a realism lora, a good one, so I need thousands of images.

I've tried searching Huggingface. It's a disaster. Anything sourced from Pexels is unusable. The images are majority "artsy" type stuff, bokeh out the ass, just a weird slopped look. There is a dataset that claims it is 120M images from Flickr, but it is actually 3000 unique images duplicated millions of times each (yes, really, 500 downloads per month btw). There are various Flickr30k, Flickr8k, etc datasets, but they are resized to low resolution with no original URLs.

I'd really want to avoid resorting to scraping some website myself, but it looks like that might be the only option.

Anonymous
09/02/25(Tue)14:56:16 No.106465557

Anonymous 09/02/25(Tue)14:56:16 No.106465557

>>106465502
there's quite many, like: private eye goku, vampire hunter d etc. they had pretty high budgets back then

Anonymous
09/02/25(Tue)14:56:59 No.106465567

Anonymous 09/02/25(Tue)14:56:59 No.106465567

>>106465543
why would anyone collect a great dataset for no reason? the dataset collection is the bottleneck, not the lora training, fire up that yt-dlp and gallery-dl

Anonymous
09/02/25(Tue)15:00:42 No.106465610

Anonymous 09/02/25(Tue)15:00:42 No.106465610

>>106465395
If you are curious here is a list someone did for the artist Neta can do, I don't think they went through the danbooru ones though but yeah overall still needs more cooking time.
https://neta-lumina-style.tz03.xyz/

Anonymous
09/02/25(Tue)15:01:52 No.106465621

Anonymous 09/02/25(Tue)15:01:52 No.106465621

>>106465567
>why would anyone collect a great dataset for no reason?
To share with the community so everyone can improve their models? That's the whole point of HF. There are plenty of image datasets there, they are just complete ass for various reasons. Either Pexels slop, or downsampled Flickr, or "whoops I fucked up and duplicated each image tens of thousands of times teehee sorry"

Anonymous
09/02/25(Tue)15:03:32 No.106465644

Anonymous 09/02/25(Tue)15:03:32 No.106465644

>>106465621
>To share with the community so everyone can improve their models?
then you wouldnt need to train a lora on those images

Anonymous
09/02/25(Tue)15:04:51 No.106465658

Anonymous 09/02/25(Tue)15:04:51 No.106465658

>>106465184
What does a neta prompt look like? Last time I looked through their user guide, their prompts were the most convoluted shit I have ever seen since Pony and the score_7, score_8, score_9 days.
Did you train any loras for it?

Anonymous
09/02/25(Tue)15:06:55 No.106465678

Anonymous 09/02/25(Tue)15:06:55 No.106465678

>>106465610
based should be in OP and replace the khroma section

Anonymous
09/02/25(Tue)15:07:42 No.106465685

Anonymous 09/02/25(Tue)15:07:42 No.106465685

>>106465610
some of these are good but a lot are scarily bad, especially how it quickly replaces basic animals with humans

Anonymous
09/02/25(Tue)15:11:22 No.106465723

Anonymous 09/02/25(Tue)15:11:22 No.106465723

>>106465471
>>106465388
>>106465113
>>106464736
These belong to Anime Diffusion Thread.

Anonymous
09/02/25(Tue)15:13:09 No.106465738

Anonymous 09/02/25(Tue)15:13:09 No.106465738

>>106465723
Cope and seethe

Anonymous
09/02/25(Tue)15:13:42 No.106465744

Anonymous 09/02/25(Tue)15:13:42 No.106465744

File: AnimateDiff_00279.mp4 (2.41 MB, 832x832)

2.41 MB MP4

>>106465132
>>106465184

oh shi- forgot about Neta lumina. Too busy with wan 2.2 and qwen, but i'll try it again.

Anonymous
09/02/25(Tue)15:13:57 No.106465749

Anonymous 09/02/25(Tue)15:13:57 No.106465749

>>106465738
>tranimetard is retarded
basically every time

Anonymous
09/02/25(Tue)15:14:33 No.106465757

Anonymous 09/02/25(Tue)15:14:33 No.106465757

>threadshitting faggot is a tourist
basically every time

Anonymous
09/02/25(Tue)15:15:30 No.106465765

Anonymous 09/02/25(Tue)15:15:30 No.106465765

File: ComfyUI_temp_cxpmt_00001_.png (1.65 MB, 1152x1152)

1.65 MB PNG

If I have a dataset with hundreds of pics, can I get away with training less steps/epochs?

Anonymous
09/02/25(Tue)15:19:31 No.106465801

Anonymous 09/02/25(Tue)15:19:31 No.106465801

>>106465757
>tranimetroony so mad and castrated by his hrt hes to scared to even reply
>but still has to reply in the thread to lash out for attention like a child he is
uh oh, not again tranimesisters, maybe... maybe if we say "tranime website!!!" on cue now we can recover?? lmao

Anonymous
09/02/25(Tue)15:20:05 No.106465806

Anonymous 09/02/25(Tue)15:20:05 No.106465806

>>106465765
no. the more images you have, the more steps you should train for with a lower learning rate

Anonymous
09/02/25(Tue)15:20:10 No.106465808

Anonymous 09/02/25(Tue)15:20:10 No.106465808

what caused the absolute seething meltdown demonstrated above?

Anonymous
09/02/25(Tue)15:21:34 No.106465819

Anonymous 09/02/25(Tue)15:21:34 No.106465819

>if i call the guy btfoing me mad i dont have to engage with the argument
>no, i am NOT a child btw
tranimetards really are embarrassing...

Anonymous
09/02/25(Tue)15:22:17 No.106465828

Anonymous 09/02/25(Tue)15:22:17 No.106465828

File: Chroma_00039_.png (2.15 MB, 1456x992)

2.15 MB PNG

Anonymous
09/02/25(Tue)15:22:58 No.106465834

Anonymous 09/02/25(Tue)15:22:58 No.106465834

BIG stinky

Anonymous
09/02/25(Tue)15:24:02 No.106465845

Anonymous 09/02/25(Tue)15:24:02 No.106465845

File: GeYnhsKX0AAT7BK.jpg (361 KB, 2048x1024)

361 KB JPG

>solves the fluxplastic problem
>vanishes
what was their endgame?

Anonymous
09/02/25(Tue)15:24:23 No.106465849

Anonymous 09/02/25(Tue)15:24:23 No.106465849

>>106465539
I don't know why but people keep repeating this bullshit. It's just straight up false. Maybe you can find one on craigslist for $500 and go that route if you're a giga retard.

Anonymous
09/02/25(Tue)15:26:26 No.106465871

Anonymous 09/02/25(Tue)15:26:26 No.106465871

File: file.png (402 KB, 823x1045)

402 KB PNG

>>106465539
>>106465849
huh?

Anonymous
09/02/25(Tue)15:27:09 No.106465879

Anonymous 09/02/25(Tue)15:27:09 No.106465879

>>106465543
Flickr urls are standard format with no secret up to 1024
https://www.flickr.com/services/api/misc.urls.html
If they have the original ID you can grab them easily
Other than that scraping flickr is easy, you don't need to register for their api, just grab the key from the site and use the site's endpoint instead. Go for the Explore page on a few dates, should be enough for your needs and inherently high quality because they're featured
t. scraped 5b from flickr (no you can't have it)

Anonymous
09/02/25(Tue)15:28:07 No.106465889

Anonymous 09/02/25(Tue)15:28:07 No.106465889

>>106465871
>pounds
RETARD
E
T
A
R
D

Anonymous
09/02/25(Tue)15:29:07 No.106465901

Anonymous 09/02/25(Tue)15:29:07 No.106465901

>>106465871
anon said $, as in USD
£500 is $669.38

Anonymous
09/02/25(Tue)15:29:10 No.106465903

Anonymous 09/02/25(Tue)15:29:10 No.106465903

>>106465889
fair enough, i didn't realize how weak the dollar is recently so yeah you're right they're about 700 usd

Anonymous
09/02/25(Tue)15:29:48 No.106465909

Anonymous 09/02/25(Tue)15:29:48 No.106465909

>>106465871
oi you got a loicense to post here?

Anonymous
09/02/25(Tue)15:30:02 No.106465913

Anonymous 09/02/25(Tue)15:30:02 No.106465913

>>106465849
browse other sites, more local sites

Anonymous
09/02/25(Tue)15:34:59 No.106465960

Anonymous 09/02/25(Tue)15:34:59 No.106465960

File: WanVideo2_2_I2V_32FPS_00020.mp4 (3.37 MB, 1024x576)

3.37 MB MP4

>>106465909
>oi you got a loicense to post here?
yes but probably not for long. enjoying my final days before the govt bans 4chan

Anonymous
09/02/25(Tue)15:43:36 No.106466058

Anonymous 09/02/25(Tue)15:43:36 No.106466058

>>106465960
isn't kiwifarms and 4chan suing them? lmao

Anonymous
09/02/25(Tue)15:50:49 No.106466134

Anonymous 09/02/25(Tue)15:50:49 No.106466134

>>106465879
Ok thank you anon. I also was finally able to find LAION highres subset on HF, which appears to be decent. Currently scraping a subset of those URLs right now.

Anonymous
09/02/25(Tue)15:51:31 No.106466144

Anonymous 09/02/25(Tue)15:51:31 No.106466144

There was someone in the previous thread that asked why corporations aren't doing what radiance is doing. The answer is that radiance is based off of a paper that is about a month old.
https://arxiv.org/abs/2507.23268

Anonymous
09/02/25(Tue)15:54:55 No.106466193

Anonymous 09/02/25(Tue)15:54:55 No.106466193

>>106466134
https://huggingface.co/datasets/madebyollin/megalith-10m
You could try this but it's cc0 trash so probably low quality aesthetically

Anonymous
09/02/25(Tue)15:57:43 No.106466226

Anonymous 09/02/25(Tue)15:57:43 No.106466226

>>106466144
Yes.
But if the furryfag really want to go VAE-less, then he should also go with the route of sub-quadratic complexity attention

Anonymous
09/02/25(Tue)16:06:05 No.106466332

Anonymous 09/02/25(Tue)16:06:05 No.106466332

>>106466144
I don't know about the approach radiance is doing but it has been explored before a while back.
https://github.com/ShoufaChen/PixelFlow

Anonymous
09/02/25(Tue)16:17:46 No.106466430

Anonymous 09/02/25(Tue)16:17:46 No.106466430

My return is on the horizon
Just wait for me to get back to speed and master chroma

Anonymous
09/02/25(Tue)16:18:46 No.106466439

Anonymous 09/02/25(Tue)16:18:46 No.106466439

>>106466430
big if true

Anonymous
09/02/25(Tue)16:21:46 No.106466469

Anonymous 09/02/25(Tue)16:21:46 No.106466469

>>106466226
>sub-quadratic complexity attention
Is this a speedup?

Anonymous
09/02/25(Tue)16:22:14 No.106466478

Anonymous 09/02/25(Tue)16:22:14 No.106466478

>>106466439
It will be, I apologize in advance the no nothing hamsters that shit up the general and can't be happy with containment will go nuclear

Anonymous
09/02/25(Tue)16:27:33 No.106466532

Anonymous 09/02/25(Tue)16:27:33 No.106466532

File: ComfyUI_temp_rpeaj_00042_.png (2.15 MB, 1152x1152)

2.15 MB PNG

Anonymous
09/02/25(Tue)16:29:11 No.106466545

Anonymous 09/02/25(Tue)16:29:11 No.106466545

File: Jubileebakuhatsu1.jpg (1.83 MB, 3000x1537)

1.83 MB JPG

Any way to gen a girl causing an energy explosion out of her body?

Anonymous
09/02/25(Tue)16:32:51 No.106466578

Anonymous 09/02/25(Tue)16:32:51 No.106466578

>>106466545
try with a regular prompt but in my exp it will probably need to be rewritten to be more detailed by some ai and then try that

Anonymous
09/02/25(Tue)16:33:23 No.106466582

Anonymous 09/02/25(Tue)16:33:23 No.106466582

File: fixing images 1.png (1.65 MB, 2048x2048)

1.65 MB PNG

How does inpainting on comfyui work? The official comfyui workflow doesn't work well as A111/reforge method.

Anonymous
09/02/25(Tue)16:37:10 No.106466623

Anonymous 09/02/25(Tue)16:37:10 No.106466623

File: ComfyUI_temp_rpeaj_00005_.png (3.66 MB, 1664x1152)

3.66 MB PNG

Anonymous
09/02/25(Tue)16:37:55 No.106466631

Anonymous 09/02/25(Tue)16:37:55 No.106466631

>>106466582
>part 2 - 1
do you have the other parts?

Anonymous
09/02/25(Tue)16:38:29 No.106466642

Anonymous 09/02/25(Tue)16:38:29 No.106466642

File: fixing images 2.png (3.31 MB, 2048x2048)

3.31 MB PNG

>>106466631

Anonymous
09/02/25(Tue)16:39:56 No.106466662

Anonymous 09/02/25(Tue)16:39:56 No.106466662

File: fixing images 3.png (3.18 MB, 2048x2048)

3.18 MB PNG

>>106466631
>>106466642

Anonymous
09/02/25(Tue)16:40:33 No.106466667

Anonymous 09/02/25(Tue)16:40:33 No.106466667

>>106466582
>How does inpainting on comfyui work?
It doesn't :^)

Anonymous
09/02/25(Tue)16:41:16 No.106466679

Anonymous 09/02/25(Tue)16:41:16 No.106466679

imagine getting filtered by more knobs and buttons

Anonymous
09/02/25(Tue)16:42:17 No.106466687

Anonymous 09/02/25(Tue)16:42:17 No.106466687

Can I assume that 95% here generate nsfw but are over the post only phase and are now larping as photographers prettifying their photos of the year?

Anonymous
09/02/25(Tue)16:42:43 No.106466693

Anonymous 09/02/25(Tue)16:42:43 No.106466693

>>106466679
You have never made anything

Anonymous
09/02/25(Tue)16:44:11 No.106466710

Anonymous 09/02/25(Tue)16:44:11 No.106466710

>>106466679
Inpainting in comfy sucks tho

Anonymous
09/02/25(Tue)16:44:59 No.106466715

Anonymous 09/02/25(Tue)16:44:59 No.106466715

File: AnimateDiff_00270.mp4 (3.23 MB, 720x960)

3.23 MB MP4

Anonymous
09/02/25(Tue)16:46:46 No.106466729

Anonymous 09/02/25(Tue)16:46:46 No.106466729

>>106466710
only for subhuman mouth breathers that are probably using windows or apple anyway not like their opinion matters

Anonymous
09/02/25(Tue)16:49:28 No.106466755

Anonymous 09/02/25(Tue)16:49:28 No.106466755

>106466729
stop damage controlling for comfy lil bro. Anyone who has ever used any graphic problem can tell it sucks. Shitty mask editor with clunk UI, no filters or other tools. Nothing.

Anonymous
09/02/25(Tue)16:51:18 No.106466771

Anonymous 09/02/25(Tue)16:51:18 No.106466771

>>106466578
I can do Jubilee in a hoodie, easy
Bald Jubilee after the explosion and the little outfit she is given
It's the explosion itself that's the hard gen, glowing skin, floating, clothes disintegrating, center of energy or energy ball in center, flowing hair, glowing eyes, glitter, fireworks, abstract background, bright background,

Anonymous
09/02/25(Tue)16:51:58 No.106466778

Anonymous 09/02/25(Tue)16:51:58 No.106466778

>>106466755
>too pussy to actually reply
concession humbly accepted on behalf of comfychad and his asian QT gf

Anonymous
09/02/25(Tue)16:52:49 No.106466784

Anonymous 09/02/25(Tue)16:52:49 No.106466784

File: 1731843127812647.png (106 KB, 771x526)

106 KB PNG

>>106466710
I agree it was pretty bad before but get this node in impact pack, it gets you pretty much 90% of a1111 functionality with it. Only thing missing is using an upscaler model for the upres before inpaint.

Anonymous
09/02/25(Tue)16:55:35 No.106466808

Anonymous 09/02/25(Tue)16:55:35 No.106466808

>>106466784
Hed rather seethe than learn something new.

Anonymous
09/02/25(Tue)16:58:13 No.106466835

Anonymous 09/02/25(Tue)16:58:13 No.106466835

>people now optimize everything around light loras so that using regular 20 step settings, normal cfg etc actually produces worse results with a lot of loras now
It is what it is, light is worth it, but its grim that some movement quality for a lot of loras is basically better in 2.1 until better loras are published

Anonymous
09/02/25(Tue)16:58:16 No.106466836

Anonymous 09/02/25(Tue)16:58:16 No.106466836

>>106466784
Or I can just use Krita and have a normal human gui.

Anonymous
09/02/25(Tue)17:04:00 No.106466888

Anonymous 09/02/25(Tue)17:04:00 No.106466888

File: In paint 2024 04.jpg (1.42 MB, 2500x2900)

1.42 MB JPG

>>106466784

This node worked wonderfully. Thanks for the tip.

Anonymous
09/02/25(Tue)17:06:49 No.106466912

Anonymous 09/02/25(Tue)17:06:49 No.106466912

>>106466835
I only use light loras for posting memes/garbage for other anons. I'd never use it for my own content. It's actually not that bad for anime since anime doesn't tend to need high motion fidelity anyway.

Anonymous
09/02/25(Tue)17:33:19 No.106467162

Anonymous 09/02/25(Tue)17:33:19 No.106467162

>>106460743
If you're still around can you upload a PNG to catbox? Jpeg doesn't preserve metadata

Anonymous
09/02/25(Tue)17:44:25 No.106467247

Anonymous 09/02/25(Tue)17:44:25 No.106467247

>https://www.illustrious-xl.ai/sponsor
These guys are sitting on trained models and just let the hype die. Big brain.

Anonymous
09/02/25(Tue)17:47:45 No.106467275

Anonymous 09/02/25(Tue)17:47:45 No.106467275

>>106467247
Are they at least trying to graft an llm onto those models or is it all clip?

Anonymous
09/02/25(Tue)17:52:17 No.106467319

Anonymous 09/02/25(Tue)17:52:17 No.106467319

nice collage

Anonymous
09/02/25(Tue)17:52:52 No.106467327

Anonymous 09/02/25(Tue)17:52:52 No.106467327

>>106465502
>>106465445
>>106465557
My niggas have taste

>>106465723
Nah, that thread is for straight up pedos, above can stay

>>106465871
Buy 2 for training

Anonymous
09/02/25(Tue)17:53:02 No.106467328

Anonymous 09/02/25(Tue)17:53:02 No.106467328

>>106467247
By the time those release we'll have a model more widely used than SDXL, making it completely pointless.

Anonymous
09/02/25(Tue)17:54:27 No.106467338

Anonymous 09/02/25(Tue)17:54:27 No.106467338

>>106467247
I am thankful we got what we got. Praying for a leak. Expecting nothing.

Anonymous
09/02/25(Tue)17:55:48 No.106467347

Anonymous 09/02/25(Tue)17:55:48 No.106467347

File: 1728921925257796.jpg (901 KB, 1344x768)

901 KB JPG

Anonymous
09/02/25(Tue)17:57:13 No.106467362

Anonymous 09/02/25(Tue)17:57:13 No.106467362

>>106467347
i kneel

Anonymous
09/02/25(Tue)17:58:13 No.106467373

Anonymous 09/02/25(Tue)17:58:13 No.106467373

comfy should be dragged out on the street and shot

Anonymous
09/02/25(Tue)17:59:19 No.106467383

Anonymous 09/02/25(Tue)17:59:19 No.106467383

so, anything new in the world of image/video genning?

Anonymous
09/02/25(Tue)18:00:41 No.106467394

Anonymous 09/02/25(Tue)18:00:41 No.106467394

>>106467347
Composition on this is great, really deserves an inpaint pass on the faces and hands to add that extra detail

Anonymous
09/02/25(Tue)18:01:36 No.106467407

Anonymous 09/02/25(Tue)18:01:36 No.106467407

does qwen nunchaku work with loras?

Anonymous
09/02/25(Tue)18:04:57 No.106467433

Anonymous 09/02/25(Tue)18:04:57 No.106467433

>>106467275
No clue. Newer versions have way better compatibility with natural language.

>>106467328
>By the time those release we'll have a model more widely used than SDXL, making it completely pointless.
Idk man I think we might be stuck with current models for a while, which would suck

>>106467338
>I am thankful we got what we got
Yeah me too, I even put $50 for their stardust, but I guess it doesn't matter

Anonymous
09/02/25(Tue)18:05:46 No.106467437

Anonymous 09/02/25(Tue)18:05:46 No.106467437

File: 1731123809069982.jpg (1.8 MB, 2016x1152)

1.8 MB JPG

>>106467394
This is inspired by Henry Fuseli and quite honestly his own faces and hands/legs are quite messy too.

Anonymous
09/02/25(Tue)18:07:03 No.106467445

Anonymous 09/02/25(Tue)18:07:03 No.106467445

>>106467433
>Yeah me too, I even put $50 for their stardust, but I guess it doesn't matter
$50 out of the $300,000 they need.

Anonymous
09/02/25(Tue)18:08:29 No.106467459

Anonymous 09/02/25(Tue)18:08:29 No.106467459

File: ComfyUI_temp_kqhyk_00023_.png (1.45 MB, 768x1344)

1.45 MB PNG

How much does Qwen edit suffer from quanting?

Anonymous
09/02/25(Tue)18:14:13 No.106467501

Anonymous 09/02/25(Tue)18:14:13 No.106467501

>>106467459
I don't think quanting hurts it as much as the speed LoRAs do. Up to Q4 should be alright

Anonymous
09/02/25(Tue)18:15:20 No.106467516

Anonymous 09/02/25(Tue)18:15:20 No.106467516

>>106467459
>>106467501
Q8 or bust for image edit models if you want to keep quality over multiple passes

Anonymous
09/02/25(Tue)18:18:39 No.106467536

Anonymous 09/02/25(Tue)18:18:39 No.106467536

>>106467516
Assuming you properly mask your edits it shouldn't matter. Multiple vae passes probably hurt the image way more than the quants would.

Anonymous
09/02/25(Tue)18:19:37 No.106467542

Anonymous 09/02/25(Tue)18:19:37 No.106467542

>>106467516
Oh the Q8 is only ~20GB. I though it's gonna be far worse. Is the template workflow in comfy fine?

Anonymous
09/02/25(Tue)18:21:23 No.106467555

Anonymous 09/02/25(Tue)18:21:23 No.106467555

>>106467437
Neat. Did you use a cnet or i2i as well?

Anonymous
09/02/25(Tue)18:24:28 No.106467582

Anonymous 09/02/25(Tue)18:24:28 No.106467582

File: 1731036954930137.jpg (1.75 MB, 1248x1824)

1.75 MB JPG

>>106467555
Nope. Chroma is just a good model for these styles and all sorts of weird compositions.

Anonymous
09/02/25(Tue)18:24:45 No.106467584

Anonymous 09/02/25(Tue)18:24:45 No.106467584

>ComfyUI: v0.3.50
is it safe to update to 0.3.54? any noticeable regressions?

Anonymous
09/02/25(Tue)18:26:00 No.106467595

Anonymous 09/02/25(Tue)18:26:00 No.106467595

>>106467437
This is garbage. I get it. The renaissance era was full of people finally figuring out how to paint, but larping as a classical fag as if you have this deep connection to the art world, especially when using AI, makes you look like a twat.

Anonymous
09/02/25(Tue)18:28:52 No.106467617

Anonymous 09/02/25(Tue)18:28:52 No.106467617

>>106467584
Memory is still fucked other than that nothing really special one way or another.

Anonymous
09/02/25(Tue)18:30:30 No.106467635

Anonymous 09/02/25(Tue)18:30:30 No.106467635

File: screenshot.1756852177.jpg (125 KB, 674x493)

125 KB JPG

>>106467584
0.3.54? we're on 0.3.56 now. to answer your question, it's perfectly fine.

Anonymous
09/02/25(Tue)18:37:46 No.106467692

Anonymous 09/02/25(Tue)18:37:46 No.106467692

>>106467582
how did you upscale it? looks nice

Anonymous
09/02/25(Tue)18:39:32 No.106467703

Anonymous 09/02/25(Tue)18:39:32 No.106467703

>>106467692
I just natively prompted it in this resolution. Chroma can easily stretch to nearly 2k without losing coherency.

Anonymous
09/02/25(Tue)18:40:06 No.106467705

Anonymous 09/02/25(Tue)18:40:06 No.106467705

>>106467692
Chroam can rawdog 1080p

elf-hugger
09/02/25(Tue)18:42:42 No.106467722

elf-hugger 09/02/25(Tue)18:42:42 No.106467722

File: elf hugger_00431_.png (1.48 MB, 824x1600)

1.48 MB PNG

I like krea.

Anonymous
09/02/25(Tue)18:45:17 No.106467742

Anonymous 09/02/25(Tue)18:45:17 No.106467742

>>106467437
Yeah, I looked at his art and although he paints great details in many pieces, he also draws many with undefined features much like in this image

Anonymous
09/02/25(Tue)18:47:28 No.106467753

Anonymous 09/02/25(Tue)18:47:28 No.106467753

>>106467433
>man I think we might be stuck with current models for a while, which would suck
If you’re just doing big titty 1girls, how would it suck? The newer bigger models are just slower and don’t particularly make anime even in different styles better than noob/illus models. I can’t imagine how a “better” model would actually improve the output much.

Anonymous
09/02/25(Tue)18:48:40 No.106467762

Anonymous 09/02/25(Tue)18:48:40 No.106467762

File: 1729280580258672.jpg (379 KB, 1024x1024)

379 KB JPG

Anonymous
09/02/25(Tue)18:49:22 No.106467768

Anonymous 09/02/25(Tue)18:49:22 No.106467768

>>106467753
> I can’t imagine how a “better” model would actually improve the output much.
prompt adherence + better handling of multiple subjects. If all you care about is 1girl standing in a basic pose, then yeah noob/illus will be fine for ages.

Anonymous
09/02/25(Tue)18:59:20 No.106467837

Anonymous 09/02/25(Tue)18:59:20 No.106467837

>>106467542
https://www.reddit.com/r/StableDiffusion/comments/1myr9al/use_a_multiple_of_112_to_get_rid_of_the_zoom/

Anonymous
09/02/25(Tue)19:00:20 No.106467845

Anonymous 09/02/25(Tue)19:00:20 No.106467845

>>106467768
learn to regional prompt and inpaint

Anonymous
09/02/25(Tue)19:05:00 No.106467883

Anonymous 09/02/25(Tue)19:05:00 No.106467883

>>106467845
Already know them. This isn't about me.

Anonymous
09/02/25(Tue)19:07:54 No.106467902

Anonymous 09/02/25(Tue)19:07:54 No.106467902

>>106467845
learn that that's no replacement for a model that can do multiple subjects without bleeding

Anonymous
09/02/25(Tue)19:19:16 No.106467995

Anonymous 09/02/25(Tue)19:19:16 No.106467995

>>106467902
and current day ai isnt even safe from vae loss let alone autoregressive let alone a replacement for brain to computer interface reading your thoughts and putting them into pixels and yet you can create 99.99% of everything that you want all the same

Anonymous
09/02/25(Tue)19:29:07 No.106468091

Anonymous 09/02/25(Tue)19:29:07 No.106468091

>>106467407
my niggas...

Anonymous
09/02/25(Tue)19:32:33 No.106468124

Anonymous 09/02/25(Tue)19:32:33 No.106468124

>>106467635
I updated a day or two so ago. It was fine.

Anonymous
09/02/25(Tue)19:33:34 No.106468131

Anonymous 09/02/25(Tue)19:33:34 No.106468131

File: WanVideo2_2_I2V_00297.webm (220 KB, 1248x720)

220 KB WEBM

GM

Anonymous
09/02/25(Tue)19:38:57 No.106468175

Anonymous 09/02/25(Tue)19:38:57 No.106468175

>>106467845
>adding a turbo to VW golf will make it a porsche !!1!

Anonymous
09/02/25(Tue)19:40:27 No.106468185

Anonymous 09/02/25(Tue)19:40:27 No.106468185

>>106468175
post an example change thats basically impossible with current models with inpainting, regional prompting and controlnet tools

ポストカード !!FH+LSJVkIY9
09/02/25(Tue)19:41:27 No.106468193

ポストカード !!FH+LSJVkIY9 09/02/25(Tue)19:41:27 No.106468193

File: blondyyy.jpg (289 KB, 1017x1240)

289 KB JPG

>>106468131
he seems like he smokes a LOT of cigs lately ;c
>>106465871
>founders
that fat brick wont even fit in my pc case (hotbox)

Anonymous
09/02/25(Tue)19:41:53 No.106468195

Anonymous 09/02/25(Tue)19:41:53 No.106468195

the joke is that modern models are better at it without needing to mindbreak them with a dozen external tools

Anonymous
09/02/25(Tue)19:46:33 No.106468228

Anonymous 09/02/25(Tue)19:46:33 No.106468228

Since there are some nice chroma users in this thread and I want to try croma on my vramlet setup:
How well does Chroma run on 8gb vram + 16gb systemram?
Which version and quant should I use?
should I just go for the smallest possible quant, regardless of the newest version?

I'm guessing this is the right place to find it: https://huggingface.co/silveroxides/Chroma-GGUF/tree/main

There are so many versions. Are there big differences between them? I can see they are frequently updated.

Anonymous
09/02/25(Tue)19:48:27 No.106468235

Anonymous 09/02/25(Tue)19:48:27 No.106468235

>>106468195
>this job needs to be done with an all in one tool instead of using whatever is best for the particular task because... it just does, ok!?

Anonymous
09/02/25(Tue)19:49:42 No.106468246

Anonymous 09/02/25(Tue)19:49:42 No.106468246

>>106468228
Step 1: buy more ram

Anonymous
09/02/25(Tue)19:50:15 No.106468252

Anonymous 09/02/25(Tue)19:50:15 No.106468252

>>106468228
Order more ram while you wait for https://github.com/nunchaku-tech/nunchaku/issues/431

Anonymous
09/02/25(Tue)19:50:39 No.106468254

Anonymous 09/02/25(Tue)19:50:39 No.106468254

>>106468131
>Hercules going through his teen angst emo phase
Probably posted nudes of himself on /b/

Anonymous
09/02/25(Tue)19:51:34 No.106468264

Anonymous 09/02/25(Tue)19:51:34 No.106468264

>>106468228
those are old versions. here are the "final" versions:
https://huggingface.co/QuantStack/Chroma1-Base-GGUF/tree/main
https://huggingface.co/QuantStack/Chroma1-HD-GGUF/tree/main

>8gb vram
oof

Anonymous
09/02/25(Tue)19:51:41 No.106468265

Anonymous 09/02/25(Tue)19:51:41 No.106468265

>>106468228
No idea, but you should be using https://huggingface.co/rocca/chroma-nunchaku-test
At least until the official nunchaku Chroma drops

Alternatively you can try 4bit GGUF of Chroma Flash HD (you'd use the recommended 8 heun steps for that model)

Anonymous
09/02/25(Tue)19:53:40 No.106468285

Anonymous 09/02/25(Tue)19:53:40 No.106468285

>>106468235
Yes? I mean by that logic might as well go back to the good old days of 1.5 and spend 20 hours inpainting one shitty pic lol

Anonymous
09/02/25(Tue)19:54:54 No.106468296

Anonymous 09/02/25(Tue)19:54:54 No.106468296

>>106468285
>ad absurdum
npcs are incapable of logical thought

Anonymous
09/02/25(Tue)19:55:14 No.106468298

Anonymous 09/02/25(Tue)19:55:14 No.106468298

File: hero_.jpg (1.07 MB, 2616x2000)

1.07 MB JPG

Anonymous
09/02/25(Tue)19:57:02 No.106468310

Anonymous 09/02/25(Tue)19:57:02 No.106468310

>>106468285
>why can't i just attach a saw blade to my drill instead of HAVING to use a circular saw??? THIS TECH IS TRASH

Anonymous
09/02/25(Tue)19:57:08 No.106468313

Anonymous 09/02/25(Tue)19:57:08 No.106468313

>>106468193
catbox?

Anonymous
09/02/25(Tue)19:57:10 No.106468314

Anonymous 09/02/25(Tue)19:57:10 No.106468314

>>106468298
my boy Spyro grew up

Anonymous
09/02/25(Tue)20:00:12 No.106468333

Anonymous 09/02/25(Tue)20:00:12 No.106468333

>>106468314
you think he puts his benis into her???

Anonymous
09/02/25(Tue)20:00:12 No.106468334

Anonymous 09/02/25(Tue)20:00:12 No.106468334

>>106468296
luddite are incapable of looking ahead so I guess we are at a stalemate lol. Hey moron inpainting and region isn't magically going away cause these models are getting better at prompt adhesion you can still jerk yourself using them just that base image will be made faster and will need less edits.

Anonymous
09/02/25(Tue)20:00:27 No.106468335

Anonymous 09/02/25(Tue)20:00:27 No.106468335

>>106468228
You could *probably* train Chroma in OneTrainer using nfloat_4 for both model and text, but I don't know if the quality drop would be worth it, that said with 8gb vram you will have to accept a quality drop.

If you use OneTrainer, you don't download any special quantization model, you select quantization (in your case nfloat_4) in the trainer (model section) and it will do it 'on the fly' on the full model.

OneTrainer has a 8gb vram Chroma preset, but I doubt it can run with just 16gb system ram.

Anonymous
09/02/25(Tue)20:07:00 No.106468376

Anonymous 09/02/25(Tue)20:07:00 No.106468376

>>106468334
i never said the models shouldnt get better nor that they wont, even to a point one day where most things will be automated so it might as well be all "1 thing" or even mostly just 1 ai model
i argued against the cope that with all the modern tools even today like regional prompting and good inpainting uis you cant do things like multiple subjects without bleeding

and given that the retard cope continued without being able to actually post a single example where this is impossible >>106468185
i accept your concession, and dont bother coping further without posting proof, luddite retard

Anonymous
09/02/25(Tue)20:11:28 No.106468399

Anonymous 09/02/25(Tue)20:11:28 No.106468399

>>106468264
>>106468265

What's the difference between the base, the hd and the nunchaku versions?
I want an allround model that's as versatile as possible with the most context awareness possible.

Anonymous
09/02/25(Tue)20:13:37 No.106468414

Anonymous 09/02/25(Tue)20:13:37 No.106468414

Alright, why does Comfy load the face detailer model once the workflow starts even if that's only used once the first inference is done?
Why doesn't it just load it sequentially?

Anonymous
09/02/25(Tue)20:15:24 No.106468422

Anonymous 09/02/25(Tue)20:15:24 No.106468422

>>106468399
>Nunchaku is a high-performance inference engine optimized for 4-bit neural networks, as introduced in our paper SVDQuant.
best vramlet cope tier for fags like you with no vram, wait for this to support chroma, then
>between the base, the hd
use hd

Anonymous
09/02/25(Tue)20:20:34 No.106468458

Anonymous 09/02/25(Tue)20:20:34 No.106468458

>>106468399
The Nunchaku one is an outdated hack and not really for general use. Since you're vram limited I would suggest trying lower resolutions (768 and below) with Base. HD loves blurring outputs but you can try it too if you want.

Anonymous
09/02/25(Tue)20:21:03 No.106468462

Anonymous 09/02/25(Tue)20:21:03 No.106468462

>>106468399
base is 48 epochs trained at 512 resolution

hd is those 48 epochs + a lot of epochs trained at 640, 768, 1024, 1152 resolutions as well

nunchaku is a quantization method that yields overall best results compared to other methods, and it's fast since it's q4

Anonymous
09/02/25(Tue)20:23:19 No.106468470

Anonymous 09/02/25(Tue)20:23:19 No.106468470

>>106468422
>>106468458
>>106468462

ok. thanks.
And then I need to use one of text encoders, and this vae, right?
They're linked from the official lodestones repo: https://huggingface.co/comfyanonymous/flux_text_encoders/
and the vae from here, also linked from the official lodestones repo: https://huggingface.co/lodestones/Chroma/tree/main

Anonymous
09/02/25(Tue)20:23:27 No.106468471

Anonymous 09/02/25(Tue)20:23:27 No.106468471

>>106468458
Yes, good point, the 'nunchaku' version here is NOT a real nunchaku quantization

That will happen once they're done with the Wan nunchaku

Anonymous
09/02/25(Tue)20:27:44 No.106468495

Anonymous 09/02/25(Tue)20:27:44 No.106468495

>>106468470
Yes, you need the Flux vae and the t5xxl llm, typically they are named:

ae.safetensors
t5xxl_fp16.safetensors

There's a t5xxl_fp8_e4m3fn.safetensors you can use instead if you don't have enough vram

Anonymous
09/02/25(Tue)20:29:52 No.106468507

Anonymous 09/02/25(Tue)20:29:52 No.106468507

>>106468470
T5 encoder is fine. You might have more luck with a fp8 version since it's smaller.

Anonymous
09/02/25(Tue)20:30:48 No.106468511

Anonymous 09/02/25(Tue)20:30:48 No.106468511

>>106468495
>>106468507

perfect. thanks a lot guys. that's exactly what I needed to hear.

Anonymous
09/02/25(Tue)20:32:38 No.106468520

Anonymous 09/02/25(Tue)20:32:38 No.106468520

>>106468511
Np, let us know if something fucks up.

Anonymous
09/02/25(Tue)20:33:06 No.106468525

Anonymous 09/02/25(Tue)20:33:06 No.106468525

>>106468333
In her butte

Anonymous
09/02/25(Tue)20:36:58 No.106468548

Anonymous 09/02/25(Tue)20:36:58 No.106468548

File: WanVideo2_2_I2V_00299.webm (1.18 MB, 1248x720)

1.18 MB WEBM

Anonymous
09/02/25(Tue)20:37:01 No.106468550

Anonymous 09/02/25(Tue)20:37:01 No.106468550

>>106468507
>>106468470
Do NOT use scaled text encoders.

Anonymous
09/02/25(Tue)20:37:55 No.106468554

Anonymous 09/02/25(Tue)20:37:55 No.106468554

>>106468550
What happens if you do?
What's the difference?

Anonymous
09/02/25(Tue)20:39:06 No.106468565

Anonymous 09/02/25(Tue)20:39:06 No.106468565

>>106468554
It makes the model more retarded. Offload it to CPU in the loader node if you must.

Anonymous
09/02/25(Tue)20:40:41 No.106468577

Anonymous 09/02/25(Tue)20:40:41 No.106468577

>>106468565
how much difference are we talking?

Anonymous
09/02/25(Tue)20:41:10 No.106468582

Anonymous 09/02/25(Tue)20:41:10 No.106468582

>>106468577
yes

Anonymous
09/02/25(Tue)20:44:29 No.106468599

Anonymous 09/02/25(Tue)20:44:29 No.106468599

>>106468577
Also I can recommend the GNER fp16 version.
https://huggingface.co/wikeeyang/GNER-T5-xxl-encoder-only/tree/main
Avoid FLAN since there's some schizo myth circulating around that you should use it, but it's so bad.

Anonymous
09/02/25(Tue)20:51:44 No.106468645

Anonymous 09/02/25(Tue)20:51:44 No.106468645

>>106468577
there is no significant difference for qwen's TE at least

Anonymous
09/02/25(Tue)20:52:40 No.106468654

Anonymous 09/02/25(Tue)20:52:40 No.106468654

>>106466715
Oh no!

Anonymous
09/02/25(Tue)20:53:20 No.106468657

Anonymous 09/02/25(Tue)20:53:20 No.106468657

File: 1725766727174114.png (290 KB, 1215x590)

290 KB PNG

>>106467162
>Jpeg doesn't preserve metadata
yes it does

Anonymous
09/02/25(Tue)20:54:03 No.106468663

Anonymous 09/02/25(Tue)20:54:03 No.106468663

>>106464276
newbie here, so I managed to get the wan22 i2v workflow working, but how do you modify it to include a form of reference image?

Anonymous
09/02/25(Tue)20:58:11 No.106468704

Anonymous 09/02/25(Tue)20:58:11 No.106468704

File: image - 2025-09-02T195733.734.png (2.09 MB, 1072x1880)

2.09 MB PNG

Anonymous
09/02/25(Tue)20:59:19 No.106468713

Anonymous 09/02/25(Tue)20:59:19 No.106468713

>>106468663
https://github.com/kijai/ComfyUI-WanVideoWrapper/tree/main/example_workflows

https://github.com/kijai/ComfyUI-WanVideoWrapper/blob/main/example_workflows/wanvideo2_2_I2V_A14B_example_WIP.json

Just copy from the examples.

Anonymous
09/02/25(Tue)21:01:33 No.106468734

Anonymous 09/02/25(Tue)21:01:33 No.106468734

File: kroahmah.jpg (108 KB, 1106x890)

108 KB JPG

Is there where the hidden chroma gems truly are? Anyone tried any of these? If I can make even wonkier stuff, I'm in

Anonymous
09/02/25(Tue)21:01:40 No.106468735

Anonymous 09/02/25(Tue)21:01:40 No.106468735

>>106468577
It's minor, but there. Some people are anal about it though. Your ram is low which is why I suggested the fp8 version. If you want to try the fp16 go ahead though. There will be a node in comfyUI that says "load CLIP". Make sure it's set to CPU in the device section, otherwise you might run out of memory.

>>106468599
Also true. Gner is worth experimenting with, but flan is just t5 but worse.

Anonymous
09/02/25(Tue)21:03:53 No.106468754

Anonymous 09/02/25(Tue)21:03:53 No.106468754

>>106468704
Looks a bit like Adam Hughes style ?

Anonymous
09/02/25(Tue)21:05:20 No.106468763

Anonymous 09/02/25(Tue)21:05:20 No.106468763

>>106468735
>but flan is just t5 but worse
I thought flan was supposed to do better on text ?

Anonymous
09/02/25(Tue)21:06:16 No.106468770

Anonymous 09/02/25(Tue)21:06:16 No.106468770

>>106468734
no. those are all models meant for testing. look at the dates. they are all old as shit

Anonymous
09/02/25(Tue)21:07:06 No.106468779

Anonymous 09/02/25(Tue)21:07:06 No.106468779

>>106468734
These are mostly abortions.
>>106468763
Who tf uses chroma for text?

Anonymous
09/02/25(Tue)21:10:05 No.106468802

Anonymous 09/02/25(Tue)21:10:05 No.106468802

>>106468763
It's possible. I don't ever really generate text though so I wouldn't know.

Anonymous
09/02/25(Tue)21:11:17 No.106468812

Anonymous 09/02/25(Tue)21:11:17 No.106468812

>>106468599
>>106468645
>>106468735
thanks

Anonymous
09/02/25(Tue)21:16:14 No.106468835

Anonymous 09/02/25(Tue)21:16:14 No.106468835

>>106468779
>Who tf uses chroma for text?
Not me, but I'm sure there's some weirdo out there who does

Anonymous
09/02/25(Tue)21:16:17 No.106468836

Anonymous 09/02/25(Tue)21:16:17 No.106468836

>>106468399
Nunchaku is basically almost about q8 quality (or q8 if you have a 5090), at the size of q4 but 4x faster than a regular q4. If you don't know wjat this quants mean, q8 is a gguf quant which is almost lossless, half the size of fp16 model (but not necessarily faster if you can fit it into memory), and q4 is half q8, which represents a larger quality loss but is the largest you can shrink before you start losing too much quality, and is also not faster than fp16.

Anonymous
09/02/25(Tue)21:18:00 No.106468845

Anonymous 09/02/25(Tue)21:18:00 No.106468845

File: image - 2025-09-02T201212.131.png (1.78 MB, 1072x1880)

1.78 MB PNG

>>106468754
It's a Jubilee lora but all I have prompted is Joel Jurion and Traditional medium

Anonymous
09/02/25(Tue)21:20:17 No.106468856

Anonymous 09/02/25(Tue)21:20:17 No.106468856

File: qwen_image_fp8_e4m3fn.saf(...).png (2.08 MB, 1440x1120)

2.08 MB PNG

is this what people do at mardi gras?

Anonymous
09/02/25(Tue)21:22:01 No.106468865

Anonymous 09/02/25(Tue)21:22:01 No.106468865

File: AnimateDiff_00110_.mp4 (2.94 MB, 1280x720)

2.94 MB MP4

someone at wan dataset team REALLY likes blue archive
to the point it pollutes all high res anime style gens

Anonymous
09/02/25(Tue)21:27:03 No.106468898

Anonymous 09/02/25(Tue)21:27:03 No.106468898

File: korma.jpg (14 KB, 859x80)

14 KB JPG

>>106468770
>>106468779
I will set sail for the seven epochs

Anonymous
09/02/25(Tue)21:27:21 No.106468899

Anonymous 09/02/25(Tue)21:27:21 No.106468899

>>106468845
I see, nice style

Anonymous
09/02/25(Tue)21:32:47 No.106468930

Anonymous 09/02/25(Tue)21:32:47 No.106468930

File: Radiance.png (1.62 MB, 1072x1072)

1.62 MB PNG

>>106468898
That's a special model experiment that uses pixel space instead of VAE, you need a special hacked version of Comfy to try it since it's still training

I would wait until it's done unless you really want to try bleeding edge stuff, that said it seems to be progressing quite nicely

Anonymous
09/02/25(Tue)21:35:05 No.106468948

Anonymous 09/02/25(Tue)21:35:05 No.106468948

File: WanVideo2_2_I2V_00300.webm (752 KB, 1248x720)

752 KB WEBM

Anonymous
09/02/25(Tue)21:36:47 No.106468954

Anonymous 09/02/25(Tue)21:36:47 No.106468954

>>106468930
The stupid furfag should leave bleeding edge research stuff to actual data scientists and just train the inevitable qwen model already and I'm tired of people doing the mental gymnastics into convincing themselves this is anything more than a waste of time.

Anonymous
09/02/25(Tue)21:37:21 No.106468957

Anonymous 09/02/25(Tue)21:37:21 No.106468957

File: 388416.mp4 (3.52 MB, 832x1056)

3.52 MB MP4

Anonymous
09/02/25(Tue)21:39:40 No.106468968

Anonymous 09/02/25(Tue)21:39:40 No.106468968

>>106468930
>you need a special hacked version of Comfy
Seriously? Well that's no fun...

Anonymous
09/02/25(Tue)21:42:08 No.106468984

Anonymous 09/02/25(Tue)21:42:08 No.106468984

>>106468948
not bad this one

Anonymous
09/02/25(Tue)21:43:11 No.106468987

Anonymous 09/02/25(Tue)21:43:11 No.106468987

>>106468954
>people doing the mental gymnastics into convincing themselves this is anything more than a waste of time.
It is, but it is a waste of his time, and it's very likely he is an actual researcher. Why do you want him to finetune qwen, anyway? The person with the relevant nsfw photographic dataset is a bigasp guy, not lodestone.

Anonymous
09/02/25(Tue)21:43:58 No.106468993

Anonymous 09/02/25(Tue)21:43:58 No.106468993

>>106468957
I've got the nude version of this BTW.

https://files.catbox.moe/xtbq2u.jpeg

Anonymous
09/02/25(Tue)21:46:01 No.106469001

Anonymous 09/02/25(Tue)21:46:01 No.106469001

>>106468856
looks accurate to me, only it's missing people dressed as crows

Anonymous
09/02/25(Tue)21:47:27 No.106469008

Anonymous 09/02/25(Tue)21:47:27 No.106469008

File: 1733542626615002.gif (569 KB, 400x266)

569 KB GIF

>mfw all the jumpscare ai videos coming in october

Anonymous
09/02/25(Tue)21:49:27 No.106469018

Anonymous 09/02/25(Tue)21:49:27 No.106469018

File: image - 2025-09-02T204800.200.png (1.19 MB, 1072x1880)

1.19 MB PNG

>>106468957
The unaltered Tifa image is a benchmark for genning 3d pinups, I want to get a good look like that in Illustrious

Anonymous
09/02/25(Tue)21:59:09 No.106469081

Anonymous 09/02/25(Tue)21:59:09 No.106469081

>>106468954
>inevitable qwen model
Qwen is too slow and seemingly trains too poorly to be of any real interest , you'd be better off finetuning t2i Wan, but even that will be slow and EXPENSIVE

I doubt it will happen, but we'll see

Anonymous
09/02/25(Tue)22:04:08 No.106469109

Anonymous 09/02/25(Tue)22:04:08 No.106469109

>>106468948
kek, based

Anonymous
09/02/25(Tue)22:07:40 No.106469131

Anonymous 09/02/25(Tue)22:07:40 No.106469131

>>106468987
>>106469081

Quit replying to bait you knuckleheads

Anonymous
09/02/25(Tue)22:07:41 No.106469132

Anonymous 09/02/25(Tue)22:07:41 No.106469132

what is this weird miku cult

Anonymous
09/02/25(Tue)22:09:43 No.106469147

Anonymous 09/02/25(Tue)22:09:43 No.106469147

>>106469131
But I... yes, you are likely correct

Anonymous
09/02/25(Tue)22:10:10 No.106469149

Anonymous 09/02/25(Tue)22:10:10 No.106469149

>>106469132
Why are you new?

Anonymous
09/02/25(Tue)22:11:18 No.106469155

Anonymous 09/02/25(Tue)22:11:18 No.106469155

>>106469149
i dont care about miku. i dont get the appeal

Anonymous
09/02/25(Tue)22:14:39 No.106469182

Anonymous 09/02/25(Tue)22:14:39 No.106469182

>>106469081
>trains too poorly
?

Anonymous
09/02/25(Tue)22:19:19 No.106469199

Anonymous 09/02/25(Tue)22:19:19 No.106469199

>>106468930
what's pixel space

Anonymous
09/02/25(Tue)22:21:51 No.106469209

Anonymous 09/02/25(Tue)22:21:51 No.106469209

>>106469132
It's not a cult.
>>106469155
>i dont care about miku. i dont get the appeal
I'm so fucking tired of this internalized unconscious transphobia from newfags.

Anonymous
09/02/25(Tue)22:22:04 No.106469212

Anonymous 09/02/25(Tue)22:22:04 No.106469212

>>106469182
People threw their bs XL settings onto the model and decided it didn't work and ignored multiple people who have made great LoRAs for it already. So same as early flux days lol

Anonymous
09/02/25(Tue)22:23:00 No.106469218

Anonymous 09/02/25(Tue)22:23:00 No.106469218

File: 1750315798740291.png (591 KB, 1125x900)

591 KB PNG

>upload girl to nano banana
>"change her shirt into a different shirt"
>it massively downsizes her tits from a C cup to an A cup
why

Anonymous
09/02/25(Tue)22:24:20 No.106469228

Anonymous 09/02/25(Tue)22:24:20 No.106469228

>>106469218
Do you feel safe? The girl you were editing was clearly very unsafe for your health. Good guy google.

Anonymous
09/02/25(Tue)22:26:48 No.106469237

Anonymous 09/02/25(Tue)22:26:48 No.106469237

>>106469218
it takes your gen's breast milk as payment. they have a full data center for AI breast milk

Anonymous
09/02/25(Tue)22:27:16 No.106469241

Anonymous 09/02/25(Tue)22:27:16 No.106469241

File: 1747324221633417.png (602 KB, 1024x1024)

602 KB PNG

>>106469218
(and if you're curious on why this is in local diffusion general, it's because it's a reminder that corpo models, even when smart, are still retarded because the industry leaders will make it retarded before you get to use it)

>>106469228
lol. god forbid she walks outside with that unsafe amount of fully clothed breasts in regular daywear

Anonymous
09/02/25(Tue)22:28:55 No.106469253

Anonymous 09/02/25(Tue)22:28:55 No.106469253

>>106469182
Having done a few Qwen lora training tests, it's hard to get good results when you try new concepts that are largely unfamiliar to the base model, including just different art styles, and I've been hearing the same from other people who also train, most likely due to it being very overtrained.

Also it is very slow to train, meaning few will experiment to find the potential 'best settings', but perhaps they will emerge.

Anonymous
09/02/25(Tue)22:30:59 No.106469258

Anonymous 09/02/25(Tue)22:30:59 No.106469258

File: WanVideo2_2_I2V_00301.webm (830 KB, 1248x720)

830 KB WEBM

Anonymous
09/02/25(Tue)22:31:23 No.106469260

Anonymous 09/02/25(Tue)22:31:23 No.106469260

>>106466582
>>106466642
>>106466642
Hero of the thread

Anonymous
09/02/25(Tue)22:37:08 No.106469287

Anonymous 09/02/25(Tue)22:37:08 No.106469287

>>106469218
>Using Google AI
>Why is it censoring everything ?
...

Anonymous
09/02/25(Tue)22:38:30 No.106469294

Anonymous 09/02/25(Tue)22:38:30 No.106469294

>>106469287
still less censored than chyatgpt

Anonymous
09/02/25(Tue)22:39:05 No.106469296

Anonymous 09/02/25(Tue)22:39:05 No.106469296

>>106469287
Sure, but that's not an outright rejection. Nor is there obscenity in the prompt or the image. It's just wild to me the extents these people will go. And for what?

I don't know how else to put it. It's wild that a sfw image is censored into a sfw image, completely randomly.

Anonymous
09/02/25(Tue)22:40:16 No.106469303

Anonymous 09/02/25(Tue)22:40:16 No.106469303

>>106469258
It's kind of insane that you can generate this stuff on local

Yes, it loses likeness as the video progresses, but still, it's impressive, also continued likeness could be improves by training a lora of the show

Anonymous
09/02/25(Tue)22:41:53 No.106469308

Anonymous 09/02/25(Tue)22:41:53 No.106469308

>>106469294
lowest bar ever

Anonymous
09/02/25(Tue)22:42:28 No.106469310

Anonymous 09/02/25(Tue)22:42:28 No.106469310

File: migu.mp4 (1.9 MB, 1280x720)

1.9 MB MP4

>>106469155
y-you don't like miku, anon?

Anonymous
09/02/25(Tue)22:44:58 No.106469322

Anonymous 09/02/25(Tue)22:44:58 No.106469322

>>106469132
Asch effect
- Normative social influence
(fear of rejection in a social group)
- Informational social influence
(if everyone does the ritual, it must be right)

think of it as an endless cycle of insecure newfags. they see it here and adopt it. Old fags leave, new ones come and so this continuity is created.

Anonymous
09/02/25(Tue)22:52:53 No.106469358

Anonymous 09/02/25(Tue)22:52:53 No.106469358

>>106469303
yeah thats what im saying

Anonymous
09/02/25(Tue)22:56:35 No.106469378

Anonymous 09/02/25(Tue)22:56:35 No.106469378

File: AnimateDiff_00278.mp4 (2.77 MB, 720x960)

2.77 MB MP4

>>106468654

Anonymous
09/02/25(Tue)23:00:45 No.106469406

Anonymous 09/02/25(Tue)23:00:45 No.106469406

File: Screenshot_.jpg (81 KB, 470x457)

81 KB JPG

>>106469258
"holy shit it's sonic's wife"

Anonymous
09/02/25(Tue)23:09:41 No.106469449

Anonymous 09/02/25(Tue)23:09:41 No.106469449

>>106465845
>what was their endgame?
I am guessing they want to switch over because Lumina 2.0 came out and blew what Lumina-Next did out of the water and solve the remaining issues probably they encountered in the background. It's not like they disappeared but they have gone dark since late last year. Their public domain dataset is still up so at least even if they are gone, we still have a good dataset from them for not getting slop from a foundational diffusion model.
https://huggingface.co/datasets/Spawning/PD12M

Anonymous
09/02/25(Tue)23:13:58 No.106469469

Anonymous 09/02/25(Tue)23:13:58 No.106469469

>>106469378
>first she has to suck a penis
>then she drops BB-8
>then BB-8 runs away from her
Man, it's been a rough first day of school for Jenny today.

Anonymous
09/02/25(Tue)23:16:12 No.106469477

Anonymous 09/02/25(Tue)23:16:12 No.106469477

>>106469378
That didn't look like an accident...

Anonymous
09/02/25(Tue)23:18:41 No.106469487

Anonymous 09/02/25(Tue)23:18:41 No.106469487

>>106469469
Thats based on a real chick?

Anonymous
09/02/25(Tue)23:19:48 No.106469496

Anonymous 09/02/25(Tue)23:19:48 No.106469496

new
>>106469492
>>106469492
>>106469492
>>106469492

Anonymous
09/02/25(Tue)23:21:05 No.106469503

Anonymous 09/02/25(Tue)23:21:05 No.106469503

>>106469477
?
she's just surprised it came to life
>>106469487
yeah

Anonymous
09/02/25(Tue)23:28:50 No.106469551

Anonymous 09/02/25(Tue)23:28:50 No.106469551

>>106469503
LMAO hope she doesn’t see that stuff that menace with the oral insertion Lora was going around with. Is she an actress or something?

Anonymous
09/02/25(Tue)23:30:08 No.106469571

Anonymous 09/02/25(Tue)23:30:08 No.106469571

>>106469551
no, she's a jewtuber. Jenny Nicholson

Anonymous
09/02/25(Tue)23:38:44 No.106469630

Anonymous 09/02/25(Tue)23:38:44 No.106469630

>>106469571
>jewtuber
Ok I’m interested
>Jenny Nichols is a video essayist…
Boner killed. Gay.

Anonymous
09/02/25(Tue)23:44:29 No.106469666

Anonymous 09/02/25(Tue)23:44:29 No.106469666

>>106469630
i just checked her page and she uploaded once in the past three years. this general is actually the premier source of Jenny content now lol

Anonymous
09/02/25(Tue)23:49:15 No.106469693

Anonymous 09/02/25(Tue)23:49:15 No.106469693

>>106469571
>Jenny Nicholson
The likeness of your LoRA is very impressive, anon. I'm kinda into her now.... I might have to... I dunno... gen her without her clothes on.... maybe use the facialinsertion LoRA.... I dunno we'll see...

Anonymous
09/03/25(Wed)00:10:53 No.106469787

Anonymous 09/03/25(Wed)00:10:53 No.106469787

>>106469253
I also agree with this. I haven't had much luck training styles, although I'm noticing now that higher dim values than expectedd are producing better results.

Anonymous
09/03/25(Wed)00:13:03 No.106469800

Anonymous 09/03/25(Wed)00:13:03 No.106469800

File: ComfyUI_16567.png (3.08 MB, 1200x1600)

3.08 MB PNG

>>106469551
>Is she an actress or something?
She's the cutest girl in the whole wide world!

>>106469666
>this general is actually the premier source of Jenny content now lol
Jenny's super-burned out atm (you can hear her almost wanting to cry at getting nothing done) and struggling to produce a "short" video about one of her favorite bad movies. No timeline for that, my guess would be around Halloween time, but she completely gave up on Barbie movie ranking video she was working on up until July. So even her current project isn't a sure thing.

Anonymous
09/03/25(Wed)03:31:06 No.106470681

Anonymous 09/03/25(Wed)03:31:06 No.106470681

>>106464677
nice

Anonymous
09/03/25(Wed)04:10:39 No.106470847

Anonymous 09/03/25(Wed)04:10:39 No.106470847

>>106466582
nubcake here, is inpainting in comf actually worse?

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.