/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/ldg/ - Local Diffusion Genera(...) 09/26/25(Fri)10:28:15 No.106706484

File: highlights_g_106703056_17(...).webm (3.06 MB, 1655x1697)

3.06 MB WEBM

/ldg/ - Local Diffusion General Anonymous 09/26/25(Fri)10:28:15 No.106706484

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106703056

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo

Anonymous
09/26/25(Fri)10:31:32 No.106706509

Anonymous 09/26/25(Fri)10:31:32 No.106706509

File: 1755413118895148.png (75 KB, 745x368)

75 KB PNG

>>106706459
>>style change is harder in new qie+
>that's what happens when you want to save a model with finetunes, at some point you're trying too hard and the model starts to lose some of its concept, that's why the pretraining is always the most important part, if the base model is too weak, it's already over
chat is it true?

Anonymous
09/26/25(Fri)10:31:55 No.106706512

Anonymous 09/26/25(Fri)10:31:55 No.106706512

>>106706502
2nd for MORE MOTOKO!

Anonymous
09/26/25(Fri)10:33:31 No.106706526

Anonymous 09/26/25(Fri)10:33:31 No.106706526

File: 1757735398482959.png (2.51 MB, 1328x1552)

2.51 MB PNG

>>106706512
you get a free lain instead
>>106706509
yes, im back to shitposting with old QIE desu

Anonymous
09/26/25(Fri)10:35:43 No.106706543

Anonymous 09/26/25(Fri)10:35:43 No.106706543

>>106706509
yes. go a few threads back in the archive and you'll see anons confirming it with gens.
tl:dr: it understands some concepts/objects now but lost art understanding.

Anonymous
09/26/25(Fri)10:38:40 No.106706573

Anonymous 09/26/25(Fri)10:38:40 No.106706573

>>106706509
it's all right, Tencent will save us with their own edit model that'll be released in a month
https://youtu.be/DJiMZM5kXFc?t=18

Anonymous
09/26/25(Fri)10:40:37 No.106706583

Anonymous 09/26/25(Fri)10:40:37 No.106706583

Reminder to use the v2 of the lightning lora for qwen edit. Retains original quality and style better for the overall image.

Anonymous
09/26/25(Fri)10:43:05 No.106706608

Anonymous 09/26/25(Fri)10:43:05 No.106706608

File: 1727468334517808.mp4 (3.99 MB, 1920x1080)

3.99 MB MP4

https://xcancel.com/Alibaba_Wan/status/1971485743194484880#m
lmao, Wan 2.5 can edit images, and I'm sure that one is way better than the gigaslopped QIE shit

Anonymous
09/26/25(Fri)10:44:10 No.106706623

Anonymous 09/26/25(Fri)10:44:10 No.106706623

>>106706583
it fucks up the edit capabilities thoughever bait
>>106706608
100% api only, sad

Anonymous
09/26/25(Fri)10:44:21 No.106706625

Anonymous 09/26/25(Fri)10:44:21 No.106706625

again
anon, do you know these models and can you share a comfy workflows?
https://huggingface.co/ShinoharaHare/Waifu-Inpaint-XL
https://huggingface.co/ShinoharaHare/Waifu-Colorize-XL

Anonymous
09/26/25(Fri)10:44:58 No.106706633

Anonymous 09/26/25(Fri)10:44:58 No.106706633

Do I have to run sage attention nodes for wan 2.2 workflows or can I just use a command line?

Anonymous
09/26/25(Fri)10:45:12 No.106706635

Anonymous 09/26/25(Fri)10:45:12 No.106706635

>>106706625
its illustrious based so just check the 1girl guide in the op. youre welcome retard

Anonymous
09/26/25(Fri)10:45:22 No.106706641

Anonymous 09/26/25(Fri)10:45:22 No.106706641

>thoughever
put your trip back on

>>106706608
why call it a video model at that point.

Anonymous
09/26/25(Fri)10:45:38 No.106706644

Anonymous 09/26/25(Fri)10:45:38 No.106706644

>>106706633
for wan 2.2 just a command line is enough

Anonymous
09/26/25(Fri)10:45:43 No.106706646

Anonymous 09/26/25(Fri)10:45:43 No.106706646

>>106706633
I find nodes to be better, so I can switch on the fly

Anonymous
09/26/25(Fri)10:46:45 No.106706654

Anonymous 09/26/25(Fri)10:46:45 No.106706654

>>106706633
keep sage off. it's not worth the 5% speed increase.

>>106706635
they aren't simple models like that, retard.

Anonymous
09/26/25(Fri)10:46:59 No.106706656

Anonymous 09/26/25(Fri)10:46:59 No.106706656

>>106706623
>100% api only
only for preview

Anonymous
09/26/25(Fri)10:47:05 No.106706658

Anonymous 09/26/25(Fri)10:47:05 No.106706658

>>106706644
Great.

>>106706646
>>106706654
Why switch? Isn't it purely performance?

Anonymous
09/26/25(Fri)10:47:27 No.106706660

Anonymous 09/26/25(Fri)10:47:27 No.106706660

>>106706608
look at 39 sec, there's 0 zoom in issue, the image stays the same and just changes the cat, kek they're really fucking with us and they're only releasing locally their failed scraps

Anonymous
09/26/25(Fri)10:47:29 No.106706662

Anonymous 09/26/25(Fri)10:47:29 No.106706662

>>106706658
no, flash attention is lossless, sage attention is NOT lossless

Anonymous
09/26/25(Fri)10:48:06 No.106706668

Anonymous 09/26/25(Fri)10:48:06 No.106706668

>>106706656
>being that high on copium

Anonymous
09/26/25(Fri)10:48:32 No.106706676

Anonymous 09/26/25(Fri)10:48:32 No.106706676

>>106706658
anedotal and i don't care if you believe me but turning it off had a positive impact on video gens.

barely matters for sdxl but qwen also requires it to be off so i just keep it off all the time now.

Anonymous
09/26/25(Fri)10:49:53 No.106706687

Anonymous 09/26/25(Fri)10:49:53 No.106706687

>he fell for the "it's going to be cloud only" bot brigade

Anonymous
09/26/25(Fri)10:50:57 No.106706693

Anonymous 09/26/25(Fri)10:50:57 No.106706693

File: 1755170109268766.jpg (395 KB, 1428x1922)

395 KB JPG

https://xcancel.com/bdsqlsz/status/1971448657011728480#m
>4x qwen image
it means a 80b model, and it's consistent to what that chink is saying in that video (80b)
https://youtu.be/DJiMZM5kXFc?t=204
it's over...

Anonymous
09/26/25(Fri)10:54:24 No.106706728

Anonymous 09/26/25(Fri)10:54:24 No.106706728

File: 1740381669519705.png (209 KB, 640x639)

209 KB PNG

>>106706693
so you're telling me they need a 80b to output this slop? lmao, China has lost the plot, instead of going for a more quality dataset training untainted by synthetic slop, they went the API route, MOAR LAYERS

Anonymous
09/26/25(Fri)10:54:46 No.106706734

Anonymous 09/26/25(Fri)10:54:46 No.106706734

>>106706635
ty it was helpful
but call it maximized laziness :>

Anonymous
09/26/25(Fri)10:56:39 No.106706753

Anonymous 09/26/25(Fri)10:56:39 No.106706753

>gm

Anonymous
09/26/25(Fri)10:57:26 No.106706761

Anonymous 09/26/25(Fri)10:57:26 No.106706761

>>106706662
>>106706676
Now I have to try it.

I just tried running q8 in high noise and fp16 in low noise. Didn't see much of a difference. Strange.

Anonymous
09/26/25(Fri)10:57:48 No.106706763

Anonymous 09/26/25(Fri)10:57:48 No.106706763

>>106706668
https://youtu.be/IhH7gDDPC4w?t=50m58s

Anonymous
09/26/25(Fri)10:59:30 No.106706772

Anonymous 09/26/25(Fri)10:59:30 No.106706772

>>106706763
at no point he said that they'll release the complete model
>then we will complete next version, wan 2.5 without the preview
he said "complete" not "release"

Anonymous
09/26/25(Fri)11:01:16 No.106706788

Anonymous 09/26/25(Fri)11:01:16 No.106706788

File: lmao.png (609 KB, 640x640)

609 KB PNG

>>106706693
>80b for this
https://www.reddit.com/r/StableDiffusion/comments/1nqm5l0/images_from_the_huge_apple_model_allegedly/

Anonymous
09/26/25(Fri)11:08:52 No.106706853

Anonymous 09/26/25(Fri)11:08:52 No.106706853

so anon, what kind of stuff have you been making with the edit models?
surely only wholesome memes that are family friendly, right?

> makes everyone pregnant
ANON STOP WHAT ARE YOU DOING

Anonymous
09/26/25(Fri)11:10:47 No.106706875

Anonymous 09/26/25(Fri)11:10:47 No.106706875

>>106706608
>Alibaba have their own good edit model and won't release it
>>106706693
>Tencent went for LayersMaxxing and their shit still looks like pure slop
lol it's so over dude

Anonymous
09/26/25(Fri)11:11:16 No.106706880

Anonymous 09/26/25(Fri)11:11:16 No.106706880

File: Neta Yume v3 vs 2SaaS models.jpg (1.23 MB, 4556x1446)

1.23 MB JPG

>>106706801
Hello, /adt/ repost
Impressed by the new NetaYume v3, in my opinion it's on par with current SaaS models in the anime field. Same prompt, but for copyright reasons I used character traits instead of names as shown in pic related.

I would like to do this with Chroma anime checkpoint. Does anyone have a ready workflow to import and test?

My Workflow: https://files.catbox.moe/84cdwx.png

Anonymous
09/26/25(Fri)11:12:11 No.106706891

Anonymous 09/26/25(Fri)11:12:11 No.106706891

>>106706853
Making aliexpress-tier pics for online stores

Anonymous
09/26/25(Fri)11:12:54 No.106706901

Anonymous 09/26/25(Fri)11:12:54 No.106706901

>>106706484
>CumshillUI still in the OP

Anonymous
09/26/25(Fri)11:13:12 No.106706908

Anonymous 09/26/25(Fri)11:13:12 No.106706908

>>106706891
that sounds tedious. i hope you're getting that bag anon.

Anonymous
09/26/25(Fri)11:14:14 No.106706919

Anonymous 09/26/25(Fri)11:14:14 No.106706919

File: 10000 dollars!.png (856 KB, 1977x1442)

856 KB PNG

>>106706693
So you need a RTX PRO 6000 (96 gb) to run this shit on Q8? kek, what's the point of releasing this shit at all?

Anonymous
09/26/25(Fri)11:14:18 No.106706920

Anonymous 09/26/25(Fri)11:14:18 No.106706920

>>106706880
sooooooo not local?

Anonymous
09/26/25(Fri)11:16:23 No.106706936

Anonymous 09/26/25(Fri)11:16:23 No.106706936

>>106706693
Will this be the biggest local image/video model ever? If I remember correctly, the biggest one before that was step video (30b).

Anonymous
09/26/25(Fri)11:16:35 No.106706938

Anonymous 09/26/25(Fri)11:16:35 No.106706938

Nvidia is capable of turning their gpus vram plug'n'play with upgradeable and affordable vram. But they won't do it because they have a monopoly and won't profit as much from it.

Anonymous
09/26/25(Fri)11:16:43 No.106706940

Anonymous 09/26/25(Fri)11:16:43 No.106706940

File: 1752767022929126.png (2.18 MB, 1728x1344)

2.18 MB PNG

Anonymous
09/26/25(Fri)11:16:59 No.106706943

Anonymous 09/26/25(Fri)11:16:59 No.106706943

>>106706788
Well, if this can be effectively quantizised without losing too much quality and it trains well, it could still have good adoption, but those are really big IFs

Even with good quality quantization now being available, Qwen adoption has clearly been hampered by its size and slow generation

Anonymous
09/26/25(Fri)11:17:00 No.106706944

Anonymous 09/26/25(Fri)11:17:00 No.106706944

File: file.png (1.76 MB, 896x1152)

1.76 MB PNG

>120s to generate an image
how do fluxlets do it? it seems a terribly inefficient way to iterate.

Anonymous
09/26/25(Fri)11:17:02 No.106706946

Anonymous 09/26/25(Fri)11:17:02 No.106706946

>>106706919
calling this a gaming gpu is insane work.

Anonymous
09/26/25(Fri)11:18:28 No.106706961

Anonymous 09/26/25(Fri)11:18:28 No.106706961

>>106706943
>Well, if this can be effectively quantizised without losing too much quality
if you want to run this on a 24gb vram card, you'll have to do a Q2 quant, and this shit is unusable

Anonymous
09/26/25(Fri)11:19:18 No.106706967

Anonymous 09/26/25(Fri)11:19:18 No.106706967

>>106706920
Neta Lumina it's local, its good news for local anime models.

Anonymous
09/26/25(Fri)11:19:23 No.106706969

Anonymous 09/26/25(Fri)11:19:23 No.106706969

>>106706944
this qwen?

Anonymous
09/26/25(Fri)11:19:30 No.106706970

Anonymous 09/26/25(Fri)11:19:30 No.106706970

>>106706938
>Nvidia is capable of turning their gpus vram plug'n'play with upgradeable and affordable vram.
the speed is important too, even if your 3090 has 100gb of vram and could run this, it would still be slow as fuck since it has to calculate those 80b layers

Anonymous
09/26/25(Fri)11:20:31 No.106706978

Anonymous 09/26/25(Fri)11:20:31 No.106706978

>>106706938
Well, despite being three years into the AI boom, their competitors are still sitting with their thumbs up their asses = no competition

...

Anonymous
09/26/25(Fri)11:20:39 No.106706979

Anonymous 09/26/25(Fri)11:20:39 No.106706979

>>106706944
it's so noisy, like the denoising process hasn't been finished, what model is this?

Anonymous
09/26/25(Fri)11:20:59 No.106706982

Anonymous 09/26/25(Fri)11:20:59 No.106706982

>>106706880
those SAAS are slopped at anime. please do a comparison between neta, noob, and novelAI, which is the only good API anime model

Anonymous
09/26/25(Fri)11:21:15 No.106706987

Anonymous 09/26/25(Fri)11:21:15 No.106706987

Bros.. the 4090 won't fit at all. Even with the different mounting types.

Would running it via one of those external boxes be worth it?

Anonymous
09/26/25(Fri)11:21:41 No.106706989

Anonymous 09/26/25(Fri)11:21:41 No.106706989

File: the CEO of AMD is the nie(...).png (514 KB, 1080x1004)

514 KB PNG

>>106706978
>their competitors are still sitting with their thumbs up their asses = no competition
you don't betray your family anon

Anonymous
09/26/25(Fri)11:24:04 No.106707003

Anonymous 09/26/25(Fri)11:24:04 No.106707003

>>106706989
would you. same thing

Anonymous
09/26/25(Fri)11:24:37 No.106707007

Anonymous 09/26/25(Fri)11:24:37 No.106707007

>>106707003
I wouldn't yeah

Anonymous
09/26/25(Fri)11:26:02 No.106707018

Anonymous 09/26/25(Fri)11:26:02 No.106707018

>>106706987
Did you buy the card without checking if your mobo/case has enough clearance?
You could use a riser and place on the card on top of your PC, other than that I don't know.

Anonymous
09/26/25(Fri)11:26:08 No.106707020

Anonymous 09/26/25(Fri)11:26:08 No.106707020

can someone make a Erika Kirk lora?

Anonymous
09/26/25(Fri)11:27:09 No.106707024

Anonymous 09/26/25(Fri)11:27:09 No.106707024

>>106706987
just get a cheap case which fits

Anonymous
09/26/25(Fri)11:30:35 No.106707047

Anonymous 09/26/25(Fri)11:30:35 No.106707047

Post gens fags.

And I can't right now I'm at work.

Anonymous
09/26/25(Fri)11:31:25 No.106707049

Anonymous 09/26/25(Fri)11:31:25 No.106707049

>>106707047
Make me bitch!

Anonymous
09/26/25(Fri)11:32:23 No.106707052

Anonymous 09/26/25(Fri)11:32:23 No.106707052

>>106707047
/sdg/ is that way nogen

Anonymous
09/26/25(Fri)11:36:46 No.106707091

Anonymous 09/26/25(Fri)11:36:46 No.106707091

>>106706526
Cyber-lain
Fishnets go with everything

Anonymous
09/26/25(Fri)11:37:12 No.106707096

Anonymous 09/26/25(Fri)11:37:12 No.106707096

>>106706693
I would legit buy an expensive card if this giant model was at the level of Seedream, but it's not the case at all, it's still the same slopped shit you see on your regular model, the fuck are they doing?

Anonymous
09/26/25(Fri)11:37:53 No.106707107

Anonymous 09/26/25(Fri)11:37:53 No.106707107

>>106707018
No I upgraded to a 5090, didn't plan on using the 4090 as well. But then the topic was brought up I got interested.

>>106707024
I have one of the largest cases, evo xl. It might fit if I stop using pushpull on the aio. But that'd be fully diy.

Anonymous
09/26/25(Fri)11:38:29 No.106707116

Anonymous 09/26/25(Fri)11:38:29 No.106707116

>>106707096
>blurrydream
Just have a grain filter on top of all your images boom you got yourself blurrydream at home.

Anonymous
09/26/25(Fri)11:38:51 No.106707119

Anonymous 09/26/25(Fri)11:38:51 No.106707119

>>106707096
thos guys have insane compute and they're wasting this on moar layers and moar synthetic slop, it's so sad when you think about it

Anonymous
09/26/25(Fri)11:38:58 No.106707121

Anonymous 09/26/25(Fri)11:38:58 No.106707121

>>106706982
What you say is valid, but the thing is, Noob and Ilustrious are both based on tags. How can I fairly compare the prose prompt for Noob and Ilustrious?

Anonymous
09/26/25(Fri)11:39:51 No.106707127

Anonymous 09/26/25(Fri)11:39:51 No.106707127

>>106706908
>>106706891
This, why not just take pictures of the actual product?

Anonymous
09/26/25(Fri)11:40:02 No.106707128

Anonymous 09/26/25(Fri)11:40:02 No.106707128

Moar layers has yet to be debunked tho

Anonymous
09/26/25(Fri)11:41:37 No.106707143

Anonymous 09/26/25(Fri)11:41:37 No.106707143

>>106707128
it's 4x the size of Qwen Image, and do you seriously believe the image looks 4x better? >>106706788

Anonymous
09/26/25(Fri)11:43:22 No.106707161

Anonymous 09/26/25(Fri)11:43:22 No.106707161

>>106707143
You do understand image models are judged on things other than aesthetics, right?

Anonymous
09/26/25(Fri)11:43:45 No.106707164

Anonymous 09/26/25(Fri)11:43:45 No.106707164

>>106707143
>it's 4x the size of Qwen Image
and 6.66 times the size of Flux dev, the devil is with us dude

Anonymous
09/26/25(Fri)11:44:41 No.106707174

Anonymous 09/26/25(Fri)11:44:41 No.106707174

>>106706987
just get a riser retard

Anonymous
09/26/25(Fri)11:44:46 No.106707175

Anonymous 09/26/25(Fri)11:44:46 No.106707175

>>106707161
go on anon, show us how those images are objectively better than what we can do on Qwen Image? >>106706788

Anonymous
09/26/25(Fri)11:45:03 No.106707179

Anonymous 09/26/25(Fri)11:45:03 No.106707179

>>106707161
who fucking cares about anything besides aesthetics? it's whole purpose is to make pictures, if the pictures it makes look shit what's the point?

Anonymous
09/26/25(Fri)11:45:16 No.106707183

Anonymous 09/26/25(Fri)11:45:16 No.106707183

>>106706772
???
>get asked "hey why is this model closed when everything's been open from you guys"
>response "we've done big changes to the model so in the meantime we'll give you guys a preview model for input/feedback that we can use to iron shit out"

if it wasn't going to be open he would've just said some bullshit like "model too big" instead of explaining in engrish the purpose of the preview

Anonymous
09/26/25(Fri)11:46:27 No.106707193

Anonymous 09/26/25(Fri)11:46:27 No.106707193

>>106707179
You can fix aesthetics, you can't fix dogshit prompt following or anatomy. Seriously do we have some sort influx of retard in the image gen sphere recently?

Anonymous
09/26/25(Fri)11:46:29 No.106707194

Anonymous 09/26/25(Fri)11:46:29 No.106707194

File: ladies and gentlemen, we'(...).png (594 KB, 640x640)

594 KB PNG

>>106706788
>81.3 times bigger than SD1.5
>22.8 times bigger than SDXL
>6.66 times bigger than Flux
>4.7 times bigger than HunyuanImage 2.1
>4 times bigger than Qwen Image

Anonymous
09/26/25(Fri)11:46:31 No.106707195

Anonymous 09/26/25(Fri)11:46:31 No.106707195

>>106707175
I would if it was released. All I'm saying is judging STRICTLY on aesthetics is idiotic. And I'm an aesthetics fag, trust.

Anonymous
09/26/25(Fri)11:46:52 No.106707197

Anonymous 09/26/25(Fri)11:46:52 No.106707197

>>106707121
noob does have some NL capability. Or, just do a comparison that's mostly tags, or leave noob out

Anonymous
09/26/25(Fri)11:47:04 No.106707201

Anonymous 09/26/25(Fri)11:47:04 No.106707201

>>106707179
B-but the green ball sits next to the blue box on top of the yellow rectangle, also the text is correct!

Anonymous
09/26/25(Fri)11:47:40 No.106707206

Anonymous 09/26/25(Fri)11:47:40 No.106707206

>>106707195
>I would if it was released.
there's plenty of images already publicly available, just look at them ane explain to the class how much superior they seem to be compared to the smaller models
https://www.reddit.com/r/StableDiffusion/comments/1nqm5l0/images_from_the_huge_apple_model_allegedly/

Anonymous
09/26/25(Fri)11:47:59 No.106707209

Anonymous 09/26/25(Fri)11:47:59 No.106707209

File: file.png (1.52 MB, 896x1152)

1.52 MB PNG

>>106706969
>>106706979
Flux,
https://civitai.com/models/1961797/srpo-refine-quantized-fp16-forge-compatible?modelVersionId=2220553

and a stack of loras, but these being the most prominent

https://huggingface.co/Alissonerdx/flux.1-dev-SRPO-LoRas/blob/main/srpo_128_base_R%26Q_model_fp16.safetensors

https://civitai.com/models/1253380/phone-quality-style?modelVersionId=1413027

I was soliciting hints to better workflows available out there, I'm still in the honeymoon phase of trying things out

Anonymous
09/26/25(Fri)11:48:05 No.106707210

Anonymous 09/26/25(Fri)11:48:05 No.106707210

>>106707193
>you can't fix dogshit prompt following or anatomy
yes you can it's called inpainting and manual work. are you incompetent? how do you fix a dogshit looking gen?

Anonymous
09/26/25(Fri)11:48:54 No.106707222

Anonymous 09/26/25(Fri)11:48:54 No.106707222

>>106707210
>it's called inpainting
if you want to inpaint, just use a SD1.5 model bro, you don't need a 80b model to get anatomical errors

Anonymous
09/26/25(Fri)11:48:55 No.106707223

Anonymous 09/26/25(Fri)11:48:55 No.106707223

>>106707206
It doesn't matter until anon can run his usual autistic tests. Every base model newer than XL is slopped but you don't see anon posting SD1.5 do you?

Anonymous
09/26/25(Fri)11:49:37 No.106707227

Anonymous 09/26/25(Fri)11:49:37 No.106707227

>>106707210
That's literally what a LoRA is for like dude wtf?

Anonymous
09/26/25(Fri)11:49:49 No.106707231

Anonymous 09/26/25(Fri)11:49:49 No.106707231

>>106706025
>>106706046
Sadly I don't think it's possible with the nodes we have, but I don't see why it wouldn't be feasible.

Anonymous
09/26/25(Fri)11:49:55 No.106707233

Anonymous 09/26/25(Fri)11:49:55 No.106707233

>>106707223
>you don't see anon posting SD1.5 do you?
/sdg/ exists for that no?

Anonymous
09/26/25(Fri)11:49:56 No.106707234

Anonymous 09/26/25(Fri)11:49:56 No.106707234

>>106707223
*newer than SD1.5

Anonymous
09/26/25(Fri)11:52:27 No.106707248

Anonymous 09/26/25(Fri)11:52:27 No.106707248

>>106707223
again, what's the point of a 80b model if it doesn't offer something better than Qwen Image? those images look exactly the same as a regular Qwen Image input, what's the point?

Anonymous
09/26/25(Fri)11:52:39 No.106707251

Anonymous 09/26/25(Fri)11:52:39 No.106707251

>>106707193
>Seriously do we have some sort influx of retard in the image gen sphere recently?
They are either new, retarded, or being purposefully obtuse. I can't tell which desu.

Anonymous
09/26/25(Fri)11:52:42 No.106707252

Anonymous 09/26/25(Fri)11:52:42 No.106707252

>>106707227
making a lora for a 80b model is going to be quite pricey

Anonymous
09/26/25(Fri)11:53:08 No.106707256

Anonymous 09/26/25(Fri)11:53:08 No.106707256

File: The Secret to Creative AI(...).png (1.6 MB, 1331x749)

1.6 MB PNG

What's up, /ldg/!

Last week was an absolute whirlwind. Thanks to a happy little accident, we did our first-ever YouTube livestream!

That means the raw, unfiltered VOD was up instantly. But for those who want the polished version, we just dropped a brand-new edited cut today.

Get ready to level up, because today we're diving deep into the art of compositing. We'll be breaking down killer techniques and workflows for SDXL, Flux, and even bleeding-edge models like Nano B.

>Now, I need your help deciding the future:

Want more raw, unfiltered livestreams? Reply with <3

Prefer the tight, info packed edited videos? Reply with :^)

Can't wait to see you there!

https://youtu.be/jmIbIIA9Qmc

Anonymous
09/26/25(Fri)11:53:40 No.106707261

Anonymous 09/26/25(Fri)11:53:40 No.106707261

>>106707193
>You can fix aesthetics
did anyone fix Flux aesthetics? it's been more than a year and we're still waiting lol

Anonymous
09/26/25(Fri)11:54:03 No.106707263

Anonymous 09/26/25(Fri)11:54:03 No.106707263

>>106707174
I don't think you understand the sizes at play.
But yes, the riser method is needed if they are to fit at all. I'd have to do it for both and loop it all around and diy a mount for both of them.

Anonymous
09/26/25(Fri)11:54:10 No.106707265

Anonymous 09/26/25(Fri)11:54:10 No.106707265

>>106707248
Is this your first time seeing something new pop up on the jeeterboard? You're sperging out like the GAE is taking away your GPU. Just chill until it's out.

Anonymous
09/26/25(Fri)11:55:01 No.106707273

Anonymous 09/26/25(Fri)11:55:01 No.106707273

>>106707227
no one is gonna run a 80b model, if you want to fit that on a 3090, the best you'll be able to do is Q2, do you know how Q2 looks like?

Anonymous
09/26/25(Fri)11:55:10 No.106707276

Anonymous 09/26/25(Fri)11:55:10 No.106707276

File: 1755972308492242.png (476 KB, 890x594)

476 KB PNG

>>106707263

Anonymous
09/26/25(Fri)11:56:04 No.106707284

Anonymous 09/26/25(Fri)11:56:04 No.106707284

>>106707265
you're the one sperging out about how "stacking more layers makes shit automatically better bro, still waiting for the debunk bro", you are so fucking retarded >>106707128

Anonymous
09/26/25(Fri)11:56:10 No.106707287

Anonymous 09/26/25(Fri)11:56:10 No.106707287

>>106707273
moebros, ramtorchbros what did this ramlet mean by this?

Anonymous
09/26/25(Fri)11:57:06 No.106707293

Anonymous 09/26/25(Fri)11:57:06 No.106707293

>>106707287
>he wants to calculate 80b parameters with ram offloading
it's gonna take an hour to make a single image with our current gpu's lmao

Anonymous
09/26/25(Fri)11:57:51 No.106707297

Anonymous 09/26/25(Fri)11:57:51 No.106707297

>>106707284
Sure, I'm the one sperging kek. Keep telling yourself that

Anonymous
09/26/25(Fri)11:57:51 No.106707298

Anonymous 09/26/25(Fri)11:57:51 No.106707298

>>106707127
Product in a cool setting or looking shinier sells better than an actual picture of the product.
Sad but true.

Anonymous
09/26/25(Fri)11:57:58 No.106707300

Anonymous 09/26/25(Fri)11:57:58 No.106707300

>>106707293
>blud doesnt know what MoE models are
how new are you?

Anonymous
09/26/25(Fri)11:58:24 No.106707306

Anonymous 09/26/25(Fri)11:58:24 No.106707306

>>106707300
it's not a MoE model though

Anonymous
09/26/25(Fri)11:58:33 No.106707307

Anonymous 09/26/25(Fri)11:58:33 No.106707307

>>106707300
where's the moe model?

Anonymous
09/26/25(Fri)11:59:05 No.106707311

Anonymous 09/26/25(Fri)11:59:05 No.106707311

>>106707276
Yes nigger, I know what a riser is. We're talking 2x 4slot gpus, not one.

Anonymous
09/26/25(Fri)11:59:45 No.106707317

Anonymous 09/26/25(Fri)11:59:45 No.106707317

>>106707300
who said it's a Moe model anon?

Anonymous
09/26/25(Fri)12:00:05 No.106707322

Anonymous 09/26/25(Fri)12:00:05 No.106707322

I got a natural riser with all the bouncing boobs gens I made last days

Anonymous
09/26/25(Fri)12:01:22 No.106707332

Anonymous 09/26/25(Fri)12:01:22 No.106707332

I wonder how long 1 step with a 80b image model would take

Anonymous
09/26/25(Fri)12:01:45 No.106707333

Anonymous 09/26/25(Fri)12:01:45 No.106707333

>>106706788
I don't understand them, they created SPRO and they're not using it on that giant model? fucking why?

Anonymous
09/26/25(Fri)12:02:23 No.106707340

Anonymous 09/26/25(Fri)12:02:23 No.106707340

yall mind if i up and wildly speculate thoughever

Anonymous
09/26/25(Fri)12:02:54 No.106707351

Anonymous 09/26/25(Fri)12:02:54 No.106707351

File: dmmg_0072.png (1.4 MB, 832x1216)

1.4 MB PNG

>>106707209
this is 60s on a 3090 with the fp8 model. drop your prompt and i'll make you a workflow.

your setup sucks man, mine is pure spaghetti right now (controlnets), but even the default workflow can do some good stuff. Why are you using SRPO?

Anonymous
09/26/25(Fri)12:03:27 No.106707355

Anonymous 09/26/25(Fri)12:03:27 No.106707355

Someone still believes that bigger params = better model? Damn.

Anonymous
09/26/25(Fri)12:03:46 No.106707359

Anonymous 09/26/25(Fri)12:03:46 No.106707359

>>106707317
>>106707307
>>106707306
my mistake as i didnt follow the conversation 7 replies back being about a specific model and replied to the general statement i saw of
>no one is gonna run a 80b model

Anonymous
09/26/25(Fri)12:05:50 No.106707373

Anonymous 09/26/25(Fri)12:05:50 No.106707373

File: 1754524140432055.png (2.71 MB, 1728x1344)

2.71 MB PNG

Anonymous
09/26/25(Fri)12:06:19 No.106707378

Anonymous 09/26/25(Fri)12:06:19 No.106707378

File: 1741020717067493.png (448 KB, 972x1653)

448 KB PNG

>>106706693
you know what? now it's the time to pray that what this furry fuck said about "seemless" offloading is true lol
https://xcancel.com/LodestoneRock/status/1968976389807161515#m

Anonymous
09/26/25(Fri)12:07:12 No.106707387

Anonymous 09/26/25(Fri)12:07:12 No.106707387

Thread of poorfags with miniscule compute

Anonymous
09/26/25(Fri)12:08:44 No.106707398

Anonymous 09/26/25(Fri)12:08:44 No.106707398

when will temu release a model

Anonymous
09/26/25(Fri)12:09:35 No.106707406

Anonymous 09/26/25(Fri)12:09:35 No.106707406

>>106707256
buy an add faggot

Anonymous
09/26/25(Fri)12:09:46 No.106707407

Anonymous 09/26/25(Fri)12:09:46 No.106707407

>>106707273
>do you know how Q2 looks like?
Bigger models quantize better, even Q1 might be fine. We'll just have to wait and see (though the full precision results don't exactly inspire interest).

Anonymous
09/26/25(Fri)12:11:01 No.106707419

Anonymous 09/26/25(Fri)12:11:01 No.106707419

>>106707256
fuck OFF

Anonymous
09/26/25(Fri)12:11:53 No.106707423

Anonymous 09/26/25(Fri)12:11:53 No.106707423

>>106707407
>We'll just have to wait and see
No... That's too rational. We MUST sperg out right here right now.

Anonymous
09/26/25(Fri)12:13:27 No.106707438

Anonymous 09/26/25(Fri)12:13:27 No.106707438

SDXL = clay
Flux and higher = metals
the average consumer = stuck before the copper age :(

pls Nvidia

Anonymous
09/26/25(Fri)12:14:10 No.106707443

Anonymous 09/26/25(Fri)12:14:10 No.106707443

>>106707423
>No... That's too rational.
Oh yes, the rationality that consists of looking at images from an 80b model, noticing that they are not much better than those from a 20b model (and even seem more slopped), and continuing to be enthusiastic about it, Chang, please.

Anonymous
09/26/25(Fri)12:14:27 No.106707445

Anonymous 09/26/25(Fri)12:14:27 No.106707445

>>106707256
This is why you need to aggressively tell faggots like ani to fuck off.

Anonymous
09/26/25(Fri)12:14:36 No.106707447

Anonymous 09/26/25(Fri)12:14:36 No.106707447

File: file.png (252 KB, 391x815)

252 KB PNG

oh yeah it's qwentime

Anonymous
09/26/25(Fri)12:15:34 No.106707455

Anonymous 09/26/25(Fri)12:15:34 No.106707455

>>106707447
anthro her

Anonymous
09/26/25(Fri)12:15:54 No.106707461

Anonymous 09/26/25(Fri)12:15:54 No.106707461

File: WAN2.2_00064.mp4 (2.24 MB, 960x544)

2.24 MB MP4

Anonymous
09/26/25(Fri)12:16:16 No.106707464

Anonymous 09/26/25(Fri)12:16:16 No.106707464

>>106707443
Oh shit you have an advanced copy of the model? Leak it anon!

Anonymous
09/26/25(Fri)12:19:09 No.106707490

Anonymous 09/26/25(Fri)12:19:09 No.106707490

>>106707464
>Don't look at the images bro, they don't mean anything bro, they're just the outputs of the image model after all, and you should never draw conclusions about the quality of an image model by looking at images.
(You)

Anonymous
09/26/25(Fri)12:19:12 No.106707491

Anonymous 09/26/25(Fri)12:19:12 No.106707491

So after trying native wan context I think it's just busted for I2V. Their example workflow for sliding context shows the frame count on the context node being 81 and the total frame count at some number in the hundreds, but trying to do that with the wan image to video node gives tensor size errors. Has anyone gotten wan sliding context to work with i2v WITHOUT setting the frame count on the context nodes equal to the total number of frames?

Anonymous
09/26/25(Fri)12:19:47 No.106707499

Anonymous 09/26/25(Fri)12:19:47 No.106707499

>continues sperging

Anonymous
09/26/25(Fri)12:21:19 No.106707510

Anonymous 09/26/25(Fri)12:21:19 No.106707510

File: 00001-2225156179.jpg (795 KB, 2048x2480)

795 KB JPG

Anonymous
09/26/25(Fri)12:22:20 No.106707521

Anonymous 09/26/25(Fri)12:22:20 No.106707521

File: it's so over.png (91 KB, 277x182)

91 KB PNG

>>106706693
You can press generate now, and by the time your first image will finished, WAN2.5 become open-source.

Anonymous
09/26/25(Fri)12:22:36 No.106707525

Anonymous 09/26/25(Fri)12:22:36 No.106707525

>>106707351
appreciate the honesty, I'm honestly just throwing stuff at it to see if something works.

here's your slag prompt, probably using wrong syntax
https://pastebin.com/raw/chw9aZPv

Anonymous
09/26/25(Fri)12:25:47 No.106707557

Anonymous 09/26/25(Fri)12:25:47 No.106707557

File: comfyui____0009.png (1.75 MB, 896x1216)

1.75 MB PNG

>>106707525
>https://pastebin.com/raw/chw9aZPv
i gotchu senpai, gimme ten minutes

Anonymous
09/26/25(Fri)12:26:42 No.106707564

Anonymous 09/26/25(Fri)12:26:42 No.106707564

File: ahahah.png (1.78 MB, 1396x1603)

1.78 MB PNG

>>106706693
can you feel the power of a 80b model anon? those are next gen images that's for sure 1!!1!1!

Anonymous
09/26/25(Fri)12:28:01 No.106707575

Anonymous 09/26/25(Fri)12:28:01 No.106707575

File: file.png (862 KB, 1808x692)

862 KB PNG

>>106707455
didn't expect it to work

Anonymous
09/26/25(Fri)12:29:01 No.106707584

Anonymous 09/26/25(Fri)12:29:01 No.106707584

I still have no idea how to use Qwen Edit

Anonymous
09/26/25(Fri)12:29:38 No.106707588

Anonymous 09/26/25(Fri)12:29:38 No.106707588

>>106707564
they turned flux into a 80b model

Anonymous
09/26/25(Fri)12:29:41 No.106707589

Anonymous 09/26/25(Fri)12:29:41 No.106707589

that anon is totally not sperging out guys cant you tell?

Anonymous
09/26/25(Fri)12:29:59 No.106707593

Anonymous 09/26/25(Fri)12:29:59 No.106707593

>>106707584
you don't have to, it's so slopped and nano banana destroys everything on the edit space

Anonymous
09/26/25(Fri)12:30:09 No.106707596

Anonymous 09/26/25(Fri)12:30:09 No.106707596

>>106707589
FUCK YOU!!

Anonymous
09/26/25(Fri)12:30:44 No.106707603

Anonymous 09/26/25(Fri)12:30:44 No.106707603

If nobody is aware of it the faggot is often arguing with himself typically when anon posting so ignore all of it and don't take the bait

Anonymous
09/26/25(Fri)12:30:54 No.106707605

Anonymous 09/26/25(Fri)12:30:54 No.106707605

size is all that matters.

Anonymous
09/26/25(Fri)12:31:00 No.106707607

Anonymous 09/26/25(Fri)12:31:00 No.106707607

>>106707265
>sperging
>>106707297
>sperging
>>106707499
>sperging
>>106707589
>sperging
that's a bot right?

Anonymous
09/26/25(Fri)12:31:27 No.106707609

Anonymous 09/26/25(Fri)12:31:27 No.106707609

>>106707584
so sorry for your loss.

>>106707564
oh my god it even has flux chin.
flux is a curse that keeps on giving.

Anonymous
09/26/25(Fri)12:31:38 No.106707612

Anonymous 09/26/25(Fri)12:31:38 No.106707612

>>106707605
I'm forced to distill my shit because it won't fit in hers. Sadge :(

Anonymous
09/26/25(Fri)12:31:52 No.106707616

Anonymous 09/26/25(Fri)12:31:52 No.106707616

>>106707564
still looks weird.

Anonymous
09/26/25(Fri)12:32:41 No.106707623

Anonymous 09/26/25(Fri)12:32:41 No.106707623

there arent enough newfags here to fall for your antics KEK

Anonymous
09/26/25(Fri)12:33:18 No.106707631

Anonymous 09/26/25(Fri)12:33:18 No.106707631

File: comfyui____0012.png (1.55 MB, 896x1152)

1.55 MB PNG

>>106707525
an output is attached. i'll check back in an hour or two if you have questions about anything in there. you'll have to attach your own lora nodes since this is just straight flux

worfklow: https://pastebin.com/JgZEs7QQ

Anonymous
09/26/25(Fri)12:33:44 No.106707637

Anonymous 09/26/25(Fri)12:33:44 No.106707637

>>106707623
I am not that guy but I take all bait, it's more fun that way

Anonymous
09/26/25(Fri)12:33:51 No.106707638

Anonymous 09/26/25(Fri)12:33:51 No.106707638

>>106707128
It's an objective fact that most of the Flux layers do nothing which means the model is not fully saturated. What happens when you start stacking layers is the model can learn to skip them.

Anonymous
09/26/25(Fri)12:34:32 No.106707645

Anonymous 09/26/25(Fri)12:34:32 No.106707645

>muh aesthetics
why is this an argument lmao
if a SD 1.5 model popped up and had prompt and concept understanding on par with gpt/nano we'd all be on it and just lora and upscale

Anonymous
09/26/25(Fri)12:34:53 No.106707648

Anonymous 09/26/25(Fri)12:34:53 No.106707648

>>106707638
Thanks for making a good point instead of malding like the other guy

Anonymous
09/26/25(Fri)12:35:13 No.106707653

Anonymous 09/26/25(Fri)12:35:13 No.106707653

>>106707645
>I love slop
Says no one, you're a Tencent employee doing damage control.

Anonymous
09/26/25(Fri)12:35:46 No.106707660

Anonymous 09/26/25(Fri)12:35:46 No.106707660

File: file.png (1.1 MB, 1340x758)

1.1 MB PNG

>>106707575
qwen is horny

Anonymous
09/26/25(Fri)12:36:03 No.106707666

Anonymous 09/26/25(Fri)12:36:03 No.106707666

>>106707645
good piggy

Anonymous
09/26/25(Fri)12:36:15 No.106707669

Anonymous 09/26/25(Fri)12:36:15 No.106707669

>>106707648
>malding
you're not saying "sperging" anymore? I wonder why lmao >>106707607

Anonymous
09/26/25(Fri)12:36:21 No.106707670

Anonymous 09/26/25(Fri)12:36:21 No.106707670

>>106707631
Man, that chin.. just whyy

Anonymous
09/26/25(Fri)12:36:50 No.106707676

Anonymous 09/26/25(Fri)12:36:50 No.106707676

>>106707645
Morons with a "it's always greener on the other side" syndrome

Anonymous
09/26/25(Fri)12:36:53 No.106707677

Anonymous 09/26/25(Fri)12:36:53 No.106707677

Remember his thread is dead he can only necrobump it and he craves interaction with this thread. He can't outwardly show himself because he knows he will be kicked out so now he needs to anon post all day doing thee same antics he did for years.
There is no need for brand or model wars he's just mad that he's priced out and he can't afford a new card because he never worked a day in his life. Just stop replying and you'll notice it will be just him replying to himself.

Anonymous
09/26/25(Fri)12:37:16 No.106707679

Anonymous 09/26/25(Fri)12:37:16 No.106707679

>>106707660
it's so slopped it looks like a low res painting

Anonymous
09/26/25(Fri)12:37:33 No.106707683

Anonymous 09/26/25(Fri)12:37:33 No.106707683

>>106707564
that bottom left image looks like qwen anime style
hilarious if they spent so much money on training an 80b model just to train on slop

Anonymous
09/26/25(Fri)12:38:19 No.106707698

Anonymous 09/26/25(Fri)12:38:19 No.106707698

>>106707645
>>muh aesthetics
>why is this an argument lmao
can't tell if this is bait or retardation

Anonymous
09/26/25(Fri)12:38:30 No.106707699

Anonymous 09/26/25(Fri)12:38:30 No.106707699

>>106707679
because the couch was a shit lowres image. i can try n tidy it up a bit. and slopped is the wrong term retard. but i get what you're saying.

Anonymous
09/26/25(Fri)12:39:18 No.106707709

Anonymous 09/26/25(Fri)12:39:18 No.106707709

File: 00106-3827431113.png (584 KB, 512x640)

584 KB PNG

Anonymous
09/26/25(Fri)12:39:42 No.106707712

Anonymous 09/26/25(Fri)12:39:42 No.106707712

File: some subhumans will say t(...).png (327 KB, 652x592)

327 KB PNG

>>106707699
>slopped is the wrong term
it is the right term you low IQ degenerate, look at the face of the girl, completly plastic and smooth

Anonymous
09/26/25(Fri)12:40:19 No.106707717

Anonymous 09/26/25(Fri)12:40:19 No.106707717

Why are we arguing over this stupid shit, hone your fucking craft

Anonymous
09/26/25(Fri)12:40:28 No.106707718

Anonymous 09/26/25(Fri)12:40:28 No.106707718

> anons insulting InvokeAI.
Sincere question: why are you doing this to Invoke and not to Comfy if they share the same business model?

Anonymous
09/26/25(Fri)12:41:07 No.106707723

Anonymous 09/26/25(Fri)12:41:07 No.106707723

I'm tired of the pointless speculation over models no one ITT has access to is all. It happens every time.

Anonymous
09/26/25(Fri)12:41:20 No.106707726

Anonymous 09/26/25(Fri)12:41:20 No.106707726

>>106707645
You can train a SD 1.5 tier model (~600m transformers model) with a 5090. Probably could do it with less than $1000 renting an H100.

Anonymous
09/26/25(Fri)12:41:55 No.106707729

Anonymous 09/26/25(Fri)12:41:55 No.106707729

>>106707712
wow it's nearly like it's a blurry lowres mess.
post your gens anon, we're waiting to see howit's done properly.
you must kill yourself right now to death.

Anonymous
09/26/25(Fri)12:41:56 No.106707730

Anonymous 09/26/25(Fri)12:41:56 No.106707730

>>106707723
Oh yeah, Tencent had always delivered good sovl model shit after all, why should we be weary of them now??

Anonymous
09/26/25(Fri)12:42:03 No.106707731

Anonymous 09/26/25(Fri)12:42:03 No.106707731

>>106707723
and EVERY time the doom posters get proven right

Anonymous
09/26/25(Fri)12:42:35 No.106707735

Anonymous 09/26/25(Fri)12:42:35 No.106707735

>>106707723
One retard rustles the cage and the rest of them jump in. There is zero reason to argue over shit you can't touch it's like arguing over how good the pussy would feel over a Nun

Anonymous
09/26/25(Fri)12:42:47 No.106707738

Anonymous 09/26/25(Fri)12:42:47 No.106707738

>>106707631
cheers, I'll give it a spin. Already looks quite a bit more elaborate than my adapted chicken scatch.

Anonymous
09/26/25(Fri)12:42:55 No.106707741

Anonymous 09/26/25(Fri)12:42:55 No.106707741

>>106707731
Like when they said Qwen would end up like hidream? lmao

Anonymous
09/26/25(Fri)12:42:57 No.106707743

Anonymous 09/26/25(Fri)12:42:57 No.106707743

>>106707729
>kill yourself right now to death
pleonasm

Anonymous
09/26/25(Fri)12:42:58 No.106707744

Anonymous 09/26/25(Fri)12:42:58 No.106707744

>>106707731
And despite all this free time they have yet to create a diverse well captioned dataset.

Anonymous
09/26/25(Fri)12:43:13 No.106707745

Anonymous 09/26/25(Fri)12:43:13 No.106707745

>>106707735
>like arguing over how good the pussy would feel over a Nun
fucking kekd

Anonymous
09/26/25(Fri)12:44:04 No.106707750

Anonymous 09/26/25(Fri)12:44:04 No.106707750

File: that's right.png (89 KB, 618x640)

89 KB PNG

>>106707729
>post your gens anon, we're waiting to see howit's done properly.
I won't, unlike you I recognize the models we are currently using are slop machines, I'm not releasing anything until we get something good enough

Anonymous
09/26/25(Fri)12:45:06 No.106707759

Anonymous 09/26/25(Fri)12:45:06 No.106707759

>>106707741
>Like when they said Qwen would end up like hidream?
who said that? Alibaba is a highly trusted company since they released the Wan series

Anonymous
09/26/25(Fri)12:45:28 No.106707763

Anonymous 09/26/25(Fri)12:45:28 No.106707763

>>106707741
qwen may be usable for what you want to use it, but that does not mean its not gigaslopped

Anonymous
09/26/25(Fri)12:45:32 No.106707764

Anonymous 09/26/25(Fri)12:45:32 No.106707764

>>106707750
ah, skill issue. got it.
stop posting here lil bro, you're wasting space.

Anonymous
09/26/25(Fri)12:46:16 No.106707772

Anonymous 09/26/25(Fri)12:46:16 No.106707772

>>106707731
>and EVERY time the doom posters get proven right
he's out of line but he's right
>>106707764
>skill issue
that's actual skill issue -> >>106707660

Anonymous
09/26/25(Fri)12:46:24 No.106707774

Anonymous 09/26/25(Fri)12:46:24 No.106707774

REMINDER:

most anon in this thread run sub 10gb cards

Anonymous
09/26/25(Fri)12:46:49 No.106707779

Anonymous 09/26/25(Fri)12:46:49 No.106707779

the shittier the gen the angrier they get

Anonymous
09/26/25(Fri)12:46:49 No.106707780

Anonymous 09/26/25(Fri)12:46:49 No.106707780

This thread reminds me of the time i morphed kate bush into a tiger on my pc in the mid 90's, it had mfm drives that needed a kick in the morning to spin up.

Anonymous
09/26/25(Fri)12:47:13 No.106707782

Anonymous 09/26/25(Fri)12:47:13 No.106707782

File: 00012-1862212030.png (1.13 MB, 1024x1240)

1.13 MB PNG

>>106707750
Why are you attacking him when it's clear he's testing stuff?
I post shit I'm testing all the time, this is part of the journey if there's context I'm missing please show me

Anonymous
09/26/25(Fri)12:47:18 No.106707784

Anonymous 09/26/25(Fri)12:47:18 No.106707784

>>106707774
once HunyuanImage 3.0 will be released, every guy that doesn't have a 96gb vram card will be officially called a vramlet

Anonymous
09/26/25(Fri)12:47:20 No.106707786

Anonymous 09/26/25(Fri)12:47:20 No.106707786

>>106707774
i'm going to jerk off knowing i have a medium 32gb dick.

goodbye you fucking losers. stay mad.

Anonymous
09/26/25(Fri)12:47:38 No.106707789

Anonymous 09/26/25(Fri)12:47:38 No.106707789

>>106707759
Here and rweddit during release. They called flavor of the month and we would return to the mighty flix/krea lmao. I did return those models to my recycle bin, that's for sure.

Anonymous
09/26/25(Fri)12:48:56 No.106707800

Anonymous 09/26/25(Fri)12:48:56 No.106707800

>>106707789
>They called flavor of the month and we would return to the mighty flix/krea lmao.
desu, only Qwen Image Edit is worth a damn, and I stopped using it after the novelty weared off, it's just too slopped

Anonymous
09/26/25(Fri)12:50:07 No.106707813

Anonymous 09/26/25(Fri)12:50:07 No.106707813

>>106701867
>>106707784
>96gb vram card
Lower your voice when you speak to me you're brand new to the local meta

Anonymous
09/26/25(Fri)12:52:13 No.106707826

Anonymous 09/26/25(Fri)12:52:13 No.106707826

>>106707741
trvth nvke

Anonymous
09/26/25(Fri)12:53:03 No.106707833

Anonymous 09/26/25(Fri)12:53:03 No.106707833

>>106707735
You do know the images we're currently seeing are supposed to be high quality cherry picked images? This is Tencent telling you "look at what our model can do best!", they probably made 20 tries and choose the best one for each one of them, does that scream "it's gonna be good" do you?

Anonymous
09/26/25(Fri)12:53:19 No.106707837

Anonymous 09/26/25(Fri)12:53:19 No.106707837

File: 00107-2593381260.png (987 KB, 888x1008)

987 KB PNG

Anonymous
09/26/25(Fri)12:54:28 No.106707846

Anonymous 09/26/25(Fri)12:54:28 No.106707846

>>106707741
>doom posters 18484141 - cope posters 1
doom posters sissies, how are we gonna cope with our only loss?

Anonymous
09/26/25(Fri)12:54:31 No.106707848

Anonymous 09/26/25(Fri)12:54:31 No.106707848

>>106707784
Their example images don't justify the requirements. At that size I'd expect a model that produces extremely complex perfect scenes. Like a full Peanuts comic strip page.

Anonymous
09/26/25(Fri)12:54:32 No.106707849

Anonymous 09/26/25(Fri)12:54:32 No.106707849

dis nigga never seen researcher gens before

Anonymous
09/26/25(Fri)12:54:36 No.106707850

Anonymous 09/26/25(Fri)12:54:36 No.106707850

File: ComfyUI_00157_.png (3.28 MB, 1280x1920)

3.28 MB PNG

Anonymous
09/26/25(Fri)12:54:45 No.106707852

Anonymous 09/26/25(Fri)12:54:45 No.106707852

>>106707833
In all fairness 90% of model makers make shit tier gens when showing model ability, still people have a right to explore and see if they can get anything useful. You don't have to use the model anon

Anonymous
09/26/25(Fri)12:55:08 No.106707856

Anonymous 09/26/25(Fri)12:55:08 No.106707856

>>106707461
nice

Anonymous
09/26/25(Fri)12:55:13 No.106707857

Anonymous 09/26/25(Fri)12:55:13 No.106707857

>>106707837
I would have liked to hear that music.

Anonymous
09/26/25(Fri)12:58:51 No.106707887

Anonymous 09/26/25(Fri)12:58:51 No.106707887

Qwen Image is good enough. I'm done getting hyped for new model releases, we should just focus on Qwen finetunes / controlnets / loras etc

Anonymous
09/26/25(Fri)12:59:45 No.106707893

Anonymous 09/26/25(Fri)12:59:45 No.106707893

>>106707774
no i run an exactly 10gb card

Anonymous
09/26/25(Fri)13:00:28 No.106707900

Anonymous 09/26/25(Fri)13:00:28 No.106707900

>>106707887
Small models are woefully underexplored. What the community should waste time on is a proper pretrained small model ready for finetuning on any mid-sized dataset.

Anonymous
09/26/25(Fri)13:01:16 No.106707907

Anonymous 09/26/25(Fri)13:01:16 No.106707907

>>106707887
nah, I want something smaller, and without the VAE shit so that the edit doesn't introduce pixel compression

Anonymous
09/26/25(Fri)13:01:29 No.106707909

Anonymous 09/26/25(Fri)13:01:29 No.106707909

>he doesn't seedmaxx

Anonymous
09/26/25(Fri)13:02:07 No.106707913

Anonymous 09/26/25(Fri)13:02:07 No.106707913

>>106707887
false, qwen image is bloated and slopped.we should focus on building our own non-bloated, non-slopped model at 1/4th the size or less.

Anonymous
09/26/25(Fri)13:02:39 No.106707917

Anonymous 09/26/25(Fri)13:02:39 No.106707917

slopped.we

Anonymous
09/26/25(Fri)13:02:46 No.106707918

Anonymous 09/26/25(Fri)13:02:46 No.106707918

>>106707887
bruh, Qwen Image is barely better than Flux, and flux is almost twice as small

Anonymous
09/26/25(Fri)13:03:24 No.106707924

Anonymous 09/26/25(Fri)13:03:24 No.106707924

>>106707918
Lol this is biggest load of BS ever, Flux doesn't even compare

Anonymous
09/26/25(Fri)13:03:26 No.106707925

Anonymous 09/26/25(Fri)13:03:26 No.106707925

>>106707887
im eagerly awaiting your finetunes

Anonymous
09/26/25(Fri)13:03:50 No.106707929

Anonymous 09/26/25(Fri)13:03:50 No.106707929

>>106707900
Like lumina?

Anonymous
09/26/25(Fri)13:04:23 No.106707934

Anonymous 09/26/25(Fri)13:04:23 No.106707934

>>106707913
we will never make our own model

Anonymous
09/26/25(Fri)13:04:25 No.106707935

Anonymous 09/26/25(Fri)13:04:25 No.106707935

File: 00108-2628670652.png (1.03 MB, 888x1008)

1.03 MB PNG

Anonymous
09/26/25(Fri)13:05:18 No.106707942

Anonymous 09/26/25(Fri)13:05:18 No.106707942

Based Koff.

Anonymous
09/26/25(Fri)13:05:25 No.106707943

Anonymous 09/26/25(Fri)13:05:25 No.106707943

>>106706693
There's probably hundreds of ways to improve the model by adding novel training techniques or new architectures (or going for a serious unslopped dataset) but nahh, those mf went for the "just stack more layers bro" meme, seriously...

Anonymous
09/26/25(Fri)13:05:26 No.106707944

Anonymous 09/26/25(Fri)13:05:26 No.106707944

>>106707935
anthro her

Anonymous
09/26/25(Fri)13:05:28 No.106707945

Anonymous 09/26/25(Fri)13:05:28 No.106707945

File: 2233001-You are an assist(...).png (3.71 MB, 1536x1536)

3.71 MB PNG

Anonymous
09/26/25(Fri)13:06:58 No.106707955

Anonymous 09/26/25(Fri)13:06:58 No.106707955

>>106707929
lumina could possibly work but we need to incorporate more optimizations like EQ VAE and TREAD going forward. and lumina has pretty lame base styles

>>106707934
yes we will, look at this
https://huggingface.co/KBlueLeaf/HDM-xut-340M-anime

Anonymous
09/26/25(Fri)13:07:32 No.106707961

Anonymous 09/26/25(Fri)13:07:32 No.106707961

>>106707955
>340M
no thanks, we already have SD1.5

Anonymous
09/26/25(Fri)13:08:03 No.106707969

Anonymous 09/26/25(Fri)13:08:03 No.106707969

>>106707929
Lumina is shit because it uses an opinionated, likely censored text encoder. But yes, a ~2B model with maybe something like the Qwen 0.6 text encoder but T5 XXL is still king for being verified uncensored.

Anonymous
09/26/25(Fri)13:08:05 No.106707971

Anonymous 09/26/25(Fri)13:08:05 No.106707971

anyone created voices? I am trying alltalk, its uses short voice samples to clone voiced. i tried one sample with a latina accent, but alltalk makes her speak british. do different accents need different models?

Anonymous
09/26/25(Fri)13:08:36 No.106707976

Anonymous 09/26/25(Fri)13:08:36 No.106707976

I'm starting to think Tencent is just incompetent, and HunyuanVideo was accidentally a kinda good model.

HunyuanVideo i2v was terrible, and dramatically changed the first frame. They hastily changed the implementation and released an updated model and just said "lol jk change your implementations and use this one instead" but it also had problems.

HunyuanImage 2.1 uses a VAE with too high of a compression ratio, which also caused problems with LTX and Wan 2.2 5b. They slapped on a refiner after the fact to cope, and also say you need to use their special snowflake guidance method. Refiner failed with SDXL, nobody will run it, it will fail here too. The model also has a bad license, is slopped as hell, and just worse across the board compared to Qwen.

Now HunyuanImage 3 is fucking 80b parameters, literally DoA, nobody can run it not even RTX 6000 Pro, and is even more slopped and just looks like ass for how large it purportedly is.

If you've ever tried to read their training or inference code, it's a fucking mess. They never released the text encoder HunyuanVideo was actually trained with, same with HunyuanImage 2.1. Technically, we've all been using wrong text embeddings the whole time.

They have no idea what they're doing.

Anonymous
09/26/25(Fri)13:09:02 No.106707980

Anonymous 09/26/25(Fri)13:09:02 No.106707980

>>106707971
vibevoice has been all the rage recently as far as open models go

Anonymous
09/26/25(Fri)13:09:19 No.106707982

Anonymous 09/26/25(Fri)13:09:19 No.106707982

File: disappointed.gif (485 KB, 220x220)

485 KB GIF

>>106707955
>cheapest
nigga

Anonymous
09/26/25(Fri)13:09:45 No.106707988

Anonymous 09/26/25(Fri)13:09:45 No.106707988

>>106707980
yeah for the stunt MS pulled

Anonymous
09/26/25(Fri)13:10:22 No.106707993

Anonymous 09/26/25(Fri)13:10:22 No.106707993

>>106707969
Neta Lumina can do porn fine

Anonymous
09/26/25(Fri)13:10:46 No.106707997

Anonymous 09/26/25(Fri)13:10:46 No.106707997

>>106707976
>They have no idea what they're doing.
yep, only Alibaba is the one chinese company that could save us (if they learn one day that synthetica data is poison)

Anonymous
09/26/25(Fri)13:10:48 No.106707998

Anonymous 09/26/25(Fri)13:10:48 No.106707998

>used to have the problem of not getting enough motion in my i2v
>now have the problem of too much motion

AAAAAA

Anonymous
09/26/25(Fri)13:11:14 No.106708001

Anonymous 09/26/25(Fri)13:11:14 No.106708001

>>106707900
>>106707907
>>106707913
>>106707918
>>106707925
all vramlets btw

Anonymous
09/26/25(Fri)13:11:20 No.106708003

Anonymous 09/26/25(Fri)13:11:20 No.106708003

what do we need saving from doe?

Anonymous
09/26/25(Fri)13:11:40 No.106708009

Anonymous 09/26/25(Fri)13:11:40 No.106708009

>>106707961
read the paper, it's a proof of concept. if he can do that with ~$600 of compute, the community can EASILY train their own real base models.

Instead of the furry blowing >$150,000 on finetuning fucking FLUX SCHNELL, we could have had a SOTA community-funded fast model by now if we went the route of HDM. ultimately it's inevitable though.

Anonymous
09/26/25(Fri)13:12:24 No.106708014

Anonymous 09/26/25(Fri)13:12:24 No.106708014

>>106708001
It really just seems vramlets are getting mindbroken day after day. Why don't the LLM field have this kinda bitching?

Anonymous
09/26/25(Fri)13:12:33 No.106708015

Anonymous 09/26/25(Fri)13:12:33 No.106708015

>>106708009
>read the paper, it's a proof of concept. if he can do that with ~$600 of compute, the community can EASILY train their own real base models.
I wished Tencent read that paper instead of going for gozillions of parameters lol

Anonymous
09/26/25(Fri)13:13:05 No.106708023

Anonymous 09/26/25(Fri)13:13:05 No.106708023

>>106707969
>likely censored text encoder
never had an issue with this, it does what i tell it to just fine

Anonymous
09/26/25(Fri)13:13:34 No.106708024

Anonymous 09/26/25(Fri)13:13:34 No.106708024

>>106708014
>Why don't the LLM field have this kinda bitching?
are you joking or something? when deepseek got released, the shitstorm was so intense they had to create a new general just for this specific model and appease the "giant models can't be considered local" group

Anonymous
09/26/25(Fri)13:13:38 No.106708025

Anonymous 09/26/25(Fri)13:13:38 No.106708025

>>106708001
Yes anon, I like models that can be full finetuned on consumer hardware.

Anonymous
09/26/25(Fri)13:13:51 No.106708030

Anonymous 09/26/25(Fri)13:13:51 No.106708030

>>106708001
I use Qwen image all the time (fp8 scaled), it's still fucking bloated and slopped. LORAs help deslop it though

Anonymous
09/26/25(Fri)13:13:59 No.106708032

Anonymous 09/26/25(Fri)13:13:59 No.106708032

Bigma status?

Anonymous
09/26/25(Fri)13:14:15 No.106708034

Anonymous 09/26/25(Fri)13:14:15 No.106708034

>>106708014
>Why don't the LLM field have this kinda bitching?
lol

Anonymous
09/26/25(Fri)13:15:28 No.106708043

Anonymous 09/26/25(Fri)13:15:28 No.106708043

>>106708025
Do we have a model like that? Most XL models were tuned on a small to big cluster

Anonymous
09/26/25(Fri)13:15:46 No.106708047

Anonymous 09/26/25(Fri)13:15:46 No.106708047

>>106707677
Ranfaggot is getting desperate for attention

Anonymous
09/26/25(Fri)13:16:33 No.106708058

Anonymous 09/26/25(Fri)13:16:33 No.106708058

>>106707998
I'd rather have the second problem actually, I can slow down the video, while speeding it up reduces the total time..

Anonymous
09/26/25(Fri)13:16:33 No.106708059

Anonymous 09/26/25(Fri)13:16:33 No.106708059

>>106708043
No because the people with compute are retards. For example, Chroma should've been trained from scratch as a 4B model.

Anonymous
09/26/25(Fri)13:16:35 No.106708061

Anonymous 09/26/25(Fri)13:16:35 No.106708061

>>106708043
nta but you can do a full finetune of sdxl with a 24gb card iirc

Anonymous
09/26/25(Fri)13:16:43 No.106708063

Anonymous 09/26/25(Fri)13:16:43 No.106708063

>>106707976
>I'm starting to think Tencent is just incompetent, and HunyuanVideo was accidentally a kinda good model.
yeah, they seem to have learn nothing, they went on the right path and instead of keeping those solid fundations they went for something completly new and broken, that's not how you improve on this field at all

Anonymous
09/26/25(Fri)13:17:39 No.106708068

Anonymous 09/26/25(Fri)13:17:39 No.106708068

>106707887
Autistic compulsion forces him to say the line again
>reiterates insult thrown at him
More wheelchairs it its then

Anonymous
09/26/25(Fri)13:17:50 No.106708071

Anonymous 09/26/25(Fri)13:17:50 No.106708071

>>106708014
>Why don't the LLM field have this kinda bitching?
https://www.youtube.com/watch?v=H47ow4_Cmk0

Anonymous
09/26/25(Fri)13:18:20 No.106708076

Anonymous 09/26/25(Fri)13:18:20 No.106708076

>>106708059
>>106708061
You can tune sure but will it be worthwhile? Has there any evidence that doing something like this has shown results. Only the big tunes are usable as far as I can see.

Anonymous
09/26/25(Fri)13:19:01 No.106708083

Anonymous 09/26/25(Fri)13:19:01 No.106708083

>>106707976
>HunyuanImage 2.1 uses a VAE with too high of a compression ratio, which also caused problems with LTX and Wan 2.2 5b.
wan 2.2 5b VAE is worse than the 14b one right?

Anonymous
09/26/25(Fri)13:19:14 No.106708084

Anonymous 09/26/25(Fri)13:19:14 No.106708084

I only want models I can finetune with a TNT2

Anonymous
09/26/25(Fri)13:19:20 No.106708085

Anonymous 09/26/25(Fri)13:19:20 No.106708085

>>106708024
I mean to be fair that one is 670 billion parameter lol

Anonymous
09/26/25(Fri)13:20:34 No.106708095

Anonymous 09/26/25(Fri)13:20:34 No.106708095

>>106708076
Are you stupid or something? I said he should've made a 4B model from scratch which would've had sufficient expressive compacity as a base model while also being easy to train for other community members. And yes, finetuning is easier than making a base model as you can have a much more constrained dataset.

Anonymous
09/26/25(Fri)13:20:47 No.106708097

Anonymous 09/26/25(Fri)13:20:47 No.106708097

File: 00109-3001379579.png (585 KB, 1008x888)

585 KB PNG

Anonymous
09/26/25(Fri)13:22:20 No.106708105

Anonymous 09/26/25(Fri)13:22:20 No.106708105

>>106708076
>Has there any evidence that doing something like this has shown results
only for smallscale stuff like lora extracts, people use clusters because it's waaay faster. training on a few million images with consumer hardware is too slow

Anonymous
09/26/25(Fri)13:22:20 No.106708106

Anonymous 09/26/25(Fri)13:22:20 No.106708106

>>106708009
what would a $150,000 base model get us
like comparable to what model

Anonymous
09/26/25(Fri)13:22:31 No.106708107

Anonymous 09/26/25(Fri)13:22:31 No.106708107

>>106708095
>Will it be worthwhile?
>Evidence, shown results
Here just added some keywords to help you understand, I get hard to english in India.

Anonymous
09/26/25(Fri)13:23:00 No.106708109

Anonymous 09/26/25(Fri)13:23:00 No.106708109

>>106708106
>what would a $150,000 base model get us
a small useless shit, unfortunately we'll always be dependant of giant companies like Alibaba and Tencent

Anonymous
09/26/25(Fri)13:23:16 No.106708112

Anonymous 09/26/25(Fri)13:23:16 No.106708112

>>106708107
you get hard to english in india?

Anonymous
09/26/25(Fri)13:25:28 No.106708136

Anonymous 09/26/25(Fri)13:25:28 No.106708136

>>106708109
grim.

Anonymous
09/26/25(Fri)13:26:25 No.106708144

Anonymous 09/26/25(Fri)13:26:25 No.106708144

>>106708097
>>106707945
>>106707935
>>106707837
>>106707709
go back to sdg with your slop.

Anonymous
09/26/25(Fri)13:27:15 No.106708149

Anonymous 09/26/25(Fri)13:27:15 No.106708149

>>106708107
Okay, you are retarded. The premise is small models are woefully underexplored. We, however, can use our brains when considering say the 340m HDM model, the 600m Pixart Sigma model, and extrapolate to the overly bloated Flux model. And we can ask a simple question: is Flux being 12B parameters 20 times better than Sigma? The answer is obviously no. Which means we can make the hypothesis that a model bigger than 600m and smaller than 12B could be quite good, especially considering SDXL which is decent despite using a shitty text encoder, shitty VAE, and shitty architecture. So we can make an educated guess that a properly trained 2B model would be better than SDXL. We can also make an educated guess that a 4B model would be much better than the 2B model.

Anonymous
09/26/25(Fri)13:29:17 No.106708164

Anonymous 09/26/25(Fri)13:29:17 No.106708164

>>106707988
stunt aside, i have been using it with great success. i like making asmr voices (slower, more whispery) and while alltalk is very good at what it does vibevoice does it better.

Anonymous
09/26/25(Fri)13:29:19 No.106708165

Anonymous 09/26/25(Fri)13:29:19 No.106708165

>>106708149
>Hey everyone spend your money on this shit I "think" will work over stuff we "know" works
Lol good luck with the tune bro

Anonymous
09/26/25(Fri)13:30:13 No.106708174

Anonymous 09/26/25(Fri)13:30:13 No.106708174

File: kek.png (1.62 MB, 1459x1492)

1.62 MB PNG

>>106707564
>wait anon you don't have a 96gb VRAM card to render us? AHAHAHAHAH

Anonymous
09/26/25(Fri)13:31:46 No.106708188

Anonymous 09/26/25(Fri)13:31:46 No.106708188

>>106708174
imagine standing outside that photobooth trying to get a passport picture quickly taken on the way to an important meeting and all you hear from is autistic and retarded onions voices doing onions laughs

Anonymous
09/26/25(Fri)13:32:01 No.106708190

Anonymous 09/26/25(Fri)13:32:01 No.106708190

>>106708165
>"know what works"
>spent $150k on a distilled model that underwent cope brain surgery
It's actually funny how ignorant you are about everything. Chroma wasn't "what works". And we do know what does work because people have done it multiple times. Any DiT model with text conditioning trained starting at 256px and progressing to 1K and 2K. HDM is literally a dick around project and proved without a doubt that the process is fucking simple.

Anonymous
09/26/25(Fri)13:32:55 No.106708194

Anonymous 09/26/25(Fri)13:32:55 No.106708194

>>106708188
why is s. oy censored. what.

Anonymous
09/26/25(Fri)13:33:28 No.106708199

Anonymous 09/26/25(Fri)13:33:28 No.106708199

>>106708190
As I said good luck on that amazing small model that blows everyone else's out of the water. Looking forward to it champ

Anonymous
09/26/25(Fri)13:33:32 No.106708200

Anonymous 09/26/25(Fri)13:33:32 No.106708200

File: 00110-2556412822.png (959 KB, 1008x888)

959 KB PNG

>>106708144

Anonymous
09/26/25(Fri)13:33:35 No.106708201

Anonymous 09/26/25(Fri)13:33:35 No.106708201

File: file.png (102 KB, 230x222)

102 KB PNG

>>106708032
>Bigma
I went on a 500m mlp test phase. It'll keep training forever though.

Anonymous
09/26/25(Fri)13:33:38 No.106708202

Anonymous 09/26/25(Fri)13:33:38 No.106708202

>>106708188
what's a onions voice?

Anonymous
09/26/25(Fri)13:34:42 No.106708212

Anonymous 09/26/25(Fri)13:34:42 No.106708212

>>106708202
he wanted to say "s.oy" but 4chan is censoring that word and replace it with "onion"

Anonymous
09/26/25(Fri)13:34:55 No.106708213

Anonymous 09/26/25(Fri)13:34:55 No.106708213

>>106708199
What is it with zoomers with just lying about what other people said. What is wrong with you, seriously.

Anonymous
09/26/25(Fri)13:35:04 No.106708214

Anonymous 09/26/25(Fri)13:35:04 No.106708214

quick question, is the social media hate towards AI currently big enough to hamper this field?

Anonymous
09/26/25(Fri)13:35:38 No.106708218

Anonymous 09/26/25(Fri)13:35:38 No.106708218

>>106708214
no

Anonymous
09/26/25(Fri)13:37:07 No.106708228

Anonymous 09/26/25(Fri)13:37:07 No.106708228

File: 6747473.jpg (217 KB, 784x611)

217 KB JPG

Ugh, anons, can anyone help me?

Anonymous
09/26/25(Fri)13:37:17 No.106708230

Anonymous 09/26/25(Fri)13:37:17 No.106708230

>>106708214
people will take whatever they can, especially now where legislation is hazy around copyright for learning material
If anything was gonna happen I'd expect it to be around that

Anonymous
09/26/25(Fri)13:37:44 No.106708232

Anonymous 09/26/25(Fri)13:37:44 No.106708232

>generates qwen images and edits, makes them into videos
heh nice.
>goes back to generating with sdxl

anyone else do this? there's just so many more tools and shit available for sdxl. ip-adapter is just pure bliss.

Anonymous
09/26/25(Fri)13:38:20 No.106708234

Anonymous 09/26/25(Fri)13:38:20 No.106708234

>>106708218
ok good

Anonymous
09/26/25(Fri)13:39:22 No.106708240

Anonymous 09/26/25(Fri)13:39:22 No.106708240

>>106708228
it says it filed to extract an archive, delete it and see if redownloading won't fix the error

your disk isn't full and the folder isn't write protected or has no permission for the user this process runs as right?

Anonymous
09/26/25(Fri)13:39:27 No.106708241

Anonymous 09/26/25(Fri)13:39:27 No.106708241

>>106708214
Why? AI is about efficiency (saving money) which means the people with money will invest in it because it has obvious utility. All social media does is make people better at hiding AI use but I already personally use AI for my everyday work both LLMs for being my code slave and dev duck and using Image models for things like product hero images.

Anonymous
09/26/25(Fri)13:41:00 No.106708250

Anonymous 09/26/25(Fri)13:41:00 No.106708250

>>106708232
>goes back to generating with sdxl
This whole gen is compromised. It's just NVIDIA and Comfy glowies telling everyone to buy more hardware and I bet none of the real anons here have more than 12GB of VRAM themselves.

Anonymous
09/26/25(Fri)13:41:50 No.106708258

Anonymous 09/26/25(Fri)13:41:50 No.106708258

>>106708214
No. If the powers that be wanted to hurt AI you would be hearing "think of the children" type arguments, instead it's just kvetching artists

Anonymous
09/26/25(Fri)13:41:59 No.106708259

Anonymous 09/26/25(Fri)13:41:59 No.106708259

>>106708241
i feel like if it was 100% accepted by everyone we'd have more tools idk

Anonymous
09/26/25(Fri)13:42:06 No.106708265

Anonymous 09/26/25(Fri)13:42:06 No.106708265

>cumfart ooms after every gen again
>can't line up 4 wan gens anymore. again.

why is this software so cursed?

Anonymous
09/26/25(Fri)13:42:10 No.106708266

Anonymous 09/26/25(Fri)13:42:10 No.106708266

>>106708232
I deleted all my XL models except one for when I just want a quick inpaint.

Anonymous
09/26/25(Fri)13:42:11 No.106708267

Anonymous 09/26/25(Fri)13:42:11 No.106708267

File: file.png (9 KB, 316x186)

9 KB PNG

>>106708250
speak for yourself

Anonymous
09/26/25(Fri)13:43:25 No.106708273

Anonymous 09/26/25(Fri)13:43:25 No.106708273

>>106708250
i'm that anon and i have 32gb vram but i still just like going back to sdxl for ease of use.
making wildcards for illust/noob/sdxl is just so much easier.
i'm tired of writing entire chapters just to get a decent gen with these new models.

Anonymous
09/26/25(Fri)13:44:04 No.106708278

Anonymous 09/26/25(Fri)13:44:04 No.106708278

>>106708228
looks to me like you have unstable internet

Anonymous
09/26/25(Fri)13:44:05 No.106708281

Anonymous 09/26/25(Fri)13:44:05 No.106708281

>>106708259
I don't get your logic. What tools? People don't work for free. Who are these people who should be making tools for you to use for free?

Anonymous
09/26/25(Fri)13:45:10 No.106708291

Anonymous 09/26/25(Fri)13:45:10 No.106708291

>>106708273
>i'm tired of writing entire chapters just to get a decent gen with these new models.
and then there's doing upscaling/2pass with flux, which doesn't seem to exist/work right/whatever does exist is a huge clusterfuck of spaghetti nodes. vs i get better gens just niggering with sdxl a little bit.
someone needs to make a proper realism model for illustrious, seeing as we can't rely on the chinese because they only like sameface gray alien girls and the americans like BOGGED negroid physiognomy. who's left?
..the french?

Anonymous
09/26/25(Fri)13:45:56 No.106708297

Anonymous 09/26/25(Fri)13:45:56 No.106708297

>>106708267
>barely enough to run HunyuanImage 3.0 on Q4
vramlet

Anonymous
09/26/25(Fri)13:46:45 No.106708303

Anonymous 09/26/25(Fri)13:46:45 No.106708303

File: 00111-1226643539.png (776 KB, 1008x888)

776 KB PNG

Anonymous
09/26/25(Fri)13:46:58 No.106708305

Anonymous 09/26/25(Fri)13:46:58 No.106708305

>>106708291
i tried merging lustify and biglust with some illust models which works somewhat decently but in general yeah, realism models are all biased towards shit unless you use loras.

Anonymous
09/26/25(Fri)13:47:03 No.106708309

Anonymous 09/26/25(Fri)13:47:03 No.106708309

>>106708297
You can get the Hunyuan Image 3.0 experience by quadrupling the layers on Flux with Identity pass through.

Anonymous
09/26/25(Fri)13:49:17 No.106708325

Anonymous 09/26/25(Fri)13:49:17 No.106708325

>>106708303
wow he's literally me

>>106708305
even a few pony loras somehow manage to add realistic lighting, the future may be in loras. who knows, might try it myself since i have the hardware.

>>106708267
lmao the 4090 user got called a vramlet get owned >>106708297

Anonymous
09/26/25(Fri)13:50:00 No.106708330

Anonymous 09/26/25(Fri)13:50:00 No.106708330

When ready

>>106708328
>>106708328
>>106708328

Anonymous
09/26/25(Fri)14:00:40 No.106708413

Anonymous 09/26/25(Fri)14:00:40 No.106708413

>>106707161
>things other than aesthetics
It's literally all synthetic checkboxes in benchmarks. Qwen also had funny charts shown at release with BIG NUMBAHS but in reality it's a a plastic model.

Anonymous
09/26/25(Fri)14:04:06 No.106708437

Anonymous 09/26/25(Fri)14:04:06 No.106708437

>>106706484
Sauce on CWC vid? Looks hilarious

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.