[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: collage_1769050101_1.png (3.77 MB, 1707x1683)
3.77 MB
3.77 MB PNG
Discussion of Free and Open Source Diffusion Models

Prev: >>107934819

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>ZiT
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Blessed thread of frenship
>>
>>107936090
For image gen? Yes. for LLMs? No.
>>
>>107936072
>>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon
uh oh! melty!

>>107936090
no, they are different tools entirely
>>
Can we at least agree that teto is literal shit as a waifu?
She's even below those "horse 1girls"
>>
>>107936072
did you...
you know there's a script that helps you make collages right?
>>
>>107936111
1girls being broken by horse dick is kinda based, ngl
>>
The sample images generated during Klein lora training are very inconsistent.
>>
I wish Klein had better bob variety.
>>
File: 1743726505445912.png (704 KB, 840x736)
704 KB
704 KB PNG
the japanese woman in image1 is wearing the same shirt and skirt of the anime girl in image2.

kawakami from persona but in real life:
>>
>>107936111
>"horse 1girls"
Who?
>>
does comfyorg hire shills to damage control here? how do I get hired for that?
>>
File: 1755039609919569.jpg (999 KB, 5120x1600)
999 KB
999 KB JPG
>>107936159
and with rise:
>>
File: i.jpg (87 KB, 720x1280)
87 KB
87 KB JPG
>>107936136
the lora maybe just isn't trained well at that point?
>>
umas are for breeding
>>
File: 1747008448442211.jpg (998 KB, 5120x1600)
998 KB
998 KB JPG
>>107936200
it works well but 2 image can be picky depending on the source image, generally it works fine
>>
>>107936146
It's hard to tell when it generates SD1.5 levels of body horror.
>>
File: i.jpg (104 KB, 720x1280)
104 KB
104 KB JPG
>>107936200
it's easy to get anatomy issues with a lot of poses, yes

there is a fairly decent chance that more finetuning will at least reduce the issues
>>
>>107936205
>>107936219
Okay maybe teto isn't that bad all things considered
>>
Honestly I think Klein is always meant to be used with reference images.
>>
Newbie here.
Is this a good way of prompting? I heard word order matters, so is this not good?
Using Z-Image Turbo if it matters.

SUBJECT:
A young adult woman with fair skin and a slender, elegant build, long dark brown hair worn loose with a natural wave, soft freckles across her cheeks and nose, and refined facial features. She appears confident and composed, embodying a modern, intimate portrait aesthetic.

OUTFIT:
A deep red satin slip dress with thin spaghetti straps and a fitted silhouette, softly reflective fabric that hugs the body and gathers subtly at the waist and hips, creating gentle highlights and shadows across the material.

POSE & BODY LANGUAGE:
Standing slightly angled toward the camera with her torso turned, shoulders relaxed, one arm resting naturally by her side. Her posture is upright yet casual, conveying quiet confidence and intimacy.

EXPRESSION:
happy, smiling

ENVIRONMENT:
kitchen

LIGHTING:
bright lighting

CAMERA:
photo taken by another person,
eye-level or slightly lower angle,
real smartphone or small camera look,
handheld feel,
slight low-light noise and grain,
minor focus imperfections,
natural HDR,
looks like an unedited private photo

STYLE:
photorealistic, raw, unretouched, indistinguishable from real photograph
>>
>>107936254
Try and see. Experiment.
>>
>>107936254
>I heard word order matters,
arguably less so with z and more so with older models like xl and 1.5
>>
File: image.png (1.75 MB, 1024x1024)
1.75 MB
1.75 MB PNG
Since I'm currently moving and my PC is packed away, I decided to try out Flux Klein 9b in Hugging Face Space.
My first image, on the first prompt, on the first try.

I would have expected more. I hope the Chinese can do better.
>>
File: GssafETbsAAqqPT.jpg (140 KB, 800x1200)
140 KB
140 KB JPG
>>107936072
This "node based" interface is the biggest piece of shit I've come across in my fucking life. It can't even be classified as a workflow manager. ComfyUI is a fucking diagram made with Python by the antichrist himself, whose sole purpose is to consume 2 TB of VRAM by loading a checkpoint, how could you screw up so badly as to ruin image generation, to ruin stable diffusion workflows, how the hell did the devs manage to add a subscription model to API nodes, you motherfucking pieces of shit? How can a rational human being defend this mutant technological abortion, "30% faster than Automatic1111" my ass, not even an AI agent would be capable of fuck up so badly as to create this Blender node editor ripoff.This is the result of a bunch of bad decisions made by people whose brains were unable to develop, whose balls got stuck in their abdomen during birth and who don't shit themselves by some miracle of God. I bet my organs that these morons aren't aware of how shitty this node system is because everyone in the dev team uses Automatic1111. I thought workflow managers were programmed by programmers, not the fucking Discord moderators. To those subnormals, I propose a
marketing campaign: rename this mistake to ComfyUI 9/11, this fucking shitty node system forces me to use Automatic1111 (with extensions, because I'm not a masochist), at least with that I don't have to spend 3 hours connecting nodes for every fucking workflow, how the hell do you manage to break dependencies with every fucking update? What's being updated, your chromosomes? One day I'm going to lose it, and when that day comes, I'll fork the repo and I swear to God that every time I take a shit, I'll commit a picture of it and push it to the main branch just to fill the repository with high-quality screenshots of my feces. This software made me an atheist, I refuse to believe that hell exists, I refuse to believe that there is anything worse than having to debug Python dependencies in ComfyUI.
>>
File: Flux2_01489_.png (3.25 MB, 1824x1248)
3.25 MB
3.25 MB PNG
>>
File: i.jpg (101 KB, 720x1280)
101 KB
101 KB JPG
>>107936254
will probably more or less work, but simpler is likely better:
https://gist.github.com/illuminatianon/c42f8e57f1e3ebf037dd58043da9de32#2-core-positiveprompt-structure-for-zimage-turbo

>>107936230
someone else will probably do teto
>>
File: i.jpg (86 KB, 720x1280)
86 KB
86 KB JPG
>>
>>107936286
mmmmm mid pasta gimmie something fresher next time
>>
>>107936298
anon getting knotted by the fennec tranny for the first time
>>
File: 1748252219450335.jpg (1.03 MB, 2482x1767)
1.03 MB
1.03 MB JPG
change the clothes of the girl in image1 to the clothes of the girl in image2.
>>
File: 1754530260012660.png (1.66 MB, 848x1216)
1.66 MB
1.66 MB PNG
>>107936316
keep the face of the girl in image1 the same.

also fixed the hand
>>
>>107936321
kekd
>>
>>107936321
oh so NOW you want more floyd instead of different things, bipolar retard
>>
File: image(1).png (895 KB, 1024x1024)
895 KB
895 KB PNG
>>107936291
Same text with z image turbo, same settings, also huggingface space.

Not satisfied with either. Klein followed the text better, but with less quality. Both are terrible.
>>
File: 1751564299696844.png (1.46 MB, 1200x864)
1.46 MB
1.46 MB PNG
>>107936331
well too bad, I post what I want, eat a bag of dicks.
>>
>>
>>107936111
she also looks too young which is very sus
>>
>>107936349
yeah
>>
>>107936357
*poof*
>>
File: 1746814926718899.png (1.41 MB, 1120x928)
1.41 MB
1.41 MB PNG
>>
What is a waifu? What is a horse girl?
>>
File: x_q1p93f.png (1.89 MB, 1536x1024)
1.89 MB
1.89 MB PNG
>>
>>107936254
Modern text encoders can easily take this so don't be afraid to dump entire markdowns. Tag-based models and T5 less so.
>>
File: 1742152274098378.png (1.42 MB, 1248x832)
1.42 MB
1.42 MB PNG
>>
>>107936368
>>107936371
>>
>>107936406
Is that the new jap PM?
>>
>>107936410
unfortunately not, but she's based as a nationalist nonetheless
>>
File: 1750003763726454.png (1.43 MB, 832x1248)
1.43 MB
1.43 MB PNG
>>
Does "hyper fine intricate details" still work
>>
Wonder what this is:

https://github.com/Comfy-Org/ComfyUI/commit/abe2ec26a61ff670b9c0e71e4821c873368c8728

New model called "Anima" that seems to use Qwen 3 0.6B as a text encoder.
>>
File: 1764788880559742.png (1.43 MB, 1248x832)
1.43 MB
1.43 MB PNG
>>
>>107936267
I've tried it and it doesn't seem to matter at all.
I've also tried (tube top:1.5) which supposedly makes it more emphasized but I still can't get it to show a tube top.
It's weird, people say order matters or put (xxx:1.5) but they don't work.
>>
File: 1763123901422524.png (1.37 MB, 864x1200)
1.37 MB
1.37 MB PNG
>>
>>107936447
Kinda looks like my Ex ngl.
>>
>>107936457
Show results. And configuration ( model, cfg, steps , sampler).
>>
>>
>>107936449
wat
>>
>>107936291
not true
>>
>>107936282
is that Base Klein or Distilled though? The Distilled ones have more aesthetic tuning.
>>
>>107936291
>you better use Z-image turbo instead
no. Zit is fucking slow and has no variety.
>>
>>107936486
I'm using Z-Image Turbo. If I have the "large breasts" in my prompt, it will always completely ignore "tube top" even if "tube top" is the first word or even if I have (tube top:1.5). The character is always nude if I have "large breasts" basically. I even try adding "clothed" and "wearing clothes" but it doesn't work.
I can't post it here since it's NSFW but I'm sure you get the idea.
>>
>>107936512
skill issue
>>
>>107936286
Sorry but node based infinite canvas uis won. They get the executive level buy-in.
What else can you do? Maybe one could make ui that looks like msexcel or something.
>>
File: 1744835497833421.png (1.41 MB, 864x1200)
1.41 MB
1.41 MB PNG
>>107936467
>>
>>107936433
Z-anime
>>
cozy breas
>>
>>107936513
Try the definition for tube top from wikipedia and worlds like "buxom" or "side boob" and "female torso".
>>
this girl edits her eyes. they always come out messed up
>>
File: x_6wo59k.png (1.24 MB, 1536x1024)
1.24 MB
1.24 MB PNG
>>
>>107936346
damn that looks real
>>
>>107936346
Prompt for this?
>>
>>107936513
Try using bra size like F-cup breasts.
>>
>>107936516
>>
File: nah.png (19 KB, 139x91)
19 KB
19 KB PNG
>>107936642
>>
File: view.jpg (148 KB, 1280x704)
148 KB
148 KB JPG
>>107936698
>>
LTX has infinite meme potential
>>
>warned for calling a comfy employee a jeet

>>107936752
Ltx has infinite low quality potential
>>
>>107935526
>>107935487
>>107935249
Nice. Can you share your lora? I love this girl.
>>
>>107936776
A long time ago I was b& for referring to leaderboards and "jeeterboards"
>>
>>107936254
It's unusual, at least. The only model I've seen recommend something like that is NewBie's XML-style prompts, and people here hated that idea.
>>
File: f2k9b_00013.png (1.8 MB, 960x1440)
1.8 MB
1.8 MB PNG
>>
I'm lazy and was hoping Furry Guy would manage to get us a nice NSFW model.
I've been lurking on his Discord and realize what a disappointment it is.
Even if he had the knowledge, the data, and the computing power, he would screw everything up because he's an idiot and lives in a bubble where everyone applauds him enthusiastically for every little piece of shit he does.
So he continues doing what he does, little things full of shit.
>>
File: f2k9b_00030.png (1.54 MB, 960x1440)
1.54 MB
1.54 MB PNG
>>107936829
grifters gotta grift
>>
File: x_j5id4y.png (1.59 MB, 1536x1024)
1.59 MB
1.59 MB PNG
>>
File: x_ln70m3.png (1.4 MB, 1536x1024)
1.4 MB
1.4 MB PNG
>>
>>107936799
When I tried NewBie it worked fine with the same normal English boomer prompts I use with NetaYume. I didn't even change the Gemma boilerplate stuff.
>>
Sell your suno stock now!! Ace step 1.5 is coming soon.
>>
>>107936829
The only problem with Chroma is that is was aggressively shilled for realism by people who simply wanted it to be "for realism" despite the fact it was never advertised as having a primarily realistic focus. It's not surprising that the guy previously known for Fluffyrock made a model similar to Fluffyrock.
>>
>>107936742
She's from Texas.
>>
>>107936684
It's working better now.. works about half the time. The other half the tube top weirdly transparent.

>>107936559
>"shoulderless, sleeveless top that wraps around the upper torso"
It no longer becomes a tube top, but I'll keep trying.
>>
File: klein vs ZiT.jpg (1.54 MB, 5925x3072)
1.54 MB
1.54 MB JPG
Klein vs ZiT testing: Klein anatomy and proportions are inferior to ZiT. ZiT are mostly zero shot gens, while I had to keep rolling Klein to not get freaky anatomy. Klein also changes the source style ever so slightly. Klein does small details slightly better, but the anatomy issues made it not worth it.

tl;dr ZiT is better
>>
>>107936957
Except you can make 10x the gens with klein in the time it takes to make one zit gen. Klein also probably doesn't even need a lora. just feed it a couple references and it just werks.
>>
>>107936967
>just feed it a couple references and it just werks.
It's surprisingly lackluster at style reference. IPAdapters for older models work better in this regard.
>>
>>107936967
>Except you can make 10x the gens with klein in the time it takes to make one zit gen
What? It takes a like 10 sec per gen on zit lol
>>
when is bing dall-e gonna leak
i don't want arguments
i don't want "hurr use x it's better"
i just want the answer to the question
>>
>>107936957
The problem with ZIT is that it really is too overbaked. There is zero seed variation and the camera angles are so baked it, you actively have to fight with the model to produce any camera angles beyond level with the subject.

It's pretty but lacks any substance.
>>
>>107936995
>when is bing dall-e gonna leak
Never
>>
File: 1744411222833875.jpg (907 KB, 1496x1496)
907 KB
907 KB JPG
>>
>>107936957
Flux guys generally hate anything anime and don't really bother with making it look good. Probably shouldn't use klein for anything more than editing anime stuff (even then hands are fucked most of the time)
>>
>>107937012
where can i hire a mercenary group to make it happen
you'd think some of those infinite jeets that m$ hired would leak it on after having their h1bs cut and forced to go home
>>
>>107937024
>where can i hire a mercenary group to make it happen
I doubt you can hire a group more effective than the M$ has.
>>
>>107936072
>comfyui
>rentries
>non faggot image
THANKS, I really mean it.
>>
>>107937032
i bet the coca cola deathsquads could take them out
>>
>>107936938
I've literally never seen chroma gens that come anywhere close to illust when it comes to style
If you show me how to do that I'll revere you as a god
>>
File: 1764566731641050.png (2.28 MB, 1632x928)
2.28 MB
2.28 MB PNG
>>
>>107937051
>I've literally never seen chroma gens that come anywhere close to illust when it comes to style
duh, it cant hold onto a style to save its life
but anon was talking about gens that look like real photographs not faux realism illustrations
>>
>>107936957
Are you running them at the same step count with the same sampler and scheduler? If not no one cares.
>>
>>107936974
I dunno how you could have concluded this
>>
>>107937051
It's better at western art, as you'd expect.
>>
>>107937023
skill issue
>>
>>107936957
the Klein gens look way cleaner, even from far away, though.
>>
>>107936995
if it did there's no chance you could run it
>>
>>107937092
I welcome you to demonstrate that, I mostly need western art anyways and I've never been able to tard wrangle it into that
>>
>>107937103
what if i closed all other windows and went really slow
>>
>>107937089
By comparing the two.
>>
What causes this kind of mental illness, that makes you upload such trash?

It makes me hate humanity.
>>
>>107937023
Klein has the same anatomy issues as SD1.5 though not as extreme.
>>
>>107937145
SD1.5 was in many ways a better model than ZIT.
>>
>>107936957
Either the training code is wrong or Klein has some adversarial stuff going on in the model weights.
>>
>>107937060
>the thread is dead
compared to what?
>>
File: Flux2-Klein-9b8fp_00017_.png (2.68 MB, 1664x1248)
2.68 MB
2.68 MB PNG
>>
>>107937174
It's probably censorship given how much they bragged about it.
>>
>>107937139
https://civitai.com/images/118189385
literally me browsing civitai
>>
File: 1754937112367601.png (1.21 MB, 752x1360)
1.21 MB
1.21 MB PNG
no not like that...
>>
>>107937174
this is also my suspicion right now. there isnt a single good character lora for klein. the results always look retarded. these mofos poisoned it
>>
>devs make a model with amazing quality
>have more fun with is by ruining the quality

>>107937222
Do pic related.
>>
File: Flux2-Klein-9b8fp_00020_.png (2.55 MB, 1664x1248)
2.55 MB
2.55 MB PNG
It's like I am playing with dolls
>>
Sure is a lot of Chinese culture in this thread.
>>
File: h1.png (124 KB, 1074x2390)
124 KB
124 KB PNG
>>107934572
Pass users get a free, hum , pass over range ban.
means it's just another bullshit ploy to make Hiro richer.
That guy always has been a piece of shit.
Pic related is from 2015.
He took his time to boil the frog but he's none the less doing it.
>>
>>107937229
some of the nsfw klein loras really knock it out of the park though. The pyros guy said that klein is really easy to train and he's going to try making a full nsfw checkpoint.

I've been playing with base+turbo lora, You can use cfg > 1 and negatives. I think the results are better and loras work better with that config
>>
Is the local model meta old at this point?
>>
>>107937258
there is no meta model. all of them are shit in their own way
>>
>>107937258
>local model meta
Most people here just post Saas stuff and say it's local.

Sad really.
>>
>>107937258
still useful information desu but yeah its a bit old
>>
File: 1751681939517621.png (1.99 MB, 1072x1456)
1.99 MB
1.99 MB PNG
Is anyone training 9b loras on 16gb yet?
>>
File: Flux2-Klein-9b8fp_00022_.png (2.54 MB, 1664x1248)
2.54 MB
2.54 MB PNG
>>
>>107936286
>I thought workflow managers were programmed by programmers, not the fucking Discord moderators
the nodes make retards feel smarter after they just grab a dogshit workflow off of civitai or copy someone else
>>
>>107937258
It still recommends Wan so yes
>>
>>107937268
>>107937277
I assume ZIT is the go to for general stuff and Qwen Image Edit for image to image but I'm a textgen fag so I don't really know.
>>
>>107937282
Zit is and always was shit. It just dazzled people for a few weeks because the limited stuff it does produce looks kind of nice.
>>
>>107937282
Qwen Edit or Klein but more or less yeah
>>
>>107937284
story of every single model
>>
>>107937284
Is it really that bad? The gens that I saw looked good but again image generation isn't my strong suit.
>>107937286
What's the coomer meta these days? Noob?
>>
What is Zit?
>>
>>107937305
>Is it really that bad?
It's not bad. It's just deep fried to the point of not being flexible.
>>
>>107937305
Wai-Illustrious
>>
>>107937305
>Is it really that bad?
It is shallow but it does take to styles well. It's a tiny model. But it'll never become anything unless someone rips out the distillation and even then there might be better options.
>>107937305
>What's the coomer meta these days? Noob?
If I wanted to generate hardcore anime right now I would use Noob, but since NetaYume I've only felt the desire to gen semi lewd. I just cannot go back to a four channel VAE.
>>
>>107937271
yes i did with onetrainer and aitoolkit with lots of different settings. results were all plain bad. wasted days on this. awful character likeness where zit (even without base lol) performed extremely well for the same dataset

>>107937255
okay but can you show me a good celeb lora for example? the nsfw stuff i have seen wasnt really impressive either.
>>
File: 1742185383773353.png (89 KB, 961x604)
89 KB
89 KB PNG
>>107937350
>awful character likeness where zit (even without base lol) performed extremely well for the same dataset
this was my reaction with 4b as well. did you try using a normal scheduler node that you feed the model into? i dont know what i'm doing but i dont like how the flux 2 scheduler seems to ignore the model. thought maybe that was affecting lora performance.
>>
>>107937271
If you want to train 9b klein loras with ai-toolkit, you need fix the training code manually:

https://github.com/ostris/ai-toolkit/issues/653
>>
>>107937373
thank you, i noticed it crashed due to vram no matter what setting i was usign
>>
File: AniStudio_00224_.png (1.21 MB, 688x1488)
1.21 MB
1.21 MB PNG
>>107937277
> still
Is there something better than Wan yet?
>>
>>107937360
i dont think i have tried that yet. more or less used the default workflow. but the samples during training looked like shit anyway so its not like i expected some magic in the workflow
>>
File: Flux2-Klein-9b8fp_00026_.png (2.55 MB, 1664x1248)
2.55 MB
2.55 MB PNG
>>
Anyone know what the largest modern models are? I've got alot of VRAM.
>>
>>107937384
Well wan would have to be good in the first place
>>
>>107937406
HunyuanImage-3.0
>>
>>107937445
I've got 120gb VRAM, so I guess I can run it quanted.
>>
>>107937445
>>107937435
is it worth it
>>
>>107937316
fuck you
>>
File: 1764966658338079.webm (3.9 MB, 1154x1367)
3.9 MB
3.9 MB WEBM
>>107937454
>I've got 120gb VRAM
>>
>>107937457
not for most. usually people prefer qwen image or flux2dev if it's not the smaller models
>>
>>107937470
Anything autoregressive is basically 100% shit and not to be taken seriously.
>>
so yeah klein is a really fun editing model and thats it. back to waiting for base. chinks won.
>>
File: AniStudio_00219_.png (1.5 MB, 848x1216)
1.5 MB
1.5 MB PNG
>>107937487
Tongyi please.
>>
>>107937464
it would never have won for 1girl, but i think if most used a DGX and a H200 was the low end people would have used it for landscapes and such.

it's just overall not worth the tradeoff on our common actual hardware
>>
>>107937499
based poothon hater
>>
>>107937499
shit gen kys
>>
>>107937512
Cumfart shill meltie incoming
>>
>>107937514
I was already using gwen bf16 to edit (non distilled too) which is superior to this klein cope, but I guess you only used fp8 4 steps of qwen sooo lol :D
>>
>>107937518
I don't care what you use to gen, 3dcgi is garbage, you're no better than the onsen with garbage lighting pornmix sloppa spammer
>>
File: ng8.png (1.2 MB, 832x1248)
1.2 MB
1.2 MB PNG
>so yeah klein is a really fun editing model and thats it. back to waiting for base. chinks won.
>>
>>107937514
>take a celeb that is not already in the training data
>do an edit
>only about 50% resemblance
>trained lora even worse
nah sorry. but of course it will work for trump lol
>>
File: jt6.png (1.17 MB, 1168x880)
1.17 MB
1.17 MB PNG
>we're getting there folks
>>
File: ComfyUI_temp_qkfyi_00004_.jpg (507 KB, 1152x1472)
507 KB
507 KB JPG
>>107937487
>back to waiting for base. chinks won.
just 2 more weeks bro
>>
File: Flux2-Klein_00144_.png (2.4 MB, 1920x1072)
2.4 MB
2.4 MB PNG
>>
>>107937595
>An in-game screenshot of the 2003 MMORPG "World of Warcraft." The hud is visible.
>>
File: Flux2-Klein-9b8fp_00036_.png (2.63 MB, 1664x1248)
2.63 MB
2.63 MB PNG
"miku gets a job" by anonymous
>>
File: Flux2-Klein_00147_.png (1.24 MB, 1280x720)
1.24 MB
1.24 MB PNG
>>
>>107937609
Flux kind of did too.
>>
>>107937609
>Klein knows wow
>it's nu wow
nothing was gained
>>
>>
>>
Chat... these generals truly have fallen
>>
>>107937657
All is lost!!! All is los-....
>>
>>107937139
>>
>>107937645
>mangled medallion
Soulless.
Try it on 9b.
>>
>>107937676
>Try it on 9b.
Don't believe the filename. That was 9b.
>>
File: 1761380227058038.png (2.54 MB, 1024x1536)
2.54 MB
2.54 MB PNG
maid SEX
>>
>>107937689
@klein reveal her beautiful kempt unibrow
>>
>>107937689
Remember that time Arnold Schwarzenegger had sex with his maid and produced a son that was more chad than his other fatter legitimate son?
>>
>>
>>107936938
This.
Chroma is a furfag model, made by and for furfags. And furfags dgafs about realism, fingers, toes or general anatomy..

they want to fap on wolfs fucking dragons...
>>
>>107937684
Well that sucks. I wonder if you added something like "Pay attention the the medallion necklace he is wearing. Do not modify it." it would help.
>>
>>107937691
>not noticing the eyebags
its zit slop, not klein slop
>>
Anon, what's the hype since Ltx is shit at nsfw I2V? It's fast, but whatever..
>>
>>107937714
We are not getting a proper NSFW video diffusion anytime soon (besides lora shitmix slop).
Most models are too big to finetune without costing a fortune, the small ones are too shit in quality, and most importantly there is no easy way to mass generate decent captions for the NSFW videos, say unlike Gemini API for images, etc.
Don't expect video equivalent of Noob, BigASP, chroma etc.
>>
>>107937737
>chroma
lol
>>
File: 1762119982348657.png (679 KB, 749x866)
679 KB
679 KB PNG
Pro-tip for Klein edit enjoyers.
>1. find high-res high-quality photographs with the type of nipples you like (hegre, met-art. etc)
>2. go to https://www.presize.io/ and create nine 512x512 close up images of the individual nipples
>3. save them in a folder and run this command to turn them into a collage:
montage *.png -tile 3x3 -geometry 512x512+0+0 collage.png

>4. put the collage into klein as a reference photo and let it do the rest
>>
>>107937689
>>
>>107937747
As in a major model that knows non-shitmix NSFW.
I don't pretend it isn't a trainwreck.
>>
what's the best pose estimator atm?
>>
File: ZIT_00022_.jpg (654 KB, 1800x2200)
654 KB
654 KB JPG
I love zit so much I'm afraid of what I will become when base drops.
>>
>>107937827
Nah, it's too late for you.
>>
File: md3.png (2.47 MB, 1024x1536)
2.47 MB
2.47 MB PNG
>she looks so happy!
>>
>>107936099
hard to tell from the code. Can't find anything on HF
>>
>>107936995
Chroma is better.
>>
>>107937854
dry tongue blowjobs are much better
>>
>>107936957
Well duh, the model that's seen naked women is unironically better. Now wait till a Klein Chroma tune because Chroma still destroys both of those models with basic anatomy out of the box.
>>
>>107937827
>I'm afraid of what I will become
I have some relieving news for you then.
>>
>>107937384
>>107937499
>>107937518
shoo, shoo
>>
>>107937885
>Model that has quite literally solved image gen and knows everything out of the box

Fun fact: ClosedAI, Claude are throttling their garbage 400K context models to hell and back, but Google allows almost unlimited access to their SOTA 1M context multimodal model, how do they do it?
>>
Tongyi my anus
>>
>>107937914
Google has positive cash flow and can capture the market by simply being able to bleed out longer than their competition.
>>
>>107937914
>how do they do it?
The inference in their dedicated TPU infrastructure is a lot cheaper, they are burning far less money for the same request.
And unlike Anthropic or ClosedAI, Alphabet has a lot more going on besides AI, they are unlikely to go bankrupt anytime soon (unfortunately) so they have little need for belt-tightening.
>>
>>107937914
Also I should probably drop the caveat that while 1M context is nice and better than straight up forgetting the beginning of the conversation, such long lengths are somewhat of a meme and the model's intelligence starts to drop noticeably quarter way there (a.k.a context rot)
>>
>>107937933
>>107937934
This is good, now they just need to bleed out t he competition by providing very cheap SOTA video (hopefully with light guardrails), because fuck Sora 2.
>>
>>107937695

>>>/wsg/6077296
>>
>>107937961
You mean API? They might manage that eventually. That Youtube data is going to be really useful.
But as in /ldg/? I do not think they are releasing any local image or video model anytime soon, if ever. We are lucky they are still doing gemma.
>>
>>107937748
What would the prompt?
>>
File: file.png (674 KB, 900x900)
674 KB
674 KB PNG
>be faceswap model
>example images are asian women
>>
>>107937988
anything topless. i dont even refer to it.
>>
>>107937994
nta but thanks for the tip,
>>
File: 1759484558942507.png (1.9 MB, 1669x1316)
1.9 MB
1.9 MB PNG
>>107937993
lmao i remember someone made a post on reddit saying that z-image knows so many asian actresses and they all look the same
>>
the people have turned on alibaba, goddamn

https://xcancel.com/Alibaba_Wan/status/2013808554113663415
>>
>>107937994
Do pussies work the same way?
>>
>>107938012
I turned on them first and redditorbs called me ungrateful and gave me lots of downvotes. This was during wan 2.2 when I smelled something fish with their API pipeline models vs the ones they had released. They had in house more effective turbo models behind paywall API.
>>
File: locally generated.png (1.15 MB, 2846x1559)
1.15 MB
1.15 MB PNG
>>
>>107938008
It must be like how wales can tell each other apart by the bumps on their fins.
>>
>>107938008
I live in East Asia and I can barely tell these women apart.
>>
File: ZIT_00052_.jpg (795 KB, 1800x2200)
795 KB
795 KB JPG
>>107937843
No it's not.

>>107937853
Believe.

>>107937897
>>
>>107938017
i just tried with some extreme close ups and it didnt work as well. might work better if you zoom out with some more context but i cant be assed to track down a bunch of images
>>
>>107938012
I'm convinced Twitter is 90% bots though, and the accounts seem all fake... Nobody cares about this enough to complain at that scale, so what is their real goal?
>>
>>107938114
>>107938114
>>107938114
fresh when ready
>>
>>107938118
>baked at 283
nice spamming/flooding
>>
>>107938144
you can always report rule-breaking posts/threads
>>
>>107938144
Literally every thread has been baked around 280-290, retard.
>>
>>107938118
>shitbake underbake
benchod
>>
>>107938023
>reddit
kek sounds about right.

but honestly, i wouldnt care if they had api versions that were slightly better, just would be nice if they kept releasing open source versions. the real dick move is going 100% api

>>107938066
>>107938113
comments bots or not, hopefully this will push them to go open source again
>>
>>107938200
>>107938200
>>107938200
Proper bake with links
>>
>>107938118
>>107938160
>>107938207
absolute faggotry
>>
>>107938207
>proper bake
>another underbake
and the collage? i genned very long for top spot in front page cover of /ldg/?????? benchod bloody motherfuckers DO bake proberly
>>
He's going to keep making shitbakes so may as well
>>107938215
>>107938215
>>107938215
Remember to report the tranibakes, and fill this thread first
>>
>>107938114
>>107938114
>>107938114
Migrate.
>>
>>107938260
that's a shame
>>
>>107936801
wow that's a pretty cool pic
>>
proper thread is so back
>>
>>107938024
What is this?
>>
>>107937983
one day there will be a vidya finetune for a model like this, imagine the memes.
>>
b
>>
I sincerly hope the schizo you guys suffer whoever he is successfully chokes the life out of this lunatic syndicate and causes an organic emmigration to your next scouring grounds
no need for an /ai/ board, lowlifes will control each other's population through mutually assured destruction
>>
>>107943135
k julien
>>
Why do you faggots bake so many threads?
Stop polluting /g/
>>
>>107943873
An anon who doesn't like two links in OP has been trying for months to remove them. Recently he's been baking multiple threads way earlier than when they're usually created. We don't want him here as much as you.
>>
>>107943873
Literally a single schizo melty.
>>
>check in a day later, still the same thread
>>
This actually works. Generate massively longer LTX2 videos on consumer GPUs.

https://github.com/RandomInternetPreson/ComfyUI_LTX-2_VRAM_Memory_Management



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.