[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>107983392

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>ZiT
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Blessed thread of frenship
>>
File: 1739727599569738.png (2.65 MB, 1440x1088)
2.65 MB
2.65 MB PNG
FML so
SAGE ATTENTION DOESNT WORK WITH BASE, IT PRODUCES PICREL
>>
File: z-image_00005_.png (1.39 MB, 832x1216)
1.39 MB
1.39 MB PNG
50 steps

>>107984259
30 steps

i think 50 steps is overkill and fries the image a bit
>>
I had a sneaking suspicious z image base would flop but god damn I thought it would be salvageable
>>
File: 0.png (2.42 MB, 1216x1088)
2.42 MB
2.42 MB PNG
>notices
>>
Z-image fags on suicide watch.
>>
>>
File: 1763766690912658.png (2.16 MB, 1440x1088)
2.16 MB
2.16 MB PNG
>>107984345
the fuck you on about? it's good, thinking that maybe doing first steps here and then continue on zit for refinement might be the best course
>>
>>107984274
>330s on a 5090.
It's fucking ogre
>>
UGH I CANT GET AI TOOLKIT TO WORK I NEED TO MAKE MY CUNNY LORA
>>
>>107984343
you can remove the debo one. he's dead now
>>
lora training at 512x512 is 1.23it/s on 5090 for z image at bf16
the outputs from the model are disappointing, gonna try to at least see if it's better for training turbo loras then the de-distilled model
>>
Shouldn't we split up?
I would like to share a thread with the BFL faction and spin off the Z brothers into their own Z thread.
I think that would be a kind of natural selection and would be good for everyone.
>>
Only thing that could salvage this is if it can do NSFW one-shot.
>>
File: 1764511010881685.png (247 KB, 460x460)
247 KB
247 KB PNG
are you gonna use his lora?
>>
>>107984362
>here comes the cope
>just generate then fix it with another model!
nah, I'll just use a good model
>>
File: 8884162.png (1.62 MB, 1232x1072)
1.62 MB
1.62 MB PNG
>>
>>107984368
>he's dead now
based, how did he die? AIDS?
>>
>also does amputees
This feels like a stolen klein lmao.
>>
File: o_00018_.png (1.3 MB, 1152x896)
1.3 MB
1.3 MB PNG
>>
File: z-image-base_00016_.png (2.44 MB, 1024x1536)
2.44 MB
2.44 MB PNG
>>
>res_multistep
what scheduler does the default workflow use? normal made it look like ass
>>
you guys wont be singing the same tune once the first good base finetune comes out
>>
File: 1752212135340018.png (2.48 MB, 1440x1088)
2.48 MB
2.48 MB PNG
>>107984387
keep fudding bro, you'll get there eventually
>>
>>107984397
post prompt so I can feed it in klein.
>>
>>107984397
lie, post WF
>>
>>107984401
inb4 pony v8
>>
>>107984366
we're waiting for the new Image2Lora technology that instantly produces loras
>>
File: ComfyUI_temp_ppqem_00057_.png (3.15 MB, 1216x1664)
3.15 MB
3.15 MB PNG
>>107984407
klein can't generate feet lol, germans are alergic to feet
>>
/adt/ troons need to fuck of to their dead general and gen with SDXL .
GET OUT GET OUT GET OUT
>>
Do loras trained on ZIT work with the base?
>>
>>107984406
whatever you say chang
>>
File: ComfyUI_ZIT_00043_.png (3.44 MB, 1536x2048)
3.44 MB
3.44 MB PNG
>>107984295
same seed prompt with full turbo cfg 1 steps 12
>>
I feel like Z vs Klein doesn't really matter even given Chroma is already literally happening on both, with training checkpoints for both already out
>>
File: z-image_00006_.png (1.44 MB, 1024x1024)
1.44 MB
1.44 MB PNG
this is what I get for a woman sitting down placing poker so there is no way it does that>>107984397 show WF
>>
>>107984425
delicious 4 toes
>>
>>107984400
it uses simple
>>
>>107984430
this anon has a point to be honest
>>
>>107984432
If the model parameters count is the same then yes
>>
File: ZIT_00002_.png (1.53 MB, 1024x1024)
1.53 MB
1.53 MB PNG
It accidentally made my prompt into "anime"..

Bros..
>>
Hmm, it's defintely worse than Turbo, but it's not overfit on the same camera angle and can do shit on its own which is a plus.
>>
File: o_00019_.png (1.55 MB, 1152x896)
1.55 MB
1.55 MB PNG
>>
File: 1761210688086503.png (165 KB, 842x260)
165 KB
165 KB PNG
what exactly happened in this step to produce turbo?
>>
the chinese culture is flowing
>>
File: ComfyUI_00004_.png (3.38 MB, 1434x1434)
3.38 MB
3.38 MB PNG
>>107984461
no, they won't because zit turbo loras are inherently hacked in. the other way around will likely work. i'm going to walk away from this thread because most people in it have no idea what they are saying or doing and it is triggering my autism.
>>
>>107984446
>>
File: 1745917857405613.png (1.87 MB, 1360x768)
1.87 MB
1.87 MB PNG
flux klein 9b edit is more fun than zimage
>>
File: ComfyUI_07987.jpg (2.26 MB, 1440x2160)
2.26 MB
2.26 MB JPG
>>107984174
Yeah, 30 steps is the sweet spot. 40 and 50 started to lose adherence for me. 191s for this resolution, I miss that speed already.

Oh, and Turbo LoRAs do actually work across a couple of layers on Base, which was surprising. You get mashed potatoes if the strength is too high though.
>>
>>107984443
yes and the way it's going even loras are made for both
>>
File: 1738443590072827.png (2.04 MB, 1024x1024)
2.04 MB
2.04 MB PNG
>A manga store in downtown Akihabara, Japan.
this seems worse than turbo. why is it blurry/bad text/characters?
>>
File: 027.png (2.75 MB, 1152x1136)
2.75 MB
2.75 MB PNG
>>
File: 1745746309114121.png (1.11 MB, 1168x880)
1.11 MB
1.11 MB PNG
>it's gonna be worth the wait
>>
>>
>>107984476
>what is rlhf
open a book retard
>>
>>107984442
Not much compositional variance even then for Turbo vs Base
>>107984400
Res Multistep isn't very good for these kinds of models, if you ask me, it's noisy with worse anatomy almost always
>>
>>107984442
oh no no no no....
>>
File: z-image-base_00017_.png (2.36 MB, 1024x1536)
2.36 MB
2.36 MB PNG
>>107984407
>>107984409
>>107984446
https://files.catbox.moe/6wrni9.png
>>
File: 1768527432239093.jpg (1.15 MB, 4580x1242)
1.15 MB
1.15 MB JPG
>>107984476
they trained it to the point of overfitting on aesthetics for photography. Reminder that turbo can only do realism
>>
>>107984488
>this seems worse than turbo
kek base is for training dumbass, 99% don't need to download it
>>
File: ZIT_00004_ (2).png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
LLM diversified my images.

But bros.. This is euler simple, cfg 2, 30steps.
>>
>>107984504
>Reminder that turbo can only do realism
good
>>
>>107984510
>is for training dumbass
but why would anyone bother with it having a much worse vae than the same sized yet better (and edit ability) klien 4B?
>>
Bfl won this round
>>
>>107984510
this needs to be stickied. no one cares about your base gens. i want to see how well base loras work with turbo
>>
>>107984400
use res_2s if you have it.
>>
>china used flux 1 vae instead of making their own
reminder than china can only steal, never innovate
>>
>>107984504
You are not actually saying flux does better anime than ZIT are you lmao?
>>
>>107984511
>ZIT_00004_ (2)
hmm
>>
buying a NAI suscription rn
>>
localkeks are already coping massively and it hasn't even been 2 hours yet.
>>
the finetunes that come off of z-image are going to be insane.
i already know 2 people working on them. one is anime/hentai, the other is unfortunately furry, but if Pony was any indicator, furries know what they're doing.
>>
>>107984528
flux 2 used qwen te, faggot
>>
>>107984488
Why were you expecting it not to be worse than Turbo?
>>
>>107984529
100X yes. Try any non realistic 2D style like a comic / maga. ZIT is shit
>>
>>107984335
so sage works in ZIT but not in base (where it's really needed)
fuck this gay earth
>>
>W-WHY DID YOU EXPECT IT TO ACTUALLY BE GOOD
so true, this is local after all!
>>
which NAI suscription tier will you buy bros?
>>
File: ZIT_00008_ (1).png (1.88 MB, 1024x1024)
1.88 MB
1.88 MB PNG
res2s beta57 has been best so far for me.

>>107984530
I'm feeling lazy and just save the image into downloads folder.
>>
going back to nano banana pro, fuck this gay earth
>>
>>107984529
I posted a comparison a few days ago recreating a WAI pic via boomerprompting that showed how much ZIT only wants to do realism in comparison to Klein
>>
I am once again asking for IDs and flags on every board.
>>
>>107984558
did you try res2s + bong tangent?
>>
>>107984554
which can i afford when i sell my 2080, saar?
>>
Also you can try to use ZiT loras on 0.1(!) strength.
>>
>>107984563
are you at least paying for it through API nodes to support comfyorg?
>>
>>107984335
>SAGE ATTENTION DOESNT WORK WITH BASE
wtf how is that even possible
>>
Which artist tags and quality tags are best for NAI?
>>
File: aa.jpg (1.02 MB, 2719x1920)
1.02 MB
1.02 MB JPG
>>107984529
with loras yes. There was that one lora posted here where you could do a style transfer with a reference image using klein. You can also upscale and prompt at higher resolutions
>>
>>107984571
outstanding quality
>>
So base is as uncensored/censored as turbo?
>>
File: 1745465659314082.png (2.04 MB, 1024x1024)
2.04 MB
2.04 MB PNG
>>107984488
same prompt using t2i klein 4b and generated in 3 seconds/8 steps:

maybe z base is shit on purpose cause its ONLY a training model. flux klein 9b distilled (q8) is the best/fastest model too.
>>
Is base able to be used like klein edit?
>>
File: 1763916289060120.png (1.9 MB, 1328x1328)
1.9 MB
1.9 MB PNG
>>107984540
You are out of your mind/blind or have been seeing way too much western crap lol.
>>107984577
You guys are blind...holy shit if you think that looks good
>>
please stop calling it base you fucking retards
>>
New Z model? Is this Z base?
>>
>>107984503
nice ass
>>
File: 1756541209155922.png (3.1 MB, 960x1632)
3.1 MB
3.1 MB PNG
>>107984575
slightly different arch, tried with multiple sched + sampler combo but nah, produces those ugly artifacts
>>
okay i screenshotted it now what
>>
I don't get why everyone is assblasted, we have 4b base and Z base now, good things are going to happen one way or another
>>
File: 1760621144658707.png (4 KB, 59x55)
4 KB
4 KB PNG
i'm waiting for his verdict
>>
>>107984591
stick with turbo, junior
>>
>>107984595
but I had no issue with turbo, makes no sense lol
>>
Thank god we at least got klein out of the z image turbo scare. Too bad about base z image.
>>
File: 1769534249.png (24 KB, 721x148)
24 KB
24 KB PNG
>>107984596
kek
>>107984601
it's just fud
>>
>>107984590
so what is it
>>
real knowers already knew it was never about base, it was about z-image-edit... soon.
>>
the anime example makes me think the z-image team actually trained the model a bit on the noob dataset they asked for a while back. this might mean that the new noob model will be on z-image instead of klein.
https://old.reddit.com/r/StableDiffusion/comments/1qojw11/zimage_base_vs_zimage_turbo/
>>
>>107984596
the thing with these retards is that they are already 3 doomer states ahead, they don't care about their old failed predictions
>>
I am unsure whether to choose the Opus or Tablet tier. How many gens does my NAI friend generate each month?
>>
was turbo even made from this model? will loras trained on this model even work on turbo? i highly doubt it
>>
>>107984401
>>107984535
> todd, release starfield, modders will fix it!
>>
>>107984595
These aren't artifacts, this is the secret sauce for the best finetunes ever. It must be this way
>>
File: 1756988534138879.png (3.05 MB, 960x1632)
3.05 MB
3.05 MB PNG
>>107984610
yeah it works with turbo, but turbo doesnt really benefit that much since it's only 8 steps.
>>
so, no ace today?
>>
>>107984605
yes we all care very much about your favorite discord drama, thank you very much for posting
>>
>can't edit
>looks bad
>loras don't work with turbo
it's over. please save us lodestones
>>
>>107984623
except base was specifically made for "modding"
>>
>>107984534
>localkeks
This thread is the local diffusion general. We're ALL localkeks here. Except you.
You need to GET THE FUCK OUT YOU CORPO SLURPING SHILL, STOP SHITTING UP THE THE THREAD WITH YOUR SPAM
>>
>the sd3 finetunes are going to be insane
>>
>>107984571
>0.1(!) strength.
if you need to set a lora to this its completely raped
>>
Well that was about the fastest I've deleted a model after downloading it
>>
You don't need to download the Z-image base model.

Yes, you.
>>
25 a month? It seems quite affordable.
>>
>>107984634
>loras don't work with turbo
???
>>
File: ZIT_00010_ (1).jpg (1.44 MB, 2048x2048)
1.44 MB
1.44 MB JPG
>native 2048p

Expected.

>>107984568
Way worse.
>>
>>107984601
klein has the advantage of being better than z-image base while also being an edit model. this is most likely due to the flux 2 vae and bfl panicking and overestimating z-image base because of turbo.
>>
>>107984651
same, turbo is just better/faster

also klein edit 9b and LTX2 video are more fun
>>
why does apparently everyone want to kill this general
>>
>>107984658
compare klein base to z-image base or you're retarded
>>
someone try negative prompt please
>>
File: 1760899614704953.jpg (1.87 MB, 1248x1824)
1.87 MB
1.87 MB JPG
The quality difference between base and turbo seems to be very small, prompt comprehension seems to be worse, but generation time difference is massive.
Although all of this was expected i guess.
>>
I heard that if I subscribe to Opus, I can gen txt2img without limits, is that true?
>>
>>107984665
I agree that while Klein 9b still fucks up anatomy with certain poses. using references is incredibly fun
>>
I waited specifically for z image for my NSFW fine-tuning because I truly want the best for my fine-tuning and to make an objective decision.
I will fine-tune Klein 9b and keep it private; the monkeys should wait here for their z base fine-tuning.
>>
>>107984666
It's a popular thread, its a nice place for trolls
>>
>>107984666
It can't be more dead than sdg

I miss it desu
>>
>>107984623
you're either incredibly retarded or a noob.
stop posting.
>>
Seedream 5 is our only hope. Please save comfyorg so we can retvrn to diffusing locally through api nodes
>>
has anyone been able to train klein 4b with onetrainer? i always fails to load. 9b works fine
>>
>>107984674
>but generation time difference is massive.
is it worse than chroma
>>
>>107984666
This thread was made by a schizo that wanted to kill /sdg/.
>>
If I buy, let's say, 2000 Anlas, how long are they valid for? Or do I have to use them within a certain time period?
>>
Chinamen have proved their incompetence yet again.
>>
>>107984692
Around the same as chroma actually. May be a bit slower.
>>
>>107984666
>everyone
you just need 3-4 different mentally anons wasting their lives here
heck we even attracted the retard about his anti nai pro nai nai shill whatever bullshit gig
>>
File: 00003.png (1.38 MB, 832x1216)
1.38 MB
1.38 MB PNG
Hey that's pretty good!
Euler, normal
28 steps
3.0 CFG
>>
>>107984693
Bingo. This general was born out of pure spite.
>>
>>107984658
The BFL team invented diffusion models at the Technical University of Munich. They read the ZIT paper and felt offended that they were pulling such crap with ZIT.
So they casually dropped Klein to make it clear who was boss.
This was aimed more at investors than at the open source community.
>>
File: z_imageBASEd_00018_.jpg (409 KB, 1112x1336)
409 KB
409 KB JPG
>>
File: ZIT_00014_ (2).png (1.71 MB, 1024x1024)
1.71 MB
1.71 MB PNG
Bros..
>>
res_3s/bong/cfg3.5/12 steps
>>
File: 876532.png (1.58 MB, 832x1216)
1.58 MB
1.58 MB PNG
Does pretty well with painterly styles too.
>>
>>107984705
yeah but why is this thread lik a bat sign for schizos, I haven't seen any other general where it was this bad
>>
>>107984535
>the finetunes that come off of z-image are going to be insane.
yep, and with a proper licensing we can finally move ahead and get sdxl behind
I hope every group will make their own version
>>
For the LTX-2 enjoyers. this node seems to improve coherence at the cost of slight slowdown.

use it in your first pass with distilled lora at 0.2, CFG 2, 15 steps of your sampler of choice.
>>
but can it do bare tits and vagoo?!
>>
>>107984294
>>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon
why are these in the op? totally not ani btw
>>
so far what I've seen isn't so bad for a base model, why the hysteria
>>
>>107984727
that I have no idea, but it's been worse for a few months now
I feel like I hide every other post
>>
>>107984727
>I haven't seen any other general where it was this bad
Every single active AI general is like this.
>>
File: z-image_00012_.png (1.72 MB, 832x1216)
1.72 MB
1.72 MB PNG
>it can do paags
zim wins
>>
>>107984727
oh shut up, 1/2 generals have schizos that will dedicate their life to ruining those threads for years. /g/ threads have the added flavor that people want to keep the "sirrriousss busisness only" attitude, trying to make "rules" around schizos
>>
>>107984750
wrong
>>
>>107984666
>>107984727
1 llm and proxies, mods rarely do anything about it. the thread is only usable a few times a day like now and even in these threads when the schizo is not sleeping, he will keep gaslighting
>>
>Everyone: zib won't be good. It's for finetunes and loras not for direct image
>Retards: WHAAAAA Z BASE BAAD
>>
>>107984712
it was to get away from the Jewish and Chinese saas shills. look how well that turned out. we are going to need another general to get rid of the drama and the grifters
>>
obiWAN2.6, you're our only hope
>>
>>107984756
Surely you can name an exception.
>>
>>107984744

>>107984760
>>
File: o_00024_.png (1.2 MB, 1152x896)
1.2 MB
1.2 MB PNG
>>
>>107984760
this
>>
Z-Omni when
>>
File: 1747849585570911.png (1.48 MB, 704x1456)
1.48 MB
1.48 MB PNG
>>107984678
multi image stuff or single edits are pretty fun, it works better than QIE even

replace the asian girl with the anime girl in image2 in the same pose.
>>
>z base can still do big booty loli
base > klein
>>
So Tablet and Scrol tier kind of suscription are Pay As You Go, while Opus has unlimited gens...grrr this is dificult to choose, any sugestions?
>>
good unrestricted model when?
>>
>>107984758
>>107984793
>AI as a service
>Pay as you go
Niggers, be serious
>>107984797
Never ever, we are fucked
>>
>>107984791
proof?
>>
File: 4685027.png (1 MB, 1600x512)
1 MB
1 MB PNG
>>107984760
great, now we have it for fine-tuning, who has some spare tens of thousands of dollars to do it?
>>
>>107984793
The good thing is that Tablet and Scroll give you 20% discount when purchasing Anlas
>>
>>107984537
i mean you're talking to the floyd spammer so what do you expect
>>
File: z-image_00013_.png (1.76 MB, 832x1216)
1.76 MB
1.76 MB PNG
>>
>>107984797
Be the change that you want to see.
>>
>>107984767
No it wasn't. It was born out of spite towards "avatarfags" and "ties to stability". It was named image diffusion general at first, and renamed to local diffusion only after Ani suggested that (it's not a joke, you'd know if you were there. he hates saas). It's incredibly ironic that he's the public enemy number one here nowadays
>>
> /ldg/ before zib release: z base when chinese culture?
> /ldg/ after zib release: why did you wait for it? it's for training!
>>
>>107984791
zbase can't do pretty non asians
>>
>>107984790
do you dress up as Miku sometimes when you are alone at home?
>>
>>107984797
from official labs? never again. Some of them are fine with having unrestricted llms, but image models? nuh uh.
>>
File: i2iUpscale_00002_ (1).jpg (1.11 MB, 2048x2048)
1.11 MB
1.11 MB JPG
>>107984722
Well i2i upscaling with zit works at least. This is probably how it will be used.
>>
Does lodestone even have the money to finetune base to a zchroma?
>>
>>107984813
You don't?
>>
>>107984810
>It was born out of spite towards "avatarfags"
then why do you let a retarded avitroon bake all the time?
>>
thread is cozy when theres not a schizo complaining about the inclusion of two links at the bottom of OP
>>
>>107984822
no, but that's never stopped him before
>>
File: 16465432.png (1.63 MB, 832x1216)
1.63 MB
1.63 MB PNG
>>
>>107984827
Nobody "lets" him, he's forcing himself into OP.
>>
>>107984831
a klarna model??
>>
>>107984824
why bother generating images? you are a cute and valid Miku already :)
>>
>>107984822
he could get something working with base, but he'd probably get distracted by trying some"experiment" that goes nowhere and then its allll sacked
>>
>>107984840
that sounds like you are too lazy to bake and let somebody you despise bake the general without any resistance, not even a report
>>
So after gathering a bit of info, Tablet gives you 1,000 per month, Scroll gives you 1,000, and Opus gives you 10,000. That’s a significantly higher amount, especially considering you’re only paying about 50% more.
>>
>>107984790
Ohh, my clitty is leaking in anticipation of some George Floyd edits...
>>
>>107984822
yes, he is a furfag so he has infinite money for some reason
>>
File: 71863421.png (1.68 MB, 832x1216)
1.68 MB
1.68 MB PNG
It actually just works, finetunes are gonna go so hard.
>>
what that other guy said about using base as a start and then finishing the last steps with turbo works pretty well btw. you can basically use the two models combined via the advanced ksampler like wan 2.2. might be great once we have lots of base compatible loras
>>
>>107984844
Miku is always valid also dubs of truth. :)
>>
>>107984827
>retarded avitroon
ani didnt bake this thread tho
>>
>>107984863
get out saas nigger
>>
Cope is off the charts right now.
>>
>>107984875
this, it makes images less "fried"
>>
>>107984878
Mr catjak is far worse. you should see his schizo scrawlings. it's very concerning
>>
File: z_imageBASEd_00027_.jpg (362 KB, 1112x1336)
362 KB
362 KB JPG
>>107984873
yeah, not bad not great. thats about it
>>
File: 1738997463956367.png (104 KB, 763x976)
104 KB
104 KB PNG
has anyone ever tried to run these minimal flux2 klein diffusers examples? none of them work for me even when using the latest diffusers / transformers / whatever from git
the zit one works just fine
>>
File: 38975321.png (1.88 MB, 832x1216)
1.88 MB
1.88 MB PNG
>>107984900
Yeah, pretty much perfect.
>>
File: ZIT_00028_ (1).png (1.77 MB, 1024x1024)
1.77 MB
1.77 MB PNG
Holy shit I forgot I was using i2i with a new prompt..
SD1 soul.
>>
>>107984750
I remember during the AI Dungeon days, we had our very first schizo who was immediately kicked out of the still-forming NAI team (not shilling NAI btw, this actually happened). He was some furry troon who was trying to be a community manager or something. After he got kicked he started shitting up the thread 24/7
>>
>>107984875
>using base as a start and then finishing the last steps with turbo works pretty well btw.
Are you aware that you’re copying at an almost incalculable level?
>>
>>107984914
i ran them and the included lora training code
>>
File: o_00025_.jpg (870 KB, 2304x1792)
870 KB
870 KB JPG
>>
not bad
>>
>>107984926
no, I genuinely had no idea
where are those copies now?
>>
File: 428351941.png (1.34 MB, 832x1216)
1.34 MB
1.34 MB PNG
>>
File: z-image_00016_.png (1.91 MB, 832x1216)
1.91 MB
1.91 MB PNG
sovl
>>
File: ComfyUI_temp_eknrp_00030_.png (2.8 MB, 1024x1344)
2.8 MB
2.8 MB PNG
>the magic is aggressive samplers with 12-15 steps
Eulercucks finally btfo.
>>
>>107984928
how did you install all the dependencies? pip install -U diffusers doesnt even install a diffusers version that has all the flux2 klein stuff
>>
>>107984948
proof?
>>
>>107984873
>>107984918
I much prefer rawer shit like this than the RLHF slop from Turbo. I doubt we'll ever get something with the versatility of say DALL-E 2, but when you hire third worlders for pennies per hour to do all your RLHF and pick the "best" image (or text, or whatever) you invariably get slop.
>>
Is base actually censored or just incredibly incompetent at nsfw?
>>
>>107984810
>It was named image diffusion general at first, and renamed to local diffusion only after Ani suggested that (it's not a joke, you'd know if you were there. he hates saas).
Is this true?
>>
>>107984898
yeah he should work on his wrapper instead of sperging out about two links at the bottom of op i hope ani gets the clinical help he needs soon :[
>>
File: ZIT_00031_.png (1.89 MB, 1024x1024)
1.89 MB
1.89 MB PNG
>>107984921
Bros, we are so back..
>>
>>107984966
no
>>
File: 345124681.png (1.61 MB, 832x1216)
1.61 MB
1.61 MB PNG
The fact that "Euler" is pronounced "oy · luh" is really fucked up.
>>
>>107984950
yeah you have to use `pip install git+https://github.com` syntax to pull the code right from the repo instead of the pypl
>>
>>107984967
literally catjak bipolar disorder. she needs the drama
>>
>>107984977
>luh instead of lur
Nice britbong self report.
>>
>>107984940
if they trained on the noob dataset for a bit, does it mean they also trained on the furry part of the dataset they added later?
>>
>>107984900
which model? Jugernaut? or CounterfeitXL?
>>
Aren't you glad you waited months for this?
>>
>>107984914
clone the diffusers repo, the latest stable doesn't include the flux2klein pipelines
>>107984948
this and then using klein to upscale/denoise seems pretty good
>>
>>107984994
in 2 weeks we get the finetunes
>>
>>107984994
nothing came out of the scapegoat ani drama. I'm not sure there was anything to look forward to for that
>>
>>107984940
kek'd
>>
>>107984984
right its weird that he used a gen mentioning ldg in his github project but he continues to sperg out here and drive posters away. all those rumors about ani being on meth during christmas time were probably true, how else was he able to stay up and troll the whole time?
>>
>>107984830
the only real cozy AI thread was sdg and now it's gone
>>107984944
I want to burry face in that
>>107984921
I remember joining a bit late, 1.4 was "brand new" and people were speculating whether 1.5 would be "woke" or not
We used to be happy
>>107984994
What kind of retard hypes up AI models? Most people just use the newest one or their favorite until a new one "shadowdrops" or gets leaked.
>>
>>107984977
I like it. If anyone says "yuler" you know they have zero formal education in math / ML and so you can ignore any technical advice they're trying to give. It serves as an excellent shibboleth.
>>
>>107984994
yeah, because now I’m even more convinced to subscribe to NovelAI and finally stop believing in chinese fairy tales.
>>
>>107984982
>>107984997
right, thats what i figured out as well. but then i get a different error when i try to use my gpu:
>NotImplementedError: Cannot copy out of meta tensor; no data! Please use torch.nn.Module.to_empty() instead of torch.nn.Module.to() when moving module from meta to a different device.
do i need a specific python / torch /cuda version?
>>
>>107984966
The part about "/idg/" is verifiably true.
>>100605250
The part about ani proposing "local" might be true, I think somebody mentioned that in the sdg trooncord. Might be bullshit too, who knows
>>
>>107985027
that one i don't know. i ran with 121 because it's stable and common
>>
File: 1766943773942581.png (83 KB, 2434x1440)
83 KB
83 KB PNG
>>107984727
Threads about local models aren't allowed to be healthy because it's bad for NAI's business. 4chan is all they have.
>>
>>107985016
sdg was never cozy, we wanted to discuss the models, you wanted your cult of personality, anons got to make a choice and now you are pissed they didn't choose your thread.
>>
>>107985020
I like your style, embrace ignorance, wear it as a badge of honor
>>
File: ZIT_00037_.png (1.83 MB, 1024x1024)
1.83 MB
1.83 MB PNG
I am starting to like this model now. I2i into turbo is totally feasible.

>>107985016
Are you not happy now, anon?
>>
>>107985016
>What kind of retard hypes up AI models? Most people just use the newest one or their favorite until a new one "shadowdrops" or gets leaked.
yeah same I don't get these model x vs model y shitfights
fucking retards
>>
File: 512389471.png (1.52 MB, 832x1216)
1.52 MB
1.52 MB PNG
>>107984962
I prefer it too, turbo is nice for what it is, but this is looking promising for custom aesthetic tunes.
>>
>>107985047
What cult of personality? We were just a bunch of retards spamming images and shit, it was fun.
>>
>>107984966
The post suggesting it did not attach an image so we have to take him at his word which if you know him means nothing kek
>>
File: z-image_00017_.png (2.49 MB, 984x1440)
2.49 MB
2.49 MB PNG
>try an unorthodox res
>pic related
hm.
>>
>>107985054
it's just jaded gaymers doing the console wars script
>>
>>107984752
>>107984944
saved, more?
>>
>>107984619
Yes, Opus is unlimited for txt2img at standard resolutions. Huge images or heavy img2img burn Anlas, but the sub already covers what most anons here use.
>>
>>107985058
>We were just a bunch of retards spamming images and shit, it was fun.
You turned it into a massive avatarfag circlejerk. You can always go back, just saying.
>>
File: zippo.png (1.64 MB, 1536x768)
1.64 MB
1.64 MB PNG
Z-Image nonturbo test.
>it went with a drawn style on its own rather than photorealistic
>adherence to prompt details is better
>variety is better
>some details are off, but I only used 25 steps
>takes 3-5 mins for 768, up from <1 min with turbo
It's slow on my setup, but there is potential. Might still run better than Klein 4B nondistilled for me, due to the latter taking way longer to text-encode.
>>
File: o_00026_.jpg (1.13 MB, 2304x1792)
1.13 MB
1.13 MB JPG
>>
File: i2iUpscale_00005_.jpg (1.49 MB, 2048x2048)
1.49 MB
1.49 MB JPG
>>107985051
And the simple zit upscale.
>>
>>107984948
can you kindly provide a guide to setup these samplers sar
>>
I am delighted for the community that it has finally received what it deserves.
A real model for finetuning
>>
>>107985058
>batch spams another 40 out 150 images
> gm, gn with images attached eating from the image limit because you have to sign your posts
>pointless drama because as any thread with avatars they start posting without gens to insult other avatarfags they don't like, creating the same general feeling of a /vg thread that as gone past its first week
yeah for sure man, just posting images. to be this blind you must be from the actual thread, which would not surprise me at all if you are partially responsible for what is happening here.
>>
>>107984963
What are you getting?
>Mediocre nipples and nonsensical skin blobs in nether regions
Just not trained on it. Not extremely difficult to fix.
>Barbie doll body
Censored
>>
File: ComfyUI_temp_eknrp_00034_.png (2.41 MB, 1024x1344)
2.41 MB
2.41 MB PNG
heun/simple/15s/cfg4
>>
>>107984726
>>107984873
>>107985056
prompt sirs?
>>
>>107985074
The Anlas system is actually fair if you think about it because, you're not paying for compute you don't use and training a lora locally on a costs more in electricity.
>>
>>107985099
that israeli guy has 6 fingers
>>
File: 387496342.png (1.29 MB, 1216x832)
1.29 MB
1.29 MB PNG
>>107984988
In Europe it's common to learn British English instead of American English. Although, American media is so ubiquitous most people still end up with more of an american pronunciation.
>>
File: 1.png (1.42 MB, 1504x1206)
1.42 MB
1.42 MB PNG
This is a base model. Can you guess which hole the base model goes into?

The training hole?

Or the image generation hole?
>>
>>107985076
I'd say sdg was pretty fun in early 2023. It had some sense of community and didn't turn into a circlejerk yet, it was indeed cozy. Ldg is better than what sdg turned into for sure, but is a bit soulless.
>>
>>107985110
>A real model for finetuning
grim that here there aren't finetuners
>>
File: head spin.gif (259 KB, 250x250)
259 KB
259 KB GIF
so... how many days until first illustrious-tier anime finetune of z base drops?
>>
>>107985115
now do non anime slop with those settings
>>
if anything now i noticed some posts from known users of that thread when the spamming was at its worst, as if gloating even.
>>
>>107985127
Did anyone even try genning with Klein 9B base? It looks like shit as well
>>
>>107984737
NAI does, and does it well, without the weird censorship or moralizing.
>>
>>107985136
2-3 months at least, probably more.
>>
for real no ace?
it's the 27th right?
>>
>>107984977
depends on the language
>>
Fuck, I need to hit the sack.

A final empty prompt gen.
>>
File: 1754524708244227.png (16 KB, 443x272)
16 KB
16 KB PNG
who's training?
>>
File: 900789543.png (1.48 MB, 1216x832)
1.48 MB
1.48 MB PNG
>>107985116
>"An impressionist oil painting in the style of Mary Cassatt, depicting a beautiful young mother bathing her toddler. The scene is one of pure tenderness and domestic light. Loose, visible brushstrokes in soft pastels and creams capture the play of afternoon light filtering through a bathroom window, blurring the figures into a harmonious, luminous composition. Focus is on the emotion and the quality of light, not photographic detail."
>"A masterful classical realist painting, reminiscent of Johannes Vermeer's interior scenes. A beautiful woman in a simple but elegant linen dress is meticulously writing a letter at a wooden desk by a window. The Rembrandt lighting dramatically illuminates her face and hands from the left, leaving the rest of the room in deep, rich shadows. Extreme attention to texture: the grain of the wood, the weave of the fabric, the feather of the quill."
>"An impressionistic rendering of a beautiful woman picking tulips in a vibrant spring meadow, dappled with soft light filtering through clouds. The scene is painted in loose, expressive brushstrokes, capturing the fleeting nature of the moment and the subtle interplay of colors—pinks, greens, and blues swirling around her flowing sundress. Her face is partially turned, her features softened and idealized, evoking the timeless charm of a daydream."
>>
are booru tags any good on z image? i don't like writing a novel every time i prompt
>>
>>107985127
Goes in the recycle bin hole desu
>>
How do reduce the influence of the original image with wan 2.2 i2v?
Is there a node or something?
>>
>>107985136
Let's be real, we have a competent, uncensored, artist friendly service that delivers, clinging to local is retarded
>>
>>107984366
had to reinstall ai-toolkit. AI-Toolkit-Easy-Install.bat somehow borks your installation. now we wait for the delicious cake to bake.
>>
its pronounced jif
its pronounced guhnome
its pronounced guhnu
nerd elitism is insufferable
>>
File: z_imageBASEd_00039_.jpg (446 KB, 1112x1336)
446 KB
446 KB JPG
>>
>>107985173
not me.
thx for beta testing btw
>>
>>107985181
what does that even mean?
>>
How can I get that "NAI look" consistently in my local gens? which is something even the best local models struggle.
>>
File: 1745907666643885.png (2.54 MB, 1287x1257)
2.54 MB
2.54 MB PNG
>>107985185
>AI-Toolkit-Easy-Install.bat
>>
>>107985113
>pointless drama because as any thread with avatars they start posting without gens to insult other avatarfags they don't like
That was you. And you still do it in this general.
>>
>>107985144
yeah, because I thought it was better. I immediately went back to 9B distill.
>>
>>107984504
>turbo can only do realism
Me when I lie.
>>
>>107985187
what about nginx
>>
>>107985192
I think it was pretty clear.
Basically a way for wan to not stick too much to the original latent of the pic you plug in the first latent for i2v.
>>
why would you need a modern state of the art model for anime? couldnt sd 1.5 do flawless anime already?
>>
>>107985200
stop wasting your time claudefag, come back when the thread is less active so people would actually pay attention to your false flags
>>
File: file.png (54 KB, 256x144)
54 KB
54 KB PNG
>>107985214
>>
migrate
>>107985210
>>107985210
>>107985210
>>
Fast thread only one hour, let keep it until page 10
>>
>>107985206
yeah? why not choose the easiest solution for installation
>>
>>107985187
if someone said "jif" in my vicinity I would be morally obligated to stab them in the throat, unless they were talking about peanut butter
>>
>>107985207
i don't have an avatar, i never posted an avatar and now i am pretty sure all of this is your little group on the sdg discord. because on the archive here i am finding the same pattern of posting an image without saying anything as an "i am here" while writting vile nonsense as a "nogen". some of you did not even bother to change the file name pattern or prompt pattern too
>>
File: 4756235114.png (1.53 MB, 832x1216)
1.53 MB
1.53 MB PNG
>>107985173
Damn, he's already got it up and running? Guess I gotta go make a dataset.
>>
>>107985232
kek
>>
>>107985226
shit resolution
shit vae
shit anatomy
shit multicharacter
>>
>>107985173
>Flex
what model is this
>>
>>107985136
Nobody wants to hear this, but I'm betting on "never".

Illustrious / Noob were $100k+ total training costs, and that's on SDXL which is even lighter than a 2B DiT.
Chroma was 150k and terrible at anime even though it was intended to be able to do it. "muh realism model" is a cope that everyone shifted to.
NetaYume was at least 50k, not even accounting for the OG Neta training, and is still horribly undertrained.
Newbie was 70k and the model is deep fried even if it technically knows a bit more than NetaYume.

You would need AT MINIMUM $200k, and also manage to not fuck it up. It's never happening.
>>
>>107985036
doesnt work with 12.6, 12.8 or 13
why wouldnt they mention the versions somewhere? what a fucking mess
>>
>>107985263
some version of flux
https://huggingface.co/ostris/Flex.1-alpha
>>
>>107985251
>i don't have an avatar
You do.
https://desuarchive.org/g/search/subject/lmmg/order/asc/
>>
>>107985277
>>107985277
Fennecposter said Tongyi was planning to do their own anime finetune
>>
>>107985325
"Hey boss I wanna spend 200k USD and train our model on the most degenerate porn imaginable which is literally illegal in China btw".

It's never happening. Or if it does it will be censored trash.
>>
>>107985355
Yes he did say it won't be fully uncensored, however I use IL a lot for SFW gens and perhaps NSFW gens will be easy to make with loras, we'll see.
>>
>>107985355
Noobai was trained by a Chinese student.
>>
with all the incessant whining I expected worse, this is the same prompt as the pic above with Euler and shit, its okay for a base model
>>
>>107985401
how does this prompt look on klein 4b base?
>>
Why do you even need more anime models? Can't the existing models already do anime well enough? It can't be as hard to achieve as photorealism.
>>
>>107985277
i'm trying to make a klein anime model with nsfw, so far i've got styles/artists, posing, natural language, danbooru tags (although some rare ones dont work), nudify, and limited sex acts, im still trying to make that work better, granted I only have a varied dataset of 4k images for various tasks
>>
>>107985435
They're fine style-wise and for relatively simple prompts and coom. Unironically SDXL is good enough to be on par with 95% of danbooru slop, but sometimes you want something more out of the model and do some more complicated scenes.
>>
>>107985414
idk don't have klein base, just the distill
beta looks much better than normal
>>
>>107985435
Why do you even need more photoreal models? Can't the existing models already do photoreal well enough? It can't be as hard to achieve as anime.
>>
>>107985458
beta looks more slopped and less movie-like
>>
>>107985435
They are often very generalistic and only know the top 5% popular shit. If you want specific franchises or less known stuff you are out of luck.
>>
>>107985472
idk but it's sharper and less melty than normal. This is ays(still all euler slop tho). Base already knows more than ZIT, base actually has an idea what a M1917 Stahlhelm looks like
>>
still doesn't know what a uboat, especially a type VII, looks like. I am disappoint
>>
>>107985036
>>107985286
with 12.1 and python 3.11 it just gets stuck after loading the pipeline
what a fucking mess this is
>>
1girls aren't too bad either and it has better anatomy than chroma
>>
Am I doing something wrong or is ZIB suppose to take 30+ seconds on a 5090 with @ 1024x1472 steps 30 Euler-A?
>>
>>107983099
sorry I was gone. i run the upstream WF:

https://huggingface.co/lodestones/Chroma1-Radiance/blob/main/Chroma1-Radiance-x0-x32-proto-workflow.json

tensor errors might be because of older comfyui version, it only got the small code change to do x32 after an update not to long ago.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.