[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107381033

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe
https://github.com/ostris/ai-toolkit

>Z
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image/

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
>>107383885
>neoforge announcement
based
>>
reposting vibecoded fp16 fix for non-bf16 GPUs for Z-Image (lumina). went from 10s/it to 3s/it with it.
https://pastebin.com/FFz3WX6G
credits to https://civitai.com/articles/22251
>>
Blessed thread of friendship
>>107383903
Works for my 12gb rtx 3060?
>>
>>107383885
SVD in Reforge dont have support for prompt?
>>
>>107383923
3060 supports bf16 so why should you need it?
>>
>>107383885
Based. NeoForge dev isn't getting paid for his UI. He's probably resting today because he has to go to his unrelated day job tomorrow.
>>
File: 1763271888866381.png (44 KB, 747x581)
44 KB
44 KB PNG
https://github.com/comfyanonymous/ComfyUI/pull/10994
>Make the ScaleRope node work on Z Image and Lumina.
what is that?
>>
>>107383957
what it do?
>>
>>107383957
for trannies to rope themselves. but in all seriousness, it's snake oil
>>
>>107383957
To rope your self ^^
>>
is there any way to see gens in real time in comfyui?
>>
>>107383989
comfy manager > preview method
>>
File: 1746325280683558.png (1.25 MB, 832x1216)
1.25 MB
1.25 MB PNG
Finally, n64 style
https://civitai.com/models/573101/low-poly-zelda-64-ocarina-of-time-style-characters-nintendo-n64-style?modelVersionId=2455257
>>
File: ComfyUI_temp_cgrhp_00003_.png (3.12 MB, 1480x1120)
3.12 MB
3.12 MB PNG
>lora works
>still get gorillion lora key not loaded lines
Is this because of the distill training,
>>
THAT LUISA GIRL JUST CHURNS OUT LORA AFTER LORA.. HOW TF
>>
>1boy solo
>always get a girl, never a boy
fucks sake man, do i need base models or what?
>>
Anyone uses a AMD card with Comfy in WSL? Can you tell how fast/slow picgen is?
>>
How come LTX didnt put their 60 second gen tech into LTX2? https://www.reddit.com/r/StableDiffusion/comments/1m1ka0n/ltxv_just_unlocked_native_60second_ai_videos/
>>
>>107384055
did you update comfy? he fixed that
>>
File: ZIT_2345435764576.jpg (3.11 MB, 1728x1344)
3.11 MB
3.11 MB JPG
>>
>>107384078
so the core lora loader is fine for zit?
>>
>>107384092
yes
>>
File: ComfyUI_ZImage_00369_.png (949 KB, 896x1088)
949 KB
949 KB PNG
>>
What AI to translate manga locally and offline
>>
>>107384100
Any 7b will do. The problem is hooking it to an OCR.
>>
>>107384027
what an awful example image
>>
File: 1boy solo.png (1.07 MB, 832x1216)
1.07 MB
1.07 MB PNG
>>107384062
Seems to work for me
>>
>>107384122
i mean anime finetunes my dude
>>
>>107384128
nta you want "male focus" rather than solo, "solo" is heavily female dominated
>>
>>107384128
I assumed you were referring to Z-Image.
Yes random shitmixes will be fried towards 1girl typically.
Just use base Noob and your problem is solved.
>>
>>107384144
>Just use base Noob and your problem is solved.
I am considering it. But while it can look amazing with artist tags, the anatomy is always fucked. I dont know how people manage to make good looking shit with it.
>>
I've generated every ethnicity of 1 girl standing and because I have no imagination or prompting skills i'm bored of zimage
>>
>>107384144
>Z-Image
btw, can you run z-image on 12gb?
>>
>I'm trying to bake a cake but it keeps tasting like eggs!
>Yeah that's an issue with cakes that use eggs, just use a cupful of literal shit instead and it'll fix your problem
>>
Holding a candlelight vigil for Z Image Base tonight
>>
>>107384114
>7b
I'm retarded but what does this mean? number higher = better?
I use large-v2 (2.9gb) to translate anime with subtitle edits and it works so fine. How does it compare to it?
>>
>I'm trying to make a point but I'm missing it!
>Yeah that's an issue with points you don't understand, just use a cupful of hyperbole instead and it'll fix your problem
>>
>>107384151
>the anatomy is always fucked.
Not really my experience.
Have you read this?
https://d0xb9r3fg5h.feishu.cn/docx/YpOQdtHTDoetcZxIO9fc33onnee
>>107384160
Yes, I am doing it.
It works fine and Comfy automatically does partial offloading.
>>
File: 1748156873768632.jpg (3.2 MB, 5120x1729)
3.2 MB
3.2 MB JPG
>>107383957
>what is that?
ScaleRope is supposed to help the model reach higher resolutions that they normally could, for example here I managed to go for over 2048 pixels (Z-image turbo's limit) without having that duplication effect
>>
>>107384176
7b is a model that should fit in 8GB VRAM. If a smaller one works for you, then it's fine. As I said, your problem is hooking it to an OCR. If you're willing, you can probably use chatgpt to vibecode an OCR that hooks up to your model in a weekend. Or maybe it already exists. ChatGPT can find that out also.
>>
>>107384196
This one, no. As far i remember people used a stabilizer lora with noobai... but i will give it a try.
>>
>>107384201
Useful, thanks. Does scaling (or minus scaling) have any effect on normal res gens?
>>
>>107384229
yes, so be careful, if you're using it on normal resolutions it might break your image lol
>>
File: skill issue 667678.png (1.32 MB, 832x1216)
1.32 MB
1.32 MB PNG
>>107384152
>>
File: ZiMG_00335_.png (3.98 MB, 1728x1344)
3.98 MB
3.98 MB PNG
someone seriously needs to make a venus body lora
>>
>>107384250
My girlfriend looks like that
>>
>>107384201
2560x1440 is normally not too bad. Expect more of a breakdown when the total pixel count exceeds 4MP.
>>
>>107384259
sure sweety
>>
Death to Chinese.
>>
>>107384259
Hey mine too, what a coincidence
>>
>>107384216
>people used a stabilizer lora
Who? Not me at least.
These "stabilizer loras" or miracle embeddings tend to be either snake oils or change the image too much towards random noise they learned from training dataset to be worthwhile, in my experience.
>>
>>107384275
kys
>>
why do chinese people tend to lie frequently
>>
How many are using ages <18 with zimage just because otherwise the 1girls look too slutty?
>>
>>107384254
Upload the captioned dataset
>>
>>107384297
would if I had
>>
>>107384293
that's what communism is built upon
>>
>>107384293
mandate of heaven game theory spawned from thousands of years of infighting and scamming. they are considered the jews of asia
>>
File: lawl.png (475 KB, 716x783)
475 KB
475 KB PNG
>>107384323
>they are considered the jews of asia
still better than being considered the niggers of the world
>>
Thank you for the incredible work on the Flux2 models. The image quality is impressive.

During my testing, I have observed that the model frequently produces anatomical errors (e.g., incorrect number of fingers, distorted limbs), arguably more often than expected for a model of this caliber.

My questions are:

Is the lack of effective support for Negative Prompts a contributing factor to these anatomical issues?

Since Flux2 seems to ignore negative prompts, is there a recommended approach or specific parameter setting (e.g., guidance scale) to strictly enforce anatomical correctness?

I would appreciate any insights or advice on how to mitigate these anatomical hallucinations.

Thank you!
>>
>>107384293
As opposed to the magnificent protectors of the truth, americans? lmao
>>
if i could change one thing about local diffusion it would be the eradication of all chinese
>>
File: ZiMG_00346_.png (3.81 MB, 1344x1728)
3.81 MB
3.81 MB PNG
>>
>>107384342
>Flux2
whos that?
>>
>>107384319
Make one, it's free
>>
>>107384342
whats FLUX2?
>>
What nodes do i use to use distorch2?
>>
File: what.png (149 KB, 360x360)
149 KB
149 KB PNG
>>107384342
>Flux2
what is this? a river or something?
>>
>>107384342
Who are you? Please add a signature so we can get back to you.
>>
File: Z-Image turbo.png (3.56 MB, 1920x1080)
3.56 MB
3.56 MB PNG
>>107384027
>https://civitai.com/models/573101/low-poly-zelda-64-ocarina-of-time-style-characters-nintendo-n64-style?modelVersionId=2455257
based
>>
File: 1739021409957659.png (2.41 MB, 1320x1320)
2.41 MB
2.41 MB PNG
god damn look at these nipples
>>
I love comfy so much bross it's unreal
>>
File: ComfyUI_00013_.png (1.35 MB, 1024x1024)
1.35 MB
1.35 MB PNG
Now I can fucking use comfyui for the first time with my potato 1660 super. Thanks chink god!
>>
File: 1744437438031911.png (2.92 MB, 1920x1080)
2.92 MB
2.92 MB PNG
>>107384335
>>
>>107384383
search for multi gpu in node manger
>>
>>107384394
https://huggingface.co/black-forest-labs/FLUX.2-dev/discussions/17
>>
File: BFL having a SD3 moment.png (3.12 MB, 1440x1440)
3.12 MB
3.12 MB PNG
>>107384525
https://huggingface.co/black-forest-labs/FLUX.2-dev/discussions/12
lemaooooooo
>>
i'm new to ai toolkit, using kohya and sd-scripts previously. if i've trained a lora, call it lora.sft, but want to add more steps, i can edit the finished job to add more steps and continue training. i assume it's going to continue training from the lora.sft file, saving additional lora-*stepnumber*.sft files. but then when it's finished, it's going to want to save the finished file as lora.sft again. so is it going to replace the original file, or will it not save at all because there's already a file with that name?

i hope that makes sense
>>
File: ComfyUI_02191_.png (2.4 MB, 1144x1664)
2.4 MB
2.4 MB PNG
>>
>>107384492
what speed
>>
>>107384539
ohhhh nononononno flux brotherinos nonono
>>
File: 1734771744551674.png (2.92 MB, 1920x1080)
2.92 MB
2.92 MB PNG
>>
File: zoooom.png (2.49 MB, 1088x1920)
2.49 MB
2.49 MB PNG
>>
File: OH NO NO NO NO.jpg (981 KB, 2880x1743)
981 KB
981 KB JPG
>>107384539
AIEEEEEEEE
>>
>>107384516
I have ComfyUI-MultiGPU installed (over the ComfyUI manager, but it doesnt appear in nodes.
>>
>>107384621
did you update that custom node?
>>
>>107384620
I don't understand this shit how can they be bucked so hard by the chinese
>>
File: 1737512366745604.png (92 KB, 723x272)
92 KB
92 KB PNG
i unironically miss guidance
>>
>>107384620
Buckbroken
For
Life
>>
I miss when Chinese people weren't doing AI research
>>
>>107384665
It's always been Chinese people in America vs Indian people in America. Now it's just Chinese people cutting out the middlemen
>>
>>107384636
Z is narrowly tuned on Asian female humans, so it's very strong there. Flux2 is a lot broader.
>>
>>107384631
ComfyUI-MultiGPU is version 2.5.9 the nighty one doesn't work. Already restarted multiple times.
>>
>>107384078
>fixed
He mightve fixed that, but it feels like he broke something else cuz now you have to run loras at 0.7, at 1.0 they look way overcooked.
>>
Is there any way to run Wan2.2 with just 32gb of RAM without having my ssd absolutely raped with constant offloading to the pagefile?
One solution I found is to start with --cache-none but it'll immediately offload anything, even if I change something in just the 2nd ksampler it will re-do the whole workflow from scratch every single time.
I have 5060ti 16gb + 2080 8gb btw, using multigpu nodes
>>
File: clowns.png (2.28 MB, 1920x1088)
2.28 MB
2.28 MB PNG
>>
>>107384719
yeah, I also have to tone down the strength for it to not be burned too
>>
File: do better.jpg (533 KB, 842x1601)
533 KB
533 KB JPG
>>107384694
>Z is narrowly tuned on Asian female humans
not even close, you can get every ethnicity if you just specify it
>>
>>107384688
rombach is such a chinese name
>>
>>107384702
Does it say import failed in the log or what? You might need to install the accelerate module or something
>>
File: Z-image turbo.jpg (816 KB, 3072x1305)
816 KB
816 KB JPG
>*prompt engineering intensifies*
>>
File: AngryBikiniYun.jpg (1006 KB, 2048x2304)
1006 KB
1006 KB JPG
Yunyun dont like that you stare soo much at her
>>
>>107384764
These, like any male humans, are simply permutations of the massive Asian female dataset they used.
>>
File: 23252.jpg (8 KB, 293x172)
8 KB
8 KB JPG
>>107384764
>all heterosexual couples
>all this stereotyping assumptions
>>
>>107384740
I remember some redditor making a fork of Comfy where he claimed it directly loads shit into GPU without filling the RAM+swap with every model constantly, but a) I didn't test it b) The name eludes me. Let me see if I can remember
Also modern ssds are durable and won't implode over swap use.
>>
>>107384740
I think comfy's new memory management might interfere with distorch. I dunno, maybe try disabling pinned memory
>>
>>107384694
And Z has extremely low seed variance.
>>
Can we already say that Flux 2 won and call it a day already?
>>
107384801
Extremely weak bait unworthy of (You), try harder
>>
>>107384114
>>107384176
What model to use for translation?
>>
>>107384796
>Z has extremely low seed variance
already fixed a few days ago
https://www.reddit.com/r/StableDiffusion/comments/1p9mypu/even_more_improved_zimage_turbo_variation/
>>
>>107384768
>Does it say import failed
Yeah
>You might need to install the accelerate module or something
Doesnt find anything with that name.
>>
>>107384772
try (Peanuts drawing style Peanuts drawing style Peanuts drawing style) of a woman eating pop corn, hotel room
>>
>107384783
I'll never understand these faggots who literally get turned on by looking at geometric shapes that resemble female forms and basic colors from the palette. I'd say they're mentally ill.
>>
>>107384783
What happened, my friend? Did a real woman disappoint you and now you're taking refuge in cartoons? At least there you're in control of the situation, right?
>>
So who's dedistilling this gay shit since China rug pulled us again
>>
107384843
Extremely weak bait unworthy of (You), try harder
>>
z-image-base is the glm4.6-air of /ldg/
just two more weeks
>>
What's the implication of Z-image turning stuff into cartoons when you jank up the CFG?
>>
File: 1749546186172775.png (1.38 MB, 1024x1024)
1.38 MB
1.38 MB PNG
>>107384838
>Peanuts drawing style Peanuts drawing style Peanuts drawing style of a woman eating pop corn, hotel room
lmao, virgin prompt weighting vs gigachad "I spam the same prompt style several times"
>>
File: zwingk.png (814 KB, 1024x1024)
814 KB
814 KB PNG
>>
>>107384783
This is a shared and cultural mental illness, like trans people, and in a few years it will be diagnosed as such. Get treatment, even if your friends or internet support you, you're sick, brother.
>>
How do I use zimage on neoforge
>>
>>107384863
no ones doing it this time
>>
>>107384856
>>107384906
why? TnT
>>
File: dis.png (46 KB, 421x1083)
46 KB
46 KB PNG
>>107384702
Idk bruh look for distorch directly
>>
>>107384877
its more so the mixtral 1 56b of ldg
>>
File: psxzstyle_z_0007.png (1.33 MB, 832x1216)
1.33 MB
1.33 MB PNG
anyone want to guess what zstyle is?

>>107384363
i know that lora because i made it. send me an example venus body i'll see what i can do.
>>
File: zimg.png (1.72 MB, 1024x1536)
1.72 MB
1.72 MB PNG
>>
>NAG is still fucked

thanks comfy
>>
>>107384963
>zstyle
like redux?
>>
>>107384822
Useful for some cases maybe but "fixed" is very disingenuous.
This "increases variation" by making model gen with empty seed first few 2 steps.
This causes a problem with a lot of prompts where it has to salvage the gen from a complete unrelated image.
"Illustration of Wonder Woman" without and with this method.
You see the problem?
>>
File: aniugirl_ok.jpg (8 KB, 300x168)
8 KB
8 KB JPG
>>107384921
In reality, do whatever you want, my dearly anon, This is to give you a different perspective that isn't as empowering as those from your digital friends. We don't see any trans in the amazonian tribes, as we don't see amazonian people fixated with paintings of female figures.
Even if your friends support you, always question whether things should be different.
>>
>>107384877
>glm4.6-air
QRD?
>>
>>107384990
Good to hear that it is not just my install that is fucked.
>>
>>107384968
sekushi
>>
>>107385011
Two months ago their twitter account said it's going to be ready in two weeks.
>>
>>107384998
>first few 2 steps.
go for 1 step then
>>
>>107384877
>>107384961 (me)
nevermind i misread it as glm 4.5 air
>>
why are the chinese like this
>>
>>107384968
>the "nak4dashi me" stare
>>
File: 1736170482605309.png (1.65 MB, 1024x1024)
1.65 MB
1.65 MB PNG
>>107384883
>(van gogh style, van gogh style, van gogh style:3), of Hatsune Miku eating pop corn, hotel room
migu is too stronk :(
>>
>>107385040
I don't think the sharp sword means that
>>
>>107385011
106 B model with just 12b active params.
You should be able to fit a Q4 quant into 64 gigs of system ram and it should perform reasonably well for CPU inference due to just 6 gigs of active params.
It is expected to be decently smart while having far smaller barrier to entry than other SOTA models.
It has been coming SOON TM for a while.
>>
>>107384929
what nodes do you have installed? ignore the ones that arent connected to the issue
>>
File: 1750362706877242.jpg (1.48 MB, 1536x2048)
1.48 MB
1.48 MB JPG
>>
>>107385032
I already tested that anon.
Some images come off alright but many images have the same problem of going from realistic base image to illustration.
It's a hacky partial solution and I hope the base model has better seed variance.
>>
>>107385050
she's literally "impregnate me or i'll kill you"
>>
as an outsider looking in, I don't see any progress whatsoever for making a good image with this stuff. it still looks fake
>>
File: psxzstyle_z_0006.png (1.3 MB, 832x1216)
1.3 MB
1.3 MB PNG
>>107384996
nah nothing that cool, it's a zishy style model
>>
>>107385086
much like real art a large percentage is shit its only occasionally do you see greatness
>>
File: 1759847755535841.webm (2.55 MB, 774x796)
2.55 MB
2.55 MB WEBM
is what this guy says true?????

https://files.catbox.moe/ae6i1o.mp4

How are you guys running all these AI engines?!?!?!
>>
>>107384694
Neither model is as good as Chroma when it comes to output diversity and genning females. Out of the box both models are a joke.
>>
>>107385068
https://github.com/pollockjj/ComfyUI-MultiGPU
These. 2.5.10
>>
>>107385108
Output diversity, yes. But Z dishes out possibly the most gorgeous girls i've seen out of any model yet, especially when it comes to body anatomy.
>>
>>107385070
increase the number of steps
>>
File: ComfyUI_00863_.png (1.19 MB, 1184x896)
1.19 MB
1.19 MB PNG
>>
>>107385104
saas models aren't local models. local models are moderately accessible if you have a workstation or a gayman pc. there are even some experiments with working distributed training so a modest foundational model can be trained. as for the data, everyone is gay about that
>>
File: ComfyUI_temp_qnyyp_00008_.png (2.13 MB, 1024x1024)
2.13 MB
2.13 MB PNG
Is the shit seed variance caused by Qwen TE since QI/QIE also suffer from it?
>>
>>107385133
kek
>>
>>107385123
>But Z dishes out possibly the most gorgeous girls i've seen out of any model yet, especially when it comes to body anatomy.

Which is useless without a good tune.
>>
>>107385150
wouldn't an ancestral sampler help?
>>
base is never coming out
>base is never coming out
base is never coming out
>base is never coming out
base is never coming out
>base is never coming out
base is never coming out
>base is never coming out
base is never coming out
>base is never coming out
>>
>>107385163
just two more weeks bro
>>
>>107385162
This is euler_a
>>
File: Zurbo_00027_.jpg (918 KB, 2688x1536)
918 KB
918 KB JPG
Getting the concept of convenient censorship 'just so' isn't all that easy, weird results sometimes.
>>
>>107385163
It's coming out, but I don't think at all it will be 6b. I have seen more than enough evidence that it's a larger model. The first being that Flux.2 Klein is a "size-distilled" model, so it is also smaller like Z.
>>
File: Migu starry night.png (1.56 MB, 832x1216)
1.56 MB
1.56 MB PNG
>>107385047
Kinda kino but it seems true
>>
>>107385163
YES IT IS STOP!
>>
anon, this was a great idea, thanks
>>
>>107385132
>increase the number of steps
Does next to nothing for this model.
>>
>>107385195
I've never had a model properly understand the concept of convenient censoring
>>
>>107385163
should i tell my parents im killing myself or should i just leave them a note
>>
File: Zurbo_00028_.jpg (692 KB, 2688x1536)
692 KB
692 KB JPG
>>107385223
Yeah, it's kinda annoying, probably needs more careful prompting but I'm incredibly lazy.
>>
>>107385199
>I don't think at all it will be 6b.
good, if it's close to the limit of the 24gb territory it's a win, there's no point in going for a too small model
>>
>>107385231
Is that just zimage or are you using something else? Colors look amazing
>>
>>107385218
what style was it?
>>
>>107385231
i2i
>>
>>107385199
>I have seen more than enough evidence
>from a different model and different team

>t. didn't read the paper
>>
File: zimga.png (2.07 MB, 1024x1536)
2.07 MB
2.07 MB PNG
>>
>>107385110
It wont let me install this one 2.5.9 works though. with the 2.5.10 the import fails with a
>ModuleNotFoundError: No module name 'accelerate'
in the log
>>
File: acc.gif (652 KB, 640x464)
652 KB
652 KB GIF
>>107385254
>>
>>107385254
https://github.com/pollockjj/ComfyUI-MultiGPU/issues/142
>>
>>107385199
There is zero evidence that it's a different size.
>>
>>107385233
That's not the issue. It's finetuning capability that's the main concern. Though they are doing their own Noob based tune (which I'm assuming will be part of release announcement to calm dissent a bit) but then a Chroma/bigASP style tune wouldn't happen any time soon unless they also do that too, and with that comes ability to do certain "unsafe" material out of the box so it's unlikely to happen.
>>
>>107384968
>>107385253
I have the weirdest boner right now desu
>>
>>107385233
>24GB
enjoy your DOA bloatmodel
>>
>>107385295
Why would getting a boner from hot asian girls be weird
>>
File: 1972499874.jpg (1.14 MB, 2048x3072)
1.14 MB
1.14 MB JPG
How does AI Toolkit handle multiple resolutions during training? For example if you have 512,768,1024 selected, does it train step 1 on 512, step 2 on 768, step 3 on 1024, step 4 on 512, etc..?
>>
>>107385302
you vramlets already have a 6b. you will never be happy.
>>
>>107385246
>>107385285
Why did the Chinese guy not disclose Base's param count then? (Even with NDA, the paper revealed it, so what gives?) Why does BFL use the same or similar terminology in their paper "size-distilled based on FLUX.2 Base". Also, isn't it suspicious that a mere 6B is this good? Feels like it's missing quite a few brain cells too... I'll believe it's actually 6B when I see it.
>>
>>107385199
Yeah, my guess would be somewhere in the 10-12b range, but then again I never thought anyone would be able to pack so much quality into a 6b model...
>>
File: f2f-fmw.jpg (1014 KB, 1600x1600)
1014 KB
1014 KB JPG
>>
>>107385284
That actually helped, thanks.
>>
File: ComfyUI_08260_.png (1.77 MB, 944x1280)
1.77 MB
1.77 MB PNG
>>107385104
lmao as if anyone could recreate Photoshop from scratch as if thats cheap and easy.
also training AI models isnt all that expensive, Z image Turbo was made with only $630k USD.
>How are you guys running all these AI engines?!?!?!
pic related
>>
The real lesson was not "SOTA can fit into 6B with only real images" but rather "Chinks are lying sacks of shit and should be purged from the Earth"
>>
>>107385289
>That's not the issue. It's finetuning capability that's the main concern.
The whole reason for releasing Z-Image Base is to provide a great model for finetuning:

>Z-Image-Base – The non-distilled foundation model. By releasing this checkpoint, we aim to unlock the full potential for community-driven fine-tuning and custom development.
>>
File: zimg_0004.png (1.73 MB, 832x1216)
1.73 MB
1.73 MB PNG
>>107385242
woodcut/linocut print style
>>
>>107385310
It's not, I just wanted to say desu, to be desu
>>
>>107385365
>>By releasing this checkpoint, we aim to unlock the full potential for community-driven fine-tuning and custom development.
Where did you find this quote? "community-driven" is only mentioned ONCE in the report and it's in relation to Turbo LMFAO
>>
>Ask for big ass
>Get a fatty
>Ask for thin body
>Get a thin ass
>>
>>107385331
there will be no finetunes or any community development with a model of this size.
>>
>>107385387
On their github: https://github.com/Tongyi-MAI/Z-Image
>>
dalle-3 leak when
>>
File: 9qQ7yUv5GogBkvpB5-X9g.png (1.07 MB, 1358x750)
1.07 MB
1.07 MB PNG
https://huggingface.co/Anzhc/Z-Image_Anime_VAE
>>
>>107385086
"Real" art looks fake too. What's your point?
>>
>>107385416
It's almost 2026 grandpa
>>
>>107385086
the thread is usually spammed with low effort slop. no idea why
>>
go back to your containment thread debo
>>
File: ComfyUI_00868_.png (1.23 MB, 1184x896)
1.23 MB
1.23 MB PNG
>>
File: zimage_1763063763610342.jpg (310 KB, 1920x1080)
310 KB
310 KB JPG
when pussy line lora
>>
>>107385480
there is a cameltoe lora on civit
>>
File: 1735772943223413.png (570 KB, 1280x720)
570 KB
570 KB PNG
has anyone noticed how nipples appear to be perfectly circular up close? like there's a very clear crisp border between the areola and the skin. almost as if the model has been trained on photos where nipples have been censored by circles in post.
>>
>>107384472
https://civitai.com/models/2176841/better-nipples?modelVersionId=2451363
>>
>>107385086
Because base models are always slopped. Look at Chroma for what is possible with a shittier base model though, large scale Z-Image tunes can do even better.
>>
File: go1.jpg (166 KB, 1024x1024)
166 KB
166 KB JPG
>>107385416
best training data ever
>>
File: file.png (86 KB, 1615x195)
86 KB
86 KB PNG
>>107384539
Maybe, but it's so safe anon, how can you hate sacrificing basic anatomy to make a safer world?
>>
>>107385416
Local already caught up.
>>
>>107385399
24b is not a problem for that. ultimately you must accept that even if efficiency improves, model size must also increase in order to advance.
>>
Death to artists, coomers, devs, trainers, sloppers, and chinks
>>
>>107384863
?? they cancelled base release officially?
>>
>>107385549
what I do to you?
>>
File: testo.jpg (1.73 MB, 2880x3840)
1.73 MB
1.73 MB JPG
Feels like dpm++2m SDE/simple (left) gives less noisy images than euler/normal (right).
>>
>>107385539
VRAMlets are in for harsh reality, however the Z team is tuning their own NSFW while most of us just need LoRAs after that, assuming Z distilled LoRAs train well then we're still good to go.
>>
File: 1752497278261978.mp4 (457 KB, 1024x768)
457 KB
457 KB MP4
>POV: you're a peeping tom
>>
>>107385502
thanks but i'm training my own
>>
File: 1762627505085093.png (1.69 MB, 1024x1024)
1.69 MB
1.69 MB PNG
>>107385211
time for some serious advanced prompt engineering!
>Van Gogh painting style, van gogh painting style of: [(Hatsune Miku:0.1)::0.25] eating pop corn, hotel room
https://github.com/asagi4/comfyui-prompt-control/blob/master/doc/schedules.md
>>
>>107385565
>venturing all the way into a thread you hate just to seethe about it
It's not you, anon
It's him
>>
File: z-image_00179_.png (3.55 MB, 1920x1080)
3.55 MB
3.55 MB PNG
>>
>>107385608
No, I must have done something unknowingly. And we must always strive to improve, so I need to know what I've done wrong so I can improve myself
>>
Qwen Image with 8 step turbo is a little better and a little slower than ZIT.

But I guess Z(IT) will be much more popular due to size.
>>
File: 1754599541686925.png (2.04 MB, 1685x739)
2.04 MB
2.04 MB PNG
>>107385416
>dalle-3 leak when
Train your own lora on the exact style you like or wait a few hours until I post the new version of my own https://civitai.com/models/2093591
>>
File: zimg_0005.png (1.86 MB, 832x1216)
1.86 MB
1.86 MB PNG
>>107385371
>>
>>107385487
can't find it
>>
File: ZIT.png (2.43 MB, 1280x1280)
2.43 MB
2.43 MB PNG
>>107385630
This new version was trained for Z Image Turbo btw
>>
Crazy the number of loras for the zimage turbo model already on civitai, in 2-3 days. The model is massively popular already.
>>
>>107385570
I'd say so. Try Euler / Beta.
>>
File: 635rf.jpg (157 KB, 1024x1024)
157 KB
157 KB JPG
>>107385537
>>107385630
>it's the coomer get-out-clause
that old chestnut
1 training set =/= another training set
>>
File: 093782523.png (1.67 MB, 1024x1536)
1.67 MB
1.67 MB PNG
>>
File: ComfyUI_00877_.png (802 KB, 896x896)
802 KB
802 KB PNG
>>107385604
kek, magic
>>
File: image(40).png (1.58 MB, 1024x1536)
1.58 MB
1.58 MB PNG
>>107385725
>>
File: z-image_00183_.png (2.75 MB, 1920x1080)
2.75 MB
2.75 MB PNG
>>107385747
bold of you to assume im this handsome
>>
>>107385604
>[(Hatsune Miku:0.1)::0.25]
I am sleep deprived.
Can you break down the syntax here in a retard friendly way?
>>
File: testo2.jpg (2.62 MB, 4320x3840)
2.62 MB
2.62 MB JPG
>>107385689
Worse than both
dpm++2m SDE/simple (left)
euler/normal (middle)
euler/beta (right)
>>
File: asians_vs_BWC.webm (1.07 MB, 638x362)
1.07 MB
1.07 MB WEBM
Nofap is almost over. I can't wait to download Z image and generate images of Asian girls. Asian girls belong to BWC. They are slaves to BWC.
>>
>>107385781
he's being autistic
probably because the miku too strong so he's nerfing her
0.1 * 0.25 u do the math
>>
>>107385781
basically the first 25% of the steps you have miku on the prompt, and the remaining 75% of the steps miku dissapears and the model only sees the rest (so it'll focus more on the style since it's the only thing it can see now)
>>
>>107385784
Seems that way. What about euler/normal with shift 5?
>>
>>107385805
So Miku exists at 0.1 S for the first %25 then it disappears?
I thought maybe Miku exists normally at first %25 but then her Strength gets lowered to 0.1 for the remainder.
Where does the double : also mean? Is ::0.25 alias of 0:0.25?
>>
did that fiasco around comfyui zimage loras needing to drop to 0.75 strength to go back to "normal" after that early patch in comfyui get addressed? what happend with it anyway, my zimage lora still needs 0.7 to work well
>>
>>107385799
Picture of this anon a few posts up
>>
File: a.jpg (25 KB, 735x340)
25 KB
25 KB JPG
You're genning nsfw. Flux devs are feeling unsafe right now and you're genning.
>>
>>107385847
No discernible difference. All those were 16 steps 7.0 shift
>>
hey, where the fuck is the base model you stupid fucks, are you hoarding this shit or what
>>
File: Fpn7B_FaMAAYCuk.jpg (55 KB, 736x730)
55 KB
55 KB JPG
How do I get Z to give these asian bitches some fucking tits
>>
>>107385849
it works like this
>[prompt 1:prompt 2:0.25]
here you have prompt 1 being rendered the first 25% of the steps, then the remaining 75% it's only prompt 2

Now, by doing this
>[prompt 1::0.25]
you have prompt 1 being rendered the first 25% of the steps, and the remaining 75% of the steps you'll have nothing

now imagine prompt 1 = (Hatsune Miku:0.1) and you finally got it
>>
File: BWCslave9.jpg (282 KB, 1800x1200)
282 KB
282 KB JPG
Nofap is really dangerous. Almost every time I develop some fetish or something I've never felt before. This time around I've developed the strong urge to rape Asian women. I don't even find them attractive, and I swear to god I never have before in my life. And now every time I see one that isn't overly hideous, which most of them still are, I think to myself, "Made for BWC". I don't see them as humans, but rather as property made for the pleasure of white men. I want to slap them, choke them, bruise them, and impregnate them. I even want to fuck kids. Asian girls need to learn from a young age to worship white men.
>>
>>107385887
16 steps unnecessary
>>
>>107385900
prompt?
>>
File: ComfyUI_temp_qnyyp_00033_.png (2.58 MB, 1024x1600)
2.58 MB
2.58 MB PNG
>>
>>107385896
make your own loras or use pony or chroma
>>
File: Zurbo_00043_.jpg (628 KB, 2688x1536)
628 KB
628 KB JPG
>>107385236
It's just Z.
>>
File: ComfyUI_00126_.png (3.41 MB, 1536x2048)
3.41 MB
3.41 MB PNG
z-image is cool
>>
>>107385903
Yet another model that has no concept of a vagina.
>>
>>107385920
that's a real woman bro,..,
>>
>>107385921
jahoda? not bad. i guess getting her circular hair braids was too hard for z image
>>
>>107385898
Yep, thanks anon.
So if I wanted Miku to appear after %25 I would do
>[:(Hatsune Miku:0.1):0.25]
This seems useful
>>
File: ComfyUI_00128_.png (3.04 MB, 1536x2048)
3.04 MB
3.04 MB PNG
>>107385937
someone will blast it with danbooru at some point dont worry
>>
>>107385934
workflow please?
>>
>>107385539
larger models will only take off in a few GPU generations once bigger VRAM GPUs are more widespread, but until then, DOA. You can brag about big VRAM how much you want, no mass adoption=dead model. No one's gonna do a finetune for a model only a bunch of peopel use.
>>
>>107385850
>fiasco
huge fiasco, changing 1.0 to 0.7, heads will roll
>>
File: BWCslave10.jpg (571 KB, 1800x1200)
571 KB
571 KB JPG
I want to fuck a 12 year old Japanese girl. Grab that stupid gook really hard by her hips and pump my cum inside until she's pregnant with my white child.

>>107385920
lmao
>>
GIVE ME BASE OR GIVE ME DEATH
>>
>>107385941
https://github.com/asagi4/comfyui-prompt-control/blob/master/doc/basic.md
https://github.com/asagi4/comfyui-prompt-control/blob/master/doc/macros.md
>>
File: z-image_00189_.png (2.79 MB, 1920x1080)
2.79 MB
2.79 MB PNG
>>
>>107385896
wait for boob slider, for now it struggles for huge and for flat, bit annoying
>>
File: file.png (126 KB, 919x1048)
126 KB
126 KB PNG
>>107385849
nice to see these old tricks haven't been lost and still work
https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features#prompt-editing
>>
>>107385950
>GPU
pretty sure thats not even gonna happen. TPUs
>>
>>107385900
most sane nofaper
>>
File: z-image_00190_.png (2.85 MB, 1920x1080)
2.85 MB
2.85 MB PNG
>>107385955
>>
>>107385941
exactly, have fun with that anon
>>
Wanting to get into noobai.
Vpred or epsilon though?
Almost all finetunes are from epsilon, is it better than vpred?
>>
GLM 4.6 is very good at making good prompts from rough ideas to zimage.
>>
File: 1746447918881909.jpg (253 KB, 1080x507)
253 KB
253 KB JPG
just want to point out this method for getting more seed variance SUCKS.
>>
File: z-image_00191_.png (2.17 MB, 1920x1080)
2.17 MB
2.17 MB PNG
>>
>>107385968
GPU, TPU whatever. Point stands. Models that don't have mass appeal are unlikely to get finetunes or much community support because no one will bother to further develop models only a bunch of people use.
>>
>>107385997
go for less than 0.2, and increase the number of steps
>>
File: zimg_0049.png (2.9 MB, 1920x1080)
2.9 MB
2.9 MB PNG
>>107385896
just prompt?
>>
>>107385990
Vpred.
Epsilon loras work fine and go vpred.
Not even worth a discussion.
>>
File: Zurbo_00044_.jpg (592 KB, 2688x1536)
592 KB
592 KB JPG
>>107385949
Basically this right now with some nodes removed and my own custom node for 1girls:
>>107384822

It has a few issues, as >>107384998 mentioned, but it's fun and fast, so even if something looks weird you can just regen real quick.

Also, don't look at her hand. She had a very serious musical accident. Bless her heart.
>>
>>107386009
box?
>>
>>107385954
something silently fucking up so that loras suddenly have to be changed in strength without knowing is it still a bug in comfyui that will be fixed or was it a bug in ai-toolkit is a big problem, yes
>>
>>107385418
shiny turd
>>
>>107386002
that is tru, which is why zimage is pretty dope because there are already so many loras for it
>>
almost...
>>
>>107385253
pls do more blindolf
>>
>>107385954
It's pretty silly when loras need different strengths depending on the UI and results in retarded civitai jeets claiming your lora is broken
>>
BASE MODEL CAME OUT
and then I woke up...
>>
>>107386039
Yeah ZiT is in the same position as SDXL when it came out, a for its time good model that most people interested in this kind of thing can run acceptably to really well. If the base model isn't much bigger it will be the XL successor because of that.
>>
File: hm.png (1.77 MB, 1024x1536)
1.77 MB
1.77 MB PNG
The government owns me this
>>
>>107385954
>huge fiasco, changing 1.0 to 0.7, heads will roll
why do you have to decrease the strength though, it wasn't the case on other models, those loras were meant to work on strength 1
>>
File: zimg_0057.png (2.36 MB, 1920x1080)
2.36 MB
2.36 MB PNG
>>107386022
files.catbox.moe/a17e0y.png
>>
File: 4031.jpg (508 KB, 832x1248)
508 KB
508 KB JPG
>>
>>107385640
https://civitai.com/models/2062270/cameltoe?modelVersionId=2449331
>>
>>107385896
You can chain superlatives. massively colossal huge etc. Though it often makes them nude.
>>
>>107386127
thanks anon
>>
>>107385900
you've got fucking mental issues and you should probably just kill yourself
>>
>>107386015
Prompt for this?
>>
>>107385958
this is awesome>>107385961
>>
>>107386115
Thanks boss
>>
File: BWCslave13.jpg (660 KB, 1366x2048)
660 KB
660 KB JPG
I can't believe it's almost over, I thought November would have 31 days. I want to rip that top off, bite her neck and show her what a real man can do. I'm going to fucking impale her with my ivory tower. She will devote her life giving birth to white babies.
>>
>>107385961
wait.. you're using sd-webui? with z-image?
>>
>>107386153
pic kinda looks like she has a bit of a moustache
>>
File: 00066-814520533.jpg (1023 KB, 2048x2304)
1023 KB
1023 KB JPG
i took a time with my wife to walk in the park and we are resting, we hold hands the whole time while walking.
>>
File: zimg_0063.png (2.17 MB, 1920x1080)
2.17 MB
2.17 MB PNG
>>107386151
no sweat bruh
>>
>>107385996
elaborate plz
>>
>>107386179
hawt
>>
File: 1763905736827892.jpg (712 KB, 1080x1349)
712 KB
712 KB JPG
>>107386136
Sorry Chang, but your women belong to us.
>>
>>107386153
would
>>
Can anyone post the chink system prompt file? I can't find it on the guy's HF.
>>
>>107386206
>>107386206
>>107386206
>>107386206
>>
>Some troll comes over here and posts real underaged b& bait and no one calls him out...
>>
>>107386193
based
how to generate this gesture in Zurbo btw?
>>
>>107386181
Create a "prompt expert" with specific organized prompt order
Give it a rough idea
Let it think and output a good prompt
>>
>>107385582
I bought extra system RAM and I can gen just fine with only 10 VRAM. I simply don't think upgrading to a retarded furnace heater 5090 gayming GPU would be a good investment. I would want more of a leap to at least 48 VRAM. It's stupid that a 5090 is the best thing you can get
Now this seems the most practical, but still a ripoff:
https://www.cdw.com/product/pny-nvidia-quadro-rtx-pro-4000-graphic-card-24-gb-gddr7-full-height/8388913
>>
>>107386119
sexy venusian woman
>>
File: zimg_0068.png (2.24 MB, 1368x2048)
2.24 MB
2.24 MB PNG
you definitely just prompt for milkers
>slim asian nurse at the hospital,bikini top, huge saggy beach ball sized breasts,

>>107386151
you can turn down the max size on the face detail too, that was just testing
>>
File: file.png (65 KB, 180x235)
65 KB
65 KB PNG
>>107386244
more like baggy bra
>>
>>107384963
Hey, hey! I see you made that woodcut thingy! Thanks!!

Regarding Venus body, it's this slimthic kinda thing man, I'm not sure how I can explain it.

Here's a ref maybe -- civitai.com/models/1075466/venus-body-0dang-shes-thicc



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.