[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor application acceptance emails are being sent out. Please remember to check your spam box!


[Advertise on 4chan]


336 of My Gens Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107400410

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe
https://github.com/ostris/ai-toolkit

>Z Image Edit
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image/

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
is base out already?
>>
killing myself
>>
is there a special way to summon loras per region in krita? if i do it with the menu method theres bleed, if i do it with <lora:blahblahblah> i get "Server error: 'Conv2d' object has no attribute 'temp'"
>>
File: 1745154397632711.jpg (193 KB, 640x640)
193 KB
193 KB JPG
>>
you're delusional if you think the base is coming.
>>
>>107402434
looks nice. a little smol
>>
>>107402435
I think they sincerely intended to until they realized what they had on their hands.
I called it day one.
>>
File: ComfyUI_00045_.mp4 (755 KB, 640x640)
755 KB
755 KB MP4
tongyi my butthole
>>
File: file.png (1.19 MB, 1024x1536)
1.19 MB
1.19 MB PNG
>>
>>107402435
i've been waiting all week to call anons retarded for claiming base is 6B. I guess I'll never get that chance. oh well
>>
Dedistill status?
>>
>When the anons who called you a schizo for saying there won't be a base model don't get their base model.

It's called pattern recognition and this patter was the classic Frodo refusing to throw the ring into the fire maneuver.
>>
File: ZImg_00010_.png (2.64 MB, 1440x1152)
2.64 MB
2.64 MB PNG
>>
File: ComfyUI_00211_.png (1.36 MB, 1504x1024)
1.36 MB
1.36 MB PNG
>>
>here is the base (distillied) for local cucks
>also introducing saas BASE PRO
It's going to happen isn't it
>>
File: ComfyUI_0064.png (1.52 MB, 832x1216)
1.52 MB
1.52 MB PNG
>grab prompt off of civitai
>make the asian 1girl even more obvious
>make her tits way bigger
ideal zimage workflow
>>
>>107402505
zit took everyone off guard, it's not a stretch to think it took tongey off guard too. The days only seem longer because you're anticipating, like a... kid at christmas.
>>
File: ComfyUI_00213_.png (1.94 MB, 1504x1024)
1.94 MB
1.94 MB PNG
>>107402476
>>
File: Ostris.jpg (79 KB, 2266x433)
79 KB
79 KB JPG
Any recommendation for ZIT Ostris settings? Fine details like clothing pattern is lost with the default settings.
>>
>>107402464
i thought we were gunna undistill it first and then dedistill. er-how did it go again?
>>
>>107402476
While I have no doubt that's what they've decided, ella-style, it's an incredibly dumb decision. Illu 3 levels of dumb. The model got hyped because it was faster and smaller. Releasing base would help it entrench, sitting on it will simply force people to keep using zurbo, fin.
>>
>>107402410
Anyone with NAG on Z got a workflow? I can't get anything coherent out of it
>>
The gay ring problem:
_______________

There is a very big and gay ring.

If you put it on, you gain super powers.

But, people will know you wear the huge gay ring.

do you wear the ring?
>>
>>107402482
Funny how she's 20 at most posing as a grandma.
>>
>>107402552
mail order bride, the listing said 52 but they sent a 25
>>
File: ComfyUI_00047_.mp4 (836 KB, 832x640)
836 KB
836 KB MP4
>>
File: ComfyUI_00214_.png (2.31 MB, 1504x1024)
2.31 MB
2.31 MB PNG
>>
>>107402537
It seems like shooting yourself in the foot is a rite of passage for all imagegen companies. Remember when Emad fought desperately to keep SD1.5 out of the public's hands?
>>
Any cool models or nodes like ipadapter where I can combine two images together? be it style, faces, etc
>>
>>107402561
What was the prompt? I can't tell if the shadows imply something
>>
>>107402561
neat, it has 3d generated imagery style shadow errors.
>>
>>107402563
>rite of passage
This wouldn't be the first time. Wan 2.5 was borderline though. While they never outright said they would open source it. They never corrected the people who interacted with it and said it would be open source.
>>
File: ComfyUI_00048_.mp4 (1002 KB, 640x832)
1002 KB
1002 KB MP4
>>107402568
her boobs bounce
>>
>>107402564
Res4lyf has a slew of them with examples.
>>
>>107402581
excellent output
>>
File: 1761119817889337.png (213 KB, 1563x889)
213 KB
213 KB PNG
>>107402546
what is so hard about this
>>
is there a difference between increment/decrement and randomize
>>
>>107402604
are you for real right now?
>>
>>107402522
>CRUMBS ON HIS JACKETSES
>>
>>107402527
just keep it default with cache text embeds and train for longer, like 8000 steps
also switch the adapter version to the new one, change v1 to v2 in input field
>>
>>107402604
the seed (number thing) determines the "gen" you are creating, if you keep everything else the same. that's because it is the seed for the noise, in which the model fantasizes and conjures up the magical abominations that God said you shouldn't be messing with u fool!

MATH! IS! DEMONS!
>>
>>107402618 (me)
also set quanting to - NONE - if you got 24gb
if you got 16, set it to none and set low vram true, if that doesnt fit properly then use that tensor offloading until it does
>>
>>107402604
on a fundamental level no, randomise functions as both increment and decrement, but for those of us with a limited lifespan it's handy to have thsoe options anyway
>>
>>107402628
I know what a seed is. So if I get an output that's close to what I want, I should increment or decrement rather than randomize
>>
>>107402604
>>107402628
also, sequential seeds aren't similar to each other, despite being the next number.

the reason to keep the seed the same is if you are changing other things to see what effect they have. But you should be careful about such comparisons, because sometimes it's a fluke.
>>
>>107402505
A way they can play this without fully destroying their image is to purposely modify the real base into a shittier "base", but hold on to the real base to later call it it Z-Image-Pro on a web service/API
>>
>>107402604
Increment increases quality while decrement decreases quality while randomize randomizes quality of the gen. it's true, trust me, no need to fact check.
>>
File: ComfyUI_00049_.mp4 (1.51 MB, 720x1280)
1.51 MB
1.51 MB MP4
>>107402587
was same prompt, different resolution

this one is 'her boobs jiggle'
>>
>>107402643
Doesn't matter, most people use random. a different seed is a totally different random noise.

but

sometimes you want to keep the seed the same and concentrate on fixing your gen. however, this can be a trap.
>>
>>107402645
actually I am testing this out with Flux 2 right now, and incrementing generates very similar outputs, so its working well. while randomize might generate dogshit again. idk. too much randomness involved
>>
>>107402651
oh no I've been doing it wrong
>>
File: -098235234.png (1.91 MB, 1024x1536)
1.91 MB
1.91 MB PNG
>>
any type of zit lora I try just changes the look of everything. The loras don't mix at all if you're trying to maintain a consistent character. They change the look of all characters the model already knows too.
>>
Is there any correlation between nearby seeds and similar output? There wasn't in the 1.5 days but I haven't actually checked for any other model.
>>
>>107402664
Sorry to disappoint, anon. adjacent seeds aren't related at all in terms of output.
>>
File: combined_0086.jpg (1.22 MB, 2040x3838)
1.22 MB
1.22 MB JPG
>>
>>107402679
nope. not at all.

666 is a dangerous seed, don't use it.
>>
>>107402579
This time it hurts more not because of a potential language barrier confusing the messaging but how many times they mention or allude to "open-source". Their github quite literally states, under Base "By releasing this checkpoint, we aim to unlock the full potential for community-driven fine-tuning and custom development."
>>
>>107402456
Base is for sure 6b, exact same architecture as Turbo. The researchers made a big deal about how cheap the model was trained. Why then, would they train a larger base model, then train a smaller model distilled from the big one (but still trained from randomly initialized weights)? Versus just taking the base model as-is and distilling it against itself as a teacher model (exact same thing Flux Schnell did). The latter is way more efficient and less training resources.
>>
>>107402604
No difference. Main benefit of incrementing/decrementing seed is you can recover a gen if there's some mishap.
>>
>>107402679
Yes your favorite number will give the best gens, rigorously proven with years of experimentation
>>
>>107402694
Maybe flux is better unquantized.
>>
>>107402618
>>107402632

Thanks... But that will have to cook overnight to validate if it works. I am feeling a little chilly anyway, no need for a heater.
>>
>>107402724
oh cool be sure to publish your findings
>>
>>107402527
Set steps to 1000.
>>
>>107402643
Use the Inspire Ksampler with variation seed.
>>
>>107402701
The blackest black pill is that there was no language barrier and this is just Chinese culture.
>>
File: ComfyUI_00218_.png (1.9 MB, 1504x1024)
1.9 MB
1.9 MB PNG
>wikipedia copy paste slop prompting
>it looks better than any game I've played
>>
>>107402679
No. Think of seeds as unique hashes. Be aware some samplers like Euler A add noise which makes using seeds pointless.
>>
>>107402745
True.

But we win, they gave away too much for free. We are overcoming :^)
>>
do you think alibaba can pull one more grift or will it be a different chang that swindles anon next time
>>
File: combined_0106.jpg (1.09 MB, 2040x3839)
1.09 MB
1.09 MB JPG
>>107402714
All unquantized (including the VLM)
>>
>>107402759
why can't you control ancestral noise?
>>
So can I use zit to make porn yet?
>>
I don't want to alarm you but there might be a serb or serbs in this thread.
>>
>>107402771
what's the captioning node? I like it.
>>
File: WanVideo2_2_I2V_00580.webm (2.6 MB, 608x1056)
2.6 MB
2.6 MB WEBM
>>107402670
>>
File: combined_0018.jpg (1.08 MB, 3845x2040)
1.08 MB
1.08 MB JPG
>>107402786
The original images were bulk captioned with a python script, and the result was fed into the "Prompt Cycler" node in ComfyUI.
>>
>>107402694
Is that a real place at the top or just a concept? Looks neat. Also zimage did a good job, damn.
>>
File: ComfyUI_276870_.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
>>
supports whatnow?
>>
>>107402810
>ldg taking on the 1124lb world record deadlift
>>
>>107402819
>Qwen3-VL-32B-Instruct
>66.7 GB
I only have 64gb of system ram. I have 16gb of vram, so I guess theoretically I can run it?
>>
>>107402842
where the fuck is Kandinsky faggot?
>>
File: WanVideo2_2_I2V_00581.webm (2.37 MB, 608x1056)
2.37 MB
2.37 MB WEBM
>>
>>107402590
Not working well with custom seed variation workflow
>>
>>107402894
he grabs her hair and drags her back to mordor for some bodymods :)
>>
File: ComfyUI_000012_.webm (815 KB, 960x720)
815 KB
815 KB WEBM
>>107401984
someone's cranky
>>
>>107402837
It was concept art for the "International Chengdu Global Center" which is like a Las Vegas-style arcology.
>>
>>107402909
Can you make it where the animals on the wall are trying to talk, but they totally ignore them, then one of them shoots them?
>>
>>107402658
>most people use random
Lifehack: using increment willl let you rewind without reloading wf because the seed you've missed but decided to revisit is just a click away.
>>
When do you think reddit is going to come to the same conclusion we all have? The base model is not coming.
>>
>>107402897
What is 'custom seed variation wf' even is?
>>
>>107402985
NTA but why not reload the workflow from the queue?
>>
>>107402915
This is so busy, noisy and jpegified it looks like this was genned with zurbo, not the other one.
>>
>>107402779
no
>>
>>107402987
You care about what they think?
>>
>>107403013
I mean kind of. Like I enjoy seeing them seethe and cope. I think that counts as caring in a way.
>>
Is there some new tech like ipadapter available?
I just remembered this ancient tech from the 1.5 days.
>>
Why are they just shifting meaningless stuff around on their github and huggingface
>>
so glad I bought 64 gigs of RAM for $200 several weeks ago. greatest purchase i ever made
>>
File: WanVideo2_2_I2V_00582.webm (2.13 MB, 1056x608)
2.13 MB
2.13 MB WEBM
>>
File: 322222222.png (779 KB, 1136x577)
779 KB
779 KB PNG
Will the training go well?
>>
>>107402999
I do both, but most of the time rewinding seed is faster. And sort of allows for internal pipeline separation when working on complicated workflows: rewinding seeds most of the time and reloading from queue when you're lost and need to get back to things that worked.
>>
File: ram.png (60 KB, 740x668)
60 KB
60 KB PNG
>>107403028
>>
File: 1752736896048504.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
>>107402842
>Ovis
>>
File: 1748388135514688.jpg (221 KB, 599x681)
221 KB
221 KB JPG
ComfyUI officially declares ZIT a "game changer", which indirectly implies it is superior to Flux.2
>>
>>107402985
life hack: if you use a random seed, then you can't be known to have generated adjacent seeds.

This means that even if an adjacent seed violates Iranian law, you won't be punished for something they can't prove you genned.

example.

"pretty lady"
in chroma hd

you gen seed 10001
you post it
you gen seed 10002
you post it
you gen seed 10003
it shows a booby. you don't want to go to jail in iran, so you don't post it.
you gen seed 10004
you post it

you will now be visited by the Iranian secret police.

if only you'd used RANDOM SEEDS
>>
>>107403036
smoke?
>>
>>107403050
all the competition except maybe wan is so bad its unreal
>>
>>107402999
because comfy is a fucking piece of shit and randomly decides you can no longer do that
>>
File: missile.jpg (56 KB, 565x462)
56 KB
56 KB JPG
>>107403067
>>
File: ComfyUI_00221_.png (2.28 MB, 1504x1024)
2.28 MB
2.28 MB PNG
>>
>>107402992
https://www.reddit.com/r/StableDiffusion/comments/1p94z1y/get_more_variation_across_seeds_with_z_image_turbo/
>>
>>107403045
>>107402842
I love the reference.
>>
File: coreldraw 5.25.png (1.98 MB, 1199x1599)
1.98 MB
1.98 MB PNG
>>107403101
>>
>>107403109
kek
>>
why is everyone on civitai training their z loras on pony outputs with tag prompts?
>>
File: ComfyUI_00051_.mp4 (1.9 MB, 1280x720)
1.9 MB
1.9 MB MP4
>>
>>107402842
so yeah, I don't see the ovis image support.
>>
>>107403164
First time?
>>
>>107403169
https://github.com/Andro-Meta/ComfyUI-Ovis2
found this, but I doubt it can load the new one.
>>
>>107403164
That sounds retarded, how are you even supposed to prompt for something like that?
>>
>>107403164
tag prompts with many images add seed variance back to the model
>>
>>107403165
>advisor: we might have a vacancy for you, its volunteer wor-
>woman: no thanks
>advisor: ..at a school
>>
>>107403284
women should work at home. They can setup to sell stuff they make, and they can buy property.

Married women should cover their heads when out of the house, and should always obey their husbands and be cheerful in all things as possible.
>>
am i tripping balls or raising ModelSamplingAuraFlow value fucks loras up?
>>
>>107403061
That's the weirdest train of thought I've read today.
>>
>>107403284
yes Gennie, you do have to work even when you're on your period

"fuck that, i won't do it, you stupid bitch, i'm outta here"

*sigh*

welcome back to the table Gennie

"im sorry, hormones"
>>
>‘Post-Avatar depression syndrome’: why do fans feel blue after watching James Cameron’s film?

Didn't take long, and ai can cure it.
>>
>>107403032
make him shove her down and sit down in her chair
>>
File: ComfyUI_00002_.png (2.18 MB, 1504x1024)
2.18 MB
2.18 MB PNG
>>
>>107403330
Raising shift increases the scheduler curve's slope. So what's your scheduler (actually, don't, answer, just plot them yourself).
>>
>>107403061
if you get nsfw output you probably were prompting something riskey anyway, so straight to jail
otherwise for this to be a problem, you would need the future agi to turn into a basilisk that will bruteforce generate all images in accordance with the rest of the settings that would influence the output for every single user/gen found online

you would basically need to be posting on an account/ip tied to you irl known to the government, and also at some point in that chain on posting images you would have to post that exact workflow for those sets of images once so they have the setup parameters, and then you would have to post 2 more, and probably many more gens before they can do this
and even then you have a free option to after genning a boob switch to random noise and keep posting like it was a unrelated descision you made for no particular reason
>>
File: ComfyUI_00053_.mp4 (1.49 MB, 1280x720)
1.49 MB
1.49 MB MP4
women discussing the largest corporate merger in history
>>
File: WanVideo2_2_I2V_00583.webm (2.53 MB, 1056x608)
2.53 MB
2.53 MB WEBM
vfx artists in trouble.
>>
>>107403454
kek, how'd you do that
>>
File: ComfyUI_00003_.png (2.02 MB, 1504x1024)
2.02 MB
2.02 MB PNG
>>107403418
>>
>>107403442
>output you probably were prompting something riskey
nope, noob
>>
>>107403503
then you were at best using a model you knew had women in it and wasnt as safe and effective as SD3 was, so straight to jail
>>
>>107403460
A hole appears beneath the women and they fall into the flames.
>>
The Iranian/Turk is fine, he took my advice and uses RANDOM SEEDS.
>>
>>107403528
Furk?
>>
File: ComfyUI_00004_.png (1.92 MB, 1504x1024)
1.92 MB
1.92 MB PNG
>>107403495
>>
Now that you've had a chance to decide, which race do you prefer?

Yellow race:
>>107403418

Red race:
>>107403495

Black race:
>>107403543
>>
I like all the races. I like forests, deserts, and the seaside. It's hard to pick a race.
>>
>>107403061
lmao you're funny nameGOD
>>
What the fuck is this namefag's problem?
>>
File: 1764657755.png (1.67 MB, 1024x1024)
1.67 MB
1.67 MB PNG
>>
File: 1739903305934102.png (10 KB, 326x152)
10 KB
10 KB PNG
If my dataset images are mostly 1280*1280, is it worth also scaling to lower resolutions during training in ai-toolkit? i think that will scale all images to all those resolutions and train on those too, but is this better than just training at 1280*1280?
>>
>>107403613
lol, I don't care. His posts are mostly just noise to me. Rather have him than the schizo who samefags anonymously to shill his broken software.
>>
File: nwords_.jpg (837 KB, 1456x1840)
837 KB
837 KB JPG
praying for nsfw and danbooru tune of z-image
>>
>>107403635
There won't be a base model so no tune unfortunately.
>>
File: 1764654676.png (1.66 MB, 1024x1024)
1.66 MB
1.66 MB PNG
>>
>>107403621
If it's a tiny dataset since most lora datasets are, consider mix and matching between downscaling and crops instead. The goal is for the model to generalize better, it does it not just with scale but with different composition as well. Also, training the entire lora on 1280 is overkill. It will simply be faster with large batches relegated to lower res.
>>
>>107403637
That never stopped Lodestone, you know (should have, but it didn't)
>>
File: 00025-2684646096.png (2.25 MB, 1920x1080)
2.25 MB
2.25 MB PNG
>>
>>107402909
>>107402927
>>
>>107403667
I 80 images, and dont mind the time cost to train everything at max quality. But if downscaling helps the model generalize better, why isn't 256*256 also enabled by default? Is it always better to also do 512*512 at least?
>>
>>107403695
>I 80 images
I got 80 images
>>
Qwen just updated their app for image editing. Probably means Qwen Image Edit 2511 is coming soon
>>
File: 00028-1871753202.png (1.95 MB, 1080x1920)
1.95 MB
1.95 MB PNG
>>
>>107403695
>Is it always better to also do 512*512 at least?
Yes, most models train the bulk of it on 512, so you won't make a dent if it doesn't know 256
>>
File: chroma.png (3.06 MB, 1024x1536)
3.06 MB
3.06 MB PNG
>>
>>107403708
>so you won't make a dent if it doesn't know 256
you mean it wont matter if it doesnt know 256?
interesting, ill train it at all resolutions from 1280 to 512 then
>>
>>107403707
Can you make bokeh look like a real DSLR instead of this iPhone blur?
>>
File: 1764659026.png (1.71 MB, 1024x1024)
1.71 MB
1.71 MB PNG
>>
File: 00042-2833730172.png (2.24 MB, 1080x1920)
2.24 MB
2.24 MB PNG
>>
File: 00044-2214191117.png (2.17 MB, 1920x1080)
2.17 MB
2.17 MB PNG
>>
File: ZImg_00012_.png (2.14 MB, 1440x1152)
2.14 MB
2.14 MB PNG
the rt. hon chantelle
>>
File: ComfyUI_00007_.png (2.23 MB, 1504x1024)
2.23 MB
2.23 MB PNG
>>107403543
>>107403549
(or, if you are into bestiality, a subhuman BR*WN)
>>
File: 1764659927.png (1.7 MB, 1024x1024)
1.7 MB
1.7 MB PNG
>>
so it's over? No base?
>>
>less than a week from a model release
>IT'S OVER NOTHING HAPPENED WE GONNA STARVE
>>
File: ZImg_00031_.png (2.21 MB, 1440x1152)
2.21 MB
2.21 MB PNG
>>
File: file.png (637 KB, 1121x830)
637 KB
637 KB PNG
bros im trying nag out (using blue sky as nag) but I cant seem to get this shit working, are my nodes connected good? im using the eulerflow scheduler
cfg 1, shift 3
>>
File: 1753613476944086.png (3.91 MB, 1024x1536)
3.91 MB
3.91 MB PNG
>>
File: ComfyUI_00008_.png (1.28 MB, 1504x1024)
1.28 MB
1.28 MB PNG
>>107403945
Another asian 1girl has been defeated by the CROSS.
>>
If I want the style of an image but have full control of the character, its pose, scene composition, angle etc, without the use of a lora, what's my options in comfy?
>>
>>107403962
Nanobanana api node.
>>
https://www.reddit.com/r/StableDiffusion/comments/1pc2enz/z_image_turbo_controlnet_released_by_alibaba_on_hf/
uh oh. controlnets instead of edit model. bad sign.
>>
File: ComfyUI_00009_.png (1.59 MB, 1504x1024)
1.59 MB
1.59 MB PNG
>>107403961
boom. asians defeated again.
>>
>>107403977
>no comfy
>>
File: heh.jpg (20 KB, 317x265)
20 KB
20 KB JPG
>tell the llm to describe this womans naked body
>also tell it to keep the image sfw
>it gaslights itself that the image isn't of a nude woman
>>
File: ComfyUI_00010_.png (1.28 MB, 1504x1024)
1.28 MB
1.28 MB PNG
>>107403982
It's an exorcism.
>>
It's amazing to watch the latent try to be a gook, but the crosses keep banishing the demonic race.
>>
File: ComfyUI_00011_.png (1.16 MB, 1504x1024)
1.16 MB
1.16 MB PNG
>>107404009
nice try, GOOK-AI
>>
btw, to unburn these, I can just have more iterations. idk how many are needed, but these are 9, and usually I do 80 if I want an unburned look, when I'm fighting the gook devils.
>>
>generate a 1080x1920p image with zit
>32gb vram usage 100%
>>
File: ComfyUI_00014_.png (1.77 MB, 1504x1024)
1.77 MB
1.77 MB PNG
>>107404050
HEY YELLOW YIDS!
>>
the reality is they were only targeting the asian market, and so they intentionally were trying to patch whites out of the defaults.
>>
File: ComfyUI_00015_.png (1.78 MB, 1504x1024)
1.78 MB
1.78 MB PNG
>>107404140
>>
File: ComfyUI_00016_.png (1.84 MB, 1504x1024)
1.84 MB
1.84 MB PNG
>>107404181
>>
This from the op are the optimal settings for nag?
>>
File: ComfyUI_08370_.png (3.12 MB, 1280x2048)
3.12 MB
3.12 MB PNG
sooo where is the Base model?
>>
Is this thing dead or something?
The models are in the correct location and named properly.
>>
Where base now?
Where?!
>>
File: ComfyUI_00021_.png (1.78 MB, 1504x1024)
1.78 MB
1.78 MB PNG
the blank input for 3 steps guys are right.

https://www.reddit.com/r/StableDiffusion/comments/1p94z1y/get_more_variation_across_seeds_with_z_image_turbo/

I didn't use the wf, because I already know how to chain.
>>
File: ComfyUI_00022_.png (1.89 MB, 1504x1024)
1.89 MB
1.89 MB PNG
>>107404381
But also

I am blessed by the CROSS which SLAYS evil asian sluts.
>>
the nameGOD knows how to chain guys
>>
I'd rather chain girls
>>
>>107403900
there are months where nothing happens and there are days when months happen
>>
/ldg/ - local degenerates
>>
>>107404381
Your gen and his have high pass effect in common, though. Which is weird, because in my experience, z isn't prone to falling into high pass when messing with denoise levels. It is, unlike qwen, very high-pass tolerant (thankfully).
>>
noobs who can't adjust curves, need ai to do it for them.
>>
one thousand anons just got TOLD
[x] tolderino
>>
File: ComfyUI_00026_.png (1.72 MB, 1504x1024)
1.72 MB
1.72 MB PNG
noobs, man
>>
https://huggingface.co/alibaba-pai/Z-Image-Turbo-Fun-Controlnet-Union
>>
>>107404461
Hmm. Releasing another tool for the turbo model instead of the base. Interesting. Let me plot it on my chart.
>>
File: ComfyUI_00027_.png (1.81 MB, 1504x1024)
1.81 MB
1.81 MB PNG
>>
>>107404471
https://search.brave.com/images?q=round+tuit&source=web
the chart
>>
>>107404476
>>107404460
Are these with NAG?
>>
File: ComfyUI_00028_.png (2.07 MB, 1504x1024)
2.07 MB
2.07 MB PNG
>>107404485
no, the burned look is from cfg, which isn't necessary for this prompt. actually, more steps will make it much less burned. Here it is with neither cfg nor nag.
>>
>>107403050
You fools. The interest outmatched their expectations and they now decided to keep the better stuff behind lock and API key.
>>
>>107404506
>the burned look
Glad you picked up what I'm putting down.
>>
>>107404519
They literally just released a control net. Have patience you zoomer.
>>
>>107404519
I've been sounding the alarm bell for a few days now. They will not release the base model.
>>
>>107404528
Do you know what wan did before they pulled up the video ladder? Released wan animate. This is a consolation prize.
>>
File: ComfyUI_00030_.png (1.81 MB, 1504x1024)
1.81 MB
1.81 MB PNG
>>107404506

>>107404522
I'm kind of into the burned look.
>>
>>107404447
>Needing to postprocess his gens in Photoshop
Ngmi
>>
File: ComfyUI_00032_.png (2.23 MB, 1504x1024)
2.23 MB
2.23 MB PNG
>>107404553
>>
>>107404471
They're waiting for the hype to die down a bit so they can release base or edit to get it back up again, prolonging the period of time that they are relevant. Relevance leads to active discussion, which results in more exposure.
>>
>>107404565
git gud
>>
>>107404531
>base was supposed to come out last week
>bghira warned the chinese government
>orders to censor the dataset
>they are now rebaking base from scratch in a panic
>it will be offered api only so they can monitor and censor all the prompts
>damage control is starting
expect to see more and more "who needs base? turbo is all you need!" posts.
>>
>>107402646
it's alibaba to decide
and you know their decision for wan 2.5
>>
>>107404567
>>107404553
don't post every shitty output. maintain thread quality
>>
I want Z Base to be SaaS, does that make me a bad person? I think not
>>
>>107404602
this. it's only local until it's good. alibaba saw that it was a turbo 6b model and assumed it was generic slop, even the chinese leaker guy said it was only good at realism and wouldn't be as good as flux but rather an option for those who want speed.
now they see how popular it is and, just like with wan, want to lock it behind and api and monetize it through partner shilling with comfy API, which is why comfy is already mentioning "local and cloud" in every post about z-image. normalcattle will hear about 'uncensored lightning fast z image', search it, immediately see comfyorg results, and subscribe to the pro api thinking it's what everyone was talking about all along.
>>
>>107404631
Ever read Art Forum?
>>
Is there model good and cheap enough to compete against Gemini 3? They can release a good enough model for free to piss in Google's well though.
>>
>>107402410
>Z Image Edit
It's just called Z-Image, retard
>>
>>107404659
well screenshot it, and when it releases you can z image edit it.
>>
>A hybrid cross between a rat and cat.
Tried one of those meme animal prompts. Every other model seems to get at least some idea first try. Tried many gens on Z.

>A half cat half rat hybrid creature
>A hybrid of a rat and cat.
>A hybrid fusion between a rat and cat.
etc.
lol
>>
is there lightx2v lora for flux 2?
>>
was wan2.5 even a success? no one in the west uses it. why would they pull the same trick twice.
>>
File: ComfyUI_00035_.png (1.99 MB, 1504x1024)
1.99 MB
1.99 MB PNG
be sure to share something today!

#growth #empathy #lovethumpshate
>>
>>107404679
none succeeded, though.
>>
>>107404679
>sd 1.5
sovl
>>
File: ComfyUI_00351_.png (1.62 MB, 1432x1056)
1.62 MB
1.62 MB PNG
>>
File: ComfyUI_00353_.png (1.63 MB, 1432x1056)
1.63 MB
1.63 MB PNG
>>
File: ComfyUI_00355_.png (1.63 MB, 1432x1056)
1.63 MB
1.63 MB PNG
>>
File: 00043-39392.png (1.74 MB, 1664x2432)
1.74 MB
1.74 MB PNG
Hello real general~!
The new ZiT Controlnet is very good, did you guys used it?

https://huggingface.co/alibaba-pai/Z-Image-Turbo-Fun-Controlnet-Union/tree/main
>>
>>107404774
We are back anime bross!
>>
>>107404774
arigaToT
>>
>>107404774
thank you ^^
>>
Is there a local model or workflow that takes both starting and ending frame to generate a video transition between them?
>>
>>
Can you inpaint without issues with zimage yet?
No? Still ass?
>>
how do I make parts of the workflow run depending on a boolean?
my use case is prompt rewriting, I'd like to disable it on the fly, and I want to keep the logic inside the subgraph, so externally I'd just see a boolean toggle.
I was trying some logic if nodes, but the true/false shit still requires stuff before it to be processed.
>>
>>107404863
What issues are you having? I had no problem personally.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.