[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107495506

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI:https://github.com/comfyanonymous/ComfyUI
SwarmUI:https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo:https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next:https://github.com/vladmandic/sdnext
Wan2GP:https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta:https://rentry.org/localmodelsmeta
Share Metadata:https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks:https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt:https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin:https://github.com/Acly/krita-ai-diffusion
Archive:https://rentry.org/sdg-link
Bakery:https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
File: z-image_00155_.png (1.7 MB, 1024x1024)
1.7 MB
1.7 MB PNG
why does zimage look like balls sometimes
>>
>>107499062
why continue this farce?
>>
File: Barbarian II AI 02.mp4 (798 KB, 640x640)
798 KB
798 KB MP4
Shiny disco ball.
>>
File: z-image_00751_.png (1.66 MB, 2048x1152)
1.66 MB
1.66 MB PNG
>>
>>107499080
>why continue this farce?
Why are you lurking in /ldg/ threads?
>>
the amount of slop LORAs on civitai is at an all-time high. it's almost impossible to find good LORAs just by browsing there now. chroma especially is being spammed with pointless fucking LORAs.
>>
File: z-image_00754_.png (1.76 MB, 2048x1152)
1.76 MB
1.76 MB PNG
>>
File: 1737308932818930.png (1023 KB, 896x1184)
1023 KB
1023 KB PNG
>>107499062
>https://rentry.org/animanon
>>
>>107499092
>Why are you lurking in /ldg/ threads
Social media is garbage that spies on you, YouTube is garbage that spies on you, I don't have baldurs gate 3 downloaded yet so here I am NIGGER
>>
>>107499116
have you found any good loras? i've seen some really bad ones
>>
File: kk.jpg (110 KB, 1024x1024)
110 KB
110 KB JPG
>>107499116
people don't all gen what you gen? what a surprise.
>>
File: ComfyUI_00321_.mp4 (891 KB, 640x640)
891 KB
891 KB MP4
>>
>>107499116
train whatever you need yourself. i couldnt even imagine using a lora where i dont know the training data and captions
>>
>>107499116
Everbody's LoRA sucks except mine
>>
File: z-image_00758_.png (1.91 MB, 2048x1152)
1.91 MB
1.91 MB PNG
>>
File: z-image_00760_.png (2.9 MB, 2048x1152)
2.9 MB
2.9 MB PNG
>>
>>107499157
I looked at my chroma loras folder. the best ones were all from catboxes posted on this general.

here's one anon's dalle3 style lora for chroma.
https://files.catbox.moe/l713nm.safetensors
>>
Tongyi-MAI anus
>>
File: 1750751505473811.jpg (2.91 MB, 2048x2064)
2.91 MB
2.91 MB JPG
>no z image base
>no wan 2.5
>no new qwen edit even
>not even ltx2
>>
I warned you about Chinese culture.
>>
>>107499135
Piss off somewhere else that isn't a farce then.
>>
>>107499127
So you really should just stay in Discord. You are as much of a problem as these namefags are.
>>
>>107499270
>not even ltx2
Chinese Culture might lie about giving you things, but you don't have to worry about that with Israeli Culture

>>107499281
Oh I wasn't that anon I haven't genned in a few days because I've been too busy but even if I wasn't apparently the evasi0n site was down so it wouldn't have mattered
>>
File: z-image_00167_.png (997 KB, 1024x1024)
997 KB
997 KB PNG
>>
>>107499220
if you gen realism, you can tell if a lora on civitai is trained on synthetic data just from the thumbnail. And if you're unsure, just look at the prompts used and they'll inevitably be tags rather than natural language
>>
I'm done with this shit. I'm never going to goon to ai ever again
>>
File: cat.png (660 KB, 697x507)
660 KB
660 KB PNG
What image would you show to people who still think AI image gen looks like this?
>>
>>107499397
One of your selfies.
>>
File: 1760244296528663.png (2.39 MB, 1152x1728)
2.39 MB
2.39 MB PNG
>>107499397
this one
>>
>>107499397
Nothing, AI has been a pretty good litmus test of who is capable of taking in new information and who isn't. It seems that there simply are people who can't update their model of the world.
>>
>chinese tongyi mai ayynuss
>>
File: 1752491355756902.png (2.48 MB, 1152x1728)
2.48 MB
2.48 MB PNG
>>
>>107499333
>the evasi0n site was down
anon there's 'p in your browser cache, the feds are gonna be knocking any minute now. Enjoy prison, stalker child.
>>
>>107499428
>lol it can't do will smith eating pasta AI is a nothing bu-ACK
>lol we will will never have local sora
>lol AI will never replace us in j-ACK
>lol more cope from the AI crowd

its all so tiresome, good news is we are way ahead of those faggots same as we were when they called us anoraks for even using the net back in 1994 etc.

Those fucks will be so far behind
>>
>>107499116
If a lora has a low amount of sample images from the creator, chances are the lora is bad.
>>
>>107499127
You're an idiot, we've all figured out that you're the same schizo that's been shitposting for a while pretending to be Ani
>inb4 now im julien posting
And that's your circular reasoning tactic for trolling, you samefag and reply to yourself, pathetic, buy a life man, you're doing circus acrobatics for an empty audience
>>
File: Wanimate_00141.mp4 (1.07 MB, 720x720)
1.07 MB
1.07 MB MP4
>>107499081
Thoughts?
>>
https://www.bbc.co.uk/worldservice/learningenglish/music/retroenglish/
>Geeks, nerds, anoraks

And they still hate the same people today because they are too retarded to read one page of instructions for more than 1 hour to fully understand. Nothing has changed, nothing!
>>
when i wake up tomorrow, the base model will still not be out,
>>
File: z-image_00765_.png (2.57 MB, 2048x1152)
2.57 MB
2.57 MB PNG
>>
File: file.png (512 KB, 495x635)
512 KB
512 KB PNG
jeets should stick to tutorials
>>
File: z-image_00767_.png (2.81 MB, 2048x1152)
2.81 MB
2.81 MB PNG
>>
>>107499580
The current situation is as good as it will ever be. It only gets worse.
>>
>>107499071
Short story: probably not enough denoising effort for the given prompt. Try increasing ModelSamplingAuraflow shift slightly.
>>
File: z-image_00769_.png (2.71 MB, 1152x2048)
2.71 MB
2.71 MB PNG
>>
>>107499530
floaty boaty
>>
File: 4chon.png (817 KB, 1146x1159)
817 KB
817 KB PNG
>>
>>107499862
>Hot indian boys
You can edit the prompt right
>>
>>107499893
it just auto-runs with it, but sure, you could stop it and edit it
>>
>>107499862
ayo this dude genning "hot indian boys"
>>
>>107499062
I need an LLM that can take a photo of my friend drinking from a bucket and making that either into a giant cum bucket or a bunch of dicks getting rammed into his mouth.

How do I do this?
>>
File: z-image_00771_.png (2.87 MB, 1152x2048)
2.87 MB
2.87 MB PNG
>>
How should I tag my data set for the character to be "good"?
Loras on civit only needs the tag for the character to get the hairstyle right but i need every detail on my lora
>>
>>107499971
Everything that you want to appear every time with your trigger word, do not tag. Everything you want to only appear if prompted, tag it.
>>
If hex we get z base tomorrow
>>
>>107499965
wood
>>
>>107499994
Okay but if I want to train outfits as well do I tag those somehow or just add them to normal tags?
>>
File: z-image_00215_.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>
File: 1742661890094865.png (2.15 MB, 1120x1440)
2.15 MB
2.15 MB PNG
>>
File: z-image_00220_.png (904 KB, 1024x1024)
904 KB
904 KB PNG
>>
File: 1744645394526139.png (2.38 MB, 1120x1440)
2.38 MB
2.38 MB PNG
>>
where do i find the z-image discord?
>>
comfyanon linked all my prompts to my real identity and now he's blackmailing me
>>
>>107500180
what were your prompts? and what does he want from you?
>>
File: z-image_00289_.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
>>
File: z-image_00292_.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>>
>Warned you all about Chinese culture
>Pointed out the signs.
>Showed me screenshots from discord with blatantly obvious language pointing to no release and called me names

You all owe me an apology.
>>
I wonder how zimage does realism better/focus better but flux can't do the same.
>>
>>107500362
flux is trained on sd 1.4 gens
>>
File: z-image_00296_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>
>>107500361
I'm part of "all". Why do I owe you an apology?
>>
https://www.reddit.com/r/StableDiffusion/comments/1piugto/face_dataset_preview_over_800k_273gb_images/
please train your next model on my ai slop please saar
>>
File: z-image_00304_.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
>>
File: Ocelotz_video.mp4 (2.27 MB, 736x944)
2.27 MB
2.27 MB MP4
>>107500046
>>
>>107500433
lmfao.. what a retard.. he just wants to put it on a resume and try to get a job at meta with all the other saars
>>
>>107500430
Because I tried to make you all patently aware of what was about to happen and I was met with reticule. You not backing me up on my (correct) assertion about Chinese culture makes you complicit in letting faggots who think discord screenshots are a valid form of argument run wild.
>>
>>107500433
higher number == better
>>
>>107500454
>reticule
lmao
>>
>>107500433
>synthetic face dataset
what is he even doing?
>>
You guy purposefully trying to make bad images?
>>
>Face Dataset Preview - Over 800k (273GB) Images rendered so far
>all with similar bone structure
>he could of used chroma faces as the data set

please someone tell him...

https://www.reddit.com/r/StableDiffusion/comments/1piugto/face_dataset_preview_over_800k_273gb_images/
>>
File: image.jpg (15 KB, 555x556)
15 KB
15 KB JPG
>>107500454
>if you're not supporting my own speculations, it's your fault!
>>
>>107500459
Yes because I was being targeted by reticules of bad faith actors. I do not make mistakes.
>>
>>107500454
i'm still waiting for base anon, although i would never reticule you for having an opinion <3
and you seem to care way too much about being right in some 4chan argument, chill out brother there are more important things in life than making more accurate predictions than some loners on the 4chans
>>
>>107500477
he quickled googled the word and tried to pretend that's what he meant.. hoping no one would find out he was a complete retard, but alas, everyone already knew
>>
File: z-image_00324_.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
new screenshot from starfield 2, i am hype
>>
>>107500476
>speculations
Try scientific certainties.
>>
what's your best nose gen? I'm not satisfied with these results.

>artistic Korean fashion magazine photo from 1990s magazine scan, film photography, 400mm f1.8 lens, macro.
>extreme low-angle closeup of a pale beautiful elegant 30 year old French woman's nose, nose focus. it's a view showing inside her nostrils, some nose hair faintly peeking out.
>>
>>107499071
>>107499631
because you're genning at 1024x1024, do 1280x1280
>>
>>107500433
>Bulk will be rendered 512x512
they're all low res as well
>>
>>107500468
Yes, it feels good to get noticed
>>
>>107500513
>nya, what's up doc?
>>
File: 1760688660699052.png (403 KB, 860x744)
403 KB
403 KB PNG
>>107499127
hoooly fucking based
>>107499287
>>107499517
picrel niggers
>>
File: z-image_00125_.png (1.87 MB, 1536x1536)
1.87 MB
1.87 MB PNG
>>107500513
>>
File: Wanimate_00142.mp4 (1.54 MB, 544x960)
1.54 MB
1.54 MB MP4
>>107500449
Thoughts?
>>
>>107500465
he's beginning to redeem
>>
File: z-image_00331_.png (1.59 MB, 1024x1024)
1.59 MB
1.59 MB PNG
>>
>>107499127
it might also be good to edit
>AniStudio is an AI image gen frontend developed by him
so that it tells it like it is, that his project is a toy wrapper around an already existing backend
>>
>>107499862
I've just installed this, and yes we are approaching levels of awesome that shouldn't even be possible.
>>
These jeets, absolutely oblivious to norms, exposing their fetishes to the world.
>>
>>107500578
not usually
>>
>>107499862
it just needs context and conversational mode if that would even be possible inside of comfy. I seems like streaming needs to be implemented to continue complex responses i will have a look at the code and see what I can do. Just a button really that can be used to continue its responses and save them all.
>>
File: WAN_VIDEO.mp4 (906 KB, 960x640)
906 KB
906 KB MP4
>>
File: z-image_00334__001.jpg (417 KB, 2048x2048)
417 KB
417 KB JPG
>>
File: x1200-y630.jpg (80 KB, 902x630)
80 KB
80 KB JPG
>>107500621
>537 views in 4 hours
I made one 2.5 second video that got more than that in same amount of time without cringe title.
>>
>>107500621
Just consider the fact it's only going to get worse from here.
>Every google search suggestion ending with "in Hindi"
>Every 2nd social media post just being an AI gen of some random Hindu god
>Every job search beginning with "I am an Indian male"

You will beg for the good old days.
>>
im so surprised why there hasnt been a charlie or erika kirk lora
>>
>>107500662
the fact that they copy each other is so cringe, they copy the thumbnails. these talent-less fucks won't last long. seriously watch one video on youtube these days and on the home page you will see the same thumbnail hook, same shitty text only different image.
>>
File: z-image_00335_.jpg (305 KB, 2048x2048)
305 KB
305 KB JPG
>>
>>107500652
Neat.
>>
>>107500679
No one goons to Erika and Kirkinator memes are made with API models.
Plus trannies that hate both tend to be overwhelmingly anti-AI
>>
>>107500711
why is he so angular
>>
>>107500727
erika is still hot
>>
>>107499862
> maaaam give me a sex i am an indian
>>
They will have to up their game soon because while they pumped out tons of slop shitty videos using AI anons have been taking their time to learn.
>>
Fluxbros how are we holdin up??
>>
You see that is what i'm fucking talking about, its not perfect but its a good start.
>>
>>107500761
Is this for people that can't speak english? Why is it getting so much attention?
>>
>>107500761
Anon, we've been doing this for ages already.
>>
Made a qwen-edit lora for style transfer with natural language and danbooru tags (still early training)
>>
>>107500761
aren't the Qwen models crazy censored?
>>
>>107500789
Oops the image didnt upload (its trained with nudity and does it pretty well)
>>
File: output.mp4 (3.75 MB, 720x720)
3.75 MB
3.75 MB MP4
>>107500578
>>107500584
I prefer this type of furry friend.
>>
>>107499121
skifree
>>
>>107500780
every custom node i used like this in the past wasn't good enough imo. perhaps it was just the model i used i donno.

but a lot more can be added to this, maybe be i'm late, maybe i spent too much time gooning my brains out. :)
>>
is chroma radiance any good?
>>
File: z-image_00339_.png (2.92 MB, 1536x1024)
2.92 MB
2.92 MB PNG
>>
>>107499862
>>107500761
I don't know why this shit is catching on now but you shouldn't use AI prompt expanders. These LLMs are extremely aggressively RLHF'd, this tanks their variety and creativity. Regenerate the expanded prompt and it's almost exactly the same thing each time, the AI will tend to make the same choices every time it's choosing details. Just learn to think and write the details yourself.
>>
>>107500801
Here's another example. I hope it goes well it's my first time finetuning ;_;
>>
Based nature genner.
>>
File: z-image_00341_.png (2.61 MB, 1536x1024)
2.61 MB
2.61 MB PNG
>>
File: z-image_00345_.png (2.64 MB, 1536x1024)
2.64 MB
2.64 MB PNG
>>
>>107500868
sexo
>>
>>107500868
What are these workflows?! Share pls!
>>
how are chromasissies handling getting btfo by z image?
>>
>>107500931
Zussies are currently clenching their bussies because the base model is cancelled. Without it Z-image is basically dead in the water.
>>
File: z-image_00351_.png (2.4 MB, 1536x1024)
2.4 MB
2.4 MB PNG
>>
>>107500907
>>107500900
>>107500835
Sweet to know Z knows this environment. SDXL (or at least its tunes) don't know it, compared to SD 1.5 tunes which did in my experience.
>>
File: z-image_00353_.png (2.67 MB, 1536x1024)
2.67 MB
2.67 MB PNG
>>
File: 1736071812955988.png (1.63 MB, 1417x1080)
1.63 MB
1.63 MB PNG
>>107500949
shrek 2 game remaster
>>
File: 1754711135421486.png (3.28 MB, 1024x1536)
3.28 MB
3.28 MB PNG
>>
>>107500931
by not crying every thread about base and enjoying other cool releases, wan move just dropped for example
>>
>>
So what's the captioning meta for z-image lora training?
I assume we don't use tags but how should I write the description? Do short captions work better, or does it need a novel? Do I need to mention every minor detail or is the model smart enough to understand what it should be looking at (Say, unlike SDXL)
Can you give an example of how (You) caption for your loras?
>>
>>107501008
Also forgot to ask, can you train a decent style lora with low amount of images, say 25 or do you need 100+ for optimal results like SDXL.
>>
>>107501008
Should work even without tagging. I'd use short natural language captions, like two sentences max. 25 images works just fine. Quality comes first, then amount.
>>
>>
>>107501004
>>107501070
>>107500912
Good stuff anon.
>>
https://files.catbox.moe/8zi9df.mp4
>>
File: ZiMG_01024_.png (3.82 MB, 1440x2160)
3.82 MB
3.82 MB PNG
>>
>>107501090
Thanks
>>
From negative, black, to positive, white.
>>
>>107501156
wowee
>>
>>107500924
it's my own lora for anime editing. It can do danbooru tagged 1girls over a blank canvas, natural language, anthropomorphize animals, and style transfer, and much more.

Pic related for all the functions (60 image pairs each)

It's still training
>>
>>107501149
Kek
>>
>>107501171
damnson
>>
>>
Am I going crazy or the Queue button now completely disappeared from ComfyUI.
>>
>>107500454
>Chinese Culture
oh anon... how naive you are to keep this up...
>>
>>107501314
Click on the history symbol, it's very small but it exists.
>>
>>107501371
You've got to be kidding me. Who thought this change was a good idea?
>>
how does one reinstall comfy while backing up workflows/custom nodes and your config settings?
>>
>>107501414
I don't think the nodes will be saved if you're going for a clean install. But check the main folder everything is there marked for you
>>
>>107501437
What if I copy the custom nodes folder and repaste it back after installing a new version?
>>
>>107501164
It's pretty cool, I want more sliders, I think there is also a boob slider
I need a fat one and an age one
>>
>>
>>107501384
Cumfart's thoughts are too complex for us mortals understanding.
>>107501444
Yes you can do that but their requirements.txts need to be pip installed under your new venv again.
Your workflows are under user/default and can simply be copied after reinstall.
>>
>>107501569
thanks
>>
>>107501384
>>107501371
the stop button is back where it was on the latest version! https://github.com/Comfy-Org/ComfyUI_frontend/issues/7108#issuecomment-3632267006
>>
>>107501578
I was talking about the queue window on the top left where you can see and scroll through your generated images and save/delete/reload them at your convenience.
>>
>>107501596
derp, i can't read!
>>
>>107500662
What I find particularly confusing is when their search terms always include India somehow even when it’s completely irrelevant. “Best stable diffusion model in India” what exactly is the relation between the two? Any saars can explain?
>>
File: file.png (13 KB, 423x187)
13 KB
13 KB PNG
>>107501578
why isnt it following the fucking undocker RUN dock like holy fucking shit.
>>
>>107501674
https://github.com/Comfy-Org/ComfyUI_frontend/issues/7282
it's like they only use the ui on their tiny laptops

in other news, publishing nodes to the cumfart ui registry is ez pz, EASIER compared to submitting to comfyui-manager, as that requires you to checkout the project, edit a 2mb file and submit a PR (like fucking WHY again lol)
>>
>>107501674
You've been bitching for the last fifty seven threads about this. Give it a rest.
>>
File: StarTrekTNGHeader.jpg (64 KB, 1193x628)
64 KB
64 KB JPG
>>107500842
| an elderly man with a red jacket, facial hair, and a wristwatch, sitting indoors with his long sleeves rolled up, gazing directly at the camera | the scene cuts to the man standing in a starship cockpit, his wrinkled hands gripping a control panel as explosions light up the ship's interior

| the man's weathered face shows determination as he barks orders to unseen crew members, his watch glinting under the chaos of battle | the scene cuts to the starship dodging laser fire, the man's upper body tense as he maneuvers the ship through a barrage of enemy projectiles

| the man's intense gaze shifts to a holographic display showing the ship's damage, his fingers flying across a keyboard as alarms blare | the scene cuts to the starship's hull scorched and damaged, the man's voice booming over comms as he coordinates a last-ditch escape maneuver

| the ship narrowly avoids collision with a massive asteroid, the man's face etched with focus as he adjusts thrusters mid-air | the scene cuts to the starship breaking through the enemy fleet's formation, the man's triumphant grin visible through the smoke and debris

| the man's hand rests on his chin, his red jacket slightly singed, as the ship emerges from the battle with a final burst of engine flame | the scene cuts to the man sitting again in the cockpit, now calm, his watch reflecting the fading light of the battle as the ship drifts into space

You're fucking dumb as fuck that's why.
>>
>>107501008
Captioning is pointless because the text encoder isn't trained.
>>
>>107501711
>Regenerate the expanded prompt and it's almost exactly the same thing each time, the AI will tend to make the same choices every time it's choosing details
that's just zit's seed variance being shit
>>
>>107501674
CHANGE IT YOURSELF YOU RETARD CAN YOU STOP WHINING ALL THE FUCKING TIME HOLY SHIT DUDE ARE YOU A WOMAN
>>
>>
>>107500362
Flux was lobotomized by its excessive censorship. It suffers from the same problem as SD3. A lot of data is going to be filtered out if even innocuous poses are deemed nsfw.
>>
how do you get proper text output from zimage? I get wrong text all the time unless it's a very simple one or two words prompt
is it just rng?
>>
>>107501764
There is a reason the sota models (sora/nano banana) censor the input text, and the output image, but not the raw model dataset itself.
>>
>>107501674
i don't even know what you are crying about and i don't fucking care, filtered bitch.
>>
>>107501765
You need the right aspect ratios and image dimensions. Separate lines of texts with quotes. You can use variation seed to get a slight variation of the image if a few characters are wrong.
>>
File: 178.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>
>>107499499
I honestly didn't think video gen was going to get as good as it did as fast as it did. Even the advances in image gen in just a couple years are quite amazing.
>>
>>107500779
Sloppers have brainrotted themselves to the point where thinking about prompts is too much effort.
>>
>>107500820
I only tested it very quickly but the results were so bad that I didn't really feel the need to test more.
>>
>>107501814
Thanks anon, will try this.
>>
>>107501843
we still need full zimage, I wish they would just give use all the smaller tools, that is something we can use to work around limitations.
>>
>>107500519
You do realize that he would need even higher shift to escape the blotchiness if he switches to higher res, right?
>>
>>107501861
>too much effort.
i want to enjoy life not be a slave to some dumb fuck like you who measures their dick by how much they can destroy them selves. Guess what? You take nothing to the grave idiot, what are you gonna do in heaven or where ever we go when we die, still bitch about the lazy or what ever? I bet you're one of those 23 year old zoomers that looks 35 and thinks chasing 50 year old roasties and beasting yourself half to death in the gym every day while eating meat substitutes and a vegan diet (muh save the planet) is peak life.

your heart will explode one day you idiot.
>>
>>107501927
If you have time and energy to write that stupid post then you have the same to write your prompt yourself.
>>
>>107502000
kek
>>
I heard the Chinese have a particular culture. Has that changed?
>>
>>107502000
i write the prompt retard i use the image tagger to fill in the details to flesh out the prompt, i use wan 2.2 iv2 and often its more about describing motion in a kinematic way to make sure the characters actually move and do actions that do not need a cause. you wouldn't understand it really you wouldn't, i'm not the best writer in the world and i never will be. you just seethe that people don't need to focus so much on lighting and all that crap and can instead focus on getting everything else right.

do you understand that all content is essentially the same? For example movies are all the same, music is all the same. they all follow the same structure even though they are different. the structure is important other wise no one will watch it or listen to it.
>>
>>107502106
post hands
>>
>>107501927
like i wouldn't be that Indian guy that just pumps out some slop they copied from someone else, most notably the slop that they post when ever a new model comes out, they all trip over each other to be the first and its fucking pathetic. Their example video, the workflow they stole and modified like 1 node whatever i'm not that kind of person.

I like to learn how to do it, but to do it very good.

with the right custom workflow and setup i could just run 10 jobs over night, watch each of them and pick the one that most appeal to me and then upload boom 50k views, boom 150k views 7 days later and boom its taking off. I don't fucking waste my time more than i need because I've sat here and learnt all that already. you swine will complain till you're sat on your arse wondering wtf happened in your 40's go fuck yourself.
>>
>>107502158
>muh yt views
LMAO XDDDD
>>
tell me about hunyuan. does it have any advantage compared to wan as of now?
>>
BBC DICKS INSIDE CCP CHICKS
>>
>>107502210
No it's completely outclassed.
>>
you gonna sit there 6 month what ever writing prompts in a highly competitive world when cunts are already banging this shit out like 4 gens every 48 hour to meet the algo, to then glean from the comments what need to do in the next video.

fuck off, you will be done, out of work broke and depressed.
>>
>>107501751
>FIDDLE WITH THE SETTINGS EVERY TIME THERE'S A MINOR UPDATE
Go fuck yourself
>>
>>107502222
Quads of total truth
>>
>>107501674
So the "cancel the entire que" button just doesn't exist anymore? Because that red one is the "cancel current gen" button. Do I click that 10 times now to cancel the que?
>>
File: 1736168773428765.png (14 KB, 561x267)
14 KB
14 KB PNG
>>107502350
>>
>>107502374
Are they fucking retarded? Do they not realize this is literally the second most commonly used button after run? Now I have to keep the que tab open at all times I guess, thanks for the massive improvement faggos
>>
We'd probably actually be getting the base model if you had all just kept your cool when turbo was release.
Act more aloof next time and then get excited when they give up the goods.
>>
Anyone been using the cachedit node for speed ups?
I'm trying it again, still seeing 25% faster gens on video.
>>
>>107502543
Post a comparison of outputs because I still believe cache fuckery always destroys quality too much to be worth it

>>107501927
>tfw you trigger a schizo so hard he accuses you of being Vegan Gains
>>
File: zimage base is OUT.png (1.05 MB, 896x1152)
1.05 MB
1.05 MB PNG
https://huggingface.co/Tongyi-MAI/Z-Image-Base
IT'S OUT!
>IT'S OUT!
IT'S OUT!
>IT'S OUT!
They did! They really did it!
>>
>rtx pro 6000 blackwell
what can you gen with 96gb vram?
>>
>>107502570
If you lie I image to video your 1girl getting gored in the head
Let's see

>>107502585
>what can you gen with 96gb vram?
KandinskyVideo, as well as full precision large models like HunyuanImage

It's also going to mog all of the 32gb 5090 vramlets once audio+video comes out and will be able to generate 1080p 10 second clips with video and audio,.maybe even 4k clips

But it's only 10% stronger than a 5090 so right now it's only good for training. Oh you can also run local llms for coding/erotica I guess
>>
>>107502585
Hunyuan 80B?
Upscale comically large images?
Run Wan 2.2 at fp32 as God intended?
>>
>>107502605
>I image to video your 1girl getting gored in the head

If you post a video I turn it into a busty fox women and ask your thoughts.
>>
>>107499062
Nice addition to the hall of shame
>>
>>107502618
It's alright, android sucks and you can't save a generated video from a hugging face space properly. It also doesn't want to share the address of the video on mobile

I'm also am mostly normal in this regard so I don't know how to gore prompt properly and it just made rose petals the one time I tried so whatever
>>
>>107502614
>Run Wan 2.2 at fp32 as God intended?
It's still going to be slow, as slow as WAN is on a 5090. The 6000 pro isn't stronger than a 5090, it's basically a 5090 that just has 3x the VRAM
>>
>>107502682
It is actually 10% stronger but yes I was just shitposting.
>>
>>107502570
Perhaps we have ourselves a deal... no BBC once its trained on noobai and the likes
>>
>teehee we at /ldg/ are above everyone else so at least we know the base is never dropping
Said some faggot ITT
>>
>>107502725
This but unironically.
>>
>107502734
dilate
>>
>>107502740
Won't even give a (You) because of how mad you are that /ldg/ was right.
>>
>>107502570
Oh wait, its actually a dead link LMAO, the BBC shall keep pouring
>>
>>107502703
Yeah I know I said that too, that's why I said "basically"z because you're not paying 4x a 5090 for the 6000 pro for more performance

I actually don't know what you'd go for to get more compute than that, I guess an H200

>>107502725
I still think the base is coming. Mostly because I have nothing else to really look for in the hobby right now. If it doesn't I think turbo can be wrangled it'll just be annoying waiting a few months for the community to catch up in that case
>>
>>107502759
>If it doesn't I think turbo can be wrangled it'll just be annoying waiting a few months for the community to catch up in that case
It has already been de-distilled for finetuning but de-distilled models have significantly lower quality than the actual base model before distillation. We can make finetunes without them releasing Z-Image Base but them fucking us over is still handicapping the true potential of Z-Image.
>>
>>107502803
>them releasing Z-Image Base but them fucking us over is still handicapping the true potential of Z-Image
Maybe it didnt have any to begin with, they already released the best possible version they could for clout
>>
>>107502803
>We can make finetunes without them releasing Z-Image Base but them fucking us over is still handicapping the true potential of Z-Image.
I completely agree with you but I think you're looking at it too glass-half-empty (I don't blame you because we were promised base). The history of consumer tech progress is mostly a history of getting the shitty/outdated versions of government/military tech

>>107502835
Its interesting how different everyone's feelings would be about this release if Z Base were never promised or alluded to being promised. Sometimes it's best to not tell people things, and I don't know how to reconcile that with my beliefs that all information should be free [to me]. Maybe I also don't like entering the mindset of "what could have been" because I live in the West in current_year.


Z Turbo seems to be able to handle higher cfg and negative prompts kinda as well so it's not the same situation as flux, a permissive license helps as well. If I had a 4090 or 5090 maybe I'd even try training my first lora ever on Z turbo since it fills me with determination in a way that SDXL never did
>>
>>107502876
Without the community Z-image is dead. If they dont release base, it will just be yet another model with no finetunes, no shitmixes, no merges, no nothing, it will be eclipsed by all the available resources that are continually being made for SDXL
>>
>Grok becoming the best AI model out there because it has less restrictions

Other Corpo models must learn from Elon
>>
>>107502942
They won't.
They don't care if some dudes seethe at "I am sorry as a language model..."s (I think they learned to change this response to something more organic sounding since GPT4 but regardless it's the same refusal shit.)
Even if you subscribe to 20$/month pay piggy tier you are in all likelihood costing more money than you bring.
Their real cashcow is getting corporations to fire their workers and charge them for the actually lucrative API use. Which requires gay censorship because that is just ESG and DEI infested western corpo culture since 2010s.
Forgot to add that most of the people seething will still use it anyway because they are addicted to their worthless sycophants.
>>
>>107502942
>Other Corpo models must learn from Elon
only Elon has the balls to handle the whinny anti-AI bitches, that's why I respect him
>>
File: 1749695503844956.jpg (1.14 MB, 1850x2625)
1.14 MB
1.14 MB JPG
>>107502876
>Its interesting how different everyone's feelings would be about this release if Z Base were never promised or alluded to being promised.
I would completly be fine with it and I would understand that they would keep the base for themseves, but since they promised it they have to respect that or pass as a fucking asshole who plays with peopl's emotions
>>
>>107502980
>Forgot to add that most of the people seething will still use it anyway because they are addicted to their worthless sycophants.
truth nuke, there's a reason why LLMs always suck your dick, for those normies it must feel good to finally feel validated, they need this kind of gay shit, it's super cringe but it is what it is
>>
Saar, download lora, natural desi beautiful, please.
>>
>>107503105
>>
>>107501927
>i want to enjoy life not be a slave to some dumb fuck like you who measures their dick by how much they can destroy them selves
>says this and uses cumfart
>>
>>107502926
>Without the community Z-image is dead. If they dont release base, it will just be yet another model with no finetunes, no shitmixes, no merges, no nothing
I strongly disagree. It's Different This Time(TM) for Z image. I have never seen so much adoption from the generative AI community of any model. Even WAN and the original SDXL or it's tunes don't compare. It sucks that the Z Base rug pull let out some of the hot air from the hype


Slightly unrelated but I also believe the rumor that Nvidia had their hands in the BFL pie and are very pissed off right now that Z image mogged them so hard. Hopefully this anger results in good things for local in some way

>>107503121
I would unironically pick left because architecture but also the round face covers the Indian genetics slightly more which helps me actually be attracted to her. I can squint my eyes and cope that she's SEAsian
>>
>>107503121
>Natural
>the skin is now completly slopped
well job jeets!
>>
>best way I can describe using comfyui moving forward is like sitting on a 12" dildo leaving it in and saying "fuck it, I'm gay now" instead of pulling it out with some dignity and saying "what's the next steps"
>>
File: ComfyUI_09160_.png (1.34 MB, 864x1280)
1.34 MB
1.34 MB PNG
>>
File: ComfyUI_09118_.png (1.39 MB, 864x1280)
1.39 MB
1.39 MB PNG
>>
>>107503231
he looks sassy on that image lol
>>
File: 1749199837371001.png (1.86 MB, 896x1152)
1.86 MB
1.86 MB PNG
https://www.reddit.com/r/StableDiffusion/comments/1pj0q5l/ovisimage7b_first_images/
ovis is so slopped it looks like dalle3, I'm sure it'll get some fans lol
>>
>>107503216
Tasty
>>
is it safe to pull?
>>
File: ComfyUI_02234.jpg (3.07 MB, 1536x2048)
3.07 MB
3.07 MB JPG
>>107503121
What are your favorite 1girls, /ldg/?

>disgusting fat bitches with highly exaggerated bodies
>disgusting old women with even more disgusting bodies
>interchangeable animu chicks with silly colored hair and ridiculous outfits
>1Mikus
>generic models
>Chroma trannies
>Other

>>107503297
That's a real throwback to the old AI look... but why?
>>
So what ended up being the most reliable method of seed variation? dual samplers? NAGger scale?

>>107503378
For me? it's 1clowngirl.
>>
>>107503157
You are one deluded retard if you think nvidia getting pissed at local is good, or that Z-image has any kind of future without releasing base, kek.
>>
File: 2259155365.jpg (1.84 MB, 2304x1792)
1.84 MB
1.84 MB JPG
>>
>>107503288
>he looks sassy on that image
No he just looks a little Asianfaced, just like everyone else in Z

>>107503378
>What are your favorite 1girls, /ldg/?
>disgusting fat bitches with highly
Checking in
You also forgot to mention the actual two most popular 1girls which are furries and cunny but that is understandable since you're not into either

I think it's also about time to politely ask you to switch your oneitis to someone else. I've been seeing this butterface for months now
>>
>click to gen on comfy
>its queueing a previous workflow and not the one on the screen
what the fuck
>>
>>107503392
Yeah if that was what I said then that would indeed be delusional, but I never said that
>>
so, I've learned comfyorg just vibeslops the frontend which makes a lot of sense considering how fucking retarded everything is now
>>
File: Untitled.jpg (29 KB, 334x294)
29 KB
29 KB JPG
>>107503378
>3d artist screaming internally
>>
>>107503533
Every org vibeslops now. The VP at my new job set a target for everyone to get 20% productivity improvement by 2027 with AI which is keklmao but On the bright side I get unlimited Claude Code
>>
Is NetaYume finally good?
>>
>>107503378
why is her hair and face of such poor quality? are you still using flux 1?
>>
>>107503575
that's bullshit but I believe it
Ai WILL be the future, come hell or high water. Gotta recoup those spent trillions somehow
>>
>>107503575
I just think they lost the plot and have no idea what the frontend architecture even looks like anymore. complete spaghetti code nightmare
>>
>>107503575
Isn't that demoralising? I always enjoyed being proud of my own work and being able to stand behind my own actions. If messed up I messed up and vice versa.
AI is fun until it's not, but I could not give a fuck if some image looks retarded or not but I didn't create it etc.
>>
>>107503582
go back to sdg cuntface
>>
>>107503630
demoralizing for the user is the biggest impact. see: Nodes2.0
>>
>>107503582
No it got worse, not even joking.
>>
File: 1743091234290516.png (2.78 MB, 1920x1440)
2.78 MB
2.78 MB PNG
>>107503121
This guy are sick.
>>
File: ComfyUI_00178_.png (1.07 MB, 1200x1056)
1.07 MB
1.07 MB PNG
Im kinda sad that I was right about you faggots, no base and netayume isnt better than SDXL, its a boring year for hentai gooners.
>>
>>107502570
>IT'S OUT!
man i missed it now its a dead link
>>
>>107499062
can we remove the cumfart link? he should be punished for a bit for ruining the app with bloat, spyware, enshitification and vibe niggers?
>>
lol it's going to be rusttranny bloatware in the future
>>
>>107503836
I downloaded, I'm fine tuning it right now
>>
Bets on who will make the first real finetune after base releases? Noob? Pony? Chroma? BigAsp?
>>
File: ComfyUI_00017_.png (2.63 MB, 1120x1440)
2.63 MB
2.63 MB PNG
>>107501164
7/10 not black enough
>>
File: ComfyUI_00241_.png (1.3 MB, 1200x1056)
1.3 MB
1.3 MB PNG
>>107503946
I'll bet on it not releasing to begin with
>>
>>107503832
Ryona is so hard to produce in Wai-SDXL and Z-Image. I haven't tried controlnet though.
>>
>>107503946
The devs themselves. Remember they already reached out to the noob team and they're tuning on the entire dataset.

two more weeks, of course.
god i hope it releases this weekend, training on a distilled model is pure AIDS.
>>
>>107503892
yeah, comfy dies next year for sure
>>
Can't wait for Z-Image Noob to drop and get called a Zingger for using it
>>
File: ZiMG_01038_.jpg (382 KB, 1344x1728)
382 KB
382 KB JPG
>>
>>107503964
Try making a lora o algo? Idk, im more into artist mimicry than characters, so NoobAI and Illustrious just cant be topped for now
>>
>>107504015
why did you delete all your stuff from civitai?
>>
File: ZiMG_01043_.png (3.42 MB, 1344x1728)
3.42 MB
3.42 MB PNG
>>107504032
>>
>>107504057
Have u seen them? It's trash, no wonder it's all gone.
>>
File: 1906548916.png (1.3 MB, 1152x896)
1.3 MB
1.3 MB PNG
>>
>>107504057
they broke civitai so hard that metadata doesn't auto-detect properly anymore, everything gets flagged and needs ((manual review)). It's hardly worth maintaining. Now i mostly post to share some examples of z-image and get pointers on how to use comfyui properly.
>>
>modded 5090 96gb on alibaba
fake?
>>
>>107504091
C'mon dont be so harsh bro, the windshield stocking was very original and it was your idea entirely
>>
>>107503946
I would go for "if" rather than "when" but:
>Noob
Maybe I guess? I am not sure if they would bother. Come to think of it, I have no idea how they funded Noob training and what their incentive for working on anime coom models is.
>Pony
Nope. He takes forever to train shit plus have too much ego to let go of his miserable failure with V7. Still delusional enough to think that 7.1 will turn things around, so no definitely not the first
>Chroma
Similar to above, he won't move on at least until Radiance converges into another failbake.
>BigAsp
He still wants to do V2.6 as swansong for SDXL. He is actually competent unlike astra and lodestone so I would still go with him.
>>107504097
100% 5090 uses 2x16. So it's impossible.
>>
>>107504094
>how to use comfyui properly.
lmao even the devs don't know anymore
>>
>>107503559
https://civitai.com/models/2200962?modelVersionId=2483551
>>
>>107504097
same business has a 4080 32gb for 1k, honestly tempting
>>
File: 208232029.png (822 KB, 1024x1024)
822 KB
822 KB PNG
>>107504069
>>
>>107504195
It's true! Now i get nags about my legacy backup every time i launch cumfart. I love that this scene moves so fast we get new cutting edge breakthroughs every year,
but we use those new cutting edge breakthroughs on the same shitty platforms that get worse every year too. How is this even maintainable?
>>
File: ZiMG_01055_.jpg (559 KB, 1344x1728)
559 KB
559 KB JPG
>>
>>107503946
>>107504138
You guys think the anime finetune will follow the good, logical prompt pattern of z image or they will force the retarded tag system slop machine?
I think this is the main reason the based Chinese want to do the fine tune themselves.
>>
>>107504241
I think training on just tags would suck for capable model like this.
Ideally you want to train on natural language descriptions, tags and a combination of both for maximum prompt flexibility. Alternating between these three throughout epochs.
>>
File: ZiMG_01060_.jpg (462 KB, 1344x1728)
462 KB
462 KB JPG
>>107504235
>>107504224
nice asuka
>>
>>107504241
You know damn well they will force the booru prompting for anime
>>
>>107504278
Adding on there are concepts that have no booru tag equivalents and as such you want natural language descriptions for them, and also there are specific booru tags that are tedious or difficult to express with natural language instructions.
So you want the model trained on both for best results.
>>
File: z_mod_00011_.jpg (750 KB, 1408x1952)
750 KB
750 KB JPG
>>
there are still gullible retatds itt who think theyre getting a heckin based loli hentai model for free from based china?? actual pajeet delusion. probably the same retards who thought sd3 would be good because emad told them it was better than dalle
>>
File: ZiMG_01067_.jpg (415 KB, 1344x1728)
415 KB
415 KB JPG
>>107504279
>>
File: 871144259.png (1.23 MB, 1152x896)
1.23 MB
1.23 MB PNG
>>107504279
thanks, you too
>>
>https://huggingface.co/Kijai/Kandinsky5_comfy/tree/main
better late than never, I wonder if it's worth using
>>
File: P E A K.png (2.46 MB, 1024x1024)
2.46 MB
2.46 MB PNG
>>107504347
>probably the same retards who thought sd3 would be good because emad told them it was better than dalle
I still remember when emad said that sd3 would be so good it'll be perfect on 99% of cases and that it would be the end of everything that we've finally reached the peak, and this is what we got in reality
>>
new
>>107504471
>>107504471
>>107504471
new
>>
>>107504347
Please anon let me cope. I need my updated loli slop machine and the heavens gifted us a good model that my poorfag GPU can still run.
>>
>>107504241
Aren't you able to use natural language to an extent anyway with tag-trained models? I do sometimes, and it sorta works. Z-Image's natural-language prompt adherence also "sorta works", but wasn't as magical for me as people made it sound. It's still really stubborn against doing what I want a lot of times.
>>
File: ComfyUI_02275.png (3.29 MB, 2048x1200)
3.29 MB
3.29 MB PNG
>>107503385
Anybody make a clown makeup LoRA? I'm sure the variety there is pretty poor.

>>107503488
>I think it's also about time to politely ask you to switch your oneitis to someone else
Sorry, Anon. You're just going to have to learn to love her like everyone else.
>>
>>107504513
clown/mime makeup variety is actually pretty good. though i think z-image is trained way more on mimes and pierrot than clowns.
>>
File: ComfyUI_02282.png (3.46 MB, 2048x1200)
3.46 MB
3.46 MB PNG
>>107504528
Honestly surprising to hear. Usually you'd expect something like that to only have a couple of examples in the dataset.
>>
>>107504344
oh shiiii i forgot about this woman, man i had the hots for her so bad a long time ago
>>
>>107499562
>redhead
nice
>>
>>107501927
Holy shit did I hit a nerve. I'm sorry anon I didn't mean to trigger you.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.