[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107473039

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
>>107476773
eagerly awaiting news of anons RTX Pro 6000
>>
File: Wanimate_00132.mp4 (1.11 MB, 528x960)
1.11 MB
1.11 MB MP4
>>107476608
Thoughts?
>>
kill yourself
>>
why is civitai not having a separate qwen image edit category? are loras for edit and non edit compatible between each others?
>>
>>107476608
not enough steps at low noise
>>
I'm sick and tired of genning young 1girls
>>
>>107476826
gen some bara men then
>>
>>107476826
that's impossible
>>
what program should i be using for anime datasets? currently using deepbooru with forge but doing 1 image at a time is tedious.
>>
add ramtorch to anistudio then i will use it
>>
I need good ideas for new zit loras
>>
>>107476816
how many steps do you guys use?
I am using the lightx2v 4 step lora
>>
File: ComfyUI_00269_.mp4 (1.42 MB, 576x720)
1.42 MB
1.42 MB MP4
>>
is lora block weight updated for z-image?
>>
File: ComfyUI_temp_pxmae_00001_.png (3.94 MB, 1800x1400)
3.94 MB
3.94 MB PNG
oink
>>
File: z-image_00056_.png (745 KB, 1024x1024)
745 KB
745 KB PNG
>>
File: 1765090670757125.png (160 KB, 587x590)
160 KB
160 KB PNG
we're doing it
>>
>>107476973
nice to see others taking inspiration from my gens
>>
File: ComfyUI_00270_.mp4 (730 KB, 576x720)
730 KB
730 KB MP4
>>
File: viol.mp4 (990 KB, 848x480)
990 KB
990 KB MP4
>>
>>107476900
Can you even train anything without destroying image quality of z-image?
>>
>>107476966
>>107477070
cool
>>
>>107476826
hahah very funny
>>
File: 1746888423146435.png (3.15 MB, 1826x1219)
3.15 MB
3.15 MB PNG
>>107477082
yes
>>
>>107477091
would
>>
File: ComfyUI_00271_.mp4 (1005 KB, 576x720)
1005 KB
1005 KB MP4
>>
>>107477091
How?
>>
File: 9050.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
>>
>>107477119
the same way everybody else does it. ai-tookit
>>
File: 1761112145385095.png (1.47 MB, 1120x1440)
1.47 MB
1.47 MB PNG
>>
Holy shit Zimage is so fucking hell bent on NOT making crossed legs a thing. What a weird concept to miss on.
>>
>>107477219
>>107467713
BRUH
>>
>>107477231
It's like a lottery, half the time it doesn't know.
>>
File: 1740476089561122.png (2.03 MB, 1440x1120)
2.03 MB
2.03 MB PNG
>>
File: Qwen_00008.png (1.05 MB, 1248x1248)
1.05 MB
1.05 MB PNG
>>107477247
>>
File: 45745673453.png (346 KB, 378x577)
346 KB
346 KB PNG
>>107477231
Seems to work okay if you say "legs crossed with one leg dangling" or something similar.
>>
>>107477276
nice asa
>>
File deleted.
>>107477161
That fucking faggot chinkbot poster is pissing me off and I have no way to filter it's nightmare fuel furfaggotry.
>>
>>107477116
How long do you have to wait to get a gen?
>>
File deleted.
Cyberpunk Sergal goodness
>>
>>107477327
bout 3fiddy
>>
are we being raided?
>>
>>107477367
Why would you think that?
>>
Let me ask the demon living in my walls.
>>
>>107477367
this place is a shithole that chose scam companies as their heros. of course everyone on /g/ treats it like a shitting street
>>
File: Smol'n'Adorbs.webm (1.85 MB, 560x560)
1.85 MB
1.85 MB WEBM
>>107477367
Just seems like a chinkbot trying to bait furrys for market research or smth, but it's posting aids tier gens.
>>107477380
Stay mad ranjeet/commie/leftyfag/kike/vaginajew ywnbh.
>>
https://youtu.be/3oCTiIbVfls
it begins
>>
>>107477219
i cant get a low angle shot to work at all with zimage
>>
File: ZIMGREEEE.png (343 KB, 872x798)
343 KB
343 KB PNG
how tf do you use zimg as a detailer? 0.95 denoise completely changes the image.
>>
>>107477477
lower the denoise then, you fucking retard
>>
File: ComfyUI_9210.jpg (2.62 MB, 1280x2048)
2.62 MB
2.62 MB JPG
>>
>>107477477
Use the canny controlnet
>>
>>107477471
my fucking sides!
>>
>>107477477
>0.95 denoise completely changes the image
You don't say?, we got a einstein here
>>
>1male, male focus, perineum
>result is an image of a woman with medium breasts and a vagina
fuck you
>>
>>107477477
>40 steps
try 4 steps, denoise 0.2, flow 3, heun simple. then adjust denoise to your liking.
>>
>>107477477
because you're using 40 steps. lower it to 10
>>
>>107477512
tag is 1boy, don't think there is a 1male
>>
>>107477512
It's telling you to stop being a faggot.
>>
>>107477512
>1male
Why do I keep seeing this or 1man. Those aren't recognized keywords retards.
>>
>>107477474
I think the model bundles concept together, like depending on where you set the location, you'll get easy crossed legs or it's almost impossible.
>>
>>107477141
Show me a lora that's good then.
>>
>LoRAs for Zit suck
>The base model is like 99% not coming at this point

It's over.
>>
>>107477556
Yep. Another day another local failure. Better pay up for your beloved SaaS!
>>
>>107477536
>It's telling you to stop being a faggot.
It's not gay if you say no homo right after.
>>107477531
>>107477542
You were right, it's not 1male. Thanks, it's working now without randomly giving me a woman.
>>
>>107477561
Already dumping $10k into Nano Banana Pro. I love funding Israeli intelligence.
>>
Total Chinese culture victory.
>>
is comfyorg knowledgeable in Chinese culture?
>>
>>107477581
One might say they are an expression of Chinese culture.
>>
>>107477581
yes and no at the same time

see: >>107477471
>>
File: CMGB.png (1.62 MB, 1456x816)
1.62 MB
1.62 MB PNG
>>107477572
Just generate them wearing socks anon, thats how you cancel out the gay. Secret hidden esoteric lore.
>>
best entry level gpu you can buy now?
>>
>>107477601
5090
>>
File: 1761118385991204.jpg (681 KB, 1792x2304)
681 KB
681 KB JPG
>>107477276
nta I just put "with her legs crossed"
>>
>>107477600
I'll keep that esoteric knowledge in mind when genning sexy men, thank you Cookie Monster.
>>
File: ComfyUI_00171_.png (2.52 MB, 1248x1920)
2.52 MB
2.52 MB PNG
>>107477545
It can kind of do it, kinda, but if you're looking for the ultra gooning angles, consider using the ControlNet.
>>
File: ezgif-1521c3fa17483bc8.gif (1.85 MB, 378x239)
1.85 MB
1.85 MB GIF
>>107477601
Erm.. depends what you are targeting for... you can gen on old 20xx series cards if you can still get one, try to find one with a 12-16 gig framebuffer, 8 is pushing it. you'll be looking at relatively quick gens in SD of 6-8 to 8-15 seconds at 1024x1024 + 45 steps at that level.
>>
File: ZiMG_00862_.png (2.44 MB, 1344x1728)
2.44 MB
2.44 MB PNG
>>107477616
>>
why are people content with comfyui sticking a dildo up your ass? do you really want to consider yourself a faggots?
>>
>>107477648
what prompt do you use to get it so dark?

when I use, 'dark room', 'dim lighting', 'dimly lit' it still gives me a bright image
>>
File: ComfyUI_00189_.png (2.28 MB, 1248x1920)
2.28 MB
2.28 MB PNG
>>107477630
>>
>>107477652
great timing on that considering the posts behind yours are talking about fucking men. also, barely anyone cares so until something better than comfy exists, people will keep using them

>inb4 your post is just bait to shill anistudio
>>
>>107477656
your words and daytime flash lora
>>
>>107477652
chinese slave mentality
>>
>>107477658
What wording are you using to get such a low angle?
>>
>>107477404
aaaaaaaw
>>
>>107477636
I want a relatively power efficient one because I don't want to upgrade my psu
>>
File: 73453242342.png (144 KB, 556x1069)
144 KB
144 KB PNG
>>107477656
A trick is to use a black latent instead of an empty latent, set denoise to 0.50 (higher for lighter).
>>
>>107477665
>A woman sitting on a wooden bench. extreme low angle shot, worm's eye view, from the ground up, floor level perspective. extreme foreshortening, highly distorted perspective, maximum distortion. Hyper-detailed, cinematic, volumetric lighting, god rays piercing the clouds, high contrast. ultra wide-angle, 20mm lens, panoramic lens. Shot on 70mm, 8K. Close up, viewed from the woman's shoes.
>>
>>107477676
smrt
>>
>>107477673
then do some research by taking your psu into account and the power requirements for gpus inside your budget, moron.
>>
>>107477682
thanks. i had already tried extreme low angle shot, worm's eye view and foreshortening but they didn't work.

maybe something else in my prompts was conflicting with it.
>>
File deleted.
>>107477630
>>107477474
Maybe its the way your describing? "wide centered full body shot upskirt-view between legs from below" "legs spread open presenting crotch/panties/gusset/groin/whatever to viewer"
>>
>>107477686
you mean amd 9070?
>>
I have a lora for a real life person (emma watson) but no matter what I do I can't get images to come out where her face or even the proportions of her head look like emma watson.
it's an illustrious lora, is there an illustrious checkpoint good for rendering accurate faces? I'm using "Illustrious Realism by klaabu"
>>
why is everyone seething over comfy? It works like it always has
>>
File: 1742151544559965.png (2.52 MB, 1216x1728)
2.52 MB
2.52 MB PNG
>someone knocks on your door
>says he has a delivery for you
>you open the door and see this
what do you do?
>>
saar, base model here, pls download
>>
>>107477795
people are downloading workflows from civitai made by jeets and they're seething when its not working.
>>
>>107477826
I dunno who this lady is but she's pretty
>>
>>107477795
grift culture, repetitive custom nodes and even default ones, custom nodes use vram, new UI is even less performant than litegraph, API node priority, bad example templates, updates break custom nodes often, it's written in poothon so all of it's problems on the pile, stockholm syndrome apologist redditors, run by a grift chink, fuds any competition in case something else gets traction, comfy is a psychopath that ignores valid criticism, memory leaks, and is still uncomfortable. it's a lost cause in a worse state than auto before it was abandoned
>>
>>107477842
you forgot about making worse looking images than auto forks even after wading through cope workflows
>>
>civitai early access
wtf is this kike
>>
>>107477903
somebody should just make an alternative with nsfw already
>>
>>107477831
>aggressively head bobbles towards you
>>
>>107477948
what does that mean
>>
>>107477498
nice
>>
>>107477676
somebody get this man into MENSA and the keys to the secret pizza basement
>>
>new model comes out
>end up back with illustrious
>new model comes out
>still end up going back to illustrious
can they make something good already? zimage base doesnt exist yet so dont even mention it
>>
File: 1757555720055188.png (2.99 MB, 1216x1728)
2.99 MB
2.99 MB PNG
>Anon, take this
>>
File: ComfyUI_01761_.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>
>>107478019
Wow, I haven't thought about that shithole since 2014
>>
>>107478011
>zimage base
>implying you won't go back to illustrious because there is no anime style tags
>>
why do people say zit is shit on poses when you can use control net on it?
>>
>>107478025
Unless someone makes an amazing finetune for Z-Image, Illustrious is better and still has more LoRA support.

Training LoRAs on Z-Image is spotty right now.
>>
>>107478048
the model is shit because it's distilled. no amount of cope can argue against this
>>
File: 454352.png (15 KB, 660x408)
15 KB
15 KB PNG
>>107478048
Shit ain't working on Desktop version until the next release.
>>
>>107478011
genuinely wonder what type of shitmixes do guys like you, who praise illustrious and not noob, use
>>
>>107478064
okay, so use the other version.
>>
>>107478069
link me your favorite noob and ill give it a go
>>
>>107478075
same as everyone elses
https://civitai.com/models/1301670/291h?modelVersionId=1469244
https://civitai.com/models/1201815?modelVersionId=1491533
>>
File: 00011-3918572346.png (1.05 MB, 896x1152)
1.05 MB
1.05 MB PNG
This thing is pretty good at managing multiple characters without prompt bleed.
When did prompting get so good?
>>
File: ComfyUI_01768_.png (1008 KB, 1024x1024)
1008 KB
1008 KB PNG
>>107478129
when china decided the retarded westerners didn't know what they were doing
>>
>>107478146
Non-Diffusion-based models always had superior prompt adherence. We were all just cucked by SDXL for the longest time.
>>
>>107478069
Not him but its the same for me. When I'm trying to gen with noob I get like 1/10 images that aren't fubar. For reference I don't use artist tags or loras and most of the time its just kind of blurry and or overbaked. I mean even the showcase images of the original model are kinda like that, so at this point I just assume its a feature.
>>
>>107478167
>I don't use loras
uh... alright
>I don't use artist tags
what the fuck
>>
>>107478164
???
They are all still diffusion-based though
They only use LLMs/Autoregression for the text encoder, it still uses diffusion to generate the images themselves
>>
>>107478071
But I don't know how to make a venv :(
>>
>>107477737
no, never amd
>>
File deleted.
>>
>>107478167
i also had the same experience with noob and just stuck to illustrious. dont really know what's up with all the gassing noob gets when all of the example images and gens for noob look like the same fucking slop you can make with illustrious.
>>
waow
>>
>>107478199
There is no way that kind of cameltoe gets by the jannies.
>>
File: 1760861965198804.jpg (276 KB, 1439x2926)
276 KB
276 KB JPG
you can't make money with A-
>>
>>107478204
i'm asking again. which illustrious shitmixes are you always talking about
>>
>>107478178
What if I don't to be hardlocked into a specific artist and just want to generate an image of my prompt? Unless your implying that doesn't work with Noob. It works with illustrious and it worked with pony.
>>
>>107478189
>They only use LLMs/Autoregression for the text encoder
And they actually do not even use the AR LLMs autoregressively, it only generates embeddings in a single forward pass
>>
>>107478192
the only thing indians are good for is making instructional videos on youtube, and they're damn good at it.
find you a pajeet, follow the instructions, and they'll have your car running, your wife pregnant, and comfyui installed all in like 20 minutes.

https://www.youtube.com/watch?v=SVZMN1gUYvU
>>
>>107478212
>What if I don't to be hardlocked into a specific artist
pick a different one. there are thousands. hell, mix artists together.
https://tagexplorer.github.io/#/artists?tagFilter=&page=1

default slopstyle is so tiresome and boring
>>
>>107478212
this is the issue i have with sloppers like you
2d ai gens are supposed to look like actual art, not soulless styleless slop
it started with AbyssOrangeMix and it all went downhill from there. you don't care about aesthetics, you care about content, about slop
>>
>>107478228
I immediately close Indian videos.
>>
Is Zimage actually better than illustrious? I'm starting to see a lot of loras for it suddenly.
>>
>>107478228
comfyorg should pay this brilliant engineer
>>
>>107478237
every single lora I've used so far is shit for it
>>
>>107478237
It's good, but chaining LoRAs doesnt't work so well and degrades images hard.
>>
>>107478228
I was joking. I only said that to clown on the retard who doesn't use the non-portable version.
>>
>>107478085
NTA, but that is the problem. Those two noob merges are not that well known. Base Noob sucks yet is the most popular one. For illustrious, the base ist not that much downloaded whereas its merges like WAI are way more popular. Gonna try those out later and see if my opinion of noob changes. Oh and another problem is that I have no reference workflow for noob.
>Just google it
I did and there none. The official page links to some workflow in the huggingface repo for vpred 0.5, like a year old at this point. This one has clip skip -2 even though they officially say we don't need that?? Like if they themselves can't be half assed to teach how to use their model why should I?
>>
i havent seen anyone showcase a noobai gen that made me go "HOLY SHIT I NEED TO CHANGE RIGHT NOW FROM ILLUSTRIOUS" ever
>>
File: fluxtest.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>
>>107478234
that's why your shit doesn't work right
>>
File deleted.
kek, my furry setup with that naiXLVpred102d_custom model
>>
>>107478294
i haven't seen a single person saying which illustrious model they're using
>>
>>107478313
everybody uses wai
>>
>>107478316
oh...
>>
>early access loras for turbo model
sigh
>>
>>107478316
plantmilk walnut is pretty great desu
>>
>>107478313
wai is all you need but as the other guy said, walnut is also really good.
>>
File: 1759716916077969.png (1.93 MB, 1024x1536)
1.93 MB
1.93 MB PNG
>>
why are cumfart users so threatened by other uis?
>>
>>107478326
>turbo model
It's the only model we'll get so I guess we have to make do.
>>
>>107478369
If you saw someone about to stick their hand in a blender, would you watch or would you try and stop them?
Comfy users feel the same urge to prevent self harm when they see people using other UIs.
>>
File: ComfyUI_01776_.png (710 KB, 1024x1024)
710 KB
710 KB PNG
>>
>>107478369
It's like getting accustomed to blender after self teaching on the 2.79 style for years and then suddenly 2.8 - 3 drops and its like an alien fucking language and everything is nested, the names and functions are all changed, the mouse clicks don't do the same shit anymore and it just doesn't feel comfy anymore, that's why. /vidrel
>>
>>107478230
I never claimed to care about "aesthetics", or I'm saying I'm not getting any good quality using base noov 1.0 vpred?
When I used janku I got coherent images but that basically just looked like wai v14 so I just stuck with that.
>>
>>107478383
why would i stop someone i dont know from blendering their hand? what a stupid comparison.
>>
well no wonder trannime ai gen scene is fucking dead since all of you use wai and don't care about aesthetics
>>
>>107478401
Because... human decency? lol are you brown or jewish or something? or just jaded?
>>
File: screenshot.1765183657.jpg (99 KB, 1067x254)
99 KB
99 KB JPG
>>107478369
They don't. The Z-Image-Turbo workflow having 2M downloads in a span of 10 days is proof 99.9% are using ComfyUI, so 'cumfart users' feeling threatened is entirely fabricated in your own delusional head. (You), anti-comfy schizo/anistudio/FizzleDorf, are the only person constantly bringing up UI's while everyone else is discussing current models. (You), anti-comfy schizo/anistudio/FizzleDorf, are the only person threatened by ComfyUI existing.
>>
>>107478408
you say that but every model after has been shit
>>
>>107478237
In terms of prompt adherence and resolutions it's clearly more capable.

Loras can be quickly trained at 512px in particular so maybe people just publish a lot of them because of it. It DOES make training many loras easeir than other models tho.
>>
>>107478408
It's not dead and almost no one can tell if something was made with NoobAI vs WaiNSFW.
>>
>>107478408
So whats your le hecking "aesthetics" pick then since youre clearly superior to us and totally not some random shizo ranting
>>
rolling my eyes at the amount of z loras being trained on booru tags when the model is so good at adherence already
>>
>>107478397
>It's like getting accustomed to blender
I would never make that comparison with cumfart. people have been begging for something sane for years and they only make it worse. imagine if blender only cared about proprietary 3rd party plugins?
>>
>>107478343
Is this a disco z lora?
>>
>>107478413
what does a dying model with shitty loras have to do with anything?
>>
Hmm, having a nice Chinese culture lesson this week?
>>
>>107478413
>so 'cumfart users' feeling threatened is entirely fabricated in your own delusional head
gee anon, why do you feel so threatened as to type this out?
>>
>>107478408
I literally only care about little girl buttholes and judge all models based on the ability to generate that, and little girl butthole technology for anime has not improved since pony
>>
>>107478428
If it's so good at adherence, it'll do fine with booru tags, right.
>>
>>107477956
an indian stole his job likely
>>
>>107478428
rolling my eyes at the amount of shit z loras that completely change the subject or nuke the quality.

this is why you do not make loras for distilled models.
>>
>>107478401
brownoid detected
>>
Civitai should've never made a category for it. When base releases, everyone will have to retrain their loras. This will be extremely confusing as users that don't know what's going on will still be dowloading shitty turbo loras not realizing the ones trained on base are the actual ones you're supposed to be using.
>>
>>107478498
>when base releases
>>
File: ComfyUI_01778_.png (891 KB, 1024x1024)
891 KB
891 KB PNG
hey guys, remember Flux? lmfao
>>
>>107478498
Civitai can rename the category if they care to. But the base first needs to actually release.
>>
File: 00.jpg (159 KB, 1006x257)
159 KB
159 KB JPG
>>107478498
Your average civitard doesn't care. everyone jokes about them being indians but its low tier trash people in general
>>
File deleted.
>>107477116
Thoughts?
>>
>>107478498
>When base releases
Anon. We discussed Chinese culture.
>>
File: gyate.png (786 KB, 1024x1536)
786 KB
786 KB PNG
>>107478516
sure. and flux.2 was better than flux.1. still used it a lot less than qwen or chroma so far.
>>
>>107478523
keep going

>>107475106
you too, keep going
>>
>>107478531
>>107478507
It will release eventually, even if it's API only. Ideally I'd like to think they saw the huge popularity with turbo and decided to throw more money into making base better for the release.
>>
>>107478520
it already is called Z Image Turbo on civitai
>>
>>107478410
>>107478496
if someone is deliberately sticking their hand into a blender and i dont know them, i'm letting it happen, simple as. human decency doesn't apply if you're doing something that stupid in the first place.
>>
File: file.png (2.51 MB, 1680x1184)
2.51 MB
2.51 MB PNG
>>
>>107478545
>I'd like to think they saw the huge popularity with turbo and decided to throw more money into making base better for the release.

Oh they'll be making it better, for their API.
Chinese culture you see.
>>
>>107478559
They were misled by a Turkish PHD into thinking Gradio UIs are somehow valid.
They are victims of scamming and misinformation.
>>
>guh, i need base, i need it, i NEED BASE
>finally gets released
>gee, i wonder what /ldg/ is prompting with it!
>1girl, standing
you know i'm right, you stupid subhumans will always gen 1girl with no substance, no matter how good the model is.
>>
File: file.png (2.11 MB, 1024x1024)
2.11 MB
2.11 MB PNG
World of Warcraft Tauren
>>
>>107478575
First day or two of a new model usually have fun creative prompts. That being said there won't be a base model so yeah.
>>
>>107478207
this guy has always been a clown lool
https://www.youtube.com/watch?v=irwF4EziLw0
>>
>>107478588
i dont care if it ever comes out if im being honest and i wipe my ass with chinese culture. i already got all i need with sdxl.
>>
File: 1764896367492660.png (554 KB, 903x500)
554 KB
554 KB PNG
>>107478545
>Ideally I'd like to think they saw the huge popularity with turbo and decided to throw more money into making base better for the release.
that's probably what happened, they wanted to release base less than a week after turbo and then saw the ultra hype and they didn't want to dissapoint on a mid base model, so they decided to work harder to get that lighting in a bottle moment again
>>
Z-Image Base releases, post the first prompt you're doing to gauge your creativity.
>>
File: 1750318955779709.png (194 KB, 1670x1164)
194 KB
194 KB PNG
https://artificialanalysis.ai/image/leaderboard/text-to-image?include-non-current=true
We finally got the ranking of Z-image turbo on artificial analysis (8th)
>>
File: lmaoo.png (14 KB, 1576x88)
14 KB
14 KB PNG
>>107478627
>the 80b model is 22th
AIEEEEEEEEE
>>
>>107478575
You could swap z-image with SD1.5 and 1girl sloppers won't notice. I like z-image's ability to somewhat accurately depict a scene for once.
>>
>>107478627
Turbo ranking that high while being distilled, less censored and local is already insane.

You can't do shit with FLUX.2 aside from make the most SFW corporate AI images possible.
>>
>>107478625
i'm just doing 1girl but this time with sex bob and vagene loras
>>
i need to see a side by side comparison of an sdxl gen and a z-image gen of 1girl
>>
>>107478627
All this shows is that RLHFing a model to perform well on a specific task (portraits/realism) matters more than versatility on memebenches
>>
>>107478627
>Seedream 4.5 seems to be much worse than Seedream 4.0
what happened?
>>
File: file.png (1.2 MB, 1312x912)
1.2 MB
1.2 MB PNG
>>107478625
Cult of The Autism Awareness Autism Carpet puzzle piece appreciators, full body view pixel art full body shot wide shot of an awesome anthropomorphic brightly colored furry archer holding a class appropriate tactical combat shotgun based weapon type that fires gigantic thick long 2 handed claymores instead of arrows (especially if it's a bow or a sword or spear) battle stance a fantasy adventure, FULLY CLOTHED, isometric pixel sprite for rpg maker, cute animal nose, wielding gigantic hand held Howitzer cannon, scene takes place on playground on autism puzzle piece carpet inside crowded anthro furry convention, sad ball pit, sad inflatable pool toys
/picrel
>>
>>107478627
damn how did openai fall so far behind? also why are people ever shilling grok, gotta be indians
>>
>>107478677
The dream died
>>
>add any zit lora
>immediately slopped
>>
>>107478704
>also why are people ever shilling grok
this image model is actually really good, and the least censored of all API models
https://files.catbox.moe/v6xgus.webp
up is grok and down is Z-image turbo
>>
>>107478738
I cummed
>>
i'm grokking my shit rn
>>
File: file.png (20 KB, 421x472)
20 KB
20 KB PNG
>>107478704
Nawt Indian and I like grok 4 expert cause it lets me express my thoughts as code without the ability to code.. I have tried to learn and just.. couldn't .. I just don't think my mind is suited to it.. I self taught blender but cant seem to pick up code at all.. It's like an alien language to me.
I know its a meme and you have to babysit and hand hold it but if you format stuff logically it does get the job done, I made 5 different browser addons for brave including a centralized cookie manager with search and delete function, PI-Hole extension, x minority / religion / sportsball / zoomerspeak / ebonics filtering and has a learning ai that filters out whore-posts filter, my own volume manager with easily readability and ui enlarge functions, per domain and vidya / moosic url memory and it defaults to 3% and doesn't create any data for anything anywhere you go unless /until you touch the slider which includes incrementing and direct text value input and overvolume. /picrel
>>
File: Z-image turbo.png (1.34 MB, 1280x720)
1.34 MB
1.34 MB PNG
>>
File: ComfyUI_01797_.png (920 KB, 1024x1024)
920 KB
920 KB PNG
>>
File: Z-image turbo.png (1.48 MB, 1280x720)
1.48 MB
1.48 MB PNG
>>107478781
>>
>>107478764
>rendering everything
yeah it shows it was made with 0 thought.
>>
>>107478819
kek
>>
>>107478704
Grok is actually pretty good if you want to animate an uploaded sfw picture, or t2i nsfw. Unfortunately indians ruined i2v by uploading too many pictures of real women.
During the brief point it was completely uncensored I felt no reason to gen anything locally at all, but now its cucked again so here we are.
>>
File: 107477581.png (738 KB, 1024x608)
738 KB
738 KB PNG
>>
File: file.png (48 KB, 796x545)
48 KB
48 KB PNG
>>107478823
I actually kept telling it I didn't want it to do that but it just wouldn't listen, I'm not shilling it, just giving my personal experience with it, I still think its a neat thing to play with. and now I have stuff that did not exist before because of it, I would say that's pretty useful. When I eventually learn better ways of doing thing's I will improve it.
big and easy to read / interact with.
/picrel
>>
>>107478850
Can you imagine how much nicer the world would be right now if
1. They never had internet
2. They never had English forced on them

They'd be off in their own Hindi part of the internet
>>
File: COOM-BREACH!!.gif (177 KB, 1000x1111)
177 KB
177 KB GIF
>>107478850
Yeah.. it was .. fucking.. incredible..
/picrel
>>
>>107478819
hehehe
>>
>>107478872
man that UI looks like garbage, I guess function over form in this case. Anyway tip from a real GUI/UI/UX engineer, for big lists ask it to implement virtual scrolling
>>
>>107478850
i remember back when people would @ grok for image editing on twitter some people were @ing grok on selfies women have taken asking it to prompt some shit like "add glue all over her face and stick her tongue out" and it did it. 99% sure that was the breaking point
>>
File: Z-image turbo.png (1.37 MB, 1280x720)
1.37 MB
1.37 MB PNG
>>
File: 107477599.png (1.24 MB, 1056x1024)
1.24 MB
1.24 MB PNG
>>
>>107478891
Thanks, I'll add that to my list of terms.
>>
>>107478884
I don't believe indians wouldn't have learned english by themselves, everyone can talk english in this world, even countries that weren't colonized by The United Kingdom
>>
>>107478704
>damn how did openai fall so far behind?
By censoring their shit to hell and back. They had a moat at Dalle 3, but stopped having it after Gemini 2.0 Flash roughly.
>>
>>107478966
I wonder if that "adult mode" they've been speaking about will actually arrive this december, or if it'll be yet another nothingburger.
>>
>>107478704
>>107478966
The first company that managed to be relevant on one domain isn't always the company that'll dominate in the long term, ask Nokia and Kodak about that
>>
File: 00453_376831015.jpg (1.52 MB, 3136x1344)
1.52 MB
1.52 MB JPG
>>
File: May 13, 2024.png (250 KB, 3347x757)
250 KB
250 KB PNG
>>107478976
>I wonder if that "adult mode" they've been speaking about will actually arrive this december
they promised "adult mode" for years at this point
https://hypebeast.com/2024/5/openai-considers-allowing-ai-generated-nsfw-adult-content-info
>>
File: file.png (14 KB, 409x315)
14 KB
14 KB PNG
>>107478891
and a neat little Java based video converter/compressor/resizer with audio stripping function and preset sizes for the chans.. so I don't have to use web services or freemium crap.
>>
>>107479003
>javafx
why bro WHY, also you have webmforretards which also includes video editing, why re-invent the wheel?
>>
Fuck it, I'm gay now. The 12 inch dildo doesn't hurt anymore
>>
File: 107477610.png (1.55 MB, 1056x1024)
1.55 MB
1.55 MB PNG
>>
>>107479003
https://argorar.github.io/WebMConverter/
this has existed for years and does everything that does + much more. I have no idea why you wasted time doing something that has been done hundreds of times already.
>>
someone needs to make a model thats "good enough" i dont even want the best, then i'll fuck off for 200 years. im a simple guy
>>
>>107479012
I didn't know that existed, is that for windows? And I wanted java for the universality so I could bring it with me irrespective of operating system / windows / linux. So Java was a deliberate choice. It's the only universal thing I know of. Least I'm trying ; / There are plenty of people who are so lazy they wouldn't even go this far..
>>
>>107479038
Z-Image-Turbo
>>
>>107479038
All I want is an edit model that has the realism of Z-image turbo, and yeah it's called Z-image edit but will it really be released? Let's hope so...
>>
>>107479038
all current models are 'good enough'.
>>
>>107479038
grok it up
>>
File: content kot.png (581 KB, 1152x1024)
581 KB
581 KB PNG
>>107479023
>.> I just told you I didn't know it existed.."
The potential even at this very early stage of 'agentic coding' s exciting to me and fires my imagination.. I would still wanna make my own stuff just because I can, it only makes me wanna make more stuff and make my stuff better, I am content and I am /comfy.. why are you trying to discourage that : (? I love makin stuff and I am slowly picking things up by working in reverse from the finished code to little by little kind of.. sorta .. learn a little here and there.. gotta start somehwere..
>>
>>107479051
>>107479064
what i mean is

if there's a noob/illustrious finetune to the base model and natural language actually works with tags, ill fuck off for 200 years

>>107479061
ofc it'll be released :)
>>
>>107478976
>adult mode
Sora 2 censors even fully clothed women doing something utterly innocent at the slight hint of a fetish. Don't think a model you can actually prompt for nudes is ever happening.
>>
>>107479082
There's literally no one competent enough to finetune base (if it releases).
Noob bros are doing their own thing.
>>
File: bad ending.png (138 KB, 2041x639)
138 KB
138 KB PNG
https://github.com/meituan-longcat/LongCat-Image?tab=readme-ov-file#model-download
>mid models get released fully
>great models have the infinite "will be released SOON(tm)" loop
remember, it is local until it's good
>>
>>107479067
wish they didnt nerf i2v nsfw
>>
>>107479089
I doubt they'll allow anything explicit but
>Only content suitable for audiences under 18 (a setting to bypass this restriction will be available in the future).
I wonder if they'll keep their word on this
>>
>>107479099
Musk wants the anime titties all to himself
>>
File: file.png (1.06 MB, 2726x1532)
1.06 MB
1.06 MB PNG
>>107479096
we must hope.
>>
File: 0.jpg (657 KB, 1200x630)
657 KB
657 KB JPG
>reddit tourist didnt know about webm for retards
>>
>>107479070
you're not learning anything tho. I mean it's fine if you're experimenting and are 'wowing' at your newfound ''''capabilities'''', but it's NOT YOU doing it, it's the AI shoddily copying code around (with more or less degrees of handholding depending on what youre trying to make or what already exists).
Vibecoding stuff that already exists is really dumb too, you're literally wasting your time. Also nobody, NOBODY is interested in your vibecoding escapades. That's all.
>>
Does anyone do any proper testing and reviews of the hundreds of merges on Civitai? So many has a flashy cover image but is utter shit to use or barely changes from the base model.
>>
File: 1762484544292519.png (306 KB, 814x802)
306 KB
306 KB PNG
>>107479089
No way in hell it doesn't happen, probably just will be monitored to shit and back, Indians ruined Aurora 2? (grok video nsfw i2v) yeah I bet, I don't doubt for a second they were making straight CSAM with that shit as fast as they could.
uncensored grok image to video was fucked as all hell, and it would do ANYTHING if it was furry it didn't care, even beastiality, and much much darker and more fucked up shit which I felt like a part of me died inside when I saw... actually started shaking unironically like an addict in withdrawal, the fucked shit getting dropped on /trash and discord from furry's was beyond the pale harrowing so I don't even wanna think what a normie streetshitter abomination would have made. In a way I'm actually thankful they nixed the uncensored i2v for grok as "society" definitely demonstrated it is not ready to be responsible with that kind of power.
>>
>>107479147
It'll probably happen but you'll have to provide your Government ID or some shit lmao
>>
>>107479147
>In a way I'm actually thankful they nixed the uncensored i2v for grok as "society" definitely demonstrated it is not ready to be responsible with that kind of power.
local can do it anyway so nothing changed
>>
File: Autism_Haver_8btenc.png (437 KB, 1184x1008)
437 KB
437 KB PNG
>>107479104
Well eventually aurora 2 will drop for local cause it's musk.. so.. when that happens you'll be able to turn it off and make the most depraved shit your mind can conceive of I'm sure..
>>
>>107479160
>aurora 2

whats that?
>>
>>107479126
When I said learning I men't working backward through the code to come to understand the what's whys and how's, that IS learning. and that was what I said in the context of that so I am.
It's the same way I learned how to make vrchat avatars in unity and I made over a thousand and over 100 worlds, people recognize my shit in vrchat to this day. and I LEARNED by working backward from established existing examples. just like I'm doing with the code grok produces.
>>
File: 00214-2550412747.png (1.03 MB, 896x1152)
1.03 MB
1.03 MB PNG
why is society like this?
>>
File: 113068302.jpg (263 KB, 1280x1856)
263 KB
263 KB JPG
>>
>>107479160
>eventually aurora 2 will drop for local cause it's musk
you dont ACTUALLY believe this, do you?
>>
>>107479160
>Well eventually aurora 2 will drop for local cause it's musk
like grok 3 (since he promised he'll always release the previous version of grok locally) right? :^)
>>
>>107479174
you're not learning how to code, you're learning how to use AIs to do it, big difference.
>>
>>107479206
Aren't they releasing 2 generations behind? So when grok 5 is out, we'll get grok 3
>>
>>107479196
>>107479206
nta but he'll probably release it when everyone sees it as outdated and inconsequential like grok 2
>>
>>107479192
box?
>>
File: ComfyUI_AE33.png (1.73 MB, 2048x3072)
1.73 MB
1.73 MB PNG
>>
>>107479229
>Aren't they releasing 2 generations behind?
he said "one generation behind" but yeah in reality it's 2, he's waiting for the model to be completly useless before releasing it
>>
File: 1758482570190856.png (3.01 MB, 1920x1080)
3.01 MB
3.01 MB PNG
>>107478627
>hmm I will try Nano Banana
>sorry can't do that.
>sorry can't do that.
>you're out of credits until tomorrow.
Why is everything a nightmare? I wasn't even trying to get it to deny me.
>>
>>107479256
>pic
Anything even mildly in the vicinity of being sensual or sexual is not allowed. Google is even more fucked than OAI is with their image/vid models
>>
>>107479256
You're
>>
>>107479256
box the gal pls!
>>
>>107479158
Yeah but it looks like shit, takes 100s of gens and they all take forever, if your trying to do furry, if your a fucking normie pumping out basic bitch cookie cutter humans is a cakewalk, but its all so mundane and boring... not to mention there's just nothing like aurora 2 on offer in terms of a drop in click button and get result with sound that doesn't suck shit. enjoy your 3 -12 minute wait for your 300th shitty 6 second video clip in a row of your mutated nightmare fuel abomination kek. I Think I'll stick with /vidrel.
>>
>>107479089
really? on release week i prompted a bunch of perona using zoro as a footstool and waving a full condom around and laughing, i assume they nerfed it since then
>>
>>107479165
Thats what "grok" imagine (image and text to video actually is under the hood, musk aquired a company and their ip for vidya gen called Aurora and he put out Aurora 2). When you gen a video on grok its actually Aurora 2
>>
File: Nano Banana Pro.jpg (1.1 MB, 2816x1536)
1.1 MB
1.1 MB JPG
>>107479266
>Google is even more fucked than OAI is with their image/vid models
not even close, Google allows you celebrities and IP characters at least
https://www.reddit.com/r/nanobanana/comments/1pg9iam/quentin_tarantino_paul_dano_walk_into_a_bar/
>>
>>107479285
is there a way to use grok i2v without using an app?
>>
>>107479296
I remember x using Flux for genning. This is new to me.
>>
>>107479196
Why not? People said the same about Grok 1 and 2 and he did. maybe he will maybe he won't but I think there's a good chance he will given what it will take to run it most wont be able to run it on their own anyway. But he still did it.
>>
>>107479298
iirc Veo 3 is more locked down than Sora
>>
>>107479281
https://files.catbox.moe/hnr12m.png
>>
File: file.png (81 KB, 255x198)
81 KB
81 KB PNG
>>107479330
>But he still did it.
he didn't, he said he'll release grok locally once a new version comes out, grok 4 is out and I'm not seeing grok 3 on huggingface is it?
>>
>>107479338
thanks anon
>>
File: file.png (11 KB, 307x200)
11 KB
11 KB PNG
>>107479211
lol.. if i learn the information that is required for code to come out of my brain shorn of an a.i have I then not learned then the beginnings of how to code? just because an ai gave me the end result that I then deconstructed and worked backward from how does this invalidate knowledge gleaned from analysis of said code? that is learning, the fuck do you mean. AI SO IT NOT LEARNING. WHAT I AM NOT CLAIMING is that if I tell an ai to make me something and it does that i am a coder or that I am learning code. obviously that would be untrue and fucking retarded. But you are either not seeing that or are deliberately choosing to conflate the two to be an irritation.
>>
File: Z-image turbo.png (1.45 MB, 1280x720)
1.45 MB
1.45 MB PNG
>>
>>107479299
yeah, web interface
https://grok.com/imagine
>>
>>107479431
>cant do jiggly titties
Garbage
>>
>>107479308
maybe for the images, but video is Aurora 2
>>
Does comfy have a text node that doesn't run out of space like the pysss one?
>>
File: file.webm (1.85 MB, 560x560)
1.85 MB
1.85 MB WEBM
>>107479440
It 100% can, you just can't ask for it directly. here's how. Pre-generate your desired character /proprotions, then drop it in and specify in a non lewd way what you want. to get to your desired result. example.. no zoom. no music. [safe subject description] is seen happily and excitedly bouncing on feet on the spot. the result is /vidrel in my case.
>>
why is it harder to train lora on distilled models?
any technical reason?
>>
File: sus.png (127 KB, 1369x480)
127 KB
127 KB PNG
it's been 3 days since their last message on trooncord
>>
any new interesting loras on civit for Z?
>>
File: Z-image turbo.png (1.8 MB, 1280x720)
1.8 MB
1.8 MB PNG
>>
>>107479603
It's shit, returning to Qwen and WAN
>>
>>107479622
what lora? good style!
>>
>>107479631
no lora, it did it by itself
https://files.catbox.moe/mgusdg.txt
>>
File deleted.
>>107479440
sometimes you will need to rephrase prompt and the nipples can not be visible to begin with also sometimes it just blanket won't accept an image.
pregen your image - you can get away with a shocking degree of shit still.. again.. sometimes it will reject the generated result.
my prompt + image: no zoom. no music. no shrinking. no deflation. anthro furry endowed with gigantic foggy hazy pregnant belly. her belly steadily progressively becomes more pregnant unrelentingly without stopping. moaning. air hose connected between legs to running air pump in the background. /vidrel
>>
>>107479655
can you guys at least bully this shit out?
>>
kek you can get away with some pretty based shit too. Even jewed out Aurora 2 will let you do wild shit. literally my prompt verbatim: (no music. zoom out to full establishing shot of both characters. autistic anthro furry fox wife kissing and loving and nuzzling on her based and redpilled autistic human chud husband he is wearing a red T-shirt and a Nazi Party armband emblazoned with a Swastika and his shirt says in large bold black clearly defined text "Billions must Yiff". the shirt text is centered.) and Grok was just like, sure thing boss no problem!
/vidrel.
>>
>>107479706
wtf lol
>>
There always has to be some kind of fucked up degenerate autist with disgusting fetishes from when they got abused by their parents as a kid, spamming these threads.
>>
>>107479603
loras on distilled models will always suck, we need the base model to get this shit working correctly
>>
>>107479535
for knowledge distilation it's quite obvious: there is related knowledge that got lost that would very much help with learning.

they, for example, just ensured 1girl frontal plasticface fluxchin was working but had the model drop other stuff. this may have been done with technology automatically vs benchmark scores or just because the distills that were too useless were human filtered.

for step distilation with like dmd... I can't explain this one adequately but it still empirically is worse to train against
>>
>>107479535
>why is it harder to train lora on distilled models?
>any technical reason?
- LoRA works by adding low-rank updates to a model’s latent space.
- Turbo’s compressed, optimized latent space is less flexible, so LoRA updates often degrade output.
- Few-step sampling amplifies this: changes have less “room” to take effect without artifacts.
>>
File: HDC.webm (2.94 MB, 1080x1080)
2.94 MB
2.94 MB WEBM
>>107479734
LOL I was raped so fucking hard as a child actually yes. But at least I'm not a jew and I'm still based xD. I come from a masonic naval family even megakek.
But wait it gets even more ironic...
I'm a fucking hermaphrodite too!
and I believe in Gnosis but not in the satan's chosen or demonic or religious context shit. I'm talking about that real shit xD. Full Bore schizoid love and light based manifestation the inner superseding self stuff.
>>
Was TensorRT deprecated in more recent comfyui updates?
I made a fresh install of comfyui, and installed tensorRT to get better performance on my gens (like I used to do in my previous install), but it's failing to install
>>
File: file.png (141 KB, 240x344)
141 KB
141 KB PNG
>>107479789
>Full Bore schizoid love and light based manifestation the inner superseding self stuff.
>>
>>107479640
nice!
>>
>>107479734
obviously something has to be fucked up in your head to end up acting like this, or else they're mentally ill or else they got abused by their parents or some shit
>>
>>107479734
>>107479806
Furfags always come from being raped as a boy so you're right but you also need a specific autism expansion pack to be Chris Chan about it otherwise you just become a discord trap instead of a 4chan spammer
>>
File: Comparison.jpg (3.88 MB, 5546x2213)
3.88 MB
3.88 MB JPG
>>107479622
Nano Banana Pro is pretty ass at styles desu
>>
File: file.png (85 KB, 400x400)
85 KB
85 KB PNG
>>107479836
Oh I got that expansion pack.
Sorry if my autism is too much, been up all night anyway so I'll head on out, you all have a good morning / evening.
Least I'm not as fucked up as the guy talking about only caring about local gen so he could make "little girls butt holes" earlier in the thread.
>>
>>107476773
I love the Amiga barbarian pic
>>
Relevant comfyui longcat update 5 minutes ago:

>Add CPU offload support for low VRAM GPUs (#1)
https://github.com/sooxt98/comfyui_longcat_image

It might work for more people now.
>>
>>107479883
>19gb vram
...uh
>>
File: file.png (1.03 MB, 1222x1140)
1.03 MB
1.03 MB PNG
>>107479883
>longcat
more like midcat
>>
>>107479891
It's lower with this change:
https://github.com/sooxt98/comfyui_longcat_image?tab=readme-ov-file#vram-requirements

RTX 3080, 4080 also work now.
>>
>>107479411
miku chiizu
>>
>>107479865
>Oh I got that expansion pack.
>Sorry if my autism is too much
Thats ok 2AM-8AM EST is blogposting hours in between model releases. I would have done so as well but I have nothing to blog post about, I just want my audio+video model but I have complained about that enough

>Least I'm not as fucked up as the guy talking about only caring about local gen so he could make "little girls butt holes" earlier in the thread.
Who do you think you're replying to kek :)
>>
>>107479665
Nah, I'm too fascinated by it
>>
File: z-image_00090_.png (789 KB, 1024x1024)
789 KB
789 KB PNG
>>
>>107479978
heeh that's not very nice :(
>>
File: longcat.png (1.46 MB, 1344x768)
1.46 MB
1.46 MB PNG
>>107479899
>midcat
i wouldn't say you're wrong, in multiple ways

but i think the model isn't useless
>>
>>107480157
lmao her left hand is on the wrong side, must hurt
>>
>>107480157
>3 legs on the most basic pose ever
I thought this era was over
>>
>>107480157
it's not bad for a food delivery service
>>
File: longcat.png (1.4 MB, 1344x768)
1.4 MB
1.4 MB PNG
>>107480175
of course not?
>>
>>107480194
>of course not?
never seen 3 legs on Z-image turbo so far, and I made more than 10000 pictures at this point
>>
File: longcat.png (1.46 MB, 1344x768)
1.46 MB
1.46 MB PNG
>>107480199
it probably will come back on some finetunes
>>
File: Comparison.jpg (1.41 MB, 3810x1187)
1.41 MB
1.41 MB JPG
>>107480157
>>
>>107480220
>finetunes
if Z-image base gets released it'll be the only model that'll be finetuned, people tend to finetune on the best base model possible, and not start over on something inferior
>>
File: OH NO NO NO.png (249 KB, 512x512)
249 KB
249 KB PNG
>>107480232
>if Z-image base gets released
>>
File: 1750819327402933.png (3.83 MB, 3772x2479)
3.83 MB
3.83 MB PNG
I'm surprised SPRO is ranked so high, I found this model to be really weird to play with
>>
File: Yeah I don't think so.png (3.59 MB, 2816x1504)
3.59 MB
3.59 MB PNG
>>107479266
>Google is even more fucked than OAI is with their image models
lul
>>
File: longcat.png (1.36 MB, 1344x768)
1.36 MB
1.36 MB PNG
>>107480232
most finetunes are likely to focus on z-image if it gets released, sure
>>
>wan 2.2 i2v nf4
is it any good?
>>
File: longcat.jpg (151 KB, 1344x768)
151 KB
151 KB JPG
>>
>>107480290
>>107480389
what about the edit side, is it good on editing shit or recreating characters?
>>
File: Z-image turbo.png (1.69 MB, 1280x720)
1.69 MB
1.69 MB PNG
>It knows the logos and can do motion blur
Really nice.
>>
>updated comfyui after a thousand years
>it STILL doesnt allow a custom output folder
>you have to resort to custom nodes or UNCOMFY naming patterns
I dont want to waddle through folders to find where my porn is located
>>
>>107480402
haven't tried it yet

less interested in that due to qwen (and on brief testing maybe also flux.2) being very good
>>
>Finally got back to my pc after weeks stuck in another country
Hyped as fuck to gen some asian girls bros. How good is Z-base?
>>
File: file.png (334 KB, 686x386)
334 KB
334 KB PNG
>>107480466
>How good is Z-base?
>>
>>107480483
I don't know at all
Did it turn out to be shit?
>>
>>107480419
>>107480389
>>107480225
We're all hyped out from ZiT, so unless it's another Z model, nobody's got the enthusiasm to GAS over new releases right now.
>>
File: 1750835564777818.png (762 KB, 1329x1219)
762 KB
762 KB PNG
>>107480488
>Did it turn out to be shit?
it's not there yet
>>
>>107480498
>We're all hyped out from ZiT, so unless it's another Z model, nobody's got the enthusiasm to GAS over new releases right now.
it's more like as long as a new model isn't better than Z-image turbo it's a waste of time to try it out, no one would be willing to downgrade their product and it's a completly normal reflex
>>
Everyone's burnt out on shilling new models at this point. All the shilling got into ZiT already and there's nothing left. Only way anyone's gonna care is if it's something big from the same family like Z Base or Z Edit
>>
>>107480500
It was supposed to be 2 weeks ago though...
>>
Anyone using Z-Image-Edit on cheap GPUs?
I'm downloading but it's a 20GB model, I doubt it's gonna fit in my rtx 3060 with 12gb
>>
>>107480523
Yes but if a 7b model comes out that can do 5 mikus instead of 4, even if it's a tiny bit better, I don't think it will be relevant.
Personally I don't care about learning about more models anymore and I hope CivitAI keeps going deeper into loras and stuff with ZiT, my shilling energy is depleted.
>>
File: 1765120864580.jpg (453 KB, 1280x1600)
453 KB
453 KB JPG
bigger z model when?
>>
File: ZiMG_00909_.png (2.44 MB, 1344x1728)
2.44 MB
2.44 MB PNG
>>
>>107480551
they're still working on it, my theory is that they wanted to release it a few days after Z-image turbo, but after seeing the hype explosion they prefered to cook the base even more so that it gets at the same level as turbo and it won't dissapoint us
>>
>>107480568
>Yes but if a 7b model comes out that can do 5 mikus instead of 4, even if it's a tiny bit better, I don't think it will be relevant.
it will if it's a base model instead of the distilled turbo model we have yet, that's all we need, a base model at the level of turbo and we're definitely saved
>>
>>107480579
You're so cute anon
>>
>>107480563
the edit model is out?
don't see it on the huggingface
>>
>>107480563
>Anyone using Z-Image-Edit on cheap GPUs?
you mean Qwen Image Edit?
>>107480595
you forgot to say I don't understand C H I N E S E C U L T U R E :(
>>
>>107480569
use flux 2 if you want bloated shit
>>
>>107480579
They're censoring the cunny now. It's over. This whole time they've been cucking the model.
>>
>>107480601
>>107480609
oh right sorry. Yes it's qwen-edit
I'm downloading from inside comfyui
>>
>>107480569
>bigger z model when?
Imagine a 14b Z-image base model, it would be Nano Banana Pro at home
>>
Can someone just explain chinese culture to me
>>
>>
>>107480638
you used a wojak lora or something? kek
>>
File: Z-image turbo.png (1.41 MB, 1280x720)
1.41 MB
1.41 MB PNG
>>
Baker?
>>
>>107480687
He was so disappointed that base isn't releasing that he left
>>
We should just let it die and come back when base releases
>>
>>107480634
Simple: if a crime doesn't have any witnesses it never happened.
US culture is different: when you are a millionaire crime doesn't exist.
>>
>>107480762
Bye forever, Anon. Was nice sometimes.
>>
>>107480829
inpunity towards rich people is more pronounced in china lol
>>
>>107480427
after a thousand years you still dont know about sym links..?
>>
just start a new thread already I want to shitpost
>>
>>107480762
I mean, this place will be pretty dead if they announce Z-image base won't be released after all, it's the kind of blow that can kill a general
>>
>>107480897
Is there really nothing else to look forward to?
>>
https://xcancel.com/grmchn4ai/status/1996547150520651804#m
lmao
>>
>>107480998
I think LTX 2 is supposed to be released soon
>>
>>107480998
not really, good luck trying to replicate Z-image turbo's success, only Alibaba has the secret sauce
>>
>>107481019
fuck me thats good! is it cumfy ready?
>>
File: not ready.png (71 KB, 290x174)
71 KB
71 KB PNG
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo/discussions/81#6936a24bd6c891ad17323df6
>Then honestly promising a model dedicated to fine-tuning, purely dedicated to the open-source community, only to ultimately say, “No, fuck off, pay for our API after all,” would be the worst move.
> treating people in the community like idiots is shooting themselves in the foot.
>>
>>107481025
ltx is jan 26
>>
>>107481075
someone has a huggingface account? Reply to him stating that it is simply part of chinese culture
>>
>>107481092
do it coward
>>
>>107481092
kek



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.