[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


I Didn't Lose Respect Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107341067

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe
https://github.com/ostris/ai-toolkit

>Z
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://huggingface.co/Comfy-Org/z_image_turbo

>WanX
https://rentry.org/wan22ldgguide
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
>>107342183
>I Didn't Lose Respect Edition
you lost your mind
>>
>>107342173
not sure what you're saying. I'm saying chroma is better, that gen is from Z. here's how it should be from chroma.
>>
>>107342198
oh apologies i misread kek im sweepy
>>
File: ComfyUI_08973_.png (1.47 MB, 1152x1152)
1.47 MB
1.47 MB PNG
>>
File: gallows.png (1.34 MB, 1500x1500)
1.34 MB
1.34 MB PNG
I don't understand why everyone is circlejerking over the chink model. It's barely above dogshit
>>
>>107341954
Ridge racer 2025 looks good.
>>
>>107342179
>>107342183
no, that's not how it works, none of these models are being used in a way where it's possible for them to "refuse" a response, they only activate the "model understanding" state layers. There's no typical chat context going on at all. The "Thinking" version only might help if it happens that it's just trained better, that's why I was gonna try it.
>>
>>107342183
Yay, more coomerslop. Can’t wait!
>>
File: ComfyUI_08974_.png (1.43 MB, 1152x1152)
1.43 MB
1.43 MB PNG
>>
File: ComfyUI_08975_.png (1.33 MB, 1152x1152)
1.33 MB
1.33 MB PNG
>>
>>107342229
Art at large is driven by quote coomerslop
>>
Barely above dogshit is hype for the current state of local

>>107342214
no, that's not how it works, none of these models are being used in a way where it's possible for them to "refuse" a response, they only activate the "model understanding" state layers. There's no typical chat context going on at all
Do you have a resource that goes more into detail about this like a huggingface blog or otherwise or should I just talk with Claude a bit about this. It doesn't really make sense to me that the "refusal vibes" from reinforcement learning of the weights don't propagate to other layers even implicitly. I guess you'd need to test with an abliterated version of a model and the normal version to actually compare the effect of this in practice
>>
File: z-turbo_00047_.png (804 KB, 1152x896)
804 KB
804 KB PNG
>>
>>107342249
Yeah, that’s why I only care about the small percentage of good art then.
>>
please care about flux 2
>>
does flux 2 still feature the famous bfl buttchin?
>>
Can I finally delete my hundreds of SDXL finetunes?
>>
>>107342269
soon just a little longer
>hundreds
oh you mean mixes
yeah delete those
>>
>>107342256
>Do you have a resource that goes more into detail about this like a huggingface blog or otherwise or should I just talk with Claude a bit about this. It doesn't really make sense to me that the "refusal vibes" from reinforcement learning of the weights don't propagate to other layers even implicitly. I guess you'd need to test with an abliterated version of a model and the normal version to actually compare the effect of this in practice

the TLDR is they run the LLMs when repurposing them for image models in a pure processing state where the "personality" simply isn't there at all, the output aspect isn't active.
>>
>>107342260
>good
*boring
>>
File: z-i-t.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>107342263
I'd say no, but I only managed to do a bunch of dozen gens.
>>
File: ComfyUI_07625_.png (3.53 MB, 2048x1280)
3.53 MB
3.53 MB PNG
>>
>>107342263
It's already a miracle it knows what a woman is, I wouldn't even have been surprised if they decided they were too unsafe.
>>
File: it's okay.jpg (781 KB, 2048x2048)
781 KB
781 KB JPG
What is the recommend image render size for z-chad?
>>
>>107342309
did you prompt for the hindi writing? does the model know non english/chinese scripts?
>>
File: 1740488242326084.jpg (503 KB, 2048x2048)
503 KB
503 KB JPG
>>
>>107342317
i find that any kind of vertical resolution (3:4, 9:16) begins to heavily distort towards the bottom of the image
>>
File: z-i-t.jpg (806 KB, 2048x2048)
806 KB
806 KB JPG
>>107342317
it's relatively flexible actually. 2048x2048 isn't really a huge struggle yet. I think you can go larger than that?

nothing wrong with more than one image batched at 1024x1024 or w/e you like
>>
File: 1741556154582993.jpg (319 KB, 1088x1920)
319 KB
319 KB JPG
>Given that chickens and rabbits are in the same cage, there are a total of 35 heads and 94 feet. Find the number of chickens and rabbits.
wtf, zimage has LLM like reasoning abilities?
>>
>>107342313
it does hot realistic women better in every way then Qwen out of the box by like 10x, I dunno what your point is lmao. I don't think it looks as good for photographic gens as Flux Krea though.
>>
>>107342349
I meant reading their stance around safety, not that flux 2 is shit at it.
>>
File: ComfyUI_08979_.png (1.42 MB, 1152x1152)
1.42 MB
1.42 MB PNG
Mangled hands here, can't be fixed due to lack of seed variety. Now let's compare a subset of prompts.
>>
File: 1749877524155946.png (1.98 MB, 1024x1024)
1.98 MB
1.98 MB PNG
>>107342320
I think so, maybe
>>
>>107342317
what prompt is that?
>>
>>107342347
qwen 3
>>
File: 1734682893964765.mp4 (1.85 MB, 720x912)
1.85 MB
1.85 MB MP4
>>107342198
>>
File: file.jpg (834 KB, 1792x2304)
834 KB
834 KB JPG
>>107342338
true, it hallucinates down there
>>
how do you prompt different faces?
>>
File: z-i-t.jpg (109 KB, 1024x1024)
109 KB
109 KB JPG
>>107342347
i wonder how you managed to run qwen like that during imagegen
>>
>>107342405
Dont go above 2048 in one dimension.
>>
File: taystrip.png (2.65 MB, 1824x1248)
2.65 MB
2.65 MB PNG
>>
>>107342343
Me on the right
>>
File: z-i-t.jpg (95 KB, 1024x1024)
95 KB
95 KB JPG
>>
File: 1760708336120057.png (3.39 MB, 1939x1305)
3.39 MB
3.39 MB PNG
>>107342380
>>107342415
not me, zimage official showcase
>>
>>107342442
>official showcase
Stop looking at benches.
>>
I miss flux chins, hopefully someone will make a lora for newer models
>>
when gguf
>>
when zimage nunchaku
>>
please care about flux 2
>>
>obsess over "safety and ethics" when training a model
>someone else releases a completely uncensored model shortly after and everyone flocks to it
What's even the point. Just wasting GPU power
>>
File: z-i-t.jpg (599 KB, 3072x3072)
599 KB
599 KB JPG
>>107342435
me on the very left then
>>
>>107342347
They're running an LLM as a prompt refiner. It does a bunch of reasoning to come up with the final prompt. The prompt actually fed to the diffusion model has all that text in it, it's just an LLM came up with it. The diffusion model itself isn't magic and works like every other flow matching model.
>>
>>107342479
people say it's a business decision, I believe they just think safety is so important mangling anatomy is an ok price to pay
>>
>>107342391
nice i will cum to this tomorrow.
>>
>>107342479
>completely uncensored
I wish lol.
>>
>>107342489
oh I see, in that case this can already be done, you just need to ask any llm to give the reasoning
>>
>>107342489
is that different from Qwen which doesn't use diffusion? idk im tech illiterate
>>
>>107342479
it's way less censored than flux 2, but doesn't understand full porn, or male genitalia (yet)
>>
zimage was fun for a bit, but now im bored. my pattern recognizing brain is starting to notice similarities across all gens and there are major issues with certain aspect ratios.
when can i train a lora?
>>
File: 1754502798089804.png (1.58 MB, 1280x1280)
1.58 MB
1.58 MB PNG
>>107342423
>>
Is it finally time to retire SDXL-based finetunes?
>>
Has illustrious been surpassed yet?
>>
>>107342508
Not until there's an Illustrious/NoobAI finetune for z-image
>>
>>107342503
I don't think you are ever gonna get a corpo base model that can do full porn kek. That's gonna need the gooner within us to do it.
>>
>>107342505
btw it took a dozen tries for this chink model to write delete instead of derete lol
>>
File: z-i-l.jpg (279 KB, 2048x2048)
279 KB
279 KB JPG
>>107342442
i suppose we here are not / not usually doing that yet

>>107342510
in terms of prompt comprehension and various other capabilities, that happened quite a while ago

in terms of knowing the largest number of anime characters it's still either illustrious or noob
>>
>>107342522
grok imagine...
>>
so will the comfydevs neetmaxx a fix for Z tonight or will it have to wait until friday? im very interested in getting the FULLY WORKING setup going now that we know the fucked up implementation is good already.
>>
>>107342522
dalle3 was able to do porn (and the few unfiltered api keys showed it was possible), it's just that prompts are rewritten and a watchdog stops them from being ever shown to the public
they did the smart thing by actually adding everything they could but filtering the output
>>
>>107342320
no I didnt specify hindu writing, the idea was actually that the car drives in a western city and the jeets surround the car.
I think I didnt word it properly
>>
>>107342546
turkey day vacation :3
>>
>>107342550
oh ok
>>
WHERE ARE THE NEW MUSIC MODELS?????

WHERE???? EVERYONE IS GETTING WHAT THEY WANT EXCEPT US. IT'S NOT FUCKING FAIR. NOTHING HAS BEEN IMPROVED SINCE ACE-STEP.
>>
File: z-turbo_00086_.png (1.96 MB, 1152x896)
1.96 MB
1.96 MB PNG
>>
>>107342564
I'm here with you to pray about it anon, especially since the suicide of udio
>>
>>107342564
they are also shutting down suno. no more free songs.
I wish a suno nigga would leak 4.5 as a fuck you to warner.
please god please make this happen
>>
>>107342596
honestly, i'm more surprised it didn't happen sooner

RIP a legend
https://youtu.be/eMHqy54hM1s?si=-GoNy6hrosI2FQRB
>>
>>107342479
it's financially sensible is the point lol, they don't benefit in any way from us open source users, and businesses don't want to leverage things for commercial applications that are capable of deepfakes and / or CP
>>
File: ComfyUI_07667_.png (1.52 MB, 944x1280)
1.52 MB
1.52 MB PNG
>>
Civit already added a ZIT category. WHERE ARE THE LORA TRAINERS???
>>
>>107342596
Considering Universal partnered with Udio, it seems kind of dumb Warner didn't just partner with Suno as competition
>>
>>107342608
eggman won is better
>>
>sleepy as fuck
>wanna keep genning
>>
>>107342625
Wait nevermind. That is what they did apparently
>>
>>107342626
i bring you to the fire and throw you iiiiiiiin
>>
>>107342624
>Civit already added a ZIT category
they have agents ITT damn
>>
>>107342625
it not really a partnership bro. they used lawyers and weaponized the laws to strongarm suno and udio into "partnerships" aka shutting them down and crippling them.
their main problem is that they dont want every pleb to have access to making music. it threatens their creative control over what propaganda is distributed to the public.
these fuckings jews man
>>
File: file.png (3.08 MB, 1456x2128)
3.08 MB
3.08 MB PNG
>>107342624
are there even any trainers for this?
>>
>>107342489
Wait, is that how it works? I noticed before that it isn't just the encoder that's attached to the model, but I didn't think about it much more. Is it running prompts through a full Qwen3 LLM transformer sequence (encoder & decoder) before the final encode?
>>
>>107342637
Yep, first thing they did was to ban any music download on udio.
Your own generated music.
It was "funny" seeing retards probably too young to understand cheering for that shit on twitter just because it was anti ai.
Cheering for music majors is a new low.
I'm usually not into the bullshit about giving evil or good anthropomorphism to entities, but music majors are probably the few weaponized by law to be evil cartels.
>>
File: z-i-t.jpg (118 KB, 1024x1024)
118 KB
118 KB JPG
>>107342564
maybe you can eventually autotune a tts or something?

jokes aside, it would have helped if music had open models earlier, but that wasn't what suno and udio and others did
>>
>>107342624
That's crazy fast, how rare.
>>
>>107342637
the worst thing about situations like that is they made a huge fucking profit if jewood tried to fuck them over they could have just open sourced the model dissolved the company and started up a new in a jiffy but that never happens sin begets sin and everything goes to shit
>>
File: ComfyUI_276657_.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
>>
>>107342503
>it's way less censored than flux 2, but doesn't understand full porn, or male genitalia (yet)
I think Chinese gov could fuck them over since explicit sexual behavior can't be shown in any media. Legally it's not allowed in a public, commercial, or distributed sense.
>>
>>107342665
>Yep, first thing they did was to ban any music download on udio.
That sucks but couldn't there be a browser addon that just rips the music from the player?
>>
File: ComfyUI_276684_.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
>>
>>107342702
for now any AI related search has a pass, as most laws in China are enforced only when it's politically useful, so they close their eyes (this is why you can gen for nudity for example), but you are right that full porn is probably the limit
>>
>>107342708
That's what people did, and also there was 24h the old admins allowed download. It still sucked.
>>
File: ComfyUI_01614_.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
in future. AI will be use to fake crime
>>
Is there any info how they tagged images for Z training?
>>
>>107342726
ai generated evidence is already appearing in court
>>
anons where do I download the safetensor for zimage as 1 file instead of cut in pieces?
>>
>>107342317
>>107342343
Do 2048x2048 or equivalent 4MP, but no larger. 2048* mitigates the jpeg issue somewhat. If you go higher, it breaks quickly.
>>
File: 564456456451.jpg (341 KB, 2394x1645)
341 KB
341 KB JPG
>>
File: ComfyUI_01618_.png (1.87 MB, 1648x1024)
1.87 MB
1.87 MB PNG
>>107342738
>>
>>107342742
https://huggingface.co/Comfy-Org/z_image_turbo/tree/main/split_files
>>
File: neta4.png (66 KB, 801x685)
66 KB
66 KB PNG
NetaYume guy is gonna do Z after he releases NetaYume V4 I guess
>>
File: 445454514545.jpg (747 KB, 2394x2164)
747 KB
747 KB JPG
>>
>>107342292
Spiritually low-iq
>>
File: 454545451212.jpg (549 KB, 2394x1869)
549 KB
549 KB JPG
>>
File: ComfyUI_00011_.png (3.02 MB, 1360x2048)
3.02 MB
3.02 MB PNG
Is the A10G bad for image generation now? It feels a bit slow for z-image
>>
File: 1752979811806160.mp4 (2.73 MB, 720x720)
2.73 MB
2.73 MB MP4
>>107342365
>>
File: deDL_zi_00044_.png (3.02 MB, 2048x1216)
3.02 MB
3.02 MB PNG
>>107342596
>they are also shutting down suno. no more free songs.
source on this?
>>
File: 45544545464.jpg (591 KB, 2394x1869)
591 KB
591 KB JPG
>>
File: ComfyUI_19315_.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
>>
File: z-i-t.jpg (109 KB, 1024x1024)
109 KB
109 KB JPG
>>107342755
i hope he gets the base model with a suitable license to do this.
>>
File: 545554546545.jpg (754 KB, 2394x1942)
754 KB
754 KB JPG
>>
File: ComfyUI_01622_.png (1.84 MB, 1648x1024)
1.84 MB
1.84 MB PNG
>>
>>107342754
thanks anon
I guess there is no point in using the fp32 version from the main site vs fp16?
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo/tree/main/transformer
>>
File: 44455644564.jpg (1.04 MB, 2394x2983)
1.04 MB
1.04 MB JPG
>>
>>107342772
>in 2026, Suno will make several changes to the platform, including launching new, more advanced and licensed models. When the new models launch in 2026, the current models will be deprecated. Moving forward, downloading audio will require a paid account. Suno will introduce download restrictions in certain scenarios: specifically, in the future, songs made on the free tier will not be downloadable and will instead be playable and shareable. Paid tier users will have limited monthly download caps with the ability to pay for more downloads.

https://www.wmg.com/news/warner-music-group-and-suno-forge-groundbreaking-partnership
>>
File: z-i-l.jpg (187 KB, 1024x1024)
187 KB
187 KB JPG
>>
>>107342763
>>107342773
zimage bit more prudish
>>
File: ComfyUI_temp_hepqu_00003_.png (2.55 MB, 1088x1856)
2.55 MB
2.55 MB PNG
>>
File: 54454545844.jpg (760 KB, 2138x2554)
760 KB
760 KB JPG
>>
>>107342793
local competitive music models can't be released fast enough
>>
File: 45448415544.jpg (819 KB, 2138x2629)
819 KB
819 KB JPG
>>
>>107342801
stop generating trannys pls
>>
>>107342791
fp32 is only useful if someone decides to finetune the model
>>
>>107342797
Yep, for that first image, keep in mind other seeds in that case for Chroma follows prompt exactly (only showing her legs and panties). Z-Image much is locked in on showing her face.
>>
File: 1760410874008653.png (1.2 MB, 1200x675)
1.2 MB
1.2 MB PNG
>>107342726
>AI will be use to fake crime
They already tried that
>>
File: ComfyUI_07682_.png (1.29 MB, 944x1280)
1.29 MB
1.29 MB PNG
>>
File: elonswiftc.png (2.55 MB, 1824x1248)
2.55 MB
2.55 MB PNG
https://files.catbox.moe/yver26.png
>>
File: ZImg_00084_.png (2.65 MB, 1440x1440)
2.65 MB
2.65 MB PNG
exactly what I asked for
>>
File: z-i-t.jpg (113 KB, 1024x1024)
113 KB
113 KB JPG
>>107342793
kek, wtf are they doing removing the downloads?!
>>
>>107342793
Hey y’all, looking for some tips here. I like what I’ve made so far with Suno but now I’m kind of hitting a wall with ideas for prompts. Why doesn’t Suno also have a feature to write prompts for you? Like just hit a button the says “new prompt” and then hit make song when it comes up with something that sounds interesting! Thoughts?
>>
How does FLUX 2 handle making this realistic?
>>
File: ComfyUI_temp_hepqu_00014_.png (2.54 MB, 1088x1856)
2.54 MB
2.54 MB PNG
>>
weve never been more back
>>
>>107342854
or like
style swapping to this
>>
File: z-i-t.jpg (92 KB, 1024x1024)
92 KB
92 KB JPG
>>107342848
head over to /lmg/ and use one of the simulated text generation waifus/husbandos to write your prompt?
>>
>>107342857
promptwprtj
>>
>>107342854
sure, let me spend ten minutes generating that for you

syke
>>
File: ComfyUI_05125_.png (1.44 MB, 1360x768)
1.44 MB
1.44 MB PNG
>>107342881
well here's a qwen
>>
File: ZImg_00095_.png (2.39 MB, 1440x1440)
2.39 MB
2.39 MB PNG
looks more like todd howard
>>
>>107342793
THIS is why all local audio gen is dogshit. Music copyright holders do not fuck around
>>
>>107342899
It'd take a special kind of "person" to act like Music copyright holders do.
>>
>>107342899
All Suno and Udio needs to do is release their models to the public and copyright holders literally can't do anything. It would be like trying to ban piracy all over again. But honestly I don't get why no one tried to train a local model themselves yet. China could do it easily since they don't give a fuck about copyright.
>>
File: 1741496595654489.mp4 (1.64 MB, 720x1072)
1.64 MB
1.64 MB MP4
>>107342765
>>
>>107342923
and China will deliver. I believe in Chairman Xi and the Chinese Communist Party. they will bring us another top model.
>>
>>107342923
>All Suno and Udio needs to do is release their models to the public and copyright holders literally can't do anything.
The cartel would have fucked their lives in court.
>>
File: z-i-t.jpg (155 KB, 1024x1024)
155 KB
155 KB JPG
>>
>>107342953
They'll be the sacrificial lambs so we could all enjoy infinite free music
>>
File: 1753814365835408.jpg (579 KB, 2048x2048)
579 KB
579 KB JPG
It really wanted to give her a black nose
>>
>>107342377
https://files.catbox.moe/9q479m.png
>>
>>107342824
fuck does that meam?
>>
>>107342994
The penultimate evidence they tried to incarcerate Kyle Rittenhouse with was AI "enhanced", as in they made it up.
>>
File: ComfyUI_00122_.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>107342746
you do good work anon, ty
>>107342743
rgr thx
>>
>>107342884
holy shit.. i want to play this
>>
>>107342783
I like the exaggerated sense of perspective on Chroma better. I wonder if there's a way to prompt for that Z-Image. Maybe specifying a focal length? I think that affects that.
>>
>>107343014
it's interesting how neither followed the prompt (which would require the viewpoint to be from above her eye level), but both produced relatively strong foreshortening.
>>
>>107342978
thanks anon
>>
>>107342944
i doubt alibaba researchers thought about xi or communism ever
>>
>>107343004
they didn't need ai shit to incarcerate that infected little pube
>>
>>107343024
>which would require the viewpoint to be from above her eye level
Try "from above" and "high-angle view." That's the danbooru tag and the e621 tag respectively. Also your prompt seems to suggest that her shoe is close to the camera which is why that happened, maybe.
>>
>>107343038
you seem upset
did he shoot one of your pedophile friends or something?
>>
>>107343038
he was never incarcerated though?
nvm, incoming melty
>>
File: ZImg_00148_.png (2.38 MB, 1440x1440)
2.38 MB
2.38 MB PNG
>>
File: x.png (2.07 MB, 1024x1024)
2.07 MB
2.07 MB PNG
>>107342854
ages later (also I'm not sure if it was a good prompt)
>>
So is there an edit and undistilled model coming for Z or is ded?
>>
>>107343105
I don't see the realism outside of the hands
>>
>>107343108
>undistilled model coming for Z
yes, they're continuing the training apparently and will release it later
>>
>>107343118
The lighting is more realistic.
>>
>>107343088
>I turned myself into a pickle, Morty! Boom! Big reveal: I'm a pickle. What do you think about that?
>I'm pickle chiiiick!
>>
>>107343130
she's not a pickle
>>
File: z-i-t.jpg (138 KB, 1024x1024)
138 KB
138 KB JPG
>>107343118
i only prompted it once to ask it to change the provided image to a photorealistic style. i haven't gained good familiarity with flux.2 so this could simply have been a stupid way of doing it.

but also it reminded me how long this takes so I'm probably not even going to do more seeds with the same prompt for now, never mind other settings
>>
File: ComfyUI_05132_.png (1.3 MB, 1384x752)
1.3 MB
1.3 MB PNG
>>107343012
there's probably a reshade filter for sd 1.0 by now
>>107343105
wow it looks worse than sd 1.0 img2img
>>
>>107342923
Or a rogue employee needs to leak it. Unfortunately it is probably known who has access to the model and when.
>>
File: zimg_0026.png (1.13 MB, 832x1216)
1.13 MB
1.13 MB PNG
after almost a year of wrestling with models, training loras, debugging workflows, i'm actually kinda sad that the future is kinda pointless. it's great we'll have local models that can do this shit on consumer machines but there will very quickly be no need for any custom tooling. rip my hobby i guess
>>
File: z-i-t-ass.jpg (104 KB, 1024x1024)
104 KB
104 KB JPG
>>107343153
>wow it looks worse than sd 1.0 img2img
yes. i leave it to someone else to retry if they feel like it
>>
File: 1756530912365837.png (1.15 MB, 1152x896)
1.15 MB
1.15 MB PNG
>>107343188
She's pretty
>>
File: ComfyUI_00013_.png (1.03 MB, 1152x896)
1.03 MB
1.03 MB PNG
>>
>months of troonjeets botting the thread
>new chinese model releases
>suddenly one thread an hour

Makes me hmm.
>>
>>107342765
catbox plssssssssssssss
>>
File: file.png (1.92 MB, 1152x1152)
1.92 MB
1.92 MB PNG
>go to sleep
>wake up
>went through 6 threads
>wait is it because of flux2?
>check inside
>new 6B lumina2 model completely mogs flux2
UHMM ALIBABBER BROS???
WE WONNERED?!?!
>>
Just getting back into these threads and heard about Chroma. So if one were training a booru tune, would Chroma be the best base to use?
>>
LOL
>>
>>
File: 1764223990.png (1.03 MB, 1152x896)
1.03 MB
1.03 MB PNG
>>
File: file.png (1.9 MB, 1152x1152)
1.9 MB
1.9 MB PNG
>>107343314
>>
https://files.catbox.moe/7pz5hi.png

Z Image does the three ethnicities double blowjob prompt basically as well as Hunyuan Image 2.1
dicks slightly weird but pretty good
>>
File: ComfyUI_0001.png (1.36 MB, 1152x896)
1.36 MB
1.36 MB PNG
>>
File: ComfyUI_00962_.png (1.62 MB, 992x1456)
1.62 MB
1.62 MB PNG
>>
File: z-turbo_00116_.png (1.69 MB, 1024x1536)
1.69 MB
1.69 MB PNG
>>
File: file.png (1.74 MB, 1152x1152)
1.74 MB
1.74 MB PNG
>>
File: file.png (1.89 MB, 1152x1152)
1.89 MB
1.89 MB PNG
>>107343410
damn even better shitposts than qwen image
>>
File: ComfyUI_05144_.png (1.07 MB, 1384x752)
1.07 MB
1.07 MB PNG
>>107343153
here's an accurate one
>>
File: ComfyUI_09000_.png (1.78 MB, 1152x1152)
1.78 MB
1.78 MB PNG
>>107343014
Adding wide-angle perspective to the prompt on Z seems to produce a similar effect.

>>107343010
Thanks, trying no to be too biased as both models have their strengths and weaknesses. Z knows more photo styles and does them better than Chroma (T5 limitation), so what it does well it's pretty neat.

>>107342793
Those guys are just sealing their grave for once true open source competition arrives.
>>
>>107343410
>>107343423
I've been away from these threads too long.
Who's cumfart?
Did they replace the poop dick schizo?
>>
>>107343509
>they
xersilf/y'allself
>>
File: ComfyUI_temp_bqard_00032_.jpg (488 KB, 2400x1352)
488 KB
488 KB JPG
>>
File: ComfyUI_05148_.png (891 KB, 1384x752)
891 KB
891 KB PNG
>>107343461
>>
File: temp8.png (2.35 MB, 1248x1824)
2.35 MB
2.35 MB PNG
The edit version will be a most welcome addition
>>
File: ComfyUI_temp_00077.jpg (301 KB, 1344x768)
301 KB
301 KB JPG
>>
>>107343566
KEKW
>>
File: file.png (2.18 MB, 1152x1152)
2.18 MB
2.18 MB PNG
>>107343548
you, QIE is dead in the water
>>
File: z_mod_00008_.jpg (731 KB, 1264x1656)
731 KB
731 KB JPG
>>
File: 922008382573184.png (1.03 MB, 1152x896)
1.03 MB
1.03 MB PNG
>>
File: z_mod_00009_.jpg (566 KB, 1264x1656)
566 KB
566 KB JPG
>>
>>107343591
>QIE
?
>>
File: file.png (2.05 MB, 1152x1152)
2.05 MB
2.05 MB PNG
>>107343606
qwen image edit
>>
I wonder what the schizons at sdg are upto. do they know about Z?
>>
File: ComfyUI_09004_.png (1.43 MB, 1152x1152)
1.43 MB
1.43 MB PNG
>>
>>107342622
Underrated
>>
File: stonetoss.png (1.38 MB, 832x1216)
1.38 MB
1.38 MB PNG
>>107343610
Ah yes one would hope
>>
File: 1057444075842345.png (1.09 MB, 1152x896)
1.09 MB
1.09 MB PNG
>>
File: ComfyUI_09007_.png (1.78 MB, 1152x1152)
1.78 MB
1.78 MB PNG
What are the implications of a base model knowing these girls? Kek China is based but holy shit!
>>
File: Noob Z.png (111 KB, 821x375)
111 KB
111 KB PNG
[BREAKING NEWS]
WHAT THE FUCK???
>>
File: ComfyUI_09011_.png (1.68 MB, 1152x1152)
1.68 MB
1.68 MB PNG
>>
File: 662267018436111.png (1.16 MB, 768x1344)
1.16 MB
1.16 MB PNG
>>
File: ComfyUI_09010_.png (1.38 MB, 1152x1152)
1.38 MB
1.38 MB PNG
>>107343661
Ah, they're actually doing what any open-source AI team that wants to be #1 should've been doing in the first place.
>>
File: ComfyUI_00041_.png (3.56 MB, 1360x2048)
3.56 MB
3.56 MB PNG
>>
>>107343661
holy fucking shit
we wonned SO FUCKIGN HARD
artist tags + animeme knowledge, we FUCKING WONNED
SDXL BYE BYE
NETAYUME, BYEEE
WELCOME ZZZZZZZZZ
>>
File: z_mod_00015_.jpg (575 KB, 1264x1656)
575 KB
575 KB JPG
It doesn't handle lighting too good. Definitely needs a photo lora/tune

>>107343661
Awesome. Finally some good news
>>
>>107343661
No fucking way. Ok, if they do this, then I will admit that the west has fallen.
>>
>>107343661
naniiiiii
>>
File: 765809551560502.png (1.15 MB, 1024x1024)
1.15 MB
1.15 MB PNG
>>107343661
based
>>
File: 1744542396396765.png (2.85 MB, 1024x1536)
2.85 MB
2.85 MB PNG
>>107343661
Nice. I can't wait to make thick migus.
>>
File: ComfyUI_temp_00103_.jpg (171 KB, 1280x720)
171 KB
171 KB JPG
>>
>>107343661
Isn't porn illegal over there? There's no way it's gonna be uncensored.
>>
>>107343661
Alright yeah it's probably still going to be censored just a bit. The current Z still doesn't do photoreal sex well. But at least it'll be way easier and faster to train an uncensored booru tune with that as a base.
>>
>>107343661
I can't believe I'm going to have to leave Illustrious behind. Holy shit.
>>
>>107343703
I mean so was Illustrious but that still happened so maybe dream can come true
>>
File: ComfyUI_09015_.png (928 KB, 1152x1152)
928 KB
928 KB PNG
>>107343670
Ah, knows every K-pop idol under the sun, but falls apart with even the most famous JAV pornstars. Sad.
>>
File: base.png (1.03 MB, 832x1216)
1.03 MB
1.03 MB PNG
>>107343661
>>
File: ComfyUI_07512_.png (1.41 MB, 944x1280)
1.41 MB
1.41 MB PNG
>>107343661
Z Image Turbo are GODS
I kneel
>>
>>107343682
This is China's declaration of all out war on the West. Backup your models. Download them now before they're gone. Because they will be gone.
>>
>>107343661
they better point them to the non-reencoded one, not the webp dogshit they used
>>
z-image is quite biased toward Asian women. I find it difficult to generate anything other than Chinese or East Asian, unless they are already mainstream or popular enough.
>>
File: ComfyUI_07439_.png (2.85 MB, 2048x1280)
2.85 MB
2.85 MB PNG
>>107343741
>>
File: z_mod_00024_.jpg (502 KB, 1264x1656)
502 KB
502 KB JPG
>>
>>107343661
>noob training set
literally only scat and diaper furry shit
>>
I am retarded do text encoders load onto the VRAM as well as the main model?
>>
File: z_mod_00027_.jpg (741 KB, 1264x1656)
741 KB
741 KB JPG
>>
>>107343349
>dicks slightly weird
anon I
>>
>>107343773
they are tensor models too, yes
>>
File: ComfyUI_09019_.png (1.8 MB, 1152x1152)
1.8 MB
1.8 MB PNG
>>107343741
First try. Doesn't seem to be too biased.
>>
>>107343796
Anon that's a pretty rough ai-face, once you see it, you cant unsee it
>>
File: G6h756PaQAA9n0C.jpg (492 KB, 1205x2160)
492 KB
492 KB JPG
what is the point of using local shit nowadays when SAAS models are lightyears ahead of them?
>>
>>107343703
everything is legal as long as you grease the right hands and you don't walk over the local government/mafia business (and right now politically chinese authorities close their eyes for most ai stuff)
>>
File: zimage_00002_.jpg (947 KB, 2048x2048)
947 KB
947 KB JPG
In forge I use multidiffusioned tiles to upscale high resolutions.
What's the equivalent in comfy?
>>
>>107343661
the thing I like with chinese researchers is that they often are anime friendly
bfl team are too serious for that cartoon stuff, they would never do that, they're focused on true safety
>>
File: ComfyUI_09021_.png (2.26 MB, 1152x1152)
2.26 MB
2.26 MB PNG
>>107343802
True, here's another one
>>
File: ComfyUI_09024_.png (1.83 MB, 1152x1152)
1.83 MB
1.83 MB PNG
>>107343827
Basically when you want to gen any ethnicity of woman, just include in the prompt, and it won't struggle.
>>
File: 1738412650432333.png (1.5 MB, 1024x1024)
1.5 MB
1.5 MB PNG
An indian man on the cover of a James Bond movie poster. The indian man is wearing a suit and is pointing a black pistol at the camera. The text at the top says "You only poo twice". At the bottom include the James Bond 007 logo. Include film credits in the style of a movie poster. The background is the city of Mumbai, filled with trash and cartoonish poop.
>>
File: 1753652985160426.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>107343849
>>
File: 1736183028165817.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
how did this bypass China's model training?
>>
File: 1747365621191287.png (1.38 MB, 1024x1024)
1.38 MB
1.38 MB PNG
>>107343865
>>
>>107343865
china is based now, the west has fallen
>>
>>107343837
Cool, thanks. I guess my prompt just sucked.
>>
File: 1751010053371475.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
Xi Jinping is on his hands and knees, bowing in front of Winnie the Pooh who is sitting on a gold throne in a Chinese palace with a crown, and holding a pot of honey with the label "HUNNY" on it.

kek, amazing how it works so well for a fast model.
>>
File: 884270731020717.png (1 MB, 1024x1024)
1 MB
1 MB PNG
What's the best way to run a list of prompts in comfyui? Preferably in one text box with each prompts on a new line, like this:

prompt1

prompt2

prompt3
>>
>>107343661
They better focus on the full model and not the distill.
>>
>>107343903
Keep it on continuous sentence otherwise you'll get a absolute ton of artifacts.
>>
>>107343849
now do one in valhalla
>>
File: SKbundle.jpg (67 KB, 699x1073)
67 KB
67 KB JPG
>>107343903
Try SKbundle
>>
>>107343661
HOLY FUCKING MOLY I KNEEL (while praying they don't fuck it up somehow)
>>
>>107343865
anon, it's not the ccp that is creating the model, it's researchers in a company
>>
File: 1757854285029904.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
>>107343929
An indian man with big brown eyes on the cover of a James Bond movie poster. The indian man is wearing an FBI uniform and is pointing a black pistol at the camera. The text at the top says "i'll see you in Valhalla". At the bottom include the James Bond 007 logo. Include film credits in the style of a movie poster. The background is Valhalla from Norse mythology.
>>
>>107343957
this actually looks nice
>>
>>107343661
We get all these nice things while Sam Altman is busy sucking cocks and charging $1000 for 3 prompts.
>>
File: 07310207.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>
File: ComfyUI_temp_00180_.jpg (183 KB, 1344x768)
183 KB
183 KB JPG
>>
File: 1749411124112698.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
An indian man is typing at a computer in an indian call center, on the screen is the social media site X, and a message on screen says "you are Indian, not European".
>>
seedream... api nodes.... sora 2....
why did you forget about them???
>>
File: file.png (2.24 MB, 1024x1536)
2.24 MB
2.24 MB PNG
>>107343957
>>
chroma genning monstrosities more than often, only fags say otherwise
>>
>>107343984
nice

it's kinda wild how fast this model is, and it still makes non plastic people unlike flux.
>>
File: 1752477526936339.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
Hatsune Miku standing beside a crane game in an arcade in Akihabara, Japan. Inside the crane game are several Miku plush dolls.
>>
File: 1737229689595864.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>107344011
>>
File: zimage_00033_.png (1.99 MB, 1920x1080)
1.99 MB
1.99 MB PNG
Wanted to make the girl kick the viewer, but it's hellbent on doing foot fetish shit.
>>
File: ComfyUI_00092_.png (1.35 MB, 1088x1392)
1.35 MB
1.35 MB PNG
daxz's workflow is nice.. such a fun model
basic picture I know
>>
File: file.png (2.49 MB, 1024x1536)
2.49 MB
2.49 MB PNG
>>107343991
yeah im having a lot of fun, excited for the edit and anime models
>ayanami rei and souryuu asuka langley wearing their own signature plugsuits. They are embracing each other and looking at the viewer. they're joining their hands with each other to make a heart
this model is fucking incredible
>>
>>107344022
link?
>>
File: 1759903255882270.png (1.89 MB, 1024x1024)
1.89 MB
1.89 MB PNG
one more

A wall filled with dozens of Hatsune Miku plush dolls, in an arcade in Akihabara, Japan.

cute!
>>
>>107344028
no
>>
>>107344022
>needing a wf for this
how many neurons? post hands
>>
>>107343188
>but there will very quickly be no need for any custom tooling. rip my hobby i guess
I highly doubt this.
>>
>>107344022
nigga, who?
>>
>>107344028
>>107344061

https://huggingface.co/datasets/DaxRedding/DaxWorkflows/tree/main
>>
If training works well on Z-Image then it's going to totally own image generation, 6b means it won't just generate fast but also train fast.

What a time to be alive!
>>
>>107344067
Thanks anon.

>>107344035
eat shit faggot
>>
File: ComfyUI_00046_.png (1.89 MB, 1088x1392)
1.89 MB
1.89 MB PNG
>>107344038
gimme some prompts then
>>
>>107344022
unconvincing image but probably a prompt issue more than anything
>>
>>107343957
I don't like how it did the "background is a zoomed in, blurred, and darkened version of the image" thing
>>
>>107344085
Yeah, I'm new to this shit I don't know much about prompts
Only genned profile pics with inference in stability matrix.. Trying to move away from sm because I noticed it eating my ram
>>
>>107344067
unironically some of the worst workflows I've ever seen
>>
File: 1738241319114899.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
A PC game box of the game World of Warcraft, on a table in a computer store, in front of a computer with a white CRT monitor.
>>
>>107344107
post yours
>>
File: zimage_00026_.png (1.91 MB, 1024x1536)
1.91 MB
1.91 MB PNG
>>107344106
>new to this
>downloads bloatmaxxed wfs
take this, has some custom nodes but it's pretty much as simple as it gets. disable sage attention tho
>>
File: ComfyUI_00055_.png (1.67 MB, 984x1296)
1.67 MB
1.67 MB PNG
Nice.
>>
>>107343661
16channel vae at least?
>>
>>107343865
Sometimes I wonder if people think Chinese people see a picture of Xi as or with Whinny the pooh they'll turn to ash like a vampire exposed to a crucifix.
>>
>>107344106
>stability matrix
Have you tried SwarmUI? That's probably a good intermediate if you decide to go full Comfy.
>I'm new to this shit
Welcome, fren.
>>
>>107344106
>>107344124
oops forgot catbox
https://litter.catbox.moe/39si3vgqc9paw5us.png
>>
File: 9408064.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
>>
File: Z6d5.gif (3.71 MB, 430x242)
3.71 MB
3.71 MB GIF
>>107343661
LOCAL IS EATING TONIGHT
>>
>>107344124
are these base res or upscaled gens?
>>
>>107344113
Why does every model have the exact some shitty generic artsyle for blizzard art? It always looks like a cheap mobile game compared to anything blizzard actually makes.
>>
>>107344130
i'm sure 99% of people are normal but it's funny how the government itself acts. like most governments which are dumb, I guess.
>>
Do any of you anons use z-image FP8, and how’s the quality?
>>
>>107344139
base, I just posted the WF. new models arents SDXL tier requiring 200 detailers + hiresfix shit, you just one shot it
>>
>>107344113
this model has bad sense of perspective honestly
>>
Fresh

>>107344153
>>107344153
>>107344153
>>107344153
>>
>>107344152
pretty dope
i just dont like the +16 and +24-GBs-of-vram-minimum-requirements part that usually every model has nowadays
>>
>>107344133
I saw that name flying about a couple hours ago I've never tried it but open to
>>107344134
Thanks, why shouldn't I use sage? I have it
>>
>>107344174
sorry you mentioned you were new to this so I thought you didnt have sage attn installed, then keep it, anyway enjoy



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.