[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1761253481986025.jpg (2.65 MB, 3835x3471)
2.65 MB
2.65 MB JPG
Cautious GenFlare Anticipation Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107548966

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
1~h before the meltie starts.
>>
File: 1750304252349694.png (1.79 MB, 1536x1024)
1.79 MB
1.79 MB PNG
>>
File: 1749804876345632.png (209 KB, 2244x1009)
209 KB
209 KB PNG
https://github.com/huggingface/diffusers/pull/12839
Qwen Edit 2511 is comming
>t. don't care that much since I know Z-image edit will be better and unslopped
>>
>>107557330
what a faggy pr, I think ZIE is going to be after christmas, so this should tide us over for a bit
>>
File: 1752484342058259.png (2.14 MB, 3202x1422)
2.14 MB
2.14 MB PNG
>>107557330
>>107557344
Z-image base will be able to do edit and I bet even this unspecialized model will be better than QiE kek
>>
File: 00298-1393174673.png (1022 KB, 896x1152)
1022 KB
1022 KB PNG
>>
File: zimage dalle.png (2.29 MB, 1536x1024)
2.29 MB
2.29 MB PNG
>>
>>107557406
i actually vomited
fuck you
>>
>>107557360
>I bet even this unspecialized model will be better than QiE
Won't even be close. We're getting the pretrained Omni base before it goes through SFT and what makes it look photorealistic - the rlhf and grpo steps.
Refer to pages 21-24 of the paper for the full process
>>
>>107557413
>We're getting the pretrained Omni base before it goes through SFT
i surely hope so. it's been a while since we had a true non-distilled base model release that can be properly trained and used as a blank canvas. midwits that want something that produces photoreal 1girls out of the box don't understand
>>
>>107557413
it's a good news desu, a completly untamed Base model ready to be trained the way we want, now the question is, can we reach Turbo's level with community's finetuning?
>>
File: 1763277967267835.png (75 KB, 286x301)
75 KB
75 KB PNG
>>107557115
>>107557432
https://xcancel.com/bdsqlsz/status/1984268208594104544
holy
>>
>>107557413
wait, so they distilled z-image first and then applied RLHF? how did they do that? I thought finetuning a distilled model was almost impossible
>>
>>107557495
Agree. I do like a good polished model (was hoping they'd release something like turbo without distillation alongside the foundational omni) but on a larger scope to truly move away from SDXL for good, the foundation model is required.
>>
chroma z when?
>>
>>107557413
I wonder how great Z-image turbo would've been if it wasn't distilled before doing the RLHF stuff
>>
>>107557413
will whatever they're gonna be releasing be of any use to us normies or is it only for the vramchads who're gonna use to train their own checkpoitns?
>>
How do we fight the splitbaking menace?
>>
>>107557614
Normal seed variation, less sameface and pose, better uncommon concept understanding, though realism would suffer and it would take more steps. If they wanted to train it for higher quality output at fewer steps they could but:
>"It [turbo] achieves 8-step inference that is not only indistinguishable from the 100-step teacher but frequently surpasses it in perceived quality and aesthetic appeal."
>>
>>107557643
baking right now
>>
rebaking
>>
File: file.png (88 KB, 424x863)
88 KB
88 KB PNG
what's the deal with this?
i'm relatively new to comfyui, does it unload the model after every batch or something?
is there a way to keep it in memory if so?
>>
rabaking

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon

is off topic.
>>
>>107557302
you were right on the money
>>107557781
share your workflow
>>
>>107557730
>It [turbo] achieves 8-step inference that is not only indistinguishable from the 100-step teacher
I don't believe that, there was this new distillation paper showing that D-DMD is not the optimal distillation method, let alone something that would be equal to something not distilled
>>
https://rentry.org/ranfaggot
>>
>>107557815
Thanks for exposing yourself in the collage.
>>
>>107557820
see >>107557818
>>
>>107557820
ani is right. stop using troll bakes. just let ran shit his diapers i stead
>>
>>107557826
>>107557835
>the repost spam has started
another day of ani choosing to spend his valuable time on earth having a melty instead of turning off the computer. blessed
>>
>>107557781
you're probably caching your prompt.
>>
File: file.png (1.34 MB, 2241x1527)
1.34 MB
1.34 MB PNG
>>107557790
i made a customer DeepSeek API-connected conditioner node that fleshes out prompts and translates them to chinese automatically, but the issue was happening prior to that.
the Load LoRA (DistillPatch) is this https://modelscope.cn/models/DiffSynth-Studio/Z-Image-Turbo-DistillPatch -- just a normal Load LoRA node renamed.
i'm using an RTX 3060 12GB .
>>
>>107557850
>>107557824
>>107557831
>>107557820
>>107557815
>>107557790
>>107557788

go here this thread is radioactive
>>107557807
>>107557807
>>107557807
>>
>>107557850
plz read before posting in dramabakes
https://rentry.org/ranfaggot
>>
>>107557859
based
>>
>>107557850
My guess is that you don't have enough VRAM to hold both DeepSeek, Qwen and ZiT, so when it takes less time, it's because you're re-using the same prompt and don't need to load DeepSeek (which would dump ZiT from VRAM).
>>
Imagine working for a year to defeat Comfy with your own UI and you only get 33 stars
>>
File: file.png (142 KB, 1292x747)
142 KB
142 KB PNG
>>107557859
>>107557854
i'm gonna be honest, i don't give a fuck.
it's 4chan. there's no "official" threads.
the other thread is claiming this one is fake too.
who gives a fuck one way or another, really?
just post useful shit. what's the point in discouraging people to participate?
what are you trying to control?
are you upset that you're not getting the dopamine hit from your thread of choice or the one you started doing well?
people are so fucking weird.

you can't put "started many successful 4chan /ldg/ threads" on a job resume.
>>
>>107557891
that makes a lot of sense actually.
time to research a way to fix this shit without buying a new GPU.
>>
>>107557896
This is the local schizo. /ldg/ is, unfortunately, spammed to oblivion by him. Best to just ignore him. Use whichever thread you prefer.
https://rentry.org/animanon
>>
>>107557293
>>107557293
>>107557293

see >>107557859 and >>107549797

stop using dramafaggot bakes or this will continue
>>
>>107557922
what will continue?
>>
>>107557902
>chroma was a bust but what was the best version before it got bad? Is it v48?
New versions are working just fine. Mostly sidegrades to each other.
>>
>>107557894
when Comfy first started, i hated it
the creator and his simps were fucking annoying
constantly berating the pioneers of webui that came before them
advertising relentlessly on every AI thread
just being annoying shitters

i still don't like it compared to A1111 because it's still shit to use, but i conceded that it's where all the new shit is
Comfy is like the Windows of user interfaces.
>>
>>107557896
>i'm gonna be honest, i don't give a fuck.
then you allow it to continue by supporting a welfare nigger

>>107557929
you feeding the troll
>>
>>107557938
>Julien
>>
>>107557936
>then you allow it to continue by supporting a welfare nigger
i got my question answered already in this thread.
it's what i was here for.
so... yea, i don't give a fuck about your welfare nigger.
>>
>>107557413
are you sure it's pre SFT? it would be good for finetuning, but the community doesn't known how to finetune it correctly. overloading the model with booru 1girl, close up, esoteric background causes the model to be addicted to close ups and lose the ability of generating stable backgrounds
>>
>>107557933
>Comfy is like the Windows of user interfaces.
It really isn't. Comfy being the default UI for image gen is a disaster. It's as if Arch without install script somehow became the default OS for home use.
>>
Ok how do we move forward? What are our options to salvage the general from tRANny?
Option 1: Placate him, add rentries to OP, don't splitbake. Could work for a shortwhile but unfortunately ran's dramafaggotry is well documented with his long, cursed history in this general so he will find something else to have a schizo meltdown about and the peace will be very short-lived.
Option 2: Status quo, keep splitbaking whenever ran shits up the OP. Self-explanatory, inefficient and annoying, not really a solution but at least we would be standing up to him.
Option 3: Some fucking how convince the jannies (who notoriously give zero fucks about AI threads) to range ban ran. I don't think he is tech savvy at all so it should shut him up. Not gonna happen but one can dream I suppose.
Do we just like keep calling him a faggot here or is there anything we can actually do?
>>
>>107557954
Nobody is sure of anything, it's speculation and assumptions based on their naming - "Omni", and the position of "Omni" in the graphs from the paper that place it before the finetuning steps, and the wording from the blog post.
https://tongyi-mai.github.io/Z-Image-blog/
>A foundation model designed for easy fine-tuning
What concerns me is that all that amazing autistic prompt following capability comes from both SFT and, according to the paper, the GRPO, DPO and RLHF. If the community cannot do a basic SFT, I doubt many know how to implement the laterl RL stages.
>>
>>107557520
lmao chinks waste no time making videos of their women getting railed by blacks
>>
>>107557997
Why should I go there? I'm a tourist from lmg. Last time I was here was over a year ago
>>
>>107558009
You're talking to the local spammer. You can get the essentials about him in the OP rentry https://rentry.org/animanon
>>
fucking autist local community!!!!!!!!!!!!!!!!!
>>
I swear to G*d I'll apply as a janny because of you faggots. Nobody gives a shit about your autistic drama.
>>
>>107558022
Aaaa the friendly thread is no longer friendly. I just wanna bullshit with my imagen comrades
>>
>>107558021
at first i ignored it, but now i just report it for instigating a flame war. he is being too disruptive.
>>
>>107558057
>Aaaa the friendly thread is no longer friendly
Many such cases with 4chan generals, unfortunately.
>>
>>107558057
rentries in the OP is marked for trollbake. you were duped. stop posting in it because we agreed to splitbake every single one of tranfag's spitebakes from now on
>>
>>107558057
https://rentry.org/ranfaggot
you are being lied to. this is the problem
>>
>>107558086
If you're not Ani, chill out. Only Ani would post this kind of unhinged crap to himself to garner sympathy. There's no reason to be this upset. You're helping him if anything.
>>
>>107558102
option 2 is out only option left because of you ranfaggot. this is what you wanted. everyone pissed off at you
>>
>>107558075
Umm I don't know what to believe so I'll just stay here for a bit before I go. That other thread has a funny smell to it
>>
File: ZIT_00002_.png (3.37 MB, 1392x2400)
3.37 MB
3.37 MB PNG
Can we get these trannies baking threads into a gaschamber? It's so boring.
>>
File: 1743370256543673.jpg (839 KB, 1920x1088)
839 KB
839 KB JPG
why don't the j*nnies do anything about the obvious spammer?
>>
>>107558042
wtf why did that anon just copy my post verbatim, i am so confused
>>
>>
>>107558140
It's the spammer. He's doing it to make his thread look active. Check out the rentry if you want more context.
>>
>>107558138
>>107558145
please post in the non drama bake. we did this shit yesterday and giving tRan the keys to the thread means this will never stop
>>
>>107558155
kys
>>
"Natural curves emerging, but still youthful and unrefined. "

What the fuck, qwenvl?
>>
>>107558170
simulating a thread without dramanigging and the same nig concern trolling
>>
File: file.png (19 KB, 1031x534)
19 KB
19 KB PNG
>>107558166
Anything that isn't a perfectly straight line?
>>
>>107558166
something that should be discussed outside the drama thread discussion

>>107558198
go to the other thread
>>
File: 1735329780594177.mp4 (979 KB, 480x832)
979 KB
979 KB MP4
>>107558209
>>
>>107557807
>>107557807
>>107557807

ere if you don't want to be caught in the concern troll cycle this gay thread has going on. drama is not on topic for the linked thread like it is here
>>
>>107558224
die die die die die die die
>>
File: ZIT_00018_.png (3.14 MB, 1392x2400)
3.14 MB
3.14 MB PNG
>>
>>107558231
>placating the diaries of a mad black tranny
think. use your fucking brain
>>
File: eh.jpg (130 KB, 1024x1024)
130 KB
130 KB JPG
lmao the Z-Image lora broke so hard that instead of Len's face it drew two random capybaras, the one in the back about to slit the throat of the one in front. Maybe it was the "doggystyle" tag? I did not shop that in.
>an illustration, 1girl, 1boy, hetero, sex, hatsune miku, long hair, thighhighs, blush, twintails, sex from behind, kagamine len, doggystyle, aqua hair, blonde hair, necktie
>>
File: lubimiv.jpg (452 KB, 1024x1024)
452 KB
452 KB JPG
>>
>>107558260
Holy shit, that's crazy. Lora training on ZiT looks like it can go really wrong. I'm not sure what the sauce is to make it good. I'm just waiting to see how well Base responds to lora training before I invest time into this.
>>
File: 1747853687772222.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>
File: 1748567716745579.jpg (478 KB, 1024x1024)
478 KB
478 KB JPG
>>
File: ComfyUI_00365_.png (1.1 MB, 1280x720)
1.1 MB
1.1 MB PNG
>>
IS THERE ANY WAY TO MAKE DEEPSEEK 3.1 THE FREE ONE CONNECT TO COMFY TO WRITE PROMPTS?!?!
>>
>>107558307
>dalle gen
>>
File: 1749865969288043.jpg (577 KB, 1024x1024)
577 KB
577 KB JPG
>>
>>107558317
maybe try this node, openrouter has free deepseek providers afaik

https://github.com/gabe-init/ComfyUI-Openrouter_node
>>
>>107558335
fcuk yess thank you!
>>
>>107558333
he can't repost those in his spam thread
>>
File: 1751582472371078.jpg (321 KB, 1024x1024)
321 KB
321 KB JPG
>>
File: 1737844132717139.jpg (431 KB, 1024x1024)
431 KB
431 KB JPG
>>
>>107558335
nvidia moonshot has unlimited ds
>>
>>107558362
whats that?
>>
>>
>>107558333
>trani problem?
what do you mean? multiple anons are in on the splitbake decision to shut niggerjak up for good
>>
>>107558379
if that were the case you wouldn't need to copy every post from the real thread to make yours not look deserted
>>
File: 1747978036484415.jpg (194 KB, 1024x1024)
194 KB
194 KB JPG
>>
>>107558335

Goddammit I though ds 3.1 was uncensored. it isnt even doing base level nudity.. And I used it for silly tavern for the most degen shit
>>
I hate schizos. They're the same as niggers. They just ruin any community
>>
File: tr.jpg (498 KB, 1536x1024)
498 KB
498 KB JPG
>>
cozy and real thread
>>
File: file.png (31 KB, 453x315)
31 KB
31 KB PNG
>>107557891
so it turns out it wasn't a vram issue because i'm using API, not local with deepseek, but deepseek's API getting errors along with not using a quantized text encoder.

i switched my encoder out for one from here: https://huggingface.co/worstplayer/Z-Image_Qwen_3_4b_text_encoder_GGUF/tree/main

and i had chatgpt work out the errors in deepseek, and i'm down to picrel.

the 49 second one was changing the prompt and deepseek's API doing its thing. the first one was first run, which (i think) is normal for it to be longer.
>>
>>107558782
Good job anon, yes the first run will take longer because it takes some time to load models in VRAM. If all the models you use in one workflow fit into your VRAM then subsequent gens should be faster.
>>
File: 1752659164126606.jpg (3.21 MB, 2048x2048)
3.21 MB
3.21 MB JPG
>>
File: ComfyUI_00010_.png (981 KB, 1280x720)
981 KB
981 KB PNG
>>
>>107559196
kek
>>
File: img_00021_.jpg (1.05 MB, 1352x1776)
1.05 MB
1.05 MB JPG
>>107559196
I wanted lemon lime! Did you i2i or just prompt?
>>
File: file.png (1.19 MB, 1152x864)
1.19 MB
1.19 MB PNG
>>
File: file.png (2.1 MB, 1152x864)
2.1 MB
2.1 MB PNG
>>
>>107558446
istg there's a genotype that causes this dissonance
even more broad than jewry
>>
>>107559234
i2i and had a VLM caption the image for the prompt
>>
File: z_00024_.png (996 KB, 1280x720)
996 KB
996 KB PNG
>>
>>107559286
canadian
>>
File: z_00031.png (1.03 MB, 1280x720)
1.03 MB
1.03 MB PNG
>>
File: ran.png (1.21 MB, 1216x832)
1.21 MB
1.21 MB PNG
Fuck the lolcow catjak
>>
>>107557293
OT has ZiT support naow? Woaw
>>
File: 1758218947320017.png (1016 KB, 1280x720)
1016 KB
1016 KB PNG
>>
>>107559296
is there good small uncensored vision models? last time I checked everything needed 24gb+
>>
>>107559466
I was able to run quant (Q6? I don't remember) of joycaption with 12gb.
>>
>>107559466
I ran 4bit joycaption with 12g vram
>>
>>107559472
Wont quants destroy vision model quality?

>>107559480
Just fuck off
>>
>>107559510
a quant is still better than a tiny non quant desu mefinks
>>
how do i know which thread isnt the pozzed one
>>
>>107559518
>a quant is still better than a tiny non quant desu mefinks
There are so few that can do nsfw
>>
>>107559544
The other thread is full of reposts to not look dead. Just check the timestamps for the images in common between the two threads. For more information about the spammer read the last rentry in this thread's OP.
>>
File: file.png (416 KB, 800x563)
416 KB
416 KB PNG
>>107559270
anybody with a quarter of a brain can see you faggots are copying posts from here to make it look active.
>>
File: 1753952337995542.png (2.03 MB, 832x1248)
2.03 MB
2.03 MB PNG
>>
>>107559574
>you faggots
It's one guy. See the rentry.
https://rentry.org/animanon
>>
>>107559544
look at the first post up there with the bimbo with huge tits.
notice the time
go to other thread, compare the time
they're copying shit from this thread to put there.
>>
>>107559544
you know the real one because some schizo is spending hours trying to get anon to move away from it
>>
>>107559561
no i think those rentries are necessary
look at what sdg turned into. because of one person.
go to sdg you fucking infiltrating freak
please kys
>>
>>107559601
>if everyone isn't on board it will just keep happening
what will keep happening?
>>
>>107559556
yep...
>>
i though ani and trani was the same person
i didnt know there was a new guy in the mix
>>
>>107559634
nice attempt at blackmailing anons once again.
>>
>>107559634
4chan is fundamentally flawed
it caters to the bottom denominator
this is why some form of gatekeeping is necessary
>>
>>107559634
>if this nigger were in high school with me we would beat the shit out of him every fucking day
And this is why your software is unsafe to run. xx
>>
>>107559601
>and if everyone isn't on board it will just keep happening
you'll keep splitbaking, samefagging and reposting? k whatever, i'll just keep bumping the real thread
>>
How does one get harassed on an anonymous forum?
>>
>>107559656
im good bro
i already dealt with this kind of ai drama in other boards
jumping ship
>>
File: ricebunny.png (1.29 MB, 832x1248)
1.29 MB
1.29 MB PNG
>>
File: RICEBUNNY.png (1.25 MB, 832x1248)
1.25 MB
1.25 MB PNG
>>
File: zimg_0072.png (1.11 MB, 896x896)
1.11 MB
1.11 MB PNG
>model knows beyonce
>model knows beyonce's body
>can't prompt for someone else with beyonce's body without getting beyonce

anyone have a method for this? i need a stacked hapa
>>
File: ZIT_00025_.jpg (809 KB, 2400x1392)
809 KB
809 KB JPG
Fighting, punching, is still so awkward.
>>
>>107559692
nag could help with that, put dark skin etc to negative
>>
>>107559692
>anyone have a method for this?
a better prompt
>>
File: file.png (1.71 MB, 1024x1024)
1.71 MB
1.71 MB PNG
>>107559692
Try ControlNet. Unfortunately the first ControlNet model feels kind of bad. Not sure about the second one, I need to try it.
>>
File: zimg_0035.png (1.76 MB, 960x1280)
1.76 MB
1.76 MB PNG
>>107559725
is that eric andre?

>>107559741
nag hasn't been any help at all
>>
File: LastRiceBunny.png (1.26 MB, 832x1248)
1.26 MB
1.26 MB PNG
>>
>>107559510
>Wont quants destroy vision model quality?
You should proofread your captions anyway.
>>
>>107559544
Look at how upset he gets because anon is posting here and not there kek
>>
File: 1751385585445676.png (1.52 MB, 2854x1472)
1.52 MB
1.52 MB PNG
>>107559692
>anyone have a method for this?
ask for a llm to rewrite things for you
>>
>>107559784
Does the LLM fit inside 24gb VRAM on top of ZiT and Qwen? Can you share that workflow?
>>
>even more upset now
KEKDD
>>
>>107559799
>Does the LLM fit inside 24gb VRAM on top of ZiT and Qwen?
you can go for Q8 or less to make the llm smaller, and you can offload with that node
>>107559799
>Can you share that workflow?
https://github.com/BigStationW/ComfyUI-Prompt-Rewriter/blob/main/workflow/workflow_Z-image_turbo.json
>>
didnt see trAni having this big of a meltdown in a while lol
>>
>>107559799
Q8 of a 4B is 4G. Q8 8B is around 9 or so. Get 8B VL or higher if you can run it. You can keep the enhancer in a separate workflow.
>>
>makes direct terroristic threats
why would anyone listen to you? kys
>>
>>107559814
Thank you dude.
>>
File: zimg_0125.png (2.17 MB, 1080x1440)
2.17 MB
2.17 MB PNG
>>107559748
would love an example if anyone has had any success doing this

>>107559784
i'll try running it through an llm, it's been useful so far but i haven't thought to try it to capture a body type. (pic rel)


>>107559752
yeah i was hoping to get this with just prompting only, but controlnet/body loras are my next step
>>
>>107559855
your 1girl has a really bad case of orange skin, nice gen though
>>
Thx 4 da phree bumps btw :]
>>
>>107559877
but you are causing the drama? curious
>>
>>107559877
stop spamming
>>
how is it real breas if u haf 2 spam links 2 it here ????
>>
>julien crying again
>>
>>107559927
So if you are banned the drama is over
>>
File: 1762342319104685.png (983 KB, 1280x720)
983 KB
983 KB PNG
>>
File: 1754424770245112.png (989 KB, 1280x720)
989 KB
989 KB PNG
last one pinky promise
>>
>we all agreed
>t. julien and his phone
>>
>>107557302
And he's been going at it for 4 hours straight holy moly
>>
File: 1754606034852746.png (892 KB, 1280x720)
892 KB
892 KB PNG
not what i wanted but i'll take it
>>
>>107560001
You mean seven hours. Guy chose to spend seven hours reposting other people's pics and spamming. Lol.
>>
File: Untitled-1sdfsdfdsf.jpg (424 KB, 2784x2400)
424 KB
424 KB JPG
Qwenvl vs joycaption, both using the same guiding prompt to describe the image. Interesting.

>>107559761
Just random black guy, it was an ai generated image from a long time ago.
>>
>>107559955
How did you prompt the "partial pov" of the guy?
>>
>>107560060
>on the side of the image, in the blurry foreground, a man in a charcoal jacket is seen from behind, blocking the frame.
>>
>>107560057
female kamen rider villain vs Mori Caliope
>>
>>107557790
>you were right on the money
pattern recognition, nothing more
>>107560001
expect at least another 6 hours. SAD!
>>107560000
lmao
>>
>>107560057
left is fucking hot
>>
>>107560072
What frame?
>>
>>107560057
>guiding prompt
Which was?
>>
>>107560083
the image retard
>>
so anyway
>>
File: ZIT_00021_.png (3.25 MB, 2400x1392)
3.25 MB
3.25 MB PNG
Good riddance.

Zit does not know nic cage and kojima.

>>107560085
https://files.catbox.moe/jx9m9m.txt
>>
File: file.png (1.69 MB, 1024x1024)
1.69 MB
1.69 MB PNG
>>107560144
lol
>>
File: 1739621762438615.png (166 KB, 1075x1266)
166 KB
166 KB PNG
>>107559784
If only those models wouldn't yap for so long though, I get better results with thinking activated but still...
>>
File: ZIT_00026_.jpg (697 KB, 1392x2400)
697 KB
697 KB JPG
>>
mods=gods
>>
I love local diffusion general
>>
>>107557409
I got news for you...
>>
File: ZIT_00033_.png (3.3 MB, 1392x2400)
3.3 MB
3.3 MB PNG
I like taking simple abstract illustrations and try to make it realistic.
>>
>>
>>107557571
Training will likely start the second Base drops, lodestone has been salivating
>>
>>107560305
if he finetunes base or edit and makes it work only with the pixel space, he has the perfect occasion to end up as a hero, I let him a pass for chroma since it's just not possible to fully save a distilled model like schnell
>>
>>107557571
The only thing chroma did was push flux into SOTA realism. That's not needed for Z so what does he have to offer other than burning cash for experiments that go nowhere? Respectfully.
>>
>>107560327
he'll add NSFW and more anime characters though
>>
qwen edit having just a slight bit more anatomical knowledge than ZiT gives me such fucking chinese blue balls its insane
these xxx and enhancer loras only do so much, its a diceroll.
>>
File: 1744806155895406.png (988 KB, 1280x720)
988 KB
988 KB PNG
>>
>>107560362
based
>>
>>107560314
every vaeless model his and the others are still shit with details
>>107560341
i suppose yeah but im more interested in artist prompting which is certainly not going to come from him
>>
File: 1748534789811035.png (2.09 MB, 1120x1440)
2.09 MB
2.09 MB PNG
>>107560327
Z still needs nsfw, more diverse/flexible styles, and booru tags.

though honestly, tuning Z is mostly a waste of time, for impatient slop bakers. by the time they finish a finetune, an even better base model will have released.
>>
>>107560314
Dunno about the pixel space since it is slower to generate for not so much potential gain IMO, and Base will already be slower than Turbo

That said it's practically a given that he will do a finetune on Z-Image Base
>>
>>107560389
>it is slower to generate
not anymore with JiT X0
https://github.com/LTH14/JiT
>>
its going to be someone new who does a worthwhile z tune and i dont say that out of spite or hatred its just how this sort of thing goes
>>
>>107560369
Why do you need to reply with 'based'?
>>
>>107560380
>more diverse/flexible styles, and booru tags.
Does chroma have either of these? Lol
>>
>>107560428
uh, it has an extra chromosome.
>>
hello, let's say you're building a chatbot webapp. I'm doing a PoC that is in good shape but using the google Gemini API to do my prompt calls. I've already maxed out the free tier with just my own personal testing....what do you recommend to get around that besides paying? I'm not too invested in the model so it could be something else like ChatGPT api. I'm just tring to keep the cost low for me and eventually a few users (like 10)
>>
>>107560440
Gm
>>
>>107560421
What do you mean?
>>
>>107560449
Gm sir or madam.
>>
>>107560349
qwen image remains the GOAT for prompt comprehension and complex images and anatomy. neither Z nor SPARK Chroma can handle complex, detailed backgrounds, crowds, architecture, and technology like Qwen. I think this probably is a result of the high params Qwen has. but unfortunately, qwen also takes things too literally sometimes, and has crap styles/realism by default.

>>107560428
styles yes. tags, kind of, but half baked.
>>
>>107560449
based
>>
>>107560498
>qwen image remains the GOAT for prompt comprehension
to be fair z image turbo doesn't have a cfg so it can't compete, we'll see if base is better on that
>>
>>107560498
>styles yes.
It doesn't have artists though because lode is a moralfag for some reason
>>
>>107560428
Diverse/flexible styles, YES without a doubt
>>
File: 1737483147542963.png (138 KB, 1537x613)
138 KB
138 KB PNG
>>107560545
>lode is a moralfag for some reason
and a liar
>>
no one uses chroma for its "styles" lol its use case is coom
>>
>>107560515
even with 1 CFG and a crappy turbo lora, qwen easily beats the others. I suspect Flux 2 might beat it, but Flux 2 is just too fucking bloated.

>>107560545
no that's because he fucked up by training at fp16 instead of 32 bit or something like that. artist names apparently are too low frequency to be picked up by his broken training regimen. you can still prompt art styles using a detailed sentence including artist name.
>>
>>107560449
I’m not indian. You’re presumptuous
>>
>>107560580
>Someone being presumptious, on my /ldg/ ?
More likely than you'd think...
>>
>>107560563
It retains some undestanding of artists, but it's total crapshoot if it works or not.
>>
>>107560563
Did he really lie here?
Ponyfag said in its V7 autopsy that t5 struggles to separate style and content (Even though he moralfaged artist tags away his dataset still had synthetic style cluster tags)
Maybe one of lodestone's schizo training methods amplified the problem to the point that it can't learn styles at all.
>>107560624
>but it's total crapshoot if it works or not.
Like everything else chroma then.
Pure seed lottery. Unstable, unreliable, schizo model.
>>
>>107558415
kek
>>107558455
what model is that?
>>
>>107558782
you're paying for deepzeek?
>>
>>107560498
it is fucking crazy how a lot of my gens, when i ask it to nudify a character, it also goes ahead and autocompletes the rest of the image if the input is 4:3 and the output is a 16:9 image. Really neat stuff. I hope z-edit could come out just as good.
>>
>>107560639
>Ponyfag said in its V7 autopsy that t5 struggles to separate style and content
flux can do van gogh style so if you give the name so technically it should be able to do it a bit, on chroma there's 0 artist style you can make, it's obvious he removed them
>>
>>107560644
How do I not pay for deepseeker?
>>
>>107560224
ranfaggot wasn't rangebanned so no

please move to the non drama bake and do not fellate tRan or this will continue
>>
this is a spitebake and this won't stop if you keep posting in them. report for instigating a flame war instead
>>
>>107560659
>>107560660
>>107560680
go to the non drama bake. do not support spitebakes
>>
uh oh hes making another stinky...
>>
>>107560704
>spitebakes
and another schizo word to add to the filter lol
>>
>>107560498
also question while we're on the subject, is there a particular reason i shouldn't be going lightning 8 steps for qwen edit? does cfg 1 cripple what it can really do?
i really do not want to wait the full 6s/it for 20 steps but.. eh if it can give me quality nips and stuff i might as well.
>>
>>107560639
Why should we trust anything astra says? He's (also) a lying faggot.
>>
>>107560732
niggerbakes are the cancer killing /ldg/ do not support niggerbakes
>>
>>107560736
>jet set radio
based
https://www.youtube.com/watch?v=CwE2k0HMDfo
>>
>>107560736
>>107560741
you shouldn't trust rannigger bakes
>>
>>107560660
the pony model is a hodgepodge of slop never use that shit
>>
>>107560762
nothing based about niggerbake supporters (also samefag)
>>
File: get filtered.gif (1.45 MB, 400x225)
1.45 MB
1.45 MB GIF
>>107560761
>niggerbakes
>>107560774
>rannigger
those will be filtered too
>>
>>107560762
my first run of qwen edit i nudified her in that style, buckets were filled with my coom that night.

anyway got my answer, this garbage took 3 minutes. back to lightning kek
>>
>>107560785
kek
>>
File: img_00259_.jpg (421 KB, 1643x1232)
421 KB
421 KB JPG
>>
>>107560680
openrouter -- give them $10 and then just use the free providers for whatever models
>>
>>107560794
catbox?
>>
I don't know if you have realized, but you're getting trolled by a single person. The same person that created this thread, by the way. If you don't want to be held hostage by him, go back to /sdg/.
>>
>>107560774
holy fuck, would you just fuck off already? no one gives a fuck about your thread drama bullshit
>>
>>107559692
haven't tried [from:to:when] syntax in z-image before (hapa went too asian imo) but it seems to work but probably benefits from more steps than 8 or 9.
https://files.catbox.moe/0i56p8.png
>>107559752
the body's probably better here
>>
>>107560831
nice one anon
>>
>>107560813
Thank you big dick sir. The autism itt is higher power level than me, bordering on esoteric schizophrenia
>>
>>107560785
don't support the false flag splitbaker nigger that false flags
>>
File: FluxKrea_ErikaKirk.png (2.54 MB, 1344x1728)
2.54 MB
2.54 MB PNG
NetaYume link in OP still needs an update to V4:
https://civitai.com/models/1790792?modelVersionId=2485296
>>
>>107560639
>Ponyfag said in its V7 autopsy that t5 struggles to separate style and content (Even though he moralfaged artist tags away his dataset still had synthetic style cluster tags)
oh how convenient that he "found" this flaw that just so happens to align with his moralfag views. no way he lied to cover his ass, ha ha
>>
>>107560823
yeah I figured. catjack does this shit all the time by playing both sides. wouldn't be surprised if more than half of the debo rentry was just him being a faggot
>>
>>107560854
fair, but i'll note the creator actually came in this thread, posed as a regular user and evaded training questions, which i'd say is really bad form.
https://desuarchive.org/g/thread/107488160/#107489408

i don't mind if we update the link anyway.
>>
>>107560854
the aesthetics are nice but how on earth does a model like this have worse anatomy than illustrious/noob? Most of the hands in those gens are either mangled or have one or two crinkled fingers going in the wrong direction.
>>
File: catjak.jpg (96 KB, 736x919)
96 KB
96 KB JPG
>>
>>107560863
I highly doubt ani or debo would waste their lives being so pathetic. maybe we should listen to the "schizo" and just post in the drama free threads that he is trying to make us not do by telling us what to do? then again I'm confused. how do you get rid of a schizo for good? dox them or something?
>>
>>107560883
more like >>107559355
>>
>>107560563
He didn't lie, it's just the Chroma dataset was only 5 or 6 million images and not explicitly anjme focused, it was never going to somehow have super strong Booru artist tag recognition
>>107560639
The clusters were a retarded idea but they do work once you figure out which is which, I don't really get what Astralite's point was if he said that
>>
>107560876
>107560686
>107560699
>107560704
>>107560761
>>107560783
>>107560823
>>107560863
>>107560886
Stop ban evading, Julien
>>
>>107560902
you can leave your schizoid zone at any time
>>107557807
>>107557807
>>107557807
>>
File: ComfyUI_temp_ppfpo_00126_.png (1.88 MB, 1024x1536)
1.88 MB
1.88 MB PNG
>>107560854
Yes, this new Neta is very good. Has anyone tried it?
>>
>>107560854
>>107560876 (me)
here he is again >>107560919
>>
>>107560919
I will NOT download your model (again).
>>
>>107560894
>I don't really get what Astralite's point was if he said that
I believe his model struggles to draw certain characters, concepts etc. strongly associated with specific style clusters, in different style clusters.
Never bothered to check his junk model so can't comment on the veracity of the claim.
>>
>>107560877
IDK what you mean really, the anatomy is fine if you ask me, it's definitely picky about sampler / scheduler choice though. Without getting into res4lyf stuff I find DPM++ 2S Ancestral Linear Quadratic at about CFG 5.5 the best for NetaYume. It's also stronger at 1280ish and above base gen res than it is 1024ish.
>>
>>107560919
>shit hands
>very good

>>107560902
grow the fuck up
>>
>>107560938
you are the same person but you have no self control? schizoid behavior. I blame half nig genetics
>>
>>107560876
wat? You have to be trolling or something, I don't get wtf you think that link is supposed to prove lmao. Various people have been using Neta in this thread for a while now, there's a reason it's in the OP to start with.
>>
>>107560967
comfy already cursed the model like Wai and sd3 before
>>
File: img_00263_.jpg (623 KB, 1696x1272)
623 KB
623 KB JPG
>>
>>107560978
it really is the Wai of this gen

also fuck tran with this spam schizo nightmare dilation. he did this all the time in /sdg/
>>
>>107560894
>He didn't lie
he did, he said "artist tags will be preserved" and you want me to pretend that on his 5 millions images there was somehow 0 artist tags in there and we were just unlucky? gtfo
>>
>>107560876
How does that archive link prove it exactly? Lmaeo
>>
>my dumbass through qwen edit and over a year later of using comfyui realizing there's a FILENAME PREFIX field in save image extend
>THAT's how i can organize my gens by what model they were made in
>>
>>107560978
I have no fucking clue what this comment means
>>
>>107560978
>>107560996
wai at least gave us a vae. a very saturated one but it was at least something. lumina is a dead arch
>>
>he thinks WAI is anything more than a jeetmerge that only got popular because the "author" lied about it being a tune
>>
>>107561011
anything comfy says he outwardly likes is doomed to be a failure model. sad he said the zimage anime tube was good. guess we do have to wait after all
>>
>>107561012
huh? its literally just sdxl vae though?
>>
>>107561016
it was a tune. remember when the retard destroyed his drive because he did raid0 with ssds?
>>
>>107561023
it was a trained 1.x vae. the Wai sdxl never really released the project fell apart because of neggles
>>
>>107560967
>>107561001
you're right of course, it does not prove anything, but i find the whole situation fishy because
>he completely refused to post his own gens of the model, instead posting a bunch of gens made by the creator from the civit page
>the poster is at least an insider, because he says "apparently this model took the dev forever to make because he put a lot of care into the release" which i find no indication of on the civit page
>the poster is ESL/chinese in a way you don't really see in /ldg/ conversation ("I shared it last day", "Yes, it can do")
but yes, you are right and i have no issues with updating the link or anything
>>
>>107561001
I like how the link includes a guy hating on NetaYume while being excited about the Z anime tune being done by the same guy as NetaYume lol. I hope it's good but IDK how it'd get to the same knowledge level as NetaYume, unless he has access to the entire original Neta Lumina 1.0 dataset now or something.
>>
>>107561037
it has shitty style parity. wait for the noob team to do it because I'm not impressed
>>
none of these shills compares to the OG pixshart sexuals desu thobeit
>>
also fuck ranfaggot. your gens suck and you constantly ruin the threads with your jealousy bullshit
>>
>>107561046
What does? The Z-Image tune by Neta man can't possibly exist yet
>>
File: Z-image turbo.png (1.66 MB, 1280x720)
1.66 MB
1.66 MB PNG
>>
>>107561067
ouch!
>>
reminder to please not bake tranfag. we hate you enough already for pulling this shit again
>>107557807
>>107557807
>>107557807
>>
So much of this discourse is brought about by those who only got into this hobby recently.
>>
Migrate:
>>107556266
>>107556266
>>107556266
>>
>>107561060
the neta troone team sucks ass so I'm not expecting anything increadible. noob team will train uncensored
>>
Someone make a new thread, Julien's having a melty again
>>107561086
>>107561104
>>
>>107561104
nice try debo (tran)
>>
ill wait for the real bake
UGGG I KNOW I KNOW
but im just gunna wait hehe
>>
>>107561086
where's debo's rentry?
>>
>>107561120
Calm down. It'll be okay.
>>
NEW QWEN EDIT (I just woke up).
>>
>>107561120
>Julien's
the half black American tranny had a name? she baked both threads as a false flag
>>
>>107561136
>>107561140
Nice phone
>>
>>107561133
debo is off topic and hasn't been a problem in years
>>
>>107561025
>it was a tune.
So is it still a tune? Is there a way we can tell? I wish this """open source""" culture we have wasn't so hush-hush with training methods, datasets, merge components etc.
>>
>>107561151
>in years
in decades even
>>
>>107557807
>>107557807
>>107557807
fill this tran thread before going to the new tran thread he will probably splitbake in a moment
>>
New thread. Move when ready.
>>107561167
>>107561167
>>107561167
>>
>>107561164
>>107561171
thanks for supporting this behavior anons. great job at implementing option 2
>>
https://github.com/huggingface/diffusers/pull/12839

new edit soon
>>
>>107561222
>this PR bestows Qwen Image Edit with enigmatic new powers.
if the new power is not the removal of slop I don't care, Z-image edit will dethrone it
>>
File: zimg_0042.png (1.67 MB, 1000x1496)
1.67 MB
1.67 MB PNG
>>107560831
>>107560831
anon you are a saint


>https://files.catbox.moe/0i56p8.png
>>
>>107561217
You're welcome
It WILL continue, till you fuck off
>>
>>107557967
filtered



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.