[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>107800934

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
glad to be posting in the last general before base
>>
>>107802925
https://huggingface.co/AuraDiffusion/16ch-vae
https://huggingface.co/ostris/vae-kl-f8-d16
>>
>>107802907
top right and bottom row second from the left would be cool if this was the day z turbo dropped lol
boring
>>
Reminder about Chinese culture.
>>
>>107802946
you're courting death
>>
>>107802945
Desu they remind me of my first SDXL0.9 prompts
>>
File: ComfyUI_00016_.png (1.14 MB, 1276x958)
1.14 MB
1.14 MB PNG
>>
REMINDER THAT LTX2 IS JEET BASED
USE T2V WITH NO PROMPT
100% HITRATE ON BOLLYWOOD TRASH
>>
kino!

https://files.catbox.moe/1es9up.mp4
>>
>>107802996
amke?
>>
>>107802994
every model is either jeet or chinese based cause most of the worlds population and therefore content is from those. It is promptable to steer away from that so it means nothing
>>
>>107803005
People here are retarded, most images and videos on earth are from Indians and Chinese, of course a model trained on generic data is going to be filled with them.
>>
>>107802996
>https://files.catbox.moe/1es9up.mp4

I was hoping for the car to turn into a transformer.
>>
No more subgraphs
>>
Soon™
>https://github.com/modelscope/DiffSynth-Studio/commit/0efab85674f2a65a8064acfb7a4b7950503a5668
>>
File: WANI2V_INT_00007.mp4 (3.75 MB, 880x1176)
3.75 MB
3.75 MB MP4
ltx2 nsfw loras when?
>>
>>107803015
you can do that if you prompt it, still learning how ltx2 prompting works but generally describe it and it happens, same with dialogue.
>>
can i try ltx2 with 8gb vram
>>
What was this post referring to?

>>107799750
>given that I only gen lewds, I have made my decision: I'm going to wait and see if any loras come out of it before I bother running any ltx workflows. I'm sticking with Wan. Also, I can't say I'm entirely trusting of using anything from an israeli company on my machine
>>
File: 3242342334432.png (21 KB, 1447x145)
21 KB
21 KB PNG
>>107803021
Wonder how good they managed to get the base models output compared to Z-Image Turbo since they took extra time in training.
>>
>>107803040
In theory the average gen should be slightly worse but with greatly improved seed variety. With proper prompting and patience you'll get higher quality and more diverse results than you could with turbo (in theory).
>>
>>107803040
They made it extra safe and responsible
>>
>>107802907
>3/7 gens
>top right has mangled hand too
KINO collage, thanks for playing
>>
>>107803052
cant wait to prooompt (holds dick in hand)
>>
>>107803040
The base model is expected to be better trainable and more flexible. While the distilled shows good quality already but struggles with loras and variation, it is hard to get anything more out of it than what's already in the distill. So the base is expected to bring the same quality while being more flexible, since the distilled is an extract of the base.
However, and they took long enough to release it, the fear that the model got beaten to a clump for "safety" reasons and therefore will be an absolute bitch to be finetuned, is valid.
>>
>>>/wsg/6067835
>>
>>107803024
IIRC the first LTX2 lora out already were trained on rented non-consumer GPU, H200 or H100 or something.

Very often more LoRA start to appear in numbers a few days after the training ui and/or kohya_ss support training on consumer GPU.
>>
I feel fortunate the most dedicated idiots that have issues with me are alcoholic low functioning losers that have been exposed a lowcows that can't even match the average /ldg/ poster in skill. They also like to ERP pretending to be girls with other men and can't even read filenames to realize who's posting what
I mean they think they are more important than they really are but yet they never contribute with gens. Are we almost on year 3 of this?
I take month+ breaks from this thread yet here they are still with nothing to show for it. They are going to hate me once I can move from a laptop to a 5090, I know many of them have more fancy hardware and can only create garbage.
>>
>>107803096
i think it might be too dangerous to let people train for ltx, even with their guardrails loras could erode that
>>
>>107803081
I love how you're hyping it up like there actually will be a base. Chinese culture.
>>
File: file.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>107803097
can you fix this massah?
>>
>>107803098
It was released along with the LoRA training script. I don't know where the narrative that they don't want people to train it came from.
>>
>>107803104
they literally said on discord they spent a month censoring the model
>>
>>107803102
If you're upset over the way prompts are formatted on a free tool, I don't think local is for you atm. It's very easy to get and there's easy ways to remove undesirable aspects
>>
>>107803098
did lightricks even try to train the model to refuse whatever?

i was under the impression that they just didn't train porn or anime or various other materials that would ideally be trained
>>
>>107803106
oh. what a waste of effort.
>>
>>107803099
There will be a base. Only the time they take to release it after the turbo tease makes me worried that they destroyed it for "safety". Because the turbo version gives a glimpse into what it could be capable of.
>>
>>107803106
Doesn't really change the fact they released a training script. They can want you to train it and not want to release an overly explicit model at the same time.
>>
>>107803110
no they just didn't train it on nudity. gemma though will not let you encode correctly if it was a nsfw prompt but you can use abiliterated gemma
>>
>>107803106
proof? cause I saw nothing of the sort
>>
>>107803127
I think you should just fuck off because you're not having a actual conversation and you're not even posting gens to show anons WHY they should use the model
>>
File: gsdgfsd.gif (41 KB, 640x360)
41 KB
41 KB GIF
>>107803133
>>
File: 1737628355905949.jpg (1.12 MB, 3792x1472)
1.12 MB
1.12 MB JPG
Cool lora: https://huggingface.co/fal/Qwen-Image-Edit-2511-Multiple-Angles-LoRA

The left Miku is the original, the other two are made with this lora.
>>
>>107803152
proof?
>>
>>107803152
Hold up... are you telling me. (plsplspls let it be true) that this... THIS model knows *throat hitches in excitement* MIKU?!
>>
>>107803160
retard oh my fucking god how dumb can you be
>>
>>107803127
>gemma though will not let you encode correctly if it was a nsfw prompt but you can use abiliterated gemma
i hope ai-toolkit and the other trainers then agree on WHICH abliterated gemma-3 12b gets used? as far as I can see, there are variants
>>
>>107803159
Just install it.
>>
>>107803176
i dont have a pc
>>
>>107803165
Why are you being so evasive?
>>
>>107803180
im shy
>>
File: 1751478966031759.jpg (284 KB, 1911x969)
284 KB
284 KB JPG
>>107803178
Good enough for proof? lol
>>
>>107803188
>multiple angles
>look inside
>zoom
>>
File: 1767515502654391.jpg (1.16 MB, 2656x1944)
1.16 MB
1.16 MB JPG
>>107803188
>>
>>107803152
Pretty sure Qwen edit can do this without a lora
>>
>>107803204
benchod
>>
it looks like z image omni will have a "image to lora" function according to its commit?
>>
>>107803136
kys retard
>>
>>107803204
Maybe... but this lora definitely provides better control.
>>
>>107803188
>emilia
shit taste in anime kys
>>
>>>/wsg/6067871
>>
>>107803219
Rude
>>
>>107803240
are you friendly?
>>
File: 1752260999734382.jpg (585 KB, 1328x1328)
585 KB
585 KB JPG
>>107803197
A kind of camera angle rarely seen in ai gens... and it's struggling. But the lora still positioned the camera correctly.
>>
>>107803246
What you don't realize that everyone tried the above and instead of stopping he tripled down and started running ops making shit up pretending to be other anons and kept fucking up because he's mentally ill. From day 1 he went to the general hostile and hopes new posters don't know about his history before he starts to go after posters he has a vendetta against. First time the paste was posted he went on a week long general siege and went berserk on everyone. He even rage quit the discord because people told him to stop having schizo episodes while causing bait drama like posting his IP in the discord and in the thread.
He's retarded and vain and the pastebin hurts his online persona that he can't hide because he can't make anything decent.
he also linked multiple resources that contained viruses and didn't even bother to fix or apologize
>>
How can I set this up so that it does each prompt as a new gen?
Something that I can just click and go do something else and get back to all the shots.
>>
>>107803246
it might be an llm
>>
>>107803250
slop
>>
>>107803118
Even if they do (I'm sure BFL has offered their services to them in this regard) it can still be reversed, and it's easier when it's a undistilled base model we are talking about rather than distilled models like Flux.

That said I don't think they've bothered, it will be like Z-Image Turbo, just not trained on 'sex'. The extra time will have been used to bump up the base model(s) quality in terms of output.
>>
File: file.png (1.03 MB, 850x761)
1.03 MB
1.03 MB PNG
>>107803284
>>
File: sdfsdffsdd.png (209 KB, 1954x1314)
209 KB
209 KB PNG
Holy amateur hour, they STILL haven't fixed it
>>
File: 1737462130185713.jpg (206 KB, 1511x742)
206 KB
206 KB JPG
>>107803265
Get String Selector from impact pack, connect it to CLIP text, and use "Primitive INT - Increment" for string selection. Every new job you queue in Comfy will use the next string.
>>
>>107803173
Abliterated Gemma doesn't quite allow NSFW, in my experience it'll just deftly work around the subject, "seeing" clothing where it's not, describing things tweaked just a hair as to not be so crude. I don't know if you can get around those guardrails to make it follow a lewd/perverse prompt. Especially if you can't get at it with your own Sys Prompt.

Abliterated will stop outright refusing, but its guardrails are still lingering in the background.
>>
>>107803300
If i'm being honest, the temporal upscaler introduces more issues than it seems to fix.
>>
>>107803308
? it fixes it for me?
>>
>>107803293
Citation of what ? Everyone in this thread are speculating. Are you retarded ?
>>
>>107803315
uh oh melty
>>
if base actually comes out hopefully lodestone will pivot and abandon his frankenstein surgical dedistillation experiment
>>
>>107803313
You don't notice the weird warping background that waves over the screen and the weird timing that stretches out the video unless you double the frame rate?
>>
>>107803326
>he wants lodestone unbound
retard
>>
>>107803342
If lodestones doesn't get his daily brushing he gets angwy.
>>
>>107803302
I found this, saves you from spamclicking gens.
>>
>>107803040
I wish the delay was because they trained some of the noob and Neta's dataset in it, but I'm in for disappointment, right
Wonder what this whole deal about them asking for those dataset was for, in the end
>>
>>107803300
LTX official workflows don't have it either unless I am blind, which is possible I am sleepy lol. Could I get a sample flow on how to use it?
>>
>>107803373
Comfy said it's a separate anime specific model if I remember right.
>>
>>107803366
what is sks?
>>
File: 1741380680104773.jpg (737 KB, 3272x1328)
737 KB
737 KB JPG
>>107803366
Nice. I just prefer not to pile up custom nodes. Impact pack has adetailer, so I already had it installed.
>>
>>107803383
The authors use it in their examples. Perhaps lora activation word?
>>
>>107803389
what does adetailer have to do with promptline?
>>
WAN2.2 vs LTXV 2
https://files.catbox.moe/phogc1.mp4
I'm running LTX again with a rewritten prompt better following the guidelines but it's not looking good.
It is a sick beat though
prompt:
This black-and-white manga panel dramatically depicts a train performing the impossible — “MULTI-TRACK DRIFTING!!” The upper half shows a train speeding across the tracks, its cars angled sharply as sparks fly from the wheels, giving the impression that it’s drifting like a race car. The exaggerated motion lines and tilted perspective add to the absurd intensity of the scene. In the lower panel, a shocked character’s wide-eyed expression conveys disbelief, amplifying the over-the-top nature of the moment and highlighting the comedic, almost surreal exaggeration of the concept.
>>
>>107803382
Won't this make weeb finetuners back off from finetuning z base then? If you have a limited budget to finetune, I guess it's better to wait for the anime model, if they really do have one
And it will be a pain to retrain all future z base loras on the anime model
>>
>>107803405
ltx is just too good, they will have to release wan 2.5 to compete
>>
>>107803405
https://files.catbox.moe/lw1vf1.mp4
I really want this song now
>>
Damn, this new angle lora is so much better.

>>107803389
Fair enough. I make backups like every 5 custom node that I install..
>>
It's all soulless. Why are you faggots so addicted to generating this soulless crap? I can instantly see it's AI no matter how "good" it's considered. it's pure crap. Fuck off.
>>
>>107803423
>can generate anything
>generates pics of hitler
homo
>>
>>107803412
I haven't heard any weebtuner step up to say they will. Only pony and lodestone said they would, so just the furries for now.
>>
>>107803402
Nothing. I just already had Impact Pack when I wanted to do string selection, and therefore used impact nodes.
>>
no u
>>
>>107803424
benchod
>>
The ear rape music when you don't prompt something specific is hilarious
>>
File: Comfy_022.jpg (2.61 MB, 4284x5712)
2.61 MB
2.61 MB JPG
>>107803444
>>
What is LTX?
>>
>>107803472
Veo 3 at home. Emphasis on the "At home."

Lots of interesting stuff in it though.
>>
>>107803472
loser turds xcited (over nothing, as usual)
>>
>>107803472
finetune Wan2.2-TI2V-5B
>>
>>107803472
A technically impressive video model that can do video+audio but the quality leaves a lot to be desired
>>
https://files.catbox.moe/3vqp92.mp4

Can this model do camera transitions without the corny "whoosh" sound?
>>
>>107803514
https://files.catbox.moe/9pb4km.mp4

Another one.
>>
File: 1757743450190195.jpg (1.64 MB, 1248x1824)
1.64 MB
1.64 MB JPG
>>
>>107803305
I found 12b abliterated would dance around unless you specifically referenced NSFW aspects yourself, whereas 27b was a bit better, but I only had a 12b with vision so I'm not sure how exactly 27b performs for vision, and I don't know exactly if there is a difference for prompt text generation between using a vision model to analyse an image vs as input for the text encoding of an image model, and 27b sounds pretty big for an encoder so it probably doesn't make sense anyway.
>>
>>107803514
Probably not, camera control is fairly lacking in these video models.
>>
>>107803305
I think you’re confusing the models bias to put clothing on people with the text encoder. It’s the model itself that biases away from nudity, not the text encoder
>>
File: ZiMG_00113_.png (1.22 MB, 1440x1280)
1.22 MB
1.22 MB PNG
base status?
>>
I updated my nvidia drivers earlier and forge neo no longer works, something about torch or something. Was there a quick fix or do I gotta do the annoying stuff?
>>
>>107803591
delete venv so it reinstalls
>>
>>107803582
yes
>>
Everyone just turn off stretch sigmas in the LTXV scheduler node until they fix how their latent upscaling is supposed to work. That fixes the blur at the cost of some more generation time
>>
>>107803618
proof?
>>
>>107803582
Looks like it might be out later today, full support has just been added to DiffSynth-Studio, and Modelscope is tight with the Z-Image team
>>
>>107803426
haha look at this loser look how not-based he is i bet hes in love with indians too
>>
>>107803651
indians love hitler thoughtverbait
>>
File: file.png (3.38 MB, 1536x1536)
3.38 MB
3.38 MB PNG
>>
>>107803616
yeah rip, maybe it'll be faster now at least
>>
>>107803639
Do you think LoRAs will be cross compatible, or would the step distillation get in the way (only ever used step distillation with Wan, so no I've idea how they work with image gen)?
>>
>>107803618
honestly i'm gonna wait a few weeks cause it sounds like the release was rushed, the memory management in comfy was fucked, the workflows are a mess and someone will figure this out eventually
>>
File: QwenImg_00021_.png (2.8 MB, 1152x1440)
2.8 MB
2.8 MB PNG
>>107803489
can't deny it's fun to fuck with
adding audio and inbuilt lipsync is a new paradigm for narratives, not that you couldn't accomplish that with just an image, this for instance tells a story on its own but adding the sound of rain, footsteps and perhaps a line about a GPS on the fritz brings it to another level
but yeah, if you're just here for the 1girls then the benefits will seem insignificant
>>
>>107803670
>rip
I mean, you won't lose anything, its just the venv
>>
>>107803671
That's a good question, I have no idea. Best case scenario, base trained loras will work great with Z-image Turbo, but I have a nagging feeling they won't.
>>
LTX came out a couple of days ago, and now Z-Image Base is being released

Local is eating good in 2026
>>
File: ZiMG_00117_.png (1.3 MB, 1440x1280)
1.3 MB
1.3 MB PNG
>>107803639
>Modelscope is tight with the Z-Image team
tight. tight.
>>
>>107803727
>LTX
Complete shit
>Z-Image Base
Must be released two months ago
>>
>>107803727
>and now Z-Image Base is being released
proofs?
>>
>>107803685
Yeah I still had to download stuff though. Also my pc is chugging now so that didn't work out lmfao.
>>
>>107803582
>>107803021
>Soon™
>>
>>107803639
>Looks like it might be out later today
will be useless until finetunes and jailbreak come, due to safety reasons.
>>
>>107803901
nah, edit will be fun for sure. And it looks like they actually preference trained another model called z image as well but its not distilled so it will have more variance
>>
>>107803754
Using your brain. Modelscope just added full Z-Image Base support to their DiffSynth-Studio software, something they can only do if they have access to the model.

Also it would be pointless to add this unless the model release was imminent.

Add to this the Z-Image guys posted "Your patience will be rewarded" today on their Discord.
>>
>>107804003
>their DiffSynth-Studio software
literally who
>>
>>107803924
>nah, edit will be fun for sure
Doubt edit will be released today, it will most likely be the two base models (pre-training and SFT)
>>
People forgetting their Chinese culture lessons.
>>
>>107804023
https://github.com/modelscope/DiffSynth-Studio
>>
1000 years of SDXL darkness
>>
>>107804003
Thank you dear Chinaman if this is true. This would also mean that Comfyui needs an update perhaps... grim.
>>
How do I modify video length using ltx-2 in comfy?
And the workflow automatically adjusts the image resolution yes?
>>
>ltx2 releases
>tonguee to release base around same time
>ltx2 will be forgotten

fucking called it a few threads back, kek
>>
>>107804058
frames my dear boy, 24 per second
>>
>>107804055
To pull or not to pull, that is the question...
>>
>>107804059
They rushed to release the distill when flux was released when the rest was still training. Of course they would do it again lol.
>>
>>107804059
Not really, one is for video and one is for images

That said I have serious doubts BFL will ever release their Flux 2 Klein model. In fact I doubt BFL will survive 2026, they can't be bringing in a lot of revenue.
>>
>>107804003
>Z-Image Base
They added Omni base, not base, no?
>>
>>107804059
>OMG GUYS THE IMAGE MODEL IS COMING OUT AT THE SAME TIME AS THE VIDEO MODEL LE BASED CHINA?!?!
>>
>>107804087
https://github.com/modelscope/DiffSynth-Studio/commit/0efab85674f2a65a8064acfb7a4b7950503a5668

>Support Z-Image-Omni-Base and its related models
>and its related models
>>
ltx2 tongue my anus
>>
File: 1754318122369966.jpg (309 KB, 1125x843)
309 KB
309 KB JPG
>ltfart 2 requires h100 settings
>still xmas in china
...
>>
>>107804109
Have a feeling they'll only release omni base, not the actual base but we'll see.
Hoping for the best.
>>
>>107804003
>OH MY HECKING ALTMAN THEY ADDED PULL REQUESTS!!! THAT MEANS ITS COMING TOMORROW!!!
https://github.com/huggingface/diffusers/pull/12857
>3 weeks ago
they're only doing this to get retardcattle to tune in and salivate over every "surprise" announcement they make that turns out to be another shitty LLM.
>>
>yfw nothing gets released today
>>
File: ComfyUI_00051_.png (1.06 MB, 1472x640)
1.06 MB
1.06 MB PNG
Z base obviously has insane potential, but I somehow doubt that edit model's comprehension will be anywhere as good as qwen's with that few parameters
>>
>>107804187
who cares, qwen looks like plastic in comparison, if z edit can fix that then they win
>>
>>107803097
>>107803109
Imagine how rent free someone is in your soul that you periodically repost gens and post for word from over a year ago aimed at the pastebin schizos.
This is how you show how rent free someone has and will always be in your head.
>>
>>107804177
That had to be added early since they want it to be in the next Diffusers release (thus it needs to be reviewed), which will likely happen ASAP after Z-Image base drops

Adding it to a trainer which updates constantly means it is imminent

Don't be stupid
>>
>>107804143
Hopefully both, omni base will likely be best for large finetunes, with SFT being best for loras
>>
>>107802907
>>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon
why is this still in the OP?
>>
>>107804301
Because those two faggots are cancer even the anime thread agrees.
Cope or go to another general, this isn't up for debate
>>
>still have 15.9gb left on the 96gb ssd
maybe base will fit somehow :D
>>
>>107804308
can someone else bake the threads?
>>
>>107804301
ranfaggot and debo are cancerous retards

>>107804308
only one of the faggots seethes at ani in the anime thread
>>
You retards still think base is coming. Chinese culture.
>>
New guy here what exactly is the difference between /sdg/ and /ldg/?
>>
>>107804359
Yes
>>
>>107804187
Her head needs to be smaller.
>>
>>107804003
But did they train the model on gelbooru as they promised?
>>
>>107804359
stable diffusion vs any (local) diffusion
>>
File: 1757774624027454.png (476 KB, 1635x4333)
476 KB
476 KB PNG
>>107804301
keep posting it, this surely is the time it'll work
>>107804359
/ldg/ was created because people were fed up with the avatartroon circlejerk that /sdg/ had become
>>
>>107804371
Okay I thought this thread is for videogen/Chinese models only.
>>
>>107804359
nothing other than the anon that bakes /sdg/ posts news and art the baker for /ldg/ is a spiteful troon and will only post images of her own false flags and seething in an attempt to gaslight anons into hating a developer for some unexplained reason
>>
File: ComfyUI_00045_.png (1.1 MB, 1472x640)
1.1 MB
1.1 MB PNG
>>107804364
Original gen was a lot more, well, faithful to the original
>>
>>107804382
Sounds like a weird situation.
>>
>>107804373
but you are an avatartroon that wasn't people to circlejerk you but you trained out too far and everyone hates your "art"
>>
>>107803412
The main issue is the fact that nothing was said or promised for the Z-Image anime tune other than the fact that we know that noob's database is being used for it. They could make it API for all we know.
>>107803428
The closest is Newbie which is using the same base architecture as Z-Image with Lumina 2 but with no scaling up of parameters and only doing replacing of component of the pipeline and models with different parts. The problem is how underbaked Newbie is and the utterly different style of prompting with XML at the moment. I really wish they release a 0.2 or something but not every model can be like how Chroma did their pre-releases.
>>
>>107804399
>wasn't
wants

>>107804402
>You have been crying for a long time
your record is approaching 4 years. I think your high score is safe
>>
base release will prob make this useless but this is legit just a better z turbo https://civitai.com/models/2264784/zit-khv?modelVersionId=2549250
>>
it drops when it drops. stop being retarded about zib already
>>
>ltxv2
>input picture of woman
>prompt her to say something and do a simple action
>every single gen it hangs on the static input image for several seconds while audio plays then the last second it cuts to show an unrelated woman doing the action I prompted (while also being garbled slop)
what the fuck gives?
>>
>>107804391
Head size didn't change at all but the skin texture got fucked up.
>>
>>107804446
are you doin something stupid like allotting 15 seconds for her to say "hi" and blow a kiss?
>>
File: 1740875021743794.png (266 KB, 687x890)
266 KB
266 KB PNG
>>107804410
fr
>>
>>107804410
Better how?
>>
>>107804454
this, you cant just give it 5 words for a prompt
https://ltx.io/model/model-blog/prompting-guide-for-ltx-2
>>
File: 1753974477733919.png (39 KB, 695x1016)
39 KB
39 KB PNG
>>107804466
nmp
>>
>>107804468
nai?
>>
>>107804442
this is why nobody has sympathy for your gaslighting ass troonjak. you constantly become this insufferable dramafaggot
>>
>>107804474
what does 6 and 1 mean
>>
>>107804468
even better details
>>
>>107804468
It has a better surface texture, but in return it loses details (like on the magic in the hand).
>>
>>107804502
worse*
>>
>>107804491
Everything in that post is completely true, and its subject is perfectly described by what was said in your previous one
>>
>>107804523
less schizobabble please
>>
>>107804539
I'm not from around here
>>
lay off the meth please
>>
come on alibaba. Release wan2.6 to beat ltxv 2, you know you want to
>>
is chroma better than lumina?
>>
>>107804591
lumina is like 10% baked, chroma is like 85%
>>
>>107804600
so chroma is the goat?
>>
>>107804566
>bro stop making money
retard
>>
>>107804608
yes of slow gens and weird striped artifacts
>>
>>107804566
Pathetic local user waiting for breadcrumbs
>>
>>107804622
then whats good
>>
>>107804608
for nsfw? yes, nothing comes close
>>
>>107804630
Why are you lying to newbies?
>>
it's almost midnight in china, i don't think we're getting the model today. maybe tomorrow
>>
>>107804638
duas semanas
>>
>>107804635
nta, but name a better model for realistic nsfw
>>
File: chroma_0345_.jpg (1.97 MB, 2048x3584)
1.97 MB
1.97 MB JPG
>>107804635
stfu troll
>>
>>107804653
Lumina is better for 2d
>>
>>107804649
*Unlike your adversary
>>
>they will release on Friday so we can spend our weekend gooning
Blessed Chink
>>
>>107804656
>eyes
slop
>>
>>107804638
Chinese don't have a weekend, they work 7 days a week. Could release later.
>>
>>107804656
Lumina can do better than that 100%
>>
>>107804679
then why does no one ever post images with it?
>>
>>107804677
>Chinese don't have a weekend
fake
>>
>>107804630
Which chroma though?
>>
Base is being uploaded right now but it's split into those sequential .safetensors files, it needs to be combined or whatever it is they do
>>
>>107804683
?
>>
>>107804684
qrd
>>
>>107804684
always the latest from silveroxides
>>
>>107804687
we have weekends off moron
>>
>>107804627
Nothing
>>
>>107804677
i'm not talking about weekends. i'm talking about the fact that it's almost 11PM there, no one works that late
>>
>base might actually come out on my birthday
>can't run it
>>
>>107804656
post workflow schizo
>>
File: lumina.png (644 KB, 568x926)
644 KB
644 KB PNG
way better
>>
>>107804707
you turned it into generic slop, retard
>>
>>107804720
The original was generic slop double retard
>>
File: chroma_deathclaw.jpg (699 KB, 1448x1448)
699 KB
699 KB JPG
>>107804707
1. you turned it into slop
2. image to image at low denoise does not count anyways, lumina cant do anything like that itself
>>
>>107804724
All right, drop a catbox for this and I will download chroma immediately
>>
>>107804724
If Lumina so bad, then why does it exist? You lost.
>>
>>107804705
https://files.catbox.moe/kr3svc.json
>>
>>107804566
>excited to see open source video model moving forward.
The implied tone here is that they are no longer open source. Chinese culture, you see.
>>
>>107804737
>A close-up POV shot of two hands holding a square of homemade fudge between the fingertips, right in front of the camera lens. The fingers are clean, well-groomed, with natural skin texture, slight wrinkles, and visible nails. Soft golden studio light catches the edges of the fudge and highlights the contours of the fingers. In the softly blurred background, a smiling woman in her 40s wearing a patterned dress stands in a cozy kitchen with sunlight streaming through a curtained window. The focus is razor-sharp on the hands and fudge, with fine detail in the skin and subtle shadows between each finger, showing lifelike anatomy and natural bending at the joints. 1990s American TV commercial style, slight film grain, warm nostalgic color palette.
Huh?
>>
https://files.catbox.moe/b1inkt.mp4

ltx video continuation.
>>
>>107804772
can you make patrick use datas bumhole?
>>
File: pose.jpg (237 KB, 1325x881)
237 KB
237 KB JPG
>>107804724
chroma is so damn nice once you find good settings
>>
>>107804685
why do ML people do that? split models into retarded arbitrary chunks like it's some autistic standard
are they stuck on FAT32 filesystems or something, i can't explain it
>>
>>107804804
blueboard bro
>>
File: Lumina.png (2.04 MB, 1920x1080)
2.04 MB
2.04 MB PNG
Show me chroma doing anime this good, you can't
>>
>>107804804
careful with the one zombie woman, blue board
>>
File: chroma_00034_.jpg (965 KB, 1920x1920)
965 KB
965 KB JPG
>>
>>107804816
"this good" meaning looking like a screencap from a 240p youtube video full of compression artifacts?
>>
>>107804817
>blue board
That's why he made her blue
>>
>>107804795
Well what are those good settings?
>>
>>107804831
do better then
>>
theoretically speaking what would base do that turbo can't?
>>
>>107804849
i don't generate anime i'm not interested in cartoons. that doesn't mean i cannot be an objective critic
>>
>>107804855
>i could do it if i wanted
lol lmao i accept your concession
>>
https://files.catbox.moe/c7k5sq.mp4
>>
>>107804854
It would be based
>>
File: chroma_zombies_mall.png (1.42 MB, 1248x832)
1.42 MB
1.42 MB PNG
>>107804817
>>
>>107804860
i never said i could do better. i said your gen was shit. learn some reading comprehension
>>
yo where base at
>>
File: chroma_00045_.jpg (1020 KB, 2205x2853)
1020 KB
1020 KB JPG
>>107804860
nta but here, I've posted this one before, time to put you in your place again
>>
>>107804882
China
>>
>>107804884
that's not anime, its some 3d model with cel-shading
>>
>>107804889
I just realized, if they use github Trump could release it whenever he wants
>>
>>107804854
cunny
>>
>>107804898
>moving goalposts
You have not posted even one thing in comparison cept this slop so just be quiet now >>107804707
>>
>>107804916
you can't make anime with chroma therefore chroma loses to lumina, period
>>
>>107804854
Being more flexible. It will also be twice+ as slow to generate anything.
>>
>>107804871
I want a game that looks like that. Why can't we have nice things?
>>
File: file.png (202 KB, 669x1187)
202 KB
202 KB PNG
Why does this workflow use 2 samplers?
>>
>>107804908
A friend of mine claimed that it can do cunny pretty well.
>>
>>107804919
>thinking
>>107804707
>looks better or even comparable to
>>107804884
you just have shit taste anon, just go enjoy your shit model
>>
>>107804943
its basically sdupscale like most WFs but just using clownshark samplers which are better
>>
>>107804951
>my car is better than your boat
retard
>>
>>107804960
you really do have super shit tastes if you thought >>107804707 was in any way a own. You just enjoy ai slop
>>
File: mlady.jpg (449 KB, 948x1264)
449 KB
449 KB JPG
>>107804838
lora stack and fast inpainting workflow
>>
>model tribalism
just post best settings for your favorite model to settle this once and for all
>>
LTX bergs are doing AMA on plebbit https://www.reddit.com/r/StableDiffusion/comments/1q7dzq2/im_the_cofounder_ceo_of_lightricks_we_just/
>>
>>107804973
>fast inpainting workflow
qrd?
>>
>>107804973
share it
>>
z chroma when?
>>
>>107804989
i use the adetailer auto inpaint
>>
>>107805006
Isn't that just a pretty standard workflow
Why do you even need that on chroma anyway
>>
Reminder you can gen 100 gens in the time chroma gens one
>>
>>107805013
better pussy
>>
>>107805014
actually comfy finally fixed it a while ago
>>
>>107805014
Using what model
>>
>comfyui updater just spins forever not utilizing my cpu or network
I'm so tired of this chinky piece of shit
>>
btw T2V LTX2 is perfect.
For I2V however, for some gens you have to run them at 48 fps and from 20 - 40 steps.
48 fps seems to be fixing the blury / deformity issues
so much better results
also i'm using the res_2s sampler as recommended by ltx. it's 2x times slower but gives me better results
>>
do this with anything not chroma. I'll wait. Extremely nsfw warning
https://files.catbox.moe/o69ltq.png
>>
>>107805032
>btw T2V LTX2 is perfect.
proof?
>>
>>107805004
lodestone is already experimenting with training Z-Image Turbo, but he is whining about getting Base so likely he will drop current experiments when it drops and start a large scale finetune in a hartbeat
>>
>I MUST USE CHROMA
>*look inside*
>>>107805043
>>
>>107805032
Ugh, this is the temporal upscaler schizo again, isn't it?
>>
Halp
Is there a way to generating consistance character?
>>
>>107805032
It's censored.
>>
File: zimg_00013.png (1.71 MB, 960x1280)
1.71 MB
1.71 MB PNG
to the anon who requested it, psxrodox is on civit
>>
>>107805043
But can it do dog dicks?
>>
>>107804973
imagine having horns like this - and then constantly get stuck at doors and stuff
>>
>>107805063
wan was censored when it first started as well. Apparently this thing learns faster than wan from what Ive heard so far
>>
>>107805072
your fucked up imagination is the limit. Chroma can do crazy fucked up shit. z chroma will be insane
>>
File: mlady.jpg (361 KB, 864x1152)
361 KB
361 KB JPG
>>107805065
very cool, downloaded!

>>107805074
yeah but horns + teeth = two extra unarmed attacks per round
>>
>>107805116
How do you get this style out of chroma? It's my biggest problem with the model
>>
holy shit its out
>>
>>107805043
>do this furry thing with anything but the model know to be made by a furry, I'll wait
>>
>>107805043
Best Chroma ad I've seen so far, unironically. But also >>107805141
>>
>>107805141
really, do anything not 1girl or in the same slop style everyone who uses illustrious or z image posts. Oh wait, you cant cause that is all they can do
>>
>>107805159
But chroma cannot do styles at all, so what's your point?
>>
>>107805116
thanks anon. fwiw, i had big hate for chroma until i sat down and wrestled with it for a few days. you're getting some nice gens out of it.
>>
>>107805159
illustrious can do way more than 1girl, fuck off
>>
>>107805148
>https://files.catbox.moe/o69ltq.png

You can do it with women as well too, did with an IRL lora forever ago but it definitely works with zoo on women. Don't know what the best way/workflow is to use chroma currently though.
>>
File: file.png (4 KB, 252x33)
4 KB
4 KB PNG
>>107805065
liar
>>
>>107805173
lol what, this thread has been greatly differing styles for chroma posts, you are actually trolling or you really are a slop enjoyer
>>
File: ComfyUI_temp_cuctu_00002_.png (2.66 MB, 1152x1728)
2.66 MB
2.66 MB PNG
>>
>being able to gen horse cocks is what defines a models worth
why are chromafags like this
>>
can chroma do niggario poppin a goomba with his glock while hittin a fat doobie tho lol
>>
>>107805195
see what I mean, first non chroma post is back to the the same slop plastic shit. That is why I never come here, you people are shit at this
>>
>>107805195
Looks like a 40 years old hag trying to look young
>>
>>107805208
>just spend 10 minutes per gen
fuck off
>>
>>107805116
>gets filtered by low hanging tree branch
>>
File: zimg_00007.png (1.6 MB, 960x1280)
1.6 MB
1.6 MB PNG
>>107805192
what the heck

civitai.com/models/2291079?modelVersionId=2578199
>>
>>107805204
I can make the nigger do all that while sucking a horsecock sure
>>107805213
comfy fixed it not too long ago, and you could fix it yourself before that
>>
File: mlady.jpg (358 KB, 948x1264)
358 KB
358 KB JPG
>>107805131
>How do you get this style out of chroma? It's my biggest problem with the model
I usually use 2 loras, one for style (brushtrokes etc.) and one for character shapes, details. My datasets are almost completely synthetic at this point (= made by me)
>>
>>107805194
Nah I'm just bad and I don't know how to prompt it to get a pretty style
Would appreciate any advice
>>
>>107805226
>to use chroma you need to create datasets
>>
>>107805230
just use my workflow
>>
>>107805230
4chan is such a shit site to use cause it strips meta, steal a image from the civitia page or discord
>>107804737
>>
>>107805244
All right, I appreciate it anon, it's a good idea to scour for some examples
>>
File: ComfyUI_temp_azqxo_00009_.png (3.31 MB, 1152x1728)
3.31 MB
3.31 MB PNG
>>
File: file.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
I actually bothered getting Chroma and I think the other anon was right, look at this shit
>>
File: r.jpg (220 KB, 848x1488)
220 KB
220 KB JPG
>>107805267
looks pretty neat
>>
>>107805264
>the leg
Okay that is NOT a good chroma ad
>>107805267
But did you intend for it to look like an old video game
>>
>>107805267
>intentionally fucking it up
I see right through you "other anon"
>>
File: zimg_00026.png (1.67 MB, 864x1280)
1.67 MB
1.67 MB PNG
>>107805264
i see u
>>
>>107805264
>more z image slop
>>
File: file.png (1.5 MB, 1024x1024)
1.5 MB
1.5 MB PNG
>>107805280
>>107805278
I'm just being honest, it looks like slop. Look at this zombie
>>
File: ComfyUI_temp_cuctu_00014_.png (3.05 MB, 1152x1728)
3.05 MB
3.05 MB PNG
>>
File: mlady.jpg (386 KB, 1260x840)
386 KB
386 KB JPG
>>
File: ComfyUI_temp_cuctu_00015_.png (3.01 MB, 1152x1728)
3.01 MB
3.01 MB PNG
>>
File: wan22_00304.webm (2.02 MB, 656x800)
2.02 MB
2.02 MB WEBM
>>107805032
But does it finally work with 2D animation, or do I have to struggle forcing anime style again?
>>
File: chroma.png (875 KB, 1024x1024)
875 KB
875 KB PNG
If Chroma can do better then show me how
>>
>>107805295
why are you trolling? If you are not then use the WF posted or on the civ site / discord instead of whatever shit you are doing.
>>
File: zimg_00026.png (1.46 MB, 960x1280)
1.46 MB
1.46 MB PNG
>>107805319
chroma can do great shit it just takes a lot more to dial it in, which is why i don't use chroma
>>
>>107805322
I'm using the WF you posted >>107804737
>>
>>107805307
Demon gf when
>>
File: file.png (523 KB, 1540x1004)
523 KB
523 KB PNG
>>107805322
Proof I'm not trolling
>>
File: zimg_00037.png (1.78 MB, 960x1280)
1.78 MB
1.78 MB PNG
>>107805226
>>107805311
>>
File: ChromaRadiance-4464.jpg (111 KB, 848x1488)
111 KB
111 KB JPG
>>107805313
it is my opinion that it struggles quite a lot with that.

it also has more of a tendency to fuck up on 2.5 and 3d type footage if the perspective shifts than, say, wan
>>
File: file.png (1.64 MB, 1024x1024)
1.64 MB
1.64 MB PNG
>>107805322
And here is the plastic slop output
>>
Nell Fisher lora when
>>
>>107805347
https://files.catbox.moe/kr3svc.json
and use silveroxides latest model
if your gens dont look like
>>107804656
>>107804823
>>107804804
>>107804884
then you fucked something up or are shit at prompting
If that is the case then steal images from civ / discord like I said
>>
File: 00268-3152309623.jpg (601 KB, 1344x1728)
601 KB
601 KB JPG
>>
File: ComfyUI_temp_cuctu_00020_.png (2.98 MB, 1152x1728)
2.98 MB
2.98 MB PNG
>>
>>107805014
zit also got a nvfp4
>>
Use this
https://huggingface.co/silveroxides/Chroma-Misc-Models/blob/main/Chroma-2K-QC/Chroma-2K-QC-fp8mixed-blockwise.safetensors
You will need this:
https://github.com/silveroxides/ComfyUI-QuantOps
>>
File: ComfyUI_temp_cuctu_00021_.png (2.86 MB, 1152x1728)
2.86 MB
2.86 MB PNG
>>
>>107805043
is that without any loras?
>>
>>107805378
yes, chroma can do even vore / super complicated sex positions and the like natively
>>
>>107805372
Just load the model normally with https://files.catbox.moe/kr3svc.json ?
>>
File: ComfyUI_temp_cuctu_00023_.png (2.94 MB, 1152x1728)
2.94 MB
2.94 MB PNG
>>
File: ComfyUI_005121.webm (2.4 MB, 720x960)
2.4 MB
2.4 MB WEBM
>>
File: zimg_00043.png (1.55 MB, 960x1280)
1.55 MB
1.55 MB PNG
>>107805374
>>107805366
this is great. i kinda hate that no one ever posts their gens with the loras i upload on civ because i have no idea how well they work for other people
>>
File: ComfyUI_temp_cuctu_00024_.png (2.85 MB, 1152x1728)
2.85 MB
2.85 MB PNG
>>
>>107805404
benchod
>>
where is it
WHERE is it
RELEASE IT
>>
>>107805388
QuantizedModelLoader
and
Load CLIP (Quantized)
IF you use the int8 clip

That is the best 2K model currently I think, though I have not checked the latest update, new ones are pushed every week or two as it trains. People who have plastic stuff might be using the base 512 res model like idiots
>>
>>107805394
scail?
>>
>>107805404
Heavy Metal lora? Looks great
>>
@OP include the rabbit x horse artpiece in next collage pls thx
>>
File: ComfyUI_temp_cuctu_00027_.png (2.83 MB, 1152x1728)
2.83 MB
2.83 MB PNG
>>107805404
too many indians on that site
>>
>>107805411
再过两周
>>
File: ComfyUI_temp_cuctu_00029_.png (2.99 MB, 1152x1728)
2.99 MB
2.99 MB PNG
>>
File: ComfyUI_temp_cuctu_00031_.png (2.97 MB, 1152x1728)
2.97 MB
2.97 MB PNG
>>
>>107805416
no kitten
>>
>>107805423
very cool. prompt?
>>
File: ComfyUI_temp_cuctu_00034_.png (2.75 MB, 1152x1728)
2.75 MB
2.75 MB PNG
>>
File: radiance.jpg (101 KB, 848x1488)
101 KB
101 KB JPG
>>
File: zimg_00049.png (1.63 MB, 960x1280)
1.63 MB
1.63 MB PNG
>>107805423
fair enough, just nowhere else to post

>>107805418
great eye anon, just the first chapter
>>
>>107805453
all of these are fantastic btw. very clean

nu thread
>>107805470
>>107805470
>>107805470
>>
>op without rentries
>immediately goes back to shilling his shitty UI posing as a 3rd person
lmao



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.