[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>107770086

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg
>>
Blessed thread of frenship
>>
summoning the basedjak lora baker
i have a question
>>
How can I keep ZIT from blurring the background?
And how can I make ZIT work properly for img2img?
ZIT is so good but so annoying at same time...
>>
>tranibake
>>
>>107772816
>How can I keep ZIT from blurring the background
describe the background better. nag helps because negative prompts are a thing with it on

>And how can I make ZIT work properly for img2img?
it works the same way as any other model. cumfart might just be fucking up
>>
File: file.png (1.64 MB, 895x1258)
1.64 MB
1.64 MB PNG
>>
>>107773001
every iteration has been better and better. nice work
>>
File: 00006-2362664526.jpg (806 KB, 1270x1672)
806 KB
806 KB JPG
>>
>>107773001
>>107773021

i made this on the third though?

someone is botting image reposts?
>>
File: 1763723743746133.jpg (613 KB, 1248x1824)
613 KB
613 KB JPG
>>
File: w.png (523 KB, 1024x1024)
523 KB
523 KB PNG
is this the real thread then?
>>
blessed thread of frenship
>>
>>107773182
nope, the posts above you are repost stolen from previous threads or from civitai examples. he doesn't actually gen stuff unlike the other schizo
>>
/aicg/ also got nuked.
I wonder if tran also...
But maybe not connected.
>>
>>
File: 1763723743746134.webm (512 KB, 1280x1408)
512 KB
512 KB WEBM
Does this make your pp tingle?
>>
>more reposts
>>
whats the best way to test several loras in comfy similar to the xyz plot in 1111? Either model agnostic or for sdxl
>>
File: 1766605656331.png (1.6 MB, 1368x1784)
1.6 MB
1.6 MB PNG
>posts this to advertise their lora
>get death threats from you know who
>removes it
really makes you think
>>
File: 253654.png (1.32 MB, 1024x1536)
1.32 MB
1.32 MB PNG
>>
File: 1737005802701752.png (22 KB, 873x100)
22 KB
22 KB PNG
this post will never not make me laugh
>>
>>107773351
qrd
>>
File: 35867896.png (1.92 MB, 1024x1536)
1.92 MB
1.92 MB PNG
>>
>>107773369
Why do you even want z base anyways? What can that do that you can't do right now?
>>
>>107773369
I made this.
I suppose I should take the L?
>>
File: zimg_00024.png (1.7 MB, 864x1280)
1.7 MB
1.7 MB PNG
>>107773378
just finished baking it
>>
>>107773387
training is uspposed to be more realiable
dunno

>>107773392
we're all taking the L...
>>
>>107773392
no you didnt it was me asshole
>>
Just remember 2 unemployed losers sat in a thread shitting on them just to change the OP which doesn't change the fact everyone laughs at them.
They will also lose sleep to defend their internet honor while everyone else lives life
>>
>>107773260
yeah tts would be fine. anyone know any text to speech voice ai local programs?
>>
>We want z base NOW
>Why?
>U-uh hmm yeah
Bravo
>>
File: 52.png (1.91 MB, 1088x1072)
1.91 MB
1.91 MB PNG
>>107773482
better quality + trainable, no?
>>
Hey, why is my ComfyUI crashing just after this:

VAE load device: cuda:0, offload device: cpu, dtype: torch.bfloat16
Requested to load ZImageTEModel_
loaded completely; 95367431640625005117571072.00 MB usable, 7672.25 MB loaded, full load: True
CLIP/text encoder model load device: cuda:0, offload device: cpu, current: cuda:0, dtype: torch.float16

I have a GTX 5060 TI 16gb, and I can use any other UI besides this one.
>>
>>107773501
buy a 6000, worked for me
>>
File: zit img2img.jpg (481 KB, 800x1140)
481 KB
481 KB JPG
>>107772919
>it works the same way as any other model. cumfart might just be fucking up
I use Forge Neo, not Comfy. ZIT gives bad results in img2img, the whole picture is blurry, dull, and too simplified even in high resolution, unless I set the denoising above 0.85, which is then not img2img anymore but just txt2img.
>>
>>107773501
>95367431640625005117571072.00 MB usable
damn, i'm jelly
>>
File: 0429.png (1.77 MB, 528x2000)
1.77 MB
1.77 MB PNG
>>107773525
>>95367431640625005117571072.00 MB usable
Using a botnet's worth of pcs to distributively generate 1girls
>>
>>107773525
i wish :( it simply crashes after it finds out i dont actually have all of this

and i ve been struggling for days to try and fix this sob
>>
File: fuck yiou ldg.png (1.04 MB, 896x1152)
1.04 MB
1.04 MB PNG
>>
>>107773501
You seem to be affected by this:
https://github.com/comfyanonymous/ComfyUI/issues/11332
>>
File: 00985-760194552.png (1.12 MB, 896x1152)
1.12 MB
1.12 MB PNG
>>
Have you found any real world practical applications after learning image diffusion or is it just something to kill time with?
>>
>>107773590
do you consider masturbation practical or something to kill time with?
>>
File: file.png (274 KB, 1291x1798)
274 KB
274 KB PNG
It's coming. Looks like they cooked.
>https://github.com/Lightricks/LTX-2
>>
>>107773566
this seems to be it, how weird

im surprised not a lot of people are having this problem, this has been bugging me for so long now
>>
>>107773590
generative ai should be leveraged by artists but they are too stupid and proud to use a tool that would multiply their productivity

some people post about making assets for video games
but they get autistic and fixated on making pixel character sheets
>>
>>107773413
are you that xixxix guy?
what kinds of images did you use in the data? The actual porn or just the ladies posing?
>>
>>107773590
i trained a zit lora on myself and generated professional headshots instead of going to a studio lol
the problem is that i have generated and looked at so many pictures that i cannot tell which ones are close to reality anymore. i'm addicted to creating the perfect picture
>>
>>107773608
yay new toy
can't wait to generate memes on it
>>
>>107773608
>19b
yikes
>>
>>107773605
Don't get me wrong, I had my fun generating that, too.

>>107773628
>generative ai should be leveraged by artists
That's exactly what I would do if I could draw. At least to get an outline of a character/objects as a starter. I think if you're a smart dude and drawing as a job, you would try to do this to save some time.
>>
>>107773661
wdym? it's smaller than flux 2 which is an image model
>>
>>107773608
>19b
>slop dataset
>realistic bias
how is this any different than what we got? this screams benchmaxxed slip
>>
>>107773662
>That's exactly what I would do if I could draw. At least to get an outline of a character/objects as a starter. I think if you're a smart dude and drawing as a job, you would try to do this to save some time.
Mediocre porn artists are already doing that. And have upped their game 100X as a result.
This truly is the age of the mediocre porn artist
>>
File: zimg_00061.png (1.85 MB, 864x1280)
1.85 MB
1.85 MB PNG
>>107773642
no xixxix is way funnier than i am. i just grabbed a bunch of high res scans without any nudity because i didn't want to deal with adding new concepts. i'll post this up on civit over the next few days.
>>
>>107773590
I've used it to repair/improve photos for myself, family and friends in ways I couldn't with the cloud models and I also don't want to share photos of myself and family with the big tech companies. I like to keep up and tinker with the latest advancements just so I have a deeper understanding of the tech and won't fall behind. I enjoy reading the papers along with the new releases.
>>
File: 1739267877318707.jpg (2.55 MB, 1872x2736)
2.55 MB
2.55 MB JPG
>>
Is there a guide for qwen edit image
>>
>>107773835
Who's dat?
>>
File: ComfyUI_00266_.png (1.49 MB, 896x1152)
1.49 MB
1.49 MB PNG
>>107773608
Qat-q4_0-unquantized is interesting, so does that mean the text encoder can remain usable even at lower quants?
I am not running this on my vramlet setup but maybe if they nunchaku it. (Yes I know copechaku is on life support.) Fuck ~9.5 b weights + whatever calculations being done might be too much for 12gb vram.
>>
>>107773857
https://docs.comfy.org/tutorials/image/qwen/qwen-image-edit-2511
did you try this one already? should get you started
>>
>>107773524
it has a style problem. you need loras and controlnets for restyling
>>
>>107773835
mods really need to do something about you already
>>
File: 1756782402066835.png (6 KB, 214x124)
6 KB
6 KB PNG
Okay so hear me out. Lets put a button, that brings up a button that lets you clear your job queue, and if you press that button, it brings up another button that asks you if you want to do that.
>>
>>107773762
I thought it was Bill Clinton for a sec
>>
File: file.png (1.84 MB, 1152x896)
1.84 MB
1.84 MB PNG
>thread has rentries, schizo A has meltie
>thread doesn't have rentries, schizo B has meltie
how do we solve this paradoxical situation?
>>
>>107773662
>>107773717
Even during the old days, artists cheated with this device:

https://en.wikipedia.org/wiki/Camera_lucida

There are other techniques too. The artists that say you must limit yourself to arbitrary techniques are gaslighting other people and are most likely also cheating.
>>
>>107774054
I only troll ranfaggot in threads with the rentries. it's just him screaming into the void when they aren't present. you can mail me the medal of honor I deserve
>>
>>107774054
Get mods to stop being useless sacks of shit and get them to range ban ani, debo and ran.
>>
>>107772643
I'm not going to count them all, but it looks like there must be over 12 gens in the OP. A participation trophy for literally nobody who posted is disgusting. Not a serious thread.

If you are a talented genner and you'd like to be actually judged in a thread for grown-ups, check out /adt/.
>>
>>107774111
don't think ani and debo have anything to do with it
>>
is it possible (without writing custom nodes, but fine to use custom nodes) to make a workflow that runs a object detection model (like qwen or something) to list objects in a scene, then somehow detect each object individually (sam?), then tag (anime so wd14), then inpaint the dected object using the generated prompt, iteratively over the same image so basically automaticall inpainting everything "significant" in the image... if yes, how, the bit idk is the inpainting each thing individually on the same image
>>
>>107774096
I love how you always make esoteric dog whistles using insults used on you 6 month to a year ago.
Talk about being made of glass
>>
File: file.png (2.22 MB, 1152x896)
2.22 MB
2.22 MB PNG
kinda reminds me of Settlers 3
>>
>>107774151
>detect each object individually (sam?)
You need to detect and make masks for them. I have no idea how but it should be possible.
>then inpaint the dected object using the generated prompt
What prompt? Generated from what?
>>
>>107773661
LTX VAEs tends to use higher compression that makes it faster.
>>107773678
They are our only hope to get a local video model with audio/video that doesn't rely on China's aesthetic benchmaxxing.
>https://investor.shutterstock.com/news-releases/news-release-details/lightricks-partners-shutterstock-video-training-data-advance
>>
>>107774278
>They are our only hope to get a local video model with audio/video that doesn't rely on China's aesthetic benchmaxxing.
Zai is throwing their hat into AR diffusion models. glm is a masterpiece
>>
Wings are a pain in the ass
>>
>>107773887
>38gb
Can this run on 12gb vram and 32gb ddr5?
>>
>>107774278
wake me up when they partner with pornhub
>>
>>107774361
just pick a model that works for your hardware.
as a 8gb ramlet for example, i use qwen-image-edit-2511-Q6_K.gguf (16gb)
https://huggingface.co/vantagewithai/Qwen-Image-Edit-2511-GGUF/tree/main
>>
hi chat why did last thread had 300 posts and 30 images
>>
>>107774307
>https://huggingface.co/BAAI/Emu3.5-Image
The problem is that nobody can run them. Zai makes big models so don't get your hopes up.
Right now AR image models need to be big or else it turns into a slopfest.
>>
>>107774430
Thank you anon, I still have a hard time figuring out stuff
I mean it's running but damn it is it slow
>>
>>107774437
yeah, qwen is bloatmaxxed. still have to wait for something smaller and optimized
>>
File: file.png (1.58 MB, 1152x896)
1.58 MB
1.58 MB PNG
it's summer 2026
z-image base now has hundreds of finetunes
wan 2.6 weights released because they found no commercial use for it and it's been so long
imagine the future we could have
>>
>>107774507
>no commercial use for it and it's been so long
that's all saas models tho and that didn't stop the other companies
>>
So has anyone tested LTX-2 on the API before? How does it compare to 2.2? (But I suppose it might make more sense to compare it to 2.5) Is the audio robotic or good?
I assume it doesn't know what's inside the pants as always.
Or should I give it a go at Fal?
>>107774507
Is Wan 2.6 particularly good?
2.5 wasn't exactly mindblowing.
>>
>>107774532
>Is Wan 2.6 particularly good?
idk lol, i didn't even check what it looks like since they said it's API only
>>
>>107774437
are you using the 20 step or the 4 step workflow? i prefer the latter
>>
>>
guys beware, do not use software called anistudio, I installed it and it formatted my PC, it's literal malware.
>>
>>107774497
its not like im gonna post my gens in this dogshit thread too btw
>>
File: 1766102423868319.jpg (76 KB, 750x694)
76 KB
76 KB JPG
bros, ltx 2 almost ready. goodbye wan 2.5
>>
>>107774789
i hope it's going to be good and run on consumer hardware... but do we already know this is the case?
>>
>>107774813
nta. it's 19B so it will run fine, but whether it's actually good.. no one knows until we try it
i'm reserving my optimism until i see results
>>
File: 291.png (977 KB, 1024x1024)
977 KB
977 KB PNG
>>
in anticipation of ltx2 release, do the mp4s posted to 4chan support audio as well? or will we have to host our meme gens on catbox
>>
>https://xcancel.com/BrentLynch/status/2008030132083290309
>>
>>107774827
let's hope it'll do some thing(s) well then. it's not like the LTX team was bad at what they're doing.
>>
>>107774934
I don't believe that you don't know the answer to your question already.
>>
>>107774934
most boards including /g/ won't do audio and that is probably a wise decision
>>
>>107774952
>this month
so two more weeks eh. i miss the times when shit just dropped instead of months of edging and promoting and github repos with huggingface links that 404
>>
>>107774952
sex
>>
>>107774961
>>107774968
shame. i'll be honest i am a bit of a boomer when it comes to 4chan i only come here to ldg occasionally to post some gens
>>
>>107774789
>>107774813
>israeLTX2 releases
>chinese dragon rises and drops z base
>chinese decide to finally release a gimped version of wan 2.5
>LTX forgotten again within the week
>>
>>107774952
So it's basically wan with voice, and a better animation style?
Once nsfw lora are out (if it's even possible), we can maybe finally ditch the two samplers of wan2.2 to this.
>>
>>107774952
I just hope they don't have a "WE WILL DMCA STUFF WE DONT LIKE" like the stupid BFL license allows them to.
>>
don't give a shit about LTX where base
>>
>>107774995
i don't really care for audio that much desu. also did they say how long the genned videos can be?
>>
>>107775012
>i don't really care for audio that much desu
It's a nice bonus, honestly I thought I wouldn't care but it was really nice whenever I've seen grok or early sora 2 do it.

>also did they say how long the genned videos can be?
No.
>>
>>107774952
>you don't need a monster rig to run models anymore
>19b

>>107775029
looks like 6 seconds
>>
>>107774990
you can post with audio on catbox or the fediverse or wherever

also i think the TTS we hear in LTX2 isn't that pleasant yet, so in that sense I'm not sure we're missing out on much... even disregarding that for most 4chan boards it'd be so much trolling
>>
File: file.png (2.58 MB, 1280x1440)
2.58 MB
2.58 MB PNG
>>107775042
19B means it should fit < 24GB in fp8, and probably work on less than that with offloading. i don't see the issue, anons want both quality but also vramlet compat, you can't have both

>>107775009
>>
>>107774118
/adt/, their thread is literally dead and some weeb schizo is shitting here in vendetta. When you think about it, it all makes sense. We need to tell the mods to ban /adt/ from /g/ and move it to /jp/ with the 2hutroons
>>
>>107774995
>So it's basically wan with voice, and a better animation style?
With audio. It does music, sound effects, etc.
>>
>>107775042
>looks like 6 seconds
if that's the limit then it's a nothingburger. Wan is already good enough quality and has tons of loras, the only drawback is length, and we are starting to have workarounds for that like SVI 2.0 Pro
>>
File: 1717027845733.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>107775009
Dumb cat, they said if you build it, it will come, just fucking build it.
>>
>>107775009
we already have an interesting z model, but nothing serious on the audio side
>>
>>107775241
vibevoice is a cool albeit dead idea
>>
File: 1635091041253.png (1.51 MB, 1024x1024)
1.51 MB
1.51 MB PNG
What I really want is for someone to train the golden noise NPNet for ZIT. It really improved SDXL, so I expect it would do the same here.
>>
>>107775259
>To mitigate deepfake risks and ensure low latency for the first speech chunk, voice prompts are provided in an embedded format. For users requiring voice customization, please reach out to our team. We will also be expanding the range of available speakers.
yeah nah i'm good
>>
>CES Nvidia presentation
>no consumer gpus announced
>cloud shilling
>boasts the most models released
does anyone here seriously use Nvidia models or their cloud services? I feel like the entire presentation is just blowing hot air and trying to usher in the end times of home computing
>>
File: AiSlop Dale.png (789 KB, 778x816)
789 KB
789 KB PNG
Well, I think I proompted enough AiSlop Dale portraits.
Maybe it's time to play the game.
Still stuck in XL...
>>
>>107775418
Next project: Low brow sitcom with heroes III characters? If the LTX-2 thing is good.
>>
>>107775290
what model(s) are using that?
>>
>>107775374
it was never a presentation for us, it's for investors
>>
>>107775241
there are a number of good tts models at least
>>
>>107775374
>no consumer gpus announced
We knew the TI/Super were cancelled anyway.
Next CES is when they'll probably announce the 6xxx.
>>
>>107775374
>I feel like the entire presentation is just blowing hot air and trying to usher in the end times of home computing
It is the end times of home computing. And anyone pretending it is not is a retard.
Consumer products will fade out. All great manufacturers are focusing on server markets now. Why? Because there's so much more money. And servers update frequently in order to always bring the best service to consumer, while there are consumers that use a 1070 for 15 years and are happy about it. You can maybe earn billions with consumer products, but with servers and clouds and hardware as a service you can earn trillions. That's a thousand times more.
So yeah guess what, you will own nothing and you will be happy so open up your wallets and start renting.
>>
>>107775446
Diffusion Models. I haven't heard of somebody training any more checkpoint specific versions so there's only the ones made by the researchers for SDXL, DreamShaper-xl-v2-turbo, and Hunyuan-DiT. I used it with both 3d SDXL models and Illustrious and it helped.
>>
THE END TIMES OF HOMO COMPUTING
>>
>>107775544
just like when phones replaced all pcs, and then tablets replaced all pcs, oh yeah
>>
>>107775600
You don't need more than a phone to access cloud. However, of course, you will pay as much for your -access-device- as you did for a full featured machine.
>>
when will local vid gen get good?
>>
https://vocaroo.com/117ySyp9NOQH
>>
>>107775656
never. it's all about CLOUD now
>>
>more than 50% of humans have pcs
good luck, cloud fart kek. openAI is already in a panic, and is trying by all means to ban users, just to gain bandwidth
>>
>>107775675
what model is that and can i easily take someones voice and make joi's?
>>
>>107775737
>and is trying by all means to ban users, just to gain bandwidth
what
>>
>>107775773
VibeVoice 7B
>can i easily take someones voice and make joi's?
probably. you can feed it a short wav of someone's voice and it emulates it quite well. for non standard voices and accents you might need to train a lora
>>
>>107775773
>>107775818
Personally I find chatterbox much better and faster.
>>
>>107775878
thanks for the tip. i've honestly not paid too much attention to the TTS space lately, just already had VibeVoice downloaded so that's what i used
i will check out chatterbox
>>
File: 00987-1875429724.png (1.21 MB, 896x1152)
1.21 MB
1.21 MB PNG
>>107775675
>two more weeks
kek
>>
>>107775818
is 7b the forbidden one or the smaller one that's still up
>>
>>107776154
the bigger one, they're still up but just moved to a community repo instead of being under microsoft
>>
it autosages at 300 posts right
>>
>>107776161
jeets are good for something at least kek. imagine if we never got the leak
>>
basewhenehwesab
>>
File: 3593.png (1.7 MB, 1056x1024)
1.7 MB
1.7 MB PNG
Did they intentionally avoid copyrighted stuff(zit)? though I feel something similar always happens with chinese models where they include a much greater proportion of chinese stuff so maybe some stuff gets neglected idk
>>
File: 1760081880370762.png (39 KB, 598x744)
39 KB
39 KB PNG
I don't feel like the seed is really randomizing because it doesn't change here, am I wrong
>>107774679
I am using the workflow on this >>107773887 which is set to 40 steps out the box but also says you can try with 20
>>
>get new GPU
>figure ill try full Qwen bf16 instead of fp8
>wait for output
>no difference
>at all

is this always the case with quantization? you barely notice a difference
>>
File: 8280282.png (1.47 MB, 1312x864)
1.47 MB
1.47 MB PNG
>>
>>107776538
New fp8 quantizations ("scaled") are very good at being near fp16 quality.
People are a bit outdated on this sadly.
>>
File: 1677880274397050.jpg (4 KB, 249x157)
4 KB
4 KB JPG
>civitai's top nsfw creators are working on zit models
>still weird nudity
...
>>
>>107776566
context? not up to date with current ai trends
>>
>>107776566
I thought I was only one having weird dicks and innie pussies
sad
>>
>>107776566
ppl are testing and coping
>>
difference between qwen edit image, flux 2 and z image turbo?
which one is better to edit images
>>
>>107776566
you cannot teach it good looking genitals/nipples without a finetune
>>
>>107776566
>>civitai's top nsfw creators are working on zit models
wow im so excited for the bobvagene cancer mixes
>>
>>107776402
It knows a handful of brands. More than other local model probably.
>>
>>107776566
It's distilled and they didn't focus on genitals for the distillation.
Wait for the base.
Just two more weeks.
>>
https://files.catbox.moe/bqh7en.flac

Enjoying your base model you fucking retards?
>>
>>107776852
tomar is that you?
>>
File: 3024700.png (1.67 MB, 1024x1024)
1.67 MB
1.67 MB PNG
prompt:
>>
>>
>>107775009
Prompt for that cat?
>>
File: 793881385.png (1.1 MB, 480x1408)
1.1 MB
1.1 MB PNG
>>
>>
File: 64.png (1.55 MB, 1456x816)
1.55 MB
1.55 MB PNG
>>
File: 600615.png (1.62 MB, 704x1632)
1.62 MB
1.62 MB PNG
>>
File: ComfyUI_00054_.png (1.45 MB, 832x1216)
1.45 MB
1.45 MB PNG
>>
yo guys im debo lol
>>
>>
>>
>>107777032
this solves the hand problem
>>
>>
>>
>>
File: 962.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>
>>107777159
whats behind the door
>>
File: 851.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>
comfyanon here, just wanted to say I hate minorities
>>
https://files.catbox.moe/1r5oit.flac

Stardate: 46359.2

Scene: The Ready Room of the U.S.S. Enterprise-D. Captain Picard stands by the replicator, a steaming cup of Earl Grey in hand. He turns as the door chimes and slides open to reveal Lt. Commander Data, who enters holding a PADD containing the latest subspace analytics from the Federation Archaeology Council's digital archives.
>>
File: 692.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>>107777171
free food
>>
>>107777200
god I wish because then comfy wouldn't hire them
>>
File: AnimateDiff_00010.webm (274 KB, 576x1024)
274 KB
274 KB WEBM
>>
>>107777236
chinese and indians are world majority chud
>>
File: ComfyUI_278920_.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
https://huggingface.co/black-forest-labs/FLUX.2-dev-NVFP4/tree/main
>>
>>107777403
fp4 is 50 series stuff right? the one series that doesn't fucking need it
>>
>>107777403
buy an add
>>
Does any forge support multi gpu?
>>
>>107777100
>>107777112
i member
>>
File: ComfyUI_09962_.png (1.23 MB, 944x1280)
1.23 MB
1.23 MB PNG
>>
>>107777403
Who is Flux.2 for anyway
>>
File: ComfyUI_09941_.png (1.51 MB, 944x1280)
1.51 MB
1.51 MB PNG
>>
>>107777530
not true
my penis was in a vagina for 30 seconds
and then i went soft
>>
File: 1762979644431766.png (1.73 MB, 1152x1472)
1.73 MB
1.73 MB PNG
>>
>>107777499
nice
>>
I randomly found gens i posted here on other websites which blew my fucking mind when i saw it
i posted those gens a year prior
you never know who got your gens saved
>>
>>107777621
I once saw someone use my gens as their wallpaper in a desktop thread. That was pretty cool, I thought the gens weren't very good though kek.
>>
My gens get reposted all the time on other boards because I'm a meme master.
>>
>>107777403
how about your fix your memory leaks you dumb fuck
>>
>>107777697
don't use custom nodes :^)
>>
File: 1740341938919074.png (1.02 MB, 1000x1504)
1.02 MB
1.02 MB PNG
>>107777530
>>
>>
What is the best light video workflow?
I had one decent before re installing but can't find it didn't survive the wipe sadly
>>
>>107777750
kek what model is this
>>
base now
>>
>>107777762
https://civitai.com/models/1601811/let-him-cook-i-meme-pose-concept-i-illustriousxl-and-noobai?modelVersionId=1812701

WAI
>>
>>107777741
I don't use custom nodes, just like you recommend, I'm tired of comfyui crashing without error logs anytime the vae is decoding a video, whats your excuse next? buy more ram? lol

your piece of shit repo will be replaced this year :)
>>
>>107777403
who gives a fuck? no one cares about flux 2 and they believe they now want flux 2 lobotomized in 4bit?
>>
>>107777834
>I'm tired of comfyui crashing without error logs anytime the vae is decoding a video
use Vae decode (Tiled), you simply don't have enough vram to decode all at once
>>
>>
>>107777621
Google images used to scrape images hereabouts. ( long time ago) They stopped. I found some of me images theree.
>>
File: ComfyUI_temp_cktom_00001_.png (2.16 MB, 1152x1920)
2.16 MB
2.16 MB PNG
>>
>>107777745
>2027
too soon. our technologies are shitty and our local models are bad. add at least 20 years, 2047, lol.
>>
why would it ever take 2 months to release the base model? there is no way in hell they're still training and working on it. i was only memeing before when i said it was never coming out but now it seems like the obvious truth.
>>
File: ComfyUI_temp_tpsca_00002_.png (2.56 MB, 1152x1920)
2.56 MB
2.56 MB PNG
>>
Can I run this on amd?
>>
finally got SVI working and it's good but there's a flaw. if one of the middle videos is messed up you'll need to gen it again and everything after that.. so basically you need to gen videos one by one and add more nodes as you go.
>>
>>107778038
Xl?
>>
>>
>>107778063
can you export/import latents?
>>
Am I (doubly) retarded, or is there no way to add more training to an existing LoRA with ostris? My plan was to train on narrow cases of my concept first and then branch out into the general case. Resuming from good checkpoints since some later ones can end up shit.
>>
>>107778026
It's not coming. The only thing coming down the pipeline related to z-image is lodestones' wip dedistilled scrambled egg version.
>>
>>107778119
cute tranny
>>
>>107776538
Yes, and it's been that way for ages. Quantschizoes are obsessive gaslighters. Also, ggufs are thrice slower.
>>
>>107778174
I am looking forward to only using furfag artists
>>
>>107778121
idk
desu it's not that big of a deal. logically you'd end up doing it this way anyway.
>>
>>107778167
Should just be edit job, increase steps, and or change/add dataset in AI Toolkit as long as you still have the optimizer.pt and the latest trained LoRA.
>>
>>107777991
>>107778038
low-cut panties >> standard panties >> shit >> high-rise panties
>>
>>107778291
The problem is that it only lets me resume from the last checkpoint, which in my case isn't the best the one since it's gotten a little crunchy.
>>
File: ComfyUI_temp_tpsca_00009_.png (2.36 MB, 1248x1824)
2.36 MB
2.36 MB PNG
>>107778174
We're never getting anything good again, just take a look what "normal" people have done with generative AI image/video so far, viral slop crap, misinformation, scam, deepfaking porn like crazy, stealing content, grifting. I have yet to see someone put good use of AI, because so far I have failed to see it.

Every time there is a new AI tool that gets released and allows the public to do something fun, people go crazy trying to do the most wrong stuff ever, Elon gave people an edit model, all they did was try to generate porn of other people public photos, no wonder big corpos don't want people to own GPU, RAM, whatever. Just compare what people were posting on AI subreddits a year ago vs now, especially the nsfw ones, now is just people advertising stuff, spamming comments with their affiliate links, just crap after crap, No wonder these AI labs think twice before releasing stuff, especially open source stuff.
>>
File: ComfyUI_temp_tpsca_00010_.png (3.37 MB, 1248x1824)
3.37 MB
3.37 MB PNG
>>
>>107778305
this anon makes a good point
>>
did not read all that doomerism
>>
File: ComfyUI_temp_tpsca_00011_.png (3.93 MB, 1248x1824)
3.93 MB
3.93 MB PNG
>>
>>107777403
I don't think I have a compatible gpu.
>>
I need them to release z-base before the AI bubble pops and they go out of business
>>
File: ComfyUI_temp_tpsca_00012_.png (3.63 MB, 1248x1824)
3.63 MB
3.63 MB PNG
>>107778372
Compare what these threads used to be a year or two ago, anons sharing prompts and tricks, and advice, it was a kind of nice community desu, a1111 was born here, so was comfy... it was good until the schizos avatar fags took over? now is just a cesspool of shit with anons shitting up the threads with their drama, people complaining when a new model get released and they cannot do anything with it. Things aren't get any better anon, if you got into AI in the last months, you're fucked, computer parts are expensive, cloud solution suck ass, there is no room for you, you're late, sorry
>>
>>107778337
Could probably just rename the checkpoint you want to work off of to the name of the last checkpoint. Things may get fucky with training because the optimizer state is for the latest model and not a previous step count though.
>>
File: ComfyUI_temp_tpsca_00013_.png (3.89 MB, 1248x1824)
3.89 MB
3.89 MB PNG
>>
Anons anons the sky is falling
>>
>>107778455
kek
>>
>>107778372
Sounds like Elon is getting the medal of freedom for giving the USA AI dominance and an improved GDP for now?
>>
>>107778167
Rename the finished lora to the last step it finished at. Example:

1girl_000001000.safetensors

Create a new job with steps greater than the steps of that lora. For example, 2000 steps. Start the job, then pause it immediately after the folder is created. Copy that lora into the folder and resume the job. Training will start at the last step of that lora.

Alternately, you can use a hex editor or lora metadata editor to edit the steps in the lora if you want to start at an earlier step. For some odd reason, Ostris' developer decided to store the step information and other Ostris-specific info in the metadata of the lora.
>>
>>107778119
just make a woman. it's ai. why respecting reality, eh
>>
>>107778483
Iirc even if the filename is the expected with the highest step count in a new job the trainer will start from there and run until total steps.
>>
File: ComfyUI_temp_cktom_00017_.png (2.4 MB, 1520x1040)
2.4 MB
2.4 MB PNG
Chroma is such a good model, too bad it filters retards like you
>>
>>107773608
chunky bastard
>>
File: ComfyUI_temp_tpsca_00018_.png (2.12 MB, 1520x1040)
2.12 MB
2.12 MB PNG
>>
You will never get Z-base. You will be stuck with 200T models that you can only run at Q1 and that have been trained entirely on synthetic data. You will endure another 1,000 years of SDXL.
>>
File: ComfyUI_temp_tpsca_00019_.png (2.65 MB, 1520x1040)
2.65 MB
2.65 MB PNG
>>107778556
kek, and your only hope is a furry discord tranny, you're fucked, AI was never for you
>>
>>107778511
chroma or spark?
>>
Not officially out yet but: https://huggingface.co/Lightricks/LTX-2/tree/main
>>
>>107778305
stick your tier lists up your ass, homo
>>
>>107778566
Lodestones has been hired by Alibaba to improve their API services. Zeta Chroma isn't coming.
>>
>>107773297
That's just a real image dude
>>
File: ComfyUI_temp_tpsca_00020_.png (2.57 MB, 1520x1040)
2.57 MB
2.57 MB PNG
>>107778568
chroma of course, spark is just a glorified lora
>>
File: ComfyUI_temp_cktom_00021_.png (2.2 MB, 1520x1040)
2.2 MB
2.2 MB PNG
>>
>>107778524
I know it's been almost 6 months since wanx2.2, but I still can't see ltx catching up
>>
File: ComfyUI_temp_tpsca_00022_.png (2.29 MB, 1520x1040)
2.29 MB
2.29 MB PNG
>>
https://blogs.nvidia.com/blog/rtx-ai-garage-ces-2026-open-models-video-generation/?linkId=100000401205054

Up to 3x performance and 60% reduction in VRAM for video and image generative AI via PyTorch-CUDA optimizations and native NVFP4/FP8 precision support in ComfyUI.
RTX Video Super Resolution integration in ComfyUI, accelerating 4K video generation.
NVIDIA NVFP8 optimizations for the open weights release of Lightricks’ state-of-the-art LTX-2 audio-video generation model.
>>
feeling uninspired desu
>>
File: ComfyUI_temp_tpsca_00023_.png (2.68 MB, 1520x1040)
2.68 MB
2.68 MB PNG
>>
File: ComfyUI_temp_tpsca_00024_.png (2.93 MB, 1520x1040)
2.93 MB
2.93 MB PNG
>>107778633
tbqh comfy has been running faster after the latest updates, I have totally dropped quants unless I can't fit the model (flux2)
>>
>>107778674
on older hardware it's slower than neoforge still. this is a specific node too so I doubt you were using that
>>
>>107773235
Morphing is back? 90s flashbacks
>>
LTX2 will be censored as shit, so absolutely irrelevant.
>>
https://www.reddit.com/r/StableDiffusion/comments/1q5a66x/ltx2_open_source_is_live/
Comfyui not updated yet though it seems

It is made for 5000 series btw, nvfp4 support
>>
File: 1757841445181668.png (34 KB, 720x262)
34 KB
34 KB PNG
ayo you cant be fr comfyniggs...
>>
File: 1745691635998618.png (56 KB, 747x405)
56 KB
56 KB PNG
>>107778732
https://www.reddit.com/r/StableDiffusion/comments/1q4l42p/if_youre_getting_different_zimage_turbo/

funniest shit this is still not fixed nor accounted for despite the popularity of both zimage and comfy lmaooooooooooooooooooooooooooo
>>
>>107778732
BUT MUH NVIDIA NODE!!! BUT MUH AMD ON BINBLOWS!!!!!
>>
>>107778732
>AI-jeetdditor post
>Provides no image comparission of his claim whatsoever
>probably is just a shitty trained lora
>>
>>107778732
>>107778739
>this applies to a small amount of people
>people who update comfy
Lmao nice roast
>>
>>107778754
dont engage with shitposters
>>
>>107778732
>>107778739
>fix a bug
>no he actually bugged it!
are you fucking retarded? oh right I forgot, you just want to discredit to push your shitty UI lmao.
>>
>>107778764
ani isn't a plebbitor
>>
File: ComfyUI_temp_tpsca_00027_.png (2.85 MB, 1824x1248)
2.85 MB
2.85 MB PNG
>>
>>107778777
he's present in socials, as the failed dev he is searching for clout in any space he can.
>>
>>107778788
reddit profiles are public. you can see for yourself
>>
LTX 2
>Use gradient estimation - Reduce inference steps from 40 to 20-30 while maintaining quality (see pipeline documentation)
https://github.com/Lightricks/LTX-2/blob/main/packages/ltx-pipelines/README.md#denoising-loop-optimization
>>
File: ComfyUI_temp_tpsca_00028_.png (3.47 MB, 1824x1248)
3.47 MB
3.47 MB PNG
>>107778788
I just feel bad for him honestly, I'm an old fag in these threads, and I remember when Ani was posting how animatediff was going to revolutionize anime for ever, how (he scammed) some japanese business men to invest into his crap and of course he never delivered, he just the kind of guy who starts a lot of stuff that seems like a good idea but always fails to deliver
>>
>>107778807
yeah sure retard. why haven't you pushed a commit in the last 2 weeks? bad looks for your shitty imgui project lmao
>>
Once a video clip is generated, videos are upscaled to 4K in just seconds using the new RTX Video node in ComfyUI. This upscaler works in real time, sharpens edges and cleans up compression artifacts for a clear final image. RTX Video will be available in ComfyUI next month.

To help users push beyond the limits of GPU memory, NVIDIA has collaborated with ComfyUI to improve its memory offload feature, known as weight streaming. With weight streaming enabled, ComfyUI can use system RAM when it runs out of VRAM, enabling larger models and more complex multistage node graphs on mid-range RTX GPUs.

The video generation workflow will be available for download next month, with the newly released open weights of the LTX-2 Video Model and ComfyUI RTX updates available now.
>>
>>107778764
if there was a bug which initially showed people one thing and made them set up workflows expecting certain results, then you cant just silently fix that bug some time later and not notify people within the UI properly while fucking up the end result of all previous workflows
>>
File: 1736917471489827.png (811 KB, 1504x1000)
811 KB
811 KB PNG
>>107778816
>>
>>107778835
how is it silent? is he supposed to call all of us individually?
>>
>>107776561
>New fp8 quantizations ("scaled") are very good at being near fp16 quality.
Oh wow when did this change? I wonder how fp8 gguf on Chroma compares in speed
>>
>>107778835
you could argue for a 'what's new' popup/modal once you open comfy that checks against what version you run last and gives you an overview of new features/bug fixes (would actually be nice and is a standard in many other places), but there's no such feature currently, and it doesn't make sense to give a big warning just for a small feature.
>>
fresh when ready
>>107778862
>>107778862
>>107778862
>>
>>107772643
what's the best model / lora for making Asian girls? I'm downloading a few right now, but maybe you guys have a preference.
>>
>>107778853
>Chroma1-HD-fp8_scaled_original_hybrid_large_rev2.safetensors
i'm using this one and it's great. combined with one of the lighting loras, rank 64 i think
>>
>>107778865
https://civitai.com/models/2168935/z-image-turbo
>>
>>107778863
why this early?
>>
>>107778863
thanks 4 bake bro
>>
>>107778875
we're competing against (you) to have a proper thread
>>
File: Untitled.png (70 KB, 1014x573)
70 KB
70 KB PNG
>>107778861
yeah if only there was some sort of modal
>>
File: ComfyUI_temp_tpsca_00030_.png (3.38 MB, 1824x1248)
3.38 MB
3.38 MB PNG
>>107778821
Don't believe anything until you test it, remember comfy shilled and hyped SD3 for MONTHS and it was fucking shit, he's a corpo sellout, of course he wants you to use comfyui

Nvidia also kept posting about Tensor RT and it was also shit
>>
>>107778875
this thread we are posting in was baked much much earlier
>>
>>107778441
>>107778483
>>107778499
Thanks frens. So far as it seems, this "works". I'll report back when I've had more than just 250 new steps trained.
>>
>>107778887
the manager is optional retard
>>
>>107778889
>Tensor RT and it was also shit
it was great its just that nobody wants to make it work
>>
>>107778887
ironic considering the lora fix is in the same screenshot you just posted lmao
>>
>>107778875
it's ran, she wants to bring drama back to ldg and she will probably get it. we had such a cozy bread
>>
>>107778924
>we had such a cozy bread
obviously since you weren't screeching about ran and defending ani the whole thread
and now you're going to proceed to screech about ran and defend ani the whole next thread
thanks for spamming
>>
>i will shit up the thread if it has the rentry links
mental illness as detailed in said links
>>
>>107778924
youre a literal nigger
>>
>>107778866
>Chroma1-HD-fp8_scaled_original_hybrid_large_rev2.safetensors
is it the same as Chroma1-HD-fp8mixed-final.safetensors ? Looks like the guy deleted that one
>>
File: zimg_00004.png (1.55 MB, 864x1280)
1.55 MB
1.55 MB PNG
>be me
>update my software since something changed
>open source btw, can see all changes line-by-line
>ignore all documentation and changelogs
>miss what has changed in the application

why is the developer retarded
>>
>>107778946
its (((him))) trying to stir up drama as usual
>>
>>107778957
Fuck off Ville Valo
>>
>>107778962
>>107778962
>>107778962
proper bake
>>
>>107778939
https://huggingface.co/silveroxides/Chroma-Misc-Models/tree/main/Chroma1-HD-flash-heun
found it here

run at cfg 1 with this lora:
https://civitai.com/models/2032955?modelVersionId=2300965
>>
>>107778863
when are we updating the ani rentry with his death threats?
>>
>>107778946
>millions of users have to read every single line of code in all software they use in all languages forever as it updates instead of a single dev detecting if you are on an old version with a huge bug and poping up a dialog box to notify you and save all that culumative time from millions of users by simply saying "hey, this shit works completely differently now than from what you came to expect from the previous versions"
Almost like /g/ is full of nocoding niggers who dont know what breaking change means and how it should be handled

And no, nigger, I'm not TrAni
>>
>>107779004
by definition this is not a breaking change
>>
>>107778977
nevermind that's the wrong one
>>
>>107778727
>Comfyui not updated yet though it seems
Just use their custom plugin, as documented?
I'm still downloading the model.
>>
>>107776497
there should be another subgraph with 4 steps and the lighting lora, which is way faster and should be good enough for testing prompts first
>>
>>107776561
>>107778853
You're being lied to.
fp8 scaled is 5x worse than Q8_0 GGUF
rentry.org/QUANTIZATION_ANALYSIS



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.