[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107510691

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>best way I can describe using comfyui moving forward is like sitting on a 12" dildo leaving it in and saying "fuck it, I'm gay now" instead of pulling it out with some dignity and saying "what's the next steps"
>>
>>107512146
>>Maintain Thread Quality
your post is full of troons. Literally shit tier "women".
>>
>>107512205
noone can spitebake as fast as tRANny
>>
to understand how worthless troons are, I've had troons hit on me. Actual females look at me with contempt.
>>
I love how trashy Ovis is.
>>
File: Ovis_00008_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
zit is way better at feet.

I still love the campy shittiness of ovis. It's awe-breaking and jaw-some.
>>
File: 1754924866305420.png (2.5 MB, 1920x1088)
2.5 MB
2.5 MB PNG
>>
Wretched thread of mental illness
>>
File: 00018-2989516251.png (1.03 MB, 1216x832)
1.03 MB
1.03 MB PNG
too sleepy. here's catbox incase anyone ask
https://files.catbox.moe/4y2gbl.png
https://files.catbox.moe/jkzi5k.png
>>
>>107512246
muchas gracias senor
>>
File: 1749113318837660.png (403 KB, 1080x993)
403 KB
403 KB PNG
>>107512239
Russian Favelas can be pretty cozy
>>
>>107512246
hands.

always check the hands.
>>
I still stand by auto forks just straight up looking better than cumfart outputs
>>
So ... now what?
Base confirmed never ever.
>>
>>107512198
>>107512234
>>107512239
>>107512246
>>107512254
no fucking way this is ai
>>
what's rk beta?
>>
>>107512259
it's ml actually
>>
>>107512270
makes since
>>
>>107512246
Thanks anon
>>
File: Ovis_00009_.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
>>107512254
Not dachas?

>>107512234
>>
>>107512290
Can any model match this level of trashiness?
>>
does comfy really call his ceo to read mean messages on 4chan? is he retarded or something?
>>
File: 1743562786758692.png (365 KB, 1080x1021)
365 KB
365 KB PNG
>>107512304
autism anon....
>>
File: 00019-1875320081.png (1.12 MB, 832x1216)
1.12 MB
1.12 MB PNG
>>107512276
>>107512252
enjoy :)
>>
>>107512198
kill yourself
>>
>>107512350
nice artstyle
>>
>>107512276
what's the prompt, cba to dl
>>
File: Ovis_00013_.png (956 KB, 1024x1024)
956 KB
956 KB PNG
>>107512290
wild how much different samplers can change the output.
>>
comfyui

is it a bug or a feature?

if I am in the subgraph, the seed never changed despite being set to "randomize"
>>
>>107512304
the cope tweet screenshot was pretty funny. grift chink is very sensitive
>>
>>107512254
they are in a depressing way.
>>
>>107512385
bug. you have to expose the seed as a input/widget, then wire an Int node to your subgraph's seed input/widget
>>
File: 1747766694147477.png (2.15 MB, 1280x1856)
2.15 MB
2.15 MB PNG
>>107512350
This is pretty cute
What prompt / model / lora are you using?
>>
>wan2gp expects you to write up a json file to point to checkpoints so you can select it in the dropdown menu
>that weird ass folder structure
eeehhhh back to comfy
>>
why would comfy call ani retarded? I thought they were friends
>>
File: ComfyUI_02481_.png (2.28 MB, 1504x1504)
2.28 MB
2.28 MB PNG
Their github says to be released that means we'll get it right?
>>
>>107512401
thanks. The seed doesn't propagate fully inside, I tried having preview as text inside the subgraph, but it only works if there is no current gen (the preview as text node is frozen, but actually it's getting a fresh seed).
>>
>>107512497
because "he" keeps referring to himself in the third person like a child with a developmental disability, exhibit a
>>
>>107512504
Yeah, right after Ace Step 1.5
>>
>>107512504
can be released as an API and linked in the repo
>>
>>107512512
I don't know what this means, I don't speak schizo
>>
DROP EVERYTHING! REPORTS INDICATE BASE IS COMING IN 2 WEEKS!
>>
>>107512524
okay then ``anon'', catbox any gen that you've posted to this general in the past six months, with post id, that isn't ani's avatar of course :)
>>
File: Ovis_00017_.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>107512369
>>
>>107512539
looks like SD1.5
>>
>>107512536
https://files.catbox.moe/7auwq1.mp4
>>
>>107512555
I'm sure he'll find traces of ani in there. just wait until he watched is a few thousand times
>>
>>107512555
>no metadata
>no post id
yep, that's what i thought. hasn't done anything but avatarfag this entire year.
>>
>>107512574
>jump through all these arbitrary hoops to satisfy my schizo brain
just turn the computer off bro
>>
>>107512584
you can call me schizo all you want, people are still not going to use OPs with anistudio in it
>>
>>107512593
that wasn't even my question?
>>
I'm glad you all learned about Chinese culture this way rather than though some expensive business venture and be left destitute. It sucks we didn't get our base model though.

This is why I refuse to buy modified 4090s btw. There is zero chance a warranty is honored in the event of an issue.
>>
>>107512603
you didn't learn with wan2.5?
>>
>>107512165
35 really is the new 25, but that's because weak men are so infantilized it takes until they become 35 to move out and start living like a 25 year old
>get into AI research if you want to get bitches
You will get an Asian girl, or if you're lucky and in the right place, a Jewish girl. They will be above average intelligence and anywhere from repulsive to above average in looks, with a median at "below-average"
>>
>>107512602
;)
>>
>>107512610
I draw from experience far more ancient than Wan 2.5
>>
>>107512603
Expecting a warranty from a modded Uber charged cyberpunk export-controlled weapon is the faggiest thing I've read today

The reason I was never interested in a 4090D was just because the 5090 mogged it and if you're not on Blackwell you're already behind
>>
>>107512620
6 tons of pig iron!
>>
>>107512584
>just turn the computer off bro
Maybe that's what Ani should do if she can't handle 4chan's "cyberbullying", as she says.
>>
>>107512623
>the faggiest thing I've read today
I've unironically seen people here tell me it's safe because they come with a warranty.
These idiots really think pingpong pong working out of a Chinese restaurant basement in fuck knows where China isn't just gonna say your 4090 got lost in the mail and then strip it for parts and sell it to the next idiot.
>>
>>107512654
>China isn't just gonna say your 4090 got lost in the mail
Then you chargeback the chink because you paid with a credit card.
>inb4 what if the bank doesn't take my side?
If this was even slightly a genuine concern then you either need to look into anti-anxiety medication or stop being a serf
>>
>>107512603
>this is why I refuse to buy modified 4090s btw
Not because it was a loud ticking housefire?
>>
>anon is unironically sitting there thinking "hmmm, maybe, maybe it's gonna come out tomorrow..."
LMAO
>>
>>107512699
There's no way a modded 4090 is less of a fire risk than a 5090. Are people already forgetting the burnt connectors?
>>
>>107512716
I just get a regular 4090.
>>
>>107512716
They're all housefires
>>
>>107512716
More of a fire risk* fuck I should sleep

>>107512714
I don't come back here every day because of z base, I come back here every day because I like generative AI and also the rest of the internet kinda sucks

>>107512720
Are 4090s still around? Some guy on hacker news described selling his on eBay recently and a bunch of shady Chinese named buyers with locations in Australia had a bidding war for it

I'd recommend a 5090 if you can get one right now. I also am becoming more convinced every day that Taiwan will be invaded. 5070ti + 64gb ddr5 is an ok king of the vramlets suicide stack I guess
>>
>>107512739
I got mine secondhand shipped early this year on a local peer to peer marketplace for ~$2400. Seller voluntarily sent a video of a benchmark. I admit that I was a little spooked using a peer to peer marketplace, although the website does use escrow.
>>
>>
>>107512722
someone should invent automatic Co2 extinguishing and auto power off switches inside of pc cases. bit like someone should totally invent steel reinforced glass food and drinks packaging so that we never need to use plastic and not risk an idiot using them as an offensive weapon, steel reinforced would mean you could make the glass thinner reducing the weight, also easy to recycle both materials.

but we live in a world run by complete retards who can't solve problems.
>>
>another ranfaggot bake
so tiresome
>>
https://github.com/comfyanonymous/ComfyUI_TensorRT
>NVIDIA TensorRT allows you to optimize how you run an AI model for your specific NVIDIA RTX GPU, unlocking the highest performance. To do this, we need to generate a TensorRT engine specific to your GPU.
Really? I can make I faster than torch compile?
>>
>>107512255
>hands.
https://youtu.be/n-zqjplxN1o
>>
>>107512844
it broke ages ago and nobody fixed it
>>
>>107512844
goddamn, i forgot about this one. its a relic at this point. remember trying to hook up animatediff to thing kek
>>
>>107512553
no.
>>
>>107512844
It makes SDXL run twice as fast on my machine
Tedious to make loras work together with though.
>>107512861
Skill issue
Works on my machine
>>
>>107512921
faster than torch compile?
>>
>>107512926
I never used torch compile but I would assume so.
It literally runs twice as fast.
Some images are almost pixel perfect, some images only have minor changes, sometimes it changes images more but doesn't really deform them.
>>
File: Ovis_00007_.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
This is it, the tackiest model ever
>>
>>107512844
I use tensorrt with onnx models (not for gen ai) and it's very fast. It's a bitch to setup though and documentation besides the basics is non-existent, so I'm not surprised it wasn't updated at this point
>>
>>107512943
is it compatible with wan 2.2?
>>
>>107512968
No.
Which models it works with are written there.
>>
>>107512965
I remember aitemplate being the better option for ease of use and flexibility
>>
>>107512959
>wow holy shit i'm so cool hatting gays and trans! do I fit in? well, do I? Am I cool 4chnis like you guys??
>>
>>107513018
ywnbaw
>>
>>107513018
this but unironically
>>
File: seething tranny.png (1.19 MB, 896x1152)
1.19 MB
1.19 MB PNG
>>107513018
>>
>>107512844
>>107512921
That sounds sweet. I don't suppose there's something similar for AMD? I'm reminded of having to convert safetensors to ONNX for DirectML, but I think that was more of a stopgap until AMD could expand ROCm support to more hardware. Running safetensors directly has way better ecosystem support than onnx, at least for image gen.

>>107512965
Oh, so it IS an onnx thing... Yeah, getting Olive to work was a pain. The SDXL example was broken on multiple levels. I tried merging my fixes at one point, but didn't seal the deal before they moved the examples to a separate repo. Then AMD released ROCm support for my hardware, which worked better anyway.
>>
>>107512943
>torch compile
I think that is mean to reduce quality though isn't it? Especially with things like wan.
>>
>>107512959
The ironic thing is unless you are closeted homosexual who's probably also trans, you wouldn't even dream of genning shit this cringe.
>>
>>107513018
we need to bully you people off the fucking internet as you have aided in the destruction of it and the entire western world. We do not fucking like you...
>>
>>107513067
TensorRT isn't just onnx conversions, it then applies bunch of optimizations depending on your hardware, running bunch of tests inside your GPU to determine what works fastest with minimum degradation, and then modifying the model accordingly.
I don't think it has an amd equivalent, and I wouldn't expect there to be any time soon.
>>
>>107512844
Could be helpful in the future but without lora or controlnet support and no wan support it sounds mostly useless since basic text2image doesn't take that long.
>>
>>107513119
Lora's can be used, just very tediously by compiling the model with them.
Controlnets work if they are just conditioning without a model patch.
No wan support is a bummer though.
But also, the model needs to fit inside your GPU at fp16 without offloading for you to be able to compile it with tensorrt. So it would only help people with 16, or perhaps only 24gb vram (I don't know how much overhead it needs)
>>
>>107512981
>I remember aitemplate being the better option
true. it sped up seed gen, sampling, vae encode/decode and tokenization. trt only does sampling
>>
>>107513151
Oh I had a brainfart there.
It would only work for 5090 users lol
>>
File: Ovis_00008_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>107512959
this is a chubs safe space.
>>
>>107513151
>But also, the model needs to fit inside your GPU at fp16 without offloading for you to be able to compile it with tensorrt.
That is some chicken-and-egg sadness. Unless... iGPUchads, rise up!
>>
>wan 2.2 tensorrt compiled model
>https://huggingface.co/JoeDengUserName/WAN2.2_TensorRT_Collection/tree/main
scam?
>>
>>107513218
Well the comfyui tensorrt extension doesn't support compiling it, nor inferencing it, but it it might be possible that some guy can modify the code enough on its own to get it working. On surface level it's believable that he did this with an RTX 6000 Pro. The questions in my mind are:
Can you hide anything malicious in an .engine file?
Since this is done without official nvidia support, how stable is it? Does it improve the performance as much as you expect to? I expect the answer to the latter to be no. But that doesn't mean it's useless.
Also you can't run this if you don't have an RTX 6000 PRO, whichever specific version of tensorrt library he made this with, nor without making modifications to inference code.
>>
>>107513097
Kys chud
>>
Hehe, you guys still doubt me regarding Chinese culture?
>>
These wan animate, steadydance etc models that edit videos, how come these workflows can't be run with any of the base models, 2.2 for example?
Isn't the technique inside the nodes that edit and merge the frames?
How can a video model act as an editing tool?
>>
File: Ovis_00012_.png (814 KB, 1024x1024)
814 KB
814 KB PNG
>>107513202
>>
>>107513250
seems doable
I was trying to trt convert zit but the repo failed to convert because it's not supported
https://github.com/comfyanonymous/ComfyUI_TensorRT/blob/5bcc3f1e5c2424bb20bcb586e340c25ebe4a954f/tensorrt_convert.py#L168
but I guess the code can be modified to support it
>>
guess anon got banned for posting cum lol
>>
>>107513346
>/ldg/, the homo sexual-free
Accurate
>>
>>107513397
Yes but nvidia themselves put some work to enable good performance with the models currently supported. You can google "nvidia tensorrt diffusion" and read some of the blogs they wrote about it.
So with sufficient free time and dedication you might be able to spaghetti something that works but it might not be optimal.
Z-Turbo works fast enough for me, but I am interested in tensorrt for Z-Base, of course if that actually arrives.
>>
File: 1739703284746.png (2.17 MB, 1168x1752)
2.17 MB
2.17 MB PNG
>>
>noise injection and seed variance works on qwen too
neat
>>
Never ask a chroma girl how many fingers she has
>>
BBC DICKS
INSIDE
CCP CHICKS
>>
File: cat question.jpg (32 KB, 382x417)
32 KB
32 KB JPG
So I got ai-toolkit. Coming from sd-scripts I wanted to do a test run with SDXL but I have no idea how this shit works
Does this use lycoris by default? Seems so based on conv rank parameter and the fact that it pip installed lycoris-lora. How do I use the normal lora training method? Also the only other lycoris option is lokr, how do I get the rest of the crew?
Ostensibly there is no way to shuffle captions with this? WTF.
How do I use xformers? Not in requirements.txt so I it doesn't seem to be applied automatically.
I am only seeing AdamW8bit and Adafactor as optimizers I can just install prodigy and hopefully it works when editing advanced but still seems weird.
I assume it is doing bucket resizing automatically based on resolutions I pick on the right?
How do I change lr scheduling options, like cosine or restarts?
So are optimizations like min_snr_gamma 5 prior_loss_weight 1.0 max_grad_norm 1.0 etc applied automatically? How do I apply them?
I don't expect a wall of text answer but did I feel like I missed something important. This is extremely confusing.
>>
File deleted.
>>107513704
never ask a chinese man where his wife was last saturday
>>
I love China
>>
I love porcelain
>>
>>107513734
Ai-toolkit is a barebones trainer. The only reason to use it is because it has support for models that other trainers don't.
>>
Just opened my ComfyUI after 2 months o edit a meme with kontext (font), everything was broken
All I had to do was run
>update_comfyui_and_python_dependencies.bat
and boom everything goot fixed
What a great piece of open source software
BTW, what's the latest model you guys are working with?
>>
>>107514093
>he doest use stability matrix to avoid unwanted updates
NGMI
>>
File: 1739610779474586.png (1.68 MB, 1440x808)
1.68 MB
1.68 MB PNG
>>
>>107514039
I wondered if I fucked up the installation or if it was as barebones as it seemed.
Since it is the latter I think I will see if I can get musubi-tuner working for z-image.
This was disappointing overall, after seen it be the first to announce support for z-image.
>>
File: 1758140497112591.png (687 KB, 1917x775)
687 KB
687 KB PNG
lmao, fucking chinks!
>>
File: it's over.png (162 KB, 2743x611)
162 KB
162 KB PNG
>Tongyi is ignoring all the questions related to base
>They Only talk when asking for people to shill Z-turbo on twitter
owari da...
>>
How long does it typically take you to generate an image worthy of posting on /g/?
>>
>>107514442
>>107514485
I didnt expect this much of a rugpull.

Taiwan numbah 1
>>
>>107513734
Use onetrainer
>>
>>107514442
>it's Lao Bai's anti-distillation base
even the regular chinks don't believe in the base release anymore and are trying to cope with undistilling turbo lool
>>
comfy should be dragged out on the street and shot
>>
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo/discussions/78#693aa3f4c24470480cb784fa
Why is this blatant autofellatio allowed?
>>
>>107514485
Because it's not illegal? But you and I know what the end goal is. To get some sort of advantage or code in DMs or etc. with regards to AI application so he can continue grifting.
>>
>>107514485
yes anon we care very much about your discord screenshots, thank you very much for posting them here
>>
>>107514659
Meant to quote >>107514645
>>
File: spooked.jpg (64 KB, 964x912)
64 KB
64 KB JPG
>Tried to unpin tensor not pinned by ComfyUI
WTF does this mean local diffusion bros?
>>
>>107514485
Don't listen to sarcastic Ani seethe, those are useful screenshots.
>>
>>107514664
you're welcome
>>
File: da goat.jpg (497 KB, 1920x1920)
497 KB
497 KB JPG
>>107514645
>https://huggingface.co/Tongyi-MAI/Z-Image-Turbo/discussions/78#693aa3f4c24470480cb784fa
>MonsterMMORPG
obviously the turkish god is a MMORPG nerd
>>
>>107514708
he is fucking EVERYWHERE HOLY SHIT
>>
The following custom nodes have been updated:

ComfyUI-HunyuanVideoWrapper
ComfyUI Image Saver
ComfyUI Impact Pack
ComfyUI-JakeUpgrade
ComfyUI-KJNodes
ComfyUI-Manager
ComfyUI-MultiGPU
ComfyUI-VideoHelperSuite
tinyterraNodes
gguf
rgthree-comfy
WAS Node Suite (Revised)
ComfyUI-NAG
ComfyUI-nunchaku


The update for the following custom nodes has failed:

ComfyUI-GGUF
>>
File: 1742825259529838.png (1.13 MB, 1452x1678)
1.13 MB
1.13 MB PNG
https://xcancel.com/LodestoneRock/status/1999053535988928830#m
he's right, I tested Radiance and it was like 2x faster than regular Chroma, and you can go for really high resolution without slowing down, this is the future
https://github.com/LTH14/JiT
>>
>>107514750
>it was like 2x faster than regular Chroma
no it fucking isn't. The last chekpoint would have to be black magic to get such a speed up compared to the previous versions
>>
>>107514765
>no it fucking isn't.
it fucking is, did you even try? the point of this method is to get the same vector size no matter the resolution, so you don't get compute explosition at high resolution
>>
>>107514442
>>107514485
omg stop dooming, Comfy said they'll probably release the model, trust the funnec!
>>107511837
>yes, I think they will release something. I don't think z image is good enough to compete against other api models so they probably won't try that.
>>
Oi, Comfy Anon, what does this mean?
>Tried to unpin tensor not pinned by ComfyUI
I didn't donate for this shit
>>
File: radiance_arch.png (608 KB, 2854x1945)
608 KB
608 KB PNG
>>107514750
What books do I read to get into this?
>>
>>107514809
Your gpu lost tensor cores, it's now permanently crippled.
>>
>>107514822
I just generated some faceswap porn with it few hours ago and it was working fine
>>
do tensor cores turn into shader cores if you're playing a video game?
>>
>>107514812
basically it does this (this image is for training
- you have an image you want to train
- you cut it into some little squares
- you transform each squares (tensors) into vectors
- the model has some part dedicated to only one vector only, so that part is specialized on vector 1, that part is specialized on vector 2...
- for example for a 1024x1024 image, if you cut it into some 16x16 squares you'll get 4096 vectors, and the model knows how to denoise each squares separately

Basically the idea of this paper is that you don't need to know much about the image, there's a lot of pixels that are useless, like our eyes, we don't look at individual pixels to make sense of images, we look at more simple shit like the overall shape, the texture etc...
>>
>>107514809
stop using comfy. literal malware
>>
>>107514809
comfyui destroyed your gpu
>>
the thinking man's LoRA of choice
https://civitai.com/models/2211738/
>>
>>107514809
You ran into a bug with the async offload code, the code that offloads some of the model to CPU. You should either restart the application or your PC, since your memory space is fucked and ComfyUI thinks part of the memory you have in GPU is theirs when it isn't.
>>107514840
No, it's fixed function hardware, lives right next to your CUDA cores and is now in fact bigger than them.
>>
File: 1742205293721649.png (1.07 MB, 1634x737)
1.07 MB
1.07 MB PNG
>>107514886
>https://civitai.com/models/2211738/
the same face phenomenon is much worse than I imagined, even between men and women they look really similar, like they're brothers and sisters
>>
>>107514840
no they turn into gamer cores
>>
>>107514846
No i mean give me books that teach me to be ai god
>>
>>107514778
That sounds too good to be true, it has to lower coherency (even when the model is trained for higher resolutions) or something
>>
File: image.jpg (181 KB, 1944x1026)
181 KB
181 KB JPG
At Last ! lastest comfy update get rid of nodes, directly from comfy's official discord
>>
>Finished. Loaded 220 nodes successfully.

> "The future depends on what you do today." - Mahatma Gandhi

umm, why are quotes by pedophiles in comfyui? Do you want to confess something Comfyanonymous?
>>
>>107515129
#releasethetranifiles
>>
>>107515129
gandhi is not dalai lama
>>
>>107515129
thats done by a custom node, so you are going out of your way to support pedos
>>
>>107515129
retard
>>
>>107515098
>That sounds too good to be true, it has to lower coherency
we'll have to wait and see, fo far his radiance X0 is really undertrained so it's complete ass kek
>>
>>107515139
>>107515143
Gandhi used to sleep naked with his underage niece to test if he'll penetrate her or not
>>107515141
which node?
>>
>>107515153
>>which node?
anistudio webhook
>>
>>107515125
wtf are they doing lmao, and they're fucking things up during the "Base release" waiting timeline, not the right moment to make people angry desu
>>
>>107514708
I still am very proud to have made him watch himself sucking a BBC
>>
File: file.png (183 KB, 1907x1161)
183 KB
183 KB PNG
>>107514804
The main thing is that they are on top for open weights in a variety of leaderboards just with Turbo and therefore wouldn't need to try that hard. Unless BFL felt blindsided enough to actually go toe to toe and improve their Dev distill, the chances are slim and I think they have their userbase already and proved enough with Flux.1 for users to stick around.
>>
>>107515213
>I still am very proud to have made him watch himself sucking a BBC
How?
>>
>>107515213
I made a D level Hollywood actor watch himself in a blowbang surrounded by black men
He sent me email from his lawyer and shit lmao
>>
>>107515220
on one of his reddit posts, replied with a stealth link to asking him to see my results after folowing his tutorial. and WAN did the rest lol

Hes blocked me since
>>
>>107514708
what the hell, I know this guy from twitter, he hates israel
>>
File: 1754760140074169.png (306 KB, 500x500)
306 KB
306 KB PNG
>>107515231
BASED!!
>>
>>107515218
Oh it went from 8th to 10th :(
>>
>>107515125
I went for several hours before my noodles decided to even render lol.
>>
>>107515245
Based
I follow him for those posts, idgaf what he yaps about when it comes to slopping.
The algorithm is smart enough to show me only his anti-israel posts too, since I follow dozens of groyper accounts.
>>
File: ComfyUI_temp_fyhqs_00001_.jpg (411 KB, 1440x1600)
411 KB
411 KB JPG
>>
>>107515218
I just realized seadream 4.5 released. Why is it worse than 4 though?
>>
>>107515218
just learned flux 2 released
how pozzed is it?
>>
>>107515330
check the model card on HF
>>
File: mental illness.png (346 KB, 649x1544)
346 KB
346 KB PNG
>>107515330
>how pozzed is it?
really pozzed, those guys are safety freaks
https://huggingface.co/black-forest-labs/FLUX.2-dev
>>
man i cant believe nunchaku fag rugpulled this hard too
still waiting for wanchaku
>>
>>107515330
it's generated tits for me, but you really don't want to see it
>>
>>107515372
I wouldn't really call that a rug pull, it was an amateur non-profit research project.
The team moved on to other shit and the one guy still nominally working on it is busy with other crap in his life.
>still waiting for wanchaku
It's not happening.
>>
File: combined_image.jpg (1.92 MB, 2760x3508)
1.92 MB
1.92 MB JPG
>>
>>107515372
>he thinks we'll get wanchaku
>>
>>107515393
what was the prompt?
>>
File: Z-image turbo.png (1.65 MB, 1280x720)
1.65 MB
1.65 MB PNG
>>
Does Z image work for Forge Neo well yet? What model should I get? I am almost finished with my procrastination.
>>
>>107515404
https://files.catbox.moe/iigzij.png
>>
>>107515436
>she wears white satin mittens retaining her freedom of movement but preventing her from grabbing and being self-sufficient on her own
what did he mean by this
>>
When I use a black image to i2i for zit to control the brightness, there's a threshold at 90-91 denoise where it's just normal to almost entirely black, no inbetween.

Why?
>>
Does a distilled model mean it's a model that's been trained on the outputs of a "base" model?
>>
>>107515575
basically yes
>>
>>107515595
ayy that means if Zbase, if it ever comes out, will be much, much superior to the turbo. Right?
>>
Does sage-attention work fine for lora training (not wan)?
Or do I want xformers or flash attention 3?
>>
Wanted to try a zit thing, forgot I had some loras turned on.
Came out nice.

https://files.catbox.moe/52yjap.jpg nsfw
>>
>>107515330
less bad than flux.1, still not good.

>>107515605
"much" superior for inference is not a given, but it could be. for training it is very typical that the base model is significantly better.
>>
>>107515605
You can try to hobo version de-distill. And do same seed comparisons. It's already better.
>>107515618
Isn't sage a speedup with a quality hit?
>>
>>107515683
Thanks anon

>>107515685
I don't particularly care for the generations, I'm kinda satisfied with how well the turbo does, but if it's better, then I'll take it.

But more importantly, I care about the lora training, because any lora applied to turbo almost always ruins the gen quality
>>
>>107515685
>Isn't sage a speedup with a quality hit
Yes, at least for inference.
I am asking if it gives decent speedup without too much quality hit for lora training.
I wonder if anyone here used it and compared the results.
>>
>>107514708
>>107514725
furk tube when

>>107514748
lmao get C U L T U R E D
>>
>>107512603
>I'm glad you all learned about Chinese culture this way rather than though some expensive business venture and be left destitute.
Ways That Are Dark should be required reading for gweilo in 2025.
https://en.wikipedia.org/wiki/Ways_That_Are_Dark
>>
>>107515759
>1933
>>
File: xjlaw50vzsu31.png (1.14 MB, 1400x5552)
1.14 MB
1.14 MB PNG
>>107515759
sucks to be cumfart
>>
>>107515218
If it falls another place, I think that is enough impetus to release Base since they want to be definitively better than Flux.2 Dev. Most likely it will stay there though.
>>
>>107515848
>sucks to be cumfart
yeah he is going to be thrown to the side as soon as a Chinese UI is made and all Chinese people put it on a pedestal
>>
>>107515877
Cumfy is definitely gonna be dragged and shot once Xi makes his own UI
>>
can any of you lads running Kontext do me a favor making a meme? Please
I'll do one NSFW Wan request of any girl for you too
Quid pro quo
Make it say:
"Rape 1992"
"The Kevin Nash Story"
>>
>>107516023
>Kontext
I'm not downloading that cucked bfl model
>>
>>107516031
then edit it however you wish mate
My comfy is broken and I can't be arsed fixing it right now
rather busy
>>
>>107516051
>My comfy is broken
join the club
>>
>disney made a deal with openAI to make disney stuff
absolute slop.
>>
File: 00037-734741575.png (904 KB, 832x1216)
904 KB
904 KB PNG
>>107515367
normies aren't even interested in flux2. They have nonbanana and seedream 4.0 that gets the job done for far fewer credits.
>>
comfyui is a wrapper. that's it
it's not hard to create a replacement.
>>
>>107516023
go to llmarena and ask nano banana pro if it can do it
>>
>>107516160
>they made a deal to consult the compiling of data and training of models to create highly specialized (private! mind you, your brown hands will never touch these) models pertaining to the disney aesthetic
ftfy
>>
>>107516221
I'm sure it'll write "RAPE"
>>
>>107515326
4.5 is barely an upgrade for 4.0 in the photorealism department.
here is a lewd nsfw 4.5 image:
https://files.catbox.moe/7ojow0.jpeg
here's a 4.0 image with the same prompt:
https://files.catbox.moe/8m0c8u.jpeg
>>107515330
a censored and cucked model not even worth using online via api or locally offline.
>>
>>107516218
Sounds like you are pretty technical? I'm quite humbled to be in a same thread with people of your caliber.
>>
File: 1735083635357203.png (1.31 MB, 1368x760)
1.31 MB
1.31 MB PNG
qwen edit's ability to manipulate and copy styles is so fun.
>>
>>107516290
can you edit >>107516023
It's to pop the boys on 4chan
>>
File: 1742056781352150.png (1.55 MB, 3135x1552)
1.55 MB
1.55 MB PNG
>>107516259
>I'm sure it'll write "RAPE"
it did lol
>>
>>107514750
Hands off the cat!
>>
File: hug.jpg (29 KB, 746x512)
29 KB
29 KB JPG
>>107516314
share it fren
>>
File: based google.jpg (639 KB, 2048x2048)
639 KB
639 KB JPG
>>107516355
>>
>>107516372
Thanks
go ahead and request any 18+ girl and I'll do a NSFW Wan video for you
>>
>>107516283
Alright thanks for reporting.
Disappointing, Seadream isn't that bad for an API model, definitely less censored than many of its competitors.
I want it to do well.
>>
File: that's me.png (27 KB, 200x200)
27 KB
27 KB PNG
>>107516379
nah I'm good, I did it for the love of the game, nothing else
>>
>no point in making flux.1 gens
>can't stomach making flux.2 gens
>no point in making ZIT loras
>civit overrun with buzz grabs

non-coomer realgen frens, what are you doin these days?
>>
>>107516410
>>
File: 1757528974149725.png (1.12 MB, 1448x720)
1.12 MB
1.12 MB PNG
>>
>>107516414
im vibecoding some cumfartui nodes, wbu?
>>
File: ZIT_00006_.jpg (1.12 MB, 1600x2288)
1.12 MB
1.12 MB JPG
Damn, it's difficult to get a lot of trash.
>>
>>107516496
the indian train or the chinese elevator... who wins?
>>
>>107516504
American pitbulls.
>>
>>107516508
post hands
>>
>>107516496
>>107516504
fuck you bloody that's a bangladesh train
Indian trains electric only
>>
File: bruh.png (24 KB, 385x275)
24 KB
24 KB PNG
>>
imagine not understanding the culture
>>
ComfyAds coming soon! (subscribe to Comfy Cloud to disable them!)
>>
>>107516548
If fuckin' only. Still says TBD on Github and HF.
>>
File: 1744353991869251.png (87 KB, 1235x670)
87 KB
87 KB PNG
>>107516548
>>107516558
Stop the trolling, it didn't say that at all!
>>
File: zimg_0013.png (2.84 MB, 1440x1080)
2.84 MB
2.84 MB PNG
>>107516495
not a bad idea at all, i'm digging through old gens, trying to apply some of the Z tricks to FLUX gens, trying old FLUX prompts on Z
>>
>>107516504
Can a train ride up an elevator going down?
>>
File: Autism_Haver_and_yeenV2.png (1.02 MB, 1456x816)
1.02 MB
1.02 MB PNG
>>107512146
This new model is pretty cool, but it takes a lot of fenagling. Especially with non generic anthros like Tauren. /picrel.
>>
>>107516383
seedream the model isn't censored but it can be censored with negative api filters depending on the website you choose to access it. i found it uncensored on akool and social sights ai.
>>
Hey, what's the best open source sora like nsfw AI that I can use to transform sketches to 3d Pixar style images?
>>
File: MooMoo Loffs yuu Ecin.webm (2.43 MB, 1504x832)
2.43 MB
2.43 MB WEBM
>>107512146
>>107516655
>>
>>107516655
Is this supposed to be ZIT
>>
>>107516693
It was genned with Z-image.
>>
>>107516681
>sora like
None
>that I can use to transform sketches to 3d Pixar style images
Edit models like qwen image edit.
>nsfw AI
None that fits the former criterion.
>>
>onetrainer still hasn't implemented ZiT lora training
HURRY UP YOU LAZY FUCKS
>>
File: file.png (2.46 MB, 1024x1024)
2.46 MB
2.46 MB PNG
I'm actually having trouble getting it to NOT make things NSFW and it keeps inserting 3 characters in the forefront of the scene despite me telling it I want solo explicitly. /picrel with the trio problem.
>>
>update comfy
>my sampler doesn't preview anymore
>my queue and history is gone
aaaah why does he keep changing the interface
>>
does anyone have or made a Donggeuran lora for ZIT?
>>
>>107516823
just be a guinea pig for nodes 2.0. you don't need those pesky custom nodes
>>
>>107516823
>why does he keep changing the interface
I have no idea man, who asked for this??
>>
>>107515330
>>107515367
it's not actually true that flux 2 is so bad despite the screed they posted. it's not meaningfully worse than z, and better than flux 1. however, general anatomy is worse than z.
>>
>>107516849
>it's not meaningfully worse than z
saying this when you know flux 2 is a 32b model and z-image turbo is a 6b model is crazy
>>
>>107515093
ideas in ML are generally pretty straightforward, check code implementation. The kaparthy tutorial on how to build GPT really helped me back in the day.
https://www.youtube.com/watch?v=kCc8FmEb1nY
Also a more basic ML introduction would be VAEs, here's the video I used to help myself back in the day
https://www.youtube.com/watch?v=zp8clK9yCro
If you want to know the math, go through like the stochastic differential equations that motivate some of the models (statistics in general), then I'd suggest to.... learn basic linear algebra up to kernels and linear transformations (about 30 hour of course work from vectors), then calculus up to partial differential equations (another 20 hours if you skip analysis and focus purely on application), then you can learn about statistics and inference (up to multivariate inference and modelling). Here's a book list
- Calculus (Stewart) -> Differential Eq. (Boyce) -> Partial Diff. Eq. (Evans)
- Linear Algebra (Axler)
- First Course In Probability (Ross) -> Mathematical Statistics (Hogg, McKean and Craig)
>>
>>107516849
>it's not meaningfully worse than z
>however, general anatomy is worse than z
Thx for the raff keeeeek
>>
File: 1759573318148270.jpg (939 KB, 1248x1824)
939 KB
939 KB JPG
>>
>>107516849
>that 32b model is slightly worse than a 6b model, not a big deal
KEK
>>
>>107516853
>>107516872
>>107516891
it has a vastly wider native style range, better prompt adherence, and 10-channel edit image input. the extra parameters aren't doing nothing. unfortunately it is worse for manufacturing basic 1girl.
>>
>>107516709
What's wrong with ai-toolkit? I guess they don't have fancy optimizers and learning rate schedulers... but that's it
>>
>>107516917
buggy mess
>>
File: do not feed.jpg (457 KB, 2371x1434)
457 KB
457 KB JPG
>>107516906
>he is saying that Z-image turbo can only do 1girl
kek that's a troll, don't feed
>>
>>107516479
I would pay actual money for a migu skin in raiders.
>>
>>107516930
you're right. 4chinaman is a totally different case.
>>
>>107516952
>when you write "men" it shouldn't be chinese people since they're not humans
based?
>>
>>107516952
>the european model defaults to european people
>the chinese model defaults to chinese people
call me crazy but that sounds logic?
>>
File: zimg_0044.png (2 MB, 1440x1080)
2 MB
2 MB PNG
it's kind of crazy to think how much prompt wrestling you have to do with flux that you can just drop a prompt in Z and it's 100% right
>>
>the indian model only generates perfect 10/10 blue eyed blonde haired Caucasian women
call me bloody bastard bitchod but that sounds logic?
>>
>>107516906
I'm assuming you found out about Z only recently?
>>
File: zimg_0005.png (2.56 MB, 1440x1080)
2.56 MB
2.56 MB PNG
i had to train a whole ass lora for this in flux
>>
>>107516987
Have you seen Albino Indians?
>>
File: I need to know.png (179 KB, 600x600)
179 KB
179 KB PNG
>>107516987
>the indian model
there's an indian model?
>>
File: 1653187409246.jpg (182 KB, 1280x1266)
182 KB
182 KB JPG
post yfw you were here for the first threads discovering z-image and the subsequent tinkering to fix the jpeg noisyness
>>
>>107516565
Please mr schizo don't make me coom, that with 30 seconds ads to unblock more steps and samples and quantity of nodes in the workflow will he my dream
>>
I swear to fucking God Comfy do something about these retards coding your UI.
>>
File: zimg_0066.png (2.09 MB, 1440x1080)
2.09 MB
2.09 MB PNG
>>107517007
i think i might have posted the first zimg gen to this board kek
>>
>>107517007
Imagine the impregnation
>>
>>107517048
we gave them a raise with all the enterprise deals we signed :)
>>
File: 1746781423008370.png (2.84 MB, 3105x2168)
2.84 MB
2.84 MB PNG
WHY DID THE HECKIN DIDNEY BETRAY ARTISTS??????
>>
>>107517091
UGC is free money, duh
>>
>>107517091
>boycott disney!
kek that's more unlikely to get z-image base locally
>>
>>107517091
Those who still don't realize corpos are actively working against them in relation to AI are more retarded than those who think Tongy will release Base.
>>
File: V for Victory.mp4 (801 KB, 624x624)
801 KB
801 KB MP4
>>107517007
>>107517057
>>
>>107517091
It's wild to me that people are so anti-AI that they're basically advocating for making nearly all fanart illegal. Like, you don't own that IP. Did you get permission from Disney or the anime studio to draw that character? No? Straight to jail.
>>
>>107517091
>Boycott Disney
>Implying manchild redditors are capable of living without their dose of Disney+ zogchow
>>
>>107517091
how do these "people" not understand that requiring compensation for use of works in training sets only benefits globohomo its unquestionably bad for gay ass independent artists
i dont fucking get it
>>
>>107517115
holy molyvey anuddha shoah for my six gorillion sperm cells im about to spill to this

i need to throw some abby pics into the qwen furnace asap
>>
>>107517141
first they're like
>Ban AI, muhh copyright, must protect those multibillion dollars companies
then disney shows it's ok with AI and they're now pretending they never cared about the consent of companies in the first place :^)
>>
>>107517007
>>107517115
i wish to drill my member deep inside her flower and pollinate her until she explodes
>>
>>107517091
oai models via comfy API nodes :)
>>
You can't use ZIT as a refiner for sdxl?
>>
>>107517216
the latent is in a different format. we used to have a node that would convert noise but no idea if there is one for unet -> dit. for now you can convert to an image then put it through the zit vae and it will work
>>
File: zimg_0099.png (2.48 MB, 1080x1440)
2.48 MB
2.48 MB PNG
>>107517244
>>107517216
there's a re-encode latent node in impact pack or you can do as anon suggested
>>
>>107517216
Refiners are either supposed to pick up leftover noise (disable add noise on top) or do low denoising with a different seed.
Also bad idea unless the initial model can gen something halfway decent with just 4 steps.
I honestly wouldn't try to use half-denoised leftovers with different models, I would let the first sampler finish properly, and then do low denoising with Z.
>>
Is there anything better than ancient DeepDanbooru for tagging anime images with booru tags? I tried JoyCaption, but it's bad at booru tags.
>>
File: ZIT_00024_.png (1.64 MB, 888x1280)
1.64 MB
1.64 MB PNG
>>107517244
>>107517268
>>107517270
Cool, I had that node and it's working, but results are scuffed.
It's so annoying that the decode node takes twice as long as the ksamplers time combined.
>>
>>
File: zimg_0127.png (2.48 MB, 1080x1440)
2.48 MB
2.48 MB PNG
>>107517365
>>107517270
anecdotally, i have used this the other way around to do nsfw. gen with z then do an sdxl denoise on the parts it doesn't know.

files.catbox.moe/hjlw43.jpg
>>
>>107516655
>new model
Sorry, I'm off the dopamine train. I'm sticking with ZiT, Qwen, and WAN for videos. The rest of models can suck my dick.
>>
>>107517363
https://huggingface.co/spaces/SmilingWolf/wd-tagger
>>
>>107517216
>>107517268
it still acts as a img2img hack
easier to decode and encode again, gives the same result without needing to use yet again some schizo node pack
>>
>>107517454
I've done something similar, gen nudes with ZiT, and inpaint the pploni and crotch with lora.
>>
File: ComfyUI_00147_.png (1003 KB, 752x1392)
1003 KB
1003 KB PNG
remember the six gorillion sperms spilt into these tissues
>>
>>107517471
>>107517471
>>107517471
calm and reasonable new thread
>>
>>107516839
no search bar in nodes 2 for lora and models under the drop down is stupid...
and I want my X and square dedicated cancel buttons back ; /
>>
>>107512776
kino
>>
File: untitled.png (934 KB, 1538x859)
934 KB
934 KB PNG
>>107515367



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.