[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1710346023636211.jpg (1.47 MB, 3264x3264)
1.47 MB
1.47 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>101601667

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://www.modelscope.cn/home
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Kolors
https://gokaygokay-kolors.hf.space
Nodes: https://github.com/kijai/ComfyUI-KwaiKolorsWrapper

>AuraFlow
https://fal.ai/models/fal-ai/aura-flow
https://huggingface.co/fal/AuraFlows

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
blessed thread of frenship
>>
bigma will keep us save from the space pee aliens
>>
official pixart bigma and lumina 2 and also the hunyuan finetune waiting room
>>
Bigma status?
>>
time to start genning some shit.
>>
>>101616817
stuck in 256 x 256 latent space
>>
File: Sigma_11518_.jpg (2.09 MB, 1408x2688)
2.09 MB
2.09 MB JPG
>>101616767
Thank you baker for providing us with fresh bread

>>101616777
And thank you blesser of the bread for saving us from the evils of the latent world
>>
>>101616867
this is where the monsters live anon....
>>
File: ComfyUI_00056_.png (583 KB, 512x512)
583 KB
583 KB PNG
Jungle time I think, sayt anything you like i will gen but has to be in the jungle, imagine your place is arriving towards the crude land sight.
>>
>>101616925
try asking the latent space for a "1girl" i want to see what it does
>>
File: kekman.jpg (101 KB, 1280x720)
101 KB
101 KB JPG
>>101616941
ok, i was having a real problem just there with it not genning 1girl... but you make it easy for me now anon i can give you 1girl in jungle no problem for now. But I'd like to control it.
>>
File: ComfyUI_00058_.png (493 KB, 512x512)
493 KB
493 KB PNG
>>101616941
She is not yet detailed, i'm still setting up.
>>
>>101616941
>"1girl" i want to see what it does
depends on what you want? younger i'm doing that here even if fully clothed so fuck off.
>>
>>101616864
lets see it
>>
>>101616993
pp samplers are interesting
>>
File: Sigma_11537_.jpg (2.28 MB, 2048x2048)
2.28 MB
2.28 MB JPG
>>101617012
They are indeed. Also deis and the new beta scheduler

Good night anon!
>>
>>101617012
no i hate them for most cases, ddpm works
>>
>>101617078
WAAAAAAAAHHHHHHHHHHHHH!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
>>
>>101617078
maybe lower cfg will work, be right back.
>>
File: ComfyUI_00064_.png (466 KB, 512x512)
466 KB
466 KB PNG
0.4 cfg

euler_cfg_pp

interesting but not entire to spec.
>>
>>101617108
hmm time to try with pony model at 1024
>>
>>101617049
>beta scheduler
looks incredibly similar to simple and sgm_uniform mixed 50/50 with exponential 50/50. im not sure what alpha and beta values do
>>
>>101617130
was already pony, just forgot to up res, the resulting image is a naked women though and i'd get banned, its decent mind you.
>>
>>101617144
putting "nsfw, nipples" in the negatives will usually net you child friendly images
>>
>>101617155
i don't care, why care? I',m a fucking man i don't care what these idiots say...
>>
>>101617151
WWWWAAAAAAAAAAAAAAAAAAAAHHHHHHHHHHHHHHHHHHHHHH!!!!!!!!!!!!!!!!!!!!!!!!!
>>
bye anons, good night.
>>
>>101617174
gn
>>
File: file.png (162 KB, 256x256)
162 KB
162 KB PNG
>>101616817
in latent space
>>
>>101616867
>>101617296
Hype tho
>>
>>101616767
can someone make me a picture of a
>>
>>101617308
>an astronaut is riding a brown horse on the moon, in the background is the planet Earth, double exposure

one day it'll be the whole prompt
>>
>>
>pos: zebra pattern, ...
>neg: animal face horse, ...
>>
File: 00003-3832605959.jpg (342 KB, 1208x1568)
342 KB
342 KB JPG
>>
>>101617570
this is cool
>>
>>101617575
nice patchwork
>>
File: Image.jpg (3.7 MB, 2304x1280)
3.7 MB
3.7 MB JPG
>>
File: Image.jpg (2.1 MB, 2304x1280)
2.1 MB
2.1 MB JPG
>>
File: absol moon street_00007_.png (3.66 MB, 1728x1344)
3.66 MB
3.66 MB PNG
>>
>>101618090
i don't like the way it's looking at me
>>
>>101618090
I really like the way it's looking at me. Catbox?
>>
i sleep till bimimimimigma... zzzzzzzzzzz.... mimimimimi.....
>>
mabig
>>
>>101617575
Little big planet
>>
File: 00000-3335085276.jpg (389 KB, 1512x1328)
389 KB
389 KB JPG
>>101618928
Hansel and spagrettel
>>
File: 950181967.jpg (72 KB, 768x768)
72 KB
72 KB JPG
>>
cough
>>
>>101616767
cool gens
>>
File: Sigma_11539_.jpg (2.24 MB, 2048x2048)
2.24 MB
2.24 MB JPG
Good morning anon

>>101619135
Nice house
>>
>>101620339
gm
>>
File: Sigma_11542_.jpg (2.83 MB, 2048x2048)
2.83 MB
2.83 MB JPG
>>101620383
gm
>>
File: Sigma_11545_.png (914 KB, 2048x2048)
914 KB
914 KB PNG
Closer
>>
File: Sigma_11547_.jpg (3.53 MB, 2048x2048)
3.53 MB
3.53 MB JPG
>>
>>101620502
this one is cool
>>
File: Sigma_11549_.jpg (3.24 MB, 2048x2048)
3.24 MB
3.24 MB JPG
>>
File: Sigma_11550_.jpg (3.87 MB, 2048x2048)
3.87 MB
3.87 MB JPG
>>101620502
ty
>>
File: Sigma_11551_.jpg (2.15 MB, 2048x2048)
2.15 MB
2.15 MB JPG
MFW I was born with too many fingers
>>
>>101616767
I got me a question for everyone of the thread. I do both local text chatting and image gen, and will be building a full blown desktop for it. I have the money for the 4090, but part of me is curious about the jump between the 4080 super and the 4090.
Keep cost out of it, that's not the important part. Would the 4080 super be the better choice? Part of me feels that way.
Also, is there a big difference between the ryzen and Intel on generation? I know trying to image gen on an amd GPU is a pain in the cock and I won't be doing that.
>>
>>101620572
>4080 super be the better choice
nooope, vram is king here, especially in text gen. and i think unless you're planning on training/finetuning your own models a 3090 instead of a 4090 would be a better deal.
>>
File: Sigma_11554_.jpg (1.28 MB, 2048x2048)
1.28 MB
1.28 MB JPG
>>101620572
16GB vs 24GB.. you always need more VRAM but 4090 is still expensive.
>>
File: Sigma_11552_.jpg (2.04 MB, 2048x2048)
2.04 MB
2.04 MB JPG
>>
File: Sigma_11555_.jpg (1.72 MB, 2048x2048)
1.72 MB
1.72 MB JPG
>>
>>101620603
Unfortunately I live in SEA and the cost of a 3090 is just as high as a 4090. Does the 4090 still have the melting port issues?
>>101620610
Not even the price being an issue (not trying to brag, I saved up all my funny money specifically for a badass beast of a PC). I originally wanted to *just wait* for the next iteration to drop, but unfortunately being in SEA the first run of 5090s gunna be double the price due to scalping. I am not paying nvida yacht tax and some jackasses scalp tax because he fucks the ladyboy doing inventory
>>
File: 00015-2124170760.png (892 KB, 720x1000)
892 KB
892 KB PNG
>>101620657
Also, to contribute, here's something I made.
>>
File: Sigma_11559_.jpg (3.7 MB, 2048x2048)
3.7 MB
3.7 MB JPG
>>101620657
>4090 still have the melting port issues
Power-limit and you won't have to worry. The connector is only designed for so many inserts and running full tilt is what gets you in the fire zone. I set mine to 330w
>sudo nvidia-smi -pl 330

>saved up all my funny money specifically for a badass beast of a PC
That's cool. You have time to wait for sales then! Buy it piecemeal and set price alerts
>>
>>101620657
>Does the 4090 still have the melting port issues?
sorry i have no clue. i think a 16gb card should be fine for image gen but from what i've heard 16gb is kind of useless in text gen due to the available model sizes. basically any model you can fully run off your 16gb card you can do so with a 12gb card. maybe it might be handy if you want to run an llm and a image gen model at the same time.
>>
File: Sigma_11563_.jpg (2.62 MB, 2048x2048)
2.62 MB
2.62 MB JPG
>>
>>101620684
Friend, I've done everything including dropping bribes to shop owners to alert me first at sales of the PC parts. The Asian market (moreso sea) is absolutely brutal towards anything electronic.

That being said, knowing about the 4090 and how to handle it, I'll probably grab that.
>>101620692
Well, with a kobold+GPU setup, I can run 20b models at rather nice speeds. And with 64gh of ram, plus the 16 or 24 from the GPU I choose, it would be more than enough to suit my needs. Especially with the new IMAT models, which are 25 percent smaller.
That being said I'm an amateur at all this and this may be entirely wrong
>>
File: Sigma_11565_.jpg (2.84 MB, 2048x2048)
2.84 MB
2.84 MB JPG
>>101620743
Seems like the best move. Good luck anon
>>
>>101620743
>https://huggingface.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
you can use that to see if the quant of the model you're using will fully fit into your gpu. keep in mind that when you offload the model to cpu speed goes down drastically till hits pure ram speeds.
>That being said, knowing about the 4090 and how to handle it, I'll probably grab that.
i think this is the best move as well
>>
File: Sigma_11567_.jpg (3.54 MB, 2048x2048)
3.54 MB
3.54 MB JPG
>>
>>101620787
>>101620797
Thanks frens.
>>
File: Sigma_11566_.jpg (666 KB, 2048x2048)
666 KB
666 KB JPG
>>
>>101620826
this is cool, prompt?
>>
File: Sigma_11571_.jpg (2.46 MB, 2048x2048)
2.46 MB
2.46 MB JPG
>>
File: Sigma_11572_.jpg (3.29 MB, 2048x2048)
3.29 MB
3.29 MB JPG
>>101620834
"A bronze sculpture of a decaying hand reaching upward, fingers outstretched towards the heavens, as moss and small flowers sprout from its open palm. It symbolizes the impermanence of life and the enduring power of nature."
>>
File: Sigma_11573_.jpg (1.7 MB, 2048x2048)
1.7 MB
1.7 MB JPG
>>
File: Sigma_11576_.jpg (3.81 MB, 2048x2048)
3.81 MB
3.81 MB JPG
>>
File: 0.jpg (666 KB, 2048x1024)
666 KB
666 KB JPG
>>
File: Sigma_11581_.jpg (1.1 MB, 2048x2048)
1.1 MB
1.1 MB JPG
Work came early today. I'll be back and forth

Breathe in, gen out
>>
File: Sigma_11583_.jpg (3.19 MB, 2048x2048)
3.19 MB
3.19 MB JPG
Screw it, one more before I go
>>
>>101621357
>>101621382
have fun at work anon, i'll cough here in the mean time to keep the thread nice and warm
>>
File: Untitled.jpg (1.06 MB, 3820x1832)
1.06 MB
1.06 MB JPG
can someone interpret for me what the fuck the emotes mean on civitai?

why are people crying at Misty? are they uohhh'ing ?
is the crying face actually the downvote button?
if so, what's the laughing face? it can't be the upvote button because there's already a thumbs up emote.

this is fucking stupid
>>
>>101621466
i have zero clue honestly. clicking an emote gives you buzz, so i assume it's just people clicking whatever they think looks the funniest.
>>
File: Sigma_11584_.jpg (2.69 MB, 2048x2048)
2.69 MB
2.69 MB JPG
Sneak gen before meeting starts
>>
File: Sigma_11585_.jpg (2.97 MB, 2048x2048)
2.97 MB
2.97 MB JPG
>>101621400
ty anon, have a good day yourself
>>
cough
>>
>>101620517
Water should be more blue low left corner. Great pic
>>
>>101620445
v1.0?
0.o
>>
>>101621207
Cool.
>>
File: 0.jpg (138 KB, 1024x512)
138 KB
138 KB JPG
>>101622464
>>
seems like the model checkpoints that have "mix" in the title are usually better than others. probably because the person making it has already identified the best models that they're mixing. they've obviously done a lot of comparisons.
>>
>>101623005
kek good one anon
>>
>>101623032
i know it sounds like they're just amateurs mixing shit together, that's what i thought, but if you think about the kind of person who's mixing things together they'd be the kind of person who is doing comparisons.
>>
>>101623005
merges > mixes
>>
File: Sigma_11587_.jpg (3.71 MB, 2048x2048)
3.71 MB
3.71 MB JPG
>>101622002
ty

>>101622455
2k validation after another epoch

>>101622962
Nice
>>
cough
>>
File: Sigma_11586_.jpg (3.73 MB, 2048x2048)
3.73 MB
3.73 MB JPG
>>101623355
>cough
When sickness heals the thread
>>
>>101623628
that's the phlegm i coughed out
>>
File: Sigma_11588_.png (3.89 MB, 2048x2048)
3.89 MB
3.89 MB PNG
>>101623661
The feeling of clarity
>>
File: absol_moon_street_00004_.png (3.4 MB, 1728x1344)
3.4 MB
3.4 MB PNG
>>101618133
stares at u artistically
>>101618199
i dunno what that means
>>
File: file.png (166 KB, 256x256)
166 KB
166 KB PNG
first time it actually did an astronaut and horse shaped blob
>>
>>101623852
Yeah, I can kind of see it.
>>
File: Sigma_11589_.png (3.02 MB, 2048x2048)
3.02 MB
3.02 MB PNG
>>101623852
>>101623950
LGTM
>>
So did any of these new models crack the hot tub full of sausages problem yet or are we still at babby tier levels of comprehension?
>>
File: Sigma_11595_.jpg (2.77 MB, 2048x2048)
2.77 MB
2.77 MB JPG
>>
>>101624166
Image models will not get perfect until they progress from color blob hallucinations. AI will need to be able to construct the scene in 3D space before it can have great comprehension.
>>
>>101623793
>i dunno what that means
he wants you to upload your image to catbox.moe and post the link here so he can peek the metadata
>>
File: Sigma_11596_.jpg (2.07 MB, 2048x2048)
2.07 MB
2.07 MB JPG
>>101624166
Prompt?
>>
File: Sigma_11598_.jpg (3.74 MB, 2048x2048)
3.74 MB
3.74 MB JPG
>>
File: Sigma_11602_.jpg (2.51 MB, 2048x2048)
2.51 MB
2.51 MB JPG
>>
File: 116775249263549057-SD.jpg (3.48 MB, 1648x1776)
3.48 MB
3.48 MB JPG
>>
File: Sigma_11603_.jpg (2.46 MB, 2048x2048)
2.46 MB
2.46 MB JPG
>>101624658
Really wants text when it sees quotes. Sorry didn't notice
>>
I can never decide between 20, 25, or 30 steps, or even 35. Has anyone scientifically determined the diminishing returns?

(typically use euler_a or euler_smea_dy)
>>
File: Sigma_11606_.jpg (965 KB, 2048x2048)
965 KB
965 KB JPG
>>101624757
Good question. I'd like to see a plot of fidelity vs steps
>>
>>101624686
nice
>>
File: Untitled.png (43 KB, 2818x346)
43 KB
43 KB PNG
Is there a comfyui method to save the result multiple times as it's generating?
Instead of needing to regenerate the whole image again for different step counts?
>>
>>101625122
you can chain samplers to do exactly that
>>
>>101625122
From what I understand the steps is the model's goal point, step 10 for 20 steps is not the same step 10 for 40 steps. It's like telling someone to draw something in 20 minutes vs telling someone to draw something in 40 minutes.
>>
>>101625171
>>101625220
if you chain 20 samplers together that each do 1 step, is that the same as 1 sampler that does 20 steps?
i'll be testing it myself shortly
>>
>>101625252
The only accurate way to test is running each step count separately. I'm assuming chaining is just running steps on top of a finished image. Like I said, the denoising schedule is dependent on the target end step.
>>
>>101625252
If you correctly set each nth sampler to do just the nth step of 20 it should be the same
>>101625289
> I'm assuming chaining is just running steps on top of a finished image
It is not, the advanced ksample node lets you control all that.
>>
>>101625298
Then you're just short cutting running multiple denoising schedules in parallel and the time to do your tests would be the same as doing three separate images.
>>
>>101625317
what? no.
It is sequential and lets you grab the latents at step 20/30/etc
three separate images means doing steps 0 to 10 three times
>>
>>101625332
I ALREADY TOLD YOU THE NOISE SCHEDULE IS DEPENDANT ON THE TARGET STEP COUNT
>>
File: file.jpg (34 KB, 335x382)
34 KB
34 KB JPG
>>101625356
YOU CAN DEFINE THE STEPS TO RUN SEPARATELY FROM THE TOTAL STEPS WITH THE ADVANCED KSAMPLER NODE
DON'T DOUBT ME EVER AGAIN
>>
>>101625414
I'm telling you you're a fucking moron.
Which step is 50% of 20
Which step is 50% of 40
You do know how denoising works, right?
>>
>>101625433
You think the anon cares about that? He just wants the image at step 20/30/etc.
>>
>>101624489
Trippy texture
>>
File: Sigma_11609_.jpg (1.77 MB, 2048x2048)
1.77 MB
1.77 MB JPG
>>
File: Sigma_11610_.jpg (1.62 MB, 2048x2048)
1.62 MB
1.62 MB JPG
>>
>>101625811
>>101625875
Very cool
>>
File: 0.jpg (294 KB, 2048x1024)
294 KB
294 KB JPG
>>101625875
>>
>>101625811
>>101625875
nice
>>
>>101624166
did >>101624250 pass?
>>
>>101625968
no, it's full of water
>>
File: 0.jpg (167 KB, 768x1024)
167 KB
167 KB JPG
>>
File: Sigma_11597_.jpg (3.36 MB, 2048x2048)
3.36 MB
3.36 MB JPG
>>101625883
ty

>>101625911
Ghost fren

>>101625914
ty

>>101625968
>>101626017
Seems like this is one of those English being vague things. "entirely filled" != "full of" as something "full of sugar" is not 100% sugar.

It does struggle specifically with stuff like making an Orange a different color (if you can guess why).
>>
File: Sigma_11615_.jpg (1.99 MB, 2048x2048)
1.99 MB
1.99 MB JPG
>>101626017
>>101626314
kek "entirely filled" just made the hot tub a sausage. Fail
>>
>>101626314
Let's see the tub filled 100% with 100% real sausages.
>>
>>101626418
how about an empty hot tub
>>
File: Sigma_11619_.jpg (1.94 MB, 2048x2048)
1.94 MB
1.94 MB JPG
>>101626427
>hot tub filled 100% with 100% real sausages
>>
File: Sigma_11620_.jpg (1.67 MB, 2048x2048)
1.67 MB
1.67 MB JPG
>>101626444
>an empty hot tub
"Empty" can also mean without patrons, trying out "dry hot tub"
>>
>>101626418
Water in the negatives?
>>
File: Sigma_11621_.jpg (3.12 MB, 2048x2048)
3.12 MB
3.12 MB JPG
>>101626496
Didn't want to cheat, but I'll try it. A dry hot tub everyone
>>
File: Sigma_11622_.jpg (1.75 MB, 2048x2048)
1.75 MB
1.75 MB JPG
>>101626496
Did you want blue foam? That's how you get blue foam
>>
File: Sigma_11614_.jpg (2.56 MB, 2048x2048)
2.56 MB
2.56 MB JPG
>>
File: Sigma_11624_.jpg (2.17 MB, 2048x2048)
2.17 MB
2.17 MB JPG
>>
File: Sigma_11625_.jpg (3.06 MB, 2048x2048)
3.06 MB
3.06 MB JPG
>>
File: Sigma_11626_.jpg (3.49 MB, 2048x2048)
3.49 MB
3.49 MB JPG
>>
File: Sigma_11627_.jpg (3.5 MB, 2048x2048)
3.5 MB
3.5 MB JPG
>>
File: Sigma_11630_.jpg (2.8 MB, 2048x2048)
2.8 MB
2.8 MB JPG
>>101626715
>>
File: Sigma_11632_.jpg (3.1 MB, 2048x2048)
3.1 MB
3.1 MB JPG
>>
File: Sigma_11637_.jpg (2.41 MB, 2048x2048)
2.41 MB
2.41 MB JPG
>>101626796
>>
File: Sigma_11639_.jpg (1.89 MB, 2048x2048)
1.89 MB
1.89 MB JPG
>>101626796
>>101626830
Last
>>
File: Sigma_11640_.jpg (2.83 MB, 2048x2048)
2.83 MB
2.83 MB JPG
>>
File: Sigma_11642_.jpg (2.55 MB, 2048x2048)
2.55 MB
2.55 MB JPG
Later
>>
>>101626913
Good gens
>>
>open /ldg/
>post single gen
>leave
Why?
>>
>>101627167
Why do anything
>>
serious question, what's the best method for setup and os for automatic1111? took me a while to get the shit running on ubuntu 2204, cuda 12.1 , then tried to setup on debian 12, cuda 11.8 (no xformers available) and the it/s dropped like 15% , don't want to download 15gigs in python libraries everytime for a new install, is nvidia base docker image the way to go ? I am at a point where I am thinking of just creating a separate install only for this and then use clonezilla to create a disk image to never have to setup this shit again....
>>
>>101627406
if you want current kernels/graphics stacks you might be better off with a rolling release distro.
>>
File: grid-0013.jpg (2.61 MB, 2304x1792)
2.61 MB
2.61 MB JPG
why wouldn't you just do a tub and then inpaint so it looks more like a hot tub? Is this more than a flex or is there some purpose here?

>>101627406
new upgrades are needed and you can't dodge the pain of the update cycle. cuda 11 is slower than cuda 12. If you try to use a docker image then you are just going to have to manage more.
>>
>>101627406
arch and learn to use venv
>>101627658
>inpaint
i think that anon would consider it "cheating"
>>
>>101627688
so anon is just finding weakness in a product, bitching it doesn't cover the edge case and claiming massive changes are needed.

I was hoping for something fun. I'll continue with my life.
>>
>>101627688
>learn to use venv
how risky is it to use that and fuck up your system due to a mistake?
Is it safer to put all the ai stuff in a container?
>>
>>101627853
venv is a folder, it runs from the folder, it's basically a container
>>
>>101627869
I mean, how easy is to to call something outside the venv that should have been in the venv.
But maybe I'm overthinking it.
>>
>>101623005
>>101623049
As >>101623115 alludes to, block merges imply greater quality considering one has much more control than simple mixing. Both creators likely spend equal time comparing however.
Block merging is the way to go for a multitude of reasons.
>>
>>101621466
It's contextual but sometimes random click too for buzz points
>>
>>101627936
well if you're afraid that a python file is going to read your illegal porn then no, venvs are not safe
but if your afraid of your dependencies getting mixed up, yes venvs are safe
>>
>>101627999
I'm afraid to accidentally pip install things into my main system. I'd like to keep it clean in that regard.
>>
>>101628035
When you activate the venv from bin it stays there. I've never had a problem with it mixing dependencies.
>>
>>101625122
My thought is by adding ksampler advanced nodes manually and set start/end according to your need.
Node1 : 40 steps, start 1, end 21
Node2 : 40 steps, start 21, end 31
Node3 : 40 steps, start 31, end 40
Must use exactly same sampler and scheduler.
>>
>>101628060
Will have to read it up I guess.
>>
>>101628082
The basics are simple, when you make a venv it makes a folder with a portable python version you used to make the venv. To use a venv you activate it (/venv/bin/activate). When you do any pip stuff it puts it in /venv/blah//lib or whatever.
>>
>>101628114
thx for your input
>>
Babe wake up, CogVLM2 just got released
https://github.com/THUDM/CogVLM2
>>
File: 0.jpg (685 KB, 2048x1024)
685 KB
685 KB JPG
>>
>>101628061
depending on the sampler and scheduler, this does nothing.

What do you think this is doing?
>>
>>101628551
Did you chain it? Connect latent from node1 to node2, node2 to node3
>>
File: BAD.png (2.15 MB, 3549x1695)
2.15 MB
2.15 MB PNG
>>101628227
Meh, still worse than GPT4V
http://cogvlm2-online.cogviewai.cn:7861/
>>
File: Sigma_11644_.jpg (2.39 MB, 2048x2048)
2.39 MB
2.39 MB JPG
>>101627001
ty

>>101627167
It's called "The one and done"

>>101627406
The best way to go is arch linux with podman (same as docker without bs) and the nvidia container runtine. You can run your docker container and map your models over with -v /localpath:/dockerpath

>>101627658
We were testing if the model was coherent in that fashion without trickery. One shot straight out of the model is still not feasible for a hot tub filled with sausages instead of water.
>>
File: Sigma_11648_.jpg (2.59 MB, 2048x2048)
2.59 MB
2.59 MB JPG
>>101628227
thx for news

>>101628666
thx for testing
>>
File: Sigma_11650_.jpg (3.02 MB, 2048x2048)
3.02 MB
3.02 MB JPG
>>
File: same_pic.png (551 KB, 512x768)
551 KB
551 KB PNG
>>101628621
yup. it is the same thing. If it isn't then your sampler/scheduler choice is doing something or settings are just wrong. If you are noise injecting at every 10 steps then I will ask again what you are attempting to do.

https://litter.catbox.moe/t5zv5n.png
>>
File: Sigma_11655_.jpg (3.12 MB, 2048x2048)
3.12 MB
3.12 MB JPG
>>101628666
The GPT4V caption seems kinda short to me, but it is more correct. Good details about the clothing, etc. in CogVLM2. Sigma's a 300 token model
>>
File: Sigma_11656_.jpg (3.45 MB, 2048x2048)
3.45 MB
3.45 MB JPG
>>101626913
>>
File: Sigma_11659_.jpg (2.9 MB, 2048x2048)
2.9 MB
2.9 MB JPG
>>101626913
>>101629046
>>
>>101629028
Your workflow only shows image from last node which is the 40 steps. Just add preview/save image node on node1 and node2.
>>
File: Sigma_11661_.jpg (939 KB, 2048x2048)
939 KB
939 KB JPG
>>
>>101623170
How many epochs total?
>>
File: Sigma_11663_.jpg (2.47 MB, 2048x2048)
2.47 MB
2.47 MB JPG
>>101629143
This is 5 epochs. I think training with two sets of captions gives it way better knowledge when doing multiple epochs. The official trainer picks between two by default and I have both filled to the brim (300 tokens) from two separate VLM's. I'll continue training until it starts validating worse, which could be a while.
>>
>>101629115
I don't want intermediate trash. The entire workflow is there. If you don't like the result there is nothing I can do for you at this point.
>>
>>101629267
I think you're replying to wrong person. The guy asked how save image per specific steps. So I gave him idea to use multiple ksampler modes.
>>
>>101629317
I do have the thread confusion. Sorry
>>
File: Sigma_11665_.jpg (1.12 MB, 2048x2048)
1.12 MB
1.12 MB JPG
>>
File: Sigma_11658_.jpg (2.92 MB, 2048x2048)
2.92 MB
2.92 MB JPG
>>101629246
>>101628860
>>
File: tmpgdghhu91.png (1.31 MB, 896x1152)
1.31 MB
1.31 MB PNG
Apparently the "lore" if you will is t hat I want to kill myself at work or whatever? Where the fuck did that come from?
>>
>>101621777
Prompt?
>>
I'm surprised there's not a custom ksampler that allows you to define an arbitrary number of step counts and then once a given is reached, outputs the image.
I don't wish to chain 50 samplers in order to see the denoising progression.
>>
File: 0.jpg (819 KB, 2048x1024)
819 KB
819 KB JPG
>>
File: Sigma_11669_.jpg (2.58 MB, 2816x1408)
2.58 MB
2.58 MB JPG
>>101630367
A charcoal drawing on a charred canvas depicts a solitary figure walking away from a burning city, their silhouette fading into the smoke and ash. It symbolizes resilience in the face of destruction and the cyclical nature of life and death.
>>
File: Sigma_11670_.jpg (2.47 MB, 4096x1024)
2.47 MB
2.47 MB JPG
>>101621777
>>101630367
>>101630504
Wide
>>
File: Sigma_11671_.jpg (2.68 MB, 1024x4096)
2.68 MB
2.68 MB JPG
>>101630551
Narrow
>>
File: 0.jpg (121 KB, 1024x512)
121 KB
121 KB JPG
>>
File: Sigma_11649_.jpg (1.02 MB, 2048x2048)
1.02 MB
1.02 MB JPG
>>101630572
Simple and clean, but complex and messy at the same time. Very nice
>>
>>101630335
who are you?
>>
File: Sigma_11674_.jpg (1.79 MB, 2048x2048)
1.79 MB
1.79 MB JPG
Sleep
>>
>>
File: s00-6.png (2.06 MB, 1208x1208)
2.06 MB
2.06 MB PNG
>>101620838
Nice
>>
File: Image.jpg (2.38 MB, 2304x1280)
2.38 MB
2.38 MB JPG
>>
>>
File: tmpdst942yi.png (1.1 MB, 768x1024)
1.1 MB
1.1 MB PNG
>>101631095
"Centauranon", apparently
>>
>>
File: ComfyUI_18876_.jpg (1.18 MB, 2200x2621)
1.18 MB
1.18 MB JPG
https://civitai.com/models/566526/kolors
I feel like Kolors is the only model that is really great at photorealism, remind me of Midjourney a bit, wonder why people sleep on it
>>
>>101632142
>wonder why people sleep on it
Likely the "its architecture is outdated" meme. I presume anons understanding is that it's merely a hypertuned XL.
>>
>>101632230
desu if they made the same training on a DiT model it would've gotten an insane result yeah, and also the fact that kolors fucking sucks at prompt comprehension because it has been trained with chink doesn't help either
>>
>>101632249
>chinaspeak
I don't necessarily mind that, indeed the thought of what a buger could pull from semi-chinese latent space is intriguing.
I'll give it a shot if and when I have the urge to use XL.
>>
File: image.png (3.35 MB, 1536x1280)
3.35 MB
3.35 MB PNG
>>
awesome another Laura gen gets into the collage
>>
>>101632142
https://civitai.com/images/21848019
kolors wins this one, picrel is pixart
>>
>>101620838
>>101631475
memento mori
>>
File: hunyuan_dit_1.2_00042_.jpg (425 KB, 832x1024)
425 KB
425 KB JPG
>>101632142
>wonder why people sleep on it
for me because it doesn't work on 6GB vram
>>
>>101630456
Isn't this a feature in auto, generation preview? If you're expecting legit use cases in comfy instead of pure autism you're expecting too much. Comfy provides the illusion of choice.
>>
Newbie here, Why I'm getting this results?

Model Juggernaut XL with the requirements at civitai.
>>
cough
>>
File: wayne_bat_00001_.png (3.19 MB, 1728x1344)
3.19 MB
3.19 MB PNG
>>101634730
>Why I'm getting this results?
*Why am I getting these results?

Low resolution, also a strange resolution. You want a resolution that adds up to 2048, the most common being a square of 1024 by 1024.That resolution will give you better results, though not incredible.
For most good diffusions you would take that image and upscale it before sampling it again with very low noise to increase detail.
>>
File: Sigma_11677_.jpg (2.96 MB, 2048x2048)
2.96 MB
2.96 MB JPG
>>101631475
ty

>>101631659
>>101631751
>>101631895
>>101632299
MFW I ran out of air

>>101634730
Are you using a fancy VAE or under 10 steps?
>>
File: Sigma_11679_.jpg (2.63 MB, 2048x2048)
2.63 MB
2.63 MB JPG
>>
File: Sigma_11680_.jpg (3.09 MB, 2048x2048)
3.09 MB
3.09 MB JPG
>>
File: Sigma_11681_.jpg (3.15 MB, 2048x2048)
3.15 MB
3.15 MB JPG
>>101635122 plus >>101635141
>>
>>101635141
this is sick, prompt?
>>
File: Sigma_11682_.jpg (2.08 MB, 2048x2048)
2.08 MB
2.08 MB JPG
>>101635168
Mixed media made of straws and colorful paper of a house on a sunny day
>>
>>101634984
Sorry for the misspelling.

Thanks for the tip.

I will try with another Sampler with less noise and change the resolution.
>>
>>101635255
ty, you always impress me with the sheer variety of your gens. keep it up anon.
>>
File: Sigma_11689_.jpg (2.69 MB, 2048x2048)
2.69 MB
2.69 MB JPG
>>101635374
ty. It would just be spam if they weren't interesting!
>>
>>101635021
8 steps, is the recommended for hyper version.

Don't know how change VAEs.
>>
File: ComfyUI_temp_ccufe_00023_.jpg (2.51 MB, 2560x1440)
2.51 MB
2.51 MB JPG
been a hot minute since I played with SD
>>
File: Sigma_11691_.jpg (2.16 MB, 2048x2048)
2.16 MB
2.16 MB JPG
>>101630335
>>101631777
Hello Centauranon. Let's skip the lore of you killing yourself. You're going to die anyways. Why not let it be a surprise how? Plus, who is going to make the centaurs if you check out early?

>>101635533
What do you use?
>>
File: ComfyUI_temp_zkxta_00008_.jpg (2.24 MB, 2560x1440)
2.24 MB
2.24 MB JPG
>>101635533
>>
File: ComfyUI_temp_zkxta_00012_.jpg (2.65 MB, 2560x1440)
2.65 MB
2.65 MB JPG
>>101635596
50/50 merge of AbyssOrangeMix2 and LoliDiffusion
20 steps on euler a and a 2 step ultimate sd upscale

>that smirk
>>
>>101635517
>8 steps, is the recommended for hyper version.
Yep, that's fine. Other models you need more steps. I've seen that kind of degradation from custom VAE, poor scheduler+sampler combo (GPU samplers usually), too low of steps on a normal model, and from >>101634984
>Low resolution, also a strange resolution

ComfyUI is complicated at first but keeps you in the driver's seat. https://github.com/comfyanonymous/ComfyUI?tab=readme-ov-file#installing

>>101635674
> Base Model SD 1.5
So you mean you're back, and not that you're using something else. Mistook it for a tuned Kolors gen. You should try that one out if you haven't
>>
File: Sigma_11688_.jpg (899 KB, 2048x2048)
899 KB
899 KB JPG
>>101635729 (You)
Forgot pic
>>
has anyone heard of noise playing while generating?
got a new pair of headphones (sennheisers) since my old ones broke and now I notice when I generate stuff there's this weird quiet white noise that plays through the headset. Whatever's going on it's not through the PC since Audacity doesn't pick it up. It's really weird.
>>
File: ComfyUI_temp_zkxta_00018_.jpg (2.35 MB, 2560x1440)
2.35 MB
2.35 MB JPG
>>101635729
Just been busy the last couple months troubleshooting my pc.
Turns out my pc is fine and this issue I am having is happening on every pc I have tested, all with Nvidia gpus.
So just taking a breather.
Also got sick of having 6 or more computers in pieces cluttering up my room.

>tuned Kolors gen

I will check it out, thanks.
I am assuming I need to download all of it and not just the safetensors file? (On HF btw)
>>
File: Sigma_11696_.jpg (2.06 MB, 2048x2048)
2.06 MB
2.06 MB JPG
>>101635805
Let the latents speak. Also that probably shouldn't be happening. Are you corded?

>>101635956
Odd.. 6 computers.. have you tried arch linux and docker containers (with podman)?

>I am assuming I need to download all of it and not just the safetensors file? (On HF btw)
Yeah the ChatGLM3 model is necessary as well as the SDXL VAE

For comfy support until it's native https://github.com/kijai/ComfyUI-KwaiKolorsWrapper

And hunyuandit is native in comfyui now but you 100% need to translate to Chinese at it
>>
>>101636031
>Are you corded?
yeah. Difference is my previous headset went through USB while this one goes straight into the audio port so I assume something's going on there
>>
File: Sigma_11697_.jpg (1.72 MB, 2048x2048)
1.72 MB
1.72 MB JPG
>>101635956
And for even more spice in the pot.. all my recent gens are from Pixart Sigma
>>
File: Sigma_11699_.jpg (2.36 MB, 2048x2048)
2.36 MB
2.36 MB JPG
>>
File: Sigma_11700_.jpg (2.44 MB, 2048x2048)
2.44 MB
2.44 MB JPG
>>
File: jpgjpg.jpg (2.04 MB, 2560x1440)
2.04 MB
2.04 MB JPG
>>101636031
>have you tried arch linux and docker containers (with podman)?

no, but I have done a test in Mint 21.1 with gpu passthrough and a windows vm and the issue remains, so I know that it has something to do with the Nvidia drivers.
It's definitely a software problem, got a couple experiments to try but that's for tomorrow me to deal with.
>>
File: Sigma_11701_.jpg (2.53 MB, 2048x2048)
2.53 MB
2.53 MB JPG
>>
File: Sigma_11702_.jpg (2.32 MB, 2048x2048)
2.32 MB
2.32 MB JPG
>>
>>101634960
bless you
>>
File: file.png (11 KB, 808x77)
11 KB
11 KB PNG
>>101628227
>just got released
what fucking rock you been living under?
CogVLM1 was good because it wasn't trained on GPT4 slop. 2 is trained on endless amounts of slop, like every other model.
>>
File: 00548_.png (2.89 MB, 1192x1792)
2.89 MB
2.89 MB PNG
>>101635729
Thank again

I'm working with ComfyUI and there are some improves.

Swap the model for RealVisXL and add some changes at the workflow. As you say.
>>
>>101634984
Nice.
>>
File: Sigma_11703_.jpg (1.85 MB, 2048x2048)
1.85 MB
1.85 MB JPG
>>101637628
yw anon. Nice job. Here's some inpainting examples for more advanced stuff https://comfyanonymous.github.io/ComfyUI_examples/inpaint/
>>
File: Sigma_11707_.jpg (1.62 MB, 2048x2048)
1.62 MB
1.62 MB JPG
>>
>>101637889
oh this is amazing, Now I know how some IA influencers stay at real world.

Thanks
>>
File: Sigma_11736_.jpg (1.58 MB, 1408x2688)
1.58 MB
1.58 MB JPG
>>101638103
yw, keep it up and posts some gens!
>>
how many images do i need to make a pony lora? i am giving up on trying to recreate my 1.5 style, i need to use my 1.5 generations to make a lora for pony
>>
File: Sigma_11737_.png (3.04 MB, 2560x1536)
3.04 MB
3.04 MB PNG
>>
bigma
>>
File: Sigma_11744_.jpg (1.86 MB, 2688x1536)
1.86 MB
1.86 MB JPG
>>
Even though it's 100 degrees outside it's never too hot to bake some new...

>>101639278
>>101639278
>>101639278
>>
File: Sigma_11749_.jpg (3.66 MB, 2688x1536)
3.66 MB
3.66 MB JPG
>>101639309
ty baker!
>>
File: Sigma_11758_.jpg (3.47 MB, 2688x1536)
3.47 MB
3.47 MB JPG
Filling thread
>>
File: Sigma_11760_.jpg (2.87 MB, 2688x1536)
2.87 MB
2.87 MB JPG
>>
File: Sigma_11754_.jpg (2.6 MB, 2688x1536)
2.6 MB
2.6 MB JPG
>>
File: Sigma_11768_.jpg (3.07 MB, 2688x1536)
3.07 MB
3.07 MB JPG
>>
File: Sigma_11770_.jpg (1.43 MB, 2688x1536)
1.43 MB
1.43 MB JPG
>>
File: Sigma_11774_.jpg (3.55 MB, 2688x1536)
3.55 MB
3.55 MB JPG
Full
>>
final cough



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.