[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1.1 MB, 3264x3264)
1.1 MB
1.1 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102059183

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
Training another pepe lora just because
>>
quality of gens is definitely getting better. what changed ldg?
>>
File: file.png (2.12 MB, 1024x1024)
2.12 MB
2.12 MB PNG
>>
File: perfchance.jpg (272 KB, 1532x768)
272 KB
272 KB JPG
I'm not a tech-savvy guy. How the fuck do I generate anime girls that aren't shit? It already takes 3 hours to generate a single image whereas a random website generates a 2000% better result in 3 seconds

I downloaded EasyDiffusion
>model used: Anything
>negative embedding: EasyNegative, Badartist

Surely there's something wrong with what I'm doing.

Pic related. Left is default model, middle is with Anything, and third is with a random website.
>>
knowyourmeme seems to be blocked in my country. it's scare to think how the government can do this without saying anything and most people would never notice the internet getting compartmentalized
>>
If Flux prompting is similar to what joy-caption is spitting out, we can kinda figure how to structure prompts better to output what we want or caption better. The only pain point now is things like "character raises their right hand"
>>
>>102061572
let me guess you're in a turd world country that routinely ddoses and spams
>>
File: 00114-3666841939.jpg (539 KB, 1248x1864)
539 KB
539 KB JPG
>>
File: 2883286017.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
>>
>>102061568
bro why are you still in 2023? how is that possible?
>>
>>102061582
I'm in spain...
>>
Blessed thread of frenship
>>
>>102061586
In what way am I in 2023?
>>
>>102061623
You should be either using Pony or Flux
>>
>>102061568
If you don't care about local, NovelAI has the best anime gen right now.
>>
it would take 16 hours to caption all 4500 pics I have with llama
is florence faster? I know it's not as good but I don't really care
>>
>>102061640
florence is more than good enough and it takes 1 second per image
>>
>>102061640
You could pay an API to do it faster I guess.
Are you sure you need that many images? You only need as many as it takes to showcase all the different variations/poses/examples you need to show the model.
There's also wd-tagger.
>>
File: ComfyUI_10508_.png (2.05 MB, 800x1400)
2.05 MB
2.05 MB PNG
>>
>>102061640
florence is much faster
>>
File: file.png (29 KB, 1775x135)
29 KB
29 KB PNG
>but the beauty lies in her unique combination of curves; her breasts being a beautiful example of Breastiness.

kek, LLMs truly are a treasure
>>
File: ComfyUI_00771_.png (973 KB, 1024x1024)
973 KB
973 KB PNG
>got a warning because accidentally the wrong picture where Kasias deformed nipple slipped through

kek
>>
I should be masturbating to VR games, but I'm here generating arrays of pixels endlessly.
>>
File: 00035-534123401.jpg (841 KB, 1920x1080)
841 KB
841 KB JPG
My new fun experiment, putting Visual Novel summaries from VNDB into the prompt
>>
>>102061692
We're really not that far from generating 3D images to admire in VR
>>
>>102061640
4500 pics for a lora seems overkill no?
>>
>>102061650
joy caption with the nf4 bnb llama is also extremely fast. too fast even, it takes several minutes when using the fp8 version for some reason with terrible GPU utilization
>>
>>102061692
Any good VR game for that?
>>
>>102061695
Let's rig up a fully local endless AI VN experience that isn't shit.
>>102061705
Unless you're doing an all encompassing porn finetune, you probably never need more than 100.
>>102061709
In my case it's virt-a-mate, but there's more "game"-like ones like virtual kanojo I guess.
>>
>>102061695
which one is this?
>>
>>102061706
anon I don't need pussies obsessively described at the expense of everything else and I doubt it's faster than florence which writes 300 tokens per second
>>
>>102061568
what even your hardware? what are you boing here if you wanna get easy quick good results? go to a website if you want to put zero effort in
>>
>>102061722
The story centers around Fuminori Sakisaka, a medical school student involved in a vehicle accident that claimed the lives of both his parents and left him with critical injuries.
He was rescued by experimental brain surgery that coincidentally altered his perception of the world dramatically.

Everything now seems to be composed of slimy intestines and gore to him. In addition, the rest of his senses: touch, hearing, smell, and taste, are also impaired, similar to his sight, further damaging his mental health.

Fuminori's desire to live dwindles, and one night, while still in the hospital, he contemplates suicide. However, a girl in a white dress named Saya appears before him. Compared to the horrible surroundings, she looks completely normal, if not downright gorgeous. Fuminori soon falls in love with Saya, and she becomes his raison d'être.

As time passes, Fuminori gets increasingly secluded from the rest of his normal life as he embarks on a mission to find a specific person Saya is looking for. Fuminori's friends and doctor get more concerned as he acts strangely over time. Nonetheless, they will soon cross paths, for better or worse...
>>
>>102061730
>I doubt it's faster than florence which writes 300 tokens per second
the nf4 is that fast
>>
>>102061556
What do you use? How long it takes?
>>
>>102061739
then it's probably schizo tier in it's writing, start pasting images with their captions
>>
>>102061730
It's not the writing text that you are experiencing as slow, but the analyzing of the image.
>>
>>102061721
One day with AI simulated worlds I think that will be possible.

It will be so hard not be be full immersed and to actually leave VR
>>
someone should make a gameboy lora where it turns everything into gameboy pixels
>>
>>102061745
why does speed mean schizo for llama but not for florence?
>>
>>102061752
I am angry at you
>>
>>102061759
because florence is optimized for its parameter size and not running at 4bit
>>
>>102061744
3090, ai-toolkit, default lora training preset, 1000 steps, 48 joy captioned, manually reviewed images. should take about an hour
>>102061752
and that someone could be YOU
>>
>civitai wants 5$ for 100 images generated

>runpod can generate 100 images per hour for around 50 cents

???
>>
>>102061721
Ah yes I know virt-a-mate, I'm waiting until part 2 is fully made and supported before getting a proper VR headset, hopefully by that time some cool VR headsets are around
>>
>>102061783
overhead
>>
File: file.png (2.5 MB, 1024x1024)
2.5 MB
2.5 MB PNG
>>
>>102061732
Doesn't need to be quick, but it could at least be good. I'm using a 1660 Super
>>
>>102061786
I have a quest 2, and you should get a quest 3 (or something more expensive pcvr only), and it absolutely is worth getting vamx right now. Especially if you have never experienced anything like it and are not afraid of it's absolutely terrible UI. I will be getting 2 when it comes out.
>>
>>102061798
Is

A40
48GB VRAM
50GB RAM
9 vCPUs

good for flux dev for 35 cents per hour?
>>
>>102061826
Remember someone is possibly storing everything you gen and selling it for training.
>>
File: ComfyUI_04315_.png (1.37 MB, 832x1216)
1.37 MB
1.37 MB PNG
>>
>>102061834
>selling AI sloppa for training
>>
>tfw no lora trained on old german nudist magazines
>>
>>102061852
nude loras are useless until the baseline model understands nudity, at best you have to overbake the model to get some pubic hair and maybe a slit
>>
>>102061814
Yeah I tried Vam 1 a few years ago now, but mostly without a VR headset. I think I somehow connected to a crappy VR headset and it was cool, but should be much better now
>>
>>102061873
For me it's the physical sensation of having someone standing next to you for real. Desktop mode does not do it justice at all.
>>
>>102061838
Nice dress homie
>>
>>102061572
>>102061592
Are you sure it's not your ISP?
>>
File: 00039-1482918790.png (1.28 MB, 832x1216)
1.28 MB
1.28 MB PNG
>>102061883
yeah I know, and the sense of scale is amazing. Big and small things are actually big and small.

>>102061695
Ever17 is the tale of seven individuals who become trapped 51 meters below the surface in the underwater marine theme park 'LeMU'. After an incident, almost half of LeMU becomes flooded, and the path to the surface and the communication lines are cut off. In addition, LeMU is under constant assault by severe water pressure, limiting time to find a means of escape to 119 hours. Escape is not the only concern, however; many questions arise as to the legitimacy of the accident and whether or not those trapped there were brought there for a purpose.
>>
>>102061889
Yes, here ISPs are routinely mandated to block certain sites for copyright reasons.
>>
>>102061852
There is a nudist lora on civitai trained at 7000+ steps that is pretty good. Modern pics tho
>>
File: ComfyUI_212457_.png (2.37 MB, 1920x1080)
2.37 MB
2.37 MB PNG
>>
>>102061907
It's probably just a DNS block, you can easily get around it
>>
I wish I had like 50 4090s so it took few seconds for genning instead of this long fucking wait.
>>
>>102061990
Nah, I checked. DNS resolves.
>>
>>102062004
that's not how it works
>>
I assume most people here use comfyui but does anyone use forge? Regional prompter doesn't work on latest version and forge couple just doesn't work that well for me. Any decent alternative?
>>
can you make me some real cool sneakers
>>
how does civitai even exist? just playing devils advocate here but there's a lot of "copyrighted" stuff on there, like actual characters from movies, how are they not able to take these people down?
>>
File: 10327093547.png (1.24 MB, 1024x768)
1.24 MB
1.24 MB PNG
>>
File: ComfyUI_04321_.png (1.49 MB, 960x1088)
1.49 MB
1.49 MB PNG
Why do people put keywords/activation phrases in their LoRAs? Not sure why it's necessary.
>>
>>102062040
Parody
>>
>>102062040
Because it's all AI output to which copyright law doesn't apply
>>
>>102062040
Parody and caricature of someone is under the fair use
>>
>>102062040
Same way deviant art exists
>>
How much it costs to train a lora on civitai?
>>
>>102061688
But where is the lora anon
I want to recreate my childhood
>>
File: ComfyUI_Flux_10619.jpg (325 KB, 768x1344)
325 KB
325 KB JPG
>>
>>102062162
2000 for flux
>>
File: 1098785685.jpg (125 KB, 1024x1024)
125 KB
125 KB JPG
I took a break for a while and now FLUX developments are happening way faster than I thought they would, so now I'm trying to play catch up.
What's the go to model for higher tier cards like 3090s/4090s? Should I just stick with Flux Dev non-GGUF or go with the GGUF models? If the latter, which is the most recommended?
>>
File: 2024-08-24_00297_.jpg (2.31 MB, 5120x2880)
2.31 MB
2.31 MB JPG
flux can go into some insane fucking detail if go drive the resolution high enough
>>
>>102062210
Flux-dev FP8 can currently be used with the --fast flag in ComfyUI for a 40% extra speed boost on 4000 series cards, so that's what I'm using.
>>
>>102062188
ok I will upload it
>>
>>102062236
thats the 16ch magic
>>
>>102062210
Same boat here. followup question, too: is there a way to use two GPUs or is it still limited to 1
>>
>>102062247
the best part is that VAEs aren't that big on your VRAM, imagine a 32ch now, the details would be fucking top tier
>>
Anyone VRAMlets try training a Flux LoRA using a Google Colab subscription? Wondering if it's viable for style training
>>
>>102062247
yea :D just got wait another
>Prompt executed in 2062.73 seconds
for the next one
>>
>>102062263
16ch is a nice sweet spot. after 32 the visual benefit becomes negligible. and if you're not pixel peeping 16ch is plenty
>>
>>102062238
Is there any particular decrease in quality or understanding with GGUF models when compared to the original?
I got a 3090 so I'm curious if I'm giving up anything.
>>
File: file.jpg (2.04 MB, 7961x2897)
2.04 MB
2.04 MB JPG
>>102062296
If you go for a more aggressive GGUF quantization, yes - but Q8 is almost indistinguishable from the native FP16, at almost half the VRAM cost.
>>
>>102062197
I love 3 breasted alien women.
Is the style a Lora or good prompt?
>>
File: file.png (286 KB, 1795x1059)
286 KB
286 KB PNG
>>102061535
how does SD works on linux?
my specs, i don't think i might see a problem but on software?
>>
>>102062322
Even Q6-k is good enough
>>
File: 00043-2070780825.png (1.68 MB, 896x1152)
1.68 MB
1.68 MB PNG
I heard someone singing, the sound tearing through the chilly wind.

As if to match the guitar melody I played in the dusk-lit music room.
As if to match the piano someone played in the room next door.
The pure singing voice from the rooftop connected the three disconnected melodies.

It all started on such a day in late autumn.
When someone fell in love.

Everyone did their best. Everyone pushed on. Everyone was intent and earnest…
A bond was formed deep in our hearts, and we obtained a moment that could not be replaced.
That’s why someone fell in love. A love that came too late, a love that shouldn’t have occurred.

Then comes winter — the falling snow covering up all sins.
Then comes spring — delivering punishment along with the melting of the snow.
>>
File: fp16-vs-q8-vs-fp8.jpg (742 KB, 3648x1260)
742 KB
742 KB JPG
>>102062296
this >>102062322
and for the final step fp16 the king vs Q8 the queen vs fp8_e4 the bastard prince, check my picture you can see why the quantized versions are pretty neat
>>
>>102062344
I agree, I ran Q6_K on my 4090 (until the speed increase of FP8 came about) just to give myself more headroom for running LLMs/ControlNETs/LoRA in conjunction with the main model.
>>
>>102062335
Install drivers
Install cuda toolkit
Install python (or miniconda)
You're set for AI shit
>>
File: ComfyUI_Flux_0215.jpg (931 KB, 1536x2688)
931 KB
931 KB JPG
>>102062333
the Gerry Anderson lora should be doing most of the style lifting
>>
File: 148987300.jpg (157 KB, 1024x1024)
157 KB
157 KB JPG
>>102062322
>>102062238
>>102062368
Thanks anon(s). Legitimately helped me get up to speed a little bit.
>>
>>102062374
Why the hell so many people like that style?
>>
File: 2198185726.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
>>
File: ComfyUI_Flux_0211.jpg (976 KB, 1536x2688)
976 KB
976 KB JPG
>>102062391
idk
>>
>>102062368
What's the base requirements for not crashing at default weights? I'm assuming the last image refers to weight_dtype when loading the model. I crash every time.
>>
is mickey mouse inside flux dev?
>>
File: 4step_up_00060_.png (1.01 MB, 750x750)
1.01 MB
1.01 MB PNG
>>102062431
>>
File: 00044-2138379031.png (1.5 MB, 896x1152)
1.5 MB
1.5 MB PNG
>>102062365
>>
>>102062371
>until the speed increase of FP8 came about
what speed increase?
>>
>>102062427
Amateur photography lora?
>>
File: file.png (27 KB, 567x329)
27 KB
27 KB PNG
>>102062472

>>102062238
>>
>>102062435
16-20GB VRAM and 32GB system ram and a swap file that not limited (cause on loading you probably will push your swap file to ~50-70GB)
>>
>>102062472
>>102062489
yea but be careful, in my experiments the result with --fast turned out to be non-deterministic to
>>
>>102062515
what about quality loss?
>>
File: 1453214868.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>102062480
You got it.
>>
File: 00046-3356989399.png (1.29 MB, 896x1152)
1.29 MB
1.29 MB PNG
>>102062468
>>
File: file.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>
File: 00048-3356989401.png (1.6 MB, 896x1152)
1.6 MB
1.6 MB PNG
>>102062559
>>
>>102062536
slight, but since its already just fp8 .. it doesnt matter much, but I ditched it for q8/fp16 .. tand its more like 20% .. not 40% speed increase .. its minor details that are different, but in an overall composition this could be catastrophical (like text) .. if you just gen nature shots it wont matter much
>>
File: 2024-08-24_00301_.jpg (1.87 MB, 5120x2880)
1.87 MB
1.87 MB JPG
more insane resolution extravaganza .. lowering steps made it take just
>Prompt executed in 1415.48 seconds
>>
File: ComfyUI_01209_.png (943 KB, 1280x720)
943 KB
943 KB PNG
>>
>>102062586
>but I ditched it for q8/fp16

Unless I'm reading it wrong, the >>102062489 optimization works on FP16, when the weight_dtype is set to fp8_e4m3fn.
>>
>>102062611
looks great
>>
File: 00049-777982752.png (1.91 MB, 896x1152)
1.91 MB
1.91 MB PNG
>>102062611
looks great

>>102061695
Hinamizawa, a small rural village in Japan, circa 1983. The village is known for its annual Watanagashi Festival, which honors the local deity, Oyashiro-sama.

The story follows Maebara Keiichi, a teenage boy who has recently moved to Hinamizawa with his family. He quickly becomes friends with a group of local girls: Ryuuguu Rena, Sonozaki Mion, Houjou Satoko, and Furude Rika. On the surface, Hinamizawa is a peaceful, tight-knit community. However, Keiichi soon discovers that the village harbors dark secrets and a history of mysterious disappearances and gruesome murders that occur each year around the time of the Watanagashi Festival
>>
>>102062626
but then you're using fp8, not fp16
>>
File: file.png (1.85 MB, 1024x1024)
1.85 MB
1.85 MB PNG
>>102062611
nice though
>>
File: ComfyUI_01194_.png (899 KB, 1280x720)
899 KB
899 KB PNG
I'd just like to interject for a moment.
What you're referring to as a Catgirl prompt, is in fact, ConditionConcat/Prompts, or as I've recently taken to calling it, multiple prompts plus condition concatenating.
Prompting is not a solution to LoRA unto itself, but rather another free component of a fully functioning MODEL made useful by the Condition Concatenating.
>>
>>102062626
if you use weight_dtype fp8_e4m3fn with the default fp16 model you ARE using fp8 .. the model version just cuts the step that comfyUI does everytime it loads the default fp16, cuts it into fp8 and then uses it ..
>>
How much money do I need to spend on a computer to make these stuff?
>>
CUT MY FLOAT INTO PIECES
>>
File: ComfyUI_00823_.png (1.56 MB, 1024x1024)
1.56 MB
1.56 MB PNG
I wanna upload my Kasia LoRa to civi.

what does
>Training Params:
>Epochs
mean?
and wtf do I write in there?
>>
>>102062643
ching chong
>>
>>102060253
nice, it gets the reflection from both sources, great quality
>>
File: dd.jpg (352 KB, 1024x1024)
352 KB
352 KB JPG
>>102062672
a niggilion
>>
File: 00052-777982755.png (1.88 MB, 896x1152)
1.88 MB
1.88 MB PNG
>>102062680
>>
>>102062672
40 cents per hour
>>
File: 0.jpg (117 KB, 1024x1024)
117 KB
117 KB JPG
>>
File: file.png (1.78 MB, 1024x1024)
1.78 MB
1.78 MB PNG
>>102062675
How did you train it
>>
>>102062672
32GB of RAM
any desktop cpu from the last 4 years
a RTX 3060 12GB minimum
a keyboard
monitor optional
>>
>>102062675
Do you know that famous blonde that did only one shoot for some nude magazine like Kasia and then went away?
>>
>>102062737
Jenna Jameson
>>
>>102062736
>monitor optional
How is he gonna see it, with a brain implant?
>>
>>102062737
Tera Patrick
>>
>>102062751
No, for metart, it was some nobody european
>>
>>102062644
>>102062660

Thanks for letting me know
>>
File: ComfyUI_00814_.png (808 KB, 1024x1024)
808 KB
808 KB PNG
>>102062733
put 59 images and the txt files with the descriptions into a folder and then did this https://github.com/ostris/ai-toolkit

used the example config and just modified the samples. it doesnt say anything about any epochs there tho.
>>
>>102062764
he does it for others, not himself, a true hero
>>
File: file.png (1.87 MB, 1024x1024)
1.87 MB
1.87 MB PNG
>>102062764
there are many options to interact with a headless computer anon
>>
>>102062611
a 4x upscale with flux takes me like 5min on a 3090
>>
>>102062672
$200 Lenovo p520 + ~$700 Nvidia gpu, most likely a used 3090
>>
>>102062784
Yeah I love hearing the sound of my image, or viewing it on my dick long phone display
>>
>>102062825
the absolute state of /g/
rdp or ssh or local proxy, from another normal pc
>>
File: 706813000.png (1.09 MB, 896x1152)
1.09 MB
1.09 MB PNG
>>
File: ComfyUI_Flux_0225.jpg (1.54 MB, 4096x3072)
1.54 MB
1.54 MB JPG
>>
File: 00055-1665524069.png (1.63 MB, 1152x896)
1.63 MB
1.63 MB PNG
>>
>>102062798
depends on the settings I guess.. if I push down the steps and the used upscaler I can probably cut it to 5 minutes to .. but comparing fast methods with nf4 vs full fp16 with 4x_nmkd I loose so much detail .. I dont just want the original picture .. I want each tile to pop up new interesting details
>>
>>102062837
that still requires a monitor, believe it or not
>>
>>102062842
prompt?
>>
>>102062869
debo horns lady sexi feet in leatherette chair dark maroon curtains FEET saar
>>
>>102062847
>>102062848
Prompt?
>>
>>102062884
>>102062848
You awaken in a decrepit old mansion.

A woman with eyes of jade stands before You, informing You that You are the Master of the house, and she Your Maid. However, You have no memories, no concept of self—or, indeed, any certainty that You are even alive.

The Maid invites You to join her on a journey through the mansion's lifeless halls, to behold the numerous tragedies that have befallen its residents. She suggests that among them, perhaps You will find some trace of Yourself.

Beyond the first door lies the year 1603.
It is an era of unparalleled beauty, where art and theatre flourish. Roses bloom abundantly in the garden where the inseparable Rhodes siblings play, and though they appear to be free of worry and strife... not everyone is content to see them happy.

Beyond the second door lies the year 1707.
In this era, the mansion lies in ruins, and a beast dwells within. He claims to yearn for a life of serenity, but it is not long before he yields to his innate savagery and a massacre ensues.

Beyond the third door lies the year 1869.
In this technologically advanced era, people are always on the move. The mansion's master is an ambitious businessman who has invested in the rail industry. However, his obsession with wealth and power leads him to neglect and mistreat his wife.

Beyond the fourth door lies the year 1099.
The Maid tells You that this is the final tale. In this era, You see a young man who claims to be cursed and a girl with white hair, called Giselle, who has been branded a witch and marked for death.

Having borne witness to these four tragedies, each set in a different time and place, You are now free to choose whether You wish to end Your story here... or press on.

But there are those who would say, "You were able to bear them because they weren't your tragedies."
>>
Is it worth hacking in negative prompts for Flux or is the time taken not worth it? Also, if you do use negative prompts, do you prompt the same way as positive prompts aka boomer posting?
>>
File: file.png (1.77 MB, 1024x1024)
1.77 MB
1.77 MB PNG
>>102062780
https://pastebin.com/KJeGJDmC
>>
>>102062737
dunno, who you mean?
>>
>easydiffusion
>fooocus
>metastable
Which is best and most trusted? I just wanna make some anime art for tavern cards.
>>
>>102062923
you can only trust Comfy with your computer
>>
>>102062923
If you want to use Flux Forge and Comfyui are your only options right now
>>
>>102062933
I've heard Swarm has good performance with Flux too
>>
>>102062904
boomer time
>>
File: flux_202408241.jpg (134 KB, 896x1152)
134 KB
134 KB JPG
Hello fellow genners. Giving flux a shot.
>>
>>102062907
Negative does not work
>>
File: ComfyUI_00847_.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>102062908
thanks bro.

>To determine the number of epochs from the steps, you need to know the total number of images in your dataset and use the following formula:
>\[ \text{Epochs} = \frac{\text{Total Steps}}{\left(\frac{\text{Total Training Images}}{\text{Batch Size}}\right)} \]
wat
ok so 2000 steps / 59 images = 33.89 Epochs?
>>
File: 00175-2720236127.jpg (107 KB, 528x744)
107 KB
107 KB JPG
>>
File: ComfyUI_Flux_0227.jpg (1.8 MB, 3072x5376)
1.8 MB
1.8 MB JPG
>>102062884
https://files.catbox.moe/ph2dsu.webp

>>102062857
I'd be using fp16 too if it wasn't so inconsistent in performance (24GB card). I'm doing 4x4 slices with NF4 for now. might move to Q8 or something
>>
>>102062945
Schnell

>>102062961
webp?
>>
>>102062949
I was told it did though if you use custom nodes to mitigate the effects of high CFG like burn-in.
>>
>>102062945
can't look worse than nick feldman's gens fr
>>
anyway to make images sharper with flux without using a prompt?
>>
>>102062971
>webp?

yup
>>
>>102062972
High cfg and the nodes for it increase gen time

>>102062981
upscale
>>
>>102062950
Yep, seems that way. Is it overfit in your experience? Seems pretty flexible with those two images at least.
>>
>>102062976
stop posting my name on here, it is dox, and harassment. dont post it again.
>>
Which produces better results on average, fp8_e4m3fn vs fp8_e5m2?
>>
wah wah
>>
>>102062944


>>102061695
>>
>>102062004
I wish I had 9 wives so I could make a baby in 1 month
>>
>>102062990
Right so is it worth it to get negative prompts? That is what I am asking. You're implying the only benefit is getting is just for the high CFG effect on the prompt itself.
>>
shoulda thought about that instead of boardhopping and threadhopping you samefaggy samegens
>>
If my computer shuts down on its own while genning and downloading stuff at the same time, my PSU is too weak, right?
>>
File: file.png (1.69 MB, 1024x1024)
1.69 MB
1.69 MB PNG
>>
>>102063029
who are you talking to?
>>
>>102062929
anon told me comfy has malicious code
>>
>>102063033
Also to add to this does it still gen and download even when it shuts down?
>>
>>102063013

I think it was proven that it was Nvidia being the jews that they are.
>>
File: flux_cyber-env14.jpg (3.96 MB, 2896x2272)
3.96 MB
3.96 MB JPG
>>102062976
So its bad? What dont you like about it
>>
he knows
>>
>>102063045
anon...
>>
Yeah well at least I'm only boardhopping and threadhopping unlike you who is doing physical violence to your wife
>>
>>102063019
It's not worth it, the positive prompt is enough

If you can wait for higher gen time then try the negative hack
>>
>>102063045
>computer shuts down from overdraw
>shuts down
>continue genning and downloading
Only my gens are magical enough to achieve this feat, anon. Your measly computer can only run when not shut down.
>>
>>102063072
But when my PS5 is turned off with orange light it still downloads stuff, and the game is still there when I turn back on
>>
File: ComfyUI_00975_.png (2.45 MB, 1536x1152)
2.45 MB
2.45 MB PNG
>>
>>102063045
No, one of the leds stayed on tho.
I switched PSUs to a newer one. I just want to make sure it was that.
>>
good one, very funny
>>
>>102063068
How much longer are we talking per iteration? Surely it's like a 20-30% increase in gen time, right?
>>
>>102063077
say sike right now
>>
>>102063094
Sike!
>>
File: ComfyUI_01129_.jpg (113 KB, 633x720)
113 KB
113 KB JPG
>>102063077
This better be some hardcore baiting
>>
>>102063088
CFG is literally double the work, it's a 100% gen time increase
there are nodes that stop CFG after some threshold but it's still a >60% gen time increase if you want it to have proper effect
>>
Anyway, after changing PSUs, back to genning naked women. Let's see what happens.
>>
>>102063045
Yeah probably it used to do that when I played games, but was fine with a better PSU
>>
>>102063106
CFG is not needed with proper prompting
>>
File: file.png (1.59 MB, 1024x1024)
1.59 MB
1.59 MB PNG
>>102063077
that is called sleep mode. Computers can do this, but that's not what's happening to you. Hope it's your PSU and not your processor if you have a 13th or 14th gen Intel
>>
>>102063088
50% more
https://www.reddit.com/r/StableDiffusion/comments/1enxcek/improve_the_inference_speed_by_25_at_cfg_1_for/
>>
>>102063103
Yeah sorry couldn't help myself
>>
File: 2024-08-24_00306_.jpg (443 KB, 2560x1440)
443 KB
443 KB JPG
>>102063050
very nice gen
>>
File: ComfyUI_00669_.png (629 KB, 768x512)
629 KB
629 KB PNG
I managed to put together both a SD1.5-to-Flux workflow and a Flux-to-SD1.5 workflow, but the usefulness in both cases is limited.
SD1.5 can do better compositions and art styles, so I thought it'd be good to generate the initial image on SD1.5, upscale it, and then refine it with Flux, which is better with details. However, given how badly Flux handles art styles without elaborate LLM descriptions, much of the style is lost, and Flux's prompting comprehension goes to waste somewhat because most things are already in place.
The other way round, Flux to SD1.5, benefits from Flux being able to generate at much higher resolutions, so you can then do a second pass with SD1.5 to modify the art style and better define characters that have SD1.5 LORAs. However this loses some of the coherence of Flux's details and doesn't benefit too much from SD1.5 models' stronger styles.

Images in /lmg/ because I got a letter wrong.
>>102062823
>>102062865
>>
Anyone else think in the future people will just wear goggles that has on the fly genning for everything they in their life, basically turning your shit studio apartment into a mansion, or removing the etching you left in your toilet, basically living a fake baller life and jerking off all day to gens. in the end this will turn very evil and satanic.
>>
>>102063118
>>102063033
Oops meant for this
>>
>>102063144
>wear goggles
nobody wants to wear some shitty goggles.
>>
>>102063144
>basically turning your shit studio apartment into a mansion
you don't need a ML model for this
>>
>>102063144
>skibidi toilet
prophetic
>>
>>102063144
Yes and also the same for old games, they will just remaster games into graphics you want, basically real time mods.
>>
>>102063161

what if you barely felt you had them on?
>>
>>102063106
>>102063134
Oh okay, I see why now. Yeah, that's an unfavorable tradeoff and it makes sense you are all down on it. Hopefully a competitor to Flux will make that possible then.
>>
>>102063144
If you want to entertain the simulation theory for a moment, you will find there are many parallelisms between Maya as described in Hinduism, and the paradigm of generative so called AI models. If The Matrix or something like it ever exists, it will probably rely heavily on "latent space collapse" content generation.
>>
File: flux_cyber-env05.jpg (2.91 MB, 2080x2720)
2.91 MB
2.91 MB JPG
>>102063139
Thank you. Was the movie poster bad? If so how would you improve it?
>>
Is the pepe n64 a lora or base flux?
>>
>>102063201
Two loras.
https://civitai.com/models/679189/apu-apustaja
https://civitai.com/models/660136/flux64-n64-and-ps1-game-screenshot-lora
>>
>>102063177
what if my dick travelled back in time and was suddenly attached to Kasias Bunk bed at the Face up Ass Down University?
>>
>>102063198
You're using schnell?
>>
File: file.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
Does Flux work well with tiled upscale?
>>
File: 2024-08-24_00310_.png (1.12 MB, 1280x720)
1.12 MB
1.12 MB PNG
>>102063198
nah its fine, but I guess you using a cut model? looses some details, just depends on what you aiming at, and I wonder to what >>102063230
asked .. schnell is okay.. but dev really pulls the rug under it
>>
>>102063196

I think they are the same entities people talk about when they take dmt hits, the machine elves, its all the same department.
>>
File: 2024-08-24_00309_.png (996 KB, 1280x720)
996 KB
996 KB PNG
>>102063233
yes.. see>>102062236 and >>102062611 I made these with SDUltimateUpscale and flux
>>
File: bloom.jpg (1008 KB, 1400x1112)
1008 KB
1008 KB JPG
>>102063230
>>102063251
I'm using NP4. I only have an 8gb card :/
>>
File: 2024-08-24_00312_.png (1.02 MB, 1280x720)
1.02 MB
1.02 MB PNG
>>102063274
then its fantastic results.. glad you can even use it
>>
File: file.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>102063272
thanks anon
>>
>>102063274
Try a lora
>>
File: flux_cyber-env03.jpg (3.12 MB, 2080x2720)
3.12 MB
3.12 MB JPG
>>102063300
Thank you!
>>
so does debo even come around ldg or is he still stuck in sdxl world i cant tell
>>
DO NOT respond to this anon.
>>102063329
He is trying to bring /sdg/ drama here
>>
>>102063329
I'm debo.
>>
how many steps is too many for 50 images of a person including entire body?
>>
>>102063367
No I am debo
>>
>>102063384
Okay you can be debo.
>>
Comfy stopped showing ksampler previews and they never came back
>>
>>102063395
Can't we all be debos?
>>
File: ComfyUI_04342_.png (1.66 MB, 768x1344)
1.66 MB
1.66 MB PNG
>>
not very comfy
>>
File: file.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>
I don't care what you say, SDXL is still great and so is Flux. What a time to be alive.
>>
>>102063223
Thanks
>>
File: 00199-2957302667.jpg (163 KB, 512x768)
163 KB
163 KB JPG
euler sampler, best sampler?
>>
>>102063144
>>102063177
what if you're already wearing them, but it's been so long you've forgotten? do you feel it, anon? that tension between your eyes, a slight pressure just below the bridge of your nose, at the sides of your temple? take them off, anon... come back .. remember...
>>
>>102063448
Dunno
>>
>>102063384
No you are not
>>
>>102063458
Go away, I'm batin'!
>>
>open civitai flux lora page
>fuuuuuuuck
some good new shit there tho, obra dinn, miles aldridge, KAREN GILLIAN
>>
>>102063013
you can just rent 5 cloud gpus and then just enter prompt and queue on all, you will have gens in no time.
>>
>>102063448
im not sure between euler and ipndm
>>
File: ComfyUI_00657_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>102063533
>buttchin
>>
File: file.png (1.58 MB, 1024x1024)
1.58 MB
1.58 MB PNG
I think "black velvet painting" is reducing the quality of the image. Also need to tweak or use a prompt concat on the sd upscale. What's the best way to have comfyui save a higher res image in an immediately postable format (filesize limit)?
>>
>>102063555
it's a flux-ism
>>
>>102063555
just accept them, isnt upscaled yet, I FUCKING AIRBRUSH THEM OUT SOMETIMES HAPPY NOW BIRD
>>
>>102063565
You increased the steps?
>>
I hope there is someone out there training a full finetune with 5000 porn images of varied styles, encompassing all manner of obscure fetishes and including amateur pictures from before the great purge that started with pornhub that removed so much good shit from the internet forever.
>>
File: ComfyUI_01273_.png (1.04 MB, 1280x720)
1.04 MB
1.04 MB PNG
>>102063448
For me it's ipndm and ays
>>
File: gentleman refueling.jpg (210 KB, 1024x1024)
210 KB
210 KB JPG
Almost but not quite there.
>>
>>102063618
ays?
For me it's ipndm/sdg_uniform
>>
>>102063633
Dealing with all those niggas made him age faster from stress
>>
File: file.png (3.2 MB, 1991x979)
3.2 MB
3.2 MB PNG
>>102063605
Should I? I actually bumped them down and denoise up a bit.
>>
>>102063633
Is there a fucking lora for this?
>>
File: 0.jpg (250 KB, 1024x1024)
250 KB
250 KB JPG
>>
I have 12GB VRAM is there any hope for using flux with it? Ideally at like 7 seconds like SDXL
>>
>>102063668
Schnell works best on 4 steps only, try a schnell dev merge
>>
>>102063688
No this is just flux dev.
>>
>>102063711
>Ideally at like 7 seconds like SDXL
you'll get 7 seconds per iteration if you reduce the resolution, is that okay?
>>
File: file.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
>>102063714
I have the gguf one but the results seem terrible in comparison. I may be doing something wrong. I meant for the sd upscale, I'm running base gens at various settings, dev fp8, it feels like I'm losing detail with sd upscale rather than gaining it, but I'm sure this is user error.
>>
>>102063711
Not if you want your image to not suck
>>
>>102063720
What's the prompt?
>>
Why is no one who has time to train LoRAs teaching it concepts it struggles with?
E.g. stabbing
Gore
Middle fingers
Etc.
That would improve the model so that it's dalle tier.
>>
>>102063730
Doesn't seem comfy, I guess if I'm doing something else and genning in the background. Is flux slow like that even with high VRAM?
>>
File: ComfyUI_00981_.png (2.51 MB, 1408x1408)
2.51 MB
2.51 MB PNG
>>
File: ComfyUI_04361_.png (1.55 MB, 1024x1024)
1.55 MB
1.55 MB PNG
>>
File: 2024-08-24_00344_.jpg (680 KB, 2560x1440)
680 KB
680 KB JPG
The eternal road of all 1girls goes only into one direction and the goal is VRAM. Praise VRAM eternally!
>>
>>102063744
Dev needs 20 steps at least, yours looks like schnell
>>
File: file.png (15 KB, 517x194)
15 KB
15 KB PNG
>>102063818
I'm doing 20-30 steps and trying various samplers. I think it's a prompting issue.
>>
>>102063767
This photograph captures a scene at a gas station at night, featuring a man dressed in a blue suit with a white shirt and no tie, which is unbuttoned halfway down, giving a casual, disheveled look. He has a cigarette in his mouth. The sleeves of his suit are rolled up. He is wearing glasses. He has short, slicked-back blonde hair and is holding a gas pump nozzle with his left hand as he is refueling his car while his right hand is on top of his car. He is looking to the left. Half of the backside of his grey car is visible, the car's tail lights are illuminated. The car is to the right and the man is to the left of it. The man is standing in front of a gas station sign displaying prices for different fuel types, including "e85" for "1.799", "regular" for "2.249 , and "MILK skim" for "2.99". The top of the gas station sign reads "Family Express" in large, bold letters, with a blue background. The background is dark, indicating it is nighttime, and the gas station lights are visible, casting a soft glow. The photograph has a candid, spontaneous feel, with the man's expression appearing slightly confused or determined. The image is taken from the perspective of someone sitting in a car, with the car's interior visible in the foreground, including part of the dashboard and steering wheel.

I used joycaption on the original image and tweaked it. It's not exactly fully accurate, but i gave up on it since I can't get good compliance of all the elements.
>>
>>102063831
What's the prompt?
>>
>>102063775
Flux is 4 times larger than XL, it's a lot of numbers to crunch through.
For comparison a 4060 Ti 16GB gets ~2.4s/it at the default 1024x1024 resolution. And there is no CFG by default, so no negative prompt, it would double the time.
>>
File: file.png (1.22 MB, 1280x896)
1.22 MB
1.22 MB PNG
>>102063767
Flux is a natural language model, you type like it's a person with severe autism.
>>
>>102063845
This is a captioned image - there is a bar across the bottom with text, and no text anywhere else.
The image is a vivid 1960s fantasy painting of a vivacious wizard with a long, flowing beard and mustache wearing a touristy getup, sun visor, reflective sunglasses, hawaiian shirt, shorts, sandals, reclining in a chair on the beach. He looks extremely happy. Letterpress text at the bottom reads: I cast "Conceal Erection"

just having fun, if that weren't clear. I think it's the 1960s fantasy thing that's doing it. Removing 'black velvet painting' did help somewhat, if it did make the style a bit more samey.
>>
>>102063868
>and no text anywhere else.
negatives don't really work, were you getting text "anywhere else" before you put this in?
>>
File: ComfyUI_00861_.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
ok I uploaded the Kasia LoRa

https://civitai.com/models/682483?modelVersionId=763948
>>
>>102063846
SDXL bloats its parameters, it's like 8x smaller than Flux.
>>
>>102063889
May be placebo, but yeah, watermarks. Flux/t5xxl seems much less prone to not understanding negatives.
>>
File: ComfyUI_01437_.png (982 KB, 768x1024)
982 KB
982 KB PNG
>>
File: ComfyUI_Flux_Dev_00374_.png (2.7 MB, 1536x1024)
2.7 MB
2.7 MB PNG
>>102063857
niggers could be anywhere.
>>
>>102063909
Didn't on rerunning that particular prompt, but also I don't think it's harming the image at all. Certainly not making more text like it would with really stupid/naive text encoders
>>
>>102063448
Yes, the default sampler just works, but some models require a different sampler to not get overcooked results.
>>
File: file.png (1.77 MB, 1024x1024)
1.77 MB
1.77 MB PNG
>>102063890
>>
>>102063890
not downloading this, teens shouldn't be sexualized
>>
>>102063932
Where should that go in the image?
>>
File: ComfyUI_00859_.png (1.38 MB, 1024x1024)
1.38 MB
1.38 MB PNG
>>102063991
She isnt a "Teen", thats just her Stage name.
>>
>>102064013
It's still sexualizing the idea of "teen" and therefore wrong.
>>
>>102064013
>St(((age))) name
>>
File: ComfyUI_01442_.png (828 KB, 768x1024)
828 KB
828 KB PNG
>>
>>102064037
no it isn't d*bo moralfag
>>
After extensively trying out every single promising anime model, I believe neta v2 is the best one.
>>
File: ComfyUI_00984_.png (2.17 MB, 1536x1152)
2.17 MB
2.17 MB PNG
>>
>>102063144
The world is already very evil and satanic right now? What's a little VR added to the foundations of Satan's empire?
>>
File: 0.jpg (351 KB, 1024x1024)
351 KB
351 KB JPG
The Seven Seals.
>>
File: 00229-1563566622.jpg (37 KB, 256x384)
37 KB
37 KB JPG
>>102063991
agreed. and it has no place here.
>>
File: ComfyUI_00690_.png (3.38 MB, 2048x1536)
3.38 MB
3.38 MB PNG
>>102063143
Messing around with it a bit more, the SD1.5 -> Flux thing isn't so bad. Still probably not worth it.
>>
>>102064078
yes and the kids playing forknite and listening to black sabbath backwards don't help
>>
says the degen hopper koff
>>
Hit that next fresh loaf of...
>>102064103
>>102064103
>>102064103
>>
File: 2024-08-24_00316_.png (1.32 MB, 1280x720)
1.32 MB
1.32 MB PNG
>>102064095
Biblically accurate.
>>
>>102064037
Then dont sexualize her, you can create anything you want with that LoRa.
>>
File: 4step_up_00065_.png (1.71 MB, 1536x1536)
1.71 MB
1.71 MB PNG
Turned out better than I expected
>>
File: ifx192.jpg (236 KB, 1024x1024)
236 KB
236 KB JPG
>>
File: file.png (1.9 MB, 1024x1024)
1.9 MB
1.9 MB PNG
>>102064065
thanks for the rec
>>
>>102064134
maybe you go there, not i, you sickie
>>
>>102063890
Thanks fren. A piece of internet history.
>>
lies, i shouldn't be surprised really
disingenuous for 2 years
>>
>>102061907
hola desde Valencia, knowyourmeme va de puta madre. Telefónica, which of course is government controlled because socialist shithole, solo bloquea cosas de descargas y RT
>>
File: 1724496449.png (1.54 MB, 1024x1024)
1.54 MB
1.54 MB PNG
>>
File: 1724494884.png (1.35 MB, 1024x1024)
1.35 MB
1.35 MB PNG
>>
>>102062236
Still suffers from the "must make cfg happy" repetition slop. The same shapes repeated over and over
>>
>>102064325
there's a fucking harlequinn baby lora?
>>
>>102064410
No, it's my own lora of a woman with just a mouth on her face. The harlequin baby look was unintentional.
>>
File: ifx218.jpg (342 KB, 1024x1024)
342 KB
342 KB JPG
>>
>>102064038
She's almost 40
>>
File: ifx230w.jpg (526 KB, 1600x1600)
526 KB
526 KB JPG
>>
File: ifx233.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>102062031
>>
>>102064838
Cool



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.