[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: the tallest dick general.jpg (3.62 MB, 1954x3264)
3.62 MB
3.62 MB JPG
Discussion of free and open source text-to-image models

Previously baked bread : >>103016063

Tallest Edition

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://aitracker.art
https://huggingface.co
https://civitai.com
https://tensor.art/models
https://liblib.art
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3

>SD3.5L/M
https://huggingface.co/stabilityai/stable-diffusion-3.5-large
https://replicate.com/stability-ai/stable-diffusion-3.5-large
https://huggingface.co/stabilityai/stable-diffusion-3.5-medium
https://huggingface.co/spaces/stabilityai/stable-diffusion-3.5-medium

>Sana
https://github.com/NVlabs/Sana
https://sana-gen.mit.edu

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux
DeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai
>>
File: ComfyUI_02781_.png (1.46 MB, 1024x1024)
1.46 MB
1.46 MB PNG
>>
Blessed thread of frenship
>>
File: ComfyUI_02763_.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
>>
Last night was a laugh riot, new extra venv, new comfy install with all the trimmings.
Could not figure out why it wasn't saving gens into my used-for-a-year nice neat subfolders directory setup....
yeah, took me a frustrating hour to figure out. I am an idiot. (it was in the new comfy folder ofc).
Mochi doesnt know Hatsune Miku :(
>>
>>103024409
>Mochi doesnt know Hatsune Miku :(
I think that's the first model in existance that doesn't know Migu, that's impressive lol
>>
>tallest edition
ok
>>
ok I'm retarded so excuse the dumb question but is turning images into tensors with rbg values scaled down by 255 a normal thing for apps to do or is it specifically a comfyui thing? or is it somehow necessary for ai image gen

because for writing comfy nodes to do a little image processing it doesn't feel easier
>>
File: ComfyUI_02790_.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>
>>103024437
I recall you were trying to get the perspective from the feet level but not having success, have you considered them "standing on a glass floor" and describing the camera rising up through it?
>>
>>103024458
>have you considered them "standing on a glass floor" and describing the camera rising up through it?
I have not considered that but now I will try this, thanks
>>
File: ComfyUI_02791_.png (1.4 MB, 1024x1024)
1.4 MB
1.4 MB PNG
>>
File: ComfyUI_02702_.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
>>
doesn't really work. It's ok all models struggle with extreme from below or birds eye view shows. I tried to do a couple of prompts or them lying down from above and it was body horror. I have enough good from below shots anyways I think
>>
>>103024661
I hope the HD version will be better
>>
>>103024661
That's a shame, but models probably dont have much training data on "shots from foot level", but one has to try these things.
>>
what happened to /sdg/, were all the regulars culled?
>>
File: file.png (3.3 MB, 2048x1024)
3.3 MB
3.3 MB PNG
https://github.com/Jonseed/ComfyUI-Detail-Daemon
Really cool
>>
>>103024694
anon, this is /ldg/
>>
File: 091956_00001.webm (1.45 MB, 848x480)
1.45 MB
1.45 MB WEBM
>>103024431
reposted with better decoding settings
>>103024431

>High definition video of Hatsune Miku skateboarding down a street towards the camera in New York

The model needs to have her added.
>>
This was so close to great damn. On another note, I'm getting gens that just say "Failed" and they use one of my credits. This is a new thing they updated the site with, I'm assuming to keep girls from being generated too young and skimpy

>>103024668
>I hope the HD version will be better
This is the HD version, I'm using the website

>>103024682
>That's a shame, but models probably dont have much training data on "shots from foot level", but one has to try these things.
Yeah but limitation breeds creativity. Maybe I get get step on me mommy energy in a different way
>>
>>103024722
>This is the HD version, I'm using the website
are we sure this is the HD version, maybe they just upscaled their model, it's not supposed to be at 960x resolution, the HD version will be at 720p, they're probably using the same model as ours except that they don't use tilted VAE because they have more vram, that's why it looks better to them than us
>>
>>103024722
>mommy
Hmm, perhaps the foot crushing something small, like a grape or olive if it's a bar scene at the start of the prompt?
>>
>>103024738
>the HD version will be 720p
Source? It makes sense that it would be double the resolution of the 480p version, and they can't quite call it full HD/1080p since it's not there yet

>>103024739
No anon I just meant the energy, I'm not trying to generate crushing porn lol this is for a music video/cyberpunk world building
But thank you for engaging and for the suggestion. Id be interested in seeing how well mochi can do crushing content for curiosity's sake
>>
>>103024747
>Source?
https://www.genmo.ai/blog
>Mochi 1 HD will support 720p video generation with enhanced fidelity and even smoother motion, addressing edge cases such as warping in complex scenes.
>>
>>103024751
thanks for the source, hopefully the website's model is indeed 480p plus an upscale so I have something to look forward to
>>
File: file.png (77 KB, 767x813)
77 KB
77 KB PNG
>>103024700
I like that, really cool node if you feel Flux displays oversimplistic images, I gave you my settings but I'm just getting started, can be definitely improved
https://imgsli.com/MzEzOTc4
>>
File: 093838_00001.webm (435 KB, 848x480)
435 KB
435 KB WEBM
>>103024722
Test run: 50 steps fp8 (went a bit overboard on the juice)

>Under the ultraviolet pink and blue glow of holographic advertisements in a futuristic bar, a womans shiny latex boot crushes a grape beneath her boot, spurts of juice, the camera pans up her athletic latex and cybernetically-enhanced body to her smiling face

From this 1 sample size, i dont think it understands in the way we do.
>>
>>103024928
spurting in general is a challenge for video models
>>
looks like mochi is aggressively banning youthful prompts now and still counting it as a gen, so i guess I'm done for now. I think I have 40ish seconds worth of video to make a proof of concept for my art project at least
>>
File: 1699482991902839.png (1.57 MB, 1227x940)
1.57 MB
1.57 MB PNG
flux chuds won

https://civitai.com/models/743311/forever-flux-or-andrea-botez-or-the-100-most-beautiful-faces-no8
>>
File: trick.gif (1.44 MB, 454x212)
1.44 MB
1.44 MB GIF
>>103025148
Yes, flux is perfect and this woman is so beautiful that we have to celebrate her lora.
We will never find a woman ten times better just by walking down the street.
>>
>>103024250
Epic
>>
>>103024694
turns out ""people"" acting like attentionwhoring retards in an anonymous imageboard drives said imageboard's userbase away; who would have thought?
anyway, when that shithole inevitably dies, do not give the avatartroons a single inch; let them rot in the festering, stool-filled abscess they made for themselves
>>
File: spooky.png (1.51 MB, 2240x655)
1.51 MB
1.51 MB PNG
Prompter beware! You're in for a scare! Be careful with your choice of Halloween costume...
>>
File: bliss.png (1.34 MB, 1365x1024)
1.34 MB
1.34 MB PNG
kek
>>
File: i724.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
>>
File: 325.jpg (169 KB, 1024x768)
169 KB
169 KB JPG
been away for a few months, anything new and exciting happen?

Seems like no? some crappy new UI (REforge) and SD 3.5 which looks marginally better
>>
>>103026440
>SD 3.5 which looks marginally better
https://www.youtube.com/watch?v=OJy6bJ_RxXg
>>
>>103026440
>been away for a few months, anything new and exciting happen?
Not much, I feel it'll take years to surpass Flux dev, that's how good this model is
>>
>>103026440
SD 3.5M is an upgrade to SDXL and it's definitely much faster to train, at least twice as fast just at a raw efficiency level, 1024px images can be trained at batch 2, 4 seconds per batch on a 4090 (no cache). You can expect real porn models on it soon.
>>
>>103026523
i don't believe it, the chink video maker for one is something that should get an equivalent soon
>>103026525
not really interested in training my own models but cool to hear
>>
>>103026564
>the chink video maker for one is something that should get an equivalent soon
you talk about the Mochi guys?
>>
>>103026583
nay I meant this
https://hailuoai.video/
>>
>>103026601
what makes you believe they'll make a local image model? MiniMax is API only
>>
>>103026150
my wife
>>
>look at llm fags (who im totally not one of btw), they think about it like ABC while we, img fags, think about it like XYZ (which is vastly inferior). if only you I mean we thought about it like they do, we would be in a much better spot, because we I mean llm fags are just so much better you know?
>>
File: i727.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
>>103026817
kek
>>
>>103026969
this, unironically
>>
>yeah so what if LLMs can be used anywhere that text is used
>why isn't Meta training a 70B porn image model?
>>
File: ComfyUI_temp_hadmj_00001_.png (3.49 MB, 1152x1920)
3.49 MB
3.49 MB PNG
>>
>>103027463
I wanna make sex with chinamen
>>
>>103027479
he's obviously japanese, retard
>>
File: ComfyUI_00435_.jpg (2.72 MB, 1120x4520)
2.72 MB
2.72 MB JPG
>>
>>103027502
Ni hao my nigger
>>
File: ComfyUI_temp_hadmj_00004_.png (3.25 MB, 1152x1920)
3.25 MB
3.25 MB PNG
>>
For training SD 3.5M seems like 4e-6 is the sweet spot
>>
>>103028045
post test results?
>>
>>103028064
I'm still training, but I was mostly observing the rate of change between different learning rates. 1e-6 is way too slow, 1e-5 is way too rapid and will make monstrosities.
>>
File: ComfyUI_13534_.png (1.2 MB, 776x1024)
1.2 MB
1.2 MB PNG
Honestly already bored of video models, looking forward to interactive world models.
>>
File: 1711317792483965.jpg (1.26 MB, 1304x1904)
1.26 MB
1.26 MB JPG
>>
File: 1709355031941538.jpg (1.11 MB, 1304x1904)
1.11 MB
1.11 MB JPG
skeet
>>
>>103028082
good luck anon :3
>>
Is flux hard to set up to run at home? I'm a civitai proompter but flux is too expensive to run there so I thought I'd try some gens on my rtx2070.
>>
>>103025782
Desu appropriate
>>
>>103028601
>rtx2070
that's a 8gb vram card, you can run flux if you go for a Q4 quant I guess guess
>>
File: file.jpg (1.6 MB, 3647x1232)
1.6 MB
1.6 MB JPG
Midjourney-Niji il really a special model, so far no one even got close to it in terms of making an anime image that doesn't look like AI slop
>>
>>103028601
You'll have more fun with Pony/SDXL on Forge UI. Flux is a tad bit too slow on 8vram, but it's doable.
>>
>>103028744
skill issue
>>
Man, even after huge new model release, sd /g/ threads are fucking dead. Guess SD really is only good for coom...
>>
>>103028744
ngl, based on this comparison they really have something good going on behind the scenes
>>
>>103028766
>huge
debatable
>>
>>103028766
Always been. Quite some time ago I used to look it up just to check up on how bad the situation is, but even that's gotten boring.
>>
>>103028769
>they really have something good going on behind the scenes
they just train their models with every good anime images they can find, the rest of the field go for """ethical datasets""", MJ is like the chinks, they know that to get a good model you need good images, and they don't give a fuck about the artists fee fees, as it should
>>
>>103028769
The secret is they used screenshots from real anime.
>>
>>103028797
>>103028793
i don't think training quality foundational models is as easy as "just pick good images bro"
>>
>>103028813
it is though, we're talking about millions of good quality pictures, if you train a model well, it'l be good, if you train your model with slop, it'll be good at slop, garbage in, garbage out
>>
>>103028813
Actually it is. If you do fps=1 screenshots of every anime, threw them into any VLM (ideally with the name of the movie in the prompt) and then finetuned it you'd be shocked how good the end images look.
>>
>>103028839
imagine you train a model with every single frames of every animes that exist, how many frames would that be? I wonder how many images Flux can eat before saturation
>>
>>103028744
Pixelwave looks better

that man in the MJ looks horrible, look at hands, legs, etc, if you added noise and lower the contrast in the pixelwave it will look same as the MJ one, don't lie to yourself anon
>>
>>103028793
I'm not sure that's entirely the case. Based on this one comparison it seems to be more coherent, it could be better trained parameters, or it could be more of them. Whatever they're doing is doing a similar jump to that of comprehensible text, except here it's about composition.

I'm often testing new and niche models/mixes for Pony, and even there I'm seeing huge jumps in this "compositional coherency", so it might be a matter of better trained parameters indeed. For example I'm seeing way better compositions with something like Hadrian Delice Stylized. Other models and mixes tend to be just a different style flavour of the same base composition you'd see in something like Autismmix, meaning their fine-tuning mostly touches just the surface and doesn't influence it's internal logic or understanding of pixel context.
>>
>>103028871
It's a lot of frames (24 FPS * 3600 is 86400 images per hour) and likely the model would get a lot smarter because it'll be exposed to a lot of spatially changed similar images. Videos are a goldmine for extremely high quality and high volume images.
>>
File: file.png (9 KB, 646x136)
9 KB
9 KB PNG
>>103028875
let's not pretend that the MJ niji style isn't appealing, would be disingenuous to do something like that
https://civitai.com/search/models?sortBy=models_v9&query=niji
>>
>>103028912
meme model overhyped by social media
>>
>>103028904
>86400 images per hour
>86400 images per 3 episodes
>let's say a regular anime has 3 seasons + 24 episodes
>2073600 images per anime
>there are roughly 12000 animes that existed
>24883200000 images total (24.8 billions images)
Holy fuck kek
>>
>>103028170
can i skeet on her?
>>
File: file.png (1.64 MB, 1280x720)
1.64 MB
1.64 MB PNG
>>103028941
it's not overhyped, it's legit good, I love MJ niji style, it looks like real anime drawings
>>
>>103028970
Throw 20,000 high quality anime images at SD 3.5M and you'll get the same model.
>>
>>103028970
What's the purpose of your posting? Do you think you'll get any answers that are not "just train a lora, skill issue" here?
>>
>>103029002
He's the resident MJ shill.
>>
>>103028970
the image on the left is very bad, just lower the contrast of your gens and add some blurry filter, some noise and you will get a similar aesthetic
>>
File: tmprfaek1pi.png (1.06 MB, 896x1152)
1.06 MB
1.06 MB PNG
>>
I'm the resident Pony shill, just not vanilla and most finetunes/mixes. It's a rough gem alright, just needs some elbow inpaint grease.
>>
>>103029009
What's wrong with saying that MJ is good? You can enjoy local and at the same time wishing it could be as good as its API rivals, that's not possible?
>>
>>103029029
Because you're actually a bad faith concern troll.
>>
>>103028952
They don't use all frames because there a high chance that the frame x and frame x+1 are too similar. They program checks to see if there is enough variation before getting the frame.
>>
>>103029009
I don't even disagree with him, but this is literally /local/ general. Whining won't get you a better local model.
>>
bullshit ass fuck dumb cunt journey, more like
>>
>>103029050
The problem is a lot of bullshit he says is subjective opinion about what good anime looks like. It's just a flavor of ice cream, not some impossible standard.
>>
>>103029023
Vanilla Pony is somehow still the king of proompting.
>>
>>103029062
>It's just a flavor of ice cream, not some impossible standard.
Bullshit
>>
>>103029068
Never managed to tardwrangle vanilla, though I've had issues with vanilla autism too. DPO did the trick for me, but at this point I can sometimes find a finetune or mix that on average does a better job than either of the two.
>>
>>103029079
There is nothing special in those images that Flux for example can't do with a Lora. Don't confuse aesthetic flair with technicals.
>>
>proompting
>>
>>103029089
>There is nothing special in those images
there is
>>
>>103029062
Honestly, I think local was extremely behind with anime up until Illustrious and Noobxl and if you think Pony was nearly as good as NAI or Niji, you're pretty much coping.
>>
File: 1716166545443056.jpg (1.09 MB, 1304x1904)
1.09 MB
1.09 MB JPG
>>103028959
yes
>>
>>103029096
No one said Pony was on part with MJ because we all know that MJ is at least a 3B model and SDXL is a literal piece of shit.
>>
>>103029090
Anon's got a point, it's a coomer infested environment.
>>
>>103029106
>we all know that MJ is at least a 3B model and SDXL is a literal piece of shit.
SDXL is a 2.7b model though, it's not that small
>>
>>103029106
Pony's issue wasn't SDXL though. SDXL is crap but it can do much better than Pony.
>>
>>103029114
It's almost like parameters aren't created equal.
>>
>>103029106
>we all know that MJ is at least a 3B model
>>103029121
>It's almost like parameters aren't created equal.
so parameters matters or not?
>>
>>103028871
I worked it out according chatgpt earlier this year, it estimated 60,600 hours of anime has been produced.
>>
>>103029114
If that's the case, the parameter count could make for a huge difference in coherence/complexity, even if it's arguably not that big a size difference. I've seen the jumps between smaller LLMs, so I wouldn't be surprised if the same applied here.
>>
>>103029132
MJ is a modern architecture 3B model. SDXL is a shitty unet CLIP model.
>>
>>103028656
>>103028748
I guess I will give it a shot.
>>
>>103029140
>MJ is a modern architecture 3B model. SDXL is a shitty unet CLIP model.
Also noteworthy. SD had an awful background in terms of encoders, I've seen Laion and it was a fucking mess, so I wouldn't be surprised if CLIP wasn't much better either.
>>103029155
If you go for Forge, linkrel might be the simplest to run:
https://civitai.com/models/638187/flux1-dev-hyper-nf4-flux1-dev-bnb-nf4-flux1-schnell-bnb-nf4
You pretty much just paste it into the model folder, switch to Flux mode up top and off you gen at roughly 1 image a minute.
>>
File: ComfyUI_02796_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>
>>103029140
>MJ is a modern architecture 3B model. SDXL is a shitty unet CLIP model.
Idk man, Midjourney Niji v6 was released in december 2023, it's probably a unet model, DiT wasn't a thing back then
>>
How much have they really proven to us about MJ? Can't we only guess at it's innerworkings? It's not like they've released any papers or anything, right?
>>
Speaking of DiT, anyone bother to share some insight on it? I know nothing, other than it having been used back in Hunyuan.
>>
>>103029179
Pixart Alpha used DiT and it released its paper Sept 2023. Maybe you're just ignorant and a retard. It's okay not to talk when you have no clue what you're talking about.
>>
>>103029208
Ok so you're a 2 digit IQ retard, I said that DiT wasn't a thing, meaning that it wasn't mainstream, your anecdotical Pixal Alpha release don't change that fact, that field was dominated by unet in 2023, whether your monkey brain likes it or not
>>
File: 00016-2785730321.png (1.62 MB, 896x1152)
1.62 MB
1.62 MB PNG
anyone else just typing random things, quotes or movie titles so see what they get?
>long live the new flesh
>>
>>103029231
Sometimes I prompt with song lyrics I'm currently listening to, though mostly a random bunch of it, or the most catchy ones.
>>
>>103029231
Song lyrics and bible verses have been and will always be kang
>>
>>103029200
It's just a Transformers layers model. It's basically a stack of Attention and Cross Attention layers that process a stream of tokens that represent parts of the image and the caption.
>>
File: ComfyUI_13572_.png (1.36 MB, 776x1024)
1.36 MB
1.36 MB PNG
>>103028744
The magic of Midjourney is that they have an LLM to refine your prompt to get the best possible image with their model, meanwhile for SD and Flux it takes exactly what you put in
>>
>>103029191
illustrious and noob show us that properly training your model with a good dataset goes a long way, that and maybe they have some external methods to help with details like that detail daemon thingy for flux
>>
File: ComfyUI_02800_.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
>>
>>103029263
Kudos.
>>
>>103029224
DiT was originally published in 2022. I know you're a fucking faggot, but here's a clue: often multiple teams can work on similar ideas. I know you're desperate to be right, but you're wrong. Also three months is enough time to train a model from scratch if they saw the Pixart paper and said WOW THAT IS GOOD.

USE YOUR BRAIN.
>>
File: 00019-335776529.png (1.29 MB, 896x1152)
1.29 MB
1.29 MB PNG
>I've seen things you people wouldn't believe. Attack ships on fire off the shoulder of Orion. I watched C-beams glitter in the dark near the Tannhäuser Gate. All those moments will be lost in time, like tears in rain. Time to die.
>>
>>103029264
>The magic of Midjourney is that they have an LLM to refine your prompt to get the best possible image with their model
is it? I often do some boomer prompting with chatgpt, and it doesn't suddently makes the image more aesthetic, it just helps to make the model follow my initial prompts better
>>
At this rate I'm getting popcron, it's kino /ldg/ banter hours.
>>
COZY thread
>>
back in the pixart days i'd use the banter as prompts to get the most kino outputs
>>
>>103029283
>I know you're desperate to be right, but you're wrong.
How can I be wrong? You and I have no idea what architecture was used on MJ, you're just sperging on assumptions, not facts, that's expected from a 2 digit IQ monkey though so not that surprising.
>>
>>103029304
I am literally correcting your literal ass backwards assertion that DiT didn't exist in Dec 2023. Kill yourself.
>>
pixart sexuals live on in eternity
>>
>>103029317
>your literal ass backwards assertion that DiT didn't exist in Dec 2023
not my fault if you can't proprely read english, I already explained that saying that "DiT wasn't a thing" meant "not mainstream"
>>103029224
>I said that DiT wasn't a thing, meaning that it wasn't mainstream
And yet you continue to pretend that I meant "it doesn't exist", how much of a retard are you?
>>
>>103029014
good image
>>
File: 00021-3647112580.png (1.68 MB, 896x1152)
1.68 MB
1.68 MB PNG
>It Doesn’t Feel Pity, Or Remorse, Or Fear, And It Absolutely Will Not Stop, Ever, Until You Are Dead!
kek wtf
>>
>>103029341
me begging for anon to post more gens
>>
>>103029339
I know you only do things if they're mainstream but someone running an AI company might be a little ahead of the curve. I don't even know why I talk to you, I'm pretty sure you're debo, he always had ass backwards opinions.
>>
>>103029351
>I'm pretty sure you're debo, he always had ass backwards opinions.
looooool, I thought you were debo too, so you're just someone as retarded as him, god, why did you create a clone of that retard in this place...
>>
>>103029341
what i imagine the average flux 1girl poster to look like
>>
>>103029281
Does she come with a self-cleaning hole?
>>
From now on /niji/is
>>
File: 00022-4074244207.png (1.76 MB, 896x1152)
1.76 MB
1.76 MB PNG
>to the last I grapple with thee; from hell's heart I stab at thee; for hate's sake I spit my last breath at thee.
>>
>>103028744

regarding pixelwave + loras:
https://civitai.com/articles/8505
>>
>>103024700
>(but also works with SDXL, SD1.5, and likely other models).
very interesting ty anon
>>
File: ComfyUI_13977_.png (1.05 MB, 776x1024)
1.05 MB
1.05 MB PNG
>>103029290
That's because ChatGPT doesn't know the best prompting method for Flux or SD. It doesn't know how the images were captioned, it doesn't know the training data, it doesn't know anything about the model. Meanwhile midjourney trains their own LLM with exactly the same data their image model is trained so it knows everything about the model and how to prompt it to get the best results, the image model and the LLM work in tandem.
>>
>>103029383
what else do you put in your filter?
>>
>>103029495
If I told you you would try to circumvent it. But over time there are certain key phrases that guarantee it's a low IQ phoneposter.
>>
>>103029377
of course
>>
File: 00028-3960327928.png (1.86 MB, 1152x896)
1.86 MB
1.86 MB PNG
>All is forgotten in the stone halls of the dead. These are the rooms of ruin where the spiders spin and the great circuits fall quiet, one by one...
>>
>>103029384
would've been better if it was a love-hate relationshop where they caress eachother's cheek with one hand, and hold weapons in the other
>>
File: 00033-3626336574.png (1.5 MB, 1152x896)
1.5 MB
1.5 MB PNG
>We may ask what is relevant but anything beyond that is dangerous. He is a liar. The demon is a liar. He will lie to confuse us. But he will mix lies with the truth to attack us. The attack is psycological, Damien, and powerful.
>>
>>103029440
fair enough, I'm sure there's some models where we know what caption models they used, I don't remember which one though
>>
File: 00034-2851984593.png (1.48 MB, 1152x896)
1.48 MB
1.48 MB PNG
>A picture of the entity that lives within the dataset of flux
>>
File: 0.jpg (279 KB, 896x1312)
279 KB
279 KB JPG
>>
File: 0.jpg (454 KB, 864x1248)
454 KB
454 KB JPG
>>
>>103024977
what's with all these grandmothers ITT
>>
>>103029752
The buttchin entity
>>
File: 1722748677725025.jpg (511 KB, 1920x1259)
511 KB
511 KB JPG
>>103029902
Looks like the Brazil flag rotated. It even this white streak going across the blue circle.
>>
>>103029947
kek
>>
File: ComfyUI_04860_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
tried to generate Neuro sama just by describing her with a wall of text,

SD 3.5 large model
>>
File: tmpcsxzz2uh.png (1.17 MB, 1152x896)
1.17 MB
1.17 MB PNG
>A picture of the entity that lives within the dataset of this pony finetune
should've expected it
>>
File: 1700673220250765.jpg (225 KB, 1024x1024)
225 KB
225 KB JPG
>>103029440
>>103029264
>>103028119
would you box one or more of these up, please?


Kind anons, I'm looking for box info on picrel. I saved it nov 22, 2023. I need the box info to gen the same character, but a mouse.
>>
File: 0.jpg (310 KB, 864x1248)
310 KB
310 KB JPG
>>103029948
yeah, kind of does look like it a litte.
>>
>>103030070
sovl
>>
File: tmpcsxzz2uh.png (1.32 MB, 1152x896)
1.32 MB
1.32 MB PNG
>A picture of the entity that lives within the dataset of this SDXL finetune
a bit better
>>
>>103030112
nice
>>
File: file.png (8 KB, 489x83)
8 KB
8 KB PNG
see you on the other side
>>
File: ComfyUI_04872_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
And this is made with flux, same textwall
>>
>>103030259
Godspeed
>>
File: tmpt3iz62j2.png (1.05 MB, 1152x896)
1.05 MB
1.05 MB PNG
>A picture of the entity that lives within the dataset of this different SDXL finetune
>>
File: ComfyUI_04875_.png (1.55 MB, 1024x1024)
1.55 MB
1.55 MB PNG
And this one with the flux oil painting lora
>>
>>103030333
horrifying artstyle, feels like the corporate flavour of aigen, devoid of soul, as if we didn't have to argue about it being capable of sovl to begin with
>>
>>103030357
I think the intention is to be creepy
>>
File: ComfyUI_14195_.png (925 KB, 776x1024)
925 KB
925 KB PNG
>>103029947
The Pixelwave 3 finetune has pretty much fixed the buttchin problem

>>103030068
https://files.catbox.moe/j61xif.png
>>
>>103029936
>what's with all these grandmothers ITT
Robots cannot reproduce and therefore cannot be grandmothers
>>
File: 812219896.png (1.37 MB, 1216x832)
1.37 MB
1.37 MB PNG
>>
>>103030386
Instead it's just generic AI Flux face.
>>
File: 00063-2370275969_cleanup.png (1.21 MB, 832x1216)
1.21 MB
1.21 MB PNG
>>
File: 1286847866.png (1.65 MB, 1216x832)
1.65 MB
1.65 MB PNG
>>
>>103030476
nice
>>
File: ComfyUI_14234_.png (908 KB, 776x1024)
908 KB
908 KB PNG
>>103030480
Nah, it's actually pretty good at face variety as well. It basically fixes all the problems flux had.
>>
>>103030386
>>103030511
You can tell it's yearning to give her a buttchin desu
>>
>>103030511
Generic AI Face #2
Are you going to give me an actually different face?
>>
File: ComfyUI_14243_.png (987 KB, 776x1024)
987 KB
987 KB PNG
>>103030561
There you go, enjoy
>>
>>103029974
"Heart!", "well done"
>>103030261
>>103030333
"filtered"
>>
File: 809.png (1.92 MB, 1024x1024)
1.92 MB
1.92 MB PNG
>>
>>103030690
nice
>>
File: 540825733.png (1.43 MB, 1216x832)
1.43 MB
1.43 MB PNG
>>103030509
thx
>>
>>103030614
Face #2 with black skin tone
AN ACTUALLY DIFFERENT FACE, PLEASE
>>
>>103030757
I think you might have autism it's clearly different faces.
>>
>>103030768
You must have autism because they're clearly not.
>>
File: 811.png (1.91 MB, 1024x1024)
1.91 MB
1.91 MB PNG
>>
>>103030690
sovl
>>
>>103030776
A telltale sign of autism not being able to perceive social aspects like facial recondition, to anyone normal they can see the variety and differences, for you they somehow look the same. I think you need to get tested or something, my condolences.
>>
>>103030784
what is this kino model?
>>
its midjourney. dont worry, sd5 will catch up
>>
File: 00078-1794760280.png (1.24 MB, 832x1216)
1.24 MB
1.24 MB PNG
>>103030776
>>103030802
>autists arguing over who's more autistic
>>
>>103030802
I knew that a 1girl slop poster is face blind. It's actually quite sad that you can't tell they have the same skull shape.
>>
holy fuck why is Verus Vision so fucking SLOW
>>
>>103030813
Again, you lack the social awareness to properly differentiate faces, for your sake get tested and see how severely you rank on the spectrum.
>>
>>103030843
>Says the retard autistically posting 1girl, standing images each of which look like they're 90% related.
>>
>>103030839
Trained from dedistilled
>>
File: 812.jpg (2.55 MB, 1075x2150)
2.55 MB
2.55 MB JPG
>>
>>103030860
>>103030843
I think you two should kiss already
>>
File: 1716426281480876.png (679 KB, 718x520)
679 KB
679 KB PNG
Let me call my buddy who's an expert on this thread.
>>
>>103030511
did it fix the flux loicense problem?
>>
File: ComfyUI_14096_.png (1.04 MB, 776x1024)
1.04 MB
1.04 MB PNG
>>103030860
Are you actually retarded? I was posting cars, I did a single post to show that the pixelwave model fixed the buttchin problem. You can't even make a coherent and consistent argument.
>>
File: ComfyUI_04901_.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>>
File: node progress.png (223 KB, 2182x987)
223 KB
223 KB PNG
w00t I got it working

soon I will be able to make many variations on starter pics for img2img. much to do still however.
>>
Is there a way to get flux to gen adult women with flat chest? All my attempts have failed so far
>>
File: 630316851.png (1.09 MB, 776x1024)
1.09 MB
1.09 MB PNG
>>103030757
Here you go.
>>
File: file.png (895 KB, 768x768)
895 KB
895 KB PNG
>>
File: ComfyUI_14095_.png (1.04 MB, 776x1024)
1.04 MB
1.04 MB PNG
>>
File: 1412058974.png (990 KB, 1216x832)
990 KB
990 KB PNG
>>103031016
nice
>>
>>103030920
if only I had a beard this glorious
>>
>>103031024
SD 3.5M scratches my Pixart itch. Just waiting for Sana now to do the true training head to head.
>>
File: 133707991.png (1.37 MB, 1216x832)
1.37 MB
1.37 MB PNG
>>103031040
good luck
>>
File: ComfyUI_04903_.png (2.03 MB, 1024x1024)
2.03 MB
2.03 MB PNG
>>
Ok /g/ I’ve been schlubbing it with Pony Realism and the OG YAPM for anime grills. What newer models should I run to blow my mind for realistic and anime gens? I saw something about Flux, how is that? The marketing sheet looks just like Pony’s did back when it came out.
>>
File: 2244899110.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>
File: tmp7fa8n8zy.png (2.19 MB, 1680x1000)
2.19 MB
2.19 MB PNG
>>
File: 2707444658.png (1.14 MB, 1536x640)
1.14 MB
1.14 MB PNG
>>
File: ComfyUI_04909_.png (1.85 MB, 1024x1024)
1.85 MB
1.85 MB PNG
>>
>>103031024
I like the parking garage, esp. the hint of a view and the blue sky
>>
File: ComfyUI_04896_.png (1.97 MB, 1024x1024)
1.97 MB
1.97 MB PNG
>>
File: 2235269820.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
>>103031266
It's a nice setting.
>>
File: 1252899795.png (1.03 MB, 832x1216)
1.03 MB
1.03 MB PNG
>>
File: ComfyUI_04931_.png (1.99 MB, 1024x1024)
1.99 MB
1.99 MB PNG
>>
>>103031399
sovl, if it didn't stink so much of incase (dw i actually like their artstyle & comics)
>>
This thread has very good gens
>>
File: 4144779337.png (1.44 MB, 832x1216)
1.44 MB
1.44 MB PNG
>>
>>103030933
what kinds of variations do you mean? cool node tho
>>
File: ComfyUI_14332_.png (1.07 MB, 776x1024)
1.07 MB
1.07 MB PNG
>>
File: 811331052.png (1.42 MB, 832x1216)
1.42 MB
1.42 MB PNG
>>
File: 520978994.png (1.27 MB, 832x1216)
1.27 MB
1.27 MB PNG
>>
>>103031714
Is that layerdiffusion or something new?
>>
Has anyone managed to successfully train a lora on sd3.5? I'm using kohya with the 3.5 branch. My results are garbage, shitty noisy details, body horror, everything looks like melted wax. Basically the model gets fried, not in an oversaturated sense, but in a "it just looks like shit" sense.

Doesn't matter if it's large or medium. Learning rate anywhere from 1e-4 down to 1e-5. max_grad_norm at the default of 1, or 0.01. Different optimizers. Leaving the prediction target at the default, or changing it to match AI toolkit / SimpleTuner (which is the same as flux). Fucking garbage results no matter what.

Meanwhile flux literally just werks, pretty much no matter what hyperparameters I choose. It always looks at least decent. The base SD3.5 model looks good, especially large, so obviously stability trained it *somehow*. Maybe there's some weird training detail everyone is missing that's required for it to work.
>>
>>103031130
NoobAI-XL
>>
File: 1972126648.png (1.42 MB, 896x1216)
1.42 MB
1.42 MB PNG
>>
File: 1girl.jpg (623 KB, 1344x1728)
623 KB
623 KB JPG
>>
>>
File: awawa.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
>>
File: ComfyUI_Flux_15221.jpg (202 KB, 704x1472)
202 KB
202 KB JPG
>>
>>103031801
Ask the anon who was here earlier that posted >>103028045 and >>103028082. Seems like your learning rate is too fast but your results are the opposite of his.
>>
>>103031801
>>103032165
It's possible Lora training is broken or your learning rate is too high and you're blowing up the weights which is normal for body horror and blown out colors.
>>
>>103024508
wait a second...
>>
>>103031801

Literally in the same thread:
>>103028045
>>103028082
>>
>>103025148
>100 most beautiful faces
I've seen prettier women on the fucking bus, who the fuck rates these women? 70yo pajeets?
>>
>>103032476
For the record I'm doing a full finetune, not a Lora
>>
File: ComfyUI_02234_.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>>103032409
wait for what?
>>
>>103028970
by real anime do you mean basic bitch studio ghibli style? go gen some more styles and come back with results
>>
File: 1712448008772607.png (2.4 MB, 1024x1024)
2.4 MB
2.4 MB PNG
Thoughts on Pinokio?
>anybody even using it?
>>
>>103032599
That pistol is way too perfect, what the fuck
>>
File: 1727224196280401.png (833 KB, 1024x1024)
833 KB
833 KB PNG
>>103028744
>not even generating the right anime
>>
File: ComfyUI_02229_.png (950 KB, 1024x1024)
950 KB
950 KB PNG
>>103032626
its Kino isnt it?
>>
>he fell for the MJ b8 again
>>
>>103031016
vased
>>
File: 0.jpg (271 KB, 896x1312)
271 KB
271 KB JPG
>>
File: 1702958416825984.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>103031586
sick
>>103031219
v nice
>>103030892
oooooooo
>>
>>103031569
I just mean I have the framework in place now to do more than just adding borders. Certainly also gonna do adding black bars at top and bottom of the image, maybe also flipping and cropping and such. The most important thing though was being able to get the image framed in some way. E.g. the black bars coupled with an appropriate image resolution will reliably tell the model to expect something that looks like a film still. And images framed with white tend to turn out a bit more 'artistic'
>>
>>103033382
neat
>>
>>103024722
remember all the posts of people saying we wouldn't have local video generation capable anything better than the mess that is animatediff? I said we would have something by the end of the year, I was right.

So next time some no gen fag comes into one of these threads puking verbal diarrhea out of their mouths you can simply ignore them. They are elitist glowie shills that do not wish for us to have nice things. I predict within a few months we willd have something really worth talking about that is more efficient and runs on lower hardware. I honestly think that the key is how 3D games animate things, only with photo realistic textures exact to real life, no inference needed for generating frames, the animation data is already there, the model would just need to adhere to our prompts.
>>
File: 0.jpg (242 KB, 1376x896)
242 KB
242 KB JPG
>>
File: 651.png (2.71 MB, 1440x1440)
2.71 MB
2.71 MB PNG
>>
File: 1722554278045297.png (1.68 MB, 1080x2000)
1.68 MB
1.68 MB PNG
>>103033502
Designers needed for ongoing meme wars.
>>
>>103033184
Me on the left
>>
>>103031586
I'm liking this anon. gives me inspiration, I too enjoy genning military things, but my gens are never this accurate, this one you done is creepy realistic.
>>
File: file.png (821 KB, 768x768)
821 KB
821 KB PNG
>>
File: Red_Panda.png (1.05 MB, 2922x1850)
1.05 MB
1.05 MB PNG
So this is red_panda...
https://www.recraft.ai
>>
>>103033726
I wish someone just would hack them with a 0day like some anon did with NAI
>>
>>103033726
better demo link https://replicate.com/recraft-ai/recraft-v3
>>
>>103033760
nevermind you have to pay to use it on replicate kek
>>
>>
>>
>>
>>103033536
In my country I can goto jail for those images.
>>
I don't know what to gen today, the models are pissing me off also.
>>
File: ComfyUI_14499_.png (1.08 MB, 776x1024)
1.08 MB
1.08 MB PNG
>>103033552
It's really just Pixelwave 3, no lora's or anything special. The finetune model is just that good.
>>
>>103033956
i hate flux its slow.
>>
File: 308203657634836489.webm (1.23 MB, 720x1072)
1.23 MB
1.23 MB WEBM
>prompt “stroking a cucumber with both hands”
>this is the best it comes up with
>>
File: Untitled.png (320 KB, 1024x1024)
320 KB
320 KB PNG
>>103033903
Kek, generating text with pony test, it could work. depth map was used for controlnet.
>>
>>103033956
shnell 3 or dev 3?
>>
>>103033910
In my country I can email them to your politicians, as long as it's not a threat.
>>
Are there any good recommendations for an SDXL or pony model that isn't just trained on 1girl images? Because You know after a few month you kind of get bored of genning 1girl... All the models i have sitting on my drive trend to put same face 1girl where ever they can...
>>
>>103034098
In my country they make up the rules as they go along without public approval, they also believe they can just extradite people like you anon for sending them problematic images.

You can also go to prison for calling a Islamic extremist that killed 3 little girls an Islamic extremist. Get gas lit to fuck by the media for months when they knew for a fact he was the exact thing you called him.
>>
>>103034120
there isnt one. people act like local is this bustling scene of dynamic finetunes each with their own purpose but the blackpilled reality is that it's a bunch of shitty civitai jeetmixes of the same models over and over again and there really isn't anything unique or gamechanging when it comes to finetunes. more of the same. you can count the number of actual finetunes on one hand
>>
File: ComfyUI_temp_qopsx_00067_.png (1.93 MB, 1024x1280)
1.93 MB
1.93 MB PNG
>>
>>103034156
or maybe we don't release every finetune because the scene is full of faggots like you
>>
he who only sees muck is himself muck
>>
File: u90JSuMuZ0o.jpg (140 KB, 800x773)
140 KB
140 KB JPG
>>103034156
I know anon, and its depressing, we will be forever stuck with just genning 1girl till our dicks fall off.
>>
>>103034173
stop being poor and train your own model
oh wait
>>
>>103034173
maybe i will just take this pepe and use ipadaptor and do something with him.
>>
File: ComfyUI_14631_.png (1.31 MB, 776x1024)
1.31 MB
1.31 MB PNG
>>103034086
dev 3, specifically the bf16 version. You'll need a 3090 or 4090 for that specific version. You can try the smaller quants though, I hear they're good too.

>>103033991
It's the tradeoff for the quality and capabilities
>>
File: AHHH_00007_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>
>>103034206
Downloading, will try it on my rx 6950xt. Wish me luck.
>>
>>103034206
i got 3060 it works fine, just slow and annoying. I got various flux models, i fucking hate how slow it is when ever I change the prompt. Changing seed only its not so slow, its only when clip needs updating i guess.
>>
>>103034206
ignore me though anony i just feel fucking deflated today, have been feeling like this for 3 days, don't know whats up with me, sleeping lots also. perhaps i'm run down or coming down with some flu virus (I hope not)
>>
now youtube is like
"Are you still watch this video?" to some background music I was listening to after only 5 minutes

GOD WHY? WHY? WHAT DO THEY CARE?

God please just burn it all with fucking fire
>>
File: ComfyUI_temp_qopsx_00076_.png (2.03 MB, 1024x1280)
2.03 MB
2.03 MB PNG
>>
lmao oops
>>
>>103033910
>>103034098
>In my country I can email them to your politicians, as long as it's not a threat.
Yeh same

>>103034150
But would they kill you for merely downloading untraceable social apps such as SimpleX?
>>
>>103034256
i messed with some gore loras one day and holy shit, bodies chopped in half the lot, no i don't think I have the loras and no i wont tell you where to find them. The stuff would be way to bad to post here even, but nothing that Hollywood never produced. Just people get triggered when its AI generated by one mad man on his home computer...
>>
File: 00008-2021511344.png (2.13 MB, 1440x1440)
2.13 MB
2.13 MB PNG
>>
>>103034259
yeah but the quality of those models are all over the place most of the time I download and try at least, plus they seem censored, mostly created by idiots imo.

I might get into tuning soon, i'd have to do it on a remote cloud based system though.
>>
I kind of want to train models for buildings of various cities around the world using my own camera. I guess you would call that a data set. I just need to buy the camera first.
>>
File: ComfyUI_14652_.png (1.21 MB, 776x1024)
1.21 MB
1.21 MB PNG
>>103034215
Good luck anon, not sure how AMD GPUs will work out but with 16GB VRAM you should be in a pretty solid spot to use the 8fp version if you don't want the model to overflow into RAM

>>103034227
Yea, the CLIP issue is even a thing on a single 4090 or 3090. Flux with everything loaded is around 29 GB so generally the CLIP model overflows into RAM. I have a 4090 and 3090, load the flux model on the 4090 and the clip model on the 3090. I can pump these out pretty quickly since it removes most of the bottlenecks with flux and allows me to generate 9 of these images at a time in this resolution and takes a little over a minute. I tried doing it just on the 4090 without clip offloading and it was much slower.

>>103034241
hope you feel better soon
>>
>>103034321
I do also think there is a market in this also, just gonna put that out there for any anons looking for a way to profit from it.
>>
File: 002471.jpg (3.29 MB, 1536x2560)
3.29 MB
3.29 MB JPG
>>103034120
>after a few month you kind of get bored of genning 1girl
try a new style
>>
>>103034346
I'm actually thinking about getting a drawing pad and writing some hook to gimp into comfyui, ipadator is extremely powerful. AI assisted art would be cool.
>>
File: ComfyUI_14667_.png (922 KB, 776x1024)
922 KB
922 KB PNG
>>
>>
File: ComfyUI_temp_psddj_00014_.png (1.84 MB, 1024x1512)
1.84 MB
1.84 MB PNG
>>
File: ComfyUI_14666_.png (1.08 MB, 776x1024)
1.08 MB
1.08 MB PNG
>>
I'm just working on workflow design strategies desu, I'm sick of things ending up in a confusing mess.
>>
File: file.png (690 KB, 768x768)
690 KB
690 KB PNG
>>
File: ComfyUI_14760_.png (1009 KB, 776x1024)
1009 KB
1009 KB PNG
>>
>>103034440
nice, can you do a broken mirror reflecting a face?
>>
>>103034452
face detector fail? I had a similar problem the other day with reactor node, it was driving me mad...
>>
>>103034719
>fail
Looks like a win to me
>>
>>103034227
3060's faster than my 6950xt at image gen

iirc
>>
>>103034740
hmm if you say so anon, reactor node what ever its called is broken on my machine, keeps complaining about corrupt models but they have been redownloaded and hash checked many times. Only the face swap model works, so I don't even other with that node as I think some shady shit is going on with models being altered, they are pickel and not safe.

I use some other more easy nodes for faceswapping and shit, had some funny fun with those.
>>
File: file.png (2.61 MB, 1365x1024)
2.61 MB
2.61 MB PNG
https://x.com/recraftai/status/1851706399631224939
red_panda turned out to be another closed source model, sigh...
>>
>>103034788
>another stock photo model
Who cares
>>
>>103033757
imagine
>>
>>103034794
that's the thing, I don't know why it's ranked so high, do people really prefer stock photo models over something actually good like Midjourney or dalle?
>>
>>103034805
Yes, because people are actually tasteless retards. That's why we keep getting bokeh = good models.
>>
File: ComfyUI_00008_.png (1011 KB, 1024x1024)
1011 KB
1011 KB PNG
Throwing random shit art done in gimp into ipdaptor to see results
>>
File: 00031-349319049.png (2.1 MB, 1024x1440)
2.1 MB
2.1 MB PNG
>>
File: ComfyUI_00009_.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
>>
File: ComfyUI_00011_.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
>>
File: ComfyUI_00014_.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>>
File: 002481.jpg (2.66 MB, 1536x2560)
2.66 MB
2.66 MB JPG
>>
File: ComfyUI_00022_.png (1.5 MB, 1024x1024)
1.5 MB
1.5 MB PNG
>>
>>103035017
think it needs more ksamplers
>>
File: ComfyUI_00024_.png (1.83 MB, 1024x1024)
1.83 MB
1.83 MB PNG
>>
File: 1728571890471290.png (1.71 MB, 1024x1024)
1.71 MB
1.71 MB PNG
powerful
>>
>>103034453
Cool, give me an example prompt from one of yours, let's see what I get with cfg 1 in dedistilled.
>>
File: ComfyUI_00039_.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>
File: ComfyUI_00052_.png (977 KB, 1024x1024)
977 KB
977 KB PNG
that feel when the drugs are wearing off at 5AM during a summer morning. WTF just happened and why am i in this pond?
>>
>>103035191
It reminds me of being up way too late reading my Bible, then I wake up dreaming in a cold sweat about something crazy.
>>
File: 002483.jpg (245 KB, 1536x2560)
245 KB
245 KB JPG
>>103024700
not sure if i like it with sd1.5
>>
>>103035111
All my prompts are simple, that one specifically is "photo of 1990s special forces during desert storm".
>>
Baker...
>>
Hold....
>>
File: 2024-10-30_00015_.png (1.29 MB, 720x1280)
1.29 MB
1.29 MB PNG
>>103035497
First of all, thanks for your review. Glad I downloaded Pixelwave. It really adheres to prompts better than Flux.

>photo of 1990s special forces during desert storm
Doing it with some anti-detail with dedistilled cfg 1. 30 steps, euler, beta.

This is the result of the simple prompt. I asked a llama LLM to juice up my prompt:
>Capture the gritty intensity of a 1990s special operations forces unit during Operation Desert Storm in a photograph. The image should convey the harsh conditions and rugged terrain of the desert landscape, with the soldiers' worn and weathered uniforms and equipment a testament to their unwavering dedication and resilience. Incorporate subtle hints of the era's distinctive military gear, such as PASGT helmets, ALICE packs, and M16A2 rifles, while also emphasizing the soldiers' unyielding focus and camaraderie in the face of adversity. Consider a dramatic, golden-hour lighting scheme to evoke the sense of a moment frozen in time, with the desert sun casting long shadows and accentuating the textures of the soldiers' gear and the surrounding environment.

I'll show this next.

>>103024700
>Detail Daemon
(demon?)

Anyway, how does it differ from using LyingSigmaSampler? You can chain LyingSigmaSamplers (if you are careful you can therefore set the lie amount for each step). I am anti-detailing.
>>
File: file.png (103 KB, 644x947)
103 KB
103 KB PNG
>>103035607
>Anyway, how does it differ from using LyingSigmaSampler?
you have more options on how to change the sigmas
>>
File: 2024-10-30_00016_.png (1.28 MB, 720x1280)
1.28 MB
1.28 MB PNG
>>103035607
>>
>>103035607
>>103035640
something important with pixelwave 3, make sure to use dpmpp_2m for the sampler and sgm_uniform for the scheduler, do at least 25 steps
>>
Fresh

>>103035679
>>103035679
>>103035679



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.