[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (950 KB, 3264x3264)
950 KB
950 KB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>101842860

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
File: Capture.jpg (76 KB, 880x1084)
76 KB
76 KB JPG
Is there a way to make the both of them work at the same time?
>>
>>101847080
Do you use fotor for the collages?
>>
>>101847097
They have the same output. Is there a node that accepts multiple guidance inputs?
>>
Philosopher's Stone era Emma Watson lora where?
>>
File: ComfyUI_31329_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>
>>101847121
In your gpu waiting to be unlocked you sick fuck.
I don't think anyone is going to publicly make that lora.
>>
File: file.png (2.46 MB, 1024x1024)
2.46 MB
2.46 MB PNG
>>
Blessed thread of frenship
>>
>>101847144
not for NSFW, you're sick in the head
>>101847164
go fuck yourself
>>
>>101847144
>>101847171
Calm down, faggot
Go be mad somewhere else
>>
>>101847177
hes not being mad, he's being fun
>>
>>101847121
We're definitely getting an age slider lora that works both ways like it was with sd1.5 and sdxl
>>
>>101847171
>not for nsfw
>>
File: ComfyUI_00507_.png (2.23 MB, 1024x1024)
2.23 MB
2.23 MB PNG
Do we know if finetunes can be made on consumer hardware, or if it's just Loras?
>>
>>101847227
nice one anon, prompt and settings?
>>
anybody have lewd chat gpt?
>>
>>101847097
Whats that for?
>>
File: ComfyUI_31333_.png (1.51 MB, 1024x1024)
1.51 MB
1.51 MB PNG
>>
File: ComfyUI_00513_.png (1.81 MB, 1024x1024)
1.81 MB
1.81 MB PNG
>>101847235
https://files.catbox.moe/n7mpx2.png
I've been asking chatgpt to describe the artstyles of images I like, then getting it to generate a T5 prompt to generate an image in that style
>>
>>101847227
Very unlikely looking at this stage but this time a week ago we thought any training was literally impossible so who knows what will happen next week.
>>
File: Capture.jpg (131 KB, 1581x685)
131 KB
131 KB JPG
>>101847252
I'm testing some shit to limit the burn of the CFG, but I like adaptiveGuidance and I'd want to keep it

CFGLimiterGuider is this shit btw: https://arxiv.org/pdf/2404.07724
>>
>>101847275
I see
>>
File: ComfyUI_Flux_50.png (1002 KB, 1216x832)
1002 KB
1002 KB PNG
>>101847275
>>
>>101847275
this nigga taking notes, more effort than 99.99% of image gen users
>>
This is what I love about image and text gen. When things get stale you can just take a few months off and suddenly ten years have passed
>>
File: 00002-394419179.png (2.2 MB, 1432x1432)
2.2 MB
2.2 MB PNG
with the latest update to forge, i'm unable to use PonyRealism, recreating the same seed gives me a 2.5d image. wtf. anyone have ideas? settings seem identical.
>>
>>101847413
the calculations, anon, they...different
>>
>>101847377
>mouse on left side
does miku constantly flick the bean, do ya reckon?
>>
File: ComfyUI_00522_.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>101847275
>>
File: flightreactionsyell.gif (192 KB, 220x220)
192 KB
192 KB GIF
>>101847421
no no NO NO NO EVERYTHING WAS PERFECT BEFORE PLEASE DON'T DO THIS TO ME PLEASE SOMEONE DELIVER ME FROM UPDOOOTER EVIL
>>
File: Capture.jpg (303 KB, 2711x1321)
303 KB
303 KB JPG
Oh shit, Euler CFG++ actually works on flux, you just need this node
https://github.com/pamparamm/ComfyUI-ppm
https://imgsli.com/Mjg2NzE2
>>
>>101847391
kek
>>101847426
don't worry, me neither, I just test out every single node that has "Guidance" in it, maybe with some luck I'll find something good kek
>>
File: ComfyUI_00526_.png (1.56 MB, 1024x1024)
1.56 MB
1.56 MB PNG
>>
File: Capture.jpg (464 KB, 3189x1380)
464 KB
464 KB JPG
>>101847454
it doesn't work well with the current values of DynamicThresholding though at high CFG
>>
File: ComfyUI_00527_.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>
File: file.png (156 KB, 278x320)
156 KB
156 KB PNG
>>101847492
>>
Oh wait, do the ((new calculations)) have to do with all those new features like being able to choose a specific scheduler and distilled cfg? Do they make a huge impact on PonyXL? I never used them because obviously forge didnt have them before.
Shit how would i even find out which scheduler it was picking before? Do the specific samplers have a preferred scheduler? For example im trying to make these peach gens with DPM++ 3M SDE..
>>
>>101847492
>>101847454
Is there some documentation about that "cfg_++" or a paper or something?
>>
>>101847532
you're just fucked, man, time to test settings to find the sweet spot again
>>
File: 1718581264018294.jpg (128 KB, 1024x1024)
128 KB
128 KB JPG
>>101847557
no fucking way i have so many god damn good gens i liked to use as foundations i can't just start over again
>>
File: Untitled-1.png (50 KB, 1194x678)
50 KB
50 KB PNG
I noticed that sometimes the negative prompt node takes up to 20 seconds to run (while obviously doing nothing useful), apparently because of t5xxl, here's a hack to speed things up.
I hope we get a ksampler without a mandatory negative input.
>>
>>101847431
Just install the older version
>>
>>101847574
>that picture
lmaoooooo
>>
>>101847574
>dall-e gen
>>
>>101847575
>I hope we get a ksampler without a mandatory negative input.
that's what SamplerCustom is for, the ComfyUI Flux example workflow uses it. go check it out
>>
>>101847574
why not just downgrade?
>>
>>101847621
No real point if Flux is the next big thing. Not counting my chickens on getting FluxPony this year but it'll happen.

and no fucking way am i bloating my C: with another 5 gigs worth of python dependencies to run two forges. Oh well, dems the fucking breaks i guess ill have to spend the rest of my morning getting good again.

>>101847599
that's not a gen that's just a picture of Lennon and his wife.
>>
>>101847636
>and no fucking way am i bloating my C: with another 5 gigs worth of python dependencies to run two forges
...you cant dynamically link two forge instances to a single package of dependencies?
>>
File: file.png (2.63 MB, 1024x1024)
2.63 MB
2.63 MB PNG
>>101847574
one has to imagine anon happy
>>
>>101847377
>tattoos
>>
>>101847674
Even if i knew you can do that and how, Wouldn't that beat the purpose of using different instances? Pretty sure new forge has new dependencies.
>>
>>101847611
Oh nice, I saw SamplerCustom but it had negative input too, SamplerCustomAdvanced is exactly what I wanted.
>>
>>101847693
surely you could just have a single folder loaded with dependencies, and have programs link to that directory. right? SURELY coders arent so fucking retarded that the concept of libraries makes them sperg out, right!!!
>>
File: ComfyUI_Flux_53.png (993 KB, 1216x832)
993 KB
993 KB PNG
>>101847687
Anon can't even handle ai-generated women
>>
File: ComfyUI_Flux_02068_.png (1.93 MB, 1024x1024)
1.93 MB
1.93 MB PNG
Even LoRAs are looking a little slopped to me. I hope it's just because he didn't take enough steps, otherwise a finetune may be needed.
>>
File: ComfyUI_00549_.png (2.08 MB, 1024x1024)
2.08 MB
2.08 MB PNG
>>
>still plain miku gens
I thought this general would be the more creative one. just a circlejerk for the most updooted meme char
>>
File: MarkuryFLUX_00133_.png (1.36 MB, 1280x1536)
1.36 MB
1.36 MB PNG
made my first flux lora. It's cool i guess, but the real thing is how little it takes to train! 2700 steps, 22 images, and only one 3090. https://civitai.com/models/640342/shamiko-or-machikado-mazoku-fluxd
>>
File: 1717848564508293.png (869 KB, 1024x1024)
869 KB
869 KB PNG
the power of a text encoder:
>>
File: 1712416419815786.png (1.21 MB, 896x1152)
1.21 MB
1.21 MB PNG
>>
>>101847791
No way the first loras that come out are sloppy!?
I can't believe it. I won't believe it
>>
>>101847872
>>101847791
He's just didn't train it right. Mine turned out fine.
See here -> >>101847824
>>
>>101847824
You wrote 3x3090 in the description, did it take 3 hours on a single one?
>>
>>101847872
Not sloppy, I mean AI slopped, which is baked into the base model.
>>
>>101847828
You are worse than the hunyan shills.
>>
>>101847886
That is also what that anon meant.
>>
>>101847887
im just pointing out that it's nice that a model can do text well now
>>
File: file.png (711 KB, 482x640)
711 KB
711 KB PNG
>>101847879
I have 3x3090's so there was three versions of the lora made, with different params.
Distributed training is broken at the moment for fp8, so I just three runs in three separate gpu's.
>>
wonder if there's still gonna be retards staying on 1.5 like there was when SDXL came out, or is flux so much better that they can't fool themselves anymore
>>
File: ComfyUI_00557_.png (2.01 MB, 1024x1024)
2.01 MB
2.01 MB PNG
If I'm getting fucked hands should I increase or decrease CFG?
>>
File: 1710476168329974.jpg (41 KB, 427x640)
41 KB
41 KB JPG
Sirs cant get it to generate this pose help .
>>
is there a good comfy workflow for loras that 100% works with the converted comfy lora samples
>>
>>101847887
>worse than the hunyan shills.
I'm not shilling Hunyuan. I will simply not stop using it. It takes knowing what SOTA closed aesthetic models like Niji output to know that Hunyuan is the most aesthetic model rn even with Flux being the best at prompt following.
>>
>>101847941
technically increase cfg would be better but your best bet is to just regen with 5000 different seeds and hope you get a better one
>>
>>101847941
I don't think CFG is fucking the hands, it fucks the color saturation and contrast but the anatomy is always fine
>>
>>101847824
Could you please show me some pictures from the dataset and txt used for them, do you need to write a fluid description of the picture in the txt now? I wonder if the old simple tags from my dataset for SDXL training will work with flux.
>>
File: ComfyUI_00562_.png (2.07 MB, 1024x1024)
2.07 MB
2.07 MB PNG
>>101847968
>>101847967
Yeah, I think Flux just struggles with dynamic poses or something, it's the first time I've run into a bad anatomy issue.
>>
>>101847454
>Euler CFG++
>++
What's that?
>>
File: file.png (827 KB, 2458x1146)
827 KB
827 KB PNG
>>101847994
One of the runs I did was with danbooru tags, but I also set the learning rate waaay to low so the danbooru tag part of the experiment got shafted. I was going to try it again with danbooru tags sometime.
As for what I used, it's a danbooru tag based full description of the image. I've provided a refined example from my current project.
I feed the image, and the danbooru tags to either GPT-4o or Claude and have them write captions. It works good enough.
>>
File: 1699163288114758.png (1.94 MB, 1024x1024)
1.94 MB
1.94 MB PNG
>>101847080
very impressive 2B!
>>101847824
thanks! love thad adorable retard

on a side note, is pony supposed to look washed out without style loras? I am using the sdxl vae. Is there a magic word I'm mossing? (Euler A)
>>
>>101848094
Thanks!
>>
File: 1720330185897564.png (935 KB, 1024x1024)
935 KB
935 KB PNG
>4 panel manga, can specify what happens in first/second/third/fourth panels
neat
>>
File: ComfyUI_00564_.png (2.09 MB, 1024x1024)
2.09 MB
2.09 MB PNG
>>
>>101848108
get the autismmix checkpoint off civitai, also the confetti variation of it, does really good anime stuff

if you think it's washed out make sure you have a new copy of the sdxl vae. but appearance varies by model.
>>
File: Capture.jpg (70 KB, 1080x790)
70 KB
70 KB JPG
Oh fuck, I got some new schedulers after downloading dozens of nodes that night kek
>>
Any advice on a model/prompting to generate something without people? I'm trying to generate an empty wrestling ring, no crowd, from the point of view of inside of the ring. The more I ask for no people, the more the model circumvents me by injecting horrors beyond my comprehension.
>>
File: 1708431897616861.jpg (89 KB, 744x992)
89 KB
89 KB JPG
>>101848129
Yeah this is one image with one prompt
>>
>>101848129
ヤにら
イうこたしっこ
テーリ...
ラド...
>>
File: 1710177127859294.png (951 KB, 1024x1024)
951 KB
951 KB PNG
>>101848129
>>
>>101848157
AlignYourSteps seems kinda broken with flux, but it's fantastic for sdxl
>>
>>101847080
>Local Diffusion General
can i post pics prompted on paperspace lol
>>
Loras are flowing in now. Most of them are garbage
>>
I'm looking to buy a used 3090, what price is good these days?
>>
File: ComfyUI_00570_.png (1.9 MB, 1024x1024)
1.9 MB
1.9 MB PNG
Couldn't get this one to look like Miku despite putting her name in like a dozen times, but I still like it
>>
File: 2024-08-12_00045_.png (1.71 MB, 1024x1280)
1.71 MB
1.71 MB PNG
okay .. flux LORAS starting to get good
>>
>>101847935
People who are staying on 1.5 are poorfags, there's even less chance they'd switch to flux lol.
>>
>>101848237
>Couldn't get this one to look like Miku
How is that possible? If Flux is the best at anything, it's drawing Miku kek
>>
File: 1720962723940276.png (1.26 MB, 896x1152)
1.26 MB
1.26 MB PNG
>>
File: ComfyUI_Flux_02056_.png (1.65 MB, 1024x1024)
1.65 MB
1.65 MB PNG
>>101847957
However LoRAs are coming out surprisingly well, it might be possible to undo the sloppiness after all.
>>
File: 2024-08-11_00012_.png (2.25 MB, 1280x1024)
2.25 MB
2.25 MB PNG
>>101848260
I wholeheartedly agree
>>
File: ComfyUI_flux__00500_.png (2.83 MB, 1920x1080)
2.83 MB
2.83 MB PNG
>>101848221
>>101848241
hooo boy we're gettin somewhere
https://civitai.com/models/638052/ps1-ps2-old-3d-game-style?modelVersionId=713553


already leagues above 1.5 and XL/Pony's equivalent LORA. Could use improvement, but wow.
>>
>>101848241
what is the recommended lora workflow cause there are a billion different options
>>
>>101848279
>Flux.1 S
It's worthless
>>
>>101848289
Doesn't the loras on Schnell also work on Flux?
>>
>>101848289
shut up faggot you're worthless.
>>
>>101848295
no
>>
>>101848298
What about Dev x Schnell schizomixes?
>>
File: ComfyUI_00578_.png (1.64 MB, 1024x1024)
1.64 MB
1.64 MB PNG
>>101848260
https://files.catbox.moe/giyxv9.png
https://files.catbox.moe/yvfn90.png
Beats me, It's a massive prompt but so is pic rel and it worked fine here
>>
File: 1723326811725825.png (399 KB, 1685x832)
399 KB
399 KB PNG
>>101848146
I am actually using it, and I've tried both new VAEs, spent half a day yesterday figuring it out and realized that I might be extremely retarded while taking a screenshot for you.
I just didnt notice I left CFG after fucking around with SDXLTurbo. God damn it, I didnt notice anything wrong since the images produced were so grounded in reality, they were exactly what I was looking for if not for the washed out part
>>
File: 2024-08-12_00049_.png (1.9 MB, 1024x1280)
1.9 MB
1.9 MB PNG
>>101848295
ya the work on both, but I alot of testing should be done on schnell, cause I think it will ignore alot cause it converges so fast
>>101848304
thats just stupid attention grabing .. they saw its possible to merge em, but the results are just mediocre
>>
>>101848165
Negative prompting is what you need.
>>101843586
>>
>>101848311
yeah that will do it, default is like 7. but that checkpoint is very very good for anime I have found.

https://github.com/DominikDoom/a1111-sd-webui-tagcomplete

this extension lets you add booru tags for anything fast and the model will yield good stuff for most booru tags for anime characters or whatever, it's really useful even if not using a lora.
>>
File: 2024-08-12_00051_.png (1.96 MB, 1024x1280)
1.96 MB
1.96 MB PNG
>>
>>101848283
I second this
>>101848279
>>101848241
is it feasible to run/train them on 16gb?
>>
https://civitai.com/models/640459/impressionist-landscape-lora-for-flux?modelVersionId=716306
First art style lora. Impressionist landscapes.
I kind of want to train one of Greg Rutkowski just so he has an actual seizure irl.
>>
where's this from
https://huggingface.co/datasets/la-ji/sd-prompt-image-in-the-wild-counterfeit
>>
File: ComfyUI_00584_.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
>>
File: 2024-08-12_00052_.png (1.95 MB, 1024x1280)
1.95 MB
1.95 MB PNG
>>101848374
>is it feasible to run/train them on 16gb?
not that I know so.. I mean you can, but it will take ages, even a 4090 can only do it on fp8 .. if you want to do a fp16 lora you need 40GB VRAM https://twitter.com/ostrisai/status/1819802556261863925
>>
File: fp059.jpg (385 KB, 1024x1024)
385 KB
385 KB JPG
>>
>>101848279
Definitely more ps2 than ps1. The low res warped PS1 textures is something that Flux simply cannot do, so I really can't wait to see a proper PS1 and N64 LoRA
>>
File: ComfyUI_00588_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>
File: 2024-08-12_00054_.png (2.11 MB, 1024x1280)
2.11 MB
2.11 MB PNG
>>
>>101848451
Would be more fitting to describe those renders as prerenders rather than ingame shots. phong/gouraud shaded.
>>
File: 2024-08-12_00055_.png (2.06 MB, 1024x1280)
2.06 MB
2.06 MB PNG
>>
>>101848451
>Definitely more ps2 than ps1. The low res warped PS1 textures is something that Flux simply cannot do, so I really can't wait to see a proper PS1 and N64 LoRA
this, I'm also waiting this one hard
>>
>>101848461
Nice SVD
>>
File: ComfyUI_31343_.png (1 MB, 640x1280)
1 MB
1 MB PNG
>>
File: FD_00167_.png (2.06 MB, 1024x1024)
2.06 MB
2.06 MB PNG
>>101848306
>It's a massive prompt
Using up all the tokens produces significantly better results in Flux I have found. The more schizo your prompt the better.
>>
File: ComfyUI_31344_.png (1.08 MB, 640x1280)
1.08 MB
1.08 MB PNG
>>
File: uuuuuuuuuuuuuuuuhh.png (89 KB, 200x166)
89 KB
89 KB PNG
what causes this again?
>>
File: 2024-08-12_00015_.png (1.39 MB, 1024x1280)
1.39 MB
1.39 MB PNG
>>101848476
>low res warped PS1 textures
you mean unfiltered.. but yea FLUX refuses to go back in time to the 90s.. the 80s it can do.. but 90s computer graphics is not there
>>
>>101848399
Do fp16 loras work on fp8? Logically I think yes, but I genuinely don't know.
>>
File: 1698724583204832.png (2.79 MB, 1536x1536)
2.79 MB
2.79 MB PNG
>>101848399
oh damn, was really impressed with FLUX but it takes so long on my poorfag 4080s (and you cant preview the output while it calculates steps, unlike SD). Well I guess no LORAs for me for the next 10 years
>>101848344
thanks, I'll give it another shot! used to use it on 1.5 but more often than not I had to change the words back
>>
>>101848497
It's what happen when skinwalkers forget to moisturize.
>>
File: Capture.jpg (17 KB, 664x268)
17 KB
17 KB JPG
Found a node that works at high CFG without requiring DynamicThreshold: https://github.com/comfyanonymous/ComfyUI_experiments
https://imgsli.com/Mjg2NzUw
>>
>>101848518
Interesting, can you please try a few more examples in different styles?
>>
File: 1708994434760867.png (642 KB, 640x640)
642 KB
642 KB PNG
>>101848508
>god damn 4chan glitched out and didnt let me finish my message
I wanted to ask if anyone managed to replicate picrel on 1.5/XL/Pony/FLUX? I've been trying since winter and failed miserably
>>
File: fp060.jpg (384 KB, 1024x1024)
384 KB
384 KB JPG
>>
>>101848533
>Interesting, can you please try a few more examples in different styles?
give me a prompt I'll see what I can do with it
>>
https://civitai.com/models/640459/impressionist-landscape-lora-for-flux?modelVersionId=716306

Wow this is a very impressive LoRA, added a couple of artists too.
>>
>>101848539
DALL-E gen, too advanced for Flux
>>
File: FD_00230_.png (107 KB, 256x256)
107 KB
107 KB PNG
>>101848518
you bitch >>101848531
Just do what I do and call yourself a retard and attach the pic in a reply.
>>
>>101848320
cool image i like the pyramid
>>
File: 2024-08-12_00057_.png (1.99 MB, 1024x1280)
1.99 MB
1.99 MB PNG
>>101848503
they do.. its just less numbers.. every weight is a floating point number, if you have 16bit you got 16.7 million.. if you got 8bit you got 256 numbers.. it will work.. and yea pic related was done in fp8

>>101848508
I actually ran OOM earlier when doing FLUX with a lora.. on my 4090 it works fine in either fp16 model or text encoder.. but If I load a lora it wants just that little bit more than 23.4GB and pushed to OOM if I ran it on FP16 model
>>
>>101848556
who dat?
>>
File: ComfyUI_31345_.png (1.27 MB, 640x1280)
1.27 MB
1.27 MB PNG
>>
File: ComfyUI_00595_.png (1.68 MB, 1024x1024)
1.68 MB
1.68 MB PNG
>>
there's something terribly wrong with forge's image quality
images are nowhere as crisp as with A1111 or comfy
thoughts?
>>
>>101848544
I will give you some catboxes I did with DT.
https://files.catbox.moe/g9getp.png
https://files.catbox.moe/s98qof.png
https://files.catbox.moe/0cpx1d.png
https://files.catbox.moe/0fl91b.png
>>
>>101848611
>that last gen is miku finding out about the mikuposters on the /g/ ai generals
>>
>>101848554
yeah I know. Yet my dream of local models achieving this greatness at some point lives on
>>101848564
I guess until chinese would release old H100 on custom PCBs with hacked drivers or maybe RTX7000, this is not the most entertaining model to play gacha at home
>>101848605
check samplers and settings. my gens are 1:1 on both. Though pony models RANDOMLY output colorful fog instead of images on random tag combinations, still didnt figure out what triggers it
>>
File: ComfyUI_00599_.png (1.13 MB, 1024x1024)
1.13 MB
1.13 MB PNG
>>101848615
>They gen me doing what?
>>
File: 00046-0.png (1.5 MB, 1024x1024)
1.5 MB
1.5 MB PNG
>>101848605
post examples, I have had no issue with Forge
>>
File: 2024-08-12_00063_.png (2.07 MB, 1024x1280)
2.07 MB
2.07 MB PNG
>>101848611
all of em are good, but why do they have such a grainy texture?>>101848620
>I guess until chinese would release old H100 on custom PCBs with hacked drivers or maybe RTX7000, this is not the most entertaining model to play gacha at home
yaa if moores law holds we got full flick making AI in 10 years

>>101848560
thank you
>>
File: ComfyUI_Flux_02092_.png (1.96 MB, 1024x1024)
1.96 MB
1.96 MB PNG
>>101848550
>By Frank W Benson. An impressionist landscape of a couple fine dining at a French restaurant

Not sure if following his style much but not bad
>>
>>101848611
>GuidanceNeg 10
Sorry but it doesn't seem like this node can handle guidance 10, there's something special about CFG + DynamicThresholding that just makes it work somehow
>>
>>101848620
>>101848629
not just with flux
shit's eternally blurry, and never quite the same
>>
>>101848620
>yeah I know. Yet my dream of local models achieving this greatness at some point lives on
Cook a lora from your shitting indians folder. You must have thousands by now
>>
>>101848387
i think it's from stable horde, field names seem familiar, karras=True being default, the available post_processing options
>>
File: 00022-3634275900.jpg (192 KB, 1200x1200)
192 KB
192 KB JPG
>>101848650
>Cook a lora from your shitting indians folder. You must have thousands by now
someone do this right fucking now, make it blow up as hard as that one indian documentary that flooded the internet from /pol/ a while ago.
>the next best FLUX lora could be one memeing on indians
could be big!
>>
>>101848644
Ah fair enough, will be interesting to see the prompt none the less, even without the 10neg. Particularly the Boris Valejo one
>>101848636
>why do they have such a grainy texture
They are over cooked with high cfg. Was early tests from anons Dynamic Thresholding workflow, settings aren't dialed in
>>
File: ComfyUI_31346_.png (856 KB, 640x1280)
856 KB
856 KB PNG
>>
>>101848669
I have no idea how to do it though and my GPU is shit, I'll start by collecting shit rocket memes I guess. I'd love if someone release the lora for XL/Pony
>>
File: 2024-08-12_00065_.png (2.09 MB, 1024x1280)
2.09 MB
2.09 MB PNG
>>101848655
funny fact about FLUX.. DDIM does not work on CFG == 1 .. but if use the hacks .. it works perfectly fine.. all pictures like this and above related, I made em on DDIM
>>
>>101848676
>Ah fair enough, will be interesting to see the prompt none the less, even without the 10neg. Particularly the Boris Valejo one
Ok then I'll get back to you once it's done
>>
>>101848682
>2nd panel
Finally, after all these years I have found it... the clitorus
>>
>>101848676
>They are over cooked with high cfg. Was early tests from anons Dynamic Thresholding workflow, settings aren't dialed in
you got em like this? I am running CFG == 8 no problem
>>
File: fp061.jpg (414 KB, 1024x1024)
414 KB
414 KB JPG
>>
File: ComfyUI_31347_.png (935 KB, 640x1280)
935 KB
935 KB PNG
>>
>>101848706
>Was early tests from anons Dynamic Thresholding workflow, settings aren't dialed in
I'm that anon, so you think that going changing "interpolate_phi" from 0.7 to 0.87 makes it better?
>>
>>101848706
Neg CFG was set to 10
>>
>>101848725
pure superstition, I like to move abit of the known settings.. but it wont make much a of a differernce, its about color depth and the like
>>101848727
kek, pic related
>>
>>101848641
Nvm seems overcooked
>>
File: ComfyUI_00604_.png (1.35 MB, 1024x1024)
1.35 MB
1.35 MB PNG
>>
>>101848743
now swap positive and negative prompts
>>
>>101848676
>stylized abstract modern artwork featuring bright colours, complex geometric shapes, highly detailed, intricate visible brush strokes, of Hatsune Miku
https://imgsli.com/Mjg2NzUx
you said the "Boris Valejo" one but I didn't find any workflow that has his name on the prompt
>>
>Upon final inspection of the FLUX checkpoints, they discovered trace amounts of E.COLI
S-Sars?!
>>
>>101848743
if you are writing the same text for the both of them, why won't you just fuse them, would be less a pain in the ass lawl
>>
File: 00012-1558883101.png (2.71 MB, 1536x1536)
2.71 MB
2.71 MB PNG
Oh yeah almost forgot to mention, i fixed my problem earlier with forge;
Now loading an old forge for anything not Flux, and keeping the latest forge as Flux develops to the point i can actually feasibly use it.

>now someone please share some funny indian gens i dont have mine even named so i cant find em
>>
File: ComfyUI_Flux_02102_.png (1.15 MB, 1024x1024)
1.15 MB
1.15 MB PNG
Testing https://civitai.com/models/639820/kazuma-sketchbook-flux?modelVersionId=715659
>>
File: ComfyUI_31350_.png (887 KB, 640x1280)
887 KB
887 KB PNG
>>
File: 2024-08-12_00074_.png (1.68 MB, 1024x1280)
1.68 MB
1.68 MB PNG
>>101848775
cause I sometime like clip more than txxl .. depends on the prompt

>>101848760
here you go.. pic related
>>
they should have trained it with T5 only, and proper unfiltered captions
>>
>>101848764
>you said the "Boris Valejo" one but I didn't find any workflow that has his name on the prompt
It's the demon one. I must have asked an LLM to describe a BV painting and slapped miku into the prompt. Was a few days ago and I am quite retarded
>>
>>101848832
>cause I sometime like clip more than txxl ..
I wonder if there's a node that does that for us already, jut pulls out key words for clip
>>
File: ComfyUI_Flux_02106_.png (1.94 MB, 1024x1024)
1.94 MB
1.94 MB PNG
>>101848809
>>
File: 2024-08-06_00003_.png (2.62 MB, 1280x1024)
2.62 MB
2.62 MB PNG
>>101848841
if you want any type Miku just follow this workflow, replace artist with the one you want and replace cubism with the style the artist paints in

>https://files.catbox.moe/cbym12.png
>>
File: 1701383280014507.png (967 KB, 1024x1024)
967 KB
967 KB PNG
Miku wearing a business suit and using two pistols, in the style of john wick:
>>
File: 2024-08-06_00027_.png (2.5 MB, 1280x1024)
2.5 MB
2.5 MB PNG
>>101848869
havent seen one yet.. but as Black Forest anon said a few days ago when he visited us: "There was a reason why we kept clip"
>>
File: ComfyUI_Flux_02107_.png (1.89 MB, 1024x1024)
1.89 MB
1.89 MB PNG
>>101848876
>>
>>101848692
interesting. the ddim_steps field name was just an artifact from past development, they supported other samplers
>>
>>101848899
DALL-E 3 didn't and it beats Flux
>>
File: ComfyUI_Flux_02108_.png (1.61 MB, 1024x1024)
1.61 MB
1.61 MB PNG
>>101848913
>>
File: 1723438909917248.jpg (165 KB, 1402x602)
165 KB
165 KB JPG
>>101848636
>all of em are good, but why do they have such a grainy texture?
that's because of the DynamicThreshold, it fixes CFG but also gives this white/grainy effect, there's supposedly a better alternative than that but it's something that is "very slow" based on their authors
https://github.com/scraed/CharacteristicGuidanceWebUI
>>
File: ComfyUI_Flux_02109_.png (1.82 MB, 1024x1024)
1.82 MB
1.82 MB PNG
>>101848939
Overall not a bad LoRA
>>
>>101848876
>>101848913
>>101848939
>>101848953
soul
>>
>>101848841
Here's the comparaison with Borus Valejo
https://imgsli.com/Mjg2NzYz
>>
File: fp063.jpg (406 KB, 1024x1024)
406 KB
406 KB JPG
>>
File: ComfyUI_31354_.png (741 KB, 640x1280)
741 KB
741 KB PNG
Okay, flux paired with gemma2b is better at iyashikei slice of life 4koma than most human authors.
>>
>>101848899
>There was a reason why we kept clip
probably for training purposes so people can use their existing data sets
If Flux becomes the default they get more funding, more recognition, more money.
They are out the gate really fucking strong, and we have 2 years of autism knocking out loras really quickly now.
>>
File: ComfyUI_31355_.png (1.17 MB, 640x1280)
1.17 MB
1.17 MB PNG
>>
>>101848876
since I can't read japanese to me this just looks like someones real drawing.
>>
File: Capture.jpg (501 KB, 3158x1336)
501 KB
501 KB JPG
>>101848841
Take back what I said, it seems like TonemapNoiseWithRescaleCFG can handle CFG + Guidance 10
https://imgsli.com/Mjg2Nzgz
>>
File: 1697893684696146.png (935 KB, 1024x1024)
935 KB
935 KB PNG
>>
>>101848970
Thanks, still doesn't beat DT, looked a bit burned.
>>
File: 2024-08-12_00075_.png (1.63 MB, 1024x1280)
1.63 MB
1.63 MB PNG
>>
File: ComfyUI_31356_.png (1.26 MB, 640x1280)
1.26 MB
1.26 MB PNG
>>
File: Capture.jpg (20 KB, 627x232)
20 KB
20 KB JPG
>>101849048
>looked a bit burned.
the parameters can be changed, I used the default one, DynamicThreshold looked like shit when I used the default one, took me days to figure out the sweet spot kek
>>
>>101849076
>the parameters can be changed, I used the default one, DynamicThreshold looked like shit when I used the default one, took me days to figure out the sweet spot kek
can't really tell from your poogens
>>
File: ComfyUI_31357_.png (1 MB, 640x1280)
1 MB
1 MB PNG
>>
>>101849057
This watchmen adaptation was fucking weird.
>>
has upscaling always been as slow as genning? My gens could be taking half the time if upscaling didn't take forever, where i sit at 1.35s/it gens, upscaling takes 6.83 as high as 9s.it..
>>
File: ComfyUI_31339_.png (1.5 MB, 1024x1024)
1.5 MB
1.5 MB PNG
>>
File: ComfyUI_31340_.png (1.35 MB, 1024x1024)
1.35 MB
1.35 MB PNG
>>
>>101849200
upscaling has always been slower, bigger image means more work to do in each iteration
>>
>>101849244
whats this i see about model switching at different phases? Should i try a model meant for upscaling to get a bit of a boost?
never really saw anyone talk about this in SDG so im pretty ignorant.
>>
File: ComfyUI_31331_.png (1.6 MB, 1024x1024)
1.6 MB
1.6 MB PNG
This one may be a bit too spicy for /g/: https://files.catbox.moe/ehcgqs.png
>>
File: fp064.jpg (182 KB, 1024x1024)
182 KB
182 KB JPG
>>
File: 1561816457.png (2.07 MB, 1920x1080)
2.07 MB
2.07 MB PNG
>>
File: ComfyUI_00628_.png (1.46 MB, 1024x1024)
1.46 MB
1.46 MB PNG
>>101849303
haha just kidding haha
>>
So are we back with LoRAs for Flux? Something change?
>>
File: FLUX_00011_.png (1.01 MB, 896x1152)
1.01 MB
1.01 MB PNG
>>
File: ComfyUI_31365_.png (1.3 MB, 848x1200)
1.3 MB
1.3 MB PNG
>>101849441
Heh
>>
File: FLUX_00077_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
the humble peugeot 206
>>
File: 231930219.png (1.38 MB, 1216x832)
1.38 MB
1.38 MB PNG
>>
File: ComfyUI_31368_.png (1.49 MB, 848x1200)
1.49 MB
1.49 MB PNG
>>
File: fp065.jpg (414 KB, 1024x1024)
414 KB
414 KB JPG
>>
File: 1275099653.png (1.49 MB, 1344x768)
1.49 MB
1.49 MB PNG
>>101849642
the not-so-humble 106 GTI
>>
File: FLUX_00074_.png (1.14 MB, 896x1152)
1.14 MB
1.14 MB PNG
>>101849754
not enough rice
>>
File: ComfyUI_31370_.png (1.58 MB, 848x1200)
1.58 MB
1.58 MB PNG
>>
File: fp067.jpg (256 KB, 1024x1024)
256 KB
256 KB JPG
>>
>>101849765
Gas gas gas!
>>
File: ComfyUI_31371_.png (1.65 MB, 848x1200)
1.65 MB
1.65 MB PNG
>>
File: 141543577.png (1.4 MB, 1344x768)
1.4 MB
1.4 MB PNG
>>101849765
naisu
>>
>>101848486
Yes. Because (((CLIP))) goes up to 77 and t5 goes up to like up 500.
>>
File: fp066.jpg (427 KB, 1024x1024)
427 KB
427 KB JPG
>>
File: ComfyUI_31375_.png (1.62 MB, 848x1200)
1.62 MB
1.62 MB PNG
>>
File: ComfyUI_31376_.png (1.42 MB, 848x1200)
1.42 MB
1.42 MB PNG
I love flux manga.
>>
Is Flux any good for photographic-like images? The online version seems meh.
>>
File: FLUX_00116_.png (1.06 MB, 896x1152)
1.06 MB
1.06 MB PNG
>>
>>101850118
There was only one, now there's two. Just like Agent Smith.
>>
qrd on flux prompting? is it just boomer prompting or is there something else going on?
>>
>>101848306
Prompt for pic rel? Looks great.
>>
File: fp068.jpg (354 KB, 1024x1024)
354 KB
354 KB JPG
>>
File: FLUX_00072_.png (1.28 MB, 1152x896)
1.28 MB
1.28 MB PNG
>>101850218
same as sd3
>>
>>
>>
File: 00005-75558537.png (2.07 MB, 1432x1432)
2.07 MB
2.07 MB PNG
cyberrealistic pony is surprisingly good, was having issues getting pony realism to do unique poses and more than just 1girl portrait, though having issues getting consistent quality. ez to get passable and consistent results though.


>i wonder if her broach getting duped sometimes is a result of bad tagging?
>>
>>101850218
It follows the prompt well unless the photo involves an adult human female. If you don't feel like writing a boomer prompt, just get ChatGPT to write it for you.
>>
File: ComfyUI_Flux_90.png (1.85 MB, 1344x768)
1.85 MB
1.85 MB PNG
Huh, I thought there was a fix for dynamic thresholding producing white noise when used with adaptive guidance with last steps at cfg 1. I even reinstalled it (and updated everything else) and triple-checked to use the same settings that were posted here many times but alas
>>
>>101850415
does your prompt contain a stray ":" by any chance? Also, every recent model inserts eyes or proto eyes in places where they don't belong. I'm not sure why.
>>
post you're gen when /ldg/ won
>>
Will I ever be able to train Hunyuan Loras on a 12gb?
>>
File: ComfyUI_Flux_6999.jpg (164 KB, 1024x1024)
164 KB
164 KB JPG
>>
File: gen_tmp_04.jpg (249 KB, 1312x1312)
249 KB
249 KB JPG
>>
>>101850500
lol no
>>
>>101850480
No but i just noticed my current gen accidentally left a comma in lmao oops
>>
File: ComfyUI_Flux_6977.jpg (177 KB, 1024x1024)
177 KB
177 KB JPG
>>
>>101850415
One sign of a braindead model (most fine-tunes are like that, sure sign is when its author makes a new version every two months...) is the inability to produce anything else than instagram style human portraits.
One way to gauge a model is to test common painters.
>>
>>
>>101850177
I don't get anything even close to that.
>>
>>101850558
Yeaaahh I just wanted to give that model a break because it's the best of its kind, that one amateur realism lora is shockingly BAD. looks like a deepfried amalgamation nightmare and it's highly rated.
Cyberrealistic was always hit or miss, hit it was of one of the best models for 1.5 and a kinda bad XL model. In this case for Pony i still can't judge because i'm using it for the first time.
It's certainly higher quality than any of the other cyber realistics.
>>
>>101850526
:( vramlet life sucks
>>
>>101850627
What model are you using, that's not Flux.
>>
>>
File: 4073191605.png (1.27 MB, 896x1152)
1.27 MB
1.27 MB PNG
>>
>>101850629
Seems alright.
>>
File: image-3.jpg (91 KB, 1024x1024)
91 KB
91 KB JPG
>>101850675
No. That's SD1.5 Humu as in the filename. This is what I get on flux shnell
>>
File: glumlot.png (408 KB, 472x840)
408 KB
408 KB PNG
Hey guys i'm in need of help with a inquiry in how some Ai videos are made

First, are you familiar with "dreamcore/analog horror/liminal aesthetic?

Well if you are not here are a few examples

https://www.youtube.com/watch?v=2vtn9EjSlJ0

This is DeadTempo visions he does dreamcore stuff mostly

https://www.youtube.com/watch?v=G915hZm18qM

Glumlot, same dreamcore aesthetics

https://www.tiktok.com/@fearstfilm/video/7368984349745679658?q=fearstfilm&t=1723469007140

FearstFilm, he does analog horror liminal spaces stuff

so, my question is how are those videos made? i mean what AI app or software is being used...i was thiking at first MidJourney picture that then is animated using a video ai software like KLING but i have been trying to achieve the right aesthetic and i just can't so there must be something i'm missing. since DeadTempo and Glumlot achieved almost the same aesthetic i guess the procedure is something entirely diffetent

any ideas? much appreciated (and if not well maybe direct me in the right direction where i could find some answers and i'll be on my way)
>>
>>101850787
use the realism lora, or find the right keywords
>>
File: ComfyUI_00653_.png (1.46 MB, 1024x1024)
1.46 MB
1.46 MB PNG
>>
>>101850629
In my experience pony is horrible, it does characters well but even then it needs wrangling. Have never tried any photographic pony fine-tunes (in my logic that's useless because pony is a cartoon model anyway but what do I know).
>>
>>101850787
Humu is actually very good with artists and art styles, better than some other models. It's something you wouldn't believe.
>>
File: image-4.jpg (110 KB, 1024x1024)
110 KB
110 KB JPG
>>101850799
or this one

I'm using the same prompt. It's not about the prompt, it's about the image quality. It's not the same as >>101850177
So wondering what's the trick. I'm installing locally. Maybe the only version is nerfed.
>>
File: 1718686885824652.jpg (53 KB, 600x600)
53 KB
53 KB JPG
>>101847080
Can someone please try generating the following with FLUX:
"backgammon board, view from above"
or just
"backgammon board"
none of the other image generators, even Midjourney never got this right.
Wondering if they have progressed anywhere on details with FLUX
>>
>>101850820
welcome back
>>
>>101850853
>It's not about the prompt, it's about the image quality
yeah, use the realism lora
>>
>>101850853
*on-line
>>
File: ComfyUI_Flux_95.png (1.81 MB, 1344x768)
1.81 MB
1.81 MB PNG
>>101850446
And this >>101849076 thing gives me the same shit. Guess it's the problem with adaptive guidance then, even though I have the latest updated version. Does it need any fixes or special settings applied? Using adaptive guidance with thresholding of 0.994, cfg of 6 and uncond thingy at 0. Tried tinkering with valus but nothing changes Removing adaptive guidance altogether gives me overcooked shit
>>
It's already heating up and that's before I even opened up the oven door to find some piping hot fresh bread...
>>101850883
>>101850883
>>101850883
>>
>>101850830
pony blows every other model out of the water. It requires wrangling but you know that going into it. You can do things with pony that are outright impossible with flux. Flux is good for writing and memes.
>>
>>101850925
I think you should reinstall AdaptiveGuidance and DynamicThreshold, because it's working for me
>>
>>101850861
thanks, but I am not a regular
>>
>>101850925
>And this >>101849076(You) thing gives me the same shit.
that's normal for that one, that TonemapNoise shit needs to be deactivated when the CFG returns back to 1, a bit like DynamicThreshold did a few days ago
>>
>>
>>
>>
>>
>>
>>



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.