[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1.14 MB, 3264x3264)
1.14 MB
1.14 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102003576

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/g/sdg
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
>touohoutard avatarfag attentiontranny still forcing his slop into OP collages
embarrassing, literally mentally ill
>>
>>102006777
>not a single of those animal crossing/y2k gens made the collage
what the fuck are you doing? last thread had tons of good gens.
>>
>>102006801
>touohoutard avatarfag
He generally doesn't post any text along with his slop so many people don't notice it as much, but I know when an avatar has broken from their containment. I saw the quokka come in here once but he left when people weren't responding to him.
>>
>>102006811
I posted those late, I made the doom miku tho
>>
File: cunt.png (320 KB, 396x387)
320 KB
320 KB PNG
>Hey guys I'm just gonna post a link to my patreon and some random pictures of my LoRA in your pull discussion on github.
>>
give me some random flux lora ideas that i'll probably never train. i was thinking of a pikmin enemies one.
>>
File: 2024-08-20_00355_.jpg (664 KB, 2560x1440)
664 KB
664 KB JPG
>>102006777
checked, and thank you baker
>>
>>102006846
tuktuks from various countries
>>
>>102006846
Gather a bunch of world of warcraft classic screenshots and tag them appropriately.
Flux already knows WoW. Help it understand the prompt better.
>>
File: ComfyUI_00043_.png (3.2 MB, 1536x1536)
3.2 MB
3.2 MB PNG
>>
>>102006846
a 1997 isuzu hombre
>>
>>102006846
8 bit nintendo games
>>
>>102006846
bbig boob lora
>>
>>102006863
>Gigantic windows on first floor facing public beach
Yeah nah, no way I could ever relax in that house knowing any number of crackheads could be watching me from the darkness of the beach outside.
>>
>>102006846
futa and sweaty butthole lora
>>
now that I am bored with bored women in FLUX, thanks to the lora .. what to prompt?
>>
>>
>>102006889
futa I already seen on civitai .. actually like 3 or something
>>
>>102006777
imagine licking ran's sweaty abs and then brushing her tails after a workout...
>>
>>102006882
just pile lots of crack and fentanyl in the woods and they'll swarm it like zombies and leave you alone
>>
>>102006901
a lora for each of the /sdg/ avatarfags then
>>
>>102006901
yet no sweaty butthole lora, what has our society come to?
>>
File: ComfyUI_05040_.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
>>
what's the difference between CFG and guidance? I'm confused
>>
>>102006918
That is some 90's nostalgia shit.
>>
>>102006918
she's going to vore the moon and poop it out
>>
>>102006929
>That is some 90's nostalgia shit.
that's the goal yeah, I fucking love the y2k era
https://civitai.com/models/667307
>>
Blessed thread of frenship
>>
>>102006926
they trained Flux on CFG 1 so they had to find something else to improve the prompt adherance, and that's how the distilled guidance was born, but fuck that I'm CFGmaxxing with Dynamic Threshold or Tonemap, CFG still rules to this day
>>
Forge gives me 3.4s/it vs Comfy's 2.4s/it
tried both FP16 (guessing Forge is casting to FP8 automatically) and the Q8 gguf
Is Forge a joke?
>>
>>102006846
frutiger aero, y2k has one
>>
What is the least I need to get a lora to work with dev? I have the nodes connected as usual, but am I missing something with the required Guidance?
>>
Seaking of loras, I got a technical question. Is it pointless to use LoraLoader with CLIP strength? I see very little impact when varying that from 0-1, is LoraModelLoaderOnly enough?
>>
>>102006954
Flux Pro uses CFG
Flux Dev was guidance distilled to not use CFG but instead have conditioning on the Guidance value so it learned what different CFG levels look like
Flux Schnell was guidance AND step distilled
>>
>>102006975
>Flux Dev was guidance distilled to not use CFG but instead have conditioning on the Guidance value so it learned what different CFG levels look like
so there's a way to bring the regular CFG back to flux dev with a finetune?
>>
gguf flux isn't actually faster or is it? or am i doing something wrong?
>>
>>102006975
Was it ever concluded if it was possible to un distill flux dev? Not like, restore it to pro state, but uncrack it a bit.
>>
>>102006983
>so there's a way to bring the regular CFG back to flux dev with a finetune?
no, till now DynamicTreshholding with CFG normalizing to 1 is the only way to get access to CFG on dev, a finetune wont change that.
>>
>>102006986
GGUF is just better for quantizing and (don't quote me on this) plays with system ram a little better?
>>
>>102006990
>no, till now DynamicTreshholding with CFG normalizing to 1 is the only way to get access to CFG on dev,
tonemap also works but yeah what a bummer, I thought it was possible to "un-distill" a model with some more training
>>
Anyone else notice teams that use the T5 encoder get really nervous and dismissive when you talk about finetuning it?

I wonder why that is.
>>
>>102006990
>a finetune wont change that.
If your finetune stops conditioning on guidance or you fix the value you are effectively training it without guidance distillation
>>102006999
it is
>>
>>102007011
>If your finetune stops conditioning on guidance or you fix the value you are effectively training it without guidance distillation
>it is
then it is possible to bring the CFG back if we undistill the model during the finetuning?
>>
File: ComfyUI_05041_.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
>>
>>102007019
NTA, but I believe that is the idea.
>>
File: ComfyUI_00047_.png (3.1 MB, 1536x1536)
3.1 MB
3.1 MB PNG
>>
File: ComfyUI_05042_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>
File: 1700695677360778.jpg (155 KB, 984x984)
155 KB
155 KB JPG
>>
>>102007009
it's a waste of resources and would mean you can't have one T5 shared with all finetunes
>>102007019
you can't "undistill" back to what it was but you can make it better for CFG
CFG is something that happens during image gen, not training
during training what you need is conditioning dropout and Flux Dev's distillation process might still have done it
>>
>>102007068
damn that looked good until i opened it...
>>
>>102007072
>you can't "undistill" back to what it was but you can make it better for CFG
>during training what you need is conditioning dropout and Flux Dev's distillation process might still have done it
at least it's something, if it can handle higher CFG better that's already a win, and it won't be so fried on high cfg + DynamicThresholding/Tonemap
>>
>>102007068
so they can make kale and spinach look like a burger, but they can't make mealworms and beetles look like a burger
>>
File: ComfyUI_00035_.png (2.76 MB, 1248x1848)
2.76 MB
2.76 MB PNG
>>
>>102007087
what happens when the tide comes in
>>
File: 1700239841702370.jpg (194 KB, 800x1170)
194 KB
194 KB JPG
U vill eat ze bug
>>
>>102007092
Can't explain that.
>>
File: 1704779053736376.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
Miku Hatsune is in the game Animal Crossing. She is talking to Goku from the anime Dragonball Z. In the background is a comfy looking house with a sign that says "Kame House". The image is in the style of the game Animal Crossing.
>>
File: 1695860825101715.jpg (161 KB, 800x1170)
161 KB
161 KB JPG
>>
>>102007092
>let's keep looking honey
>>
>>102007103
so you managed to make it work without the animal crossing lora?
>>
>>102007110
it works better with the lora because it was trained with images like it, the base flux even knows what ingame WoW looks like.
>>
File: 00145-2457484864.jpg (748 KB, 1344x1728)
748 KB
748 KB JPG
>>
File: Capture.png (90 KB, 840x1400)
90 KB
90 KB PNG
512 tokens is a lot, they really haven't missed when they released Flux, the only mistakes they made can be easily fixed
>>
>>102007104
make me pineapple pizza plus spiders
>>
>>102007136
different tokenizer
>>
>>102007141
true, but I don't believe the difference is that big maybe 5-10% off the GPT4 tokenizer in terms of tokens total count
>>
File: 2024-08-21_00353_.png (951 KB, 1024x1024)
951 KB
951 KB PNG
>>
>>102006991
i don't understand what is going on anymore. why is flux schnell smaller in size than flux dev? for example the q5 variants??
>>
what happens when you try an XL lora in flux?
>>
>>102007118
even the most popular mmorpg ever? amazing
>>
File: 2024-08-21_00355_.png (353 KB, 1024x1024)
353 KB
353 KB PNG
>>102007170
you get an error, they are not compatible
>>
>>102007182
you're joking but it doesn't know the main character of GTA5, one of the most popular games ever
>>
>>102007166
>why is flux schnell smaller in size than flux dev?
It shouldn't be.
>>
does flux know old school runescape?
>>
>>102007170
this anon is kinda right >>102007189 .. it won't crash tho, comfy just looks at the lora and tries to applies it and sees its in the wrong format, you get alot of
>lora key not loaded: ...
in the console, and then its like no lora is loaded at all
>>
File: 2024-08-21_00358_.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
I asked for an ar15 .. it gave me an akm .. well I guess its fine to
>>
>>102006777
lucky thread
>>
File: FD_00072_.png (1.62 MB, 1344x768)
1.62 MB
1.62 MB PNG
Running into all sorts of OOM issues I wasn't before recently. Switched back to the fp8 that was running fine at the beginning but it's still happening. Gens fine at 1.09s/it for a few gens then shits itself and goes as slow as 1m/it. Anyone else experienced this?
4080 btw.
I would be interested in seeing some of your workflows if you're running the same card and not getting this.
>>
File: ComfyUI_05049_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>
File: 1721026695333383.jpg (81 KB, 984x984)
81 KB
81 KB JPG
>>102007227
Schnell not making thick outlines :( . What proomt u used ?
>>
>>102007283
try the q8 model + gguf loader, seems to work better/less memory issues
>>
>>102007319
That's what I was using and where the issues began, and why I switched back to F8.
>>
File: 1693824681362196.jpg (48 KB, 729x806)
48 KB
48 KB JPG
>>102007334
this has been working pretty well for me, same card

but f8 is fine too
>>
>>102007345
Hmm, you're using clip_l, I am using ViT-L, maybe it's that?
>>
>>102007358
that might be it, just use the ones in the comfy guide

https://comfyanonymous.github.io/ComfyUI_examples/flux/
>>
File: 123.png (1.71 MB, 1024x1024)
1.71 MB
1.71 MB PNG
>>
>>102007283
need more info, and maybe screenshot of workflow. lora involved? you offloading clip to ram?
>>
File: workflow2.png (976 KB, 5078x2521)
976 KB
976 KB PNG
>>102007363
I mostly am.
A few tweaks for uptional upscaling and adding a LoRA and that's it.
>>
File: ComfyUI_05050_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>
>>102007390
see >>102007391
>you offloading clip to ram?
I guess so. 32GB RAM is full up, that's the issue.
>>
>>102007391
looks good, I still need to learn stuff like adding a section for inpaint stuff if I want to, been using forge for quick edits to test so far.
>>102007396
nice, could be the cover of a ps2 game, lora or just prompts?
>>
>>102007403
>lora or just prompts?
it's a lora yeah
https://civitai.com/models/667307
>Y2K style cover art with a 3d character of Hatsune Miku, with her iconic turquoise twin-tails flowing in the wind, performs an impressive kickflip on her skateboard while soaring over the majestic Grand Canyon. The sun is setting in the background, casting a golden glow over the vast, rugged landscape. Miku’s expression is one of pure exhilaration as she defies gravity, her outfit a blend of futuristic and streetwear style, complete with neon accents that make her stand out against the natural beauty of the canyon. Y2K style text at the bottom: "STYLISH"
>>
>>102007391
I've seen that workflow before. ehh try w/o lora. loras add nasty overhead to model - i got this node that can force clip into ram
>>
>>102007425
Yeah I did and it's the same issue, but I want the LoRA. And it was working fine with the LoRA until just a few days ago.
I feel like in my tweaking I connected up something wrong somewhere and fucked the whole thing up by putting the wrong node into the wrong hole.
>>
>>102007416
it's a cool aesthetic, reminds me of playstation/dreamcast game boxes
>>
>>102007458
w..wrong hole?!
try the clip offload thing https://gist.github.com/Sunderbraze
put in base custom nodes folder, force clip to cpu, NOT on cuda aka your gpu. worth a shot
>>
File: ComfyUI_05052_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
https://www.youtube.com/watch?v=9NkkZJHova4
>>
File: 1709368767334910.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
holy shit I actually got a name tag on this one.
>>
>>102007506
I tried forcing clip to cpu but I gave a 10400 and was slow as fuck when I change the prompt.But, I didn't have this script, so I will try it.
>>
File: 1719866668514436.png (1.13 MB, 1024x1024)
1.13 MB
1.13 MB PNG
>>102007545
>>
>>102007545
Which lora is this?
>>
File: 2024-08-21_00348_.png (311 KB, 1024x1024)
311 KB
311 KB PNG
>>102007312
I used a Lora:
>https://civitai.com/models/668807/simple-ink-illustrations?modelVersionId=748712
>>
where's the finetunes mansly
>>
>>102007559
I know but I am OOMing with the dev-fp8 model and some loras and I got 24GB. can't update comfy because #2666 fucked various custom nodes I rely on so GGUF is a no-go for me
>>102007526
BRO lol
>>
File: 2024-08-21_00359_.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>102007605
two more weeks
>>
>>102007589
none, vanilla flux

World of Warcraft ingame screenshot, UI and HUD visible, the main character is Miku Hatsune, she is wearing a her default outfit and holding a large greatsword. The background is Hellfire Peninsula from World of Warcraft. The action bar at the bottom has sword icons. The minimap at the top right is a map of Outland.
>>
>>102007628
and there is a typo, but it still works!
>>
THREAD THEME: create a gen of what you think the average /ldg/ anon looks like
>>
>>102007570
sex
>>
File: 2024-08-21_00362_.png (596 KB, 1024x1024)
596 KB
596 KB PNG
>>102007657
I feel nice today. You are all needy catboxers.
>>
File: FLUX_00020_.png (1.17 MB, 896x1152)
1.17 MB
1.17 MB PNG
>>102007657
>>
>>102006897
Very Nice. Catbox please.
I need a fix for fine arts style in Flux pronto.
>>
>>102007687
erotic male
>>
File: 2541860566.jpg (115 KB, 896x1152)
115 KB
115 KB JPG
>>102007657
>>
File: 2024-08-21_00370_.png (710 KB, 1024x1024)
710 KB
710 KB PNG
>>
File: ComfyUI_05053_.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
https://www.youtube.com/watch?v=UYahPVL3FL0
>>
File: Flux_00540_.png (900 KB, 1024x1024)
900 KB
900 KB PNG
>>102007657
>>
File: FLUX_00040_.png (1.31 MB, 896x1152)
1.31 MB
1.31 MB PNG
>>102007705
this is male
>>
>>102007748
>the noose
ayy lmao
>>
Flux base model is too huge and has garbage data...
I don't want ugly old politicians nor WoW data in it.
>>
File: 1717388591121.png (1.22 MB, 953x953)
1.22 MB
1.22 MB PNG
>>102006846
Asterix
>>
File: 2024-08-21_00378_.png (669 KB, 1024x1024)
669 KB
669 KB PNG
>>
>>102007769
the point is the base model can do almost anything. just use loras for specifics it can't do, or certain styles.
>>
File: number wan.png (936 KB, 960x768)
936 KB
936 KB PNG
>>102007657
>>
How do I use the gguf t5 clip? Keeps throwing up an error
>>
>>102007769
>Flux base model is too huge
that's the thing, it's big so it can eat a lot of shit, everyone will be happy if everything is included
>>
>>102007769
t. vramlet
>>
Is it possible to train flux loras using 1024x1024 pictures on a 16gb card, or do i have to downscale them to 512?
>>
File: ComfyUI_05054_.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
https://www.youtube.com/watch?v=ZhFAj8-Wfx8
>>
File: image.jpg (128 KB, 1536x1024)
128 KB
128 KB JPG
cyberjews
>>
File: 1721382165869467.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>>
>>102007867
>muh mikuu
>>
File: 0.jpg (445 KB, 2048x1024)
445 KB
445 KB JPG
>>
File: 00162-2352959416.jpg (311 KB, 896x1152)
311 KB
311 KB JPG
>>
>Trained 7 LoRAs so far and they've all been of my personal porn collection
The LoRAs... they're just so good, it's like I can just keep making more porn from the porn I already have.
Like planting porn seeds on a porn farm to harvest more porn.
>>
>>102007911
make a Dafne Keen pony lora
>>
You know what a good lora style would be? the one mimicing the FFX cutscenes:
https://www.youtube.com/watch?v=QgW-UC9tcU4
>>
File: 2024-08--21-002.png (1.54 MB, 1536x1024)
1.54 MB
1.54 MB PNG
>>102007545
>>102007570
nice
>>
File: image.jpg (130 KB, 1536x1024)
130 KB
130 KB JPG
>>
>>102007911
>so as I pray, infinite porn works
>>
File: robot002.jpg (220 KB, 899x1368)
220 KB
220 KB JPG
>>102007657
not physically but mentally
>>
File: ComfyUI_05055_.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>
>>102007802
Update comfy, they pushed an update yesterday
>>
File: 00171-438709855.jpg (237 KB, 896x1152)
237 KB
237 KB JPG
>>
File: image.jpg (136 KB, 1536x1280)
136 KB
136 KB JPG
>>102007911
>planting porn seeds on a porn farm to harvest more porn
somebody gen this
>>
File: 2024-08--21-008.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
>>
>>102007920
>>102007929
bros you are killing it. seriously. bringing a tiny bit of light into my dark world.
>>102007972
NOTED
>>
File: 1711001522310498.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
bonus if you guess the reference
>>
>>102007956
I did, but I'm guessing I need to do a pull from the folder because manager I did it via Manager.
>>
File: 00161-3008514455.jpg (198 KB, 896x1152)
198 KB
198 KB JPG
>>
>>102008029
lol I did not prompt pain on the wall, that was just there
>>
>>102008029
NHK ni Youkoso?
>>
>>102008029
4chan banner, but also that other thing ^
>>
File: bComfyUI_105748_.jpg (219 KB, 768x1024)
219 KB
219 KB JPG
>>
So LoRAs with multiple character trigger terms aren't really working right now.
>>
File: 1721965552977458.png (1010 KB, 1024x1024)
1010 KB
1010 KB PNG
>>102008055
you got it!
>>
>>102008068
Monkey looking fella
>>
>>102008069
You captioned your images wrong.
>>
>>102008029
>>102008071
Room is too clean for a hikkikomori
>>
>>102008086
I'm not the only one experiencing this.
>>
>>102008093
They captioned the images wrong.
>>
File: ComfyUI_03551_.png (1.4 MB, 1280x960)
1.4 MB
1.4 MB PNG
prompt: ifeelmyself
>>
>>102008086
no, it's simply harder to add multiple characters with loras, it's just how it is
>>
>>102007769
You don't want a base model that only has "good" images, you want as much variety as possible
>>
File: image.png (2.66 MB, 1536x1024)
2.66 MB
2.66 MB PNG
>>102007972
tried

>>102008029
>>102008044
>pain
>unprompted
flux gets it
>>
Describing multiple characters would come from the T5 encoder, no? Do loras even train it?
>>
>>102007802
that clip_l doesn't work atm, use normal one
>>
File: 131548_00001_.png (1.13 MB, 1024x1024)
1.13 MB
1.13 MB PNG
>>102007657
>>
File: 1705594961116024.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
miku in a snow globe but also on a transparent cube:
>>
>>102008069
Works for me. I trained a LoRA on 2 subjects at once and it got it. Needs more steps though, and you need to have images of the subjects both together and individually.
>>
File: bComfyUI_106324_.jpg (372 KB, 1984x1024)
372 KB
372 KB JPG
>>
File: image.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>102008140
>>
File: image.jpg (134 KB, 1536x1280)
134 KB
134 KB JPG
>>
File: ifx133.png (1.61 MB, 1024x1024)
1.61 MB
1.61 MB PNG
>>
File: image.jpg (151 KB, 1536x1280)
151 KB
151 KB JPG
>>102008134
>tried
i was imagining magazines not casettes but still good
>>
>Hey guys! I can train a LoRA on 12gb of vram!
>All I gotta do is brick my PC for 12 hours.
>>
File: 1711587053516673.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
>>102008109
is it "add multiple characters with loras" or "make a lora with multiple characters"
these are very different things
>>
>>102008213
>he only has one pc
Anon how poor are you? Don't you have a genning pc with 3GPUs and a separate gaming PC with 2 GPUs so you can gen while you train while you game?
>>
>>102007628
I tried extremely hard to do wow stuff but was quite bad. I never managed to generate a moonkin, even with all detailed description generated by LLM.
>>
>>102008213
12 hour nap
>>
File: 4.jpg (1.39 MB, 2048x2048)
1.39 MB
1.39 MB JPG
>>
File: 1695361258272359.png (1.68 MB, 1024x1024)
1.68 MB
1.68 MB PNG
>>
File: image.jpg (139 KB, 1536x1280)
139 KB
139 KB JPG
>>
File: 2024-08-21_00415_.png (757 KB, 1024x1024)
757 KB
757 KB PNG
>>
this gguf shit is fantastic. Wish it worked with ViT-L
>>
File: 133312_00001_.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
>>102008213
>>102008240
>>
ideal amount of steps for good gens? 20 enough or no?
>>
>>102008249
is this base flux or with a lora? has more sovl than the usual slop it makes
>>
>>102008331
30 if you want consistency, 40 if you want consistency on text aswell
https://reddit.com/r/StableDiffusion/comments/1er3wt7/if_you_want_a_good_compromise_between_quality_and/
>>
>>102008331
30-35 seems to be the sweet spot
>>
File: 1718675041627194.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
>>
File: 513646515.png (1.4 MB, 1024x1024)
1.4 MB
1.4 MB PNG
Something got fucked up with flux loras on forge. It's crashing all over the place, but without loras it works no problem. Anyone else experiecning that?
>>
File: 2024-08-21_00420_.png (868 KB, 1024x1024)
868 KB
868 KB PNG
>>
>>102008366
>forge
I see your problem. You are using experimental software.
>>
>>102008331
24-28. really depends on amount of ish in your gen. and your patience. text? add a few. lots of items? add a few steps.
>>102008366
>forge (sorry)
>>
Hunyan is so much better than flux. Chinese engineers are #1.
>>
So with Crystools in Comfy the meta data has the seed of the image I load, but how do I pull the seed out of it? I can get the prompt but I also need the seed
>>
File: 1704999488538586.png (1.35 MB, 1024x1024)
1.35 MB
1.35 MB PNG
>>
>>102008416
>Chinese engineers are #1.
Chicom industrial espionage is #1
>>
File: file.jpg (50 KB, 768x768)
50 KB
50 KB JPG
does sd-scripts really not have a way to do validation images? how does one set it up to do that on an fixed interval with a variety of prompts? do they really expect us to train in the dark?
>>
>>102008416
It's a good model but flux definitely shits on it. Why do you think the entire community is using Flux instead of it?
>>
>>102008462
this
>>
>>102008462
it fucked itself from the start by having a retarded Chinese text encoder
>>
>>102008477
they fixed that?
>>
File: image.png (1019 KB, 1024x1024)
1019 KB
1019 KB PNG
>>102008332
>>
>>102008485
I honestly don't care, they should've launched speaking english from the start and now Flux is here and is better with full community support. Maybe they'll not fuck up the next time.
>>
File: image.jpg (100 KB, 1280x1536)
100 KB
100 KB JPG
>flux having trouble with small stars of David
if i prompted for a swastika i bet it would've worked just fine. how antisemitic of BFL
>SPH2
>>
>>102008486
what prompt did you use?
>>
File: ComfyUI_00141_.png (1.75 MB, 1024x1024)
1.75 MB
1.75 MB PNG
>>
>>102008497
both work easy if you use high quality enough model plus t5 .. else it gets bonked
>>
File: image.png (993 KB, 1024x1024)
993 KB
993 KB PNG
>>102008512
photo of an extremely smug :3 catfaced chibi plushie anime girl with closed eyes and hands on hips, speech bubble comes out of her that reads "miku in a snow globe but also on a transparent cube", nuclear missile in the background, depth of field, blurry background

photo of a chibi plushie anime girl with one eye closed and hand on hip open mouth pointing at the viewer, speech bubble comes out of her that reads "That is base flux.", big busty breasts oversized boobs in the background, depth of field, blurry background
>>
>>102008417
can't find a solution.
https://github.com/receyuki/comfyui-prompt-reader-node
try this. why do you want the seed tho?
>>102008486
really good texture.
>>102008497
lollipop fry
>>
File: 1722629342517106.png (749 KB, 1024x1024)
749 KB
749 KB PNG
>>
>>102008555
>https://github.com/receyuki/comfyui-prompt-reader-node
Thanks I will try it.
>>
>>102008555
Oh I need the seed to upscale, I don't want to upscale from a random seed, I want to use the exact one, to avoid any changes to the image I am upscaling. There's a tendency for upscales to add and remove details.
>>
File: bComfyUI_106439_.jpg (796 KB, 2048x1024)
796 KB
796 KB JPG
>>
File: 2024-08-21_00442_.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>102008497
>>
>>102008637
a daring synthesis (assuming it's the buddhism symbol)
>>
>>102008637
should be lgbt colors. or ukranian flag lol
>>
File: lolipop fries.jpg (134 KB, 1536x1280)
134 KB
134 KB JPG
>>102008555
>lollipop fry
?

>>102008559
>amateur tv screenshots
unironically a genius use of flux. more creative than surveillance footage
>>
File: 1781259949.png (1.44 MB, 1024x1024)
1.44 MB
1.44 MB PNG
>>102008398
>>102008411
oh well
>>
File: FD_00040_.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>102008643
You can just ask for a nazi swastika and it will give it to you
>>
>>102008672
I haven't had durst styli in ages.
>>
>>102008672
I saw someone use cctv as a prompt and was amazed, the model can do cool shit
>>
File: 2024-08-21_00448_.png (1.13 MB, 1024x1024)
1.13 MB
1.13 MB PNG
>>102008684
you can also just ask for "Star of David" and it will give it to you
>>
>>102008701
it keeps surprising me, I'm slowly snapping out of my WD tag mode and really pushing it, describing exactly what I want because odds are it's in there somewhere, even though it gets pushed tot he back by all the other far more common things
>>
File: image.jpg (97 KB, 1536x1024)
97 KB
97 KB JPG
>>102008637
i wonder if it can do the Raelism symbol
>>
File: 1692789231835705.png (723 KB, 1024x1024)
723 KB
723 KB PNG
>>
>>102008715
>Raelism
>>
File: 1714909430325017.png (852 KB, 1024x1024)
852 KB
852 KB PNG
now this is podracing
>>
>>102008619
no no do not use the same seed for a resample/upscale. the moment you resize the image the seed becomes useless. worst case you burn the image. forget about the seed. trust me.
>>
https://xcancel.com/MrDavids1/status/1825208573543981111
>I've been secretly testing Mystic which I think is even better than FLUX with AI image generation.
Will it be local though? if not gtfo lol
>>
File: 2024-08-21_00449_.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
>>102008672
>>
File: image.jpg (146 KB, 1536x1280)
146 KB
146 KB JPG
>>
>>102008559
Lul
>>
what's a good sd model for fixing anime hands specifically? I got a nice looking depth map from hand refiner but that's as far as I'm getting
>>
File: image.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
>>
File: 2024-08-21_00453_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
>>102008770
need to be more specific. sd15? sdxl? pony? anus1111? forge? comfy? mesh graphormer? its always a hustle to dial it in
>>
>>102008730
I trust my eyes.
>>
>>102008815
sd15, comfy, mesh graphormer. depth map preview looks perfect, but after feeding it to a controlnet and inpainting I just get different blobs. I suspect my model just can't do hands well regardless of controlnets
>>
File: ComfyUI_02344_.png (1.21 MB, 896x1152)
1.21 MB
1.21 MB PNG
https://civitai.com/images/25424379
MJ lora got updated, that's cool, I hate the generic anime style flux always outputs
>>
>>102008804
>>
>>102008854
sd15 can into hands but you have to wrestle it a lot. It's extremely outdated at this point.
>>
Oh god I fucked up I am genning a 786,484 X 786,484 image
>>
File: image.jpg (106 KB, 1536x1024)
106 KB
106 KB JPG
>>102008726
the symbol got renewed interest when Kanye posted it on twitter

>>102008878
the half-burger with fries on top looks appealing. surprising no one has ever sold something like that
>>
>>102008890
Seems like one of those times where the NVIDIA fallback feature might actually fuck you over and crash your PC
>>
>>102008890
Grab your fire extinguisher.
>>
File: 1722207311756122.png (786 KB, 1024x1024)
786 KB
786 KB PNG
is the beta scheduler better than normal?
>>
File: 1707913643173964.jpg (55 KB, 1292x738)
55 KB
55 KB JPG
Best use of ai for me is creating wallpaper
>>
>>102008879
yeah I just jumped straight into my old setup from over a year ago. not sure if I can handle anything better with 4gb vram
>>
File: 2024-08-21_00458_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>102008904
>the symbol got renewed interest when Kanye posted it on twitter
the shit storm was so huge tho that I don't think it found its way in data set, when I prompted it it just made strange random symbols
>>
>>102008908
All I know is I saw it on time and killed comfy.
Also it was much larger because I fucked up elsewhere. It was an upscale doing 16x16 tiles too. So it was really 256 786,484 x 786,484 images
>>
>>102008941
And creating noods of friends girlfriend
>>
File: iii.png (197 KB, 1734x753)
197 KB
197 KB PNG
>>102008854
lets see..
- controlnet strength somewhere between 0.4 and 0.7
-step size around 12, scales with denoise. (lower denoise=less steps)
-and then the all important value, the denoise, needs careful adjustment.
>>
File: bComfyUI_106490_.jpg (582 KB, 2048x1024)
582 KB
582 KB JPG
>>
>>102008904
I didnt ask
>>
File: 00118-198683096.jpg (137 KB, 768x960)
137 KB
137 KB JPG
just deleted all of my models + gens lads
>>
>>102008943
You could really try and push Flux and use Q2 unet with Q3 text encoder. I expect it to look very shit at that point though
>>
>>102008995
if that's the average quality of your gens not much was lost
>>
File: 1.jpg (623 KB, 1480x1328)
623 KB
623 KB JPG
Morning
>>
File: 2024-08-21_00477_.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
>>102009009
hello stranger
>>
can someone point me to a workflow for flux which includes the option for a lora, I just trained one on civitai but im so used to a111
>>
File: 1695046721202326.jpg (162 KB, 1152x896)
162 KB
162 KB JPG
>>
File: 2024-08-21_00478_.png (1.46 MB, 1024x1024)
1.46 MB
1.46 MB PNG
>>102009014
here have mine: https://files.catbox.moe/28y5hz.png

it aint pretty, but you got options for three loras, dynamic thresholding and on the right side even sd ultimate upscale
>>
>>102009014
default workspace, add a lora loader (loraloadermodelonly)

checkpoint -> lora loader -> ksampler, and you are good to go.
>>
>>102007920
>how do generate boob,
>>
>>102008957
thanks gonna play around with this, a few tricks here I wasn't doing, and better starting values
>>
File: 2024-08-21_00482_.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
>>
so does flux "converge" at 20 steps? is 30/40 any better or diminishing returns?
>>
>>102009146
it was trained on 1000 steps so I'm doing 1000 steps.
>>
>>102008941
Looks nice anon, reminds of the default KDE Plasma wallpapers.
>>
>>102009146
not on all samplers, not on all topics, I stay with 22-25 most of the time, but 20 is good enough mostly
>>
File: 1716475554243849.png (736 KB, 1024x1024)
736 KB
736 KB PNG
there we go, perfect ad.
>>
>>102009180
kek
>>
File: 1694258154950351.png (1008 KB, 1024x1024)
1008 KB
1008 KB PNG
>>
File: 1716511272764875.jpg (214 KB, 768x1232)
214 KB
214 KB JPG
>>
File: FD_00011_.png (452 KB, 512x768)
452 KB
452 KB PNG
6.3gb vram used.
Q3_s t5 and Q2 model.
I think this is the absolute minimum for Flux.
>>
>>102009259
have you tried text with that combo yet?
>>
File: 1705226077496000.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>
File: FD_00013_.png (531 KB, 512x768)
531 KB
531 KB PNG
>>102009280
>>
>>102009310
nice. there's hope for 8GB vram users yet! thanks for posting
>>
File: FLUX00015.png (2.03 MB, 1536x1248)
2.03 MB
2.03 MB PNG
>>
>>102009332
Could start a gen on flux and upscale it with SD to clean it up so you get the good prompt understanding and some quality.
>>
File: FLUX_00011_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
well that was 4 hours wasted
>>
>>102009373
Are you trying to untrain the butt chin?
>>
>>102009383
no that's supposed to be holly willoughby
>>
File: 1700376864175745.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
there we go, I wanted someone holding a book. >>102009286
>>
>>102009390
how do you fuck up so hard
>>
>>102009413
I believed the lie
>>
>>102009390
Oh right, yeah definitely wasted 4 hours, but, you now have a LoRA you can use to produce a consistent looking woman that you can scam men with.
>>
File: 1718011479644485.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>102009392
wait, better:
>>
why does flux image gen take 10x longer with a lora, is that normal?
>>
File: ComfyUI_00788_.png (1018 KB, 896x1152)
1018 KB
1018 KB PNG
>>102009373
you definitely fucked something up, cause its super easy to train flux on faces
>>
>>102009442
if you got buzz
>>
>>102009436
You ran out of vram and the overflow is calc'd in RAM instead (slow). Use smaller loras
>>
>>102009449
It costs less to buy buzz than the power it costs to train one yourself. That's how they get you.
>>
File: Capture.png (22 KB, 584x257)
22 KB
22 KB PNG
Really interesting, AutomaticCFG seems to be also a viable option for CFG > 1 (With Tonemap and DynamicThresholding)
https://imgsli.com/Mjg5ODE4/0/3
https://github.com/Extraltodeus/ComfyUI-AutomaticCFG
>>
>>102009473
Oops I forgot the prompt:
>Hatsune Miku and Sailor Moon having a cooking competition, with Miku making futuristic dishes and Sailor Moon making traditional Japanese food, pixel art style
>>
File: ComfyUI_00791_.png (1.54 MB, 896x1152)
1.54 MB
1.54 MB PNG
>>102009449
i do it local on my 4080, takes me about 1.5 hours for a lora that recreates the subjects likeness pretty well ~500-700 steps
>>
>>102009524
I don't know who that is supposed to be
>>
>>102001109
You're perfecting your technique holy shit.
>>
>>102009524
will you please share lora settings?
>>
File: 2024-08-21_00506_.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
>>
File: file.png (657 KB, 512x768)
657 KB
657 KB PNG
>>102009259
im on q5_1 with the same vram, using forge
almost 3mins for this generation, expanding it takes 15min
>>
>>102009524
How does it feel to pay $1500 just to generate mindless slop without any artistic value whatsoever?
>>
File: 1724113424271879.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
yoji shinkawa style (mgs) lora, this is cool:

miku on the mgs3 title screen
>>
i trained a lora on civitai, I gave it about 25 portrait photos of this woman, and for some reason all the lora generates is dog pictures, wtf is going on, in the preview images one of them was a dog with the head of the woman I originally gave it but besides that it's just churning out nonsense
>>
>>102009490
Hardly a technical prompt. You are an idiot if you think this has some merit in terms of finding out about techniques.
>>102009656
>lora
You don't need lora for that and besides, this is nothing like Shinkawa anyway, retard.
>>
File: 1710963304505469.png (1.4 MB, 1024x1024)
1.4 MB
1.4 MB PNG
>>102009668
no u
>>
>>102009652
a 4080 isn't $1500
>>
>>102009668
you must be some special kind of retard to not notice the difference between cfg 1 and cfg 6
>>
Smell that fresh loaf of bread straight from the oven...
>>102009692
>>102009692
>>102009692
>>
>>102009652
>artistic value
nigger our GPUs are drawing this shit. Get the fuck out of here.
>>
>>102009652
feelsgoodman
>>
>>102009652
Good.
>>
File: ComfyUI_00792_.png (1.19 MB, 896x1152)
1.19 MB
1.19 MB PNG
>>102009594
>I don't know who that is supposed to be
its just a random egrill, so that doesn't surprise me

>>102009603
i just use the recommended settings on kohya's github with the <12gb optimization launch arguments. I've done 10-15 epochs with 20-30 image datasets, captioned with joycaption and prepended with a trigger word.
Halfway through baking the loras start to work well, but obviously look better if you give it more time. The 10 epoch 20 images took me like 2.5 hours and worked pretty damn well. 15 epochs 30 images took me overnight ~8 hours

>>102009652
I built this pc at the end of 2023, upgrading from an i7-4790k + 970. I didn't fomo into a new GPU for AI, it was just time for a new computer.
>>
>>102007828
you can train 1024*1024 on 12gb, so
>>
>>102009876
neat. i downloaded a config for kohya and it was set to 512x512.
>>
>>102007980
>halo3 scirtless
>master chief teleports behind the skirt wearer with an unforgiving look in his eye
>his left hand closing in
uh oh...
>>
AdamW8bit or Adafactor?
>>
>>102009682
catbox of this one please
i love the artstyle, is it a specific lora?
>>
>>102010223
thanks I can't stop laughing
>>
>>102010630
nvm i didnt see the other post u replied to, gonna give this lora a try
>>
>>102009876
But wasn't it yesterday or something like that where several anons here were making fun of some anon that said you could train on 12gb vram? what happened?
>>
>>102011220
It was just d*bo stirring shit.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.