[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1022 KB, 3264x3264)
1022 KB
1022 KB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102033918

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
File: delux_flebo_00096_.png (1.55 MB, 1216x832)
1.55 MB
1.55 MB PNG
>mfw
>>
File: file.png (407 KB, 1216x832)
407 KB
407 KB PNG
>>102036631
I think so, not sure which one I used though

What do you mean by space? As in HF?

Anyways I might just fold an upload it to CivitAI also
>>
File: ComfyUI_09422_.png (1.6 MB, 800x1400)
1.6 MB
1.6 MB PNG
>>
File: ComfyUI_00004_.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
>>
File: ComfyUI_13487_.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>
File: delux_hh_00065_.png (2.28 MB, 1024x1344)
2.28 MB
2.28 MB PNG
>>102036677
nice afro
>>
File: file.png (1.54 MB, 864x1280)
1.54 MB
1.54 MB PNG
>>
File: ComfyUI_00006_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>
File: long dick general.jpg (1.98 MB, 3264x2030)
1.98 MB
1.98 MB JPG
>>102036630
>>
File: ComfyUI_13492_.png (917 KB, 1024x1024)
917 KB
917 KB PNG
>>102036692
Ah yes, SDXLv0.9 default anime style.
>>
>>102036675
yeah, HF, civit, any space where you can build a catalog for these.
>>
>want to see if I can train my WoW character into the WoW LoRA
>Dimension missmatch error
>Huh?
>Lower it 8
>Still get the error
>read the logs
>2

You trained that LoRA at 2?!
I mean.. it works, but 2?
>>
>>102036751
Yeah does sound like a good idea
Anyways I uploaded the lora onto civit:
https://civitai.com/models/676564?modelVersionId=757365

Enjoy guys
>>
File: FD_00126_.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
>>102036618
Whatever the default on Civit is.
The only settings I changed were the repeats, epochs, and training resolution.
>>
>>102036734
thats an awful collage
>>
File: file.png (1.29 MB, 864x1280)
1.29 MB
1.29 MB PNG
>>
>>102036771
It was 2.
For reference, I usually stick around 16 - 32 for flux.
>>
File: ComfyUI_02117_.png (1.4 MB, 1376x800)
1.4 MB
1.4 MB PNG
>>102036734
>>102036630

How did my work of art not make the collage?
>>
File: file.png (1.44 MB, 864x1280)
1.44 MB
1.44 MB PNG
>>
File: FLUX_00663_.png (1.17 MB, 832x1216)
1.17 MB
1.17 MB PNG
Why does the Pony Guy care that Shneil being distilled?

he has 10 milions images. He will retrain it.

he only needs the base architecture to be good.
>>
>>102036805
Pony guy should just lawyer up, retrain flux and argue it's a different model based on his extensive input.
>>
>>102036805
he's actually a retard and is about to pull a Summertime Saga and/or Yanderedev

he's going to take another 4 months of no results and have another update when Pixart Next comes out
>>
It's so over for Stable Diffusion, NovelAI, Dall-e, and OpenAI
Flux has won
https://files.catbox.moe/97s9g3.png
>>
File: A3G.png (2.93 MB, 3072x1024)
2.93 MB
2.93 MB PNG
>>102036737
Huh, stacking 3 LoRAs didn't kill the s/it as much as I thought it would. 1.8s/it down to 2.9
>>
>>102036821
Too bad she has the anatomy of a barbie doll.
>>
>>102036821
Until another diffusion model architecture is made up, Flux is probably the king of local for the next year or more. Maybe we'll see a 6B from someone. What's nice is we now have something that we can grow into as consumer AI hardware improves.
>>
>>102036831
The only problem with stacking LoRAs is it does seem to have a compounding effect of shitting up everything that makes Flux good.
>>
File: ComfyUI_00874_.png (1.82 MB, 1536x1152)
1.82 MB
1.82 MB PNG
>>
>>
>>102036835
that's only because I don't have my GPU set up and I didn't use a local model. This is just Flux Schnell on huggingspaces.co
Very easy to get past the filter but not everything will show up
I'd probably get even more if I fixed my gaming PC and ran Flux locally
>>
File: 1718957652388317.png (1.78 MB, 896x1152)
1.78 MB
1.78 MB PNG
>>
>>102036847
Well yeah, prompt comprehension is already in the dumps and the halfassed screenshot one adds random artifacts since it was trained on scraped steam community screenshots and booru tags. I assume most civitai LoRAs would have a similar effect.
>>
File: syuen flux dev.jpg (216 KB, 1587x1024)
216 KB
216 KB JPG
Another successful character replication via Flux Dev LoRa. Min S/R 5 is definitely better for details than Min SR/10. Min S/R 15 gave me noisier LoRa output that was unusable. Not sure if training in higher resolution would help get the small minute details to gen more accurately. Sadly, I am OOMing at 1024x1024 and have to settle at 768x768. Maybe skimming the OOM border at 800x800 or something would yield better details.

Used this config as a base:

https://github.com/bmaltais/kohya_ss/issues/2701#issuecomment-2294611159

Changes
>768x768
>Learning Rate 0.0002
>Min S/R 5
>>
>>102036848
Whats up with this demon slayer cosplayer, and whyyou keepnspamming her? Is she soneone famous?
>>
>>102036869
Fucking cool. Prompt?
>>
>>102036885
minute details are likely being butchered by fp8 training
>>
>>102036890
"will AI replace real women?"
>>
>>102036805
Lol, the AI turned water on her face into cum.
>>
>>102036831
>>102036847
we need embeddings for characters
>>
File: FD_00124_.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>102036908
Already has
>>
>>102036908
>>102036918
Not yet, we still need real women to train the AI on.
>>
>>102036885
I invite anyone to scroll down that github link for some jump scares.
>>
>>102036926
We have enough dreamshaper faced women to go infinite.
>>
>Bought a 4070 last year
>11gb VRAM is already obsolete
it's over for me bros
>>
File: 00029-3437220076.png (1.44 MB, 1440x1440)
1.44 MB
1.44 MB PNG
>>102035997
Kewl lora
>>
File: ComfyUI_00012_.png (2.71 MB, 1536x1536)
2.71 MB
2.71 MB PNG
>>
I'm using the WoW LoRA to see if it's possible to train my character back into it after the fact
>>
>>102036966
the lights give the ship a nice sense of scale unlike most scifi ship gens
>>
File: 7..png (1.09 MB, 787x830)
1.09 MB
1.09 MB PNG
My character
>>
File: FD_00139_.png (1.15 MB, 1024x1024)
1.15 MB
1.15 MB PNG
>>102036977
>>102036987
Is this V1 or V2 of the LoRA?
>>
>>102036885
cool. now generate Syuen pregnant with the commander's baby and her claiming Nikkes can never experience this.
>>
File: cut u.png (811 KB, 960x1088)
811 KB
811 KB PNG
>>
>>102036630
My first time being in the bread pic.

Makes me a happy prompter
>>
>>102037022
Which one are you?
>>
>>102037001
The one you put on civit so 2?
>>
File: FD_00140_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>102037051
Weird it shows no downloads. I thought nobody liked it but happy to know people are using it.
>>
File: ComfyUI_00014_.jpg (769 KB, 1536x1536)
769 KB
769 KB JPG
>>102036981
thanks, I've been trying to wrangle it to make stuff that looks real
>>
File: Untitled.png (11 KB, 847x88)
11 KB
11 KB PNG
>>102037060
I think the widget just takes time to update
>>
File: ComfyUI_00879_.png (1.2 MB, 896x1152)
1.2 MB
1.2 MB PNG
>>
File: ComfyUI_05122_.png (957 KB, 1024x1024)
957 KB
957 KB PNG
>>10203701
>>
>>102036847
>The only problem with stacking LoRAs is it does seem to have a compounding effect of shitting up everything that makes Flux good.
this, that's why the real solution is a finetune that will add more concepts, Loras can't fix everything
>>
At CFG > 1, AdaptiveGuidance gives different pictures than CFGGuifance, is this normal? I thought AdaptiveGuidance was basically CFGGuidance but will add cfg = 1 at the end of the steps and nothing else
https://imgsli.com/MjkwMjA0
>>
File: magic boots.png (611 KB, 960x1088)
611 KB
611 KB PNG
>>
>>102037090
Nah those downloads are from V1.
Doesn't matter, just happy I can add to the community.
Still haven't decided on my next project.
>>
File: 1702307926455585.png (402 KB, 845x501)
402 KB
402 KB PNG
>>
Imagine using clip_l
So glad the gguf one is working again
https://imgsli.com/MjkwMzI4
>>
>>102037229
Ikr, the man who finetuned clip_l did a great job, now I'm wondering if the same can be done for t5
>>
>>102037229
You say it's a WoW aesthetic but i'll be damned if that isn't fable 1.
>>
File: based-so-fucking-zased.gif (2.25 MB, 636x640)
2.25 MB
2.25 MB GIF
>>102037121
>Syuen Flux training anon delivered
I kneel
>>
>>102036630
Why are there seperate generals for SDG and LDG, when SDG also supports local installs of SD?
>>
>>102037265
/sdg/ turned into an avatar social club. They don't discuss many aspects of stable diffusion any more, they just generate images and talk to each other about their bottom surgeries. But you can't really get rid of them either, so /ldg/ had to be made.
The avatars are terrified of losing their position on the pecking order of their containment thread so we're pretty much safe from them here.
>>
>>102037229

So Vit L Best Smooth is better than Clip L? Is this the one?

https://huggingface.co/zer0int/CLIP-GmP-ViT-L-14/tree/main
>>
>>102037237
Quantized ones work great so far
>>102037245
Depends on how you drive it. Sonic is a pretty strong prompt in the model, but you can get some non-WoW early 2000s 3D out of it. It skews heavily to WoW though.
>>
File: ComfyUI_00018_.png (2.33 MB, 1536x1536)
2.33 MB
2.33 MB PNG
>>
>>102037285
>The avatars are terrified of losing their position on the pecking order of their containment thread so we're pretty much safe from them here.
I hope that's true, I don't want them to ruin this comfy place either

>>102037303
it's this one precisely
https://huggingface.co/zer0int/CLIP-GmP-ViT-L-14/blob/main/ViT-L-14-BEST-smooth-GmP-ft.safetensors
>>
>>102037310
>I don't want them to ruin this comfy place either
Rest easy, they are trapped in a prison of their own creation.
>>
What a faggot
>>
>>102037347
I don't know if it's "legal" to make money out of your Lora training on Flux Dev, I thought it was non-commercial shit
>>
>>102037308
fuck me bro this is grand, and as other anon said, scale comes through nicely. clean
>>
>>102037347
64,420 because n64 haha i get it
>>
>>102037360
He isn't, he can't cashout. Civit is though, and they probably have a license with BFL
>>
>>102037161
I think AdaptiveGuidance is ruining the image a bit and sloppify the image even more, we don't need AG to be an antiburner by itself, we just asked it to add cfg = 1 at the end, nothing more, nothing less
>Hatsune Miku skateboarding, her text speech says: "I'm loving it", 50's comic book style
https://imgsli.com/MjkwMzMx
>>
File: Untitled.png (193 KB, 859x837)
193 KB
193 KB PNG
>>
>>102037375
>He isn't, he can't cashout.
really? so what's the point of having buzz on civitai then?
>>
>>102037385
there's no way it's not his alt-account, I know there can be some serious fanboys but writing a fucking bible to protect your daddy is a bit too far to be genuine
>>
>>102037387
You can use it to gen and train LoRAs, but the ratios are fucked. It's 2k to train a Flux LoRA, or about $2.
Meanwhile a single gen is 200 a pop.
>>
>>102037401
>It's 2k to train a Flux LoRA, or about $2.
>Meanwhile a single gen is 200 a pop.
wtf? why making one single image is only 10x less expensive than training a LoRA?? Should be easily 10000x less
>>
>>102037385
>>102037394
It's transparently just some troll from here, you can even see the 4chanisms seeping out of his writing style. he's not even trying
>>
>>102037394
The problem is, they all have the same general thread
>It's only 5 bucks a month
etc, etc,
It weird that all the shills keep pinning the value on the small price and the hard work cerfuckin does to obtain that knowledge.
>>
>>102037407
I agree but don't tell them. It's how I've been genning my LoRAs for free.
Most people just use it to gen because they are vramlets.
>>
>>102037401
I thought they added some creator payment system or whatever
>>
>>102037412
why do you all talk about reddit so much
>>
>>102037407
Yeah but the LoRAs are trained at rank 2, they take as little vram as you possible could take when training a LoRA.
>>
>>102037432
You can't really avoid it when keeping up with developments.
>>
>>102037427
No partnership options that I can see.
All I know is any time someone uploads an image to my LoRAs I get 50 buzz, and I use that to train more LoRAs.
>>
>>102037432
same reason we talk about GitHub or where ever else there are relevant happenings to flux/AI tech, anon. pretty obvious
>>
>>102037412
its pretty funny to see obvious 4chan posters code-switching to uwu reddit style
>>
>>102037451
Is it me?
>>
File: ComfyUI_00023_.png (2.22 MB, 1536x1536)
2.22 MB
2.22 MB PNG
>>102037361
glad you like it anon
>>
>>102037434
>Yeah but the LoRAs are trained at rank 2
rank 2? how can the result be any good?
>>
>>102037432
What do you mean "you all"?
>>
>>102037464
Well, the WoW LoRA was rank 2 and it was okay, but I agree, Civit is borderline scamming with those weak ass LoRA parameters. Doesn't seem to be a fatal issue for the jeets though, simply because Flux does respond so well to LoRA training.
>>
>>102037460
a) erect b) wanna fire up flux right now. just keep em coming lol. this is lora-ed?
>>
>>102037434
>>102037464
I don't believe that. Every Flux LoRA I have trained on there have been perfect. They have an incentive to produce good LoRAs. People downloading LoRAs and Checkpoints is the entirety of their business. If they only produced dogshit they'd be bankrupt.
>>
>>102037446
Lmao nevermind I just looked at it and saw they're only accepting the "top 50 creators" for their payment program, where 1,000 buzz = $1 and they take 30% (with a min of 100,000 buzz payout). Wtf, fuck that shit, literally just letting shitvit resell your work if you aren't training on there
>>
https://civitai.com/articles/6309
>Some additional work remains to confirm that all data is compliant with our safety framework, but at this point, largely everything has been completed. We'll release safety classifiers and a character codex post-V7 as part of our safety commitment.
>Overall, the dataset has been balanced to be slightly less NSFW. I've also added experimental features like scene color palette tags for better color control, and the artist blocklist has been updated to catch more instances where character names are detected as artists and removed.
Isn't is always funny to see a pony fucker talking about safety and morals?
>>
>>102037486
>I don't believe that. Every Flux LoRA I have trained on there have been perfect.
So what is it? Are they lying about rank 2 and it's a higher number or Flux is so good it even works at rank 2?
>>
File: 0.jpg (307 KB, 1792x1296)
307 KB
307 KB JPG
https://civitai.com/models/675581
Finally, anti blur lora
>>
>>102037486
Why would I lie about the LoRA being trained at rank 2? I'm literally interacting with one right now and that's how I found out.
>>
>>102037496
Where do they say it's 2? I thought it was 4?
>>
>>102037490
>no flux
>safety commitment
>less nsfw
Is xi/ximself trying to fail?
>>
>>102037503
???
left is flux's default look, right is something you have to really try to accomplish
>>
>>102037510
I think xe/xir does, he's slowly talking the cucked SAI route, and we all know how it ended up, other people will more balls replaced them, I think their ego got the best of them, as if the world wouldn't turn right without them.
https://www.youtube.com/watch?v=pEEhpmhBwq0
>>
>>102037490
He wants to be respected by the other local trainer tards and he's still seething from Sai dunking on him for doing a literal pony porn model. He added realism to his new dataset purely because Lykon tard fought him and said they didn't need Astratroons help when he can't even make a model that does everything (ie. realism) vs just stylized outputs. It's all just one, big, homosexual dick sucking contest of mentally ill narc egos, furrys and a brony. That's local AI
>>
>>102037515
what? left isn't flux default look, it doesn't have the high intensity blur flux does
>>
>>102037496
>>102037504
>>102037475
Rank 2 is fine for Anime which is 99% of their content so that's probably why.
WoW is simple, so that's fine.
But it could possibly be that Flux already understands the concepts and just needs a little prodding to produce good outputs.
>>
File: ComfyUI_01427_.png (1.14 MB, 1216x832)
1.14 MB
1.14 MB PNG
>>
>>102037529
It do be really weak though. And training on Civit is NOT cheap.
>>
File: ComfyUI_00025_.png (2.25 MB, 1536x1536)
2.25 MB
2.25 MB PNG
>>102037480
yep, just the official realism lora
if you want to go for a similar look, here is my latest prompt, using deis sampler:
>A photo of a colossal spaceship in orbit around Saturn. The ship has an amorphous, irregular, brutalist design that disregards art and beauty entirely and concerns itself only with functionality. There are various cubic and rectangular segments of the vessel. The overall shape of the ship is simply a long rectangle with various blocky regions protruding from it. Coutless tiny lights adorn its hull, and there are segments with large solar panels and vast sets of antennae. The ship has a huge flat segment on the side where it is labelled "HGH-42" in glowing letters. The rest of the ship is dark black with small gratings and other structures seen throughout the long, roughly rectangular vessel. The image is very dark, likely because it is taken in space and the vessel has very little lighting. The ship may be in Saturn's shadow, causing it not to reflect any sunlight. Saturn in the background appears to be at nighttime, as it is very dark and barely any details can be seen. The image conveys a sense of mystery as the unknown, highly futuristic vessel is seen near Saturn for unknown reasons.
>>
File: FD_00143_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>102037529
All I know is I trained the wow lora on civit and now I am making wow shreks so I don't really care what rank it is.
>>102037543
When someone gets a good training script for a 4080 I'll stop using it and do it locally but it cost me literally $0
>>
>>102037529
>But it could possibly be that Flux already understands the concepts and just needs a little prodding to produce good outputs.
that's probably that, and that's why finetuning Flux won't be as expensive as we thought it to be, it has already seen all the pictures, all it needs is the proper word attached ot it, and you don't need a lot of picture to get that extra boost, Flux is literally a Lora winning machine, if I was taking my aluminium hat I would even say that they did that on purpose to make civitai even more relevant
>>
File: 1724392803.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>
>2 years ago
>NAI leaked! Let's mix it into all our fine tunes and do whatever the fuck we want with it it.
>Now
>*Adjusts reading glasses* Hmm, I don't know about training Mario futa characters on this Model, the license dictates we can't generate over 10 images on this model without owing licensing fees to the company

When did everyone get so faggy?
>>
File: file.png (775 KB, 1216x832)
775 KB
775 KB PNG
>>102037549
>>
>>102037553
it's good enough, dataset and tagging matters more than the rank for most things.
>>
>>102037549
very much appreciated.
>>
>>102037591
For sure, but don't expect to see any high definition realism LoRAs coming out of civit
>>
>>102037579
damn, you can really mod the shit out of that game now
>>
File: ComfyUI_00027_.png (1.81 MB, 1536x1536)
1.81 MB
1.81 MB PNG
>>102037579
sick, always good to see a new style
>>
File: file.png (847 KB, 1216x832)
847 KB
847 KB PNG
>>102037602
That seems to be the next step yes. May have to find a way to make them perfect pixels though

>>102037604
I personally like this one more and yours look really sick ngl
>>
File: file.png (1.2 MB, 1216x832)
1.2 MB
1.2 MB PNG
Hellstar Remina vibe
>>
>>102037612
even if it isn't perfect, it's in the more than good enough range. there are so many games that use static 2d assets, there is whole new field here thanks to flux.
>>
>>102037612
this one makes me think of that super old film where they land a rocket in the moon's eye
>>
File: FD_00145_.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>102037601
They don't need to. Their niche is character LoRAs and porn and that's easy enough to do on site. Not that we will need realism LoRAs once we get fine tunes.
>>
>>102037161
>>102037379
https://imgsli.com/MjkwMzM1
Going for AdaptiveGuider definitely removes detail
>>
I have come to the conclusion that there's a memory leak in either comfy or flux.
>>
File: file.png (3.63 MB, 2048x1222)
3.63 MB
3.63 MB PNG
https://civitai.com/models/652699/amateur-photography-flux-dev
that's really impressive, it even removes the onobxious blur
>>
File: ComfyUI_01465_.png (517 KB, 832x1216)
517 KB
517 KB PNG
>>
>>102037773
Is that even possible when flux is just weights and comfyUI is python? Unless there's a memory leak in some deep pytorch/cuda C++ stuff.
>>
>>102037805
I don't know enough about anything but I interface with comfy and I am using flux so therefore it's one of them.
>>
>>102037800
Do dev LORAs work with schnell?
I assume not, but a man can dream
>>
>>102037852
why do you go for schnell anon? dev works fine at 20 steps
>>
Speccy lora training in-progress
luv me speccy
>>
>>102037857
I only have an RTX 4080 with 16gb of RAM, I run schnell with low vram settings
>>
>>102037872
>16GB ram is considered low vram size now
bros...
>>
>>102037872
buy more ram anon... it's not that expensive...
>>
>>102037872
use Q6_K unet and Q6_K T5
>>
File: ComfyUI_02133_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
Something really weird happened with my training a LoRA on top of a LoRA experiment. If I load this LoRA at a weight any higher than .13, it just produces white noise. Like the LoRA is 10X stronger than it should be.
>>
>>102037895
>collapse
you are better off training a character lora of your oc using real screenshots, then using both loras together after
>>
>>102037900
>collapse
It that what it's called and yeah, I figured, this whole thing was just a silly experiment.
>>
>>102037867
Rad. Good speed, anon.
>>
>>102037911
not likely in this case, but it doesn't help. use real ss when possible.
>>
File: ComfyUI_02135_.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
I also regret to inform everyone who wanted to know that no, it seems like training in new characters after the LoRA is complete is a huge waste of time, now every character is my character.
>>
so.. flux lora sizes. why the large variances? sub 20 - 600+ mb, that is quite the difference.
>>102037803
found ze lora, & dope.
>>
>>102037758
>Going for AdaptiveGuider definitely removes detail
Is there an alternative to Adaptive Guidance? I like the fact it's putting CFG = 1 at the end steps, but it should behave like CFGGuider before, and it's not doing that at all
>>
>>102037927
it depends on how you do it, the og lora is 105 images, if your oc is eating more image slots, he will be burned in there. also you train from the start again, but with the extra images added, if it's a v3 of the lora, you probably need to insert other characters with unique characteristics to compensate.
it's more of a hit your head against the wall until you figure it out approach. the good news is, once you figure this out, you can apply it to other similar types of loras and get consistent results
>>
File: ComfyUI_02141_.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
Oh well, at least I have a fully sick LoRA for my WoW character.
>>
>>102037929
Do they have one that does penises yet?
>>
>>102037929
Higher lora sizes supposedly have higher quality.
>>
>>102037867
>>
File: ComfyUI_00041_.png (1.85 MB, 1536x1536)
1.85 MB
1.85 MB PNG
goodnight anons
>>
File: medusen.png (981 KB, 960x1088)
981 KB
981 KB PNG
>>
>>102037998
the wow lora is pretty cool, I was showing some of the gens to my friend to show how good imagegen has gotten at reproducing unique styles
>>
>>102037931
That's really interesting, AdaptiveGuider is responsible on the blur background on anime drawings as if it was a photo, CFGGuider doesn't do that
https://imgsli.com/MjkwMzM4
>>
>>102036777
more sovl desu
>>
Pixart bigma status?
>>
>>102038094
You also have a better prompt adherance in CFGGuider than on AdaptiveGuider
>Y2K style cover art with a low poly 3D render of: Hatsune Miku as a sleek, robotic samurai in chrome armor is slicing through waves of pixelated sushi rolls flying through the air. Each slice sends colorful sparks flying. Behind her, a giant koi fish swims through the sky as if it were water, creating ripples of light.
Y2K style text at the bottom: "Sushi Master."
https://imgsli.com/MjkwMzQw
>>
>>102037360
>I don't know if it's "legal" to make money out of your Lora training on Flux Dev, I thought it was non-commercial shit

>He thinks they will enforce that shit

If they do their reputation will tank.
>>
>>102038289
If it's 100% certain they won't enforce this, the pony dev would've jumped the flux-dev bandwagon already
>>
>>102038298
Pony is a large scale commercial dev who could afford to purchase a license, not a little guy.
>>
>>102038301
but it's not just one little guy, it's thousands of them on civitai, lots of money they loose if they don't enforce this
>>
I tried baking a lora with 30k images and it's going at 40s/it compared to my usual 2s/it. Does it scale poorly with data size or something? Not sure what's causing it really.
>>
>>102038289
If they didn't intend on enforcing it they wouldn't have added that restriction to their custom license.
>>102038301
There's no information regarding cost. BFL are very likely pulling a number out of their ass depending on who contacts them, whether they like them and the size of their wallet.
>>102038310
That's just Civit because you can't cash out buzz.
>>
>>102038312
Probably. Check I/O usage.
>>
>>102038312
Check the nvidia memory setting from control panel
>>
>>102038314
>If they didn't intend on enforcing it they wouldn't have added that restriction to their custom license.
this, they need to keep this non commercial licence or else the first random could finetune dev, make it better than pro and make an API site from that, it would've destroyed their business plan
>>
Looks like derrian distro's lazy lora training gui updated for flux. superior alternative to bmaltais shit show usually, still installing new requirements.txt before I can say if it works or not
>>
>>102038322
>>102038324
i restarted the trainer and updated drivers and it seems stable now. i started it when i was already using vram for something else so maybe it allocated poorly or something, idk
>>
Other boards stopped linking /sdg/ and instead links to /ldg/. Calling /sdg/ schizo general.

How can we make sure /ldg/ does not suffer the same fate?
>>
>>102038481
Don't give attention to avatar fags. That includes being angry towards them, just ignore and filter
>>
>>102038490
this
>>
File: tall6.jpg (280 KB, 1624x1120)
280 KB
280 KB JPG
>>
>>102038386
yep, works. it's set up kind of weird with half the model input under the "flux Arg" drop and you have to manually put in the extra ARGs to prevent oom, but much comfier than bmaltais for those who prefer a GUI
>>
>>102038481
They won't leave /sdg/ or risk losing their place in their social avatar fags hierarchy.
Sometimes they leak into here but leave because nobody even acknowledges them. We need only keep it that way.
>>
>>102038501
looks like she didn't follow her own advice
>>
>>102038571
does prodigy work with flux?
>>
>>102037407
I think they have a limited gpu space. And they thought allowing people to c4eate Loras is more important than pics
>>
File: Joy_cap Comfyui.png (173 KB, 1232x781)
173 KB
173 KB PNG
Might be of use to someone, I don't have the access to the .json right now.
>Purpose
Comfyui sheet for using Joy_caption on a directory of images outputting txt files and a copy of the source images in a new folder and tweaking token count etc and formatting the LLM prompt to change the out put of captioning etc.

https://github.com/StartHua/Comfyui_CXH_joy_caption
https://old.reddit.com/r/StableDiffusion/comments/1egwgfk/joycaption_free_open_uncensored_vlm_early/
>>
>>102038616
yes, but I've not personally used it. testing came-rex on this run desu
>>
>>102038645
oh wow. >erect
>>
So is forge back from the dead?
>>
>>102038670
yeah
https://github.com/lllyasviel/stable-diffusion-webui-forge#quick-list
>>
File: 00068-2686490579.png (1.44 MB, 1024x1024)
1.44 MB
1.44 MB PNG
>>102037867
Kino
>>
File: 2024-08-23_00063_.png (1.34 MB, 1280x720)
1.34 MB
1.34 MB PNG
>>
can forge pin T5 to the CPU yet?
>>
>>102038616
Yeah it works, I used it for
>>102036799
>>
>>102036912
Its dolphin cum
>>
File: 00063-2686490579.png (878 KB, 1024x1024)
878 KB
878 KB PNG
>>102038679
>>
>>102038670
Yeah every time I git pull out of habit which is every few hours Theres updates
>>
File: FLUX013.jpg (196 KB, 1448x1280)
196 KB
196 KB JPG
>>102037867
PLEASE POST WHEN ITS DONE THIS IS GREAT
>>
File: ComfyUI_05354_.png (975 KB, 1024x1024)
975 KB
975 KB PNG
>>
>>102038679
magnifico
>>
>>102037998
Your blood elf is better than what the base lora can do too. Keeps giving them double ears for me.
>>102038090
A cool thing is if you daisy chain in a realism lora with it and play with the weights you can get some mid 2000s vidya aesthetics going

Glad you guys like it though. Don't forget to subscribe to my patreon. Only $5 and it also includes a lora of my face.
>>
File: 2024-08-23_00095_.png (857 KB, 1024x1024)
857 KB
857 KB PNG
>>
Has anyone here tried the single gpu fine tuning yet? Id like to get a look at a json file if you have one.
>>
I know you all are into flux, but someone tell me what is the best controlnet for sdxl/pony.
>>
>>102038894
https://huggingface.co/xinsir/controlnet-union-sdxl-1.0

Definitely this one by a good margin.
>>
>>102038894
xinsir, by far. its stronk tho so dial in carefully.
>>
>>102038894
ponycanny, mistoline, you don't have a lot of options like 1.5 because pony is overtuned
>>
Speaking of controlnet. Why is instant X so shit. Like how can they claim their control nets work when they exclusively pump out bullshit?
>>
>>102038907
>>102038922
>>102038927
Thanks bros
>>
>>102038891
What is the best rank for person face? Rank 2 or rank 4?

What about concepts? Rank4 or rank 8?

Anyone run test?
>>
>>102038894
Mistoline has an undervalued quality of letting lora effects through.
>>
File: ComfyUI_temp_eeikv_00053_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>
File: 949371462.png (1.02 MB, 1344x768)
1.02 MB
1.02 MB PNG
>>
File: ComfyUI_02147_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>
File: 2024-08-23_00105_.png (1.35 MB, 1024x1024)
1.35 MB
1.35 MB PNG
>>102038801
clean, I like it
>>
flux doesn't know akira? oh boy
>>102039008
ty for the mistoline tip, and actual usage tips on the git, look at that.
>>
File: ComfyUI_02148_.png (1.98 MB, 1488x1024)
1.98 MB
1.98 MB PNG
>>
>>102039112
lmaoo, please tell me you made a lora of that grifter?
>>
>>102039067
I keked
>>
File: ComfyUI_02149_.png (1.64 MB, 1024x1024)
1.64 MB
1.64 MB PNG
>>102039120
I did. I'm not game to upload it onto civit, but if you want it and know a place to quietly share it, I'd be happy to share.
>>
>>102039112
pls stop spamming him, don't give him more credit than he is worth, I am getting sick seeing his face
>>
>>102039148
>t if you want it and know a place to quietly share it, I'd be happy to share.
you can share it here if you want, the rest is gonna make history from it kek
>>
>>102039150
same. & reported, even if it costs me a ban.
>>
>>102038661
>>102038709
Ty bros
>>
>>102039148
just drop a link, what are you afraid of
celebs have a 101 loras on civitai and they haven't sued it yet, what's this guy gonna do?
>>
>>102039172
>>102039156
Actually, I've decided against sharing. Feels like it'd be opening a can of worms I can't put back in. his face is out there for training if you want to do it yourself.
>>
>>102039150
This is quite funny though
>>
>>102036630
>>Maintain thread quality
>https://rentry.org/debo
Who's debo?
>>
>>102039163
It would be an actual travesty if I somehow got punished and he got to keep spamming his patreon all over the place.
>>
>>102039190
You're a faggot
>>
>>102039190
>Feels like it'd be opening a can of worms
You wouldn't.
>>
>>102039190
>lack of balls on an anonymous forum
holy faggot
>>
File: 142917_00001_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>
>>102039196
everytime I try to look at those links because debo bad I end up looking only at the gens. shrug.
>>102039199
it's just.. for some reason his fuckface triggers me. I dunno why lol
>>
>>102039202
>>102039204
>>102039211
I'm sorry, I just don't want to bear that cross if the guy got upset over someone misusing a thing I made. I don't like him, but the worst I'll do is whine and post silly pictures of him.
>>
>>102039230
still a pussy
>>
File: ComfyUI_00733_.png (2.43 MB, 1024x1024)
2.43 MB
2.43 MB PNG
>>
>>102039221
>for some reason his fuckface triggers me.
Yeah that's why I made a LoRA of it. it's art in its purest form.
>>
>>102039230
pls no. I beg you. I give you a nice upscaled porsche. for only 5 euros, to my patreon! wait
>>102039258
I'll try to not get triggered. TRIGGER WARNING
>>
I gotta say, the latest gguf loading speed and LoRA loading speed has gotten really good.
>>
>>102037857
5x the steps, but not 5x better
>>
>>
>>102039392
hard to do 5x better when we're reaching a celling, don't ask a model that is 90% accurate to go 270% accurate, not gonna happen ;'(
>>
I have a 10GB 3080, if I want to train Flux loras am I doomed to create trash because I'll be training on fp8 or quants and should just use civitai or cloud compute, or is this not a big deal and my only disadvantage will be that training takes longer than a 12/16/24GB+ setup? I never made any pony/xl/1.5 loras
>>
>>102039424
how long does training take anyway?
>>
File: 00094-746884511.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>102038787
It's not to great, but here
https://civitai.com/models/677322
>>
>>102039422
if we're reaching a ceiling, why would you spend 5x more time on it?
>>
>>102039067
is this Sammy D ?
>>
>>102039446
because Dev is reaching a ceiling, not Schnell
>>
>>102039445
Thank you sir
>>
>>102038703
Yes
https://gist.github.com/Sunderbraze/d0b0f942256965b40f54247344fea37f
>>
>>102039474
that script is for ComfyUi, not Forge, and that's why I'm stuck at ComfyUi desu, because of that great script
>>
>>102039474
Sorry, I saw that you wrote Forge. But you can just build off that easily.
>>
>>102039474
that's for Comfy you fucking retard
>>
>>102039482
no, because you have to work with Forge's memory management
it's not a (hacky) one line change like for Comfy
>>
>>102039483
Get well soon
>>
>>102039505
Learn to read soon
>>
>>102039503
>(hacky)
hacky maybe, but it's working fine and it's a really important feature to have
>>
>>102039458
yes. if dev is reaching a ceiling, why would you spend 5x more time on it?
>>
>>102039524
okay? who asked
>>
>>102039537
and who asked about your opinion about the hacky script?
>>
>>102039515
No need to be mad
>No need to be fucking dyslexic
>>Take care bro
>>>Kys retard
>>>>Alright, have good one
and so on, and so on
>>
niggas, post gens, stop sucking each other off. thank you. 5$
>>
>>102039552
I'm not mad I just want you to fucking die already you fucking retard
>>
>>102039527
I think you have trouble understanding english, Schnell isn't reachng a ceiling, so it's a good idea to go for a superior model that is actualling reaching a ceiling
>>
>>102039537
>>102039547
"who asked" is a fundamentally retarded question on an anonymous forum where anyone can post their opinion freely
>>
>>102039547
the guy that linked it, are you having trouble following a simple thread?
>>
>>102039585
ne never asked your opinion about that script and on whether it's hacky or not, you decided to add that trash opinion by yourself, and yeah, no one asked for that
>>
>>102039563
>english
English

>reachng
reaching

>actualling
ac... no, nevermind, this is getting silly. are you drunk or stoned? lmao

anyway: I can try out prompts and seeds five times faster, and use the best stuff for i2i and other things. much more worth it to me
>>
>>102039601
you have a fundamental issue with understanding things, don't you?
I mentioned it is hacky for a reason that is obvious to anyone with more than two brain cells
please unalive yourself
>>
>>102039609
>>
>>102039601
Can I share my opinion?
>>
>>
File: new album by t-pose.png (1.6 MB, 1024x1024)
1.6 MB
1.6 MB PNG
>>
>>102039614
>I mentioned it is hacky for a reason
there's no reason, it's working fine on ComfyUi and has no bugs in it, you're just hating for this script for absolutely no reason, so what I need you to do is to take the nearest rope that you got, and hang yourself, period, why are you alive?
https://www.youtube.com/watch?v=tajKWkR0TtI
>>
>>102037303
>>102037310
How do you run that clip vit on gguf clip loader? Still getting an error
>>
>>102039631
>you're just hating for this script
you're retarded, sorry, there is no way around it, you lack the ability to think things through
>>
>>102037303
Has no one made a vit l gguf yet?
>>
>>102039585
Nope, I'm not the anon here: >>102039537
(I linked it to you. I'm, you know, the person who tried to help, apologized to you, and provided an extra tip. Try to think about that there are multiple posters itt, kek)
>>
>>102039650
Sure thing retard, keep calling genuine working scripts "hacky". You've really shown your double-digit IQ to everyone here.
>>
>>102039664
multiple posters changes absolutely nothing about what I wrote, holy fucking shit you people are dimwits
>>102039667
>confusing "working" with "not hacky"
just fucking die already
>>
>>102039620
and the moon here is the fact that you brought up English comprehension in the same comment
>>
>>102039445
We're in good company speccybros
>>
>>102039445
rank too high
>>
>>102039697
>and
*And

>comment
*comment.
>>
>>102039707
I'll probably redo it at some point with a larger dataset. What rank should I go for?
>>
>>102039714
yes. here is my finger.
>>
>>102039738
>yes. here is my finger.
that finger? >>102039609
>>
File: 00068-81057407.png (861 KB, 1216x832)
861 KB
861 KB PNG
>>
>>102039647
>>102039658
Works now, had to update the node

Anyone knows why you have to load the xxl encoder first and vit l second and not the other way around?
>>
File: 00071-81057410.png (924 KB, 1216x832)
924 KB
924 KB PNG
This movie was shit
>>
>>102039735
16 tops
>>
File: 00070-81057409.png (925 KB, 1216x832)
925 KB
925 KB PNG
>>
>>102039658
you can gguf CLIP vit-l but you shouldn't quant it so no point ggufing it anyway
>>
>>102039770
It is 16
>>
File: 00060-81057399.png (1016 KB, 1216x832)
1016 KB
1016 KB PNG
>>
>>
>>102039781
Why in dual clip loader you have to load first the t5 and then the clip_l ?
>>
>>102039814
you don't
>>
>>102039826
Then? It's first clip l and then t5?
>>
>>102039837
it doesn't care about the order, it knows which is which
>>
>>102039749
Yes! And its pointing at the moon >>102039563
See?
>>
File: 2024-08-23_00172_.png (1.85 MB, 1024x1024)
1.85 MB
1.85 MB PNG
>>
>>102039848
Ok thanks

Any idea why after updating comfy the gen time doubled with same settings?
>>
>>102039751
cute bug, catbox?
>>
File: 00077-355600679.png (908 KB, 1216x832)
908 KB
908 KB PNG
>>
File: 00104-2405011170.png (1.51 MB, 1024x1024)
1.51 MB
1.51 MB PNG
>>
>>102039358
you talking about Forge?
With Comfy using the GGUF Q8 is slightly slower than FP8 and with a lora enabled it is whole fucking 60% slower.
>>
>>102039874
lmao, nice one anon
>>
>>102039893
Before the last comfy update gguf was 50% faster
>>
>>102039878
sure https://files.catbox.moe/o1qua7.png

it's a simple prompt, I forgot to remove a prompt for another lora too
>>
>>102034493
how did it go
>>
File: 2024-08-23_00175_.png (1.81 MB, 1024x1024)
1.81 MB
1.81 MB PNG
>>102039898
thanks
>>
>>102039805
8 then
>>
Oven fresh piping hot bread ready for you...
>>102039916
>>102039916
>>102039916
>>
>>102039887
BRO
I got one too. one last rapeman.
>>
>>102036767
I train all my Loras at 4 so why shouldn't 2 work. It's still huge for a handful of 1024 jpgs converted to weights
>>
>>102039813
Close, but this is not eva elfie

Should a rank4 be used in place of rank2?
>>
>>102037881
You new here?
>>
>>102038013
Only they don't



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.