[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


3-Year duration 4chan Passes are now available for $45

[Advertise on 4chan]


File: tmp.jpg (776 KB, 3264x3264)
776 KB
776 KB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102162827

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
>26 minutes ago
>>
File: frame_00012.png (1.32 MB, 1920x1080)
1.32 MB
1.32 MB PNG
I had to check the catalog to find the new one.
>>
File: 2024-08-31_00135_.png (1.31 MB, 832x1216)
1.31 MB
1.31 MB PNG
>>102166301
ty baker
>>
File: FD_00209_.png (1.9 MB, 1024x1024)
1.9 MB
1.9 MB PNG
Thanks for pinching off a fresh loaf
>>
thx mods for removing the pedo slop
>>
Nobody posting gens so here's how my LoRA is progressing.

I guess it makes sense with over 1300 images it needs more time in the oven but I hope the outputs aren't all this shitty on inference.
>>
>>102166872
A lot of the images have 'cloudy' output.
>>
>>102166872
Why so many images for a LoRA? Your character looks simple (idk who the fuck that is). You should be able to do it with 20-50.
>>
File: 2024-08-31_00152_.jpg (707 KB, 2496x3648)
707 KB
707 KB JPG
>>102166872
her basic features are recognizable as Senjogahara, the style is not tho .. yea it needs more baking
>>
>>102166887
It's a style LoRA

>>102166896
I have the training set to 7000 steps and it will probably need it. I should have opted for epochs instead for ease of tracking. This one just popped out right after the new epoch began.
>>
File: 2024-08-31_00158_.png (1.43 MB, 832x1216)
1.43 MB
1.43 MB PNG
>>102166919
you making a general monogatari style lora or just a character lora? if its the later >>102166887 is right, 1300 images is to many .. if its a style lora, its fine
>>
>>102166919
I did a style LoRA with only 103 images. 1300 seems extremely excessive.
>>
>>102166933
Style LoRA.
The images are tagged with the characters, but that was secondary to the overall style. It's doing an okay job of the background imho, here's a much earlier example. It's stopped giving me wrongly proportioned bodies so far.
>>
>>102166936
I don't think it's excessive, but more training than usual is necessary to get an acceptable result.
>>
>>102166949
what data set you using? screencaps?
>>
>>102166872
what base model?
>>
>>102166966
I downloaded videos of all the character PVs throughout the series and made a script that extracted an image every 0.5 seconds from the video, cleaned up the overly samey images and tagged them.
I think the nature of the videos kind of dooms the LoRA to having compression artifacts baked into the LoRA.though.
>>
>>102166966
Flux
>>
>my lora: 0 buzz
>some pics some dude made with my lora: 120 buzz
I'm not bitter, no sir
>>
File: 2024-08-31_00163_.jpg (1.08 MB, 2496x3648)
1.08 MB
1.08 MB JPG
>>102166991
>compression artifacts baked into the LoRA.though
ya that was my thought to, did you use the blu rays or some other source? cause especially streams are pretty bad for making loras
>>
File: 2024-08-31_00016_.png (1.55 MB, 832x1216)
1.55 MB
1.55 MB PNG
>>102167038
die in a fire
>>
>>102167034
Did he at least post the pic to your lora page?
>>
>>102167041
Nope, it's about as shitty of a image scrape as you could hope for.
At least when it's finished we can all take a good look at how shitty or/ actually okay it turns out.
>>
>I was going to stop
No you weren't lol. Why would you tell such an obviously false lie?
>>
File: 2024-08-31_00166_.png (1.24 MB, 832x1216)
1.24 MB
1.24 MB PNG
why do diffusion generals attract the most horrible kind of persons
>>
>>102167109
because you keep talking about them and feeding them
>>
>>102167109
its not just diffusion generals, you can find such a schizo on every board.
>>
>>102167144
Im sorry, ill stop.
>>
>>102166991
yeah, flux picks up on the small details way more, im noticing too

on doing 7000 steps, assuming your 1300 images have no repeats set, you're looking at a bit over 5 epochs (an epoch is one complete trip through all the images)

i would say go with epochs next time, set it to 16 epochs and save each one, can cancel training if it seems good by epoch 6 or 8. It's also possible to resume from a completed epoch (trying that now)

batch size also factors in.. if you've got the memory i always trust batch size 4, too high batch size can reduce quality because it is reducing steps
>>
>>102167155
I generally keep a batch size of 3 at 512 but also do a second pass on the same dataset at 1024 with a batch size of 1 with less repeats. Idk, I just think mixing a few 1024x1024 images into the mix might be good for output. I have no proof of that though.
>>
I'm gonna do it. I'm gonna pull comfy.
I can't stand having to clear my model cache manually every time I change the prompt.
Surely this is fixed in a new pull, right?
>>
>>102167290
no. It is doing everything properly. An update won't stop it from doing what it is doing. You can't seem to get what is actually happening through that thick skull.
>>
>>102167301
What is happening, Anon?
>>
>>102167309
I have explained it twice at this point.
>>
>>102167320
didn't ask for your life story
>>
>>102167320
You have not. You claimed it's something that doesn't happen on gguf, which I am using so it isn't that. This is all you have said on the matter (providing that was even you) because I have brought it up exactly once before this.
>>
>>102167329
I assumed you were also the guy asking for optimizations and started up with that guy who was acting as city.
>>
hello 1girls
>>
Oh no
>>
>>102167351
Not me. I don't pretend to be anyone, not even myself.
It only started happening recently. Prior to this I was using wildcards with no issue. Now if I change a single word the I go OOM unless I clear the model cache.
>>
>>102167301
Calm down Comfy
>>
>>102167368
I look like that
>>
>>102167392
alright, short version, you are probably still fucked. Although I hear the new update isn't bricking shit so there is that.

>>102167395
as comfy I have a confession. lllyasviel is a better programmer than I and after some soul searching I wanted everyone to know.
>>
>>102167290
I hope for you they fixed the OOM issue with multiple loras, I had to go back to an older release after I pulled two days ago
>>
>>102167448
Multiple LoRAs is fine, that was fixed a few days ago. My issue is something else.
I pulled and updated the GGUF nodes and still the same problem so I don't know what's happening.
>>
>>102167460
Changing to Q6_K over Q8_0 seems fine. I guess I will just take the minor quality hit for convenience. Still annoying when it was all working perfectly less than a week ago.
>>
>>102167460
ookay .. ill pull then and try. Was very annoying that it would just OOM on my 4090 randomly on big loras
>>
>>102167497
Just update the gguf nodes, that's where the fix is. Update comfy too if you want I guess but it's the nodes that matter.
>>
>>102167501
nah I am using fp16 .. was a problem regardless
>>
File: FD_00276_.png (920 KB, 1024x1024)
920 KB
920 KB PNG
Why is the thread so dead? Yanks asleep?
>>
>>102167519
Oh then I don't know what your issue is. City was in here a few days ago saying he fixed the LoRA issue and he did. Yours must be something else.
>>
>>102167523
nothing interesting going on, everybody either waiting for a good flux finetune or another model to come out
>>
File: Untitled-1.jpg (1.91 MB, 3768x2512)
1.91 MB
1.91 MB JPG
>>
File: ComfyUI_00125_.png (800 KB, 1024x1024)
800 KB
800 KB PNG
flux is fucking insane
it's easily better than DALL-E 3
i've been coming up with dozens of items for my RPG
even on a Q4 quant, CFG1, 20 steps, i am getting good shit 90% of the time

i think this is the model that's going to wind up causing congress to push legislation and regulation
>>
>>102167535
It's better without so much rampant faggotry desu. Just the normal amount of faggotry from the usual suspects.
>>
File: 2024-08-31_00175_.jpg (1.07 MB, 2496x3648)
1.07 MB
1.07 MB JPG
>>102167531
I don't know what it was either.. but its fixed, can load 4 big loras again now with fp16 and no OOM.
>>
>>102167553
>i think this is the model that's going to wind up causing congress to push legislation and regulation
And what will this even do? Cuck America in the AI race? The world is bigger than the USA.
>>
>>102167565
>And what will this even do? Cuck America in the AI race? The world is bigger than the USA.
this, if the US doesn't want to improve AI, others countries will gladly do it instead
>>
>>102167553
what kind of RPG has gamer girl bathwater?
>>
Whatever the model unloading issue is, it seems to be related to the quant models. Removing them from the equation and now it's not happening.
Sucks because the quality is so much better on the Q8 than FP8.
>>
Getting a little better at 4700 steps
>>
>>102167693
A meme game.
But I will guess it's an e-girl visual novel.
>>
>>102167109
And Dunning-Kruger sub 70 IQ redditor morons.
>>
>>102167726
I will have you know my IQ is in the 4th percentile.
>>
File: city.jpg (74 KB, 960x540)
74 KB
74 KB JPG
>>102167713
shitty code
>>
>>102167109
the ai venn diagram overlaps alot with the nft/crypto space, that and it tends to attract alot of lonely gooner types, like me! my nuts hurt!
>>
>>102167746
>le funny uboat farming comment
>>
>>102167713
>Whatever the model unloading issue is, it seems to be related to the quant models. Removing them from the equation and now it's not happening.
>Sucks because the quality is so much better on the Q8 than FP8.
yeah, Q8 (or any GGUF quant) has some bugs that FP8 doesn't, let's hope city will fix them
https://github.com/city96/ComfyUI-GGUF/issues/84
>>
>>102167786
you've linked that before, it was 404 then and it is still 404
>>
>>102167797
wtf you're right, why is that 404 when I log off?
>>
>>102167563
grrr.. was to happy to soon.. it still goes OOM back to 9230f658232fd94d0beeddb94aed093a1eca82b5 it is
>>
File: FD_00299_.png (1.8 MB, 1024x1024)
1.8 MB
1.8 MB PNG
>>102167775
>le funny uboat farming
>>
File: file.png (147 KB, 1778x1145)
147 KB
147 KB PNG
>>102167786
>>102167797
the fuck is goin on? I tried to make this issue 3 times but no one can see it but me?
>>
>>102167838
city96 doesn't like you
>>
>>102167847
but I never talked to him on Github ;_;
>>
>>102167838
NTA but I can't view your profile at all, I'm not logged in though. Does Github shadowban people?
>>
>>
>>102167838
probably moderator approval mode .. the comfy team is full of weirdos
>>
>>
>>102167869
it's not a comfyorg repo
>>
>>102167863
why would I be shadowbanned? I absolutely did nothing special, I created this account a week ago and made 2 regular issues, that's all lol
>>
>>102167838
I hope your issue gets approved cause this seems to be exactly my problem
>>102167448
>>102167563
>>102167823
anything past the 92... release will randomly crash into OOM with big loras or multiple loras, while the 92.. release works perfectly fine regardless of how many loras I load
>>
>>102167838
I just submitted an issue and yours aren't there. Skips #87 and #88
>>
>>102167939
fuck, how am I gonna reach to city now?
>>
>>102167955
city96 could be anywhere
>>
File: FD_00288_.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>102167892
One of the changes he made was an attempt to fix the LoRA reloading issue, maybe a regression bug
>>
File: shamiko1.png (2.8 MB, 2048x2048)
2.8 MB
2.8 MB PNG
>>
>>102167957
city, if you're reading this thread, can you please fix this?
https://files.catbox.moe/ox8hip.txt
>>
File: shamiko2.png (3.24 MB, 1792x2304)
3.24 MB
3.24 MB PNG
>>
File: shamiko3.png (3.05 MB, 1792x2304)
3.05 MB
3.05 MB PNG
>>
File: shamiko4.png (2.48 MB, 1664x2432)
2.48 MB
2.48 MB PNG
>>
>>>/e/edg is over there
>>
>>102167971
>>102167976
>>102167985
>>102167991
You post the same shit every time. Does it nor bore you?
>>
File: chie2.png (2.77 MB, 2048x2048)
2.77 MB
2.77 MB PNG
>>102168018
>You post the same shit every time.
Nah, that's wrong.
Sometimes I post armpits.
>>
>>102168018
Go in to /sdg/. People have been prompting the same shit over and over for two years now. Actual mental illness. I even saw the angry hamburger guy here earlier today.
>>
foot fetishists can post their porn everywhere they go with no consequences but i can't?
>>
File: FD_00311_.png (1.88 MB, 1024x1024)
1.88 MB
1.88 MB PNG
>>102168028
>Go in to /sdg/.
No.
>>
Fuck yeah
>>
>>102168031
correct.
>>
>>102166301
>https://imgsys.org/rankings
No AuraFlow, no Kolors, what is this, an advert for Flux Pro?
>>
File: 1167752492635494163-SD.png (1.24 MB, 896x1152)
1.24 MB
1.24 MB PNG
whats the current best flux finetune anons?
>>
File: FD_00316_.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
Oh shit this cunt knows transformers
>>
>>102168101
is there any REAL finetunes of Flux yet?
>>
>>102168101
None, they are all shit
>>102168098
>you don't list every random model available
wow what a load of shills
Here's why
https://civitai.com/search/models?baseModel=AuraFlow&baseModel=Kolors&sortBy=models_v9
>>
>>102168126
The Ponyfucker is gonna make Aura Flow relevant.
>>
File: 1167752492635494131-SD.png (1.94 MB, 896x1152)
1.94 MB
1.94 MB PNG
>>102168120
I dont know, hence the q

>>102168126
thought so
>>
>>102168126
>Here's why
nta but i don't get what you were trying to prove with that link, they all look pretty good.
>>
>>102168145
>The Ponyfucker is gonna make Aura Flow relevant.
I'm not sure of that, recently there was a giant Hunyuan hentai finetune and no one gave a fuck

>>102168156
I think the point is that there's only ~10 contributions on civitai about Auraflow, that's what Flux does in one hour, no one give a fuck about that model
>>
File: FD_00320_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>102168145
And someone smarter is going to mog him on Flux.
>>102168156
>25 models combined
>of those only 9 are LoRAs
You're right it proves nothing. Very good models. Very popular.
>>
>>102168168
>no one give a fuck about that model
this mentality is poisonous, one of the reasons /ldg/ split from /sdg/ is to avoid that train of thought
>>
>>102168184
the reality is here, if Auraflow was a hype model there would be a lot of loras on civitai, there's nothing because there are better toys to play with, like Flux
>>
File: FD_00303_.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
>>102168184
Sorry Anon but Aura Flow had potential and got completely shit on by the release of Flux. That's just how the cookie crumbles sometimes.
Sigma had potential too, so did Lumia, but they don't hold a candle to Flux.
>>
SDXL was fairly irrelevant compared to SD 1.5 until Pony came out.
>>
>>102168101
the flux version of copax is alright... but the bar is pretty low for finetunes currently, https://civitai.com/models/118111?modelVersionId=778112
>>
>>102168201
XL had a shit load more LoRAs and fine tunes before Pony too. More than 1 in 2 months.
If Aura Flow becomes relevant it will go into the OP. But it's just a waste of space right now. Bitching about it is dumb.
>>
>>102168126
>Here's why
AuraFlow and Kolors are outperforming Flux around here, except in text and prompt adherence.
Is that it? People value so much text and where things are in a picture? I beg people to send the same prompt to Flux and Kolors and compare the outputs, getting better text and adherence to Kolors would have gotten us to where we want to go faster than creating a Lora for every single thing Kolors can't do.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.