[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


3-Year duration 4chan Passes are now available for $45

[Advertise on 4chan]


File: tmp.jpg (1.24 MB, 3264x3264)
1.24 MB
1.24 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102166301

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
Blessed thread of frenship
>>
pixart newma month tomorrow
>>
File: 00029-1337414476.png (882 KB, 1152x896)
882 KB
882 KB PNG
lets see some beaves munching on wood itt
>>
File: file.jpg (585 KB, 1440x2560)
585 KB
585 KB JPG
nic you said no underage beavers hwo could u do this?
>>
Imagine if we had longer threads with more than a 300 reply bump limit.. I feel like ldg would have better discussions this way as more people would comment on tech or info that isn't lost to the previous threads. convo almost never carries over
>>
>>102169726
Yeah makes me miss old bbs style forums
>>
>>102169771
but you whipped azuki thats not nice.
>>
File: FD_00327_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
I am turning into an avatarfaggot with the amount of liliths I am genning. I need a new subject.
>>
>>102169775
>old bbs style forums
to think so many of them have been lost to time now that they're a relic of the modern age. we need to go back
>>
>>102169802
Based. Is the pixel art just from prompting or is it an additonal LoRA?
>>
>>102169726
Imagine if we had an AI board
>>
>>102169810
Lisa Simpson playing the saxophone with characters from different shows.
>>
>>102169814
now it's all in Discord, unindexable, and it too will implode some day.
>>
>>102169814
Many went offline because of insecurities of phpBB allowing hackers to unsalt passwords and hack users that used the same passwords in several places.
>>
>>102169836
Don't prompt this it creates mouse turd gas
>>
File: file.png (653 KB, 512x512)
653 KB
653 KB PNG
>>
File: ComfyUI_02744_.png (820 KB, 1024x1024)
820 KB
820 KB PNG
Well, LoRA is done cooking at 7000 steps, like we all suspected the sample images were garbage compared to the final product.
>>
>>102169841
yeah well, what do you expect for people who do it for free.
>>
File: BlitzBallCover.png (1.11 MB, 768x1280)
1.11 MB
1.11 MB PNG
Who said Flux couldn't do this?
>>
>>102169856
>Many went offline because of insecurities of phpBB allowing hackers to unsalt passwords and hack users that used the same passwords in several places.
I remember it, complete shitshow. Kiwifarms style ai board would be great
>>
>>102169872
Man I want to play Blood Bowl now
>>
>>102169856
>>102169848
sad times we live in...
>>
>>102169861
sovl
>>
>>102169865
nice, happy to see it worked out anon. I really wonder what the fuck goes so wrong with preview images
>>
File: ComfyUI_02749_.png (867 KB, 1024x1024)
867 KB
867 KB PNG
>>102169910
No idea, I think the dataset still needs work though. It needs more environment shots or I gotta figure out what kind of prompting brings out the monogatariness of it.
>>
File: ComfyUI_Flux_25.jpg (2.33 MB, 2432x1664)
2.33 MB
2.33 MB JPG
How much memory does xlabs ipadapter require? I can run Q4_K_S model with as many loras as I want without any issues but ipadapter alone instantly crashes with OOM (and that is on Xlabs' own Ksampler, using a regular one throws an error "Expected query, key, and value to have the same dtype, but got query.dtype: struct c10::Half key.dtype: struct c10::BFloat16 and value.dtype: struct c10::BFloat16 instead")
>>
Do people train with batch sizes other than 1?
>>
>>102169944
can anyone actually train flux with a higher batch size and not oom
>>
>>102169944
I think more than one is considered better.
>>
>>102169935
>tattoo
>>
File: NotLisa.png (512 KB, 1024x1024)
512 KB
512 KB PNG
>>102169858
What if it's not Lisa?
>>
File: Flux_01763_.png (393 KB, 640x480)
393 KB
393 KB PNG
>>102169865
nice work, do you get any occasionally pixelated gens from flux when using the LoRA ?

tryin to figure out why flux does that, if it is my lora or something else.. picrel
>>
File: FFLUX_00919_.png (846 KB, 1024x768)
846 KB
846 KB PNG
>>102169643
>>
File: ComfyUI_02752_.png (712 KB, 1024x1024)
712 KB
712 KB PNG
>>102169982
I have not noticed any pixel issues thus far.
>>
>>102169962
Don't people train the model in FP8?
>>
Finally got to see one of those pictures before the janny got it.
So it turns out there's nothing wrong with the pic, except that it's a blue board, I suggest catboxing.
>>
>>102170007
>I suggest catboxing.
Charitable to think he's doing this out of ignorance.
>>
File: FD_00063_.png (1.38 MB, 1024x1024)
1.38 MB
1.38 MB PNG
>>102169878
We can do better than that. retardai.com. Just do what civit does except with nazi propaganda.
>>102169935
That's a pretty coherent keyboard. The double F row is a nice touch.
>>
>>102169944
I've seen anons here recommend it but I've always been told batch size 1 = most detail/similarity to the dataset and higher is just cope to speed up the bake time... I saw someone in an older thread say that was stupid and people don't understand what it does properly, but they didn't expand on that. it makes sense in my smol brain you'd want 1:1 accuracy to the style/subject/whatever unless you were doing an obscene amount of varied images (like a finetune)
>>
>>102169834
2chan has it
>>
>>102170022
I don't speak moon runes
>>
>>102170001
yeah, I still oom if I try to increase batch size on a rented 24gb though so idk, maybe bad settings on my part
>>
File: zuhair.png (1.31 MB, 1156x1200)
1.31 MB
1.31 MB PNG
Can some add a good hairstyle to this photo, i can't seem to get it to work.
I need to know my theoritcal looksmaxxed appearance.
>>
>>102169999
were all your training images greater or equal to 1024 x 1024 ?

maybe it is because some of my training images are lower res but i'm training at 1024 x 1024
>>
File: 00038-14886969420.png (656 KB, 768x768)
656 KB
656 KB PNG
me irl
>>
>>102170046
Training with batch size 1, in FP8, and with a resolution of 512 is only using 14GB for me.
>>
File: 00304-2600149133.png (1.72 MB, 960x1440)
1.72 MB
1.72 MB PNG
>>102170018
>retardai.com
lul
>>
>>102170020
Batch size is purely for performance. Everything else around it is coping vramlets. All batch size is is how many images in parallel are processed, this is faster because getting the image from the memory to the GPU is one of the slowest parts of training.
>>
>>102170077
ywnbarw
>>
File: BlitzBall64.png (552 KB, 1280x768)
552 KB
552 KB PNG
>>
>>102170075
I let bucketing do the work. I trained at both 512 and 1024 but the images themselves were 1920x1080
>>
>>102170065
i look like this
>>
File: 00307-2600149134.png (1.73 MB, 960x1440)
1.73 MB
1.73 MB PNG
>bro just git pull bro
>>
>>102170069
gotta think about those advertisers.
>>
>>102170022
Don't they delete everything after a while with nobody archiving it? I only post here because I know my posts will live forever and entertain future generations (that's why we archive them, right?)
>>
>>102170083
oh, that actually makes sense too. why the fuck were faggots saying it had to do with accuracy then, I'm certain I read that misinfo on multiple "guides"
>>
>>102170114
please don't reply to the thread schizo
>>
God, this thread is a celebration of ugliness. Gen something pleasing to the eye!
>>
File: 1721985286896239.png (783 KB, 2918x1806)
783 KB
783 KB PNG
>>102170083
What about pic rel?
>>
File: 00308-161711749.png (1.7 MB, 960x1440)
1.7 MB
1.7 MB PNG
>>102170119
there's archive site arhivach . ng
>>
>>102170142
No one is claiming the results are 1:1 or that you can use exactly the same training settings no matter what. But the advantages of using a highest batch size you can use outweighs batch size 1 for 100,000 years.
>>
File: 00039-14886969420.png (797 KB, 688x1008)
797 KB
797 KB PNG
>>
File: looksmaxxed.jpg (300 KB, 1156x1200)
300 KB
300 KB JPG
>>102170065
Here you go mate
>>
>>102170168
so batch size 1 is more accurate and higher batch size is literally just for speed and really not necessary for loras with their smaller datasets?
>>
>>102170175
kek'd
>>
>>102170198
interesting conclusion lmao
feel free to compare batch size 1 and batch 4 and do a real test and not trust some retard that probably posted the graph about how SD3 is the best looking model ever
>>
File: 1718293103308578.webm (117 KB, 672x672)
117 KB
117 KB WEBM
>>
File: BlitzBoardg.png (888 KB, 1280x768)
888 KB
888 KB PNG
>>
>>102170231
where's the naked catgirl?
>>
>>102170150
>arhivach . ng
That gives me a 404 error and archivach loads an ad stopped by ublock o_o
>>
File: FD_00077_.png (1.26 MB, 768x1344)
1.26 MB
1.26 MB PNG
>>102170273
Enjoy your crypto miner
>>
>>102170219
I was just trying to confirm what you just said because it sounded conflicting with your previous post, but ok
>>
File: 00010-1287077561.png (1.63 MB, 936x1376)
1.63 MB
1.63 MB PNG
>>102170273
it's .top my bad
>>
>>102170256
here you go
>>
>>102170295
kek
>>
>>102170292
no you weren't so let's start with you not lying
let me give you a protip: when a dumbass researcher posts a "EVERYTHING YOU KNOW ABOUT SOMETHING IS WRONG" it is typically bullshit. I bet everything about that research is full of holes and bunk, kind of like you.
but go ahead anon, train at batch size 1 lmao
>>
>>102170295
>8 string guitar with only 4 tuning pegs
As a guitarist I find this image highly offensive.
>>
>>102170309
ok Debo I'll be ignoring you again now. we could've had a conversation but you did this to yourself
>>
File: 000000_17107_.png (1.93 MB, 952x1587)
1.93 MB
1.93 MB PNG
>>
>>102170347
esoteric Donnie Darko
>>
File: 1718772662382.jpg (187 KB, 1024x1024)
187 KB
187 KB JPG
>>102170022
https://dec.2chan.net/85/res/70127.htm
いいスレ
>>
crooks the shooter was a mod on here?
>>
File: PlayingBlitzball.png (1.14 MB, 1280x768)
1.14 MB
1.14 MB PNG
>>
File: ComfyUI_02783_.png (838 KB, 1280x720)
838 KB
838 KB PNG
>>
File: ComfyUI_01476_.jpg (1.07 MB, 2112x2112)
1.07 MB
1.07 MB JPG
>>
>>102170443
will you share the lora once the training finishes?
>>
>>102170295
Ah, that works.
...
AGH MY EYES, WHAT IS WRONG WITH 2CHAN
>>
>>102170440
made me think of an idea for a prompt, 2 subjects playing whatever boardgame and one of them brandishes a gun at the opponent. someone can gen that. just idea-guying
>>
>>102170363
>need to watch,, time-traveling Trump
>>
File: ComfyUI_02785_.png (757 KB, 1280x720)
757 KB
757 KB PNG
>>102170462
I'm gonna do some less aggressive training overnight and see what pops out then I'll look at sharing it. I'm not entirely happy with how overbaked it seems.
>>
File: ComfyUI_01483_.png (1.71 MB, 1024x1024)
1.71 MB
1.71 MB PNG
>>
File: fail.png (860 KB, 1280x768)
860 KB
860 KB PNG
>>102170478
I pick an existing picture and use Joy Caption to turn it into a prompt that I then modify, otherwise I get stuff like picrel.
>2 Panel comic. The left panel shows 2 girls playing blitzball boardgame. In the panel of the right one of them brandishes a gun at the opponent.
>>
What's the verdict on Hyper-SD?
https://huggingface.co/ByteDance/Hyper-SD
>>
File: 00000-992777439_cleanup.png (3.06 MB, 1280x1920)
3.06 MB
3.06 MB PNG
>>
>>102170618
It's like schnell, just do the steps required for a better picture, what's the point of being able to use fewer steps if the quality never gets as good as regular with more steps?
>>
File: 00366-429861477.png (1.63 MB, 960x1440)
1.63 MB
1.63 MB PNG
>>
File: 00043-3818405146.png (1.48 MB, 896x1152)
1.48 MB
1.48 MB PNG
using wildcard of vibrant colors on all the various things in the prompt.
sadly it doesnt know Telecaster.
>>
>>102170681
Can Schnell use LoRAs made for Dev?
>>
Testing loras is tedious. What are good Loras for classic painterly styles.
I am using these and they mix well.
https://civitai.com/models/695276/mucha
https://civitai.com/models/672567?modelVersionId=752954
https://civitai.com/models/678853/sxz-dark-fantasy-flux
>>
File: 2024-08-31_00274_.png (1.02 MB, 832x1216)
1.02 MB
1.02 MB PNG
Ahh yes, this is a good lora.
>>
>>102170742
You have a 12 year old's concept of rebellion. You're basically a poser.
>>
File: ComfyUI_01485_.png (1.54 MB, 1024x1024)
1.54 MB
1.54 MB PNG
>>102169642
Hype
>>
File: FD_00085_.png (1.42 MB, 768x1344)
1.42 MB
1.42 MB PNG
>>
>>102170801
Stop engaging
>>
>>102170775
I normally just do an xy in forge or wildcards to test them
>>
File: FD_00095_.png (1.18 MB, 768x1344)
1.18 MB
1.18 MB PNG
>>102170811
>>
File: 2024-08-31_00280_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>
>>102170926
He he announces the 5090 dressed like that I will buy one, regardless of the VRAM
>>
>>102170926
a, historical dictator, miku might be amusing.
>>
File: 2024-08-31_00284_.png (1.04 MB, 1024x1024)
1.04 MB
1.04 MB PNG
>>102170933
kek
>>102170962
we had that several times in past threads.. I think I seen Migu versions of Hitler, Stalin and Lenin at the least
>>
12gb vram lora training guy that had an issue the a while ago with the training results. The issue turned out to be nothing to do with the training. It was something weird with forge, I had to change Diffusion in Low Bits from automatic to automatic fp16 lora.

Not sure if an update caused this.

So anyone else with forge with weird lora issues, this might be the problem.
>>
File: FD_00106_.png (1.34 MB, 768x1344)
1.34 MB
1.34 MB PNG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.