[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: PW_79377_.png (1.57 MB, 1024x1280)
1.57 MB
1.57 MB PNG
Previous /sdg/ thread : >>101697538

>Beginner UI local install
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SD.Next: https://github.com/vladmandic/automatic
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>Try online without registration
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://aitracker.art
https://openmodeldb.info

>Black Forest Labs: Flux
https://huggingface.co/black-forest-labs/FLUX.1-schnell
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Discord
6wUwtcJsr2

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
>>101700396
Curt Cobain version please
>>
rip porn models
>>
Where are the controlnets? IP adaptor, xinsir unified controlnet, finetunes?

DEAD
ON
ARRIVAL
>>
File: image(264).jpg (26 KB, 512x512)
26 KB
26 KB JPG
scribble is hard
>>
File: PW_79408_.png (1.28 MB, 1024x1280)
1.28 MB
1.28 MB PNG
>>
File: 1721764924544679.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
It's no censorship DALL-E, open source won.
>>
>>101700504
you can take that and do openpose on it, and it should give the pose info and then you prompt whatever you want. scribble is tricky but you can get a lot of cool outputs with it, and then do other stuff with that.
>>
>>101700485
it would take an ungodly amount of compute to finetune, but it's not impossible
pony freaks have already died on this hill, they have nothing left to lose
>>
>>101700519
and their work yielded the best models for making anime characters, even if it's good at one thing it can also be good at another
>>
>>101700519
>it would take an ungodly amount of compute
are you niggers seriously that poor and cant rent 4xA100s for a day or two to get decent results
>>
File: ComfyUI_00009_.png (917 KB, 1024x1024)
917 KB
917 KB PNG
just tried out flux for the first time and it's crazy good for a base model

this is a very basic gen but the clearly readable hiragana is a million times better than you get with any SDXL model...

but idk what to do with it until there's shit like ipadapter and controlnet
>>
>>101700544
I don't have hundreds of terabytes of training datasets
>>
File: 1711264915514003.png (1011 KB, 1024x1024)
1011 KB
1011 KB PNG
>>101700545
you exploit the good text and prompt understanding to make memes
>>
File deleted.
>>101700545
anything you want, that's the beauty of it
stop thinking in 1girl, hands on hips terms and go wild
>>
File: 1691303153188510.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>101700565
better:
>>
File: ComfyUI_00328_.png (1010 KB, 1024x1024)
1010 KB
1010 KB PNG
>>101700485
They gave us the drug before taking it back from us, what a toxic poison gift lmao
What is the reason we can't ? I mean if you have enough GPU you should be able to finetune it right ? Or did they use some ancient black magic to protect their virtual data across the globe ?
>picrel: miku live reaction
>>
File: FD_00477_.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
>>
File: F_00013_.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>>101700484
I tried kurt cobain as a muppet
>>
>>101700582
there is absolutely nothing stopping you from inpainting gens or img2img in ponyXL which excels at porn
>>
>>101700591
That doesn't explain why we can't still
Why especially this model ?
For LLM, we can train big model across multiple GPU or load them in 8bit/4bit for finetune to use less VRAM, so why can't we here ? What is the catch ?
>>
>>101700583
Very nice
>>
>>101700608
I wouldn't worry about it, there's a lot of buzz around it so we'll find out pretty soon
>>
>>101700608
it's day 1, I guarantee you people will add stuff like controlnets or what have you. but even as-is, it's very very good at depicting a range of prompts/styles. The text depiction and range of text is also amazing (it can do cursive even)
>>
>>101700608
>some obvious tranny with a fake ass Japanese name says you can't fine tune the model in a random screenshot of a discord chat
>Suddenly call into question the well established fact that all models up until now have been trainable.

Think about it.
>>
>>101700617
I just want a little bit more lewd/erotica, not even porn. It's so frustrating. But I guess you're right. I mean a small LoRA would do the trick I'm pretty sure.
>>
Can someone drop a catbox using flux? I cannot get the fucking UI to work and I wanna kill myself.
>>
File: ComfyUI_00237_.png (855 KB, 1024x1024)
855 KB
855 KB PNG
>>101700623
>>101700625
Yeah y'all right we can only wait for now
>>
File: FD_00487_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>101700485
literally who?
>>
File: 00081-3613429703.jpg (846 KB, 1560x2328)
846 KB
846 KB JPG
will try to sleep, though i stayed up way too late and will struggle to fall asleep.
today turned into clip caping random pics and pasting random tokens into the prompt, and... all bad results!
goonite
>>
File: 1707414248227601.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
the stuff that dalle will reject is now fine

OPEN SOURCE BABY
>>
I take it a VRAMlet like myself won't ever be able to run flux, right?
>>
Guys whats the most cheapest most bargain bin card that can run current SDXL models, no im not looking for middling or average performance I want an absolute bargain bin poorfag card that can just barley cross the threshold to generate images, Im asking for science I guess.
>>
File: FD_00491_.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
>>101700705
if you can run other FP8 shit you can run this. Just not at it's full glory (unless you don't mind waiting 30 minutes per image)
>>
File: 1696667671181296.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
>>
>>101700721
Any card made after 2016 with 4 GB supporting a high enough cuda version for your UI of choice. Use Forge or SD.Next or Fooocus, they should have enough optimizations to run it fine as long as you have 16 GB RAM

See https://developer.nvidia.com/cuda-gpus
>>
File: PW_79346_.png (1.21 MB, 1024x1280)
1.21 MB
1.21 MB PNG
>>101700681
Goodnight, sleep well!!

I think it's about time for me to sleep too haha
Goodnight, anons! :]
>>
File: FD_00496_.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
What celebrities does Flux know other than Trump and Biden?
Joe Rogan isn't one of them
>>
File: Poyo.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
>>101700761
Alright thanks, I'll take a look & see what has later CUDA ver. and at least 4 GBs or VRAM, all I needed to know thanks again.
>>
>>101700782
I'm personally using a Quadro P2000 with SD.Forge UI and I get acceptable times on sdxl/pony, with 16 GB RAM, of which 3/4 are occupied by the genning (depending on resolution)
>>
File: 1721960801629704.png (1.4 MB, 1024x1024)
1.4 MB
1.4 MB PNG
>>
File: 1702207801262222.png (1.72 MB, 1024x1024)
1.72 MB
1.72 MB PNG
>>
File: 1721897646695924.png (1.77 MB, 1024x1024)
1.77 MB
1.77 MB PNG
>>
File: ComfyUI_Flux_1271.jpg (138 KB, 1024x1024)
138 KB
138 KB JPG
>>
File: 1699123544270927.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
lmao it even knows styles like 1940s comics, this is great

for this prompt I didn't specify dialogue though.
>>
>>101700846
That's an actual Joe Biden quote too.
>>
File: 2024-08-02_00118_.png (1.85 MB, 1280x1280)
1.85 MB
1.85 MB PNG
>>101700631
be more creative with your prompt if you just want erotica

also good morning anons
>>
>>101700782
you want 8 at the bare minimum, better 10 or more
>>
File: ComfyUI_Flux_1279.jpg (123 KB, 1024x1024)
123 KB
123 KB JPG
>>
File: 1692383992830071.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
>>
File: FD_00506_.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
>>101700867
Imagen doesn't normally draw out the creative types
>>
File: 1722212989943549.png (1.65 MB, 1024x1024)
1.65 MB
1.65 MB PNG
specifically mentioning white speech bubble did the trick.
>>
File: ComfyUI_hsfb_00001_.jpg (2.48 MB, 2560x1440)
2.48 MB
2.48 MB JPG
>>101700846
>>101700952
neat
>>
File: 1713343652414917.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>
File: 1702330889521234.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
lmao, this one worked well

>Batman punching Joe Biden in the face with his fist in the oval office, Joe Biden looks confused and not sure where he is, Batman is smiling and saying "get out, old man" in a white speech bubble, Joe Biden is saying "Hello, Spiderman." in a white speech bubble, the image is in the style of comics from the 1940s
>>
>>101700976
>>101700976
we have come so far yet have a long way to go on hands
>>
File: 1720970713872513.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>101700987
we've come very far with ponyXL/SDXL and now this
>>
File: sd_mannequin.png (281 KB, 1475x456)
281 KB
281 KB PNG
>>101700517
now I'm trying 3D mannequin image + scribble and it works
mannequin is good when you like the body
>>
>>101700996
neat, theres something called openpose editor where you can make a pose for controlnet, but that looks even more detailed
>>
File: FD_00522_.png (899 KB, 1024x1024)
899 KB
899 KB PNG
>>101700992
XL is a pile of garbage compared even with Pony
>>
>>101701021
well pony is hyper trained on everything so it works great for anime or even realism with the right model

autismmix-confetti is top tier for anime gens for example.
>>
File: 1701253871019715.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>Joe Biden is in the oval office, Joe Biden looks confused and not sure where he is. Joe Biden is watching a TV with CNN on the television.
The television has a chyron saying "why you are racist" with a black news reporter holding a mic. The image is in the style of comics from the 1940s

the prompt understanding is really good. i'll still be using sdxl/ponyXL though for characters and controlnets, but this is a very good open source model.
>>
File: FD_00524_.png (835 KB, 1024x1024)
835 KB
835 KB PNG
>>101701028
Pony is the most fine tuned model out there, and it's getting mogged by a foundational model.
SAI is dead. There's no way SD3.1 can compete. I feel bad for whomever is still giving them money.
>>
>>101700770
Take it easy dude!
>>
>>101701044
all sd3 needs is not to be cucked in the training stage: if it's censored then it cant do anatomy well.

pony is trained on EVERYTHING so it can make hands and figures very well, clothed or not, porn or not.
>>
>>101701028
anon what's better as an anime non porn model, autismmix models or animagine?
>>
>>101701059
animagine, all pony models are made for porn first in mind
>>
File: 1711043615132410.png (1.38 MB, 1024x1024)
1.38 MB
1.38 MB PNG
>>101701059
animagine is good but autismmix is the best so far, even without loras you can prompt random characters by their booru tag and get great results. and then WITH loras it's even better.

I use this extension which is really useful:

https://github.com/DominikDoom/a1111-sd-webui-tagcomplete

if I type fate for example, i'll see stuff like artoria/nero/etc pop up and it shows the popularity of the tag, so if I use the nero tag i'll get a nero, even without a fate LORA.

for example here is a mordred I got with a simple mordred prompt:
>>
File: ComfyUI_hgsh_00001_.jpg (2.27 MB, 2560x1440)
2.27 MB
2.27 MB JPG
>>101701037
what model is that?

>posting old gens while I wait for my 1060 to struggle through the current one
>>
>>101701079
>>101701087
super helpful, thx anons
>>
Pony founder is discussing finetuning w BFL
>>
>>101701111
Both models are good, the great thing about having multiple models is you can get different styles with one click, just like a Lora.
>>
>>101701054
>all sd3 needs is not to be cucked in the training stage
it will be. I promise you they have no idea what they are doing. Flux was made by the people who invented Stable Diffusion. The only people left at SAI are retarded middle managers.
>>
tl;dr flux? will it kill cokejeets company?
>>
>>101701160
open source dall-e, in terms of functionality: demanding on VRAM but good
>>
File: FD_00538_.png (910 KB, 1024x1024)
910 KB
910 KB PNG
>>101701160
It can into hands.
>>
File: 1720537095916596.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
it's joever
>>
File: 1699149611960040.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
lmao the prompt comprehension is great
>>
>>101701160
considering the brain drain and general retardation from cokejeets company I don't think they can redeem
>>
>>101701160
who's cokejeets? emad?
>>
File: FD_00583_.png (1.77 MB, 1024x1024)
1.77 MB
1.77 MB PNG
>>
>>101700545
>clearly readable hiragana
>>
3060 12GB. Thankfully, the biggest slowdown is caused by the lack of VRAM, increasing resolution has much less impact than I expected.
>>
>deleted image of Hamster it was labeled incorrectly as a Kolors gen wheni t was FLux kek..
>>
File: 1555123263242.jpg (268 KB, 450x1398)
268 KB
268 KB JPG
>>101701841
>>
File: Flux_00199_.png (752 KB, 1024x800)
752 KB
752 KB PNG
>>
>>101701854
yeah. mine is 1060 6GB and it works,
now 1060 24GB is needed.
>>
>>101700608
There's not a technical reason preventing it. There just hasn't been the need for that sort of infrastructure since all of the local image models worth using have been so tiny. Now that we're into the 10B+ range you'll see the tools start adapting to it.
>>
File: 000000_15748_.png (2.02 MB, 1075x1434)
2.02 MB
2.02 MB PNG
>Flux w/SD3 USDU
>>
File: 19.png (1.44 MB, 720x1448)
1.44 MB
1.44 MB PNG
>>
File: MIKU.png (473 KB, 752x1459)
473 KB
473 KB PNG
>>
File: scribble04.png (905 KB, 908x1595)
905 KB
905 KB PNG
improved
>>
does comfyui have merge / combine nodes out of the box yet or will I have to risk downloading malware?
>>
>>101700396
Why is there a discord in OP?
>>
File: ComfyUI_temp_dzruo_00006_.jpg (2.74 MB, 2560x1440)
2.74 MB
2.74 MB JPG
>>101702855
pretty sure it does
>>
>>101702717
NOO THIS IS LITERAL GENOCIDE, THINK OF THE POOR SKIBIDI MIKU ARTISTS
>>
File: modelandclipmerging.png (59 KB, 753x474)
59 KB
59 KB PNG
>>101703176
bah, wrong image
>>
>>101703176
>>101703201
still doesn't, sad
>>
>>101703201
just realized the select menus weren't highlighted

add node - advanced - model merging
>>
>>101702855
I wouldnt download anything linked / mentioned here after what debo tried to pull off when he wanted to catch schizo anon
>>
>>101703199
>>101702717
this image not being done by some twitter folx cost minority artists 100s of dollars
>>
File: swimsuit_shoes.jpg (127 KB, 1024x1504)
127 KB
127 KB JPG
almost perfection
>>
Is this the tranny thread?
>>
>>101703339
this is the verification thread for the doxcord silly
>>
Does flux work on a1111?
>>
>>101703583
no, they hate voldy for some reason, did not give him access to preview release, so a1111 bros need to wait
>>
>>101703605
>former stability employees hate voldy
Imagine my shock!
>>
>>101703621
why do they have him doebeit?
>>
File: 000000_15752_.png (2.08 MB, 1075x1434)
2.08 MB
2.08 MB PNG
>FLUX ftw
>>
SKIBIDI MIKU
>>
File: 1722617569961260.jpg (1.64 MB, 2576x1944)
1.64 MB
1.64 MB JPG
I came out as trans to my parents today.
It did not go well.
>>
>>101703605
comfy wasn't given any special access. he just downloaded the model and integrated it. nothing is stopping voldy from doing the same, apart from his crippling video game addiciton
>>
File: 683299.png (881 KB, 1172x441)
881 KB
881 KB PNG
Someone please implement this on comfy/auto
https://github.com/SusungHong/SEG-SDXL
>>
>>101703841
>comfy wasn't given any special access
he was though
>>
>source: my ass
>>
>>101703882
>>101702282
>we have a partnership with comfy org which, amongst other things, lets us ensure that comfyui and swarmui have day-0 (or day-1 when mcmonkey manages to sleep through the entire release day) support for our models
>>
>>101703908
LARP
>>
>>101702717

Someone with A. I. D. S. on the internet again.
>>
File: delux_chi_00007_.jpg (216 KB, 1024x1024)
216 KB
216 KB JPG
>>
verifying a sandwich
>>
File: delux_chi_00008_.jpg (209 KB, 1024x1024)
209 KB
209 KB JPG
>>
>>101703851
this looks promising
>>
File: delux_chi_00009_.jpg (228 KB, 1024x1024)
228 KB
228 KB JPG
>>101704148
>>
>>101703851
Will try on comfy, can't promise auto tho
>>
What is the thread theme?
>>
File: delux_chi_00010_.jpg (274 KB, 1344x768)
274 KB
274 KB JPG
>>101704355
chibis
>>
File: delux_chi_00012_.jpg (197 KB, 1024x1024)
197 KB
197 KB JPG
>>
File: 00092-3239984942.jpg (366 KB, 1000x1328)
366 KB
366 KB JPG
https://files.catbox.moe/4cwqpo.jpg
nearly forgot, today is caturday... je suis expecting beaucoup cats today..
>>
File: delux_chi_00014_.jpg (157 KB, 1024x1024)
157 KB
157 KB JPG
>>101704481
>>
File: when-peace-2.jpg (224 KB, 1344x768)
224 KB
224 KB JPG
>>101704368
>>
File: file.png (2.9 MB, 1024x1792)
2.9 MB
2.9 MB PNG
cool, 1024x1024 on 3060 12gb
this is with modules that contain weights, its loading and unloading each module as needed, my ssd is slow which doesnt help
ill experiment with loading weights to ram and using set_constants, that method would only need 1 module for each block type too, might be faster
also with the planned workspace changes for lazy loading/free after use (nearly, minus 1 or 2 for FluxTransformerBlock) every module for a block type could be loaded at once, then the layers would run much faster because it would only need to allocate the workspace, loading the next module is slowing it down
see the second progress bar for a demonstration of the potential speed by loading only 8 of the modules for each block type, about 4x faster than it is atm
>>
File: file.jpg (285 KB, 1024x1792)
285 KB
285 KB JPG
>>101704670
fp8 weights will help too
also inb4 anon starts moaning again
>>
>Julien
>>
File: delux_chi_00018_.jpg (252 KB, 1024x1024)
252 KB
252 KB JPG
>>101704670
sounds too good to be true. what's the catch?
>>
>>101704730
Anon, I'm gonna cooom so hard thinking about those juicy modules and thicc VRAM. Mmm yeah, load those weights into my tight RAM daddy. Fuck my SSD harder with those workspace changes. I wanna feel every inch of those FluxTransformerBlocks inside me. Pound my progress bars with your fp8, you dirty l33t h4x0r. AHHH AHHH I'M ALLOCATING!!!!1! >_<
>>
>>101704670
so this and AIT could mean vramlets could access flux without having to consider the heat death of the universe?
>>
>>101704781
>53s/it
>>
File: file.jpg (287 KB, 1024x1792)
287 KB
287 KB JPG
>>101704773
idk none i guess
>>101704781
yeah, it would need someone to develop the ait comfy node again though
>>101704855
thats about the same as 3060 users are getting atm with shared memory. also you're ignoring everything about how im going to make it faster. idk why people like you have to moan at everything
>>
>>101704921
>moan
ctft?
>>
File: delux_chi_00024_.jpg (286 KB, 1024x1024)
286 KB
286 KB JPG
>>
File: blackhair_greyeyes.jpg (145 KB, 768x1696)
145 KB
145 KB JPG
PE teacher
>>
>>101705061
why so much leg? too much leg
>>
File: 00_08.jpg (238 KB, 1552x1200)
238 KB
238 KB JPG
If an argument goes on for more than 5 minutes, neither person has any idea what they are talking about
>>
any good tags for someone who's working or focusd on work? there's no danbooru tags for that sadly.
>>
File: 00117-2017815449.jpg (1.59 MB, 1942x2479)
1.59 MB
1.59 MB JPG
>>101705132
so true
>>101705156
at work, maybe
>>
File: delux_chi_00026_.jpg (212 KB, 1024x1024)
212 KB
212 KB JPG
>>101705156
what do you consider a 'focused' expression though?
>>
>>101705156
(subject) working on a (whatever)
>>
>sdg and ldg both using lux
>>
>>101705302
ze 'lux
>>
>>101705302
flux*
>>
>>101705187
>>101705296
thanks
>>101705249
like someone foccused on work, oblivious to things other than it
>>
>>101705302
maybe it's time for /ldg/ to come home, it's not really even an 'alternative' like pixart and whatever other shit they were using since flux is where the sd devs went
the other thread has no reason to exist anymore
>>
File: tmptkb4jgat.png (1.16 MB, 768x1024)
1.16 MB
1.16 MB PNG
>>
>>101705340
in two weeks most of you will be doing coomer 1girls from pony
>>
File: blackhair_greyeyes2.jpg (150 KB, 768x1696)
150 KB
150 KB JPG
>>101705069
Evolutionarily, to run long distances.
Currently, to please me.
>>
File: delux_chi_00029_.jpg (248 KB, 1024x1024)
248 KB
248 KB JPG
>>101705328
>what do you consider a 'focused'
>>like someone foccused on work
uhh...

>>101705340
they didn't split because they're a fundamentally different topic. they split because they wanted their own insular thing.
>>
File: file.png (203 KB, 316x1076)
203 KB
203 KB PNG
>>101705516
gee anon I wonder
>>
File: 00009-2152262942.jpg (298 KB, 1280x1384)
298 KB
298 KB JPG
How many people you know
Who can name every serial killer who ever existed in a row
>>
>>101705340
they wanted to get away from the constant rulebreaking, avatarfagging, julien & comfy public homosexual ERPing and the thread schizo
>>
>>101705605
lets be fair the drama is the best part of /sdg/
>>
File: 00133-3305710020.jpg (1007 KB, 1560x2328)
1007 KB
1007 KB JPG
>>101705605
thankfully we still have you, (and me~)
>>
File: ComfyUI_Flux_1767.jpg (135 KB, 1152x864)
135 KB
135 KB JPG
>>
>>101705340
kek, is this the new cope? SD itself is based on research done by these guys beforre they joined stability. are you going to say that everything is SD then?
>>
how to prompt The Matrix style green 1s and 0s raining in the background?

I wanna do waifu in leather coat and bikini and shades with that as a background
>>
>>101705756
matrix digital rain, or matrix code
>>
>>101705792
thanks
>>
File: file.jpg (384 KB, 1024x1792)
384 KB
384 KB JPG
>>
>>101705813
>1024x1792
>>
>>101705639
I hope you're doing well friend
>>
File: delux_chi_00031_.jpg (700 KB, 1024x1024)
700 KB
700 KB JPG
>>101705756
"matrix code rain" or just "code rain" is what I've seen it referred to as. but its not 0s and 1s, its complex characters
>>
>>101705883
that's fine, thanks anon
>>
>>101705340
/sdg/ is the SAI shilling thread and is about stable diffusion only
the drama and schizos are contained here too
/ldg/ welcomes all local models and is very friendly in general
>>
>>101705823
the sign of a good model that does exactly what you ask for
>>
File: fs_1069.jpg (91 KB, 1152x784)
91 KB
91 KB JPG
>>
File: anime_real.jpg (143 KB, 768x1696)
143 KB
143 KB JPG
modified fingers with mspaint
>>
File: FLUX__00125_.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
not sure where the straw came from
>>
File: 000000_15754_.png (1.87 MB, 1640x937)
1.87 MB
1.87 MB PNG
>>
>>101706162
>Anon thinks parasitic crystalline growth structures is straw, cute.
>>
File: ComfyUI_10275_.png (1.66 MB, 1120x1440)
1.66 MB
1.66 MB PNG
>>
>>101706212
blue board
>>
File: ComfyUI_temp_brsxd_00006_.png (3.21 MB, 1120x1440)
3.21 MB
3.21 MB PNG
>>
File: delux_chi_00032_.jpg (773 KB, 1024x1024)
773 KB
773 KB JPG
>>101706162
>code rain
>its just rain
>>
>>101706289
so code rain for background?
>>
>>101704309
<3
>>
File: ComfyUI_10281_.png (1.35 MB, 1120x1440)
1.35 MB
1.35 MB PNG
>>
File: fs_1135.jpg (146 KB, 1280x960)
146 KB
146 KB JPG
>>
File: ComfyUI_10284_.png (1.4 MB, 1120x1440)
1.4 MB
1.4 MB PNG
>>
File: delux_chi_00036_.jpg (681 KB, 1024x1024)
681 KB
681 KB JPG
>>
File: FLUX__00137_.png (856 KB, 896x1152)
856 KB
856 KB PNG
>>
XL vae still runs in lowvram mode with 8GB VRAM after the commit that changed memory management for flux. is it possible to revert a portable install?
>>
What addons do you guys use for alternate methods of downweighting prompt terms? I used to use BlenderNeko's addon but it hasn't been updated in 4 months and doesn't work with SD3 last I checked
>>
>>101705631
Enjoy the dead general
>>
might do a ten thread recap, I haven't done one in ages since I haven't been around
>>
>>101706706
I can't wait until you do one of those
>>
>>101706634
it now also goes into lowvram mode whenever i use ipadapter, damn, i am going to just paste the 27 build on top of my current instal and hope for the best
>>
File: delux_chi_00039_.jpg (460 KB, 1024x1024)
460 KB
460 KB JPG
>>101706706
did you see a waifu diffusion V demo released?
>>
>>101706747
waifu diffusion is still a thing? woa didnt hear from them since late 2022/ early 2023
>>
>>101706747
No, what's the base? Sounds exciting but can't get distracted rn I have another 1200 images to look through still
>>
File: delux_chi_00040_.jpg (357 KB, 1024x1024)
357 KB
357 KB JPG
>>101706768
and its still just as bad too
>>
Morning anons
>>
File: delux_chi_00041_.jpg (447 KB, 1024x1024)
447 KB
447 KB JPG
>>101706880
>>
File: Chen.jpg (236 KB, 1280x1856)
236 KB
236 KB JPG
>>101706880
Morning
>>
>>101706722
that fixed it, guess i just won't update this install ever again
>>
File: ComfyUI_00076_.png (2.07 MB, 1024x1536)
2.07 MB
2.07 MB PNG
schnell ain't worth it, has anyone figured out how to prompt styles consistently with flux? i try for something like monet and it just does a generic painting and the facial features end up being photo-like
>>
File: ComfyUI_10287_.png (1.35 MB, 1120x1440)
1.35 MB
1.35 MB PNG
>>
File: delux_chi_00043_.jpg (466 KB, 1024x1024)
466 KB
466 KB JPG
>>101706926
>prompt styles consistently with flux
I've seen people say to use the clip encoder for style input. idk how well it works, but its something you can try out
>>
File: 000000_15764_.png (2.19 MB, 1498x1165)
2.19 MB
2.19 MB PNG
>>101706880
G'mornin Quokkas, Chibis theme
>>
File: ComfyUI_00077_.png (1.67 MB, 1024x1536)
1.67 MB
1.67 MB PNG
>>101706951
i've tried that but it honestly doesn't seem to do a very good job (flux d on the same prompt as above), shit takes 5 minutes to gen tho vv sad

i hear flux is still good at small resolutions so i might try that and then upscale with SDXL
>>
>>101706926
>Impressionist traditional painting, oil paint on canvas, in the style of Claude Monet. --YOUR PROMPT--
set guidance to 1.5-2 run 35 steps
>>
File: ComfyUI_Flux_1873.jpg (200 KB, 1152x864)
200 KB
200 KB JPG
>>
File: tmpoodwqr4c.png (1.03 MB, 768x1024)
1.03 MB
1.03 MB PNG
>>
File: ComfyUI_10294_.png (2.19 MB, 1024x1440)
2.19 MB
2.19 MB PNG
>>
File: tmpb_hfecsi.png (1.03 MB, 768x1024)
1.03 MB
1.03 MB PNG
>>101707047
Been a while since we've seen a chimeric creature
>>
File: ComfyUI_00079_.png (1.64 MB, 1024x1536)
1.64 MB
1.64 MB PNG
>>101707007
thanks i'll try that, though i think i've done similar in the past i usually put it at the end of the prompt
>>
File: ComfyUI_Flux_1865.jpg (230 KB, 1152x864)
230 KB
230 KB JPG
>>101707087
>>
>>101707121
there is some debate on how txxl5 sees prompts, but, yes style as infront helped alot
>>
>>101707137
>>101707047
can you share prompt?
>>
>>101707245
A faded dark fantasy film still captured on a worn-out VHS tape, exhibiting desaturated hues with a subtle blue tinge. The image is characterized by grainy texture, visual noise, and distracting tracking errors causing misaligned video signals, wobbles, and jarring jumps. Blocky compression artifacts, pixelation, and ringing add to the vintage aesthetic, while cable noise and interference introduce snow, fuzz, and ghostly apparitions floating across the screen, enhancing the VHS unique aesthetic.

Being a hybrid of different creatures, the Sanctuary Guardian possesses various abilities. Its body is part lion, part goat, and has a long, spiked scorpion tail, in addition to two sets of wings.

Its red-eyed lion half has white fur and large paws, which it uses to strike its opponent. Its goat legs have a dark purple color and can perform powerful kicks, and it uses its large set of curved horns to ram its opponent. Its four large wings give it the ability to fly for short periods of time and to release powerful blasts of wind, and its scorpion tail has poisonous properties. In addition to these deadly abilities, the beast can spew lightning projectiles from its mouth.
>>
are we still BREAKing our prompts
>>
File: ComfyUI_00081_.png (2.07 MB, 1024x1536)
2.07 MB
2.07 MB PNG
>>101707142
i wish we could just train one of these modern models to do artist styles as well as SD 1.5 could

this is the same prompt but with your monet thing in front
>>
File: 7rLhoVQJ-gpQMXsWeqiLn.jpg (519 KB, 768x1024)
519 KB
519 KB JPG
ok 1.5cfg FLUX is insane
>>
File: tmpgmfjzf7_.png (993 KB, 768x1024)
993 KB
993 KB PNG
I'm not even sure how this one happened, but either way I'm here for it

>a (centaur) woman outside at a farm, tied shirt, midriff, side slit skirt, wiping sweat

https://litter.catbox.moe/fjomwa.png
>>
>>101707279
huh ok, t5 prompting is wild, you gotta just tell the model a whole ass story now?
>>
>>101707343
or have a llm do it for you. half of it is a dark souls character description from a wiki btw.
>>
File: tmpxpmfbo2e.png (1000 KB, 768x1024)
1000 KB
1000 KB PNG
>>
File: ComfyUI_00082_.png (2.28 MB, 1024x1536)
2.28 MB
2.28 MB PNG
>>101707353
yeah if i could fit an LLM *and* Flux into my GPUs T__T

I might get a $50k grant to work on AI stuff, then i'll get myself a couple A6000s to make flux waifus
>>
>>101707376
ollama with a 7-9B parameter llm doesnt take much to run
>>
>>101706926
Kinda shitty that they made the dev version non commercial (so the big finetuners will not touch it). Schnell is WAY worse than dev despite being the same size.
>>
File: ComfyUI_Flux_1907.jpg (180 KB, 1024x1024)
180 KB
180 KB JPG
>>
File: ComfyUI_Flux_1909.jpg (170 KB, 1024x1024)
170 KB
170 KB JPG
>>
>>101707376
What model and prompt anon? She's lovely.
>>
File: tmp67fen61r.png (1 MB, 768x1024)
1 MB
1 MB PNG
>>
>>101707448
flux dev 3.5 guidance, same prompt in clip and t5, 285 seconds to gen

>Impressionist traditional painting, oil paint on canvas, in the style of Claude Monet. a close-up half-portrait photo of a gorgeous young hapa (half korean) woman wearing a tight pink satin gown with lace trim, she's wearing dark eye makeup and shimmery lip gloss, she has a knowing seductive smirk on her face, she is thicc with large breasts, she is leaning forward showing ample cleavage and a sparkling necklace hanging low between her breasts, she looks elegant and wears fine jewelry, she is sitting at a bar, lit with colorful neon lights
>>
Can someone explains the difference between SDXL, SD Cascade and SD3?
I've been out since sd1.5 and didn't keep up with all the new base models.
>>
>>101707489
>Impressionist traditional painting, oil paint on canvas, in the style of Claude Monet
Not denying the result is kino, but it's concerning how none of this is reflected in what got generated.
>>
File: VUM886yfgxgIvLaip2lYx.jpg (541 KB, 1024x768)
541 KB
541 KB JPG
It's over. FLUX won.

It feels like a true successor to 1.x stable diffusion.

I haven't put it through some of my prompt comprehension tests (where Dall-E failed badly) but its performance on normal photos is amazing.
>>
>>101707511
SD3 is garbage. Don't bother with it. The development was a mess. The SD3 that was released to the public is unusable. Cascade was made by another team and released by SAI. It's kino and really fast, but the community didn't really supported it. SDXL is like a better 1.5 for 1k images. At first it was kind of lackluster, but the community has worked a lot on it and by now there are some very good fine tunes of it.
>>
>>101707548
that kind of photo stopped being normal in 2003
>>
>>101707489
Thanks, I didn't even think flux could make anything outside of uggos or ordinary looking women.
>>
>>101707559
Alright, SDXL is it then, I'll see what it can do.
>>
File: FLUXD_00002_.png (540 KB, 512x768)
540 KB
540 KB PNG
>>101707548
it's great at realism and anime but if you want anything in between like an oil painting it's trash, just straight up ignores that part of the prompt esp at smaller sizes (this was supposed to be an impressionist oil painting), also just SO slow

i miss 1.5 ability to recognize and emulate artist styles and nothing since has come even remotely close
>>
File: tmpkbv6pmnj.png (916 KB, 768x1024)
916 KB
916 KB PNG
>>
>>101707548
prompt?
>>
>>101707559
why does no one talk about cascade anymore, what was it's issues?
>>
File: segs.jpg (76 KB, 1360x450)
76 KB
76 KB JPG
I'm generating scenes with +2 people and using pic related to try to detect the people. It's working good and properly detects each person with a proper silhouette. The issue I'm having is that they come out as one entire segment rather than separate segments. Is there anything that can be done to either separate the people into individual segments or just have the detector work on a specified region of the image?
>>
>>101707601
Can't say for sure, because I haven't used it myself, but some anon was getting some good painting-like results by changing the sampler. I think they were using Euler Karras.
>>
>>101707617
I lost it, but it was something like

>My super busty cousin Winnie Liu's 21st birthday at uncle David's flat in San Francisco. 1998. She always had a very big chest. That weekend she was rocking a bustier bodycon midi with tons of cleavage. grainy photo from an old family photo album.
>>
>>101707674
Thx anon. T5 prompting is so schizo damn
>>
>>101707674
>>101707548

would
>>
>>101707740
That's what my prompts have looked like since the SD1.4 days. Nothing has changed
>>
im using a 3090 and fp8 and this shit still takes a minute per image
>>
>>101707820
45-50s in fp16 for me, but it's on its own dedicated machine.
I'll get a second 3090 sometime next week to test if the trick about running vae and clip will speed up stuff.
>>
File: ComfyUI_00084_.png (256 KB, 512x512)
256 KB
256 KB PNG
NGL this model is amazing. I'm running flex dev on a 3090; it IS a little slow but quality >>>> quantity all the way.

"A still screenshot from a visual novel from the mid 2010's. A cute naked (black:1.1) girl with (massive breasts:1.5) is on the left yelling angrily at a scared Japanese child on the right, with pink cute text on the bottom bar saying "Stop being a little bitch and suck my nigger tits!" "
>>
so /ldg/ won?
Sad.
>>
File: ComfyUI_00093_.png (592 KB, 768x768)
592 KB
592 KB PNG
Here's another version, slightly longer:

>A still screenshot from a visual novel from the mid 2010's. A cute (black:1.1) girl with (massive breasts:1.5) is on the left yelling angrily with a wild expression at a scared Japanese child on the right, with pink cute text on the bottom bar saying "Stop being a little bitch and suck my nigger tits!". Very high quality, extremely detailed, 8k style of anime like a dedicated still CG. The background is a high school classroom with windows on the left and a beautiful ocean view outside. thin transparent menu bar on top and option b8uttons on right.
>>
File: 757815192609401867.png (985 KB, 832x1216)
985 KB
985 KB PNG
For some reason Flux like to put heart pasties on nips, but when nips shows up they look deformed, rarely a good one
https://files.catbox.moe/so79rl.png
>>
File: tmpj_fig4fv.png (1000 KB, 768x1024)
1000 KB
1000 KB PNG
>>
>>101708155
I have also noticed this. Even when I'm explicit about the nudity (and often when I'm not and don't even specify it!) it seems to heart pastie just as often as it has normal nipples just as often as it removes nipples entirely! Weird trait, but I can fap to it.
>>
>>101707963
>>
>>101705249
>>101704773
>actual readable text in /sdg/ on local
Neat.
Are these first tries or is there inconsistency?
>>
File: ComfyUI_00102_.png (1.25 MB, 1536x768)
1.25 MB
1.25 MB PNG
Had to do like five gens, but it got the gist of it.
>Jesus working at a depressing booth with papers and pencil in hand, and a line of niggers coming from the foreground to him. He looks exasperated and tired. Above the booth is a sign saying "Make A Wish Foundation" and a small star of david on the right of the sign. In the background there are coca cola and mcdonalds posters with bold black caption "CONSUME." on the left. On the right wall there are Amazon and Google posters with bold RED caption "OBEY".
>>
File: delux_chi_00045_.jpg (421 KB, 1024x1024)
421 KB
421 KB JPG
just got back from my run. mass reply time

>>101707296
I've been wondering that as well. does flux support conditional operations? I could try, but I prefer the mystery

>>101707326
that doesnt look fully denoised

>>101707376
>yeah if i could fit an LLM *and* Flux into my GPUs T__T
I have a lot of gpt credits but I'm afraid to plug in an LLM node now....

>>101707548
>It's over. FLUX won.
lots of people are saying it cant be trained. that will determine whether its a long term option or not
>>
>>101707657
>>101707601

yeah i have problems getting it to do art styles too, i'll try karras
>>
File: 00198-1335051936.jpg (647 KB, 1022x1862)
647 KB
647 KB JPG
>>
>>101708301
I don't understand this argument. How could it possibly not be trainable? How do you think BFL made the model in the first place? Of course it can be trained; people are just butthurt that they have to rent out an A100 instead of making garbo-tier LoRA's on their 980
>>
File: 1721261055894233.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>
File: delux_chi_00049_.jpg (618 KB, 1024x1024)
618 KB
618 KB JPG
>>101707963
>visual novels
oooo, good idea

>>101708155
>pasties on nips
damn if only pasties anon were here

>>101708197
they're mostly all first tries
>>
>>101708310
ok interesting, same prompt same seed, just set to karras, definitely more art style
>>
>>101708355
next two seeds just seems kinda noisy and fucked up tho, not really doing the impasto thing like the first one
>>
>>101708377
30 steps instead of 20, fml whyyy not it's going to take even longer
>>
>>101707909
>>101707963
kek. do one with shinji?
>>
>>101707963
Prompt please
>>
File: delux_chi_00053_.jpg (616 KB, 1024x1024)
616 KB
616 KB JPG
>>101708338
>How could it possibly not be trainable?
some people say the liscence, some people say the hardware reqs, some people say the architecture, some people say the available weights. idk how much of it is truth and how much of it is FUD
>>
>>101708431
mixed signals from BFL. The license is permissive to finetune, but the devs say it can't be done
>>
>>101708429
It's the greentext there. That's the whole prompt. I did fp16, euler, and simple.
>>
>>101708429
>im a retard
>>
>>101708431
Schnell is open source and allowed to be tuned, so some effort will be spent to reverse the distillation and then someone is going to run on epoch on top of it with 2 million images and voila.
>>
File: its_flux_time_uwu_00011_.png (1.92 MB, 778x1167)
1.92 MB
1.92 MB PNG
>>101708423
40 steps euler karras
>>
>>101708463
>>101708467
I'm really retarded today, sorry, thanks
I don't know how I missed this wall of text omfg
>>
>>101708473
I've been out of the space for almost a year now so I'm a little out of the loop. What exactly does BFL mean when they say it's a distillation? It doesn't sound like a pruning since dev and schnell are both 12B (and we don't know how large Pro is). What's going on architecturally?
>>101708424
I can excuse pedophilia and embrace racism, but I draw the line at mecha anime.
>>101708455
Do not underestimate 4chan autism
>>
File: its_flux_time_uwu_00013_.png (3.92 MB, 1229x1843)
3.92 MB
3.92 MB PNG
>>101707489
>>101708505
tried with this prompt and it loses a lot the style, i'm going to ask claude to write a 2 paragraph description of impressionist techniques to see if it balances out
>>
so how exactly do i install flux locally from the hf repo? is there a tldr for dumbretards for where to put each file?
>>
>>101708473
simply not happening
>>
>>101708558
the same place you put your other models
>>
>>101708567
simply will happen, sorry
I know your plan was to have a monopoly behind a paywall, but it's happening and you have PHDs that want to fap locally too
>>
>>101708558
flux1-dev.sft/flux1-schnell.sft into models\unet
ae.sft into models\vae
clip_l.safetensors and t5xxl_fp16.safetensors or t5xxl_fp8_e4m3fn.safetensors into models\clip
>>
File: 1707519203270197.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>
File: 1709305556492075.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
reminder open source always wins, dall-e wont even let you prompt certain things cause "controversial".
>>
>>101708431
It's all of those combined. Biggest obstacle: It was never meant to be finetunable and BFL went to great lengths to ensure that you can't simply take the schnell model, finetune it a bit and spin up a competing firm that offers better product than their Pro model. Everything they have done is to ensure that it will never happen and them releasing it under Apache 2.0 means they are pretty damn confident that someone like NAI can't take their model, train it with anime tits and then start competing with them.
>>
File: FLUX__00160_.png (1.04 MB, 896x1152)
1.04 MB
1.04 MB PNG
>>
>>101708515
You have the Pro model which is what the training happened on. That model becomes a teacher. You then have the Schnell model, it's designed to run fast. Instead of training on 100 steps per image, you do 4 steps on Schnell and the training is strictly comparing the output of the Teacher with the Student. In the process the Student gets crippled and loses a lot of the trainability but it still works for the target goal (fast and quality image generation).
>>
File: tmp8t1q9ovh.png (1.13 MB, 896x1152)
1.13 MB
1.13 MB PNG
>>
File: ComfyUI_00120_.png (1.62 MB, 1024x1536)
1.62 MB
1.62 MB PNG
>>101708567
What makes you think it wouldn't? The community has been waiting for over a year for an SD 1.5 replacement, never got it, and has been hoarding compute and desire until Flux hit. Only real achievement of note in the interim was Pony.
>>
File: 1705648547980015.png (1.72 MB, 1024x1024)
1.72 MB
1.72 MB PNG
>>101708607
also it can do a wide range of styles, like this: "in the style of a 1940s comic":
>>
>>101708622
Read: >>101708609
>>
File: tmpuyez20sj.png (1.18 MB, 896x1152)
1.18 MB
1.18 MB PNG
>>
>>101708609
It's hilarious hubris honestly, everything can be reversed. The question is how difficult it will be to get the compute but honestly when you have a very obvious end goal and target it makes research easy because the research simply is
>we want to train Schnell on a 24 GB GPU
Many people will be taking a crack at it and we already know the power of boners and there are many PHDs who literally have access to H100s provided by their universities.
>>
>>101708600
oop i focused too much on the folders while sft was in main
thanks anon
>>
>>101708640
We're talking about computers not magic spells.
>>
>>101708633
>>101708633
>>101708633
>>
>>101708649
I think flux has done more harm to open source than good. People are now gonna wait for something that will never come and will not move on.

BFL is not and does not want to be a Stability. They have different goals. They are looking into competing with MJ and Dall-E, Sora and any video model MJ eventually makes. Their business model is also not similar to what Meta is doing with llama. These open access models are pure marketing and never meant to be the models open source community adopts as the new base to build upon.
>>
File: 00000-1685000406.jpg (2.67 MB, 2016x2592)
2.67 MB
2.67 MB JPG
>>
File: de_fl_00004_.jpg (816 KB, 1344x960)
816 KB
816 KB JPG
>>101708744
>I think flux has done more harm to open source than good
the greatest harm was showing people what a good local model can do. when SAI releases SD3.1 and it can't compete with flux, a lot of people are gonna write off SAI for good at that point.
>>
File: 1720310893245641.jpg (7 KB, 128x112)
7 KB
7 KB JPG
>>
>>101708744
>>101708744
>>101708744
Then we, the community, will have to adapt, because what you're arguing -- in essence -- is that instead of waiting for a crowdfunding or enthusiast to spend a couple grand a hundred hours on a pro bono finetuning project, we should instead wait for a nonexistent new tech startup to spend a couple hundred million and tens of thousands of PhD man hours training a new model from scratch, pro bono.

What is with this fatalistic "full open source or bust" nonsense? SAI is not coming back to save us, and if that's the case we might as well adapt to the new environment and be grateful for the amazing models we're given, and collectively organize to pool our efforts into fewer high quality fine-tunes instead of many, low-quality ones.
>>
>>101708609
I find it difficult to believe "competitive" means anything less than attempting to train a full foundational model. We have the weights. Time will tell.
>>
>>101708864
>1708744
>>I think flux has done more harm to open source than good
Skill issue.
>>
>>101708338
Dev and schnell are distilled models. They were basically trained to imitate Pro. They can only be trained with access to the Pro weights, because Pro acts as a "teacher" model. See this github thread: https://github.com/black-forest-labs/flux/issues/9
>>
>>101709169
>https://github.com/black-forest-labs/flux/issues/9
See https://x.com/ostrisai/status/1819802556261863925?t=NaBwMCu3sz4_fibQ8p2ATQ&s=19
>>
>>101707928

Who fucking cares who "won"?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.