[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: cover.jpg (3.37 MB, 3652x2226)
3.37 MB
3.37 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bred : >>102779929

No Collage Edition

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://aitracker.art
https://huggingface.co
https://civitai.com
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/kohya-ss/sd-scripts/tree/sd3

>Flux
https://replicate.com/black-forest-labs/flux-1.1-pro
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai
>>
>>102788863
Troll op, 0/10
>>
>>102788863
that's just tragic. /ldg/ truly has fallen
>>
>no collage
so we're just /sdg/ now?
>>
>>102788904
Go back.
>>
>>102788896
Come on! When was I going to get the chance again to not post a collage but making the cover still look like one?
Also, really, that was the only pic I liked in the entire last thread.
>>
honest question why is comfyui buried in the OP as if it's merely one among many valid choices for GUI when really it's the state of the art GUI which should be listed first..? Are we pretending it's second-rate due to a personal quarrel with its maker?
>>
>>102788941
if you're reading the op then you're new to image gen, and if you're new then comfyui generally isn't the best first option. it's something people switch to if they know they need it.
>>
File: comfy.png (918 KB, 1280x720)
918 KB
918 KB PNG
>>102788941
If it was first, people would use it, find all the noodles filling their room, spaghetti everywhere.
They would think local image generation was like that, that all UIs were like this, and would run away horrified.
We don't want that.
>>
File: 00116-1553071417.jpg (456 KB, 1280x1280)
456 KB
456 KB JPG
>>
>>102788939
Xd I think it’s funny too
>>
Can we get the Trump painting on the moon Ty?
https://youtu.be/zPwMdZOlPo8?feature=shared
>>
>>102785398
To the old thread, no I was talking about the license being only for commercial use if you *host* their models for a fee like what Pixai/tensorart/whatever is doing with SD models. If you run the models locally or use the images from Flux (from either someone using the model locally or being hosted commercially) you won't be affected
>>
File: 00149-1553071421.jpg (785 KB, 1490x1987)
785 KB
785 KB JPG
>>
File: 00150-1553071422.jpg (819 KB, 1490x1987)
819 KB
819 KB JPG
>>
File: 00171-1553071421.jpg (816 KB, 1490x1987)
816 KB
816 KB JPG
>>
File: grid-0655.jpg (697 KB, 3328x2432)
697 KB
697 KB JPG
>>
File: 1724649644342624.jpg (640 KB, 2072x2072)
640 KB
640 KB JPG
>>
REPA can be combined naturally with microDiT's masking training https://github.com/sihyun-yu/REPA/issues/1
>>
>>102789948
I still don't know if REPA can be used for finetuning or Loras though
>>
>>102789965
>I still don't know if REPA can be used for finetuning or Loras though
https://github.com/sihyun-yu/REPA/issues/2#issuecomment-2408437619
>We haven't tried but I think it can also work with fine-tuning setup.
>>
>>102789965
Can't wait for Prodigy + REPA 1 minute loras
>>
>>102789979
repa decreases the final loss function even further, so you'll get even better quality Loras if that's possible to do so
>>
File: 00019-1553071423.jpg (696 KB, 1987x1490)
696 KB
696 KB JPG
>>
>>102789965
I'm not sure it will work, as REPA induces different latent space (from DINOv2)
But hasn't anon here 2 threads ago been finetuning Pixart 1B, not training it from scratch?
>>
>>102790056
>But hasn't anon here 2 threads ago been finetuning Pixart 1B, not training it from scratch?
I think he was training from scratch
>>
>>102790131
and he was already 80% there!
>>
>>102790190
if it only works on scratch it's not that useful desu, yeah sure we'll be able to make quick VAE but I wish it could be used for finetune aswell, so that we wouldn't have to rely on a cucked horse fucker to get our goods kek
>>
File: the longest dick general.jpg (2.45 MB, 1335x2000)
2.45 MB
2.45 MB JPG
>>102788863
>>
>>102790190
if you were also doing something like that then i wouldnt say you simply sound jealous, but you do
>>
>>102790370
>jealous of a slow cooking tiny model made by a delusional tard
>>
>>102790446
>slow cooking tiny model
show us yours
>>
>>102790471
you're almost there, buddy, only 20% left! I believe in you!
>>
>>102790491
>not slowly cooking his own tiny model
>lashes out at the single anon who is
not even that anon kek
>>
File: 00274-1553071420.jpg (406 KB, 1296x1728)
406 KB
406 KB JPG
>>
>>102790507
criticism isn't lashing out, the slow cooker lashed out because he couldn't accept his model wasn't actually "80% there"
>>
>>102790531
still waiting for updates on your model oh wait
>>
>>102790549
>still thinking it is about the model
>>
>>102790578
>noooo!! you cant just say its 80% there!! stahp lying!!! >:(
>>
>>102790595
this but unironically, don't be delusional about your own work
>>
>>102790605
>he doesnt even have work to show
>>
>>102790611
>he needs to train a model to be able to tell if some text matches some image
>>
>>102790627
>still not training his own model from scratch
>>
File: 00283-1553071425.jpg (590 KB, 1296x1728)
590 KB
590 KB JPG
>>
>>102790635
>same old tired nonsense comeback
>>
>>102790642
meant to reply to >>102790190
>>
>>102790228
thanks champ
>>
>>102788941
What you should be asking is why are Forge and Automatic1111 still there when reForge is the only one anon talks about desu
>>
File: 00331-1553071426.png (1.27 MB, 1152x864)
1.27 MB
1.27 MB PNG
>>102790827
They still work? A1111 is great for extensions, Forge for Flux and reForge for everything else
>>
File: 0.jpg (317 KB, 1024x1024)
317 KB
317 KB JPG
>>
File: 0.jpg (148 KB, 1024x1024)
148 KB
148 KB JPG
>>
File: 00370-1553071420.jpg (517 KB, 1728x1296)
517 KB
517 KB JPG
>>
How do I run StableDiffusion with an AMD card?
>>
>>102790446
he's training a whole model from scratch not finetuning, the fact that it even got that far is very impressive. it's an interesting little experiment, not sure why you need to hate on it.
>>
File: 00425-1553071425.jpg (380 KB, 1296x1728)
380 KB
380 KB JPG
>>
File: Detailed_00385_.png (1.37 MB, 832x1216)
1.37 MB
1.37 MB PNG
Any tips for non-green mermaid tails? It feels like gacha to get any other color, even when I prompt for it.

>1girl, monster girl, mermaid girl, mature, adult woman, (green eyes), ({red|pink|purple} mermaid tail:1.5), tail fins, hip fins, fins
blah blah, there's more to the prompt but I want green eyes and non-green tail
>>
>>102790228
hooray! /ldg/ lives after all
>>
hmm no I will not be trying out on my own computer something with a name that reads like a cross between "reaper" and "rape"
>>
pyramidflow sucks
>>
>>102791614
I'd train an S model to test it but I can't find a decent t2i dataset
>>
>>102791562
reginlonal prompting maybe? separate body and face, works for me with color sensitive gens and multiple characters
>>
>>102791562
try doing BREAK before describing the tail to seperate it from the green eyes bit. helps with color bleed
>>
File: 00564-1553071431.jpg (361 KB, 1296x1728)
361 KB
361 KB JPG
>>
Are those jews actually not gonna release flux 1.1 in the end?
>>
File: 1576300769.png (1.32 MB, 896x1152)
1.32 MB
1.32 MB PNG
>>
>>102789965
# Compute CLIP-based loss
if clip_encoder is not None and zs_tilde is not None:
with torch.no_grad():
vae_dtype = next(vae.parameters()).dtype

scaling_factor = torch.tensor(
vae.config.scaling_factor,
dtype=vae_dtype,
device=x_start.device
)

x_start_scaled = (x_start / scaling_factor).to(vae_dtype)

vae_device = next(vae.parameters()).device
x_start_scaled = x_start_scaled.to(vae_device)

decoded = vae.decode(x_start_scaled).sample

decoded_images = (decoded + 1) / 2 # Scale images to [0, 1]
decoded_images = decoded_images.clamp(0, 1)

decoded_images = decoded_images.to(torch.float32)

decoded_images = F.interpolate(
decoded_images,
size=(224, 224),
mode='bicubic',
align_corners=False
)

decoded_images = decoded_images.cpu()

zs = clip_encoder(decoded_images)

zs_tilde = zs_tilde.to("cpu")

zs_tilde = F.normalize(zs_tilde.to(torch.bfloat16), dim=-1)
zs = F.normalize(zs.to(torch.bfloat16), dim=-1)

proj_loss = -torch.sum(zs * zs_tilde, dim=-1).mean()
proj_loss *= clip_loss_weight

proj_loss = proj_loss.to("cuda:0")
terms['proj_loss'] = proj_loss
terms['loss'] += proj_loss
>>
>>102790446
it must be sad that you don't do things because they might require more than 10 minutes of work
you know there are people who do hobby projects that take literal years
and you're filtered by a script that takes a couple of months to run
>>
File: 880545030.png (1.31 MB, 896x1152)
1.31 MB
1.31 MB PNG
>>
>>102791334
because it reminds him in that time he has not become a better person
someone doing something productive is an existential threat
>tall poppy syndrome
>crab in a bucket
>>
File: 00643-1553071442.jpg (351 KB, 1728x1376)
351 KB
351 KB JPG
>>
>>102792579
>why make a 3D movie in Blender, don't you know it'll take you months to render all the frames?
>don't you know it'll take you months to animate it all?
>just do nothing
>>
File: 821173391.png (1.27 MB, 896x1152)
1.27 MB
1.27 MB PNG
>>
>>102792815
Neat
>>
>>102792557
Ultimately this just fulfills the same purpose as something like using perceptual losses rather than using more mathematical loss methods. You nudge the direction by using CLIP's ability to determine how similar two images are, this gives the training more context on what needs to change. Where perceptual loss focuses on the visual features of an image, CLIP focuses on the contextual/subject features of the image. In an ideal world you'd use both but sadly they're both very expensive to compute when most of the time you're operating with limited available VRAM.
>>
>>102792375
what model?
>>
File: 00565-1553071435.jpg (330 KB, 1296x1728)
330 KB
330 KB JPG
>>102792882
thanks

>>102792908
1girl printer sdxl
>>
File: 1070221415.png (1.41 MB, 896x1152)
1.41 MB
1.41 MB PNG
>>
File: 0.jpg (81 KB, 1024x1024)
81 KB
81 KB JPG
>>
File: 00733-1553071420.jpg (489 KB, 1728x1376)
489 KB
489 KB JPG
>>
File: FluxLorasTest.jpg (3.13 MB, 9251x1157)
3.13 MB
3.13 MB JPG
trying out some flux digital artsytle lora training
>>
File: 0.jpg (229 KB, 1024x1024)
229 KB
229 KB JPG
zero. get it?
>>
>>102793366
Cool, who's the artist?

>>102793401
From zero to hero
>>
requesting toned Korra wearing transparent lingerie, stockings, choker, and a cropped leather jacket, standing, erotic pose, arcade, arcade cabinets, neon lights, dynamic lighting
>>
>>102790228
want that cobain prompt/method
>>
>>102793471
no specific artist, just a big mix of polished digital artwork styles like riot splashes and some big name artists
>>
Amaze supports flux now. It's inference from AMD for Windows. Has anyone tried it? I found it doesn't like Wine.
>>
File: 00779-1553071424.jpg (340 KB, 1728x1296)
340 KB
340 KB JPG
>>
File: not2B.png (2.69 MB, 1088x1920)
2.69 MB
2.69 MB PNG
Is there any way to use two different character LoRAs for Flux at the same time? Whenever I generate two different characters, it just blends the facial features of both.
>>
File: 00037-4210461022.png (1.27 MB, 832x1216)
1.27 MB
1.27 MB PNG
>>102793474
>>
What a fucking nutjob.
Keep your kids away from this man.
complaining that Openai wont code Triton for Windows (when it's already been done 5 months ago as a branch, that works and provides speedup)
But NO! IT MUST BE OPENAI WHO DO IT.

Picrel.
>>
File: ComfyUI_241601_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>
Hi anons, been missing for a bit
any good flux nsfw loras for buttholes?
>>
File: ComfyUI_temp_guqkf_00002_.png (3.41 MB, 1344x1600)
3.41 MB
3.41 MB PNG
>>102793747
just switch to linux if you want triton so bad, he just wants windows support because thats most people use and he needs it for his paywalled content
>>
>>102794118
its already supported in windows, just use the branch. He's calling on his followers to agressively harrass devs, they banned this shit on twitter years ago because its targetted harrassment.
I hope OpenAI sic lawyers on the fuck.
>>
File: 00836-342464344.jpg (340 KB, 1728x1296)
340 KB
340 KB JPG
>>
>>102793747
>complaining that Openai wont code Triton for Windows (when it's already been done 5 months ago as a branch, that works and provides speedup)
where? I need the latest binaries
https://github.com/jakaline-dev/Triton_win/issues/2
>>
>>102793317
>>102793586
Very cool
>>
>>102794223
feeding the troll using lawyers is not the answer. They should be smart enough to just not care and let it blow over. The no information for issues has been very prominent throughout all the scandals. Not that anyone would call this a scandal.
>>
Flux got a second version? Can't find it on HF
>>
>>102794223
>I hope OpenAI sic lawyers on the fuck.
because he expressed dissapointment on a github page even though you can easily block his ass and call it a day? are you serious?
>>
>>102794399
we won't get anything from BFL anymore, they got the free publicity from us we're not useful to them now
>>
Is there a new way to do bypasses in Comfy? It only works half the time, the other half it works like mute.
>>
>>102793366
A few more steps and you can call it an eldritch lora.
>>
>>102793476
First reference of it is from /lmg/ >>101712018
Then later /ldg/ >>101712800 >>101713541
>>
>>102793476
>>102794650
>A professional real estate photograph selfie in a living room, 24mm, f/16 lens. The background is sharp and in focus. An anime cutout of Hatsune Miku is edited into the photo. There is a photogenic man standing beside her with his hand around her shoulder.

>A professional, high-resolution photograph inside an opulent mansion, 24mm wide-angle lens, f/16 aperture for deep focus. President Donald Trump and Russian President Vladimir Putin standing side by side, appearing photogenic and formal. Behind them an anime cutout of goku from dragon ball Z is edited into the photo. A thoughtbubble above their heads that reads "Flux is cool..."

Just make it cobain instead or whatever
>>
File: file.png (216 KB, 1375x1408)
216 KB
216 KB PNG
>>102794300
>https://github.com/jakaline-dev/Triton_win/issues/2
>Then I patched triton/runtime/build.py
Where do I find that? I don't see any "runtime"
https://github.com/wkpark/triton
>>
>>102794701
>Where do I find that? I don't see any "runtime"
It's here anon.
https://github.com/wkpark/triton/tree/main/python/triton/runtime
>>
File: 00870-342464345.jpg (525 KB, 1296x1728)
525 KB
525 KB JPG
>>
>>102794568
It works 100% of the time for me. I just attach the left side of rhgtree's bypasser to the output of a node while the right side isn't connected to anything.
>>
>>102794887
my issue is that the CLIP is being passed through. I want the right side connected to something.
>>
>>102792557
>>102792893
So... with this code you can can make loras with REPA right? Np catch at all?
>>
File: 1999519294.png (1.31 MB, 832x1216)
1.31 MB
1.31 MB PNG
>>
File: bypass_error.png (158 KB, 1695x1343)
158 KB
158 KB PNG
>>102794897
I should have just done a pic to start. Here is a model passthrough failing. Top is bypassed bottom is not. Bottom works.
>>
>>102794944
The catch is it takes time and no small amount of VRAM to do and you would likely need to tune it just like any other way to guide losses.
>>
File: file.png (2.28 MB, 3491x1337)
2.28 MB
2.28 MB PNG
>>102794988
>The catch is it takes time
but I thought it would make shit faster :(
>>
>>102795023
Your time per step would go up, theoretically your total training time would go down
>>
File: bypass.png (5 KB, 416x608)
5 KB
5 KB PNG
>>102794970
I don't see the bypass node. How do you bypass your nodes? I just do it like this.
>>
>>102795088
I highlight and press CTRL+B. It makes it purple.
>>
File: 00019-1512572424.png (670 KB, 1024x576)
670 KB
670 KB PNG
I was gonna get a real gf... turns out my delusions of destiny was just that... delusions
now all I have left is 1girls.
I guess this is the universe paying me back for all those times girls came on to me and I just didn't feel it.
Why does life have to suck so much bros?
>>
>>102794765
>>102794701
>>102794300
LMAO
this faggot has to lurk here there's no way jfc
What an entitled little shit. Begs and bullies to try to get others to do work for him so he can repackage it to sell to his Patreon. I hope more places ban this retard or at least have it become common place to fervently ignore him
>>
>>102795332
anyone want to do this with no code changes? Might be funny to see his entire community get poisoned because of his dumb.
>>
>>102795293
>Why does life have to suck so much bros?
I feel good about myself not having a girlfriend, because all women vote for Kamala so for me they are the ennemy lol
>>
What's the difference between this and SD thread?
>>
>>102794765
What branch do you have to patch though? this one?
https://github.com/wkpark/triton/tree/windows-build-matrix-rework3
And his repo is 8 months old, what's the point of making binaries from such a old thing? Are you sure this is gonna work?
>>
>>102795400
>What's the difference between this and SD thread?
/sdg/ thread is the extravert thread, they just do small talk about bullshit no one care, so it can be active

/ldg/ is the introvert thread, we only talk when there's something interesting goin on
>>
>>102795422
hey introvert, don't feed the troll. I understand it may be hard to tell.
>>
>>102795116
You can control many nodes at once with the bypasser though.
>>
>>102795382
I was there but then I met this girl. I was pretty sure I was happy being alone. But then she moved me in ways I never knew I could be moved. But I was wrong. I was dead wrong, Or I was right, but the universe is denying me this thing that I wanted so much.
>>
I thought up some new ways to refine joycaption for better natural language tagging I really want to test but I woke up randomly so sick it's taking everything just to lurk on my phone
it's probably a sign my plan is genius and the powers that be are taking me out..
>>
>>102795489
I use the group bypasser node. It doesn't change that I want to be able to bypass quickly without messing around with extra nodes.
>>
Has anybody ever used StableCascade? I just turned all the settings to max and now it's busy generating a 7.18 GB image that'll take approx 30 mins.
>>
File: 1152032930.png (1.54 MB, 896x1152)
1.54 MB
1.54 MB PNG
>>
>>102795538
Yeah it's shit, even with finetuning the outputs have somewhat blurry low res details especially around the eyes because of the way it makes images
>>
>>102795558
aaaaw. I was hoping to get blown away by something magical.
(also fuck cloudflare)
>>
>>102795584
Sadly it was right to be abandoned by all
>>
>>102795503
Men have fallen for the disney love story more than women. You gotta shake it anon, it's simply not reality for the overwhelming majority of people.
>>
>>102794765
>https://github.com/wkpark/triton/tree/main/python/triton/runtime
it's more specifically this branch
https://github.com/wkpark/triton/blob/windows-fix/python/triton/runtime/build.py
>>
>>102795538
I remember some fanboys raving about it. I consider it a BFL project. I don't think SAI is going to relaunch it given they shit all over it with SD3.
>>
>>102795584
>aaaaw. I was hoping to get blown away by something magical.
anon, if StableCascade was good, we would be using it
>>
>>102795621
It was made by a seperate company in a rush and even that company shit on it saying they didn't have time and it sucked dick
then they slapped a non commercial license on it so Sai couldn't monetize it either and it got quick wiped from sais memory
Nothing to do with bfl, though I think the bfl team are a bunch of grifters anyway
>>
>>102795600
It felt so real. It was like I found somebody that was made just for me!
It was like everything was finally coming together finally starting to make sense.
But then I hit the ground. Hard. I can't believe this happened to me. I never believed any of that shit how could this happen to me?
I guess I should be grateful. It was absolutely magical while I was ignorant of reality but fuck... what a let down
>>
File: file.png (31 KB, 1107x307)
31 KB
31 KB PNG
>>102795612
I don't get it, he wants to modify a line that is already modified
https://github.com/wkpark/triton/blob/c1878dcaf8af6ab5292048aea00a84181f71c61d/python/triton/runtime/build.py#L21
https://github.com/jakaline-dev/Triton_win/issues/2#issue-2583344609
>>
>>102795649
>I think the bfl team are a bunch of grifters anyway
of course they are, they just gave to us the shit distilled versions so that we can talk about their API models and the company, it's just free publicity, now it's up to us to save flux dev, we're on the right path with the undistilled version though
>>
File: 00014-3066860572.png (1013 KB, 1024x1024)
1013 KB
1013 KB PNG
It does kinda suck for half and hour of gen time
>>
>>102795722
This line seems to be for the windows-build-matrix-rework2 branch, Idk why he didn't go for the rework3 or the windows-fix branch, this is so confusing
https://github.com/wkpark/triton/blob/7f35df7f65e427ad04ac1dba4167760c2f7e97ae/python/triton/runtime/build.py#L21

>>102795819
>It does kinda suck for half and hour of gen time
that's why I'm trying to make triton work on windows, it speeds shit up
>>
>>102795855
>that's why I'm trying to make triton work on windows, it speeds shit up
i'm tired bros...
>>
File: file.png (100 KB, 2341x838)
100 KB
100 KB PNG
>>102795855
>This line seems to be for the windows-build-matrix-rework2 branch, Idk why he didn't go for the rework3 or the windows-fix branch, this is so confusing
I think he went for the matrix-rework 1 because that was this branch used to make those binaries back then
https://github.com/wkpark/triton/actions/runs/7518654030
>>
File: ComfyUI_241638_.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>>
that new flux turbo lora seems to let you use the lcm sampler effectively with flux, which is great
lets you benefit from lcm's superior coherence
>>
>>102796799
Looks very noisy.
>>
Where can I find people discussing the inherit rules of Flux prompting? Like Youtube videos (or articles, whatever).

Surely we aren't just accepting that the Flux LLM can perfectly understand everything you write? So what are its limitations? Is anyone exploring and documenting it?
>>
>>102796799
where can i find it?
>>
>>102797211
i usually just watch this general and mentally note down the occasional tip
this is from last thread: >>102780699
other than that, i can say ive noticed that despite T5 having quite a bit of context, clip has a much shorter context, and the first sentence or so of your prompt is really important for conveying the core idea of what you want to see, after which you can flesh it out with more details as needed
look at how it interprets your concept and rewrite it in a few different ways to see what it understands best, it is made to understand human language but some things it can relate better than others so you just have to trial and error it
>>
>>102797287
The tip about boomer prompting is only relevant for Schnell.
>>
File: 00320-3729972195.png (1.16 MB, 896x1152)
1.16 MB
1.16 MB PNG
Frozen IV
>>
>>102797088
nta but Flux just does that on some gens even with zero loras or modifications whatsoever
you must be new
>>
File: 02504-1444409177.png (933 KB, 896x1152)
933 KB
933 KB PNG
>>102797493
>>
>>102797509
Doesn't on my machine. Check and mate.
>>
>>102797513
truly weird lie to tell
>>
>>102797509
>nta but Flux just does that on some gens even with zero loras or modifications whatsoever
the fuck you talk about? never happened to me
>>
>>102797513
>>102797658
samefag
>>
File: 1581829406761.png (38 KB, 678x525)
38 KB
38 KB PNG
OOTL nigga here. XL loras don't work with illustrious checkpoints, correct?
>>
>>102797694
Ye
>>
>>102797418
I gave that piece of advice knowing full well that nobody really 'gets' boomerprompting, so it was reckless of me. Little or no chance of it being helpful.

It's actually the most worthless piece of advice I could give. Since "boomerprompting" was coined to describe my prompts way back when, telling people to boomerprompt is my lazy way of saying "prompt like I do". Really the only thing unique to boomer prompting is the idea that expressions in common parlance are useful and prompts aren't an itemized list of things to be included in the image. I used to illustrate this to people by saying they should try prompting "girl", then prompting "the it girl", and see how they compare—since most assume "the" and "it" are meaningless noise tokens.
>>
>>102798418
>>102797418
also I forgot to say I don't use schnell, I use dev nf4. So my advice is tailored to flux-dev.
>>
File: edd thumbsup.png (535 KB, 560x840)
535 KB
535 KB PNG
>>102798391
ty
>>
>>102797694
They work
>>
>>102795332
>this faggot has to lurk here
you just blow in from another general? he paywalled another anons script awhile ago
>>
File: ComfyUI_06266_.png (2.89 MB, 1280x1280)
2.89 MB
2.89 MB PNG
>>102798418
>aren't an itemized list of things to be included in the image
Bulleted list proompter on flux dev here, running the hyper 8 step and still get it to adhere a lot better than SDXL
>>
>>102798727
>>102798391
ok, ill need a 3rd opinion now
>>
>>102798819
my opinion is worth at least two so you should trust it more
>>
>>102793720
She looks uncomfortable
>>
>>102798443
are you running nf4 on comfyui or on forge? i was using it before on forge but now that im trying out comfyui i just get errors using the nf4 loader node
>>
File: RETARD words can kill.jpg (49 KB, 827x498)
49 KB
49 KB JPG
so im stupid and dont follow how all this works, has flux been further optimized with like, updates and stuff at all? with the nf4/ggoofs has it been steadily running quicker and quicker?
would be nice if i could at least try it at like, 15s/it on my 1080 instead of 28s/it like the last time.
>>
>>102798911
comfy. I'm also using CheckpointLoaderNF4, it works for me
>>
>>102798825
oh that will make this easier. which one of those post are you tho
>>
>>102798942
do you use loras with it?
i think it's either that or the fact that im using the windows portable installation
>>
>>102799016
no, loras do not work
>>
File: 00033-427894485.jpg (181 KB, 896x1152)
181 KB
181 KB JPG
>>102799030
wow thats a bummer, i liked nf4 with loras but im done with forge
>>
>>102799131
wow he's literally me
>>
File: file.png (40 KB, 2760x276)
40 KB
40 KB PNG
wtf, this is way faster on WSL2 compared to running on regular windows, I usually had 3.7s/it now it goes under 3s/it
>>
>>102799177
why does that make it faster? if i do it will it be faster too?
>>
>>102799177
>wtf, this is way faster on WSL2 compared to running on regular windows, I usually had 3.7s/it now it goes under 3s/it
kek my b, I forgot I'm running on the new torch 2.6 + cu124 on wsl2, that makes shit faster of course
>>
File: 0.jpg (82 KB, 1024x1024)
82 KB
82 KB JPG
>>
>>102799207
>why does that make it faster? if i do it will it be faster too?
no I don't recommand it at all, loading models takes ages somehow and I thought it would be useful for the TorchCompileModel node (it only works on Linux and supposedly WSL) and yet I still have an error lol
>>
Bigma status?
>>
>>102799522
Bigma balls
>>
File: 0.jpg (131 KB, 1024x1024)
131 KB
131 KB JPG
>>
Amuse is the name. I can't post a duplicate image. But anyway, Amuse, provided by AMD, and for 7900 XT/XTX. It supports Flux. but is it faster than ComfyUI?
>>
>>102795293
You stop caring about your past mistakes once you realize that even when women do come onto you and you seem to do everything right they'd rather play games and toy with your emotions than have a serious relationship. It does not matter what you do. Men who have gfs are just winning the lottery and getting lucky. No such thing as doing anything right.
>>
File: file.png (5 KB, 743x55)
5 KB
5 KB PNG
>>102799177
>>102799208
yeah I went back to regular windows and the speed is virtually the same, but the improvement is quite impressive from torch 2.3.1 (3.8s/it) to torch 2.6.0 nightly (2.95s/it)
>>
>>102799600
Women were created by satan. Knowing this, nothing about the world is strange or surprising.
>>
>>102799611
>Women were created by satan.
true, and I'm sad I'm not a faggot because I'm attracted to those devils, fok
>>
>>
>>102799734
truly the only interesting technique to come out of flux
>>
If you're in Windows, are there any advantages to running Comfy in WSL instead of windows native?
>>
>>102799784
none >>102799323
>>
File: file.jpg (407 KB, 2338x1518)
407 KB
407 KB JPG
Why the fuck Karras is so weird on Flux kek
>>
File: 2024-10-12_00004_.png (1.16 MB, 720x1280)
1.16 MB
1.16 MB PNG
>>102799756
>>
>>102799797
blah, thanks
I was interested because of torch compile too
>>
>>102799839
>I was interested because of torch compile too
there's a way to get Triton installed on windows to get torch compile but I couldn't do it, got random errors during build, I fucking hate building wheels, it never work >>102795991
>>
>>102799860
>Triton
wat
>>
>>102799874
you need triton to get torch compile, but it's not "supposed" to work on windows because OpenAI hate us, didn't deter some people on trying though
>>
File: file.png (728 KB, 771x1252)
728 KB
728 KB PNG
>>102799829
k nevermind, exponential is the worst one
>>
>>102799734
>>102799836
Now upscale
>>
>>102799921
what's the best ai upscale model right now?
>>
>>102799921
I should. But first, have this delight :^) 10 iterations, so I'm trying 20. Yes I don't know why it's so bad.

https://files.catbox.moe/q9s5c3.png
>>
File: image.png (875 KB, 640x1040)
875 KB
875 KB PNG
I tried using this trendy new face restoration model some anon linked a few threads ago
https://huggingface.co/spaces/ohayonguy/PMRF
>>
>>102799895
>exponential
why?
>>
>>
>>102799963
why not? I'm trying some stuff
>>
>>102799972
Did anything happen?
>>
File: image (1).png (323 KB, 512x512)
323 KB
323 KB PNG
>>102799957
here we go again.
>>
>>102799989
>Did anything happen?
yeah, I'm not sure if I'll be able to sleep that night, that picture will haunt me for long kek >>102799895
>>
can flux be used as an upscaler?
>>
btw it looks like Flux (dev) isn't very smart at 2048x2048.
>>
>>102800015
It's the same situation as sdxl where you don't want to use anything but 1024x1024. The difference is you get a different kind of flaws instead of body horror cloning.
>>
>>102800080
>It's the same situation as sdxl where you don't want to use anything but 1024x1024.
I'll generously assume you mean 1024x1024 equivalent resolutions in other aspect buckets, but it's still wrong because FLUX can do lower resolutions whereas SDXL couldn't.
>>
File: downscaled trash gen.png (1.04 MB, 1024x1024)
1.04 MB
1.04 MB PNG
>>102800080
You actually may escape flux's skinnygirl plague, but other problems emerge. My card is too slow for very much experimentation.
>>
File: 2024-10-13_00001_.png (309 KB, 512x512)
309 KB
309 KB PNG
>>102800131
idk bro. flux gave me finger problems at 512x512
>>
>>102800131
>but it's still wrong because FLUX can do lower resolutions whereas SDXL couldn't.
no it makes sense to want to render as high a resolution as you can because AI doesn't handle tiny details well.
(so you can't go over 1024x1024, but you shouldn't use smaller resolutions either)
>>
>>102800143
>flux gave me finger problems at 512x512
512 is small though so I can't blame the model
>>
>>102800154
How's it work? Why do some things only work at certain resolutions?

many oddities to ai
>>
File: 00101-2494760407.png (2.13 MB, 896x1152)
2.13 MB
2.13 MB PNG
>>
>>102800168
probably because of the VAE, the more it has channel, the more it can do with less pixels
>>
>>102800176
Don't you think it's ironic that I'm the only one on here to explore the effects of large gens, despite having a way slower card? It's pretty weird.
>>
>>102800197
I did it too, and the result was shit so I stopped doing it lol
>>
>>102800213
some of the secrets of how flux blocks tits are to be found in image size differences.

I suspect there are aspects of advanced ai creation that are different from what we have been told.
>>
>>102800143
the actual explanation is, the base Flux model was trained on BOTH low resolution and high resolution images, whereas SDXL was not. However, you COULD train a Lora for a new concept on SDXL at 512px and it would work fine if you actually used it at 512px. Similarly, if you train a 512px Flux Lora, it will work fine at that resolution, but likely not work fine at any resolution that goes above its maximum buckets, for obvious reasons.

TLDR Flux does not have some kind of magic internal upscaler of any sort, it works exactly the fucking same as every model ever, the only difference is the presence of multiresolution training in the base model. It will not magically make your 512px Lora for a random NSFW concept it had no prior knowledge of work properly at 1024px, or anything like that.
>>
>>102800307
so 512x512 training exists in a separate space in the ai? This is extremely interesting.

This means Flux was for some reason also trained on some very low resolution images.
>>
>>102800307
Also, I don't think any of this actually explains the nature of hands getting fubar
>>
>>
>>
File: ComfyUI_241829_.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
>>
the traitors and quislings in this thread have been NOTED. it is going down in your permanent records.
>>
>>102800762
>quislings
I learned a new word today, kek
>>
File: file.png (2.15 MB, 1024x1024)
2.15 MB
2.15 MB PNG
damn Migu is tall, let's not forget that Trump is a 6'3 guy
>>
I hate when people take note of my treachery
>>
>>102800844
you have such a way with words anon uwu
https://www.youtube.com/watch?v=7mBqm8uO4Cg
>>
>>102800844
thank you for your feedback.

regards,
hr
>>
File: Saveme_00047_.png (1.04 MB, 1024x1024)
1.04 MB
1.04 MB PNG
flux doesn't want to be an upscaler.
>>
File: file.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>
File: 2272341952.png (915 KB, 896x1152)
915 KB
915 KB PNG
>>
i sperg, therefore i am
>>
>>102790219
I think you can finetune the published result. But PA-S is just fast enough to train from scratch on plebeian hardware, so anon did.
>>
>>102801279
Oh Hi Mark!
>>
>>102801343
oh hi, uh, you! well met!
>>
>>102801355
don't tell me you don't have the reference?
https://www.youtube.com/watch?v=zLhoDB-ORLQ
>>
>>102801361
no, i've not watched anything even remotely related to the Room, nor Niel Breen or anything similar. take that faggotry to /tv/. i'm not watching your clip, either. but i know what it is, from the thumbnail. and it is cancer
>>
>>102801390
>ake that faggotry to /tv/. i'm not watching your clip
are you a retard? that's a reference to your gif moron >>102801279
>>
File: 818477066.jpg (2.52 MB, 2688x2304)
2.52 MB
2.52 MB JPG
>>
File: scale_1200.jpg (81 KB, 1200x601)
81 KB
81 KB JPG
>>102801410
that gif was sourced via bing image search "consider suicide gif", not based on esoteric knowledge of shit cinema. i found an image of some nigga putting a gun in his mouth, something you should probably consider, and instead of blowing your useless brains out you're pestering me about knowledge of obscure cinema auteurs. as i said, i know nothing about the room, save /tv/ memes about same, and those... well, if you believe anything you read on 4chan, you're a goddamned retard.
since you're a retard, write a 2000 word essay about the movie in pic rel, which i also haven't seen, either.
>>
File: file.png (277 KB, 600x600)
277 KB
277 KB PNG
>>102801449
>the room
>obscure
>>
>>102801465
yeah, all the normies have seen that flick. its a normie classic. i, personally, have had deep discussions around the water cooler about Tommy Wiseau, and all his classic filmography. they, to a man, love his stilted delivery and otherwise terrible acting. they even say it's better than Game of Thrones, which as we all know is the peak of TV cinematic experiences!
>>
it amuses me that there's a The Room enjoyer here, how many other /tv/ fags are lurking here? we welcome you to this safe space, free from joker rape threads
>>
File: file.png (2.17 MB, 1024x1024)
2.17 MB
2.17 MB PNG
It's all right ladies, I identify as a woman
https://www.youtube.com/watch?v=3OR_D2EEPS4
>>
>>102801551
5 dollah sucky sucky, me love u long time
>>
>>102801551
niggers really be out here generating shovelhead whores like we're supposed to be impressed.
>>
>>102801298
>PA-S is just fast enough to train from scratch on plebeian hardware, so anon did.
What's PA-S?
>>
File: 58000.png (3.94 MB, 1440x3120)
3.94 MB
3.94 MB PNG
don't be aroused, by my confession
unless you don't give a good goddamn about redemption. i know...
christ is coming, and so am i.
and you would too if the sexy devil
caught your eye
>>
File: file.png (89 KB, 498x281)
89 KB
89 KB PNG
>>102801642
nice poem anon
>>
>>102801652
credit goes to james maynard keenan
>>
>>102801677
sorry, maynard james keenan. my bad
>>
File: file.png (2.77 MB, 2492x1273)
2.77 MB
2.77 MB PNG
I think for de-distill, Dynamic Threshold works the best
>>
>>102801697
ur noodles are shit and you are shit for usijng noodles. real niggers use slapdash, quasi-abandoned russian based software. while it may suck that voldy died valiantly in ukraine, for basically no reason, if he had only lived auto1111 could have been something great. alas, the department of state got it's wicked way, yet again
>>
at least auto has a stable version. perhaps one day cloning will bring back those who died in the Ukrainian civil war. slava whatever and such like.
>>
So I am getting SD Forge to finally try out this Flux thing.
I am creating its venv, what python version does current SD Forge run on? 3.10 like the rest?
>>
>>102801745
>3.10 like the rest?
ComfyUi uses 3.11 now
>>
>>102801750
Okay that's nice to know I guess but what does Forge use?
>>
>>102801764
I have no idea, I guess you have to install it to know the answer
>>
How is flux keeping separated the subject and background?

It has to have a way of enacting the blur.

My take is somehow blur is a program they've embedded.

What other programs can they embed? Can they embed a whole trojan?
>>
>>102801835
what? the model just learned how to handle blur correctly, that's all
>>
>>102801745
It's downloading CUDA 12.1 instead of 12.4 so I am assuming Forge too moved to 3.1 but whatever, as long as it works I don't care too much.
>>
>>102801852
It has to have a way of knowing where the edges of subjects are.

Does flux know about foreground blur?
>>
File: file.png (2.12 MB, 1024x1024)
2.12 MB
2.12 MB PNG
>>102801908
>Does flux know about foreground blur?

>A photo of a woman picking flowers, tree, foreground blur
Lawl it blured everything
>>
any happenings? china modals?
>>
Since I am using an AMD card, Flux can't run the Nvidia trojan on my machine. I feel so chad.
>>
>>102801960
a crazy face detailer/unblurer. it didn't work on the photo of Lincoln I gave it.
>>
>>102801973
>a crazy face detailer/unblurer
can you share a link? that sounds interesting
>>
File: 367146048.png (1.69 MB, 896x1152)
1.69 MB
1.69 MB PNG
>>
>>102801957
lol
>>
-- SPOILER ALERT --
-- SPOILER IMMINENT --
-- THIS POST SPOILS FROZEN --
-- SKIP THIS POST IF YOU DON'T LIKE SPOILERS --

-

-

-

-

>>102795719
Just like Ana in Frozen.
>>
>>102791334
>not sure why you need to hate on it.
I'm hating on the retard that called it "80% there", not the work. Did you even see the image he claimed was 80% of the prompt? Cause it was a vague 1girl shape and had nothing else of the prompt, he is a retard. He'll never live down that retarded figure.
>>
File: 3721939222.png (1.6 MB, 896x1152)
1.6 MB
1.6 MB PNG
>>
>>102802003
https://huggingface.co/spaces/ohayonguy/PMRF
>>
File: FIRST.png (1.04 MB, 896x1152)
1.04 MB
1.04 MB PNG
First shit prompted to flux
On a related note: Is it normal that flux is taking three times as long to generate than SDXL on my system?(dev-bnb-nf4) I guess I may need to tinker with some settings.
>>
File: file.png (2.29 MB, 1024x1024)
2.29 MB
2.29 MB PNG
>>
>>102802110
>Is it normal that flux is taking three times as long to generate than SDXL on my system?
Anon, Flux is a 12b model, SDXL is a 2.7b model, that's more than 4x as big, of course it's slower to run it
>>
https://reddit.com/r/aivideo/comments/1g2lxq5/dreamina_20_a_new_ai_gamechanger_is_coming/
Why are the chinks so good at video model?
>>
File: 1499726074.png (814 KB, 896x1152)
814 KB
814 KB PNG
>>
>>102800343
>This means Flux was for some reason also trained on some very low resolution images.
They just used the same images they trained on bigger resolutions and shrunk them to smaller sizes and trained them as well.
The result is that the smaller resolution you use, the higher the detail, if you make a pic at 1024x1024, and another at 512x512, and shrink the 1024x1024 down to that size, the detail of the native generated 512x512 image will be twice as high.
>>
How much VRAM i need to make AI gens without problem?
>>
File: 2042460918.png (1.17 MB, 896x1152)
1.17 MB
1.17 MB PNG
>>
>>102802067
Prompt? I had to take a closer look to see if real or ai.
>>
File: 1058085040.png (1.05 MB, 896x1152)
1.05 MB
1.05 MB PNG
>>102802177
Pretty sure that was an empty prompt, flux1-dev-Q8_0
>>
>>102802110
Note you may not need more than 21 steps with Flux unless you're generating difficult text.
>>
>>102802159
64GB.
>>
File: 3764305159.png (896 KB, 896x1152)
896 KB
896 KB PNG
>>
>>102802123
Glad to hear that nothing is off.
>>102802184
Is there a quick start flux transition guide for people who already now SD?
For example I believe flux doesn't have negative prompts(a quirky extension is needed I believe.)
>>
File: 1245795964.png (921 KB, 896x1152)
921 KB
921 KB PNG
>>
File: 1706822531.png (1.22 MB, 832x1216)
1.22 MB
1.22 MB PNG
>>
>>102802234
>Is there a quick start flux transition guide for people who already now SD?
people can't agree on anything right now so I doubt you can find anything.

To answer your negative question. A CFG of 1 will disable negatives from working. CFG of 1 is recommend for flux unless you have a node or are possibly using flux undistilled. The different flux version schell, dev, undistilled were created using different methods so a lot of advice falls short as most people just assume dev and there are a few important things that don't translate through the versions.
>>
https://github.com/eloialonso/diamond/tree/csgo
This is pretty cool but
>The provided configuration took 12 days on a RTX 4090.
is really impressive, I myself could have trained this. Anyone with 3-4 4090s or 3090s could train something much better in a couple weeks
>>
Does Flux not like Karras or something? I either get blur or deformed shit when I try it. It also refuses to work with a lot of other samplers too.(DPM bla bla ones)
Euler Simple seems to work but sometimes results aren't the best so I would like to have other options. I can use some other schedulers under Euler too but the only other sampler that seems to work and bring somewhat decent results is [Forge] Flux Realistic, which is maybe limited to realistic images as the name implies? Does anyone know other options?
Asking for Flux1-dev-bnb-nf4.
>>102802941
Well thanks anyway anon. Lots of trial and error and asking shit here it is then.
>>
hm
>>
I love when windows restarts my pc in the middle of the night and I wake up to 150 gens instead of 1000
>>
File: 1392459546.png (1006 KB, 832x1216)
1006 KB
1006 KB PNG
>>102804240
What are you doing with so many gens?
>>
File: 3260348113.png (1.15 MB, 832x1216)
1.15 MB
1.15 MB PNG
>>
>>102801960
bigma soon
>>
>>102802082
Neat
>>
>>102804240

I used to do that with my old 6gb card two years ago when 1.5 was king.
>>
Fresh

>>102804738
>>102804738
>>102804738
>>
File: 1172453123.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
>>102804524
Thanks
>>
>>102803014
for samplers use euler mainly, and ipndm as an alternative
for schedulers, i usually prefer beta over simple
some other schedulers like sgm uniform deliver the same result as simple with only minor differences
as for cfg, yeah set it to 1 and use distilled cfg around 3.5 to start with
>>
>>102805347
Thanks for the response anon



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.