[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: the longest dick general.jpg (1.58 MB, 1526x2000)
1.58 MB
1.58 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bred : >>102764387

You Need to be Over 20 (sampling steps) to Post Here Edition

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://aitracker.art
https://huggingface.co
https://civitai.com
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/kohya-ss/sd-scripts/tree/sd3

>Flux
https://replicate.com/black-forest-labs/flux-1.1-pro
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai
>>
atleast baker found it funny
>>
Blessed thread of frenship
>>
I need to figure out exactly what year baker came of age and where he's from so I can pander directly to his taste in women
>>
File: 01312-3243779270.jpg (607 KB, 1361x1814)
607 KB
607 KB JPG
>>102779929
Nice collage. 'Je ne sais quoi' like smelly people like to say.
>>
>>
+1 for flux hate!
>>
>>102780017
I hated flux at first but I've grown to love it
>>
>>102779398
>tried out Aria locally, in bf16, for captioning primarily NSFW images.
>It fucking sucks. First off, most notably, by default it will exclusively use gender neutral language (is this a ChatGPT thing? qwen also does it...). "A person", "an individual", "a character". Will never say man or woman.
for me everytime a model does this shit it's eliminatory, there's no point in using a caption model that will completely destroy the concept of men and women
>>
So what schedulers and samplers should I use for flux
>>
>>102780050
Models refuse to acknowledge any diverse characteristics of people which is highly ironic given how that's all woke people think about.
>>
Imagine still using Flux in October of the year of our lord 2024
>>
>>102780072
>So what schedulers and samplers should I use for flux
there's no magical schedulers and samplers that'll make everything consistently better anon, that's why I went for the basic one, euler simple
https://reddit.com/r/StableDiffusion/comments/1em7zy3/testing_the_samplers_schedulers_with_a_xy_plot/
>>
>>102780106
what do you want us to use instead?
>>
>>102780124
nothing, it's all shit. fuck this shit hobby,
>>
>>102780145
why are you here then?
>>
>>102780149
to cope
>>
>>102780124
For anime: NoobAI-XL
For everything else:
>>
What's an effective browser based text-to-image model for making background patterns?
>>
imagine still hating Flux in current month with all our new prompting tech
>>
>>102780183
>browser based text-to-image model
>>
>>102780180
>For anime: NoobAI-XL
so it surpassed pony? damn I should try that shit then
>>
>>102780204
I'll use it locally in my browser I promise
>>
File: 01385-534283553.jpg (334 KB, 972x1296)
334 KB
334 KB JPG
>>
>>102779783
i posted >>102767420 and I thought it was funny.
>>
Is it normal that forge constantly crashes when working with Flux?
>>
>>102779984
this is not a joke btw, if anyone is an occasional baker please immediately tell me where you're from (location & race) and approximately how old you are plus any proclivities you have so I can immediately start making special gens just for you
>>
>>102780362
not a baker but you can start by making something visible to the naked eye
>>
Anyone else had issues with "prompt ghosting", lingering prompts and seemingly maybe even seeds? Haven't found a fix, I've tried restarting, switching loras, models, etc.
>>
someone should put his gens through one of those new hyped image enhancement models
>>
>>102780424
Should clarify I'm using pony diffusion xl and automaticv1111

Nvidia gpu
>>
>>102780189
>new prompting tech
I only know of one and that's not really prompting it's more latent manipulation
>>
>>102780442
A1111 is a lousy piece of software with terrible memory management. Use something else.
>>
>>102780424
>lingering prompts
this is a real thing and no one's figured out a solution other than restarting your UI. maybe reloading the model would fix it as well.
>>
File: 1553392500.png (1.49 MB, 896x1152)
1.49 MB
1.49 MB PNG
>>
File: grid-0434.jpg (462 KB, 1728x2304)
462 KB
462 KB JPG
>>102780477
Could be the old gpu memory thing. Restart gpu driver with Win+Ctrl+Shift+B
>>
>>102780528
e-girl goth thoth her
>>
finally a 1girl bread
>>
File: file.jpg (257 KB, 2363x1454)
257 KB
257 KB JPG
https://civitai.com/models/843551/fluxdev-dedistilled?modelVersionId=943939&dialog=commentThread&commentId=560467
is undistilled dev that big of a deal?
>>
>>102780583
first non 1girl image of the thread
>>
>>102780573
>finally a 1girl bread
/sdg/ exists
>>
File: 1973929457.png (854 KB, 896x1152)
854 KB
854 KB PNG
>>
>>102780655
why the long face?
>>
>>102780451
The things I've learned are
- avoid saying "woman"/"girl"/etc (prominently, that is) since these are overtrained
- boomerprompt like in the SD1.5 days
- always supply some key details T5 needs, like a location and some info about clothing
- Don't prompt more than one dense short paragraph. Don't concat multiple prompts

>>102780382
The tension is that you can't quite have her. Her hazy form you can't quite make out, features of her face scantly descried, blemishes kindly veiled under blur and artefacting, the mind makes of this uncertain thing a hidden unearthly perfection and gropes toward it...
>>
File: grid-0439.jpg (393 KB, 3456x1376)
393 KB
393 KB JPG
>>102780553
>>
>>102780424
>>102780442
Stop using A1111. The author has already abandoned it.
>>
File: 00001-534283553.jpg (2.02 MB, 2700x2150)
2.02 MB
2.02 MB JPG
>>
File: 1531996407.png (825 KB, 896x1152)
825 KB
825 KB PNG
>>
>>
>>102780702
I'm not satisfied, but I appreciate and I'm thankful for your effort
>>
File: img-2024-10-11-22-11-10.png (2 MB, 1400x1000)
2 MB
2 MB PNG
>>
>>102780702
cool, model prompt ?
>>
>>102780955
i did an ella lora once
>>
Help me understand Flux prompting
>>
File: 01447-2537506100.jpg (597 KB, 1445x1814)
597 KB
597 KB JPG
>>102780929
ty mang

>>102781018
model: 1girl printer sdxl

https://files.catbox.moe/3okqxu.jpg
>>
File: 3376415182.png (949 KB, 896x1152)
949 KB
949 KB PNG
>>
File: 01466-2537506103.jpg (477 KB, 1445x1814)
477 KB
477 KB JPG
>>
>>102781123
>>102780699
>>
>>102781123
Schnell is better with short prompts. Dev requires longer and more specific prompts. Both react differently to loras.
>>
>>102781264
oh my fuck
>>
File: grid-0450.jpg (407 KB, 2760x2304)
407 KB
407 KB JPG
>>102781347
made lora of monica corgans ass
>>
File: 3998469827.png (1.44 MB, 832x1216)
1.44 MB
1.44 MB PNG
>>
File: 1708529800847477.jpg (155 KB, 811x912)
155 KB
155 KB JPG
>>102781327
>Both react differently to loras.
is it normal for them to crash when you try to use a lora
>>
File: img-2024-10-11-23-11-14.png (1.51 MB, 1152x920)
1.51 MB
1.51 MB PNG
>>
>>102781601
Sounds like you don't have enough VRAM.
>>
File: grid-0002.jpg (1000 KB, 2752x3456)
1000 KB
1000 KB JPG
>>
>>102781655
12GB Vramlet yeah, but shouldn't it work with swap?
>>
>>102780362
have you tried reviewing the old ones and targeting. I went back and saw that flux with the blur generally gets ignored. So impress a stranger or continue to make what you like. You should pick the latter.

>>102723260


>>102781327
fuck. Yet another thing to test.

>>102781403
the nerd in me loves how specific that is.
>>
>>102781675
If you're using Comfy, you can try using the Force/Set CLIP Device node to force the text encoder to be processed by the CPU, which would save some VRAM.

https://www.reddit.com/r/comfyui/comments/1flmav0/flux_running_out_of_vram_when_changing_prompts/
>>
>>102781679
>flux with the blur generally gets ignored
The baker obviously isn't blown away by the gimmick alone. That's fine. It's not everyone's cup of tea. But I've been in enough OPs that I have no reason to be salty about it.

So I see it as a challenge. Pander to the baker(s?) somehow, without giving up my style. I need bakerdox* pronto. Who is this guy?

*not actual doxxing obviously, don't ban me pls
>>
awful gens today
>>
>>102781758
thx u too
>>
>>102781763
you arouse me
>>
File: ComfyUI_temp_movuq_00018_.png (2.53 MB, 1177x1518)
2.53 MB
2.53 MB PNG
>>
File: grid-0004.jpg (863 KB, 2752x3456)
863 KB
863 KB JPG
le arousing face
>>
>>102781791
would be arousing if she didn't have flux wax skin
>>
>>102781799
that and buttchins
>>
>>102781675
it might crash due to virtual memory too, check your pagefile
>>
>>102781769
Same, I'm getting horny thinking about you. <3
>>
baker has a nuanced and unique taste this is true
>>
>>
i remember back when /ldg/ was considered the SOVL general. what happened?
>>
>>102781836
Flux
>>
>>102781843
tragic.
>>
>>102781834
interesting style
>>
File: file.png (1.73 MB, 1024x1024)
1.73 MB
1.73 MB PNG
>>
File: grid-0008.jpg (327 KB, 2752x1728)
327 KB
327 KB JPG
>>102781799
too lazy to edit, pardon me sir

>>102781808
It's just her normal chin
>>
>>102781758
when /sdg/ was the most fun it has ever been two years ago it had more awful gens than you can imagine. Lots of awful gens today is a good thing.
>>
>>102781826
I know baker's no simpleton because when I post a batch of post-worthy images and I know in my heart one of them is really the best, even if I can't explain it, that's always the most likely one to end up in the collage
>>
>>102781758
if I posted my current gens I'd get jannied
>>
>>102781888
Thanks
>>
File: 686286_.png (1.11 MB, 832x1216)
1.11 MB
1.11 MB PNG
I am using a Realistic Chinese targeted checkpoint. Are the boobs too much? Am I dong this right? It seems plastic, but that seems to be what is suppose to happen.

>>102781741
>The baker obviously isn't blown away by the gimmick alone. That's fine. It's not everyone's cup of tea. But I've been in enough OPs that I have no reason to be salty about it.
not salty, just tired if the "joke". Correct blur is fantastic for immersion.

>So I see it as a challenge. Pander to the baker(s?) somehow, without giving up my style. I need bakerdox* pronto. Who is this guy?
There is at least two given what they like. I assume one is early twenties based on preferences for characters. I highlighted that OP as I thought it was the most diverse. Baker is limited by the thread though so it could be that.

>I see it as a challenge
then continue having fun. It gets rough in here sometimes and I want it to stop.

>>102781781
mind if I join you sir? Maybe some cammys or more chun lis. That pluralization seems so wrong. list of mouse being mouses has completely destroyed my brain.
>>
File: 00025-3615055684.jpg (1.08 MB, 1720x2160)
1.08 MB
1.08 MB JPG
>>
i guess the g in /ldg/ stands for gooning now
>>
>>102782040
always has
>>
>>102782040
Tranny cries out in pain as he strikes you
>>
>>102782040
how dare you. 1girl army get ready to launch missiles.
https://litter.catbox.moe/wa8lz3.png
>>
>>102782133
finally... sovl..
>>
>>102782040
no the "g" still stands for general. lascivious degenerates general
>>
File: file.png (188 KB, 1768x1004)
188 KB
188 KB PNG
https://github.com/G-U-N/Rectified-Diffusion
https://arxiv.org/abs/2410.07303
>Diffusion models have greatly improved visual generation but are hindered by slow generation speed due to the computationally intensive nature of solving generative ODEs. Rectified flow, a widely recognized solution, improves generation speed by straightening the ODE path. Its key components include: 1) using the diffusion form of flow-matching, 2) employing v-prediction, and 3) performing rectification (a.k.a. reflow). In this paper, we argue that the success of rectification primarily lies in using a pretrained diffusion model to obtain matched pairs of noise and samples, followed by retraining with these matched noise-sample pairs. Based on this, components 1) and 2) are unnecessary. Furthermore, we highlight that straightness is not an essential training target for rectification; rather, it is a specific case of flow-matching models. The more critical training target is to achieve a first-order approximate ODE path, which is inherently curved for models like DDPM and Sub-VP. Building on this insight, we propose Rectified Diffusion, which generalizes the design space and application scope of rectification to encompass the broader category of diffusion models, rather than being restricted to flow-matching models.
So they basically said that the Flow architecture used on Flux for example is a bullshit one? Damn that's crazy...
>>
you can't spell "local diffusion general" without "OC u gen" so please limit your blathering and start making cool images I've never seen before. like this: >>102781165
>>
>>102782231
stupid ass monkey faced 1girl
>>
>>102782193
>Flow architecture used on Flux for example is a bullshit one
Who woulda thought
>>
File: 00039-924880474.jpg (653 KB, 1445x1814)
653 KB
653 KB JPG
>>102782133
offblast! https://www.youtube.com/watch?v=TQhyDYQ081U
>>
>>102782193
>>102782248
Why BFL went for the rectified flow in the first place? is it because they noticed that it would be too long to generate pictures with a 12b model?
>>
>>102782193
I am trying so hard to understand this stuff. Why can't authors just once say what their abbreviation are so I can catch up.

What is an ODE?
>>
>>102782280
>What is an ODE?
it's the diffusion equation, we can't really solve it mathematically so we're doing approximations on the computer, that's why you have a lot of ways of doing it with the samplers
>>
>>102782275
well, it was a great way to prevent full finetuning without first de/undistilling
desu we can only guess at the reasoning
>>
>>102782280
Ordinary Differential Equation
You need to get into maths my man
>>
>>102782018
Bigger boobs never hurt no anon
>>
>>102781781
you can't give chun-li big granny boobs. It detracts from the legs, which are the whole point. Chun-li should be 99% legs and everything else is just there to be a foil to the beauty and power of her legs.
>>
>>102782496
now, to be fair, the rest of her body should 'match', ie, her upper body shouldn't look bizarrely girlish and weak. And her midsection should look very robust, very strong.

Also it doesn't need to be bodybuilder-muscular. It can be tastefully obscured under a little fat. But her legs should be formidable and graceful and sexy, and they should draw attention to themselves. Breasts make her look old and top-heavy, neither of which is appropriate to her character or her unique appeal.
>>
File: p1.png (82 KB, 1246x330)
82 KB
82 KB PNG
Is pic related valid? If your optimization of the next step of the cost function fails then you either you don't converge (which flux does) or it takes longer (which is the claim). Assuming that t is equal to 1 on every prediction is a bold claim. I am not seeing support for this. If that was the case then flux would do that thing XL does where the image is mostly a solid color with some static.

I know this doesn't change the paper's findings It seems wrong though.

>>102782359
trying my best. Thanks.
>>
>>102782496
>dont make changes to an established character
that's half of the whole point of AI
>>
>>102782496
>>102782537
i think he should give her armpit hair
>>
File: IMG_0526.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
I’ve decided that I love big brother and not being able to do full nudity is fine actually
>>
File: t ranny.png (2.78 MB, 1536x1536)
2.78 MB
2.78 MB PNG
>>
>>102782757
Trollier pls
>>
>>102782563
not being able to do full nudity is like not being allowed to be racist and antisemitic on twitter before Elon bought it. It's frustrating, it's all you want to do, and you find little ways to get around the system which are very satisfying—the limits make you more creative, and what you come up with is better.

When Elon came in, antisemitism in its dumbest form immediately became normal and boring. Likewise, when a model lets you prompt nudity and sex your prompts become pornographic and dumb. My worst ever prompting was on Pony.
>>
>>102782563
just post on other boards anon. It is all good. I find the midway stuff much more annoying. With all the i2i and adjustments I have removed at least a dozen nipples from this image. This image not included because after all that I noticed that the eyes didn't turn out.
>>
File: ComfyUI_01005_.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
>>102781791
>buttchin
>>
any good local AI upscale for video?
>>
>>102783369
the minimum is 40gb vram for 360p vids
so just forget it unless you're a richfag
>>
File: 0.jpg (176 KB, 1024x1024)
176 KB
176 KB JPG
>>
>>102783493
tf..
I used to cheat topaz.ai demo and it worked alright. 5sec took 5-10min on 2080
Thought technology would advanced since then
>>
File: ComfyUI_temp_gvlrp_00029_.png (1.44 MB, 1024x1320)
1.44 MB
1.44 MB PNG
>>102782496
Is not chun-li but someone dressed as chun-li
>>
>>102779929
>>102776791
That Chun is incredible, don't suppose you would catbox it?
>>
File: 2024-10-11_00001_.png (1.33 MB, 720x1280)
1.33 MB
1.33 MB PNG
>>102783766
>flux nip destroyer strikes again
>>
File: 2024-10-11_00002_.png (1.37 MB, 720x1280)
1.37 MB
1.37 MB PNG
>>102783996
>>
File: 2024-10-11_00003_.png (1.25 MB, 720x1280)
1.25 MB
1.25 MB PNG
>>102784006
>>
File: 2024-10-11_00004_.png (1.33 MB, 720x1280)
1.33 MB
1.33 MB PNG
>>102784090
>>
>>102783270
Just put chin in the negative. Literally all it takes.
>>
File: 2024-10-11_00005_.png (1.23 MB, 720x1280)
1.23 MB
1.23 MB PNG
>>102784105
>>
My theme here is "AWESOME".
>>
>>102783996
Give me your best nipple inpainter
>>
File: 2024-10-11_00006_.png (1.27 MB, 720x1280)
1.27 MB
1.27 MB PNG
>>102784145
>>
File: a26.png (1.02 MB, 1120x1536)
1.02 MB
1.02 MB PNG
>>102783766
we love hags and their saggy tits here
>>
File: 1707710998624917.png (45 KB, 842x766)
45 KB
45 KB PNG
>>102780699
Is this correct? I'm gonna feed it to LLM node
>>
File: 00020-3620358296.png (1.48 MB, 912x1336)
1.48 MB
1.48 MB PNG
>>
>>102784239
let us know
>>
>comfyui doesn't utilizes 76% of vram
uh
>>
>>102784130
>Just put chin in the negative. Literally all it takes.
it's only working for de-distill right?
>>
>>102784280
Now 86% vram utilization. Why isn't it faster???

Maybe my system ram is super fast?
>>
>>102784440
Mine has dynamicthresholdingfull. It works, it's slow, it's half as fast. for me, 20 sits.
>>
File: file.png (656 KB, 1080x606)
656 KB
656 KB PNG
>>102784462
>20 sits
holy shit bruh, buy a better GPU, you're torturing yourself
>>
>>102784481
My gpu is very good, at gaming lol. amd cards are extremely exotic, zero of the ai coders ever owned one.
>>
>>102784481
Also, you are forgetting we are experiencing a miracle that didn't even exist in 2016.
>>
File: file.jpg (512 KB, 1899x1473)
512 KB
512 KB JPG
>>102784524
>didn't even exist in 2016.
I don't want to sound like a nerd but it was possible to do it in 2016
https://newatlas.com/ai-art-film-writing-review/46891/
>>
>>102784548
I also installed - and ran - deep dream.

Was it actually possible to use prompts?

Can you make flux do something similar?
>>
>>102784613
>Can you make flux do something similar?
if you make a deep dream lora then yeah probably
>>
>>102784548
Deep dream was the first last and only time ai art had soul
>>
>>102784630
>Made by Google
>soul
anon...
>>
File: ComfyUI_34324_.png (1.14 MB, 848x1024)
1.14 MB
1.14 MB PNG
>>
>>102784641
Deepmind used to operate semi autonomously back when Google was the super cool place to work
>>
I don't know why my amd card literally* does not benefit from having greater ram utilization.

*only kinda

--highvram
oom

--normalvram (actually default)
86%
Prompt executed in 635.37 seconds

--lowvram
76%
Prompt executed in 642.00 seconds

--novram
22%
Prompt executed in 706.31 seconds

>>102784548
I want to run it again. I hate docker, is it only a docker thing? iirc it genned on my old cpu, it can't need much.
>>
https://huggingface.co/terminusresearch/flux-booru-v0.2
the guy who made flux booru updated his shit, too lazy to try it out but here you go
>>
>>102784205
model?
>>
>>102784854
maybe because my memory clock speed is about 1ghz?
>>
File: 00064-2065459701.png (1.79 MB, 896x1152)
1.79 MB
1.79 MB PNG
>>
Anons, what's the best model currently for specifically generating pixel art?
>>
>>102780583
I'm trying it right now. 30 steps is a bitch with flux but 60 steps is a motherfucker, 8 minutes on my 3060 12gb for a 832 x 1216.

Is the prompt adherence and quality amazing? I don't know, given that it takes so long for 1 gen, I might need a week to find out. It hasn't blown my socks off so far

Also, is it bad if your card spends hours at a temp of 65c/150f degrees?
>>
File: file.jpg (3.37 MB, 3652x2226)
3.37 MB
3.37 MB JPG
https://huggingface.co/aleksa-codes/flux-ghibsky-illustration
Holy shit this is beautiful, you're missing out a lot of good stuff if you only go on civitai
>>
>>102785058
>Also, is it bad if your card spends hours at a temp of 65c/150f degrees?
no it's all right, GPUs can handle until 80°C fine, they were made for this
>>
File: file.png (2.78 MB, 2342x1238)
2.78 MB
2.78 MB PNG
>>102784613
>Can you make flux do something similar?
you absolutely can if you go for cfg 1 on de-distill kek
>>
>>102784854
because those thing are mostly written for cuda. Without checking I assume that most of those options change things with memory malloc in the background. If you are using forge (or related) you can mess around with the memory used. Protip sometimes less memory runs faster. Watch a GPU monitor to confirm what is happening.
>>
>>102785137
I apparently missed the memo on de-distill. idk what this is.
>>
>>102785219
Can Forge work on Linux?
>>
>>102785102
fuck civitai, I just searched for older sdxl models I downloaded sometime ago to check out if there are any versions or if the author created another checkpoint and to my surprise they were mostly 404, civitai deletes stuff like crazy
>>
>>102785057
Some models are quite good at pixel art but none are better than just using a pixelize filter. The one for comfy (some custom node) has various settings to tweak as well.
>>
>>102785222
>I apparently missed the memo on de-distill. idk what this is.
it's basically flux dev but without the distillation guidance, it's working with the regular CFG like regular SD models
https://huggingface.co/nyanko7/flux-dev-de-distill
>>
File: ComfyUI_05617_.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>102785057
flux dev can do pixel art right of the box
>>
>>102785246
>23.8gb
will this raep my machine?
>>
>>102785259
you're not obligated to run the bf16 model though, you can go for quants
https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF
>>
>>102785268
I run dev, because the speed difference between dev and fp8 is minimal on my amd card. It has to do probably with comfyui being coded wrong for amd cards.

I'll try the fat version first, thanks!
>>
>>102785294
there's also the fp8 version
https://huggingface.co/MinusZoneAI/flux-dev-de-distill-fp8
>>
>>102785306
Thanks, I'm downloading the 23gb one first. What changes do I need to make to my workflow?
>>
>>102785327
you remove the distilled guidance scale and you go for CFG > 1
>>
they said the flux shills left...
>>
>>102785355
>every fan I encounter is a shill
No anon, it doesn't work like that.
>>
Hey anons I have a stupid question
How come Stability or BFL don't use the same technique as Unity for people using their models commercially? Charge a small percent of the sales they make so they gain some profit?
>>
>>102785369
Anon it's bait, don't engage with the troll further
>>
>>102785379
that's what they do with their commercial licence though? I'm pretty sure that if you're making money out of your AI pictures, you have to give some % to them
>>
>>102785398
AI images cannot be copyrighted, thats why AI outputs aka gens from models don't have/need a commercial license
>>
>>102785355
I didn't learn the art of sd prompting. As a promptlet, Flux is really fantastic.
>>
File: ComfyUI_temp_cqfpx_00001_.png (1.89 MB, 1472x1251)
1.89 MB
1.89 MB PNG
>>
>>102785440
>AI images cannot be copyrighted
I'm pretty sure I've seen somewhere that Ideogram and SAI wrote on their licence that we can't train models with their AI pictures
>>
File: ComfyUI_temp_oxzpx_00001_.png (1.69 MB, 1472x1248)
1.69 MB
1.69 MB PNG
>>102785448
>>
>>102785440
*have thus far failed to gain copyright status
You need faith, anon
>>
>>102785451
Flux has the same item in their licenses
>>
>>102785451
>>102785467
that's so hypocritical of them, they didn't ask anyone's permission to use real pictures and drawings to train their model and now they are gatekeeping their output? loooool
>>
>>102785465
AI images shouldn't be copyrighted anon, why would you want that?
>>
>>102785474
Flux was trained with ai images too
>>
>>102785441
this is much better b8 i like it
>>
File: file.jpg (701 KB, 3818x1612)
701 KB
701 KB JPG
>>102785102
that's probably an undertrained Lora, when you ask something completely different it goes back to its regular style, what a shame, I wonder what pictures he used to make his Lora in the first place, the quality is amazing
>>
>>102785507
surely increasing the lora strength would mitigate that
>>
>>102785336
So it doesn't work with loras?
>>
>>102785532
it works, CFG has nothing to do with Loras
>>
>>102785537
It says it doesn't work with diffusers, and I don't know what those are, sorry.
>>
>>
>>102785555
I don't really know what you're talking about, but the regular dev loras work fine on de-distill dev
>>
File: 02229-2690418976.png (1.18 MB, 896x1152)
1.18 MB
1.18 MB PNG
>>
>>102785566
thanks, why do some comments have this?

>use X-Labs's "x-flux-comfyui" node sample workflow
>>
>>102785611
>why do some comments have this?
what comments?

>use X-Labs's "x-flux-comfyui" node sample workflow
this has nothing to do with de-distill, it's some controlnet shit, completely unrelated, like I said, you remove the distilled guidance scale node, you go for CFG > 1 and there you go you can run this model
>>
File: 02284-2300883834.png (1.1 MB, 896x1152)
1.1 MB
1.1 MB PNG
>>102785569
>>
File: 02248-2300883796.png (1.09 MB, 896x1152)
1.09 MB
1.09 MB PNG
>>102785699
>>
File: 02285-2300883835.png (1.03 MB, 896x1152)
1.03 MB
1.03 MB PNG
>>102785702
>>
>>102780180
what makes this noobAI model so good?
>>
File: ComfyUI_temp_vrraj_00007_.png (1.89 MB, 1072x1880)
1.89 MB
1.89 MB PNG
>>
>>102785712
>what makes this noobAI model so good?
a shit ton of pictures, no cucking?
>>
>>102785733
how is it compared to autismmix (pony, anime focus) as that is really good in general.
>>
File: ComfyUI_temp_uyzyp_00053_.png (2.05 MB, 1072x1880)
2.05 MB
2.05 MB PNG
>>102785712
its a tad wonky, i think theres another version that tries to fix that, but its pretty promising regardless
i like it more than other XL anime models desu (i dont care for extreme porno)
>>
>>102785747
NoobAI is a finetune of Illustrious XL which is a finetune of Kohaku Beta Revision 5 which is a finetune of Base SDXL, to be clear.
>>
>>102785268
Cool, finally the Q6 version.
>>
File: grid-0007.jpg (3.62 MB, 4992x5472)
3.62 MB
3.62 MB JPG
>>102785712
it has an painterly quality that is interesting. It feels more like OG nai. most XL models have a very narrow range, noob feels like it has possibilities.
>>
>>102785790
I'm trying to upload them for 3 days straight but fucking huggingface is giving me errors almost everytime, the best strategy I found was to upload them one by one, which is retarded but hey it works I guess :(
>>
How much longer until we have a pixel perfect translation of latent space?
>>
>>102785815
Good job nonetheless
>>
>>102785844
thanks :3
>>
File: CogVideoX-I2V_00019.webm (173 KB, 720x480)
173 KB
173 KB WEBM
>>102785493
>>
File: ComfyUI_temp_vrraj_00014_.png (1.63 MB, 1072x1880)
1.63 MB
1.63 MB PNG
>>
>>102785857
have you also tried Pyramid Flow?
>>
>>102785875
yeah but couldn't get anything out of i2v, still too early for proper testing, it needs some updates
>>
File: file.png (453 KB, 2432x1165)
453 KB
453 KB PNG
https://arxiv.org/abs/2407.15811
> using only 37M publicly available real and synthetic images, we train a 1.16 billion parameter sparse transformer with only $1,890 economical cost and achieve a 12.7 FID in zero-shot generation on the COCO dataset. Notably, our model achieves competitive FID and high-quality generations while incurring 118× lower cost than stable diffusion models and 14× lower cost than the current state-of-the-art approach that costs $28,400.
Really impressive
>>
>>
>>102785908
Everyday we inch closer to a slew of bespoke anon models
>>
>>102785926
what lora are you using to get that retro vhs style?
>>
File: file.png (2.28 MB, 3491x1337)
2.28 MB
2.28 MB PNG
>>102785956
you combine this with REPA and there we go we're allready there
https://github.com/sihyun-yu/REPA?tab=readme-ov-file
>>
>>102785908
Wtf, so that's it? You rent 8xA100 gpus, you wait for a week and you have a fully pretrained model? That's crazy...
>>
>>102777600
REPA anon, how is your progress?

>>102785908
it's kinda old news...

>>102785976
I've seen question abouts combining REPA and this micro thing in REPA github issues
But I doubt it will be good as microDiT uses masking heavily during training to optimize cost
>>
File: ComfyUI_temp_vrraj_00033_.png (2.21 MB, 1880x1072)
2.21 MB
2.21 MB PNG
>>
Requesting help to fix a problem in forge

The built-in controlnet is partially broken.

When I try to use depth, it wants to download some stuff, but then fails and complains about PYTHON CERTIFICATES which need to be updated as far as I can understand

Any thoughts?

P.S. Some other cn models are working though
>>
>>102786286

ssl.SSLCertVerificationError: [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:997)
>>
>>
>>102785624
Thanks. Trying Cfg 2.
>>
File: file.jpg (1.79 MB, 9080x1796)
1.79 MB
1.79 MB JPG
>>102786589
looks like its sweet spot is at cfg 3, after that it starts burning
>>
>>102785624
btw, does this one:
>>102785306
or these
>>102785268
support loras & negative prompting?
>>
>>102786604
Why is 1 a painting?
>>
>>102786680
>support loras & negative prompting?
yes, it supports loras and negative prompting

>>102786688
>Why is 1 a painting?
cfg 1 is always fucked on undistilled models, try that on SD1.5 you'll see lol
>>
>>102786688
its called sovl
>>
File: 2024-10-12_00002_.png (1.17 MB, 720x1280)
1.17 MB
1.17 MB PNG
a 1 girl, because the world needs these
>>
so realistically what are the best interfaces and models for realistic uncensored gens?
did everything go down the drain after automatic111 and sd1.5?
haven't checkd these threads in over a year
>>
>>102786711
idk. Maybe Flux, then inpaint with Pony? I have only inpainted a little bit with Flux, and it's fun, but often it bucks you.
>>
>>102786711
Flux + amateur lora = Kino realism
>>
File: 2024-10-12_00003_.png (1.17 MB, 720x1280)
1.17 MB
1.17 MB PNG
>>102786700
Just changed cfg to 1.
>>
>>102786728
legit amazing, wondering what's going on, why is it like this? Has anyone plotted out cfg 1 with different schedulers*samplers?
>>
>>102786728
>1 girl
>1 CFG
1 is officially the kino number
>>
File: file.png (2.41 MB, 1024x1024)
2.41 MB
2.41 MB PNG
>>
>>102786735
>why is it like this? Has anyone plotted out cfg 1 with different schedulers*samplers?
cfg 1 always produced monstruosities, even on previous models such as SD1.5, SDXL... the model has too much freedom so it does whatever it wants
>>
>>102786771
THIS
>>102786728
is NOT a monstrosity
>>
>>102786604
can you post cfg 1 Trump on the bicycle on the moon?
>>
>>102786794
it's already there, those 8 pictures are from cfg 1 to cfg 8
>>
hello r/ldg, i'm requesting a wife
>>
>
>>
>>102786680
https://imgsli.com/MzA1ODc3
Negative prompting works on de-distill, but its effects are strong, maybe "red" was overkill lol
>>
>>102786830
Neat, which quant are you using?
>>
>>102786858
Q8_0
>>
File: 2024-10-12_00004_.png (1.16 MB, 720x1280)
1.16 MB
1.16 MB PNG
>>102786728
cfg 0.5
>>
File: file.png (615 KB, 806x452)
615 KB
615 KB PNG
>>102786871
Holy fuck...
>>
>>102786880
Yeah, it's abominy at that point lmao
>>
>>102786814
NO!

The use of gpu is to be exclusively for holy deeds.
>>
>>102786801
Why does the first image have a basket?
>>
>>102786916
Why not? this was my prompt
>Donald Trump on a bike to the moon
a bike can have a basket
>>
File: 2024-10-12_00006_.png (1.6 MB, 720x1280)
1.6 MB
1.6 MB PNG
>>102786871
cfg 2

but with no positive now, just negative.
>>
>>102786944
>cfg 2
>but with no positive now, just negative.
like you wrote "1girl" on the negative and nothing on the positive?
>>
>>102786959
kind of, this is my negative:
>chin, jaw, beautiful, pretty, cute, bra

guidance 3.5 for positive & negative.
>>
>>102786972
>guidance 3.5
there's no guidance anymore on undistill, that's the point, it's to remove guidance and only get CFG
>>
what do i use to caption my dataset for lora training sdxl models?
>>
File: file.png (2.27 MB, 1024x1024)
2.27 MB
2.27 MB PNG
>>
File: 1724034796149606.png (1.35 MB, 1024x1024)
1.35 MB
1.35 MB PNG
big city miku
>>
>>102787015
>no blur
how did you get that?
>>
>>102787015
where's the blur
>>
File: 1721539165147531.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>102787020
just flux default settings, in forge

"An street view of Akihabara, Tokyo. Miku Hatsune is on a large anime style billboard on a building wall."
>>
>>102786983
I guess guidance is ignored? I am using ClipTextEncodeFlux for negative and positive prompts.
>>
>>102787041
>I am using ClipTextEncodeFlux for negative and positive prompts.
I use the regular CLIP Text Encode and I don't see any difference
>>
File: 2024-10-12_00007_.png (1.19 MB, 720x1280)
1.19 MB
1.19 MB PNG
>>102786944
cfg 1.5, with the prompt intact
>>
File: ComfyUI_241340_.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>
>>102787072
cute demon
>>
>>102787035
without blur it's still impressive, Flux is really great at details, not perfect though maybe this could be fixed with a 32 channel VAE or something?
>>
>>102786944
do 1girl in the negative, cfg 0.5
>>
File: 2024-10-12_00008_.png (1.37 MB, 720x1280)
1.37 MB
1.37 MB PNG
>>102786944
Removing the negative prompt just changes the color a little. Very strange. Still cfg 2, but now now positive & negative are blank. same seed, obviously.
>>
>>102786871
cfg 0.1
(prompt intact)

.5 is a sweet spot for whacky stuff? maybe?
>>
File: ComfyUI_241389_.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>
File: 2024-10-12_00009_.png (1.14 MB, 720x1280)
1.14 MB
1.14 MB PNG
>>102787180
>>
File: 1702118040945754.png (1.44 MB, 1024x1024)
1.44 MB
1.44 MB PNG
>>
https://reddit.com/r/aivideo/comments/1g1u6aa/balloon_birds/
lawl, I wonder why there's no video AI thread already? There's some kino in there
>>
File: 2024-10-12_00010_.png (1.35 MB, 720x1280)
1.35 MB
1.35 MB PNG
>>102787122
>>
File: 1697222890317902.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>102787241
>>
>>
File: ComfyUI_241407_.png (1.13 MB, 1024x1024)
1.13 MB
1.13 MB PNG
>>
>>102786314
manually download what you want. Your browser is causing the issue and it will just fix itself after some time.
>>
Been away for a month. What did I miss.
Last time I was here flux LoRAs were being made every five seconds.
>>
>>102787536
>Been away for a month. What did I miss.
We undistilled flux dev
https://huggingface.co/nyanko7/flux-dev-de-distill

So now we can do more than just simple loras, we can do real finetunes, now we're waiting for something like that, that guy is working on it
https://huggingface.co/SG161222/Verus_Vision_1.0b
>>
File: 2024-10-12_00011_.png (1.3 MB, 720x1280)
1.3 MB
1.3 MB PNG
>>102786871
>>
>>102787554
Very cool. Is the Turkish grifter still around?
>>
File: ComfyUI_241511_.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>>
>>
ALL I NEED IS FLUX

AND SOME BOTTLES TO PEE IN

AND TENDIES
>>
>>102787790
zoomers will never know the feel of getting one of these.

It happened to me some, but I didn't even know what focus WAS.
>>
File: 00092_.jpg (665 KB, 2387x3410)
665 KB
665 KB JPG
>>102787536
itercomp was released. New method. Seems mid to me.

>>102787643
he was just wasting time on the new text2video project
>>
Time is a valuable thing
>>
>>102787817
boomers would realize that 512x832 would destroy a data plan, if you could call it that. Nobody would send that and get a better pic. 8 3886
>>
>>
>>
>>102787827
watch it fly by as the pendulum swings.
>>
>>102787809
There's no promptan without poopsock
>>
https://replicate.com/blog/flux-is-fast-and-open-source
>FLUX is now much faster on Replicate, and we’ve made our optimizations open-source so you can see exactly how they work and build upon them.
>FLUX.1 [dev] at 1024x1024 and 28 steps: 3.03 seconds (P90: 3.90 seconds)
HOLY FUCK ARE WE BACK??
>>
>>102787597
ok here's an idea. what if the prompt says "bikini sex mommy awooga" and the cfg is turned up to 11
>>
>>102787967
uhhhhhh it literally says all they did is use pytorch nightly and quantized flux
I hope they didn't pay cerfuckhim for that revelation
>>
>>102787967
https://fast-flux-demo.replicate.workers.dev/
wtf this is instant kek
>>
>>102787967
seems like they didn't do anything. The github is an API and video card injection for your kernel is pretty standard.
>>
>>102787967
they didn't do anything that actually helps local because we were already using these things just to run the model kek
>>
>>102787967
https://github.com/replicate/cog-flux
>Compilation with torch.compile
>Optional fp8 quantization based on aredden/flux-fp8-api, using fast CuDNN attention from Pytorch nightlies
Sounds pretty standard no? Aren't we already doing that?
>>
waow
>>
File: ComfyUI_241537_.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>102787967
>>102788041
yeah this has been possible in comfy for over a month now. Before you had to use the --fast argument but now you can just select weight_dtype fp8_e4m3fn_fast in the Load Diffusion Model node and you can use the TorchCompileModel node for the torch.compile part.
>>
>>102788068
>comfy not mentioned
brings up comfy

go back >>>/sdg/
>>
File: file.png (217 KB, 2811x1370)
217 KB
217 KB PNG
>>102788068
>you can use the TorchCompileModel node for the torch.compile part.
>Cannot find a working triton installation.
It won't work on windows innit?
>>
>>102788068
thank you comfy
>>
china modal
>>
File: file.png (65 KB, 1274x480)
65 KB
65 KB PNG
>>102788100
>It won't work on windows innit?
nope, you're fucked, even the turkish grifter found this ridiculous lol
https://github.com/triton-lang/triton/issues/1640
>>
File: file.png (122 KB, 2486x608)
122 KB
122 KB PNG
>>102788182
https://github.com/triton-lang/triton/pull/4045#issuecomment-2253346967
Dare I say, Based?
>>
>>102788068
>Before you had to use the --fast argument but now you can just select weight_dtype fp8_e4m3fn_fast in the Load Diffusion Model node
It's only working for the 4090 right?
>>
>>102788068
>weight_dtype fp8_e4m3fn_fast
I don't have a fast version. That's only with schnell?
>>
File: file.png (49 KB, 1031x452)
49 KB
49 KB PNG
>>102788333
did you update ComfyUi?
>>
>>102788341
Scared to. May as well, ok
>>
>>102788341
Using it. It's not any faster lol.
>>
>>102788381
I think the fast thing only works on the 4090, so if you don't have that card nothing will happen lol
>>
>>102788272
apparently. If there's an uplift, it's minute.
>>
File: evil.png (103 KB, 506x376)
103 KB
103 KB PNG
>>102788384
AMD cards are super exotic. Nobody who writes cpu code ever owned one.
>>
File: ComfyUI_09499_.png (3.65 MB, 2048x2048)
3.65 MB
3.65 MB PNG
When I could take a photo of me and my wiafu?
>>
>>102788041
>>102787967
>>Compilation with torch.compile
asks for triton and Windows doesn't handle that
>>Optional fp8 quantization
I'm not leaving my beloved Q8_0

So I guess that's a nothingburger huh?
>>
does ram speed matter for flux?
>>
File: ComfyUI_241560_.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>102788381
You need pytorch 2.4.1 or newer and Nvidia ADA/40 series.
>>
>>102788407
wait there's a Kurt Kobain lora on flux?
>>
The irony is AMD cards are vastly more powerful than nvidia ones. But, nobody ever heard of AMD.
>>
>>102788422
Yeah, I will never buy anything from nvidia, because their chips are low quality.
>>
>>102788250
shitting on an open source project is never based. Both private companies could decide this shouldn't be MIT licensed. Additionally, the offer was put to the guy to maintain the PR.

tldr: This guy is still a d-bag.

>>102788414
if you are offloading to prevent OOM, then yes.
>>
>>102788425
>The irony is AMD cards are vastly more powerful than nvidia ones. But, nobody ever heard of AMD.
because CUDA rules the world
>>
ffs baker let's go
>>
>>102788446
gens this thread are shit so he's probably struggling to find any good ones for the collage
>>
File: ComfyUI_241580_.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
>>102788425
That was true years ago but it stopped being true after vega.
>>
>>102788438
Yep

>>102788459
Just sell your card and upgrade ad infinitum, let someone else roll the dice.
>>
>>102788456
Every thread cannot cater to your homosexual needs, get over it.
>>
File: file.gif (1.57 MB, 480x270)
1.57 MB
1.57 MB GIF
>>102788478
lmao
>>
>>102788478
so true saar, need more blurry 1girl
>>
File: file.png (598 KB, 718x1000)
598 KB
598 KB PNG
>>102788492
>need more blurry 1girl
>>
>>102788492
censorship fetish is usually homosexual.
>>
>>102788504
blurring an image is literal censorship
>>
>>102788513
you can blur for censorship, but you can also blur for the sake of art, it's all about the context anon
>>
>>102788523
saar i'd appriciate your art if i could see it
>>
>>102788523
nothing about the blurs in this thread have been about art.
>>
>>102788531
>>102788534
>t. have no clue about art
>>
>>102788523
>>102788545
what fucking art lol, i'd understand if he was going for a low res early 2000s photo aesthetics but the shit he's going for is average sd3m api outputs. the blur is way too much
>>
>>102788552
its also wrong. upscale from 0.1 megapixels doesn't look like anything in this thread.
>>
baker is taking so long because he's struggling to narrow it down with so many good gens to choose from
>>
File: file.png (2.24 MB, 1024x1024)
2.24 MB
2.24 MB PNG
>>
hey guys, baker here, sorry it's taking so long. currently at the hospital getting my eyes checked because most of the thread just looks all blurry and unclear to me. once i'm done i'll be baking soon so please bare with me!
>>
What's the current consensus on captioning VLMs anons?
My images are SFW, and I don't need booru tags
>>
>>102788663
>My images are SFW
GPT4V is still the goat of SFW captions
>>
>>102788663
i think InternVLM2 is the current local SOTA but someone correct me if i'm wrong
>>
>>102788668
Figures, I should have specified local models
>>
baker SHOCKS, STUNS /ldg/ with collage of TEXT-ONLY posts, cementing NEW META of NEVER posting gens
>>
>>102788680
>i think InternVLM2 is the current local SOTA but someone correct me if i'm wrong
I think it's molmo
https://molmo.allenai.org/
>>
>>102788663
it's important to tag all images "crisp, HD" or similar if they're sharp and clear.
>>
uglymaxxing is important too, especially with FLUX. never prompt beautiful girls, gorgeous, etc. always prompt "she's fugly and desperate", "her nose is unfortunate", "eww I hate her"
>>
hey gaize, check out my 1girl. what you think?
>https://pastebin.com/aRr9pSQa
>>
NEXT ONE

>>102788863
>>102788863
>>102788863

NEXT ONE
>>
>>102788726
That would be teaching a model that by default an image should be blurry or unclear unless instructed otherwise. Same old mistake some anime models made with tagging "masterpiece", "score_9", and other quality tags, it meant that by default gens always came out bad so that the quality tags became mandatory for the model to function normally.
>>
>>102780955
Literally Maxine Caulfield with a more punchable face.
>>
>>102787597
eerily primordial kind of cave art
>>
>>102786752
Ty miku



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.