[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1.19 MB, 3264x3264)
1.19 MB
1.19 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102130343

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
File: 1709892352199075.jpg (69 KB, 800x1170)
69 KB
69 KB JPG
>>
When do you think we will we have a serious finetune of Flux?
>>
File: 1695070461250206.jpg (83 KB, 1170x800)
83 KB
83 KB JPG
>>102135662
2
>>
File: 2024-08-29_00060_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>102135630
ty baker
>>
Can't get Flux models Q8 and the dev1 version

Getting this error even though I have clip-l in the text encoder folder
AssertionError: You do not have CLIP state dict!
>>
>The output changed slightly!
>That means it is objectively worse
>I will never upgrade my pytorch again
>2.3 is objectively as good as the quality will ever get

Okay
>>
File: Capture.png (8 KB, 604x100)
8 KB
8 KB PNG
>>102135680
Where your vae is. Clip the drop down and select clip_i and your t5, like picrel
>>
>>102135709
>clip_l
>>
>>102135693
>2.3 is objectively as good as the quality will ever get
not true, there's probably older pytorch versions that has different outputs than 2.3, now the question remains, when did it start to change?
>>
>>102135709
clip doesn't show up in the drop down
>>
>>102135728
Than make it show. It goes to the text_encoder folder.
>>
File: 1708631095082191.jpg (78 KB, 800x1170)
78 KB
78 KB JPG
>>102135662
>>102135670
More
>>
>>102135720
It's not a new topic at all
https://discuss.pytorch.org/t/different-cuda-versions-bring-different-results/174112/3
>You cannot expect to see bitwise-identical results when libraries are updated and small errors caused by the limited floating point precision are expected.
>It’s still unclear how large the errors are and if these are unexpected or not.
>>
File: 2024-08-29_00065_.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>
File: FD_00081_.png (1.91 MB, 768x1344)
1.91 MB
1.91 MB PNG
>>
>>102135738
it s already there
>>
File: FD_00086_.png (530 KB, 768x1344)
530 KB
530 KB PNG
Are Turks the new Jeets?
>>
File: 2024-08-29_00067_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
"AI was a mistake." -Hayao Miyazaki
>>
So the state of the art is Flux right? What of specific fine tuned SDs? Does Flux also cover lora?
>>
File: FD_00119_.png (586 KB, 768x1344)
586 KB
586 KB PNG
>>102135835
I genuinely don't get how this is any different to photoshop which we have had for 40 years.
Oh no people will think these are real photos democracy is doomed!
>>
>>102135855
PonyXL (and it's variants) are the only SD flavours worth a damn. And the only reasons to use Pony over Flux is ramletness and porn.
>>
>>102135855
>So the state of the art is Flux right?
For local yes
>What of specific fine tuned SDs?
Pony will still probably be ahead in niche fetish stuff for the next few months
>Does Flux also cover lora?
Yes, and in fact the LoRAs are extremely easy to train and can produce excellent outputs that make even the best SDXL LoRAs look like shit.
>>
File: 00028-3827800557.jpg (1.19 MB, 1440x1920)
1.19 MB
1.19 MB JPG
>>
>>102135872
>>102135880
What about the lower bit quants? Are they stills SOTA? With lora on 6 GB model sizes, do they still stack up better than lora on pony/sd?
>>
>>102135896
Quants are what make this useable. Flux LoRAs absolutely shit on any SD LoRA. Even the shit LoRAs are really good.
>>
>>102135896
The lower quants aren't THAT big a of a visual sacrifice. This is coming from a 24gb king of vramlets.
I haven't touched stable diffusion since Flux came out because Flux is just so much better.
>>
>>102135938
>I haven't touched stable diffusion since Flux came out because Flux is just so much better.
I have, but only to draw the nipples back on with a titty mask and adetailer
>>
File: 2024-08-29_00076_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>102135863
they fear meme magic, it is more powerful than media manipulation.. also for Photoshop you need some serious skills. For AI you just need an idea.. that makes them afraid to the core. Any autist with an idea can just push it out and it will roll over em like a landslide if its a good idea. But its over.. the djinni is out of the bottle. You can't put AI back in, but some elderly power-to-bes did not realize this yet. They fear ideas.
>>
>>102135938
>>102135922
Thx
>>
File: FD_00124_.png (647 KB, 768x1344)
647 KB
647 KB PNG
>>102135950
>for Photoshop you need some serious skills
Meanwhile the same artfags who are bitching about AI are pressing the generative fill button faster than we can proompt.
>>
>>102135979
>pressing the generative fill button
All it took for them to shut up and get in line was for it to be served to them out of adobes anus.
>>
File: 2024-08-29_00079_.png (991 KB, 1024x1024)
991 KB
991 KB PNG
>>102135979
rofl ya, I guess Photoshops AI cloud is probably one of the most used SAAS AI used now.

But most artfags are tech illiterate .. they just have no clue.
>>
File: 1721644057249857.png (497 KB, 720x480)
497 KB
497 KB PNG
>>
>>102135630
Is there a good Comfyui workflow to take a person in image A and plant it into background image B?
And a simple faceswap one?
>>
>>102135767
beautiful execution but boy thats some tacky shit right there homie. idea: "LIVE LOVE LAUGH erm FUCK"
>>102135950
they simply fear us being able to create our own entertainment (because they can't inject that really important social engineering into that)
>>
File: FD_00083_.png (1.47 MB, 768x1344)
1.47 MB
1.47 MB PNG
>>102136042
I know but I also don't care. I have tons of cheesy woman poster gens.
>>
nogen because on a nsfw bender
>>102135979
but..but.. that is different!! nice gen. I like it a lot. reminds me of.. quite a few things actually. really clean
>>102136076
lol just ignore me. really nice gen! double exposure huh. is that base flux or some lora?
>>
File: 2024-08-29T185407.544.jpg (625 KB, 1536x2688)
625 KB
625 KB JPG
>>102136097
Base flux, prompt is just "double exposure of X that turns into Y"
>>
Does base flux generate nudes?
>>
>>102136118
Kind of. You can do it easily with a LoRA but base flux has pepperoni nipples
>>
>>102136118
Not to be a dick, but why are you asking us instead of just trying it out?
>>
>>102136118
To be a dick, why are you asking us unstead of just trying it yourself, faggot
>>
File: 184212_00001_.png (1.7 MB, 1024x1296)
1.7 MB
1.7 MB PNG
>>102136118
If you ever get around to making a lora, include lots of non nude images becuase otherwise you'll get pepperoni nipples or simulacrums of, when she's wearing literally anything.
Don't make my mistake.

Realistically we have to wait for a finetune.
>>
File: Untitled.png (8 KB, 710x45)
8 KB
8 KB PNG
Has anyone tried training for clip_L yet? Legitimately can't tell if it's doing anything.
>>
>>102136118
To be helpful, yes but with limitations. Nipples are borked, genitalia do no exist.
>>
>>102136123
It can generate pointy nipples on statues.
>>
File: 2024-08-29_00087_.png (1.43 MB, 832x1216)
1.43 MB
1.43 MB PNG
>>
File: 2024-08-29_00088_.png (1.34 MB, 832x1216)
1.34 MB
1.34 MB PNG
>>
>>102136183
It can generate nipples on skeletons.
https://files.catbox.moe/gh6e6g.jpg
>>
>>102136179
You can train two identical loras but one with clip-l trained and other without to find out.
>>
why does clip or T5 have to be trained at all
the base model learned millions of concepts with the text encoders frozen
>>
>>102136223
Train a captionless LoRA then and post results.
>>
>>102136230
you're confused, anon, that has nothing to do with training the text encoders
>>
File: 2024-08-29_00095_.png (1.21 MB, 832x1216)
1.21 MB
1.21 MB PNG
>>
>>102136242
>why do we need to train the text encoders?
>it has nothing to do with training the text encoders
You're right I am very confused.
>>
File: 00001-2473210924.png (932 KB, 1152x896)
932 KB
932 KB PNG
>>
>>102136260
yes, anon, ask someone else here if you're still confused
having captions or not has nothing to do with training the text encoders
the diffusion model was trained with captions and guess what, the text encoders are frozen
this is true for all SD models, Flux, DALL-E, etc
>>
>>102136210
@_@
>>
>>102136273
Oh fuck, I get it now.
We probably do need better trained text encoders too though, they make a huge difference.
>>
File: ComfyUI_flux_00849_.jpg (2.61 MB, 1368x2000)
2.61 MB
2.61 MB JPG
No ones posting pics of my lora on my page...
https://civitai.com/models/696023
>>
>>102135863
You don't think there is a difference between spinning up tens thousands of Twitter LLM bots vs. sockpuppeting manually? Between sending regular mail by hand vs. automated spam?

Credible photoshop editing takes time and skill. Being able to do it in seconds without skill will make a difference.
>>
Anyone tried this mix? Seems decent
https://civitai.com/models/673188/acorn-is-spinning-flux?modelVersionId=757421
>>
>>102136339
That seems true for most Flux loras, very few user submissions.
>>
>>102136349
I'm not a buzz tard and do everything local so I'm not sure but I read here that the cost for on-site flux gens is quite high, especially in comparison to how much it costs to train a lora
>>
>>102136339
I will post one for you. Very few on mine either even though there's heaps of downloads, so I get it.
>>
>>102136339
anon, thats some dark shit. why would I use such a mutant fuckface disney toon lora tho. (no offense)
>>102136341
"I love seeker70" cringe. disqualified
>>
>>102136339
Learn to deal with it dude. You trained one LoRA and nobody interacted with it. I trained 4 today and didn't even upload them anywhere.
>>
>>102136425
>I love seeker70
I don't know what this means
>>
>>102136339
bro, cause your lora and fetish is ultra cringe
>>
>>102136437
I trained 4 LoRA's in my head and didn't even use any GPU cycles.
>>
>>102136455
how many downloads of you Loras you have?
>>
>>102136460
nta, but you can see it on the page, 84 pedos downloaded that crap
>>
>>102136460
I just told you I don't train LoRAs.
>>
File: 2024-08-29_00099_.png (1.17 MB, 832x1216)
1.17 MB
1.17 MB PNG
>>
>>102136474
virtually tho
>>
>>102136473
Nothing on the page is pedo-ey so I interacted with it. Lora works fine I just don't give a shit about the character.
>>
wait did they change the buzz you get for "liking" an image? wasn't it like 10? oops
>>102136448
the maker of the model, "seeker70", made not one but like a dozen images of girls wearing t-shirts with an "I love seeker70" print. if that is what he can come up with, hard pass.
>>
>>102136341
>Every gen has a butt-chin
PASS
>>
>>102136494
oh right, I didn't even look at that I just saw the tits.
https://civitai.com/images/25807409
>>
File: ComfyUI_flux_00852_.jpg (2.83 MB, 1368x2000)
2.83 MB
2.83 MB JPG
>>102136405
Send a link to yours and ill post one as well <3
>>
File: FD_00244_.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
>>102136495
The butt chin is hard coded into flux white women Anon, you can only remove it by prompting a different race. Even my Marilyn has a mild vestigial butt chin.
>>
>>102136523
Not if it's hebe fetish shit, thanks anyway.
>>
File: FD_00251_.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
>>102136531
Oh I forgot that gens arm is fucked.
>>
File: 00064-1570429997.png (1.64 MB, 1440x1440)
1.64 MB
1.64 MB PNG
>>
>>102136536
Nah I wont post hebe fetish with yours
>>
>>102136531
you can manually airbrush them out. if the gen's worth it, why not. takes exactly one click with a nice large 30% opacity airbrush.
>>
>>102136574
When my image is approved you will see my profile. I have 3.
>>102136579
>you can manually airbrush them out
Yeah I know but that defeats the whole purpose for me. I want to gen purely from text. I hate touch ups and inpainting because I get obsessive over it.
>>
>>102136601
Thanks <3
>>
>>102136601
yeah valid point. just spent 4 days (I think, lost track) on an SDXL (for enhanced pain) pin-up set. once you start digging into an image, its a fucking endless pit.
>>
>>102136210
What was your prompt for this?
>>
>>102136665
a photo of a zombie. The zombie is entirely skeletal, except for her breasts. Her breasts are large, and have prominent nipples.
>>
File: 00003-3875901612.png (3.38 MB, 2560x1440)
3.38 MB
3.38 MB PNG
Anyone know of a good way to upscale Flux gens while preserving detail? This was upscaled 2x with a denoise value of 0.5, and it is slightly blurry, while the original one is clear & detailed. Original gen for reference: https://files.catbox.moe/aqxqvc.png
It could be fixable by controlnet, but it doesn't work on Forge yet.
>>
File: 00220-3827800556.jpg (1.05 MB, 1440x2160)
1.05 MB
1.05 MB JPG
>>
File: 00015-495473830.png (1.2 MB, 1152x896)
1.2 MB
1.2 MB PNG
>>102136688
Ah good because I was getting this.
prompt for mine was
a skeleton with nipples please and thank you
>>
File: upscale.png (689 KB, 3984x1576)
689 KB
689 KB PNG
>>102136696
Try this, turn off the LoRA though
>>
File: 1698570208585344.png (3.22 MB, 1536x1536)
3.22 MB
3.22 MB PNG
>>
>>102136696
>>102136733
Forgot catbox link
https://files.catbox.moe/mf46c3.png
>>
File: 2024-08-29_00110_.png (1.17 MB, 832x1216)
1.17 MB
1.17 MB PNG
fucky.. updating comfy and suddenly I get OOM when loading two loras
>>
>>102136696
much much lower denoise. try 0.2-0.3 and 6-10 steps. also need to find the right upscaler for that stuff, thats a tricky case with the dots and everything.
>>
>>102136763
I'm sticking to c6812947e98eb384250575d94108d9eb747765d9 until that is solved
>>
File: 00223-3827800559.jpg (984 KB, 1440x2160)
984 KB
984 KB JPG
>>
>>102136798
need a tity vein lora
>>
File: 2024-08-29_00101_.png (1.1 MB, 832x1216)
1.1 MB
1.1 MB PNG
>>102136787
I am fucked.. I am using portable .. can you downgrade that with git somehow?
>>
>>102136825
I don't know, presumably it is still a git repo so you can just
git checkout c6812947e98eb384250575d94108d9eb747765d9
inside the directory
>>
>>102136696
https://files.catbox.moe/evwige.png
ultramix balanced (can probably find a better one), lanczos downscale, 10 steps, euler / simple, 0.18 denoise. no prompt
>>
>>102136696
I used Ultimate SD Upscale with upscale model 4xNomos8k_atd_jpg, 0.3 denoise, 10 steps, sampler deis, scheduler beta

https://imgsli.com/MjkyMTc3
>>
File: 2024-08-29_00114_.png (1.29 MB, 832x1216)
1.29 MB
1.29 MB PNG
>>102136837
thanks, ya that works.. portable is just a git controlled by a script I guess, I had to go back to 9230f658232fd94d0beeddb94aed093a1eca82b5 tho to fix it for me.. now I can load 4 loras at once again
>>
File: 00024-385269386.png (788 KB, 1152x896)
788 KB
788 KB PNG
>>
>>102136798
1st one bit heavy on the teeth but this one got the big booba keeper feel kinda right
>>
File: yukari1.png (2.69 MB, 2048x2048)
2.69 MB
2.69 MB PNG
>>
File: yukari2.png (3.24 MB, 1850x2028)
3.24 MB
3.24 MB PNG
>>
File: yukari3.png (2.74 MB, 1848x2024)
2.74 MB
2.74 MB PNG
>>
File: 2024-08-29T183904.924.jpg (181 KB, 1024x1024)
181 KB
181 KB JPG
>>102137010
>>102137016
>>102137025
>>
File: 00217-3827800557.jpg (1.55 MB, 1920x1920)
1.55 MB
1.55 MB JPG
>>
FEETSNIFFER TIME
anyone tried those hyper hyper 8 and 16 step flux loras? any good?
>>
>>102137078
Link? I wanna try em now
>>
>>102136875
>4xNomos8k_atd_jpg
This is on the right. nmkdSiaxCX_200k.pt is on the left
https://imgsli.com/MjkyMTg0
>>
>>102137084
I mean some degradation would be ok esp for the 8 step one
https://huggingface.co/ByteDance/Hyper-SD/tree/main
"We recommend LoRA scales around 0.125 that is adaptive with training and guidance scale could be kept on 3.5."
>>
how do i get started with the op resources?
can i get a rundown of what everything means?
>>
>>102137113
Search "Getting started with stable diffusion" on YouTube and watch any popular tutorial.
>>
>>102137113
>can you explain 3 years worth of local image generation lore
no.
What are you trying to do? What is your hardware? Let's start here.
>>
>>102137122
will do.

>>102137127
>What are you trying to do?
id really like to make stuff like those "it's le famous series except ebin grimdark 80s movie / 90s anime" reels.
>>
File: yes.jpg (1001 KB, 2048x2500)
1001 KB
1001 KB JPG
>>102137049
>>
>>102137102
>>102137078
okay tried em both.. it kinda works? ofc the output is different

pic related is the 16/8 (left 16, right 8) step versions of >>102135758 which was done in 30 steps. The size of the Hyper lora is bonkers tho, unless you are hard pressed for speed id not use them, but they work
>>
>>102137163
Those were done with Midjourney.
To do it locally you will likely need a style LoRA.
Flux is the new hotness, but if your PC is shit you should use SDXL.
Start by picking a UI. Comfy is node based and autistic, Forge is more user friendly. Everything else is dogshit.
Install one, then download a model. Then start prompting. The best way to learn is by doing.
>>
I'm having this weird issue where my sample outputs from LoRA training look fun, but when run in comfy they appear weak and half strength. Why might this be happening?
>>
>>102137197
awesome.
>>
>>102137232
Are you training and sampling at 512 then running at 1024 in comfy?
Also Comfy has different weighting to other things. It's supposed to be closer to the proper model output.
>>
>>102136484
sexooo
>>
File: FD_00015_.png (958 KB, 768x1344)
958 KB
958 KB PNG
>mfw I am prompting at 12am
>>
File: Capture.png (4 KB, 203x86)
4 KB
4 KB PNG
>>102137232
I use Forge and I don't know if there's an equivalent in Comfy but my loras are fucked if this is set to anything but picrel
>>
>>102137273
go to bed
>>
>>102137265
Training at 512x512 with a second pass in 1024x1024 for fewer steps.
For example, for every 10 repeats at 512, there will be 1 repeat at 1024.

Do you think that might be fucking it?

I sample at 768x768
>>
>>102137190
>The size of the Hyper lora is bonkers tho
for what it does it should be big, and you can save the model with the lora applied so it doesn't have to be re-applied every time, unless you want to play with the strength
>>
>>102137291
see >>102137049
>>
File: Flux_00905_.png (857 KB, 1024x768)
857 KB
857 KB PNG
>>
>>102137301
Is this an Akira style LoRA or just generic late 80s anime?
>>
>>102137301
Damn, that's nice
>>
>>102137301
>fingers
>bike has no left side
>>
>>102137301
what lora did you use for this one anon?
>>
>>102137346
thanks, she's pooping
>>
>>102137355
>cool image that was generated out of purely a description has minor imperfections
>>
>>102137365
>inpainting exists
>>
File: Flux_00910_.png (941 KB, 1024x768)
941 KB
941 KB PNG
>>102137342
urushihara satoshi lora (not yet posted flux ver)

flux is picking up on the OVA images that were part of the training data which is nice

picrel is without the lora, same prompt & seed

image is a snap from a classic anime OVA, close up exagerrated detailed view of a japanese woman's face as she straddles her speeding racing motorcycle at night along a highway with megacity in background, several other motorcyclists racing alongside her, streetlights speeding leaving long exposure streaks in this frantic action scene
>>
>>102137373
If I wanted to inpaint I would draw it from scratch in the first place.
>>
>>102137394
NTA but that's a silly thing to say
>>
File: FD_00031_.png (1.64 MB, 768x1344)
1.64 MB
1.64 MB PNG
This says a lot about society, I just don't know what.
>>
>>102137394
NTA but that's a very retarded thing to say
>>
>>102136339
The issue seems to be Flux accessibility more than the theme/quality of your Lora
>>
File: 2024-08-29_00142_.png (1.19 MB, 832x1216)
1.19 MB
1.19 MB PNG
>>102137271
>>
File: ComfyUI_flux_00860_.jpg (2.61 MB, 1368x2000)
2.61 MB
2.61 MB JPG
>>
How to update to nightly build?
>>
>>102137489
thats creepy shit anon, please don't redeem
>>
>>102137502
https://github.com/bmaltais/kohya_ss/issues/2701#issuecomment-2315207604
>>
File: ComfyUI_02297_.png (1.14 MB, 1360x768)
1.14 MB
1.14 MB PNG
>>102137525
>>
>>102137525
You're not allowed to link to his comments without paying $5
>>
>>102137543
Sorry Doctor Gozukara, should I do the needful to?
>>
File: FD_00038_.png (1.25 MB, 768x1344)
1.25 MB
1.25 MB PNG
>>
File: 2024-08-29_00151_.png (1.27 MB, 832x1216)
1.27 MB
1.27 MB PNG
>>
>>102137616
this is what my pants look like when you shine a uv light on them
>>
>>102137629
It's what my paints look like when I reach level 300 in fortnite and unlock the bonus skins.
>>
>>102137629
kek
>>
File: ComfyUI_01283_.png (3.93 MB, 1728x2304)
3.93 MB
3.93 MB PNG
>>
hey nerds, how do I use comfyUI to try out different hair colors/styles or make myself younger or older like with faceapp but locally?
>>
>Training Hentai LoRA
>Accidentally a new line in the sample prompt.txt
>Prompt is now --w
>This dude keeps popping up
>>
>>102137663
Just use Forge for something that simple. No need to learn Comfy's autismal UI for that.
>>
File: FD_00007_.png (1.93 MB, 768x1344)
1.93 MB
1.93 MB PNG
>>102137667
He looks cool though so it seems like a happy little accident.
>>
>>102137663
Furkan Gozukara has really in depth guides on all that. I suggest you check him out.
>>
>>102137680
ngl, I'm enjoying the random outputs more than the intended outputs desu
>>
>>102137663
Give Civit $2 and train a LoRA of yourself. You only need 20 pics.
Be sure to publish your LoRA so others can make porn of you like that one guy who published the LoRA of his girlfriend
>>
>>102137190
interesting. well thanks for trying. the idea is basically to combine the 8 step hyper lora and some other questionable stuff to be able to chew chew out smut at a decent pace.
>>102137542
COCKROACH PROLAPSE CATBOX OR GTFO
>>
File: 88.png (1.4 MB, 896x1152)
1.4 MB
1.4 MB PNG
>>
>>102137703
>COCKROACH PROLAPSE CATBOX OR GTFO
I already dropped the weights for the LoRA, you can make your own prolapse image.
>>
File: ComfyUI_01242_.png (3.9 MB, 1728x2304)
3.9 MB
3.9 MB PNG
>>
>>102137718
I like the one long armpit hair snaking its way out from under her arm.
>>
File: FD_00054_.png (1.28 MB, 768x1344)
1.28 MB
1.28 MB PNG
>>102137714
I am not here 24/7 where can this be obtained?
>>
File: 2024-08-29_00162_.png (1.26 MB, 832x1216)
1.26 MB
1.26 MB PNG
>>
some dude on civit has been trying to do a linetrap lora for a couple days now
>>
>>102137793
Nobody cares
>>
>>102137301
nice hoverbike

>>102137672
idk if there is an example workflow would be easier than installing another automatic1111 clone

>>102137682
but i'm lazy isn't there an easy solution to just idk load a photo and do some img2img magic or whatever it's called?

>>102137694
>waste money
no
>>
>>102137793
cunts got more thumbs up than me
>>
>>102137793
How is it going?
>>
>>102137616
Tried florence tagger, can't get the description of the jeans right, could you catbox it?
>>
>>102137762
https://gofile.io/d/A47iaR
Here you go, this is the latest and best I could do without burning his face into the LoRA excessively.
activation tokes should be "Cerfukin" but it works without it, whatever.
>>
>>102137803
Tell me what your GPU is and I'll tell you what you can do.
>>
>>102137803
>>waste money
>no
It's 2000 buzz which you will get for free by liking pictures. You don't have to give them any money
>>102137803
>img2img
You can but it will look like shit and you will have better results just using photoshop or gimp
>>
>>102137815
prompt has "psychedelic, wearing colorful glowing transparent Liquid water Metal jeans and top, in an tatami studio"
>>
File: Fur-tan Kozakura.png (2 KB, 232x60)
2 KB
2 KB PNG
>>102137682
>>
>>102137845
thanks
>>
>>102137827
2080ti

>>102137834
>it will look like shit
that's why i was hoping for some workflow/model/whatever that can just make it better
>>
File: FD_00074_.png (1.35 MB, 768x1344)
1.35 MB
1.35 MB PNG
>>102137859
You have been given your answers. Being lazy or retarded is a poor excuse this shit is piss easy.
>>
>>102137874
i'm not a nerd, just code something for me
>>
File: ComfyUI_01222_.png (3.9 MB, 2304x1792)
3.9 MB
3.9 MB PNG
>>
>>102137859
Gonna be rough using anything cutting edge. Especially since you seem completely adverse to any solutions that require even the smallest time investment.
>>
>>102137874
that looks good!
can you do a version with her front visible and with her body better lit?
>>
>>102137911
No. I am on my meemaw's mobile.
>>
File: flux0312.jpg (3.13 MB, 2528x2000)
3.13 MB
3.13 MB JPG
Morning
>>
>>102137902
nice elf
>>
>>102137902
wish AI could generate BLACKED porn of godesses like her
>>
>>102137908
yup, guess i'll have to wait
>>
>>102137926
Get help
>>
File: ComfyUI_01202_.jpg (807 KB, 1792x2304)
807 KB
807 KB JPG
>>102137925
sank yew
>>
>>102137926
pls leave your penis fetish at the door next time you come here
>>
>>102137944
why? all I have to satisfy my sexual needs are to goon to porn
what help will fix being an ugly incel? fuck off with that normalfag selfhelp cope
>>
File: 2024-08-29_00161_.png (1.13 MB, 832x1216)
1.13 MB
1.13 MB PNG
>>
>>102137911
I can but I won't because I have already moved on from that prompt.
>>102137886
Format-Volume -DriveLetter C -FileSystem NTFS -Confirm:$false
>>
>>102137953
made for dark nubian schlong
>>
>>102137972
>I can but I won't because I have already moved on from that prompt.
fair enough :)
>>
>>102137985
I'm also on my meemaw's mobile and need to take the underwater weighing test mom booked for me and my sister.
>>
>>102137625
>>102137771
>>102137971
holy fuck she's perfect
>>102137482
wish the text wasn't there so I could see her tummy
>>
File: 1702705483109303.png (160 KB, 840x1117)
160 KB
160 KB PNG
>>102137972
at least try harder
>>
File: FD_00079_.png (1.42 MB, 768x1344)
1.42 MB
1.42 MB PNG
Flux makes coherent tattoos and it makes me happy.
>>102138006
I don't understand this troll attempt but it seems uniquely American.
>>102138029
woah no way
>>
What's with civitai Flagged for review shit?
>>
>>102138115
Something you posted has been flagged as potential pizza
>>
>>102138130
https://www.reddit.com/r/civitai/comments/1f3hmp4/flagged_for_review/

They flag a lot of shit lately
>>
>>102138115
https://github.com/civitai/civitai/blob/main/src/utils/metadata/lists/blocklist.json
your spook loli nigga incest pics have to be reviewed manually.
they also have a few other filters in place, head to torso ratio, etc. raunchy celeb is insta-kill
>>
File: 2024-08-29_00175_.png (1.19 MB, 832x1216)
1.19 MB
1.19 MB PNG
>>102138026
>wish the text wasn't there so I could see her tummy
ty
>>
File: 00119-3049630777.png (2.16 MB, 1776x1376)
2.16 MB
2.16 MB PNG
>>
>>102138006
I recognize this reference
>>
>>102138195
>jap
why?
>>
>>102138139
I don't know why people insist on spamming their low quality porn on there.
>>
>>102138256
where else can you share your loras?
>>
File: FD_00100_.png (1.3 MB, 768x1344)
1.3 MB
1.3 MB PNG
>>102138256
Porn gets more buzz than any other theme. Civit is first and foremost an AI porn site.
>>
oh noes olsen spam incoming. I mean, could be worse, could be scarlett. but could also be better
>>102138238
derogatory term for "japanese person"
>>
File: 00000-916055423.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>
>>102138268
You don't need to spam low quality porn of your Loras

>>102138278
Short term gains for long term losses, in the long run porn limits your growth. Only Fans learned this.
>>
>>102138195
There's some vile shit posted on civitai but using "wop" as a prompt is worth a ban?
>>
>>102138294
>derogatory term for "japanese person"
I didn't know it was derogatory, I thought it was just a diminutive.
>>
>>102138303
>You don't need to spam low quality porn of your Loras
Oh I do, and I want to see you cry some more about that
>>
>>102138268
you share them here in the hood with your bros like a street dealer.
>>102138305
I had to google what that means, lol. and yeah need to set priorities. 12 year old on a fuckmachine? I sleep etc.
>>
>>102138303
>in the long run porn limits your growth. Only Fans learned this.
tumblr literally killed its business plan by removing the porn in there, what are you talking about debo?
>>
>>102138278
There was only slop with the filter on, had to turn it off to see anything good, it flags too much stuff NSFW wrongly
>>
File: FD_00107_.png (1.39 MB, 768x1344)
1.39 MB
1.39 MB PNG
>>102138303
Porn drives technological innovation. We wouldn't have the models we have now if it weren't for porn.
The NAI leak for 1.5 was the catalyst that led us here.
AI porn is so weird though. People have their kinks I guess but it's such a step backwards compared to the entirety of the internet.
>>
https://xcancel.com/NousResearch/status/1828121648383566270
sounds too good to be true
>>
>>102138195
What's the words-poi?
>>
File: xrated.jpg (233 KB, 1024x1024)
233 KB
233 KB JPG
>>102138345
This was rated X. They need to work on their filter.
>>
>>102136339
dude there's so many loras coming out i cant even download them. i download one and another comes out. i'm completely overwhelmed at the moment, thinking about disconnecting from the internet but fear i'm going to be missing out on some sick loras (like yours)
>>
>>102138333
I'm not the one crying right now. You'll be perma banned soon enough.
>>
>>102138354
Sharing your AI porn is basically gooning with friends.
>>
>>102138378
FLUSH SHAMING is an absolute no go.
>>102138369
hm, looks like a celeb list (person of interest), so probably gets cross-referenced if its in your prompt with anything lewd and gets blocked.
>>102138358
we can hope, they made a good model
>>
>>102138195
>turk
>turks
What the fuck, why?
>>
File: ComfyUI_flux_00869_.png (1001 KB, 832x1216)
1001 KB
1001 KB PNG
>>102138385
thanks dude
>>
>>102137301
>>102137381
modern anime was a mistake. look at this shit
>>
File: 2024-08-29_00189_.png (1.3 MB, 832x1216)
1.3 MB
1.3 MB PNG
>>102138358
>sounds too good to be true
having the computing power is one thing, but having a good dataset is the other, the later is a bigger problem than the computing power
>>
>>102138358
It's always too good to be true. I think the only realistic distributed training strategy you can do is having people with similar GPUs train synchronized checkpoints with different parts of the dataset, basically you have everyone who can full fine tune train for X hours and then merge.
>>
>>102138029
How new are you, be honest? The action is does is literally in the command. Its clearly a joke.
>>
>>102138197
god she looks so damn good
also the choker and the black bikini combo is top tier
>>
>>102138428
That's stupid, they don't allow celebrity nudes but allow celebrities loras?
>>
>>102138488
>That's stupid, they don't allow celebrity nudes but allow celebrities loras?
they, they're pretending we can't make them nude with a regular celebrities loras or something
>>
>>102138500
I think the whole point is hosting pictures of nude celebrities
>>
>>102138358
Bait for investors and ex-cryptobros. The comments are all about how they could earn money by contributing their gpus.
>We invite researchers interested in exploring this area to join us in our quest.
>p-pls help us make our dream real
>>
>cinematic screencap 1982 wideshot 35mm of a "cave nigger", "africoon", "africoons", "akata", "akatas", "beaner", "beaners", "beastial", "beastiality", "bestial", "bestiality", "browntown", "chigger", "chink", "chinks", "coon", "coonass", "coonasses", "coons", "dike", "dog-fucker", "dune coon", "dune coons", "dyke", "gas chamber", "gas chambers", "gook", "gooks", "guinne", "honkey", "incest", "incestuous", "jail bait", "jail-bait", "jail.bait", "jail_bait", "jailbait", "jap", "japs", "jejune", "jew", "jews", "jigaboo", "kike", "kikes", "kkk", "loli", "lolii", "loli-con", "loli.con", "loli_con", "lolicon", "lolis", "lolita", "mick", "mongoloid", "n1g", "n1gga", "n1gger", "nazi", "nazis", "necro", "necrophilia", "negro", "negros", "neonazi", "neonazis", "nig", "niga", "nigas", "nigg3r", "nigg4h", "nigga", "niggah", "niggar", "niggas", "niggaz", "nigger", "niggers", "niglet", "niglets", "nignog", "nignogs", "nigs", "paki", "pakis", "pedobear", "pedophile", "porch monkey", "porch monkeys", "puberty", "pubescent", "puerile", "rag-head", "raghead", "ragheads", "retard", "retarded", "retards", "rice nigger", "scat", "scrawny", "shit", "shitter", "shitting", "shota", "shota-con", "shota.con", "shota_con", "shotacon", "spic", "spics", "spook", "spooks", "swastika", "terrorist", "third reich", "towel head", "towel heads", "towelhead", "towelheads", "turk", "turks", "wetback", "wetbacks", "wigger", "wiggers", "wop", "yigger",
>>
>>102138541
>porch monkey
Wasn't this already settled in Clerks II?
>>
>>102138541
pick 2 random words from that list, that's your new "thing" now. gen nothing else for 1 month.
I've been feeding those lists into an LLM along the lines of "write a prompt and use x number of words from the following list:"
>>
File: file.png (114 KB, 1883x798)
114 KB
114 KB PNG
https://reddit.com/r/StableDiffusion/comments/1f4369h/juggernaut_xi_world_wide_release_better_prompt/
That juggernaut guy is completely delusional
>>
>>102138605
Nobody cares
>>
>>102138541
Wildcard template if anything
>>
File: 2024-08-29_00202_.png (1.25 MB, 832x1216)
1.25 MB
1.25 MB PNG
>>102138541
nice prompt, pic related
>>
File: fish.jpg (1.65 MB, 2048x2048)
1.65 MB
1.65 MB JPG
Good morning lads
>>
>>102138611
I care
>>
>>102138649
That's what I said
>>
eyoo
>>102138649
then use a good sdxl model and not this slop. plenty out there
>>
>>102138605
>This will make people come back to sdxl
>>
>all mention of 24gb fine-tuning quietly removed from the discourse for about a week now

What happened?
>>
>>102138605
>The aesthetics and detail of SDXL surpasses Flux at the moment.
Flux uses a 16ch VAE, SDXL doesn't, end of the debate
>>
>>102138663
>then use a good sdxl model and not this slop. plenty out there
I was surprised how good some of the models are. Just checked new sdxl models first time this year. LoRA training seems easier now too, tools and the user have gotten little bit better. Fun thing to play with until I get hardware to run Flux
>>
>>102138712
back then we also said that the guy who claimed it was possible was full of shit, and shocker we were right
>>
File: 1724822100698730.jpg (821 KB, 1920x2560)
821 KB
821 KB JPG
Which are best text 2 speech AI tools you can run locally?
>>
FluxD understands "Facing" as being Facing away more than facing the camera.
It's over.
FLUX 2.0 when?

>>102138605
>Oh definitely! Follow our socials. We're doing tons over here. Exciting stuff ahead.
*Internally* W..were not irrelevent!

Honestly when some grifter says "follow me" my first though is to log them then do nothing even remotely connected to them, disgusting people...
"Follow me to see some puppies kid"
Nonce vibes.
>>
>>102138735
at least, Flux raised the standards so high those griffters will be forgotten quickly if they won't make actual effort, feelsgoodman
>>
>>102138712
it should be theoretically possible because you can split any model into layers, doesn't mean it won't be so slow that it's pointless, no one wants to train batch 1 at 20 s/it
>>
>>102138720
yeah sdxl will keep you busy for a while. check out zavychroma, artium, crystal clear series. lots of good models. if you want sexo and/or anime, pony has plenty to offer.
>>
>>102138714
Thanks for pointing that out, I knew Flux's vae was vastly superior but didn't know the specifics
>>
>>102138714
SD3 uses a 16ch VAE (I'm not Lykon)
>>
>>102138795
sure, but the topic on this reddit comment was SDXL vs Flux
>>
>>102138777
TY for the tip, I'll check em out. Those realistic pony checkpoints look interesting. If I train lora for it do I just use the base pony?
>>
>>102138802
SDXL sucks just because of the crushed color range.
>>
>>102138727
>>102138761
Recommended setting are right there on the top of the Kohya SD3 branch. If it wasn't working it would have been scrapped by now. Someone should try it to see if it really is 20s/it
>>
File: 2024-08-29_00210_.png (1.04 MB, 832x1216)
1.04 MB
1.04 MB PNG
>>
>>102138814
It was when I tested in on my 4090, and they have no sampling so you have to wait 3 hours after saving and loading a manual checkpoint to see if you're even making progress.
>>
>>102138829
Sounds like a massive waste of time.
>>
>>102138686
>Massively misaligned horizon and waves
>Horror movie-tier anatomy
>Fucked up undetailed face
>Random fabric coming from her knees
And they chose this as an example image kek. Never EVER going back
>>
>>102138808
I don't make loras but other people here can surely answer that question. my guess is yes.
cyberrealistic pony is a good pony realism baseline if you want something clean and not too much leaning into the amateur corner. plenty of options tho
>>
>>102138850
Frakenmerging Loras is probably the only way to pragmatically train Flux on consumer hardware. So hopefully some techniques are developed on that vector that let's you do more comprehensive fine tuning of knowledge with only a mild amount of rape.
>>
File: 00057-2532215107.png (1.9 MB, 1024x1440)
1.9 MB
1.9 MB PNG
>>
File: 2024-08-29_00221_.png (1.38 MB, 832x1216)
1.38 MB
1.38 MB PNG
>>
>>102138864
these also have wonky perspective lines although less obvious
https://civitai.com/images/26700023
https://civitai.com/images/26700020
but that beach one is atrocious
>>
File: fs_0050.jpg (164 KB, 1280x704)
164 KB
164 KB JPG
>>
File: file.png (1.1 MB, 832x1216)
1.1 MB
1.1 MB PNG
Once you see the grey filter on all SDXL images you can't unsee it. It's like having an LCD screen with a bright back panel.
>>
File: ComfyUI_33189_.png (1.29 MB, 1280x720)
1.29 MB
1.29 MB PNG
>>102138541
>>
>>102138927
background "logic" is a bitch tho when its divided by whatever. the pin-up girl at the beach I posted here earlier, had to repaint the background like a muppet for the horizon to be one line.
>>102138942
nothing a little gimp couldn't fix tho. but boy those images are so tacky lol
>>
>>102138905
what a goddess
>>
File: 00016-4150015218.jpg (1.31 MB, 2048x2048)
1.31 MB
1.31 MB JPG
>>102136775
I ended up using ESRGAN_4x since that seemed to be the best at handling that dot pattern, which I think is trying to emulate pencil/pen sketches since that's in line with the style. Also from my tests (sample size of 2) 0.3 denoise produces the best results to me.
>>
File: file.png (2.05 MB, 832x1216)
2.05 MB
2.05 MB PNG
>>102138978
Woof, also that's all these Reddit models ever do, rock bottom stock photography of landscapes and 1girls. It's funny because you really can see the quality difference between Flux and "SOTA" SDXL.
>>
File: file.png (2.27 MB, 1024x1440)
2.27 MB
2.27 MB PNG
>>102139017
>>
>>102138864
>>102138686
>>102138605
It's probably the license that made him stick with SDXL.
>>
why is there no change to the dev images when I connect 3.0 FluxGuidance to the positive prompt, going into KSampler?
it's exactly like what is used in the comfy examples
>>
>>102139053
He should stay there too. The less slop models like Juggernaut and Dreamshaper the better
>>
>>102139000
looks a bit blurry desu. well my workflow is in that catbox. try it, also noise injection is the key for many things.
>>102139017
"rock bottom stock photography" lol. if I want clean sdxl I just use zavy. he made a clear stance on hyper/turbo/etc, I like that.
>>
>>102139053
With the amount of resources he's wasting it would be cheaper to train a new model.
>>
>>102138712
Usually in training weights are kept in fp32 precision because each update is so small to avoid quantization issues. Since fllux is 12B parameters just keeping the model in fp32 precision needs 48 GB VRAM. It's possible to use fp16 or bf16 with possible training issues and this would still require 24 GB just for the model weights. There are some papers about training with fp8 precision, but I'm not aware of any real applications.

Training also needs VRAM for activations, gradients and optimizer state. Fitting all of them in just 24 GB VRAM is beyond what current methods can do. I don't see how fine tuning flux with 24 GB VRAM is possible without swapping to RAM which destroys performance, or just training a subset of weights.

There are more tricks possible with LoRa training since only LoRa weights need to be trained.
>>
>>102139107
>Usually in training weights are kept in fp32 precision because each update is so small to avoid quantization issues.
nah, if you're training at that precision you go for tf32 these days. otherwise bf16
>>
File: ComfyUI_33193_.png (1.28 MB, 1280x720)
1.28 MB
1.28 MB PNG
>>102138541
An extremely powerful prompt.
>>
>>102139134
(note: I read your post as fp32; that's what people commonly refer to. nvm if you accounted for it)
>>
File: 1723384082526403.png (1.45 MB, 1152x896)
1.45 MB
1.45 MB PNG
>>102139068
why do you dislike juggernaut? when i first started genning 5 days ago it was the first model i used because it's the default for foocus and it was pretty good out of the box. much better than sdxl based or cyberrealistic i found
>>
i think juggernautxl is pretty good
>>
>>102139160
anything that isn't the hottest new shit is slop/VRAMlet territory. it's some inherited /sdg/ mentality
>>
>>102139134
tf32 is intermediate format used for calculations with cuda tensor cores, it's not used for storing the weights and its usage is irrelevant for VRAM usage calculations. bf16 is common precision for calculating activations.
>>
File: 1696327659569969.png (1.26 MB, 1152x896)
1.26 MB
1.26 MB PNG
>>102139183
initially when i first tried flux i was getting a lot more sloppage than juggernautxl. out of the box you couldn't get images like this with flux 3 days ago.
>>
>>102139134
TensorFloat32 isn't a datatype for the weights lol
>>
File: ComfyUI_01339_.jpg (1.12 MB, 1728x2304)
1.12 MB
1.12 MB JPG
>>
What's the current sampler/scheduler meta for flux?
>>
>>102139213
What happened three days ago?
>>
File: file.png (1.48 MB, 1440x1024)
1.48 MB
1.48 MB PNG
>>102139160
lmao
>>
The oven door has opened and the next loaf of bread is right here...
>>102139227
>>102139227
>>102139227
>>
>>102139225
thumbgate
>>
File: ifx272.jpg (226 KB, 1024x1024)
226 KB
226 KB JPG
>>
File: 1716802965431238.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>>102139225
i started using flux and this was the result

>>102139233
see what i mean. complete trash. 3.5 guidance is too high
>>
>>102139205
>>102139216
You are correct! Sorry about that.
>>
>>102139273
prompt?
>>
>>102139213
Yes, you could. It does that pretty much OOTB.
/You/ couldn't do that though
>>
>>102139273
Yeah I'll just ignore the paint chip artifacts and grey filter, nonsensical details, bad perspective, and really everything.
>>
File: sd1.5_0042.png (1.35 MB, 1024x1024)
1.35 MB
1.35 MB PNG
>>102139291
>us marine executing a little iraqi girl with a beretta 92 fs 9mm pistol inside the american embassy in iraq during the iraq invasion 2003

>>102139295
nah you could't. this [pic related] was the out of the box flux experience 3 days ago. it has gotten much much better thought

>>102139298
i have zero idea what that means. this was not happening to me
>>
>>102139367
>this [pic related] was the out of the box flux experience 3 days ago
you're trolling or delusional
check desuarchive for flux gens or something, lmao
>>
File: 2024-08-29_00259_.png (1.07 MB, 832x1216)
1.07 MB
1.07 MB PNG
>>
File: ifx263.jpg (221 KB, 1024x1024)
221 KB
221 KB JPG
>>
>>102139273
heres a hint try 40 guidance :D
>>
File: ifx264.jpg (240 KB, 1024x1024)
240 KB
240 KB JPG
>>
Why the fuck does comfy now load the unet model each gen?
>>
File: 2024-08-29_00268_.jpg (1.05 MB, 3840x2160)
1.05 MB
1.05 MB JPG
>>
File: ifx265.jpg (245 KB, 1024x1024)
245 KB
245 KB JPG
>>
>>102139919
> UserWarning: To copy construct from a tensor, it is recommended to use sourceTensor.clone().detach() or sourceTensor.clone().detach().requires_grad_(True), rather than torch.tensor(sourceTensor).
>>
Help guys!
Getting a black generated image

Had to move the clip-l and text encoder models to VAE Folder (Forge) otherwise I could not choose them in the Ui...
>>
>>102140118
nevermind that , needed to have AE. model selected.

Problem is that using Lora's with Flux Q8 gives blurry images...
>>
>>102139053
he could've gone for flux-schnell, still way better than SDXL



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.