[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1.07 MB, 3264x3264)
1.07 MB
1.07 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102092937

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
We shall leave our shitflinging to our last thread
>>
>>102095590
sorry there's a couple of people with extreme depression that find something to bitch about every thread
they basically hate anyone doing anything but they bitch because no one is doing anything
>>
>>102095564
>it is bad because it is bad and synonym of bad
>(insert opposing axiom here)
I told you what would happen if you tried to formulate a rational argument. You can't. You programming does not allow you to think in those terms. You're like a low beaked model trying to solve a riddle. You're not going to get anywhere, and I don't want to see you try, so let's leave it at that.
I'm a bad person because I defended bad thing. You won discussion because you said how bad it was.
>>
>>102095590
Reminder: if the poster you're arguing with seems too retarded to be real, it's because they are. Schizo baiting is rampant these days, don't let it tempt you. Take a moment to consider why you even need to convince some retard of a different perspective in the first place.
>>
Post your s/it if you use AMD
>>
File: 2024-08-26_00297_.jpg (1.41 MB, 3840x2160)
1.41 MB
1.41 MB JPG
>>102095567
ty baker
>>
>>102095648
>if the poster you're arguing with seems too retarded to be real, it's because they are
Very good point
>>
>>102095641
just say you're an authoritarian and move on, it's not that hard
>>
File: 00004-3073918038.jpg (1.74 MB, 1680x2980)
1.74 MB
1.74 MB JPG
power of 1girl compels you
>>
>>102095649
>if you use AMD
whose gonna tell him...
>>
>>102095581
lol that's not how it works. You have no authority, and neither do I. How we get an authority to do the policing is another topic entirely. But this is not a political forum and you are not capable of holding this discussion in a civilized manner.
So reply one last time with another platitude (unless you manage to hold your emotional conditioning long enough to let me have the last word) and that will be the end of it.
>>
>>102095641
Rationally speaking, justifying things with the logic of "it's for the greater good of society" can lead to things like genocide.
Would society be better if nobody had certain thoughts? Yes.
That doesn't mean it's morally sound to kill everyone with those thoughts.
>>
>>102095688
>>102095668
Read:
>>102095648
>>
>>102095671
The fact that flux is shit at lingerie is so fucking sad
>>
File: 2024-08-26_00301_.png (1.19 MB, 720x1280)
1.19 MB
1.19 MB PNG
I just leave this here.. wonder if someone will pick it up. Back to scifi gens
>>
>>102095702
>everyone I don't like is a schizo
>>
>>102095692
>Take a moment to consider why you even need to convince some retard of a different perspective in the first place.
>>
>>102095569
care to elaborate on the process and the results? maybe also their feedback, if there was any.
>>
>>102095714
you're just a proomptlet
https://github.com/ataylorm/FluxAIGridComparisons/blob/main/LingeriePrompts/FullGrid.jpg
>>
>>102095715
Maybe 24B
>>
File: 00025-3109227115.png (1.28 MB, 808x1216)
1.28 MB
1.28 MB PNG
>>
>>102095722
everyone has good intentions, even Hitler and Stalin thought they were the good guys
>>
File: duty_calls.png (14 KB, 300x330)
14 KB
14 KB PNG
>>102095722
>>
>>102095744
4chan has become Reddit so gradually I had not realized until now.
>>
>>102095751
you're joking or something? 4chan was the origin of many memes back in the days, he's literally bring up the spirit of old 4chan
>>
>>102095744
kek love that image, too real every time
someone make it in flux
>>
File: bComfyUI_110852_.jpg (296 KB, 1920x1088)
296 KB
296 KB JPG
>>
File: grid-0086.jpg (327 KB, 1536x2304)
327 KB
327 KB JPG
>>102095714
not flux, 1.5
>>
We probably won't have a model capable of doing good lingerie until we have a model trained exclusively on extremely high resolution imagery.
>>
File: 2024-08-26_00308_.png (1.23 MB, 832x1216)
1.23 MB
1.23 MB PNG
>>102095737
ya, but atleast it labels the panels nicely.. but if you wanted to .. you could just gen the individual panels with pretty accurate detail and stick em together
>>
File: SEI_121965633.jpg (448 KB, 2048x1366)
448 KB
448 KB JPG
>>102095516
Hi, Mao, you are still alive? Come see me sometime bro.
>>
>>102095840
kek
>>
https://civitai.com/models/681566
>heres your cool flux loras bro
>>
>>102095736
Cool link thanks anon
>>
>>102095910
... pepsi can rendering, anal prolapse and gore lora for flux. I guess we are reaching new heights of artist expression this evening.
>>
File: 00005-3073918038.jpg (1.46 MB, 2160x2160)
1.46 MB
1.46 MB JPG
>>
File: ezgif-1-386276b82d.png (501 KB, 1024x1024)
501 KB
501 KB PNG
Schnell went full Jeet KEK
This big the important
>an image from the side of a cute anime girl with green hair sitting at a desk infront of a PC. above her is a speech bubble that says "I can't, this is important - someone is WRONG on the internet". the word WRONG is underlined to emphasize it. behind her, in the opposite direction, is a speech bubble that reads "are you coming to bed?" then another beneath that, also from the opposite direction, that says "what?". the image has a manga, painterly style. the girl's hair has 4 very short pigtails, two on the top and two on the bottom, like the yotsuba 4chan mascot.
>>
>>102095955
BRAAAAAAPPPPP
>>
>>102095996
This is why I still just use GIMP to insert text and speech bubbles.
>>
File: Untitled.jpg (8 KB, 300x168)
8 KB
8 KB JPG
>>102095792
>>102095671
>>102095955
>1.5
The evolution of technology.
>>
File: 2024-08-26_00316_.jpg (1.49 MB, 3840x2160)
1.49 MB
1.49 MB JPG
Don't judge me.
>>
File: file.png (2.71 MB, 1024x1024)
2.71 MB
2.71 MB PNG
https://civitai.com/images/26274940
Flux would be a lot more fun if there were more styles inside of it it...
>>
File: 1713882675134529.png (1.77 MB, 1152x896)
1.77 MB
1.77 MB PNG
>>102095944
are there any actual guro/gore loras/checkpoints for 1.5/XL/Pony? Not the body horror monsters, but actual guro stuff?

>>102095567
I've come back to AI after taking a long break for work and it feels like I'm taking crazy pills. For some reason all my images look out of focus and... washed out? At least on base dev and dev quant versions, but a lot more detailed and contrast in the finetunes.

Picrel is example from forge (but exactly the same result in Comfy). Everything latest version.

Has anyone experienced something similar?
>>
File: 2024-08-26_00320_.jpg (1.33 MB, 3840x2160)
1.33 MB
1.33 MB JPG
>>
>>102096169
>At least on base dev and dev quant versions, but a lot more detailed and contrast in the finetunes.
there's finetunes of flux?
>>
File: 1699845371873340.png (382 KB, 1857x917)
382 KB
382 KB PNG
>>102096169
settings
>>
File: file.png (2.78 MB, 1024x1024)
2.78 MB
2.78 MB PNG
>>102096126
>>
>>102096179
yeah, like FluxUnchained
>>
>>102096197
nick fe tier gen
>>
>>102096200
not sure if that is a real fine tune or just some lora merges, there even is a lora version of it (its insane 1gb)
>>
>SD1.5 is still best for realism
>>
>>102096238
bait
>>
>>102096085
Never will you ever see a model trained with a dataset as diverse, unsafe and comprehensive as SD 1.5.
>>
File: 1695310805780331.png (1.79 MB, 1024x1024)
1.79 MB
1.79 MB PNG
>>
>>102096248
>Never will you ever see a model trained with a dataset as diverse, unsafe and comprehensive as SD 1.5.
because it was never meant to be released in that uncucked state in the first place, Runaway got some balls to give it to us
>>
>>102096169
>are there any actual guro/gore loras/checkpoints for 1.5/XL/Pony?
yes, but you can not find em on civitai, if they end there they get scrapped fast, cause its against their TOS .. don't ask me for em, I did not collect em.. not my fetish, but surely they are somewhere still
>>
is sdg and ldg like smackdown vs raw?
>>
File: 00006-3073918038.jpg (442 KB, 1296x1728)
442 KB
442 KB JPG
v-prediction models don't work with new forge, I don't think it reads .yaml files

>>102096085
I think some small flux model will replace it at some point

>>102096117
It's great
>>
File: 1707976062636425.png (1.78 MB, 1024x1024)
1.78 MB
1.78 MB PNG
>>
File: file.png (2.45 MB, 1024x1024)
2.45 MB
2.45 MB PNG
>>102096126
works well on Comfy's workflow example
>>
File: bComfyUI_111242_.jpg (356 KB, 1024x1024)
356 KB
356 KB JPG
>>
Flux could be 8B no problem, I know, you know it, everyone knows it.
>>
File: 1722600893229142.png (1.72 MB, 1024x1024)
1.72 MB
1.72 MB PNG
>>
>>102096329
I like it at 12b, it's better and the quality is virtually the same at Q8_0 (can easily fit on a 16gb card)
>>
>>102096329
dare I say VRAMlet?
>>
File: 1721220504848083.png (1.93 MB, 1024x1024)
1.93 MB
1.93 MB PNG
>>
File: file.png (2.66 MB, 1024x1024)
2.66 MB
2.66 MB PNG
>>
>>102096329
If its like the LLMs it might not need the 12B
https://youtu.be/yBL7J0kgldU?t=2220
>>
File: ezgif-1-c6a0de7986.png (592 KB, 1024x1024)
592 KB
592 KB PNG
>>102095996
>>
File: 1724541500220457.png (1.72 MB, 1024x1024)
1.72 MB
1.72 MB PNG
>>
File: bComfyUI_111713_.jpg (734 KB, 1920x1088)
734 KB
734 KB JPG
>>
>>102096395
Glorious
>>
File: file.png (2.62 MB, 1024x1024)
2.62 MB
2.62 MB PNG
>>
File: 1699002319649447.png (1.86 MB, 1024x1024)
1.86 MB
1.86 MB PNG
the stupid AI can't spell necronomicon right
>>
>>102096598
that might be the lora messing up the text
>>
>>102096653
are you shilling here again? how about you stop promoting yourself here, the anons here are obviously not your target audience
>>
>>102096653
For once I'm plesently surprised that even the ledditors aren't buying into his shit anymore
>>
>>102096671
Dr. Furkan is too busy to care about 4chan.
>>
>>102096653
the fucking retard successfully managed to push Reddit over the edge
>>
>>102096126
>https://civitai.com/images/26274940
This guy has been putting cookie cutter Loras of artists. I don't see them being very good. I'll try some when there's an artist I am interested in.
>>
>>102096671
Sorry for shilling, I will delete my comment. I just wanted to defend out favourite guy.
>>
File: fs_0168.jpg (95 KB, 1024x1024)
95 KB
95 KB JPG
trying some other datasets I made from 1.5 days, no captions on the latest tests seems to do pretty well actually
>>
>>102096743
>This guy has been putting cookie cutter Loras of artists. I don't see them being very good.
it's undertrained, if you go for selfie it's working as intended, but once you want to go for specific poses it goes back to flux slop
>>
>>102096500
If you stare at him long enough he stops being Will Smith.
>>
File: ifx21.png (1.61 MB, 1024x1024)
1.61 MB
1.61 MB PNG
>>102096653
>https://www.reddit.com/r/StableDiffusion/comments/1f1tb4x/75_gb_flux_lora_training_has_arrived_for_even_8gb/
SUS
>>
File: 1701514142606555.png (1.89 MB, 1024x1024)
1.89 MB
1.89 MB PNG
nvm
>>
>>102096646
The lora is messing with everything >>102096795
>>
File: aigrifter3.png (9 KB, 489x213)
9 KB
9 KB PNG
AHAHAHAHHAA
even redditards turned on him
>>
>>102096793
kek this keeps up and even his most devote npc defenders will turn
>>
File: aigrifter2.png (793 KB, 1024x1024)
793 KB
793 KB PNG
>u mad?
>>
File: flux0330.jpg (1.96 MB, 2304x1792)
1.96 MB
1.96 MB JPG
>>
File: 1697772319472148.png (1.72 MB, 1024x1024)
1.72 MB
1.72 MB PNG
>>
https://civitai.com/articles/6792/flux-captioning-differences-training-diary
I don't know how he didn't come to the conclusion that Joy Caption is the best. Also I still think the best is a hybrid approach. <unifying keyword>, <long description>, <tags>
>>
File: flux0329.jpg (1.45 MB, 2304x1792)
1.45 MB
1.45 MB JPG
>>
>>102096793
>>102096864
Can I get a QRD on the drama?
>>
bros, im kinda tired of redditors posting their shitty finds as gospel
>>
File: bComfyUI_111734_.jpg (1.17 MB, 1088x1920)
1.17 MB
1.17 MB JPG
>>
File: 1698187408642628.png (1.77 MB, 1024x1024)
1.77 MB
1.77 MB PNG
>>
>>102096976
some grifter is shilling his howtos, that are essentially copied Kohya script manuals, for making FLUX loras like everywhere, including here (even tho he disguises himself as a hater) .. he has em locked behind a $5 patreon paywall and sells em as if he made em
>>
>>102096976
Grifter who gives Lora tutorials has been spamming new threads and links to his Patreon. He's particularly obnoxious because his tutorials are always "How to train yourself on X model in Y GB of RAM" and he learns how to do this by annoying contributors on Github and then consequently paywalls is. This is a guy who, at this point, should be able to do this on his own as his own contributor at this point.
>>
>>102096976
Just some guy that pops up in ever reddit, github, discord, twitter etc thread with helpful information about how to train SD, flux etc, which is often locked behind his patreon, he puts his own face in all the images which triggers many people because they have had enough of him appreaing everywhere.

He's gotten so powerful that his beautiful aura has now affected 4chan threads.
>>
File: ComfyUI_00567_.png (757 KB, 1024x1024)
757 KB
757 KB PNG
how do I make characters with the same clothes? i am using character design lora for flux
>>
File: 1697857668187240.png (1.93 MB, 1024x1024)
1.93 MB
1.93 MB PNG
>>
File: 1703882778210446.png (1.83 MB, 1024x1024)
1.83 MB
1.83 MB PNG
>>
File: aigrifter.png (1.15 MB, 1024x1024)
1.15 MB
1.15 MB PNG
>>102096976
>AI grifter who hangs out in every AI related github repos (kohya, a1111) and reddit subreddits for over a year now
>As soon as some other user post a finding / new tool / anything new image gen related, he posts a "tutorial" or "guide" copying and pasting the same stuff the user posted for "helping purposes"
>These "tutorials" eventually turn into youtube videos
>Start spamming said videos in every github/reddit discussion
>These "guides" and "videos" eventually turn into paywalled content (patreon)
>Still hangs out in every github discussion just copy and pasting info and passing it as his own
>When facing a setback, he asks devs and other users for help and then paywalls the solution and passes it as his own creation
>Flux Loras Traning are in high demand right now
>AI grifter teases a low vram training method only available for his patreons
>Get called on for users who already know him, other stupid redditors defend him (its really him using alt. accounts)
>After all this, keep spamming and teasing paywalled content
>even reddit turns on him
>you're here
>>
File: 1713762175661594.png (1.72 MB, 1024x1024)
1.72 MB
1.72 MB PNG
>>
File: bComfyUI_111735_.jpg (1.31 MB, 1088x1920)
1.31 MB
1.31 MB JPG
>>
>>102097182
What broke the camel's back was when he replied to every comment his Patreon
>>
>>102097150
kino indeed it is
>>
Speaking of stealing tutorials for content, how does one merge Loras into Flux in a measured way?
>>
File: 1698270782958332.png (1.78 MB, 1024x1024)
1.78 MB
1.78 MB PNG
>>
File: SECgrifter.png (167 KB, 1375x921)
167 KB
167 KB PNG
>>102097209
He has always been like that
>>
>>102097209
I hate seeing his face being spammed on github discussions, already so hard to find solutions
>>
File: file.png (9 KB, 568x82)
9 KB
9 KB PNG
can someone give me a QRD on what difference these options make and what they are best set to?
in case it matters i have a 3070 and 32gb ram running purely nf4 (no special encoders, just whatever is baked into the nf4 file itself)
>>
>>102097182
Excited to see how he evolves his tactics to beat this backlash, I'm sure he will figure something out
>>
File: aigrifter4.png (40 KB, 883x491)
40 KB
40 KB PNG
>>102097331
just search SECourses in every ai related github repo you like and you will find posts from him spamming his crap
>>
File: 00020-3523320166.png (1.14 MB, 896x1152)
1.14 MB
1.14 MB PNG
>>102097341
>>
File: 00025-2574366989.png (1.07 MB, 896x1152)
1.07 MB
1.07 MB PNG
>>102097399
oops didn't mean to reply to that comment

Sus shadow on her tshirt here
>>
File: 00056-1872297070.png (1.22 MB, 832x1216)
1.22 MB
1.22 MB PNG
>>
File: 2024-08-26_00368_.jpg (1.76 MB, 3072x3072)
1.76 MB
1.76 MB JPG
>>102097421
that is .. eeh... just a long Tengu nose
>>
>>102097574
Yeah I thought that too.
>>
File: bComfyUI_111745_.jpg (989 KB, 1088x1920)
989 KB
989 KB JPG
>>
File: 00028-1613427591.png (1.03 MB, 896x1152)
1.03 MB
1.03 MB PNG
>>
File: ComfyUI_Flux_10988.jpg (176 KB, 1024x576)
176 KB
176 KB JPG
>>
File: 2024-08-26_00372_.jpg (873 KB, 2160x3840)
873 KB
873 KB JPG
>>
File: fs_0186.jpg (86 KB, 1024x1024)
86 KB
86 KB JPG
>>
File: FLUX_00116_.png (1.79 MB, 896x1152)
1.79 MB
1.79 MB PNG
c'mon flux, that was an easy one
A brick wall with two windows. The window on the left has 2 stickers on it, and the window on the right has double the amount of stickers on it.
>>
>ye i am used to that. i have been rented 8x A6000 GPU machine over a week now. people can train with sub-par configs

Absolutely maximum seethe
>>
File: 00032-3797100804.png (1.02 MB, 896x1152)
1.02 MB
1.02 MB PNG
>>
>>102097840
Are people now testing how good flux handles puzzles?
>>
>>102097840
Plot twits, all the reflections in the window on the right are stickers.
>>
>>102097867
Are people now asking for confirmation on things they've just witnessed?
>>
>>102097876
Are they?
>>
>>102097876
Is this a reply to an anon on 4chan?
>>
>>102097876
Are implies more than one.
>>
File: ComfyUI_Flux_0265.jpg (486 KB, 2048x1152)
486 KB
486 KB JPG
>>
>>102097696
https://www.youtube.com/watch?v=p1XYZI0i4fs
>>
File: 2024-08-26_00382_.jpg (1.63 MB, 2160x3840)
1.63 MB
1.63 MB JPG
>>
>>102097020
Don't forget his obsession with abstracting his hard work into "discovering new hyperparameters"
>>
>>102097294
Why can't you just use the built-in lora loader?
>>
how do i undervolt my nvidia gpu on linux?
>>
>>102097988
Haha nice catch.
>>
>>102098130
I want to make a frankenmodel
>>
>>102097331
Ooh it's that weirdo, lol.

>>102097182
In an alternate reality he would have made a public wiki and maintained it, and then asked for money as a tip jar.
>>
File: FLUX_00122_.png (1.18 MB, 896x1152)
1.18 MB
1.18 MB PNG
>>
>>102098184
The more ethical model is providing the knowledge up front and selling the labor on the backend. It's not hard to monetize people's laziness. But the problem with most grifters is they essentially try to monetize via knowledge monopoly.
>>
File: bComfyUI_111747_.jpg (612 KB, 1920x1088)
612 KB
612 KB JPG
>>
File: FLUX_00123_.png (1018 KB, 896x1152)
1018 KB
1018 KB PNG
that was a mistake
>>
I'm using easy diffusion yet when I upload a file to img2img and said "giant arms" since I wanted to see if the prompt would increase the arm size of the person in the image that I uploaded nothing happened and I added 125 steps. Whats going on??? How do you actually make edits to features of people on easydiffusion? Is easydiffusion the best one for what I'm looking for?
>>
>>102097359
anyone?
i mostly just wanna know the difference between queue and async
>>
File: 2024-08-26_00384_.jpg (1.68 MB, 2160x3840)
1.68 MB
1.68 MB JPG
>>102098192
cool
>>102098251
cool but extreme nightmare fuel
>>
>>102098251
Looks like showonhead cosplaying as marge
>>
File: FLUX_00125_.png (882 KB, 896x1152)
882 KB
882 KB PNG
>>102098366
she got that chipmunk face
>>
>>102098139
thanks, and nice meme easter egg
>>
File: fs_0224.jpg (119 KB, 1024x1024)
119 KB
119 KB JPG
>>
>>102098192
Would
>>
File: 2024-08-26_00400_.jpg (1.63 MB, 3840x2160)
1.63 MB
1.63 MB JPG
>>
File: ComfyUI_04525_.png (1.4 MB, 896x1088)
1.4 MB
1.4 MB PNG
>reinstall windows
>gen times are a lil quicker
>can actually load fp16 flux-dev now without crashing

it's looking up
>>
>>102098499
ya a messy install with to much shit running will give windows big headaches loading fp16, as the load alone gobbles about 70-75GB of SystemRAM+SWAP
>>
File: 1712633962299190.png (2.95 MB, 2496x1481)
2.95 MB
2.95 MB PNG
Is YouTube radicalizing Taylor Swift fans?
>>
File: 2024-08-26_00404_.jpg (1.63 MB, 3840x2160)
1.63 MB
1.63 MB JPG
>>
File: FLUX_00127_.png (836 KB, 896x1152)
836 KB
836 KB PNG
I can see chloe in a simpsons porn parody
>>
>>102098192
why do you not share the lora, bro
>>
>>102098620
legal repercussions. If you promise to say you never got it from me, you can have it
also, do you even know who that is?
>>
File: fs_0244.jpg (104 KB, 1024x1024)
104 KB
104 KB JPG
>>
>>102098571
It's weird I like the cartoon lookalikes of her or using her as a basis but not her real look.
>>
>>102098644
>If you promise to say you never got it from me
I promise.
>do you even know who that is?
No idea.
>>
>>102098602
whoa someone else know knows about chloe toy thats surprising
she was hot as fuck for a second then got too skinny
>>
so what was the conclusion on Long_ClipL?
>>
File: FD_00004_.png (1.54 MB, 1024x1024)
1.54 MB
1.54 MB PNG
>>
File: 2024-08-27_00002_.jpg (1.74 MB, 3840x2160)
1.74 MB
1.74 MB JPG
>>
>>102095567
>top right picture in the collage
is beautiful. how was it made?
>>
>>102098683
I also cloned her voice
https://vocaroo.com/1li9bpTXryS4

>>102098664
https://files.catbox.moe/x5sdo8.safetensors
>>
>>102098809
is this a local thing
if so how
can you tie this in with a text bot
holy shit
>>
>>102098809
>>102098822
oh and im not british lol
>>
>>102098805
ow wait.. I answred the wrong question.. top right? Looks like pre-raphaelit ..

>A painting in pre rapahelit style <bla bla>
>>
File: FD_00008_.png (1.71 MB, 1024x1024)
1.71 MB
1.71 MB PNG
>>
File: ComfyUI_33047_.png (1.3 MB, 1280x720)
1.3 MB
1.3 MB PNG
>>
>>102098805

here is the original post >>102095089
the anon who made this is gone tho .. I guess its a prompted art style of something like Romantics
>>
File: FD_00007_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>
>>102098843
>>102098874
thanks. i'm going to ask some people about it
>>
>>102098809
Nice, thanks. Didn't know about this model.
>>
>>102098864
cute
Does flux understand chibi ?
>>
>>102098929
yes
>>
>>102098981
nice
>>
File: _tod_.png (3.54 MB, 1328x1328)
3.54 MB
3.54 MB PNG
someone posted cool orc-killing pics in a previous thread so i upscaled one. i cant fix this so gonna leave as is. would like to see more difficult combat poses and fewer 1girls/cats
>>
File: 2024-08-26_00203_.jpg (1.55 MB, 3072x3072)
1.55 MB
1.55 MB JPG
>>102099035
that was me, here have it at 3000x3000
>>
File: 2024-08-27_00018_.jpg (1.87 MB, 3840x2160)
1.87 MB
1.87 MB JPG
>>
>>102099035
the one with an assault rifle and orcs in the background was pretty good
>>
>>102098251
Would impregnate just to see what happens
>>
File: _tod___.png (2.23 MB, 1018x1018)
2.23 MB
2.23 MB PNG
>>102099172
this one?
>>
I don't get it. What's so special about Prodigy?
>>
seems SDXL is over trained on World of Warcraft. need to remember to put "World of Warcraft" in negative prompts because it seeps into everything
>>
>>102099250
ya thats the one
>>
File: tod.png (3.18 MB, 1536x1536)
3.18 MB
3.18 MB PNG
>>
File: 2024-08-27_00025_.jpg (1.3 MB, 3840x2160)
1.3 MB
1.3 MB JPG
>>
File: ComfyUI_33055_.png (833 KB, 1280x720)
833 KB
833 KB PNG
>>
>>102099065
>3000x3000

what GPU are you using brother
>>
>>102099262
the voodoo!
>>
>>102099262
https://www.youtube.com/watch?v=WY87o9IZXWg
>>
File: 2024-08-27_00029_.jpg (1.31 MB, 3840x2160)
1.31 MB
1.31 MB JPG
>>102099317
4090 .. I am using SDUltimateUpscale and make these in chunks of 1024x1024 .. so what you are seeing is one original 1024x1024 gen and 9 tiles .. so 10 flux gens made into one

pic related is 12 chunks
>>
>>102099262
is this tech or a request for a firestarter joke.

Last week betterbird got me in the stupid joke actual product game.
>>
>>102099317
if you use something like chaiNNer you get get 4096 on a 12GB card. Then divide the image into 4 and then make sure not to inpaint on the seams.

I know you won't because it is annoying. Just pointing out that you don't need to be limited by your tech.

Freakin capcha. Multiple rejections on correct solves.
>>
>>102099350
kino. that orc looks too human. maybe ill play with it later
what settings/model do you use in SDUltimateUpscale? it might crash my 3080 if i tried that
>>
What local diffusion allows me to make chicks with bikini tops go topless and actually looks good?
>>
>>102099433
Sorry but I can't help with that.
>>
LoRA schizo who was complaining about Flux LoRAs suffering from concept bleed from last night here.
I was very unhappy with the results I got from basic tagging and decided to try highly verbose boomer prompting for the dataset this time with captions from GPT4, I then went through the captions and changed the word "She" to the character's name rougly 50% of the time, the reasoning being that the more that character's name appears in the caption, the stronger it may associate that name with that character.

I also bumped the network rank up to 64 and batch size to 4 at 512x512, let's take a look at the results.

1: Aqua standing in a livingroom setting, she is wearing her signature outfit and holding a bottle of wine.

Not a bad result
>>
>>102099393
you don't need to inpaint at all with SDUltimateUpscale and flux, you don't even have to seam fix .. it just works.. FLUX is the best model for upscaling I ever used

>>102099420
pic related are the settings .. I use flux.dev fp16 for both original picture and upscale

the orc looks like that cause its
>This photo realistic cinematic shot of a fantasy scene of...
is at the beginning of the prompt, flux tries to make it hyper realistic at its best .. lemme change style for the next one
>>
>>102099433
its easy af but i wont make coomer shit for you
>>
File: ComfyUI_02428_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>102099456
I then tested the other characters

2: Megumin in her signature outfit against a gradient background.

Here, once again, we saw pretty severe concept bleed between characters
>>
>>102099458
>you don't need to inpaint at all with SDUltimateUpscale and flux
you need to find alternatives if you are OOM
>>
>>102099433
There's no a single actual good nsfw model, all look like shit because all base models are censored.
You'll get maybe 1 good gen each 100-200 gens.
If you are talking about taking an existing pic and use that as a base then grab any sdxl nsfw model from civitai and use img2img in forge or auto1111, almost all of them are the same trash tho.
>>
File: ComfyUI_02430_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>102099476
Finally I tried Darkness,

3:
Darkness wearing plate armor, she has long blonde hair.

I tried Darkness without mentioning her hair color, but as others pointed out, the word Darkness was pretty strong and it delivered emo versions of the characters.

So what's next? The images are better with boomer prompting, that's for sure, but the concept bleed is still present. I am now training at a whopping rank 128 to see if that helps with the bleed, I remain skeptical
>>
File: 2024-08-23_00267_fp16.png (1.77 MB, 1280x1280)
1.77 MB
1.77 MB PNG
>>102099489
>>
File: 2024-08-27_00036_.jpg (1.76 MB, 3840x2160)
1.76 MB
1.76 MB JPG
>>102099420
you can try to set the slice size to 768x768 and just do 2x upscaling .. also the upscale model is 4x_NMKD_siax

pic related is
>This digital illustration of a fantasy scene of a knight in shiny armor and an orc fighting. The knight holds and spear and stabs an Orc into the heart. Blood splatters and gore. Flesh and blood explodes out of the orcs back. The orc has an expression of surprise. Several arrows are stuck in the Orcs back.
>It is a dynamic battle scene and in the background is an army of knights and orcs fighting.
I gonna be honest I prefer the realistic one..
>>
>>102099533
how inclusive you asshat. I want everyone to be able to do what I do. That is how we get better.

I say outright I have the hardware I need and your a punk for assuming I don't. Nice 1280x1280 chump.
>>
File: ComfyUI_24623226007_.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
>>
So has this thread become haunted by an actual schizo or something? It feels like every second post just incoherent angst.
>>
>>102099433
Topless is easy. They all do that.
AI just can't into vaginas because vaginas aren't very aesthetic to begin with and there's so much variation that it all turns to weird folds of discolored flesh.
>>
>>102099571
Holy yaoi hands
>>
>>
File: ComfyUI_24623225846_.png (1.76 MB, 1024x1024)
1.76 MB
1.76 MB PNG
>>102099602
lmao I'm just stoked my lora didn't totally butcher them. finding the balance between 'is it still the right style' and 'do the fingers look like they went through a blender' has been a challenge
>>
File: ComfyUI_33060_.png (935 KB, 1280x720)
935 KB
935 KB PNG
>>
File: 2024-08-27_00044_.jpg (1.16 MB, 3840x2160)
1.16 MB
1.16 MB JPG
>>
File: ComfyUI_00873_.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
>still working on my Dataset and keep finding new pictures to add to it
man its a lot of work, I'm already working for 3 days on it.
already got 108 pics so far, I make sure that every picture shows the Subject from every possible angle.
>>
File: AIComic.png (643 KB, 1024x1024)
643 KB
643 KB PNG
We're getting so close to getting something usable for production.
>This image is a digital cartoon drawing depicting a domestic scene in a kitchen. The background features light-colored wooden cabinets with simple handles and a tiled backsplash in shades of beige and white. The kitchen floor is covered in a checkered pattern of light and dark gray tiles. In the foreground, there are two characters: a mother and her daughter. The mom, on the left, has short, dark purple hair and is wearing a green t-shirt and dark blue pajama pants. She is holding a white towel and a plate and looks towards the child with a concerned expression. The daughter, standing to the right of her, has brown hair tied back in a ponytail and is wearing a green t-shirt and purple pants. The child is looking up at a refrigerator adorned with colorful drawings of various animals, including a cat, a dog, and a rabbit, all drawn in different styles. The refrigerator also has a few magnets, including one with a picture of a sunflower. The image is accompanied by a caption in the bottom saying, "Did you replace my drawings with AI generated ones?". The overall style is light-hearted and humorous, with a playful tone.
>>
File: t-o-d.png (2.51 MB, 1728x976)
2.51 MB
2.51 MB PNG
>>102099695
different styles mate
>>
>>102099433
Literally any SD1.5 realistic checkpoint
Or any of the XL porn checkpoints (non pony)
Some skill required though
>>102099490
>>102099593
You have no clue what you are talking about
>>
File: 2024-08-27_00049_.jpg (1.14 MB, 3840x2160)
1.14 MB
1.14 MB JPG
>>102099801
ya ya
>>
File: ComfyUI_33063_.png (1006 KB, 1280x720)
1006 KB
1006 KB PNG
>>
>>102099809
>(non pony)
if this wasn't /g/ I would show you how wrong you are. Agreed this is not the place to start.
>>
>>102099809
>You have no clue what you are talking about

correct

>>102099905

also correct
>>
File: 2024-08-27_00054_.jpg (1.59 MB, 3840x2160)
1.59 MB
1.59 MB JPG
>>
>>102099809
>You have no clue what you are talking about
I do, fucking retard.
All the nsfw models feel more like a frankenstein lora instead of a finetune, you are delusional if you think otherwise, unless you are talking about anime shit.
Also, pony is dogshit, it does the same gens over and over again because anime fat fucks only draw the same shit over and over again so it's always the same angle for each pose. Is dogshit.
>>
File: Graphfail.png (619 KB, 1280x768)
619 KB
619 KB PNG
Failure to understand graph trends ruins the joke.
>This image is a digital cartoon drawing in a humorous and satirical style. The scene takes place in an office setting, with a man talking to two girls sitting around a large, round table. The man standing in the foreground, on the left side of the image, is dressed in a blue suit with a white shirt and a green tie. He has glasses, a bald head, and is speaking animatedly with his hand gesturing. His mouth is open, indicating he is mid-sentence. Behind him on the wall are two graphs labeled "Stability AI" and "Black Forest". The graph labeled "Stability AI" has a downward red line, indicating a negative trend, while the graph labeled "Black Forest" has a green upward line, indicating a positive trend. Both graphs have a simple grid pattern with no additional labels or data points. The background features a gradient from light pink at the top to a darker pink at the bottom, suggesting a sunrise or sunset. The walls are adorned with framed pictures, but these are not detailed in the image. The girls in the background are dressed in business attire and appear to be listening intently to the speaker.
>>
File: ComfyUI_33065_.png (929 KB, 640x1280)
929 KB
929 KB PNG
>>
>>102099433
>looks good?

95% of the time you get extra or not enough fingers, derpy eyes, fused teeth, and other body horrors. By the time you get a good picture, you're already too crossed out to goon.
>>
Anybody else using an LLM to prompt? I've always been big into wildcards but I feel like LLMs are a better fit for Flux's preferred prompting style. Actually I've been using wildcards to prompt the LLM, but even so I find that the prompts overuse certain concepts. For example, many images have murals, intricate engravings, paintings, etc on the walls; characters often have braids or tattoos; certain things like maps and fountains appear a lot. Part of it I'm sure is that I can only fit Gemma 2b alongside Flux, but from experience I know that a lot of bigger models tend to recycle prompt elements, too.
>>
now we're getting somewhere
it's weird, it won't refuse like a typical LLM, it'll just revert to its default of describing everything how it wants to
>>
kek
>>
>>102099905
realistic poni never looks right. It's like the model knows it's not supposed to do hairless monkeys
>>
File: long road to 6000.jpg (1 KB, 100x26)
1 KB
1 KB JPG
I'M COOKING
>>
File: ComfyUI_33069_.png (1.05 MB, 848x1280)
1.05 MB
1.05 MB PNG
>>
one more penny
>>
File: 2024-08-27_00060_.jpg (1.65 MB, 3840x2160)
1.65 MB
1.65 MB JPG
>>
File: 1715870137773478.png (417 KB, 896x512)
417 KB
417 KB PNG
>>
File: Big Black Cock.png (645 KB, 1024x1024)
645 KB
645 KB PNG
>>102100059
I just pick an existing image from google and put it through Joy Caption, then I modify it for my needs. I haven't found another way. This one went perfectly.
>This image is a digital cartoon drawing featuring anthropomorphic chickens in a humorous office setting. The scene is set in a yellow-walled room with a large, round table in the center. Four chickens, all wearing red combs and wattles, are seated around the table. They have human-like expressions and are dressed in business attire, including suits and ties. The chickens on the left side of the table are looking towards the right, where a big black cock, is standing in an open doorway. The cock is wearing a blue tie and a red vest over a white shirt. The background includes a window on the left side with green curtains and a view of a sunny, mountainous landscape. Above the window, a clock shows the time as 4:15. The floor is light brown, and the room has a simple, minimalist design. The style of the cartoon is clean and colorful, with bold outlines and vibrant colors. The caption at the bottom reads, "Well, look who decided to join us."
>>
File: ComfyUI_33072_.png (2.7 MB, 1920x1088)
2.7 MB
2.7 MB PNG
>>
File: ComfyUI_33073_.png (1.1 MB, 1280x720)
1.1 MB
1.1 MB PNG
>>
>>102100059
Currently using Qwen2-1.5b for the same thing, just trying to hone in on a good system message that doesn't make the LLM hallucinate or return garbage.

>You are an assistant who improves and extends descriptions. Provide detailed descriptions for all parts of the input, specifying where it has been vague.
>>
File: ComfyUI_33075_.png (1.27 MB, 1280x720)
1.27 MB
1.27 MB PNG
>>
File: Failtext.png (506 KB, 1280x768)
506 KB
506 KB PNG
Almost. Hmmm, is it that text coherency suffers when you move away from 1024x1024?
>This image is a digitally drawn cartoon in a typical comic strip format. The scene is set in an art gallery, with a girl on the left side wearing a teal blazer and light brown pants, pointing to a framed painting on the wall. The painting, which is green with a yellow border, depicts a bowl of fruit including apples, grapes, and bananas, with a price tag of "$500" attached to the lower right corner. Another identical painting, identical in style and content, hangs on the wall to the right, priced at "$1500". In the foreground, two people are standing, observing the paintings. One person, a bald man with a blue plaid shirt and brown pants, is looking at the paintings with a confused expression. The other person, a woman with dark hair and a sleeveless dress, is standing behind the bald man, watching the scene with a neutral expression. The background features a beige wall with a few other paintings, and the gallery is lit with soft, even lighting. A humorous caption at the bottom of the image reads: "It is more expensive because it took the artist several weeks to paint it, while the other one was generated in 10 seconds on my computer."
>>
File: bComfyUI_111989_.jpg (846 KB, 1920x1088)
846 KB
846 KB JPG
>>102098805
Alphonse Mucha bro, surprised more people haven't tried him already with how overused he was in SD.

>>102100240
nice, got more? i've been wanting to see how well flux can do with a cosmic horror or spaceship blasting a planet.
>>
>>102100517
>Alphonse Mucha
looks nothing like his style tho
>>
File: still cookin.jpg (2 KB, 106x29)
2 KB
2 KB JPG
I'm still cooking bros.

also I'm surprised how good the results are already after 1000 steps.
>>
>>102100580
It's a style lora, right? If the results are already good, you probably won't need more than 2000-2500 steps.
>>
File: 1713828400833321.png (989 KB, 768x1024)
989 KB
989 KB PNG
They put a fucking black spade on my big titty goth gf WHY?!
>>
File: ComfyUI_00875_.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
also why the fuck is the thread so slow suddenly?

>>102100612
>It's a style lora, right?
its a person. someone a lot coomers will be thankful for.
>you probably won't need more than 2000-2500 steps.
I need it to do more steps because there is a lot of training material to go through and the more the better.
>>
File: ComfyUI_00202_.png (396 KB, 512x512)
396 KB
396 KB PNG
>>102100627
>mfw
>>
mfw I call out CeJewkan on leddit and he bans me
>>
>>102100649
the turk guy with the patreon?
>>
>>102100627
A black heart isn't a black spade.
>>
>>102100627
rent free
>>
File: ComfyUI_33079_.png (733 KB, 1280x720)
733 KB
733 KB PNG
>>
File: ComfyUI_04544_.png (1.68 MB, 832x1216)
1.68 MB
1.68 MB PNG
only good 1girl of the night
>>
File: bComfyUI_112019_.jpg (820 KB, 1920x1088)
820 KB
820 KB JPG
>>102100532
yeah no shit it's flux after all but that art style will pop up quite often if you use his name in the prompt.
>>
>>102100690
lovely eyes
>>
File: ComfyUI_00657_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>102100697
I can still see the Buttchin
>>
>>102100698
that pic is Kino af dude.
to get this style you just use: ...in the style of Alphonse Mucha ?
>>
File: 2024-08-27_00058_.jpg (1.8 MB, 3840x2160)
1.8 MB
1.8 MB JPG
>>102100517
>nice, got more? i've been wanting to see how well flux can do with a cosmic horror or spaceship blasting a planet.
quite well, although you have to prompt artists for cosmic horror or Cthulhu looks like plastic
>>
>>102100649
He is very ESL, so if you want to be able to continuously throw shade his way, you need to simply allude to his bad practices, other people pick up on it and he remains oblivious.
>>
File: 2024-08-27_00012_.jpg (1.76 MB, 3840x2160)
1.76 MB
1.76 MB JPG
>>
File: 2024-08-27_00056_.jpg (1.02 MB, 3840x2160)
1.02 MB
1.02 MB JPG
>>
File: sunflo.png (703 KB, 1280x768)
703 KB
703 KB PNG
Do you call that a sunflower hedgehog? :(
>This image is a colorful, digital cartoon drawn in a humorous, exaggerated style. The scene takes place in a forest with tall, thin, brown tree trunks and a clear blue sky above. In the foreground, there are three main characters: two brown bears and a sunflower hedgehog. The bear on the left is facing the viewer, standing on all fours with its mouth open, seemingly mid-speech. The bear on the right is also standing on all fours, facing the sunflower hedgehog, with a similar expression. The sunflower hedgehog, with spiky quills and a cute face, is positioned between the two bears, facing the bear on the right. In the background, a little girl stands to the right, wearing a yellow shirt and orange shorts, holding a camera. She is looking at the sunflower hedgehog, suggesting they are taking a photo. Above the sunflower hedgehog, a speech bubble reads: "You know, they say sunflower seeds are a good replacement for meat, I wonder is she got some." The text is written in a playful, cartoonish font. The overall tone of the image is humorous and light-hearted, emphasizing the absurdity of the situation.
>>
>>102100738
The funniest bit is he is a mod of the sub. They even have a self promotion rule that he is allowed to ignore because he is a mod.
What a piece of shit man.
Any time I can undermine this guys patreon and provide the information he is trying to sell for free, I will.
>>
>>102100768
>piece of shit man
That's DOCTOR piece of shit man to you. I agree though. I can't believe people have passively let him get this far. If you have something to contribute to the discussion, he will steal it and sell it.
>>
File: fs_0264.jpg (99 KB, 1024x1024)
99 KB
99 KB JPG
>>
>>102100709
Why is it there so often, it's strange
>>
>>102100709
1 in 4 americans have butt chin. Blame the americans.
>>
File: bComfyUI_112012_.jpg (890 KB, 1920x1088)
890 KB
890 KB JPG
>>102100730
that or by Alphonse Mucha, i'm sure there's better way to phrase the prompt for that exact artstyle but his name is a pretty good wildcard for me.

>>102100737
>>102100760
damn i'll have to try for it tomorrow, good gens tho man
>>
>>102100890
You have to specify more details about her looks to change the general look to lean more towards different training images.

Woman etc seems to lean a lot towards those model look with the buttchins
>>
File: ComfyUI_33086_.png (823 KB, 1280x720)
823 KB
823 KB PNG
>>
>>102100892
thanks bro, very cool
>>
File: 0.jpg (602 KB, 1024x1280)
602 KB
602 KB JPG
>>
>>102100900
I see, I'll make a list and ask a LLM to make me sentences then when I do that
>>
File: FLUX_01300_.png (1.17 MB, 832x1216)
1.17 MB
1.17 MB PNG
>>102100928
Is he right?

Shoulw we stop captioning pics?

https://civitai.com/articles/6982
>>
JoyCaption pro tip: if you're captioning NSFW real images, swap out the base llama 3.1 model for stheno 3.2, and use the official instruction format for that model. I've tested the base model, Hermes, stheno 3.2 and 3.4. Stheno 3.2 is by far the best, even 3.4 is somehow much worse.
>>
>>102101053
no it's absolutely retarded and borderline mystical and absolutely the garbage Reddit gobbles up
it operates on the same principles as any other diffusion model, the only difference is the T5 is significantly smarter than CLIP at understanding the nuances of language and thus creating the perfect conditioning tool for training images with captions

https://civitai.com/articles/6792/flux-captioning-differences-training-diary is the only guide that actually has some proof and shows that captions still win
>>
>>102101065
does joycaption only works with Llama or you can use other models like Mistral?
>>
>>102101127
Only llama 3 8b, or anything derived from that. It works by projecting the image's CLIP embedding into the token embedding space of the model, so only that model or things finetuned from it work.
>>
>>102101065
>>102101163
retard here that never used a local LLM

what is the best way to install aand run Joy Caption?
>>
>>102101065
I was going to try running InternVL2-40B, how is it compared to that?
>>
>>102101179
clone the repo and run it
>>
File: 00005-3045120450.png (1015 KB, 1216x832)
1015 KB
1015 KB PNG
>>
File: fs_0284.jpg (83 KB, 912x1280)
83 KB
83 KB JPG
>>
File: 00006-3045120451.png (967 KB, 1216x832)
967 KB
967 KB PNG
>>
>>102101209
InternVL2-40b is much better. More consistent, less likely to hallucinate, more direct and less GPT-slopped language. But the descriptions are a lot shorter than joycaption's (maybe you can change this with prompting). It sometimes misses or skips details that joycaption gets.

You will need an 80GB GPU or at least three 3090s to run the 40b locally though, assuming you're using the 8 bit quantization option.
>>
File: 00010-3045120455.png (1004 KB, 1216x832)
1004 KB
1004 KB PNG
The blur :(
>>
>>102101330
OK thanks anon.
>>
>>102101346
woohoo!
>>
File: bComfyUI_112106_.jpg (738 KB, 1920x1088)
738 KB
738 KB JPG
>>
File: 00013-3045120458.png (1.03 MB, 1216x832)
1.03 MB
1.03 MB PNG
>>
File: ComfyUI_33089_.png (832 KB, 720x1280)
832 KB
832 KB PNG
>>
File: ComfyUI_33092_.png (775 KB, 720x1280)
775 KB
775 KB PNG
>>102100702
>>
>>102099035
Why is flux bad at violence?


I can understand porn was poisoned, but violence? We have plenty of movies of people killing each other.

Is there a workaround?
>>
Another bread

>>102101429
>>102101429
>>102101429
>>
>>102101427
yes indeed
>>
>>102101434
I'll be honest, the people who do gore with AI are on the level of rekt on /b/ and it disgusts normal people. Porn is skipped for obvious reasons, gore is skipped because it's disgusting on the same level as scat for normal people.
>>
File: ComfyUI_02435_.png (976 KB, 1024x1024)
976 KB
976 KB PNG
Okay, on my latest test at rank 128 with gpt4o caption prompts I am STILL getting concept bleed through characters.
I just want to confirm with everyone that the concept bleeding is real. You can more or less get the character if you describe them in full, but that kind of defeats the purpose of tagging them in the first place.
My next step is to try the various repos that allow for the training of the clip_l model during LoRA training and report back.
>>
>>102098092
Neat
>>
>>102098222
>>102098184
>>102097209
the way that one speaks further demonstrates that turks are chineses in origin
>guuuh duuuuuh



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.