[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1008 KB, 3264x3264)
1008 KB
1008 KB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102565263

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/kohya-ss/sd-scripts/tree/sd3

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai
>>
PONY AUTISM ANONS AWAKEN
>>
>>102594258
what do you want?
>>
File: IMG_0286.jpg (84 KB, 1024x1024)
84 KB
84 KB JPG
Is this a good art style or a bad one?
I am a 3D plebeian so they all look the same to me.
>>
File: pogchamp_5_7.0_002.png (1.03 MB, 896x1152)
1.03 MB
1.03 MB PNG
>>102594270
.....poggerss...

https://civitai.com/models/799765/pogchamp-face-concept-lora-pony?modelVersionId=894297
>>
>>102594404
Bad
>>
I can't find the branch or whatever that was to train flux loras in kohya, does anyone have the install?
>>
>>102594414
Fuck
>>
>>102594404
It's fine, but unpopular. It's associated with browser games and mobile games. In general I'd say it's an art style used by cheap video games.
>>
>>102594404

Looks like those abandoneware Western Erotic Games.
>>
>>102594549
>v0.0.1a
>>
>>102594740
>Enshittification
using that from now on
>>
>>102594832
It's a fancy and funny way to just say "Bait and switch" kek, they make a good product at the begining, and once they have enough userbase and loyalty they can switch to be a lucrative service a.k.a something so shit you wouldn't even have considered it if it was like that at its launch
>>
File: pogchamp_5_7.0_016.png (1.12 MB, 896x1152)
1.12 MB
1.12 MB PNG
>>102594881
I've been raging about the enshittification of the internet for years, probably since i was a teenager funnily enough
I've hated google since 2012 maybe and i was 15 then
absolutely nothing has changed except it seems everyone wants to BE like google, which is the absolute beyond the bottom of the barrel worst case scenario we could've gotten and i wish i could go back in time to warn my younger self and to pick better hobbies before it's too late.

>oh well at least imagegen AI rocks and will stay that way
>>
question
i'm using the fast stable diffusion A111 colab notebook
i can download ponyxl from civit just fine, but not autismxl which returns a 401 error instead
why? and how do i fix it?
>>
File: file.png (5 KB, 644x112)
5 KB
5 KB PNG
>>102595075
because of this
>>
>>102595120
but I am logged in.
do i need to pass an API key or something to colab?
>>
>>102595131
I never used collab, but if collab is sending the requests from their own server, then they're not logged in
https://www.reddit.com/r/StableDiffusion/comments/19f8wdm/need_help_with_civitai_model_download/
>>
>>102595146
okay, i figured it out
you create an API token and add &token=INSERTTOKEN at the end
i did try that, but i guess i copied the api key wrong the first time?
>>
>>102595008
Finally worked with this here, thanks for the effort
>>
For the person who asked: redhead in the previous thread was made with natvis v20 + lora.
>>
File: ComfyUI_Horror_00007_.png (617 KB, 640x768)
617 KB
617 KB PNG
Hnng. I'm celebrating upcoming Halloween by generating zombies. It's so icky in a good way. Adding a butterfly makes them cute though. Still wouldn't hug
>>
>>102595499
damn you've inspired me to try and focus on halloween themed gens for next month.. now that there's an idea..
>>
Are xformers worth bothering with?
>>
File: file.png (126 KB, 1450x819)
126 KB
126 KB PNG
>>102595292
I give up, it never works
>>
>>102595530
The python stack dump. Somehow, this is better than the segfault.
>>
File deleted.
Is forbidding damaged clothing a form of flux's censorship? For some reason wedding dress on my zombie is not torn or damaged.

>>102595511
Yeah, it's so unusual comparing to normal cute waifu.
Also "Rains from the mouth" with heun sampler is something to behold.
>>
>>102595766
flux is just weird about details. I've tried every variation I can think of to conceal, cover, clothe, and dress parts of subjects, and it just doesn't get it, either. have you tried a negative input or negative weighting tokens?
>>
>>102595705
yeah the gui launched at least, but training fails as soon as it tries to actually do something
>>
Well, at least I learned that word replacement is a stupid meme. Only leaving the natural language one running. Next up is redoing the dataset to have a higher ratio of full body shots to close ups. Maybe that will help it understand placement better.
https://files.catbox.moe/cp1ir5.jpeg
>>
File: kohya SS GUI flux path.png (75 KB, 1690x325)
75 KB
75 KB PNG
>>102595873

>No Data

Hey, did you put something in the model path directory?
>>
>>102596060
Yeah I filled all 3 there with the same ones that work in ComfyUI
>>
OP linked wrong previous thread. Previous thread is >>102580786
>>
>>102591483
xl models don't handle img2img the same way FLUX does, I can't explain it but I see it when I use the respective models. Beyond that I can't say because I'm not sure what PDXL is
>>
File: Koh lora.png (14 KB, 828x178)
14 KB
14 KB PNG
>>102596073

What the hell is split model? Are we on the same page?
>>
>>102596132
I'm going to launch it again, sec
>>
>>102594422
https://github.com/bmaltais/kohya_ss/tree/sd3-flux.1
>>
>>102596132
>>102596142
Here's one example, but I tried many others including the AdamW optimizer with default settings etc.
>>
>>102596132
>>102596209
And here's the error of that config, but it's always the same error
>>
>>102596217
Try updating Torch.
>>
File: file.png (11 KB, 650x199)
11 KB
11 KB PNG
>>102596359
How and to what?
>>
>>102595454
ty
>>
>>102596217
>Copy error
>Paste to Chatgpt
>Get answer
>>
>>102595766
I had no trouble getting some torn and damaged clothing with flux dev nf4
>>
File: file.png (54 KB, 958x768)
54 KB
54 KB PNG
>>102596465
I doubt modifying the source code of the repo is the solution
>>
>>
>>102596514
try it, I do it to fix kohya errors all the time
>>
>>
>>102596515
>>102596515
I really don't understand the point of making blurry, pixelated images this is such an obscure fetish
>>
>>102596553
Neck yourself
>>
>>102596553
It's fine that you don't understand
>>
>>
what's that version of SD1.5 that was trained to gen at 1024*1024 natively called again?
>>
>>102596499
This reminds me of some erotic/fetish artist who would draw pictures of women walking around normally in public in Victorian dress style but the tits were out and the front was open. Can't remember the name but I'd love a Lora with that concept.
>>
File: file.png (28 KB, 687x226)
28 KB
28 KB PNG
>>102596536
I'm tired, boss...
>>
>>102596660
bro there's like a million victorian style pony loras probably, couple that with the right booru tag and you can gen a thousand for the rest of the weekend
hell maybe ill try that prompt right now while im learning style merging
>>
>>102596664
How do you get Pony gen dresses where the skirt has front open.
>>
File: additional flux.jpg (37 KB, 1709x136)
37 KB
37 KB JPG
>>102596217

--fp8_base --highvram --timestep_sampling sigma --model_prediction_type raw --discrete_flow_shift 1.0 --loss_type l2 --cache_text_encoder_outputs


Try this additional parameters. Failing that, grab a flux training config on this thread.

https://github.com/bmaltais/kohya_ss/issues/2701
>>
File: file.png (1.64 MB, 1280x749)
1.64 MB
1.64 MB PNG
https://huggingface.co/enhanceaiteam/midflow
Babe wake up, a new open source base model arrived
>>
>>102596778
Tried the configs in the thread, but same error
I think your arguments are for the CLI version but I use the GUI
>>
>>102596884

Looks mid. Why would you put mid in the name? lmao.
>>
>>102596884
>"inspired by Stable Diffusion and Flux 1"
>no other information
>literal mid example imgs
>>
>>102596884
>file sizes aren't egregiously huge AND we get a good amount of choice
we're EATIN' GUD
>so t minus 5 minutes until fluxfags call it shit
>>102596913
>>102596918
Or 3 minutes
>>
>>102596918
Credits:
>Pranav Ajay
>Stability AI
>OpenAI
>Google
>Black Forest Labs
>Fal AI
>CivitAi
It's just 1 pajeet and then all the other big companies
>>
>>102596209
Are you using another person's config file? If you are, did you change the file paths?
>>
>>102596922
>fluxfags
I take offense at such accusations. I use SD1.6
>>
File: for honor kneel.gif (1.27 MB, 372x374)
1.27 MB
1.27 MB GIF
>>102596961
apologies for the friendly fire, i'm on edge (and edging) today.
>>
File: p.png (95 KB, 362x648)
95 KB
95 KB PNG
>>102596884
Sus
>>
>>102596946
I tried both my own and other people's, and default presets, and yeah I changed the filepaths everytime
>>
>>102596884
>Pranav Ajay
I'll be skipping this one
>>
>>102596884
interesting to note this model isn't even available on their website for onsite generation
>flux uncensored is though
>>
File: IMG_0301.jpg (115 KB, 1024x1024)
115 KB
115 KB JPG
>>
>>102596998
Try running setup.bat and choose option 1.
>>
>>102597066
>flux uncensored
uncensored?
>>
>>102597168
Already did a few times sadly
>>
File: CHILD HUH.png (916 KB, 1670x843)
916 KB
916 KB PNG
>>102597228
yeah flux uncen-WOAH WOAH WOAH TIME OUT WAIT WHAT
CHILD HUH?
FLUX UNCENSORED WITH MIDDLE-WHAT?

https://huggingface.co/enhanceaiteam/Flux-uncensored?not-for-all-audiences=true
>>
File: pron sir welcome.png (85 KB, 1680x758)
85 KB
85 KB PNG
>child WHAT sir?
>welcome sir
>>
>>102597243
That's uhh... quite the example prompt he has...
>>
>>102597243
>has that as an example
>it doesn’t even show anything, which is good because WTF, but also bad because it’s a model failure
>>
>>102595131
You need to create a civitai token and add &token=XXXX where XXXX is the token at the end of the url.
And then the authors know the identity of everyone downloading their model.
>>
>>102597292
Example 3 is the same prompt but it's actually nude
She looks like a hag though
>>
>>102597307
>And then the authors know the identity of everyone downloading their model.
They don't actually, it's just to "prevent bots from downloading it"
>>
>>102597348
Yeah I’m not going to find out fuck that
The only thing I did with that model was check if it could do dicks (it can’t) and leave
Honestly at this point BFL is a homophobic terrorist organization
>>
>>102597365
>BFL is a homophobic terrorist organization
BASED
>>
>>102597365
>The only thing I did with that model was check if it could do dicks (it can’t) and leave
https://civitai.com/models/751269/xuncut-flux
>>
>>102597365
>Honestly at this point BFL is a homophobic terrorist organization
lmao
>>
>>102596645
ZootVision, recommending Epsilon version:
https://civitai.com/models/490451?modelVersionId=573612
Apparently censored by Google? It didn't appear in the first page.
Also, it sucks at photorealism, so if you're into that, move on.
>>
>>102597456
Thank you so much, I just spent a retarded amount of time trying to find something on Google or g archives
>>
>>102597456
perhaps the most visually offensive model on the market
>>
>>102597586
>perhaps the most visually offensive model on the market
No, that's BetterPony:
https://civitai.com/models/544876?modelVersionId=605949
You have to have score_9, score_8_up, score_7_up to avoid that.
>>
>>102597243
Huggingface examples are rotten because you can put a real picture there and claim your model made it.
I mean, civitai examples are like that, except at HF it looks like the API made them with the prompt, but you can put any pic and claim any prompt made it.
>>
File: blindqueen.jpg (869 KB, 1265x1625)
869 KB
869 KB JPG
i need this pic in original or higher resolution please
>>
>>102598080
you got it boss
>>
>>102598096
Thank you anon do you have upscaled the pic ?
>>
>>102598096
oh yes didnt see the filename
>>
gen me a cat singing
>>
>>102597403
>limp elephant trunks with un contained ginger bush
Chaz bono has a better cock than these
https://files.catbox.moe/5hyiho.jpeg
>>
File: catSuckMikkA.jpg (32 KB, 465x700)
32 KB
32 KB JPG
>>102598193
>>
I hope that anon who used ai-toolkit instead of koyah is doing okay
>>
File: gen me a cat singing.png (1.55 MB, 1597x747)
1.55 MB
1.55 MB PNG
>>102598193
you got it boss
>>
>so this is the power of flux dev
>>
>>102598213
>>102598238
>>102598266
meow meow meow meow meow meow meow meow, meow meow meow meow meow meow meow meow, meow meow meow, meow meow meow meow meow meow, meow meow meow meow meow meow meow meow meow. Meow, meow! Meow meow meow meow meow meow meow meow, meow meow meow meow meow meow meow meow meow meow
>>
>>102598266
>Clip skip 2
what? I thought you couldn't do that on flux
>>
File: iffyMix.png (428 KB, 768x768)
428 KB
428 KB PNG
>>102598266
Welp, looks like SD1.5 models have aged badly.
>>
File: IMG_0313.jpg (761 KB, 1125x1128)
761 KB
761 KB JPG
>>102598281
Your gen sir
>>102598224
they ask you how you are, and you just have to say that you're fine, when you're not really fine, but you just can't get into it because they would never understand
>>
>>102598358
Idk how to say this but that cat looks like it fucks human men
>>
File: CATSINGING.png (539 KB, 1024x1024)
539 KB
539 KB PNG
>>102598193
>cute chibi cat, standing on two legs, microphone in front, singing with musical notes around, heart above head, big expressive eyes, happy expression, blushing cheeks, fur pattern with black and white stripes, fluffy tail, small pink nose, tiny paws, wearing a colorful flower crown, background is transparent checkered pattern, digital art, cartoon style, vibrant colors, detailed shading, soft textures, light source from above, no background elements, no people, no clothing, no accessories, kawaii, playful, adorable, cheerful, joyful, cute cat, microphone stand on the ground, musical theme, singing cat, chibi art, cute animal, kawaii art, digital illustration, cartoon style, chibi cat, kawaii art, cute chibi cat, kawaii illustration, kawaii digital art, kawaii chibi cat, kawaii illustration, kawaii digital art, kawaii chibi cat, kawaii illustration, kawaii digital art, kawaii chibi cat, kawaii illustration, kawaii digital art, kawaii chibi cat, kawaii illustration, kawaii digital art, kawaii chibi cat, kawaii illustration, kawaii digital art, kawaii chibi cat, kawaii illustration, kawaii digital art, kawaii chibi cat, kawaii illustration, kawaii digital art, kawaii chibi cat, kawaii illustration, kawaii digital art, kawaii chibi cat, kawaii illustration, kawaii digital art
>>
File: file.png (53 KB, 1207x765)
53 KB
53 KB PNG
>>102596884
>Babe wake up, a new open source base model arrived
uh oh...
>>
Could someone explain what the relatively best way to start image gen these days is? I have a 4090. A long time ago I messed around with downloading SD, and used ComfyUI and A111 or whatever to generate a few images. This was when SD leaked and everything went bananas. >What model should I use to start with?
>Do people still get custom LORAs or something similar off of sites like CivitAI? Are those 'ideas' all still simliar?
I want to generate cool fantasy images and weeb images. Bonus points for mecha images, mostly lore stuff for an autistic little world I want to create
>>
>>102598888
Pony and the autism stuff off civitai.
>>
File: ComfyUI_34173_.png (884 KB, 800x1072)
884 KB
884 KB PNG
>>102598193
>>
>>102598992
thats adorable, now gen her naked and peeing on that table.
>>
>>102598933
>Pony
is pony a base model like SD? or is it something you apply on top of something else?
>>
>>102598888
>A111
Use forge. A1111 is kill. Other than that, all still the same. Except we now have Flux with contains an better LLM and thus understands normal language prompts better. Pony Diffusion for coom.
>>
>>102599093
>Use forge
no, use comfyui, that shit is aids and kill too, not even reforge is good either.
t. former forge acolyte
>>
>>102599060
It is a finetune of SDXL for anime but with much better understanding of hentai concept than the original models. It now has finetunes itself and is generally treated like an own ecosystem.
>>
>>102599106
Yes true. If you have an autistic mental condition that is.
>>
>>102599159
okay champ, you go and fix that huge bug with a1111 that crashes your shit if you go over 500 loras loaded in the UI
oh wait, you can't, you won't, and you'll even deny it exists.kill yourself.
>>
>>102599203
Who uses A1111? It hasn't been updated in half a year.
>>
File: adolf computer.gif (1.03 MB, 320x240)
1.03 MB
1.03 MB GIF
>>102599223
aaaand there it is.
>>
File: ComfyUI_34177_.png (1.37 MB, 848x1024)
1.37 MB
1.37 MB PNG
>>
>>102599229
Skill issue
>>
File: ComfyUI_34184_.png (1.41 MB, 848x1024)
1.41 MB
1.41 MB PNG
>>
>>102599126
Why the fuck won’t they make a flux version
WHYYHH
>>
if you prompt something really simple and generic, and you get a series of images from that model that are all very similar, does that imply the model is heavily trained with that subject in mind?
asking since im pretty new to this and trying to understand how that works.
>>
>>102598871
Wasn't the entire point that anybody could clone a repository with one click?
And nobody cloned Midflow?
>>
>>102599582
bro we just laughed at it for being indian i don't think anyone here even tested it
>>
File: MistoonEmerald2.png (828 KB, 768x768)
828 KB
828 KB PNG
>>102599126
But can it do this?
>>
>>102599602
this... maybe this was an insane model and we missed it because we love to make fun of pajeets too much :(
https://youtu.be/GzlKja1ySzo?t=9
>>
>>102599223
I don't have a GPU so I have to rely on A1111 WebUI CPU.
On a huggingface space at 20 minutes per generation.
>>
>>102599551
It's just Flux's lack of creativity deciding that's how that prompt should look like.
If you want more creativity try Kolors, AuraFlow, Pixart, or any SD1.5 merge like PixelKicks.
>>
what does clip skip do?
>>
File: 3339839997.png (1.85 MB, 832x1216)
1.85 MB
1.85 MB PNG
>>
>>102599735
The history is very big and complicated.
So people were all living happily without clip skip, then Novelai, a private model, leaked, and it was the best anime model by far, so people started finetuning and merging it.
But they found they had trained with clip skip, so now, any finetuning with clip skip looked better.
And so SD1.5 models were split, the ones based on branches without it, and the ones that had Novelai (and later AnythingV3) merged into it, that would look better with Clip skip, and if you merged any of them into your model, now it requires it to look good.
Hopefully this is what you wanted to know, and not the technical specifications of what it actually does.
>>
File: ComfyUI_34195_.png (1.55 MB, 848x1024)
1.55 MB
1.55 MB PNG
>>
>>102599821
whys do people use it with sdxl modelz den?
>>
>>102599476
Because of the license.
>>
>>102599696
https://stable-diffusion-art.com/forge-colab/
>>
>>102599843
I like the style, how are you making these?
>>
File: ClipSkip.jpg (68 KB, 976x600)
68 KB
68 KB JPG
>>102599862
For the same reason Novelai trained with it, some people like it better for some reason, even though it just make the CLIP encoder less accurate.
Note Clip Skip is always active as the default is 1, when people mean using it, they mean setting it to 2 or higher, or going from 12 to 11 on the diffusers version.
Here's a pic comparing without it and with it, and it's done by the people showing how it's better to use it, but honestly, I like the one on the left better.
It's mostly a subjective thing. Let's never talk about the 1337 thing.
>>
>>102599908
>Due to the computing resources required (High RAM), you need a Google Pro and Pro+ to run Forge on Colab.
Fuck you, if I wanted to spend money on this, I'd just buy a new computer with a giant GPU..
>>
File: ComfyUI_34198_ copy.png (1.4 MB, 848x1024)
1.4 MB
1.4 MB PNG
>>102599918
Here's the catbox https://files.catbox.moe/6j8ogg.png
I'm using this lora https://mega.nz/folder/DwEjQJSI#udon_Z-X99ZHJ4IwCfh_hg
>>
>>102600027
So what was your complaint again?
>>
>>102600036
Thanks!
>>
>>102600058
That Google Colab used to allow people to use their services for free for image generation and then they started kicking people out and closing their accounts for using Stable Diffusion... unless they had a Pro account, which is extortion and people supporting Google should be ashamed.
So the only way to use all the power of image generation FOR FREE without limits is with huggingface CPU spaces, which is very slow.
My computer is 14 years old so it may die at any moment and I'll buy a new one capable of doing all this locally, but I refuse to do it now just for one thing (image generation) because I can do everything else I want on it already.
>>
>>102599735
It stops the gens at a specific layer. The higher the skip number, the less the resulting image conforms with your prompt.
>>
>>102599893
Schnell?
>>
File: raindevil_.png (1.98 MB, 1263x818)
1.98 MB
1.98 MB PNG
>>
>>102600293
He will train on Auraflow and Schnell and release the one that looks better.
>>
>>102600282
Wouldn't using a lower CFG do that as well?
>>
>>102600317
Oh nice.
>>
>>102600238
>Company should hand out free gibs
They don't.
>14 years old
>because I can do everything else I want on it already.
You can do anything you want on a new one even better. Because 14 years old that's like 10 times slower than my phone.
>>
>>102600347
No, not nice, whatever happens it'll never look as good as what finetuned Flux Dev would have looked like.
Because of greed.
>>
>>102600357
>>Company should hand out free gibs
I hate when my toys are taken away. They WERE HANDING OUT free gibs, and they stopped, I was entitled to them.
>Because 14 years old that's like 10 times slower than my phone.
For what? Running games? My computer manages to run chess and HTML5 games just fine, and I can emulate any classic console at full speed, 4 CPU Cores are enough and its only problem is that it's incompatible with modern GPUs.
And I also have a phone as fast as your phone, anyway, no idea what I'm supposed to be missing, other than fast image generation.
>>
>>102600341
Clip skip works differently. Each layer you skip makes the details more generalized. For example, if you prompt for a dog, if you skip several layers, you might get a mammal that's not a dog but looks like one.

CFG is controlling the strength of the expression of the details.
>>
>>102600582
See, your toys can be taken away at any time if you don't own them. Conclusion: Own your toys and stop crying.
>For what
Idk, for what ever floats your boat.
>>
>>102599735
Image SD is navigating through a city using a map. Every corner it stop, looks at the map and judges for itself if it should turn.

Now imaging that instead of every corner it only looks at the map every 2 corners. The route is going to be mostly the same, but there will be more influence from SD (aka the model).

>>102600341
using the same analogy. Higher CFG will lower the amount of time the driver has to decide on the corner and make "more random" decision.

Something I can't prove, but seems like it happens is ultra low CFG (2 or 3) will give you the source images of a lora.
>>
>>102600303
Very cool
>>
>make shitload of nested 1girl wildcards for all possible angles, framings, characters, bodytypes, expressions, poses etc
>write an all-encompassing 1girl factory prompt
>run infinitely

this shit's dangerous, too much coom
>>
>>102600536
Fair. I wish he’d just release the dataset.
>>
>>102600967
Why don't you let us participate in your wildcard coom?
>>
>>102601087
i deleted it after cooking overnight, it was taking over my life
>>
>>102601114
sure
>>
File: Clipboard01.jpg (452 KB, 2327x926)
452 KB
452 KB JPG
I just can't wrap my brain around this how anyone can really go down and use something like this and even spend hours in this bullshit genning stuff and then even think that this is the best thing since sliced bread. It's as comfy as falling asleep on a bed of gravel.
>>
>>102600825
>Something I can't prove
What stops you?
>>
>>102601200
its priority on the list. Sooooo many projects and things I want to get done.
>>
>>102601085
There is nothing special about Pony's dataset, in fact, you could improve over it by not including the bad pictures they included with the Score_6 to Score_1 aesthetic because nobody really wants to generate bad quality and everyone is doing at least Score_7_up anyway.
That means you could reach its quality with just 600K picture.
What we're missing is someone with the money to train such a model for Flux, nobody could do anything with Pony's dataset without that capital.
>>
>>102601219
I hope you're not an ENFP (starting more projects than you finish.)
>>
>>102601299
no, I'm a top. I always finish ;)
>>
File: 0.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>
>>102601280
I have the money to train it but not the free time to build up a dataset. I’m being compulsive about it so just getting and doing captions for less than a thousand took probably 10H
>>
File: 00271-1343970263.png (3.16 MB, 1120x1440)
3.16 MB
3.16 MB PNG
rebellious subjects EMEMIES TO PEACE
>>
>>102601280
>you could improve over it by not including the bad pictures they included with the Score_6 to Score_1 aesthetic because nobody really wants to generate bad quality and everyone is doing at least Score_7_up anyway
t. somebody who doesn't understand anything about machine learning.
>>
>>102601569
Flux doesn’t even have proper negative prompts.
>>
File: 0.jpg (235 KB, 1024x1024)
235 KB
235 KB JPG
>>
File: 1701879365388151.png (264 KB, 684x504)
264 KB
264 KB PNG
>>102601177
Try genning something more fun then
>>
>>102601177
some people like spaghetti
>>
>>102601177
I’m a really visual/geometric thinker so I like having an easy way to conceptualize what’s happening. Normal UIs have this airplane cockpit effect where I have no idea WTF the settings actually do.
>>
>>102601280
>in fact, you could improve over it by not including the bad pictures they included with the Score_6 to Score_1 aesthetic
you have no idea what you're talking about and you're making a fool of yourself
>>
File: _flux10.jpg (1.26 MB, 2144x2144)
1.26 MB
1.26 MB JPG
>>102601693
These are great. Love your gens.
>>
File: 1712158244192247.png (120 KB, 512x512)
120 KB
120 KB PNG
>>102601873
Thanks
>>
could someone explain what pony is? is it standalone? do i need a certain version of stable diffusion to use it? what models are most of you guys using? sdxl? flux?
>>
>>102601799
>>102601628
>>
>>102601952

PonyXL is uncensored SDXL Fine tune. Can do NSFW stuff and was popular. IllustriousXL is posed to overtake PonyXL as the next coomer model. Using both PonyXL and Flux at the moment. Flux has higher fidelity, but cannot do NSFW easily. You can find PonyXL v6 and Illustrious 0.1 on CivitAI
>>
>>102601320
So your problem is you always keep adding things to the list in the middle so the bottom things on the list never get done?
>>
>>102601485
Wow, where have you been? If we could organize something and give you the dataset we could probably have the Flux finetune everyone is dreaming of.
>>
>>102601177
You just set up the work flow once and do many gens with it. The customization is one of the main positives of Comfy. For some people, this process very hard for them. This is why other UIs exist.
>>
>>102601799
Teach me master. Say all the pictures with the tag Score_1 are removed from the dataset, so now people can't generate bad pictures like that. It trans faster because they're fewer pictures.
In theory, it'd improve the model's quality because anything you want to draw without mentioning Scores will not be negatively affected by those bad pictures.
What is wrong with this thought process?
>>
>>102602032
IllustriousXL looks very preliminary. Is there some reason this will succeed? Release date is very soon so I'll download it anyways.

>>102602039
I'll a little worried this is going to turn into some weird /g/ ERP. I'm going to tap out now.
>>
>>102602081
>I'm going to tap out now.
Well, there goes my plan of making you increase the priority of finding out if low CFG can recreate the training images of Loras on the list.
>>
>>102602076
do you ever stop to notice that every shit model follows your philosophy, and every single good comprehensive base model does not?
>>
>>102602076
you're probably baiting but i'll bite. you have to know what bad is to avoid doing it. that's the whole reason the score tagging is included, so that you can learn the concept but then avoid the neural pathways that lead to it being a primary feature of the image. moreover there's all sorts of concepts for which there simple aren't any high quality images, so you have no choice but to learn them from low quality ones. simple example, if you say the simpsons are low quality art because they're simple cartoons and not fancy paintings/digital AW, and you prune them from your training data, now your model doesn't know the simpsons and can't draw them. pony is already extremely overcooked on a much-too-small dataset which led to plenty of catastrophic forgetting. pony can't draw a coherent room interior, it can't draw a toilet or a sink, it doesn't know what a figure skater is, and those are just examples i've run into myself.
>>
>>102602250
>you have to know what bad is to avoid doing it.
Everyone says this but the more I've thought about it the less sense it makes. How can the model make something bad if you never teach it something bad? Or are we assuming it will somehow extrapolate bad from high quality images then make bad we have no word association for to avoid it?
When we make Loras, we don't include bad images to teach it the difference. Infact, one or two bad images can mean needing to rebake a lora if it honed in on their shittiness. It seems more like this is the bakers cope because they simply don't have an actually high quality dataset or the means to sort through it. Im starting to think it's really nonsense "logic"...
>>
File: IMG_0329.jpg (78 KB, 1024x1024)
78 KB
78 KB JPG
Accidentally making an amerime Christina Hendricks lora
>>
>>102602360
You’re completely right and flux proves it.
How is flux censored? It literally just don’t have naked people in the dataset. And it turns out teaching it new body parts after it’s fully cooked is really hard.
Training on anything that isn’t perfect is suboptimal.
>>
>>102602052
Idk if you’re being sarcastic but I’ve lent out servers for people to train models before. I don’t even need the dataset — everyone I’ve lent to before had a private dataset and I wiped the machine before and after they used it.
>>
>>102602206
No, the only difference is the size of the dataset, Pony has a one so big that it offset the poor decisions of the trainer.
>>
>>102602250
>it can't draw a toilet

I knew this was bait, I still had to see.
https://litter.catbox.moe/b04nwm.png

It was bait. And yes, my base pony skills are ass.
>>
>>102602446
>It literally just don’t have naked people in the dataset.
It did and people have found ways to make it draw naked people just fine, the secret is to avoid the poisoned terms that put clothes on and poisoned terms that turn nudity into cherries and blur.
>inb4 no penis in the dataset
What? Who would want to draw one of those?
>>
>>102602511
The first has no tank and the other three are unbalanced/lopsided/deformed.
Also, ew.
>>
File: BURN.png (1 MB, 1000x804)
1 MB
1 MB PNG
>>102602501
>I don’t even need the dataset — everyone I’ve lent to before had a private dataset and I wiped the machine before and after they used it.
You are picrel and the reason we can't have nice things.
The people with the money to train and the access to the datasets run them to the ground.
>>
>>102602551
All you can get are lollipop swirly nipples and a mound of venus with no vulva.
No labia, no clit, no anus, no balls, no dick. It’s PG-13.
>>
>>102602511
Wait! How did you...
Oh wait, I thought we were talking about Flux for a moment...
>>
File: dndDM.png (762 KB, 1903x1013)
762 KB
762 KB PNG
working on a local dnd DM rn
>>
>>102597365
>if it could do dicks (it can’t) and leave
Dicks are the actually the ultimate nsfw test for a model, unsurprisingly the first model that could actually do cocks was a furry model, even the NAI leaked model sucked at dicks and had to be merged with furry models to do it well
>>
File: skating.png (999 KB, 1164x1002)
999 KB
999 KB PNG
Ahhh. Models with no fine tunes are trash.

>>102602567
it gets a lot better if I am not using base model. My point is it is in there.

>>102602579
You might have been using safe. It just won't do anything. using rating_explicit...well it works. It just includes other things by default it seems.
>>
>>102602588
Nice
>>
>>102602511
>this is what a sloptard considers an accurate toilet/bathroom layout
it's honestly a little sad how rotted your brain has become
>>
>>102602575
>lollipop swirly nipples
Skill issue, haven't you seen the perfect male nipples? You just trick it into drawing them on boobs.
>no vulva
You mean no vagina, you can get a vulva.
>It’s PG-13.
No way, people have needed to catbox Flux outputs because they're too hot for a blue board.
>>
File: 8421.png (3.34 MB, 1160x1496)
3.34 MB
3.34 MB PNG
something something big bara tiddies
>>
>>102602592
Oh I know. I’ve had something running that has a 3:1 penis:vagina ratio and while women have been basically fine since <1000 steps the penises are only just now starting to behave consistently at 12K+.
>>
>>102602614
Please show me your base flux dev labia minora, wizard
>>
>>102602630
SUCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC and licc and little cheeky nibbles until he mewls like a cat
>>
File: v.png (650 KB, 600x762)
650 KB
650 KB PNG
>>102602664
The labia is the external part of the vagina, for caring so much about female anatomy you don't really know what things are what.
A vulva looks like picrel, you may have seen a lot of them where Flux drew girls with hands around their crotches, and if you masked it to make it out of context, her fingers would look like she's showing her vulva.
So Flux knows how to draw them and you just need to make it do it in the place that you need it (not unlike making naked girls show this "two-fingers"-like vulva in the DreamlikePhotoReal 2.0 model.)
>>
>>102602360
You are correct, negative prompts are a hack, you don't have to teach whats bad so it doesnt do it, you just have to train it well
>>
>>
>>102602789
Vulva is the whole external thing from mons to perineum; I was being too ambiguous and meant the entire vulva isn’t possible, not no part of it is possible. You can get a blurred skin tone labia majora crease that looks like shit, or you can get deformed inhuman gore. There is no way to get a coherent full majora+minora+clit etc out of base dev any more than you can get a frenulum out of it
>>
File: floating toilet.png (1.08 MB, 896x1152)
1.08 MB
1.08 MB PNG
>>102602607
I didn't mention accurate toilet/bathroom at all. Image composition != objects in model. It's honestly a little sad how you are cherry picking to insult my rotted brain.

I added the "it's honestly" part as I believe it is a way to sound more uneducated. Its weird how somebody would think that a person would lie about their own personal opinion.

Anyways, I'll be posting 1girls with misplaced toilets till supper. Enjoy.
>>
File: 0.jpg (233 KB, 2048x1024)
233 KB
233 KB JPG
>>
>>102602871
ayo drop the workflow ninja
>>
I can always tell immediately when a gen was made with empty latent. Something about the way the colors and light and shade all 'balance out', if you look past the thumbnail you can almost see the gray square from which the image began.
>>
>>
>>102603101
Then you have a skill I lack.
>>
>>102603117
Why don’t you just add the JPEG after the generation is done?
>>
waiting
room
for so long
>>
>>102603163
why don't you think about it a little and see if you can figure it out yourself
>>
>>102603487
Lazy?
>>
>>102602588
Based
>>
>>102602887
A part of a vulva still counts as a vulva, you just conceded.
>>
File: 1713782503158646.png (50 KB, 965x406)
50 KB
50 KB PNG
As stupid as it feels to upgrade from 12GB to just 16GB, it's tempting when shit like this happens
>>
>>102603650
Ok retard
>>
File: 00000-4115924867.jpg (136 KB, 1352x976)
136 KB
136 KB JPG
>>
File: ComfyUI_temp_ekjap_00009_.png (1.53 MB, 1024x1024)
1.53 MB
1.53 MB PNG
it hasn't quite learned that the 1girl booru token doesn't literally mean a younger girl at this epoch, but it seems like its learning Aoin's artstyle alright. still no idea if the --weighted_captions ARG actually did anything or if I just poisoned my captions with (1girl:1.00) kek
>>
>>102603896
based voxels
>>
File: latent_grid.jpg (954 KB, 6400x4800)
954 KB
954 KB JPG
>>102603101
prove it.

latents tested the grey one is the empty one.
>>
File: lat_test_01.jpg (2.07 MB, 3584x3456)
2.07 MB
2.07 MB JPG
>>102604091
in random order of course.
>>
>>102604101
baller, shot caller, my man rolling 20 inch blades on his impala
>>
>>102604101
NTA but my guts telling me top left. don't give it away yet though I wanna see og anon's answer
>>
File: ComfyUI_00238_.png (860 KB, 1216x832)
860 KB
860 KB PNG
I just installed a 4060ti 16gb and gave flux a try, copied the workflow from some website and here's my first gen, something like "sexy asian french maid, cleavage, shiny lipstick" didn't fuck with anything at all yet
i'm looking forward to creating more gf's in this shit
way better than SDXL ootb
>>
>>102599720
IMO Kolors mogs both Pixart and Hunyuan by default and is way easier to train Loras on (you can just use similar settings to XL, in my experience)
>>
>>102604240
you load Kolors Loras just with stock "LoraLoaderModelOnly" in Comfy BTW, since "CLIP" isn't a thing for it at all, it only uses ChatGLM, and nobody is training that obviously
>>
File: ComfyUI_temp_ekjap_00038_.png (1.74 MB, 1024x1024)
1.74 MB
1.74 MB PNG
>try prompting just the trigger word
>receive picrel
this is getting weird, but ok flux, fair move, fair move..
>>
File: ComfyUI_temp_ekjap_00039_.png (1.84 MB, 1024x1024)
1.84 MB
1.84 MB PNG
>>102604293
it was more conservative on the next epoch/seed
>>
File: 00374-1789008862.png (2.04 MB, 896x1152)
2.04 MB
2.04 MB PNG
hey, must be the money
>>
File: lat_test_02.jpg (2.67 MB, 3584x3456)
2.67 MB
2.67 MB JPG
I triple checked the settings on this one. They are correct. I was really hoping it changing the initial latent had more effect.

I very lazily smudged the nipples out. Litterbox if anyone cares.
15.4MB png - https://litter.catbox.moe/wdyn58.png

and answer key:
https://litter.catbox.moe/jwkp1f.png
>>
>>102604229
I gotta take off, but I'll call myself "latent anon" so you can search the thread tomorrow when I post the answer. I feel like OG anon split, I hope not, I do want to see his powers.
>>
>>102604338
your cfg is extremely high
>>
>>102602446
All foundational models past SD 1.5 are intentionally "censored" in different ways and anyone who has ever claimed otherwise it utterly retarded. SDXL could do female nipple unintentionally in the sole context of literal oil painting gens where they weren't even the main focus.

SD3 actually draws naked people totally unprompted VERY often but they simply don't have nipples or visible genitalia, it's just more skin. SD3 is bad because they simply didn't train it for long enough and they also used a weird nonstandard approach to noise scheduling, not because of "censorship".

Flux draws (very bad, unusable) female nipples outright for some reason (it's unclear why they went that far knowing they were aesthetically poor in all cases, would have made more sense to just not draw them), and like the other models doesn't do anything for the "downstairs" region.

TLDR all of these companies have precisely the same kind of corporate safety goals and all of them always will, there is no good guy rebel company where they actually give a shit about John Coom complaining about subpar nudity in the outputs, he's not the target audience and never will be. It's the same reason that AstraliteHeart is delusional for thinking that any of these companies (SAI, BFL, etc) would logically have ever "responded to any of his emails" about Pony partnerships or whatever, the people who actually make these foundational models from scratch don't fucking give a shit about your finetunes, like why would they conceivably be impressed by that in any way.
>>
>>102604366
thanks anon, this was fun. I definitely didn't guess right on your second post, seems like my eye wanted to lean towards green/blue = empty latent, probably some personal bias towards grey = blue based & empty latent = grey square in my brain or something retarded RIP
>>
>>102604380
why would you even want female nipples if you aren't married to her
>>
>>102604417
lol asking the pertinent questions I see
>>
File: ComfyUI_temp_ekjap_00046_.png (1.66 MB, 1024x1024)
1.66 MB
1.66 MB PNG
>>102604380
>it's unclear why they went that far knowing they were aesthetically poor in all cases, would have made more sense to just not draw them
I seriously hate this, because it bleeds into male gens too. instead of having a gen that just has no nipples, I get weird deformity nips that look like swirl tooled gumdrops even on shirtless anime dudes even when the source dataset doesn't have nipples on them to begin with. thanks BFL, I hate it
>>
>>102604101
Can't tell with cfg this high on a 'strong' finetune. They tend to exhibit very little of the initial latent because they learn such strong token-image associations that the final image is almost completely determined by the prompt; which is why there is so little variation from one gen to another by noise seed.

In base model prompting or lower cfg prompting it's much more apparent.
>>
>>102604338
you're not going to believe me but I guessed correctly on this one.
>>
>>102604338
interesting that yellow/red/blue/cyan/black are all very apparent, but green/purple are almost too subtle to detect. Suggests that of R, G, and B, excess green in the initial noise is not predictive of image color balance, but red and blue are. But when you have both red and blue (purple), they cancel out.
>>
>>102603693
What gpu?
>>
>>102604657
2060 12GB. I can run Flux1-dev-Q4 and about 250MB's of loras before overflowing into computer RAM. A 4060 Ti 16GB would give a lot more wiggle room and despite the gimped bus would still be at least 2-fold faster from what I've seen, but I want to wait for at least a 20GB card in the ~$600 or less range.
>>
Is it just me, or did the previous versions of the lora look better
https://www.reddit.com/r/StableDiffusion/comments/1fro5z1/instagram_edition_v5_amateur_photography_lora/
>>
File: ComfyUI_temp_ekjap_00075_.png (1.46 MB, 1024x1024)
1.46 MB
1.46 MB PNG
>>
File: ComfyUI_temp_ekjap_00092_.png (1.67 MB, 1024x1024)
1.67 MB
1.67 MB PNG
>>
File: latent_girl.jpg (164 KB, 896x1152)
164 KB
164 KB JPG
>>102604370
dynamic thresolding is set to mimic 2.5. The KSampler is set to 3.5. Using dmp_2_ancentral.

>>102604405
it was for me too. One thing I "latent anon" might try is feeding different colors in if backgrounds aren't happening in pony. I think doing it for color is pretty mostly useless. Anon here >>102604539 had some idea about specifics that seemed promising

>>102604450
sd15, CFG 7. pic related incoming. Incoming pic for high CFG. I don't think it is a factor. I think model weight far exceeds importance with this. Key is in the top right corner.

>>102604797
4 looks better. 5 looks like instagram with everything burnt with auto AI cleaning.
>>
File: lat_03_combined.jpg (2.12 MB, 4690x3456)
2.12 MB
2.12 MB JPG
>>102605224
sd15, CFG 7 test as promised.
>>
File: ComfyUI_temp_ekjap_00094_.png (1.91 MB, 1024x1024)
1.91 MB
1.91 MB PNG
>>
File: lat_test_04.jpg (2.3 MB, 3584x3456)
2.3 MB
2.3 MB JPG
>>102605233
sd15, CFG 22 with dynamic down to 9. I am sure you can guess the key.
>>
File: latents vs results.jpg (1.97 MB, 2688x3040)
1.97 MB
1.97 MB JPG
ponyxl, starting image vs result, just plugged these into an old pixelization 1girl workflow. It's 100% denoising, 8 cfg. took the first result unless it was unpostable.

>>102605224
it was either her or the one on the far right middle. I guess far right was probably purple gradient then
>>
File: 00018-438794818.png (3.62 MB, 1344x1728)
3.62 MB
3.62 MB PNG
where is the hypernetwork tab in forge webui?
>>
>>102605297
beside the lora one.
>>
>>102605314
its gone
>>
>>102605338
it shares with textual inversion.
>>
File: 0006.jpg (1.38 MB, 1664x2304)
1.38 MB
1.38 MB JPG
>>
File: 00338-2731714749.png (630 KB, 512x768)
630 KB
630 KB PNG
>>102605345
its doesn't anon..
>>
>>102605402
It was gutted it seems:
https://github.com/lllyasviel/stable-diffusion-webui-forge/issues/275

I still stand by that is where it lives. I am sorry it lost support. Always sucks when that happens.
>>
File: 1716677469823136.png (985 KB, 896x1152)
985 KB
985 KB PNG
>>
File: ComfyUI_34212_.png (1.05 MB, 848x1024)
1.05 MB
1.05 MB PNG
>fishnet stockings reaching the middle of her thighs
Well, technically they are reaching the middle of her thighs.
>>
>>102604797
Dude is a localtard who thinks 0.00001 is a good learning rate for anything at all apparently, he literally says "Prodigy was good but I moved away from it. I trained using 210 images at 0.00001 lr, AdamW8Bit for 9000 steps. So far I'm happy with the results:"

TLDR If you're thinking about not copying the default settings CivitAI's trainer uses for that kind of thing (even if training locally) just don't, your retarded AdamW / Adafactor Pony style loras are all worse than the ones people train with normal settings on Prodigy. And so on.
>>
File: 00028-823506358.png (3.03 MB, 1344x1728)
3.03 MB
3.03 MB PNG
>>102605519
oh well, back to OG auto1111
>>
File: ComfyUI_34208_.png (875 KB, 848x1024)
875 KB
875 KB PNG
A lora based on https://xcancel.com/4yan1de
https://mega.nz/folder/mtknTSxB#cGzjJnEqhEXfb_ddb6yxNQ (shiyuki folder)
>>
File: ComfyUI_34210_.png (970 KB, 848x1024)
970 KB
970 KB PNG
>>
>>102604696
That's not bad
>>
>>102605792

Thanks anon. How do I resize LoRas?
>>
>>102605792
Why don't you post the loras on civitai?
>>
>>102605599
lol
>>
File: resize.png (61 KB, 1479x656)
61 KB
61 KB PNG
>>102605815
With kohya ss
>>102605825
Don't want to.
>>
>>102605599
>JC Chudson. 23 years old. No friends. No job. No gf. No thing ever happ-
The anon who made that lora is based
>>102605810
Yeah it does the but it'd be nice to be able to play GZDoom and proompt at the same time.
>>
File: hands.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
>>102605846
Is there a name for the style of rnata/toyo/shi with realistic background and anime on top of it?
>>
>>
File: ComfyUI_34217_.png (1.32 MB, 848x1024)
1.32 MB
1.32 MB PNG
>>102605868
I dunno.
>>
File: ComfyUI_34233_.png (1.22 MB, 848x1024)
1.22 MB
1.22 MB PNG
>>
>>102605271
that would be a hell of a gimmick, genning nothing but triangular bitches. and they'd call you triangleanon or something like it
>>
n slur
>>
>schizophrenic thread
>>
File: 2391669107.png (1.18 MB, 1024x1088)
1.18 MB
1.18 MB PNG
>>
>>102607552
very nice stocking, what style lora is this?
>>
>>102604338
>Litterbox if anyone cares.
I care that it's already gone :'(
>>
File: 3036981150.png (1.43 MB, 896x1152)
1.43 MB
1.43 MB PNG
>>102607620
huaishen on civitai
>>
File: ghettoldg.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
goddamn flux pro is better than flux dev. until I get a 5090 its hard to justify the continued struggle with local gens. its like trying to use medieval technology while others run ncircles around you. of course I will stick to local based on principle, but don't mind if use my free tokens every 6 hours on fluxpro.art because we are falling behind, anons and stop denying it
>>
File: 3506749378.png (1.21 MB, 1152x896)
1.21 MB
1.21 MB PNG
>>
>>102607725
thanku brother
>>
>>102607731
Normie thinks everything is a race.
You are a normie because you are unable to understand one simple thing:
Expression.
You are only able to express your anguish over excel sheets because you're a normie.
>>
File: elf_0902.jpg (757 KB, 1664x2432)
757 KB
757 KB JPG
Does anyone know of a workflow that combines lora and controlnet with flux and doesn't use those xlab nodes which seems to be less memory efficient than native comfy nodes? I can't make it work with a 4090.
>>
any good xl checkpoints come out lately? I'm still using biglust or natvis
>>
>https://github.com/THUDM/CogView3
>https://github.com/THUDM/CogView3/blob/main/sat/README.md
new chanur 3b model with 16ch vae and t5, has a distilled model too for 4 step and 8 step
>>
File: 0.png (923 KB, 2048x1024)
923 KB
923 KB PNG
HPH2P
>>
File: superdave shocked 1.gif (2.44 MB, 400x224)
2.44 MB
2.44 MB GIF
>>102608673
>TTI
>512x res
>((17G memory usage))
that memory usage better be in shared system and vram and they're just not really explaining it well
>>
File: file.png (11 KB, 800x260)
11 KB
11 KB PNG
>>102608673
is this an out of season April Fools joke?
>>
>>102603487
is this by flux? looks super realo. how are you making these?
>>
File: 0.jpg (80 KB, 1024x1024)
80 KB
80 KB JPG
>>
File: file.png (89 KB, 833x448)
89 KB
89 KB PNG
>>102609343
i assume most of that is just the t5, no explicit mention of vram
>>102609693
no, the cog guys just like releasing ten thousand variants of the same model for some reason like they did with cog video
>>
>>102607731
It could be as good as literally reviving Van Gogh himself, but what the fuck do I care if it has filters than won't let me generate what I want?
Just be patient and wait for the cloud models to eventually leak to local, faggot. When they do you will get bored of them anyway, you dopamine addicted niggerbrain.
>>
>>102609975
So the "plus" is a finetune or it's still a base model? that's really confusing
>>
>>102610121
if you thought that's confusing wait till you read this
>https://github.com/THUDM/CogView3/blob/main/sat/README.md
they have even MORE models. from the model description it seems like what they are would be explained in the paper, but im illiterate
>>
>>102610162
>CogView-3-Plus-3B
>CogView-3-Base-3B (with 4 and 8 steps distil)
>CogView-3-Base-3B-Relay (with 2 and 1 steps distil)
If someone wanted to define autism, you just have to link this readme kek
>>
>>102607725
>>102607824
Neat
>>
Bread...
>>102610271
>>102610271
>>102610271
>>
>>102610199
i think it's pretty cute and nice of them to release all that, unlike black forest labs that just distilled 2 models from flux pro and gave us that. no clue if the model is good but they've already won my heart
>>
>>102609941
yeah it's flux dev nf4. the trick is I take a social media image in photoshop and add the dust and scratches filter, 120 radius 4 threshold levels, then use that as img2img input image with ~0.9 denoising (with sgm_uniform scheduling, maybe different number for different scheduling idk). then in the prompt I avoid words like "girl"/"woman", instead prefer "my friend Luann" etc

flux behaves oddly with img2img, worth experimenting with more
>>
>>102609975
its also for batch size 4 for some insane reason



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.