Discussion of creative use behind free and open source text-to-image modelsLast time on /ldg/ : >>103280382Decadent Edition>Beginner UIMetastable: https://metastable.studioFooocus: https://github.com/lllyasviel/fooocusEasyDiffusion: https://easydiffusion.github.io>Advanced UIForge: https://github.com/lllyasviel/stable-diffusion-webui-forgereForge: https://github.com/Panchovix/stable-diffusion-webui-reForgeComfyUI: https://github.com/comfyanonymous/ComfyUIInvokeAI: https://github.com/invoke-ai/InvokeAISD.Next: https://github.com/vladmandic/automaticSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Models, LoRAs, traininghttps://aitracker.arthttps://huggingface.cohttps://civitai.comhttps://tensor.art/modelshttps://liblib.arthttps://imgsys.org/rankingshttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3>SD3.5L/Mhttps://huggingface.co/stabilityai/stable-diffusion-3.5-largehttps://replicate.com/stability-ai/stable-diffusion-3.5-largehttps://huggingface.co/stabilityai/stable-diffusion-3.5-mediumhttps://huggingface.co/spaces/stabilityai/stable-diffusion-3.5-medium>Sanahttps://github.com/NVlabs/Sanahttps://sana-gen.mit.edu>Fluxhttps://huggingface.co/spaces/black-forest-labs/FLUX.1-schnellhttps://comfyanonymous.github.io/ComfyUI_examples/fluxDeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main>Index of guides and other toolshttps://rentry.org/sdg-linkhttps://rentry.org/rentrysdhttps://rentry.org/sdvae>Try online without registrationtxt2img: https://www.mage.spaceimg2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest>Related boards>>>/aco/sdg>>>/aco/aivg>>>/b/degen>>>/c/kdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/tg/slop>>>/trash/sdg>>>/u/udg>>>/vt/vtai
hi
>>103289661hello
I, I repost from >>103290280Say, I'm tired of my 1660Ti to infer.What's best, a 3060 12G or a 4060 8G ?From what I understand, I should prefer VRAM over speed, no?Is it worth waiting early 2025?My budget is ~300
>>103291783>From what I understand, I should prefer VRAM over speed, no?Tough one. Technically you want more vram, but apparently the newer versions have stuff like cuda 12 support, which comes in handy with txt2img and llms. Then again, I think 3060 does make the cut for the cuda architecture jump, so it might be a sweet spot. For comparison, I'm sitting on a Max-Q 2080 with 8vram, and it's just enough for SDXL equivalents, with ~20/25 seconds per standard gen. With 12vram you'd be slightly above the comfortable minimum, so maybe ~15/20 secs per gen? Don't take my word for it, by think I'd go with the 3060, unless there's a caveat I'm not aware of.>Is it worth waiting early 2025?I'm also considering an upgrade, but with the upcoming 50 series, I'd personally wait to see how the market develops after that. Either earlier generations get cheaper, or there's more interesting alternatives within the "budget" 50s. Especially since Nvidia is going balls deep on AI, so we can expect minor or major architecture upgrades that are designed specifically for deep learning, as compared to earlier models which were more of an afterthought really.
Are all the training images of people holding flasks from safety PSAs on how not to do it?
Where can you download sholi models? CivitAi doens't have them, right? But all the pixiv AISlopers are making them somehow right?
>>103291896>GPUYeah, I think I will go for the brand new 3060 12G then.Except if I can find a a better used card that doesn't look like scam.>wait until 2025Well, I think NVidia will kuck us all.I fear that the 50xx will use even more power, and not that much VRAM.So, yeah, on paper more computational power, but one would need to buy a new power supply.Also, I think prices will not go down too.
>>103292431>I think NVidia will kuck us all.Can't argue with that, unavoidable with monopolies.
>>103289445
>>103292495how tf did I miss that souls with guns kino
>>103291810i like this style, anon
>>103290208>>103292758Have you tried illustrious/noob?
>>103292825Yeah I tried, but no luck so far. Oddly enough I've had a much better time with ntrMIXIllustriousXL_v40. No clue what's the deal with it. Maybe tomorrow I'll give 0.65S a try and mess around with sampler settings. Either it wants particular settings, artist prompts, at least one lora, or it just doesn't like vanilla Forge.
>>103292878I've been enjoying epsilonpred1.0 more than vpred only because the SMEA sampler fries the latter and not the former. .65s fries less than .6 but still isn't 100% working IMO. >>>/h/hdg will figure it out soon I'm sure. >Either it wants particular settings, artist prompts, at least one loraSMEA sampler is definitely better with anatomy than others I've tried. Artist prompts are finicky because you're SUPPOSED to prefix them with "artist:" while also only using ones from booru but I've gotten good outputs forgoing both those ideas. The artist prefix does seem to help outputs look better however. I have yet to feel the need to use a lora desu. If I used Forge I'd post a catbox for you but it'd been so long since I last used any AUTO-like UI that, the last time I tried it, I felt unable to get anything good. I didn't use pony so I am not a good judge on which one is "better" all I know is I have not yearned to go back to 1.5 anime... yet.
>>103292930is this your tinder pic?
>>103293030no, it's hatsune gigu
>>103292995>/h/dg will figure it out soon I'm sureWithout a doubt, never underestimate the power of coomers and autists combined.>which one is "better"I'm knees deep into pony, but for whatever I've seen out of anons and what little I've used of of that mix I mentiond, it looks very promising. It's a bit rougher/messier on details, at least for now, but it does seem to have a better understanding of composition. With pony you have to squeeze it out, so as not to end up with a 1girl standing in an empty void. >>103293030omg it migu
>>103293103What was the prompt for >>103292758 ? I'd like to see what I can do with it with Noob after I finish up some IRL shit. Out of curiosity if anything.
>>103292552so much kino was posted in previous frfr nocap
>>103293156I'd love to, but can hardly recall with the amount of img2img, inpaints and swapping prompts/samplers around. Along the way I've probably used:>defeated wraith laying on snow, dutch angle, arched back, leaning backwards, swayed hips, metal gauntlet on chest, fluffy winter cloak, white cloth, black hair, cold, dying, smoke, smoldering
>>103289445>>SD3.5L/M>https://huggingface.co/stabilityai/stable-diffusion-3.5-large>https://replicate.com/stability-ai/stable-diffusion-3.5-large>https://huggingface.co/stabilityai/stable-diffusion-3.5-medium>https://huggingface.co/spaces/stabilityai/stable-diffusion-3.5-medium>>>Sana>https://github.com/NVlabs/Sana>https://sana-gen.mit.edudo these really need to be in OP still? seems like dead arches that could be replaced with more flux/illustrious/noob/pony/training info
>>103293471Been a while since I baked, so didn't have time to give it a think. Open to suggestions.
Sunday Slowday
Better calm than cold.
>>103293862much better than the spam trash that has been happening. >>1032934711st fuck SAI.2nd SD3.5 isn't dead yet. I did a search sorted by downloads and limited to a month. It isn't read to be removed. I would side with civits tagging on importance. Illustrious has a IL tag. NoobAI doesn't. I believe we can treat it completely the same as XL. If there is some background stuff (outside of training methods and it is BETTAR) that makes it different somebody inform me.
>>103293471Sana is getting a 4-5b version but SD3.5 is absolutely dead
>>103293862I don't know which thread to post in, this one has the correct formatting but the other one has more posts
>>103294412We can't tell you where to post anon. Whichever you feel more comfortable in.
>>103294117>SD3.5 isn't dead yet.Might as well be considering no one talks about it
>>103294117illustrious is like pony and noob is like if someone took that and added back artist tags and then some if anything noob should have an IL tag since its downstream
I've revised the OP a bit, hopefully you won't hate it next time you see it.>>103294628Looks like it finally got a couple of finetunes and loras, but oh boy do they look like a step back.
>>103292930Omg it Bigu!
Are there any comfyui upscalers that denoise for a few steps, then adjust the seed, then denoise for a few steps, then adjust the seed, repeating?Because doing this seems to be really good at fixing hands and mistakes. Even a really really bad hand that looks like its been through a woodchipper.As we all know, using a high denoise % pigeon-holes you towards an image that doesn't match. So that's why you use a low %, but it's also why you should adjust the seed a bunch of times, so you're not pigeon-holed towards one seed.
>>103294785youd probably have to setup all the different samplers to get different seeds, i dont think theres a single custom node that does that
>>103293094
This one, then
>>103295013have you tried LTX video anon? it's faster to render than cog and I feel it's slightly better in quality
I shall plant many a kiss upon those cheeks...
>>103295027I haven't tried it because all the examples I've seen were pretty incoherent, so I'm sceptical of your better quality claimIt's true that cog 1.5 is slow as fuck though, that vid took 13 minutes on my 3090
>>103295044>It's true that cog 1.5 is slow as fuck though, that vid took 13 minutes on my 3090for a 3 sec video, damn that's rough, you only have to wait for 1 mn for a 5 sec video for LTX, yeah the quality is a hit and miss, but at least you have the luxury to quickly retry another seed, and they'll improve their shit as the version they released so far is the v0.9
One thing, when i try to run the webui it only give me options to open it either in windows explorer or chrome, i cant pick other browsers, how to fix
>>103295058true, I'll try it out later today I guess. if it's that fast then it'll be worth it even if only 1 seed out of 10 is coherent
>>103295071One workaround would be to save a bookmark of the address it gives you and open it in the browser of your choice.
>>103295071I assume windows. Open a webpage (any webpage). Save a webpage. Through explorer hold shift, then right click. Choose the option that says open with ... or other. Find your browser of choice. The association should show up in the future.
I tried genning something, but it keeps making images look like liminal horror or some shit, the character is right. Its that its all messed up
>>103295245Another example
>>103295245show us a screen of your workflow, you probably messed some settings
>>103295245reeks of VAE issues. Post your settings/workflow.
>>103295256>>103295259Damn i wonder what did i do wrong
>>103295282>cfg 13bruh that's too high
>>103295282use dynamic thresholding addon if you want those high levels of CFG
>>103295282CFG more than double what it should be, and resolution too low for an XL model. They're designed to generate about 1 megapixel, you're using 512x512 which is 0.25 megapixels. XL can't think at that resolution.
>>103295296Whats recommended ammount?>>103295297Whats that? Sounds cool
>>103295306>Whats recommended ammount?Start with 3.5
>>103295306>Whats recommended ammount?test that out, go lower until you find the sweet spot for your taste, everyone has its own cfg, but no one has it at 13 lol
>>103295323>no one has it at 13 lolanon clearly does, I guess it's also fun to put in high/low CFG garbage through img2img and see what it makes of it with more reasonable settings
https://files.catbox.moe/6psmdg.png
>>103295335same series
>>103295306I would probably use concepts that you wouldn't understand. I'll say it allows for more creativity and reduces your chance of burning the image like you are doing. It depends on your UI. You can find it in the extensions tab for your UI. Here the forge explanation: https://github.com/lllyasviel/stable-diffusion-webui-forge/issues/122I won't post the McMonkey one because he was a dick to me on discord.
https://files.catbox.moe/du1pez.png
>>103295304getting a full megapixel is too high, i outright get a "not enough memory" error. Best i can do is 896x896>>103295318>>1032953235 seems good enough
https://files.catbox.moe/tkp3c6.png
>>103295346impressive gen anon, you made that with a lora?
https://files.catbox.moe/i7tse8.png>>103295346Yes, the Amiga Deluxepaint Lora
https://files.catbox.moe/0915uh.png
>>103295376Yes, the Amiga Deluxepaint Lorahttps://civitai.com/models/875790/amiga-deluxepaint-or-fluxd
>>103295335Welcome back.
This "noobai" model is really nice, it can even fake artstyles
>>103295380I love your images, thanks for sharing your Lora!>>103289013Reposting pic rel because those kind of pictures are super useful.
>>103295402Wanna ensure one of my pics make to the collage next time round, so I'm hedging my bets and poasting to bothhttps://files.catbox.moe/r7p9oz.png
>>103295419You're welcome bby :*https://files.catbox.moe/8blggg.pngbored rn so i'm open to gen ideas btw
>>103289445What did they mean by this?
>>103295435I wanted to see if IPAdapter would be able to replicate the Amiga Deluxepaint style but meh... it's not even close, desu more efforts should be used on improving that model instead of making thousands of lora, imagine a model that can make any styles just by looking at one picture, that's the holy grail right there
>>103295481That'd be a game-changer for image genning. Imagine not needing to prompt for style. It'd be the end of the novel-length prompt>>103295473Typical CYA legalese. They ain't worried about it, neither should youhttps://files.catbox.moe/izeh17.png
>>103295509>Typical CYA legalese. They ain't worried about it, neither should youThen what's the point of even putting that caused there? If they don't give a shit why does it matter? I'm currently in their discord channel and one of the deaths straight up said they rarely have ever remove models, whether it's of a real person or an art style or a character.
>>103295473you should only care if it is a real person. If it is a real person prepare to get banned. Research other real people loras and rework your offering following their examples. If it is a lora of a big player you may get some legal headache. So far marvel/nintendo are the only ones doing anything serious. Most other people that have issue ask for the removal and it ends there.
>>103295556>rarely have ever remove modelsthink about how many models/loras they have and assign a number to what rarely would be.
>>103295556>>103295565If that's CivitAI I've literally never seen them removing Loras of a real-person, even from people like Taylor Swift or Kelly Balthazar, two VERY lawsuit-friendly peopleThey don't give a shit, that's just boilerplate. In the unlikely event they get sued they can say "But the user pinky-swore it had your consent! We can't be legally liable for damages!"https://files.catbox.moe/gvnjq7.png
>>103295613To add on to what you say, I think the only time they would ever have move anything is if someone requests them to remove a data set. Apparently you can share both the models and the data sets you used to train them. Lora networks that we create a style or likeness aren't copyright infringement in any way but sharing a dataset TECHNICALLY is depending on how you define copyright infringement. Even with that in mind only a big time celebrity would be able to twist their arm enough to remove the data set and only that. If it's an artist that doesn't have the assets to sue them and they'll just be politely told to pound sand
>>103295613That disclaimer specifies people who "are not public figures" so no posting models of your crush or momI for one would be happy to see the celeb jeetslop wiped from Civitai but they're safe
is there a way to change deafult parameters or do i gotta set it up each time?stable diffusion ui
>>103295613>Taylor Swift or Kelly Balthazarweird, it is like they being on the news and bitching. It is like they like that spotlight on them and attracting more attention. Really strange their massive teams couldn't force civit to take this stuff down. It isn't like they are aging like crazy and becoming irrelevant. >>103295644Even if it would limit the 8 pictures to a new lora that don't actually use the lora. >>103295661there are some extensions. You can also just drag an old image into the one tab and click send to txt2img.
Why cant i see any lora, i dragged a safetensors file in my lora folder (ignore characters subfolder its empty) and there is nothing on the web ui?
>>103295692refresh button bruh
>>103295704I pressed it 20 times, even restarted the webui, there is NOTHING
>>103295716check the log. I get some corrupted downloads sometimes. It will say something about rejecting it. As an alternative, I would check the sha256 against the download page.
>>103295724
>>103295044New SageAttention 2.0.0 works with Cog Video 1.5 and gives a big speed increase, supposedly. I can't get it installedBut at the end of the day were still just coping until MochiHD and img2vid releases
>>103295675It may not be worth the Streissand Effect it would generate (The Balthazars learnt that the hard way) ,esp since FLUX is open-source, so scrubbing it may not be possible. Since DALL-E 3 and MJ are proprietary it's way easier to get them to stop.Anyway, no use speculating when there's genning to be donehttps://files.catbox.moe/q40z5k.png
>>103295774>at the end of the day were still just coping until MochiHD and img2vid releasesI still don't get why they decided to work on the vid2vid vae first, like no one give a fuck about that
>>103295793You're right, it is strange
i found a trained flux checkpoint i like, but it requires me to add my own vae and clip, unlike the compact one i've been using. is there some way to turn the trained flux checkpoint into a compact one, or does it have to be trained like that?
How feasible is it to train a LoRA for a celebrity you have a few hundred pictures of? Would you need to label them first?
>>103295435Oh, if you still take request, can you generate the titlescreen for a retro MS-DOS medieval adventure game named "XIBRIFI CASTLE" ?I tried to generate fake boxes a while back (well, 11month and three weeks ago to be exact) and I botchered it because I didn't have much time.
>>103296056with flux you need about 20-30 of them. you should add tags to all the images.
>>103296056>How feasible is it to train a LoRA for a celebrity you have a few hundred pictures of?Very feasible, people have been doing that for years at this point>Would you need to label them first?Yes, you can also have AI do that for you
>>103295026what are you using to gen these? looks amazing, pls catbox
>>103296073Put it on the queue. Will post results when it's donehttps://files.catbox.moe/5a6cii.png
>>103291810Disco Elysium lora?
>>103296073https://files.catbox.moe/q26gb8.png
>>103296073Same series
>>103296073Now, some game ads
>>103296073Same series. Settings in this catbox:https://files.catbox.moe/o858ps.png
About Flux... While using .GGUF model I need VAE, CLIP and T5? Why does it say "CUDA Out of memory" with .gguf but when using Flux .safetensors model it doesn't show any error?
>
>get ComfyUI>virus>Get LORA and models from Civit>virusso THIS is the power of local AI
>>103297457>be too dumb to use the internetmany such cases
>>103297457We got him boys
>>103297457dont download pickletensors files you silly dick and never trust them from here especially
>>103295013damn
>>103296173Flux de distilled and a Rapunzel lora I trained, so I'm not sure a catbox will be much use to you.Lora doesn't work too well with distilled Flux and needs 50+ steps for a good result which means SLOW gen times, It's probably not worth sharing on Civitai
>>103297859He's not even a real doctor.