Discussion of free and open source text-to-image modelsPrevious /ldg/ bread : >>102367811>Beginner UIEasyDiffusion: https://easydiffusion.github.ioFooocus: https://github.com/lllyasviel/fooocusMetastable: https://metastable.studio>Advanced UIAutomatic1111: https://github.com/automatic1111/stable-diffusion-webuiComfyUI: https://github.com/comfyanonymous/ComfyUIForge: https://github.com/lllyasviel/stable-diffusion-webui-forgeInvokeAI: https://github.com/invoke-ai/InvokeAISD.Next: https://github.com/vladmandic/automaticSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI >Use a VAE if your images look washed outhttps://rentry.org/sdvae>Model Rankinghttps://imgsys.org/rankings>Models, LoRAs & traininghttps://civitai.comhttps://huggingface.cohttps://aitracker.arthttps://github.com/Nerogar/OneTrainerhttps://github.com/derrian-distro/LoRA_Easy_Training_Scripts>Fluxhttps://huggingface.co/spaces/black-forest-labs/FLUX.1-schnellhttps://comfyanonymous.github.io/ComfyUI_examples/flux>Pixart Sigma & Hunyuan DIThttps://huggingface.co/spaces/PixArt-alpha/PixArt-Sigmahttps://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiThttps://huggingface.co/comfyanonymous/hunyuan_dit_comfyuiNodes: https://github.com/city96/ComfyUI_ExtraModels>Index of guides and other toolshttps://rentry.org/sdg-linkhttps://rentry.org/rentrysd>Try online without registrationtxt2img: https://www.mage.spaceimg2img: https://huggingface.co/spaces/huggingface/diffuse-the-restsd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium>Maintain thread qualityhttps://rentry.org/debo>Related boards>>>/h/hdg>>>/e/edg>>>/c/kdg>>>/d/ddg>>>/b/degen>>>/vt/vtai>>>/aco/sdg>>>/u/udg>>>/tg/slop>>>/trash/sdg>>>/pol/uncensored+ai
Blessed thread of frenship
Anyone got links to all the flux loras anon has created / posted?
>>102384686https://civitai.com/models/161068/stoiqo-newreality-or-flux-sd-xl-lightning?modelVersionId=728048https://civitai.com/models/645943/flux-unchained-by-scg?modelVersionId=768009https://civitai.com/models/673188/acorn-is-spinning-flux?modelVersionId=757421
1girlying in flux
Diffussy is born
>>102384956that's a pretty good looking sd1.5 gen
>>102384956alright. I'll throw an image out then.
>>102385001what in the fuck is wrong with her face? what race is she?
>>102385016>what race is she?fried
>>102385016>>102385031you guys flatter me.
I spent 2k buzz on the new fast LoRA training on civitai and it's fucking shitty, oh well -2$
>>102385187I hadn't realized the currency was so inflated. Did it just not work? Lacks settings?
>>102385211you can't use setting on the fast trainer, apparently it just runs a single epoch in 5 minutes. Don't care though, I have my 3090 to train at home.The results are very bad, this was using the same dataset I used for my succesful averi lora.
>>102385282Averi is great
close enough
>>102385282>No settings >A single epoch What the fuck is the point, are they trying to make their site content even worse somehow
>>102385282Post the link to the lora, you get 50 buzz each time some posts a picture to the lora
>>102385696https://civitai.com/models/709854/flux-averi-lora
>>102385696that's a good deal
>he hears you saying she has a big head
>>102384976what are you doing anon? lools interesting
is it possible to tell a1111 to output the image before doing adetailer or highres fix? so i have an image at each step to make it easier to photoshop and blend errors
https://youtu.be/WtmKyqi_aFM?t=1176Anyone tried playing around with clip and t5xxl modifiers?
>>102386201blending errors is a thing
>>102386243I am willing to try anything to get better styling out of Flux, but in the video Van Gogh is in the prompt and the images he gets don't seem to change in any meaningful form towards VG style. Are styles (as in artist recognition) hidden in clip? or the model just doesn't know artists' names?
>>102386463It's just cornmeal and vitamin E.
>>102386488
>>102386468flux doesn't understand styles or artists. use a style lora.
>>102386463>>102386509LOL
>>102386201yes. It is in the settings. You might need an upscaler add-on for it though. I am pretty sure ultimate SD upscale does it.
>>102386922kek
Bigma? Can you hear me, Bigma?
>>102386201>>102387057I was sure there was a Ultimate option. Anyways, here are the base settings ones.
This technology is useless if it’s cannot generate high quality porn images
yes, that's bait
>>102387057>>102387149thanks, i also found the option to save before detailer but looks like it only saves before all detailers, not after each detailer is applied. im using a body then face detailer. but not really a big deal
>tfw lull in new developments
Twobee
>>102386468>the model just doesn't know artists' names?yeah it doesn't know a lot of stuff, that's the biggest thing that should be fixed if someone decided to make a finetune of it
>>102385282With loras trained on civiati, I have to bump the strength up a ton to see any meaningful change. I don't know if that's due to the settings or the way the dataset was tagged however.
>>102388409civitai**
>>102388409NTA but I uploaded my local trained lora to Civit and I noticed if I try to gen on Civit it looks really weak, but gens as intended with local. No special WF or anything on my end either. It's weird.
>>102388443I was referencing a model trained on civitai used locally. I've never used that website to actually generate an image. It may very well be that I'm misreading the reply chain.
>>102385458>What the fuck is the point,money? profiting off retards will always be the best tactic for companies
>>102388409>With loras trained on civiati, I have to bump the strength up a ton to see any meaningful change.that's because it's undertrained as fuck, a good lora should work on every prompt but that's not the case for those sloptunes
>>102388409Flux seems to have trouble learning certain concepts regardless of learning rate and other settings.
>>102388099cozy bred desu
>>102387294I'm interested
>>102385001waow
>a car made out of clam shells and cardboard pasted togetherwheres that prompt adherence?
>>102389814It comes at the cost of requiring verbose prompting
>>102389814>wheres that prompt adherence?at CFG 6 my dude
Using easy training script doing 1024x1024 training on a 3090. https://rentry.org/zhrdo94iIt maxes out my gpu and doesn't go a single step am I doing something wrong here? I havn't trained since the 1.5 days and i know 1024x1024 is more vram intensive but i feel like it should be able to handle this
>>102389899I still don't guy why the Flux guys decided to ditch CFG all together, it's obvious that this shit gives way better prompt adherance than their Distilled Guidance meme, is it because Flux dev is a distilled model that it can't do CFG > 1 naturally? That means that flux pro can do CFGmaxxing right?
>>102386529>flux doesn't understand styles or artists. use a style lora.Terrible bait
>>102386529Absolute nonsense
>>102389956>>102389976samefag
>>102389980I'm sorry you were called out
>>102389916if you literally max out the VRAM and get into RAM/CPU mode it will take forever per step
https://github.com/Vchitect/Vchitect-2.0looks worse than CogVideoX... and god knows how much I love to clown Cog kek
>>102390063Its maxing out at a batch of 1. 24gb of vram
I tried training a lora on ~8k booru images, think this is worth exploring more? It doesn't seem to change flux much...
>>102390318>~8k booru imagesHas to take forever to train
>realtime fluxWhat is the state of 6gb vram mafia?
>>102388594I don't think you were/are able to avoid that with other model types either. Training is difficult.>>102390318I don't think it really worked. Maybe if you also train CLIP? But I'd probably reduce the dataset at least for now. The training settings or pre-training processing need to be different IMO.
>>102390697Shieeeet
>>102390697>>102390831kek
>get in on the action with the novelAI leak 2 years ago, 4gb VRAM with 16GB ram>slowish but decent results>try it again now, same specs>150s/itwhat??? is it the model? im using ponydiffusionxl but back then i used the NAI leak, that's probably it right?
>>102390888One possibility is that you hit the memory limit of your GPU and the Nvidia drivers now use your system RAM which is SLOW.
>>102390888Could it be that you're just a little dense in general, anon?
>>102391048entirely possible. please explain to me like a retard what i'm missing, other than a brain, hurr durr
>>102390215you clearly know what you're doing then
>>102390891>>102390940>>102391131hello, I am with Netflix and we would like to give you millions of dollars to make these happen, how can I contact you?
>>102391157
>>102390888my nai model is about 4 giggy. but the pony models are all 6 and a half so its probably overflowing harder. but hey, why not play with a lightning model in the meantime while you figure out what to do? it generates images in less steps if you dont mind a couple extra fingers. just remember to adjust cfg and sampler too.
>>102391182>lightning modelnot sure where i'd find that or what it is, i assume some scaled-down model or something.i know it's kind of retarded to assume a checkpoint from now would have the same hardware reqs as one from 2 years ago but i figured there's something out there that's far more efficient than anything that used to exist that utilizes low-end hardware like mine. just don't know where to find one unless anyone has any recommendations.
>>102391190>indians instead of JewsI think I want to cheer the black nazis
>>102391286go to civit and look for lightning models. some of them are ponyxl if thats what youre after.its not silly to assume things get more efficient over time. supposedly there is some text encoder logic from flux that might be compatible with xl which might mean speedier xl models at some point.
what are some webzones that allow ai art? is pixiv the main one? twitter sucks
>>102388524I understood that, I was offering my similar but reverse experience.
>>102391604This thread looks like a good enough place desu
>>102391604A lot of social media sites allow it.
>>102389916You need to use the split mode extra arg like it says on kohyas GitHub if you're making flux Loras
>>102392160Ignore me just looked and saw it's pony
>>102384976Post imgs
>buzz beggars board on the front page of Civitai
working on a lora to make those hyper realistic ball jointed dolls like ringdoll desu
Doesn't fooocus already come with python/anaconda? I used it on windows just fine without having to install those things. Yet linux guides keep saying to download them?
you wastrels, you <insert invective here>! how dare you even exist? i am very angry! this is my """)))grrr(((""" face
I want a model like autismmix (pony anime model) that can do almost any character even without lorasloras are fine, but it's nice when a model can do a lot of stuff without a lora too
>>102393036the computer says no.
>>102393048you could even make something with pony and img2img it in flux, but that's not quite the same
>>102393059https://www.youtube.com/watch?v=CpV-dHDTf9Y
>>102389899is there some crucial setting i can only reach in comfyui?
whey hey me hearties
>>102393111no, it hinges on how much you love connecting nodes together and figuring out absolutely everything, from scratch, on your own. comfyui is a misnomer. it's not bad, but it's not very comfy, either.
>>102393036>I wantGood for you
>>102393176well it's not really necessary, for anime stuff I have sdxl/pony, but flux is getting a shitload of loras daily and can also do anime. The most creativity is possible with flux prompting.
>>102393192I didnt ask
>>102393159well i wonder what to do about blurry images then. im sure this looks nice under the vaseline.comfy does look more futuristic, which combined with the nature of this computer progam could bring about a sense of optimism.i feel like a wizard, but maybe feeling like a mad scientist would be fun too.
>>102393225the blurriness is not caused by your choice of UI. it's something else. i mean, i guess it could be caused by some systemic ineptness in piloting of said UI, but that seems unlikely. either ur prompt is bad, or your settings are bad, and all of that applies to auto1111 and comfy.
>>102393262should have said > mean, i guess it could be caused by some systemic ineptness in piloting of said UI, but that seems likelybut still, it's not a UI problem. PEBCAK, yeah?
>>102393273long story short: your gens look like shit because you are shit. ipso facto. qed
ever have monsters pop up in your gen for no reason? i've noticed it with skibidi mix and pixel mix.
>>102394447Her pelvic region is so wide it must whistle like a jug in the wind when she walks
>>102393966how are you generating booba that big with Flux?
>FaceDetailerPipe>ModuleNotFoundError: No module named 'mediapipe'Using comfyui. Installed via manager the impact pack. What am I doing wrong?
>>102386529I haven't found Loras for a lot of artists I would want to try, but even with the few I have found, Loras have been underwhelming, specially because mixing style Loras degrades the model a lot. I am back to img2img Flux gens with SDXL.
>>102394592You dont know how whistling works apparently
>>102394629It's pony. I'm using pixelmix.
>>102394757and I'm using pixelization node
>>102393966put toy, doll, in the negatives.
>>102394831didn't work chief
>>102394948use negpiphttps://github.com/hako-mikan/sd-webui-negpip
I used crowded in the negative this time around idk if it'll stick with the next gens>>102394948can't post catbox there's a problem with uploads rn>>102394968I'll try that
Where do I go to learn the more technical aspects of this tech? I know how to use it but I don't know how it works. I'd like to change that by learning things such as how the models are trained, how training even works, and how the models turn latent vector spaces into images.
>>102394994well no more issues it seems thanks everyone
>>102395172and a retro 90s Kagefusa before I go
>>102393225>well i wonder what to do about blurry images then.that's because you need to add the AutomaticCFG node to make it work at CFG > 1https://github.com/Extraltodeus/ComfyUI-AutomaticCFG
>>102395631nta but what settings do you use for cfg and positive/negative cfg guidance?
Anons, is local diffusion inferior? I'm coming to a sad realization all the "awesome things" we can do with ComfyUI pale in comparison to what these Japanese people on X are able to do with Midjourney. I try to replicate the high resolution, clear, ultra stylish, highly detailed anime pictures I see on X using ComfyUI, and fail miserably with these lousy Flux dev upscales
>>102395800cfg 6 positic/negatie cfg guidance 3.5
>>102395808Midjourney is trash. I can use Flux on huggingface that is trained on a Realism LoRa and it is far better than anything I have seen on Midjourney. Also the prompts aren't censored and I can generate girls with big boobaYou got memed
>>102395808care to show an example on what MJ can do that SD/Flux can't?
>>102395808>Japanese people on Xyou mean anyone using Niji
>>102395874Precisely. How do we make stuff as good as Niji with local diffusion. Is it a hardware limitation and upscale limits
How do you know how expensive it is to train on Flux?
>>102395890>How do we make stuff as good as Niji with local diffusion. Is it a hardware limitation and upscale limitsMj is the only non cucked entity with NAI, they train on every single copyrighted artist drawing to get the quality, the rest of the group (Flux, SD, Pony) are cucks to artist and will only train on generic shit, you get your answer
>>102395808You don't necessarily know but >initial gen + photoshop fixes>upscale with sd15 using specific artist tagsFlux isn't end all be all, you'll need to find your workflow for what do you want.
>>102395896well, what we know is that to make pony-v6, it cost more than 10k dollars, that was for a 2.7b SDXL model, now imagine doing the same thing for a 12b model
>>102395918Which 1.5 model/workflow to upscale? I dont like ult upscale because it's slow, I use a plain upscale node
>>102393262>>102393225The blurry is a prompt issue, there's some tricks to remove it completely, you need to use tags on clip and boomer prompting on t5
How the hell do i use GGUF flux models on my forge? It's not appearing on my checkpoint dropdown.
>>102394746English must not be your first language if you think that the use of the word whistling was incorrect.
https://openrouter.ai/models/mistralai/pixtral-12b:freeI think Pixtral is free on OpenRouter, it's a captioning model from Mistral
>>102396505You could not havw made a more ironic post. Thats literally not even what he said lol.
>>102389899Just copied that workflow. Let's see if it improves anything. Can I use negative prompts with that?
>>102397112>Sign in to OpenRouterNo thankshttps://huggingface.co/spaces/aixsatoshi/Pixtral-12B>They>Doesn't get she's eating with her footto the trash it goes I guess>>102397201>Can I use negative prompts with that?I think so, but it's not consistant at all, sometimes it work sometimes nothttps://reddit.com/r/StableDiffusion/comments/1eq214z/text_encoders_are_really_bad_at_negations_thats/
damn those mosquito bites showing up through the leather bikini
>>102390891>>102391157>>102391171kek
>>102395808ALL the major local pretrains use extremely safety cucked datasets
>>102397547this
>>102397547Good. People are too addicted to porn
>>102397638desu I agree with that, still trying to get rid of that addiction, it's fucking unhealthy and makes me tired all the time
>>102397746What do you do all day. Most times people use porn as a stopgap for other activities.
>>102397776nothing special lol, I fap to sleep and that's the issue, yeah I sleep well but when I wake up it's horrible
>>102397638yes because the difference in datasets between mj and local is clearly porn and not the exclusion of artists
>>102395808Show a midjourney image you could not make in flux
>>102397823I'm sorry you are so upset
>>102397796>nothing special lol, I fap to sleepHere's the issue. You have energy because you haven't done anything all day. Try having an engaging hobby or volunteering if you are a NEET or something that will use your energy and entertain you
What's the fastest way to do flux inference on cpu?
>>102397944>fast>cpu
>>102397944You open your browser quickly to https://www.fastflux.ai/ because nvidia is worth so much for a reason.
Is it just me or do you get blank image output with comfyui when you have characters like comma in the prompt
>>102397944anon, you really think Nvdia would be worth trillions if cpu was fast enough for inference??
>>102398039>>102397990Maybe a better phrasing would have been "least slow."
does this work?
>>102398217Yeah, probably but why do this? It's more difficult to control the prompt this way. What are you trying to achieve?
>>102398217Yes but why not just use this? Also if we're talking flux it's pretty much useless. SDXL can somewhat benefit with splitting tokens, just like with using BREAK in Forge, but not flux.
>>102398255Thanks I didn't see that one. Yeah, it's Pony.>>102398246I'm trying to give equal weight to both Character Description and Action regardless of their length.
What does ModelSamplingFlux do? (max_shift,base_shift)
>>102398328if the resolution ratio is 1:1 (like 1024x1024) then base_shift does nothing, max_shift on the other hand seems to deal with luminosity and contrast, that's how I experienced it
how to make flux do crying expressions, like bawling her eyes out
Anyone tried this yet?https://civitai.com/models/756183/raemu-flux
>>102399621Looks like a generic sd1.5/sdxl slop mix.>1,500 carefully curated, high-quality aesthetic anime imagesThat's, like, nothing, it needs much more images from various artists for any interesting results. Also kekd at the schizo in the comments
>>102399621Some schizo from 4chan escaped the containement board to comment in there it seems kek
>>102399621looks very slopped, he probably used synethtic pictures to make this finetune
>>102399720>>102399694>Kotflocke
>>102399621>all the images are sameface/samestyleI suspect this is a base model with a lora baked in
Yo
>>102384776>>102384776>>102384776So I wanted to replace my old A1111 install with forge (I was using Comfy for a long while) and was following steps from https://github.com/Panchovix/stable-diffusion-webui-reForgebut when I'm trying to launch I get:CUDA Stream Activated: FalseD:\SD MASTER\stable-diffusion-webui\venv\lib\site-packages\transformers\utils\hub.py:127: FutureWarning: Using `TRANSFORMERS_CACHE` is deprecated and will be removed in v5 of Transformers. Use `HF_HOME` instead.warnings.warn(Traceback (most recent call last):File "D:\SD MASTER\stable-diffusion-webui\launch.py", line 51, in <module>main().[bunch of other file calls].File "D:\SD MASTER\stable-diffusion-webui\ldm_patched\modules\model_base.py", line 6, in <module>from ldm_patched.ldm.modules.diffusionmodules.openaimodel import UNetModel, TimestepFile "D:\SD MASTER\stable-diffusion-webui\ldm_patched\ldm\modules\diffusionmodules\openaimodel.py", line 22, in <module>from ..attention import SpatialTransformer, SpatialVideoTransformer, defaultFile "D:\SD MASTER\stable-diffusion-webui\ldm_patched\ldm\modules\attention.py", line 21, in <module>import xformersModuleNotFoundError: import of xformers halted; None in sys.modulesPress any key to continue . . .what do?
>day 15 of pixart pride month
>>102400384If you have tried the whole "switch with git" method i would recommend a clean full new installSaves a lot of hassle, especially when you have an old install and dont update frequently
>>102400384Google the line above Press any key to continue and you will have your answer >hint: run with --xformers
>>102400800>>102400812>>102400826>>102400842very nice
>>102397925Cool
>>102401616ty
>>102384776Is it possible to undervolt a laptop GPU or limit the amount of resources stable-diffusion uses to a safe limit so the device doesn't burn down with excessive prompting?
>>102391604I post my effortgens (animation or lots of inpainting) on deviantart. Mainly because I used to post my fanart there a long time ago.
>>102401760I want to believe
>>102398246NTA but either concat or combine is the equivalent of BREAK in Comfy AFAIK.
is there a new meta for training LORAs locally? I trained an xl on my 8gb card months back and COMPLETELY alzheimer'd all the details, i don't even remember what particular program i used to do it and cant find the fucker on any of my 6 drives..Im finding you don't really need a lot of images to get something really good trained for xl and pony if you're just doing likeness so i want to try a few again, fucked up last time because my dataset was r tarded and inconsistently sized.
>>102394757Pixelization node gives me weird artifacts, can i see your workflow?
>>102399720It looks like Mastodon. That's normal.
So which one is the best for creating real life human and some kind of tacticool stuff?