Discussion of free and open source text-to-image modelsPrevious /ldg/ bread : >>102460029>Beginner UIFooocus: https://github.com/lllyasviel/fooocusEasyDiffusion: https://easydiffusion.github.ioMetastable: https://metastable.studio>Advanced UIForge: https://github.com/lllyasviel/stable-diffusion-webui-forgeAutomatic1111: https://github.com/automatic1111/stable-diffusion-webuiComfyUI: https://github.com/comfyanonymous/ComfyUIInvokeAI: https://github.com/invoke-ai/InvokeAISD.Next: https://github.com/vladmandic/automaticSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI >Use a VAE if your images look washed outhttps://rentry.org/sdvae>Model Rankinghttps://imgsys.org/rankings>Models, LoRAs & traininghttps://civitai.comhttps://huggingface.cohttps://aitracker.arthttps://github.com/Nerogar/OneTrainerhttps://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/kohya-ss/sd-scripts/tree/sd3>Fluxhttps://huggingface.co/spaces/black-forest-labs/FLUX.1-schnellhttps://comfyanonymous.github.io/ComfyUI_examples/flux>Pixart Sigma & Hunyuan DIThttps://huggingface.co/spaces/PixArt-alpha/PixArt-Sigmahttps://huggingface.co/comfyanonymous/hunyuan_dit_comfyuiNodes: https://github.com/city96/ComfyUI_ExtraModels>Index of guides and other toolshttps://rentry.org/sdg-linkhttps://rentry.org/rentrysd>Try online without registrationtxt2img: https://www.mage.spaceimg2img: https://huggingface.co/spaces/huggingface/diffuse-the-restsd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium>Maintain thread qualityhttps://rentry.org/debo>Related boards>>>/aco/sdg>>>/aco/aivg>>>/b/degen>>>/c/kdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/tg/slop>>>/trash/sdg>>>/u/udg>>>/vt/vtai
>>102480518>>The loras trained on Flux are A LOT better than the loras trained on SD1.5 and SDXL. It's a night and day difference.Any examples?
So it isn't possible to create a model for us mortals, only finetuning already existing models?
Bigma status?
>>102480808It is but currently only for specific things like Pixart Sigma.I don't think you have the hardware to do Lumina or Flux full model training at reasonable speed from scratch... and also no money to rent it to get a good result for a more generalist model.
Redeemed
>>102480791>Any examples?my favorite lora so far https://civitai.com/models/7227/satoshi-urushihara-style
>>102480808I don't really get why people want to, or maybe people have these insane ideas of a "perfect" model and think if they just include porn in the dataset shit will be perfect. I think both pony and animagine show that finetunes can take you really far and it is better to finetune that build shit from scratch.
>>102480791top right of OP has made some really good ones https://mega.nz/folder/mtknTSxB#cGzjJnEqhEXfb_ddb6yxNQ
>>102480940>I think both pony and animagine show that finetunes can take you really far and it is better to finetune that build shit from scratch.I don't want to sound like a doomer but even if you managed to make a model from scratch, you won't beat Flux, this shit is good and it's probably because it's a fucking 12b model, can't train a fish to run like a cat or some shit
>>102480973I wonder if some people took those loras and put them on civitai, and if it never happened, would you mind if someone would do that (just to preface, I won't do that lol)
>>102481024>buttchin
>>102480871Is it better to make a lora than pixart sigma? How does it go?Is a diffusion model just not feasible?>>102480940>>102480984Best flux model to finetune for cartoon/anime stuff?
>>102481095>Best flux model to finetune for cartoon/anime stuff?you only have 2 options, dev and schnell
>>102481189prompt?
>>102481095It is not just "better" than having a LoRa DORA or whatever for Flux or Pony, no. And yet Pixart Sigma *is* a full diffusion model type which you can train at fairly reasonable speeds from scratch on a 4090 (or a few other not insanely expensive pieces of hardware) and get somewhere.
>>102481095>https://huggingface.co/Raelina/Raemu-Fluxconsidering the timeframe and dataset, that ain't bad. its too soon for finetunes on flux. come back in six months and lets talk.
animagine can pull it out sometimes
>>102481219Kek
>>102481219so why does no one use pixart? havent tried it but im gonna right now when the model finished downloading, looks like SDXL at half the size, but i dont have a lot of example gens to go off of because no one even uses it.apparently the next Pony is based off it.
>>102481292lmao
>>102481292kekle
>>102481267No 16 channel vae kills tho
>>102481470Because it's not the best trained model at the moment. If you had training data and let it train for a longer while you could probably surpass pony and sdxl eventually. But this isn't easy, just feasible.It's also not necessarily ending up as or more capable as Flux. It currently isn't.
red room
>>102482477October soon
pasta theme, lets go
>>102480758Nice
>>102480945>>102482842>>102483045kino
Why is it you get a gen that's almost perfect and when you try any permutations for variants to fix the flaws it just ends up fucking up the good partsargh
>>102480945Very cool
>
>>102484027Prompts not good enough
Just how much more intensive is it to run flux over something like SDXL?
>>102486522>2s/it>to 368s/it
>>102486522a lot, enough for me not to bother with it
>>102487102Vegeta what does the scouter say about the seconds per it?
>>102486522It is slow and totally consumes the graphics card. I have a 4090 and with SDXL I could play computer games while genning with SDXL. Flux I feel like I need a headless computer dedicated just for the task.
>>102482975>barbar
>>102487656i want her to gobble me up
new snowpony dropped, colors are noticeably darker/moodier. very nice>possibly more detailed prompt adherence?https://civitai.com/models/522596/snowpony
>>102488535>checkpoint mergeIt'll take an entirely new finetune to affect prompt adherence
>>102488618Is that how it works? i've no idea, just doing side by sides of these seeds it looks like the new version better follows my strict detail demands in my prompts, which no other model does as well. especially autismix.
dam
I have a workfllow that does a flux gen, then image 2 image with SDXL. My usual plan is to run flux gens adjusting the prompt and other details until I get what I want, then enable the SDXL group, and adjust it until I get what I want, then enable the upscale group, etc. until I finish it. Now ComfyUI tends to try to hold things in VRAM to make it run faster and Flux-D is quite demanding of VRAM. I have a 4090 but when I do a flux gen I am pretty much capped out of VRAM whether I do 8bit or 16bit models.All of this is to say, when I start a new image in my workflow, I have everything turned off but flux. I get about 6 seconds per iteration as I am doing just flux. Then I get to an image I like, and I switch on SDXL, which takes a moment to load but then runs like normal after that (the seeds on constant on the flux group so it doesn't have to rerun anything there). Eventuallly I finish the image, save, then move onto the next one, turning off every group except flux. But now, flux gets like 16 seconds per iteration. Somehow it is trying to hold stuff in VRAM that is not being used. If it can do this and not slow down my gens that would be great, but ComfyUI does not seem to be managing this properly.
>>102489829Is there a reason why you are running the Flux gen through SDXL? Assuming you are just using Flux for the composition and then SDXL for the character features/details that Flux isn't trained for.
>>102489931I don't have style loras for the styles I like yet. They aren't on civitai and I haven't gotten the hang of training my own yet. Going to work on that tomorrow a bit.
>RuntimeError: mat1 and mat2 shapes cannot be multiplied (4032x64 and 256x768)I'm getting this error, what does it mean? Other flux models run on the same settings without problems.
>run lightning model hoping to cut down the time needed to gen this simple art style>still in order to actually get good results i need 12 steps on that model and high upscaling to fix problems with the lora>ends up taking just as much time as the standard modelfuck.
I just got Automatic1111 working but it seems to be using my CPU rather than GPU. I think I can possibly change that using the --device-id=x argument, but how can I identify the device ID of my GPU? I'm just trying numbers until my GPU fans kick on but nothing is happening.
>15 hour thread>39 imageswhere were you when ldg kill
i posted like 10 of the pics myself... few threads back was almost entirely me.. this is dead
>>102490973nothing's happening man... I'm still waiting for PuLID on ComfyUi
>>102490973its kinda surprising to me how much /lmg/ survives solely off of whining and doomposting, here it's like everyone's just content to gen stuff they like and be happy, but comes with the consequence of mostly dead generals, especially this onereally shows how shit 4chan is for actual conversation huh
Between t5_fp8 and t5_q8_0.gguf which one is better?
>>102491115fp16 > q8 >>> fp8
Any news about Flux support coming to AUTOMATIC1111? I really want to avoid doing another local install if I can.
>>102491153>AUTOMATIC1111get with the times old man, comfy or reforge, and both have had support for a while.
>>102491153what's wrong with force my dood?
>>102491167Ugh ok, I guess I’ll have to learn comfy.
>>102491199why not go for forge? it's basically A1111 that can run Flux
>>102491256forge is stricken with AIDS as of late, many breaking updates and retarded changes + memory leak problems (main dev doesn't know how to unload models properly)the replacement ive been using, reforge, is behind pretty hard on updates and looks like it might get dropped. Honestly people probably should just jump to comfy and get used to its quirks, better than the trashfire of forge.
Ever since I switched to linux on every browser I've tried to use with forge I can no longer paste images from GIMP directly into img2img or inpaint, which basically destroyed my entire forge/A111 workflow. I am all comfy now as a result because I can still paste into it. Even if I get the image in by saving it them manually loading it in forge, zooming in and panning in inpaint has visual artifacts that I didn't used to get it. Probably a skill issue on my part not to fix it somehow but why bother.
>>102491256A1111 and its forks are ironically more complex than Comfy when you try to do more than basic prompting and inpainting.
What are my options for creating realistic animations with 8 GB VRAM, ideally on SDXL?Last time I tried animatediff on SDv1.5 about half a year ago and the result was blurry, ugly and low resolution.Considering my low VRAM is there any method that doesn't require creating all the images in one go as a single collage image, but can somehow maintain cohesion through multiple smaller generations, like one per frame?
>>102491343I went through the same thing recently but found I couldn't paste into comfy either. Instead I click the upload button and paste the image path, just one extra click. On Linux Mint.
>>102491471the problem is I often don't have a path because if it's in GIMP it's usually an unsaved project I created just to make little tweaks to the picture. I was being forced to export my images every time so I gotta have the paste image.
>>102491508Ah yeah that makes sense. Wish I could track down why the paste doesn't work :(
>>102491471>>102491508linux is open source.. just fix it you retards
>>102491538>Really basic thing works in windows>linuxfags: "UHH OPEN SORES JUST FIX IT DUH"this shit's why i'm glad to hold out until LTSC runs out and then i really have no choice but to bear that autistic nonfunctioning ecosystem.
>>102491588just message Linus bro, im sure he can fix it bro
>>102491538I had a solution: switch to comfyui. it was a lot faster making a few custom nodes to make comfyui tolerable than going down that particular rabbit hole.
latin kings throw down
>>102491659Switch to a real OS like Windows
>>102491735Nah, though I have a windows box still laying around if I need it in a pinch
>>102490973Flux inference takes so long but the fidelity is so much higher than 1.5 and XL that it's rare to get something I want to post. Anon may have been right in that Flux caused more harm than good. Plus the only exciting thing as of late is video gen but the local option is so much shittier than SaaS. We need Bigma.
>>102491588This doesn’t even work, every time I search for an issue for a broken feature I find it and often there’s a merge request that claims to fix it … from 8 years ago, with some developer complaining his OS is fine and it’s your fault you don’t like the broken feature
>>102491801rare to get something I want to post if I do chose to use a previous generation model**
https://www.reddit.com/r/StableDiffusion/comments/1fm9pxa/joycaption_free_open_uncensored_vlm_alpha_one/wake up babe new joycaption just dropped
>>102491937that's a finetune of joycaption?
>>102491997No it's a whole new version, from the original creator. Reddit post has a lot of details. Also I saw this:>200k brand new captions were added with varying lengths to help it learn how to write more terselyJesus just how big is the dataset for this thing? No wonder it punches well above its weight for such a small model.
>>102491937Damn I just wasted a few hours training with captions made mostly by the old one. Thankfully that run is a bit borked anyway (2.5e-5 is way too low to teach objects apparently).
>>102491937Sick
>>102491937meh
>>102492399>>102492428the duality of anon
>>102491679kek
>>102480758Does this autistic shit come in a nice easy to install container or something yet? Flatpak? Docker?
>>102483658lora/prompt? bretty gud
Is there any way to visualize what part of the caption the model thinks corresponds to what part of the image?
Why does the Lora Loader in InspirePack allow turning on/off 58 layers for Flux where there are only 57 layers total (19 for double and 38 for single)? What's the actual number of layers?
>>102493173Closest is the windows portable comfyui installer. What, you can't manage a venv?
coziest bred too date
>>102493905who's breeding you?
>>102493940computer
>>102492399whew
>>102492302Well, it only takes 40 images for schnell to learn very well what a penis is.Where it is, it does not learn so quickly.
>>102494405
Nice penishand
>>102494414https://files.catbox.moe/d3eo68.jpeg
>>102494130>>102494452These are what AI was made for
>>102494452top kek
>>102494452>she never looks at me the way he looks at his penis arm
>>102494452Edward penishands
>>102494452i'd spend buzz for that lora as is
>>102494511Reminds me of Roger Dean's works
>>102491801>that it's rare to get something I want to post.I don't think anything I generate is interesting enough to post or has an aspect I don't like, but fuck it here's something
Any good sketchy (non-anime, non-hyper realistic, non 2.5d slop) styles for Flux? Trying to find something to generate consistent portraits for NPC DnD portraits.
>>102494452I can only imagine this is what every womans dream is
>>102494452yooooo im wet
>>102494607It’s still training. I think it’s starting to figure it out. I am watching an ai learn that a penis is not an arm in real timehttps://files.catbox.moe/r72zbm.jpeg
>Stop my lambda lab instance to test my LoRa epochs.>System file availability became unavailable. Fug... I can't play games and bake a LoRa now.
>>102495086bottom img is already pretty realistic (for me)
>>102488632autismmix is old as fuck bro
>>102495086the dickarm lora we've always wanted
>>102495050
>>102480758oh my god, all the disgusting foot fungus on the watermelon, disgusting!
can i run my flux-dev loras with flux-pro somewhere for free?
>>102496018>>102495982MALK
Kinda prefer my own lora to the Doom one uploaded
>hf uploading is downGahhhh
>>102496259i kinda want to see the penis arm lora mixed with doom now
>havingnot gonna make it
>>102496259bottoms more accurate
>>102496558Yeah it's better for non-coom
>>102490351use forge or reforge instead
>>102496302No more arms. It’s just his little disembodied pet dick now.https://files.catbox.moe/ey7xai.jpeg
>>102496619>reforgewhat is this now?
>>102496688forge = controlnet creator guy made auto1111 but actually updated (but then abandoned it for like a year and only recently un-abandoned it)reforge = some anon here made reforge but actually updated itrecently the reforge guy got contributor for forge so eventually you'll want to use thatits the same shit as auto1111 including ui but its way faster
>>102494452kek
>>102496597
>>102496619Thanks for the tip and explanation. I'm having a lot off issues being AMD and Linux. Very Hard mode and I'm a total noob.
>>102496785
>>102496937
Got flux workingI thought it couldn't do lewds.It gave me nipples first go round
>>102497162>>102494414
>>102497201https://files.catbox.moe/h8r9gw.pngYou may
>>102493287there was a heatmap extension for a1111 so it exists
>>102497235do those look like nipples to you
>>102497306its funny that we can still get something close-ish to them even with BFL's bullshit
>>102492428Yeah, I'm not impressed.
>>102494452>>102494607It didn’t work outhttps://civitai.com/models/784981?modelVersionId=877829
>>102497235>>102497162its mindboggling that the AI doesnt know what a nipple is.what a cursed world we live in lmao
>>102498084Ikr, I fucking hate this era, everyone is acting like a fucking cuck
>>102498128Do the anatomy loras fix it or are their galleries just false advertising?
>>102498430I think that's just cope, you can't fix anatomy with a fucking lora that has 100 pictures used for training, this flux model has probably seen millions of pictures, only a serious finetune could change anything substential
finally took the q4pill
>>102498455>Everything you want to see >Everything you want to hearMost impressive Asuka lora
has chang abandoned us?
>>102498933>https://litter.catbox.moe/ohfkww.png>https://litter.catbox.moe/qqfcu3.safetensorstake it for a spin
>>102499086chang was the friends we made along the way
anyone uses cogstudio?I installed both manual and auto but it didn't work. I think it uses libs on system environment variables instead of its venv folder. How do you change it?
>>102499168
>>102499699well that is unique, very modern art
>>102499722joycaption of a cooler img https://www.instagram.com/p/C90HORFPPGM/
>>102499168looks like I got here an hour too late and it's already expired. any chance of a re-up? thanks in advance anon
>>102499799https://litter.catbox.moe/lprvqw.safetensors
>>102499799>https://litter.catbox.moe/g8hn61.png>https://litter.catbox.moe/obczv5.safetensorsi meant to click 24 hour, oh well, too drunk
>>102498759What do you like more about it?
>>102499868>>102499862thank you anon(s)
celebrity loras being removed from civitai?
>>102500257Surprised it wasn't happening earlier
alright fellow weebs.how do you guys do your fucking hands? negative embeds, loras, your checkpoint, photoshop?This looks fine but I know the AI can do better.
>>102500257Yeah they’re purged and when you upload a new one it makes you check a box saying it isn’t a celebrity, and tells you those aren’t allowed on the site.It was clear they were shitheads from the beginning but still, ugh/
>>102500648Odd
Any updates on Flux for lower vsm cards? Ive been using the nf4 model and was wondering if Theres anything better than that besides the gguf quants because they dont work on my card for some reason
>>102500663It’s the AI grifter way. Gain a userbase allowing adult content and legally grey stuff, monetize and rip it all away. It’s an intentional strategy. They have no spine or principles.
>>102499722meant to link https://www.instagram.com/p/DALs9EKou9w
>>102500701Try updating your pytorch version.
what's the easiest way to extract a lora from an sdxl checkpoint? I can't find any comfy nodes that can do this
>>102501135For what reason
>>102500701The update is we kill ourselves
give me negative prompt or give me death
>>102501366Are you ok
>>102501358>obvious shill is obviouswell memed nonetheless
>>102501506Go outside dude. Some people just enjoy a nice monster. Not everything is nefarious.
>>102501259Can you use the 8-bit version of the T5 encoder? If not, then you need to update your Pytorch.
>>102501587draußen lauert der virus, commie
>>102501587Damn all of a sudden I have a hankering for Monster Energy. I don't even drink that stuff...
>>102501358>>102501587Yeah, it's totally organic for someone to create a lora for an energy drink.
>>102501681the cans look cool so no it's not that weird
>>102501681>he doesn't know how how to use flux it's not a lora kek
>>102501676i wuz drinking it every day and then i started having apnea and pain in my chest. I'm not drinking this shit again
>>102501700yeah be careful with that shit.t. have had 4 kidney stones
>>102501698you're fooling anyone, shill.
>>102501681Its not a Lora, keep believing what you are believing I dont really care I'm having fun making cool gens
Do we have a local option for 3D AI model generation yet?
>>102501681>its not normal for people to put iconic things in their images
monster energy marketing team banging their heads on the wall in frustration after failing to shill to all 2 /ldg/ anon (singular)
>>102493677There's another layer or block that is like a "everything else" layer. When I try to run a lora without it enabled the lora won't work, so I'm guessing it's like the essential bits or something idfk what I'm talking about desu that's just the best I could figure out
>>102501776I've never seen anyone creating images featuring other consumer products. It's always that same monster energy drink.
>>102501872That's because there's like 5 people poster here and one of those five people likes genning monster energy drinks you retard, it's not rocket science
I'm trying to give her a coat over her Kimono like Shiki from Kara no Kyoukai. I was going to do a leather jacket, but those tend to be a bit much for this kind of character.Any tips?
>>102501890>posterposting. fuck you autocorrect, I'm going to bed where you can't hurt me any longer
>>102501898fuck I'm not used to the catbox extension yet sorry.Here's the tags I'm currently using.
>shit posting on a phone even
>>102501898>take images of other character that has kimono as you want it >put them into joy caption >use the resulting outputs to figure out how to describe what you want otherwise i2i and draw it on roughly in mspaint or whatever
>>102501872I dotn really care
>>102501913I read 4chan on my phone to go to sleep as my last masochistic action of the day, fuck off faggot
>>102501820>>102501890>2 /ldg/ anon>5 people posterCheck the clock and day of the week and the fact that flux came out years ago
>>102501890I'm talking about the entire 4chan site. Someone or a group of anons are creating meme images featuring monster energy drinks. I can't imagine a sane person doing this for years unless it's for marketing.
>>102501954link to a post on another board
>>102501954its just popular dude, i read some article that said monster energy is basically the most successful brand by growth rate ever
>>102501954I think you need to take your meds.>>102501924>joy captionI'll give it a shot, thank you.
>>102501954Nobody cares about what you can imagine. I can imagine leprechauns, dosjt make them real. You care too much about this.
>>102501954what, that is absurd
>>102501981>You care too much about this.I've only made a few posts.>>102501978Follow your own advice.
>>102502114repost + fake meme news
>>102502087>I've only made a few posts.Like I said, you care too much. Its just a gen. Calm down.
>>102501954It bothers me on a spiritual level that I sincerely think you're like this and not trolling...
>>102502129You could've just ignored me but you went berserk over a comment. You should go out and take a walk.>>102502131I find it amusing that there exists people that don't believe online marketing exists.
>>102502162>still posting about a drink
>>102502170>Get going apeshit
>>102502175>>102502170
making a couple salty posts over the course of an hour is not "going apeshit" on 4chan
>>102502184>Still mad
I was literally just making cool gens what happened
>>102502157Lovely
>>102502201>projecting
i wouldn't mind /ldg/ getting sponsored by pixart
>third world marketing team coming out of the wood work
>>102502259>hes still on this
>>102502277>still mad
>>102502281>>102502220
>>102502289Yep, still mad.
>>102502299>>102502220
how much they pay thodesu
>>102502305Calm down, man.
>>102502214thanks
>>102503568make her scottish
https://fancyfeast-joy-caption-alpha-one.hf.space/?__theme=system
what happens if you use this lora with negative weighthttps://civitai.com/models/735558
>>102503691idk y dont u try it
>>102503583No
>>102504020Yes
The next bread is here...>>102504144>>102504144>>102504144
>>102498629I have a theory that you could jailbreak it with a few hundred or a few thousand if you do certain things, but it would require you to write sexual words in l337sp34k for every prompt because bfs has essentially poisoned their dataset for all sexual words.