Discussion of free and open source text-to-image modelsPrevious /ldg/ bread : >>102242966>Beginner UIEasyDiffusion: https://easydiffusion.github.ioFooocus: https://github.com/lllyasviel/fooocusMetastable: https://metastable.studio>Advanced UIAutomatic1111: https://github.com/automatic1111/stable-diffusion-webuiComfyUI: https://github.com/comfyanonymous/ComfyUIForge: https://github.com/lllyasviel/stable-diffusion-webui-forgeInvokeAI: https://github.com/invoke-ai/InvokeAISD.Next: https://github.com/vladmandic/automaticSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI >Use a VAE if your images look washed outhttps://rentry.org/sdvae>Model Rankinghttps://imgsys.org/rankings>Models, LoRAs & traininghttps://civitai.comhttps://huggingface.cohttps://aitracker.arthttps://github.com/Nerogar/OneTrainerhttps://github.com/derrian-distro/LoRA_Easy_Training_Scripts>Fluxhttps://huggingface.co/spaces/black-forest-labs/FLUX.1-schnellhttps://comfyanonymous.github.io/ComfyUI_examples/flux>Pixart Sigma & Hunyuan DIThttps://huggingface.co/spaces/PixArt-alpha/PixArt-Sigmahttps://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiThttps://huggingface.co/comfyanonymous/hunyuan_dit_comfyuiNodes: https://github.com/city96/ComfyUI_ExtraModels>Index of guides and other toolshttps://rentry.org/sdg-linkhttps://rentry.org/rentrysd>Try online without registrationtxt2img: https://www.mage.spaceimg2img: https://huggingface.co/spaces/huggingface/diffuse-the-restsd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium>Maintain thread qualityhttps://rentry.org/debo>Related boards>>>/h/hdg>>>/e/edg>>>/c/kdg>>>/d/ddg>>>/b/degen>>>/vt/vtai>>>/aco/sdg>>>/u/udg>>>/tg/slop>>>/trash/sdg>>>/pol/uncensored+ai
poop
Blessed thread of frenship
>>102247060ty baker
>>102247120kick this kitten for $10000000000000000
>>102247141noooooo!
>>102246847how did you prompt this? im having issues
>>102247084Want to see poop gens?
>>102247161canel
>>102247179yeah it's a shame, but it was better than the others.
What's the Lora.FA training mode for? The GUIs describe it as a way to reduce memory usage by freezing one of two matrices in the model. But no matter how much I try it, memory remains unaffected, but the character reproducibility noticeably suffers and raising the weight burns the images much quicker than regular loras. Is it really just harmful with no upsides?
>>102247178no
gentlemen
>>102247203cool o.O
>>102247195Some settings are always useless or detrimental to the results.
breasts?
>>102247218ty
I know this isn't the thread but I trust you chaps know your onionswhat's the catch with hailuoai
>>102247270reminds me of le petit prince
>>102247277why would there be a catch, it's another AI company burning through money
>>102247277there's no catch, it's a great video model and you don't need to make an account to generate videos, that's really cool
>>102247290its Tapestry of Bayeux lora mixed with the Disgaea lora
>>102247314yeah but really thois it gonna ask me for a phone number, or to sign up to somethingthere's no such thing as a free meal, or compute
>>102247327no I'm serious, you just go to the site, you type a prompt, you enter generate and that's it, no account shit, no phone shit, nothing
>>102247327>>you don't need to make an accountcan you read?
>>102247327it only asks for a phone number if you load the mobile version, your mobile browser can request the desktop version
>>102243364I'm sure it would nail the chin on the left.
>>102247327In the long term, no. But I had plenty of fun training for free during Leonardo's and Scenario's beta phases.So make a use of it while it lasts.
>>102247327why so jaded and cynical? some people are just nice and let you have massive compute power for free, no strings attached
>>102247406This is impressive. LoRA? Finetune? PROMPT??
i'm pretty happy with my workflow, but any tips on improving it?i'm a VRAMlet
>>102247341
>>102247420go for automaticCFG instead of Dynamic Thresholding, it burns the image less + gives you better prompt understandinghttps://reddit.com/r/StableDiffusion/comments/1eza71h/four_methods_to_run_flux_at_cfg_1/
>>102247417https://civitai.com/models/721039/retro-anime-flux-style
>>102247420if you go for cfg > 1, use adaptive guider, that will make your gens fasterhttps://reddit.com/r/StableDiffusion/comments/1enxcek/improve_the_inference_speed_by_25_at_cfg_1_for/
>>102247439>There are still 91 people ahead, so the wait is expected to be 5 minutes.
>>102247450>18k stepsI experienced this with my own attempts at loras, but it seems Flux requires higher amounts of steps than other models?
>>102247465>>102247447thanks!
>>102247450>1gb lorafucking based
>>102247513
>>102247417prompt was thishttps://fluxpro.art/prompts/cm0p10b5g0co111m70q6rk0pw
>>102247277>what's the catch with hailuoaiAbsolutely nothing, it's a great model, now I hope that BFL will release something at that level
>>102247612so I can just gen shit all day every day for free, maybe even write a script to gen random shit 24/7I wouldn't, but some cunt would, and this is why we can't have nice things
>>102247685it sure won't last anon, they are doing this free shit as a giant advertisement, and once people are convinced by the quality, they'll have enough fame to make it pay for it, and that's completely fair lol, my advise would be to have some fun with it before it ends, you won't get this chance a second time
asdf
>>102247835that's impressive how realistic it is, the chinks really stepped up their game there
>>102247706You can do it locallyhttps://huggingface.co/THUDM/CogVideoX-5b
>>102247903:^)
>>102247900I can't webm, the mp4 is cleaner
>>102247921you need to change your settings to get higher quality
https://github.com/PowerHouseMan/ComfyUI-AdvancedLivePortrait
>>102247612kek, we definitely need a video AI thread now
>>102248147Damn, OpenAI really shoot themselves in the foot by not making Sora public sooner, now no one will care if they do it at the end, it's too late
>>102247205my sweet
https://huggingface.co/datasets/bigdata-pw/TheSimpsonsframes from every episode and 3 florence-2-large captions, caption, detailed_caption and more_detailed_captionunsurprisingly the captions are all shit
>A couple of people that are standing next to each other.>The image shows Homer Simpson and Marge Simpson from The Simpsons wearing Santa Claus outfits, standing in front of a backdrop of light poles and a starry night sky.>The image is a still from the animated TV show, The Simpsons. It shows two characters, Homer Simpson and Marge Simpson, standing side by side in front of a blue background with white stars. Homer is wearing a red and white striped suit with a black belt and a black hat with a white pom-pom on top. Marge is also wearing a blue hoodie and a red Santa hat with white fur trim. They are both looking at Homer with a surprised expression on their faces.>Homer Simpson and Marge Simpson
>>102248293>florence-2-largeyou used the base version, not finetune?
>>102248346you haven't tried the finetune versions, have you?
>>102248375oh are they worse?
>>102248331
>>102248380>oh are they worse?they aren't good that's for sure >>102245900
>>102248331flux does img2img, and it's pretty good
>>102248380yes>This is an animated image. In this image we can see two persons standing. In the background there are street lights and sky.
>>102248417meant for >0000000000
>>102248418Yeah my bad it indeed is shit. Didn't even realize
>>102247406berserk, saint seiya, masterpiece
well, I finally did it. I quit testing, nitpicking, rebaking, procrastinating and posted my first flux lora to shitvitai feels kinda good desu
>>102248858link?
>have sudden urge to look up a massive slut I knew from high school>she took my virginity in 9th grade>tfw she's been through 3 divorces by the age of 28, has 4 kids, and lives in some hick town in Missouri nowheh, guess life isn't so bad after all.
>>102248944Can it do anything besides dancing
>>102248959cool story didnt happen tho your life still sucks buddy
>>102248962check the pol thread or the hailuoai homepage
>>102248993Don't care if you believe me, it has not impact on anything. It's just funny to how how she ended up.
>>102249045and that made you come to /ldg/ to tell us all?
>>102248887err.. please ignore my general faggotry..https://civitai.com/models/724454
>>102248995it can do a lot of funny shit, you have to watch them on /pol/ though >>>/pol/480733043
>>102249052I was posting images here throughout the day, and posted it along with an image because i thought it was funny. Seems to have made you really butthurt for some reason though.
>>102249045its obvious you're new to larping
>>102249089nobody asked bro
>>102249089yea but why would we care if a fake slut that you pretend took your virginity is divorced and has kids
>>102249092>>102249096>>102249098Oh, I get it... my mistake /ldg/ was definitely the wrong place to post this. It's full of angsty virgins.
This is insane, Holywood is dead
>>102249118stop projecting your sadness of still being a virgin onto is my guy
>>102249121it's still gacha, Hollywood dies when you can do img2img keyframes, however.
>>102249132It's funny because that wasn't even the main point of the story, seems to be what ticked you all off lmao
>>102249144RAWWWWRRRRR..... make me angry again. I . Dare. You.
>>102249121lmao, this shit is good
People are still using the base model, right? I'm waiting for checkpoints.>see checkpoint on civitai>look inside>it's just 33 loras merged with the base modelgod dammit
>>102249177>People are still using the base model, right? I'm waiting for checkpoints.no one made a real finetune of flux yet, it's asking for more than 24 gb of vram
What to use from here?https://huggingface.co/zer0int/CLIP-GmP-ViT-L-14/tree/main
>>102249189Gotcha. And I'm checking the right place, right? The entire civit page was just garbage.
>>102249189Impossible really to finetune it because none of the trainers even properly support it. Let's say you were willing to rent the servers required to train Flux, no trainer supports it out of the box.
>>102249206there's only 2 places to look at and it's civitai and huggingface yeah
>>102249059How did you upload so many images one by one to that lora?
this shit is insane
>>102249201that one is the besthttps://huggingface.co/zer0int/CLIP-GmP-ViT-L-14/blob/main/ViT-L-14-BEST-smooth-GmP-TE-only-HF-format.safetensors
which pytorch version is everybody using on comfy? I just realized im still using 2.3.0, also read that 2.4.0 sucks ass, should i upgrade to 2.5.0, im on winblows btw
>>102249248Why TE only? And what's HF
>>102249253im using 2.1.0
>>102249259>Why TE only?because we're only using the text encoder (TE), not the text decoder >And what's HFdunno what that means either
>>102249238painfully
>>102249253>which pytorch version is everybody using on comfy?2.3.1>>102249277>im using 2.1.0wtf that's an old one, why?
>>102249297no clue lmao i just never update it ima update to 2.4
>>102249253nice style, are you using a lora for that one?
>>102249238>>102249284jokes aside, I just copy+pasted the upload URL and then did them one by one. you could probably get chatgpt to write you a quick script where it automates this process pretty easily using selenium webdriver or something desu
>>102249284Why even do that lmao, press create - post images and batch upload>>102249280Isn't flux using both?
>>1022493032.4 sucks, it gives fucked up pictures, 2.5.0 seems to have fixed it though, and it's faster too
>>102249321how do I get 2.5.0 all it seems i can get is 2.4.1
>>102249319>Isn't flux using both?no diffusion models use the text decoder, it's for LLMs only
>>102249319>Why even do that lmao, press create - post images and batch uploadI didn't see a batch upload option lmao fuck me. I thought it was either post one image, or post 20 at once that go into some kind of mini album which I don't really like
>>102249280>not the text decoderthere is no text decoder in CLIP, but there is an image decoder which we don't use thus the file is Text Encoder only>dunno what that means eitherHuggingFace format, the tensors inside the file have a different layout and name. although I don't think it's an actual standard HF has created
>>1022493352.5.0 is the nightly version, you can do this:1) Go on the ComfyUI_windows_portable\update folder2) use this cmd command:..\python_embeded\python.exe -s -m pip install --upgrade --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cu121
>>102249346*an image encoderI meant to write.
>>102249336I saw most people load this one ViT-L-14-BEST-smooth-GmP-ft.safetensors
>>102249335do this but change>https://download.pytorch.org/whl/nightly/cu121to>https://download.pytorch.org/whl/nightly/cu124torch 2.5.0 works better with cuda 12.4
>>102249374that's the same thing, it's the bigger version that has some shit Flux will never use
>>102249386>torch 2.5.0 works better with cuda 12.4oh yeah? like better quality images?
>>102249407Oh, is it already time for your shift, Sergeant Johnson?
>>102249400nta and I haven't tested 1:1 myself, but someone posted some comparisons the other day and 2.5.0 with 12.4 seemed to be better quality. it was only like 3 different prompts iirc, but the difference was pretty noticeable to me. no idea if it changes performance speed at all between the 121 vs 124
>>102249307yeah, i used <lora:anime_lora_comfy_converted:1> for that gen, I totally forgot about that lora lol>>102249297thanks, can't update for now because my ISP is down and im connecting thru my cellphone with shitty speeds T_T
>>102249045holy esl SAAR DO NOT REDEEM
>>102249400yes, better colors, and slight details are better, pic related, looks for details like that she is smiling, the pearl necklace and the TV details
>fast delete of the glowiebased jannies
>>102249500oh cool, do you have other image comparisons like that? I wanna know if it's still a downgrade compared to torch 2.3.1 + cu121 or not
>>102249459><lora:anime_lora_comfy_converted:1>what's that? can you provide a civitai link? I really dig that style
>>102249526not with 2.3.1 .. can't switch versions now and test, training a lora sorry .. but I am overall happy with 2.5.0+cu124 .. anatomy is correct and pretty much behaves like 2.3.1
>>102249606i'm on 2.5.0+cu124 as well and havent had any issues (3090ti)
>>102249687leave some women for the rest of us
>>102249390So I did some testing>So in order of best to worst of your clips, it's:ViT-L-14-TEXT-detail-improved-hiT-GmP-TE-only-HF.safetensorsViT-L-14-BEST-smooth-GmP-TE-only-HF-format.safetensorsViT-L-14-GmP-ft-TE-only-HF-format.safetensorsRight?I was using 3 anyway and found it a slight improvement over base CLIP, and in certain usecases a big improvement, so I'm keen to get the time to test 1 and 2>That's how I would rate it, yes. 1. and 2. are about on par with regard to benchmarks (accuracy on zeroshot, for example). 1. is objectively better at text, over all. The rest is a bit of a subjective thing, but - yes, this would be my ranking. Albeit 2 can sometimes generate superior detail (non-text detail). It really depends on what you're prompting.I prefer the smooth one, it adds more small details to the image, TEXT makes it more slop
Is it just me, or are all the workflows on the sharing sites just dogshit?I tried using ComfyUI launcher, and importing workflows through that, and even they either fail to run.
>>102249716I'm not a big fan of "detail improved", yeah it improves the text, but for the rest it just makes it worse than smooth
>>102249721They work fine, you need to go to manager and install missing custom nodes
>>102249253That is really nice lora aside do you have a catbox?I promise to keep it sfw this time so the jannies don't get into a tizzy.
I won't care about their text to video until they support image to video.
>>102249828see vidu https://www.vidu.studio/
>>102249828>>102249836or simply Luma
>>102249836But why are people posting videos from hailuo instead of vidu.studio? Are they any good?
>>102249871text to video is so much better
>>102249847I had to stop watching Luma videos because of the morphing, I started having dreams where I was watching videos that morphed like that and realized I must stop until there are good video generators out there.
>>102249871hailuo is probably the best one so far, maybe a bit behind Sora but Sora is DOA so...>>102247314>>102247612>>102248147>>102248408>>102249121
>>102249538https://huggingface.co/XLabs-AI/flux-lora-collection/tree/main
>>102249747Yah, this is /g/ I'm not foolish enough to have done that already but fair. I keep getting weird issues like random seeds not working in Ksampler, even though I've obviously clicked the random button.Importing Via image alone also yields strange results like 'NoneType' object has no attribute 'lower' efficency node
Wait a minute, Germany beat the rest of the world in AI image generation? How did that happen?
>>102250036I don't want to simplify it too much, but it's really not that hard to make an image model. What's shocking is there's not more millionaire bankrolling models. You can legit make a decent 2B model for like $50k right now.
>>102250036a German invented latent diffusion
>>102250005Post the workflow
>>102250036German science is the best in the world.
>>102250036Not just AI image gen, this is not surprising desu.
>>102250036>>102250063Isn't twitter using flux? Maybe Elon funded it
what does this do? can i preview it?
>>102250082Nope, Elon just saw how good it was and how he couldn't make porn or nudity with it so it was safe to allow people to use it on twitter, and it was absorbed by Grok.It was never disclosed how this happened but he probably made a deal with BFL that was profitable for both, this was the stolen we dream of Stability AI, "X's image generator is powered by Stable Diffusion" would have been incredible for them.Fuck, Elon bought twitter, he could probably buy BFL too.
>>102250168Too bad SAI is now being used to AWS
Just found out that the bogdanoff LoRA I made is pretty good at bogging known characters,
>>102247205yeah I'm gonna need uhhhhhhhhhh prompt and loras if any please
I think the funny part of all this is the hentai spammer is going to force AI censorship but not in the way he thinks. The day of the AI janny comes ever closer.
updated forge and it apparently no longer supports SDV so I'm looking for a new UI. probably also going to learn flux. is comfy it or is there something else going on these days?
>>102250375Flux on comfy
You ever seen a muppet get bogged?
>>102250405>>102250428KEK
>>102248995No
https://gofile.io/d/6eKSIoHere is the bogged LoRA. activation phrase is "igor bogdanoff"
>>102250537based
>>102250396*thanks anon
>>102250624What lora is this?
>>102250747The speech bubble over the top is such a jarring break from the overall aesthetic that it borders on parody and loops back around to being funny.
>>102250649that's a raw bing output, sorry anon
has comfyui become bloatware? I remember you opened up a new window, it instantly loaded, now it takes time to load up a new canvas
>>102250851Like half of the custom nodes in comfy are spyware.
I will NOT post in the thread
>>102250855How?
>>102250855that's a pretty big accusation to swing around anon
>>102250893>>102250878>Hey guys just download this node to use an LLM in comfy!>Just plug it in there and don't even look at what it's doing
>>102250893not much else to do when other UIs fail to support the latest developments.
After training a few LoRAs using grids as training images, I'm convinced it's actually a pretty good method. It functions similarly to batching, but actually runs faster and lets you train in 1024x1024
>>102250909kek that was deb* fault, he listed that shit in his news for a whole month, that idiot never tested anything
>>102250920>>102251057YAWN1 GIRLYAWN
>>102250933>lets you train in 1024x1024How would the VRAM requirement lower when you use grid images?
>>102251057it was much better when you were generating stuff like this >>102247205you should at least go back to posting that if you're refusing to share prompt/lora for it
>>102251096It doesn't lower the requirements, but 1024 at a batch size of 1 gives you arguably better and faster results than a batch size of 4 at 512
https://imgsli.com/Mjk0NTI1Any guess for what LoRA I'm trying to train now?
>>102251115That's not me, genius. He even has an auto/forge filename.
>>102251150nta but your shit looks like 1.5 slopyou could generate exactly the same stuff using way less compute and time
What graphics card is good for local image gen that doesn't cost an arm and a leg
>>102251163probably a good roof desu
>>102251180incredible insight, nogen
>>102251297a 4060 is practically mandatory for local gens at this pointyou can get by with a 3060 for now but it won't stay that way for much longer
>>102251362NTA but >>102251180 is right. I feel uninspired just looking at your gens.What are you even bringing to the table? I feel less confident in flux just by looking at your work.
>>102251381NTA but nogen opinions are worth less than debo replies
>>102251120When you was training at 512, did you use bucketing or manually resized your images to 512x512 squares? I wonder if the results are better because of the square training data instead of buckets that keep the original aspect ratio as long as the amount of pixels is less or equal to 262144 (512*512).
>>102251390
>>102251163Needs some badgers.
>>102251413I rely on bucketing when doing 512, but slapping the images into a roughly grid shaped collages at 1024 is my go to method these days.
>>102251416nta but you radiate schizo-anon energy, >>102251362 has some nice upscaling quality, catbox?
>>102251462I am that anon and I think I'm justified in in criticizing boring 1 girl posts. The LoRA clearly is overtrained too looking at her nonsensical clothing.
>nogens complaining about 1girl boobapathetic
>>102251485you just sound salty that you can't upscale that well imo
>>102251495But where's the SNAAAAAAAAAAAAAKE?
>>102251508
>>102251533Perfect!
>>102251571thanks
https://civitai.com/models/714022/neonfantasyflux-style-lora?modelVersionId=798521
https://civitai.com/models/715731/the-sims-1-style-f1d
>>102251807>.webmNope
>nogen Excuse me that's text-fag to you, anon
What was that site to use for image to text prompt?
>>102247060>Disinfo-Copium-Machine go BRRRRRRRRWhat a silly thing to lie about lol https://x.com/halphelt/status/1831316915551137918?t=Q7enYETzZ5jvJzeufY6TIA&s=19
>>102252194This one?
>>102249117ooh
>>102252261https://huggingface.co/spaces/fancyfeast/joy-caption-pre-alphaForgot the link lol
>>102250073see attachedhttps://files.catbox.moe/21emc2.json
>>102251839Nice
>>102252321forgot to say what the problem with this one is, the seed doesn't' randomize, and the Apply RUEnet is busted.
>>102252260That user is lying to discourage other artists from adopting AI tools, but he's actually using it himself.
>>102252446not bad not bad at all
vaguely inspired by kafka
>>102252490>>102252446Why your images look like the chanel ones? Are you using the same lora?
>>102252503just prompt, but there's nothing in common with the chanel prompts.
hailuoai does not understand how typewriters work
>>102252588LOCAL diffusion general
>>102252449>but he's actually using it himself.Citation needed
>>102252588it doesn't understand anything though
I'm convinced that Dynamic Thresholding, AutomaticCFG, Skimmed CFG, and Adaptative Guidance are all a scam.
>>102252652This
once more with feeling
>>102252652Dynamic Thresholding isn't a scam, you just gotta read how to use itAutomaticCFG and SkimmedCFG however are made by the same autistic dev who doesn't even detail anything how his stuff works, he just puts a vague description and some example without workflow or details, its just says "recommended, just trust me bro"
fine I'll be the one to make the 300th post then
>>102252652How rude, they're just as effective as dev+schnell merges.
>>102253182>There's a full time Janny dedicated to watching this thread.The absolute state of this place.
Let's get some fresh bread up in here...>>102253191>>102253191>>102253191
>>102250036undertraining
agreed
well of course, thanks
really
>>102253206nice img
>>102253130thank you for your service
hit it
>>102253014>AutomaticCFG and SkimmedCFG however are made by the same autistic dev who doesn't even detail anything how his stuff workswho gives a fuck? at the end his anti burner works better than dynamic thresholdinghttps://reddit.com/r/StableDiffusion/comments/1eza71h/four_methods_to_run_flux_at_cfg_1/