General dedicated to free and open source text-to-image models.Previous /ldg/ bread : >>101118874Arguable Edition>Beginner UIFooocus: https://github.com/lllyasviel/fooocusMetastable: https://metastable.studioEasyDiffusion: https://easydiffusion.github.io>Advanced UIAutomatic1111: https://github.com/automatic1111/stable-diffusion-webuiStableSwarmUI: https://github.com/Stability-AI/StableSwarmUIInvokeAI: https://github.com/invoke-ai/InvokeAIComfyUI: https://github.com/comfyanonymous/ComfyUI>Auto1111 forksSD.Next: https://github.com/vladmandic/automaticForge: https://github.com/lllyasviel/stable-diffusion-webui-forgeAnapnoe UX: https://github.com/anapnoe/stable-diffusion-webui-ux>Use a VAE if your images look washed outhttps://rentry.org/sdvae>Models, LoRAs & traininghttps://civitai.comhttps://huggingface.cohttps://github.com/Nerogar/OneTrainerhttps://github.com/derrian-distro/LoRA_Easy_Training_Scripts>Pixart Sigma & Hunyuan DIThttps://huggingface.co/spaces/PixArt-alpha/PixArt-Sigmahttps://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiTComfy Nodes: https://github.com/city96/ComfyUI_ExtraModels*SD.Next also works with PixArt-Sigma>Animationhttps://rentry.org/AnimAnonhttps://rentry.org/AnimAnon-AnimDiffhttps://rentry.org/AnimAnon-Deforum>Index of guides and other toolshttps://rentry.org/sdg-linkhttps://rentry.org/rentrysd>View and submit GPU performance datahttps://vladmandic.github.io/sd-extension-system-info/pages/benchmark.htmlhttps://docs.getgrist.com/3mjouqRSdkBY/sdperformance>Try online without registrationtxt2img: https://www.mage.spaceimg2img: https://huggingface.co/spaces/huggingface/diffuse-the-restsd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium>Share image prompt infohttps://rentry.org/hdgcbhttps://catbox.moe>Related boards>>>/h/hdg>>>/e/edg>>>/d/ddg>>>/b/degen>>>/vt/vtai>>>/aco/sdg>>>/trash/sdg
>they were so scared that people could directly compare XL outputs to the chink models that they made up an excuse to ban it
>>101128654And they almost got away with it. Good job scoob
it was just an attempt of the sdg schizo to move people back into sdg, he has done this before in the last split. spammed the thread with bs. then samefagged bs about an XL ban and made a OP banning XL/SD3 during the dead hours where few would contest it.
Blessed thread of frenship
where's the first thread? was having a convo with an anon there, any link?
It's all so tiresome, but if it helps even one or two anons become artists in this new and fascinating medium, it will all have been worth the bit of trouble.>>101129084May your gens remain bountiful.
>>101129186sure it's right here https://desuarchive.org/g/thread/101128489/
>>101129224Why did the thread get jannied? I see no suspect material
>>101129284Spamming/Flooding
Good morning
collageanon failed us.terrible choices today
>>101129630You can always make your own collage. No need for this
>>101129630jannied thread had a sick collagethis is pretty weak, discord-tier stuff
>>101129689It was old collage someone else made. 0/10 for the effort.
>>101129714>thats a bad collage>use our discord collaeg>thats a bad thread>use our discord thread>thats bad doxx>use our discord doxx
Another day, another failure. Let's see>>101129834I don't get it but it's ok
>>101129907No other way around it, than through trial and error. Best of luck anon. Just remember to switch up your methodology. Can hardly expect different results from the same approach.
>>101129689any idea whyd that thread get deleted?
>>101129996Mass reports from discord
>>101128616Nice collage TY baker!
>>101129969>Just remember to switch up your methodologyI've been running prodigy for ages. Should probably switch it for a while
>>101128616Great collage! Loving it
(Posted in old thread on accident, oops.)Just an FYI, the fag that's crying over adetailer usage/bad gens and saying people need to be shipped back to /sdg/ is actually a furfag who has been harassing the /aco/ threads as well, and admitted to being a rapefugee from the /trash/ threads. You can see him cry over it here ->>>/aco/8319401>>>/aco/8331732
>>101130056Prodigy sucks unless you liked it fried. For all around CAME is best, for "can't fuck it up" use AdamW. AdamW is even better when using it 8 bit because you get efficiency.
>>101130056Not necessarily, but you might know better. Even changes in your dataset or the way it's captioned could be enough.
>>101129204>It's all so tiresometrue but they will never stop us
>>101130112>Prodigy sucks unless you liked it friedLower d_coef setting and scale weight norms should prevent it>AdamW is even better when using it 8 bit because you get efficiencyWith what settings, rates etc?>>101130117>Even changes in your dataset or the way it's captioned could be enoughIndeed. Some of the best loras I've made were pretty much manually captioned. It just becomes too much work with 500+ images
>>101128616horrible collage
>>101130112>Prodigy sucks unless you liked it fried.Works for me on SD/SDXL with a slow learning rate at the start and the rate adjustments like d_coef.But not well on Onetrainer/Sigma, something is IMO going wrong there. It also unexpectedly fails with other optimizers tho.
trolls really have nothing better to do, huh
>>101130567>a slow learning rate at the startdo you mean warmup or what?
>>101130776Mostly just a low d0 like 1e-7. I think I prefer it to warmup.
>>101130836I had the assumption that it was just 1 automatically with prodigy
>>101128616>le collage
>>101129290moody
>>101130853I think LR is 1 (adjusted by prodigy) d_coef and d0 and the other stuff is configurable.
>>101131235>d0Asking for a friend what does it mean?
if a checkpoint creator suggests i use dpmpp_sde for the sampler, what scheduler is he implying i should use? on auto1111 i didnt have this option
>>101131467SDE Karras?
>>101132104do you inpaint to get two different people or do you have some neat workflow?
>>101131405its the initial d at d=0
>>101132400Sorry, I meant epoch 0. d gets changed by prodigy according to d_coef
In this case with well known people i started with a simple prompt and spun the RNG a while.Then i started to dig in with pic related.If you have any tips&tricks im all ears, because i cant get a single gen without inpainting.
>>101132400lul>>101132419TY. Gotta switch to Onetrainer for now. CAME and Huber loss looks pretty interesting
ong bak gives some nice scenes
>>101133027cool, I think the main guy almost got killed in this or some other movie
Can someone do a gen of Yup or Ran sucking on that debodick
>>101133095that could be, some stunts are quite nice
>>101133143Sure
>>101132697Huber loss is very helpful as far as I can tell, particularly in the SNR variant in kohya_ss it rarely seems to hurt and sometimes seems to help massively. CAME also seems good but likewise I only tried it in the Sigma training scripts.
maybe its more mortal combat then ong bak
>python.exe installer.py local' returned non-zero exit status 1just fuck my shit up
>>101133174make sure they all look underage
>>101134156how about Xena
>>101134370oh wow, this one turned out much more interesting, and all it took was a vertical swap
>>101134485it's fun to put images like these trough interrogator>a painting of a tree in the middle of a lake, concept art, inspired by Cyril Rolando, fantasy art, ferrofluid oceans, floating kelp, 4 k detail fantasy, peter mohrbacher and dan mumford, flowing tendrils, cute detailed artwork
https://suno.com/song/382d4dfe-e694-466e-9463-6c77104d019b
>>101134563>peter mohrbacheroh I like this oneWonder how the new microsoft? interrogation model would do. What was it called again?
>>101134608I think one anon who is finetuning sigma is using it, it looked really decent from his screenshot
>>101134645Ah! Florence 2, wasn't it.
>>101134746yeh
>>101133143sorry that gay stuff is reserved for the other thread
Any illustrators that img2img their stuff here?
>>101135412probably not, and they may not say so even if
>>101135412I know some who use this for tracing and making quick drafts
>>101135412controlnet is much more useful
Does anyone know what models nemusona was using? I know it was anythingv4.5, but I tried a few checkpoints called anything4.5 (and v3 too), but they look very different. Or does anyone have suggestions on how to make something that looks similar
>>101135496yes
>>101135561Depending on the model i find that img2img helps with exploration, and rigging the control net skeleton was such a pain in the ass for me, to each their own tho. Pretty sick how i can just download a Rei lora for this sketch
>>101135847Bada bing bada boom, stellar reference for the body no lora required. Shit is freelo
>>101135847i produce internal concept art for some mundane applications. usually i produce some starting images either traditionally or gen and then produce a set of variants with a weakened controlnet, get feedback on the preferred results, and recycle until done
>>101135847jesus christ how horrifying>>101135957sweet jesus how nice
>>101135957You could've just scribbled a blob man approximating the pose if you're taking it that far
>>101135985Horrifying is the direction anon
>>101136134I enjoy her personality
>>101136134>>101135496more please
>>101132400Based Boomer.
>>101131101moody bloodborne prompts
Some color, I wonder what the machine will spit out from this. I leave the layer with the wings out cause thats just gonna make a mess
>>101136730nice vertical split gen
>>101136730nice
>>101136765Thanks m8
>>101136901cheers m8
>>101137744
>>101137829
>>101138011
I miss chang
decent gens ITT
>>101138554I'm tired of building these sand castles anon.
>>101138565
>>101138565theyre very nice castles tho
>>101136925>>101136821>>101136806prompt ?
>>1011388061grill, face down ass up, cooking
>>101139021what model?
>>101139108PixArt -> Pony -> imageupscale
>>101139158pony model over photorealism?
>>101139175I need something to stabilize the image and if I put it through another PixArt it just eats my Video card like a fat kid through 2 cakes and asks for more.
>>101139215Ah alright. That looks very good, even canvas texture is top notch
>>101139175>>101139215Base Image
>>101139254Upscaled
>>101139272Upscaled using pixart again (half power because I don't have the juice)
>>101139277would
>>101139288>hot moms in your area
>>101139320careful you got a pokie A good sign of a woman who gives zero fucks is the how dingy the carpet is
>>101139272
https://files.catbox.moe/ohxzw5.jpg
>>101139384cool style in that catbox
>>101139393Had to put it there because it's over 4mb
>>101139408
>>101139443
>>101139477
Anyone know how novelAI's vibe transfer works, or more specifically how to recreate it with stable diffusion A1111? I tried t2i-adapter, base controlnet, and IP adapter with the Ponyv6 model but they didn't work very well at transferring styles let alone specific characters.
>>101139485
>>101139507Sorry m8, don't know what you're talking about so I can't even help.
>>101139507We can only guess at their secret sauce. What have you tried doing so far?
I wonder why Kohya uses folder names for repeats etc. instead settings from GUI. Very unintuitive.
>>101139551IP-adapter clip, some T2i-adapter models, and openpose, lineart, and depth control net models. They don't seem to have a huge impact on the result unless the prompt is simple. NovelAI seems to be able to generate pretty complex stuff
>>101139655I've just split mine in to Model\date\Seed#Posting\JPG\Seed#
>>101139706
>>101139680IPAdapter, if used in a way, leads to the input images having a GREATER impact on the output than the prompt regardless of its length. I may be wrong but it sounds like you're expecting the free tools to be as easy to use as the paid ones which will never be the case. I understand MJ and NAI make it incredibly simple to drop a few reference images in and press go (and obviously whatever they're doing behind the scenes far outclasses what one can accomplish quickly on their own computer) but IMO you're missing out on the breadth of customizability afforded when you own the tools, among other things. Digging through the links in OP will likely bring elucidation but if all else fails: experimentation is key.
>>101139507pony might be shit with it have you tried a different model?
>>101139818>Digging through the links in OP will likely bring elucidation but if all else fails: experimentation is key.I've been experimenting, but LoRA seem to be the only way to get stable diffusion to generate characters accurately, but it's a lot less flexible since I would have to train my own. I haven't had a lot of luck with complex poses even with loras>>101139835The base SDXL model, but it's not that great at anime characters. Also some pony derived checkpoints
>>101131467Karras most of the time. auto111 has a scheduler select now.sometimes Exponential is good with dpmpp_sde[_gpu]
>>101140023Give euler_ancestral with ddim_uniform a try
Good night
>>101138943thanks
>>101128616ty collageanon
>>101139955ipadapter is the poormans lora after all
>>101128690 Based. >>101129714Very blatant indeed.
>>101132454What are you trying to accomplish?
ded
A new imagemodel will be opensourced, this one is a 5.6B "replication" of SD3https://x.com/FAL/status/1805306666831036863https://old.reddit.com/r/StableDiffusion/comments/1cswloa/cloneofsimo_start_training_sd3_replication_weight/
>>101141573https://new.reddit.com/r/Open_Diffusion/comments/1dn7h53/open_diffusion_mission_statement_10/>The goal of Open Diffusion is to create Open Source resources and models for all generative AI creators to freely use. Unrestricted, uncensored models built by the community with the single purpose of being as good as they can be.BASED
I'm sorry if this is retarded, but how do I train a style LoRA rather than a character one? Same shit just don't tag characters and tag artist/style instead?
>>101141582now that's interesting
>>101141583Kind of the opposite. When you train a style lora, I think you should just tag pretty much everything, except traits characteristic for the style. If it's dark and moody, or bright and colourfull, or the characters always have big heads, don't mention any of those. Caption everything else, from subject, to action, context, background, etc. Caption it like you would prompt for this very image whilst using your desired lora.
>>101141582Oh wow! It's Simo and his friends while at it. Now that's a real OG for /sdg/, rather than their thread celebrities.
>>101141672I don't know those guys? Are they actually good?
Morning anons
>>101141633Caption drop rate should also work well with this
>>101141677Don't know about others, but Simo themself is one of the folks, if not THE person behind our tools for lora training.
>>101141685>Caption drop rateNever tried it, but now that you mention it, does make sense..
>>101141696Holy shit, that sounds good, I also saw that from their statement, that's based af.>As *Open* Diffusion, we wish to produce models that are useful for the entire community. Questions of morality and ethics beyond the law are beyond the scope of this project. We are not an ethics board or a group of philosophers.
Can anyone recommend an anime dataset with "proper" descriptions? I don't mind if they were generated by llava or other image-to-text model, I do mind if they are just tags from yetanotherbooru.There's Borismile/Anime-dataset but it's too small and most of it has no captions (or upper cases).Picrelated: "a painting of a person walking in a field with a flag in the background and people walking in the distance"
>>101141708I hope this works out and we can finally let SAI die in peace
>>101141719>implying you need more than 1girlI wonder what the person behind Pony uses, especially now that they mentioned better comprehension of non-booru tags.
>>101141705It's a nice compromise since those style loras can work pretty well without captioning at all
>>101141708>>101141720That's what SAI should've done in the first place, train their model however they want in the scope of legality, the rest is useless and they shouldn't act like they are the master of morality, that's fucked up
Man, I can't wait to switch off Pony for something less vram hungry. I would love to finally try and train my own finetune on the breathtaking 8vram of my gpu.
>>101141809what's preventing you trying the SD1.5 models again?
>>101141809>I can't wait to switch off PonySame, but not due to vram but because I hate how ugly pony looks in terms of the actual image quality.It's like the whole dataset had jpeg artifacts and a piss filter thrown over it.
>>101141818Nothing really, I've just spent enough time with them. As much nostalgia as I have for 1.5, and as much as I have done with them, even for the reduced speeds of PDXL, the bump in quality has been more than worth staying with it. Even something like hands became way less of a hassle. If I'm to return into lower parameters similar to that of 1.5, I'd rather wait for more pixart support, or any other alternatives for anons on a vram budget. Also 1.5 was indeed very hard to finetune, what little experiance I had with it. Tried making my own base by training own loras/lycoris and merging them into it, but I failed miserably.>>101141836>It's like the whole dataset had jpeg artifactsBecause it actually did!>JPEG Artifacts>An issue I hadn't initially noticed in V6, which was brought to my attention by several users, is the presence of JPEG artifacts. Although this problem is only evident in certain styles, I am committed to addressing it. The issue appears to stem from two main sources: some of the source material already contains artifacts, and my pipeline, which involves saving images at 95% quality twice, likely exacerbates the problem.>To resolve this, I am making adjustments to the pipeline to ensure images are directly transferred from the source to VAE encoding without intermediate quality reductions. Additionally, I am developing methods to detect and either automatically correct or exclude images with noticeable artifacts. This should significantly reduce the presence of JPEG artifacts in the output of V7.Sauce: https://civitai.com/articles/5069/towards-pony-diffusion-v7I personally don't take issue with it's quality, but I have my own plenty gripes with it. Still, probably the model I've had most fun with.
>>101141899>and my pipeline, which involves saving images at 95% quality twice, likely exacerbates the problem.wtf, that's retarded, how could he make such a basic mistake in the first place?
>>101141918the simplest mistakes are also the simplest to make
>>101141899I'm not so hyped of his V7 anymore, he started cucking his models on V6 by removing the artist tags, his models will be more and more cucked as the versions will go on
>>101141573that thread is a month old
>>101141954The tweet is 1 day old, and this anouncment is also 1 day oldhttps://reddit.com/r/Open_Diffusion/comments/1dn7h53/open_diffusion_mission_statement_10/
>>101141947I remain optimistic, since I don't see the lack of artist tags as an issue. These very same artists likely remain in the dataset, meaning it's learning good traits from them anyway. Instead I'm looking forward to see how it better handles non-booru prompts, any improvements on realism, and getting rid of the lenghty score_schizo.
>>101141767i think he said the uses wd and then expands those tags with llava
>>101142162Sounds plausible. I just checked one of the datasets for that Open Diffusion model and it did include both.
>>101142162>>101142201that's a quite terrible approach desu, you can have a shit ton of solution when transforming tags into real sentences1girl, table, sitting, chair could be "a girl is sitting on a chair in front of a table" or "a girl is stting on a table in front of a chair"
>>101142221sure but it just has to be better than the alternatives
>>101142230why can't he use CogVLM or Florence instead?
>>101142221Less efficient in tokens, but there is merit in this meaning better results for using natural language in prompting, rather than trying to fit into a particular captioning system.
>>101142236>FlorenceDidn't it come out like just a couple of days ago?
>>101142236ask him, its pretty possible hes using something else nowbut i really dont see why florence is so hyped, sure it is decent for its size and is fast, but the results i was getting when trying it werent really anything amazing compared to the better captioners
>>101142257>the better captionerswhich one allows NFSW in the first place?
>>101142267they might not know nsfw, but i dont think they reject you either, and florence isnt really descriptive about sex either and might fuck up completely when trying to describe it, since i assume there are not very many images involving it in the dataset
>>101142319someone should finetune CogVLM or florence (or whatever good captioner model) only with nfsw to make them better at describing people doing anything else than just standing imo
>>101142328someone tried/is trying to make a nsfw tune vlm but i assume its a huge undertaking to caption it alonehttps://www.reddit.com/r/LocalLLaMA/comments/1d4ru63/phi3hornyvision128kinstruct_image_captioning/
>>101135631What happened to it anyway? I checked and there was no announcement in his twitter but he's still active.
>>101144027>>101144059>>101144131>>101144359debo
>>101144417debo btw
>>101144547>debo btwchill. no one cares.
>>101144724it's weird that he is allowed to post here while pretending to be other anons, manim just pointing it out
>>101144750we can't disallow anyone from posting on a monglian basketweaving forum, now please either stay on topic, or don't bother posting
>>101144547thanks, adding to the filters
>>101144787i can only understand that you are somebody incredibly unfamiliar with this general and ill excuse your ignorance as the ignorance of a newfriendbut, to be clearyou have absolutely no idea the person you're defending right now or their ideals and past actions. you are basically asking this comfy, fine, good general to go exactly the same way as /sdg/, a literal and metaphorical cess pit of retardation
>>101144383>>101144547Whom? Why are you labeling me with some random cunts name?
>>101144846I'm one of the folks who bakes these threads, and among other reasons, I bake them precisely to avoid this off-topic bullshit slipping over from the other general. I only have a vague idea of what's going on there, and I want none of it here. We're not janitors, you're free to report if there's a valid reason. The less it's mentioned here, the better for this general, and let's leave it at that.
>>101144868>Whom?Don't try to use grammar you clearly don't understand. It's just "Who?", you're not using the pronoun in the objective sense in your post.Classic Debo move and an absolutely pathetic attempt to pretend to be someone more educated than you are to try and throw us off the obvious scent.Back to /sdg/ with you, please. And promptly.
>>101144903>Whom?>folksFuck off with this weird corpo personality debo. Please keep this dumb ass shit in /sdg/. No need for it here.
>/sdg/ containment status: breached
>>101144925Chinese models are superior to anything StabilityAI has ever created because China believes in workers.
>>101145038based
Hunyan Lubu is going to be released this Friday. I cannot reveal more details except it is The Dream.
>>101144975Here we can see true work of an auteur. This what only Chinese models can achieve.
>>101144027>>101144059>>101144359>>101144417Very funny gens. I know you tried your best but I can't stop laughing.
>>101145158That's not all of them. Keep tagging all my guns in the thread
>>101145217I don't care.
>>101145256Ya you do you cheeky cunt. Otherwise you wouldn't tag me.
>>101145278debo
>>101145314Retard
>>101145348The most obvious sign that Debo is trolling a thread is exactly this type of response
>>101145314whats's that?sounds like troonshit
Debo is going wild both here and /sdg/
just gen
your jealous cuz only certain people are skilled enough to create interesting images
>>101145437that's why debo is all over this thread
>all it takes is one retard replying on cooldown
So PAPi is the new debo tag for /ldg/ to add to the filter?We might consider adding some of this stuff to the pastebin, has anyone seen Ran lately?
>>101116250>>101145430You're absolutely correct sir.
>>101145430>just genI endorse this message.
>>101139726I like the style on this one
>>101145481Oh, I can give you all my tags.ComfyUIUIPAPAPiSDXLPAPi and SDXL end in .jpg since they are high quality. The rest are .png
>>101145553Can we add this to the pastebin?
>>101145084Cheers anon
>>101145553>>101145555The IQ of these posters is through the roof here.
remember gyate
Bring out your gens, it's here to collect, and soon you will reap fruits of what you've sown.>>101145624Don't you mean gyat?
tfw it's almost time for new collage !!!!!!
>>101145659You are too stupid to know what gyate means, slav nigger.
>101145919not very gyate of you
>>101145624lora training on onetrainer fell flat
https://www.reddit.com/r/StableDiffusion/comments/1do5gvz/the_open_model_initiative_invoke_comfy_org/>tl;drTeams behind Invoke, ComfyOrg, CivitAI and Laion are coordinating together with couple of goals in mind:>True open source: Permissively licensed using an approved Open Source Initiative license, and developed with open and transparent principles>Capable: A competitive model built to provide the creative flexibility and extensibility needed by creatives>Ethical: Addressing major, substantiated complaints about unconsented references to artists and other individuals in the base model while recognizing training activities as fair use.>>101145930Appreciated.
>>101145954>Ethical>/sdg/ discordtroon involvedit's shit
>>101145962Looks gyate to me desu
>>101145954Time to scrape all loras and good models off civitai
>>101145954Ethical means death. The only ethics is: "was your image available in a public space, yes or not". If your name and art is public and publicly available, it's fair game.
>>101146026that's just about the closest it got to being gyate
>>101145954So what does this mean for the little man (me)
>>101146157It's an initiative focused on actually open AI image models but run by cowards. But honestly if they just focus more on the tech and training and less on the training itself, that's fine. We really need more people like Pixart that focus on proofs of concepts that try to push the bounds of efficiency. For example, they really should figure out distributed training.
>>101145954>ethicalwhen will they learn?
>>101146026so basically I'm waiting on kohya or something for another attempt
>>101146154nice
>>101146183Then Nvidia will stop making big bucks
don't even know where to start, looking at a pony diffusion v6 model that's lora but i want to run it locally and no idea where to fucking start.looking at multiple tensors, and i have webui by auto1111 but idk where to go from here
>>101146211someone else can make it unethical, I suppose?
>>101146239They'll make more than enough big bucks selling 5090s and anything with raw computer. A million 3060s will never be good even if they could theoretically be used.
Mend the schismUnite the threadsThat is all
>>101146242obviously you load pony diffusion as main model / checkpoint and the lora as lora in your UIin comfyui its checkpoint->lora->sampler
>>101146268The SAI shills and autists are not worth being with.
>>101146254it's already in the process >>101141573>>101141582
>>101146254you can't be the sole judge of ethics, that's dangerous to think you are some kind of god who know better than the others what's "good" and "bad"
>>101146049Put 'em on the torrents.
Ethics are for the little people.
A fresh loap of bread? Save me a loaf:>>101146316>>101146316>>101146316
>>101146298society isn't ready for excellent ethics
>>101146299There's already one floating around with 2022 and down models and loras
oh no, I epic brainfarted