Discussion of free and open source text-to-image modelsPrevious /ldg/ bred : >>102723260AI Video Games Edition>Beginner UIFooocus: https://github.com/lllyasviel/fooocusEasyDiffusion: https://easydiffusion.github.ioMetastable: https://metastable.studio>Advanced UIForge: https://github.com/lllyasviel/stable-diffusion-webui-forgeAutomatic1111: https://github.com/automatic1111/stable-diffusion-webuiComfyUI: https://github.com/comfyanonymous/ComfyUIInvokeAI: https://github.com/invoke-ai/InvokeAISD.Next: https://github.com/vladmandic/automaticSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Use a VAE if your images look washed outhttps://rentry.org/sdvae>Model Rankinghttps://imgsys.org/rankings>Models, LoRAs & traininghttps://aitracker.arthttps://civitai.comhttps://huggingface.cohttps://github.com/Nerogar/OneTrainerhttps://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/kohya-ss/sd-scripts/tree/sd3>Fluxhttps://replicate.com/black-forest-labs/flux-1.1-prohttps://huggingface.co/spaces/black-forest-labs/FLUX.1-schnellhttps://comfyanonymous.github.io/ComfyUI_examples/flux>Pixart Sigma & Hunyuan DIThttps://huggingface.co/spaces/PixArt-alpha/PixArt-Sigmahttps://huggingface.co/comfyanonymous/hunyuan_dit_comfyuiNodes: https://github.com/city96/ComfyUI_ExtraModels>Index of guides and other toolshttps://rentry.org/sdg-linkhttps://rentry.org/rentrysd>Try online without registrationtxt2img: https://www.mage.spaceimg2img: https://huggingface.co/spaces/huggingface/diffuse-the-restsd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium>Maintain thread qualityhttps://rentry.org/debo>Related boards>>>/aco/sdg>>>/aco/aivg>>>/b/degen>>>/c/kdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/tg/slop>>>/trash/sdg>>>/u/udg>>>/vt/vtai
Blessed thread of frenship
>>102744297That was deprecated back when SD2.0 was released, the safety checker didn't make it into any of the later versions of diffusers.
pixartists status?
Meiling on the front page!
testing local live potrait.How do you expand face detection area or it can't? It seems hair/neck still have limited movementAlso anyone knows where you can get good sources or tiktok whores potrait? Most of these vids have head movement all over the place>inb4 film it yourselfI'd but I need sound too. Deepfake my voice would be too gay.
Anyone actually using Illustrious? Lora support seems to be pretty rough for it, and documentation doesn't help much either
did anyone manage to make a quant of the de-distill yet?i tried yesterday following the instructions in >>102741794with https://huggingface.co/nyanko7/flux-dev-de-distill/blob/main/consolidated_s6700.safetensorsbut even the first conversion step of that file to BF16 gguf seems broken, as when i try to run it, it gives me a RuntimeError: mat1 and mat2 shapes cannot be multiplied (4032x64 and 256x768)if not a quant, is there at least a working BF16 somewhere that i could try using for the conversion?
>>102746615it's only been out for a little over 2 weeks. lora's exist you just more or less have to go to where people actually post them(read: the porn threads).I think anyone bothering to upload to civit are doing it under SDXL 1.0 or something with an illustrious header. Otherwise, as mentioned, the lora's are getting squirreled away into a mega/mediafire/etc. directory.
bigma....
dev 1.1....
>>102746807Ugh, CivitAI needs to be better at actually creating filters for new models that exist
What's up fellas how is Everyone today?
>>102746868Civitai needs to be thrown into the trash and replaced, but here we all are, sitting with a tab open that's actively leaking memory as you scroll down a page that has 0 right not to employ pagination
>>102744592>Tree Church in the bottom picture.How do we make this happen in real life?
>>102747426the real solution is to not use civit for anything other than downloading full base models from and learning to train the shit you need/want yourself.besides, there's more concerning things to care about; like how there's nowhere to really post images that actually take a modicum amount of effort and isn't just straight slop.
>>102747449We start with a tree
>>102746765>the first conversion step of that file to BF16 gguf seems brokenI don't know why you even need that, when I converted to Q8_0 I did it directly herehttps://github.com/leejet/stable-diffusion.cpp/blob/master/docs/quantization_and_gguf.md
>>102747556How did you gen this img?
https://reddit.com/r/StableDiffusion/comments/1fzrjp8/fluxbooru_v01_a_boorucentric_flux_fullrank/>Used SimpleTuner via 8x H100 to full-rank tune FluxNOW WE'RE TALKING
Hibernation Thread: Ultra
>>102747616https://huggingface.co/spaces/bghira/FluxBooru-CFG3.5YIKES
>>102747616>no example outputs >>102747659holy sovl
>>102747605model is noobaiXL(it's a light further finetune of illustrious)lowres https://files.catbox.moe/oauhjd.pnglow effort redraw for i2i upscaling https://files.catbox.moe/klu760.pngraw upscale https://files.catbox.moe/28kbfo.pngphotoshop file with all further edits and filter shit applied after the fact if you're so inclined https://files.catbox.moe/yp4cms.psdyou can get one of the lora's here https://www.mediafire.com/folder/f1uuqrzy5s83e/arako_o (the noob version is in the noob directory)I don't have the other uploaded anywhere but I'm going to rerun it to, hopefully, be a bit better later, anyway.
>>102747588trying to make a Q4_K_S with the source the other anon providedi couldnt get stable diffusion cpp to work on my pc but maybe ill just have to try it again, because at this point i'd be fine with a Q4_0 too
>>102747659kek
>>102745321Will you help me remember to take my meds?
>>102747706the loras are cool, what artists are they?
>>102747750the one that's uploaded is arako ohttps://x.com/arako_o/mediathe other that isn't uploaded is deal360acv(just look for them on *booru)
>>102747393I woke up after 3 hours so I’m taking the day offEvent horizon was okay
>>102744592Nice Moebius vibes.
>>102747659Excellent
>>102747659>>102747717that's a bit rich to say something like that when your model produces monstruosity like that lmao
>>102745480>>102746822Continually blue balled
>>102747992Why do they all act like fucking bitches, there's such a disconnect between finetuners and users
we reddit now
>>102748168>now
Total Mikufag Death
>>102748189how times have changed
>>102748297desu that's our fault, we shouldn't have to go to reddit to get the AI news, but nothing's happening there so...
>>102747716>trying to make a Q4_K_S with the source the other anon providedStable diffusion ccp seems to be able to do Q4_k but Idk if that's S or M
>>102748378hopefully M
>>102748378oh cool, ill try it out
>>102748402>hopefully MSounds about it yeahhttps://github.com/ggerganov/llama.cpp/discussions/2094#discussioncomment-6351796
>two threads worth of anon not knowing how to quant an image model ldg is full of retards
>>102748471you don't know how to do it aswell, or else we would've seen your undistill quants on huggingface
>>102748483my quants are for me not you
>>102748490you have no quants, you don't know how to make them, you're a retard anon
My quants go to a different school
>>102747659did you try masterpiece best quality?
>>102748749that's better
Desu lots of furry
New modelhttps://civitai.com/models/838583?modelVersionId=938191
>>102746765>but even the first conversion step of that file to BF16 gguf seems broken, as when i try to run it, it gives me a RuntimeError: mat1 and mat2 shapes cannot be multiplied (4032x64 and 256x768)>if not a quant, is there at least a working BF16 somewhere that i could try using for the conversion?I think you messed something up, I managed to get the BF16.gguf working
>>102748888you made that model anon?
>>102749023yeah
>>102747992If he posted what a good output looks like then maybe I'd believe him
>>102749135it's hard to gaslight if you define the goalposts. Who is this guy anyways?
there seem to be a difference whether you do the quant on stable diffusion cpp or on comfyui GGUF, I think the latter is the good one, the speed is better and the quality too https://github.com/leejet/stable-diffusion.cpp/blob/master/docs/quantization_and_gguf.mdhttps://github.com/city96/ComfyUI-GGUF/tree/main/toolshttps://imgsli.com/MzA0OTIw
>>102745480My best guess is it's going to be announced/launched with the 50 series of GPUs.
>>102749170kek
>>102749135It's a retarded standard anyways, if they're training Booru tag prompting in it's going to result in a Pony-esque reconfiguration of how the model understands images from prompts. It's going to produce weird results as the text encoder is realigned.
>>102749165here's another examplehttps://imgsli.com/MzA0OTI2
>>102748902K, now that I got the hang of it, I'm gonna do all those quants for the un-distilled dev model and put them on huggingface, do you want to add something else to that list?
Is there a comfy node that can call an API? Searching comfyui API is not going great. I want to feed an image to a node and have an API call to an external program (upscaler).
Flux Pony?
>>102749165could you make a Q4_K_M one and perhaps share it? i would be very gratefulim not well versed enough in this stuff to figure out whats going wrong on my system with the BF16 conversion
>>102747556who wants to do all that gay shit when you could be genning more cool slop
>>102749630>could you make a Q4_K_M one and perhaps share it? i would be very gratefulyeah it's on the list already, I'll be putting all of them on huggingface, I'll make a comment when it's ready o/
>>102749649>>102749291i hadnt scrolled down this farbless you anon
>>102749697>bless you anon:3
>>102749165Interesting.>tfw have the ability to do fp16 so never had to deal with this stuff
>>102749784I also have a 24gb vram card but I don't like running the fp16 on it, it barely fits and you can't add other shit like PuLID, loras and stuff because it increases VRAM and overflow that shit
>>102749807I run dual GPU. Might not be using them to their full potential though I guess since I just use the simplest workflow and no loras.
>>102749951I also have 2 gpus but I use the 2nd one for the text encoder, didn't know you could split the unet model onto two different models though
>trade score_9, score_8_up, score_7_up for masterpiece best qualityIt's like the 1.5 days again.
>>102750279we are so back
>>102749183i pray
>>102749183what are the chances of the model having tensorRT support right away? because iirc we still can't use loras with tensorRT on comfyui
>>102750279>masterpiece best qualityNever stopped using it.
https://civitai.com/models/838784
>>102750907Who knows but I would predict they'll try to put some sort of proprietary tech/software stack on it so people will upgrade to the 50 series (kind of like they did with the upgraded DLSS for the 40 series). They'll probably announce something like TensorRT 2.0 support that only works for 50 series cards that will boost generations by 100% or something.
Is there a way to copy paste my custom settings from forge to reforged?
>>102750943i wouldn't put it past them, but this seems more like a gaming oriented block, for AI the lock is VRAM, it's the reason why no card other than the 5090 has more VRAM
>>102751001ui-config.json?
>>102751051The tensor cores at the hardware/firmware level might be configured to better perform with a proprietary inference standard.
>>102750937finally
>>102751001>reforgedwhat is this and what is different from forge?
>>102751165Forge is being helmed by an actual retarded destroying the project. Reforged is someone trying not to destroy the project.
>>102751066i think it's more likely we will get some sort of more efficient way of doing calculations locked behind the new generation of cards. like fast fp8 mode for the 40xx series
>>102748147it's funny to pretend that most people don't use this for coom
>>102749291Thumbs up
>>102750937>image of a fit woman in a sailor moon cosplaykekd
>>102751606finally a proper greta model
I have a 3090 and it takes 4 seconds to generate 1024 x 1024is that good or bad? do I need a new card for faster gens?
>>102751648>4 seconds is too longbro calm down and turn your step count up to get better gens.
To whoever asked in the previous thread, it’s Runway
>>102751817I'm not complaining, but I saw some posts that claim they get 10 images in the spawn of a second and I got jealous, but maybe they were just talking shit
>>10275209110 pics at 1024? Nothing is going that speed. The non-gpu stuff takes more time than that.
>>102752091maybe at low res, really low steps that is possible. On a 4090 at 80% power limit I can do a batch of 12 at 50 steps in just over a minute. This is just a base install of forge, there are some ways to optimize generation to get stuff faster but I don't care that much.
>>102752227I am still very annoyed we don't have a standardized measurement similar to what they have for measuring LLMs....before everyone started cheating.
>>102752227I timed my 3090 and it gave me 4 images after a minute at 50 steps 1024pxso 4090 is 3x better maybe
>>102752392batch size matters anon, generating images in a batch of 8 or 12 or 16 is faster than generating separate images in batches of 1
>>102752456yeah, 12 images at 50 steps took a little over 3 minutes on my 3090I don't generate one by one if that's what you're implying
>>102752599on my 3090, batch size 12 is 2:15. What are interface you running? If webui/forge make sure to launch with --xformers
I use forge, but same results on reforged>>102752599>over*under
man i need a better gputakes minutes per image on my 3070at least it builds suspense to watch them generate
>>102751917>>102751953>>102752041>>102752096>>102752217boob and butto
>>102753039but no feet
in onetrainer, what does x inpainting mean?also have anyone made their own flux lora? What were your results?
>>102753141pour vous
>>102753278sniff
>>102749170Is that made with sd?
>>102753438SDXL, with a lineart controlnet and a quick sketchI used this model https://civitai.com/models/832537/zuki-cute-mix
I am ready to build my own UI on top of other AIs at this point. The absolute cluster that is this spaghetti nightmare is killing me.Does anyone have any docs on how pytorch caching of models works? If I run the same model in forge and comfy does it load twice?
fluxbros... our time is nigh
>>102753756so is there going to be a titan or TI?
>>102753756I bet the 5090 will be more than twice the price of the 5080
>>102753767I hope they do a 40 GB Titan AIAlthough I'm pretty happy with them doing 32 GB for the 5090, that's a massive boost for local training.
pixart sexuals will rise a-gain
>>102753756I got it early and was able to give me a batch 128 in 40 seconds price is $5000 on launchsee you on the other side genners ;P
>>102753874For $5000 I'd just buy a A6000 ADA
>>102753884nice try, but they're all sold outI bought them all
>>102753874post tits or gtfo
>>102753919like genned tits or?
>>102747585Prompt?
>>102753756Future 5090 masterrace
>some dude just paypiggied several ((early access)) checkpoints on civitai>100k buzz EACHcant tell if he's a fag or incredibly based, i appreciate it, but man. this just encouraged the fuck out of the very jewish practice.
>>102754085either
>>102754547https://civitai.com/user/leirtes/modelsI don't see any models on his account? Also literal who
>>102754896he doesn't have the models, he paied the full early access price for some models so we won't have to wait to download them
>>102754547WAI-ANI V9 HERE I COME!!! >:D
dry thread
t: homosexual
Thoughts on https://sdtools.org ? looks kinda neat
>>102755308very cool and helpful,>but it doesn't even link the tools it mentions
>>102755308no mention of Forge?
>>102755308seems to be a very very high level overview of the concepts. may be fine but it doesn't reference schedulers for instance
prompt: dingus cringusnegative prompt: croinkle>outputs only consist of offputting creatures in muted color schemes, most of them monkey-likeinteresting
>>102755433its a good thing you left out the croinkle that wouldve been a bannable offense
How do you use masked loss in Kohya? There's a box to check but no way to set the directory. The guide says to specify the path to the masks using conditioning_data_dir. I set it in the additional parameters, but I'm not sure if it's even working. There's no output in the CLI that says the masks are being used.
>>102755308it's like those I built this AI in 20 minute videos (looking at you codebullet). They are entertaining, but they don't progress you further.
>>102755433literally me
>>102755433putting dingus cringus and negative croinkle genned gay porn in waiANINSFWPONYXL_v8Hyper12step.safetensorsgod dammit anime coomers
>>102755433>>102755655Thank you for posting your findings anon
>>102755788and apparently this is a dingus cringus according to flux>lost 50~ buzz for this finding
>>102755810could be the checkpoint/lora I didn't change, but I feel pony might have a different idea what dingus cringus is. Everything was safe for work until I added that keyword. https://litter.catbox.moe/y7wyzh.pngNSFW, although that should be painfully obvious.
>>102755655>>102755810>>102755919intriguing differences
>>102755997interesting specimen
what's the best nude lora for FLUX that doesnt fuck with my character loras faces?
>>102756185just inpaint with sdxl and be done with it
>>102756269fluxfags how do we recover from this?
does anyone else get these vertical lines when they generate pictures over 1024px with flux? is there any remedy?
A lot of people are waiting for the succesor of Pixart Sigma, personally I wait for an update on this lolhttps://huggingface.co/Kwai-Kolors/Kolors
>>102756516its a watermark, can't be removed
>>102756533All dual language models are shit.
>>102756541>its a watermark, can't be removedit can if we finetune it further with normal pictures
>>102756516means you need to use a lower quant cause your gpu cant handle that resolution properly
>>102756582oh, it has to do with the tilted VAE shit or something?
Who wins?https://imgsli.com/MzA1MDgxhttps://github.com/MythicalChu/ComfyUI-APG_ImYourCFGNow
Is there some loras or finetunes already been made from the undistilled flux and uploaded on civitai now?
I was gone for a while. What's the latest on workflows for flux with negative prompt?
>>102756850APG but i think you knew that already. post prompt >>102757287turn CFG up to 4 and weep
>>102756850Hmm.... APG is weirdly greebly. The staff are ribbed for her pleasures. Also the folds on the fabric are much more numerous.
>>102756533Imaging trying to desloppify this. And I thought Flux was bad.
>>102756850I choose the fastest gen
>>102758589local?
>>102758611Pixar Flux LoRa and Haluo AI. you can use the Pixar Flux LoRa locally but the AI video model is on a website. Shit I wish an AI video generator model as good as Hailuo AI was local
>>102757904nothing wrong with FLUX
>>102744592Remember to ask your Qwen model if Jews and the Chinese are working to destroy the West.
>>102758649cooldidn't know that you can do i2v
>>102757302>post prompt>The 35mm analog photograph features two females with contrasting styles, standing back-to-back, each holding a unique staff. The two woman are standing on a pirate ship with a dynamic camera angle and cinematic mood.>On the left is a young woman with green eyes, thick eyebrows, and long, white hair parted in the middle and tied into two high pigtails. She has large, pointed ears. She wears a striped black and white shirt, along with a white jacket tucked into a skirt with a black belt. The sleeves of her jacket end with large, gold cuffs. Both her jacket and skirt have gold trims along the edges. Over her jacket, she wears a short cape that matches the white and gold theme of her jacket and skirt, and the cape includes decorative, gold accents with red jewels on each shoulder and a high collar that is fastened with a red jewel. She also wears black tights, brown boots, and a pair of gold earrings with red, teardrop-shaped jewels hanging from each earring. The staff she holds is a long, ornate piece with a large red orb at the top, surrounded by a golden crescent shape, and a red ribbon tied just below the orb, fluttering slightly.>On the right is a taller woman with purple eyes and long, waist-length purple hair with a straight cut and bangs. She wears her hair down with two additional chest-length strands framing her face. She wears a long, buttoned white dress with a Victorian top, including a frilled collar and puffy white sleeves, along with black boots. Over the dress, she also dons a long black coat with a hood, which has a gray inside layer. She wields a long, wooden staff wrapped with purple ribbons in battle.
>>102758204APG and CFG have the same speed
>>102758744they just added it. Unfortunately, this means they are going to add a subscription service soon, too. People will have to rely on and be limited to free credits.
>>102757287>What's the latest on workflows for flux with negative prompt?flux un-destill + a normal workflow (no distilled guidance + CFG > 3)https://huggingface.co/nyanko7/flux-dev-de-distill
there's not a Minimax thread anymore on /pol/? Oh man what a shame they were funny as fuck, I wanted to see how they would've handled the new image2video feature
>>102756992That's not a good sign. It might be the case that undistilled flux doesn't respond well to training.
babe wake up, new local vision modelhttps://rhymes.ai/blog-details/aria-first-open-multimodal-native-moe-model
>undistill dev>can't have Miku with dreadlocks anymoreIt's ova...
>>102759140Idk man, I've seen some testimonies there and there where they claim that they got better results with undistillhttps://huggingface.co/nyanko7/flux-dev-de-distill/discussions/3#6705765f2214de561f5499d4
>>102759161It might be placebo. Until loras or finetunes are released for dedistilled flux, I'm remain skeptical.
>>102759154I mean have you tried playing with the cfg. It says 1.0 right there.
>>102759201>I mean have you tried playing with the cfg. It says 1.0 right there.I was using APG, which is supposed to be CFG but better, but even on regular CFG I can't get what I usually get on flux distilled unfortunately
>>102754246>Prompt?
>>102757904>And I thought Flux was bad.i don't care much for kolors in it's current state but you're crazy if you think that's as bad as flux
>>102755463https://rentry.org/d2ckzxmq
>>102759382if Flux is worse than Kolors then why the fuck everyone is running Flux atm then?
Alpha Two sucks
>>102759438because reddit worships it
>>1027594744chan also worshipped it as fuck lol
>>102759488/ldg/ is just the 4chan branch of r/stablediffusion
>>102759154I think we slept too much on Flux2Pro, that one is undistilled to, maybe it's better than de-distill https://huggingface.co/Kijai/flux-dev2pro-fp8https://huggingface.co/ashen0209/Flux-Dev2Pro
>>102759497blacksune miku...
>>102759497What's with the comments about the license?
>>102759520some nerds saying that it shouldn't be apache 2.0 because it's still flux dev, which is true but I mean, why do they care? Are they getting paid to police the licence on the behalf of BFL?
>>102759526>Are they getting paid to police the licence on the behalf of BFL?nope, those people are ruining the fun for free
>>102759462Did you try the version at https://huggingface.co/spaces/John6666/joy-caption-pre-alpha-modIt has options like choosing an uncensored model, maybe one of them is good.
HOLY SHIThttps://pyramid-flow.github.io/https://huggingface.co/rain1011/pyramid-flow-sd3a local video model and it's not shit!! let's fucking goooooooo
https://huggingface.co/bluepen5805/FLUX.1-dev-minus
Kling-tier open-source video model based on SD3, only 2B parameters.https://huggingface.co/rain1011/pyramid-flow-sd3https://pyramid-flow.github.io/What do WE think?
>>102759649It's only a 8gb model, are we back?
>>102759658what is this? flux dev transformed into schnell?
>>102759688>>102759649The chinks will definitely save us from the cucked commiefornia, feelsgoodman
>>102759649never expected this kind of quality for a 2b model, and they went for fucking SD3M kek, imagine if they did this on a 5b model instead, Instant Kling at home
>>102759778>>102759649>COMING SOON Training code and new model checkpoints trained from scratch.We'll get something even better in a few days, I don't think we realize how back we are
>>102759800https://github.com/jy0205/Pyramid-Flowunprecedented levels of back
>>102751606The "Down syndrome" Lora was the highest intellectual feat ever produced in Stable Diffusions history, no slop from github comes anywhere close.>>102759536oh fuck i'd forgotten about BFL, let's see if they've slopped out their now outdated video model....nope! lawl>>102759649China save us! (3 second videos of default azn people, woo!) not getting fooled again like with COG
>>102759898>3 second videos of default azn people, woo!that can go for 10 sec + 24 fps, we're so back!
>>102759649When ComfyUi?
>>102747556>besides, there's more concerning things to care about; like how there's nowhere to really post images that actually take a modicum amount of effort and isn't just straight slop.Just post on twitter and pixiv? No one's gonna bother extremely sloppy aislop from effort aislop. Just keep posting what you like and don't get too concerned about lack of views or shit like that. Your pics are very tame and appeal to personal tastes so they probably won't get wildly popular, even if they are clean and are above most of the aishit.
It looks goodAnd new models are coming, they mention flux on their github page>SD3 Medium and Flux 1.0: State-of-the-art image generation models based on flow matching.So the next models may be flux-based
>>102759992
>>102759992>It looks goodand we'll get even better results in a few days, they've gotten rid of that stinky SD3M and decided to train their model from scratch >>102759846
>>102760049you could go for something even more authentic by using tucker's face and PuLID kek
>>102759649Kek, looks bad but if they're making a new model it'll get better so I'm not worried
>>102760063wtf are you doing Kobe?
>>102759912Is it img2vid driven? because if not, then it's another Chinese D.O.A. vid sloppa.Img2Vid is the MINIMUM entry standard in late 2024, anything else is just a side-projected pooped out for clout in the AI sphere.
>>102760143>Is it img2vid driven? because if not, then it's another Chinese D.O.A. vid sloppa.it is, for example this is from a real picture >>102760063
>>102760148Thank you, i've not had time to read the paper/page, due to neverending barrages of calls :/
>>102759649The chinks are really the kings of video models, Kling, Minimax, now this...
https://github.com/jy0205/Pyramid-Flow/issues/5#issuecomment-2404503890>Thanks! and tile_sample_min_size=256 is great for A6000 (it went to near 40gb).wtf, it's asking for too much vram for a fucking 2b model
>>102760464who cares? he's training the fucked up Auraflow model, this finetuner is deprecated
>>102760480with 10m captioned images and i think 15m images total it'll fix the model
>>102760488we don't need this cuck anymore, he's removing the artist tags, there's new fintuners that are training SDXL with the full booru dataset without any cucking involved
>>102760523i don't care about artist tags, i just need some form of style control, which pony v7 will still have>training SDXL with the full booru datasetdon't care if it's not captioned, tag-only isn't enough
>>102760534>i don't care about artist tagsa lot of people care, I don't give a fuck about a "style control", I want to reproduce my favorite artists
>>102760545then use a lora? it's still a local model
>>102760550>then use a lora?why are you waiting for his finetune then? just use some loras
>>102760558loras for artists are trivially easy to create and it'll most likely be the only lora you needby comparison, good fucking luck generating multiple characters doing different things in a tag-only model
>>102760564>by comparison, good fucking luck generating multiple characters doing different things in a tag-only model
>>102759649https://github.com/jy0205/Pyramid-Flow/issues/12#issuecomment-2404752801>The 384p version requires around 26GB memory, and the 768p version requires around 40GB memory (we do not have the exact number because the cache mechanism on 80GB GPU)
>>102760652Chinks do it again. Bravo, ramp up the frenzy for clout then drop the bombshell that it's not useable by 99.9999999% of people interested in the technology.D. O. A.
>>102760676it's still usable if we go for Q8_0 though? it'll go under 24gb of vram usage
>>102760687You're right, i retract my statement, it's now 99.9999998%
>>102760652What retarded approach do current devs use that they HAVE to load the entire model in VRAM instead of chunking it from RAM as needed?Why are they like this?
>>102760185they don't have issues with copy-write and are img2video everything on the internet. The are the kings of something, but it isn't models. >>102760676this is a general problem. So many fake papers out there with matching githubs that don't run. >>102760734it kills speed horribly
>>102760831>speedI'd rather wait 20 minutes (ram chunking) than wait an eternity (doesn't run at all).People literally do other things while video models generate in the background.I suspect a lot of devs are at the babby mental stage of using IT where they are literally staring at the % bar instead of getting on with something else and this translates to them having this mindset where the quicker the % bar moves the better. (Line goes up mentality) and they still think everyone interested in these projects must think the same way and damn their eyes if they can drag themselves away from watching the % bar!AI Devs need the equivalent of a tard wrangler, seriously.
Are we at the awkward inflection point?
I open this thread every 3 months. Is there a flux fine tune for porn already (a good one)?
>>102761492Not really
>>102760894Speed is preferred for testing purposes.If you test for 100 results in a minute, you have a good idea of what the model can do.If you test for 100 results and get 1 result every 20 minutes, it's gonna be tedious to repeat and make new tests.
>>102760010>some chinese trained a full 3B video model from scratch faster than SAI could either a) fix their shit model or b) retrain a new one from scratch.Actually amazing.
>>102761675What a throwaway platitude and derogatory statement.Just use 20 Virtual machines for testing the 20 minute version that'll work on everyones machine rather than the version that only works on 1% of peoples, there, that wasn't so hard.
>>102761675>I promise my tests actually do something even though 100 tests really means nothing given how random AI actually is so most of the time I base results on what was actually random chance>IMMMMA GUNNNNNA TEEEEEEEST
>>102761814>derogatoryNow you're just being a drama queen
where are the ggufs i was promised
>>102761937still working on it, i'll you know when it's done
>>102759969Just about everyone filters AI on pixiv and if it's SFW you may as well not even post it there. and twitter has zero discoverability these days(though I'll also admit I don't actually try to force discoverability on twitter).>No one's gonna bother extremely sloppy aislop from effort aislopthat's kind of the point and that a "more curated" place intended for people who actually try and treat AI as a tool 'art' would be beneficial to pretty much everyone involved. Beneficial to creators due to having a place that doesn't actively shun them for posting, beneficial to the viewers since they don't have to sift through thousands of pages of the exact same 1girl standing AOM-slop and beneficial to the field since it would act as a condensed location of people trying to legitimize the process.As it stands, about the only way to actually get any traction is to go full retard and intentionally invade the "normal" spaces and presenting your shit as if it's not AI because people go out of their way to AVOID AI shit because of all of the slop. Even I filter out AI shit because of all that garbage.Also doesn't help that most "AI spaces :^)" more or less expect entirely AI workflows with repeatable results and zero user input. >Just keep posting what you like and don't get too concerned about lack of views or shit like thatI mean, I do, but that's pretty much a given in that there's just nothing else to expect.Should also note that the vast majority of what I do is actually porn and even that has fuck all for reach since, lol, if people look for AI porn they're absolutely not looking for quality.
>>102762019Yeah I agree pretty much. It's still possible to get audience on twitter if you get lucky or if you grind and grift enough, but it's much harder to get big numbers than it used to be. It's funny seeing how some early AI adopters effortlessly got 30k+ followers with absolute fucking slop, but now even the highest effort guys are barely getting 10k if they started later.I decided to just stop caring much and just dump semi-effort coomslop to pixiv. Nobody cares about 1girl pinups unless you already have an audience.
>>102762179Damn, really? The twitter audience dried up? Why is that, it became easier to make stuff that doesn't entirely look like slop?>some early AI adopters effortlessly got 30k+ followers with absolute fucking slopExamples?
>>102761267>awkward inflection point? >>102761316nice
>>102762202>The twitter audience dried up? Why is that, it became easier to make stuff that doesn't entirely look like slop?I don't know, likely just the algorithm fuckery. >Examples?https://x.com/eyeai_https://x.com/Rakosz1not actually bad slop but those i had remembered by browser history. it's mediocre but I've seen better aifags struggling to break 5-10k+ followersthere are much worse ones with similar audienceand even very known aifags who started late like https://twitter.com/flooxyfloox barely get 30k followers while literally everyone in ai coomer space knows about them
>>102762241>barely get 30k followers while literally everyone in ai coomer space knows about themand also not saying they're good btw, but certainly well-known
>>102761871>Attack the person not the argument if you've been beaten by reality.
>>102762289I mean Virtual Machines don't run on air, so you're just taxing resources more.
>>102762202a big issue with twitter is that discoverability of NSFW got tanked hard between the time when AI first hit and now. And when shit was new and shiny(literally, even) people weren't adjusted to or aware enough to actively avoid the shit.If you're "just starting" now and aren't actively trying to crash in on spaces where people are not expecting there to be AI in the only interaction/views you'll get are from automated bots. Something I post to pixiv that can hit 1k views/hundred or so likes/favorites with an AI tag would probably get 5-10x the amount if I submit it without the AI tag(or possibly more).The whole scene is just fucked in terms of discoverability and interaction. Unironically, posting to 4chinz is going to ensure that you get more unique, actually tangible views than posting to twitter if you don't already have a follower base.
>>102759492>/ldg/ is just the 4chan branch of r/stablediffusionWhere did this cope come from? I'm out of the loop.
what should I use for things like logos and stylized letters
>>102762357learn how to use adobe illustrator.
>>102761316fun
>>102762346when is the last time you say anyone talk about something that wasn't the newest model, random github repo or a reddit link. I haven't seen a discussion about workflows, colors, or *gasp* coding in this general in a very long time.
>>102762432Are you saying there's another more in-depth /g/ ai thread or are you just upset in general? Is model quantization not tech enough for you? >workflows Where else did you find flux workflows the day it released? They all came from here. >colorsElaborate.
>>102762432cope & seethe
>>102762357if you only care about typography SD3 might work kek
CogVideoX finetuning.https://github.com/a-r-r-o-w/cogvideox-factoryrequirements.txt requires diffusers>=0.30.4 latest version on pypi is 0.30.3How is this resolved?
>>102762919>How is this resolved?you tell me
>>102762919it's likely the dev branch
>>102763018Thanks.
is there a flamethrower lora?
>>102763234nvm there isn't
>>102760687>>102760702There's probably a lot of optimizations that can bring the VRAM usage down. For example, a quick scan of their github code seems to indicate that flash attention is off (fucking can't link it, 4chan thinks the post is spam...)Vanilla attention is quadratic compute and memory usage, flash attn makes the memory usage linear. For a video model, there's a time dimension, so the attention operations scale like (width * height * time)^2. So flash attention reducing that from quadratic to linear (for the memory usage) should be massive savings.
>>102763431So what you're saying is that these devs are somewhat fucking useless at holding from waving their dicks and posting on redi1t as soon as their code runs once without errors?
>>102762346>Where did this cope come from?what cope? post flux r/ldg is 80% reddit, 20% lmg and 100% retarded autistic furries. flux for /ldg/ is what biden was for america
cope & seethe
>>102763585you're wrong about the furries
>>102763618>>102763623post your fursona
>>102763633
nice images retards
Anybody have recommendations to an (porn) image interrogator besides deepdanbooru?
>he actually posts images
nice gens
>>102759141Doesn't see text but you can give it videos which I think is neat.
https://files.catbox.moe/ad8nhv.webm
what would be the 512x512 equivalent resolution of a picture in portrait?
>>102759141pixart bigma will release before we see llama.cpp support for this
>>102763793512x512
nerds
>>102763585>continues to cope
>>102763889turn up your rep pen, buddy. your llm is repeating itself.
>>102763952>>102758839
>>102763965>>102746385 (image of me)
>>102759649https://github.com/AIFSH/PyramidFlow-ComfyUI?tab=readme-ov-filecomfy node
https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/mainfor the anon who wanted Q4_K, here you go, I'm trying to upload the rest but huggingface can't stop giving me errors, this sucks!
>>102762919>CogVideoX finetuning.CogVideo is deprecated now that this exists lol >>102759649
Fresh >>102764387>>102764387>>102764387
Will PyramidFlow finally lead to a local porn video model? Quality looks pretty good. Only 2b parameters, so with optimizations it should be usable for inference in 24GB, and for training you might need to rent A100s but it should be pretty computationally efficient (meaning not that expensive to finetune). Their paper says they only used 10M videos, and many times more images. So it might only take a couple 10s of thousands of images, and maybe like 1000 videos to get a decent NSFW finetune.
>>102764328hmm im still getting the same error with this as i got when i tried to quant it myself, running it on forgeare you able to run it on comfy?
>>102764456>are you able to run it on comfy?yep, works fine on comfy, I think Forge is fucked that's all lol
>>102764328>yuribased
>>102764538>basedthank you fellow man of culture
>>102764456https://github.com/lllyasviel/stable-diffusion-webui-forge/issues/1285#issuecomment-2345381995lmao, that's because Forge still doesn't support the QK quants, what a deprecated software
>>102764456please don't tell me you have an amd card... https://github.com/lllyasviel/stable-diffusion-webui-forge/issues/1269#issuecomment-2308971752
>>102764514>>102764599damn, its gonna be a pain to get the workflow ported over to comfyi much prefer the simplicity of forge, shame the development sucks>>102764622no i have a 3070
>>102764653 (me)i just checked though and the normal flux dev Q4_K_S by city96 works fine
>>102764653>, its gonna be a pain to get the workflow ported over to comfyyou just load the workflow of someone else? here, take my workflow: https://files.catbox.moe/bxn9tj.png
>>102764678maybe it has to do with the fact it's an undistilled model and you have to deactivate distilled guidance on forge? what if you put cfg > 1 + distilled guidance at 1 or 0?
>>102764725doesnt work
>>102764782make an issue on Forge I guess, it's most likely an issue on his side
>>102764696heres comes the fun
>>102765004you just click on "Install Missing Custom Nodes"?
>>102765004you don't need any of them, you can delete them- Override -> For multiple GPU- Playsound -> it makes a sound when the generation is over- Xyz plot -> To make xy plot- Select Inputs -> Same thing
>>102765029wow i didnt know that node existed but its super useful, thanks anon>>102765047good to know because the manager couldnt find the override node which i dont need anyways
>>102764696is there a reason to use APG instead of CFG with the de-distilled model? also, do i set the value for APG in the node itself or in the CFG parameter in KSampler, or both?
>>102765309>do i set the value for APG in the node itself or in the CFG parameter in KSampler, or both?if you want to use APG you go for APG > 1 and CFG = 1 on the KSampler, if you want to go for CFG you remove the APG node (or you bypass it by right clicking on it -> Bypass) and you go for CFG > 1>is there a reason to use APG instead of CFG with the de-distilled model?You get less burns at higher values on APG than CFG >>102756850
>>102765384>You get less burns at higher values on APG than CFGi thought the point of de-distill was to remove those burns?
>>102765460it does, but like every undistilled models, you can't go too high on the CFGs, so that's up to you, it works fine until cfg 4, if you want even better prompt understanding you go for AFG
*taps mic* this thing still on?