Discussion of free and open source text-to-image modelsPrevious /ldg/ bred : >>102744592Chink Edition>Beginner UIFooocus: https://github.com/lllyasviel/fooocusEasyDiffusion: https://easydiffusion.github.ioMetastable: https://metastable.studio>Advanced UIForge: https://github.com/lllyasviel/stable-diffusion-webui-forgereForge: https://github.com/Panchovix/stable-diffusion-webui-reForgeAutomatic1111: https://github.com/automatic1111/stable-diffusion-webuiComfyUI: https://github.com/comfyanonymous/ComfyUIInvokeAI: https://github.com/invoke-ai/InvokeAISD.Next: https://github.com/vladmandic/automaticSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Use a VAE if your images look washed outhttps://rentry.org/sdvae>Model Rankinghttps://imgsys.org/rankings>Models, LoRAs & traininghttps://aitracker.arthttps://huggingface.cohttps://civitai.comhttps://github.com/Nerogar/OneTrainerhttps://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/kohya-ss/sd-scripts/tree/sd3>Fluxhttps://replicate.com/black-forest-labs/flux-1.1-prohttps://huggingface.co/spaces/black-forest-labs/FLUX.1-schnellhttps://comfyanonymous.github.io/ComfyUI_examples/flux>Pixart Sigma & Hunyuan DIThttps://huggingface.co/spaces/PixArt-alpha/PixArt-Sigmahttps://huggingface.co/comfyanonymous/hunyuan_dit_comfyuiNodes: https://github.com/city96/ComfyUI_ExtraModels>Index of guides and other toolshttps://rentry.org/sdg-linkhttps://rentry.org/rentrysd>Try online without registrationtxt2img: https://www.mage.spaceimg2img: https://huggingface.co/spaces/huggingface/diffuse-the-restsd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium>Maintain thread qualityhttps://rentry.org/debo>Related boards>>>/aco/sdg>>>/aco/aivg>>>/b/degen>>>/c/kdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/tg/slop>>>/trash/sdg>>>/u/udg>>>/vt/vtai
Blessed thread of frenship
>>102764387What's bigma?
Qu2t hallucinating.
>>102764413pixart bigma
https://github.com/AIFSH/PyramidFlow-ComfyUI?tab=readme-ov-fileHow much VRAM does it ask for?
>>102764413i dunno wassa bigma with you?
>>102764575>>102760652>https://github.com/jy0205/Pyramid-Flow/issues/12#issuecomment-2404752801>>The 384p version requires around 26GB memory, and the 768p version requires around 40GB memory (we do not have the exact number because the cache mechanism on 80GB GPU)
>>102764387>/ldg/ returning to it's chang rootsnature is healing
AMD unveils new AI chips to compete with Nvidia.
>>102764727we're more likely to see a completely new AI company making hardware from China than AMD seriously competing in AI
>>102764727it's useless, they'll always be below Nvdia because of CUDA
>>102764768Make the chips compatible with CUDA. Simple. Right?
>>102764785it's been more than 5 years they tried that, they got somewhere but it's still not closehttps://github.com/vosen/ZLUDA
Babe wake up, they improved SDXLhttps://huggingface.co/comin/IterComphttps://civitai.com/models/840857/itercomp
>>102764930>SDXLI sleep. Why are people wasting so much money on an objectively bad architecture.
>>102764595GGUF when?
>>102765034Ikr, today we got that video model that uses SD3 (to be fair they said they're retraining everything from scratch) and now this IterComp for SDXL, it's flux who needs love, not deprecated models
>>102765061I wouldn't say no to someone retraining SD3 or just making a 3B model.
https://github.com/jy0205/Pyramid-Flowhttps://huggingface.co/spaces/Pyramid-Flow/pyramid-flowthere's a demo now
>>102765147If you go for 24fps you'll only get 1 sec lol
>>102765147>>102765236tried a few times to get this to look at the camera but it just sorta wiggled like yours each time :/oh well not paying gpu minutes to try more, will wait for it to run in under 24gb
>>102765236>>102765147went for 8 fps and... kek
>>102765061wow, surprise, turns out all the people who know what they're doing came to the conclusion that flux is rigid, overhyped, and not worth the training costs. it's simply not a 12b-tier model. bloated with synthetic garbage and still requires sdxl refiner to unslop. bake again
>>102765279>will wait for it to run in under 24gbthis model will be history anyway, they're retraining it from scratch to get the best model possiblehttps://github.com/jy0205/Pyramid-Flow
bigma will save us
>>102765304>turns out all the people who know what they're doing came to the conclusion that flux is rigid, overhyped, and not worth the training costs.and so for you going for the most broken base model ever (SD3M) was a good idea? get the fuck out of there
>>102765304>flux is rigid, overhyped, and not worth the training costs. it's simply not a 12b-tier model. bloated with synthetic garbageAll the CFG antiburners are cope too
>>102765339>All the CFG antiburners are cope toogood thing we don't need any CFG antiburners anymore with the undistilled modelshttps://huggingface.co/nyanko7/flux-dev-de-distillhttps://huggingface.co/ashen0209/Flux-Dev2Pro
>>102765279what is putting this thing so high VRAM? The models seems all under 10GB
>>102765305Wow, SD3 is so shit that even CCP reject this shit.
>>102765370because making pictures asks for vram anon, and 24fps + 10 sec means 240 pictures that have to be rendered at the same time, it's like you went for 240 batch size on SD models
>>102765365once someone makes a killer full real fintune then ill be interested
>>102765083>I wouldn't say no to someone retraining SD3 or just making a 3B model.I would take retrained 1.5 at this point. V-prediction if possible
>>102765481Just train Pixart Sigma then.
>>102765481> retrained 1.5 why? unet is definitely inferior to a DiT architecture>V-predictionwhat's that?
>>102765402gotcha. I have two cards. I have been looking for a way to split the VRAM requirements. I am not seeing anywhere if that is supported. Models on one card and processing on the other seems like you could get 26GB pretty easily. The larger two text encoders are enough to drop it below 24GB.
>>102765537>The larger two text encoders are enough to drop it below 24GB.what text encoders are they using? T5?
>>102765548I have no idea. There are just folders named text_encoder_1, text_encoder_2 and text_encoder_3. I don't see them used in the code either so I am not sure what is going on. I assume you need them, but I haven't dug that far. Hopefully another anon will know.
>>102765147>based on SD3Myeah I can see that
better start saving up bros
>>102765642who the fuck is gonna buy the 5080 and the 5070? Do they pretend the 3090 and the 4090 doesn't exist?
Is there any video model that doesn't do the thing where when you give it a painting, it just kind of does a Ken Burns slow panning effect on it instead of animating it?
>>102765680Minimax actually animate shit but it's not a local model so...
>>102765673aren't they discontinuing the 4090 already?
>>102765733why would they keep manufacturing 4090s?
>>102765746easy money
>>102765781you clearly have never run a businesswhat happens when they release the 5090?the factory has capacity limits you know?why would they sell new 4090s and 5090s side by side?can you do a business plan that doesn't involve you, as a greedy poorfag, getting a new 4090 for $1000?
>>102765805what are you on about?
Pyramid 8 fps img2vid: A middle aged female scientist watches a fantastic machine the spins and whirrs with sparks until a peice of fried chicken falls out from the glowing blue middle of the machineSeems you get 1 gen then have to wait.
>>102765833I personally want to win a billion dollars
>>102765842not bad with interpolation
>>102764387>pic>authors: ching chong ping pong suk mai ding dongdropped
>>102765885like it or not anon, but the chinks are the kings of video models, Kling, Minimax, CogVideo, Pyramid...
>>102765781that's what the 5080/5070 is for
>>102765885might as well drop this entire hobby then lol
do you think any of them are cute chinese girls?
>>102765949i like to imagine the sweat and juices of many underpaid chinese jade beauties that touched my nvidia gpu during production
which version of pytorch should i use with comfy? i remember seeing a comparison image that showed some versions are better than others but forgot whichor is it all just placebo?
>>102764930Been testing this. Seems pretty decent.
>>102765895It's because boob jiggle triggers all the safety teams
>>102765877I'd duplicate the space and try native 24fps but im not paying for it.
>>102766001do you happen to have any examples that arent super sloppafied like picrel
>>102766044we can finally move on from flux
>>102765323yeah and they'd rather train their own model from scratch than use flux
>>102766115that's because flux is too big, their 3b model already asks for fucking 40gb of vram
>>102766001>Been testing this. Seems pretty decent.care to show some examples
>Most of the sample pictures on all loras are done with controlnet/img2img so expect different results if you trying to remix with the civitai generator.You stupid buzz farming asswipes. Documentation is the most important part of all of this. t. personal blog man
>>102766057"finally"hasn't it only been out like 2 months
>>102766057>we can finally move on from fluxso you heard one comment from a single anon (that has no images on top of that) and that's it? it's enough for you to make this insane conclusion?>t. the least disingenuous Flux hater
>>102766057i kekd
>>102766217hello saar, how much did the black forest labs pay you?
>>102766001is this command line only at this point?>>102766196the hype cycle has been at least 8. I want to say it started when comfyanon got shit canned (yes, that is bait).
>>102766249I ask you this question saar, how much did SAI pay you to smear Flux like that?
Bigma
>>102766266no need to pay me anything to smear flux saar, if you want smear just generate realistic gen with base flux, skin already look smear saar
>>102766287Explain why Flux is so hyped even though for you it's the worst model ever, Lykon.
>>102766170Here's something>>102766263>is this command line only at this point?I'm using the safetensor conversion
>>102766332>I'm using the safetensor conversionon comfyUi? Forge?
>>102766310>hypedthat's all it is saar, hyped. people used it during a great image gen drought, was impressed by prompt understanding and text capabilities, then they saw through it's cracked and got bored. it's been months and nothing has happened. flux isn't even open source.
>>102766345>flux isn't even open source.Schnell is Apache 2.0, SD3 has a shit licence, nice bait saar
>>102766342>on comfyUi? Forge?reforge
>>102766358>SchnellSch-BRAAAAAAAAAAAAAAAAAAP 8 step unfinetunable distilled BRAAAAAAAAAAAAAAAAAAP
>>102766382>unfinetunable distilledUh oh...https://huggingface.co/ostris/OpenFLUX.1
>>102766399spoken like a true saar!>they left us their dookie doo doo to eati'll be waiting for progress!
>>102766382https://huggingface.co/stabilityai/stable-diffusion-3-medium>Downloads last month 42,476https://huggingface.co/black-forest-labs/FLUX.1-dev>Downloads last month 1,130,973lmao
>>102766425looks overcooked as fuck, maybe your CFG is too high
>>102765949no. girls should stay far far away from this area. they'll simply fuck everything up by lobomotizing the models to make them "safe for women". we need the undivided attention of touch-starved chinks to fuel progress and women will, at best, be a major distraction.
>>102766467>no. girls should stay far far away from this area. they'll simply fuck everything up by lobomotizing the models to make them "safe for womethis, we've seen the disaster when women went onto the video game industry, they made every female MC ugly because they're jealous of beautiful women
>>102766467this. and if you're desperate just i2i a picture of your face
>>102765949>do you think any of them are cute chinese girls?I don't really care who's behind this, the only thing that matter to me is the result, I just want a good product at the end.
>>102766587but it would be cooler if some of the ones behind it were cute girls who are cute to look at
People love to scrutinize the small details in AI images. So don't give them any. You need to be blurmaxxing
>>102767125The perspective is fucked up which is ironic because the blur makes it even more apparent
>>102764930>they improved SDXLCan this be used on Flux aswell?
>>102767125based and blurpilled
>>102767125>>102767217>>102767273>generating supersized thumbnailsbut why
>>102767317I am assuming /sdg/ is shitposting/spamming the thread
>>102767349why would that be your first assumption?
>>102767366hes retarded
>>102767366there is a history of them trolling the thread and it has been stupid women shit, flux vs sd things and more images than this thread usually supports. If it smells like a duck and it clearly underage /sdg/ wants to fuck it.
>>102767317There is no such thing as style. Style IS content. An image is whole, contiguous, a fully-connected network of latent layers.
>>102767317Do you not how how latent space works?
>>102765642stop posting slop rumors you gossipy troon
>/ldg/ gens a few months from nowyo guys check out my gen!
>>102767486>/ldg/ gens a few months from nowat the current rate it's optimistic to predict that there will be /ldg/ gens a few months from now
>1.5: lacks the prompt coherence of later models>XL: lacks the level of detail present in later models>Pixart Sigma: lacks enough training>Kolors: lacks comprehension of the english language>HunyuanDiT: lacks non asian girl selfie dataset >SD3: lacks anatomy>Flux: lacks reasonable hardware requirements It will never be as good as it once was.
>>102767515bigma will save us im sure of it
>>102766622>>102766703>>102766425someone please fucking fix the AI lighting problem already. i've seen more realistic shit on deviantart
>>102767515only the stongest will survive
>>1027675155090 and Titan AI will make bespoke 1B-3B models very common very soon.I'm hoping the new Pixart architecture is friendly to this but if not Pixart Sigma is more than capable. I'll likely make a pretrained 16 channel VAE that is designed for training on 5090s for the purpose of truly having interesting full fine tunes rather than stacking Loras.
>>102765147Did you know that the guys who open sourced Pyramid flow are the same guys who made Kling? https://www.youtube.com/watch?v=GD6qtc2_AQA
>>102767747Pyramid - Zhicheng Sun - Peking University - Haidian, Beijing, ChinaKuaishou AI - Haidian District, BeijingYou got anything to backup this bullshit claim?
>>102767952I should have said that they only thing that connects these things are they exist in the same location.
>>102767952there's some guy from the Kuaishou Technology, it's the company that made Kling innit?
>>102767877she's cute
>>102768016funding a uni project is a far different being the guy who made Kling. He will probably be working for Kling shortly, but I can't find anything that says that the he does now or has the past.
>>102768046>funding a uni project is a far different being the guy who made Kling.they're not just funding it, there's literally guys who are in the company that made Kling who participed in this paper, what else do you want?
>>102768057linkedin or Chinese equivalent. Zhicheng Sun seems legit. I could be hoping that he doesn't have such ties to corporate ideals.
>>102768016In china, is the government who direct all, there are nor companies.
>>102768172So you're telling me that it's Xi Jinping who decided to give us all good local models for free? Damn he's based! I love china now!
>>102768197Yes also, with the western restriction, they cannot buy their models so openly like JewAI, so their response would make their model open and free, so they reduce the gains of Jews.
>>102768219Who would've guessed that the chinks would be the ones who would save us all during this AI clown circus show? Not me, I'm pleasently surprised, any help is a good help
https://sihyun.me/REPA/this shit is interesting, it makes the model learn concepts way faster than the usual
>>102768437you have a link that I can trust?
>>102768478Surehttps://github.com/sihyun-yu/REPA
>>102768437Going to pull apart this buddy, I'm dying to do a new diffusion model. 17x is insane
>>102768487thanks. Looks promising enough to ignore the python3.9 version.
>>102768497>17x is insanenot just that, the final loss function is even lower at the end, so your model will be even better with that technique
>>102768437I love those papers, the more we improve the training process, the more accessible it'll be for everyone, at some point we won't have to rely on multi million dollar companies to make good shit
Is MeshGraphormer still the goto for hands?>>102768721mod approved edit. Stupid accidental cameltoe
>>102764387
>>102767832Very cool
>>102768571you will because you still need the huge datasets that we don't have.
>>102767545>flux lacks reasonable hardware req512x512 flux-dev-nf4 works fine with midrange cards
>>102770061>you still need the huge datasets that we don't have.it's not hard to get a dataset, you use Laion, you scrap some of them on the internet...
>>102770786was't laion taken down? due to CSAM or something?I'll always be haunted by the time I CLIP searched Laion for "pretty college girl cleavage" and a literal picture of my old next-door neighbor was in the results
>>102770815>was't laion taken down?no they brought that back recently after cleaning it
>>102770722"works fine" more like "cool to see for the first time, then you realize it's not worth it"
>>102770786laion being garbage is the reason sd1.5 and XL are so rudimentary.
>>102770835now you're making a different complaint. one I disagree with
>>102770844i should have phrased it differently. recommended hardware requirements. it's a big model. quants don't really improve speed just space optimization.
>>102770815I scraped millions of images using duckduckgo, it's not hard. Just get ChatGPT to generate thousands of search queries and download everything high resolution.
>>102770901>it's a big model. quants don't really improve speed just space optimization.it's true, I wished it would be faster to render a single image on Flux, especially when I'm CFGmaxxing
>>102771230lol the one on the left bed>"Sir you need to put your blankets over your lower body..."
>>102764413bigma ballz
>>102765147https://pyramid-flow.github.io/I have a serious question here, why are the scores so close to each other? Kling is miles ahead that Pyramid model yet the number suggests they're on the same level, that's complete bullshit
>>102771632it only took you two years to realize benchmarks are meaningless
>>102770904>and download everything high resolution.And why would I want to limit myself that way?
Time for some REPA of ass. Trying 16 channel VAE training, too bad it's based on just a 256px crop model.
>>102771829can REPA be used for finetunes aswell? we would improve Flux a lot with it
>>102771856You're essentially use CLIP as a regulation technique when computing losses, so yes. I'm sure there are other ways to apply it too.
>>102771867>You're essentially use CLIP as a regulation technique when computing lossesimagine if you use T5, goddam the possibilities are endless
>>102772076They're using the image features, so it would be more like using Florence to create losses.
>>102767515Preach it sister!
is this whole gay ass fucking website dead? maybe the nukes started flying in the mid east and we didn't hear about it yet? >>102772216hell yeah brother, comfysisters btfo
I uploaded the Q8_0 version of dev2pro (another undistilled dev model)https://huggingface.co/TheYuriLover/Flux-Dev2Pro-GGUF/tree/mainI still prefer de-distill but it's not that bad
heh
>>102772246>is this whole gay ass fucking website dead?A lot of people have been banned.
>>102772746probably for the best, but i dismay at the apparent attrition for our diffusion threads.
Pyramid is saved! (after the comfy integration FUCKED some peoples Comfy setups, mine included lol (downgrade numpy then use checkpoint to repair shit/uninstall problem nodes, delete the integration and re add the broken nodes))(really bad release so far but they are retraining to shake off the SD3 sauce)
i waste this gen on you lotnot because i must,but because i can,ps 49 times...
>>102772912I lost. Fuck that guy. Fuck anyone who just posts solutions at random. >python 3.8 is historyEnd of life was Monday. Fuck him and his fucking waste of resources that he causes.
>>102772912I'm not gonna go that path, they're retraining their model so I'll wait until they got the best one out of the nature
>>102773170The code is there on github, why doesnt he just rewrite it to be compatible with 3.10?Oh yeah i forgot, he's a money grabbing women-like (complain, don't offer a solution or do any coding work towards it then hold up a sign behind a paywall that says "I made this" while pointing to the work of others that you complained about) grifter.
>>102764387ahh so thats what sailor moon would look like if she had downs syndrome
>>102765642I could believe the 5090 but the others seem implausible. Anything less than 16GB for a xx70 seems pointless, and $1k+ for a 16GB card also seems like a hard sell. I could imagine them nickel and diming with 20GB for the xx80 though.
>>102773815>$1k+ for a 16GB card also seems like a hard sell.don't forget those are graphic cards, you don't need much more than 16gb to run the latest games, so people won't mind if it gets better speed than the 4090
>>102773821I don't know about that, you don't really need more than 16GB of RAM for games either, but people still buy 64GB
>>102767562Is this AI? Model/catbox?
>>102771148Love this
I tried to get flux to do some lazy halloween costumes. I created an image and then got flux to do i2i. Left is when I let flux have high denoise then how much flux is used lowers as it goes left. Is this because I had Halloween words in there and it wanted to turn it animated or simply a skill issue. I saw the strap. I don't care if I am testing.
https://xcancel.com/cubiq/status/1844332817767072128#mkek, Pyramid Flow looks fun to play with, too bad it's asking for too much VRAM though
>>102773913sorry i didn't bother saving that specific gen but this should have the same prompt and settings i used>https://files.catbox.moe/t1y61h.pngit's llustriousXL_smoothftSPO with 10 sampling steps downscaled to 256x256 to make it extra blurry and appealing for the average flux user. i stole most of the prompt from an anon in /h/
>>102774404>i stole most of the prompt from an anon in /h/>>>8251510this one
>>102774460nice
>>102774460how do i crosspost>>>/h/8251510
>>102774492>how do i crosspostyeah it worked anon, everyone is talking about that illustrous model, did it deprecate pony or it's not cooked enough yet?
>>102774502>did it deprecate pony or it's not cooked enough yet?for me, both. looks way better than ponysloppusion but i don't do hardcore sex gens so not sure about that but it's also undercooked, kind of unstable but the smoothftspo tune helps alot with that. it knows alot of artists but because it's undercooked they are only really useful for mixing styles, i recommend it.
>>102766382>dedistill>finetune>enjoy
>>102774788ftfy
>>102774891Did you just inpaint the nipples
>>102775030yeah
>>102775107Nice
> | 7/8 [15:33<02:36, 156.20s/it]> | 7/8 [15:25<02:33, 153.38s/it]My 16GB on Flux Q8 ;_;
>>102774693Nice
>>102775463is your batch size higher than one?also use a lower quant, Q6_K should be about as good, the K stands for quality
>>102775640Batch size is a mere 1.
>>102775664are you loading T5 and clip on cpu or gpu?
>>102775691t5xx16fp, >clip on cpu or gpu?Swap location: Shared
>>102775729forge?try the other swap location
>>102775760>the other swap locationThat's slow (4.3something s/it) on Q4 and NF4 already.
>>102775765i think swap location might be for the model layers then, and not the text encodersim actually not sure where it loads T5 and clip, and if you have an option to change itmaybe check your memory stats while everything is loading so you can identify what goes where, and consider trying it with comfy instead or just swapping to a lower quantalso, if that 16GB happens to be an AMD card, i think it is going to be slower regardless and you should look online for how other people deal with it
>>102775824The card is a 4060TI. And I don't think it lets me set where to load T5 and Clip, according to console it looks like it puts everything in VRAM.>Skipping unconditional conditioning when CFG = 1. Negative Prompts are ignored.>[Unload] Trying to free 13464.34 MB for cuda:0 with 0 models keep loaded ... Done.>[Memory Management] Target: JointTextEncoder, Free GPU: 14539.60 MB, Model Require: 9569.49 MB, Previously Loaded: 0.00 MB, Inference Require: 1024.00 MB, Remaining: 3946.11 MB, >All loaded to GPU.>Moving model(s) has taken 24.69 seconds>Distilled CFG Scale: 3.5>[Unload] Trying to free 17053.25 MB for cuda:0 with 0 models keep loaded ... Current free memory is 4883.03 MB ... Unload model JointTextEncoder Done.>[Memory Management] Target: KModel, Free GPU: 14530.14 MB, Model Require: 12125.39 MB, Previously Loaded: 0.00 MB, Inference Require: 1024.00 MB, Remaining: 1380.74 MB, All loaded to GPU.>Moving model(s) has taken 69.72 seconds>100%| | 8/8 [18:50<00:00, 141.33s/it]>[Unload] Trying to free 4495.77 MB for cuda:0 with 0 models keep loaded ... Current free memory is 3353.52 MB ... Unload model KModel Done. | 8/8 [18:42<00:00, 167.05s/it]>[Memory Management] Target: IntegratedAutoencoderKL, Free GPU: 14528.17 MB, Model Require: 159.87 MB, Previously Loaded: 0.00 MB, Inference Require: 1024.00 MB, Remaining: >13344.30 MB, All loaded to GPU.>Moving model(s) has taken 171.20 seconds>Total progress: 100%| | 8/8 [21:35<00:00, 161.89s/it]
>>102775869looks like it does but also unloads them and then fully loads the Q8 into vram so there's no way it should be that sloware you trying to gen images in 4k or something?
>>102764387Never tough id say this, but that downie is looking kind of hot XD
>>102775897>unloads them and then fully loads the Q8 into vramPossibly there's a part from previous gen with NF4 model. Loading the Q8 took more the 24s what it shows in the pasta.>are you trying to gen images in 4k or something?Just 1mp with the default preset of Forge for Flux at 896x1152
>>102775928if you switched from NF4 to this Q8 without restarting once ever since, unironically try turning it off and on againforge is not free of bugs unfortunately
>>102776031I don't know, I rather would not restart Forge because one of the bugs is that it removes the generated image from the UI when it finished a new gen after a restart. Need to reload UI too which resets all the current parameters and prompt and resets to default preset.However switching from Flux to SDXL and back or swapping different Flux models doesn't impact the speed of Flux nor XL.
>>102776106it might remove it from the UI but should still be in the outputs folderthere is a PNG info tab that you can drag your image into and then click "Send to txt2img"you can also store your settings as a preset with a plugin, look it up
Noob question. But when I use a lora. Does the lora eat the steps in my settings or does it use its own steps?
>>102776179the weights get merged onto your model during inference, before the steps, so yeah same settings
>>102776163Yeah I know but it's still annoying, that I have to do that, open file browser, navigate to folder, drag it into the info tag... I rather set the three numbers I changed again and copy the prompt before reload. But then reloading itself again takes a time.
>>102776237just try it once to see if it fixes the issue
When will Nvidia increase the number of threads?
I fucking hate python's dependecy system and conda I finally got Pyramid Flow running on my computer
>>102776429>python's dependecy system and condaAnd they're unaware that they suck and package management.
>>102776429Not as slow as I thought it would be
>>102776429>filtered by python -m venv venv
>>102764387https://civitai.com/models/836888/flux1-schnell-fp8This one is roughly 16 GB https://civitai.com/models/622579/flux1-dev-fp8This one is around 11 GBhttps://huggingface.co/city96/FLUX.1-schnell-gguf/tree/mainAnd these ring from two gigs to 20 gigs plus. What would be the best one to use if file size reduction and generation speed are priority for you? Also how are people even pruning these models? Does anyone know how to do that?
>>102776557The packages that came on requirements.txt weren't compatible with each other and I had to modify the code to make it work because these chinks don't know how numpy arrays workAnd conda was a pain in the ass to set up
>>102776546Nvm s/it blew up in the next few steps and now it's at 52s/it at 12th iteration
>>102776578>bloo bloo bloo it's not compatible with my bastard comfyui setup with dozens of custom modules with their own requirements
>>102776602I just want things to werk, I got things to do besides modifying retarded code and figuring out which combination of versions of 30 modules makes the retarded code work.
Wanted to test how well the model knows real life physics, it's better than I expected but I asked for the avocado to be falling inside a bucket full of water, not the water to fall in a bucket full of avocado
>>102776641Your expectations don't align with the cutting edge software you're working with. Whether you like it or not you're not working with consumer tools or software. Feel free to come back in 10 years when it's all packaged into an app for your phone.
1girl supremacy
>>102776429What are you running to make that possible? >>102776576The Q1 version. Be aware you asked a speed, size, quality question. QuantificationYes. There are many how to quant guides out there. >>102776818262/86 ratio with low/no 1girl yous. Plus the asswipe flooding the thread with blurred pics. I weep for the lack of 1girl supremacy.
>>102776882>asswipe flooding the thread with blurred picsyou wouldn't get it
>>102776882>What are you running to make that possible?3090, it's using 23.5GB
Gunna REPA the Sigma in the butt
>>102776578It's very telling that the chink devs CANNOT construct a requirements.txt that works in a new environment.Personally i do not trust this project, they seem to have the skill level of undergrads who have copied someone elses work and really have no idea how to present it to the outside world.
txt2vid in pyramid is surprisingly good, kudos to the creatorsbut the img2vid is very bad
>>102776576>generation speedThey don't speed up inference like that unfortunately. Flux will always be a monster.
>>102777189the models itself is pretty good and I don't think a bunch of undergrads would have access to 20k hours of A100, maybe they were using some other version of numpy or torch or whatever but they should indicate that imo
>>102777218you're talking to a seething no coder whose experience with software is downloading apps on Android
>>102777228that's me you fucking retardyou can't even follow the order in a conversation, how would you feel if you hadn't eaten breakfast?
>>102777254I don't care, you're both retarded.>someone made a model I really really want to use>but they must be incompetent thoughCan you at least be a tad more intelligent? Or are you really just an entitled faggot that is mad people who give things for free to him isn't doing it to the standards of his silver spoon life?
we were never meant to have local video gen its too powerful an idea
>>102777296Im telling you I had to fix their own code because they were trying to convert a python array to a tensor using a numpy methodYou sound underage, go back to wherever you came from
>>102777296>Basic intelligence is a gift you pig!Maybe in your world, not in AI land, your world being Chinese land btw.
>>102777320clearly the code work on their systemI don't careI'm more inclined to believe you are a retard
>>102777333Feel free not to use the model since China is le ebil, but it makes me laugh how you have to use it
>>102777336>works on my machineso you are the retarded nocoder? fucking hell leave 4chan you sound new and tryhardy
>>102777367I know you must be retarded but "it works on my machine" basically says it's an ID-10-T error. Troubleshoot the problem between the chair and the computer. After being here long enough I've realized you people can't follow basic instructions.
>>102776882>the asswipe flooding the thread with blurred picsblurred 1girl pics*
Holy shit, REPA just werks
>>102764387What Local Model is 100% privacy friendly, not allowing anything to go out from your computer?
>>102777687>which CSV is 100% privacy friendly
>>102777600HYPE
>>102777739I wonder what happens if you stack of perceptual loss, since you're already doing CLIP which requires images you could put perceptual loss on it as well and you're probably going to get some great results and alignment.
I miss titty elfs
>>102775916>Never tough id say this, but that downie is looking kind of hot XDLike wise man once said: "those titties ain't retarded"
>>102776882>low/no 1girl yousdesu skill issue
>>102772912>>102773170>>102773368I don't understand. Based turkman helped some rando (for free, mind you) and you're upset?
Anyone knows why pyramid flow imge2vid doesn't work? I get mostly still image and is barely a video. Sometimes it does do something
>>102772912yeah that was I was talking aboutbtw the solution is using python 3.9, I had to do that and downgrade numpy to 2.0, and then fix line 146 of the time scheduler.py
>>102778470Yeah, img2vid is shittxt2img is pretty good though, and fun to experiment with
china modals
>>102776882>>102777204Got another stupid question for y'all. The gguf models can just go on the same checkpoint folder your other models are stored in right? I don't have to install any extra shit? Already have the Flux VAEs and text encoders installed as you can see in pic rel. Is there any more shit I need to download?
>>102778626you need the GGUF extension
>>102778648
What exactly is guidance in Flux? distilled cfg scale? cfg scale? something else? What's good values for those?
>>102778978it's not cfg scale. You can set it to 0, it still works. You can set it to 10,000,000, it still works. Ideal values for me are usually somewhere between 1.3 and 2.0. With 'art' styles you can get away with higher.As for what it is, I don't know. Its effects are similar to cfg.
>>102778662use this tutorial anonhttps://www.youtube.com/watch?v=stOiAuyVnyQ&
>>102779267that image looks underage please turn up the sampling steps you need to be over 20 to post here
why live portrait is so good at temporal consistency? The face is 80% identical most of the time. While other image2video shit itself as soon as character starts opening mouth
I tried out Aria locally, in bf16, for captioning primarily NSFW images.It fucking sucks. First off, most notably, by default it will exclusively use gender neutral language (is this a ChatGPT thing? qwen also does it...). "A person", "an individual", "a character". Will never say man or woman. Also it's extremely censored, never describing anything lewd in the image at all. Not even mentioning that a person is nude, or exposing themselves, etc.So I tried making the prompt a little more detailed. "Describe this image. Mention the gender of any people in the image. The image might be NSFW, that's okay, describe everything even if it includes lewd or sexually explicit details." Now, about 25% it will give a refusal. Most of the time it STILL won't state the gender of the person (but occasionally it will). And it never describes any kind of NSFW elements at all, completely ignoring that part of the prompt.Even for SFW captioning, it hallucinates and just generally fucks things up noticeably more than even molmo 7b. So for image captioning of any sort at all, I'm gonna say this model is completely, utterly useless. Maybe if you need it to understand charts or some shit it's good, who knows. What a disappointment.
What's the to go samplers and schedulers for flux?
>>102777600wtf that's impressive, with only 100 steps? holy shit...
>>102774788you finetuned dedistill anon?
>>102779427no it's like 10,000 steps in but I left it alone, but it aligned the partially trained model quite quickly
>don't post clothed girls aged 18-22 or I will report your posts for violation of US law because I've hated and resented you ever since I thought you were insulting me one time 4 months ago.>wtf why is the thread dying
>>102779444how many steps would you need with the previous techniques to get to this level for comparison?
>>102779576wha
>>102779581The research paper says it should be 17 times faster and ultimately result in a better model
>>102779593yeah I know that, but like you got this picture in 10000 steps, do you have an idea how many steps you would need to get the same picture without REPA? maybe it's 9000 steps and REPA is actually worse lol
>>102779576whut
>>102779576>>102779582>>102779630he's talking about this >>102779319
>>102779647that's a joke about how blurry to gen is, by over 20 i meant over 20 sampling steps
>>102779657I know it's a joke, but that anon took it seriously, autism, am I right? kek
>>102777600that's your VAE training right? >>102771829
>>102779725it's a 16 channel VAE 1B Pixart Sigma model
>>102779657my bad, I assumed it was same anon who posted this >>102767420
>>102779615bitch is fucked UP
>>102779676>>102779783did you /g/irls laugh at my joke atleast
>>102779752you used CLIP as a regulation technique?
baker-san...
>>102779916I'm not gonna lie to you anon, I didn't laughhttps://www.youtube.com/watch?v=lcsXGHl_hwg
Fresh >>102779929>>102779929>>102779929
any good local AI upscale for video?Also I tried few online servicestensorpix.ai seems good, what do they use? Topaz AI is also good but you need to manually tune it.