Discussion of Free and Open Source Text-to-Image/Video ModelsPrev: >>107449372https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipehttps://github.com/ostris/ai-toolkit>Z Image Turbohttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>WanXhttps://github.com/Wan-Video/Wan2.2>NetaYumehttps://civitai.com/models/1790792?modelVersionId=2298660https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
>>107452633bottom right is sick
Blessed thread of no /adt/ spidershizo
>anno domini 3025>everyone in the world has no more mouths but must scream>/ldg/ still exists>it still has permanent 10 autists still posting the same thing they did for a thousand years>wan 2.2 is still being used>most wan 2.2 gens are still made with bad quants, lightning loras and settings so they somehow still have color shift at the start despite that problem being solved near the very beginning of lightnings loras v2 existance>some wan 2.2 gens are still not interpolated, with users citing that the picoseconds it takes to run film-vfi-v40 is too much of a cost>a month after v79 release you still see daily "z image incorporated will NEVER release z-image-base-agi-v80"
>>107452641still using Z, but I switch between it and chroma. Z is a great for iterating on a prompt and trying stuff out, then copying the prompt to chroma for better aesthetics.
>>107452670accurate and balanced, as it should be
>>107452670>anno domini 3025>everyone in the world has no more mouths but must scream>despite omnimodal AGI fitting into 2GB of holographic crystal glass floppy disc storage you still see hourly posts about "local will die in 2 weeks after the this tech gigabubble finally pops and all this developed tech just disappears overnight like the internet did after the dot com bubble">the Global Government's Ministry of Technological HyperAdvancement based in Tel Aviv releases the newest quarterly consumer GPU to be sold, the RTX 900,000,050, now with 8.01GB of VRAM***>(*Assuming the user wins the "NVIDIA's Official Fair Sillicone Lottery System" which has a Snopes Fact Verified(tm) high* chance* of happening, resulting in the grand* megajackpot prize of those huge, extra 0.01GB VRAM (with a bonus microlottery attached which also rolls for a chance of ALL (yes, ALL :) ) GPU ROPS being included AT THE SAME TIME.)
lowres zimage feels like back to 1.5 but without fucked up hands
>>107452756>(*This will also require the newest optimization technology CuSpeed which, in order to compliment the huge VRAM improvement, requires all of those GPUs to have 127bit width memory bus to ensure full CloudfyUI Nodes 2.01 workflow stability, preventing over-the-network capability of Clodflare to accidentaly* remotely set your* GPU on fire after decrypting its homomorphic encryption to ensure the safety of your generated integers.)>(*The user must also install the latest open-source* NVIDIA driver with the Windows 10010 (The official last version* of Windows ever) with a system-on-a-chip security related* NVIDIA Management Engine forwarding all queued matrix multiplications to Palantir for scanning*.)>"But wait--there is also even more good* news*! The user that wins this--will also be allowed to apply for his weekly tour to the Wailing Wall--to be fast tracked for free* too!" (Available only after a 35 shekel returnable* safety* deposit* and NVIDIA STADIA PRO MAX PLUS++ users* (*that are only paying their weekly debt-based subscription in desoldered 2MB RAM chips from scavanged computer DDR2 RAM sticks from the old world.)
>>107452641z image is good apart from the fact that it keeps generating the same handful of images
whats blud goin on about
>>107452761wow...1girl...
>>107452788That’s all we genned back then, and it still felt like a miracle
When did you first get into local image gen?https://poal.me/vgi7svhttps://poal.me/vgi7svhttps://poal.me/vgi7svhttps://poal.me/vgi7svhttps://poal.me/vgi7svhttps://poal.me/vgi7sv
>>107452808>back thenthat's all we do NOW
>>107452816What if it was before 2022
>>107452788the slow struggle of more 1girl
>>1074528162017https://github.com/makegirlsmoe
where the fuck all the good posters go to? feels like there was a spike in creativity first day after zit release and now it's nothing but complete garbage
>>107452853this isn't an art dump, you dweeb. this is for local diffusion discussion.
>>107452853>spike in creativityKEK
dont worry anon ill get back to posting kino soon
I was promised one (1) base model
>>107452853Most good posters from /ldg/ and /lmg/ migrated to Discord a while ago. those threads are just for shitposting
>>107452883which discord?
>>107452835I remember running dalle mini and similar stuff on Google Colab before Huggingface Spaces was a thing. I remember having to wait several minutes to get a single shitty (but sometimes funny/soulful) gensSome anons used to post threads with Colab links for those things
i think it's time to move on from zit
Why are your 1girls always standing? Can they do ANYTHING ELSE?!?!
>>107452887you're there if you belong
>>107452853It dragged me back to imagegen for awhile to see what it can do, and the speed makes it a lot of fun, but I guess you quickly run into the limits and people move on until progress happens again
>>107452889time to pop this zit?
>>107452756>despite omnimodal AGI fitting into 2GB of holographic crystal glass floppy disc storage you still see hourly posts about "local will die in 2 weeks after the this tech gigabubble finally pops and all this developed tech just disappears overnight like the internet did after the dot com bubble"Lmfao, made my night
>>107452852further back is a bad idea
>>107452892so you can see the whole body
>>107452816I remember using this thing called Looking Glass that was basically pre-SD image training.
>>107452853Thread was botted by chinks heavily to create more hype, but now that they decided to delay the model to censor it, they probably decided to disable most of the bots
>>107452853Happens every single time a new model drops. People pump what out limited ideas they have, get bored, move on with their lives. It's mainly schizos who refuse to move on + coomers + schizocoomers. No one sane is dedicating their life to entertaining this site with image gen.
Can someone smarter than me tell me why adding steps makes her stop yapping?
>>107452898This. I usually check out a promising model for a bit and then stop genning entirely until the next release.
>>107452925just a proof that ai shitters are nothing but adhd zoomers who want the new big thing in a vain attempt to fill the void inside (it never works)"replace artists" my ass lol
>>107452950you sound bitter about this KEK. fuck artists, if i see your art and i kind of like it, i'll train a lora on it and post it to civit for all to have.
>>107452816Honestly 2022.I was running a shitty GAN called FuseDream in 2022."Regular hobby" is 2023 though.
>>107452853Most people see the schizo spam in these threads and move on to somewhere more productive
why the fuck is he trying so hard to create artificial discussion with this non-stop samefagging
Since we're not getting the base model, i'm training a lora on this overnight. I'll get back to you with the results tomorrow
>>107452970can you see into the code? is HE in your walls?
>>107452969All AI threads are like this, but ldg feels especially shitty, it really is a fucking dumpster. But at least it was fast (at being fucking garbage), other threads are just dead
>>107452975who asked
*yawn*
>>107452980this guy asked >>107452871
>>107452975I am actually curious if this meaningfully helps lora training.I understand that it would make a large scale finetune easier than the distill, but I wonder if it also aids loras.It might end up hurting them since it has noticeably lower quality than the distill.
I think this scene/hobby will get a life support once someone releases a local Sora 2 equivalent that doesn't take forever to gen
>he's madoh im laffin
>>107452991see you in 15 years then
>>107452975>I'll get back to you with the results tomorrowbased and thanks looking forward to it
>>107452975I recommend trying at 512px first (could be 1h) and then doing the higher resolution trainings if you feel it might be useful
https://desuarchive.org/g/thread/100607453/#q100607453good mems
>>107452991I am skeptical that is ever happening, at least any time soon.I am fine with a model that isn't cancer to run inference on with midrange GPUs and has reasonably good quality for its size though.Also should respond well to training, hope that some makes an actual NSFW video model.
based samefag
>>107452975How does the De Turbo actually work? Why are they confident Loras trained for it will be compatible with the base model once it releases?>>107452991>a local Sora 2 equivalent that doesn't take forever to genThis will come in 2026 but you will need a 5090 or better and it will still take 10 minutes perr video But this WILL come in 2026. It could come as early as by February when LTXVideo2 comes out, if it comes out
>>107452997>>107453017People thought the same when openai released dalle3 and the original Sora, and here we areof course I don't expect a local model to "know" as much as Sora 2 does since it's likely a gorillion parameters model, but a multiscene video model with audio with at least 10 seconds with at least the quality of Wan 2.2 but with better prompt alignment would do the job just fine
>>107452975here comes the cope
>>107453034no model is even close to dalle3
>>107453035Is this Chroma?
>>107453035why do you only generate these ugly bimbos?
fuck this gook model im done
>>107453052Generating ugly whores and jews is ldg culture, chuddie.
>>107453038For an aesthetics standpoint even OG Flux is better than dalle3, especially with lorasbut yeah, dalle3 has a fuckton of "world knowledge" of little concepts and celebrities that no local model has because the model saw a lot more data (not because it's a technical marvel or has tons of parameters, I bet it's actually smaller than 10b)
>>107453021It released alreadyI don't know how it is actually done but it beats CFG >1 and negative prompts back into model.The model loses quality on top of the initial distillation loss (so significantly downgraded compared to base) but in this state it responds better to finetuning.>Why are they confident Loras trained for it will be compatibleJust a guess essentially but we know that the base is also 6B so Turbo is just a low step LCM distill. Every other similar distills (Think of SDXL Turbo, Wan 2.2 distill loras) enjoys good enough lora compatibility with the base models so it is a safe bet.
>>107453035catbox pls?
>>107453035You realize the path to Chroma included dedistilled(deturbo) experiments, right? Or are you really that new?
>>107452997>>107453017>>107453034It took 9 months to go from Sora to HunyuanVideoIt took like 4 months to go from Sora to Kling There are already Sora 2 competitors such as Wan2.5There are already Sora 2 competitors released locally (that aren't good enough)There are already Sora 2 competitors that are scheduled to be released locallyThere is literally no reason we won't get Sora 2 at home in 2026But it is going to need a 5090. Prepare to cope with Q6_K or be a rentoid
>>107453056>>107453034The world knowledge is what I believe that isn't going to be replicated by any local video diffusion model anytime soon. Weight count in low billions for sane consumer GPU inference speeds can only hold so much data. You need fuck ton of weights to hold information about every single character, style, etc. out of the box like Sora does.Will local video get better?Likely.Will we have a local Sora 2 this decade (Aside from local in name only memes like Hunyuan 3)?No.
>>107453052The bimbo aesthetic requires a refined pallet to truly appreciate.
>>107452816i voted but im not happy about it because the year i started wasnt a choice fuck you
>>107452990>I am actually curious if this meaningfully helps lora training.Probably not, training on AI gens is retarded and zturbo was good in the first precisely because they didn't do that.
>>107453052>>107453091I thought "bimbo" necessitated exaggerated plastic surgery. But I looked it up and it turns out I was wrong, huh.
>>107453084>It took 9 months to go from Sora to HunyuanVideo>It took like 4 months to go from Sora to KlingIt's going to keep getting better exponentially forever!>There are already Sora 2 competitors such as Wan2.5Wan 2.5 is nothing like Sora 2.>There are already Sora 2 competitors released locally (that aren't good enough)So not competitors.>There are already Sora 2 competitors that are scheduled to be released locallySuch as?>There is literally no reason we won't get Sora 2 at home in 2026Holy copium>But it is going to need a 5090.If an actual Sora 2 competitor (as in comparable in quality and knowledge) arrives you better pray for RTX 6000 pro./ldg/ is in denial of reality /lmg/ accepted long ago.
>>107453091fucking vomitwhy are gooners like this
>>107453052Because I can
>>107453090>You need fuck ton of weights to hold information about every single character, style, etc. out of the box like Sora does.You literally don't and the Pareto principle implies you can get very close to Sora for a fraction of the price anyways>>107453127Wan2.5 is literally the same thing as Sora 2. Everything we need to make Sora 2 architecturally already is known openly. It has already been tried stuff such as OviIt's going to happen by 2027 And imagine having frogs on your hard drive ew lol.
reminder that nsfw posting is against the board rules. you know what to do anons
>>107453115You're thinking of "bimboification" the weird niche porn fetish. Bimbo is just an insult for a stupid woman.
>>107453139>You literally don't and the Pareto principle implies you can get very close to Sora for a fraction of the price anywaysYeah that's why 20B LLM models are indistinguishable from 100B ones, oh wait.
>>107453147>Bimbo is just an insult for a stupid woman.That's not true, bimbo strongly implies body work done, at the very least bleached hair or enhanced lips/breasts/ass
>>107453133At least it's not a troonime poster. Thanks for that.
i know anon misses my base not merge noobai schizo kino and i am sorry
>>107453091>>107453134these don't have fucked up faces and proportions though
>>107453163"Bimbo" dates back to the 1910s before that shit even existed, so no.
who is he talking to
>>107453079Chroma should fucking fix that fucking lines problem, sadly its a flawed model and it wont get better, we got what we got
>>107453162>that's why 20B LLM models are indistinguishable from 100B ones, oh wait.Correct, they are absolutely more than 80% the capability of the 100b models while being only 20% the size, which is exactly what the Pareto principle is>>107453173And the meaning of the word has changed since the 1910s, just like the word "gay" and countless others. So yes. Bimbo in the modern sense of the word heavily implies body modification to go along with the stupidity. I'm not denying that the stupidity is the most essential component though beyond anything else
>>107452975The v2 adapter does a better job of de-distilling the model than this finetune.Nobody measures validation loss but me so I'm apparently the only person who knows this.
>>107453127>If an actual Sora 2 competitor (as in comparable in quality and knowledge)You don't need to have the same "knowledge" to be good when Loras and in-context learning are a thing (edit models).Nano Banana Pro is likely a very large model close to 1T param yet you can get close to some of its outputs aesthetically if you train loras for z-image (a small 6b model), wan or qwen-imgMost people would be perfectly happy with a Wan 2.2 tier model with audio, that does multiscene and isn't limited to 5 seconds
>>107453184What does that have to do with you being a retard?
>>107453148z?
>>107452975based
>more and more zit loras getting trained on pony tags and datahttps://youtu.be/DjJPzEF5bHg?t=40
>>107453167
>>107453188Once again, you're getting it mixed up with the porn idea of "bimboification" which makes sense because you frequent this site, which is to a large extent a porn site. When you say "in the modern sense of the word" what you really mean is "in the 4chan sense of the word." In which case, you are correct. I just double checked urbandictionary to see if maybe you are correct and normies really have started using it this way but I see nothing about plastic surgery. https://www.urbandictionary.com/define.php?term=bimbo
>>107453199>Nano Banana Pro is likely a very large model close to 1Trepeated a lot, but I doubt this. it's too inefficient. I bet it's a moderate size around 20b, paired with a vlm that mediates between the user and actual image generation. may or not be autoregressive, doesn't matter.
>>107453222Dude go to a university and talk to 10 normie girls and ask them if getting lip fillers and fake tits and ass is an essential part of being a bimbo and all 10 will say yes holy shit go fuck yourself and your urban dictionary
bimbo = retarded woman and most retarded women have injections but not all
>>107453214It's a NoobAI mix.
>>107453188I don't know, body modification weirdos seem like they're in a category all their own, or at least should be lumped in with whatever those medical surgery fetish freaks are called. Being a pretty, dumb girl is so far removed from all that.
>>107453240why are you using z-image? too low vram?
>>107453231I had a college girl do make up for me for a Halloween party and she went to put some lip stuff on but had to stop because it's the plumping shit. Turns out that stuff burns. I had no idea that was a thing.
Is this some Discord raid again? What's with the pedo spam
>>107453247Who said I wasn't?
low cfg bro status??!?
>>107453012i do feel like making an attempt at zit-gyate
>>107453266same pose every tiem
>>107453240oh okI was hoping someone found a way to extract more variety from z.
>>107453266why are you using an outdated model then
>>107453288every. time.
>>107453241It's less about body modification being essential and more about vanity and looks over smarts, which body modification is a strong modern signal forI'm not arguing that it's the most essential component, but it's the logical continuation of bleached blonde hair and what doing that to your body represents, and again go ask 10 college girls, this is the aesthetic that the word "bimbo" captures in someone's mind. Because of insta thots the blonde hair and white skin is less essential than ever, actually >>107453254Yeah there's the fake plumping stuff that's temporary but there's also the more permanent lip fillers that are injections and last close to a yearThere was a lora called "Botox lips* that made comically fake big lips for wan 2.2 and it had to be pointed out to the guy who made the lora that it's not actually Botox that gets injected into the lip.>>107453297>el bimbissimo
>>107453358got it working eventually huh?
>>107453286you should
>>107453372You're thinking of a different anon. I just stole this prompt.
>>107452816jan 2023
>>107453053lmao deal with it gweilo
>shift = 100
>>107453462Give it to me so I can steal it as well.
how do I load this in comfy? The workflow for the full model throws an error when I try to load ithttps://huggingface.co/mzbac/Z-Image-Turbo-Fun-Controlnet-Union-8bit
>>107453614https://github.com/comfyanonymous/ComfyUI/pull/11062
>>107453190> Nobody measures validation loss but me> trusting validation loss
>>107453591From here >>107375814
>>107375814The actual absolute state of the seed variance tho, lol.
>>107453664pretty sure I had to heavily edit that prompt when I made this >>107376054
>>107453695
>>107453219not only they are training only on booru tags, but they also don't facial expressions at all, it seems. you try to gen one of "smirk, smug, sad, crying, happy" and all gens come out with the same blank expression
>>107453765>don't facialdon't tag
>trained on high quality sdxl outputs (my creation)
>>107453429let's see (training with a fast 512px)
>>107453012>pixart sigma in op>embeddings
>>107453199>You don't need to have the same "knowledge" to be good when Loras and in-context learning are a thing (edit models).This is the peak of cope. Why are you not understanding that you shouldn't need loras for every little thing?Loras should be for super niche stuff, not general knowledge. There are guys making prime lens loras cause local models lack so much basic knowledge.
>>107453139>Wan2.5 is literally the same thing as Sora 2.glad no one fell for this bait
>>107453807You're right about the cope but let's not pretend like a majority of LoRAs aren't concepts present in the model that the author is unable to prompt because they are mouth breathing jeets
>>107453647if you aren't trusting stabilized validation loss then what exactly are you trusting when you train a model
>>107453839how else does one observe but with the eyes faggot lets see your loras
>>107453848How can you possibly know if you have set the learning rate way too high, or too low, if not by looking at the effect on the validation loss trajectory? With a lora on a small amount of data you can just train the whole thing a bunch of times but that doesn't work with a large dataset.
Could you take the Gemma SDXL adapter and modify it so it takes Qwen and connect that to zimage?
why do you people even both with local when API can do nearly everything better?
>>107454001if the past is anything to go by it would need to be retrained but im a lowly retard idk
>>1074540011.Wtf do you mean?2.Zimage's text encoder is better than gemma so why would you even want to do whatever that exactly is?
>>107454014APIkeks are VRAMlets. sad.
>>107454014>API can do nearly everything>nearlyI would pay $200 in monero a month to anonymously be able to use Sora 2 with absolutely zero restrictions or rejections. I would rape everyone and everything and poop on everything and give myself erectile dysfunction irl and lose my job but most importantly I would pay $200 a month for itBut that service will never exist, so I care about a local version of Sora 2, which will hopefully come by next summer
>>107454014Using local models is their whole identity, same as ricer linuxtroons for example. Local models do have their unique use - porn, but considering this isn't a porn board... kek
>>107453664Thanks I'll give it a go later
>>107454081This is a feet, lips, buttcheek, thigh, tummy, shoulder, armpit and eyes thread you uncultured swine
>>107454069are you enjoying using WAN 2.5? oh wait...
>>107454050he probably thinks it would help with anime, switching encoders. everyone's had a similar thought one time or another.
>>107454014Pedos making cp
>>107454143cp and bestiality. and cp bestiality.
>>107454179they already have pony
>>107454179I like to shit on the furries but I bet the first good lewd audio tune for a video model to make moans and stuff will be the afterthought of a furry model making roars or yiffs or something gay and we're all still going to use it like the early PonyXL days
zit doesn't know what a crossbow is. It's over.
>>107454050some sdxl clones have a Gemma llm instead of clip. So take that concept, but adapt it so it takes a Qwen that would serve as a text encoder and add a second phase where it would use itself to prompt expand.
>>107452633>Take a three weeks vacation to focus on LLM, gets a nice LLM server going.>Boot up my CUMfyUI again>Everything has changed>New buttons everywhere>Big banner "would you like to test our new CUMFY interface">Lost all my opened sessions>the fuck>Launch a qwen image gen>It's only pure black noise images now>No error, it just doesn't generateI hate ComfyUI so much, so fucking much, you have no idea how much I hate this piece of shit, FUCK OFF, FUCK OIFFFFF
>>107454324Just RouWei or are there others?
i wish i wasn't interested in image gensharing a general with actual pedophiles really sucks
Just finished moving 30,000 1girls from my 'keepers' folder into sorted subfolders. The last time I did this cleanup was in January. So that's 30,000 1girls that I kept in a bit less than one year.Something's not right with this hobby.
>>107454402Are those pedophiles in this room right now
>>107454402it's correct to be attracted to a woman if she had her period
>>107453788"trained on synthetic" should be a category on civitai so you can filter out the slop
>>107454445apart from literal fucking cp posters? okay>107454385>107453799>107453245>107453053
>>107454396Doesn't Noob also have gemma?
>>107454486Why don't you use actual links spergo?
>>107454486all coomers in general are cancer, don't care about muh childruns
>107454486the images you posted are just random anime postsdid you take your meds today
saars
107453053 is anime
what causes this?
>troonime>>>/b/
>>107454486Chill
her age?
timeless
butiful 6 finger masterpis
https://x.com/nvidianewsroom/status/1996976316453679381Do you think "CUDA Tile" relates at all to tiled upscaling?
What's your image to image workflow in comfy (sdxl)?
>Do you think "CUDA Tile" relates at all to tiled upscaling?absolute state of ldg
>>107454849>inb4 radiance-type pixel space model
>>107454069Prompt for the background motion blur? Looks neat.
>>107454862you have literally no idea yourself, because it's a new nvidia feature. there's a possibility it could relate to something similar.
>>107454849Kill yourself fucktard
>>107454902>grug see similar word>grug think it related
what are best lightweight llms for vramlets to format prompts for z-image?
>>107455026Qwen3 VL 4B instruct. Or 8B
>>107452825>NOWthat's all humanity does since FOREVER
>>107455031how censored is it do you think?
>>107455026>>107455031Also don't forget the mmproj file if you are using goofs. Like this.
>>107455031>Qwen3 VL 4B instructor shit, i forgot. uncensored llms
>>107455026get a job
>>107455045>>107455042They are not. The abliterated/heretic and some uncensored version do have an effect, but it still won't give you porn. But it won't at least refuse soft lewds that often. It's just an TE, not the model itself.
>>107454849
>>107455057ah so like chunks or blocks, but tiles
CUDA Tile is based on the Tile IR specification and tools, including cuNNy, which is the user-facing language support for CUDA Tile IR (Intermediate Representation) in Python (and, in the future, C++). The NVIDIA Python implementation of this tile-based programming model is cuNNy Python.
>CUDA tile is supported on NVIDIA Blackwell.of course, those niggers
>>107455026By prompt if you mean verbose "enhancements" and paraphrasing Just use Grok.Before I get lynched in the LOCAL diffusion general, no local VRAMlet model comes close to it for this task, stop coping.
>>107452926
Give one (1) good reason why ANY company should sell their hardware to consumers when there are companies that will buy up their entire output at any price?
>>107455043Is the abliterated version better at anything?
>No base model>No updated QIE>No Wan 2.5>No more RAMWhen is it officially over?
>>107455172it already is niglet
>>107455153As a Text encoder? Not really, it's mostly cargo cultism. Diffusion models are trained downward of their text encoders, they can't use newfound capabilities from ablated models because they haven't been trained with andfor it.Some people say it makes a minute difference when you ask for some poses sometimes, but really, not really no. It doesn't harm yourself though, as ablated models can do everything non-ablated model can do, and often better.As a model to rework your prompt? Yeah, because non ablated model will just tell you to fuck yourself if you want to rework a prompt that it doesn't like.
>>107455110I mean that's like taking a rocket launcher to kill a ant, I mean sure it will work but all you are doing just adding a little verbiage to your prompt lol
>"it's gonna be released before this weekend chud"2 weekends later...
>>107455199I'm actually giddy with how right I am. This feel like calling out the series finale of a TV series several weeks in advance after everyone called you a retard.
>>107455172I get the feeling we might barely get to play with longcat (released H O U R S ago) before the QIE update or some other such thing.>>107455190Ah, there are refusals if you prompt for a changed prompt? Makes sense that you'd want the other model. Thanks.
>>107455212Don't celebrate too early, they still haven't announced base will be API (because at some point they will if they really want to fuck us)
>>107453799>>107455153gyate!
>>107455230why did they fuck people over like that?
>>107455247It would be funny. But it's important to understand how the Chinese mind works. Sometimes they just go back on things because they feel like it or a better deal comes along. It's no secret that the Chinese are notoriously fickle trading partners. I don't why people forget this fact when it comes to AI stuff.
A chink has tested all the schedulers + samplers combinaison at batch 4 to see which one is the best and gives the best seed variation, I give you the spoiler in pircelhttps://www.youtube.com/watch?v=e8aB0OIqsOc
>>107455221>Ah, there are refusals if you prompt for a changed prompt? Makes sense that you'd want the other model. Thanks.Official model released have a first layer of refusal, which is "if the user mention pussy in the prompt, just tell him to fuck off. Or if he mention breasts. Or want to get a girl naked. Or anything like that."If you ever played with ChatGPT, Grok, Deepseek, LLama, or other LLMs, that's the answer "I can't do that, dave". The LLM has been trained specifically and especially to give you that answer, every time you ask those questions.An abliterated model is just the same model who had that automatic reaction excised. It isn't more intelligent, more clever, or more knowledgeable, it isn't a panacea, but that mental block has been removed.
>>107455259Those are the ideal ones?
>>107455246z-image-turbo gyate!i'll try another training run with DDG and blank prompt preservation, maybe they do <thing>
>>107455257>I don't why people forget this fact when it comes to AI stuff.because Alibaba has always been clean with us during the whole 2025 year
>>107455265apparently, he showed some interesting examples, euler gives you too many noise for example
>>107455269why is is so ugly and melted
>>107455271>because Alibaba has always been clean with us during the whole 2025We only found out Wan 2.5 would be API only a day before release and only through a random twitter account. At no point did Alibaba step in with any of its other partners and insiders and correct them with the fact it would not be open source. No matter how you cut it, they lied through omission and were happy to let rumors of open source do the legwork for them. You'd have to be especially dim-witted to believe they didn't know what they were doing.
>>107455300>We only found out Wan 2.5 would be API only a day before release and only through a random twitter account.they never said they were gonna release wan 2.5 though, so it can't be a broken promise if there is no promise in the first place
I'm grateful that LMGchads exist as guides for LDGchads diving into VLMs. 1girl together strong.
>>107455230
>>107455323conceptually superior to every Sora 2 generation that has ever been or ever will be in existence
>>107455318Literally nobody assumed 2.5 would be API only before it go released. Nobody. It was a complete blindsiding. They never said it was going to be API only because the assumption was it never would be.
>>107455292over 20/200 vision might have been a mistake when using AI
>"before the weekend">cope: meaning next week>"it will be open-source!">cope: you are hereJust remember these lies. Z-Image base is closed source.
>>107455330>Literally nobody assumed 2.5 would be APIthat's your problem, you assumed things, it's not alibaba's fault you're making fanfictions in your head, if they don't promise anything you can't say you are owed anything
local always get saas sloppy seconds at bestthis is your fate
>>107455341I got a diy project for you.All you need is a rope and your neck
>>107455341I still remember how Flux 1.1-dev was supposed to be released open weight.1.5 years ago. We got the release of the useless Flux 2 before the release of the Flux 1.1-dev. For fuck sake.
>>107455345>you assumed thingsJust like how you assume the base model for zimage will be released. I'm just gonna sit here, knowing how Chinese people operate while you keep using magical thinking to reason why another week goes by and we still don't have the base model.
>>107455371>I still remember how Flux 1.1-dev was supposed to be released open weight.really? I didn't remember them saying it would be opensourced, I could be wrong though
>>107452816>over half are 2023 or newer newfags owari da...
>>107455379>you assumeI don't assume anything, they repeatedly said they were gonna release the model, it's not a fanfiction it's a fact
>>107455380It was and is still on several of their documentations, their githubs, their webpages. Black ink on white. "We'll release 1.1 dev later this year". They tried to remove all of this from the internet but they didn't quite succeed.Three months ago, the official page about 1.1 still told how they would release the dev open-weight "later in 2025" and such bullshit.
>>107455386I'm not going to debunk this image for the fourth day in a row. Nowhere in this is an unambiguous claim that it will be open sourced. To the contrary, this is likely a tacit admission that there is red-tape involved with releasing it.
>>107455386>Alibaba: "we are actively working on getting the model open-sourced">>107455407>Nowhere in this is an unambiguous claim that it will be open sourced.
>>107455406lmao, I mean this is BFL we're talking about, they were always giant snakes, it's obvious they're bad people look at their model card of Flux 2 dev, Alibaba is far from being this evil desu
>>107455259yet AAAAAAAAANOTHER "benchmark" prompt that is 1girl doing nothingwooooooooooooooooooooooooooooooow you can really stress test the model and realism with this
>>107455416Just like I am actively trying to quite smoking but somehow end up with a pack of cigarettes. It's the same deal here. This isn't a technical hurdle. This is an internal politics issue and the suits always have their way.Learn. Chinese. Culture.
>>107455434they still promised they will open source it, on their github page it says 2 things>to be released>by releasing this checkpointif that doesn't happen I consider this as a betrayal and I won't trust those Alibaba chinks anymore, we'll have to wait and seehttps://github.com/Tongyi-MAI/Z-Image?tab=readme-ov-file#-z-image
bros the quicker we move on the quicker we can achieve kino again
>>107455429>yet AAAAAAAAANOTHER "benchmark" prompt that is 1girl doing nothingtrue, imo the best way to find the best (scheduler + sampler) combo is by looking at the text, the one combo that makes the least errors on text wins, and desu euler + normal seems to be better than euler ancestral + ddim_uniform
>>107455448>I won't trust those Alibaba chinks anymorehow are they going to cope
>>107455448>I won't trust those Alibaba chinks anymoreI will accept your apology with open arms in a few weeks.
oh époux
>>107455466>deepthroating a multibillion dollar company
>>107455448is this your first time?
>>107455456Kek
>>107455473Does the realistic parody request thread know about Z yet
>>107455480it's been 1 year we're having Alibaba on the AI local ecosystem, and so far they never betrayed us, let's see if this is the time
>>107455490Surely
>>107455494>so far they never betrayed usIf you don't consider Wan 2.5 a betrayal you are a giga cuck.
>>107455507you were betrayed by your own expectations, nothing more
>>107455456Euler ancestral is a meme
>"ayo gibsmedat fo free" general
>>107455448>theyThey are just a small lab within Alibaba that has no control over what gets released and what doesn't. At the end of the day, if Alibaba couldn't care less, it's not happening.
>>107455523>no control over what gets released and what doesn'tif it's true they wouldn't have promised the release, they're confident it's gonna happen
>confident >"base, i guess"
>>107455511>Go to a self serve ice cream machine>1st time>Ice cream comes out>2nd time>Ice cream comes out>3rd time>Everyone cheering about the third ice cream now. It's so good.>A giant steaming turd comes out>"Erm, you were betrayed by your own expectations."You're a fucking retard.
>>107455523Analogy, you join OpenAI, then make a cool new model that's 2-4B and 4o tier, but then you plan on releasing it... Yeah, that wouldn't turn out well.
>>107455534>they're confident it's gonna happenNothing about their current communication tells me they're confident it will be released. I know I keep saying, this, but do you actually understand how Chinese people operate? Like really. I'm not meming here. They do not communicate the same way you and I do.
>>107455543again, you're the one assuming they should only release local models, I'm not a retard I know some of their models will be local and some will be API only, again, they didn't promise Wan 2.5 to be open source so I don't see the issue there, if they promised it would be a betrayal, just stop making fanfictions?
>>107455558I challenge you to find even one individual who thought it would be API only before it was released. Just because they never said it would be API only explicitly doesn't mean they were entirely truthful with their communication.
>>107455576>thoughtagain with the assumptions, I don't speak the wishful thinking language, I'm talking the fact language, there was no evidence it was gonna be open sourced, if they don't say anything, don't assume anything, it's not alibaba's fault, that's a you problem
>>107455513kek, uni_pc is completly broken on Z-image turbo
>>107455593>I don't speak the wishful thinking languageYou speak it every time you use that discord screenshot of evidence the model is coming.Releasing the distill before the base model? Come on dude. You know what that means. Their open source claims are simply goals waiting approval that will never come. You need to stop taking any words from such a fickle business culture with any value.
Remember that historically, BFL guys had to leave SAI to establish BFL. Tongyi guys will probably end up doing something similar once Alibaba tears down their open source dreams.
>>107455607>Releasing the distill before the base model? Come on dude. You know what that means.I'm assuming (see? we can assume but we have to warn people it's just an assumption and it has to be taken with a grain of salt) they rushed the turbo release distilled from an unfinished base model because they wanted to release it at the same time as flux 2 and kill its momentum, and guess what? it worked perfectly, that's genius marketing right there
>>107455621You are actually delusional. They didn't release the base model because they were legally incapable of doing so. The distill was always their planned released anything more than that was a lofty goal by the engineering team.
>tfw doesn't know any Star Trek characters
>>107455612BSL has an history of almost 3 years of releasing broken models, shitting all over everyone trying to use it, promising a new model that never went out, get the most stringent licence possibles for outdated models they released 1 years ago, and just being absolute shitter on a level almost impossibly imaginable. Their new license is made to fuck everyone ever. Their new models don't work, and was only released because _they_ had to. They're the most unwilling participant of the community ever, they're trying their hard and they're actively, 100% toxic.BSL has a history of fucking the fanbase (if even one person is a fan nowaday) for now almost three years straight. I hate chinks, but Alibaba doesn't have that history, yet. If it's between them and BSL, I'd choose Alibaba every fucking time.
>>107455663neither do I, looks fine to me
>>107455663What is that weird half space half black woman in the back left? A nigula?
>>107455543that's cute that you think if more people wish to turn shit into ice cream, then that's a factor worth mentioning. I can see what I'm dealing with now
[WORKER] Stopping job 7cc93471-a5db-4942-b3b6-298f8cb89704 on GPU(s) 0[WORKER] Stopping job 7cc93471-a5db-4942-b3b6-298f8cb89704 on GPU(s) 0[WORKER] Stopping job 7cc93471-a5db-4942-b3b6-298f8cb89704 on GPU(s) 0[WORKER] Stopping job 7cc93471-a5db-4942-b3b6-298f8cb89704 on GPU(s) 0[WORKER] Stopping job 7cc93471-a5db-4942-b3b6-298f8cb89704 on GPU(s) 0[WORKER] Stopping job 7cc93471-a5db-4942-b3b6-298f8cb89704 on GPU(s) 0
>>107455698I can tell I'm dealing with a 50 IQ gorilla because you're incapable of pattern recognition.
>>107455714I think you just accidentally conceded that your own machinations betrayed you, that does not surprise me
>>107455724No, when you create a pattern then intentionally break it people will register it as unfair. The fact you cannot understand patterns and see no issue with their conduct means you never understood it in the first place.
>>107455638So for you, Tengyi is Alibaba's bitch (fair assumption I think the same) but at the same time Tengyi has the courage to fight against Alibaba and say publicly it's gonna be open source? yeah.. I don't bite that shit, every single word they say publicly is being reviewed by Alibaba, so when they wrote "to be released" it had the approval of Alibaba themselves
Is there a best version of SDXL to use or is the base just fine?
>>107455773So sweet and naive that I got diabetes.
>>107455699working around that bug is a rite of passage for all newfag lora trainers now
Is it possible to use the z-image controlnets with ultimate sd upscale? I try to include it in the workflow but I noticed it applies the controlnet on every tile instead of tiling the controlnet as well.
The biggest killshot argument against "they will never release zimage base" to prove that those posters are low iq retards is the fact that even in the worst case scenario, where alibaba realizes they can somehow earn much more from the model than they thought, they can simply release an undertrained, mid, or even gimped base model and say thats it, and then properly finish training it fully and then just publish that as API later, keeping their izzat high while getting the bag.So they will publish the model no matter what, the question is will it actually be good compared to the increased compute cost
>>107455810I find it more likely that the base never gets past the Chinese red tape and the thought of training and releasing a gimped base model is frankly too much work.
>>107455810>where alibaba realizes they can somehow earn much more from the model than they thought, they can simply release an undertrained, mid, or even gimped base model and say thats it, and then properly finish training it fully and then just publish that as API later, keeping their izzat high while getting the bag.Isn't that the BFL strategy? I didn't work on most people, though some low IQ people lapped it up.
>gimped versionYeah thats turbo
>>107455810>keeping their izzat high while getting the bag.they could do that, give us a cucked base model and then say>"Iz normal it look like shit, iz a base model chud1!!!1!1!"
how is it even with the latest models out now SD still yet to be defeated?
>>107455825>thought of training and releasing a gimped base modelOnly a single epoch with a slight misconfiguration can fuck it up enough and be quick to train but anyway,but the easiest solution is to just release an older checkpoint where the quality isnt that great
>>107455796i saw reference to cron in the terminal and for some reason i still tried to reboot to bandaid fix. i should go to bed kek.
>>107455852>the easiest solution is to just release an older checkpoint where the quality isnt that greatturbo is already a distilled version of an early epoch of the base model, and it turned out to be a great model, those chinks know that even with a mid base model, with good finetuning you can make gold out of it
What if they make us wait and wait and then a new great base model arrives and takes the spotlight? that would be funny
>the model will be released when they're ready, that's itand it'll be ready in 2029, that's the catch
>>107455887Only google can distill nano into some small shit that fits into 16gb at fp8 to overthrow zimage, but that wont happen, google were too big of a pussy to even release gemma llms at 70b let alone anything else lol
>>107455915>>107455915>>107455915>>107455915
>>107455699https://github.com/ostris/ai-toolkit/issues/306why does this cancer exists holy fuck
>>107455809Just i2i the image with sdxl