A Giant Wake Up Call Edition Discussion of Free and Open Source Text-to-Image/Video ModelsPrev: >>107374545https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipehttps://github.com/ostris/ai-toolkit>Zhttps://huggingface.co/Tongyi-MAI/Z-Image-Turbohttps://comfyanonymous.github.io/ComfyUI_examples/z_image/>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/>NetaYumehttps://civitai.com/models/1790792?modelVersionId=2298660https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
ARE YA READY??? https://www.youtube.com/watch?v=xb2fjZa_L74
Chroma..
>>107375744inb4 untrainable 30b blob
>>107375744Nothing to be ready for, until the danbooru fork drops I've got nothing to do with Z
>>107375744>i guess
why arent these included in training data? a close up of a pussy i can understand, but why not teach it what a nipple look like?
>>107375744in my experience, teased releases never happen. AceStep 1.5 will likely never get released. It's on the "roadmap".
>>107375759Nipples look fine 90% of the time.
>>107375744>before this week end>we are already in the week end
>>107375759Nipples don't exist, it's an oversexualized fantasy
comfy should be dragged out on the street and shot
>>107375759otoh it's useful to have models that are ignorant of certain things, like you know they won't produce certain things.
>>107375744
Remember that Z-Image Turbo's text encoder can handle up to 256k tokens, so don't hesitate to yap it likes it lool.
>>107375744I hope they won't do this.
>>107375767the end of the weekend
>>107375776>implying i need more than "a photograph of a sexy woman"
>>107375744Base on what
>>107375790out of 10
QIE 2511 should be coming in the next 12 hours if it hasn't been cancelled
>>107375759In zimage? I've seen proper nipples.It's flux 2 that has no idea mammals have nipples.
>>107375801>QIE 2511 should be coming in the next 12 hours if it hasn't been cancelledI have a feeling the 2 teams will release their models at the same exact time just for the fun of it lmao
>>107375801"this weekend" means "next weekend" in mandarin
i've joined the z club
Reminder that it is Chinese culture to go back on promises if doing so will cause enough emotional pain.
>>107375822>fp8poorfag
>>1073758182511 means November 2025 anon
>>107375826i am in your club, theres nothing you can do about it
>>107375822>stable_diffusionthat's lumina 2 no?https://comfyanonymous.github.io/ComfyUI_examples/z_image/
>>107375805>I've seen proper nipples.
>>107375844i have no idea what im doing, im just following along this videohttps://www.youtube.com/watch?v=itOSk0woXo8
>>107375852just download his workflow (his image) and load it on comfyui anon >>107375844
>>107375850I remember one time I hooked up with this girl and her nipples were no shit as long as my thumb. Like she had literal udders.
>>107375822Better off running the Q8No i will not elaborate
>>107375864i downloaded both i ill try that next
>>107375744what if the loras trained on base don't look any better than the ones trained on turbo
>>107375842You're in our lobby.
This thing is cool as hell.>>107375876I'll cry
>>107375873>3 toesin any case, mind posting workflow or is it just the default?
>>107375880same difference
>>107375873lord have mercy.
how do I use torch compile on clip in comfyui?
>>107375873>nipples are more than finei think they trained on infected pimples instead
You can tell z image is going to be big because all the SDXL vramlets who have basically had to change nothing for the last two years are suddenly coming out of the woodwork asking retard questions.
>>107375873>the WAN 2.5 rugpull traumatized the AI community so badly kekwell, it's the same company that rugpulled us with wan 2.5 that is supposed to give us that base model so...
>>107375780:)the tit adder is coming, you can't stop it.>>107375837oh no it fluxes the backgrounds
>>107375898wait for the sd1.5 neanderthals
>>107375863dog. a new fantasy king has arrived.I only see a modest amount of toe and hand issues (bottom left foot, redhead's right hand). it gets the prompt.
>>107375898Z-Image is more realism focused. Unless we get an anime fine tune similar to illustrious/noob, then they won't be migrating.I don't think illustrious is going to abandon SDXL until 3.5 gets funded(never happening)
>>107375898>implying there's anything better than SDXLUnless you're just prompting realistic stuff in which case your gay
>>107375906>wait for the sd1.5 neanderthalskeek, I thought they died of old age
>>107375916Hasnt anime already been perfected?
>>107375915that was with that extended promt from >>107375634 the first attempt looked like this >>107375678
>censored>can't change camera angle>understand only basic posesand they said z img is good
First test training of a Z-Image Turbo lora, used Diffusion-Pipe and good old adamw 0.0001 LR for 100 epochsYou can most likely finetune the settings more, but the result was good (hopefully you can see who it is!), particularly for training on a distilled model like Turbo and only training at 512 pixel resolutionTraining was really fast, ~50 minutes for 25 images * 100 epochs on a 5060 Ti 16gbAnyway, here's the lora if anyone wants it: https://files.catbox.moe/4pfomp.safetensorsNo 'trigger' needed, lora strength seems best at ~0.75-0.8 from my tests
>>107375927Of course not. Regional prompting(multiple unique characters) sucks with SDXL and using booru tags is simply inferior to natural language. The QUALITY of anime images is near perfect, sure, but that's not enough.
>>107375938Does the lora make the anatomy worse?
>>107375950Nothing that I've noticed, hands etc looks just as good as before
>>107375949meh, anime is all soulless slop to me. even the non ai shit the chinks produce
>>107375873go back to /b/ xd
Hello anons, I want to train a lora of my waifu using Z Image Model, what's the bare minimum dataset size I need, software and VRAM? First time training, complete noob here, is there any rentry?
>>107375966>prompting for realism Failed normalfag behavior
I'm not downloading shitty turbo loras
>>107375978don't lose your head over all the details
frieren is overrated trash hyped by zoomer tourists. fuck off
>>107375978wait for the base model its not out yet
>>107376003zoomer tourists hobbyzoomer tourists generalzoomer tourist boardzoomer tourist website
>>10737597812gb vram and ai toolkit
>>107375978>is there any rentry?Retard. The DEMO of the model, not the full model but a distilled aesthetic tune, released literally two days ago. And the base model isn't even out yet. Fucking calm down.
>>107375862>I remember one time I hooked up with this girl and her nipples were no shit as long as my thumb. Like she had literal udders.should have put a ring on that>>107375888>3 toesyeah toes absolutely fucking suck on Z image turbo. I pray its just a turbo thing>in any case, mind posting workflow or is it just the default?it is the default workflow with the only two """objective""" "enhancements" that most people agree with right now: the TAEF1 encoder and un-bypassing the shift node with a value of 7>>107375893>lord have mercy.literally 30% of slightly overweight persian girls have the exact same face as the one on the right, if that's what you're into.>>107375897>i think they trained on infected pimples insteadthey're like 96% good enough. its not triggering my uncanny valley personally. I am getting nipples poking through bikinis often and bad nipples aren't uncommon either maybe 25% of nipples are bad (not 25% of gens with nipples, there is a 25% chance of a "nipple ngmi" occuring so thats 31% chance of a gen having good nipples with 4 nipples which is why I do a batch size of 4)
>it's another Chinese company promises to release a model then doesn't episode
>>107376004I's a good time to practice around with the turbo model right now, plus I'm a VRAMlet anyway
any tips on making the scene dark
>>107375776this is so close on being nano banana tier in terms of manga, the text is readable for the most part, maybe the base model will reach the next model and make comic rendering usable
>>107376021>the TAEF1 encoderhuh, isn't that just for previews? where exactly are you using it
>>107376021Nah she had a boyfriend. It was kind of a one night thing. I'm a certified architect too and she was pretty slim.
>>107376028funs FOR senpai
>>107376026youre a self admitted newfag whose never trained before you have no idea what youre talking about
VRAMlets, has our time come?
>>107376021>they're like 96% good enoughit's not even 50% good enough. It's literally just this. there's no detail at all, just a brown circle.
>>107376026>I's a good time to practice around with the turbo model right nowit's not. you can train using ram. you will have to do everything again when the base model drops (extremely soon)
>>107376026Please stop avatarfagging or go to /sdg/
>>107375814I had to change your promt to actually get it to work
The fun thing about doing an infographic that compares 75 images from each model is genning the Z-image images and knowing after the first 3 what every single image is going to look like
>>107376061we fixed this issue 2-3 threads ago
Noooo I still have ideas for SDXL image sets it can't be over so soon nooooooo
>>107376026once again proving frieren fags are fucking retarded zoomers.
>>107376048>you will have to do everything againi will have to change the ai toolkit job model from z image turbo to z image base and click the start button AGAIN?>????? its so over
>>107376067Nta but that was not a fix unless your goal was to prompt 1 girls. It kills complex prompt adherence
I'll wait for Onetrainer to implement ZIT
WE ARE FVCKING BACK BROSSS
>>107376073you are the kind of ai person who does more harm to imggen than anti ai pple
>>107376086
>>107376086based
>>107376067it was a shitty band-aid solution that only works conditionally.
i can't stop
>>107376067Tell me the fix and I'll throw that in as another column.
>>107375898yup, it's sad we take the opinions of vramlets seriously here.
>>107376090im not the initial zoomer retard that you replied to
>>107376090Not than anon but why are you being such a prick over a guy who just wants to train a LoRA? It's his time to spend how he pleases and you're acting like a total faggot over it.
>>107376099>it was a shitty band-aid solutionit works well you're just yelling at clouds now >>107372870
Let's face it there won't be any good Zbooru models until 2026. There's still time for one last SDXL hurrah
>>107376027we're gonna be able to AI generate an entire magazine with articles and stuff too very soon, like you can do it now with effort and photoshop but soon it'll all just be from a promptpeople discussed making dark stuff recently. starting with a black image and img2img is one thing, i forget the rest >>107376031>huh, isn't that just for previews? where exactly are you using itas the vae. picrelthere's a guide on reddit>>107376036>I'm a certified architect too and she was pretty slim.based but the point of being an architect is to architect her yourself like how master architect pierce brosnan did but i understand>>107376047lol alright niggy have a (You) because the video was slightly funny
>>107376106model vramlets can run=lively ecosystem and cosntantly developingmodel vramlets can't run=DOA
>>107376113>>107375873>the WAN 2.5 rugpull traumatized the AI community so badly kek
>>107375876They will look better, what I'm wondering is how well they'll work on the turbo model.
>>107376028now it's ESL tier but readable lool
>>107376128yeah i just came up with a workaround>>107376105basically this but the second clip should be empty
>>107376028FUCK if this gets SDXL levels of danbooru support I'm going to SHIT myself
>>107376158i alreeady shat myself
>>107375787where is that picture from. is this from part of a magazine or webcomic or something wtf lol
>>107376090He's right though.
>>107376167https://en.wikipedia.org/wiki/Sega_Poweri assume its from this
>>107376164what are your good/bad feet percentages asian foot anon? how many bad gens before a good gen like that
>>107376102Do some "Vogue" covers to mix it up.
damn, 80s/it on my rx6600
>>107375787Why's he killing the Italian dog though?
>>107376185for z image
what model is anon playing with now?
>>107376193he's tired of this shit
>>107376171It stil lrusn pretty slow on older GPUs, it seems to be optimized for RTX4+, runs much faster there even with the same VRAM
>Anime finetune of z-image>Wan2.2The finetune doesn't even have to be danbooru style. This is it. I can feel it, the stars are aligning. Thank you China.
>>107376146cute macorot
>>107376150Prompt editing to force variety (you could get even more with a wildcard in the first step, like in the old days) is a form of cheating on this kind of test.
>>107376171kek
>>107376207Nah doesn't do it for me unless I can prompt my favorite artists, I need muh booru forks
>>1073761781 out of 3 have good enough feet >>107376179vogue has jewish vibes fuck thatill change it up
>>107376172>i assume its from thisso you're telling me they put a nazi sonic child's drawing into a UK magazine? no vey
>>1073762072.2 still can't do anime style animation
>>107376227>1 out of 3 have good enough feetyou've found a good pocket of training data because for me it's like 1/8 or even rarer
>>107376158Imagine the prompt following. Imagine having more than 75 tokens usable before the quality takes a dip. Holy fuck.
>>107376026>>107375978Well?? Will you help me?
>>107376229>no veyholy shit it did lmao
So far I've generated images for SD3.5 medium, Chroma, and Z-Image. What other local models should be in the comparison? (For photo images, not anime)
>>107376256Qwen Image
>>107376249>tranime brown avatarfag zoomer retard needs to be spoonfed every button clickread and shut the fuck up >>107376018
>>107376256Pony v6, BigASP SDXL
>>107376185Use lower quants.
>>107375776>Just write a giant prompt bro the model will understand.China is amazing.https://files.catbox.moe/yy9o5m.txt
>>107376341>tfw accidentally the rape demon
>>107376300https://www.youtube.com/watch?v=sFl5rKWlOS8
>>107376158Wouldn't the model degrade in quality if you finetune it with danbooru tags/dataset? Like how chroma's finetune made hands, feets and limbs worse than the base model
>>107376380top jej
>>107376380>Mario and his big ass nose>"Jude"sounds about right
>>107376351
>>107376249the dataset is very important. make sure you cut it down to only the most salient features. like the head for example. cut off the head of all your photos before you train. then all of this data has to be customized to your available vram. scaled, if you will. if you're not careful, a model that seems very small can suddenly grow very large, and then it's all over.
>>107376410how did china produce SOTA text with a tiny local model
>>107376410
god I need -edit right now
>>107376425this is a great question, the text encoder is only a 4b model, this is black magic to me, I have no other explaination
>>107376444king. what are you training?
>>107376271>fp8 is 19gb>most I can do is probably Q4_0 with offloadingThis'll be tough but I'll give it a shot.>>107376312I haven't even done base SDXL. There's no way BigAsp will do well at this sort of image so that seems like a waste of time.
>>107376379SDXL only got better the more it got rapedVanilla SDXL is kinda shit
>>107376444unless you have 4k+ images, I have no idea why you'd be doing that many steps.
>>107376425they distilled from a mysterious bigger model that only they have access toif chinese llms are anything to go by, this means that they simply trained on gens stolen from the western proprietary SOTA
>>107376425too much is left on the table, i mean imagine these models in 5-7 years, omnimodal small model you can tell what you want with words and it just instantly changes the image. you can have something similar right now if you glue things together bust not a natively multimodal model.but as long as scaling is cheaper than r&d people will bruteforce that as the safe route towards progress until less and less companies can afford to scale more and more and then those will focus on research, get a breakthrough, and the cycle begins anew
>>107376474everyone is saying Z-Image-Base is only 6B though, according to the paper that another Anon insists on mentioning.
>>107376425This is a big misconception, 6B parameters with a 4B LLM is already huge. You can only scale so much before you reach diminishing returns, and most researchers are just lazy with their data curation. Read the paper.
my gpu keeps peaking down from 100% usage to 0% usage whats causing this? the temps look ok
>>107376494>6B parameters with a 4B LLM is already hugenot compared to other modern models tho no?>Read the paper.i glossed as much as i could understand
>>107376474A seething jew, on my /ldg/ ?More likely than you think
>>107376486>scaling is cheaper than r&dBut US companies are spending literal trillions on scaling as Chinese are outperforming them out of moms basement with r&d
>>107376494I think it will be difficult to reach higher resolutions without scaling the model size. We've been stuck < 4MP for years now.
>>107376507>i glossed as much as i could understandask gemini to make a summary for you kek
>>107376486>>107376512read the paper
Total ComfyUI Victory
>>107376459Z Image Turbo DALL-E 3-like Girls style lora that I had the dataset ready for>>107373741>>107376467The lora keeps getting better so I keep training, I'll train until it breaks completely for multiple checkpoints in a row
what Z-Image are you guys using?
>>107376507You don't have to speculate, try it yourself. What's better, Qwen Image or Z Turbo?
>>107376421Thanks, do you recommend train my lora in SDXL before?
>>107376526its like burning the lora is a feature
>>107376433
Okay. Hear me out. X-image
>>107376341>>107376357catbox / prompt?
>>107376185Another Vramlet with a GTX 1080 here, our cards should be about equally fast for games, but AMD sucks a bit more at AI.At what resolution, what CFG and Quant are you using?With Q5_K_S and CFG at 1 i get about 4s/it at 512x512 and 14s/it at 1024x1024With Q5_K_S and CFG at 2 i get about 7s/it at 512x512 and 28s/it at 1024x1024Can anyone else post their results?
>>107376546makes you think, what happened to Y-image?
>>107376474>stolen from the western proprietary SOTAwho cares? they released open weight model, and proprietary SOTA models are also going to benefit from this
>>107376558>what happened to Y-image?the same thing that happened to Iphone 9 and Windows 9
>>107376546ZA-image
>>107376474>this means that they simply trained on gens stolen from the western proprietary SOTAthey did the opposite actually
>>107376526looks good, any loras?
>>107376552thats what i generated i did a 512x512 test with 25.83s/it
>>107376572>any loras?? I'm training it
god damn I slept through two and a half threadsis the base model out yet?
>>107376587if it was it wouldn't have been 2.5 threads
>>107376587two more weeks
>>107376590shitty samplersstop it
>>107376544
more like zzzzzzzzzzzzzzz-image
>>107375776>>107376351>>107376410>>107376433>>107376544
>>107376600I think he used that lorahttps://civitai.com/models/2175050/vhscommercial?modelVersionId=2449356
>>107376351nice morklow
>>107376645thanks, and based piercel kek
Zimage lora from pic rel?
>>107376661>Zimage lora from pic rel?when we'll get z-image edit there won't need a character lora anymore, can't wait
>>107376580What program, quant and CFG did you use for that one?
if i'm not getting any speed up from using lower quants of zit then the bottleneck is in my old vramlet gpu cores, not the memory?
when they said zimage is uncensored they werent kidding wow
>>107376645that's fucking crazy good text how many steps and how many attempts did this take
>>107376600>>107376636yeah its the lora
>>107376670comfyui, i dont know what a quant is i just started generating again since 2024, cfg 1, 8 steps
Anyone managed to produce good results with WAN i2v on 8Gb VRAM? I got it working, but the videos are slow-mo garbage with some lethargic motion.
>>107376677it doesn't know penis and the vagoopers and nipples look a bit odd, but that's not censoring they just didn't include alot of it in their training
>2mins per image for a mere 20 step genQwen is going to take 2.5hrs to generate the images I need. Painful. I'll have to do this overnight.
>>1073766812 more weeks and we'll be able to finish Berserk all by ourselves
>>10737667930 steps, did 5 batches of 2 at a time while tweaking the prompt
>still no real long video for wan>every long video lora, node or workflow still relies on last frames>all produce sudden jerky movementThe color shift doesnt even bother me, there must be a way to "smooth" out transitions between each generated video on the fly? The painter long vid is a good starting point https://github.com/princepainter/ComfyUI-PainterLongVideo if any smart anons can remedy this.I tried wan windows context nodes and even riflexrope, no luck :(
>>107376707I already finished Berserk by prompting Griffith as my wife and Casca as my dog. Miura would approve.
>>107376691Dunno how well Comfy can make use of your AMD card, maybe someone with one can comment on that and give his s/it or it/s for comparison.
>>107376681
>>107376695it feels a bit like the model thinks peepees and virginas are the same thing and it tries to build one amalgamation out of the two
>>107376737the 6000 series cards are done i thinkthe new triton and miopen updates arent supported for 6000 series
>i have a dream, that one day models will never again be trained on futas, so that they know that a woman should never have a penis
>sir we've found the man who invented bokeh, you know, that background blu-AAAAAAAAAACKKKKKKKKKKKK
>>107376782>ve found the man who invented bokehgod?
The last time I proompted was nearly two years ago. This is fun.anything I should change from the default comfy workflow?
>>107376791truly the best proof this world is made by an evil deity and hell itself.
>>107376767not familiar with amd cards running on comfy, but i faintly remember something about them running on linux with comfy, if you are on windows then WSL (Windows Subsystem for Linux) might be an option, but you gotta research that yourself. good luck fellow vramlet
>>107376774kek
>>107376717>yeah its the lorayes, it makes sense. interesting
>>107376782>>107376791got this shit in 30 seconds, this is magic dude lmao
>>107375938hot damn
>>1073765521080 here too, at Q6_K and CFG 1 i get 19s/it at 1024x1024
>>107376845lmaoo I didn't expect Claude to accept that prompt rewrite
>>107376694Most people use lightning loras and apparently that's where the slow-mo comes from. Still, 8GB, that's rough buddy.
>>107376872how long is that prompt?
>>107376879https://files.catbox.moe/iuiw1m.txt
Z makes everyone with light skin a little asian
>>107376886this is nuts. i've been writing 70 token prompts
>>107376929Ikr, this model is fucking insane
>>107376876that's pretty neat
>not enough compute to run image model, captioning model, and llm all at the same time:(
>>107376872
>>107376942so sequentialI'd love to pipe my zit outputs straight to wan but that ain't happening
>>107376943>Interview: Tongyi-MAI vs Black Forest Labsoh I definitely know where this is goinghttps://www.youtube.com/watch?v=VIjlkGu_RwA
>>107375729Would a 3090 be enough to run img2vid comfortable locally or you need a 5090 shit? I currently have 3060 12gb and get a disconnecting error in ComfyUI when using higher quants or things like that. It just needs enough vram to load in memory right?
>>107376942cant you use the already loaded qwen model for textgen?
>>107376960i wish but it doesnt work like that
>>107376954>It just needs enough vram to load in memory right?no the vram amount can be larger or lesser depending on the resolution you are genning and some other things tooyou should buy a 5090, if you can, if you want to be serious with local AI. and hopefully you have at minimum 32 preferably 64gb of ddr5 ram as well
>>107376954i run everything on 8gb vram + 64gb, q8 wan i2v and t2v ggufs, bf16 zimage
>realistic images
>>107376954I have a 3090. It has s probably the bare minimum for worry-free video generation. You could probably get away with a little less, but that is the minimum without having to use low vram tricks like lightning Lora.
>>107376978how many hours left before it's officialy monday in china?
>>107376985it's sunday morning, 9:45am
>>107376983there isn't a little less, it's 16gb or 24
>>107376994So we only have 14 hours left before we can doom, grim...
>>107376996Why no 20 gb cards tho.
>>107377003good fucking question
>>107376979is that bald notch lmao
>>107376978
fellas do you know if it's possible to merge noobxl and rouwei clip together? rouwei clip has more styles and characters. noobxl has anatomical and compositional advantage
Z Image Turbo DALL-E 3 style lora 11500 steps in, keeps going.I wonder if my previous version of this lora on Qwen Image wasn't training this good because I used double the learning rate and trained until it broke down during training for the first time but now I kept going and it fixed itself. I'll have to retrain the Qwen one. https://civitai.com/models/2093591I only briefly tried to train some basic things on flux and sdxl once before but still to me even Turbo Z Image seems quite incredible for training in comparison, it just gets what it needs to do.
I want to sleep, but I don't want to wake up after everyone has celebrated the release of Z-image base...
>>107377043i dont even call it sleep anymore. i call it lora training
>>107376872>straight path is pink
>>107377038waow
Going outside good training pockets and getting slop is worse than blueballs I need base
it does minecraft pretty well
>>107377064How well does it know 4chan?
chinese century
>>107377064>it does minecraft pretty wellforget about minecraft, did you notice in your image it can also render Hatsune Miku? Damnnn, best model eva!
>>107377074the century of humiliation for the US has officially begun
>be me>going through hundreds of my training lora sampled images>suddenly the next image is blurry across the entire face of the woman>FUCK FUCK FUCK FUCK WHICH IMAGE IN THE DATASET WASNT TAGGED PROPERLY.png>realize that the blur moves and that it's actually my eyes that are blurring from a whole day of barely blinkingphew, the gens are safe.
>>107377073Not at all
>>107377043probably not coming until after /next/ weekend
>>107377038Her leg doing something weird
>>107375906That's me back from my 2yr break. And, no I will never upgrade from my GTX 1060. Whoever's maintaining that python cuda bullshit stopped supporting my card, but I figured how to get the old cuda running.
>>107377087Good.
pony v7 > z
>>107377081>the century of humiliation for the US has officially beguni might start learning Mandarin. i dont give a shit about chinese people killing chinese people on june 1989 and the porn ban doesn't affect me as long as I have AI. I for one welcome the replacement of our Jewish overlords with Chinese ones. and I never really had a problem with Chinese immigrants either, just the tourists
>>107375784>the end of the weekendat no point they said "the end of the week end"https://github.com/Tongyi-MAI/Z-Image/issues/7#issuecomment-3586493968
>>107377134based, they're the lesser evils at this point
>>107377134I'm waiting for a speech recognition model that isn't just speech2text2speech so that he can precisely teach how to pronounce words before starting
>>107377132
>>107377137I never bothered to check into the claims but I assumed the whole weekend claim was bullshit. Chinese people have weekends too.
>>107377134Unironically China has more freedom than my shithole country at the moment.But at least houses cost over a million dollars, that's a good thing right? ...right?
>>107377134tianamen? it wasn't the state that did the violence.. it was the student protesters
>>107371032How come it's not possible to use this thing on Comfy?
>>107377205>vladmandic/sdnextThis is like the worst possible, most bloated UI you could support. No neo, no Comfy... It's insane.
>its the end of november>no ltx2i weep
i can say without a doubt z image shits on any model i tested previously on my rx 6600its insane, it just generated images i thought it could never generate, no details on what was generatedthis is the best model out there right now i think were going to get nano banana pro levels of details with image editing
>>107377234they literally said mid-decemberbut they're also kikes, so who nose>>107377199>tianamen? it wasn't the state that did the violence.. it was the student protestersdid i stutter nigger? i just said I don't give a shit
>>107377107that's not her leg anon...
I have an increasing number of minor gripes with Z-Image but for what it is at the speed it does it, it's hard to faulte.g. fairly similar results here between it and Flux.2, I think both are good, Flux.2 of course took a zillion years longer though
>>107377333I wish it could do anything but the most basic of poses.
>>107377333Try 8 steps using euler with the normal scheduler instead, the comfy workflow adds a bunch of extra noise for no reason.8 steps + euler normal is basically what z-image used in their reference setup.
>>107377266"huehue my country sucks i should move to another country, but i really don't care about my previous programming where other country=bad"ya you'll get far
is it normal for my gpu to not be 100% utilised with max power draw at all tiem when generating shit
What comes first, Z controlnet or edit model? Img2img is too finicky, changes too much
>>107377363Not unless you’re genning videos or with insufficient vram.
>>107377370so i need to repaste the heatsink it keeps jumping from 100 to 0% utilisation
>>107377333Lost the texture on the right or was it without the Ben Day from the begin
>>107377366>What comes first, Z controlnet or edit model?Z-Base
>>107377363no, something must be bottlenecking it
>>107377394the temp on the hotspot sometimes quickly reaches 80 degrees c i am being cucked by the thermal paste i think
>>107377420your thermal paste is fucking your wife in front of you?
>>107377420Yeah could be, I had to RMA one of my cards because it would overheat/throttle and even freeze during gens, this one I've got can go all day at full tilt.
>>107377428its gotten bad lately
Is there a node out there for wan 2.2 to do its generation in 1 go instead of high noise, pausing then low noise? I no phr00ts wan exists but its a little too slopped
someone bake a new cake i need to show my beautiful image to everyone i generated just now it took me 1300 seconds
>>107377445Are you using a Voodoo 2 or something?
>>107377350Yeah. In any case while Flux.2 is a very very good model both editing wise and T2I wise I think that BFL building both capabilities into the same model was a giant blunder, there's no way around it lending to the perception of it having a bad "size to quality" ratio given that you still have to load the entire thing no matter what. And I also feel that there's TEs that BFL could have used that are smaller with equivalent or better performance than Mistral-Small-3.2-24B-Instruct-2506 (maybe GLM-4.1V-9B-Thinking for example).
>>107377458the prompt was quite hefty
>>107377355??? I used 9 per pass based on the Z huggingface which states 9 inference steps actually amounting to 8 proper DiT steps. NOT doing the upscale and trying to go right to 1536 in one go would for sure be worse if that's what you meant, that wouldn't prove anything / wasn't the point here at all.
>>107376243nice. catbox?
>>107377458It's all math at the end of the day, you could spend your entire life diffusing an image by hand
>>107377420If your temperature delta between core and hot spot is more than 14C, you probably have thermal paste pump out. Phase change pad like PTM7950 will fix that.
>>107377478>1gril, iphone_selfie_lora_9999.safetensors, very generic, (uninteresting:1.5)
Fresh>>107377493 >>107377493>>107377493>>107377493Fresh
>>107377488you've done him
>>107376797>antanholorry
>>107377467You could've just posted it on catbox or something