Discussion of Free and Open Source Diffusion ModelsPrev: >>107757207https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipe>Z Image Turbohttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>WanXhttps://github.com/Wan-Video/Wan2.2https://comfyanonymous.github.io/ComfyUI_examples/wan22/>NetaYumehttps://civitai.com/models/1790792?modelVersionId=2485296https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe|https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
8 minutes 47 seconds
>>107762587https://www.reddit.com/r/comfyui/comments/1pgvdgo/impact_pack_is_trying_to_connect_to_youtube/
>>107762645thx anon, good reminder to use a standalone computer for this shit
>>107762645that's ultralytics (YOLO) telemetry though, nothing to do with image gen or what you're promptingalso it can be disabled
base when
idk maybe it's just my dataset but it feels like ZiT has some problems with picking up bodytypes that are noticeable different from the default one. eg shortstack girl comes out as almost normal sized, you can kinda wrangle ZiT with prompts but it feels like it's strugglng to pick up on physique, while the likeness comes out perfectly. At least that's my impression, I'm using old datasets, and with XL models the bodytype came out way closer.
A little tip: use something like https://setpose.com/ to get the perfect pose and angle for i2i and play with denoise values. I mostly use it with zit. Play around with all the different settings to see what the free version can do. Also, change the color of the model to the skin color you want. Zit is pretty good at following the poses.
>>107762645>>107762672yeah this isn't from comfy it's from ultralytics (which is still kinda shitty), but i had this happening from when i was training vision models for another task already. disabled it now, still gonna use the nodes.>>107762699my body type loras are almost too strong
What aboutWhat about three steps Ksampling?Or more.
>>107762705seems straightforward enough
>>107762592working on doing a Lora of an OC, my workflow at the moment is to use a base SDXL model to make 20 images, various poses, expression ect, with aorund 15 of them containing her regular outfit and 5 in normal closes, im color matching the skin and hair, eyes ect to try and get them as close to each other as posible and then plan to use them to make a Lora using SDXL v6, thing is im not really happy with the results, things youd never consider like eye size being slightly off, head shape, things like that theyre so hard to control and get sonsistent that im thinking of starting again from scratch, has anyone had any luck doing this? if so what model did you use, was considering Illustrous based ones if this doesnt work out
>>107762751
are lustify/chroma still the best nsfw image models? anything new worth checking out?
>>107762724>my body type loras are almost too strongodd idk, I mean i can see it kinda picking up the bodytype but not quite, like that fitness influencer dataset I have, she is kinda muscular but has kinda short legs, I have enough clear full body pics but it always ends up giving her normal length legs and so on
>>107762751wups>>107762805if you have your dataset captioned try uncaptioning it and see what results you get
>>107762762
>>107762882whats the denoise you use
>>107762850>if you have your dataset captioned try uncaptioning it and see what results you getYeah its captioned, worth a try I guess, I always used captioned sets so far, might be interesting what it spits out this way
>>107762850question, how is Z turbo with Anime style and can you make Lora's for it yet?
>>107762675Nice style
>>1077629010.78You can get away with high values when the pose is very readable and easy for it to interpret, just as long as your prompt aligns nicely with the pose you're feeding in.
>>107762592>4/10 of the gens are mineholy kino thread of ultra friendship or whatever!!!
>>107762945tran is too much of a spiteful bitch for the collage to be taken seriously. the thread blessings are tarnished by a half nigger tranny with bad taste. you should feel bad for supporting a drama faggot and avifag worse than debo
As a techlet retard, why do some models seem to pick up the concept being trained gradually, with the sample pics looking coherent but slowly changing, while with other models the whole thing suddenly collapses into an garbled`barely recogizable mess when it starts cooking, with the training pics only slowly becoming more coherent again as time progresses?
>>107762919It can't do any styles but you can make loras. Could from week 1 even.
>>107757565nmp
>>107762919i don't do a lot of anime but it's pretty great out of the box if you're just referencing stuff
>>107763003>>107762983Nice
>>107763131you have zero reasons for living. all you do is think about ani. cry about it
>me when I read someone sneeding about the fagollage
posting 1girl standing looking at viewer? couldnt be me!
we finna be gay af underwater!
>>107763267
>>107763305>brown frostingcoincidence? I think not
>>107763226why would you gen small breasts
>>107763305Fitting brown frosting
>>107763319you realize not everyone likes cow tits right? I prefer normal sized tits, if you think those are small, man, you might be a coomerbrained porn addict fag
Blessed thread of frenship
>>107763339these threads have been cursed for a long time
>>107763321>>107763315tfw they get the frosting joke but not the pink roses>an anime still, a man in a cool outfit leans against a car, in the style of..No oneHayao MiyazakiMamoru HosodaMakoto ShinkaiSatoshi KonShinichirō WatanabeSunao KatabuchiYoko KannoNaoko YamadaMamoru NagaiHiroyuki Imaishi
>>107763377wrong image, it's this one
>post in /ldg/>get mass deleted due to some kind of mixup with the actual spammersad
>>107763401do you masturbate to controllers retard?
>>107763412This might be shocking to hear but some people gen things they don't masturbate to
how can I post on 4chan after my pass expires, they block all vpns and random range bans blocking images
>>107763412n- no...
>>107763305
>>107763439what the fuck?
>>107763401stop spamming your shitty controller you dumb fucking nigger retard.what are you? retarded?gen some new material fuckin moron
Proud of yourselves?
>>107763591alright pack it up everyone, a women has got the ick
>>107763605billions in compensation awarded, all schools to teach on the dangerous of virtual rape
>>107763591kek>I know it wasn't me, but I FEEL it was menice bookend
>>107762850I will be so happy when girls go ultra thin route mental illness instead of mega fat routethey'll go mental illness either way for the concerned demographic
>>107763591>I got robbed! Buy my nudes for 15$ on onlyfan!
>>107763591>posts her age, real name and countless attentionwhore photos of herself on the internet>she seethes at thisIs this the norm nowadays? Are women unironically this fucking retarded in 2026?
>>107763685incentivised retardationwhat other mind could concieve of 'sexist air conditioning' or 'pink tax'
https://github.com/huggingface/transformers/pull/43100>GLM-Image AR Model Supportmaybe we'll be saved from (((Alibaba))), I always loved the LLM GLM series, they are more soveful than the rest of the LLMs, let's hope that's also the case for their upcoming image model
>>107763439Dude that's impossible.
>>107763808i haven genned past 10k images at this pointmasturbated to around a dozen or so
>>107763584holy meltie over controller anons kino
>>107763584>a billion flowd fent obsessive images, more floyd than even media at the peak of their obsession with him>10 billions hitlers>but clearly, the dozen controllers posted are way too muchthanks for taking a stance
>>107763806>maybe we'll be saved from (((Alibaba)))and then it's another incredible turbo model and we'll also have to wait for base for that one
>>107763806>ARso it's an autoregressive model right? we never had a good AR local model, so let's hope that one will break that cycle
>may 24 2023 /sdg/ nah. what gives.
why have we regressed
what happened to this expressiveness
>snubbedFuck you all. Tasteless retards.
>>107763975>why have we regressedwe haven't, Z-image turbo is a way powerful model than anything we got before
this was peak soul
>>107763878>but clearly, the dozen controllers posted are way too muchthe other niggers are annoying too but you are not different from them.you are the same.stop doing that nigger retard faggot assfeetus
take me back >>107763996that's the best you can do with cutting edge models?
slop hours
we have been rug pulled and didn't even realize
>>107764009that image looks like ass the architecture is completly broken, why are you pretending this is the peak of local lol
>>107763806Why's some Russian faggot re-linking this everywhere? His Babushka die or something?
>>107763806I think it'll be good, they have no reason to release an image model if they know everyone will ignore it if it's worse than ZiT
>>107764025where did all of that potential go
>>107763806https://github.com/huggingface/transformers/blob/cd8d78fcb4067979e921b20163d62035c51b4e7f/src/transformers/models/glm_image/modular_glm_image.py#L794>=== Case 1: Image-to-Image Generation (single or multiple source images + 1 target image_grid) ===>=== Case 2: Text-to-Image Generation (no source images + 2 image_grids for multi-resolution) ===it's an edit model
>>107763993>>107764014same person
>>107764076wrong
>>107764041>I think it'll be goodI hope so, that'll force Tongyi to release the base model if it turns out we'll move on without them lmao
did anyone ever figure out how to make wan video lora locally on 16gb ramlet card?
>>107764009Even at the time I thought this was ugly and an abomination. I suspect only subhuman retards find it appealing.
>>107764009I like the idea, I wonder if it can be recreated with less impossible stuff.
>Even at the time I thought this was ugly and an abomination. I suspect only subhuman retards find it appealing.
Base is a collective hallucination.
whats anonies issue?
>>107764194>a collective hallucinationthere's a french meme about that sentence lolhttps://youtu.be/MhIbTEue2ew?t=90
>>107762705i use magicposer for poses. very simple and intuitive.
>>107764219holy shit, the memories, I vaguely remember people driving this guy nuts, kind of sad, they could have left him alone
what's the best penis substitute for z-image? i've ben using a carrot
>>107764248Do it trib style. Take a pic of your own cock over the gen.
>>107763591she feels humiliated for that weak grok. i can't even imagine her reaction, if she saw my bbc punishments, lol
>>107764268qrd or keyword for me to do my reps?
>>107763591>Proud of yourselves?should've read the ToS, by uploading her pictures on Grok she agreed on having people making parodies of her
>>107762592Hi /g/,I tried AI generation in the past but my GPU sucked. I got a better one now and I'm trying to get a hang of the basics (light/colors/camera/styles) on SDXL before I try to do anything with flux or illustriousI suck at prompting though, and I can't get Comfy to do what I want it to doCan anybody give me suggestions on how to improve my prompting?I'm assuming there is an AI tool that can help with that?
>>107764219c'est pas drôle du tout çabase ou je lance une bombe nucléaire vers la Chine
>>107764329désolé monsieur anonyme, mais la culture chinoise est plus forte que tout
>>107764308prompting depends on the model: get z-image turbo, and just ask any llm to improve the prompt for you. you shouldnt use tags like everyone did before.
>>107764308>Can anybody give me suggestions on how to improve my prompting? proficiency is gained only through trial and error. spend four years prompting and itll start to click.
>>107763806If they made an AR image model based on GLM-4.6V-Flash 9B it might be interesting, or else it's DoA
>>107763591>le Musk did this!hate the journo faggots so much. Musk or Grok didn't do anything you dumb cunt. It was the person who used the tool. Other women are more than happy to have a tool that can put them in a bikini. What about their rights to enjoy the bikini tool? What about their rights to enjoy others putting them in a bikini? Fuck off if you don't like it.
>>107764470Musk should have changed the name to Le Musk, that would have been on brand for his sense of le humor.
>>107764470>Musk or Grok didn't do anything you dumb cunt. It was the person who used the tool.it's even worse when you know it's those bitches that are doing this to themselves, if they don't want to use a website that allows image editing, they can just leave the site
>>107764248use a banana for scale
>>107764308When you have an image in your head you want to generate, really think about not only what you're 'seeing', but also what you're not actually seeing; don't EVER mention the things you're not seeing in detail, or it will try and generate it. Say, for example, you want someone with their hands handcuffed behind their back, don't prompt 'with their hands handcuffed behind their back', for it will try to generate their hands in handcuffs in the image and will likely go schizo about the handcuffs specifically. Instead, don't say anything about the handcuffs, just say 'hands hidden behind their backs', for what the hands are doing behind the back (whether they're handcuffed/tied or not) really doesn't matter if you can't see them anyway. Apply this thought process to everything when prompting. Be detailed about the things you want to see, not the things you don't. It doesn't need to know everything in your head or the intent/context of the image, it only needs to know exactly what you can see in your head and no more.
>>107764484Isn't that the spider bitch from Wicked City?
SVI 2.0 is pretty impressive desu, now if only making one minute video wouldn't take ages it would be fun to do as well
>>107764599I guess you used the same prompt for the whole minute?Can you make her turn/spin, I wonder if it's able to maintain her face for a whole minute.
>>107764351>>107764381>>107764514thanks!
>>107764630>I wonder if it's able to maintain her face for a whole minute.I won't do that again it's too long, but you can keep her face consistent yes, look at that examplehttps://www.youtube.com/watch?t=603&v=PJnTcVOqJCM&feature=youtu.be
>>107764599>SVI 2.0what is it? never heard of it before.>making one minute video wouldn't take ageshow long did it take you? hours? also what gpu?
>>107764655>>SVI 2.0>what is it? never heard of it before.they finally found a way to make Wan do longer videos without having it look like garbagehttps://github.com/vita-epfl/Stable-Video-Infinity
>>107764551the one and only
>>107764678woah thats neat.whats the catch?takes 10 hours to gen a 1 minute video?
>>107764719well, imagine the time it takes to do a 5 seconds Wan 2.2 video, now multiply that time by 12 to get a 60 seconds video
>>107764646how the hell is it able to keep face consistent??
Turbo can't deliver the same natively, I cry.
>>107764719it takes me 13min to make a 6s video in 720p that looks ok, so 2h for 1minthat's something I can run at night
>>107764678Can you edit parts of it, aka not forced to generate everything at once?
>>107764599If you sped that up to look real-time, it would be 5sec long...
>>107764738>how the hell is it able to keep face consistent??black magic
Does training on dedestilled ZIT make a noticeable difference?
>>107764599is there a 10-15 sec svi, and for comfyui nodes?
>>107764828you can go for any length you want, look at this youtube tutorial he explains it and provides a workflow on the video descriptionhttps://www.youtube.com/watch?v=XGB4qBkCFSM
>>107764828The workflow for making 10 seconds and 1 min is nearly identical. Just use less samplers
>>107764755you can yeah, since you have to provide a new prompt for ever 5 seconds
>update a few nodes but not comfy>everything works but wanwrapper workflows are now broken>had a previous comfy backed up with a working wanwrapper>revert to older wanwrapper>still fucking brokenWHY
why do you guys even update comfy?
>>107764870i don't knowbut i bricked my install. im using the goyapp now
>>107764870>why do you guys even update comfy?it's not like we have a choice when there's a new good model that comes out and that we have to update it to run it
>>107764870a lot of us dont. not sure what the ones who do are doing to get the latest comfy to work but every time i try to update it, since december, everything breaks. good thing i have working backups
>>107764856It means you can export the latent video and reinject it later?
why did comfy remove the old job queue and replace it with a system where you now need two windows, one for the queue and one for the finished results, and also make it so that you have to double click the image to see it, and also make it so that if you hover over a job you get a third job details window that will often get stuck if you dont hover over it? anyone know why he did this?
>>107764870Never had any problem doing so.
>>107764897>when there's a new good modellike...?
>>107764870to complain about the UI more>>107764909comfy never was able to do frontend so he hired jeets to do it
>>107764918Z-image turbo, and I guess that upcoming model >>107763806
>>107764897no model is good enough on release. it needs to be fine tuned and redistilled and have the first wave of loras and controlnets done before it can be used seriously
>>107764870New model support and also I hope the new release will unfuck things the old release fucked up.
>>107763806>>107764938what the fuck is glm image and why would it be good?
>>107764926>comfy never was able to do frontendI feel him, I'm way better on the backend side>he hired jeets to do itthat was a big mistake, he's not that good on the frontend but he's way better than your random jeet
>>107764945>why would it be good?the glm guys are dominating the local ecosystem on LLMs, so maybe they'll do the same on image models, they're not randoms, far from it
when will bfl create a safer flux? I could generate non ugly women with flux 2 wtf
>>107764956GLM is good but I'd say the most impressive team is the deepseek one, I wish they had any interest in image/video, these guys also publish every trick they use.
>>107764947he just used the light graph framework and didn't touch anything but I agree that nothing should have been touched if this nu-ui is the result
>>107764956All the Chinese LLMs are within spitting distance of each other, GLM 4.7 was just the latest release. They all borrowed a lot from DeepSeek.
>>107764998>feeling cute today, might never release baseCHAAANGGGGGGGG
SeedVR2 doesn't bring me joy consistently.
>>107764870>>107764725nooooticing
Am I missing something with Chroma lora training? I'm trying to train a character lora and it's barely picking up the likeness of the character. I'm using >40 images>adamw>batch size 2>lr 2e-4>alpha/rank both 16 >100 epochsI don't really know what's happening or if I should just train further.
>>107765006you mirin that culture, huh?
>she is spreading her legs revealing her panties>a third leg pops out of her crotcheverytime.pozzed model
Enjoying your base model you gullible retards?
>>107764956>dominatingdominating is a stretch but they have by far the most sovlful models, let's hope it's also the case for their image model
>>107758571>Enjoying your base model you fucking retards?>>107765046>Enjoying your base model you gullible retards?I wonder what slight variation he's gonna bring to the table on the next bread kek
>>107765033what resolution? Chroma sometimes struggles to pick up details if you use 512+ resolution.
>>107765057so where base
>>107765067how should I know, I'm not a Tongyi employee :(
im beginning to hate the chinese
>>107765064I'm using 512 right now, OneTrainer's default settings for Chroma downscale them to that. Should I bump up the resolution to 1024?
>>107765067>where baseIn China
>>107765047>they have by far the most sovlful modelsWhat exactly do you mean by that? How can an LLM have sovl?
>>107765067because you touch yourself at night
>>107765074I hate all researchers equally. all of them are sellouts
>>107765033The images are too different or your captions are bad or too short. Sometimes, it just doesn't work for a particular model. Try training it on only one image and see if the likeness if picked up in the sample images.
>>107765072she looks like Simona Halep
>>107765122only oldfags know who that is
I neither hate nor love, I just enjoy the free stuff we get every other week.
>>107765103>What exactly do you mean by that?all the LLMs sound the same, they are boring and write corporate bullshit, except the GLMs models, they write like a human would do, and they have way more imagination, maybe they're not using as much synthetic AI on their dataset training as the others
>>107765127desu her "doping" scandal is pretty recent
>>107765082>Should I bump up the resolution to 1024?No, it will learn even slower. Could it be bad captions? Here's some settings I used that worked just fine. I had rank 64 and alpha 32
>>107765132>GLMs models, they write like a human would dodo you mean less sappy shit like "mischievous glint in the eye", "ball in your court", "half lidded eyes", etc
>>107765145yeah
>>107765132>except the GLMs models, they write like a human would doYou sound like a shill.
>>107765074I gradually came to hate them
>>107765151oh, I'll take a look then
>>107762592>none of the pixelart kino made it into OP Absolute disgrace. Shame on you.
why isn't the new qwen popular here?
>>107765142I'm guessing it's probably the captions. I'll redo those and do another run with those settings you posted. Any recommendations for how to caption? I've been using joycaption, not sure if I should use something else.
Chinese culture
Remember when you were all jerking each other off over a misinterpreted discord message that the base model would be out ON the weekend the turbo model came out?You are all so clueless about Chinese culture it's obscene. >But it got merged into diffusersMerge it into my ass because that doesn't mean anything.>But the chinese guy on twitter said it was comingHe said wan 2.5 would be open source too>But they said wait a little more on the discordWait for what exactly? You and I both know the people there have no say over what gets released.At best you're getting an API model. Chinese. Culture.
>>107765187Joycaption works just fine for Chroma. 2-3 concise sentences is enough. You could also try without captions. I used ChromaHD as base.
>>107765131This. Imagine complaining about free gimmies. Must be a burger thing.
Still on a ddr4 system right now and wondering if I should stick with it and get 64gb ram or just do a full upgrade.
>>107765211>that doesn't mean anythingwhy? because you said so? why would they even put the effort on bringing the inference code on diffusers if they were sure they were gonna go API?
>>107765219we complain about lies and broken promises, if they said from the begining they were not gonna release base no one would bat a fucking eye
>>107765231Chinese culture.
>>107765006This reminded me of the unfathomable number of githubs projects last updated 4+ years ago with "model to be released soon" on the readme I've seen, ogre
>>107764998>>107765257is this a reference to deep dark fantasy? loolhttps://www.youtube.com/watch?v=Tg82nutmTwI
>>107765257>unfathomable number of githubs projects last updated 4+ years ago with "model to be released soon" on the readmeSurprisingly good understanding of Chinese culture in /ldg/? Wow
Base model will be heavily censored.
>>107765280Black Forest Labs promised a video model all the way up to 2024, I guess that was the G E R M A N C U L T U R E kicking in this time
https://xinyu-andy.github.io/SelfE-project/Trust me bro, this one will definitely replace Self Forcing!
>>107765289That's just cultural appropriation.
>>107765033With Flux (same difference), Cosign with Restarts and Prodigy (I think? that auto one) always worked best for me. Rarely had to go over 3k steps with 45-50 images before it nicely converged.Could just be Chroma though, I've never had a LoRA outright fail even without captions. It always picked up something.
>>107765269nah, just my kink
>>107765331>Hitler if he was accepted to that painting school:(
>2026>we're still on Wan 2.2 for videoAPI chads are laughing at us.
>>107765346Z-video base will save us
Floowandereeze & Robina
>>107765346>are laughing at usthey're censored and limited. even api professionals use local models, for serious projects
>>107765419very nice
>>107765430>even api professionals use local models, for serious projectsand they end up making bad shit like that coca cola ad lmaohttps://www.youtube.com/watch?v=Yy6fByUmPuE
xixxix anon pls upload your 'jak lora
>>107765456>vaporwave>90serm
>>107765464i guarantee there's hundreds of videos like that on youtube, regardless of the accuracy of the premise
>>107765435ty same to you
maybe autoregressive open weights zai-org/GLM-Image model soonhttps://www.reddit.com/r/StableDiffusion/comments/1q42gv8/glmimage_ar_model_support_by_zrzrzrzrzrzrzr_pull/https://github.com/huggingface/transformers/pull/43100/files
Haruhi is waiting
>>107765509yeah we know >>107763806
Is there NAG for forge neo? My zimage gens on neo are coming out ass compared to cumfart.
>>107765509Watch this get released very soon with no Z image base. And people will still make excuses as to why it hasn't be released yet. A key misunderstanding of Chinese culture.
>>107765509>that'll be 72gb of vram
>>107765523if that glm image is as good or better than z-image turbo while being a base model, I won't need to wait for Z-image base anymore, Alibaba can fuck themselves for what I care
>>107765510loooooooooool
>>107765509>>107765528if it's based on glm 4.6 9b flash we might be eating good
>>107765443Retraining atm. Trying regularization dataset because it overfits so fast
>>107765509>>107765543>if it's based on glm 4.6 9b flashWhen you look at the configuration_glm_image.py script you see this>hidden_size: 4096>num_hidden_layers: 40>num_attention_heads: 32>intermediate_size: 13696which is the exact same as 9b flash, so you're right anon, it's a 9b modelhttps://huggingface.co/zai-org/GLM-4.6V-Flash/blob/main/config.json
>>107765532>>107763806>I always loved the LLM GLM series, they are more soveful than the rest of the LLMsI think they are the best of all local models for creative writing right now, including having the best soul out of the big models. Although I think all LLMs since big Deepseek/Qwens and especially since old big Mistral releases lost a nice amount of actual soul overall, it's just that new models make it up with IQ.I don't think we'll get any big architectural breakthroughs with GLM, but it will probably be a good incremental leap.
>>107765607flash was released 29 days ago, and they took them less than a month to transform it into an image model? that's fast wtf
>>107765628q2 sounds like a meme quantization
>>107764998If he didn't have granny skin he would be pretty cute
>>1077656641 hour in AI is 3 days in real life
>>107765690if you die in ai you die in real life
>>107764998>>107765687>If he didn't have granny skin he would be pretty cutethe interesting thing with Z-image turbo is that you can stop the generation a few steps before (with KSampler advanced) than the expected number of steps (let's say we stop at step 18/20), and that can get rid of that overdetailled skin
>>107765676Q2 on really big models isn't a big deal>>107761981
>>107765761That seems reasonable
>>107765761Even Q1 of 671b R1 was SOTA for months. When the models are gigantic it's not the same, especially for LLMs.
>>107765710>20 stepsI also noticed that the more steps I added the worse it got, did you notice anything similar?
>>107765800for text I feel that 8 steps isn't enough, that's why I went for more, but you're right 20 might be overkill, this model wasn't trained on so many steps
>>107765509oh god please make z-image base obsolete before it ever gets released, that would be so funny
>>107765509>Chongdanov, we finally got a worthy rival to our Z-image base model>drop it
>still no base or editis qwen the best edit cope for the time being?
>>107765828Z Image Turbo won because it's small enough AND good enough. Unless GLM is as small, size/quality won't balance.
>>107765872>Z Image Turbo wonyou cant even combine two loras
>>107765872>Unless GLM is as smallit's based on glm v4.6 9b flash, so it's the same size as Chroma >>107765607
>>107765878to be fair, the fact it's working on loras at all is a miracle on itself, it's a double (guidance + steps) distilled model after all
>>107765878play with the layers of each
thoughts on the dataset i'm building?
>>107765927>thoughtsLooks like a god's chosen dataset to me!
>>107765346they're too busy spending hours trying to jailbreak the model to show a nipple for half a second
>>107765927delete this
>z-image vs flux 2Flux 2 is truly a western model
>>107765927>most are from the shoulders up, portraits lol
>>107765966>I love shekme too
>>107765969>westernI'd say european, burgers have no issue with gunshttps://www.youtube.com/watch?v=Y-3IV11_ZgA
>>107765977I tested again and I think I did something wrong, the girl is still ugly though.The dirt on the mirror looks good, so maybe its not that bad of a model.
>>107766050I better hope a 32b model is "not that bad" kek
>>107766050She wears that and has a dirty mirror, of course the model will default to ugly.
>>107765927add this one too
>>107766101thank you, i do need more. i just googled ugly jew and took the best ones
>>107765927waste of time, need hot girls not this crap
>>107765969flux 2 made a safer less feminine pose lmao
>>107766050show booty
>>107766158she has an uglier face too
>>107766379>slight pantyshotvery important details, thanks anon
>1girl
1girl Is All You Need
1girl is great, it links us to our ancestors doing the same thing since forever
>>107763403This is literally me
Inside the car there is 1girl.
>>107766419https://github.com/comfyanonymous/ComfyUI/pull/11632We'll get LTX2 before z-image base looooool
>>107766434im sorry but that doesnt count as 1girl
>>107766434quite the beauty
>>107766439>its audio + videoHUGE, I mean the audio sucks but MAN WE WONNED!!!!!!!!
ready when fresh >>107766478>>107766478>>107766478>>107766478
>>107766469rude>>107766465the car identifies as a girl
>using wan loras for i2v>works really well for a number images>suddenly just doesn't work at all and just makes the people awkwardly shift around for every genWhy does this black magic get so temperamental for no reason
>>107766472>>107766608put those on the next bread dude, they are good lol
>>107764599how long did it take you? I'm gonna try this too once I finish the setup
>>1077666513 mn per 5 second split, so 3x12 = 36 minutes
>>107766608based
>>107766608Link?
>>107762882that from zimage or qwen edit?also holy fucking shit these captchas are garbage
>>107764017how do this? this that QR code generator thing?
>>107766782literally 3 year old stable diffusion could do this
Are there any uses for SD1.5?
>>107767841for the sovl
>>107766480thanks for the fresh bread
What would a genuine AI enthusiast buy now for 15000$ ? rtx 6000? A100? multi 5090 setup?
>>107769647I'd get three 5090s, a threadripper or whatever and as much ram as possible