Discussion of Free and Open Source Text-to-Image/Video ModelsPrev: >>107390428https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipehttps://github.com/ostris/ai-toolkit>Zhttps://huggingface.co/Tongyi-MAI/Z-Image-Turbohttps://comfyanonymous.github.io/ComfyUI_examples/z_image/>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/>NetaYumehttps://civitai.com/models/1790792?modelVersionId=2298660https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
I tried expanding my single-sentence prompts (An empty Japanese school classroom after a zombie apocalypse. There are two sleeping spots and other mess) with qwen-4b, and I'm quite happy with the results. It completely fixes seed similarity since every prompt is very different. However, it's not efficient to run an llm, then unload it, and load the same model as text encoder.Does anyone have ideas how to reuse qwen without unloading it? Speaking of Comfy, obviously.
>>107392979>qwen-4bthe base model? the instruct model? the thinking model?
>>107392990For testing, I just used the free version on OR. I think it's thinking. But I think zit uses the original instruct one.
>>107393008that's too bad the text encoder is the base model, we could've used it to rewrite the prompt as well
>nigbo
Comfy must be dragged onto the streets and
thanked
https://civitai.com/models/2182094/z-image-asian-girl-33?modelVersionId=2457065Finally, a female asian lora, exactly what this model needed
MushroomBase is going to be worse than Turbo according to their paper, right?
>>107393046maybe its a particular aesthetic?
>>107393055since turbo was finetuned so I guess?
>>107393055Your mum should've pulled out, right?
>>107393058Perhaps it's a model for a real person and he's trying to get past the real person filter by saying it's an asian girl? I wouldn't be able to tell since you already know why
CivArchive still doesnt have a Z section. Wonder how many have been purged from Civit
>>107393055prompt?
>>107393058>>107393046no it's some jeet reusing an old data set to train garbage on site in order to farm yellow buzz, just like the majority of the slop you find on civitai
>>107393086Celebrities are not allowed there due to the (((payment processors)))It's perfectly legal to train / generate on them though, just not posting anything sexually explicitSomeone trained some Zit celebrity loras and put them here, maybe there will be more: https://huggingface.co/malcolmrey/zimage/tree/main
>>107393118ffs man why would jeets train asians? if anything theyll train the ones theyre stalking locally
https://civitai.com/models/2176378/afterdark-z-image-turbo?modelVersionId=2450841literally sovl vs sovless
>>107393119oh yeah I remember this guy! He had hundreds of models uploaded
my bloody rx 6600 scored 18000 in this test it shouldnt be so slowaverage is 15000 for the same modeli have repasted the gpu and its doing even better than before
Local qwen 2.5What can I do about this?
>>107393065My mum? I'm confused by the logistics.>>107393094Studio Ghibli Dark Fairytale, intricate expressionistic Portrait of a female myconid, a humanoid fungi entity with a headpiece shaped like a chanterelle mushroom. She wears a long yellow dress made of natural materials and leaves, her skin is tanned and she is standing in a dark and humid cave, surrounded by glowing algae, moss and fungi. Her eyes are glowing in a red colour and she is completely expressionless. She has a regal appearance
>>107393131>170mb lora trained on 20 imagesi bet it's just 512x512 images of smiling faces as well
>>107393161>qwen 2.5why not go for qwen 3?>What can I do about this?use an uncucked finetune of qwen I guess?
>>107393162box it pls? thats super clear af
>>107393170meant for >>107393119
>>107393153this is hugei repasted and now the gpu is generated 4 times faster??? it was doing 400s per iteration on the same workflow
>>107393188I agree that 170mb for a single person lora seems too much unless they trained many images at 1024, no idea what the quality is like though
>>107393172It's the only one I knew how to get idk what to do with the split models I see on hf>>107393192shits slow as hell.. is that what amd has to deal with?
>>107393233Have you undervolted your gpu? Could help a lot.
>>107393233Use a qwen abliterated, it's literally just a qwen that got its automatic answer for naughty things removed out, no further finetuneIt doesn't make it more knowledgeable about naughty things that it hasn't been trained for, but you won't get "I'm sorry Dave but I can't talk about that."
Is there a way to run the qwen prompt enhancer as a part of a WF so I don't have to switch to HF? Can I reuse the LLM loader for gemma models that some animoo models use?
>>107393055Damn, mushroom girls.(Please be gentle, mods. It's censored.)
>>107393279I think you replied to the wrong guy>>107393307I use the workflows from this: https://github.com/Koko-boya/Comfyui-Z-Image-Utilities
>>1073932336000 series yesits a lot better for 7000 series
>>107393329isnt that API shit?
>>107393348local with ollama
>>107393329>https://github.com/Koko-boya/Comfyui-Z-Image-Utilities>pircelbased, he knows what he's doing>>107393355>ollamawhat's that? I don't have to install something else on top of comfyui right?
>>107393355Can it load goofs? Gonna need to run it at Q8
>>107393170that's the malcolmrey special!
>>107393279yes i didits stable as well
I got no imagination
>>107393388Impressive it did the top of the box.
>>107393372https://github.com/Koko-boya/Comfyui-Z-Image-Utilities?tab=readme-ov-file#installationfor the package installation, you can make your life easier by simply doing it on the comfyui manager options>Instal PIP packages
Qwen Image is such slop compared to zit... Qwen has absolutely no idea how things in non-pristine condition look like. It's a real issue with most models, but zit is at least trying.
>>107393380you should be able to >>107393372>I don't have to install something else on top of comfyui right?You do it's a local llm server you can run offline
>>107393402That fucking cobweb on the bottom lmao.
its doing ok for 9 steps
>>107393402Yeah, Z just directly BTFOs qwen and i assume the same will happen once z-edit gets released.
I can't use Resize V2 in this particular workflow, how can I set this up to flip between the two nodes with a button? I'm not a coder, but I know it's possible to do a true/false combo.
>>107393343>taylor bitch >she killed dall-e and civitai>she's finally back for free locally and without lorafucking karma. based xi
>>107393425>You do it's a local llm server you can run offlinemeh, that one can run the instruct model on comfyui directlyhttps://github.com/1038lab/ComfyUI-QwenVL
>>107393451im going to try lewd images of taylor swifted next
>>107393451https://github.com/Azornes/Comfyui-Resolution-Master
>>107393462>meh, that one can run the instruct model on comfyui directlythat one can do, if you go for the "direct" workflow
>>107393474Oh baby, now that's a node.
If the past is to be learned, if a model wasn't released right away, it's never getting open-sourced later on
>>107393503Yeah, node made for retards. People complain about bloat in ComfyUI but this is true bloat.
>>107393461Ikr, feels good when the bad guys lose at the end
>>107393511Yeah, ten nodes to put in some fucking numbers all over the place is super dank. I agree. UX is for fags.
>>107393451you have to use an if-else/conditional custom node, there are dozens such custom nodes but none in comfy core afaik
ZIT's asian bias is pissing me off
>>107393329>https://github.com/Koko-boya/Comfyui-Z-Image-Utilitiesif you want to go for a specific gguf, like Q8, what should you write here? since the huggingface repos usually have all the Q8 quants in therehttps://huggingface.co/mradermacher/Josiefied-Qwen3-8B-abliterated-v1-GGUF/tree/main
>>107393558the endpoint part? just leave it empty.just choose the quant (4 or 8) & it downloads
>>107393556skill issue
>>107393620no for the full bf16 safetensors model I get it, but what about gguf?
yeah... you're not missing much lol >>107391202
>>107393651The right image is way more "slavic" though.
>>107393651> optimized for high-quality text renderinganon...
>>107393651kek
Just woke upHow's zimage base?
>>107393680zimage shits all over ovis wow
>>107393680>Ovis outchink'ed the females compared to z-image turbothat's an achievement
>>107393680does ovis do nipples?
>>107393680for a model specialized on text I find it pretty mid and unnatural desu
https://xcancel.com/bdsqlsz/status/1995455379128410131#myeah, pretty much confirmed that the training of base is over, now we wait for them to release it
>>107393651>>107393680>>107393719Is prompting like an esl the best practice for chinkoid models?
>>107392912ANIME DIFFUSION NEWS ANCHOR!>Noob Models!SeeleNoobAI (2048 native resolution): https://civitai.com/models/1445275/seele-noobai-sdxlChenkin Noob XL:(NoobAI ESP with new dataset of character)https://civitai.com/models/2167995/chenkin-noob-xlWAI Shuffle Noobhttps://civitai.com/models/989367/wai-shuffle-noob>Anime LoRa Making Guide!https://civitai.com/models/22530/guide-make-your-own-loras-easy-and-free>Model News!ZiT Zeta Image Turbo Model: a new 6b model, It's fast, open-source but the main problem is it doesn't understand booru tags.UIs that supports it: Comfy, Krita AI Diffusion, Neo Forge, Swarm, SD Next>Anime ZiT LoRas!:Frieren LoRAhttps://civitai.com/models/2176854/frieren-beyond-journeys-end-sousou-no-frieren-z-image-loraFlat Anime Style:https://civitai.com/models/2175307/z-image-flatanimestyleRa Lilium Style:https://civitai.com/models/2125529/ra-lilium-styleNyalia Style:https://civitai.com/models/2180136/nyalia-styleAnime Flat Style:https://civitai.com/models/1952560/anime-flat-styleALSO ANIME CHARACTER LORA REQUESTS GO HERE!
>>107393757Embarrassing.
>>107393757kys
Penis lora for z-image is outI repeat, penis lora for z-image Is. OUT!!
>>107393757go back
why does some random chink have access to the base already?>>107393324cute and soft girl, plz box
>>107393782nicetime to make brazilian shemales
>>107393751This is true for any model that uses anything other than CLIP for text encoding. You can not reach the full depth and scope of the text encoder not prompting in this manner and if you look at how the models are trained, no one uses tags or keywords because it doesn't carry enough semantic meaning to precisely compose an image and doesn't allow you to actually teach the model said semantics and meaning. This allows you prompt it to actually do complex prompts for images it has never seen before that aren't 1girls which older models fall flat on their face even attempting and missing like half to most of the details.
>>107393790Fucking faggot.This thread is done :) Blame yourself when it happens. Cocksuckers just can't stay at their containment.
>>107393787he's not some random chink, he signed a NDA contract with Alibaba so he's probably working as a freelance or some shithttps://xcancel.com/bdsqlsz/status/1994103556312584685#m
>>107393119>sks woman trigger wordthis absolute faggot has learned nothing
>base z image knows the blackpink members but not jenna ortega
>>107393809
qwen image edit is blowing my mind and i cant wait to test zimage edit what the fuck
>>107393826ok
>>107393329>https://github.com/Koko-boya/Comfyui-Z-Image-Utilities>puts "keep_model_loaded" on off>the model doesn't unload anywayfuck ass node
>>107392912Fuck me adorably, the second girl on the left looks like a porn witch.
>>107393826>:)
>>107393757Based, anime is an important part of this hobby so it's good to have their dedicated anchor
>>107393856
>>107393630You are very skillful, Sir. Haven't seen 1female image like this before.
>>107393119>Eyelash lora>it gives her small titsyup, trained on faces only
https://civitai.com/models/2183666wHY DO THESE FUCKS KEEP TRAINING SHIT LIKE THIS?! LIKE WHO ASKED?
why is this such a hard concept for these models to understand?
>>107393904Wan 2.2 does this no problem.
>>107393890kek
>>107393904Is it?
>>107393918>>107393904i realize the image wasnt very clear. i mean oiled skin.
>>107393929thers an oily skin lora for qwen already.
>>107393886
>>107393890i know i am
>>107393774>>107393786Anime website retard
>>107393971you already have an AI anime thread, get the fuck out
>>107393055Base scores higher than Turbo, but it will be the same size, slower and knock out some of the downsides like seed variety and etc. In the paper, on a certain T2I benchmark, it's self reported that in a row, Base is better than Seedream 3.0 which is better than Turbo. However, most T2I benchmarks show that it is at least tied with Seedream 3.0 if that provides a good enough reference point for the improvement. Again, Turbo is turbo for a reason. Compare A which is an unfinished Base model against D, which is Turbo. They say they use 100 steps for the Base Model to teach Turbo but I think instead of 8, it will just be something like 20-30 steps to get an image. Not the worse thing in the world if you want the slightly improved quality and seed variety.
>>107393986Seems like you are butthurt. Why?
>>107393986NTA but I gen anime and I'm not moving from here.
>>107393986Anine website, keep seething snowflake
>>107393989>sft, dmd, d-dmd, d-dmd+dmdrwhat?
>>107394009gen it all you want, just don't post it
>>107394032(You)
>>107394032Kek, go back to your shithole. I will post anime whenever I want.I repeat againANIME WEBSITE
>>107393757Based, fuck off /adt/ schizos
>>107394009>I gen anime>>107394042>I will post anime whenever I want.no one said anything about not posting anime, but this shit is not meant to be put here >>107393757
>>107393886>>107393940nice, you using a lora?
>>107394058I know the anime news anchor tradition started in the schizo thread, but there's no reason why /ldg/ can't have something like that. I was curious to check out the zit loras and Chenkin's checkpoint is definitely relevant to this general
>>107393757First one for containment anchor!Unpopular opinion: ZiT Zucks!
>>107393757No really a character but Made in Abyss style landscapes/bg/visuals perchance?
I'm getting the urge to post anime
>>107393940>she still won't eye contact me
How come there's no option to select how many tiles? Or does it do it by default if you set the tile size below the image size half width/height?
>>107394116>nooooo you must post in our tranny general nooooo
>>107394019I like the fantasy settings, because that's actually my main gen type. Have you had any luck getting deformity to gen? I haven't tried it with zit yet.Like amputees, of various kinds. above the elbow, below the elbow etc
>>107394104>ZiT Zucks!i don't think most would disagree that IL>zit for anime but once base gets tunes the chances of z replacing sdxl for both realistic and anime are very high
Qwen-Image-2511 when? It's already 2512
>>107394104>ZiT Zucks!I'm ready for it to replace Pony and Illustrious
>>107393918prompt?
https://civitai.com/models/156345our favorite lora is up bros
>>107394164FUCK THIS GUYive blocked him
>>107393119>chloewhy...?
>>107394025I'm not here to spoonfeed you the paper, get AI to do it for you.
>>107394152I'm not ready but I'm expecting it. I just hope you can still use tag-based prompting. It's so predictable and reliable. But can you even use prompt weights with the Qwen encoder? In SDXL I'm used to just jacking up the tag weight if e.g. a style isn't applied strongly enough. It's a big functionality loss if Z doesn't have an equivalent.
I fucking love this model. Almost as good as chroma in certain aspects, better in the other. a properly trained base model will probably become an open source standard for years to come.
>>107394164Thanks
consensus on samplers and schedulers with zit?res_multistep seems to make things noisy.
>>107394209euler simple is all you need
>>107393680>europian
I still don't understand why a 4kx4k image takes up 32gb of vram in comfy while forge does it just fine.
>>107394192I agree with you that tag based prompting is superior, I just hope we can brute force it in the future. I feel like the model never listens to me without tag based prompts.
>>107393852you too? gotta get my hands dirty with lora training. btw where the fuck are the definitive guides?
>>107393903It's just a pretty black woman. chill
>>107394248there isn't one, everybody has their own method and they all think theirs is the best
>>107394252it's 666avoid racemixing!!!
>>107394258>>107394252it isnt even about race fags. its about these imaginary ppl they make.
Is there a functioning UI/ZLUDA implementation for RDNA4 yet? Mate of mine just bought a 9070 and is interested in doing basic SDXL shit with it.I do it on a 6900 XT which works well enough but last I checked there was no HIP SDK for anything past RDNA3 on Windows.
>>107394129Don't u worry buddy, this one will make all the eye contact u crave
zit was made for feet.
>>107394303meant to post this one, sorry
>>107394192Tag-based prompting is a steaming pile of shit that was useful only for single-digit-iq models like sdxl. Tags carry zero positional information. They barely work for 1girling with well-established characters, which you can prompt with a couple of tags. And even for 1girling you can't specify if you want a hair ornament on the right or on the left. Do you want right or left hand on hip? etc, etc... Tags almost completely break apart if you have 2 or more people. You can't say "the girl looks at the boy while the boy looks at the camera"
>>107394192>>107394235
>>107394346yeah, there needs to be a formal description system.like you can't even describe nose type.
>>107394346This, I'm hoping by the end of the year there's a properly VLM/manually captioned Danbooru dataset of some kind. Even if you're just genning coomslop there's a real need for it.
>>107394300
VAE is outdated and needs to be replaced. I see VAE artifacts in every image now.. It's over.. it's so fucking over I can't take it anymore. Replace VAE NOW! We need a next gen replacement NOW!
>>107394346Tags combined with natural language is the way. Booru tags bring at least some degree of standardization and also some concepts are almost impossible to describe with natural language. I'd say NAI's approach is very nice with their latest model, you can specify character relations with tags and target/source system and it's much more dynamic and convenient than regional prompter.It isn't perfect but that's what I'd love to see on local, not pure NL prompting
>>107394362the ambiguity isn't gone.A woman IS sitting on a table.orA woman WILL BE sitting on a table.orA woman once was sitting on a table.orA woman thought for some time, should it come to pass that we should do it, and is it profitable that I, a woman, or he, a man, should be, but it's far from myself a humble woman to consider, so it shall be perhaps one is sitting on a table.
>>107394231Because you are using all these custom nodes. Some of these might not even use cum ui's own memory management but do something on their own etc.
New adapter
>>107394346tags aren't perfect but neither is having to write paragraphs long prompts, tags are information dense and models should be trained on them too
>>107394362One thing anime gooners don't realize is that the stupid tag based prompting will still work even on nlp based models, picrel is ZIM with 1girl, solo, chair, cafe, sitting, coffee, long hair, twintails, red eyes, sundress, indoors cafe, smile, happy,
>>107394432tags can't even prompt amputation effectively.
>>107394429FUCK I JUST FINISHED TRAINING A LORA!!
>>107394392
the world's best gen in the oven.>>107394440(they could have, though)
>>107394444holy quad 4s whatd u train sar?
>>107394397it's called pixel diffuser
>>107394440this single sentence tells me all I need to know about you
>>107394429all of this effort so that in a few days it'll be deprecated since we'll all be using base models
>>107394461Counterpoint, you are so insignificant you can't have needs.
>>107394474you are a weird man
>>107394346My ideal prompting style is tag-based, with the occasional natural language dip for positioning when needed. I also want prompt weights, which I'm not sure yet can be used with Qwen. And the last obstacle which might be a deal breaker for me is style mixing (which often relies on prompt weights, incidentally). I've not seen yet any natural language-based model coming even close to IL in terms of style control. If Z doesn't deliver I'll use it for anime the same way I use current natural language-based models: I create a ControlNet source image with them, then I switch over to IL.
>>107394415>some concepts are almost impossible to describe with natural languageThis doesn't make sense. Booru tags are a subset of natural language. Style tags? Just write "In the style of..." or even shorter "Drawn by..."But booru tags can be fed into VL together with an image and instructed, "Describe this image and use the provided tags as ground truth. They should be incorporated into your description."
>alibaba>releases ovi image model that nobody cares about>while people are literally waiting for wan 2.5the fuck is wrong with china?
>>107394432Use prompt enhancers. They'll rewrite your "1girl, looking at viewer, smiling" into a full paragraph.
>>107394427Comfy doesn't have nodes that does this apparently.
>>107394508no, I don't think I will
>>107394508>1girl, looking at viewer, smilingguys just want one thing, and it's disgusting.
>>107394486I agree, weights are useful, but think they're a feature of CLIP. It's not related to tags or NL. But CLIP is ancient. Nobody uses it in newer models. So, we can say goodbye to weights.
>>107394531I wish a girl looked at me and smiled
>>107394446
>>107394312
>>107394486Style mixing worked fine on Lumina so I doubt it will be a issue, assuming they tag it right during training.
>>107394539see?
>>107394535I wonder if the old LLM prompt adherence tricks are also applicable to image gen prompts. Like "1girl, large breasts (this is very important to my career)"
>>107394429welp, time to retrain (and then retrain in just 2 more weeks when base comes out)
>z can generate nasty ass foot content just fine>ask it for superior armpit content, with some chef's kiss stubble>it only generates baby smooth skin or full on armpit hairchiiiiiiiiiiiinks
>>107394569I had a very hard time steering style in Neta Lumina, but it was likely severely undertrained. I haven't gotten the chance to try Neta Yume though.
This is if you prompt the enhancer system prompt.
>>107394592armpit stubble is pretty niche tag even on boorus, sadly
>>107394619I've had illustrious generate armpit stubble more readily and accurately than generating pubic stubble. I couldn't get z to gen pubic stubble either.
>>107394497>alibaba>releases zit model that nobody expected>while people are literally waiting for wan 2.5Survivorship bias
>>107394592When will we finally get generative models that can actually average and generate things between two extremes which the model has seen in training or extrapolate and generate things more extreme that is has seen?
>>107394617he be doin mane the lify men
>>107394628Uhh that's weird, pubic stubble always worked for me on noob, but never the armpit stubble
>>107394514I don't think you know what I'm talking about, techlet.
comfy pls
>>107394180>chloe>why...?Is that you Pablo ?
If I denoise a 0.50 by 0.50 do I get 1 or am I retarded?
>>107394717no... NOOOOOOOOOOOOOO
>>107394723Pablo Sanchez?
>>107394729yes
>>107394642If you can't doin mane the lify men, don't do the crime
>>107394717I noticed immediately that the power lora loader is fucked, so I turned on 'legacy' nodes
>>107394717feeling uncomfortable yet?
>>107394732
It feels like zit has extremely low variation between seeds and the same prompt. Like sometimes it barely changes the poses and angles. Is this expected behavior or some config issue?
so no base?
>masterpiece, best quality, ultra-detailed, absurdres, intricate details, ultra high resolution, 8k, 4k, HDR, UHD, professional photography, sharp focus, extremely detailed, realistic, photorealistic, photorealism, hyperrealistic, hyperrealism, cinematic lighting, studio lighting, soft lighting, volumetric lighting, perfect lighting, award winning, finely detailed, high quality, ultra quality, extremely delicate and beautiful, stunningly beautiful, breathtaking, magnificent, spectacular, remarkable, fascinating, incredible, gorgeous, elegant, exquisite, flawless, perfect, immaculate, pristine, ultra clean, crisp, crystal clear, ultra sharp, razor sharp, tack sharp, highly detailed skin, detailed skin texture, realistic skin, pore level detail, subsurface scattering, smooth skin, no artifacts, clean render, perfect anatomy, ideal proportions, depth of field, bokeh, film grain, f/1.8, lens flare, chromatic aberration, color graded, post-processing, tone mapping, ray tracing, global illumination, god rays, dramatic atmosphere, moody, epic, majestic, sublime, transcendent, divine beauty, absolute perfection, ultimate quality, pinnacle of art, artistic genius, visually stunning, jaw dropping, mind blowing, awe inspiring, revolutionary, groundbreaking, legendary, iconic, timeless masterpiecethoughts?
>>107394759What did he see?
>>107394743How do I turn that on?
>>107394806down syndrome
>>107394806>generates 1girl sameface
>>107394397Lodestone will save you
>>107394806>>masterpiece, best quality, ultra-detailed, absurdres, intricate details, ultra high resolution, 8k, 4k, HDR, UHD, professional photography, sharp focus, extremely detailed0/10
>>107394814a white woman
>>107394802That makes me confident, since it means they really want to get it right and as small as they can.They could have done a quick release since the Alibaba teams have tons of compute at their disposal, but they are cleary finetuning this to perfection.
>>107394814You don't want to know
>>107394806>>masterpiece, best quality, ultra-detailed, absurdres, intricate details, ultra high resolution, 8k, 4k, HDR, UHD, professional photography, sharp focus, extremely detailed, realistic, photorealistic, photorealism, hyperrealistic, hyperrealism, cinematic lighting, studio lighting, soft lighting, volumetric lighting, perfect lighting, award winning, finely detailed, high quality, ultra quality, extremely delicate and beautiful, stunningly beautiful, breathtaking, magnificent, spectacular, remarkable, fascinating, incredible, gorgeous, elegant, exquisite, flawless, perfect, immaculate, pristine, ultra clean, crisp, crystal clear, ultra sharp, razor sharp, tack sharp, highly detailed skin, detailed skin texture, realistic skin, pore level detail, subsurface scattering, smooth skin, no artifacts, clean render, perfect anatomy, ideal proportions, depth of field, bokeh, film grain, f/1.8, lens flare, chromatic aberration, color graded, post-processing, tone mapping, ray tracing, global illumination, god rays, dramatic atmosphere, moody, epic, majestic, sublime, transcendent, divine beauty, absolute perfection, ultimate quality, pinnacle of art, artistic genius, visually stunning, jaw dropping, mind blowing, awe inspiring, revolutionary, groundbreaking, legendary, iconic, timeless masterpieceit worked
>>107394862>the final feel
i asked grok to give me a long list of every racial slur it could think off, and then generated a picture using it as the prompt
Another wonderful day of not being a comfy cuck
>>107394935What UI are you using?
>>107394862This actually made my gens look better...
>>107394946Not comfy lol
>>107394935hence no gen
>>107394954I'm just getting into this thing and Comfy looks too complicated. What do you recommend?
>>107394946Neoforge
>>107394946The truth is I don't gen, I'm just sour grapes because I'm too tarded to use comfy
>>107394963>>107394967Jokes aside, yeah don't start with comfy. Forge is easier for beginners.
>>107394235>>107394192>>107394346Z Image was trained on tags also
>>107394946the inference script provided on the huggingface repo for the checkpoint
>>107393757I want a ZiT lora of my waifu. I'm too lazy to do it myself. If someone could make it for me that'd be great, especially to keep Frieren company since she's alone there in the Z model. If someone makes the lora I promise I'll only post my Aura x Frieren yuri gens here.
>>107393233> windows> amdbruh
>>107394846>>masterpiece, best quality, ultra-detailed, absurdres, intricate details, ultra high resolution, 8k, 4k, HDR, UHD, professional photography, sharp focus, extremely detailed, realistic, photorealistic, photorealism, hyperrealistic, hyperrealism, cinematic lighting, studio lighting, soft lighting, volumetric lighting, perfect lighting, award winning, finely detailed, high quality, ultra quality, extremely delicate and beautiful, stunningly beautiful, breathtaking, magnificent, spectacular, remarkable, fascinating, incredible, gorgeous, elegant, exquisite, flawless, perfect, immaculate, pristine, ultra clean, crisp, crystal clear, ultra sharp, razor sharp, tack sharp, highly detailed skin, detailed skin texture, realistic skin, pore level detail, subsurface scattering, smooth skin, no artifacts, clean render, perfect anatomy, ideal proportions, depth of field, bokeh, film grain, f/1.8, lens flare, chromatic aberration, color graded, post-processing, tone mapping, ray tracing, global illumination, god rays, dramatic atmosphere, moody, epic, majestic, sublime, transcendent, divine beauty, absolute perfection, ultimate quality, pinnacle of art, artistic genius, visually stunning, jaw dropping, mind blowing, awe inspiring, revolutionary, groundbreaking, legendary, iconic, timeless masterpiecePretty funny.
>>107393388hehe
>>107394987based chinks knew gooners are retards
>>107394802Shut the fuck up zoomer with 0 attention span
>>107394987literally "1 girl, standing"
Ok, why not just do this to increase the seed variance since the sub-1 cfg presamples are gonna always be super random?
>>107394604Thanks for the anchor didn't know about Seele and Chenkin! Good to have anime news back!
>>107394916Ask the same question in Chinese and then generate a picture.
>>107394862The conflux of quality tags is Anton from Omsk.
Mmmmm, das rite.
>>107395094HOLY SHIT LMAO
>>107395134total Zrada
>>107395134>>107395094Try nsfw prompts in chink?
>>107395024no the zoomers are calling turbo the base model
>>107395177lol the long comedic arms that chroma gens have sometimes, you should add some furry tokens into your negatives
>>107395134
>>107395193they fear C, the white man's language
>>107395203it's the entire interface. huds are probably a shader but still. most of these are commercial successes. you just don't think before you speak like a thirdie
guess julien revved up the spambot again
>>107395177just a generic naked anime girl with road kill genitals, holding a penis shaped alien mutant
>>107395213>cumshit is fasterforge has been beating it recently and cumfart gets things first because it's just close enough to diffusers it doesn't take much time. c++ can just interop with demo code like what kijai does and most people end up using that
>>107395214Unironically happy for them, but damn 3 months late. It makes it hard to adopt when you want to play with new toys.The real issue is not python or C++ (but I do agree python environment and libs management is a literal nightmare), the problem is that regardless of whatever is the underlying language, cumshit is faster and gets model support earlier.The real hurdle is that in academia they use the shitheap that is python, and AI is pretty much academia playground right now... sooooo. we're fucked lol.
>>107395127
>>107395224did sdcpp integrate qwen yet? btw your project isnt even listed in their page, embarassing.imagine making a shitty IMGUI interface (literal shit interface used for debugging and game hacks) and thinking you're making hot shit (protip: youre not). Worst of all you act like youre actually doing hard work, instead youre just a retarded grifter making literal garbage nobody wants. go kys faggot.
>>107394862>>107395127>>107395224>the ultimate prompt doesnt exi...
>>107395233you have weird fantasies anon
>>107395227Please fix your shitty llm interface. You are too stupid for this.
>>107395243of course, it's the only base model that can do NSFW, of course the coomers will shill this shit
>>107395213wonder what pissed him off this time
bro your reply position is like one or two off lmao. fix your bot
>>107395255chroma tards woke up early i see.
nobody is ever going to need more than Chroma Z
>>107395258Put CFG at 3 or higher and see what happens.
>>107395263try adding realistic or photorealistic to it? otherwise maybe find a lora to assist
>>107394559A more perfect gen never existed.
>>107395275its not much slower since the bottleneck is vram, and swapping to ddr4 vs 5 isnt that much of a difference
>>107395275(guru is supposed to be gpu)>>107395263that guy hangs out with JEWS
>>107395283ai has been happening and completely murdered GPU prices for the last 4 years already (even more if you consider crypto came before it), why the sudden change?
>>107394992>I want a ZiT lora of my waifu.when we'll get the edit model we won't need character loras anymore lol
>>107395288yeah I'm looking into that right now. I needed more ram anyways running my txt2img clientunfortunately the best I can do is another 16 gigs for 48 total because the model I currently have is out of stock everywhere and it's 4 times the price to upgrade to new 2x32
Mona Lisa is always overcooked bro, on every models she can't take a break
>>107395263> Chroma i would not expect, lodestone has proven he's unable to cookradiance status btw?
>>107395300Trust me bro, local real time interactive gaussian splatting VR with voice commands is coming any day now
>>107395302it's all right for anime, completly useless for realistic humans, the likeliness is not here and the skin is plastic
I think this thread sucks.
>>107395304f wasted compute and americans water evaporated for cooling
>>107395319U mad?
>>107395094trying异性肛交uhit's an abomination (canceled)trying阴道插入...
>>107395319Even if we conceivably manage to generalize brushstroke and lineart look, composition, proportions and palette, there is still a ton of 'indescribable' details in each artist style which will slip through the cracks.
>>107395328if you like the motion, do you reconnect to 2nd sampler and keep the same seed and it starts the low pass directly i assume?
>>107395331this is the local general, retard
>>107395300didnt u just post this on leddit
>>107395336love this gen. did that work with base wan? more? girl can be older to not bother people, i just like the concept
>>107395328Nwe use closed loop
>>107395348it isn't insurmountable and you're just looking at the simple bits anyhowthis is the simplified structure how stuff works. learn as much as you need to. or use an even simpler thing with less capabilities if that makes you happier.
>>107395354how, do you need extra nodes for that?
>>107395359tf?!
>>107395366NTA but I'm also trying to set up wan on a 12gb 4070 and I must be retarded because I can't find these Q6/5/4/whatever models everybody is talking about anywhere
>>107395348I found this on leddit lol>>107395366it's the schizo bot that got activated now, it gives you random posts from before, don't engage
>>107395379comfytroons do have a meltdown if anyone criticizes their shit UI, so who knows
>>107395379finally a good gen
>>107395382nothing I said relates to VRAM
princess mandarininstead of peachbecause mandarin is also a fruit but it's... never mind.
>>107395382ffs that thing is back huh
>>107395388but how would you query for a very specific artist style without mentioning the artist himself? like check the 3 netayume 2girls that were posted, what would the prompt-fu required to do it?
>>107395352it may be pretty random what you get.Not sure what's happening here. my own emphasis isn't porn, but occasional nudity means Gemini is pointless.>wanI probably should into use it, but I haven't yet. I have downloaded qwen but not used it yet either.zit is a big deal because it's so fast.
>>107395391artist styles are helpful to have and removing them damages the model.however, we do need to move beyond using them as a proxy for style control and add more granular instruction following.
>>107395379Cute!!!
>>107395392there's no females in there, or else you would have seen some naked guys cringe shit, we only have 1girls (thank god for that)
>>107395397>tbf it's all COPE, why do all that work in old architectures?I guess it's less expensive to modify a base model than make a new one from scratch, but yeah it's definitely cope
Don't forget to thank our hero
>>107395391where is the pee?!!?
>>107395403Huh? What tool? You don't mean lora, per chance, because loras fuck up model's ability to resolve non-lora related details in a way no gemma-as-te will ever fix. So they absolutely doesn't apply to rouwei.
>>107395386I know this is the bot but god damn so true
>>107395413ok I tried distorch with q8 and somehow it solves the blockiness movement with q6 (e.g. a patch of the skin texture stays at a fixed location and then suddenly jumps to another fixed location). not sure why but I guess it's q8 for me now
>>107394987Based!
>>107395416come on bro, if animate my gens, reach higher
>>107395420From Q8 not that much. I only noticed much difference on the full model when I ran it directly on runpod on a h100 to test speeds.
>>107395425nevermind. there's no input for extra_args on the node. ended up doing a seperate comfy install
>>107395413wouldn't the best team be deepmind? why lie?
>>107395361I meant cooling
>>107395438qwen edit protip: use "keep his expression the same" if you want the same face. still good though.
>>107395440it should not take 10 mins for 4 steps and with lightx2v lora, that sounds like it's using CPU and not GPUuse the fp8 model or try q8 with multigpu and adjust the virtual vram option (shouldn't be needed but can use it)
Feels bad taking first 75 gens with no cherry-picking to do a proper fair model comparison. I have to look at the other gens I made after, some very nice, and they can't go in the image... it goes against all my instincts to make an image as beautiful as possible, even an infographic. But it would harm the deeper beauty which is that the image is true...ANyway here is a cute Qwen Image gen which can't be included in the infographic because not in the first 75
>>107395134Now do 'jew'
>/adt/ troons are mad>botterinos are releasedReally makes you think, huh
>>107395450thats exactly what im using with a 4080, 10mins + for a 10 second clip
>>107395382still slays me she trooned out rather than just date an incel.
>julien is mad>botterinos are releasedhuh
>>107395451Stop shilling this shit kudasai
>>107395453doesn't take long if you use the 4step lora.. takes like 1 - 2 minutes for me
>>107395379Nice colors, thanks for posting cute anime girls.
Oh look the spammer is back. Cool, cool.
>>107395458Are you using it with the light lora or is it naked qwen edit?
>>107395464why dont you use the rg3 lora loader? its so much cleaner
>>107395472yeah but i got plenty of cash so it all evens out
>>107395475The spammer no, the /adt/ spammer
>>107395413>and now with our company we are putting the main bulk of our efforts in to censoring the models and prevent any subsequent attempts at adding anything we censored back into the models...
>>107395475Good thing we can use run it locally still with comfyui api nodes.
>>107395485Why wouldn't you do 2.2 first? Who the fuck uses 2.1 anymore?
>>107395487If you build it, I will coom
ZIT shift vs stepshttps://files.catbox.moe/9ud8da.png
>>107395485What's the difference? Looks like the same guy to me.
>>107395505You need to put on your big boy pants and nut up for a GPU that doesn't belong in the flintstones era, anon.
new>>107395519>>107395519>>107395519>>107395519
>>107395413that's the man who invented the latent space? BASED
>>107394429>I added a v2 of the z-image-turbo training adapter. It is 2x as big and has been trained for a significantly longer time. With these adapters, it is a balance of getting the max amount of de-distillation without diverging from the base too much so the LoRAs maintain maximum compatibility. So I want to test this a bit more before setting it as the default. But if you want to test, just chanve the v1 to v2 in the config, and please report back if you find it works better / worse than the v1 adapter.https://x.com/ostrisai/status/1995504226295009558
>>107395450> cutethat's a troon bruhare you gay or what
>>107394814My theory is that she sharted in his face when she walked by
>>107395391catbox? all my diapers ended up drawn as panties. or do you already have a lora?
>>107395551why would you thank a tool of demons???
>>107394132Yes it uses as many tiles as it needs to cover the whole image.