Discussion and Development of Local Image, Video, and Music ModelsPrevious: >>109188075https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUISDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineageWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, & Upscalershttps://huggingface.co/modelshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.info>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/tdrussell/diffusion-pipehttps://github.com/kohya-ss/sd-scriptshttps://github.com/kohya-ss/musubi-tuner>Krea 2https://huggingface.co/krea/Krea-2-Rawhttps://huggingface.co/krea/Krea-2-Turbo>Zhttps://huggingface.co/Tongyi-MAI/Z-Image>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/https://animadex.net>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2.3https://huggingface.co/collections/Lightricks/ltx-23>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>inb4 n*gbo ban evading yet again
Blessed thread of frenship
>mfw Resource news07/03/2026>Multi-Resolution Flow Matching: Training-Free Diffusion Acceleration via Staged Samplinghttps://github.com/Xingyu-Zheng/MrFlow>DiffRGD: An Inference-Time Diffusion Guidance Through Riemannian Gradient Descenthttps://diffrgd.github.io>Representation Distribution Matching for One-Step Visual Generationhttps://alan-lanfeng.github.io/rdm>SAB-LVLM: Significance-Aware Binarization for Large Vision-Language Modelshttps://github.com/LyuQi127/SAB_LVLM>Style-CCL: Content-Preserving Style Transfer via Curriculum Continual Learninghttps://github.com/witcherofresearch/Qwen-Image-Style-Transferhttps://github.com/Tele-AI/TeleStyle>ByteDance-Seed / PARhttps://huggingface.co/ByteDance-Seed/PAR07/02/2026>PAPA: Online Personalized Active Preference Alignmenthttps://github.com/NasikNafi/papa>Condensing Large-Scale Datasets Directly with Minimal Information Losshttps://github.com/LINs-lab/CIM>VisReason: A Large-Scale Dataset for Visual Chain-of-Thought Reasoninghttps://y-research-sbu.github.io/VisReason>Asset Generator for 2D & 3D: Blender add-on that generates assets from text prompts https://github.com/tin2tin/Asset_Generator-2D-3D>ComfyUI-TrixLoader: All-in-One Image Loader, Editor, and Resizer node for ComfyUIhttps://github.com/trx7111/ComfyUI-TrixLoader07/01/2026>Elastic Diffusion Transformer: Accelerating SOTA generation modelshttps://github.com/wangjiangshan0725/Elastic-DiT>Boogu-Image-0.1-Edit-Turbohttps://huggingface.co/Boogu/Boogu-Image-0.1-Edit-Turbo>GEAR: Guided End-to-End AutoRegression for Image Synthesishttps://github.com/Tencent-Hunyuan/GEAR>SpheRoPE: Zero-Shot Optimization-Free 360 Panorama Generation with Spherical RoPEhttps://orhir.github.io/SpheRoPE>ADAPT: Attention Dynamics Alignment with Preference Tuning for Faithful MLLMshttps://github.com/yao-ustc/ADAPT>Phase-Aligned RoPE for Mixed-Resolution Diffusion Transformerhttps://hao-yu-wu.github.io/mixed_res
KREA 2 > Z IMAGE TURBO
>mfw Research news07/03/2026>OrbitQuant: Data-Agnostic Quantization for Image and Video Diffusion Transformershttps://arxiv.org/abs/2607.02461>Anti-Prompt: Image Protection against Text-Guided Image-to-Video Generationhttps://arxiv.org/abs/2607.01499>NeoMap: Training-free Novel-View Synthesis from Single Images and Videoshttps://arxiv.org/abs/2607.01962>PixGS: Pixel-Space Diffusion for Direct 3D Gaussian Splat Generationhttps://arxiv.org/abs/2607.01803>DetailAnywhere: Fashion Detail Generation via Cross-Modal Feature Alignment Distillationhttps://arxiv.org/abs/2607.02220>ICDepth: Taming Video Diffusion Models for Video Depth Estimation via In-Context Conditioninghttps://xuanhuahe.github.io/ICDepth>Alignment Is All You Need For X-to-4D Generationhttps://arxiv.org/abs/2607.02516>QWERTY: Training-Free Motion Control via Query-Warped Video Diffusion Transformershttps://arxiv.org/abs/2607.01869>DisciplineGen-1M: A Large-Scale Dataset for Multidisciplinary Visual Generation and Editinghttps://arxiv.org/abs/2607.02290>MIBE: Multi-subject Interaction Benchmark and Evaluator for Personalized Image Generationhttps://arxiv.org/abs/2607.01383>Optimizing Visual Generative Models via Distribution-wise Rewardshttps://arxiv.org/abs/2607.02291>Wavelet-Guided Semantic Signal Compensation for Inversion-Free Image Editinghttps://arxiv.org/abs/2607.02421>Efficient PEFT Methods with Adaptive Checkpointing for Vision Models and VLMs on Resource Constrained Consumer-GPUshttps://arxiv.org/abs/2607.02158>From SRA to Self-Flow: Data Augmentation or Self-Supervision?https://arxiv.org/abs/2607.02508>KathaTrace: Diagnosing Semantic Trajectory Collapse in Generated Visual Narrativeshttps://arxiv.org/abs/2607.01312>LASER: A Corrective Lens for LVLMs via Visual Attention Preservation and Sink Suppressionhttps://arxiv.org/abs/2607.01707
>>109192143>>109192148Fuck off off topic spammer
>>109192148>>109192143thanks!
>>109191553>The datasets are being made public BWAHAHAHAHAHAHAHAH we all fucking wish that would happen but there's no way.
is 50-70 images a overkill for character likeness training for krea2 and z image base?
>>109192156my pleasure :)
>>109192160no
>>109192168Crazy how proper upscaling is way too hard for you after 4 years lol
>>109191443funny init
>>109192178mama
Whoever said use anima as first pass for krea2 you might be on to something.
>>109192178do you have a prompt for this beautiful jungle worrior? also are you using the regular krea2 turbo model or a mixed tuned model?
is debo ban evading again
I just saw this a krea 2 controlnet for depthhttps://huggingface.co/Patil/Krea-2-depth-controlnetrequires depthanything2 https://github.com/kijai/ComfyUI-DepthAnythingV2for the retarded it needs to be placed in.ComfyUI/models/depthanything/we are eating so good
>>109192160No, but they should all be different. You should remove those redundant images in your dataset (same outfit, similar pose/composition).
>>109192160no such thing. the more the better, assuming they are all varied.
>>109192192can you share the workflow for it?
>>109192178what sampler/scheduler are you using?
you cant make me use two different models for a single output i wont do it
krea users in with their astonishing deformities.It makes sense that anti-racism is so popular. If you can't even comprehend anatomical mass, how can you comprehend political mass?
>>109192221Euler/Simple.
>>109192210nice find
>>109192210nice thanks
>>109192210ah wait a second i don't know how the fuck this works, i don't think the node can see it hmm.
>>109192238ty>>109192216https://files.catbox.moe/vkr3yc.pngsorry it's messy.
>>109192192You could've made this without using Anima? I don't get it.
>>109192210>>109192244ah i should have read the setup, it seem its need some node to be installed.
cozy breas
>>109192212>>109192215>>109192171thanks anons for the advice.
>>109192247thanks, hope comfy will unload anima before loading krea
>>109192254Show us, then.
>>109192258i dont think this works with comfyui
>>109192192Anima knows anatomy.
>>109192269Show you what, a basic goth thot taking a selfie with saggy tits? Seriously?
>>109192240>>109192239https://github.com/Tanmaypatil123/Krea-2-controlneti don't think its for comfyui at this stage what a shame...
Why does Krea start to fry really bad anytime the prompt includes a person and is for a photograph or real image
>>109192272yeah, someone needs to make it work with depthanything, i wonder if simply adding it to the directory and changing the code to include it would work some how? They say you need depthanything2
>>109192279Sounds like you can't. :(
>>109192302maybe ask claude to built a custom node to do it. im busy at the moment
>>109192308You're right. I can't be assed to waste my time on trivial 1girl gens.
>>109192266I can fit both at the same time. if it doesn't there's a custom node that exists to force unloading. idk if it's worth doing if you have to unload tho.>>109192254Anima is uncensored by default, has a lot more seed variation and has booru knowledge. Basically I find krea2 to be quite boring in its output compared to Anima so this way you get the best of both worlds. Animas NSFW knowledge and creativity + Krea2 rendering.
I assume 12GB VRAM and 32GB RAM is too weak to train a Krea2 lora right?
>>109192335i've seen a few anons claim you can
>>109192312I'm being retarded anon, it would need to be loaded with the actual controlnet nodes, the depthmap can be generated from anything but this file is what controls the conditioning from what i gather.
is it possible to do image editing like face or background swapping with local models like grok does or is that still needing manual inpainting
>>109192291>cringe inducing. idk. visceral. I could inspect to find out why.
>>109192320I'll find you a good example.
>>109192335it'll be slow due to ram offloading
>>109192353The right leg is too long.
>>109192198Can you do her but in the same post and outfit and body as >>109192178
>>109192160No. it can work ok with half the images but this is fine. Just try it.Don't hesitate to drop photos that seem to cause issues tho, you have plenty.
>>109192363You are right, it is
>>109192160i suddenly want this lora
>>109192288Pls anon halp
>>109192210the guy who made it left a post on reddithttps://www.reddit.com/r/StableDiffusion/comments/1uia5v4/krea2_control_net_pose_or_depth/
txt2img slop is boring, I want a Flux Klein level DranGAN
>>109192348there are a bunch of different ways to do it but you'll need to set up a workflow that uses reference images and stuff
>>109192387Is that sydney sweeney lora with strength turned down?
>>109192165catbox?
>>109192401he's a faggot but he's right, it's been a while we haven't gotten a new decent edit model>DranGANwhat's that?
Working on Chroma-Krea wf optimization. The RAW model is really good at rendering humans with the distill lora and filter bypass. It's also very diverse like OG Chroma was. The only issue is that depending on the output, the limbs/fingers may be slightly inaccurate and in wrong location, I guess that's the cost of diversity, or I could test lowering or increasing weights to see where it goes. I'm very close. Perhaps a third pass through regular Turbo would fix that.
>>109192414https://dragganai.io/
>>109192415jesus wtf is that
>>109192407I did put Sydney Sweeney in the prompt but Krea doesn't seem to know her very well
>>109192404so i need a shitload of reference pictures for the person i want to faceswap in? i wonder how the big cloud models make do without it
>>109192415There also is one extra knob tweaked to improve realism, Krea2NegPip with (illustration:-1.2) to avoid anime/drawings since the RAW model doesn't naturally do it, works to some extent, but needs improvement
>>109192387If she tried to touch me I would break out my tailor's tape measure from my murse and give her a proper dressing down.
>>109192424In that case the Turbo LoRA was turned down, so output was a little messy, but increasing its weight to 1 it becomes much more coherent. The expressiveness on her face is from the filter bypass.
>>109192414>>109192423neat
>>109192462it's just a bundle of bad loras.
>>109192429Tifa's fine too.
So I decided to try the int8 convrot for krea 2 and it is actually much slower (30 seconds on a gen that will take 70) 16 GB 4060 Ti + 64 GB Ram, I don't understand it was supposed to be faster for any modern architecture not only Ampere. Any clue? I am not using custom nodes, just the official convrot model. Is comfy's implementation wrong again?
>>109192462nice
>>109192488desu i thought convrot was poorfag only im still using fp8 scaled
man, those are shit feet.
>>109192491no. absolutely not. horrid
>>109192478Setup generally works, just needs a bit more tweaking to improve fine details. Maybe the filter bypass isn't flawless and it's generally better without, will test that next
>>109192497mxfp8 here. pretty sure they mention what quant you should use in the repo.
>>109192478>>109192511>>109192521post something better (you wont)
>>109192488I wanted to use NVFP4 but I think only turbo has that available right now
>>109192523This one, right?https://huggingface.co/Winnougan/Krea-2-Base-Turbo-NVFP4-FP8-INT8It looks like FP8 is the best for me
>>109192571Yeah that's the one.
>>109192439no no, you just need one reference image. getting the workflow you want set up is the heavy lift but the input images are straight forward
>>109192530:^) I'm genning music with Ace Step 1.5 XL SFT.
>>1091925228 or 9 steps seem optimal, past it and it will hallucinate details. Filter bypass I don't think was doing much, skin stays realistic without it (though it clearly helps in the case it's a single generation through Krea as opposed to a 2 pass workflow).
>>109192488I have the 4060 Ti 16GB too and int8-convrot is faster than fp8 for meis your ComfyUI up to date?
>>109192694people are afraid of updating comfyui because they might add a feature they don't use
:^) guess who actually owns 4chan
>>109192719Ah yes it's definitely not because you were derailing the thread with an off-topic discussion...
>>1091927194chan is a honeypot
>>109192373i going to try and trim it down to 60, captioning is going to be repetitive but I'm getting good at it. >>109192389i will share the gofile of it when its finished, she's a DNP so posting her huggingface is a high risk. going to take awhile.
>>109192744shit caption desu
>>109192738krea is shit not honey
>>109192768It's not shit, I like it
>>109192744why the fuck are you using booru style captioning for newer models? dude, use joy caption or some LLM. those tags are useless
>>109192768I like shit, the steamier the better
>>109192634Unlike OG version of the workflow, not a lot of issues left after 2nd pass. Initial Chroma gen still plays a bit of role in final output though, E.G. it should be close enough but not entirely inaccurate with things such as feet size and positioning etc... Though higher denoise values help, I tend to stick to around 0.5-0.65 for best results. At 0.65 it generalizes and doesn't break the realism style. It cleans up background really well. No stack of LoRAs will ever compare to doing this a 2 pass with Chroma. Similarly with anime, no stack of LoRAs compare to a 2 pass with Anima. The base models being passed are too strong at what they do, and only full finetunes could ever hope to compare. Krea 2 is very good at approximating styles with img2img, and its RAW model is even better at it than Turbo, which is why all this works. Hence, the wf is very useful for realism NSFW.
>>109192744why the fuck are you using ugly women? dude, use an attractive girl or something. those bitches are useless
>>109192694Yes. I updated to test convrot. I don't know what can it be. I am using Krea 2 turbo + 4-5 loras. ComfyUI 0.27.0ComfyUI_frontend v1.45.20Templates v0.11.1DiscordComfyOrgrgthree-comfy v1.0.2606200020EasyUse v1.3.7ComfyUI-Manager V3.41
>>109192719>makes an off topic post>gets warned because of the off topic post"huhhh??? I dun get ittt, I didun du nuthin"
>>109192786post wf?
>>109192792>ugly women?nta but I thought she was pretty hot.
>>109192786I too get that stretched artifact on the bottom some times, like the resolution is wrong or something
>>109192799Get your eyes checked then
>>109192497int8 convrot is higher quality than fp8 scaled. comfy said the default comfy repo will only include convrot in the future
>>109192793>I am using Krea 2 turbo + 4-5 loras.okay, what about the speed with no loras?
>>109192794He objectively didn't say "kys kys". Run md5 on the string.
base krea doesn't know what the inside of a pussy looks like.
>>109192822What's ideo do if you dig a hole there?
>>109192818kek, good point
>>109192822neither do you
>>109192799she looks like mrs potato head that got hit with down syndrome
I can tell this retard used Qwen Image Edit kek
>>109192851I'm eternally grateful for the fact I don't need to see this sort of excrement because I don't use linkedin.
>>109192860Don't need to use it if you're unemployed
>>109192851Don't be silly anon, it's clear that he had a great co-working day!
>>109192868Your post hit me like a physical blow.
>>109192841>>109192807Ok? more fun for me?
>>109192816Slow, the loras are not related.
>>109192851Do wagecucks enjoy humiliating themselves?
>>109192896no... anon we're making FUN of (You)
Training a real person character lora on Anima Base and then running it at like 0.8 strength on top of the official Turbo lora at around 0.7 strength gives similarly good results to training one on Chroma HD and then running it on the Chroma Flash checkpoint at 0.8 strength
>>109192177Pedo board
>>109192902Spooky how she looks like the ex I was dating in college. I wish she stayed this cute though, she became an alcoholic junkie and aged like shit.
>>109192922Did the blog factory explode
>>109192917>>109192177normies need to be gate kept. stay on reddit. you never belonged here
>>109192922You could have saved her
>>109192830kek >>109192808In your dream?
>>109192632
>>109192935>You could have saved herShe got empowered by feminism, 3 months of dating her was depressing enough.
>>109192910idk man, it's kinda your loss if you don't find someone attractive.
>>109192941https://www.reddit.com/r/comfyui/comments/1uk6q5m/comfyui_v0270_now_officially_supports_convrot/>It has better quality than fp8 while generally being faster.>It has better quality than fp8 while generally being faster.>We will likely be updating our default templates to use the convrot int8 models by default because they give better quality and performance for most people.>We will likely be updating our default templates to use the convrot int8 models by default because they give better quality and performance for most people.
https://github.com/Comfy-Org/ComfyUI/issues/14735>Finally, the Comfy community currently only integrates ConvRot W8A8. I want to emphasize that the real advantage of ConvRot lies in W4A4. I look forward to the Comfy community using ConvRot W4A4 to provide users with an even more outstanding performance experience.It seems we have to look forward to INT4, which will mean even greater speeds
>>109192781>>109192759what? the booru style captioning worked well with my previous loras for z image base and krea 2. i find it better and more precise than a long ass continuous run off sentence structure and paragraph structure.
>>109192955All forms of fp8 are dogshit How does convrot int8 compare to Q8_0 GGUF?
>>109193014>than a long ass continuous run off sentencedid you mean run-on?just don't write run-on sentences you fucking retard
>>109193018>How does convrot int8 compare to Q8_0 GGUF?https://github.com/BobJohnson24/ComfyUI-INT8-Fast/blob/main/Metrics.md
>>109193018>How does convrot int8 compare to Q8_0 GGUF?They're supposed to be basically the same.
>>109192178Unreal
>>109193018q8 gguf has slightly better quality in some circumstances but int8 is faster, but take my word with a grain of salt and compare/contrast yourself if you are interested. I personally didnt find much of any difference between q8 and int8
>>109193030>>109193029>>109193038Ok cool q8_0 is still better and remains the greatest quantisation format of all time. I can't imagine how you would fundamentally do better so it makes sense thanks anons
>>109193051did you skip the part about convrot being faster and providing nearly identical quality. that makes it better
>>109192955Didn’t know comfyfags were plebbitors
when int4-convrot rolls out, gguf will be obsolete.
>>109193064>not keeping up with ai tech on various platformsimagine /g/ being your only source of news
>>109192797WFhttps://files.catbox.moe/u8t01a.pngYou need to grab models from hereINT8 Convrot Chroma 1 HD Flashhttps://gofile.io/d/QlXI2iT5 Text encoderhttps://gofile.io/d/l93BuNDownside so far is it doesn't work well at 2k. Chroma 1 Flash also suffers from a strange issue with counting females (which the HD version doesn't suffer from) It's one area that Lodestone could've improved for the Flash version, but I don't see any newer Flash models for Radiance, 2K, etc...The benefits this wf are immense, same as last time, all the bad details from the Chroma gens are gone, nonsensical backgrounds, extra fingers, etc...Beforehttps://files.catbox.moe/y7p7mk.pngAfter>>109192786Beforehttps://files.catbox.moe/yc8wf6.pngAfter (Pic rel)Beforehttps://files.catbox.moe/6s8m05.pngAfterhttps://files.catbox.moe/ewble1.pngAnd of course the model can do much more, and NSFW is here at full capacity.
ok we pull
>>109193051it's a bit worse than Q8, but it's 2x faster, there's no way I'm gonna ignore that
>>109193051Q quants lose speed with loras
mxfp8 > convrot Cuda is especially tuned for it. >muh speed only poorfags with old hardware want dogshit lower quality models
>>109192210Anima bross? Why they aren't doing controlnet for us!?!?!
>>109193075>2x faster*On 30xx cards. Why do people keep leaving this out? iIt's objectively worse on 40 and 50 cards
>>109193060I'm gonna need to do my own tests, the cosine similarity of only 0.98xxx for Q8_0 GGUF sounds low >>109193075>2x faster than Q8_0Oh I see it's GPU optimized, I guess this is the q8 killer then yeah Maybe this will make me get into LTX finally
>>109193128>*On 30xx cards. Why do people keep leaving this out?not only on 30xx cards, it also works on 20xx cards, where did you get the info it's only for the 30xx cards?
>>109193128I have yet to see the speed benefits being claimed on my 3060.Either VRAM is not enough and voids any speed gains or my comfy install is shit.
>>109193113https://huggingface.co/TaihoC/Anima-ControlNet-VACE-DepthUmm retard?
>>10919315840 and 50 cards natively support FP8
>>109193190int8 is faster than fp8 on my 4060 thobeit
>>109193190you still get a speed increase on the 40 and 50 cards, 10 to 20% speed increase, and since int8 is a better quality quant than fp8, fp8 is basically deprecated
>>109193128>Which Format Should I Use?>RTX 30xx INT8 ConvRot for best quality, >INT8 for fastest>RTX 40xx FP8>RTX 50xx Blackwell NVFP4, MXFP8, or FP8, your choicehttps://huggingface.co/Winnougan/Krea-2-Base-Turbo-NVFP4-FP8-INT8I guess it makes a difference (likely speed or accuracy, who knows) but I use INT8 since I'm a 3090let.
what local models do i need for video to video
>>109193222scail2wan vaceltx2.3
>>109193128It's still 2x over all the other variants on my 5090. My ComfyUI is on cu130 with matching pytorch so I don't know what's going on there.
>>109193231gracias
>>109193126chroma krea? did lodes train a krea finetune?
>>109193282Nope, I wish. Rn he's focusing on finishing radiance for Chroma.1 and other models. Maybe he will train Krea, but its license is not the best. That is just a very fancy wf where I'm doing a 2 pass, first genning with Chroma, then passing it to Krea as img2img to fix mistakes which works well and is very fast on my 3090 >>109193069
made a custom node that sends a toast notification if a fixed seed is being used. i think its a pretty neat idea dont you think? sometimes i save a workflow with a fixed seed, open it up the next day and forget it's not randomized then wonder why my gens all are similar.
>>109192210Retard here, how do I use this for gens? Please post workflow.
>>109193354It's not made for Comfy yet.
>>109193397Then why was he yapping about ComfyUI/models/depthanything/?Fucking LLM bot spam.
>>109193006The original Convrot paper was actually mostly about int4, it specially advocated for keeping weights primarily in 4 bit while keeping a handful of important layers in 8 bit. (Both convrot variants) It even envisioned that this could be combined with a nunchaku style outlier decompose lora for much more efficient results.
>>109193460Cornrot is another germick, psycho hump.
>>109193460Oh he actually said these in his post.Well maybe I should have read his post before posting, lol.
>>109193302Is he actually starting radiance training again? I thought he was still trudging through zeta.
>>109193505He restarted Radiance "training" weeks ago.He is "training" Zeta, Radiance and some useless 2.5B pixel space model from scratch simultaneously (Thanks to Comfy being a retard gifting him 2 6000 pros).
>>109193415>filtered ngmi
>>109193069FUCK OFF FAGGOT STICK CHROMA UP YOUR ASS. YOUR CHROMA EXAMPLES ARE SHIT. WHY DO YOU KEEP INSISTING ON USING THAT FAILED, YEAR OLD MODEL WITH A DEV WHO IS MORE OF A CLOWN THAN AN ACTUAL DEV? CHROMA IT'S OUTDATED, BUILT ON THE FLAWED SCHNELL ARCHITECTURE, UNSTABLE AS HELL, AND ONLY SEEMS TO UNDERSTAND SLOPPY PROMPTS. BY INSISTING ON GOING BACK TO THAT MODEL, ALL YOU'RE DOING IS PROLONGING THE DAMAGE CHROMA HAS ALREADY DONE TO THE LOCAL COMMUNITY.
Krea 2 is a realistic /ss/ machine, it wonned
>>109193588proofs?
>>109193639Ideogram is really impressive ngl
>>109193648nice try fedboy
Help me understand Krea 2 better, anons.>Prompt structureSo what's up with it? To my understanding, it's supposed to be>Subject, background/environment and mood, camera angle/framing, style and lightingIn that order, right? And I should be verbose and descriptive instead of throwing a tag soup.However, at times I have trouble forcing the model to adhere to my prompts. For context, I'm using both a the filter bypass and the "instruct abliterated" clip for prompt adherence, as well as nsfw loras. When I prompted for "youthful woman", "thin wasp-waist" and "massive tits" it always gave me very petite results with small breasts, it wasn't until I removed "youthful" that it started generating something closer to my prompt.There's also face: How well does it understand describing facial features, and what's the best way to do so? Would ethnicity work? Should I get a lora for this too?>PosingThis is another wall I'm regularly hitting. I've tried my best to describe a pose, even get gemini and claude to describe it in order to prompt Krea 2 and still don't get anything remotely similar.From what my uneducated ass gathers, I can't use controlnet with this model. So what's the best way to get it to adhere to a reference pose?
>>109193666but it is too big and slow for my hardware so actually it is pretty bad and no one should want it
>>109193666nah
>>109193676>There's also face: How well does it understand describing facial features, and what's the best way to do so? Would ethnicity work? Should I get a lora for this too?use a celeb, it knows many
>>109193678it's pretty small as far as the newer models are concerned and most people can run it
>>10919369418.6B+8B is not small
>>109193685What if I don't necessarily want my result to look like a celeb?[spoiler]What if I want the face of someone it doesn't know?[/spoiler]
>>109192160You get better likeness with fewer images.
>>109193708then you have to train a lora for it know
>>109193703it's a MoE model, no one says Wan 2.2 is a 28b model
>>109193720it's not a MoE model just because it is split into two files>no one says Wan 2.2 is a 28b modelhigh IQ people do
>>109193676Krea2's prompt interpreter works almost like SDXL. The model prioritizes the words in the front of the prompt. The order of the words in the prompt matter more than the content of your prompt. There's often concept bleeding.
>>109193666we can't use other loras with the turbo lora. and and the standard version is fucking slow. also, complex prompts...
>>109193505Seems to be doing 1024p training Radiance alongside Zeta.>>109193575Why does the SOTA at NSFW realism trigger some anons so much? Lol, also Chroma 1 Flash is special and fixes many problems that plagued its OG version. Chroma.1 HD unfortunately suffers too many issues for me to go back to it, but a mix with HD Flash does fix them (at the cost of being huge to run, I'll figure out if that can be fixed). >UNSTABLE AS HELL, AND ONLY SEEMS TO UNDERSTAND SLOPPY PROMPTSSkill issue. Its NSFW understanding is far beyond Krea, good luck prompting any of these safetymaxxed models for "panties slightly visible", and unlike Krea it understands every single expression that girls can do from nuanced prompts without needing to resort to bypass tricks.>PROLONGING THE DAMAGE CHROMA HAS ALREADY DONE TO THE LOCAL COMMUNITY.Your screeching is akin to>why do you keep using what's available to youThere's no reason why someone who's seeking to decensor Krea 2 shouldn't use just use Chroma in 2-pass setup, otherwise they'd be limited to the sameface, poses, lack of coherence, lack of prompting freedom, and a bunch of other issues that are limitations of LoRAs.Also, it goes without saying that a Chroma-Krea wf doesn't suffer from most of the limitations of Chroma except for the limited areas where it falls short. Even then, just increase denoise, adjust prompt slightly on Krea side and you're good again.
>>109193772I thought they all worked like that.
Prompt understanding/control of Krea, but with Chroma's uncensored and unblurred backgrounds and subjects, both fixed. It's a win/win for realism.
Controlnet works with this actually.https://github.com/facok/comfyui-krea2-controlnetDamn I hoped it would overpower censorship when it comes to naughty stuff but alas.Besides that it's meh. It's too loose for my taste when it comes to following the controlnet image, results are a bit weird, it also adds stuff to image that wasn't in the control image.Thanks to the person who made it for the effort nonetheless.
>>109193871I'm not running two large models
>>109193879You don't need to. Just generate the Chroma image separately and then do I2I in Krea2 with low denoise.
>>109193825I've yet to see any evidence that Krea 2 without loras is in any way "good" at NSFW. No one has ever posted a pic made with vanilla Krea 2 that went beyond basic solo booby, anywhere that I've ever seen.
>>109193887yeah, that's running two large models
>>109193397>It's not made for Comfy yet.
Based Klein
>>109193895I assumed you already had chroma gens you wanted to improve. Either way, the NSFW Krea loras don't match what all chroma can do. For example none of them can do anal good, so it just made my results worse
>>109193906Even for ZiT like, the Mystic NSFW ones were pretty good, if Krea 2 is just more of that IDK why people are hyping it up so much
>>109193900he literally just made this an hour agohttps://github.com/facok/comfyui-krea2-controlnet
Comfy really needs to work on making the UI just werk if this is going to become a meaningful industry standard.
>>109193913Mystic NSFW changes the appearance too much for me.
>>109193925SNOFS is better
>>109193924actual enterprise companies use cloud models so i dont think it matters
>A muscular beautiful young Japanese woman with long dark hair, wearing a wet black high-cut one-piece swimsuit, clear platform high heels, and gold earrings. Her skin is shiny with oil or water, highlighting her muscular physique. She poses with one hand on her hip and the other extended flexing, looking directly at the camera.>A small golden triangular logo with an eye symbol and the word 'Milly' in cursive script, located in the top-left corner.>gray painted brick wall with visible mortar lines, light-colored tiled floor.>studio lighting, even and bright, highlights muscle definition and sheen
>>109193879Not that large. It's just 8 steps an INT8 version for Chroma 1 HD Flash (a flash version quantized at INT8 and running 2x the speed, only 30 secs on 3090), plus the usual Turbo speeds with Krea for 8 steps (also 30 secs). Think of it this way, the cost of the workflow assuming you have at least 64GB of RAM (the more, the better) to fit both models is just 2x the speed of your current Krea workflow (during inference), and the benefit is deslopping the model for almost all NSFW tasks you'd need to.
>>109193925Yeah it's not perfect, but it was still quite good as far as ZiT is concerned IMO. Anyways BigAsp V3 on Klein 9B is looking good IMO, hope it gets either a Turbo version or at least Turbo lora of some kind eventually when it comes out.
>>109193928eh, i'm really not a big fan of any AIO NSFW lora. I prefer targeted concept loras. Those tend to work the best.
>>109193932ideogram looks like it has some shitty grainy filter applied, no thanks.krea2 is better.
>>109193934Starting Chroma image.https://files.catbox.moe/4898v0.pngSame seed as >>109193934, slightly higher denoise from 0.50 to 0.55. https://files.catbox.moe/exphbd.pngMy usual denoise level of 0.60 (most slopped in this case)https://files.catbox.moe/9eddx8.pngThe model also doesn't understand as many hand signs as Chroma it seems and it needs decensor.
>>109193968better if you also thought pre-2512 Qwen Image was somehow good at producing realistic images maybe lol
>>109193986it doesn't matter either way. ideogram is abandoned as far as the community goes. everyone is making loras for krea 2. you can keep using a dead model all you like though. good luck
Why are there still people praising Ideogram when it can't even do SFW gens? It censors bread for christ sakes.
>>109193998Ideogram is abandonned because it decided to kill itself by not allowing coom loras, not because Krea 2 is better, that's a huge difference
>>109193996troon hands
>>109194017https://civitai.red/models/2688234/realism-engine-ideogram-4-krea-2?modelVersionId=3047310
>>109194017ok anon you win. now can you stop posting useless ideogram comparisons?
when will we finally get a model that's small enough to be widely used and receive community development but still enough of an improvement to replace XL instead of these bloatmodel abortions that are forgotten as soon as the novelty wears off
>>109194022why? you're afraid to measure Krea 2 to other models? that doesn't scream "this is the best model ever" to me
>>109194027comparing things to dead models is a waste of time.
>>109194021You still need to do BBOX gymnastics to get that lora to work.
>>109194023Anima is kinda that DESU
The comparisons will continue. I like both models btw
>>109193998My point is moreso I do not understand how Krea 2 is supposed to be better than really any vaguely recent model, not just Ideogram
>>109194056Uhh it knows celebrities or something
>>109194021>That json promptingInto the trash it goes. I'm a human being, not a REST API. No, LLM prompt enhancing are not as good as just raw prompting (the latter is much more creative than the former as LLMs use an autoregressive architecture)
literally me
>>109193932
Where are the america gens for America's birthday
>>109193354>>109193415cool it fucking retards i just found it when it was just released and no support for comfyui and jumped the gun. But here you miserable fucks.https://github.com/facok/comfyui-krea2-controlnetThings are moving fast it seems.
>>109194114nice toes
>>109194162Image blocked by safety filter.
no. i dont think i will
>>109193874>Damn I hoped it would overpower censorship when it comes to naughty stuff but alas.I was already 70% there just denoising an image from anima, disabling safety filter and using some nsfw lora's for krea. with the controlnet I think I'll be able to 100% denoise the image or at least a little higher without losing too much composition. There was not many images it couldn't do at 80% denoise, so maybe its skill issue on your part? I'll maybe share a workflow if I get it working but do not pester me.
>>109194162
>>109193919>>109194180This node seems superior to the brute force workaround i made. I'm about to have a lot of fun with this
>>109194361I get RuntimeError: No compatible Krea2 control LoRA block weights were found in the selected file. when running the node =/ don't know why because I checked the hash of the file and it is the correct file. any hints?
mondo girl clocking off
>>109194361>>109194381well they changed something 16 minutes ago, they might have broken it because its not loading that lora at all. Unless I'm being retarded which I don't think I am because I followed everything exactly.