Monke EditionDiscussion and Development of Local Image, Video, and Music ModelsPrevious: >>109153577https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUISDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineageWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, & Upscalershttps://huggingface.co/modelshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.info>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/tdrussell/diffusion-pipehttps://github.com/kohya-ss/sd-scriptshttps://github.com/kohya-ss/musubi-tuner>Krea 2https://huggingface.co/krea/Krea-2-Rawhttps://huggingface.co/krea/Krea-2-Turbo>Zhttps://huggingface.co/Tongyi-MAI/Z-Image>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/https://animadex.net>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2.3https://huggingface.co/collections/Lightricks/ltx-23>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/devilhttps://rentry.org/angle
>shitbake
>>109157211>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>mfw Resource news06/28/2026>Clark Air: A Sana 1.6B text-to-image transformer compressed to ternaryhttps://huggingface.co/clark-labs/clark-air-sana-1.6b-1.58bit>ComfyUI Krea 2 NegPiPhttps://github.com/blue-pen5805/ComfyUI-krea2-negpip>FLUX.1-Kontext RefControl LORAshttps://huggingface.co/collections/thedeoxen/flux1-kontext-refcontrol-lorareference-control-input>Un-0: Generating Images with Coupled Oscillatorshttps://unconv.ai/blog/introducing-un-0-generating-images-with-coupled-oscillators>ComfyUIQuantFunc: Run quantized models at 2x–11x speed with zero Python model dependencieshttps://github.com/RealJonathanYip/ComfyUI-QuantFunc06/27/2026>ComfyUI-VAEFrequencyBlend: Blend images decoded by different VAEshttps://github.com/thezveroboy/ComfyUI-VAEFrequencyBlend>Ideogram4 & Krea2 Inpainting with LanPaint Supporthttps://github.com/scraed/LanPaint/releases/tag/1.5.506/26/2026>OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generationhttps://correr-zhou.github.io/OmniShow>Adobe to Acquire Topaz Labshttps://news.adobe.com/news/2026/06/adobe-to-acquire-topaz-labs>LiveEdit: Towards Real-Time Diffusion-Based Streaming Video Editinghttps://live-edit.github.io>PhysRAG: Enhancing Physics-Awareness in Video Generation via Retrieval-Augmented Generationhttps://github.com/sediment1024/PhysRAG>SAM2Matting: Generalized Image and Video Mattinghttps://henghuiding.com/SAM2Matting>Unison: Benchmarking Unified Multimodal Models via Synergistic Understanding and Generationhttps://github.com/FudanCVL/Unison>ComfyUI-AppleSilicon-FP8 - a compatibility layer custom node for Apple Siliconhttps://github.com/pawel-mazurkiewicz/ComfyUI-AppleSilicon-FP806/25/2026>Bernini-R — GGUF (high & low noise experts) https://huggingface.co/neuregex/Bernini-R-GGUF>Physics Question Scene Graph: Fine-grained Evaluation of Physical Plausibility in Text-to-Video Generationhttps://github.com/atinpothiraj/pqsg
>mfw Research news06/28/2026>RayPE: Ray-Space Positional Encoding for 3D-Aware Video Generationhttps://arxiv.org/abs/2606.27345>IDAG-Edit: Multi-Object Video Editing via Instance-Decoupled Attention and Guidancehttps://arxiv.org/abs/2606.22042>FLAT: Feedforward Latent Triangle Splatting for Geometrically Accurate Scene Generationhttps://flat-splat.github.io>DE-FIVE: Detecting Malicious Image Prompts via Fourier Features and Image Vector Embeddingshttps://arxiv.org/abs/2606.22779>Token-to-Token Alignment of Text Embeddings for Semantic Blendinghttps://arxiv.org/abs/2606.24021>TriMotion: Modality-Agnostic Camera Control for Video Generationhttps://arxiv.org/abs/2606.20774>T-VSS: Test-Time Visual Subspace Steering for Adversarial Robustness of Vision-Language Modelshttps://arxiv.org/abs/2606.23132>Modular Diffusion Models for Structured Visual Recognitionhttps://arxiv.org/abs/2606.22702>Autonomous Video Generation with Counterfactual Controllability for Self-Evolving World Modelshttps://arxiv.org/abs/2606.24152>ScalePredictor: Instance-aware Scale Learning for Accurate Quantization of Vision Transformershttps://arxiv.org/abs/2606.21947>Layer-Specific Prompt Fusion Discovery via Differentiable Search in Vision Foundation Modelshttps://arxiv.org/abs/2606.26379>TaskTok: Delving into Task Tokens for Task-driven Image Restorationhttps://arxiv.org/abs/2606.26615>Einstein World Modelshttps://arxiv.org/abs/2606.26969>Hallucination in World Models is Predictable and Preventablehttps://www.nicklashansen.com/mmbench2>Fast LeWorldModelhttps://arxiv.org/abs/2606.26217>Beyond the Hard Budget: Sparsity Regularizers for More Interpretable Top-k Sparse Autoencodershttps://arxiv.org/abs/2606.27321>From Hallucination to Grounding: Diagnosing Visual Spatial Intelligence via CRISPhttps://arxiv.org/abs/2606.26535
>mfw API news>ByteDance launches Seed Audio 1.0 Unified AI Audio Generation for Speech, Music and Ambient Sound Creationhttps://fal.ai/models/bytedance/seed-audio-1.0>Midjourney goes from generating cat images to full-body ultrasound scanshttps://www.theverge.com/ai-artificial-intelligence/952011/midjourney-medical-ai-ultrasound-scan>Alibaba releases HappyHorse 1.1 Available on Alibaba Cloudhttps://www.alibabacloud.com/blog/happyhorse-gets-stronger-motion-expressiveness-higher-generation-consistency-and-enhanced-visual-quality_603293>ByteDance's New AI Video Model Can Make 30-Second Clips From a Single Prompthttps://www.cnet.com/tech/services-and-software/bytedance-introduces-new-seedance-2-5-video-model/>Luma Introduces Ray3.2 Model & API: Complete Creative Control for Video Generationhttps://lumalabs.ai/news>The Layout Bet — Reve 2.0https://blog.reve.com/posts/the-layout-bet>Introducing Gemini Omni — Google’s multimodal video creation/editing modelhttps://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-omni/>Nano Banana 2 and Nano Banana Pro are generally available via Gemini Enterprise Agent Platformhttps://cloud.google.com/blog/products/ai-machine-learning/nano-banana-2-and-nano-banana-pro-are-generally-available>Grok Imagine 1.5 Previewhttps://x.ai/news/grok-imagine-1-5>Seedance 2.0 in Runway APIhttps://docs.dev.runwayml.com/api-details/api_changelog/
okay, we pull
>>109157436this is off-topic
>>109157459I agree. He should revise it and add local-API news, like the new Seedance 2.0 mini arriving to ComfyCloudhttps://blog.comfy.org/p/seedance-20-mini-and-4k-is-now-available
>>109157467is nromal seedance too?
>>109157211so you guys are back to wrongbakinginteresting interesting
dogshit thread made by a retard
>>109157447redownload everything LOL
>>109157211at least keep the thread image to something bearablei dont usually hide threads but this is atrocious
>>109157211>>109157377>>109157382>>109157436Fuck off /s*g/ faggots
>complete meltdown over schizo wallsgo outside
>>109157525wheres her dick bulge
>>109157534its in her ass
>>109157537there is no need to be so retarded either yet here we are
>>109157537you're a good puppet
>>109157537kek
so the containment general is so dead even koff and nigbo prefer to post in /ldg/ kek
>>109157566look it's OP
>>109157537just so i understand the story: you're like nigbo's sidechick right now? i'm trying to understand the lore
>>109157588
>>109157595seems like you're quite invested into this
>>109157588it's probably debo himself or his tranny fuck boy.debo lives to personally ruin this thread because he has nothing in his life worth living for.>>109157595like this. only a millennial troon would find this amusing in any way. it's peak reddit.
>>109157602>>109157605
>An attractive goofy Asian woman, who is reclining on a yellow, modern chair. She has long, straight black hair with bangs and is smiling directly at the camera with buck teeth. She is dressed in a frilly french maid costume with white panties and round eye glasses. Her bare feet are prominently displayed as she holds her right foot up high by the ankle showing her sole to the viewer, her left foot remains low. >The background is a serene, outdoor garden setting with lush, green foliage and a potted plant with rounded leaves>medium shot, amateur photography, natural, candid, everydayhttps://huggingface.co/Beinsezii/Krea-2-Turbo-Projector-Scale-LoRA-DiffusersThis is what I'm using to bypass the Krea censor, not sure if there's something better
>>109157595*oh and please post the next ones you prepared, you seem to put quite some time into this
>>109157289>I use Q8Nta but that's the slowest quant you can use on Comfy
>>109157619I like quaaaality
hey debo i hope you do one of the mistakes that the rocketgurlp*do did :)
>>109157611Of course there's something better anon-kun, my Chroma-Krea workflow (which I'm in the process of refining by converting everything to INT8), almost done with T5 then I'll share it.My workflow works better than every NSFW tune that's been released for Krea.
>>109157611zit>id4>krea>kleinzit hits the mark on allid4 is technically the best but looks too much like a glossy magazine printkrea is weird and smoothklein has lol anatomy like usual, but has a nice background
how can a monkey be so triggering
>>109157634I look forward to it
>>109157648it's not about the retarded monkey it's the obvious removal of debo's rentry
>>109157648well you act like debo's bitch, it's not the monkey "koff"
>>109157663>well you act like debo's bitchit's probably just debo
so is anima outdated already thanks to krea?? did tdruss blow his load too early training a shitty dead-end base model like cosmos that nobody ever heard of (for good reason)?
>>109157675yeah
>>109157675anima has its place with vramlets. its not going anywhere due to that. however with that said he will not be making his money back.
>>109157681can krea do better?
>>109157690for porn and anime it's a WIP but we're getting there pretty quickly. krea is just a great base model in general.https://files.catbox.moe/uib9m2.pnghttps://files.catbox.moe/6d8ubk.pngI genned these earlier
I completely skipped over anima and only got back into ldg for ideogram and krea. Anima just seemed like an incremental improvement over illustrious/noobai
>>109157681i tried krea turbo and anima and they were both around the same speed? in fact i torch compiled anima too so anima might actually be slower than krea turbo
>>109157211>sdg tier OP as ldg OPgrim
>>109157712yeah it's a super incremental improvement over illustrious. the prompting is better and that's about it.>>109157720I mean speed is important but anima has the advantage in just being super trainable by people with little vram hence why I say it will probably always stay relevant due to that freedom for them. krea is just better in pretty much every regard except for anime/porn out of the box.
>>109157716anima is great for 1girl standing because the improved vae of the shitty xl one, but the lower parameter count and other factors cause it to actually be worse at sex posing than illustrious. a krea finetune would be crazy considering how good even the slopped nsfw loras are. its a bigger model but it seems to learn faster, so perhaps it wouldn't need as many epochs
>>109157211why no collage?
>>109157760debo decided that right now is a good chance to shit up /ldg/ for some reasonwatch him cry when the revenge hits
>>109155169>>109155437>>109155497thank you>>109154269
>>109157844
>>109157712>incremental improvementAnima has a low diversity problem. It's not an improvement over illustrious/noob in general.
>>109157844Been a while since I've seen a classic flux buttchin
>>109157712If Anima taught us anything, it's that it was the last of its breed when it comes to fine tuning. The era of fine tuning a base model, or stacking loras on top of one is over. If you want a real illustration or anime model now, you have to build it from scratch. New models just don't take to fine tuning the same way anymore, those days are gone.
I can't believe claude 1-shot this script
>>109157595hey where's the rest?
>>109157595debo-esque
>>109157738>seems to learn fasterLearns faster is a meaningless metric and always has been. What matters is how a model reacts to new info, how well it interprets it, and how well it integrates it with what it already knows. Problem is you can't measure any of that without dumping money into training runs first, funny enough that's exactly what we learned with Anima. The dev spent like 100k into finetuning it and without that , this whole chat would still be pure speculation on a mongolian forum, same as it is now for every other model nobody's bothered to fund like Krea, Klein, ZImage, Qwen
>>109157869the /h/ thread has had a stealth metadata viewer since forever ago desu not really surprising
>>109157738>but the lower parameter count and other factors cause it to actually be worse at sex posing than illustriousKEK good one anon
>>109157611
Haven't been around for a while. Is Krea more of a Chroma type of model, or more of a Z-Image type of model
>>109157712As someone who was the biggest Illustrious shill, Anima mogs the fuck out of it.
>>109157928it's more like z-image in the fact that it's actually coherent, but more like chroma in the way that it's full of artifacts/noise
"Z image Turbo Base"
https://github.com/blue-pen5805/ComfyUI-krea2-negpip>KREA 2 NEG PIP https://github.com/blue-pen5805/ComfyUI-krea2-negpip>KREA 2 NEG PIP https://github.com/blue-pen5805/ComfyUI-krea2-negpip>KREA 2 NEG PIP
>>109157712>an incremental improvementI don't know if people who say this are bad faith anima anti-shills, or just prompting for the most generic 1girl shit imaginable. Anima's VAE alone is a massive boost over SDXL, you can generate at 1024 res and actually not have melted eyes on 100% of the gens. And the moment you go for multi-character interactions or less common concepts, Anima will one-shot things that SDXL simply can't do coherently no matter how many times you reroll.It isn't a small improvement its night-and-day difference, how tf can you not instantly see that.
>anima is night-and-day better than illustrious>krea is night-and-day better than animaholy fuarkkkk can we get a double-upgrade this year?? where is laxsaar to finetune krea
turbo models just arent really that good yet
>>109158036The average genner's card is nearly a decade old they cannot afford to admit turbo models are bad
I have been trying to make a nude with krea using an abliterated qwen3 vl and using different vae. I was not able to do it. How are people using it?
>>109158063are you using one of the uncensor loras? i dunno if abliterated TE actually does much unless youre using the prompt enhancer
>>109158067i've used the one posted here, projector scale. But it seems it makes makes the prompt addherence not that good
>>109158021>actually not have melted eyesfuck illustrious for this. noob makes some interesting gens, but illustrious was nothing but blurry messes for me most of the time. i don't know what >>109157712 was talking about.
>>109158073this is the one im using, strength 100https://civitai.red/models/2728234/krea2filterbypass?modelVersionId=3067151
>>109158107nice, thanks anon
>>109158007thanks!
have you come to your senses and realized that krea 2 is dog shit yet? weebtrannies need not apply
>>109158260i rarely post in this shithole but from what i've tested myself is that krea has really good understanding for background and composition, but is incredibly retarded when it comes to actual peopleit also has a really huge library of celebs and characters as far as i can tell even though they all get a retard filter appliedwhat really pisses me off is that it has ZERO understanding of photography prompts / photo settings outside of muh 35mm kodak filmit's like if sdxl and flux had a baby that was an asperger autistand shorter less verbose prompts seem to be the key
>>109158260you just wait, russ will change your mind.russ..?? where are you??? hellooooooooooooooooooo???
>>109158007>KREA 2 NEG PIP >sample has cfg at 1is that how its supposed to work?
>>109158360you are supposed to use cfg 1 with turbo
>>109158406dont negative terms only work when cfg is above 1?
>>109158429that's why you use negpip
>>109158429it allows negative weights in the positive nigga
>>109157702Can you please post the metadata and prompt for the Chun-li pic? I can't get it even with catbox.Also what's the best workflows for :- Illustrious?- Anima?- Krea?I'm using https://civitai.red/models/1386234/comfyui-image-workflows but wonder if there is better.Also on Anima, my pics are kinda blurry even at high res.Also wouldn't mind a wildcards settings. Is there a better one than jbye's wildcards?
Can anyone do a collage for last thread, the monkey can be ignored
OH RUSSELL~~~
>>109158051zit is nicer than krea
>>109158330>photography prompts / photo settingsWhat models are good with those?
>>109158658flux whatever
>>109158626I appreciate how Ideogram makes them look more like shitty cosplayers rather than character if she real. But it's really hard to get a genuine amateur looking photo
>>109158658honestly, shouldn't you use the /b/ thread for non-blue purposes?As far as I'm concerned, local isn't even capable of such sorded details.
>>109158678not allow
>>109158713No one implied porn except you.
>>109158732even worse he implied lolis
>>109158713>/b/ threadthat place is a shithole full of namefagging pedos.if you think we've got a lot of mentally ill resident poster here, it's 10x worse over there.
>>109158626>>109158708i caught a chie...
>>109158747What a total snake.
>>109158752>shitholeooohohhhhh this is different?
>>109158752I love namefags. They are the easiest to ignore.
>>109158845so true
>>109158865what's "debo"
when will i get a ai waifu robot anons?
>>109158881You can go ahead and text chat with her. She's not able to get a shell just yet. gemma 4 31B.
>>109157869>>109157909How do you get this?
>>109157611>>109157634Chroma-Krea workflow. Bring Chroma's natural prompt understanding to Krea with INT8 loaders so it goes fast. Still trying to figure out what best settings are for this, but so far these are best I've found.Grab the wf herehttps://files.catbox.moe/3dykt9.pngThis is by far the fastest and easiest way to uncensor and deslop the model all in one.Absolutely no LoRA needed.This LoRA on the fly fixes bad background and incoherent/bad anatomy from Chroma. If Chroma's gen is really bad (duplicate limbs) I recommend just increasing denoising, or changing the seed. Play around with denoise anywhere from 0.4-0.7.It is plug and play compatible with all NSFW LoRAs made, but make sure to disable rebalance node in that case, and I can't guarantee it will work as well with those (best results seems to be with LoRAs completely disabled)Grab INT8 Kreahttps://huggingface.co/Winnougan/Krea-2-Base-Turbo-NVFP4-FP8-INT8INT8 Convrot Chroma 1 HD Flashhttps://gofile.io/d/QlXI2iT5 Text encoderhttps://gofile.io/d/l93BuNI'll also look into converting OG Chroma 1 HD in INT8 for its styleshttps://files.catbox.moe/sfvcmk.pnghttps://files.catbox.moe/kp6tqt.png
great slop today fellas
>>109158966
>>109158985
>>109158985>>109158988theres some splotchyness to gens, what's going on with that
>>109158966krea has samefoot.
>>109158966>This LoRA on the fly fixes bad backgroundWF*For those not aware, this WF fixes Chroma backgrounds in one shot.https://desuarchive.org/g/thread/109143837/#q109144703Now the general skin texture consistency has been improved and I figured out what was causing initial artifacts (LoRAs).
>>109158966I have the thing setup thank you very much.
>>109158992combining meltychroma with spoltchyvae qwen will do that
>>109158983thanks
>>109158966Nice work
I don't think I have any problems with the skin texture. The colors bother me a little though.
>>109158992Wdym?
>>109159017Thanks
>>109159036>>109159042Those melons look tasty
>>109159005>>109158992Nice way of replying to your own posts. What's going on with THAT?
>>109159036it's noticeably size blobs across your images. look at the sand behind her, there's these uniform pinprick sized dots
>>109159082he's making free apples for all our pals with aphantasia
>>109159036>>109159042idk man there's consistent points across your gens for some reason. if this is just your watermark or something i'm sorry for prying
>>109159111nta but i thought you were talking about the weird chroma fry the image had. how did you even notice that?
>>109159111
>>109159111*anons first time noticing when the seed is fixed
>>109159180ass shot of her shiny bike shorts?
>>109159204
Anyone had any luck integrating a LLM into their workflow? I'm trying QwenVL 3.8 (or something, same used for Krea2) and it rarely adds anything. And I think my system prompt is bad, as it either removes stuff I want or breaks things.
>>109158966Some more findings, different samplers may occasionally yield better results with teeth/eyes on some long distance gens, heun or res_2s (as in this case) might yield best (most detailed) but then it introduces weird noise similar to Ideogram/Krea. In pic rel case I had to increase the Scale Image To Pixels node to help mitigate this issue a bit, but the image took a while to generate. I'll look into different ways to help mitigate this, I.E. better settings or perhaps a face detailer.https://files.catbox.moe/jkeu9u.png
>>109159306>Ideogram/KreaIdeogram/Klein*
>>109159019If you're referring to the Chroma-Krea wf, colors can be muted by turning Rebalance Node down from 10 to 1 or disabling it. It will look exactly like ZIT, but I prefer Chroma's colors.
>>109159221Imbressive!
god fucking damn it! anima being 2b on a shitty base model is such tragedy.
>>109159482Good enough for 1girl genning. But I guess I should switch to Krea2 for more involved stuff.
>>109159489if i just wanted to 1girl - noobai was sufficient enough.if only krea2 had style transfer... can't have shit as a localkek!
>>109159500>if only krea2 had style transferhttps://www.reddit.com/r/StableDiffusion/comments/1uhpiov/krea2_style_transfer_first_release/
> Chromafuck off
>>109159548Best thing one could do since all Chroma iterations have failed and no one is properly finetuning Chroma. I don't think bigASP will be the answer, but we'll see.
>>109159509>IPAdapter has arrived so quicklHoly shit kek, do the model was uncensored via a rebalance node, Krea devs planned to keep their style transfer feature under API then a few days later IPAdapter (which usually would've given them months) is out. This release is an absolute disaster for their team, but at the same time so glorious for local.
>>109159645But does it work for anime?
>>109159659There's a manga example there, copying rough sketches and everything, so it should work. I'll test it out in a sec
>>109159670>>109159670Well?
>>109159780death by disappointment.
https://huggingface.co/Comfy-Org/Boogu-Image/discussions/10#6a404740de5b188a16cb8471>I won't be doing fp8 because int8-convrot is just much better, faster and better quality on all Nvidia GPUs. I've uploaded that now, but currently it needs very latest ComfyUI and comfy-kitchen versions.based, total fp8 death!
im using mxfp8. it should be better for 5xxx series.
>>109159812> Nvidia GPUsnigger
>>109159831why the fuck are you an AMDfag, are you retarded???
>>109159836because fuck nvidia
haven't been doing this since SDXL was the latest and greatest. what's the primary one people use these days?
>>109159780>>109159797Relax anon, it works! This is without even tuning the parameters
>>109159877Need more examples.
>>109159877
>>109159509Can't get it to work, with any of the fixes in that thread.
Forge couple is so fucking bad man, this expansion is fucking dreadful to use and virtually unusable. No wonder comfy won the local wars when forge people are forced to use this unironic dogshit.
>>109159925Worked first try for me, just make sure to copy and paste krea2.py exactly as the instructions say. Now I will test realism styles
>this threadsfw vageen department is impressed
>>109159941try
>>109159500>if only krea2 had style transfer... can't have shit as a localkek!they kept the transfer adapter for themselves, and on their technical report they said they're using Krea 2 with Flux.2 vae (while local peasants are using Qwen vae), localkeks are using an inferior version with a safety filter on top of it, yay, please hype!I'm not a big fan of ideogram, but those fags did gave us the exact same version as their API model, just saying
which models are good for loli/shota and feral?
>>109158036>turbo models just arent really that good yetnonsense, Z-image turbo is a great model
>>109160027> Z-image turbo is a great modelfor chinese
>>109158330>what really pisses me off is that it has ZERO understanding of photography prompts / photo settings outside of muh 35mm kodak filmit's not pissing me off but it surprised me, Krea emphasized that this model was supposed to be great at styles, so I was expecting the model to have a lot of knowledge on camera lens and shit, this model is so superficial desu, they focused too much on adding The Rock and other celebrities slop and not enough on adding precise styles
>>109160031>only Chinese people have good tastesad, we're talking about people eating dogs lol
>>109159941Oh as usual, human error. I was loading the 8b instead of 4b clip model.It even works with sage attention enabled.
>>109159812Anima int8-convrot when?
>>109159941Pure kino. It works fine but to get good results with style ref, images need to be as close to square or a Krea 2 supported resolution as possible since the WF isn't adapting the res based on that. There's also some weird fuzz, not sure what's causing it. But also not entirely useless, a second pass at low denoise could potentially fix this and make it pure kino.
I'm pleasantly surprised. I was waiting on a fix for this tech when I saw it earlier this morning. I've been looking forward for something like this in a long time.I feel that the devs will release their version of it seeing the community make it themselves.
>>109160044> 1girl fish head standing dull colors is good taste
>>109160067
>>109160074
>>109160071>dull colorsthis nigga spent too much time looking at slopped images he can't understand that Z-image turbo is the model that actually renders colors accurately
>>109160071>1girl fish headI don't get this meme, Z-image turbo can do so much more than close up portraits, what are you talking about??
>>109160080There's a bunch of values. It seems it's reading both the style and the composition of the image separately, so I think you can get rid of parts of the reference image bleeding over.
>>109160092Might be difficult to see in the resolution I'm uploading, but it's picking up on the smaller details like grain, crosshatching etc.
>>109160117>when there are lots of checkpointsgood joke, we only got 2 good checkpoints this year, Klein 9b, and Krea 2 Turbo
>>109160091now compare with other models
>>109160125Sure
>>109160001
>>109160123*sad Anima noises*
>>109160138Still on about that nitpicked gen? Kek, I guess you need a certain kind of autism to see that this model is much better than ZIT. I mean, there's no way ZIT could even do style transfer this well simply because the base model is weaker and img2img on it sucks, so you could tell it wouldn't succeed.
>>109160168>it's nitpicked because I said soprove it, show a comparaison that displays Krea 2 being on par with ZiT on realism (he won't make the effort and show any sign of good faith, he's just a troll after all)
>>109160140impressive, what about real artists?please also try turned 180
>>109160168>there's no way ZIT could even do style transfer this wellwhy are you saying random shit all the time? with the style transfer node it works fine on ZiThttps://github.com/BigStationW/ComfyUi-Untwisting-RoPE
>>109160024sdxl/anima
holy fuck i need the rope now
>>109157211brehs, how come (You)'re blessed with high quality (open source?) local models
Did anyone run malware bytes on the rope because it does seem kinda sketch
>>109160307I've run the codex, there is no network calls or anything malicious.
>>109160305nice image, keeping the anime style while the character is far a way is a tricky thing to do, glad this model can do it
>>109160305I like bigasp but without a specialized turbo lora it's just too long to render, I got spoiled with fast quality gens with ZiT and K2
>>109160319It's just easy to train style loras with flux 2 vae. Just take high res images and training goes brr. For older VAEs, I usually crop faces and zoom them in, here I think there is no need to do that.
Happy Monday anons, can't wait for whatever model is going to dethrone Krea this week,
>>109160350>It's just easy to train style loras with flux 2 vae.I can confirm, that's why I'm not hyped at all by Krea 2, I can tell this model will be a bitch to train proprely and even if you have the patience to get the perfect settings, it'll never reach the heights of a model with a better vae
so has krea2 just straight up replaced ideogram 4 already?
>>109160341Anime-style images, are 20 steps euler with klein distill lora at 0.2, cfg 4.9, take me about 10 sec for 1k image on 375w 4090. (fp8, all --fast)It's okay-ish, chroma, for example, was way worse (because you need to roll a lot on the base chroma base and flash just slops everything).
>>109160368Ideogram just killed itself with the bbox autism, no one was using it before Krea 2 already
>>109160386>Ideogram just killed itself with the bbox autismpersonally I don't want to use it because it's using the retarded 2 models MoE shit (like wan 2.2)
>>109160341I wouldn't be too worried, once he finishes the training, it'll be 2x faster when someone will make an int8-convrot quant of it
>>109160407why would it using 2 models impact how you use the model
>>109160368Ideogram 4 was pretty much DOA, the control it offers is of niche use, 99% of people genning don't need or even want that kind of control, since it removes the whole 'let's see what we get' anticipation, and of course it slows down trying different prompt ideas. License is a trainwreck, you can't even legally train on anything NSFW which means 90% of community interest is gone, and on top of it the model is slow.Krea 2 obviously hammer in more nails into Ideogram's coffin, but it was never going to take off. It could see some use in the semi-pro community but since you can't use it in any commercial capacity whatsoever that's not going to happen either.
>>109160427because you have to unload the first model and then load the second model, the total is 18b, you can't put all of that in your GPU unless you're some richfag that has a 5090, and why would there be a second model in the first place? ZiT doesn't need a second model, Krea 2 doesn't use a second model, I value elegance and simplicity, not bloat
>>109160439>License is a trainwreck, you can't even legally train on anything NSFWjesus... I'm not that surprised because the guy who created it is arab (so probably muslim, PORN IS HARAM)
>>109160447obviously no one should expect a training free style transfer method to be as good as something like a style transfer adapter (but the krea fags won't release that locally, and to be fair I understand why, it's the main appeal of their Krea 2 API model, without that it's useless to go to their site)
I don't get it why people are saying Krea 2 can't do facial expressions. Every expression I've tried has worked so far. Even the ones that didn't work in Z image.
>>109160456>I don't get it why people are saying Krea 2 can't do facial expressions.I guess some people triggered the safety filter and got some false positive bullshit, so from time to time the model won't listen to some begnin prompts, filters was a mistake...
>>109160450>the guy who created it is arab (so probably muslim, PORN IS HARAM)Time to make some sexy hijab girls.
>>109160473Okay I guess I've been using one of the bypass filters the whole time. The model seems to have very good understanding of expressions. I also tagged them in my dataset.
Klein 9B is still top dog as far as edit model goes right? I know K2 has edit capabilities but from what I've seen, it does not compare to Klein.
>>109160509>I know K2 has edit capabilitiesit doesn't, it's just a normal image model
>>109160526could've sworn someone mentioned it can do edits. oh well, guess I'll use it as my ZIT replacement
>[INFO] Model Krea2 prepared for dynamic VRAM loading. 12530MB Staged. 256 patches attached. Force pre-loaded 160 weights: 2824 KB. 0%| | 0/8 [00:00<?, ?it/s, Model Initializing ... ]2 minutes and counting. Is this the power of dynamic VRAM?
>>109160178I already proved it's better on this thread many times over>>109158966That is just me doing img2img, check the last catboxes as well. Not doing anything crazy, can you do on ZIT with pure img2img? The skin tone and overall style would likely not carry over and just default to Z's slop, I know because I have used it for refinements before. Again, this model is way better.>>109160187Take a closer look at that result, kek. There is bleed everywhere. Same face pose. Same colors. "Works fine on ZIT" lol, don't forget the model lacks variety. Krea was trained from the ground up to do style references. The Z team never released their style transfer model.
>>109160573> he didnt disable dynamic vram
>>109160575>The Z team never released their style transfer model.because they never made one in the first place, are you retarded?
>>109160578anon... you won't be able to disable dynamic vram in the future, and vu will be happy
>>109160581>Creative Image Editing: Z-Image-Edit shows a strong understanding of bilingual editing instructions, enabling imaginative and flexible image transformations.https://github.com/Tongyi-MAI/Z-ImageDon't cope, that's not what their paper implied
>>109160588>that's not what their paper impliedindeed, that's not what their paper implied
>>109160578It was working until it stopped working, I just loaded a new Lora. The thing is that it completes the task, just that it will randomly take 5 minutes to perform a 80 second task. Then the next one initializes quickly then it doesn't.>>109160585It's concerning because it won't work perfectly and the result is what I've been suffering: It becomes stuck for minutes and then resumes.
>>109160140getting shit with fp8 scaled
>>109160615obviously, fp8 isn't good, glad that int8-convrot exists now
>>109160533No, the Krea 2 devs said they are working on a edit model and that they plan on releasing, no idea of when though
>>109160623>no idea of when thoughthey said "in the next comming months", I really hope they'll change the vae for that one, imagine editing models with fucking qwen vae, lmao
Huh? Another txt2img only model with 0.1 nsfw and 0.1 booru knowledge, and you're all simping this hard for it? Nah, fuck this, Krea and Ideogram are the same fucking garbage, this is just rehashing 2025 all over again, WAN is way better at txt2img anyway, and Qwen Edit dropped last year, what the fuck is this shit? We're going in circles.
>>109160612>It's concerning because it won't work perfectlythey don't care, like every company in a position of monopoly, they can fuck users and get away with it
>>109160655truth nuke, wake me up when we get a local model close to gpt image 2, then we'll be talking
>>109160509Klein can't do style transfers as well, but it's a natural language edit model unlike K2 (edit model unreleased for now). Flux.3 will probably have all we need now that this level of competition has reached local.
>>109160669>Flux.3 will probably have all we needthere won't be a flux 3, at some point they won't be allowed to release models that are too powerful locally
>>109160655NOOOO!!! IT LITERALLY DOES UNDERBOOB AND UPSKIRT WITHOUT ANY FINETUNING OR LORAs!!! YOU'RE SO WRONG!!! IT'S THE BEST LOCAL MODEL SINCE CHROMA, YOU ABSOLUTE CLOWN!!!
>>109160655>0.1 nsfwGrim>0.1 booruEh, I can live with that. Gotta learn to train Loras somehow.
>>109160655What are you on about? The rebalance node gives it NSFW knowledge. It has no booru knowledge but now there's style transfer. There's also simple img2img passes on the material you're trying to fix (the approach I'm taking for my Chroma-Krea wf, and it also works perfectly for Anima).
>>109160698>The rebalance node gives it NSFW knowledge.it makes the images more slopped and the prompt adherence takes a hit overall, you have to pay a price for bypassing the filter anon, it's not free food
>>109160698>there's style transferNTA but you call that style transfer?
>>109160698>now there's style transferit's not that good, a real style transfer method requires using an adapter, the Krea team has it but won't give it to the goycattle
>>109160707Well, what is it to you? You take a reference image that contains the style you like, and you get a very close result. Well-trained LoRAs will always be better but this is pretty good, probably on par with Dall-e 3 which some anons were complaining about just a thread ago >>109155039
>>109160698>rebalance nodes>style transferwe need to start calling out these half baked trash with another name because they literally scam newbies and people with some aesthetic vision, stop lying.
>>109160733>probably on par with Dall-e 3Probably with a 2024 model?Okay...
>>109160751>Probably with a 2024 model?*2023
>>109160739I won't be that harsh, having these nodes is better better than having nothing at all, but yeah, we shouldn't pretend they're that good either
>>109160751>>109160756Which searched the internet and did style transfer, which is how all newer API models work.
>>109160720>it's not that goodDon't be silly. The results in this thread are wayyy better than anything else we had locally for this same task.
>>109160763I don't think dalle 3 searched the internet, in 2023, tool calling shit wasn't a thing at all
>>109160772>tool calling shit wasn't a thing at allClosedAI... They wouldn't tell you, there was never a proper paper for that model (aside from how it was captioned). Plus Dalle-3 had suspicious knowledge of IP. It could've been an enormous model, but it could've easily done just that.
>>109160771>The results in this thread are wayyy better than anything else we had locally for this same task.is this a joke? IP adapter has been a thing since 2022, even Flux 1 had an IP adapter, and never a training free method will beat a style transfer adapter, that's delusional
>>109160789but still, let's pretend it searched the internet and let's say it gathered a super mario 64 screenshot as a style reference to make some migu on skateboard mario 64 style shit, even nowdays this can be done with our modern models, Klein is so fucking bad at style transfer and it's an edit model :(
>>109160791>is this a joke? IP adapter has been a thing since 2022That doesn't mean it was good. The model doing the transfer matters, and so does the overall coherence of the transfers. Unless you think SDXL compares to Krea somehow.
So local models are still six years behind API models, and we are celebrating another day of choking down Python slop nodes(made with Claude) like it's a real achievement?
>>109160819>So local models are still six years behind API modelsIt's even worse for video models, Wan 2.2 is still the best thing we have, now compare that to seedance 2.5 lmaoohttps://www.youtube.com/watch?v=huXWUv9JXEE
>>109160809>Unless you think SDXL compares to Krea somehow.you are right, SDXL doesn't compare to Krea, it's superior, it knows much more styles out of the box
>>109160615>>109160187>>109160140Style transfer's only function is slop transfer, useless to a gooner, useless to a hobbyist, useless to a graphic designer. Straight to the toilet with you included.
Is it a schizo or is it a bot? The world may never know.
>>109160852who is this schizo talking to?
>quote no one>immediately get a (You)It's a sign of deep insecurity.
>>109160876>quote no oneIt's a sign of deep fear.
>>109160852No, that's a real person with real expectations and real disappointments. The fact that you're brainwashed and can't see things as they actually are is a separate issue, by the way, Chroma status? Personally I think v37 it's better than v40. Lodestone said something cryptic on his Discord, so maybe he's cooking something good.
>>109160894based, you bodied that freak hard
It's like little chickens cackling into the void.
>>109160800> even nowdays this can be done with our modern modelsPerhaps because you have a skill issue? If you tried it on the Krea 2 side of things remember what I said about needing images close to a supported resolution so it's coherent.
>>109160906that thing runs in two passes right? so expect x2 gen times?
>>109160906>useless to a gooner, useless to a hobbyist, useless to a graphic designer. Straight to the toilet with you included
>>109160916>that thing runs in two passes right? so expect x2 gen times?yes, you have to invert the image first (technically you can do an instant inversion by choosing "linear" but the quality won't be as good), but Krea is pretty fast with int8-convrot so it's not that bad
>>109160916INT8 models are very fast on my 3090, but yes.
It's really funny seeing this unhinged anger. When you have a sane mind it looks really goofy and difficult to take seriously.
>>109160906catbox workflow? that looks interesting
>>109160906What's neat is that on the preview I saw it getting closer and closer each step, maybe increasing them will increase likeness and style adherence even more.
>>109160937>schizo is talking to the void again
>>109160932>>109160934damn, i'm on ancient 20 series. maybe one day we'll get this tech to be one shot.
>>109160906Fun method, it works even for other DiT models, so it's not in any shape or form Krea exclusive.
>continues to give me (You)s out of his deep insecurity that he's been spamming this general all day long with his circular whining
>>109160955>i'm on ancient 20 series.I think it also works with the 20 series
>>109160939https://files.catbox.moe/q4tua6.pngSame as Reddit post from >>109159509This was the ref imagehttps://files.catbox.moe/kmbfdu.jpg
I'm still using Illustrious. Other models are kinda shit for 2d.
>>109160969you made int8 of that qwen3 vl 4b yourself or there is a link?
>>109160993https://huggingface.co/Winnougan/Krea-2-Base-Turbo-NVFP4-FP8-INT8
>>109161000Which is the best on blackwell
SDXL vs. Anima, using the same prompt, resolution, and settings.https://civitai.red/images/135215680Left is SDXL, right is Anima. Yeah, Anima doesn't recognize the character, but that's the least important part.
>>109161031>Aeterna Opuskek that merge looks fried as fuck
>>109161041Yeah, I'm giving Anima a pretty big handicap here.https://civitai.red/images/135215673Same deal: left is SDXL, right is Anima. Same prompt as the previous URL, same sampler, same resolution, and the same settings.
>>109161068>>109161068
>>109161031Teriteri a cute!>Yeah, Anima doesn't recognize the character?? Both are Teriteri.