Discussion and Development of Local Image, Video, and Music ModelsPrevious: >>109138222https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUISDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineageWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, & Upscalershttps://huggingface.co/modelshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.info>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/tdrussell/diffusion-pipehttps://github.com/kohya-ss/sd-scriptshttps://github.com/kohya-ss/musubi-tuner>Krea 2https://huggingface.co/krea/Krea-2-Rawhttps://huggingface.co/krea/Krea-2-Turbo>Zhttps://huggingface.co/Tongyi-MAI/Z-Image>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/https://animadex.net>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2.3https://huggingface.co/collections/Lightricks/ltx-23>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>>109140721>Comfy is on the collage againcan't blame OP, he's so handsome (no homo)
>kreausecase?
Blessed thread of frenship
Does sd.cpp have all the samplers / schedulers / flow scheduling / negpip / etc stuff yet?
>>109140749>localusecase?
>>109140793so I can train images of your mother
Has anyone tried boogu for its edit capabilities? I guess it's not better than Klein right?
>>109140776If something is not in sd.cpp, it's snake oil anyway
>>109140721How do you guys maintain consistent characters across images? I get the impression faceID is kind of old.
>>109140837
>>109140802Why the hostility? You should be happy, you are a proud local user after all :^)
https://huggingface.co/easygoing0114/qwen_image_clear_vaehttps://youtu.be/Iloby6ZXRjI?t=5
>>109140837>>109140842I wish ideogram hadn't that retarded prompting structure
>not posting ZIT to comparelol
>>109140837>>109140842
>not posting SDXL to comparelol
>>109140820So it doesn't? Darn. Hopefully soon then.
>>109140863Ideogram looks pretty good, but I won't use bboxes, fuck that shit, what were they thinking??
>not posting dall-e mini to comparelol
>>109140853>I wish ideogram hadn't that retarded prompting structureif it was optional it would've been good, but forcing this shit too us is just too much to ask, I want my gacha slop, having to control anything is lame
>>109140721Comfy is such a god damn CUTIE
Is using a local agent to generate rotating email scripts for trial scumming API services a local thing?
>>109140890Yeah, I would just use it as refining img2img model, the skin of ideogram is really good
z-image stills mogs krea and ideogram
>>109140274>>109140356Krea2 turbo runs on 4GB VRAM and the gen speed is the same as Anima at 30 steps. If chronically VRAMlets like Oekaki can run Krea2 on basically nothing and still get better prompt adherence and quality, it might just become the community's model.
>>109140874>not posting dall-e mini to compareAll right, you asked for it!
>>109140918masterpiece
>>109140917>it might just become the community's modelnot with that retarded VAE
>>109140926Ikr, models used to have sovlhttps://www.youtube.com/watch?v=XQr4Xklqzw8
>>109140831many ways. sometimes with reference images, sometimes with lora, sometimes the model just knows some character or person or has a specific idea of blonde azerbaijani women or w/eand yea there are also still tricks in use that are some supplementary thing like faceid or controlnets or w/e
>>109140928everybody enjoying krea 2 you gotta stop. pack up your bags this guy said it was bad. i'm deleting it off my hard drives right now to install zit for +500 social points
So his newest shitposting angle is now "VAEs are le bad"?
>>109140944>>109140955>everybody enjoying krea 2you and the voices in your head?
VAEtroons need to stop
>>1091409622 many word, learn 2 meme lil blud
>>109140967>can't read more than 2 wordsyour average Kreatard, ladies and gentlemen
kreatards on top zit faced niggas on bottom
VAEgods need to continue
https://huggingface.co/ilkerzgi/fal-Krea-2-Style-LoRAs>Here, more style loras!I don't want loras I want the style adapter :(
>>109140854>>109140863ZIT
>>109140986clothing is worse, sword is floating, me in the background preparing to rape
>>109140986Thanks for the tests anon
>>109140994now do anima
>>109140994>>109140994And Klein kek. Prompt was>a beautiful young Japanese actress. Cosplaying as 2B from Nier:Automata. Wearing her iconic black dress with feathered sleeves. She sits on a director's chair drinking from a thermos. Short black hair tied back in a tight bun. Thigh high stockings. She is Barefoot.>Prop sword leaning against the chair. Unworn white 2B wig resting nearby on a table. Her unworn black leather thigh-high boots are discarded on the floor nearby>Background a busy outdoor movie set with a green screen. Candid, natural BTS photographyReformatted for the ideogram gen obviously
>>109140863 Memes aside, Krea and Ideogram are at a level where they can compete with or be compared to Nano Banana and GPT Image 2 instead of fighting in the mud for the lowest spot with Z Image or Chroma.
>>109140999Klein is so bad at anatomy lol
>>109140986>turbopromptlet
>>109141017>Memes asideand then he proceedes to say a meme opinion lol
>>109140999flux always fucks up toes in stockings like that. worst part of the model.
>>109140999Anima (2B)
>>109140994Can you give me the proompt? I want to try seedream at least
>>109141092>>109140999
>>109141040kinda crazy what this model can do with only 2b params and a 0.6b text encoder.
Anyone with a krea2 workflow that bypasses any filters? I tried one but it failed.It's friday night and I need to goon.
>>109141113>I need to goon.https://civitai.red/models/2409949/sam-anima-realisticJust use this instead my man.
>>109141019seedream4.5>>109141025I don't know, what do you think? I believe Krea2 and Ideogram are already on the same level. It's time to raise the stakes and not be afraid of API services.
>>109141113>bypasses any filtersthe absolute state of localkeks
>>109141101> what what?
>mfw Resource news06/26/2026>Adobe to Acquire Topaz Labshttps://news.adobe.com/news/2026/06/adobe-to-acquire-topaz-labs>LiveEdit: Towards Real-Time Diffusion-Based Streaming Video Editinghttps://live-edit.github.io>PhysRAG: Enhancing Physics-Awareness in Video Generation via Retrieval-Augmented Generationhttps://github.com/sediment1024/PhysRAG>SAM2Matting: Generalized Image and Video Mattinghttps://henghuiding.com/SAM2Matting>Unison: Benchmarking Unified Multimodal Models via Synergistic Understanding and Generationhttps://github.com/FudanCVL/Unison>ComfyUI-AppleSilicon-FP8 - a compatibility layer custom node for Apple Siliconhttps://github.com/pawel-mazurkiewicz/ComfyUI-AppleSilicon-FP806/25/2026>Bernini-R — GGUF (high & low noise experts) https://huggingface.co/neuregex/Bernini-R-GGUF>Physics Question Scene Graph: Fine-grained Evaluation of Physical Plausibility in Text-to-Video Generationhttps://github.com/atinpothiraj/pqsg>VPA-Guard: Defending and Benchmarking Image-to-Video Generation Against Visual Prompt Attackshttps://huggingface.co/datasets/CSU-JPG/VVA-Bench>Minimalist Preprocessing Approach for Image Synthesis Detectionhttps://github.com/vohoaidanh/adof06/24/2026>Krea-2-Turbo Training Adapter https://huggingface.co/ostris/krea2_turbo_training_adapter>Vera: A Layered Diffusion Model for Content-Preserving Video Editinghttps://vera-layered-diffusion.github.io>Advancing WordArt-Oriented Scene Text Recognition: Datasets and Methodshttps://github.com/YesianRohn/WATER>DramaDirector: Geometry-Guided Short Drama Generationhttps://github.com/iLearn-Lab/DramaDirector>PG-MAP: Joint MAP Optimization for Inference-Time Alignment of Diffusion and Flow-Matching Modelshttps://github.com/sophialanlan/PG-MAP>Safe Few-Step Generation via Velocity Editinghttps://uzn36.github.io/VESFlow>Co-occurring associated retained concepts in Diffusion Unlearninghttps://github.com/damilab/CARE
>>109141040I have said it before and I will keep saying it: Anima needs a proper realism finetune (100k images or so, none of this 200 image lora shit) and it will be nearly on par with the big boi models, while being a fraction of the size and with full uncensored booru knowledge.
>mfw Research news06/26/2026>From Celebrities to Anyone: Characterizing AI Nudification Content, Technology, and Community Dynamics on 4chanhttps://arxiv.org/abs/2606.27234>LearniBridge: Learnable Calibration of Feature Caching for Diffusion Models Accelerationhttps://arxiv.org/abs/2606.26778>LCG: Long-Context Consistent Image Generation with Sparse Relational Attentionhttps://arxiv.org/abs/2606.26171>Disco-LoRA: Disentangled Composition of Content, Style, and Motion for Multi-concept Video Customizationhttps://arxiv.org/abs/2606.26668>ResilPhase: Plug-and-Play Phase Mapping for Diffusion Accelerationhttps://arxiv.org/abs/2606.26769>NaviCache: Test-Time Self-Calibration Caching for Video Generationhttps://arxiv.org/abs/2606.26795>DanceDuo: Bridging Human Movement and AI Choreographyhttps://arxiv.org/abs/2606.26507>PhyEditBench: A Real-World Multi-Stage Benchmark for Physics-Aware Image Editinghttps://arxiv.org/abs/2606.26551>TMP: Tree-structured Mixed-policy Pruning for Large-scale Image Generation and Editinghttps://arxiv.org/abs/2606.27089>DanceOPD: On-Policy Generative Field Distillationhttps://danceopd.github.io>Do Image Editing Models Understand Lighting?https://arxiv.org/abs/2606.26738>Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generationhttps://arxiv.org/abs/2606.26907>Adversarial Diffusion Across Modalities: A Fusion Survey of Attacks, Defenses, and Evaluation for Text, Vision, and VLMshttps://arxiv.org/abs/2606.26566>Safe Autoregressive Image Generation with Iterative Self-Improving Codebookshttps://arxiv.org/abs/2606.27147>SpatialFlow-GRPO: Where Spatial Credit Drives Image Editinghttps://arxiv.org/abs/2606.26872>Ask, Solve, Generate: Self-Evolving Unified Multimodal Understanding and Generation via Self-Consistency Rewardshttps://arxiv.org/abs/2606.27376>Scaling Multi-Reference Image Generation with Dynamic Reward Optimizationhttps://arxiv.org/abs/2606.26947
>>109141139>on par with the big boi models>with Qwen Image VAEkeek
>>109141132>>109141140Fuck off debo
>>109141144boogu is so slopped it's comical, China doesn't know how to make kino models (Z-image turbo was the exception not the rule)
>>109141139no one wants to dare to do that because of the obvious totally uncensored 3D realistic loli issue that comes with that model.
anima needs the noobai/illust treatment, thats it, is anyone training that?
>>109141156me in the background
>>109141163>is anyone training that?why would they do that when krea 2 is ripe
>>109141140>From Celebrities to Anyone: Characterizing AI Nudification Content, Technology, and Community Dynamics on 4chanhttps://arxiv.org/abs/2606.27234Any of the Top 10 active providers in the thread today?
>>109141140>>From Celebrities to Anyone: Characterizing AI Nudification Content, Technology, and Community Dynamics on 4chan>https://arxiv.org/abs/2606.27234we're being studied like lab rats
Arrested? For sharing tips??
First semi interesting paper post in literal years lmao
>>109141177lmao damn https://arxiv.org/abs/2604.12190
>>109141186Very constitutional
>>109141186what country is that? bonglang again?
>>109141194holy fuck... they know...
>no explicit mention of ldg bros....
>>109141152>It's already an issue.
>>109141237Clearly they're only talking about /lmg/. I can't imagine why they would take the other generals filled with browns and schizos seriously.
>>109141194>they went on /b/
>>109141259So, are you brown or schizo?
>>109141261
>>109141166I like the view more from here.
>>109141179>>109141194Is this why they detailed /r/ kek
>>109141266Also they say this:> Technically-sophisticated actors gravitate toward certain communities (e.g., 4chan), while lower-sophistication end-users are more active on others (e.g., Reddit).This was only published to troll Reddit, wasn't it?
>>109141194Heartbreaking finding; requests NEVER get fulfilled
>>109141288This deserves it's on separate study
the corpos don't need to do ethical training just you the citizens need to be ethical.
>>109141288>tfw actual scientists call you sophisticatedFirst general I post in that gets me this compliment.
>>109141300requesting naked ryan gosling with long foreskin dripping precum lora
>>109141331>[REQUESTER]
>>109141331Chroma might be your best choice for a base model, OP has guides for training a lora
>>109141225I am the rare GENNER-EDUCATOR-SOCIALIZER. they would have referenced me by name if they studied /g/ instead of /b/
is this ethical?https://files.catbox.moe/3iinxk.png
>>109141368>interracialnot ethical at all!
>>109141368it's ethnical
>>109141356I was just joking as a>[REQUESTER]as per >>109141225
>>109141383kek
>>109141179>we're being studied like lab ratsnice. they are also documenting open discussion of total kike death and it will be saved forever for posterity, it does get my a bit giddy
>>109141368sex with animals is not ethical, anon
>>109141288I don't know why this is surprising honestly. Practically every major AI software dev originated from this site or at least posted at some point.
>>109140999
>>109141368I don't know but she doesn't look impressed.
>>109141412She's used to BBC after all, can't blame her
>>109140943What tool do you use that lets you use a reference photo as an input?
>>109141397lmaoo
>>109141399I like to believe 4chan appeals to individuals who value open discussion of ideas above all else.
Those researchers are the ones who saw my supreme kino sovl gens and replied with "Nice."
>>109141279>In this work, we present the first large-scale measurement study of 4chan’s Adult Requests board>The Adult Requests board primarily serves as a venue for exchanging AI-generated nudification content. Manual inspection of a random sample of 200 videos confirm that over 98% are AI-generated sexual content/r/ used to be a decent board. I guess hiromoot got word of this paper and SHUT IT DOWN
>>109141464They only sampled /b/
>>109141412>>109141427Easy fixhttps://files.catbox.moe/82i41h.png
files.catbox. moe/7hcc73. jpg
>>109141485This one >>109141179 is about /r/
>>109141307>the corpos don't need to do ethical training just you the citizens need to be ethical.Instruction unclear, made 700 000 gpu hours of safety tuning
>>109141498>files.catbox. moe/7hcc73. jpgnewfag?
>>109141496if she's eating while having sex she isn't probably enjoying it much.
So every everyanon who keeps saying plebbit is better for imggen has been officially BTFOd right
>>109141542I mean, eating good stuff is a pleasure, so that always adds up
>>109141139these exist, the issue is that youre pushing the model to its limits at some point. and at 100k photographs youre starting to approach the area where it might just be easier to finetune a better arch on booru images instead, not least because its easier to get data for that
I wish the glowies and researchers actually funded 4chan servers as well, it's only fair for such an easy honeypot
>>109141194Researcher Wellbeing. Researchers involved in this studywere informed in advance that they would be exposed to sexually explicit content as part of the data collection and analysis process. To mitigate potential psychological harm, researchers have access to mental health resources throughoutthe study period. And any researcher who feels distressed atany point can seek support and request a break without anynegative consequences.
>>109141538Any upscaling or does it come like that out of the model? Sampler/Scheduler?One of the few krea2 gens I like.
>>109141225do you get a special label for disruptive shitposter
>>109141595heh, pussies
>>109141439you cannot post on reddit without a million hoops and licking jannies feet, it's not for smart people
>>109141595I REQUIRE MENTAL HEALTH RESOURCES TO INTERACT WITH
>>109141602no upscaling, realism engine v2 lora, prompt by grok>Boring casual selfie of a 19-year-old Japanese woman with a fit toned body and large breasts. Early 2000s gothic rockabilly fashion: tight black lace-trimmed tank top with a low V-neckline that accentuates her deep cleavage, small black leather jacket, delicate silver cross choker. Long straight jet-black hair, fair porcelain skin, very thin styled brows, striking dark brown eyes with subtle smoky makeup. She playfully frames her chin with her right hand, thumb and index finger forming a gentle L-shape touching her jawline, cheeky vixen smile. Soft natural window light, intimate and slightly sensual mood, cute amateurish make-up and a look that is filled with love.
>>109141518>muh misuse that will be irrelevant and impossible to prevent in the near future.God I hate them. Skynet can't come soon enough.
>>109141631Guess it's the lora, then. Thanks Anon.
>>109141595>"I saw post calling a gen "cucked" and "retarded"">"Time for a 2 days break"And AI researchers dare to say they don't have plum jobs?
>>109141194>>109141657Researcher Well-Being. This work required direct engagement with disturbing text content by researchers. The researchers ultimately decided that this exposure was worthwhile and necessary to expand our understanding of the resources employed by these illicit communities. To minimizeresearcher exposure to traumatic content, we opted to excludeimage-level analysis, instead analyzing only sample text anda boolean variable indicating whether or not an image wasincluded. Researchers also took breaks as needed throughoutthe data analysis process.
>2606.27234Oh, so that's why they killed /r/. Probably asked the higher-ups if they wanted to answer questions about their study, which made them shit their pants and burn it all down before others started dropping by to rubberneck.>>109141356I think my favorite part of the Krea 2 release so far has been the Chroma-glazer's triumphant return. I hadn't seen him post his Asian foot-fetish content is so long.
>>109141640I think it's probably better to run lora with krea2 all the time, even at 0.05 str
>>109141680shes got the perfect claw hand for that popcorn
I was promised nudity with this workflow. I have been lied to.
>>109141700i think nude works a lot better than naked
>>109141700https://civitai.red/models/2727829/krea-2-enable-nsfw-prompt-adherence-krea2-nsfw?modelVersionId=3066310
>>109141152I have loli uncensored even in ideogram...
>>109141686how you getting expressions? k2 seems to always gimme blank expressions no matter what
>>109141700sometimes it works sometimes it doesn't.
>>109141709Same thing.>>109141719404.
>>109141751>404you must be britishhttps://huggingface.co/Beinsezii/Krea-2-Turbo-Projector-Scale-LoRA-Diffusers
>>109141751you need to be logged in to see nsfw stuff on civit
>>109141733sure sweety
do you think the researchers are watching us now
>>109141742>how you getting expressions?Need to use lora, seemingly any lora
Oh wow, krea can blend styles really good.>>109141758I tried that one as well without success, strange.I'll look around on civitai a bit.
>>109141777Probably not, they're gooning on the red boards
>>109141680Those blockshit eyes. Man, I can't believe it. Krea 2 was so close to being the one model but it just falls flat as soon as the camera zooms out .Please tell me there's a solution to this.
>>109141792Nothing a little detailer pass can't fix.
promt: muhammad
>>109141806the smokey plastic looks good
>>109141700which exact model are you using?
civitai try not to be down for a day
>>109141832yeah, it has that cheap and mass produced aesthetic
>>109141838Standard krea2 turbo.
>>109141595This my disturbing data for those little goys. YOU LOST THE IRAN WAR
>>109141846give me 10 minutes
>>109141838Please stop im still at the office and I want to wait until im home to jerk off
>>109141782seems to be down atm.
>>109141768skill issue?
hey researchers, if you were not generationally incompetent you would be making bank in industry right now not working paycheck to paycheckHere is to the bottom of the class staying behind to monitor 4chan
>>109141877the pursuit of knowledge is a purpose than the pursuit of money
>>109141888higher purpose*fugg
>>109141846yea use nsfw one. I use civitai.red/models/958009?modelVersionId=3066243 btw
>>109141888most of the ai knowledge is also coming from google, facebook etc not bottom tier universities who can cope with investigating social impact of ai or whatever waste of time
>>109141898google/meta/adobe/etc spend a lot of money funding academic research. sometimes those projects do produce commercially useful outputs, but they also train up the researchers that will go on to become researchers in their corpo labs. research is a giant ecosystem
Good boy!
just made a quick trip to /b/. it's basically 100% CP at this point. Isn't this what Snake tried to warn us about?
>>109141933>Snakewho
>>109141931now do a good goy
krea can do trans porn/thread
loser nigbo getting his first good paper post since he started spamming here now he thinks hes hot shit fucking kekd
>>109141941
>>109141935well it wasn't technically Snakehttps://www.youtube.com/watch?v=C31XYgr8gp0
>>109141896It seems I had to use a massive ai prompt, but that model works much better, thanks.
>>109141966kek. touché.
>>109141967Who wrote the script for this game, wasn't Kojima. Awesome plot
>>109141993Kojima and his co-writer Tomokazu Fukushima
>>109142006wasnt the guy lost somewhere and the whole plot mystery? dont google it and ruin this story with facts
Local Diffusion?
>>109141696>>109141792That's just the epoch I was using at the moment. I've been reworking my dataset all week for the new training techniques (noise+depth) so things are softer/noisier than usual because there's no synthetic images included at the moment. I need those to really increase detail because there are no high quality images of Jenny on the internet (trust me, I have them all).I'm also still on ZIM+ZIT... Comfy looks like a train wreck right now, so I haven't updated in a while.Thanks for reading my blog!
actually meant to post this onehttps://www.youtube.com/watch?v=-gGLvg0n-uY
Computer, watermark my a r t for me.
>>109141896https://image-b2.civitai.com/file/civitai-media-cache/c5105f56-e633-4d91-9d1f-390aab414e24/original
>>109142015No idea. I am Indian I just looked it up for you.>>109142020Metal Gear Diffusion: The Phantom Model
>>109142040>civitaiwtf?
>>109142040she seems to be having good time.
Damn, a cucked model.
>>109142029hmmm, this Jenny does seem somehow more authentic to me now on closer inspection.I'd say "good work" , but I have to wonder about the mental state of a man who pursues this kind of work.
oh im laffin
I like testing models outside the expected normal subjects, but now I'm starting to regret it.
So what's Krea bad at?Anime? Because I've seen it make Final Fantasy-like pics and they look pretty good.
>>109142247it's *okay* at anime. nothing some training and loras can't fix. it's not like you don't already need a lora for everything in anima anyways
>>109142247It's not particularly bad at anything, but it's also not exceptional at anything. It's a real jack of all trades master of none type model
>>109142263soooo........ its a foundational model?
>A jack of all trades is a master of none, but oftentimes better than a master of one.
>>109142262How hard is to train loras for Krea?
>>109142247shit realism compared to ZITshit expressionscensoredslowok ip knowledge but 80% there, making the likeness that is there mostly worthless aside from memesin my experience it trains worse than ZIT too
>>109142267except that you can't train it since it's turbo distilled
>>109142247>So what's Krea bad at?realism and details (because of that retarded VAE)
>>109142278not sure I haven't tried I think the barrier of entry is a 24gb card but I could be wrong I'm on 12gb so not sure if I can train but it gens perfectly fine for me. anima community will likely stay strong due to low barrier of entry for training.
Prompt variability in Krea 2 also seems even less than with ZiT. I guess it's because it expects you to be very verbose with it. You have to specify every detail so there's a whole lot less of those happy accident moments.
I was definitely going for 'cross eyed' without knowing it myself. Thanks krea.
>>109142294I am haunted by blockshit eyes
>censoredI can gen full on porn but okay
>>109142302that's not cross-eyed, you asshole, she looking at >>109142302
>>109142247It doesn't really understand certain posing or character positioning and its really bad at making dynamic manga pages or long form text.>>109142278You need a 5090, unless you wanna train with shit settings.People *say* that 12gb is possible but I dont want my lora coming out looking like ass so just spin up the runpod.
so just to get this straight. If I want to generate a portrait image instead of a landscape image with Comfy Krea 2 I have to invert the weight and height imputs manually by disconnecting them and crossing them over? There is no simpler way to do this?
>>109142340>It doesn't really understand certain posing or character positioning and its really bad at making dynamic manga pages or long form text.since it has a cuck filter I'm always pondering whether the poor prompt understanding comes from the model or because the filter did some false positive triggering shit, that's annoying as fuck
>>109142288>you can't train it since it's turbo distilledthe base krea2 model exist doesnt it? or is even that one untrainable?
>>109142345?????just hook a resolution selector the empty latent and select the proportion / resolution you want
>>109142345what?
>>109142365they call it raw and yeah it's trainable, it's not good for much else
>>109142366>>109142368Sorry, retard moment, nvm
>>109142295>I think the barrier of entry is a 24gb cardthis is with diffusion-pipe and 1024 resolution
>>109141538>INT8 ConvRotI downloaded the model and updated comfy to 0.26 but I am not noticing any time difference, is there something I need to toggle or some kind of node I need to use?
anyone trained krea2 with 16gb vram yet?
>>109142386I mean this doesn't really explain anything. It will use more vram if available.
>>109142397just set 65% or whatever offload in ai toolkit
Is there a stabilizing node like zit has for very large resolutions for krea2? It's able to handle very high res, but distorts the bodies.
>>109142140>left aryangram>right krudea
in dire need of dedicated anal lora
>>109142412>just set 65% or whatever offload in ai toolkitand how fucking slow is this going to be?
>>109142415zit already does 2kx2k natively>>109142425i mean you do sleep at some point, 1 or 2 nights depending on the settings
>>109142247>So what's Krea bad at?at being efficient, 12b is bloat, Z-image turbo is a 6b model and gives better quality images
>>109142247>So what's Krea bad at?it's not very good at anything
Why do people enjoy ZiT when it has so little output diversity?
>this Z-image grifternobody wants to use that shite
bruh, Krea has been overtrained on memes or what? it can spit out the training data just like that
>>109142495Because you train or use a lora with it that easily adds the diversity
>>109142560But the lora training is the worst in the industry so its useless. RIP!
>bruh look at all those heckin awesome references and memes in the model haha disregard the shit output but look at all the cool characters it knows
local is still nowhere near API level
>>109142495I was expecting Krea 2 to have output diversity too but it's not the case at all
>>109142566>visual consistencyyes I love people being able to recognize it as gpt yellow tinted slop within seconds, valuable brand recognition
>>109142566>gpt image
>>109142298Use Raw. Turbo is obviously rigid like all other distilled models.
>>109142578why are you posting your ideogram chatlogs?
>>109142495>Why do people enjoy ZiT when it has so little output diversity?Prompt for the diversity you damn dirty ape
>>109142495Because using Base is too hard for them and they already have to wait awhile on their old card so asking them to wait even longer for base to finish is impossible.
>>109142578no refund
>>109142561Zit trains great
>>109142495the same reason why wai was so "popular", all it takes is like three words to get an average looking image of booba. but it'll never be more than that without loras or empty conditioning on early steps cope.
>it'll never be more than that without lorasthat's every local model
>>109142592Most people either want tranime or realism, ZIT is for realism, ZIB is for nothing. It also trains worse, and has a somewhat melted look. No point in using it.
>>109142495>ZiTDiT models are all the same, they won't give you diversity, I miss the unet era, when you go for a prompt on SDXL it'll give you 4 completly different interpretations
>>109142612>Yes my innie penis will never satisfy a woman
>>109142612except Anima
The more I use GPT Image the more I realize local is fundamentally lacking in essential technology. No omnimodel, no search tooling, no edit. API is so far ahead, I'm hyped for what China can deliver in the next Seedream model
>>109142615>It also trains worse, and has a somewhat melted lookWith suboptimal settings yes
>another "I don't know how to emulate API functionality on local therefore it does not exist" episode
>another never got a date I guess I'm a dirty homo episode
Has anon messed with the shift value for Krea?
as if dirty homos will ever admit to being dirty homos.
this is somebodys life rn
>>109142673Use caps lock, dummy
>>109142718me in the back with the fucked up hand
>>109142568krea is doing that styling with prompting + loras maybe
>>109141357problematic nigbo
>>109141150
>>109141430actually a bunch, but most popular is probably still flux klein, speedy and pretty competent.
>>109143000people forget that black forest labs actually sells klein as a SAAS product so it's not like they're just giving away some shit they don't think is good.Klein is made for editing and fast iteration.
>>109141186>SNEACIWho the fuck gets paid to invent retarded LGBTQIOABBQ+ tier bullshit jargon like this?
>>109143126Me. Clever isn't it?
>>109142566>GPT Image |X| Not local>Can't prompt or train to your heart's contentwhy is the biggest con missing from that image?
eyes looking rough
bout that time I have to nuke my ComfyUI installation and copy+paste commands from Grok to make krea 2 work. If only I was tech literate. Fucking python conflicts on this fedora
>>109143200>tech literate>has to copy paste commands from grokyou're doing a good job, little buddy
>>109141186>>109141194>>109143126
>>109143184That a LoRA?
>>109143184fixed the eyes for you
https://photos.google.com/share/AF1QipNIONmNur4qtfMg7ar2MD5z-1opZQBBzoefJfVEAKLyjwmU-wOphoVyyUuKK6gcWA?key=VC0zX0ZUd0diQUJpTWRxYThBelA5QWNQc3EzT3p3krea2 knows ball>>109143224yes, not quite satisfied yet>>109143242based
>>109143251man some of these shouldn't even be on there due to insanely inaccurate they look. Like look at lorde that looks literally nothing like her.
krea2 is too fucking good bros. :) hope to god some man of culture is cooking character loras from senran kagura, ikkitosen, rayman origins, isekai Meikyuu de Harem wo, vermeil, freezing, final fantasy, queen's blade and nekopara.
prompt: 4chan chud
>>109143280wow, i'm just like a 4chan chud; i'm bricked up
>>109143266nobody is touching this model, it's dead-on-arrival just like ideogram. stopgap filler without edit capabilities
>>109143293nobody is touching you and I'm not on here crying about it
>>109143256slop
>>109143302make a better space gen (you won't)
Krea 2 is a model that's the exact same number of parameters as the original Flux.1 Krea, but with an objectively worse VAE and largely worse out of the box photographic realism. I really don't get the hype at all. Sure the Turbo version is faster than OG Krea which was only guidance-distilled as opposed to step-distilled, but like a lot of models have Turbos now so that's not exactly a selling point IMO.
>>109143200Are you sure your 'venv' is correct, I mean you need to use a venv. Your distro name or version doesn't matter
>>109143200This isn't a problem on Nixos
>>109143392localkeks are desperate for scraps after months of zero progress. they will gladly gobble up whatever slop is put before them. krea 2 offers nothing new yet people are so starved for content they delude themselves into thinking this is somehow a revolutionary new model when it's literally a flux.1 reskin but worse.
>>109143435pretty sure it's at most a handful of the same shills
After a system update on arch, comfy started hanging so I finally git pulled. For some reasons it started asking for nvidia drivers despite me using an amd card with rocm
>>109143392what about generation time? if Krea 1 is slow, then Krea 2 wins
>>109143392Idk anon, maybe try prompting original Krea for anime, or anything challenging? Krea 1 had very limited understanding of anatomy because it was significantly more censored, plus it also knew far less styles out of the box.
>>109143251posted this link on shitcord and it got removed because redditor thought it was a virus
>>109143435nah. krea2 is a serious game changer for local open source image gen. it's nanobana pro levels of visual fidelity and text generation.
>>109143200why are you on /g/
>>109143452Perhaps read the installation instructions on their github page.
>>109143502It's worse at text than Ideogram. It has worse overall prompt adherence than nearly all other recent models. It's also rather poor at realism out of the box.
something went wrong but at the same time i don’t hate how it turned out
>>109143502What do you use to uncensor?
>>109143461how is Krea 2 is so "uncensored" exactly compared to anything else recent in a way that matters?. If you mean it can do booba and nips out of the box literally no one cares, so can fucking Klein 4B Distilled even:https://files.catbox.moe/voubcn.png
>>109143251a lot of these are awful lmao, his head is like uniformly bigger on the left side for some reason
>>109143502Is that krea? Can I get a catbox/workflow?
holy fuckin KEK
>>109143251for real not going to lie not clicking on that link bro
>>109143565it's a gallery of celeb headshot attempts by Krea 2
>>109143502>>109143502If some anon with proxy access to a corpo GPU asked you where he should train his illustration lora, would you tell him Anima or Krea2?
>>109143565retard
>>109143512another one from the same batch
>>109143568Yeah joking aside, that's a great resource for their lawyers. Expect something to happen in few weeks.
>>109143538It knows anime, manga, realism coherently and way more converged because the base model can do vaginas after some re-alignment hacks. That aside, if you prompt Klein for something more complex likehttps://files.catbox.moe/z6axb5.pngorhttps://files.catbox.moe/9rqlck.pngor anything involving any kind of complex smut in any modality E.G.https://files.catbox.moe/6ew6bm.pngIt probably won't succeed. Check Klein's understanding of anatomy. Check every new model's understanding across all domains, than you will see why Krea 2 simply is superior.
god i cant wait for my 64gb of ram to come in so i will have a total of 80gb. i can offload all the experts on qwen3.6 35b a3b and use comfy ui on my 4070 with qwen and not rely on the slow ass generation of my 1060 anymore.
>>109143599That's a great feeling isn't it
>>109143597Is there room for improvement? Yes, but this is good enough, also in this model, because styles by default vary so widely, the amounts of slop also vary a bit. It will all be fixed with a finetune.
>>109143598I want to climb those legs
>>109143538Klein generates body horror for anything that isn't a portrait or standing position.
>>109143625Same prompt. Even with this realism LoRA, anime and all the modalities it knows are still strong. The model has been trained so well it knows how to blend all the three things together when asked.
>>109143609it will be. monday is the day. kinda wish i went the whole 128gb route but paying $300 for 2 sticks of 32gb ddr4 already felt like ass rape. idk if i want to get a v100 32gb card next for $700 or just take the ass rape again for the other 2 sticks of 32gb ddr4 ram.
>>109143635She won't allow it
Oh no, he is still in hospital..!
>>109143683I'VE JUST PURCHASED THE REQUIRED $40K WORTH OF GOOGLE PLAY CARDS TELL ME WHERE TO SEND THEM BRAD
>>109142436Ostris has a lot of bugs that the author refused to fixed even when users made solutions for them. It shouldn't be this slow.
>>109143599>>109143652Did the blog factory explode
>>109143518i use that 160 byte lora that was shared around here but it fucks with the photorealism skin texture and prompt adherence . pic related wasn't gen with it.>>109143569i would beg him to use krea2. anima is a dead-end model for vramlets. >>109143562it's krea2 and using wan2gp. there is no workflow. use the uncensored 160byte lora plus my lunafreya lora
>>109143597Klein recreated the Yoga ladies easily from a Gemini recaption of the original pic lol
>>109143636Only giga ESLs who don't know how to write prompts in non-broken English (or at least how to ask LLMs to do it for them) think this
>>109143714yeeethanks for reading mine
>>109143726Lol
>>109143562here is the lora. https://gofile.io/d/CRsvcthere is the catbox but its made with wan2gp: https://files.catbox.moe/ng6am2.png
>>109143597Btw, Klein is absolutely abysmal for styles. It was a joke compared to Flux.2 (32B) for painting/anime styles. I don't just mean bad, but I mean Flux.1 bad at so many of them that the model was DOA for anything not photorealism.>>109143719Interesting result, Klein usually gave me body horror for multiple women facing the camera, missing fingers etc... still not as good on average as Krea though, too many melty looking toes, though it's a touch more realistic because its VAE just happens to be superior.
And here's the yoga-watching girl one on Klein (or at least I assume the original pic actually intended for her to be watching the laptop screen, which makes way more sense)
>>109143792that's not true at all lmao, Klein is quite good at styles, it's nothing like Flux.1 at all. (Assuming we're not talking about muh named artists which NO model knows "well" frankly unless it's a finetune like Anima or whatever.)Also last recreation here, the asian clone girls
>>109143837>>109143837>>109143837>>109143837
>>109143841What a lousy bake. You truly are blind and stupid.
>>109143106I like this Miku
>>109141439>I like to believe 4chan appeals to individuals who value open discussion of ideas above all else.Not really. I have a hard time getting people to actually post sources for their claims. I've been on 4chan for years and getting people to act intellectually honest is nearly impossible in some discussions. might be a /pol/ thing to some extent but i see it on other boards too.>>109141614I'd say its just the barrier to entry on reddit is lower. most normies dont like being insulted over trivial things or some of the more negative aspects of 4chan.people exaggerate how bad the janitors are on reddit. its probably better for discussion in a lot of areas than here.
>>109143926>I have a hard time getting people to actually post sources for their claims.This problem is worse on reddit. Posting sources (even legit sources) on reddit will get you banned.>most normies dont like being insulted over trivial things This problem is an order of magnitude worse on reddit.