Discussion and Development of Local Image, Video, and Music ModelsPrevious: >>109138222https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUISDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineageWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, & Upscalershttps://huggingface.co/modelshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.info>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/tdrussell/diffusion-pipehttps://github.com/kohya-ss/sd-scriptshttps://github.com/kohya-ss/musubi-tuner>Krea 2https://huggingface.co/krea/Krea-2-Rawhttps://huggingface.co/krea/Krea-2-Turbo>Zhttps://huggingface.co/Tongyi-MAI/Z-Image>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/https://animadex.net>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2.3https://huggingface.co/collections/Lightricks/ltx-23>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>>109140721>Comfy is on the collage againcan't blame OP, he's so handsome (no homo)
>kreausecase?
Blessed thread of frenship
Does sd.cpp have all the samplers / schedulers / flow scheduling / negpip / etc stuff yet?
>>109140749>localusecase?
>>109140793so I can train images of your mother
Has anyone tried boogu for its edit capabilities? I guess it's not better than Klein right?
>>109140776If something is not in sd.cpp, it's snake oil anyway
>>109140721How do you guys maintain consistent characters across images? I get the impression faceID is kind of old.
>>109140837
>>109140802Why the hostility? You should be happy, you are a proud local user after all :^)
https://huggingface.co/easygoing0114/qwen_image_clear_vaehttps://youtu.be/Iloby6ZXRjI?t=5
>>109140837>>109140842I wish ideogram hadn't that retarded prompting structure
>not posting ZIT to comparelol
>>109140837>>109140842
>not posting SDXL to comparelol
>>109140820So it doesn't? Darn. Hopefully soon then.
>>109140863Ideogram looks pretty good, but I won't use bboxes, fuck that shit, what were they thinking??
>not posting dall-e mini to comparelol
>>109140853>I wish ideogram hadn't that retarded prompting structureif it was optional it would've been good, but forcing this shit too us is just too much to ask, I want my gacha slop, having to control anything is lame
>>109140721Comfy is such a god damn CUTIE
Is using a local agent to generate rotating email scripts for trial scumming API services a local thing?
>>109140890Yeah, I would just use it as refining img2img model, the skin of ideogram is really good
z-image stills mogs krea and ideogram
>>109140274>>109140356Krea2 turbo runs on 4GB VRAM and the gen speed is the same as Anima at 30 steps. If chronically VRAMlets like Oekaki can run Krea2 on basically nothing and still get better prompt adherence and quality, it might just become the community's model.
>>109140874>not posting dall-e mini to compareAll right, you asked for it!
>>109140918masterpiece
>>109140917>it might just become the community's modelnot with that retarded VAE
>>109140926Ikr, models used to have sovlhttps://www.youtube.com/watch?v=XQr4Xklqzw8
>>109140831many ways. sometimes with reference images, sometimes with lora, sometimes the model just knows some character or person or has a specific idea of blonde azerbaijani women or w/eand yea there are also still tricks in use that are some supplementary thing like faceid or controlnets or w/e
>>109140928everybody enjoying krea 2 you gotta stop. pack up your bags this guy said it was bad. i'm deleting it off my hard drives right now to install zit for +500 social points
So his newest shitposting angle is now "VAEs are le bad"?
>>109140944>>109140955>everybody enjoying krea 2you and the voices in your head?
VAEtroons need to stop
>>1091409622 many word, learn 2 meme lil blud
>>109140967>can't read more than 2 wordsyour average Kreatard, ladies and gentlemen
kreatards on top zit faced niggas on bottom
VAEgods need to continue
https://huggingface.co/ilkerzgi/fal-Krea-2-Style-LoRAs>Here, more style loras!I don't want loras I want the style adapter :(
>>109140854>>109140863ZIT
>>109140986clothing is worse, sword is floating, me in the background preparing to rape
>>109140986Thanks for the tests anon
>>109140994now do anima
>>109140994>>109140994And Klein kek. Prompt was>a beautiful young Japanese actress. Cosplaying as 2B from Nier:Automata. Wearing her iconic black dress with feathered sleeves. She sits on a director's chair drinking from a thermos. Short black hair tied back in a tight bun. Thigh high stockings. She is Barefoot.>Prop sword leaning against the chair. Unworn white 2B wig resting nearby on a table. Her unworn black leather thigh-high boots are discarded on the floor nearby>Background a busy outdoor movie set with a green screen. Candid, natural BTS photographyReformatted for the ideogram gen obviously
>>109140863 Memes aside, Krea and Ideogram are at a level where they can compete with or be compared to Nano Banana and GPT Image 2 instead of fighting in the mud for the lowest spot with Z Image or Chroma.
>>109140999Klein is so bad at anatomy lol
>>109140986>turbopromptlet
>>109141017>Memes asideand then he proceedes to say a meme opinion lol
>>109140999flux always fucks up toes in stockings like that. worst part of the model.
>>109140999Anima (2B)
>>109140994Can you give me the proompt? I want to try seedream at least
>>109141092>>109140999
>>109141040kinda crazy what this model can do with only 2b params and a 0.6b text encoder.
Anyone with a krea2 workflow that bypasses any filters? I tried one but it failed.It's friday night and I need to goon.
>>109141113>I need to goon.https://civitai.red/models/2409949/sam-anima-realisticJust use this instead my man.
>>109141019seedream4.5>>109141025I don't know, what do you think? I believe Krea2 and Ideogram are already on the same level. It's time to raise the stakes and not be afraid of API services.
>>109141113>bypasses any filtersthe absolute state of localkeks
>>109141101> what what?
>mfw Resource news06/26/2026>Adobe to Acquire Topaz Labshttps://news.adobe.com/news/2026/06/adobe-to-acquire-topaz-labs>LiveEdit: Towards Real-Time Diffusion-Based Streaming Video Editinghttps://live-edit.github.io>PhysRAG: Enhancing Physics-Awareness in Video Generation via Retrieval-Augmented Generationhttps://github.com/sediment1024/PhysRAG>SAM2Matting: Generalized Image and Video Mattinghttps://henghuiding.com/SAM2Matting>Unison: Benchmarking Unified Multimodal Models via Synergistic Understanding and Generationhttps://github.com/FudanCVL/Unison>ComfyUI-AppleSilicon-FP8 - a compatibility layer custom node for Apple Siliconhttps://github.com/pawel-mazurkiewicz/ComfyUI-AppleSilicon-FP806/25/2026>Bernini-R — GGUF (high & low noise experts) https://huggingface.co/neuregex/Bernini-R-GGUF>Physics Question Scene Graph: Fine-grained Evaluation of Physical Plausibility in Text-to-Video Generationhttps://github.com/atinpothiraj/pqsg>VPA-Guard: Defending and Benchmarking Image-to-Video Generation Against Visual Prompt Attackshttps://huggingface.co/datasets/CSU-JPG/VVA-Bench>Minimalist Preprocessing Approach for Image Synthesis Detectionhttps://github.com/vohoaidanh/adof06/24/2026>Krea-2-Turbo Training Adapter https://huggingface.co/ostris/krea2_turbo_training_adapter>Vera: A Layered Diffusion Model for Content-Preserving Video Editinghttps://vera-layered-diffusion.github.io>Advancing WordArt-Oriented Scene Text Recognition: Datasets and Methodshttps://github.com/YesianRohn/WATER>DramaDirector: Geometry-Guided Short Drama Generationhttps://github.com/iLearn-Lab/DramaDirector>PG-MAP: Joint MAP Optimization for Inference-Time Alignment of Diffusion and Flow-Matching Modelshttps://github.com/sophialanlan/PG-MAP>Safe Few-Step Generation via Velocity Editinghttps://uzn36.github.io/VESFlow>Co-occurring associated retained concepts in Diffusion Unlearninghttps://github.com/damilab/CARE
>>109141040I have said it before and I will keep saying it: Anima needs a proper realism finetune (100k images or so, none of this 200 image lora shit) and it will be nearly on par with the big boi models, while being a fraction of the size and with full uncensored booru knowledge.
>mfw Research news06/26/2026>From Celebrities to Anyone: Characterizing AI Nudification Content, Technology, and Community Dynamics on 4chanhttps://arxiv.org/abs/2606.27234>LearniBridge: Learnable Calibration of Feature Caching for Diffusion Models Accelerationhttps://arxiv.org/abs/2606.26778>LCG: Long-Context Consistent Image Generation with Sparse Relational Attentionhttps://arxiv.org/abs/2606.26171>Disco-LoRA: Disentangled Composition of Content, Style, and Motion for Multi-concept Video Customizationhttps://arxiv.org/abs/2606.26668>ResilPhase: Plug-and-Play Phase Mapping for Diffusion Accelerationhttps://arxiv.org/abs/2606.26769>NaviCache: Test-Time Self-Calibration Caching for Video Generationhttps://arxiv.org/abs/2606.26795>DanceDuo: Bridging Human Movement and AI Choreographyhttps://arxiv.org/abs/2606.26507>PhyEditBench: A Real-World Multi-Stage Benchmark for Physics-Aware Image Editinghttps://arxiv.org/abs/2606.26551>TMP: Tree-structured Mixed-policy Pruning for Large-scale Image Generation and Editinghttps://arxiv.org/abs/2606.27089>DanceOPD: On-Policy Generative Field Distillationhttps://danceopd.github.io>Do Image Editing Models Understand Lighting?https://arxiv.org/abs/2606.26738>Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generationhttps://arxiv.org/abs/2606.26907>Adversarial Diffusion Across Modalities: A Fusion Survey of Attacks, Defenses, and Evaluation for Text, Vision, and VLMshttps://arxiv.org/abs/2606.26566>Safe Autoregressive Image Generation with Iterative Self-Improving Codebookshttps://arxiv.org/abs/2606.27147>SpatialFlow-GRPO: Where Spatial Credit Drives Image Editinghttps://arxiv.org/abs/2606.26872>Ask, Solve, Generate: Self-Evolving Unified Multimodal Understanding and Generation via Self-Consistency Rewardshttps://arxiv.org/abs/2606.27376>Scaling Multi-Reference Image Generation with Dynamic Reward Optimizationhttps://arxiv.org/abs/2606.26947
>>109141139>on par with the big boi models>with Qwen Image VAEkeek
>>109141132>>109141140Fuck off debo
>>109141144boogu is so slopped it's comical, China doesn't know how to make kino models (Z-image turbo was the exception not the rule)
>>109141139no one wants to dare to do that because of the obvious totally uncensored 3D realistic loli issue that comes with that model.
anima needs the noobai/illust treatment, thats it, is anyone training that?
>>109141156me in the background
>>109141163>is anyone training that?why would they do that when krea 2 is ripe
>>109141140>From Celebrities to Anyone: Characterizing AI Nudification Content, Technology, and Community Dynamics on 4chanhttps://arxiv.org/abs/2606.27234Any of the Top 10 active providers in the thread today?
>>109141140>>From Celebrities to Anyone: Characterizing AI Nudification Content, Technology, and Community Dynamics on 4chan>https://arxiv.org/abs/2606.27234we're being studied like lab rats
Arrested? For sharing tips??
First semi interesting paper post in literal years lmao
>>109141177lmao damn https://arxiv.org/abs/2604.12190
>>109141186Very constitutional
>>109141186what country is that? bonglang again?
>>109141194holy fuck... they know...
>no explicit mention of ldg bros....
>>109141152>It's already an issue.
>>109141237Clearly they're only talking about /lmg/. I can't imagine why they would take the other generals filled with browns and schizos seriously.
>>109141194>they went on /b/
>>109141259So, are you brown or schizo?
>>109141261
>>109141166I like the view more from here.
>>109141179>>109141194Is this why they detailed /r/ kek
>>109141266Also they say this:> Technically-sophisticated actors gravitate toward certain communities (e.g., 4chan), while lower-sophistication end-users are more active on others (e.g., Reddit).This was only published to troll Reddit, wasn't it?
>>109141194Heartbreaking finding; requests NEVER get fulfilled
>>109141288This deserves it's on separate study
the corpos don't need to do ethical training just you the citizens need to be ethical.
>>109141288>tfw actual scientists call you sophisticatedFirst general I post in that gets me this compliment.
>>109141300requesting naked ryan gosling with long foreskin dripping precum lora
>>109141331>[REQUESTER]
>>109141331Chroma might be your best choice for a base model, OP has guides for training a lora
>>109141225I am the rare GENNER-EDUCATOR-SOCIALIZER. they would have referenced me by name if they studied /g/ instead of /b/
is this ethical?https://files.catbox.moe/3iinxk.png
>>109141368>interracialnot ethical at all!
>>109141368it's ethnical
>>109141356I was just joking as a>[REQUESTER]as per >>109141225
>>109141383kek
>>109141179>we're being studied like lab ratsnice. they are also documenting open discussion of total kike death and it will be saved forever for posterity, it does get my a bit giddy
>>109141368sex with animals is not ethical, anon
>>109141288I don't know why this is surprising honestly. Practically every major AI software dev originated from this site or at least posted at some point.
>>109141394>>109141357pedo KIKE
>>109140999
>>109141368I don't know but she doesn't look impressed.
>>109141412She's used to BBC after all, can't blame her
>>109140943What tool do you use that lets you use a reference photo as an input?
>>109141397lmaoo
>>109141399I like to believe 4chan appeals to individuals who value open discussion of ideas above all else.
Those researchers are the ones who saw my supreme kino sovl gens and replied with "Nice."
>>109141279>In this work, we present the first large-scale measurement study of 4chan’s Adult Requests board>The Adult Requests board primarily serves as a venue for exchanging AI-generated nudification content. Manual inspection of a random sample of 200 videos confirm that over 98% are AI-generated sexual content/r/ used to be a decent board. I guess hiromoot got word of this paper and SHUT IT DOWN
>>109141464They only sampled /b/
>>109141412>>109141427Easy fixhttps://files.catbox.moe/82i41h.png
files.catbox. moe/7hcc73. jpg
>>109141485This one >>109141179 is about /r/
>>109141307>the corpos don't need to do ethical training just you the citizens need to be ethical.Instruction unclear, made 700 000 gpu hours of safety tuning
>>109141498>files.catbox. moe/7hcc73. jpgnewfag?
>>109141496if she's eating while having sex she isn't probably enjoying it much.
So every everyanon who keeps saying plebbit is better for imggen has been officially BTFOd right
>>109141542I mean, eating good stuff is a pleasure, so that always adds up
>>109141139these exist, the issue is that youre pushing the model to its limits at some point. and at 100k photographs youre starting to approach the area where it might just be easier to finetune a better arch on booru images instead, not least because its easier to get data for that
I wish the glowies and researchers actually funded 4chan servers as well, it's only fair for such an easy honeypot
>>109141194Researcher Wellbeing. Researchers involved in this studywere informed in advance that they would be exposed to sexually explicit content as part of the data collection and analysis process. To mitigate potential psychological harm, researchers have access to mental health resources throughoutthe study period. And any researcher who feels distressed atany point can seek support and request a break without anynegative consequences.
>>109141538Any upscaling or does it come like that out of the model? Sampler/Scheduler?One of the few krea2 gens I like.
>>109141225do you get a special label for disruptive shitposter
>>109141595heh, pussies
>>109141439you cannot post on reddit without a million hoops and licking jannies feet, it's not for smart people
>>109141595I REQUIRE MENTAL HEALTH RESOURCES TO INTERACT WITH
>>109141602no upscaling, realism engine v2 lora, prompt by grok>Boring casual selfie of a 19-year-old Japanese woman with a fit toned body and large breasts. Early 2000s gothic rockabilly fashion: tight black lace-trimmed tank top with a low V-neckline that accentuates her deep cleavage, small black leather jacket, delicate silver cross choker. Long straight jet-black hair, fair porcelain skin, very thin styled brows, striking dark brown eyes with subtle smoky makeup. She playfully frames her chin with her right hand, thumb and index finger forming a gentle L-shape touching her jawline, cheeky vixen smile. Soft natural window light, intimate and slightly sensual mood, cute amateurish make-up and a look that is filled with love.
>>109141518>muh misuse that will be irrelevant and impossible to prevent in the near future.God I hate them. Skynet can't come soon enough.
>>109141631Guess it's the lora, then. Thanks Anon.
>>109141595>"I saw post calling a gen "cucked" and "retarded"">"Time for a 2 days break"And AI researchers dare to say they don't have plum jobs?
>>109141194>>109141657Researcher Well-Being. This work required direct engagement with disturbing text content by researchers. The researchers ultimately decided that this exposure was worthwhile and necessary to expand our understanding of the resources employed by these illicit communities. To minimizeresearcher exposure to traumatic content, we opted to excludeimage-level analysis, instead analyzing only sample text anda boolean variable indicating whether or not an image wasincluded. Researchers also took breaks as needed throughoutthe data analysis process.
>2606.27234Oh, so that's why they killed /r/. Probably asked the higher-ups if they wanted to answer questions about their study, which made them shit their pants and burn it all down before others started dropping by to rubberneck.>>109141356I think my favorite part of the Krea 2 release so far has been the Chroma-glazer's triumphant return. I hadn't seen him post his Asian foot-fetish content is so long.
>>109141640I think it's probably better to run lora with krea2 all the time, even at 0.05 str
>>109141680shes got the perfect claw hand for that popcorn
I was promised nudity with this workflow. I have been lied to.
>>109141700i think nude works a lot better than naked
>>109141700https://civitai.red/models/2727829/krea-2-enable-nsfw-prompt-adherence-krea2-nsfw?modelVersionId=3066310
>>109141152I have loli uncensored even in ideogram...
>>109141686how you getting expressions? k2 seems to always gimme blank expressions no matter what
>>109141700sometimes it works sometimes it doesn't.
>>109141709Same thing.>>109141719404.
>>109141751>404you must be britishhttps://huggingface.co/Beinsezii/Krea-2-Turbo-Projector-Scale-LoRA-Diffusers
>>109141751you need to be logged in to see nsfw stuff on civit
>>109141733sure sweety
do you think the researchers are watching us now
>>109141742>how you getting expressions?Need to use lora, seemingly any lora
Oh wow, krea can blend styles really good.>>109141758I tried that one as well without success, strange.I'll look around on civitai a bit.
>>109141777Probably not, they're gooning on the red boards
>>109141680Those blockshit eyes. Man, I can't believe it. Krea 2 was so close to being the one model but it just falls flat as soon as the camera zooms out .Please tell me there's a solution to this.
>>109141792Nothing a little detailer pass can't fix.
promt: muhammad
>>109141806the smokey plastic looks good
>>109141700which exact model are you using?
civitai try not to be down for a day
>>109141832yeah, it has that cheap and mass produced aesthetic
>>109141838Standard krea2 turbo.
>>109141595This my disturbing data for those little goys. YOU LOST THE IRAN WAR
>>109141846give me 10 minutes
>>109141838Please stop im still at the office and I want to wait until im home to jerk off
>>109141782seems to be down atm.
>>109141768skill issue?
hey researchers, if you were not generationally incompetent you would be making bank in industry right now not working paycheck to paycheckHere is to the bottom of the class staying behind to monitor 4chan
>>109141877the pursuit of knowledge is a purpose than the pursuit of money
>>109141888higher purpose*fugg
>>109141846yea use nsfw one. I use civitai.red/models/958009?modelVersionId=3066243 btw
>>109141888most of the ai knowledge is also coming from google, facebook etc not bottom tier universities who can cope with investigating social impact of ai or whatever waste of time
>>109141898google/meta/adobe/etc spend a lot of money funding academic research. sometimes those projects do produce commercially useful outputs, but they also train up the researchers that will go on to become researchers in their corpo labs. research is a giant ecosystem
Good boy!
just made a quick trip to /b/. it's basically 100% CP at this point. Isn't this what Snake tried to warn us about?
>>109141933>Snakewho
>>109141931now do a good goy
krea can do trans porn/thread
loser nigbo getting his first good paper post since he started spamming here now he thinks hes hot shit fucking kekd
>>109141941
>>109141935well it wasn't technically Snakehttps://www.youtube.com/watch?v=C31XYgr8gp0
>>109141896It seems I had to use a massive ai prompt, but that model works much better, thanks.
>>109141966kek. touché.
>>109141967Who wrote the script for this game, wasn't Kojima. Awesome plot
>>109141993Kojima and his co-writer Tomokazu Fukushima
>>109142006wasnt the guy lost somewhere and the whole plot mystery? dont google it and ruin this story with facts
Local Diffusion?
>>109141696>>109141792That's just the epoch I was using at the moment. I've been reworking my dataset all week for the new training techniques (noise+depth) so things are softer/noisier than usual because there's no synthetic images included at the moment. I need those to really increase detail because there are no high quality images of Jenny on the internet (trust me, I have them all).I'm also still on ZIM+ZIT... Comfy looks like a train wreck right now, so I haven't updated in a while.Thanks for reading my blog!
actually meant to post this onehttps://www.youtube.com/watch?v=-gGLvg0n-uY
Computer, watermark my a r t for me.
>>109141896https://image-b2.civitai.com/file/civitai-media-cache/c5105f56-e633-4d91-9d1f-390aab414e24/original
>>109142015No idea. I am Indian I just looked it up for you.>>109142020Metal Gear Diffusion: The Phantom Model
>>109142040>civitaiwtf?
>>109142040she seems to be having good time.
Damn, a cucked model.
>>109142029hmmm, this Jenny does seem somehow more authentic to me now on closer inspection.I'd say "good work" , but I have to wonder about the mental state of a man who pursues this kind of work.
oh im laffin
I like testing models outside the expected normal subjects, but now I'm starting to regret it.
So what's Krea bad at?Anime? Because I've seen it make Final Fantasy-like pics and they look pretty good.
>>109142247it's *okay* at anime. nothing some training and loras can't fix. it's not like you don't already need a lora for everything in anima anyways
>>109142247It's not particularly bad at anything, but it's also not exceptional at anything. It's a real jack of all trades master of none type model
>>109142263soooo........ its a foundational model?
>A jack of all trades is a master of none, but oftentimes better than a master of one.
>>109142262How hard is to train loras for Krea?
>>109142247shit realism compared to ZITshit expressionscensoredslowok ip knowledge but 80% there, making the likeness that is there mostly worthless aside from memesin my experience it trains worse than ZIT too
>>109142267except that you can't train it since it's turbo distilled
>>109142247>So what's Krea bad at?realism and details (because of that retarded VAE)
>>109142278not sure I haven't tried I think the barrier of entry is a 24gb card but I could be wrong I'm on 12gb so not sure if I can train but it gens perfectly fine for me. anima community will likely stay strong due to low barrier of entry for training.
Prompt variability in Krea 2 also seems even less than with ZiT. I guess it's because it expects you to be very verbose with it. You have to specify every detail so there's a whole lot less of those happy accident moments.
I was definitely going for 'cross eyed' without knowing it myself. Thanks krea.
>>109142294I am haunted by blockshit eyes
>censoredI can gen full on porn but okay
>>109142302that's not cross-eyed, you asshole, she looking at >>109142302
>>109142247It doesn't really understand certain posing or character positioning and its really bad at making dynamic manga pages or long form text.>>109142278You need a 5090, unless you wanna train with shit settings.People *say* that 12gb is possible but I dont want my lora coming out looking like ass so just spin up the runpod.
so just to get this straight. If I want to generate a portrait image instead of a landscape image with Comfy Krea 2 I have to invert the weight and height imputs manually by disconnecting them and crossing them over? There is no simpler way to do this?
>>109142340>It doesn't really understand certain posing or character positioning and its really bad at making dynamic manga pages or long form text.since it has a cuck filter I'm always pondering whether the poor prompt understanding comes from the model or because the filter did some false positive triggering shit, that's annoying as fuck
>>109142288>you can't train it since it's turbo distilledthe base krea2 model exist doesnt it? or is even that one untrainable?
>>109142345?????just hook a resolution selector the empty latent and select the proportion / resolution you want
>>109142345what?
>>109142365they call it raw and yeah it's trainable, it's not good for much else
>>109142366>>109142368Sorry, retard moment, nvm
>>109142295>I think the barrier of entry is a 24gb cardthis is with diffusion-pipe and 1024 resolution
>>109141538>INT8 ConvRotI downloaded the model and updated comfy to 0.26 but I am not noticing any time difference, is there something I need to toggle or some kind of node I need to use?
anyone trained krea2 with 16gb vram yet?
>>109142386I mean this doesn't really explain anything. It will use more vram if available.
>>109142397just set 65% or whatever offload in ai toolkit
Is there a stabilizing node like zit has for very large resolutions for krea2? It's able to handle very high res, but distorts the bodies.
>>109142140>left aryangram>right krudea
in dire need of dedicated anal lora
>>109142412>just set 65% or whatever offload in ai toolkitand how fucking slow is this going to be?
>>109142415zit already does 2kx2k natively>>109142425i mean you do sleep at some point, 1 or 2 nights depending on the settings
>>109142247>So what's Krea bad at?at being efficient, 12b is bloat, Z-image turbo is a 6b model and gives better quality images
>>109142247>So what's Krea bad at?it's not very good at anything
Why do people enjoy ZiT when it has so little output diversity?
>this Z-image grifternobody wants to use that shite
bruh, Krea has been overtrained on memes or what? it can spit out the training data just like that
>>109142495Because you train or use a lora with it that easily adds the diversity
>>109142560But the lora training is the worst in the industry so its useless. RIP!
>bruh look at all those heckin awesome references and memes in the model haha disregard the shit output but look at all the cool characters it knows
local is still nowhere near API level
>>109142495I was expecting Krea 2 to have output diversity too but it's not the case at all
>>109142566>visual consistencyyes I love people being able to recognize it as gpt yellow tinted slop within seconds, valuable brand recognition
>>109142566>gpt image
>>109142298Use Raw. Turbo is obviously rigid like all other distilled models.
>>109142578why are you posting your ideogram chatlogs?
>>109142495>Why do people enjoy ZiT when it has so little output diversity?Prompt for the diversity you damn dirty ape
>>109142495Because using Base is too hard for them and they already have to wait awhile on their old card so asking them to wait even longer for base to finish is impossible.
>>109142578no refund
>>109142561Zit trains great
>>109142495the same reason why wai was so "popular", all it takes is like three words to get an average looking image of booba. but it'll never be more than that without loras or empty conditioning on early steps cope.
>it'll never be more than that without lorasthat's every local model
>>109142592Most people either want tranime or realism, ZIT is for realism, ZIB is for nothing. It also trains worse, and has a somewhat melted look. No point in using it.
>>109142495>ZiTDiT models are all the same, they won't give you diversity, I miss the unet era, when you go for a prompt on SDXL it'll give you 4 completly different interpretations
>>109142612>Yes my innie penis will never satisfy a woman
>>109142612except Anima
The more I use GPT Image the more I realize local is fundamentally lacking in essential technology. No omnimodel, no search tooling, no edit. API is so far ahead, I'm hyped for what China can deliver in the next Seedream model
>>109142615>It also trains worse, and has a somewhat melted lookWith suboptimal settings yes
>another "I don't know how to emulate API functionality on local therefore it does not exist" episode
>another never got a date I guess I'm a dirty homo episode
Has anon messed with the shift value for Krea?