Pear-Shaped EditionDiscussion and Development of Local Image and Video ModelsPrevious: >>108668921https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/tdrussell/diffusion-pipe>Zhttps://huggingface.co/Tongyi-MAI/Z-Image>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2https://huggingface.co/Lightricks/LTX-2>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girl>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
Is there some gigachad with a chad gpu to try this? I wanna see how well it fares against this prompt >>108671341https://github.com/inclusionAI/LLaDA2.0-Unihttps://huggingface.co/inclusionAI/LLaDA2.0-Uni
don't let kekestone see this lolhttps://pixeldit.github.io/https://github.com/NVlabs/PixelDiThttps://huggingface.co/nvidia/PixelDiT-1300M-1024px
>>108672564No one wants to see this though
>>108672564>don't let kekestone see this lolWhy? Because it has a woman without a dick?
>>108672564>kekestonehttps://xcancel.com/LodestoneRock/status/2046437094479020543#m>recursive model distillation be likehe's not wrong though, that's what happens when you train on synthetic data
>>108672554Damn shit has built-in turbo?
>>108672490Cosmos 2 didn't look that great but it was compositionally coherent and good enough at texthere all three were genned at 1072x1440, hi-res-fix upscaled up to 1872x2512
>>108672628>that oversaturationthat explains a lot... Anima has a huge problem with saturation as well now I know why
>>108672628Bruh cosmos is fried as fuck.
calm down cloud shill kek
>>108672647you have to go back >>108653190
>>108672647damn, this is actually all true too
flux klein edit is still so neat, Q8 model works fast too, one or two image inputs.give the anime girl a white racing suit with "Marin" in stitched black lettering on the front.
>>108672564>https://pixeldit.github.io/damn look at those veins, that dude is surely juicing kek
>>108672683apparently you can get even better results with kleinhttps://www.reddit.com/r/StableDiffusion/comments/1somo2r/coming_up_tomorrow_flux2klein_identity_transfer/
>>108672642I did say it didn't look that great
what's the best realism editing model that is good at following instructions for moving camera perspective? nano banana isn't working good since if i tell it to raise the camera a little higher, it creates a satellite image of the scene
>>108672683>pre-milf miyako saitou before her hair turns fully desaturated-red
>>108672699>filenamethat's a finetune of klein? looks pretty good
>>108672683business suit + blouse
>>108672641>>108672699why are you posting Gippity Image 1.5 gens lol
>>108672710it looks like low-detail dogshit, and it's definitely GPT Image 1.5, not even 2
>>108672719>it's definitely GPT Image 1.5proof?
Im pixeldiffusiiiiing
>>108672596
>>108672722Hive is trained on a gorillion outputs from each model, you're not gonna get a 1.0 return for GPT Image 1.5 unless it actually is specifically that
>>108672747
>>108672564>1girl, standingWOAH, FUCKING GROUNDBREAKING
>>108672758what if it's finetuned from GoyPT outputs
>>108672773>missed the point award
>>108672564Apparently OOMs on 12gb because it tries to load TE and unet model at the same time.Was gonna make some garbage for the memes but thanks for wasting my time downloading.I can probably change load precision in the inference script to get it to work but I think I am just going to delete it.
>>108672770yeah I've heard that ernie works really well with lora, I have to check that on civitai, unless you already have some examples to showcase?
>>108672695https://github.com/capitan01R/ComfyUI-Flux2Klein-Enhancer/blob/main/example_workflow/iden_wf%20(1).jsonoh shit it's not a snakeoil, it works well
>mfw Resource news04/23/2026>ParetoSlider: Diffusion Models Post-Training for Continuous Reward Controlhttps://shelley-golan.github.io/ParetoSlider-webpage>DynamicRad: Content-Adaptive Sparse Attention for Long Video Diffusionhttps://github.com/Adamlong3/DynamicRad>Normalizing Flows with Iterative Denoisinghttps://github.com/apple/ml-itarflow>LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Modelhttps://github.com/inclusionAI/LLaDA2.0-Uni>Illustrious XL & NoobAI-XL Style Explorerhttps://github.com/ThetaCursed/Illustrious-NoobAI-Style-Explorer>AI Model & โMAGAโ Influencer Emily Hart Unmasked as Indian Manhttps://www.yahoo.com/news/articles/ai-model-maga-influencer-emily-091027504.html04/22/2026>Embedding Arithmetic: A Lightweight, Tuning-Free Framework for Post-hoc Bias Mitigation in Text-to-Image Modelshttps://github.com/cvims/EMBEDDING-ARITHMETIC>Denoising, Fast and Slow: Difficulty-Aware Adaptive Sampling for Image Generationhttps://github.com/CompVis/patch-forcing>TS-Attn: Temporal-wise Separable Attention for Multi-Event Video Generationhttps://github.com/Hong-yu-Zhang/TS-Attn>AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Modelhttps://yutian10.github.io/AnyRecon>SmartPhotoCrafter: Unified Reasoning, Generation and Optimization for Automatic Photographic Image Editinghttps://github.com/vivoCameraResearch/SmartPhotoCrafter>Soft Label Pruning and Quantization for Large-Scale Dataset Distillationhttps://github.com/he-y/soft-label-pruning-quantization-for-dataset-distillation>Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representationhttps://github.com/AMAP-ML/EMF>Enhancing Continual Learning of Vision-Language Models via Dynamic Prefix Weightinghttps://github.com/YonseiML/dpw>IR-Flow: Bridging Discriminative and Generative Image Restoration via Rectified Flowhttps://github.com/fanzh03/IR-Flow
>mfw Research news04/23/2026>Image Generators are Generalist Vision Learnershttp://vision-banana.github.io>Camera Control for Text-to-Image Generation via Learning Viewpoint Tokenshttps://randdl.github.io/viewtoken_control>Hallucination Early Detection in Diffusion Modelshttps://arxiv.org/abs/2604.20354>Wan-Image: Pushing the Boundaries of Generative Visual Intelligencehttps://arxiv.org/abs/2604.19858>MMCORE: MultiModal COnnection with Representation Aligned Latent Embeddingshttps://arxiv.org/abs/2604.19902>Rethinking Where to Edit: Task-Aware Localization for Instruction-Based Image Editinghttps://arxiv.org/abs/2604.20258>Amodal SAM: A Unified Amodal Segmentation Framework with Generalizationhttps://arxiv.org/abs/2604.20748>FluSplat: Sparse-View 3D Editing without Test-Time Optimizationhttps://arxiv.org/abs/2604.20038>HumanScore: Benchmarking Human Motions in Generated Videoshttps://arxiv.org/abs/2604.20157>Render-in-the-Loop: Vector Graphics Generation via Visual Self-Feedbackhttps://arxiv.org/abs/2604.20730>Mitigating Hallucinations in Large Vision-Language Models without Performance Degradationhttps://arxiv.org/abs/2604.20366>Cognitive Alignment At No Cost: Inducing Human Attention Biases For Interpretable Vision Transformershttps://arxiv.org/abs/2604.20027>X-Cache: Cross-Chunk Block Caching for Few-Step Autoregressive World Models Inferencehttps://arxiv.org/abs/2604.20289>Self-supervised pretraining for an iterative image size agnostic vision transformerhttps://arxiv.org/abs/2604.20392>Efficient INT8 Single-Image Super-Resolution via Deployment-Aware Quantization and Teacher-Guided Traininghttps://arxiv.org/abs/2604.20291>From Diffusion to Flow: Efficient Motion Generation in MotionGPT3https://arxiv.org/abs/2603.26747
>>108672628People who say Cosmos was bad are dumb. It has an atrocious default aesthetic tune (by design, it's a fucking robotics world model), but in terms of prompt understanding, coherence, and breadth of knowledge it was basically Flux 1 level but only 2b parameters.
Updated comparison, adjusted the prompt a bit to try to force more similar results out of all the modelsPrompt:A fair-skinned young Irish woman with long, sleek copper-red hair and blue eyes stands centrally on a weathered stone walkway, posing daintily and smiling directly for the camera. She wears a whimsical pastel lavender mini-dress featuring a tiered skirt, ruffled bodice with lace trim, and sheer long sleeves, accessorized with a metallic gold crossbody bag. Her legs are clad in intricate white patterned lace tights, ending in chunky two-tone black and white platform oxford shoes. She is situated in a formal garden setting, flanked by stone balustrades topped with large white classical urns containing manicured green bushes. Immediately behind her stands a white architectural frame structure bearing the text "1GIRL GARDENS" in bold serif capital letters. The background reveals terraced flower beds, classical white statues, and a green hillside dotted with buildings. The lighting is soft, flat, and diffused from an overcast sky, creating shadow-free illumination that enhances the soft pastel colors of her dress and the even tones of her complexion. Style: whimsical DSLR street fashion photography. Mood: sweet, composed, and serene. Aspect ratio: 3:4.
Anyone tryed and used the official Circlestone Greg Rutkowski lora appart from reading the metadata? Isn't it total shit? The demo images look nothing like his art but I downloaded it anyway and used it and it's super cherry picked. Only works with vague prose style and with the slop "dramatic oil painting" prefix. If you use some anime character it completely loses the oil painting brushstroke dark fantasy effect, what a garbage lora.
>>108673089proofs?
>>108672641>>108672715this model has a weird noise pattern, doesn't it?
>>108672991Sorry but Cosmos is literally incapable of disentangling styles and characters. I'm the anon from >>108673089. The only interesting thing it has is prompt adherence. Cosmos can't grasp the concept that an anime character can have a different style applied. I downloaded an Elden Ring style lora too and same shit, only works for vague characters. The second you mention a specific anime character the lora effect stops working and loses all the aesthetic. This never happened to me with Neta Lumina or even SDXL.
>>108673089Works on my machine
>>108673106it looks like badly upscaled SDXL with like DPM++ 2S Ancestral or something lol, same kinda gooey looking trees in the background and stuff
>>108673104Not at home now, but will share because it's very noticeable. This happens with a lot of style loras. If you prompt an anime character the lora stops working and it defaults to a diluted style. I'm using the lora trigger words and the anime character without any artist styles or coloring or mediums.But if you're on a pc download -The official Anima Greg Rutkowski lora-search CivitAI for an Elden Ring lora-and whatever other style loras you want. Test the lora without a specific anime character, then test with a specific character and look at the difference in style.
>he doesn't neg the series name to get rid of its style influence NGMI
>>108673135>Cosmos is literally incapable of disentangling styles and characters
>>108673170ive noticed this with artists. extremely popular characters work fine since the dataset contains them drawn in different styles but less popular characters are way more rigid
WHY DOES FLASH ATTENTION TAKE SO LONG TO COMPILE
I see people talking about graphs and watching the loss and knowing from that if the model is undercooked or overcooked... And I wonder what the fuck is that?I didn't know you could monitor models at training
>>108672615there's something wrong with your sampler / scheduler settings
I don't know if my model was trained wrong or if I just suck at genning. Any tips?
>>108672951
>>108673081have a zit again
>>108673178thanks im adding that to my grifter keychain ;)
>>108673200Loss is the primary way training happens for AI model.Though it doesn't work the way it usually does for diffusion.It zigzags, but it is overall supposed to decrease a bit initially and then remain same for a while.In rare case it can spike a lot suddenly up or down, meaning something has gone wrong.But most of the time you let it churn at the final value for the correct time, if you do it longer than it needs, it would result in a fried model.Shorter would result in an underbaked model.Validation loss is supposed to be a way to independently determine if the model is starting to get fried.Though I never managed to get it working properly.
>>108673222wtf sampler did you use lmao, it looks weird
>>108673236How do you measure that? I always go in the blind and make some gens. But I don't know if they are undercooked or overcooked, lately I can't manage to make good models, please help me anons.
>>108673272>How do you measure that?There is literally no way to measure that besides trial and error and experience nonnie.It depends on which model, what type of lora, dataset quality/quantity, LR, batch, and other training parameters.You are going to underbake or fry until it clicks.3-4k steps is generally a good starting point for loras.Don't pay too much attention to the loss.>But I don't know if they are undercooked or overcookedIf you undercook, the character/style/concept has poor resemblance to the training dataset.If you overcook, you will get weird gens showing that irrelevant details from the dataset has been learned (this can also happen due to poor dataset diversity, quantity or very poor captioning, so in effect you can undercook and overcook at the same time!).
>>108673301>Don't pay too much attention to the loss.I mean to say don't pay too much attention to the loss as a beginner.
>>108672637I tested those three, and cosmos is equally as saturated as the flux one, both being more saturated than the z image.I don't get your complaint about Anima. It seems mostly undersaturated to me.
Local lost so bad...
>>108673260euler
>>108673331how good are the dicks it makes?
>>108673331pic unrelated?
So, I've been using automatic1111 forever, and, I'm trying to transition to confyui. Got it installed, it works , but, I need some basic workflows... Like, one that has hires fix and lora settings? I've tried making some and "wiring" it up... Well, Im just big dumb. Where to get some out of the box workflows for SD?
>>108673361Templates button is your friend.
>>108673360don't you like the api cope of making everything look like it was shot on a 2006 nokia flip phone? half the shit they post as a brag looks like gguf z image gens.
>>108673081i hate how they massacred nbp. theyre definitely running a quantized version nowpre quantized nbp was truly something else, $100 a month for 1000 4k images a day was so worth it at the time
>>108673320>It seems mostly undersaturated to me.It seems like it just accurately recreates what it was trained on to me. If you prompt artists that make desaturated art it will look desaturated, saturated artists will produce saturated results. Or you could use keywords like colorful, monochrome, etc. in the positive or negative depending on your aims to counteract that. Don't tell me you guys are just prompting "masterpiece, best quality" for aesthetics and nothing else?
>>108673382That's why local is important.The weights are yours and they can't take it.The API jews will gladly charge you hundreds of dollars a month and serve the shittiest quality possible that they think they can get away with it.I also think they are straight up doing undisclosed distills and still serve those models as the same model.
>>108673379Damn... Ok, I'll give it another look through... It was just so "API" heavy I didn't look too much in there. Thanks!
Can you guys give me some style Lora parameters? I've never seen a good style Lora, dunno if they exist. If you know tell me pls.
>>10867339835 stars status?
>>108673402Model? How many images in your dataset?
[BREAKING NEWS]https://comfy.org/countdownComfyOrg is counting down to a major release?? What could this be?
>>108673555anima preview 4 lets goooo
>>108673555must be the anima release
>>108673555announcement of the grant winner (ani)
>>108673555>A big day for open creative AIehh, it doesn't say model, probably another useless thing they made on ComfyUi like Node 3.0 or some shit
>>108673331it's rather sad for API studios. they quickly fall into oblivion, and the local community manages to create similar things, even with outdated tools kek
>>10867347390 images, I'm trying to make a style Lora. I think I tagged all correctly. I'm experimenting with some values but I can't manage to get things right. I think I'm overcooked, the models look "creamy" and oversaturated sometimes, like greasy.0.0003 learning rate, 2 repeats, 15 epochs.
>>108673555Api nodes 2.0 is finally here??
>>108673555Comfy posting hole in LDG screenshot this
>>108673555This is it, LTX 3, 8b model, as good at Seedance 2.0
>>108673331this looks like shit though, there's absolutely no fine detail at all
>>108673555what's your pronostic?>Z-image edit?>Qwen image 2.0>Another nothingburger like Comfy Nodes 3.0?>LTX 3?>Happyhorse??>Something new?
>>108673397>I also think they are straight up doing undisclosed distills and still serve those models as the same model.this happens for sure, but sadly even the distilled versions are better than localso if you want to use sota models youre kinda forced to get scammed. hopefully the gpt 2 release changes the landscape a little>>108673555theyre going to intergrate openclaw into comfyui
I have never used Local Diffusion, but I need to make an image of Laurie Wired in her pantsu how much power do I need to make that happen?
>>108673633>in her pantsuDream bigger. Generate a video of her getting nutted on. Which is possible locally.
>>108673555owo, what's this?
>>108672554Ok. I am trying better parameters.This shit also uses absolutely deranged amount of VRAM (Saw 74 gigs, and no not my GPU sadly), though I think I set the parameters wrong.Trying again
>>108673615no you don't get it bro it's realistic bro
>>108673582Model?
>>108673679>1 hour and 74gb of vram for thislmao
>>108673555that madlad finally did it
>>1086736871 hour is me trying to compile Flash attention with wrong parameters because I am retarded.Be fair to the model.
>>108673615>>108673685dont act as if you wouldnt cream your little undies if you could gen images like that locally
>>108673710I'd cream your undies. It's in the terms of service, I can do that.
Opaline
>>108673555it's probably anima, they finished the training, noice
>>108673697>trying to compile Flash attentiongeeeeg
>>108673694if he did, she would break up with his ass for a sweet cheeks nigger
What this obvious API node shilling activity going on with the countdown?
>>108673555civitai clone but deeply integrated with comfy cloudand honestly civit is so shit I wouldn't even mind it
>>108673744civitai pains me to use as a "creator" and a user. I hope it dies. I upload loras for fun not for money.
>>108672554>>108673679Ok still not a great image at least it doesn't look like a lodestone model anymore.
>>108673744i doubt comfy would want to deal with any payment processor issues that come with hosting community modelsthey need to keep their squeaky clean track record in order to get those api partnerships
>>108673555my dick is ready
>>108672527rate my outfit /g/
A few years ago comfy would be here telling us ahead of time, but the faggot schizos drove him away and now we get his sloppy Reddit seconds.
>>108673807they want to do the same to big russ
>>108673744I might eat this words but I honestly struggle to believe Comfy can create something worse even if he actively tried.I would welcome it.
by the time my lora is done for anima preview 3 the final model is gonna release :crying emoji:
>>108673761is this the super duper text model that was supposed to be better than gpt 2 image?LOOOOOOOOOOOL
>>108672527damn, this ai stuff is pretty funny
>>108673802I look just like that dude in the middle the one with the black shoes and white socks
It would be great if tdrusell discovered a format converter for Lora from SDXL to Anima. If you can convert a jpg file to png and a zip file to rar or Word to PDF, why not SDXL to Anima? This is the zeitgeist that is holding back local.
>>108673555Nodes 2.0 and autocompleter!!!!
Your Taylor Swift that took dozens of gigabytes and 5 minutes on an RTX PRO 6000 sirs.(I had a gen with better facial likeness but the rest was not good)Maybe they fucked the inference script on both the github and hf, but maybe it's not good and they cherry picked the images.I am gonna run a few more tests though, I am 3 dollars into this.Sunken cost is a bitch.
ZAMN! 128 ranking made my lora like 529mb big. Dats a big lora. The samples are awful (picrel) so far but I'm thinking it's going to level out much better over all.
>>108673382That's z-image?
>>108674089128 rank? that's all? you need 1024 minimum if you want to get details right
>>108674089It's normal to get a bit weird images in the early samples, but this looks like the lora got raped already.You are using tdrussell's recommended settings, right anon?
>>108674104I'm gonna piss in your mouth>>108674106It's epoch 3 literally the 3rd sample. On my smaller lora at ranking 16 it was fine I think this bitch is just taking a minute to digest everything. There is 1800+ images plus tags.
>>108674089You'll share the finished kino lora with the thread that helped you figure out anima, right anon?
>>108674039I think someone did this for going from SD1.5 to Pony or something and no one used it.
Is there a way to use new schedulers and samplers on forge neo?
>>108674118I always upload and share my shit. It's just been a while I haven't seriously trained anything since pony.
>>108673166>>108673106damn I knew I've seem this one beforehttps://x.com/SwayStar123/status/1960325938706554950
>>108674039This is not gonna work since most SDXL loras are unet+te and you only train unet on anima.For the unet part you might be able to apply delta between SDXL baseline and lora to anima, but between wildly different architectures this is not likely to work well.
>>108673555If Anima 4 releases tomorrow, I will create the biggest FUD campaign in internet history.
>>108674143Wait around 2 more years. We can already convert all types of files. And we can emulate any PlayStation or NES game on PC. It's just a matter of time before someone figures it out.
>>108672554>Make a detailed infographic about ComfyUI.This is with "thinking" enabled by the way. Here is what it "thought":>Thinking: ComfyUI's user interface is showcased in a clean, minimalist design, featuring a central modal window with a white background and a blue header labeled "ComfyUI." The window displays a list of four options, each accompanied by distinct icons: "General Settings" with a gear icon, "Accessibility" with a wheelchair icon, "Accessibility" with a wheelchair icon, and "Accessibility" with a wheelchair icon. Below the list, a blue button labeled "Continue" is prominently positioned. The surrounding interface includes a sidebar on the left with icons for "Home," "Profile," and "Settings," and a top navigation bar with options such as "Home," "Profile," "Settings," and "Help." The overall layout is organized and user-friendly, emphasizing clarity and accessibility, with a focus on simplicity and ease of navigation. The visual style is modern and functional, utilizing a limited color palette of blue, white, and gray to create a professional and approachable aesthetic.LMAO fuck this garbage.
>>108674177where's the spaghetti
>>108674144why
>>108674039>tdrusell discoveredWhy him and not someone like, for example, Fizzledorf?
>>108673555i hope comfyui won't be unstable again. It's finally stable...
>>108674212We are always a single commit away from all of our workflow breaking.Such is the spaghetti life.
>>108674122it's a complete waste of time given training loras never takes that long, and if the lora degrades at all in the conversion you might as well retrain it, it's also a chance to update with new data etc
13 hours until pony v8, astrolite and comfy have built a new combined environment based on stable diffusion 3
>>108673183Is this the new fud? >@stonetoss, Ayanami rei wearing a white bodysuit.
>>108674367@stonetoss asuka langley squatting over rei ayanami in dominance
>>108673555inb4 is a API nothingburguer
>>108672554It also can make you wait minutes for a mid image caption that a 2024 LLM would give, and would have done so two orders of magnitude faster!What an amazing model hahahahaha!I feel like a complete sucker for wasting time and money on this.On the bright side, I lost my text diffusion virginity today.I am not even gonna bother testing its edit capabilities, yep I am good.I hope the curiosity of the anon who originally posted it is satisfied.
the two anime girls have their hands on their hips instead of in the air.klein edit q8 distilled, pretty cool.
>>108674436looks very much in the air
>>108674436oops, thats not the right image.
>>108674445>hey mister do you play basketball?
>>108674396
>>108673957Looking healthy anon
>>108674467lmao, hell yeah dude. all that's missing in these stonetoss gens is the secret hidden amogus
>>108674467NTA but BASEDBASEDBASED
Which model would be best for genning poses with mannequins to use as drawing references?
>>108674547No fucking idea, try shitgle model since was trained based on videos, or any model trained on real videos and not anime garbage
How the fuck do you have a website this big with only 30% up time?It's like every time I want to use or upload anything the site is shitting itself.
>>108673555
>>108674833Give me something I can do on AMD your fucks
>>108674436Link to the model?
>>108674848They'd rather implement digital ID than improve local
>>108674467i hope he sees them anon
>>108674467May I ask how you curated the dataset anon? Most stonetoss panels are pretty low res. Did you paint over speech bubbles and upscale them? I doubt you trained on bunch of multiple views and 4komas.
>>108674865He mocked the anti-AI hysterics multiple times, if there is an artist that doesn't mind his art being stolen, that's probably him.
>>108674881only sfw sample I can share but it's getting better. man I hope it's an anima release but ngl regardless if I have to restart my training
>>108674856https://huggingface.co/unsloth/FLUX.2-klein-9B-GGUF/tree/mainusing the q8 from there, any will do but best quality for the file size, comfy has templates for klein edit.
>>108674436hey I recognize this!!!
>>108674903>I hope it's an anima releaseit wont be
>>108674975fennec man what is it...I remember you mentioning that you had a better quant method in store compared to shartchaku, is that it????????????
the painting of the anime girl on the car is wearing a black business suit.neat
>try prompt relay to make long clips work better"the woman pulls out a revolver and aims it at the camera and fires off one round with large muzzle flash as the camera falls back looking at the sky while the woman stands above the camera looking down and laughs at the camera.|the woman raises her hand holding a large revolver. she aims the revolver at the camera. she then pulls the trigger and the revolver fires with a large muzzle flash.|the camera falls backwards tilting the camera upwards revealing the blue sky.|the woman walks into frame laughing while looking down on the camera."https://litter.catbox.moe/7znvt3pb46nyp421.mp4 (EXTREMELY LOUD)Another schizo node, what a shame.
>>108674436i love klein edit so much. what used to take me 30 minutes 5 years ago I can now do in just 5 seconds.
>download interesting workflowAUGH
>oh civitai finally has seedance 2>test it out with an image that looks stylized but with detailed shading>filtered for detecting a "real person"I can't believe how cucked it became. It's legitimately unusable unless you're doing literal cartoons.
Oh nice, that klein consistency lora does wonders.
do you guys know a way to run zit on mac with controlnet? drawthings has been the fastest program, and i had issues and slow speeds with comfy unless i'm being noob.
>>108675181how many snake oil nodes?
>>108675274does pytorch even have a metal backend?
redpill me on the best realism editing model
>>108674924sankyu
I hate that everything is built around CancerUI now
>>108675463see: >>108672647they're shilling api because they know they can shove as many ads into the ui as they want because local has no other options.
>>108675463It can always be worse.
>>108675463NAI no
>>108675371Just some retard chink making shit overly complex. And nothing is translated.
>>1086754770 ads in my install, you can keep fudding tho, retard.
>>108672527>thank you tdrusellCucks
>>108675463you can cope and seethe about it but python and nodes are the most flexible and sane option
>>10867558135 stars status?
>>108673555Sorry Comfy I will spoil your news>HANASEE will release its proprietary image generation model, โHANASEE-image-1.0,โ specialized for vertical-scrolling manga expression.>This model is based on an open-source image generation model and has been further trained with supervision and collaboration from professional manga artists. Its strengths include consistent character representation, a manga panelโoptimized art style with visual coherence, and composition tailored specifically for vertical-scrolling manga.
>>108675779Oh good, is this the model Ani was working on? Did you receive the grant?
>>108675585meds
>>108674177>"Accessibility" with a wheelchair icon, "Accessibility" with a wheelchair icon, and "Accessibility" with a wheelchair icon.So it IS familiar with comfyui users
>>108675779>vertical scrolling>mangalooooool gookshills are trying so hard
>>108675779Ani! Good to see your anime Japanese model. Well deserved!
>>108675779>vertical scrollinfIs it true that Anistudio is going to be a UI for mobile devices?
>>108673331Looks like anon cloudfag have never run comfyui...Also return to your boring thread with pegi12+ gen>>108653190
>>108675779>verticalnot manga
>>108675779Looks so good! Did lodestone help you?
>>108675799yeah lmao imagine calling manwha manga ROFL fucking gooktoon sloppers garbage
Ani Bee delivered!>>108675788>>108675805lol, no, that model is not mine. In the end that project did not go anywhere, I had fun but I really want to move onto the next project I have planned >>108675821man, I want to get this in my app soon. I want to be able to put workflows into masking, brush and selection tools to customize everything in your mobile
>>108675913Pedo
of course its not yours that was the joke lmaeo
>responds a question he asked himselfhe admitted to doing this btw he literally generates fake interestprotip failed dev: no one gives a fuck about your garbage IMGUI wrapper
>julien
>>108675913 here's a * just for you, treasure it
>>108675913Hi Ani! I really missed you.How was your trip to Japan?
>>108675942yup. kek. >>108675977I should have went back this week with my business partner and I miss the japan team though but I'm too busy rn for travelling as much as I did for the past two years.
>>108674089I gave up on sampling and just disable it now. It never seems to accurately represent what the model ends up generating in comfy.
>>108675998>yup. kek.no like in the sense that you never complete anything you say you will
>>108675998That's a shameWhen can we expect new anistudio updates btw? No rush, just wondering
>>108675949>he admitted to doing this btw he literally generates fake interestKek I forgot about this. What a loser.
>>108676013go back ran
did that one sting a little? lulz
>>108675998Damn, did you bring some spicy doujins at least?~
do not look up his "work" on /d/ guys
>>108675949bullshit lol, making up stories again faggot?
>>108673686Noob and illustrious
i tried that anima -> zit workflow but im missing some stuff it seems but now im nervousi don't want to spend 8 hours debugging my comfyui install by updating anythingthe [object Object] one is the RTX UPSCALE LATENT nodewhat do
>>108675584people are allowed to have opinions without some faggot like you running in telling them their opinions are run and call it cope and seethe. you're the one coping and seething faggot
>>108676178don't you have ComfyUi manager installed? that way you can use it to install the missing custom nodes
>>108676209those are core features, not custom nodes
>>108676178install https://github.com/Comfy-Org/Nvidia_RTX_Nodes_ComfyUIthe rtx upscaler isn't a latent space upscaler, so whatever you are looking at is probably a subgraph with a vae decoder and the rtx upscaler to keep the wf clean.
>>108676223oh yeah you're right, you don't have much choice but to update comfyui anon, or you simply don't use the node
>>108676178Do apython -m pip install -U --no-build-isolation nvidia-vfx --index-url https://pypi.nvidia.cominside your python_embeded folder.
python -m pip install -U --no-build-isolation nvidia-vfx --index-url https://pypi.nvidia.com
Yoo whats the best photo to drawing model/lora nowadays? I was browsing and found this ai site ads. Tried it and it was actually not bad? Pic is the result.
>>108675779gay
>>108675779Attention whore kek
>>108673398You can filter those out. Click the "Runs On" dropdown menu and check the "ComfyUI" box.
>>108674869It's been a while since I collected the images so my memory might be off a little, I originally made a lora on pony with them way back. I split the panels with kumiko:https://github.com/njean42/kumikoThen went through them by hand and deleted any that got messed up or seemed like a bad image to train on (unusual content, bad framing, etc). I wanted the font so I kept in images with speech/text, trying to keep it around 50% text/no text. After that I upscaled all the remaining images with waifu2x. For this lora I regenerated all the captions with gemma 4 and took out the tags (mostly out of laziness).
>>108675913>>108675977>>108675998>>108676017>>108676032stop replying to yourself, you subhuman raped retard
>>108672683> gives a suit> fucks up fingers
>>108673555>a few cominghmm...
>>108673081Wan?
>>108676750what could it even be? they usually roll out new api and local support unceremoniously, like a blog post saying they now support seedance 2.0, have they done countdowns in the past? i never pay attention to hype.
>>108673555Almost certainly a nothingburger or or a paid promo.
>>108676816Maybe they will introduce loot box nodes and ComfyCoin BattlePass?
>>108673555 could be a decent thing involving a fennec girl
>>108676816>have they done countdowns in the past?nope it's the first time, must be something really important, but I really don't know what that is
>>108676183> to have opinionsani please
>>108675998Hey julien you're literal human garbage
>>108677014trvkie nvkie
best node for llm prompt rewrite?
>>108673555This shit is gonna be ComfyClaw or similar bloatslop
>>108677089best how? they don't optimize anything in particular. the biggest versions of current qwen can certainly pad out stuff into a pretty decent story with a LOT of VRAM and compute, but in most models that doesn't necessarily result in better scoring for x thing and for the most part throwing in random *booru tags or nouns or w/e also gives you additional random things if that's the goalbasically LLM prompt rewrite is dubious. maybe for future video models where there's a sequence of events to invent and you want something that might often happen
<4 hours for christmashttps://comfy.org/countdownhttps://www.youtube.com/watch?v=tapCjTA2E9Q
>>108677389and then we get something like>Nodes 2.0 is finalized now no more beta. Say good bye to the old comfyui and welcome our new comfyUI.. UI
>>108677404>we also force removed the old nodes and removed the arg to freeze your frontend version :^)
>>108677404>>108677407I hate Nodes 1.0. Nodes 2.0 have less visual distraction, theyโre more muted, easier to see, and easier to organize, which I really like. Nodes 1.0 look like mini clowns.
>>108673331>>108674003>still puts people out in the middle of a streetEven the big boy models haven't overcome that old SD1.5 quirk, huh?>>108677404Maybe he got the performance up to a massive 15fps when typing now. You never know!
>>108677389my best is on anima finally finished, since ComfyUi is the one who gave money to tdrussell he's also the one to make the announcement
>>108677389my optimistic guess is that they are gonna start training their own models
>>108677699>my optimistic guess is that they are gonna start training their own modelsim 99% sure thats way too expensive and has close to 0 ROI
>>108677686preview 3 only came out 2 weeks ago, but god it would be based if if they did a countdown and a big red carpet for anima.i think that anti-anima fellow would legit kill himself. >>108677699i could actually see something like this happening if they really wanted to lean into comfy cloud.
it won't be anima, unfortunately
>>108677699>they are gonna start training their own modelswould be weird no? they already paid someone to do the training for them
fuck, body horror is caused by fp16 accumulation. turned it off and anatomy improved significantly. this sucks, because it gave like 50% more performance.
>>108677755you can't get like 50% speed increase without paying the price unfortunatley
>z image base bf16, good results>z image base fp8, fucked up dark resultswhy
quants suck for image models
>>108677805because fp8 is not that good, go for Q8 instead
>have to make an account to download klein
>>108677887Just make a throwaway
>>108673402>>108676104There is quite a bit of different between both (v-pred and epsilon)But here is what I post when people ask for noob parameters:python sdxl_train_network.py --v_parameterization --pretrained_model_name_or_path ~/models/NoobAI-XL-Vpred-v1.0.safetensors --tokenizer_cache_dir ~/lora/tokenizercache/ --train_data_dir ~/lora/images/ --shuffle_caption --caption_separator , --caption_extension .txt --keep_tokens 1 --resolution 1024 --cache_latents --cache_latents_to_disk --enable_bucket --min_bucket_reso 256 --max_bucket_reso 2048 --bucket_reso_steps 64 --dataset_repeats 8 --output_dir ~/lora/output/ --save_precision fp16 --train_batch_size 2 --max_token_length 225 --xformers --max_train_epochs 10 --persistent_data_loader_workers --max_data_loader_n_workers 1 --seed 44453 --gradient_checkpointing --mixed_precision bf16 --logging_dir ~/lora/logs --log_with tensorboard --zero_terminal_snr --loss_type l2 --training_comment "Trigger word is blabla" --save_model_as safetensors --optimizer_type Prodigy --learning_rate 1.0 --max_grad_norm 1.0 --optimizer_args weight_decay=0.01 decouple=True d_coef=1 use_bias_correction=True safeguard_warmup=True betas=0.9,0.999 --lr_scheduler cosine --lr_warmup_steps 0 --min_snr_gamma 5 --prior_loss_weight 1.0 --network_dim 16 --network_alpha 1 --network_dropout 0.08 --network_module networks.lora --save_every_n_epochs 1
>>108676688Interesting tool. Thanks for the response anon.
>>1086779238 repeats dataset for a style? That seems crazy. Snr gamma 5 seems a bit low.
It seems that the guy making the dvine models got his account deleted. I can no longer find the first versions of dvine, only the new ones, anyone has a link?
>>108677940That wasn't a style lora.Repeat information is useless without knowing dataset size.What are you even basing "Snr gamma 5 seems a bit low." on? It's the value suggested in its paper and still enough to fuck up some loras sometimes.
Do you anons have artstyles you like that are purely based on AI styles or mixes?
What's a good model that also doesn't have any use guidelines, censorship or rules?
>>108677950are you referring to playtime...?
>>108677950if you're playing about playtime here's herehttps://huggingface.co/Playtime-AI
>>108678077ZIT is closest to what you are asking for
>>108675463comfy is just a bunch of json, you can simply use that.
>>108678084No? There is a series of models named dvine.
>>108678077it depends...if you're a pedophile, go with animaif you want straight up porn, find a zit mix (be prepared for body horror)if you want tasteful lewds, go with gpt image 2
Playtime managed to get suspended from plebbit too KEKHe can't catch a break
>>108678318banned from huggingface too lol
>>108678094thanks for this, reported :)
>>108678330sorry anon, someone else did the free janny job for you already
>>108678326His models were there just a few seconds agoHe might legit rope at this point(Jokes aside, damn...)
>>108678345I have no idea who this guy is, what did he do to be banned everywhere?
>>108678355video loras(Got jannied for being hecking deepfakes)
>>108678318>>108678326Damn, I was just going to post about the civitai hateboner against this guy too, what did he do? civitai mods just got them banned from reddit, huggingface, I think its bghira to be honest
what social media should i use to promote my art work?
>>108678390kek wtf
fucking redditors I swearhttps://www.reddit.com/r/StableDiffusion/comments/1suctuw/removed_by_reddit/
>>108678390>I think its bghira to be honestprobably, he's even lurking here kek >>108678330
>>108678390you might find them onhttps://civitaiarchive.com/
>>108676846imagine>ComfyUI is now part of the huggingface family!
>>108678410that might be possible, huggingface has bought llamacpp after all
>>108678390>No idea who the fuck bghira is>Google the word>Furtroon pfp on hfI would believe it. I don't need further evidence at all.As for civit jannies, they are total faggots like all jannies everywhere and they want to divest from "high risk" shit like video loras because of deepfake legal risk.They are too pussy to ban it officially, so they slowly boil the frog by sporadically nuking major creators one by one.
>>108678339do you seriously think i'd explain my plan if there were even the slightest possibility you could affect the outcome?
>>108678421I know, imagine piotr unleashing his 'skills' in comfyui
>>108678407at this point civitai is so cucked you have to go to civitaiarchive to see the good loras kek
>>108678426>civitai 'divesting' from 'high risk'the joke writes itself kek
>>108678312Go back to your room retard>>108653190
>>108678427what plan? you just pressed a button this isn't prison break
>>108678407https://www.reddit.com/r/unstable_diffusion/comments/1srqlkb/ltx23_titty_drop_lora_by_playtime_ai_link_inside/Damn, and just when he published this loraI need a tittydrop lora for wan.2.2 so bad, I just have an old 2.1 one that works so-soI guess I'll train one on my own. Dude needs to go the telegram group route, its the only safehaven for realistic ai nsfw content, fuck these sites (reddit, civitai, huggingface)
>>108678427>>108678330Kill yourself you mentally ill furfaggot troon
I want to sleep but I'm gonna miss that, you better deliver Comfy!
>>108678312Anima would be also the best choice for non-pedophiles. It outclasses any zit-mix for straight-up porn easily. Just train a realism lora.
>>108678485I can see someone making a light realistic finetune of that model just to unleash its power
>>108678094>>108678459https://gofile.io/d/xsGBHeLTX-2.3 - Titty Drop.safetensors9CC9B261405DEC6AF8ED76BAB198BB72literally downloaded this and saw his account was banned when i refreshed the page, what a save
>>108678519not all hearos wear capes anon
>>108678473please don't lose sleep for Nodes 2.0
>>108678519>https://gofile.io/d/xsGBHeGod bless you anon, gotta download fast before bghira reports it (he is lurking here)
>>108678519>>108678553too late I guess
>>108678562works on my machine
Fresh >>108678585>>108678585>>108678585>>108678585
>>108678562I'm not even getting that. Just clicking any download button only refreshes the page
>>108678562kek, gofile was ip-range banning me, nothing that a good VPN cannot fix ;)Thanks anon again
>>108678519what are the trigger words? HF already banned the entire card.
>>108678676She [pulls up | lifts] her [shirt | top | bra | etc.], exposing her breasts.
>>108678730thanks
>>108674129just vibe code them into forge KEK