Discussion and Development of Local Image, Video, and Music ModelsPrevious: >>109041690https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUISDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineageWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, & Upscalershttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.info>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/tdrussell/diffusion-pipehttps://github.com/kohya-ss/sd-scriptshttps://github.com/kohya-ss/musubi-tuner>Zhttps://huggingface.co/Tongyi-MAI/Z-Image>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/https://animadex.net>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>Wanhttps://github.com/Wan-Video/Wan2.2>LTX-2.3https://huggingface.co/collections/Lightricks/ltx-23>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
can Klein 9B work with anime also can it do nsfw edits aka give a fat titty anime bitch a nipple ring?Is the performance between quants really that big?
>mfw Resource news06/13/2026>PRXPixel (text-to-image, pixel space) https://huggingface.co/Photoroom/prxpixel-t2i>SCAIL Auto Extendhttps://github.com/Brobert-in-aus/scail-auto-extend>MotionBricks: Scalable Real-Time Motions with Modular Latent Generative Model and Smart Primitives https://nvlabs.github.io/motionbricks>dyfuzor-web: turns an Excalidraw scene into an Ideogram-4 structured JSONhttps://github.com/karolrybak/dyfuzor-web>sageattention-autotune: Autotuned block sizes and other QoL improvementshttps://github.com/woct0rdho/sageattention-autotune06/12/2026>ComfyUI-Flux2Klein-Enhancer: Conditioning enhancement and reference latent control https://github.com/capitan01R/ComfyUI-Flux2Klein-Enhancer>InterleaveThinker: Reinforcing Agentic Interleaved Generation https://zhengdian1.github.io/InterleaveThinker-proj>Experimental Anima LLLite Regional Controlnethttps://huggingface.co/Sen-sou/Anima-LLLite-Regional-Controlnet>World Tracing: Generative Pixel-Aligned Geometry Beyond the Visiblehttps://haoz19.github.io/world-tracing-page>VietFashion: Benchmarking Sketch-Text Composed Image Retrieval for Cultural Outfitshttps://hng0303.github.io/VietFashion>Modality Forcing for Scalable Spatial Generationhttps://modality-forcing.github.io>VideoMDM: Towards 3D Human Motion Generation From 2D Supervisionhttps://videomdm.github.io>EvTexture++: Event-Driven Texture Enhancement for Video Super-Resolutionhttps://github.com/DachunKai/EvTexture>Budget-Constrained Step-Level Diffusion Cachinghttps://github.com/Westlake-AGI-Lab/BudCache>ECA: Efficient Continual Alignment for Open-Ended Image-to-Text Generationhttps://github.com/Snowball0823/ECA>InterleaveThinker: Reinforcing Agentic Interleaved Generationhttps://zhengdian1.github.io/InterleaveThinker-proj>i1-3B: A Simple and Fully Open Recipe for Strong Text-to-Image Modelshttps://huggingface.co/zlab-princeton/i1-3B
>mfw Research news06/13/2026>MoVerse: Real-Time Video World Modeling with Panoramic Gaussian Scaffoldhttps://orange-3dv-team.github.io/MoVerse>Learning to Solve Generative ODEs Beyond the Linear Spanhttps://arxiv.org/abs/2606.08672>Echo-Memory: A Controlled Study of Memory in Action World Modelshttps://arxiv.org/abs/2606.09803>Beyond Consistency: Preserving Temporal Structure in Zero-Shot Video Editinghttps://arxiv.org/abs/2606.08780>BioVid: Autoregressive Video Generation with Biological Behavior Semantic Comprehensionhttps://arxiv.org/abs/2606.08674>CSFlow: Aligning Flow Matching with Human Contrast Sensitivityhttps://arxiv.org/abs/2606.08833>MaskAlign: Token-Subset Representation Alignment for Efficient Diffusion Traininghttps://arxiv.org/abs/2606.08788>Where the Score Lives: A Wavelet View of Diffusionhttps://arxiv.org/abs/2606.08309>MOFA-VTON: More Fashion Possibilities with Fine-Grained Adaptations in Virtual Try-Onhttps://arxiv.org/abs/2606.11148>Next Forcing: Causal World Modeling with Multi-Chunk Predictionhttps://gangweix.github.io/next-forcing>MotionEnhancer: Leveraging Video Diffusion for Motion-Enhanced Vision-Language Modelshttps://arxiv.org/abs/2606.06853>Can Image Models Imagine Time? ImageTime: A Novel Benchmark for Probing Visual World Modeling Through Spatiotemporal Consistencyhttps://arxiv.org/abs/2606.10620>Rethinking 3D Shape Generation: Diffusion over Superquadricshttps://arxiv.org/abs/2606.08957>Vision-Language Asymmetry in Bistable Image Captioninghttps://arxiv.org/abs/2606.08031>Do Vision-Language Models See or Guess? Measuring and Reducing Textual-Prior Reliance with a Phrasing-Controlled Benchmarkhttps://arxiv.org/abs/2606.10400>RAPID: Layer-Wise Redundancy-Aware Pruning and Importance-Driven Token Merging for Efficient ViThttps://arxiv.org/abs/2606.08156>A Unifying View of Attention Sinks: Two Algorithms, Two Solutionshttps://arxiv.org/abs/2606.08105
>visted tiktok for the first time in years looking for dance videos>99% of the results were just kling motion slopuhhh... wtfanyone know a good place to find reference videos?
>>109047336>>109047344Can you please stop spamming this, you have been caught linking anons malware multiple times which is shown in the OP.
All I want for christmas is a model that knows how guns and shooting works.
>>1090473632 more yearstrust
Remember to put aliasing in the negative prompt
>>109047336>>109047344Fuck off thread schizo
I wish there was a LTX Scailor a Wan22 at least
Using anima and can't get rid of dark skin on pov hands etc. I can't self insert onto this. How fix? Preferably without negatives for turbo, or I guess I'll figure out negpip finally
>>109047477Even negpip can't fix this lol
>>109047358>>109047456Agreed, this spam has no purpose
>>109047470should I cum to this /g/?
>>109047470just use v2v if you're so desperate for those models
>>109047456>>109047488Why is he posting this here again?I thought he only posted this in his containment thread?
>>109047436pleasu understand, work in progress
>>109047495idk, should you?
>>109047484There's no way of prompting around it? It's just too baked into the data? I'm messing around with negpip now and I can get them grey which is better I suppose...Could I get around it with a lora? If so, how many images would I need? I assume fully synthetic data is bad but I guess I can just grab booru images and their captions and just edit the image to match what I need?
>n*gbo is lonely again in his containment generalFuck off
>>109047470>>109047509cant remove the e-girl cringe even with AI
>>109047550I didthanks anon
>>109047592Must be a sad life to be this obsessed with some illusionary nemesis.
almost forgotmaintain thread qualityhttps://rentry.org/LDG_vital_info
>>109047687based
>spent hours tinkering and perfecting a style with anima>noob nails it on the first tryIs Aniam seriously meant to be the replacement for SDXL?
>>109047805what do I need to do stuff like this? man I feel like shit changed in the span of 2 weeks and I'm fully out of the loop.can you point me in the right direction pls
>>109047831update comfy and use this workflow https://github.com/user-attachments/files/28759255/Wan21_SCAIL2_Testing.2.jsonhttps://huggingface.co/Comfy-Org/SCAIL-2/tree/main/diffusion_models
>>109047764How is this illusionary when the anon spreads malware?
>>1090478672B and a C
>>109047470left mogs
>>109047823You are unable to tell the difference between low channel and high channel VAEs?
>>109047939vae is sloppers copel, if the style is shit, the vae is shit
>>109047940>no I cannot tell the difference Okay.
>>109047509>THREE FINGERS
>>109047956gen her kissing her wife
>>109047867
>>109048005smooth movement
why is my shit so fucked can someone help me?im using the mxfp8 with default workflow
>>109047867>>109048005weirding me out how her waist looks like a cylindrical lego piece on top of a pelvis block
I am worried that this hobby has taken over my life. I don't do anything other than genning cute 1girls. All day every day.
>>109048095go outside and look at the 1girls irl
>>109048095Same. I mean I do other things as well but I'm starting to wonder if I'm addicted to genning. If my computer is turned on I'm genning all the time.
>>109048100but then he'll be charged with looking at 1girls
i dream of my 1girls
>>109048109anon's from the UK?
>>109048118
>>109048095Tell me about, I have to meet this girl tonight and I'd rather stay at home, smoke some weed, gen some 1girls and then use SCAIL to generate coom videos
>>109048055increase the fps on the save nodeor use my workflowit picks the fps automaticallyhttps://github.com/user-attachments/files/28759255/Wan21_SCAIL2_Testing.2.json
>>109048187>I'd rather stay at home, smoke some weed, gen some 1girls and then use SCAIL to generate coom videosDo it. That's what all the cool anons do.
>>109047313Is Earnie any good? Is it worth downloading and trying?
>>109047564It's not anima, you lie anon
>>109048139This guy sucked
>>109048264Woman appears left on chair, like horror movie
>>109048401I have been waiting months for anon to animate that gen.
the towel is floating
>>109048512very original fellas
>>109048189i swapped to the fp8 model and seemed to have fixed itthanks for the help
https://files.catbox.moe/qxl7gi.mp4made some progress
>>109048637Anon, this is pdf territory.
>>109048663that's an actual commercial that aired on television though
Why does it seem like no one cares about 3D stuff? Like I haven't seen anyone try to implement Kimodo in Comfyui except some seemingly broken node pack I found on google. I feel like Kimodo would be really useful when paired with SCAIL2
>>1090486772d = soul3d = soulless
>>109048688retard
Having a skill issue. How do I prompt ideogram to get real life pokemans instead of whatever the fuck this is?
>>109048705now I'm tempted to revive my pokemon wildcard workflow
>>109048637shit taste, but good work. poor guy. he's gonna catch std at a young age kek
>>109048759Looks like shit>>109048705post the prompt and we can help you
>>109048797no one on this board has ever been able to top my random pokemonI know ityou know iteveryone knows it
Computer generate Hebi Nyoubou from Fate/Grand Order doing a rimjob invitation
>>109048835Ok what the fuck is that thing
>>109048797>post the prompt and we can help youNormal amateur photography prompt. That pic was just "vaporeon". "Real life creature or animal" created even worse monstrosities. Now I found that "highly detailed 3D render of a Vaporeon with fur simulation" looks decent I guess.>>109048759gib me
>>109048858gen 25 pokemanground/poison, maybe?
>>109048874>Normal amateur photography prompt. That pic was just "vaporeon". "Real life creature or animal" created even worse monstrosities. This isn't useful information to help you, now you're entertaining the malware spreader.please go to /sdg/ if you keep being obtuse like this.
RIP, this thread fucking sucks now with the lame ass deepfake posters. Thankfully theres alternatives.
>>109048969thanks for you contribution, nogen
>>109047844thanks broalso does this run on a 4090 and 64gb ram?or would I need more?
>>109047353/wsg/
> >109049207fuck off
I've asked every fucking online AI to composit this image with the happy merchant image (basically put it in the Polaroid photo's frame) and every one of them has refused to do it because it violates guardrails.is there a way to do it locally with any of the tools?
>>109049288yeah, use GIMP
feels like we're stuck in 2023. plastic skin shitgens, unironic wan2.2 posting, tardbo sdxl-tier slop. local is dead
>>109049304Be the change you want to see.
>>109049294GIMP is incredibly hard for this sort of stuff. I was hoping some AI could do it. I know ChatGPT can do this sort of shit but it refuses in this case.
Anima desu :(
>>109049304Actually, you are wrong. AI has advanced by great strides over the past few years. As a pioneer of such new technologies, it may appear to be moving at a slower pace since you are more up-to-date with such news in comparison to the average person.
>>109049364it takes like 2 seconds to shoop this dude, you dont even need AI for that.
>>109049364yet edits like that have been done millions of times by hobbyists
Downloading girls I know from insta and facebook and undressing them turns out to be quite some fun.
>>109049383>Downloading girls I knowI dont know any 3DPDs at all and thus have nobody to undress.
>>109049372actually it's more like API innovates while local stagnates
>>109048360By CoD rules, he has the higher K/D ratio so he pwns u n00b
>>109049366
>>109049391Might be a skill issue, I guess. Try touching grass first.
>>109049399I understand you lack the vocabulary to properly speak your mind but do not despair! Open weights are advancing at a steady pace, if a few steps behind paid API models. Companies and open source communities alike benefit from the valuable research from open weight models, so it is highly unlikely to truly "die".
>>109049399>API innovatesOr maybe they have access to huge models? Also what I saw from GPT didn't blow my mind actually. Lots of slop too.
Kijai deleted the Bermini models? What happened?
>>109049453Also it's Bernini. Don't know why I kept spelling it with an M
>>109049288klein or qwen should be able to do this trivially
Catjack desu :(
Do you need regional prompting with anima?I'm skeptical that it can pull it off with natural language
>>109049569Thanks so much anon!
>>109047313Anyone with 7900xtx? Can't run Ltx2.3 I2V. Getting stuck or OOM.
i scail only 1girl or can it do more?
>>109049698anima isn't the worst with composition instruction but it'll need help if you wanna do precise/intricate stuff. theres a regional conditioning node as well as a regional cn you could play withhttps://github.com/Sen-sou/Comfyui-Anima-Regional-Conditioninghttps://huggingface.co/Sen-sou/Anima-LLLite-Regional-Controlnet
>>109049884Sorry but with your track record of spreading malware I will ignore your suggestions.
>>109049884Go back to your containment general deboYou're not welcome here
>>109049884catbox?
>>109049930Haven't been here in months why is he posting here?Wasn't he happy in his own thread?
>>109049884thread schizo
>>109049938this mixed model mostly ignores anima artist tags, sadlyhttps://files.catbox.moe/2apo25.png
>>109049968thanks
Got sick of the image segmentation not working properly so I've vibe coded a node which lets you draw points/bounding boxes around your inputs so you can specify exactly which image character maps to which video character.
>>109047805kek'd
>>109047873looks more like 2EE to me
>>109050133and of course after perfectly masking out each character in both the input image and video it ignores their mask colours and just matches them left to right
>>109048811fffff
>>109049288yes there is
>>109050172how?
>>109050208carefully
>install old 2080 XC Ultra that was just sitting in storage>6.25 slots of GPU>4090 Strix OC just a compute node now>no pesky config in BIOS/Windows>both use the right amount of PCIE lanes automagically (had to move the 4090 to the lower slot so they'd both fit - you can fit a whole sheet of paper between them!))>mrw everything just werksDamn, multi-GPU sure is a lot better today than the last time I used it (8800GTs). Basically zero setup now.
>>109050307i took out my dual gpu setup because the powerful card on top was dying of suffocation. i should try gain with a riser cable so the bottom card can hang down
>>109050307horse face
>>109050320My 4090 is about 4" closer to the two 140mm fans on the bottom of my case, so it falls back to ~32C about 30 seconds after finishing it's workload. The fans almost never turned on on the 2080 doing Windows stuff, so I'm not that worried about it having air to breath.>>109050400Horses wish they were that cute!
https://files.catbox.moe/n9qo78.mp4
>>109050468wtf is this real?
>>109050449
>>109050484Yes. Teleporter was vibe coded by fable.
>>109047908left is brown
>>109050492meant to reply to >>109050468
>>109050435How did you get both?
>>109050504By prompting for both.
>>109050435https://animadex.net/?mode=characters
>>109050523oops, misquoted >>109050504
>>109049698>I'm skeptical that it can pull it off with natural languageAre you unable to try it out for yourself?
>>109050519Didn't think it actually works this way in Anima with unrelated characters from different fandoms, I'm slightly behind the curve.>>109050529Cool shit, thank you.
>>109050561Just one thing. If you use character loras, there's a chance it will leak onto the non-lora character.
>>109050492meant for >>109050402
>>109050492meant for >>109050545
How is Wan2_Bernini? Mogged by Wan21_SCAIL-2? Everyone seems to switch to scail2 immediately.
https://files.catbox.moe/b34hy8.mp4
kenji is getting handsy
>>109050654meant for >>109050660
brat
>>109050156ah, that sucksso you'd even have to use the perfect mask to rearrange the characters left to right on a temp image?
>>109050156It freaks me out to think about how many of the people who made original animation died in a fire.
https://github.com/Brobert-in-aus/scail-auto-extendMy SCAIL-2 auto extend now has an improved SAM3 identity tracker which lets you draw bounding boxes around each subject.It's not a perfect solution, the model only takes the masks as a suggestion, which is why it didn't order the characters as they were masked in this gen:>>109050133>>109050156The main benefit of this new node is that you can force every character present to be identified and improve the chances they are accurately tracked through the video.Technical breakdown if anyone gives a shit.SCAIL-2 has only one reference latent, reference_latents[-1] . This means all characters live as spatial regions inside a single composited frame. Their appearance is encoded by where they are in that frame.The colour signal is an additive embedding: x = x + patch_embedding_mask(ref_mask_latents) and scail_x = scail_x + patch_embedding_mask(sam_latents), competing against RoPE positional encoding inside full attention. When the reference composite is a horizontal row that mirrors the driving row, same-x reference tokens are positionally "closest," so position dominates the additive colour nudge. And you can't sidestep it with true per-identity references, because the model only ever consumes reference_latents[-1]The model also requires all characters to be present in the first frame of the video. SAM3 will happily mask out late arrivals, but the model won't, again due to the single reference latent which insists there are more characters and will force them into the frame.This also bleeds through to jump cuts. If there are two characters and they've swapped places after the jump cut (e.g. the camera jumps to the other side of them), then the spatial relationship in the reference latent will override the mask and cause a character mixup (as you can see in the attached)>>109050921Yep, arrange the input image characters left to right matching their intended replacements.
>>109050962Nice work anon. Seems like you can just stop genning at the cut away frame and retrack with a second group. Not an insurmountable flaw. I am glad we got Wan21SCAIL2
>>109050865I'd love a cigarette right now
>>109050962>The model also requires all characters to be present in the first frame of the video. SAM3 will happily mask out late arrivals, but the model won't, again due to the single reference latent which insists there are more characters and will force them into the frame.>This also bleeds through to jump cuts.I was actually wondering about this. Good to know.I recall bernini was better at this in terms of actually tracking identities? Haven't tried it yet (and I don't really understand the model papers, just the empirical experimentation which of course is slow).
>>109051020I did some digging through the model card and there's (very experimental) support for multiple reference images.In theory, if each character+mask is in its own reference image then the spatial relationship goes away and the model is forced to rely on colour.Will require some around fuckery with having either multiple input images or automatically separating by mask (IIRC the SAM3 nodes already natively support this so might not be too painful).>>109051061Bernini is next on my list once I'm bored of SCAIL, optimising it is tickling my 'tism so probably going to be a little while though.
>>109051061Bernini do not have extension built in. Which kind of makes worse than SCAIL2. Kijai didn't include extension function, so I assume it was not easily extendable.
>>109050865south korea is such a dirty shitty place
>>109051061I tried bernini, sometimes it outright didn't replace the subject. It simply render the same video. Huge waste of time.
>>109051075 >>109051099Maybe SCAIL-2 is overall better and perhaps it'll be easier to just work cut by cut? I guess I'll have to try at some point later.
where scail2 workflow
>>109051141I think you want the extension and workflow linked as git repo in >>109050962
The hair error at the end is unfortunate, and caused by the male dancer being momentarily entirely occluded by the female dancer, but the footwork is flawless with no body horror, which I don't think any other method can do.
>>109050962>>109051149is this sota?
>>109051162it's good for character replacement.IDK if it's SOTA for it, never mind other tasks
>>109051162fuck no, not even close. api surpassed this by miles
>>109051159the arms also aren't tracked entirely exactly but yes, it's greathair appearing on teto is a funny quirk
>prompt: she lip bites in the end
>>109051204Proof?
>>109051159u need to increase source video FPS to get better result, but that also means shorter video
>>109051226im using infinvideo
>>109051099Try the r2v workflow herehttps://github.com/amao2001/ganloss-latent-space/tree/main/workflow/2026-06-10%20berniniThe set/get nodes were breaking the outputs for some reason so I got rid of all of those and just connected them manually.
>>109051250so benis better than scail?
>>109051268Doesn't look better than scail from my testing but they're both used differently. Bernini is used to edit videos or for image references while scail is for direct video motion transfer.
Sometimes it captures facial expression perfectly, sometimes it doesn't
>>109051268Tried both. SCAIL is the better character tracker. Bernini takes too long and max out at 81 frames. Dual Sampler of High and Low model. SCAIL2 is based on wan2.1 single sampler and can extend indefinitely.Maybe you can use Bernini to edit videos, but any major flaw you can just regenerate a new video anyway.
ain't nobody told me if scail can do goon yet
>>109051315dude, 75% of the videos in these threads now are whores bouncing their poopers
>>109051333that's not even softcore
>>109051333>whores bouncing their poopersvanilla fag
Bumping up resolution (unsurprisingly) boosts quality of the gen.>>109051226I'll try boosting the framerate, but wan is trained on 16 fps so it might get weird.>>109051315It can do goon exactly as well as base WAN 2.1, though I haven't tried adding any loras beyond lightx2v
>>109051366i dont see his reflection
>>109051366damn that's smooth
>>109051268Wrong question.Bernini is better than VACE.
>>109051366has no one mentioned the compression like artifacts?
>>109051380what does the vp have to do with bernini?
>>109051375
>>109051366>>109051290>>109051216>>109051159>>109051102>>109050962>>109050734so what's the use case for this?
>>109051425goonsex
>>109051428I mean besides goon material for jeets
>>109051366why don't people interpolate their video? adds only few sec extra gen time
>>109051438what do you use diffusion for? show us
>>109051439workflow for interpolation?
>>109051425idk, it is a cool demonstrationeconomical feasibility nor practicality is not in my concern
>>109051425for fun
>>109051446
>>109051475thats an image
>>109051479r u retarded
>>109051443book cover and character portraits for my webnovel series>>109051464>>109051453don't you want to make money with this?
>>109051496benchod
>>109051475based>>109051479cringe
>>109051425Probably the best way to insert a character into a scene, but you need a reference video so it kind of sucks. Maybe if you use it on 3D animated videos you could create cinematic scenes with it.
>>109051501>making moneyi'd personally be ashamedquality is getting really close to there but not really 'there' yet
>>109051519Quality has been usable since wan 2.2. But no one here are artists so they don't know how to actually use any of this outside of gooning.
>>109051501>my webnovel seriespost 1 page
trolled!!!!!!!!!
>>109051529i am an artist and idk how i can use those as-is, reallybesides placeholders or brainstorming where those enable me to easily check and dispose many ideas in a clearer form until the vision seems alright
>>109051541asuka got me actin' like a migrant in berlin
>>109050962>My SCAIL-2 auto extend now has an improved SAM3 identity tracker which lets you draw bounding boxes around each subject.great feature, ty!the actual UI to define the bboxes doesn't seem to quite work right here (e.g. right click to delete a bbox, after some other UI interactions) but could anyhow be better if copied from the one in kj's ideogram prompt builder if that's feasible. tho having the additional point instead of a bbox is very nice.also good that the v2 workflow has frame interpolation and defaults to saving in a dated subfolder
>>109051535
>>109051545>i am an artist and idk how i can use those as-is, reallyI've had 10+ projects. Used wan + conversion for simple idle animations few times. Rest is image generation/editing
>>109051566please tell me your memeing
what happened to tdrussell?
>>109051582https://huggingface.co/weeblabs
>>109051439I was running it before, turned it off since I'm experimenting and just want the gens done as fast as possible.>>109051549thanks for the feedback, I'm deep in testing around exactly how the model handles masking and will push an update with those fixes once it's all properly working (if it works)
>>109051608another bit: purely usability wise it'd also be nice to have the identity tracker node include the object detection thresholds on the reference/driving images for text-based conditionings too (or split it out to the text conditioning and have inputs for one reference and one driving conditioning?)you'd be able to the text based functionality from the v1 workflow too without rewiring nodes
When are we getting realistic anima?
>animaoutdated slopware. after using ideogram, anima feels like sdxl in comparison. even the basic ideogram nsfw loras on civitai are more powerful than any chromakek or boorubrown slop.
>>109051625* or just allow feeding it as driving_track_data / ref_track_data like in v1>>109051632people already started making loras and finetunes for that, with some success
>>109051632if you cant into training then wait for anon to bless you with an upload
>>109051654hi anon can you bless me?
based sperg out
>>109051632>he doesn't knowbase model + turbo lora + negpip + klein 9b edit as a refiner, already does near-perfect realism even for nsfw
>>109051714>negpipqrd
>>109051720you can put (some bullshit:-1) in the positive prompt and it acts like a negative, so it can be used on turbo cfg1 models. Even on normal undistilled models it's stronger and better than an actual negative.
>>109051733This one? https://github.com/BigStationW/ComfyUI-ppm
Need to update the inpainting logic and add more logic of what >>109051714is saying for better composition. I have a inpainting bug where I can't detect clothes with the florence2 so I need to fix that up. I need better adetailer logic to identify characters to for more facial refinement but.....I could just do a inpaint pass and ignore adetailer all together honestly
>>109051740yea
>>109051750thanks mane
>>109051748what is it written in? also are you using like qwen 3.6 for the chat? pretty neat
>>109051685
>>109051777This is all in typescript/python Also I'm using gemma-4-26B-A4B-it for speed, I might use a smaller model once I decide to use heavier models but for testing models this is enough once jail broken.The dense gemma models are trivial to jailbreak so they won't fuss if I tell these two to start licking clits or some shit. I want to use 31B but I think I'm hitting diminishing returns for these types of task.
>>109046565kys fren
>>109051878you are brown
It seems that Wan2.1 and Wan2.2 LoRa may work with SCAIL2 to some degree. At least the output isn't broken.
>>109051885proof?
>>109051648>>109051625Updated the Identity Tracker and pushed a V3 workflow, now with improved character segmentation.If auto_detect is true, then it's allowed to add more masks beyond what bboxes exist (up to the model limit of 6) using the text conditioning.That only applies to the reference image, the driving video is capped at the number of masks on the reference image (you can't have five image characters replacing four video characters, for example).Creating masks via text conditioning is always optional, if it can't find any objects to mask (or all maskable objects are already inside bounding boxes) it won't create any.
>she took the forksrip kenji
>over 1300 gens without issue>launch Comfy>subgraph cannot be connected to anything else>nothing has changed and everything is still connected the same way it was since I last closed it>unpack and repack it>rewire everything back up exactly as it was before>it worksMan... I don't need the random skill-check, Comfy. Fix your shit!
>>109051425Paid shill behavior. Real anons don't run around doing free promo for some model.
>>109052135>1300 gens without issueOf Jenny?
>>109052135My rgrhtree nodes break like 20 times a day, no conflicts mentioned. It doesn't even appear as broken when inside subgraphs either.
>>109052135>subgraphSlopper
Looks like lodestones is back trying to make another pixel t2i fork from chroma called zeta. I suspect more money is being burnt in real time.
>>109048034How are you doing this on Anima? Is it a specific 2B cosplay lora or a realism finetune?
>>109052146Mostly, but a lot of it is just testing. Still hunting for that sweet spot where the LoRAs in use aren't blowing up the latent space (it's a lot of different scaling with all the varying steps, samplers & schedulers involved).>>109052167>It doesn't even appear as broken when inside subgraphs eitherYeah, it sucks that there's almost no indication of what's broken and why. I had to go through everything with a fine-tooth comb when I was building my workflow out.>>109052172I wanted to better group the settings I frequently touch, sorry for trying to be efficient!
>>109052135I have had this happen before. Next time it happens, try bypassing the entire subgraph, then unbypassing. No promises, but for whatever reason cycling it like that fixes it for me ~80% of the time without having to renoodle.
>>109052270I'll definitely keep that in mind, thanks!
Seen people drool over Ideogram 4 on a bunch of places, but I only trust you anons. What is the catch?
>>109052337nothing but shills here buckouse anima
>>109052337> What is the catch?Requires good video card.
>>109052392how good? Will 4 GB not work?
>>109047844Where to get the scail mask nodes?>SCAIL2ColoredMask
>>109052337the catch is that local is dead and you should buy claude fable>b-b-but Fable got shut downthen ai is dead ig
>>109052397nightly comfyui
>>109052337The time spent setting up the json, even with a prompt generating node and/or the nodes which let you drag and resize the bounding boxes, makes it functionally a very slow model.You can't just set up your prompt and batch gen, you need to edit each prompt.If you're willing to do that it's great, and indications so far are that it's very easy to train so plenty of loras will be be arriving, but it's slow even if you're not a promptlet, and if you are it's giga-slow.Also I found an error in my workflow (the Scail-2 Autoextender one), the image sizes need to be divisible by 32 or you get the top pixels of the image wrapped around to the bottom, like in the attached, so anyone using it should grab the updated version.
ldg approved artist tags?
gov is starting to use png info scrapers so they can scan for bad words used in prompts if they get ur pc
crazy hamburger!!!https://files.catbox.moe/diilw1.mp4
>>109052236Left is from 6 weeks ago. It definitely produces less outright body horror. I guess we will see whether it converges to anything useful. Two more months.
>>109052245Realism lora I've been working on, does everything
>>109052544sovl vs soulless
>>109052392Alright, so it's slow. Seen ballparks of about 2min on 4090s for 2MP.>>109052410JSON prompts look really ass, that's for sure. But then again, it's probably worth it instead of Klein 9b base, since that's what I've used up until now.
>>109049884why do you use rtx upscale 3x then rtx upscale 2x, then downscale?
>>109052615this is why i got bullied on the way to school
https://files.catbox.moe/yk01xe.mp4
>>109052254>I wanted to better group the settings I frequently touch, sorry for trying to be efficient!Using subgraphs is an implicit concession saying that the node design is flawed, that some nodes are bloated and try to do too much, while others are overly narrow or hyper specific and there is no coherent underlying logic governing how they were organized
>>109052478id4?
>32 notifications on civitiai
>>109052569more like shit vs piss
>>109052846>>109052846