Discussion and Development of Local Image and Video ModelsPrevious: >>108590807https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/tdrussell/diffusion-pipe>Zhttps://huggingface.co/Tongyi-MAI/Z-Imagehttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2https://huggingface.co/Lightricks/LTX-2>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girl>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
Remember Anima discussion belong to Anime generals, dont be spiters
>>108598018>Remember Anima discussion belong to Anime generals
>>10859801835 stars status?
>>108597999:]
Blessed thread of frenship
>>108597963>no Anima gens>no anime gensCan someone tell me why tdrusell chose this general to shill Anima?
>crying over the faggollage
>>108598106>why tdrusell chose this generalbecause it's a great general, the proof is that you lurk here often, meaning that you also enjoy it very much
>post in my shitty general!
https://huggingface.co/duongve/AnimaYumewhy are they finetuning an unfinished base model? lool
>>108598123No, if I’m here it’s because tdrusell post here. I’m not interested in seeing 3dPG, Zimage slop, or Chroma slop or some new scuffed DOA local model release.
>>108598145>Zimage slopit's a good model anon, it can even do good anime images out of the box :(
>>108598106He caters to me personally.
>>108598054
>>108598176Where did you find this pic of me?
>>108598186you need to sleep anon!
>>108598186Took a selfie and asked Gemma4 to caption it. The result? "average /ldg/god"
>mfw Resource news04/13/2026>LTX 2.3 Distilled v1.1https://huggingface.co/Lightricks/LTX-2.3/blob/main/ltx-2.3-22b-distilled-1.1.safetensors>UniCom: Unified Multimodal Modeling via Compressed Continuous Semantic Representationshttps://huggingface.co/tencent/Unicom-Unified-Multimodal-Modeling-via-Compressed-Continuous-Semantic-Representations>CatalogStitch: Dimension-Aware and Occlusion-Preserving Object Compositing for Catalog Image Generationhttps://catalogstitch.github.io>Realizing Immersive Volumetric Video: A Multimodal Framework for 6-DoF VR Engagementhttps://github.com/Metaverse-AI-Lab-THU/ImViD>Seeing is Believing: Robust Vision-Guided Cross-Modal Prompt Learning under Label Noisehttps://github.com/gezbww/Vis_Prompt>MixFlow: Mixed Source Distributions Improve Rectified Flowshttps://github.com/NazirNayal8/MixFlow>VisionFoundry: Teaching VLMs Visual Perception with Synthetic Imageshttps://zlab-princeton.github.io/VisionFoundry>Tango: Taming Visual Signals for Efficient Video Large Language Modelshttps://github.com/xjtupanda/Tango>VL-Calibration: Decoupled Confidence Calibration for Large Vision-Language Models Reasoninghttps://github.com/Mr-Loevan/VL-Calibration>pixlstash v1.0https://github.com/Pikselkroken/pixlstash/releases/tag/v1.0.0>SD Forge — CivitAI Helperhttps://github.com/ArthureCodage/sd-forge-civitai-helper>Is AI the greatest art heist in history?https://www.theguardian.com/books/2026/apr/12/is-ai-the-greatest-art-heist-in-history>VisionCaptioner: Automated image & video captioning using Qwen-VL and SAM3https://github.com/Brekel/VisionCaptioner04/12/2026>Stretchy Studio: FOSS 2D animation tool for turning static illustrations into mesh-deformable charactershttps://github.com/MangoLion/stretchystudio>LTX-2 VBVR LoRA - Video Reasoninghttps://huggingface.co/LiconStudio/Ltx2.3-VBVR-lora-I2V04/11/2026>ComfyUI-RookieUI: The ultimate A1111-style sidebarhttps://github.com/rookiestar28/ComfyUI-RookieUI
>mfw Research news04/13/2026>InsEdit: Towards Instruction-based Visual Editing via Data-Efficient Video Diffusion Models Adaptationhttps://arxiv.org/abs/2604.08646>CT-1: Vision-Language-Camera Models Transfer Spatial Reasoning Knowledge to Camera-Controllable Video Generationhttps://arxiv.org/abs/2604.09201>On Semiotic-Grounded Interpretive Evaluation of Generative Arthttps://arxiv.org/abs/2604.08641>SCoRe: Clean Image Generation from Diffusion Models Trained on Noisy Imageshttps://arxiv.org/abs/2604.09436>Training-free, Perceptually Consistent Low-Resolution Previews with High-Resolution Image for Efficient Workflows of Diffusion Modelshttps://arxiv.org/abs/2604.09227>ELT: Elastic Looped Transformers for Visual Generationhttps://arxiv.org/abs/2604.09168>EGLOCE: Training-Free Energy-Guided Latent Optimization for Concept Erasurehttps://arxiv.org/abs/2604.09405>Post-Hoc Guidance for Consistency Models by Joint Flow Distribution Learninghttps://arxiv.org/abs/2604.08828>MeshOn: Intersection-Free Mesh-to-Mesh Compositionhttps://threedle.github.io/MeshOn>BlendFusion -- Scalable Synthetic Data Generation for Diffusion Model Traininghttps://arxiv.org/abs/2604.09022>Strips as Tokens: Artist Mesh Generation with Native UV Segmentationhttps://arxiv.org/abs/2604.09132>Region-Constrained Group Relative Policy Optimization for Flow-Based Image Editinghttps://arxiv.org/abs/2604.09386>Detecting Diffusion-generated Images via Dynamic Assembly ForestsDetecting Diffusion-generated Images via Dynamic Assembly Forestshttps://arxiv.org/abs/2604.09106>RIRF: Reasoning Image Restoration Frameworkhttps://arxiv.org/abs/2604.09511>AniGen: Unified S3 Fields for Animatable 3D Asset Generationhttps://arxiv.org/abs/2604.08746>Do Vision Language Models Need to Process Image Tokens?https://arxiv.org/abs/2604.09425>LADR: Locality-Aware Dynamic Rescue for Efficient T2I Generation with Diffusion LLMshttps://arxiv.org/abs/2603.13450
>>108598135I dunno, if you download any of these anima "finetunes" and drop in the workflow of one of your gens and regen with it you get almost the exact same picture so I can only assume they want to steal credit for how good the model is.
>>108598242"Is AI the greatest anime tiddie in history?"
>>108598018but it's as good for generating realism as zit
>>108598330proof?
>>108598135Why not? These finetunes work.
>>108598334zimg>>108597460>>108597374>>108594855anima>>108591129>>108591142
>>108598380really? looks like some Qwen Image slop, the skin is smooth as fuck
>>108598401so as zit2b model the best for local anime and as good for realism as big chink models that's crazy
>>108597866Why are offloading all models? Disable layer offloading.Probably set TE precision higher and enable unload TE.Disable caption dropout probably.Differential Guidance meme didn't workout too well for me for other models, but try your luck I guess.
>>108598449thanks for the response anon. going to try this another day when I'm moralized again :'( . i wasted a whole 8 hours of the bullshit and was bored as fuck. I wish there lora commissioners available for high end models like ltx and qwen.
What's the best way to local gen images on Android these days?
>>108597963Question of about lora trainingWhat happened if i overtagging?Like, i used 2 model to tags image, and both of them give different tags
i'd post my gens but this bitch ass cuck asshole nigger of a board won't let post images in incognito mode and i'm rangebanned fuck this
>>108598714not a big lossyour images are shit anyway
>>108598897this
Local is dead
>>108592106I really liked this one, what's the model being used Anon?
Hello, I uploaded another lora, feel free to post your questions here.https://civitai.com/models/2540444/anima-highresaesthetic-boost>>108598018>>108598106I’m not going through all the 4chan generals, I’ll just post here.
>>108598971Based kingruss
>>108598971Do you need datasets for non anime artists?
>>108598971Thanks!
uh oh meltiy incoming
>>108598971>IYou aren't me. Don't believe his lies.But the lora is pretty useful for generating at >1024 res. Aesthetic effect is more subtle but I don't think that's necessarily a bad thing.
>>108598971Thanks for sharing! Do you have plans to make a furry finetune in the future?”
Is it possible to use ZiT or Klein on a 12GB card?
>>108599035please post in anime generals i beg you!
still waitin on the realism lora
>>108598971>feel free to post your questions here.I'm not seeing any images on civitai :(
>>108598971Russ I am busy this week so I will probably make my huggingface post about it next week, but I should give you a heads up so that you can hopefully take your time to test it on your own.Have you compared character knowledge of preview 3 vs preview 2? I see it struggling with some characters that preview 2 could do easily, but now preview 3 is struggling to do them with same consistency. I love your work with anima but it got me worried a bit.
>>108599058>i beg youlmao, get fucked
>>108598971>feel free to post your questions here.maybe it's a dumb question but, why a lora? why can't it be part of the Anima finetune?
>>108599064Civit has to "analyze" the image for safety before it shows up, and that service seems to be slow or broken right now.
>>108599095then show your images here in the meanwhile, I wanna see how your high res images look like
>>108599103They're all paired images showing before/after and are over 4MB, just wait a few minutes for Civit to unfuck themselves.
>>108598971Dude your model still doesn't recognize Rin Tezuka, c'mon :d (apart that complain, your model is really solid, good job)
>>108599083Anima has flawed architecture, get over it.
>>108598988>>108599114you are hard dude to reach
>>108599124keep crying, you lost
>>108598971Good model CHADrusell!
>>108598971is there a point in going for higher res? does that improve the hands for example?
>>108598971WHY DO YOU POST HERE? DO YOU LOVE CATJACK?
>>108599170>realistic backgroundpure slop lool
>>108599170pure kino
>>108599119>doesn't recognize Rin Tezuka? it does
>>108598971is there a reason why you decided to go for nvidia chronos as a base model? I mean, c'mon!
>>108598971fix the shitty hands
>the scamming jeets are shitting the Tongyi discord placegood, that's all they deserve for not releasing Z-image edit model kek
>>108598971Please, kingruss throw as a bone in /hgg/, we are using your model everyday!>>>/h/8860124>>>/h/8860086>>>/h/8860048>>>/h/8859813
>>108599213Why would he post in a dedicated hentai thread? Like why do you think every example image on the Civit page is SFW? Same shit as Noob, everyone knows what the model can do but there's reasons to not openly advertise that.
Anatomy seems worse at higher rez with the lora and it was already a bit of a problem. I think I'm just gonna continue upscaling. In particular some body parts get loooong.
>>108599257/adt/ is sfw...but i lurk here anyways so it doesnt matter to me where he posts
>>108599279/adt/ is fucking dead sometimes there are 24 hour periods with 5 posts
https://huggingface.co/lightx2v/Wan2.2-Distill-Models/tree/main>they're still making new lightning loras of wan 2.2lmaoo
>>108599274Works alright in my experience.
>>108599297>works alright>posts an image of a girl with a dislocated shoulderlmao
why /adt/ is dead? :(
>>108599260>>108599297the colors are too saturated, decrease the cfg I guess
>>108599308He’s a dev, not a serious anime genner and he posts his gens in the sloppiest, most casual diffusion threads.
>>108599213Thanks for the (You)s, kind stranger.
mugen and chenkin status? any gens posted with them yet?
>>108598971finally, we can see the images
>>108599350>its slopOHNONONO
>>108599353it's just a lora after all, once he ends up the finetuning of Anima with high res it'll be better
>>108599350Uh oh, nice WAI lora tdrussell!
>>108599364I really hope it doesn't end up looking like this then
>>108599346Worthless finetroons of an ancient model
>>108599350Thanks I missed WAI so much!
>>108599257it's a falseflag, this dude comes to hgg to do the same shit, nobody else cares
>>108599382hello /hgg/ lurker! welcome to /ldg/ the only serious anime general
>>10859939635 stars status?
SOUL - SOULLESS
>>10859941035 post per day status?
WAI won
>>108599433yeah it's completly useless if it slopify the output
>>108599433But you only posted the latter...
>>108599434>people notice I'm ani 35 times per dayoof you're pretty bad at hiding :d
>he is oofing
>>108599162uh oh, melty
>>108598971Works fine for me when going to 2MP and a bit beyond compared to without, minimal influence on artist tags. Nice job, eagerly on my knees for preview4/the final release with the 1536 pass.
>>108599576based WAI enjoyer
>>108599583I was just testing the lora lil bro, never gen that high to begin with anyway.
>>108599565Based TamziyGOD bulling Ani
>>108598971Based /ldg/ enjoyer
>>108599322this is really goodbox please?
https://www.reddit.com/r/StableDiffusion/comments/1skds12/update_distilled_v11_is_live/>no examplesI won't fall for your jewish tricks
>>108599350>>108598971Just call it illustrious 2.0 lora
I wanna make some cuck sloppa, any recommendations
>>108598971>https://civitai.com/models/2540444/anima-highresaesthetic-boostNoob here, what model do I need to use this with?
how can i get a realistic effect in anima preview 3? i've tested all the realistic triggers from pony-Illus…
>>108599894The page contains enough information:>Base Model>Anima>About this version>Trained on preview3I will handhold you further though:https://huggingface.co/circlestone-labs/Anima/tree/main/split_files
Anima is getting decent at replicating characters, but the details are still missing sadly.
>>108599911Ok, I dont know which anime model though, so this one wont work:>https://civitai.com/models/2458426/anima-officialanyways thanks
>>108599960>anima knows this generic chink slop but not canari_(pokemon)hmm sus
>>108599975by details, I meant smaller designs/patterns/etc. on characters.
>>108599960Flawed architecture, but still better than SDXL. I’m waiting for Noob 2.0 meanwhile using Chenkin 5.0 or base Noob as a hires pass/ detailer desu
>>108599960>the details are still missing sadly.that's what happens when you go for a meme base model with a subpar vae >>108596443
>>108600019just wait for bluvoll's Anima Flux2VAE rectified flow
>>108600095lmao'd.
Why does it take so fucking long to release the full model of Anima?What the fuck are they even doing?
>>108599902>realistic triggersIf you mean tags like realistic, photo-realistic these will just create slop.Just write a natural language description. It's hit and miss when it comes to realism though.
I still can't do two unique characters without it morphing them into one or mutating them. I've used Forge Couple, etc, but it just doesn't work. I had to give up and use Nano Banana...
>>108600233what if you give them names?
>>108600233what if you give them dicks?
>>108600245They do have names. They even have their own Booru tags.
>>108600233I mean, obviously a 0.6 TE won't do miracles here
If they won’t use Zimage, then Noob2 should be trained on Mugen, and the money saved should be invested in recaptioning the dataset with current VLMs. UNet is still kino in some respects.
>>108600233NovelAI has a special framework designed to solve this. This is why SaaS is superior to local. SaaS actually addresses problems that users face
>>108600283There isnt and wont be a good for all model, SDXL is better at quickly capturing styles and merging aesthetics. SDXL for the hires fix pass is kino in Anima and Anima has much better composition than SDXL. To me the two should coexist and complement each other.
>>108600296V5 is coming, Anima should sell their stocks before the big arrive.
>>108600296the "problem" already has like 6 different local solutions.
>>108600159share your catbox bro. anima works fine with anime. i just want to try the photorealistic part
https://huggingface.co/obsxrver/wan2.2-i2v-lightx2v-260412/tree/mainnew lightning loras
>wan 2.2local really is dead
>>108600613What was improved/fixed?
>>108600613>check user's profileno
>>108600233>what is regional conditioning
Please ... I need to artist mix ........
I don't get why this thread keeps saying LTX has no loras. There's plenty of decent NSFW loras on civitai
>>108600961I've never seen that said anywhere. Just that LTX has garbage quality.
>>108600715apparently is extracted from kijai's lorashttps://huggingface.co/lightx2v/Wan2.2-Distill-Models/tree/main
i just started genning locally with Comfy, and its kind of confusing.How do I set it up so that I can start generating stuff that doesn't look like complete dogshit
>>108601095went from 160.33s to 143.41 using this
>>108601238download a workflow for a model you want. google comfyui workflow + model name
>>108601238check other's prompt on civitai for the model ou are using (and similar models), also since this is sdxl, minimum of 1mp resolution
To jailbreak anima to do realism I think you need to do nat lang postive prompt. booru tags in negative only.
Okay, so there's a million fuckin AI GF websites that go:>Pick realistic or anime>Pick race>Pick hair color>Pick bust size>Pick ass size>Pick relationship>Pick 3 hobbies from a long list, a lot of which are hyper-specific>Then it asks you to log in with Google or make an account, then asks for credit cardHell, there's a 50% chance your 4chan ad is for one right now.Their urls and specific genned images and videos are different but they're clearly all running the exact same software. And for there to be that many, it's probably something prebuilt and easy to set up.So, any ideas where to find the guts? I don't want to make my own website, I just want to run it locally. I have the hardware, so fuck paying those blatant scammers.
>>108601279I've noticed you can weight tags extremely high on anima without it breaking the image, so try (real photo,:3.0) or some shit. Also try (cartoon, drawing, 2D,:3.0) in the negative.
anima highres lorahttps://civitai.com/models/2540444/anima-highresaesthetic-boost?modelVersionId=2855073with/without
>>108601309There's no clip to interpret those weightings, you're just asking an LLM to interpret those strings anon
>>108601380catbox?
>>108601385https://gofile.io/d/AvR7P6
>>108601372Try it, retard, it works.
>>108601306the guts are probably just a base model and half a dozen loras and a basic bitch llm to write a prompt.>user selects photorealistic black woman with big tits and a small ass.>user selects surfing and cooking as hobbies.>load realism model + black woman lora + big tits lora + small ass lora>llm prompt big titty black bitch lying on surfboard and rubbing her vagene with a bigmac.
>>108601415thanks
>>108599549>oh i am oofing
>>108601388What is this sexy outfit called
>>108601385can't>>108600538
I've looked through a lot of this info and a lot of these options but I can't find what I'm looking for: I want something akin to chatgpt's "upload an image and a prompt to edit it" where i can do something like post a picture of a green ball and say make it red with blue stripes. Any good options you guys know?
>>108601471once it makes a "character" i'm fairly certain the outputs are consistent though.wouldn't be much of a girlfriend if she looked like an entirely different girl every time she sent a photo.
>>108601648Flux2 KleinQwen Image Edit
>>108601654my fucking hero, thank you king. qwen is perfect aside from not seeming to have an offline/local version, is there a way to do that with it?
>>108601682yeshttps://www.youtube.com/results?search_query=Run+Qwen-Image-Edit+Locally
>>108601726thank you again for the spoonfeed, i found unsloth and that seems pretty cool so im trying that currently
beautiful baby girls deserve my kisses
>>108601653well it's not that hard to get consistent gens for generic 1girl shots, worst case you could gen a big batch and then run them through a face analyzer and only output the best matches. if you are using loras it just gets easier.
>>108601306Most popular sites like that are sold in a white glove service by several sites, this is onehttps://www.scrile.com/aiIts basically pay and deploy but I don't know if you can tinker the workflows or stuff like that, setting an AI adult site from the ground can be tricky, since you will have to invest money and time on hosting, coding (even with vibecoding), marketing, payment processors, creating and setting up the characters, it could take you several months
>>108601380>>108601744>>108601749>>108601751>>108601778Damn does this bitch just never wash her clothes?
>>108601823do helen frankenthaler
>>108601831
i managed using realistic anima. cleary better than klein, qwen,sdxl. and no need millions loras for the body, yay
>>108601949Thanks for posting an example gen, anon!
I iterated over this with dozens of different prompts and tried three different models and every time it adds a weird light in the middle of the scene.
>>108601949post gen.
>>108601962I tried describing a gunfight (can't say firefight or it will think like putting out fires) and all the projectile trails (can't say tracers or your image is overwatch themed now) always are coming from the light in the center of the image, often in a pillar going vertical into the sky. If I describe soldiers or silhouettes in the perimeter it places them surrounding whatever pyre is in the middle, half of the are bowing to it like in worship or something. Now it keeps adding a dog in it for no reason in every seed.I was looking through images on civitai thinking that I'm just shit at prompting. But it turns out every prompt in there is half ignored anyway. Like I saw one with "disembodied limb" that didn't feature a disembodied limb in the image. This shit is a complete joke.It's not even that people can't generate good looking images. I can make convincing images that I find interesting but it's never actually what I had in mind or intended. And it's clear none of the stuff other people make is any different. It's all so fucking typical.Like why not throw a fucking dog into my image right? People love dogs. I didn't ask for one in my prompt but what do I know so fuck me right?
>>108601998thats just how it is with classifier free guidance and rng. if you want a controlled composition you need to control it, either with weighted tokens, clusters of prompts to reinforce concepts, controlnets, proper negative prompts, etc.
kek, i have no idea how to upscale this without slopping it thoughHigh quality cosplay photo of a young and pretty japanese woman with long pink hair cosplaying as power from chainsaw man. The bedroom is full of toys and plushies. The woman is wearing gym shorts with her panties exposed. She is lying on her stomach and looking at the viewer. She is looking back. She has a toned body and the photo has an ass focus. A gaming computer is visible in the background. Her computer has a picture of Donald Trump.Negative prompt: anime, illustration, cartoon, stable diffusion, worst quality, low quality, score_1, score_2, score_3, bad hands, bad fingers, bad feet, bad anatomy, ai-generated, ai-assisted, bad quality, normal quality, average quality, adversarial noise, resized, downscaled, source larger, lowres, jpeg artifacts, compression artifacts, blurrySteps: 40, Sampler: ER SDE, Schedule type: Beta, CFG scale: 5, Shift: 3, Seed: 2054072178, Size: 896x1152, Model hash: 14fffe8ad5, Model: anima-preview3-base, Clip skip: 2, RNG: CPU, MaHiRo: True, Beta schedule alpha: 0.6, Beta schedule beta: 0.6, Version: neo, Module 1: Qwen_Image-VAE, Module 2: qwen_3_06b_base
>>108602047>cosplay photoah, genius.
>>108602167supreme leader khomeowni
>
>>108602306>hecking gigabytescatbox isn't going to make it
>>108602331He said hundreds per day ma'am.
>>108602306https://youtu.be/DjJPzEF5bHg?t=40
>>108602306>agenic AI Thankfully gens use good old fashioned regular AI mhm
>>108602306did it really take that faggot this long to find out
babe wake up, a base model that doesn't use VAE (pixel space) got releasedhttps://huggingface.co/blog/sensenova/neo-unify
>>108602575>We are actively preparing for open source as well as a detailed tech report. You will see them soon.delete your post again anon, nothing got released :)
>>108602590>nothing got releasedthat didn't prevent anons on talking about the (soon to be released) Z-image base over and over :(
Another chinese team, huh?
>>108602575>unifiedthat means it doesn't use a text encoder anymore? damn that looks interesting, it's just 3 models (TE + diffusion model + VAE) in one, I like that
>>108602575
>>108602605>Mar 9>We are actively preparing for open source as well as a detailed tech report. You will see them soon.lmao
>VAElessLodestonechads... our response?
>>108602625awooooooo~
>>108602595>Z-image basewe had turbo thobeit
>>108602636we're still saying "When Z-image edit?" to this day? more than 4 months after the Z-image series got revealed to the world lol
>>108602625>VAEless>Text-encoder-lessmake this shit a 15b model and I'm sold
>>108602575so far I've only seen pixel space only image model, but can it be done for video models too?
>>108602575its not in the news anchor so its not real news
>>108602575its not in the news anchor so its real news
>>108602575I really hope they'll release it, it's small (2b), and can do edit, if those Anima mf would've trained on this based model, NAI would be fucking dead lmao
>>108602575>pixel space>still has loss reconstruction and color shiftwhy? isn't it supposed to be a lossless process?
>>108602602>Womb EmbeddingHad to double take.
>>108602748>he hasn't demo'd it yet rat bastardo
https://huggingface.co/lodestones/Zeta-Chromathe loss curve is flattened, meaning that the training is over, yet the images it produces are still so fucking ASS
>>108602575One month later and nothing released.Judging by the slop look, we are probably not missing out much.Though it seems like they managed to make it converge into something besides crunchy blurslop. Kekstone might benefit from that.>>108602843His schizo meme architecture is unable to converge. Retard is just pointlessly wasting electricity instead of admitting that he fucked (again) with vibe training slop.
>>108602843>>108602858that's really disappointing. I was hoping that amazing tunes of z-base would be around by now but it's kind of dead
so many open source ai models die off and get no traction. This one got release yesterday yet not a peep from anyone.https://huggingface.co/tencent/Unicom-Unified-Multimodal-Modeling-via-Compressed-Continuous-Semantic-Representationshttps://github.com/Tencent-Hunyuan/UniComhttps://miazhao7708.github.io/UniComPage/
>>108603004https://huggingface.co/tencent/Unicom-Unified-Multimodal-Modeling-via-Compressed-Continuous-Semantic-Representations/tree/main/siglip2-so400m-patch16-naflex>using clip in the year of our lord 2026
>>108603004anon, you know it's a meme model when they're not comparing with the best edit models like Qwen Image Edit or Klein
>>108602575>2BSeems small. Too bad there's nothing there to try.>>108602843He's got that Civitai mindset; just fry that bitch 'till it's charred.
>>108603004>>108603004this is so ass, it completly changed the poor squirel
>>108602843>trusting the furry to not deliver garbageLOL!I'm sure the next experimental attempt will produce good results! XD
>>108603004I was interested in trying it out before I saw that it's 15gb.>>108603013Siglip isn't clip?>>108603041It uses older flux vae which is suboptimal for edit tasks now.
>>108603054>Siglip isn't clip?https://medium.com/@jiangmen28/siglip-vs-clip-the-sigmoid-advantage-457f1cb872abit's like saying Jake Paul is better than KSI, when ultimately we want fucking Mike Tyson (LLMs text encoders)
>>108603040based jenner is still alive
https://xcancel.com/bdsqlsz/status/2043981799693660215#mwhere did he get those images?
>>108603546https://github.com/Comfy-Org/ComfyUI/pull/13369#issuecomment-4237642159get chinese culture'ed (again)
Anything interesting happen in the last few days?>>108598018Guess not.
https://xcancel.com/toyxyz3/status/2044019214047162601#m>v1.0 has a normal accent>v1.1 has a jeet accentthe jokes write themselves lmao >>>/wsg/6128132
>>108603546https://github.com/Comfy-Org/workflow_templates/blob/main/templates/image_ernie_image-1.webpCome on Comfy, the dude has 6 fingers on his left hand
>>108603660kek the shoelaces
>>108603585Wait a minute...https://civitai.com/models/2540444/anima-highresaesthetic-boost>high-res support released as an official loraLoras can do that? Sweet, gonna try it out.>also comfy 0.19 came out, with an Intel portable releaseCongrats to Intelbros
>>108603546that's bullshit, but I believe it
>>108603758Left without lora, right with lora. The lighting gets fancier and proportions change a bit.
>>108601778What model is this?
>>108603552lol
>>108603878A taller pic. The composition changed a lot with this one, even with the same seed and inputs unless I missed something. The periphery's less fuzzy, but her details look a bit more slopped.
>>108603985catbox?
>>108604000prompt was just:cat, @umi \(srtm07\), smoking cigarette, spiral eyesno negative prompt
>>108604015The Japanese text on the previous one surprised me because anima isn't supposed to do that usually. Not that it's meaningful.I guess just a lucky slop. Thanks.
>>108603983Same prompt and seed, but 1280x1600. Different composition, butterface.
>>108604058I don't want to "backseat" tdrussell, but usually you need actual finetuning rather than making an adapter to change resolution target of the model effectively.
>>108604083> I don't want to "backseat" tdrussellsure thing ani
>>108604343So you want to kill the next general thread schizo? Why do you post here? You're the reason for the whole situation so fuck off
https://huggingface.co/baidu/ERNIE-Imagehttps://huggingface.co/baidu/ERNIE-Image-Turbocomfy workflow when? it seems it was already patched in but i don't see any nod
>>108604511you can download the workflow herehttps://github.com/Comfy-Org/workflow_templates/blob/main/templates/image_ernie_image.json
>>108604511This looks good? Unless the images are aggressively cherry picked, we seem to have a decently capable model with non-slopped look and SOTA text capability at just 8B. Hopefully it isn't slow as shit to run inference. And responds well to training.>>108604519Kino gen
>>108604511>>108604578https://huggingface.co/Comfy-Org/ERNIE-Imageok that's pretty good
>>108604511>>108604578>>108604636>ERNIE-Image: Our SFT model, delivers stronger general-purpose capability and instruction fidelity>ERNIE-Image-Turbo: Our Turbo model, optimized by DMD and RL, achieves faster speed and higher aestheticsI'm getting mixed signials, which one is the least slopped ultimately?https://yiyan.baidu.com/blog/posts/ernie-image
>>108604659>coffee fag is also an aisoyboi why am i not surprised
>>108604636downloadingwas getting tired of ZIB + ZIT, now it's gonna be EIB + EIT :D
>>108604659the text seems next level, and it doesn't look really slopped, can't believe Z-image turbo got beaten so quickly lmao (4chan get your shit together why are you bugging now we have a new decent model I wanna discuss about it!!)
>>108604659>https://yiyan.baidu.com/blog/posts/ernie-imageAnima btfo!!
Fresh when ready >>108604726>>108604726>>108604726
>>108604511looks like generic slopped dogshit #5849641 to me
>>108604729It's not perfect (green eye in the middle for example), but impressive character consistency for a local model doing multiple views gen.I am still downloading and haven't tested yet so I don't want to jinx it but we might be eating good with this one.