[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


No longer down for maintenance!

[Advertise on 4chan]


Discussion and Development of Local Image and Video Models

Previous: >>108590807

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Remember Anima discussion belong to Anime generals, dont be spiters
>>
File: 1764952745517916.png (140 KB, 1150x312)
140 KB
140 KB PNG
>>108598018
>Remember Anima discussion belong to Anime generals
>>
>>108598018
35 stars status?
>>
>>108597999
:]
>>
Blessed thread of frenship
>>
>>108597963
>no Anima gens
>no anime gens
Can someone tell me why tdrusell chose this general to shill Anima?
>>
>crying over the faggollage
>>
>>108598106
>why tdrusell chose this general
because it's a great general, the proof is that you lurk here often, meaning that you also enjoy it very much
>>
File: converted.jpg (1.13 MB, 1536x2560)
1.13 MB
1.13 MB JPG
>post in my shitty general!
>>
File: 1769913703128476.jpg (1.81 MB, 2048x3072)
1.81 MB
1.81 MB JPG
https://huggingface.co/duongve/AnimaYume
why are they finetuning an unfinished base model? lool
>>
>>108598123
No, if I’m here it’s because tdrusell post here. I’m not interested in seeing 3dPG, Zimage slop, or Chroma slop or some new scuffed DOA local model release.
>>
File: Z-image turbo.png (2.98 MB, 1536x864)
2.98 MB
2.98 MB PNG
>>108598145
>Zimage slop
it's a good model anon, it can even do good anime images out of the box :(
>>
File: 1639326039084.png (1.33 MB, 1600x672)
1.33 MB
1.33 MB PNG
>>108598106
He caters to me personally.
>>
>>108598054
>>
>>108598176
Where did you find this pic of me?
>>
>>108598186
you need to sleep anon!
>>
>>108598186
Took a selfie and asked Gemma4 to caption it. The result? "average /ldg/god"
>>
>mfw Resource news

04/13/2026

>LTX 2.3 Distilled v1.1
https://huggingface.co/Lightricks/LTX-2.3/blob/main/ltx-2.3-22b-distilled-1.1.safetensors

>UniCom: Unified Multimodal Modeling via Compressed Continuous Semantic Representations
https://huggingface.co/tencent/Unicom-Unified-Multimodal-Modeling-via-Compressed-Continuous-Semantic-Representations

>CatalogStitch: Dimension-Aware and Occlusion-Preserving Object Compositing for Catalog Image Generation
https://catalogstitch.github.io

>Realizing Immersive Volumetric Video: A Multimodal Framework for 6-DoF VR Engagement
https://github.com/Metaverse-AI-Lab-THU/ImViD

>Seeing is Believing: Robust Vision-Guided Cross-Modal Prompt Learning under Label Noise
https://github.com/gezbww/Vis_Prompt

>MixFlow: Mixed Source Distributions Improve Rectified Flows
https://github.com/NazirNayal8/MixFlow

>VisionFoundry: Teaching VLMs Visual Perception with Synthetic Images
https://zlab-princeton.github.io/VisionFoundry

>Tango: Taming Visual Signals for Efficient Video Large Language Models
https://github.com/xjtupanda/Tango

>VL-Calibration: Decoupled Confidence Calibration for Large Vision-Language Models Reasoning
https://github.com/Mr-Loevan/VL-Calibration

>pixlstash v1.0
https://github.com/Pikselkroken/pixlstash/releases/tag/v1.0.0

>SD Forge — CivitAI Helper
https://github.com/ArthureCodage/sd-forge-civitai-helper

>Is AI the greatest art heist in history?
https://www.theguardian.com/books/2026/apr/12/is-ai-the-greatest-art-heist-in-history

>VisionCaptioner: Automated image & video captioning using Qwen-VL and SAM3
https://github.com/Brekel/VisionCaptioner

04/12/2026

>Stretchy Studio: FOSS 2D animation tool for turning static illustrations into mesh-deformable characters
https://github.com/MangoLion/stretchystudio

>LTX-2 VBVR LoRA - Video Reasoning
https://huggingface.co/LiconStudio/Ltx2.3-VBVR-lora-I2V

04/11/2026

>ComfyUI-RookieUI: The ultimate A1111-style sidebar
https://github.com/rookiestar28/ComfyUI-RookieUI
>>
>mfw Research news

04/13/2026

>InsEdit: Towards Instruction-based Visual Editing via Data-Efficient Video Diffusion Models Adaptation
https://arxiv.org/abs/2604.08646

>CT-1: Vision-Language-Camera Models Transfer Spatial Reasoning Knowledge to Camera-Controllable Video Generation
https://arxiv.org/abs/2604.09201

>On Semiotic-Grounded Interpretive Evaluation of Generative Art
https://arxiv.org/abs/2604.08641

>SCoRe: Clean Image Generation from Diffusion Models Trained on Noisy Images
https://arxiv.org/abs/2604.09436

>Training-free, Perceptually Consistent Low-Resolution Previews with High-Resolution Image for Efficient Workflows of Diffusion Models
https://arxiv.org/abs/2604.09227

>ELT: Elastic Looped Transformers for Visual Generation
https://arxiv.org/abs/2604.09168

>EGLOCE: Training-Free Energy-Guided Latent Optimization for Concept Erasure
https://arxiv.org/abs/2604.09405

>Post-Hoc Guidance for Consistency Models by Joint Flow Distribution Learning
https://arxiv.org/abs/2604.08828

>MeshOn: Intersection-Free Mesh-to-Mesh Composition
https://threedle.github.io/MeshOn

>BlendFusion -- Scalable Synthetic Data Generation for Diffusion Model Training
https://arxiv.org/abs/2604.09022

>Strips as Tokens: Artist Mesh Generation with Native UV Segmentation
https://arxiv.org/abs/2604.09132

>Region-Constrained Group Relative Policy Optimization for Flow-Based Image Editing
https://arxiv.org/abs/2604.09386

>Detecting Diffusion-generated Images via Dynamic Assembly ForestsDetecting Diffusion-generated Images via Dynamic Assembly Forests
https://arxiv.org/abs/2604.09106

>RIRF: Reasoning Image Restoration Framework
https://arxiv.org/abs/2604.09511

>AniGen: Unified S3 Fields for Animatable 3D Asset Generation
https://arxiv.org/abs/2604.08746

>Do Vision Language Models Need to Process Image Tokens?
https://arxiv.org/abs/2604.09425

>LADR: Locality-Aware Dynamic Rescue for Efficient T2I Generation with Diffusion LLMs
https://arxiv.org/abs/2603.13450
>>
>>108598135
I dunno, if you download any of these anima "finetunes" and drop in the workflow of one of your gens and regen with it you get almost the exact same picture so I can only assume they want to steal credit for how good the model is.
>>
>>108598242
"Is AI the greatest anime tiddie in history?"
>>
>>108598018
but it's as good for generating realism as zit
>>
>>108598330
proof?
>>
>>108598135
Why not? These finetunes work.
>>
>>108598334
zimg
>>108597460
>>108597374
>>108594855

anima
>>108591129
>>108591142
>>
File: 1760560355557145.png (1.25 MB, 896x1152)
1.25 MB
1.25 MB PNG
>>108598380
really? looks like some Qwen Image slop, the skin is smooth as fuck
>>
>>108598401
so as zit
2b model the best for local anime and as good for realism as big chink models that's crazy
>>
>>108597866
Why are offloading all models? Disable layer offloading.
Probably set TE precision higher and enable unload TE.
Disable caption dropout probably.
Differential Guidance meme didn't workout too well for me for other models, but try your luck I guess.
>>
>>108598449
thanks for the response anon. going to try this another day when I'm moralized again :'( . i wasted a whole 8 hours of the bullshit and was bored as fuck. I wish there lora commissioners available for high end models like ltx and qwen.
>>
What's the best way to local gen images on Android these days?
>>
File: file.png (3.32 MB, 1152x2048)
3.32 MB
3.32 MB PNG
>>
>>108597963
Question of about lora training
What happened if i overtagging?
Like, i used 2 model to tags image, and both of them give different tags
>>
File: deMA_zi_00007_.png (2.05 MB, 1792x977)
2.05 MB
2.05 MB PNG
>>
i'd post my gens but this bitch ass cuck asshole nigger of a board won't let post images in incognito mode and i'm rangebanned fuck this
>>
File: 00005-2336170344.jpg (1.88 MB, 2016x2592)
1.88 MB
1.88 MB JPG
>>
>>108598714
not a big loss
your images are shit anyway
>>
>>108598897
this
>>
File: 597411051901623.png (769 KB, 832x1216)
769 KB
769 KB PNG
>>
File: 884561598637606.png (1023 KB, 832x1216)
1023 KB
1023 KB PNG
>>
Local is dead
>>
File: 314212686033261.png (605 KB, 832x1216)
605 KB
605 KB PNG
>>
>>108592106
I really liked this one, what's the model being used Anon?
>>
Hello, I uploaded another lora, feel free to post your questions here.
https://civitai.com/models/2540444/anima-highresaesthetic-boost
>>108598018
>>108598106
I’m not going through all the 4chan generals, I’ll just post here.
>>
>>108598971
Based kingruss
>>
>>108598971
Do you need datasets for non anime artists?
>>
>>108598971
Thanks!
>>
uh oh meltiy incoming
>>
File: _AnimaPreview3_00041_.jpg (555 KB, 1608x1248)
555 KB
555 KB JPG
>>
>>108598971
>I
You aren't me. Don't believe his lies.

But the lora is pretty useful for generating at >1024 res. Aesthetic effect is more subtle but I don't think that's necessarily a bad thing.
>>
>>108598971
Thanks for sharing! Do you have plans to make a furry finetune in the future?”
>>
Is it possible to use ZiT or Klein on a 12GB card?
>>
>>108599035
please post in anime generals i beg you!
>>
File: _AnimaPreview3_00048_.jpg (495 KB, 1248x1608)
495 KB
495 KB JPG
>>
still waitin on the realism lora
>>
File: 1774617995278017.png (308 KB, 2222x1294)
308 KB
308 KB PNG
>>108598971
>feel free to post your questions here.
I'm not seeing any images on civitai :(
>>
>>108598971
Russ I am busy this week so I will probably make my huggingface post about it next week, but I should give you a heads up so that you can hopefully take your time to test it on your own.
Have you compared character knowledge of preview 3 vs preview 2? I see it struggling with some characters that preview 2 could do easily, but now preview 3 is struggling to do them with same consistency. I love your work with anima but it got me worried a bit.
>>
>>108599058
>i beg you
lmao, get fucked
>>
>>108598971
>feel free to post your questions here.
maybe it's a dumb question but, why a lora? why can't it be part of the Anima finetune?
>>
>>108599064
Civit has to "analyze" the image for safety before it shows up, and that service seems to be slow or broken right now.
>>
File: _AnimaPreview3_00054_.jpg (401 KB, 1248x1608)
401 KB
401 KB JPG
>>
>>108599095
then show your images here in the meanwhile, I wanna see how your high res images look like
>>
>>108599103
They're all paired images showing before/after and are over 4MB, just wait a few minutes for Civit to unfuck themselves.
>>
>>108598971
Dude your model still doesn't recognize Rin Tezuka, c'mon :d (apart that complain, your model is really solid, good job)
>>
>>108599083
Anima has flawed architecture, get over it.
>>
>>108598988
>>108599114
you are hard dude to reach
>>
>>108599124
keep crying, you lost
>>
>>108598971
Good model CHADrusell!
>>
>>108598971
is there a point in going for higher res? does that improve the hands for example?
>>
File: _AnimaPreview3_00066_.jpg (540 KB, 1248x1608)
540 KB
540 KB JPG
>>
>>108598971
WHY DO YOU POST HERE? DO YOU LOVE CATJACK?
>>
File: 615463074307759.png (1.57 MB, 832x1216)
1.57 MB
1.57 MB PNG
>>
>>108599170
>realistic background
pure slop lool
>>
>>108599170
pure kino
>>
File: needahand.png (570 KB, 896x1152)
570 KB
570 KB PNG
>>108599119
>doesn't recognize Rin Tezuka
? it does
>>
>>108598971
is there a reason why you decided to go for nvidia chronos as a base model? I mean, c'mon!
>>
>>108598971
fix the shitty hands
>>
File: based.png (717 KB, 2901x1740)
717 KB
717 KB PNG
>the scamming jeets are shitting the Tongyi discord place
good, that's all they deserve for not releasing Z-image edit model kek
>>
>>108598971
Please, kingruss throw as a bone in /hgg/, we are using your model everyday!
>>>/h/8860124
>>>/h/8860086
>>>/h/8860048
>>>/h/8859813
>>
File: _AnimaPreview3_00090_.jpg (434 KB, 1248x1608)
434 KB
434 KB JPG
>>
>>108599213
Why would he post in a dedicated hentai thread? Like why do you think every example image on the Civit page is SFW? Same shit as Noob, everyone knows what the model can do but there's reasons to not openly advertise that.
>>
File: 408309586443103.png (2.42 MB, 1824x1248)
2.42 MB
2.42 MB PNG
>>
Anatomy seems worse at higher rez with the lora and it was already a bit of a problem. I think I'm just gonna continue upscaling. In particular some body parts get loooong.
>>
>>108599257
/adt/ is sfw...but i lurk here anyways so it doesnt matter to me where he posts
>>
File: 00006-3867242695.jpg (937 KB, 1728x1344)
937 KB
937 KB JPG
>>
>>108599279
/adt/ is fucking dead sometimes there are 24 hour periods with 5 posts
>>
https://huggingface.co/lightx2v/Wan2.2-Distill-Models/tree/main
>they're still making new lightning loras of wan 2.2
lmaoo
>>
File: 123800191121980.png (2.77 MB, 2016x1152)
2.77 MB
2.77 MB PNG
>>108599274
Works alright in my experience.
>>
>>108599297
>works alright
>posts an image of a girl with a dislocated shoulder
lmao
>>
why /adt/ is dead? :(
>>
>>108599260
>>108599297
the colors are too saturated, decrease the cfg I guess
>>
File: _AnimaPreview3_00108_.jpg (757 KB, 1248x1608)
757 KB
757 KB JPG
>>
>>108599308
He’s a dev, not a serious anime genner and he posts his gens in the sloppiest, most casual diffusion threads.
>>
>>108599213
Thanks for the (You)s, kind stranger.
>>
mugen and chenkin status? any gens posted with them yet?
>>
File: 1773217571435873.jpg (962 KB, 2698x1728)
962 KB
962 KB JPG
>>108598971
finally, we can see the images
>>
>>108599350
>its slop
OHNONONO
>>
>>108599353
it's just a lora after all, once he ends up the finetuning of Anima with high res it'll be better
>>
>>108599350
Uh oh, nice WAI lora tdrussell!
>>
>>108599364
I really hope it doesn't end up looking like this then
>>
>>108599346
Worthless finetroons of an ancient model
>>
>>108599350
Thanks I missed WAI so much!
>>
>>108599257
it's a falseflag, this dude comes to hgg to do the same shit, nobody else cares
>>
>>108599382
hello /hgg/ lurker! welcome to /ldg/ the only serious anime general
>>
>>108599396
35 stars status?
>>
File: 1747796594292521.png (1.78 MB, 1802x1152)
1.78 MB
1.78 MB PNG
SOUL - SOULLESS
>>
>>108599410
35 post per day status?
>>
WAI won
>>
>>108599433
yeah it's completly useless if it slopify the output
>>
>>108599433
But you only posted the latter...
>>
>>108599434
>people notice I'm ani 35 times per day
oof you're pretty bad at hiding :d
>>
>he is oofing
>>
>>108599162
uh oh, melty
>>
File: ComfyUI_09456_.png (783 KB, 1024x1024)
783 KB
783 KB PNG
>>
File: 333539074583949.png (2.31 MB, 1248x1824)
2.31 MB
2.31 MB PNG
>>
>>108598971
Works fine for me when going to 2MP and a bit beyond compared to without, minimal influence on artist tags. Nice job, eagerly on my knees for preview4/the final release with the 1536 pass.
>>
>>108599576
based WAI enjoyer
>>
>>108599583
I was just testing the lora lil bro, never gen that high to begin with anyway.
>>
>>108599565
Based TamziyGOD bulling Ani
>>
File: 780117049481247.png (3.04 MB, 1824x1248)
3.04 MB
3.04 MB PNG
>>
>>108598971
Based /ldg/ enjoyer
>>
>>108599322
this is really good
box please?
>>
File: deMA_zi_00012_.png (2.54 MB, 1792x977)
2.54 MB
2.54 MB PNG
>>
File: 102148519303150.png (2.53 MB, 1824x1248)
2.53 MB
2.53 MB PNG
>>
https://www.reddit.com/r/StableDiffusion/comments/1skds12/update_distilled_v11_is_live/
>no examples
I won't fall for your jewish tricks
>>
>>108599350
>>108598971
Just call it illustrious 2.0 lora
>>
I wanna make some cuck sloppa, any recommendations
>>
>>108598971
>https://civitai.com/models/2540444/anima-highresaesthetic-boost
Noob here, what model do I need to use this with?
>>
how can i get a realistic effect in anima preview 3? i've tested all the realistic triggers from pony-Illus…
>>
>>108599894
The page contains enough information:
>Base Model
>Anima
>About this version
>Trained on preview3
I will handhold you further though:
https://huggingface.co/circlestone-labs/Anima/tree/main/split_files
>>
File: 835673734828.jpg (1.8 MB, 1536x2432)
1.8 MB
1.8 MB JPG
Anima is getting decent at replicating characters, but the details are still missing sadly.
>>
>>108599911
Ok, I dont know which anime model though, so this one wont work:
>https://civitai.com/models/2458426/anima-official
anyways thanks
>>
>>108599960
>anima knows this generic chink slop but not canari_(pokemon)
hmm sus
>>
>>108599975
by details, I meant smaller designs/patterns/etc. on characters.
>>
>>108599960
Flawed architecture, but still better than SDXL. I’m waiting for Noob 2.0 meanwhile using Chenkin 5.0 or base Noob as a hires pass/ detailer desu
>>
>>108599960
>the details are still missing sadly.
that's what happens when you go for a meme base model with a subpar vae >>108596443
>>
File: 1769747479955588.jpg (578 KB, 1328x1640)
578 KB
578 KB JPG
>>
>>108600019
just wait for bluvoll's Anima Flux2VAE rectified flow
>>
>>108600095
lmao'd.
>>
Why does it take so fucking long to release the full model of Anima?
What the fuck are they even doing?
>>
>>108599902
>realistic triggers
If you mean tags like realistic, photo-realistic these will just create slop.
Just write a natural language description.
It's hit and miss when it comes to realism though.
>>
File: 20260318_123209.jpg (195 KB, 864x1240)
195 KB
195 KB JPG
I still can't do two unique characters without it morphing them into one or mutating them. I've used Forge Couple, etc, but it just doesn't work. I had to give up and use Nano Banana...
>>
>>108600233
what if you give them names?
>>
>>108600233
what if you give them dicks?
>>
File: rtthuc.jpg (1.04 MB, 2018x1639)
1.04 MB
1.04 MB JPG
>>108600245
They do have names. They even have their own Booru tags.
>>
>>108600233
I mean, obviously a 0.6 TE won't do miracles here
>>
If they won’t use Zimage, then Noob2 should be trained on Mugen, and the money saved should be invested in recaptioning the dataset with current VLMs. UNet is still kino in some respects.
>>
>>108600233
NovelAI has a special framework designed to solve this. This is why SaaS is superior to local. SaaS actually addresses problems that users face
>>
>>108600283
There isnt and wont be a good for all model, SDXL is better at quickly capturing styles and merging aesthetics. SDXL for the hires fix pass is kino in Anima and Anima has much better composition than SDXL. To me the two should coexist and complement each other.
>>
>>108600296
V5 is coming, Anima should sell their stocks before the big arrive.
>>
>>108600296
the "problem" already has like 6 different local solutions.
>>
File: deMA_zi_00016_.png (2.07 MB, 1792x977)
2.07 MB
2.07 MB PNG
>>
File: o_00232_.png (1.03 MB, 1536x512)
1.03 MB
1.03 MB PNG
>>
>>108600159
share your catbox bro. anima works fine with anime. i just want to try the photorealistic part
>>
File: o_00233_.png (1.93 MB, 896x1152)
1.93 MB
1.93 MB PNG
>>
https://huggingface.co/obsxrver/wan2.2-i2v-lightx2v-260412/tree/main
new lightning loras
>>
>wan 2.2
local really is dead
>>
File: 00405-1635270013.png (1.66 MB, 1472x848)
1.66 MB
1.66 MB PNG
>>
>>108600613
What was improved/fixed?
>>
>>108600613
>check user's profile
no
>>
>>108600233
>what is regional conditioning
>>
Please ... I need to artist mix ........
>>
I don't get why this thread keeps saying LTX has no loras. There's plenty of decent NSFW loras on civitai
>>
>>108600961
I've never seen that said anywhere. Just that LTX has garbage quality.
>>
>>108600715
apparently is extracted from kijai's loras

https://huggingface.co/lightx2v/Wan2.2-Distill-Models/tree/main
>>
File: 385188574252897.png (2.27 MB, 1824x1248)
2.27 MB
2.27 MB PNG
>>
File: 1774466994663736.png (553 KB, 1553x872)
553 KB
553 KB PNG
i just started genning locally with Comfy, and its kind of confusing.

How do I set it up so that I can start generating stuff that doesn't look like complete dogshit
>>
>>108601095
went from 160.33s to 143.41 using this
>>
>>108601238
download a workflow for a model you want. google comfyui workflow + model name
>>
>>108601238
check other's prompt on civitai for the model ou are using (and similar models), also since this is sdxl, minimum of 1mp resolution
>>
>>
To jailbreak anima to do realism I think you need to do nat lang postive prompt. booru tags in negative only.
>>
Okay, so there's a million fuckin AI GF websites that go:
>Pick realistic or anime
>Pick race
>Pick hair color
>Pick bust size
>Pick ass size
>Pick relationship
>Pick 3 hobbies from a long list, a lot of which are hyper-specific
>Then it asks you to log in with Google or make an account, then asks for credit card
Hell, there's a 50% chance your 4chan ad is for one right now.

Their urls and specific genned images and videos are different but they're clearly all running the exact same software. And for there to be that many, it's probably something prebuilt and easy to set up.

So, any ideas where to find the guts? I don't want to make my own website, I just want to run it locally. I have the hardware, so fuck paying those blatant scammers.
>>
>>108601279
I've noticed you can weight tags extremely high on anima without it breaking the image, so try (real photo,:3.0) or some shit. Also try (cartoon, drawing, 2D,:3.0) in the negative.
>>
File: 327.jpg (1.26 MB, 4096x2048)
1.26 MB
1.26 MB JPG
anima highres lora
https://civitai.com/models/2540444/anima-highresaesthetic-boost?modelVersionId=2855073
with/without
>>
>>108601309
There's no clip to interpret those weightings, you're just asking an LLM to interpret those strings anon
>>
File: file.png (3.21 MB, 1152x2048)
3.21 MB
3.21 MB PNG
>>
>>108601380
catbox?
>>
File: 115754419284491.png (1.69 MB, 832x1216)
1.69 MB
1.69 MB PNG
>>
>>108601385
https://gofile.io/d/AvR7P6
>>
File: 54885957594595.png (1.59 MB, 1152x2016)
1.59 MB
1.59 MB PNG
>>
>>108601372
Try it, retard, it works.
>>
>>108601306
the guts are probably just a base model and half a dozen loras and a basic bitch llm to write a prompt.
>user selects photorealistic black woman with big tits and a small ass.
>user selects surfing and cooking as hobbies.
>load realism model + black woman lora + big tits lora + small ass lora
>llm prompt big titty black bitch lying on surfboard and rubbing her vagene with a bigmac.
>>
File: 541646248919810.png (612 KB, 1216x832)
612 KB
612 KB PNG
>>
>>108601415
thanks
>>
>>108599549
>oh i am oofing
>>
File: 420273897860536.png (1.3 MB, 1248x1824)
1.3 MB
1.3 MB PNG
>>
>>108601388
What is this sexy outfit called
>>
>>108601385
can't
>>108600538
>>
I've looked through a lot of this info and a lot of these options but I can't find what I'm looking for: I want something akin to chatgpt's "upload an image and a prompt to edit it" where i can do something like post a picture of a green ball and say make it red with blue stripes. Any good options you guys know?
>>
>>108601471
once it makes a "character" i'm fairly certain the outputs are consistent though.

wouldn't be much of a girlfriend if she looked like an entirely different girl every time she sent a photo.
>>
>>108601648
Flux2 Klein
Qwen Image Edit
>>
>>108601654
my fucking hero, thank you king. qwen is perfect aside from not seeming to have an offline/local version, is there a way to do that with it?
>>
>>108601682
yes
https://www.youtube.com/results?search_query=Run+Qwen-Image-Edit+Locally
>>
>>
>>
>>108601726
thank you again for the spoonfeed, i found unsloth and that seems pretty cool so im trying that currently
>>
>>
beautiful baby girls deserve my kisses
>>
>>108601653
well it's not that hard to get consistent gens for generic 1girl shots, worst case you could gen a big batch and then run them through a face analyzer and only output the best matches.
if you are using loras it just gets easier.
>>
>>
>>108601306
Most popular sites like that are sold in a white glove service by several sites, this is one

https://www.scrile.com/ai

Its basically pay and deploy but I don't know if you can tinker the workflows or stuff like that, setting an AI adult site from the ground can be tricky, since you will have to invest money and time on hosting, coding (even with vibecoding), marketing, payment processors, creating and setting up the characters, it could take you several months
>>
File: output.webm (3.87 MB, 768x1360)
3.87 MB
3.87 MB WEBM
>>108601380
>>108601744
>>108601749
>>108601751
>>108601778

Damn does this bitch just never wash her clothes?
>>
File: Untitled.png (892 KB, 1168x816)
892 KB
892 KB PNG
>>
>>108601823
do helen frankenthaler
>>
File: Untitled.png (937 KB, 1168x816)
937 KB
937 KB PNG
>>108601831
>>
File: 1742437393421802.jpg (89 KB, 1440x833)
89 KB
89 KB JPG
i managed using realistic anima. cleary better than klein, qwen,sdxl. and no need millions loras for the body, yay
>>
>>108601949
Thanks for posting an example gen, anon!
>>
I iterated over this with dozens of different prompts and tried three different models and every time it adds a weird light in the middle of the scene.
>>
>>108601949
post gen.
>>
>>108601962
I tried describing a gunfight (can't say firefight or it will think like putting out fires) and all the projectile trails (can't say tracers or your image is overwatch themed now) always are coming from the light in the center of the image, often in a pillar going vertical into the sky. If I describe soldiers or silhouettes in the perimeter it places them surrounding whatever pyre is in the middle, half of the are bowing to it like in worship or something. Now it keeps adding a dog in it for no reason in every seed.

I was looking through images on civitai thinking that I'm just shit at prompting. But it turns out every prompt in there is half ignored anyway. Like I saw one with "disembodied limb" that didn't feature a disembodied limb in the image. This shit is a complete joke.

It's not even that people can't generate good looking images. I can make convincing images that I find interesting but it's never actually what I had in mind or intended. And it's clear none of the stuff other people make is any different. It's all so fucking typical.

Like why not throw a fucking dog into my image right? People love dogs. I didn't ask for one in my prompt but what do I know so fuck me right?
>>
>>108601998
thats just how it is with classifier free guidance and rng.
if you want a controlled composition you need to control it, either with weighted tokens, clusters of prompts to reinforce concepts, controlnets, proper negative prompts, etc.
>>
File: 00382-2054072178.jpg (287 KB, 896x1152)
287 KB
287 KB JPG
kek, i have no idea how to upscale this without slopping it though

High quality cosplay photo of a young and pretty japanese woman with long pink hair cosplaying as power from chainsaw man. The bedroom is full of toys and plushies. The woman is wearing gym shorts with her panties exposed. She is lying on her stomach and looking at the viewer. She is looking back. She has a toned body and the photo has an ass focus. A gaming computer is visible in the background. Her computer has a picture of Donald Trump.
Negative prompt: anime, illustration, cartoon, stable diffusion, worst quality, low quality, score_1, score_2, score_3, bad hands, bad fingers, bad feet, bad anatomy, ai-generated, ai-assisted, bad quality, normal quality, average quality, adversarial noise, resized, downscaled, source larger, lowres, jpeg artifacts, compression artifacts, blurry
Steps: 40, Sampler: ER SDE, Schedule type: Beta, CFG scale: 5, Shift: 3, Seed: 2054072178, Size: 896x1152, Model hash: 14fffe8ad5, Model: anima-preview3-base, Clip skip: 2, RNG: CPU, MaHiRo: True, Beta schedule alpha: 0.6, Beta schedule beta: 0.6, Version: neo, Module 1: Qwen_Image-VAE, Module 2: qwen_3_06b_base
>>
>>108602047
>cosplay photo
ah, genius.
>>
File: Flux2-Klein_00071_.png (1.71 MB, 1024x1024)
1.71 MB
1.71 MB PNG
>>
File: f2k9b_00034.png (2.24 MB, 960x1536)
2.24 MB
2.24 MB PNG
>>
>>108602167
supreme leader khomeowni
>>
File: file.png (95 KB, 278x420)
95 KB
95 KB PNG
>
>>
>>108602306
>hecking gigabytes
catbox isn't going to make it
>>
>>108602331
He said hundreds per day ma'am.
>>
>>108602306
https://youtu.be/DjJPzEF5bHg?t=40
>>
>>108602306
>agenic AI
Thankfully gens use good old fashioned regular AI mhm
>>
>>108602306
did it really take that faggot this long to find out
>>
File: Flux2-Klein_00119_.png (123 KB, 1037x80)
123 KB
123 KB PNG
>>
File: 1746365606508663.jpg (1.28 MB, 2132x1280)
1.28 MB
1.28 MB JPG
babe wake up, a base model that doesn't use VAE (pixel space) got released
https://huggingface.co/blog/sensenova/neo-unify
>>
>>108602575
>We are actively preparing for open source as well as a detailed tech report. You will see them soon.
delete your post again anon, nothing got released :)
>>
>>108602590
>nothing got released
that didn't prevent anons on talking about the (soon to be released) Z-image base over and over :(
>>
Another chinese team, huh?
>>
File: file.png (375 KB, 1280x591)
375 KB
375 KB PNG
>>108602575
>unified
that means it doesn't use a text encoder anymore? damn that looks interesting, it's just 3 models (TE + diffusion model + VAE) in one, I like that
>>
File: file.png (2 KB, 203x48)
2 KB
2 KB PNG
>>108602575
>>
>>108602605
>Mar 9
>We are actively preparing for open source as well as a detailed tech report. You will see them soon.
lmao
>>
>VAEless
Lodestonechads... our response?
>>
>>108602625
awooooooo~
>>
>>108602595
>Z-image base
we had turbo thobeit
>>
>>108602636
we're still saying "When Z-image edit?" to this day? more than 4 months after the Z-image series got revealed to the world lol
>>
File: yayy.png (83 KB, 259x194)
83 KB
83 KB PNG
>>108602625
>VAEless
>Text-encoder-less
make this shit a 15b model and I'm sold
>>
>>108602575
so far I've only seen pixel space only image model, but can it be done for video models too?
>>
>>108602575
its not in the news anchor so its not real news
>>
>>108602575
its not in the news anchor so its real news
>>
>>108602575
I really hope they'll release it, it's small (2b), and can do edit, if those Anima mf would've trained on this based model, NAI would be fucking dead lmao
>>
File: Capture.png (2.06 MB, 2594x1541)
2.06 MB
2.06 MB PNG
>>108602575
>pixel space
>still has loss reconstruction and color shift
why? isn't it supposed to be a lossless process?
>>
>>108602602

>Womb Embedding

Had to double take.
>>
File: When baidu ERNIE?.png (73 KB, 1235x445)
73 KB
73 KB PNG
>>
>>108602748
>he hasn't demo'd it yet
rat bastardo
>>
File: 1773121883282732.png (655 KB, 2100x6300)
655 KB
655 KB PNG
https://huggingface.co/lodestones/Zeta-Chroma
the loss curve is flattened, meaning that the training is over, yet the images it produces are still so fucking ASS
>>
>>108602575
One month later and nothing released.
Judging by the slop look, we are probably not missing out much.
Though it seems like they managed to make it converge into something besides crunchy blurslop. Kekstone might benefit from that.
>>108602843
His schizo meme architecture is unable to converge. Retard is just pointlessly wasting electricity instead of admitting that he fucked (again) with vibe training slop.
>>
>>108602843
>>108602858
that's really disappointing. I was hoping that amazing tunes of z-base would be around by now but it's kind of dead
>>
so many open source ai models die off and get no traction. This one got release yesterday yet not a peep from anyone.
https://huggingface.co/tencent/Unicom-Unified-Multimodal-Modeling-via-Compressed-Continuous-Semantic-Representations

https://github.com/Tencent-Hunyuan/UniCom

https://miazhao7708.github.io/UniComPage/
>>
File: Son I'm crine.png (437 KB, 976x549)
437 KB
437 KB PNG
>>108603004
https://huggingface.co/tencent/Unicom-Unified-Multimodal-Modeling-via-Compressed-Continuous-Semantic-Representations/tree/main/siglip2-so400m-patch16-naflex
>using clip in the year of our lord 2026
>>
File: 1770827840901658.png (64 KB, 449x1653)
64 KB
64 KB PNG
>>108603004
anon, you know it's a meme model when they're not comparing with the best edit models like Qwen Image Edit or Klein
>>
File: ComfyUI_21230.png (2.23 MB, 1200x1600)
2.23 MB
2.23 MB PNG
>>108602575
>2B
Seems small. Too bad there's nothing there to try.

>>108602843
He's got that Civitai mindset; just fry that bitch 'till it's charred.
>>
File: Capture.jpg (436 KB, 3714x1521)
436 KB
436 KB JPG
>>108603004
>>108603004
this is so ass, it completly changed the poor squirel
>>
>>108602843
>trusting the furry to not deliver garbage
LOL!
I'm sure the next experimental attempt will produce good results! XD
>>
>>108603004
I was interested in trying it out before I saw that it's 15gb.
>>108603013
Siglip isn't clip?
>>108603041
It uses older flux vae which is suboptimal for edit tasks now.
>>
>>108603054
>Siglip isn't clip?
https://medium.com/@jiangmen28/siglip-vs-clip-the-sigmoid-advantage-457f1cb872ab
it's like saying Jake Paul is better than KSI, when ultimately we want fucking Mike Tyson (LLMs text encoders)
>>
File: 00008-2187282030.png (3.29 MB, 2304x1248)
3.29 MB
3.29 MB PNG
>>
>>108603040
based jenner is still alive
>>
File: Ernie-image.png (465 KB, 1076x559)
465 KB
465 KB PNG
https://xcancel.com/bdsqlsz/status/2043981799693660215#m
where did he get those images?
>>
File: 1749297024678439.png (67 KB, 2291x415)
67 KB
67 KB PNG
>>108603546
https://github.com/Comfy-Org/ComfyUI/pull/13369#issuecomment-4237642159
get chinese culture'ed (again)
>>
Anything interesting happen in the last few days?

>>108598018
Guess not.
>>
https://xcancel.com/toyxyz3/status/2044019214047162601#m
>v1.0 has a normal accent
>v1.1 has a jeet accent
the jokes write themselves lmao >>>/wsg/6128132
>>
File: 00015-2179401690.png (1.24 MB, 1168x816)
1.24 MB
1.24 MB PNG
>>
File: Ernie-image.png (1.64 MB, 2400x1200)
1.64 MB
1.64 MB PNG
>>108603546
https://github.com/Comfy-Org/workflow_templates/blob/main/templates/image_ernie_image-1.webp
Come on Comfy, the dude has 6 fingers on his left hand
>>
>>108603660
kek the shoelaces
>>
>>108603585
Wait a minute...

https://civitai.com/models/2540444/anima-highresaesthetic-boost
>high-res support released as an official lora
Loras can do that? Sweet, gonna try it out.

>also comfy 0.19 came out, with an Intel portable release
Congrats to Intelbros
>>
File: 1759411545203475.png (52 KB, 1229x329)
52 KB
52 KB PNG
>>108603546
that's bullshit, but I believe it
>>
>>108603758
Left without lora, right with lora. The lighting gets fancier and proportions change a bit.
>>
>>108601778

What model is this?
>>
File: miku.png (1.87 MB, 896x1152)
1.87 MB
1.87 MB PNG
>>108603552
lol
>>
File: miku 3.png (1 MB, 896x1152)
1 MB
1 MB PNG
>>
>>108603878
A taller pic. The composition changed a lot with this one, even with the same seed and inputs unless I missed something. The periphery's less fuzzy, but her details look a bit more slopped.
>>
File: o_00235_.png (302 KB, 896x1152)
302 KB
302 KB PNG
>>
File: miku 4.png (1.49 MB, 896x1152)
1.49 MB
1.49 MB PNG
>>
File: miku 5.png (745 KB, 896x1152)
745 KB
745 KB PNG
>>108603985
catbox?
>>
File: o_00236_.png (277 KB, 896x1152)
277 KB
277 KB PNG
>>108604000
prompt was just:
cat, @umi \(srtm07\), smoking cigarette, spiral eyes
no negative prompt
>>
File: miku 6.png (1.53 MB, 896x1152)
1.53 MB
1.53 MB PNG
>>108604015
The Japanese text on the previous one surprised me because anima isn't supposed to do that usually. Not that it's meaningful.
I guess just a lucky slop. Thanks.
>>
>>108603983
Same prompt and seed, but 1280x1600. Different composition, butterface.
>>
>>108604058
I don't want to "backseat" tdrussell, but usually you need actual finetuning rather than making an adapter to change resolution target of the model effectively.
>>
File: miku 7.png (773 KB, 896x1152)
773 KB
773 KB PNG
>>
>>108604083
> I don't want to "backseat" tdrussell
sure thing ani
>>
File: 00001-3142776389.jpg (1.34 MB, 1728x2880)
1.34 MB
1.34 MB JPG
>>
File: deMA_zi_00020_.png (2.09 MB, 1792x977)
2.09 MB
2.09 MB PNG
>>
File: _AnimaPreview3_00229_.jpg (564 KB, 1160x1696)
564 KB
564 KB JPG
>>
>>108604343
So you want to kill the next general thread schizo? Why do you post here? You're the reason for the whole situation so fuck off
>>
File: o_00241_.png (990 KB, 1280x768)
990 KB
990 KB PNG
>>
https://huggingface.co/baidu/ERNIE-Image
https://huggingface.co/baidu/ERNIE-Image-Turbo

comfy workflow when? it seems it was already patched in but i don't see any nod
>>
File: 1749532566066051.jpg (920 KB, 1040x1520)
920 KB
920 KB JPG
>>
File: o_00244_.png (1.02 MB, 1280x768)
1.02 MB
1.02 MB PNG
>>
File: 1757044410024398.jpg (2.18 MB, 2400x1787)
2.18 MB
2.18 MB JPG
>>108604511
you can download the workflow here
https://github.com/Comfy-Org/workflow_templates/blob/main/templates/image_ernie_image.json
>>
>>108604511
This looks good? Unless the images are aggressively cherry picked, we seem to have a decently capable model with non-slopped look and SOTA text capability at just 8B. Hopefully it isn't slow as shit to run inference. And responds well to training.
>>108604519
Kino gen
>>
File: Ernie.png (1.29 MB, 1376x768)
1.29 MB
1.29 MB PNG
>>108604511
>>108604578
https://huggingface.co/Comfy-Org/ERNIE-Image
ok that's pretty good
>>
File: 1760655397965910.png (3.57 MB, 1484x1676)
3.57 MB
3.57 MB PNG
>>108604511
>>108604578
>>108604636
>ERNIE-Image: Our SFT model, delivers stronger general-purpose capability and instruction fidelity
>ERNIE-Image-Turbo: Our Turbo model, optimized by DMD and RL, achieves faster speed and higher aesthetics
I'm getting mixed signials, which one is the least slopped ultimately?
https://yiyan.baidu.com/blog/posts/ernie-image
>>
>>108604659
>coffee fag is also an aisoyboi
why am i not surprised
>>
>>108604636
downloading
was getting tired of ZIB + ZIT, now it's gonna be EIB + EIT :D
>>
>>108604659
the text seems next level, and it doesn't look really slopped, can't believe Z-image turbo got beaten so quickly lmao (4chan get your shit together why are you bugging now we have a new decent model I wanna discuss about it!!)
>>
File: 1749319865213888.png (1.5 MB, 1200x896)
1.5 MB
1.5 MB PNG
>>108604659
>https://yiyan.baidu.com/blog/posts/ernie-image
Anima btfo!!
>>
Fresh when ready

>>108604726
>>108604726
>>108604726
>>
>>108604511
looks like generic slopped dogshit #5849641 to me
>>
>>108604729
It's not perfect (green eye in the middle for example), but impressive character consistency for a local model doing multiple views gen.
I am still downloading and haven't tested yet so I don't want to jinx it but we might be eating good with this one.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.