/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/ldg/ - Local Diffusion Genera(...) 04/30/26(Thu)21:59:31 No.108727613

File: highlights_g_108718184_17(...).jpg (2.56 MB, 4702x3528)

2.56 MB JPG

/ldg/ - Local Diffusion General Anonymous 04/30/26(Thu)21:59:31 No.108727613

Discussion and Development of Local Image and Video Models

Previous: >>108718184

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon

Anonymous
04/30/26(Thu)22:02:46 No.108727622

Anonymous 04/30/26(Thu)22:02:46 No.108727622

File: _AnimaPreview3_00753_.jpg (371 KB, 1248x1608)

371 KB JPG

Anonymous
04/30/26(Thu)22:04:27 No.108727629

Anonymous 04/30/26(Thu)22:04:27 No.108727629

File: IMG_2299.jpg (455 KB, 1179x1912)

455 KB JPG

Anonymous
04/30/26(Thu)22:07:49 No.108727645

Anonymous 04/30/26(Thu)22:07:49 No.108727645

>>108727629
>No 2D

3DPD, dead on arrival.

Anonymous
04/30/26(Thu)22:09:51 No.108727653

Anonymous 04/30/26(Thu)22:09:51 No.108727653

>>108727629
>illegal
>things I didn't like
>no anime
dead

Anonymous
04/30/26(Thu)22:10:33 No.108727656

Anonymous 04/30/26(Thu)22:10:33 No.108727656

>>108727629
>me and a team of people
are claude agents people?

Anonymous
04/30/26(Thu)22:11:43 No.108727663

Anonymous 04/30/26(Thu)22:11:43 No.108727663

>>108727656
>are claude agents people?
+ his daughter/wife from Silly Tavern

Anonymous
04/30/26(Thu)22:13:05 No.108727671

Anonymous 04/30/26(Thu)22:13:05 No.108727671

File: 1758126248178747.png (3.76 MB, 2048x1152)

3.76 MB PNG

Anonymous
04/30/26(Thu)22:18:44 No.108727700

Anonymous 04/30/26(Thu)22:18:44 No.108727700

>mfw Resource news

04/30/2026

>ProcFunc: Function-Oriented Abstractions for Procedural 3D Generation in Python
https://github.com/princeton-vl/procfunc

>Efficient, VRAM-Constrained xLM Inference on Clients
https://github.com/deepshnv/pipeshard-mlsys26-ae

04/29/2026

>Z-Anime | Full Anime Fine-Tune on Z-Image Base
https://huggingface.co/SeeSee21/Z-Anime

>QuantVideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization
https://github.com/svg-project/Quant-VideoGen

>World-R1: Reinforcing 3D Constraints for Text-to-Video Generation
https://github.com/microsoft/World-R1

>Benchmarking Layout-Guided Diffusion Models through Unified Semantic-Spatial Evaluation in Closed and Open Settings
https://github.com/lparolari/cobench

>VibeToken: Scaling 1D Image Tokenizers and Autoregressive Models for Dynamic Resolution Generations
https://github.com/SonyResearch/VibeToken

>OmniVTG: A Large-Scale Dataset and Training Paradigm for Open-World Video Temporal Grounding
https://github.com/oceanflowlab/OmniVTG

>Refinement via Regeneration: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Models
https://github.com/LeapLabTHU/RvR

>SketchVLM: Vision language models can annotate images to explain thoughts and guide users
https://sketchvlm.github.io

>Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation
https://tuna-ai.org/tuna-2

>Prefill-Time Intervention for Mitigating Hallucination in Large Vision-Language Models
https://github.com/huaiyi66/PTI

04/28/2026

>Illustrious XL & NoobAI-XL Style Explorer
https://github.com/ThetaCursed/Illustrious-NoobAI-Style-Explorer

>LTX Desktop 1.0.5
https://github.com/Lightricks/LTX-Desktop/releases/tag/v1.0.5

>Meta-CoT: Enhancing Granularity and Generalization in Image Editing
https://shiyi-zh0408.github.io/projectpages/Meta-CoT

04/27/2026

>PixlStash 1.1.0 Update
https://pixlstash.dev/whatsnew.html

Anonymous
04/30/26(Thu)22:19:44 No.108727704

Anonymous 04/30/26(Thu)22:19:44 No.108727704

>mfw Research news

04/30/2026

>ACPO: Anchor-Constrained Perceptual Optimization for Diffusion Models with No-Reference Quality Guidance
https://arxiv.org/abs/2604.26348

>SpatialFusion: Endowing Unified Image Generation with Intrinsic 3D Geometric Awareness
https://arxiv.org/abs/2604.26341

>Delta Score Matters! Spatial Adaptive Multi Guidance in Diffusion Models
https://arxiv.org/abs/2604.26503

>Beyond Fixed Formulas: Data-Driven Linear Predictor for Efficient Diffusion Models
https://arxiv.org/abs/2604.26365

>MetaSR: Content-Adaptive Metadata Orchestration for Generative Super-Resolution
https://arxiv.org/abs/2604.26244

>SEAL: Semantic-aware Single-image Sticker Personalization with a Large-scale Sticker-tag Dataset
https://cmlab-korea.github.io/SEAL

>SnapPose3D: Diffusion-Based Single-Frame 2D-to-3D Lifting of Human Poses
https://arxiv.org/abs/2604.26620

>Co-generation of Layout and Shape from Text via Autoregressive 3D Diffusion
https://arxiv.org/abs/2604.16552

>TAP into the Patch Tokens: Leveraging Vision Foundation Model Features for AI-Generated Image Detection
https://arxiv.org/abs/2604.26772

>AnimateAnyMesh++: A Flexible 4D Foundation Model for High-Fidelity Text-Driven Mesh Animation
https://arxiv.org/abs/2604.26917

>Delineating Knowledge Boundaries for Honest Large Vision-Language Models
https://arxiv.org/abs/2604.26419

Anonymous
04/30/26(Thu)22:20:29 No.108727707

Anonymous 04/30/26(Thu)22:20:29 No.108727707

File: _AnimaPreview3_00794_.jpg (320 KB, 1248x1608)

320 KB JPG

Anonymous
04/30/26(Thu)22:22:01 No.108727713

Anonymous 04/30/26(Thu)22:22:01 No.108727713

File: 1_00045_.jpg (3.25 MB, 2392x3870)

3.25 MB JPG

Anonymous
04/30/26(Thu)22:23:39 No.108727718

Anonymous 04/30/26(Thu)22:23:39 No.108727718

By anons hand blessed be this thread

Anonymous
04/30/26(Thu)22:25:21 No.108727723

Anonymous 04/30/26(Thu)22:25:21 No.108727723

File: KleinTrueV2_00527_.png (1.95 MB, 1440x944)

1.95 MB PNG

Anonymous
04/30/26(Thu)22:37:12 No.108727777

Anonymous 04/30/26(Thu)22:37:12 No.108727777

File: 00318-102200667re.png (1.63 MB, 1088x1408)

1.63 MB PNG

>made it in

Anonymous
04/30/26(Thu)22:43:42 No.108727803

Anonymous 04/30/26(Thu)22:43:42 No.108727803

File: KleinTrueV2_00535_.png (2.49 MB, 1440x944)

2.49 MB PNG

Anonymous
04/30/26(Thu)22:44:53 No.108727808

Anonymous 04/30/26(Thu)22:44:53 No.108727808

>>108727723
80s b group with lambos, i'd watch it

Anonymous
04/30/26(Thu)22:50:19 No.108727833

Anonymous 04/30/26(Thu)22:50:19 No.108727833

>>108727723
>>108727803
Do you have lora for each car or does klein know them?

Anonymous
04/30/26(Thu)22:54:08 No.108727854

Anonymous 04/30/26(Thu)22:54:08 No.108727854

>>108727833
Just passing a high res reference image of each car, it's a klein finetune btw.

Anonymous
04/30/26(Thu)23:10:32 No.108727930

Anonymous 04/30/26(Thu)23:10:32 No.108727930

> >108727700
> >108727704
fuck off

Anonymous
04/30/26(Thu)23:21:08 No.108727969

Anonymous 04/30/26(Thu)23:21:08 No.108727969

File: _AnimaPreview3_00886_.jpg (380 KB, 1248x1608)

380 KB JPG

Anonymous
04/30/26(Thu)23:26:33 No.108728001

Anonymous 04/30/26(Thu)23:26:33 No.108728001

File: ComfyUI_00001_.png (1.4 MB, 1024x1024)

1.4 MB PNG

Anonymous
04/30/26(Thu)23:47:33 No.108728070

Anonymous 04/30/26(Thu)23:47:33 No.108728070

File: 00628-947710387-08d6a59f-(...).png (2.75 MB, 1344x1728)

2.75 MB PNG

Anonymous
05/01/26(Fri)00:51:26 No.108728298

Anonymous 05/01/26(Fri)00:51:26 No.108728298

File: _AnimaPreview3_00890_.jpg (316 KB, 1248x1608)

316 KB JPG

Anonymous
05/01/26(Fri)00:56:24 No.108728311

Anonymous 05/01/26(Fri)00:56:24 No.108728311

File: mayli.jpg (800 KB, 2304x1492)

800 KB JPG

Anonymous
05/01/26(Fri)01:00:34 No.108728321

Anonymous 05/01/26(Fri)01:00:34 No.108728321

>>108728311
why?

Anonymous
05/01/26(Fri)01:07:15 No.108728340

Anonymous 05/01/26(Fri)01:07:15 No.108728340

>>108728070
Neat breastplate design.

Anonymous
05/01/26(Fri)01:21:29 No.108728402

Anonymous 05/01/26(Fri)01:21:29 No.108728402

File: ComfyUI_23085.png (2.06 MB, 1200x1600)

2.06 MB PNG

>>108727777
And quads! Time to run to the gas station for a couple of scratch-offs before your luck wears off.

Anonymous
05/01/26(Fri)01:29:09 No.108728438

Anonymous 05/01/26(Fri)01:29:09 No.108728438

File: 00089-2062955554-b8c108a2(...).png (2.25 MB, 1344x1728)

2.25 MB PNG

>>108728340
Thanks

Anonymous
05/01/26(Fri)01:32:36 No.108728457

Anonymous 05/01/26(Fri)01:32:36 No.108728457

File: 00092-3337743569-9d0bbb49(...).png (3.32 MB, 1344x1728)

3.32 MB PNG

Anonymous
05/01/26(Fri)01:43:38 No.108728503

Anonymous 05/01/26(Fri)01:43:38 No.108728503

File: file.png (2 KB, 352x51)

2 KB PNG

44 mentions of /ldg/ in the current /hgg/ thread. Any idea why would someone keep shilling your thread there for weeks?

Anonymous
05/01/26(Fri)01:45:31 No.108728509

Anonymous 05/01/26(Fri)01:45:31 No.108728509

>>108728503
>44 mentions of /ldg/ in the current /hgg/ thread.
kekd
ldg has mind broken some to such a degree. its crazy.

Anonymous
05/01/26(Fri)01:53:17 No.108728539

Anonymous 05/01/26(Fri)01:53:17 No.108728539

>>108728402
ngl those loras literally make me lol

Anonymous
05/01/26(Fri)01:54:43 No.108728545

Anonymous 05/01/26(Fri)01:54:43 No.108728545

is the acestep.cpp guy here? I am not able to get dcw to produce clean output, with base xl.

Tonight, I'll try sft and turbo.

Anonymous
05/01/26(Fri)01:58:54 No.108728565

Anonymous 05/01/26(Fri)01:58:54 No.108728565

File: deCS_anima_00025_.png (2.41 MB, 2048x1117)

2.41 MB PNG

Anonymous
05/01/26(Fri)02:22:18 No.108728653

Anonymous 05/01/26(Fri)02:22:18 No.108728653

>>108727629
Am beta tester, it does anime i2v without issues.

Anonymous
05/01/26(Fri)02:23:51 No.108728662

Anonymous 05/01/26(Fri)02:23:51 No.108728662

>>108728653
proof: i dreamt the output

Anonymous
05/01/26(Fri)02:33:40 No.108728694

Anonymous 05/01/26(Fri)02:33:40 No.108728694

>>108727671
>>108728438
>>108728457
>>108728070
>>108727803
nice

Anonymous
05/01/26(Fri)02:35:21 No.108728699

Anonymous 05/01/26(Fri)02:35:21 No.108728699

>>108727629
125k videos reads like a serious finetune.
Obviously millions would be desirable, or a better base model than LTX (Not that many alternatives exists) but this has a shot at being something cool I guess.
>>108728653
Does it do "her clothes disintegrate and she starts bouncing on cock" slop?
I might finally download LTX if so.

Anonymous
05/01/26(Fri)02:37:07 No.108728701

Anonymous 05/01/26(Fri)02:37:07 No.108728701

File: 4chan_g_animanon-troll-MO.png (116 KB, 1463x615)

116 KB PNG

>>108728503
Smells like Animanon* stirring shit again. He does like >90% of the trolling here, usually anti-Anima.

*: May or may not be the actual Ani.

Anonymous
05/01/26(Fri)02:48:33 No.108728737

Anonymous 05/01/26(Fri)02:48:33 No.108728737

>>108727777
check'd
did you do anything for that font or was it just a consequence of the style?

Anonymous
05/01/26(Fri)02:48:56 No.108728739

Anonymous 05/01/26(Fri)02:48:56 No.108728739

>>108728402
MOAR JENNIFER

Anonymous
05/01/26(Fri)02:51:30 No.108728751

Anonymous 05/01/26(Fri)02:51:30 No.108728751

What cloudkek will never experience:
Anima to sulfur beta
https://files.catbox.moe/qa1rn2.mp4

Anonymous
05/01/26(Fri)02:52:18 No.108728752

Anonymous 05/01/26(Fri)02:52:18 No.108728752

>It's pretty good on realism, though every like 1/3 gens you do get body horror which can be annoying. If you run the model fully undistilled that problem goes down to like 1/8, but regardless.
Well at least he is honest.

Anonymous
05/01/26(Fri)02:54:14 No.108728758

Anonymous 05/01/26(Fri)02:54:14 No.108728758

>>108728751
LTX can do 20 second gens?

Anonymous
05/01/26(Fri)02:55:14 No.108728764

Anonymous 05/01/26(Fri)02:55:14 No.108728764

>>108728752
You can lower a lot using i2v. Most issue are from T2V gen

Anonymous
05/01/26(Fri)02:56:19 No.108728767

Anonymous 05/01/26(Fri)02:56:19 No.108728767

>>108728758
Up to 30 but gen times goes insane higher than 20 in my experience.

Anonymous
05/01/26(Fri)02:56:38 No.108728771

Anonymous 05/01/26(Fri)02:56:38 No.108728771

>>108728758
hardware permitting, but that's generally the limit of consumer cards, you could get more framees by lowering the resolution but that seiously kills quality

Anonymous
05/01/26(Fri)02:57:37 No.108728774

Anonymous 05/01/26(Fri)02:57:37 No.108728774

>>108728402
She looks autistic anon...

Anonymous
05/01/26(Fri)02:59:43 No.108728779

Anonymous 05/01/26(Fri)02:59:43 No.108728779

End of this month is two years of LDG :-)

Anonymous
05/01/26(Fri)03:06:22 No.108728796

Anonymous 05/01/26(Fri)03:06:22 No.108728796

>>108728402
Can we see her fee anon

Anonymous
05/01/26(Fri)03:08:59 No.108728806

Anonymous 05/01/26(Fri)03:08:59 No.108728806

>>108728751
But Apicucks can make catalogs and lame meme...

Anonymous
05/01/26(Fri)03:16:19 No.108728831

Anonymous 05/01/26(Fri)03:16:19 No.108728831

>>108728779
Neat, do you know how to use acestep.cpp's dcw? I'm having to use really low values with base xl.

Anonymous
05/01/26(Fri)03:22:30 No.108728867

Anonymous 05/01/26(Fri)03:22:30 No.108728867

>>108728779
crazy how little local progressed
at least API is welcome here now thanks to comfyui

Anonymous
05/01/26(Fri)03:35:46 No.108728937

Anonymous 05/01/26(Fri)03:35:46 No.108728937

>>108728867
local community is full of envious and bitter people. comfy made the right call when he betrayed local, nobody actually cares about freedom in computing anymore. any attempt to change the status quo is sabotaged and ridiculed

Anonymous
05/01/26(Fri)03:40:42 No.108728962

Anonymous 05/01/26(Fri)03:40:42 No.108728962

Holy sperg out with those previous two posts

Anonymous
05/01/26(Fri)03:45:39 No.108728983

Anonymous 05/01/26(Fri)03:45:39 No.108728983

>>108728962
It's like one troll who has been doing a lot of overtime since GPT Image-2 release.
I wonder what he gets out of it.

Anonymous
05/01/26(Fri)03:46:46 No.108728989

Anonymous 05/01/26(Fri)03:46:46 No.108728989

>>108728774
Oh, she's genuinely very autistic. Big part of her charm, really.

>>108728796
Here's an oldie courtesy of Wan, but her actual feet don't feature heavily in my dataset (just two pics with visible toes). I could potentially revise that though.

Anonymous
05/01/26(Fri)03:48:31 No.108728995

Anonymous 05/01/26(Fri)03:48:31 No.108728995

>>108728989
she shits in her hands and smears it on the walls?
I've dealt with bonafide autistic people before, it's a fucking nightmare. Only the truly divorced from reality could romanticise that

Anonymous
05/01/26(Fri)03:53:07 No.108729008

Anonymous 05/01/26(Fri)03:53:07 No.108729008

>>108728995
When people say "autistic" they mean neurotypical but somewhat socially awkward in an endearing way.
Yes dealing with low functioning autistics (even many high functioning ones) is hell.

Anonymous
05/01/26(Fri)03:54:52 No.108729013

Anonymous 05/01/26(Fri)03:54:52 No.108729013

>>108729008
Read it, he said "very autistic"

Anonymous
05/01/26(Fri)04:06:22 No.108729064

Anonymous 05/01/26(Fri)04:06:22 No.108729064

>>108728751
>short hair
>french
>brown
Into the trash it goes

Anonymous
05/01/26(Fri)04:08:31 No.108729070

Anonymous 05/01/26(Fri)04:08:31 No.108729070

I will be dumpster diving, I guess.

Anonymous
05/01/26(Fri)04:23:28 No.108729132

Anonymous 05/01/26(Fri)04:23:28 No.108729132

File: Jenny Being Weird.webm (3.92 MB, 960x1280)

3.92 MB WEBM

>load up Wan 2.2 after a several months
>lora key not loaded: diffusion_model.blocks.ALL.OF.THEM
Guh, what did Comfy do to these now, do I need some special loader and redo my whole workflow, or what?

>>108728995
>>108729013
BUT! I also didn't say "non-verbal and mentally retarded".

Anonymous
05/01/26(Fri)04:26:33 No.108729148

Anonymous 05/01/26(Fri)04:26:33 No.108729148

>>108728298
Arguably the most impressive line of LoRAs in the history of LoRAs. I kneel.

Anonymous
05/01/26(Fri)04:38:18 No.108729181

Anonymous 05/01/26(Fri)04:38:18 No.108729181

File: AB_Adjust.webm (3.91 MB, 748x800)

3.91 MB WEBM

>>108729132
Oof, 16m25s for this with the broken LoRAs. 7m of that was "Model Initializing". I don't even...

Anonymous
05/01/26(Fri)04:41:05 No.108729194

Anonymous 05/01/26(Fri)04:41:05 No.108729194

>>108729148
Latest one turned out really good

>>108729181
Just saw Escape from New York. Carpenter was one lucky dude

Anonymous
05/01/26(Fri)04:44:44 No.108729202

Anonymous 05/01/26(Fri)04:44:44 No.108729202

>>108728503
35 stars

Anonymous
05/01/26(Fri)04:46:17 No.108729208

Anonymous 05/01/26(Fri)04:46:17 No.108729208

>>108728653
When you say anime are you refering to actual anime or 2.5D slop like >>108728751

Anonymous
05/01/26(Fri)04:48:24 No.108729219

Anonymous 05/01/26(Fri)04:48:24 No.108729219

>>108729202
unironically hope you blow your brains out with a shotgun as soon as possible. in 3 years you've been obsessing about this hobby you contributed nothing to the community, in fact you only ever made it worse. i can guarantee that everyone hates your guts irl as well

Anonymous
05/01/26(Fri)04:48:40 No.108729221

Anonymous 05/01/26(Fri)04:48:40 No.108729221

I'm learing ace step.

Anonymous
05/01/26(Fri)04:57:16 No.108729264

Anonymous 05/01/26(Fri)04:57:16 No.108729264

haha melty

Anonymous
05/01/26(Fri)04:59:19 No.108729273

Anonymous 05/01/26(Fri)04:59:19 No.108729273

>>108729221
Is it Somali friendly

Anonymous
05/01/26(Fri)04:59:51 No.108729276

Anonymous 05/01/26(Fri)04:59:51 No.108729276

>>108729273
yes very quality

Anonymous
05/01/26(Fri)05:18:53 No.108729353

Anonymous 05/01/26(Fri)05:18:53 No.108729353

acestep in 500 steps, yes this will be the gen that really starts the local ai music revolution!!!

When it finishes '-.-

Anonymous
05/01/26(Fri)05:57:27 No.108729492

Anonymous 05/01/26(Fri)05:57:27 No.108729492

File: AB_Adjust_2.webm (3.91 MB, 750x800)

3.91 MB WEBM

>>108729181
>fixed the LoRA problem (one of them defaulted and was pointing to nothing)
>same 960x1024 res and prompt
>25m gen time
Goddamn, whatever this "Model Initializing" is doing it now does it four times per Wan 2.2 gen (3min, 6min, 4min, 6min). This same workflow could do higher resolutions in about 5min total back in the day. It barely uses my GPU now too, it just spikes up and down and the 3D portion only fills up a quarter of the way with VRAM usage topping out at 22GB (resting at about 17-18GB for most of it). It's all fucked up...

I had no idea it got this fucking shitty for video (LTX isn't as slow, but the output is also ass) with all their worthless memory "optimizations". I'm gonna have to setup and freeze an old version just for Wan because this shit as it currently stands is totally unusable.

>>108729353
>500 steps
More steps always seems to help audio diffusion (I use the full 100 with VibeVoice for instance), but 500 seems like overkill.

Anonymous
05/01/26(Fri)06:38:22 No.108729634

Anonymous 05/01/26(Fri)06:38:22 No.108729634

>>108728402
idk what the story is with this character but face looks like my sister when she was younger and it creeps me out a bit.

Anonymous
05/01/26(Fri)06:39:10 No.108729638

Anonymous 05/01/26(Fri)06:39:10 No.108729638

>>108728503
Only you think that an anon from ldg is gonna shill their general indiscriminately and namedrop it. Probably someone trying to make ldg look bad.

Anonymous
05/01/26(Fri)06:43:22 No.108729656

Anonymous 05/01/26(Fri)06:43:22 No.108729656

File: 1773359657745467.png (3.58 MB, 2048x1152)

3.58 MB PNG

Anonymous
05/01/26(Fri)06:44:54 No.108729662

Anonymous 05/01/26(Fri)06:44:54 No.108729662

>>108728758
i can do up to 1 minute on my 12gb card but the problem with ltx is that it was trained on 5 second clips so the further that you go out of bounds, the more stuttering you get. i find it is better to keep it at 5 seconds but do it at the highest FPS that you can go, then just keep extending the clips forever since it degrades slower when the frame rate is higher

Anonymous
05/01/26(Fri)06:49:34 No.108729681

Anonymous 05/01/26(Fri)06:49:34 No.108729681

>>108729656
Local?

Anonymous
05/01/26(Fri)06:58:48 No.108729722

Anonymous 05/01/26(Fri)06:58:48 No.108729722

File: image-31.png (50 KB, 990x226)

50 KB PNG

...

Anonymous
05/01/26(Fri)07:04:12 No.108729743

Anonymous 05/01/26(Fri)07:04:12 No.108729743

>>108729722
This is related to what? Missing context

Anonymous
05/01/26(Fri)07:11:20 No.108729768

Anonymous 05/01/26(Fri)07:11:20 No.108729768

File: ythfd.png (1.67 MB, 816x1952)

1.67 MB PNG

Anonymous
05/01/26(Fri)07:11:43 No.108729771

Anonymous 05/01/26(Fri)07:11:43 No.108729771

>>108729743
Context window limit reached

Anonymous
05/01/26(Fri)07:22:13 No.108729819

Anonymous 05/01/26(Fri)07:22:13 No.108729819

>>108729722
SaaS discussion: upgrades, initiatives, discoveries, and research that lead to factual improvements, boosting user experience, quality and speed, SOTA

Local discussion: money, "we give money to him but only a little guys, don't worry", Python, making the user experience worse, DOA models, more money talk, subjetive improvements, 1:2 fixes to bugs ratio, vibecoding, chinks&jeets

Anonymous
05/01/26(Fri)07:26:14 No.108729841

Anonymous 05/01/26(Fri)07:26:14 No.108729841

>>108729819
Local talk: weekly types of CFG discovered that shows opinionated prompt understanding

Anonymous
05/01/26(Fri)07:27:12 No.108729846

Anonymous 05/01/26(Fri)07:27:12 No.108729846

>>108729819
don't mention sora

Anonymous
05/01/26(Fri)07:31:57 No.108729856

Anonymous 05/01/26(Fri)07:31:57 No.108729856

>haven't proompted for months
>boot up
>my old wf from January doesn't even start ('output error' or something), even though it worked before and I haven't updooted since then
>say fuck it and pull
>not only does it work now, but the update didn't break a single node
What kind of black magic is that? Is comfy /based/ again?

Anonymous
05/01/26(Fri)07:33:38 No.108729869

Anonymous 05/01/26(Fri)07:33:38 No.108729869

>>108728298
Anywhere I can get this lora?

Anonymous
05/01/26(Fri)07:35:40 No.108729873

Anonymous 05/01/26(Fri)07:35:40 No.108729873

>>108729662
Thanks anon. If it is not too much trouble:
Which 12gb card, what are the generation speeds like and how much system memory do you have? Also are you running bf16, fp8, int8, q8 or some other quant?
I wonder if I should finally try LTX with my 3060 and 32gb system memory before that finetune releases.

Anonymous
05/01/26(Fri)07:40:33 No.108729899

Anonymous 05/01/26(Fri)07:40:33 No.108729899

>>108729873
3080ti and 32gb and int8 distilled. it takes 5 minutes per second of 4k resolution 24fps, so you can imagine that it's quite good if you want to do low resolution and then process it afterwards

Anonymous
05/01/26(Fri)07:42:02 No.108729902

Anonymous 05/01/26(Fri)07:42:02 No.108729902

>do celeb lora
>result is meh
>use photo as base for img2img
>likness is perfect
explain this to a brainlet

Anonymous
05/01/26(Fri)07:45:44 No.108729916

Anonymous 05/01/26(Fri)07:45:44 No.108729916

>>108729902
undertrained lora

Anonymous
05/01/26(Fri)07:55:13 No.108729959

Anonymous 05/01/26(Fri)07:55:13 No.108729959

>>108729869
https://civitai.red/models/2280663?modelVersionId=2908963

Anonymous
05/01/26(Fri)08:02:20 No.108729986

Anonymous 05/01/26(Fri)08:02:20 No.108729986

>>108729959
>.red

Anonymous
05/01/26(Fri)08:19:31 No.108730061

Anonymous 05/01/26(Fri)08:19:31 No.108730061

>>108729899
Thanks for the response anon. That's encouraging.
>>108729902
Klein? It has bad facial likeness in general.

Anonymous
05/01/26(Fri)08:23:45 No.108730078

Anonymous 05/01/26(Fri)08:23:45 No.108730078

>gemma 4 as prompt enhancer for z image base
it's good. anon should try it

Anonymous
05/01/26(Fri)08:24:25 No.108730083

Anonymous 05/01/26(Fri)08:24:25 No.108730083

>>108730078
give prompt

Anonymous
05/01/26(Fri)08:28:23 No.108730093

Anonymous 05/01/26(Fri)08:28:23 No.108730093

>>108730083
https://huggingface.co/spaces/Tongyi-MAI/Z-Image-Turbo/blob/main/pe.py

Anonymous
05/01/26(Fri)08:33:42 No.108730122

Anonymous 05/01/26(Fri)08:33:42 No.108730122

fartfags >>> footfags

Anonymous
05/01/26(Fri)09:03:57 No.108730282

Anonymous 05/01/26(Fri)09:03:57 No.108730282

what do you use to organize and apply your LoRAs in comfy?

Anonymous
05/01/26(Fri)09:11:38 No.108730325

Anonymous 05/01/26(Fri)09:11:38 No.108730325

File: ComfyUI_01785_.jpg (914 KB, 2304x1792)

914 KB JPG

>>108728321
I thought it was funny. Is it too mean? Originally was gonna do one with her eating popcorn watching herself.

Anonymous
05/01/26(Fri)09:14:42 No.108730343

Anonymous 05/01/26(Fri)09:14:42 No.108730343

>>108730325
Nono I was just surprised at seeing her. I'm also pretty sure 18 year old Mayli would have loved this

Anonymous
05/01/26(Fri)09:33:44 No.108730437

Anonymous 05/01/26(Fri)09:33:44 No.108730437

File: 1763464683290271.png (1.93 MB, 1023x1537)

1.93 MB PNG

>>108727629
>look my new model saar
>join my discord saar
this jeet just spent 8k to make a worse version of ltx 2.3
everyone laugh at him
https://www.reddit.com/r/StableDiffusion/comments/1t0auqh/comment/oj7txk3/

Anonymous
05/01/26(Fri)09:43:34 No.108730489

Anonymous 05/01/26(Fri)09:43:34 No.108730489

File: Cynthia_00919_.png (988 KB, 896x1152)

988 KB PNG

Anonymous
05/01/26(Fri)09:44:35 No.108730497

Anonymous 05/01/26(Fri)09:44:35 No.108730497

File: shauna_monogatari_.png (845 KB, 1152x896)

845 KB PNG

Anonymous
05/01/26(Fri)09:46:48 No.108730509

Anonymous 05/01/26(Fri)09:46:48 No.108730509

File: Dawn_Bathing.png (866 KB, 896x1152)

866 KB PNG

Anonymous
05/01/26(Fri)09:49:40 No.108730519

Anonymous 05/01/26(Fri)09:49:40 No.108730519

>>108728989
Kino... Sad we lost focus on her face

Anonymous
05/01/26(Fri)09:54:33 No.108730551

Anonymous 05/01/26(Fri)09:54:33 No.108730551

>>108730437
Worse for pron? You have no idea what you're talking about

Anonymous
05/01/26(Fri)09:56:33 No.108730564

Anonymous 05/01/26(Fri)09:56:33 No.108730564

>>108730551
just use a lora lol

Anonymous
05/01/26(Fri)10:10:01 No.108730644

Anonymous 05/01/26(Fri)10:10:01 No.108730644

So what's the current local meta way to upscale images?
I am asking for real images, not gens.
Like if I have a photo from 2000s, what can I run that will still make it look like photo from 2000s, just higher res?
Most upscale GANs I know either excessively blur the image of sharpen the artifacts to an unnatural degree.

Anonymous
05/01/26(Fri)10:21:37 No.108730719

Anonymous 05/01/26(Fri)10:21:37 No.108730719

>>108728737
Dunno I just put "huge stylized impact font text" but it's probably a combination of both.

Anonymous
05/01/26(Fri)10:23:28 No.108730732

Anonymous 05/01/26(Fri)10:23:28 No.108730732

>>108730644
you could try seedvr2, YMMV if that is "accurate" enough for you. like GAN it obviously has to fill in the additional pixels

Anonymous
05/01/26(Fri)10:31:32 No.108730778

Anonymous 05/01/26(Fri)10:31:32 No.108730778

>>108730732
Workflow? How much does 3B vs 7B matter?

Anonymous
05/01/26(Fri)10:34:11 No.108730796

Anonymous 05/01/26(Fri)10:34:11 No.108730796

>>108729656
scissor time

Anonymous
05/01/26(Fri)10:34:25 No.108730798

Anonymous 05/01/26(Fri)10:34:25 No.108730798

>>108730778
https://github.com/numz/ComfyUI-SeedVR2_VideoUpscaler/tree/main/example_workflows don't have anything else at hand rn

dunno, i just used 7b

Anonymous
05/01/26(Fri)10:36:37 No.108730811

Anonymous 05/01/26(Fri)10:36:37 No.108730811

>>108730798
Ok thanks I will see if it works well enough.

Anonymous
05/01/26(Fri)10:45:59 No.108730870

Anonymous 05/01/26(Fri)10:45:59 No.108730870

File: Jenny Droog.webm (3.92 MB, 960x1280)

3.92 MB WEBM

>>108730437
Any leaked clips yet? Or are we going to have to wait until the public unveiling?

>>108730519
Oh wow, it got deleted... you know, I think that might be a first. It's never been deleted over on >>>/tv/ that I can recall.

Anonymous
05/01/26(Fri)10:59:14 No.108730948

Anonymous 05/01/26(Fri)10:59:14 No.108730948

>>108730870
>Any leaked clips yet?
nothing worth leaking, its just run of the mill body horror slop

Anonymous
05/01/26(Fri)11:21:44 No.108731074

Anonymous 05/01/26(Fri)11:21:44 No.108731074

>>108730870
jebby stuffing her foot in my mouth and pouring that milk down her leg and into my mouth via her foot

Anonymous
05/01/26(Fri)11:23:28 No.108731087

Anonymous 05/01/26(Fri)11:23:28 No.108731087

>>108730078
And who has 32gb vram to run gemma?

Anonymous
05/01/26(Fri)11:26:40 No.108731108

Anonymous 05/01/26(Fri)11:26:40 No.108731108

>>108731087
Gee gee uuufs!

Anonymous
05/01/26(Fri)11:31:05 No.108731136

Anonymous 05/01/26(Fri)11:31:05 No.108731136

>>108731087
Q5 with enough context fits in 20GB. Or just offload to CPU. You don't need to keep it loaded after inference.

Anonymous
05/01/26(Fri)11:34:16 No.108731154

Anonymous 05/01/26(Fri)11:34:16 No.108731154

>>108730732
>>108730811
I am finding it to be unusable for any real content.
Way too many changes to small details for me.
Extra storage saved on my drive I suppose.

Anonymous
05/01/26(Fri)11:34:57 No.108731157

Anonymous 05/01/26(Fri)11:34:57 No.108731157

>>108727629
Why not just tune wan or ltx?

Anonymous
05/01/26(Fri)11:41:36 No.108731197

Anonymous 05/01/26(Fri)11:41:36 No.108731197

>>108731154
And I don't mean super small and irrelevant shit neither.
Legible text on book covers turn into AI slop gibberish.

Anonymous
05/01/26(Fri)11:55:04 No.108731277

Anonymous 05/01/26(Fri)11:55:04 No.108731277

File: Flux2-Klein_00159_.jpg (547 KB, 2048x2048)

547 KB JPG

According to ACEStep's dev, 30% of ACEStep was trained using LM codes. 50% was trained without it. As he explains

>The purpose of this design is to give you that udio-like experience.
>Because diffusion models are really good at creation.
>LM, on the other hand, easily falls into overfitting, though it has better prompt adherence and higher accuracy.

>Our trade-off is to make diffusion training harder and less dependent on LM codes.
Otherwise, it would lose creativity and degenerate into a mere renderer that decodes codes into latents, rather than having its own personality.

That makes perfect sense. I've been getting "boring outputs" that while accurate aren't on par with Udio, so I thought it's a creativity/RLHF issue. But to my surprise turning the LM off has pleasing results on XL.

This is a LoRA I'm testing, it's about 75% done. But anyways, the result is very high quality, that I couldn't get out of the regular LM. As with image models, the DiT is very creative, and likely the only way to get Udio-like creativity. The cost is just a bit of prompt following (still good lyrics), with seed variance not an issue.

First try of DiT only output on a WiP Kawaii Super Bass LoRA
https://vocaroo.com/1hxGbL1pBTaI

After hearing that, I was certain. Udio is a diffusion model. Which is why it's so damn good and creative. To squeeze even more creativity out of ACEStep, turning the LM off helps bridge that gap even further.

>>108728545
>I am not able to get dcw to produce clean output, with base xl.

DCW has different values that work with it according to the creator. I have not tried the Base XL model, I reserve it exclusively for LoRA training. My advice is to not use DCW with SFT, you will get the most advantages out of the Turbo model, which is more creative than SFT. But if you must, the 0.05 DCW values given by the creator on Turbo have to be lowered on SFT (and possibly also on Base) for it to work properly. Make sure you pair DCW with Scrag's custom VAE.

Anonymous
05/01/26(Fri)11:56:56 No.108731288

Anonymous 05/01/26(Fri)11:56:56 No.108731288

never touched AI before. I just want to make 5 second loops of secks out of the blender 3D characters I make for my own personal enjoyment. any recs on what I should use?

Anonymous
05/01/26(Fri)11:58:34 No.108731302

Anonymous 05/01/26(Fri)11:58:34 No.108731302

>>108731288
Wan 2.2 I2V + some sex lora.
Render characters into appropriate sex position beforehand.

Anonymous
05/01/26(Fri)11:58:58 No.108731304

Anonymous 05/01/26(Fri)11:58:58 No.108731304

>>108731288
Have you tried Blender?

Anonymous
05/01/26(Fri)12:01:31 No.108731317

Anonymous 05/01/26(Fri)12:01:31 No.108731317

>>108731154
>>108731197
Honestly doesn't feel great for artistic stuff neither. I gave it an image where the character was crying and it decided to wipe off the tears in upscale, alongside other changes.
Maybe I am missing something but yeah I think I am done.
I am not trying to be combative with the guy who initially suggested it btw, I appreciate the help. But these were my impressions.

Anonymous
05/01/26(Fri)12:06:05 No.108731347

Anonymous 05/01/26(Fri)12:06:05 No.108731347

>>108731302
thanks, I'll check that out. for a sex lora, should I be looking for one specifically for 3D or does that not matter?

>>108731304
>AI general
>"have you tried not using AI?"
animation is an enormous time sink. just to get down the fundamentals takes a long time, let alone getting it to look good. I'd rather craft characters and make environments. I just wanna see it in action as I've seen impressive work from r34 """artists""" who take 3D CG images and make great looking AI animations from them

Anonymous
05/01/26(Fri)12:11:43 No.108731376

Anonymous 05/01/26(Fri)12:11:43 No.108731376

>>108731347
>does that not matter?
Shouldn't matter if it is trained well but I don't think you are gonna find 3d specific ones anyway.
Speaking of which sex loras for image to video models get nuked from most places under deepfake rules.
They are difficult to find and you need to get them from some place like civitaiarchive (poorly maintained but most accessible)

Anonymous
05/01/26(Fri)12:11:58 No.108731378

Anonymous 05/01/26(Fri)12:11:58 No.108731378

File: dude weed lmao.png (1.54 MB, 2048x1100)

1.54 MB PNG

I figured I'd test the Anima RL Lora with a simple prompt and no artist tag, since russell said it's trained to emulate high danbooru scores. The effect isn't as strong as I expected though.

Anonymous
05/01/26(Fri)12:13:18 No.108731393

Anonymous 05/01/26(Fri)12:13:18 No.108731393

what do you use to organize or apply your LoRAs in comfy? or do you just download and chain them manually?

Anonymous
05/01/26(Fri)12:16:14 No.108731407

Anonymous 05/01/26(Fri)12:16:14 No.108731407

>>108731393
rgthree power loader

Anonymous
05/01/26(Fri)12:17:58 No.108731420

Anonymous 05/01/26(Fri)12:17:58 No.108731420

>>108731393
I use ComfyUI-Lora-Manager.
It runs alongside comfy as an extension. It's very nice you can save recipes and keep your stuff visually organized.

Anonymous
05/01/26(Fri)12:19:54 No.108731433

Anonymous 05/01/26(Fri)12:19:54 No.108731433

File: Anima0.3+RL+Turbo_00006_c(...).png (2.28 MB, 2048x1100)

2.28 MB PNG

>>108731378
Here's what happens if you throw in the Turbo Lora and use 8 steps, cfg 1.

Anonymous
05/01/26(Fri)12:22:19 No.108731448

Anonymous 05/01/26(Fri)12:22:19 No.108731448

>>108731393
I chain them manually because I rarely use more than 1 or 2 loras.

Anonymous
05/01/26(Fri)13:25:53 No.108731822

Anonymous 05/01/26(Fri)13:25:53 No.108731822

cozy

Anonymous
05/01/26(Fri)13:28:50 No.108731845

Anonymous 05/01/26(Fri)13:28:50 No.108731845

File: ComfyUI_Anima_00057_.png (1.24 MB, 1024x1024)

1.24 MB PNG

>>108731277
>First try of DiT only output on a WiP Kawaii Super Bass LoRA

Strategy applied to a Miku LoRA I trained. This is the first result. Keep in mind it was not trained at all on this type of cutesy/techno prompt, it's generalizing the genre (what it saw was exclusively some DECO-27 songs).

https://vocaroo.com/1gonlSXjg3b3

This musicality is insane. I know it's a LoRA, but I don't wanna hear anyone telling me Suno v5.5 is better kek (minus out of the box world knowledge, they're about on par now).

For comparison, not same prompt, but before turning off the LM, this was the best sort of output I was getting from this Miku LoRA.

https://vocaroo.com/1hANeTMzTMF1

This was not a first try, took lots of nitpicking, and it's quite meh, I couldn't get it to produce music exactly how I wanted and it was poorly composed, every other output with the LM was also meh.

Lastly, because I want the difference in musicality to be clear, this is the equivalent of that very first prompt with the LM turned on (5Hz LM codes not removed on ACEStep cpp). Though it's not the same seed, you get the point. Note I also did not bother to master this awful output (and I also noticed with LM off the output quality pre-mastering is higher, though it might be RNG).

https://vocaroo.com/14UM0LfFwjfl

I now have to retest all my "failed" LoRAs kek. If this is v1.5, v2 with its enhanced world knowledge is gonna be some fun stuff.

Anonymous
05/01/26(Fri)13:32:59 No.108731872

Anonymous 05/01/26(Fri)13:32:59 No.108731872

>>108731845
No offence but suno is way better

Anonymous
05/01/26(Fri)13:35:59 No.108731887

Anonymous 05/01/26(Fri)13:35:59 No.108731887

>>108731872
>the closed model with loads of funding and money is better than free thing
wow, I'm SHOCKED. utterly SHOCKED!

Anonymous
05/01/26(Fri)13:38:14 No.108731898

Anonymous 05/01/26(Fri)13:38:14 No.108731898

>>108731845
More interested in an ACE-Step lora training writeup.

Anonymous
05/01/26(Fri)13:38:25 No.108731902

Anonymous 05/01/26(Fri)13:38:25 No.108731902

File: 05821-2606046613.png (795 KB, 896x1152)

795 KB PNG

Can I train Anima Loras with AI toolkit? what architecture?

Anonymous
05/01/26(Fri)13:43:27 No.108731936

Anonymous 05/01/26(Fri)13:43:27 No.108731936

>>108731902
iirc ostris said there is a license conflict so he can't implement anima.
just use the standalone trainer
https://github.com/gazingstars123/Anima-Standalone-Trainer

Anonymous
05/01/26(Fri)13:44:56 No.108731944

Anonymous 05/01/26(Fri)13:44:56 No.108731944

>>108731902
No
https://github.com/ostris/ai-toolkit/issues/791

Anonymous
05/01/26(Fri)13:47:32 No.108731965

Anonymous 05/01/26(Fri)13:47:32 No.108731965

>>108731872
Suno peaked at v4.5. It hasn't improved much since then, in fact at v5.5 the audio quality has regressed. ACEStep 1.5 XL has already caught up in musicality.

Anonymous
05/01/26(Fri)13:49:49 No.108731985

Anonymous 05/01/26(Fri)13:49:49 No.108731985

>>108731936
Well that's too bad. Thanks, i'll look into standalone.
But isn't Anima basically based on Qwen? or was it Flux, and if so, couldn't you technically just train on those architectures? Well I guess not, I suppose, is the correct answer, Otherwhise Ostris wouldn't specifically mention it. But I kind of feel like I want to experiment with it none the less.

Anonymous
05/01/26(Fri)13:52:41 No.108732009

Anonymous 05/01/26(Fri)13:52:41 No.108732009

>>108731944
>https://github.com/ostris/ai-toolkit/issues/791
>>108731985
nevermind, so much for that bright idea.

Anonymous
05/01/26(Fri)13:53:09 No.108732013

Anonymous 05/01/26(Fri)13:53:09 No.108732013

>>108727700
>>108727704
thanks!

Anonymous
05/01/26(Fri)13:57:26 No.108732051

Anonymous 05/01/26(Fri)13:57:26 No.108732051

>>108732009
https://github.com/gazingstars123/Anima-Standalone-Trainer

Anonymous
05/01/26(Fri)13:57:27 No.108732052

Anonymous 05/01/26(Fri)13:57:27 No.108732052

>>108731985
Anima is Cosmos base

Anonymous
05/01/26(Fri)13:57:56 No.108732054

Anonymous 05/01/26(Fri)13:57:56 No.108732054

>>108732009
it's a very easy model to train either way.

Anonymous
05/01/26(Fri)14:03:54 No.108732096

Anonymous 05/01/26(Fri)14:03:54 No.108732096

File: ss_20260501_130046.png (393 KB, 1412x883)

393 KB PNG

Am I missing anything in my program saars?

Anonymous
05/01/26(Fri)14:04:46 No.108732100

Anonymous 05/01/26(Fri)14:04:46 No.108732100

File: 05789-3192374835.png (997 KB, 896x1152)

997 KB PNG

>>108732052
>>108732054
Thanks!

Anonymous
05/01/26(Fri)14:10:27 No.108732140

Anonymous 05/01/26(Fri)14:10:27 No.108732140

>>108732096
temp and other sampler settings?

Anonymous
05/01/26(Fri)14:11:24 No.108732149

Anonymous 05/01/26(Fri)14:11:24 No.108732149

>>108732096
Is it not inheriting your global app theme? Or are your window decorations just different? Other than that looks kewl anon

Anonymous
05/01/26(Fri)14:12:54 No.108732160

Anonymous 05/01/26(Fri)14:12:54 No.108732160

>>108732149
it's not inheriting. I probably should make it do that since it's just QT. thanks.
>>108732140
true!

Anonymous
05/01/26(Fri)14:17:07 No.108732182

Anonymous 05/01/26(Fri)14:17:07 No.108732182

We need an anima swastika lora, and a separate Adolf Hitler lora. So we can fight those bad guys.

Anonymous
05/01/26(Fri)14:17:46 No.108732193

Anonymous 05/01/26(Fri)14:17:46 No.108732193

>>108732096
Is this for anima? A short natural language paragraph will probably perform better than pseudo-tags. Not that I think it will matter too much for a simpler lora like yours.
Jessica is also a bit too generic jessica (rick and morty) might perform better.

Anonymous
05/01/26(Fri)14:20:18 No.108732209

Anonymous 05/01/26(Fri)14:20:18 No.108732209

>>108732193
Oh wait you are asking for the software.
Listing tags by frequency, ability to remove a single undesired tag from all captions with a simple click (blacklisting), automatically adding a tag to all captions.

Anonymous
05/01/26(Fri)14:22:32 No.108732229

Anonymous 05/01/26(Fri)14:22:32 No.108732229

>>108732193
she's the only person without a real last name for some reason but yeah I'm just testing right now. every image tagger i've used has been too shit/bloated or won't let you change out to whatever model you want so I made this. thanks for the advice, though.

>>108732209
>automatically adding a tag to all captions.
I have that with tag prefix or do you mean something else? also thanks again.

Anonymous
05/01/26(Fri)14:23:41 No.108732234

Anonymous 05/01/26(Fri)14:23:41 No.108732234

>>108732193
>A short natural language paragraph will probably perform better than pseudo-tags
Natural language + booru tags or alternating tags between pure tags and natural description is the way to go. Max the batch size before anything.

Anonymous
05/01/26(Fri)14:28:21 No.108732266

Anonymous 05/01/26(Fri)14:28:21 No.108732266

>>108731902
use the easyscripts fork
https://github.com/67372a/LoRA_Easy_Training_Scripts/tree/refresh

Anonymous
05/01/26(Fri)14:28:33 No.108732272

Anonymous 05/01/26(Fri)14:28:33 No.108732272

>>108732096
Next step: vibe out inference, booru, and gen history support. Then you'll never have to leave the program.

Anonymous
05/01/26(Fri)14:29:37 No.108732282

Anonymous 05/01/26(Fri)14:29:37 No.108732282

>>108732096
>>108732272
*and support for whatever training scripts you use :3

Anonymous
05/01/26(Fri)14:30:52 No.108732286

Anonymous 05/01/26(Fri)14:30:52 No.108732286

>>108732229
>do you mean something else?
Besides prefix. Can't think of a good example right now but let's say you noticed that your LLM hasn't added large breasts to any of the captions and you want that captioned. A function to easily add that to all captions (if it doesn't exists already) is useful in some cases.
>>108732234
Did you actual test this? Not that I did neither, but I've heard that just natural language performs best and the official lora only had natural language captions.
Btw:
>booru tags
Your LLM isn't exactly outputting proper booru tags anima was trained on. I think the TE inside of it is smart enough that this shouldn't matter too much, but I feel the need to emphasize what you are doing is different than running WD14 or any other proper booru tagger.

Anonymous
05/01/26(Fri)14:31:25 No.108732289

Anonymous 05/01/26(Fri)14:31:25 No.108732289

>>108732282
>>108732272
now that's overscope bloat. no thanks

Anonymous
05/01/26(Fri)14:36:21 No.108732317

Anonymous 05/01/26(Fri)14:36:21 No.108732317

>>108732266
this looks kinda cool. and so well documented. thanks.

Anonymous
05/01/26(Fri)14:37:38 No.108732328

Anonymous 05/01/26(Fri)14:37:38 No.108732328

trying to learn how to use dcw with ace step base xl

so far, it's worse with it, but I'm using very small amounts and may figure out good settings.

Anonymous
05/01/26(Fri)14:39:09 No.108732340

Anonymous 05/01/26(Fri)14:39:09 No.108732340

>>108732286
>Did you actual test this?
yes

Anonymous
05/01/26(Fri)14:39:57 No.108732343

Anonymous 05/01/26(Fri)14:39:57 No.108732343

just use diffusion-pipe. youre not a brainlet... right anon?

Anonymous
05/01/26(Fri)14:40:51 No.108732350

Anonymous 05/01/26(Fri)14:40:51 No.108732350

>>108729181
why such a framerate. just quadruple it with lossless scaling

Anonymous
05/01/26(Fri)14:41:28 No.108732355

Anonymous 05/01/26(Fri)14:41:28 No.108732355

>>108732343
im retarded sorry man, i need everything in a colorfui gui

Anonymous
05/01/26(Fri)14:45:07 No.108732375

Anonymous 05/01/26(Fri)14:45:07 No.108732375

>>108732343
>just use diffusion-pipe. youre not a brainlet... right anon?
I have big brian and I'm on windows, so no ty

Anonymous
05/01/26(Fri)14:53:32 No.108732422

Anonymous 05/01/26(Fri)14:53:32 No.108732422

Any news in Anima ControlNet or Anima edit?

Anonymous
05/01/26(Fri)14:53:57 No.108732426

Anonymous 05/01/26(Fri)14:53:57 No.108732426

>>108732422
base model isnt even done yet bro

Anonymous
05/01/26(Fri)14:54:22 No.108732430

Anonymous 05/01/26(Fri)14:54:22 No.108732430

File: 00226-2339287984-37ba0ae1(...).png (2.44 MB, 1344x1728)

2.44 MB PNG

Anonymous
05/01/26(Fri)14:55:39 No.108732434

Anonymous 05/01/26(Fri)14:55:39 No.108732434

>>108732426
Fuck but I need something to add detial to background and textures with good color quality without altering the base image, Klein does not respect style and SDXL ControlNet is washed out

Anonymous
05/01/26(Fri)14:56:21 No.108732438

Anonymous 05/01/26(Fri)14:56:21 No.108732438

>>108732434
sounds like a problem for YOU to solve

Anonymous
05/01/26(Fri)14:57:05 No.108732440

Anonymous 05/01/26(Fri)14:57:05 No.108732440

>>108731154
could be. maybe you just need to do bicubic or w/e no actual change upscales for now and call it a day.

Anonymous
05/01/26(Fri)14:57:45 No.108732445

Anonymous 05/01/26(Fri)14:57:45 No.108732445

>>108732438
Wtf? Give me the solution, bro, I don’t have time for jokes

Anonymous
05/01/26(Fri)15:02:01 No.108732472

Anonymous 05/01/26(Fri)15:02:01 No.108732472

Anima controlnet status? fuck tdrusell stop sharing experiment slop loras and make a controlnet model bro, deph and canny first

Anonymous
05/01/26(Fri)15:03:27 No.108732484

Anonymous 05/01/26(Fri)15:03:27 No.108732484

File: FluxKlein4BDistilled_Outp(...).jpg (2.36 MB, 1664x2496)

2.36 MB JPG

Anonymous
05/01/26(Fri)15:04:25 No.108732491

Anonymous 05/01/26(Fri)15:04:25 No.108732491

>>108732434
How exactly are you proooompting Klein for "improve but no change"? It can do that fairly well.

Anonymous
05/01/26(Fri)15:05:20 No.108732500

Anonymous 05/01/26(Fri)15:05:20 No.108732500

File: _AnimaPreview3_00092_.jpg (440 KB, 1248x1608)

440 KB JPG

Anonymous
05/01/26(Fri)15:06:00 No.108732507

Anonymous 05/01/26(Fri)15:06:00 No.108732507

>>108732422
>>108732472
Unofficial Controlnet LLLite released for anima. Google it.

Anonymous
05/01/26(Fri)15:06:25 No.108732509

Anonymous 05/01/26(Fri)15:06:25 No.108732509

>>108731985
No, Anima is exactly Cosmos-2 2B modified to use Qwen3 0.6B as a text encoder instead of T5-XXL, and then fine tuned on a few million images.

Anonymous
05/01/26(Fri)15:12:42 No.108732557

Anonymous 05/01/26(Fri)15:12:42 No.108732557

Any of you have a good captioning prompt for Gemma 4?

Anonymous
05/01/26(Fri)15:16:48 No.108732577

Anonymous 05/01/26(Fri)15:16:48 No.108732577

File: chun.jpg (334 KB, 864x1216)

334 KB JPG

i am bored

Anonymous
05/01/26(Fri)15:19:50 No.108732595

Anonymous 05/01/26(Fri)15:19:50 No.108732595

>>108732577
Try genning arm amputees.

Anonymous
05/01/26(Fri)15:28:17 No.108732644

Anonymous 05/01/26(Fri)15:28:17 No.108732644

>>108732557
https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Just build a template in extra option and copy it to gemma?

Anonymous
05/01/26(Fri)15:29:56 No.108732658

Anonymous 05/01/26(Fri)15:29:56 No.108732658

Hello? Where's the arm amputee lora with trigger words for different types of amputation?

Anonymous
05/01/26(Fri)15:31:34 No.108732667

Anonymous 05/01/26(Fri)15:31:34 No.108732667

>>108732658
https://civitai.com/models/1088883/armless-double-forequarter-amputee
this is so inadequate.

Anonymous
05/01/26(Fri)15:33:10 No.108732675

Anonymous 05/01/26(Fri)15:33:10 No.108732675

File: 2026-04-25-14h25m50s_seed(...).webm (3.99 MB, 480x832)

3.99 MB WEBM

>>108732595

Anonymous
05/01/26(Fri)15:36:02 No.108732698

Anonymous 05/01/26(Fri)15:36:02 No.108732698

>honglu pic
nice but thats actually the art splash from ring id

Anonymous
05/01/26(Fri)15:37:59 No.108732713

Anonymous 05/01/26(Fri)15:37:59 No.108732713

Limbus out of nowhere

Anonymous
05/01/26(Fri)15:39:00 No.108732722

Anonymous 05/01/26(Fri)15:39:00 No.108732722

>>108732675
If she were real, her jaw couldn't open that wide.

Anonymous
05/01/26(Fri)15:45:15 No.108732769

Anonymous 05/01/26(Fri)15:45:15 No.108732769

>>108732557
This is what I personally use with both Gemini 3.1 Pro and Gemini 3 Flash at least (via their direct APIS, and with high thinking on):
https://pastes.io/hdDaVoFt

It covers basically all possible use cases SFW or NSFW and pretty much takes the most possible advantage of their ability to comprehend images, with formatting / phrasing that usually is basically ready to use as-is.

Anonymous
05/01/26(Fri)15:47:08 No.108732786

Anonymous 05/01/26(Fri)15:47:08 No.108732786

>>108732440
There are a couple images in my dataset I want to keep but they are too small for 1MP training.
I guess I will just let training script's lanczos do its job. Their numbers are too few to hurt lora quality too much either way, and I would rather not teach AI synthetic nonsense data.
Upscale models disappoint me, they either change too much or too little.

Anonymous
05/01/26(Fri)15:48:09 No.108732796

Anonymous 05/01/26(Fri)15:48:09 No.108732796

>>108732675
I like how the arm magically appears when she hits the ground.

Anonymous
05/01/26(Fri)15:49:11 No.108732806

Anonymous 05/01/26(Fri)15:49:11 No.108732806

>>108732786
Why don't you just do multires bucketing, where it will still train images that are below the "base" res just at whatever their proper original low res bucket is?

Anonymous
05/01/26(Fri)15:50:26 No.108732816

Anonymous 05/01/26(Fri)15:50:26 No.108732816

>>108732796
yea thats because her stump was off screen for a moment. i have genned better ones but nsfw uhu

Anonymous
05/01/26(Fri)15:51:24 No.108732825

Anonymous 05/01/26(Fri)15:51:24 No.108732825

>>108732713
glory to limboos company

Anonymous
05/01/26(Fri)15:59:08 No.108732877

Anonymous 05/01/26(Fri)15:59:08 No.108732877

>>108732491
Prompts:
Enhance colors, shaders and lighting = character slop style face and everything
High res = slop lineart and slop details
Sharpness = the same
Klein defaults to a slop style. I would love if a controlnet exists for Klein or a denoise control to control how much I want to change of the gen

Anonymous
05/01/26(Fri)15:59:45 No.108732882

Anonymous 05/01/26(Fri)15:59:45 No.108732882

>>108732806
In sd-scripts there is do not upscale option which buckets low res images without upscaling, I think, but I don't mind images just below the threshold, say 950000 pixels, to get resized a bit and put into a proper bucket with the rest.
I guess I can edit the script, but is it worth it for like <1% of my dataset? This will also create tiny bucket below batch size probably.

Anonymous
05/01/26(Fri)16:00:48 No.108732892

Anonymous 05/01/26(Fri)16:00:48 No.108732892

>>108732507
Will try it. Thanks

Anonymous
05/01/26(Fri)16:06:29 No.108732925

Anonymous 05/01/26(Fri)16:06:29 No.108732925

>>108732877
You should be saying more like:

"significantly improve the XYZ of the WhateverStyle image 1 while keeping the ThingYouMayWantToPreserve and OtherThing and blah blah exactly the same as they are."

Anonymous
05/01/26(Fri)16:10:54 No.108732942

Anonymous 05/01/26(Fri)16:10:54 No.108732942

>>108732925
Really? But that's very gay. Aren't there more manly alternatives like denoise value or ControlNet? I don't want to speciify like a troon.

Anonymous
05/01/26(Fri)16:16:12 No.108732966

Anonymous 05/01/26(Fri)16:16:12 No.108732966

>>108732942
how else would it work?

Anonymous
05/01/26(Fri)16:24:08 No.108733005

Anonymous 05/01/26(Fri)16:24:08 No.108733005

File: FluxKlein9BDistilled_Outp(...).jpg (1.59 MB, 1792x2304)

1.59 MB JPG

Anonymous
05/01/26(Fri)16:35:48 No.108733068

Anonymous 05/01/26(Fri)16:35:48 No.108733068

>>108733005
It think it needs a >zit>sxdl refine

Anonymous
05/01/26(Fri)16:37:56 No.108733084

Anonymous 05/01/26(Fri)16:37:56 No.108733084

File: ss_20260501_153457.png (297 KB, 1412x883)

297 KB PNG

Chat, we're cookin'
Need to work on my prompt but this works nicely. Love me some ollama.

Anonymous
05/01/26(Fri)16:49:39 No.108733164

Anonymous 05/01/26(Fri)16:49:39 No.108733164

>>108727613
noob ass question: I just want to get started making nsfw gens, pictures and videos.... where tf do i start? it's overwhelming

Anonymous
05/01/26(Fri)16:50:53 No.108733174

Anonymous 05/01/26(Fri)16:50:53 No.108733174

>>108733164
2D or 3D?

Anonymous
05/01/26(Fri)17:00:27 No.108733235

Anonymous 05/01/26(Fri)17:00:27 No.108733235

>>108733174
3d

Anonymous
05/01/26(Fri)17:00:46 No.108733236

Anonymous 05/01/26(Fri)17:00:46 No.108733236

>>108733164
for images, pony and illustrious models in Forge UI.
Pinokio app with wan2gp for videos (wan 2.2) or editing pics (flux 2 klein).
thats just what i do. if you wanna make it complicated try comfyUI instead of these

Anonymous
05/01/26(Fri)17:10:54 No.108733279

Anonymous 05/01/26(Fri)17:10:54 No.108733279

>>108733236
i have no idea what any of that means but i'll check the OP for info and I'll figure it out. thanks for the tips. i hope its easy lol

Anonymous
05/01/26(Fri)17:11:57 No.108733284

Anonymous 05/01/26(Fri)17:11:57 No.108733284

File: file.png (74 KB, 588x708)

74 KB PNG

>>108733235
https://civitai.red/models - filter to sdxl 1.0 base model, the naughty bits will be better than flux, z-image, and others and probably gen faster too. if you have a 24 + GB gfx card then by all means go for something newer than 12 and 16 vram plebs struggle with.

>>108733236
i second using forge

Anonymous
05/01/26(Fri)17:14:36 No.108733296

Anonymous 05/01/26(Fri)17:14:36 No.108733296

>>108733284
ah, i have 12gb vram. i'll try some simple stuff out. if i need a new card then ill get one fuck it. idk what civitai is or sdxl but im guessing they are like models or something

Anonymous
05/01/26(Fri)17:18:04 No.108733305

Anonymous 05/01/26(Fri)17:18:04 No.108733305

>pony and illustrious models
What is this 2024? KEK you need to upgrade anon

Anonymous
05/01/26(Fri)17:27:21 No.108733353

Anonymous 05/01/26(Fri)17:27:21 No.108733353

>>108732769
>never do x
>dont do x
>dont do this
Putting those tokens in its context sure does make it more likely to produce those outputs, anon

Anonymous
05/01/26(Fri)17:28:24 No.108733358

Anonymous 05/01/26(Fri)17:28:24 No.108733358

>>108732966
A Flux Klein compendium of prompt synthax for basic image editing prompts.

Anonymous
05/01/26(Fri)17:30:04 No.108733368

Anonymous 05/01/26(Fri)17:30:04 No.108733368

>>108733279
sry it wasn't clear.
install forge UI (1 minute), then download an illustrious or pony base model on civitai.
put that in the models folder of your Forge UI installation.
then pick it in the drop down menu in Forge UI and start generating.

alternatively do whatever the cool kids recommend but i think that's a pretty good start

Anonymous
05/01/26(Fri)17:30:56 No.108733372

Anonymous 05/01/26(Fri)17:30:56 No.108733372

>>108733305
no one here still uses sdxl let alone posts about it so why would anon even recommend it? idgi is he purposely trolling?

Anonymous
05/01/26(Fri)17:31:35 No.108733376

Anonymous 05/01/26(Fri)17:31:35 No.108733376

File: 675Wvvdjwi.jpg (100 KB, 1146x648)

100 KB JPG

I build the llamacpp memeversion to quant image models but I still get shape errors after quanting. how do I unfuck this?

Anonymous
05/01/26(Fri)17:32:27 No.108733379

Anonymous 05/01/26(Fri)17:32:27 No.108733379

>>108733372
anon asked for nsfw
here is a blue board

Anonymous
05/01/26(Fri)17:32:33 No.108733381

Anonymous 05/01/26(Fri)17:32:33 No.108733381

>>108733368
i meant in the folder models>Stable-diffusion
to be more clear

Anonymous
05/01/26(Fri)17:33:44 No.108733388

Anonymous 05/01/26(Fri)17:33:44 No.108733388

>>108733379
what does that have to do with recommending an antiquated model tho

Anonymous
05/01/26(Fri)17:34:29 No.108733393

Anonymous 05/01/26(Fri)17:34:29 No.108733393

>>108733372
most of the pics posted here are crap and i generate better stuff with sdxl, that's why i recommended that.
then again i just tried to help, i don't know all the newer stuff but i'm not very impressed

Anonymous
05/01/26(Fri)17:37:10 No.108733406

Anonymous 05/01/26(Fri)17:37:10 No.108733406

hes so confident in his sdxl slop that he never posts it here lul

Anonymous
05/01/26(Fri)17:37:57 No.108733411

Anonymous 05/01/26(Fri)17:37:57 No.108733411

>>108733388
others who use z-image, flux, qwen, just about anything newer or different on other boards use sdxl models to fill in the nsfw details those models are terrible with.

Anonymous
05/01/26(Fri)17:40:48 No.108733430

Anonymous 05/01/26(Fri)17:40:48 No.108733430

>>108733376
>llamacpp memeversion
Que?
I know how you quant llms with default llama cpp but no idea what is specially needed for diffusion models.

Anonymous
05/01/26(Fri)17:41:30 No.108733435

Anonymous 05/01/26(Fri)17:41:30 No.108733435

>>108733411
unfortunate that so many suffer from skill issues with anima but thats usually how it goes with new model

Anonymous
05/01/26(Fri)17:47:13 No.108733469

Anonymous 05/01/26(Fri)17:47:13 No.108733469

>>108733430
https://github.com/city96/ComfyUI-GGUF/blob/auto_convert/tools/README.md

Anonymous
05/01/26(Fri)17:48:33 No.108733477

Anonymous 05/01/26(Fri)17:48:33 No.108733477

>>108733435
I've started experimenting with anima this week. And while I see the potential, it also gives horrible anatomy fails a lot, despite the way better composition prompting options.
It also can't gen my wAIfu until someone trains a lora (or I have enough time to learn this, so probably not anytime soon)

Anonymous
05/01/26(Fri)17:51:21 No.108733492

Anonymous 05/01/26(Fri)17:51:21 No.108733492

>>108733477
>I've started experimenting with anima this week
alright well then let the pros give the recommendations while you work on getting better

Anonymous
05/01/26(Fri)17:52:10 No.108733497

Anonymous 05/01/26(Fri)17:52:10 No.108733497

>>108733492
i look forward to seeing your nsfw anima gens

Anonymous
05/01/26(Fri)17:52:58 No.108733501

Anonymous 05/01/26(Fri)17:52:58 No.108733501

File: 18240707824087234698478.jpg (446 KB, 1024x768)

446 KB JPG

>>108733435
yeah i feel like anima is going to be the base layer in all of my gens for the foreseeable future. it does everything i want on the cartoon side and it's decent enough at realism with some finagling.

Anonymous
05/01/26(Fri)17:53:05 No.108733502

Anonymous 05/01/26(Fri)17:53:05 No.108733502

>>108733497
if youd spent some time lurking youd have already seen mine and others

Anonymous
05/01/26(Fri)17:56:13 No.108733515

Anonymous 05/01/26(Fri)17:56:13 No.108733515

>>108733492
pros wouldn't dismiss illustrious yet since it still may be the best tool for the job.

Anonymous
05/01/26(Fri)17:58:26 No.108733522

Anonymous 05/01/26(Fri)17:58:26 No.108733522

>>108733497
>nsfw anima gens
is this the new meme, anima can't do nsfw?

Anonymous
05/01/26(Fri)18:00:22 No.108733533

Anonymous 05/01/26(Fri)18:00:22 No.108733533

>>108731887
cope

Anonymous
05/01/26(Fri)18:01:06 No.108733537

Anonymous 05/01/26(Fri)18:01:06 No.108733537

Can you run full WAN 2.2 with RX 9070 XT and if you can, how slow is it?

Anonymous
05/01/26(Fri)18:02:48 No.108733546

Anonymous 05/01/26(Fri)18:02:48 No.108733546

>>108733533
>cope is when you realize the facts

Anonymous
05/01/26(Fri)18:03:42 No.108733553

Anonymous 05/01/26(Fri)18:03:42 No.108733553

>>108733515
why would anyone want to use a shitty 4ch VAE model that doesnt even know its lefts from its rights
>nb4 upscaling and regional prompting cope

Anonymous
05/01/26(Fri)18:04:00 No.108733555

Anonymous 05/01/26(Fri)18:04:00 No.108733555

>>108733164
new anon wants to get into nsfw
>>108733235
3D nsfw
>>108733236
>>108733284
good, helpful suggestions based on experience
>>108733372
resident shit stirrer chimes in
>>108733522
new meme is born ig

Anonymous
05/01/26(Fri)18:07:06 No.108733568

Anonymous 05/01/26(Fri)18:07:06 No.108733568

File: ComfyUI_Anima_00061_.png (1.33 MB, 1024x1024)

1.33 MB PNG

>>108731898
>More interested in an ACE-Step lora training writeup.

Made an XL 1.5 LoRA training rentry
https://rentry.co/s8fg8ber

If you do everything right then you will get high quality results similar to
>>108731845

I did not cover LoKR etc... because it's experimental and I never had good results with that anyways.

Anonymous
05/01/26(Fri)18:08:28 No.108733573

Anonymous 05/01/26(Fri)18:08:28 No.108733573

File: 1764604388052366.jpg (562 KB, 2048x3072)

562 KB JPG

Anonymous
05/01/26(Fri)18:09:01 No.108733577

Anonymous 05/01/26(Fri)18:09:01 No.108733577

>he can't get anima to do 3d
HOLY skill issue

Anonymous
05/01/26(Fri)18:09:20 No.108733578

Anonymous 05/01/26(Fri)18:09:20 No.108733578

>>108733553
because the only other choice for anime is anima, which is still in its infancy period.
But soon™

Anonymous
05/01/26(Fri)18:11:31 No.108733595

Anonymous 05/01/26(Fri)18:11:31 No.108733595

>>108733577
And yet you wasted electricity and bandwidth for this post.

Anonymous
05/01/26(Fri)18:12:07 No.108733600

Anonymous 05/01/26(Fri)18:12:07 No.108733600

>>108733236
>>108733284
>>108733368
>>108733381
thanks dude(s)
I'm just trying img2img gens with a prompt and oh man they are wildy retarded. i guess i'll have to learn what the fuck all these settings and models/loras are... idk just wanted to make my wife naked from some images lel

Anonymous
05/01/26(Fri)18:14:14 No.108733609

Anonymous 05/01/26(Fri)18:14:14 No.108733609

>>108733577
>3d
ewww

Anonymous
05/01/26(Fri)18:15:40 No.108733615

Anonymous 05/01/26(Fri)18:15:40 No.108733615

>>108733577
The classic "I can't figure out how to work with this model therefore it's the models fault". We saw the same thing when noob first released.

Anonymous
05/01/26(Fri)18:19:22 No.108733629

Anonymous 05/01/26(Fri)18:19:22 No.108733629

>>108733469
Dunno. It sounds simple enough, it's seems to be just adding an extra dimension to the tensor, that someone on the correct trannycord somewhere can tell you how it is fixed most likely.
README is 9 months old and probably not up to date.
There are some discussions when searching for "is:issue z-image", see anything useful?
Or just dump every relevant piece of code into Gemini API and it can most probably also unfuck this.
Sorry anon well past my bedtime.

Anonymous
05/01/26(Fri)18:20:48 No.108733634

Anonymous 05/01/26(Fri)18:20:48 No.108733634

>>108733568
Nice, thanks.
>10–25 high-quality songs
Feels like unless you want to slop loras by the dozens all the captioning automation isn't necessary.
>16 GB (minimum)
Fuck, are goofs no good? I got 16 but part is used by the monitor.

Anonymous
05/01/26(Fri)18:22:45 No.108733643

Anonymous 05/01/26(Fri)18:22:45 No.108733643

>>108727613
about time for my bi-annual check - does any of this shit run reasonably on AMD video cards yet?

I'm not going to spend the equivalent of a used car on a fucking nvidia GPU just to mess around with running AIgen shit locally

Anonymous
05/01/26(Fri)18:25:13 No.108733654

Anonymous 05/01/26(Fri)18:25:13 No.108733654

File: SD35Medium_Output_272627.png (2.97 MB, 1216x1600)

2.97 MB PNG

Trve diffusionists KNOW that SD 3.5 Medium prompted a very particular way at the top end of its resolution range is still more kinosovl than anything else since

Anonymous
05/01/26(Fri)18:26:13 No.108733658

Anonymous 05/01/26(Fri)18:26:13 No.108733658

>>108733643
>bi-annual check
that's a pretty long time for ai development speeds.
> does any of this shit run reasonably on AMD video cards yet?
Yes, but it also did 2 years ago.

Anonymous
05/01/26(Fri)18:26:23 No.108733660

Anonymous 05/01/26(Fri)18:26:23 No.108733660

File: Untitled.png (39 KB, 815x505)

39 KB PNG

>>108733372
Speak for yourself, non artist tech addict troon. Just because I don’t post anime in non anime generals doesn’t mean no one uses SDXL. Comfy, Krita, and tool automation allow me to make better and more decisions with SDXL than just prompting and waiting for results to appear on the screen for a dopamine spike, you worthless techtroon.

Anonymous
05/01/26(Fri)18:28:40 No.108733675

Anonymous 05/01/26(Fri)18:28:40 No.108733675

>>108733546
>It's better than Suno 5.5!
>Acktually stop comparing it to Suno, it's local.
Full damage control lol

Anonymous
05/01/26(Fri)18:29:03 No.108733677

Anonymous 05/01/26(Fri)18:29:03 No.108733677

>>108733658
>Yes, but it also did 2 years ago.
bullshit
I've tried to get something usable up and running every few months for the last few years on an ATI/AMD setup and shit always fucking breaks and keeps breaking until I say fuck it and give up

Anonymous
05/01/26(Fri)18:29:07 No.108733678

Anonymous 05/01/26(Fri)18:29:07 No.108733678

What's with the "ldg is not an anime thread" meme? Most of the posts here and in the faggollages are in fact anime desu?

Anonymous
05/01/26(Fri)18:30:33 No.108733687

Anonymous 05/01/26(Fri)18:30:33 No.108733687

File: ComfyUI_temp_dqupy_00001_.png (2.97 MB, 1152x1344)

2.97 MB PNG

>>108733629
Yeah I fixed it. Needed a fork of the gguf repo.
>>108733678
It's the 35stars doing his daily assault.

Anonymous
05/01/26(Fri)18:31:37 No.108733689

Anonymous 05/01/26(Fri)18:31:37 No.108733689

File: Untitled.jpg (434 KB, 1568x1568)

434 KB JPG

>>108733305
the slop is too good to give up... There are very few things people want that SDXL can't do

Anonymous
05/01/26(Fri)18:32:43 No.108733694

Anonymous 05/01/26(Fri)18:32:43 No.108733694

>>108733634
Honestly, I've got 24GB so I can't tell you exactly how much is required for lower ones. I grabbed the requirement straight from the official training guide (which is specific to the Gradio). Side-Step is different, so I'll update that part of the guide, but I'm currently not sure if this applies to XL
https://github.com/koda-dernet/Side-Step#vram-profiles

>Feels like unless you want to slop loras by the dozens all the captioning automation isn't necessary.
When I made the script I was working with 50+ songs on non-XL 1.5, but that's how I unfortunately found out about a bunch of bad practices like automatic lyrics being wrong, training Turbo instead of base, etc...

Anonymous
05/01/26(Fri)18:33:44 No.108733699

Anonymous 05/01/26(Fri)18:33:44 No.108733699

>>108733305
Anima is still not at SDXL level. Wake me up once it's out of the preview

Anonymous
05/01/26(Fri)18:34:22 No.108733704

Anonymous 05/01/26(Fri)18:34:22 No.108733704

>>108733677
I have it running on 2 and a half years old hardware, my first image on it was genned on August 11th, 2024, But that's just when I finally took the time to set it up, it was possible before that date as well.

Anonymous
05/01/26(Fri)18:35:03 No.108733707

Anonymous 05/01/26(Fri)18:35:03 No.108733707

>>108733678
/ldg/ is not anime and never will be. Nobody wants to post their gens in a fast moving, low effort slop general.

Anonymous
05/01/26(Fri)18:35:22 No.108733711

Anonymous 05/01/26(Fri)18:35:22 No.108733711

>>108733687
Cool.
>a fork of the gguf repo
Just incase I run into something similar myself, which fork?

Anonymous
05/01/26(Fri)18:35:24 No.108733712

Anonymous 05/01/26(Fri)18:35:24 No.108733712

All these XLslop defenders coming out of literally nowhere

Anonymous
05/01/26(Fri)18:37:23 No.108733721

Anonymous 05/01/26(Fri)18:37:23 No.108733721

>>108733711
>fork
I mean branch
https://github.com/city96/ComfyUI-GGUF/tree/auto_convert
the autoconvert one. But the issue was the main didn't have z-image support.

Anonymous
05/01/26(Fri)18:37:24 No.108733722

Anonymous 05/01/26(Fri)18:37:24 No.108733722

>>108733699
Did you know that most illust merges and tunes were based on 0.1 even though 1.0 and 1.1 were already out? Talk about a "preview" lol

Anonymous
05/01/26(Fri)18:37:52 No.108733724

Anonymous 05/01/26(Fri)18:37:52 No.108733724

>>108733353
No it doesn't, not in thinking mode. Did you last use an LLM in like 2022 or something?

Anonymous
05/01/26(Fri)18:40:25 No.108733734

Anonymous 05/01/26(Fri)18:40:25 No.108733734

>>108733722
Because they were made before 1.0 came out, and after it many shitmerges mixed that into themselves. Also, 1.0 and 1.1 were disappointing anyway.

Anonymous
05/01/26(Fri)18:41:20 No.108733738

Anonymous 05/01/26(Fri)18:41:20 No.108733738

>>108733734
You clearly were not around during that time. Best to keep quiet.

Anonymous
05/01/26(Fri)18:43:19 No.108733746

Anonymous 05/01/26(Fri)18:43:19 No.108733746

>>108733734
IDK why there's not many 2.0 based ones though, it was much much better than 0.1

Anonymous
05/01/26(Fri)18:43:31 No.108733748

Anonymous 05/01/26(Fri)18:43:31 No.108733748

>>108733734
>. Also, 1.0 and 1.1 were disappointing anyway.
And what makes you think we’re actually progressing, and that Anima Base 1.0 will be better than the others?

Anonymous
05/01/26(Fri)18:44:50 No.108733760

Anonymous 05/01/26(Fri)18:44:50 No.108733760

File: Screenshot_20260501_18442(...).jpg (961 KB, 1080x2400)

961 KB JPG

It's up

Anonymous
05/01/26(Fri)18:45:49 No.108733766

Anonymous 05/01/26(Fri)18:45:49 No.108733766

>advertising

Anonymous
05/01/26(Fri)18:45:58 No.108733767

Anonymous 05/01/26(Fri)18:45:58 No.108733767

>>108733760
I'm still waiting for a Dreamshaper ZTurbo finetune...

Anonymous
05/01/26(Fri)18:46:58 No.108733771

Anonymous 05/01/26(Fri)18:46:58 No.108733771

>>108733746
Because training checkpoints and LoRAs is a matter of chance, anon, it’s not a linear progression. People who trained on 0.1, if they repeat the same work on 2.0, will get different results. It’s all trial and error, and in this hobby you can end up somewhere good without really knowing how you got there or how to replicate the result.

Anonymous
05/01/26(Fri)18:46:59 No.108733772

Anonymous 05/01/26(Fri)18:46:59 No.108733772

>>108733738
I was around. 1.0 was long awaited, delayed, and when it finally came people were not so happy with it.

>>108733748
nta, but we'll have to see where it goes.
At least they didn't close up after 0.1 and keep releasing their progress, so there is hope it actually improves.

Anonymous
05/01/26(Fri)18:47:06 No.108733774

Anonymous 05/01/26(Fri)18:47:06 No.108733774

>>108733760
Do you think propensity for slop, as demonstrated in that gen, is a matter of IQ or your upbringing. I ask myself this question all the time.

Anonymous
05/01/26(Fri)18:47:18 No.108733777

Anonymous 05/01/26(Fri)18:47:18 No.108733777

>>108733760
It's up is an ldg meme anon

Anonymous
05/01/26(Fri)18:47:26 No.108733778

Anonymous 05/01/26(Fri)18:47:26 No.108733778

File: screenshot.1777675622.jpg (202 KB, 694x689)

202 KB JPG

spark chroma HD is 90% complete btw

Anonymous
05/01/26(Fri)18:49:45 No.108733788

Anonymous 05/01/26(Fri)18:49:45 No.108733788

>>108733760
Sars we will redeem!

Anonymous
05/01/26(Fri)18:52:30 No.108733802

Anonymous 05/01/26(Fri)18:52:30 No.108733802

>>108733778
>3 months to train on 2500 images

Anonymous
05/01/26(Fri)18:58:28 No.108733830

Anonymous 05/01/26(Fri)18:58:28 No.108733830

>>108730870
I need a fully jenny pron site now...

Anonymous
05/01/26(Fri)19:00:01 No.108733835

Anonymous 05/01/26(Fri)19:00:01 No.108733835

>>108731288
Send one, I'll show you what you can get

Anonymous
05/01/26(Fri)19:05:41 No.108733864

Anonymous 05/01/26(Fri)19:05:41 No.108733864

What's the SOTA for prompt part-of-speech comprehension? like understanding the difference between 'candle lights fire' and 'fire lights candle'?

Anonymous
05/01/26(Fri)19:05:56 No.108733866

Anonymous 05/01/26(Fri)19:05:56 No.108733866

>i can suck
why are localkeks so pathetic?
https://litter.catbox.moe/5b9fwjrhcpjlogei.mp4

Anonymous
05/01/26(Fri)19:12:07 No.108733898

Anonymous 05/01/26(Fri)19:12:07 No.108733898

>>108733866
stop trolling, go away

Anonymous
05/01/26(Fri)19:14:17 No.108733905

Anonymous 05/01/26(Fri)19:14:17 No.108733905

>>108733866
Kys

Anonymous
05/01/26(Fri)19:15:18 No.108733911

Anonymous 05/01/26(Fri)19:15:18 No.108733911

>>108733643
Depends on your GPU model, but the situation's improved for newer hardware. See if you can find yours in this table:
https://rocm.docs.amd.com/en/7.12.0-preview/compatibility/compatibility-matrix.html

ComfyUI also has AMD portable builds that you can try unzipping and running to see if things just werk. (Might still need the latest AMD Adrenalin drivers installed.)

Anonymous
05/01/26(Fri)19:15:25 No.108733912

Anonymous 05/01/26(Fri)19:15:25 No.108733912

Which company will give us a true base model in 2026 like how we got SD 1.5?

Anonymous
05/01/26(Fri)19:15:30 No.108733913

Anonymous 05/01/26(Fri)19:15:30 No.108733913

File: 1696443400513372.jpg (35 KB, 381x380)

35 KB JPG

>>108733866
you made this movie?

Anonymous
05/01/26(Fri)19:16:31 No.108733919

Anonymous 05/01/26(Fri)19:16:31 No.108733919

>>108733864
wat

Anonymous
05/01/26(Fri)19:17:24 No.108733926

Anonymous 05/01/26(Fri)19:17:24 No.108733926

>>108733802
I mean he's doing it on just one 4090

Anonymous
05/01/26(Fri)19:17:40 No.108733928

Anonymous 05/01/26(Fri)19:17:40 No.108733928

I may - or may not - get a supermodel gf at church because I nofapped for a week.

Anonymous
05/01/26(Fri)19:18:16 No.108733930

Anonymous 05/01/26(Fri)19:18:16 No.108733930

File: file.png (3.06 MB, 1248x1824)

3.06 MB PNG

Anonymous
05/01/26(Fri)19:19:04 No.108733933

Anonymous 05/01/26(Fri)19:19:04 No.108733933

>>108733919
part-of-speech: understand what in the sentence is doing the action (subject), what action is being done (verb), and what in the sentence the action is being done to (object)

Anonymous
05/01/26(Fri)19:19:35 No.108733935

Anonymous 05/01/26(Fri)19:19:35 No.108733935

>>108733930
she's not a 10.

Anonymous
05/01/26(Fri)19:21:06 No.108733949

Anonymous 05/01/26(Fri)19:21:06 No.108733949

File: a77f060f-57f0-4ac5-909b-8(...).png (2.59 MB, 2496x3648)

2.59 MB PNG

Why do people upload this as the cover image for their checkpoints?

Anonymous
05/01/26(Fri)19:22:15 No.108733953

Anonymous 05/01/26(Fri)19:22:15 No.108733953

>>108733930
You guys goon to that? How old are you, 60?

Anonymous
05/01/26(Fri)19:22:33 No.108733955

Anonymous 05/01/26(Fri)19:22:33 No.108733955

>>108733935
10s are overrated. your point?

Anonymous
05/01/26(Fri)19:23:24 No.108733960

Anonymous 05/01/26(Fri)19:23:24 No.108733960

>>108733953
idk man, do you start being a man after a certain age? or you're just gay forever?

Anonymous
05/01/26(Fri)19:23:33 No.108733962

Anonymous 05/01/26(Fri)19:23:33 No.108733962

>>108733930
this is slop, the head coloring and lighting is detached from the body

Anonymous
05/01/26(Fri)19:25:01 No.108733969

Anonymous 05/01/26(Fri)19:25:01 No.108733969

>>108733949
Because the average slopper believes this is the final form of every anime model.

Anonymous
05/01/26(Fri)19:25:48 No.108733979

Anonymous 05/01/26(Fri)19:25:48 No.108733979

File: 1767180195709811.jpg (127 KB, 1024x1024)

127 KB JPG

Where can I find a lora dataset to understand what a good dataset looks like and how it should be tagged? For an anime videogame character. Don't even know if I should rely on 2d fanart or take screenshots of the 2.5d ingame models? Tried with the former and while it picks up elements they're weak, rough and generated images look low quality. Guessing it's because of bad tagging and bad images used for training, but don't know what they should be looking like. Still learning the ropes

Anonymous
05/01/26(Fri)19:26:12 No.108733981

Anonymous 05/01/26(Fri)19:26:12 No.108733981

>>108733949
whats wrong with it

Anonymous
05/01/26(Fri)19:26:41 No.108733985

Anonymous 05/01/26(Fri)19:26:41 No.108733985

>>108733955
If you're gonna attract a 10, you gotta send off getting ten waves.

Anonymous
05/01/26(Fri)19:27:19 No.108733988

Anonymous 05/01/26(Fri)19:27:19 No.108733988

>>108733960
okay boomer

Anonymous
05/01/26(Fri)19:27:33 No.108733990

Anonymous 05/01/26(Fri)19:27:33 No.108733990

the average slopper has a micro penis.

Anonymous
05/01/26(Fri)19:28:45 No.108733996

Anonymous 05/01/26(Fri)19:28:45 No.108733996

Fresh

>>108733994
>>108733994
>>108733994
>>108733994

Fresh

Anonymous
05/01/26(Fri)19:31:00 No.108734009

Anonymous 05/01/26(Fri)19:31:00 No.108734009

>>108733985
I literally just said 10s are overrated. the most beautiful woman I ever saw was a 6, at best.

Anonymous
05/01/26(Fri)19:32:20 No.108734016

Anonymous 05/01/26(Fri)19:32:20 No.108734016

>>108733068
ZIT can't into 1792x2304 anon, and refining with a 4-channel VAE model like SDXL is retarded

Anonymous
05/01/26(Fri)19:36:28 No.108734039

Anonymous 05/01/26(Fri)19:36:28 No.108734039

>>108734016
i only refine the bodies for a little more detail that I feel Zit tends to struggle with. ZIT seems to struggle with bodies in general imo. It gives you perfect faces but the bodies of the people you're genning just aren't exactly what you're used to on that person sometimes.
Anyway. Zit is kinda hazy on the details and SXDL really shines with those details. And inpainting is an amazing tool because it can pick up whatever style is on the image you're inpainting.

Anonymous
05/01/26(Fri)19:38:28 No.108734054

Anonymous 05/01/26(Fri)19:38:28 No.108734054

>>108734039
that's a lot of cope anon

Anonymous
05/01/26(Fri)19:39:38 No.108734062

Anonymous 05/01/26(Fri)19:39:38 No.108734062

"cope" is Swahili for "artistry". What a beautiful language.

Anonymous
05/01/26(Fri)19:42:26 No.108734075

Anonymous 05/01/26(Fri)19:42:26 No.108734075

>>108734054
perhaps. but it's true cope. besides. I'd rather settle for a smaller image that looks good than a large image that looks like ai slop.

Anonymous
05/01/26(Fri)19:49:38 No.108734116

Anonymous 05/01/26(Fri)19:49:38 No.108734116

>>108733996
no anime collage no posting

Anonymous
05/01/26(Fri)20:02:15 No.108734189

Anonymous 05/01/26(Fri)20:02:15 No.108734189

>>108734016
It totally can

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.