/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 02/01/26(Sun)02:42:44 No.108028569

File: highlights_g_108027322_17(...).png (1.25 MB, 987x1116)

1.25 MB PNG

/ldg/ - Local Diffusion General Anonymous 02/01/26(Sun)02:42:44 No.108028569 Archived

Ingrate Contrarian Dipshittery Edition

Discussion of Free and Open Source Diffusion Models

Prev: >>108027322

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon

Anonymous
02/01/26(Sun)02:44:56 No.108028584

Anonymous 02/01/26(Sun)02:44:56 No.108028584

https://github.com/Haoming02/sd-webui-forge-classic/issues/671 He noticed

Anonymous
02/01/26(Sun)02:45:52 No.108028590

Anonymous 02/01/26(Sun)02:45:52 No.108028590

>>108028569
>>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon
we know you are mentally ill but could you stop including your schizo nonsense in the OP?

Anonymous
02/01/26(Sun)02:46:44 No.108028596

Anonymous 02/01/26(Sun)02:46:44 No.108028596

Blessed thread of frenship

Anonymous
02/01/26(Sun)02:47:28 No.108028599

Anonymous 02/01/26(Sun)02:47:28 No.108028599

>oh, a new ldg thread, surely the schizo op listened and included my wrapper!
>ctrl-f ani
>THERE IT IS HAHAHAHA IM IN THE O-
>Anima
ACKKKKKKKKKKKKKKKKKKKKKKKKK

Anonymous
02/01/26(Sun)02:51:14 No.108028627

Anonymous 02/01/26(Sun)02:51:14 No.108028627

File: ComfyUI_temp_sgxha_00016_.jpg (312 KB, 2560x1440)

312 KB JPG

Interesting, large resolutions seems to work.

Anonymous
02/01/26(Sun)02:51:34 No.108028630

Anonymous 02/01/26(Sun)02:51:34 No.108028630

>>108028599
>ACKKKKKKKKKKKKKKKKKKKKKKKKK
thank god you are finally hanging yourself troon

Anonymous
02/01/26(Sun)02:52:01 No.108028632

Anonymous 02/01/26(Sun)02:52:01 No.108028632

File: spam.png (1.17 MB, 1152x864)

1.17 MB PNG

>>108028599
>it's a comfy collab
>if i spam fud in the general comfy will go bankrupt
>if comfy goes bankrupt people will buy commercial licenses for my wrapper

Anonymous
02/01/26(Sun)02:52:48 No.108028635

Anonymous 02/01/26(Sun)02:52:48 No.108028635

>>108028627
what exactly is working? it looks terrible

Anonymous
02/01/26(Sun)02:59:33 No.108028680

Anonymous 02/01/26(Sun)02:59:33 No.108028680

>Z-image
What went wrong?

Anonymous
02/01/26(Sun)03:06:42 No.108028710

Anonymous 02/01/26(Sun)03:06:42 No.108028710

File: kj7il.png (3.57 MB, 2048x1152)

3.57 MB PNG

Schizophrenia

Anonymous
02/01/26(Sun)03:08:03 No.108028713

Anonymous 02/01/26(Sun)03:08:03 No.108028713

File: 015121d132c2898a53bdc1443(...).jpg (1.48 MB, 1791x1006)

1.48 MB JPG

>score_9

Ew, proper sloppa tag.

>>108028635
Overall the characters aren't broken.

Anonymous
02/01/26(Sun)03:09:57 No.108028721

Anonymous 02/01/26(Sun)03:09:57 No.108028721

>e621 is required for a kino anime model
sad. ill wait a bit longer then.

Anonymous
02/01/26(Sun)03:12:36 No.108028730

Anonymous 02/01/26(Sun)03:12:36 No.108028730

File: anima_output_121414.png (1.14 MB, 832x1216)

1.14 MB PNG

Anonymous
02/01/26(Sun)03:12:59 No.108028731

Anonymous 02/01/26(Sun)03:12:59 No.108028731

File: Anima_00088_.png (637 KB, 896x1152)

637 KB PNG

>>108028108
0.6B bf16 is 1.2gb, which is roughly the size of clip_l and clip_g combined.
Maybe some meaning as to be a true SDXL successor or some shit.

Anonymous
02/01/26(Sun)03:14:22 No.108028735

Anonymous 02/01/26(Sun)03:14:22 No.108028735

>>108028721
What? You can prompt anima just fine with danbooru tags or NLP.

Anonymous
02/01/26(Sun)03:15:20 No.108028738

Anonymous 02/01/26(Sun)03:15:20 No.108028738

>>108028721
e621 has never made anime models better

Anonymous
02/01/26(Sun)03:17:13 No.108028747

Anonymous 02/01/26(Sun)03:17:13 No.108028747

>>108028735
danbooru isn't enough. i loathe 90% of the posts on e621 but somehow it gives models sovl.

Anonymous
02/01/26(Sun)03:18:09 No.108028748

Anonymous 02/01/26(Sun)03:18:09 No.108028748

File: 877844554877.jpg (512 KB, 2138x1209)

512 KB JPG

The model truly is impressive because it's more refined in certain aspects, but in my tests it still is behind Newbie (and I'm guessing NetaYume as well) at prompt following (though it's better with text and results are very aesthetic). Newbie still has a better understanding of raw artistic style control.

Here's the prompt I gave Anima (re-formatted from the Newbie XML version):

>masterpiece, best quality, score_9, year 2025, highres, safe, 1girl, 2b_(nier:automata), nier:automata, painterly, impressionism, brushstrokes, An artistic, monochrome black-and-white illustration of 2B from NieR:Automata sitting at a restaurant table. The style is a unique blend of detailed manga linework and painterly impressionism, featuring thick, visible brushstrokes and impasto textures. 2B has her signature short white hair and black headband, leaning one hand against her chin while her other hand gently pets a cat lounging on the table beside her. In the foreground, a wine bottle and a half-filled wine glass sit next to a plate of food. The background consists of blurred restaurant windows and shelves, rendered with soft, atmospheric strokes that contrast with the sharp, rhythmic hatching of the character's clothing and the cat's fur.

Anonymous
02/01/26(Sun)03:18:42 No.108028750

Anonymous 02/01/26(Sun)03:18:42 No.108028750

>>108028721
just put only score_9 and you will instantly get your ponyv6 sepia sovl

Anonymous
02/01/26(Sun)03:19:19 No.108028756

Anonymous 02/01/26(Sun)03:19:19 No.108028756

2026 and we still gotta do that "\" shit before a parenthesis if there's a parenthesis in the series or artist name... fuck...

Anonymous
02/01/26(Sun)03:19:22 No.108028757

Anonymous 02/01/26(Sun)03:19:22 No.108028757

>>108028735
yeah it handles either much like NetaYume, but just is mostly way more stable than NetaYume so far. And faster

Anonymous
02/01/26(Sun)03:19:37 No.108028760

Anonymous 02/01/26(Sun)03:19:37 No.108028760

>>108028748
yeah in my 10 minutes of testing i couldn't really get it to do painterly stuff at all sadly.

Anonymous
02/01/26(Sun)03:19:55 No.108028761

Anonymous 02/01/26(Sun)03:19:55 No.108028761

File: 6476edf35c98cfc486d808e1a(...).jpg (1.6 MB, 1790x1005)

1.6 MB JPG

Soul.

Anonymous
02/01/26(Sun)03:20:40 No.108028766

Anonymous 02/01/26(Sun)03:20:40 No.108028766

>>108028748
That kinda stuff looks better on newbie because newbie can only do that kinda stuff. Almost all gens on it looks smeary which just kinda works out in this scenario

Anonymous
02/01/26(Sun)03:21:07 No.108028770

Anonymous 02/01/26(Sun)03:21:07 No.108028770

So uh... what megapixel size can I gen at without getting these gay ass
 ,W (188, 125) should be divisible by spatial_patch_size 2 
errors

Anonymous
02/01/26(Sun)03:21:21 No.108028772

Anonymous 02/01/26(Sun)03:21:21 No.108028772

>>108028731
Even then, the 1.7B model seems like a better sweetspot

Anonymous
02/01/26(Sun)03:22:11 No.108028776

Anonymous 02/01/26(Sun)03:22:11 No.108028776

>>108028772
writing this down for the retrain/re-license

Anonymous
02/01/26(Sun)03:22:19 No.108028777

Anonymous 02/01/26(Sun)03:22:19 No.108028777

>>108028680
Outdated mediocre model. In 2024-2025 it would be good, but in 2026 it's meh.

Anonymous
02/01/26(Sun)03:23:50 No.108028784

Anonymous 02/01/26(Sun)03:23:50 No.108028784

>>108028680
Chinese eyes too small to see BRC (Big Russell Cock) coming their way

Anonymous
02/01/26(Sun)03:24:07 No.108028786

Anonymous 02/01/26(Sun)03:24:07 No.108028786

File: ComfyUI_temp_ayedo_00090_.png (1.92 MB, 1280x720)

1.92 MB PNG

Damn it, I have to go to the gym.

I need to set up a way to gen on my pc through my phone.

Anonymous
02/01/26(Sun)03:24:22 No.108028787

Anonymous 02/01/26(Sun)03:24:22 No.108028787

>>108028770
wat

Anonymous
02/01/26(Sun)03:25:01 No.108028791

Anonymous 02/01/26(Sun)03:25:01 No.108028791

File: Anima_00089_.png (566 KB, 896x1152)

566 KB PNG

>>108028772
Obviously large te would have been better. 1.7B is also still vramlet friendly, can be run under 4GB vram fine.
But still, 0.6B one is working surprisingly good for its size.

Anonymous
02/01/26(Sun)03:25:37 No.108028793

Anonymous 02/01/26(Sun)03:25:37 No.108028793

>>108028791
it puts in perspective how outdated 12B T5-XXL is

Anonymous
02/01/26(Sun)03:25:41 No.108028794

Anonymous 02/01/26(Sun)03:25:41 No.108028794

>>108028770
Anima? 0.5m to 1m, it's been mostly trained on 512 pixels so far
Any other model? 1m to 2m

Anonymous
02/01/26(Sun)03:26:22 No.108028796

Anonymous 02/01/26(Sun)03:26:22 No.108028796

>>108028794
what the fuck even are those errors he's getting though, like where / how is that a thing

Anonymous
02/01/26(Sun)03:26:42 No.108028798

Anonymous 02/01/26(Sun)03:26:42 No.108028798

>>108028770
125 is indeed not divisible by 2 anon

Anonymous
02/01/26(Sun)03:27:45 No.108028805

Anonymous 02/01/26(Sun)03:27:45 No.108028805

>tdrussel doesn't have a discord server

I don't think you guys understand just how much we have won here. There won't be any sabotaging by duplicitous furries.

Every time some promising model comes around, furries rock up just to poison the dataset or training. I've heard of some even lobbying for synthetic (slop) datasets, or for the addition of low-quality niche fetish content. I swear they do this intentionally too.

Anonymous
02/01/26(Sun)03:28:42 No.108028808

Anonymous 02/01/26(Sun)03:28:42 No.108028808

File: 4545564456451.jpg (355 KB, 2138x1209)

355 KB JPG

>>108028748
Actually that's cut off kek
https://files.catbox.moe/sl91it.png

Here's another one, Newbie understood this prompt a bit more since it needed to split it into panels
https://files.catbox.moe/7stv4h.png

Anonymous
02/01/26(Sun)03:30:12 No.108028816

Anonymous 02/01/26(Sun)03:30:12 No.108028816

>>108028805
i don't think that's a thing, Chroma is actually LESS similar to Fluffyrock than I expected, overall. Anyone who actually fell for the "muh Chroma anime" or "muh Chroma realism" meme was always retarded

Anonymous
02/01/26(Sun)03:30:17 No.108028817

Anonymous 02/01/26(Sun)03:30:17 No.108028817

>>108028770
What latent size are you using? I realized that going to 1532 or beyond causes the same error. If you want a good aspect ratio, decrease the lower number instead (e.g. 1280x720)

Anonymous
02/01/26(Sun)03:30:43 No.108028819

Anonymous 02/01/26(Sun)03:30:43 No.108028819

>>108028808
I'll wait for Anima lora or any finetune, as the default art style feels too “AI” for my taste.

Anonymous
02/01/26(Sun)03:31:14 No.108028824

Anonymous 02/01/26(Sun)03:31:14 No.108028824

>>108028819
>muh fynetoonz
every time

Anonymous
02/01/26(Sun)03:31:24 No.108028826

Anonymous 02/01/26(Sun)03:31:24 No.108028826

bloatmodel fetishists getting BTFO hard lately

Anonymous
02/01/26(Sun)03:31:58 No.108028828

Anonymous 02/01/26(Sun)03:31:58 No.108028828

>>108028786
Look into Tailscale, and if you want to self host your own server Headscale is what you want.

Anonymous
02/01/26(Sun)03:32:06 No.108028830

Anonymous 02/01/26(Sun)03:32:06 No.108028830

>>108028826
acckkk

Anonymous
02/01/26(Sun)03:32:35 No.108028832

Anonymous 02/01/26(Sun)03:32:35 No.108028832

File: 1743112217405131.png (1.25 MB, 832x1248)

1.25 MB PNG

Anonymous
02/01/26(Sun)03:32:54 No.108028833

Anonymous 02/01/26(Sun)03:32:54 No.108028833

>omg it's small and fast!
dalits self-owning lately so much lol

Anonymous
02/01/26(Sun)03:33:44 No.108028837

Anonymous 02/01/26(Sun)03:33:44 No.108028837

>>108028833
what's a dalit

Anonymous
02/01/26(Sun)03:35:06 No.108028843

Anonymous 02/01/26(Sun)03:35:06 No.108028843

>>108028816
Chroma is different because the creator was too autistic and determined to be sabotaged by others.

It was also different because up until that point, all large scale finetunes were ruined by stupid decisions to cater to furries, or models were shadowdropped.

Anyone remember when noob was training? It had so much potential until some dickhead added a differently captioned non quality-filtered dataset full of furry shit from e621 and pushed for the text encoder to be finetuned too. This caused the model to freak out so hard that it couldn't do basic human anatomy without filling the negative prompt with every furry tag imaginable.

Anonymous
02/01/26(Sun)03:35:19 No.108028845

Anonymous 02/01/26(Sun)03:35:19 No.108028845

weird, my nano banana pro generates images at 4k in under 10 seconds. maybe something is wrong with your comfyui workflow?

Anonymous
02/01/26(Sun)03:35:23 No.108028847

Anonymous 02/01/26(Sun)03:35:23 No.108028847

>>108028584
based and Haomingd

Anonymous
02/01/26(Sun)03:35:24 No.108028848

Anonymous 02/01/26(Sun)03:35:24 No.108028848

Now that we have ramtorch and similar systems local models should be ~32B parameters, there is literally no reason to use toy models. And yes I know some thirdie will think this post is insincere

Anonymous
02/01/26(Sun)03:35:33 No.108028850

Anonymous 02/01/26(Sun)03:35:33 No.108028850

>>108028738
never used noobvpred?

Anonymous
02/01/26(Sun)03:35:43 No.108028851

Anonymous 02/01/26(Sun)03:35:43 No.108028851

>>108028824
yes, waiting~

Anonymous
02/01/26(Sun)03:36:49 No.108028854

Anonymous 02/01/26(Sun)03:36:49 No.108028854

>>108028848
i'll ram my torch up your bum if you catch my drift

Anonymous
02/01/26(Sun)03:36:52 No.108028855

Anonymous 02/01/26(Sun)03:36:52 No.108028855

>another troonjak rentries thread
>another failed garbage troon thread
why are anons so fucking retarded? why do they keep falling for ran bullshit?

Anonymous
02/01/26(Sun)03:37:37 No.108028858

Anonymous 02/01/26(Sun)03:37:37 No.108028858

File: i2iUpscale_00003_ (3).jpg (2.4 MB, 3840x2144)

2.4 MB JPG

>upscaling with zit
Kino.

>>108028828
Thanks.

Anonymous
02/01/26(Sun)03:37:44 No.108028859

Anonymous 02/01/26(Sun)03:37:44 No.108028859

>>108028845
how many ((((((((komfybucks)))))))) does it cost tho

Anonymous
02/01/26(Sun)03:37:49 No.108028860

Anonymous 02/01/26(Sun)03:37:49 No.108028860

>>108028855
they don't really care the board has a shit skidmarks of unfilled /ldg/ threads. it's rude

Anonymous
02/01/26(Sun)03:38:16 No.108028862

Anonymous 02/01/26(Sun)03:38:16 No.108028862

File: Anima_00098_.png (1.04 MB, 832x1216)

1.04 MB PNG

>>108028819
Just use @artist tag.
You can give generic style descriptions as NLP too. It works
>>108028826
Kek so true.
>my ten thousand dollar RTX 6000 pro purchase will give me superior gens over those filthy vraml-AAAAAACCCCCCKKKKKK!

Anonymous
02/01/26(Sun)03:38:40 No.108028863

Anonymous 02/01/26(Sun)03:38:40 No.108028863

>>108028848
genuinely, why not just trained big models and distill them properly? z-base is dogshit compared to z-turbo but when z-turbo released everyone celebrate it as the greatest thing ever. so clearly distills give you faster speed and better quality, so why not just train something big and good like qwen 2512 and distill that

Anonymous
02/01/26(Sun)03:39:18 No.108028866

Anonymous 02/01/26(Sun)03:39:18 No.108028866

File: 1505741832521.jpg (26 KB, 293x251)

26 KB JPG

>distills
>better quality

Anonymous
02/01/26(Sun)03:39:29 No.108028868

Anonymous 02/01/26(Sun)03:39:29 No.108028868

>>108028860
anons really are gullible as fuck. they don't stand for what they believe in (if they even believe in anything). they want cozy breads but don't do anything to achieve them

Anonymous
02/01/26(Sun)03:39:39 No.108028870

Anonymous 02/01/26(Sun)03:39:39 No.108028870

anon shouldve asked dtrussell why he chose to include pony style score tags i fucking hate them so much

Anonymous
02/01/26(Sun)03:39:50 No.108028872

Anonymous 02/01/26(Sun)03:39:50 No.108028872

>>108028863
cause you still have to load the model into memory. Qwen 2512 could be better also, it's not even on Qwen 3 for the TE

Anonymous
02/01/26(Sun)03:40:31 No.108028877

Anonymous 02/01/26(Sun)03:40:31 No.108028877

>>108028870
oh god i'm scooooooring
8 uppppaahahaadsdsgsgf

Anonymous
02/01/26(Sun)03:40:46 No.108028878

Anonymous 02/01/26(Sun)03:40:46 No.108028878

>>108028862
>@artist tag
aighty, imma try em, thanks!

Anonymous
02/01/26(Sun)03:40:48 No.108028879

Anonymous 02/01/26(Sun)03:40:48 No.108028879

>>108028872
>cause you still have to load the model into memory
ok but did anyone who wasn't brown have an issue doing that with z-image?

Anonymous
02/01/26(Sun)03:41:04 No.108028882

Anonymous 02/01/26(Sun)03:41:04 No.108028882

File: file.png (1.41 MB, 1024x1024)

1.41 MB PNG

>>108028837
This guy is a dalit, you may also hear him called a brahman sometimes but I assure you they are the same thing.

Anonymous
02/01/26(Sun)03:41:06 No.108028883

Anonymous 02/01/26(Sun)03:41:06 No.108028883

>>108028870
the censor tags are bloated too and he says they are required sometimes since the model will produce non euclidean slop without them

Anonymous
02/01/26(Sun)03:41:10 No.108028884

Anonymous 02/01/26(Sun)03:41:10 No.108028884

>>108028855
>>108028860
>>108028868
Do you think anon will fall for your posts this time

Anonymous
02/01/26(Sun)03:41:29 No.108028886

Anonymous 02/01/26(Sun)03:41:29 No.108028886

>>108028879
no but qwen way chungser than Z

Anonymous
02/01/26(Sun)03:42:03 No.108028889

Anonymous 02/01/26(Sun)03:42:03 No.108028889

>>108028837
Another word for indian

Anonymous
02/01/26(Sun)03:42:36 No.108028891

Anonymous 02/01/26(Sun)03:42:36 No.108028891

File: 1744686988958542.png (1.84 MB, 832x1248)

1.84 MB PNG

Anonymous
02/01/26(Sun)03:42:48 No.108028894

Anonymous 02/01/26(Sun)03:42:48 No.108028894

>>108028886
i can gen pretty decently on a 4090 with fp8, 24gb is the minimum anyway and a qwen turbo would be insane. lets leave those poorfags in the dust together, just you and me

Anonymous
02/01/26(Sun)03:43:31 No.108028896

Anonymous 02/01/26(Sun)03:43:31 No.108028896

>>108028855
Kill ani

Anonymous
02/01/26(Sun)03:44:56 No.108028903

Anonymous 02/01/26(Sun)03:44:56 No.108028903

File: 4545456465456.jpg (311 KB, 2138x1209)

311 KB JPG

>>108028808
The fridge POV results are much better than what I could get with Newbie on average though in terms of prompt adherence and overall polish
https://files.catbox.moe/ucp7u9.png

I found this model has a way better representation of objects and backgrounds in certain prompts (though not all).

Anonymous
02/01/26(Sun)03:45:48 No.108028908

Anonymous 02/01/26(Sun)03:45:48 No.108028908

File: 1745142563531500.png (1.8 MB, 832x1248)

1.8 MB PNG

>>108028833
>>108028882
only jeets care to differentiate jeets

Anonymous
02/01/26(Sun)03:46:45 No.108028911

Anonymous 02/01/26(Sun)03:46:45 No.108028911

>>108028896
isn't this post is against united states law?

Anonymous
02/01/26(Sun)03:47:08 No.108028913

Anonymous 02/01/26(Sun)03:47:08 No.108028913

>>108028903
where is the noob/ill comparison? nobody actually uses newbie or neta

Anonymous
02/01/26(Sun)03:47:57 No.108028915

Anonymous 02/01/26(Sun)03:47:57 No.108028915

>>108028908
That's why miscasteing makes them seethe so much :)

Anonymous
02/01/26(Sun)03:48:13 No.108028917

Anonymous 02/01/26(Sun)03:48:13 No.108028917

>>108028911
It is. Ran probably doesn't care because he's spamming threads from his proxies.

Anonymous
02/01/26(Sun)03:49:35 No.108028919

Anonymous 02/01/26(Sun)03:49:35 No.108028919

>>108028794
>>108028817
so im just not able to gen at a higher res than 1mp? not even upscale and a second pass?

Anonymous
02/01/26(Sun)03:50:07 No.108028922

Anonymous 02/01/26(Sun)03:50:07 No.108028922

>>108028915
how would you know what makes indians seethe? sus

Anonymous
02/01/26(Sun)03:50:26 No.108028924

Anonymous 02/01/26(Sun)03:50:26 No.108028924

>>108028911
Probably
Kill ani

Anonymous
02/01/26(Sun)03:50:58 No.108028926

Anonymous 02/01/26(Sun)03:50:58 No.108028926

>>108028922
damage control? big izzat loss!

Anonymous
02/01/26(Sun)03:52:14 No.108028937

Anonymous 02/01/26(Sun)03:52:14 No.108028937

File: z-image_00017_.png (944 KB, 1024x1024)

944 KB PNG

i don't know what z-image has against reimu but my test booru finetune so far uniquely renders her this way lol
tbf 1:1 aspect is relatively underrepresented in the dataset so maybe that's the issue. i should try a tall aspect
z-image is godly though. training on a wide dataset of booru images, it produces images that look like real drawings. anima is too much of a sidegrade from SDXL
>>108028731
i spent a while last year attempting to distillation train Qwen3 0.6B on T5-xxl as an experiment. i have a lot of output chroma images with Qwen3 0.6B. i don't think it's capable of matching T5-xxl but it's absolutely capable of producing coherent semi-prompt-following images. I wish I'd tried Qwen3 4B because i'm pretty sure it would be able to work as an alternative... 1.7B maybe too, I tried it once and it was doing an okay job.

Anonymous
02/01/26(Sun)03:52:26 No.108028939

Anonymous 02/01/26(Sun)03:52:26 No.108028939

File: 1765311608330200.png (1.75 MB, 832x1248)

1.75 MB PNG

Anonymous
02/01/26(Sun)03:52:54 No.108028942

Anonymous 02/01/26(Sun)03:52:54 No.108028942

File: z-image_00208_.png (1.79 MB, 944x1280)

1.79 MB PNG

Anonymous
02/01/26(Sun)03:55:19 No.108028950

Anonymous 02/01/26(Sun)03:55:19 No.108028950

File: q3_20250927164457.png (567 KB, 512x512)

567 KB PNG

>>108028937 (Me)
Actually I'm going to post some of these bad Qwen 0.6B + Chroma gens

Anonymous
02/01/26(Sun)03:56:23 No.108028953

Anonymous 02/01/26(Sun)03:56:23 No.108028953

File: file.png (747 KB, 1024x1024)

747 KB PNG

>>108028908 >>108028915 >>108028922 >>108028926
Call a jeet Pakistani sometime they'll freak out.

Anonymous
02/01/26(Sun)03:57:02 No.108028956

Anonymous 02/01/26(Sun)03:57:02 No.108028956

File: 1752437772713222.png (772 KB, 832x1248)

772 KB PNG

>>108028937
>anima is too much of a sidegrade from SDXL
i think till come down to what happens first; tongyi releasing the turbo sauce / anon figuring it out OR anima being "finished". the comparatively slower speed of base seems to be a determinant for many. and "sdxl anime but with a 16ch vae" is what anons wanted for a long time.

Anonymous
02/01/26(Sun)03:57:11 No.108028957

Anonymous 02/01/26(Sun)03:57:11 No.108028957

What's the token limit of the 0.6B qwen? Still 8k or smaller?

Anonymous
02/01/26(Sun)03:57:42 No.108028960

Anonymous 02/01/26(Sun)03:57:42 No.108028960

File: 1766883730315627.png (940 KB, 1024x1024)

940 KB PNG

>>108028808
Seems to work if you start the prompt with "it's a 2 panel manga"

Anonymous
02/01/26(Sun)03:57:52 No.108028962

Anonymous 02/01/26(Sun)03:57:52 No.108028962

File: s_mask_2_2e4.png (384 KB, 512x512)

384 KB PNG

this is the power of chroma

Anonymous
02/01/26(Sun)03:58:07 No.108028963

Anonymous 02/01/26(Sun)03:58:07 No.108028963

File: Anima_00099_.png (1.24 MB, 832x1216)

1.24 MB PNG

This model sometimes decides that there has two be a second person or another viewing angle in your prompt and really fights you for it lol. Need to experiment more for a consistent workaround.
>>108028937
>i spent a while last year attempting to distillation train Qwen3 0.6B on T5-xxl as an experiment.
Interesting. I know this is a long shot but would you mind sharing training code? I am curious about how that's done.
>i don't think it's capable of matching T5-xxl
So far I haven't run into anything it consistently can't do that t5 excelled at. There are certain "world knowledge" limitations with small models, but it's also so much newer and better "per weight", for the lack of better word.

Anonymous
02/01/26(Sun)03:58:09 No.108028964

Anonymous 02/01/26(Sun)03:58:09 No.108028964

>>108028632
literally why are you so obsessed? is it impossible for different people to be angry with "comfy" org? is it some retarded shill tactic to call anyone against "comfy" ani?

Anonymous
02/01/26(Sun)03:59:14 No.108028967

Anonymous 02/01/26(Sun)03:59:14 No.108028967

>>108028963
>This model sometimes decides that there has two be a second person or another viewing angle in your prompt and really fights you for it lol. Need to experiment more for a consistent workaround.
The model inherited a lot of sdxl-style nonsense.

Anonymous
02/01/26(Sun)03:59:49 No.108028972

Anonymous 02/01/26(Sun)03:59:49 No.108028972

>>108028964
Yes. That's the typical cumfart damage control. Before that it was "voldy shills".

Anonymous
02/01/26(Sun)04:00:06 No.108028975

Anonymous 02/01/26(Sun)04:00:06 No.108028975

File: 5644564564565.jpg (543 KB, 2138x1209)

543 KB JPG

>>108028903
It does quite well with giantess but it's one area where Newb still edges it out a bit

https://files.catbox.moe/gr45j8.png

Another comparison (more difficult so both are unpolished, but Newbie has the basic composition a bit more correct just objects/streets/people less refined than Anima)
https://files.catbox.moe/mdk00i.jpg
https://files.catbox.moe/o765mf.png

These are both models with great potential once they're both fully trained. Rooms for improvements? Anima could use a bit more painterly style knowledge to let the DiT shine.

Anonymous
02/01/26(Sun)04:01:17 No.108028980

Anonymous 02/01/26(Sun)04:01:17 No.108028980

>>108028956
You are also forgetting modern text encoder that can handle complex natural language and rectified flow that anima brings over sdxl.

Anonymous
02/01/26(Sun)04:02:19 No.108028986

Anonymous 02/01/26(Sun)04:02:19 No.108028986

File: 1768852714319145.jpg (1.62 MB, 2048x2048)

1.62 MB JPG

>comfy is getting objectively more bloated, unstable and insecure with every new release
>more and more new saascuck shit added all the time, new api shit, new comfy coin bullshit
>You're definitely shilling some other interface if you're against all of that though!!

Anonymous
02/01/26(Sun)04:02:43 No.108028987

Anonymous 02/01/26(Sun)04:02:43 No.108028987

>>108028937
>z-image is godly though. training on a wide dataset of booru images, it produces images that look like real drawings
Unless there is a turbo I really don't care, Waiting that long for images is a big killer for me. No point in this stuff anymore at that point

Anonymous
02/01/26(Sun)04:03:10 No.108028989

Anonymous 02/01/26(Sun)04:03:10 No.108028989

File: ComfyUI_00578_.png (763 KB, 1024x1024)

763 KB PNG

>>108028960
Nice, but she should be interacting with the cat on the left panel directly, should be a bit more like this (first Newbie gen I got for the prompt)

Anonymous
02/01/26(Sun)04:03:46 No.108028993

Anonymous 02/01/26(Sun)04:03:46 No.108028993

>>108028987
but we certainly don't need more efficient uis. you're crazy to suggest that

Anonymous
02/01/26(Sun)04:04:03 No.108028994

Anonymous 02/01/26(Sun)04:04:03 No.108028994

File: teacher_mask_50_20_25_05.png (550 KB, 512x512)

550 KB PNG

>>108028956
Yeah but those of us finetuning z-image base are going to do step distillation one way or another. AFAIK the thing missing from our knowledge is the RL model they used. And i'll be honest, I think base is still better than any other local model and definitely any local model with a permissive license. But we'll find a way to match the RL model. I think the architecture itself is the important part. Which is why finetuning still produces real-looking images
Re: the image - this is an image you get when you start distillation training the Qwen3 0.6B text encoder for Chroma

Anonymous
02/01/26(Sun)04:04:10 No.108028995

Anonymous 02/01/26(Sun)04:04:10 No.108028995

File: z-image_00211_.png (1.4 MB, 944x1280)

1.4 MB PNG

Anonymous
02/01/26(Sun)04:04:39 No.108028997

Anonymous 02/01/26(Sun)04:04:39 No.108028997

>>108028993
relax. cumfart is all there is and will be forever

Anonymous
02/01/26(Sun)04:04:46 No.108028998

Anonymous 02/01/26(Sun)04:04:46 No.108028998

>>108028848
> ramtorch
Does it work on AMD or Intel cards?

Anonymous
02/01/26(Sun)04:05:37 No.108028999

Anonymous 02/01/26(Sun)04:05:37 No.108028999

>>108028993
Where did UI come into this? I am sure he will support for anima on forge neo if that's your thing

Anonymous
02/01/26(Sun)04:05:48 No.108029001

Anonymous 02/01/26(Sun)04:05:48 No.108029001

>>108028855
What can we do?

Anonymous
02/01/26(Sun)04:06:11 No.108029004

Anonymous 02/01/26(Sun)04:06:11 No.108029004

File: 1753352763518810.png (1.72 MB, 832x1248)

1.72 MB PNG

Anonymous
02/01/26(Sun)04:06:13 No.108029006

Anonymous 02/01/26(Sun)04:06:13 No.108029006

>>108028998
It's just a memory management strategy, it shouldn't require any special hardware

Anonymous
02/01/26(Sun)04:06:27 No.108029007

Anonymous 02/01/26(Sun)04:06:27 No.108029007

>>108028998
why would you care about poothon garbage

Anonymous
02/01/26(Sun)04:07:02 No.108029009

Anonymous 02/01/26(Sun)04:07:02 No.108029009

File: t4_12000_t_mask.png (491 KB, 512x512)

491 KB PNG

Here's the Qwen 0.6B Chroma image that proves you can smash precise text recognition into a tiny model

Anonymous
02/01/26(Sun)04:07:12 No.108029011

Anonymous 02/01/26(Sun)04:07:12 No.108029011

>>108028999
all python uis will be memory hogs forever by definition

Anonymous
02/01/26(Sun)04:07:15 No.108029012

Anonymous 02/01/26(Sun)04:07:15 No.108029012

File: 1764576137578245.png (1.86 MB, 832x1248)

1.86 MB PNG

Anonymous
02/01/26(Sun)04:07:30 No.108029014

Anonymous 02/01/26(Sun)04:07:30 No.108029014

>>108028999
His bot broke and replied to wrong post.

Anonymous
02/01/26(Sun)04:08:28 No.108029018

Anonymous 02/01/26(Sun)04:08:28 No.108029018

alright I trained f2k and zim on the same dataset of a person with very similar settings and while both produce solid results i think f2k takes the cake simply because you can use the lora on the distill without any visible degradation, while zit completely shits the bed with a zim character lora.
also f2k seems to properly learn details like moles while zim completely ignores them. i like that zim can use negative prompts but i guess you can also do that with f2k with some workaround
zim takes about a minute for 30 steps on my 5070ti while the f2k distill took 20 seconds for 8 steps (both res_2s)

Anonymous
02/01/26(Sun)04:09:10 No.108029023

Anonymous 02/01/26(Sun)04:09:10 No.108029023

>>108029014
>you're a bot if you don't like comfyui
Maybe you are a bot?

Anonymous
02/01/26(Sun)04:10:32 No.108029030

Anonymous 02/01/26(Sun)04:10:32 No.108029030

>>108029018
Looking at the image comparison you posted makes me agree with you wholeheartedly

Anonymous
02/01/26(Sun)04:11:51 No.108029032

Anonymous 02/01/26(Sun)04:11:51 No.108029032

File: 1744649646463269.png (1.67 MB, 832x1248)

1.67 MB PNG

Anonymous
02/01/26(Sun)04:13:35 No.108029042

Anonymous 02/01/26(Sun)04:13:35 No.108029042

File: 1755959921656426.png (1.44 MB, 832x1248)

1.44 MB PNG

Anonymous
02/01/26(Sun)04:13:59 No.108029045

Anonymous 02/01/26(Sun)04:13:59 No.108029045

>>108028998
>>108029006
No, it's not
> A memory-efficient linear layer implementation that keeps parameters on CPU
> and transfers them to GPU on-demand using asynchronous CUDA streams.
>
> This approach interleave compute and data transfer, making it useful for:
> - Very large models that don't fit in GPU memory
> - Scenarios where GPU memory is limited but CPU memory is abundant

>>108029007
Because AI, especially training, is Python.

Anonymous
02/01/26(Sun)04:16:08 No.108029058

Anonymous 02/01/26(Sun)04:16:08 No.108029058

>>108028913
>where is the noob/ill comparison
I didn't make one because I don't test prompt following on noob/ill but it'll probably be utter humiliation because CLIP doesn't follow prompts as well as any of the newer text encoders.

Anonymous
02/01/26(Sun)04:17:25 No.108029066

Anonymous 02/01/26(Sun)04:17:25 No.108029066

>>108028950
>>108028962
>>108028994
>>108029009
Not gonna spam you guys anymore but anyway, discarding the whole 0.6B Qwen Chroma thing for a second, all the z-image nonbelievers are going to feel quite stupid soon. A booru model trained on this is going to be insane. It's literally going to be print-your-own-booru-image. Luddos will scream & cry trying to identify the image as synthetic

Anonymous
02/01/26(Sun)04:17:35 No.108029069

Anonymous 02/01/26(Sun)04:17:35 No.108029069

File: 1750459231438837.png (1.72 MB, 832x1248)

1.72 MB PNG

yeah this killed illust

Anonymous
02/01/26(Sun)04:18:04 No.108029074

Anonymous 02/01/26(Sun)04:18:04 No.108029074

File: jgt6.png (2.14 MB, 2048x2048)

2.14 MB PNG

Anonymous
02/01/26(Sun)04:18:24 No.108029075

Anonymous 02/01/26(Sun)04:18:24 No.108029075

>>108029045
You just described a memory management strategy. It's literally just layer offloading but in a way that doesn't slow down inference speed

Anonymous
02/01/26(Sun)04:20:10 No.108029081

Anonymous 02/01/26(Sun)04:20:10 No.108029081

>>108029075
btw can i just say, thank fuck for musubi tuner and its dev. every other training project is shit

Anonymous
02/01/26(Sun)04:21:01 No.108029085

Anonymous 02/01/26(Sun)04:21:01 No.108029085

>>108028828
>not just setting up openvpn or wireguard
>implement PROPRIETARY solution instead
lol, bunch of literal retards

Anonymous
02/01/26(Sun)04:29:44 No.108029111

Anonymous 02/01/26(Sun)04:29:44 No.108029111

>>108029066
>Luddos will scream & cry trying to identify the image as synthetic
I don't get this kind of childish spite desu, why do people keep trying to shove AI art into the face of people who don't wanna see it, this only creates hostility, I don't pretend my gens are hand drawn, what do these people get out of it

Anonymous
02/01/26(Sun)04:31:04 No.108029120

Anonymous 02/01/26(Sun)04:31:04 No.108029120

File: 1753878176446491.png (2.45 MB, 3138x768)

2.45 MB PNG

>>108029018
>>108029030
i'm still experimenting with zit because its really stubborn. made some improvements, so now it doesnt *completely* shit the bed but it's still pretty uncanny to me

Anonymous
02/01/26(Sun)04:31:32 No.108029123

Anonymous 02/01/26(Sun)04:31:32 No.108029123

File: ihu7.png (1.82 MB, 2048x2048)

1.82 MB PNG

Anonymous
02/01/26(Sun)04:33:03 No.108029128

Anonymous 02/01/26(Sun)04:33:03 No.108029128

File: 1762954427053381.png (40 KB, 225x225)

40 KB PNG

>>108028986

Anonymous
02/01/26(Sun)04:35:16 No.108029136

Anonymous 02/01/26(Sun)04:35:16 No.108029136

>>108029018
klein has a turbo lora available so that you can control how much distilled is in the base gen. Why isn't there an equivalent for z-image?

Anonymous
02/01/26(Sun)04:36:41 No.108029146

Anonymous 02/01/26(Sun)04:36:41 No.108029146

How could i hate api nodes when theyre just so damn good?

Anonymous
02/01/26(Sun)04:38:34 No.108029149

Anonymous 02/01/26(Sun)04:38:34 No.108029149

>>108029085
Tailscale is just Wireguard without having to open ports or fuck with config files, the client is open source and if you want to self host Headscale is an open source server (you will have to open ports for that or faff around to run it over a tor hidden service). Also the Wireguard data never goes through their servers they just coordinate clients without opening ports/behind cgnat.
You clearly have no clue what you're talking about. https://tailscale.com/blog/how-tailscale-works go have a read, if you're literate and capable of silencing your schizophrenia demons long enough.

Anonymous
02/01/26(Sun)04:39:18 No.108029151

Anonymous 02/01/26(Sun)04:39:18 No.108029151

>>108029018
Yeah, that's Flux.2's VAE in action for those closeup details, plus I'm sure BFL did extra tuning to ensure it's not slopped and the colors are accurate this time around, unlike Z where colors feel washed.

Anonymous
02/01/26(Sun)04:40:47 No.108029161

Anonymous 02/01/26(Sun)04:40:47 No.108029161

>>108029149
you're still relying on a relay/coordination server outside of your control to do the discovery phase, and that's 100% closed source
tailscale is NOT local, fuck off shill

Anonymous
02/01/26(Sun)04:42:42 No.108029173

Anonymous 02/01/26(Sun)04:42:42 No.108029173

>>108029120
>gretafag
Oh you should've said that initially so I could more easily disregard your opinion

Anonymous
02/01/26(Sun)04:44:36 No.108029182

Anonymous 02/01/26(Sun)04:44:36 No.108029182

>>108029136
Just wait a few days until someone inevitably makes it or create the diff lora on your own.
It's not difficult.

Anonymous
02/01/26(Sun)04:46:04 No.108029189

Anonymous 02/01/26(Sun)04:46:04 No.108029189

>>108029173
eh, i just needed a character to experiment with and its not like we need yet another sydney sweeney or scarlett johansson lora so whatever

Anonymous
02/01/26(Sun)04:48:36 No.108029197

Anonymous 02/01/26(Sun)04:48:36 No.108029197

Which model can cope better with shitty datasets? O tried some old datasets that were small and/or blurry on ZIT and the resulting loras were pretty impressive given what it had to work with. How do ZIM and the kleins fare in this regard?

Anonymous
02/01/26(Sun)04:49:55 No.108029204

Anonymous 02/01/26(Sun)04:49:55 No.108029204

>>108028850
They are just trolling/retarded, anyone with even a remote speck of understanding of this garbage knows that including e621 would colossally improve the model's capabilities, not only because e621 tagging is far superior its just more datapoints, even r34 is a good source in this regard, but when you say that the only thing a retard sees is "furry = bad"; that being said including e621 would also greatly increase training costs, which is probably why they never include it.

Anonymous
02/01/26(Sun)04:50:58 No.108029205

Anonymous 02/01/26(Sun)04:50:58 No.108029205

File: file.png (422 KB, 1024x768)

422 KB PNG

>>108029161
I self host my Headscale server and mentioned Anon could do the same, it absolutely is local and 100% open source and enables me and potentially him to access local and open source diffusion models running on our own hardware while outside of our home networks.

Anonymous
02/01/26(Sun)04:55:07 No.108029214

Anonymous 02/01/26(Sun)04:55:07 No.108029214

File: 4830789756.png (273 KB, 768x768)

273 KB PNG

>>108029205
I got banned for racism once for posting a image like that

Anonymous
02/01/26(Sun)04:55:32 No.108029215

Anonymous 02/01/26(Sun)04:55:32 No.108029215

File: ComfyUI_09370.png (3.03 MB, 1440x2160)

3.03 MB PNG

>>108029120
My ZIM LoRa on ZIT needs 1.75 strength before I'd consider it usable and 2.0 performs better when the face gets smaller or has to change direction. The overall performance though is a lot better the my ZIT-trained one (less occurrences of body horror and other little things a LoRa can introduce.).

Z-Image using Flux.1's VAE also means you can use one of the EQ-VAEs out there for even better quality (those helped out big time on Flux). I haven't trained one with EQ yet, but that's on my to-do list.

Anonymous
02/01/26(Sun)04:56:26 No.108029220

Anonymous 02/01/26(Sun)04:56:26 No.108029220

>>108029214
Some jannies also don't like Hitler gens

Anonymous
02/01/26(Sun)04:58:00 No.108029225

Anonymous 02/01/26(Sun)04:58:00 No.108029225

>>108029204
I would say that a small model might degrade by massive influx of furry slop, when there is very limited amount of weights to hold data and you are going to get contention at some point.
It's possible that noob for example failed to learn some characters and styles it otherwise could have learned due to furry data. Though again most likely superior tagging and additional data helped more than hurt it.
Furshit is fine as long as it is tagged out clearly.
BUT I don't mind models not touching it neither in the age of NLP. Anima can survive without e621 tags.

Anonymous
02/01/26(Sun)05:00:18 No.108029232

Anonymous 02/01/26(Sun)05:00:18 No.108029232

>seaart pruned the loras of my waifu
sigh, is there anywhere to find loras of actresses/celebrities these days?

Anonymous
02/01/26(Sun)05:03:02 No.108029249

Anonymous 02/01/26(Sun)05:03:02 No.108029249

>>108029232
Trained by yourself on your own hard drive. It's the only way.

Anonymous
02/01/26(Sun)05:03:05 No.108029250

Anonymous 02/01/26(Sun)05:03:05 No.108029250

>>108029232
Probably on Chinese sites since cheeto hitler made deepfakes illegal in burger land.

Anonymous
02/01/26(Sun)05:04:29 No.108029260

Anonymous 02/01/26(Sun)05:04:29 No.108029260

>>108029249
>>108029250
that's unfortunate. thanks fellas

Anonymous
02/01/26(Sun)05:12:49 No.108029290

Anonymous 02/01/26(Sun)05:12:49 No.108029290

>>108029232
depends on the base model. check out /r/ realistic parody OP
i think there is a lot of zit

Anonymous
02/01/26(Sun)05:13:24 No.108029292

Anonymous 02/01/26(Sun)05:13:24 No.108029292

>>108029120
>zit would just produce sameface
Time to get zim up for some hot Greta gooning

Anonymous
02/01/26(Sun)05:14:25 No.108029298

Anonymous 02/01/26(Sun)05:14:25 No.108029298

>>108029197
Use edit model to unblur the images first.

Anonymous
02/01/26(Sun)05:16:23 No.108029305

Anonymous 02/01/26(Sun)05:16:23 No.108029305

File: Anima_00114_.png (763 KB, 832x1216)

763 KB PNG

>>108029214
Whatever helps them feel like a woman, amirate?
>>108029232
huggingface but they get periodically jannied.
Is civarchive still around? Maybe there too.
But training your own is the best.
Most lora trainers are jeets who can't train anything even remotely passable.

Anonymous
02/01/26(Sun)05:16:34 No.108029306

Anonymous 02/01/26(Sun)05:16:34 No.108029306

>>108029197
>How do ZIM and the kleins fare in this regard?
absolutely unforgiving imo
i ((enhanced)) my datasets with klein 9b
something along the lines of "remove artifacts and make high quality"
just be careful it doesnt change too much

Anonymous
02/01/26(Sun)05:17:47 No.108029311

Anonymous 02/01/26(Sun)05:17:47 No.108029311

>>108029305
kek also thats a nice style anon

Anonymous
02/01/26(Sun)05:18:02 No.108029314

Anonymous 02/01/26(Sun)05:18:02 No.108029314

>>108029250
What do chinks use? Does modelscope have loras? Tensorart and Seaart are both just as dogshit as civit.

Anonymous
02/01/26(Sun)05:19:00 No.108029316

Anonymous 02/01/26(Sun)05:19:00 No.108029316

>>108029314
Something unpronounceable I'm sure.

Anonymous
02/01/26(Sun)05:19:17 No.108029318

Anonymous 02/01/26(Sun)05:19:17 No.108029318

>>108029298
Yeah I know but I'm just interested, tells you something about a models learning ability, and sometimes unblurring chsnges the image too much or introduces unwanted things etc

Anonymous
02/01/26(Sun)05:21:07 No.108029323

Anonymous 02/01/26(Sun)05:21:07 No.108029323

File: 1747105793653478.png (2.41 MB, 1088x1632)

2.41 MB PNG

Anonymous
02/01/26(Sun)05:21:38 No.108029328

Anonymous 02/01/26(Sun)05:21:38 No.108029328

https://www.modelscope.cn/models?name=z-image&page=1&tabKey=other&tags=LoRA
Oh they have. But the filtering is fuckign ass

Anonymous
02/01/26(Sun)05:22:17 No.108029333

Anonymous 02/01/26(Sun)05:22:17 No.108029333

>>108029075
That's from ramtorch sources and that's implementation of the strategy using cuda.

Anonymous
02/01/26(Sun)05:24:11 No.108029343

Anonymous 02/01/26(Sun)05:24:11 No.108029343

>>108029323
Catbox please

Anonymous
02/01/26(Sun)05:26:00 No.108029354

Anonymous 02/01/26(Sun)05:26:00 No.108029354

>>108029225
Danbooru is nowhere near close enough data to saturate that model, doubt they have a anime dataset large enough to, it's just much much cheaper (because money is always the constrain with this stuff) since its less stuff to learn.

Anonymous
02/01/26(Sun)05:27:31 No.108029361

Anonymous 02/01/26(Sun)05:27:31 No.108029361

File: 1767669869294236.png (2.46 MB, 1088x1632)

2.46 MB PNG

>>108029343
still learning the model https://files.catbox.moe/n5vvfd.png

Anonymous
02/01/26(Sun)05:29:44 No.108029375

Anonymous 02/01/26(Sun)05:29:44 No.108029375

File: 1753954352906631.png (2.92 MB, 1088x1632)

2.92 MB PNG

Anonymous
02/01/26(Sun)05:34:38 No.108029393

Anonymous 02/01/26(Sun)05:34:38 No.108029393

>>108029204
Actual bullshit, there is no evidence e621 data made it good and way more evidence that it made it worse, to this the vpred Illustrious model that was never released was way better than noob vpred as was NAIV3 which is still the best XL anime model. Both don't have a lick of e621 in it.

Anonymous
02/01/26(Sun)05:36:12 No.108029399

Anonymous 02/01/26(Sun)05:36:12 No.108029399

>>108029361
Thanks.
Interesting to see so many style stuff in the negatives. Also having TT in the negatives while prompting raven is certainly a choice.
And ughh can't say I am a fan of trannies neither but I don't think schizo prompts about trans stuff in the negatives are helpful.
Just add futanari if you are afraid of the model accidentally genning dick girls.

Anonymous
02/01/26(Sun)05:37:40 No.108029406

Anonymous 02/01/26(Sun)05:37:40 No.108029406

File: file.png (132 KB, 1385x1203)

132 KB PNG

>>108028964
>is it impossible for different people to be angry with "comfy" org?

Anonymous
02/01/26(Sun)05:37:40 No.108029407

Anonymous 02/01/26(Sun)05:37:40 No.108029407

>>108029393
>the vpred Illustrious model that was never released was way better than noob vpred
Yes because the only difference between the two was the dataset and not any hyperparameter amirite?
>as was NAIV3 which is still the best XL anime model
Now that's a real argument, but again, how do we know NAIV3 wouldn't have been better if it included e621? Because I'm pretty sure the difference between it and the other models is not merely dataset.

Anonymous
02/01/26(Sun)05:39:17 No.108029413

Anonymous 02/01/26(Sun)05:39:17 No.108029413

>>108029306
I assume 9b is better at preserving things like facial features or overall coherence than 4b right? I upscaled things with 4b and it often introduced a shiny slopped look to skin, is 9b better here?

Anonymous
02/01/26(Sun)05:39:44 No.108029415

Anonymous 02/01/26(Sun)05:39:44 No.108029415

File: 642.png (1.02 MB, 768x1344)

1.02 MB PNG

gm ai sisters

Anonymous
02/01/26(Sun)05:41:23 No.108029425

Anonymous 02/01/26(Sun)05:41:23 No.108029425

File: 1768643812126900.png (2.93 MB, 1088x1632)

2.93 MB PNG

>>108029399
>Also having TT in the negatives while prompting raven is certainly a choice.
its an old trick from nai/illust/noob to rid characters of their canon style. im not editing any of these old noob prompts save for the primer
>I don't think schizo prompts about trans stuff in the negatives are helpful.
they are basically required for naked noob which is where these prompts are from

Anonymous
02/01/26(Sun)05:42:11 No.108029430

Anonymous 02/01/26(Sun)05:42:11 No.108029430

>>108029407
Illustrious creator himself blamed e621 dataset on why noob was worse.
For NAI they did train an e621 model later on which was not that well liked if I remember. There is no way the didn't try what you said but saw the results weren't worth it which is why they made it separate.

Anonymous
02/01/26(Sun)05:43:07 No.108029432

Anonymous 02/01/26(Sun)05:43:07 No.108029432

>>108029425
Yeah that makes sense.
I had a long schizo list of negatives for noob too. (Although I switched to a more concise one later on)

Anonymous
02/01/26(Sun)05:43:32 No.108029436

Anonymous 02/01/26(Sun)05:43:32 No.108029436

>>108029189
find some hotty
it's like the anons endlessly experimenting with epstein or trump, at least have the grace of playing with nice girls instead of this genuinely mind rotting stuff

Anonymous
02/01/26(Sun)05:44:44 No.108029441

Anonymous 02/01/26(Sun)05:44:44 No.108029441

>>108029430
makes sense, hopes up that cumfy's dataset is up to match any resemblance of a quality model

Anonymous
02/01/26(Sun)05:45:05 No.108029443

Anonymous 02/01/26(Sun)05:45:05 No.108029443

>Multiple threads-long meltdown over an anime booru tag prompting model
Was it worth it?

Anonymous
02/01/26(Sun)05:45:19 No.108029445

Anonymous 02/01/26(Sun)05:45:19 No.108029445

>>108029430
>Illustrious creator himself blamed e621 dataset on why noob was worse.
The illust creator was seething at Noob too? I should've guessed fucking keeeekkkkkkkkkk

Anonymous
02/01/26(Sun)05:45:25 No.108029446

Anonymous 02/01/26(Sun)05:45:25 No.108029446

>>108029328
>nothing nsfw
I guess they block it all? Unless it's behind login

Anonymous
02/01/26(Sun)05:46:37 No.108029452

Anonymous 02/01/26(Sun)05:46:37 No.108029452

>>108029446
Don't they have their own civit clone? NSFW will be probably buried like on HF. Only for people in the know.

Anonymous
02/01/26(Sun)05:47:50 No.108029461

Anonymous 02/01/26(Sun)05:47:50 No.108029461

Anons playing with video models, is ltx2 finally getting rid of its awful sound quality and random unmoving photo gens?
And is nsfw finally working without looking ridiculously bad?

Anonymous
02/01/26(Sun)05:52:07 No.108029483

Anonymous 02/01/26(Sun)05:52:07 No.108029483

>>108029413
>is 9b better here?
honestly i felt it was pretty good, yeah. just try it yourself. sometimes it introduced wrinkles that were not there before, but you can always reroll

Anonymous
02/01/26(Sun)05:52:36 No.108029485

Anonymous 02/01/26(Sun)05:52:36 No.108029485

>>108029461
No and not really. It will be some time before it catches up to 2.2, and I expect wan 3 will be a thing by then.

Anonymous
02/01/26(Sun)05:52:50 No.108029489

Anonymous 02/01/26(Sun)05:52:50 No.108029489

File: juri han anima 2.png (628 KB, 832x1216)

628 KB PNG

Haven't seen this many watermark/text hallucinations in the bottom for a really long while. And text, watermark are in the negatives too. (Though it is a lot more coherent than when sdxl hallucinates them, you can really feel the qwen vae.)

Anonymous
02/01/26(Sun)05:53:28 No.108029493

Anonymous 02/01/26(Sun)05:53:28 No.108029493

>>108029461
Sorta kinda, it will honestly need that vae+model update that they have planned to really fix everything but they released a new sampler and some settings you can fingle around with to improve quality. Model has potential but consider this one a beta and just fuck around with it

Anonymous
02/01/26(Sun)06:01:48 No.108029529

Anonymous 02/01/26(Sun)06:01:48 No.108029529

>>108029489
Is this with watermarks in negatives?

Anonymous
02/01/26(Sun)06:03:52 No.108029535

Anonymous 02/01/26(Sun)06:03:52 No.108029535

ltx-2 is so fun (default i2v template in comfy, set frames to 240 or 10s)

https://files.catbox.moe/6fivxj.mp4

Anonymous
02/01/26(Sun)06:06:38 No.108029548

Anonymous 02/01/26(Sun)06:06:38 No.108029548

>>108029535
"with an american style accent":

https://files.catbox.moe/4h0cyf.mp4

Anonymous
02/01/26(Sun)06:06:43 No.108029549

Anonymous 02/01/26(Sun)06:06:43 No.108029549

>>108029535
How come nobody has taken this whole thing to refer to him as Dorito pope again?

Anonymous
02/01/26(Sun)06:07:09 No.108029551

Anonymous 02/01/26(Sun)06:07:09 No.108029551

>>108029529
Yes but I think it misunderstood prompt all together. The "You are an AI assistant..." stuff started appear on other seeds at top. It's kinda wild how it interprets the prompt sometimes.

Anonymous
02/01/26(Sun)06:07:57 No.108029555

Anonymous 02/01/26(Sun)06:07:57 No.108029555

>>108029551
>You are an AI assistant
Nigga why are you using this shit. It's a text encoder.

Anonymous
02/01/26(Sun)06:08:40 No.108029557

Anonymous 02/01/26(Sun)06:08:40 No.108029557

>>108029551
Anima doesn't need that part only the lumina models

Anonymous
02/01/26(Sun)06:09:33 No.108029562

Anonymous 02/01/26(Sun)06:09:33 No.108029562

>flash attention + torch compile + klein 9b
why doesn't it work?

Anonymous
02/01/26(Sun)06:11:50 No.108029568

Anonymous 02/01/26(Sun)06:11:50 No.108029568

>>108029489
you can have a 2nd pass with klein and something like "Remove watermarks and text"

Anonymous
02/01/26(Sun)06:12:11 No.108029569

Anonymous 02/01/26(Sun)06:12:11 No.108029569

>>108029549
everyone thinks he is shill man now.

also there is a ltx2 video extend workflow, which can clone voices or movements, pretty neat imo

https://huggingface.co/RuneXX/LTX-2-Workflows/tree/main

Anonymous
02/01/26(Sun)06:12:22 No.108029570

Anonymous 02/01/26(Sun)06:12:22 No.108029570

>>108029562
did you remember to set the CUDA_ENABLE_SPEEDHACKS=1 environmental variable?

Anonymous
02/01/26(Sun)06:13:34 No.108029575

Anonymous 02/01/26(Sun)06:13:34 No.108029575

should i directly train on multiple resolutions or start with 512 for a couple thousand steps and then move to something higher? assuming the use of buckets of course

Anonymous
02/01/26(Sun)06:13:46 No.108029578

Anonymous 02/01/26(Sun)06:13:46 No.108029578

>>108029555
Not out of necessity, but to improve quality:
You are an assistant designed to produce aesthetically pleasing, high quality images based on user prompts. <Prompt Start>
>>108029557
No not need, but from limited testing (admittedly needs more) it seemed to improve quality to me. This is the first problem I have run into it with it, after more than hundred gens. Might be worth it to keep it if it causes problems with prompts very occasionally.
>>108029568
I mean I can also just cut images too.
But good idea if watermark is too big and destructive to remove by cutting.

Anonymous
02/01/26(Sun)06:14:18 No.108029580

Anonymous 02/01/26(Sun)06:14:18 No.108029580

>>108029569
lmao

it worked well with the geoff clip, video extend workflow: do skip first frames setting to pick a good start spot, then frame load cap 49 (or whatever) to pick a good end point.

https://files.catbox.moe/d5o0y8.mp4

Anonymous
02/01/26(Sun)06:14:27 No.108029581

Anonymous 02/01/26(Sun)06:14:27 No.108029581

>>108029415
>zomg! faggotry guys!!111

Anonymous
02/01/26(Sun)06:15:50 No.108029589

Anonymous 02/01/26(Sun)06:15:50 No.108029589

>>108029578
Just realized how many extra its are there lol.

Anonymous
02/01/26(Sun)06:15:58 No.108029591

Anonymous 02/01/26(Sun)06:15:58 No.108029591

>>108029578
>it seemed to improve quality to me
Because it just crunched the numbers. It's the same as the padding slop for ZiT. If you want system prompts, you'd need to use a direct LLM loader that supports it and then turn the output into conditioning.

Anonymous
02/01/26(Sun)06:18:48 No.108029610

Anonymous 02/01/26(Sun)06:18:48 No.108029610

File: 9187229.png (1.36 MB, 1024x1024)

1.36 MB PNG

>>108029562
I asked Klein and he said picrel

Anonymous
02/01/26(Sun)06:23:19 No.108029636

Anonymous 02/01/26(Sun)06:23:19 No.108029636

>>108029562
FA doesn't work with imagegen. Torch compile is pure ass and you'll have to recompile anytime you change prompt or add lora or change lora strength; wasting any time you could've saved anyway. Use -fast fp16_accumulation if oyu want speedhacks.

Anonymous
02/01/26(Sun)06:28:21 No.108029647

Anonymous 02/01/26(Sun)06:28:21 No.108029647

>>108029580
https://huggingface.co/RuneXX/LTX-2-Workflows/blob/main/LTX-2%20-%20V2V%20(extend%20any%20video).json

this is a very good workflow, the other one works but this is more refined imo

Anonymous
02/01/26(Sun)06:29:00 No.108029651

Anonymous 02/01/26(Sun)06:29:00 No.108029651

File: 9731595259.png (1.67 MB, 880x1536)

1.67 MB PNG

>>108029581
Really glad you liked it anon

Anonymous
02/01/26(Sun)06:32:20 No.108029664

Anonymous 02/01/26(Sun)06:32:20 No.108029664

>>108029651
>xhe only posts gens
>xhe doesn't actually sell the figurines that'd sell very well

Anonymous
02/01/26(Sun)06:37:23 No.108029688

Anonymous 02/01/26(Sun)06:37:23 No.108029688

>>108029647
for example, doritos pope with the workflow (if you use distilled ltx bypass the lora below the model loader)

10s extend of the first part: it also has nodes for smoothing out the audio.

https://files.catbox.moe/kmsa2l.mp4

Anonymous
02/01/26(Sun)06:38:20 No.108029690

Anonymous 02/01/26(Sun)06:38:20 No.108029690

>>108029578
>>108029591
Actually despite removing it the problem persisted in some other gens.
So it seems to be something else.

Anonymous
02/01/26(Sun)06:39:52 No.108029699

Anonymous 02/01/26(Sun)06:39:52 No.108029699

File: Výstřižek.png (43 KB, 1058x589)

43 KB PNG

Tried a ZiB lora (on fp8)
Barely any change in output pic. Do I let it cook for more epochs or bump up the LR even more?

Anonymous
02/01/26(Sun)06:42:42 No.108029713

Anonymous 02/01/26(Sun)06:42:42 No.108029713

144 seconds to extend/make a 10s clip on a 4080, ltx2 q8 distilled. this model is meme magic.

https://files.catbox.moe/bljr06.mp4

Anonymous
02/01/26(Sun)06:47:16 No.108029742

Anonymous 02/01/26(Sun)06:47:16 No.108029742

>>108029699
i had to train twice as much compared to my character lora i trained on zit

Anonymous
02/01/26(Sun)06:47:56 No.108029745

Anonymous 02/01/26(Sun)06:47:56 No.108029745

GEOFF IS BREAKING THE CONDITIONING

https://files.catbox.moe/wu3xj3.mp4

Anonymous
02/01/26(Sun)06:48:32 No.108029748

Anonymous 02/01/26(Sun)06:48:32 No.108029748

>>108029742
More epochs or LR?

Anonymous
02/01/26(Sun)06:50:32 No.108029753

Anonymous 02/01/26(Sun)06:50:32 No.108029753

>>108029748
I've doubled the epochs and it was still only able to achieve %70~ likeness, but that also might be my sampling settings. Gonna try tripling it and see what happens

Anonymous
02/01/26(Sun)06:51:15 No.108029755

Anonymous 02/01/26(Sun)06:51:15 No.108029755

NovaAnimeXL 15 description
>I think this will be the last version before Z-Image version drops
NovaAnimeXL 16 description
>Z-Image Base came out last month so I guess I'll switch the base model into it after someone create Z-Image Illustrious model. Nobody knows whether this is the last version or not but I'm looking forward for newer structures
Why did I expect anything from that retard

Anonymous
02/01/26(Sun)06:52:50 No.108029762

Anonymous 02/01/26(Sun)06:52:50 No.108029762

>>108029443
Anime booru tag prompting models are literally the only thing of value to come out of AI

Anonymous
02/01/26(Sun)06:53:25 No.108029763

Anonymous 02/01/26(Sun)06:53:25 No.108029763

>>108029762
skill issue

Anonymous
02/01/26(Sun)06:56:29 No.108029773

Anonymous 02/01/26(Sun)06:56:29 No.108029773

how does z have more realistic gen than flux 2? is it an architectural difference?

Anonymous
02/01/26(Sun)07:01:48 No.108029797

Anonymous 02/01/26(Sun)07:01:48 No.108029797

>>108029773
Flux always was shit however Zit is very rigid. Unfortunately Zib is not a replacement so that's the trade off you have. If you want more flexibility you are generally forced to use Flux.

Anonymous
02/01/26(Sun)07:04:12 No.108029805

Anonymous 02/01/26(Sun)07:04:12 No.108029805

>>108029773
it all boils down to the dataset they used for rlhf https://arxiv.org/abs/2512.11883

Anonymous
02/01/26(Sun)07:04:48 No.108029808

Anonymous 02/01/26(Sun)07:04:48 No.108029808

>>108029773
It doesn't.

Anonymous
02/01/26(Sun)07:08:11 No.108029820

Anonymous 02/01/26(Sun)07:08:11 No.108029820

>>108029763
I can see what you're prompting right in this thread buddy and I don't see a lot of skill

Anonymous
02/01/26(Sun)07:08:31 No.108029822

Anonymous 02/01/26(Sun)07:08:31 No.108029822

>>108029549
>this whole thing
you mean the thing that came out a year and a half ago?

Anonymous
02/01/26(Sun)07:09:00 No.108029824

Anonymous 02/01/26(Sun)07:09:00 No.108029824

File: ComfyUI_09412.png (3.25 MB, 1440x2160)

3.25 MB PNG

>>108029773
The excessive quantization to get it to run on consumer hardware certainly isn't helping.

Anonymous
02/01/26(Sun)07:11:16 No.108029829

Anonymous 02/01/26(Sun)07:11:16 No.108029829

>>108029651
I didn't one bit. I think you should have your own board.

Anonymous
02/01/26(Sun)07:12:03 No.108029834

Anonymous 02/01/26(Sun)07:12:03 No.108029834

Is there a way to leverage VRAM from 2 NVIDIA GPUs for image generation in SDXL-like models (Illustrous/Pony for example) on WebUI Forge?

I got a 3060 Ti and a 5070 at hand and I'm wondering if I could use one for something like inference only and the other to load weights separately.

Anonymous
02/01/26(Sun)07:13:04 No.108029839

Anonymous 02/01/26(Sun)07:13:04 No.108029839

>>108029824
How much quantization? A 24GB or 32GB GPU isn't enough?

Anonymous
02/01/26(Sun)07:13:11 No.108029840

Anonymous 02/01/26(Sun)07:13:11 No.108029840

>>108029834
absolutely not

Anonymous
02/01/26(Sun)07:13:27 No.108029842

Anonymous 02/01/26(Sun)07:13:27 No.108029842

>>108029699
DO NOT USE WARMUP STEPS WITH PRODIGY
Prodigy has its own warmup like logic. It messes with it.

Anonymous
02/01/26(Sun)07:14:11 No.108029848

Anonymous 02/01/26(Sun)07:14:11 No.108029848

File: Screenshot 2026-02-01 at (...).png (45 KB, 588x314)

45 KB PNG

>>108029839

Anonymous
02/01/26(Sun)07:14:18 No.108029849

Anonymous 02/01/26(Sun)07:14:18 No.108029849

>>108029834
Nope. Extremely difficult to implement under normal circumstances and basically impossible with your dinky little forge UI.

Anonymous
02/01/26(Sun)07:14:59 No.108029854

Anonymous 02/01/26(Sun)07:14:59 No.108029854

>>108029839
no you can run full flux2 with a 3090 (if you have a shit ton of ram (like me ;)))

Anonymous
02/01/26(Sun)07:15:15 No.108029856

Anonymous 02/01/26(Sun)07:15:15 No.108029856

>>108029699
>01/02/2026
>thinks epochs convey any meaningful information
oh well good luck on your trials and errors

Anonymous
02/01/26(Sun)07:16:26 No.108029861

Anonymous 02/01/26(Sun)07:16:26 No.108029861

>>108029854
Damn. I passed on buying a 3090 to pair with my 96GB of RAM when I had the chance for a reasonable price, so I have been condemned to use a 12GB GPU as punishment.

Anonymous
02/01/26(Sun)07:16:48 No.108029863

Anonymous 02/01/26(Sun)07:16:48 No.108029863

https://files.catbox.moe/x7u325.mp4

Anonymous
02/01/26(Sun)07:16:59 No.108029864

Anonymous 02/01/26(Sun)07:16:59 No.108029864

>>108029856
It was 13 steps per epoch. 60 pic dataset.

Anonymous
02/01/26(Sun)07:18:00 No.108029870

Anonymous 02/01/26(Sun)07:18:00 No.108029870

>>108029842
Good to know.

Anonymous
02/01/26(Sun)07:19:21 No.108029880

Anonymous 02/01/26(Sun)07:19:21 No.108029880

>>108029863
>the random guy going "yeah"
kek

Anonymous
02/01/26(Sun)07:21:19 No.108029888

Anonymous 02/01/26(Sun)07:21:19 No.108029888

>>108028791
I don't really know how these text encoder + image diffusion models work, but is there a reason the text part couldn't be done in RAM? 30b LLMs offer usable performance when most of it is in RAM. And here text is only a part of what needs to be done.

Anonymous
02/01/26(Sun)07:23:06 No.108029897

Anonymous 02/01/26(Sun)07:23:06 No.108029897

>>108029848
12GB cuckbros... we wasted our savings...

Anonymous
02/01/26(Sun)07:24:00 No.108029900

Anonymous 02/01/26(Sun)07:24:00 No.108029900

>>108029861
>I have been condemned to use a 12GB GPU as punishment.
16GB is available for under 500 on the used market.

Anonymous
02/01/26(Sun)07:26:07 No.108029912

Anonymous 02/01/26(Sun)07:26:07 No.108029912

I made this thread

Anonymous
02/01/26(Sun)07:26:13 No.108029913

Anonymous 02/01/26(Sun)07:26:13 No.108029913

File: Anima_00148_.png (1.17 MB, 832x1216)

1.17 MB PNG

Not a /u/ fag but kino prompt there https://civitai.com/images/119355904
Ran it on anima instead.
>>108029888
You can do that?
There is some node, MultiGPU I think, that lets you run shit on CPU. It will be slower on system memory though, obviously.
Comfy will typically unload the text encoder from VRAM before running the unet, so there is typically not too much point in that.

Anonymous
02/01/26(Sun)07:26:27 No.108029916

Anonymous 02/01/26(Sun)07:26:27 No.108029916

>>108029849
What about asyncdiff and stuff like that?

Anonymous
02/01/26(Sun)07:26:27 No.108029917

Anonymous 02/01/26(Sun)07:26:27 No.108029917

>>108029912
make me out

Anonymous
02/01/26(Sun)07:28:06 No.108029925

Anonymous 02/01/26(Sun)07:28:06 No.108029925

>>108029888
>is there a reason the text part couldn't be done in RAM?
isnt that people do with colossal models? cpu for text embedding and let the unet on vram?

Anonymous
02/01/26(Sun)07:28:58 No.108029932

Anonymous 02/01/26(Sun)07:28:58 No.108029932

>>108029913
anima is such a slop

Anonymous
02/01/26(Sun)07:29:56 No.108029941

Anonymous 02/01/26(Sun)07:29:56 No.108029941

>>108029840
Not even model sharing?

Anonymous
02/01/26(Sun)07:31:48 No.108029951

Anonymous 02/01/26(Sun)07:31:48 No.108029951

>>108029808
but it is

Anonymous
02/01/26(Sun)07:32:43 No.108029956

Anonymous 02/01/26(Sun)07:32:43 No.108029956

where is the self-refining video comfyui implementation?

get to it nerds

chop chop

Anonymous
02/01/26(Sun)07:32:53 No.108029959

Anonymous 02/01/26(Sun)07:32:53 No.108029959

>>108029824
flux2 is okay once you really nail down what you want, and describe it in absurdly redundant detail
I think its problem is that it's too general, it has a bajillion parameters, but nobody gives a rats ass about 90% of them
klein is what happens when you strip the fat and focus on the one thing anybody cares about, bitches

Anonymous
02/01/26(Sun)07:33:49 No.108029962

Anonymous 02/01/26(Sun)07:33:49 No.108029962

>>108029956
i tried it but it has errors with res4yl

Anonymous
02/01/26(Sun)07:34:41 No.108029969

Anonymous 02/01/26(Sun)07:34:41 No.108029969

File: 00001-4017129897.png (1.24 MB, 896x1152)

1.24 MB PNG

>Prompt: a cute bitch in the desert

Anonymous
02/01/26(Sun)07:35:06 No.108029971

Anonymous 02/01/26(Sun)07:35:06 No.108029971

>>108029969
proof thats a bitch?

Anonymous
02/01/26(Sun)07:36:50 No.108029986

Anonymous 02/01/26(Sun)07:36:50 No.108029986

>>108029971
Would it lie to me?

Anonymous
02/01/26(Sun)07:37:22 No.108029989

Anonymous 02/01/26(Sun)07:37:22 No.108029989

>>108029986
>would a diffusion model lie?
tourist retard

Anonymous
02/01/26(Sun)07:37:37 No.108029991

Anonymous 02/01/26(Sun)07:37:37 No.108029991

klein edit 9b to make fent man

ltx2 to animate it (i2v workflow from here is nice: https://huggingface.co/RuneXX/LTX-2-Workflows/tree/main)

https://files.catbox.moe/kmh6ua.mp4

Anonymous
02/01/26(Sun)07:37:58 No.108029992

Anonymous 02/01/26(Sun)07:37:58 No.108029992

>>108029986
if you let it

Anonymous
02/01/26(Sun)07:38:50 No.108029996

Anonymous 02/01/26(Sun)07:38:50 No.108029996

>>108029989
>tourist calling someone else tourist
>>108029992
I didn't

Anonymous
02/01/26(Sun)07:41:16 No.108030012

Anonymous 02/01/26(Sun)07:41:16 No.108030012

File: 379708.png (1.86 MB, 1024x1024)

1.86 MB PNG

>>108029989
the only retarded ones are the ones who stick to this website
picrel is a tourist retard, he looks happy, happier than anyone who seems to frequent here enough to know about the """culture"""

Anonymous
02/01/26(Sun)07:45:04 No.108030030

Anonymous 02/01/26(Sun)07:45:04 No.108030030

the culture btw is an anonymous schizo blood feud

Anonymous
02/01/26(Sun)07:45:43 No.108030033

Anonymous 02/01/26(Sun)07:45:43 No.108030033

File: Screenshot 2026-01-29 155824.png (14 KB, 202x559)

14 KB PNG

For me the biggest improvement in achieving likeness was to be organized when collecting the dataset. Have a folder structure that has all angles and shot types and fill it and you will know what you are missing and what you have. Also you should have cropped 1:1 headshots from all angles.

And if you can't be bothered to manage the bucket sizes just use 1:1 ratio.

Anonymous
02/01/26(Sun)07:51:07 No.108030066

Anonymous 02/01/26(Sun)07:51:07 No.108030066

File: Flux2-Klein_00059_.png (2.18 MB, 1088x944)

2.18 MB PNG

time to bake, schizo

Anonymous
02/01/26(Sun)07:54:43 No.108030089

Anonymous 02/01/26(Sun)07:54:43 No.108030089

>>108029912
Good job

Anonymous
02/01/26(Sun)07:54:48 No.108030090

Anonymous 02/01/26(Sun)07:54:48 No.108030090

File: 003453456_.jpg (243 KB, 1344x768)

243 KB JPG

Welp, seems like the character threshold is around 500-600 pics on danbooru at the very least, I tried with characters around 300 and 400 pics but it couldn't shit them out, not even close

Anonymous
02/01/26(Sun)07:54:56 No.108030091

Anonymous 02/01/26(Sun)07:54:56 No.108030091

>>108029699
>he fell for the fp8 meme

Anonymous
02/01/26(Sun)07:55:07 No.108030093

Anonymous 02/01/26(Sun)07:55:07 No.108030093

>>108030066
prompt? source is obviously a pepe but is it 1 or 2 images

Anonymous
02/01/26(Sun)07:56:21 No.108030101

Anonymous 02/01/26(Sun)07:56:21 No.108030101

>>108029996
I made this general mongrel

Anonymous
02/01/26(Sun)07:56:22 No.108030102

Anonymous 02/01/26(Sun)07:56:22 No.108030102

>>108030090
Presumably it should be because its not completely trained, else its total horseshit

Anonymous
02/01/26(Sun)07:56:48 No.108030104

Anonymous 02/01/26(Sun)07:56:48 No.108030104

File: Flux2-Klein_00060_.png (2.19 MB, 1024x1008)

2.19 MB PNG

>>108030093
just 1 image
>Transfer the image into a vibrant graffiti street-mural style.

Anonymous
02/01/26(Sun)07:57:23 No.108030107

Anonymous 02/01/26(Sun)07:57:23 No.108030107

>>108030104
i dont believe you

Anonymous
02/01/26(Sun)07:57:47 No.108030108

Anonymous 02/01/26(Sun)07:57:47 No.108030108

File: 1753147770685426.png (590 KB, 589x665)

590 KB PNG

https://files.catbox.moe/ch5bgx.mp4

Anonymous
02/01/26(Sun)07:58:08 No.108030112

Anonymous 02/01/26(Sun)07:58:08 No.108030112

>>108030107
jej

Anonymous
02/01/26(Sun)07:59:09 No.108030115

Anonymous 02/01/26(Sun)07:59:09 No.108030115

Anybody else got this issue with Comfy that it corrupts checkpoints or TEs? Had this happen on Klein and now on Anima, for Klein I had to kept redownloading the TE because it got fucked up after every use, and now for Anima it's the checkpoint. Every time I close Comfy after running those models and then open it again later they're fucked up and I don't know what's causing it. It doesn't do that with WAN or LTX or SDXL.

Anonymous
02/01/26(Sun)08:00:07 No.108030119

Anonymous 02/01/26(Sun)08:00:07 No.108030119

File: Flux2-Klein_00107_.png (504 KB, 704x768)

504 KB PNG

>>108030108

Anonymous
02/01/26(Sun)08:02:35 No.108030133

Anonymous 02/01/26(Sun)08:02:35 No.108030133

A LOT OF LOYALTY FOR AN OPENAI SHILL

https://files.catbox.moe/o1hgfk.mp4

Anonymous
02/01/26(Sun)08:03:20 No.108030139

Anonymous 02/01/26(Sun)08:03:20 No.108030139

>>108030115
>FUD attempt #24526

Anonymous
02/01/26(Sun)08:03:28 No.108030140

Anonymous 02/01/26(Sun)08:03:28 No.108030140

Let's say I am into realistic nude dolphins.
Would Wan2.2 still be the most adequate for the task?

Anonymous
02/01/26(Sun)08:03:52 No.108030145

Anonymous 02/01/26(Sun)08:03:52 No.108030145

>>108030115
I have never heard of that and cannot even fathom how that would happen. Are you saying the weights on your computer have become corrupted?

Anonymous
02/01/26(Sun)08:04:23 No.108030147

Anonymous 02/01/26(Sun)08:04:23 No.108030147

>>108030108
I cringed

Anonymous
02/01/26(Sun)08:04:37 No.108030149

Anonymous 02/01/26(Sun)08:04:37 No.108030149

>>108030119
can you make him give birth

Anonymous
02/01/26(Sun)08:05:33 No.108030152

Anonymous 02/01/26(Sun)08:05:33 No.108030152

>>108030149
it would be extremely painful

Anonymous
02/01/26(Sun)08:06:46 No.108030160

Anonymous 02/01/26(Sun)08:06:46 No.108030160

File: o_00266_.png (3.76 MB, 2560x1536)

3.76 MB PNG

Anonymous
02/01/26(Sun)08:06:55 No.108030161

Anonymous 02/01/26(Sun)08:06:55 No.108030161

>>108030152
for you

Anonymous
02/01/26(Sun)08:07:51 No.108030166

Anonymous 02/01/26(Sun)08:07:51 No.108030166

File: o_00268_.png (3.96 MB, 2560x1536)

3.96 MB PNG

Anonymous
02/01/26(Sun)08:08:34 No.108030169

Anonymous 02/01/26(Sun)08:08:34 No.108030169

>>108030166
enough shrooms

Anonymous
02/01/26(Sun)08:08:42 No.108030171

Anonymous 02/01/26(Sun)08:08:42 No.108030171

If im training a character lora for klein9B with 70 images, should i also include a regularization dataset? I also want to combine it with other loras and use it for editing.

Anonymous
02/01/26(Sun)08:09:10 No.108030173

Anonymous 02/01/26(Sun)08:09:10 No.108030173

File: o_00269_.jpg (1.43 MB, 2560x1536)

1.43 MB JPG

Anonymous
02/01/26(Sun)08:09:23 No.108030175

Anonymous 02/01/26(Sun)08:09:23 No.108030175

>>108030171
yeah

Anonymous
02/01/26(Sun)08:10:56 No.108030182

Anonymous 02/01/26(Sun)08:10:56 No.108030182

>>108030175
Any good regularization dataset you can recommend? The ones i found online all look like sdxl slop. Or should i just grab 30 images off google and caption them myself?

Anonymous
02/01/26(Sun)08:11:25 No.108030185

Anonymous 02/01/26(Sun)08:11:25 No.108030185

>>108030171
On paper, you should always use a regularization dataset, but in practice few people ever do.

Anonymous
02/01/26(Sun)08:11:44 No.108030187

Anonymous 02/01/26(Sun)08:11:44 No.108030187

>>108030182
I'm checking, gimme a sec

Anonymous
02/01/26(Sun)08:13:45 No.108030196

Anonymous 02/01/26(Sun)08:13:45 No.108030196

>>108030145
Yeah I have no idea how it's happening either. When I redownload the models it works just fine, but after closing Comfy and reopening it and loading the models again they either produce only patterned noise (Anima), or Comfy gives me an error for the TE (Klein).
AND JUST AS I MAKE THIS POST AND RUN MORE TESTS IT STOPS DOING IT WTF. Guess it's "nvm fixed :)" now.

Anonymous
02/01/26(Sun)08:15:04 No.108030204

Anonymous 02/01/26(Sun)08:15:04 No.108030204

File: ComfyUI_temp_vvofp_00001_.jpg (740 KB, 1024x1536)

740 KB JPG

>>108030196
NO THERE IT IS AGAIN AAAAAAAAAAAAA
This is the Anima output after reloading it.

Anonymous
02/01/26(Sun)08:16:20 No.108030213

Anonymous 02/01/26(Sun)08:16:20 No.108030213

>>108030171
What do you want to regularize for? No one uses regularization images because no one cares about overfitting stray tags since its a single hot-swappable lora and not a fine-tune
>I also want to combine it with other loras and use it for editing.
Won't work 100% well unless they are trained together, lora weights conflict when they are trained separately
https://arxiv.org/abs/2311.13600
https://arxiv.org/abs/2412.04465

Anonymous
02/01/26(Sun)08:18:35 No.108030220

Anonymous 02/01/26(Sun)08:18:35 No.108030220

>>108030033
its a good train of thought, but you're never gonna have enought training data to cover all those categories with quality images. your dataset should always be focused on teaching what you want to reproduce. for me it's loose dresses or tops that show the natural breast shape, sucking on things, sticking tongue out, breast shape when lying on her back, etc. of course you should balance all this with close-ups, expressions you like, full body images, different lighting, poses. etc. but if you're too focused on just autistically trying to cover every single angle, you're gonna get a lora that's good at making images of her just standing, and not much else.

Anonymous
02/01/26(Sun)08:20:17 No.108030229

Anonymous 02/01/26(Sun)08:20:17 No.108030229

File: 1740305203967921.png (111 KB, 700x691)

111 KB PNG

lmao video extend is gold. cia talking about moot:

https://files.catbox.moe/7zlwbv.mp4

Anonymous
02/01/26(Sun)08:23:43 No.108030246

Anonymous 02/01/26(Sun)08:23:43 No.108030246

File: 1739568488558801.png (259 KB, 916x779)

259 KB PNG

>its another catbox video gen episode

Anonymous
02/01/26(Sun)08:24:00 No.108030250

Anonymous 02/01/26(Sun)08:24:00 No.108030250

>>108030229
where is my fent spam

Anonymous
02/01/26(Sun)08:25:05 No.108030256

Anonymous 02/01/26(Sun)08:25:05 No.108030256

File: Flux2-Klein_00110_.png (527 KB, 704x768)

527 KB PNG

>>108030229

Anonymous
02/01/26(Sun)08:26:55 No.108030268

Anonymous 02/01/26(Sun)08:26:55 No.108030268

>>108030229
Wow
Incredible
10 seconds of shitty ai generated voice

Anonymous
02/01/26(Sun)08:27:25 No.108030272

Anonymous 02/01/26(Sun)08:27:25 No.108030272

I'll take fentposting over obsessed namefaggots any day

Anonymous
02/01/26(Sun)08:27:38 No.108030273

Anonymous 02/01/26(Sun)08:27:38 No.108030273

new thread

>>108030237
>>108030237
>>108030237

Anonymous
02/01/26(Sun)08:28:45 No.108030279

Anonymous 02/01/26(Sun)08:28:45 No.108030279

>>108030220
The base model should take care of the poses. What you are teaching is the shape. And, yes, it's quite easy to fill all those categories most of the time.

Anonymous
02/01/26(Sun)08:36:01 No.108030331

Anonymous 02/01/26(Sun)08:36:01 No.108030331

>>108030204
Are using the Flux2 specific latent?

Anonymous
02/01/26(Sun)09:25:58 No.108030606

Anonymous 02/01/26(Sun)09:25:58 No.108030606

>>108028761
soulkino, catbox?

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.