/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 07/06/24(Sat)18:05:58 No.101301739

File: long dick general.jpg (3.95 MB, 3264x2790)

3.95 MB JPG

/ldg/ - Local Diffusion General Anonymous 07/06/24(Sat)18:05:58 No.101301739 Archived

General dedicated to creative use of free and open source text-to-image models.

Previous /ldg/ bread : >>101292106

Renaissance Edition

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio
EasyDiffusion: https://easydiffusion.github.io

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
StableSwarmUI: https://github.com/Stability-AI/StableSwarmUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
ComfyUI: https://github.com/comfyanonymous/ComfyUI

>Auto1111 forks
SD.Next: https://github.com/vladmandic/automatic
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
Anapnoe UX: https://github.com/anapnoe/stable-diffusion-webui-ux

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Models, LoRAs & Training
https://civitai.com
https://huggingface.co
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
Comfy Nodes: https://github.com/city96/ComfyUI_ExtraModels
*Also supported by SD.Next

>Animation
https://rentry.org/AnimAnon
https://rentry.org/AnimAnon-AnimDiff
https://rentry.org/AnimAnon-Deforum

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Share image prompt info
https://rentry.org/hdgcb
https://catbox.moe

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg

Anonymous
07/06/24(Sat)18:07:20 No.101301751

Anonymous 07/06/24(Sat)18:07:20 No.101301751

Blessed thread of frenship

Anonymous
07/06/24(Sat)18:13:41 No.101301814

Anonymous 07/06/24(Sat)18:13:41 No.101301814

File: 0.jpg (438 KB, 2048x1024)

438 KB JPG

Anonymous
07/06/24(Sat)18:18:18 No.101301861

Anonymous 07/06/24(Sat)18:18:18 No.101301861

>>101301739
nice collage

Anonymous
07/06/24(Sat)18:18:29 No.101301865

Anonymous 07/06/24(Sat)18:18:29 No.101301865

File: 00001-3183164047.jpg (554 KB, 1344x2016)

554 KB JPG

Anonymous
07/06/24(Sat)18:25:01 No.101301938

Anonymous 07/06/24(Sat)18:25:01 No.101301938

File: tmp0_i6_9_9.png (1.14 MB, 1344x768)

1.14 MB PNG

Anonymous
07/06/24(Sat)18:26:46 No.101301959

Anonymous 07/06/24(Sat)18:26:46 No.101301959

File: cce3.jpg (115 KB, 1024x1024)

115 KB JPG

Anonymous
07/06/24(Sat)18:30:11 No.101301995

Anonymous 07/06/24(Sat)18:30:11 No.101301995

File: 00214-3467391479.jpg (398 KB, 1176x1764)

398 KB JPG

Anonymous
07/06/24(Sat)18:33:06 No.101302023

Anonymous 07/06/24(Sat)18:33:06 No.101302023

>>101301617
Anybody knows?

Anonymous
07/06/24(Sat)18:33:58 No.101302032

Anonymous 07/06/24(Sat)18:33:58 No.101302032

>>101302023
youtube has a tutorial on it

Anonymous
07/06/24(Sat)18:36:20 No.101302064

Anonymous 07/06/24(Sat)18:36:20 No.101302064

File: 00232-3467391481.jpg (721 KB, 1260x1680)

721 KB JPG

Anonymous
07/06/24(Sat)18:43:27 No.101302140

Anonymous 07/06/24(Sat)18:43:27 No.101302140

File: 00261-3467391478.jpg (381 KB, 1260x1680)

381 KB JPG

ripley

Anonymous
07/06/24(Sat)18:51:09 No.101302212

Anonymous 07/06/24(Sat)18:51:09 No.101302212

>>101301625
please put the prompt in SD3 and post the results. Let's see if there is any "improvement".
And of course there is a large memory requirement. You need to swap the CLIP with beefy LLM in order to get good textual understanding. LLMs require lots of memory. There will be optimizations down the line, but the truth is simply that the next generation of image models require significantly more memory than Stable Diffusion 1.4 or SDXL.

If you are looking at buying GPUs and you want to play with best local models in the future, you will need to invest into something that has at least 16GB of vram. It's the price you pay for superior textual understanding. If not, then you can continue to use Stable Diffusion 1.4 or whatever on your 4GB card. These researchers are always targeting the 90 class card for their releases. Something that a normal consumer could get access to.
If the 5090 has 32GB of vram, then you can bet your ass that the next generation of diffusion models from research labs are targeting that.

Anonymous
07/06/24(Sat)18:51:55 No.101302220

Anonymous 07/06/24(Sat)18:51:55 No.101302220

File: 1713153346418885.png (24 KB, 751x415)

24 KB PNG

https://github.com/plemeri/InSPyReNet
i'm looking for an .onnx for this model but i cant for the life of me find it, if it even exists. i've found a .pth but the imagesegmentation on comfyui requires it to be .onnx as i understand it.

Anonymous
07/06/24(Sat)18:52:00 No.101302222

Anonymous 07/06/24(Sat)18:52:00 No.101302222

File: 2356215439.jpg (76 KB, 896x768)

76 KB JPG

Anonymous
07/06/24(Sat)18:58:18 No.101302271

Anonymous 07/06/24(Sat)18:58:18 No.101302271

File: 00316-3467391479.jpg (618 KB, 1260x1680)

618 KB JPG

Anonymous
07/06/24(Sat)18:58:58 No.101302279

Anonymous 07/06/24(Sat)18:58:58 No.101302279

File: kolor vs sd3.jpg (607 KB, 1696x1732)

607 KB JPG

>>101301625
>>101302212

Anonymous
07/06/24(Sat)19:49:51 No.101302792

Anonymous 07/06/24(Sat)19:49:51 No.101302792

File: 00473-663714552.jpg (330 KB, 941x1260)

330 KB JPG

Anonymous
07/06/24(Sat)19:51:19 No.101302807

Anonymous 07/06/24(Sat)19:51:19 No.101302807

>>101302279
can this model run the llm first, then do the inference to save vram like SD3? or since it's unet, both have to be used at once?

Anonymous
07/06/24(Sat)19:56:22 No.101302875

Anonymous 07/06/24(Sat)19:56:22 No.101302875

File: 240707_122.jpg (63 KB, 512x768)

63 KB JPG

it's not local but fun. cant stop generating since yesterday.
I want strong machine

Anonymous
07/06/24(Sat)20:02:40 No.101302949

Anonymous 07/06/24(Sat)20:02:40 No.101302949

File: 00004-663714549.jpg (1.1 MB, 1890x2520)

1.1 MB JPG

>>101302875
fun hobby, can't recommend enough

Anonymous
07/06/24(Sat)20:08:18 No.101303002

Anonymous 07/06/24(Sat)20:08:18 No.101303002

File: 00005-299241702.jpg (1.03 MB, 1890x2520)

1.03 MB JPG

Anonymous
07/06/24(Sat)20:18:33 No.101303107

Anonymous 07/06/24(Sat)20:18:33 No.101303107

File: 00006-2111909322.jpg (1.08 MB, 1890x2520)

1.08 MB JPG

Anonymous
07/06/24(Sat)21:31:55 No.101303902

Anonymous 07/06/24(Sat)21:31:55 No.101303902

File: bunline2k1024512_1024V08s(...).jpg (289 KB, 768x1024)

289 KB JPG

For some reason this model likes to use yellow background

Anonymous
07/06/24(Sat)21:31:55 No.101303904

Anonymous 07/06/24(Sat)21:31:55 No.101303904

Any way to run Kolors with comfy yet?

Anonymous
07/06/24(Sat)21:34:40 No.101303930

Anonymous 07/06/24(Sat)21:34:40 No.101303930

File: canvas (1).png (1.56 MB, 840x1256)

1.56 MB PNG

>>101302949
>>101303002
>>101303107
Model?

Anonymous
07/06/24(Sat)21:44:29 No.101304030

Anonymous 07/06/24(Sat)21:44:29 No.101304030

File: PA_0001.jpg (795 KB, 2560x1536)

795 KB JPG

Anonymous
07/06/24(Sat)21:49:53 No.101304093

Anonymous 07/06/24(Sat)21:49:53 No.101304093

>>101303904
https://github.com/kijai/ComfyUI-KwaiKolorsWrapper

Anonymous
07/06/24(Sat)21:52:19 No.101304116

Anonymous 07/06/24(Sat)21:52:19 No.101304116

File: PA_0002.jpg (763 KB, 2560x1536)

763 KB JPG

Anonymous
07/06/24(Sat)21:53:29 No.101304135

Anonymous 07/06/24(Sat)21:53:29 No.101304135

File: PA_0003.jpg (812 KB, 2560x1536)

812 KB JPG

Anonymous
07/06/24(Sat)22:07:52 No.101304302

Anonymous 07/06/24(Sat)22:07:52 No.101304302

File: PA_0023.jpg (763 KB, 2560x1536)

763 KB JPG

Anonymous
07/06/24(Sat)22:10:13 No.101304326

Anonymous 07/06/24(Sat)22:10:13 No.101304326

>>101304302
Sick pixel art

Anonymous
07/06/24(Sat)22:12:46 No.101304353

Anonymous 07/06/24(Sat)22:12:46 No.101304353

File: PA_0026.jpg (685 KB, 2560x1536)

685 KB JPG

Anonymous
07/06/24(Sat)22:15:31 No.101304378

Anonymous 07/06/24(Sat)22:15:31 No.101304378

File: PA_0027.jpg (634 KB, 2560x1536)

634 KB JPG

>>101304326
Thanks

Anonymous
07/06/24(Sat)22:18:40 No.101304403

Anonymous 07/06/24(Sat)22:18:40 No.101304403

>>101304378
Np, I just realized this one >>101304116 is also very cool

Anonymous
07/06/24(Sat)22:20:38 No.101304419

Anonymous 07/06/24(Sat)22:20:38 No.101304419

>>101304093
Where is quantized text encoder?

Anonymous
07/06/24(Sat)22:20:42 No.101304420

Anonymous 07/06/24(Sat)22:20:42 No.101304420

File: PA_0029.jpg (942 KB, 2560x1536)

942 KB JPG

>>101304116
>>101304135
This prompt always gens awesome things

Anonymous
07/06/24(Sat)22:27:55 No.101304504

Anonymous 07/06/24(Sat)22:27:55 No.101304504

>>101304420
what was the prompt? still using the new bunline? really awesome

Anonymous
07/06/24(Sat)22:32:19 No.101304558

Anonymous 07/06/24(Sat)22:32:19 No.101304558

File: KOLORS_00001_.png (3.71 MB, 1920x1088)

3.71 MB PNG

Anonymous
07/06/24(Sat)22:34:07 No.101304585

Anonymous 07/06/24(Sat)22:34:07 No.101304585

Is kolors a model created from scratch or is it just a hypertuned XL?

Anonymous
07/06/24(Sat)22:34:16 No.101304591

Anonymous 07/06/24(Sat)22:34:16 No.101304591

>>101304504
That's what creates pixels all over it.

Noita is a magical action roguelite set in a world where every pixel is physically simulated. Fight, explore, melt, burn, freeze and evaporate your way through the procedurally generated world using spells you've created yourself.

Anonymous
07/06/24(Sat)22:34:27 No.101304595

Anonymous 07/06/24(Sat)22:34:27 No.101304595

>>101304558
all we need is a comfy node, the UNET is about the same size as XL.

Anonymous
07/06/24(Sat)22:36:05 No.101304613

Anonymous 07/06/24(Sat)22:36:05 No.101304613

File: PA_0034.jpg (681 KB, 2560x1536)

681 KB JPG

Anonymous
07/06/24(Sat)22:36:28 No.101304618

Anonymous 07/06/24(Sat)22:36:28 No.101304618

>>101304595
You forgot the giganiga bytes of text encoder. That;s the big problem for vramlets!

Anonymous
07/06/24(Sat)22:37:54 No.101304631

Anonymous 07/06/24(Sat)22:37:54 No.101304631

File: PA_0035.jpg (860 KB, 2560x1536)

860 KB JPG

Anonymous
07/06/24(Sat)22:39:00 No.101304647

Anonymous 07/06/24(Sat)22:39:00 No.101304647

File: PA_0036.jpg (993 KB, 2560x1536)

993 KB JPG

>>101304504
It's a mix of Bunlinev8 and booruMadness

Anonymous
07/06/24(Sat)22:39:21 No.101304651

Anonymous 07/06/24(Sat)22:39:21 No.101304651

>>101304618
can't you run that on RAM? or do it in a similar way to sd3? also >>101304419

Anonymous
07/06/24(Sat)22:40:59 No.101304665

Anonymous 07/06/24(Sat)22:40:59 No.101304665

>>101304647
>booruMadness
you try https://civitai.com/models/505948/pixart-sigma-1024px512px-animetune ?

Anonymous
07/06/24(Sat)22:41:21 No.101304669

Anonymous 07/06/24(Sat)22:41:21 No.101304669

>>101304618
Can you use that text encoder for PixArt?

Anonymous
07/06/24(Sat)22:41:25 No.101304671

Anonymous 07/06/24(Sat)22:41:25 No.101304671

File: PA_0037.jpg (1 MB, 2560x1536)

1 MB JPG

Anonymous
07/06/24(Sat)22:42:41 No.101304678

Anonymous 07/06/24(Sat)22:42:41 No.101304678

File: PA_0038.jpg (986 KB, 2560x1536)

986 KB JPG

>>101304665
No, I don't think I have. Don't usually gen 1girl unless the thread derails

Anonymous
07/06/24(Sat)22:43:18 No.101304686

Anonymous 07/06/24(Sat)22:43:18 No.101304686

>>101304419
can you use the SD3 T5 on this model?

Anonymous
07/06/24(Sat)22:43:49 No.101304689

Anonymous 07/06/24(Sat)22:43:49 No.101304689

>>101304678
i imagine you might be able to coax some non-1girl out of it but im just speculating ive never tried myself

Anonymous
07/06/24(Sat)22:43:57 No.101304691

Anonymous 07/06/24(Sat)22:43:57 No.101304691

>>101304669
From the workflow image it seems different. It's called ChatGLM3 model.

Anonymous
07/06/24(Sat)22:44:20 No.101304694

Anonymous 07/06/24(Sat)22:44:20 No.101304694

File: PA_0039.jpg (917 KB, 2560x1536)

917 KB JPG

Anonymous
07/06/24(Sat)22:46:41 No.101304716

Anonymous 07/06/24(Sat)22:46:41 No.101304716

File: KOLORS_00013_.png (1.96 MB, 1024x1024)

1.96 MB PNG

KOLORS works pretty well right out of the box. Really impressive for a base model.
I think they really overstated the prompt adherence though.

Anonymous
07/06/24(Sat)22:49:23 No.101304746

Anonymous 07/06/24(Sat)22:49:23 No.101304746

File: PA_0041.jpg (628 KB, 2560x1536)

628 KB JPG

>>101304689
I like that the model maker posted some Training stats

Below I will list the GPU and training time I used for my training. Please use it as a reference for your training!

If you want to know the exact settings, please download the onetrainer data.

GPU: RTX 4060 Ti 16GB

■512px

Batch size: 48

70,000 / 48 = 1,500 steps

1 epoch: 5 hours

15 epochs: 75 hours

GPU usage: 13GB

Anonymous
07/06/24(Sat)22:52:26 No.101304785

Anonymous 07/06/24(Sat)22:52:26 No.101304785

>>101304716
the model is uncesored, but what about copyright, can you throw some anime characters in there? maybe some artist names?

Anonymous
07/06/24(Sat)22:52:33 No.101304788

Anonymous 07/06/24(Sat)22:52:33 No.101304788

File: KOLORS_00028_.png (962 KB, 768x1024)

962 KB PNG

Anonymous
07/06/24(Sat)22:53:32 No.101304801

Anonymous 07/06/24(Sat)22:53:32 No.101304801

>>101304788
how are you making these?

Anonymous
07/06/24(Sat)22:53:35 No.101304803

Anonymous 07/06/24(Sat)22:53:35 No.101304803

File: KOLORS_00029_.png (1.12 MB, 768x1024)

1.12 MB PNG

>>101304785
Give me a prompt and I'll see what it spits out

Anonymous
07/06/24(Sat)22:54:38 No.101304814

Anonymous 07/06/24(Sat)22:54:38 No.101304814

please keep your fatjak on his leash, he's running free in sdgs yard again

Anonymous
07/06/24(Sat)22:56:52 No.101304841

Anonymous 07/06/24(Sat)22:56:52 No.101304841

File: KOLORS_00033_.png (1.24 MB, 1024x1024)

1.24 MB PNG

>>101304801
idk, it was posted here
https://github.com/kijai/ComfyUI-KwaiKolorsWrapper

Anonymous
07/06/24(Sat)22:59:37 No.101304861

Anonymous 07/06/24(Sat)22:59:37 No.101304861

>>101304841
thanks i will check it out

Anonymous
07/06/24(Sat)23:01:10 No.101304876

Anonymous 07/06/24(Sat)23:01:10 No.101304876

File: bunline2k1024512_1024V08s(...).png (2.52 MB, 1536x1177)

2.52 MB PNG

For 2D gen dpm++2sa prob not a mandatory

Anonymous
07/06/24(Sat)23:03:07 No.101304903

Anonymous 07/06/24(Sat)23:03:07 No.101304903

>>101304803
NTA
90s retro anime screencap, an illustration inspired by the anime 'Ghost in the Shell', Depict a woman in a stasis tank, capturing the iconic aesthetic and atmosphere of the anime.

Anonymous
07/06/24(Sat)23:04:58 No.101304918

Anonymous 07/06/24(Sat)23:04:58 No.101304918

i will wait for the quant 8/4 version and a proper implementation before trying it. the life of a 8gb VRAMlet is not easy.

Anonymous
07/06/24(Sat)23:07:26 No.101304939

Anonymous 07/06/24(Sat)23:07:26 No.101304939

File: KOLORS_00061_.png (1.51 MB, 1024x1024)

1.51 MB PNG

>>101304903

I regret to inform you that it bungled the assignment

Anonymous
07/06/24(Sat)23:08:28 No.101304953

Anonymous 07/06/24(Sat)23:08:28 No.101304953

File: KOLORS_00056_.png (1.46 MB, 1024x1024)

1.46 MB PNG

>>101304939

Another one

Anonymous
07/06/24(Sat)23:09:26 No.101304959

Anonymous 07/06/24(Sat)23:09:26 No.101304959

File: PA_0042.jpg (257 KB, 2560x1536)

257 KB JPG

>>101304689
Didn't do anime for me.

Anonymous
07/06/24(Sat)23:10:04 No.101304966

Anonymous 07/06/24(Sat)23:10:04 No.101304966

File: KOLORS_00063_.png (1.51 MB, 1024x1024)

1.51 MB PNG

>>101304953
>>101304939
Here's one after putting things like 3D, CGI etc in the negatives.

Anonymous
07/06/24(Sat)23:14:32 No.101305008

Anonymous 07/06/24(Sat)23:14:32 No.101305008

File: KOLORS_00075_.png (1.59 MB, 1024x1024)

1.59 MB PNG

I just don't think it gets what a stasis tank is.

Anonymous
07/06/24(Sat)23:16:27 No.101305016

Anonymous 07/06/24(Sat)23:16:27 No.101305016

>>101304939
>>101304953
>>101304966
I don't see the resemblance, it does understand retro/90s though

Anonymous
07/06/24(Sat)23:18:50 No.101305034

Anonymous 07/06/24(Sat)23:18:50 No.101305034

File: KOLORS_00084_.png (1.49 MB, 1024x1024)

1.49 MB PNG

>>101305016

I think it suffers from the same issue pixart has in that most of the captioning is done by an LLM and the LLM is just describing what it sees. So while a person captioning might say, "That's ghost in the shell" the LLM will say "That's a 90's anime that appears to depict a woman with purple hair" etc etc

Anonymous
07/06/24(Sat)23:27:16 No.101305114

Anonymous 07/06/24(Sat)23:27:16 No.101305114

>>101305016
Maybe you can save some tokens if you put

>timeless 1girl

Anonymous
07/06/24(Sat)23:39:59 No.101305245

Anonymous 07/06/24(Sat)23:39:59 No.101305245

I can't go back to unet after trying out dit thodesu

Anonymous
07/06/24(Sat)23:45:13 No.101305294

Anonymous 07/06/24(Sat)23:45:13 No.101305294

File: file.jpg (880 KB, 1664x2304)

880 KB JPG

>>101301739
Nice

Anonymous
07/06/24(Sat)23:56:25 No.101305404

Anonymous 07/06/24(Sat)23:56:25 No.101305404

>>101305034
>most
i thought it was more like 50/50?

Anonymous
07/07/24(Sun)00:04:33 No.101305494

Anonymous 07/07/24(Sun)00:04:33 No.101305494

File: file.jpg (995 KB, 1664x2304)

995 KB JPG

>>101303930
maybe https://civitai.com/models/550737?modelVersionId=612841

Anonymous
07/07/24(Sun)00:04:59 No.101305497

Anonymous 07/07/24(Sun)00:04:59 No.101305497

File: ComfyUI_3D_epicphotogasm_(...).png (2.62 MB, 1920x1072)

2.62 MB PNG

>2023-12-26
1.5 just HITS DIFFERENT

Anonymous
07/07/24(Sun)00:06:37 No.101305515

Anonymous 07/07/24(Sun)00:06:37 No.101305515

File: ComfyUI_3D_epicphotogasm_(...).png (1.97 MB, 1600x896)

1.97 MB PNG

Anonymous
07/07/24(Sun)00:08:33 No.101305536

Anonymous 07/07/24(Sun)00:08:33 No.101305536

File: bunline2k1024512_1024V08s(...).jpg (485 KB, 768x1024)

485 KB JPG

Anonymous
07/07/24(Sun)00:09:06 No.101305546

Anonymous 07/07/24(Sun)00:09:06 No.101305546

File: ComfyUI_3D_epicphotogasm_(...).png (1.28 MB, 896x1280)

1.28 MB PNG

Anonymous
07/07/24(Sun)00:12:38 No.101305578

Anonymous 07/07/24(Sun)00:12:38 No.101305578

>>101305536
nice

Anonymous
07/07/24(Sun)00:35:03 No.101305770

Anonymous 07/07/24(Sun)00:35:03 No.101305770

>>101304959
are you using booru tags?

Anonymous
07/07/24(Sun)00:42:00 No.101305822

Anonymous 07/07/24(Sun)00:42:00 No.101305822

File: PA_0001.jpg (497 KB, 2560x1536)

497 KB JPG

>>101305770
It's a weird model, had to use makers workflow to test things out (didn't go well) so I'm back to my original one modified a bit to look like his. Finally getting some results.

Anonymous
07/07/24(Sun)01:05:53 No.101305986

Anonymous 07/07/24(Sun)01:05:53 No.101305986

File: PA_0011.jpg (446 KB, 2560x1536)

446 KB JPG

Anonymous
07/07/24(Sun)01:18:05 No.101306054

Anonymous 07/07/24(Sun)01:18:05 No.101306054

File: file.png (3.7 MB, 1664x2304)

3.7 MB PNG

Anonymous
07/07/24(Sun)01:34:00 No.101306157

Anonymous 07/07/24(Sun)01:34:00 No.101306157

Does the new update to a1111 make it as good as forge? Has anyone tried?

Anonymous
07/07/24(Sun)01:42:25 No.101306210

Anonymous 07/07/24(Sun)01:42:25 No.101306210

File: PA_0012.jpg (333 KB, 2048x2048)

333 KB JPG

>>101305986
>>101305822

Anonymous
07/07/24(Sun)01:55:16 No.101306307

Anonymous 07/07/24(Sun)01:55:16 No.101306307

File: file.png (3.06 MB, 1664x2304)

3.06 MB PNG

>>101306210
nice sigma anime

Anonymous
07/07/24(Sun)02:04:54 No.101306390

Anonymous 07/07/24(Sun)02:04:54 No.101306390

File: PA_0391.png (1.3 MB, 1024x1024)

1.3 MB PNG

https://files.catbox.moe/n348s1.png

All the progress I've made with it as a workflow.

I would also suggest using for description ( about a paragraph) of what you're trying to make.
In example.
Gandalf is described as a tall, slender man with a long white beard and bushy eyebrows that stick out beyond the brim of his hat. He wears a tall pointed blue hat, a long grey cloak, and a silver scarf. He is often depicted as having a wise and authoritative presence.

Anonymous
07/07/24(Sun)02:07:43 No.101306419

Anonymous 07/07/24(Sun)02:07:43 No.101306419

File: kolors.png (1.69 MB, 1024x1024)

1.69 MB PNG

tldr; on kolors? tried on it a huggingface space and it's looking pretty good! is there a way to offload the text encoder to cpu like with pixart or sd3? so far we now have this and hunyuan. there's also the new lumina and pixart bigma models to be released. seems like /ldg/ will be eatin good soon.

Anonymous
07/07/24(Sun)02:37:43 No.101306650

Anonymous 07/07/24(Sun)02:37:43 No.101306650

>>101306419
samples look good but I can't run it either

Anonymous
07/07/24(Sun)02:50:50 No.101306759

Anonymous 07/07/24(Sun)02:50:50 No.101306759

File: ComfyUI_temp_eclgh_00030_.png (1.65 MB, 1024x1024)

1.65 MB PNG

My thoughts on kolors so far: It's extremely good for a base mode. Like really good.
As for prompt adherence, it's kinda hit or miss. It doesn't understand brands or people or IPs unless they're ludicrously famous. Jesus, pikachu etc. Don't type in megumin and expect to get megumin. Even if she's in the dataset, she probably wasn't tagged as such because it looks like it was tagged by an LLM.

Anonymous
07/07/24(Sun)02:51:17 No.101306763

Anonymous 07/07/24(Sun)02:51:17 No.101306763

>>101306759
hands lookin good my man

Anonymous
07/07/24(Sun)02:52:47 No.101306778

Anonymous 07/07/24(Sun)02:52:47 No.101306778

>>101306759
if its prompt adherence is so so, what do you like about it?

Anonymous
07/07/24(Sun)02:59:30 No.101306819

Anonymous 07/07/24(Sun)02:59:30 No.101306819

File: file.png (3.68 MB, 1664x2304)

3.68 MB PNG

Anonymous
07/07/24(Sun)03:03:49 No.101306853

Anonymous 07/07/24(Sun)03:03:49 No.101306853

>>101306778
I think the outputs look good on their own, but it's held back by its prompt adherence. It honestly might be because it was trained on Chinese as well as English. Like is a strong word though. I'm lukewarm on it.

Anonymous
07/07/24(Sun)03:05:12 No.101306862

Anonymous 07/07/24(Sun)03:05:12 No.101306862

File: bunline2k1024512_1024V08s(...).jpg (506 KB, 768x1024)

506 KB JPG

what matters prob just 'vintage'

Anonymous
07/07/24(Sun)03:16:12 No.101306936

Anonymous 07/07/24(Sun)03:16:12 No.101306936

>>101306759
Characters and concepts can be added later easily. When I look at base model, all I care about are just three things:
>Prompt following
How complicated can my prompt be with multiple elements/concepts
>Image quality
How nice looking and crips are the images.
>Anatomy
Are humans and animals anatomically correct with no extra or merged limbs.

Only thing I don't like about Kolors is the prompt following. It feels like it does better in Chinese and always translating your shit is a hassle, but doable. It has pretty basic image quality. Lower than base SDXL.
It however has superior anatomical understanding. It's on par with Dalle 3 with that regard.
I wonder if you could swap the LLM out to something more basic and still keep the anatomical understanding. There is no need for Chinese understanding for those that speak English and also the prompt following abilities are just not there, I don't think the LLM is very useful and it's just pure bloat, if the image quality and anatomy stays same without it.

Anonymous
07/07/24(Sun)03:20:57 No.101306960

Anonymous 07/07/24(Sun)03:20:57 No.101306960

I hold a doctorate in applied synthography

Anonymous
07/07/24(Sun)03:22:11 No.101306967

Anonymous 07/07/24(Sun)03:22:11 No.101306967

>>101306960
Applied Synthology

Anonymous
07/07/24(Sun)03:28:53 No.101306990

Anonymous 07/07/24(Sun)03:28:53 No.101306990

>>101306960
pfff thats nothing compared to promptsmithing.
Get a real job loser

Anonymous
07/07/24(Sun)03:29:56 No.101306997

Anonymous 07/07/24(Sun)03:29:56 No.101306997

>manual image generation

Anonymous
07/07/24(Sun)03:37:39 No.101307051

Anonymous 07/07/24(Sun)03:37:39 No.101307051

File: file.png (3.57 MB, 1664x2304)

3.57 MB PNG

Anonymous
07/07/24(Sun)04:01:32 No.101307190

Anonymous 07/07/24(Sun)04:01:32 No.101307190

>>101303930
>>101305494
yeah that's the one

Anonymous
07/07/24(Sun)04:08:19 No.101307234

Anonymous 07/07/24(Sun)04:08:19 No.101307234

>>101306936
Too true, anyone who complains about characters or artists I think are silly because an individual can cook up an artist lora or character lora so easily on their own and share with others.

Anonymous
07/07/24(Sun)04:15:32 No.101307281

Anonymous 07/07/24(Sun)04:15:32 No.101307281

File: HANDS.png (2.43 MB, 2048x1024)

2.43 MB PNG

>>101304716
>>101302279
Yeah, that's probably the best local model we got there, it has good anatomy, is uncensored, can do hands! Too bad it's fucking unet and not DiT though...

Anonymous
07/07/24(Sun)04:25:59 No.101307341

Anonymous 07/07/24(Sun)04:25:59 No.101307341

File: file.png (1.65 MB, 1024x1024)

1.65 MB PNG

man, 1girl posters would love this model... if they could run it. fingers crossed they release a version where you can offload the llm onto cpu.
>i could not gen it with her fingers crossed

Anonymous
07/07/24(Sun)04:26:47 No.101307343

Anonymous 07/07/24(Sun)04:26:47 No.101307343

File: file.jpg (988 KB, 1920x2176)

988 KB JPG

Anonymous
07/07/24(Sun)04:28:41 No.101307360

Anonymous 07/07/24(Sun)04:28:41 No.101307360

>>101307281
>unet and not DiT
Does it matter if it works? Is DiT more efficient?

Anonymous
07/07/24(Sun)04:31:53 No.101307375

Anonymous 07/07/24(Sun)04:31:53 No.101307375

>>101307360
>Is DiT more efficient?
It's way more efficient, look at Sora for example, that's an example of a good DiT model

Ultimately, Colors could possibly be the step forward SDXL, but we can do even better with DiT models (unfortunately we got shit DiT models like SD3, pixart, hunuyuan)

Anonymous
07/07/24(Sun)04:34:08 No.101307388

Anonymous 07/07/24(Sun)04:34:08 No.101307388

>>101307375
Isn't pixart's only "fault" being under trained? I don't think that puts it on the same level of suck as the other two.

Anonymous
07/07/24(Sun)04:35:00 No.101307391

Anonymous 07/07/24(Sun)04:35:00 No.101307391

File: 16.jpg (32 KB, 344x281)

32 KB JPG

>>101307375
>unfortunately we got shit DiT models
surely there is no correlation between DiT and bad quality

Anonymous
07/07/24(Sun)04:37:22 No.101307408

Anonymous 07/07/24(Sun)04:37:22 No.101307408

>>101307391
Did you purposely stoped reading the part where I said that Sora is a DiT model?
https://www.youtube.com/watch?v=h37A4zocIFg

Anonymous
07/07/24(Sun)04:47:16 No.101307469

Anonymous 07/07/24(Sun)04:47:16 No.101307469

>>101302279
>>101307281
>>101304716
the only mildly interesting images ive seen from that model

Anonymous
07/07/24(Sun)04:49:01 No.101307476

Anonymous 07/07/24(Sun)04:49:01 No.101307476

>>101307469
For a base model that's insane, way better than anything we ever got, finetuning this shit will be a blast

Anonymous
07/07/24(Sun)04:50:17 No.101307489

Anonymous 07/07/24(Sun)04:50:17 No.101307489

>>101307476
fair, the system reqs sound a bit harsh however

Anonymous
07/07/24(Sun)05:00:42 No.101307560

Anonymous 07/07/24(Sun)05:00:42 No.101307560

File: file.jpg (825 KB, 1920x2176)

825 KB JPG

1girl

Anonymous
07/07/24(Sun)05:01:53 No.101307568

Anonymous 07/07/24(Sun)05:01:53 No.101307568

File: file.png (1.08 MB, 1024x1024)

1.08 MB PNG

kolors does some pretty good expressions like base pixart, doesn't feel soulless.

Anonymous
07/07/24(Sun)05:06:12 No.101307595

Anonymous 07/07/24(Sun)05:06:12 No.101307595

File: bunline2k1024512_1024V08s(...).jpg (447 KB, 768x1024)

447 KB JPG

Anonymous
07/07/24(Sun)05:07:22 No.101307603

Anonymous 07/07/24(Sun)05:07:22 No.101307603

The Kolors has some fucky licensing situation going on. It claims to be apache-2.0
>The code of this project is open-sourced under the Apache-2.0 license
and then next sentence is
>We sincerely urge all developers and users to strictly adhere to the open-source license
>https://huggingface.co/Kwai-Kolors/Kolors/blob/main/MODEL_LICENSE
And in that new license they forbid any commercial usage and require you to contact them for "new license" if you intend to use it for commercially.

All the projects have apache-2.0 tags. They claim that the code only is under apache and that the model is under their own license. I wonder if it is all legally sound.

Anonymous
07/07/24(Sun)05:07:22 No.101307604

Anonymous 07/07/24(Sun)05:07:22 No.101307604

>>101307568
i'm sorry to break your enthusiasm anon, but there's nothing about it's expressions that speaks sovl

Anonymous
07/07/24(Sun)05:08:30 No.101307616

Anonymous 07/07/24(Sun)05:08:30 No.101307616

>>101307595
sovl
>>101307568
sovlless

Anonymous
07/07/24(Sun)05:08:53 No.101307622

Anonymous 07/07/24(Sun)05:08:53 No.101307622

>>101307604
im talking compared to the plastic dogshit sd models usually shit out

Anonymous
07/07/24(Sun)05:09:13 No.101307624

Anonymous 07/07/24(Sun)05:09:13 No.101307624

>>101307603
>And in that new license they forbid any commercial usage and require you to contact them for "new license" if you intend to use it for commercially.
Caught my attention as well. That's basically the SAI license terms with chink characteristics.

Anonymous
07/07/24(Sun)05:10:56 No.101307639

Anonymous 07/07/24(Sun)05:10:56 No.101307639

>>101307622
They're both equally unremarkable in terms of sovl. Even this >>101307616 anon gets it.

Anonymous
07/07/24(Sun)05:15:58 No.101307678

Anonymous 07/07/24(Sun)05:15:58 No.101307678

File: license.png (22 KB, 669x356)

22 KB PNG

>>101307624
I had GPT-4o read trough their new license and analyze it. These are the main issues it points out.

Anonymous
07/07/24(Sun)05:18:38 No.101307693

Anonymous 07/07/24(Sun)05:18:38 No.101307693

File: file.png (615 KB, 1024x1024)

615 KB PNG

>>101307639
i was planning on genning an image with extra sovl to make you eat your words, but i ended up just proving you right instead

Anonymous
07/07/24(Sun)05:19:28 No.101307700

Anonymous 07/07/24(Sun)05:19:28 No.101307700

>>101307693
i do indeed enjoy that one

Anonymous
07/07/24(Sun)05:20:20 No.101307707

Anonymous 07/07/24(Sun)05:20:20 No.101307707

File: file.png (1.99 MB, 1024x1024)

1.99 MB PNG

Anonymous
07/07/24(Sun)05:21:31 No.101307718

Anonymous 07/07/24(Sun)05:21:31 No.101307718

File: 3078d809-03e2-4ed4-a723-0(...).png (1.03 MB, 1024x1024)

1.03 MB PNG

>>101307700
i like it too, but it cannot compare to what base pixart gave me

Anonymous
07/07/24(Sun)05:21:42 No.101307720

Anonymous 07/07/24(Sun)05:21:42 No.101307720

>>101307603
>code has license A
>model has license B
it's not a difficult concept

Anonymous
07/07/24(Sun)05:24:03 No.101307736

Anonymous 07/07/24(Sun)05:24:03 No.101307736

File: file.png (1.21 MB, 1024x1024)

1.21 MB PNG

Anonymous
07/07/24(Sun)05:25:08 No.101307745

Anonymous 07/07/24(Sun)05:25:08 No.101307745

But can it do booba

Anonymous
07/07/24(Sun)05:26:39 No.101307761

Anonymous 07/07/24(Sun)05:26:39 No.101307761

>>101307720
But there is more. I looked up their questionnaire that you have to fill and send to them and you are also required to accept some new agreements.

https://kolors.kuaishou.com/agreement
https://kolors.kuaishou.com/policy

These fucking people.

Anonymous
07/07/24(Sun)05:32:41 No.101307805

Anonymous 07/07/24(Sun)05:32:41 No.101307805

>>101307761
What did you expect? It's tech demo bait, same as SD3

Anonymous
07/07/24(Sun)05:34:44 No.101307820

Anonymous 07/07/24(Sun)05:34:44 No.101307820

>>101307805
>What did you expect? It's tech demo bait, same as SD3
the problem is that it's a GOOD demo bait, SD3 sucks ass so we don't bother with this shit, but we'd love to tinker with Colors

Anonymous
07/07/24(Sun)05:34:55 No.101307822

Anonymous 07/07/24(Sun)05:34:55 No.101307822

File: awdefsrd.png (370 KB, 606x678)

370 KB PNG

>>101307761
>https://kolors.kuaishou.com/agreement
>https://kolors.kuaishou.com/policy

Anonymous
07/07/24(Sun)05:36:33 No.101307833

Anonymous 07/07/24(Sun)05:36:33 No.101307833

>>101307822
kek

Anonymous
07/07/24(Sun)05:47:37 No.101307896

Anonymous 07/07/24(Sun)05:47:37 No.101307896

File: file.png (1.72 MB, 1024x1024)

1.72 MB PNG

>>101307745
yeah.. er.. meow?
>>101307822
it's all so tiresome

Anonymous
07/07/24(Sun)05:50:21 No.101307914

Anonymous 07/07/24(Sun)05:50:21 No.101307914

>>101307896
Just curb your enthusiasm, curiously observe and adapt. It's simple as.

Anonymous
07/07/24(Sun)05:50:23 No.101307915

Anonymous 07/07/24(Sun)05:50:23 No.101307915

File: 00722-2857992323.jpg (365 KB, 1260x1680)

365 KB JPG

>>101307595

Anonymous
07/07/24(Sun)05:52:08 No.101307925

Anonymous 07/07/24(Sun)05:52:08 No.101307925

in pony prompts are you supposed to use underscores or not? what people do doesn't seem consistent because you see the typical score_9, score_8_up, etc, but you also see all other tags like "green eyes," which have no underscore and yet they're trained on booru tags which actually do have underscores on those websites. but I also see a lot of "source_cartoon, source_anime".

Anonymous
07/07/24(Sun)06:01:05 No.101307972

Anonymous 07/07/24(Sun)06:01:05 No.101307972

File: 00049-2079579033.png (1.57 MB, 1024x1024)

1.57 MB PNG

Anonymous
07/07/24(Sun)06:10:46 No.101308027

Anonymous 07/07/24(Sun)06:10:46 No.101308027

>>101307925
>in pony prompts are you supposed to use underscores or not?
You only really want to use them for two specific cases. The full score_schizo prefix, and source_mongolian. When it comes to regular prompts, with the way the tokenizer seems to work, it makes a very marginal difference. I just ran a couple of quick comparisons and it's very hit and miss in terms of quality. I'd say it's not worth the bother of adding underscores to prompts/tags. It understands them just as well without it.

Anonymous
07/07/24(Sun)06:16:36 No.101308072

Anonymous 07/07/24(Sun)06:16:36 No.101308072

>>101304716
Yeah, prompt adherence is SDXL level at best. Really disappointed trying this out myself after having read the paper.

Anonymous
07/07/24(Sun)06:16:42 No.101308075

Anonymous 07/07/24(Sun)06:16:42 No.101308075

File: 00796-2857992323.jpg (378 KB, 1075x1613)

378 KB JPG

Euler A AYS seems decent

>>101307972
castle by hr giger?

Anonymous
07/07/24(Sun)06:18:16 No.101308084

Anonymous 07/07/24(Sun)06:18:16 No.101308084

File: tmpovqfgi61.png (1.72 MB, 2048x1182)

1.72 MB PNG

Out of curiosity I'll also do a couple of comparisons for score_schizo without the underscore.

Anonymous
07/07/24(Sun)06:18:33 No.101308086

Anonymous 07/07/24(Sun)06:18:33 No.101308086

>>101302140
Blade Runner vibes. Very nice.
>>101308027
Good to know. Pony prompting is very strange. Do you know how often one should use BREAK?
>>101308075
Is that some new sampler?

Anonymous
07/07/24(Sun)06:20:12 No.101308101

Anonymous 07/07/24(Sun)06:20:12 No.101308101

>>101308072
the worst part is that it's asking for a shit ton of VRAM to run the LLM that is supposed to make it "good" at prompt understanding when it's not good at all

Anonymous
07/07/24(Sun)06:20:46 No.101308104

Anonymous 07/07/24(Sun)06:20:46 No.101308104

>>101308086
>Do you know how often one should use BREAK?
I never BREAK, especially ever since I started to use Pony, since it seems to impact performance, and I'm on a tight vram budget to be able and test it properly. I experimented with breaking back in sd 1.5 and I still don't understand how it works.

Anonymous
07/07/24(Sun)06:21:43 No.101308117

Anonymous 07/07/24(Sun)06:21:43 No.101308117

>>101308086
>Is that some new sampler?
not new, just didn't bother testing

https://research.nvidia.com/labs/toronto-ai/AlignYourSteps/

Anonymous
07/07/24(Sun)06:28:50 No.101308171

Anonymous 07/07/24(Sun)06:28:50 No.101308171

File: file.jpg (985 KB, 1920x2176)

985 KB JPG

Anonymous
07/07/24(Sun)06:29:19 No.101308173

Anonymous 07/07/24(Sun)06:29:19 No.101308173

>>101308075
Nope. Gateway to hell by Beksinski and Haeckel.

Anonymous
07/07/24(Sun)06:30:07 No.101308182

Anonymous 07/07/24(Sun)06:30:07 No.101308182

File: tmpj8b1uekm.png (2.87 MB, 2048x1182)

2.87 MB PNG

>>101308086
>Pony prompting is very strange.
Aside from having to add the score_prefix, not really. I never really found the need for source_prefix, since I rely on style loras. Otherwise I just prompt with more or less natural language, be it a mix of short tags or longer descriptions. I only really had to hop off the vanilla pony model for the sake of autismmix, since the regular version was sometimes a pain in the ass to get good results from.

Protip: don't experiment with changing the score_prefix, leave it at the recommended score_9, score_8_up, score_7_up, score_6_up, score_5_up, score_4_up. Putting any of the scores in negatives, or leaving any of them out does impact quality, detail and prompt adherence.

Anonymous
07/07/24(Sun)06:33:07 No.101308201

Anonymous 07/07/24(Sun)06:33:07 No.101308201

File: tmpgaqo6gm3.png (2.51 MB, 2048x1182)

2.51 MB PNG

Anonymous
07/07/24(Sun)06:33:42 No.101308205

Anonymous 07/07/24(Sun)06:33:42 No.101308205

>>101308171
nice

Anonymous
07/07/24(Sun)06:34:33 No.101308213

Anonymous 07/07/24(Sun)06:34:33 No.101308213

File: file.png (317 KB, 444x453)

317 KB PNG

Could anyone throw me a bone?

Whenever I do inpainting, I get bruising. For example, I tried on a drawing of mine, and the blonde hair turned into this pink/purple-ish mess. Equally, whatever I inpaint on, starts to turn purple and loses a huge amount of loss.

Why the fuck does this happen? I'm using Autism DPO and the sampler is DPM++ SDE Karras. Never happened on SD 1.5 models.

Anonymous
07/07/24(Sun)06:35:25 No.101308221

Anonymous 07/07/24(Sun)06:35:25 No.101308221

>>101308182
thx for the tip and cool gen

Anonymous
07/07/24(Sun)06:40:02 No.101308242

Anonymous 07/07/24(Sun)06:40:02 No.101308242

>>101308182
I agree with not changing "score_9, score_8_up, score_7_up, score_6_up, score_5_up, score_4_up" however according to my tests it does seem like a good idea to put that whole string at the end of the prompt, because if it's at the start then it seems to hurt the strength of the other tags.

Anonymous
07/07/24(Sun)06:43:57 No.101308262

Anonymous 07/07/24(Sun)06:43:57 No.101308262

File: 00875-2857992324.jpg (375 KB, 1075x1613)

375 KB JPG

Anonymous
07/07/24(Sun)06:45:38 No.101308269

Anonymous 07/07/24(Sun)06:45:38 No.101308269

>>101308205
thanks anon

Anonymous
07/07/24(Sun)06:48:01 No.101308281

Anonymous 07/07/24(Sun)06:48:01 No.101308281

File: tmp8gpcqc86.png (2.04 MB, 2048x1182)

2.04 MB PNG

>>101308213
Hard to say what exactly causes this behaviour, so you can only play around with setting and see what works for this particular case:

Try changing the sampler, increasing steps to ~45 can help with blending and preserving original content, gradually lower the denoise, CFG or masked padding. At some point it should budge.

DPM++ SDE Karras is a bit of a peculiar sampler, so that alone might be reason enough. Some samplers handle way differently with img2img and it's settings.

>>101308242
>it does seem like a good idea to put that whole string at the end of the prompt
That's actually a very good point. Wouldn't surprise me, since prompts at the beginning always get more attention. I guess the only difference would be what are your priorities. Do you want more emphasis on quality and have it loosely inspired by the prompt, or are you more concerned with prompt adherence, even if at the cost of aesthetics. Damn, might want to test it myself now.

Anonymous
07/07/24(Sun)06:50:32 No.101308297

Anonymous 07/07/24(Sun)06:50:32 No.101308297

>>101308281
Which sampler do you recommend? If I try with Euler A for example, it gets way worse.

Anonymous
07/07/24(Sun)06:54:04 No.101308312

Anonymous 07/07/24(Sun)06:54:04 No.101308312

>>101308086
>Do you know how often one should use BREAK?
I'm relatively new to all of this, but I believe it works like this. Typically a prompt's word strength is by the order the words appear in so if we have 8 words:
word, word, word, word, word, word, word, word, word, word
You could imagine their strengths as:
1.0, 0.9, 0.8, 0.7, 0.6, 0.5, 0.4, 0.3, 0.2, 0.1,
But if you put a BREAK in the middle, their word strengths would become like this:
1.0, 0.8, 0.6, 0.4, 0.2 BREAK 1.0, 0.8, 0.6, 0.4, 0.2

Someone correct me if I'm wrong.

Anonymous
07/07/24(Sun)06:54:27 No.101308314

Anonymous 07/07/24(Sun)06:54:27 No.101308314

>>101308297
I've fallen behind on samplers, and I've been using Euler A lately myself. I remember DPM++ 2s a / Karras being good at img2img refining, but it might be dated and slow. You could try DDPM I guess?

Anonymous
07/07/24(Sun)06:55:05 No.101308317

Anonymous 07/07/24(Sun)06:55:05 No.101308317

>>101308312
meant to say 10* words

Anonymous
07/07/24(Sun)07:01:13 No.101308367

Anonymous 07/07/24(Sun)07:01:13 No.101308367

>>101308312
75 tokens per chunk, using break allows you to define you own chunks
essentially yeah

Anonymous
07/07/24(Sun)07:09:53 No.101308426

Anonymous 07/07/24(Sun)07:09:53 No.101308426

when will they invent ai smell generators? i want to know what my waifu smells like

Anonymous
07/07/24(Sun)07:14:59 No.101308460

Anonymous 07/07/24(Sun)07:14:59 No.101308460

File: file.jpg (849 KB, 1920x2176)

849 KB JPG

Anonymous
07/07/24(Sun)07:23:07 No.101308518

Anonymous 07/07/24(Sun)07:23:07 No.101308518

>>101308426
>when will they invent ai smell generators? i want to know what my waifu smells like
Wake up, anon: https://civitai.com/product/odor

Anonymous
07/07/24(Sun)07:36:32 No.101308613

Anonymous 07/07/24(Sun)07:36:32 No.101308613

>>101308518
s-s-sniffa sniffa?

Anonymous
07/07/24(Sun)07:39:05 No.101308632

Anonymous 07/07/24(Sun)07:39:05 No.101308632

>>101308613
c-cringe

Anonymous
07/07/24(Sun)08:25:38 No.101309037

Anonymous 07/07/24(Sun)08:25:38 No.101309037

File: ComfyUI_01745_.jpg (366 KB, 2048x1024)

366 KB JPG

Anonymous
07/07/24(Sun)08:27:11 No.101309052

Anonymous 07/07/24(Sun)08:27:11 No.101309052

>>101309037
wtf model is that

Anonymous
07/07/24(Sun)08:27:21 No.101309054

Anonymous 07/07/24(Sun)08:27:21 No.101309054

File: ComfyUI_Kolors_00101_.png (1.55 MB, 1216x832)

1.55 MB PNG

I'm impressed by Kolors so far.

Prompt:

 Outside of a supermarket in a cyberpunk dystopian future, a wizard casts a spell to open a portal to the world of infinite eggs. It is stormy, windy and thundering. Magical portal. Photography.

Anonymous
07/07/24(Sun)08:28:55 No.101309070

Anonymous 07/07/24(Sun)08:28:55 No.101309070

>>101309054
that's really good. How about cartoon banana that is stripping itself like the old meme

Anonymous
07/07/24(Sun)08:32:30 No.101309096

Anonymous 07/07/24(Sun)08:32:30 No.101309096

>>101309052
most of the style comes from this lora https://civitai.com/models/84527/chinese-style-illustration

Anonymous
07/07/24(Sun)08:33:32 No.101309107

Anonymous 07/07/24(Sun)08:33:32 No.101309107

>>101302279
how do i run kolor locally? does it run in comfyui?

Anonymous
07/07/24(Sun)08:34:23 No.101309120

Anonymous 07/07/24(Sun)08:34:23 No.101309120

File: ComfyUI_Kolors_00116_.png (1.26 MB, 1216x832)

1.26 MB PNG

>>101309070
Kind of? It doesn't do peeled bananas, it looks like.

>>101309107
https://github.com/kijai/ComfyUI-KwaiKolorsWrapper

Anonymous
07/07/24(Sun)08:39:11 No.101309158

Anonymous 07/07/24(Sun)08:39:11 No.101309158

>>101309054
can you try complex poses like handstand or backflip?

Anonymous
07/07/24(Sun)08:41:44 No.101309186

Anonymous 07/07/24(Sun)08:41:44 No.101309186

File: ComfyUI_Kolors_00134_.png (1.34 MB, 1216x832)

1.34 MB PNG

>>101309158

 A man doing a handstand.

Anonymous
07/07/24(Sun)08:43:30 No.101309206

Anonymous 07/07/24(Sun)08:43:30 No.101309206

>>101309186
that's not bad at all, probably the best base model of them all, can you do the same for a woman in a bikini? :^) just to see the anatomy, yes the anatomy :^)

Anonymous
07/07/24(Sun)08:43:37 No.101309209

Anonymous 07/07/24(Sun)08:43:37 No.101309209

File: ComfyUI_Kolors_00140_.png (1.37 MB, 1024x1024)

1.37 MB PNG

>>101309186
It can't do backflips, though.

Anonymous
07/07/24(Sun)08:44:55 No.101309220

Anonymous 07/07/24(Sun)08:44:55 No.101309220

File: long dick general (2).jpg (3.6 MB, 2365x3264)

3.6 MB JPG

Can't say I'm a fan of using score as suffix over prefix, it's hit and miss. The differences are very marginal, but it does adhere to prompts slightly better. Then again, devil is in the details, so there might be some merit to it. There was a couple of cases where the suffix version did have better quality, more detail or interesting composition.

Overall there isn't a big discrepancy in their behaviour between steps or CFG. Even longer and shorter prompts tend to behave similarly.

Anonymous
07/07/24(Sun)08:54:54 No.101309310

Anonymous 07/07/24(Sun)08:54:54 No.101309310

>>101309220
one day i'll be able to smell her

Anonymous
07/07/24(Sun)08:56:52 No.101309337

Anonymous 07/07/24(Sun)08:56:52 No.101309337

File: ComfyUI_Kolors_00161_.png (1.08 MB, 1216x832)

1.08 MB PNG

>>101309206
I tried to gen this north of 30 times I think. This was the only time it gave me a woman actually doing something that looked like a handstand. It was usually just frontal shots of women with flat chests or outright manly pectorals in bikinis.

Anonymous
07/07/24(Sun)08:58:31 No.101309349

Anonymous 07/07/24(Sun)08:58:31 No.101309349

>>101309337
lmao it looks like a deflated sex doll

Anonymous
07/07/24(Sun)08:58:36 No.101309351

Anonymous 07/07/24(Sun)08:58:36 No.101309351

File: ComfyUI_Kolors_00164_.png (1.3 MB, 832x1216)

1.3 MB PNG

>>101309337
Sometimes stuff like this.

Anonymous
07/07/24(Sun)09:06:09 No.101309431

Anonymous 07/07/24(Sun)09:06:09 No.101309431

File: 0.jpg (358 KB, 1024x1024)

358 KB JPG

Anonymous
07/07/24(Sun)09:07:25 No.101309445

Anonymous 07/07/24(Sun)09:07:25 No.101309445

>>101309337
>>101309351

It's really good at wallpaper material though.

Anonymous
07/07/24(Sun)09:07:38 No.101309448

Anonymous 07/07/24(Sun)09:07:38 No.101309448

File: long dick general.jpg (3.19 MB, 2306x3264)

3.19 MB JPG

Anonymous
07/07/24(Sun)09:08:30 No.101309456

Anonymous 07/07/24(Sun)09:08:30 No.101309456

File: ComfyUI_Kolors_00188_.png (1.73 MB, 1216x832)

1.73 MB PNG

>>101309445
Forgot my pic. Whoops!

Anonymous
07/07/24(Sun)09:12:31 No.101309489

Anonymous 07/07/24(Sun)09:12:31 No.101309489

File: 0.jpg (292 KB, 1024x1024)

292 KB JPG

Anonymous
07/07/24(Sun)09:15:10 No.101309516

Anonymous 07/07/24(Sun)09:15:10 No.101309516

File: ComfyUI_Kolors_00205_.png (1.55 MB, 1216x832)

1.55 MB PNG

>>101309456

Anonymous
07/07/24(Sun)09:17:36 No.101309535

Anonymous 07/07/24(Sun)09:17:36 No.101309535

>>101309220
>>101309448
mind if i ask for a catbox/prompt? how do you get dynamic pictures like that? sd loves to be symmetrical

Anonymous
07/07/24(Sun)09:18:20 No.101309538

Anonymous 07/07/24(Sun)09:18:20 No.101309538

File: file.png (1.35 MB, 1323x634)

1.35 MB PNG

>>101308314
No luck with anything. I'm tempted to just re-install.

Maybe it's a problem with forge? If anyone's ever experienced this, pls reply I'm going nuts here.

Anonymous
07/07/24(Sun)09:19:03 No.101309545

Anonymous 07/07/24(Sun)09:19:03 No.101309545

File: kolors_00105_.png (1.38 MB, 1024x1024)

1.38 MB PNG

Anonymous
07/07/24(Sun)09:23:47 No.101309601

Anonymous 07/07/24(Sun)09:23:47 No.101309601

File: ComfyUI_Kolors_00222_.png (1.85 MB, 1216x832)

1.85 MB PNG

Kolors generates faster than Pixart Sigma and SD3, too, which is a real bonus.

Anonymous
07/07/24(Sun)09:26:36 No.101309623

Anonymous 07/07/24(Sun)09:26:36 No.101309623

File: ComfyUI_Kolors_00229_.png (1.98 MB, 1216x832)

1.98 MB PNG

Anonymous
07/07/24(Sun)09:26:40 No.101309625

Anonymous 07/07/24(Sun)09:26:40 No.101309625

>>101309545
the hands look really really good, I think we found the next model to move forward, thank god

Anonymous
07/07/24(Sun)09:32:44 No.101309680

Anonymous 07/07/24(Sun)09:32:44 No.101309680

File: kolors_00137_.png (1.49 MB, 1024x1024)

1.49 MB PNG

Anyone else notice kolors really having trouble producing images of people below the shoulders?

Anonymous
07/07/24(Sun)09:33:33 No.101309692

Anonymous 07/07/24(Sun)09:33:33 No.101309692

>>101309680
This will always be the problem with models obsessed with aesthetics metrics.

Anonymous
07/07/24(Sun)09:35:00 No.101309707

Anonymous 07/07/24(Sun)09:35:00 No.101309707

File: long dick general (1).jpg (3.05 MB, 2306x3264)

3.05 MB JPG

>>101309538
>Maybe it's a problem with forge?
Doubt it. Hell, you could try using the soft inpainting mode it has pre-installed. Can you give me an example of your settings, mask, prompt? Anything and everything.
>>101309535
>how do you get dynamic pictures like that?
Honestly, no clue. Tough luck with catbox, but I can throw in a hint or two. Autismmix (Pony), 1344x768, 25/45 Euler A steps, 7 CFG, so nothing out of the ordinary. As for prompts, it's pretty much "random bullshit go". I think the combo of horizontal resolutions coupled with dutch angle, from above/below tends to give gens a lot of depth. Sometimes I barely have a prompt in there, other than what I just mentioned, and it does well either way. I think horizontals in general just tend to be more dynamic due bias in training material. Same might be the case for verticals? 1:1 just tend to be simpler in composition. Also good style loras go a long way.

Anonymous
07/07/24(Sun)09:35:46 No.101309715

Anonymous 07/07/24(Sun)09:35:46 No.101309715

>>101309337
Artificial filter layers that when activated will fuck up the image perhaps? Chinese cucks. Nothing that can't be removed of course.

Anonymous
07/07/24(Sun)09:35:46 No.101309716

Anonymous 07/07/24(Sun)09:35:46 No.101309716

File: kolors_00143_.png (1.34 MB, 1216x768)

1.34 MB PNG

>>101309692
It's true, the images look REALLY good, but it's hard to wrangle this thing to produce anything other than a portrait.

Anonymous
07/07/24(Sun)09:36:25 No.101309722

Anonymous 07/07/24(Sun)09:36:25 No.101309722

>>101309680
If you can, try changing the resolution to something asymmetrical, see if that changes anything. Resolution ratios REALLY make a huge impact.

Anonymous
07/07/24(Sun)09:37:12 No.101309728

Anonymous 07/07/24(Sun)09:37:12 No.101309728

>>101309680
Yeah, looks like you have to use some faggot verbose prompt to get what you want
https://huggingface.co/Kwai-Kolors/Kolors/discussions/7#668a52edcf56ff052f2886b9

Anonymous
07/07/24(Sun)09:37:52 No.101309738

Anonymous 07/07/24(Sun)09:37:52 No.101309738

File: kolors_00145_.png (1.73 MB, 1216x768)

1.73 MB PNG

Anonymous
07/07/24(Sun)09:38:34 No.101309744

Anonymous 07/07/24(Sun)09:38:34 No.101309744

>>101309728
>it's your bad prompt.
I'm starting to dislike this model even more with every post itt.

Anonymous
07/07/24(Sun)09:39:29 No.101309753

Anonymous 07/07/24(Sun)09:39:29 No.101309753

>>101309744
he sounded like Lykon not gonna lie: "hurdur skill issue"

Anonymous
07/07/24(Sun)09:39:36 No.101309754

Anonymous 07/07/24(Sun)09:39:36 No.101309754

File: kolors_00147_.png (1.76 MB, 1216x768)

1.76 MB PNG

Anonymous
07/07/24(Sun)09:39:38 No.101309755

Anonymous 07/07/24(Sun)09:39:38 No.101309755

>>101309744
It's a retarded take because it ultimately ignored the pope keyword which means it's bad at prompt adherence.

Anonymous
07/07/24(Sun)09:41:51 No.101309780

Anonymous 07/07/24(Sun)09:41:51 No.101309780

File: kolors_00153_.png (1.57 MB, 1216x768)

1.57 MB PNG

Anonymous
07/07/24(Sun)09:42:07 No.101309787

Anonymous 07/07/24(Sun)09:42:07 No.101309787

File: ComfyUI_Kolors_00234_.png (1.74 MB, 1216x832)

1.74 MB PNG

>>101309680
It's not so bad if you're generating dudes, but with women it's painful trying to get anything other than a face shot.

Anonymous
07/07/24(Sun)09:45:45 No.101309818

Anonymous 07/07/24(Sun)09:45:45 No.101309818

File: kolors_00160_.png (1.48 MB, 1216x768)

1.48 MB PNG

Anonymous
07/07/24(Sun)09:45:51 No.101309820

Anonymous 07/07/24(Sun)09:45:51 No.101309820

>>101309755
this, I hate those mf who gaslight people into writing a fucking bible to describe mundane stuff everyone would understand anyway

Anonymous
07/07/24(Sun)09:45:55 No.101309822

Anonymous 07/07/24(Sun)09:45:55 No.101309822

>>101309753
>>101309755
A base model indeed shouldn't require much thought to have decent output by default. Even prompts like "masterpiece, good quality, highly detailed, score_9" or whatever should be their default per se, nevermind fucking prompt adherence, which doesn't even make sense, since more prompts in the long run means more of them will get lost and ignored along the way.

Anonymous
07/07/24(Sun)09:48:14 No.101309848

Anonymous 07/07/24(Sun)09:48:14 No.101309848

>>101309787
guess that women flooded the internet with their useless selfies, and they trained the model with only that kek

Anonymous
07/07/24(Sun)09:49:08 No.101309852

Anonymous 07/07/24(Sun)09:49:08 No.101309852

can someone generate a big pair of breasts? i'd like to look at some

Anonymous
07/07/24(Sun)09:50:15 No.101309857

Anonymous 07/07/24(Sun)09:50:15 No.101309857

>>101309852
Surprisignly, Kolor can do that, the chinks are way less prude than the western fags, that's how far the west has fallen, we're loosing to a communist country that banned porn

Anonymous
07/07/24(Sun)09:56:16 No.101309915

Anonymous 07/07/24(Sun)09:56:16 No.101309915

>>101309848
No, if you only train using high aesthetics metrics you start filtering out even images of people standing. Portraits are going to score higher. It's why I think it's one of the reasons that hold base models back especially SAI's because they incompetently set their filters.

Anonymous
07/07/24(Sun)09:56:40 No.101309922

Anonymous 07/07/24(Sun)09:56:40 No.101309922

>>101309707
Do you mind if it's a NSFW image? I'm willing to post my WiP image with metadata on catbox.

Anonymous
07/07/24(Sun)09:57:34 No.101309934

Anonymous 07/07/24(Sun)09:57:34 No.101309934

>>101309922
shoot

Anonymous
07/07/24(Sun)10:00:19 No.101309960

Anonymous 07/07/24(Sun)10:00:19 No.101309960

>>101309738
>>101309754
>>101309818
damn great

Anonymous
07/07/24(Sun)10:02:59 No.101309991

Anonymous 07/07/24(Sun)10:02:59 No.101309991

>>101309857
>chinks are way less prude than the western fags
chinks are pretty horny, their fetishes are always on the more extreme side

Anonymous
07/07/24(Sun)10:17:48 No.101310131

Anonymous 07/07/24(Sun)10:17:48 No.101310131

File: 00007-2857992324.jpg (552 KB, 1344x2016)

552 KB JPG

Anonymous
07/07/24(Sun)10:19:04 No.101310151

Anonymous 07/07/24(Sun)10:19:04 No.101310151

>>101309934
https://files.catbox.moe/1blbj3.png
Sorry for the wait.
I'd post the mask but it's literally visible given it became a discolored blob on her abdomen.

Anonymous
07/07/24(Sun)10:20:15 No.101310167

Anonymous 07/07/24(Sun)10:20:15 No.101310167

>>101309934
>>101310151
Oh yeah, and my settings on forge are default except Eta noise seed delta at 31337 and clip skip 2.

Anonymous
07/07/24(Sun)10:33:54 No.101310291

Anonymous 07/07/24(Sun)10:33:54 No.101310291

>>101310151
Good stuff. Simplify your inpainting workflow and slowly build up from a minimum. Chances are you overcomplicated your prompt and so the inpainting gets lost with your input.

Try removing the BREAK, reduce your prompt to essentials like the score_prefix and something like dark shiny/glossy skin, naked, maybe laying back. Get rid of the negatives, score especially shouldn't be there. I'm not sure what the original input looks like, but you can try lowering denoise to 45 or 35.

Anonymous
07/07/24(Sun)10:36:06 No.101310312

Anonymous 07/07/24(Sun)10:36:06 No.101310312

File: 00974-2857992327.jpg (644 KB, 1344x1781)

644 KB JPG

Anonymous
07/07/24(Sun)10:40:46 No.101310360

Anonymous 07/07/24(Sun)10:40:46 No.101310360

File: file.png (363 KB, 636x608)

363 KB PNG

>>101310291
Thanks. Original input looks like pic related.
I'll try what you said, hope to god it works. Even if it doesn't, thank you for your help so far. I appreciate it, anon.

Anonymous
07/07/24(Sun)10:51:16 No.101310470

Anonymous 07/07/24(Sun)10:51:16 No.101310470

File: 0.jpg (412 KB, 1024x1024)

412 KB JPG

Anonymous
07/07/24(Sun)10:56:30 No.101310527

Anonymous 07/07/24(Sun)10:56:30 No.101310527

File: 0.jpg (406 KB, 1024x1024)

406 KB JPG

Anonymous
07/07/24(Sun)10:58:54 No.101310557

Anonymous 07/07/24(Sun)10:58:54 No.101310557

This is what kolors thinks nipples look like.

https://files.catbox.moe/k0vps6.png

Anonymous
07/07/24(Sun)10:59:22 No.101310567

Anonymous 07/07/24(Sun)10:59:22 No.101310567

File: 01004-2857992326.jpg (590 KB, 1344x1781)

590 KB JPG

Anonymous
07/07/24(Sun)11:00:39 No.101310581

Anonymous 07/07/24(Sun)11:00:39 No.101310581

File: PA_0014.jpg (411 KB, 2048x2048)

411 KB JPG

>>101309489
Very nice>>101310527

Anonymous
07/07/24(Sun)11:10:33 No.101310663

Anonymous 07/07/24(Sun)11:10:33 No.101310663

File: shutterstock_129368246-771x572.jpg (66 KB, 771x572)

66 KB JPG

>>101310557
looks fine to me

Anonymous
07/07/24(Sun)11:13:22 No.101310689

Anonymous 07/07/24(Sun)11:13:22 No.101310689

File: 00014-1847830168.jpg (238 KB, 1400x1008)

238 KB JPG

>>101309818
Nice

Anonymous
07/07/24(Sun)11:24:16 No.101310783

Anonymous 07/07/24(Sun)11:24:16 No.101310783

>>101310557
WTF????

Anonymous
07/07/24(Sun)11:30:23 No.101310835

Anonymous 07/07/24(Sun)11:30:23 No.101310835

>>101310557
can confirm. looks like a nice base for finetunes though

https://files.catbox.moe/jgsrs2.jpg

Anonymous
07/07/24(Sun)11:31:57 No.101310849

Anonymous 07/07/24(Sun)11:31:57 No.101310849

>>101310835
yeah, only the nipples look weird, the rest of the anatomy is really great, sounds easy enough to fix

Anonymous
07/07/24(Sun)11:33:19 No.101310863

Anonymous 07/07/24(Sun)11:33:19 No.101310863

File: ComfyUI_Kolors_0025.jpg (193 KB, 1024x1024)

193 KB JPG

Anonymous
07/07/24(Sun)11:44:41 No.101310958

Anonymous 07/07/24(Sun)11:44:41 No.101310958

For everyone wanting to test Kolors out, you can use this ComfyUi wrapper
https://github.com/kijai/ComfyUI-KwaiKolorsWrapper

Anonymous
07/07/24(Sun)11:45:08 No.101310963

Anonymous 07/07/24(Sun)11:45:08 No.101310963

File: ComfyUI_Kolors_0035.jpg (210 KB, 1024x1024)

210 KB JPG

Anonymous
07/07/24(Sun)11:47:39 No.101310993

Anonymous 07/07/24(Sun)11:47:39 No.101310993

It should be criminal to ship a new model without fine tuning code.

Anonymous
07/07/24(Sun)11:47:47 No.101310995

Anonymous 07/07/24(Sun)11:47:47 No.101310995

File: ygwfy21ec3bd1.png (2.23 MB, 768x1344)

2.23 MB PNG

You can test Kolors on this huggingface demo:
https://huggingface.co/spaces/gokaygokay/Kolors

Anonymous
07/07/24(Sun)11:48:50 No.101311010

Anonymous 07/07/24(Sun)11:48:50 No.101311010

File: BRUH.jpg (56 KB, 1096x799)

56 KB JPG

>>101310993
https://github.com/Kwai-Kolors/Kolors
Holy fuck it's not even on their plan

Anonymous
07/07/24(Sun)11:54:17 No.101311073

Anonymous 07/07/24(Sun)11:54:17 No.101311073

>>101310291
It's the fucking noise multiplier for img2img and vae. It was at 0.7.

For some reason it's still "tinted" towards purple/pink on forge but normal on A1111.

Anonymous
07/07/24(Sun)12:02:21 No.101311145

Anonymous 07/07/24(Sun)12:02:21 No.101311145

File: grege.png (1.85 MB, 1024x1024)

1.85 MB PNG

>>101310995
Noice, I have trouble removing the blur though, even by putting "bokeh, blur" on the negative prompt

Anonymous
07/07/24(Sun)12:04:00 No.101311163

Anonymous 07/07/24(Sun)12:04:00 No.101311163

>>101311010
Chads

Anonymous
07/07/24(Sun)12:04:40 No.101311167

Anonymous 07/07/24(Sun)12:04:40 No.101311167

File: image (2).png (1.71 MB, 1024x1024)

1.71 MB PNG

>>101310995
>Hatsune Miku shaking hands with Gawr Gura, studio ghibli drawing style
It doesn't know who Gura is :(

Anonymous
07/07/24(Sun)12:13:02 No.101311233

Anonymous 07/07/24(Sun)12:13:02 No.101311233

>>101307281
Could I get a prompt please?

Anonymous
07/07/24(Sun)12:14:02 No.101311243

Anonymous 07/07/24(Sun)12:14:02 No.101311243

>>101311233
it was "drawing of hands"

Anonymous
07/07/24(Sun)12:24:41 No.101311333

Anonymous 07/07/24(Sun)12:24:41 No.101311333

>>101310557
>>101310835
It's hit or miss, but it can generate decent nipples:
https://files.catbox.moe/vmjuus.webp
https://files.catbox.moe/d5iigy.webp

Anonymous
07/07/24(Sun)12:35:52 No.101311455

Anonymous 07/07/24(Sun)12:35:52 No.101311455

>>101311333
For a base model that's insane how good it looks, I knew the chinks would save us from cucked SAI

Anonymous
07/07/24(Sun)12:36:08 No.101311461

Anonymous 07/07/24(Sun)12:36:08 No.101311461

>>101309337
Try writing it in chinese

Anonymous
07/07/24(Sun)12:46:21 No.101311557

Anonymous 07/07/24(Sun)12:46:21 No.101311557

>>101307925
>>101308027
Aren't underscores useful to make something one term so you won't confuse the definitions

Like if I use feather_duster, I'm going to get feather dusters. But if I use feather duster with a space in between I might see rogue feathers in my images, or someone will be wearing a long jacket

Anonymous
07/07/24(Sun)12:46:44 No.101311561

Anonymous 07/07/24(Sun)12:46:44 No.101311561

File: image (3).png (1.62 MB, 1024x1024)

1.62 MB PNG

>>101310995
I'm really impressed by this model, finally we can be free of cucked SAI

Anonymous
07/07/24(Sun)12:48:19 No.101311578

Anonymous 07/07/24(Sun)12:48:19 No.101311578

official pixart bigma and lumina 2 waiting room

Anonymous
07/07/24(Sun)12:48:40 No.101311583

Anonymous 07/07/24(Sun)12:48:40 No.101311583

>>101311557
I can't be 100% sure, but that would mean "doggy style" would create dogs wouldn't it? I think it identifies it as a booru tag.

Anonymous
07/07/24(Sun)12:49:13 No.101311592

Anonymous 07/07/24(Sun)12:49:13 No.101311592

>>101311073
Weird, I'm also using Forge and the multiplier is at 1.0 for me by default.

Anonymous
07/07/24(Sun)12:50:34 No.101311612

Anonymous 07/07/24(Sun)12:50:34 No.101311612

>>101311561
without finetunes? not really.

Anonymous
07/07/24(Sun)12:51:40 No.101311623

Anonymous 07/07/24(Sun)12:51:40 No.101311623

File: Kolors.jpg (1.48 MB, 2532x2572)

1.48 MB JPG

>>101311612
Kolors is probably the best base model we ever had, that will probably be the model going forward for finetunes and shit

Anonymous
07/07/24(Sun)12:53:01 No.101311637

Anonymous 07/07/24(Sun)12:53:01 No.101311637

>>101311623
50 yuan has been deposited to your account

Anonymous
07/07/24(Sun)12:53:27 No.101311647

Anonymous 07/07/24(Sun)12:53:27 No.101311647

>>101311623
Curb you enthusiasm anon, lest you want to end up disappointed.

Anonymous
07/07/24(Sun)12:54:15 No.101311659

Anonymous 07/07/24(Sun)12:54:15 No.101311659

>>101301938
Ha, I did a few gens of something like this (though my version was more "monstrous", with claws and a tail)

Anonymous
07/07/24(Sun)12:54:17 No.101311660

Anonymous 07/07/24(Sun)12:54:17 No.101311660

>>101311623
It looks like one of the better bases, it was also published without model creation / finetuning code.

Anonymous
07/07/24(Sun)12:54:24 No.101311664

Anonymous 07/07/24(Sun)12:54:24 No.101311664

>>101311623
the model is good but i have this bad feeling

Anonymous
07/07/24(Sun)12:54:28 No.101311665

Anonymous 07/07/24(Sun)12:54:28 No.101311665

>>101311637
>>101311647
Seriously, look at that, the potential is here, it's completely uncensored and the anatomy is good, remember this is just a base model, imagine with a nice finetune on top of that >>101311333

Anonymous
07/07/24(Sun)12:57:45 No.101311719

Anonymous 07/07/24(Sun)12:57:45 No.101311719

>>101311665
>Seriously, look at that
That, anon, is an extremally small sample of a model barely anyone of us can run or finetune locally, with questionable licensing and arguable quality. I'm having hunyan flashbacks already.

Anonymous
07/07/24(Sun)12:58:24 No.101311734

Anonymous 07/07/24(Sun)12:58:24 No.101311734

>>101311557
>>101311583
I don't think that particular example is likely because there would be a lot of examples in its training data for "doggy style" whereas doggy and style are going to less represented in a porn data set

I think it happens if one of your terms is more common than both combined. Like feather would surely be more common than feather duster.

Anonymous
07/07/24(Sun)12:58:54 No.101311740

Anonymous 07/07/24(Sun)12:58:54 No.101311740

>>101311719
Hunyuan isn't that good compared to Kolors though, people will work hard to optimise Kolors, like they did for the SAI models, that's how it always worked in the imagegen ecosystem

Anonymous
07/07/24(Sun)13:03:08 No.101311790

Anonymous 07/07/24(Sun)13:03:08 No.101311790

Has anyone been able to cover up nudity with objects? Like the subject holding, like, flowers or a doll something to their chest?

Anonymous
07/07/24(Sun)13:04:35 No.101311810

Anonymous 07/07/24(Sun)13:04:35 No.101311810

>>101311740
I'll give you that from a glance I see more quality in kolors, as compared to hunyuan, but I remain sceptical nontheless. Licensing remains questionable, and hardware requirements an obstacle.

Anonymous
07/07/24(Sun)13:05:10 No.101311818

Anonymous 07/07/24(Sun)13:05:10 No.101311818

File: 0.jpg (384 KB, 1024x1024)

384 KB JPG

>>101310581
thanks, wizard.

Anonymous
07/07/24(Sun)13:06:03 No.101311834

Anonymous 07/07/24(Sun)13:06:03 No.101311834

>>101311790
inpainting, regional prompting. or you can train a concept lora

Anonymous
07/07/24(Sun)13:07:18 No.101311860

Anonymous 07/07/24(Sun)13:07:18 No.101311860

>>101311810
What even are the hardware requirements to train Kolors?

Anonymous
07/07/24(Sun)13:07:36 No.101311865

Anonymous 07/07/24(Sun)13:07:36 No.101311865

>>101311810
it asks for 20gb of vram because of the model + llm, the llm can be quantized and put on the cpu, and the model can be put at 8bit without much accuracy drop, the optimisation will be easy to do, for the licence yeah it needs clarification, it say it's MiT but it's actually not

Anonymous
07/07/24(Sun)13:07:38 No.101311866

Anonymous 07/07/24(Sun)13:07:38 No.101311866

>>101311834
The thing about training, let alone finding a lora is I don't even know what that sort of thing is even called

Anonymous
07/07/24(Sun)13:08:18 No.101311873

Anonymous 07/07/24(Sun)13:08:18 No.101311873

>>101311790
haven't tried it but "convenient censoring" is well tagged on danbooru

Anonymous
07/07/24(Sun)13:08:37 No.101311876

Anonymous 07/07/24(Sun)13:08:37 No.101311876

>>101311860
we don't know, we haven't the training code
https://github.com/Kwai-Kolors/Kolors/tree/master

Anonymous
07/07/24(Sun)13:09:31 No.101311887

Anonymous 07/07/24(Sun)13:09:31 No.101311887

>>101311866
are you new to this?

Anonymous
07/07/24(Sun)13:12:11 No.101311923

Anonymous 07/07/24(Sun)13:12:11 No.101311923

>>101311866
search for "censor" on civitai, they have loras for tails, hair, steam, soap, a general convenient censorship, etc

Anonymous
07/07/24(Sun)13:12:24 No.101311931

Anonymous 07/07/24(Sun)13:12:24 No.101311931

>>101311623
>that will probably be the model going forward for finetunes and shit
Stop repeating this.

Anonymous
07/07/24(Sun)13:13:11 No.101311941

Anonymous 07/07/24(Sun)13:13:11 No.101311941

>>101311931
https://www.youtube.com/watch?v=yWULCfJ2PGA

Anonymous
07/07/24(Sun)13:18:44 No.101311995

Anonymous 07/07/24(Sun)13:18:44 No.101311995

can kolors generate girls with armpit hair

Anonymous
07/07/24(Sun)13:21:36 No.101312025

Anonymous 07/07/24(Sun)13:21:36 No.101312025

>>101311876
>no training code
So it's as useful to anon as SD3. Nice.

Anonymous
07/07/24(Sun)13:23:27 No.101312037

Anonymous 07/07/24(Sun)13:23:27 No.101312037

File: PA_0016.jpg (894 KB, 2560x1536)

894 KB JPG

Anonymous
07/07/24(Sun)13:23:45 No.101312040

Anonymous 07/07/24(Sun)13:23:45 No.101312040

>>101311995
try it
https://gokaygokay-kolors.hf.space/

Anonymous
07/07/24(Sun)13:26:53 No.101312070

Anonymous 07/07/24(Sun)13:26:53 No.101312070

File: PA_0017.jpg (633 KB, 2560x1536)

633 KB JPG

Anonymous
07/07/24(Sun)13:26:58 No.101312072

Anonymous 07/07/24(Sun)13:26:58 No.101312072

File: Kolors-Miku.png (2.09 MB, 1024x1024)

2.09 MB PNG

>>101312025
I'm sure some autist will make a training code, the worst part was getting a good base model

Anonymous
07/07/24(Sun)13:28:28 No.101312093

Anonymous 07/07/24(Sun)13:28:28 No.101312093

>>101312072
Is it possible to create training code without the weights?

Anonymous
07/07/24(Sun)13:29:53 No.101312107

Anonymous 07/07/24(Sun)13:29:53 No.101312107

>>101312093
Why do you ask this question, we have the weights already
https://huggingface.co/Kwai-Kolors/Kolors

Anonymous
07/07/24(Sun)13:30:19 No.101312112

Anonymous 07/07/24(Sun)13:30:19 No.101312112

>>101312072
>I'm sure some autist will make a training code
just like AMD expects their userbase to make up for their shortcomings?

Anonymous
07/07/24(Sun)13:31:02 No.101312118

Anonymous 07/07/24(Sun)13:31:02 No.101312118

>>101312107
Fair enough I am mildly retarded

Anonymous
07/07/24(Sun)13:31:53 No.101312126

Anonymous 07/07/24(Sun)13:31:53 No.101312126

>>101311333
Holy shit that looks decent

Anonymous
07/07/24(Sun)13:31:55 No.101312128

Anonymous 07/07/24(Sun)13:31:55 No.101312128

>>101312112
you could have chosen a better example, like SAI asking us to fix their shitty base model since the begining (2022), it's been 2 years we were polishing their turds, nothing new in the sun

Anonymous
07/07/24(Sun)13:33:03 No.101312138

Anonymous 07/07/24(Sun)13:33:03 No.101312138

Asking again; is Kolors a model trained from scratch or is it simply a hypertuned XL

Anonymous
07/07/24(Sun)13:33:34 No.101312144

Anonymous 07/07/24(Sun)13:33:34 No.101312144

>>101312128
fair nuff

Anonymous
07/07/24(Sun)13:34:03 No.101312152

Anonymous 07/07/24(Sun)13:34:03 No.101312152

>>101312138
it's a model trained from scratch, they said it on their paper
https://github.com/Kwai-Kolors/Kolors/blob/master/imgs/Kolors_paper.pdf

Anonymous
07/07/24(Sun)13:35:04 No.101312166

Anonymous 07/07/24(Sun)13:35:04 No.101312166

>>101312118
it's all right kek

Anonymous
07/07/24(Sun)13:37:04 No.101312191

Anonymous 07/07/24(Sun)13:37:04 No.101312191

Kolors added to...
>>101312179
>>101312179
>>101312179

Anonymous
07/07/24(Sun)13:37:10 No.101312193

Anonymous 07/07/24(Sun)13:37:10 No.101312193

File: PA_0023.jpg (858 KB, 2560x1536)

858 KB JPG

Anonymous
07/07/24(Sun)13:37:17 No.101312194

Anonymous 07/07/24(Sun)13:37:17 No.101312194

File: file.png (1.42 MB, 1024x1024)

1.42 MB PNG

>>101312040
no hair :(

Anonymous
07/07/24(Sun)13:40:44 No.101312235

Anonymous 07/07/24(Sun)13:40:44 No.101312235

>>101311876
Let's see if that changes. Team Sigma / Lumina is already ok tho.

Anonymous
07/07/24(Sun)13:44:56 No.101312284

Anonymous 07/07/24(Sun)13:44:56 No.101312284

>>101312194
yeah I just tried with 4 pictures and it doesn't show armpit hair either, it will be fixed with some finetunes though, not the hardest thing in the world to add

Anonymous
07/07/24(Sun)13:45:17 No.101312288

Anonymous 07/07/24(Sun)13:45:17 No.101312288

>>101312194
nice pits but yeah.. looks like mostly trained with chinese-typical pretty but non-fetish art?

Anonymous
07/07/24(Sun)13:47:02 No.101312311

Anonymous 07/07/24(Sun)13:47:02 No.101312311

>>101312288
>non-fetish art?
they can't, porn is illegal in China

Anonymous
07/07/24(Sun)13:49:37 No.101312332

Anonymous 07/07/24(Sun)13:49:37 No.101312332

File: 0.jpg (174 KB, 1024x1024)

174 KB JPG

Anonymous
07/07/24(Sun)14:16:17 No.101312642

Anonymous 07/07/24(Sun)14:16:17 No.101312642

File: 0.jpg (235 KB, 1024x1024)

235 KB JPG

Anonymous
07/07/24(Sun)14:46:47 No.101312958

Anonymous 07/07/24(Sun)14:46:47 No.101312958

>>101312332
>>101312642
i respect the grind

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.