/g/ - /ldg/ - Local Diffusion General - Technology

Anonymous

/ldg/ - Local Diffusion Genera(...) 07/31/24(Wed)13:52:52 No.101655488

File: tmp.jpg (1.36 MB, 3265x3265)

/ldg/ - Local Diffusion General Anonymous 07/31/24(Wed)13:52:52 No.101655488 Archived

Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>101639278

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://www.modelscope.cn/home
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Kolors
https://gokaygokay-kolors.hf.space
Nodes: https://github.com/kijai/ComfyUI-KwaiKolorsWrapper

>AuraFlow
https://fal.ai/models/fal-ai/aura-flow
https://huggingface.co/fal/AuraFlows

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg

>>

Anonymous
07/31/24(Wed)14:00:45 No.101655630

Anonymous 07/31/24(Wed)14:00:45 No.101655630

blessed thread of frenship

>>

Anonymous
07/31/24(Wed)14:00:51 No.101655632

Anonymous 07/31/24(Wed)14:00:51 No.101655632

SAAAAAAAAAAAAAAAAAAS!!!!!!!!!!!!!!!!!!!!!!!

>>

Anonymous
07/31/24(Wed)14:01:54 No.101655654

Anonymous 07/31/24(Wed)14:01:54 No.101655654

THERE'S A FUCKING SAAS IN THE COLLAGE!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

>>

Anonymous
07/31/24(Wed)14:06:51 No.101655745

Anonymous 07/31/24(Wed)14:06:51 No.101655745

File: Sigma_12120_.jpg (1.88 MB, 2688x1536)

1.88 MB JPG

>>101655632
>>101655654
Baker has been compromised!

>>

Anonymous
07/31/24(Wed)14:09:33 No.101655785

Anonymous 07/31/24(Wed)14:09:33 No.101655785

File: 03747-202407311052143566912624.png (2.7 MB, 1344x1728)

2.7 MB PNG

https://files.catbox.moe/2cbnsx.png

>>

Anonymous
07/31/24(Wed)14:10:07 No.101655792

Anonymous 07/31/24(Wed)14:10:07 No.101655792

>>101655488
got a message for the top right corner image, *hyuckptui*

>>

Anonymous
07/31/24(Wed)14:11:54 No.101655818

Anonymous 07/31/24(Wed)14:11:54 No.101655818

>>101655785
why so smug

>>

Anonymous
07/31/24(Wed)14:26:15 No.101656014

Anonymous 07/31/24(Wed)14:26:15 No.101656014

>https://github.com/jhc13/taggui/releases/tag/v1.30.0
>Phi-3-vision-128k-instruct
I wonder if it's crap

>>

Anonymous
07/31/24(Wed)14:29:13 No.101656062

Anonymous 07/31/24(Wed)14:29:13 No.101656062

>>101656014
probably censored to hell and back. remember seeing people mention how some models would only use gender neutral pronouns and refuse to mention skin color, not sure if phi is like that but i wouldn't be surprised.

>>

Anonymous
07/31/24(Wed)14:29:31 No.101656068

Anonymous 07/31/24(Wed)14:29:31 No.101656068

File: 03445-202407182356163841646197.png (2.55 MB, 1344x1728)

2.55 MB PNG

>>101655818
When you're the savior of the Six Faced World, you tend to be that way.

>>

Anonymous
07/31/24(Wed)14:31:09 No.101656085

Anonymous 07/31/24(Wed)14:31:09 No.101656085

>>101656062
I love when the caption is "the person is holding their breasts"

>>

Anonymous
07/31/24(Wed)14:35:25 No.101656146

Anonymous 07/31/24(Wed)14:35:25 No.101656146

>>101656062
>probably censored to hell and back
I wouldn't be surprised. For now I just need something that captions objects and colors really well. Previous MS models have been decent and lightweight

>>101656085
I like "person is getting her tongue and her tongue pushed up"

>>

Anonymous
07/31/24(Wed)14:43:46 No.101656269

Anonymous 07/31/24(Wed)14:43:46 No.101656269

>https://huggingface.co/SmilingWolf/wd-vit-large-tagger-v3
have to test this too

>>

Anonymous
07/31/24(Wed)15:02:43 No.101656515

Anonymous 07/31/24(Wed)15:02:43 No.101656515

>>101656269
>Trained on Danbooru images
You have my attention.

>>

Anonymous
07/31/24(Wed)15:07:31 No.101656576

Anonymous 07/31/24(Wed)15:07:31 No.101656576

File: Untitled.jpg (2.92 MB, 3840x4356)

2.92 MB JPG

which one should i use

>>

Anonymous
07/31/24(Wed)15:10:02 No.101656616

Anonymous 07/31/24(Wed)15:10:02 No.101656616

>https://huggingface.co/docs/peft/main/en/package_reference/boft
and what the fuck is this

>Diag-OFT
waht

>>101656515
seems pretty good, tried on photos

>>

Anonymous
07/31/24(Wed)15:33:00 No.101656912

Anonymous 07/31/24(Wed)15:33:00 No.101656912

official bigma

>>

Anonymous
07/31/24(Wed)15:50:59 No.101657178

Anonymous 07/31/24(Wed)15:50:59 No.101657178

File: 00108-1537472161.jpg (271 KB, 1552x1200)

271 KB JPG

>>

Anonymous
07/31/24(Wed)16:09:42 No.101657423

Anonymous 07/31/24(Wed)16:09:42 No.101657423

File: Sigma_12126_.jpg (2.95 MB, 2688x1536)

2.95 MB JPG

>>101655488
No Sigma the past two collages. SaaS in latest collage.

>Is this an anime betrayal?

>>101641968
>>101642309
>>101642323
>>101646306
>>101646584
>>101646714
>>101648131

>>

Anonymous
07/31/24(Wed)16:15:06 No.101657498

Anonymous 07/31/24(Wed)16:15:06 No.101657498

File: 1722456844462.jpg (1 MB, 804x1430)

1 MB JPG

>>101655488
Can anyone replicate this style, it is sharp and soft at the same time

>>

Anonymous
07/31/24(Wed)16:19:09 No.101657543

Anonymous 07/31/24(Wed)16:19:09 No.101657543

File: Sigma_12129_.jpg (3.16 MB, 2688x1536)

3.16 MB JPG

>>101656616
>boft
Interesting, will need to read a paper on it. The description left me with more questions than answers

>>

Anonymous
07/31/24(Wed)16:24:32 No.101657622

Anonymous 07/31/24(Wed)16:24:32 No.101657622

File: Sigma_12137_.jpg (2.4 MB, 1536x2688)

2.4 MB JPG

Maybe?
>A realistic full color detailed drawing of a beautiful woman with tribal tattoos and clothed in a fur bikini looking at the camera

>>

Anonymous
07/31/24(Wed)16:25:02 No.101657628

Anonymous 07/31/24(Wed)16:25:02 No.101657628

>>101657543
>https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
Dev branch has it implemented, but I cannot test it yet. I'll have time tomorrow

>>

Anonymous
07/31/24(Wed)16:28:50 No.101657675

Anonymous 07/31/24(Wed)16:28:50 No.101657675

File: Sigma_12133_.jpg (3.07 MB, 2688x1536)

3.07 MB JPG

>>101657622 is for >>101657498
Low sleep, many mistakes

>>101657628
ty, will reference that against what I read

>>

Anonymous
07/31/24(Wed)16:34:54 No.101657756

Anonymous 07/31/24(Wed)16:34:54 No.101657756

>>101657622
Its sharp but not soft, and definitely not beautiful

>>

Anonymous
07/31/24(Wed)16:38:57 No.101657803

Anonymous 07/31/24(Wed)16:38:57 No.101657803

File: Sigma_12130_.jpg (3.54 MB, 2688x1536)

3.54 MB JPG

>>101657756
Oh well

>>

Anonymous
07/31/24(Wed)16:41:25 No.101657830

Anonymous 07/31/24(Wed)16:41:25 No.101657830

File: Sigma_12143_.jpg (1.35 MB, 2048x2048)

1.35 MB JPG

>>

Anonymous
07/31/24(Wed)16:53:17 No.101657970

Anonymous 07/31/24(Wed)16:53:17 No.101657970

>IPNDM sampler
worth using?

>>

Anonymous
07/31/24(Wed)16:54:40 No.101657984

Anonymous 07/31/24(Wed)16:54:40 No.101657984

>>101657423
two imgs in the last collage were sigma, and before that, three! :P
i dont think ive even seen a completely non sigma collage desu

>>

Anonymous
07/31/24(Wed)17:25:50 No.101658360

Anonymous 07/31/24(Wed)17:25:50 No.101658360

File: Sigma_12147_.jpg (2.9 MB, 2048x2048)

2.9 MB JPG

>>101657970
I thought not initially, but it might be sharper than deis when testing again

>>101657984
Previous /ldg/ bread : >>101639278
Which ones? I see SD and Cascade. There were a bunch of OP's w/o Sigma, but I never cared until SaaS creeped in. Tired and tilted so maybe don't mind me

>>

Anonymous
07/31/24(Wed)17:30:01 No.101658407

Anonymous 07/31/24(Wed)17:30:01 No.101658407

File: Sigma_12148_.jpg (3.03 MB, 2048x2048)

3.03 MB JPG

>>

Anonymous
07/31/24(Wed)17:33:49 No.101658457

Anonymous 07/31/24(Wed)17:33:49 No.101658457

File: Sigma_12152_.jpg (2.3 MB, 2816x1408)

2.3 MB JPG

>>

Anonymous
07/31/24(Wed)17:38:15 No.101658509

Anonymous 07/31/24(Wed)17:38:15 No.101658509

File: Sigma_12154_.jpg (2.16 MB, 2816x1408)

2.16 MB JPG

>>

Anonymous
07/31/24(Wed)17:49:52 No.101658648

Anonymous 07/31/24(Wed)17:49:52 No.101658648

File: 3717041298.jpg (183 KB, 1024x768)

183 KB JPG

>>

Anonymous
07/31/24(Wed)18:04:41 No.101658813

Anonymous 07/31/24(Wed)18:04:41 No.101658813

>>101656576
you must test every single option

>>

Anonymous
07/31/24(Wed)18:06:53 No.101658841

Anonymous 07/31/24(Wed)18:06:53 No.101658841

File: 9b9580f1-d800-4b9b-8b9f-6(...).png (1.57 MB, 896x1152)

1.57 MB PNG

>>

Anonymous
07/31/24(Wed)18:46:40 No.101659255

Anonymous 07/31/24(Wed)18:46:40 No.101659255

File: 0497f7c1-169d-4f84-a287-b(...).png (3.13 MB, 1344x1728)

3.13 MB PNG

Anyone have tips for prompting for a character holding weapons? I find it's really rare to get a gen where it doesn't break everything.

>>

Anonymous
07/31/24(Wed)19:30:17 No.101659710

Anonymous 07/31/24(Wed)19:30:17 No.101659710

>>101659255
Any decent model will respond well to prompts like "wielding" or "holding"

>>

Anonymous
07/31/24(Wed)20:14:45 No.101660179

Anonymous 07/31/24(Wed)20:14:45 No.101660179

>>101655745
nice

>>

Anonymous
07/31/24(Wed)20:58:06 No.101660577

Anonymous 07/31/24(Wed)20:58:06 No.101660577

File: ComfyUI_temp_ktrrr_00147_.png (1017 KB, 1088x960)

1017 KB PNG

>>

Anonymous
07/31/24(Wed)21:10:53 No.101660695

Anonymous 07/31/24(Wed)21:10:53 No.101660695

File: Sigma_12160_.jpg (2.28 MB, 2816x1408)

2.28 MB JPG

>>101659255
I often see gens with the weapon to the side

>>101660179
ty

>>

Anonymous
07/31/24(Wed)21:14:11 No.101660733

Anonymous 07/31/24(Wed)21:14:11 No.101660733

File: Sigma_12162_.jpg (2.61 MB, 2816x1408)

2.61 MB JPG

>>

Anonymous
07/31/24(Wed)21:15:50 No.101660757

Anonymous 07/31/24(Wed)21:15:50 No.101660757

File: Sigma_12163_.jpg (2.34 MB, 2816x1408)

2.34 MB JPG

>>

Anonymous
07/31/24(Wed)21:16:53 No.101660766

Anonymous 07/31/24(Wed)21:16:53 No.101660766

File: Sigma_12164_.jpg (3.12 MB, 2816x1408)

3.12 MB JPG

>>

Anonymous
07/31/24(Wed)21:20:51 No.101660815

Anonymous 07/31/24(Wed)21:20:51 No.101660815

File: Sigma_12166_.jpg (2.54 MB, 2816x1408)

2.54 MB JPG

>>

Anonymous
07/31/24(Wed)21:23:11 No.101660831

Anonymous 07/31/24(Wed)21:23:11 No.101660831

File: Sigma_12167_.jpg (2.16 MB, 2816x1408)

2.16 MB JPG

>>

Anonymous
07/31/24(Wed)21:47:54 No.101661083

Anonymous 07/31/24(Wed)21:47:54 No.101661083

File: Sigma_12172_.jpg (1.68 MB, 2816x1408)

1.68 MB JPG

I summon the latent underworld to correct what was done.

We must flush this thread from our memories

>>

Anonymous
07/31/24(Wed)21:50:34 No.101661105

Anonymous 07/31/24(Wed)21:50:34 No.101661105

>>101655488
oni SEX

>>

Anonymous
07/31/24(Wed)21:51:08 No.101661113

Anonymous 07/31/24(Wed)21:51:08 No.101661113

File: Sigma_12174_.jpg (1.89 MB, 2816x1408)

1.89 MB JPG

>>

Anonymous
07/31/24(Wed)21:52:48 No.101661137

Anonymous 07/31/24(Wed)21:52:48 No.101661137

File: Sigma_12175_.jpg (1.66 MB, 2816x1408)

1.66 MB JPG

>>

Anonymous
07/31/24(Wed)21:57:20 No.101661185

Anonymous 07/31/24(Wed)21:57:20 No.101661185

File: Sigma_12176_.jpg (1.76 MB, 2816x1408)

1.76 MB JPG

>>

Anonymous
07/31/24(Wed)22:03:57 No.101661269

Anonymous 07/31/24(Wed)22:03:57 No.101661269

File: Sigma_12180_.jpg (1.49 MB, 2816x1408)

1.49 MB JPG

>>

Anonymous
07/31/24(Wed)22:04:59 No.101661281

Anonymous 07/31/24(Wed)22:04:59 No.101661281

File: Sigma_12181_.jpg (1.56 MB, 2816x1408)

1.56 MB JPG

>>

Anonymous
07/31/24(Wed)22:06:13 No.101661294

Anonymous 07/31/24(Wed)22:06:13 No.101661294

File: Sigma_12182_.jpg (2.2 MB, 2816x1408)

2.2 MB JPG

>>

Anonymous
07/31/24(Wed)22:07:16 No.101661303

Anonymous 07/31/24(Wed)22:07:16 No.101661303

File: Sigma_12183_.jpg (2.33 MB, 2816x1408)

2.33 MB JPG

>>

Anonymous
07/31/24(Wed)22:08:18 No.101661317

Anonymous 07/31/24(Wed)22:08:18 No.101661317

File: Sigma_12184_.jpg (1.97 MB, 2816x1408)

1.97 MB JPG

>>

Anonymous
07/31/24(Wed)22:09:24 No.101661322

Anonymous 07/31/24(Wed)22:09:24 No.101661322

File: Sigma_12188_.jpg (2.49 MB, 2816x1408)

2.49 MB JPG

>>

Anonymous
07/31/24(Wed)22:10:06 No.101661333

Anonymous 07/31/24(Wed)22:10:06 No.101661333

File: 1722296032849388.jpg (3.72 MB, 1536x2560)

3.72 MB JPG

first time using comfyui

>>

Anonymous
07/31/24(Wed)22:10:50 No.101661342

Anonymous 07/31/24(Wed)22:10:50 No.101661342

File: Sigma_12189_.jpg (2.28 MB, 2816x1408)

2.28 MB JPG

We must unburden ourselves from what has been

>>

Anonymous
07/31/24(Wed)22:11:52 No.101661358

Anonymous 07/31/24(Wed)22:11:52 No.101661358

File: Sigma_12190_.jpg (1.76 MB, 2816x1408)

1.76 MB JPG

>>101661333
Good first go!

>>

Anonymous
07/31/24(Wed)22:12:55 No.101661369

Anonymous 07/31/24(Wed)22:12:55 No.101661369

File: Sigma_12191_.jpg (2.45 MB, 2816x1408)

2.45 MB JPG

>>

Anonymous
07/31/24(Wed)22:13:57 No.101661378

Anonymous 07/31/24(Wed)22:13:57 No.101661378

File: Sigma_12193_.jpg (1.62 MB, 2816x1408)

1.62 MB JPG

>>

Anonymous
07/31/24(Wed)22:20:09 No.101661435

Anonymous 07/31/24(Wed)22:20:09 No.101661435

File: Sigma_12194_.jpg (2.05 MB, 2816x1408)

2.05 MB JPG

>>

Anonymous
07/31/24(Wed)22:21:41 No.101661452

Anonymous 07/31/24(Wed)22:21:41 No.101661452

File: Sigma_12196_.jpg (2.31 MB, 2816x1408)

2.31 MB JPG

>>

Anonymous
07/31/24(Wed)22:24:09 No.101661475

Anonymous 07/31/24(Wed)22:24:09 No.101661475

File: Sigma_12199_.jpg (1.45 MB, 2816x1408)

1.45 MB JPG

>>

Anonymous
07/31/24(Wed)22:31:32 No.101661556

Anonymous 07/31/24(Wed)22:31:32 No.101661556

File: Sigma_12203_.jpg (3 MB, 2816x1408)

3 MB JPG

>>

Anonymous
07/31/24(Wed)22:33:59 No.101661578

Anonymous 07/31/24(Wed)22:33:59 No.101661578

File: Sigma_12205_.jpg (2.61 MB, 2816x1408)

2.61 MB JPG

>>

Anonymous
07/31/24(Wed)22:35:47 No.101661602

Anonymous 07/31/24(Wed)22:35:47 No.101661602

File: Sigma_12206_.jpg (2.28 MB, 2816x1408)

2.28 MB JPG

>>

Anonymous
07/31/24(Wed)22:57:44 No.101661802

Anonymous 07/31/24(Wed)22:57:44 No.101661802

File: rabbit_bible_00002_.png (3.19 MB, 1728x1344)

3.19 MB PNG

>>101661556
i like this one

>>

Anonymous
07/31/24(Wed)23:06:56 No.101661895

Anonymous 07/31/24(Wed)23:06:56 No.101661895

>>101661802
Mid Century Art of a cat

>>

Anonymous
07/31/24(Wed)23:08:13 No.101661903

Anonymous 07/31/24(Wed)23:08:13 No.101661903

File: Sigma_12213_.jpg (2.69 MB, 2816x1408)

2.69 MB JPG

>>101661895

>>

Anonymous
07/31/24(Wed)23:35:14 No.101662136

Anonymous 07/31/24(Wed)23:35:14 No.101662136

File: Sigma_12214_.jpg (2.15 MB, 2816x1408)

2.15 MB JPG

>>

Anonymous
07/31/24(Wed)23:39:27 No.101662176

Anonymous 07/31/24(Wed)23:39:27 No.101662176

File: Sigma_12217_.jpg (2.18 MB, 2816x1408)

2.18 MB JPG

>>

Anonymous
07/31/24(Wed)23:52:27 No.101662308

Anonymous 07/31/24(Wed)23:52:27 No.101662308

File: Sigma_12225_.jpg (1.79 MB, 2816x1408)

1.79 MB JPG

>>

Anonymous
08/01/24(Thu)00:10:21 No.101662519

Anonymous 08/01/24(Thu)00:10:21 No.101662519

>>101661083
>>101661333
>>101661802
>>101662136
VERY nice

>>

Anonymous
08/01/24(Thu)00:17:09 No.101662585

Anonymous 08/01/24(Thu)00:17:09 No.101662585

File: Sigma_12244_.png (3.59 MB, 2816x1408)

3.59 MB PNG

>>101662519
ty

>>

Anonymous
08/01/24(Thu)00:25:44 No.101662662

Anonymous 08/01/24(Thu)00:25:44 No.101662662

File: ComfyUI_temp_gqtbk_00025_.png (891 KB, 1088x960)

891 KB PNG

absurd quality ITT

>>

Anonymous
08/01/24(Thu)00:27:22 No.101662687

Anonymous 08/01/24(Thu)00:27:22 No.101662687

File: Sigma_12250_.jpg (1.81 MB, 2048x2048)

1.81 MB JPG

>>101656616
>what the fuck is this
Apparently there's a guide and it's really good imo https://huggingface.co/docs/peft/main/en/conceptual_guides/oft

>>101662662
Nice

>>

Anonymous
08/01/24(Thu)00:32:30 No.101662740

Anonymous 08/01/24(Thu)00:32:30 No.101662740

File: ComfyUI_temp_gqtbk_00024_.png (995 KB, 1088x960)

995 KB PNG

>>101662687
my feeling is that there isn't much "abstract photography" in the dataset but maybe im just failing to elicit it from the model

>>

Anonymous
08/01/24(Thu)00:35:20 No.101662776

Anonymous 08/01/24(Thu)00:35:20 No.101662776

File: ComfyUI_temp_gqtbk_00029_.png (806 KB, 896x1152)

806 KB PNG

>>

Anonymous
08/01/24(Thu)00:39:46 No.101662817

Anonymous 08/01/24(Thu)00:39:46 No.101662817

File: ComfyUI_temp_gqtbk_00031_.png (814 KB, 896x1152)

814 KB PNG

regardless, SD is outdated at this point. pixart looks better and better everyday and constantly surprises me

>>

Anonymous
08/01/24(Thu)00:43:20 No.101662849

Anonymous 08/01/24(Thu)00:43:20 No.101662849

File: Sigma_12251_.jpg (1.97 MB, 2048x2048)

1.97 MB JPG

>>101662740
100% boring normie IRL stuff

>>

Anonymous
08/01/24(Thu)00:48:11 No.101662907

Anonymous 08/01/24(Thu)00:48:11 No.101662907

File: Sigma_12252_.jpg (2.91 MB, 2048x2048)

2.91 MB JPG

>>101662817
>constantly surprises me
My same experience. It doesn't stop. How is 2k the same param count as 1k? Controlnet, etc. are definitely missing for Sigma though

>>

Anonymous
08/01/24(Thu)00:50:01 No.101662923

Anonymous 08/01/24(Thu)00:50:01 No.101662923

Pixart IPAdapter pls

>>

Anonymous
08/01/24(Thu)01:04:19 No.101663059

Anonymous 08/01/24(Thu)01:04:19 No.101663059

File: Sigma_12253_.jpg (2.85 MB, 2048x2048)

2.85 MB JPG

>>101662923
Seems simple to train.. https://github.com/tencent-ailab/IP-Adapter/blob/main/tutorial_train.py

On the project list!

>>

Anonymous
08/01/24(Thu)01:06:25 No.101663076

Anonymous 08/01/24(Thu)01:06:25 No.101663076

File: Sigma_12208_.jpg (1.78 MB, 2816x1408)

1.78 MB JPG

Good night

>>

Anonymous
08/01/24(Thu)01:08:02 No.101663097

Anonymous 08/01/24(Thu)01:08:02 No.101663097

File: image.png (1.6 MB, 1792x1152)

1.6 MB PNG

>>

Anonymous
08/01/24(Thu)01:13:00 No.101663139

Anonymous 08/01/24(Thu)01:13:00 No.101663139

>>101661333
What model?

>>

Anonymous
08/01/24(Thu)01:27:46 No.101663299

Anonymous 08/01/24(Thu)01:27:46 No.101663299

>>101663059
>Seems simple to train..
That's what they said about pixart itself yet there are only a few people doing it :d

>>

Anonymous
08/01/24(Thu)01:52:42 No.101663558

Anonymous 08/01/24(Thu)01:52:42 No.101663558

>>101663139
kolors, it's surprisingly good for not being a finetune

>>

Anonymous
08/01/24(Thu)02:03:37 No.101663690

Anonymous 08/01/24(Thu)02:03:37 No.101663690

>>101663076
>>101663097
Very cool

>>

Anonymous
08/01/24(Thu)02:12:38 No.101663756

Anonymous 08/01/24(Thu)02:12:38 No.101663756

>>101663558
Thanks, I notice it now

>>

Anonymous
08/01/24(Thu)02:41:59 No.101664006

Anonymous 08/01/24(Thu)02:41:59 No.101664006

File: ComfyUI_temp_gqtbk_00072_.png (227 KB, 896x1152)

227 KB PNG

>>

Anonymous
08/01/24(Thu)02:51:56 No.101664101

Anonymous 08/01/24(Thu)02:51:56 No.101664101

>>101664006
How do you prompt for this?

>>

Anonymous
08/01/24(Thu)02:57:26 No.101664149

Anonymous 08/01/24(Thu)02:57:26 No.101664149

>>101664101
lucky seed
https://files.catbox.moe/9bj8li.png

>>

Anonymous
08/01/24(Thu)03:00:13 No.101664169

Anonymous 08/01/24(Thu)03:00:13 No.101664169

File: ComfyUI_temp_gqtbk_00076_.png (706 KB, 896x1152)

706 KB PNG

>>

Anonymous
08/01/24(Thu)03:28:45 No.101664439

Anonymous 08/01/24(Thu)03:28:45 No.101664439

File: 116775249263549068-SD.png (3.93 MB, 1200x1776)

3.93 MB PNG

>>

Anonymous
08/01/24(Thu)03:29:41 No.101664450

Anonymous 08/01/24(Thu)03:29:41 No.101664450

File: Image.jpg (734 KB, 1792x1152)

734 KB JPG

>>

Anonymous
08/01/24(Thu)03:34:29 No.101664501

Anonymous 08/01/24(Thu)03:34:29 No.101664501

File: ComfyUI_temp_gqtbk_00090_.png (751 KB, 896x1152)

751 KB PNG

>>

Anonymous
08/01/24(Thu)03:36:43 No.101664526

Anonymous 08/01/24(Thu)03:36:43 No.101664526

>>101662662
>>101662740
>>101662776
>>101662817
>>101663097
>>101664006
>>101664169
>>101664450
>>101664501
S P A M
P
A
M

>>

Anonymous
08/01/24(Thu)03:39:47 No.101664551

Anonymous 08/01/24(Thu)03:39:47 No.101664551

File: 116775249263549073-SD.png (2.65 MB, 1200x1200)

2.65 MB PNG

>>101664439

>>

Anonymous
08/01/24(Thu)03:43:20 No.101664588

Anonymous 08/01/24(Thu)03:43:20 No.101664588

File: ComfyUI_temp_gqtbk_00091_.png (711 KB, 896x1152)

711 KB PNG

>>

Anonymous
08/01/24(Thu)03:57:47 No.101664710

Anonymous 08/01/24(Thu)03:57:47 No.101664710

File: 00_sig12.jpg (308 KB, 1336x1336)

308 KB JPG

>>101657675
Cool

>>

Anonymous
08/01/24(Thu)04:27:20 No.101664958

Anonymous 08/01/24(Thu)04:27:20 No.101664958

I'm unable to refrain from exploring latent space

>>

Anonymous
08/01/24(Thu)04:35:52 No.101665027

Anonymous 08/01/24(Thu)04:35:52 No.101665027

File: Image.jpg (1.96 MB, 1792x2304)

1.96 MB JPG

>>

Anonymous
08/01/24(Thu)04:51:12 No.101665134

Anonymous 08/01/24(Thu)04:51:12 No.101665134

>>101662687
>Apparently there's a guide and it's really good imo
Did you try it? I wonder if it's upgrade for loras

>>

Anonymous
08/01/24(Thu)05:41:53 No.101665627

Anonymous 08/01/24(Thu)05:41:53 No.101665627

>>101664551
what model is that? gnarly nails aside it looks quite nice, good gen.

>>

Anonymous
08/01/24(Thu)05:58:52 No.101665804

Anonymous 08/01/24(Thu)05:58:52 No.101665804

official pixart bigma and lumina 2 and that hunyuan finetune waiting room

>>

Anonymous
08/01/24(Thu)06:47:16 No.101666184

Anonymous 08/01/24(Thu)06:47:16 No.101666184

File: image - 2024-08-01T054605.386.png (1.67 MB, 1488x840)

1.67 MB PNG

>>

Anonymous
08/01/24(Thu)07:39:24 No.101666646

Anonymous 08/01/24(Thu)07:39:24 No.101666646

>>101665627
Pony realism

>>

Anonymous
08/01/24(Thu)08:00:58 No.101666880

Anonymous 08/01/24(Thu)08:00:58 No.101666880

File: 1693970610832092.png (909 KB, 749x753)

909 KB PNG

Are there any models or local apps that can do video inpainting like pika with 8gb of vram? I want to use ai to make women look pregnant

>>

Anonymous
08/01/24(Thu)08:43:57 No.101667339

Anonymous 08/01/24(Thu)08:43:57 No.101667339

>>101666646
ty, mind if i ask for catbox? really digging the style.

>>

Anonymous
08/01/24(Thu)09:15:04 No.101667606

Anonymous 08/01/24(Thu)09:15:04 No.101667606

>>101663299
There's no one training anything, it's a dumb metric.

>>

Anonymous
08/01/24(Thu)09:38:00 No.101667861

Anonymous 08/01/24(Thu)09:38:00 No.101667861

>>101666880
>its my comfy workflow and i get to choose the fetishes
keke

>>

Anonymous
08/01/24(Thu)09:51:47 No.101667993

Anonymous 08/01/24(Thu)09:51:47 No.101667993

https://blackforestlabs.ai/announcing-black-forest-labs/

>>

Anonymous
08/01/24(Thu)09:54:43 No.101668015

Anonymous 08/01/24(Thu)09:54:43 No.101668015

File: Sigma_12254_.jpg (1.93 MB, 2048x2048)

1.93 MB JPG

>>101663299
Hey anon, I train at night so there's fun results to check in the morning. Maybe speak for yourself

>>101663690
ty

>>101664710
ty

>>101664958
>I'm unable to refrain from exploring latent space
This ^

>>101665134
Seems like a more intelligent way of preserving "Hyperspherical Energy" without adding another linear layer like DoRA.. tldr; probably better and smaller but need to test still

>>101667606
>no one training anything
No u

>>

Anonymous
08/01/24(Thu)10:00:43 No.101668071

Anonymous 08/01/24(Thu)10:00:43 No.101668071

>>101667993
>12b
w-what a formidable fatty. any vram rich wanna give this a spin? seems like it's already supported by comfyui according to their huggingface repo.

>>

Anonymous
08/01/24(Thu)10:02:05 No.101668084

Anonymous 08/01/24(Thu)10:02:05 No.101668084

>>101667993
>training
No matches found

>>

Anonymous
08/01/24(Thu)10:02:57 No.101668094

Anonymous 08/01/24(Thu)10:02:57 No.101668094

>>101668071
i would but i don't see any comfyui workflow and i really don't want to fuck around with settings only to wind up disappointed

>>

Anonymous
08/01/24(Thu)10:29:58 No.101668427

Anonymous 08/01/24(Thu)10:29:58 No.101668427

>>101668015
>I train at night
What did you train last night?

>>

Anonymous
08/01/24(Thu)10:30:54 No.101668438

Anonymous 08/01/24(Thu)10:30:54 No.101668438

>>101668015
My point is saying "Sigma isn't being trained" is stupid because there's basically no movement anywhere, it's just a couple people, as always, doing 99% of the work.

>>

Anonymous
08/01/24(Thu)10:32:10 No.101668456

Anonymous 08/01/24(Thu)10:32:10 No.101668456

File: GRID_2.png (1.31 MB, 1024x585)

1.31 MB PNG

A new 12B parameter model just got open sourced, the examples are looking pretty good too.
https://blackforestlabs.ai/announcing-black-forest-labs/

>>

Anonymous
08/01/24(Thu)10:33:16 No.101668468

Anonymous 08/01/24(Thu)10:33:16 No.101668468

>>101668456
It's not open source, there is no training code. It's open weights and not even that, they're delivering only the cucked distilled version. Means no training.

>>

Anonymous
08/01/24(Thu)10:35:33 No.101668495

Anonymous 08/01/24(Thu)10:35:33 No.101668495

>>101668468
There's two versions, if you're talking about schnell: https://huggingface.co/black-forest-labs/FLUX.1-dev

>>

Anonymous
08/01/24(Thu)10:37:08 No.101668521

Anonymous 08/01/24(Thu)10:37:08 No.101668521

>>101668495
>FLUX.1 [dev] is an open-weight, guidance-distilled model for non-commercial applications.

>>

Anonymous
08/01/24(Thu)10:49:21 No.101668703

Anonymous 08/01/24(Thu)10:49:21 No.101668703

comfy thread

>>

Anonymous
08/01/24(Thu)11:06:46 No.101668946

Anonymous 08/01/24(Thu)11:06:46 No.101668946

>>101668427
you're mom

>>

Anonymous
08/01/24(Thu)11:26:13 No.101669184

Anonymous 08/01/24(Thu)11:26:13 No.101669184

File: Sigma_12256_.jpg (2.69 MB, 2048x2048)

2.69 MB JPG

>>101668427
Sigma 2k

>>101668438
It's okay anon, nobody expects you to know everything. Sigma _is_ being trained. There have been a lot of Sigma fine tunes released recently. And recently, I train it nightly.

>>101668456
Impressive but so huge. The gens coming out are absurdly good

>>101668946
kek

>>

Anonymous
08/01/24(Thu)11:39:32 No.101669354

Anonymous 08/01/24(Thu)11:39:32 No.101669354

>>101668015
>>101669184
why do you think there arent MORE people training pixart? there are people doing it, this is true. i just expected more adoption at this point.

>>

Anonymous
08/01/24(Thu)11:40:43 No.101669364

Anonymous 08/01/24(Thu)11:40:43 No.101669364

>>101669354
no one is training anything, if you stand back for a second you'd realize that Pony is the only real training SDXL ever got.

>>

Anonymous
08/01/24(Thu)11:43:33 No.101669411

Anonymous 08/01/24(Thu)11:43:33 No.101669411

>>101669364
>no one is training anything
fair
>Pony is the only real training SDXL ever got.
perhaps
so why arent furfags jumping to sigma?

>>

Anonymous
08/01/24(Thu)11:44:57 No.101669427

Anonymous 08/01/24(Thu)11:44:57 No.101669427

>>101669364
>pixart
>hunyuan
>kolors
>lumina
>auraflow
>flux
all of this junk and not a single good finetune for any of them. this is how it's going to be for the next 5 years. endless pumping out 'almost good enough' base models that get forgotten in a week thanks to boring datasets and local finetuners lacking the compute needed to make anything with them.

>>

Anonymous
08/01/24(Thu)11:46:28 No.101669448

Anonymous 08/01/24(Thu)11:46:28 No.101669448

>>101668094
>comfyui workflow
https://comfyanonymous.github.io/ComfyUI_examples/flux/

>>

Anonymous
08/01/24(Thu)11:47:21 No.101669458

Anonymous 08/01/24(Thu)11:47:21 No.101669458

>>101669411
Because people are dumb and need to be lead to water. Also base Sigma simply doesn't have enough parameters for something like Pony so you need someone to do something like 1.3B.

>>101669427
Auraflow is still in training, why would anyone fine tune something that is still in the oven? Pixart Next will be coming and that's got Nvidia money and their team actually gives a fuck about local training. Kolors is DOA because it's Unet. Hunyuan and Lumina require 40GB+ VRAM computers to train.

>>

Anonymous
08/01/24(Thu)11:48:28 No.101669475

Anonymous 08/01/24(Thu)11:48:28 No.101669475

bigma status?

>>

Anonymous
08/01/24(Thu)11:49:21 No.101669488

Anonymous 08/01/24(Thu)11:49:21 No.101669488

File: file.png (54 KB, 256x256)

54 KB PNG

>>101669475
>white dog
It seems to be working on finer details now but things are a lot more exploded than usual.

>>

Anonymous
08/01/24(Thu)11:50:18 No.101669495

Anonymous 08/01/24(Thu)11:50:18 No.101669495

>>101669488
this made me smile :)

>>

Anonymous
08/01/24(Thu)11:51:58 No.101669520

Anonymous 08/01/24(Thu)11:51:58 No.101669520

>>101669488
I love him.

>>

Anonymous
08/01/24(Thu)11:53:30 No.101669540

Anonymous 08/01/24(Thu)11:53:30 No.101669540

>>101668456
>12B
will that work on my 24gb gpu though?

>>

Anonymous
08/01/24(Thu)11:54:30 No.101669559

Anonymous 08/01/24(Thu)11:54:30 No.101669559

>>101668468
>It's not open source, there is no training code. It's open weights and not even that, they're delivering only the cucked distilled version. Means no training.
that's pretty easy to make the training code, stop bitching we got the weights lol

>>

Anonymous
08/01/24(Thu)11:55:13 No.101669567

Anonymous 08/01/24(Thu)11:55:13 No.101669567

>>101669559
I expect to see your training code soon then.

>>

Anonymous
08/01/24(Thu)11:59:01 No.101669614

Anonymous 08/01/24(Thu)11:59:01 No.101669614

>>101669567
anon, the worst part of imagegen model is to spent millions of dollars to train a good model, making some code with chatgpt is easy as fuck in comparaison, why are you crying?

>>

Anonymous
08/01/24(Thu)12:01:02 No.101669636

Anonymous 08/01/24(Thu)12:01:02 No.101669636

>>101668456
https://huggingface.co/black-forest-labs/FLUX.1-dev/discussions/1#66ab9dc4fd4ae9a7c49be855
>I have a 3090 with 24gb vram. But 12b parameters in float16 format are still ~24GB and this does not include the two text encoders nor the internal state of the model.
lmaoooo, what's the point then if no one can run it?

>>

Anonymous
08/01/24(Thu)12:02:35 No.101669657

Anonymous 08/01/24(Thu)12:02:35 No.101669657

>>101669636
Don't have to worry about competition if you make your model too big to run. It's what I would've done if I was SAI unironically.

>>

Anonymous
08/01/24(Thu)12:05:57 No.101669706

Anonymous 08/01/24(Thu)12:05:57 No.101669706

File: 1699741388573.png (594 KB, 1602x900)

594 KB PNG

>>

Anonymous
08/01/24(Thu)12:08:10 No.101669732

Anonymous 08/01/24(Thu)12:08:10 No.101669732

>>101669657
How do you convince your investors to spend millions of dollars on a model no regular user can use though? Sounds like suicide

>>

Anonymous
08/01/24(Thu)12:09:11 No.101669749

Anonymous 08/01/24(Thu)12:09:11 No.101669749

>>101669706
wtf? it's the FLUX.1-dev model?

>>

Anonymous
08/01/24(Thu)12:11:23 No.101669780

Anonymous 08/01/24(Thu)12:11:23 No.101669780

>>101669732
under the false premise that you can make money selling access via an api

>>

Anonymous
08/01/24(Thu)12:12:58 No.101669810

Anonymous 08/01/24(Thu)12:12:58 No.101669810

>>101669706
what's this picture? can you provide the source? that's interesting

>>

Anonymous
08/01/24(Thu)12:15:09 No.101669839

Anonymous 08/01/24(Thu)12:15:09 No.101669839

File: Sigma_12259_.jpg (1.78 MB, 2048x2048)

1.78 MB JPG

>>101669354
A two months ago there were 0 fine tunes. Momentum starts slow. How long was SDXL out before Pony?

>>101669458
>Because people are dumb and need to be lead to water.
It's worse. They need to follow someone who says it's drinkable that they trust already.

>>101669488
HYPE!!! Mostly white background too

>>

Anonymous
08/01/24(Thu)12:17:28 No.101669873

Anonymous 08/01/24(Thu)12:17:28 No.101669873

here are some 12b gens by an /lmg/ anon
>>101668789
>>101668964
>>101669042
>>101669149

>>

Anonymous
08/01/24(Thu)12:19:34 No.101669893

Anonymous 08/01/24(Thu)12:19:34 No.101669893

File: 3d4223bc-c565-4608-b9f1-1(...).png (1.7 MB, 1024x1024)

1.7 MB PNG

>>101655488

>>

Anonymous
08/01/24(Thu)12:20:45 No.101669906

Anonymous 08/01/24(Thu)12:20:45 No.101669906

>>101669873
lmg sama i love you

>>

Anonymous
08/01/24(Thu)12:21:07 No.101669912

Anonymous 08/01/24(Thu)12:21:07 No.101669912

>>101669873
I'll see if I can get it running since bigma crashed

>>

Anonymous
08/01/24(Thu)12:21:47 No.101669925

Anonymous 08/01/24(Thu)12:21:47 No.101669925

>>101669912
>bigma crashed
NOOOOOOOOOOOOOOOOOOOOOOOO!!!!!!!!!!!!!!!!!!!

>>

Anonymous
08/01/24(Thu)12:22:42 No.101669939

Anonymous 08/01/24(Thu)12:22:42 No.101669939

>>101669925
No, training sometimes crashes the GPUs and I have to restart the computer and it takes 15 minutes to load all the images so might as well dick around for a bit.

>>

Anonymous
08/01/24(Thu)12:23:50 No.101669950

Anonymous 08/01/24(Thu)12:23:50 No.101669950

>>101669873
that's insane, are we back? I waited so long for that day to happen!

>>

Anonymous
08/01/24(Thu)12:24:03 No.101669952

Anonymous 08/01/24(Thu)12:24:03 No.101669952

>>101669873
Wow looks really good, ty for linking!

>>>/g/ldg eating good

>>

Anonymous
08/01/24(Thu)12:24:06 No.101669953

Anonymous 08/01/24(Thu)12:24:06 No.101669953

>>101669939
if you get it running could you try this prompt? >>101669706

>>

Anonymous
08/01/24(Thu)12:25:30 No.101669982

Anonymous 08/01/24(Thu)12:25:30 No.101669982

File: Sigma_12260_.jpg (1.97 MB, 2048x2048)

1.97 MB JPG

>>101669912
Bigma anon doesn't stop winning even during a crash. I've never had it crash during training btw. Are you getting random OOM's or something else?

>>

Anonymous
08/01/24(Thu)12:28:00 No.101670013

Anonymous 08/01/24(Thu)12:28:00 No.101670013

>>101669982
It's probably because I use my computer while it's training, I assume there's some sort of memory leak at the Nvidia driver level as it completely kills the video drivers where the screen just starts to stutter then freezes.

>>

Anonymous
08/01/24(Thu)12:33:57 No.101670080

Anonymous 08/01/24(Thu)12:33:57 No.101670080

File: file.jpg (156 KB, 1024x1024)

156 KB JPG

it doesnt fuck up anatomy of crouching/sitting subjects like SD3 holy shit thats nice

>>

Anonymous
08/01/24(Thu)12:34:01 No.101670081

Anonymous 08/01/24(Thu)12:34:01 No.101670081

File: 5111951916.png (781 KB, 1367x824)

781 KB PNG

bruh it uses 999gb of vram swapping all that shit takes 2 min+ to make a image, barely uses the GPU (4090)

>>

Anonymous
08/01/24(Thu)12:35:17 No.101670100

Anonymous 08/01/24(Thu)12:35:17 No.101670100

File: file.png (322 KB, 1024x1024)

322 KB PNG

you can try out the model for free on replicate
>https://replicate.com/black-forest-labs/flux-dev

>>

Anonymous
08/01/24(Thu)12:38:28 No.101670141

Anonymous 08/01/24(Thu)12:38:28 No.101670141

File: file.png (1.18 MB, 1024x1024)

1.18 MB PNG

tried this >>101669706
oh wow

>>

Anonymous
08/01/24(Thu)12:38:49 No.101670148

Anonymous 08/01/24(Thu)12:38:49 No.101670148

File: sample.jpg (403 KB, 1024x1024)

403 KB JPG

>>101670100
https://replicate.com/black-forest-labs/flux-pro
This one also works without an account, crank the safety tolerance to 5 so it doesn't stall on you.

>>

Anonymous
08/01/24(Thu)12:39:22 No.101670155

Anonymous 08/01/24(Thu)12:39:22 No.101670155

>>101670081
fellow 4090 user, same here i'm trying it out on /h/. the swapping is fucking brutal which is a shame because the gen speeds aren't that bad if it could actually stay loaded

>>

Anonymous
08/01/24(Thu)12:41:02 No.101670180

Anonymous 08/01/24(Thu)12:41:02 No.101670180

File: 4799.png (2.39 MB, 1024x1024)

2.39 MB PNG

>>101670081
Meant RAM*, it hits the disk swap like crazy on 32gb
>>101670155
yeah, hopefully the model can be trimmed a bit to fit

>>

Anonymous
08/01/24(Thu)12:41:14 No.101670184

Anonymous 08/01/24(Thu)12:41:14 No.101670184

File: file.png (1.06 MB, 1024x1024)

1.06 MB PNG

>>101670148
>without an account
im using it fine without an account
>flux-pro
i believe flux-pro is their api only version, the flux-dev model is they released

>>

Anonymous
08/01/24(Thu)12:43:59 No.101670222

Anonymous 08/01/24(Thu)12:43:59 No.101670222

File: sample (1).jpg (261 KB, 1024x1024)

261 KB JPG

>>

Anonymous
08/01/24(Thu)12:44:06 No.101670223

Anonymous 08/01/24(Thu)12:44:06 No.101670223

File: file.png (1.03 MB, 1024x1024)

1.03 MB PNG

>touhou, crino, 1girl, she has gigantic ass tits wooow

>>

Anonymous
08/01/24(Thu)12:45:50 No.101670239

Anonymous 08/01/24(Thu)12:45:50 No.101670239

File: replicate-prediction-py0t(...).jpg (108 KB, 1024x1024)

108 KB JPG

>>

Anonymous
08/01/24(Thu)12:46:15 No.101670246

Anonymous 08/01/24(Thu)12:46:15 No.101670246

File: file.png (1.27 MB, 1024x1024)

1.27 MB PNG

>an image of hatsune miku holding out both her hands, on her right hand is a red pill, on her left hand is a blue pill

>>

Anonymous
08/01/24(Thu)12:48:13 No.101670277

Anonymous 08/01/24(Thu)12:48:13 No.101670277

File: file.png (1.29 MB, 1024x1024)

1.29 MB PNG

>an image of hatsune miku, a large number of blue and red pills are coming out of her nostrils
i'll try "nose" instead next

>>

Anonymous
08/01/24(Thu)12:48:42 No.101670281

Anonymous 08/01/24(Thu)12:48:42 No.101670281

File: replicate-prediction-j849(...).jpg (81 KB, 1024x1024)

81 KB JPG

>>

Anonymous
08/01/24(Thu)12:49:32 No.101670288

Anonymous 08/01/24(Thu)12:49:32 No.101670288

File: file.png (1.34 MB, 1024x1024)

1.34 MB PNG

>>101670277
>an image of hatsune miku, a large number of blue and red pills are coming out of her nose

>>

Anonymous
08/01/24(Thu)12:50:15 No.101670299

Anonymous 08/01/24(Thu)12:50:15 No.101670299

>>101670246
We're so back.

>>

Anonymous
08/01/24(Thu)12:50:43 No.101670303

Anonymous 08/01/24(Thu)12:50:43 No.101670303

File: file.png (1.31 MB, 1024x1024)

1.31 MB PNG

>>

Anonymous
08/01/24(Thu)12:50:54 No.101670308

Anonymous 08/01/24(Thu)12:50:54 No.101670308

File: sample (3).jpg (413 KB, 1024x1024)

413 KB JPG

>>

Anonymous
08/01/24(Thu)12:51:46 No.101670319

Anonymous 08/01/24(Thu)12:51:46 No.101670319

File: file.png (1.68 MB, 1024x1024)

1.68 MB PNG

>a picture of new york city, there is line of giant blue and red pill shaped buses on the road. the blue pill buses all have the face of hatsune miku on them

>>

Anonymous
08/01/24(Thu)12:52:12 No.101670326

Anonymous 08/01/24(Thu)12:52:12 No.101670326

File: sample (4).jpg (313 KB, 1024x1024)

313 KB JPG

>>

Anonymous
08/01/24(Thu)12:54:42 No.101670356

Anonymous 08/01/24(Thu)12:54:42 No.101670356

File: replicate-prediction-g258(...).png (1.29 MB, 1024x1024)

1.29 MB PNG

>>

Anonymous
08/01/24(Thu)12:56:00 No.101670376

Anonymous 08/01/24(Thu)12:56:00 No.101670376

File: sample (5).jpg (269 KB, 1024x1024)

269 KB JPG

>>101670326
wrong Pitbull

>>

Anonymous
08/01/24(Thu)12:57:28 No.101670389

Anonymous 08/01/24(Thu)12:57:28 No.101670389

File: replicate-prediction-30s7(...).png (1.22 MB, 1024x1024)

1.22 MB PNG

>>

Anonymous
08/01/24(Thu)12:57:32 No.101670390

Anonymous 08/01/24(Thu)12:57:32 No.101670390

File: file.png (1.39 MB, 1024x1024)

1.39 MB PNG

>a deranged serial killer using a crude cutout of hatsune miku's face as a mask, it is taped onto his face. he is very muscular with big nipples that look like sharp spears
not what i asked for but alright. maybe it will do better with non esl prompting

>>

Anonymous
08/01/24(Thu)12:57:41 No.101670395

Anonymous 08/01/24(Thu)12:57:41 No.101670395

>>101669427
>>101669636
Maybe this is a good time for me to complain for a bit.

How come no existing training scripts can make efficient use of multiple consumer GPUs? I made an LLM training script (qlora-pipe on Github) that does pipeline parallelism. With that + full bf16 training + Kahan summation in the optimizer, I can match the performance of mixed precision while full finetuning something like llama 3 8b on 4x4090. But with SDXL, despite being a mere 2.6B parameters, I can't FFT it (not without compromises) using any training script.

OneTrainer doesn't even support multi-GPU (lol, lmao even). With kohya, FSDP doesn't work. Deepspeed only got support recently, but DS Zero forces you into mixed precision training (where weights, grads, and optimizer state are all kept in fp32). Plus Zero has high inter-GPU bandwidth requirements and a decent amount of VRAM overhead it seems like. Basically I can't do a proper FFT of SDXL even on a fucking 4x4090 machine.

Full bf16 training + adam with kahan summation uses 10 bytes per parameter. SDXL should easily be able to be FFT'd on just 2 3090s, which is a common setup for AI enthusiasts (at least in LLM land). No training script can even get close to this. And for the new flux 12b for example, a pipeline parallel training script ought to be able to do a decent rank lora on 2x3090 as well.

At this point I just need to make a pipeline parallel training script for diffusion models I guess.

>>

Anonymous
08/01/24(Thu)12:58:14 No.101670402

Anonymous 08/01/24(Thu)12:58:14 No.101670402

File: 1734.jpg (1.18 MB, 3072x2048)

1.18 MB JPG

fluxsisters... I need more ram

>>

Anonymous
08/01/24(Thu)12:59:11 No.101670413

Anonymous 08/01/24(Thu)12:59:11 No.101670413

If Flux can't do female feet, it's not worth using.

>>

Anonymous
08/01/24(Thu)12:59:13 No.101670414

Anonymous 08/01/24(Thu)12:59:13 No.101670414

File: Sigma_12261_.jpg (2.15 MB, 2048x2048)

2.15 MB JPG

>>101670013
Whoa.. I can even game while training on arch/kde. Base KDE/X uses 1GB VRAM and no more

>>101670080
Look at that hand!

>>101670222
Why so blurry?

>>101670246
Amazing

>>

Anonymous
08/01/24(Thu)13:00:47 No.101670441

Anonymous 08/01/24(Thu)13:00:47 No.101670441

File: file.png (1.22 MB, 1024x1024)

1.22 MB PNG

since it's a transformer i hope it can be quanted like llms
>a holy painting of jesus christ caressing his big pregnant belly. there is a speech bubble above him saying "I shall name him PixArt Bigma"

>>

Anonymous
08/01/24(Thu)13:01:11 No.101670447

Anonymous 08/01/24(Thu)13:01:11 No.101670447

>>101670414
Because I asked for it. It's unfortunate that low-quality stuff like that is unreliable at best with these models, they filter out the low-quality images for training and that's all I want to generate.
Maybe in the future there's gonna be image upload/customization options like Midjourney, that would really be something.

>>

Anonymous
08/01/24(Thu)13:01:16 No.101670448

Anonymous 08/01/24(Thu)13:01:16 No.101670448

>>101670441
kekd

>>

Anonymous
08/01/24(Thu)13:02:46 No.101670478

Anonymous 08/01/24(Thu)13:02:46 No.101670478

>>101670395
Basically no effort was done on this because they know that true democratization of training kills them. There's a reason why they keep the requirements >24 GB.

>>

Anonymous
08/01/24(Thu)13:06:42 No.101670526

Anonymous 08/01/24(Thu)13:06:42 No.101670526

>>101670184
I don't see much difference in image quality between pro and dev, that's cool

>>

Anonymous
08/01/24(Thu)13:06:44 No.101670527

Anonymous 08/01/24(Thu)13:06:44 No.101670527

File: file.png (1.4 MB, 1024x1024)

1.4 MB PNG

>an image of a cat just sitting there looking at the viewer. a speech bubble above him says "Near a tree by a river
There's a hole in the Ground
Where an old man of Aran
Goes around and around
And his mind is a beacon
In the veil of the Night
For a Strange kind of Fashion
There's a wrong and a Right"

>>

Anonymous
08/01/24(Thu)13:08:59 No.101670556

Anonymous 08/01/24(Thu)13:08:59 No.101670556

File: replicate-prediction-vbnj(...).png (743 KB, 1024x1024)

743 KB PNG

>>101670413
>female feet

>>

Anonymous
08/01/24(Thu)13:09:14 No.101670560

Anonymous 08/01/24(Thu)13:09:14 No.101670560

File: file.png (1.71 MB, 1024x1024)

1.71 MB PNG

>an image of "the last supper" but everybody has been replaced with drag queens and shemales, there is a child in a yellow coat for some reason

>>

Anonymous
08/01/24(Thu)13:12:05 No.101670591

Anonymous 08/01/24(Thu)13:12:05 No.101670591

File: replicate-prediction-47m0(...).png (1.61 MB, 1024x1024)

1.61 MB PNG

>ai generated

>>

Anonymous
08/01/24(Thu)13:13:28 No.101670609

Anonymous 08/01/24(Thu)13:13:28 No.101670609

>>101670556
I'm counting 5 toes on each foot, it's worth using.

>>

Anonymous
08/01/24(Thu)13:13:59 No.101670614

Anonymous 08/01/24(Thu)13:13:59 No.101670614

File: file.png (1.18 MB, 1024x1024)

1.18 MB PNG

>an image of hatsune miku holding out both her hands, on her right hand is a red pill, on her left hand is a blue pill, there is a speech bubble above her saying "the right makes you constipated, the left gives you diarrhea. choose wisely.", 1badass
it gave me a 1cute instead

>>

Anonymous
08/01/24(Thu)13:15:00 No.101670629

Anonymous 08/01/24(Thu)13:15:00 No.101670629

File: 489362.png (1.19 MB, 1216x832)

1.19 MB PNG

KEK

>>

Anonymous
08/01/24(Thu)13:15:18 No.101670632

Anonymous 08/01/24(Thu)13:15:18 No.101670632

>ERROR
>You have reached the free time limit.
Death to non-local

>>

Anonymous
08/01/24(Thu)13:15:58 No.101670643

Anonymous 08/01/24(Thu)13:15:58 No.101670643

>>101670629
hunyuan feet anon is going to love this

>>

Anonymous
08/01/24(Thu)13:16:31 No.101670650

Anonymous 08/01/24(Thu)13:16:31 No.101670650

File: 9762.png (1.3 MB, 1216x832)

1.3 MB PNG

>>101670632
>Death to non-local
>ERROR
>You are out of memory.

>>

Anonymous
08/01/24(Thu)13:19:12 No.101670687

Anonymous 08/01/24(Thu)13:19:12 No.101670687

File: image.jpg (121 KB, 1024x1024)

121 KB JPG

>>

Anonymous
08/01/24(Thu)13:21:43 No.101670717

Anonymous 08/01/24(Thu)13:21:43 No.101670717

File: replicate-prediction-90cz(...).png (1.04 MB, 1024x1024)

1.04 MB PNG

>woman bowing forward, seen from behind, bikini
will not make her bow at all

>>

Anonymous
08/01/24(Thu)13:21:56 No.101670721

Anonymous 08/01/24(Thu)13:21:56 No.101670721

File: image (1).jpg (66 KB, 1024x1024)

66 KB JPG

>>

Anonymous
08/01/24(Thu)13:22:23 No.101670725

Anonymous 08/01/24(Thu)13:22:23 No.101670725

this had to be trained on synthslop, i can just tell. betting the paper will confirm another journeyDB masterpiece

>>

Anonymous
08/01/24(Thu)13:22:57 No.101670733

Anonymous 08/01/24(Thu)13:22:57 No.101670733

>>101670725
yeah it looks like it desu

>>

Anonymous
08/01/24(Thu)13:23:09 No.101670735

Anonymous 08/01/24(Thu)13:23:09 No.101670735

File: replicate-prediction-evw2(...).png (1.09 MB, 1024x1024)

1.09 MB PNG

>>101670725
yes look at those feet

>>

Anonymous
08/01/24(Thu)13:23:25 No.101670740

Anonymous 08/01/24(Thu)13:23:25 No.101670740

File: file.png (852 KB, 1024x1024)

852 KB PNG

>an image of a cute little chibi anime girl smiling at the viewer, a speech bubble above her says "i'm going to say the nigger word"

>>

Anonymous
08/01/24(Thu)13:25:26 No.101670756

Anonymous 08/01/24(Thu)13:25:26 No.101670756

File: image (2).jpg (124 KB, 1024x1024)

124 KB JPG

>>

Anonymous
08/01/24(Thu)13:26:15 No.101670767

Anonymous 08/01/24(Thu)13:26:15 No.101670767

https://replicate.com/black-forest-labs/flux-pro/examples
>Hardware: CPU
>Total duration 20.7s
wtf? how can it be so fast with cpu?

>>

Anonymous
08/01/24(Thu)13:27:53 No.101670783

Anonymous 08/01/24(Thu)13:27:53 No.101670783

>>101670767
AMD MI300

>>

Anonymous
08/01/24(Thu)13:28:27 No.101670794

Anonymous 08/01/24(Thu)13:28:27 No.101670794

File: sillytest.png (606 KB, 815x721)

606 KB PNG

i'm impressed by stable-fast-3d with how it handles non character images

https://huggingface.co/stabilityai/stable-fast-3d

>>

Anonymous
08/01/24(Thu)13:28:40 No.101670798

Anonymous 08/01/24(Thu)13:28:40 No.101670798

File: file.png (1.18 MB, 1024x1024)

1.18 MB PNG

>>101670303
KEK

>>

Anonymous
08/01/24(Thu)13:29:19 No.101670803

Anonymous 08/01/24(Thu)13:29:19 No.101670803

>>101670767
https://replicate.com/pricing
Also says the CPU is 4X. Is it regular 4 cores? or are they using some server grade 64 core x 4 = 256 cores?

>>

Anonymous
08/01/24(Thu)13:29:47 No.101670808

Anonymous 08/01/24(Thu)13:29:47 No.101670808

File: image (3).jpg (120 KB, 1024x1024)

120 KB JPG

>>101670725
>synthslop
yep

>>

Anonymous
08/01/24(Thu)13:29:59 No.101670811

Anonymous 08/01/24(Thu)13:29:59 No.101670811

>>101670148
I'm not feeling this model at all. I am not happy with the results from the pro version and that is the one you don't get to download. Dev is supposedly even worse. 12B model too.
I ran bunch of tests and I prefer Dall-E 3 outputs over this pro model.

This does not seem like a local model competitor, but more like a DE3, SD3 Large and Midjorney competitor.

>>

Anonymous
08/01/24(Thu)13:32:13 No.101670836

Anonymous 08/01/24(Thu)13:32:13 No.101670836

>>101670643
It's pretty bad at making feet actually. Worse than the Chinese models.
Also I think you can easily finetune the china models for feet, but this not so much.

>>

Anonymous
08/01/24(Thu)13:32:26 No.101670841

Anonymous 08/01/24(Thu)13:32:26 No.101670841

File: image (4).jpg (114 KB, 1024x1024)

114 KB JPG

>>

Anonymous
08/01/24(Thu)13:33:17 No.101670849

Anonymous 08/01/24(Thu)13:33:17 No.101670849

File: flux.jpg (328 KB, 768x1280)

328 KB JPG

>>101670733
something about it looks like Midjourney v4 outputs aesthetically. doesn't look authentic. ai trained on ai vibe. cool comprehension and a fun model but doesn't seem like the improvements line up with increased resource requirements. feels like an 8b model at max

>>

Anonymous
08/01/24(Thu)13:33:48 No.101670856

Anonymous 08/01/24(Thu)13:33:48 No.101670856

>>101670794
I really don't give a shit about 3D modeling

>>

Anonymous
08/01/24(Thu)13:34:07 No.101670861

Anonymous 08/01/24(Thu)13:34:07 No.101670861

File: file.png (1004 KB, 1024x1024)

1004 KB PNG

>>101670836
hunyuan feet anon is going to hate this...

>Patchouli Knowledge from touhou, her hair and eyes are purple and has many ribbons tied to her hair and other parts of her clothing. She wears pink pajama-like clothing and a night-cap with a gold crescent moon on it. Her dress has stripes of purple and violet, she is eating a cigarette

>>

Anonymous
08/01/24(Thu)13:36:38 No.101670888

Anonymous 08/01/24(Thu)13:36:38 No.101670888

>>101670811
you can test out the dev version here
https://replicate.com/black-forest-labs/flux-dev
it's not that different to pro imo, and it's easily the best local model we ever had, that's a great day for me, fuck SAI

>>

Anonymous
08/01/24(Thu)13:37:02 No.101670893

Anonymous 08/01/24(Thu)13:37:02 No.101670893

File: mohammed.jpg (129 KB, 1024x1024)

129 KB JPG

>>

Anonymous
08/01/24(Thu)13:40:07 No.101670932

Anonymous 08/01/24(Thu)13:40:07 No.101670932

File: 3b69ed34-48c9-428c-a8d7-f(...).gif (521 KB, 400x226)

521 KB GIF

>>101670148
>>101670424
>Great image quality
>Ok prompt understanding
>Can do NFSW
>Nice anatomy
>Apache 2.0 Licence
That's insane, I never thought we would get that day, WE ARE SO BACK

>>

Anonymous
08/01/24(Thu)13:40:52 No.101670942

Anonymous 08/01/24(Thu)13:40:52 No.101670942

https://huggingface.co/black-forest-labs/FLUX.1-schnell/tree/main

24GB model. Need a distilled version under 8GB

>>

Anonymous
08/01/24(Thu)13:41:00 No.101670944

Anonymous 08/01/24(Thu)13:41:00 No.101670944

File: image (6).jpg (74 KB, 1024x1024)

74 KB JPG

>>101670932
nah i dont think this is it

>>

Anonymous
08/01/24(Thu)13:41:05 No.101670945

Anonymous 08/01/24(Thu)13:41:05 No.101670945

File: file.png (976 KB, 1024x1024)

976 KB PNG

>a little anime girl sitting on a dirty couch looking wasted, dark circles under her eyes. she has a cigarette in her mouth, hand and nose. next to her is a beer can with a cigarette in it instead of a straw. her room is dusty and decrepit, there is a thought bubble forming above her head, inside it is an image of a pack of cigarettes

>>

Anonymous
08/01/24(Thu)13:41:08 No.101670947

Anonymous 08/01/24(Thu)13:41:08 No.101670947

>>101670081
why you use the schnell one? it's the worst version

>>

Anonymous
08/01/24(Thu)13:42:11 No.101670954

Anonymous 08/01/24(Thu)13:42:11 No.101670954

>>101670942
what about the dev version? it's the better one no?

>>

Anonymous
08/01/24(Thu)13:43:30 No.101670962

Anonymous 08/01/24(Thu)13:43:30 No.101670962

https://comfyanonymous.github.io/ComfyUI_examples/flux/
>If you don’t have t5xxl_fp16.safetensors or clip_l.safetensors already in your ComfyUI/models/clip/ directory you can find them on: this link. You can use t5xxl_fp8_e4m3fn.safetensors instead for lower memory usage but the fp16 one is recommended if you have more than 32GB ram.
How does that work? you can run the model on the GPU and the text encoder on the CPU?

>>

Anonymous
08/01/24(Thu)13:43:38 No.101670966

Anonymous 08/01/24(Thu)13:43:38 No.101670966

>>101670954
Dev version is bit better and is trained on the PRO versions

>>

Anonymous
08/01/24(Thu)13:44:04 No.101670970

Anonymous 08/01/24(Thu)13:44:04 No.101670970

File: file.png (1.47 MB, 1024x1024)

1.47 MB PNG

>A tense diplomatic negotiation in a grand hall, featuring representatives from 20 different countries, each wearing traditional attire. The scene should include interpreters, aides whispering to their leaders, and visible emotional reactions ranging from frustration to hope.
Duonald trump

>>

Anonymous
08/01/24(Thu)13:44:12 No.101670972

Anonymous 08/01/24(Thu)13:44:12 No.101670972

>>101670947
idk, I picked the one that was running on the HF space

>>

Anonymous
08/01/24(Thu)13:44:47 No.101670978

Anonymous 08/01/24(Thu)13:44:47 No.101670978

>>101670303
where is he off to?

>>

Anonymous
08/01/24(Thu)13:45:02 No.101670979

Anonymous 08/01/24(Thu)13:45:02 No.101670979

File: sample.jpg (322 KB, 1440x1440)

322 KB JPG

>>101670932
yes anon, we're at insane level of back

>>

Anonymous
08/01/24(Thu)13:45:12 No.101670980

Anonymous 08/01/24(Thu)13:45:12 No.101670980

>>101670932
>Can do NFSW
can not, try to do do woman from behind bowing

>>

Anonymous
08/01/24(Thu)13:46:03 No.101670989

Anonymous 08/01/24(Thu)13:46:03 No.101670989

Dallefags in shambles. We actually won

>>

Anonymous
08/01/24(Thu)13:46:03 No.101670990

Anonymous 08/01/24(Thu)13:46:03 No.101670990

>>101670980
it can do nude, and has great anatomy, finetunes will help for the poses

>>

Anonymous
08/01/24(Thu)13:47:49 No.101671004

Anonymous 08/01/24(Thu)13:47:49 No.101671004

>>101670978
/sdg/

>>

Anonymous
08/01/24(Thu)13:48:15 No.101671010

Anonymous 08/01/24(Thu)13:48:15 No.101671010

>>101670932
>requires RTX 4090
>most likely bitch to train and make loras for
>anatomy still lacking compared to Kolors for example

I doubt that this is it, but maybe. Apache 2.0 license is the biggest thing that sets it apart from all the others.

>>

Anonymous
08/01/24(Thu)13:48:58 No.101671016

Anonymous 08/01/24(Thu)13:48:58 No.101671016

File: sillytest2.png (1.45 MB, 1198x1111)

1.45 MB PNG

>>101670856
wasn't this supposed to be a safe space for alternative things?

>>

Anonymous
08/01/24(Thu)13:49:07 No.101671018

Anonymous 08/01/24(Thu)13:49:07 No.101671018

File: Sigma_12267_.jpg (2.26 MB, 2048x2048)

2.26 MB JPG

>>101670725
Can't copyright AI output so likely the investor-safe route. Still Sigma 0.6B btfo by this 20x larger model though got dayum

>>

Anonymous
08/01/24(Thu)13:49:38 No.101671030

Anonymous 08/01/24(Thu)13:49:38 No.101671030

>>101671010
>Apache 2.0 license is the biggest thing that sets it apart from all the others.
not just that, the image quality is insane, and it has great prompt understanding and is perfect at text, this one is truly at API levels >>101670979

>>

Anonymous
08/01/24(Thu)13:50:31 No.101671035

Anonymous 08/01/24(Thu)13:50:31 No.101671035

File: image (8).jpg (185 KB, 1024x1024)

185 KB JPG

>>101670979

>>

Anonymous
08/01/24(Thu)13:51:45 No.101671054

Anonymous 08/01/24(Thu)13:51:45 No.101671054

>>101671030
it's also great at hands, holy fuck I never expected to get such a great local model in my lifetime
https://reddit.com/r/StableDiffusion/comments/1ehknmh/new_ai_model_flux_fixes_hands/

>>

Anonymous
08/01/24(Thu)13:52:47 No.101671065

Anonymous 08/01/24(Thu)13:52:47 No.101671065

https://comfyanonymous.github.io/ComfyUI_examples/flux/
Can someone provide the links for the flux nodes?

>>

Anonymous
08/01/24(Thu)13:53:16 No.101671071

Anonymous 08/01/24(Thu)13:53:16 No.101671071

>>101671016
that's pretty cool ngl kek

>>

Anonymous
08/01/24(Thu)13:53:37 No.101671078

Anonymous 08/01/24(Thu)13:53:37 No.101671078

File: file.png (1.75 MB, 1024x1024)

1.75 MB PNG

>>

Anonymous
08/01/24(Thu)13:54:03 No.101671091

Anonymous 08/01/24(Thu)13:54:03 No.101671091

https://huggingface.co/camenduru/FLUX.1-dev/tree/main
>flux1-dev.sft
what's a .sft? that's the model? why isn't it a safetensor model like the others?

>>

Anonymous
08/01/24(Thu)13:54:08 No.101671092

Anonymous 08/01/24(Thu)13:54:08 No.101671092

File: Sigma_12269_.jpg (1.78 MB, 2048x2048)

1.78 MB JPG

>>101671016
It's not a safe space but he doesn't even have a gen. Fuck that guy and keep posting your local gens like the OP suggests

>>

Anonymous
08/01/24(Thu)13:54:54 No.101671100

Anonymous 08/01/24(Thu)13:54:54 No.101671100

>>101671091
>SFT
>SaFeTensors

>>

Anonymous
08/01/24(Thu)13:55:01 No.101671103

Anonymous 08/01/24(Thu)13:55:01 No.101671103

>>101671065
comfy has native support for this type of model. doesn't need extra nodes.

>>

Anonymous
08/01/24(Thu)13:55:03 No.101671104

Anonymous 08/01/24(Thu)13:55:03 No.101671104

>>101671091
.sft is safetensors according to comfy

>>

Anonymous
08/01/24(Thu)13:56:06 No.101671115

Anonymous 08/01/24(Thu)13:56:06 No.101671115

>>101671065
flux came with day 1 native comfy support

>>

imb
08/01/24(Thu)13:56:11 No.101671117

imb 08/01/24(Thu)13:56:11 No.101671117

File: ComfyUI_0036.jpg (99 KB, 1024x1024)

99 KB JPG

>>

Anonymous
08/01/24(Thu)13:56:16 No.101671120

Anonymous 08/01/24(Thu)13:56:16 No.101671120

File: file.png (983 KB, 1024x1024)

983 KB PNG

>>

Anonymous
08/01/24(Thu)13:57:17 No.101671137

Anonymous 08/01/24(Thu)13:57:17 No.101671137

File: file.png (1.02 MB, 1024x1024)

1.02 MB PNG

>ginger woman squatting, she is wearing round glasses and a stripped top with overalls a small frilled skirt and knee high pink boots,

>>

Anonymous
08/01/24(Thu)13:57:47 No.101671146

Anonymous 08/01/24(Thu)13:57:47 No.101671146

>>101671030
I believe when I can see that people can easily make finetunes and loras for this beast. I want to see good paper with no ClosedAI bullshit where all the sauce is hidden. I want training examples, training code etc.

Just dumping the weights on the internet is not "open source" enough for me.

>>

Anonymous
08/01/24(Thu)13:58:02 No.101671149

Anonymous 08/01/24(Thu)13:58:02 No.101671149

File: file.png (1.09 MB, 1024x1024)

1.09 MB PNG

>a profound image of an anime girl in deep meditation, a white glow emanating from her head as she attains nirvana, the mighty glow from her eyes cause the entire image to tremble and warp. there is a speech bubble above her head saying "what if pixart 12b"

>>

Anonymous
08/01/24(Thu)13:58:53 No.101671167

Anonymous 08/01/24(Thu)13:58:53 No.101671167

>>101671054
But terrible at feet. What a price to pay. Still better than SD3.

>>

Anonymous
08/01/24(Thu)13:58:54 No.101671169

Anonymous 08/01/24(Thu)13:58:54 No.101671169

>>101671104
>>101671100
oh ok my b I'm a retard, thanks kek

>>101671146
it's way harder to get millions of dollars to train a 12b model than making a training code, don't worry about it, the model is so good everyone will make the training work

>>

Anonymous
08/01/24(Thu)14:00:22 No.101671184

Anonymous 08/01/24(Thu)14:00:22 No.101671184

>>101671169
Also, keep in mind file extensions are just cosmetic, they merely inform programs of what to expect, you can rename a model to .jpg and still load it just file as long as the program recognizes it should try to load .jpg files as tensors

>>

Anonymous
08/01/24(Thu)14:00:23 No.101671185

Anonymous 08/01/24(Thu)14:00:23 No.101671185

File: file.png (1.72 MB, 1024x1024)

1.72 MB PNG

shrek lying on a recliner next to a pool, hes is drinking margarita and saying on a speech bubble "life is good"

>>

Anonymous
08/01/24(Thu)14:02:29 No.101671216

Anonymous 08/01/24(Thu)14:02:29 No.101671216

>>101670478
>they keep the requirements >24 GB
I mean it's not some big conspiracy. Larger models are better, and will require high VRAM GPUs or multiple smaller GPUs to even train a lora. That's fine. I'm just surprised that with how popular imagegen is, none of the training script creators have put much effort into efficiently splitting models across 2+ GPUs so they can be trained with consumer hardware. It's possible and not even particularly difficult, it's just nobody seems to care. Like I said, at this point I'm seriously considering making my own pipeline parallel training script (will be open source if I do it), especially if this flux model or the new larger pixart model are any good.

>>

Anonymous
08/01/24(Thu)14:02:44 No.101671222

Anonymous 08/01/24(Thu)14:02:44 No.101671222

File: file.jpg (267 KB, 1360x768)

267 KB JPG

we're so back

>>

Anonymous
08/01/24(Thu)14:03:40 No.101671237

Anonymous 08/01/24(Thu)14:03:40 No.101671237

>>101671222
wtf pikachu

>>

Anonymous
08/01/24(Thu)14:03:47 No.101671241

Anonymous 08/01/24(Thu)14:03:47 No.101671241

>>101671216
>it's just nobody seems to care
not enough people care, images are useless, meanwhile LLMs can actually do things, hence most people who know about ml are working on that instead

>>

Anonymous
08/01/24(Thu)14:05:11 No.101671260

Anonymous 08/01/24(Thu)14:05:11 No.101671260

>>101671103
>comfy has native support for this type of model. doesn't need extra nodes.
I always thought I would never use this Spaggheti shit but here we go... the model is too god to be avoided at this point

>>

Anonymous
08/01/24(Thu)14:05:32 No.101671270

Anonymous 08/01/24(Thu)14:05:32 No.101671270

>>101671216
I'm just saying that as a business decision targeting >24 GB is a smart choice as it gives you brownie points and publicity having a "local" model while forcing most people to use your API. Honestly the best license would be something like:
"Commercial use except for on-demand image generation via an API"

>>

Anonymous
08/01/24(Thu)14:05:49 No.101671275

Anonymous 08/01/24(Thu)14:05:49 No.101671275

>>101671222
This image made me realise I definitively have some weird cloth fetish.

>>

Anonymous
08/01/24(Thu)14:06:01 No.101671281

Anonymous 08/01/24(Thu)14:06:01 No.101671281

File: file.png (1.4 MB, 1024x1024)

1.4 MB PNG

>image of an heavenly immortal anime girl seated in deep mediation, her cultivation breaking through to 12b pixart biggerma realm, heaven and earth shatter as a speech bubble appear above her head saying "what if pixart 12b"

>>

Anonymous
08/01/24(Thu)14:06:13 No.101671285

Anonymous 08/01/24(Thu)14:06:13 No.101671285

>>101671216
>I mean it's not some big conspiracy. Larger models are better
this, don't blame the model creators, blame Nvdia for nerfing the VRAM, it's still at 24gb since FUCKING 2018 (Rtx-Titan)

>>

Anonymous
08/01/24(Thu)14:06:17 No.101671286

Anonymous 08/01/24(Thu)14:06:17 No.101671286

>>101671241
That is changing now as all the modalities seem to be converging. Audio, text and image all on a single multimodal LLM.

>>

Anonymous
08/01/24(Thu)14:06:53 No.101671294

Anonymous 08/01/24(Thu)14:06:53 No.101671294

>>101671286
a colossal model that no one will be able to run too

>>

Anonymous
08/01/24(Thu)14:07:34 No.101671304

Anonymous 08/01/24(Thu)14:07:34 No.101671304

File: file.png (1.23 MB, 1024x1024)

1.23 MB PNG

>photo electric effect

>>

Anonymous
08/01/24(Thu)14:11:15 No.101671366

Anonymous 08/01/24(Thu)14:11:15 No.101671366

File: ComfyUI_00226_.jpg (749 KB, 2048x2048)

749 KB JPG

The way it works with texts, even in cursive, is fucking amazing

>>

Anonymous
08/01/24(Thu)14:12:14 No.101671385

Anonymous 08/01/24(Thu)14:12:14 No.101671385

>>101671366
LMAOOOOOO

>>

Anonymous
08/01/24(Thu)14:13:15 No.101671401

Anonymous 08/01/24(Thu)14:13:15 No.101671401

>>101671366
so you made in run in comfy ui anon? how much VRAM does it ask? (image model + text encoder)

>>

Anonymous
08/01/24(Thu)14:14:42 No.101671423

Anonymous 08/01/24(Thu)14:14:42 No.101671423

>>101671241
For productive use, LLMs are more impactful than imagegen, yes. But for hobbyist use for "fun" (porn), I think imagegen is way more popular than anything people do with LLMs. Look how many loras and models are on civit compared to community LLM finetunes on huggingface.

You see all these anons in this thread complaining that larger models are way too hard to finetune. This is only because existing training scripts are shit and can't do it, theoretically it's easily achievable. With a 2x3090 machine you should be able to FFT SDXL or train a lora on Hunyuan or the new flux 12b model.

Fuck it, this weekend I'll make an attempt at a pipeline parallel training script for diffusion models, at least just to try to judge how much work it would be. Probably it shouldn't even be that much work if I reuse all the dataset loading code from kohya and make it based on HF Diffusers.

>>

Anonymous
08/01/24(Thu)14:14:52 No.101671426

Anonymous 08/01/24(Thu)14:14:52 No.101671426

File: file.png (10 KB, 287x213)

10 KB PNG

>>101671401
Even in offload mode it maxes out my 4090

>>

Anonymous
08/01/24(Thu)14:16:16 No.101671446

Anonymous 08/01/24(Thu)14:16:16 No.101671446

File: file.png (894 KB, 1024x1024)

894 KB PNG

>A colossal anime woman towers above a plain field, her gigantic form stretching across the sky. The left side is bathed in a brilliant blue sky, while the right side is shrouded in a deep, velvety night sky, which she wears like a cape. Stars twinkle like diamonds across her gown, and the moon casts a silver glow on her majestic form.
not what i asked but looks quite nice

>>

Anonymous
08/01/24(Thu)14:16:33 No.101671451

Anonymous 08/01/24(Thu)14:16:33 No.101671451

https://huggingface.co/camenduru/FLUX.1-
>ae.sft
>clip_l.safetensors
>flux1-dev.sft
>t5xxl_fp16.safetensors
>t5xxl_fp8_e4m3fn.safetensors
can someone help a retard that will use Comfy for the first time of his life? do I have to download everything? what does those files mean?

>>

Anonymous
08/01/24(Thu)14:16:43 No.101671453

Anonymous 08/01/24(Thu)14:16:43 No.101671453

Fresh bread is ready to eat...
>>101671236
>>101671236
>>101671236

>>

Anonymous
08/01/24(Thu)14:17:21 No.101671461

Anonymous 08/01/24(Thu)14:17:21 No.101671461

File: file.png (1.35 MB, 1024x1024)

1.35 MB PNG

>>101671446
>A colossal anime woman towers above a plain field, her gigantic form stretching across the sky. But it's what she's wearing that's truly striking: the night sky itself, draped across her shoulders like a majestic cape. Stars twinkle like diamonds, and the moon casts a silver glow on the folds of her celestial garment, as if the very fabric of the universe has come to life to adorn her.

>>

Anonymous
08/01/24(Thu)14:17:34 No.101671463

Anonymous 08/01/24(Thu)14:17:34 No.101671463

>>101671426
>Even in offload mode it maxes out my 4090
what do you offload? the encoder text?

>>

Anonymous
08/01/24(Thu)14:19:30 No.101671482

Anonymous 08/01/24(Thu)14:19:30 No.101671482

>>101671453
ty baker

>>

Anonymous
08/01/24(Thu)14:23:09 No.101671544

Anonymous 08/01/24(Thu)14:23:09 No.101671544

>>101655488
I haven't come to /g/ in ages. What's the difference between this general and the stable diffusion one? It seems like anon are posting the exact same type of content in both.

>>

Anonymous
08/01/24(Thu)14:23:38 No.101671551

Anonymous 08/01/24(Thu)14:23:38 No.101671551

>>101671426
what if you load in 8bit?
>--fp8_e5m2-text-enc --fp8_e5m2-unet

>>

Anonymous
08/01/24(Thu)14:23:58 No.101671561

Anonymous 08/01/24(Thu)14:23:58 No.101671561

>>101671544
/sdg/ allows saas gens, /ldg/ is local only

>>

Anonymous
08/01/24(Thu)14:27:12 No.101671591

Anonymous 08/01/24(Thu)14:27:12 No.101671591

>>101671544
the amount of free mental healthcare available in the country of the frequenters

>>

Anonymous
08/01/24(Thu)14:28:37 No.101671617

Anonymous 08/01/24(Thu)14:28:37 No.101671617

>>101671544
>exact same type of content in both.
lurk long enough and you'll realize sdg is just a discord chatroom kek

>>

Anonymous
08/01/24(Thu)14:29:19 No.101671630

Anonymous 08/01/24(Thu)14:29:19 No.101671630

>>101671617
>lurk long enough and you'll realize 4chan is just a discord chatroom kek

>>

Anonymous
08/01/24(Thu)14:31:07 No.101671665

Anonymous 08/01/24(Thu)14:31:07 No.101671665

File: 1720013034361301.png (1.16 MB, 1024x1024)

1.16 MB PNG

>>

Anonymous
08/01/24(Thu)14:31:58 No.101671674

Anonymous 08/01/24(Thu)14:31:58 No.101671674

>>101671630
>he doesn't know most of sdg is avatarfags saying gm and gn to eachother and sharing suno songs

>>

Anonymous
08/01/24(Thu)15:21:13 No.101672391

Anonymous 08/01/24(Thu)15:21:13 No.101672391

>>101671544
There's none. This thread has no right to exists.

>>

Anonymous
08/01/24(Thu)15:28:45 No.101672532

Anonymous 08/01/24(Thu)15:28:45 No.101672532

Ok. I came here because of Flux. I noticed /ldg/. and /sdg/ for some time now but I don't know the exact difference. I guess /ldg/ is more about tech than sharing gens, where /sdg/ is just about sharing gens? Or is there some drama that I missed that explains the split? If you could explain it as a veteran /sdg/ fag I'd be grateful

>>

Anonymous
08/01/24(Thu)15:29:58 No.101672553

Anonymous 08/01/24(Thu)15:29:58 No.101672553

>>101672532
/sdg/ allows dalle and gemma gens, /ldg/ is strictly local

>>

Anonymous
08/01/24(Thu)15:32:17 No.101672586

Anonymous 08/01/24(Thu)15:32:17 No.101672586

>>101672553
hmm. That doesn't sound right. Why is it called /sdg/ then? Also... that's it? No tripcode drama war or spamming autists fighting each other?

>>

Anonymous
08/01/24(Thu)15:34:04 No.101672614

Anonymous 08/01/24(Thu)15:34:04 No.101672614

>>101672586
it's a avatarfag and blogpost central as well
>That doesn't sound right. Why is it called /sdg/ then?
no clue, they just stopped caring for whatever reason

>>

Anonymous
08/01/24(Thu)15:39:10 No.101672685

Anonymous 08/01/24(Thu)15:39:10 No.101672685

>>101672614
fwiw, I like the "local [ai topic] general" naming better (like /lmg/) although iirc it didn't exist when /sdg/ came out. (or maybe it didn? I forget which came first) I know /lmg/ split from /aicg/ which is actually cancerous.

>>

Anonymous
08/01/24(Thu)15:45:40 No.101672765

Anonymous 08/01/24(Thu)15:45:40 No.101672765

>>101672685
anon frequently compares sdg to aicg and ldg to lmg, this is true

>>

Anonymous
08/01/24(Thu)15:46:51 No.101672779

Anonymous 08/01/24(Thu)15:46:51 No.101672779

>>101672685
>I know /lmg/ split from /aicg/ which is actually cancerous.
/ldg/ spilt from /sdg/ for similar reasons, it happened a while before the sd3 launch

>>

Anonymous
08/01/24(Thu)18:12:40 No.101675016

Anonymous 08/01/24(Thu)18:12:40 No.101675016

nice

>>

Anonymous
08/01/24(Thu)18:13:47 No.101675031

Anonymous 08/01/24(Thu)18:13:47 No.101675031

>>101675016
sure...

>>

Anonymous
08/01/24(Thu)18:14:34 No.101675045

Anonymous 08/01/24(Thu)18:14:34 No.101675045

>posting in the previous previous bred

>>

Anonymous
08/01/24(Thu)18:14:36 No.101675048

Anonymous 08/01/24(Thu)18:14:36 No.101675048

Oni girl rocks

>>

Anonymous
08/01/24(Thu)18:15:37 No.101675066

Anonymous 08/01/24(Thu)18:15:37 No.101675066

>>101675045
Thanks

>>

Anonymous
08/01/24(Thu)18:19:38 No.101675127

Anonymous 08/01/24(Thu)18:19:38 No.101675127

>>101675016
>>101675031
what did she mean by this

>>

Anonymous
08/01/24(Thu)18:22:01 No.101675159

Anonymous 08/01/24(Thu)18:22:01 No.101675159

just wanted you guys to know that i have a boner

>>

Anonymous
08/01/24(Thu)18:25:38 No.101675230

Anonymous 08/01/24(Thu)18:25:38 No.101675230

would you like help with that, anon

>>

Anonymous
08/01/24(Thu)18:40:57 No.101675455

Anonymous 08/01/24(Thu)18:40:57 No.101675455

>>101671423
>But for hobbyist use for "fun" (porn), I think imagegen is way more popular than anything people do with LLMs
lmao no, ERP is addictive as crack, at least the first few months
the reson you barely see less loras on LLMs is because the smallest LLMs are 4 times as big as the biggest image gen models